BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 005253
         (706 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255553723|ref|XP_002517902.1| cleavage and polyadenylation specificity factor, putative [Ricinus
           communis]
 gi|223542884|gb|EEF44420.1| cleavage and polyadenylation specificity factor, putative [Ricinus
           communis]
          Length = 740

 Score = 1254 bits (3245), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 616/742 (83%), Positives = 656/742 (88%), Gaps = 38/742 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL+GV+NENPLSYL+SID FN LIDCGWNDHFDPSLLQPLS+VASTIDAVLL
Sbjct: 1   MGTSVQVTPLNGVYNENPLSYLISIDNFNLLIDCGWNDHFDPSLLQPLSRVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYAMKQLGLSAPV+STEPVYRLGLLTMYDQYLSR+ VSEFDLF+LDDID
Sbjct: 61  SHSDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKAVSEFDLFSLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ++TRLTYSQN+HLSGKGEGIV+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 SAFQNITRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR--EMFQDAISKTLRAGGNVLLPVDSA 238
           +HLNGTVLESFVRPAVLITDAYNAL NQPPRQQR  E  +  I KTL AGGNVLLPVD+A
Sbjct: 181 RHLNGTVLESFVRPAVLITDAYNALSNQPPRQQRDKEFLEKTILKTLEAGGNVLLPVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           GRVLELLLILE +WA   LNYPI+FLTYVSSSTIDYVKSFLEWM DSI KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQFWAHRLLNYPIFFLTYVSSSTIDYVKSFLEWMSDSIAKSFETSRDNAF 300

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           LLKHVTLLINK+ELDNAP+ PK+VLASMASLEAGFSHDIFVEWA+DVKNLVLFTERGQFG
Sbjct: 301 LLKHVTLLINKNELDNAPNVPKVVLASMASLEAGFSHDIFVEWAADVKNLVLFTERGQFG 360

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TLARMLQADPPPKAVKVTMSRRVPLVG+ELIAYEEEQ RLKKEE L AS++KEEE+K S 
Sbjct: 361 TLARMLQADPPPKAVKVTMSRRVPLVGDELIAYEEEQKRLKKEEELNASMIKEEEAKVSH 420

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
           GPD+NLS DPM+IDA+N NAS D V   G  YRDIL DGFVPPSTSVAPMFPFYEN +EW
Sbjct: 421 GPDSNLS-DPMIIDASNNNASLDAVGSQGTGYRDILFDGFVPPSTSVAPMFPFYENTTEW 479

Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DDFGEVINPDDY+IKD+DMDQ  MH+GGD DGK DEGSAS ILD KPSKVVS+ELT    
Sbjct: 480 DDFGEVINPDDYVIKDDDMDQ-PMHVGGDIDGKFDEGSASWILDTKPSKVVSSELTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAE+TEHLKQHCLKHVCPHVY PQIE
Sbjct: 539 CSLIYMDYEGRSDGRSIKSILAHVAPLKLVLVHGSAESTEHLKQHCLKHVCPHVYAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGD+EIAWVDAEVGKTE+  LSLLPIST APP
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDFEIAWVDAEVGKTESDALSLLPISTSAPP 658

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 684
           HKSVLVGDLKMAD K FL+SKG+QVEFAGGALRCGEYVT+RKVG   QKGGGSGTQQIVI
Sbjct: 659 HKSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGEYVTLRKVGNINQKGGGSGTQQIVI 718

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EGPLCEDYYKIR YLYSQFYLL
Sbjct: 719 EGPLCEDYYKIREYLYSQFYLL 740


>gi|449446027|ref|XP_004140773.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Cucumis sativus]
          Length = 738

 Score = 1227 bits (3175), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 602/741 (81%), Positives = 654/741 (88%), Gaps = 38/741 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVS+D FNFLIDCGWNDHFDP+LLQPLS+VASTIDAVL+
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQ+++R+QVSEFDLFTLDDID
Sbjct: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ VTRLTYSQN+HLSGKGEGIV+APHVAGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT+LESFVRPAVLITDAYNAL+NQP R+Q++  F D I KTLRA GNVLLPVD+AG
Sbjct: 181 RHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLEL+ ILE YW E SLNYPI+FLTYV+SSTIDY+KSFLEWM D+I KSFE +R+NAFL
Sbjct: 241 RVLELIQILEWYWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LKHVTLLINKSELDNAPDGPK+VLASMASLEAG+SHDIFV+WA D KNLVLF+ERGQFGT
Sbjct: 301 LKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDIFVDWAMDAKNLVLFSERGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQADPPPKAVKVT+S+RVPL G+ELIAYEEEQ R KKEEALKASL+KEE+SKAS G
Sbjct: 361 LARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASHG 419

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
            DN+ +GDPM+IDA ++N + DV   HGG YRDILIDGFVPPST VAPMFPFYEN S WD
Sbjct: 420 ADND-TGDPMIIDA-SSNVAPDVGSSHGGAYRDILIDGFVPPSTGVAPMFPFYENTSAWD 477

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT----- 533
           DFGEVINPDDY+IKDEDMDQAAMH GGD DGKLDE +A+LILD KPSKVVSNELT     
Sbjct: 478 DFGEVINPDDYVIKDEDMDQAAMHAGGDVDGKLDETAANLILDMKPSKVVSNELTVQVKC 537

Query: 534 ----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEE 565
                                       VLVHG+AEATEHLKQHCLK+VCPHVY PQIEE
Sbjct: 538 SLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEE 597

Query: 566 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH 625
           TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEI W+DAEVGKTENG LSLLP+S    PH
Sbjct: 598 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEITWLDAEVGKTENGTLSLLPLSKAPAPH 657

Query: 626 KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIE 685
           KSVLVGDLKMAD K FL+SKGIQVEFAGGALRCGEYVT+RKV  A QKGGGSGTQQ+VIE
Sbjct: 658 KSVLVGDLKMADFKQFLASKGIQVEFAGGALRCGEYVTLRKVTDASQKGGGSGTQQVVIE 717

Query: 686 GPLCEDYYKIRAYLYSQFYLL 706
           GPLCEDYYKIR  LYSQFYLL
Sbjct: 718 GPLCEDYYKIRELLYSQFYLL 738


>gi|224121102|ref|XP_002330904.1| predicted protein [Populus trichocarpa]
 gi|222872726|gb|EEF09857.1| predicted protein [Populus trichocarpa]
          Length = 740

 Score = 1219 bits (3155), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 600/741 (80%), Positives = 647/741 (87%), Gaps = 36/741 (4%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSGV+NENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS IDAVLL
Sbjct: 1   MGTSVQVTPLSGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D LHLGALP+AMKQ GL+APVFSTEPVYRLGLLTMYDQ  SR+ VSEFDLF+LDDID
Sbjct: 61  SYGDMLHLGALPFAMKQFGLNAPVFSTEPVYRLGLLTMYDQSFSRKAVSEFDLFSLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+ TRLTYSQN+HLSGKGEGIV+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 SAFQNFTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVLESF RPAVLITDAYNAL++QP RQQR+  F + I KTL  GGNVLLPVDSAG
Sbjct: 181 RHLNGTVLESFYRPAVLITDAYNALNSQPSRQQRDKQFLETILKTLEGGGNVLLPVDSAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLLILE +W +  LNYPI+FL+YVSSSTIDY+KSFLEWM DSI KSFETSRDNAFL
Sbjct: 241 RVLELLLILEQFWGQRFLNYPIFFLSYVSSSTIDYIKSFLEWMSDSIAKSFETSRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           +KHVTLLI+K ELDNA  GPK+VLAS+ASLEAGFSHDIF EWA+DVKNLVLFTERGQFGT
Sbjct: 301 MKHVTLLISKDELDNASTGPKVVLASVASLEAGFSHDIFAEWAADVKNLVLFTERGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQADPPPKAVK+TMSRRVPLVG+ELIAYEEEQ RLK+EE LKASL+KEEESK S G
Sbjct: 361 LARMLQADPPPKAVKMTMSRRVPLVGDELIAYEEEQKRLKREEELKASLIKEEESKVSHG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
           PDNNLS DPMVID+ N ++  DVV   G  +RDILIDGFVPPSTSVAPMFPFYEN+ EWD
Sbjct: 421 PDNNLS-DPMVIDSGNTHSPLDVVGSRGSGHRDILIDGFVPPSTSVAPMFPFYENSLEWD 479

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT----- 533
           +FGEVINPDDY+++DEDMDQAAMH+G D DGKLDEGSASLILD KPSKVVSNELT     
Sbjct: 480 EFGEVINPDDYVVQDEDMDQAAMHVGADIDGKLDEGSASLILDTKPSKVVSNELTVQVKC 539

Query: 534 ----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEE 565
                                       V+VHGSAEATEHLKQH L      VY PQIEE
Sbjct: 540 SLIYMDYEGRSDGRSIKSILTHVAPLKLVMVHGSAEATEHLKQHFLNIKNVQVYAPQIEE 599

Query: 566 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH 625
           TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE+AWVDAEVGKTENGMLSLLPIS+PAPPH
Sbjct: 600 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEVAWVDAEVGKTENGMLSLLPISSPAPPH 659

Query: 626 KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIE 685
           KSVLVGDLKMAD K FL+SKG+QVEFAGGALRCGEYVT+RKVG   QKGG SGTQQI+IE
Sbjct: 660 KSVLVGDLKMADFKQFLASKGVQVEFAGGALRCGEYVTLRKVGNPSQKGGASGTQQIIIE 719

Query: 686 GPLCEDYYKIRAYLYSQFYLL 706
           GPLCEDYYKIR YLYSQFYLL
Sbjct: 720 GPLCEDYYKIREYLYSQFYLL 740


>gi|356530856|ref|XP_003533995.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 1 [Glycine max]
          Length = 736

 Score = 1217 bits (3149), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 603/741 (81%), Positives = 650/741 (87%), Gaps = 40/741 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFL+DCGWNDHFDPS LQPL++VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSHLQPLARVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYAMK+LGLSAPV+STEPVYRLGLLTMYDQYLSR+QVSEFDLFTLDDID
Sbjct: 61  SHADTLHLGALPYAMKRLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQSVTRLTYSQN+H SGKGEGIV+APHVAGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SAFQSVTRLTYSQNHHFSGKGEGIVIAPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQP R+Q +  F D + KTLRAGGNVLLPVD+ G
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQNDKEFGDILKKTLRAGGNVLLPVDTVG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLEL+L+LE YWA+ +LNYPIYFLTYV+SSTIDYVKSFLEWM D+I KSFE +R+N FL
Sbjct: 241 RVLELILMLELYWADENLNYPIYFLTYVASSTIDYVKSFLEWMSDTIAKSFEKTRENIFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK+VTLLINK+ELDNAPDGPK+VLASMASLEAGFSHDIFVEWA+DVKNLVLFTERGQF T
Sbjct: 301 LKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQFAT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQADPPPKAVKV +S+RVPLVGEELIAYEEEQ R+KK EALKASL+KEEE K S G
Sbjct: 361 LARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRIKK-EALKASLMKEEELKTSHG 419

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
            DN++S DPMVID+ N +   DV  P GG YRDI IDGFVPPSTSVAP+FP YEN SEWD
Sbjct: 420 ADNDIS-DPMVIDSGNNH---DVTGPRGGGYRDIFIDGFVPPSTSVAPIFPCYENTSEWD 475

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT----- 533
           DFGEVINPDDY+IKDEDMDQ AMH G D +GKLDEG+ASLILD KPSKVVS+E T     
Sbjct: 476 DFGEVINPDDYVIKDEDMDQTAMHGGSDINGKLDEGAASLILDTKPSKVVSDERTVQVRC 535

Query: 534 ----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEE 565
                                       VLVHGSAEATEHLKQHCLKHVCPHVY PQIEE
Sbjct: 536 SLVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQIEE 595

Query: 566 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH 625
           TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA VGKTEN  LSLLP+S  APPH
Sbjct: 596 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAVVGKTENDPLSLLPVSGAAPPH 655

Query: 626 KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIE 685
           KSVLVGDLK+AD+K FLSSKG+QVEFAGGALRCGEYVT+RKVG A QKGGGSG QQIVIE
Sbjct: 656 KSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDASQKGGGSGAQQIVIE 715

Query: 686 GPLCEDYYKIRAYLYSQFYLL 706
           GPLCEDYYKIR YLYSQFYLL
Sbjct: 716 GPLCEDYYKIRDYLYSQFYLL 736


>gi|356530858|ref|XP_003533996.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 2 [Glycine max]
          Length = 742

 Score = 1209 bits (3129), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 601/744 (80%), Positives = 650/744 (87%), Gaps = 40/744 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFL+DCGWNDHFDPS LQPL++VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSHLQPLARVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYAMK+LGLSAPV+STEPVYRLGLLTMYDQYLSR+QVSEFDLFTLDDID
Sbjct: 61  SHADTLHLGALPYAMKRLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQSVTRLTYSQN+H SGKGEGIV+APHVAGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SAFQSVTRLTYSQNHHFSGKGEGIVIAPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQ--REMFQDAIS--KTLRAGGNVLLPVD 236
           +HLNGTVL SFVRPAVLITDAYNAL+NQP R+Q  +E   + +   KTLRAGGNVLLPVD
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQNDKEFGGNHLFNLKTLRAGGNVLLPVD 240

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           + GRVLEL+L+LE YWA+ +LNYPIYFLTYV+SSTIDYVKSFLEWM D+I KSFE +R+N
Sbjct: 241 TVGRVLELILMLELYWADENLNYPIYFLTYVASSTIDYVKSFLEWMSDTIAKSFEKTREN 300

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
            FLLK+VTLLINK+ELDNAPDGPK+VLASMASLEAGFSHDIFVEWA+DVKNLVLFTERGQ
Sbjct: 301 IFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDIFVEWANDVKNLVLFTERGQ 360

Query: 357 FGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKA 416
           F TLARMLQADPPPKAVKV +S+RVPLVGEELIAYEEEQ R+KK EALKASL+KEEE K 
Sbjct: 361 FATLARMLQADPPPKAVKVVVSKRVPLVGEELIAYEEEQNRIKK-EALKASLMKEEELKT 419

Query: 417 SLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNS 476
           S G DN++S DPMVID+ N +   +V  P GG YRDI IDGFVPPSTSVAP+FP YEN S
Sbjct: 420 SHGADNDIS-DPMVIDSGNNHVPPEVTGPRGGGYRDIFIDGFVPPSTSVAPIFPCYENTS 478

Query: 477 EWDDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT-- 533
           EWDDFGEVINPDDY+IKDEDMDQ AMH G D +GKLDEG+ASLILD KPSKVVS+E T  
Sbjct: 479 EWDDFGEVINPDDYVIKDEDMDQTAMHGGSDINGKLDEGAASLILDTKPSKVVSDERTVQ 538

Query: 534 -------------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQ 562
                                          VLVHGSAEATEHLKQHCLKHVCPHVY PQ
Sbjct: 539 VRCSLVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQ 598

Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPA 622
           IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA VGKTEN  LSLLP+S  A
Sbjct: 599 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAVVGKTENDPLSLLPVSGAA 658

Query: 623 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQI 682
           PPHKSVLVGDLK+AD+K FLSSKG+QVEFAGGALRCGEYVT+RKVG A QKGGGSG QQI
Sbjct: 659 PPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDASQKGGGSGAQQI 718

Query: 683 VIEGPLCEDYYKIRAYLYSQFYLL 706
           VIEGPLCEDYYKIR YLYSQFYLL
Sbjct: 719 VIEGPLCEDYYKIRDYLYSQFYLL 742


>gi|356559788|ref|XP_003548179.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 1 [Glycine max]
          Length = 738

 Score = 1208 bits (3125), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/740 (80%), Positives = 643/740 (86%), Gaps = 36/740 (4%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFL+DCGWNDHFDPSLLQPL++VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSLLQPLARVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYAMKQLGLSAPV+STEPVYRLGLLTMYDQYLSR+QVSEFDLFTLDDID
Sbjct: 61  SHADTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S+FQSVTRLTYSQN+H SGKGEGIV+APHVAGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SSFQSVTRLTYSQNHHFSGKGEGIVIAPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQP R+Q +  F D + KTLR GGNVLLPVD+ G
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQNDKEFGDILKKTLREGGNVLLPVDTVG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLEL+L+LE YW + +LNYPIYFLTYV+SSTIDYVKSFLEWM D+I KSFE +R+N FL
Sbjct: 241 RVLELILMLESYWTDENLNYPIYFLTYVASSTIDYVKSFLEWMSDTIAKSFEKTRENIFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK+VTLLINK+ELDNAPDGPK+VLASMASLEAGFSH+IFVEWA+DVKNLVLFTERGQF T
Sbjct: 301 LKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIFVEWANDVKNLVLFTERGQFAT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQADPPPKAVKV +S+RV LVGEELIAYEEEQ R+KK EALKASL+KEEE K S G
Sbjct: 361 LARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRIKK-EALKASLMKEEEFKTSHG 419

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
            DNN S D MVID+ N +   +V  P GG YRDI IDGFVPP TSVAPMFP YEN SEWD
Sbjct: 420 ADNNTS-DSMVIDSGNNHVPPEVSGPRGGGYRDIFIDGFVPPLTSVAPMFPCYENTSEWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELT------ 533
           DFGEVINPDDY+IKDEDMDQ AMH G  +GKLDEG+ASLILD KPSKVVS+E T      
Sbjct: 479 DFGEVINPDDYVIKDEDMDQTAMHGGDINGKLDEGAASLILDTKPSKVVSDERTVQVRCS 538

Query: 534 ---------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEET 566
                                      VLVHGSAEATEHLKQHCLKHVCPHVY PQ+EET
Sbjct: 539 LVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQLEET 598

Query: 567 IDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHK 626
           IDVTSDLCAYKV LSEKLMSNVLFKKLGDYE+AWVDA VGKTEN  LSLLP+S  APPHK
Sbjct: 599 IDVTSDLCAYKVLLSEKLMSNVLFKKLGDYELAWVDAVVGKTENDPLSLLPVSGAAPPHK 658

Query: 627 SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
           SVLVGDLK+AD+K FLSSKG+QVEFAGGALRCGEYVT+RKVG A QKGGGSG QQIVIEG
Sbjct: 659 SVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDASQKGGGSGAQQIVIEG 718

Query: 687 PLCEDYYKIRAYLYSQFYLL 706
           PLCEDYYKIR YLYSQFYLL
Sbjct: 719 PLCEDYYKIRDYLYSQFYLL 738


>gi|225464483|ref|XP_002268591.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Vitis vinifera]
 gi|302143847|emb|CBI22708.3| unnamed protein product [Vitis vinifera]
          Length = 740

 Score = 1201 bits (3108), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 596/741 (80%), Positives = 647/741 (87%), Gaps = 36/741 (4%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFL+DCGWNDHFDPS LQPL++VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSFLQPLARVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +HPDTLHLGALPYAMKQLGLSAPV+STEPVYRLGLLTMYDQYLSR+QVS+FDLFTLDDID
Sbjct: 61  AHPDTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+VTRLTYSQNYHL GKGEGIV+APHVAGHLLGGTVWKITKDGEDVIYAVD+N RKE
Sbjct: 121 SAFQNVTRLTYSQNYHLFGKGEGIVIAPHVAGHLLGGTVWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           + LNGTVLESFVRPAVLITDAYNAL+NQP R+QR+  F D I KTLR  GNVLLPVD+AG
Sbjct: 181 RLLNGTVLESFVRPAVLITDAYNALNNQPSRRQRDQEFLDVILKTLRGDGNVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLEL+LILE YW +H LNYPI+FLTYV+SSTIDYVKSFLEWM DSI KSFE +RDNAFL
Sbjct: 241 RVLELMLILEQYWTQHHLNYPIFFLTYVASSTIDYVKSFLEWMSDSIAKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LKHVTLLI+KSEL+  PDGPK+VLASMASLEAGFSHDIFVEWA+D KNLVLF+ERGQF T
Sbjct: 301 LKHVTLLISKSELEKVPDGPKIVLASMASLEAGFSHDIFVEWATDAKNLVLFSERGQFAT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQADPPPKAVKVTMS+RVPLVGEEL AYEEEQ R+KKEEALKASL KE+E KAS G
Sbjct: 361 LARMLQADPPPKAVKVTMSKRVPLVGEELAAYEEEQERIKKEEALKASLSKEDEMKASRG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
            DN L GDPMVID     AS+DV  PH G +RDILIDGFVPPSTSVAPMFPFYEN+SEWD
Sbjct: 421 SDNKL-GDPMVIDTTTPPASSDVAVPHVGGHRDILIDGFVPPSTSVAPMFPFYENSSEWD 479

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT----- 533
           DFGEVINP+DY+IKDEDMDQA M +G D +GKLDEG+ASLI D  PSKV+SNELT     
Sbjct: 480 DFGEVINPEDYVIKDEDMDQATMQVGDDLNGKLDEGAASLIFDTTPSKVISNELTVQVKC 539

Query: 534 ----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEE 565
                                       VLVHGSAEATEHLKQHCLKHVCPHVY PQI E
Sbjct: 540 MLVYMDFEGRSDGRSIKSILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYAPQIGE 599

Query: 566 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH 625
           TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE+AWVDAEVGKTE+G LSLLP+STP P H
Sbjct: 600 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEVAWVDAEVGKTESGSLSLLPLSTPPPSH 659

Query: 626 KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIE 685
            +V VGD+KMAD K FL+SKGIQVEF+GGALRCGEYVT+RKVG A QKGGG+  QQIV+E
Sbjct: 660 DTVFVGDIKMADFKQFLASKGIQVEFSGGALRCGEYVTLRKVGDASQKGGGAIIQQIVME 719

Query: 686 GPLCEDYYKIRAYLYSQFYLL 706
           GPLC++YYKIR YLYSQ+YLL
Sbjct: 720 GPLCDEYYKIREYLYSQYYLL 740


>gi|356559790|ref|XP_003548180.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 2 [Glycine max]
          Length = 743

 Score = 1201 bits (3107), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/746 (79%), Positives = 643/746 (86%), Gaps = 43/746 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFL+DCGWNDHFDPSLLQPL++VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLVDCGWNDHFDPSLLQPLARVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYAMKQLGLSAPV+STEPVYRLGLLTMYDQYLSR+QVSEFDLFTLDDID
Sbjct: 61  SHADTLHLGALPYAMKQLGLSAPVYSTEPVYRLGLLTMYDQYLSRKQVSEFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S+FQSVTRLTYSQN+H SGKGEGIV+APHVAGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SSFQSVTRLTYSQNHHFSGKGEGIVIAPHVAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-------MFQDAISKTLRAGGNVLL 233
           +HLNGTVL SFVRPAVLITDAYNAL+NQP R+Q +       +F   I KTLR GGNVLL
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQNDKEFGGNHLFNLVI-KTLREGGNVLL 239

Query: 234 PVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           PVD+ GRVLEL+L+LE YW + +LNYPIYFLTYV+SSTIDYVKSFLEWM D+I KSFE +
Sbjct: 240 PVDTVGRVLELILMLESYWTDENLNYPIYFLTYVASSTIDYVKSFLEWMSDTIAKSFEKT 299

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           R+N FLLK+VTLLINK+ELDNAPDGPK+VLASMASLEAGFSH+IFVEWA+DVKNLVLFTE
Sbjct: 300 RENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEIFVEWANDVKNLVLFTE 359

Query: 354 RGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEE 413
           RGQF TLARMLQADPPPKAVKV +S+RV LVGEELIAYEEEQ R+KK EALKASL+KEEE
Sbjct: 360 RGQFATLARMLQADPPPKAVKVVVSKRVALVGEELIAYEEEQNRIKK-EALKASLMKEEE 418

Query: 414 SKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYE 473
            K S G DNN S D MVID+ N +   +V  P GG YRDI IDGFVPP TSVAPMFP YE
Sbjct: 419 FKTSHGADNNTS-DSMVIDSGNNHVPPEVSGPRGGGYRDIFIDGFVPPLTSVAPMFPCYE 477

Query: 474 NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELT 533
           N SEWDDFGEVINPDDY+IKDEDMDQ AMH G  +GKLDEG+ASLILD KPSKVVS+E T
Sbjct: 478 NTSEWDDFGEVINPDDYVIKDEDMDQTAMHGGDINGKLDEGAASLILDTKPSKVVSDERT 537

Query: 534 ---------------------------------VLVHGSAEATEHLKQHCLKHVCPHVYT 560
                                            VLVHGSAEATEHLKQHCLKHVCPHVY 
Sbjct: 538 VQVRCSLVYMDFEGRSDGRSIKNILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYA 597

Query: 561 PQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPIST 620
           PQ+EETIDVTSDLCAYKV LSEKLMSNVLFKKLGDYE+AWVDA VGKTEN  LSLLP+S 
Sbjct: 598 PQLEETIDVTSDLCAYKVLLSEKLMSNVLFKKLGDYELAWVDAVVGKTENDPLSLLPVSG 657

Query: 621 PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQ 680
            APPHKSVLVGDLK+AD+K FLSSKG+QVEFAGGALRCGEYVT+RKVG A QKGGGSG Q
Sbjct: 658 AAPPHKSVLVGDLKLADIKQFLSSKGVQVEFAGGALRCGEYVTLRKVGDASQKGGGSGAQ 717

Query: 681 QIVIEGPLCEDYYKIRAYLYSQFYLL 706
           QIVIEGPLCEDYYKIR YLYSQFYLL
Sbjct: 718 QIVIEGPLCEDYYKIRDYLYSQFYLL 743


>gi|297808393|ref|XP_002872080.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
 gi|297317917|gb|EFH48339.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
          Length = 739

 Score = 1175 bits (3039), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/743 (77%), Positives = 643/743 (86%), Gaps = 41/743 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSGV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PLS+VAS+IDAVLL
Sbjct: 1   MGTSVQVTPLSGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASSIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLHLGALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDID
Sbjct: 61  SHPDTLHLGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+V RLTYSQNYHLSGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
           +HLNGTVL+SFVRPAVLITDAY+AL+ NQ  RQQR+  F D ISK L  GGNVLLPVD+A
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           GRVLELLLILE +W++   ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF 300

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           LL+HVTLLINK++LDNAP GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFG
Sbjct: 301 LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG 360

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TLARMLQ+ PPPK VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS 
Sbjct: 361 TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH 420

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
           G D+N S +PMVID    +   DVV  HG  Y+DILIDGFVPPS+SVAPMFPFY+N SEW
Sbjct: 421 GSDDN-SSEPMVIDTKTTH---DVVGSHGPAYKDILIDGFVPPSSSVAPMFPFYDNTSEW 476

Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DDFGE+INPDDY+IKDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL     
Sbjct: 477 DDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVISNELIVTVS 536

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVH  AEATEHLKQHCL ++CPHVY PQIE
Sbjct: 537 CSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIE 596

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AWVD+EVGKTE+ M SLLP+S  A P
Sbjct: 597 ETVDVTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTESDMRSLLPMSGAASP 656

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFA-GGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 683
           HK VLVGDLK+AD K FLSSKG+QVEFA GGALRCGEYVT+RKVGP GQKGG SG QQI+
Sbjct: 657 HKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQIL 716

Query: 684 IEGPLCEDYYKIRAYLYSQFYLL 706
           IEGPLCEDYYKIR YLYSQFYLL
Sbjct: 717 IEGPLCEDYYKIRDYLYSQFYLL 739


>gi|15237845|ref|NP_197776.1| cleavage and polyadenylation specificity factor subunit 2
           [Arabidopsis thaliana]
 gi|18203240|sp|Q9LKF9.2|CPSF2_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=AtCPSF100;
           Short=CPSF 100 kDa subunit; AltName: Full=Protein EMBRYO
           DEFECTIVE 1265; AltName: Full=Protein ENHANCED SILENCING
           PHENOTYPE 5
 gi|10176855|dbj|BAB10061.1| cleavage and polyadenylation specificity factor [Arabidopsis
           thaliana]
 gi|14334618|gb|AAK59487.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
 gi|28393921|gb|AAO42368.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
 gi|332005845|gb|AED93228.1| cleavage and polyadenylation specificity factor subunit 2
           [Arabidopsis thaliana]
          Length = 739

 Score = 1167 bits (3018), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/743 (76%), Positives = 640/743 (86%), Gaps = 41/743 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PLS+VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLH+GALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDID
Sbjct: 61  SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+V RLTYSQNYHLSGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
           +HLNGTVL+SFVRPAVLITDAY+AL+ NQ  RQQR+  F D ISK L  GGNVLLPVD+A
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           GRVLELLLILE +W++   ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF 300

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           LL+HVTLLINK++LDNAP GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFG
Sbjct: 301 LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG 360

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TLARMLQ+ PPPK VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS 
Sbjct: 361 TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH 420

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
           G D+N S +PM+ID    +   DV+  HG  Y+DILIDGFVPPS+SVAPMFP+Y+N SEW
Sbjct: 421 GSDDN-SSEPMIIDTKTTH---DVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEW 476

Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DDFGE+INPDDY+IKDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL     
Sbjct: 477 DDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVS 536

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVH  AEATEHLKQHCL ++CPHVY PQIE
Sbjct: 537 CSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIE 596

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AWVD+EVGKTE  M SLLP+   A P
Sbjct: 597 ETVDVTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASP 656

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFA-GGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 683
           HK VLVGDLK+AD K FLSSKG+QVEFA GGALRCGEYVT+RKVGP GQKGG SG QQI+
Sbjct: 657 HKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQIL 716

Query: 684 IEGPLCEDYYKIRAYLYSQFYLL 706
           IEGPLCEDYYKIR YLYSQFYLL
Sbjct: 717 IEGPLCEDYYKIRDYLYSQFYLL 739


>gi|9082326|gb|AAF82809.1|AF283277_1 polyadenylation cleavage/specificity factor 100 kDa subunit
           [Arabidopsis thaliana]
          Length = 739

 Score = 1164 bits (3012), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/743 (76%), Positives = 639/743 (86%), Gaps = 41/743 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PL +VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLPRVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLH+GALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDID
Sbjct: 61  SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+V RLTYSQNYHLSGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
           +HLNGTVL+SFVRPAVLITDAY+AL+ NQ  RQQR+  F D ISK L  GGNVLLPVD+A
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           GRVLELLLILE +W++   ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF 300

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           LL+HVTLLINK++LDNAP GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFG
Sbjct: 301 LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG 360

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TLARMLQ+ PPPK VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS 
Sbjct: 361 TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH 420

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
           G D+N S +PM+ID    +   DVV  HG  Y+DILIDGFVPPS+SVAPMFP+Y+N SEW
Sbjct: 421 GSDDN-SSEPMIIDTKTTH---DVVGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEW 476

Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DDFGE+INPDDY+IKDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL     
Sbjct: 477 DDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVS 536

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVH  AEATEHLKQHCL ++CPHVY PQIE
Sbjct: 537 CSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIE 596

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AWVD+EVGKTE  M SLLP+   A P
Sbjct: 597 ETVDVTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASP 656

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFA-GGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 683
           HK VLVGDLK+AD K FLSSKG+QVEFA GGALRCGEYVT+RKVGP GQKGG SG QQI+
Sbjct: 657 HKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQIL 716

Query: 684 IEGPLCEDYYKIRAYLYSQFYLL 706
           IEGPLCEDYYKIR YLYSQFYLL
Sbjct: 717 IEGPLCEDYYKIRDYLYSQFYLL 739


>gi|115480769|ref|NP_001063978.1| Os09g0569400 [Oryza sativa Japonica Group]
 gi|75253249|sp|Q652P4.1|CPSF2_ORYSJ RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|52077178|dbj|BAD46223.1| putative cleavage and polyadenylation specificity factor [Oryza
           sativa Japonica Group]
 gi|113632211|dbj|BAF25892.1| Os09g0569400 [Oryza sativa Japonica Group]
          Length = 738

 Score = 1049 bits (2712), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/742 (70%), Positives = 603/742 (81%), Gaps = 40/742 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  DPS LQPL+KVA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DT+HLGALPYAMK LGLSAPV++TEPV+RLG+LT+YD ++SRRQVS+FDLFTLDDID
Sbjct: 61  SHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQN+ L+ KGEGIV+APHVAGH LGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT L SFVRPAVLITDAYNAL+N    RQQ + F DA+ K L  GG+VLLP+D+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLE+LLILE YWA+  L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK VT +INK EL+   D PK+VLASMASLE GFSHDIFV+ A++ KNLVLFTE+GQFGT
Sbjct: 301 LKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTMS+R+PLVG+EL AYEEEQ R+KKEEALKASL KEEE KASLG
Sbjct: 361 LARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMVIDA+ +   ++     GG   DILIDGFVPPS+SVAPMFPF+EN SEWD
Sbjct: 421 -SNAKASDPMVIDASTSRKPSNAGSKFGGNV-DILIDGFVPPSSSVAPMFPFFENTSEWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DFGEVINP+DY++K E+MD   M   GD  D  LDEGSA L+LD+ PSKV+SNE+T    
Sbjct: 479 DFGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAEATEHLK HC K+   HVY PQIE
Sbjct: 539 CSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAWVDAEVGKT++ +  L P STPA  
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKTDDKLTLLPPSSTPA-A 657

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 684
           HKSVLVGDLK+AD K FL++KG+QVEFAGGALRCGEY+T+RK+G AGQK G +G+QQIVI
Sbjct: 658 HKSVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITLRKIGDAGQK-GSTGSQQIVI 716

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EGPLCEDYYKIR  LYSQFYLL
Sbjct: 717 EGPLCEDYYKIRELLYSQFYLL 738


>gi|357127861|ref|XP_003565596.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Brachypodium distachyon]
          Length = 738

 Score = 1043 bits (2698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 518/742 (69%), Positives = 605/742 (81%), Gaps = 40/742 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW DH DPSLLQPL++VA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDHCDPSLLQPLARVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYAMK LGLSAPV++TEPV+RLGLLTMYD +LSR QV++FDLFTLDDID
Sbjct: 61  SHPDIMHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDYFLSRWQVADFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQN+ L+ KGEGIV+APHV+GHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVSGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT L SFVRPAVLITDAYNAL+NQ   RQQ + F D++ K L +GG+VLLPVD+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNQVYKRQQDQDFIDSMVKVLASGGSVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLLI+E YWA+  L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLELLLIMEQYWAQRHLVYPIYFLTNVSTSTVDYVKSFLEWMSDSISKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           L++V+L+INK EL+   D PK+VLASMASLE GFSHDIFVE A++ KNLVLFTE+GQFGT
Sbjct: 301 LRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEAKNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTM +R+PLVG+EL AYEEEQ R+KKEE LKASL K+EE KAS G
Sbjct: 361 LARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERIKKEELLKASLSKDEELKASHG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMV+DA+++  S++     GG   DILIDGFVP +TS APMFPF+EN ++WD
Sbjct: 421 -SNAKASDPMVVDASSSRKSSNAGSHVGGNV-DILIDGFVPSTTSFAPMFPFFENTADWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DFGEVINPDDY++K ++MD   M   GD  DGKLDEGSA L+LD+ PSKV+SNE+T    
Sbjct: 479 DFGEVINPDDYMMKQDEMDNNMMLGAGDGMDGKLDEGSARLLLDSAPSKVISNEMTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAEATEHLK HC K+   HVY PQIE
Sbjct: 539 CSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCAKNSDLHVYAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAWVDAEVGK +   L+LLP S+    
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKVDEK-LNLLPPSSTPSA 657

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 684
           HKSVLVGDLK+AD K FL++KG+QVEFAGGALRCGEY+T+RK+G + QK G +G+QQIVI
Sbjct: 658 HKSVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITVRKIGDSNQK-GSTGSQQIVI 716

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EGPLCEDYYKIR  LYSQF+LL
Sbjct: 717 EGPLCEDYYKIRELLYSQFFLL 738


>gi|357160194|ref|XP_003578687.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Brachypodium distachyon]
          Length = 738

 Score = 1040 bits (2689), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 518/742 (69%), Positives = 604/742 (81%), Gaps = 40/742 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW DH DPSLLQPL++VA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDHCDPSLLQPLARVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYAMK LGLSAPV+ TEPV+RLGLLTMYD +LSR QV++FDLFTLDDID
Sbjct: 61  SHPDIMHLGALPYAMKHLGLSAPVYVTEPVFRLGLLTMYDYFLSRWQVADFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQN+ L+ KGEGIV+APHV+GHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVSGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT L SFVRPAVLITDAYNAL+NQ   RQQ + F D++ K L +GG+VLLPVD+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNQVYKRQQDQDFIDSMVKVLASGGSVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLLI+E YWA+  L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLELLLIMEQYWAQRHLVYPIYFLTNVSTSTVDYVKSFLEWMSDSISKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           L++V+L+INK EL+   D PK+VLASMASLE GFSHDIFVE A++ KNLVLFTE+GQFGT
Sbjct: 301 LRYVSLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEAKNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTM +R+PLVG+EL AYEEEQ R+KKEE LKASL K+EE KAS G
Sbjct: 361 LARMLQVDPPPKAVKVTMGKRIPLVGDELKAYEEEQERIKKEELLKASLSKDEELKASHG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMV+DA+++  S++     GG   DILIDGFVP +TSVAPMFPF+EN ++WD
Sbjct: 421 -SNAKASDPMVVDASSSRKSSNAGSHVGGNV-DILIDGFVPSTTSVAPMFPFFENTADWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DFGEVINPDDY++K ++MD   M   GD  DGKLDEGSA L+LD+ PSKV+SNE+T    
Sbjct: 479 DFGEVINPDDYMMKQDEMDNNMMLGAGDGMDGKLDEGSARLLLDSAPSKVISNEMTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAEATEHLK HC K+   HVY PQIE
Sbjct: 539 CSLVYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCAKNSDLHVYAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAWVDAEVGK +   L+LLP S+    
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKVDEK-LNLLPPSSTPSA 657

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 684
           HKSVLVGDLK+AD K FL++KG+QVEFAGGALRCGEY+T+RK+G + QK G + +QQIVI
Sbjct: 658 HKSVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITVRKIGDSNQK-GSTVSQQIVI 716

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EGPLCEDYYKIR  LYSQF+LL
Sbjct: 717 EGPLCEDYYKIRELLYSQFFLL 738


>gi|218202664|gb|EEC85091.1| hypothetical protein OsI_32459 [Oryza sativa Indica Group]
          Length = 1195

 Score = 1023 bits (2646), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 517/736 (70%), Positives = 595/736 (80%), Gaps = 44/736 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  DPS LQPL+KVA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DT+HLGALPYAMK LGLSAPV++TEPV+RLG+LT+YD ++SRRQVS+FDLFTLDDID
Sbjct: 61  SHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQN+ L+ KGEGIV+APHVAGH LGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT L SFVRPAVLITDAYNAL+N    RQQ + F DA+ K L  GG+VLLP+D+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLE+LLILE YWA+  L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK VT +INK EL+   D PK+VLASMASLE GFSHDIFV+ A++ KNLVLFTE+GQFGT
Sbjct: 301 LKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTMS+R+PLVG+EL AYEEEQ R+KKEEALKASL KEEE KASLG
Sbjct: 361 LARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMVIDA+ +   ++     GG   DILIDGFVPPS+SVAPMFPF+EN SEWD
Sbjct: 421 -SNAKASDPMVIDASTSRKPSNAGSKFGGNV-DILIDGFVPPSSSVAPMFPFFENTSEWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DFGEVINP+DY++K E+MD   M   GD  D  LDEGSA L+LD+ PSKV+SNE+T    
Sbjct: 479 DFGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAEATEHLK HC K+   HVY PQIE
Sbjct: 539 CSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAWVDAEVGKT++ +  L P STPA  
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKTDDKLTLLPPSSTPA-A 657

Query: 625 HKSVLVGDLKMADLKPFLSSKG----IQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQ 680
           HKSVLVGDLK+AD K FL++KG    +QVEFAGGALRCGEY+T+RK+G AGQK G +G+Q
Sbjct: 658 HKSVLVGDLKLADFKQFLANKGLRDFLQVEFAGGALRCGEYITLRKIGDAGQK-GSTGSQ 716

Query: 681 QIVIEGPLCEDYYKIR 696
           QIVIEGPLCEDYYKI+
Sbjct: 717 QIVIEGPLCEDYYKIQ 732


>gi|242037469|ref|XP_002466129.1| hypothetical protein SORBIDRAFT_01g001930 [Sorghum bicolor]
 gi|241919983|gb|EER93127.1| hypothetical protein SORBIDRAFT_01g001930 [Sorghum bicolor]
          Length = 738

 Score = 1013 bits (2620), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 521/742 (70%), Positives = 603/742 (81%), Gaps = 40/742 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  D S LQPL+KVA T+DAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDTSQLQPLAKVAPTVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYAMK LGLSAPV++TEPV+RLGLLTMYD +LSR QVS+FDLFTLDD+D
Sbjct: 61  SHPDMMHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDHFLSRWQVSDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQNY L+ KGEGIV+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNYLLNDKGEGIVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQ  R++++  F D++ K L  GG+VLLPVD+AG
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQGYRKKQDQDFIDSLIKVLATGGSVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLL+L+ YW E  L YPIYFLT VS+ST+DYVKSFLEWM D I KSFE++R NAFL
Sbjct: 241 RVLELLLLLDTYWDERRLQYPIYFLTNVSTSTVDYVKSFLEWMRDQIAKSFESNRANAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK V L+INK EL+   D PK+VLASMASLE GFSHDIFVE A++ +NLVLFTE+GQFGT
Sbjct: 301 LKKVMLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEARNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTMS+R+PLVG+EL AYEEEQ R+KKE+ALKASLVKEEE KASLG
Sbjct: 361 LARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEKALKASLVKEEELKASLG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMVIDA+++  SA+     GG   DILIDGFVPPSTSVAPMFPF+EN +EWD
Sbjct: 421 -SNAKASDPMVIDASSSRKSANAGSHFGGN-TDILIDGFVPPSTSVAPMFPFFENTAEWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DFGEVINPDDY++K E+MD   M   GD  DGK+D+GSA L+LD+ PSKV+SNE+T    
Sbjct: 479 DFGEVINPDDYMMKQEEMDNTLMLGPGDGLDGKIDDGSARLLLDSTPSKVISNEMTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAEATEHLK HC K++  HV+ PQIE
Sbjct: 539 CSLVYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCTKNLDLHVHAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSN++ KKLG++EIAWVDAEVGK E+  L LLP S+  PP
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNIISKKLGEHEIAWVDAEVGK-EDEKLILLPPSSTPPP 657

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 684
           HK VLVGDLK++D K FL +KG QVEFAGGALRCGEY+ +RK+G + QK G +G+QQIVI
Sbjct: 658 HKPVLVGDLKLSDFKQFLENKGWQVEFAGGALRCGEYIMVRKIGDSSQK-GSTGSQQIVI 716

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EGPLCEDYYKIR  LYSQFYLL
Sbjct: 717 EGPLCEDYYKIRELLYSQFYLL 738


>gi|326495752|dbj|BAJ85972.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 726

 Score = 1005 bits (2598), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 500/729 (68%), Positives = 588/729 (80%), Gaps = 40/729 (5%)

Query: 14  FNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPY 73
           + E PL YL+++DGF FL+DCGW DH DP+LLQPL++VA TIDAVLLSHPD +HLGALPY
Sbjct: 2   YGEGPLCYLLAVDGFRFLLDCGWTDHCDPALLQPLARVAPTIDAVLLSHPDMMHLGALPY 61

Query: 74  AMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
           A+K LGLSAPV++TEPVYRLGLLTMYD +LSR QV++FDLF+LDDID+AFQ+V RL YSQ
Sbjct: 62  AIKHLGLSAPVYATEPVYRLGLLTMYDYFLSRWQVADFDLFSLDDIDAAFQNVARLKYSQ 121

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           N+ L  KGEGIV+APHV+GHLLGGTVWKITKDGEDV+YAVD+N RKE+HLNGT L SFVR
Sbjct: 122 NHLLKDKGEGIVIAPHVSGHLLGGTVWKITKDGEDVVYAVDFNHRKERHLNGTTLGSFVR 181

Query: 194 PAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           PAVLITDAYNAL+NQ   RQQ + F D++ K L  GG+VLLPVD+AGRVLELLL +E YW
Sbjct: 182 PAVLITDAYNALNNQVYKRQQDQDFIDSMVKVLSGGGSVLLPVDTAGRVLELLLTMEQYW 241

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           A+  L YPIYFLT VS+ST+D+VKSFLEWM DSI+KSFE +RDNAFLL+HV+L+INK EL
Sbjct: 242 AQRHLVYPIYFLTNVSTSTVDFVKSFLEWMSDSISKSFEHTRDNAFLLRHVSLIINKEEL 301

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           +   D PK+VLASM+SLE GFSHDIFVE A++ KNLVLFTE+GQFGTLARMLQ DPPPKA
Sbjct: 302 EKLGDAPKVVLASMSSLEVGFSHDIFVEMANEAKNLVLFTEKGQFGTLARMLQVDPPPKA 361

Query: 373 VKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVID 432
           VKVTMS+RVPLVG+EL AYEEEQ R+KKEE LKASL KE+E KAS    N  + DPMV+D
Sbjct: 362 VKVTMSKRVPLVGDELKAYEEEQERIKKEEVLKASLSKEKELKAS-HESNAKASDPMVVD 420

Query: 433 ANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII 492
           A+ +  S++     GG   DILIDGFV P+TS+APMFPF+EN ++WDDFGEVINPDDY++
Sbjct: 421 ASLSRKSSNAGSHVGGNV-DILIDGFVSPATSIAPMFPFFENTADWDDFGEVINPDDYMM 479

Query: 493 KDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT----------------- 533
           K +++D   M   GD  DGKLDEGSA L+LD+ PSKV+SNELT                 
Sbjct: 480 KQDEVDNNMMLGVGDGMDGKLDEGSARLLLDSAPSKVISNELTVQVKCSLAYMDFEGRSD 539

Query: 534 ----------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYK 577
                           VLVHGSAEATEHLK HC K+   HVY PQ+EETIDVTSDLCAYK
Sbjct: 540 GRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCAKNSDLHVYAPQLEETIDVTSDLCAYK 599

Query: 578 VQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVLVGDLKMAD 637
           VQLSEKLMSNV+ KKLG++EIAWVDA VGK +   LSL+P S+    H SVLVGDLK+AD
Sbjct: 600 VQLSEKLMSNVISKKLGEHEIAWVDAGVGKADEK-LSLVPPSSIPAAHNSVLVGDLKLAD 658

Query: 638 LKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRA 697
            K FL++KG+QVEFAGGALRCGEY+T+RK+G + QK G +G+QQIVIEGPLCEDYYKIR 
Sbjct: 659 FKQFLANKGLQVEFAGGALRCGEYITVRKIGDSNQK-GSTGSQQIVIEGPLCEDYYKIRE 717

Query: 698 YLYSQFYLL 706
            LYSQF+LL
Sbjct: 718 LLYSQFFLL 726


>gi|219886123|gb|ACL53436.1| unknown [Zea mays]
 gi|414881946|tpg|DAA59077.1| TPA: cleavage and polyadenylation specificity factor, subunit
           isoform 1 [Zea mays]
 gi|414881947|tpg|DAA59078.1| TPA: cleavage and polyadenylation specificity factor, subunit
           isoform 2 [Zea mays]
 gi|414881948|tpg|DAA59079.1| TPA: cleavage and polyadenylation specificity factor, subunit
           isoform 3 [Zea mays]
          Length = 737

 Score = 1003 bits (2594), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 516/742 (69%), Positives = 600/742 (80%), Gaps = 41/742 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  D S LQPL+KVA T+DAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDTSQLQPLAKVAPTVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYAMK LGLSAPV++TEPV+RLGLLTMYD +LSR QVS+FDLFTLDD+D
Sbjct: 61  SHPDMMHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDHFLSRWQVSDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQNY L+ KGEG+V+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNYLLNDKGEGVVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQ  R++++  F +++ K L  GG+VLLPVD+AG
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQGYRKKQDQDFIESLIKVLATGGSVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLL+L+ YW E  L YPIYFLT VS+ST+DYVKSFLEWMGD I KSFE+SR NAFL
Sbjct: 241 RVLELLLLLDMYWDERRLQYPIYFLTNVSTSTVDYVKSFLEWMGDQIAKSFESSRANAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK VTL+INK EL+   D PK+VLASMASLE GFSHDIFVE A++ +NLVLFTE+GQFGT
Sbjct: 301 LKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEARNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKA+KVTMS+R+PLVG EL AYEEEQ R+KKE++LKASLVKEEE KAS G
Sbjct: 361 LARMLQVDPPPKALKVTMSKRIPLVGNELKAYEEEQERIKKEKSLKASLVKEEELKASHG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + +PMVIDA+++  S +    H G   DILIDGFVPP TSVAPMFPF+EN +EWD
Sbjct: 421 -SNTKASEPMVIDASSSRKSVNA--SHFGGNNDILIDGFVPPLTSVAPMFPFFENTAEWD 477

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELTV--- 534
           DFGEVINPDDY++K E+MD   M   GD  DG++D+GSA L+LD+ PSKV+SNE+TV   
Sbjct: 478 DFGEVINPDDYMMKQEEMDNTLMLGPGDGLDGRIDDGSARLLLDSTPSKVISNEMTVQVK 537

Query: 535 ------------------------------LVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                         LVHGSAEATEHLK HC K++  HVY PQIE
Sbjct: 538 CSLVYMDFEGRSDGRSVKSIIAHVAPLKLILVHGSAEATEHLKMHCAKNLDLHVYAPQIE 597

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 624
           ETIDVTSDLCAYKVQLSEKLMSN++ KKLG++EIAWVDAEVGK E+  L LLP S+  PP
Sbjct: 598 ETIDVTSDLCAYKVQLSEKLMSNIISKKLGEHEIAWVDAEVGK-EDEKLILLPPSSTPPP 656

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 684
           HK VLVGDLK++D K FL +KG QVEFAGGALRCGEY+ +RKVG +  K G +G+QQIVI
Sbjct: 657 HKPVLVGDLKLSDFKQFLENKGWQVEFAGGALRCGEYIMVRKVGDSILK-GSTGSQQIVI 715

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EGPLCEDYYKIR  LYSQFYLL
Sbjct: 716 EGPLCEDYYKIRELLYSQFYLL 737


>gi|414881949|tpg|DAA59080.1| TPA: hypothetical protein ZEAMMB73_548570 [Zea mays]
          Length = 766

 Score =  986 bits (2549), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 516/771 (66%), Positives = 601/771 (77%), Gaps = 70/771 (9%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  D S LQPL+KVA T+DAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDTSQLQPLAKVAPTVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYAMK LGLSAPV++TEPV+RLGLLTMYD +LSR QVS+FDLFTLDD+D
Sbjct: 61  SHPDMMHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDHFLSRWQVSDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQNY L+ KGEG+V+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNYLLNDKGEGVVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQ  R++++  F +++ K L  GG+VLLPVD+AG
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQGYRKKQDQDFIESLIKVLATGGSVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLL+L+ YW E  L YPIYFLT VS+ST+DYVKSFLEWMGD I KSFE+SR NAFL
Sbjct: 241 RVLELLLLLDMYWDERRLQYPIYFLTNVSTSTVDYVKSFLEWMGDQIAKSFESSRANAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG---- 355
           LK VTL+INK EL+   D PK+VLASMASLE GFSHDIFVE A++ +NLVLFTE+G    
Sbjct: 301 LKKVTLIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEARNLVLFTEKGQKIF 360

Query: 356 --QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEE 413
             QFGTLARMLQ DPPPKA+KVTMS+R+PLVG EL AYEEEQ R+KKE++LKASLVKEEE
Sbjct: 361 ALQFGTLARMLQVDPPPKALKVTMSKRIPLVGNELKAYEEEQERIKKEKSLKASLVKEEE 420

Query: 414 SKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYE 473
            KAS G  N  + +PMVIDA+++  S +    H G   DILIDGFVPP TSVAPMFPF+E
Sbjct: 421 LKASHG-SNTKASEPMVIDASSSRKSVNA--SHFGGNNDILIDGFVPPLTSVAPMFPFFE 477

Query: 474 NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNE 531
           N +EWDDFGEVINPDDY++K E+MD   M   GD  DG++D+GSA L+LD+ PSKV+SNE
Sbjct: 478 NTAEWDDFGEVINPDDYMMKQEEMDNTLMLGPGDGLDGRIDDGSARLLLDSTPSKVISNE 537

Query: 532 LTV---------------------------------LVHGSAEATEHLKQHCLKHVCPHV 558
           +TV                                 LVHGSAEATEHLK HC K++  HV
Sbjct: 538 MTVQVKCSLVYMDFEGRSDGRSVKSIIAHVAPLKLILVHGSAEATEHLKMHCAKNLDLHV 597

Query: 559 YTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPI 618
           Y PQIEETIDVTSDLCAYKVQLSEKLMSN++ KKLG++EIAWVDAEVGK E+  L LLP 
Sbjct: 598 YAPQIEETIDVTSDLCAYKVQLSEKLMSNIISKKLGEHEIAWVDAEVGK-EDEKLILLPP 656

Query: 619 STPAPPHKSVLVGDLKMADLKPFLSSKG-----------------------IQVEFAGGA 655
           S+  PPHK VLVGDLK++D K FL +KG                       +QVEFAGGA
Sbjct: 657 SSTPPPHKPVLVGDLKLSDFKQFLENKGWQDFSVERERIKYVEIQSLRKELLQVEFAGGA 716

Query: 656 LRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           LRCGEY+ +RKVG +  K G +G+QQIVIEGPLCEDYYKIR  LYSQFYLL
Sbjct: 717 LRCGEYIMVRKVGDSILK-GSTGSQQIVIEGPLCEDYYKIRELLYSQFYLL 766


>gi|226492345|ref|NP_001151557.1| LOC100285191 [Zea mays]
 gi|195647682|gb|ACG43309.1| cleavage and polyadenylation specificity factor, 100 kDa subunit
           [Zea mays]
          Length = 673

 Score =  908 bits (2347), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 471/677 (69%), Positives = 547/677 (80%), Gaps = 41/677 (6%)

Query: 66  LHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQS 125
           +HLGALPYAMK LGLSAPV++TEPV+RLGLLTMYD +LSR QVS+FDLFTLDD+D+AFQ+
Sbjct: 2   MHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDHFLSRWQVSDFDLFTLDDVDAAFQN 61

Query: 126 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNG 185
           V RL YSQNY L+ KGEG+V+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE+HLNG
Sbjct: 62  VVRLKYSQNYLLNDKGEGVVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKERHLNG 121

Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           TVL SFVRPAVLITDAYNAL+NQ  R++++  F D++ K L  GG+VLLPVD+AGRVLEL
Sbjct: 122 TVLGSFVRPAVLITDAYNALNNQGYRKKQDQDFIDSLIKVLATGGSVLLPVDTAGRVLEL 181

Query: 245 LLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVT 304
           LL+L+ YW E  L YPIYFLT VS+ST+DYVKSFLEWMGD I KSFE+SR NAFLLK VT
Sbjct: 182 LLLLDMYWDERRLQYPIYFLTNVSTSTVDYVKSFLEWMGDQIAKSFESSRANAFLLKKVT 241

Query: 305 LLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           L+INK EL+   D PK+VLASMASLE GFSHDIFVE A++ +NLVLFTE+GQFGTLARML
Sbjct: 242 LIINKEELEKLGDAPKVVLASMASLEVGFSHDIFVEMANEARNLVLFTEKGQFGTLARML 301

Query: 365 QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNL 424
           Q DPPPKA+KVTMS+R+PLVG EL AYEEEQ R+KKE++LKASLVKEEE KAS G  N  
Sbjct: 302 QVDPPPKALKVTMSKRIPLVGNELKAYEEEQERIKKEKSLKASLVKEEELKASHGS-NTK 360

Query: 425 SGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEV 484
           + +PMVIDA+++  S +    H G   DILIDGFVPP TSVAPMFPF+EN +EWDDFGEV
Sbjct: 361 ASEPMVIDASSSRKSVNA--SHFGGNNDILIDGFVPPLTSVAPMFPFFENTAEWDDFGEV 418

Query: 485 INPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELTV-------- 534
           INPDDY++K E+MD   M   GD  DG++D+GSA L+LD+ PSKV+SNE+TV        
Sbjct: 419 INPDDYMMKQEEMDNTLMLGPGDGLDGRIDDGSARLLLDSTPSKVISNEMTVQVKCSLVY 478

Query: 535 -------------------------LVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDV 569
                                    LVHGSAEATEHLK HC K++  HVY PQIEETIDV
Sbjct: 479 MDFEGRSDGRSVKSIIAHVAPLKLILVHGSAEATEHLKMHCAKNLDLHVYAPQIEETIDV 538

Query: 570 TSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVL 629
           TSDLCAYKVQLSEKLMSN++ KKLG++EIAWVDAEVGK E+  L LLP S+  PPHK VL
Sbjct: 539 TSDLCAYKVQLSEKLMSNIISKKLGEHEIAWVDAEVGK-EDEKLILLPPSSTPPPHKPVL 597

Query: 630 VGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLC 689
           VGDLK++D K FL +KG QVEFAGGALRCGEY+ +RKVG +  K G +G+QQIVIEGPLC
Sbjct: 598 VGDLKLSDFKQFLENKGWQVEFAGGALRCGEYIMVRKVGDSILK-GSTGSQQIVIEGPLC 656

Query: 690 EDYYKIRAYLYSQFYLL 706
           EDYYKIR  LYSQFYLL
Sbjct: 657 EDYYKIRELLYSQFYLL 673


>gi|449528453|ref|XP_004171219.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 2-like, partial [Cucumis
           sativus]
          Length = 501

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/504 (84%), Positives = 467/504 (92%), Gaps = 4/504 (0%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVS+D FNFLIDCGWNDHFDP+LLQPLS+VASTIDAVL+
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSVDDFNFLIDCGWNDHFDPALLQPLSRVASTIDAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQ+++R+QVSEFDLFTLDDID
Sbjct: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQFIARKQVSEFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ VTRLTYSQN+HLSGKGEGIV+APHVAGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SAFQVVTRLTYSQNHHLSGKGEGIVIAPHVAGHLLGGTLWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT+LESFVRPAVLITDAYNAL+NQP R+Q++  F D I KTLRA GNVLLPVD+AG
Sbjct: 181 RHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEFGDTIQKTLRANGNVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLEL+ ILE YW E SLNYPI+FLTYV+SSTIDY+KSFLEWM D+I KSFE +R+NAFL
Sbjct: 241 RVLELIQILEWYWEEESLNYPIFFLTYVASSTIDYIKSFLEWMSDTIAKSFEHTRNNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LKHVTLLINKSELDNAPDGPK+VLASMASLEAG+SHD FV+WA D KNLVLF+ERGQFGT
Sbjct: 301 LKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDXFVDWAMDAKNLVLFSERGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQADPPPKAVKVT+S+RVPL G+ELIAYEEEQ R KKEEALKASL+KEE+SKAS G
Sbjct: 361 LARMLQADPPPKAVKVTVSKRVPLTGDELIAYEEEQNR-KKEEALKASLLKEEQSKASHG 419

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
            DN+ +GDPM+IDA ++N + DV   HGG YRDILIDGFVPPST VAPMFPFYEN S WD
Sbjct: 420 ADND-TGDPMIIDA-SSNVAPDVGSSHGGAYRDILIDGFVPPSTGVAPMFPFYENTSAWD 477

Query: 480 DFGEVINPDDYIIKDEDMDQAAMH 503
           DFGEVINPDDY+IKDEDMDQAAMH
Sbjct: 478 DFGEVINPDDYVIKDEDMDQAAMH 501


>gi|222642134|gb|EEE70266.1| hypothetical protein OsJ_30409 [Oryza sativa Japonica Group]
          Length = 1073

 Score =  875 bits (2262), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 440/628 (70%), Positives = 503/628 (80%), Gaps = 38/628 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  DPS LQPL+KVA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DT+HLGALPYAMK LGLSAPV++TEPV+RLG+LT+YD ++SRRQVS+FDLFTLDDID
Sbjct: 61  SHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQN+ L+ KGEGIV+APHVAGH LGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT L SFVRPAVLITDAYNAL+N    RQQ + F DA+ K L  GG+VLLP+D+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLE+LLILE YWA+  L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK VT +INK EL+   D PK+VLASMASLE GFSHDIFV+ A++ KNLVLFTE+GQFGT
Sbjct: 301 LKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTMS+R+PLVG+EL AYEEEQ R+KKEEALKASL KEEE KASLG
Sbjct: 361 LARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMVIDA+ +   ++     GG   DILIDGFVPPS+SVAPMFPF+EN SEWD
Sbjct: 421 -SNAKASDPMVIDASTSRKPSNAGSKFGGNV-DILIDGFVPPSSSVAPMFPFFENTSEWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELT---- 533
           DFGEVINP+DY++K E+MD   M   GD  D  LDEGSA L+LD+ PSKV+SNE+T    
Sbjct: 479 DFGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVK 538

Query: 534 -----------------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 564
                                        VLVHGSAEATEHLK HC K+   HVY PQIE
Sbjct: 539 CSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIE 598

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKK 592
           ETIDVTSDLCAYKVQLSEKLMSNV+ KK
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKK 626


>gi|168010331|ref|XP_001757858.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691134|gb|EDQ77498.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 724

 Score =  831 bits (2146), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/751 (56%), Positives = 525/751 (69%), Gaps = 72/751 (9%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG  +E PL YL+ +DGF FL+DCGW D FD SLL+PL  VA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAHSEAPLCYLLQVDGFRFLLDCGWTDSFDLSLLEPLKSVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PDT+HLGA  YA  +LGL A ++ T PV+ +G + MYD  LSR+ VS FDLFTLDD+D
Sbjct: 61  SYPDTIHLGAFTYAFAKLGLQATMYCTLPVHHMGQMYMYDHVLSRKAVSNFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           ++F +  +L Y Q+Y L GKGEG+ + P+ AGHLLGGT+WKITKD E++IYAVD+N RKE
Sbjct: 121 TSFANSVQLKYQQHYQLQGKGEGMTITPYAAGHLLGGTIWKITKDTEEIIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLN TVLE+FVRPAVLITDAYNAL+NQPPR+QR+  F D I K LRA GNVLLPV++AG
Sbjct: 181 RHLNKTVLENFVRPAVLITDAYNALNNQPPRKQRDQEFIDMILKVLRAEGNVLLPVETAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLEL+L LE  WA   L+YP+  LT VS ST+++ KS LEWM DSI +SF +SR+N+FL
Sbjct: 241 RVLELILHLESNWAHQRLSYPVALLTNVSYSTVEFAKSLLEWMSDSIARSFGSSRENSFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK++ L  ++ E D  P GPK+V ASMASLE GF+ D+FVEWA+D +NLVLFTERGQ GT
Sbjct: 301 LKYLKLCHDRKEFDELPSGPKVVFASMASLEGGFARDLFVEWATDSRNLVLFTERGQMGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKE-----EES 414
           LA+ LQA+PPPK VKVTMS+++PL GEEL AYE EQ RLK     +  LV+E      E+
Sbjct: 361 LAKKLQAEPPPKIVKVTMSQKIPLTGEELQAYELEQ-RLKMATETEVDLVEEVGPNSPEA 419

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
           KA  GP      +P    A N   S           R ILIDGF     +  PMFP YEN
Sbjct: 420 KAVTGPLPLTVAEP----ATNEIPSQ----------RQILIDGFTASDKTAGPMFPLYEN 465

Query: 475 NSEWDDFGEVINPDDYIIKDEDM-----DQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
            S+WD++GEVINP+DY ++D +M      Q A     +D    E  A  IL  +PSKVV 
Sbjct: 466 PSDWDEYGEVINPEDYRVEDTEMMDYQSSQQAPVADVEDNTDQEAEA--ILADRPSKVVV 523

Query: 530 NELT---------------------------------VLVHGSAEATEHLKQHCLKHVCP 556
            + T                                 VLVHGSAEATEHL+QHC+K+VC 
Sbjct: 524 KDYTVYVKCALYYMDFEGRSDGRSIKNILAHVAPIKLVLVHGSAEATEHLRQHCVKNVCR 583

Query: 557 HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTEN-GMLSL 615
            VY P+I ET DVTSDLCAYKV+L+E+LMS+VLF+KLGDYE+AW+D E+G  E+ GML L
Sbjct: 584 DVYAPRIGETQDVTSDLCAYKVRLTERLMSSVLFRKLGDYEVAWIDGEIGSQESEGMLPL 643

Query: 616 LPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGG 675
           LP  TP PPHKSV VGDL++AD K  L++KGIQ EFAGG LRCG+   +R+ G       
Sbjct: 644 LPSETP-PPHKSVFVGDLRLADFKQLLATKGIQAEFAGGVLRCGDAFAVRRSG------- 695

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             G+QQ+VIEGPL E+YYK+R  LYSQFY+L
Sbjct: 696 --GSQQLVIEGPLSEEYYKLRDLLYSQFYML 724


>gi|302819854|ref|XP_002991596.1| hypothetical protein SELMODRAFT_429848 [Selaginella moellendorffii]
 gi|300140629|gb|EFJ07350.1| hypothetical protein SELMODRAFT_429848 [Selaginella moellendorffii]
          Length = 715

 Score =  792 bits (2045), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/735 (54%), Positives = 527/735 (71%), Gaps = 49/735 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQ+TPL+G  +E PL YL+ +D F FL+DCGWND FD SLLQPL  VA TIDAVLL
Sbjct: 1   MGTSVQLTPLAGAHSEGPLCYLLQVDDFRFLLDCGWNDVFDVSLLQPLVSVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYA+ +LGL+A V+ T P+  +G + MYD  LSR  VS FDLF+LDD+D
Sbjct: 61  SHSDTLHLGALPYAIAKLGLNATVYCTHPIRSMGHMQMYDHCLSRTAVSHFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AF +   L YSQ++ L GKG+GI++ P  A  LLGGT+WKITKD ED+IYAVD+N RKE
Sbjct: 121 TAFSNTCPLKYSQHFPLQGKGQGIIITPFPAARLLGGTIWKITKDTEDIIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLN TVLESF RPAVLITDAYNAL++QP R+QR+  F D I +TLR+ GNVLLPV+ +G
Sbjct: 181 RHLNATVLESFTRPAVLITDAYNALNSQPVRRQRDQEFLDIILRTLRSSGNVLLPVEPSG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLE++L L+ +W++H +N P+ FLTYV  S  D+VKS LEWM D+I K+FE +R+N F 
Sbjct: 241 RVLEIILYLDQHWSQHRINVPLVFLTYVVGSVTDFVKSSLEWMNDAIGKAFEQNRENPFA 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           L+ V L  ++ +L+  P GP++VLASMASLE GF+ ++F+EWA D KNLVLFTER Q GT
Sbjct: 301 LRSVKLCTSRKQLEELPPGPRVVLASMASLETGFAKELFLEWAVDPKNLVLFTERAQVGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LAR LQ +PPPK VK+T+S++V LVGEEL AYE EQ+RL +EEA  A+  +E    AS  
Sbjct: 361 LARQLQVEPPPKIVKITISKKVLLVGEELEAYEREQSRL-REEARNAASQQEPVQPAS-- 417

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
                S D M    + ++  ++  +     + DI IDGF  P+ +VAPMFP Y++++E D
Sbjct: 418 ----SSDDLMPSSPDESSTPSEGKQQAVTVHHDIFIDGFTVPADTVAPMFPVYDDSNERD 473

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----- 533
           ++GE+INPDD++IK+E MD +      ++ KL+ EG  S     KPSKVV+ +       
Sbjct: 474 EYGEIINPDDFVIKEEFMDYSQTQANANNIKLETEGDTSA---EKPSKVVTTDTAVVPLC 530

Query: 534 ----------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTS 571
                                 VL+HGSAE+TEHLKQHCLK+VCP VYTP++ E ++VTS
Sbjct: 531 ALTFMDFEGRADGRSIKSILAHVLIHGSAESTEHLKQHCLKNVCPFVYTPRVGENMNVTS 590

Query: 572 DLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVLVG 631
           DL AYK++L+E++MS+VLF+KLGDYE+AWVD E+G+ E  +L LLP+    PPHK+V VG
Sbjct: 591 DLNAYKLRLTERIMSSVLFRKLGDYELAWVDGEIGQNEEDLLPLLPLDGTPPPHKTVFVG 650

Query: 632 DLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCED 691
           DL++AD K  L++KGIQ EFAGG LRC + + +RK          SG QQ+VIEG L +D
Sbjct: 651 DLRLADFKQLLATKGIQAEFAGGVLRCADNIAVRK----------SGGQQLVIEGSLSDD 700

Query: 692 YYKIRAYLYSQFYLL 706
           YYK+R  LYSQ++++
Sbjct: 701 YYKVRELLYSQYHIV 715


>gi|302776792|ref|XP_002971541.1| hypothetical protein SELMODRAFT_441578 [Selaginella moellendorffii]
 gi|300160673|gb|EFJ27290.1| hypothetical protein SELMODRAFT_441578 [Selaginella moellendorffii]
          Length = 721

 Score =  790 bits (2040), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 404/747 (54%), Positives = 529/747 (70%), Gaps = 67/747 (8%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQ+TPL+G  +E PL YL+ +D F FL+DCGWND FD SLLQPL  VA TIDAVLL
Sbjct: 1   MGTSVQLTPLAGAHSEGPLCYLLQVDDFRFLLDCGWNDVFDVSLLQPLVSVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DTLHLGALPYA+ +LGL+A V+ T P+  +G + MYD  LSR  VS FDLF+LDD+D
Sbjct: 61  SHSDTLHLGALPYAIAKLGLNATVYCTHPIRSMGHMQMYDHCLSRTAVSHFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AF +   L YSQ++ L GKG+GI + P  A  LLGGT+WKITKD ED+IYAVD+N RKE
Sbjct: 121 TAFSNTCPLKYSQHFPLQGKGQGITITPFPAARLLGGTIWKITKDTEDIIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLN TVLESF RPAVLITDAYNAL++QP R+QR+  F D I +TLR+ GNVLLPV+ +G
Sbjct: 181 RHLNATVLESFTRPAVLITDAYNALNSQPVRRQRDQEFLDIILRTLRSSGNVLLPVEPSG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLE++L L+ +W++H +N P+ FLTYV  S  D+VKS LEWM D+I K+FE +R+N F 
Sbjct: 241 RVLEIILYLDQHWSQHRINVPLVFLTYVVGSVTDFVKSSLEWMNDAIGKAFEQNRENPFA 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           L+ V L  ++ +LD  P GP++VLASMASLE GF+ ++F+EWA D KNLVLFTER Q GT
Sbjct: 301 LRSVKLCTSRKQLDELPPGPRVVLASMASLETGFAKELFLEWAVDPKNLVLFTERAQVGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LAR LQ +PPPK VK+T+S++V LVGEEL AYE EQ+RL +EEA  A+  +E    AS  
Sbjct: 361 LARQLQVEPPPKIVKITISKKVLLVGEELEAYEREQSRL-REEARNAASQQEPVQPAS-- 417

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGR------YRDILIDGFVPPSTSVAPMFPFYE 473
                S D ++  A + +++     P  G+      + DI IDGF  P+ +VAPMFP Y+
Sbjct: 418 -----SSDDLMPSAPDESST-----PSEGKQQAVTVHHDIFIDGFTVPADTVAPMFPVYD 467

Query: 474 NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLD-EGSASLILDAKPSKVVSNEL 532
           +++E D++GE+INPDD++IK+E MD +      ++ KL+ EG  S     KPSKVV+ + 
Sbjct: 468 DSNERDEYGEIINPDDFVIKEEFMDYSQTQANANNIKLETEGDTSA---EKPSKVVTTDT 524

Query: 533 T---------------------------------VLVHGSAEATEHLKQHCLKHVCPHVY 559
                                             VL+HGSAE+TEHLKQHCLK+VCP VY
Sbjct: 525 AVVPLCALTFMDFEGRADGRSIKSILAHVAPLKLVLIHGSAESTEHLKQHCLKNVCPFVY 584

Query: 560 TPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPIS 619
           TP++ E ++VTSDL AYK++L+E++MS+VLF+KLGDYE+AWVD E+G+ E  +L LLP+ 
Sbjct: 585 TPRVGENMNVTSDLNAYKLRLTERIMSSVLFRKLGDYELAWVDGEIGQNEEDLLPLLPLD 644

Query: 620 TPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGT 679
              PPHK+V VGDL++AD K  L++KGIQ EFAGG LRC + + +RK          SG 
Sbjct: 645 GTPPPHKTVFVGDLRLADFKQLLATKGIQAEFAGGVLRCADNIAVRK----------SGG 694

Query: 680 QQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           QQ+VIEG L +DYYK+R  LYSQ++++
Sbjct: 695 QQLVIEGSLSDDYYKVRELLYSQYHIV 721


>gi|297808389|ref|XP_002872078.1| hypothetical protein ARALYDRAFT_910398 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317915|gb|EFH48337.1| hypothetical protein ARALYDRAFT_910398 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 544

 Score =  678 bits (1750), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/562 (64%), Positives = 427/562 (75%), Gaps = 65/562 (11%)

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
           MKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDIDSAFQ+V RLTYSQN
Sbjct: 1   MKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDIDSAFQNVIRLTYSQN 60

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           YHLSG+G  IV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE+HLNGTVL+SFVRP
Sbjct: 61  YHLSGRG--IVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRP 118

Query: 195 AVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           AVLITDAY+AL+ NQ  RQQR+  F D ISK L  GGNVLLPVD+AGRVLELLLILE +W
Sbjct: 119 AVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHW 178

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           ++   ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAFLL            
Sbjct: 179 SQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLL------------ 226

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
                          SLEAGF+ +IFVEWA+D +NLVLFTE GQFGTLARMLQ+ PPPK 
Sbjct: 227 ---------------SLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAPPPKF 271

Query: 373 VKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVID 432
           VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKE E+KAS G D+N S +PMVID
Sbjct: 272 VKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEVETKASHGSDDN-SSEPMVID 330

Query: 433 ANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII 492
               +   DVV  HG  Y+DILIDGFVPPS+SVAPMFPFY+N SEWDDFGEVINPDDY+I
Sbjct: 331 TKTTH---DVVGSHGPAYKDILIDGFVPPSSSVAPMFPFYDNTSEWDDFGEVINPDDYVI 387

Query: 493 KDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCL 551
           KDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL             ++    
Sbjct: 388 KDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVISNEL-------------IRIGFT 434

Query: 552 KHVCPHVYTPQI----EETIDVTSDLCAYKVQL-SEKLMS--------NVLF-KKLGDYE 597
           +H+   ++TP++    E  + V      Y ++   EKL+          VL+ KKLG+  
Sbjct: 435 RHLRGGLFTPKVACFKEGVMFVKRKKYYYSLKFYHEKLIKTFTEMQRLRVLYGKKLGNNS 494

Query: 598 IAWVDAEVGKTENGMLSLLPIS 619
              + +E  +T+ G L LL ++
Sbjct: 495 RLLLWSE--QTQTGNLKLLDLN 514


>gi|357440035|ref|XP_003590295.1| Cleavage and polyadenylation specificity factor subunit [Medicago
           truncatula]
 gi|355479343|gb|AES60546.1| Cleavage and polyadenylation specificity factor subunit [Medicago
           truncatula]
          Length = 630

 Score =  539 bits (1388), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 253/301 (84%), Positives = 280/301 (93%), Gaps = 1/301 (0%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSID FN LIDCGWNDHFDPSLLQPLS+VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDSFNILIDCGWNDHFDPSLLQPLSRVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLHL ALPYA+K LGLSAPV+STEPVYRLGLLTMYD +LSR+QVS+FDLFTLDDID
Sbjct: 61  SHPDTLHLAALPYAIKHLGLSAPVYSTEPVYRLGLLTMYDHFLSRKQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+VTRLTYSQN+HLSGKGEGIV+APH AGHLLGGT+WKITKDGEDVIYAVD+N RKE
Sbjct: 121 SAFQTVTRLTYSQNHHLSGKGEGIVIAPHTAGHLLGGTIWKITKDGEDVIYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQP R+Q++  F D + KTLRAGGNVLLPVD+AG
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQKDKEFGDILKKTLRAGGNVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R+LEL+L+LE YWA+ +LNYPIYFLTYV+SSTIDYVKSFLEWM DSI KSFE +R+N FL
Sbjct: 241 RILELILMLESYWADENLNYPIYFLTYVASSTIDYVKSFLEWMSDSIAKSFEQTRENIFL 300

Query: 300 L 300
           L
Sbjct: 301 L 301


>gi|157112944|ref|XP_001657690.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
 gi|108884656|gb|EAT48881.1| AAEL000118-PA [Aedes aegypti]
          Length = 744

 Score =  496 bits (1277), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 287/771 (37%), Positives = 434/771 (56%), Gaps = 92/771 (11%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D   FL+DCGW++ FDP+ ++ L K   TIDAVLL
Sbjct: 1   MTSIIKLHAISGAMDESPPCYILQVDEVRFLLDCGWDEKFDPNFIKELKKYVHTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + +LGL+ P+++T PVY++G + MYD ++S   + +FDLFTLDD+D
Sbjct: 61  SYPDGLHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDLFMSHYNMYDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  L GKG GI + P  AGHL+GGT+WK+ K G ED++YA D+N +K
Sbjct: 121 AAFDRIIQLKYNQSVSLKGKGYGITITPLPAGHLIGGTIWKVMKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDAYNA + Q  R+ R E F   I +TLR  GNVL+ VD+A
Sbjct: 181 ERHLNGCELEKLQRPSLLITDAYNAKYQQARRRARDEKFMTNILQTLRNNGNVLVTVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + +++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    +EL   P  PK+VLAS A +E+GFS ++FV+WAS+V N ++ T R 
Sbjct: 301 NPFQFKHLRLCHTMAELAKVP-SPKVVLASSADMESGFSRELFVQWASNVNNSIIITCRS 359

Query: 356 QFGTLAR-MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLAR +++     + +++ + RRV L G EL    EE  R + E+  ++ +  + + 
Sbjct: 360 SPGTLARDLIENGGNGRKIELDVRRRVELEGAEL----EEYMRTEGEKHNRSIIKSDMDL 415

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
            +S   D+ L    M +     +    VV P G  +      GF   S     MFPF+E 
Sbjct: 416 DSSSDSDDELE---MSVITGKHDI---VVRPEGRSH-----TGFFKSSKKQYAMFPFHEE 464

Query: 475 NSEWDDFGEVINPDDYIIKDED-----MDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
             ++D++GE+I PDDY + D        D     I  +D K ++     +LD KP+K +S
Sbjct: 465 KIKFDEYGEIIQPDDYKMIDLGPDGGFEDNKENQIKPEDIKKEKDEELSVLD-KPTKCIS 523

Query: 530 N---------------------------------ELTVLVHGSAEATEHLKQHCLKHVCP 556
           +                                    V++ GS + T H+ +HC  ++  
Sbjct: 524 SRKLVEVNAQVQFIDFEGRSDGESMLKILSQLRPRRVVVIRGSPQNTAHIAEHCQLNIGA 583

Query: 557 HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV----------- 605
            V+TP   E ID T++   Y+V+L+E L+S + F+K  D E+AW+DA++           
Sbjct: 584 RVFTPNRGEIIDATTETHIYQVRLTEALISQLEFQKGKDAEVAWIDAQIVIPAASDTPMD 643

Query: 606 --------GKTENGMLSLLPISTPA-PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGAL 656
                    K++  +L+L P+     P H SV + +LK+ D K  L    I  EF+GG L
Sbjct: 644 VDQVEGNDDKSDRQILTLEPMKNDELPAHHSVFINELKLIDFKQVLMKANISSEFSGGVL 703

Query: 657 RCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            C    V +R+V           T ++ +EG L E+YYKIR  LY Q+ ++
Sbjct: 704 WCNNGTVALRRV----------DTGKVTVEGCLSEEYYKIRELLYEQYAIV 744


>gi|195054718|ref|XP_001994270.1| GH10247 [Drosophila grimshawi]
 gi|193896140|gb|EDV95006.1| GH10247 [Drosophila grimshawi]
          Length = 754

 Score =  493 bits (1269), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 285/779 (36%), Positives = 436/779 (55%), Gaps = 98/779 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FDP+ ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDPNFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMYDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIYAVDYNRRK 179
           +AF  +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K GE D++YA+D+N +K
Sbjct: 121 TAFDKITQLKYNQTVSLKGKGYGISITPLSAGHMIGGTIWKIVKVGEEDIVYAIDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    +++   P GPK+VLAS   +E+GF+ D+FV+WA +  N ++FT R 
Sbjct: 301 NPFQFKHINLCHTLADVYKLPVGPKVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             G+L+  +++   P + +++ + RRV L G EL  Y   Q      E L   +VK E  
Sbjct: 361 GPGSLSMELVENSVPGRQLELDVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPEVE 415

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
           ++S     +       I+ +      D+V    GR+      GF   +     MFPF+E 
Sbjct: 416 ESSSSESED------DIEMSVITGKHDIVVRAEGRHHS----GFFKSNKRHHVMFPFHEE 465

Query: 475 NSEWDDFGEVINPDDYIIKDEDMDQAAMH-IGGDDGKLDEGSASL----------ILDAK 523
             ++DD+GEVIN DDY I D + D  AM     ++ K +E  A L           L  K
Sbjct: 466 KIKYDDYGEVINLDDYRIVDANYDYTAMDDQNKENVKKEEPHAELHSNGNLDNDVQLLEK 525

Query: 524 PSKVVSNELTV---------------------------------LVHGSAEATEHLKQHC 550
           P+K++S   T+                                 +VHG+AE T+ + +HC
Sbjct: 526 PTKLISQRKTIEVHAQIQRIDFEGRSDGESMLKILSQLRPRRVIVVHGTAEGTQVVAKHC 585

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG---- 606
            ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AW+D  +G    
Sbjct: 586 EQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWIDGRLGMRLQ 645

Query: 607 -----------------KTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQ 648
                              E   L+L  ++    P H SVL+ +LK++D K  L    I 
Sbjct: 646 AIDAPNQSEITVEQDVAAQEGKTLTLETLAEDEIPVHNSVLINELKLSDFKQVLMRNSIN 705

Query: 649 VEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            EF+GG L C    + +R+V           T ++ +EG + E+YYKIR  LY Q+ ++
Sbjct: 706 SEFSGGVLWCCNGTLALRRV----------DTGKVAMEGCISEEYYKIRELLYEQYAIV 754


>gi|390333491|ref|XP_780045.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Strongylocentrotus purpuratus]
          Length = 773

 Score =  492 bits (1267), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 293/811 (36%), Positives = 423/811 (52%), Gaps = 143/811 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++TP SGV +E+P  Y++ +D F FL+DCGW++HF    ++ L K    +DAVLL
Sbjct: 1   MTSIIKLTPFSGVLDESPPCYMLQVDEFRFLLDCGWDEHFTMENIEGLKKHIHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + +  L+ P+++T PVY++G + MYD Y S+    EFDLF LDD+D
Sbjct: 61  SYPDNLHLGALPYLVGKCNLTCPIYATVPVYKMGQMFMYDLYQSKHNYEEFDLFNLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L YSQ+  L GKG G+ + P   GH++GGT+WKI KDG E++IYAVDYN +K
Sbjct: 121 AAFDRIIQLKYSQSVTLKGKGHGLTITPLSGGHMIGGTIWKIVKDGEEEIIYAVDYNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG VLE+  RP++LITD +NA + Q  R+ R E   D I  T+R  GNVL+ VD+A
Sbjct: 181 ERHLNGAVLETISRPSLLITDCFNATYVQARRRARDEKLMDIILNTMRNEGNVLISVDTA 240

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRV+EL L+L+  W        NY +  L  VS + +++ KS +EWM D + ++FE  R+
Sbjct: 241 GRVVELSLLLDQLWRNQDSGLGNYNLAMLNNVSYNVVEFAKSQVEWMSDKVMRAFEDRRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  N  EL   PD PK+VLAS+  LE G+S ++F++W+ D KN V+ T R 
Sbjct: 301 NPFQFKHLKLCHNLKELAKVPD-PKVVLASVPDLECGYSRELFIQWSGDAKNSVILTNRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAY---EEEQTRLKK-EEALKASL--- 408
             GTLAR L   P P  +K+ +S+RV L  EEL  Y   E+E+ R +K +EA +  L   
Sbjct: 360 SHGTLARRLIETPNPNQLKLRVSKRVKLEKEELDEYRIHEKEKERQRKVDEAAQRRLEGD 419

Query: 409 ----VKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTS 464
                +EE     +G         M  D      S                  F      
Sbjct: 420 SSDESEEEMEVDDMGRSRTKHDLMMNTDTGKKGTS------------------FFKTVKK 461

Query: 465 VAPMFPFYENNSEWDDFGEVINPDDYIIKDE---------------DMDQAAMHIGGD-- 507
             PMFPF+E    WDD+GEVI P+DY+IK+                D + AA    GD  
Sbjct: 462 SYPMFPFHEERLRWDDYGEVIKPEDYMIKETVQTEEEKEVKEEENADFEDAA---EGDIP 518

Query: 508 ---------------------DGKLD-EGSASLILDAKPSKVVSNELTVLVHGSAEATEH 545
                                +G+ D E    LI   KP ++      VLV G   AT+H
Sbjct: 519 TKCIASQIIVDVKCSITFIDFEGRSDGESMKKLITQVKPRQL------VLVRGQMNATQH 572

Query: 546 LKQHC-LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE 604
           L ++C L+     V+ P++ E  D T +   Y+V+L + L+S++LF K  D E++W+D  
Sbjct: 573 LAEYCHLQLAGVKVFIPRMNEICDATMESHIYQVKLKDSLVSSLLFSKTRDTELSWIDGC 632

Query: 605 V----------GKTENGMLS----------------------------------LLPI-- 618
           +          GK   G  S                                  ++P+  
Sbjct: 633 LDLQSAGDKLAGKAIKGSDSSPNGDEKSFGDEKKKTPGLGLGNESEDSSDDEDDIIPVLD 692

Query: 619 ---STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGG 675
              +    PH+ V V   +  D K  L+  GI+ EF GG L C   V I++     +KG 
Sbjct: 693 AVQTNEVTPHRQVYVNPPRFLDFKQVLAKNGIRAEFTGGVLVCNNTVAIKR----NEKG- 747

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + +EG +C+DYY +R  LY Q+ ++
Sbjct: 748 -----HLTLEGAVCDDYYTVRELLYEQYAIV 773


>gi|156399337|ref|XP_001638458.1| predicted protein [Nematostella vectensis]
 gi|156225579|gb|EDO46395.1| predicted protein [Nematostella vectensis]
          Length = 737

 Score =  491 bits (1264), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 285/772 (36%), Positives = 433/772 (56%), Gaps = 101/772 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSG  +E PL YL+ +D F FL+DCGWN+  D  +++ + +    +DAVL+
Sbjct: 1   MTSIIKLNVLSGAHDEAPLCYLLQVDEFRFLLDCGWNETLDMEIMESIKRHVQQVDAVLV 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S PD  H+G LPY + + GL  P+++T PVY++G + MYD Y   +   EFD+F+LDD+D
Sbjct: 61  SFPDIYHMGGLPYLVGKCGLHCPIYTTIPVYKMGQMFMYDWYQCHQNSEEFDVFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           + F  + +L YSQ   L GKG GI + P+ AGH++GGT+WKI KDG ED+IYAVDYN +K
Sbjct: 121 AVFDKIIQLKYSQTVSLKGKGHGITITPYAAGHMIGGTMWKIVKDGEEDIIYAVDYNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG VLE+  RP++LITD++NAL+ Q  R++R+      I KT+R  GNV++ +D+A
Sbjct: 181 ERHLNGAVLETLSRPSLLITDSFNALNIQTRRRERDTQLMGEILKTMRRHGNVMIAIDTA 240

Query: 239 GRVLELLLILEDYWA--EHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W   +  L+ Y +  L  VS + I++ KS +EWM D I K+FE  R+
Sbjct: 241 GRVLELSQLLDQLWRNLDSGLSAYSLAMLNNVSYNVIEFAKSQVEWMSDKIMKAFEIGRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N +  ++  L  + ++L   P+ PK+VLASM  L AGFS D+FVEWA + KN V+FT R 
Sbjct: 301 NPYQFRYCHLCHSLADLARVPE-PKVVLASMMDLTAGFSRDLFVEWADNPKNTVIFTARS 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA--SLVKEEE 413
             GTLAR L  +   K V++ + +RV L GEEL  Y EE  + +K+  + A  +LV E++
Sbjct: 360 SPGTLARTLIDNLELKQVELEVKQRVRLGGEELERYLEENKKKEKDYPVLAISTLVAEDD 419

Query: 414 SKASLGPDNNLSGDPMVIDANNANASADVV--EPHGGRYRDILIDGFVPPSTSVAPMFPF 471
           S            D  V D   + A  D++  E   GR        F   + S  PMFP 
Sbjct: 420 S------------DSEVEDEVASGARHDLMMAEQKSGRK-----SSFFKQARSF-PMFPC 461

Query: 472 YENNSEWDDFGEVINPDDYIIKD----EDMDQ-----------------------AAMHI 504
           +E  ++WDD+GE I P+DY+ ++    E+  Q                         +  
Sbjct: 462 HEEKAKWDDYGEFIRPEDYMQRELSATEEEKQKVVRDLSKVPTKCISQKKTVSIRCTLAF 521

Query: 505 GGDDGKLDEGSASLILD-AKPSKVVSNELTVLVHGSAEATEHLKQHC---LKHVCPHVYT 560
              +G+ D  S   IL+   P K+      VLVHG +++T+HL  +C          V+T
Sbjct: 522 IDFEGRSDGESIKRILNLVNPRKL------VLVHGDSKSTQHLADYCQSSSSIQVSQVFT 575

Query: 561 PQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKT------------ 608
           P + ET++ T +   Y+V+L + L+S++ F +  D E+AW+D ++               
Sbjct: 576 PAVGETVEATGERHIYQVKLRDALVSSLQFAQARDAELAWIDGQLDMKLAPANQDLMGDK 635

Query: 609 ---------ENGMLSLLPI-----STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGG 654
                    ++  L  +P+     S+    H SV + + +++D K  L+  GIQ EFAGG
Sbjct: 636 PGEEKMETDQDEALDTVPVLEQNTSSKIAGHVSVFINEPRLSDFKQVLNKAGIQAEFAGG 695

Query: 655 ALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            L C   V +R+          + T ++ +EG +CEDYY IR  LYSQ+ ++
Sbjct: 696 VLICNNVVCVRR----------NETGRVGLEGTVCEDYYTIRDLLYSQYAIV 737


>gi|443725188|gb|ELU12868.1| hypothetical protein CAPTEDRAFT_155355 [Capitella teleta]
          Length = 728

 Score =  491 bits (1263), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 290/763 (38%), Positives = 423/763 (55%), Gaps = 92/763 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++ P SGV  E+P  Y++ +D F+FL+DCGW++ FDP  ++ L K    IDAVLL
Sbjct: 1   MTSIIKLQPFSGVDGESPPCYMLQVDEFHFLLDCGWDEEFDPVFMENLKKHLPQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD  HLGALPY + + G++ P++ST PVY++G + MYD Y S     EF+LF+LDD+D
Sbjct: 61  SYPDPQHLGALPYLVGKCGMTCPIYSTLPVYKMGQMFMYDLYQSHHNSEEFNLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E++IYAVDYN ++
Sbjct: 121 AAFDRIQQLKYSQTINLKGKGHGLQITPLPAGHMIGGTIWKIVKDGEEEIIYAVDYNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG VLE+  RP +LITDAYNA  NQ  R+ R E     I +TLR  GN L+ +D+A
Sbjct: 181 ERHLNGCVLETINRPHLLITDAYNADFNQARRRLRDEQLMTTILQTLRNDGNCLVALDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GR+LEL  +L+  W       + Y +  L  V+ + +++ KS +EWM D I +SFE  R+
Sbjct: 241 GRILELAHLLDQMWRNQESGLMAYSLALLNNVAYNVVEFAKSQVEWMSDKIMRSFEERRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL   P+ PK+VLAS   L+ GFS ++FV+W S+ KN ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELAKVPE-PKVVLASTPDLQTGFSRELFVQWCSNPKNCIILTNRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
              TL R L   P   +V++ + RRV L G  L  +      L+ E   KA + +E+  K
Sbjct: 360 APPTLCRQLIDYPNRGSVRLEVKRRVRLEGRALEDF------LRAERERKAEVEREKAEK 413

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI-----DGFVPPSTSVAPMFP 470
                +   S D           SAD     GGR+ D+++      GF         MFP
Sbjct: 414 ERREREGLESSDD----------SADEEVGDGGRH-DLMVKMEKGKGFFKQVKKSQAMFP 462

Query: 471 FYENNSEWDDFGEVINPDDYIIKD-EDMDQAAMH-----------------IGGD----- 507
           F E   +WD++GE+I  +DYIIK+   M+   MH                 I        
Sbjct: 463 FEEEKLKWDEYGEIIRIEDYIIKEATTMEDEPMHNELKSFVTEKTEVPTKCISSSETLEL 522

Query: 508 ---------DGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCP-- 556
                    +G+ D  S   I+    S+V   +L +LV GS E+TE L   C     P  
Sbjct: 523 RANILYIDFEGRSDGDSMRKII----SQVRPRQL-ILVRGSRESTESLAAFCRD--APDI 575

Query: 557 -HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-----EVGKTEN 610
             VYTP++ E +D T++   ++V+L + ++S + F K  D EIAW+DA     +    E+
Sbjct: 576 GKVYTPRLNELVDATTESKIFQVRLKDSVVSALNFSKARDAEIAWIDAMLDLNQAEAMED 635

Query: 611 GM----LSLLPISTPAP---PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVT 663
           G        +P+  P     PH +V V + K++D K  L + G+Q EF+ G L C   V 
Sbjct: 636 GENPEDEEAVPVVIPTSQIRPHGAVFVNEPKLSDFKQTLVNLGVQAEFSAGVLICNSVVA 695

Query: 664 IRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           +RK   AG         ++ +EG LC+DYY+IR  LY QF ++
Sbjct: 696 VRK-NEAG---------RLQLEGTLCDDYYRIRQLLYEQFAIV 728


>gi|195109795|ref|XP_001999467.1| GI23051 [Drosophila mojavensis]
 gi|193916061|gb|EDW14928.1| GI23051 [Drosophila mojavensis]
          Length = 754

 Score =  490 bits (1262), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 285/785 (36%), Positives = 432/785 (55%), Gaps = 110/785 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FDP+ ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDPNFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDVYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMYDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGTVWKI K G ED+IYAVD+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLSAGHMIGGTVWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    +++   P GPK+VLAS   +E+GF+ D+FV+WA +  N ++FT R 
Sbjct: 301 NPFQFKHINLCHTLADIYKLPAGPKVVLASTPDMESGFTRDLFVQWAGNPNNSIIFTTRT 360

Query: 356 QFGTLAR-MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             G+L+  +++   P + +++ + RRV L G EL  Y   Q      E L   +VK E  
Sbjct: 361 GPGSLSMDLVENYSPGRQIELDLRRRVELEGAELEEYLRTQG-----EKLNPLIVKPEVE 415

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
           + S     +       I+ +      D+V    GR+      GF   +     MFP++E 
Sbjct: 416 EESSSESED------DIEMSVITGKHDIVVRSEGRHH----SGFFKSNKRHHVMFPYHEE 465

Query: 475 NSEWDDFGEVINPDDYIIKDEDM------DQAAMHIGGDDGKLDEGSASLI-----LDAK 523
             ++DD+GEVIN DDY I D         DQ   +I  ++  ++  S   +     L  K
Sbjct: 466 KIKYDDYGEVINLDDYRIVDTGYDYAPTDDQNKENIKKEEPHVEPQSNGNLNNDVQLLEK 525

Query: 524 PSKVVSNELT---------------------------------VLVHGSAEATEHLKQHC 550
           P+K++S   T                                 ++VHG+AE T+ + +HC
Sbjct: 526 PTKLISQRKTIEVNAQIQRIDFEGRSDGESMLKILSQLRPRRVIVVHGTAEGTQIVAKHC 585

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTEN 610
            ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AW+D  +G    
Sbjct: 586 EQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWIDGRLG---- 641

Query: 611 GMLSLLPISTPA----------------------------PPHKSVLVGDLKMADLKPFL 642
             + L  I  P                             P H SVL+ +LK++D K  L
Sbjct: 642 --MRLQAIDAPTQSEVTVEQDVAALEGKTLTLEMLEEDEIPVHNSVLINELKLSDFKQVL 699

Query: 643 SSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYS 701
               I  EF+GG L C    + +R+V             ++ +EG L EDYYKIR  LY 
Sbjct: 700 MRNNINSEFSGGVLWCCNGTLALRRVDVG----------KVAMEGCLSEDYYKIRELLYE 749

Query: 702 QFYLL 706
           Q+ ++
Sbjct: 750 QYAIV 754


>gi|194745794|ref|XP_001955372.1| GF16269 [Drosophila ananassae]
 gi|190628409|gb|EDV43933.1| GF16269 [Drosophila ananassae]
          Length = 756

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 277/784 (35%), Positives = 430/784 (54%), Gaps = 106/784 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FDP+ ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDPNFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADIYKLPAGPKVVLASTPDLESGFTRDLFVQWASNSNNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P + +++ + RRV L G EL  Y   Q      E L   +VK    
Sbjct: 361 SPGTLAMELVENCTPGRQIELDIRRRVELEGAELDEYLRTQG-----EKLNPLIVK---- 411

Query: 415 KASLGPDNNLSGDPMV---IDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
                PD            I+ +      D+V    GR+      GF   +     MFP+
Sbjct: 412 -----PDVEEESSSESEDDIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPY 462

Query: 472 YENNSEWDDFGEVINPDDYIIKD---------EDMDQAAMHIGGDDGKLDEGSASLILDA 522
           +E   ++D++GE+IN DDY I D         E+ ++  +        +D  +   I D 
Sbjct: 463 HEEKVKYDEYGEIINLDDYRIADTSGYDFVPMEEQNKENVKKEEPGSGIDHQTNGTIGDT 522

Query: 523 ------KPSKVVSNELT---------------------------------VLVHGSAEAT 543
                 KP+K+++   T                                 +++HG+AE T
Sbjct: 523 DVQLLEKPTKLINQRKTIEVNAQIQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGT 582

Query: 544 EHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA 603
           + + +HC ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AWVD 
Sbjct: 583 QVVAKHCEQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVDG 642

Query: 604 EVGKTENGMLSLLPISTPA--------------------PPHKSVLVGDLKMADLKPFLS 643
            +G     + + + ++                       P H SVL+ +LK++D K  L 
Sbjct: 643 RLGMRLKAIDAAMDVTAEQDNSAQEAKTLTLETLAEDEIPVHNSVLINELKLSDFKQILM 702

Query: 644 SKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQ 702
              I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  LY Q
Sbjct: 703 RNNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLYEQ 752

Query: 703 FYLL 706
           + ++
Sbjct: 753 YAIV 756


>gi|194906654|ref|XP_001981406.1| GG11633 [Drosophila erecta]
 gi|190656044|gb|EDV53276.1| GG11633 [Drosophila erecta]
          Length = 756

 Score =  489 bits (1258), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 278/783 (35%), Positives = 432/783 (55%), Gaps = 104/783 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P K +++ + RRV L G EL  Y   Q      E L   +VK +  
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPDVE 415

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
             S       S     I+ +      D+V    GR+      GF   +     MFP++E 
Sbjct: 416 DES------SSESEDDIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPYHEE 465

Query: 475 NSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEGSAS 517
             + D++GE+IN DDY I D              E++ +    +G D   +G + +    
Sbjct: 466 KVKCDEYGEIINLDDYRIADATGYDFVPMEEQNKENVKKEEPGMGADQQANGGIGDNDVQ 525

Query: 518 LILDAKPSKVVSNELT---------------------------------VLVHGSAEATE 544
           L+   KP+K+++   T                                 +++HG+AE T+
Sbjct: 526 LL--EKPTKLINQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQ 583

Query: 545 HLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE 604
            + +HC ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AWVD  
Sbjct: 584 VVARHCEQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVDGR 643

Query: 605 VGKTENGMLSLLPISTPA--------------------PPHKSVLVGDLKMADLKPFLSS 644
           +G     + + + ++                       P H SVL+ +LK++D K  L  
Sbjct: 644 LGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLDDDEIPIHNSVLINELKLSDFKQILMR 703

Query: 645 KGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
             I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  LY Q+
Sbjct: 704 NNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLYEQY 753

Query: 704 YLL 706
            ++
Sbjct: 754 AIV 756


>gi|195503417|ref|XP_002098643.1| GE26465, isoform A [Drosophila yakuba]
 gi|194184744|gb|EDW98355.1| GE26465, isoform A [Drosophila yakuba]
          Length = 756

 Score =  486 bits (1252), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 281/786 (35%), Positives = 431/786 (54%), Gaps = 110/786 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P K +++ + RRV L G EL  Y   Q      E L   +VK    
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVELEGAELEEYLRTQG-----EKLNPLIVK---- 411

Query: 415 KASLGPDNNLSGDPMV---IDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
                PD            I+ +      D+V    GR+      GF   +     MFP+
Sbjct: 412 -----PDVEEESSSESEDDIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPY 462

Query: 472 YENNSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEG 514
           +E   + D++GE+IN DDY I D              E++ +    +G D   +G + + 
Sbjct: 463 HEEKVKCDEYGEIINLDDYRIADATGYDFVPMEEQNKENVKKEEPGLGADQQTNGGIGDN 522

Query: 515 SASLILDAKPSKVVSNELT---------------------------------VLVHGSAE 541
              L+   KP+K+ +   T                                 +++HG+AE
Sbjct: 523 DVQLL--EKPTKLXNQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAE 580

Query: 542 ATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV 601
            T+ + +HC ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AWV
Sbjct: 581 GTQVVARHCEQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWV 640

Query: 602 DAEVGK-------------------TENGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPF 641
           D  +G                     E   L+L  ++    P H SVL+ +LK++D K  
Sbjct: 641 DGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPIHNSVLINELKLSDFKQI 700

Query: 642 LSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
           L    I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  LY
Sbjct: 701 LMRNNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLY 750

Query: 701 SQFYLL 706
            Q+ ++
Sbjct: 751 EQYAIV 756


>gi|21358013|ref|NP_651658.1| cleavage and polyadenylation specificity factor 100, isoform A
           [Drosophila melanogaster]
 gi|18203548|sp|Q9V3D6.1|CPSF2_DROME RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 2; AltName: Full=Cleavage and
           polyadenylation specificity factor 100 kDa subunit;
           Short=CPSF 100 kDa subunit
 gi|5679134|gb|AAD46873.1|AF160933_1 LD14168p [Drosophila melanogaster]
 gi|7301732|gb|AAF56844.1| cleavage and polyadenylation specificity factor 100, isoform A
           [Drosophila melanogaster]
          Length = 756

 Score =  485 bits (1248), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 282/786 (35%), Positives = 431/786 (54%), Gaps = 110/786 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L  KG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P K +++ + RRV L G EL  Y   Q      E L   +VK    
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQG-----EKLNPLIVK---- 411

Query: 415 KASLGPDNNLSGDPMV---IDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
                PD            I+ +      D+V    GR+      GF   +     MFP+
Sbjct: 412 -----PDVEEESSSESEDDIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPY 462

Query: 472 YENNSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEG 514
           +E   + D++GE+IN DDY I D              E++ +    IG +   +G + + 
Sbjct: 463 HEEKVKCDEYGEIINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDN 522

Query: 515 SASLILDAKPSKVVSNELT---------------------------------VLVHGSAE 541
              L+   KP+K++S   T                                 +++HG+AE
Sbjct: 523 DVQLL--EKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAE 580

Query: 542 ATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV 601
            T+ + +HC ++V   V+TPQ  E IDVTS++  Y+V+L+E L+S + F+K  D E+AWV
Sbjct: 581 GTQVVARHCEQNVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKDAEVAWV 640

Query: 602 DAEVGK-------------------TENGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPF 641
           D  +G                     E   L+L  ++    P H SVL+ +LK++D K  
Sbjct: 641 DGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPIHNSVLINELKLSDFKQT 700

Query: 642 LSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
           L    I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  LY
Sbjct: 701 LMRNNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLY 750

Query: 701 SQFYLL 706
            Q+ ++
Sbjct: 751 EQYAIV 756


>gi|195449222|ref|XP_002071979.1| GK22564 [Drosophila willistoni]
 gi|194168064|gb|EDW82965.1| GK22564 [Drosophila willistoni]
          Length = 757

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 282/784 (35%), Positives = 428/784 (54%), Gaps = 105/784 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIRDLKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  LE   RP++LITDAYNAL+ Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELERLQRPSLLITDAYNALYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   +E+GF+ D+FV+WA++  N ++FT R 
Sbjct: 301 NPFQFKHINLCHSLADVFKLPAGPKVVLASTPDMESGFTRDLFVQWAANPNNSIIFTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             G+LA  +++   P + +++ + RRV L G EL  Y   Q      E L   ++K    
Sbjct: 361 SPGSLAMELVENAVPGRKIELDVRRRVELEGPELEEYLRTQG-----EKLNPLIIK---- 411

Query: 415 KASLGPDNNLSGDPMV---IDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
                PD            I+ +      D+V    GR+      GF   +     MFP+
Sbjct: 412 -----PDVEEESSSESEDDIEMSVITGKHDIVVRPEGRH----TSGFFKSNKRHHVMFPY 462

Query: 472 YENNSEWDDFGEVINPDDYIIKD---------EDMDQAAM--------------HIGGD- 507
           +E   ++D++GE+IN DDY I D         E+ ++  +              H  GD 
Sbjct: 463 HEEKIKYDEYGEIINLDDYRIADLGGYDYLPAEEQNKENVKKEEPGGGQQDQQQHANGDM 522

Query: 508 --DGKLDEGSASLILDAKPSKVVSN------------------------ELTVLVHGSAE 541
             D +L E    LI   K  +V +                            ++VHG+AE
Sbjct: 523 DTDVQLLEKPTKLINQRKTIEVNAQIQRIDFEGRSDGESMLKILSQLRPRRVIVVHGTAE 582

Query: 542 ATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV 601
            T+ + +HC ++V   V+TP   E IDVT+++  Y+V+L+E L+S + F+K  + E+AWV
Sbjct: 583 GTKAVARHCEQNVGARVFTPNKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKAKNAEVAWV 642

Query: 602 DA------------------EVGKTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFL 642
           D                   EV   E   L+L  +     P H SVL+ +LK++D K  L
Sbjct: 643 DGRLGMRLKAIDGATNPTEQEVSIQEGQTLTLETLEEDEIPVHNSVLINELKLSDFKQIL 702

Query: 643 SSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQ 702
               I  EF+GG L C       +   AG         ++ +EG L EDYYKIR  LY Q
Sbjct: 703 MRNNINSEFSGGVLWCSNNTLALRRIDAG---------KVSMEGCLSEDYYKIRELLYEQ 753

Query: 703 FYLL 706
           + ++
Sbjct: 754 YAIV 757


>gi|196012036|ref|XP_002115881.1| hypothetical protein TRIADDRAFT_30006 [Trichoplax adhaerens]
 gi|190581657|gb|EDV21733.1| hypothetical protein TRIADDRAFT_30006 [Trichoplax adhaerens]
          Length = 745

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 289/779 (37%), Positives = 437/779 (56%), Gaps = 107/779 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSG  +E P  YL+ +D FNFL+DCGW+++FD  +++ + +    IDAVLL
Sbjct: 1   MTSIIRMTVLSGGQDEGPPCYLLQVDEFNFLLDCGWDENFDMEMMERVKRHIHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGA+PY + +  L  P+++T PV+++G + MYD +LSR    +FDLF+LDDID
Sbjct: 61  SHPDLLHLGAIPYLVGKCQLKCPIYATVPVHKMGQMFMYDLFLSRNDYEDFDLFSLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIYAVDYNRRK 179
            AF  +T L YSQ+ HL+GKG G+ + P+ AGH++GGT+WKI KDGE D+IYAVDYN +K
Sbjct: 121 DAFSRITALKYSQHVHLTGKGNGLTITPYAAGHMVGGTIWKIIKDGEEDIIYAVDYNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL---RAGGNVLLPVD 236
           E+HLNG+VLE+   P++LITDAYNA +NQ  R+ R+  Q  IS+ L   R+GGNVL+ VD
Sbjct: 181 ERHLNGSVLETLTHPSLLITDAYNAQYNQAKRRDRD--QKLISRVLNALRSGGNVLIAVD 238

Query: 237 SAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           +AGRVLEL L+L+  W +      YPI  L +VS + +++ KS +EWM D +  +FE +R
Sbjct: 239 TAGRVLELSLLLDHLWRKDPGLSAYPIALLNHVSYNVVEFAKSQVEWMCDKVLVAFEDNR 298

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
           +N F  K++ L  + +EL   P+ PK+VLAS   L  GF+ D+F++WA + KNL +FT R
Sbjct: 299 NNPFQFKYIQLCHSLNELSGLPE-PKVVLASSPDLTCGFARDLFLQWAGNSKNLTIFTGR 357

Query: 355 GQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
              GTL R +  D  P+++ VT+  RV L G EL  Y +++   +K + L          
Sbjct: 358 SSPGTLGRHI-LDERPQSIDVTVKTRVELSGNELEEYLQKEREKEKVKELDGLKF----- 411

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI------DGFVPPSTSVAPM 468
                         + ID+++   +       G   RD++I        F   +  V PM
Sbjct: 412 --------------VTIDSDDELTTITGGYHTGKVKRDLMIKDDDRRSSFFKKAV-VHPM 456

Query: 469 FPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMH------IGGDDGKLDEGSASLI 519
           +PF E   +WD++GE+INP+D+ + D   ED  +   H      +   + K+     S +
Sbjct: 457 YPFSETRIKWDEYGEIINPEDFTLIDVSEEDKPKKVTHSDRHYFLNKGNPKIPTKCVSFL 516

Query: 520 ----LDAKPSKV----------VSNELT-------VLVHGSAEATEHLKQHCLKHV---C 555
               ++ + S +          + N L+       VLV GS+ A + L   C +      
Sbjct: 517 KHIDINCRISLIDFEGRSDGESIRNILSLVNPRHLVLVRGSSAAVQELGNFCRQSKEMGV 576

Query: 556 PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSL 615
             V+TP + +T+D T +   Y+V+L + L+S++ +    D E+AWVD  V  T  G   L
Sbjct: 577 RKVFTPVVGQTVDATFESHLYQVRLRDSLVSSLYYCNAKDAELAWVDGRVTVTAKGHERL 636

Query: 616 L-----------------------PISTP-----APPHKSVLVGDLKMADLKPFLSSKGI 647
           L                       PI  P      P HKSV + D +++DLK  L+  GI
Sbjct: 637 LDKNNKNEDEAMDTDNTSITEAVVPILEPLLQSEIPGHKSVFINDPRLSDLKQTLTKAGI 696

Query: 648 QVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           Q EF GG + C + + +R+          + T +I +EG +C DYY +R  LY Q+ ++
Sbjct: 697 QAEFVGGVIVCNDKIAVRR----------TETGKITLEGAICNDYYTVRDILYQQYAII 745


>gi|410916717|ref|XP_003971833.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Takifugu rubripes]
          Length = 787

 Score =  482 bits (1241), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 290/810 (35%), Positives = 438/810 (54%), Gaps = 133/810 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E+ L YL+ +D F FL+DCGW+++F   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDAMKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPIHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRNNSEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           SAF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 SAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LES  RP++LITD++NA + QP R+QR EM    + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCTLESISRPSLLITDSFNATYVQPRRKQRDEMLLTNVMETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLN---YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W         YP+  L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYPLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+TL  + ++L   P  PK+VL S   LE+GFS ++F++W+ D KN ++ T R 
Sbjct: 301 NPFQFRHLTLCHSLADLARVP-SPKVVLCSQPDLESGFSRELFIQWSKDAKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTL+R L  +P  K + + + +RV L G EL  Y  E+ R+KKE A K    KE +  
Sbjct: 360 TPGTLSRYLIDNPGEKHLDLEVRKRVKLEGRELEEY-LEKDRVKKEAAKKLEQAKEVDVD 418

Query: 416 ASLGPDNNLSGD-PMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
           +S   D +   + P ++ + + +    +++  G R        F   +    PMFP +E 
Sbjct: 419 SSDESDIDDDLEQPTIVKSKHHDL---MMKSEGSRK-----GSFFKQAKKSYPMFPTHEE 470

Query: 475 NSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------------AAM 502
             +WD++GE+I  +D+++                    DE MDQ              ++
Sbjct: 471 RIKWDEYGEIIRLEDFLVPELQATEEEKSKFDSGLTNGDEPMDQDLSVLPTKCISNVESL 530

Query: 503 HIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHCL---KHV 554
            I      +D EG +    D    K + N++     V+VHG  EA+  L + C    K +
Sbjct: 531 EIRARVTYIDYEGRS----DGDSIKKIINQMKPRQLVIVHGPPEASLDLAESCKAFSKDI 586

Query: 555 CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTEN 610
              VYTP+++ETID TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + 
Sbjct: 587 --KVYTPKLQETIDATSETHIYQVRLKDSLVSSLQFCKAKDTELAWIDGVLDMRVVKVDT 644

Query: 611 GML----------------------------------------------------SLLPI 618
           G++                                                     ++P 
Sbjct: 645 GVMLEDGVKEEGEDSELSMEVTPDLGIEPSAIAVAAQRAMKNLFGEEEKELSEESDIIPT 704

Query: 619 STPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQK 673
             P P      H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   AG+ 
Sbjct: 705 LEPLPTPEVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRRT-EAGRI 763

Query: 674 GGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
           G         +EG LCEDYYKIR  LY Q+
Sbjct: 764 G---------LEGCLCEDYYKIRELLYQQY 784


>gi|50539828|ref|NP_001002384.1| cleavage and polyadenylation specificity factor subunit 2 [Danio
           rerio]
 gi|49903850|gb|AAH76029.1| Cleavage and polyadenylation specific factor 2 [Danio rerio]
          Length = 790

 Score =  478 bits (1230), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 284/812 (34%), Positives = 441/812 (54%), Gaps = 128/812 (15%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++ F   ++  L +    +DAVLL
Sbjct: 1   MTSIIKLTALSGVQEESALCYLLQVDEFRFLLDCGWDETFSMDIIDSLKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           SAF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E++IY VD+N ++
Sbjct: 121 SAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIYGVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LES  RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + S+L   P  PK+VL S   LE+GFS ++F++W  D KN V+ T R 
Sbjct: 301 NPFQFRHLSLCHSLSDLARVP-SPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K +++ + +R  L G EL  Y E++ R+KKE A K    KE +  
Sbjct: 360 TPGTLARYLIDNPGEKRIELEIRKRCRLEGRELEEYMEKE-RMKKEAAKKLEQAKEVDLD 418

Query: 416 ASLGPDNNLSGD---PMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFY 472
           +S   ++++  D   P V+   + +    +++  GGR       GF   +     MFP +
Sbjct: 419 SS--DESDMEDDLEQPAVVKTKHHDL---MMKGEGGRK-----GGFFKQAKKSYSMFPTH 468

Query: 473 ENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDG-------------KLDEGSA 516
           E   +WD++GE+I P+D+++ +    + +++ +  G  +G             K    + 
Sbjct: 469 EERIKWDEYGEIIRPEDFLVPELQATEEEKSKLESGLTNGEEPMEQDLSDVPTKCTSTTQ 528

Query: 517 SLILDAK-------------PSKVVSNELT----VLVHGSAEATEHLKQHCLKHVCP--H 557
           +L + A+               K + N++     ++VHG  +A++ L + C  +      
Sbjct: 529 TLDIRARVMYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDASQDLAESCKAYSGKDIK 588

Query: 558 VYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-------------- 603
           VY P+++ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D               
Sbjct: 589 VYIPKLQETVDATSETHIYQVRLKDSLVSSLQFCKARDTELAWIDGVLDMRVEKVDTGVI 648

Query: 604 -EVGKT--------ENGM-----LSLLPISTPA--------------------------- 622
            E+G+         E GM     L+  P +  A                           
Sbjct: 649 VELGEAKDEAEEGGEQGMEVTEELNTEPSTAAAANQRAMKTLFGEDEKEISEESDVIPTL 708

Query: 623 ---PPH-----KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 674
              P H     +SV + + +++D K  L  +GIQ EF GG L C   V +R+   AG   
Sbjct: 709 EPLPAHEVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRRT-EAG--- 764

Query: 675 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 +I +EG  C+DYY+IR  LY Q+ ++
Sbjct: 765 ------RICLEGCHCDDYYRIRELLYEQYAVV 790


>gi|195341087|ref|XP_002037143.1| GM12754 [Drosophila sechellia]
 gi|194131259|gb|EDW53302.1| GM12754 [Drosophila sechellia]
          Length = 743

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 278/783 (35%), Positives = 431/783 (55%), Gaps = 117/783 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P K +++ + RRV L G EL  Y   Q      E L   +VK    
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQG-----EKLNPLIVK---- 411

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
                        P V + +++ +  D+         DI+       +     MFP++E 
Sbjct: 412 -------------PDVEEESSSESEDDIEMSVITGKHDIV------SNKRHHVMFPYHEE 452

Query: 475 NSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEGSAS 517
             + D++GE+IN DDY I D              E++ +    IG D   +G + +    
Sbjct: 453 KVKCDEYGEIINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGADQQANGAIVDNDVQ 512

Query: 518 LILDAKPSKVVSNELT---------------------------------VLVHGSAEATE 544
           L+   KP+K+++   T                                 +++HG+AE T+
Sbjct: 513 LL--EKPTKLINQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQ 570

Query: 545 HLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE 604
            + +HC ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AWVD  
Sbjct: 571 VVARHCEQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVDGR 630

Query: 605 VGK-------------------TENGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSS 644
           +G                     E   L+L  ++    P H SVL+ +LK++D K  L  
Sbjct: 631 LGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPIHNSVLINELKLSDFKQTLLR 690

Query: 645 KGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
             I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  LY Q+
Sbjct: 691 NNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLYEQY 740

Query: 704 YLL 706
            ++
Sbjct: 741 AIV 743


>gi|242021798|ref|XP_002431330.1| Cleavage and polyadenylation specificity factor 100 kDa subunit,
           putative [Pediculus humanus corporis]
 gi|212516598|gb|EEB18592.1| Cleavage and polyadenylation specificity factor 100 kDa subunit,
           putative [Pediculus humanus corporis]
          Length = 731

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 274/767 (35%), Positives = 427/767 (55%), Gaps = 97/767 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++   +SG  +E+P  +++ +D F FL+DCGW++ FD   ++ L K    IDAV+L
Sbjct: 1   MTSIIKFQAISGAMDESPPCFILQVDEFRFLLDCGWDEKFDQEYMKELKKHVPLIDAVIL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPY + +  LS P+++T PVY++G + MYD Y SR  + EFDLFTLDD+D
Sbjct: 61  SHPDPLHLGALPYLVGKCSLSCPIYATIPVYKMGQMFMYDLYQSRYNMEEFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG GI + P  AGH++GG++WKI K G ED+IYAVDYN +K
Sbjct: 121 AAFDKIIQLKYNQSIAMKGKGYGITITPLPAGHMIGGSIWKIFKVGEEDIIYAVDYNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR+ GNVL+ VD+A
Sbjct: 181 ERHLNGCELEKIQRPSLLITDAFNATYQQQRRRVRDEKLMTNILQTLRSNGNVLVTVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +LE  W       L Y + FL  VS +T+++ KS +EWM + + +SFE +R+
Sbjct: 241 GRVLELAHMLEQLWRNKESGLLAYSLAFLNNVSYNTVEFAKSQIEWMSEKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  K+V L  + SEL   P  PK+VLAS   +E+GFS ++F++W+S+  N ++ T R 
Sbjct: 301 NPFQFKYVQLCHSFSELSKVP-SPKVVLASTPDMESGFSRELFLQWSSNPLNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +   + + + + +RV L GEEL  Y + +   +++E     +  + + +
Sbjct: 360 SPGTLARDLIENGGDRIISIEIKKRVKLEGEELEEYFKNEEERREQERENVDVSSDSDDE 419

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
             +   +    D +V D+          +PH          GF   +     MFPFYE+ 
Sbjct: 420 LEMIQVSKGRHDFLVKDS----------KPHS---------GFFKTNKKQNAMFPFYEHK 460

Query: 476 SEWDDFGEVINPDDYI----------IKDEDMDQ-----------------------AAM 502
            ++DD+GE+INPD Y           +KDE MD+                       A +
Sbjct: 461 VKFDDYGEIINPDFYKLEGEKEKMDDVKDEAMDEEERVEDQEVPTKCISYTKEIMIKAQI 520

Query: 503 HIGGDDGKLD-EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTP 561
                +G+ D E    +I   +P ++      +L+ G+ E+T+ L     K     ++ P
Sbjct: 521 QFIDFEGRSDGESIQKIISQIRPRRL------ILIRGTGESTKSLVNIVSKSTDAKIFAP 574

Query: 562 QIE-ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV--------------- 605
           Q + E +D T++   Y+++L+++L+S++ F+K  + E+AW+DA+V               
Sbjct: 575 QKKSEVVDATTETYIYQIRLTDQLISSLYFQKGKEAEVAWLDAQVLTKNRSADARPSEEE 634

Query: 606 ------GKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCG 659
                  K E   L LLP+    P H++  + +LK++D K  L+   I  EF+GG LRC 
Sbjct: 635 MEIDEELKDEILTLDLLPVED-IPGHETSYINELKLSDFKQILNKNNINCEFSGGVLRCC 693

Query: 660 EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 +   AG         ++++EG L EDYYK++  L  Q+ ++
Sbjct: 694 HGSVAVRRHEAG---------RVILEGCLSEDYYKVKELLCQQYAIV 731


>gi|322783252|gb|EFZ10838.1| hypothetical protein SINV_80021 [Solenopsis invicta]
          Length = 737

 Score =  477 bits (1228), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 282/781 (36%), Positives = 426/781 (54%), Gaps = 119/781 (15%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  NE+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + G++ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGMNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W S+ +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCSNTQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RRV L G EL  Y+       K E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLDVKRRVKLEGIELEEYQ-------KREKLKQEQMKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E   GR + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGSGRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII----------------KDEDMD---QAAMHI------ 504
           PF E   + D++GE+I P+DY I                K E+ +   + AM I      
Sbjct: 457 PFVEEKIKIDEYGEIIKPEDYKIAETVPEIEDNKENVEMKQEETNYHPEVAMDIPTKCVQ 516

Query: 505 -----------------GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLK 547
                            G  DG   E    ++   +P +V      VLV GS + TE L 
Sbjct: 517 VSRTMTVNAAVTYIDFEGRSDG---ESLQKILAQLRPRRV------VLVRGSPKDTEILA 567

Query: 548 QHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDAEV- 605
           Q   +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AW+DA + 
Sbjct: 568 QQA-QSTGARVFVPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMIT 626

Query: 606 ------------GKTENGM--------LSLLPISTPAPPHKSVLVGDLKMADLKPFLSSK 645
                        ++EN +        L  LPI+   P H++  + +LK++D K  L+  
Sbjct: 627 ARDQICRDAIADTESENAIDESDKILTLEPLPINE-VPGHQTTFINELKLSDFKQVLNKS 685

Query: 646 GIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYL 705
            I  EF+GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ +
Sbjct: 686 NIPSEFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAI 736

Query: 706 L 706
           +
Sbjct: 737 V 737


>gi|383852782|ref|XP_003701904.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Megachile rotundata]
          Length = 737

 Score =  476 bits (1225), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 282/777 (36%), Positives = 429/777 (55%), Gaps = 111/777 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQEFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W  + +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCGNSQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RR+ L G EL  Y+       ++E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQ-------RKEKLKQEQLKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  GGR + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGGRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYIIKD---------EDM----DQAAMH--IGGD------- 507
           PF E   + D++GE+I P+DY I +         E+M    + AA H  +  D       
Sbjct: 457 PFVEEKIKIDEYGEIIRPEDYKIAETMPEIDDNKENMETKQEDAAHHPEVATDIPTKCIQ 516

Query: 508 ----------------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
                           +G+ D  S   IL   +P +V      VLV GS + TE L Q  
Sbjct: 517 VTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSPKDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA------ 603
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AW+DA      
Sbjct: 571 -QSAGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARD 629

Query: 604 -----EVGKTE--------NGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                 V  TE        + +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QVCRDAVADTEPDSTIDQSDKILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF+GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ ++
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAIV 737


>gi|332028657|gb|EGI68691.1| Putative cleavage and polyadenylation specificity factor subunit 2
           [Acromyrmex echinatior]
          Length = 737

 Score =  474 bits (1220), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 282/777 (36%), Positives = 427/777 (54%), Gaps = 111/777 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  NE+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAISGAMNESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDIYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K GE D+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W S+ +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCSNPQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RRV L G EL  Y+       K E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLEVKRRVKLEGIELEEYQ-------KREKLKQEQLKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  G R + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGSRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII-----------KDEDMDQ------------------- 499
           PF E   + D++GE+I P+DY I           ++ +M Q                   
Sbjct: 457 PFVEEKIKIDEYGEIIKPEDYKIAEIVPEVEDNKENVEMKQDEFNYHPEVAVDIPTKCVQ 516

Query: 500 --------AAMHIGGDDGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
                   AA+     +G+ D  S   IL   +P +V      VLV GS + TE L Q  
Sbjct: 517 VSRMMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSPKDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDAEV---- 605
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AW+DA +    
Sbjct: 571 -QSTGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARD 629

Query: 606 ---------GKTENG------MLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                     ++EN       +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QICRDAIADTESENAIDESDKILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF+GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ ++
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAIV 737


>gi|340713940|ref|XP_003395491.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 1 [Bombus terrestris]
 gi|340713942|ref|XP_003395492.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 2 [Bombus terrestris]
          Length = 737

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 281/777 (36%), Positives = 424/777 (54%), Gaps = 111/777 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAISGAMDESPPCYILQVDELRILLDCGWDENFDQEFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W  + +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RR+ L G EL  Y+       ++E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQ-------RKEKLKQEQLKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  GGR + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGGRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII----------------KDEDMDQ-------------- 499
           PF E   + D++GE+I P+DY I                K ED                 
Sbjct: 457 PFLEEKIKIDEYGEIIRPEDYKIAETMPEVDDNKENLETKQEDTTHHPEIPTDIPTKCIQ 516

Query: 500 --AAMHIGGD------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
               M +         +G+ D  S   IL   +P +V      VLV GS + TE L Q  
Sbjct: 517 VTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSQKDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA------ 603
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AWVDA      
Sbjct: 571 -QSAGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWVDAMITARD 629

Query: 604 -----EVGKTE--------NGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                 V  TE        + +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QICRDAVAGTESDDVIDQSDKILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF+GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ ++
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAIV 737


>gi|354494117|ref|XP_003509185.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Cricetulus griseus]
          Length = 782

 Score =  474 bits (1219), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 276/812 (33%), Positives = 434/812 (53%), Gaps = 136/812 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDE--------- 513
           MFP  E   +WD++GE+I P+D+++      +E+ ++    +   D  +D+         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKNKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 514 --GSASLILDAKPS-------------KVVSNELT----VLVHGSAEATEHLKQHCL--- 551
              + S+ + A+ +             K + N++     ++VHG  EA++ L + C    
Sbjct: 523 VSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFG 582

Query: 552 -KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVG 606
            K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V 
Sbjct: 583 GKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVS 640

Query: 607 KTENGML-----------------------------------------------SLLPIS 619
           K + G++                                                ++P  
Sbjct: 641 KVDTGVILEEGELKDDGEDSEMQVDGPSDSSAIAQQKAMKSLFGDDDKELGEESEIIPTL 700

Query: 620 TPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 674
            P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+        
Sbjct: 701 EPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-------- 752

Query: 675 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|8393762|ref|NP_058552.1| cleavage and polyadenylation specificity factor subunit 2 [Mus
           musculus]
 gi|18202027|sp|O35218.1|CPSF2_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|2331036|gb|AAB66830.1| cleavage and polyadenylation specificity factor [Mus musculus]
 gi|15489017|gb|AAH13628.1| Cleavage and polyadenylation specific factor 2 [Mus musculus]
 gi|148686924|gb|EDL18871.1| cleavage and polyadenylation specific factor 2 [Mus musculus]
          Length = 782

 Score =  474 bits (1219), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 277/812 (34%), Positives = 434/812 (53%), Gaps = 136/812 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   DV +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDG-------------KL 511
           MFP  E   +WD++GE+I P+D+++ +    + +++ +  G  +G             K 
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKC 522

Query: 512 DEGSASLILDAKPS-------------KVVSNELT----VLVHGSAEATEHLKQHCL--- 551
              + S+ + A+ +             K + N++     ++VHG  EA++ L + C    
Sbjct: 523 VSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFG 582

Query: 552 -KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVG 606
            K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V 
Sbjct: 583 GKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVS 640

Query: 607 KTENGML-----------------------------------------------SLLPIS 619
           K + G++                                                ++P  
Sbjct: 641 KVDTGVILEEGELKDDGEDSEMQVDAPSDSSAMAQQKAMKSLFGEDEKELGEETEIIPTL 700

Query: 620 TPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 674
            P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+        
Sbjct: 701 EPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-------- 752

Query: 675 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|350400562|ref|XP_003485880.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Bombus impatiens]
          Length = 737

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 280/777 (36%), Positives = 424/777 (54%), Gaps = 111/777 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAISGAMDESPPCYILQVDELRILLDCGWDENFDQEFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W  + +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RR+ L G EL  Y+       ++E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQ-------RKEKLKQEQLKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  GGR + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGGRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII----------------KDEDMDQ-------------- 499
           PF E   + D++GE+I P+DY I                + ED                 
Sbjct: 457 PFLEEKIKIDEYGEIIRPEDYKIAETMPEVDDNKENLETRQEDTTHHPEIPTDIPTKCIQ 516

Query: 500 --AAMHIGGD------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
               M +         +G+ D  S   IL   +P +V      VLV GS + TE L Q  
Sbjct: 517 VTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSQKDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA------ 603
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AWVDA      
Sbjct: 571 -QSAGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWVDAMITARD 629

Query: 604 -----EVGKTE--------NGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                 V  TE        + +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QICRDAVAGTESDDVIDQSDKILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF+GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ ++
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAIV 737


>gi|380025109|ref|XP_003696322.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Apis florea]
          Length = 737

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 281/777 (36%), Positives = 423/777 (54%), Gaps = 111/777 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQEFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W  + +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RR+ L G EL  Y+       ++E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQ-------RKEKLKQEQLKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  GGR + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGGRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII----------------KDEDMDQ-------------- 499
           PF E   + D++GE+I P+DY I                K ED                 
Sbjct: 457 PFVEEKIKIDEYGEIIRPEDYKIAETMPEVDDNKENLETKQEDTAHHPEIPTDIPTKCIQ 516

Query: 500 --AAMHIGGD------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
               M +         +G+ D  S   IL   +P +V      VLV GS   TE L Q  
Sbjct: 517 VTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSQRDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA------ 603
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AWVDA      
Sbjct: 571 -QSAGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWVDAMITARD 629

Query: 604 -----EVGKTE--------NGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                 V  TE        + +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QICRDAVAGTEPNDAIDQSDKILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF+GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ ++
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAIV 737


>gi|157822735|ref|NP_001100223.1| cleavage and polyadenylation specificity factor subunit 2 [Rattus
           norvegicus]
 gi|149025374|gb|EDL81741.1| cleavage and polyadenylation specific factor 2 (predicted) [Rattus
           norvegicus]
          Length = 782

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 277/812 (34%), Positives = 434/812 (53%), Gaps = 136/812 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   DV +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDVDQPTAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDG-------------KL 511
           MFP  E   +WD++GE+I P+D+++ +    + +++ +  G  +G             K 
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKC 522

Query: 512 DEGSASLILDAKPS-------------KVVSNELT----VLVHGSAEATEHLKQHCL--- 551
              + S+ + A+ +             K + N++     ++VHG  EA++ L + C    
Sbjct: 523 VSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFG 582

Query: 552 -KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVG 606
            K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V 
Sbjct: 583 GKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVS 640

Query: 607 KTENGML-----------------------------------------------SLLPIS 619
           K + G++                                                ++P  
Sbjct: 641 KVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKELGEESEVIPTL 700

Query: 620 TPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 674
            P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+        
Sbjct: 701 EPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-------- 752

Query: 675 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|28461235|ref|NP_787002.1| cleavage and polyadenylation specificity factor subunit 2 [Bos
           taurus]
 gi|426248504|ref|XP_004018003.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Ovis aries]
 gi|1706103|sp|Q10568.1|CPSF2_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|599683|emb|CAA53535.1| Cleavage and Polyadenylation specificity factor (CPSF) 100kD
           subunit [Bos taurus]
 gi|296475169|tpg|DAA17284.1| TPA: cleavage and polyadenylation specificity factor subunit 2 [Bos
           taurus]
 gi|440892550|gb|ELR45701.1| Cleavage and polyadenylation specificity factor subunit 2 [Bos
           grunniens mutus]
          Length = 782

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 283/818 (34%), Positives = 433/818 (52%), Gaps = 148/818 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++A  D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQ 548
                ++ I         +G+ D  S   I++  KP ++      ++VHG  EA++ L +
Sbjct: 523 ISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAE 576

Query: 549 HCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA- 603
            C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D  
Sbjct: 577 CCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGV 634

Query: 604 ---EVGKTENGML----------------------------------------------- 613
               V K + G++                                               
Sbjct: 635 LDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEES 694

Query: 614 SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVG 668
            ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+  
Sbjct: 695 EIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-- 752

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|71894931|ref|NP_001026379.1| cleavage and polyadenylation specificity factor subunit 2 [Gallus
           gallus]
 gi|60098929|emb|CAH65295.1| hypothetical protein RCJMB04_15m16 [Gallus gallus]
          Length = 782

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 283/817 (34%), Positives = 434/817 (53%), Gaps = 146/817 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW+++F   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLKKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ ++GL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHSLSDLARVP-CPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K + + + RRV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVIDIELRRRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEP--HGGRYRDILIDG-------FVPPSTSVA 466
                    S +  +  ++ ++A  D+ +P  H  ++ D+++ G       F   +    
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPTVHKTKH-DLMMKGEGSRKGSFFKQAKKSY 461

Query: 467 PMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------- 499
           PMFP  E   +WD++GE+I P+D+++                    +E MDQ        
Sbjct: 462 PMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTK 521

Query: 500 -----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQH 549
                 +M I      +D EG +    D    K + N++     V+VHG  EA++ L + 
Sbjct: 522 CISATESMEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLVIVHGPPEASQDLAEC 577

Query: 550 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 603
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 604 --EVGKTENGML-----------------------------------------------S 614
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELREDEELEMQVDMPSSDSSVIAQQKAMKSLFGDDDKEMCEESE 695

Query: 615 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRR--- 752

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782


>gi|307189918|gb|EFN74154.1| Probable cleavage and polyadenylation specificity factor subunit 2
           [Camponotus floridanus]
          Length = 737

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 279/777 (35%), Positives = 426/777 (54%), Gaps = 111/777 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +  + IDAVLL
Sbjct: 1   MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVNQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  +  EL+  P  PK+VLAS   +E GFS ++F++W ++ +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMVELNQVP-SPKVVLASTPDMECGFSRELFLQWCTNPQNSIIITSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RRV L G EL  Y+       K E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLDVKRRVKLEGIELEEYQ-------KREKLKQEQMKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  G R + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGARGKHDLLVKQESKPGFFKQSKKQYPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYIIKDE-----------DMDQAAMH----IGGD------- 507
           PF E   + D++GE+I P+DY I +            +M Q   +    I  D       
Sbjct: 457 PFVEEKIKIDEYGEIIKPEDYKIAETAPEVEDNKENVEMKQEETNHHPEIAADIPTKCVQ 516

Query: 508 ----------------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
                           +G+ D  S   IL   +P +V      VLV GS + TE L Q  
Sbjct: 517 VSRTMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSPKDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDAEV---- 605
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AW+DA +    
Sbjct: 571 -QSAGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARD 629

Query: 606 ---------GKTENG------MLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                     ++EN       +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QICRDAVADTESENAINESDKILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF+GG L C       +   AG         ++++EG + EDYYK+R  L+ Q+ ++
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLFEQYAIV 737


>gi|291406601|ref|XP_002719640.1| PREDICTED: cleavage and polyadenylation specific factor 2
           [Oryctolagus cuniculus]
          Length = 782

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKMKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDAEMQVDAPSDSSVIAQQKAMKSLFGDDEKEAGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|326920924|ref|XP_003206716.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Meleagris gallopavo]
          Length = 782

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 282/817 (34%), Positives = 434/817 (53%), Gaps = 146/817 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW+++F   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLKKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ ++GL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHSLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K + + + RRV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVIDIELRRRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEP--HGGRYRDILIDG-------FVPPSTSVA 466
                    S +  +  ++ ++A  D+ +P  H  ++ D+++ G       F   +    
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPTVHKTKH-DLMMKGEGSRKGSFFKQAKKSY 461

Query: 467 PMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------- 499
           PMFP  E   +WD++GE+I P+D+++                    +E MDQ        
Sbjct: 462 PMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTK 521

Query: 500 -----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQH 549
                 +M I      +D EG +    D    K + N++     ++VHG  EA++ L + 
Sbjct: 522 CISATESMEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAEC 577

Query: 550 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 603
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 604 --EVGKTENGML-----------------------------------------------S 614
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELREDEELEMQVDMPSSDSSVIAQQKAMKSLFGDDDKEMCEESE 695

Query: 615 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRR--- 752

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782


>gi|170046825|ref|XP_001850949.1| cleavage and polyadenylation specificity factor subunit 2 [Culex
           quinquefasciatus]
 gi|167869453|gb|EDS32836.1| cleavage and polyadenylation specificity factor subunit 2 [Culex
           quinquefasciatus]
          Length = 747

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 280/774 (36%), Positives = 428/774 (55%), Gaps = 95/774 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D   FL+DCGW++ FDP+ ++ L K   TIDAVLL
Sbjct: 1   MTSIIKLHAISGAMDESPPCYILQVDEVRFLLDCGWDEKFDPNFIKELKKYVHTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + +LGL+ P+++T PVY++G + MYD Y+S   + +FDLFTLDD+D
Sbjct: 61  SYPDGLHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDLYMSHYNMYDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  L GKG GI + P  AGHL+GGT+WK+ K G ED++YA D+N +K
Sbjct: 121 AAFDKIIQLKYNQSVSLKGKGYGITITPLPAGHLIGGTIWKVVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDAYNA + Q  R+ R E F   I +TLR  GNVL+ VD+A
Sbjct: 181 ERHLNGCELEKLQRPSLLITDAYNARYQQARRRARDEKFMTNILQTLRNNGNVLVTVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + +++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    ++L   P  PK+VLAS   +E+GFS ++FV+WA +V N ++ T R 
Sbjct: 301 NPFQFKHLRLCHTMADLAKVP-SPKVVLASSPDMESGFSRELFVQWAGNVNNSIIITCRS 359

Query: 356 QFGTLAR-MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLAR ++      + +++ + RRV L G EL  Y   +      E    S++K +  
Sbjct: 360 SPGTLARDLIDNGGNGRKLELDVRRRVELEGAELDEYMRTEG-----EKHNRSVIKSDMD 414

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
             S     +     ++   ++      VV P G  +      GF   S     MFPF+E 
Sbjct: 415 LDSSSDSEDELEMSVITGKHDI-----VVRPEGRSHT-----GFFKSSKKQYAMFPFHEE 464

Query: 475 NSEWDDFGEVINPDDYIIKDEDMDQAA-----MHIGGDDGKLDEGSASLILDAKPSKVVS 529
             ++D++GE+I  D+Y + D   D A        I  +D K ++     +LD KP+K ++
Sbjct: 465 KIKFDEYGEIIQADEYRMVDLGPDGAEDNKENHQIKPEDIKKEKMDDMTVLD-KPTKCIN 523

Query: 530 NELTVLVH---------------------------------GSAEATEHLKQHCLKHVCP 556
           +   V V+                                 GS++ T H+ +HC  ++  
Sbjct: 524 SRKLVEVNAQVQFIDFEGRSDGESMLKILSQLRPRRVVVVRGSSQNTSHISEHCQLNIGA 583

Query: 557 HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV----------- 605
            V++P   E ID T++   Y+V+L+E L+S + F+K  D E+AWVDA++           
Sbjct: 584 RVFSPNRGEIIDATTETHIYQVRLTEALVSQLEFQKGKDAEVAWVDAQIVIRNKQFTSDQ 643

Query: 606 -----------GKTENGMLSLLP-ISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG 653
                       K++  +L+L P ++   P H SV + +LK+ D K  L    I  EF+G
Sbjct: 644 PMDVDQVEITEDKSDKQILTLDPLLNDQLPAHNSVFINELKLIDFKQVLMKANIASEFSG 703

Query: 654 GALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           G L C    + +R++           T ++ IEG L EDYY+IR  LY Q+ ++
Sbjct: 704 GVLWCSNGTLALRRI----------DTGKVTIEGCLSEDYYRIRELLYEQYAIV 747


>gi|73962293|ref|XP_537353.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Canis lupus familiaris]
          Length = 782

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKMKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|126282067|ref|XP_001365312.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Monodelphis domestica]
          Length = 782

 Score =  470 bits (1210), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 281/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISATESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDTPSDASVIAQQKAMKSLFGDDDKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|395503674|ref|XP_003756188.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Sarcophilus harrisii]
          Length = 782

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 281/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISATESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDASVIAQQKAMKSLFGDDDKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|348553776|ref|XP_003462702.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Cavia porcellus]
          Length = 782

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|344274144|ref|XP_003408878.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Loxodonta africana]
          Length = 782

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/818 (34%), Positives = 432/818 (52%), Gaps = 148/818 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQ 548
                ++ I         +G+ D  S   I++  KP ++      ++VHG  EA++ L +
Sbjct: 523 ISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAE 576

Query: 549 HCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA- 603
            C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D  
Sbjct: 577 CCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGV 634

Query: 604 ---EVGKTENGML----------------------------------------------- 613
               V K + G++                                               
Sbjct: 635 LDMRVSKVDTGVILEEGELKDDGEDSEMQVEASSDSSVIAQQKAMKSLFGDDEKETGEES 694

Query: 614 SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVG 668
            ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+  
Sbjct: 695 EIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-- 752

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|431839217|gb|ELK01144.1| Cleavage and polyadenylation specificity factor subunit 2 [Pteropus
           alecto]
          Length = 782

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISMTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPNEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|149737455|ref|XP_001497134.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like isoform 1 [Equus caballus]
          Length = 782

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVLAQQKAMKSLFGDDEKDTGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|449280731|gb|EMC87967.1| Cleavage and polyadenylation specificity factor subunit 2 [Columba
           livia]
          Length = 782

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 283/817 (34%), Positives = 434/817 (53%), Gaps = 146/817 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW+++F   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLRKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ ++GL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLLRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + S+L   P  PK+VLAS   LE GFS D+F++W  D KN V+ T R 
Sbjct: 301 NPFQFRHLSLCHSLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDSKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K + + + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVIDIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEP--HGGRYRDILIDG-------FVPPSTSVA 466
                    S +  +  ++ ++A  D+ +P  H  ++ D+++ G       F   +    
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPTVHKTKH-DLMMKGEGSRKGSFFKQAKKSY 461

Query: 467 PMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------- 499
           PMFP  E   +WD++GE+I P+D+++                    +E MDQ        
Sbjct: 462 PMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTK 521

Query: 500 -----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQH 549
                 +M I      +D EG +    D    K + N++     V+VHG  EA++ L + 
Sbjct: 522 CISATESMEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLVIVHGPPEASQDLAEC 577

Query: 550 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 603
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYVPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 604 --EVGKTENGML-----------------------------------------------S 614
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELREDEDTEMQVDMPSSDSSVIAQQKAMKSLFGDDDKEMCEESE 695

Query: 615 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPLPPHEVIGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRR--- 752

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYKQYAIV 782


>gi|34101288|ref|NP_059133.1| cleavage and polyadenylation specificity factor subunit 2 [Homo
           sapiens]
 gi|114654441|ref|XP_001147277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 3 [Pan troglodytes]
 gi|397525769|ref|XP_003832826.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Pan paniscus]
 gi|51338827|sp|Q9P2I0.2|CPSF2_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|119601886|gb|EAW81480.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_a [Homo sapiens]
 gi|119601888|gb|EAW81482.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_a [Homo sapiens]
 gi|193786082|dbj|BAG50953.1| unnamed protein product [Homo sapiens]
 gi|410221574|gb|JAA08006.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410221576|gb|JAA08007.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410221578|gb|JAA08008.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410252002|gb|JAA13968.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410307320|gb|JAA32260.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410307322|gb|JAA32261.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339303|gb|JAA38598.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339305|gb|JAA38599.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339307|gb|JAA38600.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339309|gb|JAA38601.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
 gi|410339311|gb|JAA38602.1| cleavage and polyadenylation specific factor 2, 100kDa [Pan
           troglodytes]
          Length = 782

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|224051637|ref|XP_002200593.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Taeniopygia guttata]
          Length = 782

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/817 (34%), Positives = 434/817 (53%), Gaps = 146/817 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW+++F   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLRKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ ++GL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHSLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K + + + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVIDIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEP--HGGRYRDILIDG-------FVPPSTSVA 466
                    S +  +  ++ ++A  D+ +P  H  ++ D+++ G       F   +    
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPTLHKTKH-DLMMKGEGSRKGSFFKQAKKSY 461

Query: 467 PMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------- 499
           PMFP  E   +WD++GE+I P+D+++                    +E MDQ        
Sbjct: 462 PMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLSDVPTK 521

Query: 500 -----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQH 549
                 +M I      +D EG +    D    K + N++     V+VHG  EA++ L + 
Sbjct: 522 CISATESMEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLVIVHGPPEASQDLAEC 577

Query: 550 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 603
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 604 --EVGKTENGML-----------------------------------------------S 614
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELREDEDLEMQVDVPSSDSSVIAQQKAMKSLFGDDDKEMCEESE 695

Query: 615 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPMPPHEVLGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRR--- 752

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|296215760|ref|XP_002754257.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Callithrix jacchus]
 gi|403298149|ref|XP_003939897.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Saimiri boliviensis boliviensis]
          Length = 782

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDASVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|383872268|ref|NP_001244509.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
 gi|402876992|ref|XP_003902228.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Papio anubis]
 gi|355693514|gb|EHH28117.1| hypothetical protein EGK_18472 [Macaca mulatta]
 gi|355778801|gb|EHH63837.1| hypothetical protein EGM_16889 [Macaca fascicularis]
 gi|380783537|gb|AFE63644.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
 gi|383412079|gb|AFH29253.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
 gi|384942144|gb|AFI34677.1| cleavage and polyadenylation specificity factor subunit 2 [Macaca
           mulatta]
          Length = 782

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 282/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|149531954|ref|XP_001507374.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Ornithorhynchus anatinus]
          Length = 782

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 281/818 (34%), Positives = 432/818 (52%), Gaps = 148/818 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLKKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQAAEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQ 548
                ++ I         +G+ D  S   I++  KP ++      ++VHG  EA++ L +
Sbjct: 523 ISTTESLEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAE 576

Query: 549 HCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA- 603
            C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D  
Sbjct: 577 CCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGV 634

Query: 604 ---EVGKTENGML----------------------------------------------- 613
               V K + G++                                               
Sbjct: 635 LDMRVSKVDTGVILEEGELKDDGEESEMQVDPPSDSSTLAQQKAMKSLFGDDDKETGEES 694

Query: 614 SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVG 668
            ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+  
Sbjct: 695 EIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRR-- 752

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|158290938|ref|XP_312464.4| AGAP002474-PA [Anopheles gambiae str. PEST]
 gi|157018137|gb|EAA08192.4| AGAP002474-PA [Anopheles gambiae str. PEST]
          Length = 745

 Score =  469 bits (1206), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 278/777 (35%), Positives = 418/777 (53%), Gaps = 103/777 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW++ FD   ++ + K   TIDAVLL
Sbjct: 1   MTSIIKMHAISGAMDESPPCYILQVDDVRILLDCGWDEKFDQGFIKEIKKYVHTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD  HLGALPY + +LGL+ P+++T PVY++G + MYD ++S   + +FDLF+LDD+D
Sbjct: 61  SYPDGSHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDMFMSHYNMHDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG GI + P  AGHL+GGT+WKI K G ED++YA D+N +K
Sbjct: 121 AAFDKIVQLKYNQSVAMKGKGYGITITPLPAGHLIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDAYNA + Q  R+ R E F   I +TLR  GNVL+ VD+A
Sbjct: 181 ERHLNGCELEKLQRPSLLITDAYNARYQQARRRARDEKFMTNILQTLRNNGNVLVTVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L   S + +++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNQSYNVVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    ++L   P  PK+VLAS   LE+GFS ++F++WA +  N ++ T R 
Sbjct: 301 NPFTFKHLRLCHTMADLAKVP-SPKVVLASSPDLESGFSRELFIQWAPNASNSIIITSRS 359

Query: 356 QFGTLAR-MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLAR +++     + +++ + RRV L G EL  Y   +         K  L +    
Sbjct: 360 SPGTLARDLIENGGNGRKIEMDIRRRVELEGAELEEYMRTEGEKLNRSIKKRDLDESSSD 419

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
                  N ++G   +           VV P G  +      GF   S     MFPF+E 
Sbjct: 420 SDDELEMNVITGKHDI-----------VVRPEGRSHT-----GFFKSSKKNYAMFPFHEE 463

Query: 475 NSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSAS-----------LILDAK 523
             ++D++GE+I PDDY +    +D      GGDD K + G  +            +LD K
Sbjct: 464 KIKYDEYGEIIQPDDYRM----VDLGPETNGGDDNKENGGIKTEDIKKEKEDEVTVLD-K 518

Query: 524 PSKVVSNELTVLVH---------------------------------GSAEATEHLKQHC 550
           P+K V +   + V+                                 GS   T H+ +HC
Sbjct: 519 PTKCVQSRKPIEVNAQVQFIDFEGRSDGESLLKILSQLRPRRVVVVRGSPANTSHIAEHC 578

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV----- 605
            +++   V+TP   E ID T++   Y+V+L+E L+S + F+K  D E+AWVDA++     
Sbjct: 579 QQNIGARVFTPNRGEIIDATTETHIYQVRLTEALVSQLEFQKGKDAEVAWVDAQIVIRNK 638

Query: 606 --------------GKTENGMLSLLPISTP-APPHKSVLVGDLKMADLKPFLSSKGIQVE 650
                          K +  +L+L P++    PPH  V + +LK+ D K  L    I  E
Sbjct: 639 RIDTMEVDDVDTIDDKMDKQILTLEPLAQEDLPPHNPVFINELKLIDFKQILMKSNIASE 698

Query: 651 FAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           F+GG L C    V +R+V           T ++ IEG + EDYYKIR  LY Q+ ++
Sbjct: 699 FSGGVLWCSNGTVALRRV----------DTGRVTIEGCISEDYYKIRELLYEQYAII 745


>gi|417404575|gb|JAA49034.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft2 cpsf subunit [Desmodus rotundus]
          Length = 782

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 284/817 (34%), Positives = 431/817 (52%), Gaps = 146/817 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCEDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+         KE +  
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-SKEADID 418

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDG-------FVPPSTSVAPM 468
           +S              D ++     D    H  ++ D+++ G       F   +    PM
Sbjct: 419 SS--------------DESDVEEDTDQPSAHKAKH-DLMMKGEGSRKGSFFKQAKKSYPM 463

Query: 469 FPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ---------- 499
           FP  E   +WD++GE+I P+D+++                    DE MDQ          
Sbjct: 464 FPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCI 523

Query: 500 ---AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQH 549
               ++ I         +G+ D  S   I++  KP ++      ++VHG  EA++ L + 
Sbjct: 524 SMTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAEC 577

Query: 550 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 603
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 604 --EVGKTENGML-----------------------------------------------S 614
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESE 695

Query: 615 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPLPPNEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR--- 752

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|351699560|gb|EHB02479.1| Cleavage and polyadenylation specificity factor subunit 2
           [Heterocephalus glaber]
          Length = 782

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 277/812 (34%), Positives = 433/812 (53%), Gaps = 136/812 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEVDIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEG-------- 514
           MFP  E   +WD++GE+I P+D+++      +E+  +    +   D  +D+         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPIDQDLSDVPTKC 522

Query: 515 ---SASLILDAKPS-------------KVVSNELT----VLVHGSAEATEHLKQHCL--- 551
              + S+ + A+ +             K + N++     ++VHG  EA++ L + C    
Sbjct: 523 ISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFG 582

Query: 552 -KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVG 606
            K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V 
Sbjct: 583 GKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVS 640

Query: 607 KTENGML-----------------------------------------------SLLPIS 619
           K + G++                                                ++P  
Sbjct: 641 KVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTL 700

Query: 620 TPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 674
            P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+        
Sbjct: 701 EPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-------- 752

Query: 675 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|456753050|gb|JAA74086.1| cleavage and polyadenylation specific factor 2, 100kDa [Sus scrofa]
          Length = 782

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 281/816 (34%), Positives = 431/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  G+VL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGSVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDAEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|340370496|ref|XP_003383782.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Amphimedon queenslandica]
          Length = 730

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 281/757 (37%), Positives = 411/757 (54%), Gaps = 78/757 (10%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++ T LSG   E P  YL+ +D F FL+DCGW++ F P + + + K    IDAVLL
Sbjct: 1   MTSIIKFTALSGAKGEGPPCYLLQVDEFCFLLDCGWDEFFSPEIAENIKKHIHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPY + +LGL  PV++T PVY++G + MYD Y +R    EFDLF+LDD+D
Sbjct: 61  SHPDVVHLGALPYVVGRLGLRCPVYATIPVYKMGQMFMYDLYQARHNSEEFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            +F  V ++ YSQ   L GKG G+ + P+ AGH++GGT+WKI KDG E+++YAVDYN +K
Sbjct: 121 QSFDLVVQVKYSQTVQLKGKGHGLTITPYPAGHMVGGTIWKIVKDGEEEIVYAVDYNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G V ++F RP +LITDAYNAL  Q  R++R+    D I  TLR  GNVL+ VD+A
Sbjct: 181 ERHLDGAVFDNFSRPHLLITDAYNALSVQARRKERDKALLDKIVNTLRKNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLN---YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W    L    Y I  L+ VS + +++ KS +EWM + + ++FE SR 
Sbjct: 241 GRVLELSQLLDQMWRHQELGFGAYSIVLLSNVSYNVVEFAKSQVEWMSEKLMRTFEDSRT 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+ L  N  EL    + PK VL S   LE GFS D+F+ W+++  N ++FT + 
Sbjct: 301 NPFQFQHINLCHNLEELAKVSN-PKAVLVSPPDLECGFSRDLFLHWSNNPHNSIIFTSKT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
              TLAR L  +     + + + RRVPL G EL     E+  +K++E  KA    ++++K
Sbjct: 360 AHNTLARTLVDNLKIITIDMDVKRRVPLEGAEL-----EEYLMKEKE--KAKTANDDDAK 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPS-----TSVAPMFP 470
            S   D  +  +                 P   +Y  ++ D     S     T   PM+ 
Sbjct: 413 DSDESDEEMEVEGTTKPTTPTTPRCLSKTP---KYDLMMTDEGKAKSSFFKQTKSFPMYH 469

Query: 471 FYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSN 530
           F     +WD++GE    +DY + D    +      G DG   E +  +     P+K VS 
Sbjct: 470 FKGEKIKWDEYGEPFRHEDYQLNDVFFKEDKEPEDGGDGVTKEVTKVI-----PTKCVSF 524

Query: 531 ELTV---------------------------------LVHGSAEATEHLK--QHCLKHVC 555
           + TV                                 L+HGS E+T+ L    H +  + 
Sbjct: 525 KKTVPVRSSLSFIDFEGRSDGDSIKRILTIMKPRQLILIHGSLESTKCLVDFSHSVLGMD 584

Query: 556 P-HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLS 614
           P  V+ P + ETID T++   Y V+L++ LMS   F    D E+AWVD ++  + +G  S
Sbjct: 585 PKKVFAPAVGETIDATTESQLYIVKLTDALMSGTRFAPGKDAELAWVDGQIRLSSDGTDS 644

Query: 615 LLPI-----STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
            +P+     +     HK+V +   +++D K  L+  GIQ EF GGAL C   V I++   
Sbjct: 645 -IPVLDVFHNKQVADHKNVFINPPRLSDFKNTLTKAGIQAEFCGGALICNGVVAIKRT-- 701

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
              +GG     +I IEG + +DYY IR  LY QF ++
Sbjct: 702 ---EGG-----KISIEGSVSDDYYLIRKLLYEQFAIV 730


>gi|387015290|gb|AFJ49764.1| Cleavage and polyadenylation specificity factor subunit 2-like
           [Crotalus adamanteus]
          Length = 783

 Score =  467 bits (1202), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 284/817 (34%), Positives = 435/817 (53%), Gaps = 145/817 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW+++F   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLRKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ ++GL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     I +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNILETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    ++L   P  PK+VLAS   L+ GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLADLARVP-SPKVVLASQPDLDCGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K + +   +RV L G+EL  Y E++         K +  K E+SK
Sbjct: 360 TPGTLARFLIDNPSEKVIDIEFRKRVKLEGKELEEYLEKEK------IKKEAAKKLEQSK 413

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
            +            +  ++ ++A  D+ +P   + + D+++ G       F   +    P
Sbjct: 414 EA-----------DIDSSDESDAEEDIDQPSVHKTKHDLMMKGEGNRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    +E MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEDEKNKLESGLTNGEEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                +M I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 ISAMESMEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLTESC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ETID TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETIDATSETHIYQVRLKDSLVSSLHFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML------------------------------------------------S 614
             V K + G++                                                 
Sbjct: 637 MRVSKVDTGVILEEGELRDDGEDTEMQVDAPASDSSAMAQQKAIKSLFGDDDKEICEESE 696

Query: 615 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
           ++P   P PP     H+SV + + +++D K  L  +G+Q EF GG L C   V +R+   
Sbjct: 697 IIPTLEPLPPNEVPGHQSVFMNEPRLSDFKQVLLREGVQAEFVGGVLVCNNLVAVRR--- 753

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                  + T +I +EG LCED+YKIR  LY Q+ ++
Sbjct: 754 -------TETGRIGLEGCLCEDFYKIRDLLYEQYAIV 783


>gi|312375001|gb|EFR22454.1| hypothetical protein AND_15244 [Anopheles darlingi]
          Length = 772

 Score =  467 bits (1201), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 281/800 (35%), Positives = 419/800 (52%), Gaps = 122/800 (15%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D   FL+DCGW++ FD   ++ + K   TIDAVLL
Sbjct: 1   MTSIIKLHAVSGAMDESPPCYILQVDDVRFLLDCGWDEKFDQVFIKEIKKYVHTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD  HLGALPY + +LGL+ P+++T PVY++G + MYD ++S   + +FDLF+LDD+D
Sbjct: 61  SYPDGSHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDMFMSHYNMHDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG GI + P  AGHL+GGT+WKI K GE D++YA D+N +K
Sbjct: 121 AAFDKIVQLKYNQSVAMKGKGYGITITPLPAGHLVGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDAYNA + Q  R+ R E F   I +TLR  GNVL+ VD+A
Sbjct: 181 ERHLNGCELEKLQRPSLLITDAYNARYQQARRRARDEKFMTNILQTLRNNGNVLVTVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  LT VS + +++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLTNVSYNVVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    ++L   P  PK+VLAS A +E+GFS ++F++WA    N ++ T R 
Sbjct: 301 NPFTFKHLRLCHTMADLAKVP-SPKVVLASSADMESGFSRELFIQWAPQATNSIIITNRS 359

Query: 356 QFGTLAR-MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLAR ++      + +++ + RRV L G EL  Y   +         K  L +    
Sbjct: 360 SPGTLARDLIDNGGNGRKIEMDVRRRVELEGAELEEYMRTEGEKLNRSIKKRDLDESSSD 419

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
                  N ++G   +           VV P G  +      GF   S     MFPF+E 
Sbjct: 420 SDDELEMNVITGKHDI-----------VVRPEGRSHT-----GFFKSSKKHYAMFPFHEE 463

Query: 475 NSEWDDFGEVINPDDYIIKD-----EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
             ++D++GE+I P+DY + D        D     I  +D K ++     +LD KP+K V 
Sbjct: 464 KIKYDEYGEIIQPEDYRMVDLGPETNGDDNKENGIKTEDIKKEKDEDVTLLD-KPTKCVQ 522

Query: 530 NELTVLVH---------------------------------GSAEATEHLKQHCLKHVCP 556
           +  T+ VH                                 GSA  T H+ +HC +++  
Sbjct: 523 SRKTIEVHAQVQFIDFEGRSDGESLLKILSQLRPRRVIVVRGSAANTAHIAEHCQQNIGA 582

Query: 557 HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV----------- 605
            V+TP   E ID T++   Y+V+L+E L+S + F+K  D E+AWVDA++           
Sbjct: 583 RVFTPNRGEIIDATTETHIYQVRLTEALVSQLEFQKGKDAEVAWVDAQIVIRNKRIDTVA 642

Query: 606 -------------------------------------GKTENGMLSLLP-ISTPAPPHKS 627
                                                 K +  +L+L P +    PPH  
Sbjct: 643 EKDASGTGAALSANPVTGAASIATDSAMDVDEVDVLEDKLDKRILTLEPMVPEELPPHNP 702

Query: 628 VLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEG 686
           V + +LK+ D K  L    I  EF+GG L C    V +R+V           T ++ IEG
Sbjct: 703 VFINELKLIDFKQVLMRSNITSEFSGGVLWCSNGTVALRRV----------DTGRVTIEG 752

Query: 687 PLCEDYYKIRAYLYSQFYLL 706
            + EDYYKIR  LY Q+ ++
Sbjct: 753 CISEDYYKIRELLYEQYAII 772


>gi|47125306|gb|AAH70095.1| Cleavage and polyadenylation specific factor 2, 100kDa [Homo
           sapiens]
          Length = 782

 Score =  466 bits (1199), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 281/818 (34%), Positives = 431/818 (52%), Gaps = 148/818 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM   + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSGKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQ 548
                ++ I         +G+ D  S   I++  KP ++      ++VHG  EA++ L +
Sbjct: 523 ISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAE 576

Query: 549 HCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA- 603
            C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D  
Sbjct: 577 CCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGV 634

Query: 604 ---EVGKTENGML----------------------------------------------- 613
               V K + G++                                               
Sbjct: 635 LDMRVSKVDTGVILEEGELRDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEES 694

Query: 614 SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVG 668
            ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+  
Sbjct: 695 EIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-- 752

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 --------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|327259138|ref|XP_003214395.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Anolis carolinensis]
          Length = 783

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 281/819 (34%), Positives = 432/819 (52%), Gaps = 149/819 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW+++F   ++  L K    +DAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLRKHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ ++GL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   L+ GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLDCGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +   K + + + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNSSEKVIDMELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++A  D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPSVHKTKHDLMMKGEGNRKGSFFKQAKKAYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WD++GE+I P+D+++                    +E MDQ         
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKNKLESGLTNGEEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQ 548
                +M I         +G+ D  S   I++  KP ++      V+VHG  EA++ L +
Sbjct: 523 VSTTESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------VIVHGPPEASQDLAE 576

Query: 549 HCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA- 603
            C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D  
Sbjct: 577 SCRAFGGKDI--KVYVPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGV 634

Query: 604 ---EVGKTENGML----------------------------------------------- 613
               V K + G++                                               
Sbjct: 635 LDMRVSKVDTGVILEEGELRDDGEDTEMQVETSSSETSTVAQQKAIKSLFGDDDKEICEE 694

Query: 614 -SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKV 667
             ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+ 
Sbjct: 695 SEIIPTLEPLPPNEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNLVAVRR- 753

Query: 668 GPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                    + T +I +EG LCED+YKIR  LY Q+ ++
Sbjct: 754 ---------TETGRIGLEGCLCEDFYKIRDLLYEQYAIV 783


>gi|345480428|ref|XP_001601407.2| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Nasonia vitripennis]
          Length = 739

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 276/779 (35%), Positives = 424/779 (54%), Gaps = 113/779 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW++ FDP  ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAISGALDESPPCYILQVDELRILLDCGWDEKFDPDFIKELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GLS P+++T PVY++G + MYD Y SR  + +F+LFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLSCPIYATIPVYKMGQMFMYDIYQSRHNMEDFNLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITD++NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDSFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVGVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS D+F++W S+ +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRDLFLQWCSNPQNSIIITSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +   + + + + ++V L G EL  Y        K+E +K   +K+E+ +
Sbjct: 360 SPGTLARDLVENGGNRNITLEIKKKVRLEGAELEEY-------MKKEKVKQEQLKQEKME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  G + + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGAKGKHDLLVKQEHKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII----------------KDEDMDQAAMHIGGD------ 507
           PF E   + D++GE+I P+DY I                K E+  Q       D      
Sbjct: 457 PFVEEKIKVDEYGEIIKPEDYKIAEVLPEAEDNKENIEVKQEEQVQHPAETMSDIPTKCV 516

Query: 508 -----------------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQH 549
                            +G+ D  S   IL   +P ++      VLV GS + TE L   
Sbjct: 517 QTTRTIAVNASVTYIDFEGRSDGESLQKILAQLRPRRI------VLVRGSPKDTELLAAQ 570

Query: 550 CLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA----- 603
             ++V   V+ P   ET+D T++   Y+V+L++ L+S + F +  GD E+AWVDA     
Sbjct: 571 A-RNVGARVFIPSRGETLDATTETHIYQVRLTDALVSGLNFSRGKGDSEVAWVDALITAR 629

Query: 604 ---------------EVGKTENGM-LSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGI 647
                           + +TE  + L  LP++     +++  + +LK++D K  L+   I
Sbjct: 630 DQVCRDVFMDNENEDLIDRTEKILTLEPLPLNEVIRVYQTTFINELKLSDFKQILTKANI 689

Query: 648 QVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             EF+GG L C       +   AG         +I++EG L EDYY+++  LY Q+ ++
Sbjct: 690 PSEFSGGVLWCCNNTIAVRRHEAG---------KIIMEGCLSEDYYRVKELLYEQYAIV 739


>gi|321462132|gb|EFX73157.1| hypothetical protein DAPPUDRAFT_58164 [Daphnia pulex]
          Length = 735

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 288/779 (36%), Positives = 418/779 (53%), Gaps = 117/779 (15%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++   LSG  +++P SYL+ +D F FL+DCGW++      +  L K  + IDAVLL
Sbjct: 1   MTSIIKFCALSGALDDSPHSYLLKVDDFTFLLDCGWDEKCSEGFIHELKKHVNKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPYA+ +LGL+ PV++T PVY++G + MYD Y S+  + +FDLFTLDD+D
Sbjct: 61  SYPDQLHLGALPYAVGKLGLTCPVYATVPVYKMGQMFMYDWYQSKDNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           ++F  V +L YSQ+  L GKG+G+++ P  AGH+LGGTVWKI KDG ED+IYAVDYN +K
Sbjct: 121 NSFDKVVQLKYSQSVPLKGKGQGLIITPLPAGHMLGGTVWKIVKDGEEDIIYAVDYNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDAYN L+ QP R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELEKIQRPSLLITDAYNTLYAQPRRRSRDEKLMTNILQTLRGGGNVLVAVDTA 240

Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +LE  W   E  L  Y +  L  V+ +  ++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELAHMLEQLWRNQESGLRAYSLALLNNVAYNVNEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  K++ L     E+     G K+VL+S   LE GF+ D+F  W SD +N ++ T R 
Sbjct: 301 NPFGFKYLQLCHTLPEVLRIA-GSKVVLSSCPDLECGFARDLFALWCSDARNSIILTSRS 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTL + L      K+V + + +RV L G EL     E+ R K+ E             
Sbjct: 360 GQGTLGQRLHDQRNLKSVTLELKQRVKLEGAEL-----EEFRRKEREK------------ 402

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDG----------FVPPSTSV 465
                 N LSG   + D   A +S    E   GR+ DI++            F   S   
Sbjct: 403 ------NILSG-IKIKDQTAAESSESEDEVKKGRH-DIVVRSDDKTTGAVQHFFKSSKKH 454

Query: 466 APMFPFYENNSEWDDFGEVINPDDYIIKDED----------------------------- 496
             MFP++E+  ++D++GE+I P+DY+I + +                             
Sbjct: 455 PTMFPYFEDKIKFDEYGEIIRPEDYVIAESEDHEMADYSVEKPKWEEEPEAECPTKCIST 514

Query: 497 -----MDQAAMHI---GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQ 548
                ++ + MHI   G  DG   E    LI   KP +      T++V GS+E+ + L+ 
Sbjct: 515 TTTLAINASIMHIDFEGRSDG---ESIIKLIESMKPKR------TIVVRGSSESCQALQN 565

Query: 549 HCLKHVCP--HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
            CL         +  +  ETID T +   Y+V+L + L+S++ F K  D E+AW+DA   
Sbjct: 566 LCLSTGSSDNKAFIARKGETIDATIESHIYQVRLKDSLLSSLSFGKAKDAEVAWIDARLT 625

Query: 604 ---------EVGKTENGML--SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGI 647
                    ++   EN  L     P+  P  P     H++  + +LK++D K  L   GI
Sbjct: 626 YQVNLTDLRDLDDKENNSLRKEQAPLLEPLEPKDIPGHETSYINELKLSDFKQVLVRNGI 685

Query: 648 QVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             EF GG L C         G    +   SG  ++ +EG + +DYY++R  LY Q+ ++
Sbjct: 686 SSEFIGGVLWCCN-------GNVALRRNESG--RVTLEGCISDDYYRVRELLYEQYAII 735


>gi|147901518|ref|NP_001081123.1| cleavage and polyadenylation specificity factor subunit 2 [Xenopus
           laevis]
 gi|18203567|sp|Q9W799.1|CPSF2_XENLA RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|4927240|gb|AAD33061.1|AF139986_1 cleavage and polyadenylation specificity factor 100 kDa subunit
           [Xenopus laevis]
          Length = 783

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 283/810 (34%), Positives = 429/810 (52%), Gaps = 131/810 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T L G   E+ + YL+ +D F FL+DCGW+++F   ++  + K    +DAVLL
Sbjct: 1   MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LF+LDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  + +L Y+Q  HL GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+TL    S+L   P  PK+VLAS   LE GFS ++F++W  D KN V+ T R 
Sbjct: 301 NPFQFRHLTLCHGYSDLARVP-SPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L   P  + + + + +RV L G+EL  Y E++         K +  K E+SK
Sbjct: 360 TPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYVEKEK------LKKEAAKKLEQSK 413

Query: 416 ASLGPDNNLSGDPMVIDA-NNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
            +    ++ S     ID   +  A  D++  + G  +      F   +    PMFP  E+
Sbjct: 414 EADLDSSDDSDVEEDIDQITSHKAKHDLMMKNEGSRK----GSFFKQAKKSYPMFPAPED 469

Query: 475 NSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------------AAM 502
             +WD++GE+I P+D+++                    DE MDQ              +M
Sbjct: 470 RIKWDEYGEIIKPEDFLVPELQVTEDEKTKLESGLTNGDEPMDQDLSDVPTKCVSTTESM 529

Query: 503 HIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHCL----KH 553
            I      +D EG +    D    K + N++     ++VHG  +AT+ L + C     K 
Sbjct: 530 EIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRAFGGKD 585

Query: 554 VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTE 609
           +   VYTP++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K +
Sbjct: 586 I--KVYTPKLHETVDATSETHIYQVRLKDSLVSSLKFCKAKDTELAWIDGVLDMRVSKVD 643

Query: 610 NGML----------------------------------------------------SLLP 617
            G++                                                    +L P
Sbjct: 644 TGVILEERELKDEGEDMEMQVDTQVMDASTIAQQKVIKSLFGDDDKEFSEESEIIPTLEP 703

Query: 618 I-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGG 676
           + S   P H+SV + + +++D K  L  +GI  EF GG L C   V +R+          
Sbjct: 704 LPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVCNNMVAVRR---------- 753

Query: 677 SGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           + T +I +EG LCED++KIR  LY Q+ ++
Sbjct: 754 TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783


>gi|195392300|ref|XP_002054797.1| GJ24636 [Drosophila virilis]
 gi|194152883|gb|EDW68317.1| GJ24636 [Drosophila virilis]
          Length = 693

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 274/740 (37%), Positives = 419/740 (56%), Gaps = 81/740 (10%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FDP+ ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDPNFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMYDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED+IYA+D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLSAGHMIGGTIWKIVKVGEEDIIYAIDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    +++   P GPK+VLAS   +E+GF+ D+FV+WAS+  N ++FT R 
Sbjct: 301 NPFQFKHIHLCHTLADIYKLPAGPKVVLASTPDMESGFTRDLFVQWASNPNNSIIFTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVK---E 411
             G+L+  +++   P + +++ + RRV L G EL  Y   Q      E L   +VK   E
Sbjct: 361 GPGSLSMELVENSTPGRQIELDVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPEVE 415

Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
           EES +    D  +S    VI   +   +    EPH            V   T        
Sbjct: 416 EESSSESEDDIEMS----VITGKHDIVNVKKEEPH------------VEQQT-------- 451

Query: 472 YENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDA-KPSKVVS 529
             N ++ +D   +  P   I + + ++  A     D +G+ D  S   IL   +P +V  
Sbjct: 452 --NGNQDNDVQMLEKPTKLISQRKTIEVNAQIQRIDFEGRSDGESMLKILSQLRPRRV-- 507

Query: 530 NELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVL 589
               ++VHG+AE T+ + +HC ++V   V+ PQ  E IDVT+++  Y+V+L+E L+S + 
Sbjct: 508 ----IVVHGTAEGTQVVAKHCEQNVGARVFAPQKGEIIDVTTEIHIYQVRLTEGLVSQLQ 563

Query: 590 FKKLGDYEIAWVDAEVG---------------------KTENGMLSLLPIST-PAPPHKS 627
           F+K  D E+AW+D  +G                       E   L+L  +     P H S
Sbjct: 564 FQKGKDAEVAWIDGRLGMRLQAIDAPNQSEVTVEQDVAAQEGKTLTLETLEEDEIPVHNS 623

Query: 628 VLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEG 686
           VL+ +LK++D K  L    I  EF+GG L C    + +R+V             ++ +EG
Sbjct: 624 VLINELKLSDFKQVLMRNNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEG 673

Query: 687 PLCEDYYKIRAYLYSQFYLL 706
            L EDYYKIR  LY Q+ ++
Sbjct: 674 CLSEDYYKIRELLYEQYAIV 693


>gi|328780437|ref|XP_394940.3| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2 [Apis mellifera]
          Length = 730

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 277/765 (36%), Positives = 415/765 (54%), Gaps = 111/765 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +    IDAVLL
Sbjct: 1   MTSIIKLHAVSGAMDESPPCYILQVDELRILLDCGWDENFDQEFIRELKRHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L  VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLAYSLALLNNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +EL+  P  PK+VLAS   +E GFS ++F++W  + +N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSMAELNQVP-SPKVVLASTPDMECGFSRELFLQWCGNPQNSIILTSRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + + + + RR+ L G EL  Y+       ++E LK   +K+E+ +
Sbjct: 360 SPGTLARDLVEKGGNRNITLEVKRRIKLEGLELEEYQ-------RKEKLKQEQLKQEQME 412

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMF 469
                            A+ ++ S D +E  GGR + D+L+      GF   S    PMF
Sbjct: 413 T----------------ADVSSESEDEIEVGGGRGKHDLLVKQESKPGFFKQSKKQHPMF 456

Query: 470 PFYENNSEWDDFGEVINPDDYII----------------KDEDMDQ-------------- 499
           PF E   + D++GE+I P+DY I                K ED                 
Sbjct: 457 PFVEEKIKIDEYGEIIRPEDYKIAETMPEVDDNKENLETKQEDTAHHPEIPTDIPTKCIQ 516

Query: 500 --AAMHIGGD------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
               M +         +G+ D  S   IL   +P +V      VLV GS   TE L Q  
Sbjct: 517 VTRTMTVNASVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSQRDTEILAQQA 570

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA------ 603
            +     V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AWVDA      
Sbjct: 571 -QSAGARVFIPGRGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWVDAMITARD 629

Query: 604 -----EVGKTENG--------MLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQV 649
                 V  TE+         +L+L P+     P H++  + +LK++D K  L+   I  
Sbjct: 630 QICRDAVAGTESNDAIDQSDKILTLEPLPLNEVPGHQTTFINELKLSDFKQILNKSNIPS 689

Query: 650 EFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYK 694
           EF+GG L C       +   AG         ++++EG + EDYYK
Sbjct: 690 EFSGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYK 725


>gi|187608214|ref|NP_001120452.1| cleavage and polyadenylation specific factor 2, 100kDa [Xenopus
           (Silurana) tropicalis]
 gi|170285004|gb|AAI61233.1| LOC100145546 protein [Xenopus (Silurana) tropicalis]
          Length = 783

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 285/810 (35%), Positives = 430/810 (53%), Gaps = 131/810 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T L+G   E+ + YL+ +D F FL+DCGW+++F   ++  + K    +DAVLL
Sbjct: 1   MTSIIKLTTLAGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  ++ST PVY++G + MYD Y SR    +F LF+LDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYSTIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  + +L YSQ  HL GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYSQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD +NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMISRPSLLITDCFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+TL    S+L   P  PK+VLAS   LE GFS ++F++W  D KN V+ T R 
Sbjct: 301 NPFQFRHLTLCHGFSDLARVP-SPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L   P  + + + + +RV L G+EL  Y E++         K +  K E+SK
Sbjct: 360 TPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYLEKEK------LKKEAAKKLEQSK 413

Query: 416 ASLGPDNNLSGDPMVIDANNAN-ASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
            +    ++ S     ID   ++ A  D++  + G  +      F   +    PMFP  E 
Sbjct: 414 EADLDSSDDSDAEEDIDQTTSHKAKHDLMMKNEGSRK----GSFFKQAKKSYPMFPAPEE 469

Query: 475 NSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------------AAM 502
             +WD++GE+I P+D+++                    +E MDQ              +M
Sbjct: 470 RIKWDEYGEIIKPEDFLVPELQATEDEKTKLESGLTNGEEPMDQDLSDVPTKCISATESM 529

Query: 503 HIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHCL----KH 553
            I      +D EG +    D    K + N++     ++VHG  +AT+ L + C     K 
Sbjct: 530 EIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRAFGGKD 585

Query: 554 VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTE 609
           +   VYTP++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K +
Sbjct: 586 I--KVYTPKLHETVDATSETHIYQVRLKDSLVSSLKFCKAKDTELAWIDGVLDMRVSKVD 643

Query: 610 NGML----------------------------------------------------SLLP 617
            G++                                                    +L P
Sbjct: 644 TGVILEEGELKDEGEDSEMQVDTQALDASAIAQQKAIKSLFGDDDKEFSEESEIIPTLEP 703

Query: 618 I-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGG 676
           + S   P H+SV + + +++D K  L  +GIQ EF GG L C   V +R+          
Sbjct: 704 LPSNEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRR---------- 753

Query: 677 SGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           + T +I +EG LCED++KIR  LY Q+ ++
Sbjct: 754 TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783


>gi|270010824|gb|EFA07272.1| hypothetical protein TcasGA2_TC014506 [Tribolium castaneum]
          Length = 733

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 267/773 (34%), Positives = 422/773 (54%), Gaps = 107/773 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSG  +E+P  Y++ +D    L+DCGW++HFD  +++ + +   TIDAVL+
Sbjct: 1   MTSIIKLQALSGAMDESPPCYILQVDEVRILLDCGWDEHFDMEIIKEMRRHVHTIDAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD  HLGALPY + +LGL+ P+++T PVY++G + MYD + S   + +FDLFTLDD+D
Sbjct: 61  SYPDVAHLGALPYLVGKLGLNCPIYATIPVYKMGQMFMYDLFQSHYNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           + F+ V +L Y+Q+  L GKG G+ + P  AGH++GGT+WKI K G ED+IYA D+N +K
Sbjct: 121 ATFEKVIQLKYNQSVPLKGKGYGLTITPLPAGHMIGGTIWKIMKVGEEDIIYANDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++ ITDA+NA + Q  R+ R E     I +TLR  GNVL+ VD+A
Sbjct: 181 ERHLNGCELEKLQRPSLFITDAFNATYQQARRRARDEKLMTNILQTLRNNGNVLVAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       L Y +  L+ VS + +++ KS +EWM D + +SFE +R+
Sbjct: 241 GRVLELAHMLDQLWRNKESGLLVYSLALLSNVSYNVVEFAKSQIEWMSDKLMRSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  +  EL      PK+VLAS   +E+GFS ++F++W S+  N ++ T R 
Sbjct: 301 NPFQFKHLQLCHSLHELQKV-SSPKVVLASSPDMESGFSRELFLQWCSNPNNSIIITTRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +   + + + + RRV L G EL  Y++ Q R K+EE        + +  
Sbjct: 360 SPGTLARDLVDNGGNRQIDLVVKRRVKLEGSELEEYQKSQ-REKREENSSRDEESDSDDD 418

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
             +           VI    +    D+V    G+       GF   +    P++PF+E  
Sbjct: 419 IEMS----------VI----SKGRHDIVIKQEGKTS----GGFFKVTKKQYPIYPFHEEK 460

Query: 476 SEWDDFGEVINPDDY----------------IIKDED---------------------MD 498
            + D++GE+I P+DY                +IK E+                     ++
Sbjct: 461 IKCDEYGEIIKPEDYKLADVVTETEDNKENVVIKKEEEVIPEVAETPSKCIVLSRTVQVN 520

Query: 499 QAAMHI---GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVC 555
               +I   G  DG   E    ++   +P +V      ++V GS E+T  +K HC +++ 
Sbjct: 521 CQVQYIDFEGRSDG---ESLMKILSQLRPRRV------IIVRGSPESTNTIKNHCQENLD 571

Query: 556 PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA------------ 603
             V+ P   E +D T++   Y+V+L++ L+S + F+K  D E+AW++A            
Sbjct: 572 ARVFAPVRGEVVDATTETHIYQVRLTDALVSQLNFQKAKDAEVAWLNAQIVVRESQLDAR 631

Query: 604 ---------EVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGG 654
                    EV + E+ +L+L P      PH +V + +LK+++ K  L+   I  EF+GG
Sbjct: 632 RMNVDNEPMEVDEEESKILTLEPYGDNI-PHDTVFINELKLSEFKQILAKSNINSEFSGG 690

Query: 655 ALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            L C    + IR+V           T ++++EG + EDYYK++  LY Q+ +L
Sbjct: 691 VLWCSNGTLAIRRV----------ETGRVILEGCISEDYYKVKELLYEQYAVL 733


>gi|414881945|tpg|DAA59076.1| TPA: hypothetical protein ZEAMMB73_548570 [Zea mays]
          Length = 309

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 226/303 (74%), Positives = 264/303 (87%), Gaps = 1/303 (0%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  D S LQPL+KVA T+DAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDTSQLQPLAKVAPTVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYAMK LGLSAPV++TEPV+RLGLLTMYD +LSR QVS+FDLFTLDD+D
Sbjct: 61  SHPDMMHLGALPYAMKHLGLSAPVYATEPVFRLGLLTMYDHFLSRWQVSDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQNY L+ KGEG+V+APHVAGHLLGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNYLLNDKGEGVVIAPHVAGHLLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGTVL SFVRPAVLITDAYNAL+NQ  R++++  F +++ K L  GG+VLLPVD+AG
Sbjct: 181 RHLNGTVLGSFVRPAVLITDAYNALNNQGYRKKQDQDFIESLIKVLATGGSVLLPVDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLELLL+L+ YW E  L YPIYFLT VS+ST+DYVKSFLEWMGD I KSFE+SR NAFL
Sbjct: 241 RVLELLLLLDMYWDERRLQYPIYFLTNVSTSTVDYVKSFLEWMGDQIAKSFESSRANAFL 300

Query: 300 LKH 302
           LK+
Sbjct: 301 LKY 303


>gi|391325231|ref|XP_003737142.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 1 [Metaseiulus occidentalis]
          Length = 741

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 277/778 (35%), Positives = 421/778 (54%), Gaps = 109/778 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V++  +SGV +E+P  YL+ ID F  L+D GW++ F+P  ++ LS++ S +D +LL
Sbjct: 1   MTSIVKIHAISGVHDESPHCYLLQIDEFKILLDLGWDEFFNPKPIRELSRLVSQVDVILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGA P+   ++    PV++T PVY++G L MYD + S + + +F++F+LDD+D
Sbjct: 61  SYPDPLHLGAFPHLRHEI--KCPVYATVPVYKMGQLFMYDLHESHKSMEDFNIFSLDDVD 118

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  +T+L Y+Q     GKG+GI + P  AGH++GGTVW+ITKDG ED+IYAVDYN ++
Sbjct: 119 EAFDMITQLKYNQTLPFKGKGQGISITPLPAGHMIGGTVWRITKDGEEDIIYAVDYNHKR 178

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LES  RP++LITDA+NA + QP R+ R E     I +T+RAGGNVL+ VD+A
Sbjct: 179 ERHLNGCALESIQRPSLLITDAFNANYIQPRRRSRDEKLLTTIIQTMRAGGNVLIGVDTA 238

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +LE  W       + Y +   + V++  I++ KS +EWM D + +SFE +R 
Sbjct: 239 GRVLELAHMLEQLWRNQESGLMAYSLIMASNVAAHVIEFAKSQVEWMSDKVMRSFEGARS 298

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  K++    +  E+ +  + PK+VLASM  LE+G+  D+F+ WAS+ KN V+ T R 
Sbjct: 299 NPFQFKYLIPCHSHGEIQSVSE-PKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRS 357

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  D  PK V +T+ +RV L  +EL  +   + RLKKE+  K     E+ S 
Sbjct: 358 SPGTLARNL-VDNRPKFVHLTLKQRVALEADELEEHVRNE-RLKKEKETKI----EDSSD 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
            S   D  L+   +++ A+  +  +                 F  P+     MFP  E  
Sbjct: 412 ESDIEDEALAAAAVIVGASIEDRQS----------------FFQKPTKKSHLMFPLKEEK 455

Query: 476 SEWDDFGEVIN-----------PDDYIIKDEDMDQAAMHIG--GDDGKLDEGSASLILDA 522
            +WD++GE+IN           P D +       Q   H+    DD K ++ +    +  
Sbjct: 456 LKWDEYGEIINTDMFSNMGLNAPGDILEPSVLGQQQQQHVSDRKDDAKKEQVTEQAEI-- 513

Query: 523 KPSKVVSNELT---------------------------------VLVHGSAEA-TEHLKQ 548
            P+K ++ E+T                                 V+V G  EA T     
Sbjct: 514 -PTKCIAKEVTIQVNCSIDYIDFEGRSDGESIRQLVQMMKPKRLVIVRGGDEANTAAFYD 572

Query: 549 HCLKHVC---PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE- 604
           +C+   C     V+ P+  E +D T++   Y+V+L E L++ + F+K  + E+AW+DAE 
Sbjct: 573 YCVNSGCVQDNRVFAPKAHEVVDATTESHIYQVKLKESLLARLRFRKAKNAELAWLDAEI 632

Query: 605 ---------VGK----TENGMLSLLPI---STPAPPHKSVLVGDLKMADLKPFLSSKGIQ 648
                    VGK    T+  ++ L P+   +    PH  + + DLK++D K  L   GI 
Sbjct: 633 AEPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKSGIS 692

Query: 649 VEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            EF+GG L C       K    G         ++ +EG L +DY++IR  LY Q+ +L
Sbjct: 693 AEFSGGVLYCNNCSVAVKRNETG---------RLSVEGALTDDYFRIRELLYDQYAIL 741


>gi|332223568|ref|XP_003260944.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 1 [Nomascus leucogenys]
          Length = 782

 Score =  460 bits (1184), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 280/816 (34%), Positives = 427/816 (52%), Gaps = 144/816 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ--------- 499
           MFP  E   +WDD   +  P+D+++                    DE MDQ         
Sbjct: 463 MFPAPEERIKWDDRDLLFRPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKC 522

Query: 500 ----AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHC 550
                ++ I      +D EG +    D    K + N++     ++VHG  EA++ L + C
Sbjct: 523 VSTTESIEIKARVTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECC 578

Query: 551 L----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 603
                K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D    
Sbjct: 579 RAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLD 636

Query: 604 -EVGKTENGML-----------------------------------------------SL 615
             V K + G++                                                +
Sbjct: 637 MRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEI 696

Query: 616 LPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPA 670
           +P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+    
Sbjct: 697 IPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR---- 752

Query: 671 GQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 ------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>gi|308799055|ref|XP_003074308.1| polyadenylation cleavage/specificity factor 100 kDa subunit (ISS)
           [Ostreococcus tauri]
 gi|116000479|emb|CAL50159.1| polyadenylation cleavage/specificity factor 100 kDa subunit (ISS)
           [Ostreococcus tauri]
          Length = 807

 Score =  460 bits (1183), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 304/811 (37%), Positives = 427/811 (52%), Gaps = 131/811 (16%)

Query: 2   GTSVQVTPLSGV-------FNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST 54
           G  V VTPL GV         E  + Y VSIDG N L+DCGW D FD  +L+PL  +A  
Sbjct: 22  GNKVLVTPLYGVRGVDFDGAGERAMCYHVSIDGCNILLDCGWTDAFDVEMLKPLEAIAKD 81

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF-DL 113
           +DAVL+SHPDT HLGALPYA  +LG++  V++T PV+++G + MYD +L+R+   +F + 
Sbjct: 82  VDAVLISHPDTAHLGALPYAFGKLGMNCKVYATLPVHKMGQMYMYDHFLTRQDQEDFQET 141

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
           F+LDD+D AF +   + Y Q   L GKGEGI V  + AGH LGG +WKI KD ED+IYAV
Sbjct: 142 FSLDDVDKAFAAFVPVKYQQLSMLRGKGEGISVMAYAAGHTLGGAMWKIGKDAEDIIYAV 201

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQP----PRQQREMFQDAISKTLRAGG 229
           DYN RKE+HLNG   +S  RPA+LITDA +     P    PR  +    D I  +LR  G
Sbjct: 202 DYNVRKERHLNGATFDSIHRPALLITDASSVEREVPKSTVPRDTK--LVDTILSSLRMNG 259

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           NVL+P+D AGRVLEL+L+LE+ W +  L +Y I  LT V+ +T+D+ KS LEWMGD +T 
Sbjct: 260 NVLIPIDPAGRVLELILLLEEKWQQRQLGSYQIVLLTNVAYNTLDFAKSHLEWMGDLVTS 319

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
           +FE  R+N F  K +T+     EL   P GPK+VLAS  SLEAG +  +F EWA D  NL
Sbjct: 320 AFERRRENPFNTKFITICHTMDELKALPPGPKVVLASFGSLEAGPARHLFAEWAGDKSNL 379

Query: 349 VLFTERGQFGTL----ARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEAL 404
           V+ T + + G+L     R+       K VK T+SRRVPL GEEL  +E  +   K ++  
Sbjct: 380 VVLTGQPEEGSLMEEVVRVSSKPAAKKNVKFTLSRRVPLEGEELATHESTRKADKSKKEE 439

Query: 405 KASL--VKEEESKASLGPDNNLSGDPM----VIDANNANASADVVEPHGGRYRDILIDGF 458
           +     V  EE    + P      +PM     +    + A AD+      R R+ L +GF
Sbjct: 440 EKKPEHVSVEEEMVDIKPVEPDEPEPMDVLFGVTTVGSTAEADL------RRRETLTEGF 493

Query: 459 VPPSTSVAPMFPFYENNSEWD----DFGEVINPDDYIIKDEDMDQAAMHIGGDDGK---- 510
            P  T   PMF     +  WD    D+G+ I+ + ++   +   QA+  +  +  K    
Sbjct: 494 TPIMTQHGPMFA----DEVWDPVMTDYGQEIDIELFMRTSQ---QASGRMVPELAKEPST 546

Query: 511 -LDEGSASLILDAK--------------PSKVVSNELTV--------------------- 534
             ++ S  +I + +              P+K+VS  + V                     
Sbjct: 547 MFEDPSVEMIEEQQLVEAAQEAEEDEEIPTKLVSEAVEVSVKATILTIDFEGKADGQSVR 606

Query: 535 ------------LVHGSAEATEHLK-QHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLS 581
                       LVHG+A+ T+ LK Q  L      +YTP   +T++ TS +  YK++LS
Sbjct: 607 TLIEQAAPRQIVLVHGNAKETKLLKDQLVLTLPGVDIYTPNAGKTVECTSSMATYKIRLS 666

Query: 582 EKLMSNVLFKKLGDYEIAWVDAEVGKT--ENGMLSLLPIST------------------- 620
           + L      + +  Y + WV+  VGK   E G   LLP+ST                   
Sbjct: 667 DALFQKAKMRDMSGYRVGWVNGIVGKALEEGGAPMLLPMSTLSTKADAGALVTTTSNEMA 726

Query: 621 ----PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGG 675
                A    SV +GDL++ D +  L+ +GI  EF+GG L C +  VTIRK         
Sbjct: 727 IMKRAAAQPGSVFLGDLRLVDFRQALAQEGITAEFSGGVLVCADGRVTIRK--------- 777

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
               +++VIEG L +D+++IR  LYSQ+ +L
Sbjct: 778 -DSDEKLVIEGALSQDFFEIRQILYSQYQIL 807


>gi|391325233|ref|XP_003737143.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 2 [Metaseiulus occidentalis]
          Length = 745

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 282/783 (36%), Positives = 422/783 (53%), Gaps = 115/783 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V++  +SGV +E+P  YL+ ID F  L+D GW++ F+P  ++ LS++ S +D +LL
Sbjct: 1   MTSIVKIHAISGVHDESPHCYLLQIDEFKILLDLGWDEFFNPKPIRELSRLVSQVDVILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGA P+   ++    PV++T PVY++G L MYD + S + + +F++F+LDD+D
Sbjct: 61  SYPDPLHLGAFPHLRHEI--KCPVYATVPVYKMGQLFMYDLHESHKSMEDFNIFSLDDVD 118

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  +T+L Y+Q     GKG+GI + P  AGH++GGTVW+ITKDG ED+IYAVDYN ++
Sbjct: 119 EAFDMITQLKYNQTLPFKGKGQGISITPLPAGHMIGGTVWRITKDGEEDIIYAVDYNHKR 178

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LES  RP++LITDA+NA + QP R+ R E     I +T+RAGGNVL+ VD+A
Sbjct: 179 ERHLNGCALESIQRPSLLITDAFNANYIQPRRRSRDEKLLTTIIQTMRAGGNVLIGVDTA 238

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +LE  W       + Y +   + V++  I++ KS +EWM D + +SFE +R 
Sbjct: 239 GRVLELAHMLEQLWRNQESGLMAYSLIMASNVAAHVIEFAKSQVEWMSDKVMRSFEGARS 298

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  K++    +  E+ +  + PK+VLASM  LE+G+  D+F+ WAS+ KN V+ T R 
Sbjct: 299 NPFQFKYLIPCHSHGEIQSVSE-PKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRS 357

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK------ASLV 409
             GTLAR L  D  PK V +T+ +RV L  +EL  +   + RLKKE+  K       S +
Sbjct: 358 SPGTLARNL-VDNRPKFVHLTLKQRVALEADELEEHVRNE-RLKKEKETKIEDSSDESDI 415

Query: 410 KEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMF 469
           ++E   A+  P   LSG           +S D+ E             F  P+     MF
Sbjct: 416 EDEALAAAARP--RLSG-----------SSGDLTERQS---------FFQKPTKKSHLMF 453

Query: 470 PFYENNSEWDDFGEVIN-----------PDDYIIKDEDMDQAAMHIGGDDGKLDEGSASL 518
           P  E   +WD++GE+IN           P D +       Q   H+   D K D     +
Sbjct: 454 PLKEEKLKWDEYGEIINTDMFSNMGLNAPGDILEPSVLGQQQQQHVS--DRKDDAKKEQV 511

Query: 519 ILDAK-PSKVVSNELT---------------------------------VLVHGSAEA-T 543
              A+ P+K ++ E+T                                 V+V G  EA T
Sbjct: 512 TEQAEIPTKCIAKEVTIQVNCSIDYIDFEGRSDGESIRQLVQMMKPKRLVIVRGGDEANT 571

Query: 544 EHLKQHCLKHVC---PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW 600
                +C+   C     V+ P+  E +D T++   Y+V+L E L++ + F+K  + E+AW
Sbjct: 572 AAFYDYCVNSGCVQDNRVFAPKAHEVVDATTESHIYQVKLKESLLARLRFRKAKNAELAW 631

Query: 601 VDAE----------VGK----TENGMLSLLPI---STPAPPHKSVLVGDLKMADLKPFLS 643
           +DAE          VGK    T+  ++ L P+   +    PH  + + DLK++D K  L 
Sbjct: 632 LDAEIAEPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLV 691

Query: 644 SKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
             GI  EF+GG L C       K    G         ++ +EG L +DY++IR  LY Q+
Sbjct: 692 KSGISAEFSGGVLYCNNCSVAVKRNETG---------RLSVEGALTDDYFRIRELLYDQY 742

Query: 704 YLL 706
            +L
Sbjct: 743 AIL 745


>gi|432944969|ref|XP_004083472.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Oryzias latipes]
          Length = 787

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 276/819 (33%), Positives = 427/819 (52%), Gaps = 145/819 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E+ L YL+ +D F  L+DCGW++HF   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEESALCYLLQVDEFRILLDCGWDEHFSMDIIDAMKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPIHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRNNSEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           SAF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 SAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LES  RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCTLESINRPSLLITDSFNATYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W         YP+  L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGTYPLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+ L  + ++L   P  PK+VL S   LE+GFS ++F++W  + KN ++ T R 
Sbjct: 301 NPFQFRHLNLCHSLADLARVP-SPKVVLCSQPDLESGFSRELFIQWCQNSKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVG-----EELIAYEEEQTRLKKEEALKASLVK 410
             GTL R L   P  K + + + +RV L G            +++   K E+A +  +  
Sbjct: 360 TPGTLGRYLIDHPGEKMLDLEVRKRVKLEGKELEEYLEKEKIKKEAAKKLEQAKEVDVDS 419

Query: 411 EEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFP 470
            +ES      D      P+ +   + +    +++  G R        F   +    PMFP
Sbjct: 420 SDESDMEDDLDQ-----PVAVKTKHHDL---MMKSEGSRK-----GSFFKQAKKSYPMFP 466

Query: 471 FYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ------------ 499
            +E   +WD++GE+I  +D+++                    DE MDQ            
Sbjct: 467 THEERIKWDEYGEIIRLEDFLVPELQAAEDEKSKLDSGLTNGDEPMDQDLSVVPTKCISN 526

Query: 500 -------AAMHIGGDDGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL 551
                  A +     +G+ D  S   I++  KP ++      V+VHG  EA++ L + C 
Sbjct: 527 MENLEIRARITYIDYEGRSDGDSIKKIINQMKPRQL------VIVHGPPEASQDLAESCK 580

Query: 552 ---KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----E 604
              K +   VYTP+++ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      
Sbjct: 581 AFSKDI--KVYTPKLQETVDATSETHIYQVRLKDSLVSSLQFCKAKDTELAWIDGVLDMR 638

Query: 605 VGKTENGML--------------------------------------------------- 613
           V K + G++                                                   
Sbjct: 639 VVKVDTGVMLEDRVKEEEEDGEMPMETGQEVGIDHNATAVAAQRAMKNLFGEDEKEVSEE 698

Query: 614 -SLLPISTPAP-----PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKV 667
             ++P   P P      H++V + + +++D K  L  +GIQ EF GG L C   V +R+ 
Sbjct: 699 SDVIPTLEPLPLTEIPGHQAVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRRT 758

Query: 668 GPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             AG+ G         +EG LC+DYYKIR  LY Q+ ++
Sbjct: 759 -EAGRIG---------LEGCLCDDYYKIRELLYQQYAVV 787


>gi|391325235|ref|XP_003737144.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like isoform 3 [Metaseiulus occidentalis]
          Length = 754

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 279/781 (35%), Positives = 420/781 (53%), Gaps = 102/781 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V++  +SGV +E+P  YL+ ID F  L+D GW++ F+P  ++ LS++ S +D +LL
Sbjct: 1   MTSIVKIHAISGVHDESPHCYLLQIDEFKILLDLGWDEFFNPKPIRELSRLVSQVDVILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGA P+   ++    PV++T PVY++G L MYD + S + + +F++F+LDD+D
Sbjct: 61  SYPDPLHLGAFPHLRHEI--KCPVYATVPVYKMGQLFMYDLHESHKSMEDFNIFSLDDVD 118

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  +T+L Y+Q     GKG+GI + P  AGH++GGTVW+ITKDG ED+IYAVDYN ++
Sbjct: 119 EAFDMITQLKYNQTLPFKGKGQGISITPLPAGHMIGGTVWRITKDGEEDIIYAVDYNHKR 178

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LES  RP++LITDA+NA + QP R+ R E     I +T+RAGGNVL+ VD+A
Sbjct: 179 ERHLNGCALESIQRPSLLITDAFNANYIQPRRRSRDEKLLTTIIQTMRAGGNVLIGVDTA 238

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +LE  W       + Y +   + V++  I++ KS +EWM D + +SFE +R 
Sbjct: 239 GRVLELAHMLEQLWRNQESGLMAYSLIMASNVAAHVIEFAKSQVEWMSDKVMRSFEGARS 298

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  K++    +  E+ +  + PK+VLASM  LE+G+  D+F+ WAS+ KN V+ T R 
Sbjct: 299 NPFQFKYLIPCHSHGEIQSVSE-PKVVLASMPDLESGYGRDLFMLWASNPKNSVILTSRS 357

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  D  PK V +T+ +RV L  +EL    EE  R ++       L KE+E+K
Sbjct: 358 SPGTLARNL-VDNRPKFVHLTLKQRVALEADEL----EEHVRNER-------LKKEKETK 405

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPH-GGRYRDILIDG--FVPPSTSVAPMFPFY 472
                D +   D  +  A   +       P   G   D+      F  P+     MFP  
Sbjct: 406 IEDSSDESDIEDEALAAAAQHHHQDHTKRPRLSGSSGDLTERQSFFQKPTKKSHLMFPLK 465

Query: 473 ENNSEWDDFGEVIN-----------PDDYIIKDEDMDQAAMHIG--GDDGKLDEGSASLI 519
           E   +WD++GE+IN           P D +       Q   H+    DD K ++ +    
Sbjct: 466 EEKLKWDEYGEIINTDMFSNMGLNAPGDILEPSVLGQQQQQHVSDRKDDAKKEQVTEQAE 525

Query: 520 LDAKPSKVVSNELT---------------------------------VLVHGSAEA-TEH 545
           +   P+K ++ E+T                                 V+V G  EA T  
Sbjct: 526 I---PTKCIAKEVTIQVNCSIDYIDFEGRSDGESIRQLVQMMKPKRLVIVRGGDEANTAA 582

Query: 546 LKQHCLKHVC---PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVD 602
              +C+   C     V+ P+  E +D T++   Y+V+L E L++ + F+K  + E+AW+D
Sbjct: 583 FYDYCVNSGCVQDNRVFAPKAHEVVDATTESHIYQVKLKESLLARLRFRKAKNAELAWLD 642

Query: 603 AE----------VGK----TENGMLSLLPI---STPAPPHKSVLVGDLKMADLKPFLSSK 645
           AE          VGK    T+  ++ L P+   +    PH  + + DLK++D K  L   
Sbjct: 643 AEIAEPEEDNDLVGKGDEETKEKLMVLQPLGDSNRVVAPHNPLFINDLKLSDFKQVLVKS 702

Query: 646 GIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYL 705
           GI  EF+GG L C       K    G         ++ +EG L +DY++IR  LY Q+ +
Sbjct: 703 GISAEFSGGVLYCNNCSVAVKRNETG---------RLSVEGALTDDYFRIRELLYDQYAI 753

Query: 706 L 706
           L
Sbjct: 754 L 754


>gi|198452192|ref|XP_002137430.1| GA26549 [Drosophila pseudoobscura pseudoobscura]
 gi|198131825|gb|EDY67988.1| GA26549 [Drosophila pseudoobscura pseudoobscura]
          Length = 757

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 269/781 (34%), Positives = 418/781 (53%), Gaps = 99/781 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+  D+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAADTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GR+LEL  +L+  W       + Y +  L  VS + +++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRMLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVVEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L    +++   P GPK+VLAS   LE+GF+ D+F++WA +  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHTLADVYKLPAGPKVVLASTPDLESGFTRDLFIQWAGNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P + +++ + RRV L G EL  Y   +T+ +K   L A    EEES
Sbjct: 361 SPGTLAMELVENYAPGRQIELDVRRRVELEGAELEEY--LRTQGEKINPLIAKPEPEEES 418

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
            +    D         I+ +      D+V    GR+      GF   +     MFP++E 
Sbjct: 419 SSESEDD---------IEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPYHEE 465

Query: 475 NSEWDDFGEVINPDDYIIKD-------------EDMDQAAMHIGGDDGKLDEGSASLILD 521
             ++D++GE+IN DDY I D             E++ +    IG +          + L 
Sbjct: 466 KIKYDEYGEIINLDDYRIADMNNTEFPPEEQNKENVKKEEPGIGIEQQANGAMDTDVQLL 525

Query: 522 AKPSKVVSNELT---------------------------------VLVHGSAEATEHLKQ 548
            KP+K+++   T                                 ++VHG+ E T+ + +
Sbjct: 526 EKPTKLINQRKTIEVNAQIQRIDFEGRSDGESMLKILSQLRPRRVIVVHGTEEGTQVVAK 585

Query: 549 HCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG-- 606
           HC ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AWVD  +G  
Sbjct: 586 HCEQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVDGRLGMR 645

Query: 607 --------------------KTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSK 645
                                 E   L+L  +     P H SVL+ +LK++D K  L   
Sbjct: 646 LKAIDAPPTAMDVTVEQDAAMQEGKTLTLETLEEDEIPVHNSVLINELKLSDFKQILLRX 705

Query: 646 GIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYL 705
                                   AG         ++ +EG L E+YYKIR  LY Q+ +
Sbjct: 706 XXXXXXXXXXXXXXXXXXXXXXXDAG---------KVAMEGCLSEEYYKIRELLYEQYAI 756

Query: 706 L 706
           +
Sbjct: 757 V 757


>gi|145340766|ref|XP_001415490.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575713|gb|ABO93782.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 715

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 294/766 (38%), Positives = 411/766 (53%), Gaps = 129/766 (16%)

Query: 19  LSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQL 78
           + Y VSIDG N L+DCGWND FD  +L+PL+ +A  +DAVL+SHPDT HLGALPYA  +L
Sbjct: 1   MCYHVSIDGCNILLDCGWNDKFDVDMLKPLAAIAPKVDAVLISHPDTAHLGALPYAFGKL 60

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF-DLFTLDDIDSAFQSVTRLTYSQNYHL 137
           G++  V++T PV+++G + MYD +L+R+   +F ++F+LDD+D+AF +   + Y Q   L
Sbjct: 61  GMNCKVYATLPVHKMGQMYMYDHFLTRQDQGDFQEVFSLDDVDTAFAAFVPVKYMQLSML 120

Query: 138 SGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL 197
            GKG+GI V  + AGH LGG VWKI KD EDV+YAVDYN RKE+HLNGT  ++  RPA+L
Sbjct: 121 RGKGDGISVMAYAAGHTLGGAVWKIGKDAEDVVYAVDYNVRKERHLNGTSFDAIHRPALL 180

Query: 198 ITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS 256
           ITDA +     P +  R+    D+I  +LR  GNVL+P+D AGRVLEL+L+LE+ WA+  
Sbjct: 181 ITDASSVDREVPNKTTRDAKLIDSILSSLRMNGNVLIPIDPAGRVLELILLLEEKWAQRQ 240

Query: 257 L-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 315
           L +Y I  LT V+ +T+D+ KS LEWMGD +T +FE  R+N F  K +TL  +  EL   
Sbjct: 241 LGSYQIVLLTNVAYNTLDFAKSHLEWMGDHVTNAFERRRENPFNTKFLTLCHSMEELQAL 300

Query: 316 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA----RMLQADPPPK 371
           P GPK+VLAS  SLEAG S  +F EWA D  NLV+ T + + G+L     ++       K
Sbjct: 301 PPGPKVVLASFGSLEAGPSRHLFAEWAEDKSNLVILTGQPEHGSLTEQVVQLSAKATAKK 360

Query: 372 AVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVI 431
            +K+T+SRR+PL G EL  +E  +      E  K    KE E++A L             
Sbjct: 361 KIKLTLSRRIPLEGSELAEHESSRKSSTSTELEK----KESETEADL------------- 403

Query: 432 DANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVIN----- 486
                            R RD L +GF P ST   PMFP         D+G+ I+     
Sbjct: 404 -----------------RRRDTLTEGFTPISTPHGPMFPDEVWEPTMTDYGQEIDIETFH 446

Query: 487 ----------------------------------------PDDYIIKDEDMDQAAMHIGG 506
                                                   P   + +  +++  A  I  
Sbjct: 447 QISQMSSGIPIPEPMKETTVVDDLDVANIEEDEEEEPQEVPTKLVTETREINIRATIITV 506

Query: 507 D-DGKLDEGSA-SLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVY--TPQ 562
           D +GK D  S  +LI  A P +V      VLVHG A+ T+ LK   L    P V    P 
Sbjct: 507 DFEGKADGKSVRTLITQAAPRRV------VLVHGDAKETKTLKD-ALTAGLPGVQIDAPD 559

Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKT--ENGMLSLLPIS- 619
             +TI+ TS    YK+++S+ L      + +  Y++ WV+  VGK   E G   LLP+S 
Sbjct: 560 AGKTIECTSASATYKIRVSDALFQKANMRDMAGYKVGWVNGVVGKALEEGGAPMLLPVSA 619

Query: 620 --------TPAPPHK----------SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE- 660
                     AP +           SV +GDL+++D +  L+ +GI  EFA G L C   
Sbjct: 620 LNSNADGMALAPSNATMTKVSAQPGSVFLGDLRLSDFRQALAQEGIIAEFADGVLVCANG 679

Query: 661 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            VT+RK           G +++V+EG L +DY+++R  LYSQ+ +L
Sbjct: 680 RVTVRK----------DGDEKLVVEGALSQDYFEVRQILYSQYSIL 715


>gi|348517622|ref|XP_003446332.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Oreochromis niloticus]
          Length = 787

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 276/814 (33%), Positives = 430/814 (52%), Gaps = 135/814 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E  L YL+ +D F FL+DCGW+++F   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEETALCYLLQVDEFRFLLDCGWDENFSMEIIDVMKRHVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPIHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRNNSEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LES  RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLESLSRPSLLITDSFNAAYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLN---YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W         YP+  L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGAYPLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+TL  + ++L   P  PK+VL S   LE+GFS ++F++W  + KN ++ T R 
Sbjct: 301 NPFQFRHLTLCHSLADLARVP-SPKVVLCSQPDLESGFSRELFIQWCQNAKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K + + + +RV L G+EL  Y E++   K+         + +   
Sbjct: 360 TPGTLARYLIDNPGEKMLDLEVKKRVKLEGKELEEYLEKEKLKKETAKKLEQAKEVDVDS 419

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
           +     ++      V+   + +    +++  G R        F   +    PMFP +E  
Sbjct: 420 SDESDMDDDLDQSAVVKTKHHDL---MMKGEGSRK-----GSFFKQAKKSYPMFPTHEER 471

Query: 476 SEWDDFGEVINPDDYIIK-------------------DEDMDQ-------------AAMH 503
            +WD++GE+I  +++++                    DE MDQ              ++ 
Sbjct: 472 IKWDEYGEIIRLEEFLVPELQATEEEKSKLESGLTNGDEPMDQDLSVVPTKCISSTESLE 531

Query: 504 IGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL---KH 553
           I         +G+ D  S   I++  KP ++      V+V G  EA+  L + C    K 
Sbjct: 532 IRARVTYIDYEGRSDGDSIKKIINQMKPRQL------VIVRGPPEASLDLAESCKAFSKD 585

Query: 554 VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTE 609
           +   VYTP+++ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K +
Sbjct: 586 I--KVYTPKLQETVDATSETHIYQVRLKDSLVSSLQFCKAKDTELAWIDGVLDMRVVKVD 643

Query: 610 NGML----------------------------------------------------SLLP 617
            G++                                                     ++P
Sbjct: 644 TGVILEEGVKDEAEESELAMDIAPDLGTDPVNIAVAAQRAMKNLFGEDEKEFSEESDVIP 703

Query: 618 ISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQ 672
              P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   AG+
Sbjct: 704 TLEPLPPNETPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRRT-EAGR 762

Query: 673 KGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            G         +EG LC+DYYKIR  LY Q+ ++
Sbjct: 763 IG---------LEGCLCDDYYKIRELLYQQYAVV 787


>gi|223648270|gb|ACN10893.1| Cleavage and polyadenylation specificity factor subunit 2 [Salmo
           salar]
          Length = 796

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 248/667 (37%), Positives = 385/667 (57%), Gaps = 73/667 (10%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E+ L YL+ +D F FL+DCGW++ F   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEESALCYLLQVDEFRFLLDCGWDESFSMDIIDSMKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+ P+++T PVY++G + MYD Y SR    +F+LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCPIYATIPVYKMGQMFMYDLYQSRNNTEDFNLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LES  RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCTLESVSRPSLLITDSFNATYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + ++L   P  PK+VL S   LE+GFS ++F++W  D KN V+ T R 
Sbjct: 301 NPFQFRHLSLCHSLADLARVP-SPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTL R L  +P  K + + + +RV L G EL  Y E++ R+KKE A K  L +E+E  
Sbjct: 360 TPGTLGRYLIDNPGEKMLDLEIRKRVKLEGRELEEYLEKE-RMKKEAAKK--LEQEKEVD 416

Query: 416 ASLGPDNNLSGD---PMVIDANNANASADVVEPHGGRYRDILIDG-------FVPPSTSV 465
                ++++  D   P V+                 ++ D+++ G       F   +   
Sbjct: 417 VDSSDESDMEDDLELPAVVKT---------------KHHDLMMKGDGIRKGSFFKQAKKS 461

Query: 466 APMFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEGSASLI- 519
            PMFP +E   +WD++GE+I P+D+++      +E+ ++    +   D  +D+ S+S + 
Sbjct: 462 YPMFPTHEERVKWDEYGEIIRPEDFLVPELQATEEEKNKLESGMANGDEPMDQDSSSKVP 521

Query: 520 ------------------------LDAKPSKVVSNELT----VLVHGSAEATEHLKQHCL 551
                                    D    K + N++     V+VHG  EA+  L + C 
Sbjct: 522 TKCTSTTENLEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLVIVHGPPEASLDLAESCK 581

Query: 552 KHVCP-HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVG 606
                  VYTP+++ET+D TS+   Y+V+L + L+S++ F +  D E+AW+D      V 
Sbjct: 582 AFTKDIKVYTPKLQETVDATSETHIYQVRLKDSLVSSLQFCRAKDTELAWIDGVLDMRVV 641

Query: 607 KTENGML 613
           K + G+L
Sbjct: 642 KVDTGVL 648



 Score = 69.3 bits (168), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 49/84 (58%), Gaps = 10/84 (11%)

Query: 623 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQI 682
           P H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   AG+ G        
Sbjct: 723 PGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRRT-EAGRIG-------- 773

Query: 683 VIEGPLCEDYYKIRAYLYSQFYLL 706
            +EG LC+DYYKIR  LY Q+ ++
Sbjct: 774 -LEGCLCDDYYKIRELLYQQYAVV 796


>gi|213514628|ref|NP_001134023.1| cleavage and polyadenylation specificity factor subunit 2 [Salmo
           salar]
 gi|209156194|gb|ACI34329.1| Cleavage and polyadenylation specificity factor subunit 2 [Salmo
           salar]
          Length = 796

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 247/657 (37%), Positives = 381/657 (57%), Gaps = 53/657 (8%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E+ L YL+ +D F FL+DCGW++ F   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEESALCYLLQVDEFRFLLDCGWDESFSMDIIDAMKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+ P+++T PVY++G + MYD Y SR    +F+LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCPIYATIPVYKMGQMFMYDLYQSRNNTEDFNLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  + +L YSQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LES  RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCTLESVSRPSLLITDSFNATYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L  + ++L   P  PK+VL S   LE+GFS ++F++W  + KN V+ T R 
Sbjct: 301 NPFQFRHLSLCHSLADLARVP-SPKVVLCSQPDLESGFSRELFIQWCQEAKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTL R L  +P  K + + + +RV L G EL  Y E++ R+KKE A K    KE +  
Sbjct: 360 TPGTLGRYLIDNPGEKMLDLEIRKRVKLEGRELEEYLEKE-RMKKEAAKKLEQEKEVDVD 418

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
           +S   D +   D + + A       D++    G    +    F   +    PMFP +E  
Sbjct: 419 SS---DESDMEDDLELPAMVKTKHHDLMMKGDG----VRKGSFFKQAKKSYPMFPTHEER 471

Query: 476 SEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEGSASLI----------- 519
            +WD++GE+I P+D+++      +E+ ++    +   D  +D+ S+S +           
Sbjct: 472 VKWDEYGEIIRPEDFLVPELQATEEEKNKLESCMAKGDEPMDQDSSSKVPTKCTSTTENL 531

Query: 520 --------------LDAKPSKVVSNELT----VLVHGSAEATEHLKQHCLKHVCP-HVYT 560
                          D    K + N++     V+VHG  EA+  L + C        VYT
Sbjct: 532 EIKARVTYIDYEGRSDGDSIKKIINQMKPRQLVIVHGPPEASLDLAESCKAFTKDIKVYT 591

Query: 561 PQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
           P+++ET+D TS+   Y+V+L + L+S++ F +  D E+AW+D      V K + G+L
Sbjct: 592 PKLQETVDATSETHIYQVRLKDSLVSSLQFCRAKDTELAWIDGVLDMRVVKVDTGVL 648



 Score = 68.9 bits (167), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 49/84 (58%), Gaps = 10/84 (11%)

Query: 623 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQI 682
           P H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   AG+ G        
Sbjct: 723 PGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNNIVAVRRT-EAGRIG-------- 773

Query: 683 VIEGPLCEDYYKIRAYLYSQFYLL 706
            +EG LC+DYYKIR  LY Q+ ++
Sbjct: 774 -LEGCLCDDYYKIRELLYQQYAVV 796


>gi|328722057|ref|XP_001949295.2| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Acyrthosiphon pisum]
          Length = 724

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 274/771 (35%), Positives = 421/771 (54%), Gaps = 112/771 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++   LSG  NE+P  YL+ ID F FL+DCGW++ F   ++  L +    IDAVLL
Sbjct: 1   MTSIIKFYTLSGAHNESPPCYLLQIDEFKFLLDCGWDELFSMGVVNKLKRYIHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLG LPY + + GL+ PV++T PVY++G + MYD + S     +F+LF LDD+D
Sbjct: 61  SHPDRFHLGILPYLVGKCGLNCPVYATIPVYQMGQMFMYDLHQSLCNAEDFNLFNLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  V ++ Y+Q   L GKG G+ +    +GH++GGT+WKI+K G ED++YAVD+N RK
Sbjct: 121 AAFDKVIQVKYNQIVSLKGKGIGLRIVALASGHMVGGTIWKISKVGEEDIVYAVDFNHRK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG+ LE   RP++LI D +NA + QP R+ R E     I  TLR  GNVL+ VD+A
Sbjct: 181 ERHLNGSDLEKLGRPSLLILDCFNAAYAQPRRRSRDEALMTCILTTLRVKGNVLMAVDTA 240

Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL+ +L+  W   E  L  Y + FLT VS +T+++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELIHMLDQLWRNKESGLGVYSLVFLTNVSYNTVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F+ KHV L  N ++L    + PK+VLAS   LE GFS ++F+ WAS+ KN ++ T+R 
Sbjct: 301 NPFIFKHVKLCHNMNDLKKVSE-PKVVLASHGDLENGFSREVFIMWASNPKNSIILTDRA 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L      + +K+ + +RVPL   EL   EE   + +KE        K E SK
Sbjct: 360 APGTLARNLIDGGSDRNIKLIVKKRVPLDENEL---EEYNIKYEKE--------KMEGSK 408

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAP------MF 469
                      DP+  D      S D  E   G+Y D+L+D     S   +       MF
Sbjct: 409 M----------DPVSSD------SEDEQEVMRGKY-DLLVDADTLSSKKSSKKEFSHNMF 451

Query: 470 PFYENNSEWDDFGEVINPDDYII---------------KDEDMDQAAMHI---------- 504
           P+YE+  ++D +GE+I P+D+I                K  D+++   ++          
Sbjct: 452 PYYEDKCKFDQYGEIIKPEDFIKFDVAPVDKPTLDEPNKKSDIEENLYNVPSKCVKYEQN 511

Query: 505 -------------GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCL 551
                        G  DG   E    ++L  KP ++      +LV G++ +T+ +     
Sbjct: 512 IYVAAKIVYIDFEGRSDG---ESIKQMVLALKPRRL------ILVRGNSYSTKVVYNFAK 562

Query: 552 KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE------- 604
             +   V+TP+I + ++VT++   Y+V+L++ L+S + FKK  +  +A+++A+       
Sbjct: 563 VFIDGKVFTPRIGQCMNVTTESHIYQVRLTDTLLSKINFKKGPNGNLAYMNAKLKLNSRD 622

Query: 605 --------VGKTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGA 655
                   + +  + + +L P++     PHK+V +  LK++D K  LS K I  E + G 
Sbjct: 623 TVMEVDNVISEKNDQIFTLEPLADHEIHPHKTVFINRLKLSDFKQILSKKNIPCELSKGV 682

Query: 656 LRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           L C       +   +G         ++++EG +   YY IR+ LYSQF ++
Sbjct: 683 LWCCNRTVCVRRNSSG---------KVLMEGIISRQYYYIRSLLYSQFIII 724


>gi|198428144|ref|XP_002129804.1| PREDICTED: similar to cleavage and polyadenylation specific factor
           2 [Ciona intestinalis]
          Length = 784

 Score =  445 bits (1144), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 262/811 (32%), Positives = 412/811 (50%), Gaps = 132/811 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++ TPL+G  NE P  YL+ +D F FL+DCGW++ FD  ++  + K  S +DA+LL
Sbjct: 1   MTSIIKFTPLAGALNEGPNCYLLQVDEFTFLLDCGWSEDFDMDVINNVMKHISQVDAMLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           + PD  H+GALPY   ++GL+  +++T PVY++G + +YD Y S   + +FD FTLDD+D
Sbjct: 61  TFPDIQHIGALPYLAGKIGLNCAIYATVPVYKMGQMFLYDLYQSHHNIEDFDKFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE--DVIYAVDYNRR 178
           SAF  +T++ ++Q   L  KG G+ + P  AGH++GGT WKI KD E  +++YAVD+N +
Sbjct: 121 SAFDKITQVKHNQTITLKDKGLGLSITPVHAGHMIGGTAWKIIKDDEEGEIVYAVDFNHK 180

Query: 179 KEKHLNGTVL-------ESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 230
           +E+HLNG  L        S   P ++ITD YNA++ Q  R+ R E     I +T+R  GN
Sbjct: 181 RERHLNGCSLFESSGETWSGKPPQLMITDGYNAMYQQARRKLRDEQLLTRIIETMRGDGN 240

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
           VL+ VD+AGRVLEL ++L+  W +       Y +  +  V+ + +++ K  +EWM D I 
Sbjct: 241 VLIAVDTAGRVLELAILLDQLWRDTRSGLCAYSLAMINNVTYNVVEFAKFMVEWMSDKII 300

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
            SF   R+N F  KH+ L  N  +L   P  PK VLAS A +E GF+  +F+ WA+D +N
Sbjct: 301 NSFTDQRNNPFHFKHLKLCHNLGDLAQVPQ-PKCVLASTADMECGFARQLFIRWAADPRN 359

Query: 348 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 407
            V+ T R   GTL+R L  DP    +K+ M +RVP++GEEL  YE      +   A KA+
Sbjct: 360 TVIITSRSTKGTLSRTLVDDPTVSRLKLEMKKRVPIIGEELDQYE------RNRAAKKAT 413

Query: 408 LVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAP 467
            VK  E ++S   D + + +P+    N      D + P+    +      F        P
Sbjct: 414 EVKVFEEESS---DESDAEEPV----NTIQNRHDFIVPNEVPKKS---GSFFKQLKKTFP 463

Query: 468 MFPFYENNSEWDDFGEVINPDDY----IIK-DEDMDQAAMHIGGDDGKLDEGSASLILDA 522
           M+PF E   +WD++GE+INPDD+    II+ DE++    +    +  K D      +++ 
Sbjct: 464 MYPFIEPRIKWDEYGEIINPDDFRMSNIIQVDEEVKAEIIKTKMEVDKTDSNPLQSVVEE 523

Query: 523 KPSKVVSNEL---------------------------------TVLVHGSAEATEHLKQH 549
            P+K V+  +                                  ++V    + T++  + 
Sbjct: 524 APTKCVTETVFIEMKCTISFIDFEGRSDGESMLKIIQQIKPREVIVVRADTKTTKYYAEA 583

Query: 550 CLKHVCP---HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG 606
             K +      V+TP + E +D T +   Y+V+L + L+  + F    D EI W+DA+V 
Sbjct: 584 IRKALTSSGVEVFTPAVNEVVDTTKERHIYQVKLKDSLVGTLRFSNARDSEICWIDAKVD 643

Query: 607 KTEN----------------------------------------------GMLSLLPIST 620
            +EN                                               + +++P   
Sbjct: 644 CSENVNDSSKVLTDSQIREAKEIADKEEFTMDHDGEDIIASQKSSNAINTQVANIIPSLE 703

Query: 621 P-----APPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGG 675
           P      P H++  + +L+++D K  L+ +G Q EF GG L C   + IR+     Q+G 
Sbjct: 704 PLSIEDTPGHQTCFINELRLSDFKQVLTKEGYQAEFIGGVLVCNNMLAIRR----NQQG- 758

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 I +EG L E+YY IR  LY Q+ ++
Sbjct: 759 -----HIDLEGTLTEEYYAIRDLLYQQYAVV 784


>gi|193676458|ref|XP_001951701.1| PREDICTED: probable cleavage and polyadenylation specificity factor
           subunit 2-like [Acyrthosiphon pisum]
          Length = 729

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 273/777 (35%), Positives = 425/777 (54%), Gaps = 119/777 (15%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++   LSG  NE+P  YL+ ID F FL+DCGW++ F   ++  L +    IDAVLL
Sbjct: 1   MTSIIKFYTLSGAHNESPPCYLLQIDEFKFLLDCGWDERFSMGVVNKLKRYIHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLG LPY + + GL+ PV++T PVY++G + MYD + S     +FDLF LDD+D
Sbjct: 61  SHPDRFHLGILPYLVGKCGLNCPVYATIPVYQMGQMFMYDLHQSLCNAEDFDLFNLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIYAVDYNRRK 179
           +AF  V ++ Y+Q   L GKG G+ +    AGH++GGT+W+I+K GE D++YAVD+N +K
Sbjct: 121 AAFDKVIQVKYNQIVSLKGKGIGLRIVALPAGHMVGGTIWRISKVGEEDIVYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG+ LE   RP++LI D +NA ++QP R+ R E     I  TLRA GNVL+ +D+A
Sbjct: 181 ERHLNGSDLERLGRPSLLILDCFNAAYSQPRRRSRDEALMTCILTTLRAKGNVLMAIDTA 240

Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL+ +L+  W   E  L  Y + FLT VS +T+++ KS +EWM D + KSFE +R+
Sbjct: 241 GRVLELMHMLDQLWRNKESGLGVYSLVFLTNVSYNTVEFAKSQIEWMSDKLMKSFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KHV L  N ++L+   + PK+VLAS   LE+GFS ++F+ WAS+ KN ++ T+R 
Sbjct: 301 NPFFFKHVKLCHNMNDLNKVSE-PKVVLASNGDLESGFSREVFIMWASNSKNSIILTDRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +   + +K+ + +RVPL   EL    EE      EE ++AS +      
Sbjct: 360 APGTLARDLIDEGGDRNIKLIVKKRVPLDDNEL----EEYNIKHDEEKMEASKI------ 409

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDG-----FVPPSTSVAPMFP 470
                      DP+  D      S D  E   G+Y D+L+D               PMFP
Sbjct: 410 -----------DPVSSD------SEDEQEVMRGKY-DLLVDADTLSSKKSSKKEFPPMFP 451

Query: 471 FYENNSEWDDFGEVINPDDYIIKD-----------------------------------E 495
           +YE   ++D +GE+I  +D+I  D                                   +
Sbjct: 452 YYEEKCKFDPYGEIIKQEDFIKFDVAPGDKPTVDEQNKKSDEDEEEDLNDVPSKCVEYEQ 511

Query: 496 DMDQAA--MHI---GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHC 550
           ++  AA  +HI   G  DG   E    ++L  KP +++      LV G+  +T+ +    
Sbjct: 512 NIYVAAKIVHIDFEGRSDG---ESIKQIVLALKPRRLI------LVRGNPYSTKVVYNFA 562

Query: 551 LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG---- 606
              +   V+TP+I + ++VT++   Y+V+L++ L+S + FKK  + ++A+++A++     
Sbjct: 563 KVFIDGKVFTPRIGQCLNVTTESHIYQVRLTDALLSKINFKKGPNGDLAYMNAKLKLNSR 622

Query: 607 ---------------KTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQVE 650
                          + ++ + +L P++     P K+V +  LK++D K  LS   I  E
Sbjct: 623 DTVMEVDNVVSEKMPRIDDQIFTLEPLAEHEIHPRKTVFINRLKLSDFKQILSKNNIPCE 682

Query: 651 FAGGAL-RCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            + G L  C   V +R+          + + ++++EG +   YY IR+ LYSQF ++
Sbjct: 683 LSKGVLWCCNRTVCVRR----------NSSGKVLMEGIISRQYYYIRSLLYSQFIII 729


>gi|357610700|gb|EHJ67102.1| putative cleavage and polyadenylation specificity factor 100 kDa
           subunit [Danaus plexippus]
          Length = 818

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 263/764 (34%), Positives = 410/764 (53%), Gaps = 97/764 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++   LSG  +E+P  Y++ +D F FL+DCGW++ FD   ++ L +  ++IDAVLL
Sbjct: 1   MTSIIKFHCLSGAGDESPPCYVLQVDEFKFLLDCGWDEKFDMDFIKELKRHVNSIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH D LHLGALPYA+ QLGL+ P+++T P+Y++G + MYD Y S + VSEFDLFTLDD+D
Sbjct: 61  SHSDPLHLGALPYAVGQLGLNCPIYATLPIYKMGQMFMYDLYQSHKNVSEFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  +T+L Y+Q+  + GKG G+ + P  AGHLLGGTVW+I   G ED++YA D+N +K
Sbjct: 121 TAFDRITQLKYNQSVDMKGKGLGLRITPLPAGHLLGGTVWRIAAPGEEDIVYAPDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  +E  +RP++L+  A NA + Q  R+ R E     I  TLR GG+VL+  D+A
Sbjct: 181 ERHLNGCEIEKIMRPSLLLLGAMNADYVQQRRRLRDEKLMTTILSTLRGGGSVLVCTDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L+ VS + +++ KS +EWM D +T++FE +R 
Sbjct: 241 GRVLELAHMLDQLWRNKDSGLVAYSLLLLSNVSYNVVEFAKSQIEWMSDKLTRAFEGARS 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F L+H+ L  +  E+   P GPK+VLAS   LE GF+ D+F++WA + +N ++ T R 
Sbjct: 301 NPFALRHLQLCHSVVEVTRTP-GPKVVLASFPDLETGFARDLFLQWAPNSQNSIVLTART 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLK---KEEALKASLVKEE 412
             GTLAR L      + +++T+ RRV L G EL  + +++ ++    KEE    S   E 
Sbjct: 360 SPGTLARDLIEKGGDRTIELTVRRRVRLEGAELEEFMQQRVKVNNSVKEETGGISSDSES 419

Query: 413 ESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFY 472
           E +  +         P+  DA  A         H                     M+P  
Sbjct: 420 EGELEMCVVTGKHDIPVRGDARPAGCFKSNKRHHA--------------------MYPCT 459

Query: 473 ENNSEWDDFGEVINPDDYIIKD--------EDMDQAAMHIGGD----------------- 507
           E  +  DD+GE+I P+DY + +         D+  A  H                     
Sbjct: 460 EERARADDYGEIIRPEDYRLAEVVDAEGEIRDVPPAPTHTQEPEEEITEIPSKCITATKQ 519

Query: 508 ------------DGKLD-EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHV 554
                       +G+ D E    ++  AKP  VV+      +     A   LK+HC    
Sbjct: 520 LQVKASIQYIELEGRCDGESLLRVVAAAKPRAVVA------LRAGPTALATLKKHCDSEG 573

Query: 555 CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENG--- 611
              V+TP   +T+D T++   Y+V+L++ +M  + ++  GD E+AW+ A V +       
Sbjct: 574 IEKVFTPGRGDTVDATTESHIYQVKLTDSVMCGLSWRSAGDAELAWLSAVVAQPRTRDTP 633

Query: 612 --------MLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE-YV 662
                   M+SL   +    PH +  V  +++++L+  L+  G+  EF+ GAL C    +
Sbjct: 634 SEEVADVEMMSLE--AAEGVPHGAWFVNSVRLSELRAALARNGLGAEFSAGALECCNGTI 691

Query: 663 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            IR++    + G      ++ +EG L E+Y+K+R  LY QF ++
Sbjct: 692 AIRRL----ENG------RVALEGVLSEEYFKVRELLYDQFAIV 725


>gi|393910519|gb|EFO19846.2| cleavage and polyadenylation specificity factor subunit 2 [Loa loa]
          Length = 828

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 280/842 (33%), Positives = 436/842 (51%), Gaps = 150/842 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSGV ++ PL YL+ +D   FL+DCGW++ FD + ++ + +    I+AVLL
Sbjct: 1   MTSIIKLEALSGVQDDGPLCYLLQVDQVYFLLDCGWDERFDMAYIEAVKRRVPLINAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D  HLGALPY +++ GL+ P+++T PVY++G + +YD   +   V +F+LF LDDID
Sbjct: 61  SYADIPHLGALPYLVRKCGLNCPIYATVPVYKMGQMFLYDWVNNHTSVEDFNLFNLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ V ++ YSQ   L G   G+ + P  AGH++GG +W+ITK G E+++YAVD+N RK
Sbjct: 121 AAFERVQQVKYSQTILLKGDN-GLQITPLPAGHMIGGAIWRITKMGDEEIVYAVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   E   RP +LITD++NAL+NQP R+QR E     +  T+R GG+V++ +D+A
Sbjct: 180 ERHLNGCTFEGIGRPNLLITDSFNALYNQPRRKQRDEQLVTRLLGTVRDGGDVMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLE+  +L+  W  AE  L  Y +  L++V+SS +++ KS +EWM D + KSFE  R 
Sbjct: 240 GRVLEIAHLLDQLWHNAEAGLMTYNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRY 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV L     +L      PK+VL S   +E+GFS ++F+EW +D+KN V+ T R 
Sbjct: 300 NPFQFRHVQLCHTHIDLLRV-RSPKVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358

Query: 356 QFGTL-ARML----QADPPP-----KAVKVTMSRRVPLVGEELIAY-------EEEQTRL 398
              TL AR++    QA   P     + + + + RR+ L G EL  Y       E E TR+
Sbjct: 359 GDRTLGARLIRMAEQAAENPNGTINRNLTLEVKRRIRLEGAELENYRAKKRAEEREATRI 418

Query: 399 KKEEALKASLVKEE----------------------ESKASLGPDNNLSGDPMVIDANNA 436
           + E + + + +++                         K +    N  S   +      A
Sbjct: 419 RLEASRRNARLEQADSSDDSDDDAVMVVPATTSGVLNGKMTNSKRNVTSSFSVSTTTTTA 478

Query: 437 NASADVVEPHGGRYRDILI-------DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDD 489
           + SA  +     R  DI+          F   S    PMFP+ E  + WDD+GE+I P++
Sbjct: 479 DMSAAQIAEQ--RSHDIMWKWEQQQKSSFFKQSKKSFPMFPYIEEKTRWDDYGEIIRPEE 536

Query: 490 YIIKDEDM--DQAAMHIGGDDGKLDEGSASLILDAK-PSKVVSNELT------------- 533
           Y+I D  +       H  G DG  D     L  + + PSK +S  +              
Sbjct: 537 YMIADTPVVPQIPPEHKDGADGTFDGQVVPLYEEREWPSKCISQIMKMEVLCKVDFIDFE 596

Query: 534 --------------------VLVHGSAEATEHLKQHCLKH--VCPHVYTPQIEETIDVTS 571
                               ++VHGS+ AT HL Q+  ++  V   ++TP++ E +D T 
Sbjct: 597 GRSDGESAKKILSQIKPKQLIIVHGSSAATRHLAQYAQQNGIVQGKIFTPRLGEIVDATI 656

Query: 572 DLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV--------GKTENG------------ 611
           +   Y+V LS+ +MS+++F+ + D E++W+DA +        G+T+N             
Sbjct: 657 ESHIYQVTLSDAVMSSLIFQTVKDAELSWLDARIVRRKTVTPGQTQNADEENCETNGNKE 716

Query: 612 --------------------------MLSLLPI-STPAPPHKSVLVGDLKMADLKPFLSS 644
                                        L PI S   PPH++V V D K++D+K  L+S
Sbjct: 717 EVEEMEQDGDEVEGKRLSNLKVAAADTFCLEPILSANIPPHQTVFVNDPKLSDVKQLLAS 776

Query: 645 KGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFY 704
            G + EF+ G L      +IR+   AG         +  +EG  CEDYYKIR  +Y+QF 
Sbjct: 777 NGFRAEFSSGILYINNIASIRR-NEAG---------RFHVEGCACEDYYKIRDIVYAQFA 826

Query: 705 LL 706
           ++
Sbjct: 827 VV 828


>gi|312084310|ref|XP_003144223.1| cleavage and polyadenylation specificity factor subunit 2 [Loa loa]
          Length = 837

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 277/849 (32%), Positives = 437/849 (51%), Gaps = 155/849 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSGV ++ PL YL+ +D   FL+DCGW++ FD + ++ + +    I+AVLL
Sbjct: 1   MTSIIKLEALSGVQDDGPLCYLLQVDQVYFLLDCGWDERFDMAYIEAVKRRVPLINAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D  HLGALPY +++ GL+ P+++T PVY++G + +YD   +   V +F+LF LDDID
Sbjct: 61  SYADIPHLGALPYLVRKCGLNCPIYATVPVYKMGQMFLYDWVNNHTSVEDFNLFNLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ V ++ YSQ   L G   G+ + P  AGH++GG +W+ITK G E+++YAVD+N RK
Sbjct: 121 AAFERVQQVKYSQTILLKGDN-GLQITPLPAGHMIGGAIWRITKMGDEEIVYAVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   E   RP +LITD++NAL+NQP R+QR E     +  T+R GG+V++ +D+A
Sbjct: 180 ERHLNGCTFEGIGRPNLLITDSFNALYNQPRRKQRDEQLVTRLLGTVRDGGDVMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLE+  +L+  W  AE  L  Y +  L++V+SS +++ KS +EWM D + KSFE  R 
Sbjct: 240 GRVLEIAHLLDQLWHNAEAGLMTYNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRY 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV L     +L      PK+VL S   +E+GFS ++F+EW +D+KN V+ T R 
Sbjct: 300 NPFQFRHVQLCHTHIDLLRV-RSPKVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358

Query: 356 QFGTL-ARML----QADPPP-----KAVKVTMSRRVPLVGEELIAY-------EEEQTRL 398
              TL AR++    QA   P     + + + + RR+ L G EL  Y       E E TR+
Sbjct: 359 GDRTLGARLIRMAEQAAENPNGTINRNLTLEVKRRIRLEGAELENYRAKKRAEEREATRI 418

Query: 399 KKEEALKASLVKEE----------------------ESKASLGPDN-----------NLS 425
           + E + + + +++                         K +    N             +
Sbjct: 419 RLEASRRNARLEQADSSDDSDDDAVMVVPATTSGVLNGKMTNSKRNVTSSFSVSTTTTTA 478

Query: 426 GDPM---VIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFG 482
           G+P+   + D + A  +         ++       F   S    PMFP+ E  + WDD+G
Sbjct: 479 GNPLKSFLTDMSAAQIAEQRSHDIMWKWEQQQKSSFFKQSKKSFPMFPYIEEKTRWDDYG 538

Query: 483 EVINPDDYIIKDEDM--DQAAMHIGGDDGKLDEGSASLILDAK-PSKVVSNELT------ 533
           E+I P++Y+I D  +       H  G DG  D     L  + + PSK +S  +       
Sbjct: 539 EIIRPEEYMIADTPVVPQIPPEHKDGADGTFDGQVVPLYEEREWPSKCISQIMKMEVLCK 598

Query: 534 ---------------------------VLVHGSAEATEHLKQHCLKH--VCPHVYTPQIE 564
                                      ++VHGS+ AT HL Q+  ++  V   ++TP++ 
Sbjct: 599 VDFIDFEGRSDGESAKKILSQIKPKQLIIVHGSSAATRHLAQYAQQNGIVQGKIFTPRLG 658

Query: 565 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV--------GKTENG----- 611
           E +D T +   Y+V LS+ +MS+++F+ + D E++W+DA +        G+T+N      
Sbjct: 659 EIVDATIESHIYQVTLSDAVMSSLIFQTVKDAELSWLDARIVRRKTVTPGQTQNADEENC 718

Query: 612 ---------------------------------MLSLLPI-STPAPPHKSVLVGDLKMAD 637
                                               L PI S   PPH++V V D K++D
Sbjct: 719 ETNGNKEEVEEMEQDGDEVEGKRLSNLKVAAADTFCLEPILSANIPPHQTVFVNDPKLSD 778

Query: 638 LKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRA 697
           +K  L+S G + EF+ G L      +IR+   AG         +  +EG  CEDYYKIR 
Sbjct: 779 VKQLLASNGFRAEFSSGILYINNIASIRR-NEAG---------RFHVEGCACEDYYKIRD 828

Query: 698 YLYSQFYLL 706
            +Y+QF ++
Sbjct: 829 IVYAQFAVV 837


>gi|384251490|gb|EIE24968.1| hypothetical protein COCSUDRAFT_83661 [Coccomyxa subellipsoidea
           C-169]
          Length = 731

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 287/752 (38%), Positives = 427/752 (56%), Gaps = 79/752 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           +QVTPL G   + P+  L+ ID    L+DCGW+D +D  LL PL  V   +  VL++HPD
Sbjct: 3   IQVTPLYGAGTDGPVCNLLQIDQLLLLLDCGWDDAYDMELLHPLKNVIGHVHGVLITHPD 62

Query: 65  TLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQ 124
             HLGALPY + +L LS PV++T PV ++G + MYDQY++R  V++F  F LDD+D AF 
Sbjct: 63  PAHLGALPYLVGRLKLSVPVYATFPVQKMGEIFMYDQYVTRHAVTDFAAFNLDDVDEAFA 122

Query: 125 SVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHL 183
            +T L Y Q   L G GEG  + P  AGHLLGG +W+IT  + E ++YAV YN +KE+HL
Sbjct: 123 RITPLKYQQTLTLEGPGEGFSITPFAAGHLLGGCIWRITTPEEEHIVYAVHYNHKKERHL 182

Query: 184 NGTVLES-FVRPAVLITDAYNALHNQPPRQQREM---FQDAISKTLRAGGNVLLPVDSAG 239
           NG VL+S F RPA+LITDA N++     R +  +    ++A+  T+RA GNVL+PVD+AG
Sbjct: 183 NGGVLDSAFSRPAILITDADNSMLEGAVRSRETLDKELREAVMATVRANGNVLIPVDAAG 242

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R+LEL+L+LE++W +  L YP+  L+ ++ + ++   S LEWM   I + FE ++ N F 
Sbjct: 243 RLLELVLLLEEHWDKQKLTYPLVLLSPMAYNVLELASSQLEWMSHYIGQMFERTKQNPFS 302

Query: 300 L---KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           +   K + L     EL   P GP++V+A++ SLEAG S  +  EWA++  NL+LF  R  
Sbjct: 303 VRQAKKLKLCRTTEELAKLPPGPRVVMATLPSLEAGASRQLLTEWATNPANLILFPGRAP 362

Query: 357 FGTLARMLQAD---PPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEE 413
             TLA +LQ +     P  V + +S+R+PL G EL A++E QT         A +++EEE
Sbjct: 363 NDTLAGLLQQNMQSGQPFTVPIRLSKRMPLQGAELQAWQESQT---------AHVLEEEE 413

Query: 414 SKA----SLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMF 469
             A    S+G  +  + D   +   +   S+    P       +LIDGFV P  +VAPMF
Sbjct: 414 EPAISTESIGKISRATSDGAKLAPASLQPSSMASLPAA----RVLIDGFVVPEGAVAPMF 469

Query: 470 PFYENNSEWDDFGEVINPDDY-------IIKDEDMD------------------------ 498
           P  ++++E+DD+G +++P ++            DMD                        
Sbjct: 470 PSEDDDNEYDDYGALLHPGEFQQAGGTATAMSMDMDDGEDSPEEEEVPTKVVFEDIKLPV 529

Query: 499 QAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHC---LKHVC 555
            A + +   DG+ D  S  LIL     KV    L VLVHG+ +AT+ L+  C   L  V 
Sbjct: 530 HARLLLLDYDGRSDGRSMRLIL----GKVAPRHL-VLVHGTPQATQVLRDACGDDLYSVN 584

Query: 556 PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG-DYEIAWVDAEVGKTENGML- 613
             V+ P   ET+DV++   +++V LS+ L++ +  +++G +Y +AWV   V    +G L 
Sbjct: 585 GQVHCPANGETVDVSAGTSSFQVGLSDGLLAQLRMRQMGSEYALAWVHGVVASVNSGALP 644

Query: 614 SLLPISTPAPP--HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAG 671
            +LP S  A       V +GD K++DLK  L  +GI   F  G L+C   V++++  P  
Sbjct: 645 EVLPASASAGEALEGGVFIGDAKLSDLKTALEKEGIAAVFVEGNLQCSGSVSVKRTVP-- 702

Query: 672 QKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
           + GG      I++EGPL +DYY+IR  LYSQ+
Sbjct: 703 EDGG------IILEGPLSDDYYRIRTVLYSQY 728


>gi|325187176|emb|CCA21717.1| cleavage and polyadenylation specificity factor subunit putative
           [Albugo laibachii Nc14]
 gi|325187319|emb|CCA21858.1| cleavage and polyadenylation specificity factor subunit putative
           [Albugo laibachii Nc14]
          Length = 731

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 264/754 (35%), Positives = 415/754 (55%), Gaps = 78/754 (10%)

Query: 5   VQVTPLSGVFNENPL-SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHP 63
           +  TPL GV++ +P  +YL+ ID    L+DCGW D +D  LL+PL KVA  ID VL+SHP
Sbjct: 4   ITFTPLYGVYSRDPCCAYLLEIDEVCILLDCGWTDQYDTELLKPLQKVADRIDLVLISHP 63

Query: 64  DTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLS-RRQVSEFDLFTLDDIDSA 122
           D  H+GALPYA+ +LGL AP++ T PV+RLG + +YD Y +  +   +F+L+ LD +D+ 
Sbjct: 64  DMAHIGALPYAIGKLGLKAPIYGTLPVHRLGQINLYDAYQAIVKSDGDFNLYNLDHVDAV 123

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
           F++  +L YS+   L+  GEGIV+ PH +GHL+GG++W+I K+ +++IYAVDYN R E  
Sbjct: 124 FENFKQLKYSEKLTLTSSGEGIVITPHASGHLIGGSMWRIMKETDEIIYAVDYNHRSEHV 183

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRV 241
           L  +VL SF RP +LITD+ +    QP  + R+      I KTLR+GGNVLLP DSAGRV
Sbjct: 184 LPKSVLSSFTRPTLLITDSLSLHTKQPKLKDRDSKIMVEILKTLRSGGNVLLPTDSAGRV 243

Query: 242 LELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLK 301
           LEL+ +L+ YW ++ L  PI  L  +S  T    ++ LEW  + I ++F+  R N F   
Sbjct: 244 LELMRVLDQYWIQNKLRDPIALLHDMSYYTPKAAEAMLEWCNEQIARNFDAGRQNPFQFS 303

Query: 302 HVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER---GQFG 358
           H+ L+ +  EL+     PK+VLA+ A+LE G++ ++F+++A+D +N ++FT       FG
Sbjct: 304 HIHLIHSIEELEKL-SSPKVVLATSATLECGYAKELFIKYAADTRNSIIFTTTPPPRSFG 362

Query: 359 TLARMLQADPP--PKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKA 416
             AR+L  +     + V  ++++RV L G EL  YE ++ R  + EA         E +A
Sbjct: 363 --ARILDMNKKNDSRVVTCSVAKRVLLEGTELALYEAKERRRLRLEA---------EQRA 411

Query: 417 SLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNS 476
               D  +    M I+   ++A     EP+  + R     G    ++   PMF   E   
Sbjct: 412 KEMEDAAMEDMMMGIEEYESDAED---EPN-TQLRGTFKFGLGQIASIRYPMFFCTEPKV 467

Query: 477 EWDDFGEVINPDDY---------IIKD-----EDMDQAAMHIGGDDGKLDEGSASLILD- 521
           EWD++GE+I P+D+         +I+      +D+D+    I   D  +D      +++ 
Sbjct: 468 EWDEYGEIIRPEDFRDTSLSANLLIRKALPGLDDVDRDTTMIDDQDTVVDSRPMKTVVEH 527

Query: 522 ----------------AKPSKVVSNELT-------VLVHGSAEATEHLKQ--HCLKHVCP 556
                               + + N L+       +LVHG+ E T  LKQ      ++C 
Sbjct: 528 LHVTVNARILWVDFDGIADGRAIRNCLSNVKPRKLILVHGTEETTADLKQFVESTINLCE 587

Query: 557 HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGML-SL 615
            ++TP++ E ID+ SD   YK+ L E L + + F K+G++++A+V  +V  +    + +L
Sbjct: 588 AIFTPKVMECIDIESDTSIYKLALKESLYTAMNFHKVGNHDVAYVTGQVSTSATSSIPTL 647

Query: 616 LPIS-TPAPPHKSVLVGD--LKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQ 672
            P S +    HK +L+ D  LK+  +K  L   G   +F  G L C + V +++      
Sbjct: 648 QPRSDSNMTEHKPLLLSDGKLKLDIMKQVLGRAGFDAKFRSGMLICNDGVVLKR------ 701

Query: 673 KGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
               +   +IV+EG L   YY+IR+ LY QF L+
Sbjct: 702 ----AHNNEIVVEGVLSASYYRIRSLLYEQFTLI 731


>gi|170581110|ref|XP_001895540.1| cleavage and polyadenylation specificity factor [Brugia malayi]
 gi|158597460|gb|EDP35606.1| cleavage and polyadenylation specificity factor, putative [Brugia
           malayi]
          Length = 831

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 277/846 (32%), Positives = 438/846 (51%), Gaps = 155/846 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSGV ++ PL YL+ +D   FL+DCGW++ FD + ++ + +    I+AVLL
Sbjct: 1   MTSIIKLEALSGVQDDGPLCYLLQVDQVYFLLDCGWDERFDMAYIEAVKRRVPLINAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D  HLGALPY +++ GL+ P+++T PVY++G + +YD   +   V +F+LF LDDID
Sbjct: 61  SYADIPHLGALPYLVRKCGLNCPIYATVPVYKMGQMFLYDWVNNHTSVEDFNLFNLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ V ++ YSQ   L G   G+ + P  AGH++GG +W+ITK G E+++YAVD+N RK
Sbjct: 121 AAFERVQQVKYSQTILLKGDN-GLQITPLPAGHMIGGAIWRITKMGDEEIVYAVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   E   RP +LITD++NAL+NQP R+QR E     +  T+R GG+V++ +D+A
Sbjct: 180 ERHLNGCTFEGIGRPNLLITDSFNALYNQPRRKQRDEQLVTRLLGTVRDGGDVMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLE+  +L+  W  AE  L  Y +  L++V+SS +++ KS +EWM D + KSFE  R 
Sbjct: 240 GRVLEIAHLLDQLWHNAEAGLMTYNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRY 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV L     +L      PK+VL S   +E+GFS ++F+EW +D+KN V+ T R 
Sbjct: 300 NPFQFRHVQLCHTHIDLMRV-RSPKVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358

Query: 356 QFGTL-ARML----QADPPP-----KAVKVTMSRRVPLVGEELIAY-------EEEQTRL 398
              TL AR++    QA   P     + + + + RR+ L G EL  Y       E E TR+
Sbjct: 359 GDRTLGARLIRMAEQAAENPNGTINRNLTLEVKRRIRLDGVELENYRAKKRAEEREATRI 418

Query: 399 KKEEALKASLVKE------EESKASLGPDNNLSGDPMVIDANNANASADV---------- 442
           + E + + + +++       +  A +      SG   +++    N+  ++          
Sbjct: 419 RLEASRRNARLEQADSSDDSDDDAVMVVPATTSG---ILNGKMTNSKRNIASSFSASTTT 475

Query: 443 --------VEPHGGRYRDILI-------DGFVPPSTSVAPMFPFYENNSEWDDFGEVINP 487
                    +    R  DI+          F   S    PMFP+ E  + WDD+GE+I P
Sbjct: 476 STTADLSAAQIAEQRSHDIMWKWEQQQKSSFFKQSKKSFPMFPYIEEKTRWDDYGEIIRP 535

Query: 488 DDYIIKDEDM--DQAAMHIGGDDGKLDEGSASLILDAK-PSKVVSNELT----------- 533
           ++Y+I D  +       H  G D   D     L  + + PSK +S  +            
Sbjct: 536 EEYMIVDTPVVPQIPPEHKDGTDSTFDGQVVPLYEEREWPSKCISQIMKMEVLCKVDFID 595

Query: 534 ----------------------VLVHGSAEATEHLKQHCLKH--VCPHVYTPQIEETIDV 569
                                 ++VHGS+ AT HL Q+  ++  V   ++TP++ E +D 
Sbjct: 596 FEGRSDGESAKKILSQIKPKQLIIVHGSSAATRHLAQYAQQNGIVQGKIFTPRLGEIVDA 655

Query: 570 TSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV--------GKTEN----------- 610
           T +   Y+V LS+ +MS+++F+ + D E++W+DA +        G+T N           
Sbjct: 656 TIESHIYQVTLSDAVMSSLIFQTVKDAELSWLDARIVRRKTVTPGQTRNTAEENLETNGN 715

Query: 611 ------------------GMLSLLPI------------STPAPPHKSVLVGDLKMADLKP 640
                               LS L +            S   PPH++V V D K++D+K 
Sbjct: 716 KEEEVEEMEQDDSDQVEGKRLSNLKVAAADTFCLEPMLSANIPPHQAVFVNDPKLSDMKQ 775

Query: 641 FLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
            L+S G + EF+ G L      +IR+   AG         +  +EG  CEDYYKIR  +Y
Sbjct: 776 LLASNGFRAEFSSGVLYINNIASIRR-NEAG---------RFHVEGCACEDYYKIRDIVY 825

Query: 701 SQFYLL 706
           +QF ++
Sbjct: 826 AQFAVV 831


>gi|324503279|gb|ADY41427.1| Cleavage and polyadenylation specificity factor subunit 2 [Ascaris
           suum]
          Length = 841

 Score =  430 bits (1106), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 281/859 (32%), Positives = 439/859 (51%), Gaps = 171/859 (19%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSGV ++ PL YL+ +D   FL+DCGW++ FD + ++ + +    I+AVLL
Sbjct: 1   MTSIIKLEALSGVQDDGPLCYLLQVDQVFFLLDCGWDERFDMAYIEAVKRRVPQINAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D LHLGALPY +++ G++ P+++T PVY++G + +YD       V +F LF LDDID
Sbjct: 61  SYADILHLGALPYLVRKCGMNCPIYATVPVYKMGQMFLYDWVNGHTSVEDFTLFNLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF+ V ++ YSQ   L G   G+ + P  AGH++GG +W+ITK G E+++YAVD+N +K
Sbjct: 121 GAFERVQQVKYSQTILLKGDN-GLQITPLPAGHMIGGAIWRITKMGDEEIVYAVDFNHKK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   E   RP ++ITDA+NAL+NQP R+QR E     +  T+R GG+V++ +D+A
Sbjct: 180 ERHLNGCTFEGIGRPNLMITDAFNALYNQPRRKQRDEQLVTKLLGTVRDGGDVMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GR+LE+  +L+  W  AE  L  Y +  L++V+SS +++ KS +EWM D I KSFE  R 
Sbjct: 240 GRILEIAHLLDQLWHNAEAGLMTYNLVMLSHVASSVVEFAKSQVEWMSDKILKSFEVGRY 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV L     +L      PK+VL S   +E GFS +IF+EW +DV+N V+ T R 
Sbjct: 300 NPFQFRHVQLCHTHMDLLRI-RSPKVVLVSGLDMECGFSREIFLEWCADVRNTVIVTGRS 358

Query: 356 QFGTL-ARMLQ-----ADPPP---KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEA--- 403
              TL AR+++     A+ P    + + + + RR+ L G EL  Y  ++   ++E A   
Sbjct: 359 GDRTLGARLIRMAEQMAENPSTVNRNLTLEVKRRIRLEGVELENYRAKKRADEREAARKR 418

Query: 404 LKASL--VKEEESKASLGPDNN----LSGDPMVIDANNANA--------SADVVEPHGG- 448
           L+AS    + E +++S   D+     ++G+ M I A NA +         +     HGG 
Sbjct: 419 LEASRRNARLEHAESSDDSDDETVMVVTGNNMGISAGNAKSLTTNTPSRHSSSTSIHGGN 478

Query: 449 ------------------RYRDILI-------DGFVPPSTSVAPMFPFYENNSEWDDFGE 483
                             R  DI+          F   +    P+FP+ E  + WDD+GE
Sbjct: 479 PTSPINSTTLTPAQLAEQRSHDIMWKWEQQQKSSFFKQNKKAFPVFPYIEEKTRWDDYGE 538

Query: 484 VINPDDYIIKDEDM------DQAAMHIGGD------------------------------ 507
           +I P++Y+I D  +      ++ A  I G                               
Sbjct: 539 IIRPEEYMIVDSSVVPHITTERMAESIPGTPHSENGQTVPHYEEREWPTKCISQITKMEV 598

Query: 508 ---------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHCLKH--VC 555
                    +G+ D  S   IL   KP ++      V+VHGSA AT HL Q+  +   V 
Sbjct: 599 LCKVEFIDFEGRSDGESMKKILSQVKPKQL------VIVHGSAAATRHLAQYASETGIVQ 652

Query: 556 PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGK-------- 607
             ++TP++ E +D T +   Y+V LS+ LMS+++F+ + D E++W+DA + +        
Sbjct: 653 GKIFTPRLGEIVDATIESHIYQVTLSDALMSSLIFQTVKDAELSWLDARIARRKAITGAT 712

Query: 608 ------TENG---------------------------------MLSLLPI-STPAPPHKS 627
                  E G                                    L P+ S+  P H++
Sbjct: 713 SAVKENREEGEEMPNEDETMEQGGEEETGDGERLSNKKAAAADTFCLEPMPSSNIPSHQA 772

Query: 628 VLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGP 687
           V V D K++D+K  L + G   EF+ G L      +IR+   AG         +  +EG 
Sbjct: 773 VFVNDPKLSDMKQLLMANGFHAEFSSGVLYINNVASIRR-NEAG---------RFHVEGC 822

Query: 688 LCEDYYKIRAYLYSQFYLL 706
             EDYYKIR  +Y+QF ++
Sbjct: 823 ASEDYYKIRDIVYAQFAIV 841


>gi|13938095|gb|AAH07163.1| Cpsf2 protein, partial [Mus musculus]
          Length = 732

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 255/758 (33%), Positives = 400/758 (52%), Gaps = 136/758 (17%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF 114
           IDAVLLSHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LF
Sbjct: 5   IDAVLLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLF 64

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAV 173
           TLDD+D+AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAV
Sbjct: 65  TLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAV 124

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVL 232
           D+N ++E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL
Sbjct: 125 DFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVL 184

Query: 233 LPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKS 289
           + VD+AGRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + 
Sbjct: 185 IAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRC 244

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           FE  R+N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN +
Sbjct: 245 FEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSI 303

Query: 350 LFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLV 409
           + T R   GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+         
Sbjct: 304 ILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-- 361

Query: 410 KEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPP 461
                          S +  +  ++ ++   DV +P   + + D+++ G       F   
Sbjct: 362 ---------------SKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSRKGSFFKQ 406

Query: 462 STSVAPMFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDG--------- 509
           +    PMFP  E   +WD++GE+I P+D+++ +    + +++ +  G  +G         
Sbjct: 407 AKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDLS 466

Query: 510 ----KLDEGSASLILDAKPS-------------KVVSNELT----VLVHGSAEATEHLKQ 548
               K    + S+ + A+ +             K + N++     ++VHG  EA++ L +
Sbjct: 467 DVPTKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAE 526

Query: 549 HCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA- 603
            C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D  
Sbjct: 527 CCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGV 584

Query: 604 ---EVGKTENGML----------------------------------------------- 613
               V K + G++                                               
Sbjct: 585 LDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSAMAQQKAMKSLFGEDEKELGEET 644

Query: 614 SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVG 668
            ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+  
Sbjct: 645 EIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-- 702

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 703 --------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 732


>gi|328768987|gb|EGF79032.1| hypothetical protein BATDEDRAFT_12823 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 719

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 271/757 (35%), Positives = 396/757 (52%), Gaps = 89/757 (11%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V+ T + G  ++ PL YL+ ID    L+DCGW++  DPS L  L KVA  IDA+LL
Sbjct: 1   MSSFVKFTAILGAHDQGPLCYLLEIDEAKLLLDCGWSESTDPSQLAALEKVARQIDALLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH D  HLGA PYA K LGL+ PVF+T PV+ +G   M+D   ++    EF LFT DDID
Sbjct: 61  SHADLDHLGAFPYAAKHLGLTCPVFATTPVHDMGQACMHDLIQAKLNQEEFHLFTKDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AF   T L YSQ   L+GK +GI V+   AGH +GGT+WKI KD E+++YAVDYN RKE
Sbjct: 121 TAFAKTTILRYSQPTVLTGKCQGITVSAFSAGHTIGGTIWKIKKDTEEIVYAVDYNHRKE 180

Query: 181 KHLNGTVL---ESFVRPAVLITDAYNALHNQP-PRQQRE-MFQDAISKTLRAGGNVLLPV 235
           +HLNGTVL   ++ +RP +LITDA+N L   P PR+QR+    ++I+  L   GNVL+P 
Sbjct: 181 RHLNGTVLLSTDTLIRPTLLITDAFNTLMPDPAPRKQRDAALIESIATVLSEHGNVLIPS 240

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           DS+ RVLELL +L+ +WA H   Y + FLT  S + I+  KS LEWMGD I ++F T+R+
Sbjct: 241 DSSTRVLELLYMLDQHWAFHRYTYHLVFLTNQSQNAINLAKSTLEWMGDGIAQAF-TARE 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
             F  K + ++ +  ELDN   GPK+VLAS   +  GFS D+ +EW SD +N+++  +R 
Sbjct: 300 LPFEFKCLKMIHSIDELDNLM-GPKVVLASFPGMMTGFSQDLLIEWGSDPRNMIILPDRA 358

Query: 356 QFGTLARMLQAD--PPPKAVKVTMSRRVPLVGEELIAY------EEEQTRL--KKEEALK 405
           Q GTL RM+  D     K   + + ++VPLVG+EL  Y      EEE  RL    +  L 
Sbjct: 359 QPGTLGRMMFDDWFESAKMADMNLKKQVPLVGDELDEYMSKKQAEEEHARLMHSHQLGLD 418

Query: 406 ASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSV 465
            S   +      +     +  D  V D N +                    GF   + + 
Sbjct: 419 DSSDSDMSDTEEVAKPQPMQFDIYVKDVNRST-------------------GFFKQAQAF 459

Query: 466 APMFPFYENNSEWDDFGEVINPD--------------------------------DYIIK 493
             M+P +E+    DD+GE+I+ D                                 Y+++
Sbjct: 460 -KMYPVHEHRPRVDDYGELIDLDMYAKLELQHNLAPNEPEENEKVVAPVKKVVPSKYVVE 518

Query: 494 DEDMD-QAAMHIGGDDGKLDEGSA-SLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCL 551
           D  +  +  M     +G+ D  S  ++I    P K+      + VHG   +T    ++C 
Sbjct: 519 DILLSLKCRMQYIDFEGRSDGKSVKNIIAQVAPRKL------LFVHGDKASTMAFAEYCR 572

Query: 552 KH--VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTE 609
            +  +   VY P   E ++V+S    ++V L++ LM       +    I   D+  G T 
Sbjct: 573 TNESLTNEVYDPVQGECVNVSSATNLFRVVLTDTLMDEYSLSYITGV-IKLQDSVTGGT- 630

Query: 610 NGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
             ML ++P+ T       ++VG+ K++ ++  L S+G +  FA G L   E   + K   
Sbjct: 631 RAMLEVVPVETQLTRQHVMVVGEAKLSQVRKVLDSQGFRTAFASGVLVVNEGKALIK--- 687

Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
              + G  G+  + +EG +  DYYKIR  LYS   +L
Sbjct: 688 ---RSGTDGS--LALEGSISRDYYKIRELLYSTLAIL 719


>gi|255070137|ref|XP_002507150.1| predicted protein [Micromonas sp. RCC299]
 gi|255070139|ref|XP_002507151.1| predicted protein [Micromonas sp. RCC299]
 gi|226522425|gb|ACO68408.1| predicted protein [Micromonas sp. RCC299]
 gi|226522426|gb|ACO68409.1| predicted protein [Micromonas sp. RCC299]
          Length = 808

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 285/815 (34%), Positives = 433/815 (53%), Gaps = 132/815 (16%)

Query: 2   GTSVQVTPLSGV--FNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVL 59
           GT ++ +PL GV    E+P  Y++ +DGF  L+DCGWND FD +LL+PL+KVA+ +DAVL
Sbjct: 5   GTRIKFSPLYGVQGIGEDPFCYVLDLDGFKILLDCGWNDSFDVNLLEPLAKVAAEVDAVL 64

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           +SHPDT HLGALPYA  +LG+   V++T PV+++GL+ MYD +LSR    +F +FTLDDI
Sbjct: 65  ISHPDTEHLGALPYAFGKLGMRCKVYATLPVHKMGLMFMYDHFLSRNANEDFRVFTLDDI 124

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
           D+AF +   + Y+Q   L G G GI + P+ AGH+LGG +WK+ K+ +DV+YAV++N R+
Sbjct: 125 DTAFSAFVPVRYAQRSALVGHGAGITITPYAAGHMLGGALWKVHKETDDVVYAVNFNHRR 184

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           EKHLNGTVLES  RPAVLITDA NA    P + +     +AI +T+R  GNVL+P+D AG
Sbjct: 185 EKHLNGTVLESIKRPAVLITDASNARRLPPSKTRENDLIEAILRTVRQDGNVLIPIDPAG 244

Query: 240 RVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           RVLELLL+LE+ W++  L  Y +  LT V+ +T+++ +S LEWMG+ + + F+  R NAF
Sbjct: 245 RVLELLLVLEERWSQKQLAAYQLVLLTKVAYNTLEFARSHLEWMGEHVGQYFDRERHNAF 304

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
             +H+ L  +  E    P GPK+VLAS  SL+AG S  IFVEWA D +NL++FT+R Q G
Sbjct: 305 NTRHLKLCHSIDEFRALPQGPKVVLASFGSLDAGASRHIFVEWAPDPRNLIVFTDRLQPG 364

Query: 359 TLARMLQ--ADPPPKA---VKVTMSRRVPLVGEELIAYE-----EEQTRLKKEEALKASL 408
           +L+R +   +  PP A   +++++S+R+ LVG+EL+ ++       Q  +  + + K  +
Sbjct: 365 SLSREVCRLSQLPPGARLPLRISLSQRLKLVGDELLEWQGKEISRSQALVPIKSSTKYRV 424

Query: 409 VKE-----EESKASLGPD--------NNLSGDPMVIDANNANASADV-VEPHGGRYRDIL 454
           ++E     E  K +L           ++  G   V+D  N   +A+V +      Y ++L
Sbjct: 425 LREPKPVIESCKPNLDTQCTTMHSQASHRGGRCYVLDGINQVNNANVAIFDDESWYPNVL 484

Query: 455 IDGFVPPSTSVAPMFPFY-----ENNSEWDD--------FGEV-----INPDDYIIKDED 496
             G     T  +  F  Y     +N+    D        FG       + PD   +  ED
Sbjct: 485 DFG----ETITSETFEGYVQIGLQNDHRSGDRIEERPGEFGHTSDPGRVYPDTQFMGLED 540

Query: 497 MD------------QAAMHIGGDDGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEAT 543
                         +AA+HI   +G  D  S   IL   +P +V+      LV G+   T
Sbjct: 541 SPTKILTETHDVYLRAAVHICDFEGNSDGHSIQTILTHLEPRRVI------LVRGNPSDT 594

Query: 544 EHLKQHCLKHVC-PHVYTPQIEETIDVTSDLCAYKVQLSEKLMS---------------- 586
           + L+    K +    ++ P+  + ++  S+   ++++LS+ L+S                
Sbjct: 595 DFLRMQLQKSLLRAEIHAPKQSQMVECISENTTFRLELSQDLLSHTHMRDVAGYQVGWVE 654

Query: 587 -NVLFKKLG---------------------------------DYEIAWVDAEVGKT---- 608
            NVL  + G                                   E    DA VG      
Sbjct: 655 GNVLISRGGGDPAATLVPAKSGMICEAQRTGLQPNTGASQTATRETRTQDARVGLDFSRE 714

Query: 609 --ENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVTIR 665
             E    S L +        + LVG LK++D +  L++ G   EF GGAL C G+ V +R
Sbjct: 715 IDEQSTASELFLDELVVKKPAALVGSLKLSDSRLALAAAGCATEFRGGALMCTGDKVRVR 774

Query: 666 KVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
           K           G + +++EG LC+ ++ +R+ LY
Sbjct: 775 KTVNV------MGAENLLLEGNLCDTFFSVRSTLY 803


>gi|427789025|gb|JAA59964.1| Putative mrna cleavage and polyadenylation factor ii complex
           subunit cft2 cpsf subunit [Rhipicephalus pulchellus]
          Length = 646

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 244/674 (36%), Positives = 368/674 (54%), Gaps = 88/674 (13%)

Query: 93  LGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           +G + MYD + SR  + +F LFTLDD+D+AF  + +L YSQ  +L GKG+G+ + P  AG
Sbjct: 1   MGQMFMYDLFQSRHNMEDFTLFTLDDVDAAFDKIIQLKYSQTVNLKGKGQGLSITPLPAG 60

Query: 153 HLLGGTVWKITKDGE-DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPR 211
           H++GGTVW+I KDGE D++YAVD+N +KE+HLNG  LE+  RP++LITD YNA + Q  R
Sbjct: 61  HMIGGTVWRIVKDGEEDIVYAVDFNHKKERHLNGCALETISRPSLLITDCYNANYVQARR 120

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYV 267
           + R E     I +TLR  GNVL+ VD+AGRVLEL  +LE  W       + Y +  L  V
Sbjct: 121 RTRDEQLMTNILQTLRNSGNVLVAVDTAGRVLELAHMLEQLWRNQDSGLMAYSLALLNNV 180

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           S + +++ KS +EWM D + +SFE +R+N F  +H+ L    +EL   P+ PK+VLASMA
Sbjct: 181 SYNVVEFAKSQVEWMSDKVMRSFEGARNNPFQFRHLQLCHGMAELARVPE-PKVVLASMA 239

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            +E GFS ++F++W S  +N V+ T R   GTLAR L  +P  +++ +T+ +RV L G E
Sbjct: 240 DMECGFSRELFIQWCSSPRNSVVLTSRSAPGTLARQLIENPHQQSLTITVKKRVRLEGSE 299

Query: 388 LIAYEEEQTRLKKEEALKASLVK-EEESKASLGPDNNLSGDPMVIDANNANASADVVEPH 446
           L  Y      ++KE+ L A+  K E +++      +  S D M ID           EP 
Sbjct: 300 LEEY------MRKEKELAAARHKAERDTELDASDSSEESEDDMDIDEKKPQP-----EPK 348

Query: 447 GGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGG 506
           G      +  GF   +     MFP  E   +WDD+GE+I P+D+++    +D+AA     
Sbjct: 349 GEAKSKSM--GFFKQAKKSYLMFPVKEEKIKWDDYGEIIRPEDFVV----VDKAAQEEET 402

Query: 507 DDGKLDEG--------------SASLILDAKPS-------------------KVVSNELT 533
           D+ K ++                +SL LD   S                   +++  +  
Sbjct: 403 DETKAEDDDLMQDVTEVPTKCLESSLQLDVNASLQFIDFEGRSDGESVRKIVQMMKPQRV 462

Query: 534 VLVHGSAEATEHLKQHCLKH--VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK 591
           +LV GS EAT+ +   C     V   V+TP+I E +D T++   Y+V+L + L+S++ F 
Sbjct: 463 ILVRGSPEATQAMAAFCRSSGSVQGRVFTPRIGEVVDATTESHIYQVKLRDSLVSSLQFA 522

Query: 592 KLGDYEIAWVDAEVGKTEN------------------GMLSLLPI-STPAPPHKSVLVGD 632
           +  + E+AW+D E+   E+                   M  L P+  +  P H ++ V +
Sbjct: 523 RAKNAELAWLDGEIATEEHLAPDGTRDETIDEDESRESMYILQPLPPSQVPGHATIFVNE 582

Query: 633 LKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDY 692
           LK++D K  L   G+Q EF+GG L C   V +R+   AG         +I IEG LCEDY
Sbjct: 583 LKLSDFKQVLLRNGVQAEFSGGVLYCNGIVAVRR-NEAG---------RINIEGCLCEDY 632

Query: 693 YKIRAYLYSQFYLL 706
           +K+R  LY Q+ ++
Sbjct: 633 FKVREILYQQYAII 646


>gi|307203591|gb|EFN82620.1| Probable cleavage and polyadenylation specificity factor subunit 2
           [Harpegnathos saltator]
          Length = 685

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 259/775 (33%), Positives = 398/775 (51%), Gaps = 159/775 (20%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ +D    L+DCGW+++FD   ++ L +  + IDAVLL
Sbjct: 1   MTSIIKLHAISGAMDESPPCYILQVDELRILLDCGWDENFDQDFIKELKRHVNQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD LHLGALPY + + GL+ P+++T PVY++G + MYD Y SR  + +FDLFTLDD+D
Sbjct: 61  SYPDPLHLGALPYLVGKCGLNCPIYATIPVYKMGQMFMYDMYQSRHNMEDFDLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L Y+Q+  + GKG G+ + P  AGH++GGT+WKI K G ED+IYAVD+N +K
Sbjct: 121 AAFDKIVQLKYNQSISMKGKGYGVTLTPLPAGHMIGGTIWKIVKVGEEDIIYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LE   RP++LITDA+NA + Q  R+ R E     I +TLR GGNVL+ VD+A
Sbjct: 181 ERHLNGCELERLQRPSLLITDAFNATYQQARRRTRDEKLMTNILQTLRGGGNVLVSVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           GRVLEL  +L+  W                                        ++++  
Sbjct: 241 GRVLELAHMLDQLW---------------------------------------RNKESGL 261

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           L   + LL N            +VLAS   +E GFS ++F++W ++ +N ++ T R   G
Sbjct: 262 LAYSLALLNN------------VVLASTPDMECGFSRELFLQWCTNPQNSIILTSRTSPG 309

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TLAR L      + + + + RRV L G EL  Y+       K E LK   +K+E+ +   
Sbjct: 310 TLARDLVEKGGNRNITLEVKRRVKLEGIELEEYQ-------KREKLKQEQLKQEQMEI-- 360

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID-----GFVPPSTSVAPMFPFY 472
                         A+ ++ S D +E  G R + D+L+      GF   S    PMFPF 
Sbjct: 361 --------------ADVSSESEDEIEVGGARGKHDLLVKQESKPGFFKQSKKQHPMFPFV 406

Query: 473 ENNSEWDDFGEVINPDDYIIKDE-----------DMDQAAMH----IGGD---------- 507
           E   + D++GE+I P+DY I +            +M Q  ++    I  D          
Sbjct: 407 EEKIKIDEYGEIIKPEDYKIAETLPEVEDNKENVEMKQEEINHHPEIAADIPTKCIQVSR 466

Query: 508 -------------DGKLDEGSASLIL-DAKPSKVVSNELTVLVHGSAEATEHLKQHCLKH 553
                        +G+ D  S   IL   +P +V      VLV GS++ TE L Q   + 
Sbjct: 467 AMTVNAAVTYIDFEGRSDGESLQKILAQLRPRRV------VLVRGSSKDTEILAQQA-QS 519

Query: 554 VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK-LGDYEIAWVDA--------- 603
               V+ P   ET+D T++   Y+V+L++ L+S + F K  GD E+AW+DA         
Sbjct: 520 AGARVFIPARGETLDATTETHIYQVRLTDALVSGLNFSKGKGDSEVAWIDAMITARDQIC 579

Query: 604 --EVGKTE---------NGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQVEF 651
              +  TE         + +L+L P+     P H++  + +LK++D K  L+   I  EF
Sbjct: 580 RDAIADTEPEDAIMDESDKILTLEPLPLNEVPGHQTTFINELKLSDFKQVLNKSNISSEF 639

Query: 652 AGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           +GG L C       +   AG         ++++EG + EDYYK+R  LY Q+ ++
Sbjct: 640 SGGVLWCCNNTIAVRRHEAG---------KVILEGCISEDYYKVRELLYEQYAIV 685


>gi|428169733|gb|EKX38664.1| hypothetical protein GUITHDRAFT_89302 [Guillardia theta CCMP2712]
          Length = 770

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 267/805 (33%), Positives = 402/805 (49%), Gaps = 134/805 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V+ TPL G  +E PL YL+ ID    L+DCGW+++FD   L+ L K+A T+DA+LL
Sbjct: 1   MSSLVKFTPLCGARSEEPLCYLLEIDEACILLDCGWDENFDVVSLRKLIKIAPTLDAILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H D  HLGALPY ++   + A V++T PV ++G LTMYD   SR    +F  FTL DID
Sbjct: 61  THCDLGHLGALPYIIRNCNVKAKVYATIPVQKMGQLTMYDMVESRMAKEDFKQFTLADID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
            A+ +   L Y Q+  LSGK EGI ++P  AGH++GG +WKITK+ E+++YAVDYN  ++
Sbjct: 121 MAWDNFVVLRYQQSCSLSGKAEGITISPLNAGHMIGGALWKITKESEEIVYAVDYNHAQD 180

Query: 181 KHLNGTVLESFVRPAVLITDAY-----NALHNQPPRQQREMFQDAISKTLRAGGNVLLPV 235
           +HL+GTVL    RP +LITDAY     N L  +  R+QR +  + +   +R  GNVL+PV
Sbjct: 181 RHLDGTVLVDLPRPNILITDAYTALDKNTLGGKKAREQRLI--EHVMSAIRQDGNVLIPV 238

Query: 236 DSAGRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           DS GRVLELL++L++ W +  H     + FL+  S S ID   S  EW+   + + F  S
Sbjct: 239 DSTGRVLELLIVLDELWQQNPHLRGVTLAFLSPESRSIIDMAMSQTEWLSKHVNQRFIQS 298

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLE-AGFSHDIFVEWASDVKNLVLFT 352
           R N F L++V    ++ EL   P  P++VLAS   LE + FS D+F EWA D KNLVL T
Sbjct: 299 RHNVFHLENVHRCCSREELGRLP-YPQVVLASGLDLETSSFSLDLFAEWAPDSKNLVLLT 357

Query: 353 ERGQFGTLARMLQ-----ADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK-- 405
           ++ + G+ AR  Q       P P  + + M RRVPL G EL  +EE Q RLK  EA +  
Sbjct: 358 QKARPGSRARQFQDLMGSGLPLPSNLMLQMHRRVPLEGRELREHEE-QERLKALEARRQL 416

Query: 406 -----------------ASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGG 448
                            A  V E +    +G   +        D + +  +    +  GG
Sbjct: 417 EEEAEEAEEEEEEEEENAGAVGEAKEGEEVGKKASTPRAGKGADWSGSTPNKRHKKGRGG 476

Query: 449 RYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMD---------- 498
             R +              MFP +E    +D++GEV++   Y+ +D+  +          
Sbjct: 477 ESRFL--------------MFPHHEEIYSFDEYGEVMDTSIYLKEDQQEEVQGFVEETIS 522

Query: 499 -----------------------------------QAAMHIGGDDGKLDEGSASLILD-A 522
                                                 M      G+ D  S   IL+  
Sbjct: 523 YSGSATSELRPVAHQLHAAAAIPTKSLTYTIRTQLNCGMAFLDYGGRSDSSSVHTILEHL 582

Query: 523 KPSKVVSNELTVLVHGSAEATEHLKQHCLKHVC--PHVYTPQIEETIDVTSDLCAYKVQL 580
           KP+KV      +++HGS +ATE L+  C++ V    + + P + E +  +SD   YK++L
Sbjct: 583 KPAKV------IVIHGSEKATEELQNFCIRKVTEPENTFAPPVGEAVMASSDTNIYKIKL 636

Query: 581 SEKLMSNVLFKKLGDYEIAWVDAEVGKTENGML---SLLPIS--------TPAPPHKS-- 627
            + L   + F ++G Y++A++DA +   +   +   S LP+         T  P  +   
Sbjct: 637 DKALAQGLQFVRVGGYDVAYIDASITCPDENSVDNSSTLPVGQNKDKQMPTLVPRQQEDG 696

Query: 628 ------VLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQ 681
                   +GD+K++DLK  L  +  + E   G L     + IRK G            +
Sbjct: 697 GGRKPFAFIGDVKLSDLKVLLEKQKYKTELKAGMLVVNGSIIIRKSG-----------SR 745

Query: 682 IVIEGPLCEDYYKIRAYLYSQFYLL 706
           ++ EG +C +Y  +R+ L SQ++ L
Sbjct: 746 MIFEGTICTEYAAVRSLLMSQYHTL 770


>gi|402591052|gb|EJW84982.1| cleavage and polyadenylation specificity factor subunit 2
           [Wuchereria bancrofti]
          Length = 809

 Score =  407 bits (1047), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 263/832 (31%), Positives = 415/832 (49%), Gaps = 149/832 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSGV ++ PL YL+ +D   FL+DCGW++ FD + ++ + +    I+AVLL
Sbjct: 1   MTSIIKLEALSGVQDDGPLCYLLQVDQVYFLLDCGWDERFDMAYIEAVKRRVPLINAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D  HLGALPY +++ GL+ P+++T PVY++G + +YD   +   V +F+LF LDDID
Sbjct: 61  SYADIPHLGALPYLVRKCGLNCPIYATVPVYKMGQMFLYDWVNNHTSVEDFNLFNLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ V ++ YSQ   L G   G+ + P  AGH++GG +W+ITK G E+++YAVD+N RK
Sbjct: 121 AAFERVQQVKYSQTILLKGDN-GLQITPLPAGHMIGGAIWRITKMGDEEIVYAVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   E   RP +LITD++NAL+NQP R+QR E     +  T+R GG+V++ +D+A
Sbjct: 180 ERHLNGCTFEGIGRPNLLITDSFNALYNQPRRKQRDEQLVTRLLGTVRDGGDVMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLE+  +L+  W  AE  L  Y +  L++V+SS +++ KS +EWM D + KSFE  R 
Sbjct: 240 GRVLEIAHLLDQLWHNAEAGLMTYNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRY 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV L     +L      PK+VL S   +E+G S D  +   + +  +   T   
Sbjct: 300 NPFQFRHVQLCHTHIDLMRV-RSPKVVLVSGLDMESGRSGDRTL--GARLIRMAEQTAEN 356

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAY-------EEEQTRLKKEEALKASL 408
             GT+ R L          + + RR+ L G EL  Y       E E TR++ E + + + 
Sbjct: 357 PNGTINRNL---------TLEVKRRIRLEGVELENYRAKKRAEEREATRIRLEASRRNAR 407

Query: 409 V---------------------------KEEESKASLGPDNNLSGDPMVIDANNANASAD 441
           +                           K   SK ++    + S      D + A  +  
Sbjct: 408 LEQADSSDDSDDDAVMVVPATTSGILNGKMTNSKRNIASSFSASTTISTTDLSAAQIAEQ 467

Query: 442 VVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDM--DQ 499
                  ++       F   S    PMFP+ E  + WDD+GE+I P++Y+I D  +    
Sbjct: 468 RSHDIMWKWEQQQKSSFFKQSKKSFPMFPYIEEKTRWDDYGEIIRPEEYMIADTPVVPQI 527

Query: 500 AAMHIGGDDGKLDEGSASLILDAK-PSKVVSNELT------------------------- 533
              H  G D   D     L  + + PSK +S  +                          
Sbjct: 528 PPEHKDGTDSTFDGQVVPLYEEREWPSKCISQIMKMEVLCKVDFIDFEGRSDGESAKKIL 587

Query: 534 --------VLVHGSAEATEHLKQHCLKH--VCPHVYTPQIEETIDVTSDLCAYKVQLSEK 583
                   ++VHGS+ AT HL Q+  ++  V   ++TP++ E +D T +   Y+V LS+ 
Sbjct: 588 SQIKPKQLIIVHGSSAATRHLAQYAQQNGIVQGKIFTPRLGEIVDATIESHIYQVTLSDA 647

Query: 584 LMSNVLFKKLGDYEIAWVDAEV--------GKTENG------------------------ 611
           +MS+++F+ + D E++W+DA +        G+ +N                         
Sbjct: 648 VMSSLIFQTVKDAELSWLDARIVRRKTVTPGQAQNAGEENLETNGNKEEEVEEMEQDGSD 707

Query: 612 ----------------MLSLLP-ISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGG 654
                              L P +S   PPH++V V D K++D+K  L+S G + EF+ G
Sbjct: 708 QVEGKRLSNLKVAVADTFCLEPMLSANIPPHQAVFVNDPKLSDMKQLLASNGFRAEFSSG 767

Query: 655 ALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            L      +IR+   AG         +  +EG  CEDYYKIR  +Y+QF ++
Sbjct: 768 VLYINNIASIRR-NEAG---------RFHVEGYACEDYYKIRDIVYAQFAVV 809


>gi|346465041|gb|AEO32365.1| hypothetical protein [Amblyomma maculatum]
          Length = 644

 Score =  407 bits (1046), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 238/666 (35%), Positives = 361/666 (54%), Gaps = 78/666 (11%)

Query: 93  LGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           +G + MYD + SR  + +F LFTLDD+D+AF  + +L YSQ  +L GKG+G+ + P  AG
Sbjct: 1   MGQMFMYDLFQSRHNMEDFTLFTLDDVDAAFDKIIQLKYSQTVNLKGKGQGLSITPLPAG 60

Query: 153 HLLGGTVWKITKDGE-DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPR 211
           H++GGTVW+I KDGE D++YAVD+N +KE+HLNG  LE+  RP++LITD YNA + Q  R
Sbjct: 61  HMIGGTVWRIVKDGEEDIVYAVDFNHKKERHLNGCALETISRPSLLITDCYNANYVQARR 120

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYV 267
           + R E     I +TLR GGNVL+ VD+AGRVLEL  +LE  W       + Y +  L  V
Sbjct: 121 RTRDEQLMTNILQTLRNGGNVLVAVDTAGRVLELAHMLEQLWRNQDSGLMAYSLALLNNV 180

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           S + +++ KS +EWM D + +SFE +R+N F  +H+ L    +EL   P+ PK+VLASMA
Sbjct: 181 SYNVVEFAKSQVEWMSDKVMRSFEGARNNPFQFRHLQLCHGLAELARVPE-PKVVLASMA 239

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            +E GFS D+F++W S  +N V+ T R   GTLAR L  +P  +A+ +TM +RV L G E
Sbjct: 240 DMECGFSRDLFIQWCSSPRNSVVLTSRTAPGTLARQLIENPHQQALTITMKKRVRLEGSE 299

Query: 388 LIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           L  Y      ++KE+ L A+  K E     L   ++       +D +       + EP G
Sbjct: 300 LEEY------MRKEKELAAARHKAERD-TELDASDSSEESEDDMDVDEKKP---LPEPKG 349

Query: 448 GRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKD------------- 494
                 +  GF   +     MF   E   +WDD+GEVI P+D+++ D             
Sbjct: 350 ESKAKSM--GFFKQAKKSYLMFQVKEEKIKWDDYGEVIRPEDFVVVDKTTQEEEADEAKA 407

Query: 495 ------EDMDQAAMHIGGDDGKLDEGSASLILD----------AKPSKVVSNELTVLVHG 538
                 +D+ +          +LD  ++   +D           K  +++  +  +LV G
Sbjct: 408 EDDDLTQDVTEVPTKCLESSLQLDVNASLQFIDFEGRSDGESVRKIVQMMKPQRVILVRG 467

Query: 539 SAEATEHLKQHCLKH--VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDY 596
           S EAT+ +   C     V   V+TP++ E +D T++   Y+V+L + L+S++ F +  + 
Sbjct: 468 SPEATQAMAAFCRSSGAVQGRVFTPRMGELVDATTESHIYQVKLRDSLVSSLQFARAKNA 527

Query: 597 EIAWVDAEVGKTE------------------NGMLSLLPI-STPAPPHKSVLVGDLKMAD 637
           E+AW+D E+   E                  + M  L P+  +  P H ++ + ++K++D
Sbjct: 528 ELAWLDGEIATEEHLAPDGAQDDSLDMDEPRDSMYILQPLPPSQVPGHATIFINEIKLSD 587

Query: 638 LKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRA 697
            K  L   G+Q EF+GG L C   V +R+   AG         +I IEG LCEDY+K+R 
Sbjct: 588 FKQVLLRNGVQAEFSGGVLYCNGIVAVRR-NEAG---------RINIEGCLCEDYFKVRE 637

Query: 698 YLYSQF 703
            LY Q+
Sbjct: 638 ILYQQY 643


>gi|281208327|gb|EFA82503.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
           PN500]
          Length = 738

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 270/762 (35%), Positives = 414/762 (54%), Gaps = 80/762 (10%)

Query: 1   MGTSVQVTPLSGVFNE-NPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVL 59
           M + ++ TPLSG  NE +P  YL+ ID F  L+DCGWN   D S+L+PL  VA+ IDA+L
Sbjct: 1   MTSIIKFTPLSGGANEISPPCYLLEIDEFTILLDCGWNHSLDLSILEPLKAVANKIDAIL 60

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           LS+PD  HLGALPYA+ +LGL+  ++ T P++++G + +YD Y +     +FD F LDD+
Sbjct: 61  LSYPDIEHLGALPYAVSKLGLTGTIYGTTPIFKMGQMFLYDLYSNHMAQEDFDRFDLDDV 120

Query: 120 DSAF--QSVTRLTYSQNYHLSGKGEG-IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           D  F  +    L++SQ+Y L+      I + P+ AGH++GG+VWKITK+ + +IYA+D+N
Sbjct: 121 DLCFDKKRFKELSFSQHYTLTTPSSATITITPYSAGHMIGGSVWKITKETDTIIYAIDFN 180

Query: 177 RRKEKHLNG--TVLES--FVRPAVLITDAYNALHNQPPRQQREMFQD-----AISKTLRA 227
            RKE HL G   VL+    ++P  LITDA +A    PP   + + +D      + KTLR 
Sbjct: 181 HRKEGHLEGFFPVLQGQDLLKPTHLITDARHA--RTPPTALKRIEKDKALYSTLLKTLRE 238

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHSLN--YPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GGNVLLPVD+AGR LELL  +E +WA+  L+  Y + FL  V+ +  ++ KS LE+M  +
Sbjct: 239 GGNVLLPVDTAGRSLELLQSIESHWAQQRLSGAYTVIFLNNVTYNVCEFAKSQLEFMSTA 298

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWAS 343
               FE   +N F  K++ L  +  +L+N        +VLAS   LE+G++ ++F++WA+
Sbjct: 299 AGLKFEQRNENIFAFKNIKLCHSIYDLENLMGLSSNYVVLASGKDLESGYARELFIKWAA 358

Query: 344 DVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEA 403
           D KNL+L T+  + GTLA  L  D P ++V + + RRV L GEEL AYEEE+ R K+EE 
Sbjct: 359 DSKNLILMTDSVEEGTLASHLLNDQP-ESVTLELGRRVELEGEELRAYEEERQRQKEEER 417

Query: 404 LKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPST 463
             A  +K+EE        N +  +P ++D    + +     P G    D+  D F     
Sbjct: 418 AAAEKLKQEEEAL-----NQMVLEPDILDDKIIDITFKK-NPFGSNRYDLTRDQFA--ME 469

Query: 464 SVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDE-GSASLILDA 522
            + PMFPF E   + D++GE         +D+++ + A  +  +D ++++       ++ 
Sbjct: 470 GMQPMFPFIEKVFKVDEYGE---------QDDELLEIARKLNQEDQEMEQLDEVDEKIEE 520

Query: 523 KPSKVVSNELTVLVHGSAEATEHL-------------------------KQHCLKHVCPH 557
            P K+V   LTV +  S +  E+                           Q C+  +  H
Sbjct: 521 TPKKIVKETLTVDLKCSVQYIEYEGCSDGKSIKTIIQKIAPSKLILVRGNQDCIAELETH 580

Query: 558 V---------YTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKT 608
           V         Y P I +TID+TS+   Y V L + L+S++   KL DY+IA++ A+V   
Sbjct: 581 VKQNMRVKGLYKPIINQTIDLTSETNVYNVVLKDSLISSLASSKLMDYDIAYIQAKVILN 640

Query: 609 ENGM----LSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTI 664
           E  M    +  L       PH S  +GD+K+++ K  L   G QV+F  G +      T+
Sbjct: 641 ETNMKAPPVLELLAEEEIEPHNSSFIGDIKLSEFKQLLIDSGYQVQFDQGIIAVSMKTTL 700

Query: 665 RKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             +      G  S    I I+G L ++YY++R  LY QF ++
Sbjct: 701 IYIWREEVDGNSS----IQIDGILSDEYYQVRELLYQQFQII 738


>gi|66826811|ref|XP_646760.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
 gi|74858209|sp|Q55BS1.1|CPSF2_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 2; AltName: Full=Cleavage and polyadenylation
           specificity factor 100 kDa subunit; Short=CPSF 100 kDa
           subunit
 gi|60474609|gb|EAL72546.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
          Length = 784

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 256/805 (31%), Positives = 420/805 (52%), Gaps = 120/805 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++ T LSG  +E+P  YL+ ID F  L+DCG + + D SLL+PL KVA  IDAVLL
Sbjct: 1   MASIIKFTALSGAKDESPPCYLLEIDDFCILLDCGLSYNLDFSLLEPLEKVAKKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DT H+G LPY + + GL+  ++ T PV ++G + +YD Y ++    EF  ++LD+ID
Sbjct: 61  SHSDTTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNID 120

Query: 121 SAF--QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           S F       L++SQ+Y LSGKG+GI + P++AGH +G +VWKITK    ++YA+DYN R
Sbjct: 121 SCFGEDRFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHR 180

Query: 179 KEKHLNGTVLES-FVRPAVLITDAYN-----ALHNQPPRQQREMFQDAISKTLRAGGNVL 232
            E HL+   L S  ++P++LITD+       A      R Q  +F+  I++ LR GGNVL
Sbjct: 181 NEGHLDSLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQ-SLFE-QINRNLRDGGNVL 238

Query: 233 LPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +PVD+AGRVLELLL +E+YW+++ SL  Y + FL   S S   + +S LE+M  + +  F
Sbjct: 239 IPVDTAGRVLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKF 298

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           E + +N F  KH+ +L +  EL   PD  K++L S   LE GFS ++F++W SD K L+L
Sbjct: 299 EQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLIL 358

Query: 351 FTERGQFGTLA-RMLQADPPP----KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK 405
           FT++    +LA ++++    P    K +++    RVPL G+EL+ YE EQ + ++E+ L+
Sbjct: 359 FTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQYEMEQAKQREEKRLE 418

Query: 406 ASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVP----- 460
              +++E+ +              +++A N +    +++    + R I+ D  V      
Sbjct: 419 Q--LRKEQEEREERERLEEEEREQLLNATNQDQLQQLLQLQQQKERGIIDDSMVHMKNPF 476

Query: 461 ------------PSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMD---------- 498
                          S+  MFP++E + +W ++GE    DD I++++D            
Sbjct: 477 ENDRFDLLDSEFKKQSMITMFPYFEKHLKWGEYGE--EDDDLILRNQDKKVEEVTMEEDE 534

Query: 499 -----------------------QAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVL 535
                                  Q   + G  DG+      ++I    P+K+      VL
Sbjct: 535 IQEQEIPKKIITQTLRLPINCKIQTIDYEGCSDGR---SIKAIIQQIAPTKL------VL 585

Query: 536 VHGSAEATEHLKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG 594
           + GS + ++ ++ +  +++    +Y P I E +D+TSD   Y++ L + L++ +   K+ 
Sbjct: 586 IRGSEQQSQSIENYVKENIRTKGIYIPSIGEQLDLTSDTNVYELLLKDSLVNTLKTSKIL 645

Query: 595 DYEIAWVDAEVGKTENGMLSLLPISTPAP------------------------------- 623
           DYE++++  +V   +   + +L +    P                               
Sbjct: 646 DYEVSYIQGKVDILDGSNVPVLDLIQSIPINNNNNNNNNNNNNNNNNNNNTTMMTTTTTT 705

Query: 624 --PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQ 681
              H    +GD+K++DLK  L + GIQV+F  G L CG  V I +    G      G   
Sbjct: 706 TNGHDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNCGGLVYIWRDEDHG------GNSI 759

Query: 682 IVIEGPLCEDYYKIRAYLYSQFYLL 706
           I ++G + ++YY I+  LY QF ++
Sbjct: 760 INVDGIISDEYYLIKELLYKQFQIV 784


>gi|452822529|gb|EME29547.1| cleavage and polyadenylation specificity factor subunit 2
           [Galdieria sulphuraria]
          Length = 747

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 252/758 (33%), Positives = 399/758 (52%), Gaps = 118/758 (15%)

Query: 1   MGTSVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVL 59
           M + ++ TPL GV  E+  + YL+ ID F  L+DCGWND F+ +LL+PL ++A  +DAVL
Sbjct: 1   MSSILRFTPLYGVKTEDLAVCYLLEIDDFRILLDCGWNDRFEETLLEPLRRIAPRVDAVL 60

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           +SHPD  HLGALPYA+ +LGL AP ++T PV+R+G L MYD + SR    +F +F LDD+
Sbjct: 61  ISHPDLFHLGALPYAVAKLGLRAPTYATLPVWRMGQLFMYDAHQSRAMQEDFQVFDLDDV 120

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
           DSAF++  +L Y Q  + S +G+GI + PH AGH++GGTVWKI  + E+++YA D+N ++
Sbjct: 121 DSAFENFIQLKYQQIVNFSERGKGITITPHPAGHMIGGTVWKIQSETEEIVYANDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNAL----------HNQPPRQQR---------EMFQDA 220
           E+HLN T L+   RP+ LI  A  AL            Q P+  +         E+ ++A
Sbjct: 181 ERHLNPTTLQYLTRPSHLIISASQALVRPSSSSSISGQQFPKGSQIYSRSNPLTEICEEA 240

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSF 278
           +S TLR GG+V++PVD+AGRVLEL L  ED+WA   L  +Y +  + +VS +TID+ KS 
Sbjct: 241 LS-TLRQGGDVVIPVDTAGRVLELALGFEDFWATEKLGSSYAVAIIEHVSFNTIDFAKSM 299

Query: 279 LEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIF 338
           +EWM D++   F+T+R+N F LKH+ L  +     ++   PK++L S+ASLE GFS ++ 
Sbjct: 300 MEWMSDAVINKFDTTRENPFHLKHIHL-CHSRSELSSLLSPKVILTSVASLECGFSRELV 358

Query: 339 VEWASDVKNLVLFTERGQFGTLAR----MLQADPPPKAVKV-----TMSRRVPLVGEELI 389
           VE  S+ KN ++  +R +  TLA     +L+ +   K V++      ++RRVPL G EL 
Sbjct: 359 VEMVSNKKNKLILVDRLEPNTLAHSIYNVLEDESEGKTVQLPRIALRLNRRVPLQGAEL- 417

Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANN-----ANASADVVE 444
              EE     K      SL+   ++ + +  +N LS               ++   D   
Sbjct: 418 ---EEYYANMKTSNESVSLL---QNPSEMHFENRLSSSTEEEQEEEDLSSMSDDEKDKAT 471

Query: 445 PHGGRYRDILIDGFVPPSTSVAPMFPFYENNSE----WDDFGEVINPDDYIIKDEDMDQA 500
            H G +      G      + + M  F     +    WDD+G VI+   ++I ++  +  
Sbjct: 472 NHFGSF-----SGESKIDKARSEMIVFSNARKQTDDIWDDYGLVIDTKCFMIGEDPGE-- 524

Query: 501 AMHIGGDDGKLDEGSASLILDAK-------------PSKV-------------------- 527
              I GD  +  E S    L+               P+K                     
Sbjct: 525 ---IEGDSEEFSETSMDDALNNPVDFRGLFQEDEQVPTKCIQVNVNLEVACQIRYVGCAG 581

Query: 528 -------------VSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLC 574
                        V+    ++VHGS + T  +K+ C + +   ++ P+  ETID+T+D  
Sbjct: 582 LSDGRSLRQLLTAVAPRRVIIVHGSRKETAAIKEFCERGLTKDIFCPRAMETIDITTDTS 641

Query: 575 AYKVQLSEKLMSNVLFKKLGDYEIAWVDA-------------EVGKTENGMLSLLPISTP 621
            +++ L ++L+S+ ++K++GDYE++++D              E   +      L   S+ 
Sbjct: 642 IFRLTLRDRLLSSCIWKRIGDYELSFLDGTIRVENESSPKEKETNVSHTQEYVLEQRSSL 701

Query: 622 APPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCG 659
              H  V +G+ K++DL+P LS  GI  +F G ++  G
Sbjct: 702 DSGHPIVFIGEGKLSDLRPALSRVGIPSDFIGDSVSNG 739


>gi|74183852|dbj|BAE24504.1| unnamed protein product [Mus musculus]
          Length = 493

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 205/505 (40%), Positives = 306/505 (60%), Gaps = 31/505 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   DV +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYII 492
           MFP  E   +WD++GE+I P+D+++
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLV 487


>gi|195503420|ref|XP_002098644.1| GE26465, isoform B [Drosophila yakuba]
 gi|194184745|gb|EDW98356.1| GE26465, isoform B [Drosophila yakuba]
          Length = 548

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 215/561 (38%), Positives = 332/561 (59%), Gaps = 40/561 (7%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L GKG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P K +++ + RRV L G EL  Y   Q      E L   +VK +  
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVELEGAELEEYLRTQG-----EKLNPLIVKPDVE 415

Query: 415 KASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
           + S     +       I+ +      D+V    GR+      GF   +     MFP++E 
Sbjct: 416 EESSSESED------DIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPYHEE 465

Query: 475 NSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEGSAS 517
             + D++GE+IN DDY I D              E++ +    +G D   +G + +    
Sbjct: 466 KVKCDEYGEIINLDDYRIADATGYDFVPMEEQNKENVKKEEPGLGADQQTNGGIGDNDVQ 525

Query: 518 LILDAKPSKVVSNELTVLVHG 538
           L+   KP+K+++   T+ V+ 
Sbjct: 526 LL--EKPTKLINQRKTIEVNA 544


>gi|195574631|ref|XP_002105288.1| GD21403 [Drosophila simulans]
 gi|194201215|gb|EDX14791.1| GD21403 [Drosophila simulans]
          Length = 664

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 242/694 (34%), Positives = 368/694 (53%), Gaps = 110/694 (15%)

Query: 93  LGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           +G + MYD Y+S   + +FDLF+LDD+D+AF+ +T+L Y+Q   L GKG GI + P  AG
Sbjct: 1   MGQMFMYDLYMSHFNMGDFDLFSLDDVDTAFEKITQLKYNQTVSLKGKGYGISITPLNAG 60

Query: 153 HLLGGTVWKITKDGE-DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPR 211
           H++GGT+WKI K GE D++YA D+N +KE+HL+G  L+   RP++LITDAYNA + Q  R
Sbjct: 61  HMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGCELDRLQRPSLLITDAYNAQYQQARR 120

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYV 267
           + R E     I +T+R  GNVL+ VD+AGRVLEL  +L+  W       + Y +  L  V
Sbjct: 121 RARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELAHMLDQLWKNKDSGLMAYSLALLNNV 180

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           S + I++ KS +EWM D +TK+FE +R+N F  KH+ L  + +++ N P GPK+VLAS  
Sbjct: 181 SYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYNLPAGPKVVLASTP 240

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA-RMLQADPPPKAVKVTMSRRVPLVGE 386
            LE+GF+ D+FV+WAS+  N ++ T R   GTLA  +++   P K +++ + RRV L G 
Sbjct: 241 DLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGA 300

Query: 387 ELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMV---IDANNANASADVV 443
           EL  Y   Q      E L   +VK         PD            I+ +      D+V
Sbjct: 301 ELEEYLRTQG-----EKLNPLIVK---------PDVEEESSSESEDDIEMSVITGKHDIV 346

Query: 444 EPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKD--------- 494
               GR+      GF   +     MFP++E   + D++GE+IN DDY I D         
Sbjct: 347 VRPEGRHH----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADATGYEFVPM 402

Query: 495 -----EDMDQAAMHIGGD---DGKLDEGSASLILDAKPSKVVSNELT------------- 533
                E++ +    +G D   +G + +    L+   KP+K+++   T             
Sbjct: 403 EEQNKENVKKEEPGMGADQQANGAIVDNDVQLL--EKPTKLINQRKTIEVNAQVQRIDFE 460

Query: 534 --------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDL 573
                               +++HG+AE T+ + +HC ++V   V+TPQ  E IDVT+++
Sbjct: 461 GRSDGESMLKILSQLRPRRVIVIHGTAEGTQVVARHCEQNVGARVFTPQKGEIIDVTTEI 520

Query: 574 CAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGK-------------------TENGMLS 614
             Y+V+L+E L+S + F+K  D E+AWVD  +G                     E   L+
Sbjct: 521 HIYQVRLTEGLVSQLQFQKGKDAEVAWVDGRLGMRVKAIEAPMDVTVEQDASVQEGKTLT 580

Query: 615 LLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQ 672
           L  ++    P H SVL+ +LK++D K  L    I  EF+GG L C    + +R+V     
Sbjct: 581 LETLADDEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRVDAG-- 638

Query: 673 KGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   ++ +EG L E+YYKIR  LY Q+ ++
Sbjct: 639 --------KVAMEGCLSEEYYKIRELLYEQYAIV 664


>gi|355680846|gb|AER96660.1| cleavage and polyadenylation specific factor 2, 100kDa [Mustela
           putorius furo]
          Length = 569

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 205/505 (40%), Positives = 306/505 (60%), Gaps = 31/505 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYII 492
           MFP  E   +WD++GE+I P+D+++
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLV 487


>gi|24650920|ref|NP_733264.1| cleavage and polyadenylation specificity factor 100, isoform B
           [Drosophila melanogaster]
 gi|23172526|gb|AAN14148.1| cleavage and polyadenylation specificity factor 100, isoform B
           [Drosophila melanogaster]
          Length = 664

 Score =  390 bits (1002), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 242/694 (34%), Positives = 366/694 (52%), Gaps = 110/694 (15%)

Query: 93  LGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           +G + MYD Y+S   + +FDLF+LDD+D+AF+ +T+L Y+Q   L  KG GI + P  AG
Sbjct: 1   MGQMFMYDLYMSHFNMGDFDLFSLDDVDTAFEKITQLKYNQTVSLKDKGYGISITPLNAG 60

Query: 153 HLLGGTVWKITKDGE-DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPR 211
           H++GGT+WKI K GE D++YA D+N +KE+HL+G  L+   RP++LITDAYNA + Q  R
Sbjct: 61  HMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGCELDRLQRPSLLITDAYNAQYQQARR 120

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYV 267
           + R E     I +T+R  GNVL+ VD+AGRVLEL  +L+  W       + Y +  L  V
Sbjct: 121 RARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELAHMLDQLWKNKESGLMAYSLALLNNV 180

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           S + I++ KS +EWM D +TK+FE +R+N F  KH+ L  + +++   P GPK+VLAS  
Sbjct: 181 SYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLASTP 240

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA-RMLQADPPPKAVKVTMSRRVPLVGE 386
            LE+GF+ D+FV+WAS+  N ++ T R   GTLA  +++   P K +++ + RRV L G 
Sbjct: 241 DLESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGA 300

Query: 387 ELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMV---IDANNANASADVV 443
           EL  Y   Q      E L   +VK         PD            I+ +      D+V
Sbjct: 301 ELEEYLRTQG-----EKLNPLIVK---------PDVEEESSSESEDDIEMSVITGKHDIV 346

Query: 444 EPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKD--------- 494
               GR+      GF   +     MFP++E   + D++GE+IN DDY I D         
Sbjct: 347 VRPEGRHH----SGFFKSNKRHHVMFPYHEEKVKCDEYGEIINLDDYRIADATGYEFVPM 402

Query: 495 -----EDMDQAAMHIGGD---DGKLDEGSASLILDAKPSKVVSNELT------------- 533
                E++ +    IG +   +G + +    L+   KP+K++S   T             
Sbjct: 403 EEQNKENVKKEEPGIGAEQQANGGIVDNDVQLL--EKPTKLISQRKTIEVNAQVQRIDFE 460

Query: 534 --------------------VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDL 573
                               +++HG+AE T+ + +HC ++V   V+TPQ  E IDVTS++
Sbjct: 461 GRSDGESMLKILSQLRPRRVIVIHGTAEGTQVVARHCEQNVGARVFTPQKGEIIDVTSEI 520

Query: 574 CAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGK-------------------TENGMLS 614
             Y+V+L+E L+S + F+K  D E+AWVD  +G                     E   L+
Sbjct: 521 HIYQVRLTEGLVSQLQFQKGKDAEVAWVDGRLGMRVKAIEAPMDVTVEQDASVQEGKTLT 580

Query: 615 LLPIS-TPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQ 672
           L  ++    P H SVL+ +LK++D K  L    I  EF+GG L C    + +R+V     
Sbjct: 581 LETLADDEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRVDAG-- 638

Query: 673 KGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                   ++ +EG L E+YYKIR  LY Q+ ++
Sbjct: 639 --------KVAMEGCLSEEYYKIRELLYEQYAIV 664


>gi|440797154|gb|ELR18249.1| cleavage and polyadenylation specificity factor subunit 2, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 799

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 260/759 (34%), Positives = 393/759 (51%), Gaps = 127/759 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M   V+ TP+ G   E P   L+ ID +  L+DCGW+D FD   L+ +      IDAVLL
Sbjct: 1   MTAIVKYTPIYGSKTEGPFCSLLEIDEYRILLDCGWDDKFDIEALENVKAYIPKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHL                                      +  +FD++ LDD+D
Sbjct: 61  SHPDLLHL--------------------------------------KDEDFDVWNLDDVD 82

Query: 121 SAF--QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           +AF  +   +L YSQ+  L+G+G GI + P+V GH++GGTVWKITK+ E+++YAVDYN +
Sbjct: 83  AAFNEERFEQLKYSQHVRLTGRGAGIELTPYVGGHMIGGTVWKITKETEEILYAVDYNHK 142

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
           KE+HLN TVLE+  RP +LITDA+N L  Q  R+ R+M   D   KTL+  GNVLLP D+
Sbjct: 143 KERHLNPTVLETLNRPTLLITDAFNGLSTQSSRRSRDMDLLDTTMKTLKGDGNVLLPTDT 202

Query: 238 AGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           AGRVLELLL  + +WA + L+ Y +  L   + +TI++ KS LEWM  ++ KSF+  R N
Sbjct: 203 AGRVLELLLTFDQHWAYYRLSQYGLVLLEKQAYNTIEFAKSQLEWMSTAVQKSFDLDRVN 262

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
            F  K V L  +  EL+  P  P +VLA+ ASLE GF+ D+FVEW+S+ ++ V+FT+R Q
Sbjct: 263 PFEFKFVRLCHSVEELEALPK-PLVVLATTASLEWGFARDLFVEWSSNPRHAVIFTDRPQ 321

Query: 357 FGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK----------- 405
            GTL  ++    PP A+ + + RRVPL G EL  + ++Q   K  + L+           
Sbjct: 322 PGTLGHLVLTQQPP-ALGLELHRRVPLEGAELREWRQKQQEEKARKLLEEQQKVHGDLCG 380

Query: 406 ASL--VKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPST 463
           ASL  ++EEE + +   + +   D + +  +    S +  + +         D F P ++
Sbjct: 381 ASLKHLQEEEKRKNEAEEIDEEEDDVSLLFHTTAHSFNPFKEN--------CDWFAPKNS 432

Query: 464 ------SVAPMFPFYENNSEWDDFGEVINPDDYI----IKDEDMDQAAMHIGGDDGKLDE 513
                  V P+FP  +   ++DD+G++I+   ++     +D  +   +++  G+ G   E
Sbjct: 433 GNYYEPQVCPLFPHEDVRQKFDDYGQMIDLQHFLHPPSQRDFPLTADSLNARGEGGDKME 492

Query: 514 GSASLILDAK-----PSKVVSNEL-----------------------TVLVHGSAEA--- 542
                   A      P+K ++ E                        T+L H +      
Sbjct: 493 TEGGEGQAAAEEEAVPTKCITVERKVEVKCTIKYIDFEGRSDGRSIKTILAHVAPRKMVL 552

Query: 543 --TEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNV--LFKKLGDY 596
              EHLK++C   + VC  VYTP   ET+D+TSD   Y+V++ E L+ ++   F K+GD 
Sbjct: 553 FHVEHLKEYCADTRTVCNSVYTPDDNETLDLTSDTNIYRVKVKEALLKSLEEEFMKVGDR 612

Query: 597 EIAWVDAEVGKT------ENGM---LSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGI 647
           E+A+V+  +  T        GM   L   P     PPH  V VG+++++D K  L+  G 
Sbjct: 613 EVAYVNGVLNPTGFAPRRGEGMELELEQAPEEI-IPPHDPVFVGEVRLSDFKDILTQHGF 671

Query: 648 QVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
           + EFA G L C   V ++K     +  G SG  +I + G
Sbjct: 672 RTEFAAGVLICNGVVMLKK-----ETEGLSGRSKISVNG 705



 Score = 75.5 bits (184), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 34/84 (40%), Positives = 51/84 (60%), Gaps = 5/84 (5%)

Query: 623 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQI 682
           PPH  V VG+++++D K  L+  G + EFA G L C   V ++K     +  G SG  +I
Sbjct: 721 PPHDPVFVGEVRLSDFKDILTQHGFRTEFAAGVLICNGVVMLKK-----ETEGLSGRSKI 775

Query: 683 VIEGPLCEDYYKIRAYLYSQFYLL 706
            + G LC+DY+ +R  LYSQF++L
Sbjct: 776 SVNGALCDDYFAVRDLLYSQFHIL 799


>gi|330803886|ref|XP_003289932.1| hypothetical protein DICPUDRAFT_80682 [Dictyostelium purpureum]
 gi|325079974|gb|EGC33550.1| hypothetical protein DICPUDRAFT_80682 [Dictyostelium purpureum]
          Length = 752

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 254/768 (33%), Positives = 414/768 (53%), Gaps = 78/768 (10%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MG+ V+ T LSG  NE P  YL+ ID F  L+DCG +   D SLL+PL K A  IDAVLL
Sbjct: 1   MGSIVKFTALSGGDNEKPPCYLLEIDDFCILLDCGLSYDLDFSLLEPLKKYADKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D LH+G LPYA+ +LGL+  ++ T PV ++G + +YD Y ++    EFD F LD++D
Sbjct: 61  SNSDLLHIGGLPYAVGKLGLTGTIYGTTPVLKMGTMFLYDLYENKMAQEEFDQFNLDNVD 120

Query: 121 SAF--QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           + F       L++SQ+Y L GKG+GI + P++AGH++G +VW+ITK    +IYA+D+N R
Sbjct: 121 ACFGEDRFKELSFSQHYLLQGKGKGISITPYLAGHMVGSSVWRITKGTYSIIYALDFNHR 180

Query: 179 KEKHLNGTVLES-FVRPAVLITDAYNALHNQPPRQ---QREMFQDAISKTLRAGGNVLLP 234
            E HL+   L S  ++P++LITD+       P ++   + +   + I  +LRAGGNVLLP
Sbjct: 181 NEGHLDSLQLTSDILKPSLLITDSKGVDRTLPYKKIATRDQALLEKIHNSLRAGGNVLLP 240

Query: 235 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           VD+AGRVLELLL +E+YW ++ L+ Y + FL   S +   + KS LE+M  S +  FE  
Sbjct: 241 VDTAGRVLELLLCIENYWVKNRLSLYTVGFLGRFSFNVCQFAKSQLEFMSSSASVRFEQK 300

Query: 294 RDNAFLLKHVTLLINKSELDNAP--DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
            DN F  + + +    S L+  P  + PK++L S   LE G+S D+F++W+SD KNL+LF
Sbjct: 301 IDNPFTFRQIKIF---STLEEIPETNTPKVILTSSQDLETGYSRDLFIKWSSDPKNLILF 357

Query: 352 TERGQFGTLARML------QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK 405
           T     G+LA  +      ++    K +++    RVPL GEEL+ YE+   + K+E+ L+
Sbjct: 358 TNYIPEGSLASKVINIASNKSSGSNKTIEIQQGSRVPLQGEELLEYEQRIAKEKEEKLLE 417

Query: 406 ASLVKEEESKASLGPDNNLSGDPMVIDANN-----ANASADVVEPHGGRYRDILIDGFVP 460
               ++EE +     +    G  M +D NN      N   +   P+G    D L   F  
Sbjct: 418 QLKKEQEEQEERERLEMEEKG--MNLDDNNDEIMITNGVNEPSLPNGTIINDSL-SNFKN 474

Query: 461 P-------------STSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMD--------- 498
           P                +  MFP+YE + +W D+GE    +++I K+++           
Sbjct: 475 PFENKYDLSRGQFRREGMVAMFPYYEKHVKWGDYGE--EDEEFIEKNQNQKVEEVAMEED 532

Query: 499 -----------QAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELT----VLVHGSAEAT 543
                          H    + K+D      I D +  K +  ++     VL+ G  + +
Sbjct: 533 EENEQEVPKKIVVTTHQCEVNCKVDTIDYEGISDGRSIKTIIQQIAPTNLVLIRGKKDQS 592

Query: 544 EHLKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVD 602
           ++++ +  +++    +++P I E +D+TS    Y++ L + L++ +   K+ D E++++ 
Sbjct: 593 KNIENYVKENMRTKGIFSPAINEELDLTSGTNVYELVLRDTLVNTLKPSKILDCEVSFIQ 652

Query: 603 AEVG---KTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGI-QVEFAGGALRC 658
            +V    +  +  L ++P S     H    +GD+K+ADLK  L   GI +V+F  G + C
Sbjct: 653 GKVEYNPENNSSYLDIIP-SEQNNGHDESFIGDIKLADLKQVLVKAGIKKVQFDQGIINC 711

Query: 659 GEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            + V I +     +  GG+    I ++G + ++YY ++  LY QF ++
Sbjct: 712 NDLVYIWR-----EDVGGNSI--INVDGIISDEYYLVKELLYRQFQIV 752


>gi|339247939|ref|XP_003375603.1| cleavage and polyadenylation specificity factor subunit 2
           [Trichinella spiralis]
 gi|316971010|gb|EFV54853.1| cleavage and polyadenylation specificity factor subunit 2
           [Trichinella spiralis]
          Length = 1188

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 248/758 (32%), Positives = 386/758 (50%), Gaps = 124/758 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++   LSGV +++P  Y++ +  F+F++DCGW+  F+   ++   K A  IDAVLL
Sbjct: 1   MTSLIRFEALSGVMDDSPPCYVLEVGEFHFMLDCGWDSSFNMDFIERAQKWAPRIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+PD  H+GALPY + + GLS P+++T PVYR+G + +YD Y S +   +F +F+LDD+D
Sbjct: 61  SYPDIAHIGALPYLVGKCGLSCPIYATVPVYRMGQMFLYDWYQSFQNYEDFQIFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
             F  V ++ Y+Q   + G+G G+ + P  AGH++GGT+W+ITK GE+ ++YAVD+N +K
Sbjct: 121 QVFDKVLQVKYNQQVSMKGRGHGLQIVPLPAGHMIGGTIWRITKMGEEEIVYAVDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG  LES  RP +LITDAY        R+ R E     I KTLR+GGNVL+ VD+A
Sbjct: 181 ERHLNGCPLESIARPNLLITDAYMCGTALLRRKFRDEALLSTILKTLRSGGNVLIVVDTA 240

Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL+ +L+  W  AE  L  Y + F+  V+ + +++ KS +EWM + + + FE  R 
Sbjct: 241 GRVLELVQLLDQLWHNAEAGLLLYSLIFMNSVAFNVVEFAKSQVEWMSERMLRMFEEGRS 300

Query: 296 NAFLLKHVTLLINKSELD-----------NAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           N F  +H  L  + +EL            +A    K+VLAS   L++GFS ++F++W  D
Sbjct: 301 NPFQFRHAQLCHSLAELTRLRSPKVLSFRDAFFSDKVVLASQPDLDSGFSRELFLDWCID 360

Query: 345 VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEAL 404
            KN ++ T R + G+L   L          + M      +G + I  + ++      E +
Sbjct: 361 AKNCIILTSRARIGSLCSKL----------IEMVSSPERIGTKQITVQVKRRFDDYGEVI 410

Query: 405 KASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDG--FVPPS 462
            A    + E+K  +    +L  D M  D  N      V  P  G  +DI      FV   
Sbjct: 411 HAKSYLQLETKVRM---VDLMRDRMGEDQENG-----VTTP--GEVQDIPTKCIQFVQTV 460

Query: 463 TSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILD- 521
              A +        E+ DF                          +G+ D  S   IL  
Sbjct: 461 EVFAQL--------EFIDF--------------------------EGRTDVDSLKKILQM 486

Query: 522 AKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVC---PHVYTPQIEETIDVTSDLCAYKV 578
           +KP +++      LVHG AE TE L  +C K +      V+TP++ + +D T +   Y++
Sbjct: 487 SKPKQII------LVHGMAEQTEKLANYCRKSLNMAEDKVFTPRLGDLVDATIESHMYQL 540

Query: 579 QLSEKLMSNVLFKKLGDYEIAWVDA-------------------------------EVGK 607
           +L++ L++++ F  + D EIAWV+                                ++G 
Sbjct: 541 KLTDALLNSLKFIHVKDVEIAWVNGLIKHNCSEEETEDQKIAAMDVDDEKNAENAVDIGS 600

Query: 608 TENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKV 667
                L LLP S+  P H +V VGD K++DLK  L   G Q EF+ G L     ++IRK 
Sbjct: 601 DNIPYLDLLP-SSEIPSHDAVFVGDPKLSDLKQALMLDGFQAEFSHGVLVVNNVLSIRKR 659

Query: 668 GPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYL 705
                        Q+ +EG +C+DYY IR   ++ ++ 
Sbjct: 660 ADG----------QLHVEGIVCKDYYAIRDQFHANYFF 687


>gi|167535876|ref|XP_001749611.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772003|gb|EDQ85662.1| predicted protein [Monosiga brevicollis MX1]
          Length = 770

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 248/785 (31%), Positives = 397/785 (50%), Gaps = 100/785 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M   V+V  LSGV +E+P  YL+ +DG   L+DCGW++HFD + L  L+KVASTID VLL
Sbjct: 1   MAFIVRVEALSGVLDESPPCYLLELDGVRILLDCGWSEHFDTTQLDALAKVASTIDLVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S PD  HLGALPYA ++LGL+ P ++T P+ +LGLL +YD + +R +  +F+ F+LD ID
Sbjct: 61  SQPDIHHLGALPYAYEKLGLTCPCYATLPIKQLGLLFLYDAFQARMEQEDFETFSLDGID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
            +F ++T + YSQ   ++G   GI +    AGH+LGGTVW+ITKD EDV+YA++YN R E
Sbjct: 121 ESFANITSVKYSQAIEVAGT--GITLLALQAGHMLGGTVWRITKDDEDVVYALNYNHRSE 178

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQ--PPRQQREMFQDAISKTLRAGGNVLLPVDSA 238
           +HL   V +   RP++LIT A NA       P+++         +T+R+ G +++  D+A
Sbjct: 179 RHLRPAVFQLLTRPSLLITGARNASTEMVLKPKEREAKLLSLAEQTMRSDGTMVVVADTA 238

Query: 239 GRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           GR LEL+ + E +W ++     YP++FL++ S + +++ ++ +E+M D +    +T   N
Sbjct: 239 GRTLELVQLFESHWNDNPGLKTYPVFFLSHNSYNVLEFAQTLIEFMSDKMLVKLQTMTHN 298

Query: 297 AFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            F   ++     +  +D      G K+V+   +SLEAGF  ++    A + +N  LF  R
Sbjct: 299 PFACPNIKC---QKTVDGVMRSAGAKVVIVPHSSLEAGFGRELLFRLAGEARNRFLFIAR 355

Query: 355 GQFGTL-ARMLQADPPPKAVKVTMSRRVPLVGEELIAYEE---EQTRLKKEEALKASLVK 410
               +L AR+L        ++     RV L GEEL AY +   E+ + +KE+AL  +  +
Sbjct: 356 PPPHSLGARLLAKSGQIHTIQFEHRFRVQLEGEELKAYRQHKAEEAKQQKEDALAQARAE 415

Query: 411 ----EEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVA 466
                 +S+     D++++  PM +     +  A    P   R +D          T+  
Sbjct: 416 GTFVGSDSEDDEDEDDHVADLPMRLPGTQPSIDAVHHTPQQTRAKDRTFRSRRQALTT-- 473

Query: 467 PMFPFYEN---------------NSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGD---- 507
             FPF  N                 EWDD+G   + +   + D  +         D    
Sbjct: 474 --FPFQSNKVVRASTYDSFMGAQKVEWDDYGMTFDREKLKLLDSHLATGLEAPAADEADK 531

Query: 508 ---DGKLD----EGSASLILDAKPSKVVSNE----------------------------- 531
              D  L+    E +AS+    +PSKVV+ +                             
Sbjct: 532 PAEDSNLEAMQAELTASIQEAERPSKVVAQQRDLSVRCQVEYLDLEGLSDRESMLNILER 591

Query: 532 ----LTVLVHGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLM 585
                 VL+HG+ + TE L   C+  +     +  P+  E +D+  +   ++++L + L+
Sbjct: 592 MRPRFLVLLHGTEDETEELADSCVHKLRDLERIVMPKRFERVDIAGERNIFQLRLRDALV 651

Query: 586 SNVLFKKLGDYEIAWVDAEVGKTE-------NGMLSLLPISTPAPPHKSVLVGDLKMADL 638
           S++ F + G+Y+IAW+D  +  TE          L  L  +T A  H +V VGD++++ L
Sbjct: 652 SSLKFSEAGEYKIAWIDGVLAHTEGDETSSKRAKLPQLEAATEAAEHNAVFVGDIRLSQL 711

Query: 639 KPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAY 698
           K  L +  ++V +    L C   V + K        GGS +    I+GPLCE YYK+R  
Sbjct: 712 KTVLENHQVEVSWWVEKLVCNNQVVVGK-----DPLGGSFS----IDGPLCETYYKVREL 762

Query: 699 LYSQF 703
           LY QF
Sbjct: 763 LYQQF 767


>gi|74194185|dbj|BAE24650.1| unnamed protein product [Mus musculus]
          Length = 396

 Score =  376 bits (966), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 184/396 (46%), Positives = 261/396 (65%), Gaps = 6/396 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAY 391
             GTLAR L  +P  K  ++ + +RV L G+EL  Y
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEY 395


>gi|74188762|dbj|BAE28111.1| unnamed protein product [Mus musculus]
          Length = 412

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 183/393 (46%), Positives = 260/393 (66%), Gaps = 6/393 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEEL 388
             GTLAR L  +P  K  ++ + +RV L G+EL
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392


>gi|172087214|ref|XP_001913149.1| cleavage and polyadenylation factor [Oikopleura dioica]
 gi|18029276|gb|AAL56454.1| cleavage and polyadenylation factor-like protein [Oikopleura
           dioica]
          Length = 765

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 239/784 (30%), Positives = 390/784 (49%), Gaps = 97/784 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V+   LSG  +E P  +L+ ID F FL+DCGW +     ++  L +    IDA+L+
Sbjct: 1   MTSIVKFQSLSGFDDEAPHCHLLQIDDFKFLLDCGWAEQHHEKIIDGLKRHGRQIDAILI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LH G LPY + +LG++ P++ T P  ++G + +YD  LSR  V +FD+FTLDD+D
Sbjct: 61  SHPDLLHCGMLPY-LSKLGITCPIYMTMPACKMGQMFLYDFVLSRTAVEDFDMFTLDDVD 119

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD-GEDVIYAVDYNRRK 179
           + F   T+L ++Q   + G+  GI + P  AGH++GGT WKI KD  E+ +Y VD N ++
Sbjct: 120 AVFDRATQLKHNQTEAVRGQDYGIQIMPVQAGHMIGGTTWKIMKDEEEEYVYCVDVNHKR 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  L++F +P ++ITD     + Q  R +R E     I  T   GGNVL+  D+A
Sbjct: 180 ETHLNGIQLDAFDKPTLMITDCSTYGYQQERRAKRTERLVQRIQNTTSKGGNVLITTDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GR LE+ L+LE  W +         +  ++ V++STI+  K  +EWM + I   F   R+
Sbjct: 240 GRSLEMALMLEGIWNDERYGLGRVNLVMVSNVATSTIEAAKGMIEWMSEKIISKFTHKRE 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F L  + L  +  E+   P+ PK++LA+   ++ GFS ++FV  A+  KN V+ + R 
Sbjct: 300 NIFDLTKMKLRSSIQEIARIPE-PKVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRS 358

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             G+L R +  +    ++ + M++R+PLVG EL  YE+++ + +    +K  L +E   +
Sbjct: 359 TKGSLCRKIIENEGMSSITLEMNKRLPLVGPELEEYEKQKEQERNANLIK-RLEEESSDE 417

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVA-PMFPFYEN 474
           +       +S     +     +   D++ PH  + ++    GF   +     P+FPF EN
Sbjct: 418 SENEMSETISVRKKTVKGKRTH---DIIMPHHVQKKE---GGFFKKARKEKFPLFPFNEN 471

Query: 475 NSEWDDFGEVINPDDYI------------IKDEDMDQAAMHIGG---DDGKLDEGSASLI 519
             +WDD+GE+INPDDY             I +   +Q ++  G    +D +  +    + 
Sbjct: 472 RIKWDDYGEIINPDDYKTHELIPESEPVNINNLTENQQSVTFGRHKPNDSRKKQKEEPVE 531

Query: 520 LDAKPSKVVSNELTVLVHGSAE-----------------------------ATEHLKQHC 550
            +  P+K +     V +  S E                               E  K+  
Sbjct: 532 EEKAPTKCIKTREQVSIRCSIEFINFEGRVDGESQLQLLSTIKPKELILIRTKEKYKEKL 591

Query: 551 LKHVCPHV-----YTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG--DYEIAWVDA 603
            K +   V     + P   E ID T +   Y+++L + L+SN+ F ++G  D E+A +  
Sbjct: 592 FKDIKSRVQGIRIHMPVHHELIDATKESFIYQLKLKDSLLSNLNFVRVGSKDIEVARIRG 651

Query: 604 EVG--------KTENG------------MLSLLPISTP-APPHKSVLVGDLKMADLKPFL 642
            V         + ENG            + +L P++   +  H S+ + D K+ +LK  L
Sbjct: 652 RVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGHDSIFINDTKLTELKSNL 711

Query: 643 SSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQ 702
              G+Q EF GG L C   V+I++          S    I +EG L EDY+ +R  +Y  
Sbjct: 712 IDCGMQAEFIGGNLVCNNKVSIKR----------SANGVIQVEGTLSEDYFIVRKMVYDN 761

Query: 703 FYLL 706
           + ++
Sbjct: 762 YAIV 765


>gi|410962841|ref|XP_003987977.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Felis catus]
          Length = 690

 Score =  366 bits (939), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 234/726 (32%), Positives = 365/726 (50%), Gaps = 148/726 (20%)

Query: 93  LGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           +G + MYD Y SR    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ + P  AG
Sbjct: 1   MGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAG 60

Query: 153 HLLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPR 211
           H++GGT+WKI KDG E+++YAVD+N ++E HLNG  LE   RP++LITD++NA + QP R
Sbjct: 61  HMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRR 120

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYV 267
           +QR E     + +TLR  GNVL+ VD+AGRVLEL  +L+  W        +Y    L  V
Sbjct: 121 KQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNV 180

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           S + +++ KS +EWM D + + FE  R+N F  +H++L    S+L   P  PK+VLAS  
Sbjct: 181 SYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQP 239

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            LE GFS D+F++W  D KN ++ T R   GTLAR L  +P  K  ++ + +RV L G+E
Sbjct: 240 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKE 299

Query: 388 LIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           L  Y E++   K+                        S +  +  ++ ++   D+ +P  
Sbjct: 300 LEEYLEKEKLKKEAAKKLEQ-----------------SKEADIDSSDESDVEEDIDQPSA 342

Query: 448 GRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK------ 493
            + + D+++ G       F   +    PMFP  E   +WD++GE+I P+D+++       
Sbjct: 343 HKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATE 402

Query: 494 -------------DEDMDQ-------------AAMHIGGD------DGKLDEGSASLILD 521
                        DE MDQ              ++ I         +G+ D  S   I++
Sbjct: 403 EEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIIN 462

Query: 522 A-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAY 576
             KP ++      ++VHG  EA++ L + C     K +   VY P++ ET+D TS+   Y
Sbjct: 463 QMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIY 514

Query: 577 KVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML------------------- 613
           +V+L + L+S++ F K  D E+AW+D      V K + G++                   
Sbjct: 515 QVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDA 574

Query: 614 ----------------------------SLLPISTPAPP-----HKSVLVGDLKMADLKP 640
                                        ++P   P PP     H+SV + + +++D K 
Sbjct: 575 PSDSSVLAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQ 634

Query: 641 FLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
            L  +GIQ EF GG L C   V +R+          + T +I +EG LC+D+Y+IR  LY
Sbjct: 635 VLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRIGLEGCLCQDFYRIRDLLY 684

Query: 701 SQFYLL 706
            Q+ ++
Sbjct: 685 EQYAIV 690


>gi|426377790|ref|XP_004055637.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Gorilla gorilla gorilla]
 gi|193785772|dbj|BAG51207.1| unnamed protein product [Homo sapiens]
          Length = 690

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 234/726 (32%), Positives = 365/726 (50%), Gaps = 148/726 (20%)

Query: 93  LGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           +G + MYD Y SR    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ + P  AG
Sbjct: 1   MGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAG 60

Query: 153 HLLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPR 211
           H++GGT+WKI KDG E+++YAVD+N ++E HLNG  LE   RP++LITD++NA + QP R
Sbjct: 61  HMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRR 120

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYV 267
           +QR E     + +TLR  GNVL+ VD+AGRVLEL  +L+  W        +Y    L  V
Sbjct: 121 KQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNV 180

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           S + +++ KS +EWM D + + FE  R+N F  +H++L    S+L   P  PK+VLAS  
Sbjct: 181 SYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQP 239

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            LE GFS D+F++W  D KN ++ T R   GTLAR L  +P  K  ++ + +RV L G+E
Sbjct: 240 DLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKE 299

Query: 388 LIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           L  Y E++   K+                        S +  +  ++ ++   D+ +P  
Sbjct: 300 LEEYLEKEKLKKEAAKKLEQ-----------------SKEADIDSSDESDIEEDIDQPSA 342

Query: 448 GRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK------ 493
            + + D+++ G       F   +    PMFP  E   +WD++GE+I P+D+++       
Sbjct: 343 HKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATE 402

Query: 494 -------------DEDMDQ-------------AAMHIGGD------DGKLDEGSASLILD 521
                        DE MDQ              ++ I         +G+ D  S   I++
Sbjct: 403 EEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIIN 462

Query: 522 A-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAY 576
             KP ++      ++VHG  EA++ L + C     K +   VY P++ ET+D TS+   Y
Sbjct: 463 QMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIY 514

Query: 577 KVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML------------------- 613
           +V+L + L+S++ F K  D E+AW+D      V K + G++                   
Sbjct: 515 QVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVEA 574

Query: 614 ----------------------------SLLPISTPAPP-----HKSVLVGDLKMADLKP 640
                                        ++P   P PP     H+SV + + +++D K 
Sbjct: 575 PSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQ 634

Query: 641 FLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
            L  +GIQ EF GG L C   V +R+          + T +I +EG LC+D+Y+IR  LY
Sbjct: 635 VLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRIGLEGCLCQDFYRIRDLLY 684

Query: 701 SQFYLL 706
            Q+ ++
Sbjct: 685 EQYAIV 690


>gi|313232558|emb|CBY19228.1| unnamed protein product [Oikopleura dioica]
          Length = 764

 Score =  365 bits (937), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 238/784 (30%), Positives = 389/784 (49%), Gaps = 98/784 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + V+   LSG  +E P  +L+ ID F FL+DCGW +     ++  L +    IDA+L+
Sbjct: 1   MTSIVKFQSLSGFDDEAPHCHLLQIDDFKFLLDCGWAEQHHEKIIDGLKRHGRQIDAILI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LH G LPY + +LG++ P++ T P  ++G + +YD  LSR  V +FD+FTLDD+D
Sbjct: 61  SHPDLLHCGMLPY-LSKLGITCPIYMTMPACKMGQMFLYDFVLSRTAVEDFDMFTLDDVD 119

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD-GEDVIYAVDYNRRK 179
           + F   T+L ++Q   + G+  GI + P V GH++GGT WKI KD  E+ +Y VD N ++
Sbjct: 120 AVFDRATQLKHNQTEAVRGQDYGIQIMP-VQGHMIGGTTWKIMKDEEEEYVYCVDVNHKR 178

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  L++F +P ++ITD     + Q  R +R E     I  T   GGNVL+  D+A
Sbjct: 179 ETHLNGIQLDAFDKPTLMITDCSTYGYQQERRAKRTERLVQRIQNTTSKGGNVLITTDTA 238

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GR LE+ L+LE  W +         +  ++ V++STI+  K  +EWM + I   F   R+
Sbjct: 239 GRSLEMALMLEGIWNDERYGLGRVNLVMVSNVATSTIEAAKGMIEWMSEKIISKFTHKRE 298

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F L  + L  +  E+   P+ PK++LA+   ++ GFS ++FV  A+  KN V+ + R 
Sbjct: 299 NIFDLTKMKLRSSIQEIARIPE-PKVILATPMDMDTGFSRELFVMMAAHPKNAVIMSGRS 357

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             G+L R +  +    ++ + M++R+PLVG EL  YE+++ + +    +K  L +E   +
Sbjct: 358 TKGSLCRKIIENEGMSSITLEMNKRLPLVGPELEEYEKQKEQERNANLIK-RLEEESSDE 416

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVA-PMFPFYEN 474
           +       +S     +     +   D++ PH  + ++    GF   +     P+FPF EN
Sbjct: 417 SENEMSETISVRKKTVKGKRTH---DIIMPHHVQKKE---GGFFKKARKEKFPLFPFNEN 470

Query: 475 NSEWDDFGEVINPDDYI------------IKDEDMDQAAMHIGG---DDGKLDEGSASLI 519
             +WDD+GE+INPDDY             I +   +Q ++  G    +D +  +    + 
Sbjct: 471 RIKWDDYGEIINPDDYKTHELIPESEPVNINNLTENQQSVTFGRHKPNDSRKKQKEEPVE 530

Query: 520 LDAKPSKVVSNELTVLVHGSAE-----------------------------ATEHLKQHC 550
            +  P+K +     V +  S E                               E  K+  
Sbjct: 531 EEKAPTKCIKTREQVSIRCSIEFINFEGRVDGESQLQLLSTIKPKELILIRTKEKYKEKL 590

Query: 551 LKHVCPHV-----YTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG--DYEIAWVDA 603
            K +   V     + P   E ID T +   Y+++L + L+SN+ F ++G  D E+A +  
Sbjct: 591 FKDIKSRVQGIRIHMPVHHELIDATKESFIYQLKLKDSLLSNLNFVRVGSKDIEVARIRG 650

Query: 604 EVG--------KTENG------------MLSLLPISTP-APPHKSVLVGDLKMADLKPFL 642
            V         + ENG            + +L P++   +  H S+ + D K+ +LK  L
Sbjct: 651 RVDYFGGRLELEAENGENDEPKKLEIDDIPTLQPVTNNYSSGHDSIFINDTKLTELKSNL 710

Query: 643 SSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQ 702
              G+  EF GG L C   V+I++          S    I +EG L EDY+ +R  +Y  
Sbjct: 711 IDCGMHAEFIGGNLVCNNKVSIKR----------SANGVIQVEGTLSEDYFIVRKMVYDN 760

Query: 703 FYLL 706
           + ++
Sbjct: 761 YAIV 764


>gi|393910520|gb|EJD75913.1| cleavage and polyadenylation specificity factor subunit 2, variant
           [Loa loa]
          Length = 664

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 222/636 (34%), Positives = 340/636 (53%), Gaps = 91/636 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  LSGV ++ PL YL+ +D   FL+DCGW++ FD + ++ + +    I+AVLL
Sbjct: 1   MTSIIKLEALSGVQDDGPLCYLLQVDQVYFLLDCGWDERFDMAYIEAVKRRVPLINAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           S+ D  HLGALPY +++ GL+ P+++T PVY++G + +YD   +   V +F+LF LDDID
Sbjct: 61  SYADIPHLGALPYLVRKCGLNCPIYATVPVYKMGQMFLYDWVNNHTSVEDFNLFNLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ V ++ YSQ   L G   G+ + P  AGH++GG +W+ITK G E+++YAVD+N RK
Sbjct: 121 AAFERVQQVKYSQTILLKGDN-GLQITPLPAGHMIGGAIWRITKMGDEEIVYAVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   E   RP +LITD++NAL+NQP R+QR E     +  T+R GG+V++ +D+A
Sbjct: 180 ERHLNGCTFEGIGRPNLLITDSFNALYNQPRRKQRDEQLVTRLLGTVRDGGDVMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLE+  +L+  W  AE  L  Y +  L++V+SS +++ KS +EWM D + KSFE  R 
Sbjct: 240 GRVLEIAHLLDQLWHNAEAGLMTYNLVMLSHVASSVVEFAKSQVEWMSDKVLKSFEVGRY 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV L     +L      PK+VL S   +E+GFS ++F+EW +D+KN V+ T R 
Sbjct: 300 NPFQFRHVQLCHTHIDLLRV-RSPKVVLVSGLDMESGFSRELFLEWCTDIKNSVIVTGRS 358

Query: 356 QFGTL-ARML----QADPPP-----KAVKVTMSRRVPLVGEELIAY-------EEEQTRL 398
              TL AR++    QA   P     + + + + RR+ L G EL  Y       E E TR+
Sbjct: 359 GDRTLGARLIRMAEQAAENPNGTINRNLTLEVKRRIRLEGAELENYRAKKRAEEREATRI 418

Query: 399 KKEEALKASLVKEE----------------------ESKASLGPDNNLSGDPMVIDANNA 436
           + E + + + +++                         K +    N  S   +      A
Sbjct: 419 RLEASRRNARLEQADSSDDSDDDAVMVVPATTSGVLNGKMTNSKRNVTSSFSVSTTTTTA 478

Query: 437 NASADVVEPHGGRYRDILI-------DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDD 489
           + SA  +     R  DI+          F   S    PMFP+ E  + WDD+GE+I P++
Sbjct: 479 DMSAAQIAEQ--RSHDIMWKWEQQQKSSFFKQSKKSFPMFPYIEEKTRWDDYGEIIRPEE 536

Query: 490 YIIKDEDM--DQAAMHIGGDDGKLDEGSASLILDAK-PSKVVSNELT------------- 533
           Y+I D  +       H  G DG  D     L  + + PSK +S  +              
Sbjct: 537 YMIADTPVVPQIPPEHKDGADGTFDGQVVPLYEEREWPSKCISQIMKMEVLCKVDFIDFE 596

Query: 534 --------------------VLVHGSAEATEHLKQH 549
                               ++VHGS+ AT HL Q+
Sbjct: 597 GRSDGESAKKILSQIKPKQLIIVHGSSAATRHLAQY 632


>gi|47224566|emb|CAG03550.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 765

 Score =  362 bits (928), Expect = 5e-97,   Method: Compositional matrix adjust.
 Identities = 253/831 (30%), Positives = 390/831 (46%), Gaps = 197/831 (23%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E+ L YL+ +D F FL+DCGW+++F   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDAMKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPIHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRNNSEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQ------------------------NYHLSGKGEGIVVAPHVAGHLLG 156
           SAF  + +L YSQ                         ++ +GKG G+ + P  AGH++G
Sbjct: 121 SAFDKIQQLKYSQIVSLKGKLASKRLFTWSKLPKYVMAFYATGKGHGLSITPLPAGHMIG 180

Query: 157 GTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM 216
           GT+WKI KD    +           H    V+  ++     +   YN + +   R     
Sbjct: 181 GTIWKIVKDVTSTV----------AHWRALVVLPYLSQTPSMQHMYNHVASSGTRCS--- 227

Query: 217 FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVK 276
               I +T  AG  V                           YP+  L  VS + +++ K
Sbjct: 228 ---LIWRTKDAGLGV---------------------------YPLALLNNVSYNVVEFSK 257

Query: 277 SFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHD 336
           S +EWM D + + FE  R+N F  +H+TL  + ++L   P  PK+VL S   LE+GFS +
Sbjct: 258 SQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHSLADLARVP-SPKVVLCSQPDLESGFSRE 316

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQT 396
           +F++W+ D KN ++ T R   GTLAR L  +P  K + + + +RV L G EL  Y  E+ 
Sbjct: 317 LFIQWSKDSKNSIILTYRTTPGTLARYLIDNPGEKHLDLEVRKRVRLEGRELEEY-LEKD 375

Query: 397 RLKKEEALKASLVKE---EESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDI 453
           R+KKE A K    KE   + S  S   D++    P  + + + +    +++  G R    
Sbjct: 376 RIKKEAAKKLEQAKEVDVDSSDESDMDDDDDLDQPTTVKSKHHDL---MMKSEGSRK--- 429

Query: 454 LIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK-------------------D 494
               F   +    PMFP +E   +WD++GE+I  +D+++                    D
Sbjct: 430 --GSFFKQAKKSYPMFPTHEERIKWDEYGEIIRLEDFLVPELQATEEEKSKLDSGLTNGD 487

Query: 495 EDMDQ-------------AAMHIGGDDGKLD-EGSASLILDAKPSKVVSNELT----VLV 536
           E MDQ              ++ I      +D EG +    D    K + N++     V+V
Sbjct: 488 EPMDQDLSVLPTKCISNVESLEIRARVTYIDYEGRS----DGDSIKKIINQMKPRQLVIV 543

Query: 537 HGSAEATEHLKQHCL---KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
           HG  EA+  L + C    K +   VYTP+++ETID TS+   Y+V+L + L+S++ F K 
Sbjct: 544 HGPPEASLDLAESCKAFSKDI--KVYTPKLQETIDATSETHIYQVRLKDSLVSSLQFCKA 601

Query: 594 GDYEIAWVDA----EVGKTENGML------------------------------------ 613
            D E+AW+D      V K + G++                                    
Sbjct: 602 KDTELAWIDGVLDMRVVKVDTGVMLEDGVKEEAEDSELGMEITPDLGIEASSIAVAAHRA 661

Query: 614 ----------------SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFA 652
                            ++P   P P      H+SV + + +++D K  L  +GIQ EF 
Sbjct: 662 MKNLFGEEEKEVSEESDIIPTLEPLPTPEVPGHQSVFINEPRLSDFKQVLLREGIQAEFV 721

Query: 653 GGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
           GG L C   V +R+   AG         +I +EG LCEDYYKIR  LY Q+
Sbjct: 722 GGVLVCNNMVAVRRT-EAG---------RISLEGCLCEDYYKIRELLYQQY 762


>gi|350587145|ref|XP_001926907.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Sus scrofa]
          Length = 438

 Score =  361 bits (927), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 178/383 (46%), Positives = 252/383 (65%), Gaps = 6/383 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  G+VL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGSVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMS 378
             GTLAR L  +P  K  ++  S
Sbjct: 360 TPGTLARFLIDNPSEKITEIESS 382


>gi|384484008|gb|EIE76188.1| hypothetical protein RO3G_00892 [Rhizopus delemar RA 99-880]
          Length = 657

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 228/635 (35%), Positives = 341/635 (53%), Gaps = 90/635 (14%)

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           KV+  IDAVLLSH D  HLGA PYA   LG++ PV+ST PV  +G + MYD Y SR    
Sbjct: 2   KVSKQIDAVLLSHSDLGHLGAYPYARNHLGMTCPVYSTVPVVNMGKMCMYDLYQSRTNEL 61

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           EF  FTL+D+D+AF  +T L YSQ + L GK +GI +  + A H +GGT+WKI +D +++
Sbjct: 62  EFKTFTLEDVDNAFDKITPLRYSQPFSLPGKCQGITITAYAAAHTVGGTIWKIKQDTDEI 121

Query: 170 IYAVDYNRRKEKHLNGT-------VLESFVRPAVLITDAYNALHNQPPRQQR--EMFQDA 220
           +YAVD+N RKE HL+GT       VL+S  RP++LITDAYN+    P R+ R   MF D 
Sbjct: 122 VYAVDFNHRKEYHLDGTVLHSGGVVLDSLTRPSLLITDAYNSQVVHPARKDRYAAMF-DT 180

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLE 280
           +  +L  GG+VLLP DS+ RVLEL  +L+ +W+++ LNYP+  L+  S  T+ + K  LE
Sbjct: 181 MLTSLNKGGSVLLPTDSSARVLELAYLLDQHWSQNQLNYPLIMLSNTSYHTVHFAKIMLE 240

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
           WMG+ +T+ F  SR+N +  K+V L     +LDN P GPK+V+AS  SLE GF+ ++F+ 
Sbjct: 241 WMGEELTRKFSQSRENPYEFKYVRLCHKIEDLDNYP-GPKIVMASHHSLETGFARELFLR 299

Query: 341 W-ASDVKNLVLFTERGQFGTLARMLQAD------------------------PPPKAVKV 375
           W  +D +N ++ T+R   GTLAR L  D                         P  A + 
Sbjct: 300 WMTNDPQNTLILTDRSAPGTLARRLYDDWEQQTNKTATTTTVVNNNRTKVLVKPAIAYEN 359

Query: 376 TMS----RRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVI 431
           T+     +RVPL G EL  YE  Q    ++EA +A+++    SK  +  D +   D   +
Sbjct: 360 TIDLRVYKRVPLEGAELQEYEAAQRAKAEKEAAQAAMLA--RSKIIMEEDESDVSD---M 414

Query: 432 DANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYI 491
           D  + +    +        RD    G          MFP+ E   + DD+GE I  + Y+
Sbjct: 415 DEGDEDVEGLLTRQFDLYVRDTGKSGGFFKHAHSYRMFPYLEKRKKMDDYGEAIQIEHYM 474

Query: 492 IKD--EDMDQAAMHI--GGDDGKLDEGSASL---IL---DAKPSKVVSNELT-------- 533
                E M+Q   ++  G + GK D+    L   IL   D  P+K +S++ T        
Sbjct: 475 KASELERMEQEKKNLGQGANFGKEDDMQIDLQEPILPGRDETPTKYISSDETFLVRCQLR 534

Query: 534 -------------------------VLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEET 566
                                    ++VHGS  +T+ L+  C  +++    ++TP + E 
Sbjct: 535 YVDLEGLSDGRSMKTILPQIAPRKLIIVHGSESSTKDLESACQGIEYFTKEIFTPSVGEV 594

Query: 567 IDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV 601
           ++V++    Y+V+L++ ++S++ F KL DYE+A V
Sbjct: 595 LNVSAATNIYRVKLTDSMVSSLRFSKLDDYELARV 629


>gi|341883504|gb|EGT39439.1| CBN-CPSF-2 protein [Caenorhabditis brenneri]
          Length = 822

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 253/838 (30%), Positives = 408/838 (48%), Gaps = 148/838 (17%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +D    L+DCGW++ F+    + L      I AVL+
Sbjct: 1   MTSIIKLKVFSGAKDEGPLCYLLQVDSDYILLDCGWDERFELKYFEELKPFIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV++T PVY++G + +YD   S   V EFD +TLDD+D
Sbjct: 61  SHPDPLHLGGLPYLVAKCGLTAPVYATVPVYKMGQMFIYDLVYSHLDVEEFDHYTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
            AF+ V ++ Y+Q   L G   G+      AGH++GG++W+I +  GED+IY VD+N +K
Sbjct: 121 MAFEKVEQVKYNQTVVLKGDS-GVHFTAIPAGHMIGGSIWRICRVTGEDIIYCVDFNHKK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G   ++F RP +LIT A++    Q  R+ R E+    I +T+R  G+ ++ +D+A
Sbjct: 180 ERHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDELLVTKILRTVRQKGDCMVVIDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITK-SFETSR 294
           GRVLE+  +L+  W+        Y +  +++V+SS + + KS LEWM +S+ K    ++R
Sbjct: 240 GRVLEIAYLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMHESLFKYDSNSTR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LK+VTL  +  EL      PK+VL S   +EAGFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKNVTLCHSHQELLRVR-SPKVVLCSSQDMEAGFSRELFLDWCSDSRNGVILTAR 358

Query: 355 GQFGTLARML-----QAD-----PPPKAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
               TLA  L     +A+     P  + + + + +RVPL GEEL+ Y+        E+TR
Sbjct: 359 PSSFTLAAKLVNLAERANDGILRPEDRLISLLVKKRVPLEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESK------ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR 451
           ++ E A + +   E +        A + P ++           N +   D++     ++ 
Sbjct: 419 MRMERARRQAQANESDDSDDDDMAAPIVPRHSEKDFRSFDGIENDSHCFDIM----AKWD 474

Query: 452 DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII-------KDEDMDQAAM-- 502
           +     F   +    PM+P+ E   +WDD+GEVI P+DY +       K ++ D+  +  
Sbjct: 475 NQQKASFFKTTKKSFPMYPYIEEKIKWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVVVQ 534

Query: 503 -------------HIGGDDGKLDEGSASL-------------ILDAKPSKVVSNELT--- 533
                        H+     K  E    +             I D + +K +   LT   
Sbjct: 535 KREDEEEVYNPNDHVEEMPTKCVEFKNRIEVCCRVEFIDYEGISDGESTKKMLAGLTPRQ 594

Query: 534 -VLVHGSAEATEHLKQHCLKH--VCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 590
            ++VHGS + T  L  +   +      + TP   + ID + +   ++V LS+ L++ + F
Sbjct: 595 IIIVHGSRDDTRDLYAYFSDNGIKSDMMKTPVAGDLIDASVESFIFQVSLSDALLAELQF 654

Query: 591 KKLGD-YEIAWVDAEVGKTEN-------GMLSLL----------------PIST------ 620
           K++ +   +AW+DA+V + EN       G  +L+                P+ T      
Sbjct: 655 KQVSEGNSLAWLDAKVTEKENLDNMLISGTSNLMIGNGNHDTSGSDQNEEPMETDENGLQ 714

Query: 621 ------------PAPPHK-------------------SVLVGDLKMADLKPFLSSKGIQV 649
                       P  P K                   ++ V D KM+D K  L  +G + 
Sbjct: 715 ENGNSDRNGFKKPKEPEKIRGTLILDPLQRSRIPVHQAIFVNDPKMSDFKNLLVERGYKA 774

Query: 650 EFAGGALRC-GEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           EF  G L   G   +IR+          S T    +EG   +DYYK+R   Y QF +L
Sbjct: 775 EFLSGTLIINGGKCSIRR----------SETGSFQMEGAFTKDYYKVRKLFYDQFAVL 822


>gi|395827898|ref|XP_003787126.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 [Otolemur garnettii]
          Length = 750

 Score =  353 bits (907), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 176/390 (45%), Positives = 252/390 (64%), Gaps = 7/390 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS + +  +     +   R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVCFTCNKEV-CYXDKRN 299

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 300 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 358

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVG 385
             GTLAR L  +P  K  ++ + +RV L G
Sbjct: 359 TPGTLARFLIDNPSEKITEIELRKRVKLEG 388



 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 84/345 (24%), Positives = 145/345 (42%), Gaps = 111/345 (32%)

Query: 458 FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEG 514
           F   +    PMFP  E   +WD++GE+I P+D+++ +    + +++ +  G  +G  DE 
Sbjct: 421 FFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEP 478

Query: 515 SASLILDAKPSKVVSNELTVLVH---------------------------------GSAE 541
               + D  P+K +S   ++ +                                  G  E
Sbjct: 479 MNQDLSDV-PTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKXXXXXXXXGPPE 537

Query: 542 ATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYE 597
           A++ L + C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E
Sbjct: 538 ASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAE 595

Query: 598 IAWVDA----EVGKTENGML---------------------------------------- 613
           +AW+D      V K + G++                                        
Sbjct: 596 LAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDE 655

Query: 614 -------SLLPISTPAPPH-----KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEY 661
                   ++P   P PPH     +SV + + +++D K  L  +GIQ EF GG L C   
Sbjct: 656 KETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQ 715

Query: 662 VTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           V +R+          + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 716 VAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 750


>gi|290981012|ref|XP_002673225.1| predicted protein [Naegleria gruberi]
 gi|284086807|gb|EFC40481.1| predicted protein [Naegleria gruberi]
          Length = 808

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 253/840 (30%), Positives = 411/840 (48%), Gaps = 171/840 (20%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF---DPSLLQPLSKVASTIDA 57
           M +S+Q  PL G  NE P+  ++ +D +  L+DCGW+++F   D  + + ++     IDA
Sbjct: 1   MSSSIQFVPLVGSQNEGPVCSILIVDDYYILLDCGWDENFNTKDSHIQEIINNYRDKIDA 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGL-----SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           +L+S  D  H GALPY + + G+      A +F+T P+ ++G + +YD Y + RQ  +F+
Sbjct: 61  ILISQSDIYHCGALPYLVGKCGILENKKKAKIFATLPIVKMGQMHLYDAYQNIRQHQDFE 120

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGK-------------------------------- 140
            F LDD+D  F S+ +L YSQ Y LS +                                
Sbjct: 121 TFDLDDVDLCFDSIHQLKYSQRYPLSQQTTIITQIEETDENGEEGEGGVVGSSGSVAEME 180

Query: 141 GEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL-ESFVRPAVLIT 199
           GE +V+ P +AGH LGGT+WK+TK+ ++++YA+D+N + E+HLNG+VL E   +PA+LIT
Sbjct: 181 GEKLVICPFLAGHTLGGTIWKLTKETDEIVYAIDFNIKTERHLNGSVLGELGGKPALLIT 240

Query: 200 DAYN----------ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
           DAYN           +   P  +       +I+ TL  GGNVL+P+++AGRV EL+L+LE
Sbjct: 241 DAYNVKPIPSSDLGGVDKAPAIK----IMKSITDTLTGGGNVLVPIETAGRVFELMLLLE 296

Query: 250 DYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLI 307
           + W       N+ +  LT V+  TI++    LEWM D I K F+  R+N F  ++ ++  
Sbjct: 297 ERWKRDPQMANFELILLTNVAYRTIEFASHQLEWMSDKIMKGFDEKRENPFKFQYFSVCH 356

Query: 308 NKSEL-----------------------DNAPDG---PKLVLASMASLEAGFSHDIFVEW 341
           N  EL                       + A  G   P +VLAS  +L+ G++ ++FV+W
Sbjct: 357 NVEELMDKLQKKEQMRMMMENQMNDEDEETATTGKHTPMVVLASSNTLDYGYARELFVKW 416

Query: 342 ASDVKNLVLFTERGQFGTLARML-------QADPPPKAVKVTMSRRVPLVGEELIAYEEE 394
             D +NLV+F ER    +L+R L       +++   + + +T+ RRV L GEEL  YE+E
Sbjct: 417 CEDQRNLVMFIERSAPNSLSRKLINKLRAKKSERLDENMSLTLYRRVALKGEELEKYEKE 476

Query: 395 QTRLKKEEA---------------LKASLVKEEESKASLGPDNNLSGDPMVIDANNANAS 439
           Q +LK+E                 ++    ++ + K S      L+G       +++   
Sbjct: 477 Q-QLKQEAEKKRREEEERNKRVIHVRDEDDEDLDLKKSKQFREELTGGA----DDDSQTH 531

Query: 440 ADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQ 499
           A +  P   RY             S   MFP  E     D++GE ++P+D+ ++    DQ
Sbjct: 532 ARLYLPENMRYH------------SQYLMFPCIERGISKDEYGESVDPEDFKLRLLQADQ 579

Query: 500 AAMHIGGDDGKLDEGS---------------------ASLILDAKPSKV-VSNELT---- 533
           +   I  D+   +E                       A L  + + S V + N L     
Sbjct: 580 SE-QIMADNTIHEEEDYYEPPSKIESENVSVRILCKLAYLDFEGRSSPVDIKNILQKINP 638

Query: 534 ---VLVHGSAEATEHLKQHC-LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVL 589
              +L+HGS E+   L  +C  K +   + TP   E +D+T D   +KV+L + L+S + 
Sbjct: 639 RKLILIHGSQESIIELSDYCETKKISEQIKTPMDLEVMDMTMDTNMFKVKLKQDLLSQIH 698

Query: 590 FKKLG-DYEIAWVDAEVGKTENGMLSLLPISTPAPP---HKSVLVGDLKMADLKPFLSSK 645
           + K G +Y++A+++  + + E G  S +P   P P    H ++L+GDLK+      L   
Sbjct: 699 YIKSGTNYDMAYIEG-IYRVEEG--SDIPCIHPNPKPKGHPTMLIGDLKLNQFFKLLKES 755

Query: 646 GIQVEF-AGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFY 704
           G+  EF  GG L C + V ++K   +G         +I + G L   Y+++R  LY +FY
Sbjct: 756 GLSAEFQQGGVLVCNDEVMLQKDKKSG---------EIQVFGSLSPTYFQVRELLY-KFY 805


>gi|328866931|gb|EGG15314.1| beta-lactamase domain-containing protein [Dictyostelium
           fasciculatum]
          Length = 768

 Score =  350 bits (898), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 254/800 (31%), Positives = 401/800 (50%), Gaps = 126/800 (15%)

Query: 1   MGTSVQVTPLSGVFNE-NPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVL 59
           M + ++ TPL G   +  P  YL+ ID F  L+DCGWN   D SLL  L KVA+ +DA+L
Sbjct: 1   MTSVIKFTPLCGGAGQITPPCYLLEIDNFCILLDCGWNAKLDISLLDELKKVANKVDAIL 60

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           L++PDT H+GALPYA+ +LGL+  ++ T P++++G + MYD Y SR    EFD F LD++
Sbjct: 61  LTYPDTEHIGALPYAIGKLGLTGKIYGTTPIHKMGQIFMYDLYTSRMAQEEFDRFDLDEV 120

Query: 120 DSAFQS--VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
           D  F       L+YSQ+Y +    +GI++ P++AGH++GG+VW+I K+ + ++YAVD N 
Sbjct: 121 DMCFDQSRFKELSYSQHYEIPD-SDGIIITPYLAGHMVGGSVWRIAKESDVIVYAVDINH 179

Query: 178 RKEKHL-----NGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDA----ISKTLRAG 228
           R+E HL     NG +     +P  LITDA + L   PP Q++     A    + K+LR G
Sbjct: 180 RRESHLEGFLQNGLLSPELAKPTHLITDALHIL--DPPPQKKADKDTAMLAQLRKSLRDG 237

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHSLN--YPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           GN+L+  D+AGRVLELLL ++ YW++H L   Y + F   V+    ++ KS LE+M  + 
Sbjct: 238 GNILVATDTAGRVLELLLTIDQYWSQHRLGSAYSVVFFNSVTYYVREFAKSQLEFMSTAA 297

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK--LVLASMASLEAGFSHDIFVEWASD 344
           +  FE   +N F  +++ +  +  +L+  P+  +  +VLAS   LE GF+ D+F++WA+D
Sbjct: 298 SSKFEQKNENIFNFRNIKICNSFKQLEELPNLTRNYVVLASSKDLETGFAKDLFIQWAND 357

Query: 345 VKNLVLFTERGQFGTLARML-QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEA 403
            KN+V+ T+    GTL   L +      +++VT  +RV L GEEL  YEE   R K EE 
Sbjct: 358 PKNMVMLTDNMDEGTLGDQLSKCQSGIDSIQVTHGKRVELEGEELREYEETIQRKKDEEK 417

Query: 404 LKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPST 463
                 + +E KA+     ++     +I   N         P   R+ D+    F+  + 
Sbjct: 418 RLEEEKRLQEEKANRKERMDVDDQEELITKKN---------PLLNRF-DMHRSDFI--NE 465

Query: 464 SVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAK 523
              PMFPF E   +WD++GE  + +   I  E  DQ    +  DD  ++E +     + K
Sbjct: 466 HYIPMFPFTEPIVKWDEYGEQ-DEELLNIAKELKDQKDKEM-KDDVVMEEENKQEEEETK 523

Query: 524 PSKVVS-NELT--------------------------------VLVHGSAEATEHLKQHC 550
           P K+V+ N +                                 +LV G+ +  + L    
Sbjct: 524 PKKIVTFNTMVKVNCSVTRFDYQGCSDGQSLKTIIQKIAPTNLILVRGNQQCVDELLDFA 583

Query: 551 LKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV---- 605
            K +    +++P I   ID+TS       +  + L+ ++   KL DYEIA+++A+V    
Sbjct: 584 KKSLRVKGLFSPAISNQIDLTS-------ETHDSLIKSLNTSKLMDYEIAYIEAKVHIED 636

Query: 606 ----GKTENG-----------------------------------MLSLLPISTPAPPHK 626
               G T                                      +L ++P+   +  H 
Sbjct: 637 IILNGATNAATPLAITSPTTSTAITTTNDSKALTVVQPKEKKIIPLLDIMPVE-ESKGHN 695

Query: 627 SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
              VGD+K+++ K  L+ +G QV+F  G L C   V +        +    G   I I+G
Sbjct: 696 VSFVGDVKLSEFKDVLTREGFQVQFDKGILSCNGLVYL-------WREEVDGNSCINIDG 748

Query: 687 PLCEDYYKIRAYLYSQFYLL 706
            + E+YY ++  LYSQF +L
Sbjct: 749 VMSEEYYLVKELLYSQFKIL 768


>gi|297695726|ref|XP_002825082.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 2 [Pongo abelii]
          Length = 747

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 239/767 (31%), Positives = 372/767 (48%), Gaps = 184/767 (23%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF 114
           IDAVLLSHPD LHLGA PYA+ +LGL   +++  PVY++G + MYD Y  R         
Sbjct: 50  IDAVLLSHPDPLHLGAXPYAVGKLGLKCAIYAPIPVYKMGQMXMYDLYQFR--------- 100

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAV 173
                                   GKG G+ + P  AGH++GGT+WKI KDGE+ ++YAV
Sbjct: 101 ------------------------GKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAV 136

Query: 174 DYNRRKEK-HLNGTVLES--FVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGG 229
           D+N ++E  +L+G    S  +  P++LITD++NA + QP R+QR E     + +TLR  G
Sbjct: 137 DFNHKREMLNLSGKPFSSTMYYSPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDG 196

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSI 286
           NVL+ VD+AGRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D +
Sbjct: 197 NVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKL 256

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            + FE  R+N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D K
Sbjct: 257 MRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPK 315

Query: 347 NLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA 406
           N ++ T R   GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+      
Sbjct: 316 NSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLE 375

Query: 407 SLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------F 458
                             S +  +  ++ ++   D+ +P   + + D+++ G       F
Sbjct: 376 Q-----------------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSF 418

Query: 459 VPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ 499
              +    PMFP  E   +WD++GE+I P+D+++                    DE MDQ
Sbjct: 419 FKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQ 478

Query: 500 -------------AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGS 539
                         ++ I         +G+ D  S   I++  KP ++      ++VHG 
Sbjct: 479 DLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGP 532

Query: 540 AEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGD 595
            EA++ L + C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D
Sbjct: 533 PEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKD 590

Query: 596 YEIAWVDA----EVGKTENGML-------------------------------------- 613
            E+AW+D      V K + G++                                      
Sbjct: 591 AELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGD 650

Query: 614 ---------SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCG 659
                     ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C 
Sbjct: 651 DEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCN 710

Query: 660 EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             V +R+          + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 711 NQVAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 747


>gi|449518417|ref|XP_004166238.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like, partial [Cucumis sativus]
          Length = 237

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 177/236 (75%), Positives = 186/236 (78%), Gaps = 34/236 (14%)

Query: 505 GGD-DGKLDEGSASLILDAKPSKVVSNELTV----------------------------- 534
           GGD DGKLDE +A+LILD KPSKVVSNELTV                             
Sbjct: 2   GGDVDGKLDETAANLILDMKPSKVVSNELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAP 61

Query: 535 ----LVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 590
               LVHG+AEATEHLKQHCLK+VCPHVY PQIEETIDVTSDLCAYKVQLSEKLMSNVLF
Sbjct: 62  LKLVLVHGTAEATEHLKQHCLKNVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 121

Query: 591 KKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVE 650
           KKLGDYEI W+DAEVGKTENG LSLLP+S    PHKSVLVGDLKMAD K FL+SKGIQVE
Sbjct: 122 KKLGDYEITWLDAEVGKTENGTLSLLPLSKAPAPHKSVLVGDLKMADFKQFLASKGIQVE 181

Query: 651 FAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           FAGGALRCGEYVT+RKV  A QKGGGSGTQQ+VIEGPLCEDYYKIR  LYSQFYLL
Sbjct: 182 FAGGALRCGEYVTLRKVTDASQKGGGSGTQQVVIEGPLCEDYYKIRELLYSQFYLL 237


>gi|17559452|ref|NP_504822.1| Protein CPSF-2 [Caenorhabditis elegans]
 gi|18201967|sp|O17403.1|CPSF2_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 2; AltName: Full=Cleavage and
           polyadenylation specificity factor 100 kDa subunit;
           Short=CPSF 100 kDa subunit
 gi|351057814|emb|CCD64424.1| Protein CPSF-2 [Caenorhabditis elegans]
          Length = 843

 Score =  332 bits (850), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 219/697 (31%), Positives = 358/697 (51%), Gaps = 97/697 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +DG   L+DCGW++ F     + L      I AVL+
Sbjct: 1   MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV++T PVY++G + +YD   S   V EF+ +TLDD+D
Sbjct: 61  SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
           +AF+ V ++ Y+Q   L G   G+      AGH+LGG++W+I +  GED++Y VD+N +K
Sbjct: 121 TAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   ++F RP +LIT A++    Q  R+ R E     I +T+R  G+ ++ +D+A
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
           GRVLEL  +L+  W+        Y +  +++V+SS + + KS LEWM + + K   +S R
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LKHVTL  +  EL      PK+VL S   +E+GFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVR-SPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358

Query: 355 GQFGTLARML-----QADP-----PPKAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
               TLA  L     +A+        + + + + +RV L GEEL+ Y+        E+TR
Sbjct: 359 PASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESK------ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR 451
           L+ E A + +   E +        A + P ++         + N   + D++     ++ 
Sbjct: 419 LRMERARRQAQANESDDSDDDDIAAPIVPRHSEKDFRSFDGSENDAHTFDIM----AKWD 474

Query: 452 DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII-------KDEDMDQAAM-- 502
           +     F   +    PMFP+ E   +WDD+GEVI P+DY +       K ++ D+  +  
Sbjct: 475 NQQKASFFKTTKKSFPMFPYIEEKVKWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVVVK 534

Query: 503 -------------HI--------------------------GGDDGKLDEGSASLILDAK 523
                        H+                          G  DG   E +  L+    
Sbjct: 535 KREEEEEVYNPNDHVEEMPTKCVEFKNRVEVSCRIEFIEYEGISDG---ESTKKLLAGLL 591

Query: 524 PSKVVSNELTVLVHGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLS 581
           P ++      ++VHGS + T  L  +          +  P+    +D + +   Y+V LS
Sbjct: 592 PRQI------IVVHGSRDDTRDLVAYFADSGFDTTMLKAPEAGALVDASVESFIYQVALS 645

Query: 582 EKLMSNVLFKKLGD-YEIAWVDAEVGKTE--NGMLSL 615
           + L++++ FK++ +   +AW+DA V + E  + ML++
Sbjct: 646 DALLADIQFKEVSEGNSLAWIDARVMEKEAIDNMLAV 682



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 623 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVTIRKVGPAGQKGGGSGTQQ 681
           P H++V V D K++D K  L+ KG + EF  G L   G   +IR+          + T  
Sbjct: 769 PIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRR----------NDTGV 818

Query: 682 IVIEGPLCEDYYKIRAYLYSQFYLL 706
             +EG   +DYYK+R   Y QF +L
Sbjct: 819 FQMEGAFTKDYYKLRRLFYDQFAVL 843


>gi|308480408|ref|XP_003102411.1| CRE-CPSF-2 protein [Caenorhabditis remanei]
 gi|308262077|gb|EFP06030.1| CRE-CPSF-2 protein [Caenorhabditis remanei]
          Length = 850

 Score =  331 bits (849), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 217/692 (31%), Positives = 353/692 (51%), Gaps = 99/692 (14%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +D    L+DCGW++ F+    + L      I AVL+
Sbjct: 1   MTSIIKLRVFSGAKDEGPLCYLLQVDNDYILLDCGWDERFELKYFEDLKPFIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV++T PVY++G + +YD   S   V EF+ +TLDD+D
Sbjct: 61  SHPDPLHLGGLPYLVAKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
            AF+ V ++ Y+Q   L G   G+      AGH++GG++W+I +  GED+IY VD+N +K
Sbjct: 121 MAFEKVEQVKYNQTVVLKGDS-GVHFTAMPAGHMIGGSIWRICRVTGEDIIYCVDFNHKK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           ++HLNG   ++F RP +LIT A++    Q  R  R +     I +T+R  G+ ++ +D+A
Sbjct: 180 DRHLNGCSFDNFNRPHLLITGAHHISLPQMKRMDRDQQLVTKILRTVRQKGDCMIVIDTA 239

Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITK-SFETSR 294
           GRVLEL  +L+  W  A+  L+ Y +  +++V+SS + + KS LEWM + + K    ++R
Sbjct: 240 GRVLELAYLLDQLWGNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSNSAR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LKH+TL  +  EL      PK+VL S   +E+GFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKHITLCHSHQELMRVR-SPKVVLCSSQDMESGFSRELFLDWCSDSRNGVILTAR 358

Query: 355 GQFGTLARML-----QADP-----PPKAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
               TLA  L     +A+        + + +++ +RVPL GEEL+ Y+        E+TR
Sbjct: 359 PSSFTLAAKLVNLAERANDGVLRNEDRLISLSVKKRVPLEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNAN--ASADVVEPHGGRYRDILI 455
           ++ E A + +   E +              P+ +  ++     S D +E       DI+ 
Sbjct: 419 IRMERARRQAQANESDDSDDD-----DMAAPINVTRHSEKDYRSFDGIESDNTHCFDIMS 473

Query: 456 D-------GFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII-----------KD--- 494
                    F   +    PM+P+ E   +WDD+GEVI P+DY +           KD   
Sbjct: 474 KWDNQQKASFFKSTKKSFPMYPYIEEKVKWDDYGEVIKPEDYTVISKIDLRKGGNKDEPV 533

Query: 495 ------------------EDMDQAAMHI----------------GGDDGKLDEGSASLIL 520
                             E+M    +                  G  DG   E +  ++ 
Sbjct: 534 VVKKREEEEEVYNPNDHVEEMPTKCVEFKNRIEISCRVEFIEYEGISDG---ESTKKMLA 590

Query: 521 DAKPSKVVSNELTVLVHGSAEATEHLKQHCLKH--VCPHVYTPQIEETIDVTSDLCAYKV 578
              P ++      ++VHGS + T  L  +   +      + TP   + ID + +   Y+V
Sbjct: 591 GLHPRQI------IIVHGSRDDTRDLYAYFCDNGFAADMMKTPVAGDLIDASVESFIYQV 644

Query: 579 QLSEKLMSNVLFKKLGD-YEIAWVDAEVGKTE 609
            LS+ L++ + FK++ +   +AW+DA V + E
Sbjct: 645 ALSDALLAEIHFKEVSEGNSLAWMDARVMEKE 676



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 35/105 (33%), Positives = 48/105 (45%), Gaps = 13/105 (12%)

Query: 604 EVGKTENGMLSLLPISTP-APPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEY 661
           E      G L L P+     P H+++ V D K++D K  L  KG + EF  G L   G  
Sbjct: 757 EAAAKPRGNLILEPLPKKLIPIHQAIFVNDPKLSDFKNLLVEKGYKAEFLSGTLLINGGK 816

Query: 662 VTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            +IR+           G     +EG L +DYYK+R   Y QF +L
Sbjct: 817 CSIRR-----------GEMGFSMEGALSKDYYKLRNLFYDQFAIL 850


>gi|229553940|sp|A8XUS3.2|CPSF2_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
           factor subunit 2; AltName: Full=Cleavage and
           polyadenylation specificity factor 100 kDa subunit;
           Short=CPSF 100 kDa subunit
          Length = 842

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 211/685 (30%), Positives = 352/685 (51%), Gaps = 85/685 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +D    L+DCGW++ F+    + L      I AVL+
Sbjct: 1   MTSIIKLKVFSGAKDEGPLCYLLQVDNDYILLDCGWDERFELKYFEELRPYIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV+ T PVY++G + +YD   S   V EF  ++LDD+D
Sbjct: 61  SHPDPLHLGGLPYLVAKCGLTAPVYCTVPVYKMGQMFIYDLVYSHLDVEEFQHYSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
            AF+ V ++ Y+Q   L G   G+      AGH++GG++W+I +  GED+IY VD+N RK
Sbjct: 121 MAFEKVEQVKYNQTVVLKGDS-GVNFTAMPAGHMIGGSMWRICRITGEDIIYCVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           ++HL+G   ++F RP +LIT A++    Q  R+ R E     I +T+R  G+ ++ +D+A
Sbjct: 180 DRHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
           GRVLEL  +L+  WA        Y +  +++V+SS + + KS LEWM + + +   +S R
Sbjct: 240 GRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWMDEKLFRYDSSSAR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LK+V L+ +  EL      PK+VL S   +E GFS ++F++W +D +N V+ T R
Sbjct: 300 YNPFTLKNVNLVHSHLELIKIR-SPKVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358

Query: 355 -GQFGTLARMLQADPPP---------KAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
              F   AR+++              K + + + +RVPL GEEL+ Y+        E+TR
Sbjct: 359 PASFTLAARLVELAERANDGVLRNEDKHLSLLVRKRVPLEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESKA----------SLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           ++ E A + +   E +              L   ++ S D +  D++  +  A       
Sbjct: 419 IRMERARRQAQANESDDSDDDDIAAPIVPRLSEKDHRSFDAIENDSHCFDIMA------- 471

Query: 448 GRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY-IIKDEDMDQA------ 500
            ++ +     F   +    PM+P+ E   +WDD+GEVI P+DY +I   DM +       
Sbjct: 472 -KWDNQQKASFFKSTKKSFPMYPYIEEKVKWDDYGEVIKPEDYTVISKIDMRKGKNKDEP 530

Query: 501 -AMHIGGDDGKL------DEGSASLILDAKPSKVVSNEL--------------------- 532
             +H   D+ ++      DE   +  ++ +    +S  +                     
Sbjct: 531 VVVHKREDEEEVYNPNDHDEEMPTKCVEFRNRIEISCRVEFIEYEGISDGESTKKMLAGL 590

Query: 533 ----TVLVHGSAEATEHLKQHCLKHVCP--HVYTPQIEETIDVTSDLCAYKVQLSEKLMS 586
                ++VHGS + T  L  +   +      + TP   E ID + +   Y+V LS+ L++
Sbjct: 591 MPRQIIIVHGSRDDTRDLYAYFTDNGFKKDQLNTPVANELIDASVESFIYQVSLSDALLA 650

Query: 587 NVLFKKLGD-YEIAWVDAEVGKTEN 610
            + FK++ +   +AW+DA + + E+
Sbjct: 651 EIQFKEVSEGNSLAWIDARIQEKES 675


>gi|268558798|ref|XP_002637390.1| Hypothetical protein CBG19097 [Caenorhabditis briggsae]
          Length = 838

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 211/685 (30%), Positives = 352/685 (51%), Gaps = 85/685 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +D    L+DCGW++ F+    + L      I AVL+
Sbjct: 1   MTSIIKLKVFSGAKDEGPLCYLLQVDNDYILLDCGWDERFELKYFEELRPYIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV+ T PVY++G + +YD   S   V EF  ++LDD+D
Sbjct: 61  SHPDPLHLGGLPYLVAKCGLTAPVYCTVPVYKMGQMFIYDLVYSHLDVEEFQHYSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
            AF+ V ++ Y+Q   L G   G+      AGH++GG++W+I +  GED+IY VD+N RK
Sbjct: 121 MAFEKVEQVKYNQTVVLKGDS-GVNFTAMPAGHMIGGSMWRICRITGEDIIYCVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           ++HL+G   ++F RP +LIT A++    Q  R+ R E     I +T+R  G+ ++ +D+A
Sbjct: 180 DRHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
           GRVLEL  +L+  WA        Y +  +++V+SS + + KS LEWM + + +   +S R
Sbjct: 240 GRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWMDEKLFRYDSSSAR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LK+V L+ +  EL      PK+VL S   +E GFS ++F++W +D +N V+ T R
Sbjct: 300 YNPFTLKNVNLVHSHLELIKIR-SPKVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358

Query: 355 -GQFGTLARMLQADPPP---------KAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
              F   AR+++              K + + + +RVPL GEEL+ Y+        E+TR
Sbjct: 359 PASFTLAARLVELAERANDGVLRNEDKHLSLLVRKRVPLEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESKA----------SLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           ++ E A + +   E +              L   ++ S D +  D++  +  A       
Sbjct: 419 IRMERARRQAQANESDDSDDDDIAAPIVPRLSEKDHRSFDAIENDSHCFDIMA------- 471

Query: 448 GRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY-IIKDEDMDQA------ 500
            ++ +     F   +    PM+P+ E   +WDD+GEVI P+DY +I   DM +       
Sbjct: 472 -KWDNQQKASFFKSTKKSFPMYPYIEEKVKWDDYGEVIKPEDYTVISKIDMRKGKNKDEP 530

Query: 501 -AMHIGGDDGKL------DEGSASLILDAKPSKVVSNEL--------------------- 532
             +H   D+ ++      DE   +  ++ +    +S  +                     
Sbjct: 531 VVVHKREDEEEVYNPNDHDEEMPTKCVEFRNRIEISCRVEFIEYEGISDGESTKKMLAGL 590

Query: 533 ----TVLVHGSAEATEHLKQHCLKHVCP--HVYTPQIEETIDVTSDLCAYKVQLSEKLMS 586
                ++VHGS + T  L  +   +      + TP   E ID + +   Y+V LS+ L++
Sbjct: 591 MPRQIIIVHGSRDDTRDLYAYFTDNGFKKDQLNTPVANELIDASVESFIYQVSLSDALLA 650

Query: 587 NVLFKKLGD-YEIAWVDAEVGKTEN 610
            + FK++ +   +AW+DA + + E+
Sbjct: 651 EIQFKEVSEGNSLAWIDARIQEKES 675



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 48/98 (48%), Gaps = 14/98 (14%)

Query: 611 GMLSLLPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVTIRKVG 668
           G L L P+     P H+++ V D K+++ K  L  KG + EF  G L   G   +IR   
Sbjct: 753 GTLILTPLPKKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTLLINGGKCSIR--- 809

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                 G +G Q   +EG   +D+YK+R   Y QF +L
Sbjct: 810 ------GETGFQ---MEGAFTKDFYKLRKLFYDQFAVL 838


>gi|213407230|ref|XP_002174386.1| cleavage factor two Cft2/polyadenylation factor CPSF-73
           [Schizosaccharomyces japonicus yFS275]
 gi|212002433|gb|EEB08093.1| cleavage factor two Cft2/polyadenylation factor CPSF-73
           [Schizosaccharomyces japonicus yFS275]
          Length = 786

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 246/796 (30%), Positives = 394/796 (49%), Gaps = 137/796 (17%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL- 80
           L+ +DG + LID G     D SL  P   V    D +LLSH D  HLG L YA +     
Sbjct: 17  LLELDGVHILIDPG----SDNSLTHPSIDVVP--DLILLSHSDLAHLGGLVYACRHYNWK 70

Query: 81  SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGK 140
           +A +++T PV  +G +TMYD   S          T+ D+D  F S+T L YSQ   L GK
Sbjct: 71  TAFIYATLPVINMGRMTMYDAIKSNLVTD----ITIADVDLVFDSITTLRYSQPASLMGK 126

Query: 141 GEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------VLESFVR 193
             GI +    AGH LGGT+W ITK+ E ++YAVD+N  K+KHLNGT       +LE   R
Sbjct: 127 CNGINITAFNAGHTLGGTLWSITKESESLVYAVDWNHSKDKHLNGTALYSNGQILEILTR 186

Query: 194 PAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P  L+TDA NAL + P R++R E   +A+  TL  GG+VLLP+D+A RV+EL   L+ +W
Sbjct: 187 PNTLVTDANNALISIPARKKRDEALIEAVMSTLLKGGSVLLPMDAASRVIELCYFLDTHW 246

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           A     L++PIYFL+Y S+ TI Y KS +EWMGD+I + F  + ++    +H+  + + S
Sbjct: 247 ASSQPPLSFPIYFLSYSSAKTIGYAKSMIEWMGDNIVRDFGMN-ESLLEFRHIQTITHPS 305

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD-VKNLVLFTERGQF--GTLARML--- 364
           +L     GPK+++A+  +LE+GFS ++ ++   D   NL+L T++ ++   +LA+     
Sbjct: 306 QLSQISPGPKVIIATSLTLESGFSQNVLLDIMPDNSNNLILLTQKSRYSENSLAKQFYRY 365

Query: 365 ----QADPPPKAVKVTM--------SRRVPLVGEELIAYEE-EQTRLKKE------EALK 405
                   P     V M            PL GEEL  ++E EQ++  ++      E   
Sbjct: 366 WERASRKSPENFSSVGMYFEQSIQVKHSEPLQGEELREFQEKEQSKRTRDAEDIALELRN 425

Query: 406 ASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDI-LIDGFVPPSTS 464
            +++ E+ES+ S   ++ L+  P + + N  +A+        G+  D+ L D  +    S
Sbjct: 426 RTILDEDESEESSSDEDELTQVPELSNTNLGSAAF-----MSGKTFDLNLRDPNIASLQS 480

Query: 465 VAPMFPFYENNSEWDDFGEVINPDDYIIK---------DEDMDQAAMHIGG--------D 507
              MFP+ E    +DD+GE++  +D+ ++         +E+ D A  H           +
Sbjct: 481 KFKMFPYVEKRRRFDDYGEILRQEDFAMEERTAGIVEGEENEDYAPAHESTGKRKWAEVN 540

Query: 508 DGKLDEGSASLILDAKPSKVVSN---------------------------------ELTV 534
           +G++ E   +  +   PSK+V+                                     V
Sbjct: 541 NGQISENQLNEDMPDVPSKIVTTTRYLKISCQVAFIDMEGLHDGRSLKTIIPQVNPRRLV 600

Query: 535 LVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK 592
           L+H + E    +K+ C  L      VY P  +E ++V+ D+ ++ ++LS++L+ ++++KK
Sbjct: 601 LIHATDEERADMKKTCAALTAFTKDVYCPDYKEVVNVSIDVNSFNMKLSDELVKSLIWKK 660

Query: 593 LGDYEIAWVDAEVGKTEN----GMLSLLPISTP-----------------APPHKSVLVG 631
           LG+YE+A + A++   EN       S  P+                    AP    + VG
Sbjct: 661 LGNYEVAHLMAKIRMPENVDEEAEESKEPVDPKDNLPILDSLKTQQDFALAPRAAPIFVG 720

Query: 632 DLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCE 690
           ++++A L+  L  +GI VE  G G L CG  V IRK+             +IVIEG +  
Sbjct: 721 NVRLAALRKTLMDQGISVELKGEGVLLCGGIVAIRKLDNG----------RIVIEGGISN 770

Query: 691 DYYKIRAYLYSQFYLL 706
            +++IR  +Y    ++
Sbjct: 771 RFFEIRKTIYDTLAMV 786


>gi|430813604|emb|CCJ29043.1| unnamed protein product [Pneumocystis jirovecii]
 gi|430813606|emb|CCJ29045.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 772

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 233/784 (29%), Positives = 390/784 (49%), Gaps = 128/784 (16%)

Query: 16  ENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAM 75
           E   + ++S      L+D G ND     LL    ++    D +L SH D  H+G+  +  
Sbjct: 11  ERSSASVLSFGEIKILLDPGAND-----LLSEFLELDFIPDLILFSHSDVSHVGSFVHGF 65

Query: 76  KQLGL-SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
           K  G    P+++T P++ +G +TM D Y   + + + +  +  DID+AF S+  L YSQ 
Sbjct: 66  KHSGWHDVPIYATLPIFNMGRVTMSDCY---KNIMD-NTISTKDIDNAFDSIITLRYSQP 121

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL-------NGTV 187
             LSGK  GI +  + +GH LGGT+WKITKD E+++Y V++N  K+ HL       NGT+
Sbjct: 122 ISLSGKLNGISITAYNSGHSLGGTIWKITKDSENIVYCVNWNHSKDSHLNGSILYSNGTI 181

Query: 188 LESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
           L++ +RP +LITDA N+  + P R++R E F D+I  TL   GNVL+P D+A R LE   
Sbjct: 182 LDALIRPTILITDAINSNISIPSRKKRTEAFFDSIKNTLAQQGNVLIPTDAATRSLEFCW 241

Query: 247 ILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL 306
           IL+ YW +H+L YPIYFL++  +  I Y +S +EWM DSI   + +S  + F   +V ++
Sbjct: 242 ILDRYWKQHNLQYPIYFLSHTGNKAISYAQSMIEWMSDSIISEYGSS-GSVFEFTYVKVI 300

Query: 307 INKSELDNAPDGPKLVLASMASLEAGFSHDIFVE-WASDVKNLVLFTERGQF--GTLARM 363
            N+ +  +   GPK++LA+ ++++ GFS  IF++  A D KNLV+ +++  +   +L++ 
Sbjct: 301 TNEFQFLSMVSGPKVILATSSNMDCGFSQKIFLDSIAKDSKNLVILSQKSIYYENSLSKD 360

Query: 364 L------------QADPPPKAVK----VTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 407
           L            Q  PP   +     VT+   VPLVG EL  Y+E++   +++EA  A 
Sbjct: 361 LLDRWNLAIEHSDQLIPPAVILNFNRTVTIRTSVPLVGSELEKYQEKEKLRREKEA--AK 418

Query: 408 LVKEEESK------------ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI 455
           L+ E +++             S     +   D M+     A  SA ++    G +   L 
Sbjct: 419 LIMELQNRDLFDSSDSDLNDDSNDRKTHFRNDSMI-----AKGSASLLT--SGVHDLYLQ 471

Query: 456 DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYI-IKDEDMDQAA------------- 501
              +   +    MFP  E    +DDFGE+I P+ +  I +ED++  A             
Sbjct: 472 TNEIRKMSPRFKMFPTLEKRRRFDDFGEIIIPEKFFRIIEEDLEFNANNELNKSINTMTK 531

Query: 502 ----------MHIGGDDGKLDEGSASLIL-------------------DAKPSK----VV 528
                     +  G  D  ++  S ++I                    D K  K    +V
Sbjct: 532 KRKWAGISNNIQNGNIDKDINVPSKTIITEEKILIKCSVRYIDMEGLHDGKSLKTIIPMV 591

Query: 529 SNELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMS 586
           +    VL++ + EA +++   C  L      +Y+P   E + +   L +Y ++LS+ +++
Sbjct: 592 NPRKLVLINSTQEAKDNMMATCRSLTSFTNDIYSPLQGEVLKIGIKLNSYNLKLSDNIIN 651

Query: 587 NVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSV---------LVGDLKMAD 637
            + +KKLGDY ++ V  ++  + +   + LPI      H ++          VGD+K+  
Sbjct: 652 TLRWKKLGDYNVSHVIGKLKLSADFTETNLPILEILSTHSNIRNIPQSHPLFVGDVKLTQ 711

Query: 638 LKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIR 696
           +K  L  +G   E  G G L C   VT+RK+G     GG     ++++EG + +++Y +R
Sbjct: 712 VKQLLQDQGHVAELIGEGVLLCDGLVTVRKIG-----GG-----KVILEGGVSQEFYDVR 761

Query: 697 AYLY 700
             +Y
Sbjct: 762 KIVY 765


>gi|298708373|emb|CBJ48436.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 997

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 159/378 (42%), Positives = 232/378 (61%), Gaps = 16/378 (4%)

Query: 5   VQVTPL----SGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           V  TPL     G     P+S ++ + G   L+DCGW+ HFD +LL+PL +V   ID VL+
Sbjct: 127 VVFTPLYGCDEGATGVEPVSSILEVGGVTILLDCGWDIHFDTALLEPLREVVKRIDLVLI 186

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLG LPYA  +LG+ A V++T PV+++G + +YD Y+SR     F  F LDD+D
Sbjct: 187 SHPDLEHLGGLPYAFGKLGMRAKVYATLPVWKMGQMAVYDAYISRTHEGNFQAFDLDDVD 246

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE--DVIYAVDYNRR 178
           +AF     L +SQ+   SG+G G+ + P+ AG ++G  VW+++   E  D++YA  YN  
Sbjct: 247 AAFARFKTLKFSQHLTFSGRGAGVTITPYAAGRMIGAAVWRVSWQTEDNDIVYATAYNND 306

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALH-----NQPPRQQREMFQ----DAISKTLRAGG 229
            E+HL  + L +  RP+VLITDA+NAL       + P  +R++ +      +  T+R GG
Sbjct: 307 HERHLRASALGTLTRPSVLITDAHNALTGGGMIRKDPSSKRKLREVELISTVMDTVRGGG 366

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           NVLLP D+AGRVLELL++L DYW +H L +Y +  L   + +T ++ KS LEWM + I +
Sbjct: 367 NVLLPTDTAGRVLELLVLLNDYWQKHRLGSYKLVLLHNTAFNTCEFAKSQLEWMSEDIGR 426

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
           +F+  R N F L++V ++ +  ELD   D PK+V+A+  SL+ GFS  + + WAS   N 
Sbjct: 427 AFDLQRSNPFELRNVHIMHSLEELDELGDDPKVVMATDMSLDFGFSKALLLRWASGGANT 486

Query: 349 VLFTERGQFGTLARMLQA 366
           +L T RG   T AR L A
Sbjct: 487 ILLTGRGHGNTTARTLIA 504


>gi|358338982|dbj|GAA43367.2| cleavage and polyadenylation specificity factor subunit 2, partial
           [Clonorchis sinensis]
          Length = 995

 Score =  307 bits (786), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 143/334 (42%), Positives = 211/334 (63%), Gaps = 5/334 (1%)

Query: 25  IDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPV 84
           +D F+ L+DCGW+D  D   ++ L++    IDAVLLSH    HLG LP+ +   GL  PV
Sbjct: 1   VDEFHCLLDCGWSDGLDKEYVKRLTQWTRHIDAVLLSHQSLRHLGLLPFLVGSCGLKCPV 60

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI 144
           ++T PVY++G LT+YD Y S     +F  FTLDD+D+AF  V ++ Y Q  +L G+G G+
Sbjct: 61  YATTPVYKMGQLTLYDFYQSMYASEDFTAFTLDDVDAAFDLVVQVKYQQTINLPGRGRGL 120

Query: 145 VVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNA 204
            + P  +GH LGGT+WK+ K+  D++YAVD+N +KE+HLNG   ++ +RP +LI DA N 
Sbjct: 121 CITPLPSGHTLGGTIWKLVKEDTDIVYAVDFNHKKERHLNGATFDACMRPHLLIMDASNT 180

Query: 205 LHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYP 260
           ++  P R+ R E  + +I KTLR GGN+L+ VD+AGR LE+   LE  W       + Y 
Sbjct: 181 MYTHPRRKDRDETLRHSILKTLRRGGNILVAVDTAGRCLEVAHFLEQCWLNQDSGMMAYG 240

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           +  L++V+ + +D+ KS +EWM + + ++FE  R N F  +HV L     +LD  P+ PK
Sbjct: 241 LAMLSFVAFNVVDFAKSMVEWMSEKVMRTFEDQRTNPFHFRHVQLCHTLEQLDTVPE-PK 299

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
           +VLAS + L  GF+  +F EWA +  N V+ T R
Sbjct: 300 VVLASASDLSCGFARQLFAEWADNDLNTVILTSR 333



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 60/241 (24%), Positives = 100/241 (41%), Gaps = 61/241 (25%)

Query: 505 GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHC---LKHVCPHVYTP 561
           G  DG   E    +++  +P +++      LV  S   TE L  +C   +      V+TP
Sbjct: 656 GRSDG---EAMKRIVVGLRPQELI------LVGNSRADTEQLATYCRTVMLLASNLVHTP 706

Query: 562 QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENG---------- 611
                I+ T +   Y+ ++ + L+S++ F K+ DYE+AWV+A +  T+N           
Sbjct: 707 SACSVINCTKEGDIYQARMKDSLVSSLRFTKIRDYELAWVEANIDLTDNASSDPDHSESA 766

Query: 612 ----------------------------MLSLLPIST-PAPPHKSVLVGDLKMADLKPFL 642
                                        L +L + T P   HK+V V + K++DLK  L
Sbjct: 767 SDDLNMPNASGDDNPPSPPKTRSSLAADRLPVLGLPTGPVGAHKTVFVNEPKLSDLKQLL 826

Query: 643 SSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQ 702
            + G+  EF  G L     V I++          S   ++++EG L   Y+ +R  LY Q
Sbjct: 827 LANGLVAEFVSGVLVVDNCVAIKR----------SEAGKLLLEGLLSRTYFTVRQVLYQQ 876

Query: 703 F 703
            
Sbjct: 877 L 877


>gi|256077070|ref|XP_002574831.1| cleavage and polyadenylation specificity factor [Schistosoma
           mansoni]
          Length = 928

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 147/359 (40%), Positives = 220/359 (61%), Gaps = 6/359 (1%)

Query: 1   MGTSV-QVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVL 59
           M TS+ ++  LSG  +     YL+ +D F+ L+DCGW +  D   ++ +SK A  +DAVL
Sbjct: 1   MATSIIKLHTLSGAGDNGSPCYLLQVDEFHCLLDCGWCEKLDSDYVKEVSKWAKHVDAVL 60

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           LSH    HLG LPY +   GL+ PV++T PVY++G + MYD + SR    +F  +TLDD+
Sbjct: 61  LSHQSLRHLGLLPYLVGTCGLNCPVYATTPVYKMGQMFMYDFFQSRHASEDFSHYTLDDV 120

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
           D AF  V ++ Y Q   L G+G G+ + P  +GH LGGT+WK+ K+   ++YA+D+N +K
Sbjct: 121 DLAFDHVHQVKYQQTISLHGRGHGLCITPLPSGHTLGGTIWKLVKEDTSIVYALDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   ++ +RP +LI D  N L+ QP R+ R E  +  + K+LR GGNVL+ VD+A
Sbjct: 181 ERHLNGATFDACIRPHLLIMDGSNTLYTQPRRKDRDENLRQTVLKSLRRGGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GR LE+   LE  W       + Y +  L YV+ + +D+ KS +EWM + + +SFE  R 
Sbjct: 241 GRCLEVAHFLEQCWLNQESGLMAYGLAMLNYVALNVVDFAKSMVEWMSEKVMRSFEDQRS 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
           N F  +H+ L     +LD A   PK+VL+S++ L  GFS  +F EWA +  N ++ T +
Sbjct: 301 NPFHFRHMQLCHTLEQLD-AVSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQ 358



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 109/250 (43%), Gaps = 67/250 (26%)

Query: 505 GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVC---PHVYTP 561
           G  DG   E    +++  +P +++      LV  +A A +HL  +C   +     +++ P
Sbjct: 698 GRSDG---EAMKRILIGLRPQEII------LVGNNAPAIDHLANYCRGVMLLDPNYIHIP 748

Query: 562 QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG--------------- 606
              E ++ T +   Y+ ++ + L+S++ F K+ DYE+AWV+A V                
Sbjct: 749 HPREIVNCTKEGDIYQARMKDSLVSSLKFTKIRDYELAWVEATVSLDDKFDYHIKEKRNN 808

Query: 607 -----------------KTENGM------------LSLLPIST-PAPPHKSVLVGDLKMA 636
                             T N +            L +L + T P   HK+V V + K++
Sbjct: 809 NNTGNNDNDDDNGDVEMSTGNNLELRSRTPLAADQLPVLSLPTGPIGQHKTVFVNEPKLS 868

Query: 637 DLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIR 696
           DLK  L S+G+  EF  G L     V I++          S   ++++EG LC  Y+++R
Sbjct: 869 DLKQLLLSQGLMAEFVSGILVVDNCVAIKR----------SEAGKLLLEGLLCGTYFEVR 918

Query: 697 AYLYSQFYLL 706
             LY QF +L
Sbjct: 919 RILYQQFAIL 928


>gi|449662070|ref|XP_004205466.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like, partial [Hydra magnipapillata]
          Length = 568

 Score =  305 bits (782), Expect = 4e-80,   Method: Compositional matrix adjust.
 Identities = 200/586 (34%), Positives = 305/586 (52%), Gaps = 79/586 (13%)

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGR 240
           HLNG VLE+  RPA+LITD+Y AL NQ  R++R++   ++I   LR  GNVLL VD+AGR
Sbjct: 1   HLNGAVLETLSRPALLITDSYAALCNQERRKERDIQLMNSILSALRQDGNVLLAVDTAGR 60

Query: 241 VLELLLILEDYWA--EHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           +LEL+ +L+  W+  E  L+ Y +  L  VS + +++ KS +EWM D + KSFE  R N 
Sbjct: 61  ILELMQLLDQMWSAKESGLSVYSLALLNNVSYNVVEFAKSQVEWMSDRMMKSFEVDRRNP 120

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  KH+TL     ELD  P  PK+VLAS A +  GFS D+FV+WAS+ KN V+FT +   
Sbjct: 121 FAFKHITLCHFLKELDQLP-SPKVVLASAADMNCGFSKDLFVQWASNPKNSVIFTFKTSP 179

Query: 358 GTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKAS 417
           G+LAR L  +P  ++V++ + +RV L G EL  Y E +    ++  L+  L + +  + +
Sbjct: 180 GSLARTLIDNPKIESVELEVFKRVRLEGVELSQYLEVEKEKARQAKLQRKLTEVDVRQEN 239

Query: 418 LGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSE 477
           +  D + S + M  +  N +    ++     R++             + PMFPF E   +
Sbjct: 240 VFKDESESEEEMEEENLNKSKYDLMITNEKLRHKSSFF-----KQAKIYPMFPFKEERLK 294

Query: 478 WDDFGEVINPDDYIIKDED-MDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTV-- 534
           WDD+GE+I P+DY+I + + M++    I  +D K  E   +L +   P+K VS  + V  
Sbjct: 295 WDDYGEIIRPEDYVIIENNLMEEEGPKITIEDMK--EDLEALEIKEPPTKSVSEMVKVDV 352

Query: 535 -------------------------------LVHGSAEATEHLKQHC---LKHVCPHVYT 560
                                          L+HGS  ATE L ++C    +     VYT
Sbjct: 353 RCKISYIDFEGRSDGESVRRILSIVKPRQLILIHGSPAATEALSRYCQTSTQFNVSKVYT 412

Query: 561 PQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENG--------- 611
           P   E +D T +   Y+V+L + L+S++ F    D E+AWVD ++     G         
Sbjct: 413 PYTNEMVDATRESHIYQVKLKDSLVSSLKFAVARDTELAWVDGQLVMEARGEKFNQIEQE 472

Query: 612 ------MLSLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 660
                    ++P+    PP     H +V + + +++D K  L+  GIQ EF GG L C  
Sbjct: 473 NSEKVEKQDVVPVLEQLPPEMIPGHATVFIDEPRLSDFKQVLTKAGIQAEFTGGVLVCNN 532

Query: 661 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            V +R+    G++G      +I IEG LCE+YY IR  LY Q+ ++
Sbjct: 533 VVAVRR----GEQG------KISIEGGLCEEYYVIRQLLYDQYAIV 568


>gi|449018596|dbj|BAM81998.1| cleavage and polyadenylation specific factor 2, 100kD subunit
           [Cyanidioschyzon merolae strain 10D]
          Length = 884

 Score =  304 bits (779), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 167/412 (40%), Positives = 254/412 (61%), Gaps = 19/412 (4%)

Query: 1   MGTSVQVTPLSGVFNENP-LSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST-IDAV 58
           M +S++VTPL G     P L  ++ ID   FL+DCGWND FD +LL+PL  V +  IDAV
Sbjct: 1   MASSIRVTPLYGAHTSAPPLCTVLEIDDGVFLLDCGWNDRFDVALLEPLRPVITRGIDAV 60

Query: 59  LLSHPDTLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTL 116
            L+HPD  HLGALPY + +LGL  S P+++T PV  LG + +YD +  R    +F+ FTL
Sbjct: 61  FLTHPDLAHLGALPYLVGKLGLPASVPIYATTPVQILGQMFLYDAHQHRYYGEDFETFTL 120

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           DD+D AF+ +  + Y Q   L+   + +    + AGHLLGG +WK  K+ E+++Y VD N
Sbjct: 121 DDVDEAFERMRPVKYQQVIELA---QNVFATAYPAGHLLGGAIWKFQKESEEIVYCVDVN 177

Query: 177 RRKEKHLNG--TVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLL 233
            R+E+ LNG  +  +   +P+ LI  A   L    P Q++E    +A+ +TLR GG+VL+
Sbjct: 178 HRRERLLNGCASTPQLITKPSHLIVGASGVL--TAPSQKKETDLWEAVVETLRGGGDVLM 235

Query: 234 PVDSAGRVLELLLILEDYWAEH---SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           PVDSAGR LELL+  +++W  H   +  YP+ F  +V   TI++ KS +EWM D++  +F
Sbjct: 236 PVDSAGRCLELLVAADEFWTAHPDVAALYPVVFAQHVGIHTIEFAKSLIEWMSDAVVSAF 295

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW-ASDVKNLV 349
           ++ R+N F L+HV ++    + D  P  PK+V+A + SL+ GFS  +F++  A+D + +V
Sbjct: 296 DSRRENPFRLRHVQVVHGLDQADALP-SPKVVMAPLPSLDYGFSRVLFLQRIAADPRAMV 354

Query: 350 LFTERGQFGTLARMLQADPPPKAVK--VTMSRRVPLVGEELIAYEEEQTRLK 399
           L ++R + GT A  L  +     V+  +T + RVPL GEEL  ++ EQ + +
Sbjct: 355 LMSDRLESGTFAFRLAVEKEKLRVREPLTYAERVPLQGEELERWQREQEKAR 406



 Score = 68.6 bits (166), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 54/236 (22%), Positives = 97/236 (41%), Gaps = 53/236 (22%)

Query: 518 LILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYK 577
           LI+   P +V+      ++HGS   T  L ++  K     +Y P+  E +DV+SD   Y+
Sbjct: 655 LIVSMAPQRVI------IIHGSERETAALTEYLGKKNFTRLYAPRAREMVDVSSDTSVYR 708

Query: 578 VQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSV--------- 628
           ++L + L+    ++++ DYE+AW D  +    +G L L+ +       + +         
Sbjct: 709 IKLDDSLLRRCFWRRMQDYELAWFDGYIQTDPDGQLRLVSVERQTEQEQQLPEGTESGVD 768

Query: 629 -----------------LVGDLKMADLKPF-LSSKGIQVEFAG---GALRCGEY--VTIR 665
                            LV   + A+   F L ++  QV       G LR  +   +  +
Sbjct: 769 AAWLAAKTTDAASAATALVDGDRTANTTTFALVTERTQVGHLNVFVGDLRLSDLKEIMTK 828

Query: 666 KVGPAGQKGGGSGTQQ---------------IVIEGPLCEDYYKIRAYLYSQFYLL 706
            + PA   GG    +                +VIEG L  +Y+ +R  +YSQ+ +L
Sbjct: 829 SLMPAEFAGGALCVENDRPPSIVLVRKRQHDLVIEGSLSAEYFDVRDLVYSQYMIL 884


>gi|26344199|dbj|BAC35756.1| unnamed protein product [Mus musculus]
          Length = 296

 Score =  299 bits (765), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 142/296 (47%), Positives = 202/296 (68%), Gaps = 5/296 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFE 296


>gi|19112240|ref|NP_595448.1| cleavage factor two Cft2/polyadenylation factor CPSF-73 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74582548|sp|O74740.1|CFT2_SCHPO RecName: Full=Cleavage factor two protein 2
 gi|3738153|emb|CAA21254.1| cleavage factor two Cft2/polyadenylation factor CPSF-73 (predicted)
           [Schizosaccharomyces pombe]
          Length = 797

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 237/804 (29%), Positives = 380/804 (47%), Gaps = 156/804 (19%)

Query: 23  VSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL-S 81
           + +DG +  ID G +D    SL  P  +V    D +LLSH D  H+G L YA  +    +
Sbjct: 18  IELDGIHIYIDPGSDD----SLKHP--EVPEQPDLILLSHSDLAHIGGLVYAYYKYDWKN 71

Query: 82  APVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
           A +++T P   +G +TM D  +    +S+    +  D+D+ F S+  L Y Q   L GK 
Sbjct: 72  AYIYATLPTINMGRMTMLDA-IKSNYISDM---SKADVDAVFDSIIPLRYQQPTLLLGKC 127

Query: 142 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------VLESFVRP 194
            G+ +  + AGH LGGT+W + K+ E V+YAVD+N  K+KHLNG        +LE+  RP
Sbjct: 128 SGLTITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRP 187

Query: 195 AVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
             LITDA N+L + P R++R E F +++  +L  GG VLLPVD+A RVLEL  IL+++W+
Sbjct: 188 NTLITDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWS 247

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
                L +PI FL+  S+ TIDY KS +EWMGD+I + F  + +N    +++  + + S+
Sbjct: 248 ASQPPLPFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGIN-ENLLEFRNINTITDFSQ 306

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN-LVLFTERG------------QFG 358
           + +   GPK++LA+  +LE GFS  I ++  S+  N L+LFT+R             ++ 
Sbjct: 307 ISHIGPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSRCPQNSLANQFIRYW 366

Query: 359 TLARMLQADPP-------PKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKE 411
             A   + D P        +AVK+    + PL GEEL +Y+E +   + ++A   +L   
Sbjct: 367 ERASKKKRDIPHPVGLYAEQAVKIKT--KEPLEGEELRSYQELEFSKRNKDAEDTAL--- 421

Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPH----------GGRYRDILIDGFVPP 461
           E    ++  ++  S      D  + N       PH          G  +   L D  V  
Sbjct: 422 EFRNRTILDEDLSSSSSSEDDDLDLNTEV----PHVALGSSAFLMGKSFDLNLRDPAVQA 477

Query: 462 STSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSA----S 517
             +   MFP+ E     D++GE+I   D+ + +E  +   +    DD  L   +     S
Sbjct: 478 LHTKYKMFPYIEKRRRIDEYGEIIKHQDFSMINEPANTLELENDSDDNALSNSNGKRKWS 537

Query: 518 LILDA------------KPSKVVSNELT-------------------------------- 533
            I D              PSK++++E T                                
Sbjct: 538 EINDGLQQKKEEEDEDEVPSKIITDEKTIRVSCQVQFIDIEGLHDGRSLKTIIPQVNPRR 597

Query: 534 -VLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 590
            VL+H S E  E +K+ C  L      VY P   E I+V+ D+ A+ ++L++ L+ N+++
Sbjct: 598 LVLIHASTEEKEDMKKTCASLSAFTKDVYIPNYGEIINVSIDVNAFSLKLADDLIKNLIW 657

Query: 591 KKLGDYEIAWVDAEVGKTENGM---------------------------------LSLLP 617
            K+G+ E++ + A+V  ++                                    L+L  
Sbjct: 658 TKVGNCEVSHMLAKVEISKPSEEEDKKEEVEKKDGDKERNEEKKEEKETLPVLNALTLRS 717

Query: 618 ISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGG 676
               AP    +LVG++++A L+  L  +GI  E  G G L CG  V +RK+      GG 
Sbjct: 718 DLARAPRAAPLLVGNIRLAYLRKALLDQGISAELKGEGVLLCGGAVAVRKLS-----GG- 771

Query: 677 SGTQQIVIEGPLCEDYYKIRAYLY 700
               +I +EG L   +++IR  +Y
Sbjct: 772 ----KISVEGSLSNRFFEIRKLVY 791


>gi|195145330|ref|XP_002013649.1| GL24248 [Drosophila persimilis]
 gi|194102592|gb|EDW24635.1| GL24248 [Drosophila persimilis]
          Length = 583

 Score =  288 bits (737), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 195/608 (32%), Positives = 303/608 (49%), Gaps = 100/608 (16%)

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVL 232
           D   RKE+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL
Sbjct: 1   DSTTRKERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVL 60

Query: 233 LPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           +  D+AGR+LEL  +L+  W       + Y +  L  VS + +++ KS +EWM D +TK+
Sbjct: 61  IAADTAGRMLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVVEFAKSQIEWMSDKLTKA 120

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           FE +R+N F  KH+ L    +++   P GPK+VLAS   LE+GF+ D+F++WAS+  N +
Sbjct: 121 FEGARNNPFQFKHIQLCHTLADVYKLPAGPKVVLASTPDLESGFTRDLFIQWASNANNSI 180

Query: 350 LFTERGQFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASL 408
           + T R   GTLA  +++   P + +++ + RRV L G EL  Y   +T+ +K   L A  
Sbjct: 181 ILTTRTSPGTLAMELVENYAPGRQIELDVRRRVELEGAELEEY--LRTQGEKINPLIAKP 238

Query: 409 VKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPM 468
             EEES +    D         I+ +      D+V    GR+      GF   +     M
Sbjct: 239 EPEEESSSESEDD---------IEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVM 285

Query: 469 FPFYENNSEWDDFGEVINPDDYIIKD-------------EDMDQAAMHIGGDDGKLDEGS 515
           FP++E   ++D++GE+IN DDY I D             E++ +    IG +        
Sbjct: 286 FPYHEEKIKYDEYGEIINLDDYRIADMNNTEFPPEEQNKENVKKEEPGIGIEQQANGAMD 345

Query: 516 ASLILDAKPSKVVSNELT---------------------------------VLVHGSAEA 542
             + L  KP+K+++   T                                 ++VHG+ E 
Sbjct: 346 TDVQLLEKPTKLINQRKTIEVNAQIQRIDFEGRSDGESMLKILSQLRPRRVIVVHGTEEG 405

Query: 543 TEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVD 602
           T+ + +HC ++V   V+TPQ  E IDVT+++  Y+V+L+E L+S + F+K  D E+AWVD
Sbjct: 406 TQVVAKHCEQNVGARVFTPQKGEIIDVTTEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVD 465

Query: 603 AEVG----------------------KTENGMLSLLPIST-PAPPHKSVLVGDLKMADLK 639
             +G                        E   L+L  +     P H SVL+ +LK++D K
Sbjct: 466 GRLGMRLKAIDAPPTAMDVTVEQDAAMQEGKTLTLETLEEDEIPVHNSVLINELKLSDFK 525

Query: 640 PFLSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAY 698
             L    I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  
Sbjct: 526 QILLRNNINSEFSGGVLWCTNGTLALRRVDAG----------KVAMEGCLSEEYYKIREL 575

Query: 699 LYSQFYLL 706
           LY Q+ ++
Sbjct: 576 LYEQYAIV 583


>gi|119601889|gb|EAW81483.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_c [Homo sapiens]
          Length = 690

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 138/284 (48%), Positives = 194/284 (68%), Gaps = 5/284 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFL 279
           GRVLEL  +L+  W        +Y    L  VS + +++ KS L
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQL 284



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/384 (24%), Positives = 162/384 (42%), Gaps = 126/384 (32%)

Query: 433 ANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEV 484
           ++ ++   D+ +P   + + D+++ G       F   +    PMFP  E   +WD++GE+
Sbjct: 323 SDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEI 382

Query: 485 IN-----PDDYIIK-------------------DEDMDQ-------------AAMHIGGD 507
           I      P+D+++                    DE MDQ              ++ I   
Sbjct: 383 IKDLLFRPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKAR 442

Query: 508 DGKLD-EGSASLILDAKPSKVVSNELT----VLVHGSAEATEHLKQHCL----KHVCPHV 558
              +D EG +    D    K + N++     ++VHG  EA++ L + C     K +   V
Sbjct: 443 VTYIDYEGRS----DGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKDI--KV 496

Query: 559 YTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML- 613
           Y P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + G++ 
Sbjct: 497 YMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVIL 556

Query: 614 ----------------------------------------------SLLPISTPAPPH-- 625
                                                          ++P   P PPH  
Sbjct: 557 EEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEV 616

Query: 626 ---KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQI 682
              +SV + + +++D K  L  +GIQ EF GG L C   V +R+          + T +I
Sbjct: 617 PGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRI 666

Query: 683 VIEGPLCEDYYKIRAYLYSQFYLL 706
            +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 667 GLEGCLCQDFYRIRDLLYEQYAIV 690


>gi|444714932|gb|ELW55806.1| Cleavage and polyadenylation specificity factor subunit 2 [Tupaia
           chinensis]
          Length = 723

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 137/284 (48%), Positives = 194/284 (68%), Gaps = 5/284 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFL 279
           GRVLEL  +L+  W        +Y    L  VS + +++ KS L
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQL 284



 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 83/341 (24%), Positives = 142/341 (41%), Gaps = 115/341 (33%)

Query: 433 ANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEV 484
           ++ ++A  DV +P   + + D+++ G       F   +    PMFP  E   +WD++GE+
Sbjct: 323 SDESDAEEDVDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEI 382

Query: 485 INPDDYII-------------------KDEDMDQ-------------AAMHIGGD----- 507
           I P+D+++                    DE MDQ              ++ I        
Sbjct: 383 IKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYID 442

Query: 508 -DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTP 561
            +G+ D  S   I++  KP +++      +VHG  EA++ L + C     K +   VY P
Sbjct: 443 YEGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMP 494

Query: 562 QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML---- 613
           ++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + G++    
Sbjct: 495 KLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEG 554

Query: 614 -------------------------------------------SLLPISTPAPP-----H 625
                                                       ++P   P PP     H
Sbjct: 555 ELKDDGEDSEMQVDAPSDSSAIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGH 614

Query: 626 KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRK 666
           +SV + + +++D K  L  +GIQ EF GG L C   V +R+
Sbjct: 615 QSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR 655


>gi|302833565|ref|XP_002948346.1| hypothetical protein VOLCADRAFT_31342 [Volvox carteri f.
           nagariensis]
 gi|300266566|gb|EFJ50753.1| hypothetical protein VOLCADRAFT_31342 [Volvox carteri f.
           nagariensis]
          Length = 375

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 160/377 (42%), Positives = 244/377 (64%), Gaps = 21/377 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M TSV+ TPLSGV  E+PL YL+ ID F  L+DCGW+++FD S L+P+ +V   ++AVLL
Sbjct: 1   METSVRFTPLSGVDAESPLCYLLEIDSFTILLDCGWDENFDESALEPIKRVLPRVNAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS--EFDLFTLDD 118
           SHPD  HLGALPY + + GL+AP+FST+PV R+G + M++ YL+++  +  +F +F LDD
Sbjct: 61  SHPDVAHLGALPYLVGKCGLTAPIFSTKPVRRMGEMFMFESYLAKQASTSIDFAIFDLDD 120

Query: 119 IDSAFQ---SVTRLTYSQNYHLSGK-----GEGIVVAPHVAGHLLGGTVWKITKD-GEDV 169
           +D+AF+     T L +SQ + L        G GI +A H AG   GG VW+I+   GE+V
Sbjct: 121 VDAAFRLNPRWTELRFSQRHQLLAAMPATAGGGIAIAAHAAGRYPGGAVWRISLGCGEEV 180

Query: 170 IYAVDYNRRKEKHLNGTVLESFV---RPAVLITDAYNALHNQPPRQQR-EMFQDAISKTL 225
           +YAVDYN RKE+ LN T L+  +   +PA+LI+D  N L     R +R E F DAI+ T+
Sbjct: 181 VYAVDYNHRKERLLNRTNLDELLSSQQPALLISDCLNGLTENTDRHRRDEEFLDAITATV 240

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWA----EHSLNYPIYFLTYVSSSTIDYVKSFLEW 281
            A G+VL+P D+AGRVLEL L+L+++++    +     P+  L+    + +++ ++ LE+
Sbjct: 241 EAEGSVLIPTDAAGRVLELALLLDEHFSRARYDKGTTSPV-LLSATIKTVLEFARTQLEY 299

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
           +G  + ++F   R   F  + ++++    EL   P GPK+VLA M SLE+G + ++ V+W
Sbjct: 300 LGSELVQAFSLKRSVPFSFRKLSVITRLEELGAFP-GPKVVLAPMPSLESGPARELLVQW 358

Query: 342 ASDVKNLVLFTERGQFG 358
            +  +N ++FTER Q G
Sbjct: 359 GALPRNTIIFTERAQVG 375


>gi|393241063|gb|EJD48587.1| hypothetical protein AURDEDRAFT_183466 [Auricularia delicata
           TFB-10046 SS5]
          Length = 893

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 243/887 (27%), Positives = 387/887 (43%), Gaps = 193/887 (21%)

Query: 5   VQVTPLSGVFNE---NPLSYLVSIDGFNFLIDCG---WNDHFDPS--------LLQPLSK 50
           +  TPLSG  +E   NPL+YL+ +D    L+DCG   WN  F             Q L  
Sbjct: 2   ITFTPLSGDAHESNGNPLAYLLQVDDVKILLDCGSPDWNPEFIDEDGDAPWTPYCQALRS 61

Query: 51  VASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSE 110
            A +ID VLLSH D  H G  PYA     L AP + T P+  +G + + D+  + R    
Sbjct: 62  FAHSIDLVLLSHGDLQHCGLYPYAFAHWNLRAPAYCTYPIQAMGRVAVLDELEALRAEQS 121

Query: 111 FD-----------------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
           F                              +    D+  AF S+  + YSQ  HL GK 
Sbjct: 122 FAETDAANDADPPVDADGDAIMQSRASRSKYVAQRKDVQDAFDSLITMRYSQPTHLQGKC 181

Query: 142 EGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTVL------------ 188
           +G+ + P  AGH LGGT+WKI       ++YAVD N  +E+HL+GTVL            
Sbjct: 182 QGLTITPFSAGHTLGGTIWKIRSPSVGTIVYAVDMNHMRERHLDGTVLFRSAPGAGATIF 241

Query: 189 ESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGG-NVLLPVDSAGRVLELLL 246
           E   RP VLITDA   L     R+ R+    + +S TL     ++L+P DS+ RVLELL+
Sbjct: 242 EPLARPDVLITDADKTLVVNARRKDRDAALLELVSDTLGTRSHSLLMPCDSSTRVLELLV 301

Query: 247 ILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK--------------SFET 292
           + + +W+   +  PI  ++   +  + +V+S +EW+G +I+K              + + 
Sbjct: 302 LFDQHWSFSKMRAPICLVSRTGAEMLTFVRSMMEWLGGTISKEDVGEKPDNNNKGGNRKR 361

Query: 293 SRDN---------AFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEW 341
            RD+         A   +H+      ++L +      PKL+LA   ++  G S  IF ++
Sbjct: 362 KRDDEEEDAIGAFALRFRHLEFFTTYAQLTSTYPSSKPKLILAVPQNISHGSSRAIFTDF 421

Query: 342 ASDVKNLVLFTERGQFGTLARML-------QADPPP-------------KAVKVTMSRRV 381
           AS V N+V+ T +G+ GTL+RML       Q D                + +K+ M  +V
Sbjct: 422 ASVVGNVVVLTSKGEQGTLSRMLFDKWNEAQRDGDQYGAGTVGEPVTLNETLKLRMHTKV 481

Query: 382 PLVGEELIAYEEEQTRLKKEE------------ALKASLVKEEESKASLGPDNNLSGDPM 429
           PL G EL  + + +   ++ E              +A   + +  ++   PD++  G P 
Sbjct: 482 PLQGAELETHLQAERAAQEREAKQAAALARAQLEAEADDEESDSDESQSEPDDDGDGKPA 541

Query: 430 --VIDANNANASADVVEPHGGRYRDILIDGFVPPSTSV----------APMFPFYENNSE 477
             + DA + ++  D  + +   + DI + G V   TS             MFP+ E    
Sbjct: 542 EPLRDAWHFDSGGDTADANRISF-DIYMKGSVARPTSFFKATEGQTQRFKMFPYVERRRR 600

Query: 478 WDDFGEVINPDDYIIKDEDMDQAAMH---IGGDDGKLDEGSASLILDAKPSKVVSNELTV 534
            D FGEV++   ++ K + ++  A     +     K  E  A       PSK V+ E  V
Sbjct: 601 VDAFGEVVDVAMWLRKGKALETGAESEEALEAKRKKAAEEEAKKAQAEPPSKFVTTEAEV 660

Query: 535 ---------------------------------LVHGSAEATEHLKQHC--LKHVCPHVY 559
                                            LVH +  AT  LK+ C  ++ +   +Y
Sbjct: 661 QLACRLFFVDMEGLNDSRAVKTIVPQVNPRKMILVHSTTAATNALKESCSSIRAMTKDIY 720

Query: 560 TPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV-------------- 605
           TP + +++ +   + ++ + LSE+L++++   +  D E+ +V   +              
Sbjct: 721 TPWLGDSVQIGEHINSFSLSLSEELLASIKMSRFEDTEVGYVAGRLVAHASSSIPVLEPL 780

Query: 606 --GKTENGMLSLLPISTP-----APPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALR 657
             GKTE+G L     +       A   +S ++GDLK+  LK  L++ GI  EFAG G L 
Sbjct: 781 AGGKTEDGALQAAAPAARRQLGVAQLPQSTMIGDLKLTALKARLAAIGIPAEFAGEGVLV 840

Query: 658 CGEYVTIRKVGP---AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYS 701
           CG++V      P      +  G G  ++VIEG +C+ YY IR  +Y+
Sbjct: 841 CGDFVRDPDADPNAVVAVRKMGRG--KVVIEGGVCDVYYTIRREVYA 885


>gi|353237084|emb|CCA69065.1| hypothetical protein PIIN_02923 [Piriformospora indica DSM 11827]
          Length = 887

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 251/887 (28%), Positives = 393/887 (44%), Gaps = 201/887 (22%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCGWND-HFDPSL-------------LQP 47
           V  TPL+G  +     PL+YL+ IDG   L+DCG  D H D  L                
Sbjct: 2   VSFTPLAGGAHSASTIPLAYLLDIDGAKILLDCGSPDWHLDDDLKVGEEQKQIFESYCAQ 61

Query: 48  LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQ 107
           L +++  ID VLLSH D  H G   YA  + GL+A  ++T PV     L   ++ ++ R 
Sbjct: 62  LQRISPDIDLVLLSHGDLAHAGLYAYANARWGLTATAYATLPVQATARLATLEESITLRG 121

Query: 108 VSEFD--------------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
             + D                          +    +I+ AFQS+  L YSQ   L+GK 
Sbjct: 122 EEQIDSDPQPTPETDGMEITPAEEKKRTKIRVAKPQEINDAFQSIITLRYSQPTQLAGKC 181

Query: 142 EGIVVAPHVAGHLLGGTVWKITKD-GEDVIYAVDYNRRKEKHLNGTVL---------ESF 191
           +GI + P  AGH +GGT+WKI       ++YAV+ N  KE+HL+G+VL         E  
Sbjct: 182 QGITITPFSAGHTIGGTIWKIRSSLAGTIVYAVNLNHLKERHLDGSVLTLSTGGNVFEPL 241

Query: 192 VRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 250
            RP VLITDA  AL     R+ R+    D I++T+ +G ++LLPVDS+ R+LELL++ + 
Sbjct: 242 ARPEVLITDAERALTIGSKRKDRDRALLDLITETIESGHSLLLPVDSSTRLLELLVLTDQ 301

Query: 251 YWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT--------KSFETSRDN------ 296
           +WA   +  PI  ++  S   +  V++ +EW+G +I+        K+    RD       
Sbjct: 302 HWAYSKMRAPICLISKTSRQLLSMVRNMMEWLGGTISKEDLGDSAKNQRRRRDEDDEALG 361

Query: 297 --AFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
             A   K V    N  E+ N  +   PKL+L+  ASL  G S  +F ++A +  N+V+ T
Sbjct: 362 ALALRFKFVEFFSNPDEMINIFSSREPKLILSVPASLSHGPSRSLFADFAVNEGNMVVLT 421

Query: 353 ERGQFGTLARML-------QADPPP-----KAVKVTMSR--------RVPLVGEELIAY- 391
           +R   GTL R L       Q D          V V++ R        +VPL G EL  Y 
Sbjct: 422 QRTGMGTLNRFLLDRWEAGQEDSQRWQDGHIGVPVSLDRPIDMELRIKVPLQGVELEEYR 481

Query: 392 EEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGG--- 448
           E+E+   ++  A KA+  ++++ +      +    D      +  + +A+V E   G   
Sbjct: 482 EKEKLAKEQANAKKAAAARQQQMREEEVESSGSESDDSDDSDSGEDVTAEVTEEMEGVDW 541

Query: 449 ----------RYR--DILIDG-------FVPPSTSVAP---MFPFYENNSEWDDFGEVIN 486
                     RY+  DI + G       F   + +  P   +FPF E     DDFGEVI+
Sbjct: 542 TILDQEEVGLRYQSYDIYVKGHQNKTSNFFKSNDASVPRFRVFPFIEKRKRVDDFGEVID 601

Query: 487 PDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLIL--DAKPSKVVSNELT----------- 533
              ++ K + MDQ A        +L   +       +  PSK ++ +++           
Sbjct: 602 VSSWLRKGKIMDQNAESEQSKANRLKAAAKEKEQQPEEAPSKFIAEQISIDMRCKVMFVD 661

Query: 534 ----------------------VLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDV 569
                                 ++V  ++EATE L + C  +K +   +YTP++ ETI +
Sbjct: 662 LEGVHDGRALKNILPQVNPRRLIIVQATSEATESLAEACKAIKSMSAEIYTPRVGETIRI 721

Query: 570 TSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV--------------------------DA 603
             ++  Y + LS+ LM+++      D EIA+V                          D 
Sbjct: 722 GENMENYTIALSDALMNSLKMATYEDNEIAFVRGRLSNPTSTGIYVLEPPRLGMQRTTDV 781

Query: 604 EVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCG--- 659
           E+ + ENG+ +    ST A   +++++GDLK+  LK  L+  GI  EFAG G L C    
Sbjct: 782 EMAEKENGVAAAKDSSTAAVIPRAIMIGDLKLTALKIRLNRLGIAAEFAGEGFLVCRSKP 841

Query: 660 ------EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
                 + V +RK     +KG      ++ +EG     +Y +R  +Y
Sbjct: 842 IDDDEEDTVAVRKT----RKG------EVRVEGDASPLFYMVREEIY 878


>gi|123476407|ref|XP_001321376.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121904201|gb|EAY09153.1| hypothetical protein TVAG_363680 [Trichomonas vaginalis G3]
          Length = 700

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 204/724 (28%), Positives = 353/724 (48%), Gaps = 69/724 (9%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH 62
           TS+   PLSG  +  P +YL+ +D F FL+DCGW + F    +Q   ++ S ++AVLLSH
Sbjct: 6   TSISFQPLSGAQSTTPFAYLLHVDEFTFLLDCGWTEDFRLEDIQTQIEICSHVNAVLLSH 65

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSA 122
               H+GALPY     GLSAP+F+T P+  LG L +YD YL+ R   EF  F  +DID A
Sbjct: 66  ASIEHIGALPYLCSH-GLSAPIFATMPIPALGSLLIYDSYLNIRDEEEFKEFNANDIDQA 124

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
           FQ + R+TY Q+  L GK   I + P+ AG+ LGGTVW+I K   +VIY+V       K+
Sbjct: 125 FQKINRMTYQQSEQLDGK--NITITPYNAGNTLGGTVWRIVKGQNEVIYSVSVGDHS-KY 181

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 242
           L+   LES + P + I DA     ++  ++  + F   I   L  G  ++ P D     L
Sbjct: 182 LSSFSLESGLHPTLWILDARGPESHRDGKE--DEFWRQIFGKLNGGKTIIFPTDGVSGSL 239

Query: 243 ELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD----NAF 298
           E++  L++ W + +  + IYFL++ S + +   +S   ++   I +   +       N  
Sbjct: 240 EVISRLKEQWKKVNWKWKIYFLSHSSPAVLKNAQSLSNYLSLDIQEKINSGEYPFEFNDP 299

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
            L + + + +  ++D +     +V++S  +LE GFS  +F++ A+   NL++FT+R    
Sbjct: 300 DLSYFSCVTSIKDIDFSQGC--VVISSTDTLERGFSRKLFLDKANS-DNLIIFTQREPPY 356

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           +LA  L+ +   +  +  +  R PL GEEL+ + E+Q+ L+++       + +E  + S 
Sbjct: 357 SLAEALRTNNAHRTFRFIIKHREPLTGEELVKFMEKQSALQEKANEIEGDISDESDEVSQ 416

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYR------------DILIDGFVPPSTSVA 466
                        +  N++  A  ++ H  +++            +I+++ ++  +  +A
Sbjct: 417 E------------NIENSSQIAQSLKKHFFQFKRKETSDLSDYGANIVVENYLKGANPMA 464

Query: 467 P----MFPFYENNSEWDDFGE--VINPDDYIIKDEDMDQAAMHIGGDDGKLDEGS--ASL 518
           P         +++    +F +  V  P  ++I   D +     +  +  +  + S  A  
Sbjct: 465 PSKMDTSKMIDSSLTQQNFIQELVYKPSKFMITQYDYNFVGTAVFWNLERTSDYSTIAYN 524

Query: 519 ILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPH---VYTPQIEETIDVTSDLCA 575
           +    P+ +      +++    E  E L +  LK   P    +Y P I E + +  DL  
Sbjct: 525 VTSFNPTDI------IIIGAKKENCEELMK-ILKGKSPQNTRIYIPAIGEKVSLQRDLTT 577

Query: 576 YKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTEN-GMLSLLPISTPAPPHKSVLVGDLK 634
            K+ LS  L+S + F   G  +IA+++A +   E+   +   P+ + A  H++  VG + 
Sbjct: 578 RKISLSRALLSGIDFVNCGVNDIAYIEATLKADEHQQFVQARPVESSA-GHQATFVGTID 636

Query: 635 MADLKPFLSSKGIQVEF-AGGALRCG-EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDY 692
           M+ L   L S GI  +F AGG L CG   V +R V            + I +EG +C DY
Sbjct: 637 MSQLSSKLDSLGINNDFKAGGVLECGRRRVKVRLVNE----------KSITVEGMICPDY 686

Query: 693 YKIR 696
            K+R
Sbjct: 687 IKVR 690


>gi|357440001|ref|XP_003590278.1| Cleavage and polyadenylation specificity factor subunit [Medicago
           truncatula]
 gi|355479326|gb|AES60529.1| Cleavage and polyadenylation specificity factor subunit [Medicago
           truncatula]
          Length = 196

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 131/155 (84%), Positives = 140/155 (90%)

Query: 552 KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENG 611
           K VCPHVY PQIEETIDVTSDLCAYKVQLSEKLMS+VLFKKLG+YE+AWVDAE GKTEN 
Sbjct: 42  KDVCPHVYAPQIEETIDVTSDLCAYKVQLSEKLMSSVLFKKLGEYEVAWVDAEAGKTEND 101

Query: 612 MLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAG 671
           MLSLLP+S    PHKSVLVGDLK+AD K FLS+KG+ VEFAGGALRCGEYVT+RKVG A 
Sbjct: 102 MLSLLPVSGAPHPHKSVLVGDLKLADFKQFLSTKGVPVEFAGGALRCGEYVTVRKVGDAT 161

Query: 672 QKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           QKG GSGTQQI+IEGPLCEDYYKIR YLYSQFYLL
Sbjct: 162 QKGAGSGTQQIIIEGPLCEDYYKIRDYLYSQFYLL 196


>gi|170090732|ref|XP_001876588.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164648081|gb|EDR12324.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 901

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 250/922 (27%), Positives = 386/922 (41%), Gaps = 244/922 (26%)

Query: 5   VQVTPLSGVF---NENPLSYLVSIDGFNFLIDCG---WNDHFDP---------------S 43
           +  TPLSG     N  PL+YL+ +D    L+DCG   W+    P                
Sbjct: 2   ITFTPLSGAAHSSNATPLAYLLQVDDVRILLDCGSPDWSPEPSPFEEHPEHDSGDVPWTK 61

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYL 103
             + L K A T+D VLLSH D  H G  P+A    GL AP ++T PV  +G + + +   
Sbjct: 62  YCEALQKCAPTVDLVLLSHGDLAHCGLYPWAYTNWGLKAPAYTTLPVQAMGRIAVTEDIE 121

Query: 104 SRRQVSEFD-----------------------------------LFTLDDIDSAFQSVTR 128
             R     D                                   + T  ++  AF+S+  
Sbjct: 122 GIRDEENVDGEREAEPDKQKQDTDGTEEISAESPSFIFNPKRKFVSTTAEVQDAFESINT 181

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTV 187
           L YSQ  HL GK +G+ + P  AGH LGGT+WKI +     ++YAV+ N  +E+HL+GTV
Sbjct: 182 LRYSQPTHLQGKCQGLTITPFNAGHTLGGTIWKIRSPSSGTIVYAVNVNHMRERHLDGTV 241

Query: 188 L---------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
           L         +   RP +LITDA  A      R+ R+    D IS TL +  ++LLP DS
Sbjct: 242 LIRQAAGGIFDPLARPDLLITDAERASVTTSRRKDRDAALIDTISATLGSRSSLLLPCDS 301

Query: 238 AGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK----SFETS 293
           + RVLELL++L+ +W    L YPI  L+      + +V+S +EW+G +I+K       T 
Sbjct: 302 STRVLELLVLLDQHWNYSRLRYPICLLSRTGREMLTFVRSMMEWLGGTISKEDVGEEGTG 361

Query: 294 RDNA-----------------FLLKHVTLLINKSEL--DNAPDGPKLVLASMASLEAGFS 334
           R N                     +H+    N   L    +   PKL+LA  ASL  G S
Sbjct: 362 RQNQNKRRRDEEGDEDALGALTFFRHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGPS 421

Query: 335 HDIFVEWASDVKNLVLFTERGQFGTLARML------QADPPPK--------------AVK 374
            ++F ++A+   N+VL T R + GTL R L         P  K              A+ 
Sbjct: 422 RNMFSDFAAVPDNVVLLTGRSEEGTLGRALFDKWNNSQRPDDKWDKGKIGSNVMMDGAIT 481

Query: 375 VTMSRRVPLVGEELIAY-EEEQTRLKKEEALKASLVK----------------------E 411
           + M+ +VPL G EL A+ +EE+   +KE A +A+L +                      E
Sbjct: 482 IKMNHKVPLQGAELEAHLQEERVAKEKEAAHQAALARNQRMLEADEDDSDSDLDSDADEE 541

Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAP---- 467
            E + +LG D        ++D ++       +        DI I G V  +TS       
Sbjct: 542 AEVRQALGGD--------MMDTDDGEGLTKQLLSF-----DIYIKGNVSKATSFFKISGS 588

Query: 468 ------MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLD---EGSASL 518
                 MFP+ E     D++GE I+   ++ K + +++ A      D K     E  A  
Sbjct: 589 QTQRFRMFPYVEKKRRVDEYGETIDVGMWLRKGKVLEEEAESDEVKDYKRRTQAEEEAKA 648

Query: 519 ILDAKPSKVVSNEL---------------------------------TVLVHGSAEATEH 545
            +   PSK V+ E+                                  ++VH    ATE 
Sbjct: 649 SIREPPSKYVTTEIEIQLACRLLFVDMEGLNDGRAVKTIVPQVNPRKMIIVHAPPNATEA 708

Query: 546 LKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA 603
           L + C  ++ +   +Y P + E+I +     ++ + +S++L++++      D +IA+V  
Sbjct: 709 LIESCGNIRAMTKDIYAPTVGESIQIGQQTNSFSISISDELLASLKMSSFEDNQIAYVRG 768

Query: 604 E-VGKTENGMLSLLPISTP------------------------APPHKSVLVGDLKMADL 638
             V    + + +L P+S+                         A PH S ++G+LK+  L
Sbjct: 769 RIVAHATSTIPTLEPVSSSTLSEDPVDSKVTVKRRTLGSRQQVALPH-STMIGELKLTAL 827

Query: 639 KPFLSSKGIQVEFAG-GALRC-------------GEYVTIRKVGPAGQKGGGSGTQQIVI 684
           K  L+S G+Q E  G G L C             GE V++RK+          GT  + +
Sbjct: 828 KARLASIGVQAELIGEGVLICGAGAKRNASSDTLGESVSVRKL--------ARGT--VEL 877

Query: 685 EGPLCEDYYKIRAYLYSQFYLL 706
           EG + E YY +R  +YS   L+
Sbjct: 878 EGNVSEVYYMVRREIYSLHALV 899


>gi|169861678|ref|XP_001837473.1| cleavage and polyadenylation specificity factor subunit
           [Coprinopsis cinerea okayama7#130]
 gi|116501494|gb|EAU84389.1| cleavage and polyadenylation specificity factor subunit
           [Coprinopsis cinerea okayama7#130]
          Length = 926

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 250/939 (26%), Positives = 393/939 (41%), Gaps = 254/939 (27%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCGWNDHF-DPSLLQ-------------- 46
           +  TPL+G        PLSY++ +D    L+DCG  D   +PS  Q              
Sbjct: 2   ITFTPLAGSAKSKSTTPLSYVLQVDDVRILLDCGSPDWVQEPSPFQDGADMEDDSNVKST 61

Query: 47  ---------PLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLT 97
                     + KVA TID VLLSH D  H G  P+A  + GL+AP ++T PV  +G + 
Sbjct: 62  SPPWQAYCEAMKKVAPTIDLVLLSHGDLAHCGLYPWAYSRWGLTAPAYTTLPVQAMGRIA 121

Query: 98  MYDQYLSRRQVSEFDL----------------------------------FTLDDIDSAF 123
           + +     R   E D+                                   TL ++ +AF
Sbjct: 122 VTEDIEGIRGEIEVDIEEPVEEDAQKQDGGLEVEEQEKALPTMGAKGMCVATLIEVHNAF 181

Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKH 182
            S+  L YSQ  HL GK +G+ + P  AGH +GGT+WKI +     ++YAV+ N  KE+H
Sbjct: 182 DSINTLRYSQPIHLQGKCQGLTITPFNAGHSIGGTIWKIRSPSSGTILYAVNLNHMKERH 241

Query: 183 LNGTVL----------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
           L+GTV+          ES VRP +LITDA  A      R+ R+    D I+ TL +  ++
Sbjct: 242 LDGTVMMVRPGGSGVFESLVRPDLLITDAERASVITSRRKDRDAALIDTITATLTSRSSL 301

Query: 232 LLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS-- 289
           LLP DS+ R+LELL++L+ +W    L YPI  L+      + +V+S +EW+G +I+K   
Sbjct: 302 LLPCDSSTRILELLVLLDQHWNYSRLTYPICLLSRTGREMLTFVRSMMEWLGGTISKEDV 361

Query: 290 ----------FETSRDN-----------AFLLKHVTLLINKSEL--DNAPDGPKLVLASM 326
                      +  RD+           A   KH+    N   L   ++   PKL+LA  
Sbjct: 362 GEEGNKRQDRNKRRRDDEDGVEEALGALALRFKHLEFFPNPQALLQRHSSKDPKLILAVP 421

Query: 327 ASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML--------QADPP--------- 369
           ASL  G S  +F ++A+   N+VL T RG  GTL R L        + D           
Sbjct: 422 ASLSHGPSRQLFADFAAVPDNVVLLTTRGAEGTLGRALFDKWNNSQRGDDKWDKGRIGRN 481

Query: 370 ---PKAVKVTMSRRVPLVGEELIAY-EEEQTRLKKEEALKASLVKEEE------------ 413
                A+K+ M  +VPL G EL  Y  +E+   +KE A +A++ + +             
Sbjct: 482 VMMDGAIKIKMYHKVPLQGAELEEYLAKERAAKEKEAAQQAAMARNQRMLEADEDDSDSE 541

Query: 414 ------SKASLGPDNNLSGDPMVIDANN---------ANASADVVEPHGGRYR-----DI 453
                 +         L GD  V +A N         ++  AD  +   G  +     DI
Sbjct: 542 SDSDSDADDEEEVREALGGDMDVDEAGNRRRRRGMKKSSDGADWGDGDEGYTKQLLSFDI 601

Query: 454 LIDGFVPPSTSVAP----------MFPFYENNSEWDDFGEVIN----------------- 486
            + G V  STS             MFP+ E     D++GE ++                 
Sbjct: 602 YLKGKVSKSTSFFKSVGGQTQRFRMFPYVEKKRRVDEYGETVDVGLWLRKGKALEEEAEK 661

Query: 487 -------------------PDDYIIKDEDMDQAAMHIGGDDGKLDEGSA--SLILDAKPS 525
                              P  Y+  + ++  A   +  D   L++G A  +++    P 
Sbjct: 662 KEKMEEGATIEEEDKIAEPPSKYVTSEVEVQLACRLLFIDMEGLNDGRAVKTIVPQVNPR 721

Query: 526 KVVSNELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEK 583
           ++      ++VH S EAT  L + C  +K +   +  P + E+I +   +  + + +S++
Sbjct: 722 RM------IVVHASEEATNALIESCGSIKAMTKDILAPVVNESIQIGQQINNFSISISDE 775

Query: 584 LMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLL-PISTPAP------------------- 623
           +++++   +  D EI +V   V    N ++ +L P S+  P                   
Sbjct: 776 MLASLRMSRFEDNEIGYVRGRVVMHSNSIIPILEPASSAFPSSQTPTTKQVLNKRKLGSR 835

Query: 624 -----PHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCG----------EYVTIRKV 667
                PH S ++G+LK+  LK  L+  GIQ E  G G L CG          E V +RKV
Sbjct: 836 PQVALPH-STMIGELKLTALKARLAKVGIQAELVGEGVLICGAGVGSLDNLAETVAVRKV 894

Query: 668 GPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                      + ++ +EG + + YY +R  +Y    L+
Sbjct: 895 ----------ASGRVELEGNVSDVYYTVRKEIYQLHALV 923


>gi|159465769|ref|XP_001691095.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158279781|gb|EDP05541.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 389

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 159/391 (40%), Positives = 236/391 (60%), Gaps = 29/391 (7%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M T V+ TPL GV  ++PL  L+ ID +  L+DCGW+D FD +LL P+ KV   IDAVLL
Sbjct: 1   METVVRYTPLCGVGEDSPLCSLLEIDDYTILLDCGWDDSFDVALLDPVLKVLPRIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHP   HLG+LPY + + GL+APVFST+P  R+G + M++  L+ + VS+F  + LDD+D
Sbjct: 61  SHPSPAHLGSLPYLVGRCGLAAPVFSTKPTRRMGEMFMFEACLAHQAVSDFAAYDLDDVD 120

Query: 121 SAFQ---SVTRLTYSQNYHL--------------SGKGEGIVVAPHVAGHLLGGTVWKIT 163
           + F+     T L YSQ + L                 G GI + P  AG   GG VW++T
Sbjct: 121 AGFRLHPRWTELRYSQKHLLLPPAAPAGAAGGGQGPAGGGIAITPLPAGRYPGGAVWRLT 180

Query: 164 --KDGEDVIYAVDYNRRKEKHLNGTVLES---FVRPAVLITDAYNALH-NQPPRQQR-EM 216
               G++V+YAVD+N RKE+ LN T   +    ++PA+LI DA N L    PPR +R E 
Sbjct: 181 LLGSGQEVVYAVDFNHRKERLLNETTFTTALAALQPALLIGDAVNGLAPPAPPRHKRDEE 240

Query: 217 FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTID 273
           F DAI+ T+   GNVL+P D+AGRVLEL L+L++++A         P+  L+Y   + ++
Sbjct: 241 FLDAITATVEGEGNVLIPTDAAGRVLELALLLDEHFARARCVIAATPV-VLSYTIKTVLE 299

Query: 274 YVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           + ++ LE++G  + ++F   R   F  + + ++    +L   P GPK+VLA++ SL+ G 
Sbjct: 300 FARTQLEYLGSEMVQAFSHKRTIPFTFRKLAVITRLEDLGAIP-GPKVVLATLPSLDCGP 358

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           +  + V+WA+  +N ++FTER   GTLA  L
Sbjct: 359 ARQLLVDWAAAPRNTIIFTERANPGTLAHAL 389


>gi|349604123|gb|AEP99763.1| Cleavage and polyadenylation specificity factor subunit 2-like
           protein, partial [Equus caballus]
          Length = 281

 Score =  258 bits (658), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 129/280 (46%), Positives = 180/280 (64%), Gaps = 6/280 (2%)

Query: 94  GLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGH 153
           G + MYD Y SR    +F LFTLDD+D+AF  + +L +SQ  +L GKG G+ + P  AGH
Sbjct: 1   GQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGH 60

Query: 154 LLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQ 212
           ++GGT+WKI KDG E+++YAVD+N ++E HLNG  LE   RP++LITD++NA + QP R+
Sbjct: 61  MIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRK 120

Query: 213 QR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVS 268
           QR E     + +TLR  GNVL+ VD+AGRVLEL  +L+  W        +Y    L  VS
Sbjct: 121 QRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVS 180

Query: 269 SSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMAS 328
            + +++ KS +EWM D + + FE  R+N F  +H++L    S+L   P  PK+VLAS   
Sbjct: 181 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPD 239

Query: 329 LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           LE GFS D+F++W  D KN ++ T R   GTLAR L  +P
Sbjct: 240 LECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNP 279


>gi|395330425|gb|EJF62808.1| hypothetical protein DICSQDRAFT_135076 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 943

 Score =  258 bits (658), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 252/957 (26%), Positives = 385/957 (40%), Gaps = 272/957 (28%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCG---W-----NDHFDPSLLQP------ 47
           +  TPLSG        PL+YL+ +D    L+DCG   W      D  + S L P      
Sbjct: 2   ITFTPLSGPARSARTVPLAYLLQVDDVRILLDCGSPDWCPETTQDGTEESELAPWEKYCD 61

Query: 48  -LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRR 106
            L + AS++D VLLSH D  H G  PYA    GL+AP ++T PV  +  + + +     R
Sbjct: 62  SLKECASSVDLVLLSHGDLSHCGLYPYAHAHWGLTAPAYTTLPVQAMARVAVTEDVEGIR 121

Query: 107 QVSEF---------------------------------------DLFTLDDIDSAFQSVT 127
              +                                        ++ TL ++  AF+SV 
Sbjct: 122 DEQDVGDTTEAKGTQESSSEPSGSPVLGENVSSPPPSSEGKRRKNVATLQEVVDAFESVN 181

Query: 128 RLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGT 186
            L YSQ  HL GK +G+ + P  AGH LGGT+WKI +     ++YAVD N  +E+HL+GT
Sbjct: 182 VLRYSQPCHLQGKCQGLTIIPFNAGHSLGGTIWKIRSPSAGTILYAVDMNHMRERHLDGT 241

Query: 187 VL-----------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           VL           ES  RP +LITDA  A      R+ R+    D ++ TL +  ++LLP
Sbjct: 242 VLIRQASAGGGVFESLARPDLLITDAERANVTTARRKDRDAALLDCVTATLSSRNSLLLP 301

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
            D++ RVLELL++L+ +W    L YPI  L+      + +V+S +EW G +I+K  E   
Sbjct: 302 CDASTRVLELLVLLDQHWNYSRLKYPICLLSRTGQEMLTFVRSMMEWFGGTISK--EDVG 359

Query: 295 DN-----------------------AFLLKHVTLLINKSELDN--APDGPKLVLASMASL 329
           +N                       A   KHV   ++   L +  +   PKL+LA  A+L
Sbjct: 360 ENGENGRRDRRRRDDDHDEEALGAFALRFKHVEFFLSPQALMSTYSSKDPKLILAVPATL 419

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-------QADPPP------------ 370
             G S  IF E+A    N+VL T RG+ GTL R+L       Q +               
Sbjct: 420 SHGPSRAIFAEFAEIPDNVVLLTGRGEPGTLGRLLFDKWNDSQREEAKWDRGKIGNNIMM 479

Query: 371 -KAVKVTMSRRVPLVGEELIAY-----------EEEQTRLKKEEALKASLV--------- 409
              +++ M  +VPL GEEL  Y             +Q  L + + +  +           
Sbjct: 480 DGVLRLEMHSKVPLQGEELEEYLAKERAAREKAAAQQAALARTQRMLEADEAESESEDDT 539

Query: 410 --------KEEESKASLGP---DNNLSGDPMVIDANNAN----------ASADVV----E 444
                   +E E + +LG    D    G P+     N            A  D V    E
Sbjct: 540 DESGSDSDEESEVERTLGEDFMDTAEEGKPVRTGRTNGRRKRKRAEGGGADGDWVVGGNE 599

Query: 445 PHGGRYR----DILIDGFVPPSTSVAP----------MFPFYENNSEWDDFGEVIN---- 486
           P  G       DI + G V  +TS             MFP+ E     D++GE ++    
Sbjct: 600 PEDGAVTRISFDIYLKGNVTKATSFFKSAEGQTQRFRMFPYVEKKRRVDEYGETVDVGMW 659

Query: 487 -----------------------------------PDDYIIKDEDMDQAAMHIGGDDGKL 511
                                              P  Y+    ++  A      D   L
Sbjct: 660 LRKGKVFEESTESEESKEAKRRKEEEEAKKTPREPPSKYVTSVAEVQLACRLFFVDLEGL 719

Query: 512 DEGSA--SLILDAKPSKVVSNELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETI 567
           ++G A  +++    P K+      +LVH    AT+ L + C  +K +   +Y P   ETI
Sbjct: 720 NDGRAVKTIVPQVNPRKM------ILVHAPQAATDALIESCASIKAMTKEIYAPPQGETI 773

Query: 568 DVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLL---------PI 618
            +     ++ + LS++L++++   +  D E+A+V   V    +  + +L         P 
Sbjct: 774 QIGQHTNSFSISLSDELLASLKMSRFEDNEVAYVSGRVSSLASSTIPVLEPAAITHFQPA 833

Query: 619 STPAPPHK--------------SVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVT 663
           S P  P +              S ++G+LK+  LK  L+S G+Q E  G G L C     
Sbjct: 834 SAPHQPLRGRMLGSRPTQALPQSTMIGELKLTALKTRLASIGVQAELVGEGVLIC----- 888

Query: 664 IRKVGPAGQKGGGSGTQ--------------QIVIEGPLCEDYYKIRAYLYSQFYLL 706
               G A +KG G G                ++ +EG + + Y+ +R  +YS   L+
Sbjct: 889 ----GAAAKKGAGVGLDSLGDSVAVRKTARGRVEVEGSVSDVYHTVRREVYSLLALV 941


>gi|67968123|dbj|BAE00542.1| unnamed protein product [Macaca fascicularis]
          Length = 592

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 183/625 (29%), Positives = 294/625 (47%), Gaps = 147/625 (23%)

Query: 193 RPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+AGRVLEL  +L+  
Sbjct: 4   RPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQI 63

Query: 252 WAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           W        +Y    L  VS + +++ KS +EWM D + + FE  R+N F  +H++L   
Sbjct: 64  WRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHG 123

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R   GTLAR L  +P
Sbjct: 124 LSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNP 182

Query: 369 PPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDP 428
             K  ++ + +RV L G+EL  Y E++   K+                        S + 
Sbjct: 183 SEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-----------------SKEA 225

Query: 429 MVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDD 480
            +  ++ ++   D+ +P   + + D+++ G       F   +    PMFP  E   +WD+
Sbjct: 226 DIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDE 285

Query: 481 FGEVINPDDYIIK-------------------DEDMDQ-------------AAMHIGGD- 507
           +GE+I P+D+++                    DE MDQ              ++ I    
Sbjct: 286 YGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARV 345

Query: 508 -----DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPH 557
                +G+ D  S   I++  KP ++      ++VHG  EA++ L + C     K +   
Sbjct: 346 TYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--K 397

Query: 558 VYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
           VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + G++
Sbjct: 398 VYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 457

Query: 614 -----------------------------------------------SLLPISTPAPP-- 624
                                                           ++P   P PP  
Sbjct: 458 LEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHE 517

Query: 625 ---HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQ 681
              H+SV + + +++D K  L  +GIQ EF GG L C   V +R+          + T +
Sbjct: 518 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----------TETGR 567

Query: 682 IVIEGPLCEDYYKIRAYLYSQFYLL 706
           I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 568 IGLEGCLCQDFYRIRDLLYEQYAIV 592


>gi|392593024|gb|EIW82350.1| hypothetical protein CONPUDRAFT_54247 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 926

 Score =  254 bits (650), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 248/933 (26%), Positives = 391/933 (41%), Gaps = 253/933 (27%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCG---WNDHFDPSL-------------- 44
           +  TPLSG    +   PL+YL+ ID    L+DCG   WN    PS               
Sbjct: 2   ITFTPLSGAARSSVTSPLAYLLQIDDVKILLDCGSPDWNPEKIPSTSTESDSSPYFWQDY 61

Query: 45  LQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLS 104
              L + A ++D VLLSH D  H G   YA  + GL APV+ST PV  +G +   +    
Sbjct: 62  CNALKQCAPSVDLVLLSHGDLSHCGLFAYAYSRWGLKAPVYSTLPVQAMGRIATTEDVDG 121

Query: 105 RR--------QVSEFD-------------------------LFTLDDIDSAFQSVTRLTY 131
            R           +FD                         + T+ ++  AF S+  L Y
Sbjct: 122 LRDEGIHDPENEQDFDEEHKEENENEEGFSTEQKEHTSIKFIATMQEVHEAFDSINTLRY 181

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL-- 188
           SQ  HL G+ +GI V P  AGH LGGT+WKI +     ++YAV+ N  +E+HL+GT+L  
Sbjct: 182 SQPTHLQGRCQGITVTPFNAGHTLGGTIWKIRSPSAGTILYAVNINHMRERHLDGTILVR 241

Query: 189 -------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGR 240
                  E   RP +LITDA  A      R+ R+    D IS TL +  ++LLP DS+ R
Sbjct: 242 SAGGGVFEQLARPDLLITDADRANVVTSRRKDRDAALMDCISATLSSRSSLLLPCDSSTR 301

Query: 241 VLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS----------- 289
           VLELL++L+ +W  H   YPI FL+      + +V+S +EW+G ++ K            
Sbjct: 302 VLELLVLLDQHWKFHDYRYPICFLSRNGREMLTFVRSMMEWLGGTVNKEDVGVDGSGRMG 361

Query: 290 ---------FETSRDNAFLLK--HVTLLINKSEL--DNAPDGPKLVLASMASLEAGFSHD 336
                     +     AF L+  H+    N   L    +   PK++LA  ASL  G S  
Sbjct: 362 GNKRRRDDDADDDALGAFALRFPHLEFFPNPDALLQTYSSKDPKIILAVPASLSHGPSRS 421

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARML--------QADPP------------PKAVKVT 376
           +FV++A+   N+VL T RG+ GTL ++L        +AD                A+++ 
Sbjct: 422 LFVDFAAVPDNVVLLTGRGEEGTLGQILFGRWNDSQRADDKWDKGKIGRNVMMDGAMRLK 481

Query: 377 MSRRVPLVGEELIAY-EEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANN 435
           MS +VPL G EL  Y  +E+   +KE A +A++ + +    +   +++   D    +   
Sbjct: 482 MSSKVPLQGTELELYLAKERATKEKEVAQQAAMARNQRMLEADEDESDEESDSDAEEDEV 541

Query: 436 ANA-------SADVVEPH-GGRYR------------------------DILIDGFVPPST 463
           A A       S D+  P+ G R R                        DI + G +  +T
Sbjct: 542 ARALGVTTLDSDDISSPNLGLRKRKGESAEDGEWADMDEGLTKQVLSFDIYLKGNMSKAT 601

Query: 464 SVAP----------MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGK--- 510
           S             MFP+ E     D++GE I+   ++ K + M++ +    GD+ K   
Sbjct: 602 SFFKTSSNQSQRFRMFPYVEKKRRVDEYGETIDVGMWLRKGKVMEEDSQ---GDEAKDVK 658

Query: 511 ----LDEGSASLILDAKPSKVVSNELTV-------------------------------- 534
                +E          P K V++E+ V                                
Sbjct: 659 RRQAEEEEKFQKAAQEPPYKFVTSEIEVQLACRLLFIDMQGLNDGRSVKTIIPQMNPRKM 718

Query: 535 -LVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK 591
            +VH S  A+E L   C  +  +   +Y PQ+ +++ +     ++ + LS++L++ +   
Sbjct: 719 IIVHASESASEALISSCANIHAMTKDIYAPQVGDSVQIGQQTNSFSISLSDELIAGLKMS 778

Query: 592 KLGDYEIAWVDAEVGKTENGMLSLLPISTPA-----------------PPHK-------- 626
           +  D E+A+V    G+  +   S +PI  PA                 PP +        
Sbjct: 779 RFEDNEVAYV---TGRVISHFSSTIPILGPAYAVPPARQSSVVSENVEPPKRRTLGSRSK 835

Query: 627 -----SVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCG-------------EYVTIRKV 667
                S ++G+LK+  LK  L++ GI  E  G G L CG             + V +RK 
Sbjct: 836 IDLPHSTMIGELKLTSLKSRLAAVGIHAELIGEGVLICGAGAKRDQASQNLHDTVAVRK- 894

Query: 668 GPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
                    + + ++ +EG + + YY +R  +Y
Sbjct: 895 ---------TTSGKVELEGNVSDVYYNVRNEIY 918


>gi|409079696|gb|EKM80057.1| hypothetical protein AGABI1DRAFT_72888 [Agaricus bisporus var.
           burnettii JB137-S8]
 gi|426198540|gb|EKV48466.1| hypothetical protein AGABI2DRAFT_220282 [Agaricus bisporus var.
           bisporus H97]
          Length = 919

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 251/926 (27%), Positives = 378/926 (40%), Gaps = 245/926 (26%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCG---WNDHFDPSL-------------- 44
           +  TPLSG    +   PLSYL+ +D    L+DCG   W    D S               
Sbjct: 2   ITFTPLSGAARSDSPSPLSYLLQVDDVRMLLDCGSPDWAPENDASTDGENESEEPRHSWS 61

Query: 45  --LQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG-LLTMYD- 100
              + L ++A TID VLLSH D  H G  PYA  + GL AP +ST PV   G +  M D 
Sbjct: 62  DYCETLRRIAPTIDLVLLSHGDLSHSGLYPYAYSRWGLKAPAYSTLPVQATGKIAAMEDV 121

Query: 101 ------QYLSRRQVSEFD---------------------------LFTLDDIDSAFQSVT 127
                 Q +    + E +                           L TL ++  AF+ + 
Sbjct: 122 EGIRDEQDIGDEPIQEAEHQELQSGEDAGVHKESSLNPTTKTGKFLATLVEVQDAFEYLN 181

Query: 128 RLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGT 186
            L YSQ  HL GK +GI + P  AGH LGGT+WKI +     +IYAV  N  KE+HL+GT
Sbjct: 182 TLRYSQPMHLQGKCQGITITPFNAGHTLGGTIWKIRSPTSGTIIYAVHMNHMKERHLDGT 241

Query: 187 VL---------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
           VL         E   RP +LITDA  A      R+ R+    D I+ TL +  ++LLP D
Sbjct: 242 VLMKNASGGIFEPLARPDLLITDADRANVITSRRKDRDAALIDTITATLSSRSSLLLPCD 301

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS---FETS 293
           S+ R+LELL++L+ +W+   L YPI  L       + +V+S +EW+G +I+K     E +
Sbjct: 302 SSTRILELLVLLDQHWSYSRLRYPICLLARTGRDMLAFVRSMMEWLGGTISKEDVGVEAT 361

Query: 294 RDN------------------AFLLKHVTLLINKSEL--DNAPDGPKLVLASMASLEAGF 333
                                A   KH+    N   L    +   PKL+LA  ASL  G 
Sbjct: 362 AKQRNKRKRDDDDDNEALGALALRFKHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGP 421

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARML--------QADPP------------PKAV 373
           S ++FV++A    N+VL T RG+ G+L R L        + D                  
Sbjct: 422 SRNLFVDFAVVPDNVVLLTGRGEEGSLGRALFNKWNDRQRVDDKWDKGKIGSNIMLDGGF 481

Query: 374 KVTMSRRVPLVGEELIAY-EEEQTRLKKEEALKASLVKEEES------------------ 414
           ++ M  +VPL G EL AY ++E+ +  KE A +A+L + +                    
Sbjct: 482 RMKMRSKVPLQGAELEAYLQQEKEKKDKEVAQQAALARSQRMLEADEDESDSDSDTDEEE 541

Query: 415 --KASLGPDNNLSGDPMV-------IDANNANASADVVEPHGGRYRDILIDGFVPPSTSV 465
             + +L  D  + GD +         DA +    AD          DI + G V  +TS 
Sbjct: 542 EVRRTLEGDMEVDGDGISRRRKRDDTDATDWALDADEGLTKQFLSFDIYLKGNVSRATSF 601

Query: 466 AP----------MFPFYENNSEWDDFGEVIN----------------------------- 486
                       MFP+ E     D++GE I+                             
Sbjct: 602 FKTAGGQTQRFRMFPYVEKKRRVDEYGETIDVGMWLRKGMVLEEEAESDEIKDYKKKLQE 661

Query: 487 ----------PDDYIIKDEDMDQAAMHIGGDDGKLDEGSA--SLILDAKPSKVVSNELTV 534
                     P  ++  D D+  A   +  D   L++G A  +++    P K+      +
Sbjct: 662 EEEAKKIKEPPSKFVTMDVDVQLACRLLFVDMEGLNDGRAVKTIVPQINPRKM------I 715

Query: 535 LVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK 592
           LV  S  A+  L + C  ++ +   +Y+P + E++ +      + + +SE L++++   +
Sbjct: 716 LVSASESASNALIESCSNIRAMTKDIYSPAVGESVQIGQQTNTFSISISEDLLTSLRMSR 775

Query: 593 LGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH------------------------KSV 628
             D EI +V   V       +  L   +  PP                         +S 
Sbjct: 776 FEDNEIGYVRGRVVAHATSTIPTLESVSSLPPTTDRTVVSDPSKSRILGSRPKVALPQST 835

Query: 629 LVGDLKMADLKPFLSSKGIQVEFAG-GALRCG------------EYVTIRKVGPAGQKGG 675
           ++G+LK+  LK  L++  I  E  G G L CG            E V +RK      K  
Sbjct: 836 MIGELKLTALKQRLAAVNIPAELIGEGVLICGGIRQTDNMDTSEETVAVRK------KAK 889

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLYS 701
           GS    + +EG + E YYK+R  +Y+
Sbjct: 890 GS----VELEGNVSELYYKVRREIYN 911


>gi|301092283|ref|XP_002997000.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
 gi|262112189|gb|EEY70241.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
          Length = 513

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 171/540 (31%), Positives = 269/540 (49%), Gaps = 84/540 (15%)

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLE 280
           I KT+R GGNVL+P DS+GRVLEL+ +L+ YW ++ L  PI  L  +S  T    ++ LE
Sbjct: 4   ILKTVRNGGNVLIPTDSSGRVLELMRVLDQYWIQNKLRDPIALLHDMSYYTPKAAQAMLE 63

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
           W  D I K+F+  R N F   H+ L+    ELD  P+ PK+VLA+  SLE GF+ DIF+ 
Sbjct: 64  WCNDRIAKNFDVGRQNPFQFTHIHLVHTLEELDALPN-PKVVLATSPSLECGFAKDIFIR 122

Query: 341 WASDVKNLVLF---TERGQFGTLARMLQADPPP-KAVKVTMSRRVPLVGEELIAYE-EEQ 395
           WA D +N ++F   T    F +    L  DP   K +  T++++V L G EL  YE +E+
Sbjct: 123 WAPDPRNSIIFSSTTSETSFASRVVKLSKDPSAEKNISCTVTQKVFLEGAELALYEVKER 182

Query: 396 TRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI 455
            RL+ E   KA  ++E   +  +          M I+   + +  +   P   + R    
Sbjct: 183 KRLRTEAENKAKEIEEAAMEDMM----------MGIEDFESESEEEETTPQEVQLRGTFK 232

Query: 456 DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDM---DQAAMHIGGD---DG 509
            G    ++   PMF   E+ +EWD++GE+INPDD+  KD  +    QA  +I  D   D 
Sbjct: 233 VGLGQFASVRYPMFFAVESKTEWDEYGEIINPDDF--KDATLLANRQARRNIIEDADGDE 290

Query: 510 KLDEGSASLILDAKPSKVVSNELTV---------------------------------LV 536
            ++  +    ++ +P+K ++NE+ V                                 LV
Sbjct: 291 DMENANQEAAVETRPTKTITNEVVVNIAARITQVDFDGIADGRAIRNCLGNVKPRKLILV 350

Query: 537 HGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG 594
           HG+ + T  LKQ     +  C  V+TP + E ID+ SD   YK+ + E L ++     +G
Sbjct: 351 HGTEKTTSELKQFVESSIPMCEAVFTPDVMECIDIESDTNVYKLSVKESLYTSA----VG 406

Query: 595 DYEIAWVDAEVGKTENGMLSLLPISTP------APPHKSVLVGD--LKMADLKPFLSSKG 646
            +E+++V  ++  +EN   S +P+  P         H+ +L+ D  +K+  +K  L   G
Sbjct: 407 SHEVSYVTGQLVLSEN---SSVPVLQPLNENGGQATHEPILLSDGKMKLDVMKQVLGKAG 463

Query: 647 IQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            Q +F GG L C + V +++          +   +IV+EG L  +YY+IRA LY QF L+
Sbjct: 464 FQAKFRGGMLVCNDGVVLKR----------AMNNEIVMEGTLSRNYYRIRALLYEQFTLV 513


>gi|412994069|emb|CCO14580.1| predicted protein [Bathycoccus prasinos]
          Length = 1092

 Score =  249 bits (636), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 171/487 (35%), Positives = 251/487 (51%), Gaps = 70/487 (14%)

Query: 2   GTSVQVTPLSGVFNEN------------PLSYLVSIDGFNFLIDCGWNDHFDPS-LLQPL 48
           G  V +TPL G   E+            PL YL+ ID  N L+DCGW+D FD +  ++ L
Sbjct: 158 GNKVALTPLLGGIREDDGARGGTTTTTEPLCYLLQIDQANILLDCGWDDRFDQTEYVKEL 217

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP---VFSTEPVYRLGLLTMYDQYLS- 104
            K+A T+D VL+SH    H+GA+P    +     P   ++++ P ++LG +  YD  L  
Sbjct: 218 EKIAPTLDCVLISHCTQRHVGAVPLLFSERVKCNPNCKIYASIPTHKLGQMLCYDIALGY 277

Query: 105 ---RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG------------------ 143
              R +  E   ++LDD+D AF     + Y Q+  +S + E                   
Sbjct: 278 SEFRGEFGEDVGYSLDDVDLAFSKFVPVKYQQHSRVSVRRESAGGGGGGESDAGTNSKNS 337

Query: 144 -------IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL-ESFVRPA 195
                  IVV    AGH LGG+ W+I+KD ED++YAVDYN RKE+HL GT L E+  RP+
Sbjct: 338 GGATNSDIVVEAINAGHTLGGSCWRISKDAEDIVYAVDYNMRKERHLAGTSLAETVHRPS 397

Query: 196 VLITDAYNALHNQPPR--QQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           VLITD  N     P    Q R++   D + K  R  GNV++  D+ GR LEL L+LE+ W
Sbjct: 398 VLITDCRNVDRKAPESRLQVRDLPLVDCVLKHARMEGNVVICCDAVGRTLELALLLEETW 457

Query: 253 AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
              +L +Y +     V+++ +++ +S LEWM + +   F+++R N F +K +    +  +
Sbjct: 458 KNQNLGSYQLVLFNNVAANALEFARSHLEWMNEDVGLKFDSTRQNVFDVKRLFPCHSYED 517

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF-TERGQFGTLARMLQADPPP 370
               P GPK+VLAS+ASLE GF+  +FVEWASD KN  ++  E G+   LAR +      
Sbjct: 518 FTRLPPGPKVVLASLASLEGGFARKLFVEWASDAKNCFIWPDEIGRQVGLAREIVEKCSK 577

Query: 371 KA--------------VKVTMSRRVPLVGEELIAYEEEQTRLK-----KEEALKASLVKE 411
                           +KV ++RR  L G+EL A+E EQ   +     + E     L +E
Sbjct: 578 GGAKTTSSKTKKKDVIMKVELARRELLSGKELEAWEHEQEEKRLEAEKRREEEAKRLAEE 637

Query: 412 EESKASL 418
           EE K  L
Sbjct: 638 EEKKRML 644



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/236 (22%), Positives = 98/236 (41%), Gaps = 74/236 (31%)

Query: 534  VLVHGSAEATEHLKQHCLKHVCPH------VYTPQIEETIDVTSDLCAYKVQLSEKLMSN 587
            +LV G+ +  E L  H L +   H      +  P+  ET+D +S    YKV+LSE ++S+
Sbjct: 868  ILVSGTVKDAEKLASH-LYNDSEHFPKSSKIDYPKNNETLDASSVHPTYKVRLSEAVLSS 926

Query: 588  VLFKKLGDYEIAWVDAEVGKT-ENGML-SLLPISTPA----------------------- 622
               +++  Y + W+D  +G   E+G    LLP+   A                       
Sbjct: 927  ARLRQVSGYAVGWIDGVIGPIPEDGSAPELLPVPVNALKLTVSKTVKDESLLAGKVTGPS 986

Query: 623  ----PPHKSVLV-------------------------GDLKMADLKPFLSSKGIQVEFA- 652
                 P  + LV                         GD+++++ + +L   G+  EF  
Sbjct: 987  LIKKEPTAAALVVEDNEENEGTEINIVTKHHRRSAFVGDVRLSEFRRYLQRMGVPAEFGE 1046

Query: 653  GGALRC--GEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            GGAL C  G+ V  R+          +   ++++EG + + Y+ +R  LY+Q+ ++
Sbjct: 1047 GGALVCANGQVVVRRR----------AEDDELIVEGSISDAYFNVRDMLYAQYSII 1092


>gi|260822471|ref|XP_002606625.1| hypothetical protein BRAFLDRAFT_209615 [Branchiostoma floridae]
 gi|229291969|gb|EEN62635.1| hypothetical protein BRAFLDRAFT_209615 [Branchiostoma floridae]
          Length = 607

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 193/630 (30%), Positives = 293/630 (46%), Gaps = 128/630 (20%)

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
           +LN   L   +R   L++ +Y        + + E     I  T+R  GNVL+ +D+AGRV
Sbjct: 1   YLNYVQLRRKLRDEQLLSKSYLNYVQLRRKLRDEQLLTEIFNTVRDDGNVLVSIDTAGRV 60

Query: 242 LELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           LEL  +LE YW  AE  L  Y +  L  V+ + +++ KS +EWM D I + FE +R+N F
Sbjct: 61  LELSQLLEQYWQNAETGLQAYNLCLLNNVAYNVVEFAKSQVEWMSDKIMRVFEDNRNNPF 120

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
             KH+ L  + SEL   PD PK+VLAS+  LE+GFS ++FV+W  + KN V+ T R   G
Sbjct: 121 QFKHLKLCHSLSELHKVPD-PKVVLASVPDLESGFSRELFVQWCQNQKNTVVLTSRPGPG 179

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TL RML  +P  K   +   +RV L G EL  Y +E+ + K+E+  + S  K +ES    
Sbjct: 180 TLGRMLIDNPKMKTFTLQARKRVRLEGPELEEYLQEEKKEKEEKKRRESKAKGDES---- 235

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPH-------GGRYRDILIDGFVPPSTSVAPMFPF 471
             D + S D M ++ ++       V  H       GGR       GF   +    PMFP 
Sbjct: 236 --DTSESEDEMEVEGSSFPGGVKGVAKHDLMMQAEGGRK-----GGFFKQAKKAYPMFPA 288

Query: 472 YENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNE 531
            E   +WDD+GE+I P+DY++ +    +        +    E  A  + D  P+K +  E
Sbjct: 289 PEERVKWDDYGEIIKPEDYMVVEMTQAEEEKAKAEGEAAAQEEFAEELTDV-PTKSIVQE 347

Query: 532 LT---------------------------------VLVHGSAEATEHLKQHCLK---HVC 555
           LT                                 V+VHG++E+T  L + C      V 
Sbjct: 348 LTLDIKCRVVYIDFEGRSDGESMKKILTQLKPRQLVIVHGNSESTLLLAEVCRSTAGMVQ 407

Query: 556 PHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVD------------- 602
             V+TP++ ET+D T +   Y+V+L + L+S++ F K  D E+AWVD             
Sbjct: 408 EKVFTPRLNETVDATMESHIYQVKLKDSLVSSLQFYKARDTELAWVDGQLDLTTPTTDTS 467

Query: 603 -----AEVGKTE------------------NGMLSLLPISTPA----------------- 622
                 EV + E                  +G L  LP +  +                 
Sbjct: 468 ALLEEGEVQEMEDLEEEQFFKARDTELAWVDGPLLTLPFTCKSAKAAAEESRETVPTLEA 527

Query: 623 ------PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGG 676
                 P H++V +   +++D+K  L  +GIQ EF+GG L C   V +++          
Sbjct: 528 LPISQIPGHEAVFINKPRLSDIKQVLQKEGIQAEFSGGVLICNNVVALKR--------NE 579

Query: 677 SGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           SG  +I +EG +CEDYYK+R  LY Q+ ++
Sbjct: 580 SG--RIGMEGCICEDYYKVRKLLYEQYAIV 607


>gi|348689662|gb|EGZ29476.1| hypothetical protein PHYSODRAFT_552782 [Phytophthora sojae]
          Length = 513

 Score =  244 bits (622), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 177/540 (32%), Positives = 267/540 (49%), Gaps = 84/540 (15%)

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLE 280
           I KT+R GGNVL+P DS+GRVLEL+ +L+ YW ++ L  PI  L  +S  T    ++ LE
Sbjct: 4   ILKTVRNGGNVLIPTDSSGRVLELMRVLDQYWIQNKLRDPIALLHDMSYYTPKAAQAMLE 63

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
           W  D I K+F+  R N F   H+ L+    ELD  P  PK+VLA+  SLE GF+ DIF+ 
Sbjct: 64  WCNDRIAKNFDVGRQNPFQFSHIHLVHTLEELDALP-SPKVVLATSPSLECGFAKDIFIR 122

Query: 341 WASDVKNLVLFTERGQFGTLA-RMLQADPPPKAVKV---TMSRRVPLVGEELIAYE-EEQ 395
           WA D +N ++FT      + A R+L+    P A KV   T++++V L G EL  YE +E+
Sbjct: 123 WAPDPRNSIIFTSTTPETSFASRVLKIAKDPSAAKVISCTVTKKVFLEGAELALYEVKER 182

Query: 396 TRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI 455
            RL+ E   KA  ++E   +  +          M I+   + +  +       + R    
Sbjct: 183 KRLRTEAENKAKEIEEAAMEDMM----------MGIEDFESESEEEETTQQEVQLRGTFK 232

Query: 456 DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDM---DQAAMHIGGD-DGKL 511
            G    ++   PMF   E   EWD++GE+INPDD+  KD  +    QA  +I  D DG  
Sbjct: 233 VGLGQFASVRYPMFFAVEPKIEWDEYGEIINPDDF--KDATLLANRQARRNIIEDADGDE 290

Query: 512 DEGSA--SLILDAKPSKVVSNELTV---------------------------------LV 536
           D  SA      + +P+K ++NE+TV                                 LV
Sbjct: 291 DMESADKEAAAETRPTKTITNEVTVSIAARITQVDFDGIADGRAIRNCLGNVKPRKLILV 350

Query: 537 HGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG 594
           HG+   T  LK+     +  C  V+TP + E ID+ SD   YK+ + E L ++     +G
Sbjct: 351 HGTETTTNELKKFVESSIPLCEAVFTPNVMECIDIESDTNVYKLSVKESLYTSA----VG 406

Query: 595 DYEIAWVDAEVGKTENGMLSLLPISTP------APPHKSVLVGD--LKMADLKPFLSSKG 646
            +E+A+V  ++   EN   S +P+  P         H+ +L+ D  +K+  +K  L   G
Sbjct: 407 SHEVAYVTGQLALPEN---SSVPVLQPLNENGGQTTHEPILLSDGKMKLDVMKQVLGKAG 463

Query: 647 IQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            Q +F GG L C + V +++          +   +IV+EG L  +YY+IRA LY QF L+
Sbjct: 464 FQAKFRGGMLVCNDGVVLKR----------AMNNEIVMEGTLSRNYYRIRALLYEQFTLV 513


>gi|392568293|gb|EIW61467.1| hypothetical protein TRAVEDRAFT_162694 [Trametes versicolor
           FP-101664 SS1]
          Length = 943

 Score =  242 bits (618), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 237/956 (24%), Positives = 381/956 (39%), Gaps = 270/956 (28%)

Query: 5   VQVTPLSG---VFNENPLSYLVSIDGFNFLIDCG------------------WNDHFDPS 43
           +  TPLSG        PL+YL+ +D    L+DCG                  W  + D  
Sbjct: 2   ITFTPLSGAAGTVRTVPLAYLLQVDDVRILLDCGSPDWCPEPSSEEGDDVLSWTKYCDA- 60

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY- 102
               L + A ++D VLLSH D  H G  PYA    GL+AP ++T P+  +      +   
Sbjct: 61  ----LKECAPSVDLVLLSHGDLSHSGLYPYAYSHWGLTAPAYTTLPIQAMAKTAATEDVE 116

Query: 103 ------------------------------------------LSRRQVSEFDLFTLDDID 120
                                                      S R V    + T+  + 
Sbjct: 117 AIRDEQPVEDIAPPSEESLAPEGSVSPSPNNATPPASSPTPSPSSRAVKHRYVATVQQVH 176

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRK 179
            AF SV  L YSQ  HL GK +G+ + P  AGH LGGT+WKI +     ++YAVD N  +
Sbjct: 177 DAFDSVNVLRYSQPCHLQGKCQGLTIIPFNAGHTLGGTIWKIRSPSAGTILYAVDMNHMR 236

Query: 180 EKHLNGTVL----------ESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           E+HL+GTVL          ES  RP +LITDA  A      R+ R+    D ++ TL + 
Sbjct: 237 ERHLDGTVLIRQGSTGGVFESLARPDLLITDAERANVTTARRKDRDSALLDCVTATLSSR 296

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
            ++LLP DS+ RVLELL++L+ +W    L YPI  L+      + +V+S +EW+G +I+K
Sbjct: 297 NSLLLPCDSSTRVLELLVLLDQHWNYSRLKYPICLLSRTGREMLTFVRSMMEWLGGTISK 356

Query: 289 SFETSRDN----------------------AFLLKHVTLLINKSELDN--APDGPKLVLA 324
             +   D                       A   +H+    +   L +  +   PKL+LA
Sbjct: 357 E-DVGEDGTNHGRDRRRRDEDNDEEALGAFALRFRHLEFFSSPQALMSTYSTKDPKLILA 415

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-------QADPPP------- 370
             A+L  G S  +F  +A    N+VL T R + GTL R+L       Q +          
Sbjct: 416 VPATLSHGPSRSLFAHFAEIPDNVVLLTGRSEPGTLGRILFDKWNNSQREEAKWDRGKIG 475

Query: 371 ------KAVKVTMSRRVPLVGEELIAY-EEEQTRLKKEEALKASLVKEEES--------- 414
                   +++ + ++VPL G+EL  +  +E+   +KE A +A+L + +           
Sbjct: 476 NNIMMDGVLRLEIHKKVPLQGDELEEFLAKERAVKEKEAAHQAALARTQRMLEADEGQSD 535

Query: 415 -----------------KASLGPDNNLSGDPMVIDANNANAS-----------------A 440
                            +  LG D   + D +       NA+                  
Sbjct: 536 SDSDDEDESDDDEEDEVERELGEDLMDATDDLKRSRQGPNATTRSGTKRKRGEGGGGDGT 595

Query: 441 DVV---EPHGGRYR---DILIDGFVPPSTSVAP----------MFPFYENNSEWDDFGEV 484
           D V   E   G  R   DI + G V  +TS             MFP+ E   + D++GE 
Sbjct: 596 DWVLGNEADEGATRISFDIYLKGNVAKATSFFKSADGQTQRFRMFPYVEKKRKVDEYGET 655

Query: 485 INPDDYIIKDEDMDQAAMHIGGDDGKL--DEGSASLILDAKPSKVVSN------------ 530
           ++   ++ K + +++ A      D +   +E  A       PSK V++            
Sbjct: 656 VDVGTWLRKGKVLEEDAEDEETKDARRRKEEEEAKKAPQEPPSKFVTSIAEVQLACRLFF 715

Query: 531 ---------------------ELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETI 567
                                   +L+H    AT+ L + C  ++ +   +Y P   ET+
Sbjct: 716 VDLEGLNDGRAVKTIVPQVNPRKMILIHAPQAATDALIESCANIRAMTKEIYAPAQGETV 775

Query: 568 DVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGML-------------- 613
            +     ++ + LS++L++++   +  D E+ +V   +      M+              
Sbjct: 776 QIGQQTNSFSISLSDELLASIKMSRFEDNEVGYVAGRIASLATSMIPVLQPASSASLQTQ 835

Query: 614 --SLLPI------STPAPP-HKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCG---- 659
             SL P+      S P  P  +S ++G+LK+  LK  L+  G+Q E  G G L CG    
Sbjct: 836 AASLQPVQVRMLGSRPKQPLPQSTMIGELKLTSLKARLAQVGVQAELVGEGVLICGAAAK 895

Query: 660 ---------EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                    + V +RK          +G  ++ +EG + + YYK+R  +Y+   L+
Sbjct: 896 KGASADALEDSVAVRK----------TGRGRVELEGSISDIYYKVRKEIYALHALV 941


>gi|7243115|dbj|BAA92605.1| KIAA1367 protein [Homo sapiens]
          Length = 579

 Score =  239 bits (611), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 177/615 (28%), Positives = 284/615 (46%), Gaps = 147/615 (23%)

Query: 203 NALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
           NA + QP R+QR E     + +TLR  GNVL+ VD+AGRVLEL  +L+  W        +
Sbjct: 1   NATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGV 60

Query: 262 Y---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDG 318
           Y    L  VS + +++ KS +EWM D + + FE  R+N F  +H++L    S+L   P  
Sbjct: 61  YSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-S 119

Query: 319 PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMS 378
           PK+VLAS   LE GFS D+F++W  D KN ++ T R   GTLAR L  +P  K  ++ + 
Sbjct: 120 PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELR 179

Query: 379 RRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANA 438
           +RV L G+EL  Y E++   K+                        S +  +  ++ ++ 
Sbjct: 180 KRVKLEGKELEEYLEKEKLKKEAAKKLEQ-----------------SKEADIDSSDESDI 222

Query: 439 SADVVEPHGGRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY 490
             D+ +P   + + D+++ G       F   +    PMFP  E   +WD++GE+I P+D+
Sbjct: 223 EEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDF 282

Query: 491 IIK-------------------DEDMDQ-------------AAMHIGGD------DGKLD 512
           ++                    DE MDQ              ++ I         +G+ D
Sbjct: 283 LVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSD 342

Query: 513 EGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETI 567
             S   I++  KP ++      ++VHG  EA++ L + C     K +   VY P++ ET+
Sbjct: 343 GDSIKKIINQMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETV 394

Query: 568 DVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML---------- 613
           D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + G++          
Sbjct: 395 DATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDG 454

Query: 614 -------------------------------------SLLPISTPAPP-----HKSVLVG 631
                                                 ++P   P PP     H+SV + 
Sbjct: 455 EDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMN 514

Query: 632 DLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCED 691
           + +++D K  L  +GIQ EF GG L C   V +R+          + T +I +EG LC+D
Sbjct: 515 EPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRIGLEGCLCQD 564

Query: 692 YYKIRAYLYSQFYLL 706
           +Y+IR  LY Q+ ++
Sbjct: 565 FYRIRDLLYEQYAIV 579


>gi|326436560|gb|EGD82130.1| hypothetical protein PTSG_02804 [Salpingoeca sp. ATCC 50818]
          Length = 630

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 178/630 (28%), Positives = 288/630 (45%), Gaps = 103/630 (16%)

Query: 104 SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKIT 163
           + R   +F  FTLDD+D AF ++TR+ YSQ  +L G G  I   P  AGH++GG+VW+IT
Sbjct: 21  AHRAQEDFSTFTLDDVDQAFDNITRIKYSQTVNLPGVGISITAYP--AGHMIGGSVWRIT 78

Query: 164 KDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQD---A 220
           KDGE+V+YAVDYN R+E HLN T L+    PA+LITD  N  +  P R  RE+      A
Sbjct: 79  KDGENVVYAVDYNHRREWHLNSTSLDILTWPAILITDTLNVAYTSPKR--REVLGQLLAA 136

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLE 280
           + ++L    NVL+  D+AGR  ELL +L+    + S     +F+   +   +D V + ++
Sbjct: 137 VRESLNKQANVLVLADTAGRSFELLQVLDQLAGKMSGASQFFFVGACTQVVMDTVTTMVD 196

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
           ++ D +       +   F   ++  + +   + NA  GPK+V+ +   LEAGFS  +F +
Sbjct: 197 FLSDGLQAQMNEHKAMPFRFPNIKRVQSLDAI-NAHPGPKVVVTAELGLEAGFSRQLFAQ 255

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKK 400
           WA++  N ++FT R    TLA  +  +  P  +++ +  RV L GEEL A+  E+   + 
Sbjct: 256 WAANPDNAIIFTRRPDEDTLAHSIYHNTAPDTLQLRLGARVELEGEELEAHRAER---EM 312

Query: 401 EEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFV- 459
            E +  +    + +   +G +       M +D      S+D  +       D+L   F  
Sbjct: 313 REHMDETAAASDAAADGMGRE-------MGMDVQEEQLSSDDEDHEPYERHDLL--AFTA 363

Query: 460 ----PPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK-----DEDMDQAAMHIGGDDGK 510
               P       +FP   +  +WDD+G  ++   Y I+      E   + AM     D +
Sbjct: 364 SKAGPVQRRRNAVFPEDTHTMDWDDYGLKVDMSRYRIEVVPEAPEPAAETAM-----DQR 418

Query: 511 LDEGSASLILDAKPSKVVSN--ELT-------------------------------VLVH 537
            D  +    L  KP+KVV +  E++                               VLV 
Sbjct: 419 EDSSAILTALLEKPTKVVEHVVEISLKCKVHRFDVEGRTDGESMKRIMEHVKPRNLVLVQ 478

Query: 538 GSAEATEHLKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDY 596
           G    T+   + C   +   ++ TP     +++TS    ++V+L E L+S +  ++ GDY
Sbjct: 479 GPPAETKTFAEFCQSKLGIENIVTPAFGRPVEITSGRNIFQVKLREALVSALDLRRAGDY 538

Query: 597 EIAWVDAEVGK------------------------TENGMLSL----------LPISTPA 622
           E+AWVD  + K                         + G L+           L +    
Sbjct: 539 EVAWVDGVMAKGIKPAAPEGEGGDGEGGNGEGGEDADAGSLTSNIDMDAGVPELGVDEEP 598

Query: 623 PPHKSVLVGDLKMADLKPFLSSKGIQVEFA 652
            PH  V VGDL+++D K  L  +G +  F+
Sbjct: 599 EPHDVVFVGDLRLSDFKRLLIDEGYEPPFS 628


>gi|443926973|gb|ELU45512.1| cleavage and polyadenylation specificity factor subunit
           [Rhizoctonia solani AG-1 IA]
          Length = 854

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 225/859 (26%), Positives = 363/859 (42%), Gaps = 206/859 (23%)

Query: 18  PLSYLVSIDGFNFLIDCGWND-HFDPSL-------------------LQPLSKVASTIDA 57
           PL Y++ ID    L+DCG  D H +PS                     + L+  A T+D 
Sbjct: 18  PLCYILQIDDVRILLDCGAPDWHPEPSTETSSTPGESQQVEPHWVRYCEQLAVQAPTVDL 77

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD----- 112
           VLLSH D  H+G  PYA  + GL AP +++ PV  +G + + D   S R     D     
Sbjct: 78  VLLSHADVAHVGLFPYAHAKYGLRAPAYASLPVQAMGRMAVLDNIESIRSEEPVDDPANS 137

Query: 113 ----------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV 150
                                 + ++ + + AF S+  L YSQ  HL    +GI + P  
Sbjct: 138 DTGLDIALPTFGLTPDPSKQRKIASIKETNDAFDSLHALRYSQPAHL----QGITITPFS 193

Query: 151 AGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL----------ESFVRPAVLIT 199
           AGH +GGT+WKI +     V+YAV+ N  KE+HL+GTVL          ES  RP +LIT
Sbjct: 194 AGHTIGGTIWKIRSPSAGTVVYAVNLNHTKERHLDGTVLLKGGAGGGVLESLSRPDLLIT 253

Query: 200 DAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN 258
           DA   L     R+ R+    DA++  L++G +VL+P D++ R+LELL++ + +W+   L 
Sbjct: 254 DAERTLVVSARRKDRDAALLDAVTNVLQSGHSVLMPCDASTRILELLVLFDQHWSFSKLR 313

Query: 259 YPIYFLTYVSSSTIDYVKSFLEWMGDSITK--SFETSRDNAF---------LLKHVTLLI 307
            P+  ++  ++  +  V+S +EW G ++TK  +F+   +             L  + L  
Sbjct: 314 APLCLVSRTANDMLTLVRSMMEWFGGTVTKEEAFDAGNNKKRKRNQEGEDDALGTLALRF 373

Query: 308 NKSELDNAPDG---------PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
              E+  +PD          PKL+L   A+L  G S  IF E+AS   N V+ +   + G
Sbjct: 374 KHLEIFPSPDALVSRYPSSMPKLLLVVPATLSHGNSRRIFAEFASVPGNAVILSTPSEPG 433

Query: 359 TLARML-------QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEAL-KASLVK 410
           TLA  L       Q+D   +    ++ + + L     + Y E++   K+ +A  +A+L +
Sbjct: 434 TLANTLFNEWNLGQSD-NERFGHGSVGQPIQLNSTMTLTYLEKERAAKERQATQRAALAR 492

Query: 411 EEESKASLGPDNNLSGDPMVI---------DANNANASADVVEPHGGRYRDILIDGFVPP 461
            +    +   D++ S               D +N     D  E       DI + G V  
Sbjct: 493 SQRLLEADEADSDSSNSEADEEEVEDALGDDMDNGVPEGD--ESAKQLSFDIFLKGNVSR 550

Query: 462 STSVAP---------MFPFYENNSEWDDFGEVIN-------------------------- 486
           + S            MFP  E     D++GE I+                          
Sbjct: 551 AASFFKTAGQASRFRMFPHIERKRRVDEYGETIDVAAWLRKDRALAVAVEAEEAREAQQK 610

Query: 487 --------------PDDYIIKDEDMDQAAMHIGGDDGKLDEGSA--SLILDAKPSKVVSN 530
                         P  +I++  ++      +  D   L++G +  ++I    P K+   
Sbjct: 611 KQEEEEKSKTPAEPPSKFIVETIEVQLRCKLLFVDMDGLNDGRSVKTIIPQVNPRKM--- 667

Query: 531 ELTVLVHGSAEATEHLKQHCL--KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNV 588
              ++VH   EAT+ LK+ CL  K +   ++ P + + + +      + V LS++L+   
Sbjct: 668 ---IIVHSHREATDALKESCLSIKAMTRDIHAPDVGDVVQIGQQTNVFTVALSDELI--- 721

Query: 589 LFKKLGDYEIAWVDAEVGKTENGMLSLL----PIST-------PAPPHKSVL-------V 630
                 D EI +V   V    N  +S+L    P+S+       PA   + VL       +
Sbjct: 722 ----FEDNEIGFVHGRVTGNANSTVSVLEPTMPVSSSGDAENIPASDVRPVLSLPWSTMI 777

Query: 631 GDLKMADLKPFLSSKGIQVEFAG-GALRCG--------EYVTIRKVGPAGQKGGGSGTQQ 681
           GDL++  LK  L   GI  EF G G L CG        + V +RK          +   Q
Sbjct: 778 GDLRLTALKTRLGVLGIAAEFIGEGVLVCGTRTSGTLDDVVAVRK----------TARGQ 827

Query: 682 IVIEGPLCEDYYKIRAYLY 700
           +V+EG + + YY +R  +Y
Sbjct: 828 VVVEGSISDVYYTVRREVY 846


>gi|402226056|gb|EJU06116.1| hypothetical protein DACRYDRAFT_73414 [Dacryopinax sp. DJM-731 SS1]
          Length = 925

 Score =  230 bits (587), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 234/935 (25%), Positives = 368/935 (39%), Gaps = 258/935 (27%)

Query: 8   TPLSGVFNE----NPLSYLVSIDGFNFLIDCGWND----------------------HFD 41
           TPL G        N   YL+ ID    L+DCG  D                       + 
Sbjct: 5   TPLCGSAQSTSVPNAFCYLLQIDDIRVLLDCGAPDWRLGAGEDVEGEDEAASRRETKKWW 64

Query: 42  PSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQ 101
              L  L++ A  ID VL +H    H+G   YA  +LGLSAP F+T PV  LG + + + 
Sbjct: 65  SEYLSLLTRTAPEIDLVLFTHGSLQHIGLYSYARAKLGLSAPAFATLPVQALGRIAVLED 124

Query: 102 YLSRRQVSEFD-------------------------LFTLDDIDSAFQSVTRLTYSQNYH 136
               R   + D                         + T D +  AF S+T L YSQ   
Sbjct: 125 VEGWRAEVDVDNEVPEEYSGDGDVKMESGIQLLHKAIATADVVKEAFDSITTLKYSQATQ 184

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL------- 188
           L+GK + + +  + A H LGGT+WK+ +     ++YAV  N  KE+HL+GT L       
Sbjct: 185 LTGKLQALTLTAYSASHTLGGTLWKLRSASSGTLLYAVGLNHMKEQHLDGTALVRPGGGG 244

Query: 189 --ESFVRPAVLITDAYN-ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
             E   RP +LITDA    + +   R++ E F ++I+ TLR+ G+VL+PVD++ R++ELL
Sbjct: 245 VGEGLGRPDLLITDAGRVGIISVRRREREEAFLESITNTLRSSGSVLIPVDASTRLVELL 304

Query: 246 LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET---SRDNA----- 297
           +IL+ +W +     P+  ++      + +V+S +EWMG  IT+  E     +D+      
Sbjct: 305 IILDQHWTQAKTRAPLCLVSRTGKECVTFVRSLMEWMGGWITREGEVPTIGKDSKKRKRR 364

Query: 298 -------------------FLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHD 336
                                 KH+ +  +   L +A  P  PK++LA+  ++  G S  
Sbjct: 365 NRKDEEDIEEEDALLANMILRFKHLQIFPSPEALMDAIHPSAPKVILATPLTMSHGASRA 424

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARML-------QADPPP----------KA---VKVT 376
           +F  ++S   NL+L     + GTLAR L       QA+             KA   + V 
Sbjct: 425 MFESFSSMRNNLLLLVNIAEKGTLARSLWDIWQREQAETAKWGKGRLGAIVKAETDISVR 484

Query: 377 MSRRVPLVGEELIAY-------------------------EEEQTRLKKEEALKASLVKE 411
           M+ +VPL G EL  Y                         +++     ++EA  AS    
Sbjct: 485 MNAKVPLAGVELEEYLNAEKAAKEKAAAEAAARPQLLLEADDDDEGDSEDEASDASSELA 544

Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR---DILIDGFVPPST----- 463
            E +   G D  ++       +    A A+  E    R +   DI + G V  +T     
Sbjct: 545 VEEELGGGTDEGVATRHFAEGSGAKGAGAEEEEADSARQQLSFDIYLKGKVARATFFKSS 604

Query: 464 -----SVAPMFPFYENNSEWDDFGEVIN-------------------------------- 486
                +   MFP+ E     D++GE I+                                
Sbjct: 605 SGAQATRYRMFPYVEKRRRIDEWGETIDVGTWMRRGKKWEEEEETEENQAAKEARRKRQE 664

Query: 487 -----------PDDYIIKDEDMDQAAMHIGGDDGKLDEGSAS--LILDAKPSKVVSNELT 533
                      P  YI +   +D        D   L++G A+  ++    P K+      
Sbjct: 665 EEQAQHAPPEPPSKYITEQHSIDVRCKVYFVDFEGLNDGRATKMIVPQVNPRKM------ 718

Query: 534 VLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK 591
           +LV    EAT  L Q C  ++ +   + TP + E + +     +Y + + E L S +   
Sbjct: 719 ILVASQPEATAELMQACGEIRSMTREISTPGVGEEVKIGEHSHSYSISVGETLFSTLKMS 778

Query: 592 KLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKS------------------------ 627
           K  D E+A+V   +    N   S +P+  PA   KS                        
Sbjct: 779 KFEDNEVAFVSGRIAFNPN---SAIPVLEPAASAKSQDSAVVPTGTDQAREEQTMIATVP 835

Query: 628 -------VLVGDLKMADLKPFLSSKGIQVEFAG-GALRCG-----------EYVTIRKVG 668
                   L+GDL++  LK  LS+ GI  +FAG G L CG           + V++RK+G
Sbjct: 836 AQILPQTTLIGDLRLTALKARLSTLGITADFAGEGVLICGLSQTGNGGSDTDIVSVRKMG 895

Query: 669 PAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
                       ++ + G + + YY +R  LY  +
Sbjct: 896 RG----------RVEVAGNVSDVYYTVRRELYGLY 920


>gi|301092285|ref|XP_002997001.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
 gi|262112190|gb|EEY70242.1| cleavage and polyadenylation specificity factor subunit, putative
           [Phytophthora infestans T30-4]
          Length = 222

 Score =  224 bits (572), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 105/213 (49%), Positives = 146/213 (68%), Gaps = 2/213 (0%)

Query: 5   VQVTPLSGVFNENPL-SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHP 63
           +  TPL GV +  P  +YL+ +D    L+DCGW D +D  LL+PL +V   ID VL+SH 
Sbjct: 4   ITFTPLYGVHSTAPCCAYLLEVDEVCILLDCGWTDAYDVELLKPLQRVVDRIDLVLVSHL 63

Query: 64  DTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSR-RQVSEFDLFTLDDIDSA 122
           D  H+GALPYAM +LGLSAPV+ T PV+R+G + +YD + ++ +  S+F LF+LDD+D  
Sbjct: 64  DLAHMGALPYAMGKLGLSAPVYGTLPVHRMGQIALYDAFQAKTKHDSDFSLFSLDDVDLV 123

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
           F+   +L YS+   L+  GEGIV+ PHVAGHL+GG +W+I K+ +D+IYAVDYN R E  
Sbjct: 124 FERFKQLKYSEKLTLTSSGEGIVITPHVAGHLIGGALWRIMKETDDIIYAVDYNHRSEHV 183

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           L  T+L+SF RP +LITD+ N    QP  + R+
Sbjct: 184 LQKTILDSFTRPTLLITDSMNLHAEQPKLKDRD 216


>gi|348689663|gb|EGZ29477.1| hypothetical protein PHYSODRAFT_473604 [Phytophthora sojae]
          Length = 221

 Score =  224 bits (570), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 104/213 (48%), Positives = 146/213 (68%), Gaps = 2/213 (0%)

Query: 5   VQVTPLSGVFNENPL-SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHP 63
           +  TPL GV +  P  +YL+ +D    L+DCGW D +D  LL+PL +V   ID VL+SH 
Sbjct: 4   ITFTPLYGVHSSAPCCAYLLEVDEVCILLDCGWTDEYDVELLKPLQRVVDRIDLVLVSHL 63

Query: 64  DTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSR-RQVSEFDLFTLDDIDSA 122
           D  H+GALPYAM +LGL+APV+ T PV+R+G + +YD + ++ +  S+F LF+LDD+D  
Sbjct: 64  DLAHMGALPYAMGKLGLNAPVYGTLPVHRMGQIALYDAFQAKTKHDSDFSLFSLDDVDLV 123

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
           F+   +L YS+   L+  GEGIV+ PHVAGHL+GG +W+I K+ +D+IYAVDYN R E  
Sbjct: 124 FERFKQLKYSEKLTLTSSGEGIVITPHVAGHLIGGALWRIMKETDDIIYAVDYNHRSEHV 183

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           L  T+L+SF RP +LITD+ N    QP  + R+
Sbjct: 184 LQKTILDSFTRPTLLITDSMNLHAEQPKLKDRD 216


>gi|390601510|gb|EIN10904.1| hypothetical protein PUNSTDRAFT_112695 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 937

 Score =  223 bits (568), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 233/933 (24%), Positives = 372/933 (39%), Gaps = 231/933 (24%)

Query: 5   VQVTPLSG---VFNENPLSYLVSIDGFNFLIDCG---WNDHFDPS--------------- 43
           +  TPLSG        PL+YL+ +D    L+DCG   W     PS               
Sbjct: 2   ITFTPLSGGAKSTRTTPLAYLLQVDDVRILLDCGSPDWCPERSPSSSAVTTESLSYPWDE 61

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYL 103
               L + A ++D VLLSH D  H G   YA  + GL AP ++T PV  +  +   +   
Sbjct: 62  YCDALRENAPSVDLVLLSHADLAHSGLYAYAYSRWGLKAPTYTTLPVQAMARVATLEDVE 121

Query: 104 SRRQVSEFD----------------------------------LFTLDDIDSAFQSVTRL 129
             R   + D                                  + T  ++  AF SV  L
Sbjct: 122 GVRDEEDVDPPEQQDEDQAEGDGDEKAFEGEKTKPVQRKTRKYVATAFEVHEAFDSVNTL 181

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL 188
            YSQ  HL GK +GI + P  AGH LGG +WKI +     ++YAV+ N  +E+HL+GTVL
Sbjct: 182 RYSQPCHLQGKCQGITITPFNAGHTLGGAIWKIRSPSAGTIVYAVNLNHMRERHLDGTVL 241

Query: 189 ---------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
                    E   RP +LITDA         R+ R+    D I+  L    ++ +P DS+
Sbjct: 242 IRPGGGGVFEPLARPDLLITDAERTNVVSSRRKDRDAALIDTITAALARRSSLFMPCDSS 301

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS--------- 289
            R+LELL++L+ +WA   L YPI  L+      + +V++ +EW+G +I+K          
Sbjct: 302 TRLLELLVLLDQHWAYQRLRYPICLLSRTGREMLTFVRAMMEWLGGTISKEDVGVGEDGQ 361

Query: 290 ------FETSRDN------------AFLLKHVTLLINKSELDN--APDGPKLVLASMASL 329
                     R N            A   +H+    N   L N  +   PKL+LA  ASL
Sbjct: 362 GGGKQDKRRRRVNDDEEGEDALGALALRFRHLEFFPNPQALLNTYSSKDPKLILAVPASL 421

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML--------QADPPPKAVKV------ 375
             G S  +F  +A+   N+++ T+RG+ GTL   L        +A+      K+      
Sbjct: 422 SHGPSRALFSTFAAVPDNVIILTQRGEEGTLGNDLFKKWNNSQRAEHKWDKGKIGSNVML 481

Query: 376 ------TMSRRVPLVGEELIAY-EEEQTRLKKEEALKASLVKEEES-------------- 414
                  M+ +VPL G+EL A+  +E+  ++KE A K +   E++               
Sbjct: 482 DGNMILKMNSKVPLQGDELEAFLAKERAAMEKEAAEKTADDFEQQRMLEADEEDTDTDED 541

Query: 415 -KASLGPDNNLSGDPMVIDAN-NANASADVVEPHGGRYR--------------------- 451
                  + +L+ D    + + +A A     EP G   R                     
Sbjct: 542 SDDEDEVERSLAADVAEAEPDPDAPAGGAFAEPGGQSRRSKRVRGVDDADWGLDADEGLN 601

Query: 452 ------DILIDGFVPPSTSVAP-----------MFPFYENNSEWDDFGEVINPDDYIIKD 494
                 D+ I G V  + S              MFP+ E     DD+GE+I+   ++ K 
Sbjct: 602 RQVLSFDVYIKGNVSRAASFFKSADGQSQQRFRMFPYIEKKRRVDDYGELIDVGMWLRKG 661

Query: 495 EDMDQAAMHIGGDDGKLDEGSASLILDA---KPSKVVSNELTV----------------- 534
           +  ++ A      + K ++      + A    PSK VS+E+ V                 
Sbjct: 662 KVFEEEAESNESKELKRNQAEEEAKVSAFEEPPSKFVSSEVEVQLACRLLFVDMEGLNDG 721

Query: 535 ----------------LVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAY 576
                           +VH   EAT  L + C  ++ +   +Y P++ +++ +     ++
Sbjct: 722 RAVKTIVPQVNPRKMIIVHAPTEATGSLIESCGNIRAMTKEIYAPELLQSVSIGQQTNSF 781

Query: 577 KVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLL-PI----------STPAPPH 625
            + LSE L++++      D E+ +V   V       + +L P+          + PA P 
Sbjct: 782 SISLSEDLITSIKMSSFEDNEVGYVTGRVAIHAGSAVPVLEPLAGSAATRKTKTLPARPG 841

Query: 626 -----------KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQK 673
                      +S L+G+LK+  LK  L+S GI+ E  G G L CG+  +  +       
Sbjct: 842 VIGMRAPIDLPRSTLIGELKLTTLKSRLASVGIRAELVGEGVLICGKRRSASEPLEGTVA 901

Query: 674 GGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
              S    + +EG   + YY +R  +Y    L+
Sbjct: 902 VRKSTRGHVELEGTASDVYYIVRREIYKLHALV 934


>gi|320163729|gb|EFW40628.1| cleavage and polyadenylation specificity factor [Capsaspora
           owczarzaki ATCC 30864]
          Length = 744

 Score =  221 bits (564), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 124/327 (37%), Positives = 197/327 (60%), Gaps = 19/327 (5%)

Query: 93  LGLLTMYDQYLSRRQVS-EFDL-FTLDDIDSAFQSVTRLTYSQNY--HLSGKGEGIVVAP 148
           +G + MYD ++S  ++  E  L FTLDD+D+AF+ +T L + Q     L  K + I + P
Sbjct: 1   MGQMFMYDLWMSHAEMQGEGALPFTLDDVDAAFERITTLKFQQRVVVPLGAKTKPITIIP 60

Query: 149 HVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLE---SFVRPAVLITDAYNAL 205
           H AGH++GGT+W+I  +GED++YAVD+N + E+HLN T L+    + RP++LI++++N  
Sbjct: 61  HAAGHMVGGTIWRIITEGEDIVYAVDFNHQLERHLNPTELKDLFQYERPSILISNSFNYG 120

Query: 206 HNQPPRQQRE-MFQDAISKTL------RAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN 258
               PR+ R+ +F D+I  TL       AGG+VL+P D+AGRVLEL  +L+  W ++  N
Sbjct: 121 AESVPRKTRDRLFLDSIVNTLINPKDGSAGGSVLIPTDTAGRVLELAQVLDKQWEKYK-N 179

Query: 259 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDN-APD 317
           +PI  L+++S + +++  + +EWM   + K FET+R N F   H+ +     EL   A +
Sbjct: 180 FPIVVLSHISRTVMNFAMAQIEWMSAKMQKEFETTRSNPFSFAHIKMCQTMEELAQVAKE 239

Query: 318 G-PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVT 376
           G P +VLASM  L +GF+ D+ ++WA + KNL++F        LA+ L      + + + 
Sbjct: 240 GTPVVVLASMEGLTSGFARDLMLKWAENPKNLIIFPNNSPASDLAKSLVEK--NRQIVID 297

Query: 377 MSRRVPLVGEELIAYEEEQTRLKKEEA 403
           +  R+ L GEEL  Y  EQ   + E A
Sbjct: 298 VKTRIALEGEELDEYLREQEEAEMELA 324



 Score = 97.8 bits (242), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/310 (28%), Positives = 125/310 (40%), Gaps = 78/310 (25%)

Query: 463 TSVAPMFPFYENN-SEWDDFGEVINPDDYIIKDEDMDQA-----------------AMHI 504
           T   PMFPF E +  + D++GEVI   DY I  E+                     AM  
Sbjct: 447 TRTFPMFPFVEQHRKKADEWGEVIRRSDYQILTEEFTDTLKPLASTSSSAGTSHATAMVT 506

Query: 505 GGDDG------KLDEGSASLILDA----KPSKVVSNELT--------------------- 533
           G ++       KLD       L A    +PSK VS ++                      
Sbjct: 507 GEEETGLESTLKLDTSQIKQQLHATAHNRPSKTVSKQVALQIQCTVKHVDLEGRADSMSL 566

Query: 534 ------------VLVHGSAEATEHLKQHCLKHVCPH--VYTPQIEETIDVTSDLCAYKVQ 579
                       +LVHGSA ++  L +  L+   P   V    +  TID +S+   Y+V+
Sbjct: 567 ATIFESVNARQLILVHGSATSSNEL-ESALRVKMPQCKVTIAALNTTIDASSEHNIYQVR 625

Query: 580 LSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPA---PPHKSVLVGDLKMA 636
           L + LMS + F   G +E+A+   ++     G  +L     PA   P H  V VGD K+ 
Sbjct: 626 LRDSLMSTLKFSTTGMFELAYFHGQIHVPTGGKTTLELDVLPAHLVPGHAQVFVGDPKLY 685

Query: 637 DLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIR 696
           ++K  L   G   EF  G L C + + IRK             Q   IEG L EDY+ +R
Sbjct: 686 EVKEVLIEAGFHAEFVQGVLVCNDTIAIRK-----------QDQAFAIEGGLSEDYFAVR 734

Query: 697 AYLYSQFYLL 706
             LY QF ++
Sbjct: 735 DVLYDQFAIV 744


>gi|422293869|gb|EKU21169.1| cleavage and polyadenylation specificity factor subunit 2
           [Nannochloropsis gaditana CCMP526]
          Length = 925

 Score =  221 bits (563), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 149/437 (34%), Positives = 233/437 (53%), Gaps = 31/437 (7%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLS 61
           G  +    L GV    PL YL+ +     L+DCGW+   D +LL+PL  V   +  VLLS
Sbjct: 59  GEGLTFRVLYGVLEHEPLCYLLKVGEATLLLDCGWDVQLDEALLEPLLPVLPQVQLVLLS 118

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSR-----RQVSEFDLFTL 116
            PD  H+GALP+  K L    P+++T+PV+++  + +YD YL++        +    FTL
Sbjct: 119 FPDLSHMGALPWVAKHLRPGVPIYTTQPVFKMAQMVLYDLYLNKCMDTASGAAGCPAFTL 178

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEG-IVVAPHVAGHLLGGTVWKIT-KDGEDVIYAVD 174
           D++D+A      L +SQ   +  +G   + V P+ AG +LGG  W++  K  E+++YAVD
Sbjct: 179 DEVDAAMARFQLLKFSQPLEVRQQGRFYLSVTPYPAGRILGGCFWRVNYKKMEEIVYAVD 238

Query: 175 YNRRKEKHLNGTVLESF--------VRPAVLITDAYNALH-NQPPRQQREMFQDAISKTL 225
           +N + E+HL G V E+F         RP + ITDA  + + +   R+    F  A + TL
Sbjct: 239 FNLKSERHLTGAV-EAFNALSADKEQRPCLFITDARPSPNLSTDERKVETEFLAAATGTL 297

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMG 283
           R GG+VL+PV+++GR  ELLL L  +W    L   Y I  L +++ + + + KS +E+M 
Sbjct: 298 RKGGHVLIPVETSGRAQELLLALNGHWRSDRLLWGYKIVLLHHMARNVLHFTKSMVEYMH 357

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD---GPKLVLASMASLEAGFSHDIFVE 340
             + + F+ S  N F LKHV    +  EL+ A      P +VLAS   ++ GFS  +   
Sbjct: 358 PEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSRALATR 417

Query: 341 WASDVKNLVLFTERGQFGTLARML-QADPPPKAVKVTMSRRVP----LVGEELIAYEEEQ 395
           WAS  +N +L     + G+LA    +    PKA    +S  VP    +VGEEL    E++
Sbjct: 418 WASGPENALLLCGHLRKGSLAESFWKLRHLPKA---ALSFSVPVIERIVGEELAGLREKE 474

Query: 396 TRLKKEEALKASLVKEE 412
            R ++ +AL+A   + +
Sbjct: 475 DR-ERRKALEAEEFRRQ 490


>gi|388579716|gb|EIM20037.1| hypothetical protein WALSEDRAFT_61199 [Wallemia sebi CBS 633.66]
          Length = 844

 Score =  219 bits (557), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 212/845 (25%), Positives = 357/845 (42%), Gaps = 160/845 (18%)

Query: 4   SVQVTPLSGVFNEN--------PLSYLVSIDGFNFLIDCG---W--NDHFDPSLLQPLSK 50
           ++ VTPL+G    N        P  YL+ I+    L+DCG   W  ND       + L +
Sbjct: 2   AITVTPLAGSGRVNTEERNTGEPFCYLLEIEDARILLDCGSRDWEANDESAFYYEKKLRE 61

Query: 51  VASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSE 110
           +A TID VLLSH  T H G   YA    GL  P + + PV  L  L+  +  +  R   +
Sbjct: 62  IAPTIDLVLLSHASTKHSGFYAYAYTHYGLKCPAYCSLPVKELARLSTLEDIIGWRGERD 121

Query: 111 FDLFTLDD----------IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVW 160
            +    DD            +A+ SV  + Y Q  HL GK  G+ +  + +GH LGGT+W
Sbjct: 122 IEGLHNDDELWCVPTREENRAAWTSVKDVRYHQPQHLYGKLRGVTITAYSSGHTLGGTLW 181

Query: 161 KITKDG-EDVIYAVDYNRRKEKHLNGTVL-----------ESFVRPAVLITDAYNALHNQ 208
           KI       ++YAV  N  KE+HL+GT L           E  VRP ++ITD+       
Sbjct: 182 KIRAPSVGTILYAVGINHMKERHLDGTALIRGDQGGLTVHEQLVRPGLVITDSERGDCVN 241

Query: 209 PPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA-----EHSLNYPIY 262
             R+ R+    D I++TL++G ++LLP D   R+LELL++L+ +W      + S   P+ 
Sbjct: 242 AKRKDRDAALLDIINRTLQSGNSLLLPCDPTSRILELLVLLDQHWTYIRDKDPSFRIPLC 301

Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA----------FLLKHVTLLINKSEL 312
            ++   +  + +V+  +E+ G + T + + SR+ A             K + +  +   L
Sbjct: 302 LISNTGTDMLKFVRGLMEFFGGA-TAAGDNSREEAERRYKENRGVLDFKTLNIFTSVDAL 360

Query: 313 DNA-PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML------- 364
           + A P  PKLVLA   S+  G S  +F  ++++  N ++ T RG  G+LAR L       
Sbjct: 361 EAAYPGTPKLVLAVPYSMSYGGSRRLFHSFSNNPGNAIVLTSRGAPGSLARDLFDRWNGK 420

Query: 365 -----------QADPPPKAVKVTMSRRVPLVGEELIAYE-EEQTRLKKEEALKASLVKEE 412
                      +A      + +T   +VPL+GEEL AY+  E+   ++E A +A+  +  
Sbjct: 421 QNDKWGSGKLGEAVQGDWNIPITEHSKVPLLGEELEAYQATERINREQEAARQAADSRRR 480

Query: 413 E-----------------SKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI 455
                               +S   D+ +          + N    +         DI +
Sbjct: 481 RMMEADAQEEDDEEDDFEGDSSSDEDDKVVEKEEQQKEEDGNGLQQIS-------YDIYL 533

Query: 456 DG--------FVPPSTSVAP---MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHI 504
            G        F   +   AP   MFPF +   + D +GEVI+ + ++ +  ++++ A+  
Sbjct: 534 KGHSTRGATSFFKSAQGSAPRFRMFPFNDIKRKMDSYGEVIDAESWVSRGRELERQAIEQ 593

Query: 505 GGDD----GKLDEGSASLILDAKPSKVVSNELTV-------------------------- 534
             +      K++E + +  L+  PSK +S  + V                          
Sbjct: 594 DQEHEAKRRKMEEEADATPLEP-PSKYISENVEVGVNCQVMYIDLEGLNDSRAIKNIMPR 652

Query: 535 -------LVHGSAEATEHLKQ--HCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLM 585
                  LV G+  ++  L      +  +   +Y P + ETI +     +Y   L + L+
Sbjct: 653 LNPRKMILVGGTQTSSNSLINAFEAISAMTKDIYVPNMGETIKIGEHTHSYTFTLGDSLV 712

Query: 586 SNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPH------KSVLVGDLKMADLK 639
           +NV      D+ +     ++   E  ++    ++T A          S+ +GD+K+  LK
Sbjct: 713 NNVHMAPFEDFVVGHAIGKMAYHEEALVPTFEVATSAAQETTANVPTSLYIGDMKLTSLK 772

Query: 640 PFLSSKGIQVEFAG-GALRCGEYVTIRKVGPA---GQKGGGSGTQQIVIEGPLCEDYYKI 695
             L   G+  EF G G L C   +   +   A     KG  + T  ++ +G +   YY +
Sbjct: 773 AKLVGLGLSAEFGGEGVLVCWNEMNSEEGAVAISKNSKGELNMTSSLIGDGDI---YYTV 829

Query: 696 RAYLY 700
           R  +Y
Sbjct: 830 RDAVY 834


>gi|422294077|gb|EKU21377.1| cleavage and polyadenylation specificity factor subunit 2, partial
           [Nannochloropsis gaditana CCMP526]
          Length = 429

 Score =  215 bits (547), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 145/418 (34%), Positives = 221/418 (52%), Gaps = 30/418 (7%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLS 61
           G  +    L GV    PL YL+ +     L+DCGW+   D +LL+PL  V   +  VLLS
Sbjct: 16  GEGLTFRVLYGVLEHEPLCYLLKVGEATLLLDCGWDVQLDEALLEPLLPVLPQVQLVLLS 75

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQ-----VSEFDLFTL 116
            PD  H+GALP+  K L    P+++T+PV+++  + +YD YL++        +    FTL
Sbjct: 76  FPDLSHMGALPWVAKHLRPGVPIYTTQPVFKMAQMVLYDLYLNKCMDTASGAAGCPAFTL 135

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEG-IVVAPHVAGHLLGGTVWKIT-KDGEDVIYAVD 174
           D++D+A      L +SQ   +  +G   + V P+ AG +LGG  W++  K  E+++YAVD
Sbjct: 136 DEVDAAMARFQLLKFSQPLEVRQQGRFYLSVTPYPAGRILGGCFWRVNYKKMEEIVYAVD 195

Query: 175 YNRRKEKHLNGTVLESF--------VRPAVLITDAYNALH-NQPPRQQREMFQDAISKTL 225
           +N + E+HL G V E+F         RP + ITDA  + + +   R+    F  A + TL
Sbjct: 196 FNLKSERHLTGAV-EAFNALSADKEQRPCLFITDARPSPNLSTDERKVETEFLAAATGTL 254

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMG 283
           R GG+VL+PV+++GR  ELLL L  +W    L   Y I  L +++ + + + KS +E+M 
Sbjct: 255 RKGGHVLIPVETSGRAQELLLALNGHWRSDRLLWGYKIVLLHHMARNVLHFTKSMVEYMH 314

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAP---DGPKLVLASMASLEAGFSHDIFVE 340
             + + F+ S  N F LKHV    +  EL+ A      P +VLAS   ++ GFS  +   
Sbjct: 315 PEVIRDFDRSLRNPFSLKHVVPAQSMLELEAAMGEYRNPVVVLASDEGMDTGFSRALATR 374

Query: 341 WASDVKNLVLFTERGQFGTLAR-MLQADPPPKAVKVTMSRRVP----LVGEELIAYEE 393
           WAS  +N +L     + G+LA    +    PKA    +S  VP    +VGEEL    E
Sbjct: 375 WASGPENALLLCGHLRKGSLAESFWKLRHLPKA---ALSFSVPVIERIVGEELAGLRE 429


>gi|302694097|ref|XP_003036727.1| hypothetical protein SCHCODRAFT_72177 [Schizophyllum commune H4-8]
 gi|300110424|gb|EFJ01825.1| hypothetical protein SCHCODRAFT_72177 [Schizophyllum commune H4-8]
          Length = 913

 Score =  213 bits (541), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 229/909 (25%), Positives = 373/909 (41%), Gaps = 212/909 (23%)

Query: 8   TPLSGVFNEN---PLSYLVSIDGFNFLIDCG---WNDHFDPSLLQP-------------L 48
           TPL+G    N   PL +++ +D    L+DCG   W+     S ++              L
Sbjct: 5   TPLAGAACSNRTTPLCFILQVDDVKILLDCGSPDWSPEPSTSEVKVEDTSYSWEEYCSIL 64

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQV 108
            + A+++D VLLSH D  H G  PYA  + GL A  ++T PV  +  +   +     R  
Sbjct: 65  RQHAASVDLVLLSHGDLQHSGLYPYAYSRWGLKAQTYTTLPVQAMARIAAAEDVEGLRDE 124

Query: 109 SEFD-------------------------------------LFTLDDIDSAFQSVTRLTY 131
            + D                                     + TL ++  AF SV  L Y
Sbjct: 125 EDVDAEGLLVPEATQPTEEQPEGQEEGEKQEPKMRKLRGKYVATLQEVQDAFDSVNVLRY 184

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL-- 188
           SQ  HL GK +GI + P  AGH LGGT+WKI +     ++YAV+ N  +E+HL+GTVL  
Sbjct: 185 SQPCHLQGKCQGITITPFNAGHTLGGTIWKIRSPSSGTILYAVNMNHMRERHLDGTVLIR 244

Query: 189 ------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRV 241
                 E   RP + ITDA  A      R+ R+    D ++  L +  ++LLP DS  R+
Sbjct: 245 QAGGIFEPLARPDLFITDADRANVITSRRKDRDASLIDTVTTALSSRSSLLLPCDSGTRL 304

Query: 242 LELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS------------ 289
           LELL++L+ +W    L YPI  ++      + +V+S +EW+G +I+K             
Sbjct: 305 LELLVLLDQHWNYSRLRYPICLVSRTGREMLTFVRSMMEWLGGTISKEDVGEDGMKGRHG 364

Query: 290 ---FETSRDN------AFLLK--HVTLLINKSEL--DNAPDGPKLVLASMASLEAGFSHD 336
                   DN      AF L+  H+        L    +   PKL+LA   +L  G S  
Sbjct: 365 NKRKRADDDNDEDALGAFALRFQHLEFFPTPQALLQTYSSKDPKLILAVPLNLSHGPSRS 424

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARML--------QADPPPKAVKV------------T 376
           IF E+A+   N++L T+RG  GTLAR L        +A+      KV             
Sbjct: 425 IFSEFAAIPDNVILLTQRGDPGTLARALFEKWNDSQRAEAKWDKGKVGSNVMLDDNLTLK 484

Query: 377 MSRRVPLVGEELIAYEEEQ------------TRLKKEEALKA----SLVKEEESKASLGP 420
           M R+VPL G+EL AY  ++               + +  L+A    S    +        
Sbjct: 485 MRRKVPLQGDELEAYLAKERAAKEKEAAQQAAAARNQRMLEADEGDSESDSDSDGEDDAS 544

Query: 421 DNNLSGDPMVIDANNANASADVV-------EPHGGRYRDILIDGFVPPSTSV-------- 465
           +   + + M +DA      AD          P      DI + G V  +TS         
Sbjct: 545 EKAFNEEVMDLDAERRKGEADWAGLDGDDEHPKQLVSFDIYLKGNVSKATSFFRNAGAAA 604

Query: 466 ---APMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKL---DEGSASLI 519
                MFP+ E     D++GE ++   ++ K +  ++ A      + +    +E  A   
Sbjct: 605 QQRFRMFPYVEKKRRVDEYGETVDVGMWLRKGKVFEEEAESEEVKEARRKQQEEEEAKKA 664

Query: 520 LDAKPSKVVSNELTV---------------------------------LVHGSAEATEHL 546
           +   PSK V  E+ V                                 +VH +++A + L
Sbjct: 665 ILEPPSKFVETEVEVQMACRLLFVDMEGLNDSRAVKTIVPKVNPRKMIIVHATSDAADSL 724

Query: 547 KQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE 604
            + C  ++ +   +Y P+  +++ +     ++ + +S++L++++   +  D E+ ++   
Sbjct: 725 IESCGNIQAMTKDIYAPEFGQSVQIGQQTSSFSISISDELLASLRMSRFEDNEVGYITGR 784

Query: 605 VGKTENGML-------------SLLPISTP--------APPHKSVLVGDLKMADLKPFLS 643
           V      +L             + LP+  P        A   +S ++G+LK+  LK  L+
Sbjct: 785 VVMHATTLLPTLEPAAKTAAAATRLPLRAPRVLGSRPAAQLPRSTMIGELKLTALKARLA 844

Query: 644 SKGIQVEFAG-GALRCGEYVTIRK---VGPAGQKGGGSGTQQ--IVIEGPLCEDYYKIRA 697
             G+  E  G G L CG  VT RK     P  +      T +  + +EG + E YY +R 
Sbjct: 845 QVGVHAELVGEGVLICG--VTHRKGDGADPLAESVAVRKTARGNVEMEGNVSETYYAVRK 902

Query: 698 YLYSQFYLL 706
            +Y+   L+
Sbjct: 903 EIYNLHALV 911


>gi|164663111|ref|XP_001732677.1| hypothetical protein MGL_0452 [Malassezia globosa CBS 7966]
 gi|159106580|gb|EDP45463.1| hypothetical protein MGL_0452 [Malassezia globosa CBS 7966]
          Length = 862

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 215/839 (25%), Positives = 358/839 (42%), Gaps = 177/839 (21%)

Query: 19  LSYLVSIDGFNFLIDCGWNDHF----DPSLLQP------------LSKVASTIDAVLLSH 62
           LSYL+ ID    L+DCG  +      D  L Q             L ++  TID VLL+H
Sbjct: 36  LSYLLEIDQCRILLDCGAPEDLTFVDDTQLKQEGSHVWRGTLPDILERIGPTIDVVLLTH 95

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF-------- 114
            +  HLG   YA    GL  PV++T PV  +G L M +   S R   + +L         
Sbjct: 96  AEMSHLGLYAYAYANYGLQCPVYATLPVQTMGRLQMLEIVRSWRAEVDANLTSSKSEANS 155

Query: 115 -------TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDG 166
                  T   +D AF ++  L Y +   L GK  G+V+  + AGH LGGTVWK+ +   
Sbjct: 156 GLKRYIPTEAQVDDAFDAIRPLRYLEPTPLDGKCAGLVLTAYNAGHSLGGTVWKLRSPTV 215

Query: 167 EDVIYAVDYNRRKEKHLNGTVL----------ESFVRPAVLITDAYNALHNQPPRQQRE- 215
             ++ A+D+N  +E+HL+GT L           +  RP VLITD    L     R+ R+ 
Sbjct: 216 GTIVMALDWNHHRERHLDGTALLSVGAAAPLAHAIGRPDVLITDIERGLFTNARRKDRDA 275

Query: 216 MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA---EHSLNYPIYFLTYVSSSTI 272
                I +TL +G +VL+PVDSA R+LE+L++L+ +WA   +H   +P+  +++     +
Sbjct: 276 ALLSQIHRTLTSGHSVLIPVDSAARLLEILVLLDQHWAFSYQHQ-RFPLCLVSHTGQEVV 334

Query: 273 DYVKSFLEWMGDSITKSFETSRDNAFL--------------------LKHVTLLINKSEL 312
           +  ++F+EWM          + + +                         +    +   L
Sbjct: 335 ERARTFMEWMSREWAIQLLDAPEASSRRKTTSSSSSSSAATAKSPLDFSGLRFYSSVEAL 394

Query: 313 DNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML------ 364
             A  P   K+VLA+  +L  G S  +  E+  D   L++ T RG   +L R L      
Sbjct: 395 HQALTPSQVKVVLATPPALSHGLSRQLLPEFLCDPDALLILTSRGTPSSLVRNLWDRWNA 454

Query: 365 -QADPP---------PKAVKVTMS----RRVPLVGEELIAY-EEEQTRLKKEEALKASLV 409
            QAD           P +V   +S    RRVPL G+EL  Y E ++ R    +A +A + 
Sbjct: 455 KQADRDAWRQGHVGVPVSVGGQLSYELRRRVPLAGDELRTYVERQKAREAAADAPRARIQ 514

Query: 410 KEEES----------KASLGPDNNLSGDPMVIDANNANA--------SADVVEPHGGRYR 451
           + +             +    D+   G P  + +    A        +A   EP G  + 
Sbjct: 515 QPQREADDVDDDDASSSDSSSDDEFDGQPSRLPSTRTIAPERAQMQLNAAAPEPVGMSF- 573

Query: 452 DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMD------------- 498
           DI + G V        MFP  E   + D +GE I+   ++ +   ++             
Sbjct: 574 DIFLRGQVSRDAVHYRMFPHIERKRKVDGYGESIDTSRWLARRRRLEAEQEEQLNPERLK 633

Query: 499 --------------------QAAMH---IGGDDGKLDEGSA--SLILDAKPSKVVSNELT 533
                                AA+    +  D   L++G A  +L+   +P ++      
Sbjct: 634 PQKKRTRPVDVPCKYTSDTLNAAVRCHVLYVDLQGLNDGRALTTLVPQLQPRRL------ 687

Query: 534 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
           ++V+G    T  ++    +     +YTP + +T+ V     +Y V+L + LM ++ +  +
Sbjct: 688 IMVNGDEATTLAVRAKLSR--THDLYTPDLGQTVSVGGLSNSYSVRLGDALMGSLRWHPM 745

Query: 594 GDYEIAWVDAEVG-KTENGMLSLLPISTPAPPH-----KSVLVGDLKMADLKPFLSSK-G 646
            DY I  +       +++   +L+P++  A  H      ++ +GDL++  LK +L+ +  
Sbjct: 746 QDYNIVHLHVSPDFASDSDTPTLVPVNDAATVHTAQAPSTLYIGDLRLPALKAYLARQHR 805

Query: 647 IQVEFAG-GALRCGEY----VTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
           I+ +FAG G L CG+     VT+ K           GT +IV+EG L  +  ++R  +Y
Sbjct: 806 IRADFAGEGVLVCGDRDERNVTVTK----------QGTGRIVVEGSLSTNLARVRQSIY 854


>gi|409049761|gb|EKM59238.1| hypothetical protein PHACADRAFT_249539 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 951

 Score =  209 bits (533), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 157/524 (29%), Positives = 236/524 (45%), Gaps = 121/524 (23%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCGWND-------------------HFDP 42
           +  TPLSG    +   PL+YL+ +D    L+DCG  D                   H   
Sbjct: 2   ITFTPLSGAARSSRTVPLAYLLQVDDVRILLDCGAPDWCPEDTSSAVKEEDLQETHHHWE 61

Query: 43  SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM---- 98
              Q L + A TID VL+SH D  H G  PYA  + GL+AP ++T PV  +  +      
Sbjct: 62  QYCQTLKEYAPTIDLVLMSHGDLQHTGLYPYAYSRWGLTAPAYTTLPVQAMARIAATEDV 121

Query: 99  ------------------------YDQYLSRRQVSEFD-----------LFTLDDIDSAF 123
                                    D++  + Q  E             + T+ ++  AF
Sbjct: 122 EGIQDQEDISDDLAMPEDVEVQDAQDKHDEKSQSPELKSAAPEPRSRKYVATVQEVHDAF 181

Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKH 182
            SV  L YSQ  HL GK +G+ + P  AGH LGGT+WKI +     ++YAVD N  +E+H
Sbjct: 182 DSVNVLRYSQPCHLQGKCQGLTIIPFNAGHTLGGTIWKIRSPTAGTILYAVDMNHMRERH 241

Query: 183 LNGTVL-----------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 230
           L+GTVL           E+ VRP +LITDA  A      R+ R+    D ++ TL +  +
Sbjct: 242 LDGTVLMRQGSSNTGIFETLVRPDLLITDAERANVTTARRKDRDAALLDCVTATLTSRNS 301

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS- 289
           +LLP D++ RVLELL++L+ +W+   L +PI  L+      + +V+S +EW+G +++K  
Sbjct: 302 LLLPCDASTRVLELLVLLDQHWSYSRLKFPICLLSRAGHEMLTFVRSMMEWLGGTVSKED 361

Query: 290 ----------------------FETSRDNAFLLK--HVTLLINKSELDN--APDGPKLVL 323
                                  +     AF L+  H+ +  N + +    +   PKL+L
Sbjct: 362 VGVEGQDGKHGKDRKRKRVDDDDDNEALGAFALRFPHLEIFPNPAAMMQRYSSKDPKLIL 421

Query: 324 ASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-------QADPPP------ 370
           A  +SL  G S  +F E+A    N+VL T RG+ GTL R+L       Q D         
Sbjct: 422 AVPSSLSHGPSRALFSEFAEIPDNVVLLTGRGEEGTLGRILFERWDNSQRDDTKWDRGKI 481

Query: 371 -------KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 407
                    + + +S +VPL G EL  +   +   K+ EA K +
Sbjct: 482 GNNVMMDGTLHLKISSKVPLQGAELEEHLARERAAKEREAAKKA 525



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 66/310 (21%), Positives = 133/310 (42%), Gaps = 81/310 (26%)

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMH------------------------ 503
           MFP+ E   + DD+GE+++ + ++ K + +++ A +                        
Sbjct: 650 MFPYVERKRKIDDYGELVDVEMWMRKGKALEENAENEDLKEMKMKTEEEEKPQEPPSKFV 709

Query: 504 ------------IGGDDGKLDEGSA--SLILDAKPSKVVSNELTVLVHGSAEATEHLKQH 549
                       +  D   L++G A  +++    P K++      +VH    AT+HL + 
Sbjct: 710 TTEVEVQLACRLLFVDLEGLNDGRAVKTIVPQVNPRKMI------IVHAPQAATDHLIEA 763

Query: 550 C--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGK 607
           C  ++ +   +Y P + E++ +     ++ + LS++L++++   +  D E+A+V    G+
Sbjct: 764 CAGIRAMTKDIYAPAVGESVQIGQHTNSFSISLSDELLASLKMSRFEDNEVAYV---TGR 820

Query: 608 TENGMLSLLPI---------------------------STPAPPHKSVLVGDLKMADLKP 640
             +   S +PI                            T A P +S ++G+LK+  LK 
Sbjct: 821 VSSLATSTIPILESVGSSSVGRAVTARHTARGRILGSRPTRALP-QSTMIGELKLTALKA 879

Query: 641 FLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKG---GGSGTQQIVIEGPLCEDYYKIR 696
            L++ G+Q E  G G L CG          A Q+      +G  ++ +EG + + YYK+R
Sbjct: 880 RLAAVGVQAELVGEGVLICGAAARRGSAPDALQESVAVKKTGRGKLELEGAVSDVYYKVR 939

Query: 697 AYLYSQFYLL 706
             +Y+   L+
Sbjct: 940 REVYNLHALV 949


>gi|389746898|gb|EIM88077.1| hypothetical protein STEHIDRAFT_94995 [Stereum hirsutum FP-91666
           SS1]
          Length = 968

 Score =  208 bits (530), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 147/451 (32%), Positives = 212/451 (47%), Gaps = 95/451 (21%)

Query: 9   PLSGVFNEN---PLSYLVSIDGFNFLIDCG---WNDHFDPSL---------LQPLSKVAS 53
           PLSG    +   PL+YL+ +D  + L+DCG   W   FD  L          Q L + A 
Sbjct: 6   PLSGAAKSDRLVPLAYLLQVDDVHILLDCGSPDWCPEFDDGLNVSAHWETYCQSLKEAAP 65

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD- 112
           TID VLLSH D  H G  PYA  + GL AP +ST PV  +  +   ++  S R   + D 
Sbjct: 66  TIDLVLLSHGDLAHSGLYPYAYARWGLKAPAYSTLPVQAMARIAATEESESIRDEQDVDA 125

Query: 113 ------------------------------------LFTLDDIDSAFQSVTRLTYSQNYH 136
                                               + T  ++  AF S+  L YSQ  H
Sbjct: 126 GYQSDQPQDGEDKVEDSGERVDESGPSSAVQRKAKYVATPSEVQEAFDSINTLRYSQPTH 185

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL------- 188
           L GK +G+ + P  AGH LGGT+WKI +     ++YAV+ N  +E+HL+GTVL       
Sbjct: 186 LQGKCQGVTITPFNAGHTLGGTIWKIRSPSAGTIMYAVNMNHMRERHLDGTVLMRQGGGI 245

Query: 189 -----ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVL 242
                E   RP +LITDA  A      R+ R+    D I+  L +  ++LLP D++ RVL
Sbjct: 246 APGVFEPLARPDLLITDAARADVLSSRRKDRDASLIDTITAALSSRSSLLLPCDASTRVL 305

Query: 243 ELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS------------- 289
           ELL++L+ +W+   L YPI  L+      + +V+S +EW+G +++K              
Sbjct: 306 ELLVLLDQHWSFARLKYPICLLSRSGREMLTFVRSMMEWLGGTVSKEDVGEEVTSGGRDG 365

Query: 290 --------FETSRDN------AFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGF 333
                    +   D+      A   KH+   +N   L    +   PKL+LA  ASL  G 
Sbjct: 366 GKRGKKRKKDNDEDDDVIGAFALRFKHLEFFLNPQALQQTYSSKDPKLILAVPASLSHGP 425

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           S  +F ++AS   N+VL T RG+ GTL+R+L
Sbjct: 426 SRSLFADFASIPDNVVLLTSRGEEGTLSRVL 456



 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/325 (21%), Positives = 130/325 (40%), Gaps = 109/325 (33%)

Query: 468 MFPFYENNSEWDDFGEVIN------------------------------------PDDYI 491
           MFP+ E   + D++GEV++                                    P  ++
Sbjct: 653 MFPYVEKRRKVDEYGEVLDVGMWVRRGKILEEDSNEDAREEKEKEEEAKRAPREPPSKFV 712

Query: 492 IKDEDMDQAAMHIGGDDGKLDEGSAS--LILDAKPSKVVSNELTVLVHGSAEATEHLKQH 549
            +  ++  A   +  D   L++G A+  +I    P K++      +VHGS  ATE L   
Sbjct: 713 SRIVEVQLACRLLFVDLEGLNDGRATKTIIPQVNPRKMI------IVHGSPSATEALIDS 766

Query: 550 C--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGK 607
           C  ++ +   V+ P + E++ +  +  ++ + LS+ L++++   +  D E+ +V   +  
Sbjct: 767 CSNIRAMTKDVFAPSVGESVQIGQNTSSFSISLSDDLLASMKMSRFEDNEVGYVTGRIAI 826

Query: 608 TENGMLSLL------------------------PIST----PAP---------PHKSVLV 630
           T +  + +L                        P+ T    P P         PH S ++
Sbjct: 827 TASSTVPILQPLSNAPTSPSTTTSTSTSSPSPMPLRTLPDRPRPIGSLPTLRLPH-STMI 885

Query: 631 GDLKMADLKPFLSSKGIQVEFAG-GALRCG--------------EYVTIRKVGPAGQKGG 675
           G+LK+  LK  L+S GIQ E  G G L CG              E V +RKVG       
Sbjct: 886 GELKLTALKSRLASIGIQSELVGEGVLICGTKGGGGLSLGESLGESVAVRKVGRG----- 940

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLY 700
                ++ +EG + + Y+++R  +Y
Sbjct: 941 -----RVELEGGVSDVYFRVRKEIY 960


>gi|449549925|gb|EMD40890.1| hypothetical protein CERSUDRAFT_111471 [Ceriporiopsis subvermispora
           B]
          Length = 934

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 157/502 (31%), Positives = 227/502 (45%), Gaps = 116/502 (23%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCGWNDHF--DPSLL-------QP----- 47
           +  TPLSG    +   PL+YL+ +D    L+DCG  D    D S         QP     
Sbjct: 2   ITFTPLSGSARTSSTIPLAYLLQVDDVRILLDCGSPDWCPEDASTSEDAEQKPQPWEKYS 61

Query: 48  --LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMY------ 99
             L + A T+D VLLSH D  H G  PYA    GL APV++T PV  +G +         
Sbjct: 62  EALKECAPTVDLVLLSHGDLSHSGLYPYAYAHWGLKAPVYTTLPVQAMGRIAATEDVESL 121

Query: 100 ----------------------------------DQYLSRRQVSEFDLFTLDDIDSAFQS 125
                                             D  +SR++ + + + T+ ++  AF S
Sbjct: 122 RDEMQVEEEEEAPSSPTASPEAEAGPSTPPPPASDTSVSRKKKARY-VATIQEVHDAFDS 180

Query: 126 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLN 184
           +  L YSQ  HL GK +G+ + P  AGH LGGT+WKI +     ++YAVD N  +E HL+
Sbjct: 181 INVLRYSQPCHLQGKCQGLTIIPFNAGHTLGGTIWKIRSPTAGTILYAVDMNHMREHHLD 240

Query: 185 GTVL-----------ESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVL 232
           GTVL           ES  RP + ITDA  A      R+ R     D ++ TL +  ++L
Sbjct: 241 GTVLIRQANAGGGVFESLARPDLFITDAERAHVTTARRKDRVAALLDCVTATLTSRNSLL 300

Query: 233 LPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS--- 289
           LP DS+ RVLELL++L+ +W    L +PI  L+      + +V+S +EW+G +I+K    
Sbjct: 301 LPCDSSTRVLELLVLLDQHWNYSRLKFPICLLSRTGREMLTFVRSMMEWLGGTISKEDVG 360

Query: 290 FETSRDN------------------AFLLKHVTLLINKSELDN--APDGPKLVLASMASL 329
            + S +N                  A   +H+    N   L    +   PKL+LA  A+L
Sbjct: 361 EDGSSNNKKRRRADDDADDEALGAFALRFRHLEFFPNPQALMQTYSSKDPKLILAVPATL 420

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-------QADPPP------------ 370
             G S  +F ++A    N+VL T R + GTL R+L       Q D               
Sbjct: 421 SHGPSRALFTQFAEMPDNVVLLTGRSEEGTLGRILFDRWNAAQRDEAKWDRGKIGSNVMM 480

Query: 371 -KAVKVTMSRRVPLVGEELIAY 391
              +++ M+ +VPL G EL  Y
Sbjct: 481 DGTLRLKMNSKVPLQGAELEVY 502



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 69/335 (20%), Positives = 133/335 (39%), Gaps = 89/335 (26%)

Query: 452 DILIDGFVPPSTSVAP---------MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAM 502
           DI + G V  +TS            MFP+ E     D++GEV++   ++ K + +++ A 
Sbjct: 607 DIYLKGNVAKTTSFFKSEGQAQRYRMFPYMEKKRRVDEYGEVLDVGMWLRKGKVLEEDAE 666

Query: 503 HIGGDDGKLDEGSASLILDAKP-SKVVSNELTV--------------------------- 534
                + +  E        A+P SK ++ E+ V                           
Sbjct: 667 SEETKEARRREEEDVKKAPAEPPSKFITTEVEVQLACRLLFVDMEGLNDGRAVKTIVPQV 726

Query: 535 ------LVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMS 586
                 +VH   E T+ L + C  ++ +   +Y PQ  E + +     ++ + LS++L++
Sbjct: 727 NPRKMIVVHAPPEGTDVLMESCANIRAMTRDIYAPQQGEMVQIGQHTNSFSISLSDELLA 786

Query: 587 NVLFKKLGDYEIAWVDAEVGKTENGMLSLL-PI--------------------STP-APP 624
           ++   +  D E+ +V   +    +  + +L P+                    S P A  
Sbjct: 787 SIKMSRFEDNEVGYVTGRIASLASSTIPVLEPVSSSSLPSTQSRKALRGRNLGSRPTATL 846

Query: 625 HKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGT---- 679
            +S ++G+LK+  LK  L++ G+  E  G G L CG          A +KG  S +    
Sbjct: 847 PQSTMIGELKLTALKARLAAVGVHAELIGEGVLICGA---------AAKKGSTSDSLEDS 897

Query: 680 --------QQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                    ++ +EG + + YY +R  +Y+   L+
Sbjct: 898 VAVKKTARGRVELEGSVSDVYYTVRREIYNMHALV 932


>gi|432115811|gb|ELK36959.1| Cleavage and polyadenylation specificity factor subunit 2 [Myotis
           davidii]
          Length = 687

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 158/564 (28%), Positives = 256/564 (45%), Gaps = 146/564 (25%)

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKS 277
           + +TLR  GNVL+ VD+AGRVLEL  +L+  W        +Y    L  VS + +++ KS
Sbjct: 141 VLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKS 200

Query: 278 FLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
            +EWM D + + FE  R+N F  +H++L    S+L   P  PK+VLAS   LE GFS D+
Sbjct: 201 QVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDL 259

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPP----------KAVKVTMSRRVPLVGEE 387
           F++W  D KN ++ T R   GTLAR L  +P P          K  ++ + +RV L G+E
Sbjct: 260 FIQWCEDPKNSIILTYRTTPGTLARFLIDNPLPHPSPSLHFAEKVTEIELRKRVKLEGKE 319

Query: 388 LIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           L  Y      L++E+  K +  K E+SK +            +  ++ ++   D+ +P  
Sbjct: 320 LEEY------LEREKLKKEAAKKLEQSKEA-----------DIDSSDESDVEEDIDQPSA 362

Query: 448 GRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK------ 493
            + + D+++ G       F   +    PMFP  E   +WD++GE+I P+D+++       
Sbjct: 363 HKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATE 422

Query: 494 -------------DEDMDQ-------------AAMHIGGD------DGKLDEGSASLILD 521
                        DE MDQ              ++ I         +G+ D  S   I++
Sbjct: 423 EEKSKLESGLTNGDEPMDQDLSDVPTKCISMTESIEIKARVTYIDYEGRSDGDSIKKIIN 482

Query: 522 A-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAY 576
             KP ++      ++VHG  EA++ L + C     K +   VY P++ ET+D TS+   Y
Sbjct: 483 QMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIY 534

Query: 577 KVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML------------------- 613
           +V+L + L+S++ F K  D E+AW+D      V K + G++                   
Sbjct: 535 QVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDP 594

Query: 614 ----------------------------SLLPISTPAPP-----HKSVLVGDLKMADLKP 640
                                        ++P   P PP     H+SV + + ++ D K 
Sbjct: 595 PSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPNEVPGHQSVFMNEPRLFDFKQ 654

Query: 641 FLSSKGIQVEFAGGALRCGEYVTI 664
            L  + IQ EF GG L C   +++
Sbjct: 655 VLLREWIQAEFVGGVLVCNNQISV 678



 Score =  158 bits (400), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 80/185 (43%), Positives = 119/185 (64%), Gaps = 13/185 (7%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSG------KGEG-IVVAPHVAGHLLG-----GTVWKITKDGED 168
           +AF  + +L +SQ  +L        +G+G +++A   AG +L        +W+ TKD   
Sbjct: 121 AAFDKIQQLKFSQIVNLKANVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWR-TKDAGL 179

Query: 169 VIYAV 173
            +Y++
Sbjct: 180 GVYSL 184


>gi|336373839|gb|EGO02177.1| hypothetical protein SERLA73DRAFT_86401 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336386654|gb|EGO27800.1| hypothetical protein SERLADRAFT_447017 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 930

 Score =  204 bits (518), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 159/520 (30%), Positives = 241/520 (46%), Gaps = 112/520 (21%)

Query: 5   VQVTPLSGVFNEN---PLSYLVSIDGFNFLIDCG---WNDHFDPSLL------------- 45
           +  TPLSG    +   PL+YL+ +D    L+DCG   W+     S +             
Sbjct: 2   ITFTPLSGAARSSRTVPLAYLLQVDDVRILLDCGSPDWSPEPSSSAVKSEDLRQHSYHWE 61

Query: 46  ---QPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
              Q L + + T+D VLLSH D  H G   YA  + GL AP +ST PV   G +   +  
Sbjct: 62  EYCQALRECSPTVDLVLLSHGDLAHTGLYAYAYSRWGLKAPAYSTLPVQATGRIATNEDV 121

Query: 103 LSRRQVSEFD----------------------------------LFTLDDIDSAFQSVTR 128
              R+  + D                                  + T+ ++  A+ ++  
Sbjct: 122 EGIREEQDVDTDSENQHHNSALEGTESGSQKSPESQPKKTSGKYIATVLEVHDAYDAMNT 181

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTV 187
           L YSQ  HL GK +GI + P+ AGH LGGT+WKI +     ++YAVD N  +E+HL+GTV
Sbjct: 182 LRYSQPTHLQGKCQGITITPYNAGHSLGGTIWKIRSPSAGTILYAVDINHMRERHLDGTV 241

Query: 188 L---------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
           L         E+  RP +LITDA  A      R+ R+    D IS TL +  ++LLP DS
Sbjct: 242 LVRPASGGIVEALARPDLLITDAERANVTTSRRKDRDAALIDTISATLSSRSSLLLPCDS 301

Query: 238 AGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK--------- 288
           + RVLELL++L+ +W      YPI  L+      + +V+S +EW+G +++K         
Sbjct: 302 STRVLELLVLLDQHWKFADFRYPICLLSRNGREMLTFVRSMMEWLGGTVSKEDVGVDGSG 361

Query: 289 ---SFETSRDN----------AFLLKHVTLLINKSEL--DNAPDGPKLVLASMASLEAGF 333
                +  RD+          A   KH+    N   L    +   PKL+LA  ASL  G 
Sbjct: 362 KSGGNKRRRDDEGEDEALGAFALRFKHLEFFPNPQALLQTYSSKDPKLILAVPASLSHGP 421

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARML--------QADPP------------PKAV 373
           S  +F ++A    N+VL T RG+ GTL R+L        +AD                 +
Sbjct: 422 SRLLFSDFAVVPDNVVLLTSRGEEGTLGRILFDKWNDSQRADDKWDKGKIGSNIMMDGTM 481

Query: 374 KVTMSRRVPLVGEELIAYEEEQTRLKKEEAL-KASLVKEE 412
           K+ ++ ++PL G EL  Y  ++   K++EA+ +A+L + +
Sbjct: 482 KLKINSKIPLQGAELEEYLAKERVAKEKEAVQQAALARNQ 521



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 79/344 (22%), Positives = 132/344 (38%), Gaps = 103/344 (29%)

Query: 452 DILIDGFVPPSTSVAP----------MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAA 501
           DI I G V  STS             MFP+ E     D++GE I+   ++ K + +++ A
Sbjct: 599 DIYIKGNVSKSTSFFKTVGGQPQRFRMFPYVEKKRRVDEYGETIDVGMWLRKGKVLEEDA 658

Query: 502 M--HIGGDDGKLDEGSASLILDAKPSKVVSNEL--------------------------- 532
               +     K  E  A  I+   PSK V++++                           
Sbjct: 659 ESDELKEAKRKQAEEEAKKIVREPPSKFVTSDVEIQLACRLLFVDMEGLNDGRAVKTIVP 718

Query: 533 ------TVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKL 584
                  ++VH    AT  L   C  ++ +   +Y P   ETI +      + + LS++L
Sbjct: 719 QVNPRKMIIVHAPDSATSALIDSCANIRAMTKDIYAPSTGETIRLGQQTNTFSILLSDEL 778

Query: 585 MSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAP--------------------- 623
           ++ +   +  D E+ +V    G+  + + S +P+  PA                      
Sbjct: 779 LNTLKMSRFEDNEVGYV---TGRVASHVSSTIPVLEPAISSALPSDSSDRKLFLRGRQLG 835

Query: 624 -------PHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCG-------------EYV 662
                  PH S ++G+LK+  LK  L+S GIQ E  G G L CG             E V
Sbjct: 836 SRPTQTLPH-STMIGELKLTALKTRLASVGIQAELIGEGVLICGAGAKRNQPSDTLEETV 894

Query: 663 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           ++RK              ++ +EG + + YY +R  +YS   L+
Sbjct: 895 SVRKTARG----------RVELEGNVSDVYYTVRKEIYSLHALV 928


>gi|403298151|ref|XP_003939898.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2 isoform 2 [Saimiri boliviensis boliviensis]
          Length = 648

 Score =  201 bits (512), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 155/572 (27%), Positives = 255/572 (44%), Gaps = 146/572 (25%)

Query: 245 LLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLK 301
           L  L+D W        +Y    L  VS + +++ KS +EWM D + + FE  R+N F  +
Sbjct: 113 LFTLDDIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFR 172

Query: 302 HVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
           H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R   GTLA
Sbjct: 173 HLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLA 231

Query: 362 RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPD 421
           R L  +P  K  ++ + +RV L G+EL  Y E++   K+                     
Sbjct: 232 RFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------------- 277

Query: 422 NNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAPMFPFYE 473
              S +  +  ++ ++   D+ +P   + + D+++ G       F   +    PMFP  E
Sbjct: 278 ---SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPE 334

Query: 474 NNSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------------AA 501
              +WD++GE+I P+D+++                    DE MDQ              +
Sbjct: 335 ERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTES 394

Query: 502 MHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL--- 551
           + I         +G+ D  S   I++  KP ++      ++VHG  EA++ L + C    
Sbjct: 395 IEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------IIVHGPPEASQDLAECCRAFG 448

Query: 552 -KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVG 606
            K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V 
Sbjct: 449 GKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVS 506

Query: 607 KTENGML-----------------------------------------------SLLPIS 619
           K + G++                                                ++P  
Sbjct: 507 KVDTGVILEEGELKDDGEDSEMQVDAPSDASVIAQQKAMKSLFGDDEKETGEESEIIPTL 566

Query: 620 TPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 674
            P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+        
Sbjct: 567 EPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR-------- 618

Query: 675 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
             + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 619 --TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 648



 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 63/119 (52%), Positives = 85/119 (71%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDDI
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDI 119


>gi|396500483|ref|XP_003845730.1| similar to cleavage and polyadenylation specificity factor subunit
           2 [Leptosphaeria maculans JN3]
 gi|312222311|emb|CBY02251.1| similar to cleavage and polyadenylation specificity factor subunit
           2 [Leptosphaeria maculans JN3]
          Length = 954

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 181/623 (29%), Positives = 260/623 (41%), Gaps = 122/623 (19%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G    +P S  L+  DG    L+D GW++ FD   L+ + K   TI  +LL+H  T
Sbjct: 5   TPLLGALTSSPASQSLLEFDGGIQILVDIGWDESFDVEKLKEIEKHVPTISLILLTHATT 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSE------------- 110
            HLGA  +  K   L    PV++T+PV  LG   + D Y S    S              
Sbjct: 65  AHLGAYVHCCKNFPLFTRIPVYATKPVISLGRTLLQDLYASSPLASSIIPNQTLNESAYT 124

Query: 111 --------------FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVA 151
                             T ++I   F  +  L YSQ +       S    G+ +  + A
Sbjct: 125 FSTGLIAGHDPNILLQAPTPEEIGEYFARINPLRYSQPHEPLLAPHSPPPNGLTITAYSA 184

Query: 152 GHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLIT 199
           GH LGG++W I    E V+YAVD+N+  E  L+G             VL+   RP  LI 
Sbjct: 185 GHTLGGSIWHIQHGMESVVYAVDWNQATEHVLSGAAWLGGPGAGGSEVLKQLRRPTALIC 244

Query: 200 DAYNA---LHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS 256
            +         +PP ++ E     + +T+  GG+VL+P DS+ R+LEL  +LE+ W   S
Sbjct: 245 SSKGTELVKVARPPSKRDEALLALVRETVANGGSVLIPSDSSARILELAYLLEETWQRDS 304

Query: 257 LN---------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS------RDNA---- 297
           +N           +Y  +    +T+ Y +S LEWM + I K FE +      +D++    
Sbjct: 305 INSDGDSPLKSAKVYLASRTGGATMRYARSMLEWMEEGIVKEFEVASGANNGKDDSKAAR 364

Query: 298 --FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             F  KH+TLL  K+ +    A  GP+++LAS  +LE GFS D     ASD KNL+L TE
Sbjct: 365 VPFDFKHITLLERKTRVARMLATSGPRVILASDTTLEWGFSKDAIKSLASDEKNLILLTE 424

Query: 354 RG-----QFGTLARML----------QADPPPKAVKVTMS---------RRVPLVGEELI 389
           R      Q  +L R L           +   P A  V  S         R V L G EL 
Sbjct: 425 RAGEPSSQKKSLGRYLWDLWHERSAASSHEAPSATVVDASGDNAPVCNIRAVSLEGNELS 484

Query: 390 AYE-------EEQTRLKKEEA----LKASLVKEEESKASLGPDNNLSGDPMVIDANNANA 438
            Y+       + Q  +  E A    +   +V +  S  S   D   SGD     A NA  
Sbjct: 485 LYQQYLASQRQRQNTMGGESAVMLEMPTDVVDDRSSTESESSDG--SGDGYRGKALNATV 542

Query: 439 SADVVEPHGGR-----------YRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINP 487
           +        G             R  + D  V        +FPF       DDFGE+I  
Sbjct: 543 ALQHARNKLGLTDAELGVKVLVQRKNIYDFEVQGKKGKDKVFPFQRKKKRADDFGELIRA 602

Query: 488 DDYIIKDEDMDQAAMHIGGDDGK 510
           +D+   +E+ + A   + G+  K
Sbjct: 603 EDFARVEEEDNVAGEALRGEGTK 625


>gi|224161209|ref|XP_002338303.1| predicted protein [Populus trichocarpa]
 gi|222871828|gb|EEF08959.1| predicted protein [Populus trichocarpa]
          Length = 106

 Score =  199 bits (506), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 95/106 (89%), Positives = 100/106 (94%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSGV+NENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS IDAVLL
Sbjct: 1   MGTSVQVTPLSGVYNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRR 106
           S+ D LHLGALP+AMKQ GL+APVFSTEPVYRLGLLTMYDQ  SR+
Sbjct: 61  SYGDMLHLGALPFAMKQFGLNAPVFSTEPVYRLGLLTMYDQSFSRK 106


>gi|193786016|dbj|BAG50992.1| unnamed protein product [Homo sapiens]
          Length = 644

 Score =  199 bits (505), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 151/555 (27%), Positives = 250/555 (45%), Gaps = 143/555 (25%)

Query: 259 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDG 318
           Y +  L  VS + +++ KS +EWM D + + FE  R+N F  +H++L    S+L   P  
Sbjct: 126 YSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-S 184

Query: 319 PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMS 378
           PK+VLAS   LE GFS D+F++W  D KN ++ T R   GTLAR L  +P  K  ++ + 
Sbjct: 185 PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELR 244

Query: 379 RRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANA 438
           +RV L G+EL  Y E++   K+                        S +  +  ++ ++ 
Sbjct: 245 KRVKLEGKELEEYLEKEKLKKEAAKKLEQ-----------------SKEADIDSSDESDI 287

Query: 439 SADVVEPHGGRYR-DILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY 490
             D+ +P   + + D+++ G       F   +    PMFP  E   +WD++GE+I P+D+
Sbjct: 288 EEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDF 347

Query: 491 IIK-------------------DEDMDQ-------------AAMHIGGD------DGKLD 512
           ++                    DE MDQ              ++ I         +G+ D
Sbjct: 348 LVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSD 407

Query: 513 EGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETI 567
             S   I++  KP ++      ++VHG  EA++ L + C     K +   VY P++ ET+
Sbjct: 408 GDSIKKIINQMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETV 459

Query: 568 DVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML---------- 613
           D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + G++          
Sbjct: 460 DATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDG 519

Query: 614 -------------------------------------SLLPISTPAPP-----HKSVLVG 631
                                                 ++P   P PP     H+SV + 
Sbjct: 520 EDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMN 579

Query: 632 DLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCED 691
           + +++D K  L  +GIQ EF GG L C   V +R+          + T +I +EG LC+D
Sbjct: 580 EPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRIGLEGCLCQD 629

Query: 692 YYKIRAYLYSQFYLL 706
           +Y+IR  LY Q+ ++
Sbjct: 630 FYRIRDLLYEQYAIV 644



 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 63/123 (51%), Positives = 87/123 (70%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAF 123
           +  
Sbjct: 121 AGL 123


>gi|406604299|emb|CCH44271.1| Cleavage and polyadenylation specificity factor subunit
           [Wickerhamomyces ciferrii]
          Length = 795

 Score =  198 bits (503), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 161/524 (30%), Positives = 249/524 (47%), Gaps = 56/524 (10%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPY-AMKQLGL 80
           L+  DG   L D GW+   D S L    K+  TID ++LSHP T  +G   Y A + L +
Sbjct: 18  LLEFDGVRVLADPGWDGITDISYL---DKILPTIDIIVLSHPTTNFIGCYAYLAFRDLNI 74

Query: 81  SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLS 138
             PV++T P   LG +   D Y S   +       F L D++ AF  +  + +SQ   L 
Sbjct: 75  --PVYATLPTTNLGRVATLDLYRSVGLIGPLKNTEFELKDVEEAFDKIITVKHSQTIDLR 132

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL---ESFVRPA 195
           GK +G+ +    AGH LGGT+W   K+ E +IYA  +N  K+  LNG  L    + +RP+
Sbjct: 133 GKYDGLSITAINAGHTLGGTIWAFNKNPEKIIYAPQWNHSKDSFLNGADLLQNSTLMRPS 192

Query: 196 VLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           V+IT +  A+ +  P ++R E F + +  TL  GG VLLP    GR+LEL+ +++++   
Sbjct: 193 VIITSS--AIGSVLPHKKRVEKFFELVDATLGRGGTVLLPTSIGGRMLELVHLIDEHL-- 248

Query: 255 HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDN 314
            S   P+  L+Y  +  + Y  S LEWM  ++ + +ET     F    V  +I  +EL N
Sbjct: 249 QSAPIPVLMLSYTKARNLTYAGSMLEWMAPAVIREWETRGQPPFDSSRVQ-VIEPNELLN 307

Query: 315 APDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER--------GQFGTLARMLQ 365
            P G K+V AS A  E G  +         D K  ++ TE+          F T   + Q
Sbjct: 308 MP-GAKVVFASGAGFEDGSVAQAALTTLCDDEKTTIILTEKTVENTIGNDLFYTWRSLAQ 366

Query: 366 ADPP------------PKAVKVTMSRRVPLVGEELIAYEE--EQTRLKKEEALKASLVKE 411
           A+ P             K + V   R   L+G+ELI YE   +Q RL KE+  K  L ++
Sbjct: 367 ANSPDGKAQDGVPVVLQKQLNVKPIREEELLGDELINYENHVKQRRLLKEQTKKNKLSEK 426

Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
           +E++                D + + +  + +     +   I ID  V  S   A MF F
Sbjct: 427 KETQ--------------FEDESESESEDEDILGEEKKIETIPIDVDVRSSKGRAKMFQF 472

Query: 472 YENNSEWDDFGEVINPDDYIIKDE-DMDQAAMHIGGDDGKLDEG 514
               +++DD+GE+IN  D+  ++E D+ +   H    + K+  G
Sbjct: 473 VPRKAKFDDYGEIINHSDFTREEEKDVGKMKRHKQNQNNKVQIG 516


>gi|281344001|gb|EFB19585.1| hypothetical protein PANDA_019064 [Ailuropoda melanoleuca]
          Length = 237

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 88/172 (51%), Positives = 123/172 (71%), Gaps = 1/172 (0%)

Query: 10  LSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLG 69
           L     E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLLSHPD LHLG
Sbjct: 65  LDSTREESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLLSHPDPLHLG 124

Query: 70  ALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRL 129
           ALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D+AF  + +L
Sbjct: 125 ALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVDAAFDKIQQL 184

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRKE 180
            +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++E
Sbjct: 185 KFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 236


>gi|300121266|emb|CBK21646.2| unnamed protein product [Blastocystis hominis]
          Length = 400

 Score =  195 bits (496), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 195/362 (53%), Gaps = 14/362 (3%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M ++ + TPL G  N+ P+  ++ ID  + ++DCGW++  +  +L P+      ++AVL+
Sbjct: 1   MPSTFKFTPLYGAENDGPVCSILQIDSIHIMLDCGWDERLETDMLSPIKDYIPLLNAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH D LHLGALPY   +   + P+F  +  + L    M D   +R    E  +F  DDI 
Sbjct: 61  SHADFLHLGALPYVYSRWDCNVPIFINKDAFLLARFCMEDVMENRLLGEEDCIFGKDDIS 120

Query: 121 SAFQSVTRLTYSQNYH-LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
              +    + Y+Q    +S  G+ + +    AGH++GG++W I  + + ++Y+++ N + 
Sbjct: 121 KVCECFRTVVYNQQERIMSETGDVVYINAREAGHMIGGSIWDIITETDHLVYSMNINPQP 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNAL------HNQPPRQQREMFQDAISKTLR-AGGNVL 232
           + HL G   +     ++LITDA   +      ++Q  + +   F   I+ TLR   G+VL
Sbjct: 181 DNHLRGASSDVSGNISLLITDACEHMTEKSRYNSQLEKAKFGHFSYLITDTLRDKHGSVL 240

Query: 233 LPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           +PVDS GR LE++L+LE  W E +L NY + FL+  SS T++Y++     + + I +   
Sbjct: 241 IPVDSVGRCLEVILLLERVWKESNLENYKVLFLSSRSSQTVNYIQGIASNLNERILQQSA 300

Query: 292 TSRDNAFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
            +   AF L+ VT +   S ++N       K+V+A++  LE  F+  +  +W +  +NL+
Sbjct: 301 EAERKAFDLQFVTCV---SIVENVLESQASKVVIATLPGLETSFAQTLLKKWCTRSENLL 357

Query: 350 LF 351
           LF
Sbjct: 358 LF 359


>gi|169599735|ref|XP_001793290.1| hypothetical protein SNOG_02691 [Phaeosphaeria nodorum SN15]
 gi|160705309|gb|EAT89422.2| hypothetical protein SNOG_02691 [Phaeosphaeria nodorum SN15]
          Length = 957

 Score =  192 bits (488), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 173/617 (28%), Positives = 262/617 (42%), Gaps = 151/617 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ FD + L+ + +  ST+  VLL+H  T HLGA  +  K   L    PV
Sbjct: 26  GIKILIDVGWDESFDVAKLKEIERHVSTLSFVLLTHATTAHLGAYVHCCKNFPLFSRVPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSE---------------------------FDLFTLD 117
           ++T PV  LG   + D Y S    S                                T +
Sbjct: 86  YATVPVISLGRTLLQDLYASTPLASSILPTDALTESAYSFPSALKGGKNPNILLQAPTQE 145

Query: 118 DIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I + F ++T L YSQ +       S    G+ +  + AGH LGG++W I    E V+YA
Sbjct: 146 EIANYFGAITPLRYSQPHQPIPSSFSPPLNGLTITAYSAGHTLGGSIWHIQHGMESVVYA 205

Query: 173 VDYNRRKEKHLNGT-----------VLESFVRPAVLITDAYNA----LHNQPPRQQREMF 217
           VD+N+ +E  L+G            VLE   RP  +I  + N+    +   P ++  E+ 
Sbjct: 206 VDWNQAREHVLSGAAWLGTGTGGSEVLEQLRRPTAMICSSKNSGLVKVAKAPSKRDEELL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW----AEHSLNYPI-----YFLTYVS 268
              I  T+  GG+VL+P DS+ R+LE+  +LE  W    A    N P+     Y  +   
Sbjct: 266 S-MIRDTVAKGGSVLIPCDSSARILEIAYLLEKSWHSETARSENNSPLKNAKAYLASRTG 324

Query: 269 SSTIDYVKSFLEWMGDSITKSFETS-----------------RDNA------FLLKHVTL 305
            +T+ YV+S LEWMG+ I K FE +                 RD+       F  +H+TL
Sbjct: 325 GATMRYVRSMLEWMGEGIVKEFEAASGAAEGQGQRNVRGAPGRDDGRGIRTPFDFQHITL 384

Query: 306 LINKSELD---NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER-GQFGT-- 359
           L  K+ +    NA + P+++LAS  SLE GFS D     ASD KNLV+ TER G+ GT  
Sbjct: 385 LEKKARVTRMLNATE-PRVILASDTSLEWGFSKDAIRSLASDEKNLVILTERVGELGTQE 443

Query: 360 --LARML-------------------QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRL 398
             L R L                     D   +   ++  R V L G+E+  Y   Q  L
Sbjct: 444 KGLGRYLWDLWNERSVNSGDDSLDSTMVDVSGQQASISTVRTVALEGDEVPLY---QQFL 500

Query: 399 KKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGR--------- 449
            ++  L  ++    +   +L    ++  D     + ++  SAD    HGG+         
Sbjct: 501 ARQRQLHNTMTG--DGGTTLETSADVVDDRSSTTSESSEESAD---GHGGKILNTTAALQ 555

Query: 450 -------------------YRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY 490
                               R  + D  V        +FP  +   + DDFG++I P+++
Sbjct: 556 HARNKLGLTDAELGVNILIRRKNVYDYEVRGKKGKEKLFPHQQKRRKQDDFGDLIRPEEF 615

Query: 491 IIKDEDMDQAAMHIGGD 507
              DE+ +     +GGD
Sbjct: 616 ARADEEDN-----VGGD 627



 Score = 46.6 bits (109), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 50/209 (23%), Positives = 80/209 (38%), Gaps = 61/209 (29%)

Query: 523 KPSKVVSNELTVLVHGSAEATEHLKQHCLKHV--------CPHVYTPQIEETIDVTSDLC 574
           KP K++      L+ G    T  L + C   +           V+TP I   +D + D  
Sbjct: 732 KPRKLI------LIGGEEAETMELAEICRTALNVGLEASAAIDVFTPTIGIVVDASVDTN 785

Query: 575 AYKVQLSEKLMSNVLFKKL---------GDYEIAWVDAEVGKTENGMLSLLPISTPAPPH 625
           A+ V+LS  ++ N+ ++ +         G    A +DA   + E        +  PA P 
Sbjct: 786 AWTVKLSRTMVRNLHWQNVRGMGVVAITGRLAAATLDAPPKEEEGSAKKKARLDAPAVPV 845

Query: 626 KSVL---------------------------VGDLKMADLKPFLSSKGIQVEFAG-GALR 657
            S+L                           VGDL++ADL+  + S G++ EF G G L 
Sbjct: 846 SSLLESSSTPILDVVPANMATAVRSVAQPFHVGDLRLADLRKLMKSNGMEAEFRGEGVLV 905

Query: 658 CGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
               V +RK          + T QI ++G
Sbjct: 906 INGTVAVRK----------TATGQIEVDG 924


>gi|10241720|emb|CAC09445.1| hypothetical protein [Homo sapiens]
          Length = 504

 Score =  192 bits (487), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 147/540 (27%), Positives = 242/540 (44%), Gaps = 143/540 (26%)

Query: 274 YVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           + KS +EWM D + + FE  R+N F  +H++L    S+L   P  PK+VLAS   LE GF
Sbjct: 1   FSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGF 59

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEE 393
           S D+F++W  D KN ++ T R   GTLAR L  +P  K  ++ + +RV L G+EL  Y E
Sbjct: 60  SRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLE 119

Query: 394 EQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-D 452
           ++   K+                        S +  +  ++ ++   D+ +P   + + D
Sbjct: 120 KEKLKKEAAKKLEQ-----------------SKEADIDSSDESDIEEDIDQPSAHKTKHD 162

Query: 453 ILIDG-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK------------ 493
           +++ G       F   +    PMFP  E   +WD++GE+I P+D+++             
Sbjct: 163 LMMKGEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKL 222

Query: 494 -------DEDMDQ-------------AAMHIGGD------DGKLDEGSASLILDA-KPSK 526
                  DE MDQ              ++ I         +G+ D  S   I++  KP +
Sbjct: 223 ESGLTNGDEPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQ 282

Query: 527 VVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSE 582
           +      ++VHG  EA++ L + C     K +   VY P++ ET+D TS+   Y+V+L +
Sbjct: 283 L------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKD 334

Query: 583 KLMSNVLFKKLGDYEIAWVDA----EVGKTENGML------------------------- 613
            L+S++ F K  D E+AW+D      V K + G++                         
Sbjct: 335 SLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSV 394

Query: 614 ----------------------SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKG 646
                                  ++P   P PP     H+SV + + +++D K  L  +G
Sbjct: 395 IAQQKAMKSLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREG 454

Query: 647 IQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
           IQ EF GG L C   V +R+          + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 455 IQAEFVGGVLVCNNQVAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 504


>gi|47224568|emb|CAG03552.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 206

 Score =  191 bits (486), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 88/205 (42%), Positives = 133/205 (64%), Gaps = 25/205 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T +SGV  E+ L YL+ +D F FL+DCGW+++F   ++  + +    +DAVLL
Sbjct: 1   MTSIIKLTAVSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDAMKRYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD +HLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPIHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRNNSEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQ------------------------NYHLSGKGEGIVVAPHVAGHLLG 156
           SAF  + +L YSQ                         ++ +GKG G+ + P  AGH++G
Sbjct: 121 SAFDKIQQLKYSQIVSLKGKLACKRLFTWSKLPKYVMAFYATGKGHGLSITPLPAGHMIG 180

Query: 157 GTVWKITKDG-EDVIYAVDYNRRKE 180
           GT+WKI KDG E+++YAVD+N ++E
Sbjct: 181 GTIWKIVKDGEEEIVYAVDFNHKRE 205


>gi|393215649|gb|EJD01140.1| cleavage and polyadenylation specificity factor subunit
           [Fomitiporia mediterranea MF3/22]
          Length = 922

 Score =  191 bits (486), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 154/489 (31%), Positives = 228/489 (46%), Gaps = 103/489 (21%)

Query: 5   VQVTPLSG---VFNENPLSYLVSIDGFNFLIDCG---W---------NDHFDP------S 43
           +  TPLSG   +    PLSYL+ +D    L+DCG   W          D  D       S
Sbjct: 2   ITFTPLSGGARLSKTIPLSYLLQVDDVRILLDCGSPGWCPEHAIAGSEDSSDSQSFSWES 61

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYL 103
             + L + A T+D VL+SH D  H G   YA    GL AP ++T PV     L   ++  
Sbjct: 62  YCKALKECAPTVDLVLISHGDLQHAGLYAYAYAHWGLRAPTYTTLPVQATARLAAVEEAE 121

Query: 104 SRRQVSEFD-------------------------LFTLDDIDSAFQSVTRLTYSQNYHLS 138
           S R   + D                         + + DD+  A+ S+  L YSQ  HL 
Sbjct: 122 SIRSEEDVDNRNETSNDAEANDRMDVDDVLRRKFVPSPDDVREAYDSIHTLRYSQPAHLQ 181

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL--------- 188
           GK +G+ +    AGH LGGT+WKI +     ++YAVD N  +E+HL+GTV+         
Sbjct: 182 GKCQGLTITAFNAGHTLGGTIWKIRSPSAGTILYAVDLNHLRERHLDGTVILRGAGAGGV 241

Query: 189 -ESFVRPAVLITDAYNALHNQPPRQQREMFQ--DAISKTLRAGGNVLLPVDSAGRVLELL 245
            E+  RP ++ITDA + ++N   R++    Q  D ++ TL +  +VL+P DS+ R+LELL
Sbjct: 242 YEALARPDLMITDA-DRVNNISCRKKDRDAQLIDTVTSTLSSRHSVLMPCDSSTRLLELL 300

Query: 246 LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK-----------SFETSR 294
           ++L+ +W      +PI  ++      + +V+S +EW+G +I+K           + +  R
Sbjct: 301 VLLDQHWTYSRFKFPICLVSRTGREMLTFVRSMMEWLGGTISKEDVGEDTGNNANNKRRR 360

Query: 295 DN----------AFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
           D+          A   K++    N   L +  +   PKL+LA   SL  G S  IF E+A
Sbjct: 361 DDDNEEEALGALALRFKYLEFFPNPQALLHTYSSKDPKLILAVPVSLSHGSSRSIFSEFA 420

Query: 343 SDVKNLVLFTERGQFGTLARML--------QADPP------------PKAVKVTMSRRVP 382
           S   N+VL T  G+ GTLAR L        + D               K +K+TM  +VP
Sbjct: 421 SVADNVVLLTSPGEDGTLARTLFDMWNDEQREDDKWNKGKLGRNVMLDKTLKLTMKSKVP 480

Query: 383 LVGEELIAY 391
           L G EL  Y
Sbjct: 481 LQGVELEEY 489



 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 63/247 (25%), Positives = 108/247 (43%), Gaps = 46/247 (18%)

Query: 490 YIIKDEDMDQAAMHIGGDDGKLDEGSA--SLILDAKPSKVVSNELTVLVHGSAEATEHLK 547
           YI  D D+  A   +  D   L++G A   +     P K++      +VH S++  + L 
Sbjct: 678 YISYDVDVQLACRLLFVDMEGLNDGRAVKKIAAHVNPRKLI------IVHSSSDGAQSLI 731

Query: 548 QHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV 605
           + C  ++ +   +Y P I E + +     +Y + LSE+L+++V      D E+ ++   +
Sbjct: 732 EACGAVRALTKEIYAPDIGEQVQIGQHTNSYSISLSEELLASVRMSNFEDNEVGFIQGCI 791

Query: 606 GKTENGMLSLL-PIST-----------------PA-----PPHK---SVLVGDLKMADLK 639
               +  + +L P+S                  PA     P  K   S ++GDLK+  LK
Sbjct: 792 ASLASSTIPILEPVSNLTSRLEDVPMESEQLVKPARLGSRPATKLPRSTMIGDLKLTALK 851

Query: 640 PFLSSKGIQVEFAG-GALRC-----GEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYY 693
             LS  G+  EFAG G L C      E V+   +    +K  G    ++ +EG + E YY
Sbjct: 852 ARLSKMGVHTEFAGEGVLLCRNSSSDEDVSTESIVAVRKKADG----KVELEGTVTEVYY 907

Query: 694 KIRAYLY 700
            +R  +Y
Sbjct: 908 TVRRAIY 914


>gi|296424981|ref|XP_002842022.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295638279|emb|CAZ86213.1| unnamed protein product [Tuber melanosporum]
          Length = 975

 Score =  191 bits (485), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 178/612 (29%), Positives = 264/612 (43%), Gaps = 121/612 (19%)

Query: 8   TPLSGVFNENPL--SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  +++    S L   +G   LID GW++ FD  +L  L +   TID +LL+HP  
Sbjct: 5   TPLLGAQSDSQACQSLLELENGIKVLIDVGWDESFDVKMLAELERHTPTIDLILLTHPTL 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF--------- 114
            H+GA  +A K +    S PV+ST PV  LG L + D YLS    S   L          
Sbjct: 65  AHMGAYAHACKHIPSFSSIPVYSTFPVSNLGRLLLQDIYLSTPLASTRLLDSAAPPVPLP 124

Query: 115 -TLDDIDSAFQSVTRLTYSQNY-------HLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            T  +IDS    +  L YSQ          +SGK   I +  + AGH LGGT+WKI +  
Sbjct: 125 PTSAEIDSYCTKIVTLKYSQPTPLHSAVARVSGKLGSITITAYSAGHSLGGTIWKIQQAQ 184

Query: 167 EDVIYAVDYNRRKEKHLNGT--------VLESFVRPAVLITDAYNA--LHNQPPRQQR-E 215
           E ++YAVD+N  +E  L G          +E+  +P  LI  A N+  +     R++R E
Sbjct: 185 ESIVYAVDWNHSRENCLRGAGFLSGGGVSVETLGKPTALICSARNSEVVSMAGGRKKRDE 244

Query: 216 MFQDAISKT-LRAGGNVLLPVDSAGRVLELLLILEDYWAE--------HSLNYPIYFLTY 266
           M  DAI KT L+  G VL+P DS GRVLEL+ +LE  W +              ++ +  
Sbjct: 245 MLLDAIKKTALKNSGTVLIPTDSVGRVLELVYLLEHAWRKDQELSSRAKGKGIGLFLVGR 304

Query: 267 VSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA------------FLLKHVT 304
                   V S LEWM + + + FE+           RD+A            F   H+ 
Sbjct: 305 RVRRLGQVVGSMLEWMDEGVVREFESIAGGDRRGNRQRDDAEGKGNDGNKAGPFDFLHLN 364

Query: 305 LLINKSELD----NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER--GQFG 358
           L+  +  L+    +  +  K+++AS +SL  GFS +  +  ASD KNLV+ TER  G+ G
Sbjct: 365 LVSTQGHLNRILNDGNERGKVIIASDSSLGWGFSREALMRLASDEKNLVVLTERSDGKLG 424

Query: 359 TLARMLQ-------------------ADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLK 399
               + Q                        +  ++ +  R PL G+EL AY       +
Sbjct: 425 WAGNLWQQWKEKTGSGGEANATDWQEVSLDGQRAELDIPHRTPLEGQELEAYNRHFAAQQ 484

Query: 400 KEEALKASLVKEEESKASLGPD---------------NNLSGDPMVIDANNANASADVVE 444
              +   SL+      +S+G +               +   G  +   AN+   SA  V 
Sbjct: 485 ALTSQHQSLLSNSGLPSSMGAEPDDDDASSSSDDDSDSERQGKALTT-ANSKKISAATVM 543

Query: 445 PHGG---RYR--------DILIDGF------VPPSTSVAPMFPFYENNSEWDDFGEVINP 487
             G    RY         +IL+ G       V  +     MFPF       D++GEV+  
Sbjct: 544 LGGATPSRYGAGKVDIGINILLRGKGVYDYDVRGAKGRNRMFPFVMRRRRVDEYGEVVRA 603

Query: 488 DDYIIKDEDMDQ 499
           D+Y+  +E  ++
Sbjct: 604 DEYMRAEEKAEE 615


>gi|345563127|gb|EGX46131.1| hypothetical protein AOL_s00110g295 [Arthrobotrys oligospora ATCC
           24927]
          Length = 982

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 132/396 (33%), Positives = 192/396 (48%), Gaps = 62/396 (15%)

Query: 26  DGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLG--LSAP 83
           +G   L+DCGW++ F+   LQ + K A TI  +LL+HP   H+G+  +    +      P
Sbjct: 25  NGIKILVDCGWSEPFNVDDLQQIEKHAPTISLILLTHPTLSHIGSYAHCCAHIPHFSRIP 84

Query: 84  VFSTEPVYRLGLLTMYDQYLSRRQVSEF-----DLFTL--------DDIDSAFQSVTRLT 130
           V+ T PV  LG   + D YLS   ++       DL  L        DDID  F S + L 
Sbjct: 85  VYCTYPVANLGRSLLQDAYLSTPLITSTYPPTSDLSPLVLRNPPSSDDIDRYFDSFSSLK 144

Query: 131 YSQNYHL-SGKGEGIVVAPHVAGHLLGGTVWKI--TKDGEDVIYAVDYNRRKEKHLNGT- 186
           YSQ +   S    G+ +  + AGH LGGT+W+I  +   E+++YAV +N  ++ HL+   
Sbjct: 145 YSQPFTFPSPPLAGLTITAYRAGHTLGGTIWRIQHSHSSENILYAVSWNHLRDAHLSSAS 204

Query: 187 -------VLESFVRPAVLITDAYNALHNQ--PPRQQR-EMFQDAISKTLRAGGNVLLPVD 236
                  V E F+ P  LI   YN L  Q   PR++R E+   AI K   AGG VL+P D
Sbjct: 205 FLPGPTGVSEEFLNPTALICSPYNCLPGQVSTPRKKRDELLLSAIRKAAFAGGTVLIPTD 264

Query: 237 SAGRVLELLLILEDYWAEHSLNY-----PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           S+ R+LEL  +LE  +   S N+      I      +  T  YV++ LEWM +S+ K FE
Sbjct: 265 SSARILELAYLLEHDFRSKSSNWGSSGATISLAVRTAGRTFRYVRALLEWMDESMVKEFE 324

Query: 292 TSRDN--------------------------AFLLKHVTLLINKSELDN--APDGPKLVL 323
           +   N                           F  +H+ L+ +K +L    +  G K+V+
Sbjct: 325 SVTHNNNPSSRRKPKSSNTGAGDKEDDKLYGPFDFRHLKLVEHKHQLTKILSRKGGKVVI 384

Query: 324 ASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            S  SLE GFS ++    A D +NL++ TERG  GT
Sbjct: 385 TSDKSLEWGFSTEVVKSIADDERNLIVLTERGSEGT 420


>gi|452004821|gb|EMD97277.1| hypothetical protein COCHEDRAFT_1163978 [Cochliobolus
           heterostrophus C5]
          Length = 948

 Score =  189 bits (480), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 135/408 (33%), Positives = 186/408 (45%), Gaps = 76/408 (18%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ F    L+ + +   T+  +LL+H  T HLGA  +  K   L    PV
Sbjct: 26  GIQILIDVGWDEDFSVEQLKEIERHVPTLSFILLTHATTAHLGAYVHCCKNFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSE---------------------------FDLFTLD 117
           ++T PV  LG   + D Y S    S                                T  
Sbjct: 86  YATNPVISLGRTLLQDLYESTPLASSIIPTEALNESAYSFSSALKGGKNPNILLQAPTAQ 145

Query: 118 DIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I   F  +  L YSQ +       S    G+ +  + AGH LGG++W I    E V+YA
Sbjct: 146 EIADYFARINPLRYSQPHQPIPSPHSPPLNGLTITAYSAGHTLGGSIWHIQHGMESVVYA 205

Query: 173 VDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALH---NQPPRQQREMF 217
           VD+N+ +E  L+G             VLE   RP  LI  + N       +PP ++ E  
Sbjct: 206 VDWNQAREHVLSGAAWLGGPGTSGSEVLEQLRRPTALICSSRNTDMVKVAKPPSKRDEAL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------LNYPIYFLTYVSS 269
            + I  T+  GG VL+P DS+ RVLEL  +LE+ W   +         N  IY  +  + 
Sbjct: 266 IEMIRDTVANGGTVLIPSDSSARVLELAYLLEETWHRETAEGGNGPLANTKIYLASRTAG 325

Query: 270 STIDYVKSFLEWMGDSITKSFETS-----RDNA-----------FLLKHVTLLINKSELD 313
           +T+ YV+S LEWM + I K FE S     R N            F  +HVTLL  K+ + 
Sbjct: 326 ATMRYVRSMLEWMEEGIVKEFEASAADQDRRNKGGKDEDRAKIPFDFRHVTLLERKTRVA 385

Query: 314 N--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER-GQFG 358
              A DGP+++LAS  +LE GFS D     ASD KNLV+ TER G+ G
Sbjct: 386 RMLAADGPRVILASDTTLEWGFSKDALRSLASDEKNLVILTERSGELG 433


>gi|451853389|gb|EMD66683.1| hypothetical protein COCSADRAFT_35187 [Cochliobolus sativus ND90Pr]
          Length = 948

 Score =  189 bits (479), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 135/414 (32%), Positives = 188/414 (45%), Gaps = 76/414 (18%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ F    L+ + +   T+  +LL+H  T HLGA  +  K   L    PV
Sbjct: 26  GIQILIDVGWDEDFSVEQLKEIERHVPTLSFILLTHATTAHLGAYVHCCKNFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSE---------------------------FDLFTLD 117
           ++T PV  LG   + D Y S    S                                T  
Sbjct: 86  YATNPVISLGRTLLQDLYESTPLASSIIPTEALNESAYSFSSALKGGKNPNILLQAPTAQ 145

Query: 118 DIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I   F  +  L YSQ +       S    G+ +  + AGH LGG++W I    E V+YA
Sbjct: 146 EIADYFARINPLRYSQPHQPIPSPHSPPLNGLTITAYSAGHTLGGSIWHIQHGMESVVYA 205

Query: 173 VDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALH---NQPPRQQREMF 217
           VD+N+ +E  L+G             VLE   RP  LI  + N       +PP ++ E  
Sbjct: 206 VDWNQAREHVLSGAAWLGGPGTGGSEVLEQLRRPTALICSSRNTDMVKVAKPPSKRDEAL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------LNYPIYFLTYVSS 269
            + I  T+  GG VL+P DS+ RVLEL  +LE+ W   +         N  IY  +  + 
Sbjct: 266 IEMIRDTVANGGTVLIPSDSSARVLELAYLLEETWHRETAEGGNSPLTNAKIYLASRTAG 325

Query: 270 STIDYVKSFLEWMGDSITKSFETS-----RDNA-----------FLLKHVTLLINKSELD 313
           +T+ YV+S LEWM + I K FE S     R N            F  +H+TLL  K+ + 
Sbjct: 326 ATMRYVRSMLEWMEEGIVKEFEASAADQDRRNKGGKDEDRAKIPFDFRHITLLERKTRVA 385

Query: 314 N--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER-GQFGTLARML 364
              A DGP+++LAS  +LE GFS D     ASD KNLV+ TER G+ G   + L
Sbjct: 386 RMLAADGPRVILASDTTLEWGFSKDALRSLASDEKNLVILTERSGELGAQRKGL 439


>gi|189192102|ref|XP_001932390.1| cleavage and polyadenylation specificity factor subunit 2
           [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187973996|gb|EDU41495.1| cleavage and polyadenylation specificity factor subunit 2
           [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 954

 Score =  188 bits (477), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 172/618 (27%), Positives = 246/618 (39%), Gaps = 132/618 (21%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ F+   L+ + +   TI  +LL+H  T HLGA  +  K   L    PV
Sbjct: 26  GIQILIDVGWDEQFNVEKLKEIERHVPTISFILLTHATTAHLGAYVHCCKNFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSE---------------------------FDLFTLD 117
           ++T PV  LG   + D Y S    S                                T  
Sbjct: 86  YATNPVISLGRTLLQDLYESTPLASSIIPTEALNESAYSFSSALKGGNNPNILLQAPTSQ 145

Query: 118 DIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I   F  +  L YSQ +       S    G+ +  + AGH LGG++W I    E V+YA
Sbjct: 146 EIADYFARINPLRYSQPHEPIPSPHSPPLNGLTITAYSAGHTLGGSIWHIQHGMESVVYA 205

Query: 173 VDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNA---LHNQPPRQQREMF 217
           VD+N+ +E  L+G             VLE    P  LI    N       + P ++ E  
Sbjct: 206 VDWNQAREHVLSGAAWLGGPGTGGSEVLEQLRHPTALICSTKNTGMVKKARSPNERDEAL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL---------NYPIYFLTYVS 268
            + I  T+  GG VL+P DS+ R+LEL  +LED W                 +Y  +   
Sbjct: 266 LEMIRNTISNGGTVLIPSDSSARILELAYLLEDTWEREVTEGDGSGPLSTTKLYLASRTG 325

Query: 269 SSTIDYVKSFLEWMGDSITKSFETSRDNA-----------------FLLKHVTLLINKSE 311
            +T+ YV+S LEWM + I K FE S  +                  F  +H+TLL  K+ 
Sbjct: 326 GATMRYVRSMLEWMEEGIVKEFEASAADQDRRTKEGQEEERVAKVPFDFRHITLLERKTR 385

Query: 312 LDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER-GQFGT----LARML 364
           +    A  GP+++LAS A+LE GFS D     ASD KNLV+ TER G+ G+    L R L
Sbjct: 386 VARMLAGAGPRVILASDATLEWGFSKDAIRSLASDEKNLVILTERSGELGSQKKGLGRYL 445

Query: 365 -----------QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEE 413
                        D P   V      + PL     +A + ++  L ++      L  + +
Sbjct: 446 WDLWNQRNASPGEDAPSTTVIDASGNQAPLDTVRTVALQGDEVPLYQQ-----FLASQRQ 500

Query: 414 SKASLGPDNN--LSGDPMVID--------------------ANNANASADVVEPHGGR-- 449
            + ++G DN   L     V+D                    A NA  +        G   
Sbjct: 501 RQTTMGGDNAAMLETSADVVDDRSSTESESSEGSGDGYRGKALNATVALQHARNKLGMTD 560

Query: 450 ---------YRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQA 500
                     R  + D  V        MFPF       DDFG++I P+D+  + E+ D A
Sbjct: 561 AELGVNVLIRRKNVYDYEVQGKKGKERMFPFQAKKRRTDDFGDLIRPEDF-ARAEERDNA 619

Query: 501 AMHIGGDDGKLDEGSASL 518
           A      DG   E +  L
Sbjct: 620 AGEALRGDGTKKENAVGL 637


>gi|344253621|gb|EGW09725.1| Sodium/potassium/calcium exchanger 4 [Cricetulus griseus]
          Length = 1206

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 207/412 (50%), Gaps = 57/412 (13%)

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKS 277
           + +TLR  GNVL+ VD+AGRVLEL  +L+  W        +Y    L  VS + +++ KS
Sbjct: 141 VLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKS 200

Query: 278 FLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
            +EWM D + + FE  R+N F  +H++L    S+L   P  PK+VLAS   LE GFS D+
Sbjct: 201 QVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDL 259

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
           F++W  D KN ++ T R   GTLAR L  +P  K  ++ + +RV L G+EL  Y E++  
Sbjct: 260 FIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYVEKEKL 319

Query: 398 LKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILID 456
            K+         KE +  +S                + ++   D+ +P   + + D+++ 
Sbjct: 320 KKEAAKKLEQ-SKEADIDSS----------------DESDVEEDIDQPSAHKTKHDLMMK 362

Query: 457 G-------FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDG 509
           G       F   +    PMFP  E   +WD++GE+I      I  E         G  DG
Sbjct: 363 GEGSRKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKARVTYIDYE---------GRSDG 413

Query: 510 KLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEE 565
              +    +I   KP ++      ++VHG  EA++ L + C     K +   VY P++ E
Sbjct: 414 ---DSIKKIINQMKPRQL------IIVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHE 462

Query: 566 TIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
           T+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K + G++
Sbjct: 463 TVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 514



 Score =  155 bits (392), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 79/185 (42%), Positives = 119/185 (64%), Gaps = 13/185 (7%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSG------KGEG-IVVAPHVAGHLLG-----GTVWKITKDGED 168
           +AF  + +L +SQ  +L        +G+G +++A   AG +L        +W+ TKD   
Sbjct: 121 AAFDKIQQLKFSQIVNLKANVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWR-TKDAGL 179

Query: 169 VIYAV 173
            +Y++
Sbjct: 180 GVYSL 184


>gi|330920784|ref|XP_003299151.1| hypothetical protein PTT_10086 [Pyrenophora teres f. teres 0-1]
 gi|311327303|gb|EFQ92764.1| hypothetical protein PTT_10086 [Pyrenophora teres f. teres 0-1]
          Length = 953

 Score =  186 bits (472), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 167/610 (27%), Positives = 246/610 (40%), Gaps = 131/610 (21%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ F+   L+ + +   TI  +LL+H  T HLGA  +  K   L    PV
Sbjct: 26  GIQILIDVGWDEQFNVEKLKEIERHVPTISFILLTHATTAHLGAYVHCCKNFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSE---------------------------FDLFTLD 117
           ++T PV  LG   + D Y S    S                                T  
Sbjct: 86  YATNPVISLGRTLLQDLYESTPLASSIIPTEALNESAYSFSSALKGGKNPNILLQAPTSQ 145

Query: 118 DIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I   F  ++ L YSQ +       S    G+ +  + AGH LGG++W I    E V+YA
Sbjct: 146 EIGDYFARISPLRYSQPHQPIPSPHSPPLNGLTITAYSAGHTLGGSIWHIQHGMESVVYA 205

Query: 173 VDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNA---LHNQPPRQQREMF 217
           VD+N+ +E  L+G             VLE    P  LI  + N       + P ++ E  
Sbjct: 206 VDWNQAREHVLSGAAWLGGPGTGGSEVLEQLRHPTALICSSKNTGMVKKARSPNERDEAL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN---------YPIYFLTYVS 268
            + I  T+  GG VL+P DS+ R+LEL  +LE+ W                 +Y  +   
Sbjct: 266 LEMIRNTVSNGGTVLIPSDSSARILELAYLLEETWEREETQGDGSGPLSTTKLYLASRTG 325

Query: 269 SSTIDYVKSFLEWMGDSITKSFETSRDNA-----------------FLLKHVTLLINKSE 311
            +T+ YV+S LEWM + I K FE S  +                  F  +H+TLL  K+ 
Sbjct: 326 GATMRYVRSMLEWMEEGIVKEFEASAADQDRRTKGGKEDERVAKVPFDFRHITLLERKTR 385

Query: 312 LDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER-GQFGT----LARML 364
           +    A  GP+++LAS A+LE GFS D     ASD KNLV+ TER G+ G+    L R L
Sbjct: 386 VARMLAGAGPRVILASDATLEWGFSKDAIRTLASDEKNLVILTERSGELGSQKKGLGRYL 445

Query: 365 -----------QADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEE 413
                        D P   V      + PL     +A + ++  L ++      L  + +
Sbjct: 446 WDLWNQRNASPGEDAPSTTVIDASGNQAPLDTIRTVALQGDEVPLYQQ-----FLASQRQ 500

Query: 414 SKASLGPDNN--LSGDPMVID--------------------ANNANASADVVEPHGGR-- 449
            + ++G DN   L     V+D                    A NA  +        G   
Sbjct: 501 RQTTMGGDNAAMLETSADVVDDRSSTESESSEGSGDGYRGKALNATVALQHARNKLGMTD 560

Query: 450 ---------YRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQA 500
                     R  + D  V        MFPF       DDFG++I P+D+   +E+ + A
Sbjct: 561 AELGVNVLIRRKNVYDYEVQGKKGKERMFPFQAKKRRTDDFGDLIRPEDFARAEEEDNTA 620

Query: 501 AMHIGGDDGK 510
              + G+  K
Sbjct: 621 GEALRGEGTK 630


>gi|119601887|gb|EAW81481.1| cleavage and polyadenylation specific factor 2, 100kDa, isoform
           CRA_b [Homo sapiens]
          Length = 496

 Score =  181 bits (460), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 143/532 (26%), Positives = 236/532 (44%), Gaps = 143/532 (26%)

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
           M D + + FE  R+N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W
Sbjct: 1   MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQW 59

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKE 401
             D KN ++ T R   GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+ 
Sbjct: 60  CQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEA 119

Query: 402 EALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG--- 457
                                  S +  +  ++ ++   D+ +P   + + D+++ G   
Sbjct: 120 AKKLEQ-----------------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGS 162

Query: 458 ----FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK-------------------D 494
               F   +    PMFP  E   +WD++GE+I P+D+++                    D
Sbjct: 163 RKGSFFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGD 222

Query: 495 EDMDQ-------------AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTV 534
           E MDQ              ++ I         +G+ D  S   I++  KP ++      +
Sbjct: 223 EPMDQDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQL------I 276

Query: 535 LVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 590
           +VHG  EA++ L + C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F
Sbjct: 277 IVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQF 334

Query: 591 KKLGDYEIAWVDA----EVGKTENGML--------------------------------- 613
            K  D E+AW+D      V K + G++                                 
Sbjct: 335 CKAKDAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMK 394

Query: 614 --------------SLLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGG 654
                          ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG
Sbjct: 395 SLFGDDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGG 454

Query: 655 ALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            L C   V +R+          + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 455 VLVCNNQVAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 496


>gi|58266278|ref|XP_570295.1| cleavage and polyadenylation specificity factor subunit
           [Cryptococcus neoformans var. neoformans JEC21]
 gi|134111080|ref|XP_775682.1| hypothetical protein CNBD4110 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50258346|gb|EAL21035.1| hypothetical protein CNBD4110 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57226528|gb|AAW42988.1| cleavage and polyadenylation specificity factor subunit, putative
           [Cryptococcus neoformans var. neoformans JEC21]
          Length = 899

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 226/907 (24%), Positives = 373/907 (41%), Gaps = 229/907 (25%)

Query: 5   VQVTPLSGVFNEN----PLSYLVSIDGFNFLIDCGWNDHFDPSLL------QPLSKVAST 54
           + +TPLS    E     P+ YL+ +D    L+D G  D+   S        + +  +A T
Sbjct: 2   ITLTPLSASAAETSPSEPICYLLELDDARILLDMGQRDYRASSQQCSWDYEEAVRDLAPT 61

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-- 112
           +  VLLSH  + +L   PYA  + GL+ PV++T+P   +G +    +  S R     D  
Sbjct: 62  LSLVLLSHSSSNYLSLYPYARARWGLTCPVYATQPTVEMGRVVCLAEAESWRSECPVDSE 121

Query: 113 ----------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLG 156
                           + T+++I  AF  +  + YSQ  HL G    +++ P  +GH LG
Sbjct: 122 KVAADDGSKKPLRGPFVPTVEEIHEAFDWIKAVRYSQPLHLGGDFSHLLLTPFASGHTLG 181

Query: 157 GTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTV---------LESFVRPAVLITDAYNALH 206
           G+++KI +     V+YAV  N   E+HL+G V          +  +RP +LI +   ++ 
Sbjct: 182 GSLFKIRSPTSGTVLYAVGINHTSERHLDGMVGVQNGPTGYADGVLRPDLLIVEGGRSMV 241

Query: 207 NQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--------EHSL 257
             P R++RE    D I+ TL +  +VLLPVD + R+LEL+++L+ +W         +   
Sbjct: 242 VNPKRKEREAALIDTITSTLESNHSVLLPVDPSPRLLELMILLDQHWTFKRTPKVKQRRY 301

Query: 258 N--------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF---------ETSRDNAFLL 300
           N        YP+  ++  +   + + +S ++WMG  +  S          + +R     L
Sbjct: 302 NEPPADLWPYPLCIVSKTAQDMVAFARSLIDWMGGVVKDSAGDMVDVGRGKRARGARMAL 361

Query: 301 ---------KHVTLLINKSE-LDNAP-DGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
                    +HV   +N ++ L   P   PKLVLA   ++  G S  +F   A+   N++
Sbjct: 362 GSEYGVLDFRHVQFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVI 421

Query: 350 LFTERGQFGTLARML--------------------QADPPPKAVKVTMSRRVPLVGEELI 389
           + T R +  TLAR L                            ++V +  +VPL G EL 
Sbjct: 422 MLTGRSEEQTLARDLYNRWERSQTTGSKWGEGKIGHLTQLEGKLQVEVDSKVPLSGAELE 481

Query: 390 AY-EEEQTRLKKEEALKASLVK----------EEESKASLGPDNNLSGDPMVIDANNANA 438
           A+ E E+ + +KE A KA++ +          E +S +    D + +GD  V     ANA
Sbjct: 482 AHVESERLQKEKEAAHKAAVDRSRRMLEADDLESDSDSESEADGH-AGDITVRRTEGANA 540

Query: 439 SADVVEPHGGRYRDILIDGFVPPSTSVAP-----MFPFYENNS-EWDDFGEVIN------ 486
            A   E       DI + G    S   A      MFPF E    + D FGE ++      
Sbjct: 541 YAGDGEDVRTMSFDIYVKGQQMRSGRGAEMARFRMFPFVERKGRKIDQFGEGLDIGQWMR 600

Query: 487 ----------------------------------PDDYIIKDEDMDQAAMHIGGDDGKLD 512
                                             P  Y+ ++  ++  AM    D   L 
Sbjct: 601 KGREIAEEGETEEVREAKKRKEEEEEKAKQAPEPPSKYVSEEVGVELKAMIGFVDMEGLH 660

Query: 513 EGSA--SLILDAKPSKVVSNELTVLVHGSAEATEHLKQH--CLKHVCPHVYTPQIEETID 568
           +G +  ++I D +P K+      ++V  S E+T++L      +      +++P + E I 
Sbjct: 661 DGQSIKTIISDLQPRKL------IIVRSSKESTQNLISFLGSVTGFTRDIFSPSLTEEIK 714

Query: 569 VTSDLCAYKVQLSEKLMSNVLFKKLGD---YEIAWVDAEVGKTENGMLSLL--------- 616
           +   + +Y + L + + S+ L KK  D   YE+ +VD ++       + +L         
Sbjct: 715 IGEHVQSYSLTLGDSI-SSALAKKWSDFEGYEVTFVDGKIVLPAGSTIPILETPSLVGPL 773

Query: 617 ---------------------------PIST--PAPPHKSVLVGDLKMADLKPFLS--SK 645
                                      PIS+  P P   S  +GDL++A LK  LS  + 
Sbjct: 774 VKTEAEGDDADDEAKPSAEELAAASAPPISSSAPLPLPTSTFIGDLRLARLKHRLSLLNP 833

Query: 646 GIQVEFAG-GALRCG-----------EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYY 693
            I  EFAG G L CG             V++RK+G            +IV+EG +   Y 
Sbjct: 834 PIPAEFAGEGVLVCGPGIAQEAQGAASVVSVRKIGEG----------KIVLEGCIGRVYV 883

Query: 694 KIRAYLY 700
           ++R  LY
Sbjct: 884 EVRKALY 890


>gi|407929750|gb|EKG22561.1| RNA-metabolising metallo-beta-lactamase [Macrophomina phaseolina
           MS6]
          Length = 974

 Score =  179 bits (454), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 131/432 (30%), Positives = 203/432 (46%), Gaps = 85/432 (19%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  + +  S  L+ +DG    LID GW++ FD   L+ L +   T+  VLL+H  T
Sbjct: 5   TPLLGAQSTSTASQSLLELDGGIKILIDVGWDETFDAEKLKELERQIPTLSCVLLTHATT 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLF----- 114
            HLGA  +  K   L    P+++T PV  LG   + D Y    L+   + E  L      
Sbjct: 65  AHLGAFAHCCKHFPLFTRIPIYATTPVISLGRTLLQDLYTSTPLASSIIPEAALSDSAYS 124

Query: 115 -----------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAG 152
                            T ++I + F  +  L YSQ +            G+ +  + AG
Sbjct: 125 FPALQGGNHPNILLQPPTTEEIANYFSLIHGLKYSQPHQPLPSPFSPPLNGLTITAYSAG 184

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITD 200
           H LGGT+W I    E ++YAVD+N+ +E  L+G             V+E   RP  ++  
Sbjct: 185 HTLGGTIWHIQHGLESIVYAVDWNQAREHVLSGAAWLGGSGAGGAEVIEQLRRPTAMVCS 244

Query: 201 AYNA--LHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW---AE 254
           +  A  +     RQ+R E+  + I +T+  GG+VL+P DS+ RVLEL  +LE+ W   A+
Sbjct: 245 SRGAERIALAGGRQKRDELLLEMIKETVCNGGSVLIPSDSSARVLELAYLLENAWQADAQ 304

Query: 255 HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS--------------------- 293
              N P+Y  +   ++T+ Y +S LEWM + I + FE +                     
Sbjct: 305 SFGNAPLYLASRTCAATMRYARSMLEWMDEGIVREFEAASSGQGTDDNKRSRTQQGSGRS 364

Query: 294 --------RDNA-FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
                   + NA F  + + L+  ++++    A +GPK++LAS  SLE GFS +     A
Sbjct: 365 KEGKEDAKKPNAPFDFRSLRLVERRTQVSRMLAAEGPKVILASDVSLEWGFSKEAVRALA 424

Query: 343 SDVKNLVLFTER 354
           +D +NLV+ TER
Sbjct: 425 ADSRNLVILTER 436


>gi|224009389|ref|XP_002293653.1| cleavage and polyadenylation specificity factor [Thalassiosira
           pseudonana CCMP1335]
 gi|220971053|gb|EED89389.1| cleavage and polyadenylation specificity factor [Thalassiosira
           pseudonana CCMP1335]
          Length = 347

 Score =  173 bits (439), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 186/351 (52%), Gaps = 20/351 (5%)

Query: 18  PLSYLVSIDGFNFLIDCGWNDHFDP--SLLQPLSKVASTIDAVLLSHPDTLHLGALPY-- 73
           P   LV   G   L++ GW++      S+   +      +DA+L++      LG LP   
Sbjct: 1   PSCTLVEYAGMKLLLNAGWDETLPAATSVSDIIPNELPDVDAILITDSTLSSLGGLPMYF 60

Query: 74  -AMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAF--QSVTRLT 130
              +    + P  +T P  ++G +T+YD + S         ++LDD+D+ F  +SV  L 
Sbjct: 61  GGNQDKKRNPPFLATYPTVKMGQMTLYDHHASLSLDGTHPGYSLDDVDAVFGEESVITLK 120

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGT--VWKITKDGEDVIYAVDYNRRKEKHLNGTVL 188
           YSQ  +     + + + PH++GH++GG   V K   D  +VI A  Y+  KEKHL G+ L
Sbjct: 121 YSQTLNSKTSNKLLSITPHLSGHVVGGCYYVLKQLADDTEVILAPTYHHAKEKHLAGSTL 180

Query: 189 ESF-VRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLI 247
             F V    L+T    A  N   R + EM +  ++  LR  GNVLLPVD++GRVLELLLI
Sbjct: 181 HKFGVNADALLTMPGGARGN---RSEAEMIESMMA-ALRRDGNVLLPVDASGRVLELLLI 236

Query: 248 LEDYWAEHSLN--YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTL 305
           L+ YW    L   Y + ++  ++ +TI++ +S LEWM + +   F++ R + + LK V +
Sbjct: 237 LDRYWERQRLGGAYNLCWVGPMALNTIEFARSQLEWMAEPLGAQFDSQRGHPYALKSVRI 296

Query: 306 LINKSELDNAPD----GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
             + +EL++  +     P  VLAS +SL+ G + D+ ++W  +  NLVL T
Sbjct: 297 CSSVAELESVIESSNGNPTAVLASGSSLDHGPARDLLLKWGDNPDNLVLIT 347


>gi|403418874|emb|CCM05574.1| predicted protein [Fibroporia radiculosa]
          Length = 826

 Score =  172 bits (437), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 104/290 (35%), Positives = 154/290 (53%), Gaps = 38/290 (13%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIY 171
           + T+ D++ AF S+  L YSQ  HL GK +G+ + P  AGH LGGT+WKI +     ++Y
Sbjct: 56  IATIQDVNEAFDSMNVLRYSQPCHLQGKCQGLTIIPFNAGHTLGGTIWKIRSPTAGTILY 115

Query: 172 AVDYNRRKEKHLNGTVL-----------ESFVRPAVLITDAYNALHNQPPRQQREM-FQD 219
           AVD N  +E+HL+GTVL           ES  RP +LITDA  A      R+ R+    D
Sbjct: 116 AVDMNHTRERHLDGTVLVRQASAGGGIFESLARPDLLITDAERANVTTARRKDRDAALLD 175

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFL 279
            ++ TL +  ++LLP D++ RVLELL++L+ +W    L +PI  L+      + +V+S +
Sbjct: 176 CVTATLTSRNSLLLPCDASTRVLELLVLLDQHWNYSRLKFPICLLSRTGREMLTFVRSMM 235

Query: 280 EWMGDSITKS-------------FETSRDN----------AFLLKHVTLLINKSELDN-- 314
           EW+G +++K               +  RD           A   +H+    N   L +  
Sbjct: 236 EWLGGTVSKEDVGEEATGGQGKGNKRRRDEDGDEEALGAFALRFRHLEFFPNPQALLHTY 295

Query: 315 APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           +   PKL+LA  ASL  G S  +F E+A    N+VL T RG+ GTL R+L
Sbjct: 296 SSKDPKLILAVPASLSHGPSRVLFTEFAETPDNVVLLTGRGEEGTLGRIL 345



 Score = 61.2 bits (147), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 71/355 (20%), Positives = 132/355 (37%), Gaps = 105/355 (29%)

Query: 441 DVVEPHGGRYRDILIDGFVPPSTSVAP----------MFPFYENNSEWDDFGEVIN---- 486
           D  EP      DI + G V  +TS             MFP+ E     D++GE ++    
Sbjct: 486 DSDEPMRALSFDIYLKGNVARTTSFFKSAEGQSQRFRMFPYVEKKRRVDEYGETVDVGMW 545

Query: 487 ----------------------------------PDDYIIKDEDMDQAAMHIGGDDGKLD 512
                                             P  +I  + D+  A      D   L+
Sbjct: 546 LRKGKVLEEDAESEETKELRRKAEEEAKKVPVELPSKFITTEVDVQLACRLFFVDLEGLN 605

Query: 513 EGSA--SLILDAKPSKVVSNELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETID 568
           +G A  +++    P K++      +VH  +  T+ L + C  ++ +   +Y P   E I 
Sbjct: 606 DGRAVKTIVPQVNPRKMI------VVHAPSNYTDALIESCSNIRAMTKDIYAPAQGECIQ 659

Query: 569 VTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLL-PISTPAPPH-- 625
           +     ++ + LS++L++++   +  D E+ +V   +    +  + +L P+S  +     
Sbjct: 660 IGQHTNSFSISLSDELLTSLKMSQFEDNEVGYVTGRIASLASSTIPVLEPVSFTSAQFEA 719

Query: 626 --------------------KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTI 664
                               +S ++G+LK+  LK  L++ G+  E  G G L CG     
Sbjct: 720 KSRKSLQSRMLGSRPTLTLPQSTMIGELKLTALKSRLATVGVHAELIGEGVLICG----- 774

Query: 665 RKVGPAGQKGGGSGTQ-------------QIVIEGPLCEDYYKIRAYLYSQFYLL 706
                A  K GGSG               ++ +EG + + YY +R  +Y+   L+
Sbjct: 775 -----AAAKKGGSGESLEDSVTVKKMTRGRVELEGSVSDIYYTVRKEIYNLHALV 824


>gi|378733596|gb|EHY60055.1| hypothetical protein HMPREF1120_08027 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 948

 Score =  172 bits (436), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 120/421 (28%), Positives = 197/421 (46%), Gaps = 74/421 (17%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  +++  S  L+ +DG    L+D GW++ FD   L  + K  ST+  +LL+HP  
Sbjct: 5   TPLLGAQSDSRASQSLLELDGGVKILVDVGWDERFDTRQLTEIEKHTSTLSFILLTHPTI 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF------------ 111
            H+GA  +  K + L    P+++T PV   G   + D Y S    + F            
Sbjct: 65  SHIGAFAHCCKHIPLFSQVPIYATPPVIAFGRTLLEDLYSSSPLAATFIPGSASPEDGTS 124

Query: 112 -----------DLFTLDDIDSAFQSVTRLTYSQ-----NYHLSGKGEGIVVAPHVAGHLL 155
                         T ++I+  FQ ++ L YSQ         S   EG+ +  + AGH L
Sbjct: 125 ADDKSRSNILRQAPTFEEINKYFQLISPLKYSQPLQPTASQFSAPVEGLTLTAYNAGHTL 184

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT----------VLESFVRPAVLITDAYNAL 205
           GGT+W I +  E ++YAVD+N+ +E  + G           V+E   +P+ L+  +  A 
Sbjct: 185 GGTIWHIQQGMESIVYAVDWNQARENVVAGAAWFGGVGGAEVIEQLRKPSALVCSSVGAT 244

Query: 206 H---NQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE--HSLNY- 259
               +   + + +     I  ++  GG VL+P DS+ RVLEL  +LE  W++  HS ++ 
Sbjct: 245 RVALSGGRKARDDALLGHIKTSVAKGGTVLIPTDSSARVLELAWLLEKAWSDPAHSASFK 304

Query: 260 --PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN--------------------- 296
              +Y  +   ++T+ + +S LEWM DSI + FE   +N                     
Sbjct: 305 DVKVYMASRSGNATLRHARSLLEWMDDSIVREFEGEDENPTTQPYNRRGGNKAAGTNKPS 364

Query: 297 -AFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             F  K+V ++  K +L+     +GP+++LAS  +L+ GFS  +        +NLV+ TE
Sbjct: 365 RPFEFKNVKVVERKHQLEKLLKVEGPRVILASDVTLDWGFSRSLLEHVVQKPENLVILTE 424

Query: 354 R 354
           R
Sbjct: 425 R 425


>gi|357624104|gb|EHJ75000.1| hypothetical protein KGM_18742 [Danaus plexippus]
          Length = 595

 Score =  172 bits (436), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 187/366 (51%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCILLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSQIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + VT +T  Q+  +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCIKKVTAVTLHQSVMVDNELE---IKAYYAGHVLGAAMFWIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVEKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YP+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPVYFALGLTEKANNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +KS +DN   G  +V A+   L AG S +IF +WA   +N+++   F 
Sbjct: 298 NMFDFKHIKPF-DKSYIDNP--GAMVVFATPGMLHAGLSLNIFKKWAPYEQNMLIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|449299688|gb|EMC95701.1| hypothetical protein BAUCODRAFT_71003 [Baudoinia compniacensis UAMH
           10762]
          Length = 938

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 130/410 (31%), Positives = 189/410 (46%), Gaps = 58/410 (14%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G   E+  S  L+ +DG    L+D GW+  FD   L  + +  ST+  VLL+H  T
Sbjct: 5   TPLLGAQAESAASQSLLELDGGIKVLVDVGWDAAFDAQRLDAIERQTSTLSLVLLTHATT 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF--------- 114
            HLGA  +  K + L    PV++T PV  LG   + D Y S    +              
Sbjct: 65  EHLGAYAHCCKHIPLFSKVPVYATTPVINLGRTLLLDLYASSPLAASIIHTSSISSSSTT 124

Query: 115 --------------TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLL 155
                         T ++I + F S+  L YSQ +       S    G+ +  + AGH L
Sbjct: 125 SKADSSPNLLLQPPTPEEIATYFASINALKYSQPHQPVASSWSPALGGLTITAYGAGHTL 184

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------VLESFVRPAVLITDAYNALHN 207
           GGTVW I +  E ++YA D+N+ +E  L G         ++E   RP  LI  +      
Sbjct: 185 GGTVWHIQQGLESIVYAADWNQGRENLLPGAALLSGGQEIIEPLQRPTALICSSKGVEKA 244

Query: 208 QP-PRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE-----HSLNY- 259
           Q   R+ R+ M    +  T+  GG VL+P DS+ R+LEL  +L + W E     H+  Y 
Sbjct: 245 QSQSRKDRDGMLLSLVRDTIAQGGKVLIPTDSSARMLELAFLLNEAWKENLDGPHAATYR 304

Query: 260 --PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET------SRDNAFLLKHVTLLINKSE 311
              +Y  +   S++I Y++S LEW+ +S+    E          N    +HV L+   S 
Sbjct: 305 SARVYMASKSGSASIRYLQSMLEWVEESVRAEAEAHLTKTKGSTNPLNWQHVKLVERNST 364

Query: 312 LDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           L+ A     P + LAS ASLE GFS       A+D KNLV+ TE+   G+
Sbjct: 365 LERAVQRSQPCVFLASDASLEWGFSRLALESLATDTKNLVILTEKSAPGS 414


>gi|157107341|ref|XP_001649735.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
 gi|108879612|gb|EAT43837.1| AAEL004757-PA [Aedes aegypti]
          Length = 613

 Score =  171 bits (434), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + + ID 
Sbjct: 4   IKITPLGAGQDVGRSCILLSMGGKNIMLDCGMHMGYNDERRFPDFSFIVPEGPITNHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMIGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   +P +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CKPDLLITESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFAVGLTEKANNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K  +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKGYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|321257420|ref|XP_003193582.1| cleavage and polyadenylation specificity factor subunit
           [Cryptococcus gattii WM276]
 gi|317460052|gb|ADV21795.1| Cleavage and polyadenylation specificity factor subunit, putative
           [Cryptococcus gattii WM276]
          Length = 900

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 220/889 (24%), Positives = 363/889 (40%), Gaps = 224/889 (25%)

Query: 19  LSYLVSIDGFNFLIDCGWNDHFDPSLL------QPLSKVASTIDAVLLSHPDTLHLGALP 72
           + YL+ +D    L+D G  D+   +        + +  +A T+  VLLSH  + +L   P
Sbjct: 20  ICYLLELDDARILLDMGQRDYRSSTQQGRWDYEEAVRDLAPTLSLVLLSHSSSNYLSLYP 79

Query: 73  YAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-------------------L 113
           YA  + GL+ PV++T+P   +G +    +  S R     +                   +
Sbjct: 80  YARARWGLTCPVYATQPTVEMGRVVCLAEAESWRSECPVESEGEVAGDDGSKKPFKGPFV 139

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYA 172
            T+++I  AF  +  + YSQ  HL G    +++ P  +GH LGG+++KI +     V+YA
Sbjct: 140 PTVEEIHEAFDWIKAVRYSQPLHLGGDFSHLLLTPFASGHTLGGSLFKIRSPTSGTVLYA 199

Query: 173 VDYNRRKEKHLNGTV---------LESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAIS 222
           V  N   E+HL+G V         ++  +RP +LI +   ++   P R++RE    D I+
Sbjct: 200 VGVNHTSERHLDGMVGVQNGPTGYVDGVLRPDLLIVEGGRSMVINPKRKEREAALIDTIT 259

Query: 223 KTLRAGGNVLLPVDSAGRVLELLLILEDYWA--------EHSLN--------YPIYFLTY 266
            TL +  +VLLPVD + R+LEL+++L+ +W         +   N        YP+  ++ 
Sbjct: 260 STLESNHSVLLPVDPSPRLLELMVLLDQHWTFKRTPKVKQQRYNEPPADLWPYPLCIVSK 319

Query: 267 VSSSTIDYVKSFLEWMGDSITKSF---------ETSRDNAFLL---------KHVTLLIN 308
            +   + + +S ++WMG  +  S          + +R     L         +HV   +N
Sbjct: 320 TAQDMVAFARSLIDWMGGVVKDSAGDMVDVGRGKRARGARMALGSEYGVLDFRHVQFFLN 379

Query: 309 KSE-LDNAP-DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-- 364
            ++ L   P   PKLVLA   ++  G S  +F   A+   N+++ T R +  TLAR L  
Sbjct: 380 PTDLLQTYPLTRPKLVLAIPPTMSHGPSRFLFTAMANTEGNVIMLTGRSEEQTLARDLFN 439

Query: 365 ------------------QADPPPKAVKVTMSRRVPLVGEELIAY-EEEQTRLKKEEALK 405
                                     ++V M  +VPL G EL A+ E E+ + +KE A K
Sbjct: 440 RWERSQTVGSKWGEGKIGHLTQLEGKLQVEMDSKVPLSGAELEAHMESERLQKEKEAAHK 499

Query: 406 ASLVKEEE---------SKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILID 456
           A++ +               S    + L+G   V     ANA A   E       DI + 
Sbjct: 500 AAVDRSRRMLEADDLESDSESESEADGLAGGITVRRTEGANAYAGDGEDVRTMSFDIYVK 559

Query: 457 GFVPPSTSVAP-----MFPFYENNS-EWDDFGEVIN------------------------ 486
           G    S   A      MFPF E    + D FGE ++                        
Sbjct: 560 GQQMRSGRGAEMARFRMFPFVERKGRKIDQFGEGLDIGQWMRKGREIAEEGETEEVRDAK 619

Query: 487 ----------------PDDYIIKDEDMDQAAMHIGGDDGKLDEGSA--SLILDAKPSKVV 528
                           P  Y+ ++  ++  AM    D   L +G +  ++I D +P K+ 
Sbjct: 620 KRKEEEEEKAKQAPEPPSKYVSEEVGVELKAMIGFVDMEGLHDGQSIKTIISDLQPRKL- 678

Query: 529 SNELTVLVHGSAEATEHLKQH--CLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMS 586
                ++V  S E+T++L      +      +++P + E I +   + +Y + L + + S
Sbjct: 679 -----IIVRSSKESTQNLISFLGSVTGFTKDIFSPSLTEEIKIGEHVQSYSLTLGDSI-S 732

Query: 587 NVLFKKLGD---YEIAWVDAEV----GKT------------------------------- 608
           + L KK  D   YE+ +VD ++    G T                               
Sbjct: 733 SALAKKWSDFEGYEVTFVDGKIVLPAGSTIPILETPSLVGPLIKTEAEGDEADGESKPSA 792

Query: 609 -ENGMLSLLPIST--PAPPHKSVLVGDLKMADLKPFLS--SKGIQVEFAG-GALRCG--- 659
            E    S  PIS+  P P   S  +GDL++A LK  LS  +  I  EFAG G L CG   
Sbjct: 793 EELAAASTPPISSSAPLPLPTSTFIGDLRLARLKHRLSLLNPPIPAEFAGEGVLVCGPGI 852

Query: 660 --------EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
                     V++RK+G            +IV+EG +   Y ++R  LY
Sbjct: 853 AQEAQGAASIVSVRKIGEG----------KIVLEGCIGRVYVEVRKALY 891


>gi|170052069|ref|XP_001862054.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
           quinquefasciatus]
 gi|167873079|gb|EDS36462.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
           quinquefasciatus]
          Length = 615

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 184/369 (49%), Gaps = 18/369 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + + ID 
Sbjct: 4   IKITPLGAGQDVGRSCILLSMGGKNIMLDCGMHMGYNDERRFPDFSFIVPEGPITNHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   +P +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CKPDLLITESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YP+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPVYFAVGLTEKANNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +K  +DN   G  +V A+   L AG S  IF +WA +  N+V+     
Sbjct: 298 NMFDFKHIKPF-DKGYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGYC 354

Query: 356 QFGTLARML 364
             GT+   +
Sbjct: 355 VQGTVGHKI 363


>gi|326426580|gb|EGD72150.1| cleavage and polyadenylation specificity factor subunit 3
           [Salpingoeca sp. ATCC 50818]
          Length = 790

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 111/372 (29%), Positives = 193/372 (51%), Gaps = 21/372 (5%)

Query: 7   VTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKVA-STIDAVLLSHPD 64
           +TPL          +++   GF  ++DCG +         P +S++  + ID VL++H  
Sbjct: 53  ITPLGAGQEVGRSCHILKFKGFTIMLDCGIHPGLKGKASLPFVSQIELNKIDLVLITHFH 112

Query: 65  TLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEF-DLFTLDDID 120
             H GALP+ +++   S  VF   +T+ +YR     + + Y+    +S F ++++L+D++
Sbjct: 113 LDHCGALPWLLERSTFSGRVFMTPATKAIYRW----ILEDYVRVSNISNFAEMYSLEDVE 168

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           ++   +  ++Y Q  ++    +G+   P+ AGH+LG  ++ I   G  ++Y  D++R ++
Sbjct: 169 NSLAKIETISYHQETNM----DGVRFTPYCAGHVLGACMFDIEIAGVRLVYTGDFSREED 224

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           +HL    +     P +LIT++   +     RQ RE  F   I   +  GG  L+PV + G
Sbjct: 225 RHLMAAEVPPN-SPDILITESTFGVRQHESRQTREHRFTKTIHDVVDRGGRCLIPVFALG 283

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLIL+DYW  H      PIY+ + ++   +   K+++  M +SI K+   S +N 
Sbjct: 284 RAQELLLILDDYWQNHDELHRVPIYYASALARRCMAVYKTYVNVMKESIQKTI--SINNP 341

Query: 298 FLLKHVTLLINKSELDNA-PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           F  +HV+ + N  + D     GP ++LAS   L++G S +IF  WAS+  N VL      
Sbjct: 342 FNFRHVSYIRNLHQFDGEYGGGPCVMLASPGMLQSGLSREIFERWASNKANCVLLAGYVV 401

Query: 357 FGTLARMLQADP 368
            GTLA+ L   P
Sbjct: 402 NGTLAKDLLKAP 413


>gi|452840080|gb|EME42018.1| hypothetical protein DOTSEDRAFT_133466 [Dothistroma septosporum
           NZE10]
          Length = 1101

 Score =  171 bits (432), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 129/407 (31%), Positives = 193/407 (47%), Gaps = 61/407 (14%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  +++P S  L+ +DG    L+D GW++ FD   L  + +  ST+  VLL+HP  
Sbjct: 5   TPLLGAQSDSPASQSLLELDGGVKILVDVGWDETFDAEKLHAIEQHVSTLSIVLLTHPTL 64

Query: 66  LHLGALPYAMKQL-GLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSE------------- 110
            H+GA  +  K + G S  PV++T PV  LG   + D Y S    +              
Sbjct: 65  DHIGAYAHCCKHIPGFSRIPVYATTPVVNLGRTLLADLYHSAPLTTSIIPTSAILSSPIA 124

Query: 111 ----------FDLFTLDDIDSAFQSVTRLTYSQNYH----LSGKGEG-IVVAPHVAGHLL 155
                     +   T D+I + F ++  L YSQ +      SG G G +V+  + AGH  
Sbjct: 125 ADPHTTPNLLYQHPTPDEIAAYFNAINPLKYSQPHQPIGVASGPGLGNLVITAYSAGHTP 184

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------VLESFVRPAVLITDAYNALHN 207
           GGT+W I    E ++YA D+N+ +E  L+G         ++E   RP  L+  +      
Sbjct: 185 GGTIWHIQHGLESIVYAADWNQGRENLLSGAAWLGTSSEIIEPLRRPTALVCSSKGVQKT 244

Query: 208 QP-PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE-----HSLNY- 259
              PR++R E+    I +T+  GG VL+P DS+ RVLEL  IL   W E     H+  Y 
Sbjct: 245 DTLPRKKRDELLVSLIRETVAQGGKVLIPTDSSARVLELAFILNHTWRENITGPHADTYR 304

Query: 260 --PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN--------- 308
              I+  +  S+ST+  +   LEWM D+I +  E +       K +  +++         
Sbjct: 305 HARIFMASKSSTSTMRQLHGMLEWMDDAIQRHAEAAMGQGGDDKKIPSMLDWRFVKQIER 364

Query: 309 KSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           KS+LD       P ++LAS ASLE G S       A D +NLV+ TE
Sbjct: 365 KSQLDKVLQRQNPCIILASDASLEWGLSQHALKALAGDARNLVILTE 411


>gi|358394479|gb|EHK43872.1| hypothetical protein TRIATDRAFT_79096 [Trichoderma atroviride IMI
           206040]
          Length = 957

 Score =  170 bits (430), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 169/631 (26%), Positives = 262/631 (41%), Gaps = 128/631 (20%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    L+D GW++ F    L+ L K   T+  +LL+H  T 
Sbjct: 6   PLQGALSESLASQSLLELDGGVKVLVDLGWDESFSSEKLEELEKQVPTLSLILLTHATTS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR-------RQVSEFDLF--- 114
           HL A  +  K + L    PV++T PV  LG     D Y S        RQ S  +     
Sbjct: 66  HLAAYAHCCKNIALFTRIPVYATRPVIDLGRTLTQDLYSSTPAAATTIRQSSLSETTYAY 125

Query: 115 ---------------TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHL 154
                          T ++I   F  +  L YSQ +       S    G+ +  + +GH 
Sbjct: 126 SQTATTAQNLLLQSPTPEEIARYFSLIQPLKYSQPHQPLSSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY 259
            A  +     R +R E   D I   +  GG VL+PVDS+ RVLE+  +LE+ W   + N 
Sbjct: 246 GADKSAQAGGRAKRDEHLIDMIKSCVSRGGTVLIPVDSSARVLEISYLLENAWRTDAANR 305

Query: 260 -------PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-------------SRDNA-F 298
                   +Y      SST+ Y +S LEWM ++I + FE               ++ A F
Sbjct: 306 DGVLKFSKLYLAGRNVSSTMRYARSMLEWMDNNIVQEFEAFAEGQRKTNGGSEKKEGAPF 365

Query: 299 LLKHVTLLINKSE--------LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             K++ LL  K++        ++N     +++LAS  S++ GFS D+    A D +NLV+
Sbjct: 366 DFKYLRLLERKAQIAKLLSQSIENGETQGRVILASDVSMDWGFSKDLIKGLAKDTRNLVI 425

Query: 351 FTERGQFG-----TLARML------QAD-----------------PPPKAVKVTMSRRVP 382
            TER         +++RM+      + D                    + ++V  +RR P
Sbjct: 426 LTERPSLANTDAPSISRMMWEWWKERRDGISTEHASNGDSLETIYSGGRELEVREARREP 485

Query: 383 LVGEELIAYEEE-QTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASAD 441
           L G+EL  Y++   T+ + +   +A      E+ A +  D +        +       A 
Sbjct: 486 LEGDELAIYQQWLATQRQLQATQQAGGAGALEASADVVDDASSESSSDSEEEGEQQGKAL 545

Query: 442 VVEPHGGRY---------RDILIDGFVPPST----------SVAPMFPFYENNSEWDDFG 482
            V    G+           D+ I+  +   T               FP        DDFG
Sbjct: 546 NVSATMGQAGRKNVVLKDEDLGINILIKKKTVFDFDTRGKRGRERSFPMAIRRKRHDDFG 605

Query: 483 EVINPDDYIIKDEDMDQAA--MHIGGDDGKL 511
           E+I P+DY+  +E  D AA    I  +D KL
Sbjct: 606 ELIRPEDYLRAEEKEDDAADGAQIAAEDEKL 636


>gi|405120276|gb|AFR95047.1| cleavage and polyadenylation specificity factor subunit
           [Cryptococcus neoformans var. grubii H99]
          Length = 899

 Score =  170 bits (430), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 223/907 (24%), Positives = 371/907 (40%), Gaps = 229/907 (25%)

Query: 5   VQVTPLSGVFNEN----PLSYLVSIDGFNFLIDCGWNDHFDPSLL------QPLSKVAST 54
           + +TPLS    E     P+ YL+ +D    L+D G  D+   +        + +  +A T
Sbjct: 2   ITLTPLSASAAETSPSEPICYLLELDDARILLDMGQRDYRASAQQSSWDYEEAVRDLAPT 61

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRR-------- 106
           +  VLLSH  + +L   PYA  + GL+ PV++T+P   +G +    +  S R        
Sbjct: 62  LSLVLLSHSSSNYLSLYPYARARWGLTCPVYATQPTVEMGRVVCLAEAESWRAECPVESE 121

Query: 107 QVSEFD----------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLG 156
            V+E D          + T++++  AF  +  + YSQ  HL G    +++ P  +GH LG
Sbjct: 122 DVAEDDGSKKPLKGPFVPTVEEVHEAFDWIKAVRYSQPLHLGGDFSHLLLTPFASGHTLG 181

Query: 157 GTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTV---------LESFVRPAVLITDAYNALH 206
           G+++KI +     V+YAV  N   E+HL+G V          +  +RP +LI +   ++ 
Sbjct: 182 GSLFKIRSPTSGTVLYAVGVNHTSERHLDGMVGVQNGPTGYADGVLRPDLLIAEGGRSMV 241

Query: 207 NQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--------EHSL 257
             P R++RE    D I+ TL +  +VLLPVD + R+LEL+++L+ +W         +   
Sbjct: 242 VNPKRKEREAALIDTITSTLESNHSVLLPVDPSPRLLELMILLDQHWTFKRTPKVKQQRY 301

Query: 258 N--------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF---------ETSRDNAFLL 300
           N        YP+  ++  +   + + +S ++WMG  +  S          + +R     L
Sbjct: 302 NEPPADLWPYPLCIVSKTAQDMVAFARSLIDWMGGVVKDSAGDMVDVGRGKRARGARMAL 361

Query: 301 ---------KHVTLLINKSE-LDNAP-DGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
                    +HV   +N ++ L   P   PKLVLA   ++  G S  +F   A+   N++
Sbjct: 362 GSEYGVLDFRHVLFFLNTTDLLQTYPLTRPKLVLAVPPTMSHGPSRFLFTAMANTEGNVI 421

Query: 350 LFTERGQFGTLARML--QADPPPKA------------------VKVTMSRRVPLVGEELI 389
           + T R +  TLAR L  + +    A                  ++V +  +VPL G EL 
Sbjct: 422 MLTGRSEEQTLARDLYNRWERSQTAGSKWGEGKIGHLTRLEGKLQVEVDSKVPLSGAELE 481

Query: 390 AY-EEEQTRLKKEEALKASLVK----------EEESKASLGPDNNLSGDPMVIDANNANA 438
           A+ E E+ + +KE A KA++ +          E +S +    D + +G   V     ANA
Sbjct: 482 AHVESERLQKEKEAAHKAAVDRSRRMLEADDLESDSDSESEADGH-TGGITVRRTEGANA 540

Query: 439 SADVVEPHGGRYRDILIDGFVPPSTSVAP-----MFPFYENNS-EWDDFGEVIN------ 486
            A   E       DI + G    S   A      MFPF E    + D FGE ++      
Sbjct: 541 YAGDGEDVRTMSFDIYVKGQQMRSGRGAEMARFRMFPFVERKGRKIDQFGEGLDIGQWMR 600

Query: 487 ----------------------------------PDDYIIKDEDMDQAAMHIGGDDGKLD 512
                                             P  Y+ +   ++  AM    D   L 
Sbjct: 601 KGREIAEEGETEEVREAKKRKEEEEEKAKQAPEPPSKYVSEKVGVEMKAMIGFVDMEGLH 660

Query: 513 EGSA--SLILDAKPSKVVSNELTVLVHGSAEATEHLKQH--CLKHVCPHVYTPQIEETID 568
           +G +  ++I D +P K+      ++V  S E+T  L             +++P + E I 
Sbjct: 661 DGQSIKTIISDLQPRKL------IIVRSSKESTRDLISFLGSATGFTKEIFSPSLTEEIK 714

Query: 569 VTSDLCAYKVQLSEKLMSNVLFKKLGD---YEIAWVDAEVGKTENGMLSLLPIST----- 620
           +   + +Y + L + + S+ L KK  D   YE+ +VD ++       + +L   +     
Sbjct: 715 IGEHVQSYSLTLGDSI-SSALAKKWSDFEGYEVTFVDGKIVLPAGSTIPILETPSLVGPL 773

Query: 621 ---------------------------------PAPPHKSVLVGDLKMADLKPFLS--SK 645
                                            P P   S  +GDL++A LK  LS  + 
Sbjct: 774 VKTEAEGDDAEDEAKPSAEELAAASASPISSSVPLPLPTSTFIGDLRLARLKHRLSLLNP 833

Query: 646 GIQVEFAG-GALRCG-----------EYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYY 693
            I  EFAG G L CG             V++RK+G            +IV+EG +   Y 
Sbjct: 834 PIPAEFAGEGVLVCGPGIAQEAQGAASVVSVRKIGEG----------KIVLEGCIGRVYV 883

Query: 694 KIRAYLY 700
           ++R  LY
Sbjct: 884 EVRKALY 890


>gi|398396344|ref|XP_003851630.1| hypothetical protein MYCGRDRAFT_109995 [Zymoseptoria tritici
           IPO323]
 gi|339471510|gb|EGP86606.1| hypothetical protein MYCGRDRAFT_109995 [Zymoseptoria tritici
           IPO323]
          Length = 1108

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 130/420 (30%), Positives = 190/420 (45%), Gaps = 67/420 (15%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           T L G  +++P S  L+ +DG    L+D GW++ FD   LQ L K  ST+  +LL+H   
Sbjct: 5   TALLGAQSDSPASQSLLELDGGVKLLVDVGWDETFDAEKLQTLEKHVSTLSVILLTHATV 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSE------------- 110
            H+GA  +  K +      PV++T PV  LG   + D Y S    +              
Sbjct: 65  EHIGAYAHCCKHIPAFNKIPVYATTPVINLGRTLIADIYASSPLAASVIPTSSISSSPVA 124

Query: 111 ----------FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLL 155
                     F   T D+I S F  +  L YSQ +       S     + +  + AGH +
Sbjct: 125 LAPESTPNLLFQPPTADEIASYFNLIHPLKYSQPHQPIPSPWSPSLGNLTITAYSAGHTI 184

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-----------VLESFVRPAVLITDAYNA 204
           GGT+W I    E ++YA D+N+ +E  L+G            ++E+  RP  LI  +   
Sbjct: 185 GGTIWHIQHSMESIVYAADWNQGRENLLSGAAWLGSTSGGAEIIEALRRPTALICSSKGV 244

Query: 205 LHNQP-PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYP-- 260
                 PR++R E     I  T+  GG VL+P DS+ RVLEL  +L   W E+ +N P  
Sbjct: 245 EKTDTMPRKKRDETLVGLIRDTIAQGGKVLIPTDSSARVLELAFVLNQNWKEN-INGPHA 303

Query: 261 -------IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL----------LKHV 303
                  IY  +  SSST+  ++  LEW+ +SI +  E +     +           + V
Sbjct: 304 DTYRHAKIYMASKTSSSTVRQLQGMLEWLDESIIRDAEVAMGQQQVENQKVPTLLDWRFV 363

Query: 304 TLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
             +  KS+ D A     P ++LAS ASLE GFS       ASD +NLV+ TE    G  A
Sbjct: 364 KQIERKSQFDRALKRSSPCILLASDASLEWGFSRSALESLASDSRNLVVLTETVSHGKSA 423


>gi|417403203|gb|JAA48419.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Desmodus rotundus]
          Length = 603

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELQIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPTLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|388853919|emb|CCF52417.1| uncharacterized protein [Ustilago hordei]
          Length = 1033

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 141/510 (27%), Positives = 227/510 (44%), Gaps = 126/510 (24%)

Query: 15  NENP--LSYLVSIDGFNFLIDCGWNDHF------------------------------DP 42
            E+P  L+YL+ +D    LIDCG  + F                              DP
Sbjct: 31  QEHPRALAYLLQMDDVRVLIDCGSPEDFVFSNSVSASTSDNHDGKAESSSMAQQREASDP 90

Query: 43  S------------LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPV 90
           +            L   L ++A TID VLLSH    HLG   YA  +LGL   V++T PV
Sbjct: 91  TASFDLDQLKAAPLDTLLRQLAPTIDLVLLSHSSLDHLGLFAYAHAKLGLRCQVYATMPV 150

Query: 91  YRLGLLTMYDQYLSRRQVSEFD---------------LFTLDDIDSAFQSVTRLTYSQNY 135
             +G LT+ +   + R  SE D               L T ++++ AF+ +  + Y Q  
Sbjct: 151 QSMGKLTVLEAIQTWR--SEVDIEKESSSSSFNTHRCLPTANEVEDAFEEIKTVRYMQPT 208

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL------ 188
           HL GK   + +  + AGH LGG +WKI +     V+ A+D+N  +E+HL+GT+L      
Sbjct: 209 HLEGKCASLTLTAYNAGHSLGGAIWKIRSPTSGTVVVALDWNHNRERHLDGTILLSSSAA 268

Query: 189 -----------ESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVD 236
                      ++  RP +LIT+    L     R+ R+    D +  T++AG ++L P+D
Sbjct: 269 APGAPGSGSGSDAVRRPDLLITEIERGLVTNTRRKDRDAALIDLVHTTIQAGNSLLFPID 328

Query: 237 SAGRVLELLLILEDYWA---EHSLNYPIYFLTYVSSSTIDYVKSFLEWMG-DSITKSFET 292
           ++ R+LEL+++L+ +WA    H+  +P+  ++      I+  ++++EWM  +  TK+ ET
Sbjct: 329 ASARLLELMVLLDQHWAYAYPHA-RFPLCLISRTGKEVIERSRTYMEWMTREWATKANET 387

Query: 293 SRDNA------------------FLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAG 332
              N                      K+V +  +   +D A   D  K+VLA   S+  G
Sbjct: 388 IEANQDKSKPPNRGNRSAAASSPLDFKYVKVYSSLQAMDEAIPQDQAKVVLAVPPSMTHG 447

Query: 333 FSHDIFVEWASDVKNLVLFTERGQFGTLARML--------------------QADPPPKA 372
            S  +   +A +  ++V+   RG+ G+L R L                    +A  P   
Sbjct: 448 PSRRLLARFAKNPNDVVVLISRGEPGSLCRQLWDAWNTNQGKGFAWAQGKLGEAVTPNTR 507

Query: 373 VKVTMSRRVPLVGEELIAY-EEEQTRLKKE 401
           V+  +  RVPL GEEL A+ E EQ    ++
Sbjct: 508 VRFELKSRVPLEGEELRAHLEAEQAERDRQ 537


>gi|350288464|gb|EGZ69700.1| hypothetical protein NEUTE2DRAFT_152270 [Neurospora tetrasperma
           FGSC 2509]
          Length = 1070

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 170/635 (26%), Positives = 252/635 (39%), Gaps = 146/635 (22%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++  S  L+ +DG    LID GW++ FD   L+ L K A T+  +LL+H    
Sbjct: 74  PLQGALSDSSASQSLLELDGGVKILIDVGWDETFDVEKLKELGKQAPTLSLILLTHATVP 133

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS-------------------- 104
           HL A  +  K        PV++T PV  LG     D Y S                    
Sbjct: 134 HLAAYAHCCKHFPPFQRIPVYATRPVIDLGRTLTQDLYASTPLAATTISSASLAEVSYAS 193

Query: 105 --RRQVSEFDLFTL-----DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAG 152
              +  S  + F L     ++I   F  +  L YSQ +            G+ +  + +G
Sbjct: 194 GYSQAASAENTFLLQPPTPEEITKYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSG 253

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLI 198
             LGGT+W I    E ++YAVD+N+ +E    G               V+E   +P  L+
Sbjct: 254 RTLGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGNHGGAGGTQVIEQLRKPTALV 313

Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL- 257
             +       P  ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W +    
Sbjct: 314 CSSRTPDAALPRAKRDEQLMESIKLCIARGGTVLIPVDSSARVLELSYLLEHAWRKEVAK 373

Query: 258 ------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----SRDN----------- 296
                 +  ++      SST+   +S LEWM DSI + FE     SR N           
Sbjct: 374 DNDVFKSAKLFLAGRTISSTMKNARSMLEWMDDSIIREFEAFADESRRNNRRDEGNHQTG 433

Query: 297 --AFLLKHVTLLINKSEL-------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
              F  K++ LL  K+++       D+A    K++LAS  SL+ GFS DI    A+D +N
Sbjct: 434 PGPFDFKYLRLLERKAQIDKILQQSDDAEPRAKVILASDTSLDWGFSKDILKSIAADARN 493

Query: 348 LVLFTER-----GQFGTLARML-----------------------QADPPPKAVKVTMSR 379
           LV+ TE+      Q  +++R L                       Q     + +++  + 
Sbjct: 494 LVILTEKPNLEPNQKPSISRTLWEWWKERRDGVATERTSNGDTFEQVYAGNRELEIETAE 553

Query: 380 RVPLVGEELIAYEEEQTRLKKEEALKASL-----------------VKEEESKASLGPDN 422
           R  L G+EL  Y   Q  L  +  L+A+L                    +    S G D 
Sbjct: 554 RKGLEGDELNVY---QQWLATQRQLQATLQSGGTNLLEAPGDVLDDADSDTDSESEGSDT 610

Query: 423 NLSGDPMVIDANNANASADVVEPHGGRYRD------ILI------DGFVPPSTSVAPMFP 470
              G  + I    A AS   V       RD      ILI      D  V  +     MFP
Sbjct: 611 EQQGKALNIANTMAQASRKKVV-----LRDEDLGVTILIKKENVYDFNVRGTKGRDRMFP 665

Query: 471 FYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIG 505
                   D+FGE+I P+DY+  +E  D      G
Sbjct: 666 VAMRRRRADEFGELIRPEDYLRAEEREDAENQEAG 700


>gi|441671688|ref|XP_004093259.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
           [Nomascus leucogenys]
          Length = 585

 Score =  169 bits (428), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 XALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|149024842|gb|EDL81339.1| similar to RIKEN cDNA 2410006F12 [Rattus norvegicus]
          Length = 601

 Score =  169 bits (428), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 181/360 (50%), Gaps = 18/360 (5%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVAS 53
           M   ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++  
Sbjct: 1   MMPEIRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTD 60

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFD 112
            +D V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E +
Sbjct: 61  FLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEAN 120

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
            FT   I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y 
Sbjct: 121 FFTSQMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYT 177

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DYN   ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG V
Sbjct: 178 GDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKV 236

Query: 232 LLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           L+PV + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F 
Sbjct: 237 LIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF- 295

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
             + N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 296 -VQRNMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 351


>gi|365990355|ref|XP_003672007.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
 gi|343770781|emb|CCD26764.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
          Length = 757

 Score =  169 bits (428), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 189/371 (50%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           ST+D +L+SH    H  +LPY M++   +  VF T P   +YR  LL  + +  S    S
Sbjct: 25  STVDVLLISHFHLDHAASLPYVMQKTNFNGRVFMTHPTKAIYRW-LLRDFVRVTSIGVNS 83

Query: 110 EFD----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
             D    L+T +D+  +F  +  +    +YH +    GI      AGH+LG  +++I   
Sbjct: 84  PLDREENLYTNEDLVESFDKIETV----DYHSTIDVNGIKFTAFHAGHVLGAAMFQIEIA 139

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V++  DY+R K++HLN   +       +++   +    ++P   + +     I  T+
Sbjct: 140 GMRVLFTGDYSREKDRHLNSAEVPPLSSNILIVESTFGTATHEPRLNREKKLTQMIHHTV 199

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFLE 280
             GG VL+PV + GR  EL+LIL++YWA+H+        PIY+ + ++   +   ++++ 
Sbjct: 200 SHGGRVLMPVFALGRAQELMLILDEYWAQHAEELGDGQVPIYYASNLARKCMSVFQTYVN 259

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
            M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L++G S D+   
Sbjct: 260 MMNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDL--GPSVMLASPGMLQSGLSRDLLER 317

Query: 341 WASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQT 396
           W  D KNLVL T     GT+A+  ML+ D  P     +VT++RR  +      A+ + Q 
Sbjct: 318 WCPDEKNLVLITGYSIEGTMAKYLMLEPDTIPSVNNPEVTVARRCNIEEISFAAHVDFQE 377

Query: 397 RLKKEEALKAS 407
            L+  + + A+
Sbjct: 378 NLEFIQKINAT 388


>gi|312381513|gb|EFR27247.1| hypothetical protein AND_06171 [Anopheles darlingi]
          Length = 624

 Score =  169 bits (428), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 106/356 (29%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + + ID 
Sbjct: 4   IKITPLGAGQDVGRSCILLSMGGKNIMLDCGMHMGYNDERRFPDFSFIIPEGPITNHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTP 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   +P +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CKPDLLITESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YP+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPVYFAVGLTEKANNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K  +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKGYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|66472504|ref|NP_001018457.1| integrator complex subunit 11 [Danio rerio]
 gi|82192739|sp|Q503E1.1|INT11_DANRE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|63102425|gb|AAH95364.1| Zgc:110671 [Danio rerio]
          Length = 598

 Score =  169 bits (427), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +  + E   +  + AGH+LG  + +I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    ++S  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIM 350


>gi|158298905|ref|XP_319042.4| AGAP009923-PA [Anopheles gambiae str. PEST]
 gi|157014111|gb|EAA13845.4| AGAP009923-PA [Anopheles gambiae str. PEST]
          Length = 608

 Score =  169 bits (427), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 183/366 (50%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + + ID 
Sbjct: 4   IKITPLGAGQDVGRSCILLSMAGKNIMLDCGMHMGYNDERRFPDFSFIIPEGPITNHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTP 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   +P +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CKPDLLITESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YP+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPVYFAVGLTEKANNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +K  +DN   G  +V A+   L AG S  IF +WA +  N+V+     
Sbjct: 298 NMFDFKHIKPF-DKGYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VQGTVG 360


>gi|164424681|ref|XP_958078.2| hypothetical protein NCU06869 [Neurospora crassa OR74A]
 gi|157070616|gb|EAA28842.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 986

 Score =  169 bits (427), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 170/635 (26%), Positives = 252/635 (39%), Gaps = 146/635 (22%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++  S  L+ +DG    LID GW++ FD   L+ L K A T+  +LL+H    
Sbjct: 6   PLQGALSDSSASQSLLELDGGVKILIDVGWDETFDVEKLKELGKQAPTLSLILLTHATVP 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS-------------------- 104
           HL A  +  K        PV++T PV  LG     D Y S                    
Sbjct: 66  HLAAYAHCCKHFPPFQRIPVYATRPVIDLGRTLTQDLYASTPLAATTISSASLAEVSYAS 125

Query: 105 --RRQVSEFDLFTL-----DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAG 152
              +  S  + F L     ++I   F  +  L YSQ +            G+ +  + +G
Sbjct: 126 GYSQAASAENTFLLQPPTPEEITKYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSG 185

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLI 198
             LGGT+W I    E ++YAVD+N+ +E    G               V+E   +P  L+
Sbjct: 186 RTLGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGNHGGAGGTQVIEQLRKPTALV 245

Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL- 257
             +       P  ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W +    
Sbjct: 246 CSSRTPDAALPRAKRDEQLMESIKLCIARGGTVLIPVDSSARVLELSYLLEHAWRKEVAK 305

Query: 258 ------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----SRDN----------- 296
                 +  ++      SST+   +S LEWM DSI + FE     SR N           
Sbjct: 306 DNDVFKSAKLFLAGRTISSTMKNARSMLEWMDDSIIREFEAFADESRRNNRRDEGNHQTG 365

Query: 297 --AFLLKHVTLLINKSEL-------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
              F  K++ LL  K+++       D+A    K++LAS  SL+ GFS DI    A+D +N
Sbjct: 366 PGPFDFKYLRLLERKAQIDKILQQSDDAEPRAKVILASDTSLDWGFSKDILKSIAADARN 425

Query: 348 LVLFTER-----GQFGTLARML-----------------------QADPPPKAVKVTMSR 379
           LV+ TE+      Q  +++R L                       Q     + +++  + 
Sbjct: 426 LVILTEKPNLEPNQKPSISRTLWEWWKERRDGVATERTSNGDTFEQVYAGNRELEIETAE 485

Query: 380 RVPLVGEELIAYEEEQTRLKKEEALKASL-----------------VKEEESKASLGPDN 422
           R  L G+EL  Y   Q  L  +  L+A+L                    +    S G D 
Sbjct: 486 RKGLEGDELNVY---QQWLATQRQLQATLQSGGTNLLEAPGDVLDDADSDTDSESEGSDT 542

Query: 423 NLSGDPMVIDANNANASADVVEPHGGRYRD------ILI------DGFVPPSTSVAPMFP 470
              G  + I    A AS   V       RD      ILI      D  V  +     MFP
Sbjct: 543 EQQGKALNIANTMAQASRKKVV-----LRDEDLGVTILIKKENVYDFNVRGTKGRDRMFP 597

Query: 471 FYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIG 505
                   D+FGE+I P+DY+  +E  D      G
Sbjct: 598 VAMRRRRADEFGELIRPEDYLRAEEREDAENQEAG 632


>gi|351697497|gb|EHB00416.1| Integrator complex subunit 11 [Heterocephalus glaber]
          Length = 672

 Score =  169 bits (427), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 77  IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 136

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 137 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 196

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 197 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 253

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 254 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 312

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 313 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 370

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 371 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 423


>gi|195394529|ref|XP_002055895.1| GJ10637 [Drosophila virilis]
 gi|194142604|gb|EDW59007.1| GJ10637 [Drosophila virilis]
          Length = 597

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHECVSKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|354495797|ref|XP_003510015.1| PREDICTED: integrator complex subunit 11-like [Cricetulus griseus]
 gi|344251677|gb|EGW07781.1| Integrator complex subunit 11 [Cricetulus griseus]
          Length = 600

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|336466927|gb|EGO55091.1| hypothetical protein NEUTE1DRAFT_130968 [Neurospora tetrasperma
           FGSC 2508]
          Length = 1051

 Score =  169 bits (427), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 170/635 (26%), Positives = 252/635 (39%), Gaps = 146/635 (22%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++  S  L+ +DG    LID GW++ FD   L+ L K A T+  +LL+H    
Sbjct: 55  PLQGALSDSSASQSLLELDGGVKILIDVGWDETFDVEKLKELGKQAPTLSLILLTHATVP 114

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS-------------------- 104
           HL A  +  K        PV++T PV  LG     D Y S                    
Sbjct: 115 HLAAYAHCCKHFPPFQRIPVYATRPVIDLGRTLTQDLYASTPLAATTISSASLAEVSYAS 174

Query: 105 --RRQVSEFDLFTL-----DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAG 152
              +  S  + F L     ++I   F  +  L YSQ +            G+ +  + +G
Sbjct: 175 GYSQAASAENTFLLQPPTPEEITKYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSG 234

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLI 198
             LGGT+W I    E ++YAVD+N+ +E    G               V+E   +P  L+
Sbjct: 235 RTLGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGNHGGAGGTQVIEQLRKPTALV 294

Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL- 257
             +       P  ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W +    
Sbjct: 295 CSSRTPDAALPRAKRDEQLMESIKLCIARGGTVLIPVDSSARVLELSYLLEHAWRKEVAK 354

Query: 258 ------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----SRDN----------- 296
                 +  ++      SST+   +S LEWM DSI + FE     SR N           
Sbjct: 355 DNDVFKSAKLFLAGRTISSTMKNARSMLEWMDDSIIREFEAFADESRRNNRRDEGNHQTG 414

Query: 297 --AFLLKHVTLLINKSEL-------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
              F  K++ LL  K+++       D+A    K++LAS  SL+ GFS DI    A+D +N
Sbjct: 415 PGPFDFKYLRLLERKAQIDKILQQSDDAEPRAKVILASDTSLDWGFSKDILKSIAADARN 474

Query: 348 LVLFTER-----GQFGTLARML-----------------------QADPPPKAVKVTMSR 379
           LV+ TE+      Q  +++R L                       Q     + +++  + 
Sbjct: 475 LVILTEKPNLEPNQKPSISRTLWEWWKERRDGVATERTSNGDTFEQVYAGNRELEIETAE 534

Query: 380 RVPLVGEELIAYEEEQTRLKKEEALKASL-----------------VKEEESKASLGPDN 422
           R  L G+EL  Y   Q  L  +  L+A+L                    +    S G D 
Sbjct: 535 RKGLEGDELNVY---QQWLATQRQLQATLQSGGTNLLEAPGDVLDDADSDTDSESEGSDT 591

Query: 423 NLSGDPMVIDANNANASADVVEPHGGRYRD------ILI------DGFVPPSTSVAPMFP 470
              G  + I    A AS   V       RD      ILI      D  V  +     MFP
Sbjct: 592 EQQGKALNIANTMAQASRKKVV-----LRDEDLGVTILIKKENVYDFNVRGTKGRDRMFP 646

Query: 471 FYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIG 505
                   D+FGE+I P+DY+  +E  D      G
Sbjct: 647 VAMRRRRADEFGELIRPEDYLRAEEREDAENQEAG 681


>gi|395840791|ref|XP_003793235.1| PREDICTED: integrator complex subunit 11 isoform 1 [Otolemur
           garnettii]
          Length = 600

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|336261956|ref|XP_003345764.1| hypothetical protein SMAC_05921 [Sordaria macrospora k-hell]
 gi|380090100|emb|CCC12183.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1003

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 171/636 (26%), Positives = 252/636 (39%), Gaps = 148/636 (23%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++  S  L+ +DG    LID GW++ FD   L+ L ++A T+  +LL+H    
Sbjct: 6   PLQGALSDSSASQSLLELDGGVKILIDVGWDETFDVEKLRELGRIAPTLSLILLTHATVP 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS-------------------- 104
           HL A  +  K        PV++T PV  LG     D Y S                    
Sbjct: 66  HLAAYAHCCKHFPPFQRIPVYATRPVIDLGRTLTQDLYASTPLAATTISSASLAEVSYAS 125

Query: 105 --RRQVSEFDLFTL-----DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAG 152
              +  S  + F L     ++I   F  +  L YSQ +            G+ +  + +G
Sbjct: 126 GYSQAASAENTFLLQPPTPEEITKYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSG 185

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLI 198
           H LGGT+W I    E ++YAVD+N  +E    G               V+E   +P  L+
Sbjct: 186 HTLGGTIWHIQHGLESIVYAVDWNHSRENVFAGAAWLSGNHGGAGSTQVIEQLHKPTALV 245

Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL- 257
             +     +    ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W +    
Sbjct: 246 CSSRTPDASLSRLKRDEQLMESIKLCIARGGTVLIPVDSSARVLELSYLLEHAWRKEVAK 305

Query: 258 ------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----SRDN----------- 296
                 +  ++      SST+   +S LEWM D+I K FE     SR N           
Sbjct: 306 DNDVFKSAKLFLAGRTISSTMKNARSMLEWMDDNIIKEFEAFADESRRNNRRDEGNHQTG 365

Query: 297 --AFLLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F  K++ LL  K+++        D  P   K++LAS ASL+ GFS DI    A+D +
Sbjct: 366 PGPFDFKYLRLLERKAQIEKILKQSEDTEPRA-KVILASDASLDWGFSKDILKSIAADAR 424

Query: 347 NLVLFTERGQFG-----TLARML-----------------------QADPPPKAVKVTMS 378
           NLV+ TE+  F      ++AR L                       Q     + ++V  +
Sbjct: 425 NLVILTEKPNFEPNHKPSIARTLWEWWKERRDGVATERTSNGDTFEQVYAGNRELEVETA 484

Query: 379 RRVPLVGEELIAYEEEQTRLKKEEALKASL-----------------VKEEESKASLGPD 421
            R  L G+EL  Y   Q  L  +  L+A+L                    +    S G D
Sbjct: 485 ERKGLEGDELNVY---QQWLATQRQLQATLQSGGTTTLEAPGDVLDDADTDTDTDSEGSD 541

Query: 422 NNLSGDPMVIDANNANASADVVEPHGGRYRD------ILI------DGFVPPSTSVAPMF 469
               G  + I    A AS   V       +D      ILI      D  V        MF
Sbjct: 542 TEQQGKALNIATTMAQASRKKVA-----LKDEDLGVTILIKKENTYDFNVRGKKGRDRMF 596

Query: 470 PFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIG 505
           P        D+FGE+I P+DY+  +E  D      G
Sbjct: 597 PVAMRRRRADEFGELIRPEDYLRAEEREDAENAEAG 632


>gi|21312614|ref|NP_082296.1| integrator complex subunit 11 [Mus musculus]
 gi|81904239|sp|Q9CWS4.1|INT11_MOUSE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|12845859|dbj|BAB26928.1| unnamed protein product [Mus musculus]
 gi|26355309|dbj|BAC41135.1| unnamed protein product [Mus musculus]
 gi|74192536|dbj|BAE43054.1| unnamed protein product [Mus musculus]
 gi|74219576|dbj|BAE29558.1| unnamed protein product [Mus musculus]
 gi|148683102|gb|EDL15049.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
           [Mus musculus]
          Length = 600

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|74198351|dbj|BAE39661.1| unnamed protein product [Mus musculus]
          Length = 600

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|76559911|ref|NP_001029064.1| integrator complex subunit 11 [Rattus norvegicus]
 gi|119371245|sp|Q3MHC2.1|INT11_RAT RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|75867808|gb|AAI05304.1| Cleavage and polyadenylation specific factor 3-like [Rattus
           norvegicus]
          Length = 600

 Score =  168 bits (426), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|431922648|gb|ELK19568.1| Integrator complex subunit 11 [Pteropus alecto]
          Length = 603

 Score =  168 bits (426), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIRVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDR-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|74220481|dbj|BAE31460.1| unnamed protein product [Mus musculus]
          Length = 600

 Score =  168 bits (426), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQG 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|444519369|gb|ELV12789.1| Integrator complex subunit 11 [Tupaia chinensis]
          Length = 601

 Score =  168 bits (426), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 111/357 (31%), Positives = 180/357 (50%), Gaps = 19/357 (5%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTID 56
           ++VTPL G   +   S  LVSI G N ++DCG +  F       D S +    ++   +D
Sbjct: 4   IRVTPLVGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLD 63

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT
Sbjct: 64  CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 123

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DY
Sbjct: 124 SQMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDY 180

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+P
Sbjct: 181 NMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIP 239

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           V + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   +
Sbjct: 240 VFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQ 297

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
            N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 RNMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 351


>gi|197099184|ref|NP_001124760.1| integrator complex subunit 11 [Pongo abelii]
 gi|55725797|emb|CAH89679.1| hypothetical protein [Pongo abelii]
          Length = 655

 Score =  168 bits (426), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFTDNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|348551496|ref|XP_003461566.1| PREDICTED: integrator complex subunit 11 [Cavia porcellus]
          Length = 600

 Score =  168 bits (425), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|440801023|gb|ELR22048.1| cleavage and polyadenylation specific factor 3like, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 657

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 110/371 (29%), Positives = 182/371 (49%), Gaps = 18/371 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP----LSK---VASTIDA 57
           ++VTPL    +      LVS+ G N + DCG +  +D +   P    +SK     + ID 
Sbjct: 3   IKVTPLGAGQDVGRSCILVSLGGKNIMFDCGMHMGYDDARRFPDFNFISKSGNFTNAIDC 62

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           ++++H    H GALPY  +  G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 63  IIITHFHLDHCGALPYFTEMCGYDGPIYMTHPTKAICPILLEDYRKITVERKGETNFFTS 122

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +    E + +  + AGH+LG  ++ +    + V+Y  DYN
Sbjct: 123 QMIKDCMKKVVGLNVHQTVQVD---EELEIRAYYAGHVLGAAMFYVRVGDQSVVYTGDYN 179

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    +E  +RP VLIT++  A   +  ++ RE  F   +   +  GG VL+PV
Sbjct: 180 MTPDRHLGAAWIEK-LRPDVLITESTYATTIRDSKRWRERDFLKRVHSCVEKGGKVLIPV 238

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W  + I ++F     
Sbjct: 239 FALGRAQELCILLETYWERMNLTVPIYFSAGLTEKATNYYKLFIHWTNEKIKRTF--VHR 296

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH++    +  L + P GP ++ A+   L AG S ++F +WA + KNLV+     
Sbjct: 297 NMFDFKHISTF--ERGLADQP-GPMVLFATPGMLHAGTSLEVFKKWAPNEKNLVIIPGYC 353

Query: 356 QFGTLARMLQA 366
             GT+   L A
Sbjct: 354 VVGTVGNKLAA 364


>gi|402852593|ref|XP_003891002.1| PREDICTED: integrator complex subunit 11 isoform 1 [Papio anubis]
 gi|355557446|gb|EHH14226.1| hypothetical protein EGK_00111 [Macaca mulatta]
 gi|387540112|gb|AFJ70683.1| integrator complex subunit 11 [Macaca mulatta]
          Length = 600

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|195112455|ref|XP_002000788.1| GI10422 [Drosophila mojavensis]
 gi|193917382|gb|EDW16249.1| GI10422 [Drosophila mojavensis]
          Length = 597

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHECVLKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|118572558|sp|Q5NVE6.2|INT11_PONAB RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
          Length = 600

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|426327390|ref|XP_004024501.1| PREDICTED: integrator complex subunit 11 isoform 1 [Gorilla gorilla
           gorilla]
          Length = 600

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|156546030|ref|XP_001608037.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Nasonia
           vitripennis]
 gi|345498393|ref|XP_003428220.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Nasonia
           vitripennis]
 gi|345498395|ref|XP_003428221.1| PREDICTED: integrator complex subunit 11-like isoform 3 [Nasonia
           vitripennis]
          Length = 595

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 182/366 (49%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVS+ G N ++DCG +  F       D S + P     + ID 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSVGGKNIMLDCGMHMGFNDERRFPDFSYIVPEGPATNYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + ++Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKAPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +KS +DN   G  +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFDFKHIKPF-DKSYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNEANMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|33300633|ref|NP_060341.2| integrator complex subunit 11 isoform 2 [Homo sapiens]
 gi|118572557|sp|Q5TA45.2|INT11_HUMAN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein; AltName: Full=Protein
           related to CPSF subunits of 68 kDa; Short=RC-68
 gi|14124912|gb|AAH07978.1| Cleavage and polyadenylation specific factor 3-like [Homo sapiens]
 gi|60650138|tpg|DAA05669.1| TPA_exp: beta-lactamase fold protein family member RC-68 [Homo
           sapiens]
 gi|78100161|tpg|DAA05728.1| TPA_exp: integrator complex subunit 11 [Homo sapiens]
 gi|119576636|gb|EAW56232.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
           [Homo sapiens]
 gi|119576638|gb|EAW56234.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
           [Homo sapiens]
          Length = 600

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|343958192|dbj|BAK62951.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
          Length = 600

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|255084461|ref|XP_002508805.1| predicted protein [Micromonas sp. RCC299]
 gi|226524082|gb|ACO70063.1| predicted protein [Micromonas sp. RCC299]
          Length = 728

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 17/385 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           +++ PL           L S      + DCG +  +      P       ST+DA+L++H
Sbjct: 27  LEIMPLGAGSEVGRSCVLASYKNKTVMFDCGVHPGYAGIASLPYFDEVDLSTVDAMLITH 86

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDS 121
               H  A+P+ + +      +  T P   +  + M D   L+++  +   LF   D+  
Sbjct: 87  FHLDHCAAVPFVVGRTNFKGRILMTHPTKAIFAMLMNDFVKLNKQGDNSEALFGEKDVQE 146

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
             + +  + + Q   +    +G+ V P+ AGH+LG  ++ +   G  V+Y  DY+R  ++
Sbjct: 147 CMRRIEVIDFHQEMDI----DGVKVTPYRAGHVLGACMFYVDIGGLRVLYTGDYSRTPDR 202

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGR 240
           HL G  L   + P V+I +A   +    PR++RE  F D + + L  GG VLLPV + GR
Sbjct: 203 HLPGADLPP-IPPHVVIVEATYGVSPHSPREERERRFTDMVHRVLTRGGKVLLPVVALGR 261

Query: 241 VLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
             E+LLILEDYW +H      PIY  + ++   +   ++++  +   +  +FE S  N F
Sbjct: 262 AQEVLLILEDYWVKHPELKGVPIYQASALAKRAMTVYQTYINVLNSDMKAAFEES--NPF 319

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           +  HV  L N S LD+   GP +VLA+ + L++G S D+F  W  D KN V+  +    G
Sbjct: 320 VFNHVNHLANSSGLDDV--GPCVVLATPSMLQSGLSRDLFESWCGDSKNGVIICDFAVQG 377

Query: 359 TLARMLQADPPPKAVKVTMSRRVPL 383
           TLAR + +D   K V     + +PL
Sbjct: 378 TLAREILSD--CKTVTSRTGQELPL 400


>gi|397476276|ref|XP_003809533.1| PREDICTED: integrator complex subunit 11 isoform 1 [Pan paniscus]
 gi|410206788|gb|JAA00613.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
 gi|410251172|gb|JAA13553.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
 gi|410297680|gb|JAA27440.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
 gi|410349815|gb|JAA41511.1| cleavage and polyadenylation specific factor 3-like [Pan
           troglodytes]
          Length = 600

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|340966678|gb|EGS22185.1| putative cleavage and polyadenylation protein [Chaetomium
           thermophilum var. thermophilum DSM 1495]
          Length = 998

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 171/634 (26%), Positives = 247/634 (38%), Gaps = 162/634 (25%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  +E+  S  L+ +DG    LID GW++ FDPSLL+ L K   T+  +LL+H   
Sbjct: 5   TPLLGARSESTASQSLLELDGGVKVLIDVGWDESFDPSLLRELEKHVPTLSLILLTHATI 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSE------------- 110
            HLGA  +  K   L    PV++T PV  LG     D Y S  + +              
Sbjct: 65  NHLGAYAHCCKHFPLFTRIPVYATRPVIDLGRTLTQDLYASNPRAATTIPKSSLAETAFA 124

Query: 111 ---------------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHV 150
                              T D+I   F  +  L YSQ +            G+ +  + 
Sbjct: 125 FPQAAGGAELPSSLLLQPPTPDEIIRYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYN 184

Query: 151 AGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAV 196
           +GH LGGT+W I    E ++YAVD+N+ +E    G               V+E   +P  
Sbjct: 185 SGHSLGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGGHGAAVGTEVIEPLRKPTA 244

Query: 197 LITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--- 253
           L+  +       P  ++ E   +++   +  GG VL+PVDS+ RVLEL  +LE  W    
Sbjct: 245 LVCSSRTPDAALPRARRDEQLLESVKLCIARGGTVLIPVDSSARVLELAYLLEHAWRTEV 304

Query: 254 ----EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA------------ 297
               E      +Y       ST+   +S LEWM DSI + FE     A            
Sbjct: 305 AKENEVFKGTKLYLAGRSVGSTMRNARSMLEWMDDSIVREFEAVAGGARTTNGGANASGG 364

Query: 298 --------FLLKHVTLLINKSELDNA-------PDG--PK--LVLASMASLEAGFSHDIF 338
                   F  K++ LL  K++++         P+G  PK  ++LA+  SL+ GFS D+ 
Sbjct: 365 NKAKEAGPFDFKYLRLLERKAQIERVLQQATSPPEGESPKGTVILATDTSLDWGFSKDVL 424

Query: 339 VEWASDVKNLVLFTERGQFG-----TLARML-----------------------QADPPP 370
              ASD +NLV+ TE+         ++ARML                       Q     
Sbjct: 425 KAIASDARNLVILTEKPNLANPDRPSIARMLWDWWRERRDGVAVEQTASGDTFEQVYGGG 484

Query: 371 KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMV 430
           + + V  S R PL G EL  Y   Q  L  +  L+A+L        S G    L     V
Sbjct: 485 RELSVPESTRHPLEGSELTVY---QQWLATQRQLQATL-------RSGGAAGALEASADV 534

Query: 431 ID-----------------ANNANASADVVEPHGGRYRDILID---GF------------ 458
           +D                     N S  + +    R + +L D   G             
Sbjct: 535 VDDASETTTESEESETEQQGKALNVSTTIGQ--ASRKKVVLTDEDLGITILLKKKGVYDF 592

Query: 459 -VPPSTSVAPMFPFYENNSEWDDFGEVINPDDYI 491
            V        MFP        D+FGE+I P+DY+
Sbjct: 593 DVRNKKGRERMFPTVLRRKRVDEFGELIRPEDYL 626


>gi|343958314|dbj|BAK63012.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
          Length = 600

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVHDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|296479091|tpg|DAA21206.1| TPA: cleavage and polyadenylation specific factor 3-like [Bos
           taurus]
          Length = 599

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKXGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP++LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|403297738|ref|XP_003939709.1| PREDICTED: integrator complex subunit 11 isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 600

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVEHGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|296206477|ref|XP_002750225.1| PREDICTED: integrator complex subunit 11 isoform 1 [Callithrix
           jacchus]
          Length = 600

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|440911726|gb|ELR61363.1| Integrator complex subunit 11 [Bos grunniens mutus]
          Length = 599

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP++LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|194906134|ref|XP_001981318.1| GG11690 [Drosophila erecta]
 gi|190655956|gb|EDV53188.1| GG11690 [Drosophila erecta]
          Length = 597

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG +  F       D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGFNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|326932364|ref|XP_003212289.1| PREDICTED: integrator complex subunit 11-like [Meleagris gallopavo]
          Length = 600

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|195445135|ref|XP_002070189.1| GK11920 [Drosophila willistoni]
 gi|194166274|gb|EDW81175.1| GK11920 [Drosophila willistoni]
          Length = 597

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|61098197|ref|NP_001012854.1| integrator complex subunit 11 [Gallus gallus]
 gi|75571225|sp|Q5ZIH0.1|INT11_CHICK RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|53135966|emb|CAG32473.1| hypothetical protein RCJMB04_26e19 [Gallus gallus]
          Length = 600

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|274326663|ref|NP_001094578.1| integrator complex subunit 11 [Bos taurus]
 gi|152941100|gb|ABS44987.1| related to CPSF subunits 68 kDa [Bos taurus]
          Length = 599

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP++LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|449268484|gb|EMC79348.1| Integrator complex subunit 11 [Columba livia]
          Length = 600

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|21358523|ref|NP_651721.1| integrator 11 [Drosophila melanogaster]
 gi|7301822|gb|AAF56931.1| integrator 11 [Drosophila melanogaster]
 gi|16768852|gb|AAL28645.1| LD08814p [Drosophila melanogaster]
 gi|220943570|gb|ACL84328.1| CG1972-PA [synthetic construct]
 gi|220953494|gb|ACL89290.1| CG1972-PA [synthetic construct]
          Length = 597

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|313238583|emb|CBY13629.1| unnamed protein product [Oikopleura dioica]
          Length = 618

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 112/393 (28%), Positives = 190/393 (48%), Gaps = 22/393 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP---------LSKVASTI 55
           +++ PL    +      LVSI   N + DCG +  +  +   P          + +   I
Sbjct: 4   IRIVPLGAGQDVGRSCILVSIGNKNVMFDCGMHMGYQDARRFPDFNYITGGDQTTLTPHI 63

Query: 56  DAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQV-SEFD 112
           DAV++SH    H GALPY  +Q+G   P++ T P   +   LL  + + +++R   +E +
Sbjct: 64  DAVIISHFHLDHCGALPYMSEQVGYEGPIYMTMPTKVICPILLEDFRKVVTKRSAGAETN 123

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
            FT + I +  + V  +   Q  ++    + + +  + AGH+LG  ++KIT   E V+Y 
Sbjct: 124 FFTSEMIKNCMRKVEIVGLHQVINVD---DELSIKAYYAGHVLGAAMFKITVGDESVLYT 180

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            D+N   ++HL G       +P VLI+++  A   +  ++ RE  F   I + +  GG V
Sbjct: 181 GDFNMTPDRHL-GAAWADRCKPTVLISESTYATTIRDSKRSRERDFLKKIHRCVENGGKV 239

Query: 232 LLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           L+PV + GR  EL ++LE YW    LN P+YF   ++    +Y K F+ W  + I  SF 
Sbjct: 240 LIPVFALGRAQELCILLEQYWDRMKLNVPVYFTAGLAEKATNYYKLFVNWTNEKIKSSF- 298

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
               N F  K++     + E+     GP++  A+   L AG S +IF  W +D KN ++ 
Sbjct: 299 -VERNLFDFKYIKAF--QKEIHMNQSGPQVCFATPGMLHAGMSLEIFQNWCTDEKNCIIM 355

Query: 352 TERGQFGTLA-RMLQADPPPKAVKVTMSRRVPL 383
                 GT+  R+L  +   K   V ++ R+ +
Sbjct: 356 PGYCVAGTVGHRLLHGERHFKFNGVNVTSRIKV 388


>gi|195503187|ref|XP_002098546.1| GE23879 [Drosophila yakuba]
 gi|194184647|gb|EDW98258.1| GE23879 [Drosophila yakuba]
          Length = 597

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|195341281|ref|XP_002037239.1| GM12816 [Drosophila sechellia]
 gi|195574829|ref|XP_002105386.1| GD21460 [Drosophila simulans]
 gi|194131355|gb|EDW53398.1| GM12816 [Drosophila sechellia]
 gi|194201313|gb|EDX14889.1| GD21460 [Drosophila simulans]
          Length = 597

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKNYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|195062087|ref|XP_001996130.1| GH14325 [Drosophila grimshawi]
 gi|193891922|gb|EDV90788.1| GH14325 [Drosophila grimshawi]
          Length = 597

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYAGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|355680857|gb|AER96662.1| cleavage and polyadenylation specific factor 3-like protein
           [Mustela putorius furo]
          Length = 440

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 177/355 (49%), Gaps = 18/355 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 13  IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRNGRLTDFLDC 72

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 73  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 132

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 133 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 189

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 190 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPV 248

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 249 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 306

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+
Sbjct: 307 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 358


>gi|301618510|ref|XP_002938656.1| PREDICTED: integrator complex subunit 11 isoform 1 [Xenopus
           (Silurana) tropicalis]
 gi|301618512|ref|XP_002938657.1| PREDICTED: integrator complex subunit 11 isoform 2 [Xenopus
           (Silurana) tropicalis]
          Length = 600

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTEFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVNLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHETVEKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNDKNMVIM 350


>gi|207079923|ref|NP_001128922.1| DKFZP459J1110 protein [Pongo abelii]
 gi|56403907|emb|CAI29738.1| hypothetical protein [Pongo abelii]
          Length = 600

 Score =  167 bits (422), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYVTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT +  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITGSTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|410989914|ref|XP_004001198.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
           [Felis catus]
          Length = 598

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|260790823|ref|XP_002590440.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
 gi|229275634|gb|EEN46451.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
          Length = 597

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 174/351 (49%), Gaps = 20/351 (5%)

Query: 22  LVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG    +ND     D + +     +   +D V++SH    H G LPY 
Sbjct: 12  LVSIGGKNIMLDCGMHMGYNDERRFPDFTYITQSGTLNDHLDCVIISHFHLDHCGCLPYM 71

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYDQY---LSRRQVSEFDLFTLDDIDSAFQSVTRLTY 131
            + +G   P++ T P   +  + + D     + R+  S+ + FT   I    + V  +  
Sbjct: 72  TEMVGYDGPIYMTHPTKAICPILLEDYRKITVDRKGESQANFFTSQMIKDCMKKVIPVNL 131

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESF 191
            Q   +  + E   +  + AGH+LG  ++ I    E V+Y  DYN   ++HL    ++  
Sbjct: 132 HQTVQVDDELE---IKAYYAGHVLGAAMFLIKVGSESVVYTGDYNMTPDRHLGAAWIDK- 187

Query: 192 VRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 250
            RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE 
Sbjct: 188 CRPDLLITESTYATTIRDSKRCRERDFLKKVHETIEKGGKVLIPVFALGRAQELCILLET 247

Query: 251 YWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           +W   ++  PIYF T ++    +Y + F+ W    I K+F   + N F  KH+    ++S
Sbjct: 248 FWERMNIKAPIYFSTGLTEKANNYYRLFITWTNQKIRKTF--VKRNMFEFKHIKAF-DRS 304

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
            +DN   GP +V A+   L AG S  IF +WA D KN+V+       GT+ 
Sbjct: 305 YIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPDSKNMVIMPGYCVAGTVG 353


>gi|366992944|ref|XP_003676237.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
 gi|342302103|emb|CCC69876.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
          Length = 771

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 189/371 (50%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           STID +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S    S
Sbjct: 59  STIDVLLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKAIYRW-LLRDFVRVTSIGVNS 117

Query: 110 EF----DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
                 +++T +D+  +F  +  +    +YH +   +GI      AGH+LG  +++I   
Sbjct: 118 TIGNDDNIYTDEDLAESFDKIETV----DYHSTVDVDGIKFTAFHAGHVLGAAMFQIEIA 173

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V++  DY+R  ++HLN   + S     +++   +    ++P   + +     I  T+
Sbjct: 174 GLRVLFTGDYSREMDRHLNSAEVPSLPSDVLIVESTFGTATHEPRLNREKNLTQLIHSTV 233

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLE 280
             GG VLLPV + GR  E++LIL++YW++H     S   PIY+ + ++   +   ++++ 
Sbjct: 234 SRGGRVLLPVFALGRAQEIMLILDEYWSQHAEELGSGQVPIYYASNLAKKCMSVFQTYVN 293

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
            M D I + F  S+ N F+ K+++ L N  E  +   GP ++LAS   L++G S D+  +
Sbjct: 294 MMNDDIRRKFRDSQTNPFIFKNISYLRNLEEFQDF--GPSVMLASPGMLQSGLSRDVLEK 351

Query: 341 WASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQT 396
           W  D KNLVL T     GT+A+  ML+ D  P     +VT+ RR  +      A+ + Q 
Sbjct: 352 WCPDEKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEVTVPRRCNVEEISFAAHVDFQE 411

Query: 397 RLKKEEALKAS 407
            L+  E + A+
Sbjct: 412 NLEFIEKISAN 422


>gi|15029864|gb|AAH11155.1| Cleavage and polyadenylation specific factor 3-like [Mus musculus]
          Length = 600

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 179/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V      Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVADHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|453084596|gb|EMF12640.1| Metallo-hydrolase/oxidoreductase [Mycosphaerella populorum SO2202]
          Length = 964

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 126/407 (30%), Positives = 189/407 (46%), Gaps = 61/407 (14%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  +++P S  L+ +DG    L+D GW++ FD   L  L +  +T+  VLL+H   
Sbjct: 5   TPLLGAQSDSPASQSLLELDGGVKILVDVGWDETFDAEQLHALERHVATLSVVLLTHATL 64

Query: 66  LHLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS---------RRQVSE---- 110
            HLGA  +  K +    + PV++T PV  LG   + D Y S          R ++     
Sbjct: 65  DHLGAYAHCCKHIPHFRNVPVYATTPVVNLGRTLITDLYASAPLAAGVIPARAIAANTAL 124

Query: 111 ---------FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLG 156
                    F   + D+I + F ++  L YSQ +       S     + +  + AGH  G
Sbjct: 125 APDATPSLLFPAPSADEIAAYFGAIHPLRYSQPHQPVPSPFSAPVGNLTITAYSAGHTPG 184

Query: 157 GTVWKITKDGEDVIYAVDYNRRKEKHLNG--------TVLESFVRPAVLITDAYNALHNQ 208
           GT+W I    E ++YA D+N+ +E  L+G         + E   RP  LI  +      +
Sbjct: 185 GTIWHIQHSLESIVYAADWNQGRENLLSGAAWLSGGSNITEGLQRPTALICSSRGVEKTE 244

Query: 209 P-PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------LN 258
              R++R E     I +T+  GG VL+P DS+ RVLEL  IL   W E+          N
Sbjct: 245 TLTRKKRDEALISLIRETIAQGGKVLIPTDSSARVLELAFILNHTWRENVEGPHADTYRN 304

Query: 259 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD----------NAFLLKHVTLLIN 308
             IY  +  S ST+  + S LEWM D+I +  E +            N    + +  + +
Sbjct: 305 ARIYMASKTSKSTVRQLSSMLEWMDDAIIRDAEAAMSKTQADEGRVPNLLDWQFIQQIES 364

Query: 309 KSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           K++LD A     P ++LAS ASLE GFS     + A D +NLV+ TE
Sbjct: 365 KNKLDQALRRRRPCILLASDASLEWGFSRQAMEKLAEDPRNLVILTE 411


>gi|194765324|ref|XP_001964777.1| GF23370 [Drosophila ananassae]
 gi|190615049|gb|EDV30573.1| GF23370 [Drosophila ananassae]
          Length = 597

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+S+ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPDGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWDRMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|348503157|ref|XP_003439132.1| PREDICTED: integrator complex subunit 11-like [Oreochromis
           niloticus]
          Length = 601

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 180/366 (49%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND     D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +  + E   +  + AGH+LG  +  I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVHIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +++  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHESIERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    ++S  DN   GP +V A+   L AG S  IF +WA + KN+V+     
Sbjct: 298 NMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIMPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VQGTIG 360


>gi|91086147|ref|XP_969343.1| PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
 gi|270009886|gb|EFA06334.1| hypothetical protein TcasGA2_TC009205 [Tribolium castaneum]
          Length = 595

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 184/366 (50%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+++ G N ++DCG    +ND     D S +     + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCILLTMGGKNIMLDCGMHMGYNDERRFPDFSYISQEGPLTSYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G S P++ T P   +  + + D + +S  +  + + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYSGPIYMTHPTKAIAPILLEDMRKVSVEKKGDQNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  +   I +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSLMVDNE---IEIKAYYAGHVLGAAMFWIRVGAQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECMDRGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKAPVYFALGLTEKANNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    ++S +DN   GP +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFDFKHIKPF-DRSYIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|56403864|emb|CAI29717.1| hypothetical protein [Pongo abelii]
          Length = 600

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+  + + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKT--SVQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|224079882|ref|XP_002197797.1| PREDICTED: integrator complex subunit 11 [Taeniopygia guttata]
          Length = 600

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ + P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMSHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|432866809|ref|XP_004070946.1| PREDICTED: integrator complex subunit 11-like [Oryzias latipes]
          Length = 599

 Score =  166 bits (420), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 180/366 (49%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND     D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGYNDDRRFPDFSYVTQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +  + E   +  + AGH+LG  +  I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVYIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +++  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHESIERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGMTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    ++S  DN   GP +V A+   L AG S  IF +WA + KN+V+     
Sbjct: 298 NMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIMPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VQGTIG 360


>gi|303275006|ref|XP_003056813.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461165|gb|EEH58458.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 803

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 110/368 (29%), Positives = 187/368 (50%), Gaps = 14/368 (3%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKV-ASTIDAVLLSH 62
           +++TPL           + +  G + + DCG +  +      P   +V  ST+DA+L++H
Sbjct: 18  LRITPLGAGSEVGRSCVMATYKGKSVMFDCGVHPGYAGIASLPYFDEVDLSTVDALLVTH 77

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSA 122
               H  A+P+ +        +  T P   +  + M D    ++      LFT  D+ +A
Sbjct: 78  FHLDHCAAVPFLVGHTNFKGRILMTHPTKAIFNMLMTDFVKLQKNNDSEALFTEQDLKAA 137

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
              +  + + Q   +    +G+ V P+ AGH+LG  ++ +  DG  V+Y  DY+R  ++H
Sbjct: 138 IAMIEVVDFHQEIVI----DGMKVTPYRAGHVLGACMFFVDIDGLRVLYTGDYSRTPDRH 193

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRV 241
           L G  L S V P V+I+++   +    PR++RE  F D + + L  GG VLLPV + GR 
Sbjct: 194 LPGADLPS-VPPHVVISESTYGVSPHTPREEREKRFTDRVYQILNRGGKVLLPVVALGRA 252

Query: 242 LELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
            ELLLILED+W +H    N PIY  + ++   +   ++++  +   +  +FE +  N F+
Sbjct: 253 QELLLILEDHWKKHPELANVPIYQASALARRAMTVYQTYINVLNSDMKAAFEEA--NPFV 310

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
             HV  L +   LD+   GP +VLA+ + L++G S ++F  W  D  N V+  +    GT
Sbjct: 311 FNHVQHLSHAGGLDDV--GPCVVLATPSMLQSGLSRELFEMWCGDANNGVIIADFAVQGT 368

Query: 360 LARMLQAD 367
           LAR + +D
Sbjct: 369 LAREILSD 376


>gi|383859336|ref|XP_003705151.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Megachile
           rotundata]
          Length = 595

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 182/366 (49%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVS+ G N ++DCG +  F       D S + P     + ID 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSMGGKNIMLDCGMHMGFNDERRFPDFSYIVPEGPATNYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + ++Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDRGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNEANMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|301788922|ref|XP_002929872.1| PREDICTED: integrator complex subunit 11-like [Ailuropoda
           melanoleuca]
          Length = 600

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDR-CRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|359319514|ref|XP_003639102.1| PREDICTED: integrator complex subunit 11-like [Canis lupus
           familiaris]
          Length = 600

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|410928941|ref|XP_003977858.1| PREDICTED: integrator complex subunit 11-like [Takifugu rubripes]
          Length = 601

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 181/366 (49%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND     D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGYNDDRRFPDFSYVTQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALP+  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPFMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +  + E   +  + AGH+LG  + +I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +++  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHESIERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    ++S  DN   GP +V A+   L AG S  IF +WA + KN+V+     
Sbjct: 298 NMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIMPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VQGTIG 360


>gi|125773833|ref|XP_001358175.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
 gi|54637910|gb|EAL27312.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
          Length = 597

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+++ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLTMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYNGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVARGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|426240429|ref|XP_004014105.1| PREDICTED: integrator complex subunit 11 [Ovis aries]
          Length = 515

 Score =  166 bits (419), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 176/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|392580514|gb|EIW73641.1| hypothetical protein TREMEDRAFT_67471 [Tremella mesenterica DSM
           1558]
          Length = 944

 Score =  166 bits (419), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 187/781 (23%), Positives = 314/781 (40%), Gaps = 181/781 (23%)

Query: 18  PLSYLVSIDGFNFLIDCGWND------HFDPSLLQPLSKVASTIDAVLLSHPDTLHLGAL 71
           PL YL+ +D    L+D G +D      H        + ++A T+  VLLSH  T +L   
Sbjct: 19  PLCYLLEVDDARILLDMGQSDYTAASSHSSYEYENKVRELAPTLSLVLLSHSQTRYLSLY 78

Query: 72  PYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD------------------- 112
           P+A  + GL  PV++T+P   +G +    +  S R     D                   
Sbjct: 79  PFARARWGLQCPVYATQPTVEMGRVVCLSEVYSWRSEHAVDDTSDHSANHSSGGSPDKGK 138

Query: 113 -------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TK 164
                  + T++++  AF  +  + Y+Q  HL G    +++ P  +GH LGGT++KI + 
Sbjct: 139 QPLRGPFVPTVEEVHEAFDWIKAVRYNQPLHLDGGLSHLLLTPFRSGHTLGGTLFKIRSP 198

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVL---------ESFVRPAVLITDAYNALHNQPPRQQRE 215
               V+YAV  N   E+HL+G V          E  +RP +LI +   A    P R++RE
Sbjct: 199 TSGTVLYAVGMNHTGERHLDGMVSGQGGPSGYEEGVLRPDLLIVEGSRATVVNPKRRERE 258

Query: 216 M-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-------------------AEH 255
               D +S TL A  +VL+PVD + R+LELL++ + +W                   AE 
Sbjct: 259 TALIDVVSSTLEASRSVLMPVDPSPRLLELLILFDQHWTFKQIPPEKRNHLYVPKEEAER 318

Query: 256 SLNYPIYFLTYVSSSTIDYVKSFLEWMG------------DSITKSFETSRDNAFLL--- 300
              YP+  ++        + +S +EWMG            D +    +  R     L   
Sbjct: 319 QWPYPLCLVSRTGHDMASFARSLIEWMGGIVREAGGEEVVDDLPTGGKKGRRKPIGLGNS 378

Query: 301 -------KHVTLLINKSELDNAP--DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
                  +HV    +  +L      + PKLVLA   ++  G S  +F    S   N++L 
Sbjct: 379 EYGLLDFRHVRFFASPMDLLQGLGLNRPKLVLAIPPAMNHGPSRWLFTAMGSVEGNVILL 438

Query: 352 TERGQFGTLARMLQAD---PPPKAVK-----------------VTMSRRVPLVGEELIAY 391
           T  GQ  +LAR L  +     P   K                 V ++ +VPL+G EL A+
Sbjct: 439 TSTGQDQSLARDLYNEWEKSQPSGCKWGEGKIGKLHRLDGSMTVELNSKVPLIGAELEAH 498

Query: 392 -EEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDA------NNANASADVVE 444
            E E+   ++E A +A+L + E    +   +++   D   +DA       N    A+   
Sbjct: 499 VEAERLEKEREAAHQAALNRSERMLEADDLESDSDSDTESLDAATGGLVRNRAEGANAYA 558

Query: 445 PHGGRYRDILIDGFVPPST-----------SVAPMFPFYENNS-EWDDFGEVINPDDYII 492
             G   R +  D FV               +   MFPF E    + DD+GE ++   ++ 
Sbjct: 559 GDGEDVRTMSFDIFVKGQQMRTGRGTEGGMARFRMFPFLERRGRKIDDYGEGLDIGQWVR 618

Query: 493 K----------------------DEDMDQ-------------------AAMHIGGDDGKL 511
           K                      DE+  Q                   A +     DG+L
Sbjct: 619 KGKEIEEEGETEEVREAKRRKEMDEEKHQDAPEPPSKYVTEIKTVELHAYVFFVDMDGQL 678

Query: 512 D-EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQH--CLKHVCPHVYTPQIEETID 568
           D +   ++I D +P K+      ++V  + +ATE+L  +      +   ++ P + +T+ 
Sbjct: 679 DGQALKTVITDLQPRKI------IIVRSTPQATENLLDYFRSASLITHDIHIPALYQTLR 732

Query: 569 VTSDLCAYKVQLSEKLMSNVLFK--KLGDYEIAWVDAEVGKTENGMLSLLPIST----PA 622
           +   + +Y + L + + +++  K  K   +EI  VD ++  +    +  L  S     PA
Sbjct: 733 IGEHVQSYSLILGDSISASLAGKWSKFEGFEITMVDGKIAFSAGSTVPHLETSNAVIEPA 792

Query: 623 P 623
           P
Sbjct: 793 P 793



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 35/92 (38%), Positives = 50/92 (54%), Gaps = 3/92 (3%)

Query: 618 ISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGG- 675
           + T  P   S+ VGDL++A LK  L+S  I  EFAG G L CG  V+  +   AG     
Sbjct: 850 VQTAVPLPTSLFVGDLRLAVLKNKLASLNIPAEFAGEGVLVCGPGVSTPETAKAGSLVAV 909

Query: 676 -GSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
              GT +IV+EG + + Y+ +R  LY  F ++
Sbjct: 910 RKVGTGEIVLEGTVGKVYFDVRKALYGSFAMV 941


>gi|328776642|ref|XP_003249190.1| PREDICTED: integrator complex subunit 11-like [Apis mellifera]
          Length = 603

 Score =  166 bits (419), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 182/366 (49%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVS+ G N ++DCG +  F       D S + P     + ID 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSMGGKNIMLDCGMHMGFNDERRFPDFSYIIPEGPATNYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + ++Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDRGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNEANMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|195143691|ref|XP_002012831.1| GL23717 [Drosophila persimilis]
 gi|194101774|gb|EDW23817.1| GL23717 [Drosophila persimilis]
          Length = 597

 Score =  166 bits (419), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 179/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++TPL    +      L+++ G N ++DCG    +ND     D S + P   + S ID 
Sbjct: 4   IKITPLGAGQDVGRSCLLLTMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEIVGYNGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +    E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWINVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    +++  RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDN-ARPDLLISESTYATTIRDSKRCRERDFLKKVHECVARGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L YPIYF   ++     Y K F+ W    I K+F     
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF--VHR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+ 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIM 350


>gi|401827835|ref|XP_003888210.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
 gi|392999410|gb|AFM99229.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
          Length = 496

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 179/366 (48%), Gaps = 23/366 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP----LSKVAS---TIDA 57
           + V PL    +      LV+I G   + DCG +  F+     P    +SK  S    ID 
Sbjct: 1   MNVVPLGAGQDVGRSCVLVTIGGRTIMFDCGMHMGFNDERRFPDFSYISKTKSFDKVIDC 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD 117
           V++SH    H GALPY  +  G + P++ T P   +    + D +         ++F+  
Sbjct: 61  VIISHFHLDHCGALPYFTEVCGYNGPIYMTLPTKEV-CPVLLDDFRKIVGAKGDNIFSYQ 119

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
           DI +  + VT ++ S+ Y      E   + P+ AGH+LG  ++ +    + V+Y  DY+ 
Sbjct: 120 DIVNCMKKVTTISMSETYK---HDEDFYITPYYAGHVLGAAMFHVVVGDQSVVYTGDYST 176

Query: 178 RKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD 236
             +KHL    ++  VRP +LIT++ Y ++     R +   F  AIS  +  GG VL+P+ 
Sbjct: 177 TPDKHLGPASIKC-VRPDLLITESTYGSITRDCRRVKEREFLKAISDCIARGGRVLIPIF 235

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS-FETSRD 295
           + GR  EL L+L+ YW    L  P+YF + ++    +  K F+ +  +++ K  FE    
Sbjct: 236 ALGRAQELCLLLDGYWERTGLKVPVYFSSGLTEKANEIYKKFISYTNETVKKKIFER--- 292

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+     K  +DN  +GP ++ AS   L +G S  +F EW SD KNLV+   + 
Sbjct: 293 NVFEYKHIKPF-QKYYMDN--EGPMVLFASPGMLHSGMSLRMFKEWCSDEKNLVIIPGYC 349

Query: 353 ERGQFG 358
            RG  G
Sbjct: 350 VRGTIG 355


>gi|380011463|ref|XP_003689822.1| PREDICTED: integrator complex subunit 11-like [Apis florea]
          Length = 595

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 182/366 (49%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVS+ G N ++DCG +  F       D S + P     + ID 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSMGGKNIMLDCGMHMGFNDERRFPDFSYIIPEGPATNYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + ++Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDRGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNEANMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|340728535|ref|XP_003402577.1| PREDICTED: integrator complex subunit 11-like [Bombus terrestris]
 gi|350421011|ref|XP_003492700.1| PREDICTED: integrator complex subunit 11-like [Bombus impatiens]
          Length = 595

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 108/366 (29%), Positives = 182/366 (49%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVS+ G N ++DCG +  F       D S + P     + ID 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSMGGKNIMLDCGMHMGFNDERRFPDFSYIIPEGPTTNYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + ++Y  DYN
Sbjct: 124 QMIKDCMKKVIAVTLHQSVMVDSELE---IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDRGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNEANMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|432090010|gb|ELK23618.1| Integrator complex subunit 11 [Myotis davidii]
          Length = 561

 Score =  165 bits (418), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 55  LVSIGGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDCVIISHFHLDHCGALPYF 114

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 115 SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVRLHQ 174

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +    E + +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 175 TVQVD---EELQIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 230

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 231 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 290

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 291 ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 347

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 348 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 384


>gi|118572556|sp|Q2YDM2.2|INT11_BOVIN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
           Full=Cleavage and polyadenylation-specific factor 3-like
           protein; Short=CPSF3-like protein
 gi|158455110|gb|AAI10156.2| CPSF3L protein [Bos taurus]
          Length = 599

 Score =  165 bits (417), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 176/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S      ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP++LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|302832928|ref|XP_002948028.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
           nagariensis]
 gi|300266830|gb|EFJ51016.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
           nagariensis]
          Length = 728

 Score =  165 bits (417), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 104/361 (28%), Positives = 174/361 (48%), Gaps = 15/361 (4%)

Query: 29  NFLIDCGWNDHFDPSLLQPL--SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
             + DCG +  F      PL      +T+D  L++H    H  A+PY +++      +F 
Sbjct: 48  TVMFDCGIHPAFKGMDSLPLLDDIDIATVDVALITHFHLDHCAAVPYLLRKTRFKGRIFM 107

Query: 87  TEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVV 146
           T P   +    + D     +  SE  LF  +D+D++ + +  + + Q   +SG    + +
Sbjct: 108 THPTKAIYYSLLRDLAKGAKHSSEEALFNEEDLDASMEQIEVVDFYQTIEVSG----MQI 163

Query: 147 APHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALH 206
            P+ AGH+LG  ++ +   G   +Y  DY+R  ++HL G      V P ++I ++     
Sbjct: 164 TPYRAGHVLGAAMFMVEVAGLRCLYTGDYSRLPDRHLPGADTPP-VTPHIVIVESTYGTS 222

Query: 207 NQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL---NYPIY 262
              PRQQRE +  D I  TL  GG VL+P+ + GR  ELLL+L++YW  H       PIY
Sbjct: 223 RHLPRQQREQLLIDNIRTTLNRGGRVLMPIVALGRAQELLLLLDEYWEAHKSELGGIPIY 282

Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
             + + S  +   ++++E + + I K F     N F  +HV  L N +       GP ++
Sbjct: 283 QASSMMSKALGVYQTYVESLNEDIKKVFHDR--NPFKFRHVQTLKNPAHFIADYSGPCVI 340

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP 382
           +A+ + L++G S D F  W  D +N  +  +    GTLA+ +     P ++     RRVP
Sbjct: 341 MATPSGLQSGASRDFFEAWCEDARNTCIICDFAVQGTLAKEILGG--PSSITTREGRRVP 398

Query: 383 L 383
           L
Sbjct: 399 L 399


>gi|350585498|ref|XP_003127541.3| PREDICTED: integrator complex subunit 11-like [Sus scrofa]
          Length = 599

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 175/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFSDDRRFPDFSYITRHGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    +    +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKAVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|12053137|emb|CAB66747.1| hypothetical protein [Homo sapiens]
 gi|49065540|emb|CAG38588.1| FLJ20542 [Homo sapiens]
 gi|117645260|emb|CAL38096.1| hypothetical protein [synthetic construct]
 gi|208966056|dbj|BAG73042.1| cleavage and polyadenylation specific factor 3-like [synthetic
           construct]
          Length = 600

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T     +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHSTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|367047989|ref|XP_003654374.1| hypothetical protein THITE_2117338 [Thielavia terrestris NRRL 8126]
 gi|347001637|gb|AEO68038.1| hypothetical protein THITE_2117338 [Thielavia terrestris NRRL 8126]
          Length = 1015

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 169/635 (26%), Positives = 260/635 (40%), Gaps = 129/635 (20%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G  +E+  S  L+ +DG    L+D GW++ FD   L+ L K   T+  +LL+H   
Sbjct: 5   TPLQGALSESTASQSLLELDGGVKVLVDVGWDESFDAERLRELEKHIPTLSLILLTHATV 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR------------------ 105
            HLGA  +  K   L    PV++T PV  LG     D Y S                   
Sbjct: 65  DHLGAYAHCCKHFPLFTRIPVYATRPVIDLGRTLTQDLYASTPVAATTISPTSLAEVAYS 124

Query: 106 -RQVSEFDLFTL------DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGH 153
             Q S  D   L      ++I   F  +  L YSQ +            G+ +  + +GH
Sbjct: 125 YAQTSSADHNLLLQPPTPEEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGH 184

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLIT 199
            LGGT+W I    E ++YAVD+N+ +E   +G               V+E   +P  L+ 
Sbjct: 185 TLGGTIWHIQHGLESIVYAVDWNQARENVFSGAAWLGGGLGGAGGAEVIEQLRKPTALVC 244

Query: 200 DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-AEHSLN 258
            +          ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W AE + +
Sbjct: 245 SSRTPETAIARGRRDEQLLESIKLCIARGGTVLIPVDSSARVLELAYLLEHAWRAEVAKD 304

Query: 259 YPIYFLTYVS------SSTIDYVKSFLEWMGDSITKSFET----------------SRDN 296
             ++  T V        ST+   +S LEWM DSI + FE                  RD 
Sbjct: 305 NDVFKSTKVYLAGRSIGSTMRNARSMLEWMDDSIVREFEAVAGGTRGANSGAGGGKGRDA 364

Query: 297 A-FLLKHVTLLINKSEL---------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
             F  K++ LL  K+++         D+ P G K++LA+ ASLE GFS ++    A D +
Sbjct: 365 GPFDFKYLRLLERKAQVERILQQEAGDSEPKG-KVILATDASLEWGFSKEVLKAIAGDAR 423

Query: 347 NLVLFTERGQFG----TLARML-----------------------QADPPPKAVKVTMSR 379
           NLV+ TE+        ++AR L                       Q     + +++T + 
Sbjct: 424 NLVVLTEKPNLSHGRTSIARTLWEWWKERKDGVAVEQTSSGDTFEQVYGGGRELELTETT 483

Query: 380 RVPLVGEELIAYEEE-QTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANA 438
           R  L G+EL  Y++   T+ + +  L++S     ES A +  D + +             
Sbjct: 484 RQALEGDELGLYQQWLATQRQLQATLQSSGAAALESSAEVVDDASETTTESEESETERQG 543

Query: 439 SADVVEP---HGGRYRDILID---GF-------------VPPSTSVAPMFPFYENNSEWD 479
            A  V        R + +L D   G              V        MFP        D
Sbjct: 544 KALNVSTTIGQASRKKVVLKDEDLGITILLKKRGVYDFDVRGKKGRERMFPTVIRRKRND 603

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEG 514
           +FGE+I P++Y+  +E  D        D  + ++G
Sbjct: 604 EFGELIRPEEYLRAEERADADGQEEAQDGNRQEQG 638


>gi|397476278|ref|XP_003809534.1| PREDICTED: integrator complex subunit 11 isoform 2 [Pan paniscus]
          Length = 606

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 27  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 87  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 203 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 262

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 263 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 319

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 320 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356


>gi|193786492|dbj|BAG51775.1| unnamed protein product [Homo sapiens]
          Length = 606

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 27  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 87  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 203 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 262

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 263 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 319

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 320 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356


>gi|374253819|ref|NP_001243385.1| integrator complex subunit 11 isoform 1 [Homo sapiens]
 gi|119576642|gb|EAW56238.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_f
           [Homo sapiens]
          Length = 606

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 27  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 87  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 203 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 262

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 263 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 319

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 320 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356


>gi|426327392|ref|XP_004024502.1| PREDICTED: integrator complex subunit 11 isoform 2 [Gorilla gorilla
           gorilla]
          Length = 606

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 27  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 87  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 203 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 262

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 263 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 319

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 320 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356


>gi|158256210|dbj|BAF84076.1| unnamed protein product [Homo sapiens]
          Length = 606

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 27  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 87  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 203 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 262

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 263 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 319

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 320 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 356


>gi|380798915|gb|AFE71333.1| integrator complex subunit 11 isoform 2, partial [Macaca mulatta]
          Length = 588

 Score =  163 bits (413), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 171/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 9   LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 68

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 69  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 128

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 129 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 184

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 185 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 244

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 245 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 301

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 302 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 338


>gi|427785581|gb|JAA58242.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Rhipicephalus pulchellus]
          Length = 587

 Score =  163 bits (412), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 177/366 (48%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           + VTPL    +      L+SI G N ++DCG +  F       D S +     +   +D 
Sbjct: 4   ISVTPLGAGQDVGRSCILLSIGGKNVMLDCGMHMGFNDERRFPDFSYITQEGPLNEHLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G S PV+ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYSGPVYMTHPTKAICPILLEDFRKITVDRKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    + V+Y  DYN
Sbjct: 124 AMIRDCMRKVVAVNLHQAVQVDDELE---IKAYYAGHVLGAAMFRIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    L+   RP +LIT++  A   +  ++ RE  F   +   +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWLDK-CRPDLLITESTYATTIRDSKRCRERDFLTKVHDCIDKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWDRMNLRVPIYFAVGLTEKATNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +++ +DN   GP +V A+   L AG S  IF +WA    N+V+     
Sbjct: 298 NMFDFKHIKPF-DRAFIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPFEANMVIMPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VAGTVG 360


>gi|346976151|gb|EGY19603.1| cleavage and polyadenylation specificity factor subunit 2
           [Verticillium dahliae VdLs.17]
          Length = 972

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 170/620 (27%), Positives = 257/620 (41%), Gaps = 129/620 (20%)

Query: 10  LSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLH 67
           L G  +E+  S  ++ +DG    LID GW++ FD   L+ L K   T+  +LL+H  T H
Sbjct: 8   LQGARSESAASQSILELDGGVKVLIDIGWDESFDVEKLKELEKQVPTLSLILLTHATTSH 67

Query: 68  LGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLSR-RQVSEFDLFTLDDI----- 119
           L A  +  K        PV++T PV  LG     D Y S  R  +     +L ++     
Sbjct: 68  LAAFAHCCKNFPQFTRIPVYATRPVIDLGRTLTQDLYSSTPRAATTIPHDSLSEVAYSYS 127

Query: 120 -----DSAF-------QSVTR-------LTYSQNYH-----LSGKGEGIVVAPHVAGHLL 155
                DS F       + +TR       L YSQ +       S    G+ +    AGH L
Sbjct: 128 QQPTSDSNFLLQAPTPEEITRYFSLIQPLKYSQPHEPLPSPFSPPLNGLTITAFNAGHTL 187

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYN 203
           GGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  +  
Sbjct: 188 GGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGAGGAEVIEQLRKPTALICSSRG 247

Query: 204 ALHNQPP---RQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW------AE 254
           A  N P    R++ E   D I   +  GG VL+P DS+GRVLEL  +LE  W       +
Sbjct: 248 ADRNAPSGGRRKRDEQLIDMIKLCVSRGGTVLIPADSSGRVLELAYLLEHAWRLEVGKTD 307

Query: 255 HSLNYP-IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA---------------- 297
            +L    +Y      SST+ Y +S LEWM D+I + FE + D                  
Sbjct: 308 SALRAAKLYLAGRNVSSTLRYARSMLEWMDDNIVREFEATADGQRKANGNDGKHAKDAAP 367

Query: 298 FLLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  + + L+  ++++        DN     ++++AS  SLE GFS  +  E A D +NL+
Sbjct: 368 FDFRFMRLVEREAQIRKLLSQTSDNVQSEGRVIVASDNSLEWGFSQQLLRELAKDSRNLL 427

Query: 350 LFTER---GQFG--TLARML--------------QADPPP---------KAVKVTMSRRV 381
           + T++    Q G  ++AR L              Q+D            +A+ VT ++R 
Sbjct: 428 ILTDKPSLAQSGQPSIARTLWDWWQERKDGVSIDQSDSNDSIELVYGGGRALSVTDAKRQ 487

Query: 382 PLVGEELIAYEEE-QTRLKKEEALKASLVKEEESKASL-------------GPDNNLSGD 427
            L G+EL  Y++   T+ + +  L A +    E+ A +               DN   G 
Sbjct: 488 GLEGDELSTYQQWLATQRQLQATLNAGVAGSLEAPADVGDDGSSESSSDSGESDNEQQGK 547

Query: 428 PMVIDANNANAS-ADVVEPHGGRYRDILI------DGFVPPSTSVAPMFPFYENNSEWDD 480
            + I      A+   VV        ++L       D  V         FP        D 
Sbjct: 548 ALNISTTMGQATRKKVVLSDEDLGINVLTKKLGASDYDVRAKRGRERCFPLTIRRKRDDQ 607

Query: 481 FGEVINPDDYIIKDEDMDQA 500
           FGE I P+DY+  +E  + A
Sbjct: 608 FGEAIRPEDYLRAEEKEEDA 627


>gi|344283025|ref|XP_003413273.1| PREDICTED: integrator complex subunit 11-like [Loxodonta africana]
          Length = 719

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 177/354 (50%), Gaps = 19/354 (5%)

Query: 8   TPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVL 59
           TP +G   +   S  LVS+ G N ++DCG +  F       D S +    ++   +D V+
Sbjct: 125 TPRAGAGQDVGRSCILVSVAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVI 184

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDD 118
           +SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT   
Sbjct: 185 ISHFHLDHCGALPYFSEMVGYDGPIYMTPPTQAICPILLEDYRKIAVDKKGEANFFTSQM 244

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN  
Sbjct: 245 IKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 301

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
            ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV +
Sbjct: 302 PDRHLGAAWIDR-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFA 360

Query: 238 AGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
            GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + N 
Sbjct: 361 LGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQRNM 418

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 419 FEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 469


>gi|159488791|ref|XP_001702386.1| subunit of mRNA cleavage and polyadenylation specificity factor
           [Chlamydomonas reinhardtii]
 gi|158271180|gb|EDO97006.1| subunit of mRNA cleavage and polyadenylation specificity factor
           [Chlamydomonas reinhardtii]
          Length = 690

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 174/361 (48%), Gaps = 15/361 (4%)

Query: 29  NFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
             + DCG +  F      PL       T+D  L++H    H  A+PY +++      +F 
Sbjct: 23  TVMFDCGIHPAFKGMDSLPLLDEIDIDTVDVALITHFHLDHCAAVPYLLRKTRFKGRIFM 82

Query: 87  TEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVV 146
           T P   +    + D     +  SE  LF  DD++++ Q +  + + Q   ++G    + +
Sbjct: 83  THPTKAIYYSLLRDLAKGSKHSSEEALFNEDDLEASMQRIEVVDFYQTIEVAG----MQI 138

Query: 147 APHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALH 206
            P+ AGH+LG  ++ +   G   +Y  DY+R  ++HL    +   V+P ++I ++     
Sbjct: 139 TPYRAGHVLGAAMFLVEVAGCRCLYTGDYSRLPDRHLPAADIPP-VKPHIVIVESTYGTS 197

Query: 207 NQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIY 262
              PR QRE +  D I  T+  GG V++PV + GR  ELLL+L++YW  H       PIY
Sbjct: 198 RHLPRLQREQLLLDTIRNTINRGGRVIMPVVALGRAQELLLLLDEYWEAHKSELSGIPIY 257

Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
             + + S  +   ++++E + D I + F     N F  +HV  L N +   +   GP ++
Sbjct: 258 QASSMMSKALGVYQTYVESLNDDIKRVFHER--NPFKFRHVQTLKNPAHFISDYSGPCVI 315

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP 382
           +A+ + L++G S D F  W  D +N  +  +    GTLA+ +     P ++     RRVP
Sbjct: 316 MATPSGLQSGASRDFFEAWCEDSRNTCIICDFAVQGTLAKEILGG--PSSITTREGRRVP 373

Query: 383 L 383
           L
Sbjct: 374 L 374


>gi|134083194|emb|CAK42833.1| unnamed protein product [Aspergillus niger]
          Length = 865

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 173/379 (45%), Gaps = 52/379 (13%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLDLQELEKHVPTLSLILLTHATPAHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF---------------DLFTLDDIDSAFQSVTRL 129
           ++T PV  LG   + D Y S    + F                  T ++I   F  +  L
Sbjct: 87  YATSPVIALGRTLLQDLYASSPLAATFLPKATEATHAGRILLQPPTAEEIARYFSLIHPL 146

Query: 130 TYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN 184
            YSQ +       S    G+ +  + AGH +GGT+W I    E ++YAVD+N+ +E  + 
Sbjct: 147 KYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIWHIQHGMESIVYAVDWNQARESVVA 206

Query: 185 GT------------VLESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGG 229
           G             V+E   +P  L+           P  R++R ++  D I  T+  GG
Sbjct: 207 GAAWFGGSGASGTEVIEQLRKPTALVCSTRGGERFALPGGRKKRDDLLLDMIRSTIAKGG 266

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLE 280
            VL+P D++ RVLEL   LE  W + +             +Y     +++T+   +S LE
Sbjct: 267 TVLIPTDTSARVLELAYALEHAWRDAAGSGQGDDVLKGAGLYLAGRKANTTMRLARSMLE 326

Query: 281 WMGDSITKSFETSRDNA----FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFS 334
           WM ++I + FE + +      F  KH+ +L  K  L+   +   PK++LAS  SL+ GF+
Sbjct: 327 WMDENIVREFEAAEEGKGVGPFTFKHLRILERKKRLEKILSDQKPKVILASDTSLDWGFA 386

Query: 335 HDIFVEWASDVKNLVLFTE 353
            D     A    NL+L TE
Sbjct: 387 KDSLRLVAEGANNLLLLTE 405



 Score = 44.7 bits (104), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 60/294 (20%), Positives = 104/294 (35%), Gaps = 95/294 (32%)

Query: 468 MFPFYENNSEWDDFGEVINPDDY-----IIKDEDMDQAAMHIGGDDGKLDEG-------S 515
           MFP+     + D+FGE I P+D      + +D ++D A       +G+  EG        
Sbjct: 528 MFPYVAPRKKGDEFGEFIRPEDTADELSLAEDGEVDAAVSSEDEVEGQSFEGPAKAVYEK 587

Query: 516 ASLILDAKPSKV-----------------VSNELTVLVHGSAEATEHLKQHCLKHVCPH- 557
           A+L ++A+ + V                 +     +LV G  + T  L   C K +    
Sbjct: 588 ATLTINARLAYVDFTGLHDKRSLEMLIPLIQPRKLILVGGMKQETTALATECQKLLAAKS 647

Query: 558 -----------VYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG 606
                      ++TP   E +D + D  A+ V+LS  L+  + ++ +    +  + A++ 
Sbjct: 648 GMDVSAADSAVIFTPVNGEVVDASVDTNAWMVKLSNNLVRRLKWQHVRSLGVVTLTAQLR 707

Query: 607 KTENGML-----------------------SLLPISTPAPPH------------------ 625
             E  +L                           ++T APP                   
Sbjct: 708 GPEQAVLEDSTEENPSKKPKLLEEEKKEEGGSTEVATNAPPEGAKPSADKSEVYPLLDVL 767

Query: 626 ------------KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRK 666
                       + + VGDL++ADL+  +   G   EF G G L     V +RK
Sbjct: 768 PVNMAAGTRSMTRPLHVGDLRLADLRKIMQGAGHTAEFRGEGTLLIDGMVAVRK 821


>gi|417403209|gb|JAA48422.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Desmodus rotundus]
          Length = 604

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 106/356 (29%), Positives = 175/356 (49%), Gaps = 17/356 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELQIKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPTLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++     P    +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAXXXAHPCA-MVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 351


>gi|307215032|gb|EFN89859.1| Integrator complex subunit 11 [Harpegnathos saltator]
          Length = 594

 Score =  162 bits (411), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 184/365 (50%), Gaps = 20/365 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP----LSKVAST--IDAV 58
           ++VTPL    +      LVS+ G N ++DCG +  F+     P    +S+ A+T  ID V
Sbjct: 4   IKVTPLGAGQDVGRSCILVSMGGKNIMLDCGMHMGFNDERRFPDFSYISEGAATDHIDCV 63

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLD 117
           ++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT  
Sbjct: 64  IISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTSQ 123

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
            I    + V  +T  Q+  +    E   +  + AGH+LG  ++ +    + ++Y  DYN 
Sbjct: 124 MIKDCIKKVIAVTLHQSVMVDPDLE---IKAYYAGHVLGAAMFWVRVGSQSIVYTGDYNM 180

Query: 178 RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
             ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV 
Sbjct: 181 TPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDRGGKVLIPVF 239

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + N
Sbjct: 240 ALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQRN 297

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FTE 353
            F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+   F  
Sbjct: 298 MFDFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNESNMVIMPGFCV 354

Query: 354 RGQFG 358
           +G  G
Sbjct: 355 QGTVG 359


>gi|343429654|emb|CBQ73226.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 1039

 Score =  162 bits (410), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 126/434 (29%), Positives = 204/434 (47%), Gaps = 84/434 (19%)

Query: 48  LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQ 107
           L ++A TID VLLSH    HLG   YA  +LGL   V++T PV  +G LT+ +   + R 
Sbjct: 129 LRQLAPTIDLVLLSHSSLDHLGLYAYAHAKLGLRCQVYATMPVQSMGKLTVLEAIQTWR- 187

Query: 108 VSEFD-------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHL 154
            SE D             L T D ++ AF+ +  + Y Q  HL GK   + +  + AGH 
Sbjct: 188 -SEVDIEREAPSGLARRCLATPDQVEEAFEQIKTVRYMQPTHLEGKCASLTLTAYNAGHS 246

Query: 155 LGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL-----------------ESFVRPAV 196
           LGG VWKI +     V+ A+D+N  +E+HL+GT+L                 ++  RP +
Sbjct: 247 LGGAVWKIRSPTSGTVVIALDWNHNRERHLDGTILLSSSAAAPGAPGAASGADAVRRPDL 306

Query: 197 LITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA-- 253
           LIT+    L     R+ R+    D +  T++AG ++L P+D++ R+LEL+++L+ +WA  
Sbjct: 307 LITEIERGLVVNTRRKDRDAALIDLVHTTIQAGHSLLFPIDASARLLELMVLLDQHWAYA 366

Query: 254 -EHSLNYPIYFLTYVSSSTIDYVKSFLEWMG-DSITKSFET------------------- 292
             H+  +P+  ++      I+  ++++EWM  +  TK+ ET                   
Sbjct: 367 YPHA-RFPLCLISRTGKEVIERSRTYMEWMTREWATKAGETIEAEKDKQPQRNARGGPNR 425

Query: 293 --SRDNAFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
             +  +    K+V +  +   +D A   D  K+VLA   S+  G S  +   +A +  ++
Sbjct: 426 SAAASSPLDFKYVRVFPSLQAMDEAIPHDQAKVVLAVPPSMTHGPSRRLLARFAQNPNDV 485

Query: 349 VLFTERGQFGTLARMLQ---------------------ADPPPKAVKVTMSRRVPLVGEE 387
           V+   RG+ G+L R L                        P   A++  +  +VPL GEE
Sbjct: 486 VVLISRGEPGSLCRELWNAWNTHQSKGFSWAQGKLGQIVTPTKTALRFELKSKVPLEGEE 545

Query: 388 LIAY-EEEQTRLKK 400
           L A+ E EQ    K
Sbjct: 546 LRAHLEAEQAERDK 559


>gi|327288530|ref|XP_003228979.1| PREDICTED: integrator complex subunit 11-like [Anolis carolinensis]
          Length = 600

 Score =  162 bits (410), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 179/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           +++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  LIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHETIERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF   ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSMGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIM 350


>gi|307170840|gb|EFN62951.1| Integrator complex subunit 11 [Camponotus floridanus]
          Length = 595

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 181/366 (49%), Gaps = 21/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++TPL    +      LVS+ G N ++DCG +  F       D S +       + ID 
Sbjct: 4   IKITPLGAGQDVGRSCILVSMGGKNIMLDCGMHMGFNDERRFPDFSYIVAEGPATNYIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G + P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFTEMVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ I    + ++Y  DYN
Sbjct: 124 QMIKDCMKKVVAVTLHQSVMVDPELE---IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECIDKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  P+YF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFITWTNQKIKKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+    +K+ +DN   G  +V A+   L AG S  IF +WA +  N+V+   F 
Sbjct: 298 NMFEFKHIKPF-DKAYIDNP--GAMVVFATPGMLHAGLSLQIFKKWAPNEANMVIMPGFC 354

Query: 353 ERGQFG 358
            +G  G
Sbjct: 355 VQGTVG 360


>gi|428177137|gb|EKX46018.1| hypothetical protein GUITHDRAFT_70813 [Guillardia theta CCMP2712]
          Length = 485

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 179/368 (48%), Gaps = 20/368 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHFDPSLLQPLSK---VASTIDA 57
           ++VTPL    +      LV+I G N ++DCG    +ND       + +SK       ID 
Sbjct: 3   IKVTPLGAGQDVGKSCILVTIGGKNIMLDCGMHPGYNDERRFPDFRYISKEGNFTGLIDL 62

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY---LSRRQVSEFDLF 114
           V++SH    H G+LPY  + LG   P+++T P   +  + + D     + RR V E D+F
Sbjct: 63  VIISHFHLDHCGSLPYFTEVLGYDGPMYATHPTKAIMPILLEDYRKISVERRGVEEKDMF 122

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
           +   I      VT     +   +    E   + P+ AGH+LG  ++ I    + ++Y  D
Sbjct: 123 SSQQIKDCMMKVTPCALEETIMIE---EDFEIRPYYAGHVLGAAMFYIRVGQQSILYTGD 179

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLL 233
           YN   ++HL G+     +RP +LIT++  A   +  ++ RE    + +S+ +R GG VL+
Sbjct: 180 YNMTPDRHL-GSARCDKLRPDLLITESTYATTIRESKRWRERDMLNQVSECVRNGGKVLI 238

Query: 234 PVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           PV + GR  EL L+L+ +W    L  PIYF   ++     Y K ++ W    I  +F   
Sbjct: 239 PVFALGRAQELCLLLDAFWERTGLKVPIYFSAGLTEKANLYYKMYISWTNQKIKDTF--V 296

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           + N F  +H+    +++ +D    GP ++ A+   L  G S ++F +WA   KNLV+   
Sbjct: 297 KRNVFDFQHIQPF-DRAFIDRP--GPMVLFATPGMLHGGLSMEVFKKWAPSDKNLVIMPG 353

Query: 354 RGQFGTLA 361
               GTL 
Sbjct: 354 YCVAGTLG 361


>gi|156840674|ref|XP_001643716.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156114339|gb|EDO15858.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 778

 Score =  162 bits (409), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 99/326 (30%), Positives = 163/326 (50%), Gaps = 16/326 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           S ID +L+SH    H  +LPY MK+      VF T P   +YR  L             S
Sbjct: 59  SKIDVLLISHFHLDHAASLPYVMKRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGTTSS 118

Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           E D  L+T +D+  +F  +  +    +YH +    GI      AGH+LG  +++I   G 
Sbjct: 119 EKDENLYTDEDLADSFDKIETI----DYHSTMDVNGIKFTAFHAGHVLGAAMFQIEIAGL 174

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRA 227
            V++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T+  
Sbjct: 175 RVLFTGDYSREMDRHLNSAEVPPLPSDVLIVESTFGTATHEPRLNREKKLTQLIHSTVGR 234

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLEWM 282
           GG VL+PV + GR  EL+LIL++YW++H     S   PIY+ + ++   +   ++++  M
Sbjct: 235 GGRVLMPVFALGRAQELMLILDEYWSQHADELGSGQVPIYYASNLAKKCMSVYQTYVNMM 294

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWA 342
            D I K F  S+ N F+ KH++ L N  E  +   GP ++LAS   L+ G S D+  +W 
Sbjct: 295 NDDIRKKFRDSQTNPFIFKHISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLEKWC 352

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
            + KN+VL T     GT+A+ +  +P
Sbjct: 353 PEDKNMVLITGYSVEGTMAKYIMLEP 378


>gi|281348165|gb|EFB23749.1| hypothetical protein PANDA_020173 [Ailuropoda melanoleuca]
          Length = 591

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 104/339 (30%), Positives = 170/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 12  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDCVIISHFHLDHCGALPYF 71

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 72  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 131

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   R
Sbjct: 132 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDR-CR 187

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + + +  GG VL+PV + GR  EL ++LE +W
Sbjct: 188 PNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPVFALGRAQELCILLETFW 247

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 248 ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 304

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 305 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 341


>gi|452981499|gb|EME81259.1| hypothetical protein MYCFIDRAFT_140021 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 938

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/409 (30%), Positives = 191/409 (46%), Gaps = 63/409 (15%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           TPL G    +P S  L+ +DG    L+D GW++ FD   LQ L K  ST+  +LL+H   
Sbjct: 5   TPLLGAQTASPASQSLLELDGGVKILVDVGWDETFDTGKLQALEKHVSTLSVILLTHATI 64

Query: 66  LHLGALPYAMKQL-GLS-APVFSTEPVYRLGLLTMYDQYLSRRQVS-------------- 109
            H+GA  +  K + G +  PV++T PV  LG     D Y S    +              
Sbjct: 65  EHIGAYAHCCKHVPGFAKVPVYATTPVVNLGRTLAADIYASSPSAAITIPASSIGPLNSN 124

Query: 110 -EFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKGE-----GIVVAPHVAGHLLGGTV 159
              +L     T +++ + F ++  L YSQ +             + +  + AGH  GGT+
Sbjct: 125 ATPNLLLPAPTAEEVATYFSAIHPLKYSQPHQPLPSPWSPPLGNLTITAYSAGHTPGGTI 184

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGT-------------VLESFVRPAVLITDAYNALH 206
           W I    E ++YA D+N+ +E  L+G              ++E   RP  L+  +     
Sbjct: 185 WHIQHSLESIVYAADWNQGRENLLSGAAWLGGSGAGGGAEIIEPLRRPTALVCSSRGVEK 244

Query: 207 NQP-PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-------- 256
               PR++R E     I +T+  GG VL+P DS+ RVLEL  IL   W E++        
Sbjct: 245 TDVLPRKKRDETLISLIRETIAQGGKVLIPTDSSARVLELAFILNHTWRENTSGPHADTY 304

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-----RD-----NAFLLKHVTLL 306
            N  IY  +  S+ST+  ++S LEWM D+I +  E +     RD     N    K V  +
Sbjct: 305 RNAKIYMASKSSTSTVRQLQSMLEWMDDTIIQDAERAMNKGQRDDDKAPNLLDWKFVKQI 364

Query: 307 INKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             +++ D A     P ++LAS AS+E G+S     + ++D +NLV+ TE
Sbjct: 365 ERQTQFDRALRRRSPCIMLASDASMEWGYSRQALEKLSADPRNLVVLTE 413



 Score = 39.7 bits (91), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 2/45 (4%)

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKDE--DMDQAAMHIGGDDGK 510
           MFPF     + DD+G++I P+DY+  +E  D+D   M  G   G+
Sbjct: 572 MFPFVSRRPKHDDYGDIIKPEDYLRAEERDDVDGVDMRDGAKQGE 616


>gi|303391170|ref|XP_003073815.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
 gi|303302963|gb|ADM12455.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
          Length = 496

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 188/394 (47%), Gaps = 34/394 (8%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           + V PL    +      LV+I+G   + DCG +  F       D S +         ID 
Sbjct: 1   MNVVPLGAGQDVGRSCILVTINGRTVMFDCGMHMGFNDERRFPDFSYISKTKNFDKVIDC 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFDLFT 115
           +++SH    H GALPY  +  G S P++ T P   +   LL  + + +  +  S   +F+
Sbjct: 61  IIISHFHLDHCGALPYFTEVCGYSGPIYMTLPTKEVCPVLLDDFRKIVGGKGDS---IFS 117

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
             DI +  + V  ++ ++ Y      E   + P+ AGH+LG  ++ ++   + V+Y  DY
Sbjct: 118 YQDISNCMKKVVTISMNETYK---HDENFYITPYYAGHVLGAAMFHVSVGDQSVVYTGDY 174

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 234
           +   +KHL    ++  +RP +LIT++ Y ++     R +   F  A+S  +  GG VL+P
Sbjct: 175 STTPDKHLGPASIKC-IRPDLLITESTYGSITRDCRRVKEREFLKAVSDCIARGGRVLIP 233

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT-KSFETS 293
           + + GR  EL L+L+ YW    L  P+YF + ++    +  K F+ +  +++  K FE  
Sbjct: 234 IFALGRAQELCLLLDGYWERTGLEIPVYFSSGLTEKANEIYKKFIGYTNETVKRKIFER- 292

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             N F  KH+     +  +DN   GP ++ AS   L +G S  IF EW  D KNLV+   
Sbjct: 293 --NVFEYKHIKPF-QRYYMDNK--GPMVLFASPGMLHSGMSLRIFKEWCEDEKNLVIIPG 347

Query: 354 RGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
               GT+   +          +  ++R+ ++GEE
Sbjct: 348 YCVRGTIGEKI----------LNGAKRLEILGEE 371


>gi|355744837|gb|EHH49462.1| hypothetical protein EGM_00117, partial [Macaca fascicularis]
          Length = 592

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/339 (30%), Positives = 169/339 (49%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG +  F       D S +    ++   +D V++SH    H GALPY 
Sbjct: 13  LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 72

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    +        Q
Sbjct: 73  SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKEVAGHLHQ 132

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E V+Y  DYN   E+HL    ++   R
Sbjct: 133 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPERHLGAAWIDK-CR 188

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 189 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 248

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 249 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 305

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 306 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 342


>gi|334321967|ref|XP_001364674.2| PREDICTED: integrator complex subunit 11-like [Monodelphis
           domestica]
          Length = 600

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 106/339 (31%), Positives = 172/339 (50%), Gaps = 18/339 (5%)

Query: 22  LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           LVSI G N ++DCG    +ND   F D S +    ++   +D V++SH    H GALPY 
Sbjct: 21  LVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 80

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + +G   P++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q
Sbjct: 81  SEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDCMKKVVAVHLHQ 140

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +  + E   +  + AGH+LG  +++I    E  +Y  DYN   ++HL    ++   R
Sbjct: 141 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESAVYTGDYNMTPDRHLGAAWIDK-CR 196

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W
Sbjct: 197 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 256

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
              +L  PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  
Sbjct: 257 ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFA 313

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 314 DNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350


>gi|356525973|ref|XP_003531594.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Glycine max]
          Length = 688

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 188/372 (50%), Gaps = 26/372 (6%)

Query: 7   VTPLSGVFNENPLSYL-VSIDGFNFLIDCGWNDHFDP-SLLQPLSKV-ASTIDAVLLSHP 63
           VTPL G  NE   S + +S  G + L DCG +  F   S L    ++  ST+D +L++H 
Sbjct: 22  VTPL-GAGNEVGRSCVYMSYKGKSILFDCGIHLGFSGMSALPYFDEIDPSTLDVLLITHF 80

Query: 64  DTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDI 119
              H  +LPY +++      VF   +T+ +Y+L    +   ++   +VS  D LF   DI
Sbjct: 81  HLDHAASLPYFLEKTTFRGRVFMTYATKAIYKL----LLSDFVKVSKVSVEDMLFDEQDI 136

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
           + +   +  + + Q   ++G    I    + AGH+LG  ++ +   G  V+Y  DY+R +
Sbjct: 137 NRSMDKIEVIDFHQTVEVNG----IRFWCYAAGHVLGAAMFMVDIAGVRVLYTGDYSREE 192

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           ++HL    +  F     +I   Y   H+QP   + + F D I  T+  GG VL+P  + G
Sbjct: 193 DRHLRAAEIPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGGRVLIPAYALG 252

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLIL++YWA H    N PIY+ + ++   +   +++   M D +    + ++ N 
Sbjct: 253 RAQELLLILDEYWANHPELHNIPIYYASPLAKKCLTVYETYTLSMNDRV----QNAKSNP 308

Query: 298 FLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           F  KH++ L   S ++   D GP +V+AS   L++G S  +F +W SD KN  +      
Sbjct: 309 FSFKHISAL---SSIEVFKDVGPSVVMASPGGLQSGLSRQLFDKWCSDKKNTCVLPGFVV 365

Query: 357 FGTLARMLQADP 368
            GTLA+ +  +P
Sbjct: 366 EGTLAKTIMTEP 377


>gi|374110195|gb|AEY99100.1| FAGR279Cp [Ashbya gossypii FDAG1]
          Length = 771

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 95/329 (28%), Positives = 171/329 (51%), Gaps = 20/329 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYL-----S 104
           S ++ +L+SH    H  +LPY M++      VF T P   +YR  LL+ + +       S
Sbjct: 61  SQVEVLLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRW-LLSDFVKVTNIGNDS 119

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
              VS+ +L+T +D+  +F  +  +    +YH +    GI    + AGH+LG  ++++  
Sbjct: 120 AGGVSDENLYTDEDLAESFDRIETV----DYHSTIDVNGIKFTAYHAGHVLGAAMFQVEI 175

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  +++  DY+R  ++HLN   + +     +++   +    ++P   + +     I  T
Sbjct: 176 AGLRILFTGDYSRELDRHLNSAEIPTLPSDILIVESTFGTATHEPRTSKEKKLTQLIHTT 235

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-----PIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 236 VSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMSVFQTYV 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L+ G S D+  
Sbjct: 296 NMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLE 353

Query: 340 EWASDVKNLVLFTERGQFGTLARMLQADP 368
           +W  D KNLVL T     GT+A+ L  +P
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLMLEP 382


>gi|71017515|ref|XP_758988.1| hypothetical protein UM02841.1 [Ustilago maydis 521]
 gi|46098766|gb|EAK83999.1| hypothetical protein UM02841.1 [Ustilago maydis 521]
          Length = 979

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 144/521 (27%), Positives = 228/521 (43%), Gaps = 137/521 (26%)

Query: 15  NENP--LSYLVSIDGFNFLIDCGWNDHF----------------DPSLLQP--------- 47
            E+P  L+YL+ +D    LIDCG  + F                  S  QP         
Sbjct: 42  QEHPRALAYLLQMDDVRVLIDCGSTEDFLFHGTSSQSDDSADAEAESQPQPESSSMAQQR 101

Query: 48  ------------------LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP 89
                             L ++ASTID VLLSH    HLG   YA   LGL   V++T P
Sbjct: 102 QASDLDINHLKAAPLDTLLRQLASTIDLVLLSHSSLDHLGLYAYAHANLGLRCQVYATMP 161

Query: 90  VYRLGLLTMYDQYLSRRQVSEFD-------------LFTLDDIDSAFQSVTRLTYSQNYH 136
           V  +G LT+ +   + R  SE D             L T D ++ AF+ +  + Y Q  H
Sbjct: 162 VQSMGKLTVLEAIQTWR--SEVDIEKECTSASTRRCLATPDQVEDAFEEIKTVRYMQPTH 219

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL------- 188
           L GK   + +  + AGH LGG VWKI +     V+ A+D+N  +E+HL+GT+L       
Sbjct: 220 LEGKCASLTLTAYNAGHSLGGAVWKIRSPTSGTVVIALDWNHNRERHLDGTILLSSSAAA 279

Query: 189 -----------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
                      ++  RP +LIT+    L     R+ R+    D +  T++AG ++L PVD
Sbjct: 280 PGAPGSGASASDAVRRPDLLITEIERGLVVNTRRKDRDAALIDLVHTTIQAGNSLLFPVD 339

Query: 237 SAGRVLELLLILEDYWA---EHSLNYPIYFLTYVSSSTIDYVKSFLEWMG-DSITKSFET 292
           ++ R+LEL+++L+ +WA    H+  +P+  ++      I+  ++++EWM  +  TK+ ET
Sbjct: 340 ASARLLELMVLLDQHWAYAYPHA-RFPLCLISRTGKEVIERSRTYMEWMTREWATKANET 398

Query: 293 ------------SRDNA-------------FLLKHVTLLINKSELDNA--PDGPKLVLAS 325
                        + NA                K+V +      +D A   D  K+VLA 
Sbjct: 399 IEADKDTLPAKMQQRNARGGGLRPAAASSPLDFKYVKVFPTLQAMDEAIPQDQAKVVLAV 458

Query: 326 MASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML--------------------Q 365
             S+  G S  +   +A +  ++V+   RG+ G+L R L                    Q
Sbjct: 459 PPSMTHGPSRKLLARFAQNPNDVVVLISRGEPGSLCRELWDAWNTNQSKGFSWSQGKLGQ 518

Query: 366 A-DPPPKAVKVTMSRRVPLVGEELIAYEE----EQTRLKKE 401
           A      +++  +  +VPL G+EL A+ E    E+ RL ++
Sbjct: 519 AVVASNTSLRFELKSKVPLEGDELRAHREAEQAERERLAQQ 559


>gi|429243009|ref|NP_594263.2| mRNA cleavage and polyadenylation specificity factor complex
           endoribonuclease subunit Ysh1 [Schizosaccharomyces pombe
           972h-]
 gi|384872669|sp|O13794.2|YSH1_SCHPO RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
           3'-end-processing protein ysh1
 gi|347834169|emb|CAB16227.2| mRNA cleavage and polyadenylation specificity factor complex
           endoribonuclease subunit Ysh1 [Schizosaccharomyces
           pombe]
          Length = 757

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 171/320 (53%), Gaps = 14/320 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EF 111
           ST+D +L+SH    H+ +LPY M++      VF T P   +    + D Y+    V  E 
Sbjct: 69  STVDVLLISHFHLDHVASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSD-YVKVSNVGMED 127

Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            L+   D+ +AF  +  +    +YH + + EGI   P+ AGH+LG  ++ +   G ++++
Sbjct: 128 QLYDEKDLLAAFDRIEAV----DYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILF 183

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGN 230
             DY+R +++HL+   +    RP VLIT++ Y    +QP  ++     + I  T+R GG 
Sbjct: 184 TGDYSREEDRHLHVAEVPP-KRPDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGR 242

Query: 231 VLLPVDSAGRVLELLLILEDYWAEH--SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           VL+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D+I K
Sbjct: 243 VLMPVFALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRK 302

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
            F  +  N F+ + V  L N  + D+   GP ++LAS   L+ G S  +   WA D +N 
Sbjct: 303 IF--AERNPFIFRFVKSLRNLEKFDDI--GPSVILASPGMLQNGVSRTLLERWAPDPRNT 358

Query: 349 VLFTERGQFGTLARMLQADP 368
           +L T     GT+A+ +  +P
Sbjct: 359 LLLTGYSVEGTMAKQITNEP 378


>gi|241245173|ref|XP_002402434.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
 gi|215496345|gb|EEC05985.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
          Length = 596

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 115/390 (29%), Positives = 188/390 (48%), Gaps = 26/390 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           + VTPL    +      L+SI G N ++DCG    +ND     D S +     +   +D 
Sbjct: 4   ISVTPLGAGQDVGRSCILLSIGGKNIMLDCGMHMGYNDERRFPDFSYVTQEGPLNDHLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           +++SH    H GALPY  + +G + PV+ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  LIISHFHLDHCGALPYMTEMVGYAGPVYMTHPTKAICPILLEDFRKITVDRKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVNLHQTVQVDDELE---IKAYYAGHVLGAAMFHIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   +   +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLTKVHDCIDKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWDRMNLKVPIYFAVGLTEKATNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +++ +DN   GP +V A+   L AG S  IF +WA    N+V+     
Sbjct: 298 NMFDFKHIKPF-DRAFIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPVEGNMVIMPGYC 354

Query: 356 QFGTL-------ARMLQADPPPKAVKVTMS 378
             GT+       AR ++ D   + V+V MS
Sbjct: 355 VAGTVGHKILSGARKVELD-NRQVVEVKMS 383


>gi|347838796|emb|CCD53368.1| similar to cleavage and polyadenylation specificity factor subunit
           2 [Botryotinia fuckeliana]
          Length = 934

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 157/582 (26%), Positives = 236/582 (40%), Gaps = 120/582 (20%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ FD   L+ L K   T+  +LL+H    H+ A  +  K   L    PV
Sbjct: 26  GIKVLIDVGWDETFDVEKLRELEKQIPTLSLILLTHATVPHIAAYAHCCKHFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRR-------QVSEFDLF--TLDDIDSAFQSVTRLTYSQNY 135
           ++T PV  LG   + D Y S           S F L   T ++I+  F  V  L YSQ +
Sbjct: 86  YATHPVIALGRTLLQDLYCSTPLASTIIPTTSSFLLQSPTKEEINYYFSLVRPLKYSQPH 145

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK------------HL 183
                  G+ +  + AGH LGGT+W I    E ++YAVD+N+ +E               
Sbjct: 146 Q---PLNGVTITAYNAGHSLGGTIWHIQHGLESIVYAVDWNQARENVLAGAAWLGGAGAG 202

Query: 184 NGTVLESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGR 240
              V+E   +P  LI  +        P  R +R E+  D I  ++  GG VL+P DS  R
Sbjct: 203 GAEVIEQLRKPTALICSSKGGERVALPGGRAKRDELLLDMIRSSISRGGIVLIPTDSGAR 262

Query: 241 VLELLLILEDYWAEHSL-------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET- 292
           ++EL  +LE  W   +        +   Y     S  T+ Y +S  EWM ++I + FE  
Sbjct: 263 MMELAYLLEHAWRTENQEEESAFKSAKPYLAVSTSEMTMRYTRSMFEWMDEAIIREFEAQ 322

Query: 293 ----------SRDNA---------FLLKHVTLLINKSELD---NAPDG-----PKLVLAS 325
                      R NA         F  KH+ LL  K ++D   N  D       K++LAS
Sbjct: 323 PGHEEQRTGQQRRNAEEAKQHIGPFEFKHLRLLGRKGQIDRMLNETDNLGRSVGKVILAS 382

Query: 326 MASLEAGFSHDIFVEWASDVKNLVLFTER-----GQFGTLARML---------------- 364
             S+E GFS ++  + A D KNL++ TER     G  G L R L                
Sbjct: 383 DTSIEWGFSKEVLCKIADDDKNLLILTERLNPISGAPG-LGRTLWSWWEERRDGVISEPS 441

Query: 365 -------QADPPPKAVKVTMSRRVPLVGEELIAYEEE-QTRLKKEEALKASLVKEEESKA 416
                  Q     + +++   +R+PL G +L  Y++   T+ + +  L+       E+ A
Sbjct: 442 SNGGVLEQVYGGGRDLEIKEPKRIPLEGNDLTVYQQWLATQRQLQTTLQPGGATALEASA 501

Query: 417 SL-------------GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI-------- 455
            +               +N   G  + I A    A+   +   G    D+ +        
Sbjct: 502 DIVDDASSDSSSDSDDSENEQQGKALNISATMGQANRKKI---GLSDEDLGVNILLRKKG 558

Query: 456 --DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 495
             D  V        MFP        DDFGE+I P +++  +E
Sbjct: 559 VHDFDVRGKKGRDKMFPMAIRRKRNDDFGELIRPGEFLRAEE 600


>gi|297837375|ref|XP_002886569.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297332410|gb|EFH62828.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 693

 Score =  159 bits (403), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 40/385 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            +S  G N L DCG            + D  DPS      
Sbjct: 19  GDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPS------ 72

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               +ID +L++H    H  +LPY +++   +  VF   +T+ +Y+L LLT Y + +S+ 
Sbjct: 73  ----SIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKV 126

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            V +  LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G
Sbjct: 127 SVEDM-LFDEQDINKSMDKIEVIDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAG 181

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTL 225
             ++Y  DY+R +++HL    L  F  P + I ++ + +     R  RE  F D I  T+
Sbjct: 182 VRILYTGDYSREEDRHLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTV 240

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   ++++  M 
Sbjct: 241 AQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMN 300

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           D I   F  S  N F+ KH++ L +  + ++   GP +V+A+   L++G S  +F  W S
Sbjct: 301 DRIRNQFANS--NPFVFKHISPLNSIDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCS 356

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D KN  +       GTLA+ +  +P
Sbjct: 357 DKKNACIIPGYMVEGTLAKTIINEP 381


>gi|321468347|gb|EFX79332.1| hypothetical protein DAPPUDRAFT_304859 [Daphnia pulex]
          Length = 597

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 179/367 (48%), Gaps = 17/367 (4%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKVA-----STID 56
           T ++VTPL    +      L+ + G N ++DCG +  ++     P  S +A      ++D
Sbjct: 2   TDIKVTPLGAGQDVGRSCILLQMGGKNIMLDCGMHMGYNDERRFPDFSYIADGNLTESLD 61

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V++SH    H GALP+  + +G + P++ T P   +  + + D + ++  +  E + FT
Sbjct: 62  CVIISHFHLDHCGALPFMTEMVGYNGPIYMTHPTKAIAPILLEDMRKVAVERKGETNFFT 121

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +T  Q   +  + E   +  + AGH+LG  ++ +    + V+Y  DY
Sbjct: 122 SAHIKDCMKKVIAVTLHQTVQVDSEIE---IKAYYAGHVLGAAMFHVKVGNQSVVYTGDY 178

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++   RP +LI+++  A   +  ++ RE  F   +   +  GG VL+P
Sbjct: 179 NMTPDRHLGAAWIDK-CRPNILISESTYATTIRDSKRCRERDFLKKVHDCVDRGGKVLIP 237

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           V + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W    I K+F   +
Sbjct: 238 VFALGRAQELCILLETYWERMNLKAPIYFAVGLTEKANNYYKMFITWTNQKIRKTF--VQ 295

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F  KH+    +KS  D    GP +V A+   L AG S  +F +WA +  N+++    
Sbjct: 296 RNMFEFKHIRPF-DKSYADTP--GPMVVFATPGMLHAGLSLQLFKKWAPNENNMLIMPGY 352

Query: 355 GQFGTLA 361
              GT+ 
Sbjct: 353 CVSGTVG 359


>gi|195145328|ref|XP_002013648.1| GL24247 [Drosophila persimilis]
 gi|194102591|gb|EDW24634.1| GL24247 [Drosophila persimilis]
          Length = 154

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 68/148 (45%), Positives = 105/148 (70%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+F+T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIFATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAP 148
           +AF+ +T+L Y+Q   L GKG GI + P
Sbjct: 121 TAFEKITQLKYNQTVSLKGKGYGISITP 148


>gi|302309512|ref|NP_986945.2| AGR279Cp [Ashbya gossypii ATCC 10895]
 gi|442570103|sp|Q74ZC0.2|YSH1_ASHGO RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|299788393|gb|AAS54769.2| AGR279Cp [Ashbya gossypii ATCC 10895]
          Length = 771

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 94/329 (28%), Positives = 171/329 (51%), Gaps = 20/329 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQ-- 107
           S ++ +L+SH    H  +LPY M++      VF T P   +YR  LL+ + +  +     
Sbjct: 61  SQVEVLLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRW-LLSDFVKVTNIGNDN 119

Query: 108 ---VSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
              VS+ +L+T +D+  +F  +  +    +YH +    GI    + AGH+LG  ++++  
Sbjct: 120 AGGVSDENLYTDEDLAESFDRIETV----DYHSTIDVNGIKFTAYHAGHVLGAAMFQVEI 175

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  +++  DY+R  ++HLN   + +     +++   +    ++P   + +     I  T
Sbjct: 176 AGLRILFTGDYSRELDRHLNSAEIPTLPSDILIVESTFGTATHEPRTSKEKKLTQLIHTT 235

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-----PIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 236 VSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMSVFQTYV 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L+ G S D+  
Sbjct: 296 NMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLE 353

Query: 340 EWASDVKNLVLFTERGQFGTLARMLQADP 368
           +W  D KNLVL T     GT+A+ L  +P
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLMLEP 382


>gi|254582142|ref|XP_002497056.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
 gi|238939948|emb|CAR28123.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
          Length = 772

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 185/369 (50%), Gaps = 22/369 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S    +
Sbjct: 60  SKVDILLISHFHVDHAASLPYVMQKTNFQGRVFMTHPTKAIYRW-LLRDFVRVTSIGNSA 118

Query: 110 ---EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
              + +L+T +D+  +F  +  +    +YH +    GI    + AGH+LG  +++I   G
Sbjct: 119 TGKDENLYTDEDLAESFDRIETI----DYHSTVDVGGIKFTAYHAGHVLGAAMFQIEIAG 174

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             V++  DY+R  ++HLN   +  F    +++   +    ++P   +       I  T+ 
Sbjct: 175 LRVLFTGDYSRELDRHLNSAEIPPFPSDVLIVESTFGTATHEPRINRERKLTQLIHSTVT 234

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFLEW 281
            GG VLLPV + GR  EL+LIL++YW++H+        PIY+ + ++   +   ++++  
Sbjct: 235 KGGRVLLPVFALGRAQELMLILDEYWSQHAEELGGGQVPIYYASNLARKCMSVFQTYVNM 294

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
           M D I + F  S+ N F+ K+++ L N  E  +   GP ++LAS   L+ G S ++   W
Sbjct: 295 MNDDIRRKFRDSQTNPFVFKNISYLKNIDEFQDF--GPSVMLASPGMLQNGLSREVLERW 352

Query: 342 ASDVKNLVLFTERGQFGTLAR--MLQAD--PPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
             + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q  
Sbjct: 353 CPEGKNLVLITGYSVEGTMAKFLMLEPDTIPSINNPEITIPRRCQIEEISFAAHVDFQEN 412

Query: 398 LKKEEALKA 406
           L+  E + A
Sbjct: 413 LEFIEKISA 421


>gi|321457255|gb|EFX68345.1| hypothetical protein DAPPUDRAFT_218302 [Daphnia pulex]
          Length = 597

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 179/367 (48%), Gaps = 17/367 (4%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKVA-----STID 56
           T ++VTPL    +      L+ + G N ++DCG +  ++     P  S +A      ++D
Sbjct: 2   TDIKVTPLGAGQDVGRSCILLQMGGKNIMLDCGMHMGYNDERRFPDFSYIADGNLTESLD 61

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V++SH    H GALP+  + +G + P++ T P   +  + + D + ++  +  E + FT
Sbjct: 62  CVIISHFHLDHCGALPFMTEMVGYNGPIYMTHPTKAIAPILLEDMRKVAVERKGETNFFT 121

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +T  Q   +  + E   +  + AGH+LG  ++ +    + V+Y  DY
Sbjct: 122 SAHIKDCMKKVIAVTLHQTVQVDSEIE---IKAYYAGHVLGAAMFHVKVGNQSVVYTGDY 178

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++   RP +LI+++  A   +  ++ RE  F   +   +  GG VL+P
Sbjct: 179 NMTPDRHLGAAWIDK-CRPNILISESTYATTIRDSKRCRERDFLKKVHDCVDRGGKVLIP 237

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           V + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W    I K+F   +
Sbjct: 238 VFALGRAQELCILLETYWERMNLKAPIYFAVGLTEKANNYYKMFITWTNQKIRKTF--VQ 295

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F  KH+    +KS  D    GP +V A+   L AG S  +F +WA +  N+++    
Sbjct: 296 RNMFEFKHIRPF-DKSYADTP--GPMVVFATPGMLHAGLSLQLFKKWAPNENNMLIMPGY 352

Query: 355 GQFGTLA 361
              GT+ 
Sbjct: 353 CVSGTVG 359


>gi|15219848|ref|NP_176297.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|30696512|ref|NP_849835.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|79320389|ref|NP_001031215.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|75262219|sp|Q9C952.1|CPSF3_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3-I; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit I; Short=AtCPSF73-I;
           Short=CPSF 73 kDa subunit I
 gi|12323330|gb|AAG51638.1|AC018908_4 putative cleavage and polyadenylation specificity factor;
           72745-70039 [Arabidopsis thaliana]
 gi|23297661|gb|AAN13003.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
 gi|24415578|gb|AAN41458.1| putative cleavage and polyadenylation specificity factor 73 kDa
           subunit [Arabidopsis thaliana]
 gi|222422865|dbj|BAH19419.1| AT1G61010 [Arabidopsis thaliana]
 gi|222423059|dbj|BAH19511.1| AT1G61010 [Arabidopsis thaliana]
 gi|332195645|gb|AEE33766.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|332195646|gb|AEE33767.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
 gi|332195647|gb|AEE33768.1| cleavage and polyadenylation specificity factor subunit 3-I
           [Arabidopsis thaliana]
          Length = 693

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 40/385 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            +S  G N L DCG            + D  DPS      
Sbjct: 19  GDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPS------ 72

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               +ID +L++H    H  +LPY +++   +  VF   +T+ +Y+L LLT Y + +S+ 
Sbjct: 73  ----SIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKV 126

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            V +  LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G
Sbjct: 127 SVEDM-LFDEQDINKSMDKIEVIDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAG 181

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTL 225
             ++Y  DY+R +++HL    L  F  P + I ++ + +     R  RE  F D I  T+
Sbjct: 182 VRILYTGDYSREEDRHLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTV 240

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   ++++  M 
Sbjct: 241 AQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMN 300

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           D I   F  S  N F+ KH++ L +  + ++   GP +V+A+   L++G S  +F  W S
Sbjct: 301 DRIRNQFANS--NPFVFKHISPLNSIDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCS 356

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D KN  +       GTLA+ +  +P
Sbjct: 357 DKKNACIIPGYMVEGTLAKTIINEP 381


>gi|396082329|gb|AFN83939.1| putative beta-lactamase fold-containingexonuclease [Encephalitozoon
           romaleae SJ-2008]
          Length = 496

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 176/366 (48%), Gaps = 23/366 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP----LSKVAS---TIDA 57
           + V PL    +      LV+I G   + DCG +  F+     P    +SK  S    ID 
Sbjct: 1   MNVVPLGAGQDVGRSCVLVTIGGRTIMFDCGMHMGFNDERRFPDFSYISKTKSFDKAIDC 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD 117
           V++SH    H GALPY  +  G + PV+ T P   +    + D +    +     +FT  
Sbjct: 61  VVISHFHLDHCGALPYFTEVCGYNGPVYMTLPTKEV-CPVLLDDFRKIVEGKGDSIFTYQ 119

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
           DI +  + VT +  ++ Y      E   + P+ AGH+LG  ++ +    + V+Y  DY+ 
Sbjct: 120 DILNCMKKVTTINMNETYK---HDEDFYITPYYAGHVLGAAMFHVVVGDQSVVYTGDYST 176

Query: 178 RKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD 236
             +KHL    ++  VRP +LIT++ Y ++     R +   F  A+S  +  GG VL+P+ 
Sbjct: 177 TPDKHLGPASIKC-VRPDLLITESTYGSITRDCRRVKEREFLKAVSDCIARGGRVLIPIF 235

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS-FETSRD 295
           + GR  EL L+L+ YW    L  P+YF + ++    +  K F+ +  +++ +  FE    
Sbjct: 236 ALGRAQELCLLLDGYWERTGLKIPVYFSSGLTEKANEIYKKFISYTNETVKRKIFER--- 292

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FT 352
           N F  KH+     K  ++N   GP ++ AS   L +G S  +F EW  D KNLV+   + 
Sbjct: 293 NVFEYKHIKPF-QKYYMENK--GPMVLFASPGMLHSGMSLRMFKEWCEDEKNLVIIPGYC 349

Query: 353 ERGQFG 358
            RG  G
Sbjct: 350 VRGTIG 355


>gi|154292337|ref|XP_001546744.1| hypothetical protein BC1G_14624 [Botryotinia fuckeliana B05.10]
          Length = 901

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 157/582 (26%), Positives = 238/582 (40%), Gaps = 120/582 (20%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ FD   L+ L K   T+  +LL+H    H+ A  +  K   L    PV
Sbjct: 26  GIKVLIDVGWDETFDVEKLRELEKQIPTLSLILLTHATVPHIAAYAHCCKHFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRR-------QVSEFDLF--TLDDIDSAFQSVTRLTYSQNY 135
           ++T PV  LG   + D Y S           S F L   T ++I+  F  V  L YSQ +
Sbjct: 86  YATHPVIALGRTLLQDLYCSTPLASTIIPTTSSFLLQSPTKEEINYYFSLVRPLKYSQPH 145

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK------------HL 183
                  G+ +  + AGH LGGT+W I    E ++YAVD+N+ +E               
Sbjct: 146 Q---PLNGVTITAYNAGHSLGGTIWHIQHGLESIVYAVDWNQARENVLAGAAWLGGAGAG 202

Query: 184 NGTVLESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGR 240
              V+E   +P  LI  +        P  R +R E+  D I  ++  GG VL+P DS  R
Sbjct: 203 GAEVIEQLRKPTALICSSKGGERVALPGGRAKRDELLLDMIRSSISRGGIVLIPTDSGAR 262

Query: 241 VLELLLILEDYWAEHSL-------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET- 292
           ++EL  +LE  W   +        +   Y     S  T+ Y +S  EWM ++I + FE  
Sbjct: 263 MMELAYLLEHAWRTENQEEESAFKSAKPYLAVSTSEMTMRYTRSMFEWMDEAIIREFEAQ 322

Query: 293 ----------SRDNA---------FLLKHVTLLINKSELD---NAPDG-----PKLVLAS 325
                      R NA         F  KH+ LL  K ++D   N  D       K++LAS
Sbjct: 323 PGHEEQRTGQQRRNAEEAKQHIGPFEFKHLRLLGRKGQIDRMLNETDNLGRSVGKVILAS 382

Query: 326 MASLEAGFSHDIFVEWASDVKNLVLFTER-----GQFGTLARMLQ-----------ADPP 369
             S+E GFS ++  + A D KNL++ TER     G  G L R L            ++P 
Sbjct: 383 DTSIEWGFSKEVLCKIADDDKNLLILTERLNPISGAPG-LGRTLWSWWEERRDGVISEPS 441

Query: 370 P------------KAVKVTMSRRVPLVGEELIAYEEE-QTRLKKEEALKASLVKEEESKA 416
                        + +++   +R+PL G +L  Y++   T+ + +  L+       E+ A
Sbjct: 442 SNGGVLEQVYGGGRDLEIKEPKRIPLEGNDLTVYQQWLATQRQLQTTLQPGGATALEASA 501

Query: 417 SL-------------GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILI-------- 455
            +               +N   G  + I A    A+   +   G    D+ +        
Sbjct: 502 DIVDDASSDSSSDSDDSENEQQGKALNISATMGQANRKKI---GLSDEDLGVNILLRKKG 558

Query: 456 --DGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 495
             D  V        MFP        DDFGE+I P +++  +E
Sbjct: 559 VHDFDVRGKKGRDKMFPMAIRRKRNDDFGELIRPGEFLRAEE 600


>gi|302899216|ref|XP_003048005.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256728937|gb|EEU42292.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 958

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 123/424 (29%), Positives = 188/424 (44%), Gaps = 78/424 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+P S  L+ +DG    L+D GW++ FD   L+ + K  +T+  +L++H    
Sbjct: 6   PLQGALSESPASQSLLELDGGVKVLVDLGWDESFDAGKLKEIEKQVTTLSLILVTHATAS 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS---------RRQVSEFDLF- 114
           HL A  +  K +      PV++T PV  LG   + D Y S         +  +SE     
Sbjct: 66  HLAAYAHCCKNIPQFTRIPVYATRPVIDLGRTLIQDLYNSSPAAATTIPQSSLSETAFSF 125

Query: 115 ---------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
                          T +DI   F  +  L YSQ +            G+ +  + +GH 
Sbjct: 126 AQTATTAQNLLLQSPTNEDIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN- 258
            A        R +R E   D I   +  GG VL+PVDS+ RVLEL  +LE  W   + + 
Sbjct: 246 GADRTAQAGGRAKRDEQLIDTIKACVTRGGTVLIPVDSSARVLELSYLLEHAWRTDAASE 305

Query: 259 ------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET--------------SRDNAF 298
                   +Y      SST+ Y +S LEWM D+I + FE                    F
Sbjct: 306 DGVLKAAKLYLAGRNMSSTMRYARSMLEWMDDTIVQEFEAFAEGQRKVNGAGDKKEGGPF 365

Query: 299 LLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             K++ LL  K+++        +N     +++LAS +S+E GFS D+    A D +NLV+
Sbjct: 366 DFKYLRLLERKAQIVRLLSRGFENVETEGRVILASDSSIEWGFSKDLIKGLARDSRNLVI 425

Query: 351 FTER 354
            T++
Sbjct: 426 LTDK 429



 Score = 44.3 bits (103), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 54/234 (23%), Positives = 88/234 (37%), Gaps = 61/234 (26%)

Query: 523 KPSKVVSNELTVLVHGSAEATEHLKQHCLKHV-------------CPHVYTPQIEETIDV 569
           KP K++      LV G  E T  L + C + +                VYTP+I   +D 
Sbjct: 733 KPRKLI------LVGGGREETLALAEDCRRALGGDAAAGDGSSERTVDVYTPEIGTLVDA 786

Query: 570 TSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV----------DAEVG---------KTEN 610
           + D  A+ V+L++ L+  + ++ +    I  +          DA  G         KTE 
Sbjct: 787 SVDTNAWVVKLADSLVKKIKWQNVRGLGIVTITGQLLATKLDDAPAGDQDAANKRQKTEE 846

Query: 611 GMLSLLPISTPAP-PHKSVL----------------VGDLKMADLKPFLSSKGIQVEFAG 653
              + L     +P P   VL                VGDL++ADL+  + S G   EF G
Sbjct: 847 SSTTALSTVVASPMPTLDVLPANLVSAVRSAAQPLHVGDLRLADLRRAMQSAGHTAEFRG 906

Query: 654 -GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
            G L     V +RK        G    + + +       +Y++R  +Y    ++
Sbjct: 907 EGTLVVDGTVAVRKTA-----AGRVEVESVGMPTARRSTFYEVRKVIYDNLAVV 955


>gi|18377654|gb|AAL66977.1| putative cleavage and polyadenylation specificity factor
           [Arabidopsis thaliana]
          Length = 693

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 40/385 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            +S  G N L DCG            + D  DPS      
Sbjct: 19  GDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPS------ 72

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               +ID +L++H    H  +LPY +++   +  VF   +T+ +Y+L LLT Y + +S+ 
Sbjct: 73  ----SIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKV 126

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            V +  LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G
Sbjct: 127 SVEDM-LFDEQDINKSMDKIEVIDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAG 181

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTL 225
             ++Y  DY+R +++HL    L  F  P + I ++ + +     R  RE  F D I  T+
Sbjct: 182 VRILYTGDYSREEDRHLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTV 240

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   ++++  M 
Sbjct: 241 AQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMN 300

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           D I   F  S  N F+ KH++ L +  + ++   GP +V+A+   L++G S  +F  W S
Sbjct: 301 DRIRNQFANS--NPFVFKHISPLNSIDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCS 356

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D KN  +       GTLA+ +  +P
Sbjct: 357 DKKNACIIPGYMVEGTLAKTIINEP 381


>gi|442751667|gb|JAA67993.1| Putative cleavage and polyadenylation specificity factor cpsf
           subunit [Ixodes ricinus]
          Length = 596

 Score =  159 bits (401), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 114/390 (29%), Positives = 187/390 (47%), Gaps = 26/390 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           + VTPL    +      L+SI G N ++DCG    +ND     D S +     +   +D 
Sbjct: 4   ISVTPLGAGQDVGRSCILLSIGGKNIMLDCGMHMGYNDERRFPDFSYVTQEGPLNDHLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           +++ H    H GALPY  + +G + PV+ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  LIIGHFHLDHCGALPYMTEMVGYAGPVYMTHPTKAICPILLEDFRKITVDRKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  ++ I    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVNLHQTVQVDDELE---IKAYYAGHVLGAAMFHIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   +   +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLTKVHDCIDEGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETYWDRMNLKVPIYFAVGLTEKATNYYKMFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +++ +DN   GP +V A+   L AG S  IF +WA    N+V+     
Sbjct: 298 NMFDFKHIKPF-DRAFIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPVEGNMVIMPGYC 354

Query: 356 QFGTL-------ARMLQADPPPKAVKVTMS 378
             GT+       AR ++ D   + V+V MS
Sbjct: 355 VAGTVGHKILSGARKVELD-NRQVVEVKMS 383


>gi|410074967|ref|XP_003955066.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
 gi|372461648|emb|CCF55931.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
          Length = 769

 Score =  159 bits (401), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 187/370 (50%), Gaps = 22/370 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS---RR 106
           S++D +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S     
Sbjct: 59  SSVDILLISHFHLDHAASLPYVMQRTNFKGRVFMTHPTKAIYRW-LLRDFVRVTSIGINS 117

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
              + +L+T +D+  +F  +  +    +YH +    GI    + AGH+LG  +++I   G
Sbjct: 118 TGEDDNLYTDEDLVESFDKIETI----DYHSTVDVNGIKFTAYHAGHVLGAAMFQIEIAG 173

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             V++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T+ 
Sbjct: 174 LRVLFTGDYSRETDRHLNSAEVPPLSSDILIVESTFGTATHEPRLSREKKLTQLIHTTVS 233

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFLEW 281
            GG VL+PV + GR  EL+LIL+++W++H+        PI++ + ++   +   ++++  
Sbjct: 234 QGGRVLMPVFALGRAQELMLILDEFWSQHADELGGGQVPIFYASDLARKCMSVFQTYVNM 293

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
           M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L++G S D+   W
Sbjct: 294 MNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDF--GPSVMLASPGMLQSGISRDLLERW 351

Query: 342 ASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQTR 397
             D KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q  
Sbjct: 352 CPDDKNLVLITGYSVEGTMAKFIMLEPDTIPSVNNPEITIPRRCQVEEISFAAHVDFQEN 411

Query: 398 LKKEEALKAS 407
           L+  E + A+
Sbjct: 412 LEFIEKINAN 421


>gi|50287519|ref|XP_446189.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74637743|sp|Q6FUA5.1|YSH1_CANGA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|49525496|emb|CAG59113.1| unnamed protein product [Candida glabrata]
          Length = 771

 Score =  159 bits (401), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 183/371 (49%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----R 105
           S +D +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S     
Sbjct: 60  SIVDVLLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKAIYRW-LLRDFVRVTSIGSQS 118

Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
               + +L++ +D+  +F  +  +    +YH      GI      AGH+LG  +++I   
Sbjct: 119 SNAEDDNLYSNEDLIESFDKIETI----DYHSMIDVNGIKFTAFHAGHVLGAAMFQIEIA 174

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T+
Sbjct: 175 GLRVLFTGDYSREIDRHLNSAEVPPLPSDILIVESTFGTATHEPRLHREKKLTQLIHSTV 234

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLE 280
             GG VL+PV + GR  EL+LIL++YW++H     S   PI++ + ++   +   ++++ 
Sbjct: 235 NKGGRVLMPVFALGRAQELMLILDEYWSQHKEELGSNQIPIFYASNLARKCLSVFQTYVN 294

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
            M D+I K F  S+ N F+ K++  + N  E  +   GP ++LAS   L+ G S D+   
Sbjct: 295 MMNDNIRKKFRDSQTNPFIFKNIAYIKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLER 352

Query: 341 WASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQT 396
           W  D KNLVL T     GT+A+  +L+ D  P     +VT+ RR  +      A+ + Q 
Sbjct: 353 WCPDEKNLVLITGYSVEGTMAKYLLLEPDTIPSVSNPEVTIPRRCRVEELSFAAHVDFQE 412

Query: 397 RLKKEEALKAS 407
            L+  E + AS
Sbjct: 413 NLEFIEQINAS 423


>gi|310799284|gb|EFQ34177.1| RNA-metabolising metallo-beta-lactamase [Glomerella graminicola
           M1.001]
          Length = 984

 Score =  158 bits (400), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 165/621 (26%), Positives = 249/621 (40%), Gaps = 148/621 (23%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  ++ +DG    LID GW++ FD   LQ L K   T+  +LL+H  T 
Sbjct: 6   PLQGALSESSASQSILELDGGVKILIDLGWDESFDVEKLQELEKQVPTLSLILLTHATTS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS-------------------- 104
           HL A  +  K   L    PV++T PV  LG     D Y S                    
Sbjct: 66  HLAAFAHCCKNFPLFTRIPVYATRPVIDLGRTLTQDLYASTPLAATKIPLGSLTEAAYSF 125

Query: 105 RRQVSEFDLFTLD-----DIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
            +Q +    F L      +I   F  +  L YSQ +            G+++  + AGH 
Sbjct: 126 SQQSTAGSEFLLQAPSPAEITRYFSLIQPLKYSQPHEPLPSPFSPPLNGLIITAYNAGHS 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ KE    G             V+E   +P  L+  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQAKENVFAGAAWLGGAGGGGADVIEQLRKPTALVCSSR 245

Query: 203 NA--LHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW------- 252
            A  +     R +R E   D I   +  GG  L+PVDS+ RVLE+  +LE  W       
Sbjct: 246 GAEKVAQAGGRAKRDEQLIDMIKTCVARGGTALIPVDSSARVLEIAYLLEHAWRADSESD 305

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--------------- 297
           +    +  +Y      SST+ Y +S LEWM D+I + FE+  D                 
Sbjct: 306 SSSLKSAKLYLAGRNMSSTLRYARSMLEWMDDNIVREFESVADGQRKANGTEAKSKEGVP 365

Query: 298 FLLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  +++ L+  ++++        DN     +++LAS  +LE GFS D+    A D +NLV
Sbjct: 366 FDFRYLKLVERRAQIEKLLSGSGDNVQAEGRVILASDDTLEWGFSKDLIRGLAKDSRNLV 425

Query: 350 LFTE-----RGQFGTLARML------QADPPP-----------------KAVKVTMSRRV 381
           + T+     R +  ++AR L      + D                    + ++V  ++R 
Sbjct: 426 ILTDKPAKSRAEQPSIARTLWDWWTERRDGVAVEQSSNGNNLELVYGGGRELEVQEAKRQ 485

Query: 382 PLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGP-------------------DN 422
            L GEEL  Y   Q  L  +  L+A+L  +    ASL                     DN
Sbjct: 486 ALEGEELNVY---QQWLATQRQLQATL--QSGGGASLQAPADAADDVSSDSSTDSGESDN 540

Query: 423 NLSGDPMVIDANNANASA------------DVVEPHGGRYRDILIDGFVPPSTSVAPMFP 470
              G  + I      A+             +++    G Y D  + G      S    FP
Sbjct: 541 EQQGKALNISTTMGQATRKKVVLTDEDLGINILTKKRGAY-DFDVRGKKGRERS----FP 595

Query: 471 FYENNSEWDDFGEVINPDDYI 491
                   D FG+VI P+DY+
Sbjct: 596 LVMRRRRDDQFGDVIRPEDYL 616


>gi|443725897|gb|ELU13297.1| hypothetical protein CAPTEDRAFT_184406 [Capitella teleta]
          Length = 668

 Score =  158 bits (400), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 93/317 (29%), Positives = 165/317 (52%), Gaps = 12/317 (3%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF 114
           ID +L+SH    H G LP+ +++ G     F T     +    + D        +E  L+
Sbjct: 52  IDLLLVSHFHLDHAGGLPWFLEKTGFKGRCFMTHASKAIYRWLLSDYVKVSNIATEQQLY 111

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
              DI+++   +  +    N+H   +  GI    + AGH+LG  ++ I   G  V+Y  D
Sbjct: 112 QDSDIEASMDKIETV----NFHQETEVNGIKFCAYTAGHVLGAAMFMIEIAGVKVLYTGD 167

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLL 233
           ++R +++HL    + + V+P VLIT++    H   PR++RE  F   IS  +  GG  L+
Sbjct: 168 FSREEDRHLMAAEIPN-VKPDVLITESTYGTHIHEPREEREGRFTSLISDIVNRGGRCLI 226

Query: 234 PVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           PV + GR  ELLLIL++YW++H    + PIY+ + ++   +   ++++  M D I +   
Sbjct: 227 PVFALGRAQELLLILDEYWSQHPELQDIPIYYASSLAKKCMSVYQTYINAMNDKIKRQIN 286

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           T  +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D +N  + 
Sbjct: 287 T--NNPFVFKHISNLKSMEHFDDI--GPSVVMASPGMMQSGLSRELFENWCTDKRNGCII 342

Query: 352 TERGQFGTLARMLQADP 368
                 GTLA+ + ++P
Sbjct: 343 AGYCVEGTLAKHILSEP 359


>gi|125546484|gb|EAY92623.1| hypothetical protein OsI_14368 [Oryza sativa Indica Group]
          Length = 700

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 184/387 (47%), Gaps = 44/387 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 28  GDQLIITPL-GAGNEVGRSCVYMSFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS----- 81

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+  
Sbjct: 82  -----TIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKV 132

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 133 SKVSVEDMLFDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 188

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    L  F     +I   Y    +QP   + + F D I  T
Sbjct: 189 AGVRVLYTGDYSREEDRHLKAAELPQFSPDICIIESTYGVQQHQPRHVREKRFTDVIHTT 248

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YWA H      PIY+ + ++   +   ++++  M
Sbjct: 249 VSQGGRVLIPAFALGRAQELLLILDEYWANHPELHKIPIYYASPLAKKCMAVYQTYINSM 308

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH+  L   + +DN  D GP +V+AS   L++G S  +F +W
Sbjct: 309 NERIRNQFAQS--NPFHFKHIESL---NSIDNFHDVGPSVVMASPGGLQSGLSRQLFDKW 363

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
            +D KN  +       GTLA+ +  +P
Sbjct: 364 CTDKKNSCVIPGYVVEGTLAKTIINEP 390


>gi|115456655|ref|NP_001051928.1| Os03g0852900 [Oryza sativa Japonica Group]
 gi|27573349|gb|AAO20067.1| putative cleavage and polyadenylation specifity factor protein
           [Oryza sativa Japonica Group]
 gi|29126360|gb|AAO66552.1| putative cleavage and polyadenylation specifity factor [Oryza
           sativa Japonica Group]
 gi|108712151|gb|ABF99946.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit,
           putative, expressed [Oryza sativa Japonica Group]
 gi|113550399|dbj|BAF13842.1| Os03g0852900 [Oryza sativa Japonica Group]
 gi|125588676|gb|EAZ29340.1| hypothetical protein OsJ_13407 [Oryza sativa Japonica Group]
          Length = 700

 Score =  158 bits (400), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 184/387 (47%), Gaps = 44/387 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 28  GDQLIITPL-GAGNEVGRSCVYMSFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS----- 81

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+  
Sbjct: 82  -----TIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKV 132

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 133 SKVSVEDMLFDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 188

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    L  F     +I   Y    +QP   + + F D I  T
Sbjct: 189 AGVRVLYTGDYSREEDRHLKAAELPQFSPDICIIESTYGVQQHQPRHVREKRFTDVIHTT 248

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YWA H      PIY+ + ++   +   ++++  M
Sbjct: 249 VSQGGRVLIPAFALGRAQELLLILDEYWANHPELHKIPIYYASPLAKKCMAVYQTYINSM 308

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH+  L   + +DN  D GP +V+AS   L++G S  +F +W
Sbjct: 309 NERIRNQFAQS--NPFHFKHIESL---NSIDNFHDVGPSVVMASPGGLQSGLSRQLFDKW 363

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
            +D KN  +       GTLA+ +  +P
Sbjct: 364 CTDKKNSCVIPGYVVEGTLAKTIINEP 390


>gi|380480161|emb|CCF42595.1| RNA-metabolising metallo-beta-lactamase [Colletotrichum
           higginsianum]
          Length = 979

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 126/440 (28%), Positives = 193/440 (43%), Gaps = 84/440 (19%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  ++ +DG    LID GW++ FD   LQ L K   T+  +LL+H    
Sbjct: 6   PLQGALSESAASQSILELDGGVKILIDLGWDESFDVEKLQELEKQVPTLSLILLTHATAS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQY---------------------L 103
           HL A  +  K   L    PV++T PV  LG     D Y                      
Sbjct: 66  HLAAFAHCCKNFPLFTRIPVYATRPVIDLGRTLTQDLYASTPLAATKIPHGSLNEAAYSF 125

Query: 104 SRRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
           S++  ++ D      T ++I   F  +  L YSQ +            G+++  + AGH 
Sbjct: 126 SQQPTADSDFLLQAPTPEEITRYFSLIQPLKYSQPHEPLPSPFSPPLNGLMITAYNAGHS 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEK------------HLNGTVLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ KE                  V+E   +P  L+  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQAKENVFAGAAWLGGAGGGGAEVIEQLRKPTALVCSSR 245

Query: 203 NA--LHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--- 256
            A  +     R +R E   D I   +  GG  L+PVDS+ RVLE+  +LE  W   S   
Sbjct: 246 GAEKVAQAGGRAKRDEQLVDMIKTCVSRGGTALVPVDSSARVLEIAYLLEHAWRVDSESD 305

Query: 257 ----LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--------------- 297
                +  +Y      SST+ Y +S LEWM D+I + FE+  D                 
Sbjct: 306 NSSLKSAKLYLAGRNMSSTLRYARSMLEWMDDNIVREFESVADGQRRTNGAEAKSKEGVP 365

Query: 298 FLLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  +++ L+  ++++        DN     +++LAS  +LE GFS D+    A D +NLV
Sbjct: 366 FDFRYLKLVERRAQIEKLLSGSGDNVQAEGRVILASDDTLEWGFSKDLIRGLAKDSRNLV 425

Query: 350 LFTE-----RGQFGTLARML 364
           + T+     R +  ++AR L
Sbjct: 426 ILTDKPAKSRAEQPSIARTL 445


>gi|50304897|ref|XP_452404.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74636942|sp|Q6CUI5.1|YSH1_KLULA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|49641537|emb|CAH01255.1| KLLA0C04598p [Kluyveromyces lactis]
          Length = 764

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 175/348 (50%), Gaps = 24/348 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----- 104
           STID +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S     
Sbjct: 64  STIDLLLISHFHLDHAASLPYVMQRTNFRGRVFMTHPTKAIYRW-LLNDFVKVTSIGDSP 122

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +  S  +L++ +D+  +F  +  +    +YH + +  GI      AGH+LG  +++I  
Sbjct: 123 GQDSSNDNLYSDEDLAESFDRIETI----DYHSTMEVNGIKFTAFHAGHVLGAAMFQIEI 178

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P + +       I   
Sbjct: 179 AGVRVLFTGDYSREVDRHLNSAEVPPQSSDVIIVESTFGTATHEPRQNRERKLTQLIHTV 238

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW  H         PI++ + ++   +   ++++
Sbjct: 239 VSKGGRVLLPVFALGRAQEIMLILDEYWQNHKEELGNGQVPIFYASNLAKKCMSVFQTYV 298

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F+ S+ N F+ K+++ L N  E ++   GP ++LAS   L+ G S DI  
Sbjct: 299 NMMNDDIRKKFKDSQTNPFIFKNISYLKNLDEFEDF--GPSVMLASPGMLQNGLSRDILE 356

Query: 340 EWASDVKNLVLFTERGQFGTLARML----QADPPPKAVKVTMSRRVPL 383
           +W  + KNLVL T     GT+A+ L    +A P     ++T+ RR  +
Sbjct: 357 KWCPEEKNLVLVTGYSVEGTMAKYLLLEPEAIPSVHNPEITIPRRCQV 404


>gi|429857613|gb|ELA32471.1| cleavage and polyadenylylation specificity [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 962

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 166/644 (25%), Positives = 261/644 (40%), Gaps = 151/644 (23%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  ++ +DG    LID GW++ FD   L+ L K   T+  +LL+H  T 
Sbjct: 6   PLQGALSESSASQSILELDGGVKILIDLGWDESFDVEKLRELEKQVPTLSIILLTHATTS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS-------------------- 104
           HL A  +  K   L    PV++T PV  LG     D Y S                    
Sbjct: 66  HLAAFAHCCKNFPLFTRIPVYATRPVIDLGRTLTQDLYASTPLAATKIPHGSLSEAAYSY 125

Query: 105 -RRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
            ++   + D      T ++I   F  +  L YSQ +            G+++  + AGH 
Sbjct: 126 SQQPTGDSDFLLQAPTPEEITRYFSLIQPLKYSQPHEPLPSPFSPPLNGLMITAYNAGHS 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEK------------HLNGTVLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E                  V+E   +P  L+  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVYAGAAWLGGAGGGGAEVIEQLRKPTALVCSSR 245

Query: 203 NA--LHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA------ 253
            A  +     R +R E   D I   +  GG  L+PVDS+ RVLE+  +LE  W       
Sbjct: 246 GAEKVAQAGGRAKRDEQLVDIIKLCVSRGGTCLIPVDSSARVLEIAYLLEHTWQVDSETD 305

Query: 254 EHSLNYP-IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--------------- 297
           ++SL    +Y      SST+ Y +S LEWM D+I + FE+  D                 
Sbjct: 306 DNSLKAAKLYLAGRNMSSTLRYARSMLEWMDDNIVREFESVADGQRKANGADGKTKEAVP 365

Query: 298 FLLKHVTLLINKSELD--------NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  +++ L+  +++++        N     +++LAS  +LE GFS D+    A D +NLV
Sbjct: 366 FDFRYLKLVERRAQIEKLLSSSGGNVQSEGRVILASDDTLEWGFSKDLIKGLAKDSRNLV 425

Query: 350 LFTE-----RGQFGTLARML------QADPPP-----------------KAVKVTMSRRV 381
           + T+     R +  ++AR L      + D                    + +++  ++R 
Sbjct: 426 VLTDKPPKSRAEQPSIARTLWDWWTERQDGATVEQTSSGDSIEFVYGGGRELEIQEAKRQ 485

Query: 382 PLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGP-------------------DN 422
            L G+EL  Y   Q  L  +  L+A+L  +    ASL                     DN
Sbjct: 486 ALEGDELTVY---QQWLATQRQLQATL--QSGGGASLQAPADAADDVSSESSSDSGESDN 540

Query: 423 NLSGDPMVIDANNANASA------------DVVEPHGGRYRDILIDGFVPPSTSVAPMFP 470
              G  + I      A+             +++    G Y D  + G      S    FP
Sbjct: 541 EQQGKALNISTTMGQATRKKVVLTDEDLGINILTKKRGAY-DFDVRGKKGRERS----FP 595

Query: 471 FYENNSEWDDFGEVINPDDYII---KDEDMDQAAMHIGGDDGKL 511
                   D FG+VI P+DY+    K+ED+    M    D+ +L
Sbjct: 596 LVMRRRRDDQFGDVIRPEDYLRAEEKEEDVPDTEMRGDDDEDRL 639


>gi|357445375|ref|XP_003592965.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
 gi|357445453|ref|XP_003593004.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
 gi|355482013|gb|AES63216.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
 gi|355482052|gb|AES63255.1| Cleavage and polyadenylation specificity factor subunit 3-I
           [Medicago truncatula]
          Length = 690

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 114/382 (29%), Positives = 183/382 (47%), Gaps = 46/382 (12%)

Query: 7   VTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPLSKVAS 53
           VTPL G  NE   S + ++  G   L DCG            + D  DPS          
Sbjct: 24  VTPL-GAGNEVGRSCVYMTYKGKTVLFDCGIHPGYSGMAALPYFDEIDPS---------- 72

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSE 110
           T+D +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+   +VS 
Sbjct: 73  TVDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKL----LLSDYVKVSKVSV 128

Query: 111 FD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
            D L+   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G  V
Sbjct: 129 DDMLYDEQDINRSMDKIEVIDFHQTVEVNG----IRFWCYTAGHVLGAAMFMVDIAGVRV 184

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGG 229
           +Y  DY+R +++HL       F     +I   Y   H+QP   + + F D I  T+  GG
Sbjct: 185 LYTGDYSREEDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGG 244

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
            VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   +++   M D I 
Sbjct: 245 RVLIPAYALGRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI- 303

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVK 346
              + ++ N F  KH++ L   S +D   D GP +V+AS   L++G S  +F  W SD K
Sbjct: 304 ---QNAKSNPFAFKHISAL---SSIDIFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKK 357

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N  +       GTLA+ +  +P
Sbjct: 358 NSCVIPGYVVEGTLAKTILNEP 379


>gi|367031802|ref|XP_003665184.1| hypothetical protein MYCTH_2308652 [Myceliophthora thermophila ATCC
           42464]
 gi|347012455|gb|AEO59939.1| hypothetical protein MYCTH_2308652 [Myceliophthora thermophila ATCC
           42464]
          Length = 1035

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 129/446 (28%), Positives = 195/446 (43%), Gaps = 89/446 (19%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           +PL G   E+  S  L+ +DG    L+D GW++ FD   L+ L K   T+  +LL+H   
Sbjct: 5   SPLQGALTESAASQSLLELDGGVKVLVDVGWDETFDVEKLRELEKQVPTLSLILLTHATI 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS---------RRQVSEFDLF 114
            HLGA  +  K   L    PV++T PV  LG     D Y S         +  ++E    
Sbjct: 65  NHLGAYAHCCKNFPLFTRIPVYATRPVIDLGRTLTQDLYASTPMAATTIPQTSLAESSYS 124

Query: 115 ----------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGH 153
                           T D+I   F  +  L YSQ +            G+ +  + +GH
Sbjct: 125 YAQASSADHKLLLQPPTPDEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGH 184

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLIT 199
            LGGT+W I    E ++YAVD+++ +E   +G               V+E   +P  L+ 
Sbjct: 185 TLGGTIWHIQHGLESIVYAVDWSQARENVFSGAAWLGGGHGAAGGAEVIEQLRKPTALVC 244

Query: 200 DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA------ 253
            +       P  ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W       
Sbjct: 245 SSRTPETALPRGRRDEQLLESIKLCIARGGTVLIPVDSSARVLELSYLLEHAWRSEVAKD 304

Query: 254 -EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-----TSRDNA---------- 297
            E   +  +Y       ST+   +S LEWM DSI + FE     T   N+          
Sbjct: 305 NEVFKSTKVYLAGRSVGSTMRNARSMLEWMDDSIVREFEAVAGGTRTGNSGGGAGSGAKG 364

Query: 298 -----FLLKHVTLLINKSEL----------DNAPDGPKLVLASMASLEAGFSHDIFVEWA 342
                F  KH+ LL  K+++          D+A    +++LA+ +SLE GFS D+    A
Sbjct: 365 KEAGPFDFKHLRLLERKAQVERVLQQATATDDAEPRGRVILATDSSLEWGFSKDVMRAIA 424

Query: 343 SDVKNLVLFTERGQFG----TLARML 364
            D +NLV+ TE+        ++ARML
Sbjct: 425 EDPRNLVILTEKPSLNPGKPSIARML 450


>gi|290978816|ref|XP_002672131.1| predicted protein [Naegleria gruberi]
 gi|284085705|gb|EFC39387.1| predicted protein [Naegleria gruberi]
          Length = 749

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/411 (29%), Positives = 206/411 (50%), Gaps = 25/411 (6%)

Query: 2   GTSVQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAV 58
           G  + VTPL G  NE   S  L+   G   L DCG +  F      P       S ID V
Sbjct: 36  GEKLVVTPL-GAGNEVGRSAVLLQFKGKTVLFDCGIHPAFTGMASLPFFDTIEPSEIDLV 94

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSEFDLFT 115
           L++H    H GALPY  +       VF T P   +Y+L LLT + + +S   V +  LFT
Sbjct: 95  LVTHFHLDHCGALPYFTEHTNFQGRVFMTHPTKAIYKL-LLTDFVK-VSDVHVDD-QLFT 151

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
             ++  + + +  +    +YH   +  GI    + AGH+LG  ++ +   G  V+Y  D+
Sbjct: 152 EQNLLDSLKKIELI----DYHQELEHNGIKFWCYNAGHVLGAAMFMVEIAGVRVLYTGDF 207

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLP 234
           +R+ ++HL G    + + P VLI ++   +     + +RE  F   +++ ++ GG  L+P
Sbjct: 208 SRQPDRHLLGAETPT-MSPDVLIVESTYGIQVHESQSEREKRFTQMVTEIVKRGGRCLIP 266

Query: 235 VDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 292
           V + GR  ELLLIL+++W  H    + PIY+ + ++   +   ++++  M D I K F+ 
Sbjct: 267 VFALGRAQELLLILDEFWETHQDLQHIPIYYASSLAKKCMTIFQTYINMMNDKIRKQFDI 326

Query: 293 SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
              N F+ KH++ L  +S  D   +GP +++AS   L++G S ++F  W  D KN V+  
Sbjct: 327 H--NPFVFKHISNL--RSIEDFQDNGPCVIMASPGMLQSGLSKELFELWCQDAKNGVIIA 382

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL-VGEELIAYEEEQTRLKKEE 402
                GTLA+ + ++  P+ V ++    VPL +    I++     + + EE
Sbjct: 383 GYSVDGTLAKKIMSE--PETVTLSNGNTVPLRMSVRTISFSAHSDKAQTEE 431


>gi|408391611|gb|EKJ70983.1| hypothetical protein FPSE_08842 [Fusarium pseudograminearum CS3096]
          Length = 963

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 123/424 (29%), Positives = 191/424 (45%), Gaps = 78/424 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++  S  L+ +DG    L+D GW++ FD   L+ + K  +T+  +L++H    
Sbjct: 6   PLQGALSDSSASQSLLELDGGVKVLVDLGWDESFDVEKLKEIEKQVTTLSLILVTHATAS 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQY---------------------L 103
           HL A  +  K +      PV++T PV  LG   + D Y                     L
Sbjct: 66  HLAAYAHCCKNIPQFTRIPVYATRPVIDLGRTLIQDLYTSSPAAATTIPQSSLTESAYSL 125

Query: 104 SRRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
           ++   +  +L       ++I   F  +  L YSQ +            G+ +  + +GH 
Sbjct: 126 TQTATTAQNLLLQSPNSEEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-- 257
            A     P  R +R E   D I   +  GG VL+PVDS+ RVLEL  +LE  W   +   
Sbjct: 246 GADRTAQPGGRTKRDEQLIDTIKACVTRGGTVLIPVDSSARVLELSYLLEHAWRTDAASE 305

Query: 258 -----NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-----SRDNA---------F 298
                +  +Y      SST+ Y +S LEWM DSI + FE       R N          F
Sbjct: 306 GGVLKSAKLYLAGRNMSSTMRYARSMLEWMDDSIVQEFEAFAEDQRRVNGANNKKEGGPF 365

Query: 299 LLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             K++ LL  K+++        +NA    +++LAS +S+E GFS D+    A D +NLV+
Sbjct: 366 DFKYLRLLERKAQIARLLSQNVENAGTEGRVILASDSSIEWGFSKDLIKGLAQDSRNLVI 425

Query: 351 FTER 354
            T++
Sbjct: 426 LTDK 429


>gi|444315239|ref|XP_004178277.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
 gi|387511316|emb|CCH58758.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
          Length = 781

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/372 (29%), Positives = 184/372 (49%), Gaps = 26/372 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS---RR 106
           STID +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S     
Sbjct: 69  STIDVLLISHFHLDHAASLPYVMQRTNFRGRVFMTHPTKAIYRW-LLRDFVKVTSIGGDA 127

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
           +  + +L+  +D+  +F  +  +    +YH +    GI    + AGH+LG  +++I   G
Sbjct: 128 ENKDENLYNDEDLVESFDRIETI----DYHSTIDVNGIKFTAYHAGHVLGAAMFQIEIAG 183

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTL 225
             +++  DY+R  ++HLN   +       +LI ++        PR  REM     +   +
Sbjct: 184 LRILFTGDYSRELDRHLNSAEIPPLASD-ILIVESTFGTATHEPRLNREMKLTQLVHSIV 242

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFLE 280
             GG VL+PV + GR  E++LIL++YW  H         PIY+ + ++   +   ++++ 
Sbjct: 243 SRGGRVLMPVFALGRAQEIMLILDEYWNNHHEELGGGQVPIYYASSLAKKCMSVFQTYVN 302

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFV 339
            M D I K F  S+ N F+ K+++ L N   LDN  D GP ++LAS   L++G S D+  
Sbjct: 303 MMNDDIRKKFRDSQTNPFIFKNISYLRN---LDNFEDFGPSVLLASPGMLQSGISRDLLE 359

Query: 340 EWASDVKNLVLFTERGQFGTLARMLQADPPP----KAVKVTMSRRVPLVGEELIAYEEEQ 395
            W  + KN+VL T     GT+A+ L  +P         ++++ RR  +      A+ + Q
Sbjct: 360 RWCPEDKNMVLITGYSVEGTMAKYLMVEPDTIPSINNPEISIPRRCKIEEISFAAHVDFQ 419

Query: 396 TRLKKEEALKAS 407
             L+  E + AS
Sbjct: 420 ENLEFIEKINAS 431


>gi|32564696|ref|NP_495706.2| Protein F10B5.8 [Caenorhabditis elegans]
 gi|26985793|emb|CAB54223.2| Protein F10B5.8 [Caenorhabditis elegans]
          Length = 608

 Score =  157 bits (398), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 182/373 (48%), Gaps = 18/373 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++ PL    +      L++I G N ++DCG +  +       D S +    ++   +D 
Sbjct: 8   IKIVPLGAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDC 67

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           V++SH    H G+LP+  + +G   P++ T P   +  + + D    +  +  E + FT 
Sbjct: 68  VIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFTS 127

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           DDI +  + V      +  H+  +   + +    AGH+LG  +++I      V+Y  DYN
Sbjct: 128 DDIKNCMKKVVGCALHEIIHVDNE---LSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDYN 184

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    +   VRP VLI+++  A   +  ++ RE  F   + + +  GG V++PV
Sbjct: 185 MTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHECVMKGGKVIIPV 244

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +LN PIYF   ++     Y + F+ W  ++I K+F     
Sbjct: 245 FALGRAQELCILLESYWERMALNVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VER 302

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+  +    E  + P GP+++ ++   L  G S  +F +W SD  N+++     
Sbjct: 303 NMFEFKHIKPMEKGCE--DQP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYC 359

Query: 356 QFGTL-ARMLQAD 367
             GT+ AR++  +
Sbjct: 360 VAGTVGARVINGE 372


>gi|356543411|ref|XP_003540154.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Glycine max]
          Length = 689

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 115/382 (30%), Positives = 183/382 (47%), Gaps = 46/382 (12%)

Query: 7   VTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPLSKVAS 53
           VTPL G  NE   S + +S  G   L DCG            + D  DPS          
Sbjct: 23  VTPL-GAGNEVGRSCVYMSYKGKTVLFDCGIHPAYSGMAALPYFDEIDPS---------- 71

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSE 110
           T+D +L++H    H  +LPY +++      VF   +T+ +Y+L    +   ++   +VS 
Sbjct: 72  TVDVLLITHFHLDHAASLPYFLEKTTFRGRVFMTYATKAIYKL----LLSDFVKVSKVSV 127

Query: 111 FD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
            D LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G  V
Sbjct: 128 EDMLFDEQDINRSMDKIEVIDFHQTVEVNG----IRFWCYTAGHVLGAAMFMVDIAGVRV 183

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGG 229
           +Y  DY+R +++HL       F     +I   Y   H+QP   + + F D I  T+  GG
Sbjct: 184 LYTGDYSREEDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGG 243

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
            VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   +++   M D I 
Sbjct: 244 RVLIPAFALGRAQELLLILDEYWANHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI- 302

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVK 346
              + ++ N F  KHV+ L   S ++   D GP +V+AS   L++G S  +F  W SD K
Sbjct: 303 ---QNAKSNPFSFKHVSAL---SSIEVFKDVGPSVVMASPGGLQSGLSRQLFDMWCSDKK 356

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N  +       GTLA+ +  +P
Sbjct: 357 NSCVLPGYVVEGTLAKTIINEP 378


>gi|443898849|dbj|GAC76183.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT2
           [Pseudozyma antarctica T-34]
          Length = 1135

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 119/433 (27%), Positives = 200/433 (46%), Gaps = 86/433 (19%)

Query: 48  LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQ 107
           L ++A TID VLLSH    HLG   YA  +LGL   V++T PV  +G LT+ +   + R 
Sbjct: 195 LRELAPTIDLVLLSHSSLDHLGLYAYAYAKLGLRCLVYATMPVQSMGKLTVLEATQTWRN 254

Query: 108 VSEFD------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPH 149
             + D                  L T  +I+ AF+ +  + Y Q  HL GK   + +  +
Sbjct: 255 EVDIDAEEAASNKAGSLASKRRCLATTAEIEDAFEHIKTVRYMQPTHLEGKCASLTLTAY 314

Query: 150 VAGHLLGGTVWKI-TKDGEDVIYAVDYNRRKEKHLNGTVL-----------------ESF 191
            AGH LGG +WKI +     V+ A+D+N  +E+HL+GT+L                 ++ 
Sbjct: 315 NAGHSLGGAIWKIRSPTSGTVVVALDWNHNRERHLDGTILLSSSAAGPGMSSSGSGADAV 374

Query: 192 VRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 250
            RP +LIT+    L     R+ R+    D + KT+++G +VL P+D++ R+LEL+++L+ 
Sbjct: 375 RRPDLLITEIERGLVVNTRRKDRDAAIIDLVHKTIQSGHSVLFPIDASARLLELMVLLDQ 434

Query: 251 YWA---EHSLNYPIYFLTYVSSSTIDYVKSFLEWM----GDSITKSFETSRD-------- 295
           +WA    H+  +P+  ++      I+  ++++EWM         ++ E  +D        
Sbjct: 435 HWAYAYPHA-RFPLCLISRTGKEVIERSRTYMEWMTREWATKANETIEADKDRQPDAHRA 493

Query: 296 ----------NAFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWAS 343
                     +    K+V +  +  ++D A   D  ++VLA   S+  G S  +   +A 
Sbjct: 494 GRGARNAAASSPLDFKYVRVFASLQQMDEAIPQDQARVVLAVPPSMTHGPSRRLLARFAR 553

Query: 344 DVKNLVLFTERGQFGTLARML-----QADPP---------------PKAVKVTMSRRVPL 383
           +  + ++   RG+ G+L R L     Q  P                   V+  +  +VPL
Sbjct: 554 NPNDAIVLISRGEPGSLCRQLWDAWNQRQPKGFSWTKGKLGEVVSGEATVRYELQSKVPL 613

Query: 384 VGEEL-IAYEEEQ 395
            GEEL +  E EQ
Sbjct: 614 EGEELRLHLESEQ 626


>gi|367016955|ref|XP_003682976.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
 gi|359750639|emb|CCE93765.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
          Length = 775

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 169/328 (51%), Gaps = 20/328 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           S ID +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S    S
Sbjct: 59  SKIDVLLISHFHVDHAASLPYVMQKTNFQGRVFMTHPTKAIYRW-LLRDFVRVTSIGVSS 117

Query: 110 ---EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
              + +L+T +D+  +F  +  +    ++H +    GI    + AGH+LG  +++I   G
Sbjct: 118 GGKDDNLYTDEDLAESFDRIETI----DFHSTVDVNGIKFTAYHAGHVLGAAMFQIEIAG 173

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             +++  DY+R  ++HLN   + +      ++   +    ++P   +       I  T+ 
Sbjct: 174 VRILFTGDYSRELDRHLNSAEVPTLPSDVHIVESTFGTATHEPRVNRERKLTQLIHSTVS 233

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFLEW 281
            GG VLLPV + GR  E++LIL++YW +HS        PIY+ + ++   +   ++++  
Sbjct: 234 RGGRVLLPVFALGRAQEIMLILDEYWTQHSDELGGGQVPIYYASNLAKKCMSVFQTYVNM 293

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVE 340
           M D I K F  S+ N F+ K+++ L N   +D+  D GP ++LAS   L++G S D+  +
Sbjct: 294 MNDDIRKKFRDSQTNPFVFKNISYLRN---IDDFQDFGPSVMLASPGMLQSGLSRDVLEK 350

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADP 368
           W  + KNLVL T     GT+A+ L  +P
Sbjct: 351 WCPEDKNLVLITGYSVEGTMAKFLMLEP 378


>gi|198421242|ref|XP_002128016.1| PREDICTED: similar to Cleavage and polyadenylation specificity
           factor subunit 3 (Cleavage and polyadenylation
           specificity factor 73 kDa subunit) (CPSF 73 kDa subunit)
           (mRNA 3-end-processing endonuclease CPSF-73) [Ciona
           intestinalis]
          Length = 690

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 116/391 (29%), Positives = 194/391 (49%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST----IDAVLL 60
           +++TPL          +L+       ++DCG   H   S L  L  +  T    ID +L+
Sbjct: 17  LKITPLGAGQEVGRSCHLLEFKEKKIMLDCGI--HPGISGLAGLPYIDFTEPEKIDLLLV 74

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTL 116
           +H    H G LP+ +++      VF   +T+ +YR     +   Y+    +S  D L+T 
Sbjct: 75  THFHLDHAGGLPWFLQKTTFKGRVFMTHATKAIYRW----LLSDYIKVSNISTEDQLYTE 130

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++ +   +  +    N+H      GI    + AGH+LG  ++ I   G  V+Y  DY+
Sbjct: 131 ADLEDSMARIETI----NFHEEKMVGGIKFWCYHAGHVLGAAMFMIQIAGVRVLYTGDYS 186

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R +++HL    + + VRP VLIT+A    H   PR++RE  F + +   +  GG  L+PV
Sbjct: 187 REEDRHLMAAEIPA-VRPDVLITEATYGTHIHEPREEREARFTNTVQDIVNRGGRCLIPV 245

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+DYWA H    + PIY+ + ++   +   +++   M   I K    S
Sbjct: 246 FALGRAQELLLILDDYWANHPELHDIPIYYASSLAKKCMAVYQTYSNAMNQKIQKQLNIS 305

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             N F  KH++ L      D+   GP +V+AS   +++G S ++F  W +D +N V+   
Sbjct: 306 --NPFQFKHISNLKGMEHFDDV--GPSVVMASPGMMQSGLSRELFESWCNDRRNGVIVAG 361

Query: 354 RGQFGTLARMLQADPPPKAVKVTMS-RRVPL 383
               GTLA+ + ++P      V+MS +++PL
Sbjct: 362 YCVEGTLAKHILSEPEE---VVSMSGQKIPL 389


>gi|342882935|gb|EGU83499.1| hypothetical protein FOXB_05909 [Fusarium oxysporum Fo5176]
          Length = 950

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/424 (28%), Positives = 189/424 (44%), Gaps = 78/424 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++P S  L+ +DG    L+D GW++ FD   L+ + K  +T+  +L++H    
Sbjct: 6   PLQGALSDSPASQSLLELDGGVKVLVDLGWDETFDVEKLKEIEKQVTTLSLILVTHATAS 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQY---------------------L 103
           HL A  +  K +      PV++T PV  LG   + D Y                     L
Sbjct: 66  HLAAYAHCCKNIPQFTRIPVYATRPVIDLGRTLIQDLYTSSPAAATTIPQSSLTESAYSL 125

Query: 104 SRRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
           ++   +  +L     T ++I   F  +  L YSQ +            G+ +  + +GH 
Sbjct: 126 TQTATTAQNLLLQSPTNEEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN- 258
            A        R +R E   D I   +  GG VL+PVDS+ RVLEL  +LE  W   + + 
Sbjct: 246 GADRTAQTGGRAKRDEQLIDTIKACVTRGGTVLIPVDSSARVLELSYLLEHAWRTDAASD 305

Query: 259 ------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET--------------SRDNAF 298
                   +Y      SST+ Y +S LEWM +SI + FE                    F
Sbjct: 306 AGVLKTAKLYLAGRNMSSTMRYARSMLEWMDESIVQEFEAFAEGQRKVNGANDKKEGGPF 365

Query: 299 LLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             K++ LL  K+++        DN     +++LAS +S+E GFS D+    A D +NLV+
Sbjct: 366 DFKYLRLLERKAQIARLLSQNPDNVSTEGRVILASDSSIEWGFSKDLIKGLARDSRNLVI 425

Query: 351 FTER 354
            T++
Sbjct: 426 LTDK 429


>gi|256084683|ref|XP_002578556.1| cleavage and polyadenylation specificity factor [Schistosoma
           mansoni]
 gi|350644758|emb|CCD60512.1| cleavage and polyadenylation specificity factor,putative
           [Schistosoma mansoni]
          Length = 619

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 177/357 (49%), Gaps = 18/357 (5%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTI 55
           +S++V PL    +      LV++ G N + DCG    +ND     D + +     +   +
Sbjct: 2   SSIRVIPLGAGQDVGRSCILVTLGGKNIMFDCGMHMGYNDDRKFPDFTYITDKGGLNEYL 61

Query: 56  DAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLF 114
           D V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  + + F
Sbjct: 62  DCVIISHFHLDHCGALPYMTEVIGYDGPIYMTHPTKAICPILLEDYRKINVERRGDQNFF 121

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
           T D I      V  +   Q   +  + E   +    AGH+LG  ++ +      V+Y  D
Sbjct: 122 TSDMIYRCMTKVRCVYIHQTVKVDDELE---IQAFYAGHVLGAAMFLVRVGTNSVLYTGD 178

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLL 233
           YN   ++HL G    S  RP +LIT++  A   +  ++ RE  F + I   + AGG VL+
Sbjct: 179 YNMTPDRHL-GAAWVSRCRPDLLITESTYATTIRDSKRTREREFLEKIHARVEAGGKVLI 237

Query: 234 PVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           PV + GR  EL ++LE YW   +++ PIYF   ++    +Y K F+ W    I ++F   
Sbjct: 238 PVFALGRAQELCILLETYWERMNISVPIYFSMGMAEKANEYYKLFISWTNQKIKETF--V 295

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           + N F  KH+  L  +  +DN   GP +V A+   L AG S  IF +WASD +N+V+
Sbjct: 296 KRNMFDFKHIKPL-GQGTVDNP--GPMVVFATPGMLHAGQSLHIFRKWASDERNMVV 349


>gi|46138561|ref|XP_390971.1| hypothetical protein FG10795.1 [Gibberella zeae PH-1]
          Length = 964

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 123/425 (28%), Positives = 191/425 (44%), Gaps = 79/425 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +++  S  L+ +DG    L+D GW++ FD   L+ + K  +T+  +L++H    
Sbjct: 6   PLQGALSDSSASQSLLELDGGVKVLVDLGWDETFDVEKLKEIEKQVTTLSLILVTHATAS 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQY---------------------L 103
           HL A  +  K +      PV++T PV  LG   + D Y                     L
Sbjct: 66  HLAAYAHCCKNIPQFTRIPVYATRPVIDLGRTLIQDLYTSSPAAATTIPQSSLTESAYSL 125

Query: 104 SRRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
           ++   +  +L       ++I   F  +  L YSQ +            G+ +  + +GH 
Sbjct: 126 TQTATTARNLLLQSPNSEEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-- 257
            A     P  R +R E   D I   +  GG VL+PVDS+ RVLEL  +LE  W   +   
Sbjct: 246 GADRTAQPGGRTKRDEQLIDTIKACVTRGGTVLIPVDSSARVLELSYLLEHAWRTDAASE 305

Query: 258 -----NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-----SRDNA---------- 297
                +  +Y      SST+ Y +S LEWM DSI + FE       R N           
Sbjct: 306 GGVLKSAKLYLAGRNMSSTMRYARSMLEWMDDSIVQEFEAFAEDQRRVNGANNKKEGGGP 365

Query: 298 FLLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  K++ LL  K+++        +NA    +++LAS +S+E GFS D+    A D +NLV
Sbjct: 366 FDFKYLRLLERKAQIARLLSQNVENAGTEGRVILASDSSIEWGFSKDLIKGLAQDSRNLV 425

Query: 350 LFTER 354
           + T++
Sbjct: 426 ILTDK 430


>gi|145350779|ref|XP_001419775.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580007|gb|ABO98068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 767

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 96/318 (30%), Positives = 172/318 (54%), Gaps = 15/318 (4%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSEFD 112
           +DA+ ++H    H  A+P+   +   +  +F T P   +  + M D  + L  ++ SE  
Sbjct: 64  VDALFVTHFHLDHCAAVPFLCGRTDFNGRIFMTHPTKAIYHMLMQDFCRLLKNQEPSE-Q 122

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           LF   D++++ + +  + + Q   +    +G+ V P+ AGH+LG  ++ +   G  V+Y 
Sbjct: 123 LFGEKDLEASMKKIEVIDFHQEVDV----DGVKVTPYRAGHVLGACMFNVDIGGLRVLYT 178

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R  ++HL    + + + P V+I ++   +    PR++RE+ F + +   LR GG V
Sbjct: 179 GDYSRIADRHLPAADVPA-IPPHVVIVESTYGVSPHSPREEREIRFTEKVQTILRRGGRV 237

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           LLPV + GR  ELLLILED+WA++      PIY  + ++   +   ++++  +   +  +
Sbjct: 238 LLPVVALGRAQELLLILEDFWAQNPDLQRVPIYQASALARKAMTIYQTYINVLNSDMKAA 297

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           FE +  N F+  HV  +   SELD+   GP +VLA+ + L++G S ++F  W  D KN V
Sbjct: 298 FEEA--NPFVFNHVKHVSKSSELDDV--GPCVVLATPSMLQSGLSRELFESWCEDPKNGV 353

Query: 350 LFTERGQFGTLARMLQAD 367
           +  +    GTLAR + +D
Sbjct: 354 IIADFAVQGTLAREILSD 371


>gi|443694305|gb|ELT95478.1| hypothetical protein CAPTEDRAFT_151615 [Capitella teleta]
          Length = 600

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 171/366 (46%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++V PL    +      LVSI G N ++DCG    +ND     D S +     +   +D 
Sbjct: 4   IRVVPLGAGQDVGRSCILVSIGGKNLMLDCGMHMGYNDERRFPDFSYINKEGPLTDYLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGFDGPIYMTHPTKAICPILLEDYRKITVERKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           + I S  +    +   Q   +  + E   +  + AGH+LG  +  I    + V+Y  DYN
Sbjct: 124 EMIKSCMKKTIAMNLHQTIQVDDELE---IKAYYAGHVLGAAMIHIRVGEQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   +   +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDR-CRPDLLITESTYATTIRDSKRCRERDFLKKVHDAVDKGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++     Y K F+ W    I  +F   + 
Sbjct: 240 FALGRAQELCILLETYWDRMNLKVPIYFSMGLTEKANHYYKMFITWTNQKIKNTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +K   DN   GP +V A+   L  G S  IF +W    KN+V+     
Sbjct: 298 NMFDFKHIKPF-DKVYADNP--GPMVVFATPGMLHGGLSLQIFKKWCGGEKNMVIMPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VSGTIG 360


>gi|401624491|gb|EJS42547.1| ysh1p [Saccharomyces arboricola H-6]
          Length = 779

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 182/371 (49%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S ID +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 59  SKIDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 119 SMGGKDESLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNREKKLTQLIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHTDELGGGQVPIFYASNLAKKCMSVFQTYV 294

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 413 ENLEFIEKISA 423


>gi|403346510|gb|EJY72653.1| putative cleavage and polyadenylation specificity factor subunit 2
           [Oxytricha trifallax]
          Length = 853

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 95/365 (26%), Positives = 182/365 (49%), Gaps = 36/365 (9%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPY--AMKQ 77
           L+ +     L+DCG N+ +    L  L  +     +D + LSH   +H+GA+PY  A   
Sbjct: 58  LLKVGDLTILLDCGANESYSLDQLNLLRDIIKEQNVDFIFLSHASMMHVGAIPYLQANGC 117

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHL 137
           L     V ST P  ++G LTMY+ ++ +++ + FD FTL D++ +F+ +  ++Y++N  +
Sbjct: 118 LDFQLKVMSTSPTAKMGALTMYEFFIQKKESANFDYFTLQDVEKSFERIELVSYNENRKI 177

Query: 138 SGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV---LESFVRP 194
             +   ++++   +G+ +GG  WKI  + + ++YAV+ N   +K L+ T+    E F   
Sbjct: 178 RMRETELILSALPSGNSIGGACWKIEYNKQTIVYAVELN---DKPLHITIPMKFEDFKNA 234

Query: 195 AVLITDAY----NALHNQPPRQQREMFQDAISKTLRAG---------GNVLLPVDSAGRV 241
            +LIT+A+    +   NQ  +Q  +++Q    + L+           G +L+PV    R+
Sbjct: 235 NILITNAFLTPKSFKSNQKIQQAPKIYQFLSEEKLKIKLEKVIADNMGQILIPVTDKNRI 294

Query: 242 LELLLILEDYWAEHS-------------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           L+ L++LE+ +  +S             +  PI +L Y+S  T+   +S L WM     K
Sbjct: 295 LQCLIMLENMFQTNSKLQSVFKNPQNQLMTMPIVYLEYMSRDTLGVGRSHLGWMNFQDNK 354

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
            F+   +N    + V  +    E       P++++ S+AS   G++  +  E++   KN 
Sbjct: 355 VFQDIDENPINFQFVKDIFTLDEYRKLEHSPRIIVTSLASFSQGYTKQLIYEFSQVPKNE 414

Query: 349 VLFTE 353
           ++F +
Sbjct: 415 IVFLQ 419


>gi|326495416|dbj|BAJ85804.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 180/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            ++  G   L DCG            + D  DPS      
Sbjct: 33  GDQMVVTPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 86

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 87  ----AIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 138

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 139 KVSVEDMLFDEQDIIRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 194

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    +  F     +I   Y    +QP   + + F DAI  T+
Sbjct: 195 GVRILYTGDYSREEDRHLKAAEIPQFSPDICIIESTYGVQQHQPRHVREKRFTDAIHNTV 254

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H      PIY+ + ++   +   ++++  M 
Sbjct: 255 SQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQTYINSMN 314

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS  SL++G S  +F +W 
Sbjct: 315 ERIRNQFAQS--NPFHFKHIDPL---NSIDNFHDVGPSVVMASPGSLQSGLSRQLFDKWC 369

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       G+LA+ +  +P
Sbjct: 370 TDKKNTCVIPGYAVEGSLAKTIINEP 395


>gi|210075949|ref|XP_504965.2| YALI0F03817p [Yarrowia lipolytica]
 gi|223634672|sp|Q6C2Z7.2|YSH1_YARLI RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|199424917|emb|CAG77772.2| YALI0F03817p [Yarrowia lipolytica CLIB122]
          Length = 827

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/366 (28%), Positives = 182/366 (49%), Gaps = 37/366 (10%)

Query: 21  YLVSIDGFNFLIDCG------------WNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHL 68
           +++S  G   ++D G            + D FD           STID +L+SH    H 
Sbjct: 53  HVISFKGKTIMLDAGVHPAHSGLASLPFYDEFD----------LSTIDILLISHFHLDHA 102

Query: 69  GALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQS 125
            +LPY M++      VF T P   +YR  LL+ + +  S  + S+ DL++  D+ ++F  
Sbjct: 103 ASLPYVMQKTNFKGRVFMTHPTKGIYRW-LLSDFVRVTSGAE-SDPDLYSEADLTASFNK 160

Query: 126 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNG 185
           +  +    +YH + +  G+    + AGH+LG  ++ I   G  V++  DY+R +++HLN 
Sbjct: 161 IETI----DYHSTMEVNGVKFTAYHAGHVLGAAMYTIEVGGVKVLFTGDYSREEDRHLNQ 216

Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLEL 244
             +   ++P +LI ++        PR +RE      I  TL  GG  LLPV + GR  E+
Sbjct: 217 AEVPP-MKPDILICESTYGTGTHLPRLEREQRLTGLIHSTLDKGGKCLLPVFALGRAQEI 275

Query: 245 LLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKH 302
           LLIL++YW  H     + IY+ + ++   I   ++++  M D+I + F   + N F  K+
Sbjct: 276 LLILDEYWEAHPDLQEFSIYYASALAKKCIAVYQTYINMMNDNIRRRFRDQKTNPFRFKY 335

Query: 303 VTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLAR 362
           +  + N    D+   GP +++AS   L++G S  +   WA D KN ++ T     GT+A+
Sbjct: 336 IKNIKNLDRFDDM--GPCVMVASPGMLQSGVSRSLLERWAPDPKNTLILTGYSVEGTMAK 393

Query: 363 MLQADP 368
            +  +P
Sbjct: 394 QIINEP 399


>gi|401837471|gb|EJT41396.1| YSH1-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 779

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S ID +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 59  SKIDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 119 SMGGKDESLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQAD--PPPKAVKVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNSEITIPRRCQVEEISFAAHVDFQ 412

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 413 ENLEFIEKISA 423


>gi|198413502|ref|XP_002128796.1| PREDICTED: similar to cleavage and polyadenylation specific factor
           3-like [Ciona intestinalis]
          Length = 605

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 102/367 (27%), Positives = 174/367 (47%), Gaps = 19/367 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL--------SKVASTID 56
           +++ PL    +      +V++ G N ++DCG +  F+     P           +   ID
Sbjct: 4   IKLVPLGAGQDVGRSCIIVTLGGKNIMLDCGMHMGFNDERRFPYFDYITGGKGTLTEHID 63

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V++SH    H GALPY  +  G   P++ T P   +  + + D + ++  +  E + F 
Sbjct: 64  CVIISHFHLDHCGALPYMSEMKGYDGPIYMTHPTKAICPILLEDYRKITVDRKGETNFFD 123

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +   Q  H+  + E   +  + AGH+LG  ++ +    + V+Y  DY
Sbjct: 124 SKMIKDCMKKVIPVNLHQTIHVDDQLE---IKAYYAGHVLGAAMFLLKVGTDSVLYTGDY 180

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++   RP VLIT++  A   +  ++ RE  F   + + +  GG VL+P
Sbjct: 181 NMTPDRHLGAAWVDK-CRPDVLITESTYATTIRDSKRCRERDFLKKVHERVEDGGKVLIP 239

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           V + GR  EL ++LE YW   +L  PIYF   +++   +Y K F+ W    I  +F    
Sbjct: 240 VFALGRAQELCILLESYWDRMNLKVPIYFSAGLTNKATEYYKLFITWTNQKIKDTF--VE 297

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F  KH+    N+S +DN   GP +V A+   L  G S +IF  W ++ KN+++    
Sbjct: 298 RNMFDFKHIKEF-NRSYIDNP--GPMVVFATPGMLHGGLSLEIFKRWCTNEKNMIIMPGY 354

Query: 355 GQFGTLA 361
              GT+ 
Sbjct: 355 CVAGTVG 361


>gi|326487902|dbj|BAJ89790.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 180/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            ++  G   L DCG            + D  DPS      
Sbjct: 33  GDQMVVTPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 86

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 87  ----AIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 138

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 139 KVSVEDMLFDEQDIIRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 194

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    +  F     +I   Y    +QP   + + F DAI  T+
Sbjct: 195 GVRILYTGDYSREEDRHLKAAEIPQFSPDICIIESTYGVQQHQPRHVREKRFTDAIHNTV 254

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H      PIY+ + ++   +   ++++  M 
Sbjct: 255 SQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQTYINSMN 314

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS  SL++G S  +F +W 
Sbjct: 315 ERIRNQFAQS--NPFHFKHIDPL---NSIDNFHDVGPSVVMASPGSLQSGLSRQLFDKWC 369

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       G+LA+ +  +P
Sbjct: 370 TDKKNTCVIPGYAVEGSLAKTIINEP 395


>gi|19074744|ref|NP_586250.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
           cuniculi GB-M1]
 gi|19069386|emb|CAD25854.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
           cuniculi GB-M1]
 gi|449329879|gb|AGE96147.1| hypothetical protein ECU10_1350 [Encephalitozoon cuniculi]
          Length = 496

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 179/368 (48%), Gaps = 27/368 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP----LSKVAS---TIDA 57
           + V PL    +      LVSI G   + DCG +  F+     P    +SK  S    ID 
Sbjct: 1   MNVIPLGAGQDVGRSCILVSIKGRTIMFDCGMHMGFNDERRFPDFSYISKTKSFDKVIDC 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFDLFT 115
           +++SH    H GALPY  +  G   P++ T P   +   LL  + + ++ +  S   +FT
Sbjct: 61  IIISHFHLDHCGALPYFTEVCGYGGPIYMTLPTKEVCPVLLDDFRKIVAGKGDS---IFT 117

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
             DI +  + V  ++ ++ Y      E   + P+ AGH+LG  ++ +    + V+Y  DY
Sbjct: 118 YQDISNCMKKVVTISMNETYK---HDEDFYITPYYAGHVLGAAMFHVVVGDQSVVYTGDY 174

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 234
           +   +KHL    ++  +RP +LIT++ Y ++     + +   F  A+S  +  GG VL+P
Sbjct: 175 STTPDKHLGPASIKC-IRPDLLITESTYGSITRDCRKVKEREFLKAVSDCVARGGRVLIP 233

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS-FETS 293
           + + GR  EL L+L+ YW    L  P+YF + ++    +  K F+ +  +++ K  FE  
Sbjct: 234 IFALGRAQELCLLLDGYWERTGLKTPVYFSSGLTEKANEIYKKFISYTNETVRKKIFER- 292

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL--- 350
             N F  KH+     +  +++   GP ++ AS   L +G S  IF EW  D KNLV+   
Sbjct: 293 --NMFEYKHIKPF-QRHYMESK--GPMVLFASPGMLHSGMSLKIFKEWCEDEKNLVIIPG 347

Query: 351 FTERGQFG 358
           +  RG  G
Sbjct: 348 YCVRGTIG 355


>gi|357117889|ref|XP_003560694.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Brachypodium distachyon]
          Length = 690

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 180/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + +TPL            ++  G   L DCG            + D  DPS      
Sbjct: 18  GDQMVITPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 71

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 72  ----AIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 123

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 124 KVSVEDMLFDEQDIIRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 179

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    +  F     ++   Y    +QP   + + F DAI  T+
Sbjct: 180 GVRILYTGDYSREEDRHLKAAEIPQFSPDVCIVESTYGVQQHQPRHVREKRFTDAIHNTV 239

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H      PIY+ + ++   +   ++++  M 
Sbjct: 240 SQGGRVLIPAFALGRAQELLLILDEYWSNHPELQKIPIYYASPLAKKCMAVYQTYINSMN 299

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS  SL++G S  +F +W 
Sbjct: 300 ERIRNQFAQS--NPFHFKHIEPL---NSIDNFHDVGPSVVMASPGSLQSGLSRQLFDKWC 354

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       GTLA+ +  +P
Sbjct: 355 TDKKNTCVIPGYVIEGTLAKTIINEP 380


>gi|384486005|gb|EIE78185.1| hypothetical protein RO3G_02889 [Rhizopus delemar RA 99-880]
          Length = 613

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 162/312 (51%), Gaps = 11/312 (3%)

Query: 41  DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
           D S +         IDAV++SH    H GALP+  + LG   P++ T P   +  + + D
Sbjct: 24  DFSYISKTGNFTDIIDAVIISHFHLDHCGALPFFTEMLGYDGPIYMTHPTKAICPILLED 83

Query: 101 -QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV 159
            + ++  +  E + FT   I +  + V  ++  Q   +  + E   +  + AGH+LG  +
Sbjct: 84  YRKITVERKGETNFFTSAMIKNCMKKVHAVSLHQTIKVDDELE---IKAYYAGHVLGAAM 140

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQ 218
           + +    E V+Y  DYN   ++HL    ++  VRP VL+T++  A   +  ++ RE  F 
Sbjct: 141 FYVRVGQESVVYTGDYNMTPDRHLGSAWIDK-VRPDVLVTESTYATTIRDSKRSRERDFL 199

Query: 219 DAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSF 278
             + + +  GGNV++PV + GR  EL +++E YW    L+ PIYF T ++    ++ K F
Sbjct: 200 TKVHECVLNGGNVIIPVFALGRAQELCILIESYWDRMGLDVPIYFSTGLTERATEFYKLF 259

Query: 279 LEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIF 338
           + W    I  +F  S+ N F  KH+    N++ +D    GPK++ A+   L AG S ++F
Sbjct: 260 INWTNQKIKSTF--SQRNMFDFKHIKTW-NRNYIDQP--GPKVLFATPGMLNAGTSLEVF 314

Query: 339 VEWASDVKNLVL 350
            +WA D KN+V+
Sbjct: 315 KKWAPDPKNMVI 326


>gi|406601461|emb|CCH46911.1| hypothetical protein BN7_6516 [Wickerhamomyces ciferrii]
          Length = 679

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 100/322 (31%), Positives = 169/322 (52%), Gaps = 14/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           ST+D +L+SH    H  +LPY M+       VF T P   +YR  LL+ + +  S    S
Sbjct: 25  STVDILLISHFHLDHAASLPYVMQHTNFKGRVFMTHPTKAIYRW-LLSDFVKVTSIGSSS 83

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              L+T +D+  +F  +  +    +YH + + +GI    + AGH+LG  ++ I   G  +
Sbjct: 84  SSALYTDEDLSESFDRIETI----DYHSTIEVDGIRFTAYHAGHVLGAAMFFIEIGGLKL 139

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           ++  DY+R + +HLN   +    +P V++T++        PR ++E+   + I  TL  G
Sbjct: 140 LFTGDYSREENRHLNPAEVPP-TKPDVMVTESTFGTATHEPRLEKEVRLTNLIHSTLIKG 198

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VLLPV + G   ELLLIL++YW++H    N  +Y+ + ++   +   ++++  M D+I
Sbjct: 199 GRVLLPVFALGTAQELLLILDEYWSQHQDLENVNVYYASSLAKKCLAVFQTYINMMNDNI 258

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            K F     N F  K++  + N  + D+   GP +V+AS   L+ G S ++   WA D +
Sbjct: 259 RKQFRDQNSNPFQFKYIKNIKNLDKFDDF--GPCVVVASPGMLQNGVSRELLERWAPDSR 316

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N V+ T     GTLA+ L  +P
Sbjct: 317 NSVILTGYSVEGTLAKTLLTEP 338


>gi|363750442|ref|XP_003645438.1| hypothetical protein Ecym_3113 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889072|gb|AET38621.1| Hypothetical protein Ecym_3113 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 773

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 97/346 (28%), Positives = 174/346 (50%), Gaps = 24/346 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYL-----S 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  LL+ + +       +
Sbjct: 61  SKVDVLLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRW-LLSDFVKVTNIGNGT 119

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                + +L+T +D+  +F  +  +    ++H +    GI    + AGH+LG  ++++  
Sbjct: 120 AASSGDENLYTDEDLAESFDKIETV----DFHSTIDVNGIKFTAYHAGHVLGAAMFQVEI 175

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  +++  DY+R  ++HLN   + S     +++   +    ++P   +       I  T
Sbjct: 176 AGLRILFTGDYSRELDRHLNSAEVPSLPSDILIVESTFGTATHEPRVSKERKLTQLIHTT 235

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 236 VAKGGRVLLPVFALGRAQEIMLILDEYWSQHAEELGTGQVPIFYASNLARKCMSVFQTYV 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L+ G S D+  
Sbjct: 296 NMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLE 353

Query: 340 EWASDVKNLVLFTERGQFGTLARML----QADPPPKAVKVTMSRRV 381
           +W  D KNLVL T     GT+A+ L    ++ P      VT+ RR 
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLILEPESIPSINNPDVTIPRRC 399


>gi|402080824|gb|EJT75969.1| hypothetical protein GGTG_05894 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 974

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 125/428 (29%), Positives = 185/428 (43%), Gaps = 81/428 (18%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           +PL G  +E   S  L+ +DG    LID GW++  D   L+ L K   T+  +LL+H   
Sbjct: 5   SPLQGALSEATASQSLLELDGGVKVLIDVGWDETLDIEKLKELEKQVPTLSLILLTHATV 64

Query: 66  LHLGALPYAMKQLGLSA--PVFSTEPVYRLGLLTMYDQY--------------------- 102
            HL A  +  K   L A  PV++T+PV  LG   + D Y                     
Sbjct: 65  PHLSAFVHCCKHFPLFARIPVYATQPVIDLGRTLIQDLYSSTPLAATTIPDTSLAEAAFS 124

Query: 103 LSRRQVSEFDLF---TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHL 154
            S+ Q S   L    T ++I   F  +  L YSQ +       S    G+ +  + +GH 
Sbjct: 125 YSQPQFSNNFLLQAPTTEEIAKYFSLIQPLKYSQPHQPLASPFSPPLNGLTITAYNSGHS 184

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------------VLESFVRPAVLITDA 201
           LGGT+W I    E ++YAVD+N  ++    G              V+E   +P  LI  A
Sbjct: 185 LGGTIWHIQHGLESIVYAVDWNLARDNVYAGAAWMGSGHGSGGAEVMEQLRKPTALICSA 244

Query: 202 YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-- 259
                      + +   D + +T+  GG VL+P+DS+ RVLEL  +LE  W   +     
Sbjct: 245 RAGEGGLSRGARDQQLLDTMRRTVARGGTVLIPIDSSARVLELAYLLEHAWRSEASGVTE 304

Query: 260 -------PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--------------- 297
                   +Y      +STI   KS  EWM DSI + FE   D                 
Sbjct: 305 AGALGTAKLYLAGRSVNSTIRLAKSMFEWMDDSIVQEFEAVADQGGKRTNGNTDGGRGRD 364

Query: 298 ---FLLKHVTLLINKSELD------NAPD--GPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F  K++ +L  K++++      + P+    K++LAS  SLE GFS D+    A D +
Sbjct: 365 AGPFDFKYLRVLDRKAQVEKVLSQSSTPNELRGKVILASDTSLEWGFSKDVMARIADDSR 424

Query: 347 NLVLFTER 354
           NLV+ TE+
Sbjct: 425 NLVILTEK 432



 Score = 49.7 bits (117), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 48/90 (53%), Gaps = 1/90 (1%)

Query: 534 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
           +LV GSA+ TE +   C ++    VYTP +  ++D + D  A+ V+LSE L+  + ++ +
Sbjct: 744 ILVAGSADETEAVADDCRRNAI-EVYTPPVGASVDASVDTNAWVVKLSEPLVKRLRWQTV 802

Query: 594 GDYEIAWVDAEVGKTENGMLSLLPISTPAP 623
               I  V A +  T     SL P S+ AP
Sbjct: 803 RGLGIVTVTAHLTATPVAQKSLPPPSSTAP 832


>gi|213409816|ref|XP_002175678.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
 gi|212003725|gb|EEB09385.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
          Length = 771

 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 94/319 (29%), Positives = 168/319 (52%), Gaps = 12/319 (3%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L++H    H  ALPY M++      VF T P   +    + D         E  
Sbjct: 40  STVDILLITHFHLDHAAALPYVMQKTNFRGRVFMTHPTKAVCKWLLSDYVRVSNVGVEDQ 99

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+   D+ +AF+ +  +    +YH + + EG+   P  AGH+LG  ++ I   G  ++Y 
Sbjct: 100 LYDEKDLAAAFERMEAV----DYHSTIEVEGVKFTPFHAGHVLGACMYFIEIAGVKLLYT 155

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNV 231
            D++R +++HLN   +    +P +LI+++ Y    +QP   +     + +  T+R GG V
Sbjct: 156 GDFSREEDRHLNIAEVPP-QKPNILISESTYGTASHQPRLDKEARLLNLVHTTVRNGGRV 214

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H+   + PIY+ + ++   +   ++++  M D I K+
Sbjct: 215 LMPVFALGRAQELLLILDEYWHSHAELRSVPIYYASSLARKCMAVYQTYINMMNDKIRKA 274

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  +  N F+ +++  L +  + D+   GP ++LAS   L+ G S  +   WA D +N +
Sbjct: 275 F--AERNPFIFRYIKSLRSIDKFDDI--GPSVILASPGMLQNGVSRTLLERWAPDARNTL 330

Query: 350 LFTERGQFGTLARMLQADP 368
           L T     GT+A+++  +P
Sbjct: 331 LLTGYSVEGTMAKLIANEP 349


>gi|406865774|gb|EKD18815.1| RNA-metabolising metallo-beta-lactamase [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 1331

 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 121/414 (29%), Positives = 179/414 (43%), Gaps = 86/414 (20%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSA--PV 84
           G   LID GW++ FD + L+ L K   T+  +LL+H    H+ A  +  K   L +  PV
Sbjct: 26  GVKVLIDVGWDETFDVAKLKELEKQVPTLSIILLTHATVSHIAAFAHCCKHFPLFSRIPV 85

Query: 85  FSTEPVYRLGLLTM-------------------------YDQYLSRRQVSEFDLF--TLD 117
           ++T PV  LG   +                         Y Q +S  Q +   L   T +
Sbjct: 86  YATLPVISLGRTLVQNIYASTPLSATIIPHSALSEASYAYSQTISANQDANILLQPPTSE 145

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I S F  +  L YSQ +            G+ +  + AGH LGGT+W I    E ++YA
Sbjct: 146 EIASYFALIHPLKYSQPHQPLPSPFSPPLNGLAITAYNAGHTLGGTIWHIQHGLESIVYA 205

Query: 173 VDYNRRKEK------------HLNGTVLESFVRPAVLITDAYNALHNQPP--RQQR-EMF 217
           VD+N+ +E                  V+E   +P  LI  +     +  P  R +R E+ 
Sbjct: 206 VDWNQARENVLAGAAWLGGAGAGGAEVIEQLRKPTALICSSRGGERHALPGGRAKRDELL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA-------EHSLNYPIYFLTYVSSS 270
            + I  ++  GG VL+P DS+ RVLEL  +LE  W         H     +Y  +    +
Sbjct: 266 LEMIKTSVSQGGIVLIPTDSSARVLELAYLLEHVWRTESKDEDSHLRGAKLYLASRNIGA 325

Query: 271 TIDYVKSFLEWMGDSITKSFE--------------------TSRDNAFLLKHVTLLINKS 310
           T+ Y +S LEWM D+I + FE                    +S    F  KH+ LL  K 
Sbjct: 326 TMRYARSMLEWMDDAIIREFEANAGINQKETGSKAAGDAKGSSDGGPFDFKHLRLLERKG 385

Query: 311 ELD----------NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
           ++D          +     K++LAS ASLE GFS DI    A D +NL++ TE+
Sbjct: 386 QIDRIMGQTDIDRHGRSIGKVILASDASLEWGFSRDILKAVADDTRNLIILTEK 439


>gi|340381556|ref|XP_003389287.1| PREDICTED: integrator complex subunit 11-like [Amphimedon
           queenslandica]
          Length = 610

 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 171/359 (47%), Gaps = 19/359 (5%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHFDPSLLQPLSKVAST---- 54
           + +++ PL    +      LVS+ G N + DCG    +ND         ++    T    
Sbjct: 2   SDIRIVPLGAGQDVGRSCILVSMGGKNIMFDCGMHMGYNDERRFPDFTYITDTGQTLHDY 61

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDL 113
           I+ V+LSH    H GALPY  +  G + P++ T P   +  + + D + +   +  E + 
Sbjct: 62  INCVILSHFHLDHCGALPYFTEMCGYNGPIYMTHPTKAICPVLLEDFRRVCVDKKGEQNF 121

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
           FT   I    + V  +   Q   +  + E   +  + AGH+LG  ++ +    + V+Y  
Sbjct: 122 FTSQMIKDCMRKVITVNLHQCVKVDDQLE---IKAYYAGHVLGAAMFHVRVGHQSVVYTG 178

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVL 232
           DYN   ++HL G+      RP +LIT++  A   +  ++ RE  F   + + L   G VL
Sbjct: 179 DYNMTPDRHL-GSAWIDRCRPDLLITESTYATTIRDSKRCRERDFLKKLHECLERDGKVL 237

Query: 233 LPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 292
           +PV + GR  EL ++LE YW   +L YPIYF T ++     Y K F+ W    I  +F  
Sbjct: 238 IPVFALGRAQELCILLESYWERMNLKYPIYFSTGLTEKANHYYKLFISWTNQKIKNTF-- 295

Query: 293 SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
              N F  KH+    ++S +D    GP +V A+   L AG S  IF +WA D KN+++ 
Sbjct: 296 IHRNMFDFKHIKAF-DRSYIDQP--GPMIVFATPGMLHAGLSLQIFKKWAEDEKNMLIM 351


>gi|403216468|emb|CCK70965.1| hypothetical protein KNAG_0F03030 [Kazachstania naganishii CBS
           8797]
          Length = 820

 Score =  155 bits (393), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 97/331 (29%), Positives = 169/331 (51%), Gaps = 19/331 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL---LTMYDQYLSRR 106
           ST+D +L+SH    H  +LPY M++      VF T P   +YR  L   + +    +   
Sbjct: 59  STVDILLISHFHLDHAASLPYVMQRTPFKGRVFMTHPTKAIYRWLLRDFVRVTAIGVDST 118

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
             +E  L+T +D+  +F  +  +    +YH + +  GI    + AGH+LG  +++I   G
Sbjct: 119 LAAEESLYTDEDLAESFDKIETI----DYHSTVEVNGIKFTAYHAGHVLGAAMFQIEIAG 174

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             +++  DY+R  ++HLN   +       +++   +    ++P   +       I  T+ 
Sbjct: 175 LKILFTGDYSREMDRHLNSAEVPPQSSDILVVESTFGTATHEPRLHRENKLTQLIHTTVG 234

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLEW 281
            GG VL+PV + GR  EL+LIL++YW +H     S   PI++ + ++   +   ++++  
Sbjct: 235 RGGRVLMPVFALGRAQELMLILDEYWQKHSDELGSGQVPIFYASDLARKCMSVFQTYVNM 294

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
           M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L++G S D+  +W
Sbjct: 295 MNDDIRKKFRDSQTNPFIFKNISYLKNLEEFQDF--GPSVMLASPGMLQSGLSRDLLEKW 352

Query: 342 ASDVKNLVLFTERGQFGTLAR--MLQADPPP 370
             + KNLVL T     GT+A+  ML+ D  P
Sbjct: 353 CPEQKNLVLITGYSVEGTMAKYIMLEPDTIP 383


>gi|297739612|emb|CBI29794.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 116/396 (29%), Positives = 186/396 (46%), Gaps = 44/396 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 20  GDQLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPS----- 73

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+  
Sbjct: 74  -----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKV 124

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D L+   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 125 SKVSVEDMLYDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 180

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T
Sbjct: 181 AGVRVLYTGDYSREEDRHLRAAEIPQFCPDICIIESTYGVQLHQPRHVREKRFTDVIHST 240

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M
Sbjct: 241 ISQGGRVLIPAYALGRAQELLLILDEYWSNHPELHNVPIYYASPLAKRCMAVYQTYINSM 300

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH++ L     ++N  D GP +V+AS   L++G S  +F  W
Sbjct: 301 NERIRNQFANS--NPFDFKHISPL---KSIENFNDVGPSVVMASPGGLQSGLSRQLFDMW 355

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 377
            SD KN  +       GTLA+ +  +P      V M
Sbjct: 356 CSDKKNACVIPGYVVGGTLAKTIINEPKENCQSVEM 391


>gi|156379813|ref|XP_001631650.1| predicted protein [Nematostella vectensis]
 gi|156218694|gb|EDO39587.1| predicted protein [Nematostella vectensis]
          Length = 688

 Score =  155 bits (391), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 104/387 (26%), Positives = 193/387 (49%), Gaps = 24/387 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLSH 62
           +++TPL          +++   G   ++DCG +         P      T  ID +L+SH
Sbjct: 21  LRITPLGSGQEVGRSCHILEFKGKKVMLDCGIHPGMTGVESLPFLDEIDTAEIDLLLVSH 80

Query: 63  PDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDD 118
               H G+LP+ +++      VF   +T+ +YR     +   Y+    ++  D LFT  D
Sbjct: 81  FHLDHCGSLPWLLEKTTFKGRVFMTHATKAIYRW----LLSDYVKVSNIAAEDMLFTESD 136

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           ++ +   +  L + Q   + G    I    + AGH+LG  ++ +   G  ++Y  D++R+
Sbjct: 137 LEKSMDKIETLHFHQEKEVGG----IKFWCYHAGHVLGACMFMLEIAGVKILYTGDFSRQ 192

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
           +++HL    + S + P VLI ++    H    R++RE  F   +   +  GG  L+PV +
Sbjct: 193 EDRHLMAAEIPS-ISPDVLIIESTYGTHIHEKREEREARFTGTVHDIVNRGGRCLIPVFA 251

Query: 238 AGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D I K    S  
Sbjct: 252 LGRAQELLLILDEYWQNHPELHDIPIYYASQLAKKCMSVFQTYVNAMNDKIKKQIAIS-- 309

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F+ KH++ L +  + D+   GP +V+AS   +++G S ++F +W +D +N V+     
Sbjct: 310 NPFVFKHISNLKSIDQFDDI--GPSVVMASPGMMQSGLSRELFEQWCTDRRNGVIIAGYC 367

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVP 382
             GTLA+ L ++  P+ V+    +++P
Sbjct: 368 VEGTLAKNLMSE--PEEVQTMSGQKIP 392


>gi|406694795|gb|EKC98117.1| cleavage and polyadenylation specificity factor subunit
           [Trichosporon asahii var. asahii CBS 8904]
          Length = 958

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 156/573 (27%), Positives = 251/573 (43%), Gaps = 91/573 (15%)

Query: 5   VQVTPLSG----VFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           + +TPLS     V  + P+SY + +D    L+D G  D +  S  Q   +    I     
Sbjct: 2   ITLTPLSSSATSVSPDEPVSYFLELDDARILLDMGQRD-YRASAQQTSWEYEEKI----- 55

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRR-------QVSEFD- 112
               T +LG   YA    GL  PV++T+P   +G +    +  S R       +  EF  
Sbjct: 56  -RDPTQYLGLYAYARAHWGLKCPVYATQPTVEMGRVVSLAEAESWRAECPVSDEEGEFKG 114

Query: 113 --LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDV 169
             + T ++I  AF  +  + Y+Q  HL G+   +++ P  +GH+LGGT++KI +     V
Sbjct: 115 PFVPTTEEIHEAFDHIKAIRYNQPLHLGGELSHLLLTPFPSGHVLGGTLFKIRSPTSGTV 174

Query: 170 IYAVDYNRRKEKHLNGTVL---------ESFVRPAVLITDAYNALHNQPPRQQREM-FQD 219
           +YAV  N   E+HL+G V          E   RP +LI +   +      R++RE    D
Sbjct: 175 LYAVGINHTGERHLDGMVTGQGGLQGYAEDIRRPDLLIVEGGRSNAVNAKRRERETAILD 234

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA----------EHSLNYPIYFLTYVSS 269
            ++ TL  G +VL+P D++ R+LELL++L+ +W+              N+P+  ++  + 
Sbjct: 235 LVTATLAGGRSVLMPCDASPRLLELLVLLDQHWSFKRTAAPGGPAAQWNHPLCLVSRTAQ 294

Query: 270 STIDYVKSFLEWMG--------DSITKSFETSRDN-------------AFLLKHVTLLIN 308
             + + +S LEWMG        D +  + +  +               A    HV     
Sbjct: 295 DMVSFARSLLEWMGGVVRESGADDVVAALDRRKGRKRKALVNLGSEYGALDFSHVQFFAT 354

Query: 309 KSE-LDNAP-DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-- 364
             E L+  P + PKLVLA   ++  G S  +F   AS   N+VL T  G+  TLAR L  
Sbjct: 355 PEELLEKYPANRPKLVLAIPPTMSHGPSRTLFASMASVTGNVVLLTGHGEDRTLARELYA 414

Query: 365 ------------------QADPPPKAVKVTMSRRVPLVGEELIAYE-EEQTRLKKEEALK 405
                              A P    +++ +  + PL GEEL AYE  E+ + ++E A +
Sbjct: 415 RWEAHQDEGAHYGHGKIGHATPMEGRLELELDAKEPLSGEELEAYETAEREKREREAAHQ 474

Query: 406 ASLVK-----EEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVP 460
           A+L +     E +   S    ++ +GD   +    ANA A   E       DI + G   
Sbjct: 475 AALERNNRMLEADDLESDSDSDSEAGDLAGLHQEGANAFAGDGEDARTMSFDIFVKGQSV 534

Query: 461 PSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK 493
              +   MFP+     + D FGE ++   +I K
Sbjct: 535 LRGTRFRMFPYIAKGRKVDSFGEGLDVGQWIRK 567



 Score = 44.3 bits (103), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 20/97 (20%)

Query: 617 PISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRC---------GEYVTIRK 666
           P S P     S+ +GDL++  LK  L + GI  +FAG G L C         G  V +RK
Sbjct: 867 PPSGPLTLPSSLFIGDLRLLALKNRLGTLGIPAQFAGEGVLVCGPGVEPGAKGSIVAVRK 926

Query: 667 VGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
           +    ++G      ++V+EGP+   Y+ +R  LY  +
Sbjct: 927 L----EEG------RVVLEGPVSGTYFAVRRELYGSY 953


>gi|242013971|ref|XP_002427672.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
 gi|212512102|gb|EEB14934.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
          Length = 572

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 95/311 (30%), Positives = 157/311 (50%), Gaps = 11/311 (3%)

Query: 43  SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-Q 101
           S + P   + + ID V++SH    H GALPY  + +G + P++ T P   +  + + D +
Sbjct: 26  SFISPEGPITNFIDCVIISHFHLDHCGALPYLTEMVGYNGPIYMTHPTKAISPILLEDMR 85

Query: 102 YLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
            +S  +  E + FT   I    + V  +T  Q+  +  + E   +  + AGH+LG  ++ 
Sbjct: 86  KISVEKKGEVNFFTSQMIKDCMKKVITVTLHQSIMVDSQLE---IKAYYAGHVLGAAMFW 142

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
           I      V+Y  DYN   ++HL    ++   RP +LIT++  A   +  ++ RE  F   
Sbjct: 143 IRVGNLSVVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKK 201

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLE 280
           + + +  GG VL+PV + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ 
Sbjct: 202 VHECIEKGGKVLIPVFALGRAQELCILLETYWERMNLKVPIYFAVGLTEKANNYYKMFIT 261

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
           W    I K+F   + N F  KH+    ++S +D A   P +V A+   L AG S  IF +
Sbjct: 262 WTNQKIRKTF--VQRNMFDFKHIKPF-DRSYIDQA--WPMVVFATPGMLHAGLSLQIFKK 316

Query: 341 WASDVKNLVLF 351
           WA +  N+V+ 
Sbjct: 317 WAPNENNMVIM 327


>gi|156042700|ref|XP_001587907.1| hypothetical protein SS1G_11148 [Sclerotinia sclerotiorum 1980]
 gi|154695534|gb|EDN95272.1| hypothetical protein SS1G_11148 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 936

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 162/595 (27%), Positives = 243/595 (40%), Gaps = 144/595 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ FD   L+ L K   T+  +LL+H    H+ A  +  K   L    PV
Sbjct: 26  GVKVLIDVGWDETFDVEKLRELEKQIPTLSLILLTHATVPHIAAYAHCCKHFPLFTRIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRR-------QVSEFDLF--TLDDIDSAFQSVTRLTYSQNY 135
           ++T PV  LG   + D Y S           S F L   T ++I+  F  V  L YSQ +
Sbjct: 86  YATHPVIALGRTLLQDLYSSTPLASTVIPTTSSFLLQPPTKEEINYYFSLVRPLKYSQPH 145

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK------------HL 183
                  G+ +  + AGH LGGT+W I    E ++YAVD+N+ +E               
Sbjct: 146 Q---PLNGVTITAYNAGHSLGGTIWHIQHGLESIVYAVDWNQARENVLAGAAWLGGAGAG 202

Query: 184 NGTVLESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGR 240
              V+E   +P  LI  +        P  R +R E+  D I  +++ GG VL+P DS  R
Sbjct: 203 GAEVIEQLRKPTALICSSKGGERVALPGGRAKRDELLLDMIKSSIKRGGIVLIPTDSGAR 262

Query: 241 VLELLLILEDYW-----AEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET- 292
           ++EL  +LE  W      E S   +   Y     S  T+ Y +S  EWM ++I + FE  
Sbjct: 263 MMELAYLLEHAWRTGNQEEESAFRSAKPYLAVSTSEMTMRYTRSMFEWMDEAIIREFEAQ 322

Query: 293 -------------------SRDNA--FLLKHVTLLINKSELD---NAPDG-----PKLVL 323
                              S+ NA  F  KH+ LL  K ++D   N  D       K++L
Sbjct: 323 PGHEEQQTGQQRRHAYSDESKQNAGPFEFKHLRLLGRKGQIDRMLNETDNLGRSVGKVIL 382

Query: 324 ASMASLEAGFSHDIFVEWASDVKNLVLFTER-----GQFGTLARML-------------- 364
           AS  S+E GFS ++  + A D KNL++ TE+     G  G L R L              
Sbjct: 383 ASDTSIEWGFSKEVLRKIADDDKNLLILTEKLNRIDGVTG-LGRTLWSWWEERRNGVATE 441

Query: 365 ---------QADPPPKAVKVTMSRRVPLVGEELIAYEE---EQTRLKKE------EALKA 406
                    Q     + +++   +R+PL G +L  Y++    Q +L+         AL+A
Sbjct: 442 PSSNGGNLEQVYGGGRDLEIREPKRIPLEGNDLTVYQQWLATQRQLQNTLQPGGATALEA 501

Query: 407 S-----------------LVKEEESK-----ASLGPDNN----LSGDPMVIDANNANASA 440
           S                    E++ K     A++G  N     LS + + I+        
Sbjct: 502 SADIVDDASSDSSSDSDDSETEQQGKALNISATMGQANRKKIGLSDEDLGINILLRKKGV 561

Query: 441 DVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 495
              +  G + RD               MFP        DDFGE+I P +++  +E
Sbjct: 562 HDFDVRGKKGRD--------------KMFPMAIRRKRNDDFGELIRPGEFLRAEE 602


>gi|156403103|ref|XP_001639929.1| predicted protein [Nematostella vectensis]
 gi|156227060|gb|EDO47866.1| predicted protein [Nematostella vectensis]
          Length = 527

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 103/345 (29%), Positives = 170/345 (49%), Gaps = 18/345 (5%)

Query: 31  LIDCG----WNDHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           ++DCG    +ND     D   +    K+   +D VL+SH    H GALPY  + +G   P
Sbjct: 1   MLDCGMHMGYNDERRFPDFDYITRSGKLTEHLDCVLISHFHLDHCGALPYFSEMVGYDGP 60

Query: 84  VFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           ++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q+  +  + E
Sbjct: 61  IYMTHPTKAICPILLEDYRKITVERKGETNFFTSQMIKDCMKKVVPINLHQSIKVDDELE 120

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
              +  + AGH+LG  ++ +    E V+Y  DYN   ++HL    ++   RP +LIT++ 
Sbjct: 121 ---IKAYYAGHVLGAVMFHMRVGTESVVYTGDYNMTPDRHLGSAWIDK-CRPDILITEST 176

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
            A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE YW   +L  PI
Sbjct: 177 YATTIRDSKRCRERDFLKKVHETMEKGGKVLIPVFALGRAQELCILLETYWERMNLKAPI 236

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           YF T ++     Y K F+ W    I  +F   + N F  +H+    ++S +DN   GP +
Sbjct: 237 YFSTGLTEKANHYYKLFITWTNQKIKNTF--VQRNMFEFEHIKPF-DRSYIDNP--GPMV 291

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQA 366
           V A+   L AG S  IF +WAS+  N+V+       GT+   + A
Sbjct: 292 VFATPGMLHAGLSLQIFKKWASNENNMVVIPGYCVAGTVGHKVLA 336


>gi|326508058|dbj|BAJ86772.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 704

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 179/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            ++  G   L DCG            + D  DPS      
Sbjct: 33  GDQMVVTPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 86

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 87  ----AIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 138

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 139 KVSVEDMLFDEQDIIRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 194

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  + Y  DY+R +++HL    +  F     +I   Y    +QP   + + F DAI  T+
Sbjct: 195 GVRIRYTGDYSREEDRHLKAAEIPQFSPDICIIESTYGVQQHQPRHVREKRFTDAIHNTV 254

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H      PIY+ + ++   +   ++++  M 
Sbjct: 255 SQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQTYINSMN 314

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS  SL++G S  +F +W 
Sbjct: 315 ERIRNQFAQS--NPFHFKHIDPL---NSIDNFHDVGPSVVMASPGSLQSGLSRQLFDKWC 369

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       G+LA+ +  +P
Sbjct: 370 TDKKNTCVIPGYAVEGSLAKTIINEP 395


>gi|374253821|ref|NP_001243389.1| integrator complex subunit 11 isoform 3 [Homo sapiens]
 gi|194386866|dbj|BAG59799.1| unnamed protein product [Homo sapiens]
          Length = 571

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 99/330 (30%), Positives = 165/330 (50%), Gaps = 18/330 (5%)

Query: 31  LIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           ++DCG +  F       D S +    ++   +D V++SH    H GALPY  + +G   P
Sbjct: 1   MLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGP 60

Query: 84  VFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           ++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q   +  + E
Sbjct: 61  IYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELE 120

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
              +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++ 
Sbjct: 121 ---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITEST 176

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
            A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PI
Sbjct: 177 YATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPI 236

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           YF T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +
Sbjct: 237 YFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMV 291

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 292 VFATPGMLHAGQSLQIFRKWAGNEKNMVIM 321


>gi|323307973|gb|EGA61229.1| Ysh1p [Saccharomyces cerevisiae FostersO]
          Length = 727

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 25  SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 84

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 85  SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 140

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 141 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 200

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 201 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 260

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 261 NMMNDDIXKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 318

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 378

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 379 ENLEFIEKISA 389


>gi|255718827|ref|XP_002555694.1| KLTH0G15202p [Lachancea thermotolerans]
 gi|238937078|emb|CAR25257.1| KLTH0G15202p [Lachancea thermotolerans CBS 6340]
          Length = 755

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 93/328 (28%), Positives = 167/328 (50%), Gaps = 19/328 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           ST+D +L+SH    H  +LPY M++      VF T P   +YR  LL+ + +  S    S
Sbjct: 63  STVDVLLISHFHLDHAASLPYVMQRTNFRGRVFMTHPTKAIYRW-LLSDFVKVTSIGSTS 121

Query: 110 EFD----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
             D    L+T +D+  +F  +  +    ++H +    GI      AGH+LG  ++++   
Sbjct: 122 FSDKDENLYTDEDLAESFDRIETI----DFHSTIDVNGIKFVAFHAGHVLGAAMFQVEIA 177

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  +++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T+
Sbjct: 178 GLKILFTGDYSRETDRHLNSAEVPPSSSDVLIVESTFGTATHEPRINREKKLTQLIHSTV 237

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFLE 280
             GG VLLPV + GR  E++LIL++YW++H+        P+++ + ++   +   ++++ 
Sbjct: 238 MRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGNGQVPVFYASNLAKKCMSVFQTYVN 297

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
            M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L+ G S D+  +
Sbjct: 298 MMNDDIRKKFRDSQSNPFIFKNISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLEK 355

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADP 368
           W    KNLVL T     GT+A+ +  +P
Sbjct: 356 WCPGEKNLVLITGYSVEGTMAKFIMLEP 383


>gi|426327394|ref|XP_004024503.1| PREDICTED: integrator complex subunit 11 isoform 3 [Gorilla gorilla
           gorilla]
          Length = 571

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 99/330 (30%), Positives = 165/330 (50%), Gaps = 18/330 (5%)

Query: 31  LIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           ++DCG +  F       D S +    ++   +D V++SH    H GALPY  + +G   P
Sbjct: 1   MLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGP 60

Query: 84  VFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           ++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q   +  + E
Sbjct: 61  IYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELE 120

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
              +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++ 
Sbjct: 121 ---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITEST 176

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
            A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PI
Sbjct: 177 YATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPI 236

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           YF T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +
Sbjct: 237 YFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMV 291

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 292 VFATPGMLHAGQSLQIFRKWAGNEKNMVIM 321


>gi|242032211|ref|XP_002463500.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
 gi|241917354|gb|EER90498.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
          Length = 695

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 108/386 (27%), Positives = 182/386 (47%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + +TPL            ++  G   L DCG            + D  DPS      
Sbjct: 25  GDQMVITPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 78

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               TID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 79  ----TIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 130

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D L+  +DI  + + +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 131 KVSVEDMLYDENDIARSMEKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 186

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    L  F     +I   Y    +QP   + + F + I  T+
Sbjct: 187 GVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQQHQPRIVREKRFTEVIHNTV 246

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M 
Sbjct: 247 SQGGRVLIPAFALGRAQELLLILDEYWSKHPELHKIPIYYASPLAKRCMAVYQTYINSMN 306

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS   L++G S  +F +W 
Sbjct: 307 ERIRNQFAQS--NPFHFKHIESL---NSIDNFHDVGPSVVMASPGGLQSGLSRQLFDKWC 361

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       GTLA+ +  +P
Sbjct: 362 TDKKNACVIPGYVVEGTLAKTIINEP 387


>gi|349579985|dbj|GAA25146.1| K7_Ysh1p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 779

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 59  SKVDILLISHFHLDHAASLPYVMQRTNFQGKVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 119 SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 413 ENLEFIEKISA 423


>gi|254567914|ref|XP_002491067.1| hypothetical protein [Komagataella pastoris GS115]
 gi|238030864|emb|CAY68787.1| hypothetical protein PAS_chr2-1_0816 [Komagataella pastoris GS115]
 gi|328352406|emb|CCA38805.1| Cleavage and polyadenylation specificity factor subunit 2
           [Komagataella pastoris CBS 7435]
          Length = 854

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 147/540 (27%), Positives = 235/540 (43%), Gaps = 74/540 (13%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G N   D  W+   D   L  L K+   I+ +LLSHP    +G   Y +++  +  + P+
Sbjct: 26  GINIFADPSWDGVAD---LSYLDKIIPQINVILLSHPTADFIGGFVYLLQKYPVLKTLPI 82

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVS--EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           +ST P+  LG ++  + Y ++  V   E  +    DID  F S+  L YSQ+  L+G  +
Sbjct: 83  YSTYPITNLGKVSTTELYRAKGLVGPLEGSIMEKSDIDECFDSIIPLKYSQSTPLTGIAQ 142

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN-GTVLES-------FVRP 194
           G+ V P+ AGH LGGT W I  + E ++YA  +N  K+  LN  T L+S        V+P
Sbjct: 143 GLSVTPYNAGHSLGGTFWSINYNNEKIVYAPAWNHSKDSFLNSATFLQSNGHPIPQLVKP 202

Query: 195 AVLIT--DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           A +IT  D  ++L      ++ E F   +  T+   G V LP   +GR LELL +++ + 
Sbjct: 203 ASVITGSDLGSSLSYN---KKLEKFFTLVDATIAQNGTVFLPTSMSGRFLELLHLMDQHL 259

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
               +  P+  + +  S ++    + LEWM   I K +E   +  F    V  L++  +L
Sbjct: 260 GNQPI--PVLLVAFTGSKSLSLAGNMLEWMSPKIIKDWEERNETPFDPSRVQ-LVDVDDL 316

Query: 313 DNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER---GQFG---------- 358
              P G K+V  + A L  G  +H        D KN ++FTER     FG          
Sbjct: 317 VQLP-GAKVVFTADADLTIGSTAHSTLASICIDEKNTIIFTERPTNSSFGASIYEIWEKL 375

Query: 359 TLARMLQAD---PPPKAVKVTMSRRV--PLVGEELIAYEEEQTRLKKEEALKASLVKEEE 413
           TL R  + +   P P    +T SR     L G EL  Y E     K+E+  K  + K   
Sbjct: 376 TLERNGKLEDGFPVPFEKLLTFSRVTLKKLTGLELAQYTEIVNERKQEKRKKRQVEKMNT 435

Query: 414 S---KASLGPDNNLSG-DPMVIDA------------------------NNANASADVVEP 445
           +     S+  +  +S  DP  + A                           N +   V  
Sbjct: 436 TILADKSIDINKPISEFDPAAVKALEEDEDEDEEEDKEDIGVEETANDERGNTTTTAVAS 495

Query: 446 HGGRYRDIL---IDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAM 502
              + +DI    +D  V  +     +FP++    E DD+G  I+  D++ +D+  + + M
Sbjct: 496 TKKQEKDIYKIPLDFDVRNAKGRNRLFPYHSRIQETDDYGIKIDHSDFVKEDKSEEFSRM 555


>gi|224140921|ref|XP_002323825.1| predicted protein [Populus trichocarpa]
 gi|222866827|gb|EEF03958.1| predicted protein [Populus trichocarpa]
          Length = 696

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 121/400 (30%), Positives = 194/400 (48%), Gaps = 42/400 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 22  GDQLTLTPL-GAGNEVGRSCVYMSFKGKTVLFDCGIHLAYSGMAALPYFDEIDPS----- 75

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +++L LLT Y + +S+
Sbjct: 76  -----TIDVLLVTHFHLDHAASLPYFLEKTTFRGRVFMTHATKAIFKL-LLTNYVK-VSK 128

Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
             V +  LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 129 VSVEDM-LFDEKDINRSMDKIEVIDFHQTVDVNG----IKFWCYTAGHVLGAAMFMVDIA 183

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T+
Sbjct: 184 GVRVLYTGDYSREEDRHLCAAEMPQFSPDICIIESTYGVQLHQPRHLREKRFTDVIHSTI 243

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M 
Sbjct: 244 SLGGRVLIPAFALGRAQELLLILDEYWSNHPELHNIPIYYASPLAKKCMTVYQTYILSMN 303

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           + I   F  S  N F  KH++ L N  E D +  GP +V+AS   L++G S  +F  W S
Sbjct: 304 ERIRNQFANS--NPFKFKHISPL-NSIE-DFSDVGPSVVMASPGGLQSGLSRQLFDMWCS 359

Query: 344 DVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           D KN  +       GTLA+ +  +  PK V++      PL
Sbjct: 360 DKKNACVIPGYVVEGTLAKTIINE--PKEVQLMNGLTAPL 397


>gi|323336337|gb|EGA77605.1| Ysh1p [Saccharomyces cerevisiae Vin13]
          Length = 745

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 25  SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 84

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 85  SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 140

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 141 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 200

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 201 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 260

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 261 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 318

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 378

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 379 ENLEFIEKISA 389


>gi|323303815|gb|EGA57598.1| Ysh1p [Saccharomyces cerevisiae FostersB]
          Length = 727

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 25  SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 84

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 85  SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 140

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 141 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 200

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 201 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 260

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 261 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 318

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 319 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 378

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 379 ENLEFIEKISA 389


>gi|358385845|gb|EHK23441.1| hypothetical protein TRIVIDRAFT_37526 [Trichoderma virens Gv29-8]
          Length = 957

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 127/428 (29%), Positives = 191/428 (44%), Gaps = 86/428 (20%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    L+D GW++ F    L+ L K   T+  +LL+H    
Sbjct: 6   PLQGALSESLASQSLLELDGGVKVLVDLGWDESFSSDKLEELEKQVPTLSLILLTHATVS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR-------RQVSEFDLF--- 114
           HL A  +  K + L    PV++T PV  LG     D Y S        RQ S  +     
Sbjct: 66  HLAAYAHCCKNIALFTRIPVYATRPVIDLGRTLTQDLYSSTPAAATTIRQSSLSETAYAY 125

Query: 115 ---------------TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHL 154
                          T ++I   F  +  L YSQ +       S    G+ +  + +GH 
Sbjct: 126 SQTVTTAQNLLLQSPTPEEIARYFSLIQPLKYSQPHQPLSSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-----EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
            A  N     R +R     EM +  +S+    GG VL+PVDS+ RVLE+  +LE  W   
Sbjct: 246 GADKNAQAGGRAKRDEHLIEMIKTCVSR----GGTVLIPVDSSARVLEISYLLEYAWRTD 301

Query: 256 SLNY-------PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-------------SRD 295
           + N         +Y      SST+ Y +S LEWM ++I + FE               ++
Sbjct: 302 AANKDGVLKYSKLYLAGRNVSSTMRYARSMLEWMDNNIVQEFEAFAEGQRKVNGGNEKKE 361

Query: 296 NA-FLLKHVTLLINKSE--------LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            A F  K++ LL  K++        ++N     +++LAS  S++ GFS D+    A D +
Sbjct: 362 GAPFDFKYLRLLERKAQITKLLSQNIENGETQGRVILASDVSMDWGFSKDLVKGLAKDSR 421

Query: 347 NLVLFTER 354
           NLV+ TER
Sbjct: 422 NLVILTER 429


>gi|401885166|gb|EJT49292.1| cleavage and polyadenylation specificity factor subunit
           [Trichosporon asahii var. asahii CBS 2479]
          Length = 958

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 156/573 (27%), Positives = 251/573 (43%), Gaps = 91/573 (15%)

Query: 5   VQVTPLSG----VFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           + +TPLS     V  + P+SY + +D    L+D G  D +  S  Q   +    I     
Sbjct: 2   ITLTPLSSSATSVSPDEPVSYFLELDDARILLDMGQRD-YRASAQQTSWEYEEKI----- 55

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRR-------QVSEFD- 112
               T +LG   YA    GL  PV++T+P   +G +    +  S R       +  EF  
Sbjct: 56  -RDPTQYLGLYAYARAHWGLKCPVYATQPTVEMGRVVSLAEAESWRAECPVSDEEGEFKG 114

Query: 113 --LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDV 169
             + T ++I  AF  +  + Y+Q  HL G+   +++ P  +GH+LGGT++KI +     V
Sbjct: 115 PFVPTTEEIHEAFDHIKAIRYNQPLHLGGELSHLLLTPFPSGHVLGGTLFKIRSPTSGTV 174

Query: 170 IYAVDYNRRKEKHLNGTVL---------ESFVRPAVLITDAYNALHNQPPRQQREM-FQD 219
           +YAV  N   E+HL+G V          E   RP +LI +   +      R++RE    D
Sbjct: 175 LYAVGINHTGERHLDGMVTGQGGLQGYAEDIRRPDLLIVEGGRSNAVNAKRRERETAILD 234

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA----------EHSLNYPIYFLTYVSS 269
            ++ TL  G +VL+P D++ R+LELL++L+ +W+              N+P+  ++  + 
Sbjct: 235 LVTATLAGGRSVLMPCDASPRLLELLVLLDQHWSFKRTAAPGGPAAQWNHPLCLVSRTAQ 294

Query: 270 STIDYVKSFLEWMG--------DSITKSFETSRDN-------------AFLLKHVTLLIN 308
             + + +S LEWMG        D +  + +  +               A    HV     
Sbjct: 295 DMVSFARSLLEWMGGVVRESGADDVVAALDRRKGRKRKALVNLGSEYGALDFSHVQFFAT 354

Query: 309 KSE-LDNAP-DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-- 364
             E L+  P + PKLVLA   ++  G S  +F   AS   N+VL T  G+  TLAR L  
Sbjct: 355 PEELLEKYPANRPKLVLAIPPTMSHGPSRTLFASMASVPGNVVLLTGHGEDRTLARELYA 414

Query: 365 ------------------QADPPPKAVKVTMSRRVPLVGEELIAYE-EEQTRLKKEEALK 405
                              A P    +++ +  + PL GEEL AYE  E+ + ++E A +
Sbjct: 415 RWEAHQDEGAHYGHGKIGHATPMEGRLELELDAKEPLSGEELEAYETAEREKREREAAHQ 474

Query: 406 ASLVK-----EEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVP 460
           A+L +     E +   S    ++ +GD   +    ANA A   E       DI + G   
Sbjct: 475 AALERNNRMLEADDLESDSDSDSEAGDLAGLHQEGANAFAGDGEDARTMSFDIFVKGQSV 534

Query: 461 PSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK 493
              +   MFP+     + D FGE ++   +I K
Sbjct: 535 LRGTRFRMFPYIAKGRKVDSFGEGLDVGQWIRK 567



 Score = 44.3 bits (103), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 48/97 (49%), Gaps = 20/97 (20%)

Query: 617 PISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRC---------GEYVTIRK 666
           P S P     S+ +GDL++  LK  L + GI  +FAG G L C         G  V +RK
Sbjct: 867 PPSGPLTLPSSLFIGDLRLLALKNRLGTLGIPAQFAGEGVLVCGPGVEPGAKGSIVAVRK 926

Query: 667 VGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
           +    ++G      ++V+EGP+   Y+ +R  LY  +
Sbjct: 927 L----EEG------RVVLEGPVSGTYFAVRRELYGSY 953


>gi|6323307|ref|NP_013379.1| Ysh1p [Saccharomyces cerevisiae S288c]
 gi|74644951|sp|Q06224.1|YSH1_YEAST RecName: Full=Endoribonuclease YSH1; AltName: Full=Yeast 73 kDa
           homolog 1; AltName: Full=mRNA 3'-end-processing protein
           YSH1
 gi|577190|gb|AAB67367.1| Ysh1p: subunit of polyadenylation factor I (PF I) [Saccharomyces
           cerevisiae]
 gi|151940984|gb|EDN59365.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
           YJM789]
 gi|190405336|gb|EDV08603.1| hypothetical protein SCRG_04228 [Saccharomyces cerevisiae RM11-1a]
 gi|256269831|gb|EEU05091.1| Ysh1p [Saccharomyces cerevisiae JAY291]
 gi|285813694|tpg|DAA09590.1| TPA: Ysh1p [Saccharomyces cerevisiae S288c]
 gi|323332373|gb|EGA73782.1| Ysh1p [Saccharomyces cerevisiae AWRI796]
          Length = 779

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 59  SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 119 SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 413 ENLEFIEKISA 423


>gi|259148260|emb|CAY81507.1| Ysh1p [Saccharomyces cerevisiae EC1118]
          Length = 779

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 59  SKVDILLISHFHLDHAASLPYVMQRTNFEGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 119 SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 413 ENLEFIEKISA 423


>gi|320170221|gb|EFW47120.1| integrator complex subunit 11 [Capsaspora owczarzaki ATCC 30864]
          Length = 661

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 123/460 (26%), Positives = 214/460 (46%), Gaps = 29/460 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++V PL    +      LVSI G N + DCG    +ND   F D + ++        ID 
Sbjct: 3   IRVRPLGAGQDVGRSCLLVSIGGKNIMFDCGMHMGYNDARRFPDFASIKRTGPYTDVIDC 62

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GA+ +  +  G   P++ T P   +  + + D + L+  +  E + FT 
Sbjct: 63  VIVSHFHLDHCGAIVHFSEVCGYDGPIYMTHPTKAICPILLEDYRKLTVERKGETNFFTS 122

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            +I +  + V  +   ++  +  + E   +  + AGH+LG  ++ +    E V+Y  D+N
Sbjct: 123 ANIKACMKKVIAVNLHESVRVDDEIE---IKAYYAGHVLGAAMFHVRVGSESVVYTGDFN 179

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   I + +  GG VL+PV
Sbjct: 180 MTPDRHLGAAWIDR-CRPDLLITESTYATTIRDSKRNREGEFLRKIHECVEQGGKVLIPV 238

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL +++E YW    L  P+YF   +++   +Y K F+ W    I ++F     
Sbjct: 239 FALGRAQELCILVETYWERLGLTVPVYFSAGLTAKANNYYKLFITWTNQKIKRTF--VER 296

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +++ LDN   GP ++ A+   L AG S D F +WA + KN+V+     
Sbjct: 297 NMFEFKHIKPF-DRAFLDNP--GPMVLFATPGMLHAGMSLDAFRKWAPNDKNMVILPGYC 353

Query: 356 QFGT-----LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQ---TRLKKEEALKAS 407
             GT     LA   Q + P +A +  +  R+ +      A+ + +     ++  E     
Sbjct: 354 VAGTVGNKVLAGHKQIEMPDRA-RTVIDVRLSVQNLSFSAHADAKGIVQLIRHAEPRNVM 412

Query: 408 LVKEEESKASLGPDNNLS--GDPMVIDANNANASADVVEP 445
           LV  E++K +      +S  G P    AN A  + +   P
Sbjct: 413 LVHGEKAKMAFLKAKIISEIGIPCFDPANGATVTIETAHP 452


>gi|322700762|gb|EFY92515.1| cleavage and polyadenylylation specificity factor, putative
           [Metarhizium acridum CQMa 102]
          Length = 960

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 129/426 (30%), Positives = 187/426 (43%), Gaps = 80/426 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    L+  GW++ FD   L+ L K   T+  +LL+H    
Sbjct: 6   PLQGALSESTASQSLLELDGGVKVLVGLGWDETFDVRKLEELEKQVPTLSLILLTHATAS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR-------RQVSEFDLF--- 114
           HL A  +  K   L    P ++T PV  LG   + D Y S        RQ S  ++    
Sbjct: 66  HLAAYVHCCKNFPLFTRIPAYATRPVIDLGRSLIQDLYSSTPAASTTIRQSSLSEIAYAY 125

Query: 115 ---------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
                          T D I   F  +  L YSQ +            G+ +  + +GH 
Sbjct: 126 TQTAATAQNLLLQSPTPDQIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--- 256
            A  +     R +R E   + I   +  GG VL+PVDS+ RVLEL  +LE  W   +   
Sbjct: 246 GAQKSAQTAGRAKRDEQLLEMIKTCVTKGGTVLIPVDSSARVLELSYLLEHAWRADAASD 305

Query: 257 ---LNYP-IYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-------------SRDNA-F 298
              LN   +Y      SST+ Y +S LEWM D+I + FE               +D   F
Sbjct: 306 NGVLNSAKLYLAGRNMSSTMRYARSMLEWMDDNIVQEFEAFAEGQRKANGTVEKKDGGPF 365

Query: 299 LLKHVTLLINKSELDNAPD----------GPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
             K++ LL  K+++    D            +++LAS AS+E GFS D+  E A D  NL
Sbjct: 366 DFKYLRLLERKAQVSKLLDQVASAQGEAAKGRVILASDASMEWGFSKDVLRELAKDPNNL 425

Query: 349 VLFTER 354
           V+ T+R
Sbjct: 426 VILTDR 431


>gi|242786013|ref|XP_002480717.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces stipitatus ATCC 10500]
 gi|218720864|gb|EED20283.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces stipitatus ATCC 10500]
          Length = 1017

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 142/515 (27%), Positives = 215/515 (41%), Gaps = 128/515 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW++ FD   L  L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDETFDVLELAELEKHIPTLSLILLTHATISHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATGPVISLGRTLLQDMYTSAPLAATFLPKVSISEPGASTSAASAAAATVSTEGDGRSSS 146

Query: 111 ---------FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLG 156
                        + ++I   F  +  L YSQ +       S   +G+ +  + AGH +G
Sbjct: 147 MLATTGRILLQPPSAEEIARYFSLIHPLKYSQPHSPLCSPFSPPLDGLTLTAYSAGHTVG 206

Query: 157 GTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNA 204
           GT+W I    E ++YAVD+N+ +E  + G             V+E   +P  LI  +   
Sbjct: 207 GTIWHIQHGMESIVYAVDWNQARENVVAGAAWFGGSGTSGTEVIEQLRKPTALICSSKGG 266

Query: 205 LHNQPPR--QQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW--AEHSLN- 258
               PP   Q+R+ +  D I  +L  GG+VL+P D++ RVLEL   LE  W  A  S N 
Sbjct: 267 DKFAPPGGLQKRDALLFDMIRSSLAKGGSVLIPTDTSARVLELSYALEHAWRDAADSSNG 326

Query: 259 ------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE--------------------- 291
                   IY     + ST+   +S LEWM + I + FE                     
Sbjct: 327 EDVFKKAEIYLAGKKAHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAARGHKRTDSQSR 386

Query: 292 ---TSRDNA------FLLKHVTLLINKSELDNA-PDG-PKLVLASMASLEAGFSHDIFVE 340
              +SRDN       F LKH+ ++  K +L+    DG PK+++AS  SL+ G+S + F  
Sbjct: 387 TTGSSRDNKATKLGPFTLKHLKIVEQKRKLEKILGDGIPKVIIASDTSLDWGYSKETFRT 446

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKK 400
            A D +NL++ TE     TL    Q D P +  K+T+ R +         YEE +  +  
Sbjct: 447 LAEDSQNLIILTE-----TLPSRYQTDDPEQPDKMTLGRMI------WHWYEERKDGVAM 495

Query: 401 EEALKASLVKEEES-----------KASLGPDNNL 424
           E A    L+++  S           +A+L PD  +
Sbjct: 496 ETASSGELLEQIHSGGREITLVDVERAALDPDEQV 530


>gi|359486187|ref|XP_002271646.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Vitis vinifera]
          Length = 693

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 184/387 (47%), Gaps = 44/387 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 20  GDQLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPS----- 73

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+  
Sbjct: 74  -----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKV 124

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D L+   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 125 SKVSVEDMLYDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 180

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T
Sbjct: 181 AGVRVLYTGDYSREEDRHLRAAEIPQFCPDICIIESTYGVQLHQPRHVREKRFTDVIHST 240

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M
Sbjct: 241 ISQGGRVLIPAYALGRAQELLLILDEYWSNHPELHNVPIYYASPLAKRCMAVYQTYINSM 300

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH++ L     ++N  D GP +V+AS   L++G S  +F  W
Sbjct: 301 NERIRNQFANS--NPFDFKHISPL---KSIENFNDVGPSVVMASPGGLQSGLSRQLFDMW 355

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
            SD KN  +       GTLA+ +  +P
Sbjct: 356 CSDKKNACVIPGYVVGGTLAKTIINEP 382


>gi|326503296|dbj|BAJ99273.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 693

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 179/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            +S  G   L DCG            + D  DPS      
Sbjct: 21  GDHMVVTPLGAGGEVGRSCVHMSFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 74

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 75  ----AIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 126

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   D+  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 127 KVSVEDMLFDEQDVIRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 182

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    +  F     +I   Y    +QP   + + F DAI  T+
Sbjct: 183 GVRILYTGDYSREEDRHLKAAEVPQFSPDICIIESTYGVQQHQPRHVREKRFTDAIHNTV 242

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H      PIY+ + ++   +   ++++  M 
Sbjct: 243 SQGGRVLIPAYALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQTYINSMN 302

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS  SL++G S  +F +W 
Sbjct: 303 ERIRNQFAQS--NPFHFKHIEPL---NSIDNFHDVGPSVVMASPGSLQSGLSRQLFDKWC 357

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       G+L + +  +P
Sbjct: 358 TDKKNTCVIPGFAVEGSLVKTIINEP 383


>gi|291233360|ref|XP_002736621.1| PREDICTED: cleavage and polyadenylation specific factor 3,
           73kDa-like [Saccoglossus kowalevskii]
          Length = 715

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 193/378 (51%), Gaps = 38/378 (10%)

Query: 22  LVSIDGFNFLIDCGWND---------HFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALP 72
           ++   G   ++DCG +          +FD  L++P       ID +L+SH    H GALP
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPYFD--LIEP-----DEIDLLLISHFHLDHCGALP 88

Query: 73  YAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTR 128
           + +++      VF   +T+ +YR  L      Y+    +S E  L+T +D++++   +  
Sbjct: 89  WFLQKTNFQGRVFMTHATKAIYRWLL----SDYVKVSNISTEQMLYTDNDLENSMDRIET 144

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL 188
           +    ++H+  +  G+    + AGH+LG  ++ I   G  ++Y  D++R++++HL    L
Sbjct: 145 I----DFHVETEVLGVKFWCYNAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEL 200

Query: 189 ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLI 247
            S VRP VLI ++    H    R++RE  F   +   +  GG  L+PV + GR  ELLLI
Sbjct: 201 PS-VRPDVLIIESTYGTHIHEKREEREARFTGTVHDIVNRGGRCLIPVFALGRAQELLLI 259

Query: 248 LEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTL 305
           L++YWA H    + PIY+ + ++   +   ++++  M D I +    S  N F+ KH++ 
Sbjct: 260 LDEYWANHPELHDIPIYYASSLAKKCMSVYQTYINAMNDKIKRQITIS--NPFVFKHISN 317

Query: 306 LINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQ 365
           L      D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + 
Sbjct: 318 LRGMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDRRNGVIIAGYCVEGTLAKHIL 375

Query: 366 ADPPPKAVKVTMSRRVPL 383
           +   P+ V     +++PL
Sbjct: 376 SQ--PEEVTTMSGQKLPL 391


>gi|167525469|ref|XP_001747069.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774364|gb|EDQ87993.1| predicted protein [Monosiga brevicollis MX1]
          Length = 730

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/366 (28%), Positives = 178/366 (48%), Gaps = 19/366 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKVAST-----IDAV 58
           ++V PL    +      LV++ G   + DCG +  ++ +   P  ++VA       ID  
Sbjct: 10  IRVVPLGAGQDVGRSCVLVTMGGRTIMFDCGMHMGYNDARRFPDFTQVAQGPLTDHIDLA 69

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFDLFTL 116
           +++H    H GALPY  +Q+G   P++ T P   +   LL  Y +    RQ  E + FT 
Sbjct: 70  IITHFHLDHCGALPYFTEQVGYDGPLYMTMPTRAIAQVLLEDYRKIAVSRQ-GEKNFFTR 128

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           DDI +     T +   Q   +    E   +  + AGH+LG  ++ +    + V+Y  DYN
Sbjct: 129 DDIKTCLNKATTIDLHQTVVIDQDFE---IKAYYAGHVLGAAMFYVRVGNQSVVYTGDYN 185

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++    P V+I+++  A   +  R+ RE      I++ ++ GG VLLPV
Sbjct: 186 MSPDRHLGAAWIDR-CEPDVIISESTYATTIRDSRRAREHDLLTKITQCVQRGGKVLLPV 244

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    +  PIYF T +++   +Y K F+ W    + ++F     
Sbjct: 245 FALGRAQELCILLETHWQRTGMRVPIYFSTGLTARANEYYKLFITWTNQKLKETF--VER 302

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +HV    ++S L++A  GP+++ A+   L AG S   F  W  D +N+V+     
Sbjct: 303 NLFDFQHVQPF-DRSYLEHA--GPQVLFATPGMLHAGTSLLAFTHWCEDPRNMVILPGYC 359

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 360 TAGTVG 365


>gi|346327110|gb|EGX96706.1| cleavage and polyadenylylation specificity factor, putative
           [Cordyceps militaris CM01]
          Length = 1024

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 187/432 (43%), Gaps = 78/432 (18%)

Query: 1   MGTSVQVTPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAV 58
           + T     PL G  +E+  S  L+ +DG    L+D GW++ FD + L+ L K   T+  +
Sbjct: 32  IATMFTFCPLQGAQSESLASQSLLELDGGVKVLVDLGWDESFDVAKLEELEKQVPTLSLI 91

Query: 59  LLSHPDTLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS------------ 104
           LL+H    H+ A  +  K + L    PV++T PV  LG     D Y S            
Sbjct: 92  LLTHATASHIAAYVHCCKNIPLFTRIPVYATRPVIDLGRTLTQDLYSSTPAAATTVPPAA 151

Query: 105 ---------RRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVV 146
                    +   +  +L     T DDI   F  +  L YSQ +            G+ +
Sbjct: 152 LSASAYAYTQAATTTQNLLLQSPTPDDIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTI 211

Query: 147 APHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK------------HLNGTVLESFVRP 194
             + AGH LGGT+W I    E ++YAVD+N+ +E                  V+E   +P
Sbjct: 212 TAYNAGHTLGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAQVIEQLRKP 271

Query: 195 AVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
             LI  +  A  N     R +R E   + I   +  GG VL+PVDS+ RVLEL  +LE  
Sbjct: 272 TALICSSRGAERNAQAGGRAKRDEQLLETIKAAVARGGTVLIPVDSSARVLELAYLLEHA 331

Query: 252 WAEHSLNYP-------IYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-----SRDNAFL 299
           W   S +         +Y      +ST+ Y +S LEWM D I + FE       R N   
Sbjct: 332 WRTDSASAAGVFKAAKLYLAGRNMASTMRYARSMLEWMDDGIVQEFEAFAEGQKRTNGAS 391

Query: 300 LKHV---------TLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWA 342
            K V          LL  K+++        +N     +++LAS  S++ GFS D+    A
Sbjct: 392 DKKVGGPLDFRFMRLLDRKAQIAKLLSTAVNNGESKGRVILASDTSMDWGFSKDLLRGLA 451

Query: 343 SDVKNLVLFTER 354
           SD  N+V+ T++
Sbjct: 452 SDPNNVVILTDK 463


>gi|303310723|ref|XP_003065373.1| hypothetical protein CPC735_045980 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240105035|gb|EER23228.1| hypothetical protein CPC735_045980 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 1026

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/441 (28%), Positives = 184/441 (41%), Gaps = 114/441 (25%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSA--PV 84
           G   LID GW++ FDPS L+ L K   T+  +LL+H    H+GA  Y  K   L A  PV
Sbjct: 27  GVKILIDVGWDETFDPSALKELEKHIPTLSLILLTHATPSHIGAFVYCCKTFPLFAQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF-------------------------------DL 113
           ++T PV   G   + D Y S    S F                               D 
Sbjct: 87  YATYPVISFGRSLLQDLYSSAPLASTFLPTTSSISDSNGSNSLPTQDPTAPAGALTEGDT 146

Query: 114 F-------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLL 155
                         T +DI   F  +  L YSQ +            G+ +  + AGH +
Sbjct: 147 LNSTTAGKILLPSPTSEDIARHFSLIHPLKYSQPHQPLPSPFSPPLNGLTITAYNAGHTV 206

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYN 203
           GGT+W I    E ++YAVD+N+ +E  + G             V+E   +P  L+  A  
Sbjct: 207 GGTIWHIQHGMESIVYAVDWNQARENVIAGAAWFGSSGANRTDVIEQLRKPTALVCSAKG 266

Query: 204 ALHNQP--PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-------- 252
                P   R++R ++  D I   +   G VLLP D++ RVLEL  +LE  W        
Sbjct: 267 GDKFAPGGGRKKRDDLLLDMIRSCIAKKGTVLLPTDTSARVLELAYVLEHAWREAADGPD 326

Query: 253 AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-------------------- 291
            E+SL N  +Y       ST+   +S LEWM +SI + FE                    
Sbjct: 327 GENSLKNATLYLAGKKVHSTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQ 386

Query: 292 ---------TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAG 332
                    + + +A        F  +H+ ++  K++L+N    +GPK+++AS ASL+ G
Sbjct: 387 QSKGTPGQTSDKKSAGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDASLDWG 446

Query: 333 FSHDIFVEWASDVKNLVLFTE 353
           FS +I    A   +NLV+ TE
Sbjct: 447 FSKEILRHVAQGAENLVILTE 467


>gi|320034772|gb|EFW16715.1| cleavage and polyadenylylation specificity factor [Coccidioides
           posadasii str. Silveira]
          Length = 1026

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 126/441 (28%), Positives = 183/441 (41%), Gaps = 114/441 (25%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSA--PV 84
           G   LID GW++ FDPS L+ L K   T+  +LL+H    H+GA  Y  K   L A  PV
Sbjct: 27  GVKILIDVGWDETFDPSALKELEKHIPTLSLILLTHATPSHIGAFVYCCKTFPLFAQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF-------------------------------DL 113
           ++T PV   G   + D Y S    S F                               D 
Sbjct: 87  YATYPVISFGRSLLQDLYSSAPLASTFLPTTSSISDSNGSNSLPTQDPTAPAGALTEGDT 146

Query: 114 F-------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLL 155
                         T +DI   F  +  L YSQ +            G+ +  + AGH +
Sbjct: 147 LNSTTAGKILLPSPTSEDIARHFSLIHPLKYSQPHQPLPSPFSPPLNGLTITAYNAGHTV 206

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYN 203
           GGT+W I    E ++YAVD+N+ +E  + G             V+E   +P  L+  A  
Sbjct: 207 GGTIWHIQHGMESIVYAVDWNQARENVIAGAAWFGSSGANRTDVIEQLRKPTALVCSAKG 266

Query: 204 ALHNQP--PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-------- 252
                P   R++R ++  D I   +   G VLLP D++ RVLEL  +LE  W        
Sbjct: 267 GDKFAPGGGRKKRDDLLLDMIRSCIAKKGTVLLPTDTSARVLELAYVLEHAWREAANGPD 326

Query: 253 AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-------------------- 291
            E+SL N  +Y       ST+   +S LEWM +SI + FE                    
Sbjct: 327 GENSLKNATLYLAGKKVHSTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQ 386

Query: 292 --------TSRDNA---------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAG 332
                   TS   +         F  +H+ ++  K++L+N    +GPK+++AS ASL+ G
Sbjct: 387 QSKGTPGQTSDKKSAGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDASLDWG 446

Query: 333 FSHDIFVEWASDVKNLVLFTE 353
           FS +I    A   +NLV+ TE
Sbjct: 447 FSKEILRHVAQGAENLVILTE 467


>gi|338722203|ref|XP_001496423.3| PREDICTED: integrator complex subunit 11 [Equus caballus]
          Length = 571

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 99/330 (30%), Positives = 165/330 (50%), Gaps = 18/330 (5%)

Query: 31  LIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           ++DCG +  F       D S +    ++   +D V++SH    H GALPY  + +G   P
Sbjct: 1   MLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGP 60

Query: 84  VFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           ++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q   +  + E
Sbjct: 61  IYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELE 120

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
              +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++ 
Sbjct: 121 ---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITEST 176

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
            A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PI
Sbjct: 177 YATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERVNLKAPI 236

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           YF T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +
Sbjct: 237 YFSTGLTEKANHYYKLFITWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMV 291

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 292 VFATPGMLHAGQSLQIFRKWAGNEKNMVIM 321


>gi|359486185|ref|XP_003633408.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Vitis vinifera]
          Length = 694

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 185/387 (47%), Gaps = 44/387 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 21  GDQLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPS----- 74

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+  
Sbjct: 75  -----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKV 125

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D L+   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 126 SKVSVEDMLYDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 181

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T
Sbjct: 182 AGVRVLYTGDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFTDVIHST 241

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M
Sbjct: 242 ISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNIPIYYASPLAKRCMAVYQTYINSM 301

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH++ L     ++N  D GP +V+AS + L++G S  +F  W
Sbjct: 302 NERIRNQFANS--NPFDFKHISPL---KSIENFNDVGPSVVMASPSGLQSGLSRQLFDMW 356

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
            SD KN  +       GTLA+ +  +P
Sbjct: 357 CSDKKNACVIPGYVVEGTLAKTIINEP 383


>gi|116203607|ref|XP_001227614.1| hypothetical protein CHGG_09687 [Chaetomium globosum CBS 148.51]
 gi|88175815|gb|EAQ83283.1| hypothetical protein CHGG_09687 [Chaetomium globosum CBS 148.51]
          Length = 956

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 125/427 (29%), Positives = 191/427 (44%), Gaps = 81/427 (18%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           +PL G  +E+  S  L+ +DG    LID GW++ FD   L+ L K   T+  +LL+H   
Sbjct: 5   SPLQGALSESTASQSLLELDGGVKVLIDVGWDEAFDVEKLRELEKQIPTLSLILLTHATV 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR------------------ 105
            HLGA  +  K   L    PV++T PV  LG     D Y S                   
Sbjct: 65  DHLGAYAHCCKNFPLFTRVPVYATRPVIDLGRTLTQDLYASTPVAATTISPTSLAEASYS 124

Query: 106 -RQVSEFDLFTL------DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGH 153
             Q S  D   L      ++I   F  +  L YSQ +            G+ +  + +GH
Sbjct: 125 YAQTSSADHKLLLQPPTPEEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGH 184

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLIT 199
            LGGT+W I    E ++YAVD+N+ +E   +G               V+E   +P  L+ 
Sbjct: 185 TLGGTIWHIQHGLESIVYAVDWNQARENVFSGAAWLGGGHGGAGGAEVIEQLRKPTALVC 244

Query: 200 DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-AEHSLN 258
            +       P  ++ E   ++I   +  GG VL+PVDS+ RVLEL   LE  W AE + +
Sbjct: 245 SSRTPETALPRGRRDEQLLESIKLCIARGGTVLIPVDSSARVLELSYFLEHAWRAEIAKD 304

Query: 259 YPIYFLT--YVSSSTIDYV----KSFLEWMGDSITKSFET----------------SRDN 296
             ++  T  Y++  TI+      +S LEWM DSI + FE                     
Sbjct: 305 NEVFKSTKAYLAGRTINSTMRNARSMLEWMDDSIVREFEAVAGGQRGNGGSGGGKGKDAG 364

Query: 297 AFLLKHVTLLINKSELDNA---------PDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
            F  K++ LL  K++++           P G ++++A+ +SLE GFS ++    A D +N
Sbjct: 365 PFDFKYLRLLERKAQVERVLQQAADASEPKG-RVIVATDSSLEWGFSKEVMRAIAGDPRN 423

Query: 348 LVLFTER 354
           LV+ TE+
Sbjct: 424 LVILTEK 430


>gi|400602286|gb|EJP69888.1| RNA-metabolising metallo-beta-lactamase [Beauveria bassiana ARSEF
           2860]
          Length = 962

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 122/425 (28%), Positives = 185/425 (43%), Gaps = 78/425 (18%)

Query: 8   TPLSGVFNEN-PLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           +PL G  +E+     L+ +DG    L+  GW++ FD + L+ L K   T+  +LL+H   
Sbjct: 5   SPLQGAQSESLATQSLLELDGGVKILVGLGWDESFDVAKLEELEKQVPTLSLILLTHATA 64

Query: 66  LHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS------------------- 104
            HL A  +  K + L    PV++T PV  LG     D Y S                   
Sbjct: 65  PHLAAYAHCCKNIPLFTRIPVYATRPVIDLGRTLTQDLYSSTPAAATTIPQAALSASAYA 124

Query: 105 --RRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGH 153
             +   +  +L     T D+I   F  +  L YSQ +            G+ +  + AGH
Sbjct: 125 YAQTATTAQNLLLQSPTPDEIARFFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNAGH 184

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEK------------HLNGTVLESFVRPAVLITDA 201
            LGGT+W I    E ++YAVD+N+ +E                  V+E   +P  LI  +
Sbjct: 185 TLGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAQVIEQLRKPTALICSS 244

Query: 202 YNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN 258
             A  N     R +R E   + I   +  GG VL+PVDS+ RVLEL  +LE  W   S +
Sbjct: 245 RGAERNAQAGGRAKRDEQLLETIKAAVARGGTVLIPVDSSARVLELAYLLEHAWRTDSAS 304

Query: 259 -------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-----SRDNA--------- 297
                    +Y      +ST+ Y +S LEWM DSI + FE       R N          
Sbjct: 305 ATGVLKAAKLYLAGRNMASTMRYARSMLEWMDDSIVQEFEAFAEGQKRTNGNSDKKVGGP 364

Query: 298 FLLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  + + LL  K+++        +N     +++LAS   ++ GFS D+    ASD  N+V
Sbjct: 365 FDFRFMRLLDRKAQIAKLLTTAVNNGESRGRVILASDTCMDWGFSKDLLRGLASDANNVV 424

Query: 350 LFTER 354
           + T++
Sbjct: 425 ILTDK 429


>gi|297739590|emb|CBI29772.3| unnamed protein product [Vitis vinifera]
          Length = 680

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 185/387 (47%), Gaps = 44/387 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 21  GDQLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPS----- 74

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+  
Sbjct: 75  -----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKV 125

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D L+   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 126 SKVSVEDMLYDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 181

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T
Sbjct: 182 AGVRVLYTGDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFTDVIHST 241

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M
Sbjct: 242 ISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNIPIYYASPLAKRCMAVYQTYINSM 301

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH++ L     ++N  D GP +V+AS + L++G S  +F  W
Sbjct: 302 NERIRNQFANS--NPFDFKHISPL---KSIENFNDVGPSVVMASPSGLQSGLSRQLFDMW 356

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
            SD KN  +       GTLA+ +  +P
Sbjct: 357 CSDKKNACVIPGYVVEGTLAKTIINEP 383


>gi|367005895|ref|XP_003687679.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
 gi|357525984|emb|CCE65245.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
          Length = 790

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 186/371 (50%), Gaps = 24/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS---RR 106
           ST+D +L+SH    H  +LPY M++   +  VF T P   +YR  LL  + +  S     
Sbjct: 59  STVDILLISHFHLDHAASLPYVMQRTNFNGRVFMTHPTKAIYRW-LLKDFVRVTSIGGSP 117

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
              + +L+T +D+  +F  +  +    +YH +    GI      AGH+LG  +++I    
Sbjct: 118 NEKDDNLYTDEDLSESFDRIETI----DYHSTMDVNGIKFTAFHAGHVLGAAMFQIELGS 173

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             V++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T+ 
Sbjct: 174 LRVLFTGDYSRELDRHLNSAEIPPLASDVLIVESTFGTATHEPRLSREKKLTQLIHSTVT 233

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWA--EHSL---NYPIYFLTYVSSSTIDYVKSFLEW 281
            GG VL+PV + GR  EL+LIL++YW+  E  L     PIY+ + ++  ++   ++++  
Sbjct: 234 KGGRVLMPVFALGRAQELMLILDEYWSHNEEELGNGQVPIYYASNLAKRSMSVFQTYVNM 293

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVE 340
           M DSI K F  S+ N F+ K+++ L N   +D+  D GP ++LA+   L+ G S D+  +
Sbjct: 294 MNDSIRKKFRDSKTNPFIFKNISYLKN---IDSFQDFGPSVMLAAPGMLQNGLSRDLLEK 350

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADPPP----KAVKVTMSRRVPLVGEELIAYEEEQT 396
           W  + KN+VL T     G++A+ L  +P         +V + RR  +      A+ + Q 
Sbjct: 351 WCPEPKNMVLITGYSVEGSMAKYLMLEPENIPSVNNPEVNIPRRCQVEEISFAAHVDFQE 410

Query: 397 RLKKEEALKAS 407
            +   E ++AS
Sbjct: 411 NIDFIEQIRAS 421


>gi|392297785|gb|EIW08884.1| Ysh1p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 772

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 176/366 (48%), Gaps = 20/366 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L              
Sbjct: 59  SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVXXXXXXXXXX 118

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I   G  V
Sbjct: 119 --GLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEIAGLRV 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGG 229
           ++  DY+R  ++HLN   +       +++   +    ++P   +       I  T+  GG
Sbjct: 173 LFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHSTVMRGG 232

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            VLLPV + GR  E++LIL++YW+ H+        PI++ + ++   +   ++++  M D
Sbjct: 233 RVLLPVFALGRAQEIMLILDEYWSRHADELGGGQVPIFYASNLAKKCMSVFQTYVNMMND 292

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
            I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+   W  +
Sbjct: 293 DIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLERWCPE 350

Query: 345 VKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQTRLKK 400
            KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q  L+ 
Sbjct: 351 DKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQENLEF 410

Query: 401 EEALKA 406
            E + A
Sbjct: 411 IEKISA 416


>gi|260942135|ref|XP_002615366.1| hypothetical protein CLUG_04248 [Clavispora lusitaniae ATCC 42720]
 gi|238850656|gb|EEQ40120.1| hypothetical protein CLUG_04248 [Clavispora lusitaniae ATCC 42720]
          Length = 940

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 153/543 (28%), Positives = 237/543 (43%), Gaps = 81/543 (14%)

Query: 30  FLIDCGWN-DHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMK--QLGLSAPVFS 86
            L D GWN ++ D  L          +  +  S P+ +  G +   MK   L  + PV++
Sbjct: 29  ILADPGWNGENPDDCLFMEKHLSDVDLLLLSQSTPEFIG-GYILLCMKFPSLMSAIPVYT 87

Query: 87  TEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI 144
           T  + +LG ++  + Y SR  +         + D+D  F  +T + Y QN  ++     I
Sbjct: 88  TVAISQLGRVSTVEFYRSRGHLGPLQSAFMEVSDVDEWFDKMTSVKYFQN--MTALENRI 145

Query: 145 VVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESFVRPA 195
           ++  + +GH LGG+ W ITK  E +IYA  +N  K+  LN         G+ + S VRP+
Sbjct: 146 LLTAYNSGHTLGGSFWLITKRLEKIIYAPTWNHSKDSFLNSASFLSPTTGSPISSLVRPS 205

Query: 196 VLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE- 254
            +IT       N   +++ E F   +  TL  GG VLLP   +GR LELL I++++ A  
Sbjct: 206 AIITSTELG-SNMSHKKRMEKFLQLVDATLANGGAVLLPTTISGRFLELLRIIDEHLANL 264

Query: 255 HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE--TSRDNA-----FLLKHVTLLI 307
                P+YFL+Y  +  + Y  + L+WM   + K +E   + D A     F    V LL 
Sbjct: 265 QGAAIPVYFLSYSGTKVLSYAANLLDWMSSQLIKEYEGIAAEDRAYSRVPFEPSKVDLLS 324

Query: 308 NKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTERGQFG-------- 358
           N  EL   P GPK+V AS    + G  S         D K  ++ TE+  F         
Sbjct: 325 NPQELIQLP-GPKIVFASGIDFKDGDMSTQALQLLCQDEKTTIILTEKSSFARDNTCTTD 383

Query: 359 ------TLARMLQADPPPKAVKVTMSRRVPLVG----EELIAYEEEQTRLKKEEALKASL 408
                 TLA           V V + + +PL      EEL   E ++ + K  +A +  L
Sbjct: 384 LFQEWYTLASAKNNGVAEDGVPVPLEKAIPLTSWTREEELKDVELQRFKEKVAQARRQKL 443

Query: 409 ---VKEEESKASLGPDNN----------LSGD-------PMVIDANNANASAD---VVEP 445
              V+++++K  L  D N          +S D         VI +  AN  AD   V+  
Sbjct: 444 LNKVRDKKNKNILNADLNSDDSSSDEDEISTDEEEKGIEANVISSTTANGQADATSVLNS 503

Query: 446 HGGRYRDILIDGF---VPPSTSVA-------PMFPFYENNSEW--DDFGEVINPDDYIIK 493
           H     D + +      P  T V+        MFPF+ ++ +   DD+GEVI+P D+   
Sbjct: 504 HEVFVTDYVTENLEANKPVDTRVSYKLKPRQAMFPFFPSSKKRKHDDYGEVIDPKDFQRS 563

Query: 494 DED 496
           DE+
Sbjct: 564 DEN 566


>gi|321264788|ref|XP_003197111.1| cleavage and polyadenylation specificity factor [Cryptococcus
           gattii WM276]
 gi|317463589|gb|ADV25324.1| Cleavage and polyadenylation specificity factor, putative
           [Cryptococcus gattii WM276]
          Length = 778

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  ALPY M++      +  V+ T     +  LTM D      Q  
Sbjct: 79  STVDALLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138

Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +    L+   D+ S++QS   + Y Q+  ++G   G+   P+ AGH+LG +++ I   G 
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
            ++Y  DY+R +++HL    +   V+P V+I ++   +H  P R+++E  F   ++  +R
Sbjct: 196 KILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG  L+P+ S G   EL L+L++YW +H    N P+YF + +    +   K+++  M  
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           +I   F   RDN F  + V  L +  +L     GP ++++S   +  G S D+  EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            KN V+ T     GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396


>gi|147787280|emb|CAN71414.1| hypothetical protein VITISV_029216 [Vitis vinifera]
          Length = 687

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 114/387 (29%), Positives = 185/387 (47%), Gaps = 44/387 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 14  GDQLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPS----- 67

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+  
Sbjct: 68  -----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKV 118

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D L+   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +  
Sbjct: 119 SKVSVEDMLYDEQDILRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T
Sbjct: 175 AGVRVLYTGDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFTDVIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M
Sbjct: 235 ISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNIPIYYASPLAKRCMAVYQTYINSM 294

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEW 341
            + I   F  S  N F  KH++ L     ++N  D GP +V+AS + L++G S  +F  W
Sbjct: 295 NERIRNQFANS--NPFDFKHISPL---KSIENFNDVGPSVVMASPSGLQSGLSRQLFDMW 349

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
            SD KN  +       GTLA+ +  +P
Sbjct: 350 CSDKKNACVIPGYVVEGTLAKTIINEP 376


>gi|322786053|gb|EFZ12664.1| hypothetical protein SINV_01905 [Solenopsis invicta]
          Length = 686

 Score =  153 bits (386), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 192/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLVEADEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    + +E  L+T  D++++   +  +    N+
Sbjct: 96  FKGRCFMTHATKAIYRWLL----SDYIKVSNIATEQMLYTESDLETSMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + + P 
Sbjct: 148 HEEKDVFGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-IHPD 206

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW++
Sbjct: 207 VLITESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQ 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           HS     PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++   N   +
Sbjct: 267 HSELHEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHIS---NLKGI 321

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           D+  D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P+
Sbjct: 322 DHFEDIGPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKTILSE--PE 379

Query: 372 AVKVTMSRRVPL 383
            +     +++PL
Sbjct: 380 EITTMSGQKLPL 391


>gi|340518710|gb|EGR48950.1| predicted protein [Trichoderma reesei QM6a]
          Length = 962

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 188/424 (44%), Gaps = 78/424 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    L+D GW++ F    L+ L K   T+  +LL+H    
Sbjct: 6   PLQGALSESLASQSLLELDGGVKVLVDLGWDETFSSDKLEELEKQVPTLSLILLTHATVS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR-------RQVSEFDLF--- 114
           HL A  +  K + L    PV++T PV  LG     D Y S        RQ S  +     
Sbjct: 66  HLAAYAHCCKNIALFTRIPVYATRPVIDLGRTLTQDLYSSTPAAATTIRQSSLSETAYAY 125

Query: 115 ---------------TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHL 154
                          T ++I   F  +  L YSQ +       S    G+ +  + +GH 
Sbjct: 126 SQTATTAQNLLLQSPTPEEIARYFSLIQPLKYSQPHQPLSSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY 259
            A        R +R E   + I   +  GG VL+PVDS+ RVLE+  +LE  W   + N 
Sbjct: 246 GADRTAQAGGRAKRDEHLLEMIKTCVSRGGTVLIPVDSSARVLEISYLLEHAWRTDAANR 305

Query: 260 -------PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-------------SRDNA-F 298
                   +Y      SST+ Y +S LEWM ++I + FE               ++ A F
Sbjct: 306 DGVLKYSKLYLAGRNVSSTMRYARSMLEWMDNNIVQEFEAFAEGQRKVNGGSEKKEGAPF 365

Query: 299 LLKHVTLLINKSE--------LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             K++ LL  K++        ++N     +++LAS  ++E GFS D+    A D +NLV+
Sbjct: 366 DFKYLRLLERKAQIIKLLSQNIENGETHGRVILASDITMEWGFSKDLVKGLARDSRNLVI 425

Query: 351 FTER 354
            TER
Sbjct: 426 LTER 429


>gi|58270576|ref|XP_572444.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|134118056|ref|XP_772409.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|338819805|sp|P0CM89.1|YSH1_CRYNB RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|338819806|sp|P0CM88.1|YSH1_CRYNJ RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|50255022|gb|EAL17762.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57228702|gb|AAW45137.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 773

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  ALPY M++      +  V+ T     +  LTM D      Q  
Sbjct: 79  STVDAMLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138

Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +    L+   D+ S++QS   + Y Q+  ++G   G+   P+ AGH+LG +++ I   G 
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
            ++Y  DY+R +++HL    +   V+P V+I ++   +H  P R+++E  F   ++  +R
Sbjct: 196 KILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG  L+P+ S G   EL L+L++YW +H    N P+YF + +    +   K+++  M  
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           +I   F   RDN F  + V  L +  +L     GP ++++S   +  G S D+  EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            KN V+ T     GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396


>gi|268530366|ref|XP_002630309.1| Hypothetical protein CBG00745 [Caenorhabditis briggsae]
          Length = 637

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 182/374 (48%), Gaps = 18/374 (4%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTID 56
           ++++ PL    +      L++I   N ++DCG +  +       D S +    ++   +D
Sbjct: 33  NIKIVPLGAGQDVGRSCILITIGTKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLD 92

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFT 115
            V++SH    H G+LP+  + +G   P++ T P   +  + + D    +  +  E + FT
Sbjct: 93  CVIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFT 152

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
            DDI +  + V      +   +  +   + +    AGH+LG  +++I      V+Y  DY
Sbjct: 153 SDDIKNCMKKVIGCALHEIIQVDDQ---LSIRAFYAGHVLGAAMFEIRVGDHSVLYTGDY 209

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    +   VRP +LI+++  A   +  ++ RE  F   + +T+  GG V++P
Sbjct: 210 NMTPDRHLGAARVLPGVRPTILISESTYATTIRDSKRARERDFLRKVHETVMKGGKVIIP 269

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           V + GR  EL ++LE YW   +LN PIYF   ++     Y + F+ W  ++I K+F    
Sbjct: 270 VFALGRAQELCILLESYWERMALNVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VE 327

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F  KH+  +    E  + P GP+++ ++   L  G S  +F +W SD  N+++    
Sbjct: 328 RNMFEFKHIRPMEKGCE--DQP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGY 384

Query: 355 GQFGTL-ARMLQAD 367
              GT+ AR++  +
Sbjct: 385 CVAGTVGARVINGE 398


>gi|242007002|ref|XP_002424331.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Pediculus humanus corporis]
 gi|212507731|gb|EEB11593.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Pediculus humanus corporis]
          Length = 692

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 186/358 (51%), Gaps = 26/358 (7%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLLSHPDTLHLGALPYAMKQ 77
           ++   G N ++DCG   H   S L  L  V    A  ID +L++H    H GALP+ + +
Sbjct: 37  MLEFKGKNVMLDCGI--HPGLSGLDALPFVDLIEADEIDLLLVTHFHLDHSGALPWFLLK 94

Query: 78  LGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQ 133
                  F   +T+ +YR     +   Y+    +S E  L+T  D++ + + +  +    
Sbjct: 95  TKFKGRCFMTHATKAIYRW----LLSDYIKVSNISTEQMLYTDHDLEESMEKIETI---- 146

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           N+H   +  GI    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + S ++
Sbjct: 147 NFHEEKEIFGIKFWAYHAGHVLGAAMFMIEIAGVRVLYTGDFSRQEDRHLMAAEIPS-IK 205

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P VLIT++    H    R++RE  F + I   +  GG  L+PV + GR  ELLLIL+DYW
Sbjct: 206 PDVLITESTYGTHIHEKREERETRFTNLIHTIINRGGRCLIPVFALGRAQELLLILDDYW 265

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           ++H    + PIY+ + ++   +   ++++  M D I +  + + +N F+ +H+  L    
Sbjct: 266 SQHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFIFRHIHNLKGID 323

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             D+   GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++P
Sbjct: 324 HFDDI--GPCVVMASPGMMQSGLSRELFELWCTDSKNGVIIAGYCVEGTLAKQILSEP 379


>gi|302808975|ref|XP_002986181.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
 gi|300146040|gb|EFJ12712.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
          Length = 684

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/399 (27%), Positives = 189/399 (47%), Gaps = 40/399 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  +++ PL            ++  G   L DCG            + D  DPS      
Sbjct: 20  GEKMEIMPLGAGSEVGRSCCHMTYKGKTILFDCGIHPGYTGMAALPYFDEIDPS------ 73

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               TID +L++H    H  +LPY +++      VF   +T+ +Y+L LLT Y + +S+ 
Sbjct: 74  ----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL-LLTDYVK-ISKG 127

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            V +  L+   D+      +  + + Q   ++G    I    + AGH+LG  ++ +   G
Sbjct: 128 SVEDM-LYDEQDVLKTMDKIEVIDFHQTMEVNG----IRFWCYTAGHVLGAAMFMVDIAG 182

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F + I++T+ 
Sbjct: 183 IRVLYTGDYSREEDRHLKAAEMPEFSPDVCIIESTYGVQIHQPRHVREKRFTETIAQTVS 242

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG VL+P  + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D
Sbjct: 243 HGGRVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQTYINSMND 302

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
            I   +E S  N F  KH++ L +  + ++   GP +V+AS + L++G S  +F  W  D
Sbjct: 303 KIKSQYENS--NPFNFKHISPLKSIEQFEDV--GPSIVMASPSGLQSGLSRQLFDRWCQD 358

Query: 345 VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
            KN  +       GTLA+ +  +  PK V +     VPL
Sbjct: 359 RKNACVIPGYVVEGTLAKTILNE--PKEVALVSGLVVPL 395


>gi|224140919|ref|XP_002323824.1| predicted protein [Populus trichocarpa]
 gi|222866826|gb|EEF03957.1| predicted protein [Populus trichocarpa]
          Length = 699

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 193/400 (48%), Gaps = 42/400 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + +TPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 22  GDQLTLTPL-GAGNEVGRSCVYMSFKGKTVLFDCGIHPAYSGMAALPYFDEIDPS----- 75

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L LLT Y + +S+
Sbjct: 76  -----TIDVLLVTHFHLDHAASLPYFLEKTTFRGRVFMTHATKAIYKL-LLTDYVK-VSK 128

Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
             V +  LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 129 VSVEDM-LFDEKDINRSMDKIEVIDFHQTVDVNG----IKFWCYTAGHVLGAAMFMVDIA 183

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T+
Sbjct: 184 GVRVLYTGDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHIREKRFTDVIHSTI 243

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H    N P+Y+ + ++   +   ++++  M 
Sbjct: 244 SLGGRVLIPAFALGRAQELLLILDEYWSNHPELHNIPVYYASPLAKKCMTVYQTYILSMN 303

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           + I   F  S  N F  KH++ L +  +  +   GP +V+A+   L++G S  +F  W S
Sbjct: 304 ERIRNQFADS--NPFKFKHISPLNSIEDFTDV--GPSVVMATPGGLQSGLSRQLFDMWCS 359

Query: 344 DVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           D KN  +       GTLA+ +  +  PK V++      PL
Sbjct: 360 DKKNACVIPGFLVEGTLAKTIINE--PKEVQLMNGLTAPL 397


>gi|308509314|ref|XP_003116840.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
 gi|308241754|gb|EFO85706.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
          Length = 612

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 182/374 (48%), Gaps = 18/374 (4%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTID 56
           ++++ PL    +      L++I G N ++DCG +  +       D S +    ++   +D
Sbjct: 7   TIKIVPLGAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLD 66

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFT 115
            V++SH    H G+LP+  + +G   P++ T P   +  + + D    +  +  E + FT
Sbjct: 67  CVIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGESNFFT 126

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
            DDI +  + V      +   +  +   + +    AGH+LG  +++I      V+Y  DY
Sbjct: 127 SDDIKNCMKKVIGCALHEIIQVDDQ---LSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDY 183

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    +   VRP VLI+++  A   +  ++ RE  F   + +T+  GG V++P
Sbjct: 184 NMTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHETVMKGGKVIIP 243

Query: 235 VDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           V + GR  EL ++LE YW   +L+ PIYF   ++     Y + F+ W  ++I K+F    
Sbjct: 244 VFALGRAQELCILLESYWERMALSVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VE 301

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F  KH+  +    E  + P GP+++ ++   L  G S  +F +W  D  N+++    
Sbjct: 302 RNMFEFKHIRPMEKGCE--DQP-GPQVLFSTPGMLHGGQSLKVFKKWCGDPLNMIIMPGY 358

Query: 355 GQFGTL-ARMLQAD 367
              GT+ AR++  +
Sbjct: 359 CVAGTVGARVINGE 372


>gi|357114659|ref|XP_003559115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Brachypodium distachyon]
          Length = 768

 Score =  152 bits (385), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 180/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + +TPL            ++  G   L DCG            + D  DPS      
Sbjct: 96  GDQMVITPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 149

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +YRL    +   Y+   
Sbjct: 150 ----AIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYRL----LLSDYVKVS 201

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 202 KVSVEDMLFDEQDIIRSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 257

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    +  F     ++   Y    +QP   + + F DAI  T+
Sbjct: 258 GVRILYTGDYSREEDRHLKAAEIPQFSPDVCIVESTYGVQQHQPRHVREKRFTDAIHNTV 317

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW+ H      PIY+ + ++   +   ++++  M 
Sbjct: 318 SQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQTYINSMN 377

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS  +L++G S  +F +W 
Sbjct: 378 ERIRNQFAQS--NPFHFKHIEPL---NSIDNFHDVGPSVVMASPGTLQSGLSRQLFDKWC 432

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       GTL++ +  +P
Sbjct: 433 TDKKNTCVIPGFVIEGTLSKTIINEP 458


>gi|302806483|ref|XP_002984991.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
 gi|302825687|ref|XP_002994439.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
 gi|300137630|gb|EFJ04498.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
 gi|300147201|gb|EFJ13866.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
          Length = 677

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 110/399 (27%), Positives = 189/399 (47%), Gaps = 40/399 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  +++ PL            ++  G   L DCG            + D  DPS      
Sbjct: 13  GEKMEIMPLGAGSEVGRSCCHMTYKGKTILFDCGIHPGYTGMAALPYFDEIDPS------ 66

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               TID +L++H    H  +LPY +++      VF   +T+ +Y+L LLT Y + +S+ 
Sbjct: 67  ----TIDVLLVTHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL-LLTDYVK-ISKG 120

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            V +  L+   D+      +  + + Q   ++G    I    + AGH+LG  ++ +   G
Sbjct: 121 SVEDM-LYDEQDVLKTMDKIEVIDFHQTMEVNG----IRFWCYTAGHVLGAAMFMVDIAG 175

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLR 226
             V+Y  DY+R +++HL    +  F     +I   Y    +QP   + + F + I++T+ 
Sbjct: 176 IRVLYTGDYSREEDRHLKAAEMPEFSPDVCIIESTYGVQIHQPRHVREKRFTETIAQTVS 235

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG VL+P  + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D
Sbjct: 236 HGGRVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQTYINSMND 295

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
            I   +E S  N F  KH++ L +  + ++   GP +V+AS + L++G S  +F  W  D
Sbjct: 296 KIKSQYENS--NPFNFKHISPLKSIEQFEDV--GPSIVMASPSGLQSGLSRQLFDRWCQD 351

Query: 345 VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
            KN  +       GTLA+ +  +  PK V +     VPL
Sbjct: 352 RKNACVIPGYVVEGTLAKTILNE--PKEVALVSGLVVPL 388


>gi|168034228|ref|XP_001769615.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162679157|gb|EDQ65608.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 563

 Score =  152 bits (384), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 179/372 (48%), Gaps = 23/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I G N + DCG +  +       D S +         ID V+++H    H+GALPY 
Sbjct: 14  IVTIGGKNIMFDCGMHMGYQDERRYPDFSFISKSGDFTHVIDCVIVTHFHLDHIGALPYF 73

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G   P++ T P   L  L +  Y + +  R+  E + F++  I    + VT +   
Sbjct: 74  TEVCGYDGPIYMTYPTKALAPLMLEDYRKVMVERK-GEQEQFSVLQIQKCMKKVTAVDLR 132

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +   G  +    + AGH+LG  ++ +    + V+Y  DYN   ++HL    ++  +
Sbjct: 133 QTIKV---GADLEFRAYYAGHVLGAAMFWVKAGDDTVVYTGDYNMTPDRHLGAAQIDR-L 188

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
            P +LIT++  A   +  ++ RE  F  A+ K + AGG VL+PV + GR  EL ++L++Y
Sbjct: 189 EPDLLITESTYATTVRDSKRAREREFLKAVHKCVAAGGKVLIPVFALGRAQELCILLDEY 248

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L+ PIY    ++     Y K  + W    +  ++ T   N F  KHV +   +S+
Sbjct: 249 WERTNLDMPIYISAGLTMQANVYYKLLISWTNQKVKDTYVTR--NTFDFKHV-IPFERSK 305

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           +D AP GP ++ A+   L  G S ++F  WA    N+++       GT+   L    P K
Sbjct: 306 ID-AP-GPCVLFATPGMLSGGLSLEVFKHWAPSESNMIILPGFCVAGTVGSKLM---PGK 360

Query: 372 AVKVTMSRRVPL 383
             K+ + +R  L
Sbjct: 361 PAKIDLDKRTTL 372


>gi|384252038|gb|EIE25515.1| Metallo-hydrolase/oxidoreductase [Coccomyxa subellipsoidea C-169]
          Length = 696

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 172/347 (49%), Gaps = 13/347 (3%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPL--SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPV 84
           G   + DCG +  F      P   S    ++D +L++H    H  A+PY + +      +
Sbjct: 33  GKTVMFDCGVHPGFSGEQSLPYFDSIDLDSVDLMLVTHFHLDHCAAVPYVVGKTVFKGRI 92

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI 144
           F T P   +  + + D     R  ++  L++  D+++A +    L + Q   +    +GI
Sbjct: 93  FMTHPTKAIFGMLLKDSVKVSRGATDAGLYSEKDVEAALERTELLDFHQTIDV----DGI 148

Query: 145 VVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNA 204
            V    AGH+LG  ++ +   G   +Y  DY+R  ++H++   L S   P ++I +A   
Sbjct: 149 KVTAWRAGHVLGAAMFMVEIAGMRALYTGDYSRLADRHMSAADLPS-PPPHIVIVEATYG 207

Query: 205 LHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPI 261
           +    PR+ RE  F + I   ++ GG  LLPV + GR  EL+LILEDYW  ++     PI
Sbjct: 208 VSRHLPREGREQRFVNMIRAVVQRGGRCLLPVVALGRAQELMLILEDYWDRNADLRGVPI 267

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           Y  + ++   +   ++++  M D I  +F  S  N F  K++T L  +  LD+   GP +
Sbjct: 268 YQASGLARRALGIFQTYIAMMNDDIKAAFGQSA-NPFNFKYITELKTQGGLDDV--GPCV 324

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           VLA+ + L++G S ++F  W  D +N V+  +    GTLAR + A P
Sbjct: 325 VLATPSMLQSGLSRELFDAWCEDKRNGVIIADFAVQGTLARDILASP 371


>gi|156552097|ref|XP_001605081.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Nasonia vitripennis]
          Length = 688

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 182/356 (51%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 39  MLEFKGKKIMLDCGIHPGLSGLDALPFVDIIEADEIDLLLISHFHLDHCGALPWFLQKTN 98

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    + +E  L+T  D++S+   +  +    N+
Sbjct: 99  FKGRCFMTHATKAIYRWLL----SDYIKVSNIATEQMLYTEADLESSMDKIETI----NF 150

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + V P 
Sbjct: 151 HEEKDVYGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-VHPD 209

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW++
Sbjct: 210 VLITESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQ 269

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H      PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++ L      
Sbjct: 270 HPELHEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHISNLKGIDHF 327

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++P
Sbjct: 328 DDI--GPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKTILSEP 381


>gi|307199387|gb|EFN80012.1| Cleavage and polyadenylation specificity factor subunit 3
           [Harpegnathos saltator]
          Length = 685

 Score =  152 bits (383), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 99/368 (26%), Positives = 185/368 (50%), Gaps = 18/368 (4%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 35  MLEFKGKRIMLDCGIHPGLSGMDALPFVDLVEADEIDLLLISHFHLDHCGALPWFLQKTS 94

Query: 80  LSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSG 139
                F T     +    + D        +E  L+T  D++++   +  +    N+H   
Sbjct: 95  FKGRCFMTHATKAIYRWLLSDYIKVSNIATEQMLYTESDLETSMDKIETI----NFHEEK 150

Query: 140 KGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLIT 199
              GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + + P VLIT
Sbjct: 151 DVFGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-IHPDVLIT 209

Query: 200 DAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-- 256
           ++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW +HS  
Sbjct: 210 ESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWGQHSEL 269

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP 316
              PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++   N   +D+  
Sbjct: 270 HEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHIS---NLKGIDHFE 324

Query: 317 D-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKV 375
           D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P+ +  
Sbjct: 325 DIGPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKGILSE--PEEITT 382

Query: 376 TMSRRVPL 383
              +++PL
Sbjct: 383 MSGQKLPL 390


>gi|307177772|gb|EFN66769.1| Cleavage and polyadenylation specificity factor subunit 3 [Camponotus
            floridanus]
          Length = 1750

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 99/368 (26%), Positives = 186/368 (50%), Gaps = 18/368 (4%)

Query: 22   LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
            ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 1100 MLEFKGKKIMLDCGIHPGLSGMDALPFVDLVEADEIDLLLISHFHLDHCGALPWFLQKTS 1159

Query: 80   LSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSG 139
                 F T     +    + D        +E  L+T  D++++   +  +    N+H   
Sbjct: 1160 FKGRCFMTHATKAIYRWLLSDYIKVSNIATEQMLYTESDLETSMDKIETI----NFHEEK 1215

Query: 140  KGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLIT 199
               GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + + P VLIT
Sbjct: 1216 DVFGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-IHPDVLIT 1274

Query: 200  DAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-- 256
            ++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW++HS  
Sbjct: 1275 ESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHSEL 1334

Query: 257  LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP 316
               PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++   N   +D+  
Sbjct: 1335 HEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHIS---NLKGIDHFE 1389

Query: 317  D-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKV 375
            D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P+ +  
Sbjct: 1390 DIGPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKTILSE--PEEITT 1447

Query: 376  TMSRRVPL 383
               +++PL
Sbjct: 1448 MSGQKLPL 1455


>gi|326435554|gb|EGD81124.1| integrator complex subunit 11 [Salpingoeca sp. ATCC 50818]
          Length = 620

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 102/355 (28%), Positives = 173/355 (48%), Gaps = 17/355 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HFDPSLLQPLSKVASTIDAV 58
           + V PL    +      +V ++G   + DCG    +ND   F    +     + S ID V
Sbjct: 38  IVVLPLGAGQDVGRSCIIVEMNGRTIMFDCGMHMGYNDDRRFPDFSVLADGDLTSRIDVV 97

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLD 117
           ++SH    H GALP+  +  G   P++ T P   +  L + D + +S  +  E + FT  
Sbjct: 98  IISHFHLDHCGALPFFSEMCGYDKPIYMTYPTKAICPLLLEDYRKISVERKGERNFFTSQ 157

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
            I      V  +   Q+  L G    I +  + AGH+LG  ++ +    + V+Y  DYN 
Sbjct: 158 MIKDCMSKVQPVDLHQSVTLPGD---IEIKAYYAGHVLGAAMFHVRVGDKSVVYTGDYNM 214

Query: 178 RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
             ++HL GT    F +P  +IT++  A   +  ++ RE  F   + + ++ GG VL+PV 
Sbjct: 215 TPDRHL-GTAWIDFCQPDAIITESTYATTIRDSKRCRERDFLTKVHRCVKNGGKVLIPVF 273

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           + GR  EL ++LE YW  + L+ PIYF T ++    +Y + F+ +    I  +F     N
Sbjct: 274 ALGRAQELCILLETYWERYKLDTPIYFSTGLTEKANEYYRLFVMYTNQKIKDTFVDR--N 331

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
            F  KH+    ++S  D    GP+++ A+   L AG + ++F +WA D +N+V+ 
Sbjct: 332 LFDFKHIRAF-DRSYADQP--GPQVLFATPGMLHAGVALEVFAKWAGDPRNMVIL 383


>gi|405124298|gb|AFR99060.1| endoribonuclease YSH1 [Cryptococcus neoformans var. grubii H99]
          Length = 770

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  ALPY M++      +  V+ T     +  LTM D      Q  
Sbjct: 79  STVDAMLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138

Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +    L+   D+ S++QS   + Y Q+  ++G   G+   P+ AGH+LG +++ I   G 
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
            ++Y  DY+R +++HL    +   V+P V+I ++   +H  P R+++E  F   ++  +R
Sbjct: 196 MILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG  L+P+ S G   EL L+L++YW +H    N P+YF + +    +   K+++  M  
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWHDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           +I   F   RDN F  + V  L +  +L     GP ++++S   +  G S D+  EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            KN V+ T     GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396


>gi|328873132|gb|EGG21499.1| integrator complex subunit 11 [Dictyostelium fasciculatum]
          Length = 645

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 182/381 (47%), Gaps = 19/381 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++V PL    +      +VSI   N + DCG +  +       D S +    +   T+D 
Sbjct: 3   IKVVPLGAGQDVGRSCVIVSIGNKNIMFDCGMHMGYHDERRFPDFSFISKTKQFTKTLDC 62

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           ++++H    H GALPY  +  G   P++ T P   +  + + D + +S  +  E + FT 
Sbjct: 63  IIITHFHLDHCGALPYFTEMCGYDGPIYMTLPTKAIVPILLEDYRKISVDRKGETNFFTP 122

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    + + +  + AGH+LG  ++      E V+Y  DYN
Sbjct: 123 QMIKDCMKKVIPIALHQTIKVD---DELSIKAYYAGHVLGAAMFYAKVGEESVVYTGDYN 179

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++  VRP +LIT+   A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 180 MTPDRHLGSAWIDQ-VRPNLLITETTYATTIRDSKRGRERDFLKRVHECVEKGGKVLIPV 238

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GRV EL ++++ YW + +LN PIYF   ++     Y K F+ W    I ++F   + 
Sbjct: 239 FALGRVQELCILIDSYWEQMNLNVPIYFSEGLAEKANFYYKLFITWTNQKIKQTF--VKR 296

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+        L +AP GP ++ A+   L AG S ++F +WA +  N+ +     
Sbjct: 297 NMFDFKHIKPF--DRHLADAP-GPMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPGYC 353

Query: 356 QFGTLA-RMLQADPPPKAVKV 375
             GT+  ++L     P+ V++
Sbjct: 354 VVGTVGNKLLSNAGGPQMVEI 374


>gi|241953057|ref|XP_002419250.1| subunit of mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
 gi|223642590|emb|CAX42840.1| subunit of mRNA cleavage and polyadenylation factor, putative
           [Candida dubliniensis CD36]
          Length = 930

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 146/547 (26%), Positives = 239/547 (43%), Gaps = 88/547 (16%)

Query: 28  FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSAPV 84
           F  + D  WN   D +    + +     +A+LLSH     +     L      L  + P+
Sbjct: 27  FKLIADPFWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLFIKFPNLMSTIPI 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           +ST PV +LG ++  + Y +   +   D  +  LD++D+ F  V  L Y Q+ +L     
Sbjct: 86  YSTLPVNQLGRVSTVEYYRAMGILGPVDTAILELDEVDNWFDKVNLLKYQQSLNLFD--N 143

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESFVR 193
            +VV P+ AGH LGGT W ITK  + VIYA  +N  K+  LN         G    S +R
Sbjct: 144 KVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISSSTGNPHLSLLR 203

Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
           P   IT A +       R++ E F   +  TL  GG  +LP   +GR LEL  +++++  
Sbjct: 204 PTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTSLSGRFLELFHLIDEHLK 262

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
              +  P+YFL+Y  +  + Y  + L+WM  S TK +E      F    V LL++ SEL 
Sbjct: 263 GAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELL 320

Query: 314 NAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER--------------GQFG 358
           N   GPK+V  S   L +G  S + F    +D +  ++ TE+               ++ 
Sbjct: 321 NL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDERTTIILTEKTTMSLESSLSSILYTEWD 379

Query: 359 TLARMLQADPPPKAVKVTM---------SRRVPLVGEELIAYEEEQTRLKKEEALKASLV 409
           TLA+          + V +         ++ + L G EL  ++E+  + +KE+ L  + V
Sbjct: 380 TLAKKRGGGESADGIAVPIDKNISLKNWTKEIELTGTELTEFQEKVAQKRKEKLL--AKV 437

Query: 410 KEEESKASLGPD--------------------------NNLSGDPMVIDANNANASADVV 443
           ++++++  L  D                          N L      I+  N+N SA+ V
Sbjct: 438 RDQKNQNILSADTVDSEDSSDDDEGDEEREKQKSDDASNLLIKQYQSINVANSNVSANEV 497

Query: 444 EP---HGGRYRDIL---IDGFVPPSTSVA-------PMFPFY--ENNSEWDDFGEVINPD 488
            P   H     D +   ++  +P    +          FP++   +  ++DD+GEVIN +
Sbjct: 498 NPLAIHEAFITDHIKQSLEKNLPIDLRITHKLRPRQATFPYFATSHKQKFDDYGEVINIE 557

Query: 489 DYIIKDE 495
           DY   DE
Sbjct: 558 DYQRHDE 564


>gi|332019331|gb|EGI59837.1| Cleavage and polyadenylation specificity factor subunit 3
           [Acromyrmex echinatior]
          Length = 685

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 191/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ + +  
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLVEADEIDLLLISHFHLDHCGALPWFLLKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    + +E  L+T  D++++   +  +    N+
Sbjct: 96  FKGRCFMTHATKAIYRW----LLSDYIKVSNIATEQMLYTESDLETSMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + + P 
Sbjct: 148 HEEKDMFGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-IHPD 206

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW++
Sbjct: 207 VLITESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQ 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           HS     PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++   N   +
Sbjct: 267 HSELHEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHIS---NLKGI 321

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           D+  D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P+
Sbjct: 322 DHFEDIGPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKTILSE--PE 379

Query: 372 AVKVTMSRRVPL 383
            +     +++PL
Sbjct: 380 EITTMSGQKLPL 391


>gi|168007963|ref|XP_001756677.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692273|gb|EDQ78631.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 682

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/398 (28%), Positives = 191/398 (47%), Gaps = 38/398 (9%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCGWND---------HFDPSLLQPLSKV 51
           G  ++VTPL G  NE   S + ++  G   + DCG +          +FD   + P+S  
Sbjct: 15  GDKLEVTPL-GAGNEVGRSCVYMTYKGKTVMFDCGIHPGYSGMAALPYFDE--IDPIS-- 69

Query: 52  ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQV 108
              ID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   ++   +V
Sbjct: 70  ---IDVLLVTHFHLDHCASLPYFLEKTNFKGRVFMTHATKAIYKL----LLSDFVKISKV 122

Query: 109 SEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           S  D L+   DI    + +  + + Q   ++G    I    + AGH+LG  ++ +   G 
Sbjct: 123 SVDDMLYDEHDIARTMEKIEVIDFHQTMEVNG----IRFWCYTAGHVLGAAMFMVDIAGM 178

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRA 227
            V+Y  DY+  +++HL    +  F     +I   Y    +QP   +   F D +++T+  
Sbjct: 179 RVLYTGDYSCEEDRHLRAAEMPHFSPDVCIIESTYGVQIHQPRIMRERRFTDTVAQTVSQ 238

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+P  + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D 
Sbjct: 239 GGKVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQTYINAMNDR 298

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K FE S  N F  KH+  L N    D+   GP +V+AS   L++G S  +F  W  D 
Sbjct: 299 IQKQFEVS--NPFDFKHIQPLKNIDGFDDI--GPAVVMASPGGLQSGLSRQLFDIWCQDK 354

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           KN  +       GTLA+ +  +  PK V +     VPL
Sbjct: 355 KNSCIIPGYVVEGTLAKAIMNE--PKEVTLLSGLVVPL 390


>gi|427779771|gb|JAA55337.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
           cpsf subunit [Rhipicephalus pulchellus]
          Length = 621

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 114/400 (28%), Positives = 180/400 (45%), Gaps = 52/400 (13%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           + VTPL    +      L+SI G N ++DCG +  F       D S +     +   +D 
Sbjct: 4   ISVTPLGAGQDVGRSCILLSIGGKNVMLDCGMHMGFNDERRFPDFSYITQEGPLNEHLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G S PV+ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYSGPVYMTHPTKAICPILLEDFRKITVDRKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    + V+Y  DYN
Sbjct: 124 AMIRDCMRKVVAVNLHQAVQVDDELE---IKAYYAGHVLGAAMFRIRVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-----FQDAISK-------- 223
              ++HL    L+   RP +LIT++  A   +  ++ RE        D I K        
Sbjct: 181 MTPDRHLGAAWLDK-CRPDLLITESTYATTIRDSKRCRERDFLTKVHDCIDKGGKVLIPV 239

Query: 224 ---TLR-------------------AGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
              T+R                    GG VL+PV + GR  EL ++LE YW   +L  PI
Sbjct: 240 FXTTIRDSKRCRERDFLTKVHDCIDKGGKVLIPVFALGRAQELCILLETYWDRMNLRVPI 299

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           YF   ++    +Y K F+ W    I K+F   + N F  KH+    +++ +DN   GP +
Sbjct: 300 YFAVGLTEKATNYYKMFITWTNQKIRKTF--VQRNMFDFKHIKPF-DRAFIDNP--GPMV 354

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
           V A+   L AG S  IF +WA    N+V+       GT+ 
Sbjct: 355 VFATPGMLHAGLSLQIFKKWAPFEANMVIMPGYCVAGTVG 394


>gi|383861262|ref|XP_003706105.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Megachile rotundata]
          Length = 686

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 191/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLVEADEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    + +E  L+T  D++++   +  +    N+
Sbjct: 96  FKGRCFMTHATKAIYRWLL----SDYIKVSNIATEQMLYTESDLETSMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + + P 
Sbjct: 148 HEEKDVFGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-IHPD 206

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW++
Sbjct: 207 VLITESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQ 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H      PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++   N   +
Sbjct: 267 HPELHEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHIS---NLKGI 321

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           D+  D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P+
Sbjct: 322 DHFEDIGPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKTILSE--PE 379

Query: 372 AVKVTMSRRVPL 383
            +     +++PL
Sbjct: 380 EITTMSGQKLPL 391


>gi|226505292|ref|NP_001151522.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Zea mays]
 gi|195647398|gb|ACG43167.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Zea mays]
 gi|224034229|gb|ACN36190.1| unknown [Zea mays]
 gi|413932397|gb|AFW66948.1| cleavage and polyadenylation specificity factor, subunit [Zea mays]
          Length = 694

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 106/386 (27%), Positives = 181/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + +TPL            ++  G   L DCG            + D  DPS      
Sbjct: 25  GDQMVITPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYTGMAALPYFDEIDPS------ 78

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+   
Sbjct: 79  ----AIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKVS 130

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D LF   DI  + + +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 131 KVSVEDMLFDESDIARSMEKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 186

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    L  F     +I   Y    +QP   + + F + I  T+
Sbjct: 187 GVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQQHQPRIVREKRFTEVIHNTV 246

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M 
Sbjct: 247 SQGGRVLIPAFALGRAQELLLILDEYWSKHPELHKIPIYYASPLAKRCMAVYQTYINSMN 306

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS + L++G S  +F +W 
Sbjct: 307 ERIRNQFAQS--NPFHFKHIESL---NSIDNFHDVGPSVVMASPSGLQSGLSRQLFDKWC 361

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D +N  +       GTLA+ +  +P
Sbjct: 362 TDKRNACVIPGYVVEGTLAKTIINEP 387


>gi|297279172|ref|XP_001092173.2| PREDICTED: integrator complex subunit 11 isoform 3 [Macaca mulatta]
          Length = 579

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 95/313 (30%), Positives = 158/313 (50%), Gaps = 11/313 (3%)

Query: 41  DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
           D S +    ++   +D V++SH    H GALPY  + +G   P++ T P   +  + + D
Sbjct: 26  DFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 85

Query: 101 -QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV 159
            + ++  +  E + FT   I    + V  +   Q   +  + E   +  + AGH+LG  +
Sbjct: 86  YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAM 142

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQ 218
           ++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A   +  ++ RE  F 
Sbjct: 143 FQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFL 201

Query: 219 DAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSF 278
             + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF T ++     Y K F
Sbjct: 202 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 261

Query: 279 LEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIF 338
           + W    I K+F   + N F  KH+    +++  DN   GP +V A+   L AG S  IF
Sbjct: 262 IPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIF 316

Query: 339 VEWASDVKNLVLF 351
            +WA + KN+V+ 
Sbjct: 317 RKWAGNEKNMVIM 329


>gi|213512037|ref|NP_001133354.1| cleavage and polyadenylation specificity factor subunit 3 [Salmo
           salar]
 gi|209151738|gb|ACI33081.1| Cleavage and polyadenylation specificity factor subunit 3 [Salmo
           salar]
          Length = 690

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 182/356 (51%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R++RE  F + I   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTIHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINV--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYSVEGTLAKHIMSEP 378


>gi|380012076|ref|XP_003690115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Apis florea]
          Length = 686

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 191/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLVEADEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    + +E  L+T  D++++   +  +    N+
Sbjct: 96  FKGRCFMTHATKAIYRWLL----SDYIKVSNIATEQMLYTESDLETSMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + + P 
Sbjct: 148 HEEKDVFGIKFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPN-IHPD 206

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F + + + +  GG  L+PV + GR  ELLLIL++YW++
Sbjct: 207 VLITESTYGTHIHEKREDREGRFTNLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQ 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H      PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++   N   +
Sbjct: 267 HPELHEIPIYYASSLAKKCMAVYQTYVNAMNDKIRR--QIAINNPFVFKHIS---NLKGI 321

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           D+  D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P+
Sbjct: 322 DHFEDIGPCVVMASPGMMQSGLSRELFESWCTDAKNGVIIAGYCVEGTLAKTILSE--PE 379

Query: 372 AVKVTMSRRVPL 383
            +     +++PL
Sbjct: 380 EITTMSGQKLPL 391


>gi|312083284|ref|XP_003143797.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
 gi|307761039|gb|EFO20273.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
          Length = 644

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 95/358 (26%), Positives = 174/358 (48%), Gaps = 23/358 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++ PL    +      LVSI G N ++DCG +  +       D S +     +   +D 
Sbjct: 59  IKIVPLGAGRDVGRSCILVSIGGKNVMLDCGMHMGYSDERRFPDFSFISGGGSLTEFLDC 118

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----DL 113
           V+++H    H G+LP+  + +G   P++ T P   +  + + D    R+  +EF    + 
Sbjct: 119 VIITHFHLDHCGSLPHMSEVIGYDGPIYMTYPTKAIAPVLLEDY---RKIQTEFKGDKNF 175

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
           FT   I +  + V  +   +   +  +   + +    AGH+LG  +++I    E V+Y  
Sbjct: 176 FTSQMIKNCMKKVIAINIHEKIDIDNE---LSIRAFYAGHVLGAAMFQIMVGSESVLYTG 232

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVL 232
           D+N   ++HL    +E  ++P +LI+++  A   +  ++ RE  F   +  T+  GG VL
Sbjct: 233 DFNTTPDRHLGAARVEPGLKPDLLISESTYATTIRDSKRARERDFLKKVHDTVSNGGKVL 292

Query: 233 LPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 292
           +PV + GR  EL ++LE YW   +L YPI+F   ++     Y + F+ W  + I ++F  
Sbjct: 293 IPVFALGRAQELCILLESYWERMNLKYPIFFSQGLAEKANQYYRLFISWTNEKIKRTF-- 350

Query: 293 SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
              N F  KH+     +    ++P GP ++ ++   L  G S  +F +W SD KNL++
Sbjct: 351 VERNMFDFKHIRPF--EQSYTDSP-GPMVLFSTPGMLHGGQSLRVFTKWCSDEKNLII 405


>gi|300706889|ref|XP_002995677.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
 gi|239604869|gb|EEQ82006.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
          Length = 500

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 172/355 (48%), Gaps = 18/355 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           +++ PL    +      +V+I+G   ++DCG    +ND     D S L         ID 
Sbjct: 1   MKIIPLGAGQDVGRSCIIVNIEGRTIMLDCGMHMGYNDQRRFPDFSALSKTGDFNKLIDC 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           +++SH    H GALP+  +      P++ T+P   +  + + D + +S  + S+   F+ 
Sbjct: 61  IIISHFHLDHTGALPFFTEICKYDGPIYMTKPTKAVIPILLEDFRKISAPKSSDGKFFSY 120

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            DI +  + +  + +++ Y      E   + P+ AGH++G  ++ +      V+Y  DYN
Sbjct: 121 QDIQNCLKKIITINFNETYK---HDENFFITPYYAGHVIGAAMFHVQVGSRSVVYTGDYN 177

Query: 177 RRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPV 235
              ++HL    +   +RP +LIT++ Y ++     + +   F  A+   +  GG VL+P+
Sbjct: 178 MTPDRHLGAASIPC-LRPDLLITESTYGSITRDCRKSKEREFFKAVLDCVSNGGKVLIPI 236

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL L+L+ +W    L  PIYF + ++    +  K FL +  ++I K+      
Sbjct: 237 FALGRAQELCLLLDSHWERMQLKVPIYFSSGLTEKANNIYKQFLSYTNETIKKN--AFNH 294

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           N F  KH T    K  LD   + P ++ AS   L +G S  +F EW +D KNLV+
Sbjct: 295 NVFDFKHTTTF-QKHFLD--LNIPMVLFASPGMLHSGMSLKVFKEWCTDPKNLVI 346


>gi|341890123|gb|EGT46058.1| hypothetical protein CAEBREN_05882 [Caenorhabditis brenneri]
          Length = 618

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 175/366 (47%), Gaps = 17/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++ PL    +      L++I G N ++DCG +  +       D S +    ++   +D 
Sbjct: 8   IKIVPLGAGQDVGRSCILITIGGKNVMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDC 67

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           V++SH    H G+LP+  + +G   P++ T P   +  + + D    +  +  E + FT 
Sbjct: 68  VIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAIAQVLLEDYRKVQCDIKGETNFFTS 127

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           DDI +  +        +   +  +   + +    AGH+LG  +++I      V+Y  DYN
Sbjct: 128 DDIKNCMKKCIGCALHEVIQVDDQ---LSIRAFYAGHVLGAAMFEIRVGDHSVLYTGDYN 184

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    +   VRP VLI+++  A   +  ++ RE  F   + +++  GG V++PV
Sbjct: 185 MTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHESVMKGGKVIIPV 244

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++     Y + F+ W  ++I K+F     
Sbjct: 245 FALGRAQELCILLESYWERMALTVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VER 302

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+  +    E  + P GP+++ ++   L  G S  +F +W SD  N+++     
Sbjct: 303 NMFEFKHIRPMEKGCE--DMP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPINMIIMPGYC 359

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 360 VAGTVG 365


>gi|196007172|ref|XP_002113452.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
 gi|190583856|gb|EDV23926.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
          Length = 596

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 104/366 (28%), Positives = 172/366 (46%), Gaps = 18/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++V PL    +      LV+I   N + DCG    +ND     D + +     +   +D 
Sbjct: 4   IKVVPLGAGQDVGRSCILVTIGCKNIMFDCGMHMGYNDDRRFPDFTYITRSGSLTQFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  +      P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMCKYDGPIYMTHPTKAICPILLEDYRKITVDRKGEKNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E   +  + AGH+LG  ++ +    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVKAINLHQTVKVDDDLE---IKAYYAGHVLGAAMFLVKVGCESVLYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLTKVHECVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF T ++     Y K F+ W    I ++F   + 
Sbjct: 240 FALGRAQELCILLETYWDRMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRRTF--VQH 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+    +++ +DN    P +V A+   L  G S  IF +WA D KN+V+     
Sbjct: 298 NMFEFKHIKPF-DRALIDNP--NPMVVFATPGMLHGGLSLQIFKKWAPDDKNMVILPGYC 354

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 355 VAGTVG 360


>gi|170595519|ref|XP_001902415.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
           malayi]
 gi|158589929|gb|EDP28737.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
           malayi]
          Length = 589

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 175/358 (48%), Gaps = 23/358 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++V PL    +      LVSI G N ++DCG +  +       D S +     +   +D 
Sbjct: 4   IKVVPLGAGRDVGRSCILVSIGGRNVMLDCGMHMGYSDERRFPDFSFINGGGSLTEFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----DL 113
           V+++H    H G+LP+  + +G   P++ T P   +  + + D    R+  +EF    + 
Sbjct: 64  VIITHFHLDHCGSLPHMSEVVGYDGPIYMTYPTKAIAPVLLEDY---RKVQTEFKGDKNF 120

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
           FT   I +  + V  +   +   +  +   + +    AGH+LG  +++I    E V+Y  
Sbjct: 121 FTSQMIKNCMKKVIAINIHEKIDVDNE---LSIRAFYAGHVLGAAMFQIMVGSESVLYTG 177

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVL 232
           D+N   ++HL    +E  ++P +LI+++  A   +  ++ RE  F   +  T+  GG VL
Sbjct: 178 DFNTTPDRHLGAARVEPGLKPDLLISESTYATTIRDSKRARERDFLKKVHDTVSNGGKVL 237

Query: 233 LPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 292
           +PV + GR  EL ++LE YW   +L YPI+F   ++     Y + F+ W  + I ++F  
Sbjct: 238 IPVFALGRAQELCILLESYWERMNLKYPIFFSQGLAEKANQYYRLFISWTNEKIKRTF-- 295

Query: 293 SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
              N F  KH+     +S +++   GP ++ ++   L  G S  +F +W SD KNL++
Sbjct: 296 VERNMFDFKHIRPF-EQSYIESP--GPMVLFSTPGMLHGGQSLRVFTKWCSDEKNLII 350


>gi|51467896|ref|NP_001003836.1| cleavage and polyadenylation specificity factor subunit 3 [Danio
           rerio]
 gi|49619053|gb|AAT68111.1| cleavage and polyadenylation specificity factor 3 [Danio rerio]
          Length = 690

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 189/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMVDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 380

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 381 ITTMSGQKLPL 391


>gi|341903207|gb|EGT59142.1| hypothetical protein CAEBREN_31222 [Caenorhabditis brenneri]
          Length = 571

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 175/366 (47%), Gaps = 17/366 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++ PL    +      L++I G N ++DCG +  +       D S +    ++   +D 
Sbjct: 11  LKIVPLGAGQDVGRSCILITIGGKNVMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDC 70

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           V++SH    H G+LP+  + +G   P++ T P   +  + + D    +  +  E + FT 
Sbjct: 71  VIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAIAQVLLEDYRKVQCDIKGETNFFTS 130

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           DDI +  +        +   +  +   + +    AGH+LG  +++I      V+Y  DYN
Sbjct: 131 DDIKNCMKKCIGCALHEVIQVDDQ---LSIRAFYAGHVLGAAMFEIRVGDHSVLYTGDYN 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    +   VRP VLI+++  A   +  ++ RE  F   + +++  GG V++PV
Sbjct: 188 MTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHESVMKGGKVIIPV 247

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW   +L  PIYF   ++     Y + F+ W  ++I K+F     
Sbjct: 248 FALGRAQELCILLESYWERMALTVPIYFSQGLAERANQYYRLFISWTNENIKKTF--VER 305

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+  +    E  + P GP+++ ++   L  G S  +F +W SD  N+++     
Sbjct: 306 NMFEFKHIRPMEKGCE--DMP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPINMIIMPGYC 362

Query: 356 QFGTLA 361
             GT+ 
Sbjct: 363 VAGTVG 368


>gi|397639513|gb|EJK73612.1| hypothetical protein THAOC_04754 [Thalassiosira oceanica]
          Length = 454

 Score =  150 bits (379), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 118/400 (29%), Positives = 189/400 (47%), Gaps = 24/400 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAV 58
           M  ++Q+TPL          +L++  G   L+DCG +  +D     P        ++D +
Sbjct: 1   MEDTMQITPLGSGQEVGRSCHLLTFRGTTVLLDCGIHPGYDGMAGLPFFDRVDPESVDVL 60

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS------ 109
           L++H    H  +LPY  ++ G    VF T P   V RL LL  Y + ++ +  S      
Sbjct: 61  LVTHFHLDHAASLPYFTERTGFRGRVFMTHPTKAVIRL-LLGDYLRLMAVKHGSSGGELN 119

Query: 110 -EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
            E  L+T  ++ S    +  + Y Q   L+    G+      AGH+LG  ++ I   G  
Sbjct: 120 PEDVLYTEAELQSCVDKIELIDYHQTIDLN-LPSGLKFHALNAGHVLGAAMFYIEIGGRS 178

Query: 169 VIYAVDYNRRKEKHLNGTVLESF-VRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLR 226
           V+Y  DY+  +++HL    L  +   P VLI ++   +   P R +RE  F   I + + 
Sbjct: 179 VLYTGDYSMEEDRHLMAAELPRYHASPDVLIVESTYGVQVHPTRAEREARFTGTIERIVT 238

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG  L+PV + GR  ELLLIL++YW EH    + P+Y+ + ++S  +   +++   M  
Sbjct: 239 GGGRCLIPVFALGRAQELLLILDEYWQEHPHLQSVPVYYASKMASRALRVYQTYANMMNA 298

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWAS 343
            I    +    N F  +H+  L    +++N  D GP +V AS   L++G S  +F  WA+
Sbjct: 299 RIRTQMDLG--NPFSFRHIRNL-KSIDVNNFDDRGPSVVFASPGMLQSGVSRQLFDRWAT 355

Query: 344 DVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           D KN VL        TLA+ + +   PK V     RR PL
Sbjct: 356 DPKNGVLIAGYAVEHTLAKEIMSQ--PKEVVTMEGRRQPL 393


>gi|410928245|ref|XP_003977511.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Takifugu rubripes]
          Length = 696

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 183/356 (51%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ + + +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMEKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLARKCMAVYQTYINAMNDKIRKAINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 378


>gi|89267474|emb|CAJ83498.1| cleavage and polyadenylation specific factor 3 [Xenopus (Silurana)
           tropicalis]
          Length = 692

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 190/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRALIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P   A
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIA 382

Query: 373 VKVTMS-RRVPL 383
              TMS +++PL
Sbjct: 383 ---TMSGQKLPL 391


>gi|392575747|gb|EIW68879.1| hypothetical protein TREMEDRAFT_44189 [Tremella mesenterica DSM
           1558]
          Length = 738

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 170/326 (52%), Gaps = 18/326 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  ALPY M++      +  V+ T     +  LTM D      Q +
Sbjct: 75  STVDAILITHFHVDHAAALPYIMERTNFKDGAGKVYMTHATKAIYGLTMMDAVRISDQNA 134

Query: 110 EF--DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +    L+T  D+ S++Q+   + Y Q+  +SG   G+   P+ AGH+LG +++ I   G 
Sbjct: 135 DNAGRLYTEADVQSSWQNTIAVDYHQDIVVSG---GLRFTPYHAGHVLGASMFMIEIAGL 191

Query: 168 DVIYAVDYNRRKEKHLNGTVLESF--VRPAVLITDAYNALHNQPPRQQRE-MFQDAISKT 224
            ++Y  DY+R +++HL   V+     V+P V+I ++   +H  P R+++E  F   +S  
Sbjct: 192 KILYTGDYSREEDRHL---VIAEVPPVKPDVMICESTFGVHTLPDRKEKEEQFTTLVSNI 248

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           ++ GG  L+P+ S G   EL L+L++YW +H    N PI+F + +    +   K+++  M
Sbjct: 249 VKRGGRCLMPIPSFGNGQELALLLDEYWHDHPELQNIPIFFASGLFQRGMRVYKTYVHTM 308

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWA 342
             +I   F   RDN F  K+V  L +    DN    P +V+AS   +  G S ++  +WA
Sbjct: 309 NANIRSRF-ARRDNPFDFKYVKPLKDGRRGDNF-KSPCVVMASAQFMSFGLSRELLEDWA 366

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
              KN V+ T     GT+AR L  +P
Sbjct: 367 PGEKNGVIVTGYSIEGTMARTLLGEP 392


>gi|55741994|ref|NP_001006770.1| cleavage and polyadenylation specificity factor 3 [Xenopus
           (Silurana) tropicalis]
 gi|49522504|gb|AAH75564.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
           (Silurana) tropicalis]
          Length = 692

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 190/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRALIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P   A
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIA 382

Query: 373 VKVTMS-RRVPL 383
              TMS +++PL
Sbjct: 383 ---TMSGQKLPL 391


>gi|24648013|ref|NP_650738.1| cleavage and polyadenylation specificity factor 73 [Drosophila
           melanogaster]
 gi|21430620|gb|AAM50988.1| RE31408p [Drosophila melanogaster]
 gi|23171662|gb|AAF55578.2| cleavage and polyadenylation specificity factor 73 [Drosophila
           melanogaster]
 gi|220948314|gb|ACL86700.1| CG7698-PA [synthetic construct]
          Length = 684

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 198/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    ++AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + K ++ GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKAVLSE--PEEITTLSGQKLPL 390


>gi|195343244|ref|XP_002038208.1| GM18692 [Drosophila sechellia]
 gi|194133058|gb|EDW54626.1| GM18692 [Drosophila sechellia]
          Length = 684

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 198/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    ++AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + K ++ GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKTVLSE--PEEITTLSGQKLPL 390


>gi|195569857|ref|XP_002102925.1| GD20157 [Drosophila simulans]
 gi|194198852|gb|EDX12428.1| GD20157 [Drosophila simulans]
          Length = 684

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 198/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    ++AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + K ++ GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKTVLSE--PEEITTLSGQKLPL 390


>gi|168026077|ref|XP_001765559.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683197|gb|EDQ69609.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 682

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 114/398 (28%), Positives = 191/398 (47%), Gaps = 38/398 (9%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCGWND---------HFDPSLLQPLSKV 51
           G  ++VTPL G  NE   S + ++  G   + DCG +          +FD   + P+S  
Sbjct: 15  GDKLEVTPL-GAGNEVGRSCVYMTYKGKTVMFDCGIHPGYSGMAALPYFDE--IDPIS-- 69

Query: 52  ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQV 108
              ID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   ++   +V
Sbjct: 70  ---IDVLLVTHFHLDHCASLPYFLEKTNFKGRVFMTHATKAIYKL----LLSDFVKISKV 122

Query: 109 SEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           S  D L+   DI    + +  + + Q   ++G    I    + AGH+LG  ++ +   G 
Sbjct: 123 SVDDMLYDEHDIARTMEKIEVIDFHQTMEVNG----IRFWCYTAGHVLGAAMFMVDIAGM 178

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRA 227
            V+Y  DY+  +++HL    +  F     +I   Y    +QP   +   F D +++T+  
Sbjct: 179 RVLYTGDYSCEEDRHLRAAEMPRFSPDVCIIESTYGVQIHQPRIMRERRFTDTVAQTVSQ 238

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+P  + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M + 
Sbjct: 239 GGKVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQTYINAMNER 298

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K FE S  N F  KH+  L N  E D+   GP +V+AS   L++G S  +F  W  D 
Sbjct: 299 IQKQFEVS--NPFDFKHIQPLKNIDEFDDI--GPAVVMASPGGLQSGLSRQLFDIWCQDK 354

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           KN  +       GT A+ +  +  PK V +     VPL
Sbjct: 355 KNSCVIPGYVVEGTPAKAIMNE--PKEVTLLSGLVVPL 390


>gi|389638668|ref|XP_003716967.1| hypothetical protein MGG_06570 [Magnaporthe oryzae 70-15]
 gi|351642786|gb|EHA50648.1| hypothetical protein MGG_06570 [Magnaporthe oryzae 70-15]
 gi|440474177|gb|ELQ42934.1| cleavage and polyadenylation specificity factor subunit 2
           [Magnaporthe oryzae Y34]
 gi|440484966|gb|ELQ64966.1| cleavage and polyadenylation specificity factor subunit 2
           [Magnaporthe oryzae P131]
          Length = 962

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 157/618 (25%), Positives = 242/618 (39%), Gaps = 141/618 (22%)

Query: 8   TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
           +PL G  +E   S  L+ +DG    LID GW++ FD   L+ + K   T+  +LL+H   
Sbjct: 5   SPLQGALSEATASQSLLELDGGVKVLIDIGWDETFDVEKLKEVEKQVPTLSLILLTHATV 64

Query: 66  LHLGALPYAMKQLGLSA--PVFSTEPVYRLGLLTMYDQY--------------------- 102
            HL AL +  K   L A  P+++T+P   LG   + D Y                     
Sbjct: 65  PHLSALVHCCKNFPLFARIPIYATQPAIDLGRTLIQDLYSSTPAAATSIPDSALAEASYS 124

Query: 103 LSRRQVSEFDLF----TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGH 153
            S+ Q +         + D+I   F  +  L YSQ +       S    G+ +  + AGH
Sbjct: 125 FSQTQTNGHGFLLQAPSPDEIAKYFSLIQPLKYSQPHQPLASPFSPPLNGLTITAYNAGH 184

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEK-------------HLNGTVLESFVRPAVLITD 200
            LGGT+W I    E ++YAVD+N  ++                   V+E   +P  L+  
Sbjct: 185 SLGGTIWHIQHGMESIVYAVDWNLARDNVYAGAAWMGGGHGGGGAEVIEQLRKPTALVCS 244

Query: 201 AYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---- 256
              A        + +   D +   +  GG VL+PVDS+ RVLEL  +LE  W   +    
Sbjct: 245 TRTAEGGLTRAARDKQLLDTMRMAISRGGTVLIPVDSSARVLELAYLLEHAWRSEASTEG 304

Query: 257 ---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL-------------- 299
                  +Y       STI   KS  EWM +SI + FE   D  F               
Sbjct: 305 GGLSTAKLYLAGRSVHSTIKLAKSMFEWMDNSIVQEFEAGADQGFRRTNGAGGNADAKGK 364

Query: 300 ------LKHVTLLINKSE----LDNAPD--GPKLVLASMASLEAGFSHDIFVEWASDVKN 347
                  K++ LL  K++    L+ + D    K++LA+  SLE GFS DI    A+D +N
Sbjct: 365 DGGPFDFKYLRLLDRKAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIANDSRN 424

Query: 348 LVLFTERGQFG-----TLARML-----------------------QADPPPKAVKVTMSR 379
           +V+  E+         +++R L                       Q     + +++  S+
Sbjct: 425 MVILPEKPAESSRDNPSISRQLWRWWKERRDGVADEQSSGAGSAEQVFAGGRELQIRESK 484

Query: 380 RVPLVGEELIAYEE---EQTRLKK------EEALKAS-----------------LVKEEE 413
           +VPL   EL  Y++    Q +L          AL+AS                    E++
Sbjct: 485 KVPLADSELSIYQQWLATQRQLNATVQGGGASALEASADVADDVSSESSSDSDDSENEQQ 544

Query: 414 SKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYE 473
            KA        S   +V+   +      + +P  G Y D  + G          MFP   
Sbjct: 545 GKALNASTTQASRKKVVLQDEDLGVMILLKKP--GVY-DFPVKG----KKGRERMFPLAV 597

Query: 474 NNSEWDDFGEVINPDDYI 491
                D+FGE+I P+DY+
Sbjct: 598 RRKRNDEFGELIRPEDYL 615



 Score = 42.7 bits (99), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 40/180 (22%), Positives = 76/180 (42%), Gaps = 47/180 (26%)

Query: 534 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
           +LV GSA+ TE +   C ++    V+TP +   +D + D  A+ V+L++ L+  + ++++
Sbjct: 740 ILVAGSADETEAVADDCRRNAI-EVFTPPVGAVVDASVDTNAWVVKLADPLVKRLKWQQV 798

Query: 594 GDYEIAWVDAEVGKT----ENGM------------------------------------- 612
               I  V A++  T    +NG+                                     
Sbjct: 799 RGLGIVTVTAQLTATPAAQKNGIPLLIADDDGANKRQKIKATGVDDQEPTAEDEDVGVMP 858

Query: 613 -LSLLPISTPAPPHKSVL---VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKV 667
            L +LP++  +    +     VG+L++ADL+  + + G   +F G G L     V +RK 
Sbjct: 859 TLDVLPVAMVSASRSAAQVLHVGELRLADLRRTMQNLGHSADFRGEGTLLIDGTVVVRKT 918


>gi|345563625|gb|EGX46611.1| hypothetical protein AOL_s00097g515 [Arthrobotrys oligospora ATCC
           24927]
          Length = 791

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 173/371 (46%), Gaps = 29/371 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           ++V   G   ++D G +  +D     P       ST+D +L+SH    H G+LPY + + 
Sbjct: 37  HIVQYKGKTVMLDAGVHPAYDGISSLPFYDDFDLSTVDILLISHFHLDHAGSLPYVLTKT 96

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSE--FDLFTLDDIDSAFQSVTRLTYSQNYH 136
                VF T P   +    M D        SE    LF+  D  S+F  ++ + Y Q  H
Sbjct: 97  NFRGRVFMTHPTKAIYKWLMSDSVRVSNTTSEQTTQLFSETDHLSSFSQISAIDYYQTLH 156

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 196
            S     I + P+ AGH+LG  ++ I   G  +++  DY+R  ++HL    L   ++P +
Sbjct: 157 HSS----IAITPYPAGHVLGAAMFLIEIAGLKILFTGDYSREDDRHLVSASLPKHIKPDI 212

Query: 197 LITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
           LIT++     +  PR ++E  F   ++  L  GG VL+PV + GR  ELLLILE+YW  H
Sbjct: 213 LITESTYGTASHMPRPEKEARFISLVTSILDRGGRVLMPVFALGRAQELLLILEEYWEVH 272

Query: 256 S--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR----------------DNA 297
                YPIY+ + ++   +   ++++  M D+I   F +                   N 
Sbjct: 273 ERYRQYPIYYASSLARRCMSVYQTYIHAMNDNIKALFRSKMAAIGEAAGKDGQVIGGTNP 332

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F ++ V  L +    D+   G  ++LA+   ++ G S ++   W  D KN V+ T     
Sbjct: 333 FEMRWVRSLKSLDRFDDV--GGCVMLAAPGMMQNGVSRELLERWCPDPKNGVILTGYSVE 390

Query: 358 GTLARMLQADP 368
           GTLA+ +  +P
Sbjct: 391 GTLAKSILNEP 401


>gi|348518441|ref|XP_003446740.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Oreochromis niloticus]
          Length = 686

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 182/356 (51%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ + + +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEDSMEKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLARKCMAVYQTYINAMNDKIRKAINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ +  +P
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDRRNGVIIAGYCVEGTLAKHIMTEP 378


>gi|388507878|gb|AFK42005.1| unknown [Medicago truncatula]
          Length = 534

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 107/360 (29%), Positives = 174/360 (48%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCGWN-DHFDPSLLQPLSKVAST------IDAVLLSHPDTLHLGALPYA 74
           +V I+G   + DCG    H D S      K++ +      +D ++++H    H+GAL Y 
Sbjct: 20  IVKINGKRIMFDCGMRMRHTDHSRYPDFKKISDSGNFNDALDCIIITHFHLDHVGALAYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G S PV+ T P+  L  L +  Y + +  R+  E + FT D I    + V  +   
Sbjct: 80  TEVCGYSGPVYMTYPIKALSPLMLEDYRKVMVDRRGEE-EQFTSDHIAECMKKVIAVDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH++G  ++ +     +++Y  DYN   ++HL    ++  +
Sbjct: 139 QTVQVD---EDLQIRAYYAGHVIGAAMFYVKVGDAEMVYTGDYNMTPDRHLGAAQIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           R  +LIT++  A   +  +  RE  F  A+ K +  GG VL+P  + GR  EL ++L+DY
Sbjct: 195 RLDLLITESTYATTIRDSKYAREREFLKAVHKCVSGGGKVLIPTFALGRAQELRILLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L  PIYF + ++     Y K  + W    I  ++ T   NAF  K+V     +S 
Sbjct: 255 WERMNLKVPIYFSSGLTIQANTYHKMLIGWTSQKIKDTYSTH--NAFDFKNVHKF-ERSM 311

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           LD AP GP ++ A+   L  GFS ++F  WA   KNLV        GT+   L +  P K
Sbjct: 312 LD-AP-GPCVLFATPGMLIGGFSLEVFKHWAPSEKNLVALPGYCMAGTVGHRLTSGKPTK 369


>gi|195497711|ref|XP_002096215.1| GE25184 [Drosophila yakuba]
 gi|194182316|gb|EDW95927.1| GE25184 [Drosophila yakuba]
          Length = 684

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 198/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    ++AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + K ++ GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKAVLSE--PEEITTLSGQKLPL 390


>gi|55250298|gb|AAH85402.1| Cleavage and polyadenylation specific factor 3 [Danio rerio]
 gi|182889046|gb|AAI64567.1| Cpsf3 protein [Danio rerio]
          Length = 690

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 189/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 380

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 381 ITTMSGQKLPL 391


>gi|194900154|ref|XP_001979622.1| GG16362 [Drosophila erecta]
 gi|190651325|gb|EDV48580.1| GG16362 [Drosophila erecta]
          Length = 684

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 103/391 (26%), Positives = 198/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    ++AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + K ++ GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHVHEKREDRENRFTSLVQKIVQQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKAVLSE--PEEITTLSGQKLPL 390


>gi|392862603|gb|EAS36741.2| cleavage and polyadenylylation specificity factor [Coccidioides
           immitis RS]
          Length = 1026

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 123/441 (27%), Positives = 182/441 (41%), Gaps = 114/441 (25%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSA--PV 84
           G   LID GW++ FDPS L+ L K   T+  +LL+H    H+GA  Y  K   L A  PV
Sbjct: 27  GVKILIDVGWDETFDPSALKELEKHIPTLSLILLTHATPSHIGAFVYCCKTFPLFAQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF-------------------------------DL 113
           ++T PV   G   + D Y S    S F                               D 
Sbjct: 87  YATYPVISFGRSLLQDLYSSAPLASTFLPTTSSISDSNGSGSVPTQDPTAPAGALTEGDT 146

Query: 114 F-------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLL 155
                         T +DI   F  +  L YSQ +            G+ +  + AGH +
Sbjct: 147 LNSTTAGKILLPSPTSEDIARHFSLIHPLKYSQPHQPLPSPFSPPLNGLTITAYNAGHTV 206

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYN 203
           GGT+W I    E ++YAVD+N+ +E  + G             V+E   +P  L+  A  
Sbjct: 207 GGTIWHIQHGMESIVYAVDWNQARENVIAGAAWFGSSGANRTDVIEQLRKPTALVCSAKG 266

Query: 204 ALHNQP--PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-------- 252
                P   R++R ++  D I   +   G VLLP D++ RVLEL  +LE  W        
Sbjct: 267 GDKFAPGGGRKKRDDLLLDMIRSCIARKGTVLLPTDTSARVLELAYVLEHAWREAADGPD 326

Query: 253 AEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-------------------- 291
            E+SL N  +Y        T+   +S LEWM +SI + FE                    
Sbjct: 327 GENSLKNANLYLAGKKVHGTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQ 386

Query: 292 ---------TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAG 332
                    + + +A        F  +H+ ++  K++L+N    +GPK+++AS  SL+ G
Sbjct: 387 QSKGTPGQTSDKKSAGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDTSLDWG 446

Query: 333 FSHDIFVEWASDVKNLVLFTE 353
           FS +I    A   +NLV+ TE
Sbjct: 447 FSKEILRHVAQGAENLVILTE 467


>gi|194743214|ref|XP_001954095.1| GF18101 [Drosophila ananassae]
 gi|190627132|gb|EDV42656.1| GF18101 [Drosophila ananassae]
          Length = 684

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 104/391 (26%), Positives = 198/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTD 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + KT++ GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKTVQQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKTILSE--PEEITTLSGQKLPL 390


>gi|226497180|ref|NP_001146407.1| uncharacterized protein LOC100279987 [Zea mays]
 gi|219887045|gb|ACL53897.1| unknown [Zea mays]
 gi|414873991|tpg|DAA52548.1| TPA: hypothetical protein ZEAMMB73_264007 [Zea mays]
          Length = 697

 Score =  149 bits (377), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 179/386 (46%), Gaps = 42/386 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            ++  G   L DCG            + D  DPS      
Sbjct: 25  GDQMVVTPLGAGSEVGRSCVHMTFKGRTVLFDCGIHPAYSGMAALPYFDEIDPS------ 78

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
                ID +L++H    H  +LPY +++      VF   +T+ +Y+L    +   Y+   
Sbjct: 79  ----AIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL----LLSDYVKVS 130

Query: 107 QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
           +VS  D L+   DI  +   +  + + Q   ++G    I    + AGH+LG  ++ +   
Sbjct: 131 KVSVEDMLYDESDIARSMDKIEVIDFHQTLEVNG----IRFWCYTAGHVLGAAMFMVDIA 186

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  ++Y  DY+R +++HL    L  F     +I   Y    +QP   + + F + I  T+
Sbjct: 187 GVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQQHQPRIIREKRFTEVIHNTV 246

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M 
Sbjct: 247 SQGGRVLIPAFALGRAQELLLILDEYWSKHPELHKIPIYYASPLAKRCMAVYQTYINSMN 306

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWA 342
           + I   F  S  N F  KH+  L   + +DN  D GP +V+AS   L++G S  +F +W 
Sbjct: 307 ERIRNQFAQS--NPFHFKHIESL---NSIDNFHDVGPSVVMASPGGLQSGLSRQLFDKWC 361

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
           +D KN  +       GTLA+ +  +P
Sbjct: 362 TDKKNACVIPGYVVEGTLAKTIINEP 387


>gi|403337788|gb|EJY68117.1| Integrator complex subunit 11 [Oxytricha trifallax]
          Length = 771

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 115/392 (29%), Positives = 173/392 (44%), Gaps = 45/392 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW---------NDHF-DPSLLQPLSKVAST 54
           ++V PL    +      +V + G   + DCG          + HF   S  QPL    + 
Sbjct: 3   IKVIPLGAGQDVGRSCVIVELGGRRLMFDCGIHMVNQQQFPDFHFLQGSQQQPLD-FTNH 61

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL- 113
           ID VL++H    H GAL Y  + +G   P+ +T P   +  L + D     R+VS     
Sbjct: 62  IDCVLITHFHLDHCGALTYFTEGVGYHGPILATPPTKAIIPLMLED----FRKVSSMQQG 117

Query: 114 --------------------FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGH 153
                               FT D I +    ++ +   +   + G    I V  + AGH
Sbjct: 118 QKGGGQGSGGNQNSMNQDTAFTSDMIKACIAKISTIQLHETQVIKG---DIKVTAYYAGH 174

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQ 213
           +LG  ++ +  +GE V+Y  DYN   ++HL    ++  +RP V IT+   A   +  ++ 
Sbjct: 175 VLGACMFYVECNGESVVYTGDYNMTADRHLGAAWIDK-LRPDVCITETTYATTIRDSKRS 233

Query: 214 REM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTI 272
           RE  F   + +TL  GG VL+PV + GR  EL ++LE YW   +L YPIYF   ++    
Sbjct: 234 REREFLKVVHETLDNGGKVLIPVFALGRAQELCVLLETYWNRTNLQYPIYFSGGLTEKAN 293

Query: 273 DYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG 332
            Y K F+ W  + I K+F T   N F  +HV  L   S      D P +  AS   L  G
Sbjct: 294 FYYKLFINWTNEKIKKTF-TKNQNMFQFQHVKTLDTASI---KSDQPMVCFASPGMLHGG 349

Query: 333 FSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           +S  IF +WA   KN ++       GT+   L
Sbjct: 350 YSLQIFKDWAGQEKNTLIIPGYCMPGTVGNKL 381


>gi|322708414|gb|EFY99991.1| cleavage and polyadenylylation specificity factor, putative
           [Metarhizium anisopliae ARSEF 23]
          Length = 960

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 124/426 (29%), Positives = 182/426 (42%), Gaps = 80/426 (18%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    L+  GW++ FD   L+ L K   T+  +LL+H    
Sbjct: 6   PLQGALSESTASQSLLELDGGVKVLVGLGWDETFDLGKLEELEKQVPTLSLILLTHATAS 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSR-------RQVSEFDLF--- 114
           HL A  +  K   L    P ++T PV  LG   + D Y S        RQ S  ++    
Sbjct: 66  HLAAYVHCCKNFPLFTRIPAYATRPVIDLGRSLIQDLYSSTPAASTTIRQTSLSEIAYAY 125

Query: 115 ---------------TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHL 154
                          T D I   F  +  L YSQ +            G+ +  + +GH 
Sbjct: 126 TQTAATAQNLLLQSPTPDQIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTITAYNSGHT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAY 202
           LGGT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  + 
Sbjct: 186 LGGTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGGGGAEVIEQLRKPTALICSSR 245

Query: 203 NALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--- 256
            A  +     R +R E   + I   +  GG VL+PVDS+ RVLEL  +LE  W   +   
Sbjct: 246 GAQKSAQTAGRAKRDEQLLEMIKTCVTKGGTVLIPVDSSARVLELSYLLEHAWRADAASD 305

Query: 257 ----LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET--------------SRDNAF 298
                +  +Y      SST+ Y +S LEWM D+I + FE                    F
Sbjct: 306 NGVLTSAKLYLAGRNMSSTMRYARSMLEWMDDNIVQEFEAFAEGQRKANGAVEKKEGGPF 365

Query: 299 LLKHVTLLINKSELDNAPD----------GPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
             K++ LL  K+++    D            +++LAS  S+E GFS D+    A D  NL
Sbjct: 366 DFKYLRLLERKAQVSKLLDQVASAQGEVAKGRVILASDTSMEWGFSKDVLKGLAKDPNNL 425

Query: 349 VLFTER 354
           V+ T+R
Sbjct: 426 VILTDR 431


>gi|320163324|gb|EFW40223.1| CPSF3 protein [Capsaspora owczarzaki ATCC 30864]
          Length = 802

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 106/392 (27%), Positives = 182/392 (46%), Gaps = 40/392 (10%)

Query: 7   VTPLSGVFNENPLSYLVSIDGFNFLIDCGWN------------DHFDPSLLQPLSKVAST 54
           +TPL          +++   G   + DCG +            D FDP L         +
Sbjct: 44  LTPLGAGQEVGRSCFVLQFKGKTIMFDCGLHPAYSGQAALPFFDSFDPGL--------DS 95

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF 114
           ID +L++H        +PY M +      VF T P   +    + D        ++  LF
Sbjct: 96  IDVLLVTH------AGVPYIMTKTNFKGRVFMTHPTKAIYKWMVADFIRVSNVSADEMLF 149

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
              DID+    +  +    +YH   +  GI    + AGH+LG  ++ +   G  ++Y  D
Sbjct: 150 NERDIDNTMARIETI----DYHQEKEVNGIKFWCYNAGHVLGACMFMVEIAGVKLLYTGD 205

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLL 233
           Y+R +++HL    + + + P VL  ++   +    PR +RE  F   +   +  GG  LL
Sbjct: 206 YSRHEDRHLMPAEIPT-IAPDVLCVESTYGVRVHEPRVEREGRFTKDVHDIVMRGGKCLL 264

Query: 234 PVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           PV + GR  ELLLIL+++W       N PIY+ + ++   +   ++++  M + I + F 
Sbjct: 265 PVFALGRAQELLLILDEFWESKPALHNIPIYYASSLARKCMAIYQTYINQMNERIRRQFA 324

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
            S  N F+ KH+  + + SE+D +  GP +++AS   L+ G S D+F +W  D +N V+ 
Sbjct: 325 IS--NPFMFKHIASIKSASEIDQS--GPMVMMASPGMLQNGLSRDLFEQWCPDSRNGVIV 380

Query: 352 TERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           T     GTLA+ + +   PK V     +++PL
Sbjct: 381 TGYSVEGTLAKSILS--APKEVPSLTGQKLPL 410


>gi|440632320|gb|ELR02239.1| hypothetical protein GMDG_05312 [Geomyces destructans 20631-21]
          Length = 988

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 123/410 (30%), Positives = 174/410 (42%), Gaps = 82/410 (20%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   LID GW++ FD   L+ L K    I  VLL+H    HL A  +  K   L    P+
Sbjct: 26  GVKVLIDVGWDETFDVEKLRNLEKHVPAISIVLLTHATVGHLAAYAHCCKHFPLFTRIPI 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVS---------------------EFDLFTL------D 117
           ++T PV  LG   + D Y S    S                     E D   L      +
Sbjct: 86  YATTPVISLGRTLLQDLYASTPLASTIIPSSLLSETSYSYSKPGSGEDDSHILLQSPTHE 145

Query: 118 DIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +I + F  +  L YSQ +       S    G+ +  + AGH LGGT+W I    E ++YA
Sbjct: 146 EIANYFSLIHPLKYSQPHQPLPSPFSQPLNGLTITAYNAGHTLGGTIWHIQHGLESIVYA 205

Query: 173 VDYNRRKEK------------HLNGTVLESFVRPAVLITDAYNA--LHNQPPRQQR-EMF 217
           VD+N+ +E                  V+E   +P  LI  +  A  +     R +R E  
Sbjct: 206 VDWNQARENILAGAAWLGGAGAGGAEVIEQLRKPTALICSSKGAERIALVGGRTKRDEAL 265

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-------LNYPIYFLTYVSSS 270
            D I   +  GG VL+P DS+ RVLEL  +LE  W + +        N  +Y  +    +
Sbjct: 266 LDMIKSAIAKGGTVLIPTDSSARVLELAYLLEHAWRKDASNPESPFQNANLYLCSKNIGA 325

Query: 271 TIDYVKSFLEWMGDSITKSFET-----------------SRDNAFLLKHVTLLINKSEL- 312
           T+ Y +S LEWM D I + FE                  +    F  KH+ L+  K  + 
Sbjct: 326 TMRYTRSMLEWMDDGIIREFEAIAGGIDRQPNKPSEPRQAGAGPFDFKHLRLIEKKGGVS 385

Query: 313 -----DNAPDG---PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
                D   DG    K++LAS  SL+ GFS DI    A+D +NLV+ TE+
Sbjct: 386 AVLNNDATKDGKPMAKVILASDRSLDWGFSKDILRNIAADSRNLVILTEK 435


>gi|170060909|ref|XP_001866010.1| cleavage and polyadenylation specificity factor [Culex
           quinquefasciatus]
 gi|167879247|gb|EDS42630.1| cleavage and polyadenylation specificity factor [Culex
           quinquefasciatus]
          Length = 688

 Score =  149 bits (375), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 185/357 (51%), Gaps = 24/357 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  +D + +SH    H GALP+ +++  
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLIEADEVDLLFISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     M   Y+    +S E  L+T  D++++ + +  +    N+
Sbjct: 96  FKGRCFMTHATKAIYRW----MLSDYIKVSNISTEQMLYTEADLEASMEKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      G+    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 148 HEERDVMGVRFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPT-MKPD 206

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F   + K ++ GG  L+PV + GR  ELLLIL++YW++
Sbjct: 207 VLITESTYGTHIHEKREDRESRFTSLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQ 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           +      PIY+ + ++   +   ++++  M D I +  + + +N F+ +H++   N   +
Sbjct: 267 NPELQEIPIYYASSLAKKCMAVYQTYINAMNDKIRR--QIAVNNPFVFRHIS---NLKGI 321

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+  D GP +V+AS   +++G S ++F  W SD KN V+       GTLA+ + ++P
Sbjct: 322 DHFEDIGPCVVMASPGMMQSGLSRELFETWCSDPKNGVIIAGYCVEGTLAKTVLSEP 378


>gi|147905468|ref|NP_001088278.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
           laevis]
 gi|54038587|gb|AAH84286.1| LOC495111 protein [Xenopus laevis]
          Length = 692

 Score =  149 bits (375), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 190/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRSLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P    
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEE-- 380

Query: 373 VKVTMS-RRVPL 383
             VTMS +++PL
Sbjct: 381 -IVTMSGQKLPL 391


>gi|157117185|ref|XP_001652976.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
 gi|108876120|gb|EAT40345.1| AAEL007904-PA [Aedes aegypti]
          Length = 687

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 95/357 (26%), Positives = 185/357 (51%), Gaps = 24/357 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  +D + +SH    H GALP+ +++  
Sbjct: 36  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLIEADEVDLLFISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     M   Y+    +S    L+T  D++++ + +  +    N+
Sbjct: 96  FKGRCFMTHATKAIYRW----MLSDYIKVSNISTDQMLYTEADLEASMEKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      G+    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 148 HEERDVMGVRFWAYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPA-MKPD 206

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F   + K ++ GG  L+PV + GR  ELLLIL++YW++
Sbjct: 207 VLITESTYGTHIHEKREDRESRFTSLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQ 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           +     +PIY+ + ++   +   ++++  M D I +  + + +N F+ +H++   N   +
Sbjct: 267 NPELQEFPIYYASSLAKKCMAVYQTYINAMNDKIRR--QIAVNNPFVFRHIS---NLKGI 321

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+  D GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++P
Sbjct: 322 DHFEDIGPCVVMASPGMMQSGLSRELFETWCTDPKNGVIIAGYCVEGTLAKTILSEP 378


>gi|212543221|ref|XP_002151765.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces marneffei ATCC 18224]
 gi|210066672|gb|EEA20765.1| cleavage and polyadenylylation specificity factor, putative
           [Talaromyces marneffei ATCC 18224]
          Length = 1015

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 136/515 (26%), Positives = 209/515 (40%), Gaps = 128/515 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW++ FD   L  L K   T+  VLL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDETFDVLELAELEKHIPTLSLVLLTHATISHIGAFAHCCKIFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSEFDLF--------------------- 114
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATGPVISLGRTLLQDMYTSAPLAATFLPKASISELGASTSAASAAVATASAEGDDQSSK 146

Query: 115 -------------TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLG 156
                        T ++I   F  +  L YSQ +       S   +G+ +  + AGH +G
Sbjct: 147 KLGTTGRILLQPPTGEEIARYFSLIHPLKYSQPHSPLCSPFSPPLDGLTLTAYSAGHTVG 206

Query: 157 GTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNA 204
           GT+W I    E ++YAVD+N+ +E  + G             V+E   +P  LI  +   
Sbjct: 207 GTIWHIQHGMESIVYAVDWNQARENVVAGAAWFGGSGTSGTEVIEQLRKPTALICSSKGG 266

Query: 205 LHNQPP---RQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS----- 256
               PP    ++  +  D I  +L  GG+VL+P D++ RVLEL   LE  W + +     
Sbjct: 267 DKFAPPGGLHKRDALLFDMIRSSLAKGGSVLIPTDTSARVLELSYALEHAWRDAADSADS 326

Query: 257 ----LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-------------------- 292
                   +Y     + ST+   +S LEWM + I + FE                     
Sbjct: 327 EDVFKKAELYLAGRKAHSTMRLARSMLEWMDEGIVREFEAVEGGDAAAVRGHKTTDSQNR 386

Query: 293 ----SRDNA------FLLKHVTLLINKSELDNA-PDG-PKLVLASMASLEAGFSHDIFVE 340
               +RD        F LKH+ ++  K +L+    DG PK+++AS  SL+ G+S + F  
Sbjct: 387 NAGVTRDKQGTKLGPFTLKHLKIVEQKRKLEKVLADGIPKVIIASDTSLDWGYSKETFRT 446

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKK 400
            A   +NL+L TE     TL    Q D P +  K+T+ R +         YEE +  +  
Sbjct: 447 LAQGSQNLILLTE-----TLPIRYQTDDPEQPDKMTLGRMI------WRWYEERRDGVAM 495

Query: 401 EEALKASLVKEEES-----------KASLGPDNNL 424
           E A    L+++  S           +A+L PD  +
Sbjct: 496 ETASNGELLEQIHSGGREISIVDVERAALDPDEQV 530


>gi|384499309|gb|EIE89800.1| hypothetical protein RO3G_14511 [Rhizopus delemar RA 99-880]
          Length = 654

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 190/391 (48%), Gaps = 34/391 (8%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL--SKVASTIDAVLLSH 62
           +++TPL         S L+   G   L+D G +  ++     P       ++ID +L++H
Sbjct: 7   LKITPLGSGNEVGRSSILMEYKGKTILLDAGIHPAYNGLASLPFFDEMDPASIDVLLVTH 66

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDS 121
               H  ++PY M +      VF T P   +    + D YL    + E D L+T +D+ +
Sbjct: 67  FHVDHAASVPYLMGK----GRVFMTHPTKAIFKWLLSD-YLRVSHIGEEDQLYTEEDLLN 121

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
           +F  +  + Y Q   +    EGI    + AGH+LG  ++ I   G  V+Y  DY+R +++
Sbjct: 122 SFHRIEAIDYHQQVEV----EGIKFTAYNAGHVLGAAMFLIEIAGVKVLYTGDYSREEDR 177

Query: 182 HL------NGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           HL       G+V        VLIT++   + +  PR  +E  F   +   +  GG  L+P
Sbjct: 178 HLMAAEKPEGSV-------DVLITESTYGVQSHEPRIAKETRFTSLVHNIVTRGGRCLMP 230

Query: 235 VDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 292
           V + GR  ELLLIL+++W  H    + PIY+ + ++   +   ++++  M   I K F  
Sbjct: 231 VFALGRAQELLLILDEFWEAHPELDSIPIYYASSLAKRCMAVYQTYINMMNARIRKQFAI 290

Query: 293 SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
           S  N F+ KH++ L N  + +++  GP +++AS   L+ G S ++F  WA D KN ++ T
Sbjct: 291 S--NPFVFKHISNLKNVEQFEDS--GPCVMMASPGMLQNGLSRELFERWAPDKKNGLVIT 346

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                 TLAR  QA   P   +    R+VPL
Sbjct: 347 GYCVENTLAR--QAMNEPSDFQAMDGRKVPL 375


>gi|351704796|gb|EHB07715.1| Cleavage and polyadenylation specificity factor subunit 3
           [Heterocephalus glaber]
          Length = 692

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 190/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVHAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P   A
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIA 375

Query: 373 VKVTMS-RRVPL 383
              TMS +++PL
Sbjct: 376 ---TMSGQKLPL 384


>gi|324504608|gb|ADY41989.1| Integrator complex subunit 11 [Ascaris suum]
          Length = 588

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 169/355 (47%), Gaps = 17/355 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      L+SI G N ++DCG +  +       D S +     +   +  
Sbjct: 4   LKVTPLGAGQDVGRSCILLSIGGKNVMLDCGMHMGYQDERRFPDFSYISGGVPLTDYLHC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + +      E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYEGPIYMTYPTKAIAPVLLEDFRKVQTEYRGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I +  + VT +  ++  ++  K   + +    AGH+LG  ++ I    E VIY  D+N
Sbjct: 124 QMIKTCMRKVTPVNVNEEVNVDDK---LSIQAFYAGHVLGAAMFLIKVGSESVIYTGDFN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    +E  ++P +LI++   A   +  ++ RE  F   +   +  GG VL+PV
Sbjct: 181 TTADRHLGAAHVEPGLKPDLLISETTYATTIRDSKRARERDFLKKVHDCVANGGKVLIPV 240

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE YW    L  PI+F   ++     Y + F+ W  + I ++F     
Sbjct: 241 FALGRAQELCILLESYWERMDLTVPIFFSHGLAEKATQYYRLFISWTNEKIKRTF--VHR 298

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           N F  KH+          ++P GP ++ ++   L  G S  +F +W SD KN+V+
Sbjct: 299 NMFDFKHIRPF--DQSFSDSP-GPMVLFSTPGMLHGGQSLRVFKKWCSDEKNMVI 350


>gi|432954006|ref|XP_004085503.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Oryzias latipes]
          Length = 686

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 182/356 (51%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  + L+T  D++ + + +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRW----LLSDYIKVSNISADEMLYTETDLEDSMEKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLARKCMAVYQTYINAMNDKIRKAINV--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ +  +P
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMTEP 378


>gi|198451826|ref|XP_001358526.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
 gi|198131664|gb|EAL27667.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
          Length = 684

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 104/391 (26%), Positives = 197/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + KT+  GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKTVLQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPELHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKTILSE--PEEITTLSGQKLPL 390


>gi|400600571|gb|EJP68245.1| metallo-beta-lactamase superfamily protein [Beauveria bassiana
           ARSEF 2860]
          Length = 866

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 185/381 (48%), Gaps = 29/381 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D       S  Q ++  L+T  D  + F  +  + Y   
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSANQTTQ-PLYTEQDHLNTFPQIEAIDYHTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+ 
Sbjct: 160 HTISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKI 215

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG  LLPV + GR  ELLLIL++YW 
Sbjct: 216 DVLITESTYGIASHVPRLEREQALMKSITNILNRGGRALLPVFALGRAQELLLILDEYWG 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA----FLL 300
           +HS    YPIY+ + ++   +   ++++  M D+I + F       ETS +      +  
Sbjct: 276 KHSEFQKYPIYYASNLAKKCMLIYQTYVGAMNDNIKRLFRERMAEAETSGEAGAGGPWDF 335

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++F  WA   KN V+ T     GT+
Sbjct: 336 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELFERWAPSDKNGVIITGYSVEGTM 393

Query: 361 ARMLQADPPPKAVKVTMSRRV 381
           AR +  +  P+ ++  MSR +
Sbjct: 394 ARQIMKE--PEQIQAVMSRSI 412


>gi|344301243|gb|EGW31555.1| hypothetical protein SPAPADRAFT_67601 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 1032

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 138/452 (30%), Positives = 215/452 (47%), Gaps = 61/452 (13%)

Query: 22  LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH--PDTLHLGALPYAMK-- 76
           L+S D  F  L D  W D  D + +  + +  S ++AVLLSH  PD +  G +   +K  
Sbjct: 20  LLSFDNEFKLLADPSW-DGKDANAVLFMEQHLSEVNAVLLSHSTPDFIS-GYVLLCLKFP 77

Query: 77  QLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQN 134
            L  + PV+ST PV +LG ++  + Y +   +   D  +  +D++D+ F  VT L Y Q+
Sbjct: 78  NLMSTMPVYSTLPVNQLGRISTVEYYRANGVLGPLDSAILEIDEVDNWFDRVTLLKYQQS 137

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------G 185
            +L      + + P+ AGH LGG  W I K  + VIYA  +N  K+  LN         G
Sbjct: 138 TNL--MDNKVTITPYNAGHTLGGAFWLIVKRIDKVIYAPAWNHSKDSFLNSASFISTSTG 195

Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
             L S +RP   IT A +     P +++ E F   +  TL  GG  LLP   +GR LEL 
Sbjct: 196 NPLLSLLRPTAFIT-APDLGSTMPHKRRTEKFLQLVDATLANGGAALLPTSLSGRFLELF 254

Query: 246 LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-ETSRDNA------- 297
            +++++     +  P+YFL+Y  +  + Y  + L+WM  S  KS+ ETS D         
Sbjct: 255 HLIDEHLQGAPI--PVYFLSYSGTRILSYASNLLDWMSGSFIKSWDETSGDGGRGGGKAL 312

Query: 298 ----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFT 352
               F    V LL++ SEL     GPK+V  S   +++G  S + F    ++ K  V+ T
Sbjct: 313 SSMPFDPSKVDLLLDPSELIQL-SGPKIVFCSGIDIKSGDISSETFQYLCNNEKTTVILT 371

Query: 353 ERGQF--GTLARML-------------------QADPPPKAVKVT-MSRRVPLVGEELIA 390
           E+ Q   G L  ML                    A P  K V +   +R   L G EL  
Sbjct: 372 EKSQLENGGLNSMLYKEWYELTKKKLGGKIEDGTAVPLDKTVSIEDWTRETNLEGRELSD 431

Query: 391 YEEEQTRLKKEEALKASLVKEEESKASLGPDN 422
           ++E  T+ +KE+ L  + V++++++  L  +N
Sbjct: 432 FQERITQQRKEKLL--AKVRDKKNQNILNAEN 461


>gi|193608339|ref|XP_001949326.1| PREDICTED: integrator complex subunit 11-like isoform 1
           [Acyrthosiphon pisum]
 gi|328710634|ref|XP_003244318.1| PREDICTED: integrator complex subunit 11-like isoform 2
           [Acyrthosiphon pisum]
          Length = 603

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 105/397 (26%), Positives = 188/397 (47%), Gaps = 32/397 (8%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVAS 53
           +   + VTPL    +      L++I   N ++DCG +  +       D S +     +  
Sbjct: 3   ISNRIIVTPLGAGQDVGRSCILITIGNRNIMLDCGMHMGYQDERKFPDFSYITSDGNITD 62

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD- 112
            ID V++SH    H GAL Y  + LG   P++ T P   +  + + D    R+ + E++ 
Sbjct: 63  IIDCVIISHFHLDHCGALSYLTEHLGYHGPIYMTHPTKAIAPILLEDM---RKHLVEYEE 119

Query: 113 ---LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
               FT   I    + VT +     + +    + I +  + AGH+LG  ++ I    + V
Sbjct: 120 EAKYFTSSAIRDCMKKVTAVNL---HEVVTVKDDIELKAYYAGHVLGAAMFYIKVGNDSV 176

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  D++   ++HL    ++   RP +LIT++  A   +  ++ RE  F   + + +  G
Sbjct: 177 VYTGDFSMTPDRHLGAAWIDK-CRPTLLITESTYATTIRDSKRCRERDFLKNVHECIDRG 235

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           G VL+P+ + GR  EL ++++ YW    L  P+YF   ++     Y K F+ W    + +
Sbjct: 236 GKVLIPIFALGRAQELCILIDTYWDRMGLKVPVYFAAGLTEKANSYYKMFITWTNQKVRQ 295

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
           +F   + N F  KH+    +K+ + N   GP +V A+   L AG S +IF +WA D KN+
Sbjct: 296 TF--VQRNMFDFKHIKPF-DKTYMHNP--GPMVVFATPGMLHAGLSLNIFKKWAPDEKNM 350

Query: 349 VLFTERGQFGTL-------ARMLQADPPPKAVKVTMS 378
           ++       GT+       ++ ++A+ P K + V MS
Sbjct: 351 LIVPGYCVSGTVGNKVLSGSKKIEAE-PNKFIDVKMS 386


>gi|393245131|gb|EJD52642.1| Metallo-hydrolase/oxidoreductase [Auricularia delicata TFB-10046
           SS5]
          Length = 751

 Score =  148 bits (374), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 102/324 (31%), Positives = 171/324 (52%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  +L Y M++      +  V+ T P   +    M D ++     S
Sbjct: 57  STVDALLITHFHLDHAASLTYIMEKTNFRDGNGKVYMTHPTKAVYKFMMQD-FVRMSAAS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LFT  D+  +  S+  ++  Q   +     G+   P+ AGH+LG  ++ I   G  V
Sbjct: 116 TDALFTPLDLSMSLASIIPISAHQ---VISPCPGLTFTPYHAGHVLGACMFHIDIAGVKV 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  DY+R +++HL    +   VRP VLI ++   + +   R+++E  F   I + ++ G
Sbjct: 173 LYTGDYSREEDRHLVKAEIPP-VRPDVLIVESTYGVQSVGNREEKEGRFLSLIHEIIKRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+ LLPV + GR  ELLL+L+DYWA+H    + PIY+ + ++   +   ++++  M  +I
Sbjct: 232 GHALLPVFALGRAQELLLVLDDYWAKHPELHSVPIYYASNLARKCMAVYQTYIHTMNSNI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNA-PDGPK-LVLASMASLEAGFSHDIFVEWASD 344
            + F   RDN F+ KH++ L     L+    DGP  +VLAS   L++G S ++   WA D
Sbjct: 292 RQRF-ARRDNPFIFKHISHLPQTRGLERKIADGPPCVVLASPGMLQSGTSRELLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GTLAR +  DP
Sbjct: 351 PRNALVVTGYSVEGTLARDILNDP 374


>gi|195145744|ref|XP_002013850.1| GL23169 [Drosophila persimilis]
 gi|194102793|gb|EDW24836.1| GL23169 [Drosophila persimilis]
          Length = 684

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/391 (26%), Positives = 197/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 76  SHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +   ++P VLIT++    H    R+ RE  F   + KT+  GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKTVLQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPELHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKTILSE--PEEITTLSGQKLPL 390


>gi|348558392|ref|XP_003465002.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Cavia porcellus]
          Length = 684

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/372 (27%), Positives = 190/372 (51%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P   A
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIA 375

Query: 373 VKVTMS-RRVPL 383
              TMS +++PL
Sbjct: 376 ---TMSGQKLPL 384


>gi|326916480|ref|XP_003204535.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Meleagris gallopavo]
          Length = 759

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 103 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 162

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 163 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 214

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 215 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 273

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 274 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 333

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 334 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 391

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 392 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 447

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 448 ITTMSGQKLPL 458


>gi|308492421|ref|XP_003108401.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
 gi|308249249|gb|EFO93201.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
          Length = 712

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 176/373 (47%), Gaps = 18/373 (4%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLS 61
           S+  TPL          +L+   G   ++DCG +         P         ID +L++
Sbjct: 10  SLSFTPLGSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLIT 69

Query: 62  HPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD 118
           H    H GALP+ +++       F   +T+ +YR+ LL  Y +           L+T DD
Sbjct: 70  HFHLDHCGALPWLLQKTAFRGKCFMTHATKAIYRM-LLGDYVRISKYGGADRNQLYTEDD 128

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           ++ +   +  + + +   ++G    I   P+VAGH+LG   + I   G  V+Y  D++  
Sbjct: 129 LEKSMAKIETIDFREQKEVNG----IRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCL 184

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
           +++HL    +   + P VLIT++         R  RE  F   +   +  GG  L+P  +
Sbjct: 185 EDRHLCAAEIPP-ITPQVLITESTYGTQTHEERSVREKRFTQMVHDIVTRGGRCLIPAFA 243

Query: 238 AGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            G   EL+LIL++YW  H    + P+Y+ + ++   +   ++F+  M   I K  + +  
Sbjct: 244 IGPAQELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQK--QIAVK 301

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F+ KHV+ L    + ++A  GP +VLA+   L++GFS ++F  W SD KN  +     
Sbjct: 302 NPFIFKHVSTLRGMDQFEDA--GPCVVLATPGMLQSGFSRELFENWCSDSKNGCIIAGYC 359

Query: 356 QFGTLARMLQADP 368
             GTLAR +  +P
Sbjct: 360 VEGTLARHILTEP 372


>gi|391330858|ref|XP_003739869.1| PREDICTED: integrator complex subunit 11-like [Metaseiulus
           occidentalis]
          Length = 601

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 173/368 (47%), Gaps = 18/368 (4%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTI 55
           + + +TPL    +      L+S+ G N ++DCG +  +       D S +     +   +
Sbjct: 2   SEITITPLGAGQDVGRSCILISMGGKNIMLDCGMHMGYQDERRFPDFSYINNGGPLDDFL 61

Query: 56  DAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLF 114
           D V++SH    H GALP+  + +G + P++ T P   +  + + D + +   +  E + F
Sbjct: 62  DCVIISHFHLDHCGALPFMSEMIGYTGPIYMTHPTKAICPILLEDFRKICVDKKGEQNFF 121

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
           +   I    + V      +   +  + E   +  + AGH+LG  ++ I      ++Y  D
Sbjct: 122 SQGMIRDCMKKVIPCNLHETIKVDSELE---IKAYYAGHVLGAAMFHIKVGHISIVYTGD 178

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLL 233
           YN   ++HL    ++   RP +LIT++  A   +  ++ RE  F + +   +  GG VL+
Sbjct: 179 YNMTPDRHLGAAWIDR-CRPDLLITESTYATTIRDSKRCRERDFLNKVHDCIERGGKVLI 237

Query: 234 PVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           P  + GR  EL ++LE YW   +L  PIYF   ++    +Y K F+ W    I  +F   
Sbjct: 238 PAFALGRAQELCILLETYWERMNLKCPIYFAAGLTEKATNYYKMFITWTNQKIRNTF--V 295

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             N F  KH+    +++ +DN   GP +V A+   L AG S  IF +WA   +N+V+   
Sbjct: 296 DHNMFDFKHIKPF-DRAYIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPFEENMVIMPG 352

Query: 354 RGQFGTLA 361
               GT+ 
Sbjct: 353 YCVSGTVG 360


>gi|260815130|ref|XP_002602327.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
 gi|229287635|gb|EEN58339.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
          Length = 687

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 93/336 (27%), Positives = 171/336 (50%), Gaps = 22/336 (6%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEF 111
           ID +L+SH    H G LPY + +      VF   +T+ +Y+     +   Y+    +S  
Sbjct: 71  IDLLLISHFHLDHCGGLPYFLTKTSFRGRVFMTHATKAIYKW----LLSDYIKVSNISSE 126

Query: 112 D-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
           D L+T +D+ ++   +  +    N+H      GI    + AGH+LG  ++ I   G  ++
Sbjct: 127 DMLYTENDLSASMDKIETV----NFHQETDVNGIKFWCYNAGHVLGAAMFMIEIAGVKIL 182

Query: 171 YAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
           Y  D++R++++HL    + + + P VLI +A    H    R++RE  F   +   +  GG
Sbjct: 183 YTGDFSRQEDRHLMAAEVPA-IHPDVLIIEATYGTHIHEKREEREARFTSTVHDIVNRGG 241

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
             L+PV + GR  ELLLIL++YW+ H    + PIY+ + ++   +   ++++  M + I 
Sbjct: 242 RCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMAVYQTYINAMNEKIR 301

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
           K    S  N F+ KH++ L      D+   GP +V+AS   +++G S ++F  W +D +N
Sbjct: 302 KQISVS--NPFVFKHISNLKGMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDRRN 357

Query: 348 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
             +       GTLA+ + ++  P+ +     +++PL
Sbjct: 358 GCIIAGYCVEGTLAKHIMSE--PEEITTMSGQKIPL 391


>gi|403270697|ref|XP_003927303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Saimiri boliviensis boliviensis]
          Length = 658

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 33  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 92

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 93  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 144

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 145 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 203

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 204 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 263

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 264 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 321

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 322 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 377

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 378 ITTMSGQKLPL 388


>gi|431911821|gb|ELK13965.1| Cleavage and polyadenylation specificity factor subunit 3, partial
           [Pteropus alecto]
          Length = 667

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 12  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 71

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 72  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 123

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 124 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 182

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 183 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 242

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 243 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 300

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 301 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 356

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 357 ITTMSGQKLPL 367


>gi|363732494|ref|XP_419942.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Gallus gallus]
          Length = 672

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 16  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 75

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 76  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 127

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 128 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 186

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 187 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 246

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 247 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 304

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 305 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 360

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 361 ITTMSGQKLPL 371


>gi|194220982|ref|XP_001502516.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Equus caballus]
 gi|301775721|ref|XP_002923277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Ailuropoda melanoleuca]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|410955844|ref|XP_003984560.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Felis catus]
          Length = 686

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|221106537|ref|XP_002161150.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Hydra magnipapillata]
          Length = 677

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/346 (28%), Positives = 173/346 (50%), Gaps = 24/346 (6%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
           G+N L    + D  DP            +D +L+SH    H G LP+ +++      VF 
Sbjct: 44  GYNGLDSLPFIDEIDPG----------EVDLLLISHFHLDHCGGLPWFLEKTHFKGRVFM 93

Query: 87  TEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIV 145
           T P   +    + D Y+    +S +  L+T  D++ +   +  + + Q   +SG    I 
Sbjct: 94  THPTKAIYRWLLAD-YIKVSNISADQMLYTEKDLEKSMDKIETMHFHQEKEVSG----IK 148

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
              + AGH+LG  ++ I   G +++Y  D++R++++HL    + + + P VLI ++    
Sbjct: 149 FWAYNAGHVLGAAMFMIEIAGVNILYTGDFSRQEDRHLMSAEIPN-ISPDVLIMESTYGT 207

Query: 206 HNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIY 262
           H    R+QRE  F   I   +  GG  L+PV + GR  ELLLIL++YW +H    + P+Y
Sbjct: 208 HVHEKREQREKRFTSTIHNIISRGGRCLIPVFALGRAQELLLILDEYWNQHPELQDVPVY 267

Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
           + + ++   +   ++++  M + I +    S  N F+ KH++ L      D+   GP +V
Sbjct: 268 YASSLAKKCMAVYQTYISAMNEKIRRQISIS--NPFVFKHISNLKGIDSFDDI--GPSVV 323

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           LAS   +++G S ++F  W +D +N V+       GTLA+ L ++P
Sbjct: 324 LASPGMMQSGLSRELFETWCTDPRNGVIIAGYCVEGTLAKELMSEP 369


>gi|126303222|ref|XP_001371997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Monodelphis domestica]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|449498153|ref|XP_002196255.2| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3 [Taeniopygia guttata]
          Length = 746

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 91  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 150

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 151 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 202

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 203 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 261

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 262 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 321

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 322 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 379

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 380 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 435

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 436 ITTMSGQKLPL 446


>gi|296224527|ref|XP_002758090.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Callithrix jacchus]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|350539083|ref|NP_001233296.1| cleavage and polyadenylation specificity factor subunit 3 [Pan
           troglodytes]
 gi|397513374|ref|XP_003826991.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Pan paniscus]
 gi|426334660|ref|XP_004028859.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Gorilla gorilla gorilla]
 gi|343961085|dbj|BAK62132.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
           troglodytes]
 gi|343961781|dbj|BAK62478.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
           troglodytes]
 gi|410254182|gb|JAA15058.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
           troglodytes]
 gi|410291448|gb|JAA24324.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
           troglodytes]
 gi|410339611|gb|JAA38752.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
           troglodytes]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|332247248|ref|XP_003272765.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Nomascus leucogenys]
 gi|67969340|dbj|BAE01022.1| unnamed protein product [Macaca fascicularis]
 gi|355751093|gb|EHH55348.1| hypothetical protein EGM_04543 [Macaca fascicularis]
 gi|380813676|gb|AFE78712.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
           mulatta]
 gi|383419123|gb|AFH32775.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
           mulatta]
 gi|384940728|gb|AFI33969.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
           mulatta]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|335285899|ref|XP_003354974.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Sus scrofa]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|402890043|ref|XP_003908303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Papio anubis]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|7706427|ref|NP_057291.1| cleavage and polyadenylation specificity factor subunit 3 [Homo
           sapiens]
 gi|18203503|sp|Q9UKF6.1|CPSF3_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit; Short=CPSF 73 kDa
           subunit; AltName: Full=mRNA 3'-end-processing
           endonuclease CPSF-73
 gi|6002955|gb|AAF00224.1|AF171877_1 cleavage and polyadenylation specificity factor 73 kDa subunit
           [Homo sapiens]
 gi|18044212|gb|AAH20211.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
           sapiens]
 gi|62822309|gb|AAY14858.1| unknown [Homo sapiens]
 gi|119621394|gb|EAX00989.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
           CRA_a [Homo sapiens]
          Length = 684

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|27805863|ref|NP_776709.1| cleavage and polyadenylation specificity factor subunit 3 [Bos
           taurus]
 gi|426223116|ref|XP_004005724.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Ovis aries]
 gi|18202362|sp|P79101.1|CPSF3_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit; Short=CPSF 73 kDa
           subunit; AltName: Full=mRNA 3'-end-processing
           endonuclease CPSF-73
 gi|1707412|emb|CAA65151.1| Cleavage and Polyadenylation Specifity Factor protein [Bos taurus]
 gi|75773721|gb|AAI04554.1| Cleavage and polyadenylation specific factor 3, 73kDa [Bos taurus]
 gi|296482248|tpg|DAA24363.1| TPA: cleavage and polyadenylation specificity factor subunit 3 [Bos
           taurus]
 gi|440897562|gb|ELR49218.1| Cleavage and polyadenylation specificity factor subunit 3 [Bos
           grunniens mutus]
          Length = 684

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|346472285|gb|AEO35987.1| hypothetical protein [Amblyomma maculatum]
          Length = 510

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/340 (29%), Positives = 164/340 (48%), Gaps = 18/340 (5%)

Query: 31  LIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           ++DCG +  F       D S +     +   +D V++SH    H GALPY  + +G S P
Sbjct: 1   MLDCGMHMGFNDERRFPDFSYITQEGPLNEHLDCVIISHFHLDHCGALPYMTEMVGYSGP 60

Query: 84  VFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           ++ T P   +  + + D + ++  +  E + FT   I    + V  +   Q   +  + E
Sbjct: 61  IYMTHPTKAICPILLEDYRKITVDRKGETNFFTSAMIRDCMRKVVAVNLHQAVQVDDELE 120

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
              +  + AGH+LG  ++ I    + V+Y  DYN   ++HL    ++   RP +LIT++ 
Sbjct: 121 ---IKAYYAGHVLGAAMFWIRVGSQSVVYTGDYNMTPDRHLGAAWVDK-CRPDLLITEST 176

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPI 261
            A   +  ++ RE  F   +   +  GG VL+PV + GR  EL ++LE YW   +L  PI
Sbjct: 177 YATTIRDSKRCRERDFLTKVHDCIDKGGKVLIPVFALGRAQELCILLETYWDRMNLRVPI 236

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKL 321
           YF   ++    +Y K F+ W    I K+F   + N F  KH+    +++ +DN   GP +
Sbjct: 237 YFAVGLTEKATNYYKMFITWTNQKIRKTF--VQRNMFDFKHIKPF-DRAFIDNP--GPMV 291

Query: 322 VLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
           V A+   L AG S  IF +WA    N+V+       GT+ 
Sbjct: 292 VFATPGMLHAGLSLQIFKKWAPFEANMVIMPGYCVAGTVG 331


>gi|395507218|ref|XP_003757924.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Sarcophilus harrisii]
          Length = 684

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|291412514|ref|XP_002722528.1| PREDICTED: cleavage and polyadenylation specific factor 3, 73kDa
           [Oryctolagus cuniculus]
          Length = 684

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|15079675|gb|AAH11654.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
           sapiens]
 gi|157929136|gb|ABW03853.1| cleavage and polyadenylation specific factor 3, 73kDa [synthetic
           construct]
          Length = 684

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HGVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|50549403|ref|XP_502172.1| YALI0C23232p [Yarrowia lipolytica]
 gi|49648039|emb|CAG82492.1| YALI0C23232p [Yarrowia lipolytica CLIB122]
          Length = 799

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 173/353 (49%), Gaps = 45/353 (12%)

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQL-GLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEF 111
           T++ VL +H +  HLGA   A K    L+A P + T PV  +G +   + Y S+  +S  
Sbjct: 41  TLNLVLFTHANAAHLGAYALACKLYPALAAVPAYGTLPVINMGRIATLEAYRSQGLLSS- 99

Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG-------------------------IVV 146
           +  T  +I+  F ++T + Y Q   +  + +G                         + +
Sbjct: 100 EHITATEIEIIFDNITSIKYLQPIGIGVRSKGEVATTATEDGNSTELTTTQVTTHETLTI 159

Query: 147 APHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------VLESFVRPAVLI 198
               +GH LGGT+W++    ++V+YAVD+N  K+ HL+G         ++ +  RP V++
Sbjct: 160 TAFNSGHSLGGTIWRLQHQQDNVVYAVDWNHAKDSHLSGAAFLQKGGQIVSALHRPTVMV 219

Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH--- 255
             +   L     +++  +   +I K L+ GG+VLLP     RVLE++ +L+D W  +   
Sbjct: 220 CGSQTGLR---LKRRDILLWSSIQKALKRGGSVLLPTSVGSRVLEVIHMLDDLWTNNQNS 276

Query: 256 SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD-- 313
                +  LT++ +  ++Y  S LEWM  SI   +E   ++ F  ++  ++ +  + D  
Sbjct: 277 QQGVTLVLLTHLGARLLEYASSMLEWMSPSIIAEWEKKNESPFQTRNFKIVHSMDQFDKV 336

Query: 314 -NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQ 365
               +G  +V++    LE+GFS  +F   ASD +N VLFTER +  +LA  LQ
Sbjct: 337 VKGGNGQFVVVSVGEDLESGFSRLLFNRLASDERNSVLFTERSEGNSLATELQ 389



 Score = 45.8 bits (107), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 24/143 (16%)

Query: 575 AYKVQLSEKLMSNVLFKKL-GDYEIAWVDAEVGKTEN-------GMLSLLPISTPA---- 622
           A  +QL+ +L   + +++L G   +A V  +V K E+         L+L PI   A    
Sbjct: 663 AVDIQLTPELSRLLNWQQLSGGLSLAHVVGKVAKNEDKSEDTPLAALALQPIVDAADLAV 722

Query: 623 -PPHKSVLVGDLKMADLKPFLSSKGIQVEF-AGGALRCGEYVTIRKVGPAGQKGGGSGTQ 680
            P  + + VGD+++A+LK  L   G +  F AGG L     V+IRKV  +          
Sbjct: 723 APRIEPLRVGDIRLAELKQALGKLGFRAVFQAGGVLVVDGKVSIRKVDES---------- 772

Query: 681 QIVIEGPLCEDYYKIRAYLYSQF 703
            +V++G +  D+Y I+  + +Q 
Sbjct: 773 NLVVDGGIGSDFYAIKEVVRAQL 795


>gi|407919362|gb|EKG12612.1| Beta-lactamase-like protein [Macrophomina phaseolina MS6]
          Length = 842

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 104/386 (26%), Positives = 183/386 (47%), Gaps = 29/386 (7%)

Query: 20  SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQ 77
           S+++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + +
Sbjct: 39  SHIIQYKGKTVMLDAGMHPAYDGLAALPFYDEFDLSTVDVLLISHFHIDHAASLPYVLSK 98

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
                 VF T P   +    + D      +S    S   L+T  D  S F  +  + Y  
Sbjct: 99  TNFKGRVFMTHPTKAIYKWLIQDSVRVGNISSSSESRIQLYTEADHLSTFPQIEAIDYYT 158

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
            + +S     I + P+ AGH+LG  ++ I   G  +++  DY+R +++HL    +   V+
Sbjct: 159 THTISS----IRITPYPAGHVLGAAMFLIEIAGLKILFTGDYSREEDRHLISAEVPKNVK 214

Query: 194 PAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
             VLIT++   + +  PR +RE     +I+  +  GG  LLPV + GR  ELLLIL++YW
Sbjct: 215 VDVLITESTFGIASHVPRLEREAALMKSITGIINRGGRALLPVFALGRAQELLLILDEYW 274

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-----------ETSRDNAFL 299
           A+H      PIY+ + ++   +   ++++  M D+I + F           + S+   + 
Sbjct: 275 AKHPEFQKIPIYYASNIARKCMVVYQTYVYAMNDNIKRLFRERMEEAERNGDASKAGPWD 334

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K+V  L +    D+   G  ++LAS   ++ G S ++   WA D +N V+ T     GT
Sbjct: 335 FKYVRSLKSLERFDDV--GSCVMLASPGMMQNGVSRELLERWAPDQRNGVIMTGYSVEGT 392

Query: 360 LARMLQADP---PPKAVKVTMSRRVP 382
           + +M+  +P   P    +  ++RR P
Sbjct: 393 MGKMILHEPEQIPAVMTRANVARRGP 418


>gi|359321645|ref|XP_003639652.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Canis lupus familiaris]
          Length = 717

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 62  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 121

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 122 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 173

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 174 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 232

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 233 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 292

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 293 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 350

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 351 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 406

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 407 ITTMSGQKLPL 417


>gi|300676780|gb|ADK26656.1| cleavage and polyadenylation specific factor 3, 73kDa [Zonotrichia
           albicollis]
          Length = 721

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 66  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 125

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 126 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 177

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 178 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 236

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 237 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 296

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 297 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 354

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 355 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 410

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 411 ITTMSGQKLPL 421


>gi|417412420|gb|JAA52597.1| Putative cleavage and polyadenylation specificity factor cpsf
           subunit, partial [Desmodus rotundus]
          Length = 714

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 59  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 118

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 119 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 170

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 171 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 229

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 230 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 289

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 290 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 347

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 348 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 403

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 404 ITTMSGQKLPL 414


>gi|432100623|gb|ELK29151.1| Cleavage and polyadenylation specificity factor subunit 3 [Myotis
           davidii]
          Length = 684

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 190/375 (50%), Gaps = 32/375 (8%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSL--LQPLSKV----ASTIDAVLLSHPDTLHLGALPYAM 75
           ++   G   ++DCG      P L  +  L+ +     + ID +L+SH    H GALP+ +
Sbjct: 29  ILEFKGRKIMLDCG----IHPGLEGMDALAYIDLIDPAEIDLLLISHFHLDHCGALPWFL 84

Query: 76  KQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTY 131
           ++       F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +  
Sbjct: 85  QKTSFKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI-- 138

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESF 191
             N+H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + 
Sbjct: 139 --NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN- 195

Query: 192 VRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 250
           ++P +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++
Sbjct: 196 IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDE 255

Query: 251 YWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           YW  H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +
Sbjct: 256 YWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKS 313

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
               D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++ 
Sbjct: 314 MDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE- 370

Query: 369 PPKAVKVTMSRRVPL 383
            P+ +     +++PL
Sbjct: 371 -PEEITTMSGQKLPL 384


>gi|344280152|ref|XP_003411849.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Loxodonta africana]
          Length = 903

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 101/385 (26%), Positives = 192/385 (49%), Gaps = 25/385 (6%)

Query: 9   PLSGVFNENPLSYLV-SIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDT 65
           P  G   E   S ++    G   ++DCG +   +     P   +   + ID +L+SH   
Sbjct: 234 PFPGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHL 293

Query: 66  LHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDS 121
            H GALP+ +++       F   +T+ +YR     +   Y+    +S  D L+T  D++ 
Sbjct: 294 DHCGALPWFLQKTSFKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLED 349

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
           +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++
Sbjct: 350 SMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDR 405

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGR 240
           HL    + + ++P +LI ++    H    R++RE  F + +   +  GG  L+PV + GR
Sbjct: 406 HLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 464

Query: 241 VLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
             ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D I K      +N F
Sbjct: 465 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPF 522

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           + KH++ L +    D+   GP +V+AS   +++G S ++F  W +D +N V+       G
Sbjct: 523 VFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEG 580

Query: 359 TLARMLQADPPPKAVKVTMSRRVPL 383
           TLA+ + ++  P+ +     +++PL
Sbjct: 581 TLAKHIMSE--PEEITTMSGQKLPL 603


>gi|126030713|pdb|2I7T|A Chain A, Structure Of Human Cpsf-73
 gi|126030714|pdb|2I7V|A Chain A, Structure Of Human Cpsf-73
          Length = 459

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|67969643|dbj|BAE01170.1| unnamed protein product [Macaca fascicularis]
          Length = 684

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|119576641|gb|EAW56237.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_e
           [Homo sapiens]
          Length = 578

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 169/356 (47%), Gaps = 40/356 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V                   VA H     L  TV +I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKV-------------------VAVH-----LHQTV-QIKVGSESVVYTGDYN 158

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 159 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 217

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 218 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 275

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 276 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 328


>gi|320590943|gb|EFX03384.1| polyadenylation specificity factor [Grosmannia clavigera kw1407]
          Length = 1036

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/450 (26%), Positives = 186/450 (41%), Gaps = 106/450 (23%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    LID GW++ FD   L+ L K   TI  VLL+H    
Sbjct: 6   PLLGAKSESTASQSLLELDGGVKVLIDVGWDESFDAEKLRELEKQVPTISLVLLTHATVS 65

Query: 67  HLGALPYAMKQLG--LSAPVFSTEPVYRLGLLTMYDQYLS---------RRQVSE----- 110
           H+ A  +  K     +  P+F+T+PV  LG   + D Y S         R  ++E     
Sbjct: 66  HIAAFAHCCKNFPQFVRIPIFATKPVIDLGRTLLQDLYASTPLAASTIPRGSLAEASYSY 125

Query: 111 ------------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE-----GIVVAPHVAGH 153
                           T D+I   F  +  L YSQ +            G+ +  + +G 
Sbjct: 126 SQSLSAEHSQFLLQAPTADEITRYFSLIRELKYSQPHQPQAPPSLPPLNGLTITAYNSGR 185

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDA 201
            LGGT+W I    E ++Y VD+ + KE   +G             V E   +P  L++ +
Sbjct: 186 TLGGTIWHIQLGLESIVYGVDWGQYKENVFSGAAWIGGGGSGGSEVNEQLRKPTALVSSS 245

Query: 202 YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL---- 257
                 +P  +  ++ Q AI   +  GG VL+PVDS+ RVLEL  +LE  W + +     
Sbjct: 246 RAPAVLRPGLRDEQLLQ-AIRVCVARGGTVLIPVDSSARVLELAYLLEHAWRKDAAAAAA 304

Query: 258 ------------NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA-------- 297
                          ++     S S + + ++ LEWM D I + FE   D +        
Sbjct: 305 GSNGKEDIGLLARSKLFLAGRTSGSLMRHARTLLEWMNDGIVQEFEAVADGSKQQTNNGG 364

Query: 298 ----------------------------FLLKHVTLLINKSELDNA------PDGPKLVL 323
                                       F +KH+ LL  +++++        P G K++L
Sbjct: 365 NRGRGGGGGGGGGGGNGADDNKNRESGPFDMKHLRLLERRAQVERVLNSQSPPGGGKVIL 424

Query: 324 ASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           AS AS+E GFS ++    A   +NLVL TE
Sbjct: 425 ASDASMEWGFSKEVLRRIADKPRNLVLLTE 454


>gi|348531581|ref|XP_003453287.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Oreochromis niloticus]
          Length = 690

 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 181/356 (50%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 96  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 147

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 207 ILIIESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINI--NNPFVFKHISNLKSMDHF 324

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P
Sbjct: 325 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 378


>gi|321461562|gb|EFX72593.1| hypothetical protein DAPPUDRAFT_308207 [Daphnia pulex]
          Length = 689

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 182/356 (51%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID +L+SH    H GALP+ +++  
Sbjct: 35  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLIEADQIDLLLISHFHLDHCGALPWFLQKTT 94

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S +  L+T  D++++ + +  +    N+
Sbjct: 95  FKGRCFMTHATKAIYRW----LLSDYIKVSNISTDQMLYTEADLEASMEKIEVI----NF 146

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      G+    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + + VRP 
Sbjct: 147 HEEKDVGGVRFWAYNAGHVLGAAMFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPT-VRPD 205

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H    R+ RE  F   I + +  GG  L+PV + GR  ELLLIL++YW+ 
Sbjct: 206 ILITESTYGTHIHEKREDRESRFTGLIHEIVNRGGRCLIPVFALGRAQELLLILDEYWSL 265

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H      PIY+ + ++   +   ++++  M D I +  + + +N F+ KH++ L    + 
Sbjct: 266 HPELHEIPIYYASSLAQKCMAVYQTYINAMNDKIRR--QIAINNPFIFKHISSLKGIDQF 323

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++   GP +++AS   +++G S ++F  W +D KN  +       GTLA+ + ++P
Sbjct: 324 EDV--GPCVIMASPGMMQSGLSRELFEAWCTDPKNGCIIAGYCVEGTLAKHVLSEP 377


>gi|149050991|gb|EDM03164.1| cleavage and polyadenylation specificity factor 3, isoform CRA_a
           [Rattus norvegicus]
          Length = 685

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|71795627|ref|NP_001025201.1| cleavage and polyadenylation specificity factor subunit 3 [Rattus
           norvegicus]
 gi|71121802|gb|AAH99817.1| Cleavage and polyadenylation specificity factor 3 [Rattus
           norvegicus]
          Length = 685

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGMKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|71682600|gb|AAI00570.1| Cpsf3 protein [Mus musculus]
          Length = 512

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   + +   P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGTDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|148702078|gb|EDL34025.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
           [Mus musculus]
          Length = 701

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 46  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 105

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 106 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 157

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 158 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 217 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 277 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 334

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 335 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 390

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 391 ITTMSGQKLPL 401


>gi|31980904|ref|NP_061283.2| cleavage and polyadenylation specificity factor subunit 3 [Mus
           musculus]
 gi|341940395|sp|Q9QXK7.2|CPSF3_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit; Short=CPSF 73 kDa
           subunit; Short=mRNA 3'-end-processing endonuclease
           CPSF-73
 gi|23271024|gb|AAH23297.1| Cleavage and polyadenylation specificity factor 3 [Mus musculus]
          Length = 684

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|74221128|dbj|BAE42066.1| unnamed protein product [Mus musculus]
          Length = 684

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|219123319|ref|XP_002181974.1| cleavage and polyadenylation specific factor [Phaeodactylum
           tricornutum CCAP 1055/1]
 gi|217406575|gb|EEC46514.1| cleavage and polyadenylation specific factor [Phaeodactylum
           tricornutum CCAP 1055/1]
          Length = 1001

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 104/338 (30%), Positives = 169/338 (50%), Gaps = 38/338 (11%)

Query: 56  DAVLLSHPDTLHLGALPYAMKQLGLSAP------VFSTEPVYRLGLLTMYDQYLSRRQVS 109
           D ++L+      LG LP   +Q+  + P      +++T P  ++G +T+YDQ+ +     
Sbjct: 72  DCLVLTDSTLQALGGLPMYYRQMKDTQPDLPLPPIYATFPTVKMGQMTLYDQHAAISLDG 131

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHL-----SGKGEGIVVAPHVAGHLLGGTVWKITK 164
               +TL D+D  F SV  + YSQ   +     + K   + V  H AGH++GG  + + +
Sbjct: 132 GQPPYTLRDLDDVFASVHAIKYSQAMRVYPRDTNTKHASLSVTAHRAGHVVGGAFYVVQR 191

Query: 165 --DGEDVIYAVDYNRRKEKHLNG-TVLESFVRPAVLITD--------AYNALHNQ----- 208
             D   V+    Y+  KE HL+  T+L+    P VL+T         A + + N      
Sbjct: 192 LRDETVVVLTTQYHVAKELHLDSSTILKHATTPDVLVTHPGGPALRLARSNVQNTVTPLV 251

Query: 209 PPR---QQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYF 263
           PP+   Q   +  + +   LR  GNVLLP D +GRVLE+LL L ++W  H L  +Y + +
Sbjct: 252 PPQMVTQVERVLVETVLSVLRRDGNVLLPCDVSGRVLEVLLALHNHWDRHRLAASYHLIW 311

Query: 264 LTYVSSSTIDYVKSFLEWMGDSITKSFET-SRDNAFLLKHVTLLINKSELDN----APDG 318
              ++ + +D+ +S LEWMG  +   F+  +  +   L HV +  N  EL+      P+ 
Sbjct: 312 CGPMAPNVLDFARSQLEWMGTKLGHVFDAQAGPHPLTLPHVHVCTNTRELEKFLAENPN- 370

Query: 319 PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           P  V+AS  SLE G + D+ + WA +V N +LFT+  Q
Sbjct: 371 PACVVASGLSLEGGPARDLLLSWADNVDNAILFTDASQ 408



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 89/186 (47%), Gaps = 21/186 (11%)

Query: 528  VSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSN 587
            +++E+TVL   +   T+ +           V  P   E I++     AY V+L +     
Sbjct: 830  LTDEVTVLAKATKAFTQGMHD---------VRMPSDGEVIELKVGHAAYAVRLIDTPYHP 880

Query: 588  VLFKKLGDYE---IAWVDAEVGK--TENGMLSLLPISTPAPPHKSVLV--GDLKMADLKP 640
            +  ++  D     I   +A+VG+    +G + L P  + A    S+ +  GD+ + DL+ 
Sbjct: 881  LKEREAADLSHEPIESFEAKVGQKVAADGSIVLAPKDSGANDDPSIYLSDGDVLLTDLRA 940

Query: 641  FLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 700
             L +KG++ E++  A    + V   KV    QK   SG  Q+ +EGPLCED+Y +R  + 
Sbjct: 941  ELIAKGMKAEYSTKA-GVAQLVVNGKV--LVQKAQDSG--QLEVEGPLCEDFYLVRGVVC 995

Query: 701  SQFYLL 706
             QF ++
Sbjct: 996  GQFTVV 1001


>gi|171679503|ref|XP_001904698.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939377|emb|CAP64605.1| unnamed protein product [Podospora anserina S mat+]
          Length = 967

 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 118/431 (27%), Positives = 181/431 (41%), Gaps = 86/431 (19%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  L+ +DG    L+D GW++ F    L+ L K   T+  +LL+H    
Sbjct: 6   PLQGALSESTASQSLLELDGGVKILVDVGWDETFAVEKLRELEKQVPTLSFILLTHATVA 65

Query: 67  HLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS-------------------R 105
           H+GA  +  K + L  + P ++T PV  LG     D Y S                    
Sbjct: 66  HIGAYAHCCKHIPLFSTIPAYATRPVIDLGRTLTQDLYASTPLAATTIPTSSLAEVAYAS 125

Query: 106 RQVSEFDLFTL------DDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHL 154
            Q    +   L      ++I   F ++  + YSQ         S     + V  + +G  
Sbjct: 126 SQAPSLNPNLLLQPPSPEEITRYFANIQAVQYSQPQQPRSSPFSPDITNLTVTAYNSGRT 185

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT--------------VLESFVRPAVLITD 200
           LGG +W I    E ++YAVD+N+ KE   +G               V+E   +P  L+  
Sbjct: 186 LGGAIWHIQHGLESIVYAVDWNQGKENVFSGAAWLSGGHGGGGSTEVIEQLRKPTALVCS 245

Query: 201 AYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA------- 253
           +          ++ E   ++I   +  GG VL+PVDS+ RVLEL  +LE  W        
Sbjct: 246 SRTPDATLSRAKRDEQLLESIKLCIARGGTVLIPVDSSARVLELSYLLEHAWRNEVDNNN 305

Query: 254 -EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--------------- 297
            E   N  +Y   +   ST+ + +S  EWM D I + FE +                   
Sbjct: 306 NETFRNAQLYLAGHSIGSTLKHARSLFEWMDDKIVREFEAAAGGKESHSRGQRGGHHHDH 365

Query: 298 -----FLLKHVTLLINKSEL---------DNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
                F  KH+ LL  K ++         D  P G +++LA+ +SLE GFS ++    A 
Sbjct: 366 KVAGPFDFKHLRLLERKGQVSWVLKQALEDLEPKG-RVILATDSSLEWGFSKEVLKSIAG 424

Query: 344 DVKNLVLFTER 354
           D +NLVL TE+
Sbjct: 425 DARNLVLLTEK 435


>gi|195995883|ref|XP_002107810.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
 gi|190588586|gb|EDV28608.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
          Length = 636

 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 184/359 (51%), Gaps = 24/359 (6%)

Query: 20  SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLSHPDTLHLGALPYAMKQ 77
            +++       ++DCG +         P + + +   ID +L+SH    H GALP+ +++
Sbjct: 38  CHIIQYKNKTIMLDCGIHPGRHGVEALPYTDIIAEDQIDLLLISHFHLDHCGALPWFLER 97

Query: 78  LGLSAPVF---STEPVYRLGLLTMYDQY--LSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
                 VF   +T+ +YR  LL  Y +   +S  Q+    L+T  D++ +   +  +   
Sbjct: 98  TSFKGRVFMTHATKAIYRW-LLADYVKVSNISTDQM----LYTEKDLEKSMTKIETI--- 149

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
            ++H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V
Sbjct: 150 -HFHQEKEVNGIKFWCYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMAAEIPS-V 207

Query: 193 RPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +P VLI ++   +H    R+ RE  F   +   +  GG  L+PV + GR  ELLLIL++Y
Sbjct: 208 KPDVLIIESTYGVHIHEKREIREKRFTSTVHDIVNRGGRCLIPVFALGRAQELLLILDEY 267

Query: 252 WAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINK 309
           W+ H+   + PIY+ + ++   +   ++++  M D I      S  N F+ KH++ L   
Sbjct: 268 WSNHTELHDIPIYYASSLAKKCMAVYQTYVSAMNDKIRNQIAIS--NPFIFKHISNLKGI 325

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
              D+   GP +V+AS   +++G S ++F +W +D KN V+       GTLA+ + ++P
Sbjct: 326 DHFDDI--GPCVVMASPGMMQSGLSRELFEKWCTDSKNGVVIAGYCVEGTLAKEVMSEP 382


>gi|74178650|dbj|BAE33998.1| unnamed protein product [Mus musculus]
          Length = 684

 Score =  146 bits (369), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|195037533|ref|XP_001990215.1| GH19212 [Drosophila grimshawi]
 gi|193894411|gb|EDV93277.1| GH19212 [Drosophila grimshawi]
          Length = 686

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 186/375 (49%), Gaps = 26/375 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 20  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 77

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S    L+T 
Sbjct: 78  SHFHLDHCGALPWFLMKTSFRGRCFMTHATKAIYRW----MLSDYIKISNISTDQMLYTE 133

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 134 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 189

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +    +P VLIT++    H    R+ RE  F   + K ++ GG  L+PV
Sbjct: 190 RQEDRHLMAAEVPP-KKPDVLITESTYGTHIHEKREDRESRFTTLVQKIVQQGGRCLIPV 248

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL++YW+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 249 FALGRAQELLLILDEYWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 306

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            +N F+ +H++ L      D+   GP +++AS   +++G S ++F  W +D KN V+   
Sbjct: 307 VNNPFVFRHISNLKGIDHFDDI--GPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAG 364

Query: 354 RGQFGTLARMLQADP 368
               GTLA+ + ++P
Sbjct: 365 YCVEGTLAKTILSEP 379


>gi|452819966|gb|EME27015.1| cleavage and polyadenylation specifity factor protein [Galdieria
           sulphuraria]
          Length = 717

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 107/385 (27%), Positives = 187/385 (48%), Gaps = 44/385 (11%)

Query: 4   SVQVTPLSGVFNENPLS-YLVSIDGFNFLIDCG------------WNDHFDPSLLQPLSK 50
           ++Q+TPL G  NE   S  L++      + DCG            + D  DP        
Sbjct: 23  TLQITPL-GAGNEVGRSCVLLTYKNKTIMFDCGVHPAYSGLASLPFFDEMDPR------- 74

Query: 51  VASTIDAVLLSHPDTLHLGALPYAMKQLGLS--APVFSTEPVYRLGLLTMYDQYLS---R 105
              +ID +L++H    H  ALPY +++   +  A VF T P        +Y   LS   R
Sbjct: 75  ---SIDLILITHFHLDHCAALPYLLEKTNCNPNARVFMTHPTK-----AIYKTLLSDFVR 126

Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
              +E  L++  D+    + +  L    +YH      GI    + AGH+LG  ++ +   
Sbjct: 127 VSSNEDVLYSEQDLSRTMKRIETL----DYHQEMNWNGIRFWAYNAGHVLGAAMFLVEIA 182

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V+Y  D++R++++HL    +  F    +++   Y    ++P + +   F   +++ +
Sbjct: 183 GVRVLYTGDFSRQEDRHLKEAEIPPFPPDIIIVESTYGVQVHEPRKIREARFTQKVAEIV 242

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
           R GG VLLPV + GR  ELLLILE+YW  H    + PIY+ + ++   +   ++++  M 
Sbjct: 243 RRGGRVLLPVFALGRAQELLLILEEYWEAHPDLQDIPIYYASSLAKRCMSVYQTYINMMN 302

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           D+I K +E S  N F  K+V  + N  + D++  GP + +AS   L++G S ++   W +
Sbjct: 303 DNIRKRYEVS--NPFAFKYVLNVKNIQDFDDS--GPCVFMASPGMLQSGLSRELCERWCT 358

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D +N ++       GTLA+ + ++P
Sbjct: 359 DRRNGIILPGYSVEGTLAKHILSEP 383


>gi|403373777|gb|EJY86813.1| Cleavage and polyadenylation specificity factor subunit 3
           [Oxytricha trifallax]
          Length = 755

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 175/371 (47%), Gaps = 15/371 (4%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVL 59
           G  +++TPL            +   G   ++DCG +   D     P   V +   +D +L
Sbjct: 24  GDFLEITPLGAGCEVGRSCIYLECKGKKIMLDCGIHPGKDGVQALPYFDVINPKELDLIL 83

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           ++H    H   LPY +++      V+ T P   +    M D         +  LF  +D+
Sbjct: 84  ITHFHVDHCAGLPYFLEKTDFKGKVYMTHPTKSIYNYVMQDFVKVSNIAIDEKLFDENDL 143

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
            +    +    Y  +YH   +  GI  + + AGH+LG +++ I  DG  ++Y  DY+R +
Sbjct: 144 KNTLDKI----YMLDYHQEVEENGIKFSCYRAGHVLGASMFLIEIDGVKILYTGDYSREE 199

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           ++HL    L +     +++   Y    ++   ++ E F   +   ++ GG  LLPV + G
Sbjct: 200 DRHLKPAELPNCEVDVLIVESTYGVQIHEQRDKREERFTKLVHDIVKRGGKCLLPVFALG 259

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  E+LLIL +YW ++    N PIY+   ++  ++   +++   MGD +    E S +N 
Sbjct: 260 RAQEILLILNEYWQKNPDIQNVPIYYSGSLAQKSLTVFQTYRNMMGDQLRMELE-SGNNP 318

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  + +T   ++SE       P +++AS   L+ G S D+FV+WA D KN ++FT     
Sbjct: 319 FHFEPITTFNDESEF------PLVIMASPGMLQNGQSRDLFVKWAPDPKNGIVFTGYSVE 372

Query: 358 GTLARMLQADP 368
           GTLA+ +   P
Sbjct: 373 GTLAKSVMNRP 383


>gi|354504216|ref|XP_003514173.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3 [Cricetulus griseus]
          Length = 684

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|302793925|ref|XP_002978727.1| hypothetical protein SELMODRAFT_109555 [Selaginella moellendorffii]
 gi|300153536|gb|EFJ20174.1| hypothetical protein SELMODRAFT_109555 [Selaginella moellendorffii]
          Length = 522

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 174/360 (48%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-DPSLLQPLSKVAST------IDAVLLSHPDTLHLGALPYA 74
           +VS+ G   + DCG +  + D       S+++ T      ID V+++H    H+GALPY 
Sbjct: 17  IVSMGGKKIMFDCGMHMGYQDERRFPDFSQISKTGDFTHEIDCVIVTHFHLDHVGALPYF 76

Query: 75  MKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G   PV+ T P   L   +L  Y + +  R+  E    TL  I    + V  +   
Sbjct: 77  TEVCGYEGPVYMTYPTKALAPIMLEDYRKIMVDRRGEEEQFSTLH-IQQCMKKVIAVDLR 135

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +S   + +    + AGH+LG  ++ +      V+Y  DYN   ++HL    ++  +
Sbjct: 136 QTIRVS---KDLAFRAYYAGHVLGAAMFYVKAGNSTVVYTGDYNMTPDRHLGAAQIDR-L 191

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +P +LIT++  A   +  R  +E  F + +   +  GG VL+P+ + GR  EL ++L++Y
Sbjct: 192 KPDLLITESTYATTIRESRLAKEAEFLNVVHTCVSKGGKVLIPISALGRAQELCILLDEY 251

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L  PIYF   ++  +  Y K  + W    I  ++ T   NAF  KHV    ++++
Sbjct: 252 WERMNLKVPIYFSAGLTMQSNAYYKLLISWTNQRIKDTYVTR--NAFDFKHV-FPFDRTQ 308

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           LD   +GP ++ A+   L  G S ++   WA   +NL++       GT+A+ L +  P +
Sbjct: 309 LDG--NGPCILFATPGMLTGGLSLEVLKHWAPVEQNLLIIPGFCLAGTVAQKLCSGKPTR 366


>gi|149050992|gb|EDM03165.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
           [Rattus norvegicus]
          Length = 605

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|405963469|gb|EKC29039.1| Cleavage and polyadenylation specificity factor subunit 3
           [Crassostrea gigas]
          Length = 686

 Score =  146 bits (368), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 177/358 (49%), Gaps = 22/358 (6%)

Query: 20  SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLSHPDTLHLGALPYAMKQ 77
            +L+   G   ++DCG +   +     P   +     +D +L+SH    H GALPY +++
Sbjct: 32  CHLLEFKGKKIMLDCGIHPGLNGFASLPFLDLVEVEEVDLLLISHFHLDHCGALPYFLEK 91

Query: 78  LGLSAPVFST---EPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQ 133
                  F T   + +YR  L      Y+    ++  D L+T  DI+++   +  +    
Sbjct: 92  TQFKGRCFMTHASKAIYRWLL----SDYVKVSNIATEDMLYTESDIENSMDKIETI---- 143

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           N+H   +  GI    + AGH+LG  ++ I   G  V+Y  D++R++++HL    +   + 
Sbjct: 144 NFHQEVEVNGIKFWCYTAGHVLGAAMFMIEIAGVRVLYTGDFSRQEDRHLMAAEIPR-IH 202

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P V+I ++    H    R+ RE  F   +   +  GG  L+PV + GR  ELLLIL++YW
Sbjct: 203 PDVVIIESTYGTHIHEKREDREARFTGLVHDIVSRGGRCLIPVFALGRAQELLLILDEYW 262

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           + H    + PIY+ + ++   +   ++++  M + I +    S  N F+ KH++ L +  
Sbjct: 263 SNHPELHDIPIYYASSLAKKCMSVYQTYINAMNEKIRRQINIS--NPFVFKHISNLKSME 320

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             ++   GP +VLAS   +++G S ++F  W +D +N  +       GTLA+ + ++P
Sbjct: 321 HFEDI--GPSVVLASPGMMQSGLSRELFESWCTDKRNGCIIAGYCVEGTLAKHILSEP 376


>gi|344232758|gb|EGV64631.1| Metallo-hydrolase/oxidoreductase [Candida tenuis ATCC 10573]
          Length = 782

 Score =  146 bits (368), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 178/354 (50%), Gaps = 32/354 (9%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M+       VF   +T+ +YR  LLT + +  S    +
Sbjct: 60  SKVDLLLVSHFHLDHAASLPYVMQHTNFRGRVFMTHATKAIYRW-LLTDFVRVTSLSSNT 118

Query: 110 EFD---------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVW 160
             D         L+T +D+  +F  +  +    ++H + + +GI    + AGH+LG  ++
Sbjct: 119 SNDPNSGGTSANLYTDEDLMKSFDRIETV----DFHSTMELDGIRFTAYHAGHVLGACLY 174

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQD 219
            I   G   ++  DY+R + +HL    + S V+P +LIT++        PR ++E     
Sbjct: 175 LIEIGGLKALFTGDYSREENRHLPVAEVPS-VKPDILITESTFGTATHEPRMEKENRMTR 233

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKS 277
            I  TL  GG VL+PV + G   ELLLILE+YW+++    N  +YF + ++   +   ++
Sbjct: 234 IIHSTLSKGGRVLMPVFALGTAQELLLILEEYWSQNKDLQNIDVYFASSLARKCLAVYQT 293

Query: 278 FLEWMGDSITKSFETS---RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGF 333
           +   M D I     +S   R N F  K++  L     LD   D GP +V+AS   L++GF
Sbjct: 294 YTNIMNDKIRSMASSSSYDRKNPFTFKYIKTL---KSLDRFQDFGPSVVIASPGMLQSGF 350

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPP----KAVKVTMSRRVPL 383
           S  +  +WA D KN VL T     GT+A+ L  +PP        ++T++RR+ +
Sbjct: 351 SRQLLEKWAPDPKNTVLMTGYSVEGTMAKDLLIEPPTIPSVNNPEMTITRRLSI 404


>gi|281351872|gb|EFB27456.1| hypothetical protein PANDA_012399 [Ailuropoda melanoleuca]
          Length = 648

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 185/363 (50%), Gaps = 24/363 (6%)

Query: 30  FLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF-- 85
           F +DCG +   +     P   +   + ID +L+SH    H GALP+ +++       F  
Sbjct: 1   FQLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMT 60

Query: 86  -STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
            +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+H   +  G
Sbjct: 61  HATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NFHEVKEVAG 112

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           I    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P +LI ++  
Sbjct: 113 IKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTY 171

Query: 204 ALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYP 260
             H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  H    + P
Sbjct: 172 GTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIP 231

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           IY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    D+   GP 
Sbjct: 232 IYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHFDDI--GPS 287

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ +     ++
Sbjct: 288 VVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQK 345

Query: 381 VPL 383
           +PL
Sbjct: 346 LPL 348


>gi|427779921|gb|JAA55412.1| Putative cleavage and polyadenylation specificity factor cpsf
           subunit [Rhipicephalus pulchellus]
          Length = 737

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 101/373 (27%), Positives = 192/373 (51%), Gaps = 28/373 (7%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLLSHPDTLHLGALPYAMKQ 77
           ++   G   ++DCG   H   S L  L  V    A  ID +L+SH    H GALP+ +++
Sbjct: 85  MLEFKGKRIMLDCGI--HPGMSGLDALPYVDLIEADEIDLLLVSHFHLDHCGALPWFLQK 142

Query: 78  LGLSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQ 133
                  F   +T+ +YR     +   Y+    + +E  L++  D++S+ + +  +    
Sbjct: 143 TTFKGRCFMTHATKAIYRW----LLADYIKVSNIGTEQMLYSEADLESSMEKIETI---- 194

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           N+H      GI    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + + + 
Sbjct: 195 NFHEEKDVNGIRFWCYNAGHVLGAAMFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPN-IH 253

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P VLI ++    H    R++RE  F   +   +  GG  L+PV + GR  ELLLIL++YW
Sbjct: 254 PDVLIIESTYGTHIHEKREEREARFTGLVHDIVNRGGRCLIPVFALGRAQELLLILDEYW 313

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           + H    + PIY+ + ++   +   ++++  M + I +  + + +N F+ KH++ L +  
Sbjct: 314 SNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNERIRR--QITINNPFVFKHISNLKSIE 371

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPP 370
             ++   GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P
Sbjct: 372 HFEDI--GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILSE--P 427

Query: 371 KAVKVTMSRRVPL 383
           + +   + +++PL
Sbjct: 428 EEISTMVGQKLPL 440


>gi|74211665|dbj|BAE29190.1| unnamed protein product [Mus musculus]
          Length = 684

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR  L      Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRWLL----SDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYRTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|240975718|ref|XP_002402161.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
 gi|215491113|gb|EEC00754.1| cleavage and polyadenylation specificity factor, putative [Ixodes
           scapularis]
          Length = 694

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 193/373 (51%), Gaps = 28/373 (7%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLLSHPDTLHLGALPYAMKQ 77
           ++   G   ++DCG   H   S L  L  V    A  ID +L+SH    H GALP+ +++
Sbjct: 42  ILEFKGKRIMLDCGI--HPGMSGLDALPYVDLIEADEIDLLLVSHFHLDHCGALPWFLQK 99

Query: 78  LGLSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQ 133
                  F   +T+ +YR     +   Y+    + +E  L++  D++++ + +  +    
Sbjct: 100 TTFKGRCFMTHATKAIYRW----LLADYIKVSNIGTEQMLYSETDLEASMEKIETI---- 151

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           N+H   +  GI    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + + + 
Sbjct: 152 NFHEEKEVNGIRFWCYNAGHVLGAAMFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPN-IH 210

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P VLI ++    H    R++RE  F   +   +  GG  L+PV + GR  ELLLIL++YW
Sbjct: 211 PDVLIIESTYGTHIHEKREEREARFTGLVHDIVNRGGRCLIPVFALGRAQELLLILDEYW 270

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           + H    + PIY+ + ++   +   ++++  M + I +  + + +N F+ KH++ L +  
Sbjct: 271 SNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNERIRR--QITINNPFVFKHISNLKSIE 328

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPP 370
             ++   GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ + ++  P
Sbjct: 329 HFEDV--GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILSE--P 384

Query: 371 KAVKVTMSRRVPL 383
           + +   + +++PL
Sbjct: 385 EEISTMVGQKLPL 397


>gi|190346159|gb|EDK38177.2| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 770

 Score =  145 bits (367), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 104/359 (28%), Positives = 181/359 (50%), Gaps = 39/359 (10%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+    +  VF   +T+ +YR  LL+ + +  S     
Sbjct: 58  SKVDILLISHFHLDHAASLPYVMQHTNFNGRVFMTHATKAIYRW-LLSDFVRVTSIGGGG 116

Query: 105 -------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGG 157
                      +  +L+T DD+  +F  +  +    +YH + + EGI    + AGH+LG 
Sbjct: 117 DSRLNSGNETATSSNLYTDDDLIRSFDRIETI----DYHSTIEVEGIRFTAYHAGHVLGA 172

Query: 158 TVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM- 216
            ++ +   G  V++  DY+R +++HL    +   +RP +LIT++        PR ++E  
Sbjct: 173 CMYFVEIGGLKVLFTGDYSREEDRHLQVAEVPP-MRPDILITESTFGTATHEPRLEKEAR 231

Query: 217 FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE----HSLNYPIYFLTYVSSSTI 272
               I  TL  GG +L+PV + GR  ELLLILE+YW++    H++N  ++F + ++   +
Sbjct: 232 MTKIIHSTLLKGGRILMPVFALGRAQELLLILEEYWSQNEDLHNIN--VFFASSLARKCM 289

Query: 273 DYVKSFLEWMGDSITKSFETS---RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMAS 328
              +++   M D+I     ++   + N F  KH+ L+     LD   D GP +V+A+   
Sbjct: 290 AVYQTYTNIMNDNIRHGVSSASGGKSNPFQFKHIKLI---RSLDKFQDIGPCVVVAAPGM 346

Query: 329 LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP----PPKAVKVTMSRRVPL 383
           L+ G S ++   WA D KN V+ T     GT+A+ L  +P      +   VT+ RR+ +
Sbjct: 347 LQNGVSRELLERWAPDAKNAVIMTGYSVEGTMAKELLTEPHTIQSSQNADVTIPRRMAI 405


>gi|355565449|gb|EHH21878.1| hypothetical protein EGK_05038 [Macaca mulatta]
          Length = 650

 Score =  145 bits (367), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 176/338 (52%), Gaps = 22/338 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           + ID +L+SH    H GALP+ +++       F   +T+ +YR     +   Y+    +S
Sbjct: 59  AEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRW----LLSDYVKVSNIS 114

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 115 ADDMLYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVK 170

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  
Sbjct: 171 LLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNR 229

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D 
Sbjct: 230 GGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDK 289

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K      +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D 
Sbjct: 290 IRKQINI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDK 345

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +N V+       GTLA+ + ++  P+ +     +++PL
Sbjct: 346 RNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 381


>gi|195452860|ref|XP_002073532.1| GK13096 [Drosophila willistoni]
 gi|194169617|gb|EDW84518.1| GK13096 [Drosophila willistoni]
          Length = 684

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 102/391 (26%), Positives = 196/391 (50%), Gaps = 30/391 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 18  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 75

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   ++    +S +  L+T 
Sbjct: 76  SHFHIDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDFIKISNISTDQMLYTE 131

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 132 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 187

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +    +P VLIT++    H    R+ RE  F   + KT+  GG  L+PV
Sbjct: 188 RQEDRHLMAAEVPP-TKPDVLITESTYGTHIHEKREDRESRFTSLVQKTVMQGGRCLIPV 246

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 247 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 304

Query: 294 RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +N F+ +H++   N   +D+  D GP +++AS   +++G S ++F  W +D KN V+  
Sbjct: 305 VNNPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIVA 361

Query: 353 ERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
                GTLA+ + ++  P+ +     +++PL
Sbjct: 362 GYCVEGTLAKTILSE--PEEITTLSGQKLPL 390


>gi|6625904|gb|AAF19420.1|AF203969_1 cleavage and polyadenylation specificity factor 73 kDa subunit [Mus
           musculus]
          Length = 684

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 186/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F   +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFWHTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|223647718|gb|ACN10617.1| Integrator complex subunit 11 [Salmo salar]
          Length = 343

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 94/319 (29%), Positives = 156/319 (48%), Gaps = 16/319 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQQGRLTEFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +  + E   +  + AGH+LG  + +I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   ++  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNMKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDN 314
           N F  KH+    ++S  DN
Sbjct: 298 NMFEFKHIKAF-DRSYADN 315


>gi|388852694|emb|CCF53612.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
           [Ustilago hordei]
          Length = 888

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 167/322 (51%), Gaps = 13/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D        +
Sbjct: 74  STVDAILITHFHLDHAAALTYIMEKTNFRDGHGKVYMTHPTKAVYRFLMSDFVRISNAGN 133

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           E  LF  +++ ++++ +  + + Q+  ++G   G+    + AGH+LG  ++ I   G  +
Sbjct: 134 EDHLFDENEMLASWRQIEAVDFHQDVSIAG---GLRFTAYHAGHVLGACMFLIEIAGLRI 190

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  D++R +++HL    +   V+P VLI ++        PR  +E  F   I   ++ G
Sbjct: 191 LYTGDFSREEDRHLVQAEIPP-VKPDVLICESTYGTQTHEPRHDKEHRFTSQIHHIIKRG 249

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VLLPV   GR  ELLL+L++YWA H    + PIY+ + ++   I   ++++  M D I
Sbjct: 250 GRVLLPVFVLGRAQELLLLLDEYWAAHPELQSVPIYYASALAKKCISVYQTYIHTMNDHI 309

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+ KH++ L +  + ++   GP +++AS   +++G S ++   WA D +
Sbjct: 310 RTRF-NRRDNPFVFKHISNLRSLEKFEDR--GPCVMMASPGFMQSGVSRELLERWAPDKR 366

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ +     GT+AR +  +P
Sbjct: 367 NGLIVSGYSVEGTMARNILNEP 388


>gi|197102904|ref|NP_001127045.1| cleavage and polyadenylation specificity factor subunit 3 [Pongo
           abelii]
 gi|55733623|emb|CAH93488.1| hypothetical protein [Pongo abelii]
          Length = 647

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 176/338 (52%), Gaps = 22/338 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           + ID +L+SH    H GALP+ +++       F   +T+ +YR  L      Y+    +S
Sbjct: 25  AEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLL----SDYVKVSNIS 80

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 81  ADDMLYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVK 136

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  
Sbjct: 137 LLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNR 195

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D 
Sbjct: 196 GGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDK 255

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K      +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D 
Sbjct: 256 IRKQINI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDK 311

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +N V+       GTLA+ + ++  P+ +     +++PL
Sbjct: 312 RNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 347


>gi|195108751|ref|XP_001998956.1| GI24246 [Drosophila mojavensis]
 gi|193915550|gb|EDW14417.1| GI24246 [Drosophila mojavensis]
          Length = 686

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 193/390 (49%), Gaps = 28/390 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 20  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 77

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 78  SHFHLDHCGALPWFLMKTSFRGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 133

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 134 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 189

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +    +P VLIT++    H    R+ RE  F   + K +  GG  L+PV
Sbjct: 190 RQEDRHLMAAEVPP-KKPDVLITESTYGTHIHEKREDRESRFTSLVQKIVMQGGRCLIPV 248

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 249 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 306

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            +N F+ +H++ L      D+   GP +++AS   +++G S ++F  W +D KN V+   
Sbjct: 307 VNNPFVFRHISNLKGIDHFDDI--GPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAG 364

Query: 354 RGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
               GTLA+ + ++  P+ +     +++PL
Sbjct: 365 YCVEGTLAKTILSE--PEEITTLSGQKLPL 392


>gi|119621395|gb|EAX00990.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
           CRA_b [Homo sapiens]
          Length = 647

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 176/338 (52%), Gaps = 22/338 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           + ID +L+SH    H GALP+ +++       F   +T+ +YR  L      Y+    +S
Sbjct: 25  AEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLL----SDYVKVSNIS 80

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 81  ADDMLYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVK 136

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  
Sbjct: 137 LLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNR 195

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D 
Sbjct: 196 GGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDK 255

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K      +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D 
Sbjct: 256 IRKQINI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDK 311

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +N V+       GTLA+ + ++  P+ +     +++PL
Sbjct: 312 RNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 347


>gi|307110126|gb|EFN58363.1| hypothetical protein CHLNCDRAFT_142438 [Chlorella variabilis]
          Length = 709

 Score =  145 bits (366), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 168/369 (45%), Gaps = 28/369 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           VQ+ PL           +V   G   ++DCG +  F      P       S +DA+L++H
Sbjct: 25  VQILPLGAGQEVGRSCIIVRYCGKTVMLDCGVHPGFFGIASLPFFDEVDLSEVDAMLVTH 84

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSA 122
               H  A+PY          V  T P   +    + D     +  S   L++  D+D+A
Sbjct: 85  FHLDHCAAVPYVTGHTSFRGRVLMTHPTKAIVHTLLKDFVKVSKGGSGEGLYSERDLDAA 144

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
            +    + + Q   L    +GI V  + AGH+LG  ++ +   G  ++Y  DY+R  ++H
Sbjct: 145 MERTEVIDFHQTVDL----DGIRVTAYRAGHVLGAAMFMVEVGGMRLLYTGDYSRIPDRH 200

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRV 241
           +    L +  RP +++ ++   +    PR++RE  F   I   +  GG VLLPV + GR 
Sbjct: 201 MPAADLPA-QRPHIVVVESTYGVSRHLPREEREQRFVQRIHTAVARGGRVLLPVVALGRA 259

Query: 242 LELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
            ELLLILE+YW  H      PIY  + ++   I   K+++E M + I ++F  +  N F 
Sbjct: 260 QELLLILEEYWERHPELHGVPIYQASGLARRAISVYKAYIEMMNEDIKRAFTVA--NPFE 317

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            KH++ L + +  D+                +G S ++F  W  D +N V+  +    GT
Sbjct: 318 FKHISHLKSAAHFDD----------------SGMSRELFEAWCEDARNCVVIADFAVQGT 361

Query: 360 LARMLQADP 368
           LAR +  +P
Sbjct: 362 LARDILGNP 370


>gi|195395198|ref|XP_002056223.1| GJ10819 [Drosophila virilis]
 gi|194142932|gb|EDW59335.1| GJ10819 [Drosophila virilis]
          Length = 686

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 102/390 (26%), Positives = 193/390 (49%), Gaps = 28/390 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLL 60
           +Q+ PL           ++   G   ++DCG   H   S +  L  V    A  ID + +
Sbjct: 20  LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGI--HPGLSGMDALPYVDLIEADEIDLLFI 77

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
           SH    H GALP+ + +       F   +T+ +YR     M   Y+    +S E  L+T 
Sbjct: 78  SHFHLDHCGALPWFLMKTSFRGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTE 133

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
            D++++ + +  +    N+H      G+    + AGH+LG  ++ I   G  ++Y  D++
Sbjct: 134 ADLEASMEKIETI----NFHEERDVMGVRFCAYNAGHVLGAAMFMIEIAGIKILYTGDFS 189

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPV 235
           R++++HL    +    +P VLIT++    H    R+ RE  F   + K +  GG  L+PV
Sbjct: 190 RQEDRHLMAAEVPP-KKPDVLITESTYGTHIHEKREDRESRFTSLVQKIVMQGGRCLIPV 248

Query: 236 DSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  ELLLIL+++W+++      PIY+ + ++   +   ++++  M D I +  + +
Sbjct: 249 FALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRR--QIA 306

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            +N F+ +H++ L      D+   GP +++AS   +++G S ++F  W +D KN V+   
Sbjct: 307 VNNPFVFRHISNLKGIDHFDDI--GPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAG 364

Query: 354 RGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
               GTLA+ + ++  P+ +     +++PL
Sbjct: 365 YCVEGTLAKTILSE--PEEITTLSGQKLPL 392


>gi|355680849|gb|AER96661.1| cleavage and polyadenylation specific factor 3, 73kDa [Mustela
           putorius furo]
          Length = 600

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 176/338 (52%), Gaps = 22/338 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           + ID +L+SH    H GALP+ +++       F   +T+ +YR  L      Y+    +S
Sbjct: 11  AEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLL----SDYVKVSNIS 66

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 67  ADDMLYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVK 122

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           ++Y  D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  
Sbjct: 123 LLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNR 181

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D 
Sbjct: 182 GGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDK 241

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K      +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D 
Sbjct: 242 IRKQINI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDK 297

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +N V+       GTLA+ + ++  P+ +     +++PL
Sbjct: 298 RNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 333


>gi|443899092|dbj|GAC76423.1| mRNA cleavage and polyadenylation factor II complex, BRR5
           [Pseudozyma antarctica T-34]
          Length = 884

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 92/322 (28%), Positives = 168/322 (52%), Gaps = 13/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D        +
Sbjct: 74  STVDAILITHFHLDHAAALTYIMEKTNFRDGHGKVYMTHPTKAVYRFLMSDFVRISNAGN 133

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + +LF  +++ ++++ +  + + Q+  ++G   G+    + AGH+LG  ++ I   G  +
Sbjct: 134 DDNLFDENEMLASWRQIEAVDFHQDVSIAG---GLRFTSYHAGHVLGACMFLIEIAGLRI 190

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  D++R +++HL    +   VRP VLI ++        PR  +E  F   I   ++ G
Sbjct: 191 LYTGDFSREEDRHLVQAEIPP-VRPDVLICESTYGTQTHEPRLDKEHRFTSQIHHIIKRG 249

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VLLPV   GR  ELLL+L++YWA H    + PIY+ + ++   I   ++++  M D I
Sbjct: 250 GRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQTYIHTMNDHI 309

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+ KH++ L +  + ++   GP +++AS   +++G S ++   WA D +
Sbjct: 310 RTRF-NRRDNPFVFKHISNLRSLEKFEDR--GPCVMMASPGFMQSGVSRELLERWAPDKR 366

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ +     GT+AR +  +P
Sbjct: 367 NGLIVSGYSVEGTMARNILNEP 388


>gi|325186851|emb|CCA21396.1| cleavage and polyadenylation specific factor 3 puta [Albugo
           laibachii Nc14]
          Length = 759

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 178/370 (48%), Gaps = 16/370 (4%)

Query: 5   VQVTPLSGVFNENPLSYLV-SIDGFNFLIDCGWNDHFDPSLLQPL--SKVASTIDAVLLS 61
           +++ PL G  NE   S ++    G   ++DCG +  +      P      A  ID +L++
Sbjct: 18  MRIMPL-GAGNEVGRSCIILKFKGKTIMLDCGVHPGYSGHGSLPFFDGVEAEEIDLLLVT 76

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDID 120
           H    H+ ALP+  ++      VF T P   +  + + D +L    +S  D ++   D++
Sbjct: 77  HFHIDHVAALPHFTEKTNFKGRVFMTHPTKAVMQMMLRD-FLRVSNISVDDQIYDDKDLN 135

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +    V  +    ++H      GI   P+ AGH+LG  ++ I   G  V+Y  DY+   +
Sbjct: 136 NCVAKVEII----DFHQEKTHNGIKFTPYNAGHVLGACMYLIEIGGVKVLYTGDYSLEND 191

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGR 240
           +HL    L +     +++   Y    +Q   ++   F   +   +R GG  L+PV + GR
Sbjct: 192 RHLMAAELPACSPDVLIVESTYGVQVHQSVVEREGRFTGQVESVIRRGGRCLIPVFALGR 251

Query: 241 VLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
             ELLLIL+++W  H    + PIYF + +++  +   ++++  M D I K    S  N F
Sbjct: 252 TQELLLILDEHWQAHPDLHDIPIYFASKLAAKALRVYQTYINMMNDRIRKQIAVS--NPF 309

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           L  H++ L +  + D++  GP +V+AS   L++G S  +F  W SD +N  L       G
Sbjct: 310 LFDHISNLKSMDDFDDS--GPCVVMASPGMLQSGVSRQLFERWCSDKRNACLIPGYVVEG 367

Query: 359 TLARMLQADP 368
           TLA+ + ++P
Sbjct: 368 TLAKKILSEP 377


>gi|317036117|ref|XP_001397647.2| cleavage and polyadenylylation specificity factor [Aspergillus
           niger CBS 513.88]
          Length = 1015

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 175/426 (41%), Gaps = 99/426 (23%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLDLQELEKHVPTLSLILLTHATPAHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTAAASAAASVAEGDESTEATH 146

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +            G+ +  + AGH +GGT+W
Sbjct: 147 AGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIW 206

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 207 HIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALVCSTRGGERFA 266

Query: 209 PP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------- 256
            P  R++R ++  D I  T+  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 267 LPGGRKKRDDLLLDMIRSTIAKGGTVLIPTDTSARVLELAYALEHAWRDAAGSGQGDDVL 326

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE--------TSRDNA----------- 297
               +Y     +++T+   +S LEWM ++I + FE        T + N            
Sbjct: 327 KGAGLYLAGRKANTTMRLARSMLEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGK 386

Query: 298 --------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
                   F  KH+ +L  K  L+   +   PK++LAS  SL+ GF+ D     A    N
Sbjct: 387 TEGKGVGPFTFKHLRILERKKRLEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANN 446

Query: 348 LVLFTE 353
           L+L TE
Sbjct: 447 LLLLTE 452


>gi|116283804|gb|AAH30988.1| CPSF3 protein [Homo sapiens]
          Length = 554

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 179/352 (50%), Gaps = 22/352 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA++L
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKIL 367


>gi|356502382|ref|XP_003519998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-II-like [Glycine max]
          Length = 516

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 105/360 (29%), Positives = 174/360 (48%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCGWN----DHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+    + DCG +    DH    D + + P   + S +  ++++H    H+GAL Y 
Sbjct: 20  VVTINAKRIMFDCGMHMGYLDHRRYPDFTRISPSRDLNSALSCIIITHFHLDHVGALAYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            + LG + PV+ T P   L  L +  Y + +  R+  E +LF+ D I    + V  +   
Sbjct: 80  TEVLGYNGPVYMTYPTKALAPLMLEDYRKVMVDRRGEE-ELFSSDQIAECMKKVIAVDLR 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    + + +  + AGH++G  ++       +++Y  DYN   ++HL    ++  +
Sbjct: 139 QTVQVE---KDLQIRAYYAGHVIGAAMFYAKVGDAEMVYTGDYNMTPDRHLGAAQIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           R  +LIT++  A   +  R  RE  F  A+ K +  GG VL+P  + GR  EL ++LEDY
Sbjct: 195 RLDLLITESTYATTIRDSRYAREREFLKAVHKCVSCGGKVLIPTFALGRAQELCILLEDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L  PIYF   ++     Y K  + W    I  ++  S+ NAF  K+V     +S 
Sbjct: 255 WERMNLKVPIYFSAGLTIQANAYYKMLIRWTRQKIKDTY--SKHNAFDFKNVQKF-ERSM 311

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           +D AP GP ++ A+   L  GFS ++F  WA    NLV        GT+   L +D   K
Sbjct: 312 ID-AP-GPCVLFATPGMLSGGFSVEVFKHWAVSENNLVSLPGYCVPGTIGHKLMSDKHDK 369


>gi|350633583|gb|EHA21948.1| hypothetical protein ASPNIDRAFT_41125 [Aspergillus niger ATCC 1015]
          Length = 1015

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 175/426 (41%), Gaps = 99/426 (23%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLDLQELEKHVPTLSLILLTHATPAHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTAAASAAASVAEGDESTEATH 146

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +            G+ +  + AGH +GGT+W
Sbjct: 147 AGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIW 206

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 207 HIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALVCSTRGGERFA 266

Query: 209 PP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------- 256
            P  R++R ++  D I  T+  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 267 LPGGRKKRDDLLLDMIRSTIAKGGTVLIPTDTSARVLELAYALEHAWRDAAGSGQGDDVL 326

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE--------TSRDNA----------- 297
               +Y     +++T+   +S LEWM ++I + FE        T + N            
Sbjct: 327 KGAGLYLAGRKANTTMRLARSMLEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGK 386

Query: 298 --------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
                   F  KH+ +L  K  L+   +   PK++LAS  SL+ GF+ D     A    N
Sbjct: 387 TEGKGVGPFTFKHLRILERKKRLEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANN 446

Query: 348 LVLFTE 353
           L+L TE
Sbjct: 447 LLLLTE 452


>gi|119576637|gb|EAW56233.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
           [Homo sapiens]
          Length = 329

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 108/354 (30%), Positives = 168/354 (47%), Gaps = 40/354 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V                   VA H     L  TV +I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKV-------------------VAVH-----LHQTV-QIKVGSESVVYTGDYN 158

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 159 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 217

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 218 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 275

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V
Sbjct: 276 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMV 326


>gi|302927041|ref|XP_003054415.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256735356|gb|EEU48702.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 827

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 105/379 (27%), Positives = 181/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLART 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  S F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQPIYTEQDHLSTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIGGLNIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           HS    YPIY+ + ++   +   ++++  M D+I + F       E S D A     +  
Sbjct: 277 HSDFQKYPIYYASNLARKCMLVYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA   KN V+ T     GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>gi|268552491|ref|XP_002634228.1| Hypothetical protein CBG01798 [Caenorhabditis briggsae]
          Length = 722

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 101/373 (27%), Positives = 176/373 (47%), Gaps = 18/373 (4%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLS 61
           ++  TPL          +L+   G   ++DCG +         P         ID +L++
Sbjct: 10  ALSFTPLGSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLIT 69

Query: 62  HPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD 118
           H    H GALP+ +++       F   +T+ +YR+ LL  Y +           L+T DD
Sbjct: 70  HFHLDHCGALPWLLQKTAFRGKCFMTHATKAIYRM-LLGDYVRISKYGGADRNQLYTEDD 128

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           ++ +   +  + + +   ++G    I   P+VAGH+LG   + I   G  V+Y  D++  
Sbjct: 129 LEKSMAKIETIDFREQKEVNG----IRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCL 184

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
           +++HL    +   V P VLIT++         R  RE  F   +   +  GG  L+P  +
Sbjct: 185 EDRHLCAAEIPP-VSPQVLITESTYGTQTHEDRSVREKRFTQMVHDIVTRGGRCLIPAFA 243

Query: 238 AGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            G   EL+LIL++YW  H    + P+Y+ + ++   +   ++F+  M   I K  + +  
Sbjct: 244 IGPAQELMLILDEYWEAHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQK--QIAIK 301

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F+ KHV+ L    + ++A  GP +VLA+   L++GFS ++F  W SD KN  +     
Sbjct: 302 NPFIFKHVSTLRGMDQFEDA--GPCVVLATPGMLQSGFSRELFENWCSDSKNGCIIAGYC 359

Query: 356 QFGTLARMLQADP 368
             GTLA+ +  +P
Sbjct: 360 VEGTLAKHILTEP 372


>gi|145478255|ref|XP_001425150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392218|emb|CAK57752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 690

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/319 (30%), Positives = 154/319 (48%), Gaps = 14/319 (4%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLF 114
           ID +L++H    H GALPY +K       ++ T P   +  L + D    + +    DL 
Sbjct: 63  IDLILITHFHLDHCGALPYFLKNYKFKGKIYMTTPTKEIYGLVLKDSIKVKSEDFSQDLI 122

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
               I+ + +++  + Y Q  H     +GI +  + AGH+LG  ++ +  DG  V+Y  D
Sbjct: 123 NEQSIEQSLKNIDCIDYDQEIHY----QGIKLKCYNAGHVLGAAMFMVEIDGVRVLYTGD 178

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
           Y+  KE+HL    L    +  VLI +A Y    ++   ++ E F   I  TL  GGNVLL
Sbjct: 179 YSTEKERHLRPAQL-PLEKIHVLIVEATYGDTQHETRTKREENFLKEIVSTLNGGGNVLL 237

Query: 234 PVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           PV + GR  ELL+IL++YW+++     +PIY    ++       +     +G+   K   
Sbjct: 238 PVFATGRCHELLIILDEYWSKNPQVQQFPIYSTCTLAIKCTHIFQKHFNKLGNKYHKG-- 295

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
              +N F   H+    +  ++ N    PK+V+AS   L++G S  I+  W  D KN V+ 
Sbjct: 296 ---ENLFKFNHINTKKHLQDILNN-QKPKVVMASPGLLQSGHSKQIYEYWCKDEKNQVII 351

Query: 352 TERGQFGTLARMLQADPPP 370
           T     GT+A  L  +P P
Sbjct: 352 TGPAVQGTIAHQLIHNPEP 370


>gi|322699261|gb|EFY91024.1| cleavage and polyadenylation specifity factor [Metarhizium acridum
           CQMa 102]
          Length = 829

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/381 (27%), Positives = 182/381 (47%), Gaps = 28/381 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 43  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHVDHAASLPYVLAKT 102

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 103 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNSTTQPVYTEQDHLNTFSQIEAIDYHTTH 162

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+  
Sbjct: 163 TISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKDVKID 218

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG  LLPV + GR  ELLLIL++YW +
Sbjct: 219 VLITESTYGIASHVPRLEREQALMKSITGILNRGGRALLPVFALGRAQELLLILDEYWGK 278

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H     YPIY+ + ++   +   ++++  M D+I + F       E S D A     +  
Sbjct: 279 HPEFQKYPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERMAEAEASGDGAGQGGPWDF 338

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L++G S ++F  WA + KN V+ T     GT+
Sbjct: 339 KYIRSLKNLDRFDDV--GGCVMLASPGMLQSGVSRELFERWAPNEKNGVIITGYSVEGTM 396

Query: 361 ARMLQADPPPKAVKVTMSRRV 381
           AR +  +  P  +   MSR +
Sbjct: 397 ARQIMQE--PDQIPAVMSRNL 415


>gi|302787435|ref|XP_002975487.1| hypothetical protein SELMODRAFT_52099 [Selaginella moellendorffii]
 gi|300156488|gb|EFJ23116.1| hypothetical protein SELMODRAFT_52099 [Selaginella moellendorffii]
          Length = 517

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 172/360 (47%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHF-DPSLLQPLSKVAST------IDAVLLSHPDTLHLGALPYA 74
           +VS+ G   + DCG +  + D       S+++ T      ID V+++H    H+GALPY 
Sbjct: 12  IVSMGGKKIMFDCGMHMGYQDERRFPDFSQISKTGDFTHEIDCVIVTHFHLDHVGALPYF 71

Query: 75  MKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G   PV+ T P   L   +L  Y + +  R+  E    TL  I    + V  +   
Sbjct: 72  TEVCGYEGPVYMTYPTKALAPIMLEDYRKIMVDRRGEEEQFSTLH-IQQCMKKVIAVDLR 130

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +S     +    + AGH+LG  ++ +      V+Y  DYN   ++HL    ++  +
Sbjct: 131 QTIRVS---RDLAFRAYYAGHVLGAAMFYVKAGNSTVVYTGDYNMTPDRHLGAAQIDR-L 186

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +P +LIT++  A   +  R  +E  F + +   +  GG VL+P+ + GR  EL ++L++Y
Sbjct: 187 KPDLLITESTYATTIRESRLAKEAEFLNVVHTCVSKGGKVLIPISALGRAQELCILLDEY 246

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L  PIYF   ++  +  Y K  + W    I  ++ T   NAF  KHV    ++++
Sbjct: 247 WERMNLKVPIYFSAGLTMQSNAYYKLLISWTNQRIKDTYVTR--NAFDFKHV-FPFDRTQ 303

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           LD    GP ++ A+   L  G S ++   WA   +NL++       GT+A+ L +  P +
Sbjct: 304 LDGP--GPCILFATPGMLTGGLSLEVLKHWAPVEQNLLIIPGFCLAGTVAQKLCSGKPTR 361


>gi|425768274|gb|EKV06801.1| Cleavage and polyadenylylation specificity factor, putative
           [Penicillium digitatum Pd1]
 gi|425770355|gb|EKV08828.1| Cleavage and polyadenylylation specificity factor, putative
           [Penicillium digitatum PHI26]
          Length = 1001

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/414 (27%), Positives = 176/414 (42%), Gaps = 87/414 (21%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D F+   L  L K   T+  +LL+H    H+GAL +  +   L    P+
Sbjct: 27  GIKILVDVGWDDTFNTLDLAELEKHIPTLSLILLTHATPAHIGALVHCCRTFPLFTQIPI 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV   G   + D Y S         +  VSE                         
Sbjct: 87  YATNPVIAFGRTLLQDLYASAPLAATFLPKASVSEPGASSAGSATVSGADAEAAGNTSRI 146

Query: 111 -FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                T ++I   F  +  L YSQ +       S    G+ +  + AGH +GGT+W I  
Sbjct: 147 LLQSPTAEEISRYFSLIQPLKYSQPHQPLSSPFSPPLNGLTLTAYNAGHTVGGTIWHIQH 206

Query: 165 DGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLI--TDAYNALHNQPP 210
             E ++YA+D+N+ +E  + G             V+E   +P  LI  T   + L     
Sbjct: 207 GLESIVYAMDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALICSTTGGDKLAPSGG 266

Query: 211 RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------LNYPI 261
           R++R ++  D I  +L  GG VL+P D++ RVLEL   LE  W + +            +
Sbjct: 267 RKKRDDLLLDMIRSSLAKGGTVLIPTDTSARVLELAYSLEHSWRDAANGDKEDVLQGAGL 326

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN--------------------AFLLK 301
           Y      ++TI   +S LEWM ++I + FE +  +                     F  K
Sbjct: 327 YLAGKKVTNTIRLARSMLEWMDENIVREFEAAESSDVTNGQRTGAQEKSSNKGGGPFTFK 386

Query: 302 HVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           H+ ++  K  L+   A  GPK++LAS  S++ GFS D   + A    NL+L TE
Sbjct: 387 HLKIIERKKRLEKLLAEPGPKVILASDTSMDWGFSKDALRQVAEGPNNLLLLTE 440


>gi|391348443|ref|XP_003748457.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Metaseiulus occidentalis]
          Length = 673

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 111/420 (26%), Positives = 207/420 (49%), Gaps = 29/420 (6%)

Query: 52  ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQV 108
           A  ID +L+SH    H GALP+ +++       F   +T+ +YR  LL    +  +    
Sbjct: 57  ADEIDLLLVSHFHLDHCGALPWFLQKTTFKGRCFMTHATKAIYRW-LLADCIKVSNIGST 115

Query: 109 SEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
           S  +L+T  D++++   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 116 SSNNLYTEADLEASMDKIEVI----NFHEEKEINGIRFWCYHAGHVLGAAMFFIEIAGVK 171

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           ++Y  D++R++++HL    + S V+P VLI ++    H    RQ RE  F   + + +  
Sbjct: 172 ILYTGDFSRQEDRHLMSAEIPS-VKPDVLIIESTYGTHIHEKRQDREHRFTHLVQEIVTR 230

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M + 
Sbjct: 231 GGRCLIPVFALGRAQELLLILDEYWGLHPELHDIPIYYASSLAKKCMAVYQTYVNAMNER 290

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I +    S  N F+ KH++ L +    D+   GP +++A+   +++G S ++F  W  D 
Sbjct: 291 IRRQIAIS--NPFVFKHISNLKSIDHFDDV--GPCVIMATPGMMQSGLSRELFEAWCGDT 346

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEAL 404
           KN V+       GTLA+ + ++  P+ V     +++PL +  + I++       +  E +
Sbjct: 347 KNGVIIAGYCVEGTLAKQILSE--PQEVTSMNGQKMPLKMSVDYISFSAHTDYQQTSEFI 404

Query: 405 KA------SLV---KEEESKASLGPDNNLSGDPMVIDANN-ANASADVVEPHGGRYRDIL 454
           +A       LV   + E S+     +    G+ + +D  N AN  A  ++  G R   ++
Sbjct: 405 RALKPPNIILVHGEQNEMSRLKAAIEREYEGEDLKMDVYNPANGHAVTLKFRGERLAKVM 464


>gi|389740019|gb|EIM81211.1| mRNA 3'-end-processing protein YSH1 [Stereum hirsutum FP-91666 SS1]
          Length = 841

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 164/324 (50%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   L    M D ++     S
Sbjct: 57  STVDAILITHFHLDHAAALTYIMEKTNFRDGKGKVYMTHPTKALHKFMMQD-FVRMSNSS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              L +  D+  +  S+  ++  Q   L     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 TDALISPLDLSMSISSIIPVSAHQ---LITPCPGVTFTPYHAGHVLGACMYLIDMAGIKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   VRP VLI ++   + +   R ++E+ F   +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAEVPP-VRPDVLIVESTYGVQSLEARDEKELRFTSLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++   +   ++++  M  +I
Sbjct: 232 GHVLLPAFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNSNI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ +   S  E   A   P +VLAS   +++G S  +   WA D
Sbjct: 292 RTRF-AKRDNPFVFKHISNMPQSSGWERKIAEGPPCVVLASPGFMQSGPSRQLLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GTLAR +  +P
Sbjct: 351 SRNGLIVTGYSVEGTLAREIMTEP 374


>gi|347965534|ref|XP_321933.5| AGAP001224-PA [Anopheles gambiae str. PEST]
 gi|333470467|gb|EAA01794.5| AGAP001224-PA [Anopheles gambiae str. PEST]
          Length = 690

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 181/356 (50%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID + +SH    H GALP+ +++  
Sbjct: 37  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLIDADQIDLLFISHFHLDHCGALPWFLQKTS 96

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     M   Y+    +S +  L+T  D++++ + +  +    N+
Sbjct: 97  FKGRCFMTHATKAIYRW----MLSDYIKVSNISTDQMLYTEADLEASMEKIETI----NF 148

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      G+    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + + +RP 
Sbjct: 149 HEERDILGVRFWAYNAGHVLGAAMFMIEIAGIRVLYTGDFSRQEDRHLMAAEIPA-MRPD 207

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F   + K ++ GG  L+PV + GR  ELLLIL++YW++
Sbjct: 208 VLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQ 267

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           +      PIY+ + ++   +   ++++  M D I +  + + +N F+ + ++ L      
Sbjct: 268 NPDLQEIPIYYASSLAKKCMAVYQTYINAMNDKIRR--QIAINNPFVFRFISNLKGIDHF 325

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ +  +P
Sbjct: 326 DDV--GPCVVMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKTILFEP 379


>gi|349579839|dbj|GAA25000.1| K7_Cft2p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 859

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 132/485 (27%), Positives = 220/485 (45%), Gaps = 67/485 (13%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEVSFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMAS-------LEAGFSHDIFV-------E 340
              F +     +I  +EL   P G K+   S          ++ G S    +       E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 341 WASDVKNLVLFTERGQ--FGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRL 398
            AS +  ++   E+G+  + T     ++      + +   +  PL  EE  A++ +    
Sbjct: 373 CASSLDKILEIVEQGERNWKTFPEDGKSFLCDNYISIDTIKEEPLSKEETEAFKVQLKEK 432

Query: 399 KKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGF 458
           K++   K  LVK E  K +       +G+ ++ D N   A          R +DIL++  
Sbjct: 433 KRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------RNQDILVENV 476

Query: 459 --VPP 461
             VPP
Sbjct: 477 NGVPP 481


>gi|405958713|gb|EKC24813.1| Integrator complex subunit 11 [Crassostrea gigas]
          Length = 575

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 91/317 (28%), Positives = 157/317 (49%), Gaps = 11/317 (3%)

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQV 108
           K+   +D V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  + 
Sbjct: 29  KLTDHLDCVIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDYRKITVERK 88

Query: 109 SEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
            E + FT + I +  + V  +   +   +    E + +  + AGH+LG  ++ I    + 
Sbjct: 89  GEENFFTSEMIKNCMKKVVVVNLHETKQVD---EELEIKAYYAGHVLGAAMFHIKVGQQS 145

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           V+Y  DYN   ++HL    ++   RP +LIT++  A   +  ++ RE  F   +   +  
Sbjct: 146 VVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHDCVEK 204

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
           GG VL+PV + GR  EL ++LE YW   ++  PIYF   ++     Y K F+ W    I 
Sbjct: 205 GGKVLIPVFALGRAQELCILLESYWDRMNIKVPIYFSLGLTEKANHYYKLFITWTSQKIK 264

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
           K+F   + N F  KH+    +++ +DN   GP +V A+   L AG S  IF +WA +  N
Sbjct: 265 KTF--VQRNMFEFKHIKPF-DRAFIDNP--GPMVVFATPGMLHAGLSLQIFKKWAPNELN 319

Query: 348 LVLFTERGQFGTLARML 364
           +V+       GT+   +
Sbjct: 320 MVIMPGYCVAGTVGHKI 336


>gi|327261273|ref|XP_003215455.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Anolis carolinensis]
          Length = 651

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 175/338 (51%), Gaps = 22/338 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           + ID +L+SH    H GALP+ +++       F   +T+ +YR  L      Y+    +S
Sbjct: 28  AEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLL----SDYVKVSNIS 83

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 84  ADDMLYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVK 139

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  
Sbjct: 140 LLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNR 198

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H      PIY+ + ++   +   ++++  M D 
Sbjct: 199 GGRGLIPVFALGRAQELLLILDEYWQNHPELHEIPIYYASSLAKKCMAVYQTYVNAMNDK 258

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K      +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D 
Sbjct: 259 IRKQINI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDK 314

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +N V+       GTLA+ + ++  P+ +     +++PL
Sbjct: 315 RNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 350


>gi|328350068|emb|CCA36468.1| hypothetical protein PP7435_Chr1-0308 [Komagataella pastoris CBS
           7435]
          Length = 741

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 94/321 (29%), Positives = 170/321 (52%), Gaps = 15/321 (4%)

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSE 110
           T+D +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  +    S 
Sbjct: 14  TVDVLLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKAIYRW-LLNDFVRVTAIDDDSN 72

Query: 111 FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
             L++  D+  +F  +  +    ++H + + +GI    + AGH+LG  ++ I   G  V+
Sbjct: 73  -QLYSDKDLKDSFDRIETI----DFHSTIEIDGIRFTAYQAGHVLGAAMFFIEIAGIKVL 127

Query: 171 YAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
           +  D++R +++HL+   +   VRP VLIT++        PR+++E      I  TL  GG
Sbjct: 128 FTGDFSREEDRHLSVAEVPP-VRPDVLITESTFGTATHEPREEKEKKLTTMIHSTLANGG 186

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
            VL+PV + GR  ELLLIL++YW++H    N  +Y+ + ++   +   ++++  M ++I 
Sbjct: 187 RVLMPVFALGRAQELLLILDEYWSQHQDLENIKVYYASDLARKCLAVYQTYINMMNENIR 246

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
           K F  +  N F  +++  + N S+ D+    P +V+AS   L+ G S  +  +WA D +N
Sbjct: 247 KKFRDTNKNPFQFQYIKNIKNLSKFDDF--QPSVVVASPGMLQNGVSRALLEKWAPDPRN 304

Query: 348 LVLFTERGQFGTLARMLQADP 368
            ++ T     GT+A+ +  +P
Sbjct: 305 TLIMTGYSVEGTMAKEILLEP 325


>gi|238880762|gb|EEQ44400.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 931

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 142/548 (25%), Positives = 233/548 (42%), Gaps = 89/548 (16%)

Query: 28  FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSAPV 84
           F  + D  WN   D +    + +     +A+LLSH     +     L      L  S PV
Sbjct: 27  FKLIADPSWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIKFPILMSSVPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           +ST PV +LG ++  + Y +   +   D  +  LD++D+ F  V  L Y Q+ +L     
Sbjct: 86  YSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQQSLNLFD--N 143

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESFVR 193
            +VV P+ AGH LGGT W ITK  + VIYA  +N  K+  LN         G    S +R
Sbjct: 144 KVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTGNPHLSLLR 203

Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
           P   IT A +       R++ E F   +  TL  GG  +LP   +GR LEL  +++++  
Sbjct: 204 PTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTSLSGRFLELFHLIDEHLK 262

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
              +  P+YFL+Y  +  + Y  + L+WM  S TK +E      F    V LL++ SEL 
Sbjct: 263 GAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELL 320

Query: 314 NAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER--------------GQFG 358
               GPK+V  S   L +G  S + F    +D +  ++ TE+               ++ 
Sbjct: 321 KL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDERTTIILTEKTTMNFASSLSSVLYTEWD 379

Query: 359 TLARMLQADPPPKAVKVTM---------SRRVPLVGEELIAYEEEQTRLKKEEALKASLV 409
           +LA+          + V +         ++ V L G EL  ++E+  + +KE+ L  + V
Sbjct: 380 SLAKKRGGGESEDGIAVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLL--AKV 437

Query: 410 KEEESKASLGPDN----------------------NLSGDPMVIDANNANASADVVEPHG 447
           ++++++  L  D                       N S + ++    N N +   V P+ 
Sbjct: 438 RDQKNQNILSADTVDSEDSSDDDDEGDNEAEKQKGNTSSNLLIKQYQNINVADSNVAPNE 497

Query: 448 ----GRYRDILIDGFVPPSTSVAPM--------------FPFY--ENNSEWDDFGEVINP 487
                 +   + D          P+              FP++   +  ++DD+GEVI  
Sbjct: 498 VNPLATHEAFITDHIKQSLEKNLPIDLKITHKLRPRQATFPYFATAHKQKFDDYGEVIKI 557

Query: 488 DDYIIKDE 495
           +DY   DE
Sbjct: 558 EDYQRHDE 565


>gi|323303882|gb|EGA57663.1| Cft2p [Saccharomyces cerevisiae FostersB]
          Length = 859

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 132/485 (27%), Positives = 220/485 (45%), Gaps = 67/485 (13%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMAS-------LEAGFSHDIFV-------E 340
              F +     +I  +EL   P G K+   S          ++ G S    +       E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 341 WASDVKNLVLFTERGQ--FGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRL 398
            AS +  ++   E+G+  + T     ++      + +   +  PL  EE  A++ +    
Sbjct: 373 CASSLDKILEIVEQGERNWKTFPEDGKSFLCDNYISIDTIKEEPLSKEETEAFKVQLKEK 432

Query: 399 KKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGF 458
           K++   K  LVK E  K +       +G+ ++ D N   A          R +DIL++  
Sbjct: 433 KRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------RNQDILVENV 476

Query: 459 --VPP 461
             VPP
Sbjct: 477 NGVPP 481


>gi|358368318|dbj|GAA84935.1| cleavage and polyadenylylation specificity factor [Aspergillus
           kawachii IFO 4308]
          Length = 1015

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 115/426 (26%), Positives = 175/426 (41%), Gaps = 99/426 (23%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLDLQELEKHVPTLSLILLTHATPAHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTAAASAAASVAEGDESAEATH 146

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +            G+ +  + AGH +GGT+W
Sbjct: 147 AGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIW 206

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            +    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 207 HVQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALVCSTRGGERFA 266

Query: 209 PP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------- 256
            P  R++R ++  D I  T+  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 267 LPGGRKKRDDLLLDMIRSTIAKGGTVLIPTDTSARVLELAYALEHAWRDAAGSGQGDDTL 326

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE--------TSRDNA----------- 297
               +Y     +++T+   +S LEWM ++I + FE        T + N            
Sbjct: 327 KGAGLYLAGRKANTTMRLARSMLEWMDENIVREFEAAEGVDAATGQSNTEGQRAGQNQGK 386

Query: 298 --------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
                   F  KH+ +L  K  L+   +   PK++LAS  SL+ GF+ D     A    N
Sbjct: 387 AEGKGVGPFTFKHLRILERKKRLEKILSDQKPKVILASDTSLDWGFAKDSLRLVAEGANN 446

Query: 348 LVLFTE 353
           L+L TE
Sbjct: 447 LLLLTE 452


>gi|358396914|gb|EHK46289.1| hypothetical protein TRIATDRAFT_132454 [Trichoderma atroviride IMI
           206040]
          Length = 881

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 100/346 (28%), Positives = 171/346 (49%), Gaps = 25/346 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSE 110
           ST+D +L+SH    H  +LPY + +      VF T P   +    + D  +  +    S 
Sbjct: 86  STVDVLLISHFHIDHAASLPYVLAKTNFRGRVFMTHPTKAIYKWLIQDSVRVANTASNSA 145

Query: 111 FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
             L+T  D  + F  +  + Y   + +S     I + P+ AGH+LG  ++ I   G ++ 
Sbjct: 146 TQLYTEQDHLNTFPQIEAIDYHTTHTISS----IRITPYPAGHVLGAAMFLIEIAGLNIF 201

Query: 171 YAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
           +  DY+R +++HL    +   ++  VLIT++   + +  PR +RE     +I+  L  GG
Sbjct: 202 FTGDYSREQDRHLVSAEVPKGLKIDVLITESTYGIASHVPRVEREQALMKSITGILNRGG 261

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
             LLPV + GR  ELLLIL++YW +H+    +PIY+ + ++   +   ++++  M D+I 
Sbjct: 262 RALLPVFALGRAQELLLILDEYWGKHTEFQKFPIYYASNLARKCMVIYQTYVGAMNDNIK 321

Query: 288 KSF-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSH 335
           + F       E S D A     +  K++  L N    D+   G  ++LAS   L+ G S 
Sbjct: 322 RLFRERMAEAEASGDGAGKNGPWDFKYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSR 379

Query: 336 DIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRV 381
           ++F  WA   KN V+ T     GT+AR +  +  P  ++  MSR +
Sbjct: 380 ELFERWAPSEKNGVIITGYSVEGTMARQIMQE--PDQIQAVMSRSI 423


>gi|62898706|dbj|BAD97207.1| cleavage and polyadenylation specific factor 3, 73kDa variant [Homo
           sapiens]
          Length = 684

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T   ++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETVLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>gi|303391080|ref|XP_003073770.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
 gi|303302918|gb|ADM12410.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon intestinalis ATCC 50506]
          Length = 696

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 106/386 (27%), Positives = 187/386 (48%), Gaps = 20/386 (5%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLS 61
           ++V PL G  NE   S  +V   G   ++DCG +  +      P   +   S IDA+ ++
Sbjct: 7   IKVMPL-GAGNEVGRSCVIVECGGRTIMLDCGVHPAYTGVASLPFLDLVDLSKIDAIFVT 65

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
           H    H  ALP+  ++      V+ T P   +    + D        S+ D +T  D+  
Sbjct: 66  HFHLDHAAALPFLTEKTSFKGKVYMTHPTKAILKWLLNDYIRLINAASDADFYTETDLVK 125

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
            +  +  + Y Q  ++    +GI V    AGH+LG  ++ +  +   ++Y  D++R +++
Sbjct: 126 CYDRIIPIDYHQEVNV----KGIKVKALNAGHVLGAAMFLVEIEKSKILYTGDFSREEDR 181

Query: 182 HLNGTVLES-FVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           HL     ES   +   LIT++   +    PR +RE  F   +   ++ GG  LLPV + G
Sbjct: 182 HLKAA--ESPGCKIDALITESTYGVQCHLPRSEREGRFTSIVQNVVQRGGRCLLPVFALG 239

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W+ ++     PIY+ + ++   +   ++++  M + I K   +   N 
Sbjct: 240 RAQELLLILEEHWSSNASLQKIPIYYASALAKRCMGVYQTYIGMMNERIQKL--SLVRNP 297

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  K+V  L      D+  +GP +++AS   L++G S D+F  W SD KN V+       
Sbjct: 298 FAFKYVKNLKGIDSFDD--EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCVD 355

Query: 358 GTLARMLQADPPPKAVKVTMSRRVPL 383
           GTLA+ + ++  PK ++    +R+ L
Sbjct: 356 GTLAKEILSE--PKEIEALNGKRLRL 379


>gi|412990885|emb|CCO18257.1| predicted protein [Bathycoccus prasinos]
          Length = 825

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 178/374 (47%), Gaps = 29/374 (7%)

Query: 29  NFLIDCGWNDHFDP-SLLQPLSKV-ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
           N + DCG +  F   S L    ++  S ID +L++H    H  A+P+ + +      VF 
Sbjct: 74  NVMFDCGIHPGFSGLSSLPYFDEIDVSAIDVLLVTHFHLDHCAAVPFLVNRTNFKGRVFM 133

Query: 87  TEPVYRLGLLTMYD-QYLSRRQ-------------VSEFDLFTLDDIDSAFQSVTRLTYS 132
           T     +  + M D   LS RQ               E  L+   D+ +A   +  + + 
Sbjct: 134 THATKAIFHMLMSDFVRLSARQQPKAKGSEEKEEEEDESQLWDAKDLKAAMDKIEVIDFH 193

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q  ++    +GI V P+ AGH+LG   +++   G  V+Y  DY+R  ++HL    +    
Sbjct: 194 QEINI----DGIKVTPYRAGHVLGACQFEVNVGGCRVLYTGDYSRVADRHLPAADIPKKT 249

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
            P V+I ++   +    P+++RE  F D I   L  GG  LLPV + GR  ELLLILEDY
Sbjct: 250 -PHVVIVESTYGVSPHTPKEEREARFTDKIHGILGRGGKCLLPVVALGRAQELLLILEDY 308

Query: 252 WAEH--SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINK 309
           W +H    + P+Y  + ++   +   ++++  +   I + FE    N F  KHV  L   
Sbjct: 309 WEKHPEMSHVPVYQASALARKAMTVFETYINVLNADIKRQFEEK--NPFNFKHVQSLNRA 366

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           S+LD    GP +VLA+ + L++G S ++F  W     N V+  +    GTLAR + +D  
Sbjct: 367 SDLDGNT-GPCVVLATPSMLQSGTSRELFENWCESSDNGVVICDFAVQGTLAREILSD-- 423

Query: 370 PKAVKVTMSRRVPL 383
            K VK    R + L
Sbjct: 424 VKTVKARDGRELQL 437


>gi|330796066|ref|XP_003286090.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
 gi|325083909|gb|EGC37349.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
          Length = 468

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 180/374 (48%), Gaps = 19/374 (5%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTID 56
           +++V PL    +      +V+I   N + DCG +  +       D S +    +    ID
Sbjct: 2   TIKVVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGYYDERRFPDFSYISKNKQFTKIID 61

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V+++H    H GALPY  + +G   P++ T P   +  + + D + ++  +  + + FT
Sbjct: 62  CVIITHFHLDHCGALPYFTEMVGYDGPIYMTLPTKAITPILLEDYRKITVDRKGDTNFFT 121

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +   Q   +    E + +  + AGH+LG  ++      E V+Y  DY
Sbjct: 122 PQMIKDCMKKVIPIDLHQTIKVD---EELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDY 178

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++  V+P VLIT+   A   +  ++ RE  F   + + +  GG VL+P
Sbjct: 179 NMTPDRHLGSAWIDQ-VKPDVLITETTYATTIRDSKRGRERDFLKRVHECVEKGGKVLIP 237

Query: 235 VDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           V + GRV EL ++++ YW + +L++ PIYF   ++     Y K F+ W    I ++F   
Sbjct: 238 VFALGRVQELCILIDSYWEQMNLSHVPIYFSAGLAEKANLYYKLFINWTNQKIKQTF--V 295

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           + N F  KH+     +S L +AP G  ++ A+   L AG S ++F +WA +  N+ +   
Sbjct: 296 KRNMFDFKHIKPF--QSHLVDAP-GAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPG 352

Query: 354 RGQFGTLARMLQAD 367
               GT+   L A+
Sbjct: 353 YCVVGTVGNKLLAN 366


>gi|344257704|gb|EGW13808.1| Cleavage and polyadenylation specificity factor subunit 3
           [Cricetulus griseus]
          Length = 647

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 93/338 (27%), Positives = 175/338 (51%), Gaps = 22/338 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           + ID +L+SH    H GALP+ +++       F   +T+ +YR  L      Y+    +S
Sbjct: 25  AEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLL----SDYVKVSNIS 80

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  
Sbjct: 81  ADDMLYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVK 136

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  
Sbjct: 137 LLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNR 195

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D 
Sbjct: 196 GGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDK 255

Query: 286 ITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
           I K      +N F+ KH++ L +    D+   GP +V+AS   ++ G S ++F  W +D 
Sbjct: 256 IRKQINI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMIQNGLSRELFESWCTDK 311

Query: 346 KNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +N V+       GTLA+ + ++  P+ +     +++PL
Sbjct: 312 RNGVIIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 347


>gi|322710530|gb|EFZ02104.1| cleavage and polyadenylation specifity factor [Metarhizium
           anisopliae ARSEF 23]
          Length = 831

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/381 (27%), Positives = 181/381 (47%), Gaps = 28/381 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 43  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHVDHAASLPYVLAKT 102

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 103 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNSTTQPVYTEQDHLNTFSQIEAIDYHTTH 162

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+  
Sbjct: 163 TISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKDVKID 218

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG  LLPV + GR  ELLLIL++YW +
Sbjct: 219 VLITESTYGIASHVPRLEREQALMKSITGILNRGGRALLPVFALGRAQELLLILDEYWGK 278

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H     YPIY+ + ++   +   ++++  M D+I + F       E S D A     +  
Sbjct: 279 HPEFQKYPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERMAEAEASGDGAGQGGPWDF 338

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L++G S ++F  WA   KN V+ T     GT+
Sbjct: 339 KYIRSLKNLDRFDDV--GGCVMLASPGMLQSGVSRELFERWAPSEKNGVIITGYSVEGTM 396

Query: 361 ARMLQADPPPKAVKVTMSRRV 381
           AR +  +  P  +   MSR +
Sbjct: 397 ARQIMQE--PDQIPAVMSRNL 415


>gi|300706475|ref|XP_002995499.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
 gi|239604633|gb|EEQ81828.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
          Length = 671

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/376 (29%), Positives = 184/376 (48%), Gaps = 24/376 (6%)

Query: 3   TSVQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWND-HFDPSLLQPLSKV-ASTIDAVL 59
             ++V PL G  NE   S  L+S +  N + DCG +  H   + L  L  V  ST+DA  
Sbjct: 29  NKIKVKPL-GAGNEVGRSCILISYNNKNIMFDCGVHSAHTGIASLPFLDTVDLSTVDACF 87

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           ++H    H   LPY  ++      VF T P   +    + D        S+ D +T  D+
Sbjct: 88  ITHFHLDHAAGLPYLTEKTNFKGKVFMTHPTKAILRWMLNDYVRIINASSDVDFYTEKDL 147

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
           ++ +  +  + Y Q  ++    EGI V    AGH+LG  ++ I  +   ++Y  DY+R +
Sbjct: 148 NNCYNKIIPIDYHQEINI----EGIKVIGLNAGHVLGAAMFLIKIEDSVMLYTGDYSREE 203

Query: 180 EKHLNGTVLES-FVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
           ++HL     ES   +   LIT++   +     R +RE  F   I+K +  GG  LLPV +
Sbjct: 204 DRHLKAA--ESPNCKIHALITESTYGVQCHLSRDERESRFTSTITKIVTRGGRCLLPVFA 261

Query: 238 AGRVLELLLILEDYWAE----HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            GR  ELLLIL+++W+     HS+  PIY+ + ++   I   ++++  M D I KS  + 
Sbjct: 262 LGRAQELLLILDEHWSNNPQLHSI--PIYYASALAKKCIGIYQTYINMMNDHIKKS--SL 317

Query: 294 RDNAFLLKHVTLLINKSELDNAPDG-PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
             N F  ++V    N   +D   D  P +++AS   L++G S ++F +W  D +N V+  
Sbjct: 318 IKNPFAFQYVK---NLKSIDFFEDNSPCVIMASPGMLQSGLSRELFEKWCGDRRNGVIIP 374

Query: 353 ERGQFGTLARMLQADP 368
                GTLA+ +  +P
Sbjct: 375 GYSVDGTLAKEILNEP 390


>gi|348686031|gb|EGZ25846.1| hypothetical protein PHYSODRAFT_478942 [Phytophthora sojae]
          Length = 733

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 178/370 (48%), Gaps = 16/370 (4%)

Query: 5   VQVTPLSGVFNENPLSYLV-SIDGFNFLIDCGWNDHFDPSLLQPL--SKVASTIDAVLLS 61
           +++ PL G  NE   S +V    G   ++DCG +  +      P      A  ID +L++
Sbjct: 17  MRIMPL-GAGNEVGRSCIVLKFKGKTIMLDCGVHPGYSGHGSLPFFDGVEAEEIDLLLIT 75

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDID 120
           H    H+ ALP+  ++      VF T P   +  + + D +L    +S  D ++   D++
Sbjct: 76  HFHIDHVAALPHFTEKTNFKGRVFMTHPTKAVMQMMLRD-FLRVSNISVDDQIYDDKDLN 134

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +    V  +    ++H      GI   P+ AGH+LG  ++ I   G  V+Y  DY+   +
Sbjct: 135 NCVSKVEII----DFHQEIMHNGIKFTPYNAGHVLGACMYLIEIGGVKVLYTGDYSLEND 190

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGR 240
           +HL    L +     +++   Y    +Q   ++   F   +   +R GG  L+PV + GR
Sbjct: 191 RHLMAAELPACSPDVLIVESTYGVQVHQSVVEREGRFTGQVEAVVRRGGRCLIPVFALGR 250

Query: 241 VLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
             ELLLIL+++W  H    + PIYF + +++  +   ++++  M D I K    S  N F
Sbjct: 251 TQELLLILDEHWRSHPDLQDIPIYFASKLAAKALRVYQTYINMMNDRIRKQIAIS--NPF 308

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
             +H++ L +  + D++  GP +V+AS   L++G S  +F  W SD +N  L       G
Sbjct: 309 QFEHISNLKSMDDFDDS--GPSVVMASPGMLQSGVSRQLFERWCSDKRNACLIPGYVVEG 366

Query: 359 TLARMLQADP 368
           TLA+ + ++P
Sbjct: 367 TLAKKILSEP 376


>gi|395518397|ref|XP_003763348.1| PREDICTED: integrator complex subunit 11 [Sarcophilus harrisii]
          Length = 393

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 169/357 (47%), Gaps = 25/357 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND     D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E  +Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESAVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
           N F  KH+    +++  DN   GP +        E G   D+   WA + +    F 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVG-------EGGPWLDLVQAWAGEEEGAATFC 344


>gi|47230093|emb|CAG10507.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 730

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 94/356 (26%), Positives = 180/356 (50%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 26  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 85

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  + L+   D++ +   +  +    N+
Sbjct: 86  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADEMLYAETDLEESMDKIETI----NF 137

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P 
Sbjct: 138 HEVREVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 196

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  
Sbjct: 197 ILIIESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 256

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    
Sbjct: 257 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINI--NNPFVFKHISNLKSMDHF 314

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P
Sbjct: 315 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 368


>gi|301111988|ref|XP_002905073.1| cleavage and polyadenylation specificity factor subunit 3
           [Phytophthora infestans T30-4]
 gi|262095403|gb|EEY53455.1| cleavage and polyadenylation specificity factor subunit 3
           [Phytophthora infestans T30-4]
          Length = 724

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 178/370 (48%), Gaps = 16/370 (4%)

Query: 5   VQVTPLSGVFNENPLSYLV-SIDGFNFLIDCGWNDHFDPSLLQPL--SKVASTIDAVLLS 61
           +++ PL G  NE   S +V    G   ++DCG +  +      P      A  ID +L++
Sbjct: 17  MRIMPL-GAGNEVGRSCIVLKFKGKTIMLDCGVHPGYSGHGSLPFFDGVEAEEIDLLLIT 75

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDID 120
           H    H+ ALP+  ++      VF T P   +  + + D +L    +S  D ++   D++
Sbjct: 76  HFHIDHVAALPHFTEKTNFKGRVFMTHPTKAVMQMMLRD-FLRVSNISVDDQIYDDKDLN 134

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +    V  +    ++H      GI   P+ AGH+LG  ++ I   G  V+Y  DY+   +
Sbjct: 135 NCVSKVEII----DFHQEMMHNGIKFTPYNAGHVLGACMYLIEIGGVKVLYTGDYSLEND 190

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGR 240
           +HL    L +     +++   Y    +Q   ++   F   +   +R GG  L+PV + GR
Sbjct: 191 RHLMAAELPACSPDVLIVESTYGVQVHQSVVEREGRFTGQVEAVVRRGGRCLIPVFALGR 250

Query: 241 VLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
             ELLLIL+++W  H    + PIYF + +++  +   ++++  M D I K    S  N F
Sbjct: 251 TQELLLILDEHWRSHPDLQDIPIYFASKLAAKALRVYQTYINMMNDRIRKQIAIS--NPF 308

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
             +H++ L +  + D++  GP +V+AS   L++G S  +F  W SD +N  L       G
Sbjct: 309 QFEHISNLKSMDDFDDS--GPSVVMASPGMLQSGVSRQLFERWCSDKRNACLIPGYVVEG 366

Query: 359 TLARMLQADP 368
           TLA+ + ++P
Sbjct: 367 TLAKKILSEP 376


>gi|328867689|gb|EGG16071.1| beta-lactamase domain-containing protein [Dictyostelium
           fasciculatum]
          Length = 786

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/320 (30%), Positives = 167/320 (52%), Gaps = 17/320 (5%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY-LSRRQVSEFDL 113
           ID +L+SH    H  A+PY +++      V+ T P  ++  + + D   +S   V+E   
Sbjct: 83  IDLLLVSHFHLDHAAAVPYFVQKTDFKGKVYMTHPTKKIYKVLLSDYVKVSNISVAEDMP 142

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
           F   D++++   +  +    NYH   +  GI    + AGH+LG  ++ +   G  ++Y  
Sbjct: 143 FDEQDLNASLPKIEHI----NYHQKIEHNGIKFCCYNAGHVLGAAMFMVEIAGVRILYTG 198

Query: 174 DYNRRKEKHLNGTVLESF-VRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
           D++R++++HL G   ES  V   VLI ++   +    PR +RE  F  +I + +R GG  
Sbjct: 199 DFSRQEDRHLMGA--ESPPVDVDVLIIESTYGVQVHEPRLERERRFTTSIHEIVRRGGRC 256

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY+ + ++   +   +++++ M + I   
Sbjct: 257 LIPVFALGRAQELLLILDEYWIAHPELHGIPIYYASALAKKCMKVYQTYIQMMNERIRAQ 316

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
           F  S  N F+ KH+    + + +DN  D GP + +AS   L++G S  +F  W SD +N 
Sbjct: 317 FAVS--NPFIFKHIK---DINGIDNFNDNGPCVFMASPGMLQSGLSRQLFERWCSDRRNG 371

Query: 349 VLFTERGQFGTLARMLQADP 368
           V+       GTLA+ + ++P
Sbjct: 372 VVIPGYSVEGTLAKHIMSEP 391


>gi|449283675|gb|EMC90280.1| Cleavage and polyadenylation specificity factor subunit 3, partial
           [Columba livia]
          Length = 667

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 91/352 (25%), Positives = 175/352 (49%), Gaps = 14/352 (3%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 12  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 71

Query: 80  LSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSG 139
                F T     +    + D        ++  L+T  D++ +   +  +    N+H   
Sbjct: 72  FKGRTFMTHATKAIYKWLLSDCVKVSNISADDMLYTETDLEESMDKIETI----NFHEVK 127

Query: 140 KGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLIT 199
           +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P +LI 
Sbjct: 128 EVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILII 186

Query: 200 DAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-- 256
           ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  H   
Sbjct: 187 ESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPEL 246

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP 316
            + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    D+  
Sbjct: 247 HDIPIYYASSLAKKCMSVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHFDDI- 303

Query: 317 DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P
Sbjct: 304 -GPSIVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHVMSEP 354


>gi|313216448|emb|CBY37756.1| unnamed protein product [Oikopleura dioica]
          Length = 690

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 180/364 (49%), Gaps = 31/364 (8%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF- 85
           G N L    + D+ DP            ID +L+SH    H G LP+ + +      VF 
Sbjct: 44  GINGLNGLPFMDYTDPD----------KIDILLISHFHLDHCGGLPWFLTKTQFKGRVFM 93

Query: 86  --STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
             +T+ +YR  LL+ Y + +S   V E  LFT  D++     +  + +    H++G    
Sbjct: 94  TYATKAIYRW-LLSDYIK-VSNVGVEEL-LFTEKDLEETLDRIETVKFHAEKHING---- 146

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           I    + AGH+LG   + +   G  V++  D++R +++HL    +    +P +LI ++  
Sbjct: 147 IKFCAYHAGHVLGAAQFMVEIAGVKVLFTGDFSREEDRHLMAAEVPP-QKPDILIMESTY 205

Query: 204 ALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYP 260
             H    R++RE  F   I   +  GG  L+PV + GR  ELLLIL+DYWA+H    + P
Sbjct: 206 GTHLHEKREEREHRFTSVIHDIINRGGRCLIPVFALGRAQELLLILDDYWAQHPELHDIP 265

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           IY+ + ++   +   +++   M   I K+  T   N F  +H++ L      D+   GP 
Sbjct: 266 IYYASTLAKKCMSVYQTYTNAMNSKIQKAITTR--NPFQFRHISNLKGMEAFDDDI-GPS 322

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMS-R 379
           +VLAS   +++G S ++F +W ++ +N V+       GTLA  ++ +P      VTMS +
Sbjct: 323 VVLASPGMMQSGLSRELFEKWCTNKRNGVILAGYAVEGTLAHQIKTEPDE---IVTMSGQ 379

Query: 380 RVPL 383
           ++PL
Sbjct: 380 KLPL 383


>gi|313244184|emb|CBY15021.1| unnamed protein product [Oikopleura dioica]
          Length = 690

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 180/364 (49%), Gaps = 31/364 (8%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF- 85
           G N L    + D+ DP            ID +L+SH    H G LP+ + +      VF 
Sbjct: 44  GINGLNGLPFMDYTDPD----------KIDILLISHFHLDHCGGLPWFLTKTQFKGRVFM 93

Query: 86  --STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
             +T+ +YR  LL+ Y + +S   V E  LFT  D++     +  + +    H++G    
Sbjct: 94  TYATKAIYRW-LLSDYIK-VSNVGVEEL-LFTEKDLEETLDRIETVKFHAEKHING---- 146

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           I    + AGH+LG   + +   G  V++  D++R +++HL    +    +P +LI ++  
Sbjct: 147 IKFCAYHAGHVLGAAQFMVEIAGVKVLFTGDFSREEDRHLMAAEVPP-QKPDILIMESTY 205

Query: 204 ALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYP 260
             H    R++RE  F   I   +  GG  L+PV + GR  ELLLIL+DYWA+H    + P
Sbjct: 206 GTHLHEKREEREHRFTSVIHDIINRGGRCLIPVFALGRAQELLLILDDYWAQHPELHDIP 265

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           IY+ + ++   +   +++   M   I K+  T   N F  +H++ L      D+   GP 
Sbjct: 266 IYYASTLAKKCMSVYQTYTNAMNSKIQKAITTR--NPFQFRHISNLKGMEAFDDDI-GPS 322

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMS-R 379
           +VLAS   +++G S ++F +W ++ +N V+       GTLA  ++ +P      VTMS +
Sbjct: 323 VVLASPGMMQSGLSRELFEKWCTNKRNGVILAGYAVEGTLAHQIKTEPDE---IVTMSGQ 379

Query: 380 RVPL 383
           ++PL
Sbjct: 380 KLPL 383


>gi|32566029|ref|NP_502553.2| Protein CPSF-3 [Caenorhabditis elegans]
 gi|26985920|emb|CAC44310.2| Protein CPSF-3 [Caenorhabditis elegans]
          Length = 707

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 176/373 (47%), Gaps = 18/373 (4%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLS 61
           S+  TPL          +L+   G   ++DCG +         P         ID +L++
Sbjct: 10  SLCFTPLGSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLIT 69

Query: 62  HPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD 118
           H    H GALP+ +++       F   +T+ +YR+ LL  Y +           L+T DD
Sbjct: 70  HFHLDHCGALPWLLQKTAFQGKCFMTHATKAIYRM-LLGDYVRISKYGGPDRNQLYTEDD 128

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           ++ +   +  + + +   ++G    I   P+VAGH+LG   + I   G  V+Y  D++  
Sbjct: 129 LEKSMAKIETIDFREQKEVNG----IRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCL 184

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
           +++HL    +   + P VLIT++         R  RE  F   +   +  GG  L+P  +
Sbjct: 185 EDRHLCAAEIPP-ITPQVLITESTYGTQTHEDRAVREKRFTQMVHDIVTRGGRCLIPAFA 243

Query: 238 AGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            G   EL+LIL++YW  H    + P+Y+ + ++   +   ++F+  M   I K  + +  
Sbjct: 244 IGPAQELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQK--QIAVK 301

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F+ KHV+ L    + ++A  GP +VLA+   L++GFS ++F  W  D KN  +     
Sbjct: 302 NPFIFKHVSTLRGMDQFEDA--GPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYC 359

Query: 356 QFGTLARMLQADP 368
             GTLA+ + ++P
Sbjct: 360 VEGTLAKHILSEP 372


>gi|242220452|ref|XP_002475992.1| predicted protein [Postia placenta Mad-698-R]
 gi|220724781|gb|EED78801.1| predicted protein [Postia placenta Mad-698-R]
          Length = 825

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 166/324 (51%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  AL Y  ++         V+ T P   L    M D ++     +
Sbjct: 48  STVDVLLITHFHLDHAAALTYITEKTNFRDGKGKVYMTHPTKALHKFMMQD-FVRMSSST 106

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  DI  +  S+  ++  Q   +     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 107 SDALFSPLDIQMSLSSIIPVSAHQ---VITPCPGVTFTPYHAGHVLGACMFLIDIAGLKI 163

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   +RP VLI ++   +     R+++E+ F + +   +R G
Sbjct: 164 LYTGDYSREEDRHLVKAEVPP-IRPDVLIIESTYGVQTLEGREEKELRFTNLVHSIIRRG 222

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++  ++   ++++  M  ++
Sbjct: 223 GHVLLPTFALGRAQELLLILDEYWKKHPDLQNVPIYYASSLARKSMAVYQTYIHTMNSNV 282

Query: 287 TKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ L      E   A   P +VLAS   + +G S ++   WA D
Sbjct: 283 RSRF-AKRDNPFVFKHISNLPQSKGWERKIAEGPPCVVLASPGFMTSGASRELLELWAPD 341

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N V+ T     GT+AR +Q++P
Sbjct: 342 SRNGVIITGYSIEGTMAREIQSEP 365


>gi|221484558|gb|EEE22852.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii GT1]
          Length = 1100

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 181/398 (45%), Gaps = 44/398 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           V++TPL           +    G   + DCG +  +      P+      +++D  L++H
Sbjct: 110 VEITPLGAGCEVGRSCVIARYKGLTVMFDCGVHPAYSGLGALPIFDAVDMTSVDVCLVTH 169

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD---------- 112
               H GALPY + +      VF TEP   +  L     +L   ++S F           
Sbjct: 170 FHLDHCGALPYLVTKTAFRGRVFMTEPTRVISKLV----WLDYARMSAFSQGSRDNQGAA 225

Query: 113 -----------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                            L+  DD+D+  + V  L + Q   +     GI V+   AGH+L
Sbjct: 226 AAQAAAGSQAEKAGGAFLYDEDDVDATVRMVECLDFHQQVEVG----GIKVSCFGAGHVL 281

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           G  ++ I   G  ++Y  D++R +++H+    +   V   +LI ++   +H    RQ RE
Sbjct: 282 GACMFLIEIGGVRMLYTGDFSRERDRHVPIAEVPP-VDVQLLICESTYGIHVHDDRQLRE 340

Query: 216 -MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTI 272
             F  A+   +  GG  LLPV + GR  ELLLILE+YW  H    + PI FL+ +SS   
Sbjct: 341 RRFLKAVVDIVNRGGKCLLPVFALGRAQELLLILEEYWTAHPEIRHVPILFLSPLSSKCA 400

Query: 273 DYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLE 330
               +F++  G+++ +S     +N F  + V  +  +  + +    DGP +V+A+   L+
Sbjct: 401 VVFDAFVDMCGEAV-RSRALRGENPFAFRFVKNVKSVEAARVYIHHDGPAVVMAAPGMLQ 459

Query: 331 AGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +G S +IF  WA D KN V+ T     GTLA  L+ +P
Sbjct: 460 SGASREIFEAWAPDAKNGVILTGYSVKGTLADELKREP 497


>gi|254565077|ref|XP_002489649.1| Putative endoribonuclease [Komagataella pastoris GS115]
 gi|238029445|emb|CAY67368.1| Putative endoribonuclease [Komagataella pastoris GS115]
          Length = 784

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 175/342 (51%), Gaps = 17/342 (4%)

Query: 20  SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQ 77
           S+++   G   ++D G +  F      P        T+D +L+SH    H  +LPY M++
Sbjct: 31  SHIIQFKGKTVMLDAGVHPAFQGMASLPFYDEFDLGTVDVLLISHFHLDHAASLPYVMQK 90

Query: 78  LGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                 VF T P   +YR  LL  + +  +    S   L++  D+  +F  +  +    +
Sbjct: 91  TNFKGRVFMTHPTKAIYRW-LLNDFVRVTAIDDDSN-QLYSDKDLKDSFDRIETI----D 144

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           +H + + +GI    + AGH+LG  ++ I   G  V++  D++R +++HL+   +   VRP
Sbjct: 145 FHSTIEIDGIRFTAYQAGHVLGAAMFFIEIAGIKVLFTGDFSREEDRHLSVAEVPP-VRP 203

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++        PR+++E      I  TL  GG VL+PV + GR  ELLLIL++YW+
Sbjct: 204 DVLITESTFGTATHEPREEKEKKLTTMIHSTLANGGRVLMPVFALGRAQELLLILDEYWS 263

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           +H    N  +Y+ + ++   +   ++++  M ++I K F  +  N F  +++  + N S+
Sbjct: 264 QHQDLENIKVYYASDLARKCLAVYQTYINMMNENIRKKFRDTNKNPFQFQYIKNIKNLSK 323

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            D+    P +V+AS   L+ G S  +  +WA D +N ++ TE
Sbjct: 324 FDDF--QPSVVVASPGMLQNGVSRALLEKWAPDPRNTLIMTE 363


>gi|312372474|gb|EFR20427.1| hypothetical protein AND_20124 [Anopheles darlingi]
          Length = 692

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 180/356 (50%), Gaps = 22/356 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  ID + +SH    H GALP+ +++  
Sbjct: 39  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLIDADQIDLLFISHFHLDHCGALPWFLQKTS 98

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     M   Y+    +S +  L+T  D++++ + +  +    N+
Sbjct: 99  FKGRCFMTHATKAIYRW----MLSDYIKVSNISTDQMLYTEADLEASMEKIETI----NF 150

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      G+    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + + +RP 
Sbjct: 151 HEERDILGVRFWAYNAGHVLGAAMFMIEIAGIRVLYTGDFSRQEDRHLMAAEIPA-MRPD 209

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++    H    R+ RE  F   + K +  GG  L+PV + GR  ELLLIL++YW++
Sbjct: 210 VLITESTYGTHIHEKREDRENRFTSLVQKIVTQGGRCLIPVFALGRAQELLLILDEYWSQ 269

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           +      PIY+ + ++   +   ++++  M D I +  + + +N F+ + ++ L      
Sbjct: 270 NPDLQEIPIYYASSLAKKCMAVYQTYINAMNDKIRR--QIAINNPFVFRFISNLKGIDHF 327

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D+   GP +V+AS   +++G S ++F  W +D KN V+       GTLA+ +  +P
Sbjct: 328 DDV--GPCVVMASPGMMQSGLSRELFETWCTDPKNGVIIAGYCVEGTLAKTILFEP 381


>gi|221504752|gb|EEE30417.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii VEG]
          Length = 1100

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 181/398 (45%), Gaps = 44/398 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           V++TPL           +    G   + DCG +  +      P+      +++D  L++H
Sbjct: 110 VEITPLGAGCEVGRSCVIARYKGLTVMFDCGVHPAYSGLGALPIFDAVDMTSVDVCLVTH 169

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD---------- 112
               H GALPY + +      VF TEP   +  L     +L   ++S F           
Sbjct: 170 FHLDHCGALPYLVTKTAFRGRVFMTEPTRVISKLV----WLDYARMSAFSQGSRDNQGAA 225

Query: 113 -----------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                            L+  DD+D+  + V  L + Q   +     GI V+   AGH+L
Sbjct: 226 AAQAAAGSQAEKAGGAFLYDEDDVDATVRMVECLDFHQQVEVG----GIKVSCFGAGHVL 281

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           G  ++ I   G  ++Y  D++R +++H+    +   V   +LI ++   +H    RQ RE
Sbjct: 282 GACMFLIEIGGVRMLYTGDFSRERDRHVPIAEVPP-VDVQLLICESTYGIHVHDDRQLRE 340

Query: 216 -MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTI 272
             F  A+   +  GG  LLPV + GR  ELLLILE+YW  H    + PI FL+ +SS   
Sbjct: 341 RRFLKAVVDIVNRGGKCLLPVFALGRAQELLLILEEYWTAHPEIRHVPILFLSPLSSKCA 400

Query: 273 DYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLE 330
               +F++  G+++ +S     +N F  + V  +  +  + +    DGP +V+A+   L+
Sbjct: 401 VVFDAFVDMCGEAV-RSRALRGENPFAFRFVKNVKSVEAARVYIHHDGPAVVMAAPGMLQ 459

Query: 331 AGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +G S +IF  WA D KN V+ T     GTLA  L+ +P
Sbjct: 460 SGASREIFEAWAPDAKNGVILTGYSVKGTLADELKREP 497


>gi|343428147|emb|CBQ71677.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
           [Sporisorium reilianum SRZ2]
          Length = 878

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 168/322 (52%), Gaps = 13/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D        +
Sbjct: 74  STVDAILITHFHLDHAAALTYIMEKTNFRDGHGKVYMTHPTKAVYRFLMSDFVRISNAGN 133

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + +LF  +++ ++++ +  + + Q+  ++G   G+    + AGH+LG  ++ I   G  +
Sbjct: 134 DDNLFDENEMFASWRQIEAVDFHQDVSIAG---GLRFTAYHAGHVLGACMFLIEIAGLRI 190

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  D++R +++HL    +   V+P VLI ++        PR  +E  F   I   ++ G
Sbjct: 191 LYTGDFSREEDRHLVQAEIPP-VKPDVLICESTYGTQTHEPRLDKEHRFTSQIHHIIKRG 249

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VLLPV   GR  ELLL+L++YWA H    + PIY+ + ++   I   ++++  M D I
Sbjct: 250 GRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQTYIHTMNDHI 309

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+ KH++ L +  + ++   GP +++AS   +++G S ++   WA D +
Sbjct: 310 RTRF-NRRDNPFVFKHISNLRSLEKFEDR--GPCVMMASPGFMQSGVSRELLERWAPDKR 366

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ +     GT+AR +  +P
Sbjct: 367 NGLIVSGYSVEGTMARNILNEP 388


>gi|71005902|ref|XP_757617.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
 gi|74703664|sp|Q4PEJ3.1|YSH1_USTMA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|46097110|gb|EAK82343.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
          Length = 880

 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 168/322 (52%), Gaps = 13/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D        +
Sbjct: 74  STVDAILITHFHLDHAAALTYIMEKTNFRDGHGKVYMTHPTKAVYRFLMSDFVRISNAGN 133

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + +LF  +++ ++++ +  + + Q+  ++G   G+    + AGH+LG  ++ I   G  +
Sbjct: 134 DDNLFDENEMLASWRQIEAVDFHQDVSIAG---GLRFTSYHAGHVLGACMFLIEIAGLRI 190

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  D++R +++HL    +   V+P VLI ++        PR  +E  F   I   ++ G
Sbjct: 191 LYTGDFSREEDRHLVQAEIPP-VKPDVLICESTYGTQTHEPRLDKEHRFTSQIHHIIKRG 249

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VLLPV   GR  ELLL+L++YWA H    + PIY+ + ++   I   ++++  M D I
Sbjct: 250 GRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQTYIHTMNDHI 309

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+ KH++ L +  + ++   GP +++AS   +++G S ++   WA D +
Sbjct: 310 RTRF-NRRDNPFVFKHISNLRSLEKFEDR--GPCVMMASPGFMQSGVSRELLERWAPDKR 366

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ +     GT+AR +  +P
Sbjct: 367 NGLIVSGYSVEGTMARNILNEP 388


>gi|126030715|pdb|2I7X|A Chain A, Structure Of Yeast Cpsf-100 (Ydh1p)
          Length = 717

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 135/494 (27%), Positives = 214/494 (43%), Gaps = 85/494 (17%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMA------------------------SLE 330
              F +     +I  +EL   P G K+   S                          S E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 331 AGFSHDIFVEWA-SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
              S D  +E    D +N   F E G+       +  D           +  PL  EE  
Sbjct: 373 CASSLDKILEIVEQDERNWKTFPEDGKSFLCDNYISID---------TIKEEPLSKEETE 423

Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGR 449
           A++ +    K++   K  LVK E  K +       +G+ ++ D N   A          R
Sbjct: 424 AFKVQLKEKKRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------R 467

Query: 450 YRDILIDGF--VPP 461
            +DIL++    VPP
Sbjct: 468 NQDILVENVNGVPP 481


>gi|410898094|ref|XP_003962533.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Takifugu rubripes]
          Length = 691

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 182/355 (51%), Gaps = 20/355 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 36  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYH 136
                F   +T+ +YR  LL+ Y + +S     E  L+   D++ +   +  +    N+H
Sbjct: 96  FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADEM-LYAETDLEESMDKIETI----NFH 148

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 196
              +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + S V+P +
Sbjct: 149 EVREVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPDI 207

Query: 197 LITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
           LI ++    H    R++RE  F + +   +   G  L+PV + GR  ELLLIL++YW  H
Sbjct: 208 LIIESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNH 267

Query: 256 S--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
               + PIY+ + ++   +   ++++  M D I K+     +N F+ KH++ L +    D
Sbjct: 268 PELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAINI--NNPFVFKHISNLKSMDHFD 325

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++P
Sbjct: 326 DI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 378


>gi|401827745|ref|XP_003888165.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
 gi|392999365|gb|AFM99184.1| putative RNA-processing beta-lactamase-fold exonuclease
           [Encephalitozoon hellem ATCC 50504]
          Length = 643

 Score =  143 bits (360), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 185/387 (47%), Gaps = 24/387 (6%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLS 61
           +++ PL G  NE   S  +V   G   ++DCG +  +      P   +   S IDA+ ++
Sbjct: 7   IKIMPL-GAGNEVGRSCVIVECGGRTIMLDCGVHPAYTGVASLPFLDLVDLSKIDAIFIT 65

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
           H    H  ALP+  ++      V+ T P   +    + D        S+ D +T  D+  
Sbjct: 66  HFHLDHAAALPFLTEKTSFKGKVYMTHPTKAILKWLLNDYIRLINAASDADFYTETDLVK 125

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
            +  +  + Y Q  ++    +GI V    AGH+LG  ++ I  +   V+Y  D++R +++
Sbjct: 126 CYDRIIPIDYHQEVNV----KGIKVKALNAGHVLGAAMFLIEIEKSKVLYTGDFSREEDR 181

Query: 182 HLNGTVLES-FVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           HL     ES   +   LIT++   +    PR +RE  F   +   ++ GG  LLPV + G
Sbjct: 182 HLKAA--ESPGCKIDALITESTYGVQCHLPRAEREGRFTSIVQNVVQRGGRCLLPVFALG 239

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  ++     PIY+ + ++   +   ++++  M + I K     R N 
Sbjct: 240 RAQELLLILEEHWGSNASLQKIPIYYASALAKRCMGVYQTYIGMMNERIQK-LSLVR-NP 297

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  K+V  L      D+  +GP +++AS   L++G S D+F  W SD +N V+       
Sbjct: 298 FAFKYVKNLKGIDSFDD--EGPCVIMASPGMLQSGLSRDLFERWCSDSRNAVIIPGYCVD 355

Query: 358 GTLARMLQADPPP------KAVKVTMS 378
           GTLA+ + ++P        K +++ MS
Sbjct: 356 GTLAKEILSEPKEIEALNGKKLRLNMS 382


>gi|395332776|gb|EJF65154.1| Metallo-hydrolase/oxidoreductase [Dichomitus squalens LYAD-421 SS1]
          Length = 809

 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 163/325 (50%), Gaps = 16/325 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  AL Y M++         V+ T P   L    M D    R   S
Sbjct: 57  STVDVLLITHFHLDHAAALTYIMEKTNFRDGKGKVYMTHPTKALHKFMMQD--FVRMSTS 114

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D LFT  ++  +  S+  ++  Q   +     G+   P+ AGH+LG  ++ I   G  
Sbjct: 115 SADTLFTPLEMSMSLASIIPVSAHQ---VITPCPGVTFTPYHAGHVLGACMFLIDIAGLK 171

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  DY+R +++HL    +   + P VLI ++   + +  PR  +E  F + +   +R 
Sbjct: 172 ILYTGDYSREEDRHLVKAEIPP-IHPDVLIVESTYGVQSHEPRDDKEARFTNLVHSIIRR 230

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG+VLLP  + GR  ELLLIL++YWA+H    N PIY+ + ++   +   ++++  M  +
Sbjct: 231 GGHVLLPTFALGRAQELLLILDEYWAKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNSN 290

Query: 286 ITKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           +   F   RDN F+ KH+T +      E   A   P +VLAS   + +G S ++   WAS
Sbjct: 291 VRTRF-AKRDNPFVFKHITNVPGTRGWERKIAEGPPCVVLASPGFMNSGPSRELLELWAS 349

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D KN  + T     GT+AR +  +P
Sbjct: 350 DSKNGCIVTGYSVEGTMARDILNEP 374


>gi|255570075|ref|XP_002526000.1| cleavage and polyadenylation specificity factor, putative [Ricinus
           communis]
 gi|223534732|gb|EEF36424.1| cleavage and polyadenylation specificity factor, putative [Ricinus
           communis]
          Length = 963

 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 108/360 (30%), Positives = 174/360 (48%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG    ++DH    D SL+       S +  V+++H    H+GALPY 
Sbjct: 20  VVTINGKRIMFDCGMHMGYDDHRRYPDFSLISKSGDFDSALHCVIITHFHLDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + PV+ T P   L  L +  Y + +  R+  E + FT D I      V  +   
Sbjct: 80  TEVCGYNGPVYMTYPTKALSPLMLEDYRKVMVDRR-GEEEQFTADHIKQCLNKVIAVDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    + + +  + AGH+LG  ++        ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTVQVD---KDLQIRAYYAGHVLGAAMFYAKVGDSAMVYTGDYNMTPDRHLGAAQIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LIT++  A   +  +  RE  F   + K +  GG VL+P  + GR  EL L+L+DY
Sbjct: 195 QLDLLITESTYATTIRDSKYAREREFLKVVHKCVAGGGKVLIPTFALGRAQELCLLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L  PIYF   ++     Y K  + W    I +++ TSR NAF  K+V    ++S 
Sbjct: 255 WERMNLKVPIYFSAGLTIQANMYYKMLIGWTSQKIKETY-TSR-NAFDFKNVYTF-DRSL 311

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           LD AP GP ++ A+   +  GFS ++F  WA    NLV        GT+   L +  P K
Sbjct: 312 LD-AP-GPCVLFATPGMISGGFSLEVFKRWAPCEMNLVTLPGYCVAGTIGHKLMSGKPSK 369


>gi|255934198|ref|XP_002558380.1| Pc12g15810 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211582999|emb|CAP81208.1| Pc12g15810 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 893

 Score =  143 bits (360), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 120/436 (27%), Positives = 183/436 (41%), Gaps = 90/436 (20%)

Query: 8   TPLSGV---FNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    ++    S L    G   L+D GW++ F+   L  L K   T+  +LL+H  
Sbjct: 5   TPLLGAQSSYSRASQSILELDGGIKILVDVGWDEKFNTLDLAELEKHIPTLSLILLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS---------RRQVSE--- 110
             H+GAL +  +   L    P+++T PV   G   + D Y S         +  VSE   
Sbjct: 65  PAHIGALVHCCRTFPLFTQIPIYATNPVIAFGRTLLQDLYASAPLAATFLPKASVSEPGA 124

Query: 111 -----------------------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----E 142
                                      T ++I   F  +  L YSQ +            
Sbjct: 125 SSAGSATVSGGDTEAAGSASRILLQSPTAEEISRYFSLIQPLKYSQPHQPLPSPFSPPLN 184

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLES 190
           G+ +  + AGH +GGT+W I    E ++YAVD+N+ +E  + G             V+E 
Sbjct: 185 GLTLTAYNAGHTVGGTIWHIQHGLESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQ 244

Query: 191 FVRPAVLI--TDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLI 247
             +P  LI  T   + L     R++R ++  D I  +L  GG VL+P D++ RVLEL   
Sbjct: 245 LRKPTALICSTTGGDKLAPSGGRKKRDDLLLDMIRSSLAKGGTVLIPTDTSARVLELAYA 304

Query: 248 LEDYWAEHS--------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-------- 291
           LE  W + +            +Y      ++TI   +S LEWM ++I + FE        
Sbjct: 305 LEHSWRDAANGDKEDVLQGAGLYLAGKKVTNTIRLARSMLEWMDENIVREFEAAESADVT 364

Query: 292 -----------TSRDNA-FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDI 337
                      TS+    F  KH+ ++  K  L+   A  GPK++LAS  S++ GFS   
Sbjct: 365 NGQRTGGQDKSTSKGGGPFTFKHLKIIERKKRLEKLLAEPGPKVILASDTSMDWGFSKHA 424

Query: 338 FVEWASDVKNLVLFTE 353
             + A    NL+L TE
Sbjct: 425 LRQVAEGPNNLLLMTE 440



 Score = 59.7 bits (143), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 71/304 (23%), Positives = 113/304 (37%), Gaps = 96/304 (31%)

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKV 527
           MFP+     + D++GE I P+D +   ED D AA      +G+  EG A ++ + +   +
Sbjct: 565 MFPYVAPRKKGDEYGEFIRPEDLVSDGEDADVAAESEDEVEGQSFEGPAKVVYNTQTITI 624

Query: 528 ------------------------VSNELTVLVHGSAEATEHLKQHCLKHVCPH------ 557
                                   +  +  +LV G  E T  L   C K +         
Sbjct: 625 NARIAFIDFMGLHDKRSLEMLIPLIQPQKLILVGGMKEETSALAAECQKLLTVKLGATVS 684

Query: 558 ---------VYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK----------------- 591
                    ++TP   E ID + D  A+ V+LS  L+  + ++                 
Sbjct: 685 DPAFDSAAIIFTPANREVIDASVDTNAWNVKLSNTLVRRLNWQHVRSLGVVALTAQLRGP 744

Query: 592 ---KLGDYEIAW--------------VDAEVGKTENG---------MLSLLPISTPAPPH 625
              ++GD E +               V  E+G+ +           +L  LP S  A   
Sbjct: 745 EPAEIGDVETSGKKMKQLKDEAASSAVAPELGQADTKIIDKVEVYPLLDTLPASMAAGTR 804

Query: 626 ---KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQ 681
              + + VGDL++ADL+  + S G   EF G G L   + V +RK          SGT +
Sbjct: 805 SMARPLHVGDLRLADLRKLMQSAGHTAEFRGEGTLLIDKSVAVRK----------SGTGK 854

Query: 682 IVIE 685
           I IE
Sbjct: 855 IEIE 858


>gi|68471691|ref|XP_720152.1| hypothetical protein CaO19.7957 [Candida albicans SC5314]
 gi|68471954|ref|XP_720020.1| hypothetical protein CaO19.325 [Candida albicans SC5314]
 gi|46441870|gb|EAL01164.1| hypothetical protein CaO19.325 [Candida albicans SC5314]
 gi|46442007|gb|EAL01300.1| hypothetical protein CaO19.7957 [Candida albicans SC5314]
          Length = 931

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 142/548 (25%), Positives = 232/548 (42%), Gaps = 89/548 (16%)

Query: 28  FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSAPV 84
           F  + D  WN   D +    + +     +A+LLSH     +     L      L  S PV
Sbjct: 27  FKLIADPSWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIKFPILMSSIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           +ST PV +LG ++  + Y +   +   D  +  LD++D+ F  V  L Y Q+ +L     
Sbjct: 86  YSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQQSLNLFD--N 143

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESFVR 193
            +VV P+ AGH LGGT W ITK  + VIYA  +N  K+  LN         G    S +R
Sbjct: 144 KVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTGNPHLSLLR 203

Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
           P   IT A +       R++ E F   +  TL  GG  +LP   +GR LEL  +++++  
Sbjct: 204 PTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTSLSGRFLELFHLIDEHLK 262

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
              +  P+YFL+Y  +  + Y  + L+WM  S TK +E      F    V LL++ SEL 
Sbjct: 263 GAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELL 320

Query: 314 NAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER--------------GQFG 358
               GPK+V  S   L +G  S + F    +D    ++ TE+               ++ 
Sbjct: 321 KL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIILTEKTTMNFASSLSSVLYTEWD 379

Query: 359 TLARMLQADPPPKAVKVTM---------SRRVPLVGEELIAYEEEQTRLKKEEALKASLV 409
           +LA+          + V +         ++ V L G EL  ++E+  + +KE+ L  + V
Sbjct: 380 SLAKKRGGGESEDGIAVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLL--AKV 437

Query: 410 KEEESKASLGPDN----------------------NLSGDPMVIDANNANASADVVEPHG 447
           ++++++  L  D                       N S + ++    N N +   V P+ 
Sbjct: 438 RDQKNQNILSADTVDSEDSSDDDDEGDNEAEKQKGNTSSNLLIKQYQNINVADSNVAPNE 497

Query: 448 ----GRYRDILIDGFVPPSTSVAPM--------------FPFY--ENNSEWDDFGEVINP 487
                 +   + D          P+              FP++   +  ++DD+GEVI  
Sbjct: 498 VNPLATHEAFITDHIKQSLEKNLPIDLKITHKLRPRQATFPYFATAHKQKFDDYGEVIKI 557

Query: 488 DDYIIKDE 495
           +DY   DE
Sbjct: 558 EDYQRHDE 565


>gi|392512873|emb|CAD25809.2| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
           cuniculi GB-M1]
          Length = 643

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 185/387 (47%), Gaps = 24/387 (6%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLS 61
           +++ PL G  NE   S  +V   G   ++DCG +  +      P   +   S IDAV ++
Sbjct: 7   IKIMPL-GAGNEVGRSCVIVECGGRTIMLDCGVHPAYTGMASLPFLDLVDLSKIDAVFIT 65

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
           H    H  ALP+  ++      V+ T P   +    + D        S+ D +T  D+  
Sbjct: 66  HFHLDHAAALPFLTEKTSFRGKVYMTHPTKAILKWLLNDYIRIINASSDTDFYTETDLVK 125

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
            +  +  + Y Q  ++    +GI V    AGH+LG  ++ +  +   ++Y  D++R +++
Sbjct: 126 CYDRIIPIDYHQEVNV----KGIKVKALNAGHVLGAAMFLVEIEKSKILYTGDFSREEDR 181

Query: 182 HLNGTVLES-FVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           HL     ES   +   LIT++   +    PR +RE  F   +   ++ GG  LLPV + G
Sbjct: 182 HLKAA--ESPGCKIDALITESTYGVQCHLPRAEREGRFTSIVQNVVQRGGRCLLPVFALG 239

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  ++     PIY+ + ++   +   ++++  M + I K     R N 
Sbjct: 240 RAQELLLILEEHWGSNTSLQKIPIYYASALAKRCMGVYQTYIGMMNERIQK-LSLVR-NP 297

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  K+V  L      D+  +GP +++AS   L++G S D+F  W SD KN V+       
Sbjct: 298 FAFKYVKNLKGIDSFDD--EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCVD 355

Query: 358 GTLARMLQADPPP------KAVKVTMS 378
           GTLA+ + ++P        K +++ MS
Sbjct: 356 GTLAKEILSEPKEIEAMNGKKLRLNMS 382


>gi|342320223|gb|EGU12165.1| Cleavage and polyadenylation specificity factor subunit [Rhodotorula
            glutinis ATCC 204091]
          Length = 1010

 Score =  142 bits (359), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 187/816 (22%), Positives = 315/816 (38%), Gaps = 253/816 (31%)

Query: 114  FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI----------T 163
             T  +I  AF ++  + ++Q  HL+G  +G  +  H +GH LGG+++ +           
Sbjct: 214  LTTQEIRDAFLAINAVRWTQPIHLTGPLKGYTLVAHRSGHTLGGSLYTLRPSLSSSLSPA 273

Query: 164  KDGEDVIYAVDYNRRKEKHLN-------GTVLESFVRPAVLITDAYNALHNQPPRQQREM 216
                 ++YA  +N  KE HL+       G V ++F R  V+I  A  +      R  RE 
Sbjct: 274  SSASSLLYAPLFNHVKEHHLDPTSLLNAGNVDDNFRRMGVMIVGAERSKVVNIKRIDRER 333

Query: 217  -FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTID 273
               D I+ TL+AGG++LLP D + R+ ELL++LE +W   +L   +P+  ++      + 
Sbjct: 334  KMLDLITSTLQAGGSILLPTDPSARLFELLILLETHWQFANLGQQFPLCLISRTGREAVG 393

Query: 274  YVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDN-----APDGPKLVLASMAS 328
            +V+S  EWMG  I  S       A  LK   L I  S LD       P  PKL+L   ++
Sbjct: 394  FVRSLTEWMGGQIAGS------GADKLKFANLRIFSS-LDEIATTIPPSVPKLILTVPST 446

Query: 329  LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQAD--------------------- 367
            L  G+S  +F+++A +  NLVL T   + G+LAR L  +                     
Sbjct: 447  LSYGYSRALFLDFARNAANLVLLTGLSEPGSLARWLAREVWEPQQEKGCKYGEGKVGKEV 506

Query: 368  PPPKAVKVTMSRRVPLVGEEL---IAYEEEQTRL--KKEEALKASLVKEEESKASLGPDN 422
               + +++ + R+V L G+EL   +A E E   L  +++ AL+ S    +++      D 
Sbjct: 507  KMDQTIELEIKRKVYLEGDELEAHLAAEREAAELVARQQAALERSRRMLQDNAGGDSDDE 566

Query: 423  NLSG------------------DPM-------------------VIDANNANASADVVEP 445
            + S                    PM                    +DA   + SA     
Sbjct: 567  SDSEGEEADAAEEANGAAVDEDQPMPVRRRRLGGFTGGAGAWDEFLDAETLSGSA----- 621

Query: 446  HGGRYRDILIDGFVPPSTSVA-----PMFPFYENNSEWDDFGEVINPDDYIIKDEDMD-- 498
             GG+  DI + G     ++        MFP  E     D +GE I+ + ++ + +D D  
Sbjct: 622  -GGQVFDIYVRGSYGVRSAAGGLPRFRMFPVVERKRRVDAYGEAIDVEGWLRRGQDDDPL 680

Query: 499  ----------------------------------------QAAMHIGGDDGKLDEGSA-- 516
                                                    QA + +   +G L +G A  
Sbjct: 681  SPNNAQVLGKRAREEEKEPEPEEKPDPPHKYVVDRVEVPLQALLFVVDMEG-LSDGRALK 739

Query: 517  SLILDAKPSKVVSNELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLC 574
            +++    P K+      V+V G +EA + L   C  +  +   +YTP + ETI V  +  
Sbjct: 740  TILPQINPRKL------VIVDGPSEAIQDLAGACKAVTSMTEDIYTPSLGETIKVGEETK 793

Query: 575  AYKVQLSEKLMSNVLFKKLGDYEIAWV--------------------------------- 601
             + ++L + +M+ +   ++ DY++A+V                                 
Sbjct: 794  NFSIRLGDSIMATLRLSRVEDYDVAYVSGIVHIDPESDLPVLERPTFADAASAPSALPAP 853

Query: 602  -----------------DAEVGKTENGML------SLLPISTPAPPHKSVLVGDLKMADL 638
                             +AE    E G        S+LP   P     S+ +GDL++A L
Sbjct: 854  DGTDTTIASGDGGPAPTEAEQADAEEGASEEPADPSILPALKP-----SLFIGDLRLALL 908

Query: 639  KPFLSSKGIQVEFAG-GALRCG--------------------------EYVTIRKVGPAG 671
            K  L++  +  EF G G L CG                          ++V    +  A 
Sbjct: 909  KERLAALKVPSEFTGEGILVCGPAPPEAFDFDFSGAASRAGIDTRKGAKFVRDALLNEAM 968

Query: 672  QKGGGS------GTQQIVIEGPLCEDYYKIRAYLYS 701
            +  GG       G  ++V+EG   E Y+ +R  +Y+
Sbjct: 969  EASGGRVAVRKVGRGRLVLEGGPGETYFVVRRAVYA 1004



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 69/147 (46%), Gaps = 25/147 (17%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLL-----------QP----- 47
           S+ +TPLS   +  P +YL+++D    L+DCG  D    + L           +P     
Sbjct: 2   SITITPLSA--HPLPPTYLLTVDNAQILLDCGSYDKGREATLPSTSTSSALTDEPTSEQV 59

Query: 48  ------LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQ 101
                 L K+A +++ VLLSHP    LG LP+   + GL  PV+ T P   +G   + ++
Sbjct: 60  TEYLSILRKLAPSLNLVLLSHPLLTSLGLLPFLRARCGLRCPVYGTLPTREMGRYAV-EE 118

Query: 102 YLSRRQVSEFDLFTLDDIDSAFQSVTR 128
           ++  R  +E +    + ++ A  +  R
Sbjct: 119 WVEARSAAEKNEIRYEALEQAVGASKR 145


>gi|6323144|ref|NP_013216.1| Cft2p [Saccharomyces cerevisiae S288c]
 gi|74645023|sp|Q12102.1|CFT2_YEAST RecName: Full=Cleavage factor two protein 2; AltName: Full=105 kDa
           protein associated with polyadenylation factor I
 gi|1256878|gb|AAB67560.1| Ydh1p: 105 kDa protein associated with polyadenylation factor 1 (PF
           I) [Saccharomyces cerevisiae]
 gi|1297030|emb|CAA61694.1| L2946 [Saccharomyces cerevisiae]
 gi|1360512|emb|CAA97682.1| CFT2 [Saccharomyces cerevisiae]
 gi|151941280|gb|EDN59658.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
           YJM789]
 gi|256271979|gb|EEU06997.1| Cft2p [Saccharomyces cerevisiae JAY291]
 gi|285813533|tpg|DAA09429.1| TPA: Cft2p [Saccharomyces cerevisiae S288c]
 gi|392297633|gb|EIW08732.1| Cft2p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 859

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 135/494 (27%), Positives = 214/494 (43%), Gaps = 85/494 (17%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMA------------------------SLE 330
              F +     +I  +EL   P G K+   S                          S E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 331 AGFSHDIFVEWA-SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
              S D  +E    D +N   F E G+       +  D           +  PL  EE  
Sbjct: 373 CASSLDKILEIVEQDERNWKTFPEDGKSFLCDNYISID---------TIKEEPLSKEETE 423

Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGR 449
           A++ +    K++   K  LVK E  K +       +G+ ++ D N   A          R
Sbjct: 424 AFKVQLKEKKRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------R 467

Query: 450 YRDILIDGF--VPP 461
            +DIL++    VPP
Sbjct: 468 NQDILVENVNGVPP 481


>gi|119195099|ref|XP_001248153.1| hypothetical protein CIMG_01924 [Coccidioides immitis RS]
          Length = 1015

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 121/431 (28%), Positives = 183/431 (42%), Gaps = 105/431 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAM----------- 75
           G   LID GW++ FDPS L+ L K   T+  +LL+H    H+GA  Y +           
Sbjct: 27  GVKILIDVGWDETFDPSALKELEKHIPTLSLILLTHATPSHIGAFVYCLYATYPVISFGR 86

Query: 76  ---KQLGLSAPVFST--------------------EPVYRLGLLTMYDQYLSRRQVSEFD 112
              + L  SAP+ ST                    +P    G LT  D  L+     +  
Sbjct: 87  SLLQDLYSSAPLASTFLPTTSSISDSNGSGSVPTQDPTAPAGALTEGD-TLNSTTAGKIL 145

Query: 113 LF--TLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKITKD 165
           L   T +DI   F  +  L YSQ +            G+ +  + AGH +GGT+W I   
Sbjct: 146 LPSPTSEDIARHFSLIHPLKYSQPHQPLPSPFSPPLNGLTITAYNAGHTVGGTIWHIQHG 205

Query: 166 GEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQP--PR 211
            E ++YAVD+N+ +E  + G             V+E   +P  L+  A       P   R
Sbjct: 206 MESIVYAVDWNQARENVIAGAAWFGSSGANRTDVIEQLRKPTALVCSAKGGDKFAPGGGR 265

Query: 212 QQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW--------AEHSL-NYPI 261
           ++R ++  D I   +   G VLLP D++ RVLEL  +LE  W         E+SL N  +
Sbjct: 266 KKRDDLLLDMIRSCIARKGTVLLPTDTSARVLELAYVLEHAWREAADGPDGENSLKNANL 325

Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFE-----------------------------T 292
           Y        T+   +S LEWM +SI + FE                             +
Sbjct: 326 YLAGKKVHGTMRLARSMLEWMDESIVREFEGGDGGESLGAGRSSGAASGQQSKGTPGQTS 385

Query: 293 SRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
            + +A        F  +H+ ++  K++L+N    +GPK+++AS  SL+ GFS +I    A
Sbjct: 386 DKKSAGPHKGLGPFTFRHLKIIERKTKLENILRSEGPKVIIASDTSLDWGFSKEILRHVA 445

Query: 343 SDVKNLVLFTE 353
              +NLV+ TE
Sbjct: 446 QGAENLVILTE 456


>gi|190406148|gb|EDV09415.1| 105 kDa protein associated with polyadenylation factor 1
           [Saccharomyces cerevisiae RM11-1a]
 gi|207343065|gb|EDZ70642.1| YLR115Wp-like protein [Saccharomyces cerevisiae AWRI1631]
          Length = 859

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 135/494 (27%), Positives = 214/494 (43%), Gaps = 85/494 (17%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMA------------------------SLE 330
              F +     +I  +EL   P G K+   S                          S E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 331 AGFSHDIFVEWA-SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
              S D  +E    D +N   F E G+       +  D           +  PL  EE  
Sbjct: 373 CASSLDKILEIVEQDERNWKTFPEDGKSFLCDNYISID---------TIKEEPLSKEETE 423

Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGR 449
           A++ +    K++   K  LVK E  K +       +G+ ++ D N   A          R
Sbjct: 424 AFKVQLKEKKRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------R 467

Query: 450 YRDILIDGF--VPP 461
            +DIL++    VPP
Sbjct: 468 NQDILVENVNGVPP 481


>gi|396082284|gb|AFN83894.1| putative beta-lactamase fold-containing exonuclease
           [Encephalitozoon romaleae SJ-2008]
          Length = 643

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 185/387 (47%), Gaps = 24/387 (6%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLS 61
           +++ PL G  NE   S  +V   G   ++DCG +  +      P   +   S IDA+ ++
Sbjct: 7   IKIMPL-GAGNEVGRSCVIVECGGRTIMLDCGVHPAYTGVASLPFLDLVDLSKIDAIFIT 65

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
           H    H  ALP+  ++      V+ T P   +    + D        S+ D +T  D+  
Sbjct: 66  HFHLDHAAALPFLTEKTSFKGKVYMTHPTKAILKWLLNDYIRLINAASDADFYTESDLIK 125

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
            +  +  + Y Q  ++    +GI V    AGH+LG  ++ I  +   V+Y  D++R +++
Sbjct: 126 CYDRIIPIDYHQEVNV----KGIKVKALNAGHVLGAAMFLIEIEKSKVLYTGDFSREEDR 181

Query: 182 HLNGTVLES-FVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           HL     ES   +   LIT++   +    PR +RE  F   +   ++ GG  LLPV + G
Sbjct: 182 HLKAA--ESPGCKIDGLITESTYGVQCHLPRAEREGRFTSIVQNVVQRGGRCLLPVFALG 239

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  ++     PIY+ + ++   +   ++++  M + I K     R N 
Sbjct: 240 RAQELLLILEEHWNSNTSLQKIPIYYASALAKRCMGVYQTYIGMMNERIQK-LSLVR-NP 297

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  K+V  L      D+  +GP +++AS   L++G S D+F  W SD KN V+       
Sbjct: 298 FAFKYVKNLKGIDSFDD--EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCVD 355

Query: 358 GTLARMLQADPPP------KAVKVTMS 378
           GTLA+ + ++P        K +++ MS
Sbjct: 356 GTLAKEILSEPKEIEALNGKKLRLNMS 382


>gi|340383473|ref|XP_003390242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Amphimedon queenslandica]
          Length = 726

 Score =  142 bits (359), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 188/366 (51%), Gaps = 28/366 (7%)

Query: 29  NFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLSHPDTLHLGALPYAMKQLGLSAPVF- 85
             ++DCG +         P + +  +  ID +L++H    H GALP+ +++      VF 
Sbjct: 87  KIMLDCGIHPGLSGMDALPYTDMIESDEIDLLLITHFHLDHCGALPWFLEKTTFKGRVFM 146

Query: 86  --STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
             +T+ +YR     +   Y+    +S +  L+T  D++ +   +  + + Q   +SG   
Sbjct: 147 TPATKAIYRW----LLSDYIKVSNISSDHMLYTEKDLEKSMDKIEIINFHQEVDVSG--- 199

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
            I    + AGH+LG  ++ I   G  V+Y  D++R +++HL    + +   P +LI+++ 
Sbjct: 200 -IKFTAYNAGHVLGAAMFMIEIAGVKVLYTGDFSRVEDRHLMAAEVPN-SSPDILISEST 257

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNY 259
              H    R+QRE  F   I   +  GG+ L+PV + GR  ELLLIL++YW+ H    + 
Sbjct: 258 YGTHIHEKREQREARFTTKIHDIVTRGGHCLIPVFALGRAQELLLILDEYWSCHPELHDI 317

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-G 318
           PIY+ + ++   +   ++++  M + I +    S  N F+ KH++ L N   +DN  D G
Sbjct: 318 PIYYASSLAKKCMAVYQTYIGAMNERIRRQIGIS--NPFVFKHISSLKN---IDNFDDIG 372

Query: 319 PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMS 378
           P ++LAS   +++G S  +F  W +D +N V+       GTLA+ + ++P      VTM+
Sbjct: 373 PCVILASPGMMQSGLSRQLFESWCTDKRNGVVVAGYCVEGTLAKHILSEPSE---VVTMN 429

Query: 379 -RRVPL 383
            +++PL
Sbjct: 430 GQKLPL 435


>gi|237839761|ref|XP_002369178.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii ME49]
 gi|211966842|gb|EEB02038.1| cleavage and polyadenylation specificity factor, putative
           [Toxoplasma gondii ME49]
          Length = 1100

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 180/398 (45%), Gaps = 44/398 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           V++TPL           +    G   + DCG +  +      P+      +++D  L++H
Sbjct: 110 VEITPLGAGCEVGRSCVIARYKGLTVMFDCGVHPAYSGLGALPIFDAVDMTSVDVCLVTH 169

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD---------- 112
               H GALPY + +      VF TEP   +  L     +L   ++S F           
Sbjct: 170 FHLDHCGALPYLVTKTAFRGRVFMTEPTRVISKLV----WLDYARMSAFSQGSRDNQGAA 225

Query: 113 -----------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                            L+  DD+D+  + V  L + Q   +     GI V+   AGH+L
Sbjct: 226 AAQAAAGSQAEKAGGAFLYDEDDVDATVRMVECLDFHQQVEVG----GIKVSCFGAGHVL 281

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           G  ++ I   G  ++Y  D++R  ++H+    +   V   +LI ++   +H    RQ RE
Sbjct: 282 GACMFLIEIGGVRMLYTGDFSRESDRHVPIAEVPP-VDVQLLICESTYGIHVHDDRQLRE 340

Query: 216 -MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTI 272
             F  A+   +  GG  LLPV + GR  ELLLILE+YW  H    + PI FL+ +SS   
Sbjct: 341 RRFLKAVVDIVNRGGKCLLPVFALGRAQELLLILEEYWTAHPEIRHVPILFLSPLSSKCA 400

Query: 273 DYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLE 330
               +F++  G+++ +S     +N F  + V  +  +  + +    DGP +V+A+   L+
Sbjct: 401 VVFDAFVDMCGEAV-RSRALRGENPFAFRFVKNVKSVEAARVYIHHDGPAVVMAAPGMLQ 459

Query: 331 AGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +G S +IF  WA D KN V+ T     GTLA  L+ +P
Sbjct: 460 SGASREIFEAWAPDAKNGVILTGYSVKGTLADELKREP 497


>gi|378756364|gb|EHY66388.1| cleavage and polyadenylation specificity factor [Nematocida sp. 1
           ERTm2]
          Length = 692

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 173/371 (46%), Gaps = 14/371 (3%)

Query: 3   TSVQVTPLSGVFNENPLSYLVS-IDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVL 59
           T+ ++ PL G  +E   S +V+   G   + DCG +  +      P   +   + +D +L
Sbjct: 8   TAARILPL-GAGSEVGRSCVVTKFQGVTVMFDCGVHPAYTGISSLPFFDLIDPTEVDVIL 66

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           ++H    H GALPY  ++ G    V+ T P   +    + D        SE DLFT  ++
Sbjct: 67  VTHFHLDHAGALPYFTERSGFKGKVYMTHPTRAIFRWLLNDYVRVSNVSSENDLFTEKEL 126

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
              +  +  + Y Q   L    + I +  + AGH+LG  ++ +  +   ++Y  DY+R +
Sbjct: 127 SQCYDRIIPIDYGQEITL----KNITIIAYNAGHVLGAAMFLVKNENISLLYTGDYSREE 182

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           ++HL   V+       ++    Y    +Q   ++   F   +S  ++ GG  LLPV + G
Sbjct: 183 DRHLKAAVIPPMPIDILISESTYGVQCHQSKEEREHRFITGVSDVVKRGGKCLLPVFALG 242

Query: 240 RVLELLLILEDYW-AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLIL+++W A   L   PI + + ++   +   +++L  M D I    E S  N 
Sbjct: 243 RAQELLLILDEFWEARKDLQGIPILYASALAKRFMAVYQTYLNMMNDRIQGMAEIS--NP 300

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  KHV  + N    ++   GP +++AS   L+ G S D+F  W  D +N  +       
Sbjct: 301 FHFKHVQNIKNIEAYEDR--GPCVMMASPGMLQNGLSRDLFEMWCGDKRNGCIIPGYCVE 358

Query: 358 GTLARMLQADP 368
           GTLA+ L  +P
Sbjct: 359 GTLAKDLLCEP 369


>gi|66820693|ref|XP_643926.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
 gi|74860395|sp|Q86A79.1|CPSF3_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3; Short=Cleavage and polyadenylation
           specificity factor 3
 gi|60472339|gb|EAL70292.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
           AX4]
          Length = 774

 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 180/373 (48%), Gaps = 19/373 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST----IDAVLL 60
           +++TP+           L+   G   + DCG +  +   +  P      +    ID +L+
Sbjct: 36  LEITPIGSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDSIESDIPDIDLLLV 95

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDD 118
           SH    H  A+PY + +      VF T P   +  + + D Y+    ++  D  LF   D
Sbjct: 96  SHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSD-YVKVSNITRDDDMLFDKSD 154

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           +D + + + ++ Y Q      +  GI V    AGH+LG  ++ I   G  ++Y  D++R+
Sbjct: 155 LDRSLEKIEKVRYRQKV----EHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQ 210

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
           +++HL G      V+  VLI ++   +    PR +RE  F  ++ + +   G  L+PV +
Sbjct: 211 EDRHLMGAETPP-VKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPVFA 269

Query: 238 AGRVLELLLILEDYW-AEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            GR  ELLLIL++YW A   L++ PIY+ + ++   +   ++++  M D +   F+ S  
Sbjct: 270 LGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVS-- 327

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+  +      D+   GP + +AS   L++G S  +F  W SD +N ++     
Sbjct: 328 NPFEFKHIKNIKGIESFDDR--GPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPGYS 385

Query: 356 QFGTLARMLQADP 368
             GTLA+ + ++P
Sbjct: 386 VEGTLAKHIMSEP 398


>gi|19074699|ref|NP_586205.1| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
           cuniculi GB-M1]
          Length = 730

 Score =  142 bits (358), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 179/371 (48%), Gaps = 18/371 (4%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLS 61
           +++ PL G  NE   S  +V   G   ++DCG +  +      P   +   S IDAV ++
Sbjct: 94  IKIMPL-GAGNEVGRSCVIVECGGRTIMLDCGVHPAYTGMASLPFLDLVDLSKIDAVFIT 152

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
           H    H  ALP+  ++      V+ T P   +    + D        S+ D +T  D+  
Sbjct: 153 HFHLDHAAALPFLTEKTSFRGKVYMTHPTKAILKWLLNDYIRIINASSDTDFYTETDLVK 212

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
            +  +  + Y Q  ++    +GI V    AGH+LG  ++ +  +   ++Y  D++R +++
Sbjct: 213 CYDRIIPIDYHQEVNV----KGIKVKALNAGHVLGAAMFLVEIEKSKILYTGDFSREEDR 268

Query: 182 HLNGTVLES-FVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           HL     ES   +   LIT++   +    PR +RE  F   +   ++ GG  LLPV + G
Sbjct: 269 HLKAA--ESPGCKIDALITESTYGVQCHLPRAEREGRFTSIVQNVVQRGGRCLLPVFALG 326

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  ++     PIY+ + ++   +   ++++  M + I K   +   N 
Sbjct: 327 RAQELLLILEEHWGSNTSLQKIPIYYASALAKRCMGVYQTYIGMMNERIQKL--SLVRNP 384

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  K+V  L      D+  +GP +++AS   L++G S D+F  W SD KN V+       
Sbjct: 385 FAFKYVKNLKGIDSFDD--EGPCVIMASPGMLQSGLSRDLFERWCSDSKNAVIIPGYCVD 442

Query: 358 GTLARMLQADP 368
           GTLA+ + ++P
Sbjct: 443 GTLAKEILSEP 453


>gi|260942735|ref|XP_002615666.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
 gi|238850956|gb|EEQ40420.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
          Length = 797

 Score =  142 bits (358), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 170/339 (50%), Gaps = 32/339 (9%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  LL+ + +  S     
Sbjct: 64  SKVDILLISHFHLDHAASLPYVMQQTSFRGRVFMTHATKAIYRW-LLSDFVRVTSLSGSG 122

Query: 105 ----------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHL 154
                         +  +L+T +D+ S+F  +  +    +YH + + EGI    + AGH+
Sbjct: 123 DEGRSMNGSQNSGTTSANLYTDEDLMSSFDKIETI----DYHSTMEIEGIRFTAYHAGHV 178

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR 214
           LG  ++ +   G  V++  DY+R +++HL    +    RP +LIT++        PR ++
Sbjct: 179 LGACMYFVEIGGLKVLFTGDYSREEDRHLKVAEVPP-TRPDILITESTFGTATHEPRLEK 237

Query: 215 EM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--EHSLNYPIYFLTYVSSST 271
           E      I  T+  GG +L+PV + GR  ELLLILE+YW+  E   N  IY+ + ++   
Sbjct: 238 ETRMMKNIHSTILKGGRILMPVFALGRAQELLLILEEYWSLNEDIQNVNIYYASNLARKC 297

Query: 272 IDYVKSFLEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASL 329
           +   +++   M + I  S  +S + N F  KH+  +     +D   D GP +V+AS   L
Sbjct: 298 MAVYQTYTSIMNEKIRLSASSSEKTNPFQFKHIKSI---KSIDKIQDMGPCVVVASPGML 354

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++G S  +   WA D KN V+ T     GT+A+ L A+P
Sbjct: 355 QSGVSRQLLERWAPDPKNAVILTGYSVEGTMAKELLAEP 393


>gi|387594235|gb|EIJ89259.1| integrator complex subunit 11 [Nematocida parisii ERTm3]
 gi|387594982|gb|EIJ92609.1| integrator complex subunit 11 [Nematocida parisii ERTm1]
          Length = 502

 Score =  142 bits (358), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 172/353 (48%), Gaps = 23/353 (6%)

Query: 22  LVSIDGFNFLIDCG-------WNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I     + DCG       +    D  LL P       ID V+++H    H G LPY 
Sbjct: 18  VVTIQNRTIMFDCGMHMGHSDYRRFPDFKLLGP-GPYTGVIDCVIITHFHMDHCGGLPYF 76

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYDQ---YLSRRQVSEFD--LFTLDDIDSAFQSVTRL 129
            ++   S P++ T P   +  + + D    Y  R  V +F    +  ++I +  + +  +
Sbjct: 77  TERCKYSGPIYMTPPTKAVLPIILQDYCKVYNERDDVGKFQHPTYNEENIKNCMKKIIPI 136

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLE 189
           +  +   +    +   + P+ AGH+LG  ++ +    E V+Y  DYN   ++HL+G  + 
Sbjct: 137 SIEETVEIE---KDFTITPYYAGHVLGAAMYHVKVGDESVVYTGDYNMTPDRHLDGAWMP 193

Query: 190 SFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 248
             V P+VLIT++  AL  +  R+++E  F +++ + ++ GG VL+PV + GR  EL L+L
Sbjct: 194 K-VYPSVLITESTYALLVRDCRREKERDFIESVVQCVKNGGKVLIPVFALGRAHELCLLL 252

Query: 249 EDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           + +W +  L+ PIY    ++    D  K F+++  + I  +    + N F  +HV     
Sbjct: 253 DTHWEKTKLDIPIYTSATLTHKANDIYKQFIDYTHEHIRSTLH--KRNLFDFRHVKQF-- 308

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
            S L +  +GP ++ +S   L +G S  IF +W  D  N+V+F      GT+ 
Sbjct: 309 DSNLASL-EGPMILFSSPGMLHSGPSLSIFKKWCGDPNNMVIFPGYCVRGTIG 360


>gi|392569726|gb|EIW62899.1| mRNA 3'-end-processing protein YSH1 [Trametes versicolor FP-101664
           SS1]
          Length = 805

 Score =  142 bits (358), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 96/324 (29%), Positives = 164/324 (50%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  AL Y M++         V+ T P   L    M D ++     S
Sbjct: 57  STVDVLLITHFHLDHAAALTYIMEKTNFKNGKGKVYMTHPTKALHKFMMQD-FVRMSSSS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LFT  ++  +  S+T ++  Q   +     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 TDTLFTPLEMSMSLASITTVSAHQ---VINPCPGVTFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   V P VLI ++   + +  PR+ +E  F + +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAEIPP-VHPDVLIVESTYGVQSHEPREDKETRFTNLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YWA+H    N P+Y+ + ++   +   ++++  M  ++
Sbjct: 232 GHVLLPTFALGRAQELLLILDEYWAKHPDLHNVPVYYASSLARKCMAVYQTYIHTMNANV 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F    DN F+ KH+T +      E   A   P +VLAS   ++ G S ++   WA D
Sbjct: 292 RTRF-AKHDNPFVFKHITNVPGTRGWERKIAEGPPCVVLASPGFMQTGPSRELLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GT+AR +  +P
Sbjct: 351 GRNGLIVTGYSIEGTMAREILTEP 374


>gi|323353975|gb|EGA85828.1| Cft2p [Saccharomyces cerevisiae VL3]
          Length = 859

 Score =  142 bits (358), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 131/485 (27%), Positives = 219/485 (45%), Gaps = 67/485 (13%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMAS-------LEAGFSHDIFV-------E 340
              F +     +I  +EL   P G K+   S          ++ G S    +       E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 341 WASDVKNLVLFTERGQ--FGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRL 398
            AS +  ++   E+ +  + T     ++      + +   +  PL  EE  A++ +    
Sbjct: 373 CASSLDKILXIVEQDERXWKTFPEDGKSFLCDNYISIDTIKEEPLSKEETEAFKVQLKEK 432

Query: 399 KKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGF 458
           K++   K  LVK E  K +       +G+ ++ D N   A          R +DIL++  
Sbjct: 433 KRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------RNQDILVENV 476

Query: 459 --VPP 461
             VPP
Sbjct: 477 NGVPP 481


>gi|339237605|ref|XP_003380357.1| cleavage and polyadenylation specificity factor subunit 3
           [Trichinella spiralis]
 gi|316976818|gb|EFV60027.1| cleavage and polyadenylation specificity factor subunit 3
           [Trichinella spiralis]
          Length = 687

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 98/354 (27%), Positives = 175/354 (49%), Gaps = 18/354 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           L+   G + L+DCG +   +     P         +D +L++H    H G LP+ +++  
Sbjct: 37  LIQFKGKSILLDCGIHPGLNGVDALPFVDTIDCEKVDLLLVTHFHLDHCGGLPWFLEKTT 96

Query: 80  LSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNYHLS 138
                F T     +  + + D Y+    +  +  L++ D+++ +   +  +    ++H  
Sbjct: 97  FRGRCFMTHATKAIYPIILSD-YVKVSNIGLDQMLYSEDELEKSMDKIELI----DFHEQ 151

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI 198
            +  GI    +VAGH+LG  ++ I   G  ++Y  DY+R +++HL    + S +RP VLI
Sbjct: 152 KEVNGIKFWCYVAGHVLGACMFMIEIAGVRILYTGDYSRLEDRHLCAAEVPS-IRPDVLI 210

Query: 199 TDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS- 256
            ++         R+ RE  F   +   +  GG  L+PV + GR  ELLLIL+++W +H+ 
Sbjct: 211 AESTYGTQIHENREDREHRFTSMVYTIVSRGGRCLIPVFALGRAQELLLILDEFWTKHAE 270

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 315
             N PI+F + ++   +   ++F+  M  +I K  + +  N FL KHV  L     +D  
Sbjct: 271 LQNIPIFFASSLAKKCMAVYQTFISGMNQNIQK--QIAVQNPFLFKHVRSL---RSIDFF 325

Query: 316 PD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            D GP +VLAS   L++G S ++F  W +D KN  +       GTLA+ + ++P
Sbjct: 326 EDIGPCVVLASPGMLQSGLSRELFEMWCTDTKNGCIIAGYCVEGTLAKHILSEP 379


>gi|442570104|sp|Q4IPN9.2|YSH1_GIBZE RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
          Length = 833

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 181/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQPVYTEQDHLNTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E S D A     +  
Sbjct: 277 HADFQKYPIYYASNLARKCMLIYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA   KN V+ T     GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>gi|302415331|ref|XP_003005497.1| cleavage and polyadenylation specificity factor subunit 2
           [Verticillium albo-atrum VaMs.102]
 gi|261354913|gb|EEY17341.1| cleavage and polyadenylation specificity factor subunit 2
           [Verticillium albo-atrum VaMs.102]
          Length = 739

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 133/491 (27%), Positives = 204/491 (41%), Gaps = 137/491 (27%)

Query: 9   PLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTL 66
           PL G  +E+  S  ++ +DG    LID GW++ FD   L+ L K+               
Sbjct: 6   PLQGACSESAASQSILELDGGVKVLIDLGWDESFDVEKLKALEKI--------------- 50

Query: 67  HLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLS--------------------RR 106
                           PV++T PV  LG     D Y S                     +
Sbjct: 51  ----------------PVYATRPVIDLGRTLTQDLYSSTPRAATTIPHDSLSEVAYSYSQ 94

Query: 107 QVSEFDLFTL-----DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLG 156
           Q +    F L     ++I   F  +  L YSQ +            G+ +    AGH LG
Sbjct: 95  QPTTGSNFLLQAPTPEEITRYFSLIQPLKYSQPHEPLPSPFSPPLNGLTITAFNAGHTLG 154

Query: 157 GTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNA 204
           GT+W I    E ++YAVD+N+ +E    G             V+E   +P  LI  +  A
Sbjct: 155 GTIWHIQHGLESIVYAVDWNQARENVFAGAAWLGGAGAGGAEVIEQLRKPTALICSSRGA 214

Query: 205 LHNQPP---RQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW------AEH 255
             N P    R++ E   D I   +  GG VL+P DS+GRVLEL  +LE  W       + 
Sbjct: 215 DRNAPSGGRRKRDEQLIDMIKLCVSRGGTVLIPADSSGRVLELAYLLEHAWRLEAGKTDS 274

Query: 256 SLNYP-IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA----------------F 298
           +L    +Y      SST+ Y +S LEWM D+I + FE + D                  F
Sbjct: 275 ALRAAKLYLAGRNVSSTLRYARSMLEWMDDNIVREFEATADGQRKANGNDGKHAKDAAPF 334

Query: 299 LLKHVTLLINKSEL--------DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             + + L+  ++++        +N     ++++AS  SLE GFSH++  E A D +NL++
Sbjct: 335 DFRFMRLVEREAQIRKLLSQTSENVRSDGRVIVASDNSLEWGFSHELLRELAKDSRNLLI 394

Query: 351 FTER---GQFG--TLARML--------------QADPPP---------KAVKVTMSRRVP 382
            T++    Q G  ++AR+L              Q+D            +A+ VT +RR  
Sbjct: 395 LTDKPSLAQSGQPSIARILWDWWQERRDGVSIDQSDSNDSIELVYGGGRALTVTDARRQG 454

Query: 383 LVGEELIAYEE 393
           L G+EL  Y++
Sbjct: 455 LEGDELSTYQQ 465


>gi|67525249|ref|XP_660686.1| hypothetical protein AN3082.2 [Aspergillus nidulans FGSC A4]
 gi|40744477|gb|EAA63653.1| hypothetical protein AN3082.2 [Aspergillus nidulans FGSC A4]
 gi|259485970|tpe|CBF83440.1| TPA: cleavage and polyadenylylation specificity factor, putative
           (AFU_orthologue; AFUA_3G09720) [Aspergillus nidulans
           FGSC A4]
          Length = 1005

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 120/423 (28%), Positives = 178/423 (42%), Gaps = 97/423 (22%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  L  L K  ST+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GVKILVDVGWDDTFDPLDLVELEKHVSTLSLILLTHATPSHIGAYVHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQY---------LSRRQVSE------------------------- 110
           ++T PV  LG   + D Y         L +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDVYESAPLAATFLPKASISEPGASTSAASAASVTEADGSADATSA 146

Query: 111 ----FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWK 161
                   T ++I   F  +  L YSQ +       S    G+ +  + AGH +GGT+W 
Sbjct: 147 GRILLQPPTTEEIARYFALIQPLKYSQPHQPIPSPFSPPLNGLTLTAYNAGHTVGGTIWH 206

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQP 209
           I    E ++YAVD+N+ +E  + G             V+E   +P  LI           
Sbjct: 207 IQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALICSTRGGDKFAL 266

Query: 210 P--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYP------ 260
           P  R++R E+  D I  TL  GG VL+P D++ RVLEL   LE  W + + +        
Sbjct: 267 PGGRKKRDEILLDMIRSTLVKGGTVLIPTDTSARVLELAYALEHAWRDAARDTQDDVLKR 326

Query: 261 --IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR---------------------DNA 297
             +Y      ++T+   +S LEWM +SI + FE +                      DN 
Sbjct: 327 GGLYLAGRKVNTTMRLARSMLEWMDESIVREFEAAEAADTAGQNNDGQRSDQRQGKTDNK 386

Query: 298 ----FLLKHVTLLINKSELD---NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
               F  KH+  +  K +L+   N P  PK++LAS +SL+ GF+ +     A    NL+L
Sbjct: 387 GLGPFTFKHLKTVERKKKLEQLLNDPT-PKVILASDSSLDWGFAKESLRLLAGGENNLLL 445

Query: 351 FTE 353
            T+
Sbjct: 446 LTD 448


>gi|392593709|gb|EIW83034.1| Metallo-hydrolase oxidoreductase [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 101/324 (31%), Positives = 161/324 (49%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   L    M D Y+     S
Sbjct: 57  STVDALLVTHFHLDHAAALTYIMEKTNFRDGKGKVYMTHPTKALHKFMMQD-YVRMSSSS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LFT  D+  +  S+  ++  Q   L     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 SDALFTPLDMSMSLSSIIAISAHQ---LITPCPGVTFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   VRP VLI ++   + +   R+ +E  F   +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAEVPP-VRPDVLIVESTYGVQSLECREDKEARFTGLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW  H    N PIY+ + ++   +   ++++  M  +I
Sbjct: 232 GHVLLPAFALGRAQELLLILDEYWKRHPDLHNVPIYYASNLARKCMAVYQTYIHTMNSNI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ L      E   A   P +VLAS    ++G S ++   WA D
Sbjct: 292 RTRF-AKRDNPFVFKHISNLPQPKGWERKIAEGPPCVVLASPGFCQSGPSRELLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N  + T     GT+AR +  +P
Sbjct: 351 ARNGFILTGYSVEGTMARDILNEP 374


>gi|429963183|gb|ELA42727.1| hypothetical protein VICG_00042 [Vittaforma corneae ATCC 50505]
          Length = 642

 Score =  142 bits (357), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 98/348 (28%), Positives = 172/348 (49%), Gaps = 24/348 (6%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTE 88
           L+DCG +  +      P   +   S IDA+L++H    H  ALP+  ++      V+ T 
Sbjct: 33  LLDCGVHPAYTGVSSLPFLDLVDLSKIDAILVTHFHLDHAAALPFLTEKTEFKGKVYMTH 92

Query: 89  PVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAP 148
           P   +    + D        SE D +T  D+ S +  +  + Y Q  ++    EGI V  
Sbjct: 93  PTKAILKWLLNDYIRVINSSSEQDFYTEQDLQSCYDKIIPIDYHQQINI----EGIKVTA 148

Query: 149 HVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN-----GTVLESFVRPAVLITDAYN 203
             AGH+LG  ++ +  +   ++Y  D++R +++HL      G  L++      LIT++  
Sbjct: 149 LNAGHVLGAAMFLLEIEKSKILYTGDFSREEDRHLKAAESPGCCLDA------LITESTY 202

Query: 204 ALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE--HSLNYP 260
            +    PR +RE  F   +S  +  GG  LLPV + GR  ELLLILE++W E  H    P
Sbjct: 203 GVQCHLPRYEREARFTSIVSHVVLRGGRCLLPVFALGRAQELLLILEEHWDENPHLKGIP 262

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           IY+ + ++   +   ++++  M + I K+  +   N F  ++V  + +     +   GP 
Sbjct: 263 IYYASALAQKCMSVYQTYINMMNERIQKA--SLVKNPFDFRNVESIKDIQSFKDT--GPC 318

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +++AS   L++GFS ++F +W S+ KN V+       GTLA+ + ++P
Sbjct: 319 VMMASPGMLQSGFSRELFEKWCSNEKNGVVIPGYCVEGTLAKEILSEP 366


>gi|154336691|ref|XP_001564581.1| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|134061616|emb|CAM38647.1| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 756

 Score =  142 bits (357), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 175/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V+V P+           +V   G   ++DCG  +H   S L  L    S     ID VL+
Sbjct: 26  VEVLPIGSGGEVGRSCVVVHYKGRGVMLDCG--NHPAKSGLDSLPFFDSIKCDEIDVVLI 83

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY   Q      VF T        + M D    R      DL T + + 
Sbjct: 84  THFHLDHCGALPYFCNQTSFKGRVFMTSATKAFYKMVMND--FLRIGAGASDLVTSEWLQ 141

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S    +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 142 STIDRIETIEYHEEVTVNG----ISFQPFNAGHVLGAAMFMVDIAGMRALYTGDFSRVPD 197

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++R  +F  ++   +R GG  L+PV + G
Sbjct: 198 RHLLGAEVPPY-SPDILIAESTNGIRELESREEREHLFTSSVHDVVRRGGRCLVPVFALG 256

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  H    N PIY+ + ++   +   ++F+  M D + K    +  N 
Sbjct: 257 RAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNP 315

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  LI+    ++  +GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 316 FVFKYIHSLIDTKSFED--NGPCVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCVD 373

Query: 358 GTLARMLQADP 368
           GT+A+ + A P
Sbjct: 374 GTIAKDVLAKP 384


>gi|299752177|ref|XP_001830756.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
           okayama7#130]
 gi|298409712|gb|EAU91125.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
           okayama7#130]
          Length = 846

 Score =  142 bits (357), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 100/325 (30%), Positives = 166/325 (51%), Gaps = 16/325 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y  ++         V+ T P   +    M D   +R   S
Sbjct: 57  STVDAILVTHFHLDHAAALTYITEKTNFRDGKGKVYMTHPTKAVHKFMMQD--FARMSSS 114

Query: 110 EFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
             D LF+  D+  +  S+  ++  Q  ++     G+   P+ AGH+LG  ++ I   G  
Sbjct: 115 TSDALFSPLDMQMSLASIIPVSAHQLINVC---PGVSFTPYHAGHVLGACMFLIDIAGLK 171

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
           ++Y  DY+R +++HL    L   +RP VLI ++   +H    R+++E  F   +   +R 
Sbjct: 172 ILYTGDYSREEDRHLVKAELPP-IRPDVLIVESTYGVHTLEGREEKEARFTTLVHSIIRR 230

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++   +   ++++  M  +
Sbjct: 231 GGHVLLPAFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNAN 290

Query: 286 ITKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           I   F   RDN F+ K+++ L      E   A   P +VLAS   ++ G S ++F  WA 
Sbjct: 291 IRTRF-AKRDNPFVFKYISNLPQTRGWEKKIAEGPPCVVLASPGFMQVGPSRELFELWAP 349

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D +N ++ T     GTLAR +  +P
Sbjct: 350 DARNGLIITGYSIEGTLARDIMTEP 374


>gi|66816359|ref|XP_642189.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
 gi|74856745|sp|Q54YL3.1|INT11_DICDI RecName: Full=Integrator complex subunit 11 homolog
 gi|60470287|gb|EAL68267.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
          Length = 744

 Score =  142 bits (357), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 177/371 (47%), Gaps = 19/371 (5%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW----ND--HF-DPSLLQPLSKVASTID 56
           +++V PL    +      +V+I   N + DCG     ND   F D S +    +    ID
Sbjct: 2   TIKVVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVID 61

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V+++H    H GALP+  +  G   P++ T P   +  + + D + ++  +  E + FT
Sbjct: 62  CVIITHFHLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFT 121

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +   Q   +    E + +  + AGH+LG  ++      E V+Y  DY
Sbjct: 122 AQMIKDCMKKVIPVNLHQTIKVD---EELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDY 178

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++  V+P VLIT+   A   +  ++ RE  F   I + +  GG VL+P
Sbjct: 179 NMTPDRHLGSAWIDQ-VKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIP 237

Query: 235 VDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           V + GRV EL ++++ YW + +L + PIYF   ++     Y K F+ W    I ++F   
Sbjct: 238 VFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFINWTNQKIKQTF--V 295

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           + N F  KH+     +S L +AP G  ++ A+   L AG S ++F +WA +  N+ +   
Sbjct: 296 KRNMFDFKHIKPF--QSHLVDAP-GAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPG 352

Query: 354 RGQFGTLARML 364
               GT+   L
Sbjct: 353 YCVVGTVGNKL 363


>gi|408390480|gb|EKJ69876.1| hypothetical protein FPSE_09963 [Fusarium pseudograminearum CS3096]
          Length = 833

 Score =  141 bits (356), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 181/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQPVYTEQDHLNTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E S D A     +  
Sbjct: 277 HADFQKYPIYYASNLARKCMLIYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA   KN V+ T     GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>gi|346466613|gb|AEO33151.1| hypothetical protein [Amblyomma maculatum]
          Length = 618

 Score =  141 bits (356), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 92/337 (27%), Positives = 179/337 (53%), Gaps = 24/337 (7%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SE 110
           ID +L+SH    H GALP+ + +       F   +T+ +YR     +   Y+    + +E
Sbjct: 1   IDLLLVSHFHWYHCGALPWFLLKTTFKGRCFMTHATKAIYRW----LLADYIKVSNIGTE 56

Query: 111 FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
             L+T  D++++ + +  +    N+H   +  GI    + AGH+LG  ++ I   G  V+
Sbjct: 57  QMLYTEADLEASMEKIETI----NFHEEKEVNGIRFWCYNAGHVLGAAMFMIEIAGVKVL 112

Query: 171 YAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
           Y  D++R++++HL    + + + P VLI ++    H    R++RE  F   +   +  GG
Sbjct: 113 YTGDFSRQEDRHLMAAEIPN-IHPDVLIIESTYGTHIHEKREEREARFTGLVHDIVNRGG 171

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
             L+PV + GR  ELLLIL++YW+ H    + PIY+ + ++   +   ++++  M + I 
Sbjct: 172 RCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNERIR 231

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVK 346
           +  + + +N F+ KH++   N   +++  D GP +V+AS   +++G S ++F  W +D K
Sbjct: 232 R--QITINNPFVFKHIS---NLKSIEHFEDIGPCVVMASPGMMQSGLSRELFESWCTDPK 286

Query: 347 NLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           N V+       GTLA+ + ++  P+ +   + +++PL
Sbjct: 287 NGVIIAGYCVEGTLAKTILSE--PEEISTMVGQKLPL 321


>gi|302679538|ref|XP_003029451.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
 gi|300103141|gb|EFI94548.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
          Length = 786

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/335 (29%), Positives = 171/335 (51%), Gaps = 23/335 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  +L Y  ++         ++ T P   L    M D ++     S
Sbjct: 57  STVDAILITHFHLDHAASLTYITEKTNFRDGKGKIYMTHPTKALHKFMMQD-FVRTGSSS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  DI  +  S+  ++  Q   L     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 SDALFSPLDISMSLASIIPVSAHQ---LITPCPGVSFTPYHAGHVLGACMFLIDMAGLRI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    L   +RP VLI ++   + +  PR ++E+ F + +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAELPP-IRPDVLIVESTYGVQSHEPRDEKELRFTNLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++  ++   ++++  M  +I
Sbjct: 232 GHVLLPQFALGRAQELLLILDEYWKKHPDLHNVPIYYASGLARKSMAVYQTYIHTMNSNI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+         K ++   P  P +VLA+   ++ G S ++F  WA D +
Sbjct: 292 RSRF-AKRDNPFVF--------KCKIAEGP--PCVVLATPGFMQTGSSRELFELWAPDSR 340

Query: 347 NLVLFTERGQFGTLARMLQADPPP-KAVKVTMSRR 380
           N ++ T     GTLAR +  +P   ++VK  M +R
Sbjct: 341 NGLIVTGYSVEGTLARDIMTEPEEFQSVKGHMIQR 375


>gi|409080187|gb|EKM80547.1| hypothetical protein AGABI1DRAFT_70926 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 841

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 164/333 (49%), Gaps = 22/333 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRR--- 106
           S++DA+L++H    H  AL Y  ++         V+ T P   L    M D   +RR   
Sbjct: 57  SSVDAILITHFHLDHAAALTYITEKTNFKDGKGKVYMTHPTKALHKFMMQDFVRTRRANF 116

Query: 107 ------QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVW 160
                   S   LF+  D+  +  S+  ++  Q   L     G+   P+ AGH+LG  ++
Sbjct: 117 VKCPHSSASSDALFSPLDMQMSLASIIAVSAHQ---LITVCPGVSFIPYHAGHVLGACMF 173

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQD 219
            I   G  ++Y  DY+R +++HL    L   +RP VL+ ++   +H    R+++E  F  
Sbjct: 174 LIDIAGLKILYTGDYSREEDRHLIKAELPP-IRPDVLVVESTYGVHTGESREEKEHRFTS 232

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKS 277
            +   +R GG+VLLP  + GR  ELLLIL+DYW +H    N P+Y+ + ++   +   ++
Sbjct: 233 LVHSIIRRGGHVLLPTFALGRAQELLLILDDYWKKHPDLHNVPVYYASGLARKCMAVYQT 292

Query: 278 FLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA-PDGPK-LVLASMASLEAGFSH 335
           ++  M  +I   F   RDN F+ KH++ +      +    DGP  +VLAS   ++ G S 
Sbjct: 293 YIHTMNANIRSRF-ARRDNPFVFKHISNVPQTRGWEKKIADGPPCVVLASPGFMQVGPSR 351

Query: 336 DIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++F  W  D +N ++ T     GT AR +  +P
Sbjct: 352 ELFEHWCPDARNGLIITGYSIEGTPARDIMTEP 384


>gi|146421308|ref|XP_001486604.1| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 770

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/359 (28%), Positives = 180/359 (50%), Gaps = 39/359 (10%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+    +  VF   +T+ +YR  LL+ + +  S     
Sbjct: 58  SKVDILLISHFHLDHAASLPYVMQHTNFNGRVFMTHATKAIYRW-LLSDFVRVTSIGGGG 116

Query: 105 -------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGG 157
                      +  +L+T DD+  +F  +  +    +YH + + EGI    + AGH+LG 
Sbjct: 117 DSRLNSGNETATSSNLYTDDDLIRSFDRIETI----DYHSTIEVEGIRFTAYHAGHVLGA 172

Query: 158 TVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM- 216
            ++ +   G  V++  DY+R +++HL    +   +RP +LIT++        PR ++E  
Sbjct: 173 CMYFVEIGGLKVLFTGDYSREEDRHLQVAEVPP-MRPDILITESTFGTATHEPRLEKEAR 231

Query: 217 FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE----HSLNYPIYFLTYVSSSTI 272
               I  TL  GG +L+PV + GR  ELLLILE+YW +    H++N  ++F + ++   +
Sbjct: 232 MTKIIHLTLLKGGRILMPVFALGRAQELLLILEEYWLQNEDLHNIN--VFFASSLARKCM 289

Query: 273 DYVKSFLEWMGDSITKSFETS---RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMAS 328
              +++   M D+I     ++   + N F  KH+ L+     LD   D GP +V+A+   
Sbjct: 290 AVYQTYTNIMNDNIRHGVSSASGGKLNPFQFKHIKLI---RSLDKFQDIGPCVVVAAPGM 346

Query: 329 LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPP----KAVKVTMSRRVPL 383
           L+ G S ++   WA D KN V+ T     GT+A+ L  +P      +   VT+ RR+ +
Sbjct: 347 LQNGVSRELLERWAPDAKNAVIMTGYSVEGTMAKELLTEPHTIQSLQNADVTIPRRMAI 405


>gi|401428833|ref|XP_003878899.1| cleavage and polyadenylation specificity factor,putative
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322495148|emb|CBZ30452.1| cleavage and polyadenylation specificity factor,putative
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 756

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 178/372 (47%), Gaps = 21/372 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V+V P+           +V   G   ++DCG  +H   S L  L    S     ID VL+
Sbjct: 26  VEVLPIGSGGEVGRSCVVVRYKGRGVMLDCG--NHPAKSGLDSLPFFDSIKCDEIDVVLI 83

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY   Q      +F T        + M D    R      DL T + + 
Sbjct: 84  THFHLDHCGALPYFCNQTSFKGRIFMTSATKAFYKMVMND--FLRIGAGASDLVTSEWLQ 141

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S    +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 142 STIDRIETVEYHEEVTVNG----ISFQPFNAGHVLGAAMFMVDIAGMRALYTGDFSRVPD 197

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++R ++F  ++ + +R GG  L+PV + G
Sbjct: 198 RHLLGAEVPPY-SPDILIAESTNGIRELESREEREQLFTGSVHEVVRRGGRCLVPVFALG 256

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  H    N PIY+ + ++   +   ++F+  M D + K    +  N 
Sbjct: 257 RAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNP 315

Query: 298 FLLKHV-TLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           F+ K++ +L+  KS  DN   GP +VLAS   L++G S ++F  W  D +N ++      
Sbjct: 316 FVFKYIHSLMDTKSFEDN---GPCVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCV 372

Query: 357 FGTLARMLQADP 368
            GT+A+ + A P
Sbjct: 373 DGTIAKDVLAKP 384


>gi|170587204|ref|XP_001898368.1| cpsf3-prov protein [Brugia malayi]
 gi|158594194|gb|EDP32780.1| cpsf3-prov protein, putative [Brugia malayi]
          Length = 700

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 183/378 (48%), Gaps = 33/378 (8%)

Query: 7   VTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLSHPD 64
           +TPL          + ++  G   L+DCG +         P         +D +L++H  
Sbjct: 15  ITPLGSGQEVGRSCHYLTFKGKKILLDCGIHPGMSGVDALPFVDFVDCEELDLLLVTHFH 74

Query: 65  TLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-------LF 114
             H GALP+ +++       F   +T+ +YR+ +      YL   +VS++        L+
Sbjct: 75  LDHCGALPWLLEKTAFRGRCFMTHATKAIYRMSI----GDYL---KVSKYGGSSDNRMLY 127

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
             +D++ + + +  +    ++H   +  GI    HVAGH+LG  ++ I   G  ++Y  D
Sbjct: 128 NEEDLEKSMEKIEVI----DFHEQKEVNGIKFWCHVAGHVLGACMFMIEIAGVRILYTGD 183

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLL 233
           ++R +++HL    L + V P VLI ++         R +RE  F   + + +  GG  L+
Sbjct: 184 FSRLEDRHLCAAELPT-VSPDVLICESTYGTQVHESRDEREKRFTSIVHEIVGRGGRCLI 242

Query: 234 PVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           P  + GR  ELLLIL++YW  H    + P+Y+ + ++   +   ++F+  M   I K  +
Sbjct: 243 PAFALGRAQELLLILDEYWESHPELQDIPVYYASSLAKKCMAVYQTFVSGMNSRIQK--Q 300

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
            + +N F+ KHV+   N   +D+  D GP +VLAS   L+ G S ++F  W +D KN  +
Sbjct: 301 IALNNPFVFKHVS---NLKSIDHFEDVGPCVVLASPGMLQNGLSRELFENWCTDSKNGCI 357

Query: 351 FTERGQFGTLARMLQADP 368
                  GTLA+ + ++P
Sbjct: 358 IAGYCVEGTLAKHILSEP 375


>gi|157876175|ref|XP_001686447.1| putative cleavage and polyadenylation specificity factor
           [Leishmania major strain Friedlin]
 gi|68129521|emb|CAJ08064.1| putative cleavage and polyadenylation specificity factor
           [Leishmania major strain Friedlin]
          Length = 756

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 175/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V+V P+           +V   G   ++DCG  +H   S L  L    S     ID VL+
Sbjct: 26  VEVLPIGSGGEVGRSCVVVQYKGRGVMLDCG--NHPAKSGLDSLPFFDSIKCDEIDVVLI 83

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY   Q      VF T        + M D    R      DL T + + 
Sbjct: 84  THFHLDHCGALPYFCNQTSFKGRVFMTSATKAFYKMVMND--FLRIGAGASDLVTSEWLQ 141

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S    +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 142 STIDRIETVEYHEEVTVNG----ISFQPFNAGHVLGAAMFMVDIAGMRALYTGDFSRVPD 197

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++R  +F  ++   +R GG  L+PV + G
Sbjct: 198 RHLLGAEVPPY-SPDILIAESTNGIRELESREEREHLFTSSVHDVVRRGGRCLVPVFALG 256

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  H    N PIY+ + ++   +   ++F+  M D + K    +  N 
Sbjct: 257 RAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNP 315

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L++    ++  +GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 316 FVFKYIRSLMDTKSFED--NGPCVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCVD 373

Query: 358 GTLARMLQADP 368
           GT+A+ + A P
Sbjct: 374 GTIAKDVLAKP 384


>gi|209876680|ref|XP_002139782.1| cleavage and polyadenylation specificity factor subunit 3
           [Cryptosporidium muris RN66]
 gi|209555388|gb|EEA05433.1| cleavage and polyadenylation specificity factor subunit 3, putative
           [Cryptosporidium muris RN66]
          Length = 767

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 177/368 (48%), Gaps = 27/368 (7%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           +V+  G + + DCG +  F      P+      S+ID  L++H    H GA+PY +    
Sbjct: 41  VVTFKGRSVMFDCGIHPAFSGIGSLPVFDAVDISSIDLCLVTHFHLDHSGAIPYFVSSTD 100

Query: 80  LSAPVFSTEPVYRLGLLTMYDQYLSRR--------------QVSEFDLFTLDDIDSAFQS 125
            +  +F TEP   +  L   D     R               VS  +L+T  DI+ A + 
Sbjct: 101 FNGRIFMTEPTKAICKLVWQDYARMNRFSTNSPVPVDSDEAPVSCVNLYTEPDIEKAMKR 160

Query: 126 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNG 185
           +  + + Q   +    +G+ ++ + AGH+LG  ++ +   G  ++Y  DY+R  ++H+  
Sbjct: 161 IEIIDFRQQAEI----DGVRISCYGAGHVLGACMFLVEIGGVRILYTGDYSREDDRHVPR 216

Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
             +   V   VLI ++        PR+ RE  F   +   L   G  LLPV + GR  EL
Sbjct: 217 AEIPP-VDVHVLICESTYGTRLHEPRKDREKRFLGCVQSILSRQGKCLLPVFAIGRAQEL 275

Query: 245 LLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKH 302
           LLIL+++WA+ S   N PIY+ + +S   +   ++++   GD++ K  +    N F  + 
Sbjct: 276 LLILDEHWAQTSCLHNIPIYYASPMSVKCMRVFETYINQCGDAVRKQADMGI-NPFNFQF 334

Query: 303 VTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           V  + + SE+ +A   +GP +++A+   L+ G S DIF  WA D +N V+ T     GT 
Sbjct: 335 VKTVNSISEIKDAIYSEGPCVIMAAPGMLQNGTSRDIFEVWAPDKRNGVILTGYAIRGTP 394

Query: 361 ARMLQADP 368
           A  L+ +P
Sbjct: 395 AYELRREP 402


>gi|393217572|gb|EJD03061.1| Metallo-hydrolase/oxidoreductase [Fomitiporia mediterranea MF3/22]
          Length = 826

 Score =  141 bits (356), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/324 (30%), Positives = 163/324 (50%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  +L Y M++         V+ T P   +    M D ++     S
Sbjct: 57  STVDAILVTHFHIDHAASLTYIMEKTNFRDGKGKVYMTHPTKGVYRFLMQD-FMRISSTS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LFT  ++  +  S+  ++  Q   +S    G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 TDGLFTSVELSMSLASIMTVSAHQLITVS---PGLSFTPYHAGHVLGACMFLIDIAGLRI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  DY+R +++HL    +   VRP VLI ++   +     R  +E  F + +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAEIPP-VRPDVLIVESTYGVQGHEERDTKEHRFTNLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+ LLPV + GR  ELLLILEDYW +H    N PIY+ + ++   +   ++++  M  +I
Sbjct: 232 GHALLPVFALGRAQELLLILEDYWKKHPDLHNVPIYYASNLARKCMAVYQTYIHTMNSNI 291

Query: 287 TKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KHV+ +  +   E   A   P ++L +   L+ G S ++   WA D
Sbjct: 292 RSRF-AKRDNPFVFKHVSNIPQVRGWEKRIAEGPPCVILCTPGMLQPGPSRELLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GTLAR +  +P
Sbjct: 351 PRNGLIITGYSVEGTLARDIVNEP 374


>gi|414881435|tpg|DAA58566.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
          Length = 558

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/363 (27%), Positives = 166/363 (45%), Gaps = 23/363 (6%)

Query: 22  LVSIDGFNFLIDCGW-----NDHFDPSLLQPLSK-----VASTIDAVLLSHPDTLHLGAL 71
           +V+I G   + DCG      +D   P   + L+        + I  V+++H    H+GAL
Sbjct: 20  VVTIGGKRVMFDCGMHMGYHDDRHYPDFARALAAWGAPDFTTAISCVVITHFHMDHIGAL 79

Query: 72  PYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLT 130
           PY  +  G   P++ T P   L    + D + ++  Q  E   ++ +DI    + VT + 
Sbjct: 80  PYFTEVCGYHGPIYMTYPTKALAPFMLEDYRKVTMGQRGEEKQYSYEDILRCMKKVTPMD 139

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
             Q   +    + +V+  + AGH++G  +         ++Y  DYN   ++HL    ++ 
Sbjct: 140 LKQTVQVD---KDLVIRAYYAGHVIGAAMIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDR 196

Query: 191 FVRPAVLITDAYNA--LHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 248
            ++  VLIT++  A  + +  P ++RE F  A+ K +  GG VL+P  + GR  EL ++L
Sbjct: 197 -LKLDVLITESTYAKSIRDSKPARERE-FLKAVHKCVSGGGKVLIPTFALGRAQELCMLL 254

Query: 249 EDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           +DYW    L  PIYF   ++     Y K  + W    I  S      N F  KHV     
Sbjct: 255 DDYWERMGLKVPIYFSAGLTIQANVYYKMLIGWTSQKIKDSHTVH--NPFDFKHVCHF-E 311

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +S ++N   GP ++ A+   +  GFS + F +WA   KNLV        GT+   L    
Sbjct: 312 RSFINNP--GPCVLFATPGMITGGFSLEAFKKWAPSEKNLVTLPGYCVSGTIGHKLMCGK 369

Query: 369 PPK 371
           P +
Sbjct: 370 PTR 372


>gi|291238246|ref|XP_002739041.1| PREDICTED: cleavage and polyadenylation specific factor 3-like
           [Saccoglossus kowalevskii]
          Length = 573

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 161/355 (45%), Gaps = 43/355 (12%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++V PL    +      LVSI G N + DCG    +ND     D S +     +   +D 
Sbjct: 4   IKVVPLGAGQDVGRSCVLVSIGGKNIMFDCGMHMGYNDERRFPDFSYITRAGTLTEHLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H G+LP+  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGSLPHMSEMIGFDGPIYMTIPTKAICPILLEDYRKITVEKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  ++ +    + V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVNLHQTVQVDDELE---IKAYYAGHVLGAAMFHVKVGSQSVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD 236
              ++HL                            ++R+  Q  +   +  GG VL+PV 
Sbjct: 181 MTADRHLGC--------------------------RERDFLQK-VHDCVEKGGKVLIPVF 213

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + N
Sbjct: 214 ALGRAQELCILLETFWDRMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQRN 271

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
            F  +H+    ++S  DN   GP +V A+   L  G S  +F +WAS+ KN+V+ 
Sbjct: 272 MFEFRHIKPF-DRSYTDNP--GPMVVFATPGMLHGGLSLHVFKKWASNEKNMVIM 323


>gi|115396064|ref|XP_001213671.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114193240|gb|EAU34940.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1005

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 114/420 (27%), Positives = 176/420 (41%), Gaps = 93/420 (22%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP +LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLVLQELEKHVPTLSLILLTHATPAHIGAFVHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDLYASAPLAATFLPKASISEPGAGTSAASAGATATEGEGSADAPH 146

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +  S         G+ +  + AGH +GGT+W
Sbjct: 147 PSRILLQPPTNEEIARYFSLIHPLKYSQPHQPSPSPFSPPLNGLTLTAYNAGHTVGGTIW 206

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 207 HIQHGMESIVYAVDWNQARESVVAGAAWFGGPGASGTEVIEQLRKPTALVCSTRGGDKFA 266

Query: 209 PP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN------- 258
            P  R++R ++  D I  TL  GG VL+P D++ RVLEL   LE  W + + +       
Sbjct: 267 LPGGRKKRDDLLLDMIRSTLAKGGTVLIPTDTSARVLELAYALEHAWRDAAASGSEDKTL 326

Query: 259 --YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD--------------------- 295
               +Y       +T+   +S LEWM ++I + FE +                       
Sbjct: 327 KEAGLYLAGRKVHTTMRLARSMLEWMDENIVREFEAAEGVDATTGQSIQRPGGQKDEKGV 386

Query: 296 NAFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             F  K++ L+  + +L+   A   PK++LAS +SL+ GF+ +     A    NL+L TE
Sbjct: 387 GPFTFKNLKLVERRKKLEKILADQTPKVILASDSSLDWGFAKESLRLIAEGSNNLLLLTE 446


>gi|224108267|ref|XP_002314781.1| predicted protein [Populus trichocarpa]
 gi|222863821|gb|EEF00952.1| predicted protein [Populus trichocarpa]
          Length = 639

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 171/360 (47%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG    ++DH    D SL+        ++D V+++H    H+GALPY 
Sbjct: 20  VVTINGKRIMFDCGMHMGYDDHRRYPDFSLISKSRDFDHSLDCVIITHFHLDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ T P   L  L + D  + L  R+  E + FT   I    + V  +   
Sbjct: 80  TEVCGYNGPIYMTYPTKALAPLMLEDFRKVLVDRRGEE-EQFTSLHISQCMEKVIAVDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    + + +  + AGH+LG  ++        ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTVQVD---DDLQIRAYYAGHVLGAAMFYAKVGDSAMVYTGDYNMTPDRHLGAAQIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
              +LIT++  A   +  +  RE  F  A+ + +  GG VL+P  + GR  EL ++L+DY
Sbjct: 195 ELDLLITESTYATTIRDSKYAREREFLKAVHECVAGGGKVLIPTFALGRAQELCILLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L  PIYF   ++     Y K  + W    + +++ T   NAF  KHV        
Sbjct: 255 WERMNLKVPIYFSAGLTIQANLYYKILISWTSQKVKETYATR--NAFDFKHVHNF--DRS 310

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           L NAP GP ++ A+   +  GFS ++F +WA    NL+        GT+   L +  P K
Sbjct: 311 LINAP-GPCVLFATPGMISGGFSLEVFKQWAPCEMNLITLPGYCVAGTVGHKLMSGKPTK 369


>gi|342180524|emb|CCC90000.1| putative cleavage and polyadenylation specificity factor subunit
           [Trypanosoma congolense IL3000]
          Length = 766

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 179/371 (48%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           +V   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 38  VEILPIGSGGEVGRSCIVVRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCEEIDVVLI 95

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      +F T        + M D    R   S  D+   + + 
Sbjct: 96  THFHLDHCGALPYFCEQTAFKGRIFMTSATKAFYKMVMND--FLRVGASAEDIVNNEWLQ 153

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G  V+Y  D++R  +
Sbjct: 154 STIEKIETVEYHEEVTVNG----IHFQPFNAGHVLGAALFMVDIAGMKVLYTGDFSRVPD 209

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++RE +F   +   ++ GG  L+PV + G
Sbjct: 210 RHLLGAEVPPY-SPDILIAESTNGIRELESREERETLFTTWVHDVVKGGGRCLIPVFALG 268

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + +  E  R N 
Sbjct: 269 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKEQHENHR-NP 327

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L++    ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 328 FVFKYIQSLLDTRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDKRNGIIVAGYCVD 385

Query: 358 GTLARMLQADP 368
           GT+A+ + + P
Sbjct: 386 GTIAKEILSKP 396


>gi|426197081|gb|EKV47008.1| hypothetical protein AGABI2DRAFT_203789 [Agaricus bisporus var.
           bisporus H97]
          Length = 794

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 165/333 (49%), Gaps = 22/333 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           S++DA+L++H    H  AL Y  ++         V+ T P   L    M D   +RR +S
Sbjct: 57  SSVDAILITHFHLDHAAALTYITEKTNFKDGKGKVYMTHPTKALHKFMMQDFVRTRRALS 116

Query: 110 ---------EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVW 160
                       LF+  D+  +  S+  ++  Q   L     G+   P+ AGH+LG  ++
Sbjct: 117 VKCPHSSASSDALFSPLDMQMSLASIIAVSAHQ---LITVCPGVSFIPYHAGHVLGACMF 173

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQD 219
            I   G  ++Y  DY+R +++HL    L   +RP VL+ ++   +H    R+++E  F  
Sbjct: 174 LIDIAGLKILYTGDYSREEDRHLIKAELPP-IRPDVLVVESTYGVHTGESREEKEHRFTS 232

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKS 277
            +   +R GG+VLLP  + GR  ELLLIL+DYW +H    N P+Y+ + ++   +   ++
Sbjct: 233 LVHSIIRRGGHVLLPTFALGRAQELLLILDDYWKKHPDLHNVPVYYASGLARKCMAVYQT 292

Query: 278 FLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA-PDGPK-LVLASMASLEAGFSH 335
           ++  M  +I   F   RDN F+ KH++ +      +    DGP  +VLAS   ++ G S 
Sbjct: 293 YIHTMNANIRSRF-ARRDNPFVFKHISNVPQTRGWEKKIADGPPCVVLASPGFMQVGPSR 351

Query: 336 DIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++F  W  D +N ++ T     GT AR +  +P
Sbjct: 352 ELFEHWCPDARNGLIITGYSIEGTPARDIMTEP 384


>gi|302309220|ref|NP_986485.2| AGL182Cp [Ashbya gossypii ATCC 10895]
 gi|299788256|gb|AAS54309.2| AGL182Cp [Ashbya gossypii ATCC 10895]
 gi|374109730|gb|AEY98635.1| FAGL182Cp [Ashbya gossypii FDAG1]
          Length = 803

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 132/524 (25%), Positives = 232/524 (44%), Gaps = 72/524 (13%)

Query: 22  LVSIDGFNFLIDCGWND--HFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA----- 74
           ++S D    LID GW+    +D  +     +    +D +LLS P    +GA  YA     
Sbjct: 19  ILSFDNCTLLIDPGWSGGCSYDECMAY-WKEWIPQVDIILLSQPIQECIGA--YAALFFD 75

Query: 75  -MKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRLTY 131
            +        V+ST PV  LG +   D Y S   +  FD   +D  DID+AF  +  + Y
Sbjct: 76  YISHFNSRIQVYSTLPVANLGRVATVDLYASLGIIGPFDTNRIDIEDIDTAFDHLNTVKY 135

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL--- 188
           SQ   L  + +G+ +  + +G   GGT+W      E V+YA  +N  ++  LN   L   
Sbjct: 136 SQLVDLKSRFDGLSLVAYSSGFAPGGTIWCANTYSEKVLYAPRWNHTRDTILNSADLLDK 195

Query: 189 -----ESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLE 243
                 + +RP+ +I  A +   + P R++ + F++ I K L A  +V+LP    G+ LE
Sbjct: 196 GGKPSTALMRPSAVIMSAAHVGPSTPYRKRSQKFKEVIKKALSANTSVILPSAIGGKFLE 255

Query: 244 LLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA- 297
           L +++ D   E+       + P+  L+Y    T+ Y +S LEW+   + K++E SRDN  
Sbjct: 256 LFVLVHDILHENKKSGLQADAPVLLLSYSRGRTLTYARSMLEWLSSQLVKTWE-SRDNKS 314

Query: 298 -FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
            F L +   ++N ++L N P G K+   S         +D   +  +  K +++ TE+  
Sbjct: 315 PFDLGNRLKIVNVNDLANYP-GTKICFISQVET---LINDALSKVCTKEKAMLVLTEKPT 370

Query: 357 F-----GTLAR---------------MLQADPPP--KAVKVTMSRRVPLVGEELIAYEEE 394
           +       LA+                ++ +P    +++ +  S+  PL G +L   EE 
Sbjct: 371 YYSHTIAILAKAYAKWERALNSNNLNAVEGNPIAYSESLSLQFSKTKPLTGSDL---EEF 427

Query: 395 QTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDIL 454
           + R++     +A L+   +S      DN           ++ +   DV+ PHG       
Sbjct: 428 KERIEARRKERAELLSSFQSN-----DNPAGASAFTAIEDDDDEEEDVLRPHGAGALSTK 482

Query: 455 IDGFVPPSTSVAP-------MFPFYENNSEWDDFGEVINPDDYI 491
           ++  +P    + P       MFPF       DD+GE+++ + ++
Sbjct: 483 VE--IPTDLIIQPNALPKHKMFPFQPGKVAHDDYGELVDFERFL 524


>gi|414881434|tpg|DAA58565.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
          Length = 400

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 100/363 (27%), Positives = 166/363 (45%), Gaps = 23/363 (6%)

Query: 22  LVSIDGFNFLIDCGW-----NDHFDPSLLQPLSK-----VASTIDAVLLSHPDTLHLGAL 71
           +V+I G   + DCG      +D   P   + L+        + I  V+++H    H+GAL
Sbjct: 20  VVTIGGKRVMFDCGMHMGYHDDRHYPDFARALAAWGAPDFTTAISCVVITHFHMDHIGAL 79

Query: 72  PYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLT 130
           PY  +  G   P++ T P   L    + D + ++  Q  E   ++ +DI    + VT + 
Sbjct: 80  PYFTEVCGYHGPIYMTYPTKALAPFMLEDYRKVTMGQRGEEKQYSYEDILRCMKKVTPMD 139

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
             Q   +    + +V+  + AGH++G  +         ++Y  DYN   ++HL    ++ 
Sbjct: 140 LKQTVQVD---KDLVIRAYYAGHVIGAAMIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDR 196

Query: 191 FVRPAVLITDAYNA--LHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 248
            ++  VLIT++  A  + +  P ++RE F  A+ K +  GG VL+P  + GR  EL ++L
Sbjct: 197 -LKLDVLITESTYAKSIRDSKPARERE-FLKAVHKCVSGGGKVLIPTFALGRAQELCMLL 254

Query: 249 EDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           +DYW    L  PIYF   ++     Y K  + W    I  S      N F  KHV     
Sbjct: 255 DDYWERMGLKVPIYFSAGLTIQANVYYKMLIGWTSQKIKDSHTVH--NPFDFKHVCHF-E 311

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +S ++N   GP ++ A+   +  GFS + F +WA   KNLV        GT+   L    
Sbjct: 312 RSFINNP--GPCVLFATPGMITGGFSLEAFKKWAPSEKNLVTLPGYCVSGTIGHKLMCGK 369

Query: 369 PPK 371
           P +
Sbjct: 370 PTR 372


>gi|294656507|ref|XP_002770276.1| DEHA2D07304p [Debaryomyces hansenii CBS767]
 gi|199431523|emb|CAR65632.1| DEHA2D07304p [Debaryomyces hansenii CBS767]
          Length = 959

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 125/449 (27%), Positives = 200/449 (44%), Gaps = 58/449 (12%)

Query: 22  LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQ 77
           L+S D     L D  WN +    +L  + +    +D +LLSH     +     L      
Sbjct: 20  LLSFDNDIKILADPSWNGNNHNDILY-MEQYLKEVDIILLSHSTPEFISGFVLLCIKFPN 78

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNY 135
           L  + P++ST PV +LG ++  + Y +   +   +  +  +D++D  F  +  L + Q  
Sbjct: 79  LMSNIPIYSTLPVNQLGRVSTVEYYRANGVLGPLNNSILEVDEVDEWFDKIIPLKFFQT- 137

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GT 186
            LS     +V+ P+ AGH LGGT W IT+  E +IYA  +N  K+  LN         G 
Sbjct: 138 -LSVFDNRLVITPYNAGHTLGGTFWLITRRLEKIIYAPSWNHSKDSFLNSASFLSSSSGN 196

Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
            L   +RP VLIT+  +       +++ E F + +  TL  GG VLLP   +GR LELL 
Sbjct: 197 PLSQLMRPTVLITNT-DLGSTMSHKKRTEKFLNLVDATLANGGAVLLPTSLSGRFLELLH 255

Query: 247 ILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--------- 297
           +++ +    S   P+YFL+Y  +  + Y  + LEWM   + K +E +             
Sbjct: 256 LIDQHL--QSAPIPVYFLSYSGTKVLSYASNLLEWMSSQLVKEWEEASSVNNNSSNKNNF 313

Query: 298 -FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTERG 355
            F    V LL + SEL     GPK+V  S   L+ G  S +       D K  ++ TE+ 
Sbjct: 314 PFDPSKVDLLSDPSELVQL-SGPKIVFCSGIDLKNGDMSSEALQYLCQDEKTTIVLTEKT 372

Query: 356 QFG--------------TLARMLQADPPPKAVKVTM---------SRRVPLVGEELIAYE 392
            FG               L +  Q       V V +         +R  PL+G EL  + 
Sbjct: 373 HFGLDNTINSQLYHDWYNLTKQKQGGTVEDGVAVPLEKVISLENWNREEPLIGAELTDF- 431

Query: 393 EEQTRLKKEEALKASLVKEEESKASLGPD 421
           +E+  L++++ L A  V++ +++  L  D
Sbjct: 432 QEKINLQRKQKLLAK-VRDRKNQNLLNAD 459


>gi|378756880|gb|EHY66904.1| cleavage and polyadenylation specificity factor subunit 3
           [Nematocida sp. 1 ERTm2]
          Length = 501

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 171/352 (48%), Gaps = 21/352 (5%)

Query: 22  LVSIDGFNFLIDCGWN----DH--FDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAM 75
           +VSI     + DCG +    DH  F    L         ID V+++H    H G LPY  
Sbjct: 18  VVSIQNKTIMFDCGMHMGHSDHRRFPDFKLLGAGPYTGVIDCVIITHFHMDHCGGLPYFT 77

Query: 76  KQLGLSAPVFSTEPVYRLGLLTMYDQ---YLSRRQVSEFDL--FTLDDIDSAFQSVTRLT 130
           ++   + P++ T P   +  + + D    Y  R   S+F    +  ++I +  + V  + 
Sbjct: 78  ERCKYAGPIYMTPPTKAVLPIILQDYCKVYNERDDSSKFQYPTYNEENIKACMKKVIPIA 137

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
             +   +    +   + P+ AGH+LG  ++ +    E V+Y  DYN   ++HL+G  +  
Sbjct: 138 MDETVEIE---KDFTITPYYAGHVLGAAMFHVRVGDESVVYTGDYNMTPDRHLDGAWMPK 194

Query: 191 FVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
            V P VLIT++  AL  +  R+++E  F +++ + ++ GG VL+PV + GR  EL L+L+
Sbjct: 195 -VYPNVLITESTYALLVRDCRREKEREFIESVVQCVKNGGKVLIPVFALGRAHELCLLLD 253

Query: 250 DYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINK 309
            +W +  L+ PIY    ++    D  K F+++  + I  +    + N F  +HV      
Sbjct: 254 THWEKSKLSIPIYTSATLTHKANDIYKQFIDYTHEHIRNTMH--KRNLFDFQHVKQF--D 309

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
           S L +  +GP ++ +S   L +G S  IF +W  D KN+V+F      GT+ 
Sbjct: 310 SNLASL-EGPMILFSSPGMLHSGPSLSIFKKWCGDPKNMVIFPGYCVRGTIG 360


>gi|402590428|gb|EJW84358.1| RNA-metabolising metallo-beta-lactamase [Wuchereria bancrofti]
          Length = 579

 Score =  141 bits (355), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 164/332 (49%), Gaps = 23/332 (6%)

Query: 31  LIDCGWNDHF-------DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           ++DCG +  +       D S +     +  ++D V+++H    H G+LP+  + +G   P
Sbjct: 1   MLDCGMHMGYSDERRFPDFSFINGGGSLTESLDCVIITHFHLDHCGSLPHMSEVVGYDGP 60

Query: 84  VFSTEPVYRLGLLTMYDQYLSRRQVSEF----DLFTLDDIDSAFQSVTRLTYSQNYHLSG 139
           ++ T P   +  + + D    R+  +EF    + FT   I +  + V  +   +   +  
Sbjct: 61  IYMTYPTKAIAPVLLEDY---RKVQTEFKGDKNFFTSQMIKNCMKKVIAINIHEKIDVDN 117

Query: 140 KGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLIT 199
           +   + +    AGH+LG  +++I    E V+Y  D+N   ++HL    +E  ++P +LI+
Sbjct: 118 E---LSIRAFYAGHVLGAAMFQIMVGSESVLYTGDFNTTPDRHLGAARVEPGLKPDLLIS 174

Query: 200 DAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN 258
           ++  A   +  ++ RE  F   +  T+  GG VL+PV + GR  EL ++LE YW   +L 
Sbjct: 175 ESTYATTIRDSKRARERDFLKKVHDTVSNGGKVLIPVFALGRAQELCILLESYWERMNLK 234

Query: 259 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDG 318
           YPI+F   ++     Y + F+ W  + I ++F     N F  KH+     +S +D+   G
Sbjct: 235 YPIFFSQGLAEKANQYYRLFISWTNEKIKRTF--VERNMFDFKHIRPF-EQSYIDSP--G 289

Query: 319 PKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           P ++ ++   L  G S  +F +W SD KNL++
Sbjct: 290 PMVLFSTPGMLHGGQSLRVFTKWCSDEKNLII 321


>gi|378730429|gb|EHY56888.1| endoribonuclease ysh1 [Exophiala dermatitidis NIH/UT8656]
          Length = 868

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 98/339 (28%), Positives = 161/339 (47%), Gaps = 29/339 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T P   +    + D        S  D
Sbjct: 75  STVDILLISHFHLDHAAALPYVLAKTDFKGRVFMTHPTKAIYKWLIQDSVRVSNTSSTSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + +   + +SG    + + P+ AGH+LG  ++ I   G +
Sbjct: 135 QRTSLYTEADHISTLPQIETIDFYTTHTVSG----VRITPYPAGHVLGAAMFLINIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVL---ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKT 224
           + +  DY+R +++HL    +    +  +  +LIT++   + N PPR +RE     A++  
Sbjct: 191 IWFTADYSREQDRHLVAAEVPNKSTVGKIDLLITESTFGISNAPPRAEREAGLLKAVTNI 250

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           L  GG VL+PV + GR  ELLLILEDYW++H     YPIY+    +   +   ++++  M
Sbjct: 251 LNRGGKVLMPVFALGRAQELLLILEDYWSKHPELQKYPIYYTGNTARKCMVVYQTYINAM 310

Query: 283 GDSITKSFETSRDNA-------------FLLKHVTLLINKSELDNAPDGPKLVLASMASL 329
            D+I + F      A             +  + V  L N    D+   G  ++LAS   L
Sbjct: 311 NDNIKRIFRERMAEAEAAGNAKGVSAGPWDFRFVRSLRNLDRFDDV--GGCVMLASPGML 368

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++G S  +   WA D +N V+ T     GT+AR + ++P
Sbjct: 369 QSGMSRVLLERWAPDPRNGVIMTGYNVEGTMARTILSEP 407


>gi|429963288|gb|ELA42832.1| hypothetical protein VICG_00147 [Vittaforma corneae ATCC 50505]
          Length = 513

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 91/349 (26%), Positives = 168/349 (48%), Gaps = 22/349 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSL----LQPLSKVAS---TIDAVLLSHPDTLHLGALPYA 74
           +V+I+    + DCG +  +  S      Q LSK  +    +D +L+SH    H GALPY 
Sbjct: 18  VVNINNKTIMFDCGMHMGYSDSRKFPDFQALSKTGNFDKIVDCILISHFHLDHCGALPYF 77

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            + LG   P++ T P   +  + + D Q +   +  + ++++ +DI    + +  +  ++
Sbjct: 78  TEVLGYKGPIYMTYPTKAVLPILLEDCQKILSMKSHDSNIYSFEDIKKCMEKIVPINMNE 137

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
              +S   +G  +  + AGH++G  ++ +    + V+Y  DY+   ++HL GT     +R
Sbjct: 138 TVEVS---KGFTITAYYAGHVIGAAMFYVKVGDQSVVYTGDYSTTADQHL-GTAWIDTLR 193

Query: 194 PAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P ++IT++ Y ++     + +   F  +I   +  GG  L+P+ + GR  E+ LI+E YW
Sbjct: 194 PDLMITESTYGSVIRDCRKAKEREFLQSIHNCIERGGKTLIPIFALGRAQEICLIVESYW 253

Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
               L  P+YF   ++    +  K F+ +  +S+ +  +    N F   H+      SEL
Sbjct: 254 ERMGLEIPVYFAGGMTEKANEIYKRFINYTNESVRE--KILEKNVFEFSHIKPYRKGSEL 311

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FTERGQFG 358
                GP ++ +S   L +G S  IF    SD +NLV+   +  RG  G
Sbjct: 312 Q----GPCVIFSSPGMLHSGTSLRIFKNICSDPRNLVILPGYCVRGTLG 356


>gi|401624663|gb|EJS42715.1| cft2p [Saccharomyces arboricola H-6]
          Length = 858

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 154/561 (27%), Positives = 240/561 (42%), Gaps = 97/561 (17%)

Query: 15  NENPLSYLVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHL 68
           +E  +  +V  D    LID GWN    PS       ++   KV   ID V+LS P T  L
Sbjct: 12  SETTVGSVVRFDNVTLLIDPGWN----PSKVSYEQCVKYWEKVIPEIDVVILSQPTTECL 67

Query: 69  GA---LPYAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSA 122
           GA   L Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +
Sbjct: 68  GAHSLLYYNFISHFISRIHVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEKS 127

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
           F  +  L YSQ   L  + +G+ +  + AG   GG++W I+   E +IYA  +N  ++  
Sbjct: 128 FDHIVPLKYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLIYAKRWNHTRDNI 187

Query: 183 LN--------GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 234
           LN        G  L + +RP+ +IT       +QP +++ + F+D + K L + G+V++P
Sbjct: 188 LNAASILDATGKPLSTLMRPSAIITTLDKFGSSQPFKKRTKTFKDTLKKGLSSDGSVIIP 247

Query: 235 VDSAGRVLELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           VD +G+ LEL      L+ E          P+  L+Y    TI Y KS LEW+  S+ K+
Sbjct: 248 VDMSGKFLELFTQVHELLFESTKINVHTQVPVLILSYARGRTITYAKSMLEWLSPSLLKT 307

Query: 290 FETSRDNA--FLLKHVTLLINKSELDNAPDGPKLVLASMAS-------LEAGFSHDIFV- 339
           +E +R+N   F +     +I+  EL N   G K+   S           + G S    + 
Sbjct: 308 WE-NRNNTSPFEIGSRIKIISPKEL-NRYVGSKICFVSEVDALINEVITKVGNSEKTTLI 365

Query: 340 ------EWASDVKNLVLFTERGQFGTLARMLQADPPPKA---VKVTMSRRVPLVGEELIA 390
                 E AS +  ++ F       T     + D P      + +   +   L  +EL A
Sbjct: 366 LTKPKFESASSLNKIINFLSENDRKT---SFKEDKPYTCDSYISIDTIKEEALNKDELEA 422

Query: 391 YEEEQTRLKKEEALKASLVKEEESKASLG--------PDNNLSGDPMVIDANNANASADV 442
           ++ +    KK  + K SLVK E  K S G         D  ++G  ++  A NA+    V
Sbjct: 423 FKLQIKEKKKNRSKKISLVKRESKKLSNGNATIDGSTADRTINGQDIL--AENADEEQAV 480

Query: 443 VEPHG----------------------------GRYRDILIDGFVPPS-TSVAPMFPFYE 473
           V   G                             +  ++ +D  +  S TS   MFPF  
Sbjct: 481 VSIMGEDDDEEEEEEENDNLLSLLKDNTHKSAVKKNTEVPVDIIIQTSATSKHKMFPFNP 540

Query: 474 NNSEWDDFGEVIN-----PDD 489
              + DD+G V++     PDD
Sbjct: 541 AKIKKDDYGAVVDFTMFIPDD 561


>gi|255721479|ref|XP_002545674.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240136163|gb|EER35716.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 870

 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 139/543 (25%), Positives = 236/543 (43%), Gaps = 85/543 (15%)

Query: 28  FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSAPV 84
           F  L D  WN   D   +  + +    ID +LLSH     +     L      L  + P+
Sbjct: 27  FKILTDPSWNG-VDVDSVLFIEQHLKEIDVILLSHSTEEFISGFMLLCIKFPNLMSTIPI 85

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGE 142
           +ST PV +LG ++  + Y +   +   D  +  LD++D+ F  +  L Y Q+ +L     
Sbjct: 86  YSTLPVNQLGRVSTVECYRASGILGPVDSAIIELDEVDNWFDKINLLKYQQSVNLFD--N 143

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESFVR 193
            +V+ P+ AGH LGGT W ITK  + VIYA  +N  K+  LN         G+   S +R
Sbjct: 144 KVVITPYNAGHTLGGTFWLITKRVDRVIYAPAWNHSKDSFLNSASFISPSTGSPHLSLLR 203

Query: 194 PAVLI--TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           P   +  TD  +A+ +   +++ E F   +  TL  GG V+LP   +GR LEL  +++++
Sbjct: 204 PTAFVTATDMGSAMSH---KKRTEKFLQLVDATLANGGAVVLPTSLSGRFLELFHLVDEH 260

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
                +  P+YFL+Y  +  + Y  S  +WM ++++K +E      F    V LL++ +E
Sbjct: 261 LKGAPI--PVYFLSYSGTKVLSYASSMSDWMSNTLSKQWEELSTVPFNPSKVDLLLDPAE 318

Query: 312 LDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTERG--------------- 355
           L     GPK+V  S   L+ G  S + F    +D    V+ TE+                
Sbjct: 319 LIKL-SGPKIVFCSGIDLKDGDISSEAFQYLCNDTSTTVILTEKSCIDSRNGLGAELYKE 377

Query: 356 --------QFGTLARMLQADPPPKAVKV-TMSRRVPLVGEELIAYEEEQTRLKKEE---- 402
                     G  A+   A P  + + +   ++ V L G++L+ ++E+  + +KE+    
Sbjct: 378 WYTSASNKSTGNGAKDGIAVPIDRTISLQNQTKEVDLTGQDLLNFQEKVAQKRKEKLMAK 437

Query: 403 --------ALKASLV--------------------KEEESKASLGPDNNLSGDPMVIDAN 434
                    L A  V                     EE  K  L  +  +S   +   AN
Sbjct: 438 VRDQKNQNILSADTVDAEDSSDDDREDEDEEGHYSDEELKKLELAKNTAVSTSQVADLAN 497

Query: 435 NANASADVVEPHGGRYR--DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII 492
           +     D ++ +  +    D+ I   + P  ++ P FP   +  ++DD+GEVI+   +  
Sbjct: 498 HEAFVMDTIKQNLEKNLPIDLKITHKLKPRQAMFPYFP-TAHREKFDDYGEVIDIKKFQK 556

Query: 493 KDE 495
            DE
Sbjct: 557 NDE 559


>gi|449296201|gb|EMC92221.1| hypothetical protein BAUCODRAFT_569527 [Baudoinia compniacensis
           UAMH 10762]
          Length = 834

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 177/382 (46%), Gaps = 28/382 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L++H    H+ +LPY + + 
Sbjct: 42  HIIQYKGKTVMLDAGIHPAYDGLAALPFYDEFDLSTVDVLLITHFHMDHVASLPYVLAKT 101

Query: 79  GLSAPVFSTEPVYRLGLLTMYD----QYLSRRQVSEFD-----LFTLDDIDSAFQSVTRL 129
             +  V+ T P   +    M D    Q       S  D     LF   DI +    +  +
Sbjct: 102 PFAGRVYMTHPTKAIYKHLMTDSVRVQNTHTSATSGTDGYVAQLFNEQDILTTMPQIQTI 161

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLE 189
           +++   H+     GI   P+ AGH+LG  ++ I   G ++++  DY+R   +HL    + 
Sbjct: 162 SFNTT-HIHN---GIKFTPYPAGHVLGACMYLIEIAGLNILFTGDYSREDNRHLMPASIP 217

Query: 190 SFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 248
             V    LIT++   +    PR +RE     +I+  L  GG  LLP  + G   ELLLIL
Sbjct: 218 RHVNVDCLITESTFGISTHVPRAERETALMRSITGILNRGGRALLPTFALGGAQELLLIL 277

Query: 249 EDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN------AFLL 300
           EDYWA H     +PIYF + ++   +   +++++ M ++I   F+ ++ N       +  
Sbjct: 278 EDYWARHPEYQRFPIYFASSLARKCMVVYQTYIDAMNENIRTKFQAAQANPDGVGGPWDF 337

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           +H+  L +    D+   G  ++LAS   L+ G S  +   WA D KN V+ T     GT+
Sbjct: 338 QHIRSLKSLERFDDV--GGCVMLASPGMLQNGVSRSLLERWAPDAKNGVIITGYSVEGTM 395

Query: 361 ARMLQADPPPKAVKVTMSRRVP 382
           A+ +  +  P ++   M+ R P
Sbjct: 396 AKSIMLE--PDSIPAVMTNRQP 415


>gi|414881433|tpg|DAA58564.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
          Length = 400

 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 100/363 (27%), Positives = 166/363 (45%), Gaps = 23/363 (6%)

Query: 22  LVSIDGFNFLIDCGW-----NDHFDPSLLQPLSK-----VASTIDAVLLSHPDTLHLGAL 71
           +V+I G   + DCG      +D   P   + L+        + I  V+++H    H+GAL
Sbjct: 20  VVTIGGKRVMFDCGMHMGYHDDRHYPDFARALAAWGAPDFTTAISCVVITHFHMDHIGAL 79

Query: 72  PYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLT 130
           PY  +  G   P++ T P   L    + D + ++  Q  E   ++ +DI    + VT + 
Sbjct: 80  PYFTEVCGYHGPIYMTYPTKALAPFMLEDYRKVTMGQRGEEKQYSYEDILRCMKKVTPMD 139

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
             Q   +    + +V+  + AGH++G  +         ++Y  DYN   ++HL    ++ 
Sbjct: 140 LKQTVQVD---KDLVIRAYYAGHVIGAAMIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDR 196

Query: 191 FVRPAVLITDAYNA--LHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 248
            ++  VLIT++  A  + +  P ++RE F  A+ K +  GG VL+P  + GR  EL ++L
Sbjct: 197 -LKLDVLITESTYAKSIRDSKPARERE-FLKAVHKCVSGGGKVLIPTFALGRAQELCMLL 254

Query: 249 EDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           +DYW    L  PIYF   ++     Y K  + W    I  S      N F  KHV     
Sbjct: 255 DDYWERMGLKVPIYFSAGLTIQANVYYKMLIGWTSQKIKDSHTVH--NPFDFKHVCHF-E 311

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +S ++N   GP ++ A+   +  GFS + F +WA   KNLV        GT+   L    
Sbjct: 312 RSFINNP--GPCVLFATPGMITGGFSLEAFKKWAPSEKNLVTLPGYCVSGTIGHKLMCGK 369

Query: 369 PPK 371
           P +
Sbjct: 370 PTR 372


>gi|391871950|gb|EIT81099.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT2
           [Aspergillus oryzae 3.042]
          Length = 1010

 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 116/431 (26%), Positives = 176/431 (40%), Gaps = 104/431 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLDLQELEKHVPTLSLILLTHATPAHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTSAASAAASAAASAPEGEGGA 146

Query: 111 ---------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLG 156
                        T ++I   F  +  L YSQ +            G+ +  + AGH +G
Sbjct: 147 DASHSGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVG 206

Query: 157 GTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNA 204
           GT+W I    E ++YAVD+N+ +E  + G             V+E   +P  L+      
Sbjct: 207 GTIWHIQHGMESIVYAVDWNQARESVMAGAAWFGGSGASGTEVIEQLRKPTALVCSTRGG 266

Query: 205 LHNQPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS----- 256
                P  R++R+ +  D I  TL  GG VL+P D++ RVLEL   LE  W + +     
Sbjct: 267 DKFALPGGRKKRDDLLLDMIRSTLAKGGTVLIPTDTSARVLELAYALEHAWRDAAGTGQE 326

Query: 257 ----LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA----- 297
                   +Y     +++T+   +S LEWM ++I + FE           SR N      
Sbjct: 327 DNVLKEAGLYLAGRKANTTMRLARSMLEWMDENIVREFEAAEGVDAATGQSRANPGGQRS 386

Query: 298 -------------FLLKHVTLLINKSELDNAPD--GPKLVLASMASLEAGFSHDIFVEWA 342
                        F  KH+ ++  K +L+   +   PK++LAS  SL+ GF+ +     A
Sbjct: 387 GQNQGKEEKGTGPFTFKHLKIVERKKKLEKILNNQAPKVILASDTSLDWGFAKESLRLVA 446

Query: 343 SDVKNLVLFTE 353
               NL+L TE
Sbjct: 447 GGPNNLLLLTE 457


>gi|146099573|ref|XP_001468678.1| putative cleavage and polyadenylation specificity factor
           [Leishmania infantum JPCM5]
 gi|134073046|emb|CAM71766.1| putative cleavage and polyadenylation specificity factor
           [Leishmania infantum JPCM5]
          Length = 756

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 176/372 (47%), Gaps = 21/372 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V+V P+           +V   G   ++DCG  +H   S L  L    S     ID VL+
Sbjct: 26  VEVLPIGSGGEVGRSCVVVRYKGRGVMLDCG--NHPAKSGLDSLPFFDSIKCDEIDVVLI 83

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY   Q      +F T        + M D    R      DL T + + 
Sbjct: 84  THFHLDHCGALPYFCNQTSFKGRIFMTSATKAFYKMVMND--FLRIGAGASDLVTSEWLQ 141

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S    +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 142 STIDRIETVEYHEEVTVNG----ISFQPFNAGHVLGAAMFMVDIAGMRALYTGDFSRVPD 197

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++R  +F  ++   +R GG  L+PV + G
Sbjct: 198 RHLLGAEVPPY-SPDILIAESTNGIRELESREEREHLFTSSVHDVVRRGGRCLVPVFALG 256

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  H    N PIY+ + ++   +   ++F+  M D + K    +  N 
Sbjct: 257 RAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNP 315

Query: 298 FLLKHV-TLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           F+ K++ +L+  KS  DN   GP +VLAS   L++G S ++F  W  D +N ++      
Sbjct: 316 FVFKYIHSLMDTKSFEDN---GPCVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCV 372

Query: 357 FGTLARMLQADP 368
            GT+A+ + A P
Sbjct: 373 DGTIAKDVLAKP 384


>gi|398022636|ref|XP_003864480.1| cleavage and polyadenylation specificity factor, putative
           [Leishmania donovani]
 gi|322502715|emb|CBZ37798.1| cleavage and polyadenylation specificity factor, putative
           [Leishmania donovani]
          Length = 756

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 108/372 (29%), Positives = 176/372 (47%), Gaps = 21/372 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V+V P+           +V   G   ++DCG  +H   S L  L    S     ID VL+
Sbjct: 26  VEVLPIGSGGEVGRSCVVVRYKGRGVMLDCG--NHPAKSGLDSLPFFDSIKCDEIDVVLI 83

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY   Q      +F T        + M D    R      DL T + + 
Sbjct: 84  THFHLDHCGALPYFCNQTSFKGRIFMTSATKAFYKMVMND--FLRIGAGASDLVTSEWLQ 141

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S    +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 142 STIDRIETVEYHEEVTVNG----ISFQPFNAGHVLGAAMFMVDIAGMRALYTGDFSRVPD 197

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++R  +F  ++   +R GG  L+PV + G
Sbjct: 198 RHLLGAEVPPY-SPDILIAESTNGIRELESREEREHLFTSSVHDVVRRGGRCLVPVFALG 256

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE++W  H    N PIY+ + ++   +   ++F+  M D + K    +  N 
Sbjct: 257 RAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQTFVSAMNDRV-KQQHANHHNP 315

Query: 298 FLLKHV-TLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
           F+ K++ +L+  KS  DN   GP +VLAS   L++G S ++F  W  D +N ++      
Sbjct: 316 FVFKYIHSLMDTKSFEDN---GPCVVLASPGMLQSGISLELFERWCGDRRNGIIMAGYCV 372

Query: 357 FGTLARMLQADP 368
            GT+A+ + A P
Sbjct: 373 DGTIAKDVLAKP 384


>gi|358333178|dbj|GAA51732.1| integrator complex subunit 11 [Clonorchis sinensis]
          Length = 649

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 96/299 (32%), Positives = 150/299 (50%), Gaps = 13/299 (4%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFD 112
           +D V++SH    H GALPY  + +G   P++ T P   +   LL  Y +    R+  E +
Sbjct: 130 LDCVIISHFHLDHCGALPYMTEIVGYDGPIYMTHPTKAICPILLDDYRKITVERR-GEQN 188

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
            FT + I      V  +   Q   +  + E   +    AGH+LG  ++ I    + V+Y 
Sbjct: 189 FFTSEMIYRCMSKVKCVYVHQTVKVDDELE---LQAFYAGHVLGAAMFLIRVGSQSVLYT 245

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DYN   ++HL G    S   P +LIT++  A   +  ++ RE  F + I   + AGG V
Sbjct: 246 GDYNMTPDRHL-GAAWVSRCCPDILITESTYATTIRDSKRAREREFLEKIHARVEAGGKV 304

Query: 232 LLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           L+PV + GR  EL ++LE YW   +++ PIYF   ++    +Y K F+ W    I ++F 
Sbjct: 305 LIPVFALGRAQELCILLETYWERMNISVPIYFSMGMAEKANEYYKLFISWTNQKIKETF- 363

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             + N F  KH+  L  +  +DN   GP +V A+   L AG S  IF +WA D +N+V+
Sbjct: 364 -VKRNMFEFKHIKPL-GQGIVDNP--GPMVVFATPGMLHAGQSLHIFRKWAPDERNMVV 418


>gi|387594760|gb|EIJ89784.1| cleavage and polyadenylation specificity factor 3 [Nematocida
           parisii ERTm3]
 gi|387596392|gb|EIJ94013.1| cleavage and polyadenylation specificity factor 3 [Nematocida
           parisii ERTm1]
          Length = 696

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 171/371 (46%), Gaps = 14/371 (3%)

Query: 3   TSVQVTPLSGVFNENPLSYLVS-IDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVL 59
           T+ ++ PL G  +E   S +V+   G   + DCG +  +      P   +   + ID +L
Sbjct: 8   TAARILPL-GAGSEVGRSCVVTKFRGVTVMFDCGVHPAYTGVSSLPFFDLIDPAEIDVIL 66

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
           ++H    H GALPY  ++ G    ++ T P   +    + D        SE DLFT  ++
Sbjct: 67  VTHFHLDHAGALPYFTERSGFKGKIYMTHPTRAIFRWLLNDYVRVSNVSSENDLFTEKEL 126

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
              +  +  + Y Q   L    + I +  + AGH+LG  ++ +  +   ++Y  DY+R +
Sbjct: 127 AQCYDKIIPIDYGQEIPL----KNITIIAYNAGHVLGAAMFLVKNEDISLLYTGDYSREE 182

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           ++HL   V+       ++    Y    +Q   ++   F   +S  ++ GG  LLPV + G
Sbjct: 183 DRHLKAAVIPPMPIDILISESTYGVQCHQSKEERETRFITGVSDVVKRGGKCLLPVFALG 242

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLIL+++W         PI + + ++   +   +++L  M D I    E S  N 
Sbjct: 243 RAQELLLILDEFWDSRKDLQGIPILYASALAKRFMAVYQTYLNMMNDRIQGMAEIS--NP 300

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  KHV  + N    ++   GP +++AS   L+ G S D+F  W  D +N  +       
Sbjct: 301 FHFKHVQSIKNIEAYEDR--GPCVMMASPGMLQNGLSRDLFEMWCGDKRNGCIIPGYCVE 358

Query: 358 GTLARMLQADP 368
           GTLA+ L  +P
Sbjct: 359 GTLAKDLLCEP 369


>gi|320583131|gb|EFW97347.1| Putative endoribonuclease [Ogataea parapolymorpha DL-1]
          Length = 702

 Score =  140 bits (352), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 94/326 (28%), Positives = 165/326 (50%), Gaps = 18/326 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----R 105
           S +D +L+SH    H  +LPY M+       VF T P   +Y+  LL  + +  S     
Sbjct: 55  SKVDVLLISHFHLDHAASLPYVMQHTNFKGRVFMTYPTKAIYKW-LLNDFVRVTSIADDN 113

Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
            + S   L+T +D++ +   +  +    +YH + + EGI    + AGH+LG  ++ +   
Sbjct: 114 DENSANFLYTDEDLNESLDRIETI----DYHSTIEVEGIRFTAYHAGHVLGAAMFFVELG 169

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKT 224
           G   ++  DY+R +++HL+   L    RP +LIT++        PR +RE      I  T
Sbjct: 170 GLKFLFTGDYSREEDRHLSSAELPP-SRPDLLITESTFGTATHVPRVEREAKLTHVIHST 228

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           ++ GG  LLPV + GR  E+LLIL++YW  +    N PIY+ + ++   +   + ++  M
Sbjct: 229 IQQGGRCLLPVFALGRAQEILLILDEYWQNNPELQNVPIYYASDLAKKCMAVYQRYVNMM 288

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWA 342
            DSI K F  +  N F  K++  + N  ++++      +++AS   L+ G S  I  +W+
Sbjct: 289 NDSIRKKFTETNQNPFHFKYIKNITNIEKINDLDSS--VLIASPGMLQNGISRKILEKWS 346

Query: 343 SDVKNLVLFTERGQFGTLARMLQADP 368
            D +N  + T     GT+A++L  +P
Sbjct: 347 PDPRNSCILTGYSVEGTMAKILLTEP 372


>gi|336371935|gb|EGO00275.1| hypothetical protein SERLA73DRAFT_73000 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336384684|gb|EGO25832.1| hypothetical protein SERLADRAFT_437559 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 748

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 165/324 (50%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D Y+     S
Sbjct: 57  STVDAILITHFHLDHAAALTYIMEKTNFRDGKGKVYMTHPTKAVHKFMMQD-YVRMSTSS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  ++  +  S+  ++  Q   L     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 TDALFSPLEMTMSLSSIIPVSAHQ---LISPCPGVTFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   VRP VLI ++   + +   R ++E+ F   +   +R G
Sbjct: 173 LYTGDYSREEDRHLVSAEVPP-VRPDVLIVESTYGVQSLEARDEKEVRFTSLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N  IY+ + ++   +   ++++  M  +I
Sbjct: 232 GHVLLPTFALGRAQELLLILDEYWKKHPDLHNVTIYYASSLARKCMAVYQTYIHTMNANI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNA-PDGPK-LVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ L      +    DGP  +VLAS   L++G S ++   WA D
Sbjct: 292 RSRF-AKRDNPFVFKHISNLAQPRGWERKIADGPPCVVLASPGFLQSGPSRELLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GTLAR +  +P
Sbjct: 351 PRNGLIVTGYSVEGTLARDIMNEP 374


>gi|296418744|ref|XP_002838985.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295634979|emb|CAZ83176.1| unnamed protein product [Tuber melanosporum]
          Length = 783

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 164/332 (49%), Gaps = 12/332 (3%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY-LSRRQVSEF 111
           ST+D +L+SH    H  +LPY M +      VF T P   +    + D   +     S  
Sbjct: 72  STVDVLLISHFHLDHAASLPYVMTKTNFRGRVFMTHPTKAIYKWLIQDSVRVGNVHNSPD 131

Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
           +L+T  D  S++  +  +    +YH +    GI + P+ AGH+LGG ++ I   G  +++
Sbjct: 132 NLYTESDHLSSYSRIEAI----DYHTTLTHAGISITPYHAGHVLGGAMFFIEIAGLKILF 187

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 230
             DY+R  ++HL    +    +P +LI ++        PR ++E       ++ L  GG 
Sbjct: 188 TGDYSREDDRHLVSAEV-PHQKPDLLICESTYGTATHMPRLEKEARLMKMTTEILNRGGR 246

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           VL+PV + GR  ELLLIL++YW +H    +YPIY+ + ++   +D  ++++  M D I +
Sbjct: 247 VLMPVFALGRAQELLLILDEYWEKHPAYQSYPIYYASNLARKCMDVYRTYINTMNDKIKR 306

Query: 289 S-FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
           + FE    N +  + V  L      ++   G  ++LAS   L+ G S ++   W  D +N
Sbjct: 307 AMFEGEGRNPWDFRWVRSLKTIDRFEDV--GGCVMLASPGMLQNGVSRELLERWCPDPRN 364

Query: 348 LVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
            ++ T     GT+A+ +  +P      VT +R
Sbjct: 365 GLVITGYSVEGTMAKQIMNEPTEIPAVVTANR 396


>gi|330842661|ref|XP_003293292.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
 gi|325076396|gb|EGC30185.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
          Length = 789

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 187/374 (50%), Gaps = 21/374 (5%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST----IDAVL 59
           +++TP+ G  NE   S  L+   G   + DCG +  +   +  P      +    ID +L
Sbjct: 31  LEITPI-GSGNEVGRSCVLLKYKGKKIMFDCGVHPAYSGLVSLPFFDSVESDIPDIDLLL 89

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLD 117
           +SH    H  A+PY + +   S  VF T P   +  + + D ++    ++  D  LF   
Sbjct: 90  VSHFHLDHAAAVPYFVGKTKFSGRVFMTHPTKAIYGMLLAD-FVKVTTITRDDDMLFDEK 148

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
           D++S+ + + ++ Y Q      +  GI V    AGH+LG  ++ +   G  ++Y  D++R
Sbjct: 149 DLNSSLEKIEKVRYRQKV----EHNGIKVTCFNAGHVLGAAMFMVEIAGVKILYTGDFSR 204

Query: 178 RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVD 236
           ++++HL G      V+  VLI ++   +    PR +RE  F  ++   +  GG  L+PV 
Sbjct: 205 QEDRHLMGAETPP-VKVDVLIIESTYGVQVHEPRLEREKRFTTSVHDVVSRGGRCLIPVF 263

Query: 237 SAGRVLELLLILEDYW-AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           + GR  ELLLIL++YW A  SL+  PIY+ + ++   +   ++++  M D +   F+ S 
Sbjct: 264 ALGRAQELLLILDEYWIANPSLHGIPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVS- 322

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F  K+++ +      D+  +GP + +AS   L++G S  +F  W +  +N V+    
Sbjct: 323 -NPFEFKYISNIKGIESFDD--NGPCVFMASPGMLQSGLSRQLFERWCTSKRNGVVIPGY 379

Query: 355 GQFGTLARMLQADP 368
              GTLA+ + ++P
Sbjct: 380 SVEGTLAKHIMSEP 393


>gi|328773999|gb|EGF84036.1| hypothetical protein BATDEDRAFT_9083 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 669

 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 183/382 (47%), Gaps = 38/382 (9%)

Query: 5   VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCGWN------------DHFDPSLLQPLSKV 51
           +++TPL G  NE   S  L+   G   ++DCG +            D+ DP         
Sbjct: 57  LKITPL-GAGNEVGRSCILLEFKGKTIMLDCGLHPAHSGLAALPFFDNIDPE-------- 107

Query: 52  ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF 111
             ++D VL++H    H   LPY M++      VF T P   +    + D Y+    +S  
Sbjct: 108 --SVDLVLITHFHVDHAAGLPYFMEKTAFKGRVFMTHPTRAIYKWLVSD-YIKISSLSPD 164

Query: 112 D-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
           D L++  D+ +++  +  + Y Q   L G    I   P+ AGH+LG  ++ +   G  ++
Sbjct: 165 DQLYSDKDLANSYGRIEVIDYHQEVDLGG----IKFTPYYAGHVLGAAMFLLEIAGVRLL 220

Query: 171 YAVDYNRRKEKHLNGTVLE-SFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           Y  DY+R +++HL       S + P VLI ++   +    PR  RE  F   +   ++ G
Sbjct: 221 YTGDYSREEDRHLMAAERPPSSIIPEVLICESTFGVQTLEPRLDREQRFTRMVHTIVKRG 280

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G  LLPV + GR  ELLLIL++YW  H+   + PIY+ + ++   +   +++   M   I
Sbjct: 281 GRCLLPVFALGRAQELLLILDEYWHAHADLHSVPIYYASAIAKKCMAVYQTYTNMMNGRI 340

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            +  + S  N F  KH++ L + ++ D+   GP +++AS   L++G S ++   W  D +
Sbjct: 341 REMAKIS--NPFQFKHISNLKSIAQFDDV--GPCVMMASPGMLQSGLSRELLELWCVDKR 396

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N V+       GTL + + + P
Sbjct: 397 NGVIIPGYVVEGTLGKQILSQP 418


>gi|380494427|emb|CCF33158.1| endoribonuclease YSH1 [Colletotrichum higginsianum]
          Length = 846

 Score =  139 bits (351), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 181/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAGLPFFDDFDLSTVDVLLISHFHVDHAASLPYVLSKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQPVYTEADHLNTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           HS    +PIY+ + ++   +   ++++  M D+I + F       E S D +     +  
Sbjct: 277 HSEFQKFPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERMAEAEASGDGSGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA + KN V+ T     GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>gi|346323812|gb|EGX93410.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Cordyceps militaris CM01]
          Length = 879

 Score =  139 bits (351), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 183/390 (46%), Gaps = 38/390 (9%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLS---------HPDTLHLG 69
           +++   G   ++D G +  +D     P       ST+D +L+S         H    H  
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISQSELRYPMRHFHIDHAA 100

Query: 70  ALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQS 125
           +LPY + +      VF T P   +    + D       S  Q ++  L+T  D  + F  
Sbjct: 101 SLPYVLAKTNFRGRVFMTHPTKAIYKWLIQDSVRVGNTSANQTTQ-PLYTEQDHLNTFPQ 159

Query: 126 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNG 185
           +  + Y   + +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL  
Sbjct: 160 IEAIDYHTTHTISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVS 215

Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLEL 244
             +   V+  VLIT++   + +  PR +RE     +I+  L  GG  LLPV + GR  EL
Sbjct: 216 AEVPKGVKIDVLITESTYGIASHVPRLEREQALMKSITNILNRGGRALLPVFALGRAQEL 275

Query: 245 LLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA----- 297
           LLIL++YW +H+    YPIY+ + ++   +   ++++  M D+I + F      A     
Sbjct: 276 LLILDEYWGKHAEFQKYPIYYASNLAKKCMLIYQTYVGAMNDNIKRLFRERMAEAETSGG 335

Query: 298 ------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
                 +  K++  L N    D+   G  ++LAS   L+ G S ++F  WA + KN V+ 
Sbjct: 336 AGAGGPWDFKYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELFERWAPNDKNGVII 393

Query: 352 TERGQFGTLARMLQADPPPKAVKVTMSRRV 381
           T     GT+AR +  +  P+ ++  MSR +
Sbjct: 394 TGYSVEGTMARQIMKE--PEQIQAVMSRSI 421


>gi|171689890|ref|XP_001909884.1| hypothetical protein [Podospora anserina S mat+]
 gi|170944907|emb|CAP71018.1| unnamed protein product [Podospora anserina S mat+]
          Length = 835

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 104/380 (27%), Positives = 183/380 (48%), Gaps = 30/380 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 42  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 101

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D       S    ++  ++T  D  + F  +  + Y   
Sbjct: 102 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQL-VYTEQDHLNTFPQIEAIDYHTT 160

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +SG    I V P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+ 
Sbjct: 161 HTISG----IRVTPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAQVPRGVKI 216

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW 
Sbjct: 217 DVLITESTYGIASHVPRLEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWG 276

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FL 299
           +H+    YPIY+ + ++   +   ++++  M D+I + F       E S D A     + 
Sbjct: 277 KHAEYQKYPIYYASNLARKCMLVYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWD 336

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K +  L +    ++   G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 337 FKFIRSLKSIDRFEDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 394

Query: 360 LARMLQADPPPKAVKVTMSR 379
           +A+ +  +  P+ ++  MSR
Sbjct: 395 MAKQIMQE--PEHIQAVMSR 412


>gi|209875817|ref|XP_002139351.1| RNA-metabolising metallo-beta-lactamase family protein
           [Cryptosporidium muris RN66]
 gi|209554957|gb|EEA05002.1| RNA-metabolising metallo-beta-lactamase family protein
           [Cryptosporidium muris RN66]
          Length = 797

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 112/393 (28%), Positives = 186/393 (47%), Gaps = 49/393 (12%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW-----NDHFDP------SLLQPLSKVAS 53
           + VTPL    +      LV I     ++DCG      +D   P      S L P+  + S
Sbjct: 3   ITVTPLGAGQDVGRSCILVRIYEKVVMLDCGMHMGYKDDRRYPDFTLISSSLDPVV-INS 61

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQ---VSE 110
            +D V++SH    H GALPY  +++G S P+  T P   +  + + D      Q   +S+
Sbjct: 62  LVDVVVISHYHLDHCGALPYFTEKIGYSGPIIMTYPTKAVSPILLADCCKVMEQKNILSK 121

Query: 111 F---------DL--------FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGH 153
           F         D+        F++ D+    + VT +   Q   ++G    I + P+ AGH
Sbjct: 122 FGSDINTESTDILKPVDPQHFSVGDVWKCMEKVTAIQLHQTISVNG----INITPYYAGH 177

Query: 154 LLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQ 213
           +LG +++ +    E ++Y  DYN  +++HL    ++    P VL++++  A + +P R+ 
Sbjct: 178 VLGASMFHVEVGNESIVYTGDYNMVRDRHLGPASIKKLF-PDVLLSESTYATYIRPSRRS 236

Query: 214 RE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTI 272
            E +F + + + L  GG VL+PV + GR  EL ++LE +W    L YPIYF   ++  + 
Sbjct: 237 TERIFCEMVLQCLEKGGKVLIPVFAVGRAQELCILLEFFWRRMQLRYPIYFGGAMTEKSS 296

Query: 273 DYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG 332
            Y + +  W   +++       D+ F   HV L  ++S L N   GP ++ A+   L AG
Sbjct: 297 LYYQLYTNWTNTALS-------DDLFSFPHV-LPYDRSVLTNT--GPAVLFATPGMLHAG 346

Query: 333 FSHDIFVEWASDVKNLVLFTERGQFGTL-ARML 364
            S   F  WA D  NL +       GTL AR++
Sbjct: 347 LSLQAFKCWAPDPNNLTIIPGFCVAGTLGARII 379


>gi|315054255|ref|XP_003176502.1| cleavage and polyadenylation specificity factor subunit 2
           [Arthroderma gypseum CBS 118893]
 gi|311338348|gb|EFQ97550.1| cleavage and polyadenylation specificity factor subunit 2
           [Arthroderma gypseum CBS 118893]
          Length = 1024

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 114/432 (26%), Positives = 177/432 (40%), Gaps = 105/432 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW++ FD S L+ L +   T+  +LL+H    HLGA  +  +   L    P+
Sbjct: 27  GVKILVDVGWDESFDTSALKELERHIPTLSLILLTHATPSHLGAFVHCCRTYPLFTQIPI 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF---DLFTLDDIDSAFQSVTRLTYSQN---YHLS 138
           ++T PV   G   + + Y S    + F      T  D  S     +  T S+    Y  +
Sbjct: 87  YATIPVIAFGRTYLQNLYASAPLAATFLPSTSVTASDPSSGLTIQSATTASEGPSGYENT 146

Query: 139 GKGE---------------------------------------GIVVAPHVAGHLLGGTV 159
           G G                                        G+ +  + AGH +GGT+
Sbjct: 147 GSGRILLPPPSNEDIARYFSLIHPLKYSQPLQPLPSPFSPPLNGLTITAYNAGHTVGGTI 206

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHN 207
           W I    E ++YAVD+++ +E  + G             V+E   +P  LI  A      
Sbjct: 207 WHIQHGMESIVYAVDWSQARENVIAGAAWFGSSIGSGTEVIEQLRKPTALICSASGGDKF 266

Query: 208 QPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-------- 256
             P  R++R+ +  D I  ++  GG VLLP DS+ RVLE+  +LE  W E +        
Sbjct: 267 ALPGGRKKRDGLLLDMIRSSVAKGGTVLLPTDSSARVLEIAYVLEHAWREAADSGDPNDP 326

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE------------------------ 291
             N P+Y     +  T+   +S LEWM ++I + FE                        
Sbjct: 327 LKNAPLYLAGKKAHGTMRLARSMLEWMDENIVREFEGNDGVEVTAGKAAGGAANQSSKGA 386

Query: 292 TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEW 341
            S+ +A        F  KH+ L+ +K++LD      GPK++L+   SLE G S  I    
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGPKVILSPDTSLEWGLSKHILKHI 446

Query: 342 ASDVKNLVLFTE 353
           A   +NL++ TE
Sbjct: 447 AEGSENLIIMTE 458


>gi|409044817|gb|EKM54298.1| hypothetical protein PHACADRAFT_146128 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 869

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 92/324 (28%), Positives = 167/324 (51%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  AL Y  ++         ++ T P   L    M D ++     S
Sbjct: 58  STVDVILITHFHLDHAAALTYITEKTNFRDGKGKIYMTHPTKALHKFMMQD-FVRMGSSS 116

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  ++  +  S+  ++  Q   +     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 117 SDALFSPMELSVSLASIIPVSAHQ---VISPCPGVTFTPYHAGHVLGACMFLIDIAGLKI 173

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   +RP VLI ++   +     R+++E+ F + +   +R G
Sbjct: 174 LYTGDYSREEDRHLVKAEVPP-IRPDVLIVESTFGVQTLEGREEKELRFTNLVHNIIRRG 232

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N P+Y+ + ++   +   ++++  M  ++
Sbjct: 233 GHVLLPTFALGRAQELLLILDEYWKKHPDLHNVPVYYASSLARKCMAVYQTYIHTMNSNV 292

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNA-PDGPK-LVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ + +    +    +GP  +VLAS   +E+G S ++   WA D
Sbjct: 293 RSRF-AKRDNPFVFKHISNVPHSRGWERKIAEGPSCVVLASPGFMESGPSRELLELWAPD 351

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N V+ T     GT+AR +Q +P
Sbjct: 352 SRNGVILTGYSIEGTMARDIQTEP 375


>gi|403419016|emb|CCM05716.1| predicted protein [Fibroporia radiculosa]
          Length = 828

 Score =  139 bits (350), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 95/324 (29%), Positives = 163/324 (50%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  AL Y  ++         V+ T P   L    M D ++     +
Sbjct: 57  STVDVLLITHFHLDHAAALTYITEKTNFRDGKGKVYMTHPTKALHKFMMQD-FMRMSSST 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  D+  +  S+  ++  Q   +     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 SDALFSPLDLSMSLSSIIPVSAHQ---VITPCPGVTFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R ++ HL    +  F RP VLI ++   +     R+ +E  F + +   +R G
Sbjct: 173 LYTGDYSREEDCHLVKAEVPPF-RPDVLIIESTYGVQTLECREDKEQRFTNLVHSIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++   +   ++++  M  ++
Sbjct: 232 GHVLLPTFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNANV 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDN-APDGPK-LVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ L +    +    DGP  +VLAS   +  G S ++   WA D
Sbjct: 292 RSRF-AKRDNPFVFKHISNLPHTRGWERKVADGPPCVVLASPGFVTVGASRELLEMWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GT+AR +Q++P
Sbjct: 351 SRNGIIITGYSIEGTMARDIQSEP 374


>gi|294945374|ref|XP_002784648.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
 gi|239897833|gb|EER16444.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
          Length = 1115

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 137/452 (30%), Positives = 196/452 (43%), Gaps = 101/452 (22%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-----VSIDGFNFLIDCGWNDHFDPSLLQPL-------- 48
           G SV++ P+S   ++  ++ L     V+    + L+DCGW +  DP +L PL        
Sbjct: 12  GVSVEILPISKDTSQYQMAVLKLTDDVTNTSCSVLLDCGWTEEMDPDMLGPLVAEQQPSG 71

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAM------------------------------KQL 78
           +++   ID  LLS  D  H GA PY                                 Q 
Sbjct: 72  ARLVDQIDVCLLSFADLQHCGAWPYVYCHLRPKKLQYAVAPPPVGEADAAASSSKNSNQP 131

Query: 79  GLSAPVFSTEPVYRLGLLTM------YDQYLSRRQVSEFDLFTLDDIDSAFQ-SVTRLTY 131
              A V +TEPV RLG LT+       D+       +   L T+DD   AF  +VT L Y
Sbjct: 132 SNGAMVLATEPVRRLGELTLTALHEDIDKMRDAVTTTNDWLLTIDDTIMAFNGAVTPLQY 191

Query: 132 SQNYHLS--------GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL 183
            +    +         KG  +   P  AG +LGG  W+I    + ++YAVDY    ++HL
Sbjct: 192 GEGVMFTMRGDAGANAKGPTVRFTPLPAGRMLGGAYWRIDVGSQSMVYAVDYQMAGDRHL 251

Query: 184 NGTVL--ESFVRPAVLITD---------------------------AYNA-----LHNQP 209
           NG  L       P+VLIT+                            Y+A       N+ 
Sbjct: 252 NGMELPPPEQAPPSVLITNTMPPAVEGAVTCAGQGATSNVATESRRTYDAGITASRSNRR 311

Query: 210 PRQQREMFQDAISKTLRAGGNVLLPVD--SAGRVLELLLILEDYWAEHS--LNYPIYFLT 265
             Q  E     + ++LR  G VLLPVD  S GRVLELLL+LE  WA  +    YP+ +++
Sbjct: 312 YAQAEEALLGMVLRSLRKDGTVLLPVDCCSTGRVLELLLLLEAAWAADAGLQVYPVVYVS 371

Query: 266 YVSSSTIDYVKSFLEWMGDSITKSFETSRD---NAFLLKHVTLLINKSEL-DNAP-DGPK 320
            +    +D +K  +EWM   +   F+TS     + FL +HV L  +  +   N P   PK
Sbjct: 372 PLGDVVLDQIKIRMEWMSRVVHNDFDTSMGFMYHPFLFQHVQLCSSFQDFAQNYPARKPK 431

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
           +VLAS ASLE G + +IF     D  + V+FT
Sbjct: 432 VVLASSASLEIGDAREIFCRMCGDPNSTVIFT 463


>gi|398406895|ref|XP_003854913.1| hypothetical protein MYCGRDRAFT_55193, partial [Zymoseptoria
           tritici IPO323]
 gi|339474797|gb|EGP89889.1| hypothetical protein MYCGRDRAFT_55193 [Zymoseptoria tritici IPO323]
          Length = 855

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 94/337 (27%), Positives = 162/337 (48%), Gaps = 25/337 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL---LTMYDQYLSRR 106
           ST+D +L++H    H  +LPY + +   +  V+ T P   +Y+      + +++ +    
Sbjct: 76  STVDLLLITHFHQDHSASLPYVLAKTNFAGRVYMTHPTKAIYKWTTQDAVRVHNTHTPAS 135

Query: 107 QVSEFD-----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
             S  D     L+T  DI S    +  +++    H +    GI   P+ AGH+LG  ++ 
Sbjct: 136 STSGTDGYVSQLYTEQDILSTLPMIQTISF----HTTHSHNGIRFTPYPAGHVLGACMYL 191

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDA 220
           I   G ++++  DY+R  ++HL    +   V+   LIT++   +  + PRQ+RE     +
Sbjct: 192 IEIAGLNILFTGDYSRETDRHLIPAAVPRNVKIDCLITESTFGISTRTPRQERENALIKS 251

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I+  L  GG VL+P  + G   EL+LILEDYW  H     +P+Y+ + ++   +   +++
Sbjct: 252 ITGILNRGGRVLMPTTAVGNTQELMLILEDYWQRHEEYRRFPMYYASGLAKKVMIVYQTY 311

Query: 279 LEWMGDSITKSFETSRDNAFLLKH------VTLLINKSELDNAPD-GPKLVLASMASLEA 331
           +E M D+I   F+ S   A              +     +D   D GP +VLAS   L+ 
Sbjct: 312 VETMNDTIKAKFQASAAAASDSSGAGGPWDFNFIRQLKSMDRYEDVGPSVVLASPGMLQN 371

Query: 332 GFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           G S  +   WA D KN V+ T     GT+A+ +  +P
Sbjct: 372 GPSRTLLERWAPDAKNGVIITGYSVEGTMAKTIMTEP 408


>gi|72387720|ref|XP_844284.1| cleavage and polyadenylation specificity factor subunit
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
 gi|62359436|gb|AAX79873.1| cleavage and polyadenylation specificity factor subunit, putative
           [Trypanosoma brucei]
 gi|70800817|gb|AAZ10725.1| cleavage and polyadenylation specificity factor subunit, putative
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
          Length = 770

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 178/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           +V   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 39  VEILPIGSGGEVGRSCVVVRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCDEIDLVLI 96

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      +F T        + M D    R   S  D+   + + 
Sbjct: 97  THFHLDHCGALPYFCEQTSFRGRIFMTSATKAFYKMVMND--FLRIGASAEDIVNNEWLQ 154

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G  ++Y  D++R  +
Sbjct: 155 STIEKIETVEYHEEVTVNG----IHFQPFNAGHVLGAALFMVDIAGMKLLYTGDFSRVPD 210

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++RE  F   +   ++ GG  L+PV + G
Sbjct: 211 RHLLGAEVPPY-SPDILIAESTNGIRELESREERESLFTTWVHDVVKGGGRCLVPVFALG 269

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + K  E  R N 
Sbjct: 270 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKKQHENHR-NP 328

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L++    ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 329 FVFKYIQSLLDTRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDKRNGIIVAGYCVD 386

Query: 358 GTLARMLQADP 368
           GT+A+ + + P
Sbjct: 387 GTIAKDILSKP 397


>gi|330923041|ref|XP_003300074.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
 gi|311325959|gb|EFQ91831.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
          Length = 705

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 179/370 (48%), Gaps = 33/370 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY----LSRRQV 108
           ST+D +L+SH    H  +LPY + +      VF T P   +    + D      +S    
Sbjct: 74  STVDVLLISHFHVDHAASLPYVLAKTNFKGRVFMTHPTKAIYKWLIQDSVRVGNMSSNSE 133

Query: 109 SEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
           ++  ++T  D  + +  +  + +   + +SG    + + P+ AGH+LG  ++ +   G  
Sbjct: 134 TKIQMYTEADHLNTYPMIESIDFYTTHTVSG----VRITPYPAGHVLGAAMFLMEIAGLK 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R  ++HL    + + V+  VLIT++   +    PR +RE     AI+  L  
Sbjct: 190 ILFTGDYSREDDRHLVSASVPAGVKVDVLITESTFGISMHTPRVEREAQLMKAITDVLNR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  LLPV + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M D+
Sbjct: 250 GGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVYQTYVSAMNDN 309

Query: 286 ITKSF-----------ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFS 334
           I + F           +T R  A+  K V  L +    D+   G  ++LAS   +++G S
Sbjct: 310 IKRLFAERMAEAEAAGDTGRRGAWDFKFVRSLKSLERFDDL--GGCVMLASPGMMQSGTS 367

Query: 335 HDIFVEWASDVKNLVLFTERGQFGTLARMLQADP---PPKAVKVTMSRRVPLVGEELIAY 391
            ++   WA D +N V+ T     GT+A+ +  +P   P    + + + R P  G+     
Sbjct: 368 RELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRASNTARRP--GQR---- 421

Query: 392 EEEQTRLKKE 401
           E EQT + + 
Sbjct: 422 ENEQTMIPRR 431


>gi|261327437|emb|CBH10412.1| cleavage and polyadenylation specificity factor subunit, putative
           [Trypanosoma brucei gambiense DAL972]
          Length = 770

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 178/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           +V   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 39  VEILPIGSGGEVGRSCVVVRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCDEIDLVLI 96

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      +F T        + M D    R   S  D+   + + 
Sbjct: 97  THFHLDHCGALPYFCEQTSFRGRIFMTSATKAFYKMVMND--FLRIGASAEDIVNNEWLQ 154

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G  ++Y  D++R  +
Sbjct: 155 STIEKIETVEYHEEVTVNG----IHFQPFNAGHVLGAALFMVDIAGMKLLYTGDFSRVPD 210

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  +  +  P +LI ++ N +     R++RE  F   +   ++ GG  L+PV + G
Sbjct: 211 RHLLGAEVPPY-SPDILIAESTNGIRELESREERESLFTTWVHDVVKGGGRCLVPVFALG 269

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + K  E  R N 
Sbjct: 270 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKKQHENHR-NP 328

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L++    ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 329 FVFKYIQSLLDTRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDKRNGIIVAGYCVD 386

Query: 358 GTLARMLQADP 368
           GT+A+ + + P
Sbjct: 387 GTIAKDILSKP 397


>gi|296815164|ref|XP_002847919.1| cleavage and polyadenylation specificity factor subunit 2
           [Arthroderma otae CBS 113480]
 gi|238840944|gb|EEQ30606.1| cleavage and polyadenylation specificity factor subunit 2
           [Arthroderma otae CBS 113480]
          Length = 1000

 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 113/407 (27%), Positives = 175/407 (42%), Gaps = 80/407 (19%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA------MKQLGL 80
           G   L+D GW++ FD S L+ L +   T+  +LL+H    HLGA  +       ++ L  
Sbjct: 27  GVKILVDVGWDESFDTSALKELERHIPTLSLILLTHATPSHLGAFVHCSFGRTYLQNLYA 86

Query: 81  SAPVFST------------EPVYRLGLLTMYDQYLSRRQVSEFDLFTL-----DDIDSAF 123
           SAP+ +T                 +   T   Q LS    +      L     +DI   F
Sbjct: 87  SAPLAATFLPSTSVTASDGSSGLAIPSTTPTSQGLSGPDNTGSGRILLPPPSNEDIARYF 146

Query: 124 QSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
             +  L YSQ              G+ +  + AGH +GGT+W I    E ++YAVD+++ 
Sbjct: 147 SLIHPLKYSQPLQPLPSPFSPPLNGLTITAYNAGHTVGGTIWHIQHGMESIVYAVDWSQA 206

Query: 179 KEKHLNGT------------VLESFVRPAVLITDAYNALHNQPP--RQQRE-MFQDAISK 223
           +E  + G             V+E   +P  LI  A        P  R++R+ +  D I  
Sbjct: 207 RENVIAGAAWFGSSGGSGTEVIEQLRKPTALICSASGGDKFALPGGRKKRDGLLLDMIRS 266

Query: 224 TLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---------LNYPIYFLTYVSSSTIDY 274
            +  GG VLLP DS+ RVLE+  +LE  W E +          N P+Y     +  T+  
Sbjct: 267 CVAKGGTVLLPTDSSARVLEIAYVLEHAWREAADSGDSNEVLKNAPLYLAGKKAHGTMRL 326

Query: 275 VKSFLEWMGDSITKSFETSRD--------------------------NAFLLKHVTLLIN 308
            +S LEWM ++I + FE +                              F  KH+ L+ +
Sbjct: 327 ARSMLEWMDENIVREFEGNDGVEVGAGKSGGGAANQPSKSAQGQKSLGPFTFKHLNLVEH 386

Query: 309 KSELDNAPD--GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           K++LD+  D  GPK++L+  ASLE G S  +  + A+   NL++ TE
Sbjct: 387 KAKLDSILDSKGPKVILSPDASLEWGLSRHVLRQIAAGSDNLIIMTE 433


>gi|324506922|gb|ADY42942.1| Cleavage and polyadenylation specificity factor subunit 3 [Ascaris
           suum]
          Length = 706

 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 104/390 (26%), Positives = 189/390 (48%), Gaps = 34/390 (8%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLS 61
           S+  TPL          + ++  G   L+DCG +         P         +D +L++
Sbjct: 21  SLTFTPLGSGQEVGRSCHYLTFKGKKILLDCGIHPGMSGVDALPFVDFVDCEELDLLLVT 80

Query: 62  HPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD------ 112
           H    H GA+P+ +++       F   +T+ +YR+    +   YL   +VS++       
Sbjct: 81  HFHLDHCGAVPWLLEKTAFRGRCFMTHATKAIYRM----LIGDYL---KVSKYGGGSDNR 133

Query: 113 -LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            L+T +D++ + + +  +    ++H   +  GI    +VAGH+LG  ++ I   G  V+Y
Sbjct: 134 LLYTEEDLEKSMEKIEVI----DFHEQKEVNGIKFWCYVAGHVLGACMFMIEIAGVRVLY 189

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGN 230
             D++R +++HL    L + V P VLI ++         R++RE  F   + + +  GG 
Sbjct: 190 TGDFSRLEDRHLCAAELPT-VSPDVLICESTYGTQVHEGREEREKRFTSTVHEIVGRGGR 248

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
            L+P  + GR  ELLLIL++YW  H    + P+Y+ + ++   +   ++F+  M   I K
Sbjct: 249 CLIPAFALGRAQELLLILDEYWEAHPELQDIPVYYASSLAKKCMAVYQTFVSGMNSRIQK 308

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
             + + +N F+ +HV+ L +    ++   GP +VLAS   L+ G S ++F  W +D KN 
Sbjct: 309 --QIALNNPFVFRHVSNLKSIEHFEDV--GPCVVLASPGMLQNGLSRELFENWCTDSKNG 364

Query: 349 VLFTERGQFGTLARMLQADPPPKAVKVTMS 378
            +       GTLA+ + ++P      VTMS
Sbjct: 365 CIIAGYCVEGTLAKHILSEPEE---IVTMS 391


>gi|310796189|gb|EFQ31650.1| metallo-beta-lactamase superfamily protein [Glomerella graminicola
           M1.001]
          Length = 855

 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 101/379 (26%), Positives = 181/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHVDHAASLPYVLSKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQPVYTEADHLNTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    +PIY+ + ++   +   ++++  M D+I + F       E S D +     +  
Sbjct: 277 HAEFQKFPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERMAEAEASGDGSGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA + KN V+ T     GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>gi|402471873|gb|EJW05382.1| hypothetical protein EDEG_00046 [Edhazardia aedis USNM 41457]
          Length = 507

 Score =  139 bits (349), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 181/365 (49%), Gaps = 20/365 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           + V PL    +      L +++G   ++DCG    +ND+    D S +         ID 
Sbjct: 1   MHVIPLGAGQDVGRSCILATLEGRTIMLDCGMHMGYNDYRKFPDFSYISKQLGFNRLIDC 60

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD 117
           +++SH    H GALPY  + LG   P++ T P   +  + + D     R+ ++   +  +
Sbjct: 61  IIISHFHIDHCGALPYFTEVLGYDGPIYMTHPTKAICQILLEDTRKIARKNNDKMTYNKE 120

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
           DI++  + V  +  ++ Y         ++ P+ AGH+LG  ++ +    E ++Y  DYN 
Sbjct: 121 DIENCMKKVIPINMNETYE---HDVDFIIKPYPAGHVLGAAMFYVKVGCESLVYTGDYNT 177

Query: 178 RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
             ++HL G  ++  +RP + IT++      +  R+ +E  F  +I + ++ GG VL+P  
Sbjct: 178 TPDRHLGGAWIDC-LRPDLFITESTYGSTIRDCRKAKEREFLSSIYECVKNGGKVLIPTF 236

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           + GR  E+ L+++ YW + +L+ P+YF   ++       + ++ +  ++I K  +    N
Sbjct: 237 ALGRAQEMCLLIDSYWEKMNLSVPVYFTAGMAERANQIYRLYINYTNETIRK--KILERN 294

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FTE 353
            F  K++  L +K  +D  P GP ++LAS   L +G S ++F++   D  N+++   +  
Sbjct: 295 LFEYKYIKSL-DKGVID-LP-GPMVILASPGMLHSGNSLNLFLKICHDKNNMIVIPGYCV 351

Query: 354 RGQFG 358
           RG  G
Sbjct: 352 RGTVG 356


>gi|190346294|gb|EDK38344.2| hypothetical protein PGUG_02442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 821

 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 136/453 (30%), Positives = 202/453 (44%), Gaps = 92/453 (20%)

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL----- 183
           L YSQ   LS     +++ P+ AGH LGGT W ITK  E VIYA  +N  K+  L     
Sbjct: 19  LKYSQT--LSLFENKMIITPYNAGHTLGGTFWCITKRLEKVIYAPSWNHSKDSFLSSSSF 76

Query: 184 ----NGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
                G  L   +RP VLIT+  +   N P +++ E F   +  TL  GG V+LP   +G
Sbjct: 77  LSASTGNPLSQLMRPTVLITNT-DLGSNLPHKKRAEKFLQLMDATLANGGAVVLPTSLSG 135

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-------- 291
           R LELL +++ +     +  P+YFL+Y  +  ++Y  S LEWM  S+ K +E        
Sbjct: 136 RFLELLHLVDHHLQSQPI--PVYFLSYSGTKVLNYASSLLEWMSTSLVKEWEAASSASMN 193

Query: 292 -TSRDN-AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNL 348
            T+++N  F    V LL +  EL     GPK+VL +   + +G  S ++     SD KN 
Sbjct: 194 STNKNNFPFDPSKVDLLSDPKELIQL-SGPKIVLCAGIDMNSGDVSFEVLKYLCSDQKNT 252

Query: 349 VLFTERGQFGT--------------------------LARMLQADPPPKAVKVTMSRRVP 382
           VL TE+  FG                           LA   +   P +     +SR  P
Sbjct: 253 VLLTEKTHFGADFSINAQLFTDWVRLSREKYGNAEDGLAIGYEGTIPLRG----LSREDP 308

Query: 383 LVGEELIAYEE-----------EQTRLKKEEA-LKASLVKEEESKASLGPDNNLSGD--- 427
           L G EL +++E           EQ R +K +  L A  ++EE+S +  G D   S +   
Sbjct: 309 LSGSELTSFQERINHQRKKKLFEQVRDRKNQNLLNADNLEEEDSSSDDGEDAESSDEEMP 368

Query: 428 ----------PMVIDAN-NANASADVVEPHGGRYR-------DILIDGFVPPSTSVAPMF 469
                     P  ID N NA  + D       +         D+ I   + P  ++ P  
Sbjct: 369 TTTETEAGAMPGAIDTNVNAIVTQDAFVADQVKQTLDDELPLDVKITHKLKPRQAMFPYI 428

Query: 470 PFYENNSEWDDFGEVINPDDYIIKDEDMDQAAM 502
           P ++   ++DD+GEVI+  DY  + ED+  A +
Sbjct: 429 PPHKR--KFDDYGEVIDIKDY-QRAEDLTNAKL 458


>gi|169767492|ref|XP_001818217.1| cleavage and polyadenylylation specificity factor [Aspergillus
           oryzae RIB40]
 gi|83766072|dbj|BAE56215.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1014

 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 116/435 (26%), Positives = 176/435 (40%), Gaps = 108/435 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FDP  LQ L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDPLDLQELEKHVPTLSLILLTHATPAHIGAFAHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTSAASAAASAAASAAASAPEG 146

Query: 111 -------------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAG 152
                            T ++I   F  +  L YSQ +            G+ +  + AG
Sbjct: 147 EGGADASHSGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAG 206

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITD 200
           H +GGT+W I    E ++YAVD+N+ +E  + G             V+E   +P  L+  
Sbjct: 207 HTVGGTIWHIQHGMESIVYAVDWNQARESVMAGAAWFGGSGASGTEVIEQLRKPTALVCS 266

Query: 201 AYNALHNQPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS- 256
                    P  R++R+ +  D I  TL  GG VL+P D++ RVLEL   LE  W + + 
Sbjct: 267 TRGGDKFALPGGRKKRDDLLLDMIRSTLAKGGTVLIPTDTSARVLELAYALEHAWRDAAG 326

Query: 257 --------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA- 297
                       +Y     +++T+   +S LEWM ++I + FE           SR N  
Sbjct: 327 TGQEDNVLKEAGLYLAGRKANTTMRLARSMLEWMDENIVREFEAAEGVDAATGQSRANPG 386

Query: 298 -----------------FLLKHVTLLINKSELDNAPD--GPKLVLASMASLEAGFSHDIF 338
                            F  KH+ ++  K +L+   +   PK++LAS  SL+ GF+ +  
Sbjct: 387 GQRSGQNQGKEEKGTGPFTFKHLKIVERKKKLEKILNNQAPKVILASDTSLDWGFAKESL 446

Query: 339 VEWASDVKNLVLFTE 353
              A    NL+L TE
Sbjct: 447 RLVAGGPNNLLLLTE 461


>gi|357158307|ref|XP_003578085.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-II-like [Brachypodium distachyon]
          Length = 553

 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 100/363 (27%), Positives = 165/363 (45%), Gaps = 22/363 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFD-----PSLLQPLSKVASTID------AVLLSHPDTLHLGA 70
           +V+I G   + DCG +  +      P   + L+    T D       V+++H    H+GA
Sbjct: 20  VVTIGGKRIMFDCGMHMGYHDCNRYPDFARILAAAPETTDFTSAISCVIITHFHLDHIGA 79

Query: 71  LPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRL 129
           LPY  +  G   P++ T P   L  L + D + +   Q  E + ++ +DI    + V  +
Sbjct: 80  LPYFTEVCGYHGPIYMTYPTKALAPLMLEDYRKVMVDQRGEEEQYSYEDILRCMKKVIPV 139

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLE 189
              Q   ++     +V+  + AGH+LG  +         ++Y  DYN   ++HL    +E
Sbjct: 140 DLKQTIQVN---RDLVIRAYYAGHVLGAAMVYAKVGDAAMVYTGDYNMTPDRHLGAAQIE 196

Query: 190 SFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 248
             ++  +LIT++  A   +  +  RE  F  A+ K +  GG VL+P  + GR  EL ++L
Sbjct: 197 R-LKLDLLITESTYAKTIRDSKHAREREFLKAVHKCVSEGGKVLIPTFALGRAQELCILL 255

Query: 249 EDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 308
           +DYW   +L  PIYF   ++     Y K  + W    I  S+     N F  KHV     
Sbjct: 256 DDYWERMNLKIPIYFSAGLTIQANMYYKMLIGWTSQKIKDSYTVQ--NPFDFKHVCHF-- 311

Query: 309 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +    N P GP ++ A+   +  GFS ++F  WA+  KNLV        GT+   L +  
Sbjct: 312 ERSFINDP-GPCVLFATPGMISGGFSLEVFKRWATSDKNLVTLPGYCVAGTIGHKLMSGK 370

Query: 369 PPK 371
           P +
Sbjct: 371 PTR 373


>gi|448124505|ref|XP_004204939.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
 gi|358249572|emb|CCE72638.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
          Length = 948

 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 126/453 (27%), Positives = 196/453 (43%), Gaps = 66/453 (14%)

Query: 22  LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA-------LPY 73
           L+S D     L D  WN   +P  +  L K    ID +LLSH     +          PY
Sbjct: 20  LLSFDNEIKILADPSWNGK-NPDSVLYLEKYLKEIDLILLSHATAEFISGYVLLCVKFPY 78

Query: 74  AMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRLTY 131
            M  +     V+ST PV +LG ++  + Y S   +      +   D++D  F  V  L Y
Sbjct: 79  LMSNIA----VYSTLPVNQLGRISTIEYYRSSGILGPLKDSILEADEVDEWFDKVKPLKY 134

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES- 190
            Q  +L      +V+ P+ AGH LGGT W +T+  E VIYA  +N  K+  LN     S 
Sbjct: 135 MQTLNLFD--SKLVITPYNAGHTLGGTFWLLTRQLEKVIYAPAWNHSKDSFLNNATFLSS 192

Query: 191 --------FVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 242
                    +RP  LIT+  +       +++ E F   +  TL  GG VLLP   AGR L
Sbjct: 193 STGNPSSQLLRPTALITNT-DLGSTMSHKKRTEKFLQLVDATLANGGTVLLPTSLAGRFL 251

Query: 243 ELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA----- 297
           ELL +++ +    S   P+YFL+Y  +  ++Y  + LEWM   + K +E +  +      
Sbjct: 252 ELLHLVDQHL--QSAPIPVYFLSYSGTRVLNYASNLLEWMSGQLIKEWEEASSSTNNSSN 309

Query: 298 -----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLF 351
                F    V LL + +EL     GPK+V  S    + G  S ++      D K  ++ 
Sbjct: 310 KNNFPFDPSKVDLLSDPNELIQL-SGPKIVFCSGLDFKDGDVSFEVLSYLCQDEKTTIIL 368

Query: 352 TERGQFGT----------------------LARMLQADPPPKAVKVT-MSRRVPLVGEEL 388
           TE+  FG+                      L     A P  K + +   ++  PL+G EL
Sbjct: 369 TEKTHFGSDDTINSQLYREWYELTKQRNGGLVEDGTAVPLEKIINLQHWTKEEPLIGTEL 428

Query: 389 IAYEEEQTRLKKEEALKASLVKEEESKASLGPD 421
             ++E  ++ +K+  L    V++ +++  L  D
Sbjct: 429 SDFQERISQQRKQRLLAK--VRDRKNQNLLNAD 459


>gi|449460766|ref|XP_004148116.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-II-like [Cucumis sativus]
          Length = 649

 Score =  138 bits (348), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 176/360 (48%), Gaps = 20/360 (5%)

Query: 22  LVSIDGFNFLIDCGWN-DHFDPSLLQPLSKVASTID------AVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG +  + D       S+++++ D       ++++H    H+GALPY 
Sbjct: 20  VVTINGKRIMFDCGMHLGYVDHRRYPDFSRISASHDYNNVLSCIIITHFHLDHIGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ T P   L  +T+  Y + +  R+  E + FT D I    + V  +   
Sbjct: 80  TEVCGYNGPIYMTYPTMALAPITLEDYRKVMVDRR-GEAEQFTNDHIMECLKKVVPVDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  ++        ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDR-M 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LIT++  A   +  +  RE  F  A+   L +GG VL+P  + GR  EL ++L+DY
Sbjct: 195 QLDLLITESTYATTIRDSKYAREREFLKAVHNCLASGGKVLIPTFALGRAQELCVLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   +L +PIY    ++     Y K  + W    + +++ T   NAF  K+V    ++S 
Sbjct: 255 WERMNLKFPIYVSAGLTVQANMYYKMLISWTSQKVKETYTTR--NAFDFKNVQKF-DRSM 311

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           +D AP GP ++ A+   + +GFS ++F  WA    NL+        GT+   L +  P K
Sbjct: 312 ID-AP-GPCVLFATPGMISSGFSLEVFKRWAPSKLNLITLPGYCVAGTVGHKLMSGKPTK 369


>gi|396488788|ref|XP_003842943.1| similar to cleavage and polyadenylation specifity factor
           [Leptosphaeria maculans JN3]
 gi|312219521|emb|CBX99464.1| similar to cleavage and polyadenylation specifity factor
           [Leptosphaeria maculans JN3]
          Length = 861

 Score =  138 bits (348), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 177/368 (48%), Gaps = 26/368 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  ++     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGMHPAYEGLSAMPFYDEFDLSTVDVLLISHFHVDHAASLPYVLAKT 99

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D      +S    ++  ++T  D  + +  +  + +   
Sbjct: 100 NFKGRVFMTHPTKAIYKWLIQDSVRVGNMSSNSETKIQMYTEQDHLNTYPMIESIDFYTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +SG    + + P+ AGH+LG  ++ +   G  +++  DY+R  ++HL    + + V+ 
Sbjct: 160 HTVSG----VRITPYPAGHVLGAAMFLMEIAGLKILFTGDYSREDDRHLVSASVPAGVKV 215

Query: 195 AVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   +    PR +RE     AI+  L  GG  LLPV + GR  ELLLIL++YW+
Sbjct: 216 DVLITESTFGISMHTPRVEREAQLMKAITDILNRGGRALLPVFALGRAQELLLILDEYWS 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-----------ETSRDNAFLL 300
           +H      PIY+ + ++   +   ++++  M D+I + F           +T R  A+  
Sbjct: 276 KHPEVQKIPIYYNSSLARKCMQVYQTYVSAMNDNIKRLFAERMAEAEAAGDTGRRGAWDF 335

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K V  L +    D+   G  ++LAS   +++G S ++   WA D +N V+ T     GT+
Sbjct: 336 KFVRSLKSLERFDDL--GGCVMLASPGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTM 393

Query: 361 ARMLQADP 368
           A+ +  +P
Sbjct: 394 AKQIVHEP 401


>gi|401882746|gb|EJT46990.1| cleavage and polyadenylation specificity factor [Trichosporon
           asahii var. asahii CBS 2479]
 gi|406700483|gb|EKD03650.1| cleavage and polyadenylation specificity factor [Trichosporon
           asahii var. asahii CBS 8904]
          Length = 738

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 169/348 (48%), Gaps = 41/348 (11%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP-----VYRLGLLTMYDQYLSR-- 105
           ST+DA+L++H    H  ALPY M+++ L    +           R G+    D    R  
Sbjct: 77  STVDAILITHFHVDHAAALPYIMEKVRLMVLCWELTSDELPGRKRQGVHDARDACHLRTD 136

Query: 106 ------RQVSEF--DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGG 157
                  Q +E    L+   D+ +++++   + Y Q+ ++SG   G+   P+ AGH+LG 
Sbjct: 137 DDGHRPHQNAEAAGRLYNEADVQASWENTIAVDYHQDINISG---GLRFTPYHAGHVLGA 193

Query: 158 TVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESF--VRPAVLITDAYNALHNQPPRQQRE 215
           +++ I   G  V+Y  DY+R +++HL   V+     V+P V+I ++   +H  P R+ +E
Sbjct: 194 SMFLIEIAGLKVLYTGDYSREEDRHL---VIAEVPPVKPDVMICESTFGVHTLPDRKDKE 250

Query: 216 -------------MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYP 260
                        +    +S  +R GG VL+P+ S G   EL L+L+DYW +H      P
Sbjct: 251 EQFTSELISRATQLTSALVSNIVRRGGKVLMPIPSFGNGQELALLLDDYWNDHPELQGVP 310

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           IYF + +    +   K ++  M  +I   F   RDN F  K+V  L +   LD+    P 
Sbjct: 311 IYFASGLFQRGMRVYKKYVHTMNANIRSRF-ARRDNPFDFKYVKWLKDPKRLDHKQ--PC 367

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +V+AS   +  G S ++  EWA D KN V+ T     GT+AR L  +P
Sbjct: 368 VVMASAQFMSFGLSRELLEEWAPDPKNGVIVTGYSIEGTMARTLLGEP 415


>gi|401404496|ref|XP_003881737.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
 gi|325116150|emb|CBZ51704.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
          Length = 1033

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 109/395 (27%), Positives = 178/395 (45%), Gaps = 37/395 (9%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           V++TPL           +V   G   + DCG +  +      P+      +++D  L++H
Sbjct: 106 VEITPLGAGCEVGRSCVIVRYKGVTVMFDCGVHPAYSGLGALPIFDAVDMTSVDVCLITH 165

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-------------------QYL 103
               H GALPY + +      VF TEP   +  L   D                   Q  
Sbjct: 166 FHLDHCGALPYLVTKTAFRGRVFMTEPTRVISKLVWLDYARMSAFSQAPEQANAAASQRA 225

Query: 104 SRRQVSEFD----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV 159
           S  Q  +      L+  DD+D   Q    L + Q   + G    + ++   AGH+LG  +
Sbjct: 226 SSGQGDKSGAGNYLYDEDDVDKTVQMAECLDFHQQVEVGG----VKISCFGAGHVLGACM 281

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE--MF 217
           + I   G  ++Y  D++R K++H+    +   V   +LI ++   +H    RQ RE    
Sbjct: 282 FLIEIGGVRMLYTGDFSREKDRHVPIAEVPP-VDVQLLICESTYGIHVHDDRQLRERRFL 340

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYV 275
           +  +   L  GG  LLPV + GR  ELLLILE+YW  H    + PI FL+ +SS  +   
Sbjct: 341 KAVVDIVLNRGGKCLLPVFALGRAQELLLILEEYWTAHPEVCHVPILFLSPLSSKCMVVF 400

Query: 276 KSFLEWMGDSITKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGF 333
            +F++  GD++ ++     +N F  + V  L  +  + +    DGP +++A+   L++G 
Sbjct: 401 DAFVDMCGDAV-RNRALRGENPFAFRFVKNLKSVESARVYIHHDGPAVIMAAPGMLQSGA 459

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S +IF   A + KN V+ T     GTLA  L+ +P
Sbjct: 460 SREIFEALAPESKNGVILTGYSVKGTLADELKREP 494


>gi|167526212|ref|XP_001747440.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774275|gb|EDQ87907.1| predicted protein [Monosiga brevicollis MX1]
          Length = 668

 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 166/355 (46%), Gaps = 18/355 (5%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV----ASTIDAVLLSHPDTLHLGALPYAMK 76
           ++++  GF  ++DCG   H   S L  L  V     S +D   ++H    H GALP+ + 
Sbjct: 40  HIITYKGFTIMLDCG--THPAKSGLAQLPYVDEVDLSQVDFCFVTHFHVDHCGALPWLLS 97

Query: 77  QLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYH 136
           +      VF T     +    + D         +  LF+  DI++  + +  + + Q   
Sbjct: 98  KTPFKGRVFMTHATKAVYQWMLTDYVRINATTDDNQLFSDKDIENTMKRIETVDFEQTVM 157

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 196
           L     G+   P+ AGH+LG  +++I   G  ++Y  D++R +++HL    +   ++P +
Sbjct: 158 L----RGLSFTPYSAGHVLGACMFEIDIAGVKLLYTGDFSRDEDRHLMAASIPP-IKPDI 212

Query: 197 LITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
           LI ++         RQ RE  F   +   ++ GG  L+PV + GR  ELLLIL++YW +H
Sbjct: 213 LIAESTLGDLEHENRQDRERRFTKEVHTIVQRGGRCLIPVFALGRAQELLLILDEYWQQH 272

Query: 256 S--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
               N PIY+ + ++   +   K+F+  M   I +  + S  N F  + +  L    E D
Sbjct: 273 PELHNVPIYYASALAKRCMGVFKAFVNMMNPKIQQQMKIS--NPFQFQFIHNLRKLDEFD 330

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +   G  +VLA+   L+ G S ++F  WA +  N V+       GTLA  L   P
Sbjct: 331 D--HGSSVVLATPGMLQNGLSRELFERWAPNRHNGVILAGYHVEGTLAHELLKQP 383


>gi|407411604|gb|EKF33594.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi marinkellei]
          Length = 763

 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 176/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           ++   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 39  VEILPIGSGGEVGRSCVILRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCDEIDLVLI 96

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      VF T        + M D    R   S  D+ T + + 
Sbjct: 97  THFHLDHCGALPYFCEQTAFKGRVFMTSATKAFYKMVMND--FLRVGASANDIVTNEWLQ 154

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 155 STIEKIETVEYHEEVTVNG----IRFQPFNAGHVLGAALFMVDIAGMKTLYTGDFSRVPD 210

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  + S+  P +LI ++ N +     R++R  +F   +   ++ GG  L+PV + G
Sbjct: 211 RHLLGAEVPSY-SPDILIAESTNGIRELESREERETLFTTWVHDVVKGGGRCLVPVFALG 269

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + +     R N 
Sbjct: 270 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NP 328

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L+     ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 329 FVFKYIHSLMETRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVD 386

Query: 358 GTLARMLQADP 368
           GT+A+ +   P
Sbjct: 387 GTIAKDILTKP 397


>gi|50363261|gb|AAT75333.1| cleavage polyadenylation specificity factor CPSF73 [Trypanosoma
           cruzi]
          Length = 762

 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 176/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           ++   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 38  VEILPIGSGGEVGRSCVILRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCDEIDLVLI 95

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      VF T        + M D    R   S  D+ T + + 
Sbjct: 96  THFHLDHCGALPYFCEQTAFKGRVFMTSATKAFYKMVMND--FLRVGASANDIVTNEWLQ 153

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 154 STIEKIETVEYHEEVTVNG----IRFQPFNAGHVLGAALFMVDIAGMKTLYTGDFSRVPD 209

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  + S+  P +LI ++ N +     R++R  +F   +   ++ GG  L+PV + G
Sbjct: 210 RHLLGAEVPSY-SPDILIAESTNGIRELESREERETLFTTWVHDVVKGGGRCLVPVFALG 268

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + +     R N 
Sbjct: 269 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NP 327

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L+     ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 328 FVFKYIHSLMETRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVD 385

Query: 358 GTLARMLQADP 368
           GT+A+ +   P
Sbjct: 386 GTIAKDILTKP 396


>gi|407851025|gb|EKG05159.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 762

 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 176/371 (47%), Gaps = 19/371 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           ++   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 38  VEILPIGSGGEVGRSCVILRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCDEIDLVLI 95

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      VF T        + M D    R   S  D+ T + + 
Sbjct: 96  THFHLDHCGALPYFCEQTAFKGRVFMTSATKAFYKMVMND--FLRVGASANDIVTNEWLQ 153

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 154 STIEKIETVEYHEEVTVNG----IRFQPFNAGHVLGAALFMVDIAGMKTLYTGDFSRVPD 209

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  + S+  P +LI ++ N +     R++R  +F   +   ++ GG  L+PV + G
Sbjct: 210 RHLLGAEVPSY-SPDILIAESTNGIRELESREERETLFTTWVHDVVKGGGRCLVPVFALG 268

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + +     R N 
Sbjct: 269 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NP 327

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L+     ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 328 FVFKYIHSLMETRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVD 385

Query: 358 GTLARMLQADP 368
           GT+A+ +   P
Sbjct: 386 GTIAKDILTKP 396


>gi|367034742|ref|XP_003666653.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
           42464]
 gi|347013926|gb|AEO61408.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
           42464]
          Length = 879

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 183/380 (48%), Gaps = 30/380 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D       S    ++  ++T  D  + F  +  + Y   
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQL-VYTEQDHLNTFPMIEAIDYHTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G ++++  DY+R +++HL    +   V+ 
Sbjct: 160 HTISS----IRITPYPAGHVLGAAMFLIEIAGLNILFTGDYSREQDRHLVSAEVPKGVKI 215

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YWA
Sbjct: 216 DVLITESTYGIASHVPRLEREQALMKSITSVLNRGGRVLMPVFALGRAQELLLILDEYWA 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FL 299
           +H     YPIY+ + ++   +   ++++  M D+I + F       E S D A     + 
Sbjct: 276 KHKEYQKYPIYYASNLARKCMLVYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWD 335

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K +  L +    ++   G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 FKFIRSLKSIDRFEDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 393

Query: 360 LARMLQADPPPKAVKVTMSR 379
           +A+ +  +  P+ ++  M+R
Sbjct: 394 MAKHIMQE--PEQIQAVMTR 411


>gi|402594378|gb|EJW88304.1| cleavage and polyadenylation specificity factor subunit 3
           [Wuchereria bancrofti]
          Length = 694

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 179/378 (47%), Gaps = 39/378 (10%)

Query: 7   VTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLSHPD 64
           +TPL          + ++  G   L+DCG +         P         +D +L++H  
Sbjct: 15  ITPLGSGQEVGRSCHYLTFKGKKILLDCGIHPGMSGVDALPFVDFVDCEELDLLLVTHFH 74

Query: 65  TLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
             H GALP+ +++       F   +T+ +YR+ +      YL   +VS++          
Sbjct: 75  LDHCGALPWLLEKTAFRGRCFMTHATKAIYRMSI----GDYL---KVSKY---------- 117

Query: 122 AFQSVTRLTYSQ-------NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
              S  R+ Y++       ++H   +  GI    HVAGH+LG  ++ I   G  ++Y  D
Sbjct: 118 GGSSDNRMLYNEEDLEKVIDFHEQKEVNGIKFWCHVAGHVLGACMFMIEIAGVRILYTGD 177

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLL 233
           ++R +++HL    L + V P VLI ++         R +RE  F   + + +  GG  L+
Sbjct: 178 FSRLEDRHLCAAELPT-VSPDVLICESTYGTQVHESRDEREKRFTSIVHEIVGRGGRCLI 236

Query: 234 PVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
           P  + GR  ELLLIL++YW  H    + P+Y+ + ++   +   ++F+  M   I K  +
Sbjct: 237 PAFALGRAQELLLILDEYWESHPELQDIPVYYASSLAKKCMAVYQTFVSGMNSRIQK--Q 294

Query: 292 TSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
            + +N F+ KHV+   N   +D+  D GP +VLAS   L+ G S ++F  W +D KN  +
Sbjct: 295 IALNNPFVFKHVS---NLKSIDHFEDVGPCVVLASPGMLQNGLSRELFENWCTDSKNGCI 351

Query: 351 FTERGQFGTLARMLQADP 368
                  GTLA+ + ++P
Sbjct: 352 IAGYCVEGTLAKHILSEP 369


>gi|402084516|gb|EJT79534.1| endoribonuclease YSH1 [Gaeumannomyces graminis var. tritici
           R3-111a-1]
          Length = 868

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 171/369 (46%), Gaps = 26/369 (7%)

Query: 20  SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQ 77
            +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + +
Sbjct: 40  CHIIQYRGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAK 99

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                 VF T P   +    M D      +    +   ++T  D  + F  +  + Y   
Sbjct: 100 TNFKGRVFMTHPTKAIYKWLMQDSVRVGNTSSNPTSQPVYTEQDHLNTFPQIEAIDYYTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G +V +  DY+R +++HL    +   V+ 
Sbjct: 160 HTISS----IRITPYPAGHVLGAAMFLIEIAGLNVFFTGDYSREQDRHLVSAEVPRGVQI 215

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW 
Sbjct: 216 DVLITESTYGIASHVPRMEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWD 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA------------FL 299
            HS     PIY+ + ++   +   ++++  M D+I + F      A            + 
Sbjct: 276 RHSEYQKVPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERLAEAEAAGNVGTGGGPWD 335

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K +  L N    D+   GP ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 FKFIRSLKNLDRFDDL--GPCVMLASPGMLQTGVSRELLERWAPSDKNGVVITGYSVEGT 393

Query: 360 LARMLQADP 368
           +A+ +  +P
Sbjct: 394 MAKQIMQEP 402


>gi|429862463|gb|ELA37111.1| cleavage and polyadenylation specifity 73 kda [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 831

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 104/393 (26%), Positives = 181/393 (46%), Gaps = 33/393 (8%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 37  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHVDHAASLPYVLSKT 96

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 97  NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQPVYTEADHLNTFPQIEAIDYHTTH 156

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 157 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKID 212

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 213 VLITESTYGIASHVPRLEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 272

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H     +PIY+ + ++   +   ++++  M D+I + F       E S D +     +  
Sbjct: 273 HGEYQKFPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERMAEAEASGDGSGKGGPWDF 332

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA + KN V+ T     GT+
Sbjct: 333 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPNEKNGVIITGYSVEGTM 390

Query: 361 ARMLQADP-------PPKAVKVTMSRRVPLVGE 386
           A+ +  +P       PP A       R   V E
Sbjct: 391 AKQIMQEPDQIQAVMPPPARDADPEERARSVAE 423


>gi|326473038|gb|EGD97047.1| cleavage and polyadenylylation specificity factor [Trichophyton
           tonsurans CBS 112818]
          Length = 1024

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 114/432 (26%), Positives = 179/432 (41%), Gaps = 105/432 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ--LGLSAPV 84
           G   L+D GW++ FD S+L+ L +   T+  +LL+H    HLGA  +  +   L +  P+
Sbjct: 27  GVKILVDVGWDESFDTSVLKELERHIPTLSLILLTHATPSHLGAFVHCCRTYPLFMQIPI 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF---DLFTLDDIDSAFQSVTRLTYSQN---YHLS 138
           ++T PV   G   + + Y S    + F      T  D  S     +  + SQ    Y  +
Sbjct: 87  YATIPVIAFGRTYLQNLYASAPLAATFLPSTSVTASDPSSGLTIQSATSPSQGPSGYETT 146

Query: 139 GKGE---------------------------------------GIVVAPHVAGHLLGGTV 159
           G G                                        G+ +  + AGH +GGT+
Sbjct: 147 GSGRILLPPPTNEDIARYFSLIHPLKYSQPLQPLPSPFSPPLNGLTITAYNAGHTVGGTI 206

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHN 207
           W I    E ++YAVD+++ +E  + G             V+E   +P  LI+ A      
Sbjct: 207 WHIQHGMESIVYAVDWSQARENVIAGAAWFGSSIGSGTEVIEQLRKPTALISSASGGDKF 266

Query: 208 QPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-----AEHS--- 256
             P  R++R+ +  D I      GG VLLP DS+ RVLE+  +LE  W     +E S   
Sbjct: 267 ALPGGRKKRDGLLLDMIRSCAAKGGTVLLPTDSSARVLEIAYVLEHAWRGAADSEDSNDP 326

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE------------------------ 291
             N P+Y     +  T+   +S LEWM ++I + FE                        
Sbjct: 327 LKNTPLYLAGKKAHGTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGTSTQPSKAA 386

Query: 292 TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEW 341
            S+ +A        F  KH+ L+ +K++LD      GPK++L+   SLE G S  +    
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGPKVILSPDTSLEWGLSRHVLKHI 446

Query: 342 ASDVKNLVLFTE 353
           A   +NL++ TE
Sbjct: 447 AEGNENLIIMTE 458


>gi|360043111|emb|CCD78523.1| cleavage and polyadenylation specificity factor-related
           [Schistosoma mansoni]
          Length = 670

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 180/356 (50%), Gaps = 19/356 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA---STIDAVLLSHPDTLHLGALPYAMKQL 78
           L++  G   ++DCG +         P         T D +L+SH    H G LP+ + + 
Sbjct: 30  LLTFKGKKIILDCGIHPGLRNRESLPFIDAIPDIQTTDLILISHFHLDHCGGLPHLLLKT 89

Query: 79  GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
           G  +  +   +T+ +YR  LL  + +  +   + +  L++  DI ++   +  + + Q  
Sbjct: 90  GAKSKCYMTHATKAIYRY-LLADFVRVSNSGGLPDQLLYSDRDIVASLDHIDTIDFHQEL 148

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            ++G    I  + + AGH+LG  ++ I   G  ++Y  D++R++++HL    +   +RP 
Sbjct: 149 EVNG----IKFSAYHAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMCAEIPP-IRPD 203

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT+A   +H    R++RE  F   +   +  GG  L+P  + GR  EL+LIL++YW  
Sbjct: 204 VLITEATYGIHIHDKREEREARFTSLVHDIVTRGGRCLIPAFALGRAQELMLILDEYWDN 263

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M + I    + + +N F  +H++ L +    
Sbjct: 264 HPELHDIPIYYASQLARKCMAVYQTYIYAMNERIRN--QLANNNPFCFRHISNLKSIEHF 321

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D++  GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + + P
Sbjct: 322 DDS--GPCVVMASPGMMQSGLSRELFENWCTDKRNGVIIAGYCVEGTLAKQILSLP 375


>gi|170093225|ref|XP_001877834.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164647693|gb|EDR11937.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 772

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/322 (30%), Positives = 162/322 (50%), Gaps = 21/322 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y  ++         V+ T P   +    M D Y+     +
Sbjct: 57  STVDAILITHFHLDHAAALTYITEKTNFRDGKGKVYMTHPTKAVHKFMMQD-YVRMGSST 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  D+  +  S+  ++  Q   L     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 SDALFSPLDMTMSLASIIPVSAHQ---LITICPGVSFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    L   VRP VLI ++   + +   R+++E  F + +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAELPP-VRPDVLIVESTYGVQSLEGREEKEQRFTNLVHSVIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++   +   ++++  M ++I
Sbjct: 232 GHVLLPAFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNNNI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+ K   +         A   P +VLAS   ++ G S ++F  WA D +
Sbjct: 292 RSRF-AKRDNPFVFKCKKI---------AEGPPCVVLASPGFMQVGPSRELFELWAPDAR 341

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ T     GTLAR +  +P
Sbjct: 342 NGLIITGYSIEGTLARDIMTEP 363


>gi|116200035|ref|XP_001225829.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
 gi|88179452|gb|EAQ86920.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
          Length = 854

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 182/380 (47%), Gaps = 30/380 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D       S    ++  ++T  D  + F  +  + Y   
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQL-VYTEQDHLNTFPMIEAIDYHTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G ++++  DY+R +++HL    +   VR 
Sbjct: 160 HTISS----IRITPYPAGHVLGAAMFLIEIAGLNILFTGDYSREQDRHLVSAEVPKGVRV 215

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW 
Sbjct: 216 DVLITESTYGIASHVPRLEREQALMKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWG 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FL 299
           +H     YPIY+ + ++   +   ++++  M D+I + F       E S D A     + 
Sbjct: 276 KHRDYQRYPIYYASNLARKCMLVYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWD 335

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K +  L +    ++   G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 FKFIRSLKSIDRFEDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 393

Query: 360 LARMLQADPPPKAVKVTMSR 379
           +A+ +  +  P+ ++  M+R
Sbjct: 394 MAKQIMQE--PEQIQAVMTR 411


>gi|451852830|gb|EMD66124.1| hypothetical protein COCSADRAFT_34708 [Cochliobolus sativus ND90Pr]
          Length = 872

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 178/370 (48%), Gaps = 33/370 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY----LSRRQV 108
           ST+D +L+SH    H  +LPY + +      VF T P   +    + D      +S    
Sbjct: 74  STVDVLLISHFHVDHAASLPYVLAKTNFKGRVFMTHPTKAIYKWLIQDSVRVGNMSSNSE 133

Query: 109 SEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
           ++  ++T  D  + +  +  + +   + +SG    + + P+ AGH+LG  ++ +   G  
Sbjct: 134 TKIQMYTEADHLNTYPMIESIDFYTTHTVSG----VRITPYPAGHVLGAAMFLMEIAGLK 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R  ++HL    +   V+  VLIT++   +    PR +RE     AI+  L  
Sbjct: 190 ILFTGDYSREDDRHLVSASVPPGVKIDVLITESTFGISMHTPRVEREAQLMKAITDVLNR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  LLPV + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M D+
Sbjct: 250 GGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVYQTYVSAMNDN 309

Query: 286 ITKSF-----------ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFS 334
           I + F           +T R  A+  K V  L +    D+   G  ++LAS   +++G S
Sbjct: 310 IKRLFAERMAEAEAAGDTGRRGAWDFKFVRSLKSLERFDDL--GGCVMLASPGMMQSGTS 367

Query: 335 HDIFVEWASDVKNLVLFTERGQFGTLARMLQADP---PPKAVKVTMSRRVPLVGEELIAY 391
            ++   WA D +N V+ T     GT+A+ +  +P   P    + + + R P  G+     
Sbjct: 368 RELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRASNTARRP--GQR---- 421

Query: 392 EEEQTRLKKE 401
           E EQT + + 
Sbjct: 422 ENEQTMIPRR 431


>gi|449016323|dbj|BAM79725.1| cleavage and polyadenylation specifity factor protein
           [Cyanidioschyzon merolae strain 10D]
          Length = 749

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 100/353 (28%), Positives = 173/353 (49%), Gaps = 26/353 (7%)

Query: 29  NFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLS--APV 84
             L DCG +  +      P       S ID +L++H    H   LPY + Q  L+  A +
Sbjct: 34  TILFDCGVHPAYSGLAALPFFDEIDPSEIDVILITHFHLDHCAGLPYLVTQTNLNPRARI 93

Query: 85  FSTEP---VYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNYHLSG 139
             T P   VYR    ++   ++ R   S++   ++T  D++     +  + Y Q+  +SG
Sbjct: 94  LMTHPTKAVYR----SLIGDFV-RVGSSDYAGIIYTESDLNQTMARIECIDYHQHIDVSG 148

Query: 140 KGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLIT 199
               + ++ + AGH+LG  ++ +   G  V+Y  D++R++++HL    +   +   VLI 
Sbjct: 149 ----VRISAYNAGHVLGAAMFLVEVAGVSVLYTGDFSRQEDRHLMEAEIPRGIHIDVLIC 204

Query: 200 DAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-- 256
           ++   +    PR+ RE  F   +++ ++ GG  LLPV + GR  ELLLILE+YW  H   
Sbjct: 205 ESTYGVQVHEPRRVREARFTQRVAEVVKRGGRCLLPVFALGRAQELLLILEEYWDAHPEL 264

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP 316
              PIY+ + ++   +    +++  M  +I + +     N F  K+V   +N   LD   
Sbjct: 265 QEIPIYYSSSIAKRCMAIYSTYIHQMNQNIQQRYRRF-GNPFAFKYV---MNIRSLDEFE 320

Query: 317 D-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D GP + +AS   L++G S  +F +W SD +N V+       GTLA+ +  DP
Sbjct: 321 DSGPCVFMASPGMLQSGMSRRLFEKWCSDRRNGVILPGYSVQGTLAKYILTDP 373


>gi|256086716|ref|XP_002579538.1| cleavage and polyadenylation specificity factor-related
           [Schistosoma mansoni]
          Length = 670

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 180/356 (50%), Gaps = 19/356 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA---STIDAVLLSHPDTLHLGALPYAMKQL 78
           L++  G   ++DCG +         P         T D +L+SH    H G LP+ + + 
Sbjct: 30  LLTFKGKKIILDCGIHPGLRNRESLPFIDAIPDIQTTDLILISHFHLDHCGGLPHLLLKT 89

Query: 79  GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
           G  +  +   +T+ +YR  LL  + +  +   + +  L++  DI ++   +  + + Q  
Sbjct: 90  GAKSKCYMTHATKAIYRY-LLADFVRVSNSGGLPDQLLYSDRDIVASLDHIDTIDFHQEL 148

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            ++G    I  + + AGH+LG  ++ I   G  ++Y  D++R++++HL    +   +RP 
Sbjct: 149 EVNG----IKFSAYHAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMCAEIPP-IRPD 203

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT+A   +H    R++RE  F   +   +  GG  L+P  + GR  EL+LIL++YW  
Sbjct: 204 VLITEATYGIHIHDKREEREARFTSLVHDIVTRGGRCLIPAFALGRAQELMLILDEYWDN 263

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M + I    + + +N F  +H++ L +    
Sbjct: 264 HPELHDIPIYYASQLARKCMAVYQTYIYAMNERIRN--QLASNNPFCFRHISNLKSIEHF 321

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           D++  GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + + P
Sbjct: 322 DDS--GPCVVMASPGMMQSGLSRELFENWCTDKRNGVIIAGYCVEGTLAKQILSLP 375


>gi|341038970|gb|EGS23962.1| hypothetical protein CTHT_0006720 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 894

 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 183/380 (48%), Gaps = 30/380 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       S +D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSQVDVLLISHFHIDHAASLPYVLAKT 99

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D       S    S+  ++T  D  + F  +  + Y   
Sbjct: 100 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQL-VYTEQDHLNTFPMIEAIDYYTT 158

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G ++++  DY+R +++HL    +   V+ 
Sbjct: 159 HTISS----IRITPYPAGHVLGAAMFLIEIAGLNILFTGDYSREQDRHLVSAQVPKGVKI 214

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   +    PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YWA
Sbjct: 215 DVLITESTYGIATHVPRLEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWA 274

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FL 299
           +H     YPIY+ + ++   +   ++++  M D+I + F       E S D+A     + 
Sbjct: 275 KHKEYQKYPIYYASNLARKCMLVYQTYVGAMNDNIKRLFRERMAEAEASGDSAGKGGPWD 334

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K +  L +    ++   G  ++LAS   L+ G S ++   WA + KN V+ T     GT
Sbjct: 335 FKFIRSLKSIDRFEDV--GGCVMLASPGMLQNGVSRELLERWAPNEKNGVIITGYSVEGT 392

Query: 360 LARMLQADPPPKAVKVTMSR 379
           +A+ L  +  P+ ++  M+R
Sbjct: 393 MAKQLMQE--PEQIQAVMTR 410


>gi|189208340|ref|XP_001940503.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187976596|gb|EDU43222.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 871

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 179/370 (48%), Gaps = 33/370 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY----LSRRQV 108
           ST+D +L+SH    H  +LPY + +      VF T P   +    + D      +S    
Sbjct: 74  STVDVLLISHFHVDHAASLPYVLAKTNFKGRVFMTHPTKAIYKWLIQDSVRVGNMSSNSE 133

Query: 109 SEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
           ++  ++T  D  + +  +  + +   + ++G    + + P+ AGH+LG  ++ +   G  
Sbjct: 134 TKIQMYTEADHLNTYPMIESIDFYTTHTVAG----VRITPYPAGHVLGAAMFLMEIAGLK 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R  ++HL    + + V+  VLIT++   +    PR +RE     AI+  L  
Sbjct: 190 ILFTGDYSREDDRHLVSASVPAGVKVDVLITESTFGISMHTPRVEREAQLMKAITDVLNR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  LLPV + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M D+
Sbjct: 250 GGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVYQTYVSAMNDN 309

Query: 286 ITKSF-----------ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFS 334
           I + F           +T R  A+  K V  L +    D+   G  ++LAS   +++G S
Sbjct: 310 IKRLFAERMAEAEAAGDTGRRGAWDFKFVRSLKSLERFDDL--GGCVMLASPGMMQSGTS 367

Query: 335 HDIFVEWASDVKNLVLFTERGQFGTLARMLQADP---PPKAVKVTMSRRVPLVGEELIAY 391
            ++   WA D +N V+ T     GT+A+ +  +P   P    + + + R P  G+     
Sbjct: 368 RELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRASNTARRP--GQR---- 421

Query: 392 EEEQTRLKKE 401
           E EQT + + 
Sbjct: 422 ENEQTMIPRR 431


>gi|326477880|gb|EGE01890.1| cleavage and polyadenylylation specificity factor [Trichophyton
           equinum CBS 127.97]
          Length = 1024

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 114/432 (26%), Positives = 178/432 (41%), Gaps = 105/432 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ--LGLSAPV 84
           G   L+D GW++ FD S+L+ L +   T+  +LL+H    HLGA  +  +   L +  P+
Sbjct: 27  GVKILVDVGWDESFDTSVLKELERHIPTLSLILLTHATPSHLGAFVHCCRTYPLFMQIPI 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF---DLFTLDDIDSAFQSVTRLTYSQN---YHLS 138
           ++T PV   G   + + Y S    + F      T  D  S     +  + SQ    Y  +
Sbjct: 87  YATIPVIAFGRTYLQNLYASAPLAATFLPSTSVTASDPSSGLTIQSATSPSQGPSGYETT 146

Query: 139 GKGE---------------------------------------GIVVAPHVAGHLLGGTV 159
           G G                                        G+ +  + AGH +GGT+
Sbjct: 147 GSGRILLPPPTNEDIARYFSLIHPLKYSQPLQPLPSPFSPPLNGLTITAYNAGHTVGGTI 206

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHN 207
           W I    E ++YAVD+++ +E  + G             V+E   +P  LI  A      
Sbjct: 207 WHIQHGMESIVYAVDWSQARENVIAGAAWFGSSIGSGTEVIEQLRKPTALICSASGGDKF 266

Query: 208 QPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW-----AEHS--- 256
             P  R++R+ +  D I      GG VLLP DS+ RVLE+  +LE  W     +E S   
Sbjct: 267 ALPGGRKKRDGLLLDMIRSCAAKGGTVLLPTDSSARVLEIAYVLEHAWRGAADSEDSNDP 326

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE------------------------ 291
             N P+Y     +  T+   +S LEWM ++I + FE                        
Sbjct: 327 LKNTPLYLAGKKAHGTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGTSTQPSKAA 386

Query: 292 TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEW 341
            S+ +A        F  KH+ L+ +K++LD      GPK++L+   SLE G S  +    
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGPKVILSPDTSLEWGLSRHVLKHI 446

Query: 342 ASDVKNLVLFTE 353
           A   +NL++ TE
Sbjct: 447 AEGNENLIIMTE 458


>gi|320593246|gb|EFX05655.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
           clavigera kw1407]
          Length = 857

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 179/380 (47%), Gaps = 30/380 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGQHPAYDGLASLPFFDDFDLSTVDVLLISHFHVDHAASLPYVLAKT 99

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  S F+ +  +    +Y
Sbjct: 100 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQPVYTEQDHLSTFRQIEAI----DY 155

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H +     I + P+ AGH+LG  ++ I   G  +++  DY+R  ++HL    +   V+  
Sbjct: 156 HTTHTVSSIRITPYPAGHVLGAAMFLIEIAGLKIMFTGDYSRELDRHLVSATVPKGVKVD 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW++
Sbjct: 216 VLITESTYGIASHVPRLEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWSK 275

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA------------FLL 300
           HS   NYPIY+ + ++   +   +++   M D+I + +      A            +  
Sbjct: 276 HSDFQNYPIYYASNLAKKCMVVYQTYTGAMNDNIKRLYAERAKEAEATGNSAGGGGPWDF 335

Query: 301 KHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           + +  L N   LD   D G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 RFIRSLKN---LDRFEDIGGCVMLASPGMLQNGVSRELLERWAPSDKNGVIITGYSVEGT 392

Query: 360 LARMLQADPPPKAVKVTMSR 379
           +A+ +  +  P  ++  MSR
Sbjct: 393 MAKQIMQE--PDHIQAVMSR 410


>gi|452002411|gb|EMD94869.1| hypothetical protein COCHEDRAFT_1222148 [Cochliobolus
           heterostrophus C5]
          Length = 872

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 178/370 (48%), Gaps = 33/370 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY----LSRRQV 108
           ST+D +L+SH    H  +LPY + +      VF T P   +    + D      +S    
Sbjct: 74  STVDVLLISHFHVDHAASLPYVLAKTNFKGRVFMTHPTKAIYKWLIQDSVRVGNMSSNSE 133

Query: 109 SEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
           ++  ++T  D  + +  +  + +   + +SG    + + P+ AGH+LG  ++ +   G  
Sbjct: 134 TKIQMYTEADHLNTYPMIESIDFYTTHTVSG----VRITPYPAGHVLGAAMFLMEIAGLK 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R  ++HL    +   V+  VLIT++   +    PR +RE     AI+  L  
Sbjct: 190 ILFTGDYSREDDRHLVSASVPPGVKIDVLITESTFGISMHTPRVEREAQLMKAITDVLNR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG  LLPV + GR  ELLLIL++YW++H      PIY+ + ++   +   ++++  M D+
Sbjct: 250 GGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVYQTYVSAMNDN 309

Query: 286 ITKSF-----------ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFS 334
           I + F           +T R  A+  K V  L +    D+   G  ++LAS   +++G S
Sbjct: 310 IKRLFAERMAEAEAAGDTGRRGAWDFKFVRSLKSLERFDDL--GGCVMLASPGMMQSGTS 367

Query: 335 HDIFVEWASDVKNLVLFTERGQFGTLARMLQADP---PPKAVKVTMSRRVPLVGEELIAY 391
            ++   WA D +N V+ T     GT+A+ +  +P   P    + + + R P  G+     
Sbjct: 368 RELLERWAPDPRNGVIITGYSVEGTMAKHIVHEPDQIPAIMTRASNTARRP--GQR---- 421

Query: 392 EEEQTRLKKE 401
           E EQT + + 
Sbjct: 422 ENEQTMIPRR 431


>gi|429966185|gb|ELA48182.1| hypothetical protein VCUG_00420 [Vavraia culicis 'floridensis']
          Length = 669

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 179/388 (46%), Gaps = 37/388 (9%)

Query: 4   SVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLL 60
           ++ + PL G  NE   S + ++    + L+DCG +  +  +   P   +   ST+DAV +
Sbjct: 6   NLTIMPL-GAGNEVGRSCIHITYKSLSILLDCGVHPAYTGTSSLPFLDLINLSTVDAVFI 64

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  ++   +  VF T P   +    + D        +E D ++  D++
Sbjct: 65  THFHLDHAGALPYLTEKTNFAGKVFMTHPTKAILRWLLNDYIRIINANTEIDFYSEKDLN 124

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           + +  +  + Y+Q   +    +   V+   AGH+LG  ++ I  D   ++Y  DY+  ++
Sbjct: 125 NCYDKIIAIDYNQTVVV----KDFKVSALNAGHVLGAAMFMIENDRVKILYTGDYSTEED 180

Query: 181 KHLNGTVL-----------------ESFVRPAVLITDAYNALHNQPPRQQREM-FQDAIS 222
           +HL G                    E+     VLI ++   +    PR++RE  F   ++
Sbjct: 181 RHLKGADTAWISKYGNMDEKEHSNDETVHHLDVLICESTYGVQCHLPREERERRFTQVVN 240

Query: 223 KTLRAGGNVLLPVDSAGRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLE 280
             +  GG  LLPV + GR  ELLLILEDYW    H  N PIY+ + +++  +   +++  
Sbjct: 241 DIVTRGGKCLLPVFALGRAQELLLILEDYWDRNPHLHNIPIYYASALANRCLSIYQAYTH 300

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
            M   I K       +AF  KH+  L  KS  ++      +V+AS   L++G S ++F  
Sbjct: 301 MMNLKIKK-------DAFNFKHIRNL--KSVDNHLIKNACVVMASPGMLQSGLSRELFES 351

Query: 341 WASDVKNLVLFTERGQFGTLARMLQADP 368
           W  D  N  +       GTLA+ +  +P
Sbjct: 352 WCEDANNGTVIPGYCVQGTLAKEIMTEP 379


>gi|346972312|gb|EGY15764.1| endoribonuclease YSH1 [Verticillium dahliae VdLs.17]
          Length = 837

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 179/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHVDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    S   ++T  D  + F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPSTQPVYTEADHMNTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRVEREQALVKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H     YPIY+ + ++   +   ++++  M D+I + F       E S D +     +  
Sbjct: 277 HPDYQKYPIYYASNLARKCMVVYQTYVGAMNDNIKRLFREGMAQAEASGDGSGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
            ++  L N    D+   G  ++LAS   L+ G S ++   WA + KN V+ T     GT+
Sbjct: 337 NYIRSLKNLDRFDDL--GGCVMLASPGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>gi|448122146|ref|XP_004204382.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
 gi|358349921|emb|CCE73200.1| Piso0_000226 [Millerozyma farinosa CBS 7064]
          Length = 948

 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 147/568 (25%), Positives = 240/568 (42%), Gaps = 110/568 (19%)

Query: 22  LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA-------LPY 73
           L+S D     L D  WN   +P  +  L K     D +LLSH     +          PY
Sbjct: 20  LLSFDNEIKILADPSWNGK-NPDSILYLEKYLKETDLILLSHATAEFISGYVLLCVKFPY 78

Query: 74  AMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRLTY 131
            M  +     V+ST PV +LG ++  + Y S   +      +   D++D  F  V  L Y
Sbjct: 79  LMSNIA----VYSTLPVNQLGRISTIEYYRSSGILGPLKDSILEADEVDEWFDKVKPLKY 134

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES- 190
            Q  +L      +V+ P+ AGH LGGT W +T+  E VIYA  +N  K+  LN     S 
Sbjct: 135 MQTLNLFD--SKMVITPYNAGHTLGGTFWLLTRQLEKVIYAPAWNHSKDSFLNNATFLSS 192

Query: 191 --------FVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 242
                    +RP  LIT+  +       +++ E F   +  TL  GG VLLP   AGR L
Sbjct: 193 STGNPSSQLLRPTALITNT-DLGSTMSHKKRTEKFLSLVDATLANGGTVLLPTSLAGRFL 251

Query: 243 ELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA----- 297
           ELL +++ +    S   P+YFL+Y  +  ++Y  + LEWM   + K +E +  +      
Sbjct: 252 ELLHLVDQHL--QSAPIPVYFLSYSGTRVLNYASNLLEWMSGQLIKEWEEASSSTNNSSN 309

Query: 298 -----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLF 351
                F    V LL + +EL     GPK+V  S    + G  S ++      D K  ++ 
Sbjct: 310 KNNFPFDPSKVDLLSDPNELIQL-SGPKIVFCSGLDFKDGDVSFEVLSYLCQDEKTTIIL 368

Query: 352 TERGQFGT--------------LARMLQ--------ADPPPKAVKV-TMSRRVPLVGEEL 388
           TE+  FG+              LA+           A P  K + +   ++  PL+G +L
Sbjct: 369 TEKTHFGSDDTINSQLYREWYDLAKQRNGGLVEDGAAVPLEKIINLQNWTKEEPLIGSDL 428

Query: 389 IAYEEEQTRLKKEEALKASLVKEEESKASLGPD--------------------------- 421
             ++E  ++ +K+  L    V++ +++  L  D                           
Sbjct: 429 SDFQERISQQRKQRLLAK--VRDRKNQNLLNADTLSDDDSSDEEENTTDEESEALKMTST 486

Query: 422 ----NNLSGD----PMVID---ANNANASADVVEP-HGGRYRDILIDGFVPPSTSVAPMF 469
               N+++G+    P+ +D   ++ A  S+ + +     R  D+ I   + P  +   MF
Sbjct: 487 TIKSNSVTGNNTTAPVRVDDLSSHEAFISSHIKQTLQDNRPLDLKITYKLKPRHA---MF 543

Query: 470 PFY--ENNSEWDDFGEVINPDDYIIKDE 495
           PF    +  + DD+GE+IN +D+   D+
Sbjct: 544 PFMVVSHKPKVDDYGEMINIEDFQKNDD 571


>gi|340521586|gb|EGR51820.1| predicted protein [Trichoderma reesei QM6a]
          Length = 887

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 98/343 (28%), Positives = 167/343 (48%), Gaps = 28/343 (8%)

Query: 59  LLSHPDTLHL---GALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSE--FDL 113
           LL+  D+ H+    +LPY + +      VF T P   +    + D        S     L
Sbjct: 115 LLTRGDSFHIDHAASLPYVLAKTNFRGRVFMTHPTKAIYKWLIQDSVRVGNTASNSATQL 174

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
           +T  D  + F  +  + Y   + +S     I + P+ AGH+LG  ++ I   G ++ +  
Sbjct: 175 YTEQDHLNTFPQIEAIDYHTTHTISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTG 230

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVL 232
           DY+R +++HL    +   ++  VLIT++   + +  PR +RE     +I+  L  GG  L
Sbjct: 231 DYSREQDRHLVSAEVPKGIKIDVLITESTYGIASHVPRLEREQALMKSITGILNRGGRAL 290

Query: 233 LPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           LPV + GR  ELLLIL++YWA+H     +PIY+ + ++   +   ++++  M D+I + F
Sbjct: 291 LPVFALGRAQELLLILDEYWAKHPEYQKFPIYYASNLARKCMVIYQTYVGAMNDNIKRLF 350

Query: 291 -------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIF 338
                  E S D+A     +  K++  L N    D+   G  ++LAS   L+ G S ++F
Sbjct: 351 RERMAEAEASGDSAGKNGPWDFKYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELF 408

Query: 339 VEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRV 381
             WA   KN V+ T     GT+AR +  +  P  ++  MSR +
Sbjct: 409 ERWAPSEKNGVIITGYSVEGTMARQIMQE--PDQIQAVMSRSI 449


>gi|164658265|ref|XP_001730258.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
 gi|159104153|gb|EDP43044.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
          Length = 741

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 89/288 (30%), Positives = 149/288 (51%), Gaps = 10/288 (3%)

Query: 84  VFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
           V+ T P   +    M D        S+  LF   ++ ++++ +  + Y Q   L G   G
Sbjct: 13  VYMTHPTKAIYRFLMSDFVRISNAGSDRMLFDEAEMLASWRQIEAVDYHQEVVLGG---G 69

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           +   P+ AGH+LG  ++ I   G  V+Y  DY+R +++HL    +   +RP VLI ++  
Sbjct: 70  LRFTPYHAGHVLGACMFMIDMAGLRVLYTGDYSREEDRHLVQAEVPP-MRPDVLICESTY 128

Query: 204 ALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYP 260
              +  PR  +EM F   I   +R GG VLLPV   GR  ELLL+L++YW  H    + P
Sbjct: 129 GTQSLEPRLDKEMRFTSLIHSIIRRGGRVLLPVFVLGRAQELLLLLDEYWEAHPELHSVP 188

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPK 320
           IY+ + ++   +   ++++  M   I   F   RDN F+ KHV+ L +  + D+   GP 
Sbjct: 189 IYYASSLARKCMSIYQTYIHTMNQHIRARFH-RRDNPFVFKHVSNLRSLDKFDD--KGPC 245

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +++AS   +++G S ++   WA D +N V+ +     GT+AR + +DP
Sbjct: 246 VMMASPGFMQSGISRELLERWAPDKRNGVIVSGYSVEGTMARDILSDP 293


>gi|302661813|ref|XP_003022569.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517]
 gi|291186522|gb|EFE41951.1| hypothetical protein TRV_03308 [Trichophyton verrucosum HKI 0517]
          Length = 1024

 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 113/432 (26%), Positives = 175/432 (40%), Gaps = 105/432 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW++ FD S+L+ L +   T+  +LL+H    HLGA  +  +   L    P+
Sbjct: 27  GVKILVDVGWDESFDTSVLKELERHIPTLSLILLTHATPSHLGAFVHCCRAYPLFTQIPI 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTL---------------------------- 116
           ++T PV   G   + + Y S    + F   T                             
Sbjct: 87  YATIPVIAFGRTYLQNLYASAPLAATFLPSTSVTASDPSSGLTIQSATSSAQGPSGYENT 146

Query: 117 ------------DDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTV 159
                       +DI   F  +  L YSQ              G+ +  + AGH +GGT+
Sbjct: 147 GSGRILLPPPTNEDIARYFSLIHPLKYSQPLQPLPSPFSPPLNGLTITAYNAGHTVGGTI 206

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHN 207
           W I    E ++YAVD+++ +E  + G             V+E   +P  LI  A      
Sbjct: 207 WHIQHGMESIVYAVDWSQARENVIAGAAWFGSSIGSGTEVIEQLRKPTALICSASGGDKF 266

Query: 208 QPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-------- 256
             P  R++R+ +  D I      GG VLLP DS+ RVLE+  +LE  W E +        
Sbjct: 267 ALPGGRKKRDGLLLDMIRSCAAKGGTVLLPTDSSARVLEIAYVLEHAWREAADSEDSNDP 326

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE------------------------ 291
             N P+Y     +  T+   +S LEWM ++I + FE                        
Sbjct: 327 LKNTPLYLAGKKAHGTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGASNQPSKGA 386

Query: 292 TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEW 341
            S+ +A        F  KH+ L+ +K++LD      GPK++L+   SLE G S  +    
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGPKVILSPDTSLEWGLSKHVLKHI 446

Query: 342 ASDVKNLVLFTE 353
           A   +NL++ TE
Sbjct: 447 AEGNENLIIMTE 458


>gi|259148102|emb|CAY81351.1| Cft2p [Saccharomyces cerevisiae EC1118]
          Length = 859

 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 136/494 (27%), Positives = 214/494 (43%), Gaps = 85/494 (17%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMA------------------------SLE 330
              F +     +I  +EL   P G K+   S                          S E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 331 AGFSHDIFVEWA-SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
              S D  +E    D +N   F E G+       +  D           +  PL  EE  
Sbjct: 373 CASSLDKILEIVEQDERNWKTFPEDGKSFLCDNYISID---------TIKEEPLSKEETE 423

Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGR 449
           A++ +    K+E   K  LVK E  K +       +G+ ++ D N   A          R
Sbjct: 424 AFKVQLKEKKRERNKKILLVKRESKKLA-------NGNAIIDDTNGERAM---------R 467

Query: 450 YRDILIDGF--VPP 461
            +DIL++    VPP
Sbjct: 468 NQDILVENVNGVPP 481


>gi|67517547|ref|XP_658594.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
 gi|74598547|sp|Q5BEP0.1|YSH1_EMENI RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
           3'-end-processing protein ysh1
 gi|40746402|gb|EAA65558.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
 gi|259488717|tpe|CBF88384.1| TPA: Endoribonuclease ysh1 (EC 3.1.27.-)(mRNA 3'-end-processing
           protein ysh1) [Source:UniProtKB/Swiss-Prot;Acc:Q5BEP0]
           [Aspergillus nidulans FGSC A4]
          Length = 884

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 172/361 (47%), Gaps = 19/361 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSD 133

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P+ AGH+LG  ++ I+  G ++++ 
Sbjct: 134 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFT 193

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   V+  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 194 GDYSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 253

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLILE+YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 254 LMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 313

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  K+V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 314 FRQRMAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 371

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
              WA + +N V+ T     GT+A+ L  +  P  +   MSR    +G   +   +E+ +
Sbjct: 372 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PDQIHAVMSRAATGMGRTRMNGNDEEQK 429

Query: 398 L 398
           +
Sbjct: 430 I 430


>gi|71654879|ref|XP_816051.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|70881152|gb|EAN94200.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 430

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 101/365 (27%), Positives = 174/365 (47%), Gaps = 19/365 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL----SKVASTIDAVLL 60
           V++ P+           ++   G + ++DCG  +H   S L  L    S     ID VL+
Sbjct: 38  VEILPIGSGGEVGRSCVILRYKGRSVMLDCG--NHPAKSGLDSLPFFDSIRCDEIDLVLI 95

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H GALPY  +Q      VF T        + M D    R   S  D+ T + + 
Sbjct: 96  THFHLDHCGALPYFCEQTAFKGRVFMTSATKAFYKMVMND--FLRVGASANDIVTNEWLQ 153

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           S  + +  + Y +   ++G    I   P  AGH+LG  ++ +   G   +Y  D++R  +
Sbjct: 154 STIEKIETVEYHEEVTVNG----IRFQPFNAGHVLGAALFMVDIAGMKTLYTGDFSRVPD 209

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
           +HL G  + S+  P +LI ++ N +     R++R  +F   +   ++ GG  L+PV + G
Sbjct: 210 RHLLGAEVPSY-SPDILIAESTNGIRELESREERETLFTTWVHDVVKGGGRCLVPVFALG 268

Query: 240 RVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA 297
           R  ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + +     R N 
Sbjct: 269 RAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NP 327

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F+ K++  L+     ++   GP +VLAS   L++G S ++F  W  D +N ++       
Sbjct: 328 FVFKYIHSLMETRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVD 385

Query: 358 GTLAR 362
           GT+A+
Sbjct: 386 GTIAK 390


>gi|367054168|ref|XP_003657462.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
 gi|347004728|gb|AEO71126.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
          Length = 859

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 106/387 (27%), Positives = 182/387 (47%), Gaps = 32/387 (8%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQY----LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                VF T P   +    + D       S    ++  ++T  D  + F  +  + Y   
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQL-VYTEQDHLNTFPMIEAIDYHTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G ++++  DY+R +++HL    +   V+ 
Sbjct: 160 HTISS----IRITPYPAGHVLGAAMFLIEIAGLNILFTGDYSREQDRHLVSAEVPKGVKI 215

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW 
Sbjct: 216 DVLITESTYGVASHIPRLEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWG 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FL 299
           +H     YPIY+ + ++   +   ++++  M D+I + F       E S D A     + 
Sbjct: 276 KHKEYQKYPIYYASNLARKCMLVYQTYVGAMNDNIKRLFRERMAEAEASGDAAGKGGPWD 335

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K +  L +    D+   G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 FKFIRSLKSIDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 393

Query: 360 LARMLQADPPPKAVKVTMS----RRVP 382
           +A+ L  +P      +T S    RR P
Sbjct: 394 MAKQLMQEPDQIQAVMTRSSAGGRRAP 420


>gi|223997482|ref|XP_002288414.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975522|gb|EED93850.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 557

 Score =  136 bits (343), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 112/397 (28%), Positives = 181/397 (45%), Gaps = 25/397 (6%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSH 62
           + +TPL          +L++      L+DCG +  +D     P         +D +L++H
Sbjct: 5   MTITPLGSGQEVGRSCHLLTFRSTTILLDCGIHPGYDGMAGLPFFDRVDPEQVDVLLITH 64

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS--------EF 111
               H  +LPY  ++ G    +F T P   V RL LL  Y + +  ++ S        + 
Sbjct: 65  FHLDHAASLPYFTERTGFKGRIFMTHPTKAVIRL-LLGDYLKLMMMKKGSGGADKDDNQD 123

Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            L+T  D+ S    +  + Y Q   L+    G+      AGH+LG  ++ I   G  V+Y
Sbjct: 124 VLYTEADLQSCVDKIELIDYHQTIDLN-LPSGLKFHALNAGHVLGAAMFFIEVGGRSVLY 182

Query: 172 AVDYNRRKEKHLNGTVLESF-VRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
             DY+  +++HL    L  +   P +LI ++   +     R +RE  F   I + +  GG
Sbjct: 183 TGDYSMEEDRHLMAAELPKYHASPDLLIVESTYGVQVHASRAEREARFTGTIERIVTGGG 242

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
             L+PV + GR  ELLLIL++YW EH    + PIY+ + ++S  +   +++   M   I 
Sbjct: 243 RCLIPVFALGRAQELLLILDEYWQEHPHLQSIPIYYASKMASRALRVYQTYANMMNARIR 302

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVK 346
              +    N F   H+  L    +++N  D GP +V AS   L++G S  +F  WA D K
Sbjct: 303 AQMDLG--NPFHFSHIRNL-KSIDVNNFDDRGPSVVFASPGMLQSGVSRQLFDRWAGDPK 359

Query: 347 NLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           N V+        TLA+ + +   PK V     RR PL
Sbjct: 360 NGVMLAGYAVEHTLAKEIMSQ--PKEVVTLEGRRQPL 394


>gi|448118544|ref|XP_004203525.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
 gi|448120951|ref|XP_004204108.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
 gi|359384393|emb|CCE79097.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
 gi|359384976|emb|CCE78511.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
          Length = 809

 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 176/360 (48%), Gaps = 42/360 (11%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+       VF   +T+ +YR  LL+ + +  S     
Sbjct: 64  SKVDILLISHFHLDHAASLPYVMQHTNFKGRVFMTHATKAIYRW-LLSDFVKVTSIGGGG 122

Query: 105 ---------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                        S  +L+T DD+  +F  +  +    +YH + + +GI    + AGH+L
Sbjct: 123 DPRMNNDDSSLNTSSGNLYTDDDLMRSFDRIETI----DYHSTIEVDGIRFTAYHAGHVL 178

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           G  ++ I   G  V++  D++  +++HL    +   V+P +LI+++        PR ++E
Sbjct: 179 GACMYLIEIGGLKVLFTGDFSCEEDRHLQVAEIPP-VKPDILISESTFGTATHEPRLEKE 237

Query: 216 -MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA----EHSLNYPIYFLTYVSSS 270
                 I  TL  GG +L+PV + GR  ELLLILE+YW      H++N  IYF + ++  
Sbjct: 238 ARMTSIIHSTLLKGGRILMPVFALGRAQELLLILEEYWGLNDDLHNIN--IYFASSLARK 295

Query: 271 TIDYVKSFLEWMGDSITKSFETS----RDNAFLLKHVTLLINKSELDNAPD-GPKLVLAS 325
            +   +++   M DSI  S  ++    + N F  K++    N   LD   D GP +V+AS
Sbjct: 296 CMAVYQTYTNIMNDSIRLSTSSTNSGEKRNPFQFKYIK---NIRSLDKFQDFGPCVVVAS 352

Query: 326 MASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP----PKAVKVTMSRRV 381
              L+ G S ++   WA D +N V+ T     GT+A+ L  +PP         VT+ RR+
Sbjct: 353 PGMLQNGVSRELLERWAPDPRNAVIMTGYSVEGTMAKELLTEPPTIQSATNADVTIPRRI 412


>gi|323347464|gb|EGA81734.1| Cft2p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 859

 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 101/331 (30%), Positives = 162/331 (48%), Gaps = 33/331 (9%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLAS 325
              F +     +I  +EL   P G K+   S
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVS 343


>gi|254581424|ref|XP_002496697.1| ZYRO0D06028p [Zygosaccharomyces rouxii]
 gi|238939589|emb|CAR27764.1| ZYRO0D06028p [Zygosaccharomyces rouxii]
          Length = 835

 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 147/555 (26%), Positives = 244/555 (43%), Gaps = 95/555 (17%)

Query: 17  NPLSYLVSIDGFNFLIDCGW---NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA--- 70
           N +  +V  D    LID GW      ++ S+ +  S +   ++ +LLS      LGA   
Sbjct: 14  NTIGTIVRFDNVTILIDPGWFSSKVSYEDSV-KYWSNLIPEVNIILLSQSSVDCLGAYTM 72

Query: 71  -----LPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL--FTLDDIDSAF 123
                LP+ + ++     V++T PV  LG ++ +D Y SR  V  +D     +DD++ AF
Sbjct: 73  LYHNFLPHFISRI----QVYATLPVTNLGRVSTFDLYASRGLVGPYDTNQIDVDDVERAF 128

Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL 183
           + +  L YSQ   L  K +G+ +  + +G   GG++W I+   E +IYA  +N  ++  L
Sbjct: 129 EHIESLKYSQLVDLRSKFDGLTLVAYNSGVSPGGSIWCISTYLEKLIYARRWNHTRDTIL 188

Query: 184 NGTVL--------ESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPV 235
           NG  L         + +RP+ +IT        +P  ++   F+D++ + L + G++L+PV
Sbjct: 189 NGASLLDGSGKPISTLLRPSAIITTFEKFGSPKPHARRMRCFKDSMKQALTSNGSILIPV 248

Query: 236 DSAGRVLELLLILEDYWAEHSLN-----YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +  G  L++L+ + D+  E+S N      P+  ++Y     + Y KS LEW+  S  K++
Sbjct: 249 EMGGNFLDILVSVHDFLYENSKNKLYSQVPVILVSYSRGRALTYAKSMLEWLSSSAIKTW 308

Query: 291 ETSRDNA--FLLKHVTLLINKSELDNAPDGPKLVLASMAS--LEAGFSHDIFVEWASDVK 346
           E SRDN   F L     +    EL N   G K+   S     ++    H   +E A+ + 
Sbjct: 309 E-SRDNRTPFDLGRRFHVATPEELTNY-SGSKICFVSQVDSLVDEVIKHLCQLERATIL- 365

Query: 347 NLVLFTERGQFGTLARML-------------QADPPPKAVKVTMS--RRVPLVGEELIAY 391
            L  FT+ G    LA M              +  P   +  +T+   +  PLV +EL  Y
Sbjct: 366 -LPGFTQ-GYPSALATMYKKWEQASKQQNLEEGKPVSYSGHITLKNIKLDPLVNKELEHY 423

Query: 392 EEEQT-RLKKEEALKASLVKEEESKASL-----GPDN----------------------- 422
            E+ T R    + L A+L++E +   S+     G  N                       
Sbjct: 424 LEQVTERRDSRQELTATLIREAKKTNSIETFAGGAANGQPGALGLGGIGEGDFDDEEEED 483

Query: 423 NLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVA-PMFPFYENNSEWDDF 481
           NL G  M+ D   A        P G +  +I  D ++   T     MFPF     + DD+
Sbjct: 484 NLIG--MLRDGTTA--------PTGKQAVEIPTDIYIQEGTPAKHRMFPFQPPRIKRDDY 533

Query: 482 GEVINPDDYIIKDED 496
           G +I+    I  D+D
Sbjct: 534 GSIIDFSMLIPSDDD 548


>gi|219121689|ref|XP_002181194.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407180|gb|EEC47117.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 602

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 110/389 (28%), Positives = 181/389 (46%), Gaps = 21/389 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDP-SLLQPLSKVA-STIDAVLLSH 62
           + +TPL          +L+   G   L+DCG +  +D  + L  L ++    +D +L++H
Sbjct: 5   MSITPLGSGQEVGRSCHLLEFRGMTILLDCGIHPGYDGLNGLPYLDRIEPDQVDVLLITH 64

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSEFDLFTLDDI 119
               H+ +LPY  ++      +F T P   V RL L         +    E  L+T  D+
Sbjct: 65  FHLDHVASLPYLTERTSFKGRIFMTHPTKAVTRLLLGDYLRLLQMKNAKPEDVLYTEADL 124

Query: 120 DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRK 179
            S    +  +    ++H +    G+      AGH+LG  ++ ++  G  ++Y  DY+   
Sbjct: 125 QSCIDKIELM----DFHTTVTVGGLSFYALNAGHVLGACMFFLSLGGRKILYTGDYSMED 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
           ++HL    + +   P VLI +A   +     R +RE  F   I + +  GG  L+PV + 
Sbjct: 181 DRHLMAAEIPA-ESPDVLIVEATYGVQVHASRAEREARFTGTIERVISRGGRCLIPVFAL 239

Query: 239 GRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           GR  ELLLIL++YW    H  N PI++ + ++S  +   +++   M   I    + S  N
Sbjct: 240 GRAQELLLILDEYWQANPHLQNIPIWYASKLASRALRVYQTYANMMNARIRSQMDVS--N 297

Query: 297 AFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            F  + +  L  I+ +  D++  GP +V AS   L++G S  +F  WASD KN VL    
Sbjct: 298 PFRFRFIQNLKSIDVNSFDDS--GPSVVFASPGMLQSGVSRQLFDRWASDHKNGVLIAGY 355

Query: 355 GQFGTLARMLQADPPPKAVKVTMSRRVPL 383
               TLA+ + A   PK V     RR PL
Sbjct: 356 AVEHTLAKEIMAQ--PKEVVTLEGRRQPL 382


>gi|66357778|ref|XP_626067.1| CPSF metallobeta-lactamase [Cryptosporidium parvum Iowa II]
 gi|46227299|gb|EAK88249.1| CPSF metallobeta-lactamase [Cryptosporidium parvum Iowa II]
          Length = 751

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 164/362 (45%), Gaps = 48/362 (13%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVAST----------IDAVLLSHPDTLHLGALPYAMKQLGL 80
           + DCG +  F      P  ++ S           ID V++SH    H GALP+  +++G 
Sbjct: 31  MFDCGMHMGFKDERKYPDFRLISATLDPLIINEYIDLVIISHYHLDHCGALPFFTEKIGY 90

Query: 81  SAPVFSTEPVYRLGLLTMYDQ--------YLSRRQV-----------SEFDLFTLDDIDS 121
             P+  T P   +  + + D          L +  V           +E+  FT+ D+ S
Sbjct: 91  KGPIVMTYPTKSVSSVLLSDCCKIMEQKLLLQKTNVDVAPPNETVYNNEYGFFTVSDVWS 150

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
             + V  +   Q   +SG    I + P+ AGH+LG +++ +    E ++Y  D+N  +++
Sbjct: 151 CMEKVKAIQLHQTIVISG----IKITPYYAGHVLGASMFHVQVSDESIVYTGDFNMVRDR 206

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGR 240
           HL G  L   + P++LI+++  A + +P R+  E  F + +   L+ GG VL+PV + GR
Sbjct: 207 HL-GPALIPKLLPSLLISESTYATYIRPSRRSTERTFCEMVYSCLKRGGKVLIPVFAIGR 265

Query: 241 VLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
             EL ++LE YW    + +PI+F   ++     Y + F  W    +        DN F  
Sbjct: 266 AQELCILLEIYWRRMQIRFPIFFGGSMTEKANSYYQLFTNWTNTPLA-------DNIFTF 318

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FTERGQF 357
            HV L  +KS L     GP ++ A+   L  G S   F  WA D  NL +   F   G  
Sbjct: 319 PHV-LPYDKSIL--TLSGPAVLFATPGMLHTGLSLQAFKMWAPDSNNLTIIPGFCVSGTI 375

Query: 358 GT 359
           G+
Sbjct: 376 GS 377


>gi|67624341|ref|XP_668453.1| ENSANGP00000013258 [Cryptosporidium hominis TU502]
 gi|54659666|gb|EAL38233.1| ENSANGP00000013258 [Cryptosporidium hominis]
          Length = 750

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 100/362 (27%), Positives = 165/362 (45%), Gaps = 48/362 (13%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVAST----------IDAVLLSHPDTLHLGALPYAMKQLGL 80
           + DCG +  F      P  ++ S           ID V++SH    H GALP+  +++G 
Sbjct: 29  MFDCGMHMGFKDERKYPDFRLISATLDPLIINEYIDLVIISHYHLDHCGALPFFTEKIGY 88

Query: 81  SAPVFSTEPVYRLGLLTMYD------QYLSRRQVS-------------EFDLFTLDDIDS 121
             P+  T P   +  + + D      Q L  ++ +             E+  FT+ D+ S
Sbjct: 89  KGPIVMTYPTKSVSSVLLSDCCKIMEQKLLLQKTNADVVPPNETVYNNEYGFFTVSDVWS 148

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
             + V  +   Q   +SG    I + P+ AGH+LG +++ +    E ++Y  D+N  +++
Sbjct: 149 CMEKVKAIQLHQTIVISG----IKITPYYAGHVLGASMFHVQVSDESIVYTGDFNMVRDR 204

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGR 240
           HL G  L   + P++LI+++  A + +P R+  E  F + +   L+ GG VL+PV + GR
Sbjct: 205 HL-GPALIPKLLPSLLISESTYATYIRPSRRSTERTFCEMVYSCLKRGGKVLIPVFAIGR 263

Query: 241 VLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
             EL ++LE YW    + +PI+F   ++     Y + F  W    +        DN F  
Sbjct: 264 AQELCILLEIYWRRMQIRFPIFFGGSMTEKANSYYQLFTNWTNTPLA-------DNIFTF 316

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FTERGQF 357
            HV L  +KS L     GP ++ A+   L  G S   F  WA D  NL +   F   G  
Sbjct: 317 PHV-LPYDKSIL--TLSGPAVLFATPGMLHTGLSLQAFKMWAPDSNNLTIIPGFCVSGTI 373

Query: 358 GT 359
           G+
Sbjct: 374 GS 375


>gi|154322621|ref|XP_001560625.1| hypothetical protein BC1G_00653 [Botryotinia fuckeliana B05.10]
 gi|347837188|emb|CCD51760.1| similar to cleavage and polyadenylation specifity factor
           [Botryotinia fuckeliana]
          Length = 828

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 167/368 (45%), Gaps = 26/368 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGMHAGYDGLAALPFYDDFDLSTVDLLLISHFHVDHAASLPYVLAKT 99

Query: 79  GLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +Y+  ++       +        ++T  D  + F  +  + Y   +
Sbjct: 100 NFKGRVFMTHPTKAIYKWLIIDSVRVGGASSGGGSQPVYTEADHLTTFAQIEAIDYHTTH 159

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I V P+ AGH+LG  ++ I   G  + +  DY+R  ++HL    +   V+  
Sbjct: 160 TISS----IRVTPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREDDRHLVSAEVPKGVKID 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +++  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRLEREQALMKSVTSILNRGGRVLMPVFALGRAQELLLILDEYWGK 275

Query: 255 HS--LNYPIYFL------------TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
           H      PIY+             TYV S   +  + F E M ++   S    R   +  
Sbjct: 276 HPEFQKIPIYYASNLARKCMLVYQTYVGSMNENIKRLFRERMAEAEANSTSGGRGGPWDF 335

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S  +   WA   KN V+ T     GT+
Sbjct: 336 KYIRSLKNLDRFDDV--GGCVILASPGMLQNGISRQLLERWAPSDKNGVIITGYSVEGTM 393

Query: 361 ARMLQADP 368
           A+ +  +P
Sbjct: 394 AKQIMQEP 401


>gi|328766828|gb|EGF76880.1| hypothetical protein BATDEDRAFT_14507, partial [Batrachochytrium
           dendrobatidis JAM81]
          Length = 475

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 175/376 (46%), Gaps = 30/376 (7%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDA 57
           ++V PL    +      LV++   N + DCG    ++DH    D + +       S ID 
Sbjct: 8   IRVIPLGAGQDVGRSCVLVTMGSKNIMFDCGMHMGYSDHRRFPDFTYISKSGDYTSMIDC 67

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  +  G   P++ T P   +  + + D + +   +  E D FT 
Sbjct: 68  VIISHFHLDHCGALPYFTEICGYDGPIYMTGPTKAIAPILLEDMRKVVVERKGETDFFTS 127

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDG----EDVIY 171
            DI +  Q V  +   +   +  + E   + P+ AGH+LG  ++ +   DG    + V+Y
Sbjct: 128 VDIKNCMQKVIAVNLMETVQVDAQLE---IRPYYAGHVLGAAMFYVRVTDGYGVTQSVVY 184

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 230
             DYN   ++HL    ++    P ++IT+   A   +  ++ RE  F   +   +  GG 
Sbjct: 185 TGDYNMTPDRHLGAAQIDG-CEPDLIITETTYATTIRDSKRARERDFLKKVHDCVSGGGK 243

Query: 231 VLLPVDSAGRVLELLLILEDYWAEH---SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
           VL+PV + GR  ELL+++E YW          P+YF T ++    +Y K F+ W  +++ 
Sbjct: 244 VLVPVFALGRAQELLILIESYWRRMDDLCDKVPVYFSTGLTERANEYYKLFISWTNENV- 302

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPD--GPKLVLASMASLEAGFSHDIFVEWASDV 345
           KS    R N F   H+     +S   +  D  G  ++ A+   L AG S ++F +W  D 
Sbjct: 303 KSALVER-NMFDFAHI-----RSWSHSFADEPGAMVLFATPGMLHAGTSLEVFKKWCHDP 356

Query: 346 KNLVLFTERGQFGTLA 361
           KN+++       GT+ 
Sbjct: 357 KNMIIMPGYCVAGTVG 372


>gi|323336644|gb|EGA77910.1| Cft2p [Saccharomyces cerevisiae Vin13]
          Length = 859

 Score =  136 bits (342), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 101/331 (30%), Positives = 162/331 (48%), Gaps = 33/331 (9%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLAS 325
              F +     +I  +EL   P G K+   S
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVS 343


>gi|50286175|ref|XP_445516.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49524821|emb|CAG58427.1| unnamed protein product [Candida glabrata]
          Length = 843

 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 140/583 (24%), Positives = 263/583 (45%), Gaps = 77/583 (13%)

Query: 22  LVSIDGFNFLIDCGWNDH---FDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA---- 74
           ++  D    L+D GW+ +   ++ S+    S + + +D +L+S P T  LGA  +     
Sbjct: 19  ILRFDNVTILLDPGWSSYKVSYEDSV-AFWSNIIAEVDIILISQPTTECLGAYTFLYYNF 77

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRLTYS 132
           +        V++T PV  LG ++  + Y+++  +  +  +   +DD++ AF  +  L YS
Sbjct: 78  ISHFISHIQVYATLPVANLGRVSTIEFYVTKGIIGPYQTNQLDIDDVEKAFDFIDVLKYS 137

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN-------- 184
           Q   L  K +G+ +  + +G+  GG +W IT   E +IYA  +N  ++  LN        
Sbjct: 138 QLVDLRSKYDGLSLFAYNSGYAPGGAIWCITTYSEKLIYAPRWNHTRDTILNAANLLDNT 197

Query: 185 GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           G  L S +RP+ ++T+  +   +QP R++ + F+D +   L   GN+L+PVD  G+ L+L
Sbjct: 198 GKPLSSLMRPSAIVTNFDHFGSSQPFRKRAKSFKDILKTKLSNNGNILIPVDIGGKFLDL 257

Query: 245 LLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-SRDNAF 298
            +++ D+  E+       N PI  L+Y  + ++ Y KS  EW      K++E  ++  AF
Sbjct: 258 FVLVHDFLYENGRNNKLANIPIVLLSYTKARSLTYAKSMTEWFSSISAKTWENRNQKTAF 317

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ-- 356
            L     +++ +EL N   GPK+   S  ++E   +  + +  + +   LVL T+  Q  
Sbjct: 318 DLDTPFSVVDSNELANLK-GPKICFVS--NVETLVNDALSILGSDNNTLLVLTTDNRQEV 374

Query: 357 ------------FGTLARMLQADPPPKAVKVTMSRRV--PLVGEELIAY-EEEQTRLKKE 401
                         T + +  A+      K+T++      L  EEL AY  + + R +K+
Sbjct: 375 PALHTIYDYWKENNTESSIESANVLKLNQKITINTTTFKELQNEELDAYLSKLEQRKRKQ 434

Query: 402 EALKASLVKEEESKASLGPDNNLSGDP-------MVIDANNANASA-------------- 440
              + +  K  +  A++    NL+ D        +V D  N                   
Sbjct: 435 LITEITTRKGLKKGAAVALPTNLASDEGQKTEVDLVDDITNTEDLEKLLEEEEEDEDEDN 494

Query: 441 -----DVVEPH---GGRYRDILIDGFVPPSTSVA-PMFPFYENNSEWDDFGEVINPDDYI 491
                +++E      G    I +D  + P  +    +FPF     + DD+G V+  D ++
Sbjct: 495 EDNLINILEDEDRADGIEESIPVDIIITPGVNNKHKIFPFQPLRQKKDDYGIVVKFDQFV 554

Query: 492 IKD--EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNEL 532
             +  +D+  +  HI GD+ + D     +I +A   K+ S+ +
Sbjct: 555 PAEDKDDITPSKRHINGDNEE-DMDDDYVIKEASNKKIKSDSV 596


>gi|365764103|gb|EHN05628.1| Ysh1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 699

 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 168/349 (48%), Gaps = 23/349 (6%)

Query: 75  MKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLSRRQVSEFDLFTLDDIDSAFQSV 126
           M++      VF T P   +YR  L     +T      S     +  LF+ +D+  +F  +
Sbjct: 1   MQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSSSMGTKDEGLFSDEDLVDSFDKI 60

Query: 127 TRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT 186
             +    +YH +    GI      AGH+LG  +++I   G  V++  DY+R  ++HLN  
Sbjct: 61  ETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEIAGLRVLFTGDYSREVDRHLNSA 116

Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
            +       +++   +    ++P   +       I  T+  GG VLLPV + GR  E++L
Sbjct: 117 EVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHSTVMRGGRVLLPVFALGRAQEIML 176

Query: 247 ILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLK 301
           IL++YW++H+        PI++ + ++   +   ++++  M D I K F  S+ N F+ K
Sbjct: 177 ILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPFIFK 236

Query: 302 HVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
           +++ L N  +  +   GP ++LAS   L++G S D+   W  + KNLVL T     GT+A
Sbjct: 237 NISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTMA 294

Query: 362 R--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA 406
           +  ML+ D  P     ++T+ RR  +      A+ + Q  L+  E + A
Sbjct: 295 KFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQENLEFIEKISA 343


>gi|327308534|ref|XP_003238958.1| cleavage and polyadenylylation specificity factor [Trichophyton
           rubrum CBS 118892]
 gi|326459214|gb|EGD84667.1| cleavage and polyadenylylation specificity factor [Trichophyton
           rubrum CBS 118892]
          Length = 1024

 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 111/432 (25%), Positives = 176/432 (40%), Gaps = 105/432 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW++ FD S+L+ L +   T+  +LL+H    HLGA  +  +   L    P+
Sbjct: 27  GVKILVDVGWDESFDTSVLKELERHIPTLSLILLTHATPSHLGAFVHCCRTYPLFTQIPI 86

Query: 85  FSTEPVYRLGLLTMYDQYLSRRQVSEF---DLFTLDDIDSAFQSVTRLTYSQN---YHLS 138
           ++T PV   G   + + Y S    + F      T  D  S     +  + SQ    Y ++
Sbjct: 87  YATIPVIAFGRTYLQNLYASAPLAATFLPSTSVTASDPSSGLTIQSATSSSQGPSGYEIT 146

Query: 139 GKGE---------------------------------------GIVVAPHVAGHLLGGTV 159
           G G                                        G+ +  + AGH +GGT+
Sbjct: 147 GSGRILLPPPTNEDIARYFSLIHPLKYSQPLQPLPSPFSPPLNGLTITAYNAGHTVGGTI 206

Query: 160 WKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHN 207
           W I    E ++YAVD+++ +E  + G             V+E   +P  LI  A      
Sbjct: 207 WHIQHGMESIVYAVDWSQARENVIAGAAWFGSSIGSGTEVIEQLRKPTALICSASGGDKF 266

Query: 208 QPP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-------- 256
             P  R++R+ +  D I      GG VLLP DS+ R+LE+  +LE  W E +        
Sbjct: 267 ALPGGRKKRDGLLLDMIRSCAAKGGTVLLPTDSSARILEIAYVLEHAWREAADSEDLNDP 326

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE------------------------ 291
             N P+Y     +  T+   +S LEWM ++I + FE                        
Sbjct: 327 LKNTPLYLAGKKAHGTMRLARSMLEWMDENIVREFEGNDGVEATTGKAAGGASNQPSKGA 386

Query: 292 TSRDNA--------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEW 341
            S+ +A        F  KH+ L+ +K++LD      G K++L+   SLE G S  +    
Sbjct: 387 QSQKSATGQKSLGPFTFKHLNLVEHKAKLDGILESKGSKVILSPDTSLEWGLSKHVLKHI 446

Query: 342 ASDVKNLVLFTE 353
           A   +NL++ TE
Sbjct: 447 AEGNENLIIMTE 458


>gi|156064885|ref|XP_001598364.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980]
 gi|154691312|gb|EDN91050.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 820

 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 167/368 (45%), Gaps = 26/368 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGMHAGYDGLAALPFYDDFDLSTVDLLLISHFHVDHAASLPYVLAKT 99

Query: 79  GLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +Y+  ++       +        ++T  D  + F  +  + Y   +
Sbjct: 100 NFKGRVFMTHPTKAIYKWLIIDSVRVGGASSNGGSHSVYTEADHLTTFAQIEAIDYHTTH 159

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I V P+ AGH+LG  ++ I   G  + +  DY+R  ++HL    +   V+  
Sbjct: 160 TISS----IRVTPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREDDRHLVSAEVPKGVKID 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +++  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRLEREQALMKSVTSILNRGGRVLMPVFALGRAQELLLILDEYWDK 275

Query: 255 HS--LNYPIYFL------------TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
           H      PIY+             TYV S   +  + F E M ++   S    R   +  
Sbjct: 276 HPEFQKIPIYYASNLARKCMLVYQTYVGSMNENIKRLFRERMAEAEANSTSGGRGGPWDF 335

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S  +   WA   KN V+ T     GT+
Sbjct: 336 KYIRSLKNLDRFDDV--GGCVILASPGMLQNGISRQLLERWAPSDKNGVIITGYSVEGTM 393

Query: 361 ARMLQADP 368
           A+ +  +P
Sbjct: 394 AKQIMQEP 401


>gi|358378169|gb|EHK15851.1| hypothetical protein TRIVIDRAFT_65314 [Trichoderma virens Gv29-8]
          Length = 873

 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 175/366 (47%), Gaps = 30/366 (8%)

Query: 38  DHFDPSLLQPL--SKVASTIDAVLLSHPDTLHL---GALPYAMKQLGLSAPVFSTEPVYR 92
           D FD S +  L  S+      ++LL+  D+ H+    +LPY + +      VF T P   
Sbjct: 70  DDFDLSTVDVLLISQTLHDASSLLLTRGDSFHIDHAASLPYVLAKTNFRGRVFMTHPTKA 129

Query: 93  LGLLTMYDQYLSRRQVSE--FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV 150
           +    + D        S     L+T  D  + F  +  + Y   + +S     I + P+ 
Sbjct: 130 IYKWLIQDSVRVGNTASNSATQLYTEQDHLNTFPQIEAIDYHTTHTISS----IRITPYP 185

Query: 151 AGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPP 210
           AGH+LG  ++ I   G ++ +  DY+R +++HL    +   ++  VLIT++   + +  P
Sbjct: 186 AGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGLKIDVLITESTYGIASHVP 245

Query: 211 RQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYV 267
           R +RE     +I+  L  GG  LLPV + GR  ELLLIL++YW +H     +PIY+ + +
Sbjct: 246 RLEREQALMKSITGILNRGGRALLPVFALGRAQELLLILDEYWGKHPEFQRFPIYYASNL 305

Query: 268 SSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLLKHVTLLINKSELDNA 315
           +   +   ++++  M D+I + F       E S D A     +  K++  L N    D+ 
Sbjct: 306 ARKCMVIYQTYVGAMNDNIKRLFRERMAEAEASGDAAGKNGPWDFKYIRSLKNLDRFDDV 365

Query: 316 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKV 375
             G  ++LAS   L+ G S ++F  WA   KN V+ T     GT+AR +  +  P  ++ 
Sbjct: 366 --GGCVMLASPGMLQNGVSRELFERWAPSEKNGVIITGYSVEGTMARQIMQE--PDQIQA 421

Query: 376 TMSRRV 381
            MSR +
Sbjct: 422 VMSRSI 427


>gi|146417489|ref|XP_001484713.1| hypothetical protein PGUG_02442 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 821

 Score =  135 bits (341), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 149/537 (27%), Positives = 229/537 (42%), Gaps = 106/537 (19%)

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL----- 183
           L YSQ   LS     +++ P+ AGH LGGT W ITK  E VIYA  +N  K+  L     
Sbjct: 19  LKYSQT--LSLFENKMIITPYNAGHTLGGTFWCITKRLEKVIYAPSWNHSKDSFLSSSSF 76

Query: 184 ----NGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
                G  L   +RP VLIT+  +   N P +++ E F   +  TL  GG V+LP   +G
Sbjct: 77  LSASTGNPLSQLMRPTVLITNT-DLGSNLPHKKRAEKFLQLMDATLANGGAVVLPTSLSG 135

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-------- 291
           R LELL +++ +     +  P+YFL+Y  +  ++Y  S LEWM   + K +E        
Sbjct: 136 RFLELLHLVDHHLQSQPI--PVYFLSYSGTKVLNYASSLLEWMSTLLVKEWEAASSASMN 193

Query: 292 -TSRDN-AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNL 348
            T+++N  F    V LL++  EL     GPK+VL +   + +G  S ++      D KN 
Sbjct: 194 STNKNNFPFDPSKVDLLLDPKELIQL-SGPKIVLCAGIDMNSGDVSFEVLKYLCLDQKNT 252

Query: 349 VLFTERGQFGT--------------------------LARMLQADPPPKAVKVTMSRRVP 382
           VL TE+  FG                           LA   +   P +     +SR  P
Sbjct: 253 VLLTEKTHFGADFSINAQLFTDWVRLSREKYGNAEDGLAIGYEGTIPLRG----LSREDP 308

Query: 383 LVGEELIAYEE-----------EQTRLKKEEA-LKASLVKEEESKASLGPDNNLSGD--- 427
           L G EL +++E           EQ R +K +  L A  ++EE+S +  G D   S +   
Sbjct: 309 LSGSELTSFQERINHQRKKKLFEQVRDRKNQNLLNADNLEEEDSSSDDGEDAESSDEEMP 368

Query: 428 ----------PMVIDAN-NANASADVVEPHGGRYR-------DILIDGFVPPSTSVAPMF 469
                     P  ID N NA  + D       +         D+ I   + P  ++ P  
Sbjct: 369 TTTETEAGAMPGAIDTNVNAIVTQDAFVADQVKQTLDDELPLDVKITHKLKPRQAMFPYI 428

Query: 470 PFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGG-----DDGKLDEGSASLILDAKP 524
           P ++   ++DD+GEVI+  DY  + ED+  A + +        + KL  G+       + 
Sbjct: 429 PPHKR--KFDDYGEVIDIKDY-QRAEDLTNAKLILDSKRKFEQEDKLKWGNDDDRRSGRG 485

Query: 525 SKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLS 581
             + +N LT          E L    L+     ++ P+    +  T DL  ++  LS
Sbjct: 486 GGIQTNRLT--------PQETLNNQILQKNLHTLFQPRKRVIVTKTQDL-KFRCSLS 533


>gi|401841928|gb|EJT44237.1| CFT2-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 861

 Score =  135 bits (341), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 101/331 (30%), Positives = 163/331 (49%), Gaps = 33/331 (9%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   +D ++LS P T  LGA   L 
Sbjct: 19  VVQFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEVDVIILSQPTTECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DID +F  +  L
Sbjct: 75  YNFVSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIDKSFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E +IYA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLIYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ + F+D + + L + G+V++PVD +G+ 
Sbjct: 195 DSAGKPLSTLMRPSAIITTLDKFGSSQPFKKRSKSFKDTLKRGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  ++Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLIVSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLAS 325
              F +     +I+ +EL N   G K+   S
Sbjct: 314 TSPFEIGSRIKIISPNEL-NKYAGTKICFVS 343


>gi|240280758|gb|EER44262.1| cleavage and polyadenylation specificity factor subunit 2
           [Ajellomyces capsulatus H143]
          Length = 1010

 Score =  135 bits (341), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 131/538 (24%), Positives = 206/538 (38%), Gaps = 122/538 (22%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G        +  ++ +DG    L+D GW++ FD S L  L +   T+  VLL+H  
Sbjct: 5   TPLLGAQSSGSRAVQSILELDGGVKILVDVGWDESFDVSALAELERQIPTLSLVLLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----------- 111
             H+GA  +  K   L    P+++T PV  LG   + D Y S    + F           
Sbjct: 65  PSHIGAFAHCCKTFPLFNQIPIYATSPVIALGRTLLQDLYSSAPLAATFLSKATSADSSP 124

Query: 112 ------------DLFTLDDIDSA---------------FQSVTRLTYSQNYHLSGKG--- 141
                       D   +D  DS                F  +  L YSQ +         
Sbjct: 125 SSPISSRAENVADTANIDHNDSPRILLPPPTSEEIARYFSLIHPLKYSQPHQPLPSPFSP 184

Query: 142 --EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------V 187
              G+ +  + AGH +GGT+W I    E +IYAVD+N+ +E  + G             V
Sbjct: 185 PLNGLTLTAYNAGHTVGGTIWHIQHGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEV 244

Query: 188 LESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           +E   +P   +           P  R++R ++  D I      GG VL+P D++ R LEL
Sbjct: 245 VEQLRKPTAFVCSTRGGDKFSLPGGRKKRDDLLMDMIRNCFSKGGTVLIPTDTSARALEL 304

Query: 245 LLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
             +LE  W E +          +  +Y        T+   +S LEWM + I + FE    
Sbjct: 305 AYVLEHAWRESAETADGEDPLKSGELYLAGKKGYGTMRLARSMLEWMDEGIVREFEAGHG 364

Query: 296 ------------------------------------NAFLLKHVTLLINKSELDN--APD 317
                                                 F  KH+ ++  K++L+     +
Sbjct: 365 GDPVAAGGKGRQDGPNQRTPSAAMTDKRGDSSFKNLGPFTFKHLKIVERKAKLEKILGSN 424

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 377
            PK++L S  SL+ G+S  +  + AS  +NLV+ TE   F           P K +   +
Sbjct: 425 TPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE--SFSV--------SPNKQMVDGI 474

Query: 378 SRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANN 435
             R  L  E    YEE +  +  E  +   L+++  S   L    ++   P+  DAN+
Sbjct: 475 RSRPSLAHEIWTIYEERKDGVSSETTINGELLEQVHSGGRLLTVTDVEKTPL--DAND 530



 Score = 44.7 bits (104), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 76/199 (38%), Gaps = 59/199 (29%)

Query: 534 VLVHGSAEATEHLKQHCLKHVCPH--------------VYTPQIEETIDVTSDLCAYKVQ 579
           +L  G  E TE L   C   +                 ++TP I ET+D + D  A+ V+
Sbjct: 789 ILTAGLKEETEALAAECRNLLTAKAGLELGSSSQSVVDIFTPVIGETVDASVDTNAWMVK 848

Query: 580 LS----------------------------EKLMSNVLFKKLGDYEIAWVDAEVGKTENG 611
           LS                            +++ S       G+ +   V     K    
Sbjct: 849 LSSVVALTGELRGPEPMVADEDGPGMSQKKQRMFSENASSSEGNEQKQLVPR---KHSFP 905

Query: 612 MLSLLPISTPAPPH---KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKV 667
           +L +LP++  A      + + VGDL++ADL+  + S G   EF G G L    +V +RK 
Sbjct: 906 LLDVLPVNMAAATRSVTRPLHVGDLRLADLRKLMQSSGHTAEFRGEGTLLIDGFVAVRK- 964

Query: 668 GPAGQKGGGSGTQQIVIEG 686
                    SGT +I IEG
Sbjct: 965 ---------SGTGKIEIEG 974


>gi|389634325|ref|XP_003714815.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
 gi|351647148|gb|EHA55008.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
 gi|440467574|gb|ELQ36790.1| endoribonuclease YSH1 [Magnaporthe oryzae Y34]
 gi|440483131|gb|ELQ63565.1| endoribonuclease YSH1 [Magnaporthe oryzae P131]
          Length = 829

 Score =  135 bits (341), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 172/370 (46%), Gaps = 27/370 (7%)

Query: 20  SYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQ 77
            +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + +
Sbjct: 40  CHIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHVDHAASLPYVLSK 99

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
                 VF T P   +    + D      +    +   ++T  D  + F  +  + Y   
Sbjct: 100 TNFKGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTSQPVYTEQDHLNTFPQIEAIDYYTT 159

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+ 
Sbjct: 160 HTISS----IRITPYPAGHVLGAAMFLIEIAGMNIFFTGDYSREQDRHLVSAEVPRGVKI 215

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW 
Sbjct: 216 DVLITESTYGIASHVPRVEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWG 275

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA-------------F 298
           +H      PIY+ + ++   +   ++++  M D+I + F      A             +
Sbjct: 276 KHQEYQKVPIYYASNLARKCMVVYQTYVGAMNDNIKRLFRERLAEAEASGKSGAGGGGPW 335

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
             K++  L N    D+   GP ++LAS   L+ G S ++   WA   KN V+ T     G
Sbjct: 336 DFKYIRSLKNLDRFDDL--GPCVMLASPGMLQNGVSRELLERWAPSDKNGVVITGYSVEG 393

Query: 359 TLARMLQADP 368
           T+A+ +  +P
Sbjct: 394 TMAKQIMQEP 403


>gi|242053629|ref|XP_002455960.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
 gi|241927935|gb|EES01080.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
          Length = 558

 Score =  135 bits (340), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 165/362 (45%), Gaps = 21/362 (5%)

Query: 22  LVSIDGFNFLIDCG----WNDHFD-PSLLQPLSK-----VASTIDAVLLSHPDTLHLGAL 71
           +V+I G   + DCG    ++DH   P   + L+        + I  V+++H    H+GAL
Sbjct: 20  VVTIGGKRVMFDCGMHMGYHDHRHYPDFARALAAWGAPDFTTAISCVVITHFHLDHIGAL 79

Query: 72  PYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLT 130
           PY  +  G   P++ T P   L    + D + ++  Q  E + ++ +DI    + V  + 
Sbjct: 80  PYFTEICGYHGPIYMTYPTKALAPFMLEDYRKVTMDQRGEEEQYSYEDILRCMKKVIPMD 139

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
             Q   +    + +V+  + AGH++G  +         ++Y  DYN   ++HL    ++ 
Sbjct: 140 LKQTIQVD---KDLVIRAYYAGHVIGAAMIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDH 196

Query: 191 FVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
            ++  +LIT++  A   +  +  RE  F  A+ K +  GG VL+P  + GR  EL ++L+
Sbjct: 197 -LKLDLLITESTYAKTIRDSKHAREREFLKAVHKCVSGGGKVLIPTFALGRAQELCMLLD 255

Query: 250 DYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINK 309
           DYW    L  PIYF   ++     Y K  + W    I  S      N F  KHV     +
Sbjct: 256 DYWERMDLKVPIYFSAGLTIQANVYYKMLIGWTSQKIKDSHAVH--NPFDFKHVCHF-ER 312

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           S ++N   GP ++ A+   +  GFS + F +WA   KNL+        GT+   L    P
Sbjct: 313 SFINNP--GPCVLFATPGMISGGFSLEAFKKWAPSEKNLITLPGYCVSGTIGHKLMCGKP 370

Query: 370 PK 371
            +
Sbjct: 371 TR 372


>gi|255542245|ref|XP_002512186.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Ricinus communis]
 gi|223548730|gb|EEF50220.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Ricinus communis]
          Length = 361

 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 105/352 (29%), Positives = 171/352 (48%), Gaps = 42/352 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPL 48
           G  + VTPL G  NE   S + +S  G   L DCG            + D  DPS     
Sbjct: 21  GDVLTVTPL-GAGNEVGRSCVYMSYKGKIVLFDCGIHPAYSGMAALPYFDEIDPS----- 74

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSR 105
                TID +L++H    H  +LPY +++      VF   +T+ +Y+L LLT    Y+  
Sbjct: 75  -----TIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL-LLT---DYVKV 125

Query: 106 RQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +VS  D LF   DI+ +   +  +    ++H + +  GI    + AGH+LG  ++ +  
Sbjct: 126 SKVSIEDMLFDEQDINRSMDKIEVI----DFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  ++Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T
Sbjct: 182 AGVRLLYTGDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHIREKRFTDVIHST 241

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           +  GG VL+P  + GR  ELLLIL++YW+ H    N PIY+ + ++   +   ++++  M
Sbjct: 242 ISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNVPIYYASPLAKKCMTVYQTYILSM 301

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFS 334
            + I   F  S  N F  KH++ L +  +  +   GP +V+AS   L++G S
Sbjct: 302 NERIRNQFANS--NPFKFKHISPLNSIEDFTDV--GPSVVMASPGGLQSGLS 349


>gi|146170679|ref|XP_001017643.2| metallo beta lactamase domain containing protein [Tetrahymena
           thermophila]
 gi|146145062|gb|EAR97398.2| metallo beta lactamase domain containing protein [Tetrahymena
           thermophila SB210]
          Length = 675

 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 94/338 (27%), Positives = 154/338 (45%), Gaps = 33/338 (9%)

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--GLLTMYDQYLSRRQ 107
           K    ID VL+SH    H+GALPY  +      P++ T P   L   +   + + ++  Q
Sbjct: 68  KWDQIIDLVLISHFHLDHIGALPYFTEIYNYDGPIYMTSPTKALLPYMCEDFRKVITESQ 127

Query: 108 VSEFD--------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVA 147
             EF                     ++T ++I   FQ    +   +   ++G    I + 
Sbjct: 128 KKEFTDDSIPQTPAQKIINDSRYPLIYTQENIQKCFQKAKTIQLLETIDVNG----IKIK 183

Query: 148 PHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA-YNALH 206
           P+ AGH+LG  ++ I      V+Y  D++   ++HL    +E  V+P +LI++  Y  + 
Sbjct: 184 PYYAGHVLGACMFMIEYRNVKVVYTGDFHSNADRHLGAAWIEK-VKPDLLISECTYGTII 242

Query: 207 NQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTY 266
               R + + F   I +T+  GG VL+PV + GR  EL ++LE YW       P+YF   
Sbjct: 243 RDSKRAREKNFLKQIQETIDQGGKVLIPVFALGRAQELCILLETYWQRTQSQVPVYFAAG 302

Query: 267 VSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASM 326
           +      Y K F+ W  + I  S+ T  DN F  K++    ++S +    +GP ++ A+ 
Sbjct: 303 MIEKANFYYKLFVNWTNEKIKSSYLT--DNMFDFKYIKPF-SRSLI--KTNGPMVLFATP 357

Query: 327 ASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
             L AG S  +F EW  D KN ++       GTL  +L
Sbjct: 358 GMLHAGLSMQVFKEWCYDEKNTLIIPGYCVAGTLGCVL 395


>gi|328704356|ref|XP_001945120.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like [Acyrthosiphon pisum]
          Length = 694

 Score =  135 bits (339), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 187/372 (50%), Gaps = 26/372 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A+ ID +L++H    H GALP+ + +  
Sbjct: 39  VMEFKGKKIMLDCGIHPGLQGLDALPFVDLIEANEIDLLLITHFHLDHSGALPWFLLKTK 98

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQV-SEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                +   +T+ +YR  L      Y+    + +E  L+T  D++ +   +  +    N+
Sbjct: 99  FKGKCYMTHATKAIYRWLL----SDYIKVSNIGTEQMLYTEADLEKSMDRIETI----NF 150

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      GI    + AGH+LG  ++ I   G  V+Y  D++R++++HL    +    RP 
Sbjct: 151 HEEKDVGGIRFCAYNAGHVLGAAMFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPP-SRPE 209

Query: 196 VLITDAYNALH-NQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LIT++    H ++   ++   F   ++  +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 210 ILITESTYGTHIHEKREERERRFTMLVNDIVNRGGRCLIPVFALGRAQELLLILDEYWGL 269

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I +  + + +N F+ KH+T   N   +
Sbjct: 270 HPELHDIPIYYASSLAKKCMAVYQTYINAMNDRIKR--QIAVNNPFVFKHIT---NLKSI 324

Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           D+  D GP +++AS   +E+G S ++F  W +D KN V+       GTLA+ + ++  P+
Sbjct: 325 DHFEDIGPCVIMASPGVMESGLSRELFEMWCTDSKNGVIIAGYVVQGTLAKAILSE--PE 382

Query: 372 AVKVTMSRRVPL 383
            +     +++PL
Sbjct: 383 DITTMTGQKLPL 394


>gi|344302811|gb|EGW33085.1| hypothetical protein SPAPADRAFT_66091 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 762

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 170/332 (51%), Gaps = 25/332 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYR------LGLLTMYDQYL 103
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR      + + ++    +
Sbjct: 62  SKVDILLISHFHLDHAASLPYVMQQTTFKGRVFMTQATKAIYRWLLQDFVRVTSIGTTKM 121

Query: 104 SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKIT 163
              +    +L+T DDI  +F  +  +    +YH + + EGI    + AGH+LG  ++ I 
Sbjct: 122 EGGEGQSSNLYTADDIMKSFDRIETI----DYHSTMEIEGIKFTAYHAGHVLGACMYFIE 177

Query: 164 KDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY---NALHNQPPRQQREMFQDA 220
             G  V++  DY+R + +HL+   +   V+P +LI+++      L ++   +++    + 
Sbjct: 178 IGGLKVLFTGDYSREENRHLHAAEIPP-VKPDILISESTFGTGTLESKADLEKK--LTNH 234

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--EHSLNYPIYFLTYVSSSTIDYVKSF 278
           I  TL  GG VLLPV + G   ELLLIL++YW   E   N  +Y+ + ++   +   +++
Sbjct: 235 IHATLTKGGRVLLPVFALGNTQELLLILDEYWNNNEDLQNINVYYASSLAKKCMAVYETY 294

Query: 279 LEWMGDSITKSFETS--RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHD 336
              M D I  S  +S  + N F  K++  + +  +  +   GP +V+A+   L+AG S  
Sbjct: 295 TSIMNDKIRLSASSSGHKSNPFDFKYIKSIRDLGKFQDM--GPSVVIAAPGMLQAGISRQ 352

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +  +WA D KNLV+ T     GT+A+ L  +P
Sbjct: 353 LLEKWAPDPKNLVILTGYSVEGTMAKELLKEP 384


>gi|225560694|gb|EEH08975.1| cleavage and polyadenylation specificity factor subunit 2
           [Ajellomyces capsulatus G186AR]
          Length = 1010

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 131/540 (24%), Positives = 207/540 (38%), Gaps = 126/540 (23%)

Query: 8   TPLSGVFNE--NPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G  +     +  ++ +DG    L+D GW++ FD S L  L +   T+  VLL+H  
Sbjct: 5   TPLLGAQSSGSRAVQSILELDGGVKILVDVGWDESFDVSALAELERQIPTLSLVLLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----------- 111
             H+GA  +  K   L    P+++T PV  LG   + D Y S    + F           
Sbjct: 65  PSHIGAFAHCCKTFPLFNQIPIYATSPVIALGRTLLQDLYSSAPLAATFLPKATSADSSP 124

Query: 112 ------------DLFTLDDIDSA---------------FQSVTRLTYSQNYHLSGKG--- 141
                       D   +D  DS                F  +  L YSQ +         
Sbjct: 125 SSPISSRAENVADTANIDHNDSPRILLPPPTTEEIARYFSLIHPLKYSQPHQPLPSPFSP 184

Query: 142 --EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------V 187
              G+ +  + AGH +GGT+W I    E +IYAVD+N+ +E  + G             V
Sbjct: 185 PLNGLTLTAYNAGHTVGGTIWHIQHGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEV 244

Query: 188 LESFVRPAVLIT-----DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 242
           +E   +P   +      D ++ L  +  R   ++  D I      GG VL+P D++ R L
Sbjct: 245 VEQLRKPTAFVCSTRGGDKFSLLGGRKKRD--DLLMDMIRNCFSKGGTVLIPTDTSARAL 302

Query: 243 ELLLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           EL  +LE  W E +          +  +Y        T+   +S LEWM + I + FE  
Sbjct: 303 ELAYVLEHAWRESAETADGEDPLKSGELYLAGKKGYGTMRLARSMLEWMDEGIVREFEAG 362

Query: 294 RD------------------------------------NAFLLKHVTLLINKSELDN--A 315
                                                   F  KH+ ++  K++L+    
Sbjct: 363 HGGDPVAAGGKGRQDGPNQRTPSAAMTDKRGDSSFKNLGPFTFKHLKIVERKAKLEKILG 422

Query: 316 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKV 375
            + PK++L S  SL+ G+S  +  + AS  +NLV+ TE   F           P K +  
Sbjct: 423 SNTPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE--SFSV--------SPNKQMVD 472

Query: 376 TMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANN 435
               R  L  E    YEE +  +  E  +   L+++  S   L    ++   P+  DAN+
Sbjct: 473 NFRFRPSLAHEIWTIYEERKDGVSSETTVNGELLEQVHSGGRLLTVTDVEKTPL--DAND 530



 Score = 44.3 bits (103), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 77/199 (38%), Gaps = 59/199 (29%)

Query: 534 VLVHGSAEATEHLKQHCLKHVCPH--------------VYTPQIEETIDVTSDLCAYKVQ 579
           +L  G  E TE L   C   +                 ++TP I ET+D + D  A+ V+
Sbjct: 789 ILTAGLKEETEALAAECRNLLTAKAGLELGSSSQSVVDIFTPVIGETVDASVDTNAWMVK 848

Query: 580 LSEKLMSNVLFKKLGDYEIAWVD--------------AEVGKTENG-------------- 611
           LS  +    L  +L   E    D              +E   +  G              
Sbjct: 849 LSSVV---ALTGELRGPEPMVADEDGPGMSQKKQRMFSENASSSEGIEQKQLVPRKHSFP 905

Query: 612 MLSLLPISTPAPPH---KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKV 667
           +L +LP++  A      + + VGDL++ADL+  + S G   EF G G L    +V +RK 
Sbjct: 906 LLDVLPVNMAAATRSVTRPLHVGDLRLADLRKLMQSSGHTAEFRGEGTLLIDGFVAVRK- 964

Query: 668 GPAGQKGGGSGTQQIVIEG 686
                    SGT +I IEG
Sbjct: 965 ---------SGTGKIEIEG 974


>gi|46360445|gb|AAS80153.1| ACT11D09.9 [Cucumis melo]
          Length = 708

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 175/361 (48%), Gaps = 21/361 (5%)

Query: 22  LVSIDGFNFLIDCGWN----DHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG +    DH    D S +       +T+  ++++H    H+GALPY 
Sbjct: 52  VVTINGKRIMFDCGMHLGYVDHRRYPDFSRISASRDYNNTLSCIIITHFHLDHIGALPYF 111

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ T P   L  +T+  Y + +  R+  E + FT D I    + V  +   
Sbjct: 112 TEICGYNGPIYMTYPTMALAPITLEDYRKVMVDRR-GEAEQFTNDHIMECLKKVVPVDLK 170

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  ++        ++Y  DYN   ++HL    ++  +
Sbjct: 171 QTIQVD---EDLQIRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDR-M 226

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRV-LELLLILED 250
           +  +LIT++  A   +  +  RE  F  A+   L +GG VL+P  + GR   EL ++L+D
Sbjct: 227 QLDLLITESTYATTIRDSKYAREREFLKAVHNCLASGGKVLIPTFALGRAQQELCVLLDD 286

Query: 251 YWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
           YW   +L +PIY    ++     Y K  + W    + +++ T   NAF  K+V    ++S
Sbjct: 287 YWERMNLKFPIYVSAGLTVQANMYYKMLISWTSQKVKETYTTR--NAFDFKNVQKF-DRS 343

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPP 370
            +D AP GP ++ A+   + +GFS ++F  WA    NL+        GT+   L +  P 
Sbjct: 344 MID-AP-GPCVLFATPGMISSGFSLEVFKRWAPSKLNLITLPGYCVAGTVGHKLMSGKPT 401

Query: 371 K 371
           K
Sbjct: 402 K 402


>gi|85079519|ref|XP_956368.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
 gi|74630409|sp|Q8WZS6.1|YSH1_NEUCR RecName: Full=Endoribonuclease ysh-1; AltName: Full=mRNA
           3'-end-processing protein ysh-1
 gi|18376069|emb|CAD21097.1| related to BRR5 (component of pre-mRNA polyadenylation factor PF I)
           [Neurospora crassa]
 gi|28917429|gb|EAA27132.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
          Length = 850

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 184/382 (48%), Gaps = 30/382 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 99

Query: 79  GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF   +T+ +Y+  +        +        ++T +D    F  +  + Y+  +
Sbjct: 100 NFRGRVFMTHATKAIYKWLIQDSVRVGNTSSNPQSSLVYTEEDHLKTFPMIEAIDYNTTH 159

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 160 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREEDRHLISAKVPKGVKID 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRPEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 275

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E+S D A     +  
Sbjct: 276 HAEYQKYPIYYASNLARKCMLVYQTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDF 335

Query: 301 KHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           + +  L     LD   D G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 RFIRSL---KSLDRFEDVGGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 392

Query: 360 LARMLQADPPPKAVKVTMSRRV 381
           +A+ L  +  P+ ++  MSR +
Sbjct: 393 MAKQLLQE--PEQIQAVMSRNI 412


>gi|336468884|gb|EGO57047.1| hypothetical protein NEUTE1DRAFT_84705 [Neurospora tetrasperma FGSC
           2508]
 gi|350288819|gb|EGZ70044.1| Endoribonuclease ysh-1 [Neurospora tetrasperma FGSC 2509]
          Length = 853

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 184/382 (48%), Gaps = 30/382 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 99

Query: 79  GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF   +T+ +Y+  +        +        ++T +D    F  +  + Y+  +
Sbjct: 100 NFRGRVFMTHATKAIYKWLIQDSVRVGNTSSNPQSSLVYTEEDHLKTFPMIEAIDYNTTH 159

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 160 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREEDRHLISAKVPKGVKID 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRPEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 275

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E+S D A     +  
Sbjct: 276 HAEYQKYPIYYASNLARKCMLVYQTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDF 335

Query: 301 KHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           + +  L     LD   D G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 RFIRSL---KSLDRFEDVGGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 392

Query: 360 LARMLQADPPPKAVKVTMSRRV 381
           +A+ L  +  P+ ++  MSR +
Sbjct: 393 MAKQLLQE--PEQIQAVMSRNI 412


>gi|325088985|gb|EGC42295.1| cleavage and polyadenylation specific subunit [Ajellomyces
           capsulatus H88]
          Length = 1010

 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 130/538 (24%), Positives = 206/538 (38%), Gaps = 122/538 (22%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G        +  ++ +DG    L+D GW++ FD S L  L +   T+  VLL+H  
Sbjct: 5   TPLLGAQSSGSRAVQSILELDGGVKILVDVGWDESFDVSALAELERQIPTLSLVLLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----------- 111
             H+GA  +  K   L    P+++T PV  LG   + D Y S    + F           
Sbjct: 65  PSHIGAFAHCCKTFPLFNQIPIYATSPVIALGRTLLQDLYSSAPLAATFLSKATSADSSP 124

Query: 112 ------------DLFTLDDIDSA---------------FQSVTRLTYSQNYHLSGKG--- 141
                       D   +D  DS                F  +  L YSQ +         
Sbjct: 125 SSPISSRAENVADTANIDHNDSPRILLPPPTSEEIARYFSLIHPLKYSQPHQPLPSPFSP 184

Query: 142 --EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------V 187
              G+ +  + AGH +GGT+W I    E +IYAVD+N+ +E  + G             V
Sbjct: 185 PLNGLTLTAYNAGHTVGGTIWHIQHGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEV 244

Query: 188 LESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           +E   +P   +           P  R++R ++  D I      GG VL+P D++ R LEL
Sbjct: 245 VEQLRKPTAFVCSTRGGDKFSLPGGRKKRDDLLMDMIRNCFSKGGTVLIPTDTSARALEL 304

Query: 245 LLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
             +LE  W E +          +  +Y        T+   +S LEWM + I + FE    
Sbjct: 305 AYVLEHAWRESAETADGEDPLKSGELYLAGKKGYGTMRLARSMLEWMDEGIVREFEAGHG 364

Query: 296 ------------------------------------NAFLLKHVTLLINKSELDN--APD 317
                                                 F  KH+ ++  K++++     +
Sbjct: 365 GDPVAAGGKGRQDGPNQRTPSAAMTDKRGDSSFKNLGPFTFKHLKIVERKAKIEKILGSN 424

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 377
            PK++L S  SL+ G+S  +  + AS  +NLV+ TE   F           P K +   +
Sbjct: 425 TPKVILTSDTSLDWGYSKHVLQKIASGSENLVILTE--SFSV--------SPNKQMVDGI 474

Query: 378 SRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANN 435
             R  L  E    YEE +  +  E  +   L+++  S   L    ++   P+  DAN+
Sbjct: 475 RSRPSLAHEIWTIYEERKDGVSSETTINGELLEQVHSGGRLLTVTDVEKTPL--DAND 530



 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 76/199 (38%), Gaps = 59/199 (29%)

Query: 534 VLVHGSAEATEHLKQHCLKHVCPH--------------VYTPQIEETIDVTSDLCAYKVQ 579
           +L  G  E TE L   C   +                 ++TP I ET+D + D  A+ V+
Sbjct: 789 ILTAGLKEETEALAAECRNLLTAKAGLELGSSSQSVVDIFTPVIGETVDASVDTNAWMVK 848

Query: 580 LS----------------------------EKLMSNVLFKKLGDYEIAWVDAEVGKTENG 611
           LS                            +++ S       G+ +   V     K    
Sbjct: 849 LSSVVALTGELRGPEPMVADEDGPGMSQKKQRMFSENASSSEGNEQKQLVPR---KHSFP 905

Query: 612 MLSLLPISTPAPPH---KSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKV 667
           +L +LP++  A      + + VGDL++ADL+  + S G   EF G G L    +V +RK 
Sbjct: 906 LLDVLPVNMAAATRSVTRPLHVGDLRLADLRKLMQSSGHTAEFRGEGTLLIDGFVAVRK- 964

Query: 668 GPAGQKGGGSGTQQIVIEG 686
                    SGT +I IEG
Sbjct: 965 ---------SGTGKIEIEG 974


>gi|344229479|gb|EGV61364.1| hypothetical protein CANTEDRAFT_98614 [Candida tenuis ATCC 10573]
          Length = 943

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 112/385 (29%), Positives = 175/385 (45%), Gaps = 39/385 (10%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDG-FNFLIDCGWN--DHFDPSLLQPLSKVASTIDA 57
           M T   +TP  G    +  + L++IDG  N L D  WN  DH D   LQ   K   +++ 
Sbjct: 1   MFTFTLLTPADG---HSSKASLMTIDGDVNILADISWNGKDHHDLDYLQDTLK---SVNL 54

Query: 58  VLLSHPDTLHLGA---LPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD-- 112
           VLLSH     +G    L     +L  +  V++T  V +LG ++  + Y S   +      
Sbjct: 55  VLLSHSTPEFIGGYALLCLKFPELMKNIKVYATSAVSQLGRVSTVELYRSVGLIGPLKDA 114

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           +  + D+D  F  V  L Y   Y  +   E + + P+ +GH LGG+ W + +  E +IYA
Sbjct: 115 VLEVSDVDEYFDRVISLKY---YQSTNALERLAITPYNSGHTLGGSFWLLQRKLEKIIYA 171

Query: 173 VDYNRRKE---------KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISK 223
             +N  K+             G  L   VRP  L+T   +   N   +++ E F   +  
Sbjct: 172 PSWNHSKDSFLSAASFLSSSTGNPLSQLVRPTALVT-GTDVGSNLSHKKRSEKFLQLVDG 230

Query: 224 TLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMG 283
           TL  GG VLLP   +GR LELL +++++    S   P+ FL+Y  ++ + Y  + LEWM 
Sbjct: 231 TLANGGTVLLPTTISGRFLELLHLVDEHL--QSAPIPVLFLSYSGTNVLRYATNLLEWMS 288

Query: 284 DSITKSFETSRD---NAFLLKHVTLLINKSELDNAP------DGPKLVLASMASLEAG-F 333
            S++K  E +     N     H     +K +L + P       GPK+V  S   L +G  
Sbjct: 289 PSLSKELENANSIVTNTGNRNHFPFDPSKVDLVSTPYELTQMAGPKVVFTSGVDLNSGEL 348

Query: 334 SHDIFVEWASDVKNLVLFTERGQFG 358
           S +      +D K  ++ TE+  FG
Sbjct: 349 SSEALRVLCNDEKTTIILTEKTHFG 373


>gi|291000374|ref|XP_002682754.1| predicted protein [Naegleria gruberi]
 gi|284096382|gb|EFC50010.1| predicted protein [Naegleria gruberi]
          Length = 458

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 173/377 (45%), Gaps = 26/377 (6%)

Query: 22  LVSIDGFNFLIDCG----WNDHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I     + DCG    +ND     D   +    +   TID V++SH    H GALPY 
Sbjct: 13  IVTIGRKTIMFDCGMHMGYNDERRFPDFKFISKNGQFTQTIDCVIISHFHLDHCGALPYF 72

Query: 75  MKQLGLSAPVFSTEPVYRLG--LLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLT 130
            +  G   P++ T P   +   LL  + + +  R+    +   F+ +D+ +  + V  L 
Sbjct: 73  TEVCGYDGPIYMTYPTKAIAPILLEDFRRVMVDRKGDNLNQGFFSSEDVKNCIKKVQPLN 132

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD---GEDVIYAVDYNRRKEKHLNGTV 187
             Q   L  + E   + P+ AGH+LG  ++ + KD   G  V+Y  DYN   ++HL    
Sbjct: 133 LHQTIILDDELE---IKPYYAGHVLGAAMFYV-KDLATGASVVYTGDYNMTADRHLGSAT 188

Query: 188 LESFVRPAVLITDAYNA--LHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
           ++   RP +LIT+   A  + +    ++R+  +      +   G VL+PV + GRV EL 
Sbjct: 189 IDR-CRPDLLITETTYATTIRDSKSSRERDFCKQVYDTVVNKKGKVLIPVFALGRVQELC 247

Query: 246 LILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHV 303
           ++LE YW   +L  + PIYF   +      Y + ++ W  + I  +    + N F   ++
Sbjct: 248 ILLETYWERKNLGKSVPIYFSAGMVEKANYYYQLYINWTNEKIKTTLFDQKRNLFNFSNI 307

Query: 304 TLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARM 363
                +  +DN   GP ++ A+   L AG S ++F +WA    N V+       GT+   
Sbjct: 308 QSF-ERFLMDNP--GPMVLFATPGMLHAGMSLEVFKKWAPGENNKVILPGYCVEGTVGNK 364

Query: 364 LQADPPPKAVKVTMSRR 380
           +  +   K+ K+ +  R
Sbjct: 365 VLRNKDLKSSKIEIDSR 381


>gi|115479027|ref|NP_001063107.1| Os09g0397900 [Oryza sativa Japonica Group]
 gi|50252615|dbj|BAD28786.1| putative FEG protein [Oryza sativa Japonica Group]
 gi|113631340|dbj|BAF25021.1| Os09g0397900 [Oryza sativa Japonica Group]
 gi|218202115|gb|EEC84542.1| hypothetical protein OsI_31281 [Oryza sativa Indica Group]
 gi|222641522|gb|EEE69654.1| hypothetical protein OsJ_29268 [Oryza sativa Japonica Group]
          Length = 559

 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 160/356 (44%), Gaps = 20/356 (5%)

Query: 27  GFNFLIDCGWN---------DHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
           G   + DCG +           FD  L    +   + I  V+++H    H+GALPY  + 
Sbjct: 26  GKRVMFDCGMHMGHRDSRRYPDFDRLLADGAADYTAAISCVVITHFHLDHIGALPYFTEV 85

Query: 78  LGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYH 136
            G   PV+ T P   L  L + D + +      E + ++ +DI    + V  L   Q   
Sbjct: 86  CGYHGPVYMTYPTKALAPLMLEDYRKVMVDHRGEEEQYSYEDILRCMRKVIPLDLKQTIQ 145

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 196
           +    + + +  + AGH+LG  +         ++Y  DYN   ++HL    ++  ++  +
Sbjct: 146 VD---KDLSIRAYYAGHVLGAAMIYAKVGDAAIVYTGDYNMTPDRHLGAAQIDR-LKLDL 201

Query: 197 LITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
           LIT++  A   +  +  RE  F  A+ K +  GG VL+P  + GR  EL ++L+DYW   
Sbjct: 202 LITESTYAKTVRDSKHAREREFLKAVHKCVSGGGKVLIPAFALGRAQELCILLDDYWERM 261

Query: 256 SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 315
           +L  PIYF   ++     Y K  + W    I  S+     N F  KHV     +S ++N 
Sbjct: 262 NLKIPIYFSAGLTIQANMYYKMLIGWTSQKIKNSYTVH--NPFDFKHVCHF-ERSFINNP 318

Query: 316 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
             GP ++ A+   +  GFS ++F +WA   KNLV        GT+   L +  P +
Sbjct: 319 --GPCVLFATPGMISGGFSLEVFKKWAPSEKNLVTLPGYCVAGTIGHKLMSGKPTR 372


>gi|123439147|ref|XP_001310348.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
           vaginalis G3]
 gi|121892114|gb|EAX97418.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
           vaginalis G3]
          Length = 679

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 98/349 (28%), Positives = 165/349 (47%), Gaps = 23/349 (6%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTE 88
           ++DCG +  ++     P       + ID +L++H    H+ A+P+ + Q   S P F T 
Sbjct: 37  MLDCGIHPAYENFGGLPFIDAIDPAKIDVLLITHFHIDHITAVPWFLTQTNFSGPCFMTH 96

Query: 89  PVYRLGLLTMYDQY-LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVA 147
               +    + D   +S R   E +LFT  D+ +    +T +    NYH +   +GI + 
Sbjct: 97  TTKTISKTLLVDYVGVSGRGSEEPNLFTRADVANVQNMITAV----NYHQTVTHQGIKMT 152

Query: 148 PHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES-----FVRPAVLITDAY 202
            + AGH+LG  +W +  DG  V+Y  D++   E+HL G  +        +RP VLI ++ 
Sbjct: 153 CYPAGHVLGACMWLVEIDGVKVLYTGDFSLENERHLQGAEIPKSLSGEIIRPDVLIMEST 212

Query: 203 NALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NY 259
           + L     R  RE  F D ++K ++ GG  L+P+ + GR  ELL+IL++YW  H      
Sbjct: 213 HGLARIESRVDREYRFIDNVTKIIKRGGRCLIPIFALGRAQELLIILDEYWESHPEYNGV 272

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
           PIY+ + ++   I    +F +     +  +        F   +V   I   + D++   P
Sbjct: 273 PIYYGSNLAKQAIAAYNAFYQDHNSRVVTA-----KGKFEFSYVK-YIRDYDFDDSL--P 324

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            +VL S A L+ G S  IF  W S+  N ++       GTL ++L  +P
Sbjct: 325 CVVLCSPAMLQNGMSRKIFEAWCSNSVNGLIIPGYIVDGTLPQVLMKNP 373


>gi|453087099|gb|EMF15140.1| Metallo-hydrolase/oxidoreductase [Mycosphaerella populorum SO2202]
          Length = 845

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 95/340 (27%), Positives = 161/340 (47%), Gaps = 30/340 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL---LTMYDQYLSRR 106
           ST+D +L++H    H  +LPY + +   +  VF T P   +Y+      + +++ +    
Sbjct: 76  STVDLLLITHFHQDHSASLPYVLSKTNFAGKVFMTHPTKAIYKWTTQDAVRVHNTHAPAS 135

Query: 107 QVSEFD-----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
             S  D     L+T  DI S    +  +++    H +    GI   P+ AGH+LG  ++ 
Sbjct: 136 STSGTDGYVSQLYTEQDILSTLPMIQTISF----HTTHSHNGIRFTPYPAGHVLGACMYL 191

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDA 220
           I   G +V++  DY+R  ++HL    +   V+   LIT++   +  + PRQ+RE     +
Sbjct: 192 IEIAGLNVLFTGDYSRENDRHLIPAAVPRNVKVDCLITESTFGISTRTPRQERENALIKS 251

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I+  L  GG VL+P  + G   ELLLILED+W  H     +PIY+ + ++   +   +++
Sbjct: 252 ITTILNRGGRVLMPTTAVGNTQELLLILEDHWHRHEEYRRFPIYYASGLARKVMVVYQTY 311

Query: 279 LEWMGDSITKSFETSRDNAFL----------LKHVTLLINKSELDNAPDGPKLVLASMAS 328
           ++ M D I   F+ S     +           + V  L      D+   G  +VLAS   
Sbjct: 312 VDDMNDRIKAKFQASATGPSVGDGGTAGPWDFQFVRALKGVDRFDDV--GGSVVLASPGM 369

Query: 329 LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           L+ G S  +   WA D KN V+ T     GT+A+ +  +P
Sbjct: 370 LQNGPSRALLERWAPDSKNGVIITGYSVEGTMAKNILLEP 409


>gi|336259697|ref|XP_003344648.1| hypothetical protein SMAC_07216 [Sordaria macrospora k-hell]
 gi|380088385|emb|CCC13649.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 857

 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 100/381 (26%), Positives = 184/381 (48%), Gaps = 28/381 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 99

Query: 79  GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF   +T+ +Y+  +        +    +   ++T +D    F  +  + Y+  +
Sbjct: 100 NFRGRVFMTHATKAIYKWLIQDSVRVGNTSSNPTSSLVYTEEDHLKTFPMIEAIDYNTTH 159

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 160 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREEDRHLISAEVPKGVKID 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRVEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 275

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E+S D A     +  
Sbjct: 276 HAEFQKYPIYYASNLARKCMLVYQTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDF 335

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K +  L +    ++   G  ++LAS   L+ G S ++   WA   KN V+ T     GT+
Sbjct: 336 KFIRSLKSIDRFEDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTM 393

Query: 361 ARMLQADPPPKAVKVTMSRRV 381
           A+ +  +  P  ++  MSR +
Sbjct: 394 AKHIMQE--PDTIQAVMSRNI 412


>gi|281201684|gb|EFA75892.1| integrator complex subunit 11 [Polysphondylium pallidum PN500]
          Length = 648

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/409 (25%), Positives = 181/409 (44%), Gaps = 48/409 (11%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++V PL    +      +VSI   N + DCG +  +       D S +    +    +D 
Sbjct: 3   IKVVPLGAGQDVGRSCVIVSIGNKNIMFDCGMHMGYHDERRFPDFSFISKTKQFTKVLDC 62

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTE--------PVYRLG--------------- 94
           V+++H    H GALPY  +  G   P++ T          +Y+                 
Sbjct: 63  VIITHFHLDHCGALPYFTEICGYDGPIYMTVCYKCLISISIYKYNYNSLTFMLQLIQLPT 122

Query: 95  ------LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAP 148
                 LL  Y + +  R+  E + FT   I    + V  +   Q   +    + + + P
Sbjct: 123 KAIVPILLEDYRKIVVDRK-GETNFFTPQMIKDCMKKVIPVALHQTIDVD---DELSIKP 178

Query: 149 HVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQ 208
           + AGH+LG  ++      E V+Y  DYN   ++HL    +++ V P +LIT+   A   +
Sbjct: 179 YYAGHVLGAAMFYCKVGEESVVYTGDYNMTPDRHLGSAWIDA-VNPTLLITETTYATTIR 237

Query: 209 PPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYV 267
             ++ RE  F   + + +  GG VL+PV + GRV EL ++++ YW +  L+ PIYF   +
Sbjct: 238 DSKRGRERDFLKRVHECVEKGGKVLIPVFALGRVQELCILIDTYWEQMGLSVPIYFSEGL 297

Query: 268 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMA 327
           +     Y K F+ W    I ++F   + N F  KH+        L +AP GP ++ A+  
Sbjct: 298 AEKANFYYKLFIGWTNQKIKQTF--VKRNMFDFKHIKPF--DRMLVDAP-GPMVLFATPG 352

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA-RMLQADPPPKAVKV 375
            L AG S ++F +WA    N+ +       GT+  ++L     P+ V++
Sbjct: 353 MLHAGASLEVFKKWAPSELNMTIIPGYCVVGTVGNKLLSNASGPQMVEI 401


>gi|428172766|gb|EKX41673.1| hypothetical protein GUITHDRAFT_74597 [Guillardia theta CCMP2712]
          Length = 615

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 176/370 (47%), Gaps = 19/370 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           L+   G   + DCG +  +      P      A +ID +L++H    H  ++PY + +  
Sbjct: 41  LLKFKGKTIMFDCGAHPGYRGEESLPFFDEVDAESIDLLLVTHFHVDHAASVPYFLTKTT 100

Query: 80  LSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF---DLFTLDDIDSAFQSVTRLTYSQNYH 136
               V+ T P   +  L   D ++    +SE     L+T  DI      +  + Y Q   
Sbjct: 101 FKGKVYMTYPTLAICKLVWSD-FIKVSGISEQYGGSLYTEKDIQETVNKIICIDYHQEVE 159

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 196
           +    EG+    + AGH+LG  ++ +   G  ++Y  DY+R++++HL    + S V+  V
Sbjct: 160 V----EGVKFWCYNAGHVLGACMFIVQIAGVRLLYTGDYSRQEDRHLMAAEMPS-VQVHV 214

Query: 197 LITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
           L+ ++   +    PR+ RE  F +A+  TL+ GG VLLPV + GR  ELLL+L++YW ++
Sbjct: 215 LVVESTYGVQTHEPRRSREKRFLEAVVSTLQLGGRVLLPVFAIGRAQELLLLLDEYWRKN 274

Query: 256 S--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
                YPI  L+ ++   I   ++++  M + I    +   +N F  +H+  +   +E  
Sbjct: 275 PELHRYPIICLSGMAKRCIASYQTYINQMNNRIRHLNDI--ENPFEFRHIRYMTTMAEFQ 332

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAV 373
           +  + P +V+AS   L+ G S D+F  W     N V+ T      TLA+ L  D  P   
Sbjct: 333 D--NCPCVVMASPGMLQNGPSRDLFDRWCEYRHNSVVITGYCVQNTLAKEL-LDAQPATH 389

Query: 374 KVTMSRRVPL 383
            +   + VPL
Sbjct: 390 TLQDGKEVPL 399


>gi|440795785|gb|ELR16901.1| putative cleavage and polyadenylation specificity factor, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 589

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 132/252 (52%), Gaps = 8/252 (3%)

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           NYH   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL      ++  
Sbjct: 19  NYHQQIEANGIKFWCYNAGHVLGAAMFMIEIAGVRILYTGDFSRQEDRHLMAAETPAYTA 78

Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
             V++   Y    ++P  ++   F   +   +R GG  LLPV + GR  ELLLIL++YW 
Sbjct: 79  DIVIVESTYGVQIHEPRIERETRFTKLVHTIVRRGGRCLLPVFALGRAQELLLILDEYWE 138

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
            H      PIY+ + ++   +   ++++  M ++I K F  S  N F+ KH++ L     
Sbjct: 139 AHPELHKVPIYYASSLAKKCMTVYQTYINMMNENIRKQFAVS--NPFVFKHISNLKGMQH 196

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
            D++  GP +V+AS   L++G S ++F +W S+ KN V+       GTLA+ + ++  P 
Sbjct: 197 FDDS--GPCVVMASPGMLQSGLSRELFEKWCSNAKNGVIIPGYCVEGTLAKHIMSE--PS 252

Query: 372 AVKVTMSRRVPL 383
            V     R +PL
Sbjct: 253 EVTAMDGRMLPL 264


>gi|308807807|ref|XP_003081214.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
           subunit) (ISS) [Ostreococcus tauri]
 gi|116059676|emb|CAL55383.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
           subunit) (ISS) [Ostreococcus tauri]
          Length = 572

 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 177/371 (47%), Gaps = 29/371 (7%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKV-ASTIDAVL 59
           G  +++ PL           + +  G   + DCG +  F      P L  V  S +DA+L
Sbjct: 13  GEMLEIIPLGAGSEVGRSCVVATFRGKTLMFDCGIHPGFSGIASLPYLDDVDLSAVDALL 72

Query: 60  LSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDD 118
           ++H    H  A+P+ + +      +F T P   +  + M D   L ++   E  LFT  D
Sbjct: 73  VTHFHLDHCAAVPFLVGRTDFRGRIFMTHPTKAIYHMLMQDFVRLMKQGGGEEPLFTDAD 132

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           ++++ + +  + + Q   +    +G+ V P+ AGH+LG  ++ +   G  V+Y  DY+R 
Sbjct: 133 LEASMKRIEVVDFHQEIDV----DGVKVTPYRAGHVLGACMFNVDIGGLRVLYTGDYSRI 188

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSA 238
            ++HL    + + + P V+I ++   +    PR++RE+                      
Sbjct: 189 ADRHLPAADIPA-IPPHVVIVESTYGVSPHSPREEREIRXXXXXXX-------------- 233

Query: 239 GRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
            R  ELLLILED+WA++      PIY  + ++   +   ++++  +   +  +FE +  N
Sbjct: 234 -RAQELLLILEDFWAQNPDLQRVPIYQASTLARKAMTIYQTYINVLNADMKAAFEEA--N 290

Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
            F+  HV  +   SELD+   GP +VLA+ + L++G S ++F  W  + KN V+  +   
Sbjct: 291 PFVFNHVKHISKASELDDV--GPCVVLATPSMLQSGLSRELFESWCEEPKNGVIIADFAV 348

Query: 357 FGTLARMLQAD 367
            GTLAR + +D
Sbjct: 349 QGTLAREILSD 359


>gi|295659367|ref|XP_002790242.1| cleavage and polyadenylation specific factor 2 [Paracoccidioides
           sp. 'lutzii' Pb01]
 gi|226281947|gb|EEH37513.1| cleavage and polyadenylation specific factor 2 [Paracoccidioides
           sp. 'lutzii' Pb01]
          Length = 999

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 113/456 (24%), Positives = 183/456 (40%), Gaps = 110/456 (24%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    +   +  ++ +DG    L+D GW+  FD S L  L +   T+  +LL+H  
Sbjct: 5   TPLLGAQSSSSRAVQSILELDGGVKILVDVGWDHSFDTSALAELERQIPTLSLILLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----------- 111
             H+GA  +  K   L    PV++T PV   G   + D Y S    + F           
Sbjct: 65  PSHIGAFAHCCKTFPLFTQIPVYATSPVIAFGRSLLQDLYASAPLAATFWPPATAGASSP 124

Query: 112 ---------------------------DLFTLDDIDSAFQSVTRLTYSQNYHLSGKG--- 141
                                         + ++I   F  +  L YSQ +         
Sbjct: 125 TSAAASRAAISPESADTDQNERPRILLPPPSTEEIARYFSLIQPLKYSQPHQPLPSPFSP 184

Query: 142 --EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------V 187
              G+ +  + AGH +GGT+W I    E +IYAVD+N+ +E  + G             V
Sbjct: 185 PLNGLTLTAYNAGHTVGGTIWHIQHGMESIIYAVDWNQARENVIAGASWFGGSGGSGTEV 244

Query: 188 LESFVRPAVLI--TDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           +E   +P  L+  T   + L     R++R ++  D +      GG VL+P+D++ RVLEL
Sbjct: 245 VEQLRKPTALVCSTRGGDKLVLSGGRKRRDDLLLDMLRSCFSKGGTVLIPMDTSARVLEL 304

Query: 245 LLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR- 294
             +LE  W E +             +Y     +  T+   +S LEWM + I + FE    
Sbjct: 305 AYVLEHAWRESAETADGEDPLKGVGLYLAGRKAHGTMRLARSMLEWMDEGIVREFEAGHG 364

Query: 295 -----------------------------DNA------FLLKHVTLLINKSELDN--APD 317
                                        DNA      F  +H+ ++  K++LD     +
Sbjct: 365 RDPVTGGGKGRSDGPSQRNAPASIPDKKGDNASKGLGPFTFRHLKIVERKTKLDKILGSN 424

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            P+++L S  SLE G+S  +  + A+  +NL++ TE
Sbjct: 425 APQVILTSDTSLEWGYSKHVLQKIAAGSENLIILTE 460


>gi|444314085|ref|XP_004177700.1| hypothetical protein TBLA_0A03830 [Tetrapisispora blattae CBS 6284]
 gi|387510739|emb|CCH58181.1| hypothetical protein TBLA_0A03830 [Tetrapisispora blattae CBS 6284]
          Length = 842

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 139/538 (25%), Positives = 250/538 (46%), Gaps = 72/538 (13%)

Query: 22  LVSIDGFNFLIDCGWNDHF--DPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMK 76
           +V  D    LID  WN         ++  S + S +D +LLS P+   LGA   L Y   
Sbjct: 19  IVRFDSVTLLIDPAWNSSTLSYSQCVKYWSNIISEVDIILLSQPNVDFLGAYSLLYYNFL 78

Query: 77  QLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL--FTLDDIDSAFQSVTRLTYSQ 133
              +S   V+ST P+  +G ++  D Y S+  +  ++     L+DI+ +F  +T + YSQ
Sbjct: 79  SHFISRIEVYSTLPIANIGRVSTIDLYASKGILGPYETSQLELEDIEKSFDHITSIKYSQ 138

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL--------NG 185
              L  + +G+    + +G   GGT+W IT + E ++Y   +N  K+  L        NG
Sbjct: 139 LVDLRARYDGLSFVAYSSGVNPGGTIWNITSNSEKILYTPQWNHTKDTILPGSGLIDTNG 198

Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
             L + ++P+ +IT+        P R++   F+D + + L++  ++++PVD  G++L+LL
Sbjct: 199 KPLSTVMKPSAIITNFEKFGSITPYRKRSHQFRDFLKERLKSHHSIMIPVDLGGKLLDLL 258

Query: 246 LILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN--AF 298
           + + D++ E+S+     N PI+ + Y     + Y +S LEW+  SI +++ + RDN   F
Sbjct: 259 VQINDFFYENSMEKRFHNIPIFIIAYSRGRILTYARSMLEWLSASILQTW-SRRDNLSPF 317

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ-- 356
             K+   +I+  +L +   G K+   S   +      ++  +  +D K  +L T  G   
Sbjct: 318 DFKNKVEVISPDQL-SKHKGQKICFVSDVDI---LIDEVISKICTDDKMTILLTNTGPSE 373

Query: 357 ---FGTL-----------ARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEE-QTRLKKE 401
                +L            R++  +      KV    +  L G++L +Y E+ QTR ++ 
Sbjct: 374 EPVLNSLNKYWLKSNSNDGRIVHCNYNMTVKKVN---KRSLKGKDLESYTEKIQTRREQR 430

Query: 402 EALKASLVKE----------------EESKASLGP-DNNLSGDPMVIDANNANASADVVE 444
           ++L+  L KE                +E  +SLG  +  + G+    D +  +   +++ 
Sbjct: 431 KSLELQLRKEAKMNNKSLNLVVGSASKEGSSSLGATEGRIRGEEEEEDDDEDDDEDNLIN 490

Query: 445 PHGG------RYRDILIDGFVPP-STSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 495
             GG        +DI ID  V   + S   MFPF  +  + DD+G + N D  I K+E
Sbjct: 491 MLGGGTKLSATKKDIPIDIIVQSDAASKHSMFPFTNSRIKKDDYGTISNFDMLIPKEE 548


>gi|50308971|ref|XP_454491.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49643626|emb|CAG99578.1| KLLA0E12013p [Kluyveromyces lactis]
          Length = 812

 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 131/531 (24%), Positives = 237/531 (44%), Gaps = 62/531 (11%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPL-SKVASTIDAVLLSHPDTLHLGALPYAMKQL-- 78
           +V  +    L+D GWN        +   ++  S +D VL+S P    LG+     KQ   
Sbjct: 19  IVRFNNVIVLLDPGWNGEGSYEECEEFWTQYISEVDIVLISQPTIECLGSYAMMFKQFLP 78

Query: 79  --GLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQN 134
                  V+ T PV  LG +   D   S   +  F   +  L+DI+S+F  +  + YSQ 
Sbjct: 79  HFRSRIQVYGTLPVSNLGRVNSVDLLTSVGILGPFSNAVMDLEDIESSFDLIETVKYSQT 138

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN--------GT 186
             L  K +G+ +  H +G+  GGT+W I    E ++YA  +N  ++  LN        G 
Sbjct: 139 VDLKNKFDGLSLEAHNSGYAPGGTIWTIITSSEKILYAPRWNHTRDTILNSADLLDNTGN 198

Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
              S + P  +IT+       +P R++ E F D + + ++   ++L+PV+  G++LE+L+
Sbjct: 199 PTSSMMHPTSVITNLSIIGSAEPQRKRVEHFTDTMKRAIQMNNSLLVPVEVGGKLLEVLV 258

Query: 247 ILEDYWAEH---SLNY--PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLK 301
           ++ ++  E+    L Y  P++ ++Y    ++ Y KS LEW+   + K++E SRDN     
Sbjct: 259 LVNNFLYENMRGGLKYDIPVFLISYSRGRSLTYAKSMLEWLSSQVIKTWE-SRDNRSPFD 317

Query: 302 HVTLL--INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            V+ L  I   EL     G K+ L S   ++   S  I    + D K  ++ TER     
Sbjct: 318 VVSRLRIITPEEL-GGYTGQKICLVS--EVDDILSQTINKLCSKD-KVTIILTERHPNTP 373

Query: 360 LARMLQA-----------------DPPPKAVKVTMSRRVPLVGEELIAYEEEQTR--LKK 400
               L+                  D  P ++  +MS R+ +    L   + E+ R  +K 
Sbjct: 374 AQHPLRKLNDKWQQAIKNGSRSALDGNPISISDSMSLRI-MKRTILNKKDAEKVREMIKT 432

Query: 401 EEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRD-------- 452
              ++  +++E  +K     ++      ++ D ++ ++  + V+    R ++        
Sbjct: 433 RNEVREKIIEEYTAKT----NDKAQTKTILFDVDDESSDEEGVDSMDARGKNGSGNVKVE 488

Query: 453 ILIDGFVPPSTSVAP---MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQA 500
           I +D     S S      MFPF+    + DD+G+V+N   ++ ++E  DQA
Sbjct: 489 IPVDITSNDSVSTNEKHLMFPFHPAKLKSDDYGDVVNLKRFLPQEESYDQA 539


>gi|403158620|ref|XP_003319317.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375166386|gb|EFP74898.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 778

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 167/332 (50%), Gaps = 22/332 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           S++DA+L++H    H  +L Y M+          VF T P   +    M D        +
Sbjct: 82  SSVDAILITHFHLDHAASLTYIMENTNFKEGHGKVFMTHPTKAVYRFLMQDFVRMSTIGT 141

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + +LF  + + +++ S+  + Y Q   L      +    + AGH+LG  ++ I   G  V
Sbjct: 142 DSELFNEEQMIASYDSINAIDYHQEISLGC----LRFTSYPAGHVLGAAMFLIEISGIRV 197

Query: 170 IYAVDYNRRKEKHLNGTVLESF-VRPAVLITDAYNALHNQPPR-QQREMFQDAISKTLRA 227
           +Y  DY+  +++HL    + ++  +P V+I ++   + +  PR ++ E F   +   L+ 
Sbjct: 198 LYTGDYSTEEDRHLIPARVPNWNEKPDVMICESTYGVQSLEPRFEKEERFTTLVQSILKR 257

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YWA H  LN  PIY+++ +++  +   ++F+  M D 
Sbjct: 258 GGRVLMPVFALGRAQELLLILDEYWANHPELNQIPIYYISNLAAKCMKVYQTFIHGMNDQ 317

Query: 286 ITKSFETS-------RDNAFLLK--HVTLLINKSELDNAPDGPKLVLASMASLEAGFSHD 336
           I + F          R+   + K  +VT L    + D+   GP +V+AS   +++G S +
Sbjct: 318 IKRKFNQGINPWTFYREGKGVFKKGYVTNLKAIDKFDDR--GPCVVMASPGFMQSGVSRE 375

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +   WA D +N +L T     GT+AR +  +P
Sbjct: 376 LLERWAPDRRNALLVTGYSIEGTMAREMLKEP 407


>gi|425780830|gb|EKV18826.1| Endoribonuclease ysh1 [Penicillium digitatum PHI26]
 gi|425783067|gb|EKV20936.1| Endoribonuclease ysh1 [Penicillium digitatum Pd1]
          Length = 862

 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 166/356 (46%), Gaps = 23/356 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDVLLISHFHVDHSSALPYVLSKTNFKGRVFMTPATRAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +    GI + P+ AGH+LG  ++KI   G   ++ 
Sbjct: 135 QRTTLYTERDHLSTLPLIETIDFYTTHTINGIRITPYPAGHVLGAAMFKIDIAGLVTLFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    + S  +  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAAVPSGTKIDVLITESTFGISSNPPRLEREAALMKSITSILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H     +PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKFPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  + V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGNKSVSVGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP---PPKAVKVTMSR---RVPLVGEE 387
              WA   +N V+ T     GT+A+ L  +P   P    KV+      RVP V +E
Sbjct: 373 LERWAPSDRNGVVMTGYSVEGTMAKGLLNEPDQIPAVMSKVSTGHGRGRVPGVNDE 428


>gi|269860830|ref|XP_002650133.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Enterocytozoon bieneusi H348]
 gi|220066453|gb|EED43934.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Enterocytozoon bieneusi H348]
          Length = 657

 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 94/344 (27%), Positives = 164/344 (47%), Gaps = 14/344 (4%)

Query: 30  FLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFST 87
           FL+DCG +  +      P   + +   IDAV ++H    H  ALP+  ++      V+ T
Sbjct: 35  FLMDCGVHPAYTGVSCLPFLDLINLEEIDAVFITHFHLDHAAALPFLTEKTAFKGKVYMT 94

Query: 88  EPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVA 147
            P   +    + D        S+ D +T  D+++ +  +  + Y Q   + G    I   
Sbjct: 95  HPTKAILKWLLNDYIRIINSASDEDFYTEKDLENCYNKIIPIDYHQVIDVVG----IKFT 150

Query: 148 PHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHN 207
              AGH+LG  ++ +      ++Y  D++R  ++HL      +  +  +LIT++      
Sbjct: 151 ALNAGHVLGAAMFLLEIGQTKLLYTGDFSREDDRHLKSAETPN-CKLDILITESTYGTQC 209

Query: 208 QPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE--HSLNYPIYFL 264
             PR +RE  F   +S  +  GG  LLPV + GR  ELLLIL++YW E  H    PI++ 
Sbjct: 210 HLPRIERENRFTKVVSDVVERGGKCLLPVFALGRAQELLLILDEYWEENPHLKKIPIFYA 269

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           + ++   +   ++++  M + + K    +R N F  K+V  + +   + +   GP +++A
Sbjct: 270 SALAKKCMGIYQTYVNMMNERMQK-LNLTR-NPFDFKNVENIKDAKTVRDG--GPCVIMA 325

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S   L++G S DIF  W SD KN V+       GTLA+ +  +P
Sbjct: 326 SPGMLQSGVSRDIFERWCSDSKNGVVIAGYCVEGTLAKEVLKEP 369


>gi|294658126|ref|XP_460457.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
 gi|218511903|sp|Q6BMW3.2|YSH1_DEBHA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|202952895|emb|CAG88764.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
          Length = 815

 Score =  133 bits (334), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 169/341 (49%), Gaps = 34/341 (9%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+    +  VF   +T+ +YR  LL+ + +  S     
Sbjct: 64  SKVDILLVSHFHLDHAASLPYVMQHTNFNGRVFMTHATKAIYRW-LLSDFVKVTSIGGGS 122

Query: 105 ---------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                           +L+T DD+  +F  +  +    +YH + + +GI    + AGH+L
Sbjct: 123 DARLNNSDPNANTGSSNLYTDDDLMRSFDRIETI----DYHSTIELDGIRFTAYHAGHVL 178

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           G  ++ I   G  V++  DY+  +++HL    +   ++P +LIT++        PR ++E
Sbjct: 179 GACMYFIEIGGLKVLFTGDYSSEEDRHLQVAEVPP-IKPDILITESTFGTATHEPRLEKE 237

Query: 216 M-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--EHSLNYPIYFLTYVSSSTI 272
               + I  TL  GG +L+PV + GR  ELLLILE+YW+  +   N  IY+ + ++   +
Sbjct: 238 TRMTNIIHSTLLKGGRILMPVFALGRAQELLLILEEYWSLNDDLQNINIYYASSLARKCM 297

Query: 273 DYVKSFLEWMGDSI----TKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMA 327
              +++   M DSI    + +  + + N F  K +  + N   LD   D GP +V+AS  
Sbjct: 298 AVYQTYTNIMNDSIRLTTSATNSSKKQNPFQFKFIKSIKN---LDKFQDFGPCVVVASPG 354

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            L+ G S ++   WA D KN V+ T     GT+A+ L  +P
Sbjct: 355 MLQNGVSRELLERWAPDPKNAVIMTGYSVEGTMAKDLLTEP 395


>gi|226288011|gb|EEH43524.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 999

 Score =  132 bits (333), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 114/456 (25%), Positives = 183/456 (40%), Gaps = 110/456 (24%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    +   +  ++ +DG    L+D GW+  FD S L  L +   T+  +LL+H  
Sbjct: 5   TPLLGAQSSSSRAVQSILELDGGVKILVDVGWDHSFDTSALAELERQIPTLSLILLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS------------------ 104
             H+GA  +  K   L    PV++T PV   G   + D Y S                  
Sbjct: 65  PSHIGAFAHCCKTFPLFTQIPVYATSPVIAFGRSLLQDLYASAPLAATFWPPATAGASSP 124

Query: 105 ------RRQVSEFDLFT--------------LDDIDSAFQSVTRLTYSQNYHLSGKG--- 141
                 R  +S     T               ++I   F  +  L YSQ +         
Sbjct: 125 TSAAASRTAISPESADTDQNERPRILLPPPSTEEIARYFSLIQPLKYSQPHQPLPSPFSP 184

Query: 142 --EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------V 187
              G+ +  + AGH +GGT+W I    E +IYAVD+N+ +E  + G             V
Sbjct: 185 PLNGLTLTAYNAGHTVGGTIWHIQHGMESIIYAVDWNQARENVIAGAAWFGGSGGSGTEV 244

Query: 188 LESFVRPAVLI--TDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           +E   +P  L+  T   + L     R++R ++  D +      GG VL+P+D++ RVLEL
Sbjct: 245 VEQLRKPTALVCSTRGGDKLALSGGRKRRDDLLLDMLRSCFSKGGTVLIPMDTSARVLEL 304

Query: 245 LLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR- 294
             +LE  W E +             +Y     +  T+   +S LEWM + I + FE    
Sbjct: 305 AYVLEHAWRESAETADGEDPLKGAGLYLAGRKAHGTMRLARSMLEWMDEGIVREFEAGHG 364

Query: 295 -----------------------------DNA------FLLKHVTLLINKSELDN--APD 317
                                        DNA      F  +H+ ++  K++LD     +
Sbjct: 365 RDPVTGGGKGRSDGPSQRNAPASVPDKKSDNASKGLGPFTFRHLKIVERKTKLDKILGSN 424

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            P+++L    SLE G+S  +  + A+  +NL++ TE
Sbjct: 425 APQVILTPDTSLEWGYSKHVLQKIAAGSENLIILTE 460


>gi|150865856|ref|XP_001385241.2| hypothetical protein PICST_89936 [Scheffersomyces stipitis CBS
           6054]
 gi|149387112|gb|ABN67212.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 793

 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 181/349 (51%), Gaps = 29/349 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+       VF   +T+ +YR  LL  + +  S     
Sbjct: 64  SKVDILLISHFHLDHAASLPYVMQHTTFKGRVFMTHATKAIYRW-LLQDFVRVTSIGAGS 122

Query: 105 RRQVSE---FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
           R + S+    +L+T DDI S+F  +  +    +YH + + +GI    + AGH+LG  ++ 
Sbjct: 123 RAEGSDETSTNLYTDDDIISSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 178

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQ--QREMFQD 219
           +   G  V++  DY+R + +HL+   +    RP +LIT++        P+   ++ + Q+
Sbjct: 179 VEIGGLKVLFTGDYSREENRHLHAAEVPP-TRPDILITESTFGTGTLEPKADLEKRLVQN 237

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKS 277
            I  TL  GG VL+PV S G   ELLLIL++YW ++    N  ++F + ++   +   ++
Sbjct: 238 -IHATLTKGGRVLMPVFSLGNAQELLLILDEYWEKNEDLQNISVFFASKLARKCMAVYQT 296

Query: 278 FLEWMGDSITKSFETSRDNA-FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHD 336
           +   M D+I  S    + ++ F  K++  + +  +  +   GP +V+AS   L+AG S  
Sbjct: 297 YTSIMNDNIRLSSRIGQKSSPFDFKYIKSIKDLGKFSDM--GPSVVVASPGMLQAGVSRQ 354

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARMLQADPP--PKAVK--VTMSRRV 381
           +  +WA D KNLV+ T     GT+A+ L  +P     AV   +T+ RR+
Sbjct: 355 LLEKWAPDPKNLVVMTGYSVEGTMAKDLLNEPHTIKSAVNPDITIPRRI 403


>gi|238882385|gb|EEQ46023.1| hypothetical protein CAWG_04366 [Candida albicans WO-1]
          Length = 783

 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 97/331 (29%), Positives = 169/331 (51%), Gaps = 23/331 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  L+  + +  S     
Sbjct: 63  SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 121

Query: 110 EFD--------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
             D        L+T DDI  +F  +  +    +YH + + +GI    + AGH+LG  ++ 
Sbjct: 122 SEDGGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 177

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
           I   G  V++  DY+R + +HL+   +   ++P +LI+++        PR + E      
Sbjct: 178 IEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRIELERKLTTH 236

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I  T+  GG VLLPV + G   ELLLIL++YW+++    N  +++ + ++   +   +++
Sbjct: 237 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 296

Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
              M D I  S  +S + N F  K++  + + S+  +   GP +V+A+   L+AG S  +
Sbjct: 297 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 354

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             +WA D KNLV+ T     GT+A+ L  +P
Sbjct: 355 LEKWAPDGKNLVILTGYSVEGTMAKELLKEP 385


>gi|68489322|ref|XP_711502.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
 gi|68489371|ref|XP_711478.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
 gi|74584420|sp|Q59P50.1|YSH1_CANAL RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
           3'-end-processing protein YSH1
 gi|46432783|gb|EAK92250.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
 gi|46432809|gb|EAK92275.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
          Length = 870

 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 97/331 (29%), Positives = 169/331 (51%), Gaps = 23/331 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  L+  + +  S     
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 208

Query: 110 EFD--------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
             D        L+T DDI  +F  +  +    +YH + + +GI    + AGH+LG  ++ 
Sbjct: 209 SEDGGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 264

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
           I   G  V++  DY+R + +HL+   +   ++P +LI+++        PR + E      
Sbjct: 265 IEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRIELERKLTTH 323

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I  T+  GG VLLPV + G   ELLLIL++YW+++    N  +++ + ++   +   +++
Sbjct: 324 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 383

Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
              M D I  S  +S + N F  K++  + + S+  +   GP +V+A+   L+AG S  +
Sbjct: 384 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 441

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             +WA D KNLV+ T     GT+A+ L  +P
Sbjct: 442 LEKWAPDGKNLVILTGYSVEGTMAKELLKEP 472


>gi|30677952|ref|NP_178282.2| cleavage and polyadenylation specificity factor subunit 3-II
           [Arabidopsis thaliana]
 gi|332278175|sp|Q8GUU3.2|CPS3B_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
           subunit 3-II; AltName: Full=Cleavage and polyadenylation
           specificity factor 73 kDa subunit II; Short=AtCPSF73-II;
           Short=CPSF 73 kDa subunit II; AltName: Full=Protein
           EMBRYO SAC DEVELOPMENT ARREST 26
 gi|62320470|dbj|BAD94982.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
           thaliana]
 gi|330250395|gb|AEC05489.1| cleavage and polyadenylation specificity factor subunit 3-II
           [Arabidopsis thaliana]
          Length = 613

 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 167/358 (46%), Gaps = 20/358 (5%)

Query: 22  LVSIDGFNFLIDCGW-------NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG        N + + SL+       + I  ++++H    H+GALPY 
Sbjct: 20  VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ + P   L  L +  Y + +  R+  E +LFT   I +  + V  +   
Sbjct: 80  TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEE-ELFTTTHIANCMKKVIAIDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  +         ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LI+++  A   +  +  RE  F  A+ K +  GG  L+P  + GR  EL ++L+DY
Sbjct: 195 QLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   ++  PIYF + ++     Y K  + W   ++ +   T   N F  K+V        
Sbjct: 255 WERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF--DRS 310

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           L +AP GP ++ A+   L AGFS ++F  WA    NLV        GT+   L A  P
Sbjct: 311 LIHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMAGKP 367


>gi|388579831|gb|EIM20151.1| Metallo-hydrolase/oxidoreductase [Wallemia sebi CBS 633.66]
          Length = 626

 Score =  132 bits (333), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 90/325 (27%), Positives = 164/325 (50%), Gaps = 19/325 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D        +
Sbjct: 81  STVDALLITHFHLDHAAALTYIMEKTNFKEGKGKVYMTSPTKAVYRFMMQDFVRISTTSA 140

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           E  LFT  ++ ++++S+    ++Q         G+   P+ AGH+LG  ++ I   G  V
Sbjct: 141 EDQLFTESEMIASWRSIQVSDFNQEI---VPASGVRFTPYPAGHVLGAAMFLIEIAGLKV 197

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY--NALHNQPPRQQREMFQDAISKTLRA 227
           +Y  DY+R +++HL+   +       +++   Y    L N+P +++R  F + +   +R 
Sbjct: 198 LYTGDYSREEDRHLHAAEIPKEQTDVLIVESTYGVQTLENRPEKEKR--FTELVHNIIRR 255

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAE----HSLNYPIYFLTYVSSSTIDYVKSFLEWMG 283
           GG VL+P  + GR  ELLLIL++YW      HS+  PIY+ + ++   +   ++++  M 
Sbjct: 256 GGRVLMPSFALGRAQELLLILDEYWQRNPDLHSI--PIYYASNLARKCMAVYQAYIRTMN 313

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
            +I + F+ S +N F  K ++ L +  +  +   GP ++LAS   L++G S ++   WA 
Sbjct: 314 KNINRRFD-SGENPFQFKFISELGDLRKWQD--KGPCVMLASPGMLQSGTSRELLERWAP 370

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D KN ++       GT+A  +  +P
Sbjct: 371 DPKNGLIICGYSVEGTMAHSIVNEP 395


>gi|452985743|gb|EME85499.1| hypothetical protein MYCFIDRAFT_130659 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 844

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 97/359 (27%), Positives = 170/359 (47%), Gaps = 31/359 (8%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL---LTMYDQYLSRR 106
           ST+D +L++H    H  +LPY + +   +  V+ T P   +Y+      + +++ +    
Sbjct: 76  STVDLLLITHFHQDHSASLPYVLSKTNFAGRVYMTHPTKAIYKWTTQDAVRVHNTHTPAS 135

Query: 107 QVSEFD-----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
             S  D     L+T  DI S    +  +++    H +    GI   P+ AGH+LG  ++ 
Sbjct: 136 SSSGTDGYVSQLYTEQDILSTMPMIQTISF----HTTHSHNGIRFTPYPAGHVLGACMYL 191

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDA 220
           I   G ++++  DY+R  ++HL    +   V+   LIT++   +  + PRQ+RE     +
Sbjct: 192 IEIAGLNILFTGDYSRETDRHLIPATVPRNVKVDCLITESTFGISTRTPRQERENALIKS 251

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I+  L  GG VL+P  + G   ELLLILEDYW  H     +PIY+ + ++   +   +++
Sbjct: 252 ITTILNRGGRVLMPTTAVGNTQELLLILEDYWQRHEEYRKFPIYYASGLARKVMVVYQTY 311

Query: 279 LEWMGDSITKSFETS----------RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMAS 328
           ++ M D+I   F+ S              +  + V  L      ++   G  +VLAS   
Sbjct: 312 VDDMNDTIKAKFQASAVGQSVGEGGTAGPWDFQFVRALKGIDRFEDV--GGSVVLASPGM 369

Query: 329 LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPP-KAVKVTMSRRVPLVGE 386
           L+ G S  +   WA + KN V+ T     GT+A+ +  +P    AV    S  +P +G+
Sbjct: 370 LQNGPSRALLERWAPEAKNGVVITGYSVEGTMAKTILMEPDEIPAVTQNRSANIPSMGK 428


>gi|358365452|dbj|GAA82074.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
           [Aspergillus kawachii IFO 4308]
          Length = 882

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 172/362 (47%), Gaps = 20/362 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSSTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEQDHLSTLPLIETIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   V+  VLIT++   + + PPR +RE     AI+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAALMKAITGVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  + V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSVSAGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP-LVGEELIAYEEEQT 396
              WA + +N V+ T     GT+A+ +  +  P+ +   MSR    LV   + A  EE+ 
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQILNE--PEQIPAVMSRATTGLVRRGMAAGNEEEQ 430

Query: 397 RL 398
           ++
Sbjct: 431 KV 432


>gi|145230249|ref|XP_001389433.1| endoribonuclease ysh1 [Aspergillus niger CBS 513.88]
 gi|134055550|emb|CAK37196.1| unnamed protein product [Aspergillus niger]
          Length = 874

 Score =  132 bits (332), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 172/362 (47%), Gaps = 20/362 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSSTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEQDHLSTLPLIETIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   V+  VLIT++   + + PPR +RE     AI+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAALMKAITGVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  + V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSVSAGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP-LVGEELIAYEEEQT 396
              WA + +N V+ T     GT+A+ +  +  P+ +   MSR    LV   + A  EE+ 
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQILNE--PEQIPAVMSRATTGLVRRGMAAGNEEEQ 430

Query: 397 RL 398
           ++
Sbjct: 431 KV 432


>gi|4220489|gb|AAD12712.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
           thaliana]
          Length = 837

 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 167/358 (46%), Gaps = 20/358 (5%)

Query: 22  LVSIDGFNFLIDCGW-------NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG        N + + SL+       + I  ++++H    H+GALPY 
Sbjct: 20  VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ + P   L  L +  Y + +  R+  E +LFT   I +  + V  +   
Sbjct: 80  TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEE-ELFTTTHIANCMKKVIAIDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  +         ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LI+++  A   +  +  RE  F  A+ K +  GG  L+P  + GR  EL ++L+DY
Sbjct: 195 QLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   ++  PIYF + ++     Y K  + W   ++ +   T   N F  K+V        
Sbjct: 255 WERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF--DRS 310

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           L +AP GP ++ A+   L AGFS ++F  WA    NLV        GT+   L A  P
Sbjct: 311 LIHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMAGKP 367


>gi|255957115|ref|XP_002569310.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591021|emb|CAP97240.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 862

 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 166/356 (46%), Gaps = 23/356 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDVLLISHFHVDHSSALPYVLSKTNFKGRVFMTPATRAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   +  + +++ +    GI + P+ AGH+LG  ++KI   G   ++ 
Sbjct: 135 QRTTLYTERDHLSTLPMIETIDFYTTHTINGIRITPYPAGHVLGAAMFKIDIAGLVTLFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    + S  +  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAAVPSGTKIDVLITESTFGISSNPPRLEREAALMKSITGILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H     +PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKFPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  + V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGNKSVSVGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP---PPKAVKVTMSR---RVPLVGEE 387
              WA   +N V+ T     GT+A+ L  +P   P    KV+      RVP V +E
Sbjct: 373 LERWAPSDRNGVVMTGYSVEGTMAKGLLNEPDQIPAVMSKVSTGHGRGRVPGVNDE 428


>gi|46107872|ref|XP_380995.1| hypothetical protein FG00819.1 [Gibberella zeae PH-1]
          Length = 864

 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 106/410 (25%), Positives = 184/410 (44%), Gaps = 59/410 (14%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHP--DTL---------- 66
           +++   G   ++D G +  +D     P       ST+D +L+SHP  DT           
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHPVQDTTALYCHGQYCA 100

Query: 67  -------------------HLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYL---S 104
                              H  +LPY + +      VF T P   +    + D      +
Sbjct: 101 CVMSISMIMLLIGHSFHIDHAASLPYVLAKTNFRGRVFMTHPTKAIYKWLIQDSVRVGNT 160

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
               +   ++T  D  + F  +  + Y   + +S     I + P+ AGH+LG  ++ I  
Sbjct: 161 SSNPTTQPVYTEQDHLNTFPQIEAIDYHTTHTISS----IRITPYPAGHVLGAAMFLIEI 216

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISK 223
            G ++ +  DY+R +++HL    +   V+  VLIT++   + +  PR +RE     +I+ 
Sbjct: 217 AGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQALMKSITS 276

Query: 224 TLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEW 281
            L  GG VL+PV + GR  ELLLIL++YW +H+    YPIY+ + ++   +   ++++  
Sbjct: 277 ILNRGGRVLMPVFALGRAQELLLILDEYWGKHADFQKYPIYYASNLARKCMLIYQTYVGA 336

Query: 282 MGDSITKSF-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASL 329
           M D+I + F       E S D A     +  K++  L N    D+   G  ++LAS   L
Sbjct: 337 MNDNIKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDRFDDV--GGCVMLASPGML 394

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
           + G S ++   WA   KN V+ T     GT+A+ +  +  P  ++  MSR
Sbjct: 395 QNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQE--PDQIQAVMSR 442


>gi|255724858|ref|XP_002547358.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
 gi|240135249|gb|EER34803.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
          Length = 783

 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 97/330 (29%), Positives = 168/330 (50%), Gaps = 21/330 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRL---GLLTMYDQYLSRR 106
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR      + +     SR 
Sbjct: 63  SKVDILLISHFHVDHSASLPYIMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGSSRA 122

Query: 107 QVSEFD----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI 162
           +    D    L+T DDI  +F  +  +    +YH + + +GI    + AGH+LG  ++ I
Sbjct: 123 EAGGKDEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYFI 178

Query: 163 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 221
              G  V++  DY+R + +HL+   +   ++P +LI+++        PR + E      I
Sbjct: 179 EIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRVELERKLTTHI 237

Query: 222 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFL 279
             T+  GG VLLPV + G   ELLLIL++YW+++    N  +++ + ++   +   +++ 
Sbjct: 238 HATVTKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYT 297

Query: 280 EWMGDSITKSFET-SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIF 338
             M D I  S  +  + N F LK +  + + S+  +   GP +V+A+   L+AG S  + 
Sbjct: 298 GIMNDKIRLSSSSGEKSNPFDLKFIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQLL 355

Query: 339 VEWASDVKNLVLFTERGQFGTLARMLQADP 368
            +WA D KNLV+ T     GT+A+ L  +P
Sbjct: 356 EKWAPDNKNLVILTGYSVEGTMAKELLKEP 385


>gi|406866779|gb|EKD19818.1| metallo-beta-lactamase superfamily protein [Marssonina brunnea f.
           sp. 'multigermtubi' MB_m1]
          Length = 823

 Score =  132 bits (331), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 99/345 (28%), Positives = 164/345 (47%), Gaps = 26/345 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  +LPY + +      VF T P   +    + D        S+  
Sbjct: 76  STVDVLLISHFHVDHAASLPYVLAKTNFKGRVFMTHPTKAIYKWLIQDSIRVGGASSDSK 135

Query: 113 ---LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              ++T  D  S F  +  + Y   + +S     I + P+ AGH+LG  ++ I   G  +
Sbjct: 136 GQPVYTEADHLSTFPMIEAIDYHTTHTISS----IRITPYPAGHVLGAAMFLIEIAGLKI 191

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
            +  DY+R  ++HL    +   V+  VLIT++   +    PR +RE     +I+  L  G
Sbjct: 192 FFTGDYSREDDRHLVSAEVPKGVKIDVLITESTYGIAAHVPRVEREQQLMKSITSILNRG 251

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VL+PV + GR  ELLLIL++YWA H      PIY+ + ++   +   ++++  M ++I
Sbjct: 252 GRVLMPVFALGRAQELLLILDEYWALHPEFQKIPIYYASNLARKCMLVYQTYVGAMNENI 311

Query: 287 TKSF-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFS 334
            + F       E S D A     +  K++  L N    D+   G  ++LAS   L+ G S
Sbjct: 312 KRLFRERMAEAEASSDTAAKGGPWDFKYIRSLKNLDRFDDV--GRCVMLASPGMLQNGVS 369

Query: 335 HDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
            ++   WA   KN V+ T     GT+A+ +  +  P  ++  MSR
Sbjct: 370 RELLERWAPSEKNGVVITGYSVEGTMAKQIMQE--PDQIQAIMSR 412


>gi|403223285|dbj|BAM41416.1| uncharacterized protein TOT_030000678 [Theileria orientalis strain
           Shintoku]
          Length = 706

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/347 (28%), Positives = 166/347 (47%), Gaps = 21/347 (6%)

Query: 43  SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
           +L + L+ V +TID+ ++SH    H+GALP+  +++G S PV+ T P   L  L + D  
Sbjct: 109 ALKKALNNVTNTIDSAIISHFHIDHVGALPFLTEEIGYSGPVYMTYPTKALSPLLLRDSG 168

Query: 103 LSRRQVSEFDLFTLDDIDS----------AFQSVT---RLTYSQNYHLSGKGEGIVVAPH 149
           ++ +  S   L   D              +F SV    + +       + K EG+ V+P 
Sbjct: 169 IAAKTASVKSLLNFDKRRKVEERPDPWGYSFNSVAECMKRSIPLQLRSAEKVEGLTVSPF 228

Query: 150 VAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITD-AYNALHNQ 208
            AGH+LG  ++    DG  V+Y  D+N   +KHL    + S + P VLI +  Y     Q
Sbjct: 229 YAGHVLGAAMFLAESDGFKVLYTGDFNTVPDKHLGPAKVPS-LEPDVLICETTYATFVRQ 287

Query: 209 PPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVS 268
             +       + +  TL  GG VL+PV + GR  EL +IL +YW   SL +PIYF   +S
Sbjct: 288 SKKATEVELCNLVHDTLINGGKVLIPVFAVGRAQELAIILNNYWNNLSLLFPIYFGGGLS 347

Query: 269 SSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMAS 328
               +Y K    W  ++   +    ++N F ++++ L  ++S L++  + P ++ A+   
Sbjct: 348 EKATNYYKLHSSWTDNN---NISKLKENPFAMENL-LQFDQSFLND--NRPMVLFATPGM 401

Query: 329 LEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKV 375
           +  G S      W+S+ KNL+L       GT+   L +    +  K+
Sbjct: 402 VHTGLSLKACKIWSSNPKNLILIPGYCVQGTVGNKLISGTKGREYKI 448


>gi|121700651|ref|XP_001268590.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus clavatus NRRL 1]
 gi|119396733|gb|EAW07164.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus clavatus NRRL 1]
          Length = 878

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/346 (28%), Positives = 165/346 (47%), Gaps = 27/346 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 133

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T +D  S    +  + ++  + ++     I + P  AGH+LG  ++ ++  G +
Sbjct: 134 QRTTLYTENDHLSTLPLIETIDFNTTHTINS----IRITPFPAGHVLGAAMFLVSIAGLN 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   ++  VLIT++   +   PPR +RE     AI+  L  
Sbjct: 190 ILFTGDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAALMKAITGVLNR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLILE+YW  H      PIY++   +   +   ++++  M D+
Sbjct: 250 GGRVLMPVFALGRAQELLLILEEYWETHPDLQKIPIYYIGNTARRCMVVYQTYIGAMNDN 309

Query: 286 ITKSF-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F       E S D +     +  + V  L +    D+   G  ++LAS   L+ G 
Sbjct: 310 IKRLFRQRMAEAEASGDKSASAGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGT 367

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
           S ++   WA + +N V+ T     GT+A+ L  +  P  +   MSR
Sbjct: 368 SRELLERWAPNERNGVVMTGYSVEGTMAKQLLNE--PDQIPAVMSR 411


>gi|70996586|ref|XP_753048.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus fumigatus Af293]
 gi|74672067|sp|Q4WRC2.1|YSH1_ASPFU RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
           3'-end-processing protein ysh1
 gi|66850683|gb|EAL91010.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus fumigatus Af293]
 gi|159131784|gb|EDP56897.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Aspergillus fumigatus A1163]
          Length = 872

 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 170/361 (47%), Gaps = 19/361 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTVNSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   +   PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAALMKSITGILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  K V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSASAGPWDFKFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
              WA + +N V+ T     GT+A+ L  +  P+ +   MSR    V    +A  +E+ +
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PEQIPAVMSRSAGGVSRRGLAGTDEEQK 430

Query: 398 L 398
           +
Sbjct: 431 I 431


>gi|297814408|ref|XP_002875087.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297320925|gb|EFH51346.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 819

 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 167/358 (46%), Gaps = 20/358 (5%)

Query: 22  LVSIDGFNFLIDCGW-------NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG        N + D SL+       + I  ++++H    H+GALPY 
Sbjct: 20  VVTINGKRIMFDCGMHMGCDDHNRYPDFSLVSKSGDFDNAISCIIITHFHMDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ + P   L  L +  Y + +  R+  E +LFT   I +  + V  +   
Sbjct: 80  TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRR-GEDELFTTAHIANCMKKVIAIDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  +         ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAVMVYAKVGDAAIVYTGDYNMTTDRHLGAAKIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LI+++  A   +  +  RE  F  A+ K +  GG  L+P  + GR  EL ++L+DY
Sbjct: 195 QLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   ++  PIYF + ++     Y K  + W   ++ +   T   N F  K+V        
Sbjct: 255 WERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF--DRS 310

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           L +AP GP ++ A+   L AGFS ++F  WA    NLV        GT+   L +  P
Sbjct: 311 LIHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMSGKP 367


>gi|241951638|ref|XP_002418541.1| cleavage and polyadenylation factor specificity complex subunit,
           putative; endonuclease, putative [Candida dubliniensis
           CD36]
 gi|223641880|emb|CAX43843.1| cleavage and polyadenylation factor specificity complex subunit,
           putative [Candida dubliniensis CD36]
          Length = 787

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 164/331 (49%), Gaps = 22/331 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGL--------LTMYDQ 101
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  +        +     
Sbjct: 63  SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRS 122

Query: 102 YLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
                     +L+T DDI  +F  +  +    +YH + + +GI    + AGH+LG  ++ 
Sbjct: 123 GDGSGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 178

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
           +   G  V++  DY+R + +HL+   +   ++P +LI ++        PR + E      
Sbjct: 179 VEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILICESTFGTGTLEPRLELERKLTTH 237

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I  T+  GG VLLPV + G   ELLLIL++YW+++    N  +++ + ++   +   +++
Sbjct: 238 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 297

Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
              M D I  S  +S + N F  K +  + + S+  +   GP +V+A+   L+AG S  +
Sbjct: 298 TGIMNDKIRLSSASSKKSNPFDFKFIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 355

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             +WA D KNLV+ T     GT+A+ L  +P
Sbjct: 356 LEKWAPDGKNLVILTGYSVEGTMAKELLKEP 386


>gi|449435478|ref|XP_004135522.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3-I-like [Cucumis sativus]
          Length = 481

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 116/437 (26%), Positives = 201/437 (45%), Gaps = 49/437 (11%)

Query: 7   VTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPLSKVAS 53
           +TPL G  NE   S + +S  G   L DCG            + D  DPS          
Sbjct: 26  ITPL-GAGNEVGRSCVYMSYKGKIVLFDCGIHPAYSGMAALPYFDEIDPS---------- 74

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSE 110
           TID +L++H    H  +LPY +++      VF   +T+ +Y+L LL     ++   +VS 
Sbjct: 75  TIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLLL----DFVKVSKVSV 130

Query: 111 FD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
            D L+   DI  +   +  + + Q   ++G      +   +   +LG  ++ +   G  V
Sbjct: 131 EDMLYDEQDISRSMDKIEVIDFHQTVEVNGIR---FLWCXLIRKMLGAAMFMVDIAGVRV 187

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGG 229
           +Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D +  T+  GG
Sbjct: 188 LYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGG 247

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
            VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   +++   M D I 
Sbjct: 248 RVLIPAFALGRAQELLLILDEYWANHPELHNIPIYYASPLAKRCLTVYETYTLSMNDRI- 306

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
              + ++ N F  K+++ L +     +   GP +V+AS + L++G S  +F  W S+   
Sbjct: 307 ---QNAKSNPFRFKYISPLKSIEVFKDV--GPSVVMASPSGLQSGLSRQLFEMWCSEKHV 361

Query: 348 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 407
            + +T       L+ M+       A+ + ++R VP V  E  A + E+  +KK E +  +
Sbjct: 362 SLHWTS----DPLSDMVSDS--VVALILNINREVPKVIVESEAVKTEEENVKKAEKVIHA 415

Query: 408 LVKEEESKASLGPDNNL 424
           L+        LG +  L
Sbjct: 416 LLVSLFGDVKLGENGKL 432


>gi|119494361|ref|XP_001264076.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Neosartorya fischeri NRRL 181]
 gi|119412238|gb|EAW22179.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Neosartorya fischeri NRRL 181]
          Length = 878

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 170/361 (47%), Gaps = 19/361 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ ++  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPFPAGHVLGAAMFLVSIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   +   PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAALMKSITGILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  K V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSASAGPWDFKFVRSLRSLERFDDL--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
              WA + +N V+ T     GT+A+ L  +  P+ +   MSR    V    +A  +E+ +
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PEQIPAVMSRSAGGVSRRGLAGTDEEQK 430

Query: 398 L 398
           +
Sbjct: 431 I 431


>gi|410730217|ref|XP_003671288.2| hypothetical protein NDAI_0G02680 [Naumovozyma dairenensis CBS 421]
 gi|401780106|emb|CCD26045.2| hypothetical protein NDAI_0G02680 [Naumovozyma dairenensis CBS 421]
          Length = 846

 Score =  130 bits (328), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 134/524 (25%), Positives = 234/524 (44%), Gaps = 76/524 (14%)

Query: 30  FLIDCGWNDH---FDPSLLQPLSKVASTIDAVLLSHPDTLHLGA--LPYA--MKQLGLSA 82
            LID GW      ++ S+ +  + V   +D +LLS P    LGA  L Y   +       
Sbjct: 28  ILIDPGWASSAVSYEDSV-RYWTNVIPEVDIILLSQPTGECLGAYTLLYTNFLSHFKSRI 86

Query: 83  PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRLTYSQNYHLSGK 140
            V+ST P+  LG ++M + Y S+  +  ++   LD  DI+ +F  ++ L YSQ   L  K
Sbjct: 87  EVYSTLPIANLGRVSMIESYASKGIIGPYNTNRLDLEDIEKSFDHISILKYSQTVDLRSK 146

Query: 141 GEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESF 191
            +G+ +  + +G   GGT+W I+   E +IY   +N  ++  LN         G  L S 
Sbjct: 147 FDGLSLIAYNSGSNPGGTIWSISTYSEKLIYVHRWNHTRDSILNPASLLDQTTGKPLASL 206

Query: 192 VRPAVLIT--DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD-SAGRVLELLLIL 248
           ++P+ +IT  D + ++   P +++ ++F+  +  +L   G+VL+PV+  +G+ L++L+I+
Sbjct: 207 LKPSGVITTLDKFGSI--DPFKRRVKLFKGTVWNSLNNNGSVLIPVEMGSGKFLDILVII 264

Query: 249 EDYWAEHSLN-----YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN--AFLLK 301
            ++  E+  N      P+  ++Y     + Y KS LEW+  S+ K++E+   N   F L 
Sbjct: 265 HEFLFENGKNPFYKHLPVLLVSYSKGRALTYTKSMLEWLSSSLLKTWESRSSNPSPFDLG 324

Query: 302 HVTLLINKSELDNAPDGPKLVLASMAS--LEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           +   ++   EL   P+  K+ L S     L+   +H    +     K  +L T     G 
Sbjct: 325 NRFKVVTSDELSKYPNS-KICLVSNVDILLDETVAHLCDSKSQHQNKTTILLTSNMNNGI 383

Query: 360 LARMLQADPPPKA-----VKVTMSRRV------PLVGEELIAYEEE-QTRLKKEEALKAS 407
           L  M +     K      +K   +  V      PL  EEL  Y+   + R  KE+ +  S
Sbjct: 384 LQNMKECWEEQKVKEGDLIKFNKTISVHNIQLDPLNDEELSEYKSVLEERKNKEKLIIES 443

Query: 408 LVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEP---------------------- 445
           + + +     L  D  L G   ++DA+  ++  D+                         
Sbjct: 444 IKRGKHKDKILTLD--LHGKDSILDASRKSSIIDLTNADEEEEDEEEDEDEDDALSSKAL 501

Query: 446 HGGRYR---DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVIN 486
           +  R     DI+I   +PP +    MF FY    + DD+G VI+
Sbjct: 502 YAKRIHTPVDIIIQPNLPPKSK---MFQFYPTKLKTDDYGTVID 542


>gi|149245580|ref|XP_001527267.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449661|gb|EDK43917.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 1067

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 129/465 (27%), Positives = 210/465 (45%), Gaps = 71/465 (15%)

Query: 16  ENPLSYLVSIDGFN----FLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA- 70
           EN  S+  S+  F+     L D  W+   D  +++ + +   +IDA+++SH  T  +   
Sbjct: 11  ENDRSFKASLLTFDNEHRILADPSWSGS-DALVVKFMEQYLPSIDAIIISHSTTEFISGY 69

Query: 71  --LPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSV 126
             L     ++ L+ PV+ST PV +LG ++  + Y S+  +      L  LD+ID+ F   
Sbjct: 70  ILLCIYFPKIMLTIPVYSTLPVNQLGRISTVEYYRSQGVLGPVLSSLIELDEIDNWFDKF 129

Query: 127 TRLTYSQNYHLS-GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN- 184
             + Y QN  L  GK   + + P+ +GH LGGT W I K  + VIYA  +N  ++  LN 
Sbjct: 130 KTVKYLQNITLCDGK---LTMTPYNSGHSLGGTFWLIVKRIDRVIYAPSWNHSRDSLLNN 186

Query: 185 --------GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD 236
                   G      +RP   +T A +   N   +++ E F   +  TL  GG  ++P  
Sbjct: 187 AGFINTQTGMPHVGLLRPTAFVTGA-DLGSNLSHKKRCEKFLQLVDATLNNGGAAIIPTS 245

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF--ETSR 294
            +GR LEL  +++ +     +  P+YF +Y  +  + Y    ++WM  S  K++  E  R
Sbjct: 246 ISGRFLELFHLVDQHLKGAPI--PVYFFSYSGTKILSYASGLMDWMSSSFNKAWNIENLR 303

Query: 295 DNA--FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLF 351
           D+   F    V LL++ SEL     GPK++  S   L  G  S  +F+   +D K  V+ 
Sbjct: 304 DDQLPFNPSKVDLLLDPSELMQM-RGPKIIFCSGIDLTNGDLSSKVFLYLCNDEKTTVIL 362

Query: 352 TERGQFGTLARMLQADPPPKA-----------VKVTMSRR--------VPL--------- 383
           TE+    +L   LQ D                VK+  SR         VPL         
Sbjct: 363 TEK---PSLLLALQKDSGNSMASISKELYNNWVKLAKSRTGKATDGVAVPLETVLKLDQW 419

Query: 384 ------VGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDN 422
                  GE+LI +  E T  +KE+ +  + V++++ +  L  DN
Sbjct: 420 MVEEEVTGEDLINFRNEITAKRKEKLI--AKVRDQKIQNLLNTDN 462


>gi|320581695|gb|EFW95914.1| Ca2+/calmodulin-dependent protein kinase [Ogataea parapolymorpha
           DL-1]
          Length = 1184

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 141/532 (26%), Positives = 231/532 (43%), Gaps = 70/532 (13%)

Query: 22  LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL 80
           L++ DG  N L D GW+   D S L+P       I  ++LS   T +LGA  Y + +  +
Sbjct: 59  LLTFDGQLNILADPGWDGVSDISYLEPH---IPNIHLIILSQTTTEYLGAFAYLLYKYPI 115

Query: 81  SAPV--FSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRLTYSQNYH 136
              V  ++T PV +LG L   + Y S   V       LD  D+++ F S+  + YSQ+  
Sbjct: 116 LRKVKTYATLPVSKLGRLATIELYRSAGLVGPLKGAVLDVEDVENYFNSIITVNYSQSVS 175

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN-GTV-LESFVRP 194
           L+G   GI +  + +GH LGG+ W + KD E ++YA  +N  K+  L  G + L + +R 
Sbjct: 176 LTGNLSGITITAYNSGHTLGGSFWLLNKDAEKIVYAPTWNHSKDYFLKPGRLNLPNLLRA 235

Query: 195 AVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
             LI+   +   +   + +   F + +  TL  G ++LLP    GR+LELL +L+     
Sbjct: 236 TTLIS-GSDLGSSLSHKMRISKFMELVKLTLMNGTSILLPTSVTGRLLELLPLLDQ---- 290

Query: 255 HSLNYPI----YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
              N P+    Y L++    ++++  + LEWM   ITK++E      F    + L I+  
Sbjct: 291 ---NVPVDINFYLLSFTGKKSLEFSGNMLEWMSPDITKNWENQNQTPFESNRLKL-ISLR 346

Query: 311 ELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           +L +    PK++      L  G  S D F+E  S     ++ TER +  T A  +  +  
Sbjct: 347 DLASLDHRPKIIFVDGTDLNEGSLSRDCFIELCSKHNTALIMTERPEVNTTAYDVYKEWE 406

Query: 370 PKA-----------------VKVTMSRRVPLVGEELIAYE---EEQTRLKKEEALKASLV 409
            K                  + ++ +R   L G EL AY+   EE+ + +KE+ ++  L 
Sbjct: 407 SKVKNDNNLKDGALTILEKQMSLSATREEKLRGSELNAYKKSVEERRQRRKEQEVQERLN 466

Query: 410 KE-------EESKASLGPDNNLSGDPMVIDANNANA-----------------SADVVEP 445
            +       E+            G+    DA N                    SA   E 
Sbjct: 467 NDLLDTLIGEDEDDDDDDSEFSDGEDAGADAENGENGEVKTTTTSTALTQSTHSAKDEEE 526

Query: 446 H--GGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 495
           H    +   + +D  V  +     MFPF       DD+GEVI   D++ ++E
Sbjct: 527 HITVDQILQMPMDFDVRNAKGRNRMFPFIVKKVSVDDYGEVIRHSDFMREEE 578



 Score = 48.1 bits (113), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 40/147 (27%), Positives = 74/147 (50%), Gaps = 16/147 (10%)

Query: 562 QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL-GDYEIAWVDAEVGKTENG--MLSLLPI 618
           +  E ID+ + + +Y + +S +L + + ++ + G Y IA V  EV     G   L L+P 
Sbjct: 732 KFNEKIDLGNVVTSYDLVISNELNNTLNWQAITGGYSIAHVYGEVVPVAPGDKHLKLVPP 791

Query: 619 STP--APPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGG 675
           +     P   S+ +GD+K+A+L+  L+     VEF G G L     + +RKV        
Sbjct: 792 TNTNLMPVSNSISIGDIKLAELRRKLTELNHAVEFRGDGTLVVNNQLAVRKVTDGN---- 847

Query: 676 GSGTQQIVIEGPLCEDYYKIRAYLYSQ 702
                 +VI+G + + +Y++R+ + S+
Sbjct: 848 ------LVIDGAMGQLFYQVRSLVMSK 868


>gi|242778797|ref|XP_002479311.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces stipitatus ATCC 10500]
 gi|218722930|gb|EED22348.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces stipitatus ATCC 10500]
          Length = 861

 Score =  130 bits (327), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 163/342 (47%), Gaps = 19/342 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +LLSH    H  ALPY + +      V +T     +    + D        S  D
Sbjct: 75  STVDILLLSHFHVDHSSALPYVLSKTNFKGRVLTTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I V P+ AGH+LG  ++ ++  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFYTTHTINSIRVTPYPAGHVLGAAMFLVSIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPRGIKIDVLITESTFGISSNPPRLEREAALMKSITGILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLILE+YW  H      PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILEEYWERHPEYQKVPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  ++V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGNKNVAAGPWDFRYVRSLRSLERFDDI--GSCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
              WA   +N V+ T     GT+A+ L  +  P+ +  TMS+
Sbjct: 373 LERWAPSERNGVVMTGYSVEGTMAKQLLNE--PEQIPATMSK 412


>gi|452845681|gb|EME47614.1| hypothetical protein DOTSEDRAFT_146416 [Dothistroma septosporum
           NZE10]
          Length = 839

 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 173/372 (46%), Gaps = 30/372 (8%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           ++V   G   ++D G +  ++     P       ST+D +L++H    H  +LPY + + 
Sbjct: 43  HIVQYKGKTVMLDAGIHPSYEGLGALPFYDEFDLSTVDLLLITHFHQDHSASLPYVLAKT 102

Query: 79  GLSAPVFSTEP---VYRLGL---LTMYDQYLSRRQVSEFD-----LFTLDDIDSAFQSVT 127
                VF T P   +Y+      + +++ +      S  D     L+T  DI S    + 
Sbjct: 103 DFHGKVFMTHPTKAIYKWTTQDAVRVHNTHTPASSTSGTDGYVSQLYTEQDILSTLPMIQ 162

Query: 128 RLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV 187
            ++++  +       GI   P+ AGH+LG  ++ I   G ++++  DY+R  ++HL    
Sbjct: 163 TISFNTTH----SHNGIRFTPYPAGHVLGACMYHIEIAGLNILFTGDYSREIDRHLIPAT 218

Query: 188 LESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
           +   V+   LIT++   +  + PRQ+RE     +++  L  GG VL+P  + G   ELLL
Sbjct: 219 IPPNVKIDCLITESTFGISTREPRQERENQLMKSVTNILNRGGRVLMPTTAVGNTQELLL 278

Query: 247 ILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA------- 297
           ILEDYW  H     +PIY+ + ++   +   +++++ M D I   F+ S   A       
Sbjct: 279 ILEDYWQRHEEYRRFPIYYASGLARKVMVVYQTYVDNMNDRIKAKFQASAAAAGDGGAAG 338

Query: 298 -FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
            +  + V  L      ++   G  +VLAS   L+ G S  +   WA D KN V+ T    
Sbjct: 339 PWDFQFVRALKGVDRFEDV--GGSVVLASPGMLQNGPSRALLERWAPDPKNGVVITGYSV 396

Query: 357 FGTLARMLQADP 368
            GT+A+ +  +P
Sbjct: 397 EGTMAKQIMLEP 408


>gi|342319748|gb|EGU11695.1| Endoribonuclease YSH1 [Rhodotorula glutinis ATCC 204091]
          Length = 857

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/331 (27%), Positives = 162/331 (48%), Gaps = 18/331 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H   L Y M++      +  V+ + P   +    M D        S
Sbjct: 80  STVDAILITHFHLDHAACLTYVMEKTNFKEGNGVVYMSHPTKAVYRYLMSDFVRVSTAGS 139

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHL---SGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
           + +LFT  ++ ++F  +    + Q   L   S     +      AGH+LG  ++ I   G
Sbjct: 140 DDNLFTESEMLASFDQIQSFDFEQEILLPPSSTSSASVRFTSFAAGHVLGACMFLIEVAG 199

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPA-VLITDAYNALHNQPPRQQRE-MFQDAISKT 224
             V+Y  DY+  +++HL    + ++ RP  V+I ++   + +  PR ++E  F + +   
Sbjct: 200 ARVLYTGDYSTEEDRHLVPAKVPNWERPPDVMICESTYGVQSHEPRLEKEAQFTNLVRSI 259

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWM 282
           L+ GG VLLPV + GR  ELLLIL++YWAEH    + PIY+++ ++   +D  + ++  M
Sbjct: 260 LKRGGRVLLPVFALGRAQELLLILDEYWAEHPELQHIPIYYVSSLAIKCMDVYRQYIHTM 319

Query: 283 GDSITKSFETSRDNAFLLKHVTLLINK-----SELDNAPDGPKLVLASMASLEAGFSHDI 337
             ++   F     N F  K     I       S+L++    P +V+AS   L +G S ++
Sbjct: 320 SPNVRSKFARG-INPFDFKRKDSFIRPLDRGISKLNDR--NPCVVMASPGFLTSGVSREL 376

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             +WA D +N ++ T     G +AR +  +P
Sbjct: 377 LEKWAPDPRNGLIITGYSVEGVMARTIMNEP 407


>gi|169767044|ref|XP_001817993.1| endoribonuclease ysh1 [Aspergillus oryzae RIB40]
 gi|83765848|dbj|BAE55991.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391872741|gb|EIT81836.1| mRNA cleavage and polyadenylation factor II complex, BRR5
           [Aspergillus oryzae 3.042]
          Length = 870

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/342 (27%), Positives = 163/342 (47%), Gaps = 19/342 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGIKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW +H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWEKHPELQKVPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +        + V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSISAGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
              WA + +N V+ T     GT+A+ L  +  P+ +   MSR
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PEQIPAVMSR 412


>gi|238483863|ref|XP_002373170.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
           [Aspergillus flavus NRRL3357]
 gi|220701220|gb|EED57558.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
           [Aspergillus flavus NRRL3357]
          Length = 870

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 95/342 (27%), Positives = 163/342 (47%), Gaps = 19/342 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGIKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW +H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWEKHPELQKVPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +        + V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSISAGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
              WA + +N V+ T     GT+A+ L  +  P+ +   MSR
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PEQIPAVMSR 412


>gi|121705410|ref|XP_001270968.1| cleavage and polyadenylylation specificity factor, putative
           [Aspergillus clavatus NRRL 1]
 gi|119399114|gb|EAW09542.1| cleavage and polyadenylylation specificity factor, putative
           [Aspergillus clavatus NRRL 1]
          Length = 1014

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 112/430 (26%), Positives = 172/430 (40%), Gaps = 103/430 (23%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FD   L  L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 27  GIKILVDVGWDDTFDTLDLLELEKHIPTLSLILLTHATPSHIGAFVHCCKTFPLFTQIPV 86

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 87  YATSPVISLGRTLLQDLYASSPLAATFLPKASISEPGASTSAASAAASVTDGQGSSDASN 146

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +       S    G+ +  + AGH +GGT+W
Sbjct: 147 AGRILLQPPTTEEIARYFSLIHPLKYSQPHQPLSSPFSSPLNGLTLTAYNAGHTVGGTIW 206

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 207 HIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALVCSTRGGDKFA 266

Query: 209 PP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYP----- 260
            P  R++R+ +  D I  +L  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 267 LPGGRKKRDDLLLDMIRSSLAKGGTVLIPTDTSARVLELAYALEHAWRDAAAGNSESDNV 326

Query: 261 -----IYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA-------- 297
                +Y       +T+   +S +EWM ++I + FE           S+ N         
Sbjct: 327 LKGAGLYMAGRKGHTTMRLARSMIEWMDENIVREFEAAEGVDAVTGQSQSNTDGQRSGGQ 386

Query: 298 ------------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWAS 343
                       F  KH+ ++  K  L+   A   PK+++AS  SL+ GF+ +     A 
Sbjct: 387 GQGKTGPKGVGPFTFKHLKIVERKKRLEKLLADQTPKVIIASDTSLDWGFAKESLRLVAE 446

Query: 344 DVKNLVLFTE 353
              NL+L TE
Sbjct: 447 GPNNLLLLTE 456


>gi|342879865|gb|EGU81098.1| hypothetical protein FOXB_08372 [Fusarium oxysporum Fo5176]
          Length = 858

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/331 (27%), Positives = 160/331 (48%), Gaps = 26/331 (7%)

Query: 67  HLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAF 123
           H  +LPY + +      VF T P   +    + D      +    +   ++T  D  + F
Sbjct: 115 HAASLPYVLAKTNFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQPVYTEQDHLNTF 174

Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL 183
             +  + Y   + +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL
Sbjct: 175 PQIEAIDYHTTHTISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHL 230

Query: 184 NGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVL 242
               +   V+  VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  
Sbjct: 231 VSAEVPKGVKIDVLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQ 290

Query: 243 ELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETS 293
           ELLLIL++YW +H+    YPIY+ + ++   +   ++++  M D+I + F       E S
Sbjct: 291 ELLLILDEYWGKHADFQKYPIYYASNLARKCMLIYQTYVGAMNDNIKRLFRERMAEAEAS 350

Query: 294 RDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
            D A     +  K++  L N    D+   G  ++LAS   L+ G S ++   WA   KN 
Sbjct: 351 GDGAGKGGPWDFKYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNG 408

Query: 349 VLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
           V+ T     GT+A+ +  +  P  ++  MSR
Sbjct: 409 VIITGYSVEGTMAKQIMQE--PDQIQAVMSR 437


>gi|354543512|emb|CCE40231.1| hypothetical protein CPAR2_102690 [Candida parapsilosis]
          Length = 938

 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 96/345 (27%), Positives = 163/345 (47%), Gaps = 30/345 (8%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSAPVFST 87
           L D  WN   D    Q +       DA+++SH     +     L      +  + PV+ST
Sbjct: 30  LADPSWNG-VDAKAAQFMESHLQQTDAIIISHSTDEFISGYILLCITFPNIMSNMPVYST 88

Query: 88  EPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIV 145
            PV +LG ++  + Y S+  +     +L  LD+ID+ F   T + Y QN  +  +   I 
Sbjct: 89  LPVNQLGRISTVEYYRSQGILGPLLSNLIELDEIDNWFDKFTIVKYQQNVTICDRK--IT 146

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL---------ESFVRPAV 196
           + P+ +GH LGGT W   K  + ++YA  +N  K+  LNG             S +RP  
Sbjct: 147 MTPYNSGHSLGGTFWLFVKRIDRIVYAPSWNHSKDAFLNGANFINSTSGNPHVSLLRPTA 206

Query: 197 LI--TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
            I  TD  +A+ +   +++ E F   +  TL  GG+ ++P   +GR LE+  +++++   
Sbjct: 207 FITATDLGSAMSH---KKRCEKFLQLVDATLANGGSAIIPTSISGRFLEVFHLVDEHLKG 263

Query: 255 HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL----KHVTLLINKS 310
             +  P+YF++Y  +  + Y  S ++WM     K++ T   N  LL      V LL++ S
Sbjct: 264 API--PVYFISYSGTKVLSYASSLMDWMSSDFNKTWNTDGGNNSLLPFNPSKVDLLLDPS 321

Query: 311 ELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER 354
           EL   P G K++  +   L+ G  S  +F    +D +  V+ TE+
Sbjct: 322 ELTQTP-GAKIIFCAGLDLKNGDLSSKVFSYLCNDERTTVILTEK 365



 Score = 40.4 bits (93), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 43/83 (51%), Gaps = 6/83 (7%)

Query: 615 LLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQK 673
           LL +   AP    + +G++++ DLK  L+S  + VEF G G L     + IRKV     +
Sbjct: 851 LLMVIANAP---KLAIGNIRLPDLKNKLTSLNLNVEFKGEGTLVVNNALAIRKVAYGSLE 907

Query: 674 GGGSGTQQIVIEGPLCEDYYKIR 696
              SG   IVI+G     YYK++
Sbjct: 908 SDDSG--DIVIDGNAGPLYYKVK 928


>gi|115397403|ref|XP_001214293.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192484|gb|EAU34184.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 870

 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 97/342 (28%), Positives = 165/342 (48%), Gaps = 19/342 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDVLLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ ++  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTVNSIHITPFPAGHVLGAAMFLVSIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   V+  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRD-NA----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D NA    +  + V  L +    D+   G  ++LAS   L++G S ++
Sbjct: 315 FRQRMAEAEASGDKNASAGPWDFRFVRSLRSLERFDDV--GGCVMLASPGMLQSGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
              WA + +N V+ T     GT+A+ L  +  P+ +   MSR
Sbjct: 373 LERWAPNERNGVIMTGYSVEGTMAKQLLNE--PEQIPAVMSR 412


>gi|212533753|ref|XP_002147033.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces marneffei ATCC 18224]
 gi|210072397|gb|EEA26486.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
           putative [Talaromyces marneffei ATCC 18224]
          Length = 866

 Score =  129 bits (325), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 163/342 (47%), Gaps = 19/342 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +LLSH    H  ALPY + +      V +T     +    + D        S  D
Sbjct: 75  STVDILLLSHFHVDHSSALPYVLSKTNFKGRVLTTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I V P+ AGH+LG  ++ ++  G ++++ 
Sbjct: 135 QRTSLYTEHDHLSTLPLIETIDFYTTHTINSIRVTPYPAGHVLGAAMFLVSIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPRGIKIDVLITESTFGISSNPPRLEREAALMKSITGILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLILE+YW  H      PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILEEYWERHPEFQKIPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  ++V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGNKNVAAGPWDFRYVRSLRSLERFDDI--GSCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSR 379
              WA   +N V+ T     GT+A+ L  +  P+ +  TMS+
Sbjct: 373 LERWAPSERNGVVMTGYSVEGTMAKQLLNE--PEQIPATMSK 412


>gi|449546825|gb|EMD37794.1| hypothetical protein CERSUDRAFT_154677 [Ceriporiopsis subvermispora
           B]
          Length = 820

 Score =  129 bits (325), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 161/324 (49%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  AL Y  ++         V+ T P   L    M D ++     +
Sbjct: 57  STVDVLLITHFHLDHAAALTYITEKTNFRDGKGKVYMTHPTKALHKFMMQD-FVRMSSST 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  D+  +  ++  ++  Q   +     G+   P+ AGH+LG  ++ I   G  +
Sbjct: 116 SDALFSPLDLSMSMSAIIPVSAHQ---VITPCPGVSFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    +   +RP VLI ++   +     R+++E  F   +   +R G
Sbjct: 173 LYTGDYSREEDRHLVKAEVPP-IRPDVLIVESTYGVQTLEGREEKEQRFTTLVHNIIRRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VLLP  + GR  ELLLIL++YW +H    N PIY+ + ++   +   ++++  M  ++
Sbjct: 232 GHVLLPTFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQTYIHTMNANV 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++ +      E   A   P +VLAS   + +G S ++   WA D
Sbjct: 292 RTRF-AKRDNPFVFKHISNVPQARGWERKIAEGPPCVVLASPGFVTSGPSRELLELWAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GT+AR +  +P
Sbjct: 351 SRNGIIVTGYSVEGTMARDILNEP 374


>gi|440638117|gb|ELR08036.1| hypothetical protein GMDG_02874 [Geomyces destructans 20631-21]
          Length = 831

 Score =  129 bits (324), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 174/373 (46%), Gaps = 18/373 (4%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  FD     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGMHPAFDGLSALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 99

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLS 138
                VF T P   +    + D        S  +  +    ++   S   +  + +YH +
Sbjct: 100 NFKGRVFMTHPTKAIYKWLIQDSVRVSSNSSSTEQSSTPYTEADHASTFPMIEAIDYHTT 159

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI 198
                I + P  AGH+LG  ++ I+  G  +++  DY+   ++HL    + + V+  VLI
Sbjct: 160 HTISSIRITPLPAGHVLGAAMFLISISGLTILFTGDYSIEPDRHLISASVPANVKVDVLI 219

Query: 199 TDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS- 256
           T++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW+ H  
Sbjct: 220 TESTYGVASHVPRLEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWSRHKD 279

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET---------SRDNAFLLKHVTLL 306
             N PIY+ + ++   +   ++++  M ++I + F           +    +  K++  L
Sbjct: 280 LQNIPIYYASNLARKCMLVYQTYVGAMNENIKRLFRERMAESEAGGTNGGPWDFKYIRSL 339

Query: 307 INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQA 366
            +    D+   G  ++LAS   ++ G S ++   WA   KN V+ T     GT+A+ +  
Sbjct: 340 KSLERFDDV--GSCVMLASPGMMQNGVSRELLERWAPSDKNGVVITGYSVEGTMAKSIMQ 397

Query: 367 DPPPKAVKVTMSR 379
           +  P  ++  MSR
Sbjct: 398 E--PDQIQAIMSR 408


>gi|358058074|dbj|GAA96053.1| hypothetical protein E5Q_02714 [Mixia osmundae IAM 14324]
          Length = 896

 Score =  129 bits (323), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 165/681 (24%), Positives = 288/681 (42%), Gaps = 145/681 (21%)

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI-TKDGEDVIYAV 173
           ++ ++  AF  +  + +S   HL G+   + +    +G  LGGT++ + +     ++YA 
Sbjct: 137 SMREVREAFDRIRTIRWSSPLHLEGRNAPLTLLAQPSGTHLGGTLFFVRSPTMPPILYAP 196

Query: 174 DYNRRKEKHLNGTVLESFVRPA-------VLITDAYNAL-HNQPPRQQREMFQDAISKTL 225
            +N  KEKHL+     S V           LIT    A    Q    +       I+ TL
Sbjct: 197 VFNHIKEKHLDSAA--SIVLGGAETKGLGTLITSVEKAQSKGQKTVARNSAMLQTITSTL 254

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWA-EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           +AG +VL+PVD+AGR+ ELL++L+ +W   H  ++P+  ++         +++  E+ G 
Sbjct: 255 QAGRSVLMPVDAAGRIAELLVLLDQHWTFSHLGDFPLCLVSPTGPPLQMTLRNLHEFFGS 314

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWA 342
           ++ K      +    L ++ +  +   L     P  PK+VLA+   L  G S  +F E A
Sbjct: 315 NLGK------EGIGRLANLKIFPSLDSLYAVIPPHVPKVVLAAPLPLSYGSSRKVFTEMA 368

Query: 343 SDVKNLVLFTERGQFGTLARML-----QADPPPK---------------AVKVTMSRRVP 382
           +   NL+L T  G  G+L+R L     +A  P +               AV + M  +V 
Sbjct: 369 AQAGNLLLLTSPGPAGSLSRSLFDKWNEAQTPAQRMGTGEIGQTITLNEAVSLPMRSKVI 428

Query: 383 LVGEELIAYEEEQTRLKKEEALKASLVKEEESKAS-----LGPDNNLSGDPMVIDANNAN 437
           L GEEL  + + Q   K+  A + ++++  +  A         ++  S D   ++A NA 
Sbjct: 429 LQGEELQEFLDNQRAAKERHAKQKAMLERSQRMAEADADASDSEDGDSSDEDELEAPNAG 488

Query: 438 A-------SADVVEPHGGRYR------------DILIDGFVPPST--------SVAP--- 467
                   + DV+   G R              D  +D   PP T        S+A    
Sbjct: 489 EILPQQGDNVDVMAEPGARRDGEPGSMRGTGVWDEFLDEDAPPGTLDVYVRGRSIAAFLN 548

Query: 468 -----------MFPFYENNSEWDDFGEVINPDDYIIK----DEDMDQAAMH---IGGDDG 509
                      M+PF E   + D +GEVI+   ++ +    +E+ ++ AM+   +G    
Sbjct: 549 GMPDTTSSRLRMYPFTERRRKVDAYGEVIDVQGWLRRGRNDEEEQEENAMNNALLGKRKR 608

Query: 510 KLDEG----------SASLIL-------------DAKPSKVVSNELT----VLVHGSAEA 542
           + DE              ++L             D +  K +   L     +LV+GS+ A
Sbjct: 609 QQDEQVEPPHKFLIEERQVMLRCQLFAVDLEGRADGRALKDIIPRLAPKRLILVNGSSAA 668

Query: 543 TEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIA--- 599
            + + + C   V P +  P + E +    ++ ++ ++L ++L+S++   K+ +YE+A   
Sbjct: 669 AQDIARACHDFV-PVIEAPALGERVIAGIEIQSFAIRLGDELLSSLKLSKVEEYEMARIS 727

Query: 600 ----WVDAEVGKTENGMLSLLPIS----------------TPAPPHKSVLVGDLKMADLK 639
               +VD E   T    L+   IS                + AP   S+ +GD+K+A L+
Sbjct: 728 GILRFVDGEDIPTLEPSLAQAAISEDLLVDGADTEMTKKGSLAPLKPSMFIGDVKLAALR 787

Query: 640 PFLSSKGIQVEFAG-GALRCG 659
             L S  IQ  FAG G L CG
Sbjct: 788 QRLLSAKIQASFAGAGVLVCG 808


>gi|340509014|gb|EGR34593.1| hypothetical protein IMG5_006210 [Ichthyophthirius multifiliis]
          Length = 456

 Score =  128 bits (322), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 91/335 (27%), Positives = 152/335 (45%), Gaps = 30/335 (8%)

Query: 49  SKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPV----------YRLGLLTM 98
           ++    ID VL+SH    H+GALPY  +      P++ T P           YR  +   
Sbjct: 64  TQYTDIIDLVLISHFHLDHIGALPYFSEIYQYDGPIYMTAPTKALFPYMCEDYRKVISDT 123

Query: 99  YDQ--------YLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV 150
           Y +           + Q   F +++ ++I ++FQ V  +   +   ++G    I + P+ 
Sbjct: 124 YKKENMIDDNNNNDQLQKMPF-VYSQENIQNSFQKVQTIQLLETIDVNG----IKIKPYY 178

Query: 151 AGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQP 209
           AGH+LG  ++ I   G  V+Y  D++   ++HL    ++  + P +LI++  Y  +  + 
Sbjct: 179 AGHVLGACMFLIEYKGIKVVYTGDFHSNADRHLGAAWIDK-INPDLLISECTYGTIVRES 237

Query: 210 PRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSS 269
            R +   F   + +T+  GG VL+PV + GR  EL ++LE YW       P+YF   +  
Sbjct: 238 KRARERTFLQQVQETIDQGGKVLIPVFALGRAQELCVLLETYWQRTQNQAPVYFAAGMIE 297

Query: 270 STIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASL 329
               Y K F+ W  + I   +    DN F  KH+     KS +    + P ++ A+   L
Sbjct: 298 KANFYYKLFVNWTNEKIKSCYLI--DNMFNFKHIKPF-QKSLIK--ANMPMVLFATPGML 352

Query: 330 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
            AG S  +F EW  D KN ++       GTL   L
Sbjct: 353 HAGLSMQVFKEWCYDSKNTLIIPGYCVAGTLGNKL 387


>gi|27372065|gb|AAN87883.1| FEG protein [Arabidopsis thaliana]
          Length = 613

 Score =  128 bits (322), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 165/358 (46%), Gaps = 20/358 (5%)

Query: 22  LVSIDGFNFLIDCGW-------NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG        N + + SL+       + I  ++++H    H+GALPY 
Sbjct: 20  VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ + P   L  L +  Y + +  R+  E +LFT   I +  + V  +   
Sbjct: 80  TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRR-GEEELFTTTHIANCMKKVIAIDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  +         ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LI+++  A   +  +  RE  F  A+ K +  GG  L+P  + GR  EL ++L+DY
Sbjct: 195 QLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   ++  PIYF + ++     Y K  + W   ++ +   T   N F  K+V        
Sbjct: 255 WERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF--DRS 310

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           L +AP GP ++ A    L AG S ++F  WA    NLV        GT+   L A  P
Sbjct: 311 LIHAP-GPCVLFAIPGMLCAGLSLEVFKHWAPSPLNLVALLGYSVAGTVGHKLMAGKP 367


>gi|389583415|dbj|GAB66150.1| RNA-metabolising metallo-beta-lactamase domain containing protein
           [Plasmodium cynomolgi strain B]
          Length = 713

 Score =  128 bits (322), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 91/351 (25%), Positives = 160/351 (45%), Gaps = 41/351 (11%)

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--- 100
           L++ LS++   ID V++SH    H+GALP+  + L     +  + P   L    + D   
Sbjct: 99  LIKNLSRINEIIDCVIISHFHMDHIGALPFFTEILKYRGTIIMSYPTKALSPTLLLDGCR 158

Query: 101 --------QYLSRR---------QVSEFDLFTL---------DDIDSAFQSVTRLTYSQN 134
                   Q   R+         ++  +++ +L         D I S    V  L  ++ 
Sbjct: 159 VADIKWEKQNFERQIKLLNEKSDELLNYNISSLKKDPWNISEDHIYSCIGKVVGLQINET 218

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + +      + + P+ AGH+LG  ++KI  +   VIY  DYN   +KHL  T + S   P
Sbjct: 219 FEMG----NMSITPYYAGHVLGACIFKIEVNNFSVIYTGDYNTVPDKHLGSTKIPSLT-P 273

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            + I+++  A + +P R+  E+   + + + +  GG VL+PV + GR  EL ++L+ YW 
Sbjct: 274 EIFISESTYATYVRPTRKASELDLCNLVHECVHKGGKVLIPVFAIGRAQELSILLDSYWR 333

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
           +  +NYPIYF   ++ +   Y + +  W+  S      T + N F   +++  +N    +
Sbjct: 334 KMKINYPIYFGCGLTENANKYYRIYSSWVNSSCV---STDKKNLFDFANISPFVNNYLGE 390

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           N    P ++ A+   L  G S   F  WA   KNL++       GT+   L
Sbjct: 391 NR---PMVLFATPGMLHTGLSLKAFKAWAGSSKNLIVLPGYCVQGTVGHKL 438


>gi|308198072|ref|XP_001387057.2| predicted protein [Scheffersomyces stipitis CBS 6054]
 gi|149389019|gb|EAZ63034.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 934

 Score =  128 bits (322), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 145/561 (25%), Positives = 238/561 (42%), Gaps = 102/561 (18%)

Query: 22  LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH--PDTLHLGALPYAMK-- 76
           L+S D  F  L D  WN   D + +  + +     + +LLSH  P+ +  G +   +K  
Sbjct: 20  LLSFDNEFRVLADPSWNGK-DVNSVMFMEQHLRNTNIILLSHSTPEFIS-GYVLMCLKFP 77

Query: 77  QLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQN 134
            L  +  V+ST PV +LG L+  + Y +   +   +  L  LD++D  F  ++ L Y Q 
Sbjct: 78  NLMANIQVYSTLPVNQLGRLSTVEFYRANGMLGPLNTALLELDEVDEWFDKISLLKYLQ- 136

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV------- 187
             L+     +V+ P+ AGH LGGT W ITK  + VIYA  +N  K+  LNG         
Sbjct: 137 -ILNVFDNKVVITPYNAGHTLGGTFWLITKRSDRVIYAPAWNHSKDSFLNGASFLSSSSG 195

Query: 188 --LESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
             L   +RP   IT + +       +++ E F   +  TL  GG  ++P   +GR LEL 
Sbjct: 196 NPLSQLLRPTAFIT-STDMGSVMSHKKRTEKFLQLVDATLANGGAAVIPTSLSGRFLELF 254

Query: 246 LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA------FL 299
            +++++     +  P+YFL+Y  +  + Y  + ++WM  S+   +E +  +       F 
Sbjct: 255 HLIDEHLQGAPI--PVYFLSYSGTKVLSYASNLIDWMSSSVQSQWEEAESSTNYKNLPFD 312

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTERGQFG 358
              V LL++  EL     GPK+V  S   L  G  S + F     D K+ +L TE+  FG
Sbjct: 313 PSKVDLLLSPEELIQLS-GPKIVFCSGIDLRNGELSAEAFQYLCQDEKSTILLTEKSLFG 371

Query: 359 ---TLARMLQAD-------------------PPPKAVKV-TMSRRVPLVGEELIAYEEEQ 395
              TL  +L  +                   P  +   +   +R   L G  L  ++E  
Sbjct: 372 VDETLNTVLYKEWHSLTKQKLGGKVEDGVAVPLERVFSIDDWTREENLSGTALTDFQERI 431

Query: 396 TRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANAS---------------- 439
              +KE+ L    V++ +++  L  D  L G+    +  + N+S                
Sbjct: 432 AVRRKEKLLAK--VRDRKNQNLLNSD--LVGEEDSSEDEDGNSSDEETKVSETTETTTVV 487

Query: 440 ----------ADVVEPHGG-------------RYRDILIDGFVPPSTSVAPMFPFYEN-- 474
                     AD +  H               R  D+ I   + P  +   MFP++ N  
Sbjct: 488 ASTVASGPSVADELAAHEAFITDHIKQSLEENRPLDLKITYKLKPRQA---MFPYFINTH 544

Query: 475 NSEWDDFGEVINPDDYIIKDE 495
             ++DD+GEVI+  D+   DE
Sbjct: 545 KQKFDDYGEVIDVKDFQKTDE 565


>gi|159487337|ref|XP_001701679.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280898|gb|EDP06654.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 460

 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 168/359 (46%), Gaps = 32/359 (8%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS-------LLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V + G   + DCG +  F  +       LL    +    IDA++++H    H+GALPY 
Sbjct: 17  IVRMAGRTVMFDCGAHFGFRDARRFPEFGLLSRAGRFTELIDALVITHFHIDHIGALPYF 76

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYDQY-LSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            +  G   PV  T P + +  + + D   ++  +  E   +T   +    + VT +   Q
Sbjct: 77  TEVCGYRGPVLMTYPTFAMAPIMLEDYVKVNADRPGEVLPYTEQHVRDCLRRVTAVDLHQ 136

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLE---- 189
              +     G+    H AGH+LG  +  +T      +Y  D+N   ++HL    L     
Sbjct: 137 ---VVAVAPGLSFTFHYAGHVLGAAMVTMTAGHLTALYTGDFNSAPDRHLGSAELAAGGA 193

Query: 190 ----SFVR-PAVLITDAYNA--LHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 242
                 +R P VLI++A  A  L +    ++R++ Q A+  T+ AGG VL+P  + GR  
Sbjct: 194 GPAGCLMREPDVLISEATYAASLRDSKRGRERDLLQ-AVEDTVAAGGKVLIPTFAMGRAQ 252

Query: 243 ELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKH 302
           ELL++L D W    L  PIYF + ++S  + Y +  L W   ++ K+ E      F    
Sbjct: 253 ELLMLLADCWRRKGLTVPIYFSSAMASRALTYYQLLLNWTNANVRKAVEADVYGMFR--- 309

Query: 303 VTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL---FTERGQFG 358
            T   ++S L  AP GP ++ AS  ++ +G S + F  WA   +NLV+   +  RG++G
Sbjct: 310 -TRPWDRSLL-QAP-GPAVLFASPGNITSGVSLEAFRAWAGSSRNLVVLAGYQVRGEWG 365


>gi|385305954|gb|EIF49896.1| mrna cleavage and polyadenylation specificity factor complex
           subunit ysh1 [Dekkera bruxellensis AWRI1499]
          Length = 295

 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/259 (29%), Positives = 135/259 (52%), Gaps = 10/259 (3%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+T +D++S+   +  L    +YH + + +GI      AGH+LG  ++ +   G   ++ 
Sbjct: 37  LYTDEDLNSSLDRIEXL----DYHSTIEVDGIRFTAFPAGHVLGAAMFLVEMGGLKFLFT 92

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL+   +   V P +LI ++        PR +RE      I  TL+ GG  
Sbjct: 93  GDYSREEDRHLSSAEVPD-VTPDLLIVESTFGTATHVPRLERENKLTTVIHSTLQQGGRC 151

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           LLPV + GR  E+LLIL++YW  H    N PIY+ + ++   +   + ++  M DSI K 
Sbjct: 152 LLPVFALGRAQEILLILDEYWQRHKDLQNVPIYYASSLAKKCMAVYERYINMMNDSIRKK 211

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           F  + +N F  K++  + +   +D+    P +++AS   L+ G S  +  +W  D +N V
Sbjct: 212 FTETNENPFHFKYIKNVAHADRIDDL--NPCVMIASPGMLQNGVSRQLLEKWCPDPRNTV 269

Query: 350 LFTERGQFGTLARMLQADP 368
           + T     GT+A+ L  +P
Sbjct: 270 IMTGYSVDGTMAKKLLTEP 288


>gi|2394306|gb|AAB70268.1| 73 kDA subunit of cleavage and polyadenylation specificity factor
           [Homo sapiens]
          Length = 379

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 75/274 (27%), Positives = 146/274 (53%), Gaps = 14/274 (5%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  ++Y 
Sbjct: 3   LYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYT 58

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  GG  
Sbjct: 59  GDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRG 117

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D I K 
Sbjct: 118 LIPVFALGRAQELLLILDEYWQNHPELXDXPIYYASSLAKKCMAVYQTYVNAMNDKIRKQ 177

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
                +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D +N V
Sbjct: 178 INI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGV 233

Query: 350 LFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +       GTLA+ + ++  P+ +     +++PL
Sbjct: 234 IIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 265


>gi|328853485|gb|EGG02623.1| hypothetical protein MELLADRAFT_38438 [Melampsora larici-populina
           98AG31]
          Length = 672

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 94/331 (28%), Positives = 163/331 (49%), Gaps = 20/331 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  +L Y M+       +  VF T P   +    M D        +
Sbjct: 47  STVDAILITHFHLDHAASLTYIMENTNFKEGNGKVFMTHPTKAVYRFLMQDFVRMSTIGT 106

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + +LF  + +  +++S+  + Y Q   L      +    + AGH+LG  ++ I   G  V
Sbjct: 107 DGELFNEEQMTLSYESINAIDYHQEISLGS----LRFTSYPAGHVLGAAMFLIEIAGIRV 162

Query: 170 IYAVDYNRRKEKHLNGTVLESF-VRPAVLITDAYNALHNQPPR-QQREMFQDAISKTLRA 227
           +Y  DY+  +++HL    + ++  +P V+I ++   + +  PR ++ E F   +   L+ 
Sbjct: 163 LYTGDYSTEEDRHLIPAKVPNWNEKPDVMICESTYGVQSLEPRPEKEERFTALVQMILKR 222

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H  LN  PIY+++ +++  +   ++F+  M + 
Sbjct: 223 GGRVLMPVFALGRAQELLLILDEYWSNHPELNSIPIYYISNLAAKCMKVYQTFIHGMNEE 282

Query: 286 ITKSFETS-------RDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDI 337
           I   F          R+   L K    + N   LD   D GP +V+AS   +  G S ++
Sbjct: 283 IKSKFNKGINPWTFFREGKGLFKK-GYVTNLKTLDKFDDRGPCVVMASPGFMTNGASREL 341

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
              WA D +N +L T     GT+AR +  +P
Sbjct: 342 LERWAPDRRNGLLVTGYSIEGTMAREMLKEP 372


>gi|221055463|ref|XP_002258870.1| RNA-metabolising metallo-beta-lactamase [Plasmodium knowlesi strain
           H]
 gi|193808940|emb|CAQ39643.1| RNA-metabolising metallo-beta-lactamase,putative [Plasmodium
           knowlesi strain H]
          Length = 914

 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 89/351 (25%), Positives = 160/351 (45%), Gaps = 41/351 (11%)

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--- 100
           L++ LS++   ID V++SH    H+GALP+  + L     +  + P   L  + + D   
Sbjct: 99  LIEKLSRINEIIDCVIISHFHMDHIGALPFFTEILKYRGTIIMSYPTKALSPILLLDGCR 158

Query: 101 -------QYLSRRQVS----------EFDLFTL---------DDIDSAFQSVTRLTYSQN 134
                  +    RQ+            +++ +L         + I S    V  L  ++ 
Sbjct: 159 VADLKWEKKNFERQIKLLNEKSDELLNYNISSLKKDPWNISEEHIYSCIGKVVGLQINET 218

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           Y +      + + P+ AGH+LG  ++KI  +   VIY  DYN   +KHL  T + S + P
Sbjct: 219 YEMG----NMSITPYYAGHVLGACIYKIEVNNFSVIYTGDYNTVPDKHLGSTKIPS-LNP 273

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            + I+++  A + +P R+  E+   + + + +  GG VL+PV + GR  EL ++L+ YW 
Sbjct: 274 EIFISESTYATYVRPTRKASELDLCNLVHECVHKGGKVLIPVFAIGRAQELSILLDSYWR 333

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
           +  +NYPIYF   ++ +   Y + +  W+    +    T + N F   +++  +N    +
Sbjct: 334 KMKINYPIYFGCGLTENANKYYRIYSSWVN---SNCVSTDKKNLFDFANISPFVNNYLDE 390

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           N    P ++ A+   L  G S   F  WA    NL++       GT+   L
Sbjct: 391 NR---PMVLFATPGMLHTGLSLKAFKAWAGSSNNLIVLPGYCVQGTVGHKL 438


>gi|224140917|ref|XP_002323823.1| predicted protein [Populus trichocarpa]
 gi|222866825|gb|EEF03956.1| predicted protein [Populus trichocarpa]
          Length = 250

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/254 (30%), Positives = 127/254 (50%), Gaps = 10/254 (3%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G  V+Y 
Sbjct: 3   LFDEKDINRSMDKIEVIDFHQTLDVNG----IKFWCYTAGHVLGAAMFMVDIAGVRVLYT 58

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVL 232
            DY+R +++HL    +  F     +I   Y    +QP   + + F D I  T+  GG VL
Sbjct: 59  GDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHLREKRFTDVIHSTISLGGRVL 118

Query: 233 LPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   ++++  M + I   F
Sbjct: 119 IPAFALGRAQELLLILDEYWANHPELHNIPIYYASPLAKKCMTVYQTYILSMNERIRNQF 178

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
             S  N F  KH++ L +  +  +   GP +V+AS   L++G S  +F  W SD KN  +
Sbjct: 179 ANS--NPFKFKHISPLNSIEDFSDV--GPSVVMASPGGLQSGLSRQLFDMWCSDKKNACV 234

Query: 351 FTERGQFGTLARML 364
                  GTLA+ +
Sbjct: 235 LPGYVVEGTLAKTI 248


>gi|448517227|ref|XP_003867743.1| endoribonuclease [Candida orthopsilosis Co 90-125]
 gi|380352082|emb|CCG22306.1| endoribonuclease [Candida orthopsilosis]
          Length = 769

 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 169/333 (50%), Gaps = 25/333 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  L+  + +  S     
Sbjct: 64  SKVDILLVSHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 122

Query: 105 ----RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVW 160
                      +L+T DDI  +F  +  +    ++H + + +GI    + AGH+LG  ++
Sbjct: 123 TEGGGGNDEGGNLYTDDDIFKSFDRIETI----DFHSTMEVDGIRFTAYYAGHVLGACMY 178

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQD 219
            I   G  V++  DY+R + +HL    +   V+P VLIT++        P+ + E    +
Sbjct: 179 LIEIGGLKVLFTGDYSREENRHLPSAEVPP-VKPDVLITESTFGTGTLEPKAELEKKLTN 237

Query: 220 AISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKS 277
            I  T+  GG VLLPV + G   ELLLIL++YW ++    N  +Y+ + ++   +   ++
Sbjct: 238 HIHATITKGGRVLLPVFALGNAQELLLILDEYWEKNEDLQNVSVYYCSDLARKCMAVYET 297

Query: 278 FLEWMGDSI--TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSH 335
           +   M D I  + S + S+ N F  K++  + N S+  +   GP +V+A+   L+AG S 
Sbjct: 298 YTGIMNDKIRLSSSSDDSKSNPFDFKYIKSIRNLSKFSDL--GPSVVVATPGMLQAGVSR 355

Query: 336 DIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            +  +WA + KNLV+ T     GT+A+ L  +P
Sbjct: 356 QLLEKWAPEQKNLVILTGYSVEGTMAKDLLKEP 388


>gi|156343760|ref|XP_001621104.1| hypothetical protein NEMVEDRAFT_v1g222359 [Nematostella vectensis]
 gi|156206741|gb|EDO29004.1| predicted protein [Nematostella vectensis]
          Length = 388

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/287 (29%), Positives = 145/287 (50%), Gaps = 16/287 (5%)

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
           GI    + AGH+LG  ++ +   G  ++Y  D++R++++HL    + S + P VLI ++ 
Sbjct: 83  GIKFWCYHAGHVLGACMFMLEIAGVKILYTGDFSRQEDRHLMAAEIPS-ISPDVLIIEST 141

Query: 203 NALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNY 259
              H    R++RE  F   +   +  GG  L+PV + GR  ELLLIL++YW  H    + 
Sbjct: 142 YGTHIHEKREEREARFTGTVHDIVNRGGRCLIPVFALGRAQELLLILDEYWQNHPELHDI 201

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
           PIY+ + ++   +   ++++  M D I K    S  N F+ KH++ L +  + D+   GP
Sbjct: 202 PIYYASQLAKKCMSVFQTYVNAMNDKIKKQIAIS--NPFVFKHISNLKSIDQFDDI--GP 257

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLAR--MLQADPPPKAVKVTM 377
            +V+AS   +++G S ++F +W +D +N V+       GTLA+   L    PP    V +
Sbjct: 258 SVVMASPGMMQSGLSRELFEQWCTDRRNGVIIAGYCVEGTLAKEVSLVVHNPPNCQSVEL 317

Query: 378 SRRVPLVGEELIAYEEEQTRLKKEEA--LKASLVKEEESKASLGPDN 422
             R    GE++     +  R K E    L   L+K   +   + PD+
Sbjct: 318 YFR----GEKMAKVMGQMAREKPEHGKPLSGILIKRGFNYHLIAPDD 360


>gi|149641381|ref|XP_001505542.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-like, partial [Ornithorhynchus anatinus]
          Length = 595

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/274 (27%), Positives = 146/274 (53%), Gaps = 14/274 (5%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  ++Y 
Sbjct: 33  LYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYT 88

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  GG  
Sbjct: 89  GDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRG 147

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D I K 
Sbjct: 148 LIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQ 207

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
                +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D +N V
Sbjct: 208 INI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGV 263

Query: 350 LFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +       GTLA+ + ++  P+ +     +++PL
Sbjct: 264 IIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 295


>gi|444731702|gb|ELW72051.1| Cleavage and polyadenylation specificity factor subunit 3 [Tupaia
           chinensis]
          Length = 587

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/274 (27%), Positives = 146/274 (53%), Gaps = 14/274 (5%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  ++Y 
Sbjct: 25  LYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYT 80

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  GG  
Sbjct: 81  GDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRG 139

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D I K 
Sbjct: 140 LIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQ 199

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
                +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D +N V
Sbjct: 200 INI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGV 255

Query: 350 LFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
           +       GTLA+ + ++  P+ +     +++PL
Sbjct: 256 IIAGYCVEGTLAKHIMSE--PEEITTMSGQKLPL 287


>gi|66356658|ref|XP_625507.1| cleavage and polyadenylation specifity factor protein, CPSF
           metallobeta-lactamase [Cryptosporidium parvum Iowa II]
 gi|46226496|gb|EAK87490.1| cleavage and polyadenylation specifity factor protein, CPSF
           metallobeta-lactamase [Cryptosporidium parvum Iowa II]
          Length = 780

 Score =  127 bits (318), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 167/365 (45%), Gaps = 24/365 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           +VS  G + + DCG +  F      P+      STID  L++H    H GA PY +    
Sbjct: 41  VVSFKGRSVMFDCGIHPAFSGIGSLPVFDAIDVSTIDLCLITHFHLDHSGATPYFVSLTD 100

Query: 80  LSAPVFSTEPVYRLGLLTMYDQYLSRR-----------QVSEFDLFTLDDIDSAFQSVTR 128
            +  VF TEP   +  L   D     +            +S  +L+T  DI+ A      
Sbjct: 101 FNGKVFMTEPTKAICKLVWQDYARVNKFSAGSIESEEAPLSSINLYTEKDIEKAINMTEI 160

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL 188
           + + Q   L    +GI  + + AGH+LG  ++ +   G  ++Y  DY+R  ++H+    +
Sbjct: 161 IDFRQQVEL----DGIRFSCYGAGHVLGACMFLVEIGGVRILYTGDYSREDDRHVPRAEI 216

Query: 189 ESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLI 247
              +   VLI ++        PR  RE  F   +   +   G  LLPV + GR  ELLLI
Sbjct: 217 PP-IDVHVLICESTYGTRIHEPRIDREKRFLGGVQSIITRKGKCLLPVFAIGRAQELLLI 275

Query: 248 LEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTL 305
           LE++W+      N PI + + +S   +   ++++   GDS+ +  +    N F   ++  
Sbjct: 276 LEEHWSRTPSIQNVPIIYASPMSIKCMRVFETYINQCGDSVRRQADLGI-NPFQFNYIKT 334

Query: 306 LINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARM 363
           + + +E+ +     GP +V+A+   L+ G S DIF  WA D +N ++ T     GT A  
Sbjct: 335 VNSLNEIKDIIYNPGPCVVMAAPGMLQNGTSRDIFEIWAPDKRNGIILTGYAVRGTPAYE 394

Query: 364 LQADP 368
           L+ +P
Sbjct: 395 LRKEP 399


>gi|156096985|ref|XP_001614526.1| RNA-metabolising metallo-beta-lactamase domain containing protein
           [Plasmodium vivax Sal-1]
 gi|148803400|gb|EDL44799.1| RNA-metabolising metallo-beta-lactamase domain containing protein
           [Plasmodium vivax]
          Length = 911

 Score =  127 bits (318), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 157/347 (45%), Gaps = 33/347 (9%)

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--- 100
           L+  L ++   ID V++SH    H+GALP+  + L     +  + P   L  + + D   
Sbjct: 99  LINNLKRINEMIDCVIISHFHMDHIGALPFFTEILKYRGTILMSYPTKALSPILLLDGCR 158

Query: 101 -------QYLSRRQVS----EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI----- 144
                  +    RQ+     + D     +I S  +    ++  Q Y   GK  G+     
Sbjct: 159 VADLKWEKQNFERQIKLLNEKSDELLNYNISSLKKDPWNISEEQIYSCIGKVVGLQINET 218

Query: 145 ------VVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI 198
                  + P+ AGH+LG  ++KI  +   VIY  DYN   +KHL  T + S   P + I
Sbjct: 219 FQMGNMSITPYYAGHVLGACIFKIEVNNFSVIYTGDYNTVPDKHLGSTKIPSLT-PEIFI 277

Query: 199 TDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL 257
           +++  A + +P R+  E+   + + + +  GG VL+PV + GR  EL ++L+ YW +  +
Sbjct: 278 SESTYATYVRPTRKASELDLCNLVHECVHKGGKVLIPVFAIGRAQELSILLDSYWKKMKI 337

Query: 258 NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD 317
           NYPIYF   ++ +   Y + +  W+  S      T + N F   +++  +N    +N   
Sbjct: 338 NYPIYFGCGLTENANKYYRIYSSWVNSSCV---STDKKNLFDFANISPFVNSYLGENR-- 392

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
            P ++ A+   L  G S   F  W+   KNL++       GT+   L
Sbjct: 393 -PMVLFATPGMLHTGLSLKAFKAWSGCSKNLIVLPGYCVQGTVGHKL 438


>gi|295657429|ref|XP_002789283.1| endoribonuclease ysh1 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226283953|gb|EEH39519.1| endoribonuclease ysh1 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 892

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 160/335 (47%), Gaps = 25/335 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           S++D +L+SH    H   LPY + +      VF T     +    + D        S  D
Sbjct: 79  SSVDILLISHFHLDHSAGLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 138

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T ++  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 139 QRTTLYTEEEHLSTLPQIEAIDFNTTHTINS----IRITPFPAGHVLGAAMFVISIAGLN 194

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 195 ILFTGDYSREEDRHLISAEVPKGIKIDVLITESTFGISSNPPRLEREAALMKSITTILNR 254

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   I   ++++  M ++
Sbjct: 255 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNMARKCIIVYQTYIGAMNEN 314

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  ++V  + N    D+   G  ++LAS   L+ G 
Sbjct: 315 IKRVFRERMAEADAAGANSATAGPWNFRYVRSVKNIERFDDV--GGCVMLASPGMLQTGT 372

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 373 SRELLERWAPNERNGVIMTGYSVEGTMGKQILNEP 407


>gi|156848581|ref|XP_001647172.1| hypothetical protein Kpol_1036p59 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156117856|gb|EDO19314.1| hypothetical protein Kpol_1036p59 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 821

 Score =  126 bits (317), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 131/530 (24%), Positives = 233/530 (43%), Gaps = 75/530 (14%)

Query: 22  LVSIDGFNFLIDCGWN----DHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
           +V  D    LID  WN     + D   ++  S +   +D +LLS P    LGA  Y+M  
Sbjct: 19  IVRFDNVTILIDPSWNGKNVSYADS--IKYWSTIIPEVDIILLSQPSLECLGA--YSMLY 74

Query: 78  LGLSA------PVFSTEPVYRLGLLTMYDQYLSRRQVS--EFDLFTLDDIDSAFQSVTRL 129
               +       V++T PV  LG +++ +QY     +   E +   L+DI+ +F ++  +
Sbjct: 75  YNFVSHFVSRIDVYATLPVSNLGRISVIEQYACAGIIGPYETNEMDLEDIEKSFDNIKTV 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL- 188
            YSQ   L  K +G+ +  + +G   GG++W +    E ++YA  +N  K+  LNG  L 
Sbjct: 135 KYSQLVDLRSKFDGLTLVAYNSGVNAGGSIWCLLTYSEKLVYAPHWNHTKDTILNGAALL 194

Query: 189 -------ESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
                   + ++P  +IT           R++ + F D++ + L   G++++PVD  G+ 
Sbjct: 195 DNTGKPLSTLMKPTAIITSLGRFGSALSFRKRSKNFNDSLKRGLSNNGSIMIPVDITGKF 254

Query: 242 LELLLILEDYWAEHSLN-----YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L + + ++  E+S +       +  + Y     + Y +S LEW+  S+ K++E SRDN
Sbjct: 255 LDLFVQVHNFLYENSKSGSYNQTHVLLIAYFRGKVLTYARSMLEWLSSSLMKTWE-SRDN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
           A  F +     +I+ SE+ N P G K+   S   +     +++  +  S  K  VL T  
Sbjct: 314 ASPFDIGSKFKVIDPSEISNFP-GSKVCFVSQVDI---LLNEVLTKLCSMNKTTVLMTST 369

Query: 355 GQFGTL-----------ARMLQADPPPKAVKVT------MSRRVPLVGEELIAYEEEQTR 397
               T            A+ LQ       +  T      ++   PLV E+L   EE   R
Sbjct: 370 NTNNTQILETMYEKWEKAKTLQKLQDGSTISFTDTVLLKIASYKPLVNEQL---EEYNAR 426

Query: 398 LK-KEEALKAS---LVKEEESKASLGPDNNLSGDPMVIDANNANASA------DVVEPHG 447
           LK + +  K +   L KE +    +G      G  ++   N+           +++    
Sbjct: 427 LKERRDKCKETVEILKKEAKLGTRIGDMYRSEGVGLIHSLNDEEDEDEDEEEENILNSTS 486

Query: 448 GRYR------DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYI 491
            + +      DI I+     +TS   MFPF    ++ DD+G V++ + +I
Sbjct: 487 SQTKSFTVPVDIKIN---RSATSKHKMFPFQPGRTKIDDYGSVVDFNMFI 533


>gi|119185911|ref|XP_001243562.1| hypothetical protein CIMG_03003 [Coccidioides immitis RS]
 gi|392870265|gb|EJB11994.1| endoribonuclease ysh1 [Coccidioides immitis RS]
          Length = 881

 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/344 (26%), Positives = 158/344 (45%), Gaps = 19/344 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDVLLVSHFHLDHSAALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEQDHLSTLPLIEAIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLI ++   + + PPR +RE     +++  L  GG V
Sbjct: 195 GDYSREEDRHLVSAEVPKGIKIDVLIAESTFGISSNPPRLERETALMKSVTSVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWGRHPELQKIPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  K V  + N    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEARGDKSTTAGPWDFKFVRSVRNLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRV 381
              WA   +N V+ T     GT+ + +  +  P+ +   MS R 
Sbjct: 373 LERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSART 414


>gi|320032162|gb|EFW14117.1| cleavage and polyadenylation specificity factor [Coccidioides
           posadasii str. Silveira]
          Length = 881

 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/344 (26%), Positives = 158/344 (45%), Gaps = 19/344 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDVLLVSHFHLDHSAALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEQDHLSTLPLIEAIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLI ++   + + PPR +RE     +++  L  GG V
Sbjct: 195 GDYSREEDRHLVSAEVPKGIKIDVLIAESTFGISSNPPRLERETALMKSVTSVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWGRHPELQKIPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  K V  + N    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEARGDKSTTAGPWDFKFVRSVRNLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRV 381
              WA   +N V+ T     GT+ + +  +  P+ +   MS R 
Sbjct: 373 LERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSART 414


>gi|303323846|ref|XP_003071912.1| metallo-beta-lactamase superfamily protein [Coccidioides posadasii
           C735 delta SOWgp]
 gi|240111619|gb|EER29767.1| metallo-beta-lactamase superfamily protein [Coccidioides posadasii
           C735 delta SOWgp]
          Length = 881

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 90/344 (26%), Positives = 158/344 (45%), Gaps = 19/344 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDVLLVSHFHLDHSAALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEQDHLSTLPLIEAIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLI ++   + + PPR +RE     +++  L  GG V
Sbjct: 195 GDYSREEDRHLVSAEVPKGIKIDVLIAESTFGISSNPPRLERETALMKSVTSVLNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++  ++   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWGRHPELQKIPIYYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 FETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F      A            +  K V  + N    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEARGDKSTTAGPWDFKFVRSVRNLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRV 381
              WA   +N V+ T     GT+ + +  +  P+ +   MS R 
Sbjct: 373 LERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSART 414


>gi|390602470|gb|EIN11863.1| Metallo-hydrolase/oxidoreductase, partial [Punctularia
           strigosozonata HHB-11173 SS5]
          Length = 721

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 93/324 (28%), Positives = 163/324 (50%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+D +L++H    H  +L Y M++         V+ T P   +    M D ++     S
Sbjct: 57  STVDVLLITHFHLDHAASLTYIMEKTNFRDGHGKVYMTHPTKAVYKFMMQD-FVRMSSSS 115

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
              LF+  D+  +  S+  ++  Q   L     GI   P+ AGH+LG  ++ I   G  +
Sbjct: 116 SDALFSPLDLSMSLSSIIPVSAHQ---LITPFPGISFTPYHAGHVLGACMFLIDIAGLKI 172

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           +Y  DY+R +++HL    L   +RP VLI ++   + +   R+++E  F + +   ++ G
Sbjct: 173 LYTGDYSREEDRHLVKAELPP-IRPDVLIAESTWGVQSGDSREEKEARFTNIVHSIIKRG 231

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G+VL+P  + GR  ELLLIL++YW++H    N PIY+ + ++   +   ++++  M  +I
Sbjct: 232 GHVLMPTFAIGRAQELLLILDEYWSKHPELHNVPIYYASSLARKCMAVYQTYIHTMNSNI 291

Query: 287 TKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
              F   RDN F+ KH++        E   A   P ++LAS   L++G S ++    A D
Sbjct: 292 RSRF-AKRDNPFVFKHISHAPQNRGWERKLAEGPPCVILASPGMLQSGPSRELLELLAPD 350

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            +N ++ T     GT AR +  +P
Sbjct: 351 SRNGLVLTGYSVEGTPARDIINEP 374


>gi|339244969|ref|XP_003378410.1| putative metallo-beta-lactamase domain protein [Trichinella
           spiralis]
 gi|316972680|gb|EFV56345.1| putative metallo-beta-lactamase domain protein [Trichinella
           spiralis]
          Length = 562

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 55/373 (14%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           +++ PL           LV+I G N ++DCG +  F       D S +    K+   ID 
Sbjct: 4   IKIVPLGAGQEVGRSCILVTIGGKNVMLDCGMHMGFNDERRFPDFSYITQKGKLDDFIDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD---LF 114
           V++SH    H GALPY  + +G + P++ T P   +  + + D    + QV   +   +F
Sbjct: 64  VIISHFHLDHCGALPYMTEMVGYNGPIYMTIPTKAIVPVLLED--FRKVQVKYRNDPFIF 121

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 174
           T + I      V  ++  +                    L+G                 D
Sbjct: 122 TSNMIKDCMNKVKTISLHE-------------------ELMG-----------------D 145

Query: 175 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLL 233
           +N   ++HL    ++   RP VLI+++  A   +  ++ RE  F   +   +  GG VL+
Sbjct: 146 FNMTPDRHLGPAEIDR-CRPDVLISESTYATTIRDSKRARERDFLKKVHDCINNGGKVLI 204

Query: 234 PVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           PV + GR  EL ++LE YW   +L+ PIY    ++   +DY K F+ W  + I K+F   
Sbjct: 205 PVFALGRAQELCILLESYWERMNLSIPIYVSKGMAEKAVDYYKLFVTWTSEKIKKTF--V 262

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           + N F  KHV  L  +    + P GP +V A+   L +G S  IF +WA++ KN+V+   
Sbjct: 263 KRNMFDFKHV--LPFEDSFADTP-GPMVVFATPGMLHSGQSLKIFKKWATNEKNMVIMPG 319

Query: 354 RGQFGTLARMLQA 366
               GT+   L A
Sbjct: 320 YCVQGTVGSKLIA 332


>gi|443926404|gb|ELU45071.1| mRNA 3'-end-processing protein YSH1 [Rhizoctonia solani AG-1 IA]
          Length = 409

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 80/262 (30%), Positives = 135/262 (51%), Gaps = 10/262 (3%)

Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            L+T  D+  +   +  ++  Q   L     G+   P+ AGH+LG  ++ I   G  ++Y
Sbjct: 86  SLYTPLDVSLSLSHIIPISAHQ---LISPTPGLSFTPYHAGHVLGACMFLIDIAGLQILY 142

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 230
             DY+R +++HL    L   +RP +LI ++   +     R+ RE  F  ++   ++ GG+
Sbjct: 143 TGDYSREEDRHLVRAELPP-IRPDLLIVESTYGVQGHEARESREARFTSSVHTIVKRGGH 201

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           VLLPV + GR  ELLLIL++YWA H      P+Y+ + ++   +   ++++  M   I  
Sbjct: 202 VLLPVFALGRAQELLLILDEYWAAHPELHGVPVYYASNLARKCMAVYQTYIHTMNSHIRS 261

Query: 289 SFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            F   +DN F+ KH++ L      E   A  GP ++LAS   + +G S ++   WA D K
Sbjct: 262 RF-ARKDNPFVFKHISHLPATRGWERKIAEAGPCVILASPGFMSSGPSRELLELWAPDAK 320

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N V+ T     GT+AR +  +P
Sbjct: 321 NGVIITGYSIEGTMARDIILEP 342


>gi|226295077|gb|EEH50497.1| endoribonuclease ysh1 [Paracoccidioides brasiliensis Pb18]
          Length = 888

 Score =  126 bits (316), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 88/347 (25%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           S++D +L+SH    H   LPY + +      VF T     +    + D        S  D
Sbjct: 75  SSVDILLISHFHLDHSAGLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T ++  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEEEHLSTLPQIEAIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEVPKGIKIDVLITESTFGISSNPPRLEREAALMKSITTILNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   I   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNMARKCIIVYQTYIGAMNEN 310

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  ++V  + N    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRVFRERMAEADAAGANSATAGPWNFRYVRSVKNIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA + +N ++ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPNERNGIIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|225677757|gb|EEH16041.1| endoribonuclease ysh1 [Paracoccidioides brasiliensis Pb03]
          Length = 888

 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 88/347 (25%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           S++D +L+SH    H   LPY + +      VF T     +    + D        S  D
Sbjct: 75  SSVDILLISHFHLDHSAGLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T ++  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEEEHLSTLPQIEAIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   ++  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEVPKGIKIDVLITESTFGISSNPPRLEREAALMKSITTILNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   I   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNMARKCIIVYQTYIGAMNEN 310

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  ++V  + N    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRVFRERMAEADAAGANSATAGPWNFRYVRSVKNIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA + +N ++ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPNERNGIIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|403216796|emb|CCK71292.1| hypothetical protein KNAG_0G02340 [Kazachstania naganishii CBS
           8797]
          Length = 823

 Score =  125 bits (315), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 126/555 (22%), Positives = 246/555 (44%), Gaps = 81/555 (14%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLL------QPLSKVASTIDAVLLSHPDTLHLGA--LPY 73
           L+  D    L+D GW     P L+      +  S + + +D ++LS P    LGA  L Y
Sbjct: 19  LIKFDNVTILLDPGWF----PGLVSVDDTVKYWSNIIADVDIIILSQPTKECLGAYSLLY 74

Query: 74  A--MKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRL 129
              +        V++T P+  LG +   D Y S+  +  +  ++  +DD++ +F  +  L
Sbjct: 75  VNFLSHFISRIEVYATLPIANLGRVATIDLYASQGVIGPYLSNIMDVDDVEKSFDCIKTL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  K EG+    + +G   GG++W I+   E ++Y   +N  K   LN     
Sbjct: 135 KYSQVVDLRYKFEGLTFVAYNSGSAPGGSIWCISTYVEKLVYVKRWNHTKNNLLNAASIW 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  + +  +P+ +IT        +P R++ + F+D ++++L++ G++L+PVD  G  
Sbjct: 195 DSGGKPISALSKPSAIITTFDKLGSTKPLRRRTKEFRDILTRSLQSSGSLLIPVDIGGDF 254

Query: 242 LEL------LLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           L L      +L+     +    N PI F++Y    T+ Y KS LEW      K++ET +D
Sbjct: 255 LNLFVSVQSILLTTHRGSRKYGNIPILFISYARGRTLTYAKSMLEWFSSESMKNWET-KD 313

Query: 296 NA--FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF-- 351
           N   F + +    I+ +EL   P G K+ L S  +++A  +  I   + ++  N++L   
Sbjct: 314 NQSPFDIDNRLHFISPNELSKYP-GSKICLVS--NMDALLNETILKLYKTENLNVILTDG 370

Query: 352 --TERGQFGTL-----------ARMLQAD--PPPKAVKVTMSRRVPLVGEELIAYEEEQT 396
             ++     T+           + +L+ D  P  + V + +  +  L  + L  ++ +  
Sbjct: 371 FDSDATMISTMLQKWNKSCLDNSNILEGDMLPFSQTVPIKVWTKQALKSDALDTFKNQIE 430

Query: 397 RLKKEEALKASLVKEEESKASLGP--DNNLSGDPMVIDANN------------------- 435
           + + E + K + +K +   ++ GP  D  ++G+  +    N                   
Sbjct: 431 KRRLERSEKEATLKRDAKTSANGPAADAAMNGNGSLAVGQNGIGINDDDDDDDDDNDVLS 490

Query: 436 ANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVIN------PDD 489
           A  S       G ++ +  +D ++  + S   MF F     + DD+G +++       DD
Sbjct: 491 ARKSDGKNNSKGAKFMEPPVDLYLNEN-SKQKMFLFNPKREKRDDYGIMVDFSMFAPKDD 549

Query: 490 YIIKDEDMDQAAMHI 504
            I++  D++ ++  +
Sbjct: 550 EIVETSDVNISSKEV 564


>gi|258578481|ref|XP_002543422.1| predicted protein [Uncinocarpus reesii 1704]
 gi|237903688|gb|EEP78089.1| predicted protein [Uncinocarpus reesii 1704]
          Length = 875

 Score =  125 bits (315), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 89/343 (25%), Positives = 160/343 (46%), Gaps = 19/343 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      +F T     +    + D        S  D
Sbjct: 75  STVDVLLVSHFHLDHSAALPYVLSKTNFKGRIFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEQDHLSTLPLIEAIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLI ++   + + PPR +RE     +++  L  GG V
Sbjct: 195 GDYSREEDRHLISAEVPKGIKIDVLIAESTFGISSSPPRLERETALMKSVTSILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFL------------TYVSSSTIDYVKS 277
           L+PV + GR  ELLLIL++YW+ H      PI+++            TY+ +   +  + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWSRHPDLQKVPIFYIGNMARRCMVVYQTYIGAMNDNIKRL 314

Query: 278 FLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F E M ++  K  +++    +  K V  + N    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRERMAEAEAKGDKSTTAGPWDFKFVRSVRNLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
              WA   +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 373 LERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSAR 413


>gi|353239750|emb|CCA71648.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
           [Piriformospora indica DSM 11827]
          Length = 756

 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 94/327 (28%), Positives = 162/327 (49%), Gaps = 20/327 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQL------GLSAPVFSTEPVYRLGLLTMYDQYLSRR 106
           ST+D +L++H    H   L Y M++       G      +T+ VY+     +   +L   
Sbjct: 56  STVDVILITHFHLDHAAGLTYIMEKTNFREGKGKVYMTLATKAVYKF----IMQDFLRMS 111

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
             S   LF+  D   +F S+  +   Q   +     GI   P+ AGH+LG  ++ I   G
Sbjct: 112 SSSTEPLFSPLDFSMSFSSIITVAAHQ---VIVPCPGISFTPYHAGHVLGACMFLIDIAG 168

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTL 225
             V+Y  DY+R +++HL    + S +RP VLI ++   +        RE  F D ++  +
Sbjct: 169 LKVLYTGDYSREEDRHLVQAQVPS-IRPDVLICESTYGVQKHEELSGREKRFVDLVTAVV 227

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
           + GG+VLLP  + GR  E+LLILE++W+ +      PIY+++ ++   +   ++ +  M 
Sbjct: 228 KRGGHVLLPAFALGRAQEILLILEEHWSRNPDLHGVPIYYVSSLAKKCMAVYQTNISSMN 287

Query: 284 DSITKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
             I + ++  ++N F+ K++T L     +E   A   P +VLAS   ++ G S ++   W
Sbjct: 288 SKIQERWK-KQENPFVFKYITNLPQTRGAEKKVAEGPPCVVLASPGFMDNGSSRELLELW 346

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADP 368
           A D +N V+ T     GT+AR +Q  P
Sbjct: 347 APDPRNAVIVTGYSVEGTMARDIQNSP 373


>gi|354543719|emb|CCE40441.1| hypothetical protein CPAR2_104770 [Candida parapsilosis]
          Length = 776

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 99/335 (29%), Positives = 170/335 (50%), Gaps = 26/335 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRL---GLLTMYDQYLSRR 106
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR      + +     SR 
Sbjct: 64  SKVDILLVSHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRT 123

Query: 107 Q----VSEFD----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGT 158
           +     S  D    ++T DDI  +F  +  +    ++H + + +GI    + AGH+LG  
Sbjct: 124 EGGGSTSSNDEGGNIYTDDDIFKSFDRIETI----DFHSTMEVDGIRFTAYYAGHVLGAC 179

Query: 159 VWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-F 217
           ++ I   G  +++  DY+R + +HL    +   V+P VLIT++        PR + E   
Sbjct: 180 MYLIEIGGLKILFTGDYSREENRHLPSAEVPP-VKPDVLITESTFGTGTLEPRAELETKL 238

Query: 218 QDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYV 275
            + I  TL  GG VLLPV + G   ELLLIL++YW ++    N  +Y+ + ++   +   
Sbjct: 239 TNHIHATLTKGGRVLLPVFALGNAQELLLILDEYWEKNEDLQNVSVYYCSDLARKCMAVY 298

Query: 276 KSFLEWMGDSI--TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           +++   M D I  + S + S+ + F  K++  + N S+  +   GP +V+A+   L+AG 
Sbjct: 299 ETYTGIMNDKIRLSSSSDDSKSSPFDFKYIKSIRNLSKFSDL--GPSVVVATPGMLQAGV 356

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S  +  +WA + KNLV+ T     GT+A+ L  +P
Sbjct: 357 SRQLLEKWAPEQKNLVILTGYSVEGTMAKDLLKEP 391


>gi|299116292|emb|CBN76100.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 752

 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/238 (33%), Positives = 130/238 (54%), Gaps = 14/238 (5%)

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           ++H   + EGI    + AGH+LG  ++ I   G  V+Y  DY+   ++HL    + S   
Sbjct: 37  DFHQVLEHEGIKFWCYNAGHVLGAAMFMIEIAGVHVLYTGDYSMEADRHLMAAEMPS-TS 95

Query: 194 PAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P VLI ++   +    PR++RE  F   +SK ++ GG  L+PV + GR  ELLLIL++YW
Sbjct: 96  PDVLIVESTYGVQVHEPRKERESRFVGTVSKAVKKGGRCLIPVFALGRAQELLLILDEYW 155

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
            +H    + PIY+ + ++S      K+++  M + I +  + +  N F  +H+T L +  
Sbjct: 156 QQHRELHHIPIYYASRLAS------KTYINMMNEHIRQQMDVA--NPFKFQHITNLKSID 207

Query: 311 ELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           + D++  GP +V+AS   L++G S  +F  W +D KN VL       GTLA+ L + P
Sbjct: 208 QFDDS--GPSVVMASPGMLQSGVSRMLFDRWCTDDKNSVLIPGYSVEGTLAKKLLSMP 263


>gi|395828536|ref|XP_003787428.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
           specificity factor subunit 3 [Otolemur garnettii]
          Length = 634

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/255 (28%), Positives = 137/255 (53%), Gaps = 12/255 (4%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+T  D++ +   +  +    N+H   +  GI    + AGH+LG  ++ I   G  ++Y 
Sbjct: 121 LYTETDLEESMDKIETI----NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYT 176

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            D++R++++HL    + + ++P +LI ++    H    R++RE  F + +   +  GG  
Sbjct: 177 GDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRG 235

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D I K 
Sbjct: 236 LIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQ 295

Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
                +N F+ KH++ L +    D+   GP +V+AS   +++G S ++F  W +D +N V
Sbjct: 296 INI--NNPFVFKHISNLKSMDHFDDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGV 351

Query: 350 LFTERGQFGTLARML 364
           +       GTLA++L
Sbjct: 352 IIAGYCVEGTLAKIL 366


>gi|361125691|gb|EHK97723.1| putative Cleavage factor two protein 2 [Glarea lozoyensis 74030]
          Length = 835

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/444 (26%), Positives = 186/444 (41%), Gaps = 105/444 (23%)

Query: 152 GHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLIT 199
           GH LGGT+W+I    E ++YAVD+N+ +E  L+G             V+E   +P  LI 
Sbjct: 61  GHTLGGTIWQIQAGLESIVYAVDWNQSRENILSGAAWLGGAGGGGAEVIEQLRKPTALIC 120

Query: 200 DAYNALH---NQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS 256
            +            +++ E+  D I   +  GG VL+P DS+ RVLEL  +LE  W E +
Sbjct: 121 SSKGGEKVAIAGGKKKRDELLLDNIKSCVSKGGIVLIPTDSSARVLELAYLLEHAWREDA 180

Query: 257 -------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE-----TSRDN-------- 296
                  ++   Y  +    +T+ Y +S LEWM +SI + FE       +D+        
Sbjct: 181 ESDDSTLMSARPYLASKNIQATMRYARSMLEWMDESIVREFEAVAGQNKQDDDPDAKLRG 240

Query: 297 ---AFLLKHVTLLINKSELD--------NAPDGPKLVLASMASLEAGFSHDIFVEWASDV 345
               F  KH+ LL  KS++D        +     K++LAS  SLE GFS ++F     D 
Sbjct: 241 IGGPFDFKHLRLLERKSQIDKIMQEVDNHGRSIGKVILASDTSLEWGFSKEVFRRICDDR 300

Query: 346 KNLVLFTER-GQ-------FGTLARML-----------------------QADPPPKAVK 374
           +NLV+FTER GQ        G +AR L                       Q     + ++
Sbjct: 301 RNLVIFTERMGQPKMENPKLG-MARTLWSWWEDRSDGVATETAASGDVLEQVYGGGRQLE 359

Query: 375 VTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL--------------GP 420
           +  + RV L G++L AY+      ++ +  +A      ES A +                
Sbjct: 360 MRETTRVALEGDDLAAYQNWLATQRQLQTTQAGGATSLESSADMIDDAVSDSSDSDDDDE 419

Query: 421 DNNLSGDPMVIDANNANASADVVEPHGGRYRDILI----------DGFVPPSTSVAPMFP 470
           +N   G  + I A    A+   +   G    D+ I          D  V        MFP
Sbjct: 420 ENEQQGKALNISATMGQANRKKI---GLTDEDLGINILLRKKGVYDYDVRGKKGREKMFP 476

Query: 471 FYENNSEWDDFGEVINPDDYIIKD 494
                   D++GE++ P+D++++D
Sbjct: 477 LVVRRKRTDEYGELVRPEDFVMQD 500


>gi|255718601|ref|XP_002555581.1| KLTH0G12606p [Lachancea thermotolerans]
 gi|238936965|emb|CAR25144.1| KLTH0G12606p [Lachancea thermotolerans CBS 6340]
          Length = 816

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 130/531 (24%), Positives = 224/531 (42%), Gaps = 67/531 (12%)

Query: 22  LVSIDGFNFLIDCGWNDH--FDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAM---K 76
           ++  +    ++D  W     +    +    ++ +  D +LLS P    LGA  YAM   K
Sbjct: 19  VLRFENVTIMVDPAWEGRGSWSSEQIDFWGELVAQADIILLSQPTAEFLGA--YAMLYFK 76

Query: 77  QLG---LSAPVFSTEPVYRLGLLTMYDQYLSRRQVS--EFDLFTLDDIDSAFQSVTRLTY 131
            LG       VF+T PV  LG +T  D Y S+  V   + +   L+DI+ AF  V  + +
Sbjct: 77  FLGHFKTRIAVFATLPVANLGRVTTLDLYASQGLVGPVQTNALDLNDIEEAFDHVITVKH 136

Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN------- 184
           SQ   L  K +G+ V P+ +G+  GG+++ IT   + +IYA  +N  K+  LN       
Sbjct: 137 SQILDLKSKYDGLTVIPYSSGYAPGGSIFCITTYSDKIIYAPRWNHTKDTILNSAAVLNS 196

Query: 185 -GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLE 243
            G    S +RP+ ++T       + P +++   F++ + + L   G  ++P D  G+ L+
Sbjct: 197 SGKPTPSMMRPSAVVTTTARIGSSVPYKKRAARFKELLREALPKNGTAIIPTDIGGKFLD 256

Query: 244 LLLILEDYWAEHSLN-----YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA- 297
           LL+++ DY  E   N       +  ++Y    T+ Y +S LEW+  SI K +E   + + 
Sbjct: 257 LLVLVHDYLYEMKQNRNQSDVSVLLVSYSRGRTLTYARSMLEWLSPSIVKVWEGRNNRSP 316

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F       +++  EL     G K+   S         + +     S  +  V+ TE    
Sbjct: 317 FDFGSRLKIVSPEELKRY-SGSKICFVSRVD---RLINAVVQTLCSSERTTVILTEPLVL 372

Query: 358 GTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEE-QTRLKKEEALKASLVKEEESK- 415
            + +  + A    K  +   ++    +    ++Y E    +  K E LK+  ++E ESK 
Sbjct: 373 QSESSKVLAAMHSKWARANKAQDSRALNNRHVSYSENVAIQTAKTEPLKSQDLQEFESKI 432

Query: 416 -----------ASLGPDNNLSGD--------------------PMVIDANNANASADVVE 444
                      + L  +  + GD                    P  I A N  +S  V +
Sbjct: 433 EIRRREHKDLLSKLETETAVVGDMSSNGGMLDVAEEEEDEDDIPDFITAVNRKSSRSVTK 492

Query: 445 PHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 495
           P      DI I     P      MFPF+    + DD+G+V++   +I KD+
Sbjct: 493 PIEIPV-DIHIQSDAQPRHK---MFPFHAMKVKKDDYGDVVDFTQFIPKDQ 539


>gi|389601462|ref|XP_001565522.2| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
 gi|322505052|emb|CAM39016.2| putative cleavage and polyadenylation specificity factor
           [Leishmania braziliensis MHOM/BR/75/M2904]
          Length = 829

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 134/476 (28%), Positives = 216/476 (45%), Gaps = 52/476 (10%)

Query: 4   SVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH 62
           S+++T +      N P +YLV IDG   L DCGWN+ FD S L  L    +T+ AV+LS 
Sbjct: 8   SIRLTSVYECTTPNAPYAYLVEIDGVRILFDCGWNEEFDTSFLAKLKPYLATVHAVILSS 67

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD---- 118
           P     GALP+ +  +     V +     ++G+ ++   +L   Q      FTL D    
Sbjct: 68  PHITACGALPFVLTHIAPGTFVAAAGATSKIGVHSVLHSFLY--QYPNSHTFTLADGEGF 125

Query: 119 ---IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV--AGHLLGGTVWKITKDGEDVIYAV 173
              +DS + S   L       ++ K E + V      AG +LGG  W I    +++ Y  
Sbjct: 126 TMTVDSIYHSFRSLREPYGGKVTVKNEDVEVNCFAVFAGRMLGGYSWTIKYQIDELFYCP 185

Query: 174 DYNRRKEKHLNGTVLESFVRPA----VLITD--AYNALHNQPPR---QQREMFQDAISKT 224
           D++ +         L+SF  P     VL++    +  + N+  +   Q + +F++ +  T
Sbjct: 186 DFSVKP-----SYALKSFDVPTTANIVLVSSFPFHMTVSNRTTKYEEQLKSLFKE-LQHT 239

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMG 283
           LR G +VL+PV+ AGR LE+L IL    AE   + Y +  +   +   +D   +  E + 
Sbjct: 240 LRGGSDVLVPVNVAGRGLEVLNILVHLLAEQGGDKYKVVLVAAQAQELLDKAGTMTEALQ 299

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAP-DGPKLVLASMASLEAGFSHDI---FV 339
           D +        D+  L  +V  L  +S  +  P  GPK+ +A  ASL+ G S ++   FV
Sbjct: 300 DYLI------LDDKRLFANV--LTCRSAEEVLPIQGPKICVADGASLDFGPSAELLEYFV 351

Query: 340 EWASD-VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAY------E 392
           +   D   +L++ TE    GT A ++ A    + +   ++RR  L GEEL  Y      +
Sbjct: 352 KGNRDGADHLIVLTEPPLPGTNATVVTAAGDGERLHFQITRRSRLSGEELEEYYIDLEHD 411

Query: 393 EEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGG 448
            EQ R + E      +V ++E  A+     N  GD    D ++    A     HGG
Sbjct: 412 VEQRRRELEAQSIFQVVPDDEEDAA-----NTKGDADDDDDDDGEWVAAAATSHGG 462


>gi|71754401|ref|XP_828115.1| cleavage and polyadenylation specificity factor [Trypanosoma
           brucei]
 gi|70833501|gb|EAN79003.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma brucei brucei strain 927/4 GUTat10.1]
          Length = 818

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 177/387 (45%), Gaps = 35/387 (9%)

Query: 18  PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
           P++YL+ IDG   L+DCGWND F+ S L  L      + AVL S P+    GALP+ M+ 
Sbjct: 28  PMAYLLEIDGVRILMDCGWNDGFETSYLDALLPYLGDLHAVLFSTPELSSCGALPFVMEH 87

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYL---------SRRQVSEFDLFTLDDIDSAFQSVTR 128
           +     V +     ++GL  +   +L           +   EF++ T+D I SAF+SV R
Sbjct: 88  ITAETHVAAAGATAKMGLHGLLHPFLYLFPNTNTWKLQSGVEFEM-TVDKIYSAFRSV-R 145

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL 188
             Y     +  +   +   P  +G +LGG  W I    +++ Y  D++ +    LN    
Sbjct: 146 EPYGGKVTIRHRDVEVECFPVFSGRMLGGCGWLIKYQIDELFYCPDFSLKPSYALN---- 201

Query: 189 ESFVRP---AVLITDA--YNALHNQPPR--QQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
             F  P    +L  D   ++ L N   +  +Q  +F   +  TLR G +VL+PV   GR 
Sbjct: 202 -RFAPPTTATLLFIDGSPFHLLGNSGKKYEEQLNVFIREVLSTLRNGKDVLVPVSVPGRG 260

Query: 242 LELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
           LE+L I+     E    NY I   +  ++  I    +  E + D +  S      N    
Sbjct: 261 LEVLTIIMHLLTEKGGDNYSIVLASVQAAEVIGKASTMTESLKDEVILSEHQLFANVITC 320

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI---FVEWA-SDVKNLVLFTERGQ 356
           K    +++ +       GPK+ LA   +L+ G + D+   F++ +  D ++L++F    +
Sbjct: 321 KTAQEVMSVA-------GPKVCLADGETLDYGVAADLLEYFLQGSDEDREHLIVFPWTPK 373

Query: 357 FGTLARMLQADPPPKAVKVTMSRRVPL 383
             T A  + A     A+KV  +RR+PL
Sbjct: 374 RDTTAFSVAAAAKGDAIKVQYTRRIPL 400


>gi|154282371|ref|XP_001541981.1| hypothetical protein HCAG_02152 [Ajellomyces capsulatus NAm1]
 gi|150410161|gb|EDN05549.1| hypothetical protein HCAG_02152 [Ajellomyces capsulatus NAm1]
          Length = 925

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/347 (26%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  +LPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHLDHSASLPYVLSKTNFRGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEQDHLSTLSHIEAIDFNTTHTINN----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL        V+  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEAPKGVKVDVLITESTFGVSSNPPRLEREAALIKSITSILNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVYQTYIGAMNEN 310

Query: 286 ITKSF-------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F       E S D +        + V  + +    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRLFRQRMAEAEASGDKSISAGPWDFRFVRSVRSIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA + +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPNERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|448516292|ref|XP_003867539.1| mRNA cleavage and polyadenlylation factor [Candida orthopsilosis Co
           90-125]
 gi|380351878|emb|CCG22102.1| mRNA cleavage and polyadenlylation factor [Candida orthopsilosis]
          Length = 936

 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/344 (27%), Positives = 158/344 (45%), Gaps = 26/344 (7%)

Query: 30  FLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSAPVFS 86
            L D  WN   DP   + +       DA+++SH     +     L      +  + PV+S
Sbjct: 29  ILADPSWNG-IDPKAAKFMELHLQQTDAIIISHSTNEFISGYILLCITFPNIMSNIPVYS 87

Query: 87  TEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI 144
           T PV +LG ++  + Y S   +     +L  LD+ID  F     + Y QN  +  +   I
Sbjct: 88  TLPVNQLGRISTVEYYRSSGILGPLLSNLVELDEIDYWFDKFIIVKYQQNVTICDRK--I 145

Query: 145 VVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GTVLESFVRPA 195
            + P+ +GH LGGT W + K  + +IYA  +N  K+  LN         G    + +RP 
Sbjct: 146 TMTPYNSGHSLGGTFWLLVKKIDRIIYAPSWNHSKDAFLNSANFINSTSGNPHLALLRPT 205

Query: 196 VLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
             IT A +     P +++ E F   +  TL  GG+ ++P   +GR LE+  +++++    
Sbjct: 206 AFIT-ATDLGSAMPHKKRCEKFLQLVDATLANGGSAIIPTSISGRFLEVFHLVDEHLKGA 264

Query: 256 SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL----KHVTLLINKSE 311
            +  P+YFL+Y  +  + Y  S ++WM      ++ +   N  LL      V LL++ SE
Sbjct: 265 PI--PVYFLSYSGTKILSYASSLMDWMSSGFNNTWNSDIGNNSLLPFNPSKVDLLLDPSE 322

Query: 312 LDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER 354
           L   P G K++  +    + G  S  +F    +D +  V+ TE+
Sbjct: 323 LTQIP-GAKIIFCAGLDFKNGDLSSKVFSYLCNDERTTVILTEK 365



 Score = 42.7 bits (99), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 45/83 (54%), Gaps = 6/83 (7%)

Query: 615 LLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQK 673
           LL + + AP    + +G++++ DLK  LSS  + VEF G G L   + + IRK+     +
Sbjct: 849 LLMVISNAP---RLAIGNIRLPDLKKKLSSLNLNVEFKGEGTLVVNDVLAIRKIAYGSLE 905

Query: 674 GGGSGTQQIVIEGPLCEDYYKIR 696
              SG   IVI+G     YYK++
Sbjct: 906 SDDSG--DIVIDGNAGPLYYKVK 926


>gi|325090760|gb|EGC44070.1| endoribonuclease ysh1 [Ajellomyces capsulatus H88]
          Length = 893

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/347 (26%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  +LPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHLDHSASLPYVLSKTNFRGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEQDHLSTLSHIEAIDFNTTHTINN----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL        V+  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEAPKGVKVDVLITESTFGVSSNPPRLEREAALIKSITSILNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVYQTYIGAMNEN 310

Query: 286 ITKSF-------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F       E S D +        + V  + +    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRLFRQRMAEAEASGDKSISAGPWDFRFVRSVRSIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA + +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPNERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|261333901|emb|CBH16895.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 818

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 177/387 (45%), Gaps = 35/387 (9%)

Query: 18  PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
           P++YL+ IDG   L+DCGWND F+ S L  L      + AVL S P+    GALP+ M+ 
Sbjct: 28  PMAYLLEIDGVRILMDCGWNDGFETSYLDALLPYLGDLHAVLFSTPELSSCGALPFVMEH 87

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYL---------SRRQVSEFDLFTLDDIDSAFQSVTR 128
           +     V +     ++GL  +   +L           +   EF++ T+D I SAF+SV R
Sbjct: 88  ITAETHVAAAGATAKMGLHGLLHPFLYLFPNNNTWKLQSGVEFEM-TVDKIYSAFRSV-R 145

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL 188
             Y     +  +   +   P  +G +LGG  W I    +++ Y  D++ +    LN    
Sbjct: 146 EPYGGKVTIRHRDVEVECFPVFSGRMLGGCGWLIKYQIDELFYCPDFSLKPSYALN---- 201

Query: 189 ESFVRP---AVLITDA--YNALHNQPPR--QQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
             F  P    +L  D   ++ L N   +  +Q  +F   +  TLR G +VL+PV   GR 
Sbjct: 202 -RFAPPTTATLLFIDGSPFHLLGNSGKKYEEQLNVFIREVLSTLRNGKDVLVPVSVPGRG 260

Query: 242 LELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
           LE+L I+     E    NY I   +  ++  I    +  E + D +  S      N    
Sbjct: 261 LEVLTIIMHLLTEKGGDNYSIVLASVQAAEVIGKASTMTESLKDEVILSEHQLFANVITC 320

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI---FVEWA-SDVKNLVLFTERGQ 356
           K    +++ +       GPK+ LA   +L+ G + D+   F++ +  D ++L++F    +
Sbjct: 321 KTAQEVMSVA-------GPKVCLADGETLDYGVAADLLEYFLQSSDEDREHLIVFPWTPK 373

Query: 357 FGTLARMLQADPPPKAVKVTMSRRVPL 383
             T A  + A     A+KV  +RR+PL
Sbjct: 374 RDTTAFSVAAAAKGDAIKVQYTRRIPL 400


>gi|395840793|ref|XP_003793236.1| PREDICTED: integrator complex subunit 11 isoform 2 [Otolemur
           garnettii]
          Length = 499

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFITWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|302501173|ref|XP_003012579.1| hypothetical protein ARB_01192 [Arthroderma benhamiae CBS 112371]
 gi|291176138|gb|EFE31939.1| hypothetical protein ARB_01192 [Arthroderma benhamiae CBS 112371]
          Length = 991

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 112/403 (27%), Positives = 172/403 (42%), Gaps = 80/403 (19%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL----SA 82
           G   L+D GW++ FD S+L+ L +      A L S   T +L  L YA   L      S 
Sbjct: 27  GVKILVDVGWDESFDTSVLKELERFVCPYTAALGSFGRT-YLQNL-YASAPLAATFLPST 84

Query: 83  PVFSTEPVYRLGLLTM---------YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
            V +++P   L + ++         Y+   S R +      T +DI   F  +  L YSQ
Sbjct: 85  SVTASDPSSGLTIQSVTSSSQGPSGYENTGSGRIL--LPPPTNEDIARYFSLIHPLKYSQ 142

Query: 134 NYHLSGKG-----EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-- 186
                         G+ +  + AGH +GGT+W I    E ++YAVD+++ +E  + G   
Sbjct: 143 PLQPLPSPFSPPLNGLTITAYNAGHTVGGTIWHIQHGMESIVYAVDWSQARENVIAGAAW 202

Query: 187 ----------VLESFVRPAVLITDAYNALHNQPP--RQQRE-MFQDAISKTLRAGGNVLL 233
                     V+E   +P  LI  A        P  R++R+ +  D I      GG VLL
Sbjct: 203 FGSSIGSGTEVIEQLRKPTALICSASGGDKFALPGGRKKRDGLLLDMIRSCAAKGGTVLL 262

Query: 234 PVDSAGRVLELLLILEDYWAEHS---------LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           P DS+ RVLE+  +LE  W E +          N P+Y     +  T+   +S LEWM +
Sbjct: 263 PTDSSARVLEIAYVLEHAWREAADSEDSNDPLKNTPLYLAGKKAHDTMRLARSMLEWMDE 322

Query: 285 SITKSFE------------------------TSRDNA--------FLLKHVTLLINKSEL 312
           +I + FE                         S+ +A        F  KH+ L+ +K++L
Sbjct: 323 NIVREFEGNDGVEATTGKAAGGASNQPSKGVQSQKSATGQKSLGPFTFKHLNLVEHKAKL 382

Query: 313 DNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           D      GPK++L+   SLE G S  +    A   +NL++ TE
Sbjct: 383 DGVLESKGPKVILSPDTSLEWGLSKHVLKHIAEGNENLIIMTE 425


>gi|296803464|ref|XP_002842585.1| endoribonuclease ysh1 [Arthroderma otae CBS 113480]
 gi|238838904|gb|EEQ28566.1| endoribonuclease ysh1 [Arthroderma otae CBS 113480]
          Length = 854

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/335 (25%), Positives = 160/335 (47%), Gaps = 25/335 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H G+LPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHLDHSGSLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 133

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 134 QRTSLYTEHDHLSTLPIIETIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   V+  V+IT++   + + PPR +RE     +++  +  
Sbjct: 190 ILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAALIKSVTSIINR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 250 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVYQTYIGAMNEN 309

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  + V  L N    ++   G  ++LAS   L+ G 
Sbjct: 310 IKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDV--GGCVMLASPGMLQTGT 367

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 368 SRELLERWAPNERNGVIMTGYSVEGTMGKQIINEP 402


>gi|154422115|ref|XP_001584070.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
           vaginalis G3]
 gi|121918315|gb|EAY23084.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
           vaginalis G3]
          Length = 588

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 181/389 (46%), Gaps = 29/389 (7%)

Query: 20  SYLVSIDGFNFLIDCGWN----DHFD--PSLLQPLSKVASTIDAVLLSHPDTLHLGALPY 73
           S LV I     L+DCG N    D  D  P+   P  KV    D VL+SH  T HL A+PY
Sbjct: 28  SILVEIGSKKVLLDCGVNFTATDEKDRLPAYQDPFPKV----DLVLISHIHTDHLAAVPY 83

Query: 74  AMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
             + L   APV+ T    ++ +  M D +L   +V+E   +  +D+ +    +  + +  
Sbjct: 84  LTEVLKCQAPVYMTR-ASQMMMPIMLDDFL---KVTENPPYKAEDLTNCKPKIKVVEFYS 139

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
            +  +    GI V    AGH+LG   + +   G   IY  D++   + HL+G  +     
Sbjct: 140 RFEAA---PGIFVQAFPAGHILGAACFFVQVRGLSFIYTGDFSAIADHHLSGHAVPRLF- 195

Query: 194 PAVLITDAY--NALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           P +LIT++   N + +   +++R   Q  + + +  GG VL+PV + GR+ E+ L+LEDY
Sbjct: 196 PDLLITESTYGNQVRDSIAKRERSFVQ-MVHQVVGEGGKVLIPVFAVGRLQEICLMLEDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHV-TLLINKS 310
           W       PIY+ T +  + +   K  + WM  ++  +   +   AF   +       KS
Sbjct: 255 WNRMGYTEPIYYTTNLGENCMKVYKQCVNWMNPTVQTNLFDNGSTAFKFTYSRNFNPKKS 314

Query: 311 ELDNAPDGPKLVLASMASLEAG---FSHDIFVEWASDVKNLVLFTERGQFGTLAR-MLQA 366
           ++D +     ++LA+   L  G   F+  +  +W  D +N+V+F       T  R +L  
Sbjct: 315 KIDESRG--LVMLATSGMLNPGTPAFNFFVNEKWYDDPRNMVIFPGYCGPNTFGRAVLTR 372

Query: 367 DPPPKAVKVTMSRRVPLVGEELIAYEEEQ 395
           D     V+ T SRR  +  + +I  + E+
Sbjct: 373 DLTTNRVQFT-SRRPAMTVDIIIKCKVER 400


>gi|426327398|ref|XP_004024505.1| PREDICTED: integrator complex subunit 11 isoform 5 [Gorilla gorilla
           gorilla]
          Length = 502

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 72/214 (33%), Positives = 115/214 (53%), Gaps = 7/214 (3%)

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI 198
           G  + + +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LI
Sbjct: 45  GVNDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLI 103

Query: 199 TDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL 257
           T++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L
Sbjct: 104 TESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNL 163

Query: 258 NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD 317
             PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   
Sbjct: 164 KVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP-- 218

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 219 GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 252


>gi|225561321|gb|EEH09601.1| endoribonuclease ysh1 [Ajellomyces capsulatus G186AR]
          Length = 903

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 92/347 (26%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  +LPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHLDHSASLPYVLSKTNFRGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEQDHLSTLSHIEAIDFNTTHTINN----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL        V+  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEAPKGVKVDVLITESTFGVSSNPPRLEREAALIKSITSILNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVYQTYIGAMNEN 310

Query: 286 ITKSF-------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F       E S D +        + V  + +    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRLFRQRMAEAEASGDKSISAGPWDFRFVRSVRSIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA + +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPNERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|426327396|ref|XP_004024504.1| PREDICTED: integrator complex subunit 11 isoform 4 [Gorilla gorilla
           gorilla]
          Length = 499

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|402852595|ref|XP_003891003.1| PREDICTED: integrator complex subunit 11 isoform 2 [Papio anubis]
          Length = 499

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|374253826|ref|NP_001243391.1| integrator complex subunit 11 isoform 4 [Homo sapiens]
          Length = 502

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 72/214 (33%), Positives = 115/214 (53%), Gaps = 7/214 (3%)

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI 198
           G  + + +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LI
Sbjct: 45  GVNDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLI 103

Query: 199 TDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL 257
           T++  A   +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L
Sbjct: 104 TESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNL 163

Query: 258 NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD 317
             PIYF T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   
Sbjct: 164 KVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP-- 218

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 219 GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 252


>gi|302846726|ref|XP_002954899.1| hypothetical protein VOLCADRAFT_65253 [Volvox carteri f.
           nagariensis]
 gi|300259874|gb|EFJ44098.1| hypothetical protein VOLCADRAFT_65253 [Volvox carteri f.
           nagariensis]
          Length = 477

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/378 (26%), Positives = 163/378 (43%), Gaps = 45/378 (11%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPS-------LLQPLSKVAST 54
           G   Q  P     +      +V + G   + DCG +  F  +       LL    +    
Sbjct: 10  GAERQTVPTGAGQDVGRSCCIVRMAGRTVMFDCGAHFGFRDARRFPEFGLLSRAGRFTEI 69

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY-LSRRQVSEFDL 113
           IDAV+++H  T HLGALPY  +  G   P+  T P + +  + + D   ++  +  E   
Sbjct: 70  IDAVVITHFHTDHLGALPYFTEICGYRGPILMTYPTFAIAPIMLADYVKVNADRPGERLP 129

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAP------HVAGHLLGGTVWKITKDGE 167
           +    +    + VT +   Q          +VVAP      H AGH+LG  +  +T    
Sbjct: 130 YNEQHVRDCLRRVTAVDLHQV---------VVVAPGLSFTFHYAGHVLGAAMVHMTAGHL 180

Query: 168 DVIYAVDYNRRKEKHLN-----------GTVLESFVRPAVLITDA-YNALHNQPPRQQRE 215
             +Y  D+N   ++HL            G    S   P VLI++A Y A      R +  
Sbjct: 181 TALYTGDFNSSPDRHLGPAEAPLALLQGGPSGASVRHPDVLISEATYAATLRDSKRARER 240

Query: 216 MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYV 275
               A+ +T+ AGG VL+P  + GR  ELL+++ D W  + L  PIYF + +++  + Y 
Sbjct: 241 DLLGAVVETVAAGGKVLIPTFAMGRAQELLMLITDCWERNGLQVPIYFSSAMAARALVYY 300

Query: 276 KSFLEWMGDSITKSFETSRDNAFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGF 333
           +  L W   +            F+  H+ +   I+ + +  AP GP L+ AS  ++ +G 
Sbjct: 301 QLLLNWTNANHIHC-------VFVNVHICVCTHIHTTWMMLAP-GPALLFASPGNIASGV 352

Query: 334 SHDIFVEWASDVKNLVLF 351
           + + F  WA   KNL++ 
Sbjct: 353 ALEAFRSWAGSSKNLLVL 370


>gi|327356883|gb|EGE85740.1| endoribonuclease ysh1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 887

 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  +LPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHLDHSASLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEQDHLSTLSQIEAIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL        ++  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEAPKGIKIDVLITESTFGVSSNPPRLEREAALMKSITGVLNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVYQTYIGAMNEN 310

Query: 286 ITKSF-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F       E S D +     +  + V  + +    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSVRSIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA   +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|397476280|ref|XP_003809535.1| PREDICTED: integrator complex subunit 11 isoform 3 [Pan paniscus]
          Length = 499

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|239612611|gb|EEQ89598.1| endoribonuclease ysh1 [Ajellomyces dermatitidis ER-3]
          Length = 904

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 91/347 (26%), Positives = 165/347 (47%), Gaps = 27/347 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  +LPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHLDHSASLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 134

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 135 QRTTLYTEQDHLSTLSQIEAIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 190

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL        ++  VLIT++   + + PPR +RE     +I+  L  
Sbjct: 191 ILFTGDYSREEDRHLISAEAPKGIKIDVLITESTFGVSSNPPRLEREAALMKSITGVLNR 250

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVYQTYIGAMNEN 310

Query: 286 ITKSF-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F       E S D +     +  + V  + +    D+   G  ++LAS   L+ G 
Sbjct: 311 IKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSVRSIERFDDV--GGCVMLASPGMLQTGT 368

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
           S ++   WA   +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 369 SRELLERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 413


>gi|256077072|ref|XP_002574832.1| cleavage and polyadenylation specificity factor [Schistosoma
           mansoni]
          Length = 1063

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 62/159 (38%), Positives = 92/159 (57%), Gaps = 5/159 (3%)

Query: 200 DAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-- 256
           D  N L+ QP R+ R E  +  + K+LR GGNVL+ VD+AGR LE+   LE  W      
Sbjct: 2   DGSNTLYTQPRRKDRDENLRQTVLKSLRRGGNVLIAVDTAGRCLEVAHFLEQCWLNQESG 61

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 315
            + Y +  L YV+ + +D+ KS +EWM + + +SFE  R N F  +H+ L     +LD A
Sbjct: 62  LMAYGLAMLNYVALNVVDFAKSMVEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLD-A 120

Query: 316 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
              PK+VL+S++ L  GFS  +F EWA +  N ++ T +
Sbjct: 121 VSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQ 159



 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 58/243 (23%), Positives = 103/243 (42%), Gaps = 67/243 (27%)

Query: 505 GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVC---PHVYTP 561
           G  DG   E    +++  +P +++      LV  +A A +HL  +C   +     +++ P
Sbjct: 499 GRSDG---EAMKRILIGLRPQEII------LVGNNAPAIDHLANYCRGVMLLDPNYIHIP 549

Query: 562 QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG--------------- 606
              E ++ T +   Y+ ++ + L+S++ F K+ DYE+AWV+A V                
Sbjct: 550 HPREIVNCTKEGDIYQARMKDSLVSSLKFTKIRDYELAWVEATVSLDDKFDYHIKEKRNN 609

Query: 607 -----------------KTENGM---------LSLLPI-STPAPP---HKSVLVGDLKMA 636
                             T N +            LP+ S P  P   HK+V V + K++
Sbjct: 610 NNTGNNDNDDDNGDVEMSTGNNLELRSRTPLAADQLPVLSLPTGPIGQHKTVFVNEPKLS 669

Query: 637 DLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIR 696
           DLK  L S+G+  EF  G L     V I++          S   ++++EG LC  Y++  
Sbjct: 670 DLKQLLLSQGLMAEFVSGILVVDNCVAIKR----------SEAGKLLLEGLLCGTYFETF 719

Query: 697 AYL 699
            ++
Sbjct: 720 DFM 722


>gi|296206479|ref|XP_002750226.1| PREDICTED: integrator complex subunit 11 isoform 2 [Callithrix
           jacchus]
          Length = 499

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|374253828|ref|NP_001243392.1| integrator complex subunit 11 isoform 5 [Homo sapiens]
 gi|119576639|gb|EAW56235.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
           [Homo sapiens]
 gi|119576644|gb|EAW56240.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
           [Homo sapiens]
          Length = 499

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|119576647|gb|EAW56243.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_i
           [Homo sapiens]
          Length = 502

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 52  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 110

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 111 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 170

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 171 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 225

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 226 TPGMLHAGQSLQIFRKWAGNEKNMVIM 252


>gi|312080023|ref|XP_003142424.1| cpsf3-prov protein [Loa loa]
          Length = 715

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 103/409 (25%), Positives = 183/409 (44%), Gaps = 68/409 (16%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLS 61
           S+ +TPL          + ++  G   L+DCG +         P         +D +L++
Sbjct: 12  SLVITPLGSGQEVGRSCHYLTFKGKKILLDCGIHPGMSGVDALPFVDFVDCEELDLLLVT 71

Query: 62  HPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD------ 112
           H    H GALP+ +++       F   +T+ +YR+ +      YL   +VS++       
Sbjct: 72  HFHLDHCGALPWLLEKTAFRGRCFMTHATKAIYRMSI----GDYL---KVSKYGGSSDNR 124

Query: 113 -LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            L+  +D++ + + +  +    ++H   +  GI    HVAGH+LG  ++ I   G  ++Y
Sbjct: 125 MLYNEEDLEKSMEKIEVI----DFHEQKEVNGIKFWCHVAGHVLGACMFMIEIAGVRILY 180

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNV 231
             D++R +++HL    L + V P VLI ++         R +RE       K +  GG  
Sbjct: 181 TGDFSRLEDRHLCAAELPT-VSPDVLICESTYGTQVHESRDERE-------KVVGRGGRC 232

Query: 232 LLPVDSAGRVLELLLILEDYWAEH-----------------------------SLNYPIY 262
           L+P  + GR  ELLLIL++YW  H                              ++  I+
Sbjct: 233 LIPAFALGRAQELLLILDEYWEAHPELQDIPNNPVCCNADEMTVVEPNRSVIVGIDLLIF 292

Query: 263 F--LTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GP 319
           F   + ++   +   ++F+  M   I K  + + +N F+ KHV+   N   +D+  D GP
Sbjct: 293 FDHASSLAKKCMAVYQTFVSGMNSRIQK--QIALNNPFVFKHVS---NLKSIDHFEDVGP 347

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            +VLAS   L+ G S ++F  W +D KN  +       GTLA+ + ++P
Sbjct: 348 CVVLASPGMLQNGLSRELFENWCTDSKNGCIIAGYCVEGTLAKHILSEP 396


>gi|119576648|gb|EAW56244.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_j
           [Homo sapiens]
          Length = 476

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 26  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 84

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 85  TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 144

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 145 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 199

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 200 TPGMLHAGQSLQIFRKWAGNEKNMVIM 226


>gi|34783058|gb|AAH00675.2| CPSF3L protein, partial [Homo sapiens]
          Length = 473

 Score =  123 bits (309), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 23  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 81

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 82  TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 141

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 142 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 196

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 197 TPGMLHAGQSLQIFRKWAGNEKNMVIM 223


>gi|315043764|ref|XP_003171258.1| endoribonuclease ysh1 [Arthroderma gypseum CBS 118893]
 gi|311345047|gb|EFR04250.1| endoribonuclease ysh1 [Arthroderma gypseum CBS 118893]
          Length = 853

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 159/335 (47%), Gaps = 25/335 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H G+LPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHLDHSGSLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 133

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+   D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 134 QRTSLYNEHDHLSTLPIIETIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   V+  V+IT++   + + PPR +RE     +++  +  
Sbjct: 190 ILFTGDYSREEDRHLISAEVPKSVKIDVMITESTFGISSNPPRLEREAALMKSVTSVINR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 250 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVYQTYIGAMNEN 309

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  + V  L N    ++   G  ++LAS   L+ G 
Sbjct: 310 IKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDV--GGCVMLASPGMLQTGT 367

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 368 SRELLERWAPNERNGVIMTGYSVEGTMGKQIINEP 402


>gi|367015916|ref|XP_003682457.1| hypothetical protein TDEL_0F04350 [Torulaspora delbrueckii]
 gi|359750119|emb|CCE93246.1| hypothetical protein TDEL_0F04350 [Torulaspora delbrueckii]
          Length = 835

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 136/541 (25%), Positives = 236/541 (43%), Gaps = 72/541 (13%)

Query: 22  LVSIDGFNFLIDCGWNDH---FDPSLLQPLSKVASTIDAVLLSHPDTLHLGA-------- 70
           ++  D    L+D  W+     ++ S+ +  S++   +D +LLS P    LGA        
Sbjct: 19  IIRFDNVTILVDPSWHSSKISYENSV-RFWSEIIPEVDIILLSQPSVETLGAYGSLYHNF 77

Query: 71  LPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTR 128
           L + + ++     V++T PV  LG +T  D Y S+  +  F    +D  D++ AF  +  
Sbjct: 78  LSHFISRI----EVYATLPVSNLGRVTTIDYYTSKGLIGPFKANQIDLRDVEFAFDHIQT 133

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV- 187
           L YSQ   L  K +G+ +  + AG   GG VW I+   E ++YA  +N  +   LNG+  
Sbjct: 134 LKYSQLADLRSKYDGLTLIAYSAGVSPGGCVWCISTYFEKLVYAFRWNHTRNTILNGSSL 193

Query: 188 -------LESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGR 240
                  L +  RP+ +IT       ++P  ++ ++F+DA+ + L + G+VL+P +  G 
Sbjct: 194 LDKTGKPLATLARPSAVITKLDKFGSSKPHGKRVKVFKDALKRVLSSSGSVLIPAEIGGN 253

Query: 241 VLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            L+L +++ D+  E S        P+  + Y     + Y +S LEW+  S+ K +E SRD
Sbjct: 254 FLDLFVLVHDFLYESSKSRLFAQVPVLLVAYSRGRVLTYARSMLEWLSSSLLKIWE-SRD 312

Query: 296 NA--FLLKHVTLLINKSELDNAPDGPKLVLASMAS--LEAGFSHDIFVEWAS-------- 343
           N   F L     +I  ++L     GPK+   S     ++   S     E  +        
Sbjct: 313 NRSPFDLGSRFHVIAPTDLTKY-SGPKICFVSQVETLVDEVISRLCQTERTTIILTSSDN 371

Query: 344 -DVKNLVLFTERGQFGTLARML---QADPPPKAVKVTMSRRVPLVGEELIAYEEEQT-RL 398
            D + L +  +        R     Q+    +++ +   +  P+ GEEL  Y    T R 
Sbjct: 372 DDTRTLSVLHKNWDLAQKQRGAEEGQSISYSESLTLKTVQTKPMTGEELEQYVAGITERK 431

Query: 399 KKEEALKASLVKEEE-----SKASLGPDNNLSGDPMVIDANNANASA--------DVVEP 445
            K + L+ SL K+ +     S+   G D+  SG+      +              D+++ 
Sbjct: 432 TKRKELEESLHKDVKLAGKISRRLDGKDD--SGNMREDGQDPEEDDDEDEDENLLDILKE 489

Query: 446 H-----GGRYRDILIDGFVPPSTSVA-PMFPFYENNSEWDDFGEVINPDDYIIK-DEDMD 498
                 G    DI +D  + P++     MFPF     + DD+G  ++    I   DE+MD
Sbjct: 490 KSSTSTGQTAIDIPVDYLIQPTSQPKHKMFPFQPAKIKSDDYGTFVDFSSLIQNDDEEMD 549

Query: 499 Q 499
           Q
Sbjct: 550 Q 550


>gi|350646480|emb|CCD58879.1| cleavage and polyadenylation specificity factor,putative
           [Schistosoma mansoni]
          Length = 729

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 62/159 (38%), Positives = 92/159 (57%), Gaps = 5/159 (3%)

Query: 200 DAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-- 256
           D  N L+ QP R+ R E  +  + K+LR GGNVL+ VD+AGR LE+   LE  W      
Sbjct: 2   DGSNTLYTQPRRKDRDENLRQTVLKSLRRGGNVLIAVDTAGRCLEVAHFLEQCWLNQESG 61

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 315
            + Y +  L YV+ + +D+ KS +EWM + + +SFE  R N F  +H+ L     +LD A
Sbjct: 62  LMAYGLAMLNYVALNVVDFAKSMVEWMSEKVMRSFEDQRSNPFHFRHMQLCHTLEQLD-A 120

Query: 316 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
              PK+VL+S++ L  GFS  +F EWA +  N ++ T +
Sbjct: 121 VSEPKVVLSSLSDLSCGFSRQLFAEWADNDLNTIILTSQ 159



 Score = 79.0 bits (193), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 109/250 (43%), Gaps = 67/250 (26%)

Query: 505 GGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVC---PHVYTP 561
           G  DG   E    +++  +P +++      LV  +A A +HL  +C   +     +++ P
Sbjct: 499 GRSDG---EAMKRILIGLRPQEII------LVGNNAPAIDHLANYCRGVMLLDPNYIHIP 549

Query: 562 QIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVG--------------- 606
              E ++ T +   Y+ ++ + L+S++ F K+ DYE+AWV+A V                
Sbjct: 550 HPREIVNCTKEGDIYQARMKDSLVSSLKFTKIRDYELAWVEATVSLDDKFDYHIKEKRNN 609

Query: 607 -----------------KTENGM------------LSLLPIST-PAPPHKSVLVGDLKMA 636
                             T N +            L +L + T P   HK+V V + K++
Sbjct: 610 NNTGNNDNDDDNGDVEMSTGNNLELRSRTPLAADQLPVLSLPTGPIGQHKTVFVNEPKLS 669

Query: 637 DLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIR 696
           DLK  L S+G+  EF  G L     V I++          S   ++++EG LC  Y+++R
Sbjct: 670 DLKQLLLSQGLMAEFVSGILVVDNCVAIKR----------SEAGKLLLEGLLCGTYFEVR 719

Query: 697 AYLYSQFYLL 706
             LY QF +L
Sbjct: 720 RILYQQFAIL 729


>gi|403297740|ref|XP_003939710.1| PREDICTED: integrator complex subunit 11 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 499

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVEHGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|326482980|gb|EGE06990.1| endoribonuclease ysh1 [Trichophyton equinum CBS 127.97]
          Length = 818

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 159/335 (47%), Gaps = 25/335 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H G+LPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHLDHSGSLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 133

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+   D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 134 QRTSLYNEHDHLSTLPIIETIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   V+  V+IT++   + + PPR +RE     +++  +  
Sbjct: 190 ILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAALMKSVTSIINR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 250 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVYQTYIGAMNEN 309

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  + V  L N    ++   G  ++LAS   L+ G 
Sbjct: 310 IKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDV--GGCVMLASPGMLQTGT 367

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 368 SRELLERWAPNERNGVIMTGYSVEGTMGKQIINEP 402


>gi|326475916|gb|EGD99925.1| endoribonuclease ysh1 [Trichophyton tonsurans CBS 112818]
          Length = 855

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 85/335 (25%), Positives = 159/335 (47%), Gaps = 25/335 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H G+LPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHLDHSGSLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 133

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+   D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 134 QRTSLYNEHDHLSTLPIIETIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLN 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   V+  V+IT++   + + PPR +RE     +++  +  
Sbjct: 190 ILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAALMKSVTSIINR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 250 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVYQTYIGAMNEN 309

Query: 286 ITKSFETSRDNA------------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
           I + F      A            +  + V  L N    ++   G  ++LAS   L+ G 
Sbjct: 310 IKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDV--GGCVMLASPGMLQTGT 367

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           S ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 368 SRELLERWAPNERNGVIMTGYSVEGTMGKQIINEP 402


>gi|327293421|ref|XP_003231407.1| endoribonuclease ysh1 [Trichophyton rubrum CBS 118892]
 gi|326466523|gb|EGD91976.1| endoribonuclease ysh1 [Trichophyton rubrum CBS 118892]
          Length = 855

 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 88/333 (26%), Positives = 160/333 (48%), Gaps = 21/333 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H G+LPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHLDHSGSLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSD 133

Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
               L+   D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G +
Sbjct: 134 QRTSLYNEHDHLSTLPIIETIDFNTTHAINS----IRITPFPAGHVLGAAMFLISIAGLN 189

Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRA 227
           +++  DY+R +++HL    +   V+  V+IT++   + + PPR +RE     +++  +  
Sbjct: 190 ILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAALMKSVTSIINR 249

Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
           GG VL+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++
Sbjct: 250 GGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVYQTYIGAMNEN 309

Query: 286 ITKSFETSRDNAFLL--KHVT-------LLINKSELDNAPD-GPKLVLASMASLEAGFSH 335
           I + F      A     K VT        + +   LD   D G  ++LAS   L+ G S 
Sbjct: 310 IKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLASPGMLQTGTSR 369

Query: 336 DIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 370 ELLERWAPNERNGVIMTGYSVEGTMGKQIINEP 402


>gi|70999860|ref|XP_754647.1| cleavage and polyadenylylation specificity factor [Aspergillus
           fumigatus Af293]
 gi|66852284|gb|EAL92609.1| cleavage and polyadenylylation specificity factor, putative
           [Aspergillus fumigatus Af293]
          Length = 1013

 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 106/413 (25%), Positives = 166/413 (40%), Gaps = 103/413 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FD   L  L K   T+  +LL+H    HLGA  +  +   L    PV
Sbjct: 26  GIKILVDVGWDDTFDTLDLLELEKHIPTLSLILLTHATPAHLGAFVHCCRTFPLFTQIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 86  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTSAASAAASVTDGEGNTPASS 145

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +       S    G+ +  + AGH +GGT+W
Sbjct: 146 AGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLSSPFSPPLNGLTLTAYNAGHTVGGTIW 205

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 206 HIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGAEVIEQLRKPTALVCSTRGGDKFA 265

Query: 209 PP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------- 256
            P  R++R+ +  D I  +L  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 266 LPGGRKKRDDLLLDMIRSSLAKGGTVLIPTDTSARVLELAYALEHAWRDVAGGNNESDMA 325

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA-------- 297
             N  +Y     + +T+   +S LEWM ++I + FE           ++ NA        
Sbjct: 326 LKNAGLYLAGRKAHTTMRLARSMLEWMDENIVREFEAAEGVDAVTGQTQSNADGQRSGGQ 385

Query: 298 ------------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHD 336
                       F  KH+  +  +  L+       PK+++AS  SL+ GF+ +
Sbjct: 386 GQGKGGSKGLGPFTFKHLRTVERRKRLEKILTDQKPKVIIASDTSLDWGFAKE 438


>gi|68077031|ref|XP_680435.1| cleavage and polyadenylation specificity factor protein [Plasmodium
           berghei strain ANKA]
 gi|56501360|emb|CAH96636.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium berghei]
          Length = 967

 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 91/351 (25%), Positives = 160/351 (45%), Gaps = 41/351 (11%)

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLG------LSAPVFSTEPVYRLGLLT 97
           L+  L K+   ID V++SH    H+GALP+  + L       +S P  +  PV  L    
Sbjct: 94  LINNLKKINEMIDCVIISHFHMDHIGALPFFTEILQYKGTIIMSYPTKALSPVLLLDGCK 153

Query: 98  MYDQYLSRRQVSEF---------DLF--------------TLDDIDSAFQSVTRLTYSQN 134
           + D    ++ + +          DL               T ++I +    V  L  ++ 
Sbjct: 154 ISDMKWEKKNLEKQIKMLNEKSDDLLNYNINCLKKDPWNITEENIYNCINKVVGLQVNET 213

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           Y L      I + P+ AGH+LG  ++++  +   VIY  DYN   +KHL  T +   + P
Sbjct: 214 YELGD----ISITPYYAGHVLGACMYRLEVNNISVIYTGDYNTIPDKHLGSTKI-PVLTP 268

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            + I+++  A + +P R+  E+   + +++ +  GG VL+PV + GR  EL ++LE+YW 
Sbjct: 269 EIFISESTYASYVRPTRKSSELELCNLVNECVHKGGKVLIPVFAIGRAQELSILLEEYWE 328

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
           +  +N PIYF   ++ +   Y K +  W+ ++      T   N F   +++   N    +
Sbjct: 329 KMKINCPIYFGCGLTENANKYYKIYSSWISNNCV---STEVKNLFDFSNISQFSNNYLNE 385

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           N    P ++ A+   L  G +   F  WAS+  NL++       GT+   L
Sbjct: 386 NR---PMVLFATPGMLHTGLALKAFKAWASNPNNLIILPGYCVQGTIGHKL 433


>gi|156082980|ref|XP_001608974.1| RNA-metabolising metallo-beta-lactamase and metallo-beta-lactamase
           superfamily domain containing protein [Babesia bovis
           T2Bo]
 gi|154796224|gb|EDO05406.1| RNA-metabolising metallo-beta-lactamase and  metallo-beta-lactamase
           superfamily domain containing protein [Babesia bovis]
          Length = 760

 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 93/341 (27%), Positives = 153/341 (44%), Gaps = 42/341 (12%)

Query: 43  SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-- 100
           +L + L+ + S ID  ++SH    H+GALP+  + LG   PVF T P   LG + + D  
Sbjct: 109 ALKKSLNDITSNIDCAIISHFHLDHIGALPFLTEHLGYKGPVFMTYPTRGLGPIMLRDSA 168

Query: 101 ------------------------------QYLSRRQVSEFDLFTLDDIDSAFQSVTRLT 130
                                         + L+  Q+  FD +    +D    S++R  
Sbjct: 169 QVVTSRFRDAIETESSTRGASILLNRNKKRKPLTAEQLDRFDPWGYT-VDCVADSLSRAH 227

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
             Q       G  + + P+ AGH+LG  ++ +  DG  V+Y  D+N   +KHL    + S
Sbjct: 228 VMQLKSSQTLG-NMRITPYYAGHVLGAAMFLVECDGISVLYTGDFNMTPDKHLGPARVPS 286

Query: 191 FVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
            + P ++I ++ Y ++  Q  R         +   L AGG VL+PV + GR  EL +IL+
Sbjct: 287 -LNPDIMICESTYASIIRQARRSTEMELCTVVHDCLLAGGKVLIPVFAVGRAQELAIILD 345

Query: 250 DYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINK 309
            YW++  L +PIYF   +S     Y K    W   + +++     DN F L+H+    N 
Sbjct: 346 TYWSKLQLRFPIYFGGGLSERATSYYKLHSLW---TDSRNIPNMGDNCFSLEHMLPFENS 402

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
              +   D P ++ A+   + +G S      WA + KNL++
Sbjct: 403 FLTE---DRPMVLFATPGMVHSGLSLKACKLWAPNPKNLIV 440


>gi|159127661|gb|EDP52776.1| cleavage and polyadenylylation specificity factor, putative
           [Aspergillus fumigatus A1163]
          Length = 1013

 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 106/413 (25%), Positives = 166/413 (40%), Gaps = 103/413 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FD   L  L K   T+  +LL+H    HLGA  +  +   L    PV
Sbjct: 26  GIKILVDVGWDDTFDTLDLLELEKHIPTLSLILLTHATPAHLGAFVHCCRTFPLFTQIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T PV  LG   + D Y S         +  +SE                         
Sbjct: 86  YATSPVIALGRTLLQDLYASSPLAATFLPKASISEPGASTSAASAAASVTDGEGNTPASS 145

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +       S    G+ +  + AGH +GGT+W
Sbjct: 146 AGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLSSPFSPPLNGLTLTAYNAGHTVGGTIW 205

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 206 HIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGAEVIEQLRKPTALVCSTRGGDKFA 265

Query: 209 PP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------- 256
            P  R++R+ +  D I  +L  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 266 LPGGRKKRDDLLLDMIRSSLAKGGTVLIPTDTSARVLELAYALEHAWRDVAGGNNESDMA 325

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA-------- 297
             N  +Y     + +T+   +S LEWM ++I + FE           ++ NA        
Sbjct: 326 LKNAGLYLAGRKAHTTMRLARSMLEWMDENIVREFEAAEGVDAVTGQTQSNADGQRSGGQ 385

Query: 298 ------------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHD 336
                       F  KH+  +  +  L+       PK+++AS  SL+ GF+ +
Sbjct: 386 GQGKGGSKGLGPFTFKHLRTVERRKRLEKILTDQKPKVIIASDTSLDWGFAKE 438


>gi|119491987|ref|XP_001263488.1| cleavage and polyadenylylation specificity factor, putative
           [Neosartorya fischeri NRRL 181]
 gi|119411648|gb|EAW21591.1| cleavage and polyadenylylation specificity factor, putative
           [Neosartorya fischeri NRRL 181]
          Length = 1013

 Score =  122 bits (306), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 105/413 (25%), Positives = 166/413 (40%), Gaps = 103/413 (24%)

Query: 27  GFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL--SAPV 84
           G   L+D GW+D FD   L  L K   T+  +LL+H    H+GA  +  K   L    PV
Sbjct: 26  GIKILVDVGWDDTFDTLDLLELEKHIPTLSLILLTHATPAHIGAFVHCCKTFPLFTQIPV 85

Query: 85  FSTEPVYRLGLLTMYDQYLS---------RRQVSE------------------------- 110
           ++T P+  LG   + D Y S         +  +SE                         
Sbjct: 86  YATSPIIALGRTLLQDLYASSPLAATFLPKASISEPGASTSAASAAASVTDGEGNTAASS 145

Query: 111 -----FDLFTLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVW 160
                    T ++I   F  +  L YSQ +       S    G+ +  + AGH +GGT+W
Sbjct: 146 AGRILLQPPTAEEIARYFSLIHPLKYSQPHQPLSSPFSPPLNGLTLTAYNAGHTVGGTIW 205

Query: 161 KITKDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLITDAYNALHNQ 208
            I    E ++YAVD+N+ +E  + G             V+E   +P  L+          
Sbjct: 206 HIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGAEVIEQLRKPTALVCSTRGGDKFA 265

Query: 209 PP--RQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--------- 256
            P  R++R+ +  D I  +L  GG VL+P D++ RVLEL   LE  W + +         
Sbjct: 266 LPGGRKKRDDLLLDMIRSSLAKGGTVLIPTDTSARVLELAYALEHAWRDVAGGNNESDIA 325

Query: 257 -LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET----------SRDNA-------- 297
             N  +Y     + +T+   +S LEWM ++I + FE           ++ NA        
Sbjct: 326 LKNAGLYLAGRKAHTTMRLARSMLEWMDENIVREFEAAEGVDAVTGQTQSNADGQRSGGQ 385

Query: 298 ------------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHD 336
                       F  KH+  +  +  L+       PK+++AS  SL+ GF+ +
Sbjct: 386 GQGKGGSKGLGPFTFKHLRTVERRKRLEKILTDQKPKVIIASDTSLDWGFAKE 438


>gi|71027091|ref|XP_763189.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350142|gb|EAN30906.1| hypothetical protein TP03_0171 [Theileria parva]
          Length = 678

 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 170/361 (47%), Gaps = 45/361 (12%)

Query: 43  SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-- 100
           +L + L  V +++D  ++SH    H+GALP+  + +G S P++ T P   L  L + D  
Sbjct: 105 ALKKALKNVTNSVDCSVISHFHLDHVGALPFLTEHIGYSGPIYLTYPTRALCPLLLRDSV 164

Query: 101 QYLSRRQVSEFDLFTLDDIDSAFQSV----TRLTYSQN-------------YHLSGKGE- 142
           Q  S R V + D  T+  I+++ +S+    T  TY+ +             Y L+   E 
Sbjct: 165 QVTSTRTVPD-DPNTISSINASVKSLLNCHTNTTYNTDKRRKIEERTDPWGYSLNSVAEC 223

Query: 143 ----------------GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT 186
                            + + P+ AGH+LG +++    DG  V+Y  D+N   +KHL G 
Sbjct: 224 MKRSIPLQLRATETVGNLNLVPYYAGHVLGASMFLSECDGFKVLYTGDFNTIPDKHL-GP 282

Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELL 245
                + P VLI ++  A   +  ++  EM     +  TL  GG VL+PV + GR  EL 
Sbjct: 283 AKVPTLEPDVLICESTYATFVRQSKRATEMELCTTVHDTLINGGKVLIPVFAVGRAQELA 342

Query: 246 LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTL 305
           +IL +YW   S+++PIYF   +S    +Y K    W  ++   S    R+N F L+++ L
Sbjct: 343 IILNNYWNNLSISFPIYFGGGLSEKATNYYKLHSSWTNNN---SITNLRENPFSLRNL-L 398

Query: 306 LINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQ 365
             ++S L++  + P ++ A+   +  G S      W+ +  NL+L       GT+   L 
Sbjct: 399 QFDQSFLND--NRPMVLFATPGMVHTGLSLKACKLWSQNPNNLILIPGYCVQGTVGNKLI 456

Query: 366 A 366
           A
Sbjct: 457 A 457


>gi|82704800|ref|XP_726704.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23482224|gb|EAA18269.1| hypothetical protein [Plasmodium yoelii yoelii]
          Length = 954

 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 91/351 (25%), Positives = 160/351 (45%), Gaps = 41/351 (11%)

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLG------LSAPVFSTEPVYRLGLLT 97
           L+  L K+   ID V++SH    H+GALP+  + L       +S P  +  PV  L    
Sbjct: 94  LINNLKKINEIIDCVIISHFHMDHIGALPFFTEILQYKGTIIMSYPTKALSPVLLLDGCK 153

Query: 98  MYDQYLSRRQVSEF---------DLF--------------TLDDIDSAFQSVTRLTYSQN 134
           + D    ++ + +          DL               T ++I +    V  L  ++ 
Sbjct: 154 ISDIKWEKKNLEKQIKMLNEKSDDLLNYNINCIKKDPWNITEENIYNCINKVVGLQVNET 213

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           Y L      I + P+ AGH+LG  ++++  +   VIY  DYN   +KHL  T +   + P
Sbjct: 214 YELGD----ISITPYYAGHVLGACMYRLEVNNISVIYTGDYNTIPDKHLGSTKI-PVLTP 268

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            + I+++  A + +P R+  E+   + +++ +  GG VL+PV + GR  EL ++LE+YW 
Sbjct: 269 EIFISESTYASYVRPTRKSSELELCNLVNECVHKGGKVLIPVFAIGRAQELSILLEEYWE 328

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
           +  +N PIYF   ++ +   Y K +  W+ ++      T   N F   +++   N    +
Sbjct: 329 KMKINCPIYFGCGLTENANKYYKIYSSWISNNCV---STEVKNLFDFSNISQFSNNYLNE 385

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           N    P ++ A+   L  G +   F  WAS+  NL++       GT+   L
Sbjct: 386 NR---PMVLFATPGMLHTGLALKAFKAWASNPNNLIILPGYCVQGTIGHKL 433


>gi|10433243|dbj|BAB13943.1| unnamed protein product [Homo sapiens]
          Length = 499

 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 70/207 (33%), Positives = 112/207 (54%), Gaps = 7/207 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  +++I    E V+Y  DYN   ++HL    ++   RP +LIT++  A 
Sbjct: 49  IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + +T+  GG VL+PV + GR  EL ++L+ +W   +L  PIYF 
Sbjct: 108 TIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLKTFWERMNLKVPIYFS 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
           T ++     Y K F+ W    I K+F   + N F  KH+    +++  DN   GP +V A
Sbjct: 168 TGLTEKANHYYKLFIPWTNQKIRKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLF 351
           +   L AG S  IF +WA + KN+V+ 
Sbjct: 223 TPGMLHAGQSLQIFRKWAGNEKNMVIM 249


>gi|358333242|dbj|GAA51791.1| cleavage and polyadenylation specificity factor subunit 3
           [Clonorchis sinensis]
          Length = 697

 Score =  121 bits (303), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 92/348 (26%), Positives = 163/348 (46%), Gaps = 54/348 (15%)

Query: 67  HLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAF 123
           H G LPY + + G+ A  +   +T+ +YR  LL  + +  +   V +  L+T  DI ++ 
Sbjct: 18  HCGGLPYLLLKTGVRAKCYMTHATKAIYRY-LLADFVRVSNSSGVPDQSLYTDRDIIASL 76

Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL 183
             +  L + Q   ++G    I      AGH+LG  ++ I   G  V+Y  D++R++++HL
Sbjct: 77  DRIDTLDFHQELEVNG----IKFTAFHAGHVLGAAMFLIEIAGVKVLYTGDFSRQEDRHL 132

Query: 184 NGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVL 242
               +   VRP VLIT+A   +H    R+ RE  F   +   +  GG  L+P  + GR  
Sbjct: 133 MCAEIPH-VRPDVLITEATYGIHIHDKREDREARFTRLVHDIVGRGGRCLIPAFALGRAQ 191

Query: 243 ELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
           EL+LIL++YWA H    + PIY+ + ++   +   ++++  M + I    + + +N F  
Sbjct: 192 ELMLILDEYWANHPELHDIPIYYASQLARKCMAVYQTYIHAMNEKIRN--QLANNNPFCF 249

Query: 301 KHVT----------------LLINKSEL----------------DNAP--------DGPK 320
           +H++                 L +K+ L                 N P         GP 
Sbjct: 250 RHISNLKAMRSYSISEQTEHALASKAWLYVAYSRFPVIGTVAAGTNVPTSIEHFDDSGPC 309

Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +V+AS   +++G S ++F  W +D +N V+       GTLA+ + + P
Sbjct: 310 VVMASPGMMQSGMSRELFENWCTDRRNGVIIAGYCVEGTLAKQILSLP 357


>gi|323451639|gb|EGB07515.1| hypothetical protein AURANDRAFT_27422, partial [Aureococcus
           anophagefferens]
          Length = 178

 Score =  121 bits (303), Expect = 1e-24,   Method: Composition-based stats.
 Identities = 59/154 (38%), Positives = 89/154 (57%), Gaps = 4/154 (2%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPV 90
           L+DCG +  F+ +  + +  VA  +D VL+SH +  HLGAL  A  + GL AP+++T PV
Sbjct: 25  LLDCGCDVGFEEACFERIGAVAKDVDLVLISHHELRHLGALAAAKARYGLRAPIYATLPV 84

Query: 91  YRLGLLTMYDQYLSRRQVSEFDL----FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVV 146
            +LG +TMY+ +   R     D     FTLDD+D+AF  +  L + Q   L GKG G+V+
Sbjct: 85  TKLGFVTMYEAWAGYRASFGRDAARSKFTLDDVDAAFGKMRPLKFDQPLSLRGKGAGVVI 144

Query: 147 APHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
             H  GH +GG  W++    +D++Y VD +   E
Sbjct: 145 TAHRCGHSVGGAYWRVRLGADDIVYCVDAHHADE 178


>gi|407847992|gb|EKG03521.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 883

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 114/410 (27%), Positives = 180/410 (43%), Gaps = 34/410 (8%)

Query: 17  NPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMK 76
            P + L+ IDG   L+DCGWND FD S L  L      + AVL S P+ +  GALP+ ++
Sbjct: 108 TPFANLIEIDGVRILLDCGWNDEFDVSFLDTLMPYLGDVHAVLFSTPELVSCGALPFVVE 167

Query: 77  QLGLSAPVFSTEPVYRLGL-------LTMYDQYLSRRQVSEFDL-FTLDDIDSAFQSVTR 128
            +     V +     ++GL       L ++    + R  +  D   T+D + SAF+SVT 
Sbjct: 168 HISTGTCVAAAGSTAKMGLHGVLHPFLYLFPNVKTWRLENGLDFEMTVDKVYSAFRSVTE 227

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL 188
             Y     +  +   +   P  +G +LGG  W I    +++ Y  D++ +         L
Sbjct: 228 -PYGGKVTIRHRDAEVECYPIFSGRMLGGHGWLIKYKIDELFYCPDFSLKPS-----YAL 281

Query: 189 ESFVRPA----VLITDAYNALHNQPPRQQREMFQDAISK---TLRAGGNVLLPVDSAGRV 241
           + F+ P     + I  +   L     R+  E     I +   TLR G +VL+PV  AGR 
Sbjct: 282 KRFLPPTTSTLLFIDGSPFHLSGNTGRKYEEQLNALIREILGTLRNGKDVLIPVSVAGRG 341

Query: 242 LELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 300
           LE+L I+     E    NY + F +  ++  +    +  E + D I  S     +     
Sbjct: 342 LEILTIVTHLLTEKGGDNYTVVFASIQAAELVAKASTMTEALLDEIILS-----ERQLFA 396

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW----ASDVKNLVLFTERGQ 356
             VT    +  L  A  GPK+ +A   +L+ G S ++   +    A + +NLV+ T   +
Sbjct: 397 NVVTCKTAEEVLSVA--GPKICIADGETLDYGVSAELLGHFLQADADERENLVVLTGAPK 454

Query: 357 FGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA 406
             T A  + A     A+ +  + R PL  EEL  Y   Q  L+ EE  KA
Sbjct: 455 PHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEY-YLQIELEMEEQRKA 503


>gi|388498176|gb|AFK37154.1| unknown [Lotus japonicus]
          Length = 315

 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/314 (28%), Positives = 147/314 (46%), Gaps = 42/314 (13%)

Query: 7   VTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPLSKVAS 53
           VTPL G  NE   S + +S  G   L DCG            + D  DPS          
Sbjct: 23  VTPL-GAGNEVGRSCVYMSYKGKTVLFDCGIHPAYSGMAALPYFDEIDPS---------- 71

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSE 110
           T+D +L++H    H  +LPY +++      VF   +T+ +Y+L L      ++   +VS 
Sbjct: 72  TVDVLLITHFHLDHAASLPYFLEKTTFRGRVFMTYATKAIYKLLL----SDFVKVSKVSV 127

Query: 111 FD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
            D LF   DI+ +   +  +    ++H + +  GI    + AGH+LG  ++ +   G  V
Sbjct: 128 EDMLFDEQDINRSMDKIEVI----DFHQTLEVNGIRFWCYTAGHVLGAAMFMVDIAGVRV 183

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGG 229
           +Y  DY+R +++HL       F     +I   Y   H+QP   + + F D I  T+  GG
Sbjct: 184 LYTGDYSREEDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFTDVIHSTISQGG 243

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
            VL+P  + GR  ELLLIL++YW  H    N PIY+ + ++   +   +++   M D I 
Sbjct: 244 RVLIPAFALGRAQELLLILDEYWTNHPELQNIPIYYASPLAKKCLTVYETYTLSMNDRI- 302

Query: 288 KSFETSRDNAFLLK 301
              + ++ N F  K
Sbjct: 303 ---QNAKSNPFSFK 313


>gi|440298403|gb|ELP91039.1| Cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba invadens IP1]
          Length = 788

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 175/383 (45%), Gaps = 40/383 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN---DHFDPSLLQPLSKVAS--TID 56
           G+ +++ PL          +++   G N ++DCG +    H + +L  PL +     +I+
Sbjct: 18  GSVLEIKPLGAGREVGRSCFVLKYMGHNIMLDCGVHPAKKHGEDAL--PLFEYGDVDSIE 75

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--GLLTMYDQYLSRRQ----VSE 110
            + ++H    H  ALPY + +      +  T P   +   L   + Q  S  Q    VS 
Sbjct: 76  LLCVTHFHVDHCAALPYLVLERNYKGKILMTPPTKEIFGELFKEFHQMSSTIQPPKPVSP 135

Query: 111 FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
            ++  L+ ID+             +H   +  G+ +    AGH+LG  ++ +  +G  ++
Sbjct: 136 KEV--LERIDTI-----------KFHEMQEFNGMKIWCFNAGHILGAAMFCLEINGVKIL 182

Query: 171 YAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGG 229
           Y  D++   ++H++   +  F    V+I ++   + +Q PR  RE  F   I + L+ GG
Sbjct: 183 YTGDFSGESDRHMHSAEVPPF-EIDVMICESTYGIMDQEPRVDRENRFVKQIVEILKRGG 241

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
             L+PV S GR  E  LILE+YW  H     Y I+F + ++   + Y + +  +M   + 
Sbjct: 242 KCLIPVFSLGRAQEFELILEEYWQSHKELWAYSIFFFSSIAKKCMTYFEKYTSFMNQELR 301

Query: 288 KSFETSRDNAFLLKHV---TLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           K     +  AF  K +   +  ++ S +DN    P +VLAS   L+ GFS  +F  W +D
Sbjct: 302 K----RKRQAFNFKFIRDGSSSVDDSTIDNH---PCVVLASPGMLQDGFSRTLFERWCTD 354

Query: 345 VKNLVLFTERGQFGTLARMLQAD 367
             N V+       GTLA+ +  D
Sbjct: 355 KNNGVIIPGYCVEGTLAKQIIND 377


>gi|167394445|ref|XP_001733538.1| cleavage and polyadenylation specificity factor [Entamoeba dispar
           SAW760]
 gi|165894673|gb|EDR22582.1| cleavage and polyadenylation specificity factor, putative
           [Entamoeba dispar SAW760]
          Length = 688

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/275 (29%), Positives = 142/275 (51%), Gaps = 22/275 (8%)

Query: 18  PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
           P+S L+ I+    L+DCG + +F   +++    + S ID VL+SH D  H+GALPY   +
Sbjct: 16  PVSALLEINSTKILLDCGVDCNFTREIIEKYDSI-SDIDIVLISHSDLRHMGALPYIANK 74

Query: 78  LGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHL 137
              +  +++T+PV ++G L M  + +  +Q+  +  + L D++  ++ +  L Y   Y L
Sbjct: 75  -NPNCSIYTTDPVGKMGYLCM-KEAIKTQQLIGYPCYRLKDVEQTYKRIFLLEY---YKL 129

Query: 138 SGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL 197
              GE + V+ H +G  LGGT WKI    +++IYAV  +      + G+ +  F RP VL
Sbjct: 130 QKCGE-VEVSAHPSGRTLGGTNWKICNGCDEIIYAVGNDLNNGFVIEGSKIMKFNRPMVL 188

Query: 198 ITDAYNALHNQPPRQQREMFQDA---ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +TD    +  Q   Q  EM  +    I K +   G  LLPV+  GR++E + ++     +
Sbjct: 189 LTD----IGGQGKCQ--EMLNNVMMEIRKIVLRKGCCLLPVECGGRIMEYMEMVY-ISCD 241

Query: 255 HSLNYPI-----YFLTYVSSSTIDYVKSFLEWMGD 284
             +N  I     Y ++ V+    +  K+ +EW+ D
Sbjct: 242 VDINRVIKDASFYCISSVADQIKEMNKTIMEWVRD 276


>gi|428671580|gb|EKX72498.1| cleavage and polyadenylation specificity factor, putative [Babesia
           equi]
          Length = 656

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 95/342 (27%), Positives = 157/342 (45%), Gaps = 32/342 (9%)

Query: 48  LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSR 105
           L+ + +T+D  ++SH    H+GALP+  +QL  + PV+ T P   L  + + D  Q  ++
Sbjct: 99  LNDLTNTLDCAIISHFHLDHVGALPFLTEQLKFNGPVYMTWPTKALSPILLRDSAQVTAQ 158

Query: 106 RQVSE--FDLFTLDDIDSAFQSVTRLTYSQN---YHLSGKGE-----------------G 143
           R V +   +L  L ++ +  +S  R   + +   Y+L    E                  
Sbjct: 159 RTVKQDKENLRNLLNMRTDSESHKRRKGADDPWGYNLGPATESVKKAIALQLQETRHIGN 218

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           I + P+ AGH+LG  ++ +  DG  V+Y  D+N   +KHL    +     P VLI ++  
Sbjct: 219 IKITPYYAGHVLGAAMFHVECDGFSVLYTGDFNTVPDKHLGPAKVPRLC-PDVLICESTY 277

Query: 204 ALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY 262
           A   + PR+  EM     +  TL  GG VL+PV + GR  EL +IL+ YW++  L YPIY
Sbjct: 278 ATVVRQPRKATEMELCTVVHDTLLKGGKVLIPVFAVGRAQELAIILDSYWSKLELKYPIY 337

Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
           F   +S    +Y K    W  +    +     +N F + ++    N    +N    P ++
Sbjct: 338 FGGGLSEKATNYYKLHSCWTNE---HNIPGLNENTFSMSYIQPFDNGYLNENR---PMVL 391

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
            A+   + AG S      WA +  NL++       GT+   L
Sbjct: 392 FATPGMVHAGLSLRACKLWAPNPNNLIVIPGYCVQGTVGNKL 433


>gi|402465801|gb|EJW01455.1| hypothetical protein EDEG_00447 [Edhazardia aedis USNM 41457]
          Length = 774

 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 170/371 (45%), Gaps = 30/371 (8%)

Query: 5   VQVTPLSGVFNENPLSYL-VSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLS 61
           +++TPL G  NE   S + +       L+D G +  F      P   V     IDA+ ++
Sbjct: 7   LKITPL-GAGNEVGRSCIHIEYKQTQLLLDIGIHPAFTGPCALPFLDVIDLHKIDALFVT 65

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDS 121
           H    H GALPY  ++      +F T P   +    + D        S  D++T  D+ +
Sbjct: 66  HFHLDHAGALPYLTEKTNFKGKIFMTHPTKSILKYLLNDYTKVVNASSNEDMYTEADLKN 125

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
            +  +  + Y Q      K + I V    AGH+LG  ++ +    + ++Y  DY+   ++
Sbjct: 126 CYNKIFAIDYFQEI----KIKDIKVVSLNAGHVLGAAMFLLKIGSKKLLYTGDYSTEPDR 181

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGR 240
           HL        +    LIT++   +    PR++RE  F +A+   ++  G VLLPV + GR
Sbjct: 182 HLKEAKCPGKIN--FLITESTYGVQCHLPREEREKRFLNAVRDIIKRRGKVLLPVFALGR 239

Query: 241 VLELLLILEDYW--AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA- 297
             E+LLILE+YW   E   N PIY+ + ++   I             I + +  S  N  
Sbjct: 240 AQEILLILEEYWDNNEDLQNVPIYYASALARRCI------------GIYQQYSQSDKNVD 287

Query: 298 FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 357
           F  K++    N +  D+  + P +V+AS   L++G S D+F +W  D +N V+       
Sbjct: 288 FKFKYIR---NINTFDDR-NLPCVVMASPGMLQSGLSRDLFEKWCEDKRNGVIIAGYCVQ 343

Query: 358 GTLARMLQADP 368
           GTLA+ +  +P
Sbjct: 344 GTLAKEILNEP 354


>gi|402217247|gb|EJT97328.1| Metallo-hydrolase/oxidoreductase [Dacryopinax sp. DJM-731 SS1]
          Length = 780

 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 166/331 (50%), Gaps = 24/331 (7%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEP---VYRLGLLTMYDQYLSRR 106
           ST+DA+L++H    H  +L Y M++         V+ T P   VYRL ++  Y +  + +
Sbjct: 60  STVDALLITHFHLDHAASLTYIMEKTNFKDGKGKVYMTHPTKAVYRL-MMQDYVRMSAAQ 118

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
             S   LFT  D+      +  ++++    +     G+   P+ AGH+LG +++ I    
Sbjct: 119 STSAPPLFTPLDLSITLPLINAVSFATTTTVI---PGLSFTPYPAGHVLGASMFLIQLAD 175

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPA-----VLITDAYNALHNQPPRQQREMFQDAI 221
             ++Y  DY+R + +HL    + + V P      ++I   +     +  R++ E F   I
Sbjct: 176 LRILYTGDYSREESRHL----VRAEVPPGAGIDVLIIESTFGVQSTEGRREKEERFTSLI 231

Query: 222 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFL 279
            + L  GG+VL+PV + G   ELLLIL+D++ +H     +PIY+ + ++   +   + ++
Sbjct: 232 HRILMRGGHVLMPVFAVGGAQELLLILDDFFEKHPELHKFPIYYASALARKCMAVYQGYV 291

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKS--ELDNAPDGPKLVLASMASLEAGFSHDI 337
             M ++I + F  ++ N F+ +HV+ +   S  E       P ++LAS   +++G S ++
Sbjct: 292 HVMNNNIRQRFANNQ-NPFVFRHVSHIPRSSGWEKKIGEGPPCVILASPGMMQSGASREL 350

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
              WA D +N ++ T     G++AR +  +P
Sbjct: 351 LEMWAPDRRNGIVLTGYSVEGSMARNIMNEP 381


>gi|71656590|ref|XP_816840.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|50363263|gb|AAT75334.1| cleavage polyadenylation specificity factor CPSF100 [Trypanosoma
           cruzi]
 gi|70881994|gb|EAN94989.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 802

 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 115/431 (26%), Positives = 187/431 (43%), Gaps = 40/431 (9%)

Query: 2   GTSVQVTPLSGV------FNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTI 55
            +S+++T L G           P + L+ IDG   L+DCGWND FD + L  L      +
Sbjct: 6   ASSIKLTNLYGAPTGDTYHPSTPFANLIEIDGVRILLDCGWNDEFDVNFLDALMPYLGDV 65

Query: 56  DAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGL-------LTMYDQYLSRRQV 108
            AVL S P+ +  GALP+ M+ +     V +     ++GL       L ++    + R  
Sbjct: 66  HAVLFSTPELVSCGALPFVMEHIPTGTCVAAAGSTAKMGLHGVLHPFLYLFPNVKTWRLE 125

Query: 109 SEFDL-FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +  D   T+D + SAF+SVT   Y     +  +   +   P  +G +LGG  W I    +
Sbjct: 126 NGLDFEMTVDKVYSAFRSVTE-PYGGKVTIRHRDAEVECYPIFSGRMLGGHGWLIKYKID 184

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPA----VLITDAYNALHNQPPRQQREMFQDAISK 223
           ++ Y  D++ +         L+ F+ P     + I  +   L     R+  E     I +
Sbjct: 185 ELFYCPDFSLKP-----SYALKRFLPPTTSTLLFIDGSPFHLSGNTGRKYEEQLNALIRE 239

Query: 224 ---TLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFL 279
              TLR G +VL+PV   GR LE+L I+     E    NY + F +  ++  +    +  
Sbjct: 240 ILGTLRNGKDVLIPVSVVGRGLEILTIVTHLLTEKGGDNYTVVFASIQAAELVAKASTMT 299

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
           E + D I  S      N    K    +++ +       GPK+ +A   +L+ G S ++  
Sbjct: 300 EALLDEIILSERQLFANVVTCKTAEEVLSVA-------GPKICIADGETLDYGVSAELLG 352

Query: 340 EW----ASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQ 395
            +    A + +NLV+ T   +  T A  + A     A+ +  + R PL  EEL  Y   Q
Sbjct: 353 HFLQADADERENLVVLTGAPKPHTNAFTMAAAKKGDAIDLRYTIRSPLGKEELEEY-YLQ 411

Query: 396 TRLKKEEALKA 406
             L+ EE  KA
Sbjct: 412 IELEMEEQRKA 422


>gi|281206064|gb|EFA80253.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
           PN500]
          Length = 656

 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 74/238 (31%), Positives = 125/238 (52%), Gaps = 10/238 (4%)

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           YH   + +GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL G      V  
Sbjct: 45  YHEKLEHKGIKFCCYNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMGAETPP-VNV 103

Query: 195 AVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            +LI ++   +    PR +RE  F  +I + ++ GG  L+PV + GR  ELLLIL++YW 
Sbjct: 104 DILIIESTYGVQVHEPRLEREKRFTSSIHEVVKRGGRCLIPVFALGRAQELLLILDEYWI 163

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
            H      PIY+ + ++   +   ++++  M + I   F+ S  N F  KH+    N S 
Sbjct: 164 AHPELQKIPIYYASALARKCMSVYQTYINMMNERIRAQFDLS--NPFSFKHIE---NISG 218

Query: 312 LDN-APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           ++    DGP + +AS   L++G S  +F  W SD  N V+       GTLA+ + ++P
Sbjct: 219 IERFTDDGPCVFMASPGMLQSGLSRQLFERWCSDKMNGVVIPGYNVEGTLAKHIMSEP 276


>gi|399216276|emb|CCF72964.1| unnamed protein product [Babesia microti strain RI]
          Length = 916

 Score =  119 bits (299), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 100/381 (26%), Positives = 175/381 (45%), Gaps = 26/381 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNF----LIDCGWNDHFDPSLLQPLSKVASTID 56
           MG  V + P   +  ++  + LVSI   N+    L+DCG +D F+   ++ L   +  I 
Sbjct: 1   MGMYVTIQP---ILTDSEWATLVSIKLSNYRIKLLVDCGLSDGFNCHSIKKLLMQSIGIK 57

Query: 57  AVLLSHPDTLHLGALPYAMKQ---LGLSAPVFSTEPVYRLG---LLTMYDQYLSRRQVSE 110
            + L+H    H+G LP+ M++   L     +  T+P Y+L    LL + D        S+
Sbjct: 58  YIFLTHSTLEHVGGLPFLMRKYTKLRNKPQIICTDPTYKLAKANLLDLVDNMSLNLPKSK 117

Query: 111 FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVI 170
              ++ D+I+SA  +   L Y ++  L    +G+ +    +GH +GG+ + +T   + ++
Sbjct: 118 LH-YSADEINSALSNSKLLRYDEHITLDSAIDGLSLHVINSGHSVGGSAYVLTMGTKQIL 176

Query: 171 YAVDYNRRKEKHLNGTVLESFVRPAVLITD----AYNA--LHNQPPRQQREMFQDAISKT 224
            A   +   + HLN   L +   P +LITD    + NA  LH+       +M       T
Sbjct: 177 IARKISLISKWHLNSLSLSTVNNPYLLITDFPKLSINACLLHS-----SLDMVIHKTINT 231

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMG 283
           L+ G  VLLP+D   R++ELL   E  W  H +  +P+   + + S       + +E+M 
Sbjct: 232 LKNGNCVLLPIDIDSRMVELLHHFEMCWKSHYVAKWPLIIASPIVSKMSLIFSTSIEYMS 291

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
             +   F     N  +  +V  L    +L    + P ++ ++  SL  GFS+ +F    S
Sbjct: 292 SKVKSEFSRDLKNPLIFDNVIYLDKLEQLKPFTNVPCVIFSTPGSLNWGFSNALFAAIGS 351

Query: 344 DVKNLVLFTERGQFGTLARML 364
              NL++ ++     TLAR L
Sbjct: 352 KKGNLIILSKEPTTKTLARKL 372


>gi|449435476|ref|XP_004135521.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           3-I-like [Cucumis sativus]
          Length = 392

 Score =  119 bits (299), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 93/337 (27%), Positives = 159/337 (47%), Gaps = 44/337 (13%)

Query: 7   VTPLSGVFNENPLSYL-VSIDGFNFLIDCG------------WNDHFDPSLLQPLSKVAS 53
           +TPL G  NE   S + +S      L DCG            + D  DPS          
Sbjct: 26  ITPL-GAGNEVGRSCVYMSYKSKIVLFDCGIHPAYSGMAALPYFDEIDPS---------- 74

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSE 110
           TID +L++H    H  +LPY +++      VF   +T+ +Y+L L      ++   +VS 
Sbjct: 75  TIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTYATKAIYKLLL----SDFVKVSKVSV 130

Query: 111 FD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
            D L+   DI+ +   +  +    ++H + +  GI    + AGH+LG  ++ +   G  V
Sbjct: 131 EDMLYDEQDINRSMDKIEVI----DFHQTVEVNGIRFWCYTAGHVLGAAMFMVDIAGVRV 186

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGG 229
           +Y  DY+R +++HL    +  F     +I   Y    +QP   + + F D +  T+  GG
Sbjct: 187 LYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFTDVVHSTISQGG 246

Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 287
            VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   +++   M D I 
Sbjct: 247 RVLIPAFALGRAQELLLILDEYWANHPELHNIPIYYASPLAKRCLTVYETYTLSMNDRI- 305

Query: 288 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
              + ++ N F  K+++ L +     +   GP +V+A
Sbjct: 306 ---QNAKSNPFRFKYISPLKSIEVFKDV--GPSVVMA 337


>gi|302412663|ref|XP_003004164.1| endoribonuclease YSH1 [Verticillium albo-atrum VaMs.102]
 gi|261356740|gb|EEY19168.1| endoribonuclease YSH1 [Verticillium albo-atrum VaMs.102]
          Length = 730

 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 78/261 (29%), Positives = 134/261 (51%), Gaps = 19/261 (7%)

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           +YH +     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+
Sbjct: 48  DYHTTHTISSIRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVK 107

Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
             VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW
Sbjct: 108 IDVLITESTYGIASHVPRVEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYW 167

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----F 298
            +H     YPIY+ + ++   +   ++++  M D+I + F       E S D +     +
Sbjct: 168 GKHPDFQKYPIYYASNLARKCMVVYQTYVGAMNDNIKRLFREGMAQAEASGDGSGKGGPW 227

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
              ++  L N    D+   G  ++LAS   L+ G S ++   WA + KN V+ T     G
Sbjct: 228 DFNYIRSLKNLDRFDDL--GGCVMLASPGMLQNGVSRELLERWAPNDKNGVIITGYSVEG 285

Query: 359 TLARMLQADPPPKAVKVTMSR 379
           T+A+ +  +  P  ++  MSR
Sbjct: 286 TMAKQIMQE--PDQIQAVMSR 304


>gi|410076302|ref|XP_003955733.1| hypothetical protein KAFR_0B03020 [Kazachstania africana CBS 2517]
 gi|372462316|emb|CCF56598.1| hypothetical protein KAFR_0B03020 [Kazachstania africana CBS 2517]
          Length = 817

 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 130/526 (24%), Positives = 238/526 (45%), Gaps = 68/526 (12%)

Query: 30  FLIDCGWNDHFDP--SLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMKQLGLSA-P 83
            LID GWN+        ++  S +   +D VLLS P    +GA   L Y      +S   
Sbjct: 27  ILIDPGWNNKKVSYEECVRYWSNIIPEVDIVLLSQPTIECIGAYTLLHYNFLSHFISRIE 86

Query: 84  VFSTEPVYRLGLLTMYDQYLSRRQVSEF--DLFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
           V++T PV  LG ++  D Y S+  +  +  +   ++DI+ ++  V  L +SQ   L    
Sbjct: 87  VYATLPVTNLGRVSTIDLYASKGVIGPYTTNQMNVEDIEKSYDHVKALKFSQMVDLKSTF 146

Query: 142 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL--------ESFVR 193
           +G+ +  + +G+  GG++W I    E ++YA  +N  K   L+ + L         + +R
Sbjct: 147 DGLSLVAYNSGYTTGGSIWCIMTHSEKLLYARRWNHTKNNILDASALLGPGGKPSSALMR 206

Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
           P+ +IT        +P +++ +MF+D + K++ +GG+ ++PV+     L+LL+++ D+  
Sbjct: 207 PSAIITTLDRFGSPKPYKKRSKMFKDLLRKSVTSGGSAVIPVEIGENFLDLLVLVHDFLY 266

Query: 254 EHS-------LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--FLLKHVT 304
           E+S       LN  I  ++Y     + Y KS LEW+  S  K++E SRD++  F L    
Sbjct: 267 ENSKSGLISQLN--ILLVSYSKGRIVTYAKSMLEWLSSSAIKTWE-SRDSSSPFELGKNF 323

Query: 305 LLINKSELDNAPDGPKLVLAS---------MASLEAGFSHDIFVEWASDVKNLV--LFTE 353
            +I  SE+   P G K+   S         + +L    +  I +    +   +V  ++ E
Sbjct: 324 NVILPSEISKYP-GSKICFVSQLEPMMDEVIENLGQNETSTILLTSKVNRSEIVSEIYKE 382

Query: 354 RGQFGTLARMLQADPPPKAVKVTMSRR--VPLVGEELIAYEEEQTRLKKEEALKASLVKE 411
             Q      + +    P +  V + +    PL G +L  + ++    +KE+  K+ L+  
Sbjct: 383 WTQLCKKPSVEEGQILPYSSSVLLKKVNIEPLRGHDLDEF-KKSIEERKEKRSKSELLLR 441

Query: 412 EESK---ASLGPDNNLSGDPMVIDANNANA-------------SADVVEPHGGRYRDIL- 454
           +E+K    SL  D  ++G  M  D + + A               +++    G+  D L 
Sbjct: 442 KEAKNPAKSLNTD-RVNGGSMDGDTSQSKAIDEDDDEEEEEEEEDNLLRILKGQSGDKLS 500

Query: 455 ------IDGFV-PPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK 493
                 +D +V   ST    MF F     + DD+G +++   +I K
Sbjct: 501 GVIEYPVDTYVQTTSTPKNKMFQFNPRKEKRDDYGTIVDYSMFISK 546



 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 65/121 (53%), Gaps = 10/121 (8%)

Query: 557 HVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG-DYEIAWVDAEVGK--TENGM- 612
            ++ PQ+ E I+ ++ + A  + L  +L   + ++K+G D+ +A V   V +    N M 
Sbjct: 668 EMFAPQLNEYIEFSTTIKALDISLDPELDKLLKWQKIGDDHTVAHVVGRVVRDTIHNSMR 727

Query: 613 --LSLLPISTPAPPH-KSVL--VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRK 666
             L L PIS+    H KS L  +G++++A++K  L+ +G   EF G G L     V +R+
Sbjct: 728 NKLVLKPISSGTKMHTKSGLLSIGEVRLAEVKRKLTEQGHVAEFQGEGTLVVNNEVMVRR 787

Query: 667 V 667
           +
Sbjct: 788 I 788


>gi|350638481|gb|EHA26837.1| hypothetical protein ASPNIDRAFT_35736 [Aspergillus niger ATCC 1015]
          Length = 915

 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 93/339 (27%), Positives = 159/339 (46%), Gaps = 11/339 (3%)

Query: 67  HLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSV 126
           H  ALPY + +      VF T     +    + D        S  D  T    +    S 
Sbjct: 135 HSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSSTASSSDQRTTLYTEQDHLST 194

Query: 127 TRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT 186
             L  + +++ +     I + P  AGH+LG  ++ I+  G ++++  DY+R +++HL   
Sbjct: 195 LPLIETIDFNTTHTINSIRITPFPAGHVLGAAMFLISIAGLNILFTGDYSREEDRHLIPA 254

Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
            +   V+  VLIT++   + + PPR +RE     AI+  L  GG VL+PV + GR  ELL
Sbjct: 255 EVPKGVKIDVLITESTFGISSNPPRLEREAALMKAITGVLNRGGRVLMPVFALGRAQELL 314

Query: 246 LILEDYWAEHS--LNYPIYFL--TYVSSSTIDYVKS-FLEWMGDSITKSFETSRDNAFLL 300
           LIL++YW  H      PIY++  T    +  D +K  F + M ++     ++     +  
Sbjct: 315 LILDEYWETHPELQKIPIYYIGNTARRCAMNDNIKRLFRQRMAEAEASGDKSVSAGPWDF 374

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           + V  L +    D+   G  ++LAS   L+ G S ++   WA + +N V+ T     GT+
Sbjct: 375 RFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTM 432

Query: 361 ARMLQADPPPKAVKVTMSRRVP-LVGEELIAYEEEQTRL 398
           A+ +  +  P+ +   MSR    LV   + A  EE+ ++
Sbjct: 433 AKQILNE--PEQIPAVMSRATTGLVRRGMAAGNEEEQKV 469


>gi|124505029|ref|XP_001351256.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium falciparum 3D7]
 gi|3758842|emb|CAB11127.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium falciparum 3D7]
          Length = 1017

 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 87/351 (24%), Positives = 159/351 (45%), Gaps = 41/351 (11%)

Query: 44  LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLG------LSAPVFSTEPVYRLGLLT 97
           L+  L ++   ID V++SH    H+GALP+  + L       +S P  +  P+  L    
Sbjct: 159 LINNLKRINEIIDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLDSCR 218

Query: 98  MYDQYLSR----RQVS----------EFDLFTL---------DDIDSAFQSVTRLTYSQN 134
           + D    +    RQ+            +++  +         D+I +    V  L  ++ 
Sbjct: 219 VTDMKWEKKNFERQIKMLNEKSDELLNYNINCIKKDPWNINEDNIYNCIDKVIGLQINET 278

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
           + L      + + P+ AGH+LG  ++KI      VIY  DYN   +KHL    + S + P
Sbjct: 279 FELGD----MSITPYYAGHVLGACIYKIEVRNFSVIYTGDYNTIPDKHLGSANIPS-LNP 333

Query: 195 AVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
            + I+++  A + +P ++  E+   + + + +  GG VL+PV + GR  EL ++L+DYW 
Sbjct: 334 EIFISESTYATYVRPTKKASELELCNLVHECVHKGGKVLIPVFAIGRAQELSILLDDYWK 393

Query: 254 EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELD 313
           +  ++YPIYF   ++ +   Y K +  W+  S        ++N F   +++  +N     
Sbjct: 394 KMKIHYPIYFGCGLTENANKYYKIYSSWINSS---CMSNEKENLFDFANISPFLNNYL-- 448

Query: 314 NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
                P ++ A+   L  G S   F  WA + +NL++       GT+   L
Sbjct: 449 -NEKRPMVLFATPGMLHTGLSLKAFKAWAGNPQNLIVLPGYCVQGTVGHKL 498


>gi|85000301|ref|XP_954869.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303015|emb|CAI75393.1| hypothetical protein, conserved [Theileria annulata]
          Length = 663

 Score =  118 bits (296), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 174/376 (46%), Gaps = 51/376 (13%)

Query: 34  CGWNDHFDP------SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFST 87
           C     FD       +L + L  V +++D  ++SH    H+GALP+  + +G S P++ +
Sbjct: 90  CAVKQEFDKDIYMKNALQKALRNVTNSVDCSIISHFHLDHVGALPFLTEHIGYSGPIYLS 149

Query: 88  EPVYRLGLLTMYD--QYLSRRQVSEFDLFTLDDIDSAFQSV----TRLTYSQN------- 134
            P   L  L + D  Q  S R V + D  ++  I+++ +S+    T  T++ +       
Sbjct: 150 YPTRALCPLLLRDSVQVTSTRTVPD-DPNSISSINASVKSLLNSHTNATFTPDKRRKIEE 208

Query: 135 ------YHLSGKGE-----------------GIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
                 Y L+   E                  + + P+ AGH+LG +++    DG  V+Y
Sbjct: 209 KADPWGYTLNSVAECMKRSIPLQLRATETVGNLNLVPYYAGHVLGASMFLSECDGFKVLY 268

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 230
             D+N   +KHL G      + P VLI ++  A   +  ++  EM     + +TL  GG 
Sbjct: 269 TGDFNTIPDKHL-GPAKVPTLEPDVLICESTYATFVRQSKRATEMELCTTVHETLINGGK 327

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           VL+PV + GR  EL +IL +YW   SL++PIYF   +S    +Y K    W  ++   + 
Sbjct: 328 VLIPVFAVGRAQELAIILNNYWNNLSLSFPIYFGGGLSEKATNYYKLHSSWTNNN---NI 384

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
              R+N F L+++ L  ++S L++  + P ++ A+   +  G S      W+ +  NL+L
Sbjct: 385 TNLRENPFSLRNL-LQFDQSFLND--NRPMVLFATPGMVHTGLSLKACKLWSQNPSNLIL 441

Query: 351 FTERGQFGTLARMLQA 366
                  GT+   L A
Sbjct: 442 IPGYCVQGTVGNKLIA 457


>gi|340058172|emb|CCC52525.1| cleavage and polyadenylation specificity factor,putative,
           (fragment), partial [Trypanosoma vivax Y486]
          Length = 411

 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 112/396 (28%), Positives = 175/396 (44%), Gaps = 35/396 (8%)

Query: 17  NPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMK 76
            P++YL+ IDG   L+DCGW D F  S L  L      + AVL S P+    GALP+ M 
Sbjct: 27  TPMAYLIEIDGVRILMDCGWTDEFRVSHLDALMPHIKDVHAVLFSTPEMCSCGALPFVMD 86

Query: 77  QLGLSAPVFSTEPVYRLGLLTMYDQYL---SRRQV------SEFDLFTLDDIDSAFQSVT 127
            +     V +     ++GL  +   +L   S RQ       +EF+L T+D I SAF+SV 
Sbjct: 87  HVPPGTHVAAAGATTKMGLHGVLHPFLYQFSNRQTWQLESGTEFEL-TVDKIYSAFRSV- 144

Query: 128 RLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV 187
           +  Y     +S K   +   P   G +LGG  W I    +++ Y  D++ +        V
Sbjct: 145 KEPYGGKVTISHKDVAVECFPVFTGRMLGGYGWLIKYQIDELFYCPDFSLKPSY-----V 199

Query: 188 LESFVRP---AVLITDAYNALHNQPPRQQREMFQDAISK----TLRAGGNVLLPVDSAGR 240
           L  FV P    VL  D     H     ++ E   +A  +    TLR G +VL+PV  AGR
Sbjct: 200 LNRFVPPTTATVLFIDGSPLRHGGGGGRRYEEHLNAFIRDVLGTLRNGKDVLIPVSVAGR 259

Query: 241 VLELLLILEDYWAEH-SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
            LE+L I+     E  S +Y +      ++  I    +  E + D +  S +       L
Sbjct: 260 GLEVLAIVTHLLTEKGSDSYTVVLAALQAAEIISKAGTMTEALRDEVILSEQQ------L 313

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD----VKNLVLFTERG 355
             +V       E+   P GPK+ +A   +L  G + ++   +  D     +NLV+     
Sbjct: 314 FANVVTCKTAQEVLTVP-GPKVCVADGETLGYGIAAELLEYFLQDDQEGRENLVVLPWAP 372

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAY 391
           +  + A ++ A      +++  ++R PL  EEL  Y
Sbjct: 373 RQESNASIIAAASKGDMMQLRYTKRSPLNKEELEEY 408


>gi|154278321|ref|XP_001539974.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150413559|gb|EDN08942.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 977

 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 94/356 (26%), Positives = 143/356 (40%), Gaps = 72/356 (20%)

Query: 8   TPLSGVFNE--NPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G  +     +  ++ +DG    L+D GW++ FD S L  L +   T+  VLL+H  
Sbjct: 5   TPLLGAQSSGSRAVQSILELDGGVKILVDVGWDESFDVSALAELERQIPTLSLVLLTHAT 64

Query: 65  TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF----------- 111
             H+GA  +  K   L    P+++T PV  LG   + D Y S    + F           
Sbjct: 65  PSHIGAFAHCCKTFPLFNQIPIYATSPVIALGRTLLQDLYSSAPLAATFLPKATSADSSP 124

Query: 112 ------------DLFTLDDIDSA---------------FQSVTRLTYSQNYHLSGKG--- 141
                       D   +D  DS                F  +  L YSQ +         
Sbjct: 125 SSPISSRAENVADTANIDHNDSPRILLPPPTSEEIARYFSLIHPLKYSQPHQPLPSPFSP 184

Query: 142 --EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT------------V 187
              G+ +  + AGH +GGT+W I    E +IYAVD+N+ +E  + G             V
Sbjct: 185 PLNGLTLTAYNAGHTVGGTIWHIQHGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEV 244

Query: 188 LESFVRPAVLI--TDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLEL 244
           +E   +P   +  T   +       R++R ++  D I      GG VL+P D++ R LEL
Sbjct: 245 VEQLRKPTAFVCSTRGGDKFSLSGGRKKRDDLLMDMIRNCFSKGGTVLIPSDTSARALEL 304

Query: 245 LLILEDYWAEHSLNY---------PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
             +LE  W E +             +Y        T+   +S LEWM + I + FE
Sbjct: 305 AYVLEHAWRESAETVDGEDPLKSGELYLAGKKGYGTMRLARSMLEWMDEGIVREFE 360



 Score = 47.0 bits (110), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 50/211 (23%), Positives = 80/211 (37%), Gaps = 68/211 (32%)

Query: 534 VLVHGSAEATEHLKQHCLKHVCPH--------------VYTPQIEETIDVTSDLCAYKVQ 579
           +L  G  E TE L   C   +                 ++TP I ET+D + D  A+ V+
Sbjct: 741 ILTAGLKEETEALAAECRNLLTAKAGLELGSSSQSVVDIFTPVIGETVDASVDTNAWMVK 800

Query: 580 LSEKLMSNVLFKKLGDYEIAWVDAE------VGKTENG---------------------- 611
           LS  L+  + ++ +    +  +  E      +   E+G                      
Sbjct: 801 LSSTLVKRLKWQSVRSLGVVALTGELRGPEPMAADEDGPGMSQKKQRTFSENASSSEGNE 860

Query: 612 ------------MLSLLPISTPAPPH---KSVLVGDLKMADLKPFLSSKGIQVEFAG-GA 655
                       +L +LP++  A      + + VGDL++ADL+  + S G   EF G G 
Sbjct: 861 KKQLVPRKHSFPLLDVLPVNMAAATRSVTRPLHVGDLRLADLRKLMQSSGHTAEFRGEGT 920

Query: 656 LRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
           L    +V +RK          SGT +I IEG
Sbjct: 921 LLIDGFVAVRK----------SGTGKIEIEG 941


>gi|366991851|ref|XP_003675691.1| hypothetical protein NCAS_0C03360 [Naumovozyma castellii CBS 4309]
 gi|342301556|emb|CCC69326.1| hypothetical protein NCAS_0C03360 [Naumovozyma castellii CBS 4309]
          Length = 814

 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 84/301 (27%), Positives = 150/301 (49%), Gaps = 31/301 (10%)

Query: 22  LVSIDGFNFLIDCGWND----HFDPSLLQPLSKVASTIDAVLLSHPDTLHLGA--LPYA- 74
           ++  D    LID GW      + D   ++  S +   ID +L+S P    LGA  L Y  
Sbjct: 19  ILKFDNVTILIDPGWTSTEVSYVD--CVKYWSNLIPEIDVILISQPTIECLGAYTLLYEN 76

Query: 75  -MKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF---DLFTLDDIDSAFQSVTRLT 130
            +        V++T PV  LG ++  + Y S+  +  F   +   ++DI++AF  +  L 
Sbjct: 77  FLSHFLSRIAVYATLPVANLGRVSTIEWYASQGIIGPFLDSNKMEVEDIEAAFDHIQILK 136

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN------ 184
           YSQ   L  K +G+      +G   GG++W I+   E ++YA  +N  ++  LN      
Sbjct: 137 YSQMIDLRSKFDGLTFFALNSGVNPGGSIWCISTYSEKLVYAPRWNHTRDTILNAASLLD 196

Query: 185 --GTVLESFVRPAVLIT--DAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGR 240
             G  L + +RP+ +IT  D + ++  +P +++  +F+D++ K L   G  L+P+D  G+
Sbjct: 197 NMGKPLSTLMRPSGIITSFDKFGSV--KPYKKRARIFKDSLKKALSNNGTALIPIDIGGK 254

Query: 241 VLELLLILEDYWAEHSLN-----YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            L++ +++ D+  E+  N      PI  ++Y     + Y KS LEW+  ++ K++E SR 
Sbjct: 255 FLDVFVLVHDFLYENLKNGMFNRLPILLVSYSRGRALTYAKSMLEWLSSTLLKTWE-SRS 313

Query: 296 N 296
           N
Sbjct: 314 N 314


>gi|149245028|ref|XP_001527048.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449442|gb|EDK43698.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 812

 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 95/349 (27%), Positives = 164/349 (46%), Gaps = 40/349 (11%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  +             S
Sbjct: 63  SKVDILLISHFHVDHSASLPYVMQQSNFKGKVFMTHATKAIYRWLMQDFVRVTSIGNSRS 122

Query: 110 EF-----------------DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAG 152
           E                  +L+T DDI  +F  +  +    +YH + + +GI    + AG
Sbjct: 123 EGGGTSATGASGSLNEEGGNLYTDDDIFKSFDRIETI----DYHSTMEIDGIKFTAYHAG 178

Query: 153 HLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQ 212
           H+LG  ++ I   G  V++  DY+R + +HL    +    RP +LIT++         + 
Sbjct: 179 HVLGACMYFIEIGGLKVLFTGDYSREENRHLQAAEVPP-TRPDILITESTFGTGTLESKA 237

Query: 213 QREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSS 269
           + E      I  T+  GG VLLPV + G   E+LLILE+YW ++    N  +Y+ + ++ 
Sbjct: 238 ELEKKLTSHIHATITRGGRVLLPVFALGNAQEILLILEEYWEKNEDLHNVNVYYCSDLAR 297

Query: 270 STIDYVKSFLEWMGDSI----------TKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
             +   +++   M D I          + S  +++ N F  K++  + N S+  +   GP
Sbjct: 298 KCMAVYETYTGIMNDKIRLSSSSSSSTSSSNNSTKSNPFDFKYIKSIKNLSKFSDL--GP 355

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            +V+A+   L+AG S  +  +WA + KNLV+ T     GT+A+ +  +P
Sbjct: 356 SVVVATPGMLQAGVSRQLLEKWAPEQKNLVILTGYSVEGTMAKDIMKEP 404


>gi|84995678|ref|XP_952561.1| hypothetical protein [Theileria annulata]
 gi|65302722|emb|CAI74829.1| hypothetical protein TA11620 [Theileria annulata]
          Length = 830

 Score =  115 bits (289), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 96/391 (24%), Positives = 180/391 (46%), Gaps = 21/391 (5%)

Query: 26  DGF-NFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQL-----G 79
           D F N L++CGW+  F    L    K A  +D +L++  D LH GAL +   +      G
Sbjct: 33  DNFLNVLLNCGWSLDFSEEKLNLYKKYAQNVDVILITDGDFLHSGALLWLTSRFLTELKG 92

Query: 80  LSAP-VFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLS 138
            S P +  TE  Y+    ++ D   +    ++F  +++DD++    +  +L YS+ Y   
Sbjct: 93  KSIPKILCTEGTYKFMRASLIDVLENVTFSTDFGYYSMDDLELLDSNCVKLRYSETYCHM 152

Query: 139 GKGEGIVVAPHVA----GHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
            K + + V         G+ +GG +WKI+     VI            LNG  +   + P
Sbjct: 153 KKLQNLDVKSSFCALNNGYSVGGAIWKISVGYNTVICGDKIRIYTGTLLNGANINDILNP 212

Query: 195 AVLI---TDAYNALHNQPPR-----QQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
            +L+    D     H   P+     +      D +  TL  GGN+L P+D    +L LL+
Sbjct: 213 DLLVLSHEDVETPKHVTDPKGVKVCEDLNSLTDKLFTTLTKGGNILFPMDVDYTLLNLLI 272

Query: 247 ILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL-LKHVT 304
            L   W+   L+ + I   + ++   + ++ + LE+M  SI  +F  +  N F+ L H+ 
Sbjct: 273 HLNMIWSTSQLSQFKIVLASPIADKLMLFIGTCLEYMKTSIFHNFIKTLWNPFMDLNHIE 332

Query: 305 LLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
           ++ +  +L      P + +++ ++++ GFS+ +F+  +S  KNLV+ T+  Q  T     
Sbjct: 333 IITSLGQLSRYRFRPTVFISTTSNMDFGFSNFLFLAISSYYKNLVVLTKPNQSVTKYVYN 392

Query: 365 QADPPPKAVKVTMSRRVPLVGEELIAYEEEQ 395
           + +   +A +   +R + ++ +E    E E+
Sbjct: 393 RNNSGVQAPQYKETRLINVLDDEPEEQENEK 423


>gi|349603401|gb|AEP99246.1| Cleavage and polyadenylation specificity factor subunit 2-like
           protein, partial [Equus caballus]
          Length = 327

 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 89/339 (26%), Positives = 146/339 (43%), Gaps = 117/339 (34%)

Query: 467 PMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMDQ-------- 499
           PMFP  E   +WD++GE+I P+D+++                    DE MDQ        
Sbjct: 7   PMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDVPTK 66

Query: 500 -----AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLK 547
                 ++ I         +G+ D  S   I++  KP +++      +VHG  EA++ L 
Sbjct: 67  CISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLA 120

Query: 548 QHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA 603
           + C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D 
Sbjct: 121 ECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDG 178

Query: 604 ----EVGKTENGML---------------------------------------------- 613
                V K + G++                                              
Sbjct: 179 VLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVLAQQKAMKSLFGDDEKDTGEE 238

Query: 614 -SLLPISTPAPPH-----KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKV 667
             ++P   P PPH     +SV + + +++D K  L  +GIQ EF GG L C   V +R+ 
Sbjct: 239 SEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR- 297

Query: 668 GPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
                    + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 298 ---------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 327


>gi|429966183|gb|ELA48180.1| hypothetical protein VCUG_00418 [Vavraia culicis 'floridensis']
          Length = 647

 Score =  115 bits (288), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 173/367 (47%), Gaps = 19/367 (5%)

Query: 17  NPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMK 76
           N  S L+ ID +  LI+ G +       L  L ++   ID +L+ H +  ++G LP    
Sbjct: 18  NVFSQLLEIDTYKILINIGSDPFLKVDYLAELERIIDDIDCILICHAELKYIGGLP---- 73

Query: 77  QLG--LSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQN 134
            LG      ++ + PV+ LG L M  +     +V     +  DDI+  F  ++ + YSQ 
Sbjct: 74  SLGERFKGKLYCSVPVHTLGRL-MVSEVNRNMEVFGAKRYEEDDIEEWFARISVVKYSQP 132

Query: 135 YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRP 194
             L      + +  H +GH LGG +W+I+KD E+V+ A D N RKE H++G  + +  + 
Sbjct: 133 IELGA----LRLTAHNSGHSLGGCLWQISKDNENVVVAFDINHRKENHVDGLEINNLRKN 188

Query: 195 AVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
            + + +       + P Q++    + +S   +  GN ++ + +  R LE+  IL+++   
Sbjct: 189 FIFLMNC--EFVGEVPVQRKSRDSEFMSFLAQNHGNKIVILCTFSRYLEICSILDEFLER 246

Query: 255 HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDN 314
              N    FL++ S++  +  K  LEW GD   K F  ++ N F  K++      SE+D 
Sbjct: 247 K--NKRCTFLSFNSNTLYESFKIMLEWAGDIALKKFTNTKVNPFAFKNIRFKDLYSEVDK 304

Query: 315 APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVK 374
             D   + +    +L + F++ I  +   +   LV+F +  +  T+ R+   D P   V+
Sbjct: 305 KTD---IFVILDENLCSPFTNRIVYDLNDERNVLVVFNDEHE-RTITRLDYMDVPEFKVE 360

Query: 375 VTMSRRV 381
               ++V
Sbjct: 361 KESDKQV 367


>gi|403222958|dbj|BAM41089.1| cleavage and polyadenylation specificty factor subunit [Theileria
           orientalis strain Shintoku]
          Length = 700

 Score =  115 bits (288), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 95/372 (25%), Positives = 170/372 (45%), Gaps = 40/372 (10%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTE 88
           + DCG +         P+ +    + ++  L++H    H GA+PY + +      +F T 
Sbjct: 39  MFDCGLHPALSGVGALPVFEAVDITKVEVCLVTHFHLDHCGAIPYLLSKTKFRGRIFMTS 98

Query: 89  PVYRLGLLTMYD-----QYLSRRQV---------------SEFD-------LFTLDDIDS 121
               +  L   D     Q  S +++               +E D       L+T DD++ 
Sbjct: 99  ATKAICHLLWTDYARMEQLHSVKKIFDQPDALNDEGQNEDTEMDELVCGSGLYTFDDVEF 158

Query: 122 AFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 181
           A   +  +    ++H       I V+ + AGH+LG  ++ +  DG  ++Y  DY+  K+K
Sbjct: 159 ALDKIETI----DFHEELTVNNIKVSCYRAGHVLGACMFLVEIDGVRILYTGDYSVEKDK 214

Query: 182 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGR 240
           HL    +   +   +LI+++   +     R QRE  F   +   +   G  LLPV + GR
Sbjct: 215 HLPSAEI-PLINVHLLISESTYGIRVHEERGQRESRFMHVVLDIIMREGKCLLPVFALGR 273

Query: 241 VLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
             E+LLIL++YWA +    N PI++++ ++S ++   ++F+   GD I +S      N F
Sbjct: 274 SQEILLILDEYWANNRQLQNVPIFYISPLASKSLKVYETFVGLCGDYIKESIYNGH-NPF 332

Query: 299 LLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQ 356
             K V    +  ++ N    +GP +++ S   L+ G S ++F   + D +N V+ T    
Sbjct: 333 NFKFVKYARSVRQIRNYLLREGPCIIMTSPGMLQGGPSLEVFELISPDNRNGVVLTGYTV 392

Query: 357 FGTLARMLQADP 368
            GTLA  L+ DP
Sbjct: 393 KGTLADELKKDP 404


>gi|67968624|dbj|BAE00671.1| unnamed protein product [Macaca fascicularis]
          Length = 341

 Score =  115 bits (288), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 90/348 (25%), Positives = 148/348 (42%), Gaps = 117/348 (33%)

Query: 458 FVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIK-------------------DEDMD 498
           F   +    PMFP  E   +WD++GE+I P+D+++                    DE MD
Sbjct: 12  FFKQAKKSYPMFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMD 71

Query: 499 Q-------------AAMHIGGD------DGKLDEGSASLILDA-KPSKVVSNELTVLVHG 538
           Q              ++ I         +G+ D  S   I++  KP +++      +VHG
Sbjct: 72  QDLSDVPTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLI------IVHG 125

Query: 539 SAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLG 594
             EA++ L + C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  
Sbjct: 126 PPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAK 183

Query: 595 DYEIAWVDA----EVGKTENGML------------------------------------- 613
           D E+AW+D      V K + G++                                     
Sbjct: 184 DAELAWIDGVLDMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFG 243

Query: 614 ----------SLLPISTPAPPH-----KSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC 658
                      ++P   P PPH     +SV + + +++D K  L  +GIQ EF GG L C
Sbjct: 244 DDEKETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC 303

Query: 659 GEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
              V +R+          + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 304 NNQVAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 341


>gi|156089433|ref|XP_001612123.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154799377|gb|EDO08555.1| hypothetical protein BBOV_III009990 [Babesia bovis]
          Length = 943

 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/343 (25%), Positives = 151/343 (44%), Gaps = 18/343 (5%)

Query: 28  FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQL-----GLSA 82
            N L++CGW+  F+P  +  L +  S +D ++L+  D  H+GALP     L     GL  
Sbjct: 95  INILVNCGWSLDFEPESIDLLKQCCSDVDVIILTDGDFGHVGALPVIYSWLHVVRDGLGL 154

Query: 83  P-VFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
           P +  TE  Y+     + D   +     +F+ +   D+D  +     L Y +++     G
Sbjct: 155 PSILCTEGCYKFARACLVDVLDNATLSYKFEGYNFSDLDLFYSGCVTLRYRESFPFVKSG 214

Query: 142 EG----IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL 197
           EG    I + P   G  +GG VW++      ++ A  Y       LNG   +      V+
Sbjct: 215 EGWRIHISLLPLNNGVSIGGAVWRLELGTRTIVCAPTYRVESVWFLNGCEFDGIRNADVV 274

Query: 198 ITDAYNALHNQPPR------QQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +T     L  +P                 I  TLR+ G+VL+P+D   ++++LL  L   
Sbjct: 275 VTYDQPRLPPEPVNPYVTECNSMSSILSVIGGTLRSHGSVLIPLDVGSQLIDLLFHLNAV 334

Query: 252 WAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF-LLKHVTLLINK 309
           W+   L  YPI  ++ ++   I    + LE+M  +I  +F  +  N    +K +  +   
Sbjct: 335 WSNSDLQQYPIVLVSPIAVKLILLFGTCLEYMRTTICHNFLRTLWNPISSMKFIHAVSRL 394

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            EL    + P + +++ +SL+ G S  +F   +   KN ++FT
Sbjct: 395 DELRRFANRPCVFISTCSSLDFGLSSYLFAALSCYKKNSIIFT 437


>gi|383859338|ref|XP_003705152.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Megachile
           rotundata]
          Length = 494

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 68/217 (31%), Positives = 113/217 (52%), Gaps = 10/217 (4%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  ++ I    + ++Y  DYN   ++HL    ++   RP +LI+++  A 
Sbjct: 49  IKAYYAGHVLGAAMFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLISESTYAT 107

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  ++ RE  F   + + +  GG VL+PV + GR  EL ++LE YW   +L  P+YF 
Sbjct: 108 TIRDSKRCRERDFLKKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFA 167

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
             ++    +Y K F+ W    I K+F   + N F  KH+    +K+ +DN   G  +V A
Sbjct: 168 LGLTEKANNYYKMFITWTNQKIKKTF--VQRNMFDFKHIKPF-DKAYIDNP--GAMVVFA 222

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVL---FTERGQFG 358
           +   L AG S  IF +WA +  N+V+   F  +G  G
Sbjct: 223 TPGMLHAGLSLQIFKKWAPNEANMVIMPGFCVQGTVG 259


>gi|323453344|gb|EGB09216.1| hypothetical protein AURANDRAFT_71470 [Aureococcus anophagefferens]
          Length = 1101

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/277 (28%), Positives = 140/277 (50%), Gaps = 12/277 (4%)

Query: 95  LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHL 154
           LL+ Y + L +    E  L+  +D+      V  +    ++H   + EGI    + AGH+
Sbjct: 2   LLSDYIRLLPQDDRGEGGLYDEEDLARCCDRVELV----DFHQVVEHEGIRFWSYNAGHV 57

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR 214
           LG  ++ I   G  ++Y  DY+  +++HL    + + + P VLI ++         R  R
Sbjct: 58  LGAAMFMIEIGGVRLLYTGDYSLEEDRHLVPAEVPT-LEPHVLIMESTYGTQKHESRDVR 116

Query: 215 E-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSST 271
           E +F   I + ++ GG  L+PV + GR  ELLLIL++YW E       P+++ + ++S  
Sbjct: 117 EALFTSTIERIVQRGGRCLIPVFALGRAQELLLILDEYWKEREDLQRVPVFYASKMASRA 176

Query: 272 IDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEA 331
           +   ++++  M   +    + S  N F   HV  L +  +LD++  GP +VLA+   L++
Sbjct: 177 LRVYQTYINMMNMHVRDQMDIS--NPFKFDHVQNLASIDDLDDS--GPVVVLAAPGMLQS 232

Query: 332 GFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           G S  +F  WAS  +N V+       GTLA+ + ++P
Sbjct: 233 GVSRQLFDRWASSERNGVVIAGYSVEGTLAKQILSEP 269


>gi|402696937|gb|AFQ90657.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Dibamus sp. JJF-2012]
          Length = 220

 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 118/213 (55%), Gaps = 8/213 (3%)

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
           GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P +LI ++ 
Sbjct: 6   GIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIEST 64

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NY 259
              H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  H    + 
Sbjct: 65  YGTHIHEKREEREARFCNXVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHXXLHDI 124

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
           PIY+ + ++   +   ++++  M D I K      +N F+JKH++ L +    D+   GP
Sbjct: 125 PIYYASSLAKKCMAVYQTYVNAMNDKIRKXXXI--NNPFVJKHISNLKSMDHFDDI--GP 180

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +V+AS   +++G S ++F  W +D +N V+  
Sbjct: 181 SVVMASPGMMQSGLSRELFESWCTDKRNGVIIA 213


>gi|294883712|ref|XP_002771037.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
 gi|239874243|gb|EER02853.1| cleavage and polyadenylation specificity factor, putative
           [Perkinsus marinus ATCC 50983]
          Length = 1050

 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 169/400 (42%), Gaps = 94/400 (23%)

Query: 47  PLSKVAS----TIDAVLLSHPDTLHLGALPYAMKQL------------------------ 78
           P+SK  S     ID  LLS  D  H GA PY    L                        
Sbjct: 19  PISKDTSQYQMAIDVCLLSFADLQHCGAWPYVYCHLRPKKLQYAVAPPPVGEADAAASSS 78

Query: 79  --------GLSAPVFSTEPVYRLGLLTM------YDQYLSRRQVSEFDLFTLDDIDSAFQ 124
                      A V +TEPV RLG LT+       D+       +   L T+DD   AF 
Sbjct: 79  SSKNSNQPSNGAMVLATEPVRRLGELTLTALHEDIDKMRDAVTTTNDWLLTIDDTIMAFN 138

Query: 125 -SVTRLTYSQNYHLS--------GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
            +VT L Y +    +         KG  +   P  AG +LGG  W+I    + ++YAVDY
Sbjct: 139 GAVTPLQYGEGVMFTMRGDAGANAKGPTVRFTPLPAGRMLGGAYWRIDVGSQSMVYAVDY 198

Query: 176 NRRKEKHLNGTVLE--SFVRPAVLITDA---------------------------YNA-- 204
               ++HLNG  L       P+VLIT+                            Y+A  
Sbjct: 199 QMAGDRHLNGMELPPPEQAPPSVLITNTMPPAVEGAVTCAGQGATSNVATESRRTYDAGI 258

Query: 205 ---LHNQPPRQQREMFQDAISKTLRAGGNVLLPVD--SAGRVLELLLILEDYWAEHS--L 257
                N+   Q  E     + ++LR  G VLLPVD  S GRVLELLL+LE  WA  +   
Sbjct: 259 TASRSNRRYAQAEEALLGMVLRSLRKDGTVLLPVDCCSTGRVLELLLLLEAAWAADAGLQ 318

Query: 258 NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD---NAFLLKHVTLLINKSEL-D 313
            YP+ +++ +    +D +K  +EWM   +   F+TS     + FL +HV L  +  +   
Sbjct: 319 VYPVVYVSPLGDVVLDQIKIRMEWMSRVVHNDFDTSMGFMYHPFLFQHVQLCSSFQDFAQ 378

Query: 314 NAP-DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
           N P   PK+VLAS ASLE G + +IF     D  + V+FT
Sbjct: 379 NYPARKPKVVLASSASLEIGDAREIFCRMCGDPNSTVIFT 418


>gi|167395302|ref|XP_001733549.1| Cleavage and polyadenylation specificity factor subunit [Entamoeba
           dispar SAW760]
 gi|165894214|gb|EDR22276.1| Cleavage and polyadenylation specificity factor subunit, putative
           [Entamoeba dispar SAW760]
          Length = 736

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 163/378 (43%), Gaps = 30/378 (7%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWND---HFDPSLLQPLSKVAS--TID 56
           G  +++ PL          +++   G N ++DCG +    H + +L  PL + A   +I+
Sbjct: 18  GNYLEIRPLGAGREVGRSCFILKYMGHNIMLDCGVHPAKPHGEAAL--PLFEHADIDSIE 75

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--GLLTMYDQYLSRRQ--VSEFD 112
            + ++H    H  +LPY + +      V  T P   +   L   + Q  S  Q   S   
Sbjct: 76  LLCVTHYHVDHCASLPYLILERQFKGKVLMTPPTKEIFGELFKEFHQMSSTIQPPKSVNP 135

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
              +D ID+             +H   +  G+ +    AGH+LG  ++ I  +G  ++Y 
Sbjct: 136 KEVMDRIDTI-----------KFHELQEYNGMKIWCFNAGHILGAAMFCIEINGVKILYT 184

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVL 232
            D++   ++HL    +  F    ++    Y  +  +    +   F   I + L+ GG  L
Sbjct: 185 GDFSGETDRHLQAAEVPPFQIDVMMCESTYGIIEQESRIDRENAFIRQIMEILKRGGKCL 244

Query: 233 LPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +PV S GR  E  LILE+YW  H    +  I+F + ++     Y + F  +M   + K  
Sbjct: 245 IPVFSLGRAQEFELILEEYWQNHKDLWSVSIFFFSSIAKKCTTYFEKFTSFMNQDLRKKT 304

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           + + D  F+ +  +     S  D A D  P +V+AS   L+ G S  IF  W +D KN V
Sbjct: 305 KQAFDFKFIREGSS-----SVDDGAIDYKPCVVMASPGMLQDGISRKIFERWCTDKKNGV 359

Query: 350 LFTERGQFGTLARMLQAD 367
           +       GTLA+ L  D
Sbjct: 360 IIPGYCVEGTLAKDLILD 377


>gi|396081352|gb|AFN82969.1| putative cleavage and polyadenylation [Encephalitozoon romaleae
           SJ-2008]
          Length = 639

 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 84/317 (26%), Positives = 156/317 (49%), Gaps = 28/317 (8%)

Query: 5   VQVTPLS----GVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           V +TPL     G++      +L+ ID    L++CG     D S+  P+     + DA+LL
Sbjct: 6   VSLTPLIRTEIGIY-----CHLLEIDNVKILVNCGAPYTMDMSIYTPILPQILSCDAILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +     + GALPY + Q      VFS+ P+  LG + + D++L +    E ++ T     
Sbjct: 61  TSFGVNYAGALPYIL-QNNYYNKVFSSVPIKTLGKICL-DEHL-KGMGKELEVDT----- 112

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
             F+ ++ + YSQ   ++     + V  + +G+ +GG ++KI+K  E ++   + N RKE
Sbjct: 113 GLFERISEIKYSQPTVINN----VEVCAYNSGNSIGGCLYKISKGAEKIVVGFNMNHRKE 168

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
            HL+G         ++ + +  + L       +R+ MF++ +   L +GG V+LPV  + 
Sbjct: 169 NHLDGIGFSGIGDCSLCVVNGNHVLAENVSIAKRDNMFREMVGNVLDSGGKVILPVKYS- 227

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R+LE+ LIL +  ++ S    +  L+Y     ++  +S +EW G+ ++  F   + N F 
Sbjct: 228 RLLEVALILNNMMSQRS--EKVVCLSYFGQRFVERARSMIEWAGEKVSSMFSEEKVNPFE 285

Query: 300 LKHVTLL---INKSELD 313
            + +  +    N SE D
Sbjct: 286 FEKIEFIEHYQNISEFD 302


>gi|269860949|ref|XP_002650191.1| cleavage and polyadenylation specificity factor subunit
           [Enterocytozoon bieneusi H348]
 gi|220066365|gb|EED43849.1| cleavage and polyadenylation specificity factor subunit
           [Enterocytozoon bieneusi H348]
          Length = 501

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 166/355 (46%), Gaps = 23/355 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST-------IDAVLLSHPDTLHLGALPYA 74
           +V+I     + DCG +  ++ S   P     +        +D +++SH    H G+LPY 
Sbjct: 18  VVTIKNKTIMFDCGIHLGYNDSRKLPNFDYFNENHHGRRPVDIIVISHFHIDHCGSLPYF 77

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
           ++    +  +F T P      + + D  +    +   E  L+T + I++    V  L   
Sbjct: 78  VETTQFNGLIFMTHPTKAALPIVLEDCKKIFENKNQMEKPLYTTEQINNCLSKVIALNME 137

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           + Y +    +  ++ P+ AGH++G  ++ +    E V+Y  D++   +++L    ++  +
Sbjct: 138 ETYEIE---QEFIIRPYYAGHVIGAAMFFVRYLDETVVYTGDFSTIPDRYLRAATIDC-L 193

Query: 193 RPAVLITDAY--NALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 250
            P +LIT++   N + +    ++REM   A+ KT+  GG VL+P+ + GR  E+ L+L++
Sbjct: 194 YPDLLITESTYGNIVRDLRKSKEREMIM-AVHKTIDIGGKVLIPIFALGRAQEICLLLKN 252

Query: 251 YWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET-SRDNAFLLKHVTLLINK 309
           Y     L+ PIYF T +     D    F  +  +S+ +  +  S  N+  +K       +
Sbjct: 253 YCERIQLSVPIYFTTGLIDKINDIYLKFASYTNESLEQPLKIRSILNSKFVKPF-----E 307

Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
            E  N+P GP ++ A+ A L  G S +IF     D KN ++       GT+   +
Sbjct: 308 KEYLNSP-GPMIIFATPAMLINGPSLNIFKSICHDSKNTIILPGYCSKGTIGEKI 361


>gi|71661559|ref|XP_817799.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
           strain CL Brener]
 gi|70883012|gb|EAN95948.1| cleavage and polyadenylation specificity factor, putative
           [Trypanosoma cruzi]
          Length = 625

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 73/249 (29%), Positives = 126/249 (50%), Gaps = 7/249 (2%)

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
            QS      +  YH      GI   P  AGH+LG  ++ +   G   +Y  D++R  ++H
Sbjct: 15  LQSTIEKIETVEYHEEVTVNGIRFQPFNAGHVLGAALFMVDIAGMKTLYTGDFSRVPDRH 74

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRV 241
           L G  + S+  P +LI ++ N +     R++RE +F   +   ++ GG  L+PV + GR 
Sbjct: 75  LLGAEVPSY-SPDILIAESTNGIRELESREERETLFTTWVHDVVKGGGRCLVPVFALGRA 133

Query: 242 LELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
            ELLLILE+YW  H    + PIY+ + ++   +   ++F+  M D + +     R N F+
Sbjct: 134 QELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQTFVSAMNDRVKQQHANHR-NPFV 192

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
            K++  L+     ++   GP +VLAS   L++G S ++F  W  D +N ++       GT
Sbjct: 193 FKYIHSLMETRSFEDT--GPCVVLASPGMLQSGISLELFERWCGDRRNGIIIAGYCVDGT 250

Query: 360 LARMLQADP 368
           +A+ +   P
Sbjct: 251 IAKDILTKP 259


>gi|340545979|gb|AEK51788.1| cleavage and polyadenylation specific factor 3 [Heteronotia binoei]
 gi|402696941|gb|AFQ90659.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Malaclemys terrapin]
 gi|402696943|gb|AFQ90660.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Testudo hermanni]
          Length = 220

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 117/213 (54%), Gaps = 8/213 (3%)

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
           GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P +LI ++ 
Sbjct: 6   GIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIEST 64

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNY 259
              H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  H    + 
Sbjct: 65  YGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDI 124

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
           PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    D+   GP
Sbjct: 125 PIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHFDDI--GP 180

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +V+AS   +++G S ++F  W +D +N V+  
Sbjct: 181 SVVMASPGMMQSGLSRELFESWCTDKRNGVIIA 213


>gi|303389227|ref|XP_003072846.1| putative cleavage and polyadenylation specificity factor
           [Encephalitozoon intestinalis ATCC 50506]
 gi|303301989|gb|ADM11486.1| putative cleavage and polyadenylation specificity factor
           [Encephalitozoon intestinalis ATCC 50506]
          Length = 639

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 80/307 (26%), Positives = 146/307 (47%), Gaps = 25/307 (8%)

Query: 5   VQVTPL----SGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           V +TPL    +G++      +L+ +D    LI+CG +   D S+  P+     + DA+LL
Sbjct: 6   VSLTPLIRTETGIY-----CHLLEVDNVKILINCGASYTMDMSIYAPILPQILSCDAILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +      +G LPY + Q      VFS+ PV  LG + + +        +E D+       
Sbjct: 61  TSFGINCIGGLPYIL-QNNYYNKVFSSVPVKVLGKICLDEHLRGMGLEAEVDI------- 112

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
             F+ ++ + YSQ   ++     + +  + +G+ +GG ++KI+K  E ++   + N RKE
Sbjct: 113 GCFERISEIKYSQPTMVND----VEICAYNSGNSIGGCLYKISKGAEKIVVGFNANHRKE 168

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
            HL+G         ++ + +  + L       +R+ MF++AI   L  G  V+LPV  + 
Sbjct: 169 NHLDGMGFAGVGDCSLCVFNGNHVLAENISIAKRDNMFREAIGSALDLGRKVILPVKYS- 227

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R LE+ LIL  +  + S    I  L+Y     ++  KS +EW G+ ++  F   + N F 
Sbjct: 228 RFLEVALILNSFMGQRS--EKIACLSYFGQRFVERAKSMIEWAGEKVSSMFSEEKINPFE 285

Query: 300 LKHVTLL 306
            + +  +
Sbjct: 286 FEKIEFI 292


>gi|401826283|ref|XP_003887235.1| beta-CASP domain-containing protein [Encephalitozoon hellem ATCC
           50504]
 gi|392998394|gb|AFM98254.1| beta-CASP domain-containing protein [Encephalitozoon hellem ATCC
           50504]
          Length = 639

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/307 (25%), Positives = 152/307 (49%), Gaps = 25/307 (8%)

Query: 5   VQVTPL----SGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           V +TPL    +G++      +L+ ID    L++CG     D S+   +     + DA+LL
Sbjct: 6   VSLTPLIRTDTGIY-----CHLLEIDNVRILVNCGAPYTMDMSIYTSVLPQILSCDAILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +     ++GALPY + Q      +FS+ P+  LG + + D++L    + E + +T     
Sbjct: 61  TSFGVNYVGALPYIL-QNNYYNKIFSSVPIKVLGKICL-DEHLKGMGM-EVEGYT----- 112

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           + F+ ++ + YSQ   +      + +  + +G+ +GG ++KI+K  E ++  ++ N RKE
Sbjct: 113 ACFERISEIKYSQPTVIGN----VEICTYNSGNSIGGCIYKISKGAERIVIGLNMNHRKE 168

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAG 239
            HL+G         ++ + +  + L       +R+ MF++ +   L +GG V+LPV  + 
Sbjct: 169 NHLDGIGFSGIGDCSLCVVNGNHVLAENISVAKRDNMFREIVGSVLSSGGKVILPVKYS- 227

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R LE+ LIL    A+   N  I  L+Y     ++  +S +EW G+ ++  F   + N F 
Sbjct: 228 RFLEIALILNSMMAQR--NERIVCLSYFGQRFVERARSMIEWAGEKVSSMFSEEKVNPFE 285

Query: 300 LKHVTLL 306
            + +  +
Sbjct: 286 FEKIEFV 292


>gi|302667649|ref|XP_003025406.1| hypothetical protein TRV_00467 [Trichophyton verrucosum HKI 0517]
 gi|291189514|gb|EFE44795.1| hypothetical protein TRV_00467 [Trichophyton verrucosum HKI 0517]
          Length = 865

 Score =  113 bits (283), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/321 (24%), Positives = 150/321 (46%), Gaps = 25/321 (7%)

Query: 67  HLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD----LFTLDDIDSA 122
           H G+LPY + +      VF T     +    + D        S  D    L+   D  S 
Sbjct: 86  HSGSLPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTSSSSDQRTSLYNEHDHLST 145

Query: 123 FQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH 182
              +  + ++  + ++     I + P  AGH+LG  ++ I+  G ++++  DY+R +++H
Sbjct: 146 LPIIETIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLNILFTGDYSREEDRH 201

Query: 183 LNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRV 241
           L    +   V+  V+IT++   + + PPR +RE     +++  +  GG VL+PV + GR 
Sbjct: 202 LISAEVPKGVKIDVMITESTFGISSNPPRLEREAALMKSVTSIINRGGRVLMPVFALGRA 261

Query: 242 LELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA-- 297
            ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++I + F      A  
Sbjct: 262 QELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVYQTYIGAMNENIKRLFRQRMAEAEA 321

Query: 298 ----------FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 347
                     +  + V  L N    ++   G  ++LAS   L+ G S ++   WA + +N
Sbjct: 322 RGDKSVTAGPWDFRFVRSLRNLDRFEDV--GGCVMLASPGMLQTGTSRELLERWAPNERN 379

Query: 348 LVLFTERGQFGTLARMLQADP 368
            V+ T     GT+ + +  +P
Sbjct: 380 GVIMTGYSVEGTMGKQIINEP 400


>gi|407041778|gb|EKE40943.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Entamoeba nuttalli P19]
          Length = 751

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 163/378 (43%), Gaps = 30/378 (7%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWND---HFDPSLLQPLSKVAS--TID 56
           G  +++ PL          +++   G N ++DCG +    H + +L  PL + A   +I+
Sbjct: 18  GNYLEIRPLGAGREVGRSCFILKYMGHNIMLDCGVHPAKPHGEAAL--PLFEHADIDSIE 75

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--GLLTMYDQYLSRRQ--VSEFD 112
            + ++H    H  +LPY + +      V  T P   +   L   + Q  S  Q   S   
Sbjct: 76  LLCVTHYHVDHCASLPYLILERQFKGKVLMTPPTKEIFGELFKEFHQMSSTIQPPKSVNP 135

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
              +D ID+             +H   +  G+ +    AGH+LG  ++ I  +G  ++Y 
Sbjct: 136 KEVMDRIDTI-----------KFHELQEYNGMKIWCFNAGHILGAAMFCIEINGVKILYT 184

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVL 232
            D++   ++HL    +  F    ++    Y  +  +    +   F   I + L+ GG  L
Sbjct: 185 GDFSGETDRHLQAAEVPPFQIDVMMCESTYGIIEQESRIDRENAFIRQIIEILKRGGKCL 244

Query: 233 LPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +PV S GR  E  LILE+YW  H    +  I+F + ++     Y + F  +M   + K  
Sbjct: 245 IPVFSLGRAQEFELILEEYWQNHKDLWSVSIFFFSSIAKKCTTYFEKFTSFMNQELRKKT 304

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           + + D  F+ +      + S  D A D  P +V+AS   L+ G S  IF  W +D KN V
Sbjct: 305 KQAFDFKFIREG-----SSSVDDGAIDYKPCVVMASPGMLQDGISRKIFERWCTDKKNGV 359

Query: 350 LFTERGQFGTLARMLQAD 367
           +       GTLA+ L  D
Sbjct: 360 IIPGYCVEGTLAKDLILD 377


>gi|67479721|ref|XP_655242.1| cleavage and polyadenylation specificity factor 73 kDa subunit
           [Entamoeba histolytica HM-1:IMSS]
 gi|56472366|gb|EAL49856.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Entamoeba histolytica HM-1:IMSS]
 gi|449703858|gb|EMD44220.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
           putative [Entamoeba histolytica KU27]
          Length = 755

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 163/378 (43%), Gaps = 30/378 (7%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWND---HFDPSLLQPLSKVAS--TID 56
           G  +++ PL          +++   G N ++DCG +    H + +L  PL + A   +I+
Sbjct: 18  GNYLEIRPLGAGREVGRSCFILKYMGHNIMLDCGVHPAKPHGEAAL--PLFEHADIDSIE 75

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--GLLTMYDQYLSRRQ--VSEFD 112
            + ++H    H  +LPY + +      V  T P   +   L   + Q  S  Q   S   
Sbjct: 76  LLCVTHYHVDHCASLPYLILERQFKGKVLMTPPTKEIFGELFKEFHQMSSTIQPPKSVNP 135

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
              +D ID+             +H   +  G+ +    AGH+LG  ++ I  +G  ++Y 
Sbjct: 136 KEVIDRIDTI-----------KFHELQEYNGMKIWCFNAGHILGAAMFCIEINGVKILYT 184

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVL 232
            D++   ++HL    +  F    ++    Y  +  +    +   F   I + L+ GG  L
Sbjct: 185 GDFSGETDRHLQAAEVPPFQIDVMMCESTYGIIEQESRIDRENAFIRQIIEILKRGGKCL 244

Query: 233 LPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +PV S GR  E  LILE+YW  H    +  I+F + ++     Y + F  +M   + K  
Sbjct: 245 IPVFSLGRAQEFELILEEYWQNHKDLWSVSIFFFSSIAKKCTTYFEKFTSFMNQELRKKT 304

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
           + + D  F+ +  +     S  D A D  P +V+AS   L+ G S  IF  W +D KN V
Sbjct: 305 KQAFDFKFIREGSS-----SVDDGAIDYKPCVVMASPGMLQDGISRKIFERWCTDKKNGV 359

Query: 350 LFTERGQFGTLARMLQAD 367
           +       GTLA+ L  D
Sbjct: 360 IIPGYCVEGTLAKDLILD 377


>gi|366999893|ref|XP_003684682.1| hypothetical protein TPHA_0C00920 [Tetrapisispora phaffii CBS 4417]
 gi|357522979|emb|CCE62248.1| hypothetical protein TPHA_0C00920 [Tetrapisispora phaffii CBS 4417]
          Length = 822

 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 119/472 (25%), Positives = 214/472 (45%), Gaps = 71/472 (15%)

Query: 22  LVSIDGFNFLIDCGW--NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALP---YAMK 76
           L+  D    LID  W  N       ++  S +   +D +LLS P    LGA     Y   
Sbjct: 19  LLKFDNVTILIDPAWYSNSVSYSDSVKYWSTIIPEVDLILLSQPTVRSLGAFALIYYNFY 78

Query: 77  QLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL--FTLDDIDSAFQSVTRLTYSQ 133
              +S   V+ST PV  LG  +  + Y++R     +D     L+DI+ AF  +  + YSQ
Sbjct: 79  SHFISQIEVYSTLPVSNLGRTSTIELYVARGITGPYDSNEIDLEDIEKAFDMIQTIKYSQ 138

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNG-TVLESFV 192
              L  K +G+    H +G  +GG+++ +    E +IYA  +N  ++  L+G ++L+S  
Sbjct: 139 LVDLKSKFDGLTFVAHNSGVNVGGSIFCLMTYTEKLIYAPKWNHTRDMILSGASLLDSAG 198

Query: 193 RP-------AVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
           +P         LITD  N    +  +++ + F+D + + L   G++++PV+ + + ++LL
Sbjct: 199 KPISALLGATALITDFSNFASTKSFKRKSKAFKDMLREGLYLNGSIVIPVEISSKFIDLL 258

Query: 246 LILEDYW----AEHSLNYP-IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--F 298
           + +++Y     ++     P I  ++Y     + Y KS LEW   ++TKS+E S+D A  F
Sbjct: 259 VQVQNYILDAKSQGQKTEPHILLVSYSRGRILTYAKSMLEWFSSTLTKSWE-SKDTASPF 317

Query: 299 LLKHVTLLINKSELDNAPDGPKL------------VLASMASLE-------------AGF 333
            L ++  ++   EL N P G K+            V+  ++ LE             +  
Sbjct: 318 DLGNLLHVVTPKELKNYP-GAKICFVSEVDLLINDVICRLSKLERTSVFLTSTNFEDSSV 376

Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEE 393
             D++ +W  + +N  +  E GQ    +  +      + V         L  ++L A+ +
Sbjct: 377 VSDMYSKWKLEKQNKKV--EEGQSIIYSESISIRTSEEKV---------LKKKDLEAFTK 425

Query: 394 E-QTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVE 444
           E +TR +K + L  +LV E +    L           +   N+A A+ D+VE
Sbjct: 426 EIETRREKRKDLIVALVNESKKNKGLTD---------MFRKNSALANTDIVE 468


>gi|85001073|ref|XP_955255.1| cleavage and polyadenylation specificty factor, subunit [Theileria
           annulata strain Ankara]
 gi|65303401|emb|CAI75779.1| cleavage and polyadenylation specificty factor, subunit, putative
           [Theileria annulata]
          Length = 1282

 Score =  112 bits (281), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 99/391 (25%), Positives = 177/391 (45%), Gaps = 29/391 (7%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAV 58
           M   V++T L            V  D    + DCG +         P+ +    S ++  
Sbjct: 1   MDDRVRITVLGAGCEVGRSCVYVERDNSCLMFDCGLHPALSGVGALPVFEAVDISKVEVC 60

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-----QYLSRRQVSEFD- 112
           L++H    H GA+PY + +   +  +  T     +  L   D     Q L+ + + + D 
Sbjct: 61  LVTHFHLDHCGAVPYLLSKTKFNGRILMTPATKSICHLLWTDYARMEQLLTVKTIFDDDD 120

Query: 113 ----------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI 162
                     L++ +D++ A   +  + + Q   ++     I ++ + AGH+LG  ++ +
Sbjct: 121 GMDELVCGSGLYSFEDVEYALDRIETIDFHQEITVND----IKISCYRAGHVLGACMFLV 176

Query: 163 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 221
             DG  ++Y  DY+  K+KHL    + S     +LI+++   +     R QREM F   +
Sbjct: 177 EIDGVRILYTGDYSVEKDKHLPSAEIPS-TNVHLLISESTYGIRVHEERSQREMRFLHVV 235

Query: 222 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFL 279
              +   G  LLPV + GR  E+LLIL++YW  +    N PI++++ ++S ++   ++F+
Sbjct: 236 MDIIMREGKCLLPVFALGRSQEILLILDNYWENNRQLHNVPIFYISPLASKSLRVYETFV 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDI 337
              GD I +S      N F  K V    +  ++ N    DGP +++ S   L+ G S ++
Sbjct: 296 GQCGDYIKQSVYNGF-NPFDFKFVKYARSIKQIRNYLLRDGPCIIMTSPGMLQGGPSLEV 354

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           F     D +N V+ T     GTLA  L+ DP
Sbjct: 355 FELICPDNRNGVVLTGYTVKGTLADELKKDP 385


>gi|342185150|emb|CCC94633.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 308

 Score =  112 bits (280), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 90/301 (29%), Positives = 136/301 (45%), Gaps = 23/301 (7%)

Query: 3   TSVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLS 61
           T++   P S  F+ N P+SYL+ IDG   L+DCGW+D F  S L  LS     + AVL S
Sbjct: 12  TNIYGAPSSDAFHPNTPMSYLLEIDGVRILMDCGWDDKFSVSYLDALSPYLGNLHAVLFS 71

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLS--------RRQVSEFDL 113
            P+    GALP+ M+++     V +     ++GL  +   +L         R +  E   
Sbjct: 72  SPELRSCGALPFVMERIPPGTYVSAAGATSKMGLHGVLHPFLYLYPNANVWRLETGEEFE 131

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
            T+D + SAF+SV R  Y     ++ +G  +       G +LGG  W I    +++ Y  
Sbjct: 132 MTVDKVYSAFRSV-RQPYGSKVTVAHRGVEVECFSVFCGRMLGGCGWLIKYQIDELFYCP 190

Query: 174 DYNRRKEKHLNGTVLESFVRPA----VLITDAYNALHNQPPRQQREMFQDAISK---TLR 226
           D++ +    LN      FV P     + I      L     R+  E     I +   TLR
Sbjct: 191 DFSLKPSYALN-----RFVPPTTATLLFIDGTPFHLSGNAGRKYEEQLNVPIREVLNTLR 245

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
            G +VL+PV  AGR LE+L I+    AE    NY +   +  +S  I    +  E + D 
Sbjct: 246 YGKDVLIPVSVAGRGLEVLTIISHLLAEKGGDNYSVVLASLQASEIIAKASTMTESLKDE 305

Query: 286 I 286
           +
Sbjct: 306 V 306


>gi|156083689|ref|XP_001609328.1| cleavage and polyadenylation specifity factor [Babesia bovis T2Bo]
 gi|154796579|gb|EDO05760.1| cleavage and polyadenylation specifity factor [Babesia bovis]
          Length = 709

 Score =  112 bits (280), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 96/377 (25%), Positives = 171/377 (45%), Gaps = 44/377 (11%)

Query: 29  NFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
           N + DCG +         P+ +    S +D  L++H    H GA+PY + +      +F 
Sbjct: 42  NVMFDCGLHPALSGVGALPVFEAIDLSKVDLCLITHFHLDHCGAVPYLLSKTSFKGRIFM 101

Query: 87  TEPVYRLGLLTMYDQYLSRRQV----SEFD--------------------------LFTL 116
           T     +  L ++  Y    Q+    S FD                          L++ 
Sbjct: 102 TYATKAICHL-LWTDYARMEQLQTVKSIFDRTAPRDLQDGSDSKEGLMDELICGSGLYSF 160

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
           DD++ A   +  +    ++H      GI  + + AGH+LG +++ +  DG  ++Y  DY+
Sbjct: 161 DDVEYALSKIETI----DFHEEKDVGGIKFSCYRAGHVLGASMFLVEMDGVRILYTGDYS 216

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++H+    +   +   +LI ++   +     R QRE  F  ++ + +  GG  LLPV
Sbjct: 217 TEVDRHVPCAEIPP-INAHLLICESTYGIRIHEERVQRERRFLRSVIEIVTRGGKCLLPV 275

Query: 236 DSAGRVLELLLILEDYW-AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
            + GR  E+LLIL++YW A  +L   PI++++ ++  ++   ++F+   GD I +     
Sbjct: 276 FALGRAQEILLILDEYWQANRNLQPIPIFYISPLAQKSLRVYETFVGLCGDYIKECVYNG 335

Query: 294 RD--NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
            +  N   +K+   +   S+   A DGP +V+ S   L+ G S  IF + A D +N V+ 
Sbjct: 336 FNPFNFTFVKYARSVAEISQYLQA-DGPCIVMTSPGMLQGGPSLQIFEKIAPDSRNGVVL 394

Query: 352 TERGQFGTLARMLQADP 368
           T     GTLA  L+ DP
Sbjct: 395 TGYTVKGTLADELRRDP 411


>gi|378756419|gb|EHY66443.1| hypothetical protein NERG_00083 [Nematocida sp. 1 ERTm2]
          Length = 730

 Score =  112 bits (279), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 84/300 (28%), Positives = 144/300 (48%), Gaps = 18/300 (6%)

Query: 55  IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSE-FDL 113
           I  ++L   D   LG L + ++ LG++AP++ T P+  LG +    + L R +V E F  
Sbjct: 54  ITHIILCSSDISSLGGLIH-LESLGINAPIYGTVPIKILGRI----EILERLKVLEKFHG 108

Query: 114 FTLDDI--DSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            +  D+  D  F  +  L Y+Q   L    +GIVV P  +G  +GG +WKI K+ ++ I 
Sbjct: 109 NSSLDMKQDKIFDRIIPLKYTQTVELE---DGIVVGPLNSGSSVGGAIWKIRKNEQEWII 165

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 230
               N RKE HL+G  + +  +P  +I ++   +  Q  R+ R+    D++ K +   G 
Sbjct: 166 CDKINHRKEAHLDGLDISNISKPLGVIVNSTQVVKEQSTRRMRDKELVDSVVKCINGNGK 225

Query: 231 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           V +P     ++LE+ + L  Y  + +   P+   ++  +   D VK+ LEW G SI   F
Sbjct: 226 VFIPT-GYSQLLEIAMTL--YNHKETQEMPMALYSFYGNKYFDMVKTILEWTGSSILHKF 282

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
              ++N F L ++      +E  ++     ++        +GFS  I    A   KNL+L
Sbjct: 283 NQEKENPFNLLNLKFY---NECPDSEISENIIFVIDKHGNSGFSPVILPHIAKSSKNLIL 339


>gi|449670960|ref|XP_004207395.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           2-like [Hydra magnipapillata]
          Length = 105

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 73/102 (71%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++ TPLSG  +E PL YL+ +D F FL+DCGW+++    +++ + + A +IDAVLL
Sbjct: 1   MTSIIRFTPLSGAQDEGPLCYLLQVDEFKFLLDCGWDENLSQDVIENIKRHAHSIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
           SHPD  HLGALPY + +  L+ PV++T PVY++G + +YD Y
Sbjct: 61  SHPDIYHLGALPYLIGKCNLNCPVYATIPVYKMGQMFLYDFY 102


>gi|209420822|gb|ACI46951.1| cyclin B [Fenneropenaeus penicillatus]
          Length = 475

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 85/268 (31%), Positives = 131/268 (48%), Gaps = 39/268 (14%)

Query: 278 FLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
            +EWM + +TK+F++ R N F  KH+    N ++L   P  PK+VLAS   L  G++ ++
Sbjct: 1   MIEWMSEKLTKAFDSLRTNPFSFKHLKFCHNLTDLSRLP-SPKVVLASFPDLGCGYAREL 59

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
           FV+WA++ KN ++ T R    TLAR L  +P  +  K+   RR+ L G EL    +E  R
Sbjct: 60  FVQWATNPKNTIILTSRTGPDTLARRLIDNPQIRTFKLLEKRRMKLEGSEL----DEHYR 115

Query: 398 LKKEEALKASLVKEEESKASLGPDNNL-----SGDPMVIDANNANASADVVEPHGGRYRD 452
           +K+EE  +   +K EE ++S   +N         D +V+     N S      H      
Sbjct: 116 MKREEEQQQQRIKMEEVESSSDSENEDGLEAGKHDIIVLHEKAGNQSMFRSRKHH----- 170

Query: 453 ILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII---KDEDMDQ-AAMHIGGDD 508
                         PMFPF+E     DD+GE IN +D+ I   KD++ +    + I  +D
Sbjct: 171 --------------PMFPFHEEKIRGDDYGEYINLEDFDISSMKDDNKENLENLQIPYED 216

Query: 509 GKLDEGSASLILDAKPSKVVSNELTVLV 536
             L      + ++  PSK VS  +TV V
Sbjct: 217 DDL------MDIEEPPSKCVSQTVTVRV 238


>gi|387594701|gb|EIJ89725.1| hypothetical protein NEQG_00495 [Nematocida parisii ERTm3]
 gi|387596451|gb|EIJ94072.1| hypothetical protein NEPG_00738 [Nematocida parisii ERTm1]
          Length = 744

 Score =  111 bits (278), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/333 (27%), Positives = 156/333 (46%), Gaps = 19/333 (5%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLS 81
           +V ID    L++ G        +   L  + S I  ++L   D   LG L + ++ LG+ 
Sbjct: 22  IVEIDNLRILVNFGTEYDLSLDIYSDLEYLKS-ITHIILCSSDISSLGGLIH-LESLGID 79

Query: 82  APVFSTEPVYRLGLLTMYDQYLSRRQVSE-FDLFTLDDI--DSAFQSVTRLTYSQNYHLS 138
            P++ T P+  LG +    + L R +V E F      +   D  F  +  L Y+Q   LS
Sbjct: 80  VPIYGTVPIKILGRI----EILERIKVLEKFHSIGSSEAKQDKVFDKIIPLKYTQTVELS 135

Query: 139 GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI 198
              +GI V P  +G  +GG+VWKI K+ ++ +     N RKE HL+G    +  +P  ++
Sbjct: 136 ---DGIFVGPLNSGSSVGGSVWKIRKNEQEWLICDKVNHRKEAHLDGLDTSNISKPLGIV 192

Query: 199 TDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL 257
            ++ + +  Q  R+ R+    D I K +   G V +P     ++LE+++ L ++     L
Sbjct: 193 VNSTHVIKEQNTRRMRDKELVDCIVKCINNKGKVFIPT-GYSQLLEIVMTLYNHKDTQEL 251

Query: 258 NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD 317
              +Y  ++  S   D VK+ LEW G SI + F   ++N F L ++    N+      P+
Sbjct: 252 TMALY--SFYGSKYFDMVKTILEWTGSSILQKFNQEKENPFNLLNLKFY-NECADCEIPE 308

Query: 318 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
               V+    +  +GFS  I    A + +NL+L
Sbjct: 309 DIIFVIDRHGN--SGFSPVILPGIAKNPQNLIL 339


>gi|71027889|ref|XP_763588.1| cleavage and polyadenylation specificity factor protein [Theileria
           parva strain Muguga]
 gi|68350541|gb|EAN31305.1| cleavage and polyadenylation specificity factor protein, putative
           [Theileria parva]
          Length = 708

 Score =  111 bits (277), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 165/363 (45%), Gaps = 30/363 (8%)

Query: 30  FLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFST 87
            + DCG +         P+ +    S +   L++H    H GA+PY + +   +  +  T
Sbjct: 30  LMFDCGLHPALSGVGALPVFEAVDISKVQVCLVTHFHLDHCGAVPYLLSKTKFNGRILMT 89

Query: 88  EPVYRLGLLTMYD-----QYLSRRQVSEFD------------LFTLDDIDSAFQSVTRLT 130
                +  L   D     Q L+ + +   D            L++ +D++ A   +  + 
Sbjct: 90  PATKSICHLLWTDYARMEQLLTVKTIFNDDDESMDELVCGSGLYSFEDVEHALDRIETID 149

Query: 131 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 190
           + Q   ++     + ++ + AGH+LG  ++ I   G  ++Y  DY+  K++HL    +  
Sbjct: 150 FHQEITVND----MKISCYRAGHVLGACMFLIEIGGVRILYTGDYSMEKDRHLPSAEI-P 204

Query: 191 FVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
                +LI+++   +     R QREM F   +   +   G  LLPV + GR  E+LLIL+
Sbjct: 205 LTNVHLLISESTYGIRVHEERSQREMRFLHVVMDIIMRNGKCLLPVFALGRSQEILLILD 264

Query: 250 DYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLI 307
           DYW  +    N PI++++ ++S ++   ++F+   G+ I +S      N F  K V    
Sbjct: 265 DYWENNKQLHNVPIFYISPLASKSLKVYETFVGQCGEYIKQSVYNGF-NPFNFKFVRYAR 323

Query: 308 NKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQ 365
           +  ++ N    DGP +++ S   L+ G S ++F     D +N V+ T     GTLA  L+
Sbjct: 324 SIKQIRNYLLRDGPCIIMTSPGMLQGGPSLEVFELLCPDNRNGVVLTGYAVKGTLADELK 383

Query: 366 ADP 368
            DP
Sbjct: 384 KDP 386


>gi|157870438|ref|XP_001683769.1| putative cleavage and polyadenylation specificity factor
           [Leishmania major strain Friedlin]
 gi|68126836|emb|CAJ04467.1| putative cleavage and polyadenylation specificity factor
           [Leishmania major strain Friedlin]
          Length = 828

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 122/449 (27%), Positives = 198/449 (44%), Gaps = 55/449 (12%)

Query: 4   SVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH 62
           S+Q TP+      N P +YLV IDG   L DCGWND FD S L  L     T+ AV+LS 
Sbjct: 8   SIQFTPVYECTTPNAPYAYLVDIDGVRILFDCGWNDEFDTSFLNKLKPHLPTVHAVVLSS 67

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD---- 118
           P     GALP+ +  +     V +     ++G+ ++   +L   Q      FTL D    
Sbjct: 68  PHITACGALPFVLSHISPGTFVAAAGGTSKIGVHSVLHSFLY--QYPNSHTFTLADGEAF 125

Query: 119 ---IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV--AGHLLGGTVWKITKDGEDVIYAV 173
              +DS + S   L       ++ K + + V      AG +LGG  W I    +++ Y  
Sbjct: 126 TMTVDSIYHSFRSLREPYGGKVTVKNDDVEVNCFAVFAGRMLGGYSWTIKYQIDELFYCP 185

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPP-------------RQQREMFQDA 220
           D++ +    L         +P  + T A   L +  P              Q + +F++ 
Sbjct: 186 DFSVKPSYAL---------KPFDVPTTANIVLASSFPFHMTGANRTTKYEEQLKSLFKE- 235

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFL 279
              TLR G +VL+PV+ AGR LE+L I+    AE   + Y +  +   +   +D   +  
Sbjct: 236 FQHTLRGGSDVLVPVNVAGRGLEVLNIIVHLLAEQGGDKYKVVLVAAQAQELLDKAGTMT 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP-DGPKLVLASMASLEAGFSHDI- 337
           E + D +        D+  L   V  L  +S  +  P  GPK+ +A  ASL+ G S ++ 
Sbjct: 296 EALQDYLIL------DDKRLFASV--LTCRSAEEVLPIQGPKICVADGASLDFGPSAELL 347

Query: 338 --FVEWASD-VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG------EEL 388
             FV+   D   +L++ TE    GT A ++ A    + + + ++RR  L G         
Sbjct: 348 EYFVKGNRDGADHLIVLTEPPLPGTNAAVVTAAADGERLHMQITRRSRLSGEELEEYYIE 407

Query: 389 IAYEEEQTRLKKEEALKASLVKEEESKAS 417
           + +E EQ R + E      +V++++  A+
Sbjct: 408 LEHEMEQRRRELEAQSAFQIVQDDDEAAT 436


>gi|328854195|gb|EGG03329.1| hypothetical protein MELLADRAFT_90299 [Melampsora larici-populina
           98AG31]
          Length = 695

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 131/607 (21%), Positives = 241/607 (39%), Gaps = 147/607 (24%)

Query: 193 RPAVLITDAYNALHNQPPRQQREM-----------FQDAISKTLRAGGNVLLPVDSAGRV 241
           RP V++     +L     ++ R+              D I+ TLR+  +V +P D++ R+
Sbjct: 9   RPLVMMIGTERSLTKSIRKKDRDQVLFMTYITSFDLTDTIASTLRSSHSVFIPTDASARL 68

Query: 242 LELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMG-----DSITKSFETSRD 295
           +EL+++L+  W    L  +P+  ++      I +++S  EWM      +S  KS   +RD
Sbjct: 69  IELIIMLDTLWTTSRLEPFPLCLVSQTGKDMITFLRSLTEWMSPLTPTESQLKS--RARD 126

Query: 296 N-----AFLLKHVTLL--INKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
                 A  L+++     I   E   A   PK +LA   ++  GFS  +F        NL
Sbjct: 127 EGPGGIALRLRNLKFFNSIEALESQTAAIQPKCILAVPLTMAYGFSRRMFTRHVGKPGNL 186

Query: 349 VLFTERGQFGTLARMLQAD---------------PPP----KAVKVTMSRRVPLVGEELI 389
           V+ T  G+  +L R L AD               P P     +V V + R+V L GEEL 
Sbjct: 187 VVLTSMGEKESLTRWL-ADQVNEKSEAKYGSGTIPEPIDLNTSVSVELKRKVVLEGEELE 245

Query: 390 AYEEEQTRLKKEEAL-KASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGG 448
            Y E++ R K+     +A LV+       +  + + S      D   +N+  +  E    
Sbjct: 246 QYLEDKQRAKERRTKHEAMLVRSRR----MIDEEDDSDRMSSSDDQESNSETETQEKPAS 301

Query: 449 RYRDILI---------DGFVPPSTSVA-----------PMFPFYENNSEWDDFGEVINPD 488
           R +             D FV  + ++A            MFPF +   + D +GE++N D
Sbjct: 302 RKKPFTKLTQAKVATWDEFVDETETIAFDIYVKGSHRIKMFPFVDRRRKVDAYGEMLNVD 361

Query: 489 DYIIKDEDMDQAAM---HIGGDDGKLDEGSASLILDAKPSKVVSN--------------- 530
           +++ + + + ++ +   ++G      +       ++  P K VS                
Sbjct: 362 EWLRRGDSVQESTIKNENVGKKRKWEEGEEGEDGVEEPPHKFVSETEEVKVVCKVLLIDL 421

Query: 531 ------------------ELTVLVHGSAEATEHLKQH--CLKHVCPHVYTPQIEETIDVT 570
                             +  VL++G++E  +    +   +      +++P+I E   + 
Sbjct: 422 EGKADGRALQTIIPHINPKTVVLINGTSETHQEFISNVSAIPSFTTQIFSPKIGECSVIG 481

Query: 571 SDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----------------EVGKTENGMLS 614
            D  ++ V+LS+ LMS++   K+  +E+ ++                   +G   +  L+
Sbjct: 482 HDTKSFSVRLSDDLMSSIKLSKVEGFEVGYLTGILQVLDESSIPTLERLPIGLNNSTQLT 541

Query: 615 LLPISTPAPP------------HK---------SVLVGDLKMADLKPFLSSKGIQVEFAG 653
                T  P             H+         ++ +G++K+  LK +L+S GIQ EF G
Sbjct: 542 RYNQRTSKPKDTENEESKLDISHRLDALPITSSTIFIGEIKLIGLKSYLNSIGIQAEFTG 601

Query: 654 -GALRCG 659
            G L CG
Sbjct: 602 EGVLICG 608


>gi|428671767|gb|EKX72682.1| cleavage and polyadenylation specificity factor protein, putative
           [Babesia equi]
          Length = 732

 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 168/376 (44%), Gaps = 45/376 (11%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTE 88
           + DCG +         P+ +    + +   L++H    H GA+PY + + G    +  T 
Sbjct: 40  MFDCGLHPALSGVGALPVFEAVDITKVKVCLVTHFHLDHCGAIPYLLSKTGFKGKILMTC 99

Query: 89  PVYRLGLLTMYDQYLSRRQVSE----FD---------------------------LFTLD 117
               +  L ++  Y    Q+      FD                           L++ +
Sbjct: 100 ATKAICHL-LWTDYARMEQLCSVKKIFDHTDKLNPDGTSNEEDEDVVDELVCGSGLYSFE 158

Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
           D++ A   +  +    ++H     +GI ++ + AGH+LG  ++ +  DG  ++Y  DY+ 
Sbjct: 159 DVEYALNHIETI----DFHEERSFDGIKISCYRAGHVLGACMFLVEMDGVRILYTGDYST 214

Query: 178 RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
             ++HL    + + +   +LI+++   +     R QRE  F   +   L   G  LLPV 
Sbjct: 215 EYDRHLPSAEIPN-INVHLLISESTYGIRIHEERTQREARFLHVVLDILMRDGKCLLPVF 273

Query: 237 SAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR 294
           + GR  E+LLILE+YWA +    + PI++++ ++S ++   ++F+   G+ + +S     
Sbjct: 274 ALGRAQEILLILEEYWAANKQLQSIPIFYISPLASKSLRVYETFIGLCGEYVKESVYNGH 333

Query: 295 DNAFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            N F  K V    +   +      DGP +V+ S   L+ G S ++F  +A D +N V+ T
Sbjct: 334 -NPFNFKFVKYAKSVESIRTYLLRDGPCVVMTSPGMLQGGPSLEVFEIFAPDNRNGVILT 392

Query: 353 ERGQFGTLARMLQADP 368
                GTLA  L+ DP
Sbjct: 393 GYTVKGTLADALKKDP 408


>gi|402696939|gb|AFQ90658.1| 73kDa cleavage and polyadenylation specific factor 3, partial
           [Draco beccarii]
          Length = 220

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 115/213 (53%), Gaps = 8/213 (3%)

Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY 202
           GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P +LI ++ 
Sbjct: 6   GIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIEST 64

Query: 203 NALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW--AEHSLNY 259
              H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW         
Sbjct: 65  YGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQXXXXXXEI 124

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
           PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    D+   GP
Sbjct: 125 PIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHFDDI--GP 180

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
            +V+AS   +++G S ++F  W +D +N V+  
Sbjct: 181 SVVMASPGMMQSGLSRELFESWCTDKRNGVIIA 213


>gi|399216074|emb|CCF72762.1| unnamed protein product [Babesia microti strain RI]
          Length = 725

 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 167/369 (45%), Gaps = 34/369 (9%)

Query: 26  DGFNFLIDCGWNDHFDPSLLQPLSKVAST--IDAVLLSHPDTLHLGALPYAMKQLGLSAP 83
           +G   + DCG +         P+ +  S   ++  L++H    H GA+PY + +      
Sbjct: 22  EGKQVMFDCGLHPALSGVGALPVFEAISIEKVNLCLVTHFHLDHCGAVPYLVGKTSFKGT 81

Query: 84  VFSTEPVYRLGLLTMYDQ-----------------YLSRRQVSEFDLFTLDDIDSAFQSV 126
           +  TEP   +  L   D                  Y     ++   LF  +D+  AF+ +
Sbjct: 82  IVMTEPTRVICRLMWADYEKMGKTLQGQTKIGEEGYAMDELITGSGLFNSEDVKKAFEMI 141

Query: 127 TRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT 186
             + + +   +    +GI +  + AGH+LG  ++ +   G  V+Y  DY+  +++H+   
Sbjct: 142 RTIDFHEEIEI----DGIKLTCYGAGHVLGACMFMVEIGGIRVLYTGDYSSEQDRHVPKA 197

Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELL 245
            +   +   +LI ++         R QRE     +I   +  GG  LLPV + GR  E+L
Sbjct: 198 EIPP-IDVHLLICESTYGTRIHDERTQRETRLIRSILNAVDNGGKCLLPVFALGRAQEIL 256

Query: 246 LILEDYW-AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHV 303
           LILE+YW A   L+  PI++++ +SS  +   ++F+   G+ I +  +   +N +   H+
Sbjct: 257 LILEEYWKANRRLHRVPIFYISPLSSKALKVYETFIGVCGEHIKRRVQQG-ENPYHFTHI 315

Query: 304 ----TLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
               T+   +S L    D P +++ S   L+ G S D+F   A D +N V+ T     GT
Sbjct: 316 KYAPTVDSVRSHL--LRDAPCVIMTSPGMLQGGPSRDVFEIIAPDNRNGVILTGYTVKGT 373

Query: 360 LARMLQADP 368
           LA  L+ +P
Sbjct: 374 LADELKKEP 382


>gi|327408312|emb|CCA30123.1| unnamed protein product [Neospora caninum Liverpool]
          Length = 1183

 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 82/288 (28%), Positives = 131/288 (45%), Gaps = 33/288 (11%)

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGE-GIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           F   D+ ++ +  T L   + +   G  E  + + P  AGH+LG  ++++      V+Y 
Sbjct: 391 FEQSDVAASAERATALRLREAWREGGASEDALQLTPFYAGHVLGAAMFELKIGNTSVVYT 450

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
            D+N   ++HL    L   +RP VLI++   A   +P ++  E  F   +  TL  GG V
Sbjct: 451 GDFNTIPDRHLGSASLPC-LRPDVLISECTYASFVRPSKRTVERDFCAVVHDTLTKGGKV 509

Query: 232 LLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW---------- 281
           L+PV + GR  EL ++LE+YW    L++PIYF   ++     Y + ++ W          
Sbjct: 510 LIPVFAVGRAQELCMLLENYWERMHLHFPIYFAGGMTERANVYYRLYVHWSKANGSVDAG 569

Query: 282 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 341
            GD +  S       AF   H+  L  +S L +AP  P ++LA+   L  G +      W
Sbjct: 570 AGDELPTS-------AFSFPHI--LPFQSSLLSAPT-PLVLLATPGMLHGGLALKALKAW 619

Query: 342 ASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
           A D  NLVL       GT+  ML          +   R++PL G   +
Sbjct: 620 AGDQANLVLLPGYCVRGTVGAML----------IAGQRQIPLDGHATL 657


>gi|401423165|ref|XP_003876069.1| cleavage and polyadenylation specificity factor,putative
           [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322492310|emb|CBZ27584.1| cleavage and polyadenylation specificity factor,putative
           [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 822

 Score =  109 bits (272), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 119/450 (26%), Positives = 198/450 (44%), Gaps = 45/450 (10%)

Query: 4   SVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH 62
           S+Q T +      N P +YLV IDG   L DCGWND FD S L  L     T+ AV+LS 
Sbjct: 8   SIQFTSVYECTTPNAPYAYLVEIDGVRILFDCGWNDEFDTSFLDKLKPYLPTVHAVILSS 67

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD---- 118
           P     GALP+ +  +     V +     ++G+ ++   +L   Q      FTL D    
Sbjct: 68  PHITACGALPFVLSHISPGTFVAAAGGTSKIGVHSVLHSFLY--QYPNSHTFTLADGESF 125

Query: 119 ---IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV--AGHLLGGTVWKITKDGEDVIYAV 173
              +DS + S   L       ++ K + + V      AG +LGG  W +    +++ Y  
Sbjct: 126 TMTVDSIYHSFRSLREPYGGKVTVKNDDVEVNCFAVFAGRMLGGYSWTVKYQIDELFYCP 185

Query: 174 DYNRRKEKHL---------NGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
           D++ +    L         N  +  SF  P  +      A + +   Q + +F++    T
Sbjct: 186 DFSVKPSYALKPFDVPTTANIVLASSF--PFHMTGANRTAKYEE---QLKSLFKE-FQHT 239

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMG 283
           LR G +VL+PV+ AGR LE+L I+    AE   + Y +  +   +   +D   +  E + 
Sbjct: 240 LRGGSDVLVPVNVAGRGLEVLNIIVHLLAEQGGDKYKVVLVAAQAQELLDKAGTMTEALQ 299

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI---FVE 340
           D +        D+  L  +V L    +E      GPK+ +   ASL+ G S ++   FV+
Sbjct: 300 DYLIL------DDKRLFANV-LTCRSAEEALTIQGPKICVTDGASLDFGPSAELLEYFVK 352

Query: 341 WASD-VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG------EELIAYEE 393
              D   +L++ TE    GT A ++ A    + + + ++RR  L G         + +E 
Sbjct: 353 GNRDGADHLIVLTEPPLPGTNAAVVTAAADGERLHMQITRRSRLSGEELEEYYIELEHEM 412

Query: 394 EQTRLKKEEALKASLVKEEESKASLGPDNN 423
           EQ R + E      +V++++  A+   + N
Sbjct: 413 EQRRRELEAQSAFQIVQDDDEAAAAKREEN 442


>gi|357618299|gb|EHJ71335.1| hypothetical protein KGM_14386 [Danaus plexippus]
          Length = 324

 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 79/294 (26%), Positives = 145/294 (49%), Gaps = 30/294 (10%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +         P   +  A  +D +L+SH    H GALP+ + +  
Sbjct: 37  MLEFKGKKIMLDCGIHPGLSGMDALPFVDLIEADEVDLLLISHFHLDHSGALPWFLTKTS 96

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVTRLTYSQNY 135
               VF   +T+ +YR     +   Y+    +S E  L+T  D++ +   +  +    N+
Sbjct: 97  FKGRVFMTHATKAIYRW----LVSDYIKVSNISTEQMLYTESDLEGSMDRIETI----NF 148

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H      G+    + AGH+LG  ++ I   G  V+Y  D++R++++HL    + + V P 
Sbjct: 149 HEEKDVRGVRFWAYNAGHVLGAAMFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPT-VHPD 207

Query: 196 VLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT           R++RE  F   +S  +  GG  L+PV + GR  ELLLIL++YW+ 
Sbjct: 208 VLITK----------REERESRFTTLVSDVVGRGGRCLIPVFALGRAQELLLILDEYWSL 257

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL 306
           H    + PIY+ + ++   +   ++++  M D I +  + + +N F+ +H++ L
Sbjct: 258 HPELQDIPIYYASSLAKKCMAVYQTYVNAMNDRIRR--QIAVNNPFVFRHISNL 309


>gi|19173576|ref|NP_597379.1| CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 100kDa SUBUNIT
           [Encephalitozoon cuniculi GB-M1]
 gi|19170782|emb|CAD26556.1| CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR 100kDa SUBUNIT
           [Encephalitozoon cuniculi GB-M1]
          Length = 639

 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 145/307 (47%), Gaps = 25/307 (8%)

Query: 5   VQVTPL----SGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           V +TPL    +GV+      +++ ID    L++CG     D S+  P+     + DA+LL
Sbjct: 6   VSLTPLIKTETGVY-----CHMLEIDNTKILVNCGAPYAMDMSMYTPVLPQILSCDAILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +  +   +G LPY ++       VFS+ P+  LG + + +        S  D        
Sbjct: 61  TSFNINCIGGLPYVLRN-NYYNKVFSSVPIKVLGKICLDEHLRGMGLESSVDT------- 112

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
             F+ ++ + YSQ   ++     + +  + +G+ +GG ++KI+K  E +I   + N RKE
Sbjct: 113 GCFERISEIKYSQPTAVNN----VEICAYNSGNSIGGCLYKISKGPERIIVGFNVNHRKE 168

Query: 181 KHLNGTVLESFVRPAVLITDAYNAL-HNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
            HL+G         ++ + +  + L  N    ++ ++F+D +   L +G  V+LPV  + 
Sbjct: 169 NHLDGMSFSGIGDCSLCVFNGNHVLAENISIAKRDDVFRDMVGGALDSGRKVVLPVKYS- 227

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R LE+ LIL    A+   N  I  L+Y     ++  KS +EW G+ ++  F   + N F 
Sbjct: 228 RFLEVALILNGLMAQR--NGKIACLSYFGQRFVERAKSMIEWAGEKVSSMFSEEKVNPFE 285

Query: 300 LKHVTLL 306
            + +  +
Sbjct: 286 FERIEFM 292


>gi|449329090|gb|AGE95364.1| cleavage and polyadenylation specificity factor 100kDa subunit
           [Encephalitozoon cuniculi]
          Length = 639

 Score =  108 bits (271), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 78/307 (25%), Positives = 145/307 (47%), Gaps = 25/307 (8%)

Query: 5   VQVTPL----SGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           V +TPL    +GV+      +++ ID    L++CG     D S+  P+     + DA+LL
Sbjct: 6   VSLTPLIKTETGVY-----CHMLEIDNTKILVNCGAPYAMDMSMYTPVLPQILSCDAILL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +  +   +G LPY ++       VFS+ P+  LG + + +        S  D        
Sbjct: 61  TSFNINCIGGLPYVLRN-NYYNKVFSSVPIKVLGKICLDEHLRGMGLESSVDT------- 112

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
             F+ ++ + YSQ   ++     + +  + +G+ +GG ++KI+K  E +I   + N RKE
Sbjct: 113 GCFERISEIKYSQPTAVNN----VEICAYNSGNSIGGCLYKISKGPERIIVGFNVNHRKE 168

Query: 181 KHLNGTVLESFVRPAVLITDAYNAL-HNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
            HL+G         ++ + +  + L  N    ++ ++F+D +   L +G  V+LPV  + 
Sbjct: 169 NHLDGMSFSGIGDCSLCVFNGNHVLAENISIAKRDDVFRDMVGGALDSGRKVVLPVKYS- 227

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           R LE+ LIL    A+   N  I  L+Y     ++  KS +EW G+ ++  F   + N F 
Sbjct: 228 RFLEVALILNGLMAQR--NGKIACLSYFGQRFVERAKSMIEWAGEKVSSMFSEEKVNPFE 285

Query: 300 LKHVTLL 306
            + +  +
Sbjct: 286 FERIEFM 292


>gi|47229058|emb|CAG03810.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 698

 Score =  108 bits (270), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 68/194 (35%), Positives = 102/194 (52%), Gaps = 8/194 (4%)

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
           I  DGE +    DYN   ++HL    ++   RP +LI+++  A   +  ++ RE  F   
Sbjct: 235 IRVDGE-LSQQGDYNMTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKK 292

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLE 280
           + +T+  GG VL+PV + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ 
Sbjct: 293 VHETIERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFIT 352

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
           W    I K+F   + N F  KH+    ++S  DN   GP +V A+   L AG S  IF +
Sbjct: 353 WTNQKIRKTF--VQRNMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKK 407

Query: 341 WASDVKNLVLFTER 354
           WA + KN+V F  R
Sbjct: 408 WAGNEKNMVQFLRR 421


>gi|39645207|gb|AAH13904.2| CPSF3L protein, partial [Homo sapiens]
          Length = 429

 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 99/185 (53%), Gaps = 7/185 (3%)

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLR 226
            V+Y  DYN   ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+ 
Sbjct: 1   SVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVE 59

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
            GG VL+PV + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I
Sbjct: 60  RGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKI 119

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            K+F   + N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + K
Sbjct: 120 RKTF--VQRNMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEK 174

Query: 347 NLVLF 351
           N+V+ 
Sbjct: 175 NMVIM 179


>gi|261191614|ref|XP_002622215.1| endoribonuclease ysh1 [Ajellomyces dermatitidis SLH14081]
 gi|239589981|gb|EEQ72624.1| endoribonuclease ysh1 [Ajellomyces dermatitidis SLH14081]
          Length = 894

 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 75/283 (26%), Positives = 141/283 (49%), Gaps = 23/283 (8%)

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
           L+T  D  S    +  + ++  + ++     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 146 LYTEQDHLSTLSQIEAIDFNTTHTINS----IRITPFPAGHVLGAAMFLISIAGLNILFT 201

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL        ++  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 202 GDYSREEDRHLISAEAPKGIKIDVLITESTFGVSSNPPRLEREAALMKSITGVLNRGGRV 261

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW+ H      PIY++  ++   +   ++++  M ++I + 
Sbjct: 262 LMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVYQTYIGAMNENIKRL 321

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  + V  + +    D+   G  ++LAS   L+ G S ++
Sbjct: 322 FRQRMAEAEASGDKSVSAGPWDFRFVRSVRSIERFDDV--GGCVMLASPGMLQTGTSREL 379

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
              WA   +N V+ T     GT+ + +  +  P+ +   MS R
Sbjct: 380 LERWAPSERNGVIMTGYSVEGTMGKQILNE--PEQIPAVMSGR 420


>gi|146088435|ref|XP_001466050.1| putative cleavage and polyadenylation specificity factor
           [Leishmania infantum JPCM5]
 gi|134070152|emb|CAM68485.1| putative cleavage and polyadenylation specificity factor
           [Leishmania infantum JPCM5]
          Length = 819

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 121/455 (26%), Positives = 200/455 (43%), Gaps = 55/455 (12%)

Query: 4   SVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH 62
           S+Q T +      N P +YL+ IDG   L DCGWND FD S L  L     T+ AV+LS 
Sbjct: 8   SIQFTSVYECTTPNAPYAYLIEIDGVRILFDCGWNDEFDTSFLSKLKPHLPTVHAVVLSS 67

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD---- 118
           P     GALP+ +  +     V +     ++G+ ++   +L   Q      FTL D    
Sbjct: 68  PHITACGALPFVLSHISPGTFVAAAGGTSKVGVHSVLHSFLY--QYPNSHTFTLADGEAF 125

Query: 119 ---IDSAFQSVTRLTYSQNYHLSGKGEGIVVA--PHVAGHLLGGTVWKITKDGEDVIYAV 173
              +DS + S   L       ++ K   + V      AG +LGG  W I    +++ Y  
Sbjct: 126 TMTVDSIYHSFRSLREPYGGKVTVKNGDVEVNCLAVFAGRMLGGYSWIIKYQIDELFYCP 185

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPP-------------RQQREMFQDA 220
           D++ +    L         +P  + T A   L +  P              Q + +F++ 
Sbjct: 186 DFSVKPSYAL---------KPFDVPTTANIVLASSFPFHMTGSNRTTKYEEQLKNLFKE- 235

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFL 279
              TLR G +VL+PV+ AGR LE+L I+    AE   + Y +  +   +   +D   +  
Sbjct: 236 FQHTLRGGSDVLVPVNVAGRGLEVLNIIVHLLAEQGGDKYKVVLVAAQAQELLDKAGTMT 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP-DGPKLVLASMASLEAGFSHDI- 337
           E + D +        D+  L  +V  L  +S  +  P  GPK+ +A  ASL+ G S ++ 
Sbjct: 296 EALQDYLIL------DDKRLFANV--LTCRSAEEVLPIQGPKICVADGASLDFGPSAELL 347

Query: 338 --FVEWASD-VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG------EEL 388
             FV+   D   +L++ TE    GT A ++ A    + + + ++RR  L G         
Sbjct: 348 EYFVKGNRDGADHLIVLTEPPLPGTNAAVVTAAADGERLHMQITRRSRLSGEELEEYYIE 407

Query: 389 IAYEEEQTRLKKEEALKASLVKEEESKASLGPDNN 423
           + +E EQ R + E      +V++++  A++  + N
Sbjct: 408 LEHEMEQRRRELEARSAFQIVQDDDEAATVKGEEN 442


>gi|398016320|ref|XP_003861348.1| cleavage and polyadenylation specificity factor, putative
           [Leishmania donovani]
 gi|322499574|emb|CBZ34647.1| cleavage and polyadenylation specificity factor, putative
           [Leishmania donovani]
          Length = 818

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 121/455 (26%), Positives = 200/455 (43%), Gaps = 55/455 (12%)

Query: 4   SVQVTPLSGVFNEN-PLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSH 62
           S+Q T +      N P +YL+ IDG   L DCGWND FD S L  L     T+ AV+LS 
Sbjct: 8   SIQFTSVYECTTPNAPYAYLIEIDGVRILFDCGWNDEFDTSFLSKLKPHLPTVHAVVLSS 67

Query: 63  PDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD---- 118
           P     GALP+ +  +     V +     ++G+ ++   +L   Q      FTL D    
Sbjct: 68  PHITACGALPFVLSHISPGTFVAAAGGTSKVGVHSVLHSFLY--QYPNSHTFTLADGEAF 125

Query: 119 ---IDSAFQSVTRLTYSQNYHLSGKGEGIVVA--PHVAGHLLGGTVWKITKDGEDVIYAV 173
              +DS + S   L       ++ K   + V      AG +LGG  W I    +++ Y  
Sbjct: 126 TMTVDSIYHSFRSLREPYGGKVTVKNGDVEVNCLAVFAGRMLGGYSWIIKYQIDELFYCP 185

Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPP-------------RQQREMFQDA 220
           D++ +    L         +P  + T A   L +  P              Q + +F++ 
Sbjct: 186 DFSVKPSYAL---------KPFDVPTTANIVLASSFPFHMTGSNRTTKYEEQLKNLFKE- 235

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFL 279
              TLR G +VL+PV+ AGR LE+L I+    AE   + Y +  +   +   +D   +  
Sbjct: 236 FQHTLRGGSDVLVPVNVAGRGLEVLNIIVHLLAEQGGDKYKVVLVAAQAQELLDKAGTMT 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAP-DGPKLVLASMASLEAGFSHDI- 337
           E + D +        D+  L  +V  L  +S  +  P  GPK+ +A  ASL+ G S ++ 
Sbjct: 296 EALQDYLIL------DDKRLFANV--LTCRSAEEVLPIQGPKICVADGASLDFGPSAELL 347

Query: 338 --FVEWASD-VKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG------EEL 388
             FV+   D   +L++ TE    GT A ++ A    + + + ++RR  L G         
Sbjct: 348 EYFVKGNRDGADHLIVLTEPPLPGTNAAVVTAAADGERLHMQITRRSRLSGEELEEYYIE 407

Query: 389 IAYEEEQTRLKKEEALKASLVKEEESKASLGPDNN 423
           + +E EQ R + E      +V++++  A++  + N
Sbjct: 408 LEHEMEQRRRELEARSAFQIVQDDDEAATVKGEEN 442


>gi|327351648|gb|EGE80505.1| cleavage and polyadenylation specificity factor [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 983

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 107/431 (24%), Positives = 173/431 (40%), Gaps = 101/431 (23%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    +   +  ++ +DG    L+D GW++ FD S L  L      +   LL    
Sbjct: 5   TPLLGAQSSSSRAVQSILELDGGVKILVDVGWDESFDVSALAELENPVIALGRTLL---- 60

Query: 65  TLHLGALPYAMKQLGLSAPVFST-EPVYRLGLLTMYDQYLSRRQVSEFDLFTLD------ 117
                      ++L  SAP+ +T  P    G L+     + +R     D   +D      
Sbjct: 61  -----------QELYASAPLAATFLPKATSGDLSPPSP-VPKRATRSADTTNVDHDDPPG 108

Query: 118 ---------DIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKIT 163
                    +I   F  +  L YSQ +            G+ +  + AGH +GGT+W I 
Sbjct: 109 ILLPPPTSEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIWHIQ 168

Query: 164 KDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLI--TDAYNALHNQP 209
              E +IYAVD+N+ +E  + G             V+E   +P   +  T   + L    
Sbjct: 169 HGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEVVEQLRKPTAFVCSTRGGDKLSLLG 228

Query: 210 PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---------LNY 259
            R++R ++  D I  +   GG VL+P D++ RVLEL  +LE  W E +          + 
Sbjct: 229 GRKRRDDLLLDMIRSSFSKGGTVLIPTDTSARVLELAYVLEHAWRESAETADGADPLKSG 288

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR------------------------- 294
            +Y     +  T+   +S LEWM + I + FE                            
Sbjct: 289 ALYLAGKKAHGTMRLTRSMLEWMDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMP 348

Query: 295 ----DNA------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
               D A      F  K++ ++  K++LD     + PK++L S  SL+ G+S  +    A
Sbjct: 349 DKRGDGAFKALGPFTFKYLKIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIA 408

Query: 343 SDVKNLVLFTE 353
           +  +NLV+ TE
Sbjct: 409 TGSENLVILTE 419



 Score = 47.8 bits (112), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 69/173 (39%), Gaps = 54/173 (31%)

Query: 558 VYTPQIEETIDVTSDLCAYKVQLSEKLMSNV------------LFKKLGDYEIAWVDAEV 605
           ++TP I ET+D + D  A+ V+LS  L+  +            L  +L   E+   D + 
Sbjct: 786 IFTPVIGETVDASVDTNAWMVKLSSALVKRLKWQNVRSLGVVALTGELRAPELTAADEDA 845

Query: 606 GKTENGMLSLLPISTPAP---------PHKSVL----------------------VGDLK 634
            +       LLP + P+          P K+ L                      VGDL+
Sbjct: 846 PEVSQKKQRLLPDNAPSTGGNEQKQLVPSKNALPLLDVLPVKMAAATRSVTRALHVGDLR 905

Query: 635 MADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
           +ADL+  + S G   EF G G L    +V +RK          SGT +I IEG
Sbjct: 906 LADLRKLMQSSGHTAEFRGEGTLLIDGFVAVRK----------SGTGKIEIEG 948


>gi|302499334|ref|XP_003011663.1| hypothetical protein ARB_02217 [Arthroderma benhamiae CBS 112371]
 gi|291175215|gb|EFE31023.1| hypothetical protein ARB_02217 [Arthroderma benhamiae CBS 112371]
          Length = 749

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 68/238 (28%), Positives = 124/238 (52%), Gaps = 13/238 (5%)

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           I + P  AGH+LG  ++ I+  G ++++  DY+R +++HL    +   V+  V+IT++  
Sbjct: 59  IRITPFPAGHVLGAAMFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTF 118

Query: 204 ALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYP 260
            + + PPR +RE     +++  +  GG VL+PV + GR  ELLLIL++YW+ H      P
Sbjct: 119 GISSNPPRLEREAALMKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVP 178

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL--KHVT-------LLINKSE 311
           IY++  ++   +   ++++  M ++I + F      A     K VT        + +   
Sbjct: 179 IYYIGNMARRCMVVYQTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRN 238

Query: 312 LDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           LD   D G  ++LAS   L+ G S ++   WA + +N V+ T     GT+ + +  +P
Sbjct: 239 LDRFEDVGGCVMLASPGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEP 296


>gi|239610975|gb|EEQ87962.1| cleavage and polyadenylation specificity factor [Ajellomyces
           dermatitidis ER-3]
          Length = 983

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 107/431 (24%), Positives = 173/431 (40%), Gaps = 101/431 (23%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    +   +  ++ +DG    L+D GW++ FD S L  L      +   LL    
Sbjct: 5   TPLLGAQSSSSRAVQSILELDGGVKILVDVGWDESFDVSALAELENPVIALGRTLL---- 60

Query: 65  TLHLGALPYAMKQLGLSAPVFST-EPVYRLGLLTMYDQYLSRRQVSEFDLFTLD------ 117
                      ++L  SAP+ +T  P    G L+     + +R     D   +D      
Sbjct: 61  -----------QELYASAPLAATFLPKATSGDLSPPSP-VPKRATRSADTTNVDHDEPPG 108

Query: 118 ---------DIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKIT 163
                    +I   F  +  L YSQ +            G+ +  + AGH +GGT+W I 
Sbjct: 109 ILLPPPTSEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIWHIQ 168

Query: 164 KDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLI--TDAYNALHNQP 209
              E +IYAVD+N+ +E  + G             V+E   +P   +  T   + L    
Sbjct: 169 HGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEVVEQLRKPTAFVCSTRGGDKLSLLG 228

Query: 210 PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---------LNY 259
            R++R ++  D I  +   GG VL+P D++ RVLEL  +LE  W E +          + 
Sbjct: 229 GRKRRDDLLLDMIRSSFSKGGTVLIPTDTSARVLELAYVLEHAWRESAETADGADPLKSG 288

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR------------------------- 294
            +Y     +  T+   +S LEWM + I + FE                            
Sbjct: 289 ALYLAGKKAHGTMRLTRSMLEWMDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMP 348

Query: 295 ----DNA------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
               D A      F  K++ ++  K++LD     + PK++L S  SL+ G+S  +    A
Sbjct: 349 DKRGDGAFKALGPFTFKYLKIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIA 408

Query: 343 SDVKNLVLFTE 353
           +  +NLV+ TE
Sbjct: 409 TGSENLVILTE 419



 Score = 47.8 bits (112), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 69/173 (39%), Gaps = 54/173 (31%)

Query: 558 VYTPQIEETIDVTSDLCAYKVQLSEKLMSNV------------LFKKLGDYEIAWVDAEV 605
           ++TP I ET+D + D  A+ V+LS  L+  +            L  +L   E+   D + 
Sbjct: 786 IFTPVIGETVDASVDTNAWMVKLSSALVKRLKWQNVRSLGVVALTGELRAPELTAADEDA 845

Query: 606 GKTENGMLSLLPISTPAP---------PHKSVL----------------------VGDLK 634
            +       LLP + P+          P K+ L                      VGDL+
Sbjct: 846 PEVSQKKQRLLPDNAPSTGGNEQKQLVPSKNALPLLDVLPVKMAAATRSVTRALHVGDLR 905

Query: 635 MADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
           +ADL+  + S G   EF G G L    +V +RK          SGT +I IEG
Sbjct: 906 LADLRKLMQSSGHTAEFRGEGTLLIDGFVAVRK----------SGTGKIEIEG 948


>gi|14520957|ref|NP_126432.1| mRNA 3'-end processing factor, [Pyrococcus abyssi GE5]
 gi|5458174|emb|CAB49663.1| Cleavage and polyadenylation specficity factor [Pyrococcus abyssi
           GE5]
          Length = 651

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/405 (25%), Positives = 175/405 (43%), Gaps = 42/405 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + K   
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESYVLVDFGVNVAALNDPYKAFPHFDAPEFQYVLK-EG 247

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 248 LLDAIIITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 307

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     I +  H AGH+LG  +    I     ++  
Sbjct: 308 YRPRDIKEVIKHTITLDYGEVRDIS---PDIRLTLHNAGHILGSAIVHLHIGNGLHNIAI 364

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I +T
Sbjct: 365 TGDF-----KFIPTRLLEPANAKFPRLETLVMESTYGGANDIQMPREEAEKRLIEVIHQT 419

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           L+ GG VL+P  + GR  E++++LEDY    +++ PIY    +  +T  +  ++ E++  
Sbjct: 420 LKRGGKVLIPAMAVGRAQEVMMVLEDYARIGAIDAPIYLDGMIWEATAIHT-AYPEYLSR 478

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
            + +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 479 RLREQIFKEGYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 538

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            D +N ++F      GTL R +Q+            R +P+VGEE
Sbjct: 539 PDPRNSIIFVSYQAEGTLGRQVQSG----------VREIPMVGEE 573


>gi|380741511|tpe|CCE70145.1| TPA: mRNA 3'-end processing factor, putative [Pyrococcus abyssi
           GE5]
          Length = 648

 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/405 (25%), Positives = 175/405 (43%), Gaps = 42/405 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + K   
Sbjct: 186 IRITGLGGFREVGRSALLVQTDESYVLVDFGVNVAALNDPYKAFPHFDAPEFQYVLK-EG 244

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 245 LLDAIIITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 304

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     I +  H AGH+LG  +    I     ++  
Sbjct: 305 YRPRDIKEVIKHTITLDYGEVRDIS---PDIRLTLHNAGHILGSAIVHLHIGNGLHNIAI 361

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I +T
Sbjct: 362 TGDF-----KFIPTRLLEPANAKFPRLETLVMESTYGGANDIQMPREEAEKRLIEVIHQT 416

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           L+ GG VL+P  + GR  E++++LEDY    +++ PIY    +  +T  +  ++ E++  
Sbjct: 417 LKRGGKVLIPAMAVGRAQEVMMVLEDYARIGAIDAPIYLDGMIWEATAIHT-AYPEYLSR 475

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
            + +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 476 RLREQIFKEGYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 535

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            D +N ++F      GTL R +Q+            R +P+VGEE
Sbjct: 536 PDPRNSIIFVSYQAEGTLGRQVQSG----------VREIPMVGEE 570


>gi|452825586|gb|EME32582.1| RNA-metabolising metallo-beta-lactamase family protein [Galdieria
           sulphuraria]
          Length = 370

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 93/353 (26%), Positives = 155/353 (43%), Gaps = 28/353 (7%)

Query: 31  LIDCGWNDHFDPSLLQP---LSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFST 87
           ++DCG +  +      P   L+     + AV ++H    H+GALP   ++ G   P++ +
Sbjct: 1   MLDCGLHPSYQDDRRYPNFGLAFSYGPLKAVFITHCHADHVGALPILTERWGYDGPIYMS 60

Query: 88  EPVYRLGLLTMY--------DQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSG 139
           EP  +L    +         D   +    SE+  +T  +++S    VT +   Q+  +  
Sbjct: 61  EPTRKLSYYILEECVGSWGGDDEWTDSSRSEWS-YTQREVESCLTKVTIMEPGQSISV-- 117

Query: 140 KGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA-VLI 198
            GE + V   +AGH+LG  ++ I  D   ++Y  D+      HL    ++    P  V++
Sbjct: 118 -GENVQVHSWMAGHVLGAYMFSIVVDNHRILYTGDFTSCPTFHLPPARVDDIPYPPDVIL 176

Query: 199 TDAYNALHNQPPR--QQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS 256
           ++A  A   +  R   Q E  Q+ +   L  GG VL+PV + GR  ELLL+LE YW    
Sbjct: 177 SEATYATSFKDGRLNNQVEFIQNVLD-CLLDGGKVLVPVFAIGRAQELLLLLEMYWQRFH 235

Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK-----SFETSRDNAFLLKHVTLLINKSE 311
           L++PI F T  +   +     F  W     T+     S++T      ++    LL    E
Sbjct: 236 LSFPILFSTKNAHQVLQIYTEFAHWTRTPSTRDEQMMSYQTWWSRVQVVDPEQLLDAVEE 295

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
            D     P + L +  +L  G S  +F   A D KNL++       GT+ + L
Sbjct: 296 WDR----PLVALTTPGTLARGLSLQVFRRIAPDEKNLLIIPHFCISGTIEKRL 344


>gi|399216826|emb|CCF73513.1| unnamed protein product [Babesia microti strain RI]
          Length = 646

 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 98/405 (24%), Positives = 164/405 (40%), Gaps = 78/405 (19%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST--------------------IDAVLLS 61
           +V+I G   + DCG +  ++ +   PL  +  +                    ID ++L+
Sbjct: 22  IVTIGGRKVMFDCGAHSGYNDNRRYPLFSLLESKESPITVNSSNKTEKISNFDIDCIILT 81

Query: 62  HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-------QYLSRRQVSEFDL- 113
           H    H GALPY  + LG   P+  + P   L  + + D       ++  +  + + D  
Sbjct: 82  HFHIDHCGALPYFTENLGYDGPILMSYPTKALTPILLKDSCRVQSLKHTKKNPIMDSDKS 141

Query: 114 ------------------FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                             FT   ++ +      L    + H+      + + P+ AGH+L
Sbjct: 142 FMALLNENPAASYEESLNFTEQSVEKSLSRAIPLQLHSDTHIGD----LTIRPYYAGHVL 197

Query: 156 GGTVWKITKDGEDVIYAV---------------DYNRRKEKHLNGTVLESFVRPAVLITD 200
           G +++ +    + V+Y                 D+N   +KHL    +   + P VLI +
Sbjct: 198 GASIFAVRYKSQLVVYTGTNSFNAIRQKTIQLGDFNTMSDKHLGPAKIPK-LEPDVLICE 256

Query: 201 AYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY 259
           +  A   +P R+  E+    A+  TL  GG VL+PV + GR  EL +ILE +W   +LNY
Sbjct: 257 STYATIVRPSRRSAEVELCKAVKDTLDHGGKVLIPVFAVGRAQELAIILECFWKRVNLNY 316

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGP 319
           PIYF   +S     Y K     + D   + FE++  +AF   H    IN+         P
Sbjct: 317 PIYFAGGMSERASTYYKLHSYALMDLDGQLFESTLISAF--DHD--FINEKR-------P 365

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
            ++ A+   L  G S  +   WA D  NL++       GT+   L
Sbjct: 366 MVLFATPGMLNGGLSLSVCKAWAPDPHNLIIIPGYCIQGTVGNRL 410


>gi|397651897|ref|YP_006492478.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
           furiosus COM1]
 gi|393189488|gb|AFN04186.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
           furiosus COM1]
          Length = 648

 Score =  106 bits (264), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 100/404 (24%), Positives = 175/404 (43%), Gaps = 42/404 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + K   
Sbjct: 186 IRITGLGGFREVGRSALLVQTDESFVLVDFGINVAALNDPYKAFPHFDAPEFQYVLK-EG 244

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 245 LLDAIVITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFVEIQQSNGQEPL 304

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     + +  H AGH+LG  +    I     ++  
Sbjct: 305 YKPKDIKEVIKHTITLDYGEVRDIS---PDVRLTLHNAGHILGSAIVHLHIGNGLHNIAV 361

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I +T
Sbjct: 362 TGDF-----KFIPTRLLEPASYRFPRLETLVMESTYGGANDIQMPREEAEKRLIEVIHQT 416

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           +R GG VL+P  + GR  E++++LE+Y     ++ PIY    +  +T  +  ++ E++  
Sbjct: 417 IRRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIDVPIYLDGMIWEATAIHT-AYPEYLSK 475

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
           ++ +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 476 TLREQIFKEDYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 535

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGE 386
           SD +N ++F      GTL R +Q+            R +P++GE
Sbjct: 536 SDKRNSIIFVSYQAEGTLGRQVQSG----------VREIPMIGE 569


>gi|225679068|gb|EEH17352.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 984

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 106/431 (24%), Positives = 170/431 (39%), Gaps = 100/431 (23%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    +   +  ++ +DG    L+D GW+  FD S L  L          LL    
Sbjct: 5   TPLLGAQSSSSRAVQSILELDGGVKILVDVGWDHSFDTSALAELESPVIAFGRSLL---- 60

Query: 65  TLHLGALPYAMKQLGLSAPVFST-EPVYRLGLLTMYDQYLSRRQVSEFDLFT-------- 115
                      + L  SAP+ +T  P    G  +      SR  +S     T        
Sbjct: 61  -----------QDLYASAPLAATFWPPATAGASSPTSAAASRTAISPESADTDQNERPRI 109

Query: 116 ------LDDIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKITK 164
                  ++I   F  +  L YSQ +            G+ +  + AGH +GGT+W I  
Sbjct: 110 LLPPPSTEEIARYFSLIQPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIWHIQH 169

Query: 165 DGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLI--TDAYNALHNQPP 210
             E +IYAVD+N+ +E  + G             V+E   +P  L+  T   + L     
Sbjct: 170 GMESIIYAVDWNQARENVIAGAAWFGGSGGSGTEVVEQLRKPTALVCSTRGGDKLALSGG 229

Query: 211 RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---------LNYP 260
           R++R ++  D +      GG VL+P+D++ RVLEL  +LE  W E +             
Sbjct: 230 RKRRDDLLLDMLRSCFSKGGTVLIPMDTSARVLELAYVLEHAWRESAETADGEDPLKGAG 289

Query: 261 IYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR-------------------------- 294
           +Y     +  T+   +S LEWM + I + FE                             
Sbjct: 290 LYLAGRKAHGTMRLARSMLEWMDEGIVREFEAGHGRDPVTGGGKGRSDGPSQRNAPASVP 349

Query: 295 ----DNA------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
               DNA      F  +H+ ++  K++LD     + P+++L    SLE G+S  +  + A
Sbjct: 350 DKKSDNASKGLGPFTFRHLKIVERKTKLDKILGSNAPQVILTPDTSLEWGYSKHVLQKIA 409

Query: 343 SDVKNLVLFTE 353
           +  +NL++ TE
Sbjct: 410 AGSENLIILTE 420


>gi|18977777|ref|NP_579134.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
           furiosus DSM 3638]
 gi|18893520|gb|AAL81529.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
           furiosus DSM 3638]
          Length = 651

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 100/404 (24%), Positives = 175/404 (43%), Gaps = 42/404 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + K   
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESFVLVDFGINVAALNDPYKAFPHFDAPEFQYVLK-EG 247

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 248 LLDAIVITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFVEIQQSNGQEPL 307

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     + +  H AGH+LG  +    I     ++  
Sbjct: 308 YKPKDIKEVIKHTITLDYGEVRDIS---PDVRLTLHNAGHILGSAIVHLHIGNGLHNIAV 364

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I +T
Sbjct: 365 TGDF-----KFIPTRLLEPASYRFPRLETLVMESTYGGANDIQMPREEAEKRLIEVIHQT 419

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           +R GG VL+P  + GR  E++++LE+Y     ++ PIY    +  +T  +  ++ E++  
Sbjct: 420 IRRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIDVPIYLDGMIWEATAIHT-AYPEYLSK 478

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
           ++ +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 479 TLREQIFKEDYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 538

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGE 386
           SD +N ++F      GTL R +Q+            R +P++GE
Sbjct: 539 SDKRNSIIFVSYQAEGTLGRQVQSG----------VREIPMIGE 572


>gi|331212217|ref|XP_003307378.1| hypothetical protein PGTG_00328 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309297781|gb|EFP74372.1| hypothetical protein PGTG_00328 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 950

 Score =  106 bits (264), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 73/273 (26%), Positives = 129/273 (47%), Gaps = 35/273 (12%)

Query: 115 TLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKI--------TKDG 166
           +  ++  AF SV  + YSQ  HL  K   + +  H +GH +GGT+W +        +   
Sbjct: 169 SFKELRDAFDSVIAVRYSQPIHLGRKLRPLTLTAHKSGHTIGGTIWSLRSPLHTVSSASS 228

Query: 167 EDVIYAVDYNRRKEKHLNGTVLES------------FVRPAVLITDAYNALHNQPPRQQR 214
             +IYA  +N  +E HL+   L                RP V++     +L     ++ R
Sbjct: 229 STLIYAPIFNHVRESHLDSAALVQATGDGSMRIGLGMSRPMVMVVGTERSLIKGIRKKDR 288

Query: 215 E-MFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTI 272
           + +  D+I++TLRA   VL+P D + R++ELLL+L+ +W +  L+ +P+  ++      +
Sbjct: 289 DRILLDSITQTLRASRTVLIPTDPSARLIELLLLLDSHWTQSRLDSFPLCLVSQTGKDVV 348

Query: 273 DYVKSFLEWMGDSITKSF-------ETSRDN----AFLLKHVTLL--INKSELDNAPDGP 319
            +++S  EWM  ++ +S          +RD        L+H+     +   E +     P
Sbjct: 349 TFIRSLTEWMSPALARSSFDQNHHKRGNRDQNDQGPLRLRHIRFFNSVEALEAELPIRQP 408

Query: 320 KLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 352
           K++LA   S+E GFS  +F   A    NL++ T
Sbjct: 409 KVILAVPLSMEYGFSRAMFTRIAGVEGNLIILT 441



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 50/192 (26%), Positives = 92/192 (47%), Gaps = 24/192 (12%)

Query: 4   SVQVTPLSGVFNENP-LSYLVSIDGFNFLIDCGWNDHFDP----SLLQPLSKVASTIDAV 58
           ++++TPL G  +    LSYL+ ID    L+DCG  D   P      L  L+++  ++D V
Sbjct: 2   AIKLTPLIGAHDSTGILSYLLEIDEGRILLDCGCPDRPTPGEIDGYLNKLAELTPSLDLV 61

Query: 59  LLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD 118
           LLSHP    LG +P    +LGL  P+++T P   +G     ++++ +R + E      + 
Sbjct: 62  LLSHPLLSSLGLVPLLRARLGLRCPIYATLPTKEMGRWAA-EEWIGQRALEES-----NG 115

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGT---------VWKIT----KD 165
           I+++ QS   L+   +     +   ++V P      +  +         +WK++    +D
Sbjct: 116 IENSTQSAENLSLQLSSDQPAQNIPVIVEPENLSKSVPPSHSNSNNSDHIWKVSFKELRD 175

Query: 166 GEDVIYAVDYNR 177
             D + AV Y++
Sbjct: 176 AFDSVIAVRYSQ 187


>gi|408404164|ref|YP_006862147.1| beta-lactamase [Candidatus Nitrososphaera gargensis Ga9.2]
 gi|408364760|gb|AFU58490.1| beta-lactamase domain protein [Candidatus Nitrososphaera gargensis
           Ga9.2]
          Length = 700

 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 165/371 (44%), Gaps = 27/371 (7%)

Query: 10  LSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST---------IDAVLL 60
           L GV       ++V       ++DCG N    P  +  L+              +DAV++
Sbjct: 251 LGGVKQVGRSCFIVVTPESKVMLDCGIN----PGEMSGLNAYPRLDWFNFDLDDLDAVII 306

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
            H    H G LP A+ + G   PV+ TEP   L  L   D          +  +   D++
Sbjct: 307 GHAHIDHQGFLP-ALFKYGYKGPVYCTEPTLPLMTLLQMDSVKIANSNGTYLPYEARDVN 365

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
              +    L Y +   +S     I +    AGH++G     +   G  +++Y+ DY   +
Sbjct: 366 EVIKHCITLPYGKPTDIS---PDITITLQNAGHIMGSATVHLNISGAHNILYSGDYKYAR 422

Query: 180 EKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQ--REMFQDAISKTLRAGGNVLLPVD 236
            + L+  V   + R   LIT++ Y    +  P QQ     F ++I+KTL  GG VL+PV 
Sbjct: 423 TQLLDSAV-SMYPRVETLITESTYGNTTDVMPDQQVVYRSFTESINKTLIEGGKVLIPVP 481

Query: 237 SAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           + GR  E++L++     E  L   PIY    +S ++  ++ S+  ++G  + KS  +   
Sbjct: 482 AVGRAQEIMLVMAKEMREGRLVESPIYIEGMISEASAIHM-SYAHYLGSEVRKSV-SQGI 539

Query: 296 NAFLLKHVTLLINKSELDNA--PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           N F  ++ T++    + D+    + P +V+A+   LE G S + F E A + KN ++F  
Sbjct: 540 NPFQSEYFTVISGHGKRDDVLNDENPAIVMATSGMLEGGPSVEYFKELAPNPKNKIMFVS 599

Query: 354 RGQFGTLARML 364
               GTL R +
Sbjct: 600 YQINGTLGRRV 610


>gi|308162204|gb|EFO64613.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia lamblia P15]
          Length = 737

 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 107/404 (26%), Positives = 181/404 (44%), Gaps = 62/404 (15%)

Query: 5   VQVTPLSGVFNENP-----LSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA------- 52
           V++TPL G  NE       LSY  S    + ++DCG +    P+L +    VA       
Sbjct: 7   VKLTPL-GAGNEVGRSCFILSYQRSGCSGSIMLDCGLH----PALSETRDYVAIQALPFF 61

Query: 53  ------STIDAVLLSHPDTLHLGALPYAMKQLGLSA------------PVFSTEPVYRLG 94
                 ST+  +L++H    H+ ALPY ++ L   A            P++ T P  ++ 
Sbjct: 62  DLEDYVSTLSLILITHFHNDHIAALPYLLRCLRDRAVKEGKPELHYIPPIYMTAPTLKIF 121

Query: 95  LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHL 154
             ++ D       +S+  L+T +D+D   ++   LT   +++ + +  GI      AGH+
Sbjct: 122 KESVTDV------ISQTKLYTHEDVDFMAKNTKLLT---SFYQTERVSGISFTAMPAGHV 172

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKE-KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQ 213
           +G  ++ I+ D    +Y  D++   E +HL            ++I   Y  +  Q  R  
Sbjct: 173 IGAAMFHISIDNFHALYTGDFSCEPEDRHLQPATFPQVKLDLLIIESTYGTI-RQKERMT 231

Query: 214 REM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSS 269
           RE  F D I  T++  G VLLPV S GRV ELL IL++YW EH        IY+++ ++ 
Sbjct: 232 RERDFIDLIVSTVKKDGCVLLPVFSIGRVQELLCILQEYWREHEQEMARITIYYVSAIAD 291

Query: 270 STIDYV---KSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASM 326
           +        K FL   GD+     +T +      +   ++  K+   N P  P ++  + 
Sbjct: 292 NARQLYSKDKGFLRH-GDTGLSDIQTGK------RKDKIIYTKTRPKN-PKKPYVMFCTP 343

Query: 327 ASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA-RMLQADPP 369
             L++G S +++ E      NL+L T      TL  ++L+  PP
Sbjct: 344 GMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLEGKPP 387


>gi|332159620|ref|YP_004424899.1| mRNA 3'-end processing factor [Pyrococcus sp. NA2]
 gi|331035083|gb|AEC52895.1| mRNA 3'-end processing factor, putative [Pyrococcus sp. NA2]
          Length = 651

 Score =  105 bits (263), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 101/405 (24%), Positives = 175/405 (43%), Gaps = 42/405 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + +   
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESFVLVDFGINVAALNDPYKAFPHFDAPEFQYVLR-EG 247

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 248 LLDAIIITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 307

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     I +  H AGH+LG  +    I     ++  
Sbjct: 308 YRPRDIKEVIKHTITLDYGEVRDIS---PDIRLTLHNAGHILGSAIVHLHIGNGLHNIAI 364

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I KT
Sbjct: 365 TGDF-----KFIPTRLLEPANARFPRLETLVMESTYGGSNDIQMPREEAEKRLIEVIHKT 419

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           ++ GG VL+P  + GR  E++++LE+Y     ++ PIY    +  +T  +  ++ E++  
Sbjct: 420 IKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIDVPIYLDGMIWEATAIHT-AYPEYLSR 478

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
            + +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 479 RLREQIFKEGYNPFLSEIFHPVANSRERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 538

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            D KN ++F      GTL R +Q+           +R +P++GEE
Sbjct: 539 PDPKNSIIFVSYQAEGTLGRQVQSG----------AREIPMIGEE 573


>gi|124809291|ref|XP_001348538.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium falciparum 3D7]
 gi|23497434|gb|AAN36977.1| cleavage and polyadenylation specificity factor protein, putative
           [Plasmodium falciparum 3D7]
          Length = 876

 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 97/439 (22%), Positives = 185/439 (42%), Gaps = 67/439 (15%)

Query: 3   TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLL 60
           +++ +  L G         ++  D  + ++DCG +  F      P+      S +D  L+
Sbjct: 2   SNINIVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLI 61

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--------------------------- 93
           +H    H GALPY + +      +F TE    +                           
Sbjct: 62  THFHMDHSGALPYLINKTRFKGRIFMTEATKSICYLLWNDYARIEKYMNVVNKNKLSKNK 121

Query: 94  -----------GLLTMYDQYLSRRQVSEFD---------------LFTLDDIDSAFQSVT 127
                      G + + ++Y S   + +                 L+  +DID     + 
Sbjct: 122 KGGEDDNGLNNGNMLLSNEYSSDENIDDNGDVYENNDNGDGNSNVLYDENDIDKTMDLIE 181

Query: 128 RLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV 187
            L + QN+        +    + AGH++G  ++ +  +    +Y  DY+R  ++H+    
Sbjct: 182 TLNFHQNFEFPN----VKFTAYRAGHVIGACMFLVEINNIRFLYTGDYSREIDRHIPIAE 237

Query: 188 LESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
           + + +   VLI +    +     R++RE+ F + ++  +   G VLLPV + GR  ELLL
Sbjct: 238 IPN-IDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLPVFALGRAQELLL 296

Query: 247 ILEDYW--AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVT 304
           ILE++W   +H  N PI++++ +++ ++   ++F+   G+ + K     + N F  K+V 
Sbjct: 297 ILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNEGK-NPFNFKYVK 355

Query: 305 L---LINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA 361
               L + S      + P +++AS   L+ G S +IF   ASD K+ V+ T     GTLA
Sbjct: 356 YAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVILTGYTVKGTLA 415

Query: 362 RMLQADPPPKAVKVTMSRR 380
             L+ +P    +   + +R
Sbjct: 416 DELKTEPEFVTINDKVVKR 434


>gi|261206112|ref|XP_002627793.1| cleavage and polyadenylylation specificity factor [Ajellomyces
           dermatitidis SLH14081]
 gi|239592852|gb|EEQ75433.1| cleavage and polyadenylylation specificity factor [Ajellomyces
           dermatitidis SLH14081]
          Length = 983

 Score =  105 bits (262), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 106/431 (24%), Positives = 172/431 (39%), Gaps = 101/431 (23%)

Query: 8   TPLSGV--FNENPLSYLVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
           TPL G    +   +  ++ +DG    L+D GW++ FD S L  L      +   LL    
Sbjct: 5   TPLLGAQSSSSRAVQSILELDGGVKILVDVGWDESFDVSALAELENPVIALGRTLL---- 60

Query: 65  TLHLGALPYAMKQLGLSAPVFST-EPVYRLGLLTMYDQYLSRRQVSEFDLFTLD------ 117
                      ++L  SAP+ +T  P    G L+     + +R     D   +D      
Sbjct: 61  -----------QELYASAPLAATFLPKATSGDLSPPSP-VPKRATRSADTTNVDHDEPPG 108

Query: 118 ---------DIDSAFQSVTRLTYSQNYHLSGKG-----EGIVVAPHVAGHLLGGTVWKIT 163
                    +I   F  +  L YSQ +            G+ +  + AGH +GGT+W I 
Sbjct: 109 ILLPPPTSEEIARYFSLIHPLKYSQPHQPLPSPFSPPLNGLTLTAYNAGHTVGGTIWHIQ 168

Query: 164 KDGEDVIYAVDYNRRKEKHLNGT------------VLESFVRPAVLI--TDAYNALHNQP 209
              E +IYAVD+N+ +E  + G             V+E   +P   +  T   + L    
Sbjct: 169 HGMESIIYAVDWNQARENVIAGAAWFGGSGASGTEVVEQLRKPTAFVCSTRGGDKLSLLG 228

Query: 210 PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---------LNY 259
            R++R ++  D I  +   GG VL+P D++ R LEL  +LE  W E +          + 
Sbjct: 229 GRKRRDDLLLDMIRSSFSKGGTVLIPTDTSARALELAYVLEHAWRESAETADGADPLKSG 288

Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSR------------------------- 294
            +Y     +  T+   +S LEWM + I + FE                            
Sbjct: 289 ALYLAGKKAHGTMRLTRSMLEWMDEGIVREFEAGHGDPVAVSGKGRQDGPSQRNPLTGMP 348

Query: 295 ----DNA------FLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHDIFVEWA 342
               D A      F  K++ ++  K++LD     + PK++L S  SL+ G+S  +    A
Sbjct: 349 DKRGDGAFKALGPFTFKYLKIVERKAKLDKILGSNTPKVILTSDTSLDWGYSKHVLQNIA 408

Query: 343 SDVKNLVLFTE 353
           +  +NLV+ TE
Sbjct: 409 TGSENLVILTE 419



 Score = 47.8 bits (112), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 47/173 (27%), Positives = 69/173 (39%), Gaps = 54/173 (31%)

Query: 558 VYTPQIEETIDVTSDLCAYKVQLSEKLMSNV------------LFKKLGDYEIAWVDAEV 605
           ++TP I ET+D + D  A+ V+LS  L+  +            L  +L   E+   D + 
Sbjct: 786 IFTPVIGETVDASVDTNAWMVKLSSALVKRLKWQNVRSLGVVALTGELRAPELTAADEDA 845

Query: 606 GKTENGMLSLLPISTPAP---------PHKSVL----------------------VGDLK 634
            +       LLP + P+          P K+ L                      VGDL+
Sbjct: 846 PEVSQKKQRLLPDNAPSTGGNEQKQLVPSKNALPLLDVLPVKMAAATRSVTRALHVGDLR 905

Query: 635 MADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEG 686
           +ADL+  + S G   EF G G L    +V +RK          SGT +I IEG
Sbjct: 906 LADLRKLMQSSGHTAEFRGEGTLLIDGFVAVRK----------SGTGKIEIEG 948


>gi|159111399|ref|XP_001705931.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia lamblia ATCC 50803]
 gi|157434022|gb|EDO78257.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia lamblia ATCC 50803]
          Length = 757

 Score =  105 bits (261), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/404 (26%), Positives = 181/404 (44%), Gaps = 62/404 (15%)

Query: 5   VQVTPLSGVFNENP-----LSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA------- 52
           V++TPL G  NE       LSY  S    + ++DCG +    P+L +    VA       
Sbjct: 29  VKLTPL-GAGNEVGRSCFILSYQRSGCSGSIMLDCGLH----PALSETRDYVAIQALPFF 83

Query: 53  ------STIDAVLLSHPDTLHLGALPYAMKQLGLSA------------PVFSTEPVYRLG 94
                 ST+  +L++H    H+ ALPY ++ L   A            PV+ T P  ++ 
Sbjct: 84  DLEDYVSTLSLILITHFHNDHIAALPYLLRCLRDRAVKEGKPELHYIPPVYMTAPTLKIF 143

Query: 95  LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHL 154
             ++ D       +S+  L+T +D++   ++   LT   +++ + +  GI      AGH+
Sbjct: 144 KESVTDV------ISQTKLYTHEDVEFMAKNTKLLT---SFYQTERVNGISFTAMPAGHV 194

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKE-KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQ 213
           +G  ++ I+ D    +Y  D++   E +HL            ++I   Y  +  Q  R  
Sbjct: 195 IGAAMFHISIDNFHALYTGDFSCEPEDRHLQPATFPQVKLDLLIIESTYGTIR-QKERMT 253

Query: 214 REM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSS 269
           RE  F D I  T++  G VLLPV S GRV ELL IL++YW EH        IY+++ ++ 
Sbjct: 254 RERDFIDLIVSTVKKDGCVLLPVFSIGRVQELLCILQEYWREHEQEMARVTIYYVSAIAD 313

Query: 270 STIDYV---KSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASM 326
           +        K FL   GD+     +T +      +   ++  K+   N P  P ++  + 
Sbjct: 314 NARQLYSKDKGFLRH-GDTGLSDIQTGK------RKDRIIYTKTRPKN-PKKPYVMFCTP 365

Query: 327 ASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA-RMLQADPP 369
             L++G S +++ E      NL+L T      TL  ++L+  PP
Sbjct: 366 GMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLEGKPP 409


>gi|337284211|ref|YP_004623685.1| mRNA 3'-end processing factor [Pyrococcus yayanosii CH1]
 gi|334900145|gb|AEH24413.1| mRNA 3'-end processing factor, putative [Pyrococcus yayanosii CH1]
          Length = 648

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/404 (25%), Positives = 173/404 (42%), Gaps = 42/404 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + K   
Sbjct: 186 IRITGLGGFREVGRSALLVQTDESYVLVDFGVNVAAMNDPYKAFPHFDAPEFQYVLK-EG 244

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 245 LLDAIIITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQEPL 304

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     I +  H AGH+LG  +    I     ++  
Sbjct: 305 YRPRDIKEVIKHTITLDYGEVRDIS---PDIRLTLHNAGHILGSAIVHLHIGNGLHNIAV 361

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ +A     N  Q PR++ E    + I +T
Sbjct: 362 TGDF-----KFIPTRLLEPANARFPRLETLVMEATYGGSNDIQMPREEAEKRLIEVIHRT 416

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           ++ GG VL+P  + GR  E++++LE+Y     ++ PIY    +  +T  +  ++ E++  
Sbjct: 417 IKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIDVPIYLDGMIWEATAIHT-AYPEYLSK 475

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
            + +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 476 RLREQIFHEGYNPFLNEVFKPVANSRERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 535

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGE 386
            D KN ++F      GTL R +Q            +R +P+VGE
Sbjct: 536 PDPKNSMIFVSYQAEGTLGRQVQNG----------AREIPMVGE 569


>gi|253742053|gb|EES98907.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
           [Giardia intestinalis ATCC 50581]
          Length = 757

 Score =  103 bits (258), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 107/404 (26%), Positives = 182/404 (45%), Gaps = 62/404 (15%)

Query: 5   VQVTPLSGVFNENP-----LSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA------- 52
           V+VTPL G  NE       LSY  S    + ++DCG +    P+L +    VA       
Sbjct: 29  VKVTPL-GAGNEVGRSCFILSYQRSGCSGSIMLDCGLH----PALSETRDYVAIQALPFF 83

Query: 53  ------STIDAVLLSHPDTLHLGALPYAMKQLGLSA------------PVFSTEPVYRLG 94
                 + +  +L++H    H+ ALPY ++ L   A            PV+ T P  ++ 
Sbjct: 84  DLEDYVANLSLILITHFHNDHIAALPYLLRCLRDRAVKEGKPELHYIPPVYMTAPTLKIF 143

Query: 95  LLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHL 154
             ++ D       +S+  L+T +D++   ++   LT   +++ + +  G+      AGH+
Sbjct: 144 KESVADV------ISQTKLYTHEDVEFMAKNTRLLT---SFYQTERVSGVSFTAMPAGHV 194

Query: 155 LGGTVWKITKDGEDVIYAVDYNRRKE-KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQ 213
           +G  ++ I+ D    +Y  D++   E +HL        VR  +LI ++      Q  R  
Sbjct: 195 IGAAMFHISIDNFHALYTGDFSCEPEDRHLQPATFPQ-VRLDLLIIESTYGTIRQKERMT 253

Query: 214 REM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSS 269
           RE  F D I  T++  G VLLPV S GRV ELL IL++YW EH        IY+++ ++ 
Sbjct: 254 RERDFIDLIVSTVKKDGCVLLPVFSIGRVQELLCILQEYWREHEQEMARVTIYYVSAIAD 313

Query: 270 STIDYV---KSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASM 326
           +        K FL   GD+     +T +      +   ++  K+   N P  P ++  + 
Sbjct: 314 NARQLYSKDKGFLRH-GDTGLSDIQTGK------RKDRIIYTKTRPKN-PKKPYVMFCTP 365

Query: 327 ASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLA-RMLQADPP 369
             L++G S +++ E      NL+L T      TL  ++L+  PP
Sbjct: 366 GMLQSGVSKEMYNELCGSPDNLLLVTGYATQDTLLYKLLEGKPP 409


>gi|430813249|emb|CCJ29377.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 574

 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 68/239 (28%), Positives = 123/239 (51%), Gaps = 20/239 (8%)

Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
           +YH + +  G+   P+ AGH+LG  ++ I   G  +++  DY+R +++HL    +   ++
Sbjct: 31  DYHSTIEVNGVKFTPYHAGHVLGAAMFFIEVAGIKILFTGDYSREEDRHLIPAEVPP-IQ 89

Query: 194 PAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
           P +LIT++ Y    +QP  ++       I   +R GG VL+PV + GR  EL+LI+++YW
Sbjct: 90  PDILITESTYGTASHQPISEKESRLTSIIHSIIRRGGRVLIPVFALGRTQELMLIIDEYW 149

Query: 253 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 310
             H    + P+Y+   ++   +   +          TK FE    N F+ ++++ L    
Sbjct: 150 HNHPELHSIPVYYACSLAKKCMTVYQ----------TKIFE--ERNPFIFRYISSL---K 194

Query: 311 ELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            LD   D GP ++LAS   L++G S  +  +W  D KN ++       GT+A+ +  +P
Sbjct: 195 SLDRFEDIGPCVMLASPGMLQSGVSRALLEKWCPDPKNGLIVAGYCVEGTMAKHILNEP 253


>gi|358060736|dbj|GAA93507.1| hypothetical protein E5Q_00148 [Mixia osmundae IAM 14324]
          Length = 1378

 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 157/332 (47%), Gaps = 18/332 (5%)

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQL------GLSAPVFSTEPVYRLGLLTMYDQYLSRRQ 107
           T+DA+L++H    H   LPY M++       G      +T+ VY L +       +    
Sbjct: 88  TVDAILVTHFHLDHAAGLPYIMEKTNFKDGGGRVYMTHATKDVYELLMQDFVRISIIEGT 147

Query: 108 VSEFDLFTLDDIDSAFQSVTRLTYSQNYHL--SGKGEGIVV--APHVAGHLLGGTVWKIT 163
            +   +   ++++++ +++  + + +   +  S K     V    + AGH+LG +++ I 
Sbjct: 148 DTSQRIMDAENLEASLETIQGIRFYEEVTIPISSKRSTTSVRFTSYPAGHVLGASMFLIE 207

Query: 164 KDGEDVIYAVDYNRRKEKHLNGTVLESF--VRPAVLITDAYNALHNQPPRQQRE-MFQDA 220
             G  V+Y  DY+   + HL    + ++   RP V+I ++   + +  P+  RE  F + 
Sbjct: 208 IGGARVLYTGDYSTEADMHLIPASVPTWGGKRPDVMICESTFGVQSFEPKAIREAQFTNK 267

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I   L+ GG VLLP  S+G   ELLL+L+D+W ++     +PIY++T ++S  +   +  
Sbjct: 268 IKTILKRGGKVLLPAFSSGVSQELLLVLDDFWEKNPDLHEFPIYYVTSLASRVLKVYRQH 327

Query: 279 LEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDN--APDGPKLVLASMASLEAGFSHD 336
           +      I +    S DN +       +     +    A   P +V+A+   L+ G S +
Sbjct: 328 ISSQSQKIQQR-AASGDNPYDFGKGRFVKELRSIRRGVADKSPCVVVATPGMLQPGTSRE 386

Query: 337 IFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
           +   WA D +N ++       G+LAR LQA+P
Sbjct: 387 LLERWAGDRRNGLILCGYSVEGSLARDLQAEP 418


>gi|237842097|ref|XP_002370346.1| RNA-metabolising metallo-beta-lactamase domain-containing protein
           [Toxoplasma gondii ME49]
 gi|211968010|gb|EEB03206.1| RNA-metabolising metallo-beta-lactamase domain-containing protein
           [Toxoplasma gondii ME49]
          Length = 1089

 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 115/247 (46%), Gaps = 17/247 (6%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           + P  AGH+LG  ++++      V+Y  D+N   ++HL    L   +RP VLI++   A 
Sbjct: 295 LTPFYAGHVLGAAMFELKLGKASVVYTGDFNTIPDRHLGSAALPC-LRPDVLISECTYAS 353

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +P ++  E  F   +  TL  GG VL+PV + GR  EL ++LE+YW    L +PIYF 
Sbjct: 354 FVRPSKRTVERDFCAVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLRFPIYFA 413

Query: 265 TYVSSSTIDYVKSFLEW--MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
             ++     Y + ++ W     ++    E +   AF   H+  L  +S L +AP  P ++
Sbjct: 414 GGMTERANAYYRLYVHWSKADANVDADPEDALRTAFSFPHI--LPFQSSLLSAPT-PLVL 470

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP 382
           LA+   L  G +      W  D   LVL       GT+  ML          +   R++P
Sbjct: 471 LATPGMLHGGLALKALKAWGGDPATLVLLPGYCVRGTVGAML----------IAGQRQIP 520

Query: 383 LVGEELI 389
           L G   +
Sbjct: 521 LDGHATL 527


>gi|221482308|gb|EEE20663.1| RNA-metabolising metallo-beta-lactamase domain-containing protein,
           putative [Toxoplasma gondii GT1]
          Length = 1090

 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 115/247 (46%), Gaps = 17/247 (6%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           + P  AGH+LG  ++++      V+Y  D+N   ++HL    L   +RP VLI++   A 
Sbjct: 303 LTPFYAGHVLGAAMFELKLGKASVVYTGDFNTIPDRHLGSAALPC-LRPDVLISECTYAS 361

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +P ++  E  F   +  TL  GG VL+PV + GR  EL ++LE+YW    L +PIYF 
Sbjct: 362 FVRPSKRTVERDFCAVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLRFPIYFA 421

Query: 265 TYVSSSTIDYVKSFLEW--MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
             ++     Y + ++ W     ++    E +   AF   H+  L  +S L +AP  P ++
Sbjct: 422 GGMTERANAYYRLYVHWSKADANVDADPEDALRTAFSFPHI--LPFQSSLLSAPT-PLVL 478

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP 382
           LA+   L  G +      W  D   LVL       GT+  ML          +   R++P
Sbjct: 479 LATPGMLHGGLALKALKAWGGDPATLVLLPGYCVRGTVGAML----------IAGQRQIP 528

Query: 383 LVGEELI 389
           L G   +
Sbjct: 529 LDGHATL 535


>gi|14591202|ref|NP_143278.1| mRNA 3'-end processing factor [Pyrococcus horikoshii OT3]
 gi|294979445|pdb|3AF5|A Chain A, The Crystal Structure Of An Archaeal Cpsf Subunit, Ph1404
           From Pyrococcus Horikoshii
 gi|294979446|pdb|3AF6|A Chain A, The Crystal Structure Of An Archaeal Cpsf Subunit, Ph1404
           From Pyrococcus Horikoshii Complexed With Rna-Analog
 gi|3257827|dbj|BAA30510.1| 651aa long hypothetical protein [Pyrococcus horikoshii OT3]
          Length = 651

 Score =  103 bits (256), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 101/405 (24%), Positives = 172/405 (42%), Gaps = 42/405 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + +   
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESFVLVDFGVNVAMLNDPYKAFPHFDAPEFQYVLR-EG 247

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 248 LLDAIIITHAHLDHCGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 307

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     I +  H AGH+LG  +    I     ++  
Sbjct: 308 YRPRDIKEVIKHTITLDYGEVRDIS---PDIRLTLHNAGHILGSAIVHLHIGNGLHNIAI 364

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I  T
Sbjct: 365 TGDF-----KFIPTRLLEPANAKFPRLETLVMESTYGGANDIQMPREEAEKRLIEVIHNT 419

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           ++ GG VL+P  + GR  E++++LE+Y     +  PIY    +  +T  +  ++ E++  
Sbjct: 420 IKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIEVPIYLDGMIWEATAIHT-AYPEYLSR 478

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
            + +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 479 RLREQIFKEGYNPFLSEIFHPVANSRERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQLA 538

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEE 387
            D KN ++F      GTL R +Q+            R +P+VGEE
Sbjct: 539 PDPKNSIIFVSYQAEGTLGRQVQSG----------IREIPMVGEE 573


>gi|221502797|gb|EEE28511.1| RNA-metabolising metallo-beta-lactamase domain-containing protein,
           putative [Toxoplasma gondii VEG]
          Length = 1072

 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 72/247 (29%), Positives = 115/247 (46%), Gaps = 17/247 (6%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           + P  AGH+LG  ++++      V+Y  D+N   ++HL    L   +RP VLI++   A 
Sbjct: 295 LTPFYAGHVLGAAMFELKLGKASVVYTGDFNTIPDRHLGSAALPC-LRPDVLISECTYAS 353

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +P ++  E  F   +  TL  GG VL+PV + GR  EL ++LE+YW    L +PIYF 
Sbjct: 354 FVRPSKRTVERDFCAVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLRFPIYFA 413

Query: 265 TYVSSSTIDYVKSFLEW--MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLV 322
             ++     Y + ++ W     ++    E +   AF   H+  L  +S L +AP  P ++
Sbjct: 414 GGMTERANAYYRLYVHWSKADANVDADPEDALRTAFSFPHI--LPFQSSLLSAPT-PLVL 470

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVP 382
           LA+   L  G +      W  D   LVL       GT+  ML          +   R++P
Sbjct: 471 LATPGMLHGGLALKALKAWGGDPATLVLLPGYCVRGTVGAML----------IAGQRQIP 520

Query: 383 LVGEELI 389
           L G   +
Sbjct: 521 LDGHATL 527


>gi|389852761|ref|YP_006354995.1| mRNA 3'-end processing factor [Pyrococcus sp. ST04]
 gi|388250067|gb|AFK22920.1| putative mRNA 3'-end processing factor [Pyrococcus sp. ST04]
          Length = 651

 Score =  102 bits (254), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 100/404 (24%), Positives = 172/404 (42%), Gaps = 42/404 (10%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN-----------DHFDPSLLQPLSKVAS 53
           +++T L G       + LV  D    L+D G N            HFD    Q + K   
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESFVLVDFGVNVAAMNDPYKAFPHFDAPEFQYVLK-EG 247

Query: 54  TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDL 113
            +DA++++H    H G LPY  +      P+++T P   L +L   D    ++   +  L
Sbjct: 248 LLDAIIITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 307

Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIY 171
           +   DI    +    L Y +   +S     I +  H AGH+LG  +    I     ++  
Sbjct: 308 YRPKDIKEVIKHTITLDYGEVRDIS---PDIRLTLHNAGHILGSAIVHLHIGNGLHNIAI 364

Query: 172 AVDYNRRKEKHLNGTVLE----SFVRPAVLITDAYNALHN--QPPRQQRE-MFQDAISKT 224
             D+     K +   +LE     F R   L+ ++     N  Q PR++ E    + I  T
Sbjct: 365 TGDF-----KFIPTKLLEPANAKFPRLETLVMESTYGGSNDIQMPREEAEKRLIEVIHHT 419

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
           ++ GG VL+P  + GR  E++++LE+Y     ++ PIY    +  +T  +  ++ E++  
Sbjct: 420 IKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIDAPIYLDGMIWEATAIHT-AYPEYLSR 478

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDG--PKLVLASMASLEAGFSHDIFVEWA 342
            + +       N FL +    + N  E  +  D   P +++AS   L  G S + F + A
Sbjct: 479 RLREQIFKEGYNPFLSEIFHPVANSRERQDIIDSKEPAIIIASSGMLVGGPSVEYFKQLA 538

Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGE 386
            D KN ++F      GTL R +Q            +R +P++GE
Sbjct: 539 PDPKNAIIFVSYQAEGTLGRQVQNG----------AREIPMIGE 572


>gi|297737628|emb|CBI26829.3| unnamed protein product [Vitis vinifera]
          Length = 686

 Score =  102 bits (253), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 67/227 (29%), Positives = 109/227 (48%), Gaps = 7/227 (3%)

Query: 146 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
           +  + AGH+LG  ++        ++Y  DYN   ++HL    ++  ++  +LIT++  A 
Sbjct: 204 IRAYYAGHVLGAAMFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDR-LQLDLLITESTYAT 262

Query: 206 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 264
             +  +  RE  F  A+ K +  GG VL+P  + GR  EL ++L++YW   +L  PIYF 
Sbjct: 263 TVRDSKYAREREFLKAVHKCVADGGKVLIPTFALGRAQELCILLDNYWERMNLKVPIYFS 322

Query: 265 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 324
             ++     Y K  + W    + +++ T   NAF  K+V        L NAP GP ++ A
Sbjct: 323 AGLTIQANMYYKMLISWTNQRVKETYATH--NAFDFKNVRSF--DRSLINAP-GPCVLFA 377

Query: 325 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
           +   +  GFS ++F  WA    NLV        GT+   L    P K
Sbjct: 378 TPGMISGGFSLEVFKLWAPSEMNLVTLPGYCLAGTIGHKLTTGKPTK 424



 Score = 44.3 bits (103), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 7/73 (9%)

Query: 22 LVSIDGFNFLIDCGWN----DHF---DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
          +V+I+G   + DCG +    DH    D SL+   +   + ID ++++H    H+GALPY 
Sbjct: 20 VVTINGKRIMFDCGMHMGYLDHRRFPDFSLISKSADFNTAIDCIVITHFHLDHVGALPYF 79

Query: 75 MKQLGLSAPVFST 87
           +  G S P++ T
Sbjct: 80 TEVCGYSGPIYMT 92


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.135    0.397 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 11,671,394,138
Number of Sequences: 23463169
Number of extensions: 510087625
Number of successful extensions: 1343620
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1170
Number of HSP's successfully gapped in prelim test: 2034
Number of HSP's that attempted gapping in prelim test: 1334060
Number of HSP's gapped (non-prelim): 4972
length of query: 706
length of database: 8,064,228,071
effective HSP length: 150
effective length of query: 556
effective length of database: 8,839,720,017
effective search space: 4914884329452
effective search space used: 4914884329452
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)