BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 004656
         (739 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9LKF9|CPSF2_ARATH Cleavage and polyadenylation specificity factor subunit 2
           OS=Arabidopsis thaliana GN=CPSF100 PE=1 SV=2
          Length = 739

 Score = 1231 bits (3185), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 592/743 (79%), Positives = 669/743 (90%), Gaps = 8/743 (1%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPL GV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PLS+VASTIDAVLL
Sbjct: 1   MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPDTLH+GALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDID
Sbjct: 61  SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           SAFQ+V RLTYSQNYHLSGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
           +HLNGTVL+SFVRPAVLITDAY+AL+ NQ  RQQR+  F D ISK L  GGNVLLPVD+A
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
           GRVLELLLILE +W++   ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF 300

Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
           LL+HVTLLINK++LDNAP GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFG
Sbjct: 301 LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG 360

Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
           TLARMLQ+ PPPK VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS 
Sbjct: 361 TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH 420

Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
           G D+N S +PM+ID    +   DV+  HG  Y+DILIDGFVPPS+SVAPMFP+Y+N SEW
Sbjct: 421 GSDDN-SSEPMIIDTKTTH---DVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEW 476

Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELTVQVK 537
           DDFGE+INPDDY+IKDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL V V 
Sbjct: 477 DDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVS 536

Query: 538 CLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 597
           C L+ +DYEGR+DGRSIK++++HV+PLKLVLVH  AEATEHLKQHCL ++CPHVY PQIE
Sbjct: 537 CSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIE 596

Query: 598 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 657
           ET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AWVD+EVGKTE  M SLLP+   A P
Sbjct: 597 ETVDVTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASP 656

Query: 658 HKSVLVGDLKMADLKPFLSSKGIQVEFA-GGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 716
           HK VLVGDLK+AD K FLSSKG+QVEFA GGALRCGEYVT+RKVGP GQKGG SG QQI+
Sbjct: 657 HKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQIL 716

Query: 717 IEGPLCEDYYKIRAYLYSQFYLL 739
           IEGPLCEDYYKIR YLYSQFYLL
Sbjct: 717 IEGPLCEDYYKIRDYLYSQFYLL 739


>sp|Q652P4|CPSF2_ORYSJ Cleavage and polyadenylation specificity factor subunit 2 OS=Oryza
           sativa subsp. japonica GN=Os09g0569400 PE=2 SV=1
          Length = 738

 Score = 1118 bits (2893), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 548/742 (73%), Positives = 634/742 (85%), Gaps = 7/742 (0%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D  DPS LQPL+KVA TIDAVLL
Sbjct: 1   MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DT+HLGALPYAMK LGLSAPV++TEPV+RLG+LT+YD ++SRRQVS+FDLFTLDDID
Sbjct: 61  SHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDID 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
           +AFQ+V RL YSQN+ L+ KGEGIV+APHVAGH LGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKE 180

Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
           +HLNGT L SFVRPAVLITDAYNAL+N    RQQ + F DA+ K L  GG+VLLP+D+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAG 240

Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
           RVLE+LLILE YWA+  L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFL 300

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           LK VT +INK EL+   D PK+VLASMASLE GFSHDIFV+ A++ KNLVLFTE+GQFGT
Sbjct: 301 LKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGT 360

Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
           LARMLQ DPPPKAVKVTMS+R+PLVG+EL AYEEEQ R+KKEEALKASL KEEE KASLG
Sbjct: 361 LARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLG 420

Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
             N  + DPMVIDA+ +   ++     GG   DILIDGFVPPS+SVAPMFPF+EN SEWD
Sbjct: 421 -SNAKASDPMVIDASTSRKPSNAGSKFGGNV-DILIDGFVPPSSSVAPMFPFFENTSEWD 478

Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELTVQVK 537
           DFGEVINP+DY++K E+MD   M   GD  D  LDEGSA L+LD+ PSKV+SNE+TVQVK
Sbjct: 479 DFGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVK 538

Query: 538 CLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 597
           C L ++D+EGR+DGRS+K++++HVAPLKLVLVHGSAEATEHLK HC K+   HVY PQIE
Sbjct: 539 CSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIE 598

Query: 598 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 657
           ETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAWVDAEVGKT++ +  L P STPA  
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKTDDKLTLLPPSSTPA-A 657

Query: 658 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 717
           HKSVLVGDLK+AD K FL++KG+QVEFAGGALRCGEY+T+RK+G AGQK G +G+QQIVI
Sbjct: 658 HKSVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITLRKIGDAGQK-GSTGSQQIVI 716

Query: 718 EGPLCEDYYKIRAYLYSQFYLL 739
           EGPLCEDYYKIR  LYSQFYLL
Sbjct: 717 EGPLCEDYYKIRELLYSQFYLL 738


>sp|Q9V3D6|CPSF2_DROME Probable cleavage and polyadenylation specificity factor subunit 2
           OS=Drosophila melanogaster GN=Cpsf100 PE=1 SV=1
          Length = 756

 Score =  529 bits (1362), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 295/786 (37%), Positives = 453/786 (57%), Gaps = 77/786 (9%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++  +SG  +E+P  Y++ ID    L+DCGW++ FD + ++ L +   T+DAVLL
Sbjct: 1   MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD  HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S   + +FDLF+LDD+D
Sbjct: 61  SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF+ +T+L Y+Q   L  KG GI + P  AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HL+G  L+   RP++LITDAYNA + Q  R+ R E     I +T+R  GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W       + Y +  L  VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+ L  + +++   P GPK+VLAS   LE+GF+ D+FV+WAS+  N ++ T R 
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360

Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
             GTLA  +++   P K +++ + RRV L G EL  Y   Q      E L   +VK    
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQG-----EKLNPLIVK---- 411

Query: 415 KASLGPDNNLSGDPMV---IDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
                PD            I+ +      D+V    GR+      GF   +     MFP+
Sbjct: 412 -----PDVEEESSSESEDDIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPY 462

Query: 472 YENNSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEG 514
           +E   + D++GE+IN DDY I D              E++ +    IG +   +G + + 
Sbjct: 463 HEEKVKCDEYGEIINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDN 522

Query: 515 SASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAE 574
              L+   KP+K++S   T++V   +  ID+EGR+DG S+  ILS + P +++++HG+AE
Sbjct: 523 DVQLL--EKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAE 580

Query: 575 ATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV 634
            T+ + +HC ++V   V+TPQ  E IDVTS++  Y+V+L+E L+S + F+K  D E+AWV
Sbjct: 581 GTQVVARHCEQNVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKDAEVAWV 640

Query: 635 DAEVGK-------------------TENGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPF 674
           D  +G                     E   L+L  ++    P H SVL+ +LK++D K  
Sbjct: 641 DGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPIHNSVLINELKLSDFKQT 700

Query: 675 LSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 733
           L    I  EF+GG L C    + +R+V             ++ +EG L E+YYKIR  LY
Sbjct: 701 LMRNNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLY 750

Query: 734 SQFYLL 739
            Q+ ++
Sbjct: 751 EQYAIV 756


>sp|O35218|CPSF2_MOUSE Cleavage and polyadenylation specificity factor subunit 2 OS=Mus
           musculus GN=Cpsf2 PE=1 SV=1
          Length = 782

 Score =  522 bits (1345), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 293/815 (35%), Positives = 454/815 (55%), Gaps = 109/815 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALP+A+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   DV +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 524
           MFP  E   +WD++GE+I P+D+++ +    + +++ +  G  +G   E      L   P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG---EEPMDQDLSDVP 519

Query: 525 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 584
           +K VS   ++++K  + +IDYEGR+DG SIK I++ + P +L++VHG  EA++ L + C 
Sbjct: 520 TKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579

Query: 585 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 636
               K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D     
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637

Query: 637 EVGKTENGML-----------------------------------------------SLL 649
            V K + G++                                                ++
Sbjct: 638 RVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSAMAQQKAMKSLFGEDEKELGEETEII 697

Query: 650 PISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAG 704
           P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+     
Sbjct: 698 PTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----- 752

Query: 705 QKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
                + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -----TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>sp|Q10568|CPSF2_BOVIN Cleavage and polyadenylation specificity factor subunit 2 OS=Bos
           taurus GN=CPSF2 PE=1 SV=1
          Length = 782

 Score =  521 bits (1341), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 292/817 (35%), Positives = 455/817 (55%), Gaps = 113/817 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++A  D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEGSASLILDA 522
           MFP  E   +WD++GE+I P+D+++      +E+  +    +   D  +D+  + +    
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDV---- 518

Query: 523 KPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQH 582
            P+K +S   ++++K  + +IDYEGR+DG SIK I++ + P +L++VHG  EA++ L + 
Sbjct: 519 -PTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAEC 577

Query: 583 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 636
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 637 --EVGKTENGML-----------------------------------------------S 647
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESE 695

Query: 648 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 702
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR--- 752

Query: 703 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>sp|Q9P2I0|CPSF2_HUMAN Cleavage and polyadenylation specificity factor subunit 2 OS=Homo
           sapiens GN=CPSF2 PE=1 SV=2
          Length = 782

 Score =  518 bits (1334), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 291/817 (35%), Positives = 454/817 (55%), Gaps = 113/817 (13%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T LSGV  E+ L YL+ +D F FL+DCGW++HF   ++  L K    IDAVLL
Sbjct: 1   MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LFTLDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
           +AF  + +L +SQ  +L GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H++L    S+L   P  PK+VLAS   LE GFS D+F++W  D KN ++ T R 
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L  +P  K  ++ + +RV L G+EL  Y E++   K+               
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411

Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
                    S +  +  ++ ++   D+ +P   + + D+++ G       F   +    P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462

Query: 468 MFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEGSASLILDA 522
           MFP  E   +WD++GE+I P+D+++      +E+  +    +   D  +D+  + +    
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDV---- 518

Query: 523 KPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQH 582
            P+K +S   ++++K  + +IDYEGR+DG SIK I++ + P +L++VHG  EA++ L + 
Sbjct: 519 -PTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAEC 577

Query: 583 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 636
           C     K +   VY P++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D   
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635

Query: 637 --EVGKTENGML-----------------------------------------------S 647
              V K + G++                                                
Sbjct: 636 DMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESE 695

Query: 648 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 702
           ++P   P PP     H+SV + + +++D K  L  +GIQ EF GG L C   V +R+   
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR--- 752

Query: 703 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
                  + T +I +EG LC+D+Y+IR  LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>sp|Q9W799|CPSF2_XENLA Cleavage and polyadenylation specificity factor subunit 2
           OS=Xenopus laevis GN=cpsf2 PE=1 SV=1
          Length = 783

 Score =  509 bits (1310), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 294/812 (36%), Positives = 453/812 (55%), Gaps = 102/812 (12%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++T L G   E+ + YL+ +D F FL+DCGW+++F   ++  + K    +DAVLL
Sbjct: 1   MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLGALPYA+ +LGL+  +++T PVY++G + MYD Y SR    +F LF+LDD+D
Sbjct: 61  SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
            AF  + +L Y+Q  HL GKG G+ + P  AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E HLNG  LE   RP++LITD++NA + QP R+QR E     + +TLR  GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
           GRVLEL  +L+  W        +Y    L  VS + +++ KS +EWM D + + FE  R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  +H+TL    S+L   P  PK+VLAS   LE GFS ++F++W  D KN V+ T R 
Sbjct: 301 NPFQFRHLTLCHGYSDLARVP-SPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359

Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
             GTLAR L   P  + + + + +RV L G+EL  Y E++         K +  K E+SK
Sbjct: 360 TPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYVEKEK------LKKEAAKKLEQSK 413

Query: 416 ASLGPDNNLSGDPMVIDA-NNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
            +    ++ S     ID   +  A  D++  + G  +      F   +    PMFP  E+
Sbjct: 414 EADLDSSDDSDVEEDIDQITSHKAKHDLMMKNEGSRK----GSFFKQAKKSYPMFPAPED 469

Query: 475 NSEWDDFGEVINPDDYII------KDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVV 528
             +WD++GE+I P+D+++      +DE     +    GD+  +D+  + +     P+K V
Sbjct: 470 RIKWDEYGEIIKPEDFLVPELQVTEDEKTKLESGLTNGDE-PMDQDLSDV-----PTKCV 523

Query: 529 SNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL---- 584
           S   ++++K  + +IDYEGR+DG SIK I++ + P +L++VHG  +AT+ L + C     
Sbjct: 524 STTESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRAFGG 583

Query: 585 KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGK 640
           K +   VYTP++ ET+D TS+   Y+V+L + L+S++ F K  D E+AW+D      V K
Sbjct: 584 KDI--KVYTPKLHETVDATSETHIYQVRLKDSLVSSLKFCKAKDTELAWIDGVLDMRVSK 641

Query: 641 TENGML----------------------------------------------------SL 648
            + G++                                                    +L
Sbjct: 642 VDTGVILEERELKDEGEDMEMQVDTQVMDASTIAQQKVIKSLFGDDDKEFSEESEIIPTL 701

Query: 649 LPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 707
            P+ S   P H+SV + + +++D K  L  +GI  EF GG L C   V +R+        
Sbjct: 702 EPLPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVCNNMVAVRR-------- 753

Query: 708 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
             + T +I +EG LCED++KIR  LY Q+ ++
Sbjct: 754 --TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783


>sp|Q55BS1|CPSF2_DICDI Cleavage and polyadenylation specificity factor subunit 2
           OS=Dictyostelium discoideum GN=cpsf2 PE=3 SV=1
          Length = 784

 Score =  451 bits (1161), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 270/807 (33%), Positives = 441/807 (54%), Gaps = 91/807 (11%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + ++ T LSG  +E+P  YL+ ID F  L+DCG + + D SLL+PL KVA  IDAVLL
Sbjct: 1   MASIIKFTALSGAKDESPPCYLLEIDDFCILLDCGLSYNLDFSLLEPLEKVAKKIDAVLL 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SH DT H+G LPY + + GL+  ++ T PV ++G + +YD Y ++    EF  ++LD+ID
Sbjct: 61  SHSDTTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNID 120

Query: 121 SAF--QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           S F       L++SQ+Y LSGKG+GI + P++AGH +G +VWKITK    ++YA+DYN R
Sbjct: 121 SCFGEDRFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHR 180

Query: 179 KEKHLNGTVLES-FVRPAVLITDAYN-----ALHNQPPRQQREMFQDAISKTLRAGGNVL 232
            E HL+   L S  ++P++LITD+       A      R Q  +F+  I++ LR GGNVL
Sbjct: 181 NEGHLDSLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQ-SLFE-QINRNLRDGGNVL 238

Query: 233 LPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
           +PVD+AGRVLELLL +E+YW+++ SL  Y + FL   S S   + +S LE+M  + +  F
Sbjct: 239 IPVDTAGRVLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKF 298

Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
           E + +N F  KH+ +L +  EL   PD  K++L S   LE GFS ++F++W SD K L+L
Sbjct: 299 EQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLIL 358

Query: 351 FTERGQFGTLA-RMLQADPPP----KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK 405
           FT++    +LA ++++    P    K +++    RVPL G+EL+ YE EQ + ++E+ L+
Sbjct: 359 FTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQYEMEQAKQREEKRLE 418

Query: 406 ASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVP----- 460
              +++E+ +              +++A N +    +++    + R I+ D  V      
Sbjct: 419 Q--LRKEQEEREERERLEEEEREQLLNATNQDQLQQLLQLQQQKERGIIDDSMVHMKNPF 476

Query: 461 ------------PSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDED--MDQAAMHIGG 506
                          S+  MFP++E + +W ++GE    DD I++++D  +++  M    
Sbjct: 477 ENDRFDLLDSEFKKQSMITMFPYFEKHLKWGEYGE--EDDDLILRNQDKKVEEVTME--- 531

Query: 507 DDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKL 566
                      +     P K+++  L + + C +  IDYEG +DGRSIK I+  +AP KL
Sbjct: 532 --------EDEIQEQEIPKKIITQTLRLPINCKIQTIDYEGCSDGRSIKAIIQQIAPTKL 583

Query: 567 VLVHGSAEATEHLKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK 625
           VL+ GS + ++ ++ +  +++    +Y P I E +D+TSD   Y++ L + L++ +   K
Sbjct: 584 VLIRGSEQQSQSIENYVKENIRTKGIYIPSIGEQLDLTSDTNVYELLLKDSLVNTLKTSK 643

Query: 626 LGDYEIAWVDAEVGKTENGMLSLLPISTPAP----------------------------- 656
           + DYE++++  +V   +   + +L +    P                             
Sbjct: 644 ILDYEVSYIQGKVDILDGSNVPVLDLIQSIPINNNNNNNNNNNNNNNNNNNNTTMMTTTT 703

Query: 657 ----PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGT 712
                H    +GD+K++DLK  L + GIQV+F  G L CG  V I +    G      G 
Sbjct: 704 TTTNGHDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNCGGLVYIWRDEDHG------GN 757

Query: 713 QQIVIEGPLCEDYYKIRAYLYSQFYLL 739
             I ++G + ++YY I+  LY QF ++
Sbjct: 758 SIINVDGIISDEYYLIKELLYKQFQIV 784


>sp|O17403|CPSF2_CAEEL Probable cleavage and polyadenylation specificity factor subunit 2
           OS=Caenorhabditis elegans GN=cpsf-2 PE=3 SV=1
          Length = 843

 Score =  374 bits (960), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 229/689 (33%), Positives = 379/689 (55%), Gaps = 48/689 (6%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +DG   L+DCGW++ F     + L      I AVL+
Sbjct: 1   MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV++T PVY++G + +YD   S   V EF+ +TLDD+D
Sbjct: 61  SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
           +AF+ V ++ Y+Q   L G   G+      AGH+LGG++W+I +  GED++Y VD+N +K
Sbjct: 121 TAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           E+HLNG   ++F RP +LIT A++    Q  R+ R E     I +T+R  G+ ++ +D+A
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
           GRVLEL  +L+  W+        Y +  +++V+SS + + KS LEWM + + K   +S R
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LKHVTL  +  EL      PK+VL S   +E+GFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVR-SPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358

Query: 355 GQFGTLARML-----QADP-----PPKAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
               TLA  L     +A+        + + + + +RV L GEEL+ Y+        E+TR
Sbjct: 359 PASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESK------ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR 451
           L+ E A + +   E +        A + P ++         + N   + D++     ++ 
Sbjct: 419 LRMERARRQAQANESDDSDDDDIAAPIVPRHSEKDFRSFDGSENDAHTFDIM----AKWD 474

Query: 452 DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII-------KDEDMDQAAMHI 504
           +     F   +    PMFP+ E   +WDD+GEVI P+DY +       K ++ D+  +  
Sbjct: 475 NQQKASFFKTTKKSFPMFPYIEEKVKWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVVVK 534

Query: 505 GGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPL 564
             ++ +        + +  P+K V  +  V+V C + FI+YEG +DG S K +L+ + P 
Sbjct: 535 KREEEEEVYNPNDHV-EEMPTKCVEFKNRVEVSCRIEFIEYEGISDGESTKKLLAGLLPR 593

Query: 565 KLVLVHGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVL 622
           ++++VHGS + T  L  +          +  P+    +D + +   Y+V LS+ L++++ 
Sbjct: 594 QIIVVHGSRDDTRDLVAYFADSGFDTTMLKAPEAGALVDASVESFIYQVALSDALLADIQ 653

Query: 623 FKKLGD-YEIAWVDAEVGKTE--NGMLSL 648
           FK++ +   +AW+DA V + E  + ML++
Sbjct: 654 FKEVSEGNSLAWIDARVMEKEAIDNMLAV 682



 Score = 52.4 bits (124), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)

Query: 656 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVTIRKVGPAGQKGGGSGTQQ 714
           P H++V V D K++D K  L+ KG + EF  G L   G   +IR+          + T  
Sbjct: 769 PIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRR----------NDTGV 818

Query: 715 IVIEGPLCEDYYKIRAYLYSQFYLL 739
             +EG   +DYYK+R   Y QF +L
Sbjct: 819 FQMEGAFTKDYYKLRRLFYDQFAVL 843


>sp|A8XUS3|CPSF2_CAEBR Probable cleavage and polyadenylation specificity factor subunit 2
           OS=Caenorhabditis briggsae GN=cpsf-2 PE=3 SV=2
          Length = 842

 Score =  367 bits (941), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 223/687 (32%), Positives = 372/687 (54%), Gaps = 56/687 (8%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
           M + +++   SG  +E PL YL+ +D    L+DCGW++ F+    + L      I AVL+
Sbjct: 1   MTSIIKLKVFSGAKDEGPLCYLLQVDNDYILLDCGWDERFELKYFEELRPYIPKISAVLI 60

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           SHPD LHLG LPY + + GL+APV+ T PVY++G + +YD   S   V EF  ++LDD+D
Sbjct: 61  SHPDPLHLGGLPYLVAKCGLTAPVYCTVPVYKMGQMFIYDLVYSHLDVEEFQHYSLDDVD 120

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
            AF+ V ++ Y+Q   L G   G+      AGH++GG++W+I +  GED+IY VD+N RK
Sbjct: 121 MAFEKVEQVKYNQTVVLKGDS-GVNFTAMPAGHMIGGSMWRICRITGEDIIYCVDFNHRK 179

Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
           ++HL+G   ++F RP +LIT A++    Q  R+ R E     I +T+R  G+ ++ +D+A
Sbjct: 180 DRHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
           GRVLEL  +L+  WA        Y +  +++V+SS + + KS LEWM + + +   +S R
Sbjct: 240 GRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWMDEKLFRYDSSSAR 299

Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
            N F LK+V L+ +  EL      PK+VL S   +E GFS ++F++W +D +N V+ T R
Sbjct: 300 YNPFTLKNVNLVHSHLELIKIR-SPKVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358

Query: 355 -GQFGTLARMLQADPPP---------KAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
              F   AR+++              K + + + +RVPL GEEL+ Y+        E+TR
Sbjct: 359 PASFTLAARLVELAERANDGVLRNEDKHLSLLVRKRVPLEGEELLEYKRRKAERDAEETR 418

Query: 398 LKKEEALKASLVKEEESKA----------SLGPDNNLSGDPMVIDANNANASADVVEPHG 447
           ++ E A + +   E +              L   ++ S D +  D++  +  A       
Sbjct: 419 IRMERARRQAQANESDDSDDDDIAAPIVPRLSEKDHRSFDAIENDSHCFDIMA------- 471

Query: 448 GRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY-IIKDEDMDQA------ 500
            ++ +     F   +    PM+P+ E   +WDD+GEVI P+DY +I   DM +       
Sbjct: 472 -KWDNQQKASFFKSTKKSFPMYPYIEEKVKWDDYGEVIKPEDYTVISKIDMRKGKNKDEP 530

Query: 501 -AMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILS 559
             +H   D+ ++   +     +  P+K V     +++ C + FI+YEG +DG S K +L+
Sbjct: 531 VVVHKREDEEEVYNPNDH--DEEMPTKCVEFRNRIEISCRVEFIEYEGISDGESTKKMLA 588

Query: 560 HVAPLKLVLVHGSAEATEHLKQHCLKHVCP--HVYTPQIEETIDVTSDLCAYKVQLSEKL 617
            + P ++++VHGS + T  L  +   +      + TP   E ID + +   Y+V LS+ L
Sbjct: 589 GLMPRQIIIVHGSRDDTRDLYAYFTDNGFKKDQLNTPVANELIDASVESFIYQVSLSDAL 648

Query: 618 MSNVLFKKLGD-YEIAWVDAEVGKTEN 643
           ++ + FK++ +   +AW+DA + + E+
Sbjct: 649 LAEIQFKEVSEGNSLAWIDARIQEKES 675



 Score = 34.3 bits (77), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 644 GMLSLLPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGAL 689
           G L L P+     P H+++ V D K+++ K  L  KG + EF  G L
Sbjct: 755 GTLILTPLPKKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTL 801


>sp|O74740|CFT2_SCHPO Cleavage factor two protein 2 OS=Schizosaccharomyces pombe (strain
           972 / ATCC 24843) GN=cft2 PE=1 SV=1
          Length = 797

 Score =  343 bits (880), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 254/804 (31%), Positives = 403/804 (50%), Gaps = 123/804 (15%)

Query: 23  VSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL-S 81
           + +DG +  ID G +D    SL  P  +V    D +LLSH D  H+G L YA  +    +
Sbjct: 18  IELDGIHIYIDPGSDD----SLKHP--EVPEQPDLILLSHSDLAHIGGLVYAYYKYDWKN 71

Query: 82  APVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
           A +++T P   +G +TM D  +    +S+    +  D+D+ F S+  L Y Q   L GK 
Sbjct: 72  AYIYATLPTINMGRMTMLDA-IKSNYISDM---SKADVDAVFDSIIPLRYQQPTLLLGKC 127

Query: 142 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------VLESFVRP 194
            G+ +  + AGH LGGT+W + K+ E V+YAVD+N  K+KHLNG        +LE+  RP
Sbjct: 128 SGLTITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRP 187

Query: 195 AVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
             LITDA N+L + P R++R E F +++  +L  GG VLLPVD+A RVLEL  IL+++W+
Sbjct: 188 NTLITDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWS 247

Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
                L +PI FL+  S+ TIDY KS +EWMGD+I + F  + +N    +++  + + S+
Sbjct: 248 ASQPPLPFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGIN-ENLLEFRNINTITDFSQ 306

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN-LVLFTERG------------QFG 358
           + +   GPK++LA+  +LE GFS  I ++  S+  N L+LFT+R             ++ 
Sbjct: 307 ISHIGPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSRCPQNSLANQFIRYW 366

Query: 359 TLARMLQADPP-------PKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKE 411
             A   + D P        +AVK+    + PL GEEL +Y+E +   + ++A   +L   
Sbjct: 367 ERASKKKRDIPHPVGLYAEQAVKIKT--KEPLEGEELRSYQELEFSKRNKDAEDTAL--- 421

Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPH----------GGRYRDILIDGFVPP 461
           E    ++  ++  S      D  + N       PH          G  +   L D  V  
Sbjct: 422 EFRNRTILDEDLSSSSSSEDDDLDLNTEV----PHVALGSSAFLMGKSFDLNLRDPAVQA 477

Query: 462 STSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSA----S 517
             +   MFP+ E     D++GE+I   D+ + +E  +   +    DD  L   +     S
Sbjct: 478 LHTKYKMFPYIEKRRRIDEYGEIIKHQDFSMINEPANTLELENDSDDNALSNSNGKRKWS 537

Query: 518 LILDA------------KPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLK 565
            I D              PSK++++E T++V C + FID EG  DGRS+KTI+  V P +
Sbjct: 538 EINDGLQQKKEEEDEDEVPSKIITDEKTIRVSCQVQFIDIEGLHDGRSLKTIIPQVNPRR 597

Query: 566 LVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 623
           LVL+H S E  E +K+ C  L      VY P   E I+V+ D+ A+ ++L++ L+ N+++
Sbjct: 598 LVLIHASTEEKEDMKKTCASLSAFTKDVYIPNYGEIINVSIDVNAFSLKLADDLIKNLIW 657

Query: 624 KKLGDYEIAWVDAEVGKTENGM---------------------------------LSLLP 650
            K+G+ E++ + A+V  ++                                    L+L  
Sbjct: 658 TKVGNCEVSHMLAKVEISKPSEEEDKKEEVEKKDGDKERNEEKKEEKETLPVLNALTLRS 717

Query: 651 ISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGG 709
               AP    +LVG++++A L+  L  +GI  E  G G L CG  V +RK+      GG 
Sbjct: 718 DLARAPRAAPLLVGNIRLAYLRKALLDQGISAELKGEGVLLCGGAVAVRKLS-----GG- 771

Query: 710 SGTQQIVIEGPLCEDYYKIRAYLY 733
               +I +EG L   +++IR  +Y
Sbjct: 772 ----KISVEGSLSNRFFEIRKLVY 791


>sp|Q503E1|INT11_DANRE Integrator complex subunit 11 OS=Danio rerio GN=cpsf3l PE=2 SV=1
          Length = 598

 Score =  169 bits (428), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  L   Q   +  + E   +  + AGH+LG  + +I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LI+++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    ++S  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIM 350


>sp|Q3MHC2|INT11_RAT Integrator complex subunit 11 OS=Rattus norvegicus GN=Cpsf3l PE=2
           SV=1
          Length = 600

 Score =  168 bits (426), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350



 Score = 35.8 bits (81), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/82 (24%), Positives = 37/82 (45%)

Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
           IL  +    +     ++VK  + ++ +   AD + I  ++    P  ++LVHG A+  E 
Sbjct: 363 ILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422

Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
           L+Q   +      Y P   ET+
Sbjct: 423 LRQKIEQEFRVSCYMPANGETV 444


>sp|Q9CWS4|INT11_MOUSE Integrator complex subunit 11 OS=Mus musculus GN=Cpsf3l PE=2 SV=1
          Length = 600

 Score =  168 bits (426), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350



 Score = 35.8 bits (81), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/84 (23%), Positives = 38/84 (45%)

Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
           IL  +    +     ++VK  + ++ +   AD + I  ++    P  ++LVHG A+  E 
Sbjct: 363 ILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422

Query: 579 LKQHCLKHVCPHVYTPQIEETIDV 602
           L+Q   +      Y P   ET+ +
Sbjct: 423 LRQKIEQEFRVSCYMPANGETVTL 446


>sp|Q5NVE6|INT11_PONAB Integrator complex subunit 11 OS=Pongo abelii GN=CPSF3L PE=2 SV=2
          Length = 600

 Score =  168 bits (425), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350



 Score = 37.4 bits (85), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/82 (25%), Positives = 38/82 (46%)

Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
           IL  +    +     ++VK  + ++ +   AD + I  ++    P  ++LVHG A+  E 
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422

Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
           LKQ   + +    Y P   ET+
Sbjct: 423 LKQKIEQELRVSCYMPANGETV 444


>sp|Q5TA45|INT11_HUMAN Integrator complex subunit 11 OS=Homo sapiens GN=CPSF3L PE=1 SV=2
          Length = 600

 Score =  168 bits (425), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S +    ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350



 Score = 38.1 bits (87), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 21/82 (25%), Positives = 39/82 (47%)

Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
           IL  +    +     ++VK  + ++ +   AD + I  ++    P  ++LVHG A+  E 
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422

Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
           LKQ   + +  + Y P   ET+
Sbjct: 423 LKQKIEQELRVNCYMPANGETV 444


>sp|Q5ZIH0|INT11_CHICK Integrator complex subunit 11 OS=Gallus gallus GN=CPSF3L PE=2 SV=1
          Length = 600

 Score =  167 bits (423), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG    +ND   F D S +    ++   +D 
Sbjct: 4   IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +    E + +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP +LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W   +L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+    +++  DN   GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350



 Score = 38.1 bits (87), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 21/87 (24%), Positives = 40/87 (45%)

Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
           IL  +    +     ++VK  + ++ +   AD + I  ++    P  ++LVHG A+  E 
Sbjct: 363 ILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEF 422

Query: 579 LKQHCLKHVCPHVYTPQIEETIDVTSD 605
           LKQ   +    + Y P   ET  + ++
Sbjct: 423 LKQKIEQEFHVNCYMPANGETTTIFTN 449


>sp|Q2YDM2|INT11_BOVIN Integrator complex subunit 11 OS=Bos taurus GN=CPSF3L PE=2 SV=2
          Length = 599

 Score =  165 bits (417), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 176/356 (49%), Gaps = 18/356 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
           ++VTPL    +      LVSI G N ++DCG +  F       D S      ++   +D 
Sbjct: 4   IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDC 63

Query: 58  VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
           V++SH    H GALPY  + +G   P++ T+P   +  + + D + ++  +  E + FT 
Sbjct: 64  VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123

Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
             I    + V  +   Q   +  + E   +  + AGH+LG  +++I    E V+Y  DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180

Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
              ++HL    ++   RP++LIT++  A   +  ++ RE  F   + +T+  GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239

Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            + GR  EL ++LE +W    L  PIYF T ++     Y K F+ W    I K+F   + 
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
           N F  KH+          ++P GP +V A+   L AG S  IF +WA + KN+V+ 
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350



 Score = 37.7 bits (86), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 21/82 (25%), Positives = 38/82 (46%)

Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
           IL  +    +     ++VK  + ++ +   AD + I  ++    P  ++LVHG A+  E 
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEF 422

Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
           LKQ   +    + Y P   ET+
Sbjct: 423 LKQKIEQEFRVNCYMPANGETV 444


>sp|Q12102|CFT2_YEAST Cleavage factor two protein 2 OS=Saccharomyces cerevisiae (strain
           ATCC 204508 / S288c) GN=CFT2 PE=1 SV=1
          Length = 859

 Score =  165 bits (417), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 198/838 (23%), Positives = 328/838 (39%), Gaps = 179/838 (21%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
           +V  D    LID GWN    PS       ++   KV   ID ++LS P    LGA   L 
Sbjct: 19  VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74

Query: 73  YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
           Y      +S   V++T PV  LG ++  D Y S   +  +D   LD  DI+ +F  +  L
Sbjct: 75  YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134

Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
            YSQ   L  + +G+ +  + AG   GG++W I+   E ++YA  +N  ++  LN     
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194

Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
              G  L + +RP+ +IT       +QP +++ ++F+D + K L + G+V++PVD +G+ 
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254

Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
           L+L      L+ E          P+  L+Y    T+ Y KS LEW+  S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313

Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMA------------------------SLE 330
              F +     +I  +EL   P G K+   S                          S E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372

Query: 331 AGFSHDIFVEWA-SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
              S D  +E    D +N   F E G+       +  D           +  PL  EE  
Sbjct: 373 CASSLDKILEIVEQDERNWKTFPEDGKSFLCDNYISID---------TIKEEPLSKEETE 423

Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDAN-------------NA 436
           A++ +    K++   K  LVK E  K +       +G+ ++ D N             N 
Sbjct: 424 AFKVQLKEKKRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAMRNQDILVENV 476

Query: 437 NASADVVEPHGG---------------------------RYRDILIDGFVPPST-SVAPM 468
           N    +    GG                           +  ++ +D  + PS  S   M
Sbjct: 477 NGVPPIDHIMGGDEDDDEEEENDNLLNLLKDNSEKSAAKKNTEVPVDIIIQPSAASKHKM 536

Query: 469 FPFYENNSEWDDFGEVIN-----PDD---------------------------------- 489
           FPF     + DD+G V++     PDD                                  
Sbjct: 537 FPFNPAKIKKDDYGTVVDFTMFLPDDSDNVNQNSRKRPLKDGAKTTSPVNEEDNKNEEED 596

Query: 490 -YIIKDEDMDQAAMHIGGDDGKLDEGSAS-------LILDAKPSKVVSNELTVQVKCLLI 541
            Y + D    ++        G    G A        L +D   SK   + + VQ+KC ++
Sbjct: 597 GYNMSDPISKRSKHRASRYSGFSGTGEAENFDNLDYLKIDKTLSKRTISTVNVQLKCSVV 656

Query: 542 FIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETID 601
            ++ +   D RS   I   +   K+VL        E +    +K     V  P + + ++
Sbjct: 657 ILNLQSLVDQRSASIIWPSLKSRKIVLSAPKQIQNEEITAKLIKKNIEVVNMP-LNKIVE 715

Query: 602 VTSDLCAYKVQLSEKLMSNVLFKKLGD-YEIAWVDAEVGK------------TENGMLSL 648
            ++ +    + +   L + + ++++ D Y +A V   + K                 L L
Sbjct: 716 FSTTIKTLDISIDSNLDNLLKWQRISDSYTVATVVGRLVKESLPQVNNHQKTASRSKLVL 775

Query: 649 LPISTPAPPHKS--VLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPA 703
            P+   +  HK+  + +GD+++A LK  L+ K    EF G G L   E V +RK+  A
Sbjct: 776 KPLHGSSRSHKTGALSIGDVRLAQLKKLLTEKNYIAEFKGEGTLVINEKVAVRKINDA 833


>sp|O13794|YSH1_SCHPO Endoribonuclease ysh1 OS=Schizosaccharomyces pombe (strain 972 /
           ATCC 24843) GN=ysh1 PE=3 SV=2
          Length = 757

 Score =  160 bits (404), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 101/320 (31%), Positives = 171/320 (53%), Gaps = 14/320 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EF 111
           ST+D +L+SH    H+ +LPY M++      VF T P   +    + D Y+    V  E 
Sbjct: 69  STVDVLLISHFHLDHVASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSD-YVKVSNVGMED 127

Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
            L+   D+ +AF  +  +    +YH + + EGI   P+ AGH+LG  ++ +   G ++++
Sbjct: 128 QLYDEKDLLAAFDRIEAV----DYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILF 183

Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGN 230
             DY+R +++HL+   +    RP VLIT++ Y    +QP  ++     + I  T+R GG 
Sbjct: 184 TGDYSREEDRHLHVAEVPP-KRPDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGR 242

Query: 231 VLLPVDSAGRVLELLLILEDYWAEH--SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
           VL+PV + GR  ELLLIL++YW  H    + PIY+ + ++   +   ++++  M D+I K
Sbjct: 243 VLMPVFALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRK 302

Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
            F  +  N F+ + V  L N  + D+   GP ++LAS   L+ G S  +   WA D +N 
Sbjct: 303 IF--AERNPFIFRFVKSLRNLEKFDDI--GPSVILASPGMLQNGVSRTLLERWAPDPRNT 358

Query: 349 VLFTERGQFGTLARMLQADP 368
           +L T     GT+A+ +  +P
Sbjct: 359 LLLTGYSVEGTMAKQITNEP 378


>sp|Q74ZC0|YSH1_ASHGO Endoribonuclease YSH1 OS=Ashbya gossypii (strain ATCC 10895 / CBS
           109.51 / FGSC 9923 / NRRL Y-1056) GN=YSH1 PE=3 SV=2
          Length = 771

 Score =  159 bits (402), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 94/329 (28%), Positives = 171/329 (51%), Gaps = 20/329 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQ-- 107
           S ++ +L+SH    H  +LPY M++      VF T P   +YR  LL+ + +  +     
Sbjct: 61  SQVEVLLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRW-LLSDFVKVTNIGNDN 119

Query: 108 ---VSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
              VS+ +L+T +D+  +F  +  +    +YH +    GI    + AGH+LG  ++++  
Sbjct: 120 AGGVSDENLYTDEDLAESFDRIETV----DYHSTIDVNGIKFTAYHAGHVLGAAMFQVEI 175

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  +++  DY+R  ++HLN   + +     +++   +    ++P   + +     I  T
Sbjct: 176 AGLRILFTGDYSRELDRHLNSAEIPTLPSDILIVESTFGTATHEPRTSKEKKLTQLIHTT 235

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-----PIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 236 VSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMSVFQTYV 295

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  E  +   GP ++LAS   L+ G S D+  
Sbjct: 296 NMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLE 353

Query: 340 EWASDVKNLVLFTERGQFGTLARMLQADP 368
           +W  D KNLVL T     GT+A+ L  +P
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLMLEP 382


>sp|Q9C952|CPSF3_ARATH Cleavage and polyadenylation specificity factor subunit 3-I
           OS=Arabidopsis thaliana GN=CPSF73-I PE=1 SV=1
          Length = 693

 Score =  159 bits (402), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 40/385 (10%)

Query: 2   GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
           G  + VTPL            +S  G N L DCG            + D  DPS      
Sbjct: 19  GDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPS------ 72

Query: 50  KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
               +ID +L++H    H  +LPY +++   +  VF   +T+ +Y+L LLT Y + +S+ 
Sbjct: 73  ----SIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKV 126

Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
            V +  LF   DI+ +   +  + + Q   ++G    I    + AGH+LG  ++ +   G
Sbjct: 127 SVEDM-LFDEQDINKSMDKIEVIDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAG 181

Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTL 225
             ++Y  DY+R +++HL    L  F  P + I ++ + +     R  RE  F D I  T+
Sbjct: 182 VRILYTGDYSREEDRHLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTV 240

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
             GG VL+P  + GR  ELLLIL++YWA H    N PIY+ + ++   +   ++++  M 
Sbjct: 241 AQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMN 300

Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
           D I   F  S  N F+ KH++ L +  + ++   GP +V+A+   L++G S  +F  W S
Sbjct: 301 DRIRNQFANS--NPFVFKHISPLNSIDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCS 356

Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
           D KN  +       GTLA+ +  +P
Sbjct: 357 DKKNACIIPGYMVEGTLAKTIINEP 381



 Score = 38.9 bits (89), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 13/140 (9%)

Query: 509 GKLDEGSASLILDAKPSKV-VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLV 567
           G + EG+ +  +  +P +V + N LT  +   + +I +   AD     T L  + P  ++
Sbjct: 366 GYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNII 425

Query: 568 LVHGSAEATEHLKQHCLKHV---CPHVYTPQIEETIDV---TSDLCAYKVQLSEK----- 616
           LVHG A     LKQ  L         + TP+  E++++   +  L     +L+EK     
Sbjct: 426 LVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVG 485

Query: 617 -LMSNVLFKKLGDYEIAWVD 635
             +S +L KK   Y+I   D
Sbjct: 486 DTVSGILVKKGFTYQIMAPD 505


>sp|Q6FUA5|YSH1_CANGA Endoribonuclease YSH1 OS=Candida glabrata (strain ATCC 2001 / CBS
           138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=YSH1 PE=3
           SV=1
          Length = 771

 Score =  159 bits (401), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 183/371 (49%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----R 105
           S +D +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S     
Sbjct: 60  SIVDVLLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKAIYRW-LLRDFVRVTSIGSQS 118

Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
               + +L++ +D+  +F  +  +    +YH      GI      AGH+LG  +++I   
Sbjct: 119 SNAEDDNLYSNEDLIESFDKIETI----DYHSMIDVNGIKFTAFHAGHVLGAAMFQIEIA 174

Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
           G  V++  DY+R  ++HLN   +       +++   +    ++P   + +     I  T+
Sbjct: 175 GLRVLFTGDYSREIDRHLNSAEVPPLPSDILIVESTFGTATHEPRLHREKKLTQLIHSTV 234

Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLE 280
             GG VL+PV + GR  EL+LIL++YW++H     S   PI++ + ++   +   ++++ 
Sbjct: 235 NKGGRVLMPVFALGRAQELMLILDEYWSQHKEELGSNQIPIFYASNLARKCLSVFQTYVN 294

Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
            M D+I K F  S+ N F+ K++  + N  E  +   GP ++LAS   L+ G S D+   
Sbjct: 295 MMNDNIRKKFRDSQTNPFIFKNIAYIKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLER 352

Query: 341 WASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQT 396
           W  D KNLVL T     GT+A+  +L+ D  P     +VT+ RR  +      A+ + Q 
Sbjct: 353 WCPDEKNLVLITGYSVEGTMAKYLLLEPDTIPSVSNPEVTIPRRCRVEELSFAAHVDFQE 412

Query: 397 RLKKEEALKAS 407
            L+  E + AS
Sbjct: 413 NLEFIEQINAS 423


>sp|Q6CUI5|YSH1_KLULA Endoribonuclease YSH1 OS=Kluyveromyces lactis (strain ATCC 8585 /
           CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37)
           GN=YSH1 PE=3 SV=1
          Length = 764

 Score =  158 bits (399), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 175/348 (50%), Gaps = 24/348 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----- 104
           STID +L+SH    H  +LPY M++      VF T P   +YR  LL  + +  S     
Sbjct: 64  STIDLLLISHFHLDHAASLPYVMQRTNFRGRVFMTHPTKAIYRW-LLNDFVKVTSIGDSP 122

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
            +  S  +L++ +D+  +F  +  +    +YH + +  GI      AGH+LG  +++I  
Sbjct: 123 GQDSSNDNLYSDEDLAESFDRIETI----DYHSTMEVNGIKFTAFHAGHVLGAAMFQIEI 178

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P + +       I   
Sbjct: 179 AGVRVLFTGDYSREVDRHLNSAEVPPQSSDVIIVESTFGTATHEPRQNRERKLTQLIHTV 238

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW  H         PI++ + ++   +   ++++
Sbjct: 239 VSKGGRVLLPVFALGRAQEIMLILDEYWQNHKEELGNGQVPIFYASNLAKKCMSVFQTYV 298

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F+ S+ N F+ K+++ L N  E ++   GP ++LAS   L+ G S DI  
Sbjct: 299 NMMNDDIRKKFKDSQTNPFIFKNISYLKNLDEFEDF--GPSVMLASPGMLQNGLSRDILE 356

Query: 340 EWASDVKNLVLFTERGQFGTLARML----QADPPPKAVKVTMSRRVPL 383
           +W  + KNLVL T     GT+A+ L    +A P     ++T+ RR  +
Sbjct: 357 KWCPEEKNLVLVTGYSVEGTMAKYLLLEPEAIPSVHNPEITIPRRCQV 404


>sp|Q6C2Z7|YSH1_YARLI Endoribonuclease YSH1 OS=Yarrowia lipolytica (strain CLIB 122 / E
           150) GN=YSH1 PE=3 SV=2
          Length = 827

 Score =  156 bits (395), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 98/322 (30%), Positives = 169/322 (52%), Gaps = 15/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
           STID +L+SH    H  +LPY M++      VF T P   +YR  LL+ + +  S  + S
Sbjct: 87  STIDILLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKGIYRW-LLSDFVRVTSGAE-S 144

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + DL++  D+ ++F  +  +    +YH + +  G+    + AGH+LG  ++ I   G  V
Sbjct: 145 DPDLYSEADLTASFNKIETI----DYHSTMEVNGVKFTAYHAGHVLGAAMYTIEVGGVKV 200

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
           ++  DY+R +++HLN   +   ++P +LI ++        PR +RE      I  TL  G
Sbjct: 201 LFTGDYSREEDRHLNQAEVPP-MKPDILICESTYGTGTHLPRLEREQRLTGLIHSTLDKG 259

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G  LLPV + GR  E+LLIL++YW  H     + IY+ + ++   I   ++++  M D+I
Sbjct: 260 GKCLLPVFALGRAQEILLILDEYWEAHPDLQEFSIYYASALAKKCIAVYQTYINMMNDNI 319

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
            + F   + N F  K++  + N    D+   GP +++AS   L++G S  +   WA D K
Sbjct: 320 RRRFRDQKTNPFRFKYIKNIKNLDRFDDM--GPCVMVASPGMLQSGVSRSLLERWAPDPK 377

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ T     GT+A+ +  +P
Sbjct: 378 NTLILTGYSVEGTMAKQIINEP 399


>sp|Q06224|YSH1_YEAST Endoribonuclease YSH1 OS=Saccharomyces cerevisiae (strain ATCC
           204508 / S288c) GN=YSH1 PE=1 SV=1
          Length = 779

 Score =  154 bits (389), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
           S +D +L+SH    H  +LPY M++      VF T P   +YR  L     +T      S
Sbjct: 59  SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118

Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
                +  LF+ +D+  +F  +  +    +YH +    GI      AGH+LG  +++I  
Sbjct: 119 SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174

Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
            G  V++  DY+R  ++HLN   +       +++   +    ++P   +       I  T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234

Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
           +  GG VLLPV + GR  E++LIL++YW++H+        PI++ + ++   +   ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294

Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
             M D I K F  S+ N F+ K+++ L N  +  +   GP ++LAS   L++G S D+  
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352

Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQAD--PPPKAVKVTMSRRVPLVGEELIAYEEEQ 395
            W  + KNLVL T     GT+A+  ML+ D  P     ++T+ RR  +      A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412

Query: 396 TRLKKEEALKA 406
             L+  E + A
Sbjct: 413 ENLEFIEKISA 423


>sp|P0CM88|YSH1_CRYNJ Endoribonuclease YSH1 OS=Cryptococcus neoformans var. neoformans
           serotype D (strain JEC21 / ATCC MYA-565) GN=YSH1 PE=3
           SV=1
          Length = 773

 Score =  152 bits (385), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  ALPY M++      +  V+ T     +  LTM D      Q  
Sbjct: 79  STVDAMLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138

Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +    L+   D+ S++QS   + Y Q+  ++G   G+   P+ AGH+LG +++ I   G 
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
            ++Y  DY+R +++HL    +   V+P V+I ++   +H  P R+++E  F   ++  +R
Sbjct: 196 KILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG  L+P+ S G   EL L+L++YW +H    N P+YF + +    +   K+++  M  
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           +I   F   RDN F  + V  L +  +L     GP ++++S   +  G S D+  EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            KN V+ T     GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396


>sp|P0CM89|YSH1_CRYNB Endoribonuclease YSH1 OS=Cryptococcus neoformans var. neoformans
           serotype D (strain B-3501A) GN=YSH1 PE=3 SV=1
          Length = 773

 Score =  152 bits (385), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  ALPY M++      +  V+ T     +  LTM D      Q  
Sbjct: 79  STVDAMLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138

Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
           +    L+   D+ S++QS   + Y Q+  ++G   G+   P+ AGH+LG +++ I   G 
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195

Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
            ++Y  DY+R +++HL    +   V+P V+I ++   +H  P R+++E  F   ++  +R
Sbjct: 196 KILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254

Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
            GG  L+P+ S G   EL L+L++YW +H    N P+YF + +    +   K+++  M  
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314

Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
           +I   F   RDN F  + V  L +  +L     GP ++++S   +  G S D+  EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372

Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
            KN V+ T     GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396


>sp|Q9UKF6|CPSF3_HUMAN Cleavage and polyadenylation specificity factor subunit 3 OS=Homo
           sapiens GN=CPSF3 PE=1 SV=1
          Length = 684

 Score =  147 bits (372), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>sp|P79101|CPSF3_BOVIN Cleavage and polyadenylation specificity factor subunit 3 OS=Bos
           taurus GN=CPSF3 PE=2 SV=1
          Length = 684

 Score =  147 bits (372), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   +++G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>sp|Q9QXK7|CPSF3_MOUSE Cleavage and polyadenylation specificity factor subunit 3 OS=Mus
           musculus GN=Cpsf3 PE=1 SV=2
          Length = 684

 Score =  146 bits (369), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)

Query: 22  LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
           ++   G   ++DCG +   +     P   +   + ID +L+SH    H GALP+ +++  
Sbjct: 29  ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88

Query: 80  LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
                F   +T+ +YR     +   Y+    +S  D L+T  D++ +   +  +    N+
Sbjct: 89  FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
           H   +  GI    + AGH+LG  ++ I   G  ++Y  D++R++++HL    + + ++P 
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           +LI ++    H    R++RE  F + +   +  GG  L+PV + GR  ELLLIL++YW  
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
           H    + PIY+ + ++   +   ++++  M D I K      +N F+ KH++ L +    
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317

Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
           D+   GP +V+AS   ++ G S ++F  W +D +N V+       GTLA+ + ++  P+ 
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373

Query: 373 VKVTMSRRVPL 383
           +     +++PL
Sbjct: 374 ITTMSGQKLPL 384


>sp|Q4PEJ3|YSH1_USTMA Endoribonuclease YSH1 OS=Ustilago maydis (strain 521 / FGSC 9021)
           GN=YSH1 PE=3 SV=1
          Length = 880

 Score =  143 bits (361), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 91/322 (28%), Positives = 168/322 (52%), Gaps = 13/322 (4%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
           ST+DA+L++H    H  AL Y M++         V+ T P   +    M D        +
Sbjct: 74  STVDAILITHFHLDHAAALTYIMEKTNFRDGHGKVYMTHPTKAVYRFLMSDFVRISNAGN 133

Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
           + +LF  +++ ++++ +  + + Q+  ++G   G+    + AGH+LG  ++ I   G  +
Sbjct: 134 DDNLFDENEMLASWRQIEAVDFHQDVSIAG---GLRFTSYHAGHVLGACMFLIEIAGLRI 190

Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
           +Y  D++R +++HL    +   V+P VLI ++        PR  +E  F   I   ++ G
Sbjct: 191 LYTGDFSREEDRHLVQAEIPP-VKPDVLICESTYGTQTHEPRLDKEHRFTSQIHHIIKRG 249

Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
           G VLLPV   GR  ELLL+L++YWA H    + PIY+ + ++   I   ++++  M D I
Sbjct: 250 GRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQTYIHTMNDHI 309

Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
              F   RDN F+ KH++ L +  + ++   GP +++AS   +++G S ++   WA D +
Sbjct: 310 RTRF-NRRDNPFVFKHISNLRSLEKFEDR--GPCVMMASPGFMQSGVSRELLERWAPDKR 366

Query: 347 NLVLFTERGQFGTLARMLQADP 368
           N ++ +     GT+AR +  +P
Sbjct: 367 NGLIVSGYSVEGTMARNILNEP 388


>sp|Q54YL3|INT11_DICDI Integrator complex subunit 11 homolog OS=Dictyostelium discoideum
           GN=ints11 PE=3 SV=1
          Length = 744

 Score =  142 bits (358), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 177/371 (47%), Gaps = 19/371 (5%)

Query: 4   SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW----ND--HF-DPSLLQPLSKVASTID 56
           +++V PL    +      +V+I   N + DCG     ND   F D S +    +    ID
Sbjct: 2   TIKVVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVID 61

Query: 57  AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
            V+++H    H GALP+  +  G   P++ T P   +  + + D + ++  +  E + FT
Sbjct: 62  CVIITHFHLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFT 121

Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
              I    + V  +   Q   +    E + +  + AGH+LG  ++      E V+Y  DY
Sbjct: 122 AQMIKDCMKKVIPVNLHQTIKVD---EELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDY 178

Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
           N   ++HL    ++  V+P VLIT+   A   +  ++ RE  F   I + +  GG VL+P
Sbjct: 179 NMTPDRHLGSAWIDQ-VKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIP 237

Query: 235 VDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
           V + GRV EL ++++ YW + +L + PIYF   ++     Y K F+ W    I ++F   
Sbjct: 238 VFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFINWTNQKIKQTF--V 295

Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           + N F  KH+     +S L +AP G  ++ A+   L AG S ++F +WA +  N+ +   
Sbjct: 296 KRNMFDFKHIKPF--QSHLVDAP-GAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPG 352

Query: 354 RGQFGTLARML 364
               GT+   L
Sbjct: 353 YCVVGTVGNKL 363



 Score = 39.7 bits (91), Expect = 0.099,   Method: Compositional matrix adjust.
 Identities = 18/67 (26%), Positives = 33/67 (49%)

Query: 528 VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHV 587
           +  + T++VKC +  + +   AD + I  ++    P  ++LVHG  E    L Q  +K +
Sbjct: 383 IDKKTTIEVKCKIHNLSFSAHADAKGILQLIKMSNPRNVILVHGEKEKMGFLSQKIIKEM 442

Query: 588 CPHVYTP 594
             + Y P
Sbjct: 443 GVNCYYP 449


>sp|Q86A79|CPSF3_DICDI Cleavage and polyadenylation specificity factor subunit 3
           OS=Dictyostelium discoideum GN=cpsf3 PE=3 SV=1
          Length = 774

 Score =  142 bits (358), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 180/373 (48%), Gaps = 19/373 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST----IDAVLL 60
           +++TP+           L+   G   + DCG +  +   +  P      +    ID +L+
Sbjct: 36  LEITPIGSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDSIESDIPDIDLLLV 95

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDD 118
           SH    H  A+PY + +      VF T P   +  + + D Y+    ++  D  LF   D
Sbjct: 96  SHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSD-YVKVSNITRDDDMLFDKSD 154

Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
           +D + + + ++ Y Q      +  GI V    AGH+LG  ++ I   G  ++Y  D++R+
Sbjct: 155 LDRSLEKIEKVRYRQKV----EHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQ 210

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
           +++HL G      V+  VLI ++   +    PR +RE  F  ++ + +   G  L+PV +
Sbjct: 211 EDRHLMGAETPP-VKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPVFA 269

Query: 238 AGRVLELLLILEDYW-AEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
            GR  ELLLIL++YW A   L++ PIY+ + ++   +   ++++  M D +   F+ S  
Sbjct: 270 LGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVS-- 327

Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
           N F  KH+  +      D+   GP + +AS   L++G S  +F  W SD +N ++     
Sbjct: 328 NPFEFKHIKNIKGIESFDDR--GPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPGYS 385

Query: 356 QFGTLARMLQADP 368
             GTLA+ + ++P
Sbjct: 386 VEGTLAKHIMSEP 398


>sp|Q4IPN9|YSH1_GIBZE Endoribonuclease YSH1 OS=Gibberella zeae (strain PH-1 / ATCC
           MYA-4620 / FGSC 9075 / NRRL 31084) GN=YSH1 PE=3 SV=2
          Length = 833

 Score =  142 bits (357), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 103/379 (27%), Positives = 181/379 (47%), Gaps = 28/379 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 41  HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100

Query: 79  GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF T P   +    + D      +    +   ++T  D  + F  +  + Y   +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQPVYTEQDHLNTFPQIEAIDYHTTH 160

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G ++ +  DY+R +++HL    +   V+  
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKID 216

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E S D A     +  
Sbjct: 277 HADFQKYPIYYASNLARKCMLIYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDF 336

Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
           K++  L N    D+   G  ++LAS   L+ G S ++   WA   KN V+ T     GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTM 394

Query: 361 ARMLQADPPPKAVKVTMSR 379
           A+ +  +  P  ++  MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411


>sp|Q5BEP0|YSH1_EMENI Endoribonuclease ysh1 OS=Emericella nidulans (strain FGSC A4 / ATCC
           38163 / CBS 112.46 / NRRL 194 / M139) GN=ysh1 PE=3 SV=1
          Length = 884

 Score =  136 bits (343), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 172/361 (47%), Gaps = 19/361 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 74  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSD 133

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P+ AGH+LG  ++ I+  G ++++ 
Sbjct: 134 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFT 193

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   V+  VLIT++   + + PPR +RE     +I+  L  GG V
Sbjct: 194 GDYSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 253

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLILE+YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 254 LMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 313

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  K+V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 314 FRQRMAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 371

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
              WA + +N V+ T     GT+A+ L  +  P  +   MSR    +G   +   +E+ +
Sbjct: 372 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PDQIHAVMSRAATGMGRTRMNGNDEEQK 429

Query: 398 L 398
           +
Sbjct: 430 I 430


>sp|Q8WZS6|YSH1_NEUCR Endoribonuclease ysh-1 OS=Neurospora crassa (strain ATCC 24698 /
           74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=ysh-1
           PE=3 SV=1
          Length = 850

 Score =  134 bits (338), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 184/382 (48%), Gaps = 30/382 (7%)

Query: 21  YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
           +++   G   ++D G +  +D     P       ST+D +L+SH    H  +LPY + + 
Sbjct: 40  HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 99

Query: 79  GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
                VF   +T+ +Y+  +        +        ++T +D    F  +  + Y+  +
Sbjct: 100 NFRGRVFMTHATKAIYKWLIQDSVRVGNTSSNPQSSLVYTEEDHLKTFPMIEAIDYNTTH 159

Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
            +S     I + P+ AGH+LG  ++ I   G  + +  DY+R +++HL    +   V+  
Sbjct: 160 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREEDRHLISAKVPKGVKID 215

Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
           VLIT++   + +  PR +RE     +I+  L  GG VL+PV + GR  ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRPEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 275

Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
           H+    YPIY+ + ++   +   ++++  M D+I + F       E+S D A     +  
Sbjct: 276 HAEYQKYPIYYASNLARKCMLVYQTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDF 335

Query: 301 KHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           + +  L     LD   D G  ++LAS   L+ G S ++   WA   KN V+ T     GT
Sbjct: 336 RFIRSL---KSLDRFEDVGGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 392

Query: 360 LARMLQADPPPKAVKVTMSRRV 381
           +A+ L  +  P+ ++  MSR +
Sbjct: 393 MAKQLLQE--PEQIQAVMSRNI 412


>sp|Q6BMW3|YSH1_DEBHA Endoribonuclease YSH1 OS=Debaryomyces hansenii (strain ATCC 36239 /
           CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968) GN=YSH1 PE=3
           SV=2
          Length = 815

 Score =  133 bits (334), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 99/341 (29%), Positives = 169/341 (49%), Gaps = 34/341 (9%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
           S +D +L+SH    H  +LPY M+    +  VF   +T+ +YR  LL+ + +  S     
Sbjct: 64  SKVDILLVSHFHLDHAASLPYVMQHTNFNGRVFMTHATKAIYRW-LLSDFVKVTSIGGGS 122

Query: 105 ---------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
                           +L+T DD+  +F  +  +    +YH + + +GI    + AGH+L
Sbjct: 123 DARLNNSDPNANTGSSNLYTDDDLMRSFDRIETI----DYHSTIELDGIRFTAYHAGHVL 178

Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
           G  ++ I   G  V++  DY+  +++HL    +   ++P +LIT++        PR ++E
Sbjct: 179 GACMYFIEIGGLKVLFTGDYSSEEDRHLQVAEVPP-IKPDILITESTFGTATHEPRLEKE 237

Query: 216 M-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--EHSLNYPIYFLTYVSSSTI 272
               + I  TL  GG +L+PV + GR  ELLLILE+YW+  +   N  IY+ + ++   +
Sbjct: 238 TRMTNIIHSTLLKGGRILMPVFALGRAQELLLILEEYWSLNDDLQNINIYYASSLARKCM 297

Query: 273 DYVKSFLEWMGDSI----TKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMA 327
              +++   M DSI    + +  + + N F  K +  + N   LD   D GP +V+AS  
Sbjct: 298 AVYQTYTNIMNDSIRLTTSATNSSKKQNPFQFKFIKSIKN---LDKFQDFGPCVVVASPG 354

Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
            L+ G S ++   WA D KN V+ T     GT+A+ L  +P
Sbjct: 355 MLQNGVSRELLERWAPDPKNAVIMTGYSVEGTMAKDLLTEP 395


>sp|Q59P50|YSH1_CANAL Endoribonuclease YSH1 OS=Candida albicans (strain SC5314 / ATCC
           MYA-2876) GN=YSH1 PE=3 SV=1
          Length = 870

 Score =  133 bits (334), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/331 (29%), Positives = 169/331 (51%), Gaps = 23/331 (6%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
           S +D +L+SH    H  +LPY M+Q      VF   +T+ +YR  L+  + +  S     
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 208

Query: 110 EFD--------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
             D        L+T DDI  +F  +  +    +YH + + +GI    + AGH+LG  ++ 
Sbjct: 209 SEDGGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 264

Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
           I   G  V++  DY+R + +HL+   +   ++P +LI+++        PR + E      
Sbjct: 265 IEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRIELERKLTTH 323

Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
           I  T+  GG VLLPV + G   ELLLIL++YW+++    N  +++ + ++   +   +++
Sbjct: 324 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 383

Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
              M D I  S  +S + N F  K++  + + S+  +   GP +V+A+   L+AG S  +
Sbjct: 384 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 441

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
             +WA D KNLV+ T     GT+A+ L  +P
Sbjct: 442 LEKWAPDGKNLVILTGYSVEGTMAKELLKEP 472


>sp|Q8GUU3|CPS3B_ARATH Cleavage and polyadenylation specificity factor subunit 3-II
           OS=Arabidopsis thaliana GN=CPSF73-II PE=1 SV=2
          Length = 613

 Score =  132 bits (333), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 167/358 (46%), Gaps = 20/358 (5%)

Query: 22  LVSIDGFNFLIDCGW-------NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
           +V+I+G   + DCG        N + + SL+       + I  ++++H    H+GALPY 
Sbjct: 20  VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79

Query: 75  MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
            +  G + P++ + P   L  L +  Y + +  R+  E +LFT   I +  + V  +   
Sbjct: 80  TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEE-ELFTTTHIANCMKKVIAIDLK 138

Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
           Q   +    E + +  + AGH+LG  +         ++Y  DYN   ++HL    ++  +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR-L 194

Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
           +  +LI+++  A   +  +  RE  F  A+ K +  GG  L+P  + GR  EL ++L+DY
Sbjct: 195 QLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDY 254

Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
           W   ++  PIYF + ++     Y K  + W   ++ +   T   N F  K+V        
Sbjct: 255 WERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF--DRS 310

Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
           L +AP GP ++ A+   L AGFS ++F  WA    NLV        GT+   L A  P
Sbjct: 311 LIHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMAGKP 367


>sp|Q4WRC2|YSH1_ASPFU Endoribonuclease ysh1 OS=Neosartorya fumigata (strain ATCC MYA-4609
           / Af293 / CBS 101355 / FGSC A1100) GN=ysh1 PE=3 SV=1
          Length = 872

 Score =  131 bits (329), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 170/361 (47%), Gaps = 19/361 (5%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+SH    H  ALPY + +      VF T     +    + D        S  D
Sbjct: 75  STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134

Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
             T    +    S   L  + +++ +     I + P  AGH+LG  ++ I+  G ++++ 
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTVNSIRITPFPAGHVLGAAMFLISIAGLNILFT 194

Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
            DY+R +++HL    +   ++  VLIT++   +   PPR +RE     +I+  L  GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAALMKSITGILNRGGRV 254

Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
           L+PV + GR  ELLLIL++YW  H      PIY++   +   +   ++++  M D+I + 
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314

Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
           F       E S D +     +  K V  L +    D+   G  ++LAS   L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSASAGPWDFKFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372

Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
              WA + +N V+ T     GT+A+ L  +  P+ +   MSR    V    +A  +E+ +
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PEQIPAVMSRSAGGVSRRGLAGTDEEQK 430

Query: 398 L 398
           +
Sbjct: 431 I 431


>sp|Q54SH0|INT9_DICDI Integrator complex subunit 9 homolog OS=Dictyostelium discoideum
           GN=ints9 PE=3 SV=1
          Length = 712

 Score = 88.6 bits (218), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 75/279 (26%), Positives = 114/279 (40%), Gaps = 52/279 (18%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG------LLTMYDQY---- 102
           STID +L+S+   ++  ALP+  +       +++TEP  ++G      L+ M  QY    
Sbjct: 115 STIDMILISNYTNIY--ALPFITEYTNFQGKIYATEPTVQIGKLLLEELVQMDKQYSNSS 172

Query: 103 ----------------------------------LSRRQVSEFDLFTLDDIDSAFQSVTR 128
                                             L R      DL+   DI+ +F+ +  
Sbjct: 173 INNNNNNNNLSDCWQNIEILEKLNVHNVGMENENLYRDSYRWKDLYKKIDIEKSFEKIQS 232

Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTV 187
           + +++    S K  G    P  +G+ LG   W I   G E V+Y  D +    ++     
Sbjct: 233 IRFNE----SIKHYGFECIPSSSGYGLGSANWVIESKGFERVVYISDSSLSLSRYPTPFQ 288

Query: 188 LESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLI 247
           L     P VLI    N   N PP Q        I  TL+ GG VL+P  S G +L+L   
Sbjct: 289 LSPIDNPDVLILSKINHYPNNPPDQMLSELCSNIGSTLQQGGTVLIPSYSCGIILDLFEH 348

Query: 248 LEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDS 285
           L DY  +  L Y PIYF++ VS + + Y   + EW+  S
Sbjct: 349 LADYLNKVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKS 387


>sp|Q58633|Y1236_METJA Uncharacterized protein MJ1236 OS=Methanocaldococcus jannaschii
           (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
           100440) GN=MJ1236 PE=4 SV=1
          Length = 634

 Score = 87.8 bits (216), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 91/373 (24%), Positives = 154/373 (41%), Gaps = 18/373 (4%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN----DHFDPSLLQPLSKVASTIDAVLL 60
           ++V+ L G          V       LIDCG N    D   P    P   +   +DAV++
Sbjct: 180 IRVSFLGGAREVGRSCLYVQTPDTRVLIDCGINVACEDKAFPHFDAPEFSIED-LDAVIV 238

Query: 61  SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
           +H    H G +P  + + G   PV+ T P   L  L   D     ++  +   +T  DI 
Sbjct: 239 THAHLDHCGFIP-GLFRYGYDGPVYCTRPTRDLMTLLQKDYLEIAKKEGKEVPYTSKDIK 297

Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIYAVDYNRR 178
           +  +    + Y     +S     I +  H AGH+LG  +    I +   ++ Y  D    
Sbjct: 298 TCVKHTIPIDYGVTTDIS---PTIKLTLHNAGHVLGSAIAHLHIGEGLYNLAYTGDIKFE 354

Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHN--QPPRQQREMFQDAISKTLRAGGNVLLPVD 236
             + L   V +      ++I   Y A  +      +        +S+T   GG VL+PV 
Sbjct: 355 TSRLLEPAVCQFPRLETLIIESTYGAYDDVLPEREEAERELLRVVSETTDRGGKVLIPVF 414

Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
             GR  EL+L+LE+ + +   N P+Y    +  +T  +  ++ E++   + +      DN
Sbjct: 415 GVGRAQELMLVLEEGYNQGIFNAPVYLDGMIWEATAIHT-AYPEYLSKEMRQKIFHEGDN 473

Query: 297 AFL---LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
            FL    K V     + ++ ++ D P ++LA+   L  G S +     A D KN ++F  
Sbjct: 474 PFLSEVFKRVGSTNERRKVIDS-DEPCVILATSGMLTGGPSVEYLKHLAPDEKNAIIFVG 532

Query: 354 RGQFGTLARMLQA 366
               GTL R +Q+
Sbjct: 533 YQAEGTLGRKVQS 545


>sp|Q5SLP1|RNSE_THET8 Ribonuclease TTHA0252 OS=Thermus thermophilus (strain HB8 / ATCC
           27634 / DSM 579) GN=TTHA0252 PE=1 SV=1
          Length = 431

 Score = 87.8 bits (216), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/384 (25%), Positives = 157/384 (40%), Gaps = 22/384 (5%)

Query: 5   VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG-WNDHFDPSLLQPLSKVASTIDAVLLSHP 63
           +++ P          ++L+   G   L+DCG +    +     P       +DAVLL+H 
Sbjct: 1   MRIVPFGAAREVTGSAHLLLAGGRRVLLDCGMFQGKEEARNHAPFGFDPKEVDAVLLTHA 60

Query: 64  DTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAF 123
              H+G LP   ++ G   PV++T     L  + + D      +V +   F  +D++ A 
Sbjct: 61  HLDHVGRLPKLFRE-GYRGPVYATRATVLLMEIVLEDAL----KVMDEPFFGPEDVEEAL 115

Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL 183
             +  L Y +   L      + +A   AGHL G        +G  ++Y+ D   R++  L
Sbjct: 116 GHLRPLEYGEWLRLGA----LSLAFGQAGHLPGSAFVVAQGEGRTLVYSGDLGNREKDVL 171

Query: 184 NGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLE 243
               L       VL    Y    ++P R+    F + + KTL  GG VL+P  +  R  E
Sbjct: 172 PDPSLPPLAD-LVLAEGTYGDRPHRPYRETVREFLEILEKTLSQGGKVLIPTFAVERAQE 230

Query: 244 LLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL--- 299
           +L +L  Y   H L   PIY  + ++   +      + +  + +   F   + N F    
Sbjct: 231 ILYVL--YTHGHRLPRAPIYLDSPMAGRVLSLYPRLVRYFSEEVQAHFLQGK-NPFRPAG 287

Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
           L+ V        L+ AP GP +VLA    L  G          SD +N ++F      G 
Sbjct: 288 LEVVEHTEASKALNRAP-GPMVVLAGSGMLAGGRILHHLKHGLSDPRNALVFVGYQPQGG 346

Query: 360 LARMLQADPPPKAVKVTMSRRVPL 383
           L   + A PP  AV++ +   VPL
Sbjct: 347 LGAEIIARPP--AVRI-LGEEVPL 367


>sp|A7SBF0|INT9_NEMVE Integrator complex subunit 9 homolog OS=Nematostella vectensis
           GN=ints9 PE=3 SV=1
          Length = 660

 Score = 87.4 bits (215), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 111/408 (27%), Positives = 176/408 (43%), Gaps = 66/408 (16%)

Query: 1   MGTSVQVTPLSGVFNENPLSYLVS--------IDGF---NFLIDCGWNDHFD--PSLLQP 47
           M T  Q TPLS V NE   S L S        I+GF   N L + G     D  P +  P
Sbjct: 32  MSTVNQFTPLSLVNNEK-FSQLKSWSSRELQEIEGFTAQNNLKEAGGRLFIDAEPEVCPP 90

Query: 48  LSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG------LLTMY 99
            + +   S +D +L+S  +  H+ ALP+  +  G +  +++TEP  ++G      L+T  
Sbjct: 91  ETGLIDFSMVDVILIS--NYHHMLALPFITEYSGFNGKIYATEPTIQIGRDLMLELVTFA 148

Query: 100 DQYLSRRQVSEFD-----------------------LFTLDDIDSAFQSVTRLTYSQNYH 136
           ++   RR  + +                        L++  D+ +    +  ++YS+   
Sbjct: 149 ERVPKRRNGNMWKNDNVIRCLPAPLNELANVKSWRVLYSKHDVKACISKIQAVSYSEKLD 208

Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH---LNGTVLESFVR 193
           L G    + ++ H +G  LG + W +  + E + Y +  +     H   LN TVL++   
Sbjct: 209 LCGI---LQLSAHSSGFCLGSSNWMLESEYEKISY-LSPSSSFTTHPLPLNQTVLKN--S 262

Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
             ++IT    A  + P     E F   ++ TLRAGGNVL+P   +G + +L   L  Y  
Sbjct: 263 DVLIITGVTEAPIDNPDAMLGE-FCTHLASTLRAGGNVLVPCYPSGVLYDLFECLYTYLD 321

Query: 254 EHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSI-TKSF--ETSRDNAFLLKHVTLLINK 309
              L   PIYF++ V+ S++ Y   + EW+  S  TK +  E    +A LLK   L +  
Sbjct: 322 NAKLGMVPIYFISPVADSSLAYSNIYGEWLCQSKQTKVYLPEPPFPHAELLKEARLKV-F 380

Query: 310 SELDNAPDG----PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
           S L N        P +V     SL  G +      W     N V+FTE
Sbjct: 381 SNLHNGFSSSFKTPCVVFTGHPSLRYGDAVHFMEIWGKSGNNTVIFTE 428


>sp|Q57626|Y162_METJA Uncharacterized protein MJ0162 OS=Methanocaldococcus jannaschii
           (strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
           100440) GN=MJ0162 PE=3 SV=1
          Length = 421

 Score = 84.0 bits (206), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/343 (25%), Positives = 158/343 (46%), Gaps = 39/343 (11%)

Query: 31  LIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALP-YAMKQLGLSAPVFSTEP 89
           L+DCG      P   +        +DAV++SH    H GA+P Y  K+      ++ T P
Sbjct: 28  LLDCG----MSPDTGEIPKVDDKAVDAVIVSHAHLDHCGAIPFYKFKK------IYCTHP 77

Query: 90  VYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPH 149
              L  +T  D     +   E      +DI  A +++  L Y +   ++   E I    +
Sbjct: 78  TADLMFITWRDTLNLTKAYKE------EDIQHAMENIECLNYYEERQIT---ENIKFKFY 128

Query: 150 VAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNA-LHNQ 208
            AGH+LG     +  DG+ ++Y  D N    + L     +      ++I   Y + L  +
Sbjct: 129 NAGHILGSASIYLEVDGKKILYTGDINEGVSRTLLPADTDIDEIDVLIIESTYGSPLDIK 188

Query: 209 PPRQ--QREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLT 265
           P R+  +R++ ++ IS+T+  GG V++PV + GR  E+LLI+ +Y     L + PIY   
Sbjct: 189 PARKTLERQLIEE-ISETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLRDVPIYTDG 247

Query: 266 YVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF-LLKHV--TLLINKSELDNAPDGPKLV 322
            +  +T  Y+ S++ W+   I K+   +R N F  +K    +L+ NK         P ++
Sbjct: 248 SLIHATAVYM-SYINWLNPKI-KNMVENRINPFGEIKKADESLVFNKE--------PCII 297

Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQ 365
           +++   ++ G     +++   D KN ++ T     GTL R L+
Sbjct: 298 VSTSGMVQGGPVLK-YLKLLKDPKNKLILTGYQAEGTLGRELE 339


>sp|Q2KJA6|INT9_BOVIN Integrator complex subunit 9 OS=Bos taurus GN=INTS9 PE=2 SV=1
          Length = 658

 Score = 82.4 bits (202), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/339 (25%), Positives = 142/339 (41%), Gaps = 46/339 (13%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRR---- 106
           ST+D +L+S+   +   ALPY  +  G +  V++TEP  ++G L M +   ++ R     
Sbjct: 94  STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLMEELVNFIERVPKAQ 151

Query: 107 ----------------------QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
                                 +VS +   +T+ +++SA   +  + YSQ   L G    
Sbjct: 152 SASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFG---A 208

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           + V P  +G+ LG + W I    E V Y V  +     H       S     VLI     
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMDQASLKNSDVLILTGLT 267

Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
            +    P      F   ++ T+R GGNVL+P   +G + +LL  L  Y     L + P Y
Sbjct: 268 QIPTANPDSMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSSIPFY 327

Query: 263 FLTYVSSSTIDYVKSFLEWMG-DSITKSF--ETSRDNAFL-----LKHVTLLINKSELDN 314
           F++ V++S++++ + F EW+  +  TK +  E    +A L     LKH   +    +  N
Sbjct: 328 FISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSI--HGDFSN 385

Query: 315 APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
               P +V     SL  G        W     N V+FTE
Sbjct: 386 DFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTE 424


>sp|Q5ZKK2|INT9_CHICK Integrator complex subunit 9 OS=Gallus gallus GN=INTS9 PE=2 SV=1
          Length = 658

 Score = 82.4 bits (202), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 86/339 (25%), Positives = 139/339 (41%), Gaps = 46/339 (13%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
           ST+D +L+S+   +   ALPY  +  G +  V++TEP  ++G L M +   S  +V +  
Sbjct: 94  STVDVILISNYHCMM--ALPYITEYTGFTGTVYATEPTVQIGRLLMEELVNSIERVPKAQ 151

Query: 113 -----------------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
                                         +T+ ++++A   +  + YSQ   L G    
Sbjct: 152 SASTWKNKEVQRLLPAPLKDAVEVSMWRKCYTMPEVNAALSKIQLVGYSQKIELFG---A 208

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           + V P  +G+ LG + W I    E V Y V  +     H       S     VLI     
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMDQASLKNSDVLILTGLT 267

Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
            +    P      F   ++ T+R GGNVL+P   +G + +LL  L  Y     L N P Y
Sbjct: 268 QIPTANPDGMVGEFCSNLAMTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNVPFY 327

Query: 263 FLTYVSSSTIDYVKSFLEWMG-DSITKSF--ETSRDNAFL-----LKHVTLLINKSELDN 314
           F++ V++S++++ + F EW+  +  TK +  E    +A L     LKH   +    +  N
Sbjct: 328 FISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSI--HGDFSN 385

Query: 315 APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
               P ++     SL  G        W     N V+FTE
Sbjct: 386 DFKQPCVIFTGHPSLRFGDVVHFMELWGKSSLNTVIFTE 424


>sp|Q8K114|INT9_MOUSE Integrator complex subunit 9 OS=Mus musculus GN=Ints9 PE=2 SV=1
          Length = 658

 Score = 82.0 bits (201), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 143/337 (42%), Gaps = 42/337 (12%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRR---- 106
           ST+D +L+S+   +   ALPY  +  G +  V++TEP  ++G L M +   ++ R     
Sbjct: 94  STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTMQIGRLLMEELVNFIERVPKAQ 151

Query: 107 ----------------------QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
                                 +VS +   +T+ +++SA   +  + YSQ   L G    
Sbjct: 152 SASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFG---A 208

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           + V P  +G+ LG + W I    E V Y V  +     H       S     VLI     
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMDQASLKNSDVLILTGLT 267

Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
            +    P      F   ++ T+R GGNVL+P   +G + +LL  L  Y     L N P Y
Sbjct: 268 QIPTANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNIPFY 327

Query: 263 FLTYVSSSTIDYVKSFLEWMG-DSITKSF--ETSRDNAFLLKHVTLLINKS---ELDNAP 316
           F++ V++S++++ + F EW+  +  +K +  E    +A L++   L   +S   +  N  
Sbjct: 328 FISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSNDF 387

Query: 317 DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
             P ++     SL  G        W     N ++FTE
Sbjct: 388 RQPCVLFTGHPSLRFGDVVHFMELWGKSSLNTIIFTE 424


>sp|Q6DFF4|INT9_XENLA Integrator complex subunit 9 OS=Xenopus laevis GN=ints9 PE=2 SV=1
          Length = 658

 Score = 81.3 bits (199), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 83/345 (24%), Positives = 142/345 (41%), Gaps = 58/345 (16%)

Query: 53  STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSE 110
           ST+D +L+S+   +   ALPY  ++ G +  V++TEP  ++G L M +   ++ R   ++
Sbjct: 94  STVDVILISNYHCMM--ALPYITERTGFTGTVYATEPTVQIGRLLMEELVNFIERVPKAQ 151

Query: 111 ---------------------FDLFT------LDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
                                 ++FT      + ++++A   +  + YSQ   L G    
Sbjct: 152 SATVWKHKDVQRLLPAPLKDAVEVFTWKKCYSMQEVNAALSKIQLVGYSQKIELFGV--- 208

Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
           + V P  +G+ LG + W I    E V Y V  +     H       S     VLI     
Sbjct: 209 VQVTPLSSGYALGSSNWVIQSHYEKVSY-VSGSSLLTTHPQPMDQTSLKNSDVLILTGLT 267

Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
            +    P      F   ++ T+R+GGNVL+P   +G + +LL  L  Y     L N P Y
Sbjct: 268 QIPTANPDGMVGEFCSNLAMTIRSGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNVPFY 327

Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTL----LINKSELDNAPD- 317
           F++ V++S++++ + F EW+          ++ N   L         LI  ++L + P+ 
Sbjct: 328 FISPVANSSLEFSQIFAEWLCH--------NKQNKVYLPEPPFPHAELIQSNKLKHYPNI 379

Query: 318 ---------GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
                     P +V     +L  G        W     N V+FTE
Sbjct: 380 HGDFSNDFKQPCVVFTGHPTLRFGDVVHFMELWGKSSLNTVIFTE 424


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.136    0.398 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 287,914,480
Number of Sequences: 539616
Number of extensions: 12635123
Number of successful extensions: 35233
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 50
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 34915
Number of HSP's gapped (non-prelim): 129
length of query: 739
length of database: 191,569,459
effective HSP length: 125
effective length of query: 614
effective length of database: 124,117,459
effective search space: 76208119826
effective search space used: 76208119826
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)