BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 004656
(739 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9LKF9|CPSF2_ARATH Cleavage and polyadenylation specificity factor subunit 2
OS=Arabidopsis thaliana GN=CPSF100 PE=1 SV=2
Length = 739
Score = 1231 bits (3185), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 592/743 (79%), Positives = 669/743 (90%), Gaps = 8/743 (1%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
MGTSVQVTPL GV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PLS+VASTIDAVLL
Sbjct: 1 MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPDTLH+GALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDID
Sbjct: 61 SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
SAFQ+V RLTYSQNYHLSGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180
Query: 181 KHLNGTVLESFVRPAVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
+HLNGTVL+SFVRPAVLITDAY+AL+ NQ RQQR+ F D ISK L GGNVLLPVD+A
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
GRVLELLLILE +W++ ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF 300
Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
LL+HVTLLINK++LDNAP GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFG
Sbjct: 301 LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG 360
Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
TLARMLQ+ PPPK VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS
Sbjct: 361 TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH 420
Query: 419 GPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
G D+N S +PM+ID + DV+ HG Y+DILIDGFVPPS+SVAPMFP+Y+N SEW
Sbjct: 421 GSDDN-SSEPMIIDTKTTH---DVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEW 476
Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELTVQVK 537
DDFGE+INPDDY+IKDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL V V
Sbjct: 477 DDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVS 536
Query: 538 CLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 597
C L+ +DYEGR+DGRSIK++++HV+PLKLVLVH AEATEHLKQHCL ++CPHVY PQIE
Sbjct: 537 CSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIE 596
Query: 598 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 657
ET+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AWVD+EVGKTE M SLLP+ A P
Sbjct: 597 ETVDVTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASP 656
Query: 658 HKSVLVGDLKMADLKPFLSSKGIQVEFA-GGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 716
HK VLVGDLK+AD K FLSSKG+QVEFA GGALRCGEYVT+RKVGP GQKGG SG QQI+
Sbjct: 657 HKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQIL 716
Query: 717 IEGPLCEDYYKIRAYLYSQFYLL 739
IEGPLCEDYYKIR YLYSQFYLL
Sbjct: 717 IEGPLCEDYYKIRDYLYSQFYLL 739
>sp|Q652P4|CPSF2_ORYSJ Cleavage and polyadenylation specificity factor subunit 2 OS=Oryza
sativa subsp. japonica GN=Os09g0569400 PE=2 SV=1
Length = 738
Score = 1118 bits (2893), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/742 (73%), Positives = 634/742 (85%), Gaps = 7/742 (0%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
MGTSVQVTPLSG + E PL YL+++DGF FL+DCGW D DPS LQPL+KVA TIDAVLL
Sbjct: 1 MGTSVQVTPLSGAYGEGPLCYLLAVDGFRFLLDCGWTDLCDPSHLQPLAKVAPTIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SH DT+HLGALPYAMK LGLSAPV++TEPV+RLG+LT+YD ++SRRQVS+FDLFTLDDID
Sbjct: 61 SHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTLYDYFISRRQVSDFDLFTLDDID 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
+AFQ+V RL YSQN+ L+ KGEGIV+APHVAGH LGGTVWKITKDGEDV+YAVD+N RKE
Sbjct: 121 AAFQNVVRLKYSQNHLLNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVYAVDFNHRKE 180
Query: 181 KHLNGTVLESFVRPAVLITDAYNALHNQP-PRQQREMFQDAISKTLRAGGNVLLPVDSAG 239
+HLNGT L SFVRPAVLITDAYNAL+N RQQ + F DA+ K L GG+VLLP+D+AG
Sbjct: 181 RHLNGTALGSFVRPAVLITDAYNALNNHVYKRQQDQDFIDALVKVLTGGGSVLLPIDTAG 240
Query: 240 RVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL 299
RVLE+LLILE YWA+ L YPIYFLT VS+ST+DYVKSFLEWM DSI+KSFE +RDNAFL
Sbjct: 241 RVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWMNDSISKSFEHTRDNAFL 300
Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
LK VT +INK EL+ D PK+VLASMASLE GFSHDIFV+ A++ KNLVLFTE+GQFGT
Sbjct: 301 LKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMANEAKNLVLFTEKGQFGT 360
Query: 360 LARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
LARMLQ DPPPKAVKVTMS+R+PLVG+EL AYEEEQ R+KKEEALKASL KEEE KASLG
Sbjct: 361 LARMLQVDPPPKAVKVTMSKRIPLVGDELKAYEEEQERIKKEEALKASLNKEEEKKASLG 420
Query: 420 PDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWD 479
N + DPMVIDA+ + ++ GG DILIDGFVPPS+SVAPMFPF+EN SEWD
Sbjct: 421 -SNAKASDPMVIDASTSRKPSNAGSKFGGNV-DILIDGFVPPSSSVAPMFPFFENTSEWD 478
Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD--DGKLDEGSASLILDAKPSKVVSNELTVQVK 537
DFGEVINP+DY++K E+MD M GD D LDEGSA L+LD+ PSKV+SNE+TVQVK
Sbjct: 479 DFGEVINPEDYLMKQEEMDNTLMPGAGDGMDSMLDEGSARLLLDSTPSKVISNEMTVQVK 538
Query: 538 CLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIE 597
C L ++D+EGR+DGRS+K++++HVAPLKLVLVHGSAEATEHLK HC K+ HVY PQIE
Sbjct: 539 CSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYAPQIE 598
Query: 598 ETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPP 657
ETIDVTSDLCAYKVQLSEKLMSNV+ KKLG++EIAWVDAEVGKT++ + L P STPA
Sbjct: 599 ETIDVTSDLCAYKVQLSEKLMSNVISKKLGEHEIAWVDAEVGKTDDKLTLLPPSSTPA-A 657
Query: 658 HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVI 717
HKSVLVGDLK+AD K FL++KG+QVEFAGGALRCGEY+T+RK+G AGQK G +G+QQIVI
Sbjct: 658 HKSVLVGDLKLADFKQFLANKGLQVEFAGGALRCGEYITLRKIGDAGQK-GSTGSQQIVI 716
Query: 718 EGPLCEDYYKIRAYLYSQFYLL 739
EGPLCEDYYKIR LYSQFYLL
Sbjct: 717 EGPLCEDYYKIRELLYSQFYLL 738
>sp|Q9V3D6|CPSF2_DROME Probable cleavage and polyadenylation specificity factor subunit 2
OS=Drosophila melanogaster GN=Cpsf100 PE=1 SV=1
Length = 756
Score = 529 bits (1362), Expect = e-149, Method: Compositional matrix adjust.
Identities = 295/786 (37%), Positives = 453/786 (57%), Gaps = 77/786 (9%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ +SG +E+P Y++ ID L+DCGW++ FD + ++ L + T+DAVLL
Sbjct: 1 MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S + +FDLF+LDD+D
Sbjct: 61 SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
+AF+ +T+L Y+Q L KG GI + P AGH++GGT+WKI K G ED++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E+HL+G L+ RP++LITDAYNA + Q R+ R E I +T+R GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W + Y + L VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F KH+ L + +++ P GPK+VLAS LE+GF+ D+FV+WAS+ N ++ T R
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360
Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEES 414
GTLA +++ P K +++ + RRV L G EL Y Q E L +VK
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVDLEGAELEEYLRTQG-----EKLNPLIVK---- 411
Query: 415 KASLGPDNNLSGDPMV---IDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
PD I+ + D+V GR+ GF + MFP+
Sbjct: 412 -----PDVEEESSSESEDDIEMSVITGKHDIVVRPEGRHH----SGFFKSNKRHHVMFPY 462
Query: 472 YENNSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEG 514
+E + D++GE+IN DDY I D E++ + IG + +G + +
Sbjct: 463 HEEKVKCDEYGEIINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDN 522
Query: 515 SASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAE 574
L+ KP+K++S T++V + ID+EGR+DG S+ ILS + P +++++HG+AE
Sbjct: 523 DVQLL--EKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAE 580
Query: 575 ATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWV 634
T+ + +HC ++V V+TPQ E IDVTS++ Y+V+L+E L+S + F+K D E+AWV
Sbjct: 581 GTQVVARHCEQNVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKGKDAEVAWV 640
Query: 635 DAEVGK-------------------TENGMLSLLPIS-TPAPPHKSVLVGDLKMADLKPF 674
D +G E L+L ++ P H SVL+ +LK++D K
Sbjct: 641 DGRLGMRVKAIEAPMDVTVEQDASVQEGKTLTLETLADDEIPIHNSVLINELKLSDFKQT 700
Query: 675 LSSKGIQVEFAGGALRCGE-YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 733
L I EF+GG L C + +R+V ++ +EG L E+YYKIR LY
Sbjct: 701 LMRNNINSEFSGGVLWCSNGTLALRRVDAG----------KVAMEGCLSEEYYKIRELLY 750
Query: 734 SQFYLL 739
Q+ ++
Sbjct: 751 EQYAIV 756
>sp|O35218|CPSF2_MOUSE Cleavage and polyadenylation specificity factor subunit 2 OS=Mus
musculus GN=Cpsf2 PE=1 SV=1
Length = 782
Score = 522 bits (1345), Expect = e-147, Method: Compositional matrix adjust.
Identities = 293/815 (35%), Positives = 454/815 (55%), Gaps = 109/815 (13%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALP+A+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ K+
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKEKLKKEAAKKLEQ-------- 411
Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
S + + ++ ++ DV +P + + D+++ G F + P
Sbjct: 412 ---------SKEADIDSSDESDVEEDVDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462
Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 524
MFP E +WD++GE+I P+D+++ + + +++ + G +G E L P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG---EEPMDQDLSDVP 519
Query: 525 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 584
+K VS ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C
Sbjct: 520 TKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579
Query: 585 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 636
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 637 EVGKTENGML-----------------------------------------------SLL 649
V K + G++ ++
Sbjct: 638 RVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSAMAQQKAMKSLFGEDEKELGEETEII 697
Query: 650 PISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAG 704
P P PP H+SV + + +++D K L +GIQ EF GG L C V +R+
Sbjct: 698 PTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----- 752
Query: 705 QKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
+ T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 753 -----TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>sp|Q10568|CPSF2_BOVIN Cleavage and polyadenylation specificity factor subunit 2 OS=Bos
taurus GN=CPSF2 PE=1 SV=1
Length = 782
Score = 521 bits (1341), Expect = e-146, Method: Compositional matrix adjust.
Identities = 292/817 (35%), Positives = 455/817 (55%), Gaps = 113/817 (13%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ K+
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411
Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
S + + ++ ++A D+ +P + + D+++ G F + P
Sbjct: 412 ---------SKEADIDSSDESDAEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462
Query: 468 MFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEGSASLILDA 522
MFP E +WD++GE+I P+D+++ +E+ + + D +D+ + +
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDV---- 518
Query: 523 KPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQH 582
P+K +S ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L +
Sbjct: 519 -PTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAEC 577
Query: 583 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 636
C K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635
Query: 637 --EVGKTENGML-----------------------------------------------S 647
V K + G++
Sbjct: 636 DMRVSKVDTGVILEEGELKDDGEDSEMQVDAPSDSSVIAQQKAMKSLFGDDEKETGEESE 695
Query: 648 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 702
++P P PP H+SV + + +++D K L +GIQ EF GG L C V +R+
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR--- 752
Query: 703 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
+ T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>sp|Q9P2I0|CPSF2_HUMAN Cleavage and polyadenylation specificity factor subunit 2 OS=Homo
sapiens GN=CPSF2 PE=1 SV=2
Length = 782
Score = 518 bits (1334), Expect = e-146, Method: Compositional matrix adjust.
Identities = 291/817 (35%), Positives = 454/817 (55%), Gaps = 113/817 (13%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVP-SPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ K+
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKEKLKKEAAKKLEQ-------- 411
Query: 416 ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR-DILIDG-------FVPPSTSVAP 467
S + + ++ ++ D+ +P + + D+++ G F + P
Sbjct: 412 ---------SKEADIDSSDESDIEEDIDQPSAHKTKHDLMMKGEGSRKGSFFKQAKKSYP 462
Query: 468 MFPFYENNSEWDDFGEVINPDDYII-----KDEDMDQAAMHIGGDDGKLDEGSASLILDA 522
MFP E +WD++GE+I P+D+++ +E+ + + D +D+ + +
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGDEPMDQDLSDV---- 518
Query: 523 KPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQH 582
P+K +S ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L +
Sbjct: 519 -PTKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAEC 577
Query: 583 CL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA-- 636
C K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 578 CRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVL 635
Query: 637 --EVGKTENGML-----------------------------------------------S 647
V K + G++
Sbjct: 636 DMRVSKVDTGVILEEGELKDDGEDSEMQVEAPSDSSVIAQQKAMKSLFGDDEKETGEESE 695
Query: 648 LLPISTPAPP-----HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 702
++P P PP H+SV + + +++D K L +GIQ EF GG L C V +R+
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR--- 752
Query: 703 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
+ T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>sp|Q9W799|CPSF2_XENLA Cleavage and polyadenylation specificity factor subunit 2
OS=Xenopus laevis GN=cpsf2 PE=1 SV=1
Length = 783
Score = 509 bits (1310), Expect = e-143, Method: Compositional matrix adjust.
Identities = 294/812 (36%), Positives = 453/812 (55%), Gaps = 102/812 (12%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T L G E+ + YL+ +D F FL+DCGW+++F ++ + K +DAVLL
Sbjct: 1 MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LF+LDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRK 179
AF + +L Y+Q HL GKG G+ + P AGH++GGT+WKI KDG E+++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H+TL S+L P PK+VLAS LE GFS ++F++W D KN V+ T R
Sbjct: 301 NPFQFRHLTLCHGYSDLARVP-SPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L P + + + + +RV L G+EL Y E++ K + K E+SK
Sbjct: 360 TPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYVEKEK------LKKEAAKKLEQSK 413
Query: 416 ASLGPDNNLSGDPMVIDA-NNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYEN 474
+ ++ S ID + A D++ + G + F + PMFP E+
Sbjct: 414 EADLDSSDDSDVEEDIDQITSHKAKHDLMMKNEGSRK----GSFFKQAKKSYPMFPAPED 469
Query: 475 NSEWDDFGEVINPDDYII------KDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVV 528
+WD++GE+I P+D+++ +DE + GD+ +D+ + + P+K V
Sbjct: 470 RIKWDEYGEIIKPEDFLVPELQVTEDEKTKLESGLTNGDE-PMDQDLSDV-----PTKCV 523
Query: 529 SNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL---- 584
S ++++K + +IDYEGR+DG SIK I++ + P +L++VHG +AT+ L + C
Sbjct: 524 STTESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQDLAEACRAFGG 583
Query: 585 KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGK 640
K + VYTP++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K
Sbjct: 584 KDI--KVYTPKLHETVDATSETHIYQVRLKDSLVSSLKFCKAKDTELAWIDGVLDMRVSK 641
Query: 641 TENGML----------------------------------------------------SL 648
+ G++ +L
Sbjct: 642 VDTGVILEERELKDEGEDMEMQVDTQVMDASTIAQQKVIKSLFGDDDKEFSEESEIIPTL 701
Query: 649 LPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKG 707
P+ S P H+SV + + +++D K L +GI EF GG L C V +R+
Sbjct: 702 EPLPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVCNNMVAVRR-------- 753
Query: 708 GGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 739
+ T +I +EG LCED++KIR LY Q+ ++
Sbjct: 754 --TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783
>sp|Q55BS1|CPSF2_DICDI Cleavage and polyadenylation specificity factor subunit 2
OS=Dictyostelium discoideum GN=cpsf2 PE=3 SV=1
Length = 784
Score = 451 bits (1161), Expect = e-126, Method: Compositional matrix adjust.
Identities = 270/807 (33%), Positives = 441/807 (54%), Gaps = 91/807 (11%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + ++ T LSG +E+P YL+ ID F L+DCG + + D SLL+PL KVA IDAVLL
Sbjct: 1 MASIIKFTALSGAKDESPPCYLLEIDDFCILLDCGLSYNLDFSLLEPLEKVAKKIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SH DT H+G LPY + + GL+ ++ T PV ++G + +YD Y ++ EF ++LD+ID
Sbjct: 61 SHSDTTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNID 120
Query: 121 SAF--QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
S F L++SQ+Y LSGKG+GI + P++AGH +G +VWKITK ++YA+DYN R
Sbjct: 121 SCFGEDRFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHR 180
Query: 179 KEKHLNGTVLES-FVRPAVLITDAYN-----ALHNQPPRQQREMFQDAISKTLRAGGNVL 232
E HL+ L S ++P++LITD+ A R Q +F+ I++ LR GGNVL
Sbjct: 181 NEGHLDSLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQ-SLFE-QINRNLRDGGNVL 238
Query: 233 LPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
+PVD+AGRVLELLL +E+YW+++ SL Y + FL S S + +S LE+M + + F
Sbjct: 239 IPVDTAGRVLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKF 298
Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
E + +N F KH+ +L + EL PD K++L S LE GFS ++F++W SD K L+L
Sbjct: 299 EQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLIL 358
Query: 351 FTERGQFGTLA-RMLQADPPP----KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK 405
FT++ +LA ++++ P K +++ RVPL G+EL+ YE EQ + ++E+ L+
Sbjct: 359 FTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQYEMEQAKQREEKRLE 418
Query: 406 ASLVKEEESKASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVP----- 460
+++E+ + +++A N + +++ + R I+ D V
Sbjct: 419 Q--LRKEQEEREERERLEEEEREQLLNATNQDQLQQLLQLQQQKERGIIDDSMVHMKNPF 476
Query: 461 ------------PSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDED--MDQAAMHIGG 506
S+ MFP++E + +W ++GE DD I++++D +++ M
Sbjct: 477 ENDRFDLLDSEFKKQSMITMFPYFEKHLKWGEYGE--EDDDLILRNQDKKVEEVTME--- 531
Query: 507 DDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKL 566
+ P K+++ L + + C + IDYEG +DGRSIK I+ +AP KL
Sbjct: 532 --------EDEIQEQEIPKKIITQTLRLPINCKIQTIDYEGCSDGRSIKAIIQQIAPTKL 583
Query: 567 VLVHGSAEATEHLKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK 625
VL+ GS + ++ ++ + +++ +Y P I E +D+TSD Y++ L + L++ + K
Sbjct: 584 VLIRGSEQQSQSIENYVKENIRTKGIYIPSIGEQLDLTSDTNVYELLLKDSLVNTLKTSK 643
Query: 626 LGDYEIAWVDAEVGKTENGMLSLLPISTPAP----------------------------- 656
+ DYE++++ +V + + +L + P
Sbjct: 644 ILDYEVSYIQGKVDILDGSNVPVLDLIQSIPINNNNNNNNNNNNNNNNNNNNTTMMTTTT 703
Query: 657 ----PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGT 712
H +GD+K++DLK L + GIQV+F G L CG V I + G G
Sbjct: 704 TTTNGHDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNCGGLVYIWRDEDHG------GN 757
Query: 713 QQIVIEGPLCEDYYKIRAYLYSQFYLL 739
I ++G + ++YY I+ LY QF ++
Sbjct: 758 SIINVDGIISDEYYLIKELLYKQFQIV 784
>sp|O17403|CPSF2_CAEEL Probable cleavage and polyadenylation specificity factor subunit 2
OS=Caenorhabditis elegans GN=cpsf-2 PE=3 SV=1
Length = 843
Score = 374 bits (960), Expect = e-102, Method: Compositional matrix adjust.
Identities = 229/689 (33%), Positives = 379/689 (55%), Gaps = 48/689 (6%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ SG +E PL YL+ +DG L+DCGW++ F + L I AVL+
Sbjct: 1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLG LPY + + GL+APV++T PVY++G + +YD S V EF+ +TLDD+D
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
+AF+ V ++ Y+Q L G G+ AGH+LGG++W+I + GED++Y VD+N +K
Sbjct: 121 TAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E+HLNG ++F RP +LIT A++ Q R+ R E I +T+R G+ ++ +D+A
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239
Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
GRVLEL +L+ W+ Y + +++V+SS + + KS LEWM + + K +S R
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299
Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
N F LKHVTL + EL PK+VL S +E+GFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVR-SPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358
Query: 355 GQFGTLARML-----QADP-----PPKAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
TLA L +A+ + + + + +RV L GEEL+ Y+ E+TR
Sbjct: 359 PASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETR 418
Query: 398 LKKEEALKASLVKEEESK------ASLGPDNNLSGDPMVIDANNANASADVVEPHGGRYR 451
L+ E A + + E + A + P ++ + N + D++ ++
Sbjct: 419 LRMERARRQAQANESDDSDDDDIAAPIVPRHSEKDFRSFDGSENDAHTFDIM----AKWD 474
Query: 452 DILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYII-------KDEDMDQAAMHI 504
+ F + PMFP+ E +WDD+GEVI P+DY + K ++ D+ +
Sbjct: 475 NQQKASFFKTTKKSFPMFPYIEEKVKWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVVVK 534
Query: 505 GGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPL 564
++ + + + P+K V + V+V C + FI+YEG +DG S K +L+ + P
Sbjct: 535 KREEEEEVYNPNDHV-EEMPTKCVEFKNRVEVSCRIEFIEYEGISDGESTKKLLAGLLPR 593
Query: 565 KLVLVHGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVL 622
++++VHGS + T L + + P+ +D + + Y+V LS+ L++++
Sbjct: 594 QIIVVHGSRDDTRDLVAYFADSGFDTTMLKAPEAGALVDASVESFIYQVALSDALLADIQ 653
Query: 623 FKKLGD-YEIAWVDAEVGKTE--NGMLSL 648
FK++ + +AW+DA V + E + ML++
Sbjct: 654 FKEVSEGNSLAWIDARVMEKEAIDNMLAV 682
Score = 52.4 bits (124), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 43/85 (50%), Gaps = 11/85 (12%)
Query: 656 PPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVTIRKVGPAGQKGGGSGTQQ 714
P H++V V D K++D K L+ KG + EF G L G +IR+ + T
Sbjct: 769 PIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRR----------NDTGV 818
Query: 715 IVIEGPLCEDYYKIRAYLYSQFYLL 739
+EG +DYYK+R Y QF +L
Sbjct: 819 FQMEGAFTKDYYKLRRLFYDQFAVL 843
>sp|A8XUS3|CPSF2_CAEBR Probable cleavage and polyadenylation specificity factor subunit 2
OS=Caenorhabditis briggsae GN=cpsf-2 PE=3 SV=2
Length = 842
Score = 367 bits (941), Expect = e-100, Method: Compositional matrix adjust.
Identities = 223/687 (32%), Positives = 372/687 (54%), Gaps = 56/687 (8%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ SG +E PL YL+ +D L+DCGW++ F+ + L I AVL+
Sbjct: 1 MTSIIKLKVFSGAKDEGPLCYLLQVDNDYILLDCGWDERFELKYFEELRPYIPKISAVLI 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLG LPY + + GL+APV+ T PVY++G + +YD S V EF ++LDD+D
Sbjct: 61 SHPDPLHLGGLPYLVAKCGLTAPVYCTVPVYKMGQMFIYDLVYSHLDVEEFQHYSLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
AF+ V ++ Y+Q L G G+ AGH++GG++W+I + GED+IY VD+N RK
Sbjct: 121 MAFEKVEQVKYNQTVVLKGDS-GVNFTAMPAGHMIGGSMWRICRITGEDIIYCVDFNHRK 179
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
++HL+G ++F RP +LIT A++ Q R+ R E I +T+R G+ ++ +D+A
Sbjct: 180 DRHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239
Query: 239 GRVLELLLILEDYWAEHSL---NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
GRVLEL +L+ WA Y + +++V+SS + + KS LEWM + + + +S R
Sbjct: 240 GRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWMDEKLFRYDSSSAR 299
Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
N F LK+V L+ + EL PK+VL S +E GFS ++F++W +D +N V+ T R
Sbjct: 300 YNPFTLKNVNLVHSHLELIKIR-SPKVVLCSSQDMETGFSRELFLDWCADQRNGVILTAR 358
Query: 355 -GQFGTLARMLQADPPP---------KAVKVTMSRRVPLVGEELIAYEE-------EQTR 397
F AR+++ K + + + +RVPL GEEL+ Y+ E+TR
Sbjct: 359 PASFTLAARLVELAERANDGVLRNEDKHLSLLVRKRVPLEGEELLEYKRRKAERDAEETR 418
Query: 398 LKKEEALKASLVKEEESKA----------SLGPDNNLSGDPMVIDANNANASADVVEPHG 447
++ E A + + E + L ++ S D + D++ + A
Sbjct: 419 IRMERARRQAQANESDDSDDDDIAAPIVPRLSEKDHRSFDAIENDSHCFDIMA------- 471
Query: 448 GRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDY-IIKDEDMDQA------ 500
++ + F + PM+P+ E +WDD+GEVI P+DY +I DM +
Sbjct: 472 -KWDNQQKASFFKSTKKSFPMYPYIEEKVKWDDYGEVIKPEDYTVISKIDMRKGKNKDEP 530
Query: 501 -AMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILS 559
+H D+ ++ + + P+K V +++ C + FI+YEG +DG S K +L+
Sbjct: 531 VVVHKREDEEEVYNPNDH--DEEMPTKCVEFRNRIEISCRVEFIEYEGISDGESTKKMLA 588
Query: 560 HVAPLKLVLVHGSAEATEHLKQHCLKHVCP--HVYTPQIEETIDVTSDLCAYKVQLSEKL 617
+ P ++++VHGS + T L + + + TP E ID + + Y+V LS+ L
Sbjct: 589 GLMPRQIIIVHGSRDDTRDLYAYFTDNGFKKDQLNTPVANELIDASVESFIYQVSLSDAL 648
Query: 618 MSNVLFKKLGD-YEIAWVDAEVGKTEN 643
++ + FK++ + +AW+DA + + E+
Sbjct: 649 LAEIQFKEVSEGNSLAWIDARIQEKES 675
Score = 34.3 bits (77), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 1/47 (2%)
Query: 644 GMLSLLPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGAL 689
G L L P+ P H+++ V D K+++ K L KG + EF G L
Sbjct: 755 GTLILTPLPKKQIPVHQAIFVNDPKLSEFKNLLVDKGYKAEFFSGTL 801
>sp|O74740|CFT2_SCHPO Cleavage factor two protein 2 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=cft2 PE=1 SV=1
Length = 797
Score = 343 bits (880), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 254/804 (31%), Positives = 403/804 (50%), Gaps = 123/804 (15%)
Query: 23 VSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL-S 81
+ +DG + ID G +D SL P +V D +LLSH D H+G L YA + +
Sbjct: 18 IELDGIHIYIDPGSDD----SLKHP--EVPEQPDLILLSHSDLAHIGGLVYAYYKYDWKN 71
Query: 82 APVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
A +++T P +G +TM D + +S+ + D+D+ F S+ L Y Q L GK
Sbjct: 72 AYIYATLPTINMGRMTMLDA-IKSNYISDM---SKADVDAVFDSIIPLRYQQPTLLLGKC 127
Query: 142 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------VLESFVRP 194
G+ + + AGH LGGT+W + K+ E V+YAVD+N K+KHLNG +LE+ RP
Sbjct: 128 SGLTITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRP 187
Query: 195 AVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
LITDA N+L + P R++R E F +++ +L GG VLLPVD+A RVLEL IL+++W+
Sbjct: 188 NTLITDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWS 247
Query: 254 EHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
L +PI FL+ S+ TIDY KS +EWMGD+I + F + +N +++ + + S+
Sbjct: 248 ASQPPLPFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGIN-ENLLEFRNINTITDFSQ 306
Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN-LVLFTERG------------QFG 358
+ + GPK++LA+ +LE GFS I ++ S+ N L+LFT+R ++
Sbjct: 307 ISHIGPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSRCPQNSLANQFIRYW 366
Query: 359 TLARMLQADPP-------PKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKE 411
A + D P +AVK+ + PL GEEL +Y+E + + ++A +L
Sbjct: 367 ERASKKKRDIPHPVGLYAEQAVKIKT--KEPLEGEELRSYQELEFSKRNKDAEDTAL--- 421
Query: 412 EESKASLGPDNNLSGDPMVIDANNANASADVVEPH----------GGRYRDILIDGFVPP 461
E ++ ++ S D + N PH G + L D V
Sbjct: 422 EFRNRTILDEDLSSSSSSEDDDLDLNTEV----PHVALGSSAFLMGKSFDLNLRDPAVQA 477
Query: 462 STSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSA----S 517
+ MFP+ E D++GE+I D+ + +E + + DD L + S
Sbjct: 478 LHTKYKMFPYIEKRRRIDEYGEIIKHQDFSMINEPANTLELENDSDDNALSNSNGKRKWS 537
Query: 518 LILDA------------KPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLK 565
I D PSK++++E T++V C + FID EG DGRS+KTI+ V P +
Sbjct: 538 EINDGLQQKKEEEDEDEVPSKIITDEKTIRVSCQVQFIDIEGLHDGRSLKTIIPQVNPRR 597
Query: 566 LVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLF 623
LVL+H S E E +K+ C L VY P E I+V+ D+ A+ ++L++ L+ N+++
Sbjct: 598 LVLIHASTEEKEDMKKTCASLSAFTKDVYIPNYGEIINVSIDVNAFSLKLADDLIKNLIW 657
Query: 624 KKLGDYEIAWVDAEVGKTENGM---------------------------------LSLLP 650
K+G+ E++ + A+V ++ L+L
Sbjct: 658 TKVGNCEVSHMLAKVEISKPSEEEDKKEEVEKKDGDKERNEEKKEEKETLPVLNALTLRS 717
Query: 651 ISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGG 709
AP +LVG++++A L+ L +GI E G G L CG V +RK+ GG
Sbjct: 718 DLARAPRAAPLLVGNIRLAYLRKALLDQGISAELKGEGVLLCGGAVAVRKLS-----GG- 771
Query: 710 SGTQQIVIEGPLCEDYYKIRAYLY 733
+I +EG L +++IR +Y
Sbjct: 772 ----KISVEGSLSNRFFEIRKLVY 791
>sp|Q503E1|INT11_DANRE Integrator complex subunit 11 OS=Danio rerio GN=cpsf3l PE=2 SV=1
Length = 598
Score = 169 bits (428), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 177/356 (49%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V L Q + + E + + AGH+LG + +I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAMVQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LI+++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ ++S DN GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRSYADNP--GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIM 350
>sp|Q3MHC2|INT11_RAT Integrator complex subunit 11 OS=Rattus norvegicus GN=Cpsf3l PE=2
SV=1
Length = 600
Score = 168 bits (426), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ +++ DN GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
Score = 35.8 bits (81), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/82 (24%), Positives = 37/82 (45%)
Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
L+Q + Y P ET+
Sbjct: 423 LRQKIEQEFRVSCYMPANGETV 444
>sp|Q9CWS4|INT11_MOUSE Integrator complex subunit 11 OS=Mus musculus GN=Cpsf3l PE=2 SV=1
Length = 600
Score = 168 bits (426), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ +++ DN GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRTFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
Score = 35.8 bits (81), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/84 (23%), Positives = 38/84 (45%)
Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 579 LKQHCLKHVCPHVYTPQIEETIDV 602
L+Q + Y P ET+ +
Sbjct: 423 LRQKIEQEFRVSCYMPANGETVTL 446
>sp|Q5NVE6|INT11_PONAB Integrator complex subunit 11 OS=Pongo abelii GN=CPSF3L PE=2 SV=2
Length = 600
Score = 168 bits (425), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ +++ DN GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
Score = 37.4 bits (85), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 21/82 (25%), Positives = 38/82 (46%)
Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
LKQ + + Y P ET+
Sbjct: 423 LKQKIEQELRVSCYMPANGETV 444
>sp|Q5TA45|INT11_HUMAN Integrator complex subunit 11 OS=Homo sapiens GN=CPSF3L PE=1 SV=2
Length = 600
Score = 168 bits (425), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 178/356 (50%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ +++ DN GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
Score = 38.1 bits (87), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 21/82 (25%), Positives = 39/82 (47%)
Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
LKQ + + + Y P ET+
Sbjct: 423 LKQKIEQELRVNCYMPANGETV 444
>sp|Q5ZIH0|INT11_CHICK Integrator complex subunit 11 OS=Gallus gallus GN=CPSF3L PE=2 SV=1
Length = 600
Score = 167 bits (423), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/356 (31%), Positives = 180/356 (50%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + E + + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ +++ DN GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADNP--GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
Score = 38.1 bits (87), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 21/87 (24%), Positives = 40/87 (45%)
Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEF 422
Query: 579 LKQHCLKHVCPHVYTPQIEETIDVTSD 605
LKQ + + Y P ET + ++
Sbjct: 423 LKQKIEQEFHVNCYMPANGETTTIFTN 449
>sp|Q2YDM2|INT11_BOVIN Integrator complex subunit 11 OS=Bos taurus GN=CPSF3L PE=2 SV=2
Length = 599
Score = 165 bits (417), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 176/356 (49%), Gaps = 18/356 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T+P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP++LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W L PIYF T ++ Y K F+ W I K+F +
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF--VQR 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
N F KH+ ++P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF--DRAFADSP-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIM 350
Score = 37.7 bits (86), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 21/82 (25%), Positives = 38/82 (46%)
Query: 519 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 578
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEF 422
Query: 579 LKQHCLKHVCPHVYTPQIEETI 600
LKQ + + Y P ET+
Sbjct: 423 LKQKIEQEFRVNCYMPANGETV 444
>sp|Q12102|CFT2_YEAST Cleavage factor two protein 2 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=CFT2 PE=1 SV=1
Length = 859
Score = 165 bits (417), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 198/838 (23%), Positives = 328/838 (39%), Gaps = 179/838 (21%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPS------LLQPLSKVASTIDAVLLSHPDTLHLGA---LP 72
+V D LID GWN PS ++ KV ID ++LS P LGA L
Sbjct: 19 VVRFDNVTLLIDPGWN----PSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLY 74
Query: 73 YAMKQLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRL 129
Y +S V++T PV LG ++ D Y S + +D LD DI+ +F + L
Sbjct: 75 YNFTSHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPL 134
Query: 130 TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN----- 184
YSQ L + +G+ + + AG GG++W I+ E ++YA +N ++ LN
Sbjct: 135 KYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASIL 194
Query: 185 ---GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
G L + +RP+ +IT +QP +++ ++F+D + K L + G+V++PVD +G+
Sbjct: 195 DATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKF 254
Query: 242 LELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
L+L L+ E P+ L+Y T+ Y KS LEW+ S+ K++E +R+N
Sbjct: 255 LDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNN 313
Query: 297 A--FLLKHVTLLINKSELDNAPDGPKLVLASMA------------------------SLE 330
F + +I +EL P G K+ S S E
Sbjct: 314 TSPFEIGSRIKIIAPNELSKYP-GSKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFE 372
Query: 331 AGFSHDIFVEWA-SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELI 389
S D +E D +N F E G+ + D + PL EE
Sbjct: 373 CASSLDKILEIVEQDERNWKTFPEDGKSFLCDNYISID---------TIKEEPLSKEETE 423
Query: 390 AYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDAN-------------NA 436
A++ + K++ K LVK E K + +G+ ++ D N N
Sbjct: 424 AFKVQLKEKKRDRNKKILLVKRESKKLA-------NGNAIIDDTNGERAMRNQDILVENV 476
Query: 437 NASADVVEPHGG---------------------------RYRDILIDGFVPPST-SVAPM 468
N + GG + ++ +D + PS S M
Sbjct: 477 NGVPPIDHIMGGDEDDDEEEENDNLLNLLKDNSEKSAAKKNTEVPVDIIIQPSAASKHKM 536
Query: 469 FPFYENNSEWDDFGEVIN-----PDD---------------------------------- 489
FPF + DD+G V++ PDD
Sbjct: 537 FPFNPAKIKKDDYGTVVDFTMFLPDDSDNVNQNSRKRPLKDGAKTTSPVNEEDNKNEEED 596
Query: 490 -YIIKDEDMDQAAMHIGGDDGKLDEGSAS-------LILDAKPSKVVSNELTVQVKCLLI 541
Y + D ++ G G A L +D SK + + VQ+KC ++
Sbjct: 597 GYNMSDPISKRSKHRASRYSGFSGTGEAENFDNLDYLKIDKTLSKRTISTVNVQLKCSVV 656
Query: 542 FIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETID 601
++ + D RS I + K+VL E + +K V P + + ++
Sbjct: 657 ILNLQSLVDQRSASIIWPSLKSRKIVLSAPKQIQNEEITAKLIKKNIEVVNMP-LNKIVE 715
Query: 602 VTSDLCAYKVQLSEKLMSNVLFKKLGD-YEIAWVDAEVGK------------TENGMLSL 648
++ + + + L + + ++++ D Y +A V + K L L
Sbjct: 716 FSTTIKTLDISIDSNLDNLLKWQRISDSYTVATVVGRLVKESLPQVNNHQKTASRSKLVL 775
Query: 649 LPISTPAPPHKS--VLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPA 703
P+ + HK+ + +GD+++A LK L+ K EF G G L E V +RK+ A
Sbjct: 776 KPLHGSSRSHKTGALSIGDVRLAQLKKLLTEKNYIAEFKGEGTLVINEKVAVRKINDA 833
>sp|O13794|YSH1_SCHPO Endoribonuclease ysh1 OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=ysh1 PE=3 SV=2
Length = 757
Score = 160 bits (404), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 101/320 (31%), Positives = 171/320 (53%), Gaps = 14/320 (4%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EF 111
ST+D +L+SH H+ +LPY M++ VF T P + + D Y+ V E
Sbjct: 69 STVDVLLISHFHLDHVASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSD-YVKVSNVGMED 127
Query: 112 DLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
L+ D+ +AF + + +YH + + EGI P+ AGH+LG ++ + G ++++
Sbjct: 128 QLYDEKDLLAAFDRIEAV----DYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILF 183
Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGN 230
DY+R +++HL+ + RP VLIT++ Y +QP ++ + I T+R GG
Sbjct: 184 TGDYSREEDRHLHVAEVPP-KRPDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGR 242
Query: 231 VLLPVDSAGRVLELLLILEDYWAEH--SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
VL+PV + GR ELLLIL++YW H + PIY+ + ++ + ++++ M D+I K
Sbjct: 243 VLMPVFALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRK 302
Query: 289 SFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNL 348
F + N F+ + V L N + D+ GP ++LAS L+ G S + WA D +N
Sbjct: 303 IF--AERNPFIFRFVKSLRNLEKFDDI--GPSVILASPGMLQNGVSRTLLERWAPDPRNT 358
Query: 349 VLFTERGQFGTLARMLQADP 368
+L T GT+A+ + +P
Sbjct: 359 LLLTGYSVEGTMAKQITNEP 378
>sp|Q74ZC0|YSH1_ASHGO Endoribonuclease YSH1 OS=Ashbya gossypii (strain ATCC 10895 / CBS
109.51 / FGSC 9923 / NRRL Y-1056) GN=YSH1 PE=3 SV=2
Length = 771
Score = 159 bits (402), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 94/329 (28%), Positives = 171/329 (51%), Gaps = 20/329 (6%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQ-- 107
S ++ +L+SH H +LPY M++ VF T P +YR LL+ + + +
Sbjct: 61 SQVEVLLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRW-LLSDFVKVTNIGNDN 119
Query: 108 ---VSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
VS+ +L+T +D+ +F + + +YH + GI + AGH+LG ++++
Sbjct: 120 AGGVSDENLYTDEDLAESFDRIETV----DYHSTIDVNGIKFTAYHAGHVLGAAMFQVEI 175
Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
G +++ DY+R ++HLN + + +++ + ++P + + I T
Sbjct: 176 AGLRILFTGDYSRELDRHLNSAEIPTLPSDILIVESTFGTATHEPRTSKEKKLTQLIHTT 235
Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-----PIYFLTYVSSSTIDYVKSFL 279
+ GG VLLPV + GR E++LIL++YW++H+ PI++ + ++ + ++++
Sbjct: 236 VSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMSVFQTYV 295
Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
M D I K F S+ N F+ K+++ L N E + GP ++LAS L+ G S D+
Sbjct: 296 NMMNDKIRKKFRDSQTNPFIFKNISYLKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLE 353
Query: 340 EWASDVKNLVLFTERGQFGTLARMLQADP 368
+W D KNLVL T GT+A+ L +P
Sbjct: 354 KWCPDEKNLVLITGYSVEGTMAKFLMLEP 382
>sp|Q9C952|CPSF3_ARATH Cleavage and polyadenylation specificity factor subunit 3-I
OS=Arabidopsis thaliana GN=CPSF73-I PE=1 SV=1
Length = 693
Score = 159 bits (402), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 114/385 (29%), Positives = 188/385 (48%), Gaps = 40/385 (10%)
Query: 2 GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG------------WNDHFDPSLLQPLS 49
G + VTPL +S G N L DCG + D DPS
Sbjct: 19 GDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPS------ 72
Query: 50 KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRR 106
+ID +L++H H +LPY +++ + VF +T+ +Y+L LLT Y + +S+
Sbjct: 73 ----SIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKV 126
Query: 107 QVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG 166
V + LF DI+ + + + + Q ++G I + AGH+LG ++ + G
Sbjct: 127 SVEDM-LFDEQDINKSMDKIEVIDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAG 181
Query: 167 EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTL 225
++Y DY+R +++HL L F P + I ++ + + R RE F D I T+
Sbjct: 182 VRILYTGDYSREEDRHLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTV 240
Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMG 283
GG VL+P + GR ELLLIL++YWA H N PIY+ + ++ + ++++ M
Sbjct: 241 AQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMN 300
Query: 284 DSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWAS 343
D I F S N F+ KH++ L + + ++ GP +V+A+ L++G S +F W S
Sbjct: 301 DRIRNQFANS--NPFVFKHISPLNSIDDFNDV--GPSVVMATPGGLQSGLSRQLFDSWCS 356
Query: 344 DVKNLVLFTERGQFGTLARMLQADP 368
D KN + GTLA+ + +P
Sbjct: 357 DKKNACIIPGYMVEGTLAKTIINEP 381
Score = 38.9 bits (89), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 37/140 (26%), Positives = 63/140 (45%), Gaps = 13/140 (9%)
Query: 509 GKLDEGSASLILDAKPSKV-VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLV 567
G + EG+ + + +P +V + N LT + + +I + AD T L + P ++
Sbjct: 366 GYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNII 425
Query: 568 LVHGSAEATEHLKQHCLKHV---CPHVYTPQIEETIDV---TSDLCAYKVQLSEK----- 616
LVHG A LKQ L + TP+ E++++ + L +L+EK
Sbjct: 426 LVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVG 485
Query: 617 -LMSNVLFKKLGDYEIAWVD 635
+S +L KK Y+I D
Sbjct: 486 DTVSGILVKKGFTYQIMAPD 505
>sp|Q6FUA5|YSH1_CANGA Endoribonuclease YSH1 OS=Candida glabrata (strain ATCC 2001 / CBS
138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=YSH1 PE=3
SV=1
Length = 771
Score = 159 bits (401), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 106/371 (28%), Positives = 183/371 (49%), Gaps = 23/371 (6%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----R 105
S +D +L+SH H +LPY M++ VF T P +YR LL + + S
Sbjct: 60 SIVDVLLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKAIYRW-LLRDFVRVTSIGSQS 118
Query: 106 RQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
+ +L++ +D+ +F + + +YH GI AGH+LG +++I
Sbjct: 119 SNAEDDNLYSNEDLIESFDKIETI----DYHSMIDVNGIKFTAFHAGHVLGAAMFQIEIA 174
Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTL 225
G V++ DY+R ++HLN + +++ + ++P + + I T+
Sbjct: 175 GLRVLFTGDYSREIDRHLNSAEVPPLPSDILIVESTFGTATHEPRLHREKKLTQLIHSTV 234
Query: 226 RAGGNVLLPVDSAGRVLELLLILEDYWAEH-----SLNYPIYFLTYVSSSTIDYVKSFLE 280
GG VL+PV + GR EL+LIL++YW++H S PI++ + ++ + ++++
Sbjct: 235 NKGGRVLMPVFALGRAQELMLILDEYWSQHKEELGSNQIPIFYASNLARKCLSVFQTYVN 294
Query: 281 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 340
M D+I K F S+ N F+ K++ + N E + GP ++LAS L+ G S D+
Sbjct: 295 MMNDNIRKKFRDSQTNPFIFKNIAYIKNLDEFQDF--GPSVMLASPGMLQNGLSRDLLER 352
Query: 341 WASDVKNLVLFTERGQFGTLAR--MLQADPPPKAV--KVTMSRRVPLVGEELIAYEEEQT 396
W D KNLVL T GT+A+ +L+ D P +VT+ RR + A+ + Q
Sbjct: 353 WCPDEKNLVLITGYSVEGTMAKYLLLEPDTIPSVSNPEVTIPRRCRVEELSFAAHVDFQE 412
Query: 397 RLKKEEALKAS 407
L+ E + AS
Sbjct: 413 NLEFIEQINAS 423
>sp|Q6CUI5|YSH1_KLULA Endoribonuclease YSH1 OS=Kluyveromyces lactis (strain ATCC 8585 /
CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37)
GN=YSH1 PE=3 SV=1
Length = 764
Score = 158 bits (399), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 175/348 (50%), Gaps = 24/348 (6%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLS----- 104
STID +L+SH H +LPY M++ VF T P +YR LL + + S
Sbjct: 64 STIDLLLISHFHLDHAASLPYVMQRTNFRGRVFMTHPTKAIYRW-LLNDFVKVTSIGDSP 122
Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
+ S +L++ +D+ +F + + +YH + + GI AGH+LG +++I
Sbjct: 123 GQDSSNDNLYSDEDLAESFDRIETI----DYHSTMEVNGIKFTAFHAGHVLGAAMFQIEI 178
Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
G V++ DY+R ++HLN + +++ + ++P + + I
Sbjct: 179 AGVRVLFTGDYSREVDRHLNSAEVPPQSSDVIIVESTFGTATHEPRQNRERKLTQLIHTV 238
Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-----NYPIYFLTYVSSSTIDYVKSFL 279
+ GG VLLPV + GR E++LIL++YW H PI++ + ++ + ++++
Sbjct: 239 VSKGGRVLLPVFALGRAQEIMLILDEYWQNHKEELGNGQVPIFYASNLAKKCMSVFQTYV 298
Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
M D I K F+ S+ N F+ K+++ L N E ++ GP ++LAS L+ G S DI
Sbjct: 299 NMMNDDIRKKFKDSQTNPFIFKNISYLKNLDEFEDF--GPSVMLASPGMLQNGLSRDILE 356
Query: 340 EWASDVKNLVLFTERGQFGTLARML----QADPPPKAVKVTMSRRVPL 383
+W + KNLVL T GT+A+ L +A P ++T+ RR +
Sbjct: 357 KWCPEEKNLVLVTGYSVEGTMAKYLLLEPEAIPSVHNPEITIPRRCQV 404
>sp|Q6C2Z7|YSH1_YARLI Endoribonuclease YSH1 OS=Yarrowia lipolytica (strain CLIB 122 / E
150) GN=YSH1 PE=3 SV=2
Length = 827
Score = 156 bits (395), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 98/322 (30%), Positives = 169/322 (52%), Gaps = 15/322 (4%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGLLTMYDQYLSRRQVS 109
STID +L+SH H +LPY M++ VF T P +YR LL+ + + S + S
Sbjct: 87 STIDILLISHFHLDHAASLPYVMQKTNFKGRVFMTHPTKGIYRW-LLSDFVRVTSGAE-S 144
Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
+ DL++ D+ ++F + + +YH + + G+ + AGH+LG ++ I G V
Sbjct: 145 DPDLYSEADLTASFNKIETI----DYHSTMEVNGVKFTAYHAGHVLGAAMYTIEVGGVKV 200
Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAG 228
++ DY+R +++HLN + ++P +LI ++ PR +RE I TL G
Sbjct: 201 LFTGDYSREEDRHLNQAEVPP-MKPDILICESTYGTGTHLPRLEREQRLTGLIHSTLDKG 259
Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
G LLPV + GR E+LLIL++YW H + IY+ + ++ I ++++ M D+I
Sbjct: 260 GKCLLPVFALGRAQEILLILDEYWEAHPDLQEFSIYYASALAKKCIAVYQTYINMMNDNI 319
Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
+ F + N F K++ + N D+ GP +++AS L++G S + WA D K
Sbjct: 320 RRRFRDQKTNPFRFKYIKNIKNLDRFDDM--GPCVMVASPGMLQSGVSRSLLERWAPDPK 377
Query: 347 NLVLFTERGQFGTLARMLQADP 368
N ++ T GT+A+ + +P
Sbjct: 378 NTLILTGYSVEGTMAKQIINEP 399
>sp|Q06224|YSH1_YEAST Endoribonuclease YSH1 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=YSH1 PE=1 SV=1
Length = 779
Score = 154 bits (389), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 181/371 (48%), Gaps = 23/371 (6%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
S +D +L+SH H +LPY M++ VF T P +YR L +T S
Sbjct: 59 SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118
Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
+ LF+ +D+ +F + + +YH + GI AGH+LG +++I
Sbjct: 119 SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174
Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
G V++ DY+R ++HLN + +++ + ++P + I T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234
Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS-----LNYPIYFLTYVSSSTIDYVKSFL 279
+ GG VLLPV + GR E++LIL++YW++H+ PI++ + ++ + ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294
Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
M D I K F S+ N F+ K+++ L N + + GP ++LAS L++G S D+
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352
Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQAD--PPPKAVKVTMSRRVPLVGEELIAYEEEQ 395
W + KNLVL T GT+A+ ML+ D P ++T+ RR + A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412
Query: 396 TRLKKEEALKA 406
L+ E + A
Sbjct: 413 ENLEFIEKISA 423
>sp|P0CM88|YSH1_CRYNJ Endoribonuclease YSH1 OS=Cryptococcus neoformans var. neoformans
serotype D (strain JEC21 / ATCC MYA-565) GN=YSH1 PE=3
SV=1
Length = 773
Score = 152 bits (385), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
ST+DA+L++H H ALPY M++ + V+ T + LTM D Q
Sbjct: 79 STVDAMLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138
Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
+ L+ D+ S++QS + Y Q+ ++G G+ P+ AGH+LG +++ I G
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195
Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
++Y DY+R +++HL + V+P V+I ++ +H P R+++E F ++ +R
Sbjct: 196 KILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254
Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
GG L+P+ S G EL L+L++YW +H N P+YF + + + K+++ M
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314
Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
+I F RDN F + V L + +L GP ++++S + G S D+ EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372
Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
KN V+ T GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396
>sp|P0CM89|YSH1_CRYNB Endoribonuclease YSH1 OS=Cryptococcus neoformans var. neoformans
serotype D (strain B-3501A) GN=YSH1 PE=3 SV=1
Length = 773
Score = 152 bits (385), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/324 (30%), Positives = 169/324 (52%), Gaps = 14/324 (4%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGL---SAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
ST+DA+L++H H ALPY M++ + V+ T + LTM D Q
Sbjct: 79 STVDAMLITHFHVDHAAALPYIMEKTNFKDGNGKVYMTHATKAIYGLTMMDTVRLNDQNP 138
Query: 110 EFD--LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE 167
+ L+ D+ S++QS + Y Q+ ++G G+ P+ AGH+LG +++ I G
Sbjct: 139 DTSGRLYDEADVQSSWQSTIAVDYHQDIVIAG---GLRFTPYHAGHVLGASMFLIEIAGL 195
Query: 168 DVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLR 226
++Y DY+R +++HL + V+P V+I ++ +H P R+++E F ++ +R
Sbjct: 196 KILYTGDYSREEDRHLVMAEIPP-VKPDVMICESTFGVHTLPDRKEKEEQFTTLVANIVR 254
Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGD 284
GG L+P+ S G EL L+L++YW +H N P+YF + + + K+++ M
Sbjct: 255 RGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYKTYVHTMNA 314
Query: 285 SITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASD 344
+I F RDN F + V L + +L GP ++++S + G S D+ EWA D
Sbjct: 315 NIRSRF-ARRDNPFDFRFVKWLKDPQKLREN-KGPCVIMSSPQFMSFGLSRDLLEEWAPD 372
Query: 345 VKNLVLFTERGQFGTLARMLQADP 368
KN V+ T GT+AR L ++P
Sbjct: 373 SKNGVIVTGYSIEGTMARTLLSEP 396
>sp|Q9UKF6|CPSF3_HUMAN Cleavage and polyadenylation specificity factor subunit 3 OS=Homo
sapiens GN=CPSF3 PE=1 SV=1
Length = 684
Score = 147 bits (372), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR + Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H + PIY+ + ++ + ++++ M D I K +N F+ KH++ L +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317
Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
D+ GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++ P+
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373
Query: 373 VKVTMSRRVPL 383
+ +++PL
Sbjct: 374 ITTMSGQKLPL 384
>sp|P79101|CPSF3_BOVIN Cleavage and polyadenylation specificity factor subunit 3 OS=Bos
taurus GN=CPSF3 PE=2 SV=1
Length = 684
Score = 147 bits (372), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 188/371 (50%), Gaps = 24/371 (6%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR + Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H + PIY+ + ++ + ++++ M D I K +N F+ KH++ L +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317
Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
D+ GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++ P+
Sbjct: 318 DDI--GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373
Query: 373 VKVTMSRRVPL 383
+ +++PL
Sbjct: 374 ITTMSGQKLPL 384
>sp|Q9QXK7|CPSF3_MOUSE Cleavage and polyadenylation specificity factor subunit 3 OS=Mus
musculus GN=Cpsf3 PE=1 SV=2
Length = 684
Score = 146 bits (369), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 97/371 (26%), Positives = 187/371 (50%), Gaps = 24/371 (6%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR + Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW----LLSDYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H + PIY+ + ++ + ++++ M D I K +N F+ KH++ L +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQINI--NNPFVFKHISNLKSMDHF 317
Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 372
D+ GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++ P+
Sbjct: 318 DDI--GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE--PEE 373
Query: 373 VKVTMSRRVPL 383
+ +++PL
Sbjct: 374 ITTMSGQKLPL 384
>sp|Q4PEJ3|YSH1_USTMA Endoribonuclease YSH1 OS=Ustilago maydis (strain 521 / FGSC 9021)
GN=YSH1 PE=3 SV=1
Length = 880
Score = 143 bits (361), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 168/322 (52%), Gaps = 13/322 (4%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLS---APVFSTEPVYRLGLLTMYDQYLSRRQVS 109
ST+DA+L++H H AL Y M++ V+ T P + M D +
Sbjct: 74 STVDAILITHFHLDHAAALTYIMEKTNFRDGHGKVYMTHPTKAVYRFLMSDFVRISNAGN 133
Query: 110 EFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
+ +LF +++ ++++ + + + Q+ ++G G+ + AGH+LG ++ I G +
Sbjct: 134 DDNLFDENEMLASWRQIEAVDFHQDVSIAG---GLRFTSYHAGHVLGACMFLIEIAGLRI 190
Query: 170 IYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAG 228
+Y D++R +++HL + V+P VLI ++ PR +E F I ++ G
Sbjct: 191 LYTGDFSREEDRHLVQAEIPP-VKPDVLICESTYGTQTHEPRLDKEHRFTSQIHHIIKRG 249
Query: 229 GNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
G VLLPV GR ELLL+L++YWA H + PIY+ + ++ I ++++ M D I
Sbjct: 250 GRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQTYIHTMNDHI 309
Query: 287 TKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
F RDN F+ KH++ L + + ++ GP +++AS +++G S ++ WA D +
Sbjct: 310 RTRF-NRRDNPFVFKHISNLRSLEKFEDR--GPCVMMASPGFMQSGVSRELLERWAPDKR 366
Query: 347 NLVLFTERGQFGTLARMLQADP 368
N ++ + GT+AR + +P
Sbjct: 367 NGLIVSGYSVEGTMARNILNEP 388
>sp|Q54YL3|INT11_DICDI Integrator complex subunit 11 homolog OS=Dictyostelium discoideum
GN=ints11 PE=3 SV=1
Length = 744
Score = 142 bits (358), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 177/371 (47%), Gaps = 19/371 (5%)
Query: 4 SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW----ND--HF-DPSLLQPLSKVASTID 56
+++V PL + +V+I N + DCG ND F D S + + ID
Sbjct: 2 TIKVVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVID 61
Query: 57 AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
V+++H H GALP+ + G P++ T P + + + D + ++ + E + FT
Sbjct: 62 CVIITHFHLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFT 121
Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
I + V + Q + E + + + AGH+LG ++ E V+Y DY
Sbjct: 122 AQMIKDCMKKVIPVNLHQTIKVD---EELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDY 178
Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
N ++HL ++ V+P VLIT+ A + ++ RE F I + + GG VL+P
Sbjct: 179 NMTPDRHLGSAWIDQ-VKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIP 237
Query: 235 VDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
V + GRV EL ++++ YW + +L + PIYF ++ Y K F+ W I ++F
Sbjct: 238 VFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFINWTNQKIKQTF--V 295
Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
+ N F KH+ +S L +AP G ++ A+ L AG S ++F +WA + N+ +
Sbjct: 296 KRNMFDFKHIKPF--QSHLVDAP-GAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPG 352
Query: 354 RGQFGTLARML 364
GT+ L
Sbjct: 353 YCVVGTVGNKL 363
Score = 39.7 bits (91), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 18/67 (26%), Positives = 33/67 (49%)
Query: 528 VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHV 587
+ + T++VKC + + + AD + I ++ P ++LVHG E L Q +K +
Sbjct: 383 IDKKTTIEVKCKIHNLSFSAHADAKGILQLIKMSNPRNVILVHGEKEKMGFLSQKIIKEM 442
Query: 588 CPHVYTP 594
+ Y P
Sbjct: 443 GVNCYYP 449
>sp|Q86A79|CPSF3_DICDI Cleavage and polyadenylation specificity factor subunit 3
OS=Dictyostelium discoideum GN=cpsf3 PE=3 SV=1
Length = 774
Score = 142 bits (358), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 100/373 (26%), Positives = 180/373 (48%), Gaps = 19/373 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST----IDAVLL 60
+++TP+ L+ G + DCG + + + P + ID +L+
Sbjct: 36 LEITPIGSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDSIESDIPDIDLLLV 95
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDD 118
SH H A+PY + + VF T P + + + D Y+ ++ D LF D
Sbjct: 96 SHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSD-YVKVSNITRDDDMLFDKSD 154
Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
+D + + + ++ Y Q + GI V AGH+LG ++ I G ++Y D++R+
Sbjct: 155 LDRSLEKIEKVRYRQKV----EHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQ 210
Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
+++HL G V+ VLI ++ + PR +RE F ++ + + G L+PV +
Sbjct: 211 EDRHLMGAETPP-VKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPVFA 269
Query: 238 AGRVLELLLILEDYW-AEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GR ELLLIL++YW A L++ PIY+ + ++ + ++++ M D + F+ S
Sbjct: 270 LGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVS-- 327
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F KH+ + D+ GP + +AS L++G S +F W SD +N ++
Sbjct: 328 NPFEFKHIKNIKGIESFDDR--GPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPGYS 385
Query: 356 QFGTLARMLQADP 368
GTLA+ + ++P
Sbjct: 386 VEGTLAKHIMSEP 398
>sp|Q4IPN9|YSH1_GIBZE Endoribonuclease YSH1 OS=Gibberella zeae (strain PH-1 / ATCC
MYA-4620 / FGSC 9075 / NRRL 31084) GN=YSH1 PE=3 SV=2
Length = 833
Score = 142 bits (357), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 103/379 (27%), Positives = 181/379 (47%), Gaps = 28/379 (7%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
+++ G ++D G + +D P ST+D +L+SH H +LPY + +
Sbjct: 41 HIIQYKGKTVMLDAGQHPAYDGLAALPFYDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 100
Query: 79 GLSAPVFSTEPVYRLGLLTMYDQYL---SRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
VF T P + + D + + ++T D + F + + Y +
Sbjct: 101 NFRGRVFMTHPTKAIYKWLIQDSVRVGNTSSNPTTQPVYTEQDHLNTFPQIEAIDYHTTH 160
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
+S I + P+ AGH+LG ++ I G ++ + DY+R +++HL + V+
Sbjct: 161 TISS----IRITPYPAGHVLGAAMFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKID 216
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
VLIT++ + + PR +RE +I+ L GG VL+PV + GR ELLLIL++YW +
Sbjct: 217 VLITESTYGIASHVPRLEREQALMKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGK 276
Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
H+ YPIY+ + ++ + ++++ M D+I + F E S D A +
Sbjct: 277 HADFQKYPIYYASNLARKCMLIYQTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDF 336
Query: 301 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 360
K++ L N D+ G ++LAS L+ G S ++ WA KN V+ T GT+
Sbjct: 337 KYIRSLKNLDRFDDV--GGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTM 394
Query: 361 ARMLQADPPPKAVKVTMSR 379
A+ + + P ++ MSR
Sbjct: 395 AKQIMQE--PDQIQAVMSR 411
>sp|Q5BEP0|YSH1_EMENI Endoribonuclease ysh1 OS=Emericella nidulans (strain FGSC A4 / ATCC
38163 / CBS 112.46 / NRRL 194 / M139) GN=ysh1 PE=3 SV=1
Length = 884
Score = 136 bits (343), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 100/361 (27%), Positives = 172/361 (47%), Gaps = 19/361 (5%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
ST+D +L+SH H ALPY + + VF T + + D S D
Sbjct: 74 STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSD 133
Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
T + S L + +++ + I + P+ AGH+LG ++ I+ G ++++
Sbjct: 134 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFT 193
Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
DY+R +++HL + V+ VLIT++ + + PPR +RE +I+ L GG V
Sbjct: 194 GDYSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 253
Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
L+PV + GR ELLLILE+YW H PIY++ + + ++++ M D+I +
Sbjct: 254 LMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 313
Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
F E S D + + K+V L + D+ G ++LAS L+ G S ++
Sbjct: 314 FRQRMAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 371
Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
WA + +N V+ T GT+A+ L + P + MSR +G + +E+ +
Sbjct: 372 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PDQIHAVMSRAATGMGRTRMNGNDEEQK 429
Query: 398 L 398
+
Sbjct: 430 I 430
>sp|Q8WZS6|YSH1_NEUCR Endoribonuclease ysh-1 OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=ysh-1
PE=3 SV=1
Length = 850
Score = 134 bits (338), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 103/382 (26%), Positives = 184/382 (48%), Gaps = 30/382 (7%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHLGALPYAMKQL 78
+++ G ++D G + +D P ST+D +L+SH H +LPY + +
Sbjct: 40 HIIQYKGKTVMLDAGQHPAYDGLAALPFFDDFDLSTVDVLLISHFHIDHAASLPYVLAKT 99
Query: 79 GLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNY 135
VF +T+ +Y+ + + ++T +D F + + Y+ +
Sbjct: 100 NFRGRVFMTHATKAIYKWLIQDSVRVGNTSSNPQSSLVYTEEDHLKTFPMIEAIDYNTTH 159
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
+S I + P+ AGH+LG ++ I G + + DY+R +++HL + V+
Sbjct: 160 TISS----IRITPYPAGHVLGAAMFLIEIAGLKIFFTGDYSREEDRHLISAKVPKGVKID 215
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
VLIT++ + + PR +RE +I+ L GG VL+PV + GR ELLLIL++YW +
Sbjct: 216 VLITESTYGIASHIPRPEREQALMKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGK 275
Query: 255 HS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF-------ETSRDNA-----FLL 300
H+ YPIY+ + ++ + ++++ M D+I + F E+S D A +
Sbjct: 276 HAEYQKYPIYYASNLARKCMLVYQTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDF 335
Query: 301 KHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
+ + L LD D G ++LAS L+ G S ++ WA KN V+ T GT
Sbjct: 336 RFIRSL---KSLDRFEDVGGCVMLASPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGT 392
Query: 360 LARMLQADPPPKAVKVTMSRRV 381
+A+ L + P+ ++ MSR +
Sbjct: 393 MAKQLLQE--PEQIQAVMSRNI 412
>sp|Q6BMW3|YSH1_DEBHA Endoribonuclease YSH1 OS=Debaryomyces hansenii (strain ATCC 36239 /
CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968) GN=YSH1 PE=3
SV=2
Length = 815
Score = 133 bits (334), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 169/341 (49%), Gaps = 34/341 (9%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS----- 104
S +D +L+SH H +LPY M+ + VF +T+ +YR LL+ + + S
Sbjct: 64 SKVDILLVSHFHLDHAASLPYVMQHTNFNGRVFMTHATKAIYRW-LLSDFVKVTSIGGGS 122
Query: 105 ---------RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLL 155
+L+T DD+ +F + + +YH + + +GI + AGH+L
Sbjct: 123 DARLNNSDPNANTGSSNLYTDDDLMRSFDRIETI----DYHSTIELDGIRFTAYHAGHVL 178
Query: 156 GGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 215
G ++ I G V++ DY+ +++HL + ++P +LIT++ PR ++E
Sbjct: 179 GACMYFIEIGGLKVLFTGDYSSEEDRHLQVAEVPP-IKPDILITESTFGTATHEPRLEKE 237
Query: 216 M-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--EHSLNYPIYFLTYVSSSTI 272
+ I TL GG +L+PV + GR ELLLILE+YW+ + N IY+ + ++ +
Sbjct: 238 TRMTNIIHSTLLKGGRILMPVFALGRAQELLLILEEYWSLNDDLQNINIYYASSLARKCM 297
Query: 273 DYVKSFLEWMGDSI----TKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMA 327
+++ M DSI + + + + N F K + + N LD D GP +V+AS
Sbjct: 298 AVYQTYTNIMNDSIRLTTSATNSSKKQNPFQFKFIKSIKN---LDKFQDFGPCVVVASPG 354
Query: 328 SLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADP 368
L+ G S ++ WA D KN V+ T GT+A+ L +P
Sbjct: 355 MLQNGVSRELLERWAPDPKNAVIMTGYSVEGTMAKDLLTEP 395
>sp|Q59P50|YSH1_CANAL Endoribonuclease YSH1 OS=Candida albicans (strain SC5314 / ATCC
MYA-2876) GN=YSH1 PE=3 SV=1
Length = 870
Score = 133 bits (334), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 169/331 (51%), Gaps = 23/331 (6%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS 109
S +D +L+SH H +LPY M+Q VF +T+ +YR L+ + + S
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 208
Query: 110 EFD--------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
D L+T DDI +F + + +YH + + +GI + AGH+LG ++
Sbjct: 209 SEDGGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 264
Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
I G V++ DY+R + +HL+ + ++P +LI+++ PR + E
Sbjct: 265 IEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRIELERKLTTH 323
Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
I T+ GG VLLPV + G ELLLIL++YW+++ N +++ + ++ + +++
Sbjct: 324 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 383
Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
M D I S +S + N F K++ + + S+ + GP +V+A+ L+AG S +
Sbjct: 384 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 441
Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADP 368
+WA D KNLV+ T GT+A+ L +P
Sbjct: 442 LEKWAPDGKNLVILTGYSVEGTMAKELLKEP 472
>sp|Q8GUU3|CPS3B_ARATH Cleavage and polyadenylation specificity factor subunit 3-II
OS=Arabidopsis thaliana GN=CPSF73-II PE=1 SV=2
Length = 613
Score = 132 bits (333), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 97/358 (27%), Positives = 167/358 (46%), Gaps = 20/358 (5%)
Query: 22 LVSIDGFNFLIDCGW-------NDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
+V+I+G + DCG N + + SL+ + I ++++H H+GALPY
Sbjct: 20 VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTM--YDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYS 132
+ G + P++ + P L L + Y + + R+ E +LFT I + + V +
Sbjct: 80 TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEE-ELFTTTHIANCMKKVIAIDLK 138
Query: 133 QNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFV 192
Q + E + + + AGH+LG + ++Y DYN ++HL ++ +
Sbjct: 139 QTIQVD---EDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR-L 194
Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
+ +LI+++ A + + RE F A+ K + GG L+P + GR EL ++L+DY
Sbjct: 195 QLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELCMLLDDY 254
Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
W ++ PIYF + ++ Y K + W ++ + T N F K+V
Sbjct: 255 WERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF--DRS 310
Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
L +AP GP ++ A+ L AGFS ++F WA NLV GT+ L A P
Sbjct: 311 LIHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMAGKP 367
>sp|Q4WRC2|YSH1_ASPFU Endoribonuclease ysh1 OS=Neosartorya fumigata (strain ATCC MYA-4609
/ Af293 / CBS 101355 / FGSC A1100) GN=ysh1 PE=3 SV=1
Length = 872
Score = 131 bits (329), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 99/361 (27%), Positives = 170/361 (47%), Gaps = 19/361 (5%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
ST+D +L+SH H ALPY + + VF T + + D S D
Sbjct: 75 STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVSNTASSSD 134
Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
T + S L + +++ + I + P AGH+LG ++ I+ G ++++
Sbjct: 135 QRTTLYTEHDHLSTLPLIETIDFNTTHTVNSIRITPFPAGHVLGAAMFLISIAGLNILFT 194
Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNV 231
DY+R +++HL + ++ VLIT++ + PPR +RE +I+ L GG V
Sbjct: 195 GDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAALMKSITGILNRGGRV 254
Query: 232 LLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
L+PV + GR ELLLIL++YW H PIY++ + + ++++ M D+I +
Sbjct: 255 LMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 314
Query: 290 F-------ETSRDNA-----FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
F E S D + + K V L + D+ G ++LAS L+ G S ++
Sbjct: 315 FRQRMAEAEASGDKSASAGPWDFKFVRSLRSLERFDDV--GGCVMLASPGMLQTGTSREL 372
Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTR 397
WA + +N V+ T GT+A+ L + P+ + MSR V +A +E+ +
Sbjct: 373 LERWAPNERNGVVMTGYSVEGTMAKQLLNE--PEQIPAVMSRSAGGVSRRGLAGTDEEQK 430
Query: 398 L 398
+
Sbjct: 431 I 431
>sp|Q54SH0|INT9_DICDI Integrator complex subunit 9 homolog OS=Dictyostelium discoideum
GN=ints9 PE=3 SV=1
Length = 712
Score = 88.6 bits (218), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/279 (26%), Positives = 114/279 (40%), Gaps = 52/279 (18%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG------LLTMYDQY---- 102
STID +L+S+ ++ ALP+ + +++TEP ++G L+ M QY
Sbjct: 115 STIDMILISNYTNIY--ALPFITEYTNFQGKIYATEPTVQIGKLLLEELVQMDKQYSNSS 172
Query: 103 ----------------------------------LSRRQVSEFDLFTLDDIDSAFQSVTR 128
L R DL+ DI+ +F+ +
Sbjct: 173 INNNNNNNNLSDCWQNIEILEKLNVHNVGMENENLYRDSYRWKDLYKKIDIEKSFEKIQS 232
Query: 129 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTV 187
+ +++ S K G P +G+ LG W I G E V+Y D + ++
Sbjct: 233 IRFNE----SIKHYGFECIPSSSGYGLGSANWVIESKGFERVVYISDSSLSLSRYPTPFQ 288
Query: 188 LESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLI 247
L P VLI N N PP Q I TL+ GG VL+P S G +L+L
Sbjct: 289 LSPIDNPDVLILSKINHYPNNPPDQMLSELCSNIGSTLQQGGTVLIPSYSCGIILDLFEH 348
Query: 248 LEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDS 285
L DY + L Y PIYF++ VS + + Y + EW+ S
Sbjct: 349 LADYLNKVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKS 387
>sp|Q58633|Y1236_METJA Uncharacterized protein MJ1236 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ1236 PE=4 SV=1
Length = 634
Score = 87.8 bits (216), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/373 (24%), Positives = 154/373 (41%), Gaps = 18/373 (4%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWN----DHFDPSLLQPLSKVASTIDAVLL 60
++V+ L G V LIDCG N D P P + +DAV++
Sbjct: 180 IRVSFLGGAREVGRSCLYVQTPDTRVLIDCGINVACEDKAFPHFDAPEFSIED-LDAVIV 238
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
+H H G +P + + G PV+ T P L L D ++ + +T DI
Sbjct: 239 THAHLDHCGFIP-GLFRYGYDGPVYCTRPTRDLMTLLQKDYLEIAKKEGKEVPYTSKDIK 297
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTV--WKITKDGEDVIYAVDYNRR 178
+ + + Y +S I + H AGH+LG + I + ++ Y D
Sbjct: 298 TCVKHTIPIDYGVTTDIS---PTIKLTLHNAGHVLGSAIAHLHIGEGLYNLAYTGDIKFE 354
Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHN--QPPRQQREMFQDAISKTLRAGGNVLLPVD 236
+ L V + ++I Y A + + +S+T GG VL+PV
Sbjct: 355 TSRLLEPAVCQFPRLETLIIESTYGAYDDVLPEREEAERELLRVVSETTDRGGKVLIPVF 414
Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
GR EL+L+LE+ + + N P+Y + +T + ++ E++ + + DN
Sbjct: 415 GVGRAQELMLVLEEGYNQGIFNAPVYLDGMIWEATAIHT-AYPEYLSKEMRQKIFHEGDN 473
Query: 297 AFL---LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
FL K V + ++ ++ D P ++LA+ L G S + A D KN ++F
Sbjct: 474 PFLSEVFKRVGSTNERRKVIDS-DEPCVILATSGMLTGGPSVEYLKHLAPDEKNAIIFVG 532
Query: 354 RGQFGTLARMLQA 366
GTL R +Q+
Sbjct: 533 YQAEGTLGRKVQS 545
>sp|Q5SLP1|RNSE_THET8 Ribonuclease TTHA0252 OS=Thermus thermophilus (strain HB8 / ATCC
27634 / DSM 579) GN=TTHA0252 PE=1 SV=1
Length = 431
Score = 87.8 bits (216), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 96/384 (25%), Positives = 157/384 (40%), Gaps = 22/384 (5%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG-WNDHFDPSLLQPLSKVASTIDAVLLSHP 63
+++ P ++L+ G L+DCG + + P +DAVLL+H
Sbjct: 1 MRIVPFGAAREVTGSAHLLLAGGRRVLLDCGMFQGKEEARNHAPFGFDPKEVDAVLLTHA 60
Query: 64 DTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAF 123
H+G LP ++ G PV++T L + + D +V + F +D++ A
Sbjct: 61 HLDHVGRLPKLFRE-GYRGPVYATRATVLLMEIVLEDAL----KVMDEPFFGPEDVEEAL 115
Query: 124 QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL 183
+ L Y + L + +A AGHL G +G ++Y+ D R++ L
Sbjct: 116 GHLRPLEYGEWLRLGA----LSLAFGQAGHLPGSAFVVAQGEGRTLVYSGDLGNREKDVL 171
Query: 184 NGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLE 243
L VL Y ++P R+ F + + KTL GG VL+P + R E
Sbjct: 172 PDPSLPPLAD-LVLAEGTYGDRPHRPYRETVREFLEILEKTLSQGGKVLIPTFAVERAQE 230
Query: 244 LLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFL--- 299
+L +L Y H L PIY + ++ + + + + + F + N F
Sbjct: 231 ILYVL--YTHGHRLPRAPIYLDSPMAGRVLSLYPRLVRYFSEEVQAHFLQGK-NPFRPAG 287
Query: 300 LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGT 359
L+ V L+ AP GP +VLA L G SD +N ++F G
Sbjct: 288 LEVVEHTEASKALNRAP-GPMVVLAGSGMLAGGRILHHLKHGLSDPRNALVFVGYQPQGG 346
Query: 360 LARMLQADPPPKAVKVTMSRRVPL 383
L + A PP AV++ + VPL
Sbjct: 347 LGAEIIARPP--AVRI-LGEEVPL 367
>sp|A7SBF0|INT9_NEMVE Integrator complex subunit 9 homolog OS=Nematostella vectensis
GN=ints9 PE=3 SV=1
Length = 660
Score = 87.4 bits (215), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 111/408 (27%), Positives = 176/408 (43%), Gaps = 66/408 (16%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVS--------IDGF---NFLIDCGWNDHFD--PSLLQP 47
M T Q TPLS V NE S L S I+GF N L + G D P + P
Sbjct: 32 MSTVNQFTPLSLVNNEK-FSQLKSWSSRELQEIEGFTAQNNLKEAGGRLFIDAEPEVCPP 90
Query: 48 LSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG------LLTMY 99
+ + S +D +L+S + H+ ALP+ + G + +++TEP ++G L+T
Sbjct: 91 ETGLIDFSMVDVILIS--NYHHMLALPFITEYSGFNGKIYATEPTIQIGRDLMLELVTFA 148
Query: 100 DQYLSRRQVSEFD-----------------------LFTLDDIDSAFQSVTRLTYSQNYH 136
++ RR + + L++ D+ + + ++YS+
Sbjct: 149 ERVPKRRNGNMWKNDNVIRCLPAPLNELANVKSWRVLYSKHDVKACISKIQAVSYSEKLD 208
Query: 137 LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKH---LNGTVLESFVR 193
L G + ++ H +G LG + W + + E + Y + + H LN TVL++
Sbjct: 209 LCGI---LQLSAHSSGFCLGSSNWMLESEYEKISY-LSPSSSFTTHPLPLNQTVLKN--S 262
Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
++IT A + P E F ++ TLRAGGNVL+P +G + +L L Y
Sbjct: 263 DVLIITGVTEAPIDNPDAMLGE-FCTHLASTLRAGGNVLVPCYPSGVLYDLFECLYTYLD 321
Query: 254 EHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSI-TKSF--ETSRDNAFLLKHVTLLINK 309
L PIYF++ V+ S++ Y + EW+ S TK + E +A LLK L +
Sbjct: 322 NAKLGMVPIYFISPVADSSLAYSNIYGEWLCQSKQTKVYLPEPPFPHAELLKEARLKV-F 380
Query: 310 SELDNAPDG----PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
S L N P +V SL G + W N V+FTE
Sbjct: 381 SNLHNGFSSSFKTPCVVFTGHPSLRYGDAVHFMEIWGKSGNNTVIFTE 428
>sp|Q57626|Y162_METJA Uncharacterized protein MJ0162 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ0162 PE=3 SV=1
Length = 421
Score = 84.0 bits (206), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 89/343 (25%), Positives = 158/343 (46%), Gaps = 39/343 (11%)
Query: 31 LIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALP-YAMKQLGLSAPVFSTEP 89
L+DCG P + +DAV++SH H GA+P Y K+ ++ T P
Sbjct: 28 LLDCG----MSPDTGEIPKVDDKAVDAVIVSHAHLDHCGAIPFYKFKK------IYCTHP 77
Query: 90 VYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPH 149
L +T D + E +DI A +++ L Y + ++ E I +
Sbjct: 78 TADLMFITWRDTLNLTKAYKE------EDIQHAMENIECLNYYEERQIT---ENIKFKFY 128
Query: 150 VAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNA-LHNQ 208
AGH+LG + DG+ ++Y D N + L + ++I Y + L +
Sbjct: 129 NAGHILGSASIYLEVDGKKILYTGDINEGVSRTLLPADTDIDEIDVLIIESTYGSPLDIK 188
Query: 209 PPRQ--QREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIYFLT 265
P R+ +R++ ++ IS+T+ GG V++PV + GR E+LLI+ +Y L + PIY
Sbjct: 189 PARKTLERQLIEE-ISETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLRDVPIYTDG 247
Query: 266 YVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF-LLKHV--TLLINKSELDNAPDGPKLV 322
+ +T Y+ S++ W+ I K+ +R N F +K +L+ NK P ++
Sbjct: 248 SLIHATAVYM-SYINWLNPKI-KNMVENRINPFGEIKKADESLVFNKE--------PCII 297
Query: 323 LASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQ 365
+++ ++ G +++ D KN ++ T GTL R L+
Sbjct: 298 VSTSGMVQGGPVLK-YLKLLKDPKNKLILTGYQAEGTLGRELE 339
>sp|Q2KJA6|INT9_BOVIN Integrator complex subunit 9 OS=Bos taurus GN=INTS9 PE=2 SV=1
Length = 658
Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 142/339 (41%), Gaps = 46/339 (13%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRR---- 106
ST+D +L+S+ + ALPY + G + V++TEP ++G L M + ++ R
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLMEELVNFIERVPKAQ 151
Query: 107 ----------------------QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
+VS + +T+ +++SA + + YSQ L G
Sbjct: 152 SASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFG---A 208
Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
+ V P +G+ LG + W I E V Y V + H S VLI
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMDQASLKNSDVLILTGLT 267
Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
+ P F ++ T+R GGNVL+P +G + +LL L Y L + P Y
Sbjct: 268 QIPTANPDSMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSSIPFY 327
Query: 263 FLTYVSSSTIDYVKSFLEWMG-DSITKSF--ETSRDNAFL-----LKHVTLLINKSELDN 314
F++ V++S++++ + F EW+ + TK + E +A L LKH + + N
Sbjct: 328 FISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSI--HGDFSN 385
Query: 315 APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
P +V SL G W N V+FTE
Sbjct: 386 DFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTE 424
>sp|Q5ZKK2|INT9_CHICK Integrator complex subunit 9 OS=Gallus gallus GN=INTS9 PE=2 SV=1
Length = 658
Score = 82.4 bits (202), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 139/339 (41%), Gaps = 46/339 (13%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
ST+D +L+S+ + ALPY + G + V++TEP ++G L M + S +V +
Sbjct: 94 STVDVILISNYHCMM--ALPYITEYTGFTGTVYATEPTVQIGRLLMEELVNSIERVPKAQ 151
Query: 113 -----------------------------LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
+T+ ++++A + + YSQ L G
Sbjct: 152 SASTWKNKEVQRLLPAPLKDAVEVSMWRKCYTMPEVNAALSKIQLVGYSQKIELFG---A 208
Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
+ V P +G+ LG + W I E V Y V + H S VLI
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMDQASLKNSDVLILTGLT 267
Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
+ P F ++ T+R GGNVL+P +G + +LL L Y L N P Y
Sbjct: 268 QIPTANPDGMVGEFCSNLAMTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNVPFY 327
Query: 263 FLTYVSSSTIDYVKSFLEWMG-DSITKSF--ETSRDNAFL-----LKHVTLLINKSELDN 314
F++ V++S++++ + F EW+ + TK + E +A L LKH + + N
Sbjct: 328 FISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSI--HGDFSN 385
Query: 315 APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
P ++ SL G W N V+FTE
Sbjct: 386 DFKQPCVIFTGHPSLRFGDVVHFMELWGKSSLNTVIFTE 424
>sp|Q8K114|INT9_MOUSE Integrator complex subunit 9 OS=Mus musculus GN=Ints9 PE=2 SV=1
Length = 658
Score = 82.0 bits (201), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 143/337 (42%), Gaps = 42/337 (12%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRR---- 106
ST+D +L+S+ + ALPY + G + V++TEP ++G L M + ++ R
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTMQIGRLLMEELVNFIERVPKAQ 151
Query: 107 ----------------------QVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
+VS + +T+ +++SA + + YSQ L G
Sbjct: 152 SASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFG---A 208
Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
+ V P +G+ LG + W I E V Y V + H S VLI
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMDQASLKNSDVLILTGLT 267
Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
+ P F ++ T+R GGNVL+P +G + +LL L Y L N P Y
Sbjct: 268 QIPTANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNIPFY 327
Query: 263 FLTYVSSSTIDYVKSFLEWMG-DSITKSF--ETSRDNAFLLKHVTLLINKS---ELDNAP 316
F++ V++S++++ + F EW+ + +K + E +A L++ L +S + N
Sbjct: 328 FISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSNDF 387
Query: 317 DGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
P ++ SL G W N ++FTE
Sbjct: 388 RQPCVLFTGHPSLRFGDVVHFMELWGKSSLNTIIFTE 424
>sp|Q6DFF4|INT9_XENLA Integrator complex subunit 9 OS=Xenopus laevis GN=ints9 PE=2 SV=1
Length = 658
Score = 81.3 bits (199), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 142/345 (41%), Gaps = 58/345 (16%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSE 110
ST+D +L+S+ + ALPY ++ G + V++TEP ++G L M + ++ R ++
Sbjct: 94 STVDVILISNYHCMM--ALPYITERTGFTGTVYATEPTVQIGRLLMEELVNFIERVPKAQ 151
Query: 111 ---------------------FDLFT------LDDIDSAFQSVTRLTYSQNYHLSGKGEG 143
++FT + ++++A + + YSQ L G
Sbjct: 152 SATVWKHKDVQRLLPAPLKDAVEVFTWKKCYSMQEVNAALSKIQLVGYSQKIELFGV--- 208
Query: 144 IVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYN 203
+ V P +G+ LG + W I E V Y V + H S VLI
Sbjct: 209 VQVTPLSSGYALGSSNWVIQSHYEKVSY-VSGSSLLTTHPQPMDQTSLKNSDVLILTGLT 267
Query: 204 ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
+ P F ++ T+R+GGNVL+P +G + +LL L Y L N P Y
Sbjct: 268 QIPTANPDGMVGEFCSNLAMTIRSGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNVPFY 327
Query: 263 FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTL----LINKSELDNAPD- 317
F++ V++S++++ + F EW+ ++ N L LI ++L + P+
Sbjct: 328 FISPVANSSLEFSQIFAEWLCH--------NKQNKVYLPEPPFPHAELIQSNKLKHYPNI 379
Query: 318 ---------GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
P +V +L G W N V+FTE
Sbjct: 380 HGDFSNDFKQPCVVFTGHPTLRFGDVVHFMELWGKSSLNTVIFTE 424
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.136 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 287,914,480
Number of Sequences: 539616
Number of extensions: 12635123
Number of successful extensions: 35233
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 50
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 34915
Number of HSP's gapped (non-prelim): 129
length of query: 739
length of database: 191,569,459
effective HSP length: 125
effective length of query: 614
effective length of database: 124,117,459
effective search space: 76208119826
effective search space used: 76208119826
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)