BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 005761
(678 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|A7SBF0|INT9_NEMVE Integrator complex subunit 9 homolog OS=Nematostella vectensis
GN=ints9 PE=3 SV=1
Length = 660
Score = 229 bits (584), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 185/706 (26%), Positives = 330/706 (46%), Gaps = 88/706 (12%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLP---NDFYKAICKENS 57
MK C+ G PC +L +++ DC LD+S + F+PL N+ + + ++
Sbjct: 1 MKLYCV---GHSVSSPCLVLQFKQTNIMLDCGLDMSTVNQFTPLSLVNNEKFSQL--KSW 55
Query: 58 DSQNRQKVEKPLDANDL--------IFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGL 109
S+ Q++E N+L I AEP L + S +DV+LIS+ ML L
Sbjct: 56 SSRELQEIEGFTAQNNLKEAGGRLFIDAEPEVCPPET-GLIDFSMVDVILISNYHHMLAL 114
Query: 110 PFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELL 169
PF+T GF+ KIY TE +IG+ +M EL+ + +G W + L
Sbjct: 115 PFITEYSGFNGKIYATEPTIQIGRDLMLELVTFAERV-----PKRRNGNMWKNDNVIRCL 169
Query: 170 PSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIG 229
P+ L ++A + L VK CISK+Q + + E+ GIL + A SSG +G
Sbjct: 170 PAPLNELANVKSWRVLYSK----HDVKACISKIQAVSYSEKLDLCGILQLSAHSSGFCLG 225
Query: 230 ACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFS 289
+ NW++ I+Y+S S+ + H + + ++ SD+++ + ++
Sbjct: 226 SSNWMLESEYEKISYLSPSSSFTTHPLPLNQTVLKNSDVLIITGVTEA------------ 273
Query: 290 DDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQI 349
+ N D + E C+ +++AGG+VL+P GV L E +
Sbjct: 274 -----------PIDNPDAMLGEF------CTHLASTLRAGGNVLVPCYPSGVLYDLFECL 316
Query: 350 AIFMECSSLK-IPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKK 408
+++ + L +PIY IS VA+ LAY+N EWLC+ +Q K++ +P F H +L+KE +
Sbjct: 317 YTYLDNAKLGMVPIYFISPVADSSLAYSNIYGEWLCQSKQTKVYLPEPPFPHAELLKEAR 376
Query: 409 IHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWS-GDHNSLLVLENEVDAEL 467
+ VF +H+ +++ PC+VF+ H SLR G +H + W +N+++ E +
Sbjct: 377 LKVFSNLHN-GFSSSFKTPCVVFTGHPSLRYGDAVHFMEIWGKSGNNTVIFTEPDFPYLE 435
Query: 468 AVLPFKPISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEW---------RTHVSFS 518
A+ P++P++MK C + LLK LQP+ ++ PE + RT ++
Sbjct: 436 ALAPYQPLAMKTCYCPIDPRLNFAQANKLLKELQPRHLVMPESYSRPPVIHPHRTDLTIE 495
Query: 519 DVTSFSVSHYSENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNH 578
D S++ ++ + +P + ++ IA +++S + ++ + + L G L
Sbjct: 496 D-PGCSLTTFNHLDVAALPISRSFEKVVIANELSSCLHPQHVR-PGVAVATLTGTLVTKD 553
Query: 579 GKHQLLP---------ENEPGGSSQTRPFLH---WGSPDPENLLAELSKMGINGSVERCM 626
K+ L P +E G SS + L WG+ ++ + L K GI
Sbjct: 554 NKYTLQPLEFLVEPKAGSEGGDSSTNKGQLSRHLWGTVQLDDFVRSLKKRGITD------ 607
Query: 627 TDAESEDG-FTVKVQDPEKSMIEVRAAVTVISAADKNLASRIVKAM 671
+ ES G T+ + + + ++ R + +I+ ++ L RI A+
Sbjct: 608 VNVESSGGEHTIHLPNDDAMILLDRGSTHIITHGNEELRIRIRDAL 653
>sp|Q4R5Z4|INT9_MACFA Integrator complex subunit 9 OS=Macaca fascicularis GN=INTS9 PE=2
SV=1
Length = 637
Score = 217 bits (552), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 175/663 (26%), Positives = 305/663 (46%), Gaps = 84/663 (12%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLPNDFYKAICKENSDSQ 60
MK CL G PC++L ++ DC LD+++ F PLP ++ +
Sbjct: 1 MKLYCL---SGHPTLPCNVLKFKSTTIMLDCGLDMTSTLNFLPLP--LVQSPRLSSLPGW 55
Query: 61 NRQKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLTRMEGFSA 120
+ + LD +LI ++S +DV+LIS+ M+ LP++T GF+
Sbjct: 56 SLKDGNAFLDKTELI---------------DLSTVDVILISNYHCMMALPYITEHTGFTG 100
Query: 121 KIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSALRKIALGE 180
+Y TE +IG+L+MEEL+ N R + S W + LLPS L+
Sbjct: 101 TVYATEPTVQIGRLLMEELV--NFIERV---PKAQSASLWKNKDIQRLLPSPLK------ 149
Query: 181 DGSELGG--GCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGACNWIISGA 238
D E+ C + V +SK+Q + F ++ G + + SSG +G+ NWII
Sbjct: 150 DAVEVSTWRRCYTMQEVNSALSKIQLVGFSQKIELFGAVQVTPLSSGYALGSSNWIIQSH 209
Query: 239 KGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDDNNNWEEL 298
++Y+SGS+ + H D +++ SD+++ + L+ + +
Sbjct: 210 YEKVSYVSGSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPT------------------- 250
Query: 299 MNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSL 358
+N D V E CS +V+ GG+VL+P GV LLE + +++ + L
Sbjct: 251 ----ANPDGMVGEF------CSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGL 300
Query: 359 K-IPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHS 417
+P+Y IS VA L ++ EWLC +Q K++ +P F H +LI+ K+ +P++H
Sbjct: 301 SSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHG 360
Query: 418 PKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDH-NSLLVLENEVDAELAVLPFKPIS 476
++++PC+VF+ H SLR G +H + W N+++ E + A+ P++P++
Sbjct: 361 -DFSNDFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLEALAPYQPLA 419
Query: 477 MKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEW------RTHVS--FSDVTSFSVSHY 528
MK + C + +V LLK +QP V+ PE++ ++H D ++S Y
Sbjct: 420 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPAQSHRMDLMIDCQPPAMS-Y 478
Query: 529 SENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNHGKHQLLPENE 588
E + +P + ++EI ++A +K +++ + L KH L P
Sbjct: 479 RRAEVLALPFKRRYEKIEIMPELADSLVPMEIK-PGISLATVSAVLHTKDNKHLLQPPPR 537
Query: 589 PG--GSSQTRPFLHWGSPDPENLLAELSKMGINGS--VERCMTDAESEDGFTVKVQDPEK 644
P S + R + PD ++ K ++GS VE+ + E +KV+D K
Sbjct: 538 PAQPTSGKKRKRVSDDVPD-----CKVLKPLLSGSIPVEQFVQTLEKHGFSDIKVEDTAK 592
Query: 645 SMI 647
I
Sbjct: 593 GHI 595
>sp|Q9NV88|INT9_HUMAN Integrator complex subunit 9 OS=Homo sapiens GN=INTS9 PE=1 SV=2
Length = 658
Score = 214 bits (545), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 174/670 (25%), Positives = 311/670 (46%), Gaps = 77/670 (11%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLP-------NDFYKAIC 53
MK CL G PC++L ++ DC LD+++ F PLP ++
Sbjct: 1 MKLYCL---SGHPTLPCNVLKFKSTTIMLDCGLDMTSTLNFLPLPLVQSPRLSNLPGWSL 57
Query: 54 KENSDSQNRQKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLT 113
K+ + +++ E + + + P + + L ++S +DV+LIS+ M+ LP++T
Sbjct: 58 KDGNAFLDKELKE--CSGHVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYIT 114
Query: 114 RMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSAL 173
GF+ +Y TE +IG+L+MEEL+ N R + S W + LLPS L
Sbjct: 115 EHTGFTGTVYATEPTVQIGRLLMEELV--NFIERV---PKAQSASLWKNKDIQRLLPSPL 169
Query: 174 RKIALGEDGSELGG--GCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGAC 231
+ D E+ C + V +SK+Q + + ++ G + + SSG +G+
Sbjct: 170 K------DAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSS 223
Query: 232 NWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDD 291
NWII ++Y+SGS+ + H D +++ SD+++ + L+ + +
Sbjct: 224 NWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPT------------ 271
Query: 292 NNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAI 351
+N D V E CS +V+ GG+VL+P GV LLE +
Sbjct: 272 -----------ANPDGMVGEF------CSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQ 314
Query: 352 FMECSSLK-IPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIH 410
+++ + L +P+Y IS VA L ++ EWLC +Q K++ +P F H +LI+ K+
Sbjct: 315 YIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLK 374
Query: 411 VFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDH-NSLLVLENEVDAELAV 469
+P++H ++++PC+VF+ H SLR G +H + W N+++ E + A+
Sbjct: 375 HYPSIHG-DFSNDFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLEAL 433
Query: 470 LPFKPISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEW------RTHVS--FSDVT 521
P++P++MK + C + +V LLK +QP V+ PE++ ++H D
Sbjct: 434 APYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPAQSHRMDLMIDCQ 493
Query: 522 SFSVSHYSENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNHGKH 581
++S Y E + +P + ++EI ++A +K +++ + L KH
Sbjct: 494 PPAMS-YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIK-PGISLATVSAVLHTKDNKH 551
Query: 582 QLLPENEPG--GSSQTRPFLHWGSPDPENLLAELSKMGINGS--VERCMTDAESEDGFTV 637
L P P S + R + PD ++ K ++GS VE+ + E +
Sbjct: 552 LLQPPPRPAQPTSGKKRKRVSDDVPD-----CKVLKPLLSGSIPVEQFVQTLEKHGFSDI 606
Query: 638 KVQDPEKSMI 647
KV+D K I
Sbjct: 607 KVEDTAKGHI 616
>sp|Q5ZKK2|INT9_CHICK Integrator complex subunit 9 OS=Gallus gallus GN=INTS9 PE=2 SV=1
Length = 658
Score = 213 bits (543), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 156/599 (26%), Positives = 280/599 (46%), Gaps = 66/599 (11%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLP-------NDFYKAIC 53
MK CL G PC++L ++ DC LD+++ F PLP + +
Sbjct: 1 MKLYCL---SGHPTLPCNVLKFKSTTIMLDCGLDMTSTLNFLPLPLVQSPRLSKLPGLVL 57
Query: 54 KENSDSQNRQKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLT 113
K+ S +++ E + + + P + + L ++S +DV+LIS+ M+ LP++T
Sbjct: 58 KDGSTFLDKELKE--CSGHVFVDSVPEF-CLPETELLDLSTVDVILISNYHCMMALPYIT 114
Query: 114 RMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSAL 173
GF+ +Y TE +IG+L+MEEL+ N R + S W E LLP+ L
Sbjct: 115 EYTGFTGTVYATEPTVQIGRLLMEELV--NSIERV---PKAQSASTWKNKEVQRLLPAPL 169
Query: 174 RKIALGEDGSELG--GGCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGAC 231
+ D E+ C + V +SK+Q + + ++ G + + SSG +G+
Sbjct: 170 K------DAVEVSMWRKCYTMPEVNAALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSS 223
Query: 232 NWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDD 291
NWII ++Y+SGS+ + H D +++ SD+++ + L+ + +
Sbjct: 224 NWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPT------------ 271
Query: 292 NNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAI 351
+N D V E CS +V+ GG+VL+P GV LLE +
Sbjct: 272 -----------ANPDGMVGEF------CSNLAMTVRNGGNVLVPCYPSGVIYDLLECLYQ 314
Query: 352 FMECSSL-KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIH 410
+++ + L +P Y IS VA L ++ EWLC +Q K++ +P F H +LI+ K+
Sbjct: 315 YIDSAGLSNVPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLK 374
Query: 411 VFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDH-NSLLVLENEVDAELAV 469
+P++H ++++PC++F+ H SLR G +H + W N+++ E + A+
Sbjct: 375 HYPSIHG-DFSNDFKQPCVIFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLDAL 433
Query: 470 LPFKPISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEW-RTHVSFSDVTSFSVS-- 526
P++P++MK + C + +V LLK +QP V+ PE++ + S S T +
Sbjct: 434 APYQPLAMKCVYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPSQSHRTDLMIDCQ 493
Query: 527 ----HYSENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNHGKH 581
Y E + +P + ++EI ++A +K +++ + L KH
Sbjct: 494 PPPMSYRRAEVLTLPYKRRYEKIEIMPELADSLVPLEIK-PGISLATVSAMLHTKDNKH 551
>sp|Q6DFF4|INT9_XENLA Integrator complex subunit 9 OS=Xenopus laevis GN=ints9 PE=2 SV=1
Length = 658
Score = 213 bits (542), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 182/705 (25%), Positives = 317/705 (44%), Gaps = 96/705 (13%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLPNDFYKAICK----EN 56
MK CL G PC+IL ++ DC LD+++ F PLP + K
Sbjct: 1 MKLYCL---SGHPTLPCNILKFKSSTIMLDCGLDMTSTLSFLPLPLVHSTRLSKLPGWVT 57
Query: 57 SDSQNR-QKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLTRM 115
D N+ +K K + + P + + L ++S +DV+LIS+ M+ LP++T
Sbjct: 58 KDGNNQFEKELKECSGRVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYITER 116
Query: 116 EGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSALRK 175
GF+ +Y TE +IG+L+MEEL+ N R + S W + LLP+ L+
Sbjct: 117 TGFTGTVYATEPTVQIGRLLMEELV--NFIERV---PKAQSATVWKHKDVQRLLPAPLK- 170
Query: 176 IALGEDGSELGG--GCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGACNW 233
D E+ C + V +SK+Q + + ++ G++ + SSG +G+ NW
Sbjct: 171 -----DAVEVFTWKKCYSMQEVNAALSKIQLVGYSQKIELFGVVQVTPLSSGYALGSSNW 225
Query: 234 IISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDDNN 293
+I ++Y+SGS+ + H D +++ SD+++ + L+ + +
Sbjct: 226 VIQSHYEKVSYVSGSSLLTTHPQPMDQTSLKNSDVLILTGLTQIPT-------------- 271
Query: 294 NWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFM 353
+N D V E CS ++++GG+VL+P GV LLE + ++
Sbjct: 272 ---------ANPDGMVGEF------CSNLAMTIRSGGNVLVPCYPSGVIYDLLECLYQYI 316
Query: 354 ECSSL-KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVF 412
+ + L +P Y IS VA L ++ EWLC +Q K++ +P F H +LI+ K+ +
Sbjct: 317 DSAGLSNVPFYFISPVANSSLEFSQIFAEWLCHNKQNKVYLPEPPFPHAELIQSNKLKHY 376
Query: 413 PAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDH-NSLLVLENEVDAELAVLP 471
P +H ++++PC+VF+ H +LR G +H + W N+++ E + A+ P
Sbjct: 377 PNIHG-DFSNDFKQPCVVFTGHPTLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLDALAP 435
Query: 472 FKPISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEW------RTHVS--FSDVTSF 523
++P++MK + C + +V LLK +QP V+ PE++ ++H S D
Sbjct: 436 YQPLAMKCVYCPIDTRLNFIQVTKLLKEVQPLHVVCPEQYTQPPATQSHRSDLMIDCQPP 495
Query: 524 SVSHYSENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNHGKHQL 583
+S Y E + +P + ++EI ++A +K +++ + L KH L
Sbjct: 496 PMS-YHRAEVLTLPFKRRYEKIEIMPELAQSLVPFEMK-PGVSLATVSAVLHSKDNKHVL 553
Query: 584 ----------------LPENEPGGSSQTRPF--LHWGSPDPENLLAELSKMGINGSVERC 625
P E S +T PF L GS E + L K G
Sbjct: 554 QPPPKPVAPPGSKKRKRPAEE---SPETPPFKPLLSGSIPVEQFVQTLEKNG-------- 602
Query: 626 MTDAESED---GFTVKVQDPEKSMIEVRAAVTVISAADKNLASRI 667
+D + ED G V +Q+ E + + +I D+ L R+
Sbjct: 603 FSDVKIEDTAKGHIVHLQEAETLIQFEEDSTHIICEHDERLRVRL 647
>sp|Q2KJA6|INT9_BOVIN Integrator complex subunit 9 OS=Bos taurus GN=INTS9 PE=2 SV=1
Length = 658
Score = 209 bits (532), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 157/602 (26%), Positives = 280/602 (46%), Gaps = 68/602 (11%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLP-------NDFYKAIC 53
MK CL G PC++L ++ DC LD+++ F PLP ++
Sbjct: 1 MKLYCL---SGHPTLPCNVLKFKSTTIMLDCGLDMTSTLNFLPLPLVQSPRLSNLPGWSL 57
Query: 54 KENSDSQNRQKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLT 113
K+ + +++ E + + + P + + L ++S +DV+LIS+ M+ LP++T
Sbjct: 58 KDGNAFLDKELKE--CSGHVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYIT 114
Query: 114 RMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSAL 173
GF+ +Y TE +IG+L+MEEL+ N R + S W + LLPS L
Sbjct: 115 EHTGFTGTVYATEPTVQIGRLLMEELV--NFIERV---PKAQSASLWKNKDIQRLLPSPL 169
Query: 174 RKIALGEDGSELGG--GCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGAC 231
+ D E+ C + V +SK+Q + + ++ G + + SSG +G+
Sbjct: 170 K------DAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSS 223
Query: 232 NWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDD 291
NWII ++Y+SGS+ + H D +++ SD+++ + L+ + +
Sbjct: 224 NWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPT------------ 271
Query: 292 NNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAI 351
+N D V E CS +V+ GG+VL+P GV LLE +
Sbjct: 272 -----------ANPDSMVGEF------CSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQ 314
Query: 352 FMECSSLK-IPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIH 410
+++ + L IP Y IS VA L ++ EWLC +Q K++ +P F H +LI+ K+
Sbjct: 315 YIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLK 374
Query: 411 VFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDH-NSLLVLENEVDAELAV 469
+P++H ++++PC+VF+ H SLR G +H + W N+++ E + A+
Sbjct: 375 HYPSIHG-DFSNDFRQPCVVFTGHPSLRFGDVVHFMELWGKSSLNTVIFTEPDFSYLEAL 433
Query: 470 LPFKPISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEWRTHVS--------FSDVT 521
P++P++MK + C + +V LLK +QP V+ PE++ D
Sbjct: 434 APYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPTPAQSHRMDLMVDCQ 493
Query: 522 SFSVSHYSENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNHGKH 581
++S Y E + +P + ++EI ++A +K +++ + L KH
Sbjct: 494 PPAMS-YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIK-PGISLATVSAVLHTKDNKH 551
Query: 582 QL 583
L
Sbjct: 552 VL 553
>sp|Q8K114|INT9_MOUSE Integrator complex subunit 9 OS=Mus musculus GN=Ints9 PE=2 SV=1
Length = 658
Score = 203 bits (517), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 173/670 (25%), Positives = 310/670 (46%), Gaps = 77/670 (11%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLP-------NDFYKAIC 53
MK CL G PC++L ++ DC LD+++ F PLP ++
Sbjct: 1 MKLYCL---SGHPTLPCNVLKFKSTTIMLDCGLDMTSTLNFLPLPLVQSPRLSNLPGWSL 57
Query: 54 KENSDSQNRQKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLT 113
K+ + +++ E + + + P + + L ++S +DV+LIS+ M+ LP++T
Sbjct: 58 KDGNAFLDKELKE--CSGHVFVDSVPEF-CLPETELIDLSTVDVILISNYHCMMALPYIT 114
Query: 114 RMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSAL 173
GF+ +Y TE +IG+L+MEEL+ N R + S W + LLPS L
Sbjct: 115 EHTGFTGTVYATEPTMQIGRLLMEELV--NFIERV---PKAQSASLWKNKDIQRLLPSPL 169
Query: 174 RKIALGEDGSELGG--GCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGAC 231
+ D E+ C + V +SK+Q + + ++ G + + SSG +G+
Sbjct: 170 K------DAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSS 223
Query: 232 NWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDD 291
NWII ++Y+SGS+ + H D +++ SD+++ + L+ + +
Sbjct: 224 NWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPT------------ 271
Query: 292 NNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAI 351
+N D V E CS +V+ GG+VL+P GV LLE +
Sbjct: 272 -----------ANPDGMVGEF------CSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQ 314
Query: 352 FMECSSL-KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIH 410
+++ + L IP Y IS VA L ++ EWLC +Q K++ +P F H +LI+ K+
Sbjct: 315 YIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLK 374
Query: 411 VFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDH-NSLLVLENEVDAELAV 469
+ ++H ++++PC++F+ H SLR G +H + W N+++ E + A+
Sbjct: 375 HYRSIHG-DFSNDFRQPCVLFTGHPSLRFGDVVHFMELWGKSSLNTIIFTEPDFSYLEAL 433
Query: 470 LPFKPISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEW------RTHVS--FSDVT 521
P++P++MK + C + +V LLK +QP V+ PE++ + H D
Sbjct: 434 APYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPEQYTQPPPAQAHRMDLMIDCQ 493
Query: 522 SFSVSHYSENETIHIPSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNHGKH 581
++S Y E + +P + ++EI ++A +K +++ + L KH
Sbjct: 494 PPAMS-YRRAEVLALPFKRRYEKIEIMPELADSLVPMEIK-PGISLATVSAVLHTKDNKH 551
Query: 582 QL--LPENEPGGSSQTRPFLHWGSPDPENLLAELSKMGINGS--VERCMTDAESEDGFTV 637
L P+ SS+ R ++ PD ++ K ++GS VE+ + E +
Sbjct: 552 VLQPPPKPTQPTSSKKRKRVNEDIPD-----CKVLKPLLSGSIPVEQFVQTLEKHGFSDI 606
Query: 638 KVQDPEKSMI 647
KV+D K I
Sbjct: 607 KVEDTAKGHI 616
>sp|Q54SH0|INT9_DICDI Integrator complex subunit 9 homolog OS=Dictyostelium discoideum
GN=ints9 PE=3 SV=1
Length = 712
Score = 164 bits (415), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 143/561 (25%), Positives = 252/561 (44%), Gaps = 94/561 (16%)
Query: 1 MKFTCLCQGGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSP----------------- 43
MK CL Q PC +L +L DC L++S++ F P
Sbjct: 1 MKVHCLSQSAQ---SPCFLLEYKNVKILLDCALEISSILHFLPKNLNYNNNNNNNNNNNN 57
Query: 44 ---------LPNDFYKAICKENSDSQNRQKVEKPL----DANDLIFAEPWYKTVNNLHLW 90
N+ Y K+ +Q + + L +++ + P ++ +++
Sbjct: 58 NNNNNNNNNNNNNSYSFKEKDKELNQFFKNINGTLYIDNGCSNIKYNCPQFEMIDDF--- 114
Query: 91 NVSFIDVVLISSPMGMLGLPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFY 150
S ID++LIS+ + LPF+T F KIY TE +IG+L++EEL+ M+ +Y
Sbjct: 115 --STIDMILISNYTNIYALPFITEYTNFQGKIYATEPTVQIGKLLLEELVQMDKQYSNSS 172
Query: 151 GAEESSGPQWMK-WEELELLPS------ALRKIALGEDGSELGGGCPCIAHVKDCISKVQ 203
++ W+ +E+L + L D I ++ K+Q
Sbjct: 173 INNNNNNNNLSDCWQNIEILEKLNVHNVGMENENLYRDSYRWKDLYKKID-IEKSFEKIQ 231
Query: 204 TLRFGEEACYNGILIIKAFSSGLDIGACNWIISGAKG--NIAYISGSNFA-SGHAMDFDY 260
++RF E + G I + SSG +G+ NW+I +KG + YIS S+ + S + F
Sbjct: 232 SIRFNESIKHYGFECIPS-SSGYGLGSANWVIE-SKGFERVVYISDSSLSLSRYPTPFQL 289
Query: 261 RAIQGSDLILYSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICS 320
I D+++ S ++ NN +++++ L CS
Sbjct: 290 SPIDNPDVLILSKINHY-------------PNNPPDQMLSEL----------------CS 320
Query: 321 CAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSLK-IPIYIISSVAEELLAYTNTI 379
+++ GG+VLIP G+ L L E +A ++ L +PIY +SSV++ +L+Y +
Sbjct: 321 NIGSTLQQGGTVLIPSYSCGIILDLFEHLADYLNKVGLPYVPIYFVSSVSKAVLSYADIY 380
Query: 380 PEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQ--EPCIVFSPHWSL 437
EWL K +QE+ F + F H L+++ + + VHS N+Q +PCI+F+ H S
Sbjct: 381 SEWLNKSKQERAFMPETPFLHQDLMRKGQFQAYQHVHS-----NFQANDPCIIFTGHPSC 435
Query: 438 RLGPTIHLLRRWSGDHNSLLVLENEVDAELAVLPFKPISMKVLQCSFLSGK---KLQKVQ 494
R+G L++ + NS+L++E + D + VLPF S ++ + FL +
Sbjct: 436 RIGDITTLIKLYDNPKNSILLIEPDFDFKSTVLPF---SKQISRIQFLPIDPRINFNEAN 492
Query: 495 PLLKILQPKLVLFPEEWRTHV 515
L+ L PK ++ P ++ +V
Sbjct: 493 LLISKLSPKHLIIPRIYKNYV 513
>sp|Q3MHC2|INT11_RAT Integrator complex subunit 11 OS=Rattus norvegicus GN=Cpsf3l PE=2
SV=1
Length = 600
Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/368 (20%), Positives = 136/368 (36%), Gaps = 79/368 (21%)
Query: 94 FIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYG 151
F+D V+IS + G LP+ + M G+ IY+T I +++E+
Sbjct: 60 FLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY------------ 107
Query: 152 AEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEA 211
RKIA+ + G +KDC+ KV + +
Sbjct: 108 ----------------------RKIAVDKKGE---ANFFTSQMIKDCMKKVVAVHLHQTV 142
Query: 212 CYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILY 271
+ L IKA+ +G +GA + I ++ Y N + + +L++
Sbjct: 143 QVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLI- 201
Query: 272 SDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGS 331
TE ++ D E + L E+VE GG
Sbjct: 202 --------TESTYATTIRDSKRCRER--DFLKKVHETVER-----------------GGK 234
Query: 332 VLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQEKL 391
VLIP+ +G +L + F E +LK+PIY + + E+ Y W Q+ K
Sbjct: 235 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITW-TNQKIRKT 293
Query: 392 FSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSG 451
F +F K I F + + P +VF+ L G ++ + R+W+G
Sbjct: 294 FVQRNMFEF------KHIKAF-----DRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 452 DHNSLLVL 459
+ +++++
Sbjct: 343 NEKNMVIM 350
>sp|Q9CWS4|INT11_MOUSE Integrator complex subunit 11 OS=Mus musculus GN=Cpsf3l PE=2 SV=1
Length = 600
Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/368 (20%), Positives = 136/368 (36%), Gaps = 79/368 (21%)
Query: 94 FIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYG 151
F+D V+IS + G LP+ + M G+ IY+T I +++E+
Sbjct: 60 FLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY------------ 107
Query: 152 AEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEA 211
RKIA+ + G +KDC+ KV + +
Sbjct: 108 ----------------------RKIAVDKKGE---ANFFTSQMIKDCMKKVVAVHLHQTV 142
Query: 212 CYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILY 271
+ L IKA+ +G +GA + I ++ Y N + + +L++
Sbjct: 143 QVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLI- 201
Query: 272 SDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGS 331
TE ++ D E + L E+VE GG
Sbjct: 202 --------TESTYATTIRDSKRCRER--DFLKKVHETVER-----------------GGK 234
Query: 332 VLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQEKL 391
VLIP+ +G +L + F E +LK+PIY + + E+ Y W Q+ K
Sbjct: 235 VLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITW-TNQKIRKT 293
Query: 392 FSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSG 451
F +F K I F + + P +VF+ L G ++ + R+W+G
Sbjct: 294 FVQRNMFEF------KHIKAF-----DRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 452 DHNSLLVL 459
+ +++++
Sbjct: 343 NEKNMVIM 350
>sp|Q5TA45|INT11_HUMAN Integrator complex subunit 11 OS=Homo sapiens GN=CPSF3L PE=1 SV=2
Length = 600
Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/370 (20%), Positives = 136/370 (36%), Gaps = 79/370 (21%)
Query: 92 VSFIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQF 149
F+D V+IS + G LP+ + M G+ IY+T I +++E+
Sbjct: 58 TDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY---------- 107
Query: 150 YGAEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGE 209
RKIA+ + G +KDC+ KV + +
Sbjct: 108 ------------------------RKIAVDKKGE---ANFFTSQMIKDCMKKVVAVHLHQ 140
Query: 210 EACYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLI 269
+ L IKA+ +G +GA + I ++ Y N + + +L+
Sbjct: 141 TVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLL 200
Query: 270 LYSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAG 329
+ TE ++ D E + L E+VE G
Sbjct: 201 I---------TESTYATTIRDSKRCRER--DFLKKVHETVER-----------------G 232
Query: 330 GSVLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQE 389
G VLIP+ +G +L + F E +LK+PIY + + E+ Y W Q+
Sbjct: 233 GKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPW-TNQKIR 291
Query: 390 KLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRW 449
K F +F K I F + + P +VF+ L G ++ + R+W
Sbjct: 292 KTFVQRNMFEF------KHIKAF-----DRAFADNPGPMVVFATPGMLHAGQSLQIFRKW 340
Query: 450 SGDHNSLLVL 459
+G+ +++++
Sbjct: 341 AGNEKNMVIM 350
>sp|Q5NVE6|INT11_PONAB Integrator complex subunit 11 OS=Pongo abelii GN=CPSF3L PE=2 SV=2
Length = 600
Score = 61.6 bits (148), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 77/370 (20%), Positives = 136/370 (36%), Gaps = 79/370 (21%)
Query: 92 VSFIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQF 149
F+D V+IS + G LP+ + M G+ IY+T I +++E+
Sbjct: 58 TDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY---------- 107
Query: 150 YGAEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGE 209
RKIA+ + G +KDC+ KV + +
Sbjct: 108 ------------------------RKIAVDKKGE---ANFFTSQMIKDCMKKVVAVHLHQ 140
Query: 210 EACYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLI 269
+ L IKA+ +G +GA + I ++ Y N + + +L+
Sbjct: 141 TVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLL 200
Query: 270 LYSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAG 329
+ TE ++ D E + L E+VE G
Sbjct: 201 I---------TESTYATTIRDSKRCRER--DFLKKVHETVER-----------------G 232
Query: 330 GSVLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQE 389
G VLIP+ +G +L + F E +LK+PIY + + E+ Y W Q+
Sbjct: 233 GKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPW-TNQKIR 291
Query: 390 KLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRW 449
K F +F K I F + + P +VF+ L G ++ + R+W
Sbjct: 292 KTFVQRNMFEF------KHIKAF-----DRAFADNPGPMVVFATPGMLHAGQSLQIFRKW 340
Query: 450 SGDHNSLLVL 459
+G+ +++++
Sbjct: 341 AGNEKNMVIM 350
>sp|Q2YDM2|INT11_BOVIN Integrator complex subunit 11 OS=Bos taurus GN=CPSF3L PE=2 SV=2
Length = 599
Score = 60.1 bits (144), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 78/368 (21%), Positives = 134/368 (36%), Gaps = 79/368 (21%)
Query: 94 FIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYG 151
F+D V+IS + G LP+ + M G+ IY+T+ I +++E+
Sbjct: 60 FLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDY------------ 107
Query: 152 AEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEA 211
RKIA+ + G +KDC+ KV + +
Sbjct: 108 ----------------------RKIAVDKKGE---ANFFTSQMIKDCMKKVVAVHLHQTV 142
Query: 212 CYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILY 271
+ L IKA+ +G +GA + I ++ Y N + + +
Sbjct: 143 QVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAW---------ID 193
Query: 272 SDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGS 331
SL TE ++ D E + L E+VE GG
Sbjct: 194 KCRPSLLITESTYATTIRDSKRCRER--DFLKKVHETVER-----------------GGK 234
Query: 332 VLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQEKL 391
VLIP+ +G +L + F E LK PIY + + E+ Y W Q+ K
Sbjct: 235 VLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPW-TNQKIRKT 293
Query: 392 FSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSG 451
F +F K I F + + P +VF+ L G ++ + R+W+G
Sbjct: 294 FVQRNMFEF------KHIKAF-----DRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 452 DHNSLLVL 459
+ +++++
Sbjct: 343 NEKNMVIM 350
>sp|Q5ZIH0|INT11_CHICK Integrator complex subunit 11 OS=Gallus gallus GN=CPSF3L PE=2 SV=1
Length = 600
Score = 59.3 bits (142), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 87/432 (20%), Positives = 156/432 (36%), Gaps = 93/432 (21%)
Query: 92 VSFIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQF 149
F+D V+IS + G LP+ + M G+ IY+T I +++E+ YR+
Sbjct: 58 TDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLED-------YRKI 110
Query: 150 YGAEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGE 209
++ E S + +KDC+ KV + +
Sbjct: 111 TVDKKG---------ETNFFTSQM---------------------IKDCMKKVVAVHLHQ 140
Query: 210 EACYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLI 269
+ L IKA+ +G +GA + I ++ Y N + + DL+
Sbjct: 141 TVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMTPDRHLGAAWIDKCRPDLL 200
Query: 270 LYSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAG 329
+ TE ++ D E + L E+VE G
Sbjct: 201 I---------TESTYATTIRDSKRCRER--DFLKKVHETVER-----------------G 232
Query: 330 GSVLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQE 389
G VLIP+ +G +L + F E +LK PIY + + E+ Y W Q+
Sbjct: 233 GKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITW-TNQKIR 291
Query: 390 KLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRW 449
K F +F K I F + + P +VF+ L G ++ + R+W
Sbjct: 292 KTFVQRNMFEF------KHIKAF-----DRAFADNPGPMVVFATPGMLHAGQSLQIFRKW 340
Query: 450 SGDHNSLL--------------VLENEVDAELAVLPFKPISMKVLQCSFLSGKKLQKVQP 495
+G+ +++ +L + E+ + M+V SF + + +
Sbjct: 341 AGNEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQ 400
Query: 496 LLKILQPKLVLF 507
L++ +P+ VL
Sbjct: 401 LIRQAEPRNVLL 412
>sp|Q503E1|INT11_DANRE Integrator complex subunit 11 OS=Danio rerio GN=cpsf3l PE=2 SV=1
Length = 598
Score = 56.2 bits (134), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 75/368 (20%), Positives = 135/368 (36%), Gaps = 79/368 (21%)
Query: 94 FIDVVLISS-PMGMLG-LPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYG 151
F+D V+IS + G LP+++ M G+ IY+T I +++E+
Sbjct: 60 FLDCVIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDF------------ 107
Query: 152 AEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEA 211
RKI + + G +KDC+ KV L +
Sbjct: 108 ----------------------RKITVDKKGET---NFFTSQMIKDCMKKVVPLNLHQTV 142
Query: 212 CYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILY 271
+ L IKA+ +G +GA I ++ Y N + + D+++
Sbjct: 143 QVDDELEIKAYYAGHVLGAAMVQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDILI- 201
Query: 272 SDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGS 331
+E ++ D E + L E+VE GG
Sbjct: 202 --------SESTYATTIRDSKRCRER--DFLKKVHETVER-----------------GGK 234
Query: 332 VLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQEKL 391
VLIP+ +G +L + F E +LK PIY + + E+ Y W Q+ K
Sbjct: 235 VLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITW-TNQKIRKT 293
Query: 392 FSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSG 451
F +F K I F ++ + P +VF+ L G ++ + ++W+G
Sbjct: 294 FVQRNMFEF------KHIKAFDRSYA-----DNPGPMVVFATPGMLHAGQSLQIFKKWAG 342
Query: 452 DHNSLLVL 459
+ +++++
Sbjct: 343 NEKNMVIM 350
>sp|Q8GUU3|CPS3B_ARATH Cleavage and polyadenylation specificity factor subunit 3-II
OS=Arabidopsis thaliana GN=CPSF73-II PE=1 SV=2
Length = 613
Score = 50.8 bits (120), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 100/530 (18%), Positives = 180/530 (33%), Gaps = 141/530 (26%)
Query: 1 MKFTCLCQGGGFNF-PPCHILNVSGFHVLFDCPLDLSALTVFSPLPNDFYKAICKENSDS 59
M CL G G C ++ ++G ++FDC + + C +++
Sbjct: 1 MAIDCLVLGAGQEIGKSCVVVTINGKKIMFDCGMHMG----------------CDDHNRY 44
Query: 60 QNRQKVEKPLDANDLIFAEPWYKTVNNLHLWNVSFIDVVLISSPMGMLGLPFLTRMEGFS 119
N + K D ++ I + + H+ +V LP+ T + G++
Sbjct: 45 PNFSLISKSGDFDNAISC----IIITHFHMDHVG--------------ALPYFTEVCGYN 86
Query: 120 AKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELELLPSALRKIALG 179
IY++ + LM+E+ YR+ E EL +
Sbjct: 87 GPIYMSYPTKALSPLMLED-------YRRVMVDRRG---------EEELFTTT------- 123
Query: 180 EDGSELGGGCPCIAHVKDCISKVQTLRFGEEACYNGILIIKAFSSGLDIGACNWIISGAK 239
H+ +C+ KV + + + L I+A+ +G +GA
Sbjct: 124 --------------HIANCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGD 169
Query: 240 GNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDDNNNWEELM 299
I Y N + + ID+ +L+
Sbjct: 170 AAIVYTGDYNMTTDRHL----------------------GAAKIDRLQL--------DLL 199
Query: 300 NSLSNYDESVE------EMEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFM 353
S S Y ++ E E L + C V GG LIP +G +L + +
Sbjct: 200 ISESTYATTIRGSKYPREREFLQAVHKC----VAGGGKALIPSFALGRAQELCMLLDDYW 255
Query: 354 ECSSLKIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPL-FAHVKLIKEKKIHVF 412
E ++K+PIY S + + Y + W + +EK + +P F +VK IH
Sbjct: 256 ERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFDFKNVKDFDRSLIHA- 314
Query: 413 PAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDHNSLLVLENEVDAELA---V 469
P PC++F+ L G ++ + + W+ +L+ L A +
Sbjct: 315 PG------------PCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKL 362
Query: 470 LPFKPISM------------KVLQCSFLSGKKLQKVQPLLKILQPKLVLF 507
+ KP ++ KV Q +F + + L K L PK V+
Sbjct: 363 MAGKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVL 412
>sp|A8XUS3|CPSF2_CAEBR Probable cleavage and polyadenylation specificity factor subunit 2
OS=Caenorhabditis briggsae GN=cpsf-2 PE=3 SV=2
Length = 842
Score = 50.8 bits (120), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 75/374 (20%), Positives = 150/374 (40%), Gaps = 82/374 (21%)
Query: 95 IDVVLIS--SPMGMLGLPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGA 152
I VLIS P+ + GLP+L G +A +Y T ++GQ+ + +L+ +++ +F
Sbjct: 55 ISAVLISHPDPLHLGGLPYLVAKCGLTAPVYCTVPVYKMGQMFIYDLVYSHLDVEEF--- 111
Query: 153 EESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEAC 212
Q +++++ A K+ E+
Sbjct: 112 ------QHYSLDDVDM---AFEKV--------------------------------EQVK 130
Query: 213 YNGILIIKAFSSGLDIGA--CNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLIL 270
YN +++K SG++ A +I G+ I I+G + + +DF++R
Sbjct: 131 YNQTVVLKG-DSGVNFTAMPAGHMIGGSMWRICRITGEDII--YCVDFNHR--------- 178
Query: 271 YSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDE--SVEEMEKLAFICSCAIDSVKA 328
+D S S DN N L+ + +++ ++ ++ + + + +V+
Sbjct: 179 ----------KDRHLSGCSFDNFNRPHLLITGAHHISLPQMKRKDRDEQLVTKILRTVRQ 228
Query: 329 GGSVLIPINRVGVFLQ---LLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCK 385
G +I I+ G L+ LL+Q+ + + ++S VA ++ + + EW+
Sbjct: 229 KGDCMIVIDTAGRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQLEWM-- 286
Query: 386 QRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHL 445
EKLF D A K +++ VHS L+ + P +V + G + L
Sbjct: 287 --DEKLFRYDSSSARYNPFTLKNVNL---VHSHLELIKIRSPKVVLCSSQDMETGFSREL 341
Query: 446 LRRWSGDHNSLLVL 459
W D + ++L
Sbjct: 342 FLDWCADQRNGVIL 355
>sp|Q652P4|CPSF2_ORYSJ Cleavage and polyadenylation specificity factor subunit 2 OS=Oryza
sativa subsp. japonica GN=Os09g0569400 PE=2 SV=1
Length = 738
Score = 47.8 bits (112), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 60/297 (20%), Positives = 109/297 (36%), Gaps = 76/297 (25%)
Query: 95 IDVVLIS--SPMGMLGLPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGA 152
ID VL+S M + LP+ + G SA +Y TE R+G L + Y F
Sbjct: 55 IDAVLLSHADTMHLGALPYAMKHLGLSAPVYATEPVFRLGILTL---------YDYFISR 105
Query: 153 EESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEAC 212
+ S ++++ V L++ +
Sbjct: 106 RQVSDFDLFTLDDIDA-----------------------------AFQNVVRLKYSQNHL 136
Query: 213 YNGI---LIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRA---IQGS 266
N ++I +G D+G W I+ ++ Y A+DF++R + G+
Sbjct: 137 LNDKGEGIVIAPHVAGHDLGGTVWKITKDGEDVVY----------AVDFNHRKERHLNGT 186
Query: 267 DLILYSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSV 326
L SF + N+L+N+ V + ++ + +
Sbjct: 187 AL-----------------GSFVRPAVLITDAYNALNNH---VYKRQQDQDFIDALVKVL 226
Query: 327 KAGGSVLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWL 383
GGSVL+PI+ G L++L + + L PIY +++V+ + Y + EW+
Sbjct: 227 TGGGSVLLPIDTAGRVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWM 283
>sp|Q57626|Y162_METJA Uncharacterized protein MJ0162 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ0162 PE=3 SV=1
Length = 421
Score = 44.7 bits (104), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/213 (23%), Positives = 98/213 (46%), Gaps = 35/213 (16%)
Query: 324 DSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSLK-IPIYIISSVAEELLAYTNTIPEW 382
++++ GG V+IP+ +G ++L I ++ L+ +PIY S+ Y + I W
Sbjct: 204 ETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLRDVPIYTDGSLIHATAVYMSYI-NW 262
Query: 383 LCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSP--KLLMNWQEPCIVFSPHWSLRLG 440
L + +K + E +I+ F + L+ N +EPCI+ S ++ G
Sbjct: 263 LNPK--------------IKNMVENRINPFGEIKKADESLVFN-KEPCIIVSTSGMVQGG 307
Query: 441 PTIHLLRRWSGDHNSLLVLENEVDAELA---------VLPFK---PISMKVLQCSFLS-G 487
P + L+ N L++ + + L + PFK PI KV++ F + G
Sbjct: 308 PVLKYLKLLKDPKNKLILTGYQAEGTLGRELEEGAKEIQPFKNKIPIRGKVVKIEFSAHG 367
Query: 488 KKLQKVQPLLKILQPK--LVLFPEEWRTHVSFS 518
V+ + KI +P+ +V+ E +++ +SF+
Sbjct: 368 DYNSLVRYIKKIPKPEKAIVMHGERYQS-LSFA 399
>sp|Q9LKF9|CPSF2_ARATH Cleavage and polyadenylation specificity factor subunit 2
OS=Arabidopsis thaliana GN=CPSF100 PE=1 SV=2
Length = 739
Score = 44.3 bits (103), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 74/376 (19%), Positives = 143/376 (38%), Gaps = 84/376 (22%)
Query: 93 SFIDVVLISSP--MGMLGLPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFY 150
S ID VL+S P + + LP+ + G SA +Y TE R+G L M Y QF
Sbjct: 53 STIDAVLLSHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTM---------YDQFL 103
Query: 151 GAEESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEE 210
++ S ++++ SA + + ++ S+ L E
Sbjct: 104 SRKQVSDFDLFTLDDID---SAFQNV------------------IRLTYSQNYHLSGKGE 142
Query: 211 ACYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRA---IQGSD 267
++I +G +G W I+ ++ Y A+D+++R + G+
Sbjct: 143 G-----IVIAPHVAGHMLGGSIWRITKDGEDVIY----------AVDYNHRKERHLNGT- 186
Query: 268 LILYSDLSSLDSTEDIDQSSFSDDNNNWE---ELMNSLSNYDESVEEMEKLAFICSCAID 324
+L S + D + +++ + E ++++S +
Sbjct: 187 -VLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKH------------------- 226
Query: 325 SVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLC 384
++ GG+VL+P++ G L+LL + PIY ++ V+ + Y + EW+
Sbjct: 227 -LEVGGNVLLPVDTAGRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMS 285
Query: 385 KQRQEKL-FSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTI 443
+ S D F L++ HV ++ L P +V + SL G
Sbjct: 286 DSISKSFETSRDNAF----LLR----HVTLLINKTDLDNAPPGPKVVLASMASLEAGFAR 337
Query: 444 HLLRRWSGDHNSLLVL 459
+ W+ D +L++
Sbjct: 338 EIFVEWANDPRNLVLF 353
>sp|Q58633|Y1236_METJA Uncharacterized protein MJ1236 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ1236 PE=4 SV=1
Length = 634
Score = 43.9 bits (102), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 63/140 (45%), Gaps = 14/140 (10%)
Query: 324 DSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWL 383
++ GG VLIP+ VG +L+ + P+Y+ + E +T PE+L
Sbjct: 401 ETTDRGGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLDGMIWEATAIHT-AYPEYL 459
Query: 384 CKQRQEKLF-SGDPLFAHVKLIKEKKIHVFPAVHSP---KLLMNWQEPCIVFSPHWSLRL 439
K+ ++K+F GD F VF V S + +++ EPC++ + L
Sbjct: 460 SKEMRQKIFHEGDNPFLS---------EVFKRVGSTNERRKVIDSDEPCVILATSGMLTG 510
Query: 440 GPTIHLLRRWSGDHNSLLVL 459
GP++ L+ + D + ++
Sbjct: 511 GPSVEYLKHLAPDEKNAIIF 530
>sp|O17403|CPSF2_CAEEL Probable cleavage and polyadenylation specificity factor subunit 2
OS=Caenorhabditis elegans GN=cpsf-2 PE=3 SV=1
Length = 843
Score = 43.1 bits (100), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 70/374 (18%), Positives = 146/374 (39%), Gaps = 82/374 (21%)
Query: 95 IDVVLIS--SPMGMLGLPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGA 152
I VLIS P+ + GLP+L G +A +Y T ++GQ+ + +++ +++ +F
Sbjct: 55 ISAVLISHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEF--- 111
Query: 153 EESSGPQWMKWEELELLPSALRKIALGEDGSELGGGCPCIAHVKDCISKVQTLRFGEEAC 212
+ L+ + +A K+ E+
Sbjct: 112 ---------EHYTLDDVDTAFEKV--------------------------------EQVK 130
Query: 213 YNGILIIKAFSSGLDIGA--CNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLIL 270
YN +++K SG+ A ++ G+ I ++G + + +DF+++
Sbjct: 131 YNQTVVLKG-DSGVHFTALPAGHMLGGSIWRICRVTGEDIV--YCVDFNHK--------- 178
Query: 271 YSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDE--SVEEMEKLAFICSCAIDSVKA 328
++ SF DN N L+ + +++ + ++ + + + +V+
Sbjct: 179 --------KERHLNGCSF--DNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQ 228
Query: 329 GGSVLIPINRVGVFLQ---LLEQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPEWLCK 385
G +I I+ G L+ LL+Q+ + + ++S VA ++ + + EW+
Sbjct: 229 KGDCMIVIDTAGRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWM-- 286
Query: 386 QRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHL 445
EKLF D A K + + HS + LM + P +V + G + L
Sbjct: 287 --NEKLFKYDSSSARYNPFTLKHVTL---CHSHQELMRVRSPKVVLCSSQDMESGFSREL 341
Query: 446 LRRWSGDHNSLLVL 459
W D + ++L
Sbjct: 342 FLDWCSDPRNGVIL 355
>sp|O13794|YSH1_SCHPO Endoribonuclease ysh1 OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=ysh1 PE=3 SV=2
Length = 757
Score = 42.0 bits (97), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 87/188 (46%), Gaps = 26/188 (13%)
Query: 312 MEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLL----EQIAIFMECSSLKIPIYIISS 367
+EK A + + +++ GG VL+P+ +G +LL E ++ S +PIY SS
Sbjct: 223 LEKEARLLNIIHSTIRNGGRVLMPVFALGRAQELLLILDEYWNNHLDLRS--VPIYYASS 280
Query: 368 VAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQE- 426
+A + +A T + + K+ E+ +F V S + L + +
Sbjct: 281 LARKCMAIFQTYVNMMNDNIR-------------KIFAERNPFIFRFVKSLRNLEKFDDI 327
Query: 427 -PCIVFSPHWSLRLGPTIHLLRRWSGD-HNSLLVLENEVDAELAVLPFKPISMKVLQCSF 484
P ++ + L+ G + LL RW+ D N+LL+ V+ +A K I+ + ++
Sbjct: 328 GPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTGYSVEGTMA----KQITNEPIEIVS 383
Query: 485 LSGKKLQK 492
LSG+K+ +
Sbjct: 384 LSGQKIPR 391
>sp|Q54YL3|INT11_DICDI Integrator complex subunit 11 homolog OS=Dictyostelium discoideum
GN=ints11 PE=3 SV=1
Length = 744
Score = 41.2 bits (95), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 74/359 (20%), Positives = 131/359 (36%), Gaps = 90/359 (25%)
Query: 108 GLPFLTRMEGFSAKIYITEAAARIGQLMMEELICMNMEYRQFYGAEESSGPQWMKWEELE 167
LPF T M G+ IY+T I +++E+
Sbjct: 75 ALPFFTEMCGYDGPIYMTLPTKAICPILLEDY---------------------------- 106
Query: 168 LLPSALRKIALGEDGSELGGGCPCIAHVKDCISKV------QTLRFGEEACYNGILIIKA 221
RKI + + G +KDC+ KV QT++ EE L IKA
Sbjct: 107 ------RKITVEKKGET---NFFTAQMIKDCMKKVIPVNLHQTIKVDEE------LSIKA 151
Query: 222 FSSGLDIGACNWIISGAKGNIAYISGSNFASGHAMDFDYRAIQGSDLILYSDLSSLDSTE 281
+ +G +GA + ++ Y N + + D+++ TE
Sbjct: 152 YYAGHVLGAAMFYAKVGDESVVYTGDYNMTPDRHLGSAWIDQVKPDVLI---------TE 202
Query: 282 DIDQSSFSDDNNNWEELMNSLSNYDESVEEMEKLAFICSCAIDSVKAGGSVLIPINRVGV 341
++ D E + L E VE+ GG VLIP+ +G
Sbjct: 203 TTYATTIRDSKRGRER--DFLKRIHECVEK-----------------GGKVLIPVFALGR 243
Query: 342 FLQLLEQIAIFMECSSL-KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAH 400
+L I + E +L IPIY + +AE+ Y W Q+ ++ F +F
Sbjct: 244 VQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFINW-TNQKIKQTFVKRNMF-D 301
Query: 401 VKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFSPHWSLRLGPTIHLLRRWSGDHNSLLVL 459
K IK + H+ V +P + ++F+ L G ++ + ++W+ + ++ ++
Sbjct: 302 FKHIKPFQSHL---VDAPGAM-------VLFATPGMLHAGASLEVFKKWAPNELNMTII 350
>sp|P79101|CPSF3_BOVIN Cleavage and polyadenylation specificity factor subunit 3 OS=Bos
taurus GN=CPSF3 PE=2 SV=1
Length = 684
Score = 41.2 bits (95), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 120/306 (39%), Gaps = 39/306 (12%)
Query: 303 SNYDESVEEM--EKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSS--L 358
S Y + E E+ A C+ D V GG LIP+ +G +LL + + +
Sbjct: 205 STYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELH 264
Query: 359 KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSP 418
IPIY SS+A++ +A T + + ++++ +P VF + +
Sbjct: 265 DIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPF-------------VFKHISNL 311
Query: 419 KLLMNWQE--PCIVFSPHWSLRLGPTIHLLRRWSGD-HNSLLVLENEVDAELAVLPFK-- 473
K + ++ + P +V + ++ G + L W D N +++ V+ LA
Sbjct: 312 KSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 371
Query: 474 -----------PISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEWRTHVSFSDVTS 522
P+ M V SF + Q+ ++ L+P V+ + + + +
Sbjct: 372 EEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQN--EMARLKA 429
Query: 523 FSVSHYSENETIHI----PSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNH 578
+ Y +N+ +HI P E+ L + +K + +K R+ G L +
Sbjct: 430 ALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRN 489
Query: 579 GKHQLL 584
+ +L
Sbjct: 490 FNYHIL 495
>sp|Q9UKF6|CPSF3_HUMAN Cleavage and polyadenylation specificity factor subunit 3 OS=Homo
sapiens GN=CPSF3 PE=1 SV=1
Length = 684
Score = 40.8 bits (94), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 120/306 (39%), Gaps = 39/306 (12%)
Query: 303 SNYDESVEEM--EKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSS--L 358
S Y + E E+ A C+ D V GG LIP+ +G +LL + + +
Sbjct: 205 STYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELH 264
Query: 359 KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSP 418
IPIY SS+A++ +A T + + ++++ +P VF + +
Sbjct: 265 DIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPF-------------VFKHISNL 311
Query: 419 KLLMNWQE--PCIVFSPHWSLRLGPTIHLLRRWSGD-HNSLLVLENEVDAELAVLPFK-- 473
K + ++ + P +V + ++ G + L W D N +++ V+ LA
Sbjct: 312 KSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 371
Query: 474 -----------PISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEWRTHVSFSDVTS 522
P+ M V SF + Q+ ++ L+P V+ + + + +
Sbjct: 372 EEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQN--EMARLKA 429
Query: 523 FSVSHYSENETIHI----PSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNH 578
+ Y +N+ +HI P E+ L + +K + +K R+ G L +
Sbjct: 430 ALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRN 489
Query: 579 GKHQLL 584
+ +L
Sbjct: 490 FNYHIL 495
>sp|Q9QXK7|CPSF3_MOUSE Cleavage and polyadenylation specificity factor subunit 3 OS=Mus
musculus GN=Cpsf3 PE=1 SV=2
Length = 684
Score = 40.4 bits (93), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 61/306 (19%), Positives = 120/306 (39%), Gaps = 39/306 (12%)
Query: 303 SNYDESVEEM--EKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSS--L 358
S Y + E E+ A C+ D V GG LIP+ +G +LL + + +
Sbjct: 205 STYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELH 264
Query: 359 KIPIYIISSVAEELLAYTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSP 418
IPIY SS+A++ +A T + + ++++ +P VF + +
Sbjct: 265 DIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPF-------------VFKHISNL 311
Query: 419 KLLMNWQE--PCIVFSPHWSLRLGPTIHLLRRWSGD-HNSLLVLENEVDAELAVLPFK-- 473
K + ++ + P +V + ++ G + L W D N +++ V+ LA
Sbjct: 312 KSMDHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP 371
Query: 474 -----------PISMKVLQCSFLSGKKLQKVQPLLKILQPKLVLFPEEWRTHVSFSDVTS 522
P+ M V SF + Q+ ++ L+P V+ + + + +
Sbjct: 372 EEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQN--EMARLKA 429
Query: 523 FSVSHYSENETIHI----PSLKESAELEIAADIASKFQWRMLKQKKLNITRLKGRLFVNH 578
+ Y +N+ +HI P E+ L + +K + +K R+ G L +
Sbjct: 430 ALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRN 489
Query: 579 GKHQLL 584
+ +L
Sbjct: 490 FNYHIL 495
>sp|Q12102|CFT2_YEAST Cleavage factor two protein 2 OS=Saccharomyces cerevisiae (strain
ATCC 204508 / S288c) GN=CFT2 PE=1 SV=1
Length = 859
Score = 38.1 bits (87), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 37/69 (53%), Gaps = 9/69 (13%)
Query: 324 DSVKAG----GSVLIPINRVGVFLQLLEQI-AIFMECSSL----KIPIYIISSVAEELLA 374
D++K G GSV+IP++ G FL L Q+ + E + + ++P+ I+S L
Sbjct: 232 DTLKKGLSSDGSVIIPVDMSGKFLDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLT 291
Query: 375 YTNTIPEWL 383
Y ++ EWL
Sbjct: 292 YAKSMLEWL 300
>sp|Q4PEJ3|YSH1_USTMA Endoribonuclease YSH1 OS=Ustilago maydis (strain 521 / FGSC 9021)
GN=YSH1 PE=3 SV=1
Length = 880
Score = 36.2 bits (82), Expect = 0.95, Method: Compositional matrix adjust.
Identities = 37/150 (24%), Positives = 68/150 (45%), Gaps = 21/150 (14%)
Query: 326 VKAGGSVLIPINRVGVFLQLL----EQIAIFMECSSLKIPIYIISSVAEELLAYTNTIPE 381
+K GG VL+P+ +G +LL E A E S +PIY S++A++ ++ T
Sbjct: 246 IKRGGRVLLPVFVLGRAQELLLLLDEYWAAHPELHS--VPIYYASALAKKCISVYQTYIH 303
Query: 382 WLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLMNWQE--PCIVFSPHWSLRL 439
+ + + D F VF + + + L +++ PC++ + ++
Sbjct: 304 TMNDHIRTRFNRRDNPF------------VFKHISNLRSLEKFEDRGPCVMMASPGFMQS 351
Query: 440 GPTIHLLRRWSGD-HNSLLVLENEVDAELA 468
G + LL RW+ D N L+V V+ +A
Sbjct: 352 GVSRELLERWAPDKRNGLIVSGYSVEGTMA 381
>sp|Q86A79|CPSF3_DICDI Cleavage and polyadenylation specificity factor subunit 3
OS=Dictyostelium discoideum GN=cpsf3 PE=3 SV=1
Length = 774
Score = 34.3 bits (77), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 57/277 (20%), Positives = 107/277 (38%), Gaps = 51/277 (18%)
Query: 199 ISKVQTLRFGEEACYNGILIIKAFSSGLDIGACNWIISGAKGNIAYISG-SNFASGHAMD 257
+ K++ +R+ ++ +NGI + F++G +GA ++I A I Y S H M
Sbjct: 159 LEKIEKVRYRQKVEHNGIKV-TCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMG 217
Query: 258 FDYRAIQGSDLILYSDLSSLDSTEDIDQSSFSDDNNNWEELMNSLSNYDESVEE--MEKL 315
+ ++ LI+ S Y V E +E+
Sbjct: 218 AETPPVKVDVLIIES-------------------------------TYGVQVHEPRLERE 246
Query: 316 AFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSL--KIPIYIISSVAEELL 373
S V+ G LIP+ +G +LL + + + +PIY S++A++ +
Sbjct: 247 KRFTSSVHQVVERNGKCLIPVFALGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCM 306
Query: 374 AYTNTIPEWLCKQRQEKLFSGDPL-FAHVKLIKEKKIHVFPAVHSPKLLMNWQEPCIVFS 432
T + + + + +P F H+K IK + S + + PC+ +
Sbjct: 307 GVYRTYINMMNDRVRAQFDVSNPFEFKHIKNIK--------GIES----FDDRGPCVFMA 354
Query: 433 PHWSLRLGPTIHLLRRWSGD-HNSLLVLENEVDAELA 468
L+ G + L RW D N +++ V+ LA
Sbjct: 355 SPGMLQSGLSRQLFERWCSDKRNGIVIPGYSVEGTLA 391
>sp|P09257|GB_VZVD Envelope glycoprotein B OS=Varicella-zoster virus (strain Dumas)
GN=gB PE=1 SV=2
Length = 931
Score = 33.5 bits (75), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 28/53 (52%)
Query: 9 GGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLPNDFYKAICKENSDSQN 61
G GFN P I +V+G + F C + V S P+ FY+++ E + S++
Sbjct: 38 GSGFNGPGVFITSVTGVWLCFLCIFSMFVTAVVSVSPSSFYESLQVEPTQSED 90
>sp|Q4JR05|GB_VZVO Envelope glycoprotein B OS=Varicella-zoster virus (strain Oka
vaccine) GN=gB PE=1 SV=2
Length = 931
Score = 33.5 bits (75), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 17/53 (32%), Positives = 28/53 (52%)
Query: 9 GGGFNFPPCHILNVSGFHVLFDCPLDLSALTVFSPLPNDFYKAICKENSDSQN 61
G GFN P I +V+G + F C + V S P+ FY+++ E + S++
Sbjct: 38 GSGFNGPGVFITSVTGVWLCFLCIFSMFVTAVVSVSPSSFYESLQVEPTQSED 90
>sp|Q6BMW3|YSH1_DEBHA Endoribonuclease YSH1 OS=Debaryomyces hansenii (strain ATCC 36239 /
CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968) GN=YSH1 PE=3
SV=2
Length = 815
Score = 33.5 bits (75), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 43/169 (25%), Positives = 78/169 (46%), Gaps = 26/169 (15%)
Query: 312 MEKLAFICSCAIDSVKAGGSVLIPINRVGVFLQLLEQIAIFMECSSLK-----IPIYIIS 366
+EK + + ++ GG +L+P+ +G +LL I E SL I IY S
Sbjct: 234 LEKETRMTNIIHSTLLKGGRILMPVFALGRAQELL---LILEEYWSLNDDLQNINIYYAS 290
Query: 367 SVAEELLA----YTNTIPEWLCKQRQEKLFSGDPLFAHVKLIKEKKIHVFPAVHSPKLLM 422
S+A + +A YTN + + + +L + + K++ F + S K L
Sbjct: 291 SLARKCMAVYQTYTNIMNDSI------RLTT-----SATNSSKKQNPFQFKFIKSIKNLD 339
Query: 423 NWQE--PCIVFSPHWSLRLGPTIHLLRRWSGD-HNSLLVLENEVDAELA 468
+Q+ PC+V + L+ G + LL RW+ D N++++ V+ +A
Sbjct: 340 KFQDFGPCVVVASPGMLQNGVSRELLERWAPDPKNAVIMTGYSVEGTMA 388
>sp|Q9SKC2|ARI11_ARATH Probable E3 ubiquitin-protein ligase ARI11 OS=Arabidopsis thaliana
GN=ARI11 PE=2 SV=1
Length = 542
Score = 33.1 bits (74), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 610 LAELSKMGINGSVERCMTDAESE-DGFTVKVQDPEKSMIEVRAAVTVISAADKNLASRIV 668
L+ S NG +ER AE E F K++DP K+ E+RA + ++ A K +V
Sbjct: 447 LSCCSAEAENG-LERLHHCAEEELKQFIGKIEDPSKNFGELRAKLIDLTKATKTYFENLV 505
Query: 669 KAMENIL 675
KA+EN L
Sbjct: 506 KALENGL 512
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.136 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 253,761,682
Number of Sequences: 539616
Number of extensions: 10922046
Number of successful extensions: 26380
Number of sequences better than 100.0: 41
Number of HSP's better than 100.0 without gapping: 24
Number of HSP's successfully gapped in prelim test: 17
Number of HSP's that attempted gapping in prelim test: 26268
Number of HSP's gapped (non-prelim): 72
length of query: 678
length of database: 191,569,459
effective HSP length: 124
effective length of query: 554
effective length of database: 124,657,075
effective search space: 69060019550
effective search space used: 69060019550
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 65 (29.6 bits)