Your job contains 1 sequence.
>005253
MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL
SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID
SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE
KHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGR
VLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL
KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL
ARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGP
DNNLSGDPMVIDANNANASADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDD
FGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSA
EATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAW
VDAEVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE
YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL
The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 005253
(706 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2172843 - symbol:CPSF100 "cleavage and polyade... 2242 3.4e-310 2
UNIPROTKB|Q9P2I0 - symbol:CPSF2 "Cleavage and polyadenyla... 1054 8.2e-134 3
UNIPROTKB|Q10568 - symbol:CPSF2 "Cleavage and polyadenyla... 1052 1.3e-133 3
UNIPROTKB|E2R496 - symbol:CPSF2 "Uncharacterized protein"... 1051 1.7e-133 3
RGD|1309687 - symbol:Cpsf2 "cleavage and polyadenylation ... 1048 2.7e-133 3
UNIPROTKB|Q9W799 - symbol:cpsf2 "Cleavage and polyadenyla... 1041 2.7e-133 3
ZFIN|ZDB-GENE-040718-79 - symbol:cpsf2 "cleavage and poly... 1045 3.5e-133 3
UNIPROTKB|F1NMN0 - symbol:CPSF2 "Uncharacterized protein"... 1044 5.7e-133 3
FB|FBgn0027873 - symbol:Cpsf100 "Cleavage and polyadenyla... 1003 1.1e-120 2
DICTYBASE|DDB_G0270392 - symbol:cpsf2 "cleavage and polya... 869 5.7e-120 4
MGI|MGI:1861601 - symbol:Cpsf2 "cleavage and polyadenylat... 1048 9.5e-120 2
WB|WBGene00017313 - symbol:cpsf-2 species:6239 "Caenorhab... 768 1.8e-94 3
UNIPROTKB|O17403 - symbol:cpsf-2 "Probable cleavage and p... 768 1.8e-94 3
UNIPROTKB|F1SD85 - symbol:CPSF2 "Uncharacterized protein"... 928 3.4e-93 1
POMBASE|SPBC1709.15c - symbol:cft2 "cleavage factor two C... 600 7.1e-78 3
MGI|MGI:1919207 - symbol:Cpsf3l "cleavage and polyadenyla... 438 1.4e-42 2
RGD|1306841 - symbol:Cpsf3l "cleavage and polyadenylation... 438 1.4e-42 2
UNIPROTKB|Q5TA45 - symbol:CPSF3L "Integrator complex subu... 434 2.5e-42 2
UNIPROTKB|F1NV30 - symbol:CPSF3L "Integrator complex subu... 432 8.9e-42 2
UNIPROTKB|Q5ZIH0 - symbol:CPSF3L "Integrator complex subu... 432 1.1e-41 2
UNIPROTKB|E1B7Q9 - symbol:CPSF3L "Integrator complex subu... 428 4.8e-41 2
UNIPROTKB|E2QY53 - symbol:CPSF3L "Uncharacterized protein... 427 9.5e-41 2
UNIPROTKB|Q2YDM2 - symbol:CPSF3L "Integrator complex subu... 423 3.8e-40 2
UNIPROTKB|G3V1S5 - symbol:CPSF3L "Integrator complex subu... 423 3.9e-40 2
UNIPROTKB|F1RJE8 - symbol:CPSF3L "Uncharacterized protein... 421 1.7e-39 2
FB|FBgn0039691 - symbol:IntS11 "Integrator 11" species:72... 429 2.8e-39 2
TAIR|locus:2206076 - symbol:CPSF73-I "cleavage and polyad... 428 1.4e-38 2
CGD|CAL0004705 - symbol:orf19.325 species:5476 "Candida a... 369 1.8e-38 5
UNIPROTKB|Q5AEE3 - symbol:CFT2 "Putative uncharacterized ... 369 1.8e-38 5
POMBASE|SPAC17G6.16c - symbol:ysh1 "mRNA cleavage and pol... 422 9.9e-37 1
WB|WBGene00008642 - symbol:F10B5.8 species:6239 "Caenorha... 404 2.7e-36 2
SGD|S000004267 - symbol:YSH1 "Putative endoribonuclease" ... 406 3.0e-36 3
UNIPROTKB|G4N6C6 - symbol:MGG_06570 "Uncharacterized prot... 213 1.7e-35 6
SGD|S000004105 - symbol:CFT2 "Subunit of the mRNA cleavag... 351 4.5e-35 3
ZFIN|ZDB-GENE-030131-3275 - symbol:cpsf3 "cleavage and po... 396 6.0e-34 1
UNIPROTKB|I3LKR1 - symbol:CPSF3 "Uncharacterized protein"... 394 9.8e-34 1
FB|FBgn0261065 - symbol:Cpsf73 "Cleavage and polyadenylat... 393 1.3e-33 1
UNIPROTKB|P79101 - symbol:CPSF3 "Cleavage and polyadenyla... 390 2.8e-33 1
UNIPROTKB|Q9UKF6 - symbol:CPSF3 "Cleavage and polyadenyla... 390 2.8e-33 1
UNIPROTKB|F1NKW5 - symbol:CPSF3 "Uncharacterized protein"... 390 2.8e-33 1
UNIPROTKB|E2R7R2 - symbol:CPSF3 "Uncharacterized protein"... 390 3.3e-33 1
MGI|MGI:1859328 - symbol:Cpsf3 "cleavage and polyadenylat... 387 6.0e-33 1
RGD|1305767 - symbol:Cpsf3 "cleavage and polyadenylation ... 387 6.1e-33 1
UNIPROTKB|G3V6W7 - symbol:Cpsf3 "Protein Cpsf3" species:1... 387 6.1e-33 1
UNIPROTKB|G5E9W3 - symbol:CPSF3 "Cleavage and polyadenyla... 385 7.8e-33 1
DICTYBASE|DDB_G0278189 - symbol:ints11 "integrator comple... 377 2.0e-32 2
DICTYBASE|DDB_G0274799 - symbol:cpsf3 "cleavage and polya... 384 2.7e-32 2
ZFIN|ZDB-GENE-050522-13 - symbol:cpsf3l "cleavage and pol... 373 5.2e-32 2
ASPGD|ASPL0000040420 - symbol:AN3082 species:162425 "Emer... 181 2.5e-31 6
WB|WBGene00013460 - symbol:cpsf-3 species:6239 "Caenorhab... 366 1.6e-30 1
TAIR|locus:2065368 - symbol:CPSF73-II "AT2G01730" species... 354 6.1e-30 2
CGD|CAL0005344 - symbol:orf19.5486 species:5476 "Candida ... 346 4.4e-28 1
UNIPROTKB|Q59P50 - symbol:YSH1 "Endoribonuclease YSH1" sp... 346 4.4e-28 1
ASPGD|ASPL0000060573 - symbol:AN0990 species:162425 "Emer... 348 6.8e-28 2
GENEDB_PFALCIPARUM|PFC0825c - symbol:PFC0825c "cleavage a... 280 7.7e-23 2
UNIPROTKB|O77371 - symbol:PFC0825c "Cleavage and polyaden... 280 7.7e-23 2
GENEDB_PFALCIPARUM|PF14_0364 - symbol:PF14_0364 "cleavage... 256 1.5e-21 2
UNIPROTKB|Q8IL83 - symbol:PF14_0364 "Cleavage and polyade... 256 1.5e-21 2
UNIPROTKB|C9J979 - symbol:CPSF3L "Integrator complex subu... 178 3.9e-20 2
UNIPROTKB|E9PNS4 - symbol:CPSF3L "Integrator complex subu... 236 8.2e-19 1
UNIPROTKB|G3V3T7 - symbol:CPSF2 "Cleavage and polyadenyla... 236 8.2e-19 1
UNIPROTKB|F1SD84 - symbol:LOC100625560 "Uncharacterized p... 151 4.1e-18 2
UNIPROTKB|H0YJF4 - symbol:CPSF2 "Cleavage and polyadenyla... 172 2.2e-17 2
UNIPROTKB|E9PI75 - symbol:CPSF3L "Integrator complex subu... 209 6.6e-16 1
DICTYBASE|DDB_G0282473 - symbol:ints9 "integrator complex... 209 9.2e-16 2
UNIPROTKB|E9PIG1 - symbol:CPSF3L "Integrator complex subu... 207 1.1e-15 1
UNIPROTKB|Q5ZKK2 - symbol:INTS9 "Integrator complex subun... 183 3.3e-14 3
UNIPROTKB|F6XI08 - symbol:INTS9 "Uncharacterized protein"... 184 4.5e-14 2
UNIPROTKB|F1RJQ5 - symbol:INTS9 "Uncharacterized protein"... 182 4.8e-14 2
UNIPROTKB|F1MMA6 - symbol:INTS9 "Integrator complex subun... 183 5.7e-14 2
UNIPROTKB|Q2KJA6 - symbol:INTS9 "Integrator complex subun... 183 5.7e-14 2
ZFIN|ZDB-GENE-061013-129 - symbol:ints9 "integrator compl... 182 6.8e-14 3
UNIPROTKB|G3XAN1 - symbol:INTS9 "Integrator complex subun... 178 9.5e-14 2
MGI|MGI:1098533 - symbol:Ints9 "integrator complex subuni... 179 1.5e-13 3
UNIPROTKB|Q9NV88 - symbol:INTS9 "Integrator complex subun... 178 2.0e-13 2
RGD|1311539 - symbol:Ints9 "integrator complex subunit 9"... 177 3.1e-13 3
UNIPROTKB|H7BYQ6 - symbol:INTS9 "Integrator complex subun... 178 5.1e-12 2
WB|WBGene00017608 - symbol:F19F10.12 species:6239 "Caenor... 160 4.5e-11 2
FB|FBgn0036570 - symbol:IntS9 "Integrator 9" species:7227... 148 4.5e-10 2
UNIPROTKB|Q9KV92 - symbol:VC_0264 "Putative uncharacteriz... 160 2.8e-08 1
TIGR_CMR|VC_0264 - symbol:VC_0264 "conserved hypothetical... 160 2.8e-08 1
UNIPROTKB|E9PIL7 - symbol:CPSF3L "Integrator complex subu... 135 5.7e-08 1
UNIPROTKB|G3V5T3 - symbol:CPSF2 "Cleavage and polyadenyla... 132 1.2e-07 1
TAIR|locus:2079696 - symbol:AT3G07530 "AT3G07530" species... 107 1.4e-06 4
TIGR_CMR|CHY_2049 - symbol:CHY_2049 "metallo-beta-lactama... 134 1.6e-06 2
UNIPROTKB|H0YBH8 - symbol:INTS9 "Integrator complex subun... 133 3.9e-06 1
UNIPROTKB|Q81SC3 - symbol:BA_1737 "Metallo-beta-lactamase... 140 5.3e-06 2
TIGR_CMR|BA_1737 - symbol:BA_1737 "metallo-beta-lactamase... 140 5.3e-06 2
UNIPROTKB|E5RG70 - symbol:INTS9 "Integrator complex subun... 96 1.2e-05 3
UNIPROTKB|Q8EJC6 - symbol:SO_0541 "RNA-metabolizing metal... 141 2.7e-05 2
TIGR_CMR|SO_0541 - symbol:SO_0541 "metallo-beta-lactamase... 141 2.7e-05 2
UNIPROTKB|E9PQF0 - symbol:CPSF3L "Integrator complex subu... 116 5.5e-05 1
TIGR_CMR|DET_1061 - symbol:DET_1061 "metallo-beta-lactama... 129 7.3e-05 1
UNIPROTKB|E2QVB2 - symbol:INTS9 "Uncharacterized protein"... 127 9.6e-05 1
TIGR_CMR|CPS_2623 - symbol:CPS_2623 "metallo-beta-lactama... 110 0.00018 2
UNIPROTKB|C9JZH6 - symbol:CPSF3 "Cleavage and polyadenyla... 102 0.00019 1
>TAIR|locus:2172843 [details] [associations]
symbol:CPSF100 "cleavage and polyadenylation specificity
factor 100" species:3702 "Arabidopsis thaliana" [GO:0005634
"nucleus" evidence=ISM;IDA] [GO:0009793 "embryo development ending
in seed dormancy" evidence=NAS] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IPI] [GO:0005515
"protein binding" evidence=IPI] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS;NAS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0035194 "posttranscriptional gene
silencing by RNA" evidence=IMP] [GO:0009506 "plasmodesma"
evidence=IDA] [GO:0000278 "mitotic cell cycle" evidence=RCA]
[GO:0006306 "DNA methylation" evidence=RCA] [GO:0006342 "chromatin
silencing" evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0007267 "cell-cell
signaling" evidence=RCA] [GO:0009220 "pyrimidine ribonucleotide
biosynthetic process" evidence=RCA] [GO:0009616 "virus induced gene
silencing" evidence=RCA] [GO:0009640 "photomorphogenesis"
evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
interference" evidence=RCA] [GO:0010388 "cullin deneddylation"
evidence=RCA] [GO:0016569 "covalent chromatin modification"
evidence=RCA] [GO:0031047 "gene silencing by RNA" evidence=RCA]
[GO:0035196 "production of miRNAs involved in gene silencing by
miRNA" evidence=RCA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 GO:GO:0009506 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0006378 EMBL:AB005244 GO:GO:0003723
GO:GO:0016787 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 EMBL:AF283277 EMBL:AY034982
EMBL:BT004374 IPI:IPI00521104 RefSeq:NP_197776.1 UniGene:At.25191
ProteinModelPortal:Q9LKF9 SMR:Q9LKF9 IntAct:Q9LKF9 STRING:Q9LKF9
PaxDb:Q9LKF9 PRIDE:Q9LKF9 EnsemblPlants:AT5G23880.1 GeneID:832453
KEGG:ath:AT5G23880 TAIR:At5g23880 HOGENOM:HOG000264343
InParanoid:Q9LKF9 OMA:NNPFQFK PhylomeDB:Q9LKF9
ProtClustDB:CLSN2686300 Genevestigator:Q9LKF9 GermOnline:AT5G23880
GO:GO:0035194 Uniprot:Q9LKF9
Length = 739
Score = 2242 (794.3 bits), Expect = 3.4e-310, Sum P(2) = 3.4e-310
Identities = 430/539 (79%), Positives = 487/539 (90%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
MGTSVQVTPL GV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PLS+VASTIDAVLL
Sbjct: 1 MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPDTLH+GALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+QVS+FDLFTLDDID
Sbjct: 61 SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDID 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKE 180
SAFQ+V RLTYSQNYHLSGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE
Sbjct: 121 SAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKE 180
Query: 181 KHLNGTVLESFVRPAVLITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSA 238
+HLNGTVL+SFVRPAVLITDAY+AL+ NQ RQQR+ F D ISK L GGNVLLPVD+A
Sbjct: 181 RHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
GRVLELLLILE +W++ ++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAF
Sbjct: 241 GRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAF 300
Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFG 358
LL+HVTLLINK++LDNAP GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFG
Sbjct: 301 LLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFG 360
Query: 359 TLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASL 418
TLARMLQ+ PPPK VKVTMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS
Sbjct: 361 TLARMLQSAPPPKFVKVTMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASH 420
Query: 419 GPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEW 478
G D+N S +PM+ID DV+ HG Y+DILIDGFVPPS+SVAPMFP+Y+N SEW
Sbjct: 421 GSDDN-SSEPMIIDTKTTH---DVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEW 476
Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELTVLV 536
DDFGE+INPDDY+IKDEDMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL V V
Sbjct: 477 DDFGEIINPDDYVIKDEDMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTV 535
Score = 758 (271.9 bits), Expect = 3.4e-310, Sum P(2) = 3.4e-310
Identities = 150/201 (74%), Positives = 165/201 (82%)
Query: 508 DGKLDEGSA-SLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEET 566
+G+ D S S+I P K+V LVH AEATEHLKQHCL ++CPHVY PQIEET
Sbjct: 545 EGRSDGRSIKSMIAHVSPLKLV------LVHAIAEATEHLKQHCLNNICPHVYAPQIEET 598
Query: 567 IDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHK 626
+DVTSDLCAYKVQLSEKLMSNV+FKKLGD E+AWVD+EVGKTE M SLLP+ A PHK
Sbjct: 599 VDVTSDLCAYKVQLSEKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASPHK 658
Query: 627 SVLVGDLKMADLKPFLSSKGIQVEFAGG-ALRCGEYVTIRKVGPAGQKGGGSGTQQIVIE 685
VLVGDLK+AD K FLSSKG+QVEFAGG ALRCGEYVT+RKVGP GQKGG SG QQI+IE
Sbjct: 659 PVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQILIE 718
Query: 686 GPLCEDYYKIRAYLYSQFYLL 706
GPLCEDYYKIR YLYSQFYLL
Sbjct: 719 GPLCEDYYKIRDYLYSQFYLL 739
>UNIPROTKB|Q9P2I0 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein
binding" evidence=IPI] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IDA] [GO:0000398 "mRNA
splicing, via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase
II transcription" evidence=TAS] [GO:0006397 "mRNA processing"
evidence=TAS] [GO:0006406 "mRNA export from nucleus" evidence=TAS]
[GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene
expression" evidence=TAS] [GO:0031124 "mRNA 3'-end processing"
evidence=TAS] Reactome:REACT_71 InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
EMBL:CH471061 Reactome:REACT_1675 GO:GO:0003723 GO:GO:0016787
GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0005847
GO:GO:0006369 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78
HOGENOM:HOG000264343 OMA:NNPFQFK CTD:53981 HOVERGEN:HBG051106
OrthoDB:EOG4MCWZQ GO:GO:0006398 EMBL:AK001627 EMBL:BC070095
EMBL:AB037788 EMBL:AL442079 IPI:IPI00419531 RefSeq:NP_059133.1
UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0
SMR:Q9P2I0 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677
STRING:Q9P2I0 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0
PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875
GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588
HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0
PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2
GenomeRNAi:53981 NextBio:56268 ArrayExpress:Q9P2I0 Bgee:Q9P2I0
CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 GermOnline:ENSG00000165934
Uniprot:Q9P2I0
Length = 782
Score = 1054 (376.1 bits), Expect = 8.2e-134, Sum P(3) = 8.2e-134
Identities = 221/537 (41%), Positives = 327/537 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKE-KLKKEAAKKLEQSKEADID 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S ++++ D ID + + G R F + PMFP E
Sbjct: 419 SS--DESDIEED---IDQPSAHKTKHDLMMKGEGSRK---GSFFKQAKKSYPMFPAPEER 470
Query: 476 SEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
+WD++GE+I P+D+++ + + +++ + G +G DE + D P+K +S
Sbjct: 471 IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCIS 524
Score = 151 (58.2 bits), Expect = 8.2e-134, Sum P(3) = 8.2e-134
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 609 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 662
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 689 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 748
Query: 663 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 749 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 142 (55.0 bits), Expect = 8.2e-134, Sum P(3) = 8.2e-134
Identities = 37/115 (32%), Positives = 64/115 (55%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG EA++ L + C K + VY P+
Sbjct: 541 EGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPK 592
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 593 LHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 647
>UNIPROTKB|Q10568 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation specificity
factor subunit 2" species:9913 "Bos taurus" [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISS] [GO:0005847 "mRNA cleavage
and polyadenylation specificity factor complex" evidence=ISS]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
EMBL:X75931 IPI:IPI00688446 PIR:A56351 RefSeq:NP_787002.1
UniGene:Bt.4077 ProteinModelPortal:Q10568 STRING:Q10568
PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689
KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568
OrthoDB:EOG4MCWZQ NextBio:20810154 GO:GO:0006398 Uniprot:Q10568
Length = 782
Score = 1052 (375.4 bits), Expect = 1.3e-133, Sum P(3) = 1.3e-133
Identities = 221/537 (41%), Positives = 326/537 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKE-KLKKEAAKKLEQSKEADID 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S +++ D ID + + G R F + PMFP E
Sbjct: 419 SS--DESDAEED---IDQPSAHKTKHDLMMKGEGSRK---GSFFKQAKKSYPMFPAPEER 470
Query: 476 SEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
+WD++GE+I P+D+++ + + +++ + G +G DE + D P+K +S
Sbjct: 471 IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCIS 524
Score = 151 (58.2 bits), Expect = 1.3e-133, Sum P(3) = 1.3e-133
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 609 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 662
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 689 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 748
Query: 663 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 749 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 142 (55.0 bits), Expect = 1.3e-133, Sum P(3) = 1.3e-133
Identities = 37/115 (32%), Positives = 64/115 (55%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG EA++ L + C K + VY P+
Sbjct: 541 EGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPK 592
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 593 LHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 647
>UNIPROTKB|E2R496 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006398 "histone mRNA 3'-end processing"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 GO:GO:0005847 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981 GO:GO:0006398
EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496
Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230
NextBio:20855279 Uniprot:E2R496
Length = 782
Score = 1051 (375.0 bits), Expect = 1.7e-133, Sum P(3) = 1.7e-133
Identities = 219/537 (40%), Positives = 327/537 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKE-KLKKEAAKKLEQSKEADID 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S ++++ D + D++ G + F + PMFP E
Sbjct: 419 SS--DESDVEED--IDQPSAHKMKHDLMMKGEGSRKG----SFFKQAKKSYPMFPAPEER 470
Query: 476 SEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
+WD++GE+I P+D+++ + + +++ + G +G DE + D P+K +S
Sbjct: 471 IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCIS 524
Score = 151 (58.2 bits), Expect = 1.7e-133, Sum P(3) = 1.7e-133
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 609 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 662
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 689 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 748
Query: 663 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 749 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 142 (55.0 bits), Expect = 1.7e-133, Sum P(3) = 1.7e-133
Identities = 37/115 (32%), Positives = 64/115 (55%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG EA++ L + C K + VY P+
Sbjct: 541 EGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPK 592
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 593 LHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 647
>RGD|1309687 [details] [associations]
symbol:Cpsf2 "cleavage and polyadenylation specific factor 2,
100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA
3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0016787
EMBL:CH473982 GO:GO:0005847 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981
OrthoDB:EOG4MCWZQ GO:GO:0006398 IPI:IPI00189534
RefSeq:NP_001100223.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612
GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098
Uniprot:D3Z9E6
Length = 782
Score = 1048 (374.0 bits), Expect = 2.7e-133, Sum P(3) = 2.7e-133
Identities = 219/537 (40%), Positives = 327/537 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALP+A+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYVEKE-KLKKEAAKKLEQSKEADID 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S ++++ D V D++ G + F + PMFP E
Sbjct: 419 SS--DESDVEED--VDQPTAHKTKHDLMMKGEGSRKG----SFFKQAKKSYPMFPAPEER 470
Query: 476 SEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
+WD++GE+I P+D+++ + + +++ + G +G +E + D P+K VS
Sbjct: 471 IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--EEPMDQDLSDV-PTKCVS 524
Score = 152 (58.6 bits), Expect = 2.7e-133, Sum P(3) = 2.7e-133
Identities = 35/106 (33%), Positives = 59/106 (55%)
Query: 602 DAEVGKTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 660
+ E+G+ + +L P+ P H+SV + + +++D K L +GIQ EF GG L C
Sbjct: 687 EKELGEESEVIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNN 746
Query: 661 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
V +R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 747 QVAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
Score = 142 (55.0 bits), Expect = 2.7e-133, Sum P(3) = 2.7e-133
Identities = 37/115 (32%), Positives = 64/115 (55%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG EA++ L + C K + VY P+
Sbjct: 541 EGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPK 592
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 593 LHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 647
Score = 42 (19.8 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
Identities = 35/135 (25%), Positives = 50/135 (37%)
Query: 386 EELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEP 445
EE I ++E +K E+ L L EE K+ L +PM D +DV
Sbjct: 468 EERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDL------SDVPTK 521
Query: 446 HGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVIN---PDDYII------KDED 496
I I V T + YE S+ D ++IN P II +D
Sbjct: 522 CVSATESIEIKARV---TYID-----YEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQD 573
Query: 497 MDQAAMHIGGDDGKL 511
+ + GG D K+
Sbjct: 574 LAECCRAFGGKDIKV 588
>UNIPROTKB|Q9W799 [details] [associations]
symbol:cpsf2 "Cleavage and polyadenylation specificity
factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006397
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1
UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394
KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799
Length = 783
Score = 1041 (371.5 bits), Expect = 2.7e-133, Sum P(3) = 2.7e-133
Identities = 217/538 (40%), Positives = 325/538 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T L G E+ + YL+ +D F FL+DCGW+++F ++ + K +DAVLL
Sbjct: 1 MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LF+LDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFSLFSLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
AF + +L Y+Q HL GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 CAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H+TL S+L P PK+VLAS LE GFS ++F++W D KN V+ T R
Sbjct: 301 NPFQFRHLTLCHGYSDLARVPS-PKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L P + + + + +RV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDHPSERIIDIELRKRVKLEGKELEEYVEKE-KLKKEAAKKLEQSKEADLD 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S D+++ D + D++ + G + F + PMFP E+
Sbjct: 419 SS--DDSDVEED--IDQITSHKAKHDLMMKNEGSRKG----SFFKQAKKSYPMFPAPEDR 470
Query: 476 SEWDDFGEVINPDDYIIKD----EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
+WD++GE+I P+D+++ + ED ++ + G +G DE + D P+K VS
Sbjct: 471 IKWDEYGEIIKPEDFLVPELQVTED-EKTKLESGLTNG--DEPMDQDLSDV-PTKCVS 524
Score = 151 (58.2 bits), Expect = 2.7e-133, Sum P(3) = 2.7e-133
Identities = 36/106 (33%), Positives = 57/106 (53%)
Query: 602 DAEVGKTENGMLSLLPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 660
D E + + +L P+ S P H+SV + + +++D K L +GI EF GG L C
Sbjct: 688 DKEFSEESEIIPTLEPLPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVCNN 747
Query: 661 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
V +R+ + T +I +EG LCED++KIR LY Q+ ++
Sbjct: 748 MVAVRR----------TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783
Score = 150 (57.9 bits), Expect = 2.7e-133, Sum P(3) = 2.7e-133
Identities = 38/115 (33%), Positives = 65/115 (56%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG +AT+ L + C K + VYTP+
Sbjct: 541 EGRSDGDSIKKIINQMKPRQLI------IVHGPPDATQDLAEACRAFGGKDI--KVYTPK 592
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 593 LHETVDATSETHIYQVRLKDSLVSSLKFCKAKDTELAWIDGVLDMRVSKVDTGVI 647
>ZFIN|ZDB-GENE-040718-79 [details] [associations]
symbol:cpsf2 "cleavage and polyadenylation specific
factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0016787
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106
OrthoDB:EOG4MCWZQ EMBL:BC076029 IPI:IPI00512505
RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5
STRING:Q6DHE5 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657
InParanoid:Q6DHE5 NextBio:20831102 ArrayExpress:Q6DHE5 Bgee:Q6DHE5
Uniprot:Q6DHE5
Length = 790
Score = 1045 (372.9 bits), Expect = 3.5e-133, Sum P(3) = 3.5e-133
Identities = 219/545 (40%), Positives = 331/545 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++ F ++ L + +DAVLL
Sbjct: 1 MTSIIKLTALSGVQEESALCYLLQVDEFRFLLDCGWDETFSMDIIDSLKRYVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD +HLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
SAF + +L YSQ +L GKG G+ + P AGH++GGT+WKI KDGE+ +IY VD+N ++
Sbjct: 121 SAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIYGVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LES RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L + S+L P PK+VL S LE+GFS ++F++W D KN V+ T R
Sbjct: 301 NPFQFRHLSLCHSLSDLARVPS-PKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K +++ + +R L G EL Y E++ R+KKE A K KE +
Sbjct: 360 TPGTLARYLIDNPGEKRIELEIRKRCRLEGRELEEYMEKE-RMKKEAAKKLEQAKEVDLD 418
Query: 416 ASLGPDNNLSGD---PMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFY 472
+S ++++ D P V+ +++ GGR GF + MFP +
Sbjct: 419 SS--DESDMEDDLEQPAVVKTKHHDL---MMKGEGGRK-----GGFFKQAKKSYSMFPTH 468
Query: 473 ENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
E +WD++GE+I P+D+++ + + +++ + G +G +E + D P+K S
Sbjct: 469 EERIKWDEYGEIIRPEDFLVPELQATEEEKSKLESGLTNG--EEPMEQDLSDV-PTKCTS 525
Query: 530 NELTV 534
T+
Sbjct: 526 TTQTL 530
Score = 151 (58.2 bits), Expect = 3.5e-133, Sum P(3) = 3.5e-133
Identities = 35/103 (33%), Positives = 56/103 (54%)
Query: 602 DAEVGKTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 660
+ E+ + + + +L P+ P H+SV + + +++D K L +GIQ EF GG L C
Sbjct: 695 EKEISEESDVIPTLEPLPAHEVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNN 754
Query: 661 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 703
V +R+ AG+ I +EG C+DYY+IR LY Q+
Sbjct: 755 LVAVRRT-EAGR---------ICLEGCHCDDYYRIRELLYEQY 787
Score = 145 (56.1 bits), Expect = 3.5e-133, Sum P(3) = 3.5e-133
Identities = 38/125 (30%), Positives = 69/125 (55%)
Query: 496 DMDQAAMHIGGDDGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCLKHV 554
D+ M+I + G+ D S I++ KP +++ +VHG +A++ L + C +
Sbjct: 531 DIRARVMYIDYE-GRSDGDSIKKIINQMKPRQLI------IVHGPPDASQDLAESCKAYS 583
Query: 555 CPH--VYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKT 608
VY P+++ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K
Sbjct: 584 GKDIKVYIPKLQETVDATSETHIYQVRLKDSLVSSLQFCKARDTELAWIDGVLDMRVEKV 643
Query: 609 ENGML 613
+ G++
Sbjct: 644 DTGVI 648
Score = 43 (20.2 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
Identities = 40/167 (23%), Positives = 62/167 (37%)
Query: 386 EELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEP 445
EE I ++E ++ E+ L L EE K+ L +PM D +DV
Sbjct: 469 EERIKWDEYGEIIRPEDFLVPELQATEEEKSKLESGLTNGEEPMEQDL------SDVPTK 522
Query: 446 HGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIG 505
+ + I V M+ YE S+ D ++IN +K + +H G
Sbjct: 523 CTSTTQTLDIRARV--------MYIDYEGRSDGDSIKKIINQ----MKPRQL--IIVH-G 567
Query: 506 GDDGKLDEGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLK 552
D D + K KV +L V ++E H+ Q LK
Sbjct: 568 PPDASQDLAESCKAYSGKDIKVYIPKLQETVDATSET--HIYQVRLK 612
>UNIPROTKB|F1NMN0 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK GO:GO:0006398
EMBL:AADN02003653 IPI:IPI00651282 Ensembl:ENSGALT00000017538
Uniprot:F1NMN0
Length = 782
Score = 1044 (372.6 bits), Expect = 5.7e-133, Sum P(3) = 5.7e-133
Identities = 210/499 (42%), Positives = 308/499 (61%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW+++F ++ L K +DAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLKKHVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ ++GL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L + S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHSLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K + + + RRV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDNPSEKVIDIELRRRVKLEGKELEEYLEKE-KLKKEAAKKLEQSKEADID 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S D D + +++ G R F + PMFP E
Sbjct: 419 SSDESDAEEDIDQPTVHKTKHDL---MMKGEGSRK-----GSFFKQAKKSYPMFPAPEER 470
Query: 476 SEWDDFGEVINPDDYIIKD 494
+WD++GE+I P+D+++ +
Sbjct: 471 IKWDEYGEIIKPEDFLVPE 489
Score = 151 (58.2 bits), Expect = 5.7e-133, Sum P(3) = 5.7e-133
Identities = 34/97 (35%), Positives = 54/97 (55%)
Query: 615 LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
++P P PPH+ SV + + +++D K L +GIQ EF GG L C V +R+
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRR--- 752
Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
+ T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782
Score = 144 (55.7 bits), Expect = 5.7e-133, Sum P(3) = 5.7e-133
Identities = 46/144 (31%), Positives = 74/144 (51%)
Query: 480 DFGEVINPDDYIIKDEDMDQAAMHIGGD-DGKLDEGSASLILDA-KPSKVVSNELTVLVH 537
D +V P I E M+ A D +G+ D S I++ KP ++V +VH
Sbjct: 514 DLSDV--PTKCISATESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLV------IVH 565
Query: 538 GSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
G EA++ L + C K + VY P++ ET+D TS+ Y+V+L + L+S++ F K
Sbjct: 566 GPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKA 623
Query: 594 GDYEIAWVDA----EVGKTENGML 613
D E+AW+D V K + G++
Sbjct: 624 KDAELAWIDGVLDMRVSKVDTGVI 647
Score = 46 (21.3 bits), Expect = 6.3e-17, Sum P(3) = 6.3e-17
Identities = 14/44 (31%), Positives = 20/44 (45%)
Query: 386 EELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPM 429
EE I ++E +K E+ L L EE K+ L +PM
Sbjct: 468 EERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPM 511
Score = 38 (18.4 bits), Expect = 7.3e-06, Sum P(2) = 7.3e-06
Identities = 15/44 (34%), Positives = 20/44 (45%)
Query: 485 INPDDYIIKDEDMDQAAMHIGGDDGKLD-EGS--ASLILDAKPS 525
I+ D +ED+DQ +H D + EGS S AK S
Sbjct: 417 IDSSDESDAEEDIDQPTVHKTKHDLMMKGEGSRKGSFFKQAKKS 460
>FB|FBgn0027873 [details] [associations]
symbol:Cpsf100 "Cleavage and polyadenylation specificity
factor 100" species:7227 "Drosophila melanogaster" [GO:0006379
"mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS]
[GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 OMA:NNPFQFK
GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1
UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 IntAct:Q9V3D6
STRING:Q9V3D6 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357
GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426
FlyBase:FBgn0027873 InParanoid:Q8IML7 OrthoDB:EOG4XD261
PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 Bgee:Q9V3D6
GermOnline:CG1957 Uniprot:Q9V3D6
Length = 756
Score = 1003 (358.1 bits), Expect = 1.1e-120, Sum P(2) = 1.1e-120
Identities = 222/567 (39%), Positives = 337/567 (59%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ +SG +E+P Y++ ID L+DCGW++ FD + ++ L + T+DAVLL
Sbjct: 1 MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S + +FDLF+LDD+D
Sbjct: 61 SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFDLFSLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIYAVDYNRRK 179
+AF+ +T+L Y+Q L KG GI + P AGH++GGT+WKI K GE D++YA D+N +K
Sbjct: 121 TAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKK 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E+HL+G L+ RP++LITDAYNA + Q R+ R E I +T+R GNVL+ VD+A
Sbjct: 181 ERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W + Y + L VS + I++ KS +EWM D +TK+FE +R+
Sbjct: 241 GRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F KH+ L + +++ P GPK+VLAS LE+GF+ D+FV+WAS+ N ++ T R
Sbjct: 301 NPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRT 360
Query: 356 QFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVK---E 411
GTLA +++ P K +++ + RRV L G EL EE R + E+ L +VK E
Sbjct: 361 SPGTLAMELVENCAPGKQIELDVRRRVDLEGAEL----EEYLRTQGEK-LNPLIVKPDVE 415
Query: 412 EESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPF 471
EES + D +S VI VV P G + GF + MFP+
Sbjct: 416 EESSSESEDDIEMS----VITGKHDI----VVRPEGRHH-----SGFFKSNKRHHVMFPY 462
Query: 472 YENNSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD---DGKLDEG 514
+E + D++GE+IN DDY I D E++ + IG + +G + +
Sbjct: 463 HEEKVKCDEYGEIINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGAEQQANGGIVDN 522
Query: 515 SASLILDAKPSKVVSNELTVLVHGSAE 541
L+ KP+K++S T+ V+ +
Sbjct: 523 DVQLL--EKPTKLISQRKTIEVNAQVQ 547
Score = 205 (77.2 bits), Expect = 1.1e-120, Sum P(2) = 1.1e-120
Identities = 63/215 (29%), Positives = 108/215 (50%)
Query: 508 DGKLD-EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEET 566
+G+ D E ++ +P +V+ ++HG+AE T+ + +HC ++V V+TPQ E
Sbjct: 552 EGRSDGESMLKILSQLRPRRVI------VIHGTAEGTQVVARHCEQNVGARVFTPQKGEI 605
Query: 567 IDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHK 626
IDVTS++ Y+V+L+E L+S + F+K D E+AWVD +G + + P+
Sbjct: 606 IDVTSEIHIYQVRLTEGLVSQLQFQKGKDAEVAWVDGRLGMRVKAIEA--PMDVTVEQDA 663
Query: 627 SVLVGDLKMADLKPFLSSK-GIQVEFAGGALRCGEYV-TIRKVGPAGQKGGG-----SGT 679
SV G K L+ + I L+ ++ T+ + + GG +GT
Sbjct: 664 SVQEG--KTLTLETLADDEIPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGT 721
Query: 680 Q--------QIVIEGPLCEDYYKIRAYLYSQFYLL 706
++ +EG L E+YYKIR LY Q+ ++
Sbjct: 722 LALRRVDAGKVAMEGCLSEEYYKIRELLYEQYAIV 756
>DICTYBASE|DDB_G0270392 [details] [associations]
symbol:cpsf2 "cleavage and polyadenylation
specificity factor 100 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA;ISS] [GO:0006378 "mRNA
polyadenylation" evidence=IEA;ISS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA;ISS]
[GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0006397 "mRNA
processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0270392 Pfam:PF07521 EMBL:AAFI02000005
GenomeReviews:CM000150_GR GO:GO:0006378 GO:GO:0003723 GO:GO:0016787
GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
OMA:NNPFQFK RefSeq:XP_646760.1 ProteinModelPortal:Q55BS1
STRING:Q55BS1 EnsemblProtists:DDB0233700 GeneID:8617733
KEGG:ddi:DDB_G0270392 ProtClustDB:CLSZ2431463 Uniprot:Q55BS1
Length = 784
Score = 869 (311.0 bits), Expect = 5.7e-120, Sum P(4) = 5.7e-120
Identities = 184/430 (42%), Positives = 271/430 (63%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + ++ T LSG +E+P YL+ ID F L+DCG + + D SLL+PL KVA IDAVLL
Sbjct: 1 MASIIKFTALSGAKDESPPCYLLEIDDFCILLDCGLSYNLDFSLLEPLEKVAKKIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SH DT H+G LPY + + GL+ ++ T PV ++G + +YD Y ++ EF ++LD+ID
Sbjct: 61 SHSDTTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNID 120
Query: 121 SAF--QSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
S F L++SQ+Y LSGKG+GI + P++AGH +G +VWKITK ++YA+DYN R
Sbjct: 121 SCFGEDRFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHR 180
Query: 179 KEKHLNGTVLES-FVRPAVLITDAYN-----ALHNQPPRQQREMFQDAISKTLRAGGNVL 232
E HL+ L S ++P++LITD+ A R Q +F+ I++ LR GGNVL
Sbjct: 181 NEGHLDSLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQ-SLFEQ-INRNLRDGGNVL 238
Query: 233 LPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 290
+PVD+AGRVLELLL +E+YW+++ SL Y + FL S S + +S LE+M + + F
Sbjct: 239 IPVDTAGRVLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKF 298
Query: 291 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
E + +N F KH+ +L + EL PD K++L S LE GFS ++F++W SD K L+L
Sbjct: 299 EQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLIL 358
Query: 351 FTERGQFGTLARML--QADPPP---KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALK 405
FT++ +LA L Q P K +++ RVPL G+EL+ YE EQ + ++E+ L+
Sbjct: 359 FTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQYEMEQAKQREEKRLE 418
Query: 406 ASLVKEEESK 415
L KE+E +
Sbjct: 419 -QLRKEQEER 427
Score = 135 (52.6 bits), Expect = 5.7e-120, Sum P(4) = 5.7e-120
Identities = 32/97 (32%), Positives = 51/97 (52%)
Query: 610 NGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 669
N + +T H +GD+K++DLK L + GIQV+F G L CG V I +
Sbjct: 694 NNTTMMTTTTTTTNGHDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNCGGLVYIWR--- 750
Query: 670 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
+ GG+ I ++G + ++YY I+ LY QF ++
Sbjct: 751 -DEDHGGNSI--INVDGIISDEYYLIKELLYKQFQIV 784
Score = 113 (44.8 bits), Expect = 5.7e-120, Sum P(4) = 5.7e-120
Identities = 22/91 (24%), Positives = 51/91 (56%)
Query: 534 VLVHGSAEATEHLKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKK 592
VL+ GS + ++ ++ + +++ +Y P I E +D+TSD Y++ L + L++ + K
Sbjct: 584 VLIRGSEQQSQSIENYVKENIRTKGIYIPSIGEQLDLTSDTNVYELLLKDSLVNTLKTSK 643
Query: 593 LGDYEIAWVDAEVGKTENGMLSLLPISTPAP 623
+ DYE++++ +V + + +L + P
Sbjct: 644 ILDYEVSYIQGKVDILDGSNVPVLDLIQSIP 674
Score = 109 (43.4 bits), Expect = 5.7e-120, Sum P(4) = 5.7e-120
Identities = 36/143 (25%), Positives = 65/143 (45%)
Query: 371 KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA-----------SLVKEEESKASLG 419
K +++ RVPL G+EL+ YE EQ + ++E+ L+ ++EEE + L
Sbjct: 384 KCIEIVQGSRVPLTGDELLQYEMEQAKQREEKRLEQLRKEQEEREERERLEEEEREQLLN 443
Query: 420 PDNNLSGDPMV-IDXXXXXXSAD-----VVEPHGGRYRDILIDGFVPPSTSVAPMFPFYE 473
N ++ + D + P D+L F S+ MFP++E
Sbjct: 444 ATNQDQLQQLLQLQQQKERGIIDDSMVHMKNPFENDRFDLLDSEF--KKQSMITMFPYFE 501
Query: 474 NNSEWDDFGEVINPDDYIIKDED 496
+ +W ++GE DD I++++D
Sbjct: 502 KHLKWGEYGE--EDDDLILRNQD 522
>MGI|MGI:1861601 [details] [associations]
symbol:Cpsf2 "cleavage and polyadenylation specific factor
2" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISO;IDA]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG4MCWZQ GO:GO:0006398
EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 IPI:IPI00314302
RefSeq:NP_058552.1 UniGene:Mm.716 ProteinModelPortal:O35218
SMR:O35218 STRING:O35218 PhosphoSite:O35218 PaxDb:O35218
PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786
UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 Bgee:O35218
CleanEx:MM_CPSF2 Genevestigator:O35218
GermOnline:ENSMUSG00000041781 Uniprot:O35218
Length = 782
Score = 1048 (374.0 bits), Expect = 9.5e-120, Sum P(2) = 9.5e-120
Identities = 219/537 (40%), Positives = 327/537 (60%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALP+A+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GNVL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 415
GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K KE +
Sbjct: 360 TPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKE-KLKKEAAKKLEQSKEADID 418
Query: 416 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENN 475
+S ++++ D V D++ G + F + PMFP E
Sbjct: 419 SS--DESDVEED--VDQPSAHKTKHDLMMKGEGSRKG----SFFKQAKKSYPMFPAPEER 470
Query: 476 SEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 529
+WD++GE+I P+D+++ + + +++ + G +G +E + D P+K VS
Sbjct: 471 IKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--EEPMDQDLSDV-PTKCVS 524
Score = 151 (58.2 bits), Expect = 9.5e-120, Sum P(2) = 9.5e-120
Identities = 46/143 (32%), Positives = 71/143 (49%)
Query: 564 EETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAP 623
E +D SD A Q + K + K+LG+ + E+ T L LP P
Sbjct: 661 EMQVDAPSDSSAMAQQKAMKSLFGEDEKELGE------ETEIIPT----LEPLP-PHEVP 709
Query: 624 PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 683
H+SV + + +++D K L +GIQ EF GG L C V +R+ + T +I
Sbjct: 710 GHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRIG 759
Query: 684 IEGPLCEDYYKIRAYLYSQFYLL 706
+EG LC+D+Y+IR LY Q+ ++
Sbjct: 760 LEGCLCQDFYRIRDLLYEQYAIV 782
Score = 142 (55.0 bits), Expect = 8.5e-119, Sum P(2) = 8.5e-119
Identities = 37/115 (32%), Positives = 64/115 (55%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG EA++ L + C K + VY P+
Sbjct: 541 EGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPK 592
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 593 LHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 647
Score = 42 (19.8 bits), Expect = 2.8e-06, Sum P(2) = 2.8e-06
Identities = 35/135 (25%), Positives = 50/135 (37%)
Query: 386 EELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEP 445
EE I ++E +K E+ L L EE K+ L +PM D +DV
Sbjct: 468 EERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNGEEPMDQDL------SDVPTK 521
Query: 446 HGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVIN---PDDYII------KDED 496
I I V T + YE S+ D ++IN P II +D
Sbjct: 522 CVSATESIEIKARV---TYID-----YEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQD 573
Query: 497 MDQAAMHIGGDDGKL 511
+ + GG D K+
Sbjct: 574 LAECCRAFGGKDIKV 588
>WB|WBGene00017313 [details] [associations]
symbol:cpsf-2 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
[GO:0051301 "cell division" evidence=IMP] [GO:0000910 "cytokinesis"
evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0016246
"RNA interference" evidence=IMP] [GO:0040027 "negative regulation
of vulval development" evidence=IMP] InterPro:IPR001279
InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
Uniprot:O17403
Length = 843
Score = 768 (275.4 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 169/448 (37%), Positives = 264/448 (58%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ SG +E PL YL+ +DG L+DCGW++ F + L I AVL+
Sbjct: 1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLG LPY + + GL+APV++T PVY++G + +YD S V EF+ +TLDD+D
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
+AF+ V ++ Y+Q L G G+ AGH+LGG++W+I + GED++Y VD+N +K
Sbjct: 121 TAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E+HLNG ++F RP +LIT A++ Q R+ R E I +T+R G+ ++ +D+A
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239
Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
GRVLEL +L+ W A+ L+ Y + +++V+SS + + KS LEWM + + K +S R
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299
Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
N F LKHVTL + EL PK+VL S +E+GFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVRS-PKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358
Query: 355 GQFGTLARML-----QADP-----PPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEA- 403
TLA L +A+ + + + + +RV L GEEL+ Y+ + EE
Sbjct: 359 PASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETR 418
Query: 404 LKASLVKEE-ESKASLGPDNNLSGDPMV 430
L+ + + ++ S D++ P+V
Sbjct: 419 LRMERARRQAQANESDDSDDDDIAAPIV 446
Score = 127 (49.8 bits), Expect = 9.7e-16, Sum P(3) = 9.7e-16
Identities = 37/136 (27%), Positives = 62/136 (45%)
Query: 371 KAVKVTMSRRVPLVGEELIAYEE-------EQTRLKKEEALKASLVKEEESKASLGPDNN 423
+ + + + +RV L GEEL+ Y+ E+TRL+ E A + + E + D++
Sbjct: 385 RLISLVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDS----DDDD 440
Query: 424 LSGDPMVIDXXXXXXSADVVEPHGGRYRDILID-------GFVPPSTSVAPMFPFYENNS 476
++ + S D E + DI+ F + PMFP+ E
Sbjct: 441 IAAPIVPRHSEKDFRSFDGSENDAHTF-DIMAKWDNQQKASFFKTTKKSFPMFPYIEEKV 499
Query: 477 EWDDFGEVINPDDYII 492
+WDD+GEVI P+DY +
Sbjct: 500 KWDDYGEVIKPEDYTV 515
Score = 117 (46.2 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 606 GKTENGMLSLLPISTPAPP-HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVT 663
GK G L L P+ P H++V V D K++D K L+ KG + EF G L G +
Sbjct: 752 GKIR-GNLILDPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCS 810
Query: 664 IRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
IR+ +G Q+ EG +DYYK+R Y QF +L
Sbjct: 811 IRR--------NDTGVFQM--EGAFTKDYYKLRRLFYDQFAVL 843
Score = 88 (36.0 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 23/90 (25%), Positives = 48/90 (53%)
Query: 534 VLVHGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK 591
++VHGS + T L + + P+ +D + + Y+V LS+ L++++ FK
Sbjct: 596 IVVHGSRDDTRDLVAYFADSGFDTTMLKAPEAGALVDASVESFIYQVALSDALLADIQFK 655
Query: 592 KLGD-YEIAWVDAEVGKTENGMLSLLPIST 620
++ + +AW+DA V + E + ++L + T
Sbjct: 656 EVSEGNSLAWIDARVMEKE-AIDNMLAVGT 684
>UNIPROTKB|O17403 [details] [associations]
symbol:cpsf-2 "Probable cleavage and polyadenylation
specificity factor subunit 2" species:6239 "Caenorhabditis elegans"
[GO:0005575 "cellular_component" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] InterPro:IPR001279
InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
Uniprot:O17403
Length = 843
Score = 768 (275.4 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 169/448 (37%), Positives = 264/448 (58%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ SG +E PL YL+ +DG L+DCGW++ F + L I AVL+
Sbjct: 1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLG LPY + + GL+APV++T PVY++G + +YD S V EF+ +TLDD+D
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDVEEFEHYTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRK 179
+AF+ V ++ Y+Q L G G+ AGH+LGG++W+I + GED++Y VD+N +K
Sbjct: 121 TAFEKVEQVKYNQTVVLKGDS-GVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E+HLNG ++F RP +LIT A++ Q R+ R E I +T+R G+ ++ +D+A
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239
Query: 239 GRVLELLLILEDYW--AEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-R 294
GRVLEL +L+ W A+ L+ Y + +++V+SS + + KS LEWM + + K +S R
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299
Query: 295 DNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
N F LKHVTL + EL PK+VL S +E+GFS ++F++W SD +N V+ T R
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVRS-PKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358
Query: 355 GQFGTLARML-----QADP-----PPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEA- 403
TLA L +A+ + + + + +RV L GEEL+ Y+ + EE
Sbjct: 359 PASFTLAAKLVNMAERANDGVLKHEDRLISLVVKKRVALEGEELLEYKRRKAERDAEETR 418
Query: 404 LKASLVKEE-ESKASLGPDNNLSGDPMV 430
L+ + + ++ S D++ P+V
Sbjct: 419 LRMERARRQAQANESDDSDDDDIAAPIV 446
Score = 127 (49.8 bits), Expect = 9.7e-16, Sum P(3) = 9.7e-16
Identities = 37/136 (27%), Positives = 62/136 (45%)
Query: 371 KAVKVTMSRRVPLVGEELIAYEE-------EQTRLKKEEALKASLVKEEESKASLGPDNN 423
+ + + + +RV L GEEL+ Y+ E+TRL+ E A + + E + D++
Sbjct: 385 RLISLVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDS----DDDD 440
Query: 424 LSGDPMVIDXXXXXXSADVVEPHGGRYRDILID-------GFVPPSTSVAPMFPFYENNS 476
++ + S D E + DI+ F + PMFP+ E
Sbjct: 441 IAAPIVPRHSEKDFRSFDGSENDAHTF-DIMAKWDNQQKASFFKTTKKSFPMFPYIEEKV 499
Query: 477 EWDDFGEVINPDDYII 492
+WDD+GEVI P+DY +
Sbjct: 500 KWDDYGEVIKPEDYTV 515
Score = 117 (46.2 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 606 GKTENGMLSLLPISTPAPP-HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVT 663
GK G L L P+ P H++V V D K++D K L+ KG + EF G L G +
Sbjct: 752 GKIR-GNLILDPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCS 810
Query: 664 IRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
IR+ +G Q+ EG +DYYK+R Y QF +L
Sbjct: 811 IRR--------NDTGVFQM--EGAFTKDYYKLRRLFYDQFAVL 843
Score = 88 (36.0 bits), Expect = 1.8e-94, Sum P(3) = 1.8e-94
Identities = 23/90 (25%), Positives = 48/90 (53%)
Query: 534 VLVHGSAEATEHLKQHCLKHV--CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFK 591
++VHGS + T L + + P+ +D + + Y+V LS+ L++++ FK
Sbjct: 596 IVVHGSRDDTRDLVAYFADSGFDTTMLKAPEAGALVDASVESFIYQVALSDALLADIQFK 655
Query: 592 KLGD-YEIAWVDAEVGKTENGMLSLLPIST 620
++ + +AW+DA V + E + ++L + T
Sbjct: 656 EVSEGNSLAWIDARVMEKE-AIDNMLAVGT 684
>UNIPROTKB|F1SD85 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379
"mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA polyadenylation"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
InterPro:IPR022712 PANTHER:PTHR11203:SF5 Pfam:PF10996 SMART:SM01027
GeneTree:ENSGT00700000104551 EMBL:CU468363
Ensembl:ENSSSCT00000002717 OMA:GANDESP Uniprot:F1SD85
Length = 385
Score = 928 (331.7 bits), Expect = 3.4e-93, P = 3.4e-93
Identities = 178/383 (46%), Positives = 253/383 (66%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDID 120
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +F LFTLDD+D
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLFTLDDVD 120
Query: 121 SAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRK 179
+AF + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++YAVD+N ++
Sbjct: 121 AAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 180 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSA 238
E HLNG LE RP++LITD++NA + QP R+QR E + +TLR G+VL+ VD+A
Sbjct: 181 EIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGSVLIAVDTA 240
Query: 239 GRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GRVLEL +L+ W +Y L VS + +++ KS +EWM D + + FE R+
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN ++ T R
Sbjct: 301 NPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRT 359
Query: 356 QFGTLARMLQADPPPKAVKVTMS 378
GTLAR L +P K ++ +S
Sbjct: 360 TPGTLARFLIDNPSEKITEIEVS 382
>POMBASE|SPBC1709.15c [details] [associations]
symbol:cft2 "cleavage factor two Cft2/polyadenylation
factor CPSF-73 (predicted)" species:4896 "Schizosaccharomyces
pombe" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA]
[GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
[GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
cleavage" evidence=ISO] InterPro:IPR027075 PomBase:SPBC1709.15c
Pfam:PF07521 EMBL:CU329671 GO:GO:0006378 GenomeReviews:CU329671_GR
GO:GO:0005847 GO:GO:0006379 PIR:T39643 RefSeq:NP_595448.1
ProteinModelPortal:O74740 STRING:O74740 EnsemblFungi:SPBC1709.15c.1
GeneID:2539954 KEGG:spo:SPBC1709.15c eggNOG:COG1236 KO:K14402
OMA:ISSIATP OrthoDB:EOG4WWVSN NextBio:20801097 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Uniprot:O74740
Length = 797
Score = 600 (216.3 bits), Expect = 7.1e-78, Sum P(3) = 7.1e-78
Identities = 137/346 (39%), Positives = 207/346 (59%)
Query: 23 VSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAM-KQLGLS 81
+ +DG + ID G +D SL P +V D +LLSH D H+G L YA K +
Sbjct: 18 IELDGIHIYIDPGSDD----SLKHP--EVPEQPDLILLSHSDLAHIGGLVYAYYKYDWKN 71
Query: 82 APVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKG 141
A +++T P +G +TM D S +S+ + D+D+ F S+ L Y Q L GK
Sbjct: 72 AYIYATLPTINMGRMTMLDAIKSN-YISDM---SKADVDAVFDSIIPLRYQQPTLLLGKC 127
Query: 142 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------VLESFVRP 194
G+ + + AGH LGGT+W + K+ E V+YAVD+N K+KHLNG +LE+ RP
Sbjct: 128 SGLTITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRP 187
Query: 195 AVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 253
LITDA N+L + P R++R E F +++ +L GG VLLPVD+A RVLEL IL+++W+
Sbjct: 188 NTLITDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWS 247
Query: 254 --EHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE 311
+ L +PI FL+ S+ TIDY KS +EWMGD+I + F + +N +++ + + S+
Sbjct: 248 ASQPPLPFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGIN-ENLLEFRNINTITDFSQ 306
Query: 312 LDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN-LVLFTERGQ 356
+ + GPK++LA+ +LE GFS I ++ S+ N L+LFT+R +
Sbjct: 307 ISHIGPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSR 352
Score = 116 (45.9 bits), Expect = 7.1e-78, Sum P(3) = 7.1e-78
Identities = 41/137 (29%), Positives = 69/137 (50%)
Query: 473 ENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDD--GKLDEGSASLILDAKPSKVVSN 530
+ E +D EV P II DE + + + D G D S I+ P V+
Sbjct: 544 QQKKEEEDEDEV--PSK-IITDEKTIRVSCQVQFIDIEGLHDGRSLKTII---PQ--VNP 595
Query: 531 ELTVLVHGSAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNV 588
VL+H S E E +K+ C L VY P E I+V+ D+ A+ ++L++ L+ N+
Sbjct: 596 RRLVLIHASTEEKEDMKKTCASLSAFTKDVYIPNYGEIINVSIDVNAFSLKLADDLIKNL 655
Query: 589 LFKKLGDYEIAWVDAEV 605
++ K+G+ E++ + A+V
Sbjct: 656 IWTKVGNCEVSHMLAKV 672
Score = 99 (39.9 bits), Expect = 7.1e-78, Sum P(3) = 7.1e-78
Identities = 28/80 (35%), Positives = 43/80 (53%)
Query: 622 APPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQ 680
AP +LVG++++A L+ L +GI E G G L CG V +RK+ GG
Sbjct: 722 APRAAPLLVGNIRLAYLRKALLDQGISAELKGEGVLLCGGAVAVRKLS------GG---- 771
Query: 681 QIVIEGPLCEDYYKIRAYLY 700
+I +EG L +++IR +Y
Sbjct: 772 KISVEGSLSNRFFEIRKLVY 791
Score = 97 (39.2 bits), Expect = 7.0e-76, Sum P(3) = 7.0e-76
Identities = 41/153 (26%), Positives = 73/153 (47%)
Query: 371 KAVKVTMSRRVPLVGEELIAYEE-EQTRLKKEE---ALK---ASLVKEEESKASLGPDNN 423
+AVK+ + PL GEEL +Y+E E ++ K+ AL+ +++ E+ S +S D++
Sbjct: 386 QAVKI--KTKEPLEGEELRSYQELEFSKRNKDAEDTALEFRNRTILDEDLSSSSSSEDDD 443
Query: 424 LSGDPMVIDXXXXXXSADVVEPHGGRYRDI-LIDGFVPPSTSVAPMFPFYENNSEWDDFG 482
L + V SA ++ G+ D+ L D V + MFP+ E D++G
Sbjct: 444 LDLNTEV-PHVALGSSAFLM----GKSFDLNLRDPAVQALHTKYKMFPYIEKRRRIDEYG 498
Query: 483 EVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGS 515
E+I D+ + +E + + DD L +
Sbjct: 499 EIIKHQDFSMINEPANTLELENDSDDNALSNSN 531
>MGI|MGI:1919207 [details] [associations]
symbol:Cpsf3l "cleavage and polyadenylation specific factor
3-like" species:10090 "Mus musculus" [GO:0003674
"molecular_function" evidence=ND] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 MGI:MGI:1919207 GO:GO:0005737 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
GeneTree:ENSGT00700000104485 GO:GO:0032039 GO:GO:0016180
HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB CTD:54973 EMBL:AK010425
EMBL:AK090206 EMBL:AK150436 EMBL:AK152740 EMBL:AK167607
EMBL:AK172533 EMBL:BC008240 EMBL:BC011155 IPI:IPI00467084
RefSeq:NP_082296.1 UniGene:Mm.259270 UniGene:Mm.475640
ProteinModelPortal:Q9CWS4 SMR:Q9CWS4 STRING:Q9CWS4
PhosphoSite:Q9CWS4 PaxDb:Q9CWS4 PRIDE:Q9CWS4
Ensembl:ENSMUST00000030901 GeneID:71957 KEGG:mmu:71957
InParanoid:Q9CWS4 NextBio:335052 Bgee:Q9CWS4 Genevestigator:Q9CWS4
GermOnline:ENSMUSG00000029034 Uniprot:Q9CWS4
Length = 600
Score = 438 (159.2 bits), Expect = 1.4e-42, Sum P(2) = 1.4e-42
Identities = 113/355 (31%), Positives = 181/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRTFADN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 53 (23.7 bits), Expect = 1.4e-42, Sum P(2) = 1.4e-42
Identities = 17/55 (30%), Positives = 25/55 (45%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E L+Q + Y P ET+
Sbjct: 396 KGIMQLVGQAEPESVL------LVHGEAKKMEFLRQKIEQEFRVSCYMPANGETV 444
Score = 40 (19.1 bits), Expect = 3.3e-41, Sum P(2) = 3.3e-41
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHV 554
E + V+ ++T LK HC++H+
Sbjct: 525 ETALRVYSHLKST--LKDHCVQHL 546
>RGD|1306841 [details] [associations]
symbol:Cpsf3l "cleavage and polyadenylation specific factor
3-like" species:10116 "Rattus norvegicus" [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016787
"hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 RGD:1306841 GO:GO:0005634 GO:GO:0005737
GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000231294 KO:K13148
OMA:MAVEYMS GeneTree:ENSGT00700000104485 HOVERGEN:HBG080215
OrthoDB:EOG4GXFMB CTD:54973 EMBL:BC105303 IPI:IPI00365477
RefSeq:NP_001029064.1 UniGene:Rn.98615 ProteinModelPortal:Q3MHC2
STRING:Q3MHC2 Ensembl:ENSRNOT00000026725 GeneID:298688
KEGG:rno:298688 InParanoid:Q3MHC2 NextBio:644186
Genevestigator:Q3MHC2 GermOnline:ENSRNOG00000019712 Uniprot:Q3MHC2
Length = 600
Score = 438 (159.2 bits), Expect = 1.4e-42, Sum P(2) = 1.4e-42
Identities = 113/355 (31%), Positives = 181/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRTFADN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 53 (23.7 bits), Expect = 1.4e-42, Sum P(2) = 1.4e-42
Identities = 17/55 (30%), Positives = 25/55 (45%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E L+Q + Y P ET+
Sbjct: 396 KGIMQLVGQAEPESVL------LVHGEAKKMEFLRQKIEQEFRVSCYMPANGETV 444
Score = 40 (19.1 bits), Expect = 3.3e-41, Sum P(2) = 3.3e-41
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHV 554
E + V+ ++T LK HC++H+
Sbjct: 525 ETALRVYSHLKST--LKDHCVQHL 546
>UNIPROTKB|Q5TA45 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
EMBL:AL139287 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K13148 HOVERGEN:HBG080215
OrthoDB:EOG4GXFMB CTD:54973 EMBL:AL136813 EMBL:AK000549
EMBL:AK021939 EMBL:AK023356 EMBL:AK297350 EMBL:CR533557
EMBL:BC000675 EMBL:BC007978 EMBL:BC013904 EMBL:BK005728
EMBL:BK005673 IPI:IPI00063404 IPI:IPI00306882 IPI:IPI00514973
RefSeq:NP_001243392.1 RefSeq:NP_060341.2 UniGene:Hs.6449
ProteinModelPortal:Q5TA45 SMR:Q5TA45 IntAct:Q5TA45
MINT:MINT-1482228 STRING:Q5TA45 PhosphoSite:Q5TA45 DMDM:118572557
PaxDb:Q5TA45 PRIDE:Q5TA45 DNASU:54973 Ensembl:ENST00000419704
Ensembl:ENST00000435064 Ensembl:ENST00000450926
Ensembl:ENST00000545578 GeneID:54973 KEGG:hsa:54973 UCSC:uc001aee.1
UCSC:uc001aeh.1 UCSC:uc009vjz.1 GeneCards:GC01M001236
HGNC:HGNC:26052 HPA:HPA028379 HPA:HPA029025 MIM:611354
neXtProt:NX_Q5TA45 PharmGKB:PA142672080 InParanoid:Q5TA45
PhylomeDB:Q5TA45 ChiTaRS:CPSF3L GenomeRNAi:54973 NextBio:58222
ArrayExpress:Q5TA45 Bgee:Q5TA45 Genevestigator:Q5TA45
GermOnline:ENSG00000127054 Uniprot:Q5TA45
Length = 600
Score = 434 (157.8 bits), Expect = 2.5e-42, Sum P(2) = 2.5e-42
Identities = 113/355 (31%), Positives = 181/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 60 (26.2 bits), Expect = 2.5e-42, Sum P(2) = 2.5e-42
Identities = 18/55 (32%), Positives = 27/55 (49%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E LKQ + + + Y P ET+
Sbjct: 396 KGIMQLVGQAEPESVL------LVHGEAKKMEFLKQKIEQELRVNCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 6.6e-40, Sum P(2) = 6.6e-40
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 546 LKQHCLKHV 554
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>UNIPROTKB|F1NV30 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9031
"Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
IPI:IPI00571913 EMBL:AADN02040858 Ensembl:ENSGALT00000002586
Uniprot:F1NV30
Length = 600
Score = 432 (157.1 bits), Expect = 8.9e-42, Sum P(2) = 8.9e-42
Identities = 113/355 (31%), Positives = 181/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + E + + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 59 (25.8 bits), Expect = 8.9e-42, Sum P(2) = 8.9e-42
Identities = 19/57 (33%), Positives = 26/57 (45%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDV 569
+G LI A+P V+ LVHG A+ E LKQ + + Y P ET +
Sbjct: 396 KGIMQLIRQAEPRNVL------LVHGEAKKMEFLKQKIEQEFHVNCYMPANGETTSI 446
>UNIPROTKB|Q5ZIH0 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9031
"Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000231294 KO:K13148 GO:GO:0032039 GO:GO:0016180
HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB EMBL:AJ720814 IPI:IPI00571913
RefSeq:NP_001012854.1 UniGene:Gga.13403 ProteinModelPortal:Q5ZIH0
STRING:Q5ZIH0 GeneID:419418 KEGG:gga:419418 CTD:54973
InParanoid:Q5ZIH0 NextBio:20822477 Uniprot:Q5ZIH0
Length = 600
Score = 432 (157.1 bits), Expect = 1.1e-41, Sum P(2) = 1.1e-41
Identities = 113/355 (31%), Positives = 181/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + E + + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVD---EELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 58 (25.5 bits), Expect = 1.1e-41, Sum P(2) = 1.1e-41
Identities = 19/57 (33%), Positives = 26/57 (45%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDV 569
+G LI A+P V+ LVHG A+ E LKQ + + Y P ET +
Sbjct: 396 KGIMQLIRQAEPRNVL------LVHGEAKKMEFLKQKIEQEFHVNCYMPANGETTTI 446
>UNIPROTKB|E1B7Q9 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9913
"Bos taurus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:DAAA02043243 IPI:IPI00971575 Ensembl:ENSBTAT00000010020
Uniprot:E1B7Q9
Length = 598
Score = 428 (155.7 bits), Expect = 4.8e-41, Sum P(2) = 4.8e-41
Identities = 110/354 (31%), Positives = 176/354 (49%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD 117
V++SH H GALPY + +G P++ T+P + + + D E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKGEANFFTSQ 123
Query: 118 DIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNR 177
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 MIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNM 180
Query: 178 RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVD 236
++HL ++ RP++LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 TPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVF 239
Query: 237 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 296
+ GR EL ++LE +W L PIYF T ++ Y K F+ W I K+F R N
Sbjct: 240 ALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR-N 297
Query: 297 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
F KH+ +++ D+ P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 MFEFKHIKAF-DRAFADS-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 348
Score = 58 (25.5 bits), Expect = 4.8e-41, Sum P(2) = 4.8e-41
Identities = 18/55 (32%), Positives = 26/55 (47%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E LKQ + + Y P ET+
Sbjct: 395 KGIMQLVGQAEPENVL------LVHGEAKKMEFLKQKIEQEFRVNCYMPANGETV 443
Score = 38 (18.4 bits), Expect = 6.0e-39, Sum P(2) = 6.0e-39
Identities = 7/24 (29%), Positives = 15/24 (62%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHV 554
E+ + V+ ++ LK HC++H+
Sbjct: 524 EMAMRVYSHLKSV--LKDHCVQHL 545
>UNIPROTKB|E2QY53 [details] [associations]
symbol:CPSF3L "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:AAEX03003844 RefSeq:XP_003639102.1 Ensembl:ENSCAFT00000030626
GeneID:100855777 KEGG:cfa:100855777 Uniprot:E2QY53
Length = 600
Score = 427 (155.4 bits), Expect = 9.5e-41, Sum P(2) = 9.5e-41
Identities = 112/355 (31%), Positives = 180/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + + + GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 58 (25.5 bits), Expect = 9.5e-41, Sum P(2) = 9.5e-41
Identities = 18/55 (32%), Positives = 26/55 (47%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E LKQ + + Y P ET+
Sbjct: 396 KGIMQLVGQAEPESVL------LVHGEAKKMEFLKQKIEQEFRVNCYMPANGETV 444
Score = 41 (19.5 bits), Expect = 5.8e-39, Sum P(2) = 5.8e-39
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHV 554
E+ V V+ ++ LK HC++H+
Sbjct: 525 EMAVRVYSHLKSV--LKDHCVQHL 546
>UNIPROTKB|Q2YDM2 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9913
"Bos taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000231294 EMBL:BC110155 IPI:IPI00725178
UniGene:Bt.4894 ProteinModelPortal:Q2YDM2 STRING:Q2YDM2
PRIDE:Q2YDM2 HOVERGEN:HBG080215 InParanoid:Q2YDM2 OrthoDB:EOG4GXFMB
Uniprot:Q2YDM2
Length = 599
Score = 423 (154.0 bits), Expect = 3.8e-40, Sum P(2) = 3.8e-40
Identities = 109/355 (30%), Positives = 178/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-------LSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F P ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T+P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP++LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ D+ P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADS-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 58 (25.5 bits), Expect = 3.8e-40, Sum P(2) = 3.8e-40
Identities = 18/55 (32%), Positives = 26/55 (47%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E LKQ + + Y P ET+
Sbjct: 396 KGIMQLVGQAEPENVL------LVHGEAKKMEFLKQKIEQEFRVNCYMPANGETV 444
Score = 38 (18.4 bits), Expect = 4.8e-38, Sum P(2) = 4.8e-38
Identities = 7/24 (29%), Positives = 15/24 (62%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHV 554
E+ + V+ ++ LK HC++H+
Sbjct: 525 EMAMRVYSHLKSV--LKDHCVQHL 546
>UNIPROTKB|G3V1S5 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 EMBL:AL139287 EMBL:CH471183 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 OMA:MAVEYMS
CTD:54973 UniGene:Hs.6449 GeneID:54973 KEGG:hsa:54973
HGNC:HGNC:26052 ChiTaRS:CPSF3L GenomeRNAi:54973
RefSeq:NP_001243385.1 ProteinModelPortal:G3V1S5 SMR:G3V1S5
Ensembl:ENST00000540437 ArrayExpress:G3V1S5 Bgee:G3V1S5
Uniprot:G3V1S5
Length = 606
Score = 423 (154.0 bits), Expect = 3.9e-40, Sum P(2) = 3.9e-40
Identities = 109/338 (32%), Positives = 174/338 (51%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 27 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
+ +G P++ T P + + + D + ++ + E + FT I + V + Q
Sbjct: 87 SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146
Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
+ + E + + AGH+LG +++I E V+Y DYN ++HL ++ R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202
Query: 194 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 252
P +LIT++ A + ++ RE F + +T+ GG VL+PV + GR EL ++LE +W
Sbjct: 203 PNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFW 262
Query: 253 AEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
+L PIYF T ++ Y K F+ W I K+F R N F KH+ +++
Sbjct: 263 ERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR-NMFEFKHIKAF-DRAFA 319
Query: 313 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
DN P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 320 DN-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 355
Score = 60 (26.2 bits), Expect = 3.9e-40, Sum P(2) = 3.9e-40
Identities = 18/55 (32%), Positives = 27/55 (49%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E LKQ + + + Y P ET+
Sbjct: 402 KGIMQLVGQAEPESVL------LVHGEAKKMEFLKQKIEQELRVNCYMPANGETV 450
Score = 37 (18.1 bits), Expect = 1.0e-37, Sum P(2) = 1.0e-37
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 546 LKQHCLKHV 554
LK HC++H+
Sbjct: 544 LKDHCVQHL 552
>UNIPROTKB|F1RJE8 [details] [associations]
symbol:CPSF3L "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:FP102596 RefSeq:XP_003127541.3 Ensembl:ENSSSCT00000003708
GeneID:100523908 KEGG:ssc:100523908 Uniprot:F1RJE8
Length = 599
Score = 421 (153.3 bits), Expect = 1.7e-39, Sum P(2) = 1.7e-39
Identities = 109/355 (30%), Positives = 178/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFSDDRRFPDFSYITRHGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T+P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKAVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE +W L PIYF T ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +++ D+ P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 298 NMFEFKHIKAF-DRAFADS-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 55 (24.4 bits), Expect = 1.7e-39, Sum P(2) = 1.7e-39
Identities = 18/55 (32%), Positives = 25/55 (45%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETI 567
+G L+ A+P V+ LVHG A+ E LKQ + Y P ET+
Sbjct: 396 KGIMQLVGQAEPENVL------LVHGEAKKMEFLKQKIEQEFRLSCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 1.3e-37, Sum P(2) = 1.3e-37
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 546 LKQHCLKHV 554
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>FB|FBgn0039691 [details] [associations]
symbol:IntS11 "Integrator 11" species:7227 "Drosophila
melanogaster" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0034472 "snRNA
3'-end processing" evidence=IDA] [GO:0016180 "snRNA processing"
evidence=ISS] [GO:0032039 "integrator complex" evidence=ISS]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
GO:GO:0006378 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
GO:GO:0034472 EMBL:AY061097 RefSeq:NP_651721.1 UniGene:Dm.3722
SMR:Q9VAH9 STRING:Q9VAH9 EnsemblMetazoa:FBtr0085476 GeneID:43506
KEGG:dme:Dmel_CG1972 UCSC:CG1972-RA CTD:43506 FlyBase:FBgn0039691
InParanoid:Q9VAH9 OrthoDB:EOG47D7X3 GenomeRNAi:43506 NextBio:834295
Uniprot:Q9VAH9
Length = 597
Score = 429 (156.1 bits), Expect = 2.8e-39, Sum P(2) = 2.8e-39
Identities = 111/355 (31%), Positives = 181/355 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDH--F-DPSLLQPLSKVASTIDA 57
+++TPL + L+S+ G N ++DCG +ND F D S + P + S ID
Sbjct: 4 IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G + P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTT 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V +T Q+ + E + + AGH+LG ++ I + V+Y DYN
Sbjct: 124 QMIKDCMKKVIPVTLHQSMMVDTDLE---IKAYYAGHVLGAAMFWIKVGSQSVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL ++ RP +LI+++ A + ++ RE F + + + GG VL+PV
Sbjct: 181 MTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPV 239
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE YW +L YPIYF ++ Y K F+ W I K+F R
Sbjct: 240 FALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTF-VHR- 297
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 350
N F KH+ +K+ +DN P G +V A+ L AG S IF +WA + N+V+
Sbjct: 298 NMFDFKHIKPF-DKAYIDN-P-GAMVVFATPGMLHAGLSLQIFKKWAPNENNMVI 349
Score = 39 (18.8 bits), Expect = 2.8e-39, Sum P(2) = 2.8e-39
Identities = 15/59 (25%), Positives = 24/59 (40%)
Query: 513 EGSASLILDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTS 571
+G LI + +P V+ LVHG A + L+ Y P ET +++
Sbjct: 396 KGIMQLIQNCEPKNVM------LVHGEAGKMKFLRSKIKDEFNLETYMPANGETCVIST 448
>TAIR|locus:2206076 [details] [associations]
symbol:CPSF73-I "cleavage and polyadenylation specificity
factor 73-I" species:3702 "Arabidopsis thaliana" [GO:0005634
"nucleus" evidence=ISM;IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0005515 "protein binding"
evidence=IPI] [GO:0000278 "mitotic cell cycle" evidence=RCA]
[GO:0006346 "methylation-dependent chromatin silencing"
evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0009855
"determination of bilateral symmetry" evidence=RCA] [GO:0010014
"meristem initiation" evidence=RCA] [GO:0010073 "meristem
maintenance" evidence=RCA] [GO:0016246 "RNA interference"
evidence=RCA] [GO:0031507 "heterochromatin assembly" evidence=RCA]
[GO:0045787 "positive regulation of cell cycle" evidence=RCA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006397
GO:GO:0090305 EMBL:AC018908 GO:GO:0004518 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
EMBL:AY140900 EMBL:AY150478 EMBL:AY074280 EMBL:AK316692
EMBL:AK316794 IPI:IPI00533462 PIR:G96635 RefSeq:NP_001031215.1
RefSeq:NP_176297.1 RefSeq:NP_849835.1 UniGene:At.23510
ProteinModelPortal:Q9C952 SMR:Q9C952 IntAct:Q9C952 STRING:Q9C952
PaxDb:Q9C952 PRIDE:Q9C952 EnsemblPlants:AT1G61010.1
EnsemblPlants:AT1G61010.2 EnsemblPlants:AT1G61010.3 GeneID:842393
KEGG:ath:AT1G61010 TAIR:At1g61010 HOGENOM:HOG000203394
InParanoid:Q9C952 KO:K14403 OMA:YVSFSAH PhylomeDB:Q9C952
ProtClustDB:CLSN2681829 Genevestigator:Q9C952 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 Uniprot:Q9C952
Length = 693
Score = 428 (155.7 bits), Expect = 1.4e-38, Sum P(2) = 1.4e-38
Identities = 122/392 (31%), Positives = 201/392 (51%)
Query: 2 GTSVQVTPLSGVFNENPLSYL-VSIDGFNFLIDCGWNDHFDPSLLQP-LSKV-ASTIDAV 58
G + VTPL G +E S + +S G N L DCG + + P ++ S+ID +
Sbjct: 19 GDQLIVTPL-GAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSIDVL 77
Query: 59 LLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFT 115
L++H H +LPY +++ + VF +T+ +Y+L LLT Y + +S+ V + LF
Sbjct: 78 LITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKVSVEDM-LFD 134
Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
DI+ + + + + Q ++G I + AGH+LG ++ + G ++Y DY
Sbjct: 135 EQDINKSMDKIEVIDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAGVRILYTGDY 190
Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
+R +++HL L F P + I ++ + + R RE F D I T+ GG VL+P
Sbjct: 191 SREEDRHLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIP 249
Query: 235 VDSAGRVLELLLILEDYWAEH-SL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 292
+ GR ELLLIL++YWA H L N PIY+ + ++ + ++++ M D I F
Sbjct: 250 AFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFAN 309
Query: 293 SRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
S N F+ KH++ L + +D+ D GP +V+A+ L++G S +F W SD KN +
Sbjct: 310 S--NPFVFKHISPL---NSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACII 364
Query: 352 TERGQFGTLARMLQADPPPKAVKVTMSRRVPL 383
GTLA+ + +P K V + PL
Sbjct: 365 PGYMVEGTLAKTIINEP--KEVTLMNGLTAPL 394
Score = 50 (22.7 bits), Expect = 1.4e-38, Sum P(2) = 1.4e-38
Identities = 23/77 (29%), Positives = 37/77 (48%)
Query: 534 VLVHGSAEATEHLKQHCLKHVCP---HVYTPQIEETIDV--TSDLCAYKV-QLSEKL--- 584
+LVHG A LKQ L + TP+ E++++ S+ A + +L+EK
Sbjct: 425 ILVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDV 484
Query: 585 ---MSNVLFKKLGDYEI 598
+S +L KK Y+I
Sbjct: 485 GDTVSGILVKKGFTYQI 501
>CGD|CAL0004705 [details] [associations]
symbol:orf19.325 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493 EMBL:AACQ01000027
EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 RefSeq:XP_720020.1 RefSeq:XP_720152.1
ProteinModelPortal:Q5AEE3 STRING:Q5AEE3 GeneID:3638181
GeneID:3638320 KEGG:cal:CaO19.325 KEGG:cal:CaO19.7957
Uniprot:Q5AEE3
Length = 931
Score = 369 (135.0 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 110/349 (31%), Positives = 167/349 (47%)
Query: 22 LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHL-GALPYAMKQ-- 77
L+ D F + D WN D + + + +A+LLSH + G + +K
Sbjct: 20 LLEFDNEFKLIADPSWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIKFPI 78
Query: 78 LGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNY 135
L S PV+ST PV +LG ++ + Y + + D + LD++D+ F V L Y Q+
Sbjct: 79 LMSSIPVYSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQQSL 138
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GT 186
+L +VV P+ AGH LGGT W ITK + VIYA +N K+ LN G
Sbjct: 139 NLFDNK--VVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTGN 196
Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
S +RP IT A + R++ E F + TL GG +LP +GR LEL
Sbjct: 197 PHLSLLRPTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTSLSGRFLELFH 255
Query: 247 ILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL 306
+++++ + P+YFL+Y + + Y + L+WM S TK +E F V LL
Sbjct: 256 LIDEHLKGAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPSKVDLL 313
Query: 307 INKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER 354
++ SEL GPK+V S L +G S + F +D ++ TE+
Sbjct: 314 LDPSELLKL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIILTEK 361
Score = 77 (32.2 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 20/68 (29%), Positives = 36/68 (52%)
Query: 630 VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPL 688
+G++++ DLK L + + EF G L + + +RK+ + SG IVI+G +
Sbjct: 856 IGNIRLPDLKKKLQNLNMTAEFKSEGTLVVNDILAVRKIAYGLVESDESG--DIVIDGNV 913
Query: 689 CEDYYKIR 696
YYK++
Sbjct: 914 GPLYYKVK 921
Score = 64 (27.6 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 21/68 (30%), Positives = 38/68 (55%)
Query: 469 FPFYE--NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEG-SASLILDAKPS 525
FP++ + ++DD+GEVI +DY DE + + + + G K DE +A+ + +
Sbjct: 537 FPYFATAHKQKFDDYGEVIKIEDYQRHDE-VSHSKIIMEGKR-KFDEKRTANNRRNKNQN 594
Query: 526 KVVSNELT 533
K +N+LT
Sbjct: 595 KQQANKLT 602
Score = 54 (24.1 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 17/63 (26%), Positives = 34/63 (53%)
Query: 366 ADPPPKAVKV-TMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNL 424
A P K + + ++ V L G EL ++E+ + +KE+ L + V++++++ L D
Sbjct: 395 AVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLL--AKVRDQKNQNILSADTVD 452
Query: 425 SGD 427
S D
Sbjct: 453 SED 455
Score = 47 (21.6 bits), Expect = 9.7e-37, Sum P(5) = 9.7e-37
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 520 LDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHV 554
++ S V NE+ L A T+H+KQ K++
Sbjct: 486 INVADSNVAPNEVNPLATHEAFITDHIKQSLEKNL 520
Score = 45 (20.9 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 8/31 (25%), Positives = 22/31 (70%)
Query: 576 YKVQLSEKLMSNVLFKKLGD-YEIAWVDAEV 605
++V L + ++ ++ ++K+GD Y++A + E+
Sbjct: 763 FEVNLDDSIVKDLKWQKIGDDYKVAKLYGEL 793
>UNIPROTKB|Q5AEE3 [details] [associations]
symbol:CFT2 "Putative uncharacterized protein CFT2"
species:237561 "Candida albicans SC5314" [GO:0042493 "response to
drug" evidence=IMP] InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493
EMBL:AACQ01000027 EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 RefSeq:XP_720020.1
RefSeq:XP_720152.1 ProteinModelPortal:Q5AEE3 STRING:Q5AEE3
GeneID:3638181 GeneID:3638320 KEGG:cal:CaO19.325
KEGG:cal:CaO19.7957 Uniprot:Q5AEE3
Length = 931
Score = 369 (135.0 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 110/349 (31%), Positives = 167/349 (47%)
Query: 22 LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHL-GALPYAMKQ-- 77
L+ D F + D WN D + + + +A+LLSH + G + +K
Sbjct: 20 LLEFDNEFKLIADPSWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIKFPI 78
Query: 78 LGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDDIDSAFQSVTRLTYSQNY 135
L S PV+ST PV +LG ++ + Y + + D + LD++D+ F V L Y Q+
Sbjct: 79 LMSSIPVYSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQQSL 138
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN---------GT 186
+L +VV P+ AGH LGGT W ITK + VIYA +N K+ LN G
Sbjct: 139 NLFDNK--VVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTGN 196
Query: 187 VLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
S +RP IT A + R++ E F + TL GG +LP +GR LEL
Sbjct: 197 PHLSLLRPTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTSLSGRFLELFH 255
Query: 247 ILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLL 306
+++++ + P+YFL+Y + + Y + L+WM S TK +E F V LL
Sbjct: 256 LIDEHLKGAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPSKVDLL 313
Query: 307 INKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER 354
++ SEL GPK+V S L +G S + F +D ++ TE+
Sbjct: 314 LDPSELLKL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIILTEK 361
Score = 77 (32.2 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 20/68 (29%), Positives = 36/68 (52%)
Query: 630 VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPL 688
+G++++ DLK L + + EF G L + + +RK+ + SG IVI+G +
Sbjct: 856 IGNIRLPDLKKKLQNLNMTAEFKSEGTLVVNDILAVRKIAYGLVESDESG--DIVIDGNV 913
Query: 689 CEDYYKIR 696
YYK++
Sbjct: 914 GPLYYKVK 921
Score = 64 (27.6 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 21/68 (30%), Positives = 38/68 (55%)
Query: 469 FPFYE--NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEG-SASLILDAKPS 525
FP++ + ++DD+GEVI +DY DE + + + + G K DE +A+ + +
Sbjct: 537 FPYFATAHKQKFDDYGEVIKIEDYQRHDE-VSHSKIIMEGKR-KFDEKRTANNRRNKNQN 594
Query: 526 KVVSNELT 533
K +N+LT
Sbjct: 595 KQQANKLT 602
Score = 54 (24.1 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 17/63 (26%), Positives = 34/63 (53%)
Query: 366 ADPPPKAVKV-TMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNL 424
A P K + + ++ V L G EL ++E+ + +KE+ L + V++++++ L D
Sbjct: 395 AVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLL--AKVRDQKNQNILSADTVD 452
Query: 425 SGD 427
S D
Sbjct: 453 SED 455
Score = 47 (21.6 bits), Expect = 9.7e-37, Sum P(5) = 9.7e-37
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 520 LDAKPSKVVSNELTVLVHGSAEATEHLKQHCLKHV 554
++ S V NE+ L A T+H+KQ K++
Sbjct: 486 INVADSNVAPNEVNPLATHEAFITDHIKQSLEKNL 520
Score = 45 (20.9 bits), Expect = 1.8e-38, Sum P(5) = 1.8e-38
Identities = 8/31 (25%), Positives = 22/31 (70%)
Query: 576 YKVQLSEKLMSNVLFKKLGD-YEIAWVDAEV 605
++V L + ++ ++ ++K+GD Y++A + E+
Sbjct: 763 FEVNLDDSIVKDLKWQKIGDDYKVAKLYGEL 793
>POMBASE|SPAC17G6.16c [details] [associations]
symbol:ysh1 "mRNA cleavage and polyadenylation
specificity factor complex endoribonuclease subunit Ysh1"
species:4896 "Schizosaccharomyces pombe" [GO:0004521
"endoribonuclease activity" evidence=ISO] [GO:0005634 "nucleus"
evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IC]
[GO:0006379 "mRNA cleavage" evidence=IC] [GO:0046872 "metal ion
binding" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 PomBase:SPAC17G6.16c Pfam:PF07521 GO:GO:0005829
EMBL:CU329670 GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0006378
GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
OrthoDB:EOG41ZJK7 PIR:T37848 RefSeq:NP_594263.2 STRING:O13794
EnsemblFungi:SPAC17G6.16c.1 GeneID:2542258 NextBio:20803322
Uniprot:O13794
Length = 757
Score = 422 (153.6 bits), Expect = 9.9e-37, P = 9.9e-37
Identities = 115/386 (29%), Positives = 199/386 (51%)
Query: 12 GVFNENPLS-YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSHPDTLHL 68
G NE S +++ G ++D G + + P ST+D +L+SH H+
Sbjct: 25 GAGNEVGRSCHVIQYKGKTVMLDAGVHPAYTGLSALPFFDEFDLSTVDVLLISHFHLDHV 84
Query: 69 GALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDDIDSAFQSVT 127
+LPY M++ VF T P + + D Y+ V E L+ D+ +AF +
Sbjct: 85 ASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSD-YVKVSNVGMEDQLYDEKDLLAAFDRIE 143
Query: 128 RLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTV 187
+ +YH + + EGI P+ AGH+LG ++ + G ++++ DY+R +++HL+
Sbjct: 144 AV----DYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILFTGDYSREEDRHLHVAE 199
Query: 188 LESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLL 246
+ RP VLIT++ Y +QP ++ + I T+R GG VL+PV + GR ELLL
Sbjct: 200 VPP-KRPDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGRVLMPVFALGRAQELLL 258
Query: 247 ILEDYWAEH-SL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVT 304
IL++YW H L + PIY+ + ++ + ++++ M D+I K F + N F+ + V
Sbjct: 259 ILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRKIF--AERNPFIFRFVK 316
Query: 305 LLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 364
L N + D+ GP ++LAS L+ G S + WA D +N +L T GT+A+ +
Sbjct: 317 SLRNLEKFDDI--GPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTGYSVEGTMAKQI 374
Query: 365 QADPPPKAVKVTMSRRVP--LVGEEL 388
+ P + V ++ +++P + EEL
Sbjct: 375 -TNEPIEIVSLS-GQKIPRRMAVEEL 398
>WB|WBGene00008642 [details] [associations]
symbol:F10B5.8 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0009792 EMBL:Z48334 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
GeneTree:ENSGT00700000104485 PIR:T20694 RefSeq:NP_495706.2
ProteinModelPortal:Q9U3K2 SMR:Q9U3K2 STRING:Q9U3K2 PaxDb:Q9U3K2
EnsemblMetazoa:F10B5.8 GeneID:174310 KEGG:cel:CELE_F10B5.8
UCSC:F10B5.8 CTD:174310 WormBase:F10B5.8 InParanoid:Q9U3K2
NextBio:883468 Uniprot:Q9U3K2
Length = 608
Score = 404 (147.3 bits), Expect = 2.7e-36, Sum P(2) = 2.7e-36
Identities = 105/397 (26%), Positives = 195/397 (49%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
+++ PL + L++I G N ++DCG + D F D S + ++ +D
Sbjct: 8 IKIVPLGAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDC 67
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS-EFDLFTL 116
V++SH H G+LP+ + +G P++ T P + + + D + + E + FT
Sbjct: 68 VIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFTS 127
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
DDI + + V + H+ + + + AGH+LG +++I V+Y DYN
Sbjct: 128 DDIKNCMKKVVGCALHEIIHVDNE---LSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDYN 184
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
++HL + VRP VLI+++ A + ++ RE F + + + GG V++PV
Sbjct: 185 MTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHECVMKGGKVIIPV 244
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++LE YW +LN PIYF ++ Y + F+ W ++I K+F R
Sbjct: 245 FALGRAQELCILLESYWERMALNVPIYFSQGLAERANQYYRLFISWTNENIKKTF-VER- 302
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F KH+ + + ++ P GP+++ ++ L G S +F +W SD N+++
Sbjct: 303 NMFEFKHIKPM--EKGCEDQP-GPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYC 359
Query: 356 QFGTL-ARMLQADPPPKAVKVTMSRRVPLVGEELIAY 391
GT+ AR++ + K +++ +G E +++
Sbjct: 360 VAGTVGARVINGE---KKIEIDQKMHEIRLGVEYMSF 393
Score = 49 (22.3 bits), Expect = 2.7e-36, Sum P(2) = 2.7e-36
Identities = 24/91 (26%), Positives = 37/91 (40%)
Query: 484 VINPDDYIIKDEDMDQAAMHIG--GDDGKLD-EGSASLILDAKPSKVVSNELTVLVHGSA 540
VIN + I D+ M + + + D +G LI +P V+ VHG A
Sbjct: 368 VINGEKKIEIDQKMHEIRLGVEYMSFSAHADAKGIMQLIRQCEPQHVM------FVHGEA 421
Query: 541 EATEHLKQHCLKHVCPHVYTPQIEETIDVTS 571
E LK K V+ P ET+ +++
Sbjct: 422 SKMEFLKGKVEKEYKVPVHMPANGETVVISA 452
>SGD|S000004267 [details] [associations]
symbol:YSH1 "Putative endoribonuclease" species:4932
"Saccharomyces cerevisiae" [GO:0004518 "nuclease activity"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0006379 "mRNA
cleavage" evidence=IMP] [GO:0006369 "termination of RNA polymerase
II transcription" evidence=IMP] [GO:0031126 "snoRNA 3'-end
processing" evidence=IMP] [GO:0008380 "RNA splicing" evidence=IMP]
[GO:0034247 "snoRNA splicing" evidence=IMP] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IDA;IPI] [GO:0005849 "mRNA cleavage factor complex"
evidence=IPI] [GO:0004521 "endoribonuclease activity"
evidence=ISS;IMP] [GO:0003723 "RNA binding" evidence=IC]
[GO:0004519 "endonuclease activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 SGD:S000004267
Pfam:PF07521 GO:GO:0046872 GO:GO:0006378 EMBL:BK006945
GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 EMBL:U17245 HOGENOM:HOG000203394 KO:K14403
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0031126 GO:GO:0034247
OMA:EISFAAH OrthoDB:EOG41ZJK7 PIR:S51413 RefSeq:NP_013379.1
ProteinModelPortal:Q06224 SMR:Q06224 DIP:DIP-2470N IntAct:Q06224
MINT:MINT-375457 STRING:Q06224 PaxDb:Q06224 PeptideAtlas:Q06224
EnsemblFungi:YLR277C GeneID:850983 KEGG:sce:YLR277C CYGD:YLR277c
NextBio:967501 Genevestigator:Q06224 GermOnline:YLR277C
Uniprot:Q06224
Length = 779
Score = 406 (148.0 bits), Expect = 3.0e-36, Sum P(3) = 3.0e-36
Identities = 105/371 (28%), Positives = 182/371 (49%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYRLGL-----LTMYDQYLS 104
S +D +L+SH H +LPY M++ VF T P +YR L +T S
Sbjct: 59 SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSS 118
Query: 105 RRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITK 164
+ LF+ +D+ +F + + +YH + GI AGH+LG +++I
Sbjct: 119 SMGTKDEGLFSDEDLVDSFDKIETV----DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEI 174
Query: 165 DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKT 224
G V++ DY+R ++HLN + +++ + ++P + I T
Sbjct: 175 AGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHST 234
Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LN---YPIYFLTYVSSSTIDYVKSFL 279
+ GG VLLPV + GR E++LIL++YW++H+ L PI++ + ++ + ++++
Sbjct: 235 VMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYV 294
Query: 280 EWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 339
M D I K F S+ N F+ K+++ L N + + GP ++LAS L++G S D+
Sbjct: 295 NMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF--GPSVMLASPGMLQSGLSRDLLE 352
Query: 340 EWASDVKNLVLFTERGQFGTLAR--MLQADPPPKA--VKVTMSRRVPLVGEELIAYEEEQ 395
W + KNLVL T GT+A+ ML+ D P ++T+ RR + A+ + Q
Sbjct: 353 RWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQ 412
Query: 396 TRLKKEEALKA 406
L+ E + A
Sbjct: 413 ENLEFIEKISA 423
Score = 51 (23.0 bits), Expect = 3.0e-36, Sum P(3) = 3.0e-36
Identities = 13/38 (34%), Positives = 22/38 (57%)
Query: 375 VTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEE 412
V +++ V + E+ Y+EE +K+E A K +KEE
Sbjct: 475 VKVAKAVGNIVNEI--YKEENVEIKEEIAAKIEPIKEE 510
Score = 45 (20.9 bits), Expect = 3.0e-36, Sum P(3) = 3.0e-36
Identities = 12/49 (24%), Positives = 22/49 (44%)
Query: 479 DDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLI-LDAKPSK 526
D F +N D+Y E+ + IG K+D + ++ ++ P K
Sbjct: 713 DCFTLFLNKDEYASNKEETITGVVTIGKSTAKIDFNNMKILECNSNPLK 761
Score = 41 (19.5 bits), Expect = 1.6e-34, Sum P(2) = 1.6e-34
Identities = 27/121 (22%), Positives = 49/121 (40%)
Query: 534 VLVHGSAEATEHLKQHCLKHVCP--------HVYTPQIEETIDVTSDLCAYKVQLSEKLM 585
+LVHG A LK L + HV+ P+ ++V + KV + +
Sbjct: 427 ILVHGEANPMGRLKSALLSNFASLKGTDNEVHVFNPR--NCVEVDLEFQGVKVAKAVGNI 484
Query: 586 SNVLFKKLG---DYEIAW-VDAEVGKTENGMLSLLPIS-TPAPPHKSVLVGDLKMADLKP 640
N ++K+ EIA ++ + E+ + S HK ++V + ++D K
Sbjct: 485 VNEIYKEENVEIKEEIAAKIEPIKEENEDNLDSQAEKGLVDEEEHKDIVVSGILVSDDKN 544
Query: 641 F 641
F
Sbjct: 545 F 545
>UNIPROTKB|G4N6C6 [details] [associations]
symbol:MGG_06570 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005634 "nucleus" evidence=ISS] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0006379 "mRNA cleavage" evidence=ISS] InterPro:IPR027075
Pfam:PF07521 GO:GO:0006378 EMBL:CM001234 GO:GO:0005847
GO:GO:0006379 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 RefSeq:XP_003716967.1 EnsemblFungi:MGG_06570T0
GeneID:2684725 KEGG:mgr:MGG_06570 Uniprot:G4N6C6
Length = 962
Score = 213 (80.0 bits), Expect = 1.7e-35, Sum P(6) = 1.7e-35
Identities = 57/176 (32%), Positives = 80/176 (45%)
Query: 143 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK-----------HLNG--TVLE 189
G+ + + AGH LGGT+W I E ++YAVD+N ++ H G V+E
Sbjct: 174 GLTITAYNAGHSLGGTIWHIQHGMESIVYAVDWNLARDNVYAGAAWMGGGHGGGGAEVIE 233
Query: 190 SFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
+P L+ A + + D + + GG VL+PVDS+ RVLEL +LE
Sbjct: 234 QLRKPTALVCSTRTAEGGLTRAARDKQLLDTMRMAISRGGTVLIPVDSSARVLELAYLLE 293
Query: 250 DYW-AEHSLN------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 298
W +E S +Y STI KS EWM +SI + FE D F
Sbjct: 294 HAWRSEASTEGGGLSTAKLYLAGRSVHSTIKLAKSMFEWMDNSIVQEFEAGADQGF 349
Score = 150 (57.9 bits), Expect = 1.7e-35, Sum P(6) = 1.7e-35
Identities = 36/101 (35%), Positives = 53/101 (52%)
Query: 8 TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
+PL G +E S L+ +DG LID GW++ FD L+ + K T+ +LL+H
Sbjct: 5 SPLQGALSEATASQSLLELDGGVKVLIDIGWDETFDVEKLKEVEKQVPTLSLILLTHATV 64
Query: 66 LHLGALPYAMKQLGLSA--PVFSTEPVYRLGLLTMYDQYLS 104
HL AL + K L A P+++T+P LG + D Y S
Sbjct: 65 PHLSALVHCCKNFPLFARIPIYATQPAIDLGRTLIQDLYSS 105
Score = 93 (37.8 bits), Expect = 1.7e-35, Sum P(6) = 1.7e-35
Identities = 22/85 (25%), Positives = 46/85 (54%)
Query: 534 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
+LV GSA+ TE + C ++ V+TP + +D + D A+ V+L++ L+ + ++++
Sbjct: 740 ILVAGSADETEAVADDCRRNAI-EVFTPPVGAVVDASVDTNAWVVKLADPLVKRLKWQQV 798
Query: 594 GDYEIAWVDAEVGKTENGMLSLLPI 618
I V A++ T + +P+
Sbjct: 799 RGLGIVTVTAQLTATPAAQKNGIPL 823
Score = 77 (32.2 bits), Expect = 1.7e-35, Sum P(6) = 1.7e-35
Identities = 23/63 (36%), Positives = 37/63 (58%)
Query: 298 FLLKHVTLLINKSE----LDNAPDG--PKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 351
F K++ LL K++ L+ + D K++LA+ SLE GFS DI A+D +N+V+
Sbjct: 369 FDFKYLRLLDRKAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIANDSRNMVIL 428
Query: 352 TER 354
E+
Sbjct: 429 PEK 431
Score = 70 (29.7 bits), Expect = 3.5e-33, Sum P(6) = 3.5e-33
Identities = 26/82 (31%), Positives = 41/82 (50%)
Query: 595 DYEIAWVDAEVGKTENGMLSLLPISTPAPPHKS--VL-VGDLKMADLKPFLSSKGIQVEF 651
D E D +VG L +LP++ + + VL VG+L++ADL+ + + G +F
Sbjct: 844 DQEPTAEDEDVGVMPT--LDVLPVAMVSASRSAAQVLHVGELRLADLRRTMQNLGHSADF 901
Query: 652 AG-GALRCGEYVTIRKVGPAGQ 672
G G L V +RK AG+
Sbjct: 902 RGEGTLLIDGTVVVRKTA-AGR 922
Score = 67 (28.6 bits), Expect = 1.7e-35, Sum P(6) = 1.7e-35
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKDE 495
MFP D+FGE+I P+DY+ +E
Sbjct: 592 MFPLAVRRKRNDEFGELIRPEDYLRAEE 619
Score = 42 (19.8 bits), Expect = 1.7e-35, Sum P(6) = 1.7e-35
Identities = 7/23 (30%), Positives = 15/23 (65%)
Query: 371 KAVKVTMSRRVPLVGEELIAYEE 393
+ +++ S++VPL EL Y++
Sbjct: 476 RELQIRESKKVPLADSELSIYQQ 498
>SGD|S000004105 [details] [associations]
symbol:CFT2 "Subunit of the mRNA cleavage and
polyadenlylation factor (CPF)" species:4932 "Saccharomyces
cerevisiae" [GO:0005849 "mRNA cleavage factor complex"
evidence=IPI] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IDA;IPI] [GO:0005634 "nucleus" evidence=IEA]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006379 "mRNA
cleavage" evidence=IDA;TAS] [GO:0003723 "RNA binding" evidence=IPI]
SGD:S000004105 GO:GO:0006378 EMBL:BK006945 GO:GO:0003723
EMBL:X89514 EMBL:U53878 EMBL:U53877 EMBL:Z73288 GO:GO:0005847
GO:GO:0006379 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
EMBL:Z73287 PIR:S64952 RefSeq:NP_013216.1 PDB:2I7X PDBsum:2I7X
ProteinModelPortal:Q12102 SMR:Q12102 DIP:DIP-2468N IntAct:Q12102
MINT:MINT-375505 STRING:Q12102 PaxDb:Q12102 PeptideAtlas:Q12102
EnsemblFungi:YLR115W GeneID:850806 KEGG:sce:YLR115W CYGD:YLR115w
GeneTree:ENSGT00700000104551 HOGENOM:HOG000001120 OMA:YSQPHQP
OrthoDB:EOG4W11N8 EvolutionaryTrace:Q12102 NextBio:967034
Genevestigator:Q12102 GermOnline:YLR115W Uniprot:Q12102
Length = 859
Score = 351 (128.6 bits), Expect = 4.5e-35, Sum P(3) = 4.5e-35
Identities = 103/356 (28%), Positives = 173/356 (48%)
Query: 22 LVSIDGFNFLIDCGWNDH--FDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMK 76
+V D LID GWN ++ KV ID ++LS P LGA L Y
Sbjct: 19 VVRFDNVTLLIDPGWNPSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLYYNFT 78
Query: 77 QLGLSA-PVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLD--DIDSAFQSVTRLTYSQ 133
+S V++T PV LG ++ D Y S + +D LD DI+ +F + L YSQ
Sbjct: 79 SHFISRIQVYATLPVINLGRVSTIDSYASAGVIGPYDTNKLDLEDIEISFDHIVPLKYSQ 138
Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN--------G 185
L + +G+ + + AG GG++W I+ E ++YA +N ++ LN G
Sbjct: 139 LVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEKLVYAKRWNHTRDNILNAASILDATG 198
Query: 186 TVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELL 245
L + +RP+ +IT +QP +++ ++F+D + K L + G+V++PVD +G+ L+L
Sbjct: 199 KPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDTLKKGLSSDGSVIIPVDMSGKFLDLF 258
Query: 246 -----LILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA--F 298
L+ E P+ L+Y T+ Y KS LEW+ S+ K++E +R+N F
Sbjct: 259 TQVHELLFESTKINAHTQVPVLILSYARGRTLTYAKSMLEWLSPSLLKTWE-NRNNTSPF 317
Query: 299 LLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTE 353
+ +I +EL P G K+ S E G +++ ++ + K ++ T+
Sbjct: 318 EIGSRIKIIAPNELSKYP-GSKICFVS----EVGALINEVIIKVGNSEKTTLILTK 368
Score = 98 (39.6 bits), Expect = 3.7e-07, Sum P(3) = 3.7e-07
Identities = 41/177 (23%), Positives = 74/177 (41%)
Query: 260 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA-FLLKHVTLLINKSELDNAPDG 318
P+ L+Y T+ Y KS LEW+ S+ K++E + + F + +I +EL P G
Sbjct: 278 PVLILSYARGRTLTYAKSMLEWLSPSLLKTWENRNNTSPFEIGSRIKIIAPNELSKYP-G 336
Query: 319 PKLVLASMAS-------LEAGFSHDIFV-------EWASDVKNLVLFTERGQ--FGTLAR 362
K+ S ++ G S + E AS + ++ E+ + + T
Sbjct: 337 SKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFECASSLDKILEIVEQDERNWKTFPE 396
Query: 363 MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 419
++ + + + PL EE A++ + K++ K LVK E K + G
Sbjct: 397 DGKSFLCDNYISIDTIKEEPLSKEETEAFKVQLKEKKRDRNKKILLVKRESKKLANG 453
Score = 93 (37.8 bits), Expect = 4.5e-35, Sum P(3) = 4.5e-35
Identities = 38/160 (23%), Positives = 70/160 (43%)
Query: 515 SASLILDAKPSKVVSNELTV-LVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDL 573
S ++L A P ++ + E+T L+ + E ++ + + + T I +ID D
Sbjct: 678 SRKIVLSA-PKQIQNEEITAKLIKKNIEVV-NMPLNKIVEFSTTIKTLDI--SIDSNLDN 733
Query: 574 CAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKS--VLVG 631
++S+ + +L + V+ L L P+ + HK+ + +G
Sbjct: 734 LLKWQRISDSYTVATVVGRLVKESLPQVNNHQKTASRSKLVLKPLHGSSRSHKTGALSIG 793
Query: 632 DLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPA 670
D+++A LK L+ K EF G G L E V +RK+ A
Sbjct: 794 DVRLAQLKKLLTEKNYIAEFKGEGTLVINEKVAVRKINDA 833
Score = 58 (25.5 bits), Expect = 4.5e-35, Sum P(3) = 4.5e-35
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 452 DILIDGFVPPST-SVAPMFPFYENNSEWDDFGEVINPDDYIIKDED 496
++ +D + PS S MFPF + DD+G V++ ++ D D
Sbjct: 519 EVPVDIIIQPSAASKHKMFPFNPAKIKKDDYGTVVDFTMFLPDDSD 564
>ZFIN|ZDB-GENE-030131-3275 [details] [associations]
symbol:cpsf3 "cleavage and polyadenylation
specific factor 3" species:7955 "Danio rerio" [GO:0016787
"hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-030131-3275 GO:GO:0016787
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 CTD:51692
HOVERGEN:HBG051107 EMBL:AY648793 IPI:IPI00509063
RefSeq:NP_001003836.1 UniGene:Dr.77231 ProteinModelPortal:Q6DRG6
SMR:Q6DRG6 STRING:Q6DRG6 GeneID:324554 KEGG:dre:324554
NextBio:20808833 ArrayExpress:Q6DRG6 Uniprot:Q6DRG6
Length = 690
Score = 396 (144.5 bits), Expect = 6.0e-34, P = 6.0e-34
Identities = 106/396 (26%), Positives = 203/396 (51%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 36 ILEFKGRKIMVDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 96 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 147
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + S V+P
Sbjct: 148 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPD 206
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LIT++ H R++RE F + + + G L+PV + GR ELLLIL++YW
Sbjct: 207 ILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQN 266
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K+ + N F+ KH++ N +
Sbjct: 267 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAININ--NPFVFKHIS---NLKSM 321
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 322 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 379
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 380 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 415
>UNIPROTKB|I3LKR1 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
[GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0003723 GO:GO:0004521 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:FP312696
Ensembl:ENSSSCT00000027309 Uniprot:I3LKR1
Length = 687
Score = 394 (143.8 bits), Expect = 9.8e-34, P = 9.8e-34
Identities = 104/396 (26%), Positives = 202/396 (51%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y + +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVKVRKCSNISADDMLYTETDLEESMDKIETI----NF 143
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 144 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 202
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 203 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 262
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 263 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 317
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 318 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 375
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 376 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 411
>FB|FBgn0261065 [details] [associations]
symbol:Cpsf73 "Cleavage and polyadenylation specificity
factor 73" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS;NAS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
processing" evidence=IMP] [GO:0003677 "DNA binding" evidence=IDA]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
GO:GO:0003677 GO:GO:0006378 GO:GO:0016787 GO:GO:0005847
GO:GO:0006379 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 GO:GO:0006398 KO:K14403 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 EMBL:AY119128 RefSeq:NP_650738.1
UniGene:Dm.13714 SMR:Q9VE51 IntAct:Q9VE51 MINT:MINT-804945
STRING:Q9VE51 EnsemblMetazoa:FBtr0083690 GeneID:42240
KEGG:dme:Dmel_CG7698 UCSC:CG7698-RA CTD:42240 FlyBase:FBgn0261065
InParanoid:Q9VE51 OrthoDB:EOG4P5HR4 GenomeRNAi:42240 NextBio:827838
Uniprot:Q9VE51
Length = 684
Score = 393 (143.4 bits), Expect = 1.3e-33, P = 1.3e-33
Identities = 108/432 (25%), Positives = 212/432 (49%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSH 62
+Q+ PL ++ G ++DCG + P + A ID + +SH
Sbjct: 18 LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGIHPGLSGMDALPYVDLIEADEIDLLFISH 77
Query: 63 PDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVS-EFDLFTLDD 118
H GALP+ + + F +T+ +YR M Y+ +S E L+T D
Sbjct: 78 FHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW----MLSDYIKISNISTEQMLYTEAD 133
Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
++++ + + + N+H G+ ++AGH+LG ++ I G ++Y D++R+
Sbjct: 134 LEASMEKIETI----NFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFSRQ 189
Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDS 237
+++HL + ++P VLIT++ H R+ RE F + K ++ GG L+PV +
Sbjct: 190 EDRHLMAAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPVFA 248
Query: 238 AGRVLELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GR ELLLIL+++W+++ L+ PIY+ + ++ + ++++ M D I + +
Sbjct: 249 LGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQIAVN-- 306
Query: 296 NAFLLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTER 354
N F+ +H++ N +D+ D GP +++AS +++G S ++F W +D KN V+
Sbjct: 307 NPFVFRHIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGY 363
Query: 355 GQFGTLARMLQADPPPKAVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKASLVKEEE 413
GTLA+ + ++P + + +++PL + + I++ + E ++ L+K
Sbjct: 364 CVEGTLAKAVLSEP--EEITTLSGQKLPLNMSVDYISFSAHTDYQQTSEFIR--LLKPTH 419
Query: 414 SKASLGPDNNLS 425
G N +S
Sbjct: 420 VVLVHGEQNEMS 431
>UNIPROTKB|P79101 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation specificity
factor subunit 3" species:9913 "Bos taurus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
evidence=ISS] [GO:0003723 "RNA binding" evidence=ISS] [GO:0004521
"endoribonuclease activity" evidence=ISS] [GO:0008409 "5'-3'
exonuclease activity" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0030529 "ribonucleoprotein complex" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0046872 GO:GO:0003723 GO:GO:0030529 GO:GO:0004521
GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 GO:GO:0006398 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
EMBL:X95906 EMBL:BC104553 IPI:IPI00708839 RefSeq:NP_776709.1
UniGene:Bt.5045 ProteinModelPortal:P79101 SMR:P79101 STRING:P79101
PRIDE:P79101 Ensembl:ENSBTAT00000026303 GeneID:281712
KEGG:bta:281712 CTD:51692 GeneTree:ENSGT00700000104485
HOVERGEN:HBG051107 InParanoid:P79101 OrthoDB:EOG4FN4H6
NextBio:20805634 ArrayExpress:P79101 GO:GO:0008409 Uniprot:P79101
Length = 684
Score = 390 (142.3 bits), Expect = 2.8e-33, P = 2.8e-33
Identities = 104/396 (26%), Positives = 202/396 (51%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 314
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 315 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 372
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 373 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|Q9UKF6 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0030529
"ribonucleoprotein complex" evidence=IEA] [GO:0046872 "metal ion
binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0008409 "5'-3' exonuclease activity" evidence=ISS] [GO:0004521
"endoribonuclease activity" evidence=ISS] [GO:0003723 "RNA binding"
evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IDA] [GO:0006378 "mRNA
polyadenylation" evidence=TAS] [GO:0006379 "mRNA cleavage"
evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=TAS]
[GO:0006369 "termination of RNA polymerase II transcription"
evidence=TAS] [GO:0006397 "mRNA processing" evidence=TAS]
[GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380
"RNA splicing" evidence=TAS] [GO:0010467 "gene expression"
evidence=TAS] [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
Reactome:REACT_71 InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
EMBL:AF017269 Pfam:PF07521 EMBL:AF171877 EMBL:CH471053
GO:GO:0046872 Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003723
GO:GO:0030529 GO:GO:0006406 GO:GO:0004521 GO:GO:0000398
Reactome:REACT_1788 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 Reactome:REACT_78 GO:GO:0006398 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409
EMBL:AC080162 EMBL:BC011654 EMBL:BC020211 IPI:IPI00007818
RefSeq:NP_057291.1 UniGene:Hs.515972 PDB:2I7T PDB:2I7V PDBsum:2I7T
PDBsum:2I7V ProteinModelPortal:Q9UKF6 SMR:Q9UKF6 DIP:DIP-42501N
MINT:MINT-1742891 STRING:Q9UKF6 PhosphoSite:Q9UKF6 DMDM:18203503
PaxDb:Q9UKF6 PeptideAtlas:Q9UKF6 PRIDE:Q9UKF6 DNASU:51692
Ensembl:ENST00000238112 GeneID:51692 KEGG:hsa:51692 UCSC:uc002qzo.1
GeneCards:GC02P009514 HGNC:HGNC:2326 HPA:HPA034657 MIM:606029
neXtProt:NX_Q9UKF6 PharmGKB:PA26843 InParanoid:Q9UKF6
PhylomeDB:Q9UKF6 ChiTaRS:CPSF3 EvolutionaryTrace:Q9UKF6
GenomeRNAi:51692 NextBio:55702 ArrayExpress:Q9UKF6 Bgee:Q9UKF6
CleanEx:HS_CPSF3 Genevestigator:Q9UKF6 GermOnline:ENSG00000119203
Uniprot:Q9UKF6
Length = 684
Score = 390 (142.3 bits), Expect = 2.8e-33, P = 2.8e-33
Identities = 104/396 (26%), Positives = 202/396 (51%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 314
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 315 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 372
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 373 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|F1NKW5 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0004521
"endoribonuclease activity" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0003723 GO:GO:0004521 GO:GO:0005847
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:AADN02018718
IPI:IPI00600642 Ensembl:ENSGALT00000026493 Uniprot:F1NKW5
Length = 685
Score = 390 (142.3 bits), Expect = 2.8e-33, P = 2.8e-33
Identities = 104/396 (26%), Positives = 202/396 (51%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 314
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 315 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 372
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 373 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|E2R7R2 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 GeneTree:ENSGT00700000104485 EMBL:AAEX03010701
RefSeq:XP_003639652.1 Ensembl:ENSCAFT00000005417 GeneID:100856414
KEGG:cfa:100856414 Uniprot:E2R7R2
Length = 717
Score = 390 (142.3 bits), Expect = 3.3e-33, P = 3.3e-33
Identities = 104/396 (26%), Positives = 202/396 (51%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 62 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 121
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 122 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 173
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 174 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 232
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 233 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 292
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 293 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 347
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 348 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 405
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 406 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 441
>MGI|MGI:1859328 [details] [associations]
symbol:Cpsf3 "cleavage and polyadenylation specificity
factor 3" species:10090 "Mus musculus" [GO:0003723 "RNA binding"
evidence=IDA] [GO:0003729 "mRNA binding" evidence=ISO] [GO:0004518
"nuclease activity" evidence=IEA] [GO:0004519 "endonuclease
activity" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=ISO]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISO;IDA] [GO:0008409 "5'-3'
exonuclease activity" evidence=IDA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0030529 "ribonucleoprotein complex"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
MGI:MGI:1859328 GO:GO:0046872 GO:GO:0003723 GO:GO:0030529
GO:GO:0004521 GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 GO:GO:0006398
HOGENOM:HOG000203394 KO:K14403 OMA:YVSFSAH InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 CTD:51692 GeneTree:ENSGT00700000104485
HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409 ChiTaRS:CPSF3
EMBL:AF203969 EMBL:BC023297 IPI:IPI00135099 RefSeq:NP_061283.2
UniGene:Mm.356778 ProteinModelPortal:Q9QXK7 SMR:Q9QXK7
STRING:Q9QXK7 PhosphoSite:Q9QXK7 PaxDb:Q9QXK7 PRIDE:Q9QXK7
Ensembl:ENSMUST00000067284 GeneID:54451 KEGG:mmu:54451
InParanoid:Q8CIM0 NextBio:311332 Bgee:Q9QXK7 CleanEx:MM_CPSF3
Genevestigator:Q9QXK7 GermOnline:ENSMUSG00000054309 Uniprot:Q9QXK7
Length = 684
Score = 387 (141.3 bits), Expect = 6.0e-33, P = 6.0e-33
Identities = 104/396 (26%), Positives = 201/396 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 314
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 315 DHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 372
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 373 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>RGD|1305767 [details] [associations]
symbol:Cpsf3 "cleavage and polyadenylation specific factor 3,
73kDa" species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
evidence=ISO] [GO:0004521 "endoribonuclease activity" evidence=ISO]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISO] [GO:0006398 "histone mRNA 3'-end processing"
evidence=ISO] [GO:0008409 "5'-3' exonuclease activity"
evidence=ISO] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 RGD:1305767 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6
UniGene:Rn.100522 EMBL:BC099817 IPI:IPI00365532
RefSeq:NP_001025201.1 ProteinModelPortal:Q499P4 SMR:Q499P4
STRING:Q499P4 GeneID:298916 KEGG:rno:298916 InParanoid:Q499P4
NextBio:644507 Genevestigator:Q499P4 Uniprot:Q499P4
Length = 685
Score = 387 (141.3 bits), Expect = 6.1e-33, P = 6.1e-33
Identities = 104/396 (26%), Positives = 201/396 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGMKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 314
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 315 DHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 372
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 373 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|G3V6W7 [details] [associations]
symbol:Cpsf3 "Protein Cpsf3" species:10116 "Rattus
norvegicus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
RGD:1305767 GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 EMBL:CH473947 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 UniGene:Rn.100522
Ensembl:ENSRNOT00000009652 Uniprot:G3V6W7
Length = 685
Score = 387 (141.3 bits), Expect = 6.1e-33, P = 6.1e-33
Identities = 104/396 (26%), Positives = 201/396 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNY 135
F +T+ +YR LL+ Y+ +S D L+T D++ + + + N+
Sbjct: 89 FKGRTFMTHATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NF 140
Query: 136 HLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPA 195
H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P
Sbjct: 141 HEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPD 199
Query: 196 VLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
+LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 200 ILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQN 259
Query: 255 H-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 312
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N +
Sbjct: 260 HPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSM 314
Query: 313 DNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPK 371
D+ D GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++P +
Sbjct: 315 DHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--E 372
Query: 372 AVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 406
+ +++PL + + I++ + E ++A
Sbjct: 373 EITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|G5E9W3 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation-specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:CH471053 GO:GO:0003723 GO:GO:0004521
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 GO:GO:0008409
EMBL:AC080162 UniGene:Hs.515972 HGNC:HGNC:2326 ChiTaRS:CPSF3
ProteinModelPortal:G5E9W3 SMR:G5E9W3 PRIDE:G5E9W3
Ensembl:ENST00000460593 ArrayExpress:G5E9W3 Bgee:G5E9W3
Uniprot:G5E9W3
Length = 647
Score = 385 (140.6 bits), Expect = 7.8e-33, P = 7.8e-33
Identities = 103/387 (26%), Positives = 199/387 (51%)
Query: 31 LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF--- 85
++DCG + + P + + ID +L+SH H GALP+ +++ F
Sbjct: 1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60
Query: 86 STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI 144
+T+ +YR LL+ Y+ +S D L+T D++ + + + N+H + GI
Sbjct: 61 ATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NFHEVKEVAGI 112
Query: 145 VVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNA 204
+ AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 113 KFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYG 171
Query: 205 LHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN-YPI 261
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+ PI
Sbjct: 172 THIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPI 231
Query: 262 YFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPK 320
Y+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D GP
Sbjct: 232 YYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDIGPS 286
Query: 321 LVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
+V+AS +++G S ++F W +D +N V+ GTLA+ + ++P + + ++
Sbjct: 287 VVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMSGQK 344
Query: 381 VPL-VGEELIAYEEEQTRLKKEEALKA 406
+PL + + I++ + E ++A
Sbjct: 345 LPLKMSVDYISFSAHTDYQQTSEFIRA 371
>DICTYBASE|DDB_G0278189 [details] [associations]
symbol:ints11 "integrator complex subunit 11"
species:44689 "Dictyostelium discoideum" [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0278189 Pfam:PF07521 GO:GO:0005634 GO:GO:0005737
GenomeReviews:CM000152_GR EMBL:AAFI02000023 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 RefSeq:XP_642189.1
ProteinModelPortal:Q54YL3 PRIDE:Q54YL3 EnsemblProtists:DDB0234100
GeneID:8621396 KEGG:ddi:DDB_G0278189 OMA:RTIANET
ProtClustDB:CLSZ2729107 Uniprot:Q54YL3
Length = 744
Score = 377 (137.8 bits), Expect = 2.0e-32, Sum P(2) = 2.0e-32
Identities = 104/371 (28%), Positives = 177/371 (47%)
Query: 4 SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW----ND--HF-DPSLLQPLSKVASTID 56
+++V PL + +V+I N + DCG ND F D S + + ID
Sbjct: 2 TIKVVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVID 61
Query: 57 AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
V+++H H GALP+ + G P++ T P + + + D + ++ + E + FT
Sbjct: 62 CVIITHFHLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFT 121
Query: 116 LDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 175
I + V + Q + E + + + AGH+LG ++ E V+Y DY
Sbjct: 122 AQMIKDCMKKVIPVNLHQTIKVD---EELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDY 178
Query: 176 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 234
N ++HL ++ V+P VLIT+ A + ++ RE F I + + GG VL+P
Sbjct: 179 NMTPDRHLGSAWIDQ-VKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIP 237
Query: 235 VDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 293
V + GRV EL ++++ YW + +L + PIYF ++ Y K F+ W I ++F
Sbjct: 238 VFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFINWTNQKIKQTFV-- 295
Query: 294 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
+ N F KH+ +S L +AP G ++ A+ L AG S ++F +WA + N+ +
Sbjct: 296 KRNMFDFKHIKPF--QSHLVDAP-GAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPG 352
Query: 354 RGQFGTLARML 364
GT+ L
Sbjct: 353 YCVVGTVGNKL 363
Score = 52 (23.4 bits), Expect = 2.0e-32, Sum P(2) = 2.0e-32
Identities = 18/77 (23%), Positives = 34/77 (44%)
Query: 534 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
+LVHG E L Q +K + + Y P TI + + + + +S N+L +++
Sbjct: 422 ILVHGEKEKMGFLSQKIIKEMGVNCYYPANGVTI-IIDTMKSIPIDIS----LNLLKRQI 476
Query: 594 GDYEIAWVDAEVGKTEN 610
DY + + + N
Sbjct: 477 LDYSYQYNNNNLNNFNN 493
>DICTYBASE|DDB_G0274799 [details] [associations]
symbol:cpsf3 "cleavage and polyadenylation
specificity factor 73 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA;IC] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR
binding" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0004519 "endonuclease
activity" evidence=IEA] [GO:0004518 "nuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0274799 Pfam:PF07521 GO:GO:0046872 GO:GO:0006378
GenomeReviews:CM000151_GR EMBL:AAFI02000012 GO:GO:0003730
GO:GO:0004519 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
RefSeq:XP_643926.1 ProteinModelPortal:Q86A79 SMR:Q86A79
STRING:Q86A79 EnsemblProtists:DDB0233696 GeneID:8619353
KEGG:ddi:DDB_G0274799 ProtClustDB:CLSZ2431003 Uniprot:Q86A79
Length = 774
Score = 384 (140.2 bits), Expect = 2.7e-32, Sum P(2) = 2.7e-32
Identities = 101/373 (27%), Positives = 181/373 (48%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPL-SKVASTI---DAVLL 60
+++TP+ L+ G + DCG + + + P + S I D +L+
Sbjct: 36 LEITPIGSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDSIESDIPDIDLLLV 95
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD--LFTLDD 118
SH H A+PY + + VF T P + + + D Y+ ++ D LF D
Sbjct: 96 SHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSD-YVKVSNITRDDDMLFDKSD 154
Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
+D + + + ++ Y Q + GI V AGH+LG ++ I G ++Y D++R+
Sbjct: 155 LDRSLEKIEKVRYRQKV----EHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQ 210
Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
+++HL G V+ VLI ++ + PR +RE F ++ + + G L+PV +
Sbjct: 211 EDRHLMGAETPP-VKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPVFA 269
Query: 238 AGRVLELLLILEDYW-AEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
GR ELLLIL++YW A L++ PIY+ + ++ + ++++ M D + F+ S
Sbjct: 270 LGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVS-- 327
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F KH+ + D+ GP + +AS L++G S +F W SD +N ++
Sbjct: 328 NPFEFKHIKNIKGIESFDDR--GPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPGYS 385
Query: 356 QFGTLARMLQADP 368
GTLA+ + ++P
Sbjct: 386 VEGTLAKHIMSEP 398
Score = 44 (20.5 bits), Expect = 2.7e-32, Sum P(2) = 2.7e-32
Identities = 12/30 (40%), Positives = 16/30 (53%)
Query: 534 VLVHGSAEATEHLKQHCL-KHVCPHVYTPQ 562
VLVHG A L+Q + K +V TP+
Sbjct: 442 VLVHGDANEMSRLRQSLVAKFKTINVLTPK 471
>ZFIN|ZDB-GENE-050522-13 [details] [associations]
symbol:cpsf3l "cleavage and polyadenylation specific
factor 3-like" species:7955 "Danio rerio" [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0043484 "regulation of RNA splicing"
evidence=IMP] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
ZFIN:ZDB-GENE-050522-13 GO:GO:0016787 GO:GO:0043484
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:MAVEYMS GeneTree:ENSGT00700000104485 EMBL:CABZ01054885
EMBL:CR846089 IPI:IPI00865509 Ensembl:ENSDART00000102902
Uniprot:E7EXW1
Length = 601
Score = 373 (136.4 bits), Expect = 5.2e-32, Sum P(2) = 5.2e-32
Identities = 110/361 (30%), Positives = 175/361 (48%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD-- 174
I + V L Q + + E + + AGH+LG + + V+Y V
Sbjct: 124 QMIKDCMKKVVPLNLHQTVQVDDELE---IKAYYAGHVLGAAM---VQSRFRVVYTVSVS 177
Query: 175 --YNR--RKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
Y+ L ++ RP +LI+++ A + ++ RE F + +T+ GG
Sbjct: 178 YTYSNLMTPASDLRAAWIDK-CRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGG 236
Query: 230 NVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W I K+
Sbjct: 237 KVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKT 296
Query: 290 FETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 349
F R N F KH+ ++S DN P GP +V A+ L AG S IF +WA + KN+V
Sbjct: 297 F-VQR-NMFEFKHIKAF-DRSYADN-P-GPMVVFATPGMLHAGQSLQIFKKWAGNEKNMV 351
Query: 350 L 350
+
Sbjct: 352 I 352
Score = 45 (20.9 bits), Expect = 5.2e-32, Sum P(2) = 5.2e-32
Identities = 21/80 (26%), Positives = 35/80 (43%)
Query: 534 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 593
+LVHG A+ E LK + + P ET + ++ + V +S L+ + L
Sbjct: 414 LLVHGEAKKMEFLKDKIEQEFSISCFMPANGETTTIVTNP-SVPVDISLNLLKREM--AL 470
Query: 594 GDYEIAWVDAEVGKTENGML 613
G DA+ +T +G L
Sbjct: 471 GG---PLPDAKKPRTMHGTL 487
>ASPGD|ASPL0000040420 [details] [associations]
symbol:AN3082 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR027075 EMBL:BN001306 EMBL:AACD01000051 eggNOG:COG1236
KO:K14402 OrthoDB:EOG4WWVSN InterPro:IPR022712 InterPro:IPR025069
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
OMA:YSQPHQP RefSeq:XP_660686.1 EnsemblFungi:CADANIAT00009996
GeneID:2874210 KEGG:ani:AN3082.2 HOGENOM:HOG000196366
Uniprot:Q5B8P8
Length = 1005
Score = 181 (68.8 bits), Expect = 2.5e-31, Sum P(6) = 2.5e-31
Identities = 53/160 (33%), Positives = 78/160 (48%)
Query: 115 TLDDIDSAFQSVTRLTYSQNYH-----LSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDV 169
T ++I F + L YSQ + S G+ + + AGH +GGT+W I E +
Sbjct: 155 TTEEIARYFALIQPLKYSQPHQPIPSPFSPPLNGLTLTAYNAGHTVGGTIWHIQHGMESI 214
Query: 170 IYAVDYNRRKEKHL-----------NGT-VLESFVRPAVLITDAYNALHNQPP--RQQR- 214
+YAVD+N+ +E + +GT V+E +P LI P R++R
Sbjct: 215 VYAVDWNQARESVVAGAAWFGGSGASGTEVIEQLRKPTALICSTRGGDKFALPGGRKKRD 274
Query: 215 EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAE 254
E+ D I TL GG VL+P D++ RVLEL LE W +
Sbjct: 275 EILLDMIRSTLVKGGTVLIPTDTSARVLELAYALEHAWRD 314
Score = 149 (57.5 bits), Expect = 2.5e-31, Sum P(6) = 2.5e-31
Identities = 39/109 (35%), Positives = 55/109 (50%)
Query: 8 TPLSGVFNE-NPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
TPL G + + S ++ +DG L+D GW+D FDP L L K ST+ +LL+H
Sbjct: 5 TPLLGAQSSASKASQSILELDGGVKILVDVGWDDTFDPLDLVELEKHVSTLSLILLTHAT 64
Query: 65 TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEF 111
H+GA + K L PV++T PV LG + D Y S + F
Sbjct: 65 PSHIGAYVHCCKTFPLFTQIPVYATSPVIALGRTLLQDVYESAPLAATF 113
Score = 134 (52.2 bits), Expect = 3.0e-26, Sum P(6) = 3.0e-26
Identities = 40/122 (32%), Positives = 60/122 (49%)
Query: 184 NGT-VLESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAG 239
+GT V+E +P LI P R++R E+ D I TL GG VL+P D++
Sbjct: 240 SGTEVIEQLRKPTALICSTRGGDKFALPGGRKKRDEILLDMIRSTLVKGGTVLIPTDTSA 299
Query: 240 RVLELLLILEDYWAEHSLNYP--------IYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 291
RVLEL LE W + + + +Y ++T+ +S LEWM +SI + FE
Sbjct: 300 RVLELAYALEHAWRDAARDTQDDVLKRGGLYLAGRKVNTTMRLARSMLEWMDESIVREFE 359
Query: 292 TS 293
+
Sbjct: 360 AA 361
Score = 80 (33.2 bits), Expect = 2.5e-31, Sum P(6) = 2.5e-31
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 630 VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVG 668
VGDL++ADL+ + + G + EF G G L +V +RK G
Sbjct: 923 VGDLRLADLRKIMQNAGHKAEFRGEGTLLIDGFVAVRKSG 962
Score = 75 (31.5 bits), Expect = 2.5e-31, Sum P(6) = 2.5e-31
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 298 FLLKHVTLLINKSELD---NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 353
F KH+ + K +L+ N P PK++LAS +SL+ GF+ + A NL+L T+
Sbjct: 391 FTFKHLKTVERKKKLEQLLNDPT-PKVILASDSSLDWGFAKESLRLLAGGENNLLLLTD 448
Score = 69 (29.3 bits), Expect = 2.5e-31, Sum P(6) = 2.5e-31
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 468 MFPFYENNSEWDDFGEVINPDDYIIKDE----DMDQ 499
MFP+ + D++GE+I P++Y+ +E DM Q
Sbjct: 616 MFPYVAPRKKGDEYGEIIRPEEYLRAEEREEIDMQQ 651
Score = 52 (23.4 bits), Expect = 2.5e-31, Sum P(6) = 2.5e-31
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 558 VYTPQIEETIDVTSDLCAYKVQLSEKLM 585
++TP E ID + D A+ V+LS L+
Sbjct: 800 IFTPTNGEIIDASVDTSAWTVKLSNNLV 827
Score = 37 (18.1 bits), Expect = 5.9e-13, Sum P(5) = 5.9e-13
Identities = 13/44 (29%), Positives = 19/44 (43%)
Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 242
TD++ Q E QD ++ + G +L V S GR L
Sbjct: 460 TDSHRRTLGSMIWQWYEERQDGVALEKGSDGEMLEQVHSGGREL 503
>WB|WBGene00013460 [details] [associations]
symbol:cpsf-3 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
[GO:0040007 "growth" evidence=IMP] [GO:0002119 "nematode larval
development" evidence=IMP] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 EMBL:AL132951 RefSeq:NP_502553.2
ProteinModelPortal:Q95PY8 SMR:Q95PY8 STRING:Q95PY8 PaxDb:Q95PY8
EnsemblMetazoa:Y67H2A.1.1 EnsemblMetazoa:Y67H2A.1.2 GeneID:178285
KEGG:cel:CELE_Y67H2A.1 UCSC:Y67H2A.1 CTD:178285 WormBase:Y67H2A.1
InParanoid:Q95PY8 NextBio:900506 Uniprot:Q95PY8
Length = 707
Score = 366 (133.9 bits), Expect = 1.6e-30, P = 1.6e-30
Identities = 100/373 (26%), Positives = 174/373 (46%)
Query: 4 SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS--TIDAVLLS 61
S+ TPL +L+ G ++DCG + P ID +L++
Sbjct: 10 SLCFTPLGSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLIT 69
Query: 62 HPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDD 118
H H GALP+ +++ F +T+ +YR+ LL Y + L+T DD
Sbjct: 70 HFHLDHCGALPWLLQKTAFQGKCFMTHATKAIYRM-LLGDYVRISKYGGPDRNQLYTEDD 128
Query: 119 IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRR 178
++ + + + + + ++G I P+VAGH+LG + I G V+Y D++
Sbjct: 129 LEKSMAKIETIDFREQKEVNG----IRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCL 184
Query: 179 KEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDS 237
+++HL + + P VLIT++ R RE F + + GG L+P +
Sbjct: 185 EDRHLCAAEIPP-ITPQVLITESTYGTQTHEDRAVREKRFTQMVHDIVTRGGRCLIPAFA 243
Query: 238 AGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
G EL+LIL++YW H + P+Y+ + ++ + ++F+ M I K
Sbjct: 244 IGPAQELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQIAVK-- 301
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F+ KHV+ L + ++A GP +VLA+ L++GFS ++F W D KN +
Sbjct: 302 NPFIFKHVSTLRGMDQFEDA--GPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYC 359
Query: 356 QFGTLARMLQADP 368
GTLA+ + ++P
Sbjct: 360 VEGTLAKHILSEP 372
>TAIR|locus:2065368 [details] [associations]
symbol:CPSF73-II "AT2G01730" species:3702 "Arabidopsis
thaliana" [GO:0003824 "catalytic activity" evidence=ISS]
[GO:0008152 "metabolic process" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0010197 "polar nucleus
fusion" evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005634 "nucleus" evidence=IDA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:CP002685
GenomeReviews:CT485783_GR GO:GO:0006397 GO:GO:0090305 EMBL:AC006069
GO:GO:0004518 GO:GO:0010197 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 EMBL:AY168923
EMBL:AK221561 IPI:IPI00536069 PIR:D84428 RefSeq:NP_178282.2
UniGene:At.42473 ProteinModelPortal:Q8GUU3 SMR:Q8GUU3 IntAct:Q8GUU3
STRING:Q8GUU3 PaxDb:Q8GUU3 PRIDE:Q8GUU3 EnsemblPlants:AT2G01730.1
GeneID:814702 KEGG:ath:AT2G01730 TAIR:At2g01730
HOGENOM:HOG000231294 InParanoid:Q56XW2 KO:K13148 OMA:MAVEYMS
Genevestigator:Q8GUU3 Uniprot:Q8GUU3
Length = 613
Score = 354 (129.7 bits), Expect = 6.1e-30, Sum P(2) = 6.1e-30
Identities = 102/360 (28%), Positives = 168/360 (46%)
Query: 22 LVSIDGFNFLIDCGWN----DHFD-P--SLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
+V+I+G + DCG + DH P SL+ + I ++++H H+GALPY
Sbjct: 20 VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYDQ---YLSRRQVSEFDLFTLDDIDSAFQSVTRLTY 131
+ G + P++ + P L L + D + RR E +LFT I + + V +
Sbjct: 80 TEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRR--GEEELFTTTHIANCMKKVIAIDL 137
Query: 132 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIYAVDYNRRKEKHLNGTVLES 190
Q + E + + + AGH+LG V K G+ ++Y DYN ++HL ++
Sbjct: 138 KQTIQVD---EDLQIRAYYAGHVLGA-VMVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDR 193
Query: 191 FVRPAVLITDAY-NALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 249
++ Y + ++RE Q A+ K + GG L+P + GR EL ++L+
Sbjct: 194 LQLDLLISESTYATTIRGSKYPREREFLQ-AVHKCVAGGGKALIPSFALGRAQELCMLLD 252
Query: 250 DYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINK 309
DYW ++ PIYF + ++ Y K + W ++ + T N F K+V ++
Sbjct: 253 DYWERMNIKVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTH--NPFDFKNVKDF-DR 309
Query: 310 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 369
S L +AP GP ++ A+ L AGFS ++F WA NLV GT+ L A P
Sbjct: 310 S-LIHAP-GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMAGKP 367
Score = 48 (22.0 bits), Expect = 6.1e-30, Sum P(2) = 6.1e-30
Identities = 24/92 (26%), Positives = 38/92 (41%)
Query: 525 SKVVSNELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSE-- 582
+K +S + VLVHG + LK+ + + P ET+ S K S+
Sbjct: 402 TKFLSPKNVVLVHGEKPSMMILKEKITSELDIPCFVPANGETVSFASTTYI-KANASDMF 460
Query: 583 -KLMSNVLFKKLGDYEIAWVDAEVGKTENGML 613
K SN FK ++ D +T +G+L
Sbjct: 461 LKSCSNPNFKFSNSTQLRVTDH---RTADGVL 489
>CGD|CAL0005344 [details] [associations]
symbol:orf19.5486 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0004521 "endoribonuclease
activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0034247 "snoRNA splicing"
evidence=IEA] [GO:0031126 "snoRNA 3'-end processing" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006369 "termination
of RNA polymerase II transcription" evidence=IEA] [GO:0006378 "mRNA
polyadenylation" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 CGD:CAL0005344 Pfam:PF07521 GO:GO:0005634
GO:GO:0042493 GO:GO:0046872 GO:GO:0006397 GO:GO:0090305
GO:GO:0004519 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 EMBL:AACQ01000196 EMBL:AACQ01000195
RefSeq:XP_711478.1 RefSeq:XP_711502.1 ProteinModelPortal:Q59P50
STRING:Q59P50 GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
KEGG:cal:CaO19.5486 Uniprot:Q59P50
Length = 870
Score = 346 (126.9 bits), Expect = 4.4e-28, P = 4.4e-28
Identities = 102/355 (28%), Positives = 179/355 (50%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS-RRQV 108
S +D +L+SH H +LPY M+Q VF +T+ +YR L+ + + S
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 208
Query: 109 SE-------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
SE +L+T DDI +F + + +YH + + +GI + AGH+LG ++
Sbjct: 209 SEDGGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 264
Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
I G V++ DY+R + +HL+ + ++P +LI+++ PR + E
Sbjct: 265 IEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRIELERKLTTH 323
Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
I T+ GG VLLPV + G ELLLIL++YW+++ N +++ + ++ + +++
Sbjct: 324 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 383
Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
M D I S +S + N F K++ + + S+ + GP +V+A+ L+AG S +
Sbjct: 384 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 441
Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL-VGEELIAY 391
+WA D KNLV+ T GT+A+ L +P +P +G E I++
Sbjct: 442 LEKWAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDMTIPRRIGIEEISF 496
>UNIPROTKB|Q59P50 [details] [associations]
symbol:YSH1 "Endoribonuclease YSH1" species:237561 "Candida
albicans SC5314" [GO:0042493 "response to drug" evidence=IMP]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 CGD:CAL0005344
Pfam:PF07521 GO:GO:0005634 GO:GO:0042493 GO:GO:0046872
GO:GO:0006397 GO:GO:0090305 GO:GO:0004519 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
EMBL:AACQ01000196 EMBL:AACQ01000195 RefSeq:XP_711478.1
RefSeq:XP_711502.1 ProteinModelPortal:Q59P50 STRING:Q59P50
GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
KEGG:cal:CaO19.5486 Uniprot:Q59P50
Length = 870
Score = 346 (126.9 bits), Expect = 4.4e-28, P = 4.4e-28
Identities = 102/355 (28%), Positives = 179/355 (50%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLS-RRQV 108
S +D +L+SH H +LPY M+Q VF +T+ +YR L+ + + S
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYRW-LMQDFVRVTSIGNSR 208
Query: 109 SE-------FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWK 161
SE +L+T DDI +F + + +YH + + +GI + AGH+LG ++
Sbjct: 209 SEDGGGGEGSNLYTDDDIMKSFDRIETI----DYHSTMEIDGIRFTAYHAGHVLGACMYF 264
Query: 162 ITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDA 220
I G V++ DY+R + +HL+ + ++P +LI+++ PR + E
Sbjct: 265 IEIGGLKVLFTGDYSREENRHLHAAEVPP-LKPDILISESTFGTGTLEPRIELERKLTTH 323
Query: 221 ISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSF 278
I T+ GG VLLPV + G ELLLIL++YW+++ N +++ + ++ + +++
Sbjct: 324 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 383
Query: 279 LEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI 337
M D I S +S + N F K++ + + S+ + GP +V+A+ L+AG S +
Sbjct: 384 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDM--GPSVVVATPGMLQAGVSRQL 441
Query: 338 FVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL-VGEELIAY 391
+WA D KNLV+ T GT+A+ L +P +P +G E I++
Sbjct: 442 LEKWAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDMTIPRRIGIEEISF 496
>ASPGD|ASPL0000060573 [details] [associations]
symbol:AN0990 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:BN001308
GO:GO:0046872 GO:GO:0006397 GO:GO:0090305 GO:GO:0004519
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 EMBL:AACD01000015 RefSeq:XP_658594.1
ProteinModelPortal:Q5BEP0 STRING:Q5BEP0
EnsemblFungi:CADANIAT00001661 GeneID:2876766 KEGG:ani:AN0990.2
OMA:EISFAAH OrthoDB:EOG41ZJK7 Uniprot:Q5BEP0
Length = 884
Score = 348 (127.6 bits), Expect = 6.8e-28, Sum P(2) = 6.8e-28
Identities = 103/363 (28%), Positives = 173/363 (47%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
ST+D +L+SH H ALPY + + VF T + + D S D
Sbjct: 74 STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSD 133
Query: 113 ----LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 168
L+T D S + + ++ + ++ I + P+ AGH+LG ++ I+ G +
Sbjct: 134 QRTTLYTEHDHLSTLPLIETIDFNTTHTINS----IRITPYPAGHVLGAAMFLISIAGLN 189
Query: 169 VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRA 227
+++ DY+R +++HL + V+ VLIT++ + + PPR +RE +I+ L
Sbjct: 190 ILFTGDYSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNR 249
Query: 228 GGNVLLPVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDS 285
GG VL+PV + GR ELLLILE+YW H PIY++ + + ++++ M D+
Sbjct: 250 GGRVLMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDN 309
Query: 286 ITKSF-------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGF 333
I + F E S D + K+V L + D+ G ++LAS L+ G
Sbjct: 310 IKRLFRQRMAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDV--GGCVMLASPGMLQTGT 367
Query: 334 SHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGE-ELIAYE 392
S ++ WA + +N V+ T GT+A+ L +P + MSR +G + +
Sbjct: 368 SRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPDQ--IHAVMSRAATGMGRTRMNGND 425
Query: 393 EEQ 395
EEQ
Sbjct: 426 EEQ 428
Score = 44 (20.5 bits), Expect = 6.8e-28, Sum P(2) = 6.8e-28
Identities = 15/39 (38%), Positives = 17/39 (43%)
Query: 528 VSNELTVLVHGSAEATEHLKQHCL-----KHVCPHVYTP 561
VS + +LVHG LK L K V VYTP
Sbjct: 459 VSAPVVILVHGEKHQMMRLKSKLLSLNAEKTVKVKVYTP 497
>GENEDB_PFALCIPARUM|PFC0825c [details] [associations]
symbol:PFC0825c "cleavage and polyadenylation
specificity factor protein, putative" species:5833 "Plasmodium
falciparum" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0006378
"mRNA polyadenylation" evidence=ISS] [GO:0003729 "mRNA binding"
evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] InterPro:IPR001279
SMART:SM00849 Pfam:PF07521 GO:GO:0003729 GO:GO:0016787
EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
ProtClustDB:CLSZ2433497 Uniprot:O77371
Length = 1017
Score = 280 (103.6 bits), Expect = 7.7e-23, Sum P(2) = 7.7e-23
Identities = 69/249 (27%), Positives = 127/249 (51%)
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
D+I + V L ++ + L G+ + + P+ AGH+LG ++KI VIY DYN
Sbjct: 261 DNIYNCIDKVIGLQINETFEL---GD-MSITPYYAGHVLGACIYKIEVRNFSVIYTGDYN 316
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
+KHL + S + P + I+++ A + +P ++ E+ + + + + GG VL+PV
Sbjct: 317 TIPDKHLGSANIPS-LNPEIFISESTYATYVRPTKKASELELCNLVHECVHKGGKVLIPV 375
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++L+DYW + ++YPIYF ++ + Y K + W+ S + ++
Sbjct: 376 FAIGRAQELSILLDDYWKKMKIHYPIYFGCGLTENANKYYKIYSSWINSSCMSN---EKE 432
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +++ +N + L+ P ++ A+ L G S F WA + +NL++
Sbjct: 433 NLFDFANISPFLN-NYLNEKR--PMVLFATPGMLHTGLSLKAFKAWAGNPQNLIVLPGYC 489
Query: 356 QFGTLARML 364
GT+ L
Sbjct: 490 VQGTVGHKL 498
Score = 70 (29.7 bits), Expect = 7.7e-23, Sum P(2) = 7.7e-23
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 44 LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
L+ L ++ ID V++SH H+GALP+ + L + + P L + + D
Sbjct: 159 LINNLKRINEIIDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLD 215
>UNIPROTKB|O77371 [details] [associations]
symbol:PFC0825c "Cleavage and polyadenylation specificity
factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
[GO:0003729 "mRNA binding" evidence=ISS] [GO:0005847 "mRNA cleavage
and polyadenylation specificity factor complex" evidence=ISS]
InterPro:IPR001279 SMART:SM00849 Pfam:PF07521 GO:GO:0003729
GO:GO:0016787 EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
ProtClustDB:CLSZ2433497 Uniprot:O77371
Length = 1017
Score = 280 (103.6 bits), Expect = 7.7e-23, Sum P(2) = 7.7e-23
Identities = 69/249 (27%), Positives = 127/249 (51%)
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
D+I + V L ++ + L G+ + + P+ AGH+LG ++KI VIY DYN
Sbjct: 261 DNIYNCIDKVIGLQINETFEL---GD-MSITPYYAGHVLGACIYKIEVRNFSVIYTGDYN 316
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 235
+KHL + S + P + I+++ A + +P ++ E+ + + + + GG VL+PV
Sbjct: 317 TIPDKHLGSANIPS-LNPEIFISESTYATYVRPTKKASELELCNLVHECVHKGGKVLIPV 375
Query: 236 DSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRD 295
+ GR EL ++L+DYW + ++YPIYF ++ + Y K + W+ S + ++
Sbjct: 376 FAIGRAQELSILLDDYWKKMKIHYPIYFGCGLTENANKYYKIYSSWINSSCMSN---EKE 432
Query: 296 NAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERG 355
N F +++ +N + L+ P ++ A+ L G S F WA + +NL++
Sbjct: 433 NLFDFANISPFLN-NYLNEKR--PMVLFATPGMLHTGLSLKAFKAWAGNPQNLIVLPGYC 489
Query: 356 QFGTLARML 364
GT+ L
Sbjct: 490 VQGTVGHKL 498
Score = 70 (29.7 bits), Expect = 7.7e-23, Sum P(2) = 7.7e-23
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 44 LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
L+ L ++ ID V++SH H+GALP+ + L + + P L + + D
Sbjct: 159 LINNLKRINEIIDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLD 215
>GENEDB_PFALCIPARUM|PF14_0364 [details] [associations]
symbol:PF14_0364 "cleavage and polyadenylation
specifity factor protein, putative" species:5833 "Plasmodium
falciparum" [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0006378 "mRNA
polyadenylation" evidence=ISS] [GO:0006379 "mRNA cleavage"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:AE014187 GO:GO:0005847
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 RefSeq:XP_001348538.1 ProteinModelPortal:Q8IL83
PRIDE:Q8IL83 EnsemblProtists:PF14_0364:mRNA GeneID:811946
KEGG:pfa:PF14_0364 EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH
ProtClustDB:CLSZ2457730 Uniprot:Q8IL83
Length = 876
Score = 256 (95.2 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
Identities = 70/262 (26%), Positives = 133/262 (50%)
Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
L+ +DID + L + QN+ + + AGH++G ++ + + +Y
Sbjct: 167 LYDENDIDKTMDLIETLNFHQNFEFPN----VKFTAYRAGHVIGACMFLVEINNIRFLYT 222
Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
DY+R ++H+ + + + VLI + + R++RE+ F + ++ + G V
Sbjct: 223 GDYSREIDRHIPIAEIPN-IDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKV 281
Query: 232 LLPVDSAGRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
LLPV + GR ELLLILE++W + H N PI++++ +++ ++ ++F+ G+ + K
Sbjct: 282 LLPVFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKV 341
Query: 290 FETSRDNAFLLKHVTLLINKSELDN---APDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
+ N F K+V + + + + P +++AS L+ G S +IF ASD K
Sbjct: 342 VNEGK-NPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKK 400
Query: 347 NLVLFTERGQFGTLARMLQADP 368
+ V+ T GTLA L+ +P
Sbjct: 401 SGVILTGYTVKGTLADELKTEP 422
Score = 81 (33.6 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
Identities = 23/102 (22%), Positives = 44/102 (43%)
Query: 3 TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLL 60
+++ + L G ++ D + ++DCG + F P+ S +D L+
Sbjct: 2 SNINIVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLI 61
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
+H H GALPY + + +F TE + L +++ Y
Sbjct: 62 THFHMDHSGALPYLINKTRFKGRIFMTEATKSICYL-LWNDY 102
>UNIPROTKB|Q8IL83 [details] [associations]
symbol:PF14_0364 "Cleavage and polyadenylation specificity
factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014187
GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 RefSeq:XP_001348538.1
ProteinModelPortal:Q8IL83 PRIDE:Q8IL83
EnsemblProtists:PF14_0364:mRNA GeneID:811946 KEGG:pfa:PF14_0364
EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH ProtClustDB:CLSZ2457730
Uniprot:Q8IL83
Length = 876
Score = 256 (95.2 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
Identities = 70/262 (26%), Positives = 133/262 (50%)
Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
L+ +DID + L + QN+ + + AGH++G ++ + + +Y
Sbjct: 167 LYDENDIDKTMDLIETLNFHQNFEFPN----VKFTAYRAGHVIGACMFLVEINNIRFLYT 222
Query: 173 VDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNV 231
DY+R ++H+ + + + VLI + + R++RE+ F + ++ + G V
Sbjct: 223 GDYSREIDRHIPIAEIPN-IDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKV 281
Query: 232 LLPVDSAGRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
LLPV + GR ELLLILE++W + H N PI++++ +++ ++ ++F+ G+ + K
Sbjct: 282 LLPVFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKV 341
Query: 290 FETSRDNAFLLKHVTLLINKSELDN---APDGPKLVLASMASLEAGFSHDIFVEWASDVK 346
+ N F K+V + + + + P +++AS L+ G S +IF ASD K
Sbjct: 342 VNEGK-NPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKK 400
Query: 347 NLVLFTERGQFGTLARMLQADP 368
+ V+ T GTLA L+ +P
Sbjct: 401 SGVILTGYTVKGTLADELKTEP 422
Score = 81 (33.6 bits), Expect = 1.5e-21, Sum P(2) = 1.5e-21
Identities = 23/102 (22%), Positives = 44/102 (43%)
Query: 3 TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLL 60
+++ + L G ++ D + ++DCG + F P+ S +D L+
Sbjct: 2 SNINIVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLI 61
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
+H H GALPY + + +F TE + L +++ Y
Sbjct: 62 THFHMDHSGALPYLINKTRFKGRIFMTEATKSICYL-LWNDY 102
>UNIPROTKB|C9J979 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 InterPro:IPR022712 Pfam:PF10996 HOGENOM:HOG000231294
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00514808
ProteinModelPortal:C9J979 SMR:C9J979 STRING:C9J979
Ensembl:ENST00000434694 ArrayExpress:C9J979 Bgee:C9J979
Uniprot:C9J979
Length = 344
Score = 178 (67.7 bits), Expect = 3.9e-20, Sum P(2) = 3.9e-20
Identities = 41/112 (36%), Positives = 61/112 (54%)
Query: 193 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 251
RP +LIT++ A + ++ RE F + +T+ GG VL+PV + GR EL ++LE +
Sbjct: 226 RPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETF 285
Query: 252 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHV 303
W +L PIYF T ++ Y K F+ W I K+F R N F KH+
Sbjct: 286 WERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR-NMFEFKHI 335
Score = 134 (52.2 bits), Expect = 3.9e-20, Sum P(2) = 3.9e-20
Identities = 40/145 (27%), Positives = 67/145 (46%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKG 141
I + V + Q + G
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVRFPG 148
>UNIPROTKB|E9PNS4 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00984775
ProteinModelPortal:E9PNS4 SMR:E9PNS4 Ensembl:ENST00000528879
ArrayExpress:E9PNS4 Bgee:E9PNS4 Uniprot:E9PNS4
Length = 278
Score = 236 (88.1 bits), Expect = 8.2e-19, P = 8.2e-19
Identities = 66/234 (28%), Positives = 114/234 (48%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTL 116
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTS 123
Query: 117 DDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 176
I + V + Q + + E + + AGH+LG +++I E V+Y DYN
Sbjct: 124 QMIKDCMKKVVAVHLHQTVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYN 180
Query: 177 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGG 229
++HL ++ RP +LIT++ A + ++ RE F + +T+ GG
Sbjct: 181 MTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGG 233
>UNIPROTKB|G3V3T7 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation-specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 GO:GO:0016787 PANTHER:PTHR11203:SF5 HGNC:HGNC:2325
ChiTaRS:CPSF2 EMBL:AL121773 ProteinModelPortal:G3V3T7 SMR:G3V3T7
Ensembl:ENST00000553427 ArrayExpress:G3V3T7 Bgee:G3V3T7
Uniprot:G3V3T7
Length = 80
Score = 236 (88.1 bits), Expect = 8.2e-19, P = 8.2e-19
Identities = 44/80 (55%), Positives = 58/80 (72%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGL 80
SHPD LHLGALPYA+ +LGL
Sbjct: 61 SHPDPLHLGALPYAVGKLGL 80
>UNIPROTKB|F1SD84 [details] [associations]
symbol:LOC100625560 "Uncharacterized protein" species:9823
"Sus scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
"mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
InterPro:IPR027075 Pfam:PF07521 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF13299
GeneTree:ENSGT00700000104551 EMBL:CU468363
Ensembl:ENSSSCT00000002718 OMA:VEGCASE Uniprot:F1SD84
Length = 304
Score = 151 (58.2 bits), Expect = 4.1e-18, Sum P(2) = 4.1e-18
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 609 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 662
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 211 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 270
Query: 663 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 706
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 271 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 304
Score = 142 (55.0 bits), Expect = 4.1e-18, Sum P(2) = 4.1e-18
Identities = 37/115 (32%), Positives = 64/115 (55%)
Query: 508 DGKLDEGSASLILDA-KPSKVVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQ 562
+G+ D S I++ KP +++ +VHG EA++ L + C K + VY P+
Sbjct: 63 EGRSDGDSIKKIINQMKPRQLI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPK 114
Query: 563 IEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
+ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 115 LHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 169
>UNIPROTKB|H0YJF4 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation-specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] InterPro:IPR027075
Pfam:PF07521 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF13299 HGNC:HGNC:2325 ChiTaRS:CPSF2
EMBL:AL121773 Ensembl:ENST00000555244 Uniprot:H0YJF4
Length = 269
Score = 172 (65.6 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
Identities = 49/155 (31%), Positives = 78/155 (50%)
Query: 467 PMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSK 526
PMFP E +WD++GE+I I E G DG + +I KP +
Sbjct: 30 PMFPAPEERIKWDEYGEIIKARVTYIDYE---------GRSDG---DSIKKIINQMKPRQ 77
Query: 527 VVSNELTVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSE 582
++ +VHG EA++ L + C K + VY P++ ET+D TS+ Y+V+L +
Sbjct: 78 LI------IVHGPPEASQDLAECCRAFGGKDI--KVYMPKLHETVDATSETHIYQVRLKD 129
Query: 583 KLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 613
L+S++ F K D E+AW+D V K + G++
Sbjct: 130 SLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 164
Score = 105 (42.0 bits), Expect = 2.2e-17, Sum P(2) = 2.2e-17
Identities = 24/64 (37%), Positives = 35/64 (54%)
Query: 609 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 662
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 206 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 265
Query: 663 TIRK 666
+R+
Sbjct: 266 AVRR 269
>UNIPROTKB|E9PI75 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI01011963
ProteinModelPortal:E9PI75 SMR:E9PI75 Ensembl:ENST00000527719
ArrayExpress:E9PI75 Bgee:E9PI75 Uniprot:E9PI75
Length = 209
Score = 209 (78.6 bits), Expect = 6.6e-16, P = 6.6e-16
Identities = 55/187 (29%), Positives = 93/187 (49%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 27 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
+ +G P++ T P + + + D + ++ + E + FT I + V + Q
Sbjct: 87 SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146
Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
+ + E + + AGH+LG +++I E V+Y DYN ++HL ++ R
Sbjct: 147 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 202
Query: 194 PAVLITD 200
P +LIT+
Sbjct: 203 PNLLITE 209
>DICTYBASE|DDB_G0282473 [details] [associations]
symbol:ints9 "integrator complex subunit 9"
species:44689 "Dictyostelium discoideum" [GO:0032039 "integrator
complex" evidence=IEA] [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR027074 dictyBase:DDB_G0282473
GO:GO:0005634 EMBL:AAFI02000047 GenomeReviews:CM000152_GR
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
KO:K13146 PANTHER:PTHR11203:SF2 RefSeq:XP_640069.1
ProteinModelPortal:Q54SH0 EnsemblProtists:DDB0234099 GeneID:8623598
KEGG:ddi:DDB_G0282473 OMA:DDFSTID ProtClustDB:CLSZ2729002
Uniprot:Q54SH0
Length = 712
Score = 209 (78.6 bits), Expect = 9.2e-16, Sum P(2) = 9.2e-16
Identities = 58/190 (30%), Positives = 87/190 (45%)
Query: 98 MYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGG 157
M ++ L R DL+ DI+ +F+ + + ++++ K G P +G+ LG
Sbjct: 202 MENENLYRDSYRWKDLYKKIDIEKSFEKIQSIRFNESI----KHYGFECIPSSSGYGLGS 257
Query: 158 TVWKITKDG-EDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM 216
W I G E V+Y D + ++ L P VLI N N PP Q
Sbjct: 258 ANWVIESKGFERVVYISDSSLSLSRYPTPFQLSPIDNPDVLILSKINHYPNNPPDQMLSE 317
Query: 217 FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYV 275
I TL+ GG VL+P S G +L+L L DY + L Y PIYF++ VS + + Y
Sbjct: 318 LCSNIGSTLQQGGTVLIPSYSCGIILDLFEHLADYLNKVGLPYVPIYFVSSVSKAVLSYA 377
Query: 276 KSFLEWMGDS 285
+ EW+ S
Sbjct: 378 DIYSEWLNKS 387
Score = 72 (30.4 bits), Expect = 9.2e-16, Sum P(2) = 9.2e-16
Identities = 16/57 (28%), Positives = 31/57 (54%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVS 109
STID +L+S+ ++ ALP+ + +++TEP ++G L + + +Q S
Sbjct: 115 STIDMILISNYTNIY--ALPFITEYTNFQGKIYATEPTVQIGKLLLEELVQMDKQYS 169
>UNIPROTKB|E9PIG1 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00981641
ProteinModelPortal:E9PIG1 SMR:E9PIG1 Ensembl:ENST00000530031
ArrayExpress:E9PIG1 Bgee:E9PIG1 Uniprot:E9PIG1
Length = 249
Score = 207 (77.9 bits), Expect = 1.1e-15, P = 1.1e-15
Identities = 55/186 (29%), Positives = 92/186 (49%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 68 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 127
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQ 133
+ +G P++ T P + + + D + ++ + E + FT I + V + Q
Sbjct: 128 SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 187
Query: 134 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 193
+ + E + + AGH+LG +++I E V+Y DYN ++HL ++ R
Sbjct: 188 TVQVDDELE---IKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CR 243
Query: 194 PAVLIT 199
P +LIT
Sbjct: 244 PNLLIT 249
>UNIPROTKB|Q5ZKK2 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9031
"Gallus gallus" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
GO:GO:0032039 GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359
HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 EMBL:AJ720082 IPI:IPI00651516
RefSeq:NP_001026271.1 UniGene:Gga.21113 ProteinModelPortal:Q5ZKK2
STRING:Q5ZKK2 Ensembl:ENSGALT00000026848 GeneID:422023
KEGG:gga:422023 GeneTree:ENSGT00390000001445 InParanoid:Q5ZKK2
OMA:PLAMKCV NextBio:20824712 Uniprot:Q5ZKK2
Length = 658
Score = 183 (69.5 bits), Expect = 3.3e-14, Sum P(3) = 3.3e-14
Identities = 70/252 (27%), Positives = 111/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ ++++A + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMPEVNAALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLAMTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L N P YF++ V++S++++ + F EW+ + TK +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSNVPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P ++ SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFKQPCVIFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 413 GKSSLNTVIFTE 424
Score = 84 (34.6 bits), Expect = 3.3e-14, Sum P(3) = 3.3e-14
Identities = 27/85 (31%), Positives = 43/85 (50%)
Query: 22 LVSIDGFNFLI----DCGWNDHFD--PSLLQPLSKVA--STIDAVLLSHPDTLHLGALPY 73
LV DG FL +C + D P P +++ ST+D +L+S+ + ALPY
Sbjct: 55 LVLKDGSTFLDKELKECSGHVFVDSVPEFCLPETELLDLSTVDVILISNYHCMM--ALPY 112
Query: 74 AMKQLGLSAPVFSTEPVYRLGLLTM 98
+ G + V++TEP ++G L M
Sbjct: 113 ITEYTGFTGTVYATEPTVQIGRLLM 137
Score = 42 (19.8 bits), Expect = 3.3e-14, Sum P(3) = 3.3e-14
Identities = 19/72 (26%), Positives = 33/72 (45%)
Query: 577 KVQLSEKLMSNVLFKKLGD-YEIAWVDAEVGKTENGMLSLLPISTPAPP--HKSVLVGD- 632
K+++ +L +++ ++ +A V A + +N + LP P PP K V D
Sbjct: 515 KIEIMPELADSLVPLEIKPGISLATVSAMLHTKDNKHVLQLPPKPPQPPTSKKRKRVSDD 574
Query: 633 -LKMADLKPFLS 643
+ LKP LS
Sbjct: 575 VPECKPLKPLLS 586
>UNIPROTKB|F6XI08 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
Ensembl:ENSCAFT00000013124 EMBL:AAEX03014336 RefSeq:XP_543216.2
GeneID:486090 KEGG:cfa:486090 Uniprot:F6XI08
Length = 658
Score = 184 (69.8 bits), Expect = 4.5e-14, Sum P(2) = 4.5e-14
Identities = 73/252 (28%), Positives = 110/252 (43%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L N P YF++ V++S++++ + F EW+ + TK +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH L D P +V SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSLHGDFSSDFRQ--PCVVFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 413 GKSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 4.5e-14, Sum P(2) = 4.5e-14
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>UNIPROTKB|F1RJQ5 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0032039 "integrator complex" evidence=IEA] [GO:0016180
"snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
GeneTree:ENSGT00390000001445 OMA:PLAMKCV EMBL:CU407017
Ensembl:ENSSSCT00000010615 Uniprot:F1RJQ5
Length = 576
Score = 182 (69.1 bits), Expect = 4.8e-14, Sum P(2) = 4.8e-14
Identities = 71/252 (28%), Positives = 111/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 100 YTMQEVNSALSKIQMVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 155
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 156 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 215
Query: 234 PVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L+ P YF++ V++S++++ + F EW+ + TK +
Sbjct: 216 PCYPSGVIYDLLECLYQYIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYL 275
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 276 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELW 330
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 331 GKSSLNTVIFTE 342
Score = 81 (33.6 bits), Expect = 4.8e-14, Sum P(2) = 4.8e-14
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 12 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 55
>UNIPROTKB|F1MMA6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9913
"Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 IPI:IPI00701634 UniGene:Bt.91042
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
EMBL:DAAA02021965 EMBL:DAAA02021964 Ensembl:ENSBTAT00000049079
ArrayExpress:F1MMA6 Uniprot:F1MMA6
Length = 658
Score = 183 (69.5 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
Identities = 71/252 (28%), Positives = 111/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDSMVGEFCSNLALTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L+ P YF++ V++S++++ + F EW+ + TK +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 413 GKSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>UNIPROTKB|Q2KJA6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9913
"Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
GO:GO:0005634 eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996
SMART:SM01027 EMBL:BC105437 IPI:IPI00701634 RefSeq:NP_001039828.1
UniGene:Bt.91042 ProteinModelPortal:Q2KJA6 STRING:Q2KJA6
GeneID:533964 KEGG:bta:533964 CTD:55756 HOGENOM:HOG000045359
HOVERGEN:HBG081802 InParanoid:Q2KJA6 KO:K13146 OrthoDB:EOG415GCW
NextBio:20876211 PANTHER:PTHR11203:SF2 Uniprot:Q2KJA6
Length = 658
Score = 183 (69.5 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
Identities = 71/252 (28%), Positives = 111/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDSMVGEFCSNLALTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L+ P YF++ V++S++++ + F EW+ + TK +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 413 GKSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>ZFIN|ZDB-GENE-061013-129 [details] [associations]
symbol:ints9 "integrator complex subunit 9"
species:7955 "Danio rerio" [GO:0016180 "snRNA processing"
evidence=IEA] [GO:0032039 "integrator complex" evidence=IEA]
InterPro:IPR027074 ZFIN:ZDB-GENE-061013-129 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756
HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445
EMBL:CABZ01076623 EMBL:CABZ01078246 EMBL:CABZ01078247
EMBL:CABZ01078248 EMBL:CABZ01078249 EMBL:BC124793 IPI:IPI00800641
RefSeq:NP_001070738.1 UniGene:Dr.116109 Ensembl:ENSDART00000097865
GeneID:768124 KEGG:dre:768124 InParanoid:Q08BB6 NextBio:20918446
Uniprot:Q08BB6
Length = 658
Score = 182 (69.1 bits), Expect = 6.8e-14, Sum P(3) = 6.8e-14
Identities = 70/252 (27%), Positives = 113/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
++L +++SA V + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YSLQEVNSALSKVQLVGYSQKVELFG---AVQVTPLSSGYSLGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+RAGGNVL+
Sbjct: 238 SGSSLLTTHPQPMEQSSLKNSDVLILTGLTQIPTANPDGMLGEFCSNLAMTVRAGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P S+G + +LL L + +L P YF++ V++S++++ + F EW+ + +K +
Sbjct: 298 PCYSSGVIYDLLECLYQFMDSANLGTTPFYFISPVANSSLEFSQIFAEWLCQNKQSKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + + P +V SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSI--HGDFSSEFRQPCVVFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N ++FTE
Sbjct: 413 GKSSLNTIIFTE 424
Score = 82 (33.9 bits), Expect = 6.8e-14, Sum P(3) = 6.8e-14
Identities = 18/46 (39%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
STID +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STIDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTLQIGRLLM 137
Score = 42 (19.8 bits), Expect = 6.8e-14, Sum P(3) = 6.8e-14
Identities = 38/156 (24%), Positives = 58/156 (37%)
Query: 363 MLQADPPPKAVKVTMSRRVPLVGE-ELIAYEEEQTRLKKEEALKASLVKEEESKASLGPD 421
ML+ PPP A + R+P E I E + +KA + S D
Sbjct: 489 MLELQPPPMAYRRCSVLRLPFRRRYERIHLLPELAKSLVPSEVKAGVSVATVSAVLQSKD 548
Query: 422 NN--LSGDPMVIDXXXXXXSADVVEPHGGRYRD-ILIDGFVPPSTSVAPMFP--FYENNS 476
N L P V V+E + + L+ G VP +A + E
Sbjct: 549 NKHVLQPVPKVAPVAPSKKRKRVLEEPPEQLKPKTLLSGAVPLEPFLATLHKNGIMEVKV 608
Query: 477 EWDDFGEVIN--PDDYIIKDEDMDQAAMHIGGDDGK 510
E G +++ +D +I+ ED D A HI D+ +
Sbjct: 609 EETADGHILHLQAEDVLIQLED-D--ATHIICDNNE 641
>UNIPROTKB|G3XAN1 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
EMBL:CH471080 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 UniGene:Hs.162397
HGNC:HGNC:25592 ChiTaRS:INTS9 ProteinModelPortal:G3XAN1
Ensembl:ENST00000523303 ArrayExpress:G3XAN1 Bgee:G3XAN1
Uniprot:G3XAN1
Length = 525
Score = 178 (67.7 bits), Expect = 9.5e-14, Sum P(2) = 9.5e-14
Identities = 69/252 (27%), Positives = 112/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VL+ + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L+ P+YF++ V++S++++ + F EW+ + +K +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 413 GKSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 9.5e-14, Sum P(2) = 9.5e-14
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>MGI|MGI:1098533 [details] [associations]
symbol:Ints9 "integrator complex subunit 9" species:10090
"Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0005634 "nucleus" evidence=IEA] [GO:0016180 "snRNA processing"
evidence=ISO] [GO:0032039 "integrator complex" evidence=ISO]
InterPro:IPR027074 MGI:MGI:1098533 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359 HOVERGEN:HBG081802
KO:K13146 OrthoDB:EOG415GCW PANTHER:PTHR11203:SF2
GeneTree:ENSGT00390000001445 EMBL:AK038979 EMBL:AK077634
EMBL:AK136992 EMBL:AK150195 EMBL:BC028953 EMBL:BC055700
IPI:IPI00223422 IPI:IPI00406798 RefSeq:NP_001240660.1
RefSeq:NP_700463.2 UniGene:Mm.71332 ProteinModelPortal:Q8K114
SMR:Q8K114 STRING:Q8K114 PhosphoSite:Q8K114 PaxDb:Q8K114
PRIDE:Q8K114 Ensembl:ENSMUST00000043914 GeneID:210925
KEGG:mmu:210925 UCSC:uc007uiv.1 UCSC:uc007uiw.1 InParanoid:Q8K114
NextBio:373083 Bgee:Q8K114 CleanEx:MM_INTS9 Genevestigator:Q8K114
Uniprot:Q8K114
Length = 658
Score = 179 (68.1 bits), Expect = 1.5e-13, Sum P(3) = 1.5e-13
Identities = 68/250 (27%), Positives = 112/250 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L N P YF++ V++S++++ + F EW+ + +K +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSKVYL 357
Query: 291 -ETSRDNAFLLKHVTLLINKS---ELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WAS 343
E +A L++ L +S + N P ++ SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYRSIHGDFSNDFRQPCVLFTGHPSLRFG---DVVHFMELWGK 414
Query: 344 DVKNLVLFTE 353
N ++FTE
Sbjct: 415 SSLNTIIFTE 424
Score = 81 (33.6 bits), Expect = 1.5e-13, Sum P(3) = 1.5e-13
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTMQIGRLLM 137
Score = 43 (20.2 bits), Expect = 1.5e-13, Sum P(3) = 1.5e-13
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 368 PPPKAVKVTMSRRVPLVGEEL 388
PPPK + T S++ V E++
Sbjct: 555 PPPKPTQPTSSKKRKRVNEDI 575
>UNIPROTKB|Q9NV88 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0005515 "protein binding" evidence=IPI]
[GO:0016180 "snRNA processing" evidence=IDA] [GO:0032039
"integrator complex" evidence=IDA] InterPro:IPR027074
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
GO:GO:0032039 EMBL:U96629 GO:GO:0016180 CTD:55756
HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 OMA:PLAMKCV EMBL:AK001733 EMBL:AK298468
EMBL:AK300593 EMBL:AC040975 EMBL:AC131969 EMBL:BC025267
EMBL:BK005726 EMBL:BK005674 IPI:IPI00290514 IPI:IPI00871167
RefSeq:NP_001138631.1 RefSeq:NP_001166033.1 RefSeq:NP_060720.2
UniGene:Hs.162397 ProteinModelPortal:Q9NV88 SMR:Q9NV88
IntAct:Q9NV88 STRING:Q9NV88 PhosphoSite:Q9NV88 DMDM:119371246
PaxDb:Q9NV88 PRIDE:Q9NV88 DNASU:55756 Ensembl:ENST00000416984
Ensembl:ENST00000521022 Ensembl:ENST00000521777 GeneID:55756
KEGG:hsa:55756 UCSC:uc003xha.3 GeneCards:GC08M028625
HGNC:HGNC:25592 MIM:611352 neXtProt:NX_Q9NV88 PharmGKB:PA162392192
InParanoid:Q9NV88 PhylomeDB:Q9NV88 ChiTaRS:INTS9 GenomeRNAi:55756
NextBio:60763 ArrayExpress:Q9NV88 Bgee:Q9NV88 CleanEx:HS_INTS9
Genevestigator:Q9NV88 GermOnline:ENSG00000104299 Uniprot:Q9NV88
Length = 658
Score = 178 (67.7 bits), Expect = 2.0e-13, Sum P(2) = 2.0e-13
Identities = 69/252 (27%), Positives = 112/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 182 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 237
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VL+ + P F ++ T+R GGNVL+
Sbjct: 238 SGSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 297
Query: 234 PVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L+ P+YF++ V++S++++ + F EW+ + +K +
Sbjct: 298 PCYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYL 357
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 358 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELW 412
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 413 GKSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 2.0e-13, Sum P(2) = 2.0e-13
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>RGD|1311539 [details] [associations]
symbol:Ints9 "integrator complex subunit 9" species:10116
"Rattus norvegicus" [GO:0016180 "snRNA processing"
evidence=IEA;ISO] [GO:0032039 "integrator complex"
evidence=IEA;ISO] InterPro:IPR027074 RGD:1311539 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 GO:GO:0032039 GO:GO:0016180
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 IPI:IPI00362364
Ensembl:ENSRNOT00000018071 Uniprot:F1M365
Length = 659
Score = 177 (67.4 bits), Expect = 3.1e-13, Sum P(3) = 3.1e-13
Identities = 70/250 (28%), Positives = 113/250 (45%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 238
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VLI + P F ++ T+R GGNVL+
Sbjct: 239 SGSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 298
Query: 234 PVDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L N P YF++ V++S++++ + F EW+ + +K +
Sbjct: 299 PCYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSKVYL 358
Query: 291 -ETSRDNAFLLKHVTLLINKS-ELDNAPD--GPKLVLASMASLEAGFSHDI--FVE-WAS 343
E +A L++ L +S D + D P ++ SL G D+ F+E W
Sbjct: 359 PEPPFPHAELIQTNKLKHYRSIHGDFSHDFRQPCVLFTGHPSLRFG---DVVHFMELWGK 415
Query: 344 DVKNLVLFTE 353
N V+FTE
Sbjct: 416 SSLNTVIFTE 425
Score = 81 (33.6 bits), Expect = 3.1e-13, Sum P(3) = 3.1e-13
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 95 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTMQIGRLLM 138
Score = 42 (19.8 bits), Expect = 3.1e-13, Sum P(3) = 3.1e-13
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 368 PPPKAVKVTMSRRVPLVGEEL 388
PPPK + T S++ V E++
Sbjct: 556 PPPKPTQPTSSKKRKRVSEDV 576
>UNIPROTKB|H7BYQ6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592 ChiTaRS:INTS9
ProteinModelPortal:H7BYQ6 Ensembl:ENST00000397363 Bgee:H7BYQ6
Uniprot:H7BYQ6
Length = 552
Score = 178 (67.7 bits), Expect = 5.1e-12, Sum P(2) = 5.1e-12
Identities = 69/252 (27%), Positives = 112/252 (44%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAV 173
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 76 YTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-V 131
Query: 174 DYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLL 233
+ H S VL+ + P F ++ T+R GGNVL+
Sbjct: 132 SGSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLV 191
Query: 234 PVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF- 290
P +G + +LL L Y L+ P+YF++ V++S++++ + F EW+ + +K +
Sbjct: 192 PCYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYL 251
Query: 291 -ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-W 341
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 252 PEPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELW 306
Query: 342 ASDVKNLVLFTE 353
N V+FTE
Sbjct: 307 GKSSLNTVIFTE 318
Score = 65 (27.9 bits), Expect = 5.1e-12, Sum P(2) = 5.1e-12
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 70 ALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ALPY + G + V++TEP ++G L M
Sbjct: 3 ALPYITEHTGFTGTVYATEPTVQIGRLLM 31
>WB|WBGene00017608 [details] [associations]
symbol:F19F10.12 species:6239 "Caenorhabditis elegans"
[GO:0009792 "embryo development ending in birth or egg hatching"
evidence=IMP] InterPro:IPR027074 GO:GO:0009792 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 EMBL:FO080914
RefSeq:NP_504953.1 ProteinModelPortal:Q95ZM2 PaxDb:Q95ZM2
EnsemblMetazoa:F19F10.12 GeneID:179142 KEGG:cel:CELE_F19F10.12
UCSC:F19F10.12 CTD:179142 WormBase:F19F10.12 HOGENOM:HOG000199610
InParanoid:Q95ZM2 OMA:EFMERIE NextBio:904092 Uniprot:Q95ZM2
Length = 646
Score = 160 (61.4 bits), Expect = 4.5e-11, Sum P(2) = 4.5e-11
Identities = 72/289 (24%), Positives = 120/289 (41%)
Query: 114 FTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY-- 171
+T D+ S V L+++Q L I V P V+GH G W I + E Y
Sbjct: 174 YTTTDMHSCLAKVITLSFNQTIDLFR----IKVTPVVSGHTYGSAYWTIKTENEQFAYLS 229
Query: 172 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNV 231
A + + K + L + +L+T + + L + ++ I+ L+ G+V
Sbjct: 230 ASNPSATDVKLMETAPLRAVDH--ILVT-SLSRLVDTTAKEMGYSLIKTITDVLKKHGSV 286
Query: 232 LLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITK 288
LLP+ G + E++ + D + L+ PIYF++ V+ S I EWM +S
Sbjct: 287 LLPICPVGPIFEMIEAVSDIITTTNGIPLDTPIYFISPVAKSAIAMASISAEWMSESRQN 346
Query: 289 SF---ETSRDNAFLLKHVTLLINKS---ELDNAPDGPKLVLASMASLEAGFSHDIFVEWA 342
+ E ++ L+K + I S P ++ AS ASL G + +
Sbjct: 347 AVYLPEEPYSHSNLIKSGRVKIYDSLYGSFSKEFKTPCVIFASHASLRIGDAAHMVEVLG 406
Query: 343 SDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG-EELIA 390
SD KN V+ T+ R + P K + + M R+ E L+A
Sbjct: 407 SDPKNAVIVTDPDLPCEDVREPFRNLPIKFINIPMDFRMDFASLERLLA 455
Score = 77 (32.2 bits), Expect = 4.5e-11, Sum P(2) = 4.5e-11
Identities = 21/61 (34%), Positives = 37/61 (60%)
Query: 54 TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRQVSEF 111
TIDA+L+S+ ++ +G LP+ + G S ++ TE Y+ G L M + +++SR +V
Sbjct: 89 TIDAILVSNYESF-VG-LPFYTEGSGFSGKIYVTEIAYQYGKLLMEEMLEFISRIEVLPS 146
Query: 112 D 112
D
Sbjct: 147 D 147
>FB|FBgn0036570 [details] [associations]
symbol:IntS9 "Integrator 9" species:7227 "Drosophila
melanogaster" [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0034472 "snRNA 3'-end processing" evidence=IDA]
[GO:0032039 "integrator complex" evidence=ISS] [GO:0016180 "snRNA
processing" evidence=ISS] InterPro:IPR027074 EMBL:AE014296
GO:GO:0006378 GO:GO:0005847 GO:GO:0006379 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
GO:GO:0034472 EMBL:AY058574 RefSeq:NP_648838.3 UniGene:Dm.977
SMR:Q95TS5 IntAct:Q95TS5 MINT:MINT-1734573
EnsemblMetazoa:FBtr0075495 GeneID:39763 KEGG:dme:Dmel_CG5222
UCSC:CG5222-RA FlyBase:FBgn0036570 InParanoid:Q95TS5
OrthoDB:EOG4FJ6QV GenomeRNAi:39763 NextBio:815254 Uniprot:Q95TS5
Length = 654
Score = 148 (57.2 bits), Expect = 4.5e-10, Sum P(2) = 4.5e-10
Identities = 62/254 (24%), Positives = 112/254 (44%)
Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
+F+L D+ + VT + Y + + G + P +G+ LG + W ++ E + Y
Sbjct: 180 IFSLKDVQGSLSKVTIMGYDEKLDILG---AFIATPVSSGYCLGSSNWVLSTAHEKICY- 235
Query: 173 VDYNRRKEKHLNGTVLESFVRPA-VLI-TDAYNALHNQPPRQQREMFQDAISKTLRAGGN 230
V + H + +S ++ A VLI T A P + E+ + ++ T+R G+
Sbjct: 236 VSGSSTLTTHPR-PINQSALKHADVLIMTGLTQAPTVNPDTKLGELCMN-VALTIRNNGS 293
Query: 231 VLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 289
L+P +G V +L L LN P++F++ V+ S++ Y EW+ +
Sbjct: 294 ALIPCYPSGVVYDLFECLTQNLENAGLNNVPMFFISPVADSSLAYSNILAEWLSSAKQNK 353
Query: 290 FETSRD---NAFLL-----KHVTLLINKSELDNAPDGPKLVLASMASLEAGFS-HDIFVE 340
D +AF L KH + ++ + P +V SL G + H F+E
Sbjct: 354 VYLPDDPFPHAFYLRNNKLKHYNHVFSEGFSKDFRQ-PCVVFCGHPSLRFGDAVH--FIE 410
Query: 341 -WASDVKNLVLFTE 353
W ++ N ++FTE
Sbjct: 411 MWGNNPNNSIIFTE 424
Score = 80 (33.2 bits), Expect = 4.5e-10, Sum P(2) = 4.5e-10
Identities = 22/68 (32%), Positives = 35/68 (51%)
Query: 31 LIDCGWNDHFD--PSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
L DC D P P+ K+ S +D +L+S+ L++ ALPY + G V++
Sbjct: 69 LKDCCGRVFVDSTPEFNLPMDKMLDFSEVDVILISN--YLNMLALPYITENTGFKGKVYA 126
Query: 87 TEPVYRLG 94
TEP ++G
Sbjct: 127 TEPTLQIG 134
Score = 45 (20.9 bits), Expect = 1.8e-06, Sum P(2) = 1.8e-06
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS 53
Y+++ G ++DCG + + L PL V S
Sbjct: 15 YIITFKGLRIMLDCGLTEQTVLNFL-PLPFVQS 46
>UNIPROTKB|Q9KV92 [details] [associations]
symbol:VC_0264 "Putative uncharacterized protein"
species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
[GO:0003674 "molecular_function" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0016787
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K07576 OMA:CHIDHVG PIR:F82345 RefSeq:NP_229920.1
ProteinModelPortal:Q9KV92 DNASU:2614470 GeneID:2614470
KEGG:vch:VC0264 PATRIC:20079570 ProtClustDB:CLSK2517501
Uniprot:Q9KV92
Length = 455
Score = 160 (61.4 bits), Expect = 2.8e-08, P = 2.8e-08
Identities = 85/359 (23%), Positives = 147/359 (40%)
Query: 26 DGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF 85
DG LIDCG D L + +DA++L+H H+G LP+ + GL P++
Sbjct: 39 DGQALLIDCGLFQGADERPLA-VEFALGHVDALILTHAHIDHIGRLPWLLAA-GLKQPIY 96
Query: 86 STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHL-----SGK 140
ST L L + D + +S + V RL Q+Y +
Sbjct: 97 STAATAELVPLMLEDGLKLQLGMSP------KQSERVLTEVRRLLRVQDYQKWFAVQPKR 150
Query: 141 GEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL-I 198
+ + V AGH+LG +I + +GE V+++ D L +S R L I
Sbjct: 151 ADSLWVRFQPAGHILGSAYVEIRRPNGEVVVFSGDLGPSHTPLLPDP--QSPERADYLFI 208
Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED--YWAEHS 256
Y ++ + + + + I ++L GG +L+P S GR ELL +E + +
Sbjct: 209 ETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLFDIEQLIFSQQID 268
Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE--LDN 314
N PI + ++ + F + G + R + +T+ +++ L N
Sbjct: 269 ANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCITVEDHRTHERLVN 328
Query: 315 --APDGPK-LVLASMASLEAGFSHDIFVEWASDVK-NLVLFTERGQFGTLARMLQADPP 369
A G +V+A+ + G D D + +L+L + + GTL R +Q+ P
Sbjct: 329 RLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAE-GTLGRSIQSGQP 386
>TIGR_CMR|VC_0264 [details] [associations]
symbol:VC_0264 "conserved hypothetical protein" species:686
"Vibrio cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE003852
GenomeReviews:AE003852_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K07576 OMA:CHIDHVG
PIR:F82345 RefSeq:NP_229920.1 ProteinModelPortal:Q9KV92
DNASU:2614470 GeneID:2614470 KEGG:vch:VC0264 PATRIC:20079570
ProtClustDB:CLSK2517501 Uniprot:Q9KV92
Length = 455
Score = 160 (61.4 bits), Expect = 2.8e-08, P = 2.8e-08
Identities = 85/359 (23%), Positives = 147/359 (40%)
Query: 26 DGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF 85
DG LIDCG D L + +DA++L+H H+G LP+ + GL P++
Sbjct: 39 DGQALLIDCGLFQGADERPLA-VEFALGHVDALILTHAHIDHIGRLPWLLAA-GLKQPIY 96
Query: 86 STEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHL-----SGK 140
ST L L + D + +S + V RL Q+Y +
Sbjct: 97 STAATAELVPLMLEDGLKLQLGMSP------KQSERVLTEVRRLLRVQDYQKWFAVQPKR 150
Query: 141 GEGIVVAPHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL-I 198
+ + V AGH+LG +I + +GE V+++ D L +S R L I
Sbjct: 151 ADSLWVRFQPAGHILGSAYVEIRRPNGEVVVFSGDLGPSHTPLLPDP--QSPERADYLFI 208
Query: 199 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED--YWAEHS 256
Y ++ + + + + I ++L GG +L+P S GR ELL +E + +
Sbjct: 209 ETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLFDIEQLIFSQQID 268
Query: 257 LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE--LDN 314
N PI + ++ + F + G + R + +T+ +++ L N
Sbjct: 269 ANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCITVEDHRTHERLVN 328
Query: 315 --APDGPK-LVLASMASLEAGFSHDIFVEWASDVK-NLVLFTERGQFGTLARMLQADPP 369
A G +V+A+ + G D D + +L+L + + GTL R +Q+ P
Sbjct: 329 RLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAE-GTLGRSIQSGQP 386
>UNIPROTKB|E9PIL7 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00977321
ProteinModelPortal:E9PIL7 SMR:E9PIL7 Ensembl:ENST00000534345
ArrayExpress:E9PIL7 Bgee:E9PIL7 Uniprot:E9PIL7
Length = 140
Score = 135 (52.6 bits), Expect = 5.7e-08, P = 5.7e-08
Identities = 40/131 (30%), Positives = 65/131 (49%)
Query: 5 VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTID 56
++VTPL G + S LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLVGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLD 63
Query: 57 AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD-QYLSRRQVSEFDLFT 115
V++SH H GALPY + +G P++ T P + + + D + ++ + E + FT
Sbjct: 64 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 123
Query: 116 LDDIDSAFQSV 126
I + V
Sbjct: 124 SQMIKDCMKKV 134
>UNIPROTKB|G3V5T3 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation-specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] InterPro:IPR027075
PANTHER:PTHR11203:SF5 HGNC:HGNC:2325 ChiTaRS:CPSF2 EMBL:AL121773
ProteinModelPortal:G3V5T3 SMR:G3V5T3 Ensembl:ENST00000554290
ArrayExpress:G3V5T3 Bgee:G3V5T3 Uniprot:G3V5T3
Length = 62
Score = 132 (51.5 bits), Expect = 1.2e-07, P = 1.2e-07
Identities = 25/61 (40%), Positives = 39/61 (63%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L + TI +L
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRNL-DTIQKILH 59
Query: 61 S 61
S
Sbjct: 60 S 60
>TAIR|locus:2079696 [details] [associations]
symbol:AT3G07530 "AT3G07530" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR027074 EMBL:CP002686 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 KO:K13146 PANTHER:PTHR11203:SF2
IPI:IPI00520313 RefSeq:NP_187409.2 UniGene:At.53215
ProteinModelPortal:F4JEH2 PRIDE:F4JEH2 EnsemblPlants:AT3G07530.1
GeneID:819942 KEGG:ath:AT3G07530 OMA:CYNGTLI Uniprot:F4JEH2
Length = 699
Score = 107 (42.7 bits), Expect = 1.4e-06, Sum P(4) = 1.4e-06
Identities = 38/138 (27%), Positives = 63/138 (45%)
Query: 227 AGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 286
AGG+ L+ + G VL+LL +L + SL PI+ ++ V+ + Y + EW+ +
Sbjct: 343 AGGSTLITITRIGIVLQLLELLSNSLESSSLKVPIFVISSVAEELLAYTNTIPEWLCEQR 402
Query: 287 TK---SFETSRDNAFLLK----HVTLLINKSELDNAP----DGPKLVLASMASLEAGFSH 335
+ S E S + +K H+ I+ L A P +V AS SL G S
Sbjct: 403 QEKLISGEPSFGHLKFIKNKKIHLFPAIHSPNLIYANRTSWQEPCIVFASHWSLRLGPSV 462
Query: 336 DIFVEWASDVKNLVLFTE 353
+ W D K+L++ +
Sbjct: 463 QLLQRWRGDPKSLLVLED 480
Score = 76 (31.8 bits), Expect = 1.4e-06, Sum P(4) = 1.4e-06
Identities = 21/49 (42%), Positives = 29/49 (59%)
Query: 52 ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
AS ID VL+S+P L LG LP+ + G A ++ TE ++G L M D
Sbjct: 100 ASFIDIVLISNPMGL-LG-LPFLTQNPGFFAKIYMTEVTAKIGQLMMED 146
Score = 53 (23.7 bits), Expect = 1.4e-06, Sum P(4) = 1.4e-06
Identities = 14/62 (22%), Positives = 29/62 (46%)
Query: 113 LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 172
L++LDDI+S + V + +++ +G +++ +G +G W I + Y
Sbjct: 199 LYSLDDIESCMKKVQGVKFAEEVCYNGT---LIIKALSSGLDIGACNWLINGPNGSLSYV 255
Query: 173 VD 174
D
Sbjct: 256 SD 257
Score = 43 (20.2 bits), Expect = 1.4e-06, Sum P(4) = 1.4e-06
Identities = 7/17 (41%), Positives = 12/17 (70%)
Query: 18 PLSYLVSIDGFNFLIDC 34
P +++++ GF LIDC
Sbjct: 15 PPCHMLNLCGFRILIDC 31
>TIGR_CMR|CHY_2049 [details] [associations]
symbol:CHY_2049 "metallo-beta-lactamase family protein"
species:246194 "Carboxydothermus hydrogenoformans Z-2901"
[GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
"metabolic process" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 EMBL:CP000141 GenomeReviews:CP000141_GR
GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 RefSeq:YP_360868.1
ProteinModelPortal:Q3AAG6 STRING:Q3AAG6 GeneID:3728507
KEGG:chy:CHY_2049 PATRIC:21277179 HOGENOM:HOG000244774 KO:K07576
OMA:GGRIVHH BioCyc:CHYD246194:GJCN-2048-MONOMER Uniprot:Q3AAG6
Length = 504
Score = 134 (52.2 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
Identities = 64/281 (22%), Positives = 113/281 (40%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAST---IDAVLLSHPDTLHLGALPYAMKQ 77
YL ++ G FL+DCG + + I+ +LL+H H G +P +K+
Sbjct: 17 YLFNVAGHKFLVDCGLFQGPKAIKERNYGEFPFNPREIEFILLTHAHIDHSGLIPKLVKK 76
Query: 78 LGLSAPVFSTEPVYRLGLLTMYD----QYLS----RRQVSEFDLFTLDDIDSAFQSVTRL 129
G +++TEP L + + D Q + R++ L I +A + L
Sbjct: 77 -GFKGTIYATEPTVDLAAVMLPDSGHVQEMEVERKNRKLRRAGKPELQPIYTADDAFNAL 135
Query: 130 TYSQNYHLSGKGE---GIVVAPHVAGHLLGGTVWKITKDGED----VIYAVDYNRRKEKH 182
Y Q L G+ V AGH+LG + KI G+D +++ D R
Sbjct: 136 AYFQKIPLETPITPLPGLEVTFFDAGHILGSAMIKIAYKGQDATRTILFTGDLGRNGRPF 195
Query: 183 LNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRV 241
+ + +L+ ++ Y + + + I K R GN+++P + R
Sbjct: 196 MKEP--QKVPLTDILVLESTYGDRVRSEEGDLKTLLKSLIEKVYRRNGNLIIPAFAMERT 253
Query: 242 LELLLILEDYWAEHSLNYPIYFLTYVSSS-TIDYVKSFLEW 281
+L+ IL D E+ PI Y+ S ++ K F ++
Sbjct: 254 QDLIYILNDL-VENKEVPPID--VYIDSPLAVEITKLFKKY 291
Score = 57 (25.1 bits), Expect = 1.6e-06, Sum P(2) = 1.6e-06
Identities = 20/61 (32%), Positives = 35/61 (57%)
Query: 535 LVHGSAEATEHLKQHCL-KHVCPHVYTPQIEETIDVTSDLCAYKVQ-LSEKLMSNVLFKK 592
LVHG EA +LK+ K+ P Y P+ +ETI + ++L + L +K+++ + K+
Sbjct: 428 LVHGEDEARLNLKKLIEEKYRIP-CYLPRYQETISLLANLPGKSEEVLIDKVITLLKAKQ 486
Query: 593 L 593
L
Sbjct: 487 L 487
>UNIPROTKB|H0YBH8 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
ChiTaRS:INTS9 Ensembl:ENST00000524081 Uniprot:H0YBH8
Length = 223
Score = 133 (51.9 bits), Expect = 3.9e-06, P = 3.9e-06
Identities = 36/120 (30%), Positives = 61/120 (50%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFD 112
ST+D +L+S+ + ALPY + G + V++TEP ++G L + +VS +
Sbjct: 86 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRL-LPSPLKDAVEVSTWR 142
Query: 113 -LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 171
+T+ +++SA + + YSQ L G + V P +G+ LG + W I E V Y
Sbjct: 143 RCYTMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY 199
>UNIPROTKB|Q81SC3 [details] [associations]
symbol:BA_1737 "Metallo-beta-lactamase family protein"
species:1392 "Bacillus anthracis" [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
DNASU:1086535 EnsemblBacteria:EBBACT00000009201
EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
Length = 419
Score = 140 (54.3 bits), Expect = 5.3e-06, Sum P(2) = 5.3e-06
Identities = 97/420 (23%), Positives = 172/420 (40%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL 80
Y V L DCG N ++ S + +V ++AV LSH H LP K G
Sbjct: 17 YFVKNKETKILFDCGINRSYEDSYPKIEREVVPFLEAVFLSHIHEDHTMGLPLLAKY-GY 75
Query: 81 SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGK 140
+++T Y L Y + V++ +D Q+V L Y +S
Sbjct: 76 KKKIWTTR--YTKEQLPAYYEKWRNYNVTQGWNVPYND-----QNVKDLNYIYVDEISNP 128
Query: 141 GEGIVVAPHV------AGHLLGGTVWKITKDGED-VIYAVDYNRRKEKHLNGTVLESFVR 193
E I + P + +GH+LG +VW + V Y+ DY+ E ++ L +R
Sbjct: 129 NEWIQITPTLRFQWGYSGHVLG-SVWFLVDMSHTYVFYSGDYSA--ESNILRANLPEKLR 185
Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGN---VLLPVDSAGRVLELLLILED 250
+ + A H QRE + ++ RA GN LLP+ GR +++L L +
Sbjct: 186 GDIKVAIVDAAYHTDDV-SQRERVNELCTEIERAAGNKGIALLPLPPLGRAQDIVLYLYE 244
Query: 251 YWAEHSLNYPIYFLTYVSSSTID-YVKSFL--EWMGDSITKSFETSRDNAFLLKHVTLLI 307
+ E +PI V +D + + FL +W+ ++ K E ++ LK +++
Sbjct: 245 KYKE----FPII----VDQEILDGFDEMFLYKDWIKNN--KELEELMES---LKRNIIVM 291
Query: 308 NKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT---ERGQFG--TLAR 362
+ G +V+ S A+++ + + + + +N ++FT +G F L
Sbjct: 292 DDDGGTQHSCG--IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKVLKE 349
Query: 363 MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDN 422
+ + K V + + + V E L E T L ALK + ++ ++ G +N
Sbjct: 350 RIGKECRVKRVPYKVHQSIRDVKEMLNTLLPEHTVLV--HALKEDTDRLQKKLSTAGYEN 407
Score = 43 (20.2 bits), Expect = 5.3e-06, Sum P(2) = 5.3e-06
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDV 569
E TVLVH E T+ L++ +VY+ +E I+V
Sbjct: 381 EHTVLVHALKEDTDRLQKKLSTAGYENVYSLTMER-IEV 418
>TIGR_CMR|BA_1737 [details] [associations]
symbol:BA_1737 "metallo-beta-lactamase family protein"
species:198094 "Bacillus anthracis str. Ames" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
DNASU:1086535 EnsemblBacteria:EBBACT00000009201
EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
Length = 419
Score = 140 (54.3 bits), Expect = 5.3e-06, Sum P(2) = 5.3e-06
Identities = 97/420 (23%), Positives = 172/420 (40%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL 80
Y V L DCG N ++ S + +V ++AV LSH H LP K G
Sbjct: 17 YFVKNKETKILFDCGINRSYEDSYPKIEREVVPFLEAVFLSHIHEDHTMGLPLLAKY-GY 75
Query: 81 SAPVFSTEPVYRLGLLTMYDQYLSRRQVSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGK 140
+++T Y L Y + V++ +D Q+V L Y +S
Sbjct: 76 KKKIWTTR--YTKEQLPAYYEKWRNYNVTQGWNVPYND-----QNVKDLNYIYVDEISNP 128
Query: 141 GEGIVVAPHV------AGHLLGGTVWKITKDGED-VIYAVDYNRRKEKHLNGTVLESFVR 193
E I + P + +GH+LG +VW + V Y+ DY+ E ++ L +R
Sbjct: 129 NEWIQITPTLRFQWGYSGHVLG-SVWFLVDMSHTYVFYSGDYSA--ESNILRANLPEKLR 185
Query: 194 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGN---VLLPVDSAGRVLELLLILED 250
+ + A H QRE + ++ RA GN LLP+ GR +++L L +
Sbjct: 186 GDIKVAIVDAAYHTDDV-SQRERVNELCTEIERAAGNKGIALLPLPPLGRAQDIVLYLYE 244
Query: 251 YWAEHSLNYPIYFLTYVSSSTID-YVKSFL--EWMGDSITKSFETSRDNAFLLKHVTLLI 307
+ E +PI V +D + + FL +W+ ++ K E ++ LK +++
Sbjct: 245 KYKE----FPII----VDQEILDGFDEMFLYKDWIKNN--KELEELMES---LKRNIIVM 291
Query: 308 NKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT---ERGQFG--TLAR 362
+ G +V+ S A+++ + + + + +N ++FT +G F L
Sbjct: 292 DDDGGTQHSCG--IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKVLKE 349
Query: 363 MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDN 422
+ + K V + + + V E L E T L ALK + ++ ++ G +N
Sbjct: 350 RIGKECRVKRVPYKVHQSIRDVKEMLNTLLPEHTVLV--HALKEDTDRLQKKLSTAGYEN 407
Score = 43 (20.2 bits), Expect = 5.3e-06, Sum P(2) = 5.3e-06
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 531 ELTVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDV 569
E TVLVH E T+ L++ +VY+ +E I+V
Sbjct: 381 EHTVLVHALKEDTDRLQKKLSTAGYENVYSLTMER-IEV 418
>UNIPROTKB|E5RG70 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
ChiTaRS:INTS9 IPI:IPI00974179 ProteinModelPortal:E5RG70 SMR:E5RG70
Ensembl:ENST00000523436 ArrayExpress:E5RG70 Bgee:E5RG70
Uniprot:E5RG70
Length = 300
Score = 96 (38.9 bits), Expect = 1.2e-05, Sum P(3) = 1.2e-05
Identities = 22/65 (33%), Positives = 40/65 (61%)
Query: 217 FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYV 275
F ++ T+R GGNVL+P +G + +LL L Y L+ P+YF++ V++S++++
Sbjct: 236 FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFS 295
Query: 276 KSFLE 280
+ F E
Sbjct: 296 QIFAE 300
Score = 81 (33.6 bits), Expect = 1.2e-05, Sum P(3) = 1.2e-05
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
Score = 39 (18.8 bits), Expect = 1.2e-05, Sum P(3) = 1.2e-05
Identities = 6/20 (30%), Positives = 13/20 (65%)
Query: 114 FTLDDIDSAFQSVTRLTYSQ 133
+T+ +++SA + + YSQ
Sbjct: 182 YTMQEVNSALSKIQLVGYSQ 201
>UNIPROTKB|Q8EJC6 [details] [associations]
symbol:SO_0541 "RNA-metabolizing metallo-beta-lactamase
family protein" species:211586 "Shewanella oneidensis MR-1"
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014299
GenomeReviews:AE014299_GR InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774
KO:K07576 RefSeq:NP_716177.2 ProteinModelPortal:Q8EJC6
DNASU:1168409 GeneID:1168409 KEGG:son:SO_0541 PATRIC:23520762
ProtClustDB:CLSK2516780 Uniprot:Q8EJC6
Length = 480
Score = 141 (54.7 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
Identities = 63/228 (27%), Positives = 104/228 (45%)
Query: 54 TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--------GLLTMYDQYLSR 105
TI AV+LSH H G LP +K G P+++ + L +L + D +
Sbjct: 55 TIVAVVLSHAHIDHSGRLPLLVKA-GFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTN 113
Query: 106 RQVSEFDLFTLDD---IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV------AGHLLG 156
++ ++ DL L+ ++ A Q++++ S Y G+ V PHV AGH+LG
Sbjct: 114 KKRAKHDLAPLEPLFTVEDAEQAISQFV-SLEY-----GQVTRVIPHVDICLSDAGHILG 167
Query: 157 GTVWKIT----KDGEDVIYAVDYNRRKEKHL-NGTVLESFVRPAVLITDAY-NALHNQPP 210
+ ++ K + ++++ D R L N T++++ VL+ Y N H
Sbjct: 168 SALVELWLGEGKSQKKIVFSGDLGRAGMPILQNPTLVDT--ADLVLMESTYGNRFHRSWT 225
Query: 211 RQQREMFQDAISKTLRAG-GNVLLPVDSAGRVLELLLILEDYWAEHSL 257
E+ +D +KT+ GN+LLP S GR ELL + Y E L
Sbjct: 226 DTLAEL-KDIFAKTVNESQGNILLPAFSVGRAQELLYLFHLYAKEWDL 272
Score = 37 (18.1 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 534 VLVHGSAEATEHL 546
VLVHG EA + L
Sbjct: 431 VLVHGEPEAQQGL 443
>TIGR_CMR|SO_0541 [details] [associations]
symbol:SO_0541 "metallo-beta-lactamase family protein"
species:211586 "Shewanella oneidensis MR-1" [GO:0008150
"biological_process" evidence=ND] [GO:0003824 "catalytic activity"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:AE014299 GenomeReviews:AE014299_GR
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_716177.2
ProteinModelPortal:Q8EJC6 DNASU:1168409 GeneID:1168409
KEGG:son:SO_0541 PATRIC:23520762 ProtClustDB:CLSK2516780
Uniprot:Q8EJC6
Length = 480
Score = 141 (54.7 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
Identities = 63/228 (27%), Positives = 104/228 (45%)
Query: 54 TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--------GLLTMYDQYLSR 105
TI AV+LSH H G LP +K G P+++ + L +L + D +
Sbjct: 55 TIVAVVLSHAHIDHSGRLPLLVKA-GFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTN 113
Query: 106 RQVSEFDLFTLDD---IDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHV------AGHLLG 156
++ ++ DL L+ ++ A Q++++ S Y G+ V PHV AGH+LG
Sbjct: 114 KKRAKHDLAPLEPLFTVEDAEQAISQFV-SLEY-----GQVTRVIPHVDICLSDAGHILG 167
Query: 157 GTVWKIT----KDGEDVIYAVDYNRRKEKHL-NGTVLESFVRPAVLITDAY-NALHNQPP 210
+ ++ K + ++++ D R L N T++++ VL+ Y N H
Sbjct: 168 SALVELWLGEGKSQKKIVFSGDLGRAGMPILQNPTLVDT--ADLVLMESTYGNRFHRSWT 225
Query: 211 RQQREMFQDAISKTLRAG-GNVLLPVDSAGRVLELLLILEDYWAEHSL 257
E+ +D +KT+ GN+LLP S GR ELL + Y E L
Sbjct: 226 DTLAEL-KDIFAKTVNESQGNILLPAFSVGRAQELLYLFHLYAKEWDL 272
Score = 37 (18.1 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
Identities = 8/13 (61%), Positives = 9/13 (69%)
Query: 534 VLVHGSAEATEHL 546
VLVHG EA + L
Sbjct: 431 VLVHGEPEAQQGL 443
>UNIPROTKB|E9PQF0 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00982774
ProteinModelPortal:E9PQF0 SMR:E9PQF0 Ensembl:ENST00000498476
ArrayExpress:E9PQF0 Bgee:E9PQF0 Uniprot:E9PQF0
Length = 167
Score = 116 (45.9 bits), Expect = 5.5e-05, P = 5.5e-05
Identities = 29/86 (33%), Positives = 45/86 (52%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 81 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 140
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD 100
+ +G P++ T P + + + D
Sbjct: 141 SEMVGYDGPIYMTHPTQAICPILLED 166
>TIGR_CMR|DET_1061 [details] [associations]
symbol:DET_1061 "metallo-beta-lactamase family protein"
species:243164 "Dehalococcoides ethenogenes 195" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:CP000027 GenomeReviews:CP000027_GR
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576
RefSeq:YP_181776.1 ProteinModelPortal:Q3Z7M3 STRING:Q3Z7M3
GeneID:3229629 KEGG:det:DET1061 PATRIC:21609167
ProtClustDB:CLSK2516599 BioCyc:DETH243164:GJNF-1062-MONOMER
Uniprot:Q3Z7M3
Length = 468
Score = 129 (50.5 bits), Expect = 7.3e-05, P = 7.3e-05
Identities = 83/373 (22%), Positives = 148/373 (39%)
Query: 46 QPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ--LGLSAPVFSTEPVYRLGL-----LTM 98
QP ++ AV++SH H G LP +K+ G +T + R+ L L
Sbjct: 46 QPFEIPPQSLSAVIISHAHIDHCGLLPKLVKEGFAGPVFATEATAEIARISLTDAGKLQE 105
Query: 99 YDQYLSRRQ---------VSEFDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPH 149
D +++ E L+T +D + + YS+ ++ E I H
Sbjct: 106 EDAAFKKKRHEREGRKTKYPEIPLYTAEDARAVSPLFKTVEYSREIAVT---EDITATFH 162
Query: 150 VAGHLLGGTV--WKITKDGED--VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 205
AGH+ G KI ++ ++++ D L L + V+I Y
Sbjct: 163 NAGHVFGSASIELKIQENHRQKVIVFSGDLGNWDRPILKNPDLVNQA-DYVVIESTYGDR 221
Query: 206 HNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLT 265
+Q + + I++T++ GGN+++P + R +LL L + +E + P +
Sbjct: 222 THQDINEASLKLAEIINQTVKLGGNIVIPSFALERTQDLLFFLNRFMSEGKI--PSLKVF 279
Query: 266 YVSSSTIDYVKSFLEW--MGDSITKSFETSRDNAFLLK--HVTLLINKSELDNAPDGPKL 321
S I K F E + D T + + + F + H T S+ A P +
Sbjct: 280 VDSPMAISITKIFKEHPELYDRETSGWVNNGSSPFEFEGLHFTNKAADSKAILAEKDPCI 339
Query: 322 VLASMASLEAG-FSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRR 380
++A G H + V S ++ +LF GTL R++ D K V++ + +
Sbjct: 340 IIAGSGMCTGGRIKHHL-VNNISRPESTILFVGFQATGTLGRLI-TDGA-KEVRI-LGQH 395
Query: 381 VPLVG--EELIAY 391
P+ EEL A+
Sbjct: 396 YPVQARIEELRAF 408
>UNIPROTKB|E2QVB2 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 PANTHER:PTHR11203:SF2 Ensembl:ENSCAFT00000013124
Uniprot:E2QVB2
Length = 409
Score = 127 (49.8 bits), Expect = 9.6e-05, P = 9.6e-05
Identities = 52/170 (30%), Positives = 77/170 (45%)
Query: 196 VLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 255
VLI + P F ++ T+R GGNVL+P +G + +LL L Y
Sbjct: 11 VLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSA 70
Query: 256 SL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF--ETSRDNAFL-----LKHVTLL 306
L N P YF++ V++S++++ + F EW+ + TK + E +A L LKH L
Sbjct: 71 GLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSL 130
Query: 307 INKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WASDVKNLVLFTE 353
D P +V SL G D+ F+E W N V+FTE
Sbjct: 131 HGDFSSDFRQ--PCVVFTGHPSLRFG---DVVHFMELWGKSSLNTVIFTE 175
>TIGR_CMR|CPS_2623 [details] [associations]
symbol:CPS_2623 "metallo-beta-lactamase family protein"
species:167879 "Colwellia psychrerythraea 34H" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:CP000083 GenomeReviews:CP000083_GR
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:YP_269337.1
ProteinModelPortal:Q481D2 STRING:Q481D2 GeneID:3521490
KEGG:cps:CPS_2623 PATRIC:21468305 OMA:HGPMVII
ProtClustDB:CLSK2524370 BioCyc:CPSY167879:GI48-2685-MONOMER
Uniprot:Q481D2
Length = 451
Score = 110 (43.8 bits), Expect = 0.00018, Sum P(2) = 0.00018
Identities = 62/279 (22%), Positives = 114/279 (40%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFD---PSLLQPLSKVASTIDAVLLS 61
+ +T L G Y V L+DCG + +PL ++DA++L+
Sbjct: 1 MNITFLGGTGTVTGSKYFVETSTTKILVDCGLYQGYKWLRARNREPLPLDLKSLDAIVLT 60
Query: 62 HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQ----------Y----LSRRQ 107
H H G +P KQ G V++ + L + + D Y +SR +
Sbjct: 61 HAHLDHSGFIPALYKQ-GFRGHVYAHQATISLCSILLPDSGHIQEDDAKFYGKHKISRHE 119
Query: 108 VSE--FDLFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKD 165
E +D T + S F++V +++ + + G+ I + AGH+LG + D
Sbjct: 120 NPEPLYDKATAEACLSLFKAVD---FNEEFKI---GD-IEIELQSAGHILGAASVILKAD 172
Query: 166 GEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY-NALHNQPPRQQREMFQDAISKT 224
G+ V ++ D R + + V +L+ Y N LH++ E + ++ T
Sbjct: 173 GKRVGFSGDVGRPDDIIMYPPKPLPPV-DLLLLESTYGNRLHDK--EDAFEQLAEIVNST 229
Query: 225 LRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 262
+ GG +L+P + GR + +L + + P+Y
Sbjct: 230 AKKGGALLIPSFAVGRTEAVQHMLASLMKKELIPKLPVY 268
Score = 61 (26.5 bits), Expect = 0.00018, Sum P(2) = 0.00018
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 525 SKVVSNELTVLVHGSAEATEHLKQHCLKH 553
SK+ +LVHG EA+E ++ H ++H
Sbjct: 407 SKLHPKTKVLLVHGEPEASESMRDHLMQH 435
>UNIPROTKB|C9JZH6 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation-specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 GO:GO:0003723
GO:GO:0004521 GO:GO:0008409 EMBL:AC080162 HGNC:HGNC:2326
ChiTaRS:CPSF3 IPI:IPI00807384 ProteinModelPortal:C9JZH6 SMR:C9JZH6
STRING:C9JZH6 Ensembl:ENST00000475482 HOGENOM:HOG000191757
ArrayExpress:C9JZH6 Bgee:C9JZH6 Uniprot:C9JZH6
Length = 136
Score = 102 (41.0 bits), Expect = 0.00019, P = 0.00019
Identities = 36/138 (26%), Positives = 66/138 (47%)
Query: 31 LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF--- 85
++DCG + + P + + ID +L+SH H GALP+ +++ F
Sbjct: 1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60
Query: 86 STEPVYRLGLLTMYDQYLSRRQVSEFD-LFTLDDIDSAFQSVTRLTYSQNYHLSGKGEGI 144
+T+ +YR LL+ Y+ +S D L+T D++ + + + N+H + GI
Sbjct: 61 ATKAIYRW-LLS---DYVKVSNISADDMLYTETDLEESMDKIETI----NFHEVKEVAGI 112
Query: 145 VVAPHVAGHLLGGTVWKI 162
+ AGH+LG ++ I
Sbjct: 113 KFWCYHAGHVLGAAMFMI 130
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.317 0.136 0.398 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 706 700 0.00082 121 3 11 22 0.42 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 96
No. of states in DFA: 621 (66 KB)
Total size of DFA: 365 KB (2181 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 65.79u 0.12s 65.91t Elapsed: 00:00:03
Total cpu time: 65.82u 0.12s 65.94t Elapsed: 00:00:03
Start: Tue May 21 05:13:52 2013 End: Tue May 21 05:13:55 2013