Your job contains 1 sequence.
>004964
MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL
SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQNYHLS
GKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLI
TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN
YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDG
PKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMS
RRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDANNANA
SADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMD
QAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTIL
SHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLM
SNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSK
GIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYL
L
The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 004964
(721 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2172843 - symbol:CPSF100 "cleavage and polyade... 2527 1.4e-313 2
UNIPROTKB|Q9P2I0 - symbol:CPSF2 "Cleavage and polyadenyla... 922 5.6e-137 3
UNIPROTKB|Q10568 - symbol:CPSF2 "Cleavage and polyadenyla... 920 9.2e-137 3
UNIPROTKB|E2R496 - symbol:CPSF2 "Uncharacterized protein"... 919 1.2e-136 3
UNIPROTKB|Q9W799 - symbol:cpsf2 "Cleavage and polyadenyla... 938 3.1e-136 3
RGD|1309687 - symbol:Cpsf2 "cleavage and polyadenylation ... 918 3.1e-136 3
MGI|MGI:1861601 - symbol:Cpsf2 "cleavage and polyadenylat... 918 3.9e-136 3
UNIPROTKB|F1NMN0 - symbol:CPSF2 "Uncharacterized protein"... 918 1.3e-135 3
ZFIN|ZDB-GENE-040718-79 - symbol:cpsf2 "cleavage and poly... 923 1.7e-135 3
FB|FBgn0027873 - symbol:Cpsf100 "Cleavage and polyadenyla... 929 2.8e-120 2
DICTYBASE|DDB_G0270392 - symbol:cpsf2 "cleavage and polya... 800 6.5e-118 3
WB|WBGene00017313 - symbol:cpsf-2 species:6239 "Caenorhab... 474 1.1e-97 4
UNIPROTKB|O17403 - symbol:cpsf-2 "Probable cleavage and p... 474 1.1e-97 4
POMBASE|SPBC1709.15c - symbol:cft2 "cleavage factor two C... 563 2.6e-89 3
UNIPROTKB|F1SD85 - symbol:CPSF2 "Uncharacterized protein"... 573 9.2e-86 2
UNIPROTKB|G4N6C6 - symbol:MGG_06570 "Uncharacterized prot... 213 9.6e-44 6
TAIR|locus:2206076 - symbol:CPSF73-I "cleavage and polyad... 403 8.6e-41 2
ASPGD|ASPL0000040420 - symbol:AN3082 species:162425 "Emer... 172 3.2e-38 6
UNIPROTKB|F1NV30 - symbol:CPSF3L "Integrator complex subu... 358 7.3e-35 2
UNIPROTKB|Q5TA45 - symbol:CPSF3L "Integrator complex subu... 355 8.1e-35 2
UNIPROTKB|G3V1S5 - symbol:CPSF3L "Integrator complex subu... 355 9.0e-35 2
UNIPROTKB|Q5ZIH0 - symbol:CPSF3L "Integrator complex subu... 358 9.4e-35 2
UNIPROTKB|E1B7Q9 - symbol:CPSF3L "Integrator complex subu... 354 1.7e-34 2
UNIPROTKB|Q2YDM2 - symbol:CPSF3L "Integrator complex subu... 351 4.0e-34 2
MGI|MGI:1919207 - symbol:Cpsf3l "cleavage and polyadenyla... 356 4.3e-34 2
RGD|1306841 - symbol:Cpsf3l "cleavage and polyadenylation... 356 4.3e-34 2
UNIPROTKB|F1SD84 - symbol:LOC100625560 "Uncharacterized p... 252 8.6e-34 2
UNIPROTKB|E2QY53 - symbol:CPSF3L "Uncharacterized protein... 348 9.1e-34 2
UNIPROTKB|F1RJE8 - symbol:CPSF3L "Uncharacterized protein... 349 1.4e-33 2
POMBASE|SPAC17G6.16c - symbol:ysh1 "mRNA cleavage and pol... 394 1.6e-33 1
FB|FBgn0039691 - symbol:IntS11 "Integrator 11" species:72... 351 5.5e-33 2
CGD|CAL0004705 - symbol:orf19.325 species:5476 "Candida a... 285 1.9e-32 6
UNIPROTKB|Q5AEE3 - symbol:CFT2 "Putative uncharacterized ... 285 1.9e-32 6
SGD|S000004105 - symbol:CFT2 "Subunit of the mRNA cleavag... 253 2.5e-32 4
SGD|S000004267 - symbol:YSH1 "Putative endoribonuclease" ... 347 2.1e-31 3
DICTYBASE|DDB_G0278189 - symbol:ints11 "integrator comple... 324 3.1e-31 2
ZFIN|ZDB-GENE-030131-3275 - symbol:cpsf3 "cleavage and po... 372 3.3e-31 1
FB|FBgn0261065 - symbol:Cpsf73 "Cleavage and polyadenylat... 369 6.8e-31 1
ZFIN|ZDB-GENE-050522-13 - symbol:cpsf3l "cleavage and pol... 246 8.3e-31 3
UNIPROTKB|P79101 - symbol:CPSF3 "Cleavage and polyadenyla... 366 1.5e-30 1
UNIPROTKB|Q9UKF6 - symbol:CPSF3 "Cleavage and polyadenyla... 366 1.5e-30 1
UNIPROTKB|F1NKW5 - symbol:CPSF3 "Uncharacterized protein"... 366 1.5e-30 1
UNIPROTKB|E2R7R2 - symbol:CPSF3 "Uncharacterized protein"... 366 1.7e-30 1
UNIPROTKB|H0YJF4 - symbol:CPSF2 "Cleavage and polyadenyla... 221 3.0e-30 3
MGI|MGI:1859328 - symbol:Cpsf3 "cleavage and polyadenylat... 363 3.2e-30 1
RGD|1305767 - symbol:Cpsf3 "cleavage and polyadenylation ... 363 3.2e-30 1
UNIPROTKB|G3V6W7 - symbol:Cpsf3 "Protein Cpsf3" species:1... 363 3.2e-30 1
UNIPROTKB|G5E9W3 - symbol:CPSF3 "Cleavage and polyadenyla... 361 4.4e-30 1
DICTYBASE|DDB_G0274799 - symbol:cpsf3 "cleavage and polya... 326 9.2e-29 2
UNIPROTKB|I3LKR1 - symbol:CPSF3 "Uncharacterized protein"... 324 1.4e-28 2
GENEDB_PFALCIPARUM|PFC0825c - symbol:PFC0825c "cleavage a... 273 2.0e-27 3
UNIPROTKB|O77371 - symbol:PFC0825c "Cleavage and polyaden... 273 2.0e-27 3
TAIR|locus:2065368 - symbol:CPSF73-II "AT2G01730" species... 296 6.1e-27 2
ASPGD|ASPL0000060573 - symbol:AN0990 species:162425 "Emer... 299 6.4e-27 3
WB|WBGene00013460 - symbol:cpsf-3 species:6239 "Caenorhab... 316 2.3e-26 2
WB|WBGene00008642 - symbol:F10B5.8 species:6239 "Caenorha... 298 4.9e-26 2
CGD|CAL0005344 - symbol:orf19.5486 species:5476 "Candida ... 293 1.4e-24 2
UNIPROTKB|Q59P50 - symbol:YSH1 "Endoribonuclease YSH1" sp... 293 1.4e-24 2
GENEDB_PFALCIPARUM|PF14_0364 - symbol:PF14_0364 "cleavage... 244 6.8e-24 3
UNIPROTKB|Q8IL83 - symbol:PF14_0364 "Cleavage and polyade... 244 6.8e-24 3
UNIPROTKB|C9J979 - symbol:CPSF3L "Integrator complex subu... 178 2.2e-19 2
UNIPROTKB|G3V3T7 - symbol:CPSF2 "Cleavage and polyadenyla... 236 8.6e-19 1
UNIPROTKB|Q5ZKK2 - symbol:INTS9 "Integrator complex subun... 165 2.7e-12 3
UNIPROTKB|E9PI75 - symbol:CPSF3L "Integrator complex subu... 172 6.3e-12 1
UNIPROTKB|F6XI08 - symbol:INTS9 "Uncharacterized protein"... 163 9.2e-12 2
UNIPROTKB|F1RJQ5 - symbol:INTS9 "Uncharacterized protein"... 161 1.0e-11 2
UNIPROTKB|F1MMA6 - symbol:INTS9 "Integrator complex subun... 162 1.2e-11 2
UNIPROTKB|Q2KJA6 - symbol:INTS9 "Integrator complex subun... 162 1.2e-11 2
ZFIN|ZDB-GENE-061013-129 - symbol:ints9 "integrator compl... 160 1.9e-11 3
UNIPROTKB|G3XAN1 - symbol:INTS9 "Integrator complex subun... 157 2.1e-11 2
MGI|MGI:1098533 - symbol:Ints9 "integrator complex subuni... 158 3.1e-11 3
UNIPROTKB|Q9NV88 - symbol:INTS9 "Integrator complex subun... 157 4.1e-11 2
DICTYBASE|DDB_G0282473 - symbol:ints9 "integrator complex... 189 4.3e-11 1
RGD|1311539 - symbol:Ints9 "integrator complex subunit 9"... 156 6.5e-11 3
UNIPROTKB|E9PIG1 - symbol:CPSF3L "Integrator complex subu... 170 1.9e-10 1
WB|WBGene00017608 - symbol:F19F10.12 species:6239 "Caenor... 151 9.2e-10 2
UNIPROTKB|H7BYQ6 - symbol:INTS9 "Integrator complex subun... 157 1.1e-09 2
TIGR_CMR|DET_1061 - symbol:DET_1061 "metallo-beta-lactama... 115 3.7e-09 2
FB|FBgn0036570 - symbol:IntS9 "Integrator 9" species:7227... 129 1.6e-08 2
UNIPROTKB|E9PNS4 - symbol:CPSF3L "Integrator complex subu... 157 1.7e-08 1
TIGR_CMR|CHY_2049 - symbol:CHY_2049 "metallo-beta-lactama... 86 1.8e-08 3
UNIPROTKB|Q81SC3 - symbol:BA_1737 "Metallo-beta-lactamase... 142 7.7e-08 2
TIGR_CMR|BA_1737 - symbol:BA_1737 "metallo-beta-lactamase... 142 7.7e-08 2
UNIPROTKB|G3V5T3 - symbol:CPSF2 "Cleavage and polyadenyla... 132 1.2e-07 1
UNIPROTKB|E9PIL7 - symbol:CPSF3L "Integrator complex subu... 130 2.0e-07 1
UNIPROTKB|E5RG70 - symbol:INTS9 "Integrator complex subun... 138 3.1e-06 1
UNIPROTKB|Q8EJC6 - symbol:SO_0541 "RNA-metabolizing metal... 98 1.9e-05 3
TIGR_CMR|SO_0541 - symbol:SO_0541 "metallo-beta-lactamase... 98 1.9e-05 3
UNIPROTKB|Q9KV92 - symbol:VC_0264 "Putative uncharacteriz... 134 2.0e-05 1
TIGR_CMR|VC_0264 - symbol:VC_0264 "conserved hypothetical... 134 2.0e-05 1
TAIR|locus:2079696 - symbol:AT3G07530 "AT3G07530" species... 107 3.4e-05 3
UNIPROTKB|E9PQF0 - symbol:CPSF3L "Integrator complex subu... 116 5.7e-05 1
UNIPROTKB|E2QVB2 - symbol:INTS9 "Uncharacterized protein"... 127 9.9e-05 1
TIGR_CMR|CPS_2623 - symbol:CPS_2623 "metallo-beta-lactama... 74 0.00086 3
>TAIR|locus:2172843 [details] [associations]
symbol:CPSF100 "cleavage and polyadenylation specificity
factor 100" species:3702 "Arabidopsis thaliana" [GO:0005634
"nucleus" evidence=ISM;IDA] [GO:0009793 "embryo development ending
in seed dormancy" evidence=NAS] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IPI] [GO:0005515
"protein binding" evidence=IPI] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS;NAS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0035194 "posttranscriptional gene
silencing by RNA" evidence=IMP] [GO:0009506 "plasmodesma"
evidence=IDA] [GO:0000278 "mitotic cell cycle" evidence=RCA]
[GO:0006306 "DNA methylation" evidence=RCA] [GO:0006342 "chromatin
silencing" evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0007267 "cell-cell
signaling" evidence=RCA] [GO:0009220 "pyrimidine ribonucleotide
biosynthetic process" evidence=RCA] [GO:0009616 "virus induced gene
silencing" evidence=RCA] [GO:0009640 "photomorphogenesis"
evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
interference" evidence=RCA] [GO:0010388 "cullin deneddylation"
evidence=RCA] [GO:0016569 "covalent chromatin modification"
evidence=RCA] [GO:0031047 "gene silencing by RNA" evidence=RCA]
[GO:0035196 "production of miRNAs involved in gene silencing by
miRNA" evidence=RCA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 GO:GO:0009506 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0006378 EMBL:AB005244 GO:GO:0003723
GO:GO:0016787 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 EMBL:AF283277 EMBL:AY034982
EMBL:BT004374 IPI:IPI00521104 RefSeq:NP_197776.1 UniGene:At.25191
ProteinModelPortal:Q9LKF9 SMR:Q9LKF9 IntAct:Q9LKF9 STRING:Q9LKF9
PaxDb:Q9LKF9 PRIDE:Q9LKF9 EnsemblPlants:AT5G23880.1 GeneID:832453
KEGG:ath:AT5G23880 TAIR:At5g23880 HOGENOM:HOG000264343
InParanoid:Q9LKF9 OMA:NNPFQFK PhylomeDB:Q9LKF9
ProtClustDB:CLSN2686300 Genevestigator:Q9LKF9 GermOnline:AT5G23880
GO:GO:0035194 Uniprot:Q9LKF9
Length = 739
Score = 2527 (894.6 bits), Expect = 1.4e-313, Sum P(2) = 1.4e-313
Identities = 494/666 (74%), Positives = 566/666 (84%)
Query: 66 LHLGALPYAMK---QLGLSA---PVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQNYHL 119
L L A YA + +LGL S + V L T+ D + ++V RLTYSQNYHL
Sbjct: 78 LGLSAPVYATEPVHRLGLLTMYDQFLSRKQVSDFDLFTLDDIDSAFQNVIRLTYSQNYHL 137
Query: 120 SGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL 179
SGKGEGIV+APHVAGH+LGG++W+ITKDGEDVIYAVDYN RKE+HLNGTVL+SFVRPAVL
Sbjct: 138 SGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRPAVL 197
Query: 180 ITDAYNALH-NQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 237
ITDAY+AL+ NQ RQQR+ F D ISK L GGNVLLPVD+AGRVLELLLILE +W++
Sbjct: 198 ITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQR 257
Query: 238 SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 297
++PIYFLTYVSSSTIDYVKSFLEWM DSI+KSFETSRDNAFLL+HVTLLINK++LDNA
Sbjct: 258 GFSFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNA 317
Query: 298 PDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKV 357
P GPK+VLASMASLEAGF+ +IFVEWA+D +NLVLFTE GQFGTLARMLQ+ PPPK VKV
Sbjct: 318 PPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKV 377
Query: 358 TMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDXXX 417
TMS+RVPL GEELIAYEEEQ RLK+EEAL+ASLVKEEE+KAS G D+N S +PM+ID
Sbjct: 378 TMSKRVPLAGEELIAYEEEQNRLKREEALRASLVKEEETKASHGSDDN-SSEPMIIDTKT 436
Query: 418 XXXSADVVEPHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDE 477
DV+ HG Y+DILIDGFVPPS+SVAPMFP+Y+N SEWDDFGE+INPDDY+IKDE
Sbjct: 437 TH---DVIGSHGPAYKDILIDGFVPPSSSVAPMFPYYDNTSEWDDFGEIINPDDYVIKDE 493
Query: 478 DMDQAAMHIGGD-DGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSI 536
DMD+ AMH GGD DG+LDE +ASL+LD +PSKV+SNEL V V C L+ +DYEGR+DGRSI
Sbjct: 494 DMDRGAMHNGGDVDGRLDEATASLMLDTRPSKVMSNELIVTVSCSLVKMDYEGRSDGRSI 553
Query: 537 KTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLS 596
K++++HV+PLKLVLVH AEATEHLKQHCL ++CPHVY PQIEET+DVTSDLCAYKVQLS
Sbjct: 554 KSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYAPQIEETVDVTSDLCAYKVQLS 613
Query: 597 EKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPF 656
EKLMSNV+FKKLGD E+AWVD+EVGKTE M SLLP+ A PHK VLVGDLK+AD K F
Sbjct: 614 EKLMSNVIFKKLGDSEVAWVDSEVGKTERDMRSLLPMPGAASPHKPVLVGDLKIADFKQF 673
Query: 657 LSSKGIQVEFAGG-ALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLY 715
LSSKG+QVEFAGG ALRCGEYVT+RKVGP GQKGG SG QQI+IEGPLCEDYYKIR YLY
Sbjct: 674 LSSKGVQVEFAGGGALRCGEYVTLRKVGPTGQKGGASGPQQILIEGPLCEDYYKIRDYLY 733
Query: 716 SQFYLL 721
SQFYLL
Sbjct: 734 SQFYLL 739
Score = 505 (182.8 bits), Expect = 1.4e-313, Sum P(2) = 1.4e-313
Identities = 95/109 (87%), Positives = 105/109 (96%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
MGTSVQVTPL GV+NENPLSYLVSIDGFNFLIDCGWND FD SLL+PLS+VASTIDAVLL
Sbjct: 1 MGTSVQVTPLCGVYNENPLSYLVSIDGFNFLIDCGWNDLFDTSLLEPLSRVASTIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVT 109
SHPDTLH+GALPYAMKQLGLSAPV++TEPV+RLGLLTMYDQ+LSR+ V+
Sbjct: 61 SHPDTLHIGALPYAMKQLGLSAPVYATEPVHRLGLLTMYDQFLSRKQVS 109
>UNIPROTKB|Q9P2I0 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein
binding" evidence=IPI] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IDA] [GO:0000398 "mRNA
splicing, via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase
II transcription" evidence=TAS] [GO:0006397 "mRNA processing"
evidence=TAS] [GO:0006406 "mRNA export from nucleus" evidence=TAS]
[GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene
expression" evidence=TAS] [GO:0031124 "mRNA 3'-end processing"
evidence=TAS] Reactome:REACT_71 InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
EMBL:CH471061 Reactome:REACT_1675 GO:GO:0003723 GO:GO:0016787
GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0005847
GO:GO:0006369 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78
HOGENOM:HOG000264343 OMA:NNPFQFK CTD:53981 HOVERGEN:HBG051106
OrthoDB:EOG4MCWZQ GO:GO:0006398 EMBL:AK001627 EMBL:BC070095
EMBL:AB037788 EMBL:AL442079 IPI:IPI00419531 RefSeq:NP_059133.1
UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0
SMR:Q9P2I0 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677
STRING:Q9P2I0 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0
PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875
GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588
HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0
PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2
GenomeRNAi:53981 NextBio:56268 ArrayExpress:Q9P2I0 Bgee:Q9P2I0
CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 GermOnline:ENSG00000165934
Uniprot:Q9P2I0
Length = 782
Score = 922 (329.6 bits), Expect = 5.6e-137, Sum P(3) = 5.6e-137
Identities = 205/550 (37%), Positives = 321/550 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
++ T R GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K
Sbjct: 352 SIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKE-KLKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAP 449
KE + +S ++++ D ID + + G R F + P
Sbjct: 411 QSKEADIDSS--DESDIEED---IDQPSAHKTKHDLMMKGEGSRK---GSFFKQAKKSYP 462
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 506
MFP E +WD++GE+I P+D+++ + + +++ + G +G DE + D P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-P 519
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 566
+K +S ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C
Sbjct: 520 TKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579
Query: 567 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 618
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 619 EVGKTENGML 628
V K + G++
Sbjct: 638 RVSKVDTGVI 647
Score = 304 (112.1 bits), Expect = 5.6e-137, Sum P(3) = 5.6e-137
Identities = 56/112 (50%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 151 (58.2 bits), Expect = 5.6e-137, Sum P(3) = 5.6e-137
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 624 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 677
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 689 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 748
Query: 678 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 749 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>UNIPROTKB|Q10568 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation specificity
factor subunit 2" species:9913 "Bos taurus" [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISS] [GO:0005847 "mRNA cleavage
and polyadenylation specificity factor complex" evidence=ISS]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
EMBL:X75931 IPI:IPI00688446 PIR:A56351 RefSeq:NP_787002.1
UniGene:Bt.4077 ProteinModelPortal:Q10568 STRING:Q10568
PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689
KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568
OrthoDB:EOG4MCWZQ NextBio:20810154 GO:GO:0006398 Uniprot:Q10568
Length = 782
Score = 920 (328.9 bits), Expect = 9.2e-137, Sum P(3) = 9.2e-137
Identities = 205/550 (37%), Positives = 320/550 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
++ T R GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K
Sbjct: 352 SIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYLEKE-KLKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAP 449
KE + +S +++ D ID + + G R F + P
Sbjct: 411 QSKEADIDSS--DESDAEED---IDQPSAHKTKHDLMMKGEGSRK---GSFFKQAKKSYP 462
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 506
MFP E +WD++GE+I P+D+++ + + +++ + G +G DE + D P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-P 519
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 566
+K +S ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C
Sbjct: 520 TKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579
Query: 567 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 618
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 619 EVGKTENGML 628
V K + G++
Sbjct: 638 RVSKVDTGVI 647
Score = 304 (112.1 bits), Expect = 9.2e-137, Sum P(3) = 9.2e-137
Identities = 56/112 (50%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 151 (58.2 bits), Expect = 9.2e-137, Sum P(3) = 9.2e-137
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 624 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 677
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 689 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 748
Query: 678 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 749 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>UNIPROTKB|E2R496 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006398 "histone mRNA 3'-end processing"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 GO:GO:0005847 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981 GO:GO:0006398
EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496
Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230
NextBio:20855279 Uniprot:E2R496
Length = 782
Score = 919 (328.6 bits), Expect = 1.2e-136, Sum P(3) = 1.2e-136
Identities = 205/550 (37%), Positives = 320/550 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
++ T R GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K
Sbjct: 352 SIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKELEEYLEKE-KLKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAP 449
KE + +S ++++ D ID + G R F + P
Sbjct: 411 QSKEADIDSS--DESDVEED---IDQPSAHKMKHDLMMKGEGSRK---GSFFKQAKKSYP 462
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 506
MFP E +WD++GE+I P+D+++ + + +++ + G +G DE + D P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-P 519
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 566
+K +S ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C
Sbjct: 520 TKCISTTESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579
Query: 567 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 618
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 619 EVGKTENGML 628
V K + G++
Sbjct: 638 RVSKVDTGVI 647
Score = 304 (112.1 bits), Expect = 1.2e-136, Sum P(3) = 1.2e-136
Identities = 56/112 (50%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 151 (58.2 bits), Expect = 1.2e-136, Sum P(3) = 1.2e-136
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 624 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 677
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 689 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 748
Query: 678 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 749 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>UNIPROTKB|Q9W799 [details] [associations]
symbol:cpsf2 "Cleavage and polyadenylation specificity
factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006397
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1
UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394
KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799
Length = 783
Score = 938 (335.3 bits), Expect = 3.1e-136, Sum P(3) = 3.1e-136
Identities = 214/574 (37%), Positives = 331/574 (57%)
Query: 73 YAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHV 132
Y M Q+ + S L ++ D + + +L Y+Q HL GKG G+ + P
Sbjct: 91 YKMGQMFMYDLYQSRHNTEDFSLFSLDDVDCAFDKIQQLKYNQIVHLKGKGHGLSITPLP 150
Query: 133 AGHLLGGTVWKITKDGED-VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQP 191
AGH++GGT+WKI KDGE+ ++YAVD+N ++E HLNG LE RP++LITD++NA + QP
Sbjct: 151 AGHMIGGTIWKIVKDGEEEIVYAVDFNHKREIHLNGCSLEMINRPSLLITDSFNATYVQP 210
Query: 192 PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLT 247
R+QR E + +TLR GNVL+ VD+AGRVLEL +L+ W +Y L
Sbjct: 211 RRKQRDEQLLTNVLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLN 270
Query: 248 YVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLAS 307
VS + +++ KS +EWM D + + FE R+N F +H+TL S+L P PK+VLAS
Sbjct: 271 NVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARVPS-PKVVLAS 329
Query: 308 MASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG 367
LE GFS ++F++W D KN V+ T R GTLAR L P + + + + +RV L G
Sbjct: 330 QPDLECGFSRELFIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEG 389
Query: 368 EELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNLSGDPMVIDXXXXXXSA-DVVE 426
+EL Y E++ +LKKE A K KE + +S D+++ D ID + D++
Sbjct: 390 KELEEYVEKE-KLKKEAAKKLEQSKEADLDSS--DDSDVEED---IDQITSHKAKHDLMM 443
Query: 427 PHGGRYRDILIDGFVPPSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKD----EDMDQA 482
+ G + F + PMFP E+ +WD++GE+I P+D+++ + ED ++
Sbjct: 444 KNEGSRKG----SFFKQAKKSYPMFPAPEDRIKWDEYGEIIKPEDFLVPELQVTED-EKT 498
Query: 483 AMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSH 542
+ G +G DE + D P+K VS ++++K + +IDYEGR+DG SIK I++
Sbjct: 499 KLESGLTNG--DEPMDQDLSDV-PTKCVSTTESMEIKARVTYIDYEGRSDGDSIKKIINQ 555
Query: 543 VAPLKLVLVHGSAEATEHLKQHCL----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEK 598
+ P +L++VHG +AT+ L + C K + VYTP++ ET+D TS+ Y+V+L +
Sbjct: 556 MKPRQLIIVHGPPDATQDLAEACRAFGGKDI--KVYTPKLHETVDATSETHIYQVRLKDS 613
Query: 599 LMSNVLFKKLGDYEIAWVDA----EVGKTENGML 628
L+S++ F K D E+AW+D V K + G++
Sbjct: 614 LVSSLKFCKAKDTELAWIDGVLDMRVSKVDTGVI 647
Score = 281 (104.0 bits), Expect = 3.1e-136, Sum P(3) = 3.1e-136
Identities = 49/107 (45%), Positives = 75/107 (70%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T L G E+ + YL+ +D F FL+DCGW+++F ++ + K +DAVLL
Sbjct: 1 MTSIIKLTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFSMDIIDSVKKYVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRS 107
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR +
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHN 107
Score = 151 (58.2 bits), Expect = 3.1e-136, Sum P(3) = 3.1e-136
Identities = 36/106 (33%), Positives = 57/106 (53%)
Query: 617 DAEVGKTENGMLSLLPI-STPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 675
D E + + +L P+ S P H+SV + + +++D K L +GI EF GG L C
Sbjct: 688 DKEFSEESEIIPTLEPLPSNEVPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVCNN 747
Query: 676 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
V +R+ + T +I +EG LCED++KIR LY Q+ ++
Sbjct: 748 MVAVRR----------TETGRIGLEGCLCEDFFKIRELLYEQYAIV 783
>RGD|1309687 [details] [associations]
symbol:Cpsf2 "cleavage and polyadenylation specific factor 2,
100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA
3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0016787
EMBL:CH473982 GO:GO:0005847 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981
OrthoDB:EOG4MCWZQ GO:GO:0006398 IPI:IPI00189534
RefSeq:NP_001100223.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612
GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098
Uniprot:D3Z9E6
Length = 782
Score = 918 (328.2 bits), Expect = 3.1e-136, Sum P(3) = 3.1e-136
Identities = 204/550 (37%), Positives = 321/550 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
++ T R GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K
Sbjct: 352 SIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKELEEYVEKE-KLKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAP 449
KE + +S ++++ D V D++ G + F + P
Sbjct: 411 QSKEADIDSS--DESDVEED--VDQPTAHKTKHDLMMKGEGSRKG----SFFKQAKKSYP 462
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 506
MFP E +WD++GE+I P+D+++ + + +++ + G +G +E + D P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--EEPMDQDLSDV-P 519
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 566
+K VS ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C
Sbjct: 520 TKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579
Query: 567 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 618
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 619 EVGKTENGML 628
V K + G++
Sbjct: 638 RVSKVDTGVI 647
Score = 300 (110.7 bits), Expect = 3.1e-136, Sum P(3) = 3.1e-136
Identities = 55/112 (49%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALP+A+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 152 (58.6 bits), Expect = 3.1e-136, Sum P(3) = 3.1e-136
Identities = 35/106 (33%), Positives = 59/106 (55%)
Query: 617 DAEVGKTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 675
+ E+G+ + +L P+ P H+SV + + +++D K L +GIQ EF GG L C
Sbjct: 687 EKELGEESEVIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNN 746
Query: 676 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
V +R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 747 QVAVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782
>MGI|MGI:1861601 [details] [associations]
symbol:Cpsf2 "cleavage and polyadenylation specific factor
2" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISO;IDA]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG4MCWZQ GO:GO:0006398
EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 IPI:IPI00314302
RefSeq:NP_058552.1 UniGene:Mm.716 ProteinModelPortal:O35218
SMR:O35218 STRING:O35218 PhosphoSite:O35218 PaxDb:O35218
PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786
UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 Bgee:O35218
CleanEx:MM_CPSF2 Genevestigator:O35218
GermOnline:ENSMUSG00000041781 Uniprot:O35218
Length = 782
Score = 918 (328.2 bits), Expect = 3.9e-136, Sum P(3) = 3.9e-136
Identities = 204/550 (37%), Positives = 321/550 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
++ T R GTLAR L +P K ++ + +RV L G+EL Y E++ +LKKE A K
Sbjct: 352 SIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKELEEYVEKE-KLKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAP 449
KE + +S ++++ D V D++ G + F + P
Sbjct: 411 QSKEADIDSS--DESDVEED--VDQPSAHKTKHDLMMKGEGSRKG----SFFKQAKKSYP 462
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 506
MFP E +WD++GE+I P+D+++ + + +++ + G +G +E + D P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--EEPMDQDLSDV-P 519
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 566
+K VS ++++K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C
Sbjct: 520 TKCVSATESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCR 579
Query: 567 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 618
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 619 EVGKTENGML 628
V K + G++
Sbjct: 638 RVSKVDTGVI 647
Score = 300 (110.7 bits), Expect = 3.9e-136, Sum P(3) = 3.9e-136
Identities = 55/112 (49%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSVDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALP+A+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 151 (58.2 bits), Expect = 3.9e-136, Sum P(3) = 3.9e-136
Identities = 46/143 (32%), Positives = 71/143 (49%)
Query: 579 EETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPISTPAP 638
E +D SD A Q + K + K+LG+ + E+ T L LP P
Sbjct: 661 EMQVDAPSDSSAMAQQKAMKSLFGEDEKELGE------ETEIIPT----LEPLP-PHEVP 709
Query: 639 PHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGPAGQKGGGSGTQQIV 698
H+SV + + +++D K L +GIQ EF GG L C V +R+ + T +I
Sbjct: 710 GHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQVAVRR----------TETGRIG 759
Query: 699 IEGPLCEDYYKIRAYLYSQFYLL 721
+EG LC+D+Y+IR LY Q+ ++
Sbjct: 760 LEGCLCQDFYRIRDLLYEQYAIV 782
>UNIPROTKB|F1NMN0 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK GO:GO:0006398
EMBL:AADN02003653 IPI:IPI00651282 Ensembl:ENSGALT00000017538
Uniprot:F1NMN0
Length = 782
Score = 918 (328.2 bits), Expect = 1.3e-135, Sum P(3) = 1.3e-135
Identities = 205/550 (37%), Positives = 319/550 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L + S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHSLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDSKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
++ T R GTLAR L +P K + + + RRV L G+EL Y E++ +LKKE A K
Sbjct: 352 SIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEGKELEEYLEKE-KLKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTSVAP 449
KE + +S D D + +++ G R F + P
Sbjct: 411 QSKEADIDSSDESDAEEDIDQPTVHKTKHDL---MMKGEGSRK-----GSFFKQAKKSYP 462
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKP 506
MFP E +WD++GE+I P+D+++ + + +++ + G +G +E + D P
Sbjct: 463 MFPAPEERIKWDEYGEIIKPEDFLVPELQATEEEKSKLESGLTNG--EEPMDQDLSDV-P 519
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL 566
+K +S ++++K + +IDYEGR+DG SIK I++ + P +LV+VHG EA++ L + C
Sbjct: 520 TKCISATESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLVIVHGPPEASQDLAECCR 579
Query: 567 ----KHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA---- 618
K + VY P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 580 AFGGKDI--KVYMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDM 637
Query: 619 EVGKTENGML 628
V K + G++
Sbjct: 638 RVSKVDTGVI 647
Score = 295 (108.9 bits), Expect = 1.3e-135, Sum P(3) = 1.3e-135
Identities = 53/112 (47%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW+++F ++ L K +DAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFSMDIIDSLKKHVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALPYA+ ++GL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 151 (58.2 bits), Expect = 1.3e-135, Sum P(3) = 1.3e-135
Identities = 34/97 (35%), Positives = 54/97 (55%)
Query: 630 LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 684
++P P PPH+ SV + + +++D K L +GIQ EF GG L C V +R+
Sbjct: 696 IIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNMVAVRR--- 752
Query: 685 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
+ T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 753 -------TETGRIGLEGCLCQDFYRIRELLYKQYAIV 782
>ZFIN|ZDB-GENE-040718-79 [details] [associations]
symbol:cpsf2 "cleavage and polyadenylation specific
factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0016787
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106
OrthoDB:EOG4MCWZQ EMBL:BC076029 IPI:IPI00512505
RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5
STRING:Q6DHE5 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657
InParanoid:Q6DHE5 NextBio:20831102 ArrayExpress:Q6DHE5 Bgee:Q6DHE5
Uniprot:Q6DHE5
Length = 790
Score = 923 (330.0 bits), Expect = 1.7e-135, Sum P(3) = 1.7e-135
Identities = 203/551 (36%), Positives = 324/551 (58%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L YSQ +L GKG G+ + P AGH++GGT+WKI KDGE+ +IY
Sbjct: 113 LFTLDDVDSAFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
VD+N ++E HLNG LES RP++LITD++NA + QP R+QR E + +TLR GN
Sbjct: 173 GVDFNHKREIHLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVMETLRGDGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L + S+L P PK+VL S LE+GFS ++F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHSLSDLARVPS-PKVVLCSQPDLESGFSRELFIQWCQDAKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKAS 389
V+ T R GTLAR L +P K +++ + +R L G EL Y E++ R+KKE A K
Sbjct: 352 SVILTYRTTPGTLARYLIDNPGEKRIELEIRKRCRLEGRELEEYMEKE-RMKKEAAKKLE 410
Query: 390 LVKEEESKASLGPDNNLSGD---PMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPSTS 446
KE + +S ++++ D P V+ +++ GGR GF +
Sbjct: 411 QAKEVDLDSS--DESDMEDDLEQPAVVKTKHHDL---MMKGEGGRK-----GGFFKQAKK 460
Query: 447 VAPMFPFYENNSEWDDFGEVINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILD 503
MFP +E +WD++GE+I P+D+++ + + +++ + G +G +E + D
Sbjct: 461 SYSMFPTHEERIKWDEYGEIIRPEDFLVPELQATEEEKSKLESGLTNG--EEPMEQDLSD 518
Query: 504 AKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQ 563
P+K S T+ ++ +++IDYEGR+DG SIK I++ + P +L++VHG +A++ L +
Sbjct: 519 V-PTKCTSTTQTLDIRARVMYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDASQDLAE 577
Query: 564 HCLKHVCPH--VYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA--- 618
C + VY P+++ET+D TS+ Y+V+L + L+S++ F K D E+AW+D
Sbjct: 578 SCKAYSGKDIKVYIPKLQETVDATSETHIYQVRLKDSLVSSLQFCKARDTELAWIDGVLD 637
Query: 619 -EVGKTENGML 628
V K + G++
Sbjct: 638 MRVEKVDTGVI 648
Score = 289 (106.8 bits), Expect = 1.7e-135, Sum P(3) = 1.7e-135
Identities = 52/112 (46%), Positives = 77/112 (68%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++ F ++ L + +DAVLL
Sbjct: 1 MTSIIKLTALSGVQEESALCYLLQVDEFRFLLDCGWDETFSMDIIDSLKRYVHQVDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD +HLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
Score = 151 (58.2 bits), Expect = 1.7e-135, Sum P(3) = 1.7e-135
Identities = 35/103 (33%), Positives = 56/103 (54%)
Query: 617 DAEVGKTENGMLSLLPIST-PAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGE 675
+ E+ + + + +L P+ P H+SV + + +++D K L +GIQ EF GG L C
Sbjct: 695 EKEISEESDVIPTLEPLPAHEVPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVCNN 754
Query: 676 YVTIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQF 718
V +R+ AG+ I +EG C+DYY+IR LY Q+
Sbjct: 755 LVAVRRT-EAGR---------ICLEGCHCDDYYRIRELLYEQY 787
>FB|FBgn0027873 [details] [associations]
symbol:Cpsf100 "Cleavage and polyadenylation specificity
factor 100" species:7227 "Drosophila melanogaster" [GO:0006379
"mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS]
[GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 OMA:NNPFQFK
GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1
UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 IntAct:Q9V3D6
STRING:Q9V3D6 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357
GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426
FlyBase:FBgn0027873 InParanoid:Q8IML7 OrthoDB:EOG4XD261
PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 Bgee:Q9V3D6
GermOnline:CG1957 Uniprot:Q9V3D6
Length = 756
Score = 929 (332.1 bits), Expect = 2.8e-120, Sum P(2) = 2.8e-120
Identities = 238/668 (35%), Positives = 367/668 (54%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGE-DVIY 153
L ++ D + +T+L Y+Q L KG GI + P AGH++GGT+WKI K GE D++Y
Sbjct: 113 LFSLDDVDTAFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
A D+N +KE+HL+G L+ RP++LITDAYNA + Q R+ R E I +T+R GN
Sbjct: 173 ATDFNHKKERHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGN 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W + Y + L VS + I++ KS +EWM D +T
Sbjct: 233 VLIAVDTAGRVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLT 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
K+FE +R+N F KH+ L + +++ P GPK+VLAS LE+GF+ D+FV+WAS+ N
Sbjct: 293 KAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANN 352
Query: 330 LVLFTERGQFGTLA-RMLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA 388
++ T R GTLA +++ P K +++ + RRV L G EL EE R + E+ L
Sbjct: 353 SIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGAEL----EEYLRTQGEK-LNP 407
Query: 389 SLVK---EEESKASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFVPPST 445
+VK EEES + D +S VI VV P G + GF +
Sbjct: 408 LIVKPDVEEESSSESEDDIEMS----VITGKHDI----VVRPEGRHH-----SGFFKSNK 454
Query: 446 SVAPMFPFYENNSEWDDFGEVINPDDYIIKD--------------EDMDQAAMHIGGD-- 489
MFP++E + D++GE+IN DDY I D E++ + IG +
Sbjct: 455 RHHVMFPYHEEKVKCDEYGEIINLDDYRIADATGYEFVPMEEQNKENVKKEEPGIGAEQQ 514
Query: 490 -DGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKL 548
+G + + L+ KP+K++S T++V + ID+EGR+DG S+ ILS + P ++
Sbjct: 515 ANGGIVDNDVQLL--EKPTKLISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRV 572
Query: 549 VLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKL 608
+++HG+AE T+ + +HC ++V V+TPQ E IDVTS++ Y+V+L+E L+S + F+K
Sbjct: 573 IVIHGTAEGTQVVARHCEQNVGARVFTPQKGEIIDVTSEIHIYQVRLTEGLVSQLQFQKG 632
Query: 609 GDYEIAWVDAEVGKTENGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSK-GIQVEFA 667
D E+AWVD +G + + P+ SV G K L+ + I
Sbjct: 633 KDAEVAWVDGRLGMRVKAIEA--PMDVTVEQDASVQEG--KTLTLETLADDEIPIHNSVL 688
Query: 668 GGALRCGEYV-TIRKVGPAGQKGGG-----SGTQ--------QIVIEGPLCEDYYKIRAY 713
L+ ++ T+ + + GG +GT ++ +EG L E+YYKIR
Sbjct: 689 INELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRVDAGKVAMEGCLSEEYYKIREL 748
Query: 714 LYSQFYLL 721
LY Q+ ++
Sbjct: 749 LYEQYAIV 756
Score = 275 (101.9 bits), Expect = 2.8e-120, Sum P(2) = 2.8e-120
Identities = 46/104 (44%), Positives = 74/104 (71%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ +SG +E+P Y++ ID L+DCGW++ FD + ++ L + T+DAVLL
Sbjct: 1 MTSIIKLHTISGAMDESPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVHTLDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLS 104
SHPD HLGALPY + +LGL+ P+++T PV+++G + MYD Y+S
Sbjct: 61 SHPDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMS 104
>DICTYBASE|DDB_G0270392 [details] [associations]
symbol:cpsf2 "cleavage and polyadenylation
specificity factor 100 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA;ISS] [GO:0006378 "mRNA
polyadenylation" evidence=IEA;ISS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA;ISS]
[GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0006397 "mRNA
processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0270392 Pfam:PF07521 EMBL:AAFI02000005
GenomeReviews:CM000150_GR GO:GO:0006378 GO:GO:0003723 GO:GO:0016787
GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
OMA:NNPFQFK RefSeq:XP_646760.1 ProteinModelPortal:Q55BS1
STRING:Q55BS1 EnsemblProtists:DDB0233700 GeneID:8617733
KEGG:ddi:DDB_G0270392 ProtClustDB:CLSZ2431463 Uniprot:Q55BS1
Length = 784
Score = 800 (286.7 bits), Expect = 6.5e-118, Sum P(3) = 6.5e-118
Identities = 187/559 (33%), Positives = 313/559 (55%)
Query: 111 LTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVL 170
L++SQ+Y LSGKG+GI + P++AGH +G +VWKITK ++YA+DYN R E HL+ L
Sbjct: 131 LSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHRNEGHLDSLQL 190
Query: 171 ES-FVRPAVLITDAYN-----ALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 224
S ++P++LITD+ A R Q +F+ I++ LR GGNVL+PVD+AGRVL
Sbjct: 191 TSDILKPSLLITDSKGVDKTLAFKKTITRDQ-SLFEQ-INRNLRDGGNVLIPVDTAGRVL 248
Query: 225 ELLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLL 282
ELLL +E+YW+++ SL Y + FL S S + +S LE+M + + FE + +N F
Sbjct: 249 ELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKFEQNIENPFSF 308
Query: 283 KHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 342
KH+ +L + EL PD K++L S LE GFS ++F++W SD K L+LFT++ +L
Sbjct: 309 KHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLILFTQKIPKDSL 368
Query: 343 ARML--QADPPP---KAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESK 397
A L Q P K +++ RVPL G+EL+ YE EQ + ++E+ L+ L KE+E +
Sbjct: 369 ADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDELLQYEMEQAKQREEKRLE-QLRKEQEER 427
Query: 398 ASLGPDNNLSGDPMVIDXXXXXXSADVVEPHGGRYRDILIDGFV----P----------- 442
+ + ++ +++ + R I+ D V P
Sbjct: 428 EERERLEEEEREQL-LNATNQDQLQQLLQLQQQKERGIIDDSMVHMKNPFENDRFDLLDS 486
Query: 443 --PSTSVAPMFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASL 500
S+ MFP++E + +W ++GE DD I++++D + + ++ ++ E
Sbjct: 487 EFKKQSMITMFPYFEKHLKWGEYGE--EDDDLILRNQD--KKVEEVTMEEDEIQEQEI-- 540
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
P K+++ L + + C + IDYEG +DGRSIK I+ +AP KLVL+ GS + ++
Sbjct: 541 -----PKKIITQTLRLPINCKIQTIDYEGCSDGRSIKAIIQQIAPTKLVLIRGSEQQSQS 595
Query: 561 LKQHCLKHV-CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAE 619
++ + +++ +Y P I E +D+TSD Y++ L + L++ + K+ DYE++++ +
Sbjct: 596 IENYVKENIRTKGIYIPSIGEQLDLTSDTNVYELLLKDSLVNTLKTSKILDYEVSYIQGK 655
Query: 620 VGKTENGMLSLLPISTPAP 638
V + + +L + P
Sbjct: 656 VDILDGSNVPVLDLIQSIP 674
Score = 261 (96.9 bits), Expect = 6.5e-118, Sum P(3) = 6.5e-118
Identities = 50/107 (46%), Positives = 72/107 (67%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + ++ T LSG +E+P YL+ ID F L+DCG + + D SLL+PL KVA IDAVLL
Sbjct: 1 MASIIKFTALSGAKDESPPCYLLEIDDFCILLDCGLSYNLDFSLLEPLEKVAKKIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRS 107
SH DT H+G LPY + + GL+ ++ T PV ++G + +YD Y ++ S
Sbjct: 61 SHSDTTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMS 107
Score = 135 (52.6 bits), Expect = 6.5e-118, Sum P(3) = 6.5e-118
Identities = 32/97 (32%), Positives = 51/97 (52%)
Query: 625 NGMLSLLPISTPAPPHKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYVTIRKVGP 684
N + +T H +GD+K++DLK L + GIQV+F G L CG V I +
Sbjct: 694 NNTTMMTTTTTTTNGHDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNCGGLVYIWR--- 750
Query: 685 AGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
+ GG+ I ++G + ++YY I+ LY QF ++
Sbjct: 751 -DEDHGGNSI--INVDGIISDEYYLIKELLYKQFQIV 784
>WB|WBGene00017313 [details] [associations]
symbol:cpsf-2 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
[GO:0051301 "cell division" evidence=IMP] [GO:0000910 "cytokinesis"
evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0016246
"RNA interference" evidence=IMP] [GO:0040027 "negative regulation
of vulval development" evidence=IMP] InterPro:IPR001279
InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
Uniprot:O17403
Length = 843
Score = 474 (171.9 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 122/358 (34%), Positives = 196/358 (54%)
Query: 73 YAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHV 132
Y M Q+ + V+S V T+ D + V ++ Y+Q L G G+
Sbjct: 91 YKMGQMFIYDMVYSHLDVEEFEHYTLDDVDTAFEKVEQVKYNQTVVLKGDS-GVHFTALP 149
Query: 133 AGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQP 191
AGH+LGG++W+I + GED++Y VD+N +KE+HLNG ++F RP +LIT A++ Q
Sbjct: 150 AGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNGCSFDNFNRPHLLITGAHHISLPQM 209
Query: 192 PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW--AEHSLN-YPIYFLT 247
R+ R E I +T+R G+ ++ +D+AGRVLEL +L+ W A+ L+ Y + ++
Sbjct: 210 RRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLELAHLLDQLWSNADAGLSTYNLVMMS 269
Query: 248 YVSSSTIDYVKSFLEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLA 306
+V+SS + + KS LEWM + + K +S R N F LKHVTL + EL PK+VL
Sbjct: 270 HVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRS-PKVVLC 328
Query: 307 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-----QADP-----PPKAVK 356
S +E+GFS ++F++W SD +N V+ T R TLA L +A+ + +
Sbjct: 329 SSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLIS 388
Query: 357 VTMSRRVPLVGEELIAYEEEQTRLKKEEA-LKASLVKEE-ESKASLGPDNNLSGDPMV 412
+ + +RV L GEEL+ Y+ + EE L+ + + ++ S D++ P+V
Sbjct: 389 LVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDSDDDDIAAPIV 446
Score = 272 (100.8 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 79/307 (25%), Positives = 152/307 (49%)
Query: 353 KAVKVTMSRRVPLVGEELIAYEE-------EQTRLKKEEALKASLVKEEESKASLGPDNN 405
+ + + + +RV L GEEL+ Y+ E+TRL+ E A + + E + D++
Sbjct: 385 RLISLVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDS----DDDD 440
Query: 406 LSGDPMVIDXXXXXXSADVVEPHGGRYRDILID-------GFVPPSTSVAPMFPFYENNS 458
++ + S D E + DI+ F + PMFP+ E
Sbjct: 441 IAAPIVPRHSEKDFRSFDGSENDAHTF-DIMAKWDNQQKASFFKTTKKSFPMFPYIEEKV 499
Query: 459 EWDDFGEVINPDDYII-------KDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 511
+WDD+GEVI P+DY + K ++ D+ + + + + + + + ++ P+K V
Sbjct: 500 KWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVV-VKKREEEEEVYNPNDHVEEMPTKCVE 558
Query: 512 NELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHV-- 569
+ V+V C + FI+YEG +DG S K +L+ + P ++++VHGS + T L +
Sbjct: 559 FKNRVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSRDDTRDLVAYFADSGFD 618
Query: 570 CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGD-YEIAWVDAEVGKTENGML 628
+ P+ +D + + Y+V LS+ L++++ FK++ + +AW+DA V + E +
Sbjct: 619 TTMLKAPEAGALVDASVESFIYQVALSDALLADIQFKEVSEGNSLAWIDARVMEKE-AID 677
Query: 629 SLLPIST 635
++L + T
Sbjct: 678 NMLAVGT 684
Score = 250 (93.1 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 46/108 (42%), Positives = 67/108 (62%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ SG +E PL YL+ +DG L+DCGW++ F + L I AVL+
Sbjct: 1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSV 108
SHPD LHLG LPY + + GL+APV++T PVY++G + +YD S V
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDV 108
Score = 117 (46.2 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 621 GKTENGMLSLLPISTPAPP-HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVT 678
GK G L L P+ P H++V V D K++D K L+ KG + EF G L G +
Sbjct: 752 GKIR-GNLILDPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCS 810
Query: 679 IRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
IR+ +G Q+ EG +DYYK+R Y QF +L
Sbjct: 811 IRR--------NDTGVFQM--EGAFTKDYYKLRRLFYDQFAVL 843
>UNIPROTKB|O17403 [details] [associations]
symbol:cpsf-2 "Probable cleavage and polyadenylation
specificity factor subunit 2" species:6239 "Caenorhabditis elegans"
[GO:0005575 "cellular_component" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] InterPro:IPR001279
InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
Uniprot:O17403
Length = 843
Score = 474 (171.9 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 122/358 (34%), Positives = 196/358 (54%)
Query: 73 YAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHV 132
Y M Q+ + V+S V T+ D + V ++ Y+Q L G G+
Sbjct: 91 YKMGQMFIYDMVYSHLDVEEFEHYTLDDVDTAFEKVEQVKYNQTVVLKGDS-GVHFTALP 149
Query: 133 AGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQP 191
AGH+LGG++W+I + GED++Y VD+N +KE+HLNG ++F RP +LIT A++ Q
Sbjct: 150 AGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNGCSFDNFNRPHLLITGAHHISLPQM 209
Query: 192 PRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW--AEHSLN-YPIYFLT 247
R+ R E I +T+R G+ ++ +D+AGRVLEL +L+ W A+ L+ Y + ++
Sbjct: 210 RRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLELAHLLDQLWSNADAGLSTYNLVMMS 269
Query: 248 YVSSSTIDYVKSFLEWMGDSITKSFETS-RDNAFLLKHVTLLINKSELDNAPDGPKLVLA 306
+V+SS + + KS LEWM + + K +S R N F LKHVTL + EL PK+VL
Sbjct: 270 HVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRS-PKVVLC 328
Query: 307 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML-----QADP-----PPKAVK 356
S +E+GFS ++F++W SD +N V+ T R TLA L +A+ + +
Sbjct: 329 SSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLIS 388
Query: 357 VTMSRRVPLVGEELIAYEEEQTRLKKEEA-LKASLVKEE-ESKASLGPDNNLSGDPMV 412
+ + +RV L GEEL+ Y+ + EE L+ + + ++ S D++ P+V
Sbjct: 389 LVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDSDDDDIAAPIV 446
Score = 272 (100.8 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 79/307 (25%), Positives = 152/307 (49%)
Query: 353 KAVKVTMSRRVPLVGEELIAYEE-------EQTRLKKEEALKASLVKEEESKASLGPDNN 405
+ + + + +RV L GEEL+ Y+ E+TRL+ E A + + E + D++
Sbjct: 385 RLISLVVKKRVALEGEELLEYKRRKAERDAEETRLRMERARRQAQANESDDS----DDDD 440
Query: 406 LSGDPMVIDXXXXXXSADVVEPHGGRYRDILID-------GFVPPSTSVAPMFPFYENNS 458
++ + S D E + DI+ F + PMFP+ E
Sbjct: 441 IAAPIVPRHSEKDFRSFDGSENDAHTF-DIMAKWDNQQKASFFKTTKKSFPMFPYIEEKV 499
Query: 459 EWDDFGEVINPDDYII-------KDEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVS 511
+WDD+GEVI P+DY + K ++ D+ + + + + + + + ++ P+K V
Sbjct: 500 KWDDYGEVIKPEDYTVISKIDLRKGQNKDEPVV-VKKREEEEEVYNPNDHVEEMPTKCVE 558
Query: 512 NELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHV-- 569
+ V+V C + FI+YEG +DG S K +L+ + P ++++VHGS + T L +
Sbjct: 559 FKNRVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSRDDTRDLVAYFADSGFD 618
Query: 570 CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGD-YEIAWVDAEVGKTENGML 628
+ P+ +D + + Y+V LS+ L++++ FK++ + +AW+DA V + E +
Sbjct: 619 TTMLKAPEAGALVDASVESFIYQVALSDALLADIQFKEVSEGNSLAWIDARVMEKE-AID 677
Query: 629 SLLPIST 635
++L + T
Sbjct: 678 NMLAVGT 684
Score = 250 (93.1 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 46/108 (42%), Positives = 67/108 (62%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++ SG +E PL YL+ +DG L+DCGW++ F + L I AVL+
Sbjct: 1 MTSIIKLKVFSGAKDEGPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIPKISAVLI 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSV 108
SHPD LHLG LPY + + GL+APV++T PVY++G + +YD S V
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMVYSHLDV 108
Score = 117 (46.2 bits), Expect = 1.1e-97, Sum P(4) = 1.1e-97
Identities = 37/103 (35%), Positives = 51/103 (49%)
Query: 621 GKTENGMLSLLPISTPAPP-HKSVLVGDLKMADLKPFLSSKGIQVEFAGGALRC-GEYVT 678
GK G L L P+ P H++V V D K++D K L+ KG + EF G L G +
Sbjct: 752 GKIR-GNLILDPLPKRLIPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCS 810
Query: 679 IRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
IR+ +G Q+ EG +DYYK+R Y QF +L
Sbjct: 811 IRR--------NDTGVFQM--EGAFTKDYYKLRRLFYDQFAVL 843
>POMBASE|SPBC1709.15c [details] [associations]
symbol:cft2 "cleavage factor two Cft2/polyadenylation
factor CPSF-73 (predicted)" species:4896 "Schizosaccharomyces
pombe" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA]
[GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
[GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
cleavage" evidence=ISO] InterPro:IPR027075 PomBase:SPBC1709.15c
Pfam:PF07521 EMBL:CU329671 GO:GO:0006378 GenomeReviews:CU329671_GR
GO:GO:0005847 GO:GO:0006379 PIR:T39643 RefSeq:NP_595448.1
ProteinModelPortal:O74740 STRING:O74740 EnsemblFungi:SPBC1709.15c.1
GeneID:2539954 KEGG:spo:SPBC1709.15c eggNOG:COG1236 KO:K14402
OMA:ISSIATP OrthoDB:EOG4WWVSN NextBio:20801097 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Uniprot:O74740
Length = 797
Score = 563 (203.2 bits), Expect = 2.6e-89, Sum P(3) = 2.6e-89
Identities = 134/342 (39%), Positives = 200/342 (58%)
Query: 23 VSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAM-KQLGLS 81
+ +DG + ID G +D SL P +V D +LLSH D H+G L YA K +
Sbjct: 18 IELDGIHIYIDPGSDD----SLKHP--EVPEQPDLILLSHSDLAHIGGLVYAYYKYDWKN 71
Query: 82 APVFSTEPVYRLGLLTMYD----QYLSRRS----------VTRLTYSQNYHLSGKGEGIV 127
A +++T P +G +TM D Y+S S + L Y Q L GK G+
Sbjct: 72 AYIYATLPTINMGRMTMLDAIKSNYISDMSKADVDAVFDSIIPLRYQQPTLLLGKCSGLT 131
Query: 128 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGT-------VLESFVRPAVLI 180
+ + AGH LGGT+W + K+ E V+YAVD+N K+KHLNG +LE+ RP LI
Sbjct: 132 ITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRPNTLI 191
Query: 181 TDAYNALHNQPPRQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA--EH 237
TDA N+L + P R++R E F +++ +L GG VLLPVD+A RVLEL IL+++W+ +
Sbjct: 192 TDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWSASQP 251
Query: 238 SLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNA 297
L +PI FL+ S+ TIDY KS +EWMGD+I + F + +N +++ + + S++ +
Sbjct: 252 PLPFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGIN-ENLLEFRNINTITDFSQISHI 310
Query: 298 PDGPKLVLASMASLEAGFSHDIFVEWASDVKN-LVLFTERGQ 338
GPK++LA+ +LE GFS I ++ S+ N L+LFT+R +
Sbjct: 311 GPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSR 352
Score = 262 (97.3 bits), Expect = 2.6e-89, Sum P(3) = 2.6e-89
Identities = 63/189 (33%), Positives = 104/189 (55%)
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSA----SLILDAK 505
MFP+ E D++GE+I D+ + +E + + DD L + S I D
Sbjct: 484 MFPYIEKRRRIDEYGEIIKHQDFSMINEPANTLELENDSDDNALSNSNGKRKWSEINDGL 543
Query: 506 ------------PSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHG 553
PSK++++E T++V C + FID EG DGRS+KTI+ V P +LVL+H
Sbjct: 544 QQKKEEEDEDEVPSKIITDEKTIRVSCQVQFIDIEGLHDGRSLKTIIPQVNPRRLVLIHA 603
Query: 554 SAEATEHLKQHC--LKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDY 611
S E E +K+ C L VY P E I+V+ D+ A+ ++L++ L+ N+++ K+G+
Sbjct: 604 STEEKEDMKKTCASLSAFTKDVYIPNYGEIINVSIDVNAFSLKLADDLIKNLIWTKVGNC 663
Query: 612 EIAWVDAEV 620
E++ + A+V
Sbjct: 664 EVSHMLAKV 672
Score = 99 (39.9 bits), Expect = 2.6e-89, Sum P(3) = 2.6e-89
Identities = 28/80 (35%), Positives = 43/80 (53%)
Query: 637 APPHKSVLVGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQ 695
AP +LVG++++A L+ L +GI E G G L CG V +RK+ GG
Sbjct: 722 APRAAPLLVGNIRLAYLRKALLDQGISAELKGEGVLLCGGAVAVRKLS------GG---- 771
Query: 696 QIVIEGPLCEDYYKIRAYLY 715
+I +EG L +++IR +Y
Sbjct: 772 KISVEGSLSNRFFEIRKLVY 791
Score = 97 (39.2 bits), Expect = 5.3e-72, Sum P(3) = 5.3e-72
Identities = 41/153 (26%), Positives = 73/153 (47%)
Query: 353 KAVKVTMSRRVPLVGEELIAYEE-EQTRLKKEE---ALK---ASLVKEEESKASLGPDNN 405
+AVK+ + PL GEEL +Y+E E ++ K+ AL+ +++ E+ S +S D++
Sbjct: 386 QAVKI--KTKEPLEGEELRSYQELEFSKRNKDAEDTALEFRNRTILDEDLSSSSSSEDDD 443
Query: 406 LSGDPMVIDXXXXXXSADVVEPHGGRYRDI-LIDGFVPPSTSVAPMFPFYENNSEWDDFG 464
L + V SA ++ G+ D+ L D V + MFP+ E D++G
Sbjct: 444 LDLNTEV-PHVALGSSAFLM----GKSFDLNLRDPAVQALHTKYKMFPYIEKRRRIDEYG 498
Query: 465 EVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGS 497
E+I D+ + +E + + DD L +
Sbjct: 499 EIIKHQDFSMINEPANTLELENDSDDNALSNSN 531
>UNIPROTKB|F1SD85 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379
"mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA polyadenylation"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
InterPro:IPR022712 PANTHER:PTHR11203:SF5 Pfam:PF10996 SMART:SM01027
GeneTree:ENSGT00700000104551 EMBL:CU468363
Ensembl:ENSSSCT00000002717 OMA:GANDESP Uniprot:F1SD85
Length = 385
Score = 573 (206.8 bits), Expect = 9.2e-86, Sum P(2) = 9.2e-86
Identities = 116/271 (42%), Positives = 169/271 (62%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED-VIY 153
L T+ D + + +L +SQ +L GKG G+ + P AGH++GGT+WKI KDGE+ ++Y
Sbjct: 113 LFTLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVY 172
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQR-EMFQDAISKTLRAGGN 212
AVD+N ++E HLNG LE RP++LITD++NA + QP R+QR E + +TLR G+
Sbjct: 173 AVDFNHKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGS 232
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIY---FLTYVSSSTIDYVKSFLEWMGDSIT 269
VL+ VD+AGRVLEL +L+ W +Y L VS + +++ KS +EWM D +
Sbjct: 233 VLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLM 292
Query: 270 KSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKN 329
+ FE R+N F +H++L S+L P PK+VLAS LE GFS D+F++W D KN
Sbjct: 293 RCFEDKRNNPFQFRHLSLCHGLSDLARVPS-PKVVLASQPDLECGFSRDLFIQWCQDPKN 351
Query: 330 LVLFTERGQFGTLARMLQADPPPKAVKVTMS 360
++ T R GTLAR L +P K ++ +S
Sbjct: 352 SIILTYRTTPGTLARFLIDNPSEKITEIEVS 382
Score = 304 (112.1 bits), Expect = 9.2e-86, Sum P(2) = 9.2e-86
Identities = 56/112 (50%), Positives = 78/112 (69%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
SHPD LHLGALPYA+ +LGL+ +++T PVY++G + MYD Y SR + T
Sbjct: 61 SHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFT 112
>UNIPROTKB|G4N6C6 [details] [associations]
symbol:MGG_06570 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005634 "nucleus" evidence=ISS] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0006379 "mRNA cleavage" evidence=ISS] InterPro:IPR027075
Pfam:PF07521 GO:GO:0006378 EMBL:CM001234 GO:GO:0005847
GO:GO:0006379 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 RefSeq:XP_003716967.1 EnsemblFungi:MGG_06570T0
GeneID:2684725 KEGG:mgr:MGG_06570 Uniprot:G4N6C6
Length = 962
Score = 213 (80.0 bits), Expect = 9.6e-44, Sum P(6) = 9.6e-44
Identities = 57/176 (32%), Positives = 80/176 (45%)
Query: 125 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK-----------HLNG--TVLE 171
G+ + + AGH LGGT+W I E ++YAVD+N ++ H G V+E
Sbjct: 174 GLTITAYNAGHSLGGTIWHIQHGMESIVYAVDWNLARDNVYAGAAWMGGGHGGGGAEVIE 233
Query: 172 SFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILE 231
+P L+ A + + D + + GG VL+PVDS+ RVLEL +LE
Sbjct: 234 QLRKPTALVCSTRTAEGGLTRAARDKQLLDTMRMAISRGGTVLIPVDSSARVLELAYLLE 293
Query: 232 DYW-AEHSLN------YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 280
W +E S +Y STI KS EWM +SI + FE D F
Sbjct: 294 HAWRSEASTEGGGLSTAKLYLAGRSVHSTIKLAKSMFEWMDNSIVQEFEAGADQGF 349
Score = 175 (66.7 bits), Expect = 9.6e-44, Sum P(6) = 9.6e-44
Identities = 46/158 (29%), Positives = 80/158 (50%)
Query: 476 DEDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRS 535
D D QAA D+ L E ++ P+K+V TV V L ID+ G D RS
Sbjct: 668 DADAAQAASGPAPDELDLVEDVEEEVVTG-PAKLVHTSTTVSVNLRLALIDFSGLHDRRS 726
Query: 536 IKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQL 595
+ ++ + P KL+LV GSA+ TE + C ++ V+TP + +D + D A+ V+L
Sbjct: 727 LAMLIPLIQPRKLILVAGSADETEAVADDCRRNAI-EVFTPPVGAVVDASVDTNAWVVKL 785
Query: 596 SEKLMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLLPI 633
++ L+ + ++++ I V A++ T + +P+
Sbjct: 786 ADPLVKRLKWQQVRGLGIVTVTAQLTATPAAQKNGIPL 823
Score = 150 (57.9 bits), Expect = 9.6e-44, Sum P(6) = 9.6e-44
Identities = 36/101 (35%), Positives = 53/101 (52%)
Query: 8 TPLSGVFNENPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDT 65
+PL G +E S L+ +DG LID GW++ FD L+ + K T+ +LL+H
Sbjct: 5 SPLQGALSEATASQSLLELDGGVKVLIDIGWDETFDVEKLKEVEKQVPTLSLILLTHATV 64
Query: 66 LHLGALPYAMKQLGLSA--PVFSTEPVYRLGLLTMYDQYLS 104
HL AL + K L A P+++T+P LG + D Y S
Sbjct: 65 PHLSALVHCCKNFPLFARIPIYATQPAIDLGRTLIQDLYSS 105
Score = 77 (32.2 bits), Expect = 9.6e-44, Sum P(6) = 9.6e-44
Identities = 23/63 (36%), Positives = 37/63 (58%)
Query: 280 FLLKHVTLLINKSE----LDNAPDG--PKLVLASMASLEAGFSHDIFVEWASDVKNLVLF 333
F K++ LL K++ L+ + D K++LA+ SLE GFS DI A+D +N+V+
Sbjct: 369 FDFKYLRLLDRKAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIANDSRNMVIL 428
Query: 334 TER 336
E+
Sbjct: 429 PEK 431
Score = 70 (29.7 bits), Expect = 4.3e-33, Sum P(6) = 4.3e-33
Identities = 26/82 (31%), Positives = 41/82 (50%)
Query: 610 DYEIAWVDAEVGKTENGMLSLLPISTPAPPHKS--VL-VGDLKMADLKPFLSSKGIQVEF 666
D E D +VG L +LP++ + + VL VG+L++ADL+ + + G +F
Sbjct: 844 DQEPTAEDEDVGVMPT--LDVLPVAMVSASRSAAQVLHVGELRLADLRRTMQNLGHSADF 901
Query: 667 AG-GALRCGEYVTIRKVGPAGQ 687
G G L V +RK AG+
Sbjct: 902 RGEGTLLIDGTVVVRKTA-AGR 922
Score = 67 (28.6 bits), Expect = 9.6e-44, Sum P(6) = 9.6e-44
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKDE 477
MFP D+FGE+I P+DY+ +E
Sbjct: 592 MFPLAVRRKRNDEFGELIRPEDYLRAEE 619
Score = 42 (19.8 bits), Expect = 9.6e-44, Sum P(6) = 9.6e-44
Identities = 7/23 (30%), Positives = 15/23 (65%)
Query: 353 KAVKVTMSRRVPLVGEELIAYEE 375
+ +++ S++VPL EL Y++
Sbjct: 476 RELQIRESKKVPLADSELSIYQQ 498
>TAIR|locus:2206076 [details] [associations]
symbol:CPSF73-I "cleavage and polyadenylation specificity
factor 73-I" species:3702 "Arabidopsis thaliana" [GO:0005634
"nucleus" evidence=ISM;IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0005515 "protein binding"
evidence=IPI] [GO:0000278 "mitotic cell cycle" evidence=RCA]
[GO:0006346 "methylation-dependent chromatin silencing"
evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0009855
"determination of bilateral symmetry" evidence=RCA] [GO:0010014
"meristem initiation" evidence=RCA] [GO:0010073 "meristem
maintenance" evidence=RCA] [GO:0016246 "RNA interference"
evidence=RCA] [GO:0031507 "heterochromatin assembly" evidence=RCA]
[GO:0045787 "positive regulation of cell cycle" evidence=RCA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006397
GO:GO:0090305 EMBL:AC018908 GO:GO:0004518 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
EMBL:AY140900 EMBL:AY150478 EMBL:AY074280 EMBL:AK316692
EMBL:AK316794 IPI:IPI00533462 PIR:G96635 RefSeq:NP_001031215.1
RefSeq:NP_176297.1 RefSeq:NP_849835.1 UniGene:At.23510
ProteinModelPortal:Q9C952 SMR:Q9C952 IntAct:Q9C952 STRING:Q9C952
PaxDb:Q9C952 PRIDE:Q9C952 EnsemblPlants:AT1G61010.1
EnsemblPlants:AT1G61010.2 EnsemblPlants:AT1G61010.3 GeneID:842393
KEGG:ath:AT1G61010 TAIR:At1g61010 HOGENOM:HOG000203394
InParanoid:Q9C952 KO:K14403 OMA:YVSFSAH PhylomeDB:Q9C952
ProtClustDB:CLSN2681829 Genevestigator:Q9C952 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 Uniprot:Q9C952
Length = 693
Score = 403 (146.9 bits), Expect = 8.6e-41, Sum P(2) = 8.6e-41
Identities = 116/386 (30%), Positives = 192/386 (49%)
Query: 2 GTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-LSKV-ASTIDAVL 59
G + VTPL +S G N L DCG + + P ++ S+ID +L
Sbjct: 19 GDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSIDVLL 78
Query: 60 LSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ- 115
++H H +LPY +++ + VF +T+ +Y+L LLT Y + +S+ SV + + +
Sbjct: 79 ITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKL-LLTDYVK-VSKVSVEDMLFDEQ 136
Query: 116 ------------NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEK 163
++H + + GI + AGH+LG ++ + G ++Y DY+R +++
Sbjct: 137 DINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDR 196
Query: 164 HLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGR 222
HL L F P + I ++ + + R RE F D I T+ GG VL+P + GR
Sbjct: 197 HLRAAELPQF-SPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGR 255
Query: 223 VLELLLILEDYWAEH-SL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAF 280
ELLLIL++YWA H L N PIY+ + ++ + ++++ M D I F S N F
Sbjct: 256 AQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANS--NPF 313
Query: 281 LLKHVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQF 339
+ KH++ L + +D+ D GP +V+A+ L++G S +F W SD KN +
Sbjct: 314 VFKHISPL---NSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVE 370
Query: 340 GTLARMLQADPPPKAVKVTMSRRVPL 365
GTLA+ + +P K V + PL
Sbjct: 371 GTLAKTIINEP--KEVTLMNGLTAPL 394
Score = 101 (40.6 bits), Expect = 8.6e-41, Sum P(2) = 8.6e-41
Identities = 37/136 (27%), Positives = 64/136 (47%)
Query: 491 GKLDEGSASLILDAKPSKV-VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLV 549
G + EG+ + + +P +V + N LT + + +I + AD T L + P ++
Sbjct: 366 GYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNII 425
Query: 550 LVHGSAEATEHLKQHCLKHVCP---HVYTPQIEETIDV--TSDLCAYKV-QLSEKL---- 599
LVHG A LKQ L + TP+ E++++ S+ A + +L+EK
Sbjct: 426 LVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVG 485
Query: 600 --MSNVLFKKLGDYEI 613
+S +L KK Y+I
Sbjct: 486 DTVSGILVKKGFTYQI 501
>ASPGD|ASPL0000040420 [details] [associations]
symbol:AN3082 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR027075 EMBL:BN001306 EMBL:AACD01000051 eggNOG:COG1236
KO:K14402 OrthoDB:EOG4WWVSN InterPro:IPR022712 InterPro:IPR025069
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
OMA:YSQPHQP RefSeq:XP_660686.1 EnsemblFungi:CADANIAT00009996
GeneID:2874210 KEGG:ani:AN3082.2 HOGENOM:HOG000196366
Uniprot:Q5B8P8
Length = 1005
Score = 172 (65.6 bits), Expect = 3.2e-38, Sum P(6) = 3.2e-38
Identities = 45/127 (35%), Positives = 66/127 (51%)
Query: 125 GIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHL-----------NGT-VLES 172
G+ + + AGH +GGT+W I E ++YAVD+N+ +E + +GT V+E
Sbjct: 188 GLTLTAYNAGHTVGGTIWHIQHGMESIVYAVDWNQARESVVAGAAWFGGSGASGTEVIEQ 247
Query: 173 FVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAGRVLELLLI 229
+P LI P R++R E+ D I TL GG VL+P D++ RVLEL
Sbjct: 248 LRKPTALICSTRGGDKFALPGGRKKRDEILLDMIRSTLVKGGTVLIPTDTSARVLELAYA 307
Query: 230 LEDYWAE 236
LE W +
Sbjct: 308 LEHAWRD 314
Score = 148 (57.2 bits), Expect = 3.2e-38, Sum P(6) = 3.2e-38
Identities = 38/102 (37%), Positives = 53/102 (51%)
Query: 8 TPLSGVFNE-NPLSY-LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPD 64
TPL G + + S ++ +DG L+D GW+D FDP L L K ST+ +LL+H
Sbjct: 5 TPLLGAQSSASKASQSILELDGGVKILVDVGWDDTFDPLDLVELEKHVSTLSLILLTHAT 64
Query: 65 TLHLGALPYAMKQLGL--SAPVFSTEPVYRLGLLTMYDQYLS 104
H+GA + K L PV++T PV LG + D Y S
Sbjct: 65 PSHIGAYVHCCKTFPLFTQIPVYATSPVIALGRTLLQDVYES 106
Score = 134 (52.2 bits), Expect = 4.2e-34, Sum P(6) = 4.2e-34
Identities = 40/122 (32%), Positives = 60/122 (49%)
Query: 166 NGT-VLESFVRPAVLITDAYNALHNQPP--RQQR-EMFQDAISKTLRAGGNVLLPVDSAG 221
+GT V+E +P LI P R++R E+ D I TL GG VL+P D++
Sbjct: 240 SGTEVIEQLRKPTALICSTRGGDKFALPGGRKKRDEILLDMIRSTLVKGGTVLIPTDTSA 299
Query: 222 RVLELLLILEDYWAEHSLNYP--------IYFLTYVSSSTIDYVKSFLEWMGDSITKSFE 273
RVLEL LE W + + + +Y ++T+ +S LEWM +SI + FE
Sbjct: 300 RVLELAYALEHAWRDAARDTQDDVLKRGGLYLAGRKVNTTMRLARSMLEWMDESIVREFE 359
Query: 274 TS 275
+
Sbjct: 360 AA 361
Score = 132 (51.5 bits), Expect = 3.2e-38, Sum P(6) = 3.2e-38
Identities = 45/143 (31%), Positives = 68/143 (47%)
Query: 475 KDEDM-DQAAMHIGGDDGKLDEGSASLILDAK----PSKVVSNELTVQVKCLLIFIDYEG 529
KD DM D +M GDD D +A D + P+K + + T+ + L F+D+ G
Sbjct: 687 KDTDMLDNLSMTDIGDD--TDTAAAPGEEDDQAFEGPAKAIYEKATLTINARLAFVDFTG 744
Query: 530 RADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHV-------CPH-----VYTPQ 577
D RS++ ++ + P KL+LV G E T L C K + P ++TP
Sbjct: 745 LHDKRSLEMLIPLIQPRKLILVGGMKEETMALATECQKLLGVKTGADAPSPTAAVIFTPT 804
Query: 578 IEETIDVTSDLCAYKVQLSEKLM 600
E ID + D A+ V+LS L+
Sbjct: 805 NGEIIDASVDTSAWTVKLSNNLV 827
Score = 80 (33.2 bits), Expect = 3.2e-38, Sum P(6) = 3.2e-38
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 645 VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVG 683
VGDL++ADL+ + + G + EF G G L +V +RK G
Sbjct: 923 VGDLRLADLRKIMQNAGHKAEFRGEGTLLIDGFVAVRKSG 962
Score = 75 (31.5 bits), Expect = 3.2e-38, Sum P(6) = 3.2e-38
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 280 FLLKHVTLLINKSELD---NAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 335
F KH+ + K +L+ N P PK++LAS +SL+ GF+ + A NL+L T+
Sbjct: 391 FTFKHLKTVERKKKLEQLLNDPT-PKVILASDSSLDWGFAKESLRLLAGGENNLLLLTD 448
Score = 69 (29.3 bits), Expect = 3.2e-38, Sum P(6) = 3.2e-38
Identities = 13/36 (36%), Positives = 22/36 (61%)
Query: 450 MFPFYENNSEWDDFGEVINPDDYIIKDE----DMDQ 481
MFP+ + D++GE+I P++Y+ +E DM Q
Sbjct: 616 MFPYVAPRKKGDEYGEIIRPEEYLRAEEREEIDMQQ 651
Score = 37 (18.1 bits), Expect = 1.0e-20, Sum P(5) = 1.0e-20
Identities = 13/44 (29%), Positives = 19/44 (43%)
Query: 181 TDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVL 224
TD++ Q E QD ++ + G +L V S GR L
Sbjct: 460 TDSHRRTLGSMIWQWYEERQDGVALEKGSDGEMLEQVHSGGREL 503
>UNIPROTKB|F1NV30 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9031
"Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
IPI:IPI00571913 EMBL:AADN02040858 Ensembl:ENSGALT00000002586
Uniprot:F1NV30
Length = 600
Score = 358 (131.1 bits), Expect = 7.3e-35, Sum P(2) = 7.3e-35
Identities = 98/309 (31%), Positives = 154/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ T SQ HL E + + + AGH+LG +++I
Sbjct: 107 YRKITVDKKGETNFFTSQMIKDCMKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGCESVVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 226 HETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 127 (49.8 bits), Expect = 6.5e-09, Sum P(2) = 6.5e-09
Identities = 32/92 (34%), Positives = 48/92 (52%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP 89
V++SH H GALPY + +G P++ T P
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHP 95
Score = 90 (36.7 bits), Expect = 7.3e-35, Sum P(2) = 7.3e-35
Identities = 21/84 (25%), Positives = 38/84 (45%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETIDV 584
LKQ + + Y P ET +
Sbjct: 423 LKQKIEQEFHVNCYMPANGETTSI 446
Score = 40 (19.1 bits), Expect = 0.00085, Sum P(2) = 0.00085
Identities = 13/57 (22%), Positives = 25/57 (43%)
Query: 564 HCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV 620
H K +CP + + T+D + + Q+ + M V+ L ++ VD E+
Sbjct: 94 HPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDCMKKVVAVHL--HQTVQVDEEL 148
>UNIPROTKB|Q5TA45 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
EMBL:AL139287 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K13148 HOVERGEN:HBG080215
OrthoDB:EOG4GXFMB CTD:54973 EMBL:AL136813 EMBL:AK000549
EMBL:AK021939 EMBL:AK023356 EMBL:AK297350 EMBL:CR533557
EMBL:BC000675 EMBL:BC007978 EMBL:BC013904 EMBL:BK005728
EMBL:BK005673 IPI:IPI00063404 IPI:IPI00306882 IPI:IPI00514973
RefSeq:NP_001243392.1 RefSeq:NP_060341.2 UniGene:Hs.6449
ProteinModelPortal:Q5TA45 SMR:Q5TA45 IntAct:Q5TA45
MINT:MINT-1482228 STRING:Q5TA45 PhosphoSite:Q5TA45 DMDM:118572557
PaxDb:Q5TA45 PRIDE:Q5TA45 DNASU:54973 Ensembl:ENST00000419704
Ensembl:ENST00000435064 Ensembl:ENST00000450926
Ensembl:ENST00000545578 GeneID:54973 KEGG:hsa:54973 UCSC:uc001aee.1
UCSC:uc001aeh.1 UCSC:uc009vjz.1 GeneCards:GC01M001236
HGNC:HGNC:26052 HPA:HPA028379 HPA:HPA029025 MIM:611354
neXtProt:NX_Q5TA45 PharmGKB:PA142672080 InParanoid:Q5TA45
PhylomeDB:Q5TA45 ChiTaRS:CPSF3L GenomeRNAi:54973 NextBio:58222
ArrayExpress:Q5TA45 Bgee:Q5TA45 Genevestigator:Q5TA45
GermOnline:ENSG00000127054 Uniprot:Q5TA45
Length = 600
Score = 355 (130.0 bits), Expect = 8.1e-35, Sum P(2) = 8.1e-35
Identities = 96/309 (31%), Positives = 153/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 226 HETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 127 (49.8 bits), Expect = 3.2e-09, Sum P(2) = 3.2e-09
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Score = 93 (37.8 bits), Expect = 8.1e-35, Sum P(2) = 8.1e-35
Identities = 21/82 (25%), Positives = 39/82 (47%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
LKQ + + + Y P ET+
Sbjct: 423 LKQKIEQELRVNCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 6.0e-29, Sum P(2) = 6.0e-29
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>UNIPROTKB|G3V1S5 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 EMBL:AL139287 EMBL:CH471183 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 OMA:MAVEYMS
CTD:54973 UniGene:Hs.6449 GeneID:54973 KEGG:hsa:54973
HGNC:HGNC:26052 ChiTaRS:CPSF3L GenomeRNAi:54973
RefSeq:NP_001243385.1 ProteinModelPortal:G3V1S5 SMR:G3V1S5
Ensembl:ENST00000540437 ArrayExpress:G3V1S5 Bgee:G3V1S5
Uniprot:G3V1S5
Length = 606
Score = 355 (130.0 bits), Expect = 9.0e-35, Sum P(2) = 9.0e-35
Identities = 96/309 (31%), Positives = 153/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 53 DFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 112
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 113 YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 172
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 173 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 231
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 232 HETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPW 291
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 292 TNQKIRKTF-VQR-NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 346
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 347 AGNEKNMVI 355
Score = 116 (45.9 bits), Expect = 5.0e-08, Sum P(2) = 5.0e-08
Identities = 29/86 (33%), Positives = 45/86 (52%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 27 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD 100
+ +G P++ T P + + + D
Sbjct: 87 SEMVGYDGPIYMTHPTQAICPILLED 112
Score = 93 (37.8 bits), Expect = 9.0e-35, Sum P(2) = 9.0e-35
Identities = 21/82 (25%), Positives = 39/82 (47%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 369 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 428
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
LKQ + + + Y P ET+
Sbjct: 429 LKQKIEQELRVNCYMPANGETV 450
Score = 37 (18.1 bits), Expect = 6.6e-29, Sum P(2) = 6.6e-29
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 544 LKDHCVQHL 552
>UNIPROTKB|Q5ZIH0 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9031
"Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000231294 KO:K13148 GO:GO:0032039 GO:GO:0016180
HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB EMBL:AJ720814 IPI:IPI00571913
RefSeq:NP_001012854.1 UniGene:Gga.13403 ProteinModelPortal:Q5ZIH0
STRING:Q5ZIH0 GeneID:419418 KEGG:gga:419418 CTD:54973
InParanoid:Q5ZIH0 NextBio:20822477 Uniprot:Q5ZIH0
Length = 600
Score = 358 (131.1 bits), Expect = 9.4e-35, Sum P(2) = 9.4e-35
Identities = 98/309 (31%), Positives = 154/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ T SQ HL E + + + AGH+LG +++I
Sbjct: 107 YRKITVDKKGETNFFTSQMIKDCMKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGCESVVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 226 HETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 127 (49.8 bits), Expect = 8.2e-09, Sum P(2) = 8.2e-09
Identities = 32/92 (34%), Positives = 48/92 (52%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP 89
V++SH H GALPY + +G P++ T P
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHP 95
Score = 89 (36.4 bits), Expect = 9.4e-35, Sum P(2) = 9.4e-35
Identities = 21/84 (25%), Positives = 38/84 (45%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETIDV 584
LKQ + + Y P ET +
Sbjct: 423 LKQKIEQEFHVNCYMPANGETTTI 446
Score = 40 (19.1 bits), Expect = 0.00085, Sum P(2) = 0.00085
Identities = 13/57 (22%), Positives = 25/57 (43%)
Query: 564 HCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV 620
H K +CP + + T+D + + Q+ + M V+ L ++ VD E+
Sbjct: 94 HPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDCMKKVVAVHL--HQTVQVDEEL 148
>UNIPROTKB|E1B7Q9 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9913
"Bos taurus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:DAAA02043243 IPI:IPI00971575 Ensembl:ENSBTAT00000010020
Uniprot:E1B7Q9
Length = 598
Score = 354 (129.7 bits), Expect = 1.7e-34, Sum P(2) = 1.7e-34
Identities = 95/308 (30%), Positives = 152/308 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T+P + LL
Sbjct: 47 DFSYITRSGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLED 106
Query: 99 YDQYLSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKIT 145
Y + + SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIK 166
Query: 146 KDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAIS 204
E V+Y DYN ++HL ++ RP++LIT++ A + ++ RE F +
Sbjct: 167 VGSESVVYTGDYNMTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVH 225
Query: 205 KTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWM 264
+T+ GG VL+PV + GR EL ++LE +W L PIYF T ++ Y K F+ W
Sbjct: 226 ETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWT 285
Query: 265 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWA 324
I K+F R N F KH+ +++ D+ P GP +V A+ L AG S IF +WA
Sbjct: 286 NQKIRKTF-VQR-NMFEFKHIKAF-DRAFADS-P-GPMVVFATPGMLHAGQSLQIFRKWA 340
Query: 325 SDVKNLVL 332
+ KN+V+
Sbjct: 341 GNEKNMVI 348
Score = 125 (49.1 bits), Expect = 8.3e-09, Sum P(2) = 8.3e-09
Identities = 31/103 (30%), Positives = 51/103 (49%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T+P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLED 106
Score = 91 (37.1 bits), Expect = 1.7e-34, Sum P(2) = 1.7e-34
Identities = 21/82 (25%), Positives = 38/82 (46%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 362 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEF 421
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
LKQ + + Y P ET+
Sbjct: 422 LKQKIEQEFRVNCYMPANGETV 443
Score = 37 (18.1 bits), Expect = 7.8e-29, Sum P(2) = 7.8e-29
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 537 LKDHCVQHL 545
>UNIPROTKB|Q2YDM2 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9913
"Bos taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000231294 EMBL:BC110155 IPI:IPI00725178
UniGene:Bt.4894 ProteinModelPortal:Q2YDM2 STRING:Q2YDM2
PRIDE:Q2YDM2 HOVERGEN:HBG080215 InParanoid:Q2YDM2 OrthoDB:EOG4GXFMB
Uniprot:Q2YDM2
Length = 599
Score = 351 (128.6 bits), Expect = 4.0e-34, Sum P(2) = 4.0e-34
Identities = 93/300 (31%), Positives = 151/300 (50%)
Query: 50 KVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQY-LSRR 106
++ +D V++SH H GALPY + +G P++ T+P + LL Y + + ++
Sbjct: 56 RLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKK 115
Query: 107 SVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKITKDGEDVIY 153
SQ HL + + + + AGH+LG +++I E V+Y
Sbjct: 116 GEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVY 175
Query: 154 AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGN 212
DYN ++HL ++ RP++LIT++ A + ++ RE F + +T+ GG
Sbjct: 176 TGDYNMTPDRHLGAAWIDK-CRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK 234
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF 272
VL+PV + GR EL ++LE +W L PIYF T ++ Y K F+ W I K+F
Sbjct: 235 VLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF 294
Query: 273 ETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVL 332
R N F KH+ +++ D+ P GP +V A+ L AG S IF +WA + KN+V+
Sbjct: 295 -VQR-NMFEFKHIKAF-DRAFADS-P-GPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVI 349
Score = 120 (47.3 bits), Expect = 2.9e-08, Sum P(2) = 2.9e-08
Identities = 30/103 (29%), Positives = 49/103 (47%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQP-------LSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F P ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T+P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLED 106
Score = 91 (37.1 bits), Expect = 4.0e-34, Sum P(2) = 4.0e-34
Identities = 21/82 (25%), Positives = 38/82 (46%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
LKQ + + Y P ET+
Sbjct: 423 LKQKIEQEFRVNCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 1.8e-28, Sum P(2) = 1.8e-28
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>MGI|MGI:1919207 [details] [associations]
symbol:Cpsf3l "cleavage and polyadenylation specific factor
3-like" species:10090 "Mus musculus" [GO:0003674
"molecular_function" evidence=ND] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 MGI:MGI:1919207 GO:GO:0005737 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
GeneTree:ENSGT00700000104485 GO:GO:0032039 GO:GO:0016180
HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB CTD:54973 EMBL:AK010425
EMBL:AK090206 EMBL:AK150436 EMBL:AK152740 EMBL:AK167607
EMBL:AK172533 EMBL:BC008240 EMBL:BC011155 IPI:IPI00467084
RefSeq:NP_082296.1 UniGene:Mm.259270 UniGene:Mm.475640
ProteinModelPortal:Q9CWS4 SMR:Q9CWS4 STRING:Q9CWS4
PhosphoSite:Q9CWS4 PaxDb:Q9CWS4 PRIDE:Q9CWS4
Ensembl:ENSMUST00000030901 GeneID:71957 KEGG:mmu:71957
InParanoid:Q9CWS4 NextBio:335052 Bgee:Q9CWS4 Genevestigator:Q9CWS4
GermOnline:ENSMUSG00000029034 Uniprot:Q9CWS4
Length = 600
Score = 356 (130.4 bits), Expect = 4.3e-34, Sum P(2) = 4.3e-34
Identities = 96/309 (31%), Positives = 153/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITQSGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 226 HETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRTFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 131 (51.2 bits), Expect = 7.9e-09, Sum P(2) = 7.9e-09
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Score = 85 (35.0 bits), Expect = 4.3e-34, Sum P(2) = 4.3e-34
Identities = 20/82 (24%), Positives = 37/82 (45%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
L+Q + Y P ET+
Sbjct: 423 LRQKIEQEFRVSCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 4.6e-29, Sum P(2) = 4.6e-29
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>RGD|1306841 [details] [associations]
symbol:Cpsf3l "cleavage and polyadenylation specific factor
3-like" species:10116 "Rattus norvegicus" [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016787
"hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 RGD:1306841 GO:GO:0005634 GO:GO:0005737
GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000231294 KO:K13148
OMA:MAVEYMS GeneTree:ENSGT00700000104485 HOVERGEN:HBG080215
OrthoDB:EOG4GXFMB CTD:54973 EMBL:BC105303 IPI:IPI00365477
RefSeq:NP_001029064.1 UniGene:Rn.98615 ProteinModelPortal:Q3MHC2
STRING:Q3MHC2 Ensembl:ENSRNOT00000026725 GeneID:298688
KEGG:rno:298688 InParanoid:Q3MHC2 NextBio:644186
Genevestigator:Q3MHC2 GermOnline:ENSRNOG00000019712 Uniprot:Q3MHC2
Length = 600
Score = 356 (130.4 bits), Expect = 4.3e-34, Sum P(2) = 4.3e-34
Identities = 96/309 (31%), Positives = 153/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITQSGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 226 HETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFITW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRTFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 131 (51.2 bits), Expect = 7.9e-09, Sum P(2) = 7.9e-09
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Score = 85 (35.0 bits), Expect = 4.3e-34, Sum P(2) = 4.3e-34
Identities = 20/82 (24%), Positives = 37/82 (45%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
L+Q + Y P ET+
Sbjct: 423 LRQKIEQEFRVSCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 4.6e-29, Sum P(2) = 4.6e-29
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>UNIPROTKB|F1SD84 [details] [associations]
symbol:LOC100625560 "Uncharacterized protein" species:9823
"Sus scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
"mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
InterPro:IPR027075 Pfam:PF07521 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF13299
GeneTree:ENSGT00700000104551 EMBL:CU468363
Ensembl:ENSSSCT00000002718 OMA:VEGCASE Uniprot:F1SD84
Length = 304
Score = 252 (93.8 bits), Expect = 8.6e-34, Sum P(2) = 8.6e-34
Identities = 56/174 (32%), Positives = 103/174 (59%)
Query: 466 VINPDDYIIKD---EDMDQAAMHIGGDDGKLDEGSASLILDAKPSKVVSNELTVQVKCLL 522
+ P+D+++ + + +++ + G +G DE + D P+K +S ++++K +
Sbjct: 1 LFRPEDFLVPELQATEEEKSKLESGLTNG--DEPMDQDLSDV-PTKCISTTESIEIKARV 57
Query: 523 IFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL----KHVCPHVYTPQI 578
+IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C K + VY P++
Sbjct: 58 TYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKDI--KVYMPKL 115
Query: 579 EETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 628
ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 116 HETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 169
Score = 151 (58.2 bits), Expect = 8.6e-34, Sum P(2) = 8.6e-34
Identities = 37/104 (35%), Positives = 57/104 (54%)
Query: 624 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 677
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 211 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 270
Query: 678 TIRKVGPAGQKGGGSGTQQIVIEGPLCEDYYKIRAYLYSQFYLL 721
+R+ + T +I +EG LC+D+Y+IR LY Q+ ++
Sbjct: 271 AVRR----------TETGRIGLEGCLCQDFYRIRDLLYEQYAIV 304
Score = 39 (18.8 bits), Expect = 2.0e-07, Sum P(2) = 2.0e-07
Identities = 14/49 (28%), Positives = 21/49 (42%)
Query: 454 YENNSEWDDFGEVIN---PDDYII------KDEDMDQAAMHIGGDDGKL 493
YE S+ D ++IN P II +D+ + GG D K+
Sbjct: 62 YEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKDIKV 110
>UNIPROTKB|E2QY53 [details] [associations]
symbol:CPSF3L "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:AAEX03003844 RefSeq:XP_003639102.1 Ensembl:ENSCAFT00000030626
GeneID:100855777 KEGG:cfa:100855777 Uniprot:E2QY53
Length = 600
Score = 348 (127.6 bits), Expect = 9.1e-34, Sum P(2) = 9.1e-34
Identities = 95/309 (30%), Positives = 152/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITRNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+ + GG VL+PV + GR EL ++LE +W +L PIYF T ++ Y K F+ W
Sbjct: 226 HEAVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ DN P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRAFADN-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 126 (49.4 bits), Expect = 6.6e-09, Sum P(2) = 6.6e-09
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Score = 91 (37.1 bits), Expect = 9.1e-34, Sum P(2) = 9.1e-34
Identities = 21/82 (25%), Positives = 38/82 (46%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
LKQ + + Y P ET+
Sbjct: 423 LKQKIEQEFRVNCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 4.1e-28, Sum P(2) = 4.1e-28
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>UNIPROTKB|F1RJE8 [details] [associations]
symbol:CPSF3L "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:FP102596 RefSeq:XP_003127541.3 Ensembl:ENSSSCT00000003708
GeneID:100523908 KEGG:ssc:100523908 Uniprot:F1RJE8
Length = 599
Score = 349 (127.9 bits), Expect = 1.4e-33, Sum P(2) = 1.4e-33
Identities = 95/309 (30%), Positives = 153/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T+P + LL
Sbjct: 47 DFSYITRHGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKKGEANFFTSQMIKDCMKKAVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+T+ GG VL+PV + GR EL ++LE +W L PIYF T ++ Y K F+ W
Sbjct: 226 HETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIPW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +++ D+ P GP +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VQR-NMFEFKHIKAF-DRAFADS-P-GPMVVFATPGMLHAGQSLQIFRKW 340
Query: 324 ASDVKNLVL 332
A + KN+V+
Sbjct: 341 AGNEKNMVI 349
Score = 125 (49.1 bits), Expect = 1.7e-08, Sum P(2) = 1.7e-08
Identities = 31/103 (30%), Positives = 51/103 (49%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHF-------DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG + F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFSDDRRFPDFSYITRHGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T+P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLED 106
Score = 88 (36.0 bits), Expect = 1.4e-33, Sum P(2) = 1.4e-33
Identities = 21/82 (25%), Positives = 37/82 (45%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL + + ++VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 363 ILSGQRKLELEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEF 422
Query: 561 LKQHCLKHVCPHVYTPQIEETI 582
LKQ + Y P ET+
Sbjct: 423 LKQKIEQEFRLSCYMPANGETV 444
Score = 37 (18.1 bits), Expect = 3.1e-28, Sum P(2) = 3.1e-28
Identities = 5/9 (55%), Positives = 8/9 (88%)
Query: 561 LKQHCLKHV 569
LK HC++H+
Sbjct: 538 LKDHCVQHL 546
>POMBASE|SPAC17G6.16c [details] [associations]
symbol:ysh1 "mRNA cleavage and polyadenylation
specificity factor complex endoribonuclease subunit Ysh1"
species:4896 "Schizosaccharomyces pombe" [GO:0004521
"endoribonuclease activity" evidence=ISO] [GO:0005634 "nucleus"
evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IC]
[GO:0006379 "mRNA cleavage" evidence=IC] [GO:0046872 "metal ion
binding" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 PomBase:SPAC17G6.16c Pfam:PF07521 GO:GO:0005829
EMBL:CU329670 GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0006378
GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
OrthoDB:EOG41ZJK7 PIR:T37848 RefSeq:NP_594263.2 STRING:O13794
EnsemblFungi:SPAC17G6.16c.1 GeneID:2542258 NextBio:20803322
Uniprot:O13794
Length = 757
Score = 394 (143.8 bits), Expect = 1.6e-33, P = 1.6e-33
Identities = 104/337 (30%), Positives = 178/337 (52%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRL--GLLTMY---------DQ 101
ST+D +L+SH H+ +LPY M++ VF T P + LL+ Y DQ
Sbjct: 69 STVDVLLISHFHLDHVASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSDYVKVSNVGMEDQ 128
Query: 102 YLSRRSVTRL---TYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 158
+ + + +YH + + EGI P+ AGH+LG ++ + G ++++ DY+
Sbjct: 129 LYDEKDLLAAFDRIEAVDYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILFTGDYS 188
Query: 159 RRKEKHLNGTVLESFVRPAVLITDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPV 217
R +++HL+ + RP VLIT++ Y +QP ++ + I T+R GG VL+PV
Sbjct: 189 REEDRHLHVAEVPP-KRPDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGRVLMPV 247
Query: 218 DSAGRVLELLLILEDYWAEH-SL-NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 275
+ GR ELLLIL++YW H L + PIY+ + ++ + ++++ M D+I K F +
Sbjct: 248 FALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRKIF--A 305
Query: 276 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 335
N F+ + V L N + D+ GP ++LAS L+ G S + WA D +N +L T
Sbjct: 306 ERNPFIFRFVKSLRNLEKFDDI--GPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTG 363
Query: 336 RGQFGTLARMLQADPPPKAVKVTMSRRVP--LVGEEL 370
GT+A+ + + P + V ++ +++P + EEL
Sbjct: 364 YSVEGTMAKQI-TNEPIEIVSLS-GQKIPRRMAVEEL 398
>FB|FBgn0039691 [details] [associations]
symbol:IntS11 "Integrator 11" species:7227 "Drosophila
melanogaster" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0034472 "snRNA
3'-end processing" evidence=IDA] [GO:0016180 "snRNA processing"
evidence=ISS] [GO:0032039 "integrator complex" evidence=ISS]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
GO:GO:0006378 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
GO:GO:0034472 EMBL:AY061097 RefSeq:NP_651721.1 UniGene:Dm.3722
SMR:Q9VAH9 STRING:Q9VAH9 EnsemblMetazoa:FBtr0085476 GeneID:43506
KEGG:dme:Dmel_CG1972 UCSC:CG1972-RA CTD:43506 FlyBase:FBgn0039691
InParanoid:Q9VAH9 OrthoDB:EOG47D7X3 GenomeRNAi:43506 NextBio:834295
Uniprot:Q9VAH9
Length = 597
Score = 351 (128.6 bits), Expect = 5.5e-33, Sum P(2) = 5.5e-33
Identities = 94/309 (30%), Positives = 152/309 (49%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
D S + P + S ID V++SH H GALPY + +G + P++ T P + + + D
Sbjct: 47 DFSYIVPEGPITSHIDCVIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLED 106
Query: 101 QY---LSRRSVTRLTYSQ------------NYHLSGKGE-GIVVAPHVAGHLLGGTVWKI 144
+ R+ + +Q H S + + + + AGH+LG ++ I
Sbjct: 107 MRKVAVERKGESNFFTTQMIKDCMKKVIPVTLHQSMMVDTDLEIKAYYAGHVLGAAMFWI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
+ V+Y DYN ++HL ++ RP +LI+++ A + ++ RE F +
Sbjct: 167 KVGSQSVVYTGDYNMTPDRHLGAAWIDK-CRPDLLISESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEW 263
+ + GG VL+PV + GR EL ++LE YW +L YPIYF ++ Y K F+ W
Sbjct: 226 HECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMFITW 285
Query: 264 MGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEW 323
I K+F R N F KH+ +K+ +DN P G +V A+ L AG S IF +W
Sbjct: 286 TNQKIRKTF-VHR-NMFDFKHIKPF-DKAYIDN-P-GAMVVFATPGMLHAGLSLQIFKKW 340
Query: 324 ASDVKNLVL 332
A + N+V+
Sbjct: 341 APNENNMVI 349
Score = 136 (52.9 bits), Expect = 7.3e-09, Sum P(2) = 7.3e-09
Identities = 33/103 (32%), Positives = 54/103 (52%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WNDH--F-DPSLLQPLSKVASTIDA 57
+++TPL + L+S+ G N ++DCG +ND F D S + P + S ID
Sbjct: 4 IKITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G + P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLED 106
Score = 80 (33.2 bits), Expect = 5.5e-33, Sum P(2) = 5.5e-33
Identities = 18/75 (24%), Positives = 34/75 (45%)
Query: 512 NELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCP 571
N V+VK + ++ + AD + I ++ + P ++LVHG A + L+
Sbjct: 374 NRQVVEVKMAVEYMSFSAHADAKGIMQLIQNCEPKNVMLVHGEAGKMKFLRSKIKDEFNL 433
Query: 572 HVYTPQIEETIDVTS 586
Y P ET +++
Sbjct: 434 ETYMPANGETCVIST 448
>CGD|CAL0004705 [details] [associations]
symbol:orf19.325 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493 EMBL:AACQ01000027
EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 RefSeq:XP_720020.1 RefSeq:XP_720152.1
ProteinModelPortal:Q5AEE3 STRING:Q5AEE3 GeneID:3638181
GeneID:3638320 KEGG:cal:CaO19.325 KEGG:cal:CaO19.7957
Uniprot:Q5AEE3
Length = 931
Score = 285 (105.4 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 80/239 (33%), Positives = 116/239 (48%)
Query: 108 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN- 166
V L Y Q+ +L +VV P+ AGH LGGT W ITK + VIYA +N K+ LN
Sbjct: 129 VNLLKYQQSLNLFDNK--VVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNS 186
Query: 167 --------GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD 218
G S +RP IT A + R++ E F + TL GG +LP
Sbjct: 187 ASFISPSTGNPHLSLLRPTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTS 245
Query: 219 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 278
+GR LEL +++++ + P+YFL+Y + + Y + L+WM S TK +E
Sbjct: 246 LSGRFLELFHLIDEHLKGAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSV 303
Query: 279 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER 336
F V LL++ SEL GPK+V S L +G S + F +D ++ TE+
Sbjct: 304 PFNPSKVDLLLDPSELLKL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIILTEK 361
Score = 77 (32.2 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 20/68 (29%), Positives = 36/68 (52%)
Query: 645 VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPL 703
+G++++ DLK L + + EF G L + + +RK+ + SG IVI+G +
Sbjct: 856 IGNIRLPDLKKKLQNLNMTAEFKSEGTLVVNDILAVRKIAYGLVESDESG--DIVIDGNV 913
Query: 704 CEDYYKIR 711
YYK++
Sbjct: 914 GPLYYKVK 921
Score = 71 (30.1 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 25/85 (29%), Positives = 40/85 (47%)
Query: 22 LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHL-GALPYAMKQ-- 77
L+ D F + D WN D + + + +A+LLSH + G + +K
Sbjct: 20 LLEFDNEFKLIADPSWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIKFPI 78
Query: 78 LGLSAPVFSTEPVYRLGLLTMYDQY 102
L S PV+ST PV +LG ++ + Y
Sbjct: 79 LMSSIPVYSTLPVNQLGRVSTVEYY 103
Score = 69 (29.3 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLV 551
+K S ++V+C L F+D G+ D RS+ I+ + P L+L+
Sbjct: 632 TKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKPYNLILL 676
Score = 67 (28.6 bits), Expect = 3.0e-32, Sum P(6) = 3.0e-32
Identities = 22/70 (31%), Positives = 39/70 (55%)
Query: 451 FPFYE--NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEG-SASLILDAKPS 507
FP++ + ++DD+GEVI +DY DE + + + + G K DE +A+ + +
Sbjct: 537 FPYFATAHKQKFDDYGEVIKIEDYQRHDE-VSHSKIIMEGKR-KFDEKRTANNRRNKNQN 594
Query: 508 KVVSNELTVQ 517
K +N+LT Q
Sbjct: 595 KQQANKLTPQ 604
Score = 54 (24.1 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 17/63 (26%), Positives = 34/63 (53%)
Query: 348 ADPPPKAVKV-TMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNL 406
A P K + + ++ V L G EL ++E+ + +KE+ L + V++++++ L D
Sbjct: 395 AVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLL--AKVRDQKNQNILSADTVD 452
Query: 407 SGD 409
S D
Sbjct: 453 SED 455
Score = 45 (20.9 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 8/31 (25%), Positives = 22/31 (70%)
Query: 591 YKVQLSEKLMSNVLFKKLGD-YEIAWVDAEV 620
++V L + ++ ++ ++K+GD Y++A + E+
Sbjct: 763 FEVNLDDSIVKDLKWQKIGDDYKVAKLYGEL 793
>UNIPROTKB|Q5AEE3 [details] [associations]
symbol:CFT2 "Putative uncharacterized protein CFT2"
species:237561 "Candida albicans SC5314" [GO:0042493 "response to
drug" evidence=IMP] InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493
EMBL:AACQ01000027 EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 RefSeq:XP_720020.1
RefSeq:XP_720152.1 ProteinModelPortal:Q5AEE3 STRING:Q5AEE3
GeneID:3638181 GeneID:3638320 KEGG:cal:CaO19.325
KEGG:cal:CaO19.7957 Uniprot:Q5AEE3
Length = 931
Score = 285 (105.4 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 80/239 (33%), Positives = 116/239 (48%)
Query: 108 VTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN- 166
V L Y Q+ +L +VV P+ AGH LGGT W ITK + VIYA +N K+ LN
Sbjct: 129 VNLLKYQQSLNLFDNK--VVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNS 186
Query: 167 --------GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVD 218
G S +RP IT A + R++ E F + TL GG +LP
Sbjct: 187 ASFISPSTGNPHLSLLRPTAFIT-ATDMGSVMSHRKRTEKFLQLVDATLANGGAAVLPTS 245
Query: 219 SAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDN 278
+GR LEL +++++ + P+YFL+Y + + Y + L+WM S TK +E
Sbjct: 246 LSGRFLELFHLIDEHLKGAPI--PVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSV 303
Query: 279 AFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTER 336
F V LL++ SEL GPK+V S L +G S + F +D ++ TE+
Sbjct: 304 PFNPSKVDLLLDPSELLKL-SGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIILTEK 361
Score = 77 (32.2 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 20/68 (29%), Positives = 36/68 (52%)
Query: 645 VGDLKMADLKPFLSSKGIQVEFAG-GALRCGEYVTIRKVGPAGQKGGGSGTQQIVIEGPL 703
+G++++ DLK L + + EF G L + + +RK+ + SG IVI+G +
Sbjct: 856 IGNIRLPDLKKKLQNLNMTAEFKSEGTLVVNDILAVRKIAYGLVESDESG--DIVIDGNV 913
Query: 704 CEDYYKIR 711
YYK++
Sbjct: 914 GPLYYKVK 921
Score = 71 (30.1 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 25/85 (29%), Positives = 40/85 (47%)
Query: 22 LVSIDG-FNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHL-GALPYAMKQ-- 77
L+ D F + D WN D + + + +A+LLSH + G + +K
Sbjct: 20 LLEFDNEFKLIADPSWNG-VDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIKFPI 78
Query: 78 LGLSAPVFSTEPVYRLGLLTMYDQY 102
L S PV+ST PV +LG ++ + Y
Sbjct: 79 LMSSIPVYSTLPVNQLGRVSTVEYY 103
Score = 69 (29.3 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 15/45 (33%), Positives = 26/45 (57%)
Query: 507 SKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLV 551
+K S ++V+C L F+D G+ D RS+ I+ + P L+L+
Sbjct: 632 TKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKPYNLILL 676
Score = 67 (28.6 bits), Expect = 3.0e-32, Sum P(6) = 3.0e-32
Identities = 22/70 (31%), Positives = 39/70 (55%)
Query: 451 FPFYE--NNSEWDDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEG-SASLILDAKPS 507
FP++ + ++DD+GEVI +DY DE + + + + G K DE +A+ + +
Sbjct: 537 FPYFATAHKQKFDDYGEVIKIEDYQRHDE-VSHSKIIMEGKR-KFDEKRTANNRRNKNQN 594
Query: 508 KVVSNELTVQ 517
K +N+LT Q
Sbjct: 595 KQQANKLTPQ 604
Score = 54 (24.1 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 17/63 (26%), Positives = 34/63 (53%)
Query: 348 ADPPPKAVKV-TMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLGPDNNL 406
A P K + + ++ V L G EL ++E+ + +KE+ L + V++++++ L D
Sbjct: 395 AVPIDKNISLKNWTKEVELTGTELTEFQEKVAQKRKEKLL--AKVRDQKNQNILSADTVD 452
Query: 407 SGD 409
S D
Sbjct: 453 SED 455
Score = 45 (20.9 bits), Expect = 1.9e-32, Sum P(6) = 1.9e-32
Identities = 8/31 (25%), Positives = 22/31 (70%)
Query: 591 YKVQLSEKLMSNVLFKKLGD-YEIAWVDAEV 620
++V L + ++ ++ ++K+GD Y++A + E+
Sbjct: 763 FEVNLDDSIVKDLKWQKIGDDYKVAKLYGEL 793
>SGD|S000004105 [details] [associations]
symbol:CFT2 "Subunit of the mRNA cleavage and
polyadenlylation factor (CPF)" species:4932 "Saccharomyces
cerevisiae" [GO:0005849 "mRNA cleavage factor complex"
evidence=IPI] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IDA;IPI] [GO:0005634 "nucleus" evidence=IEA]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006379 "mRNA
cleavage" evidence=IDA;TAS] [GO:0003723 "RNA binding" evidence=IPI]
SGD:S000004105 GO:GO:0006378 EMBL:BK006945 GO:GO:0003723
EMBL:X89514 EMBL:U53878 EMBL:U53877 EMBL:Z73288 GO:GO:0005847
GO:GO:0006379 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
EMBL:Z73287 PIR:S64952 RefSeq:NP_013216.1 PDB:2I7X PDBsum:2I7X
ProteinModelPortal:Q12102 SMR:Q12102 DIP:DIP-2468N IntAct:Q12102
MINT:MINT-375505 STRING:Q12102 PaxDb:Q12102 PeptideAtlas:Q12102
EnsemblFungi:YLR115W GeneID:850806 KEGG:sce:YLR115W CYGD:YLR115w
GeneTree:ENSGT00700000104551 HOGENOM:HOG000001120 OMA:YSQPHQP
OrthoDB:EOG4W11N8 EvolutionaryTrace:Q12102 NextBio:967034
Genevestigator:Q12102 GermOnline:YLR115W Uniprot:Q12102
Length = 859
Score = 253 (94.1 bits), Expect = 2.5e-32, Sum P(4) = 2.5e-32
Identities = 71/261 (27%), Positives = 129/261 (49%)
Query: 91 YRLGLLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGED 150
Y L + D +S + L YSQ L + +G+ + + AG GG++W I+ E
Sbjct: 114 YDTNKLDLEDIEISFDHIVPLKYSQLVDLRSRYDGLTLLAYNAGVCPGGSIWCISTYSEK 173
Query: 151 VIYAVDYNRRKEKHLN--------GTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDA 202
++YA +N ++ LN G L + +RP+ +IT +QP +++ ++F+D
Sbjct: 174 LVYAKRWNHTRDNILNAASILDATGKPLSTLMRPSAIITTLDRFGSSQPFKKRSKIFKDT 233
Query: 203 ISKTLRAGGNVLLPVDSAGRVLELL-----LILEDYWAEHSLNYPIYFLTYVSSSTIDYV 257
+ K L + G+V++PVD +G+ L+L L+ E P+ L+Y T+ Y
Sbjct: 234 LKKGLSSDGSVIIPVDMSGKFLDLFTQVHELLFESTKINAHTQVPVLILSYARGRTLTYA 293
Query: 258 KSFLEWMGDSITKSFETSRDNA--FLLKHVTLLINKSELDNAPDGPKLVLASMASLEAG- 314
KS LEW+ S+ K++E +R+N F + +I +EL P G K+ S E G
Sbjct: 294 KSMLEWLSPSLLKTWE-NRNNTSPFEIGSRIKIIAPNELSKYP-GSKICFVS----EVGA 347
Query: 315 FSHDIFVEWASDVKNLVLFTE 335
+++ ++ + K ++ T+
Sbjct: 348 LINEVIIKVGNSEKTTLILTK 368
Score = 128 (50.1 bits), Expect = 2.5e-32, Sum P(4) = 2.5e-32
Identities = 47/202 (23%), Positives = 88/202 (43%)
Query: 500 LILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATE 559
L +D SK + + VQ+KC ++ ++ + D RS I + K+VL E
Sbjct: 633 LKIDKTLSKRTISTVNVQLKCSVVILNLQSLVDQRSASIIWPSLKSRKIVLSAPKQIQNE 692
Query: 560 HLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGD-YEIAWVDA 618
+ +K V P + + ++ ++ + + + L + + ++++ D Y +A V
Sbjct: 693 EITAKLIKKNIEVVNMP-LNKIVEFSTTIKTLDISIDSNLDNLLKWQRISDSYTVATVVG 751
Query: 619 EVGK------------TENGMLSLLPISTPAPPHKS--VLVGDLKMADLKPFLSSKGIQV 664
+ K L L P+ + HK+ + +GD+++A LK L+ K
Sbjct: 752 RLVKESLPQVNNHQKTASRSKLVLKPLHGSSRSHKTGALSIGDVRLAQLKKLLTEKNYIA 811
Query: 665 EFAG-GALRCGEYVTIRKVGPA 685
EF G G L E V +RK+ A
Sbjct: 812 EFKGEGTLVINEKVAVRKINDA 833
Score = 98 (39.6 bits), Expect = 1.8e-15, Sum P(4) = 1.8e-15
Identities = 41/177 (23%), Positives = 74/177 (41%)
Query: 242 PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNA-FLLKHVTLLINKSELDNAPDG 300
P+ L+Y T+ Y KS LEW+ S+ K++E + + F + +I +EL P G
Sbjct: 278 PVLILSYARGRTLTYAKSMLEWLSPSLLKTWENRNNTSPFEIGSRIKIIAPNELSKYP-G 336
Query: 301 PKLVLASMAS-------LEAGFSHDIFV-------EWASDVKNLVLFTERGQ--FGTLAR 344
K+ S ++ G S + E AS + ++ E+ + + T
Sbjct: 337 SKICFVSEVGALINEVIIKVGNSEKTTLILTKPSFECASSLDKILEIVEQDERNWKTFPE 396
Query: 345 MLQADPPPKAVKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 401
++ + + + PL EE A++ + K++ K LVK E K + G
Sbjct: 397 DGKSFLCDNYISIDTIKEEPLSKEETEAFKVQLKEKKRDRNKKILLVKRESKKLANG 453
Score = 88 (36.0 bits), Expect = 2.5e-32, Sum P(4) = 2.5e-32
Identities = 30/89 (33%), Positives = 40/89 (44%)
Query: 22 LVSIDGFNFLIDCGWNDH--FDPSLLQPLSKVASTIDAVLLSHPDTLHLGA---LPYAMK 76
+V D LID GWN ++ KV ID ++LS P LGA L Y
Sbjct: 19 VVRFDNVTLLIDPGWNPSKVSYEQCIKYWEKVIPEIDVIILSQPTIECLGAHSLLYYNFT 78
Query: 77 QLGLSA-PVFSTEPVYRLGLLTMYDQYLS 104
+S V++T PV LG ++ D Y S
Sbjct: 79 SHFISRIQVYATLPVINLGRVSTIDSYAS 107
Score = 58 (25.5 bits), Expect = 2.5e-32, Sum P(4) = 2.5e-32
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 434 DILIDGFVPPST-SVAPMFPFYENNSEWDDFGEVINPDDYIIKDED 478
++ +D + PS S MFPF + DD+G V++ ++ D D
Sbjct: 519 EVPVDIIIQPSAASKHKMFPFNPAKIKKDDYGTVVDFTMFLPDDSD 564
>SGD|S000004267 [details] [associations]
symbol:YSH1 "Putative endoribonuclease" species:4932
"Saccharomyces cerevisiae" [GO:0004518 "nuclease activity"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0006379 "mRNA
cleavage" evidence=IMP] [GO:0006369 "termination of RNA polymerase
II transcription" evidence=IMP] [GO:0031126 "snoRNA 3'-end
processing" evidence=IMP] [GO:0008380 "RNA splicing" evidence=IMP]
[GO:0034247 "snoRNA splicing" evidence=IMP] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IDA;IPI] [GO:0005849 "mRNA cleavage factor complex"
evidence=IPI] [GO:0004521 "endoribonuclease activity"
evidence=ISS;IMP] [GO:0003723 "RNA binding" evidence=IC]
[GO:0004519 "endonuclease activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 SGD:S000004267
Pfam:PF07521 GO:GO:0046872 GO:GO:0006378 EMBL:BK006945
GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 EMBL:U17245 HOGENOM:HOG000203394 KO:K14403
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0031126 GO:GO:0034247
OMA:EISFAAH OrthoDB:EOG41ZJK7 PIR:S51413 RefSeq:NP_013379.1
ProteinModelPortal:Q06224 SMR:Q06224 DIP:DIP-2470N IntAct:Q06224
MINT:MINT-375457 STRING:Q06224 PaxDb:Q06224 PeptideAtlas:Q06224
EnsemblFungi:YLR277C GeneID:850983 KEGG:sce:YLR277C CYGD:YLR277c
NextBio:967501 Genevestigator:Q06224 GermOnline:YLR277C
Uniprot:Q06224
Length = 779
Score = 347 (127.2 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
Identities = 82/282 (29%), Positives = 144/282 (51%)
Query: 116 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 175
+YH + GI AGH+LG +++I G V++ DY+R ++HLN +
Sbjct: 144 DYHSTVDVNGIKFTAFHAGHVLGAAMFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSS 203
Query: 176 PAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWA 235
+++ + ++P + I T+ GG VLLPV + GR E++LIL++YW+
Sbjct: 204 NVLIVESTFGTATHEPRLNRERKLTQLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWS 263
Query: 236 EHS--LN---YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLIN 290
+H+ L PI++ + ++ + ++++ M D I K F S+ N F+ K+++ L N
Sbjct: 264 QHADELGGGQVPIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPFIFKNISYLRN 323
Query: 291 KSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLAR--MLQA 348
+ + GP ++LAS L++G S D+ W + KNLVL T GT+A+ ML+
Sbjct: 324 LEDFQDF--GPSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEP 381
Query: 349 DPPPKA--VKVTMSRRVPLVGEELIAYEEEQTRLKKEEALKA 388
D P ++T+ RR + A+ + Q L+ E + A
Sbjct: 382 DTIPSINNPEITIPRRCQVEEISFAAHVDFQENLEFIEKISA 423
Score = 73 (30.8 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
Identities = 16/43 (37%), Positives = 23/43 (53%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP---VYR 92
S +D +L+SH H +LPY M++ VF T P +YR
Sbjct: 59 SKVDILLISHFHLDHAASLPYVMQRTNFQGRVFMTHPTKAIYR 101
Score = 45 (20.9 bits), Expect = 2.1e-31, Sum P(3) = 2.1e-31
Identities = 12/49 (24%), Positives = 22/49 (44%)
Query: 461 DDFGEVINPDDYIIKDEDMDQAAMHIGGDDGKLDEGSASLI-LDAKPSK 508
D F +N D+Y E+ + IG K+D + ++ ++ P K
Sbjct: 713 DCFTLFLNKDEYASNKEETITGVVTIGKSTAKIDFNNMKILECNSNPLK 761
>DICTYBASE|DDB_G0278189 [details] [associations]
symbol:ints11 "integrator complex subunit 11"
species:44689 "Dictyostelium discoideum" [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0278189 Pfam:PF07521 GO:GO:0005634 GO:GO:0005737
GenomeReviews:CM000152_GR EMBL:AAFI02000023 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 RefSeq:XP_642189.1
ProteinModelPortal:Q54YL3 PRIDE:Q54YL3 EnsemblProtists:DDB0234100
GeneID:8621396 KEGG:ddi:DDB_G0278189 OMA:RTIANET
ProtClustDB:CLSZ2729107 Uniprot:Q54YL3
Length = 744
Score = 324 (119.1 bits), Expect = 3.1e-31, Sum P(2) = 3.1e-31
Identities = 93/324 (28%), Positives = 155/324 (47%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + + ID V+++H H GALP+ + G P++ T P + LL
Sbjct: 46 DFSYISKNGQFTKVIDCVIITHFHLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLED 105
Query: 99 YDQY-LSRRSVTRLTYSQ------------NYHLSGK-GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ T +Q N H + K E + + + AGH+LG ++
Sbjct: 106 YRKITVEKKGETNFFTAQMIKDCMKKVIPVNLHQTIKVDEELSIKAYYAGHVLGAAMFYA 165
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ V+P VLIT+ A + ++ RE F I
Sbjct: 166 KVGDESVVYTGDYNMTPDRHLGSAWIDQ-VKPDVLITETTYATTIRDSKRGRERDFLKRI 224
Query: 204 SKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNY-PIYFLTYVSSSTIDYVKSFLE 262
+ + GG VL+PV + GRV EL ++++ YW + +L + PIYF ++ Y K F+
Sbjct: 225 HECVEKGGKVLIPVFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFIN 284
Query: 263 WMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVE 322
W I ++F + N F KH+ +S L +AP G ++ A+ L AG S ++F +
Sbjct: 285 WTNQKIKQTFV--KRNMFDFKHIKPF--QSHLVDAP-GAMVLFATPGMLHAGASLEVFKK 339
Query: 323 WASDVKNLVLFTERGQFGTLARML 346
WA + N+ + GT+ L
Sbjct: 340 WAPNELNMTIIPGYCVVGTVGNKL 363
Score = 99 (39.9 bits), Expect = 3.1e-31, Sum P(2) = 3.1e-31
Identities = 26/116 (22%), Positives = 53/116 (45%)
Query: 510 VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHV 569
+ + T++VKC + + + AD + I ++ P ++LVHG E L Q +K +
Sbjct: 383 IDKKTTIEVKCKIHNLSFSAHADAKGILQLIKMSNPRNVILVHGEKEKMGFLSQKIIKEM 442
Query: 570 CPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEVGKTEN 625
+ Y P TI + + + + +S N+L +++ DY + + + N
Sbjct: 443 GVNCYYPANGVTI-IIDTMKSIPIDIS----LNLLKRQILDYSYQYNNNNLNNFNN 493
Score = 99 (39.9 bits), Expect = 1.3e-06, Sum P(2) = 1.3e-06
Identities = 27/93 (29%), Positives = 43/93 (46%)
Query: 4 SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGW----ND--HF-DPSLLQPLSKVASTID 56
+++V PL + +V+I N + DCG ND F D S + + ID
Sbjct: 2 TIKVVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVID 61
Query: 57 AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP 89
V+++H H GALP+ + G P++ T P
Sbjct: 62 CVIITHFHLDHCGALPFFTEMCGYDGPIYMTLP 94
>ZFIN|ZDB-GENE-030131-3275 [details] [associations]
symbol:cpsf3 "cleavage and polyadenylation
specific factor 3" species:7955 "Danio rerio" [GO:0016787
"hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-030131-3275 GO:GO:0016787
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 CTD:51692
HOVERGEN:HBG051107 EMBL:AY648793 IPI:IPI00509063
RefSeq:NP_001003836.1 UniGene:Dr.77231 ProteinModelPortal:Q6DRG6
SMR:Q6DRG6 STRING:Q6DRG6 GeneID:324554 KEGG:dre:324554
NextBio:20808833 ArrayExpress:Q6DRG6 Uniprot:Q6DRG6
Length = 690
Score = 372 (136.0 bits), Expect = 3.3e-31, P = 3.3e-31
Identities = 104/390 (26%), Positives = 198/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 36 ILEFKGRKIMVDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 95
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 96 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 153
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + S V+P +LIT++
Sbjct: 154 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPS-VKPDILITES 212
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + G L+PV + GR ELLLIL++YW H L+
Sbjct: 213 TYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHD 272
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K+ + N F+ KH++ N +D+ D
Sbjct: 273 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKAININ--NPFVFKHIS---NLKSMDHFDDI 327
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 328 GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 385
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 386 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 415
>FB|FBgn0261065 [details] [associations]
symbol:Cpsf73 "Cleavage and polyadenylation specificity
factor 73" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS;NAS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
processing" evidence=IMP] [GO:0003677 "DNA binding" evidence=IDA]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
GO:GO:0003677 GO:GO:0006378 GO:GO:0016787 GO:GO:0005847
GO:GO:0006379 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 GO:GO:0006398 KO:K14403 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 EMBL:AY119128 RefSeq:NP_650738.1
UniGene:Dm.13714 SMR:Q9VE51 IntAct:Q9VE51 MINT:MINT-804945
STRING:Q9VE51 EnsemblMetazoa:FBtr0083690 GeneID:42240
KEGG:dme:Dmel_CG7698 UCSC:CG7698-RA CTD:42240 FlyBase:FBgn0261065
InParanoid:Q9VE51 OrthoDB:EOG4P5HR4 GenomeRNAi:42240 NextBio:827838
Uniprot:Q9VE51
Length = 684
Score = 369 (135.0 bits), Expect = 6.8e-31, P = 6.8e-31
Identities = 106/426 (24%), Positives = 208/426 (48%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLLSH 62
+Q+ PL ++ G ++DCG + P + A ID + +SH
Sbjct: 18 LQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGIHPGLSGMDALPYVDLIEADEIDLLFISH 77
Query: 63 PDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ---- 115
H GALP+ + + F +T+ +YR +L+ Y + +S S ++ Y++
Sbjct: 78 FHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRW-MLSDYIK-ISNISTEQMLYTEADLE 135
Query: 116 ---------NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLN 166
N+H G+ ++AGH+LG ++ I G ++Y D++R++++HL
Sbjct: 136 ASMEKIETINFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFSRQEDRHLM 195
Query: 167 GTVLESFVRPAVLITDAYNALHNQPPRQQRE-MFQDAISKTLRAGGNVLLPVDSAGRVLE 225
+ ++P VLIT++ H R+ RE F + K ++ GG L+PV + GR E
Sbjct: 196 AAEVPP-MKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPVFALGRAQE 254
Query: 226 LLLILEDYWAEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLK 283
LLLIL+++W+++ L+ PIY+ + ++ + ++++ M D I + + N F+ +
Sbjct: 255 LLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQIAVN--NPFVFR 312
Query: 284 HVTLLINKSELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTL 342
H++ N +D+ D GP +++AS +++G S ++F W +D KN V+ GTL
Sbjct: 313 HIS---NLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTL 369
Query: 343 ARMLQADPPPKAVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKASLVKEEESKASLG 401
A+ + ++P + + +++PL + + I++ + E ++ L+K G
Sbjct: 370 AKAVLSEP--EEITTLSGQKLPLNMSVDYISFSAHTDYQQTSEFIR--LLKPTHVVLVHG 425
Query: 402 PDNNLS 407
N +S
Sbjct: 426 EQNEMS 431
>ZFIN|ZDB-GENE-050522-13 [details] [associations]
symbol:cpsf3l "cleavage and polyadenylation specific
factor 3-like" species:7955 "Danio rerio" [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0043484 "regulation of RNA splicing"
evidence=IMP] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
ZFIN:ZDB-GENE-050522-13 GO:GO:0016787 GO:GO:0043484
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:MAVEYMS GeneTree:ENSGT00700000104485 EMBL:CABZ01054885
EMBL:CR846089 IPI:IPI00865509 Ensembl:ENSDART00000102902
Uniprot:E7EXW1
Length = 601
Score = 246 (91.7 bits), Expect = 8.3e-31, Sum P(3) = 8.3e-31
Identities = 69/212 (32%), Positives = 107/212 (50%)
Query: 128 VAPHVAGHLLGGTVWKITKDGEDVIYAVD----YNR--RKEKHLNGTVLESFVRPAVLIT 181
+ + AGH+LG + + V+Y V Y+ L ++ RP +LI+
Sbjct: 150 IKAYYAGHVLGAAM---VQSRFRVVYTVSVSYTYSNLMTPASDLRAAWIDK-CRPDILIS 205
Query: 182 DAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLN 240
++ A + ++ RE F + +T+ GG VL+PV + GR EL ++LE +W +L
Sbjct: 206 ESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLK 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDG 300
PIYF T ++ Y K F+ W I K+F R N F KH+ ++S DN P G
Sbjct: 266 APIYFSTGLTEKANHYYKLFITWTNQKIRKTF-VQR-NMFEFKHIKAF-DRSYADN-P-G 320
Query: 301 PKLVLASMASLEAGFSHDIFVEWASDVKNLVL 332
P +V A+ L AG S IF +WA + KN+V+
Sbjct: 321 PMVVFATPGMLHAGQSLQIFKKWAGNEKNMVI 352
Score = 129 (50.5 bits), Expect = 8.3e-31, Sum P(3) = 8.3e-31
Identities = 32/92 (34%), Positives = 48/92 (52%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IKVTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEP 89
V++SH H GALPY + +G P++ T P
Sbjct: 64 VIISHFHLDHCGALPYMSEMVGYDGPIYMTHP 95
Score = 88 (36.0 bits), Expect = 8.3e-31, Sum P(3) = 8.3e-31
Identities = 30/128 (23%), Positives = 56/128 (43%)
Query: 501 ILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEH 560
IL+ + + T+ VK + ++ + AD + I ++ P ++LVHG A+ E
Sbjct: 366 ILNGQKKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLIRMAEPRNMLLVHGEAKKMEF 425
Query: 561 LKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDAEV 620
LK + + P ET + ++ + V +S L+ + LG DA+
Sbjct: 426 LKDKIEQEFSISCFMPANGETTTIVTNP-SVPVDISLNLLKREM--ALGG---PLPDAKK 479
Query: 621 GKTENGML 628
+T +G L
Sbjct: 480 PRTMHGTL 487
Score = 39 (18.8 bits), Expect = 9.7e-26, Sum P(3) = 9.7e-26
Identities = 11/47 (23%), Positives = 23/47 (48%)
Query: 499 SLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAP 545
++I+++ KV S+ +K +L+ Y+ G + T+L P
Sbjct: 554 TVIVESIVIKVTSSAEEPNLKVILLSWSYQDEELGSFLSTLLKKGLP 600
>UNIPROTKB|P79101 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation specificity
factor subunit 3" species:9913 "Bos taurus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
evidence=ISS] [GO:0003723 "RNA binding" evidence=ISS] [GO:0004521
"endoribonuclease activity" evidence=ISS] [GO:0008409 "5'-3'
exonuclease activity" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0030529 "ribonucleoprotein complex" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0046872 GO:GO:0003723 GO:GO:0030529 GO:GO:0004521
GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 GO:GO:0006398 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
EMBL:X95906 EMBL:BC104553 IPI:IPI00708839 RefSeq:NP_776709.1
UniGene:Bt.5045 ProteinModelPortal:P79101 SMR:P79101 STRING:P79101
PRIDE:P79101 Ensembl:ENSBTAT00000026303 GeneID:281712
KEGG:bta:281712 CTD:51692 GeneTree:ENSGT00700000104485
HOVERGEN:HBG051107 InParanoid:P79101 OrthoDB:EOG4FN4H6
NextBio:20805634 ArrayExpress:P79101 GO:GO:0008409 Uniprot:P79101
Length = 684
Score = 366 (133.9 bits), Expect = 1.5e-30, P = 1.5e-30
Identities = 102/390 (26%), Positives = 197/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 146
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 147 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 205
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 206 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 266 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 320
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 321 GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 378
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 379 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|Q9UKF6 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0030529
"ribonucleoprotein complex" evidence=IEA] [GO:0046872 "metal ion
binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0008409 "5'-3' exonuclease activity" evidence=ISS] [GO:0004521
"endoribonuclease activity" evidence=ISS] [GO:0003723 "RNA binding"
evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IDA] [GO:0006378 "mRNA
polyadenylation" evidence=TAS] [GO:0006379 "mRNA cleavage"
evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=TAS]
[GO:0006369 "termination of RNA polymerase II transcription"
evidence=TAS] [GO:0006397 "mRNA processing" evidence=TAS]
[GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380
"RNA splicing" evidence=TAS] [GO:0010467 "gene expression"
evidence=TAS] [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
Reactome:REACT_71 InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
EMBL:AF017269 Pfam:PF07521 EMBL:AF171877 EMBL:CH471053
GO:GO:0046872 Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003723
GO:GO:0030529 GO:GO:0006406 GO:GO:0004521 GO:GO:0000398
Reactome:REACT_1788 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 Reactome:REACT_78 GO:GO:0006398 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409
EMBL:AC080162 EMBL:BC011654 EMBL:BC020211 IPI:IPI00007818
RefSeq:NP_057291.1 UniGene:Hs.515972 PDB:2I7T PDB:2I7V PDBsum:2I7T
PDBsum:2I7V ProteinModelPortal:Q9UKF6 SMR:Q9UKF6 DIP:DIP-42501N
MINT:MINT-1742891 STRING:Q9UKF6 PhosphoSite:Q9UKF6 DMDM:18203503
PaxDb:Q9UKF6 PeptideAtlas:Q9UKF6 PRIDE:Q9UKF6 DNASU:51692
Ensembl:ENST00000238112 GeneID:51692 KEGG:hsa:51692 UCSC:uc002qzo.1
GeneCards:GC02P009514 HGNC:HGNC:2326 HPA:HPA034657 MIM:606029
neXtProt:NX_Q9UKF6 PharmGKB:PA26843 InParanoid:Q9UKF6
PhylomeDB:Q9UKF6 ChiTaRS:CPSF3 EvolutionaryTrace:Q9UKF6
GenomeRNAi:51692 NextBio:55702 ArrayExpress:Q9UKF6 Bgee:Q9UKF6
CleanEx:HS_CPSF3 Genevestigator:Q9UKF6 GermOnline:ENSG00000119203
Uniprot:Q9UKF6
Length = 684
Score = 366 (133.9 bits), Expect = 1.5e-30, P = 1.5e-30
Identities = 102/390 (26%), Positives = 197/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 146
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 147 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 205
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 206 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 266 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 320
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 321 GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 378
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 379 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|F1NKW5 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0004521
"endoribonuclease activity" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0003723 GO:GO:0004521 GO:GO:0005847
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:AADN02018718
IPI:IPI00600642 Ensembl:ENSGALT00000026493 Uniprot:F1NKW5
Length = 685
Score = 366 (133.9 bits), Expect = 1.5e-30, P = 1.5e-30
Identities = 102/390 (26%), Positives = 197/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 146
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 147 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 205
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 206 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 266 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 320
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 321 GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 378
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 379 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|E2R7R2 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 GeneTree:ENSGT00700000104485 EMBL:AAEX03010701
RefSeq:XP_003639652.1 Ensembl:ENSCAFT00000005417 GeneID:100856414
KEGG:cfa:100856414 Uniprot:E2R7R2
Length = 717
Score = 366 (133.9 bits), Expect = 1.7e-30, P = 1.7e-30
Identities = 102/390 (26%), Positives = 197/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 62 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 121
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 122 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 179
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 180 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 238
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 239 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 298
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 299 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 353
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 354 GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 411
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 412 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 441
>UNIPROTKB|H0YJF4 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation-specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] InterPro:IPR027075
Pfam:PF07521 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF13299 HGNC:HGNC:2325 ChiTaRS:CPSF2
EMBL:AL121773 Ensembl:ENST00000555244 Uniprot:H0YJF4
Length = 269
Score = 221 (82.9 bits), Expect = 3.0e-30, Sum P(3) = 3.0e-30
Identities = 46/119 (38%), Positives = 75/119 (63%)
Query: 518 VKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL----KHVCPHV 573
+K + +IDYEGR+DG SIK I++ + P +L++VHG EA++ L + C K + V
Sbjct: 48 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPEASQDLAECCRAFGGKDI--KV 105
Query: 574 YTPQIEETIDVTSDLCAYKVQLSEKLMSNVLFKKLGDYEIAWVDA----EVGKTENGML 628
Y P++ ET+D TS+ Y+V+L + L+S++ F K D E+AW+D V K + G++
Sbjct: 106 YMPKLHETVDATSETHIYQVRLKDSLVSSLQFCKAKDAELAWIDGVLDMRVSKVDTGVI 164
Score = 105 (42.0 bits), Expect = 3.0e-30, Sum P(3) = 3.0e-30
Identities = 24/64 (37%), Positives = 35/64 (54%)
Query: 624 ENGMLS-LLPISTPAPPHK-----SVLVGDLKMADLKPFLSSKGIQVEFAGGALRCGEYV 677
E G S ++P P PPH+ SV + + +++D K L +GIQ EF GG L C V
Sbjct: 206 ETGEESEIIPTLEPLPPHEVPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVCNNQV 265
Query: 678 TIRK 681
+R+
Sbjct: 266 AVRR 269
Score = 64 (27.6 bits), Expect = 3.0e-30, Sum P(3) = 3.0e-30
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 449 PMFPFYENNSEWDDFGEVI 467
PMFP E +WD++GE+I
Sbjct: 30 PMFPAPEERIKWDEYGEII 48
>MGI|MGI:1859328 [details] [associations]
symbol:Cpsf3 "cleavage and polyadenylation specificity
factor 3" species:10090 "Mus musculus" [GO:0003723 "RNA binding"
evidence=IDA] [GO:0003729 "mRNA binding" evidence=ISO] [GO:0004518
"nuclease activity" evidence=IEA] [GO:0004519 "endonuclease
activity" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=ISO]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISO;IDA] [GO:0008409 "5'-3'
exonuclease activity" evidence=IDA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0030529 "ribonucleoprotein complex"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
MGI:MGI:1859328 GO:GO:0046872 GO:GO:0003723 GO:GO:0030529
GO:GO:0004521 GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 GO:GO:0006398
HOGENOM:HOG000203394 KO:K14403 OMA:YVSFSAH InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 CTD:51692 GeneTree:ENSGT00700000104485
HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409 ChiTaRS:CPSF3
EMBL:AF203969 EMBL:BC023297 IPI:IPI00135099 RefSeq:NP_061283.2
UniGene:Mm.356778 ProteinModelPortal:Q9QXK7 SMR:Q9QXK7
STRING:Q9QXK7 PhosphoSite:Q9QXK7 PaxDb:Q9QXK7 PRIDE:Q9QXK7
Ensembl:ENSMUST00000067284 GeneID:54451 KEGG:mmu:54451
InParanoid:Q8CIM0 NextBio:311332 Bgee:Q9QXK7 CleanEx:MM_CPSF3
Genevestigator:Q9QXK7 GermOnline:ENSMUSG00000054309 Uniprot:Q9QXK7
Length = 684
Score = 363 (132.8 bits), Expect = 3.2e-30, P = 3.2e-30
Identities = 102/390 (26%), Positives = 196/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 146
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 147 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 205
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 206 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 266 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 320
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 321 GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 378
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 379 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>RGD|1305767 [details] [associations]
symbol:Cpsf3 "cleavage and polyadenylation specific factor 3,
73kDa" species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
evidence=ISO] [GO:0004521 "endoribonuclease activity" evidence=ISO]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISO] [GO:0006398 "histone mRNA 3'-end processing"
evidence=ISO] [GO:0008409 "5'-3' exonuclease activity"
evidence=ISO] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 RGD:1305767 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6
UniGene:Rn.100522 EMBL:BC099817 IPI:IPI00365532
RefSeq:NP_001025201.1 ProteinModelPortal:Q499P4 SMR:Q499P4
STRING:Q499P4 GeneID:298916 KEGG:rno:298916 InParanoid:Q499P4
NextBio:644507 Genevestigator:Q499P4 Uniprot:Q499P4
Length = 685
Score = 363 (132.8 bits), Expect = 3.2e-30, P = 3.2e-30
Identities = 102/390 (26%), Positives = 196/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 146
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 147 AGIKFWCYHAGHVLGAAMFMIEIAGMKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 205
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 206 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 266 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 320
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 321 GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 378
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 379 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|G3V6W7 [details] [associations]
symbol:Cpsf3 "Protein Cpsf3" species:10116 "Rattus
norvegicus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
RGD:1305767 GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 EMBL:CH473947 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 UniGene:Rn.100522
Ensembl:ENSRNOT00000009652 Uniprot:G3V6W7
Length = 685
Score = 363 (132.8 bits), Expect = 3.2e-30, P = 3.2e-30
Identities = 102/390 (26%), Positives = 196/390 (50%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKG 123
F +T+ +YR LL+ Y + +S S + Y++ N+H +
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEV 146
Query: 124 EGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDA 183
GI + AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++
Sbjct: 147 AGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIES 205
Query: 184 YNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN- 240
H R++RE F + + + GG L+PV + GR ELLLIL++YW H L+
Sbjct: 206 TYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHD 265
Query: 241 YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD- 299
PIY+ + ++ + ++++ M D I K + N F+ KH++ N +D+ D
Sbjct: 266 IPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDI 320
Query: 300 GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTM 359
GP +V+AS ++ G S ++F W +D +N V+ GTLA+ + ++P + +
Sbjct: 321 GPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMS 378
Query: 360 SRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+++PL + + I++ + E ++A
Sbjct: 379 GQKLPLKMSVDYISFSAHTDYQQTSEFIRA 408
>UNIPROTKB|G5E9W3 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation-specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:CH471053 GO:GO:0003723 GO:GO:0004521
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 GO:GO:0008409
EMBL:AC080162 UniGene:Hs.515972 HGNC:HGNC:2326 ChiTaRS:CPSF3
ProteinModelPortal:G5E9W3 SMR:G5E9W3 PRIDE:G5E9W3
Ensembl:ENST00000460593 ArrayExpress:G5E9W3 Bgee:G5E9W3
Uniprot:G5E9W3
Length = 647
Score = 361 (132.1 bits), Expect = 4.4e-30, P = 4.4e-30
Identities = 101/381 (26%), Positives = 194/381 (50%)
Query: 31 LIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF--- 85
++DCG + + P + + ID +L+SH H GALP+ +++ F
Sbjct: 1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60
Query: 86 STEPVYRLGLLTMYDQYLSRRSVTRLTYSQ-------------NYHLSGKGEGIVVAPHV 132
+T+ +YR LL+ Y + +S S + Y++ N+H + GI +
Sbjct: 61 ATKAIYRW-LLSDYVK-VSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYH 118
Query: 133 AGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPP 192
AGH+LG ++ I G ++Y D++R++++HL + + ++P +LI ++ H
Sbjct: 119 AGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IKPDILIIESTYGTHIHEK 177
Query: 193 RQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH-SLN-YPIYFLTYV 249
R++RE F + + + GG L+PV + GR ELLLIL++YW H L+ PIY+ + +
Sbjct: 178 REEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSL 237
Query: 250 SSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPD-GPKLVLASM 308
+ + ++++ M D I K + N F+ KH++ N +D+ D GP +V+AS
Sbjct: 238 AKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLKSMDHFDDIGPSVVMASP 292
Query: 309 ASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPL-VG 367
+++G S ++F W +D +N V+ GTLA+ + ++P + + +++PL +
Sbjct: 293 GMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP--EEITTMSGQKLPLKMS 350
Query: 368 EELIAYEEEQTRLKKEEALKA 388
+ I++ + E ++A
Sbjct: 351 VDYISFSAHTDYQQTSEFIRA 371
>DICTYBASE|DDB_G0274799 [details] [associations]
symbol:cpsf3 "cleavage and polyadenylation
specificity factor 73 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA;IC] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR
binding" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0004519 "endonuclease
activity" evidence=IEA] [GO:0004518 "nuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0274799 Pfam:PF07521 GO:GO:0046872 GO:GO:0006378
GenomeReviews:CM000151_GR EMBL:AAFI02000012 GO:GO:0003730
GO:GO:0004519 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
RefSeq:XP_643926.1 ProteinModelPortal:Q86A79 SMR:Q86A79
STRING:Q86A79 EnsemblProtists:DDB0233696 GeneID:8619353
KEGG:ddi:DDB_G0274799 ProtClustDB:CLSZ2431003 Uniprot:Q86A79
Length = 774
Score = 326 (119.8 bits), Expect = 9.2e-29, Sum P(2) = 9.2e-29
Identities = 88/315 (27%), Positives = 156/315 (49%)
Query: 55 IDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTR---L 111
ID +L+SH H A+PY + + VF T P + + + D Y+ ++TR +
Sbjct: 90 IDLLLVSHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSD-YVKVSNITRDDDM 148
Query: 112 TYSQN-------------YHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYN 158
+ ++ Y + GI V AGH+LG ++ I G ++Y D++
Sbjct: 149 LFDKSDLDRSLEKIEKVRYRQKVEHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFS 208
Query: 159 RRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPV 217
R++++HL G V+ VLI ++ + PR +RE F ++ + + G L+PV
Sbjct: 209 RQEDRHLMGAETPP-VKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPV 267
Query: 218 DSAGRVLELLLILEDYW-AEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS 275
+ GR ELLLIL++YW A L++ PIY+ + ++ + ++++ M D + F+ S
Sbjct: 268 FALGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVS 327
Query: 276 RDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTE 335
N F KH+ + D+ GP + +AS L++G S +F W SD +N ++
Sbjct: 328 --NPFEFKHIKNIKGIESFDDR--GPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPG 383
Query: 336 RGQFGTLARMLQADP 350
GTLA+ + ++P
Sbjct: 384 YSVEGTLAKHIMSEP 398
Score = 74 (31.1 bits), Expect = 9.2e-29, Sum P(2) = 9.2e-29
Identities = 18/85 (21%), Positives = 41/85 (48%)
Query: 495 EGSASLILDAKPSKVVS-NELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHG 553
EG+ + + ++P+++ + + V + + ++ + +D + + P +VLVHG
Sbjct: 387 EGTLAKHIMSEPAEITRLDNVNVPLNLTVSYVSFSAHSDFLQTSEFIQEIQPPHVVLVHG 446
Query: 554 SAEATEHLKQHCL-KHVCPHVYTPQ 577
A L+Q + K +V TP+
Sbjct: 447 DANEMSRLRQSLVAKFKTINVLTPK 471
>UNIPROTKB|I3LKR1 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
[GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0003723 GO:GO:0004521 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:FP312696
Ensembl:ENSSSCT00000027309 Uniprot:I3LKR1
Length = 687
Score = 324 (119.1 bits), Expect = 1.4e-28, Sum P(2) = 1.4e-28
Identities = 77/278 (27%), Positives = 149/278 (53%)
Query: 116 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 175
N+H + GI + AGH+LG ++ I G ++Y D++R++++HL + + ++
Sbjct: 142 NFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPN-IK 200
Query: 176 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 234
P +LI ++ H R++RE F + + + GG L+PV + GR ELLLIL++YW
Sbjct: 201 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 260
Query: 235 AEH-SLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKS 292
H L+ PIY+ + ++ + ++++ M D I K + N F+ KH++ N
Sbjct: 261 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININ--NPFVFKHIS---NLK 315
Query: 293 ELDNAPD-GPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 351
+D+ D GP +V+AS +++G S ++F W +D +N V+ GTLA+ + ++P
Sbjct: 316 SMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEP- 374
Query: 352 PKAVKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKA 388
+ + +++PL + + I++ + E ++A
Sbjct: 375 -EEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRA 411
Score = 72 (30.4 bits), Expect = 1.4e-28, Sum P(2) = 1.4e-28
Identities = 22/83 (26%), Positives = 40/83 (48%)
Query: 22 LVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLG 79
++ G ++DCG + + P + + ID +L+SH H GALP+ +++
Sbjct: 29 ILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTS 88
Query: 80 LSAPVF---STEPVYRLGLLTMY 99
F +T+ +YR LL+ Y
Sbjct: 89 FKGRTFMTHATKAIYRW-LLSDY 110
>GENEDB_PFALCIPARUM|PFC0825c [details] [associations]
symbol:PFC0825c "cleavage and polyadenylation
specificity factor protein, putative" species:5833 "Plasmodium
falciparum" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0006378
"mRNA polyadenylation" evidence=ISS] [GO:0003729 "mRNA binding"
evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] InterPro:IPR001279
SMART:SM00849 Pfam:PF07521 GO:GO:0003729 GO:GO:0016787
EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
ProtClustDB:CLSZ2433497 Uniprot:O77371
Length = 1017
Score = 273 (101.2 bits), Expect = 2.0e-27, Sum P(3) = 2.0e-27
Identities = 63/220 (28%), Positives = 114/220 (51%)
Query: 128 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 187
+ P+ AGH+LG ++KI VIY DYN +KHL + S + P + I+++ A
Sbjct: 286 ITPYYAGHVLGACIYKIEVRNFSVIYTGDYNTIPDKHLGSANIPS-LNPEIFISESTYAT 344
Query: 188 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 246
+ +P ++ E+ + + + + GG VL+PV + GR EL ++L+DYW + ++YPIYF
Sbjct: 345 YVRPTKKASELELCNLVHECVHKGGKVLIPVFAIGRAQELSILLDDYWKKMKIHYPIYFG 404
Query: 247 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 306
++ + Y K + W+ S + ++N F +++ +N + L+ P ++ A
Sbjct: 405 CGLTENANKYYKIYSSWINSSCMSN---EKENLFDFANISPFLN-NYLNEKR--PMVLFA 458
Query: 307 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 346
+ L G S F WA + +NL++ GT+ L
Sbjct: 459 TPGMLHTGLSLKAFKAWAGNPQNLIVLPGYCVQGTVGHKL 498
Score = 97 (39.2 bits), Expect = 2.0e-27, Sum P(3) = 2.0e-27
Identities = 16/61 (26%), Positives = 32/61 (52%)
Query: 516 VQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHL-----KQHCLKHVC 570
++V C +I++ + AD I+ ++ HV+P ++ VHG + L +H + +C
Sbjct: 513 IKVLCKIIYLSFSAHADSNGIQQLIKHVSPKNVIFVHGEKNGMQKLAKYISNKHMINSMC 572
Query: 571 P 571
P
Sbjct: 573 P 573
Score = 70 (29.7 bits), Expect = 2.0e-27, Sum P(3) = 2.0e-27
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 44 LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
L+ L ++ ID V++SH H+GALP+ + L + + P L + + D
Sbjct: 159 LINNLKRINEIIDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLD 215
>UNIPROTKB|O77371 [details] [associations]
symbol:PFC0825c "Cleavage and polyadenylation specificity
factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
[GO:0003729 "mRNA binding" evidence=ISS] [GO:0005847 "mRNA cleavage
and polyadenylation specificity factor complex" evidence=ISS]
InterPro:IPR001279 SMART:SM00849 Pfam:PF07521 GO:GO:0003729
GO:GO:0016787 EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
ProtClustDB:CLSZ2433497 Uniprot:O77371
Length = 1017
Score = 273 (101.2 bits), Expect = 2.0e-27, Sum P(3) = 2.0e-27
Identities = 63/220 (28%), Positives = 114/220 (51%)
Query: 128 VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNAL 187
+ P+ AGH+LG ++KI VIY DYN +KHL + S + P + I+++ A
Sbjct: 286 ITPYYAGHVLGACIYKIEVRNFSVIYTGDYNTIPDKHLGSANIPS-LNPEIFISESTYAT 344
Query: 188 HNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFL 246
+ +P ++ E+ + + + + GG VL+PV + GR EL ++L+DYW + ++YPIYF
Sbjct: 345 YVRPTKKASELELCNLVHECVHKGGKVLIPVFAIGRAQELSILLDDYWKKMKIHYPIYFG 404
Query: 247 TYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLA 306
++ + Y K + W+ S + ++N F +++ +N + L+ P ++ A
Sbjct: 405 CGLTENANKYYKIYSSWINSSCMSN---EKENLFDFANISPFLN-NYLNEKR--PMVLFA 458
Query: 307 SMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARML 346
+ L G S F WA + +NL++ GT+ L
Sbjct: 459 TPGMLHTGLSLKAFKAWAGNPQNLIVLPGYCVQGTVGHKL 498
Score = 97 (39.2 bits), Expect = 2.0e-27, Sum P(3) = 2.0e-27
Identities = 16/61 (26%), Positives = 32/61 (52%)
Query: 516 VQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHL-----KQHCLKHVC 570
++V C +I++ + AD I+ ++ HV+P ++ VHG + L +H + +C
Sbjct: 513 IKVLCKIIYLSFSAHADSNGIQQLIKHVSPKNVIFVHGEKNGMQKLAKYISNKHMINSMC 572
Query: 571 P 571
P
Sbjct: 573 P 573
Score = 70 (29.7 bits), Expect = 2.0e-27, Sum P(3) = 2.0e-27
Identities = 16/57 (28%), Positives = 28/57 (49%)
Query: 44 LLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
L+ L ++ ID V++SH H+GALP+ + L + + P L + + D
Sbjct: 159 LINNLKRINEIIDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLD 215
>TAIR|locus:2065368 [details] [associations]
symbol:CPSF73-II "AT2G01730" species:3702 "Arabidopsis
thaliana" [GO:0003824 "catalytic activity" evidence=ISS]
[GO:0008152 "metabolic process" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0010197 "polar nucleus
fusion" evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005634 "nucleus" evidence=IDA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:CP002685
GenomeReviews:CT485783_GR GO:GO:0006397 GO:GO:0090305 EMBL:AC006069
GO:GO:0004518 GO:GO:0010197 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 EMBL:AY168923
EMBL:AK221561 IPI:IPI00536069 PIR:D84428 RefSeq:NP_178282.2
UniGene:At.42473 ProteinModelPortal:Q8GUU3 SMR:Q8GUU3 IntAct:Q8GUU3
STRING:Q8GUU3 PaxDb:Q8GUU3 PRIDE:Q8GUU3 EnsemblPlants:AT2G01730.1
GeneID:814702 KEGG:ath:AT2G01730 TAIR:At2g01730
HOGENOM:HOG000231294 InParanoid:Q56XW2 KO:K13148 OMA:MAVEYMS
Genevestigator:Q8GUU3 Uniprot:Q8GUU3
Length = 613
Score = 296 (109.3 bits), Expect = 6.1e-27, Sum P(2) = 6.1e-27
Identities = 90/327 (27%), Positives = 148/327 (45%)
Query: 43 SLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQ- 101
SL+ + I ++++H H+GALPY + G + P++ + P L L + D
Sbjct: 48 SLISKSGDFDNAISCIIITHFHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYR 107
Query: 102 --YLSRRSVTRL---TYSQN-----YHLSGK-----GEGIVVAPHVAGHLLGGTVWKITK 146
+ RR L T+ N + K E + + + AGH+LG V K
Sbjct: 108 RVMVDRRGEEELFTTTHIANCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGA-VMVYAK 166
Query: 147 DGED-VIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY-NALHNQPPRQQREMFQDAIS 204
G+ ++Y DYN ++HL ++ ++ Y + ++RE Q A+
Sbjct: 167 MGDAAIVYTGDYNMTTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQ-AVH 225
Query: 205 KTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWM 264
K + GG L+P + GR EL ++L+DYW ++ PIYF + ++ Y K + W
Sbjct: 226 KCVAGGGKALIPSFALGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLISWT 285
Query: 265 GDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWA 324
++ + T N F K+V ++S L +AP GP ++ A+ L AGFS ++F WA
Sbjct: 286 SQNVKEKHNTH--NPFDFKNVKDF-DRS-LIHAP-GPCVLFATPGMLCAGFSLEVFKHWA 340
Query: 325 SDVKNLVLFTERGQFGTLARMLQADPP 351
NLV GT+ L A P
Sbjct: 341 PSPLNLVALPGYSVAGTVGHKLMAGKP 367
Score = 95 (38.5 bits), Expect = 7.5e-05, Sum P(2) = 7.5e-05
Identities = 25/86 (29%), Positives = 43/86 (50%)
Query: 22 LVSIDGFNFLIDCGWN----DHFD-P--SLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
+V+I+G + DCG + DH P SL+ + I ++++H H+GALPY
Sbjct: 20 VVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHFHMDHVGALPYF 79
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD 100
+ G + P++ + P L L + D
Sbjct: 80 TEVCGYNGPIYMSYPTKALSPLMLED 105
Score = 84 (34.6 bits), Expect = 6.1e-27, Sum P(2) = 6.1e-27
Identities = 32/132 (24%), Positives = 56/132 (42%)
Query: 501 ILDAKPSKV-VSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATE 559
++ KP+ V + N V V+C + + + D + I + ++P +VLVHG +
Sbjct: 362 LMAGKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPSMM 421
Query: 560 HLKQHCLKHVCPHVYTPQIEETIDVTSDLCAYKVQLSE---KLMSNVLFKKLGDYEIAWV 616
LK+ + + P ET+ S K S+ K SN FK ++
Sbjct: 422 ILKEKITSELDIPCFVPANGETVSFASTTYI-KANASDMFLKSCSNPNFKFSNSTQLRVT 480
Query: 617 DAEVGKTENGML 628
D +T +G+L
Sbjct: 481 DH---RTADGVL 489
>ASPGD|ASPL0000060573 [details] [associations]
symbol:AN0990 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:BN001308
GO:GO:0046872 GO:GO:0006397 GO:GO:0090305 GO:GO:0004519
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 EMBL:AACD01000015 RefSeq:XP_658594.1
ProteinModelPortal:Q5BEP0 STRING:Q5BEP0
EnsemblFungi:CADANIAT00001661 GeneID:2876766 KEGG:ani:AN0990.2
OMA:EISFAAH OrthoDB:EOG41ZJK7 Uniprot:Q5BEP0
Length = 884
Score = 299 (110.3 bits), Expect = 6.4e-27, Sum P(3) = 6.4e-27
Identities = 86/297 (28%), Positives = 149/297 (50%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
T+Y ++ S L + +++ + I + P+ AGH+LG ++ I+ G ++++ D
Sbjct: 137 TLYTEH-DHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFTGD 195
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLL 215
Y+R +++HL + V+ VLIT++ + + PPR +RE +I+ L GG VL+
Sbjct: 196 YSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRVLM 255
Query: 216 PVDSAGRVLELLLILEDYWAEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSF- 272
PV + GR ELLLILE+YW H PIY++ + + ++++ M D+I + F
Sbjct: 256 PVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRLFR 315
Query: 273 ------ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFV 321
E S D + K+V L + D+ G ++LAS L+ G S ++
Sbjct: 316 QRMAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDV--GGCVMLASPGMLQTGTSRELLE 373
Query: 322 EWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVGE-ELIAYEEEQ 377
WA + +N V+ T GT+A+ L +P + MSR +G + +EEQ
Sbjct: 374 RWAPNERNGVVMTGYSVEGTMAKQLLNEPDQ--IHAVMSRAATGMGRTRMNGNDEEQ 428
Score = 71 (30.1 bits), Expect = 6.4e-27, Sum P(3) = 6.4e-27
Identities = 21/73 (28%), Positives = 33/73 (45%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVT--- 109
ST+D +L+SH H ALPY + + VF T + + D + +
Sbjct: 74 STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSD 133
Query: 110 -RLT-YSQNYHLS 120
R T Y+++ HLS
Sbjct: 134 QRTTLYTEHDHLS 146
Score = 60 (26.2 bits), Expect = 6.4e-27, Sum P(3) = 6.4e-27
Identities = 18/69 (26%), Positives = 29/69 (42%)
Query: 513 ELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCL-----K 567
++ + +C + I + DG + + V+ ++LVHG LK L K
Sbjct: 429 KIMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSKLLSLNAEK 488
Query: 568 HVCPHVYTP 576
V VYTP
Sbjct: 489 TVKVKVYTP 497
>WB|WBGene00013460 [details] [associations]
symbol:cpsf-3 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
[GO:0040007 "growth" evidence=IMP] [GO:0002119 "nematode larval
development" evidence=IMP] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 EMBL:AL132951 RefSeq:NP_502553.2
ProteinModelPortal:Q95PY8 SMR:Q95PY8 STRING:Q95PY8 PaxDb:Q95PY8
EnsemblMetazoa:Y67H2A.1.1 EnsemblMetazoa:Y67H2A.1.2 GeneID:178285
KEGG:cel:CELE_Y67H2A.1 UCSC:Y67H2A.1 CTD:178285 WormBase:Y67H2A.1
InParanoid:Q95PY8 NextBio:900506 Uniprot:Q95PY8
Length = 707
Score = 316 (116.3 bits), Expect = 2.3e-26, Sum P(2) = 2.3e-26
Identities = 88/316 (27%), Positives = 156/316 (49%)
Query: 55 IDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYRLGL-----LTMY---DQ-- 101
ID +L++H H GALP+ +++ F +T+ +YR+ L ++ Y D+
Sbjct: 63 IDLLLITHFHLDHCGALPWLLQKTAFQGKCFMTHATKAIYRMLLGDYVRISKYGGPDRNQ 122
Query: 102 -YLS---RRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 157
Y +S+ ++ + ++ + GI P+VAGH+LG + I G V+Y D+
Sbjct: 123 LYTEDDLEKSMAKIE-TIDFREQKEVNGIRFWPYVAGHVLGACQFMIEIAGVRVLYTGDF 181
Query: 158 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 216
+ +++HL + + P VLIT++ R RE F + + GG L+P
Sbjct: 182 SCLEDRHLCAAEIPP-ITPQVLITESTYGTQTHEDRAVREKRFTQMVHDIVTRGGRCLIP 240
Query: 217 VDSAGRVLELLLILEDYWAEHSL--NYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 274
+ G EL+LIL++YW H + P+Y+ + ++ + ++F+ M I K
Sbjct: 241 AFAIGPAQELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQIAV 300
Query: 275 SRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFT 334
N F+ KHV+ L + ++A GP +VLA+ L++GFS ++F W D KN +
Sbjct: 301 K--NPFIFKHVSTLRGMDQFEDA--GPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIA 356
Query: 335 ERGQFGTLARMLQADP 350
GTLA+ + ++P
Sbjct: 357 GYCVEGTLAKHILSEP 372
Score = 60 (26.2 bits), Expect = 2.3e-26, Sum P(2) = 2.3e-26
Identities = 36/153 (23%), Positives = 64/153 (41%)
Query: 495 EGSASLILDAKPSKVVS-NELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHG 553
EG+ + + ++P ++VS + + ++ + ++ + D + + P LVLVHG
Sbjct: 361 EGTLAKHILSEPEEIVSLSGEKLPMRMQVGYVSFSAHTDYHQTSNFVKALKPPHLVLVHG 420
Query: 554 SAEATEHLKQHCLKHV----CP-HVYTP------QI----EETIDVTSDLCAYKVQLSEK 598
LK + P V+ P Q+ E+T V L A +V + +
Sbjct: 421 ELHEMSRLKSGIERQFQDDNIPIEVHNPRNTERLQLQFRGEKTAKVIGKL-AQRVPENNE 479
Query: 599 LMSNVLFKKLGDYEIAWVDAEVGKTENGMLSLL 631
+S VL K Y I V E+G + +S L
Sbjct: 480 TISGVLVKNNFSYSIM-VPEELGSYTSLRISSL 511
>WB|WBGene00008642 [details] [associations]
symbol:F10B5.8 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0009792 EMBL:Z48334 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
GeneTree:ENSGT00700000104485 PIR:T20694 RefSeq:NP_495706.2
ProteinModelPortal:Q9U3K2 SMR:Q9U3K2 STRING:Q9U3K2 PaxDb:Q9U3K2
EnsemblMetazoa:F10B5.8 GeneID:174310 KEGG:cel:CELE_F10B5.8
UCSC:F10B5.8 CTD:174310 WormBase:F10B5.8 InParanoid:Q9U3K2
NextBio:883468 Uniprot:Q9U3K2
Length = 608
Score = 298 (110.0 bits), Expect = 4.9e-26, Sum P(2) = 4.9e-26
Identities = 71/243 (29%), Positives = 127/243 (52%)
Query: 133 AGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPP 192
AGH+LG +++I V+Y DYN ++HL + VRP VLI+++ A +
Sbjct: 159 AGHVLGAAMFEIRLGDHSVLYTGDYNMTPDRHLGAARVLPGVRPTVLISESTYATTIRDS 218
Query: 193 RQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSS 251
++ RE F + + + GG V++PV + GR EL ++LE YW +LN PIYF ++
Sbjct: 219 KRARERDFLRKVHECVMKGGKVIIPVFALGRAQELCILLESYWERMALNVPIYFSQGLAE 278
Query: 252 STIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSELDNAPDGPKLVLASMASL 311
Y + F+ W ++I K+F R N F KH+ + + ++ P GP+++ ++ L
Sbjct: 279 RANQYYRLFISWTNENIKKTF-VER-NMFEFKHIKPM--EKGCEDQP-GPQVLFSTPGML 333
Query: 312 EAGFSHDIFVEWASDVKNLVLFTERGQFGTL-ARMLQADPPPKAVKVTMSRRVPLVGEEL 370
G S +F +W SD N+++ GT+ AR++ + K +++ +G E
Sbjct: 334 HGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVINGE---KKIEIDQKMHEIRLGVEY 390
Query: 371 IAY 373
+++
Sbjct: 391 MSF 393
Score = 151 (58.2 bits), Expect = 9.7e-10, Sum P(2) = 9.7e-10
Identities = 56/216 (25%), Positives = 97/216 (44%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
+++ PL + L++I G N ++DCG + D F D S + ++ +D
Sbjct: 8 IKIVPLGAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDC 67
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTMYDQY-LSRRSVTRLTYS 114
V++SH H G+LP+ + +G P++ T P + LL Y + + T S
Sbjct: 68 VIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFTS 127
Query: 115 QNY-HLSGKGEGIVVAP--HV----------AGHLLGGTVWKITKDGEDVIYAVDYNRRK 161
+ + K G + HV AGH+LG +++I V+Y DYN
Sbjct: 128 DDIKNCMKKVVGCALHEIIHVDNELSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDYNMTP 187
Query: 162 EKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQRE 197
++HL + VRP VLI+++ A + ++ RE
Sbjct: 188 DRHLGAARVLPGVRPTVLISESTYATTIRDSKRARE 223
Score = 73 (30.8 bits), Expect = 4.9e-26, Sum P(2) = 4.9e-26
Identities = 17/79 (21%), Positives = 37/79 (46%)
Query: 508 KVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLK 567
K+ ++ +++ + ++ + AD + I ++ P ++ VHG A E LK K
Sbjct: 374 KIEIDQKMHEIRLGVEYMSFSAHADAKGIMQLIRQCEPQHVMFVHGEASKMEFLKGKVEK 433
Query: 568 HVCPHVYTPQIEETIDVTS 586
V+ P ET+ +++
Sbjct: 434 EYKVPVHMPANGETVVISA 452
>CGD|CAL0005344 [details] [associations]
symbol:orf19.5486 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0004521 "endoribonuclease
activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0034247 "snoRNA splicing"
evidence=IEA] [GO:0031126 "snoRNA 3'-end processing" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006369 "termination
of RNA polymerase II transcription" evidence=IEA] [GO:0006378 "mRNA
polyadenylation" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 CGD:CAL0005344 Pfam:PF07521 GO:GO:0005634
GO:GO:0042493 GO:GO:0046872 GO:GO:0006397 GO:GO:0090305
GO:GO:0004519 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 EMBL:AACQ01000196 EMBL:AACQ01000195
RefSeq:XP_711478.1 RefSeq:XP_711502.1 ProteinModelPortal:Q59P50
STRING:Q59P50 GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
KEGG:cal:CaO19.5486 Uniprot:Q59P50
Length = 870
Score = 293 (108.2 bits), Expect = 1.4e-24, Sum P(2) = 1.4e-24
Identities = 76/263 (28%), Positives = 137/263 (52%)
Query: 116 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 175
+YH + + +GI + AGH+LG ++ I G V++ DY+R + +HL+ + ++
Sbjct: 237 DYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENRHLHAAEVPP-LK 295
Query: 176 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 234
P +LI+++ PR + E I T+ GG VLLPV + G ELLLIL++YW
Sbjct: 296 PDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNAQELLLILDEYW 355
Query: 235 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-RDNAFLLKHVTLLINK 291
+++ N +++ + ++ + +++ M D I S +S + N F K++ + +
Sbjct: 356 SQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPFDFKYIKSIKDL 415
Query: 292 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 351
S+ + GP +V+A+ L+AG S + +WA D KNLV+ T GT+A+ L +P
Sbjct: 416 SKFQDM--GPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMAKELLKEPT 473
Query: 352 PKAVKVTMSRRVPL-VGEELIAY 373
+P +G E I++
Sbjct: 474 MIQSATNPDMTIPRRIGIEEISF 496
Score = 71 (30.1 bits), Expect = 1.4e-24, Sum P(2) = 1.4e-24
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYR 92
S +D +L+SH H +LPY M+Q VF +T+ +YR
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYR 192
>UNIPROTKB|Q59P50 [details] [associations]
symbol:YSH1 "Endoribonuclease YSH1" species:237561 "Candida
albicans SC5314" [GO:0042493 "response to drug" evidence=IMP]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 CGD:CAL0005344
Pfam:PF07521 GO:GO:0005634 GO:GO:0042493 GO:GO:0046872
GO:GO:0006397 GO:GO:0090305 GO:GO:0004519 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
EMBL:AACQ01000196 EMBL:AACQ01000195 RefSeq:XP_711478.1
RefSeq:XP_711502.1 ProteinModelPortal:Q59P50 STRING:Q59P50
GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
KEGG:cal:CaO19.5486 Uniprot:Q59P50
Length = 870
Score = 293 (108.2 bits), Expect = 1.4e-24, Sum P(2) = 1.4e-24
Identities = 76/263 (28%), Positives = 137/263 (52%)
Query: 116 NYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVR 175
+YH + + +GI + AGH+LG ++ I G V++ DY+R + +HL+ + ++
Sbjct: 237 DYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENRHLHAAEVPP-LK 295
Query: 176 PAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYW 234
P +LI+++ PR + E I T+ GG VLLPV + G ELLLIL++YW
Sbjct: 296 PDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNAQELLLILDEYW 355
Query: 235 AEHS--LNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETS-RDNAFLLKHVTLLINK 291
+++ N +++ + ++ + +++ M D I S +S + N F K++ + +
Sbjct: 356 SQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPFDFKYIKSIKDL 415
Query: 292 SELDNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPP 351
S+ + GP +V+A+ L+AG S + +WA D KNLV+ T GT+A+ L +P
Sbjct: 416 SKFQDM--GPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMAKELLKEPT 473
Query: 352 PKAVKVTMSRRVPL-VGEELIAY 373
+P +G E I++
Sbjct: 474 MIQSATNPDMTIPRRIGIEEISF 496
Score = 71 (30.1 bits), Expect = 1.4e-24, Sum P(2) = 1.4e-24
Identities = 16/43 (37%), Positives = 24/43 (55%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVF---STEPVYR 92
S +D +L+SH H +LPY M+Q VF +T+ +YR
Sbjct: 150 SKVDILLISHFHVDHSASLPYVMQQSNFRGKVFMTHATKAIYR 192
>GENEDB_PFALCIPARUM|PF14_0364 [details] [associations]
symbol:PF14_0364 "cleavage and polyadenylation
specifity factor protein, putative" species:5833 "Plasmodium
falciparum" [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0006378 "mRNA
polyadenylation" evidence=ISS] [GO:0006379 "mRNA cleavage"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:AE014187 GO:GO:0005847
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 RefSeq:XP_001348538.1 ProteinModelPortal:Q8IL83
PRIDE:Q8IL83 EnsemblProtists:PF14_0364:mRNA GeneID:811946
KEGG:pfa:PF14_0364 EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH
ProtClustDB:CLSZ2457730 Uniprot:Q8IL83
Length = 876
Score = 244 (91.0 bits), Expect = 6.8e-24, Sum P(3) = 6.8e-24
Identities = 68/259 (26%), Positives = 135/259 (52%)
Query: 98 MYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 157
+YD+ +++ L + N+H + + + + AGH++G ++ + + +Y DY
Sbjct: 167 LYDENDIDKTMD-LIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGDY 225
Query: 158 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 216
+R ++H+ + + + VLI + + R++RE+ F + ++ + G VLLP
Sbjct: 226 SREIDRHIPIAEIPN-IDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLP 284
Query: 217 VDSAGRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 274
V + GR ELLLILE++W + H N PI++++ +++ ++ ++F+ G+ + K
Sbjct: 285 VFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNE 344
Query: 275 SRDNAFLLKHVTLLINKSELDN---APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 331
+ N F K+V + + + + P +++AS L+ G S +IF ASD K+ V
Sbjct: 345 GK-NPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGV 403
Query: 332 LFTERGQFGTLARMLQADP 350
+ T GTLA L+ +P
Sbjct: 404 ILTGYTVKGTLADELKTEP 422
Score = 81 (33.6 bits), Expect = 6.8e-24, Sum P(3) = 6.8e-24
Identities = 23/102 (22%), Positives = 44/102 (43%)
Query: 3 TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLL 60
+++ + L G ++ D + ++DCG + F P+ S +D L+
Sbjct: 2 SNINIVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLI 61
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
+H H GALPY + + +F TE + L +++ Y
Sbjct: 62 THFHMDHSGALPYLINKTRFKGRIFMTEATKSICYL-LWNDY 102
Score = 80 (33.2 bits), Expect = 6.8e-24, Sum P(3) = 6.8e-24
Identities = 22/85 (25%), Positives = 38/85 (44%)
Query: 495 EGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGS 554
+G+ + L +P V N+ V+ KC I + +D KT + + +VLVHG
Sbjct: 411 KGTLADELKTEPEFVTINDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPNVVLVHGD 470
Query: 555 AEATEHLKQHCLKHV-CPHVYTPQI 578
LK ++ V+TP++
Sbjct: 471 KNELNRLKNKLIEEKQYLSVFTPEL 495
>UNIPROTKB|Q8IL83 [details] [associations]
symbol:PF14_0364 "Cleavage and polyadenylation specificity
factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014187
GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 RefSeq:XP_001348538.1
ProteinModelPortal:Q8IL83 PRIDE:Q8IL83
EnsemblProtists:PF14_0364:mRNA GeneID:811946 KEGG:pfa:PF14_0364
EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH ProtClustDB:CLSZ2457730
Uniprot:Q8IL83
Length = 876
Score = 244 (91.0 bits), Expect = 6.8e-24, Sum P(3) = 6.8e-24
Identities = 68/259 (26%), Positives = 135/259 (52%)
Query: 98 MYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDY 157
+YD+ +++ L + N+H + + + + AGH++G ++ + + +Y DY
Sbjct: 167 LYDENDIDKTMD-LIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGDY 225
Query: 158 NRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLP 216
+R ++H+ + + + VLI + + R++RE+ F + ++ + G VLLP
Sbjct: 226 SREIDRHIPIAEIPN-IDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLP 284
Query: 217 VDSAGRVLELLLILEDYWAE--HSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFET 274
V + GR ELLLILE++W + H N PI++++ +++ ++ ++F+ G+ + K
Sbjct: 285 VFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNE 344
Query: 275 SRDNAFLLKHVTLLINKSELDN---APDGPKLVLASMASLEAGFSHDIFVEWASDVKNLV 331
+ N F K+V + + + + P +++AS L+ G S +IF ASD K+ V
Sbjct: 345 GK-NPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGV 403
Query: 332 LFTERGQFGTLARMLQADP 350
+ T GTLA L+ +P
Sbjct: 404 ILTGYTVKGTLADELKTEP 422
Score = 81 (33.6 bits), Expect = 6.8e-24, Sum P(3) = 6.8e-24
Identities = 23/102 (22%), Positives = 44/102 (43%)
Query: 3 TSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKV--ASTIDAVLL 60
+++ + L G ++ D + ++DCG + F P+ S +D L+
Sbjct: 2 SNINIVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLI 61
Query: 61 SHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQY 102
+H H GALPY + + +F TE + L +++ Y
Sbjct: 62 THFHMDHSGALPYLINKTRFKGRIFMTEATKSICYL-LWNDY 102
Score = 80 (33.2 bits), Expect = 6.8e-24, Sum P(3) = 6.8e-24
Identities = 22/85 (25%), Positives = 38/85 (44%)
Query: 495 EGSASLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGS 554
+G+ + L +P V N+ V+ KC I + +D KT + + +VLVHG
Sbjct: 411 KGTLADELKTEPEFVTINDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPNVVLVHGD 470
Query: 555 AEATEHLKQHCLKHV-CPHVYTPQI 578
LK ++ V+TP++
Sbjct: 471 KNELNRLKNKLIEEKQYLSVFTPEL 495
>UNIPROTKB|C9J979 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 InterPro:IPR022712 Pfam:PF10996 HOGENOM:HOG000231294
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00514808
ProteinModelPortal:C9J979 SMR:C9J979 STRING:C9J979
Ensembl:ENST00000434694 ArrayExpress:C9J979 Bgee:C9J979
Uniprot:C9J979
Length = 344
Score = 178 (67.7 bits), Expect = 2.2e-19, Sum P(2) = 2.2e-19
Identities = 41/112 (36%), Positives = 61/112 (54%)
Query: 175 RPAVLITDAYNALHNQPPRQQREM-FQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDY 233
RP +LIT++ A + ++ RE F + +T+ GG VL+PV + GR EL ++LE +
Sbjct: 226 RPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCILLETF 285
Query: 234 WAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHV 285
W +L PIYF T ++ Y K F+ W I K+F R N F KH+
Sbjct: 286 WERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTF-VQR-NMFEFKHI 335
Score = 127 (49.8 bits), Expect = 2.2e-19, Sum P(2) = 2.2e-19
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
>UNIPROTKB|G3V3T7 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation-specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 GO:GO:0016787 PANTHER:PTHR11203:SF5 HGNC:HGNC:2325
ChiTaRS:CPSF2 EMBL:AL121773 ProteinModelPortal:G3V3T7 SMR:G3V3T7
Ensembl:ENST00000553427 ArrayExpress:G3V3T7 Bgee:G3V3T7
Uniprot:G3V3T7
Length = 80
Score = 236 (88.1 bits), Expect = 8.6e-19, P = 8.6e-19
Identities = 44/80 (55%), Positives = 58/80 (72%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L K IDAVLL
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRKHVHQIDAVLL 60
Query: 61 SHPDTLHLGALPYAMKQLGL 80
SHPD LHLGALPYA+ +LGL
Sbjct: 61 SHPDPLHLGALPYAVGKLGL 80
>UNIPROTKB|Q5ZKK2 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9031
"Gallus gallus" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
GO:GO:0032039 GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359
HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 EMBL:AJ720082 IPI:IPI00651516
RefSeq:NP_001026271.1 UniGene:Gga.21113 ProteinModelPortal:Q5ZKK2
STRING:Q5ZKK2 Ensembl:ENSGALT00000026848 GeneID:422023
KEGG:gga:422023 GeneTree:ENSGT00390000001445 InParanoid:Q5ZKK2
OMA:PLAMKCV NextBio:20824712 Uniprot:Q5ZKK2
Length = 658
Score = 165 (63.1 bits), Expect = 2.7e-12, Sum P(3) = 2.7e-12
Identities = 70/251 (27%), Positives = 107/251 (42%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMPEVNAALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLAMTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L N P YF++ V++S++++ + F EW+ + TK +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSNVPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLP 358
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P ++ SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFKQPCVIFTGHPSLRFG---DVVHFMELWG 413
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 414 KSSLNTVIFTE 424
Score = 85 (35.0 bits), Expect = 2.7e-12, Sum P(3) = 2.7e-12
Identities = 30/103 (29%), Positives = 54/103 (52%)
Query: 22 LVSIDGFNFLI----DCGWNDHFD--PSLLQPLSKVA--STIDAVLLSHPDTLHLGALPY 73
LV DG FL +C + D P P +++ ST+D +L+S+ + ALPY
Sbjct: 55 LVLKDGSTFLDKELKECSGHVFVDSVPEFCLPETELLDLSTVDVILISNYHCMM--ALPY 112
Query: 74 AMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQN 116
+ G + V++TEP ++G L M ++ ++ S+ R+ +Q+
Sbjct: 113 ITEYTGFTGTVYATEPTVQIGRLLM-EELVN--SIERVPKAQS 152
Score = 42 (19.8 bits), Expect = 2.7e-12, Sum P(3) = 2.7e-12
Identities = 19/72 (26%), Positives = 33/72 (45%)
Query: 592 KVQLSEKLMSNVLFKKLGD-YEIAWVDAEVGKTENGMLSLLPISTPAPP--HKSVLVGD- 647
K+++ +L +++ ++ +A V A + +N + LP P PP K V D
Sbjct: 515 KIEIMPELADSLVPLEIKPGISLATVSAMLHTKDNKHVLQLPPKPPQPPTSKKRKRVSDD 574
Query: 648 -LKMADLKPFLS 658
+ LKP LS
Sbjct: 575 VPECKPLKPLLS 586
>UNIPROTKB|E9PI75 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI01011963
ProteinModelPortal:E9PI75 SMR:E9PI75 Ensembl:ENST00000527719
ArrayExpress:E9PI75 Bgee:E9PI75 Uniprot:E9PI75
Length = 209
Score = 172 (65.6 bits), Expect = 6.3e-12, P = 6.3e-12
Identities = 54/184 (29%), Positives = 87/184 (47%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 27 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 86
Query: 75 MKQLGLSAPVFSTEPVYRLG--LLTMYDQY-LSRRSVTRLTYSQNY----------HLSG 121
+ +G P++ T P + LL Y + + ++ SQ HL
Sbjct: 87 SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 146
Query: 122 K---GEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 178
+ + + + AGH+LG +++I E V+Y DYN ++HL ++ RP +
Sbjct: 147 TVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNL 205
Query: 179 LITD 182
LIT+
Sbjct: 206 LITE 209
>UNIPROTKB|F6XI08 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
Ensembl:ENSCAFT00000013124 EMBL:AAEX03014336 RefSeq:XP_543216.2
GeneID:486090 KEGG:cfa:486090 Uniprot:F6XI08
Length = 658
Score = 163 (62.4 bits), Expect = 9.2e-12, Sum P(2) = 9.2e-12
Identities = 72/251 (28%), Positives = 106/251 (42%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L N P YF++ V++S++++ + F EW+ + TK +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLP 358
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH L D P +V SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYPSLHGDFSSDFRQ--PCVVFTGHPSLRFG---DVVHFMELWG 413
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 414 KSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 9.2e-12, Sum P(2) = 9.2e-12
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>UNIPROTKB|F1RJQ5 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0032039 "integrator complex" evidence=IEA] [GO:0016180
"snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
GeneTree:ENSGT00390000001445 OMA:PLAMKCV EMBL:CU407017
Ensembl:ENSSSCT00000010615 Uniprot:F1RJQ5
Length = 576
Score = 161 (61.7 bits), Expect = 1.0e-11, Sum P(2) = 1.0e-11
Identities = 70/251 (27%), Positives = 107/251 (42%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 101 TMQEVNSALSKIQMVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 156
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 157 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 216
Query: 217 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L+ P YF++ V++S++++ + F EW+ + TK +
Sbjct: 217 CYPSGVIYDLLECLYQYIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLP 276
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 277 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELWG 331
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 332 KSSLNTVIFTE 342
Score = 81 (33.6 bits), Expect = 1.0e-11, Sum P(2) = 1.0e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 12 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 55
>UNIPROTKB|F1MMA6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9913
"Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 IPI:IPI00701634 UniGene:Bt.91042
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
EMBL:DAAA02021965 EMBL:DAAA02021964 Ensembl:ENSBTAT00000049079
ArrayExpress:F1MMA6 Uniprot:F1MMA6
Length = 658
Score = 162 (62.1 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
Identities = 70/251 (27%), Positives = 107/251 (42%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDSMVGEFCSNLALTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L+ P YF++ V++S++++ + F EW+ + TK +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLP 358
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELWG 413
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 414 KSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>UNIPROTKB|Q2KJA6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9913
"Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
GO:GO:0005634 eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996
SMART:SM01027 EMBL:BC105437 IPI:IPI00701634 RefSeq:NP_001039828.1
UniGene:Bt.91042 ProteinModelPortal:Q2KJA6 STRING:Q2KJA6
GeneID:533964 KEGG:bta:533964 CTD:55756 HOGENOM:HOG000045359
HOVERGEN:HBG081802 InParanoid:Q2KJA6 KO:K13146 OrthoDB:EOG415GCW
NextBio:20876211 PANTHER:PTHR11203:SF2 Uniprot:Q2KJA6
Length = 658
Score = 162 (62.1 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
Identities = 70/251 (27%), Positives = 107/251 (42%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDSMVGEFCSNLALTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L+ P YF++ V++S++++ + F EW+ + TK +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLP 358
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELWG 413
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 414 KSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>ZFIN|ZDB-GENE-061013-129 [details] [associations]
symbol:ints9 "integrator complex subunit 9"
species:7955 "Danio rerio" [GO:0016180 "snRNA processing"
evidence=IEA] [GO:0032039 "integrator complex" evidence=IEA]
InterPro:IPR027074 ZFIN:ZDB-GENE-061013-129 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756
HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445
EMBL:CABZ01076623 EMBL:CABZ01078246 EMBL:CABZ01078247
EMBL:CABZ01078248 EMBL:CABZ01078249 EMBL:BC124793 IPI:IPI00800641
RefSeq:NP_001070738.1 UniGene:Dr.116109 Ensembl:ENSDART00000097865
GeneID:768124 KEGG:dre:768124 InParanoid:Q08BB6 NextBio:20918446
Uniprot:Q08BB6
Length = 658
Score = 160 (61.4 bits), Expect = 1.9e-11, Sum P(3) = 1.9e-11
Identities = 66/235 (28%), Positives = 103/235 (43%)
Query: 113 YSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLES 172
YSQ L G + V P +G+ LG + W I E V Y V + H S
Sbjct: 199 YSQKVELFG---AVQVTPLSSGYSLGSSNWIIQSHYEKVSY-VSGSSLLTTHPQPMEQSS 254
Query: 173 FVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 232
VLI + P F ++ T+RAGGNVL+P S+G + +LL L
Sbjct: 255 LKNSDVLILTGLTQIPTANPDGMLGEFCSNLAMTVRAGGNVLVPCYSSGVIYDLLECLYQ 314
Query: 233 YWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF--ETSRDNAFL-----LK 283
+ +L P YF++ V++S++++ + F EW+ + +K + E +A L LK
Sbjct: 315 FMDSANLGTTPFYFISPVANSSLEFSQIFAEWLCQNKQSKVYLPEPPFPHAELIQTNKLK 374
Query: 284 HVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WASDVKNLVLFTE 335
H + + + P +V SL G D+ F+E W N ++FTE
Sbjct: 375 HYPSI--HGDFSSEFRQPCVVFTGHPSLRFG---DVVHFMELWGKSSLNTIIFTE 424
Score = 82 (33.9 bits), Expect = 1.9e-11, Sum P(3) = 1.9e-11
Identities = 18/46 (39%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
STID +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STIDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTLQIGRLLM 137
Score = 42 (19.8 bits), Expect = 1.9e-11, Sum P(3) = 1.9e-11
Identities = 38/156 (24%), Positives = 58/156 (37%)
Query: 345 MLQADPPPKAVKVTMSRRVPLVGE-ELIAYEEEQTRLKKEEALKASLVKEEESKASLGPD 403
ML+ PPP A + R+P E I E + +KA + S D
Sbjct: 489 MLELQPPPMAYRRCSVLRLPFRRRYERIHLLPELAKSLVPSEVKAGVSVATVSAVLQSKD 548
Query: 404 NN--LSGDPMVIDXXXXXXSADVVEPHGGRYRD-ILIDGFVPPSTSVAPMFP--FYENNS 458
N L P V V+E + + L+ G VP +A + E
Sbjct: 549 NKHVLQPVPKVAPVAPSKKRKRVLEEPPEQLKPKTLLSGAVPLEPFLATLHKNGIMEVKV 608
Query: 459 EWDDFGEVIN--PDDYIIKDEDMDQAAMHIGGDDGK 492
E G +++ +D +I+ ED D A HI D+ +
Sbjct: 609 EETADGHILHLQAEDVLIQLED-D--ATHIICDNNE 641
>UNIPROTKB|G3XAN1 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
EMBL:CH471080 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 UniGene:Hs.162397
HGNC:HGNC:25592 ChiTaRS:INTS9 ProteinModelPortal:G3XAN1
Ensembl:ENST00000523303 ArrayExpress:G3XAN1 Bgee:G3XAN1
Uniprot:G3XAN1
Length = 525
Score = 157 (60.3 bits), Expect = 2.1e-11, Sum P(2) = 2.1e-11
Identities = 68/251 (27%), Positives = 108/251 (43%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VL+ + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L+ P+YF++ V++S++++ + F EW+ + +K +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLP 358
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELWG 413
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 414 KSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 2.1e-11, Sum P(2) = 2.1e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>MGI|MGI:1098533 [details] [associations]
symbol:Ints9 "integrator complex subunit 9" species:10090
"Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0005634 "nucleus" evidence=IEA] [GO:0016180 "snRNA processing"
evidence=ISO] [GO:0032039 "integrator complex" evidence=ISO]
InterPro:IPR027074 MGI:MGI:1098533 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359 HOVERGEN:HBG081802
KO:K13146 OrthoDB:EOG415GCW PANTHER:PTHR11203:SF2
GeneTree:ENSGT00390000001445 EMBL:AK038979 EMBL:AK077634
EMBL:AK136992 EMBL:AK150195 EMBL:BC028953 EMBL:BC055700
IPI:IPI00223422 IPI:IPI00406798 RefSeq:NP_001240660.1
RefSeq:NP_700463.2 UniGene:Mm.71332 ProteinModelPortal:Q8K114
SMR:Q8K114 STRING:Q8K114 PhosphoSite:Q8K114 PaxDb:Q8K114
PRIDE:Q8K114 Ensembl:ENSMUST00000043914 GeneID:210925
KEGG:mmu:210925 UCSC:uc007uiv.1 UCSC:uc007uiw.1 InParanoid:Q8K114
NextBio:373083 Bgee:Q8K114 CleanEx:MM_INTS9 Genevestigator:Q8K114
Uniprot:Q8K114
Length = 658
Score = 158 (60.7 bits), Expect = 3.1e-11, Sum P(3) = 3.1e-11
Identities = 67/249 (26%), Positives = 108/249 (43%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L N P YF++ V++S++++ + F EW+ + +K +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSKVYLP 358
Query: 273 ETSRDNAFLLKHVTLLINKS---ELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WASD 326
E +A L++ L +S + N P ++ SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYRSIHGDFSNDFRQPCVLFTGHPSLRFG---DVVHFMELWGKS 415
Query: 327 VKNLVLFTE 335
N ++FTE
Sbjct: 416 SLNTIIFTE 424
Score = 81 (33.6 bits), Expect = 3.1e-11, Sum P(3) = 3.1e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTMQIGRLLM 137
Score = 43 (20.2 bits), Expect = 3.1e-11, Sum P(3) = 3.1e-11
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 350 PPPKAVKVTMSRRVPLVGEEL 370
PPPK + T S++ V E++
Sbjct: 555 PPPKPTQPTSSKKRKRVNEDI 575
>UNIPROTKB|Q9NV88 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0005515 "protein binding" evidence=IPI]
[GO:0016180 "snRNA processing" evidence=IDA] [GO:0032039
"integrator complex" evidence=IDA] InterPro:IPR027074
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
GO:GO:0032039 EMBL:U96629 GO:GO:0016180 CTD:55756
HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 OMA:PLAMKCV EMBL:AK001733 EMBL:AK298468
EMBL:AK300593 EMBL:AC040975 EMBL:AC131969 EMBL:BC025267
EMBL:BK005726 EMBL:BK005674 IPI:IPI00290514 IPI:IPI00871167
RefSeq:NP_001138631.1 RefSeq:NP_001166033.1 RefSeq:NP_060720.2
UniGene:Hs.162397 ProteinModelPortal:Q9NV88 SMR:Q9NV88
IntAct:Q9NV88 STRING:Q9NV88 PhosphoSite:Q9NV88 DMDM:119371246
PaxDb:Q9NV88 PRIDE:Q9NV88 DNASU:55756 Ensembl:ENST00000416984
Ensembl:ENST00000521022 Ensembl:ENST00000521777 GeneID:55756
KEGG:hsa:55756 UCSC:uc003xha.3 GeneCards:GC08M028625
HGNC:HGNC:25592 MIM:611352 neXtProt:NX_Q9NV88 PharmGKB:PA162392192
InParanoid:Q9NV88 PhylomeDB:Q9NV88 ChiTaRS:INTS9 GenomeRNAi:55756
NextBio:60763 ArrayExpress:Q9NV88 Bgee:Q9NV88 CleanEx:HS_INTS9
Genevestigator:Q9NV88 GermOnline:ENSG00000104299 Uniprot:Q9NV88
Length = 658
Score = 157 (60.3 bits), Expect = 4.1e-11, Sum P(2) = 4.1e-11
Identities = 68/251 (27%), Positives = 108/251 (43%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 183 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 238
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VL+ + P F ++ T+R GGNVL+P
Sbjct: 239 GSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 298
Query: 217 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L+ P+YF++ V++S++++ + F EW+ + +K +
Sbjct: 299 CYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLP 358
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 359 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELWG 413
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 414 KSSLNTVIFTE 424
Score = 81 (33.6 bits), Expect = 4.1e-11, Sum P(2) = 4.1e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM 137
>DICTYBASE|DDB_G0282473 [details] [associations]
symbol:ints9 "integrator complex subunit 9"
species:44689 "Dictyostelium discoideum" [GO:0032039 "integrator
complex" evidence=IEA] [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR027074 dictyBase:DDB_G0282473
GO:GO:0005634 EMBL:AAFI02000047 GenomeReviews:CM000152_GR
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
KO:K13146 PANTHER:PTHR11203:SF2 RefSeq:XP_640069.1
ProteinModelPortal:Q54SH0 EnsemblProtists:DDB0234099 GeneID:8623598
KEGG:ddi:DDB_G0282473 OMA:DDFSTID ProtClustDB:CLSZ2729002
Uniprot:Q54SH0
Length = 712
Score = 189 (71.6 bits), Expect = 4.3e-11, P = 4.3e-11
Identities = 52/156 (33%), Positives = 71/156 (45%)
Query: 114 SQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDG-EDVIYAVDYNRRKEKHLNGTVLES 172
S ++ S K G P +G+ LG W I G E V+Y D + ++ L
Sbjct: 232 SIRFNESIKHYGFECIPSSSGYGLGSANWVIESKGFERVVYISDSSLSLSRYPTPFQLSP 291
Query: 173 FVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED 232
P VLI N N PP Q I TL+ GG VL+P S G +L+L L D
Sbjct: 292 IDNPDVLILSKINHYPNNPPDQMLSELCSNIGSTLQQGGTVLIPSYSCGIILDLFEHLAD 351
Query: 233 YWAEHSLNY-PIYFLTYVSSSTIDYVKSFLEWMGDS 267
Y + L Y PIYF++ VS + + Y + EW+ S
Sbjct: 352 YLNKVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKS 387
>RGD|1311539 [details] [associations]
symbol:Ints9 "integrator complex subunit 9" species:10116
"Rattus norvegicus" [GO:0016180 "snRNA processing"
evidence=IEA;ISO] [GO:0032039 "integrator complex"
evidence=IEA;ISO] InterPro:IPR027074 RGD:1311539 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 GO:GO:0032039 GO:GO:0016180
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 IPI:IPI00362364
Ensembl:ENSRNOT00000018071 Uniprot:F1M365
Length = 659
Score = 156 (60.0 bits), Expect = 6.5e-11, Sum P(3) = 6.5e-11
Identities = 69/249 (27%), Positives = 109/249 (43%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 184 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 239
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VLI + P F ++ T+R GGNVL+P
Sbjct: 240 GSSLLTTHPQPMDQASLKNSDVLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 299
Query: 217 VDSAGRVLELLLILEDYWAEHSL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L N P YF++ V++S++++ + F EW+ + +K +
Sbjct: 300 CYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSKVYLP 359
Query: 273 ETSRDNAFLLKHVTLLINKS-ELDNAPD--GPKLVLASMASLEAGFSHDI--FVE-WASD 326
E +A L++ L +S D + D P ++ SL G D+ F+E W
Sbjct: 360 EPPFPHAELIQTNKLKHYRSIHGDFSHDFRQPCVLFTGHPSLRFG---DVVHFMELWGKS 416
Query: 327 VKNLVLFTE 335
N V+FTE
Sbjct: 417 SLNTVIFTE 425
Score = 81 (33.6 bits), Expect = 6.5e-11, Sum P(3) = 6.5e-11
Identities = 17/46 (36%), Positives = 28/46 (60%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ST+D +L+S+ + ALPY + G + V++TEP ++G L M
Sbjct: 95 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTMQIGRLLM 138
Score = 42 (19.8 bits), Expect = 6.5e-11, Sum P(3) = 6.5e-11
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 350 PPPKAVKVTMSRRVPLVGEEL 370
PPPK + T S++ V E++
Sbjct: 556 PPPKPTQPTSSKKRKRVSEDV 576
>UNIPROTKB|E9PIG1 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00981641
ProteinModelPortal:E9PIG1 SMR:E9PIG1 Ensembl:ENST00000530031
ArrayExpress:E9PIG1 Bgee:E9PIG1 Uniprot:E9PIG1
Length = 249
Score = 170 (64.9 bits), Expect = 1.9e-10, P = 1.9e-10
Identities = 54/183 (29%), Positives = 86/183 (46%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 68 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 127
Query: 75 MKQLGLSAPVFSTEPVYRLG--LLTMYDQY-LSRRSVTRLTYSQNY----------HLSG 121
+ +G P++ T P + LL Y + + ++ SQ HL
Sbjct: 128 SEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQ 187
Query: 122 K---GEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAV 178
+ + + + AGH+LG +++I E V+Y DYN ++HL ++ RP +
Sbjct: 188 TVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNL 246
Query: 179 LIT 181
LIT
Sbjct: 247 LIT 249
>WB|WBGene00017608 [details] [associations]
symbol:F19F10.12 species:6239 "Caenorhabditis elegans"
[GO:0009792 "embryo development ending in birth or egg hatching"
evidence=IMP] InterPro:IPR027074 GO:GO:0009792 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 EMBL:FO080914
RefSeq:NP_504953.1 ProteinModelPortal:Q95ZM2 PaxDb:Q95ZM2
EnsemblMetazoa:F19F10.12 GeneID:179142 KEGG:cel:CELE_F19F10.12
UCSC:F19F10.12 CTD:179142 WormBase:F19F10.12 HOGENOM:HOG000199610
InParanoid:Q95ZM2 OMA:EFMERIE NextBio:904092 Uniprot:Q95ZM2
Length = 646
Score = 151 (58.2 bits), Expect = 9.2e-10, Sum P(2) = 9.2e-10
Identities = 75/302 (24%), Positives = 122/302 (40%)
Query: 83 PVFSTEPVYRLGLLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVW 142
P F PV T D + V L+++Q L I V P V+GH G W
Sbjct: 162 PPFQN-PVEWRPYYTTTDMHSCLAKVITLSFNQTIDLFR----IKVTPVVSGHTYGSAYW 216
Query: 143 KITKDGEDVIY--AVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQ 200
I + E Y A + + K + L + +L+T + + L + ++
Sbjct: 217 TIKTENEQFAYLSASNPSATDVKLMETAPLRAVDH--ILVT-SLSRLVDTTAKEMGYSLI 273
Query: 201 DAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHS---LNYPIYFLTYVSSSTIDYV 257
I+ L+ G+VLLP+ G + E++ + D + L+ PIYF++ V+ S I
Sbjct: 274 KTITDVLKKHGSVLLPICPVGPIFEMIEAVSDIITTTNGIPLDTPIYFISPVAKSAIAMA 333
Query: 258 KSFLEWMGDSITKSF---ETSRDNAFLLKHVTLLINKS---ELDNAPDGPKLVLASMASL 311
EWM +S + E ++ L+K + I S P ++ AS ASL
Sbjct: 334 SISAEWMSESRQNAVYLPEEPYSHSNLIKSGRVKIYDSLYGSFSKEFKTPCVIFASHASL 393
Query: 312 EAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKAVKVTMSRRVPLVG-EEL 370
G + + SD KN V+ T+ R + P K + + M R+ E L
Sbjct: 394 RIGDAAHMVEVLGSDPKNAVIVTDPDLPCEDVREPFRNLPIKFINIPMDFRMDFASLERL 453
Query: 371 IA 372
+A
Sbjct: 454 LA 455
Score = 74 (31.1 bits), Expect = 9.2e-10, Sum P(2) = 9.2e-10
Identities = 20/57 (35%), Positives = 35/57 (61%)
Query: 54 TIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD--QYLSRRSV 108
TIDA+L+S+ ++ +G LP+ + G S ++ TE Y+ G L M + +++SR V
Sbjct: 89 TIDAILVSNYESF-VG-LPFYTEGSGFSGKIYVTEIAYQYGKLLMEEMLEFISRIEV 143
>UNIPROTKB|H7BYQ6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592 ChiTaRS:INTS9
ProteinModelPortal:H7BYQ6 Ensembl:ENST00000397363 Bgee:H7BYQ6
Uniprot:H7BYQ6
Length = 552
Score = 157 (60.3 bits), Expect = 1.1e-09, Sum P(2) = 1.1e-09
Identities = 68/251 (27%), Positives = 108/251 (43%)
Query: 97 TMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYAVD 156
TM + + + + YSQ L G + V P +G+ LG + W I E V Y V
Sbjct: 77 TMQEVNSALSKIQLVGYSQKIELFG---AVQVTPLSSGYALGSSNWIIQSHYEKVSY-VS 132
Query: 157 YNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLP 216
+ H S VL+ + P F ++ T+R GGNVL+P
Sbjct: 133 GSSLLTTHPQPMDQASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVP 192
Query: 217 VDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF-- 272
+G + +LL L Y L+ P+YF++ V++S++++ + F EW+ + +K +
Sbjct: 193 CYPSGVIYDLLECLYQYIDSAGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSKVYLP 252
Query: 273 ETSRDNAFL-----LKHVTLLINKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WA 324
E +A L LKH + + N P +V SL G D+ F+E W
Sbjct: 253 EPPFPHAELIQTNKLKHYPSI--HGDFSNDFRQPCVVFTGHPSLRFG---DVVHFMELWG 307
Query: 325 SDVKNLVLFTE 335
N V+FTE
Sbjct: 308 KSSLNTVIFTE 318
Score = 65 (27.9 bits), Expect = 1.1e-09, Sum P(2) = 1.1e-09
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 70 ALPYAMKQLGLSAPVFSTEPVYRLGLLTM 98
ALPY + G + V++TEP ++G L M
Sbjct: 3 ALPYITEHTGFTGTVYATEPTVQIGRLLM 31
>TIGR_CMR|DET_1061 [details] [associations]
symbol:DET_1061 "metallo-beta-lactamase family protein"
species:243164 "Dehalococcoides ethenogenes 195" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:CP000027 GenomeReviews:CP000027_GR
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576
RefSeq:YP_181776.1 ProteinModelPortal:Q3Z7M3 STRING:Q3Z7M3
GeneID:3229629 KEGG:det:DET1061 PATRIC:21609167
ProtClustDB:CLSK2516599 BioCyc:DETH243164:GJNF-1062-MONOMER
Uniprot:Q3Z7M3
Length = 468
Score = 115 (45.5 bits), Expect = 3.7e-09, Sum P(2) = 3.7e-09
Identities = 28/99 (28%), Positives = 47/99 (47%)
Query: 4 SVQVTPLSGVFNENPLSYLVSIDGFNFLIDCG-WNDH-FDPSLLQPLSKVASTIDAVLLS 61
S+++ L N YL+ D L+DCG + + QP ++ AV++S
Sbjct: 2 SIEIQFLGAARNVTGSRYLIKTDHTQLLVDCGLYQERRLQDRNWQPFEIPPQSLSAVIIS 61
Query: 62 HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
H H G LP +K+ G + PVF+TE + +++ D
Sbjct: 62 HAHIDHCGLLPKLVKE-GFAGPVFATEATAEIARISLTD 99
Score = 102 (41.0 bits), Expect = 3.7e-09, Sum P(2) = 3.7e-09
Identities = 60/261 (22%), Positives = 106/261 (40%)
Query: 124 EGIVVAPHVAGHLLGGTV--WKITKDGED--VIYAVDYNRRKEKHLNGTVLESFVRPAVL 179
E I H AGH+ G KI ++ ++++ D L L + V+
Sbjct: 155 EDITATFHNAGHVFGSASIELKIQENHRQKVIVFSGDLGNWDRPILKNPDLVNQA-DYVV 213
Query: 180 ITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL 239
I Y +Q + + I++T++ GGN+++P + R +LL L + +E +
Sbjct: 214 IESTYGDRTHQDINEASLKLAEIINQTVKLGGNIVIPSFALERTQDLLFFLNRFMSEGKI 273
Query: 240 NYPIYFLTYVSSSTIDYVKSFLEW--MGDSITKSFETSRDNAFLLK--HVTLLINKSELD 295
P + S I K F E + D T + + + F + H T S+
Sbjct: 274 --PSLKVFVDSPMAISITKIFKEHPELYDRETSGWVNNGSSPFEFEGLHFTNKAADSKAI 331
Query: 296 NAPDGPKLVLASMASLEAG-FSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 354
A P +++A G H + V S ++ +LF GTL R++ D K
Sbjct: 332 LAEKDPCIIIAGSGMCTGGRIKHHL-VNNISRPESTILFVGFQATGTLGRLI-TDGA-KE 388
Query: 355 VKVTMSRRVPLVG--EELIAY 373
V++ + + P+ EEL A+
Sbjct: 389 VRI-LGQHYPVQARIEELRAF 408
>FB|FBgn0036570 [details] [associations]
symbol:IntS9 "Integrator 9" species:7227 "Drosophila
melanogaster" [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0034472 "snRNA 3'-end processing" evidence=IDA]
[GO:0032039 "integrator complex" evidence=ISS] [GO:0016180 "snRNA
processing" evidence=ISS] InterPro:IPR027074 EMBL:AE014296
GO:GO:0006378 GO:GO:0005847 GO:GO:0006379 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
GO:GO:0034472 EMBL:AY058574 RefSeq:NP_648838.3 UniGene:Dm.977
SMR:Q95TS5 IntAct:Q95TS5 MINT:MINT-1734573
EnsemblMetazoa:FBtr0075495 GeneID:39763 KEGG:dme:Dmel_CG5222
UCSC:CG5222-RA FlyBase:FBgn0036570 InParanoid:Q95TS5
OrthoDB:EOG4FJ6QV GenomeRNAi:39763 NextBio:815254 Uniprot:Q95TS5
Length = 654
Score = 129 (50.5 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
Identities = 61/254 (24%), Positives = 110/254 (43%)
Query: 95 LLTMYDQYLSRRSVTRLTYSQNYHLSGKGEGIVVAPHVAGHLLGGTVWKITKDGEDVIYA 154
+ ++ D S VT + Y + + G + P +G+ LG + W ++ E + Y
Sbjct: 180 IFSLKDVQGSLSKVTIMGYDEKLDILG---AFIATPVSSGYCLGSSNWVLSTAHEKICY- 235
Query: 155 VDYNRRKEKHLNGTVLESFVRPA-VLI-TDAYNALHNQPPRQQREMFQDAISKTLRAGGN 212
V + H + +S ++ A VLI T A P + E+ + ++ T+R G+
Sbjct: 236 VSGSSTLTTHPR-PINQSALKHADVLIMTGLTQAPTVNPDTKLGELCMN-VALTIRNNGS 293
Query: 213 VLLPVDSAGRVLELLLILEDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLEWMGDSITKS 271
L+P +G V +L L LN P++F++ V+ S++ Y EW+ +
Sbjct: 294 ALIPCYPSGVVYDLFECLTQNLENAGLNNVPMFFISPVADSSLAYSNILAEWLSSAKQNK 353
Query: 272 FETSRD---NAFLL-----KHVTLLINKSELDNAPDGPKLVLASMASLEAGFS-HDIFVE 322
D +AF L KH + ++ + P +V SL G + H F+E
Sbjct: 354 VYLPDDPFPHAFYLRNNKLKHYNHVFSEGFSKDFRQ-PCVVFCGHPSLRFGDAVH--FIE 410
Query: 323 -WASDVKNLVLFTE 335
W ++ N ++FTE
Sbjct: 411 MWGNNPNNSIIFTE 424
Score = 85 (35.0 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
Identities = 27/97 (27%), Positives = 47/97 (48%)
Query: 31 LIDCGWNDHFD--PSLLQPLSKVA--STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFS 86
L DC D P P+ K+ S +D +L+S+ L++ ALPY + G V++
Sbjct: 69 LKDCCGRVFVDSTPEFNLPMDKMLDFSEVDVILISN--YLNMLALPYITENTGFKGKVYA 126
Query: 87 TEPVYRLGLLTMYD--QYL--SRRSVTRLTYSQNYHL 119
TEP ++G + + Y+ S ++ T + + HL
Sbjct: 127 TEPTLQIGRFFLEELVDYIEVSPKACTARLWKEKLHL 163
Score = 45 (20.9 bits), Expect = 0.00020, Sum P(2) = 0.00020
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVAS 53
Y+++ G ++DCG + + L PL V S
Sbjct: 15 YIITFKGLRIMLDCGLTEQTVLNFL-PLPFVQS 46
>UNIPROTKB|E9PNS4 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00984775
ProteinModelPortal:E9PNS4 SMR:E9PNS4 Ensembl:ENST00000528879
ArrayExpress:E9PNS4 Bgee:E9PNS4 Uniprot:E9PNS4
Length = 278
Score = 157 (60.3 bits), Expect = 1.7e-08, P = 1.7e-08
Identities = 49/188 (26%), Positives = 86/188 (45%)
Query: 41 DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLG--LLTM 98
D S + ++ +D V++SH H GALPY + +G P++ T P + LL
Sbjct: 47 DFSYITQNGRLTDFLDCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
Query: 99 YDQY-LSRRSVTRLTYSQNY----------HLSGK---GEGIVVAPHVAGHLLGGTVWKI 144
Y + + ++ SQ HL + + + + AGH+LG +++I
Sbjct: 107 YRKIAVDKKGEANFFTSQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQI 166
Query: 145 TKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAYNALHNQPPRQQREM-FQDAI 203
E V+Y DYN ++HL ++ RP +LIT++ A + ++ RE F +
Sbjct: 167 KVGSESVVYTGDYNMTPDRHLGAAWIDK-CRPNLLITESTYATTIRDSKRCRERDFLKKV 225
Query: 204 SKTLRAGG 211
+T+ GG
Sbjct: 226 HETVERGG 233
Score = 127 (49.8 bits), Expect = 4.4e-05, P = 4.4e-05
Identities = 33/103 (32%), Positives = 52/103 (50%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDA 57
++VTPL + LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDC 63
Query: 58 VLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 VIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 106
>TIGR_CMR|CHY_2049 [details] [associations]
symbol:CHY_2049 "metallo-beta-lactamase family protein"
species:246194 "Carboxydothermus hydrogenoformans Z-2901"
[GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
"metabolic process" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 EMBL:CP000141 GenomeReviews:CP000141_GR
GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 RefSeq:YP_360868.1
ProteinModelPortal:Q3AAG6 STRING:Q3AAG6 GeneID:3728507
KEGG:chy:CHY_2049 PATRIC:21277179 HOGENOM:HOG000244774 KO:K07576
OMA:GGRIVHH BioCyc:CHYD246194:GJCN-2048-MONOMER Uniprot:Q3AAG6
Length = 504
Score = 86 (35.3 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
Identities = 31/113 (27%), Positives = 59/113 (52%)
Query: 501 ILD-AKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVA--PLKLVLVHGSAEA 557
+LD AK K++ E+ V+ + + + AD R + + + P ++ LVHG EA
Sbjct: 377 LLDGAKEVKIMGEEIAVKAE-VYHYDGLSAHADQRELLAFIGRFSQKPAQIYLVHGEDEA 435
Query: 558 TEHLKQHCL-KHVCPHVYTPQIEETIDVTSDLCAYKVQ-LSEKLMSNVLFKKL 608
+LK+ K+ P Y P+ +ETI + ++L + L +K+++ + K+L
Sbjct: 436 RLNLKKLIEEKYRIP-CYLPRYQETISLLANLPGKSEEVLIDKVITLLKAKQL 487
Score = 85 (35.0 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
Identities = 33/145 (22%), Positives = 61/145 (42%)
Query: 125 GIVVAPHVAGHLLGGTVWKITKDGED----VIYAVDYNRRKEKHLNGTVLESFVRPAVLI 180
G+ V AGH+LG + KI G+D +++ D R + + +L+
Sbjct: 152 GLEVTFFDAGHILGSAMIKIAYKGQDATRTILFTGDLGRNGRPFMKEP--QKVPLTDILV 209
Query: 181 TDA-YNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL 239
++ Y + + + I K R GN+++P + R +L+ IL D E+
Sbjct: 210 LESTYGDRVRSEEGDLKTLLKSLIEKVYRRNGNLIIPAFAMERTQDLIYILNDL-VENKE 268
Query: 240 NYPIYFLTYVSSS-TIDYVKSFLEW 263
PI Y+ S ++ K F ++
Sbjct: 269 VPPID--VYIDSPLAVEITKLFKKY 291
Score = 84 (34.6 bits), Expect = 1.8e-08, Sum P(3) = 1.8e-08
Identities = 28/110 (25%), Positives = 50/110 (45%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVA-----STIDAVLLSHPDTLHLGALPYAM 75
YL ++ G FL+DCG P ++ + I+ +LL+H H G +P +
Sbjct: 17 YLFNVAGHKFLVDCGLFQ--GPKAIKERNYGEFPFNPREIEFILLTHAHIDHSGLIPKLV 74
Query: 76 KQLGLSAPVFSTEPVYRLGLLTMYDQ-YLSRRSVTRLTYSQNYHLSGKGE 124
K+ G +++TEP L + + D ++ V R ++ +GK E
Sbjct: 75 KK-GFKGTIYATEPTVDLAAVMLPDSGHVQEMEVERK--NRKLRRAGKPE 121
>UNIPROTKB|Q81SC3 [details] [associations]
symbol:BA_1737 "Metallo-beta-lactamase family protein"
species:1392 "Bacillus anthracis" [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
DNASU:1086535 EnsemblBacteria:EBBACT00000009201
EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
Length = 419
Score = 142 (55.0 bits), Expect = 7.7e-08, Sum P(2) = 7.7e-08
Identities = 91/404 (22%), Positives = 170/404 (42%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL 80
Y V L DCG N ++ S + +V ++AV LSH H LP K G
Sbjct: 17 YFVKNKETKILFDCGINRSYEDSYPKIEREVVPFLEAVFLSHIHEDHTMGLPLLAKY-GY 75
Query: 81 SAPVFSTEPVYRLGLLTMYDQYLSRRSVTR---LTYSQ------NY----HLSGKGEGIV 127
+++T Y L Y + +VT+ + Y+ NY +S E I
Sbjct: 76 KKKIWTTR--YTKEQLPAYYEKWRNYNVTQGWNVPYNDQNVKDLNYIYVDEISNPNEWIQ 133
Query: 128 VAPHV------AGHLLGGTVWKITKDGED-VIYAVDYNRRKEKHLNGTVLESFVRPAVLI 180
+ P + +GH+LG +VW + V Y+ DY+ E ++ L +R + +
Sbjct: 134 ITPTLRFQWGYSGHVLG-SVWFLVDMSHTYVFYSGDYSA--ESNILRANLPEKLRGDIKV 190
Query: 181 TDAYNALHNQPPRQQREMFQDAISKTLRAGGN---VLLPVDSAGRVLELLLILEDYWAEH 237
A H QRE + ++ RA GN LLP+ GR +++L L + + E
Sbjct: 191 AIVDAAYHTDDV-SQRERVNELCTEIERAAGNKGIALLPLPPLGRAQDIVLYLYEKYKE- 248
Query: 238 SLNYPIYFLTYVSSSTID-YVKSFL--EWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 294
+PI V +D + + FL +W+ ++ K E ++ L +++ ++ +
Sbjct: 249 ---FPII----VDQEILDGFDEMFLYKDWIKNN--KELEELMES--LKRNIIVMDDDGGT 297
Query: 295 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 354
++ +V+ S A+++ + + + + +N ++FT G+ A + + K
Sbjct: 298 QHSCG---IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKVLKERIGKE 354
Query: 355 VKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKASLVKEEESK 397
+V +RVP V + + +E L E + +KE+ +
Sbjct: 355 CRV---KRVPYKVHQSIRDVKEMLNTLLPEHTVLVHALKEDTDR 395
Score = 59 (25.8 bits), Expect = 7.7e-08, Sum P(2) = 7.7e-08
Identities = 18/66 (27%), Positives = 33/66 (50%)
Query: 519 KCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQI 578
+C + + Y+ R +K +L+ + P VLVH E T+ L++ +VY+ +
Sbjct: 354 ECRVKRVPYKVHQSIRDVKEMLNTLLPEHTVLVHALKEDTDRLQKKLSTAGYENVYSLTM 413
Query: 579 EETIDV 584
E I+V
Sbjct: 414 ER-IEV 418
>TIGR_CMR|BA_1737 [details] [associations]
symbol:BA_1737 "metallo-beta-lactamase family protein"
species:198094 "Bacillus anthracis str. Ames" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
DNASU:1086535 EnsemblBacteria:EBBACT00000009201
EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
Length = 419
Score = 142 (55.0 bits), Expect = 7.7e-08, Sum P(2) = 7.7e-08
Identities = 91/404 (22%), Positives = 170/404 (42%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYAMKQLGL 80
Y V L DCG N ++ S + +V ++AV LSH H LP K G
Sbjct: 17 YFVKNKETKILFDCGINRSYEDSYPKIEREVVPFLEAVFLSHIHEDHTMGLPLLAKY-GY 75
Query: 81 SAPVFSTEPVYRLGLLTMYDQYLSRRSVTR---LTYSQ------NY----HLSGKGEGIV 127
+++T Y L Y + +VT+ + Y+ NY +S E I
Sbjct: 76 KKKIWTTR--YTKEQLPAYYEKWRNYNVTQGWNVPYNDQNVKDLNYIYVDEISNPNEWIQ 133
Query: 128 VAPHV------AGHLLGGTVWKITKDGED-VIYAVDYNRRKEKHLNGTVLESFVRPAVLI 180
+ P + +GH+LG +VW + V Y+ DY+ E ++ L +R + +
Sbjct: 134 ITPTLRFQWGYSGHVLG-SVWFLVDMSHTYVFYSGDYSA--ESNILRANLPEKLRGDIKV 190
Query: 181 TDAYNALHNQPPRQQREMFQDAISKTLRAGGN---VLLPVDSAGRVLELLLILEDYWAEH 237
A H QRE + ++ RA GN LLP+ GR +++L L + + E
Sbjct: 191 AIVDAAYHTDDV-SQRERVNELCTEIERAAGNKGIALLPLPPLGRAQDIVLYLYEKYKE- 248
Query: 238 SLNYPIYFLTYVSSSTID-YVKSFL--EWMGDSITKSFETSRDNAFLLKHVTLLINKSEL 294
+PI V +D + + FL +W+ ++ K E ++ L +++ ++ +
Sbjct: 249 ---FPII----VDQEILDGFDEMFLYKDWIKNN--KELEELMES--LKRNIIVMDDDGGT 297
Query: 295 DNAPDGPKLVLASMASLEAGFSHDIFVEWASDVKNLVLFTERGQFGTLARMLQADPPPKA 354
++ +V+ S A+++ + + + + +N ++FT G+ A + + K
Sbjct: 298 QHSCG---IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKVLKERIGKE 354
Query: 355 VKVTMSRRVPL-VGEELIAYEEEQTRLKKEEALKASLVKEEESK 397
+V +RVP V + + +E L E + +KE+ +
Sbjct: 355 CRV---KRVPYKVHQSIRDVKEMLNTLLPEHTVLVHALKEDTDR 395
Score = 59 (25.8 bits), Expect = 7.7e-08, Sum P(2) = 7.7e-08
Identities = 18/66 (27%), Positives = 33/66 (50%)
Query: 519 KCLLIFIDYEGRADGRSIKTILSHVAPLKLVLVHGSAEATEHLKQHCLKHVCPHVYTPQI 578
+C + + Y+ R +K +L+ + P VLVH E T+ L++ +VY+ +
Sbjct: 354 ECRVKRVPYKVHQSIRDVKEMLNTLLPEHTVLVHALKEDTDRLQKKLSTAGYENVYSLTM 413
Query: 579 EETIDV 584
E I+V
Sbjct: 414 ER-IEV 418
>UNIPROTKB|G3V5T3 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation-specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] InterPro:IPR027075
PANTHER:PTHR11203:SF5 HGNC:HGNC:2325 ChiTaRS:CPSF2 EMBL:AL121773
ProteinModelPortal:G3V5T3 SMR:G3V5T3 Ensembl:ENST00000554290
ArrayExpress:G3V5T3 Bgee:G3V5T3 Uniprot:G3V5T3
Length = 62
Score = 132 (51.5 bits), Expect = 1.2e-07, P = 1.2e-07
Identities = 25/61 (40%), Positives = 39/61 (63%)
Query: 1 MGTSVQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLL 60
M + +++T LSGV E+ L YL+ +D F FL+DCGW++HF ++ L + TI +L
Sbjct: 1 MTSIIKLTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFSMDIIDSLRNL-DTIQKILH 59
Query: 61 S 61
S
Sbjct: 60 S 60
>UNIPROTKB|E9PIL7 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00977321
ProteinModelPortal:E9PIL7 SMR:E9PIL7 Ensembl:ENST00000534345
ArrayExpress:E9PIL7 Bgee:E9PIL7 Uniprot:E9PIL7
Length = 140
Score = 130 (50.8 bits), Expect = 2.0e-07, P = 2.0e-07
Identities = 35/104 (33%), Positives = 54/104 (51%)
Query: 5 VQVTPLSGVFNENPLS-YLVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTID 56
++VTPL G + S LVSI G N ++DCG +ND F D S + ++ +D
Sbjct: 4 IRVTPLVGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLD 63
Query: 57 AVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
V++SH H GALPY + +G P++ T P + + + D
Sbjct: 64 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLED 107
>UNIPROTKB|E5RG70 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
ChiTaRS:INTS9 IPI:IPI00974179 ProteinModelPortal:E5RG70 SMR:E5RG70
Ensembl:ENST00000523436 ArrayExpress:E5RG70 Bgee:E5RG70
Uniprot:E5RG70
Length = 300
Score = 138 (53.6 bits), Expect = 3.1e-06, P = 3.1e-06
Identities = 51/213 (23%), Positives = 105/213 (49%)
Query: 53 STIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYDQYLSRRSVTRLT 112
ST+D +L+S+ + ALPY + G + V++TEP ++G L M ++ ++ + R+
Sbjct: 94 STVDVILISNYHCMM--ALPYITEHTGFTGTVYATEPTVQIGRLLM-EELVN--FIERVP 148
Query: 113 YSQNYHLSGKGEGIV-VAPHVAGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLE 171
+Q+ L K + I + P + + W+ ++V A+ + + +
Sbjct: 149 KAQSASL-WKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIPMDQ 207
Query: 172 SFVRPA-VLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLIL 230
+ ++ + VL+ + P F ++ T+R GGNVL+P +G + +LL L
Sbjct: 208 ASLKNSDVLVLTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECL 267
Query: 231 EDYWAEHSLN-YPIYFLTYVSSSTIDYVKSFLE 262
Y L+ P+YF++ V++S++++ + F E
Sbjct: 268 YQYIDSAGLSSVPLYFISPVANSSLEFSQIFAE 300
>UNIPROTKB|Q8EJC6 [details] [associations]
symbol:SO_0541 "RNA-metabolizing metallo-beta-lactamase
family protein" species:211586 "Shewanella oneidensis MR-1"
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014299
GenomeReviews:AE014299_GR InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774
KO:K07576 RefSeq:NP_716177.2 ProteinModelPortal:Q8EJC6
DNASU:1168409 GeneID:1168409 KEGG:son:SO_0541 PATRIC:23520762
ProtClustDB:CLSK2516780 Uniprot:Q8EJC6
Length = 480
Score = 98 (39.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
Identities = 39/130 (30%), Positives = 60/130 (46%)
Query: 123 GEGIVVAPHV------AGHLLGGTVWKIT----KDGEDVIYAVDYNRRKEKHL-NGTVLE 171
G+ V PHV AGH+LG + ++ K + ++++ D R L N T+++
Sbjct: 146 GQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVFSGDLGRAGMPILQNPTLVD 205
Query: 172 SFVRPAVLITDAY-NALHNQPPRQQREMFQDAISKTLRAG-GNVLLPVDSAGRVLELLLI 229
+ VL+ Y N H E+ +D +KT+ GN+LLP S GR ELL +
Sbjct: 206 T--ADLVLMESTYGNRFHRSWTDTLAEL-KDIFAKTVNESQGNILLPAFSVGRAQELLYL 262
Query: 230 LEDYWAEHSL 239
Y E L
Sbjct: 263 FHLYAKEWDL 272
Score = 83 (34.3 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
Identities = 29/102 (28%), Positives = 44/102 (43%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLL---QPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
+LV++ G + L+DCG L +P TI AV+LSH H G LP +K
Sbjct: 19 HLVTVAGKHLLLDCGLIQGGKADELRNHEPFVFDPQTIVAVVLSHAHIDHSGRLPLLVKA 78
Query: 78 LGLSAPVFSTEPVYRLGLLTMYDQ-YLSRRSVTRLTYSQNYH 118
G P+++ + L + + D L R R + H
Sbjct: 79 -GFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAKH 119
Score = 42 (19.8 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
Identities = 18/64 (28%), Positives = 25/64 (39%)
Query: 499 SLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLK-LVLVHGSAEA 557
+L+ AK + N + V K + AD + H LVLVHG EA
Sbjct: 381 ALVDGAKELTIHGNSVNVAAKLHTVG-GLSAHADQAELLRWYRHFEEQPPLVLVHGEPEA 439
Query: 558 TEHL 561
+ L
Sbjct: 440 QQGL 443
>TIGR_CMR|SO_0541 [details] [associations]
symbol:SO_0541 "metallo-beta-lactamase family protein"
species:211586 "Shewanella oneidensis MR-1" [GO:0008150
"biological_process" evidence=ND] [GO:0003824 "catalytic activity"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:AE014299 GenomeReviews:AE014299_GR
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_716177.2
ProteinModelPortal:Q8EJC6 DNASU:1168409 GeneID:1168409
KEGG:son:SO_0541 PATRIC:23520762 ProtClustDB:CLSK2516780
Uniprot:Q8EJC6
Length = 480
Score = 98 (39.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
Identities = 39/130 (30%), Positives = 60/130 (46%)
Query: 123 GEGIVVAPHV------AGHLLGGTVWKIT----KDGEDVIYAVDYNRRKEKHL-NGTVLE 171
G+ V PHV AGH+LG + ++ K + ++++ D R L N T+++
Sbjct: 146 GQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVFSGDLGRAGMPILQNPTLVD 205
Query: 172 SFVRPAVLITDAY-NALHNQPPRQQREMFQDAISKTLRAG-GNVLLPVDSAGRVLELLLI 229
+ VL+ Y N H E+ +D +KT+ GN+LLP S GR ELL +
Sbjct: 206 T--ADLVLMESTYGNRFHRSWTDTLAEL-KDIFAKTVNESQGNILLPAFSVGRAQELLYL 262
Query: 230 LEDYWAEHSL 239
Y E L
Sbjct: 263 FHLYAKEWDL 272
Score = 83 (34.3 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
Identities = 29/102 (28%), Positives = 44/102 (43%)
Query: 21 YLVSIDGFNFLIDCGWNDHFDPSLL---QPLSKVASTIDAVLLSHPDTLHLGALPYAMKQ 77
+LV++ G + L+DCG L +P TI AV+LSH H G LP +K
Sbjct: 19 HLVTVAGKHLLLDCGLIQGGKADELRNHEPFVFDPQTIVAVVLSHAHIDHSGRLPLLVKA 78
Query: 78 LGLSAPVFSTEPVYRLGLLTMYDQ-YLSRRSVTRLTYSQNYH 118
G P+++ + L + + D L R R + H
Sbjct: 79 -GFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAKH 119
Score = 42 (19.8 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
Identities = 18/64 (28%), Positives = 25/64 (39%)
Query: 499 SLILDAKPSKVVSNELTVQVKCLLIFIDYEGRADGRSIKTILSHVAPLK-LVLVHGSAEA 557
+L+ AK + N + V K + AD + H LVLVHG EA
Sbjct: 381 ALVDGAKELTIHGNSVNVAAKLHTVG-GLSAHADQAELLRWYRHFEEQPPLVLVHGEPEA 439
Query: 558 TEHL 561
+ L
Sbjct: 440 QQGL 443
>UNIPROTKB|Q9KV92 [details] [associations]
symbol:VC_0264 "Putative uncharacterized protein"
species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
[GO:0003674 "molecular_function" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0016787
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K07576 OMA:CHIDHVG PIR:F82345 RefSeq:NP_229920.1
ProteinModelPortal:Q9KV92 DNASU:2614470 GeneID:2614470
KEGG:vch:VC0264 PATRIC:20079570 ProtClustDB:CLSK2517501
Uniprot:Q9KV92
Length = 455
Score = 134 (52.2 bits), Expect = 2.0e-05, P = 2.0e-05
Identities = 85/352 (24%), Positives = 145/352 (41%)
Query: 26 DGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPY----AMKQ-LGL 80
DG LIDCG D L + +DA++L+H H+G LP+ +KQ +
Sbjct: 39 DGQALLIDCGLFQGADERPLA-VEFALGHVDALILTHAHIDHIGRLPWLLAAGLKQPIYS 97
Query: 81 SAPVFSTEPVY-RLGL---LTMYDQYLSR--RSVTRLTYSQNYHL-----SGKGEGIVVA 129
+A P+ GL L M + R V RL Q+Y + + + V
Sbjct: 98 TAATAELVPLMLEDGLKLQLGMSPKQSERVLTEVRRLLRVQDYQKWFAVQPKRADSLWVR 157
Query: 130 PHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL-ITDAYNAL 187
AGH+LG +I + +GE V+++ D L +S R L I Y
Sbjct: 158 FQPAGHILGSAYVEIRRPNGEVVVFSGDLGPSHTPLLPDP--QSPERADYLFIETTYGDK 215
Query: 188 HNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED--YWAEHSLNYPIYF 245
++ + + + + I ++L GG +L+P S GR ELL +E + + N PI
Sbjct: 216 QHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLFDIEQLIFSQQIDANLPIIL 275
Query: 246 LTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE--LDN--APDGP 301
+ ++ + F + G + R + +T+ +++ L N A G
Sbjct: 276 DSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCITVEDHRTHERLVNRLASTGE 335
Query: 302 K-LVLASMASLEAGFSHDIFVEWASDVK-NLVLFTERGQFGTLARMLQADPP 351
+V+A+ + G D D + +L+L + + GTL R +Q+ P
Sbjct: 336 AAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAE-GTLGRSIQSGQP 386
>TIGR_CMR|VC_0264 [details] [associations]
symbol:VC_0264 "conserved hypothetical protein" species:686
"Vibrio cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE003852
GenomeReviews:AE003852_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K07576 OMA:CHIDHVG
PIR:F82345 RefSeq:NP_229920.1 ProteinModelPortal:Q9KV92
DNASU:2614470 GeneID:2614470 KEGG:vch:VC0264 PATRIC:20079570
ProtClustDB:CLSK2517501 Uniprot:Q9KV92
Length = 455
Score = 134 (52.2 bits), Expect = 2.0e-05, P = 2.0e-05
Identities = 85/352 (24%), Positives = 145/352 (41%)
Query: 26 DGFNFLIDCGWNDHFDPSLLQPLSKVASTIDAVLLSHPDTLHLGALPY----AMKQ-LGL 80
DG LIDCG D L + +DA++L+H H+G LP+ +KQ +
Sbjct: 39 DGQALLIDCGLFQGADERPLA-VEFALGHVDALILTHAHIDHIGRLPWLLAAGLKQPIYS 97
Query: 81 SAPVFSTEPVY-RLGL---LTMYDQYLSR--RSVTRLTYSQNYHL-----SGKGEGIVVA 129
+A P+ GL L M + R V RL Q+Y + + + V
Sbjct: 98 TAATAELVPLMLEDGLKLQLGMSPKQSERVLTEVRRLLRVQDYQKWFAVQPKRADSLWVR 157
Query: 130 PHVAGHLLGGTVWKITK-DGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVL-ITDAYNAL 187
AGH+LG +I + +GE V+++ D L +S R L I Y
Sbjct: 158 FQPAGHILGSAYVEIRRPNGEVVVFSGDLGPSHTPLLPDP--QSPERADYLFIETTYGDK 215
Query: 188 HNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILED--YWAEHSLNYPIYF 245
++ + + + + I ++L GG +L+P S GR ELL +E + + N PI
Sbjct: 216 QHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLFDIEQLIFSQQIDANLPIIL 275
Query: 246 LTYVSSSTIDYVKSFLEWMGDSITKSFETSRDNAFLLKHVTLLINKSE--LDN--APDGP 301
+ ++ + F + G + R + +T+ +++ L N A G
Sbjct: 276 DSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCITVEDHRTHERLVNRLASTGE 335
Query: 302 K-LVLASMASLEAGFSHDIFVEWASDVK-NLVLFTERGQFGTLARMLQADPP 351
+V+A+ + G D D + +L+L + + GTL R +Q+ P
Sbjct: 336 AAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAE-GTLGRSIQSGQP 386
>TAIR|locus:2079696 [details] [associations]
symbol:AT3G07530 "AT3G07530" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR027074 EMBL:CP002686 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 KO:K13146 PANTHER:PTHR11203:SF2
IPI:IPI00520313 RefSeq:NP_187409.2 UniGene:At.53215
ProteinModelPortal:F4JEH2 PRIDE:F4JEH2 EnsemblPlants:AT3G07530.1
GeneID:819942 KEGG:ath:AT3G07530 OMA:CYNGTLI Uniprot:F4JEH2
Length = 699
Score = 107 (42.7 bits), Expect = 3.4e-05, Sum P(3) = 3.4e-05
Identities = 38/138 (27%), Positives = 63/138 (45%)
Query: 209 AGGNVLLPVDSAGRVLELLLILEDYWAEHSLNYPIYFLTYVSSSTIDYVKSFLEWMGDSI 268
AGG+ L+ + G VL+LL +L + SL PI+ ++ V+ + Y + EW+ +
Sbjct: 343 AGGSTLITITRIGIVLQLLELLSNSLESSSLKVPIFVISSVAEELLAYTNTIPEWLCEQR 402
Query: 269 TK---SFETSRDNAFLLK----HVTLLINKSELDNAP----DGPKLVLASMASLEAGFSH 317
+ S E S + +K H+ I+ L A P +V AS SL G S
Sbjct: 403 QEKLISGEPSFGHLKFIKNKKIHLFPAIHSPNLIYANRTSWQEPCIVFASHWSLRLGPSV 462
Query: 318 DIFVEWASDVKNLVLFTE 335
+ W D K+L++ +
Sbjct: 463 QLLQRWRGDPKSLLVLED 480
Score = 76 (31.8 bits), Expect = 3.4e-05, Sum P(3) = 3.4e-05
Identities = 21/49 (42%), Positives = 29/49 (59%)
Query: 52 ASTIDAVLLSHPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
AS ID VL+S+P L LG LP+ + G A ++ TE ++G L M D
Sbjct: 100 ASFIDIVLISNPMGL-LG-LPFLTQNPGFFAKIYMTEVTAKIGQLMMED 146
Score = 43 (20.2 bits), Expect = 3.4e-05, Sum P(3) = 3.4e-05
Identities = 7/17 (41%), Positives = 12/17 (70%)
Query: 18 PLSYLVSIDGFNFLIDC 34
P +++++ GF LIDC
Sbjct: 15 PPCHMLNLCGFRILIDC 31
>UNIPROTKB|E9PQF0 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00982774
ProteinModelPortal:E9PQF0 SMR:E9PQF0 Ensembl:ENST00000498476
ArrayExpress:E9PQF0 Bgee:E9PQF0 Uniprot:E9PQF0
Length = 167
Score = 116 (45.9 bits), Expect = 5.7e-05, P = 5.7e-05
Identities = 29/86 (33%), Positives = 45/86 (52%)
Query: 22 LVSIDGFNFLIDCG----WND--HF-DPSLLQPLSKVASTIDAVLLSHPDTLHLGALPYA 74
LVSI G N ++DCG +ND F D S + ++ +D V++SH H GALPY
Sbjct: 81 LVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHFHLDHCGALPYF 140
Query: 75 MKQLGLSAPVFSTEPVYRLGLLTMYD 100
+ +G P++ T P + + + D
Sbjct: 141 SEMVGYDGPIYMTHPTQAICPILLED 166
>UNIPROTKB|E2QVB2 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 PANTHER:PTHR11203:SF2 Ensembl:ENSCAFT00000013124
Uniprot:E2QVB2
Length = 409
Score = 127 (49.8 bits), Expect = 9.9e-05, P = 9.9e-05
Identities = 52/170 (30%), Positives = 77/170 (45%)
Query: 178 VLITDAYNALHNQPPRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEH 237
VLI + P F ++ T+R GGNVL+P +G + +LL L Y
Sbjct: 11 VLILTGLTQIPTANPDGMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDSA 70
Query: 238 SL-NYPIYFLTYVSSSTIDYVKSFLEWM-GDSITKSF--ETSRDNAFL-----LKHVTLL 288
L N P YF++ V++S++++ + F EW+ + TK + E +A L LKH L
Sbjct: 71 GLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSL 130
Query: 289 INKSELDNAPDGPKLVLASMASLEAGFSHDI--FVE-WASDVKNLVLFTE 335
D P +V SL G D+ F+E W N V+FTE
Sbjct: 131 HGDFSSDFRQ--PCVVFTGHPSLRFG---DVVHFMELWGKSSLNTVIFTE 175
>TIGR_CMR|CPS_2623 [details] [associations]
symbol:CPS_2623 "metallo-beta-lactamase family protein"
species:167879 "Colwellia psychrerythraea 34H" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:CP000083 GenomeReviews:CP000083_GR
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:YP_269337.1
ProteinModelPortal:Q481D2 STRING:Q481D2 GeneID:3521490
KEGG:cps:CPS_2623 PATRIC:21468305 OMA:HGPMVII
ProtClustDB:CLSK2524370 BioCyc:CPSY167879:GI48-2685-MONOMER
Uniprot:Q481D2
Length = 451
Score = 74 (31.1 bits), Expect = 0.00086, Sum P(3) = 0.00086
Identities = 24/99 (24%), Positives = 40/99 (40%)
Query: 5 VQVTPLSGVFNENPLSYLVSIDGFNFLIDCGWNDHFD---PSLLQPLSKVASTIDAVLLS 61
+ +T L G Y V L+DCG + +PL ++DA++L+
Sbjct: 1 MNITFLGGTGTVTGSKYFVETSTTKILVDCGLYQGYKWLRARNREPLPLDLKSLDAIVLT 60
Query: 62 HPDTLHLGALPYAMKQLGLSAPVFSTEPVYRLGLLTMYD 100
H H G +P KQ G V++ + L + + D
Sbjct: 61 HAHLDHSGFIPALYKQ-GFRGHVYAHQATISLCSILLPD 98
Score = 68 (29.0 bits), Expect = 0.00086, Sum P(3) = 0.00086
Identities = 27/114 (23%), Positives = 50/114 (43%)
Query: 133 AGHLLGGTVWKITKDGEDVIYAVDYNRRKEKHLNGTVLESFVRPAVLITDAY-NALHNQP 191
AGH+LG + DG+ V ++ D R + + V +L+ Y N LH++
Sbjct: 158 AGHILGAASVILKADGKRVGFSGDVGRPDDIIMYPPKPLPPV-DLLLLESTYGNRLHDK- 215
Query: 192 PRQQREMFQDAISKTLRAGGNVLLPVDSAGRVLELLLILEDYWAEHSL-NYPIY 244
E + ++ T + GG +L+P + GR + +L + + P+Y
Sbjct: 216 -EDAFEQLAEIVNSTAKKGGALLIPSFAVGRTEAVQHMLASLMKKELIPKLPVY 268
Score = 65 (27.9 bits), Expect = 0.00086, Sum P(3) = 0.00086
Identities = 12/30 (40%), Positives = 21/30 (70%)
Query: 540 LSHVAP-LKLVLVHGSAEATEHLKQHCLKH 568
+S + P K++LVHG EA+E ++ H ++H
Sbjct: 406 ISKLHPKTKVLLVHGEPEASESMRDHLMQH 435
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.317 0.136 0.400 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 721 715 0.00084 121 3 11 22 0.42 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 94
No. of states in DFA: 621 (66 KB)
Total size of DFA: 370 KB (2183 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 65.02u 0.11s 65.13t Elapsed: 00:00:02
Total cpu time: 65.05u 0.11s 65.16t Elapsed: 00:00:02
Start: Fri May 10 19:23:08 2013 End: Fri May 10 19:23:10 2013