BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy13345
MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL
CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE
LGGNRTLTLQVKKRIRLEGEELEEYQKKKDKEAKDKQEKEKIPPHDTSFINELQLSDFKQ
TLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFSRDLFFQWCSSPENSIIITN
RNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL

High Scoring Gene Products

Symbol, full name Information P value
cpsf2
cleavage and polyadenylation specific factor 2
gene_product from Danio rerio 2.1e-54
Cpsf100
Cleavage and polyadenylation specificity factor 100
protein from Drosophila melanogaster 1.3e-47
CPSF2
Cleavage and polyadenylation specificity factor subunit 2
protein from Bos taurus 2.1e-47
Cpsf2
cleavage and polyadenylation specific factor 2, 100kDa
gene from Rattus norvegicus 2.1e-47
CPSF2
Uncharacterized protein
protein from Canis lupus familiaris 2.6e-47
CPSF2
Cleavage and polyadenylation specificity factor subunit 2
protein from Homo sapiens 2.6e-47
Cpsf2
cleavage and polyadenylation specific factor 2
protein from Mus musculus 3.4e-47
CPSF2
Uncharacterized protein
protein from Gallus gallus 5.6e-47
cpsf2
Cleavage and polyadenylation specificity factor subunit 2
protein from Xenopus laevis 1.2e-46
CPSF2
Uncharacterized protein
protein from Sus scrofa 3.2e-33
cpsf-2 gene from Caenorhabditis elegans 3.7e-32
cpsf-2
Probable cleavage and polyadenylation specificity factor subunit 2
protein from Caenorhabditis elegans 3.7e-32
cpsf2
cleavage and polyadenylation specificity factor 100 kDa subunit
gene from Dictyostelium discoideum 5.5e-29
CPSF100
cleavage and polyadenylation specificity factor 100
protein from Arabidopsis thaliana 8.6e-28
CPSF73-I
cleavage and polyadenylation specificity factor 73-I
protein from Arabidopsis thaliana 2.2e-07
YSH1
Putative endoribonuclease
gene from Saccharomyces cerevisiae 3.3e-07
CPSF3
Cleavage and polyadenylation specific factor 3, 73kDa, isoform CRA_b
protein from Homo sapiens 4.4e-07
CPSF3
Cleavage and polyadenylation specificity factor subunit 3
protein from Bos taurus 4.8e-07
CPSF3
Cleavage and polyadenylation specificity factor subunit 3
protein from Homo sapiens 4.8e-07
CPSF3
Uncharacterized protein
protein from Gallus gallus 4.8e-07
LOC100622181
Uncharacterized protein
protein from Sus scrofa 4.8e-07
CPSF3
Uncharacterized protein
protein from Canis lupus familiaris 5.1e-07
cpsf3
cleavage and polyadenylation specific factor 3
gene_product from Danio rerio 7.9e-07
Cpsf73
Cleavage and polyadenylation specificity factor 73
protein from Drosophila melanogaster 1.1e-06
Cpsf3
cleavage and polyadenylation specificity factor 3
protein from Mus musculus 2.4e-06
Cpsf3
cleavage and polyadenylation specific factor 3, 73kDa
gene from Rattus norvegicus 2.4e-06
orf19.5486 gene_product from Candida albicans 2.5e-06
YSH1
Endoribonuclease YSH1
protein from Candida albicans SC5314 2.5e-06
cpsf-3 gene from Caenorhabditis elegans 2.5e-06
CPSF2
Cleavage and polyadenylation-specificity factor subunit 2
protein from Homo sapiens 6.2e-06
LOC100625560
Uncharacterized protein
protein from Sus scrofa 8.5e-06
cpsf3
cleavage and polyadenylation specificity factor 73 kDa subunit
gene from Dictyostelium discoideum 1.4e-05
F10B5.8 gene from Caenorhabditis elegans 8.1e-05
orf19.325 gene_product from Candida albicans 0.00021
CFT2
Putative uncharacterized protein CFT2
protein from Candida albicans SC5314 0.00021
MGG_06570
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 0.00026
LOC100523908
Uncharacterized protein
protein from Sus scrofa 0.00030
Cpsf3l
cleavage and polyadenylation specific factor 3-like
protein from Mus musculus 0.00038
Cpsf3l
cleavage and polyadenylation specific factor 3-like
gene from Rattus norvegicus 0.00038
CPSF3L
Integrator complex subunit 11
protein from Bos taurus 0.00049
CPSF3L
Integrator complex subunit 11
protein from Bos taurus 0.00049
CPSF3L
Integrator complex subunit 11
protein from Gallus gallus 0.00065
CPSF3L
Integrator complex subunit 11
protein from Gallus gallus 0.00065
CPSF3L
Uncharacterized protein
protein from Canis lupus familiaris 0.00065
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 0.00065

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy13345
        (276 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

ZFIN|ZDB-GENE-040718-79 - symbol:cpsf2 "cleavage and poly...   392  2.1e-54   3
FB|FBgn0027873 - symbol:Cpsf100 "Cleavage and polyadenyla...   382  1.3e-47   2
UNIPROTKB|Q10568 - symbol:CPSF2 "Cleavage and polyadenyla...   402  2.1e-47   2
RGD|1309687 - symbol:Cpsf2 "cleavage and polyadenylation ...   402  2.1e-47   2
UNIPROTKB|E2R496 - symbol:CPSF2 "Uncharacterized protein"...   401  2.6e-47   2
UNIPROTKB|Q9P2I0 - symbol:CPSF2 "Cleavage and polyadenyla...   401  2.6e-47   2
MGI|MGI:1861601 - symbol:Cpsf2 "cleavage and polyadenylat...   400  3.4e-47   2
UNIPROTKB|F1NMN0 - symbol:CPSF2 "Uncharacterized protein"...   398  5.6e-47   2
UNIPROTKB|Q9W799 - symbol:cpsf2 "Cleavage and polyadenyla...   396  1.2e-46   2
UNIPROTKB|F1SD85 - symbol:CPSF2 "Uncharacterized protein"...   362  3.2e-33   1
WB|WBGene00017313 - symbol:cpsf-2 species:6239 "Caenorhab...   288  3.7e-32   2
UNIPROTKB|O17403 - symbol:cpsf-2 "Probable cleavage and p...   288  3.7e-32   2
DICTYBASE|DDB_G0270392 - symbol:cpsf2 "cleavage and polya...   210  5.5e-29   3
TAIR|locus:2172843 - symbol:CPSF100 "cleavage and polyade...   219  8.6e-28   3
POMBASE|SPBC1709.15c - symbol:cft2 "cleavage factor two C...   124  1.3e-08   3
TAIR|locus:2206076 - symbol:CPSF73-I "cleavage and polyad...   146  2.2e-07   1
SGD|S000004267 - symbol:YSH1 "Putative endoribonuclease" ...   145  3.3e-07   1
UNIPROTKB|G5E9W3 - symbol:CPSF3 "Cleavage and polyadenyla...   143  4.4e-07   1
UNIPROTKB|P79101 - symbol:CPSF3 "Cleavage and polyadenyla...   143  4.8e-07   1
UNIPROTKB|Q9UKF6 - symbol:CPSF3 "Cleavage and polyadenyla...   143  4.8e-07   1
UNIPROTKB|F1NKW5 - symbol:CPSF3 "Uncharacterized protein"...   143  4.8e-07   1
UNIPROTKB|I3LKR1 - symbol:CPSF3 "Uncharacterized protein"...   143  4.8e-07   1
UNIPROTKB|E2R7R2 - symbol:CPSF3 "Uncharacterized protein"...   143  5.1e-07   1
ZFIN|ZDB-GENE-030131-3275 - symbol:cpsf3 "cleavage and po...   144  7.9e-07   2
FB|FBgn0261065 - symbol:Cpsf73 "Cleavage and polyadenylat...   140  1.1e-06   1
MGI|MGI:1859328 - symbol:Cpsf3 "cleavage and polyadenylat...   137  2.4e-06   1
RGD|1305767 - symbol:Cpsf3 "cleavage and polyadenylation ...   137  2.4e-06   1
UNIPROTKB|G3V6W7 - symbol:Cpsf3 "Protein Cpsf3" species:1...   137  2.4e-06   1
CGD|CAL0005344 - symbol:orf19.5486 species:5476 "Candida ...   138  2.5e-06   1
UNIPROTKB|Q59P50 - symbol:YSH1 "Endoribonuclease YSH1" sp...   138  2.5e-06   1
WB|WBGene00013460 - symbol:cpsf-3 species:6239 "Caenorhab...   137  2.5e-06   1
UNIPROTKB|H0YJF4 - symbol:CPSF2 "Cleavage and polyadenyla...   127  6.2e-06   1
UNIPROTKB|F1SD84 - symbol:LOC100625560 "Uncharacterized p...   127  8.5e-06   1
DICTYBASE|DDB_G0274799 - symbol:cpsf3 "cleavage and polya...   131  1.4e-05   1
POMBASE|SPAC17G6.16c - symbol:ysh1 "mRNA cleavage and pol...   127  3.8e-05   1
WB|WBGene00008642 - symbol:F10B5.8 species:6239 "Caenorha...   123  8.1e-05   1
CGD|CAL0004705 - symbol:orf19.325 species:5476 "Candida a...   108  0.00021   3
UNIPROTKB|Q5AEE3 - symbol:CFT2 "Putative uncharacterized ...   108  0.00021   3
UNIPROTKB|G4N6C6 - symbol:MGG_06570 "Uncharacterized prot...    86  0.00026   3
UNIPROTKB|F1RJE8 - symbol:CPSF3L "Uncharacterized protein...   118  0.00030   1
MGI|MGI:1919207 - symbol:Cpsf3l "cleavage and polyadenyla...   117  0.00038   1
RGD|1306841 - symbol:Cpsf3l "cleavage and polyadenylation...   117  0.00038   1
UNIPROTKB|E1B7Q9 - symbol:CPSF3L "Integrator complex subu...   116  0.00049   1
UNIPROTKB|Q2YDM2 - symbol:CPSF3L "Integrator complex subu...   116  0.00049   1
UNIPROTKB|F1NV30 - symbol:CPSF3L "Integrator complex subu...   115  0.00065   1
UNIPROTKB|Q5ZIH0 - symbol:CPSF3L "Integrator complex subu...   115  0.00065   1
UNIPROTKB|E2QY53 - symbol:CPSF3L "Uncharacterized protein...   115  0.00065   1
UNIPROTKB|Q5TA45 - symbol:CPSF3L "Integrator complex subu...   115  0.00065   1
UNIPROTKB|G3V1S5 - symbol:CPSF3L "Integrator complex subu...   115  0.00065   1


>ZFIN|ZDB-GENE-040718-79 [details] [associations]
            symbol:cpsf2 "cleavage and polyadenylation specific
            factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0016787
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106
            OrthoDB:EOG4MCWZQ EMBL:BC076029 IPI:IPI00512505
            RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5
            STRING:Q6DHE5 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657
            InParanoid:Q6DHE5 NextBio:20831102 ArrayExpress:Q6DHE5 Bgee:Q6DHE5
            Uniprot:Q6DHE5
        Length = 790

 Score = 392 (143.0 bits), Expect = 2.1e-54, Sum P(3) = 2.1e-54
 Identities = 71/119 (59%), Positives = 94/119 (78%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLCSQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             E GFSR+LF QWC   +NS+I+T RT+PGTLAR LI+  G + + L+++KR RLEG EL
Sbjct:   334 ESGFSRELFIQWCQDAKNSVILTYRTTPGTLARYLIDNPGEKRIELEIRKRCRLEGREL 392

 Score = 131 (51.2 bits), Expect = 1.3e-20, Sum P(3) = 1.3e-20
 Identities = 22/34 (64%), Positives = 29/34 (85%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCHSL++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARV 320

 Score = 130 (50.8 bits), Expect = 2.1e-54, Sum P(3) = 2.1e-54
 Identities = 27/46 (58%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + FINE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   716 VPGHQSVFINEPRLSDFKQVLLREGIQAEFVGGVLVC-NNLVAVRR 760

 Score = 120 (47.3 bits), Expect = 1.4e-19, Sum P(3) = 1.4e-19
 Identities = 21/35 (60%), Positives = 27/35 (77%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+E GFSR+LF QWC   +NS+I+T R
Sbjct:   324 KVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYR 358

 Score = 94 (38.1 bits), Expect = 2.1e-54, Sum P(3) = 2.1e-54
 Identities = 15/30 (50%), Positives = 25/30 (83%)

Query:   247 SDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             +++  I LEGC  D+YYR+++LLY+QYA++
Sbjct:   761 TEAGRICLEGCHCDDYYRIRELLYEQYAVV 790


>FB|FBgn0027873 [details] [associations]
            symbol:Cpsf100 "Cleavage and polyadenylation specificity
            factor 100" species:7227 "Drosophila melanogaster" [GO:0006379
            "mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
            processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 OMA:NNPFQFK
            GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1
            UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 IntAct:Q9V3D6
            STRING:Q9V3D6 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357
            GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426
            FlyBase:FBgn0027873 InParanoid:Q8IML7 OrthoDB:EOG4XD261
            PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 Bgee:Q9V3D6
            GermOnline:CG1957 Uniprot:Q9V3D6
        Length = 756

 Score = 382 (139.5 bits), Expect = 1.3e-47, Sum P(2) = 1.3e-47
 Identities = 74/121 (61%), Positives = 95/121 (78%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPS-PKVVLVSTPD 82
             +++ E AK  IEWMSDKL K+FEGARNNPF FKH++LCHSLA++ K+P+ PKVVL STPD
Sbjct:   274 YNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKLPAGPKVVLASTPD 333

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIE-LGGNRTLTLQVKKRIRLEGEE 141
             +E GF+RDLF QW S+  NSII+T RTSPGTLA +L+E     + + L V++R+ LEG E
Sbjct:   334 LESGFTRDLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIELDVRRRVDLEGAE 393

Query:   142 L 142
             L
Sbjct:   394 L 394

 Score = 149 (57.5 bits), Expect = 1.3e-47, Sum P(2) = 1.3e-47
 Identities = 30/47 (63%), Positives = 36/47 (76%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRV 208
             IP H++  INEL+LSDFKQTL RN I+ EF  GVL C  GT+A+RRV
Sbjct:   681 IPIHNSVLINELKLSDFKQTLMRNNINSEFSGGVLWCSNGTLALRRV 727

 Score = 144 (55.7 bits), Expect = 1.6e-18, Sum P(2) = 1.6e-18
 Identities = 25/34 (73%), Positives = 31/34 (91%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKL K+FEGARNNPF FKH++LCHSLA++ K+
Sbjct:   287 MSDKLTKAFEGARNNPFQFKHIQLCHSLADVYKL 320

 Score = 125 (49.1 bits), Expect = 6.3e-16, Sum P(2) = 6.3e-16
 Identities = 23/35 (65%), Positives = 28/35 (80%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL STPD+E GF+RDLF QW S+  NSII+T R
Sbjct:   325 KVVLASTPDLESGFTRDLFVQWASNANNSIILTTR 359

 Score = 107 (42.7 bits), Expect = 3.3e-43, Sum P(2) = 3.3e-43
 Identities = 21/49 (42%), Positives = 32/49 (65%)

Query:   228 WCSSPENSIIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             WCS   N  +   R      D+  + +EGCLS+EYY++++LLY+QYAI+
Sbjct:   716 WCS---NGTLALRR-----VDAGKVAMEGCLSEEYYKIRELLYEQYAIV 756


>UNIPROTKB|Q10568 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:9913 "Bos taurus" [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISS] [GO:0005847 "mRNA cleavage
            and polyadenylation specificity factor complex" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
            EMBL:X75931 IPI:IPI00688446 PIR:A56351 RefSeq:NP_787002.1
            UniGene:Bt.4077 ProteinModelPortal:Q10568 STRING:Q10568
            PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689
            KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568
            OrthoDB:EOG4MCWZQ NextBio:20810154 GO:GO:0006398 Uniprot:Q10568
        Length = 782

 Score = 402 (146.6 bits), Expect = 2.1e-47, Sum P(2) = 2.1e-47
 Identities = 71/119 (59%), Positives = 95/119 (79%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct:   334 ECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 392

 Score = 144 (55.7 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

 Score = 127 (49.8 bits), Expect = 2.1e-47, Sum P(2) = 2.1e-47
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   708 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 752

 Score = 127 (49.8 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH L++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARV 320

 Score = 91 (37.1 bits), Expect = 1.2e-43, Sum P(2) = 1.2e-43
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N+     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct:   742 LVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>RGD|1309687 [details] [associations]
            symbol:Cpsf2 "cleavage and polyadenylation specific factor 2,
            100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA
            3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0016787
            EMBL:CH473982 GO:GO:0005847 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981
            OrthoDB:EOG4MCWZQ GO:GO:0006398 IPI:IPI00189534
            RefSeq:NP_001100223.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612
            GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098
            Uniprot:D3Z9E6
        Length = 782

 Score = 402 (146.6 bits), Expect = 2.1e-47, Sum P(2) = 2.1e-47
 Identities = 71/119 (59%), Positives = 95/119 (79%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct:   334 ECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKVTEIELRKRVKLEGKEL 392

 Score = 144 (55.7 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

 Score = 127 (49.8 bits), Expect = 2.1e-47, Sum P(2) = 2.1e-47
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   708 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 752

 Score = 127 (49.8 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH L++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARV 320

 Score = 91 (37.1 bits), Expect = 1.2e-43, Sum P(2) = 1.2e-43
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N+     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct:   742 LVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>UNIPROTKB|E2R496 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 GO:GO:0005847 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981 GO:GO:0006398
            EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496
            Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230
            NextBio:20855279 Uniprot:E2R496
        Length = 782

 Score = 401 (146.2 bits), Expect = 2.6e-47, Sum P(2) = 2.6e-47
 Identities = 71/119 (59%), Positives = 95/119 (79%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct:   334 ECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKEL 392

 Score = 144 (55.7 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

 Score = 127 (49.8 bits), Expect = 2.6e-47, Sum P(2) = 2.6e-47
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   708 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 752

 Score = 127 (49.8 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH L++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARV 320

 Score = 91 (37.1 bits), Expect = 1.6e-43, Sum P(2) = 1.6e-43
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N+     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct:   742 LVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>UNIPROTKB|Q9P2I0 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:9606 "Homo sapiens" [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase
            II transcription" evidence=TAS] [GO:0006397 "mRNA processing"
            evidence=TAS] [GO:0006406 "mRNA export from nucleus" evidence=TAS]
            [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene
            expression" evidence=TAS] [GO:0031124 "mRNA 3'-end processing"
            evidence=TAS] Reactome:REACT_71 InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            EMBL:CH471061 Reactome:REACT_1675 GO:GO:0003723 GO:GO:0016787
            GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0005847
            GO:GO:0006369 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78
            HOGENOM:HOG000264343 OMA:NNPFQFK CTD:53981 HOVERGEN:HBG051106
            OrthoDB:EOG4MCWZQ GO:GO:0006398 EMBL:AK001627 EMBL:BC070095
            EMBL:AB037788 EMBL:AL442079 IPI:IPI00419531 RefSeq:NP_059133.1
            UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0
            SMR:Q9P2I0 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677
            STRING:Q9P2I0 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0
            PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875
            GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588
            HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0
            PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2
            GenomeRNAi:53981 NextBio:56268 ArrayExpress:Q9P2I0 Bgee:Q9P2I0
            CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 GermOnline:ENSG00000165934
            Uniprot:Q9P2I0
        Length = 782

 Score = 401 (146.2 bits), Expect = 2.6e-47, Sum P(2) = 2.6e-47
 Identities = 71/119 (59%), Positives = 95/119 (79%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct:   334 ECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIELRKRVKLEGKEL 392

 Score = 144 (55.7 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

 Score = 127 (49.8 bits), Expect = 2.6e-47, Sum P(2) = 2.6e-47
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   708 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 752

 Score = 127 (49.8 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH L++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARV 320

 Score = 91 (37.1 bits), Expect = 1.6e-43, Sum P(2) = 1.6e-43
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N+     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct:   742 LVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>MGI|MGI:1861601 [details] [associations]
            symbol:Cpsf2 "cleavage and polyadenylation specific factor
            2" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISO;IDA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
            CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG4MCWZQ GO:GO:0006398
            EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 IPI:IPI00314302
            RefSeq:NP_058552.1 UniGene:Mm.716 ProteinModelPortal:O35218
            SMR:O35218 STRING:O35218 PhosphoSite:O35218 PaxDb:O35218
            PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786
            UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 Bgee:O35218
            CleanEx:MM_CPSF2 Genevestigator:O35218
            GermOnline:ENSMUSG00000041781 Uniprot:O35218
        Length = 782

 Score = 400 (145.9 bits), Expect = 3.4e-47, Sum P(2) = 3.4e-47
 Identities = 71/119 (59%), Positives = 95/119 (79%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++++KR++LEG+EL
Sbjct:   334 ECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPTEKVTEIELRKRVKLEGKEL 392

 Score = 144 (55.7 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

 Score = 127 (49.8 bits), Expect = 3.4e-47, Sum P(2) = 3.4e-47
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   708 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 752

 Score = 127 (49.8 bits), Expect = 1.1e-21, Sum P(3) = 1.1e-21
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH L++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARV 320

 Score = 91 (37.1 bits), Expect = 2.1e-43, Sum P(2) = 2.1e-43
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N+     +++  I LEGCL  ++YR++ LLY+QYAI+
Sbjct:   742 LVCNNQVAVRRTETGRIGLEGCLCQDFYRIRDLLYEQYAIV 782


>UNIPROTKB|F1NMN0 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK GO:GO:0006398
            EMBL:AADN02003653 IPI:IPI00651282 Ensembl:ENSGALT00000017538
            Uniprot:F1NMN0
        Length = 782

 Score = 398 (145.2 bits), Expect = 5.6e-47, Sum P(2) = 5.6e-47
 Identities = 70/119 (58%), Positives = 96/119 (80%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCHSL++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSRDLF QWC   +NSII+T RT+PGTLAR LI+    + + +++++R++LEG+EL
Sbjct:   334 ECGFSRDLFIQWCQDSKNSIILTYRTTPGTLARFLIDNPSEKVIDIELRRRVKLEGKEL 392

 Score = 136 (52.9 bits), Expect = 9.5e-21, Sum P(3) = 9.5e-21
 Identities = 24/35 (68%), Positives = 28/35 (80%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC   +NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYR 358

 Score = 131 (51.2 bits), Expect = 9.5e-21, Sum P(3) = 9.5e-21
 Identities = 22/34 (64%), Positives = 29/34 (85%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCHSL++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHSLSDLARV 320

 Score = 127 (49.8 bits), Expect = 5.6e-47, Sum P(2) = 5.6e-47
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   708 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNMVAVRR 752

 Score = 88 (36.0 bits), Expect = 7.1e-43, Sum P(2) = 7.1e-43
 Identities = 16/41 (39%), Positives = 27/41 (65%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N      +++  I LEGCL  ++YR+++LLY QYAI+
Sbjct:   742 LVCNNMVAVRRTETGRIGLEGCLCQDFYRIRELLYKQYAIV 782


>UNIPROTKB|Q9W799 [details] [associations]
            symbol:cpsf2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006397
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1
            UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394
            KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799
        Length = 783

 Score = 396 (144.5 bits), Expect = 1.2e-46, Sum P(2) = 1.2e-46
 Identities = 69/119 (57%), Positives = 95/119 (79%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH  ++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
             ECGFSR+LF QWC  P+NS+I+T RT+PGTLAR LI+    R + ++++KR++LEG+EL
Sbjct:   334 ECGFSRELFIQWCQDPKNSVILTYRTTPGTLARFLIDHPSERIIDIELRKRVKLEGKEL 392

 Score = 139 (54.0 bits), Expect = 6.6e-20, Sum P(3) = 6.6e-20
 Identities = 23/35 (65%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSR+LF QWC  P+NS+I+T R
Sbjct:   324 KVVLASQPDLECGFSRELFIQWCQDPKNSVILTYR 358

 Score = 126 (49.4 bits), Expect = 1.2e-46, Sum P(2) = 1.2e-46
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   709 VPGHQSVFMNEPRLSDFKQVLLREGIHAEFVGGVLVC-NNMVAVRR 753

 Score = 121 (47.7 bits), Expect = 6.6e-20, Sum P(3) = 6.6e-20
 Identities = 20/34 (58%), Positives = 27/34 (79%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH  ++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLTLCHGYSDLARV 320

 Score = 86 (35.3 bits), Expect = 1.9e-42, Sum P(2) = 1.9e-42
 Identities = 14/41 (34%), Positives = 29/41 (70%)

Query:   236 IIITNRNRADDSDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             ++  N      +++  I LEGCL ++++++++LLY+QYAI+
Sbjct:   743 LVCNNMVAVRRTETGRIGLEGCLCEDFFKIRELLYEQYAIV 783


>UNIPROTKB|F1SD85 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379
            "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA polyadenylation"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            InterPro:IPR022712 PANTHER:PTHR11203:SF5 Pfam:PF10996 SMART:SM01027
            GeneTree:ENSGT00700000104551 EMBL:CU468363
            Ensembl:ENSSSCT00000002717 OMA:GANDESP Uniprot:F1SD85
        Length = 385

 Score = 362 (132.5 bits), Expect = 3.2e-33, P = 3.2e-33
 Identities = 65/108 (60%), Positives = 84/108 (77%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             +++ E +K  +EWMSDKLM+ FE  RNNPF F+H+ LCH L++LA+VPSPKVVL S PD+
Sbjct:   274 YNVVEFSKSQVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDL 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQV 131
             ECGFSRDLF QWC  P+NSII+T RT+PGTLAR LI+    +   ++V
Sbjct:   334 ECGFSRDLFIQWCQDPKNSIILTYRTTPGTLARFLIDNPSEKITEIEV 381

 Score = 144 (55.7 bits), Expect = 4.7e-17, Sum P(2) = 4.7e-17
 Identities = 25/35 (71%), Positives = 29/35 (82%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S PD+ECGFSRDLF QWC  P+NSII+T R
Sbjct:   324 KVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

 Score = 127 (49.8 bits), Expect = 4.7e-17, Sum P(2) = 4.7e-17
 Identities = 21/34 (61%), Positives = 28/34 (82%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MSDKLM+ FE  RNNPF F+H+ LCH L++LA++
Sbjct:   287 MSDKLMRCFEDKRNNPFQFRHLSLCHGLSDLARV 320


>WB|WBGene00017313 [details] [associations]
            symbol:cpsf-2 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0051301 "cell division" evidence=IMP] [GO:0000910 "cytokinesis"
            evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0016246
            "RNA interference" evidence=IMP] [GO:0040027 "negative regulation
            of vulval development" evidence=IMP] InterPro:IPR001279
            InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
            GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
            GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
            OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
            ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
            EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
            CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
            Uniprot:O17403
        Length = 843

 Score = 288 (106.4 bits), Expect = 3.7e-32, Sum P(2) = 3.7e-32
 Identities = 65/129 (50%), Positives = 82/129 (63%)

Query:    27 SLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             S+ + AK  +EWM++KL K     AR NPF  KHV LCHS  EL +V SPKVVL S+ DM
Sbjct:   274 SVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDM 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG-----G-----NRTLTLQVKK 133
             E GFSR+LF  WCS P N +I+T R +  TLA  L+ +      G     +R ++L VKK
Sbjct:   334 ESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLISLVVKK 393

Query:   134 RIRLEGEEL 142
             R+ LEGEEL
Sbjct:   394 RVALEGEEL 402

 Score = 119 (46.9 bits), Expect = 5.6e-13, Sum P(3) = 5.6e-13
 Identities = 21/35 (60%), Positives = 26/35 (74%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S+ DME GFSR+LF  WCS P N +I+T R
Sbjct:   324 KVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358

 Score = 101 (40.6 bits), Expect = 3.7e-32, Sum P(2) = 3.7e-32
 Identities = 23/64 (35%), Positives = 33/64 (51%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
             IP H   F+N+ +LSDFK  L   G   EF+ G L+   G  ++RR        ME  F+
Sbjct:   768 IPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRRND-TGVFQMEGAFT 826

Query:   222 RDLF 225
             +D +
Sbjct:   827 KDYY 830

 Score = 85 (35.0 bits), Expect = 5.6e-13, Sum P(3) = 5.6e-13
 Identities = 18/35 (51%), Positives = 22/35 (62%)

Query:     1 MSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKI 34
             M++KL K     AR NPF  KHV LCHS  EL ++
Sbjct:   286 MNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRV 320

 Score = 80 (33.2 bits), Expect = 5.8e-30, Sum P(2) = 5.8e-30
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query:   247 SDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             +D+ V  +EG  + +YY++++L YDQ+A+L
Sbjct:   814 NDTGVFQMEGAFTKDYYKLRRLFYDQFAVL 843

 Score = 37 (18.1 bits), Expect = 1.7e-06, Sum P(3) = 1.7e-06
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query:    38 SDKLMKSFEGARNNPFHF 55
             S+K  +SF+G+ N+   F
Sbjct:   450 SEKDFRSFDGSENDAHTF 467


>UNIPROTKB|O17403 [details] [associations]
            symbol:cpsf-2 "Probable cleavage and polyadenylation
            specificity factor subunit 2" species:6239 "Caenorhabditis elegans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0003674
            "molecular_function" evidence=ND] InterPro:IPR001279
            InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
            GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
            GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
            OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
            ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
            EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
            CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
            Uniprot:O17403
        Length = 843

 Score = 288 (106.4 bits), Expect = 3.7e-32, Sum P(2) = 3.7e-32
 Identities = 65/129 (50%), Positives = 82/129 (63%)

Query:    27 SLAELAK--IEWMSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM 83
             S+ + AK  +EWM++KL K     AR NPF  KHV LCHS  EL +V SPKVVL S+ DM
Sbjct:   274 SVVQFAKSQLEWMNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDM 333

Query:    84 ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLIELG-----G-----NRTLTLQVKK 133
             E GFSR+LF  WCS P N +I+T R +  TLA  L+ +      G     +R ++L VKK
Sbjct:   334 ESGFSRELFLDWCSDPRNGVILTARPASFTLAAKLVNMAERANDGVLKHEDRLISLVVKK 393

Query:   134 RIRLEGEEL 142
             R+ LEGEEL
Sbjct:   394 RVALEGEEL 402

 Score = 119 (46.9 bits), Expect = 5.6e-13, Sum P(3) = 5.6e-13
 Identities = 21/35 (60%), Positives = 26/35 (74%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNR 241
             +VVL S+ DME GFSR+LF  WCS P N +I+T R
Sbjct:   324 KVVLCSSQDMESGFSRELFLDWCSDPRNGVILTAR 358

 Score = 101 (40.6 bits), Expect = 3.7e-32, Sum P(2) = 3.7e-32
 Identities = 23/64 (35%), Positives = 33/64 (51%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRVVLVSTPDMECGFS 221
             IP H   F+N+ +LSDFK  L   G   EF+ G L+   G  ++RR        ME  F+
Sbjct:   768 IPIHQAVFVNDPKLSDFKNLLTDKGYKAEFLSGTLLINGGNCSIRRND-TGVFQMEGAFT 826

Query:   222 RDLF 225
             +D +
Sbjct:   827 KDYY 830

 Score = 85 (35.0 bits), Expect = 5.6e-13, Sum P(3) = 5.6e-13
 Identities = 18/35 (51%), Positives = 22/35 (62%)

Query:     1 MSDKLMK-SFEGARNNPFHFKHVKLCHSLAELAKI 34
             M++KL K     AR NPF  KHV LCHS  EL ++
Sbjct:   286 MNEKLFKYDSSSARYNPFTLKHVTLCHSHQELMRV 320

 Score = 80 (33.2 bits), Expect = 5.8e-30, Sum P(2) = 5.8e-30
 Identities = 12/30 (40%), Positives = 23/30 (76%)

Query:   247 SDSNVIVLEGCLSDEYYRVQQLLYDQYAIL 276
             +D+ V  +EG  + +YY++++L YDQ+A+L
Sbjct:   814 NDTGVFQMEGAFTKDYYKLRRLFYDQFAVL 843

 Score = 37 (18.1 bits), Expect = 1.7e-06, Sum P(3) = 1.7e-06
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query:    38 SDKLMKSFEGARNNPFHF 55
             S+K  +SF+G+ N+   F
Sbjct:   450 SEKDFRSFDGSENDAHTF 467


>DICTYBASE|DDB_G0270392 [details] [associations]
            symbol:cpsf2 "cleavage and polyadenylation
            specificity factor 100 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA;ISS] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA;ISS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISS]
            [GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0270392 Pfam:PF07521 EMBL:AAFI02000005
            GenomeReviews:CM000150_GR GO:GO:0006378 GO:GO:0003723 GO:GO:0016787
            GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            OMA:NNPFQFK RefSeq:XP_646760.1 ProteinModelPortal:Q55BS1
            STRING:Q55BS1 EnsemblProtists:DDB0233700 GeneID:8617733
            KEGG:ddi:DDB_G0270392 ProtClustDB:CLSZ2431463 Uniprot:Q55BS1
        Length = 784

 Score = 210 (79.0 bits), Expect = 5.5e-29, Sum P(3) = 5.5e-29
 Identities = 46/117 (39%), Positives = 68/117 (58%)

Query:    32 AKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSP-KVVLVSTPDMECGFSRD 90
             +++E+MS      FE    NPF FKH+K+  SL EL ++P   KV+L S+ D+E GFSR+
Sbjct:   285 SQLEFMSSTASVKFEQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRE 344

Query:    91 LFFQWCSSPENSIIITNRTSPGTLARDLIEL-----GGNRTLTLQVKKRIRLEGEEL 142
             LF QWCS P+  I+ T +    +LA  LI+      G  + + +    R+ L G+EL
Sbjct:   345 LFIQWCSDPKTLILFTQKIPKDSLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGDEL 401

 Score = 111 (44.1 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
 Identities = 20/46 (43%), Positives = 29/46 (63%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRNRADDSDSNVI 252
             +V+L S+ D+E GFSR+LF QWCS P+  I+ T +   D     +I
Sbjct:   328 KVILTSSQDLETGFSRELFIQWCSDPKTLILFTQKIPKDSLADKLI 373

 Score = 90 (36.7 bits), Expect = 5.5e-29, Sum P(3) = 5.5e-29
 Identities = 20/43 (46%), Positives = 26/43 (60%)

Query:   165 HDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             HD SFI +++LSD KQ L   GI  +F  G+L C  G V + R
Sbjct:   709 HDESFIGDIKLSDLKQVLVNAGIQVQFDQGILNC-GGLVYIWR 750

 Score = 80 (33.2 bits), Expect = 5.5e-29, Sum P(3) = 5.5e-29
 Identities = 15/35 (42%), Positives = 25/35 (71%)

Query:   243 RADDSDSNVIV-LEGCLSDEYYRVQQLLYDQYAIL 276
             R +D   N I+ ++G +SDEYY +++LLY Q+ I+
Sbjct:   750 RDEDHGGNSIINVDGIISDEYYLIKELLYKQFQIV 784

 Score = 71 (30.1 bits), Expect = 3.1e-08, Sum P(3) = 3.1e-08
 Identities = 15/34 (44%), Positives = 19/34 (55%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKI 34
             MS      FE    NPF FKH+K+  SL EL ++
Sbjct:   290 MSSTASVKFEQNIENPFSFKHIKILSSLEELQEL 323


>TAIR|locus:2172843 [details] [associations]
            symbol:CPSF100 "cleavage and polyadenylation specificity
            factor 100" species:3702 "Arabidopsis thaliana" [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0009793 "embryo development ending
            in seed dormancy" evidence=NAS] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IPI] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS;NAS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0035194 "posttranscriptional gene
            silencing by RNA" evidence=IMP] [GO:0009506 "plasmodesma"
            evidence=IDA] [GO:0000278 "mitotic cell cycle" evidence=RCA]
            [GO:0006306 "DNA methylation" evidence=RCA] [GO:0006342 "chromatin
            silencing" evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0007267 "cell-cell
            signaling" evidence=RCA] [GO:0009220 "pyrimidine ribonucleotide
            biosynthetic process" evidence=RCA] [GO:0009616 "virus induced gene
            silencing" evidence=RCA] [GO:0009640 "photomorphogenesis"
            evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
            interference" evidence=RCA] [GO:0010388 "cullin deneddylation"
            evidence=RCA] [GO:0016569 "covalent chromatin modification"
            evidence=RCA] [GO:0031047 "gene silencing by RNA" evidence=RCA]
            [GO:0035196 "production of miRNAs involved in gene silencing by
            miRNA" evidence=RCA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 GO:GO:0009506 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0006378 EMBL:AB005244 GO:GO:0003723
            GO:GO:0016787 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 EMBL:AF283277 EMBL:AY034982
            EMBL:BT004374 IPI:IPI00521104 RefSeq:NP_197776.1 UniGene:At.25191
            ProteinModelPortal:Q9LKF9 SMR:Q9LKF9 IntAct:Q9LKF9 STRING:Q9LKF9
            PaxDb:Q9LKF9 PRIDE:Q9LKF9 EnsemblPlants:AT5G23880.1 GeneID:832453
            KEGG:ath:AT5G23880 TAIR:At5g23880 HOGENOM:HOG000264343
            InParanoid:Q9LKF9 OMA:NNPFQFK PhylomeDB:Q9LKF9
            ProtClustDB:CLSN2686300 Genevestigator:Q9LKF9 GermOnline:AT5G23880
            GO:GO:0035194 Uniprot:Q9LKF9
        Length = 739

 Score = 219 (82.2 bits), Expect = 8.6e-28, Sum P(3) = 8.6e-28
 Identities = 45/110 (40%), Positives = 65/110 (59%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVP-SPKVVLVSTPDMECGFSRDLF 92
             +EWMSD + KSFE +R+N F  +HV L  +  +L   P  PKVVL S   +E GF+R++F
Sbjct:   281 LEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIF 340

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLEGEEL 142
              +W + P N ++ T     GTLAR L      + + + + KR+ L GEEL
Sbjct:   341 VEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVTMSKRVPLAGEEL 390

 Score = 85 (35.0 bits), Expect = 9.2e-06, Sum P(3) = 9.2e-06
 Identities = 13/33 (39%), Positives = 22/33 (66%)

Query:   207 RVVLVSTPDMECGFSRDLFFQWCSSPENSIIIT 239
             +VVL S   +E GF+R++F +W + P N ++ T
Sbjct:   322 KVVLASMASLEAGFAREIFVEWANDPRNLVLFT 354

 Score = 84 (34.6 bits), Expect = 8.6e-28, Sum P(3) = 8.6e-28
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query:   164 PHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRRV 208
             PH    + +L+++DFKQ L   G+  EF  G  + C   V +R+V
Sbjct:   656 PHKPVLVGDLKIADFKQFLSSKGVQVEFAGGGALRCGEYVTLRKV 700

 Score = 67 (28.6 bits), Expect = 9.2e-06, Sum P(3) = 9.2e-06
 Identities = 14/31 (45%), Positives = 20/31 (64%)

Query:     1 MSDKLMKSFEGARNNPFHFKHVKLCHSLAEL 31
             MSD + KSFE +R+N F  +HV L  +  +L
Sbjct:   284 MSDSISKSFETSRDNAFLLRHVTLLINKTDL 314

 Score = 64 (27.6 bits), Expect = 8.6e-28, Sum P(3) = 8.6e-28
 Identities = 10/25 (40%), Positives = 19/25 (76%)

Query:   252 IVLEGCLSDEYYRVQQLLYDQYAIL 276
             I++EG L ++YY+++  LY Q+ +L
Sbjct:   715 ILIEGPLCEDYYKIRDYLYSQFYLL 739


>POMBASE|SPBC1709.15c [details] [associations]
            symbol:cft2 "cleavage factor two Cft2/polyadenylation
            factor CPSF-73 (predicted)" species:4896 "Schizosaccharomyces
            pombe" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA]
            [GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
            cleavage" evidence=ISO] InterPro:IPR027075 PomBase:SPBC1709.15c
            Pfam:PF07521 EMBL:CU329671 GO:GO:0006378 GenomeReviews:CU329671_GR
            GO:GO:0005847 GO:GO:0006379 PIR:T39643 RefSeq:NP_595448.1
            ProteinModelPortal:O74740 STRING:O74740 EnsemblFungi:SPBC1709.15c.1
            GeneID:2539954 KEGG:spo:SPBC1709.15c eggNOG:COG1236 KO:K14402
            OMA:ISSIATP OrthoDB:EOG4WWVSN NextBio:20801097 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Uniprot:O74740
        Length = 797

 Score = 124 (48.7 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 28/90 (31%), Positives = 48/90 (53%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV-PSPKVVLVSTPDMECGFSRDLF 92
             IEWM D +++ F G   N   F+++      ++++ + P PKV+L +   +ECGFS+ + 
Sbjct:   275 IEWMGDNIVRDF-GINENLLEFRNINTITDFSQISHIGPGPKVILATALTLECGFSQRIL 333

Query:    93 FQWCSSPENSIII-TNRTS-P-GTLARDLI 119
                 S   N +I+ T R+  P  +LA   I
Sbjct:   334 LDLMSENSNDLILFTQRSRCPQNSLANQFI 363

 Score = 58 (25.5 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 14/40 (35%), Positives = 24/40 (60%)

Query:   170 INELQLSDFKQTLQRNGIDCEFM-DGVLICCRGTVAVRRV 208
             +  ++L+  ++ L   GI  E   +GVL+C  G VAVR++
Sbjct:   730 VGNIRLAYLRKALLDQGISAELKGEGVLLC-GGAVAVRKL 768

 Score = 57 (25.1 bits), Expect = 1.3e-08, Sum P(3) = 1.3e-08
 Identities = 9/25 (36%), Positives = 19/25 (76%)

Query:   252 IVLEGCLSDEYYRVQQLLYDQYAIL 276
             I +EG LS+ ++ +++L+YD  A++
Sbjct:   773 ISVEGSLSNRFFEIRKLVYDALAVV 797


>TAIR|locus:2206076 [details] [associations]
            symbol:CPSF73-I "cleavage and polyadenylation specificity
            factor 73-I" species:3702 "Arabidopsis thaliana" [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0000278 "mitotic cell cycle" evidence=RCA]
            [GO:0006346 "methylation-dependent chromatin silencing"
            evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009855
            "determination of bilateral symmetry" evidence=RCA] [GO:0010014
            "meristem initiation" evidence=RCA] [GO:0010073 "meristem
            maintenance" evidence=RCA] [GO:0016246 "RNA interference"
            evidence=RCA] [GO:0031507 "heterochromatin assembly" evidence=RCA]
            [GO:0045787 "positive regulation of cell cycle" evidence=RCA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006397
            GO:GO:0090305 EMBL:AC018908 GO:GO:0004518 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            EMBL:AY140900 EMBL:AY150478 EMBL:AY074280 EMBL:AK316692
            EMBL:AK316794 IPI:IPI00533462 PIR:G96635 RefSeq:NP_001031215.1
            RefSeq:NP_176297.1 RefSeq:NP_849835.1 UniGene:At.23510
            ProteinModelPortal:Q9C952 SMR:Q9C952 IntAct:Q9C952 STRING:Q9C952
            PaxDb:Q9C952 PRIDE:Q9C952 EnsemblPlants:AT1G61010.1
            EnsemblPlants:AT1G61010.2 EnsemblPlants:AT1G61010.3 GeneID:842393
            KEGG:ath:AT1G61010 TAIR:At1g61010 HOGENOM:HOG000203394
            InParanoid:Q9C952 KO:K14403 OMA:YVSFSAH PhylomeDB:Q9C952
            ProtClustDB:CLSN2681829 Genevestigator:Q9C952 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 Uniprot:Q9C952
        Length = 693

 Score = 146 (56.5 bits), Expect = 2.2e-07, P = 2.2e-07
 Identities = 36/119 (30%), Positives = 60/119 (50%)

Query:     3 DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
             D+   +     N P ++     K C ++ +   +  M+D++   F  A +NPF FKH+  
Sbjct:   264 DEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILS-MNDRIRNQF--ANSNPFVFKHISP 320

Query:    61 CHSLAELAKVPSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
              +S+ +   V  P VV+ +   ++ G SR LF  WCS  +N+ II      GTLA+ +I
Sbjct:   321 LNSIDDFNDV-GPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTII 378


>SGD|S000004267 [details] [associations]
            symbol:YSH1 "Putative endoribonuclease" species:4932
            "Saccharomyces cerevisiae" [GO:0004518 "nuclease activity"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0006379 "mRNA
            cleavage" evidence=IMP] [GO:0006369 "termination of RNA polymerase
            II transcription" evidence=IMP] [GO:0031126 "snoRNA 3'-end
            processing" evidence=IMP] [GO:0008380 "RNA splicing" evidence=IMP]
            [GO:0034247 "snoRNA splicing" evidence=IMP] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA;IPI] [GO:0005849 "mRNA cleavage factor complex"
            evidence=IPI] [GO:0004521 "endoribonuclease activity"
            evidence=ISS;IMP] [GO:0003723 "RNA binding" evidence=IC]
            [GO:0004519 "endonuclease activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 SGD:S000004267
            Pfam:PF07521 GO:GO:0046872 GO:GO:0006378 EMBL:BK006945
            GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 EMBL:U17245 HOGENOM:HOG000203394 KO:K14403
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0031126 GO:GO:0034247
            OMA:EISFAAH OrthoDB:EOG41ZJK7 PIR:S51413 RefSeq:NP_013379.1
            ProteinModelPortal:Q06224 SMR:Q06224 DIP:DIP-2470N IntAct:Q06224
            MINT:MINT-375457 STRING:Q06224 PaxDb:Q06224 PeptideAtlas:Q06224
            EnsemblFungi:YLR277C GeneID:850983 KEGG:sce:YLR277C CYGD:YLR277c
            NextBio:967501 Genevestigator:Q06224 GermOnline:YLR277C
            Uniprot:Q06224
        Length = 779

 Score = 145 (56.1 bits), Expect = 3.3e-07, P = 3.3e-07
 Identities = 30/94 (31%), Positives = 51/94 (54%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C S+ +   +  M+D + K F  ++ NPF FK++    +L +      P V+L S   
Sbjct:   284 KKCMSVFQ-TYVNMMNDDIRKKFRDSQTNPFIFKNISYLRNLEDFQDF-GPSVMLASPGM 341

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLAR 116
             ++ G SRDL  +WC   +N ++IT  +  GT+A+
Sbjct:   342 LQSGLSRDLLERWCPEDKNLVLITGYSIEGTMAK 375


>UNIPROTKB|G5E9W3 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation-specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
            binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:CH471053 GO:GO:0003723 GO:GO:0004521
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 GO:GO:0008409
            EMBL:AC080162 UniGene:Hs.515972 HGNC:HGNC:2326 ChiTaRS:CPSF3
            ProteinModelPortal:G5E9W3 SMR:G5E9W3 PRIDE:G5E9W3
            Ensembl:ENST00000460593 ArrayExpress:G5E9W3 Bgee:G5E9W3
            Uniprot:G5E9W3
        Length = 647

 Score = 143 (55.4 bits), Expect = 4.4e-07, P = 4.4e-07
 Identities = 33/97 (34%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   239 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 294

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   295 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 331


>UNIPROTKB|P79101 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation specificity
            factor subunit 3" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=ISS] [GO:0003723 "RNA binding" evidence=ISS] [GO:0004521
            "endoribonuclease activity" evidence=ISS] [GO:0008409 "5'-3'
            exonuclease activity" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0030529 "ribonucleoprotein complex" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0046872 GO:GO:0003723 GO:GO:0030529 GO:GO:0004521
            GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 GO:GO:0006398 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            EMBL:X95906 EMBL:BC104553 IPI:IPI00708839 RefSeq:NP_776709.1
            UniGene:Bt.5045 ProteinModelPortal:P79101 SMR:P79101 STRING:P79101
            PRIDE:P79101 Ensembl:ENSBTAT00000026303 GeneID:281712
            KEGG:bta:281712 CTD:51692 GeneTree:ENSGT00700000104485
            HOVERGEN:HBG051107 InParanoid:P79101 OrthoDB:EOG4FN4H6
            NextBio:20805634 ArrayExpress:P79101 GO:GO:0008409 Uniprot:P79101
        Length = 684

 Score = 143 (55.4 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 33/97 (34%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   276 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 331

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   332 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 368


>UNIPROTKB|Q9UKF6 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0030529
            "ribonucleoprotein complex" evidence=IEA] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0008409 "5'-3' exonuclease activity" evidence=ISS] [GO:0004521
            "endoribonuclease activity" evidence=ISS] [GO:0003723 "RNA binding"
            evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0006378 "mRNA
            polyadenylation" evidence=TAS] [GO:0006379 "mRNA cleavage"
            evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=TAS]
            [GO:0006369 "termination of RNA polymerase II transcription"
            evidence=TAS] [GO:0006397 "mRNA processing" evidence=TAS]
            [GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380
            "RNA splicing" evidence=TAS] [GO:0010467 "gene expression"
            evidence=TAS] [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            EMBL:AF017269 Pfam:PF07521 EMBL:AF171877 EMBL:CH471053
            GO:GO:0046872 Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003723
            GO:GO:0030529 GO:GO:0006406 GO:GO:0004521 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 Reactome:REACT_78 GO:GO:0006398 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409
            EMBL:AC080162 EMBL:BC011654 EMBL:BC020211 IPI:IPI00007818
            RefSeq:NP_057291.1 UniGene:Hs.515972 PDB:2I7T PDB:2I7V PDBsum:2I7T
            PDBsum:2I7V ProteinModelPortal:Q9UKF6 SMR:Q9UKF6 DIP:DIP-42501N
            MINT:MINT-1742891 STRING:Q9UKF6 PhosphoSite:Q9UKF6 DMDM:18203503
            PaxDb:Q9UKF6 PeptideAtlas:Q9UKF6 PRIDE:Q9UKF6 DNASU:51692
            Ensembl:ENST00000238112 GeneID:51692 KEGG:hsa:51692 UCSC:uc002qzo.1
            GeneCards:GC02P009514 HGNC:HGNC:2326 HPA:HPA034657 MIM:606029
            neXtProt:NX_Q9UKF6 PharmGKB:PA26843 InParanoid:Q9UKF6
            PhylomeDB:Q9UKF6 ChiTaRS:CPSF3 EvolutionaryTrace:Q9UKF6
            GenomeRNAi:51692 NextBio:55702 ArrayExpress:Q9UKF6 Bgee:Q9UKF6
            CleanEx:HS_CPSF3 Genevestigator:Q9UKF6 GermOnline:ENSG00000119203
            Uniprot:Q9UKF6
        Length = 684

 Score = 143 (55.4 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 33/97 (34%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   276 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 331

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   332 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 368


>UNIPROTKB|F1NKW5 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0004521
            "endoribonuclease activity" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0003723 GO:GO:0004521 GO:GO:0005847
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:AADN02018718
            IPI:IPI00600642 Ensembl:ENSGALT00000026493 Uniprot:F1NKW5
        Length = 685

 Score = 143 (55.4 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 33/97 (34%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   276 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 331

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   332 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 368


>UNIPROTKB|I3LKR1 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
            [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0003723 GO:GO:0004521 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:FP312696
            Ensembl:ENSSSCT00000027309 Uniprot:I3LKR1
        Length = 687

 Score = 143 (55.4 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 33/97 (34%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   279 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 334

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   335 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 371


>UNIPROTKB|E2R7R2 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 GeneTree:ENSGT00700000104485 EMBL:AAEX03010701
            RefSeq:XP_003639652.1 Ensembl:ENSCAFT00000005417 GeneID:100856414
            KEGG:cfa:100856414 Uniprot:E2R7R2
        Length = 717

 Score = 143 (55.4 bits), Expect = 5.1e-07, P = 5.1e-07
 Identities = 33/97 (34%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   309 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 364

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   365 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 401


>ZFIN|ZDB-GENE-030131-3275 [details] [associations]
            symbol:cpsf3 "cleavage and polyadenylation
            specific factor 3" species:7955 "Danio rerio" [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-030131-3275 GO:GO:0016787
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 CTD:51692
            HOVERGEN:HBG051107 EMBL:AY648793 IPI:IPI00509063
            RefSeq:NP_001003836.1 UniGene:Dr.77231 ProteinModelPortal:Q6DRG6
            SMR:Q6DRG6 STRING:Q6DRG6 GeneID:324554 KEGG:dre:324554
            NextBio:20808833 ArrayExpress:Q6DRG6 Uniprot:Q6DRG6
        Length = 690

 Score = 144 (55.7 bits), Expect = 7.9e-07, Sum P(2) = 7.9e-07
 Identities = 33/97 (34%), Positives = 51/97 (52%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K+     NNPF FKH+    S+     +  P VV+ S   
Sbjct:   283 KKCMAVYQ-TYVNAMNDKIRKAIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 338

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   339 MQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 375

 Score = 36 (17.7 bits), Expect = 7.9e-07, Sum P(2) = 7.9e-07
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query:   162 IPPHDTSFINELQLSDFKQT 181
             + P D S   +L +S  KQT
Sbjct:   502 LSPSDLSNYTDLAMSTVKQT 521


>FB|FBgn0261065 [details] [associations]
            symbol:Cpsf73 "Cleavage and polyadenylation specificity
            factor 73" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS;NAS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
            processing" evidence=IMP] [GO:0003677 "DNA binding" evidence=IDA]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
            GO:GO:0003677 GO:GO:0006378 GO:GO:0016787 GO:GO:0005847
            GO:GO:0006379 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 GO:GO:0006398 KO:K14403 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 EMBL:AY119128 RefSeq:NP_650738.1
            UniGene:Dm.13714 SMR:Q9VE51 IntAct:Q9VE51 MINT:MINT-804945
            STRING:Q9VE51 EnsemblMetazoa:FBtr0083690 GeneID:42240
            KEGG:dme:Dmel_CG7698 UCSC:CG7698-RA CTD:42240 FlyBase:FBgn0261065
            InParanoid:Q9VE51 OrthoDB:EOG4P5HR4 GenomeRNAi:42240 NextBio:827838
            Uniprot:Q9VE51
        Length = 684

 Score = 140 (54.3 bits), Expect = 1.1e-06, P = 1.1e-06
 Identities = 31/97 (31%), Positives = 52/97 (53%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   I  M+D++ +    A NNPF F+H+     +     +  P V++ S   
Sbjct:   282 KKCMAVYQ-TYINAMNDRIRRQI--AVNNPFVFRHISNLKGIDHFEDI-GPCVIMASPGM 337

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             M+ G SR+LF  WC+ P+N +II      GTLA+ ++
Sbjct:   338 MQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKAVL 374


>MGI|MGI:1859328 [details] [associations]
            symbol:Cpsf3 "cleavage and polyadenylation specificity
            factor 3" species:10090 "Mus musculus" [GO:0003723 "RNA binding"
            evidence=IDA] [GO:0003729 "mRNA binding" evidence=ISO] [GO:0004518
            "nuclease activity" evidence=IEA] [GO:0004519 "endonuclease
            activity" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=ISO]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISO;IDA] [GO:0008409 "5'-3'
            exonuclease activity" evidence=IDA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030529 "ribonucleoprotein complex"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            MGI:MGI:1859328 GO:GO:0046872 GO:GO:0003723 GO:GO:0030529
            GO:GO:0004521 GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 GO:GO:0006398
            HOGENOM:HOG000203394 KO:K14403 OMA:YVSFSAH InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 CTD:51692 GeneTree:ENSGT00700000104485
            HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409 ChiTaRS:CPSF3
            EMBL:AF203969 EMBL:BC023297 IPI:IPI00135099 RefSeq:NP_061283.2
            UniGene:Mm.356778 ProteinModelPortal:Q9QXK7 SMR:Q9QXK7
            STRING:Q9QXK7 PhosphoSite:Q9QXK7 PaxDb:Q9QXK7 PRIDE:Q9QXK7
            Ensembl:ENSMUST00000067284 GeneID:54451 KEGG:mmu:54451
            InParanoid:Q8CIM0 NextBio:311332 Bgee:Q9QXK7 CleanEx:MM_CPSF3
            Genevestigator:Q9QXK7 GermOnline:ENSMUSG00000054309 Uniprot:Q9QXK7
        Length = 684

 Score = 137 (53.3 bits), Expect = 2.4e-06, P = 2.4e-06
 Identities = 32/97 (32%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   276 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 331

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             ++ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   332 IQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 368


>RGD|1305767 [details] [associations]
            symbol:Cpsf3 "cleavage and polyadenylation specific factor 3,
            73kDa" species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
            evidence=ISO] [GO:0004521 "endoribonuclease activity" evidence=ISO]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISO] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=ISO] [GO:0008409 "5'-3' exonuclease activity"
            evidence=ISO] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 RGD:1305767 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6
            UniGene:Rn.100522 EMBL:BC099817 IPI:IPI00365532
            RefSeq:NP_001025201.1 ProteinModelPortal:Q499P4 SMR:Q499P4
            STRING:Q499P4 GeneID:298916 KEGG:rno:298916 InParanoid:Q499P4
            NextBio:644507 Genevestigator:Q499P4 Uniprot:Q499P4
        Length = 685

 Score = 137 (53.3 bits), Expect = 2.4e-06, P = 2.4e-06
 Identities = 32/97 (32%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   276 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 331

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             ++ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   332 IQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 368


>UNIPROTKB|G3V6W7 [details] [associations]
            symbol:Cpsf3 "Protein Cpsf3" species:10116 "Rattus
            norvegicus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            RGD:1305767 GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 EMBL:CH473947 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 UniGene:Rn.100522
            Ensembl:ENSRNOT00000009652 Uniprot:G3V6W7
        Length = 685

 Score = 137 (53.3 bits), Expect = 2.4e-06, P = 2.4e-06
 Identities = 32/97 (32%), Positives = 50/97 (51%)

Query:    23 KLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPD 82
             K C ++ +   +  M+DK+ K      NNPF FKH+    S+     +  P VV+ S   
Sbjct:   276 KKCMAVYQ-TYVNAMNDKIRKQIN--INNPFVFKHISNLKSMDHFDDI-GPSVVMASPGM 331

Query:    83 MECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
             ++ G SR+LF  WC+   N +II      GTLA+ ++
Sbjct:   332 IQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIM 368


>CGD|CAL0005344 [details] [associations]
            symbol:orf19.5486 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0004521 "endoribonuclease
            activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0034247 "snoRNA splicing"
            evidence=IEA] [GO:0031126 "snoRNA 3'-end processing" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006369 "termination
            of RNA polymerase II transcription" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 CGD:CAL0005344 Pfam:PF07521 GO:GO:0005634
            GO:GO:0042493 GO:GO:0046872 GO:GO:0006397 GO:GO:0090305
            GO:GO:0004519 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 EMBL:AACQ01000196 EMBL:AACQ01000195
            RefSeq:XP_711478.1 RefSeq:XP_711502.1 ProteinModelPortal:Q59P50
            STRING:Q59P50 GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
            KEGG:cal:CaO19.5486 Uniprot:Q59P50
        Length = 870

 Score = 138 (53.6 bits), Expect = 2.5e-06, P = 2.5e-06
 Identities = 38/122 (31%), Positives = 67/122 (54%)

Query:     3 DKLMKSFEGARN-NPFHFKHV-KLCHSLAELAKIEWMSDKL-MKSFEGARNNPFHFKHVK 59
             D+     E  +N N F+  ++ K C ++ E      M+DK+ + S    ++NPF FK++K
Sbjct:   352 DEYWSQNEDLQNVNVFYASNLAKKCMAVYE-TYTGIMNDKIRLSSASSEKSNPFDFKYIK 410

Query:    60 LCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
                 L++   +  P VV V+TP M + G SR L  +W    +N +I+T  +  GT+A++L
Sbjct:   411 SIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMAKEL 468

Query:   119 IE 120
             ++
Sbjct:   469 LK 470


>UNIPROTKB|Q59P50 [details] [associations]
            symbol:YSH1 "Endoribonuclease YSH1" species:237561 "Candida
            albicans SC5314" [GO:0042493 "response to drug" evidence=IMP]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 CGD:CAL0005344
            Pfam:PF07521 GO:GO:0005634 GO:GO:0042493 GO:GO:0046872
            GO:GO:0006397 GO:GO:0090305 GO:GO:0004519 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            EMBL:AACQ01000196 EMBL:AACQ01000195 RefSeq:XP_711478.1
            RefSeq:XP_711502.1 ProteinModelPortal:Q59P50 STRING:Q59P50
            GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
            KEGG:cal:CaO19.5486 Uniprot:Q59P50
        Length = 870

 Score = 138 (53.6 bits), Expect = 2.5e-06, P = 2.5e-06
 Identities = 38/122 (31%), Positives = 67/122 (54%)

Query:     3 DKLMKSFEGARN-NPFHFKHV-KLCHSLAELAKIEWMSDKL-MKSFEGARNNPFHFKHVK 59
             D+     E  +N N F+  ++ K C ++ E      M+DK+ + S    ++NPF FK++K
Sbjct:   352 DEYWSQNEDLQNVNVFYASNLAKKCMAVYE-TYTGIMNDKIRLSSASSEKSNPFDFKYIK 410

Query:    60 LCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
                 L++   +  P VV V+TP M + G SR L  +W    +N +I+T  +  GT+A++L
Sbjct:   411 SIKDLSKFQDM-GPSVV-VATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMAKEL 468

Query:   119 IE 120
             ++
Sbjct:   469 LK 470


>WB|WBGene00013460 [details] [associations]
            symbol:cpsf-3 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0040007 "growth" evidence=IMP] [GO:0002119 "nematode larval
            development" evidence=IMP] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 EMBL:AL132951 RefSeq:NP_502553.2
            ProteinModelPortal:Q95PY8 SMR:Q95PY8 STRING:Q95PY8 PaxDb:Q95PY8
            EnsemblMetazoa:Y67H2A.1.1 EnsemblMetazoa:Y67H2A.1.2 GeneID:178285
            KEGG:cel:CELE_Y67H2A.1 UCSC:Y67H2A.1 CTD:178285 WormBase:Y67H2A.1
            InParanoid:Q95PY8 NextBio:900506 Uniprot:Q95PY8
        Length = 707

 Score = 137 (53.3 bits), Expect = 2.5e-06, P = 2.5e-06
 Identities = 38/120 (31%), Positives = 59/120 (49%)

Query:     3 DKLMKSFEGARNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKL 60
             D+  +S +   + P ++     K C S+ +   +  M+ ++ K    A  NPF FKHV  
Sbjct:   255 DEYWESHQELHDIPVYYASSLAKKCMSVYQTF-VNGMNSRIQKQI--AVKNPFIFKHVST 311

Query:    61 CHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDLI 119
                + +      P VVL +TP M + GFSR+LF  WC   +N  II      GTLA+ ++
Sbjct:   312 LRGMDQFEDA-GPCVVL-ATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHIL 369


>UNIPROTKB|H0YJF4 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation-specificity
            factor subunit 2" species:9606 "Homo sapiens" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] InterPro:IPR027075
            Pfam:PF07521 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF13299 HGNC:HGNC:2325 ChiTaRS:CPSF2
            EMBL:AL121773 Ensembl:ENST00000555244 Uniprot:H0YJF4
        Length = 269

 Score = 127 (49.8 bits), Expect = 6.2e-06, P = 6.2e-06
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   225 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 269


>UNIPROTKB|F1SD84 [details] [associations]
            symbol:LOC100625560 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
            "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            InterPro:IPR027075 Pfam:PF07521 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF13299
            GeneTree:ENSGT00700000104551 EMBL:CU468363
            Ensembl:ENSSSCT00000002718 OMA:VEGCASE Uniprot:F1SD84
        Length = 304

 Score = 127 (49.8 bits), Expect = 8.5e-06, P = 8.5e-06
 Identities = 26/46 (56%), Positives = 32/46 (69%)

Query:   162 IPPHDTSFINELQLSDFKQTLQRNGIDCEFMDGVLICCRGTVAVRR 207
             +P H + F+NE +LSDFKQ L R GI  EF+ GVL+C    VAVRR
Sbjct:   230 VPGHQSVFMNEPRLSDFKQVLLREGIQAEFVGGVLVC-NNQVAVRR 274


>DICTYBASE|DDB_G0274799 [details] [associations]
            symbol:cpsf3 "cleavage and polyadenylation
            specificity factor 73 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA;IC] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0004519 "endonuclease
            activity" evidence=IEA] [GO:0004518 "nuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0274799 Pfam:PF07521 GO:GO:0046872 GO:GO:0006378
            GenomeReviews:CM000151_GR EMBL:AAFI02000012 GO:GO:0003730
            GO:GO:0004519 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            RefSeq:XP_643926.1 ProteinModelPortal:Q86A79 SMR:Q86A79
            STRING:Q86A79 EnsemblProtists:DDB0233696 GeneID:8619353
            KEGG:ddi:DDB_G0274799 ProtClustDB:CLSZ2431003 Uniprot:Q86A79
        Length = 774

 Score = 131 (51.2 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 30/86 (34%), Positives = 46/86 (53%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECGFSRDLFF 93
             I  M+D++   F+ +  NPF FKH+K    + E      P V + S   ++ G SR LF 
Sbjct:   313 INMMNDRVRAQFDVS--NPFEFKHIKNIKGI-ESFDDRGPCVFMASPGMLQSGLSRQLFE 369

Query:    94 QWCSSPENSIIITNRTSPGTLARDLI 119
             +WCS   N I+I   +  GTLA+ ++
Sbjct:   370 RWCSDKRNGIVIPGYSVEGTLAKHIM 395


>POMBASE|SPAC17G6.16c [details] [associations]
            symbol:ysh1 "mRNA cleavage and polyadenylation
            specificity factor complex endoribonuclease subunit Ysh1"
            species:4896 "Schizosaccharomyces pombe" [GO:0004521
            "endoribonuclease activity" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IC]
            [GO:0006379 "mRNA cleavage" evidence=IC] [GO:0046872 "metal ion
            binding" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 PomBase:SPAC17G6.16c Pfam:PF07521 GO:GO:0005829
            EMBL:CU329670 GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0006378
            GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            OrthoDB:EOG41ZJK7 PIR:T37848 RefSeq:NP_594263.2 STRING:O13794
            EnsemblFungi:SPAC17G6.16c.1 GeneID:2542258 NextBio:20803322
            Uniprot:O13794
        Length = 757

 Score = 127 (49.8 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 30/108 (27%), Positives = 56/108 (51%)

Query:    13 RNNPFHFKH--VKLCHSLAELAKIEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKV 70
             R+ P ++     + C ++ +   +  M+D + K F  A  NPF F+ VK   +L +   +
Sbjct:   271 RSVPIYYASSLARKCMAIFQ-TYVNMMNDNIRKIF--AERNPFIFRFVKSLRNLEKFDDI 327

Query:    71 PSPKVVLVSTPDMECGFSRDLFFQWCSSPENSIIITNRTSPGTLARDL 118
               P V+L S   ++ G SR L  +W   P N++++T  +  GT+A+ +
Sbjct:   328 -GPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTGYSVEGTMAKQI 374


>WB|WBGene00008642 [details] [associations]
            symbol:F10B5.8 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0009792 EMBL:Z48334 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
            GeneTree:ENSGT00700000104485 PIR:T20694 RefSeq:NP_495706.2
            ProteinModelPortal:Q9U3K2 SMR:Q9U3K2 STRING:Q9U3K2 PaxDb:Q9U3K2
            EnsemblMetazoa:F10B5.8 GeneID:174310 KEGG:cel:CELE_F10B5.8
            UCSC:F10B5.8 CTD:174310 WormBase:F10B5.8 InParanoid:Q9U3K2
            NextBio:883468 Uniprot:Q9U3K2
        Length = 608

 Score = 123 (48.4 bits), Expect = 8.1e-05, P = 8.1e-05
 Identities = 37/106 (34%), Positives = 51/106 (48%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W ++ + K+F     N F FKH+K      E    P P+V L STP M   G S  +F
Sbjct:   288 ISWTNENIKKTF--VERNMFEFKHIKPMEKGCE--DQPGPQV-LFSTPGMLHGGQSLKVF 342

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKR-IRL 137
              +WCS P N II+      GT+   +I   G + + +  K   IRL
Sbjct:   343 KKWCSDPLNMIIMPGYCVAGTVGARVIN--GEKKIEIDQKMHEIRL 386


>CGD|CAL0004705 [details] [associations]
            symbol:orf19.325 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493 EMBL:AACQ01000027
            EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 RefSeq:XP_720020.1 RefSeq:XP_720152.1
            ProteinModelPortal:Q5AEE3 STRING:Q5AEE3 GeneID:3638181
            GeneID:3638320 KEGG:cal:CaO19.325 KEGG:cal:CaO19.7957
            Uniprot:Q5AEE3
        Length = 931

 Score = 108 (43.1 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 25/78 (32%), Positives = 40/78 (51%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
             ++WMS    K +E   + PF+   V L    +EL K+  PK+V  S  D+  G  S + F
Sbjct:   286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAF 345

Query:    93 FQWCSSPENSIIITNRTS 110
                C+    +II+T +T+
Sbjct:   346 QYLCNDEHTTIILTEKTT 363

 Score = 50 (22.7 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query:   170 INELQLSDFKQTLQRNGIDCEFM-DGVLICCRGTVAVRRV 208
             I  ++L D K+ LQ   +  EF  +G L+     +AVR++
Sbjct:   856 IGNIRLPDLKKKLQNLNMTAEFKSEGTLVV-NDILAVRKI 894

 Score = 44 (20.5 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query:   245 DDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
             +  +S  IV++G +   YY+V++ + +  A
Sbjct:   900 ESDESGDIVIDGNVGPLYYKVKECIREMLA 929


>UNIPROTKB|Q5AEE3 [details] [associations]
            symbol:CFT2 "Putative uncharacterized protein CFT2"
            species:237561 "Candida albicans SC5314" [GO:0042493 "response to
            drug" evidence=IMP] InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493
            EMBL:AACQ01000027 EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 RefSeq:XP_720020.1
            RefSeq:XP_720152.1 ProteinModelPortal:Q5AEE3 STRING:Q5AEE3
            GeneID:3638181 GeneID:3638320 KEGG:cal:CaO19.325
            KEGG:cal:CaO19.7957 Uniprot:Q5AEE3
        Length = 931

 Score = 108 (43.1 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 25/78 (32%), Positives = 40/78 (51%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDMECG-FSRDLF 92
             ++WMS    K +E   + PF+   V L    +EL K+  PK+V  S  D+  G  S + F
Sbjct:   286 LDWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAF 345

Query:    93 FQWCSSPENSIIITNRTS 110
                C+    +II+T +T+
Sbjct:   346 QYLCNDEHTTIILTEKTT 363

 Score = 50 (22.7 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 13/40 (32%), Positives = 22/40 (55%)

Query:   170 INELQLSDFKQTLQRNGIDCEFM-DGVLICCRGTVAVRRV 208
             I  ++L D K+ LQ   +  EF  +G L+     +AVR++
Sbjct:   856 IGNIRLPDLKKKLQNLNMTAEFKSEGTLVV-NDILAVRKI 894

 Score = 44 (20.5 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 8/30 (26%), Positives = 18/30 (60%)

Query:   245 DDSDSNVIVLEGCLSDEYYRVQQLLYDQYA 274
             +  +S  IV++G +   YY+V++ + +  A
Sbjct:   900 ESDESGDIVIDGNVGPLYYKVKECIREMLA 929


>UNIPROTKB|G4N6C6 [details] [associations]
            symbol:MGG_06570 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005634 "nucleus" evidence=ISS] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0006379 "mRNA cleavage" evidence=ISS] InterPro:IPR027075
            Pfam:PF07521 GO:GO:0006378 EMBL:CM001234 GO:GO:0005847
            GO:GO:0006379 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 RefSeq:XP_003716967.1 EnsemblFungi:MGG_06570T0
            GeneID:2684725 KEGG:mgr:MGG_06570 Uniprot:G4N6C6
        Length = 962

 Score = 86 (35.3 bits), Expect = 0.00026, Sum P(3) = 0.00026
 Identities = 22/79 (27%), Positives = 39/79 (49%)

Query:    46 EGARNNPFHFKHVKLCHSLAELAKVPSP-------KVVLVSTPDMECGFSRDLFFQWCSS 98
             +G    PF FK+++L    A++ K+  P       KV+L +   +E GFS+D+     + 
Sbjct:   362 KGKDGGPFDFKYLRLLDRKAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIAND 421

Query:    99 PENSIIITNRTSPGTLARD 117
               N +I+  +  P   +RD
Sbjct:   422 SRNMVILPEK--PAESSRD 438

 Score = 65 (27.9 bits), Expect = 0.00026, Sum P(3) = 0.00026
 Identities = 15/39 (38%), Positives = 25/39 (64%)

Query:   170 INELQLSDFKQTLQRNGIDCEFM-DGVLICCRGTVAVRR 207
             + EL+L+D ++T+Q  G   +F  +G L+   GTV VR+
Sbjct:   880 VGELRLADLRRTMQNLGHSADFRGEGTLLI-DGTVVVRK 917

 Score = 54 (24.1 bits), Expect = 0.00026, Sum P(3) = 0.00026
 Identities = 11/30 (36%), Positives = 17/30 (56%)

Query:    26 HSLAELAK--IEWMSDKLMKSFEGARNNPF 53
             HS  +LAK   EWM + +++ FE   +  F
Sbjct:   320 HSTIKLAKSMFEWMDNSIVQEFEAGADQGF 349


>UNIPROTKB|F1RJE8 [details] [associations]
            symbol:CPSF3L "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:FP102596 RefSeq:XP_003127541.3 Ensembl:ENSSSCT00000003708
            GeneID:100523908 KEGG:ssc:100523908 Uniprot:F1RJE8
        Length = 599

 Score = 118 (46.6 bits), Expect = 0.00030, P = 0.00030
 Identities = 32/106 (30%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADSPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L L+ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLELEGRQVLEVK 381


>MGI|MGI:1919207 [details] [associations]
            symbol:Cpsf3l "cleavage and polyadenylation specific factor
            3-like" species:10090 "Mus musculus" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 MGI:MGI:1919207 GO:GO:0005737 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
            GeneTree:ENSGT00700000104485 GO:GO:0032039 GO:GO:0016180
            HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB CTD:54973 EMBL:AK010425
            EMBL:AK090206 EMBL:AK150436 EMBL:AK152740 EMBL:AK167607
            EMBL:AK172533 EMBL:BC008240 EMBL:BC011155 IPI:IPI00467084
            RefSeq:NP_082296.1 UniGene:Mm.259270 UniGene:Mm.475640
            ProteinModelPortal:Q9CWS4 SMR:Q9CWS4 STRING:Q9CWS4
            PhosphoSite:Q9CWS4 PaxDb:Q9CWS4 PRIDE:Q9CWS4
            Ensembl:ENSMUST00000030901 GeneID:71957 KEGG:mmu:71957
            InParanoid:Q9CWS4 NextBio:335052 Bgee:Q9CWS4 Genevestigator:Q9CWS4
            GermOnline:ENSMUSG00000029034 Uniprot:Q9CWS4
        Length = 600

 Score = 117 (46.2 bits), Expect = 0.00038, P = 0.00038
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDRT--FADNPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>RGD|1306841 [details] [associations]
            symbol:Cpsf3l "cleavage and polyadenylation specific factor
            3-like" species:10116 "Rattus norvegicus" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 RGD:1306841 GO:GO:0005634 GO:GO:0005737
            GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000231294 KO:K13148
            OMA:MAVEYMS GeneTree:ENSGT00700000104485 HOVERGEN:HBG080215
            OrthoDB:EOG4GXFMB CTD:54973 EMBL:BC105303 IPI:IPI00365477
            RefSeq:NP_001029064.1 UniGene:Rn.98615 ProteinModelPortal:Q3MHC2
            STRING:Q3MHC2 Ensembl:ENSRNOT00000026725 GeneID:298688
            KEGG:rno:298688 InParanoid:Q3MHC2 NextBio:644186
            Genevestigator:Q3MHC2 GermOnline:ENSRNOG00000019712 Uniprot:Q3MHC2
        Length = 600

 Score = 117 (46.2 bits), Expect = 0.00038, P = 0.00038
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDRT--FADNPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQMLEVK 381


>UNIPROTKB|E1B7Q9 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9913
            "Bos taurus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:DAAA02043243 IPI:IPI00971575 Ensembl:ENSBTAT00000010020
            Uniprot:E1B7Q9
        Length = 598

 Score = 116 (45.9 bits), Expect = 0.00049, P = 0.00049
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   282 IPWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADSPGPMVVF-ATPGMLHAGQSLQIF 336

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   337 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 380


>UNIPROTKB|Q2YDM2 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9913
            "Bos taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000231294 EMBL:BC110155 IPI:IPI00725178
            UniGene:Bt.4894 ProteinModelPortal:Q2YDM2 STRING:Q2YDM2
            PRIDE:Q2YDM2 HOVERGEN:HBG080215 InParanoid:Q2YDM2 OrthoDB:EOG4GXFMB
            Uniprot:Q2YDM2
        Length = 599

 Score = 116 (45.9 bits), Expect = 0.00049, P = 0.00049
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADSPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>UNIPROTKB|F1NV30 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9031
            "Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            IPI:IPI00571913 EMBL:AADN02040858 Ensembl:ENSGALT00000002586
            Uniprot:F1NV30
        Length = 600

 Score = 115 (45.5 bits), Expect = 0.00065, P = 0.00065
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADNPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQILEVK 381


>UNIPROTKB|Q5ZIH0 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9031
            "Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000231294 KO:K13148 GO:GO:0032039 GO:GO:0016180
            HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB EMBL:AJ720814 IPI:IPI00571913
            RefSeq:NP_001012854.1 UniGene:Gga.13403 ProteinModelPortal:Q5ZIH0
            STRING:Q5ZIH0 GeneID:419418 KEGG:gga:419418 CTD:54973
            InParanoid:Q5ZIH0 NextBio:20822477 Uniprot:Q5ZIH0
        Length = 600

 Score = 115 (45.5 bits), Expect = 0.00065, P = 0.00065
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADNPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQILEVK 381


>UNIPROTKB|E2QY53 [details] [associations]
            symbol:CPSF3L "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:AAEX03003844 RefSeq:XP_003639102.1 Ensembl:ENSCAFT00000030626
            GeneID:100855777 KEGG:cfa:100855777 Uniprot:E2QY53
        Length = 600

 Score = 115 (45.5 bits), Expect = 0.00065, P = 0.00065
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 ITWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADNPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>UNIPROTKB|Q5TA45 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
            EMBL:AL139287 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K13148 HOVERGEN:HBG080215
            OrthoDB:EOG4GXFMB CTD:54973 EMBL:AL136813 EMBL:AK000549
            EMBL:AK021939 EMBL:AK023356 EMBL:AK297350 EMBL:CR533557
            EMBL:BC000675 EMBL:BC007978 EMBL:BC013904 EMBL:BK005728
            EMBL:BK005673 IPI:IPI00063404 IPI:IPI00306882 IPI:IPI00514973
            RefSeq:NP_001243392.1 RefSeq:NP_060341.2 UniGene:Hs.6449
            ProteinModelPortal:Q5TA45 SMR:Q5TA45 IntAct:Q5TA45
            MINT:MINT-1482228 STRING:Q5TA45 PhosphoSite:Q5TA45 DMDM:118572557
            PaxDb:Q5TA45 PRIDE:Q5TA45 DNASU:54973 Ensembl:ENST00000419704
            Ensembl:ENST00000435064 Ensembl:ENST00000450926
            Ensembl:ENST00000545578 GeneID:54973 KEGG:hsa:54973 UCSC:uc001aee.1
            UCSC:uc001aeh.1 UCSC:uc009vjz.1 GeneCards:GC01M001236
            HGNC:HGNC:26052 HPA:HPA028379 HPA:HPA029025 MIM:611354
            neXtProt:NX_Q5TA45 PharmGKB:PA142672080 InParanoid:Q5TA45
            PhylomeDB:Q5TA45 ChiTaRS:CPSF3L GenomeRNAi:54973 NextBio:58222
            ArrayExpress:Q5TA45 Bgee:Q5TA45 Genevestigator:Q5TA45
            GermOnline:ENSG00000127054 Uniprot:Q5TA45
        Length = 600

 Score = 115 (45.5 bits), Expect = 0.00065, P = 0.00065
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   283 IPWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADNPGPMVVF-ATPGMLHAGQSLQIF 337

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   338 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 381


>UNIPROTKB|G3V1S5 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 EMBL:AL139287 EMBL:CH471183 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 OMA:MAVEYMS
            CTD:54973 UniGene:Hs.6449 GeneID:54973 KEGG:hsa:54973
            HGNC:HGNC:26052 ChiTaRS:CPSF3L GenomeRNAi:54973
            RefSeq:NP_001243385.1 ProteinModelPortal:G3V1S5 SMR:G3V1S5
            Ensembl:ENST00000540437 ArrayExpress:G3V1S5 Bgee:G3V1S5
            Uniprot:G3V1S5
        Length = 606

 Score = 115 (45.5 bits), Expect = 0.00065, P = 0.00065
 Identities = 31/106 (29%), Positives = 52/106 (49%)

Query:    34 IEWMSDKLMKSFEGARNNPFHFKHVKLCHSLAELAKVPSPKVVLVSTPDM-ECGFSRDLF 92
             I W + K+ K+F   + N F FKH+K        A  P P VV  +TP M   G S  +F
Sbjct:   289 IPWTNQKIRKTF--VQRNMFEFKHIKAFDRA--FADNPGPMVVF-ATPGMLHAGQSLQIF 343

Query:    93 FQWCSSPENSIIITNRTSPGTLARDLIELGGNRTLTLQVKKRIRLE 138
              +W  + +N +I+      GT+   +  L G R L ++ ++ + ++
Sbjct:   344 RKWAGNEKNMVIMPGYCVQGTVGHKI--LSGQRKLEMEGRQVLEVK 387


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.323   0.137   0.420    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      276       257   0.00086  114 3  11 22  0.41    33
                                                     32  0.48    36


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  49
  No. of states in DFA:  601 (64 KB)
  Total size of DFA:  198 KB (2110 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  19.81u 0.07s 19.88t   Elapsed:  00:00:17
  Total cpu time:  19.81u 0.07s 19.88t   Elapsed:  00:00:17
  Start:  Thu Aug 15 12:46:15 2013   End:  Thu Aug 15 12:46:32 2013

Back to top