BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy8348
MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF
HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSM
DKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAE
IPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLI
LDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLK
GIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE
EVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALT
REYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNY
HLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEKRLRAFACIE
ITLEKCIVVLEWASNPISDMYADSLISECLIEILVEMYGEAAVPKMFKGEKITITVDKKK
ACIDLVDLSVQCEDSKLKSTVQ

High Scoring Gene Products

Symbol, full name Information P value
Cpsf73
Cleavage and polyadenylation specificity factor 73
protein from Drosophila melanogaster 1.2e-256
CPSF3
Uncharacterized protein
protein from Canis lupus familiaris 1.9e-236
CPSF3
Cleavage and polyadenylation specificity factor subunit 3
protein from Bos taurus 2.4e-236
LOC100622181
Uncharacterized protein
protein from Sus scrofa 3.5e-235
CPSF3
Uncharacterized protein
protein from Gallus gallus 1.2e-234
cpsf3
cleavage and polyadenylation specific factor 3
gene_product from Danio rerio 7.3e-233
CPSF3
Cleavage and polyadenylation specificity factor subunit 3
protein from Homo sapiens 1.1e-231
Cpsf3
cleavage and polyadenylation specificity factor 3
protein from Mus musculus 4.6e-231
Cpsf3
cleavage and polyadenylation specific factor 3, 73kDa
gene from Rattus norvegicus 9.6e-231
CPSF3
Cleavage and polyadenylation specific factor 3, 73kDa, isoform CRA_b
protein from Homo sapiens 1.1e-222
cpsf-3 gene from Caenorhabditis elegans 1.3e-196
cpsf3
cleavage and polyadenylation specificity factor 73 kDa subunit
gene from Dictyostelium discoideum 1.1e-172
CPSF73-I
cleavage and polyadenylation specificity factor 73-I
protein from Arabidopsis thaliana 6.2e-172
YSH1
Putative endoribonuclease
gene from Saccharomyces cerevisiae 1.2e-144
orf19.5486 gene_product from Candida albicans 1.7e-143
YSH1
Endoribonuclease YSH1
protein from Candida albicans SC5314 1.7e-143
PF14_0364
cleavage and polyadenylation specifity factor protein, putative
gene from Plasmodium falciparum 1.0e-111
PF14_0364
Cleavage and polyadenylation specificity factor protein, putative
protein from Plasmodium falciparum 3D7 1.0e-111
CPSF3L
Integrator complex subunit 11
protein from Gallus gallus 9.6e-89
CPSF3L
Integrator complex subunit 11
protein from Gallus gallus 9.6e-89
Cpsf3l
cleavage and polyadenylation specific factor 3-like
protein from Mus musculus 1.4e-87
CPSF3L
Integrator complex subunit 11
protein from Bos taurus 1.8e-87
Cpsf3l
cleavage and polyadenylation specific factor 3-like
gene from Rattus norvegicus 1.8e-87
CPSF3L
Uncharacterized protein
protein from Canis lupus familiaris 1.3e-86
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 1.6e-86
CPSF3L
Integrator complex subunit 11
protein from Bos taurus 2.0e-86
F10B5.8 gene from Caenorhabditis elegans 2.1e-86
LOC100523908
Uncharacterized protein
protein from Sus scrofa 1.8e-85
IntS11
Integrator 11
protein from Drosophila melanogaster 3.4e-84
ints11
integrator complex subunit 11
gene from Dictyostelium discoideum 1.1e-80
cpsf3l
cleavage and polyadenylation specific factor 3-like
gene_product from Danio rerio 1.1e-78
CPSF73-II
AT2G01730
protein from Arabidopsis thaliana 1.4e-73
PFC0825c
cleavage and polyadenylation specificity factor protein, putative
gene from Plasmodium falciparum 4.0e-62
PFC0825c
Cleavage and polyadenylation specificity factor protein, putative
protein from Plasmodium falciparum 3D7 4.0e-62
CPSF3
Cleavage and polyadenylation-specificity factor subunit 3
protein from Homo sapiens 2.2e-59
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 2.2e-46
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 2.1e-45
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 4.0e-38
CPS_2623
metallo-beta-lactamase family protein
protein from Colwellia psychrerythraea 34H 5.2e-38
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 1.5e-37
CPSF2
Uncharacterized protein
protein from Canis lupus familiaris 1.3e-36
CPSF2
Cleavage and polyadenylation specificity factor subunit 2
protein from Homo sapiens 1.3e-36
CPSF2
Uncharacterized protein
protein from Gallus gallus 1.4e-36
CPSF2
Cleavage and polyadenylation specificity factor subunit 2
protein from Bos taurus 1.6e-36
cpsf2
Cleavage and polyadenylation specificity factor subunit 2
protein from Xenopus laevis 2.1e-36
Cpsf2
cleavage and polyadenylation specific factor 2, 100kDa
gene from Rattus norvegicus 5.1e-36
Cpsf2
cleavage and polyadenylation specific factor 2
protein from Mus musculus 6.7e-36
CPSF2
Uncharacterized protein
protein from Sus scrofa 1.2e-35
cpsf2
cleavage and polyadenylation specific factor 2
gene_product from Danio rerio 3.5e-35
cpsf-2 gene from Caenorhabditis elegans 1.2e-34
cpsf-2
Probable cleavage and polyadenylation specificity factor subunit 2
protein from Caenorhabditis elegans 1.2e-34
VC_0264
Putative uncharacterized protein
protein from Vibrio cholerae O1 biovar El Tor str. N16961 3.5e-34
VC_0264
conserved hypothetical protein
protein from Vibrio cholerae O1 biovar El Tor 3.5e-34
Cpsf100
Cleavage and polyadenylation specificity factor 100
protein from Drosophila melanogaster 6.9e-32
CPSF100
cleavage and polyadenylation specificity factor 100
protein from Arabidopsis thaliana 1.8e-31
DET_1061
metallo-beta-lactamase family protein
protein from Dehalococcoides ethenogenes 195 4.3e-30
cpsf2
cleavage and polyadenylation specificity factor 100 kDa subunit
gene from Dictyostelium discoideum 2.8e-29
CHY_2049
metallo-beta-lactamase family protein
protein from Carboxydothermus hydrogenoformans Z-2901 7.0e-27
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 2.9e-22
BA_1737
Metallo-beta-lactamase family protein
protein from Bacillus anthracis 4.2e-21
BA_1737
metallo-beta-lactamase family protein
protein from Bacillus anthracis str. Ames 4.2e-21
GSU1843
RNA exonuclease, beta-lactamase fold protein
protein from Geobacter sulfurreducens PCA 1.4e-19
GSU_1843
metallo-beta-lactamase family protein
protein from Geobacter sulfurreducens PCA 1.4e-19
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 1.5e-19
ints9
integrator complex subunit 9
gene from Dictyostelium discoideum 5.8e-17
Ints9
integrator complex subunit 9
gene from Rattus norvegicus 3.0e-16
Ints9
integrator complex subunit 9
protein from Mus musculus 4.0e-16
INTS9
Integrator complex subunit 9
protein from Homo sapiens 1.4e-15
INTS9
Uncharacterized protein
protein from Canis lupus familiaris 2.4e-15
INTS9
Integrator complex subunit 9
protein from Bos taurus 3.9e-15
INTS9
Integrator complex subunit 9
protein from Bos taurus 5.0e-15
HNE_1669
Uncharacterized protein
protein from Hyphomonas neptunium ATCC 15444 1.1e-14
INTS9
Uncharacterized protein
protein from Sus scrofa 1.8e-14
INTS9
Integrator complex subunit 9, isoform CRA_a
protein from Homo sapiens 2.3e-14
SO_0541
RNA-metabolizing metallo-beta-lactamase family protein
protein from Shewanella oneidensis MR-1 3.8e-14
SO_0541
metallo-beta-lactamase family protein
protein from Shewanella oneidensis MR-1 3.8e-14
INTS9
Integrator complex subunit 9
protein from Gallus gallus 9.2e-13
INTS9
Integrator complex subunit 9
protein from Homo sapiens 6.9e-12
IntS9
Integrator 9
protein from Drosophila melanogaster 7.5e-12
ints9
integrator complex subunit 9
gene_product from Danio rerio 6.3e-11
F19F10.12 gene from Caenorhabditis elegans 8.9e-10
orf19.325 gene_product from Candida albicans 1.0e-08
CFT2
Putative uncharacterized protein CFT2
protein from Candida albicans SC5314 1.0e-08
INTS9
Integrator complex subunit 9
protein from Homo sapiens 1.7e-08
PSPTO_4134
Uncharacterized protein
protein from Pseudomonas syringae pv. tomato str. DC3000 6.1e-05
INTS9
Uncharacterized protein
protein from Canis lupus familiaris 0.00014
AT3G07530 protein from Arabidopsis thaliana 0.00019
BA_1640
Ribonuclease J
protein from Bacillus anthracis 0.00022
BA_1640
metallo-beta-lactamase family protein
protein from Bacillus anthracis str. Ames 0.00022
CHY_1157
metallo-beta-lactamase family protein
protein from Carboxydothermus hydrogenoformans Z-2901 0.00022
CBU_0596
Metal-dependent hydrolase
protein from Coxiella burnetii RSA 493 0.00081
CBU_0596
conserved hypothetical protein
protein from Coxiella burnetii RSA 493 0.00081

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy8348
        (622 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

FB|FBgn0261065 - symbol:Cpsf73 "Cleavage and polyadenylat...  2368  1.2e-256  2
UNIPROTKB|E2R7R2 - symbol:CPSF3 "Uncharacterized protein"...  2233  1.9e-236  2
UNIPROTKB|P79101 - symbol:CPSF3 "Cleavage and polyadenyla...  2233  2.4e-236  2
UNIPROTKB|I3LKR1 - symbol:CPSF3 "Uncharacterized protein"...  2221  3.5e-235  2
UNIPROTKB|F1NKW5 - symbol:CPSF3 "Uncharacterized protein"...  2229  1.2e-234  2
ZFIN|ZDB-GENE-030131-3275 - symbol:cpsf3 "cleavage and po...  2246  7.3e-233  1
UNIPROTKB|Q9UKF6 - symbol:CPSF3 "Cleavage and polyadenyla...  2235  1.1e-231  1
MGI|MGI:1859328 - symbol:Cpsf3 "cleavage and polyadenylat...  2229  4.6e-231  1
UNIPROTKB|G3V6W7 - symbol:Cpsf3 "Protein Cpsf3" species:1...  2229  4.6e-231  1
RGD|1305767 - symbol:Cpsf3 "cleavage and polyadenylation ...  2226  9.6e-231  1
UNIPROTKB|G5E9W3 - symbol:CPSF3 "Cleavage and polyadenyla...  2150  1.1e-222  1
WB|WBGene00013460 - symbol:cpsf-3 species:6239 "Caenorhab...  1904  1.3e-196  1
DICTYBASE|DDB_G0274799 - symbol:cpsf3 "cleavage and polya...  1678  1.1e-172  1
TAIR|locus:2206076 - symbol:CPSF73-I "cleavage and polyad...  1671  6.2e-172  1
POMBASE|SPAC17G6.16c - symbol:ysh1 "mRNA cleavage and pol...  1516  1.7e-155  1
SGD|S000004267 - symbol:YSH1 "Putative endoribonuclease" ...  1325  1.2e-144  3
CGD|CAL0005344 - symbol:orf19.5486 species:5476 "Candida ...  1281  1.7e-143  3
UNIPROTKB|Q59P50 - symbol:YSH1 "Endoribonuclease YSH1" sp...  1281  1.7e-143  3
ASPGD|ASPL0000060573 - symbol:AN0990 species:162425 "Emer...   849  6.9e-114  4
GENEDB_PFALCIPARUM|PF14_0364 - symbol:PF14_0364 "cleavage...   812  1.0e-111  3
UNIPROTKB|Q8IL83 - symbol:PF14_0364 "Cleavage and polyade...   812  1.0e-111  3
UNIPROTKB|F1NV30 - symbol:CPSF3L "Integrator complex subu...   886  9.6e-89   1
UNIPROTKB|Q5ZIH0 - symbol:CPSF3L "Integrator complex subu...   886  9.6e-89   1
MGI|MGI:1919207 - symbol:Cpsf3l "cleavage and polyadenyla...   875  1.4e-87   1
UNIPROTKB|E1B7Q9 - symbol:CPSF3L "Integrator complex subu...   874  1.8e-87   1
RGD|1306841 - symbol:Cpsf3l "cleavage and polyadenylation...   874  1.8e-87   1
UNIPROTKB|E2QY53 - symbol:CPSF3L "Uncharacterized protein...   866  1.3e-86   1
UNIPROTKB|G3V1S5 - symbol:CPSF3L "Integrator complex subu...   865  1.6e-86   1
UNIPROTKB|Q5TA45 - symbol:CPSF3L "Integrator complex subu...   865  1.6e-86   1
UNIPROTKB|Q2YDM2 - symbol:CPSF3L "Integrator complex subu...   864  2.0e-86   1
WB|WBGene00008642 - symbol:F10B5.8 species:6239 "Caenorha...   845  2.1e-86   2
UNIPROTKB|F1RJE8 - symbol:CPSF3L "Uncharacterized protein...   855  1.8e-85   1
FB|FBgn0039691 - symbol:IntS11 "Integrator 11" species:72...   843  3.4e-84   1
DICTYBASE|DDB_G0278189 - symbol:ints11 "integrator comple...   810  1.1e-80   1
ZFIN|ZDB-GENE-050522-13 - symbol:cpsf3l "cleavage and pol...   791  1.1e-78   1
TAIR|locus:2065368 - symbol:CPSF73-II "AT2G01730" species...   743  1.4e-73   1
GENEDB_PFALCIPARUM|PFC0825c - symbol:PFC0825c "cleavage a...   542  4.0e-62   3
UNIPROTKB|O77371 - symbol:PFC0825c "Cleavage and polyaden...   542  4.0e-62   3
UNIPROTKB|C9JZH6 - symbol:CPSF3 "Cleavage and polyadenyla...   609  2.2e-59   1
UNIPROTKB|C9J979 - symbol:CPSF3L "Integrator complex subu...   268  2.2e-46   2
UNIPROTKB|E9PNS4 - symbol:CPSF3L "Integrator complex subu...   477  2.1e-45   1
UNIPROTKB|E9PI75 - symbol:CPSF3L "Integrator complex subu...   411  4.0e-38   1
TIGR_CMR|CPS_2623 - symbol:CPS_2623 "metallo-beta-lactama...   410  5.2e-38   1
UNIPROTKB|E9PIG1 - symbol:CPSF3L "Integrator complex subu...   406  1.5e-37   1
UNIPROTKB|E2R496 - symbol:CPSF2 "Uncharacterized protein"...   390  1.3e-36   2
UNIPROTKB|Q9P2I0 - symbol:CPSF2 "Cleavage and polyadenyla...   390  1.3e-36   2
UNIPROTKB|F1NMN0 - symbol:CPSF2 "Uncharacterized protein"...   388  1.4e-36   2
UNIPROTKB|Q10568 - symbol:CPSF2 "Cleavage and polyadenyla...   389  1.6e-36   2
UNIPROTKB|Q9W799 - symbol:cpsf2 "Cleavage and polyadenyla...   389  2.1e-36   2
RGD|1309687 - symbol:Cpsf2 "cleavage and polyadenylation ...   385  5.1e-36   2
MGI|MGI:1861601 - symbol:Cpsf2 "cleavage and polyadenylat...   384  6.7e-36   2
UNIPROTKB|F1SD85 - symbol:CPSF2 "Uncharacterized protein"...   389  1.2e-35   1
ZFIN|ZDB-GENE-040718-79 - symbol:cpsf2 "cleavage and poly...   380  3.5e-35   2
WB|WBGene00017313 - symbol:cpsf-2 species:6239 "Caenorhab...   383  1.2e-34   2
UNIPROTKB|O17403 - symbol:cpsf-2 "Probable cleavage and p...   383  1.2e-34   2
UNIPROTKB|Q9KV92 - symbol:VC_0264 "Putative uncharacteriz...   376  3.5e-34   1
TIGR_CMR|VC_0264 - symbol:VC_0264 "conserved hypothetical...   376  3.5e-34   1
FB|FBgn0027873 - symbol:Cpsf100 "Cleavage and polyadenyla...   354  6.9e-32   2
TAIR|locus:2172843 - symbol:CPSF100 "cleavage and polyade...   373  1.8e-31   1
TIGR_CMR|DET_1061 - symbol:DET_1061 "metallo-beta-lactama...   267  4.3e-30   2
DICTYBASE|DDB_G0270392 - symbol:cpsf2 "cleavage and polya...   352  2.8e-29   2
TIGR_CMR|CHY_2049 - symbol:CHY_2049 "metallo-beta-lactama...   326  7.0e-27   1
UNIPROTKB|E9PIL7 - symbol:CPSF3L "Integrator complex subu...   267  2.9e-22   1
POMBASE|SPBC1709.15c - symbol:cft2 "cleavage factor two C...   288  6.3e-22   1
UNIPROTKB|Q81SC3 - symbol:BA_1737 "Metallo-beta-lactamase...   272  4.2e-21   1
TIGR_CMR|BA_1737 - symbol:BA_1737 "metallo-beta-lactamase...   272  4.2e-21   1
UNIPROTKB|Q74C32 - symbol:GSU1843 "RNA exonuclease, beta-...   162  1.4e-19   2
TIGR_CMR|GSU_1843 - symbol:GSU_1843 "metallo-beta-lactama...   162  1.4e-19   2
UNIPROTKB|E9PQF0 - symbol:CPSF3L "Integrator complex subu...   242  1.5e-19   1
DICTYBASE|DDB_G0282473 - symbol:ints9 "integrator complex...   197  5.8e-17   3
RGD|1311539 - symbol:Ints9 "integrator complex subunit 9"...   191  3.0e-16   2
MGI|MGI:1098533 - symbol:Ints9 "integrator complex subuni...   186  4.0e-16   2
UNIPROTKB|Q9NV88 - symbol:INTS9 "Integrator complex subun...   182  1.4e-15   2
UNIPROTKB|F6XI08 - symbol:INTS9 "Uncharacterized protein"...   180  2.4e-15   2
UNIPROTKB|Q2KJA6 - symbol:INTS9 "Integrator complex subun...   178  3.9e-15   2
UNIPROTKB|F1MMA6 - symbol:INTS9 "Integrator complex subun...   177  5.0e-15   2
UNIPROTKB|Q0C1L6 - symbol:HNE_1669 "Putative uncharacteri...   183  1.1e-14   3
UNIPROTKB|F1RJQ5 - symbol:INTS9 "Uncharacterized protein"...   176  1.8e-14   2
UNIPROTKB|G3XAN1 - symbol:INTS9 "Integrator complex subun...   168  2.3e-14   2
UNIPROTKB|Q8EJC6 - symbol:SO_0541 "RNA-metabolizing metal...   213  3.8e-14   1
TIGR_CMR|SO_0541 - symbol:SO_0541 "metallo-beta-lactamase...   213  3.8e-14   1
UNIPROTKB|Q5ZKK2 - symbol:INTS9 "Integrator complex subun...   162  9.2e-13   2
UNIPROTKB|H7BYQ6 - symbol:INTS9 "Integrator complex subun...   182  6.9e-12   2
FB|FBgn0036570 - symbol:IntS9 "Integrator 9" species:7227...   144  7.5e-12   2
ZFIN|ZDB-GENE-061013-129 - symbol:ints9 "integrator compl...   148  6.3e-11   2
WB|WBGene00017608 - symbol:F19F10.12 species:6239 "Caenor...   154  8.9e-10   3
CGD|CAL0004705 - symbol:orf19.325 species:5476 "Candida a...   164  1.0e-08   3
UNIPROTKB|Q5AEE3 - symbol:CFT2 "Putative uncharacterized ...   164  1.0e-08   3
UNIPROTKB|H0YBH8 - symbol:INTS9 "Integrator complex subun...   151  1.7e-08   1
UNIPROTKB|Q87XP2 - symbol:PSPTO_4134 "Uncharacterized pro...   127  6.1e-05   1
UNIPROTKB|E2QVB2 - symbol:INTS9 "Uncharacterized protein"...   125  0.00014   1
TAIR|locus:2079696 - symbol:AT3G07530 "AT3G07530" species...    80  0.00019   3
UNIPROTKB|Q81SK8 - symbol:BA_1640 "Ribonuclease J" specie...   130  0.00022   2
TIGR_CMR|BA_1640 - symbol:BA_1640 "metallo-beta-lactamase...   130  0.00022   2
TIGR_CMR|CHY_1157 - symbol:CHY_1157 "metallo-beta-lactama...   113  0.00022   2
UNIPROTKB|Q83DU6 - symbol:CBU_0596 "Metal-dependent hydro...   114  0.00081   1
TIGR_CMR|CBU_0596 - symbol:CBU_0596 "conserved hypothetic...   114  0.00081   1


>FB|FBgn0261065 [details] [associations]
            symbol:Cpsf73 "Cleavage and polyadenylation specificity
            factor 73" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS;NAS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
            processing" evidence=IMP] [GO:0003677 "DNA binding" evidence=IDA]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
            GO:GO:0003677 GO:GO:0006378 GO:GO:0016787 GO:GO:0005847
            GO:GO:0006379 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 GO:GO:0006398 KO:K14403 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 EMBL:AY119128 RefSeq:NP_650738.1
            UniGene:Dm.13714 SMR:Q9VE51 IntAct:Q9VE51 MINT:MINT-804945
            STRING:Q9VE51 EnsemblMetazoa:FBtr0083690 GeneID:42240
            KEGG:dme:Dmel_CG7698 UCSC:CG7698-RA CTD:42240 FlyBase:FBgn0261065
            InParanoid:Q9VE51 OrthoDB:EOG4P5HR4 GenomeRNAi:42240 NextBio:827838
            Uniprot:Q9VE51
        Length = 684

 Score = 2368 (838.6 bits), Expect = 1.2e-256, Sum P(2) = 1.2e-256
 Identities = 434/569 (76%), Positives = 510/569 (89%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCIMLEFK K IM+DCGIHPGLSGMDALP+VDL+E+D+IDLL ISHFHLDHC
Sbjct:    24 GAGQEVGRSCIMLEFKGKKIMLDCGIHPGLSGMDALPYVDLIEADEIDLLFISHFHLDHC 83

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL+KT FKGRCFMTHATKAIYRW+LSDYIK+SNISTEQMLYTE+DLE SM+KIET
Sbjct:    84 GALPWFLMKTSFKGRCFMTHATKAIYRWMLSDYIKISNISTEQMLYTEADLEASMEKIET 143

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHEE+DV G++F AY AGHVLGAAMF+IEIAG+KILYTGDFSRQEDRHLMAAE+PP+K
Sbjct:   144 INFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMK 203

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PD+LITESTYGTH+HE+RE+RE RFTSL+  IV +GGRCLIPVFALGRAQELLLILDE+W
Sbjct:   204 PDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPVFALGRAQELLLILDEFW 263

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
             S +P+LH+IPIYYASSLAKKCM+VYQTYINAMNDRIRRQI++NNPFVF+HISNLKGIDHF
Sbjct:   264 SQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHF 323

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             EDIGPCV+MASPGMMQSGLSRELFE WCTD KNGVIIAGYCVEGTLAK +LSEPEE+  +
Sbjct:   324 EDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKAVLSEPEEITTL 383

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPL MSVDYISFSAHTDYQQTSEF+R L+P HVVLVHGEQNEMSRLK AL REYE 
Sbjct:   384 SGQKLPLNMSVDYISFSAHTDYQQTSEFIRLLKPTHVVLVHGEQNEMSRLKLALQREYEA 443

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             D +T ++ YNPRNT +VDLYF+GEKTAKVMG LA +N +  + LSG++VKR+F YHLLAP
Sbjct:   444 DASTDIKFYNPRNTHAVDLYFRGEKTAKVMGSLAAKNSEVGSKLSGVLVKRDFKYHLLAP 503

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHL-AGPVETLD-EKRLRAFACIEITL 543
             SDL KYTD+  S + Q+QS+ +  S+S L  L+  + AG VE L+ E++LR F CIE+T+
Sbjct:   504 SDLGKYTDMSMSVVTQRQSIPWGSSLSTLELLLDRIGAGCVEVLEAERKLRVFGCIELTV 563

Query:   544 EKCIVVLEWASNPISDMYADSLISECLIE 572
             E+ I+V+EW +  ++D+YAD++++ C+++
Sbjct:   564 EQKIIVMEWQATHVNDVYADAVLA-CIMQ 591

 Score = 126 (49.4 bits), Expect = 1.2e-256, Sum P(2) = 1.2e-256
 Identities = 26/57 (45%), Positives = 38/57 (66%)

Query:   563 DSLISECLIEILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQC-EDSKLK 618
             DS   ECLIE L + +G+  VPKMF+G+ + +TV  K+A I+L  L++ C ED  L+
Sbjct:   610 DSRFRECLIETLQDTFGDNCVPKMFEGDLLPVTVSGKRAEINLETLAISCAEDDVLR 666

 Score = 45 (20.9 bits), Expect = 0.00033, Sum P(2) = 0.00033
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query:   130 EEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKIL 163
             EE D+  IK     AG  +G +  ++E  G KI+
Sbjct:    13 EESDLLQIK--PLGAGQEVGRSCIMLEFKGKKIM 44


>UNIPROTKB|E2R7R2 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 GeneTree:ENSGT00700000104485 EMBL:AAEX03010701
            RefSeq:XP_003639652.1 Ensembl:ENSCAFT00000005417 GeneID:100856414
            KEGG:cfa:100856414 Uniprot:E2R7R2
        Length = 717

 Score = 2233 (791.1 bits), Expect = 1.9e-236, Sum P(2) = 1.9e-236
 Identities = 430/607 (70%), Positives = 503/607 (82%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    51 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 110

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:   111 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 170

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   171 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 230

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   231 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 290

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   291 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 350

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   351 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 410

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   411 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 470

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   471 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 530

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
              DL  YTDL  S + Q Q++ Y+G  ++L   +  L G VE L+  EK  L+ F  I + 
Sbjct:   531 CDLSNYTDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGDVEELEIQEKPALKVFKNITVI 590

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  K+ 
Sbjct:   591 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 647

Query:   602 CIDLVDL 608
              I L D+
Sbjct:   648 EIMLQDI 654

 Score = 70 (29.7 bits), Expect = 1.9e-236, Sum P(2) = 1.9e-236
 Identities = 25/76 (32%), Positives = 42/76 (55%)

Query:   548 VVLEWASNPISDMYADSLISECL--------IEILVE-MYGEAAVPKMFKGEKITITVDK 598
             V+LE  SNP     A   +S+ L        +EI+++ ++GE  V  +  G  +++TVD 
Sbjct:   616 VILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGEDCV-SVKDGSVLSVTVDG 674

Query:   599 KKACIDLVDLSVQCED 614
             K A I+L   +V+CE+
Sbjct:   675 KTANINLETRTVECEE 690


>UNIPROTKB|P79101 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation specificity
            factor subunit 3" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=ISS] [GO:0003723 "RNA binding" evidence=ISS] [GO:0004521
            "endoribonuclease activity" evidence=ISS] [GO:0008409 "5'-3'
            exonuclease activity" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0030529 "ribonucleoprotein complex" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0046872 GO:GO:0003723 GO:GO:0030529 GO:GO:0004521
            GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 GO:GO:0006398 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            EMBL:X95906 EMBL:BC104553 IPI:IPI00708839 RefSeq:NP_776709.1
            UniGene:Bt.5045 ProteinModelPortal:P79101 SMR:P79101 STRING:P79101
            PRIDE:P79101 Ensembl:ENSBTAT00000026303 GeneID:281712
            KEGG:bta:281712 CTD:51692 GeneTree:ENSGT00700000104485
            HOVERGEN:HBG051107 InParanoid:P79101 OrthoDB:EOG4FN4H6
            NextBio:20805634 ArrayExpress:P79101 GO:GO:0008409 Uniprot:P79101
        Length = 684

 Score = 2233 (791.1 bits), Expect = 2.4e-236, Sum P(2) = 2.4e-236
 Identities = 430/607 (70%), Positives = 503/607 (82%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   318 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
              DL  YTDL  S + Q Q++ Y+G  ++L   +  L G VE L+  EK  L+ F  I + 
Sbjct:   498 CDLSNYTDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGDVEELEIQEKPALKVFKNITVI 557

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  K+ 
Sbjct:   558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614

Query:   602 CIDLVDL 608
              I L D+
Sbjct:   615 EIMLQDI 621

 Score = 69 (29.3 bits), Expect = 2.4e-236, Sum P(2) = 2.4e-236
 Identities = 25/76 (32%), Positives = 42/76 (55%)

Query:   548 VVLEWASNPISDMYADSLISECL--------IEILVE-MYGEAAVPKMFKGEKITITVDK 598
             V+LE  SNP     A   +S+ L        +EI+++ ++GE  V  +  G  +++TVD 
Sbjct:   583 VILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGEDCV-SVKDGSILSVTVDG 641

Query:   599 KKACIDLVDLSVQCED 614
             K A I+L   +V+CE+
Sbjct:   642 KTANINLETRTVECEE 657


>UNIPROTKB|I3LKR1 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
            [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0003723 GO:GO:0004521 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:FP312696
            Ensembl:ENSSSCT00000027309 Uniprot:I3LKR1
        Length = 687

 Score = 2221 (786.9 bits), Expect = 3.5e-235, Sum P(2) = 3.5e-235
 Identities = 430/610 (70%), Positives = 503/610 (82%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV---SNISTEQMLYTESDLEKSMDK 122
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KV   SNIS + MLYTE+DLE+SMDK
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVRKCSNISADDMLYTETDLEESMDK 137

Query:   123 IETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIP 182
             IETINFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP
Sbjct:   138 IETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIP 197

Query:   183 PVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILD 242
              +KPDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILD
Sbjct:   198 NIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILD 257

Query:   243 EYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGI 302
             EYW  HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +
Sbjct:   258 EYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSM 317

Query:   303 DHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV 362
             DHF+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+
Sbjct:   318 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEI 377

Query:   363 IGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTRE 422
               MSGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL RE
Sbjct:   378 TTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIRE 437

Query:   423 YEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHL 482
             YED+    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+
Sbjct:   438 YEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHI 497

Query:   483 LAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACI 539
             L+P DL  YTDL  S + Q Q++ Y+G  ++L   +  L G VE L+  EK  L+ F  I
Sbjct:   498 LSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLSYQLQKLTGDVEELEIQEKPALKVFKNI 557

Query:   540 EITLEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDK 598
              +  E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  
Sbjct:   558 TVIQEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYS 614

Query:   599 KKACIDLVDL 608
             K+  I L D+
Sbjct:   615 KRLEIMLQDI 624

 Score = 70 (29.7 bits), Expect = 3.5e-235, Sum P(2) = 3.5e-235
 Identities = 25/76 (32%), Positives = 42/76 (55%)

Query:   548 VVLEWASNPISDMYADSLISECL--------IEILVE-MYGEAAVPKMFKGEKITITVDK 598
             V+LE  SNP     A   +S+ L        +EI+++ ++GE  V  +  G  +++TVD 
Sbjct:   586 VILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGEDCV-SVKDGSVLSVTVDG 644

Query:   599 KKACIDLVDLSVQCED 614
             K A I+L   +V+CE+
Sbjct:   645 KTANINLETRTVECEE 660


>UNIPROTKB|F1NKW5 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0004521
            "endoribonuclease activity" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0003723 GO:GO:0004521 GO:GO:0005847
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:AADN02018718
            IPI:IPI00600642 Ensembl:ENSGALT00000026493 Uniprot:F1NKW5
        Length = 685

 Score = 2229 (789.7 bits), Expect = 1.2e-234, Sum P(2) = 1.2e-234
 Identities = 427/616 (69%), Positives = 505/616 (81%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   318 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRNFNYHILSP 497

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEKR---LRAFACIEIT 542
              DL  YTDL  S + Q  ++ Y+G  ++L   +  L G VE ++ ++   L+ F  I + 
Sbjct:   498 CDLSNYTDLAMSTVTQTLAIPYTGPFNLLFYQLQKLTGDVEEIEIQQKPALKVFKSITVI 557

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       +AAV K+    K+ +   +K+ 
Sbjct:   558 QEPGMVVLEWVANPANDMYADT-VTTVILEVQSNPKIQKAAVHKV--STKVDMEEYRKRM 614

Query:   602 CIDLVDL-SVQCEDSK 616
              + L D+    C  SK
Sbjct:   615 EMMLQDMFGEDCVSSK 630

 Score = 57 (25.1 bits), Expect = 1.2e-234, Sum P(2) = 1.2e-234
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query:   573 ILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQCE 613
             +L +M+GE  V    +G  + +TVD K A + L   +  CE
Sbjct:   617 MLQDMFGEDCVSSK-EGSILCVTVDGKTANLSLETRTADCE 656


>ZFIN|ZDB-GENE-030131-3275 [details] [associations]
            symbol:cpsf3 "cleavage and polyadenylation
            specific factor 3" species:7955 "Danio rerio" [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-030131-3275 GO:GO:0016787
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 CTD:51692
            HOVERGEN:HBG051107 EMBL:AY648793 IPI:IPI00509063
            RefSeq:NP_001003836.1 UniGene:Dr.77231 ProteinModelPortal:Q6DRG6
            SMR:Q6DRG6 STRING:Q6DRG6 GeneID:324554 KEGG:dre:324554
            NextBio:20808833 ArrayExpress:Q6DRG6 Uniprot:Q6DRG6
        Length = 690

 Score = 2246 (795.7 bits), Expect = 7.3e-233, P = 7.3e-233
 Identities = 428/618 (69%), Positives = 507/618 (82%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    25 GAGQEVGRSCIILEFKGRKIMVDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 84

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    85 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 144

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP VK
Sbjct:   145 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVK 204

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILITESTYGTH+HE+REERE RF + +HDIVNR GRCLIPVFALGRAQELLLILDEYW
Sbjct:   205 PDILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYW 264

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+ I+INNPFVFKHISNLK +DHF
Sbjct:   265 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAININNPFVFKHISNLKSMDHF 324

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   325 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 384

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   385 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 444

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +       +SGI+VK+NF+YH+L+P
Sbjct:   445 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGSLADKKCSQGQRVSGILVKKNFSYHILSP 504

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EKR-LRAFACIEIT 542
             SDL  YTDL  S + Q Q++ ++G   +L S + HL G VE ++  EK  ++ F  I + 
Sbjct:   505 SDLSNYTDLAMSTVKQTQAIPFTGPFPLLLSQLRHLTGDVEEIEMSEKSTVKVFNSITVI 564

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVEMYGEAAVPKMFKGEKITITVDKKKAC 602
              E  +VVLEW +NP++DMYAD+ ++  ++E+      + A+    K  K+ + V + +  
Sbjct:   565 HENNLVVLEWFANPLNDMYADA-VTTVVLEVQSNPKAQKALQPQEK--KVDVNVFQNRLL 621

Query:   603 IDLVDL-SVQCEDSKLKS 619
                 D+   +C D K K+
Sbjct:   622 KMFQDMFGEECVDFKDKN 639


>UNIPROTKB|Q9UKF6 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0030529
            "ribonucleoprotein complex" evidence=IEA] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0008409 "5'-3' exonuclease activity" evidence=ISS] [GO:0004521
            "endoribonuclease activity" evidence=ISS] [GO:0003723 "RNA binding"
            evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0006378 "mRNA
            polyadenylation" evidence=TAS] [GO:0006379 "mRNA cleavage"
            evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=TAS]
            [GO:0006369 "termination of RNA polymerase II transcription"
            evidence=TAS] [GO:0006397 "mRNA processing" evidence=TAS]
            [GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380
            "RNA splicing" evidence=TAS] [GO:0010467 "gene expression"
            evidence=TAS] [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            EMBL:AF017269 Pfam:PF07521 EMBL:AF171877 EMBL:CH471053
            GO:GO:0046872 Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003723
            GO:GO:0030529 GO:GO:0006406 GO:GO:0004521 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 Reactome:REACT_78 GO:GO:0006398 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409
            EMBL:AC080162 EMBL:BC011654 EMBL:BC020211 IPI:IPI00007818
            RefSeq:NP_057291.1 UniGene:Hs.515972 PDB:2I7T PDB:2I7V PDBsum:2I7T
            PDBsum:2I7V ProteinModelPortal:Q9UKF6 SMR:Q9UKF6 DIP:DIP-42501N
            MINT:MINT-1742891 STRING:Q9UKF6 PhosphoSite:Q9UKF6 DMDM:18203503
            PaxDb:Q9UKF6 PeptideAtlas:Q9UKF6 PRIDE:Q9UKF6 DNASU:51692
            Ensembl:ENST00000238112 GeneID:51692 KEGG:hsa:51692 UCSC:uc002qzo.1
            GeneCards:GC02P009514 HGNC:HGNC:2326 HPA:HPA034657 MIM:606029
            neXtProt:NX_Q9UKF6 PharmGKB:PA26843 InParanoid:Q9UKF6
            PhylomeDB:Q9UKF6 ChiTaRS:CPSF3 EvolutionaryTrace:Q9UKF6
            GenomeRNAi:51692 NextBio:55702 ArrayExpress:Q9UKF6 Bgee:Q9UKF6
            CleanEx:HS_CPSF3 Genevestigator:Q9UKF6 GermOnline:ENSG00000119203
            Uniprot:Q9UKF6
        Length = 684

 Score = 2235 (791.8 bits), Expect = 1.1e-231, P = 1.1e-231
 Identities = 436/625 (69%), Positives = 510/625 (81%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   318 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
              DL  YTDL  S + Q Q++ Y+G  ++L   +  L G VE L+  EK  L+ F  I + 
Sbjct:   498 CDLSNYTDLAMSTVKQTQAIPYTGPFNLLCYQLQKLTGDVEELEIQEKPALKVFKNITVI 557

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  K+ 
Sbjct:   558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614

Query:   602 CIDLVDL-SVQC----EDSKLKSTV 621
              I L D+    C    +DS L  TV
Sbjct:   615 EIMLQDIFGEDCVSVKDDSILSVTV 639


>MGI|MGI:1859328 [details] [associations]
            symbol:Cpsf3 "cleavage and polyadenylation specificity
            factor 3" species:10090 "Mus musculus" [GO:0003723 "RNA binding"
            evidence=IDA] [GO:0003729 "mRNA binding" evidence=ISO] [GO:0004518
            "nuclease activity" evidence=IEA] [GO:0004519 "endonuclease
            activity" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=ISO]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISO;IDA] [GO:0008409 "5'-3'
            exonuclease activity" evidence=IDA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030529 "ribonucleoprotein complex"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            MGI:MGI:1859328 GO:GO:0046872 GO:GO:0003723 GO:GO:0030529
            GO:GO:0004521 GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 GO:GO:0006398
            HOGENOM:HOG000203394 KO:K14403 OMA:YVSFSAH InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 CTD:51692 GeneTree:ENSGT00700000104485
            HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409 ChiTaRS:CPSF3
            EMBL:AF203969 EMBL:BC023297 IPI:IPI00135099 RefSeq:NP_061283.2
            UniGene:Mm.356778 ProteinModelPortal:Q9QXK7 SMR:Q9QXK7
            STRING:Q9QXK7 PhosphoSite:Q9QXK7 PaxDb:Q9QXK7 PRIDE:Q9QXK7
            Ensembl:ENSMUST00000067284 GeneID:54451 KEGG:mmu:54451
            InParanoid:Q8CIM0 NextBio:311332 Bgee:Q9QXK7 CleanEx:MM_CPSF3
            Genevestigator:Q9QXK7 GermOnline:ENSMUSG00000054309 Uniprot:Q9QXK7
        Length = 684

 Score = 2229 (789.7 bits), Expect = 4.6e-231, P = 4.6e-231
 Identities = 433/625 (69%), Positives = 509/625 (81%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGM+Q+GLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   318 DDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
              DL  YTDL  S + Q Q++ Y+G   +L   +  L G VE L+  EK  L+ F  I + 
Sbjct:   498 CDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEKPALKVFKSITVV 557

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  K+ 
Sbjct:   558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614

Query:   602 CIDLVDL-SVQC----EDSKLKSTV 621
              + L D+    C    +DS L  TV
Sbjct:   615 EVMLQDIFGEDCVSVKDDSVLSVTV 639


>UNIPROTKB|G3V6W7 [details] [associations]
            symbol:Cpsf3 "Protein Cpsf3" species:10116 "Rattus
            norvegicus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            RGD:1305767 GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 EMBL:CH473947 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 UniGene:Rn.100522
            Ensembl:ENSRNOT00000009652 Uniprot:G3V6W7
        Length = 685

 Score = 2229 (789.7 bits), Expect = 4.6e-231, P = 4.6e-231
 Identities = 433/625 (69%), Positives = 509/625 (81%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGM+Q+GLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   318 DDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
              DL  YTDL  S + Q Q++ Y+G   +L   +  L G VE L+  EK  L+ F  I + 
Sbjct:   498 CDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEKPALKVFKSITVV 557

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  K+ 
Sbjct:   558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614

Query:   602 CIDLVDL-SVQC----EDSKLKSTV 621
              + L D+    C    +DS L  TV
Sbjct:   615 EVMLQDIFGEDCVSVKDDSVLSVTV 639


>RGD|1305767 [details] [associations]
            symbol:Cpsf3 "cleavage and polyadenylation specific factor 3,
            73kDa" species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
            evidence=ISO] [GO:0004521 "endoribonuclease activity" evidence=ISO]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISO] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=ISO] [GO:0008409 "5'-3' exonuclease activity"
            evidence=ISO] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 RGD:1305767 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6
            UniGene:Rn.100522 EMBL:BC099817 IPI:IPI00365532
            RefSeq:NP_001025201.1 ProteinModelPortal:Q499P4 SMR:Q499P4
            STRING:Q499P4 GeneID:298916 KEGG:rno:298916 InParanoid:Q499P4
            NextBio:644507 Genevestigator:Q499P4 Uniprot:Q499P4
        Length = 685

 Score = 2226 (788.7 bits), Expect = 9.6e-231, P = 9.6e-231
 Identities = 432/625 (69%), Positives = 509/625 (81%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHC
Sbjct:    18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
             GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct:    78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             INFHE K+V GIKF  Y+AGHVLGAAMF+IEIAG+K+LYTGDFSRQEDRHLMAAEIP +K
Sbjct:   138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGMKLLYTGDFSRQEDRHLMAAEIPNIK 197

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct:   198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
               HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct:   258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP VVMASPGM+Q+GLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  M
Sbjct:   318 DDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct:   378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
             +    +E++NPRNT +V L F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P
Sbjct:   438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
              DL  YTDL  S + Q Q++ Y+G   +L   +  L G VE L+  EK  L+ F  I + 
Sbjct:   498 CDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEKPALKVFKSITVV 557

Query:   543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
              E  +VVLEW +NP +DMYAD+ ++  ++E+       + AV K+ K  K+ + V  K+ 
Sbjct:   558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614

Query:   602 CIDLVDL-SVQC----EDSKLKSTV 621
              + L D+    C    +DS L  TV
Sbjct:   615 EVMLQDIFGEDCVSVKDDSVLSVTV 639


>UNIPROTKB|G5E9W3 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation-specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
            binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:CH471053 GO:GO:0003723 GO:GO:0004521
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 GO:GO:0008409
            EMBL:AC080162 UniGene:Hs.515972 HGNC:HGNC:2326 ChiTaRS:CPSF3
            ProteinModelPortal:G5E9W3 SMR:G5E9W3 PRIDE:G5E9W3
            Ensembl:ENST00000460593 ArrayExpress:G5E9W3 Bgee:G5E9W3
            Uniprot:G5E9W3
        Length = 647

 Score = 2150 (761.9 bits), Expect = 1.1e-222, P = 1.1e-222
 Identities = 420/605 (69%), Positives = 492/605 (81%)

Query:    26 MMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTH 85
             M+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHCGALPWFL KT FKGR FMTH
Sbjct:     1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60

Query:    86 ATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAG 145
             ATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIETINFHE K+V GIKF  Y+AG
Sbjct:    61 ATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAG 120

Query:   146 HVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREE 205
             HVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +KPDILI ESTYGTH+HE+REE
Sbjct:   121 HVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREE 180

Query:   206 REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKK 265
             RE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW  HPELHDIPIYYASSLAKK
Sbjct:   181 REARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKK 240

Query:   266 CMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLS 325
             CM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF+DIGP VVMASPGMMQSGLS
Sbjct:   241 CMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLS 300

Query:   326 RELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHT 385
             RELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+  MSGQ+LPLKMSVDYISFSAHT
Sbjct:   301 RELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHT 360

Query:   386 DYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLY 445
             DYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED+    +E++NPRNT +V L 
Sbjct:   361 DYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLN 420

Query:   446 FKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAPSDLPKYTDLKASKIIQQQSV 505
             F+GEK AKVMG LA +  +    +SGI+VKRNFNYH+L+P DL  YTDL  S + Q Q++
Sbjct:   421 FRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAI 480

Query:   506 YYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEITLEKCIVVLEWASNPISDMYA 562
              Y+G  ++L   +  L G VE L+  EK  L+ F  I +  E  +VVLEW +NP +DMYA
Sbjct:   481 PYTGPFNLLCYQLQKLTGDVEELEIQEKPALKVFKNITVIQEPGMVVLEWLANPSNDMYA 540

Query:   563 DSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKACIDLVDL-SVQC----EDSK 616
             D+ ++  ++E+       + AV K+ K  K+ + V  K+  I L D+    C    +DS 
Sbjct:   541 DT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRLEIMLQDIFGEDCVSVKDDSI 597

Query:   617 LKSTV 621
             L  TV
Sbjct:   598 LSVTV 602


>WB|WBGene00013460 [details] [associations]
            symbol:cpsf-3 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0040007 "growth" evidence=IMP] [GO:0002119 "nematode larval
            development" evidence=IMP] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 EMBL:AL132951 RefSeq:NP_502553.2
            ProteinModelPortal:Q95PY8 SMR:Q95PY8 STRING:Q95PY8 PaxDb:Q95PY8
            EnsemblMetazoa:Y67H2A.1.1 EnsemblMetazoa:Y67H2A.1.2 GeneID:178285
            KEGG:cel:CELE_Y67H2A.1 UCSC:Y67H2A.1 CTD:178285 WormBase:Y67H2A.1
            InParanoid:Q95PY8 NextBio:900506 Uniprot:Q95PY8
        Length = 707

 Score = 1904 (675.3 bits), Expect = 1.3e-196, P = 1.3e-196
 Identities = 354/583 (60%), Positives = 449/583 (77%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G+GQEVGRSC +LE+K K +M+DCG+HPGL G+DALPFVD VE + IDLLLI+HFHLDHC
Sbjct:    17 GSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLITHFHLDHC 76

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNIS--TEQMLYTESDLEKSMDKI 123
             GALPW L KT F+G+CFMTHATKAIYR LL DY+++S         LYTE DLEKSM KI
Sbjct:    77 GALPWLLQKTAFQGKCFMTHATKAIYRMLLGDYVRISKYGGPDRNQLYTEDDLEKSMAKI 136

Query:   124 ETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPP 183
             ETI+F E+K+VNGI+F  Y AGHVLGA  F+IEIAGV++LYTGDFS  EDRHL AAEIPP
Sbjct:   137 ETIDFREQKEVNGIRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCLEDRHLCAAEIPP 196

Query:   184 VKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDE 243
             + P +LITESTYGT  HE R  RE RFT ++HDIV RGGRCLIP FA+G AQEL+LILDE
Sbjct:   197 ITPQVLITESTYGTQTHEDRAVREKRFTQMVHDIVTRGGRCLIPAFAIGPAQELMLILDE 256

Query:   244 YWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGID 303
             YW  H ELHDIP+YYASSLAKKCMSVYQT++N MN RI++QI++ NPF+FKH+S L+G+D
Sbjct:   257 YWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQIAVKNPFIFKHVSTLRGMD 316

Query:   304 HFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVI 363
              FED GPCVV+A+PGM+QSG SRELFE WC D KNG IIAGYCVEGTLAK ILSEPEE++
Sbjct:   317 QFEDAGPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHILSEPEEIV 376

Query:   364 GMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREY 423
              +SG++LP++M V Y+SFSAHTDY QTS FV+ L+P H+VLVHGE +EMSRLK+ + R++
Sbjct:   377 SLSGEKLPMRMQVGYVSFSAHTDYHQTSNFVKALKPPHLVLVHGELHEMSRLKSGIERQF 436

Query:   424 EDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLL 483
             +DD N  +E++NPRNT  + L F+GEKTAKV+G+LA    + +  +SG++VK NF+Y ++
Sbjct:   437 QDD-NIPIEVHNPRNTERLQLQFRGEKTAKVIGKLAQRVPENNETISGVLVKNNFSYSIM 495

Query:   484 APSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHL---AGPVETLDEKRL------- 533
              P +L  YT L+ S + Q+ SV+YSGS+ +L   +  L   A  ++ +  K +       
Sbjct:   496 VPEELGSYTSLRISSLEQRMSVHYSGSLKLLIFNLQQLNDDACLIQNIKLKEISKKGSVT 555

Query:   534 RAFAC----IEITL--EKCIVVLEWASNPISDMYADSLISECL 570
             +A       + +T+     +VV+ W SNP+ DMYADS+++  L
Sbjct:   556 QAITVFQGKVNVTVYGNDHVVVVRWDSNPVYDMYADSVVAAIL 598


>DICTYBASE|DDB_G0274799 [details] [associations]
            symbol:cpsf3 "cleavage and polyadenylation
            specificity factor 73 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA;IC] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0004519 "endonuclease
            activity" evidence=IEA] [GO:0004518 "nuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0274799 Pfam:PF07521 GO:GO:0046872 GO:GO:0006378
            GenomeReviews:CM000151_GR EMBL:AAFI02000012 GO:GO:0003730
            GO:GO:0004519 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            RefSeq:XP_643926.1 ProteinModelPortal:Q86A79 SMR:Q86A79
            STRING:Q86A79 EnsemblProtists:DDB0233696 GeneID:8619353
            KEGG:ddi:DDB_G0274799 ProtClustDB:CLSZ2431003 Uniprot:Q86A79
        Length = 774

 Score = 1678 (595.7 bits), Expect = 1.1e-172, P = 1.1e-172
 Identities = 317/575 (55%), Positives = 428/575 (74%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESD--QIDLLLISHFHLD 63
             G+G EVGRSC++L++K K +M DCG+HP  SG+ +LPF D +ESD   IDLLL+SHFHLD
Sbjct:    42 GSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDSIESDIPDIDLLLVSHFHLD 101

Query:    64 HCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQ-MLYTESDLEKSMDK 122
             H  A+P+F+ KT FKGR FMTH TKAIY  LLSDY+KVSNI+ +  ML+ +SDL++S++K
Sbjct:   102 HAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSDYVKVSNITRDDDMLFDKSDLDRSLEK 161

Query:   123 IETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIP 182
             IE + + ++ + NGIK + +NAGHVLGAAMF+IEIAGVKILYTGDFSRQEDRHLM AE P
Sbjct:   162 IEKVRYRQKVEHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMGAETP 221

Query:   183 PVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILD 242
             PVK D+LI ESTYG  VHE R ERE RFTS +H +V R G+CLIPVFALGRAQELLLILD
Sbjct:   222 PVKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPVFALGRAQELLLILD 281

Query:   243 EYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGI 302
             EYW  +P+LH +PIYYAS+LAKKCM VY+TYIN MNDR+R Q  ++NPF FKHI N+KGI
Sbjct:   282 EYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVSNPFEFKHIKNIKGI 341

Query:   303 DHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV 362
             + F+D GPCV MASPGM+QSGLSR+LFE WC+D +NG++I GY VEGTLAK I+SEP E+
Sbjct:   342 ESFDDRGPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPGYSVEGTLAKHIMSEPAEI 401

Query:   363 IGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTRE 422
               +    +PL ++V Y+SFSAH+D+ QTSEF++E++P HVVLVHG+ NEMSRL+ +L  +
Sbjct:   402 TRLDNVNVPLNLTVSYVSFSAHSDFLQTSEFIQEIQPPHVVLVHGDANEMSRLRQSLVAK 461

Query:   423 YEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHL 482
             ++     ++ +  P+N +SV L F+ EK AK +G +     K +  + GI+V ++F +H+
Sbjct:   462 FK-----TINVLTPKNAMSVALEFRPEKVAKTLGSIITNPPKQNDIIQGILVTKDFTHHI 516

Query:   483 LAPSDLPKYTDLKASKIIQQQSVYYSGS----ISVLRSLISHLAGPVETL----DEK-RL 533
             L+ SD+  YT+LK + I Q+ ++ ++ +    IS L  +   +    E+     +EK  +
Sbjct:   517 LSASDIHNYTNLKTNIIKQKLTLPFAQTYHILISTLEQIYEQIIESTESTGGGGNEKPTI 576

Query:   534 RAFACIEITLEKCI-VVLEWASNPISDMYADSLIS 567
               +  I++     + ++LEW SN ++DM  DS+I+
Sbjct:   577 TIYNEIKLIYNIGVSIILEWNSNTVNDMICDSIIA 611


>TAIR|locus:2206076 [details] [associations]
            symbol:CPSF73-I "cleavage and polyadenylation specificity
            factor 73-I" species:3702 "Arabidopsis thaliana" [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0000278 "mitotic cell cycle" evidence=RCA]
            [GO:0006346 "methylation-dependent chromatin silencing"
            evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009855
            "determination of bilateral symmetry" evidence=RCA] [GO:0010014
            "meristem initiation" evidence=RCA] [GO:0010073 "meristem
            maintenance" evidence=RCA] [GO:0016246 "RNA interference"
            evidence=RCA] [GO:0031507 "heterochromatin assembly" evidence=RCA]
            [GO:0045787 "positive regulation of cell cycle" evidence=RCA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006397
            GO:GO:0090305 EMBL:AC018908 GO:GO:0004518 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            EMBL:AY140900 EMBL:AY150478 EMBL:AY074280 EMBL:AK316692
            EMBL:AK316794 IPI:IPI00533462 PIR:G96635 RefSeq:NP_001031215.1
            RefSeq:NP_176297.1 RefSeq:NP_849835.1 UniGene:At.23510
            ProteinModelPortal:Q9C952 SMR:Q9C952 IntAct:Q9C952 STRING:Q9C952
            PaxDb:Q9C952 PRIDE:Q9C952 EnsemblPlants:AT1G61010.1
            EnsemblPlants:AT1G61010.2 EnsemblPlants:AT1G61010.3 GeneID:842393
            KEGG:ath:AT1G61010 TAIR:At1g61010 HOGENOM:HOG000203394
            InParanoid:Q9C952 KO:K14403 OMA:YVSFSAH PhylomeDB:Q9C952
            ProtClustDB:CLSN2681829 Genevestigator:Q9C952 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 Uniprot:Q9C952
        Length = 693

 Score = 1671 (593.3 bits), Expect = 6.2e-172, P = 6.2e-172
 Identities = 324/569 (56%), Positives = 416/569 (73%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAG EVGRSC+ + F+ K+I+ DCGIHP  SGM ALP+ D ++   ID+LLI+HFH+DH 
Sbjct:    28 GAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSIDVLLITHFHIDHA 87

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
              +LP+FL KT F GR FMTHATKAIY+ LL+DY+KVS +S E ML+ E D+ KSMDKIE 
Sbjct:    88 ASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINKSMDKIEV 147

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             I+FH+  +VNGIKF  Y AGHVLGAAMF+++IAGV+ILYTGD+SR+EDRHL AAE+P   
Sbjct:   148 IDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRAAELPQFS 207

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PDI I EST G  +H+ R  RE RFT +IH  V +GGR LIP FALGRAQELLLILDEYW
Sbjct:   208 PDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELLLILDEYW 267

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
             + HP+LH+IPIYYAS LAKKCM+VYQTYI +MNDRIR Q + +NPFVFKHIS L  ID F
Sbjct:   268 ANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISPLNSIDDF 327

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
              D+GP VVMA+PG +QSGLSR+LF+ WC+D KN  II GY VEGTLAKTI++EP+EV  M
Sbjct:   328 NDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIINEPKEVTLM 387

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             +G   PL M V YISFSAH DY QTS F++EL P +++LVHGE NEM RLK  L  E+ D
Sbjct:   388 NGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGEANEMMRLKQKLLTEFPD 447

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
               NT  ++  P+N  SV++YF  EK AK +G LA +       +SGI+VK+ F Y ++AP
Sbjct:   448 G-NT--KIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGDTVSGILVKKGFTYQIMAP 504

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVE--TLDEKRLRAFACIE-IT 542
              +L  ++ L  + + Q+ ++ + G+  V++  +  +   VE  T +E  L A    E +T
Sbjct:   505 DELHVFSQLSTATVTQRITIPFVGAFGVIKHRLEKIFESVEFSTDEESGLPALKVHERVT 564

Query:   543 L----EKCIVVLEWASNPISDMYADSLIS 567
             +    EK I  L+W+S+PISDM +DS+++
Sbjct:   565 VKQESEKHIS-LQWSSDPISDMVSDSIVA 592


>POMBASE|SPAC17G6.16c [details] [associations]
            symbol:ysh1 "mRNA cleavage and polyadenylation
            specificity factor complex endoribonuclease subunit Ysh1"
            species:4896 "Schizosaccharomyces pombe" [GO:0004521
            "endoribonuclease activity" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IC]
            [GO:0006379 "mRNA cleavage" evidence=IC] [GO:0046872 "metal ion
            binding" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 PomBase:SPAC17G6.16c Pfam:PF07521 GO:GO:0005829
            EMBL:CU329670 GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0006378
            GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            OrthoDB:EOG41ZJK7 PIR:T37848 RefSeq:NP_594263.2 STRING:O13794
            EnsemblFungi:SPAC17G6.16c.1 GeneID:2542258 NextBio:20803322
            Uniprot:O13794
        Length = 757

 Score = 1516 (538.7 bits), Expect = 1.7e-155, P = 1.7e-155
 Identities = 273/567 (48%), Positives = 402/567 (70%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAG EVGRSC ++++K K++M+D G+HP  +G+ ALPF D  +   +D+LLISHFHLDH 
Sbjct:    25 GAGNEVGRSCHVIQYKGKTVMLDAGVHPAYTGLSALPFFDEFDLSTVDVLLISHFHLDHV 84

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
              +LP+ + KT F+GR FMTH TKA+ +WLLSDY+KVSN+  E  LY E DL  + D+IE 
Sbjct:    85 ASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSDYVKVSNVGMEDQLYDEKDLLAAFDRIEA 144

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             +++H   +V GIKF+ Y+AGHVLGA M+ +E+AGV IL+TGD+SR+EDRHL  AE+PP +
Sbjct:   145 VDYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILFTGDYSREEDRHLHVAEVPPKR 204

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
             PD+LITESTYGT  H+ R E+E R  ++IH  +  GGR L+PVFALGRAQELLLILDEYW
Sbjct:   205 PDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGRVLMPVFALGRAQELLLILDEYW 264

Query:   246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
             + H +L  +PIYYASSLA+KCM+++QTY+N MND IR+  +  NPF+F+ + +L+ ++ F
Sbjct:   265 NNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRKIFAERNPFIFRFVKSLRNLEKF 324

Query:   306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
             +DIGP V++ASPGM+Q+G+SR L E W  D +N +++ GY VEGT+AK I +EP E++ +
Sbjct:   325 DDIGPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTGYSVEGTMAKQITNEPIEIVSL 384

Query:   366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             SGQ++P +M+V+ +SF+AH DY Q SEF+  +   H++LVHGEQ  M RLK+AL  ++ +
Sbjct:   385 SGQKIPRRMAVEELSFAAHVDYLQNSEFIDLVNADHIILVHGEQTNMGRLKSALASKFHN 444

Query:   426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
                  +++Y PRN V + L FKGE+  + +G++AV   K    +SGI+++++ NY L++ 
Sbjct:   445 R-KVDVKVYTPRNCVPLYLPFKGERLVRALGKVAVHKPKEGDIMSGILIQKDANYKLMSA 503

Query:   486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK----RLRAFACIEI 541
              DL  ++DL  + + Q+Q + +  S+ +    +  + G V+    K    +      I +
Sbjct:   504 EDLRDFSDLTTTVLTQKQVIPFFSSMELANFHLKQMFGYVKQSKTKAGQPQYTVMDAITL 563

Query:   542 TL-EKCIVVLEWASNPISDMYADSLIS 567
             TL ++  + LEW  N ++D  ADS+I+
Sbjct:   564 TLIQEHKLALEWVGNIMNDTIADSVIT 590


>SGD|S000004267 [details] [associations]
            symbol:YSH1 "Putative endoribonuclease" species:4932
            "Saccharomyces cerevisiae" [GO:0004518 "nuclease activity"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0006379 "mRNA
            cleavage" evidence=IMP] [GO:0006369 "termination of RNA polymerase
            II transcription" evidence=IMP] [GO:0031126 "snoRNA 3'-end
            processing" evidence=IMP] [GO:0008380 "RNA splicing" evidence=IMP]
            [GO:0034247 "snoRNA splicing" evidence=IMP] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA;IPI] [GO:0005849 "mRNA cleavage factor complex"
            evidence=IPI] [GO:0004521 "endoribonuclease activity"
            evidence=ISS;IMP] [GO:0003723 "RNA binding" evidence=IC]
            [GO:0004519 "endonuclease activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 SGD:S000004267
            Pfam:PF07521 GO:GO:0046872 GO:GO:0006378 EMBL:BK006945
            GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 EMBL:U17245 HOGENOM:HOG000203394 KO:K14403
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0031126 GO:GO:0034247
            OMA:EISFAAH OrthoDB:EOG41ZJK7 PIR:S51413 RefSeq:NP_013379.1
            ProteinModelPortal:Q06224 SMR:Q06224 DIP:DIP-2470N IntAct:Q06224
            MINT:MINT-375457 STRING:Q06224 PaxDb:Q06224 PeptideAtlas:Q06224
            EnsemblFungi:YLR277C GeneID:850983 KEGG:sce:YLR277C CYGD:YLR277c
            NextBio:967501 Genevestigator:Q06224 GermOnline:YLR277C
            Uniprot:Q06224
        Length = 779

 Score = 1325 (471.5 bits), Expect = 1.2e-144, Sum P(3) = 1.2e-144
 Identities = 252/478 (52%), Positives = 346/478 (72%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G   EVGRSC +L++K K++M+D GIHP   G+ +LPF D  +  ++D+LLISHFHLDH 
Sbjct:    15 GGSNEVGRSCHILQYKGKTVMLDAGIHPAYQGLASLPFYDEFDLSKVDILLISHFHLDHA 74

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNI--STEQM------LYTESDLE 117
              +LP+ + +T F+GR FMTH TKAIYRWLL D+++V++I  S+  M      L+++ DL 
Sbjct:    75 ASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSSSMGTKDEGLFSDEDLV 134

Query:   118 KSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLM 177
              S DKIET+++H   DVNGIKF+A++AGHVLGAAMF IEIAG+++L+TGD+SR+ DRHL 
Sbjct:   135 DSFDKIETVDYHSTVDVNGIKFTAFHAGHVLGAAMFQIEIAGLRVLFTGDYSREVDRHLN 194

Query:   178 AAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQEL 237
             +AE+PP+  ++LI EST+GT  HE R  RE + T LIH  V RGGR L+PVFALGRAQE+
Sbjct:   195 SAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHSTVMRGGRVLLPVFALGRAQEI 254

Query:   238 LLILDEYWSLHP-ELH--DIPIYYASSLAKKCMSVYQTYINAMNDRIRRQI--SINNPFV 292
             +LILDEYWS H  EL    +PI+YAS+LAKKCMSV+QTY+N MND IR++   S  NPF+
Sbjct:   255 MLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPFI 314

Query:   293 FKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
             FK+IS L+ ++ F+D GP V++ASPGM+QSGLSR+L E WC + KN V+I GY +EGT+A
Sbjct:   315 FKNISYLRNLEDFQDFGPSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTMA 374

Query:   353 KTILSEPEEVIGMSGQRL--PLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQN 410
             K I+ EP+ +  ++   +  P +  V+ ISF+AH D+Q+  EF+ ++   +++LVHGE N
Sbjct:   375 KFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQENLEFIEKISAPNIILVHGEAN 434

Query:   411 EMSRLKAALTREYEDDPNTSMEL--YNPRNTVSVDLYFKGEKTAKVMGELAVENLKPD 466
              M RLK+AL   +     T  E+  +NPRN V VDL F+G K AK +G +  E  K +
Sbjct:   435 PMGRLKSALLSNFASLKGTDNEVHVFNPRNCVEVDLEFQGVKVAKAVGNIVNEIYKEE 492

 Score = 65 (27.9 bits), Expect = 1.2e-144, Sum P(3) = 1.2e-144
 Identities = 24/80 (30%), Positives = 40/80 (50%)

Query:   458 LAVENLKPDAALSGIIVK--RNFNYHLLAPSDLPKY-TDLKASKIIQQQSVYYSGSISVL 514
             L  E    D  +SGI+V   +NF    L+ SDL ++  DL  + + ++QSV  +    ++
Sbjct:   523 LVDEEEHKDIVVSGILVSDDKNFELDFLSLSDLREHHPDLSTTILRERQSVRVNCKKELI 582

Query:   515 RSLISHLAGPVETL-DEKRL 533
                I  + G  E L D+ R+
Sbjct:   583 YWHILQMFGEAEVLQDDDRV 602

 Score = 60 (26.2 bits), Expect = 1.2e-144, Sum P(3) = 1.2e-144
 Identities = 11/35 (31%), Positives = 22/35 (62%)

Query:   533 LRAFACIEITLEKCIVVLEWASNPISDMYADSLIS 567
             L+    I++T+   + V+EW  + ++D  ADS+I+
Sbjct:   625 LQIMGDIKLTIVNTLAVVEWTQDLMNDTVADSIIA 659


>CGD|CAL0005344 [details] [associations]
            symbol:orf19.5486 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0004521 "endoribonuclease
            activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0034247 "snoRNA splicing"
            evidence=IEA] [GO:0031126 "snoRNA 3'-end processing" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006369 "termination
            of RNA polymerase II transcription" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 CGD:CAL0005344 Pfam:PF07521 GO:GO:0005634
            GO:GO:0042493 GO:GO:0046872 GO:GO:0006397 GO:GO:0090305
            GO:GO:0004519 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 EMBL:AACQ01000196 EMBL:AACQ01000195
            RefSeq:XP_711478.1 RefSeq:XP_711502.1 ProteinModelPortal:Q59P50
            STRING:Q59P50 GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
            KEGG:cal:CaO19.5486 Uniprot:Q59P50
        Length = 870

 Score = 1281 (456.0 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
 Identities = 240/476 (50%), Positives = 343/476 (72%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G   EVGRSC ++E+KNK IM+D G+HP LSG  + P+ D  +  ++D+LLISHFH+DH 
Sbjct:   106 GGCNEVGRSCHIIEYKNKVIMLDSGMHPALSGHASFPYFDEYDISKVDILLISHFHVDHS 165

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM----------LYTESD 115
              +LP+ + ++ F+G+ FMTHATKAIYRWL+ D+++V++I   +           LYT+ D
Sbjct:   166 ASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRSEDGGGGEGSNLYTDDD 225

Query:   116 LEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRH 175
             + KS D+IETI++H   +++GI+F+AY+AGHVLGA M+ IEI G+K+L+TGD+SR+E+RH
Sbjct:   226 IMKSFDRIETIDYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENRH 285

Query:   176 LMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
             L AAE+PP+KPDILI+EST+GT   E R E E + T+ IH  + +GGR L+PVFALG AQ
Sbjct:   286 LHAAEVPPLKPDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNAQ 345

Query:   236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN---NPFV 292
             ELLLILDEYWS + +L ++ ++YAS+LAKKCM+VY+TY   MND+IR   + +   NPF 
Sbjct:   346 ELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPFD 405

Query:   293 FKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
             FK+I ++K +  F+D+GP VV+A+PGM+Q+G+SR+L E W  D KN VI+ GY VEGT+A
Sbjct:   406 FKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMA 465

Query:   353 KTILSEPEEVIGMSG--QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQN 410
             K +L EP  +   +     +P ++ ++ ISF+AH D+QQ SEF+ ++ P+ V+LVHG+  
Sbjct:   466 KELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVDFQQNSEFIEKVSPSKVILVHGDSV 525

Query:   411 EMSRLKAALTREYEDDPNTSMEL--YNPRNTVSVDLYFKGEKTAKVMGELAVENLK 464
              M RLK+AL  +Y     T  E+  YNP+N   + + FKG K AKV+G LA E L+
Sbjct:   526 PMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIGFKGLKIAKVLGSLAEEQLQ 581

 Score = 104 (41.7 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
 Identities = 35/171 (20%), Positives = 79/171 (46%)

Query:   409 QNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEK-TAKVMGELAVENLKPDA 467
             + ++  LK  +  E   + +   EL   +          GE  T +   E ++  LK   
Sbjct:   577 EEQLQVLKKIIQDEVSAENSKITELTEEKEEADEIKEDNGETDTTQKPNESSINVLKTGQ 636

Query:   468 ALSGIIVKRNFNYHLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVET 527
              +SG++V ++FN +LL   DL ++T L  S +  +  +  +  IS++   +  + G +  
Sbjct:   637 VVSGVLVSKDFNLNLLQLQDLHEFTQLSTSIVKSKMHLKINADISLMVWHLEQMFGYINV 696

Query:   528 LDEKRLRAFACI-----EITLEKC-----IVVLEWAS-NPISDMYADSLIS 567
             +++     + C+     ++ +++       + +EW + N ++D  ADS+I+
Sbjct:   697 INDDD-EEWECVIMDVVDVFIDRSKGPGLFITVEWINDNLMADSLADSVIA 746

 Score = 54 (24.1 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query:   573 ILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
             +L   +G++   K    EK  I + K  A +D   L V+C    LK  V+
Sbjct:   803 LLKAQFGDSL--KELPEEKAIIQIGKTVANVDYKRLEVECSSKVLKDRVE 850


>UNIPROTKB|Q59P50 [details] [associations]
            symbol:YSH1 "Endoribonuclease YSH1" species:237561 "Candida
            albicans SC5314" [GO:0042493 "response to drug" evidence=IMP]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 CGD:CAL0005344
            Pfam:PF07521 GO:GO:0005634 GO:GO:0042493 GO:GO:0046872
            GO:GO:0006397 GO:GO:0090305 GO:GO:0004519 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            EMBL:AACQ01000196 EMBL:AACQ01000195 RefSeq:XP_711478.1
            RefSeq:XP_711502.1 ProteinModelPortal:Q59P50 STRING:Q59P50
            GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
            KEGG:cal:CaO19.5486 Uniprot:Q59P50
        Length = 870

 Score = 1281 (456.0 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
 Identities = 240/476 (50%), Positives = 343/476 (72%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G   EVGRSC ++E+KNK IM+D G+HP LSG  + P+ D  +  ++D+LLISHFH+DH 
Sbjct:   106 GGCNEVGRSCHIIEYKNKVIMLDSGMHPALSGHASFPYFDEYDISKVDILLISHFHVDHS 165

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM----------LYTESD 115
              +LP+ + ++ F+G+ FMTHATKAIYRWL+ D+++V++I   +           LYT+ D
Sbjct:   166 ASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRSEDGGGGEGSNLYTDDD 225

Query:   116 LEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRH 175
             + KS D+IETI++H   +++GI+F+AY+AGHVLGA M+ IEI G+K+L+TGD+SR+E+RH
Sbjct:   226 IMKSFDRIETIDYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENRH 285

Query:   176 LMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
             L AAE+PP+KPDILI+EST+GT   E R E E + T+ IH  + +GGR L+PVFALG AQ
Sbjct:   286 LHAAEVPPLKPDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNAQ 345

Query:   236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN---NPFV 292
             ELLLILDEYWS + +L ++ ++YAS+LAKKCM+VY+TY   MND+IR   + +   NPF 
Sbjct:   346 ELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPFD 405

Query:   293 FKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
             FK+I ++K +  F+D+GP VV+A+PGM+Q+G+SR+L E W  D KN VI+ GY VEGT+A
Sbjct:   406 FKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMA 465

Query:   353 KTILSEPEEVIGMSG--QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQN 410
             K +L EP  +   +     +P ++ ++ ISF+AH D+QQ SEF+ ++ P+ V+LVHG+  
Sbjct:   466 KELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVDFQQNSEFIEKVSPSKVILVHGDSV 525

Query:   411 EMSRLKAALTREYEDDPNTSMEL--YNPRNTVSVDLYFKGEKTAKVMGELAVENLK 464
              M RLK+AL  +Y     T  E+  YNP+N   + + FKG K AKV+G LA E L+
Sbjct:   526 PMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIGFKGLKIAKVLGSLAEEQLQ 581

 Score = 104 (41.7 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
 Identities = 35/171 (20%), Positives = 79/171 (46%)

Query:   409 QNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEK-TAKVMGELAVENLKPDA 467
             + ++  LK  +  E   + +   EL   +          GE  T +   E ++  LK   
Sbjct:   577 EEQLQVLKKIIQDEVSAENSKITELTEEKEEADEIKEDNGETDTTQKPNESSINVLKTGQ 636

Query:   468 ALSGIIVKRNFNYHLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVET 527
              +SG++V ++FN +LL   DL ++T L  S +  +  +  +  IS++   +  + G +  
Sbjct:   637 VVSGVLVSKDFNLNLLQLQDLHEFTQLSTSIVKSKMHLKINADISLMVWHLEQMFGYINV 696

Query:   528 LDEKRLRAFACI-----EITLEKC-----IVVLEWAS-NPISDMYADSLIS 567
             +++     + C+     ++ +++       + +EW + N ++D  ADS+I+
Sbjct:   697 INDDD-EEWECVIMDVVDVFIDRSKGPGLFITVEWINDNLMADSLADSVIA 746

 Score = 54 (24.1 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
 Identities = 15/50 (30%), Positives = 23/50 (46%)

Query:   573 ILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
             +L   +G++   K    EK  I + K  A +D   L V+C    LK  V+
Sbjct:   803 LLKAQFGDSL--KELPEEKAIIQIGKTVANVDYKRLEVECSSKVLKDRVE 850


>ASPGD|ASPL0000060573 [details] [associations]
            symbol:AN0990 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:BN001308
            GO:GO:0046872 GO:GO:0006397 GO:GO:0090305 GO:GO:0004519
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 EMBL:AACD01000015 RefSeq:XP_658594.1
            ProteinModelPortal:Q5BEP0 STRING:Q5BEP0
            EnsemblFungi:CADANIAT00001661 GeneID:2876766 KEGG:ani:AN0990.2
            OMA:EISFAAH OrthoDB:EOG41ZJK7 Uniprot:Q5BEP0
        Length = 884

 Score = 849 (303.9 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
 Identities = 163/283 (57%), Positives = 207/283 (73%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G G EVGRSC ++++K K++M+D G+HP   G  ALPF D  +   +D+LLISHFH+DH 
Sbjct:    30 GGGNEVGRSCHIIQYKGKTVMLDAGMHPAKEGFSALPFFDEFDLSTVDILLISHFHVDHS 89

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNI--STEQM--LYTESDLEKSMD 121
              ALP+ L KT FKGR FMTHATKAIY+WL+ D ++V+N   S++Q   LYTE D   ++ 
Sbjct:    90 SALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSDQRTTLYTEHDHLSTLP 149

Query:   122 KIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEI 181
              IETI+F+    +N I+ + Y AGHVLGAAMFLI IAG+ IL+TGD+SR+EDRHL+ A +
Sbjct:   150 LIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFTGDYSREEDRHLIPATV 209

Query:   182 PP-VKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLI 240
             P  VK D+LITEST+G   +  R ERE      I  ++NRGGR L+PVFALGRAQELLLI
Sbjct:   210 PRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRVLMPVFALGRAQELLLI 269

Query:   241 LDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
             L+EYW  HPEL  IPIYY  + A++CM VYQTYI AMND I+R
Sbjct:   270 LEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKR 312

 Score = 726 (260.6 bits), Expect = 5.4e-101, Sum P(4) = 5.4e-101
 Identities = 142/268 (52%), Positives = 191/268 (71%)

Query:   110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFS 169
             LYTE D   ++  IETI+F+    +N I+ + Y AGHVLGAAMFLI IAG+ IL+TGD+S
Sbjct:   138 LYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFTGDYS 197

Query:   170 RQEDRHLMAAEIPP-VKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
             R+EDRHL+ A +P  VK D+LITEST+G   +  R ERE      I  ++NRGGR L+PV
Sbjct:   198 REEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRVLMPV 257

Query:   229 FALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQI--- 285
             FALGRAQELLLIL+EYW  HPEL  IPIYY  + A++CM VYQTYI AMND I+R     
Sbjct:   258 FALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRLFRQR 317

Query:   286 ----------SIN-NPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCT 334
                       S++  P+ FK++ +L+ ++ F+D+G CV++ASPGM+Q+G SREL E W  
Sbjct:   318 MAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDVGGCVMLASPGMLQTGTSRELLERWAP 377

Query:   335 DAKNGVIIAGYCVEGTLAKTILSEPEEV 362
             + +NGV++ GY VEGT+AK +L+EP+++
Sbjct:   378 NERNGVVMTGYSVEGTMAKQLLNEPDQI 405

 Score = 224 (83.9 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
 Identities = 52/166 (31%), Positives = 91/166 (54%)

Query:   370 LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNT 429
             +P + +VD ISF+AH D  +   F+ E+    V+LVHGE+++M RLK+ L      +   
Sbjct:   432 IPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSKLL-SLNAEKTV 490

Query:   430 SMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPD------AALSGIIVKRNFNYHLL 483
              +++Y P N   V + F+ +K AKV+G+LA   L  D        ++G++V+  F+  L+
Sbjct:   491 KVKVYTPANCEEVRIPFRKDKIAKVVGKLAQTTLPTDNEDGDGPLMAGVLVQNGFDLSLM 550

Query:   484 APSDLPKYTDLKASKIIQQQSVYYSG-SISVLRSLISHLAGPVETL 528
             AP DL +Y  L  + I  +Q +  S  S+ +++  +    G +E +
Sbjct:   551 APDDLREYAGLATTTITCKQHITLSSASMDLIKWALEGTFGAIEEI 596

 Score = 54 (24.1 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
 Identities = 13/31 (41%), Positives = 19/31 (61%)

Query:   592 ITITVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
             I I VDK  A + L DL V+C ++ L+  V+
Sbjct:   803 IEIKVDKHVARVWLEDLEVECANAVLRDRVR 833

 Score = 41 (19.5 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
 Identities = 8/23 (34%), Positives = 15/23 (65%)

Query:   548 VVLEWASNPISDMYADSLISECL 570
             V L+W  N ++D  AD++++  L
Sbjct:   650 VELQWEGNMMNDGIADAVMAVLL 672


>GENEDB_PFALCIPARUM|PF14_0364 [details] [associations]
            symbol:PF14_0364 "cleavage and polyadenylation
            specifity factor protein, putative" species:5833 "Plasmodium
            falciparum" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006378 "mRNA
            polyadenylation" evidence=ISS] [GO:0006379 "mRNA cleavage"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:AE014187 GO:GO:0005847
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 RefSeq:XP_001348538.1 ProteinModelPortal:Q8IL83
            PRIDE:Q8IL83 EnsemblProtists:PF14_0364:mRNA GeneID:811946
            KEGG:pfa:PF14_0364 EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH
            ProtClustDB:CLSZ2457730 Uniprot:Q8IL83
        Length = 876

 Score = 812 (290.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
 Identities = 156/376 (41%), Positives = 252/376 (67%)

Query:   109 MLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDF 168
             +LY E+D++K+MD IET+NFH+  +   +KF+AY AGHV+GA MFL+EI  ++ LYTGD+
Sbjct:   166 VLYDENDIDKTMDLIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGDY 225

Query:   169 SRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
             SR+ DRH+  AEIP +   +LI E TYG  VH+ R++RE RF +++  ++N  G+ L+PV
Sbjct:   226 SREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLPV 285

Query:   229 FALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN 288
             FALGRAQELLLIL+E+W  +  L +IPI+Y SS+A K + +Y+T+IN   + +++ ++  
Sbjct:   286 FALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNEG 345

Query:   289 -NPFVFKHISNLKGIDH-----FEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVII 342
              NPF FK++   K ++      ++D  PCV+MASPGM+Q+G+S+ +F +  +D K+GVI+
Sbjct:   346 KNPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVIL 405

Query:   343 AGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHV 402
              GY V+GTLA  + +EPE V  ++ + +  K   + ISFSAH+D+ QT  F+ +L+  +V
Sbjct:   406 TGYTVKGTLADELKTEPEFVT-INDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPNV 464

Query:   403 VLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELA--V 460
             VLVHG++NE++RLK  L  E +      + ++ P     +  +F+   +   +G+L+  +
Sbjct:   465 VLVHGDKNELNRLKNKLIEEKQ-----YLSVFTPELLQKLSFHFEQNDSLISLGKLSEHI 519

Query:   461 ENLKPDAALSGIIVKR 476
             + +     L G+ +K+
Sbjct:   520 KKINKKIKLEGLKMKK 535

 Score = 278 (102.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
 Identities = 49/96 (51%), Positives = 66/96 (68%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G   EVGRSC+++E    S+M+DCGIHP   G+  LP  D  +  ++DL LI+HFH+DH 
Sbjct:    10 GGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLITHFHMDHS 69

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV 101
             GALP+ + KT FKGR FMT ATK+I   L +DY ++
Sbjct:    70 GALPYLINKTRFKGRIFMTEATKSICYLLWNDYARI 105

 Score = 98 (39.6 bits), Expect = 2.0e-26, Sum P(3) = 2.0e-26
 Identities = 53/247 (21%), Positives = 108/247 (43%)

Query:   338 NGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVREL 397
             N V++ G   E    K  L E ++ + +    L  K+S  +    +     + SE ++++
Sbjct:   463 NVVLVHGDKNELNRLKNKLIEEKQYLSVFTPELLQKLSFHFEQNDSLISLGKLSEHIKKI 522

Query:   398 RPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTA----K 453
                 + L   E  +M + K     E+    N   ++ N    + +    K +        
Sbjct:   523 NKK-IKL---EGLKMKKEKMIANDEHISVKNEMGDINNDEENLQISDKKKNKVDEHDKHN 578

Query:   454 VMGELAVENLKPDAALSGIIVKRNFNYHLLA-PSDLPKYTDLKASKIIQQQSVYYSGSIS 512
             +   ++ E    +  + GII+    N  +L  P+D+ +YT+LK + I Q  ++ +     
Sbjct:   579 INNNISNEKHNVNNQIEGIIITEPQNVPILIYPNDIYEYTNLKTAMIDQTINISFPYRFD 638

Query:   513 VLRSLISHLAGPVETLDEKRLRAFACIEITLEKC--IVVLEWASNPISDMYADSLISECL 570
             +L ++I ++    ET  +  L     I+I   K   ++ + W S+P++D+ ADS I+  +
Sbjct:   639 LLYNVIINVYE--ETHMDDNLIIVKDIKIIYCKDDKMIKINWLSSPLNDLIADS-INFLI 695

Query:   571 IEILVEM 577
             +E L  M
Sbjct:   696 LEFLETM 702

 Score = 47 (21.6 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
 Identities = 17/53 (32%), Positives = 26/53 (49%)

Query:   554 SNPISDMYADSLISECLIEILVEMYGEAA-VPKMFKGEKITITVDKKKACIDL 605
             S PI+D+  D  I E +I  + E +     + K    EK T+ + KKK  + L
Sbjct:   708 SIPIADVLTDHNIYEMIISYVEENFTNVERISKEILKEK-TLQMIKKKEQLKL 759


>UNIPROTKB|Q8IL83 [details] [associations]
            symbol:PF14_0364 "Cleavage and polyadenylation specificity
            factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014187
            GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 RefSeq:XP_001348538.1
            ProteinModelPortal:Q8IL83 PRIDE:Q8IL83
            EnsemblProtists:PF14_0364:mRNA GeneID:811946 KEGG:pfa:PF14_0364
            EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH ProtClustDB:CLSZ2457730
            Uniprot:Q8IL83
        Length = 876

 Score = 812 (290.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
 Identities = 156/376 (41%), Positives = 252/376 (67%)

Query:   109 MLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDF 168
             +LY E+D++K+MD IET+NFH+  +   +KF+AY AGHV+GA MFL+EI  ++ LYTGD+
Sbjct:   166 VLYDENDIDKTMDLIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGDY 225

Query:   169 SRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
             SR+ DRH+  AEIP +   +LI E TYG  VH+ R++RE RF +++  ++N  G+ L+PV
Sbjct:   226 SREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLPV 285

Query:   229 FALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN 288
             FALGRAQELLLIL+E+W  +  L +IPI+Y SS+A K + +Y+T+IN   + +++ ++  
Sbjct:   286 FALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNEG 345

Query:   289 -NPFVFKHISNLKGIDH-----FEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVII 342
              NPF FK++   K ++      ++D  PCV+MASPGM+Q+G+S+ +F +  +D K+GVI+
Sbjct:   346 KNPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVIL 405

Query:   343 AGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHV 402
              GY V+GTLA  + +EPE V  ++ + +  K   + ISFSAH+D+ QT  F+ +L+  +V
Sbjct:   406 TGYTVKGTLADELKTEPEFVT-INDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPNV 464

Query:   403 VLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELA--V 460
             VLVHG++NE++RLK  L  E +      + ++ P     +  +F+   +   +G+L+  +
Sbjct:   465 VLVHGDKNELNRLKNKLIEEKQ-----YLSVFTPELLQKLSFHFEQNDSLISLGKLSEHI 519

Query:   461 ENLKPDAALSGIIVKR 476
             + +     L G+ +K+
Sbjct:   520 KKINKKIKLEGLKMKK 535

 Score = 278 (102.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
 Identities = 49/96 (51%), Positives = 66/96 (68%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G   EVGRSC+++E    S+M+DCGIHP   G+  LP  D  +  ++DL LI+HFH+DH 
Sbjct:    10 GGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLITHFHMDHS 69

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV 101
             GALP+ + KT FKGR FMT ATK+I   L +DY ++
Sbjct:    70 GALPYLINKTRFKGRIFMTEATKSICYLLWNDYARI 105

 Score = 98 (39.6 bits), Expect = 2.0e-26, Sum P(3) = 2.0e-26
 Identities = 53/247 (21%), Positives = 108/247 (43%)

Query:   338 NGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVREL 397
             N V++ G   E    K  L E ++ + +    L  K+S  +    +     + SE ++++
Sbjct:   463 NVVLVHGDKNELNRLKNKLIEEKQYLSVFTPELLQKLSFHFEQNDSLISLGKLSEHIKKI 522

Query:   398 RPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTA----K 453
                 + L   E  +M + K     E+    N   ++ N    + +    K +        
Sbjct:   523 NKK-IKL---EGLKMKKEKMIANDEHISVKNEMGDINNDEENLQISDKKKNKVDEHDKHN 578

Query:   454 VMGELAVENLKPDAALSGIIVKRNFNYHLLA-PSDLPKYTDLKASKIIQQQSVYYSGSIS 512
             +   ++ E    +  + GII+    N  +L  P+D+ +YT+LK + I Q  ++ +     
Sbjct:   579 INNNISNEKHNVNNQIEGIIITEPQNVPILIYPNDIYEYTNLKTAMIDQTINISFPYRFD 638

Query:   513 VLRSLISHLAGPVETLDEKRLRAFACIEITLEKC--IVVLEWASNPISDMYADSLISECL 570
             +L ++I ++    ET  +  L     I+I   K   ++ + W S+P++D+ ADS I+  +
Sbjct:   639 LLYNVIINVYE--ETHMDDNLIIVKDIKIIYCKDDKMIKINWLSSPLNDLIADS-INFLI 695

Query:   571 IEILVEM 577
             +E L  M
Sbjct:   696 LEFLETM 702

 Score = 47 (21.6 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
 Identities = 17/53 (32%), Positives = 26/53 (49%)

Query:   554 SNPISDMYADSLISECLIEILVEMYGEAA-VPKMFKGEKITITVDKKKACIDL 605
             S PI+D+  D  I E +I  + E +     + K    EK T+ + KKK  + L
Sbjct:   708 SIPIADVLTDHNIYEMIISYVEENFTNVERISKEILKEK-TLQMIKKKEQLKL 759


>UNIPROTKB|F1NV30 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9031
            "Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            IPI:IPI00571913 EMBL:AADN02040858 Ensembl:ENSGALT00000002586
            Uniprot:F1NV30
        Length = 600

 Score = 886 (316.9 bits), Expect = 9.6e-89, P = 9.6e-89
 Identities = 201/501 (40%), Positives = 295/501 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH TKAI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +PD+LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L   PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G++ L +KM V+Y+SFSAH D +   + +R+  P +V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDD---P---NTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAAL 469
             K  + +E+  +   P    T+    NP   V + L     +TA  +G L  +  KP    
Sbjct:   424 KQKIEQEFHVNCYMPANGETTSIFTNPSIPVDISLGLLKRETA--IG-LLPDAKKPKLMH 480

Query:   470 SGIIVKRNFNYHLLAPSDLPK 490
               +I+K N ++ L++P    K
Sbjct:   481 GTLIMKDN-SFRLVSPEQALK 500


>UNIPROTKB|Q5ZIH0 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9031
            "Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000231294 KO:K13148 GO:GO:0032039 GO:GO:0016180
            HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB EMBL:AJ720814 IPI:IPI00571913
            RefSeq:NP_001012854.1 UniGene:Gga.13403 ProteinModelPortal:Q5ZIH0
            STRING:Q5ZIH0 GeneID:419418 KEGG:gga:419418 CTD:54973
            InParanoid:Q5ZIH0 NextBio:20822477 Uniprot:Q5ZIH0
        Length = 600

 Score = 886 (316.9 bits), Expect = 9.6e-89, P = 9.6e-89
 Identities = 201/501 (40%), Positives = 295/501 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH TKAI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +PD+LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L   PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G++ L +KM V+Y+SFSAH D +   + +R+  P +V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDD---P---NTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAAL 469
             K  + +E+  +   P    T+    NP   V + L     +TA  +G L  +  KP    
Sbjct:   424 KQKIEQEFHVNCYMPANGETTTIFTNPSIPVDISLGLLKRETA--IG-LLPDAKKPKLMH 480

Query:   470 SGIIVKRNFNYHLLAPSDLPK 490
               +I+K N ++ L++P    K
Sbjct:   481 GTLIMKDN-SFRLVSPEQALK 500


>MGI|MGI:1919207 [details] [associations]
            symbol:Cpsf3l "cleavage and polyadenylation specific factor
            3-like" species:10090 "Mus musculus" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 MGI:MGI:1919207 GO:GO:0005737 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
            GeneTree:ENSGT00700000104485 GO:GO:0032039 GO:GO:0016180
            HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB CTD:54973 EMBL:AK010425
            EMBL:AK090206 EMBL:AK150436 EMBL:AK152740 EMBL:AK167607
            EMBL:AK172533 EMBL:BC008240 EMBL:BC011155 IPI:IPI00467084
            RefSeq:NP_082296.1 UniGene:Mm.259270 UniGene:Mm.475640
            ProteinModelPortal:Q9CWS4 SMR:Q9CWS4 STRING:Q9CWS4
            PhosphoSite:Q9CWS4 PaxDb:Q9CWS4 PRIDE:Q9CWS4
            Ensembl:ENSMUST00000030901 GeneID:71957 KEGG:mmu:71957
            InParanoid:Q9CWS4 NextBio:335052 Bgee:Q9CWS4 Genevestigator:Q9CWS4
            GermOnline:ENSMUSG00000029034 Uniprot:Q9CWS4
        Length = 600

 Score = 875 (313.1 bits), Expect = 1.4e-87, P = 1.4e-87
 Identities = 196/493 (39%), Positives = 289/493 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + +S    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L  +PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSG-QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G Q L +KM V+Y+SFSAH D +   + V +  P  V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDD---PNTSMELYNPRN-TVSVDLYFKGEKTAKVMGELAVENLKPDAALSG 471
             +  + +E+      P     +  P + ++ V +     K   V G L  E  KP   L G
Sbjct:   424 RQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQG-LLPEAKKP-RLLHG 481

Query:   472 IIVKRNFNYHLLA 484
              ++ ++ N+ L++
Sbjct:   482 TLIMKDSNFRLVS 494


>UNIPROTKB|E1B7Q9 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9913
            "Bos taurus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:DAAA02043243 IPI:IPI00971575 Ensembl:ENSBTAT00000010020
            Uniprot:E1B7Q9
        Length = 598

 Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
 Identities = 198/497 (39%), Positives = 289/497 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G S     P F  +  S    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSM 120
             HLDHCGALP+F    G+ G  +MT  T+AI   LL DY K++    E   +T   ++  M
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKGEANFFTSQMIKDCM 129

Query:   121 DKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAA 179
              K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL AA
Sbjct:   130 KKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAA 189

Query:   180 EIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLL 239
              I   +P +LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL +
Sbjct:   190 WIDKCRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCI 249

Query:   240 ILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNL 299
             +L+ +W    +L   PIY+++ L +K    Y+ +I   N +IR+     N F FKHI   
Sbjct:   250 LLETFWE-RMDLK-APIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI--- 304

Query:   300 KGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILS 357
             K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   ILS
Sbjct:   305 KAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKILS 364

Query:   358 EPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
                + + M G++ L +KM V+Y+SFSAH D +   + V +  P +V+LVHGE  +M  LK
Sbjct:   365 GQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEFLK 423

Query:   417 AALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA--- 468
               + +E+       +  Y P N  +V L        G     +  E+A + L PDA    
Sbjct:   424 QKIEQEFR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDAKKPR 476

Query:   469 -LSGIIVKRNFNYHLLA 484
              L G ++ ++ N+ L++
Sbjct:   477 LLHGTLIMKDSNFRLVS 493


>RGD|1306841 [details] [associations]
            symbol:Cpsf3l "cleavage and polyadenylation specific factor
            3-like" species:10116 "Rattus norvegicus" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 RGD:1306841 GO:GO:0005634 GO:GO:0005737
            GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000231294 KO:K13148
            OMA:MAVEYMS GeneTree:ENSGT00700000104485 HOVERGEN:HBG080215
            OrthoDB:EOG4GXFMB CTD:54973 EMBL:BC105303 IPI:IPI00365477
            RefSeq:NP_001029064.1 UniGene:Rn.98615 ProteinModelPortal:Q3MHC2
            STRING:Q3MHC2 Ensembl:ENSRNOT00000026725 GeneID:298688
            KEGG:rno:298688 InParanoid:Q3MHC2 NextBio:644186
            Genevestigator:Q3MHC2 GermOnline:ENSRNOG00000019712 Uniprot:Q3MHC2
        Length = 600

 Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
 Identities = 196/493 (39%), Positives = 289/493 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + +S    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L  +PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSG-QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G Q L +KM V+Y+SFSAH D +   + V +  P  V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDD---PNTSMELYNPRN-TVSVDLYFKGEKTAKVMGELAVENLKPDAALSG 471
             +  + +E+      P     +  P + ++ V +     K   V G L  E  KP   L G
Sbjct:   424 RQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQG-LLPEAKKP-RLLHG 481

Query:   472 IIVKRNFNYHLLA 484
              ++ ++ N+ L++
Sbjct:   482 TLIMKDNNFRLVS 494


>UNIPROTKB|E2QY53 [details] [associations]
            symbol:CPSF3L "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:AAEX03003844 RefSeq:XP_003639102.1 Ensembl:ENSCAFT00000030626
            GeneID:100855777 KEGG:cfa:100855777 Uniprot:E2QY53
        Length = 600

 Score = 866 (309.9 bits), Expect = 1.3e-86, P = 1.3e-86
 Identities = 195/498 (39%), Positives = 288/498 (57%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVE-----SDQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P    +      +D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L   PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G++ L +KM V+Y+SFSAH D +   + V +  P  V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
             K  + +E+       +  Y P N  +V L        G     +  E+A + L PD    
Sbjct:   424 KQKIEQEFR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDVKKP 476

Query:   469 --LSGIIVKRNFNYHLLA 484
               L G ++ ++ N+ L++
Sbjct:   477 RLLHGTLIMKDSNFRLVS 494


>UNIPROTKB|G3V1S5 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 EMBL:AL139287 EMBL:CH471183 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 OMA:MAVEYMS
            CTD:54973 UniGene:Hs.6449 GeneID:54973 KEGG:hsa:54973
            HGNC:HGNC:26052 ChiTaRS:CPSF3L GenomeRNAi:54973
            RefSeq:NP_001243385.1 ProteinModelPortal:G3V1S5 SMR:G3V1S5
            Ensembl:ENST00000540437 ArrayExpress:G3V1S5 Bgee:G3V1S5
            Uniprot:G3V1S5
        Length = 606

 Score = 865 (309.6 bits), Expect = 1.6e-86, P = 1.6e-86
 Identities = 196/498 (39%), Positives = 291/498 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    16 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 75

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    76 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 135

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   136 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 195

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   196 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 255

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L  +PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   256 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 311

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   312 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 370

Query:   357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G++ L +KM V+Y+SFSAH D +   + V +  P  V+LVHGE  +M  L
Sbjct:   371 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 429

Query:   416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
             K  + +E        +  Y P N  +V L        G     +  E+A + L P+A   
Sbjct:   430 KQKIEQELR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPEAKKP 482

Query:   469 --LSGIIVKRNFNYHLLA 484
               L G ++ ++ N+ L++
Sbjct:   483 RLLHGTLIMKDSNFRLVS 500


>UNIPROTKB|Q5TA45 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
            EMBL:AL139287 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K13148 HOVERGEN:HBG080215
            OrthoDB:EOG4GXFMB CTD:54973 EMBL:AL136813 EMBL:AK000549
            EMBL:AK021939 EMBL:AK023356 EMBL:AK297350 EMBL:CR533557
            EMBL:BC000675 EMBL:BC007978 EMBL:BC013904 EMBL:BK005728
            EMBL:BK005673 IPI:IPI00063404 IPI:IPI00306882 IPI:IPI00514973
            RefSeq:NP_001243392.1 RefSeq:NP_060341.2 UniGene:Hs.6449
            ProteinModelPortal:Q5TA45 SMR:Q5TA45 IntAct:Q5TA45
            MINT:MINT-1482228 STRING:Q5TA45 PhosphoSite:Q5TA45 DMDM:118572557
            PaxDb:Q5TA45 PRIDE:Q5TA45 DNASU:54973 Ensembl:ENST00000419704
            Ensembl:ENST00000435064 Ensembl:ENST00000450926
            Ensembl:ENST00000545578 GeneID:54973 KEGG:hsa:54973 UCSC:uc001aee.1
            UCSC:uc001aeh.1 UCSC:uc009vjz.1 GeneCards:GC01M001236
            HGNC:HGNC:26052 HPA:HPA028379 HPA:HPA029025 MIM:611354
            neXtProt:NX_Q5TA45 PharmGKB:PA142672080 InParanoid:Q5TA45
            PhylomeDB:Q5TA45 ChiTaRS:CPSF3L GenomeRNAi:54973 NextBio:58222
            ArrayExpress:Q5TA45 Bgee:Q5TA45 Genevestigator:Q5TA45
            GermOnline:ENSG00000127054 Uniprot:Q5TA45
        Length = 600

 Score = 865 (309.6 bits), Expect = 1.6e-86, P = 1.6e-86
 Identities = 196/498 (39%), Positives = 291/498 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W     L  +PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G++ L +KM V+Y+SFSAH D +   + V +  P  V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
             K  + +E        +  Y P N  +V L        G     +  E+A + L P+A   
Sbjct:   424 KQKIEQELR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPEAKKP 476

Query:   469 --LSGIIVKRNFNYHLLA 484
               L G ++ ++ N+ L++
Sbjct:   477 RLLHGTLIMKDSNFRLVS 494


>UNIPROTKB|Q2YDM2 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9913
            "Bos taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000231294 EMBL:BC110155 IPI:IPI00725178
            UniGene:Bt.4894 ProteinModelPortal:Q2YDM2 STRING:Q2YDM2
            PRIDE:Q2YDM2 HOVERGEN:HBG080215 InParanoid:Q2YDM2 OrthoDB:EOG4GXFMB
            Uniprot:Q2YDM2
        Length = 599

 Score = 864 (309.2 bits), Expect = 2.0e-86, P = 2.0e-86
 Identities = 198/498 (39%), Positives = 289/498 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G S     P F     S    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MT  T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P +LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W    +L   PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMDLK-APIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
             S   + + M G++ L +KM V+Y+SFSAH D +   + V +  P +V+LVHGE  +M  L
Sbjct:   365 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEFL 423

Query:   416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
             K  + +E+       +  Y P N  +V L        G     +  E+A + L PDA   
Sbjct:   424 KQKIEQEFR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDAKKP 476

Query:   469 --LSGIIVKRNFNYHLLA 484
               L G ++ ++ N+ L++
Sbjct:   477 RLLHGTLIMKDSNFRLVS 494


>WB|WBGene00008642 [details] [associations]
            symbol:F10B5.8 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0009792 EMBL:Z48334 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
            GeneTree:ENSGT00700000104485 PIR:T20694 RefSeq:NP_495706.2
            ProteinModelPortal:Q9U3K2 SMR:Q9U3K2 STRING:Q9U3K2 PaxDb:Q9U3K2
            EnsemblMetazoa:F10B5.8 GeneID:174310 KEGG:cel:CELE_F10B5.8
            UCSC:F10B5.8 CTD:174310 WormBase:F10B5.8 InParanoid:Q9U3K2
            NextBio:883468 Uniprot:Q9U3K2
        Length = 608

 Score = 845 (302.5 bits), Expect = 2.1e-86, Sum P(2) = 2.1e-86
 Identities = 173/428 (40%), Positives = 261/428 (60%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVE-----SDQIDLLLISHF 60
             GAGQ+VGRSCI++    K+IM+DCG+H G       P    +      +D +D ++ISHF
Sbjct:    14 GAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDCVIISHF 73

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCG+LP      G+ G  +MT+ TKAI   LL DY KV  +I  E   +T  D++  
Sbjct:    74 HLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFTSDDIKNC 133

Query:   120 MDKIETINFHEEKDV-NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+     HE   V N +   A+ AGHVLGAAMF I +    +LYTGD++   DRHL A
Sbjct:   134 MKKVVGCALHEIIHVDNELSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDYNMTPDRHLGA 193

Query:   179 AEI-PPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQEL 237
             A + P V+P +LI+ESTY T + + +  RE  F   +H+ V +GG+ +IPVFALGRAQEL
Sbjct:   194 ARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHECVMKGGKVIIPVFALGRAQEL 253

Query:   238 LLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHIS 297
              ++L+ YW     L+ +PIY++  LA++    Y+ +I+  N+ I++     N F FKHI 
Sbjct:   254 CILLESYWE-RMALN-VPIYFSQGLAERANQYYRLFISWTNENIKKTFVERNMFEFKHIK 311

Query:   298 NL-KGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              + KG +  +  GP V+ ++PGM+  G S ++F+ WC+D  N +I+ GYCV GT+   ++
Sbjct:   312 PMEKGCE--DQPGPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVI 369

Query:   357 SEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
             +  E+ I +  +   +++ V+Y+SFSAH D +   + +R+  P HV+ VHGE ++M  LK
Sbjct:   370 NG-EKKIEIDQKMHEIRLGVEYMSFSAHADAKGIMQLIRQCEPQHVMFVHGEASKMEFLK 428

Query:   417 AALTREYE 424
               + +EY+
Sbjct:   429 GKVEKEYK 436

 Score = 38 (18.4 bits), Expect = 2.1e-86, Sum P(2) = 2.1e-86
 Identities = 8/26 (30%), Positives = 17/26 (65%)

Query:   565 LISECLIEILVEMYGEAAVPKMFKGE 590
             LI +C  + ++ ++GEA+  +  KG+
Sbjct:   405 LIRQCEPQHVMFVHGEASKMEFLKGK 430


>UNIPROTKB|F1RJE8 [details] [associations]
            symbol:CPSF3L "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:FP102596 RefSeq:XP_003127541.3 Ensembl:ENSSSCT00000003708
            GeneID:100523908 KEGG:ssc:100523908 Uniprot:F1RJE8
        Length = 599

 Score = 855 (306.0 bits), Expect = 1.8e-85, P = 1.8e-85
 Identities = 194/497 (39%), Positives = 285/497 (57%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVE-----SDQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G S     P    +      +D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIGGKNVMLDCGMHMGFSDDRRFPDFSYITRHGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MT  T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K   ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKAVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ +W    +L   PIY+++ L +K    Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETFWE-RMDLK-APIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 305

Query:   299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
              K  D  F D  GP VV A+PGM+ +G S ++F  W  + KN VI+ GYCV+GT+   IL
Sbjct:   306 -KAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364

Query:   357 SEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
             S   ++     Q L +KM V+Y+SFSAH D +   + V +  P +V+LVHGE  +M  LK
Sbjct:   365 SGQRKLELEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEFLK 424

Query:   417 AALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA--- 468
               + +E+       +  Y P N  +V L        G     +  E+A + L PDA    
Sbjct:   425 QKIEQEFR------LSCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDAKKAR 477

Query:   469 -LSGIIVKRNFNYHLLA 484
              L G ++ ++  + L++
Sbjct:   478 LLHGTLIMKDSTFRLVS 494


>FB|FBgn0039691 [details] [associations]
            symbol:IntS11 "Integrator 11" species:7227 "Drosophila
            melanogaster" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0034472 "snRNA
            3'-end processing" evidence=IDA] [GO:0016180 "snRNA processing"
            evidence=ISS] [GO:0032039 "integrator complex" evidence=ISS]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
            SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
            GO:GO:0006378 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            GO:GO:0034472 EMBL:AY061097 RefSeq:NP_651721.1 UniGene:Dm.3722
            SMR:Q9VAH9 STRING:Q9VAH9 EnsemblMetazoa:FBtr0085476 GeneID:43506
            KEGG:dme:Dmel_CG1972 UCSC:CG1972-RA CTD:43506 FlyBase:FBgn0039691
            InParanoid:Q9VAH9 OrthoDB:EOG47D7X3 GenomeRNAi:43506 NextBio:834295
            Uniprot:Q9VAH9
        Length = 597

 Score = 843 (301.8 bits), Expect = 3.4e-84, P = 3.4e-84
 Identities = 201/564 (35%), Positives = 309/564 (54%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVE----SDQIDLLLISHF 60
             GAGQ+VGRSC++L    K+IM+DCG+H G +     P F  +V     +  ID ++ISHF
Sbjct:    10 GAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+     G+ G  +MTH TKAI   LL D  KV+     E   +T   ++  
Sbjct:    70 HLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTTQMIKDC 129

Query:   120 MDKIETINFHEEKDVN-GIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  +  H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVIPVTLHQSMMVDTDLEIKAYYAGHVLGAAMFWIKVGSQSVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I   +PD+LI+ESTY T + + +  RE  F   +H+ V +GG+ LIPVFALGRAQEL 
Sbjct:   190 AWIDKCRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPVFALGRAQELC 249

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++L+ YW     L   PIY+A  L +K  + Y+ +I   N +IR+     N F FKHI  
Sbjct:   250 ILLETYWE-RMNLK-YPIYFALGLTEKANTYYKMFITWTNQKIRKTFVHRNMFDFKHIKP 307

Query:   299 LKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSE 358
                  + ++ G  VV A+PGM+ +GLS ++F+ W  +  N VI+ GYCV+GT+   IL  
Sbjct:   308 FDKA-YIDNPGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGYCVQGTVGNKILGG 366

Query:   359 PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAA 418
              ++V   + Q + +KM+V+Y+SFSAH D +   + ++   P +V+LVHGE  +M  L++ 
Sbjct:   367 AKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLIQNCEPKNVMLVHGEAGKMKFLRSK 426

Query:   419 LTREYEDDPNTSMELYNPRN----TVSVDLYFKGEKTAKVM-GELAVENLKPD-----AA 468
             +  E+      ++E Y P N     +S  +    + +  ++  E    N +P        
Sbjct:   427 IKDEF------NLETYMPANGETCVISTPVKIPVDASVSLLKAEARSYNAQPPDPKRRRL 480

Query:   469 LSGIIVKRNFNYHLLAPSDLPKYTDLK------ASKIIQQQSVYYSGSISVLRSLISH-L 521
             + G++V ++    L   +D  K   +        SK+    S     +   L++L+   L
Sbjct:   481 IHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDSGPVIRTSERLKTLLEEKL 540

Query:   522 AGPVETLDEKRLRAFACIEITLEK 545
             AG   T+ E    A   +E+ +E+
Sbjct:   541 AGWTVTMQENGSIAIESVEVKVEE 564


>DICTYBASE|DDB_G0278189 [details] [associations]
            symbol:ints11 "integrator complex subunit 11"
            species:44689 "Dictyostelium discoideum" [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0278189 Pfam:PF07521 GO:GO:0005634 GO:GO:0005737
            GenomeReviews:CM000152_GR EMBL:AAFI02000023 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 RefSeq:XP_642189.1
            ProteinModelPortal:Q54YL3 PRIDE:Q54YL3 EnsemblProtists:DDB0234100
            GeneID:8621396 KEGG:ddi:DDB_G0278189 OMA:RTIANET
            ProtClustDB:CLSZ2729107 Uniprot:Q54YL3
        Length = 744

 Score = 810 (290.2 bits), Expect = 1.1e-80, P = 1.1e-80
 Identities = 177/456 (38%), Positives = 265/456 (58%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVESDQ----IDLLLISHF 60
             GAGQ+VGRSC+++   NK+IM DCG+H G++     P F  + ++ Q    ID ++I+HF
Sbjct:     9 GAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVIDCVIITHF 68

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MT  TKAI   LL DY K++     E   +T   ++  
Sbjct:    69 HLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFTAQMIKDC 128

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  +N H+   V+  +   AY AGHVLGAAMF  ++    ++YTGD++   DRHL +
Sbjct:   129 MKKVIPVNLHQTIKVDEELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDYNMTPDRHLGS 188

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A I  VKPD+LITE+TY T + + +  RE  F   IH+ V +GG+ LIPVFALGR QEL 
Sbjct:   189 AWIDQVKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIPVFALGRVQELC 248

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             +++D YW     L  IPIY+++ LA+K    Y+ +IN  N +I++     N F FKHI  
Sbjct:   249 ILIDSYWE-QMNLGHIPIYFSAGLAEKANLYYKLFINWTNQKIKQTFVKRNMFDFKHIKP 307

Query:   299 LKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL- 356
              +   H  D  G  V+ A+PGM+ +G S E+F+ W  +  N  II GYCV GT+   +L 
Sbjct:   308 FQS--HLVDAPGAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPGYCVVGTVGNKLLT 365

Query:   357 --------SEPE-EVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVH 406
                     S+P+ +++ +  +  + +K  +  +SFSAH D +   + ++   P +V+LVH
Sbjct:   366 TGSDQQQQSKPQSQMVEIDKKTTIEVKCKIHNLSFSAHADAKGILQLIKMSNPRNVILVH 425

Query:   407 GEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSV 442
             GE+ +M  L   + +E        +  Y P N V++
Sbjct:   426 GEKEKMGFLSQKIIKEM------GVNCYYPANGVTI 455


>ZFIN|ZDB-GENE-050522-13 [details] [associations]
            symbol:cpsf3l "cleavage and polyadenylation specific
            factor 3-like" species:7955 "Danio rerio" [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0043484 "regulation of RNA splicing"
            evidence=IMP] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            ZFIN:ZDB-GENE-050522-13 GO:GO:0016787 GO:GO:0043484
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:MAVEYMS GeneTree:ENSGT00700000104485 EMBL:CABZ01054885
            EMBL:CR846089 IPI:IPI00865509 Ensembl:ENSDART00000102902
            Uniprot:E7EXW1
        Length = 601

 Score = 791 (283.5 bits), Expect = 1.1e-78, P = 1.1e-78
 Identities = 188/514 (36%), Positives = 289/514 (56%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVESDQI----DLLLISHF 60
             GAGQ+VGRSCI++    K+IM+DCG+H G +     P F  + ++ ++    D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+     G+ G  +MTH TKAI   LL D+ K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAM----FLIEIAGVKILYTGDFSRQEDR 174
             M K+  +N H+   V+  ++  AY AGHVLGAAM    F + +  V + YT         
Sbjct:   130 MKKVVPLNLHQTVQVDDELEIKAYYAGHVLGAAMVQSRFRV-VYTVSVSYTYSNLMTPAS 188

Query:   175 HLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRA 234
              L AA I   +PDILI+ESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRA
Sbjct:   189 DLRAAWIDKCRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRA 248

Query:   235 QELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFK 294
             QEL ++L+ +W     L   PIY+++ L +K    Y+ +I   N +IR+     N F FK
Sbjct:   249 QELCILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFK 306

Query:   295 HISNLKGID--HFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
             HI   K  D  + ++ GP VV A+PGM+ +G S ++F+ W  + KN VI+ GYCV+GT+ 
Sbjct:   307 HI---KAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIMPGYCVQGTVG 363

Query:   353 KTILSEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNE 411
               IL+  ++ + M G+  L +K+ V+Y+SFSAH D +   + +R   P +++LVHGE  +
Sbjct:   364 HKILNGQKK-LEMEGRATLDVKLQVEYMSFSAHADAKGIMQLIRMAEPRNMLLVHGEAKK 422

Query:   412 MSRLKAALTREYEDD---P---NTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKP 465
             M  LK  + +E+      P    T+  + NP  +V VD+     K    +G    +  KP
Sbjct:   423 MEFLKDKIEQEFSISCFMPANGETTTIVTNP--SVPVDISLNLLKREMALGGPLPDAKKP 480

Query:   466 DAALSGIIVKRNFNYHLLAPSDLPKYTDLKASKI 499
                + G ++ ++ +  L++P    K   L   ++
Sbjct:   481 -RTMHGTLIMKDNSLRLVSPEQALKELGLNEHQL 513


>TAIR|locus:2065368 [details] [associations]
            symbol:CPSF73-II "AT2G01730" species:3702 "Arabidopsis
            thaliana" [GO:0003824 "catalytic activity" evidence=ISS]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0010197 "polar nucleus
            fusion" evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=IDA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0006397 GO:GO:0090305 EMBL:AC006069
            GO:GO:0004518 GO:GO:0010197 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 EMBL:AY168923
            EMBL:AK221561 IPI:IPI00536069 PIR:D84428 RefSeq:NP_178282.2
            UniGene:At.42473 ProteinModelPortal:Q8GUU3 SMR:Q8GUU3 IntAct:Q8GUU3
            STRING:Q8GUU3 PaxDb:Q8GUU3 PRIDE:Q8GUU3 EnsemblPlants:AT2G01730.1
            GeneID:814702 KEGG:ath:AT2G01730 TAIR:At2g01730
            HOGENOM:HOG000231294 InParanoid:Q56XW2 KO:K13148 OMA:MAVEYMS
            Genevestigator:Q8GUU3 Uniprot:Q8GUU3
        Length = 613

 Score = 743 (266.6 bits), Expect = 1.4e-73, P = 1.4e-73
 Identities = 157/428 (36%), Positives = 248/428 (57%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-----DQIDLLLISHF 60
             GAGQE+G+SC+++    K IM DCG+H G    +  P   L+       + I  ++I+HF
Sbjct:     9 GAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHF 68

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             H+DH GALP+F    G+ G  +M++ TKA+   +L DY +V  +   E+ L+T + +   
Sbjct:    69 HMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELFTTTHIANC 128

Query:   120 MDKIETINFHEEKDVN-GIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  I+  +   V+  ++  AY AGHVLGA M   ++    I+YTGD++   DRHL A
Sbjct:   129 MKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGA 188

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
             A+I  ++ D+LI+ESTY T +   +  RE  F   +H  V  GG+ LIP FALGRAQEL 
Sbjct:   189 AKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELC 248

Query:   239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
             ++LD+YW     +  +PIY++S L  +    Y+  I+  +  ++ + + +NPF FK++ +
Sbjct:   249 MLLDDYWE-RMNIK-VPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFDFKNVKD 306

Query:   299 L-KGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA-KTIL 356
               + + H    GPCV+ A+PGM+ +G S E+F+ W     N V + GY V GT+  K + 
Sbjct:   307 FDRSLIHAP--GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMA 364

Query:   357 SEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
              +P  V   +G ++ ++  V  ++FS HTD +   +  + L P +VVLVHGE+  M  LK
Sbjct:   365 GKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPSMMILK 424

Query:   417 AALTREYE 424
               +T E +
Sbjct:   425 EKITSELD 432


>GENEDB_PFALCIPARUM|PFC0825c [details] [associations]
            symbol:PFC0825c "cleavage and polyadenylation
            specificity factor protein, putative" species:5833 "Plasmodium
            falciparum" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0006378
            "mRNA polyadenylation" evidence=ISS] [GO:0003729 "mRNA binding"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] InterPro:IPR001279
            SMART:SM00849 Pfam:PF07521 GO:GO:0003729 GO:GO:0016787
            EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
            RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
            EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
            EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
            ProtClustDB:CLSZ2433497 Uniprot:O77371
        Length = 1017

 Score = 542 (195.9 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
 Identities = 110/331 (33%), Positives = 192/331 (58%)

Query:    95 LSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFL 154
             L +Y  ++ I  +     E ++   +DK+  +  +E  ++  +  + Y AGHVLGA ++ 
Sbjct:   243 LLNY-NINCIKKDPWNINEDNIYNCIDKVIGLQINETFELGDMSITPYYAGHVLGACIYK 301

Query:   155 IEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLI 214
             IE+    ++YTGD++   D+HL +A IP + P+I I+ESTY T+V   ++  E    +L+
Sbjct:   302 IEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTYATYVRPTKKASELELCNLV 361

Query:   215 HDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYI 274
             H+ V++GG+ LIPVFA+GRAQEL ++LD+YW    ++H  PIY+   L +     Y+ Y 
Sbjct:   362 HECVHKGGKVLIPVFAIGRAQELSILLDDYWK-KMKIH-YPIYFGCGLTENANKYYKIYS 419

Query:   275 NAMNDRIRRQISINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCT 334
             + +N          N F F +IS     ++  +  P V+ A+PGM+ +GLS + F+ W  
Sbjct:   420 SWINSSCMSNEK-ENLFDFANISPFLN-NYLNEKRPMVLFATPGMLHTGLSLKAFKAWAG 477

Query:   335 DAKNGVIIAGYCVEGTLA-KTILSEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSE 392
             + +N +++ GYCV+GT+  K I+ E +  I + G   + +   + Y+SFSAH D     +
Sbjct:   478 NPQNLIVLPGYCVQGTVGHKLIMGEKQ--ISLDGTTYIKVLCKIIYLSFSAHADSNGIQQ 535

Query:   393 FVRELRPAHVVLVHGEQNEMSRLKAALTREY 423
              ++ + P +V+ VHGE+N M +L   ++ ++
Sbjct:   536 LIKHVSPKNVIFVHGEKNGMQKLAKYISNKH 566

 Score = 128 (50.1 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
 Identities = 32/93 (34%), Positives = 55/93 (59%)

Query:    52 IDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLY 111
             ID ++ISHFH+DH GALP+F     ++G   M++ TKA+   LL D  +V+++  E+  +
Sbjct:   170 IDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLDSCRVTDMKWEKKNF 229

Query:   112 TESDLEKSMDKI-ETINFHEEKDVNGIKFSAYN 143
              E  ++   +K  E +N++    +N IK   +N
Sbjct:   230 -ERQIKMLNEKSDELLNYN----INCIKKDPWN 257

 Score = 107 (42.7 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
 Identities = 19/42 (45%), Positives = 27/42 (64%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLV 47
             GAGQ VGRSC+++E +N+ +M DCG H G       P  +L+
Sbjct:    15 GAGQTVGRSCVIVELENRKVMFDCGCHLGYKDERKYPNFNLL 56

 Score = 42 (19.8 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query:   586 MFKGEKITI--TVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
             ++K EKI+     DKKK  ID   L V+ +  + K  +Q
Sbjct:   702 LYKNEKISNYHKKDKKKKAIDEHKLKVRNKLIQKKINIQ 740


>UNIPROTKB|O77371 [details] [associations]
            symbol:PFC0825c "Cleavage and polyadenylation specificity
            factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
            [GO:0003729 "mRNA binding" evidence=ISS] [GO:0005847 "mRNA cleavage
            and polyadenylation specificity factor complex" evidence=ISS]
            InterPro:IPR001279 SMART:SM00849 Pfam:PF07521 GO:GO:0003729
            GO:GO:0016787 EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
            RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
            EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
            EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
            ProtClustDB:CLSZ2433497 Uniprot:O77371
        Length = 1017

 Score = 542 (195.9 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
 Identities = 110/331 (33%), Positives = 192/331 (58%)

Query:    95 LSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFL 154
             L +Y  ++ I  +     E ++   +DK+  +  +E  ++  +  + Y AGHVLGA ++ 
Sbjct:   243 LLNY-NINCIKKDPWNINEDNIYNCIDKVIGLQINETFELGDMSITPYYAGHVLGACIYK 301

Query:   155 IEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLI 214
             IE+    ++YTGD++   D+HL +A IP + P+I I+ESTY T+V   ++  E    +L+
Sbjct:   302 IEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTYATYVRPTKKASELELCNLV 361

Query:   215 HDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYI 274
             H+ V++GG+ LIPVFA+GRAQEL ++LD+YW    ++H  PIY+   L +     Y+ Y 
Sbjct:   362 HECVHKGGKVLIPVFAIGRAQELSILLDDYWK-KMKIH-YPIYFGCGLTENANKYYKIYS 419

Query:   275 NAMNDRIRRQISINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCT 334
             + +N          N F F +IS     ++  +  P V+ A+PGM+ +GLS + F+ W  
Sbjct:   420 SWINSSCMSNEK-ENLFDFANISPFLN-NYLNEKRPMVLFATPGMLHTGLSLKAFKAWAG 477

Query:   335 DAKNGVIIAGYCVEGTLA-KTILSEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSE 392
             + +N +++ GYCV+GT+  K I+ E +  I + G   + +   + Y+SFSAH D     +
Sbjct:   478 NPQNLIVLPGYCVQGTVGHKLIMGEKQ--ISLDGTTYIKVLCKIIYLSFSAHADSNGIQQ 535

Query:   393 FVRELRPAHVVLVHGEQNEMSRLKAALTREY 423
              ++ + P +V+ VHGE+N M +L   ++ ++
Sbjct:   536 LIKHVSPKNVIFVHGEKNGMQKLAKYISNKH 566

 Score = 128 (50.1 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
 Identities = 32/93 (34%), Positives = 55/93 (59%)

Query:    52 IDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLY 111
             ID ++ISHFH+DH GALP+F     ++G   M++ TKA+   LL D  +V+++  E+  +
Sbjct:   170 IDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLDSCRVTDMKWEKKNF 229

Query:   112 TESDLEKSMDKI-ETINFHEEKDVNGIKFSAYN 143
              E  ++   +K  E +N++    +N IK   +N
Sbjct:   230 -ERQIKMLNEKSDELLNYN----INCIKKDPWN 257

 Score = 107 (42.7 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
 Identities = 19/42 (45%), Positives = 27/42 (64%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLV 47
             GAGQ VGRSC+++E +N+ +M DCG H G       P  +L+
Sbjct:    15 GAGQTVGRSCVIVELENRKVMFDCGCHLGYKDERKYPNFNLL 56

 Score = 42 (19.8 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
 Identities = 14/39 (35%), Positives = 21/39 (53%)

Query:   586 MFKGEKITI--TVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
             ++K EKI+     DKKK  ID   L V+ +  + K  +Q
Sbjct:   702 LYKNEKISNYHKKDKKKKAIDEHKLKVRNKLIQKKINIQ 740


>UNIPROTKB|C9JZH6 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation-specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
            binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 GO:GO:0003723
            GO:GO:0004521 GO:GO:0008409 EMBL:AC080162 HGNC:HGNC:2326
            ChiTaRS:CPSF3 IPI:IPI00807384 ProteinModelPortal:C9JZH6 SMR:C9JZH6
            STRING:C9JZH6 Ensembl:ENST00000475482 HOGENOM:HOG000191757
            ArrayExpress:C9JZH6 Bgee:C9JZH6 Uniprot:C9JZH6
        Length = 136

 Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
 Identities = 111/136 (81%), Positives = 124/136 (91%)

Query:    26 MMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTH 85
             M+DCGIHPGL GMDALP++DL++  +IDLLLISHFHLDHCGALPWFL KT FKGR FMTH
Sbjct:     1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60

Query:    86 ATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAG 145
             ATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIETINFHE K+V GIKF  Y+AG
Sbjct:    61 ATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAG 120

Query:   146 HVLGAAMFLIEIAGVK 161
             HVLGAAMF+IEIAGVK
Sbjct:   121 HVLGAAMFMIEIAGVK 136


>UNIPROTKB|C9J979 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 InterPro:IPR022712 Pfam:PF10996 HOGENOM:HOG000231294
            HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00514808
            ProteinModelPortal:C9J979 SMR:C9J979 STRING:C9J979
            Ensembl:ENST00000434694 ArrayExpress:C9J979 Bgee:C9J979
            Uniprot:C9J979
        Length = 344

 Score = 268 (99.4 bits), Expect = 2.2e-46, Sum P(2) = 2.2e-46
 Identities = 54/135 (40%), Positives = 82/135 (60%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDV 134
             M K+  ++ H+   V
Sbjct:   130 MKKVVAVHLHQTVQV 144

 Score = 252 (93.8 bits), Expect = 2.2e-46, Sum P(2) = 2.2e-46
 Identities = 52/119 (43%), Positives = 74/119 (62%)

Query:   178 AAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQEL 237
             AA I   +P++LITESTY T + + +  RE  F   +H+ V RGG+ LIPVFALGRAQEL
Sbjct:   219 AAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQEL 278

Query:   238 LLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHI 296
              ++L+ +W     L  +PIY+++ L +K    Y+ +I   N +IR+     N F FKHI
Sbjct:   279 CILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI 335

 Score = 41 (19.5 bits), Expect = 6.2e-22, Sum P(2) = 6.2e-22
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query:   137 IKFSAYNAGHVLGAAMFLIEIAGVKIL 163
             I+ +   AG  +G +  L+ IAG  ++
Sbjct:     4 IRVTPLGAGQDVGRSCILVSIAGKNVM 30


>UNIPROTKB|E9PNS4 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00984775
            ProteinModelPortal:E9PNS4 SMR:E9PNS4 Ensembl:ENST00000528879
            ArrayExpress:E9PNS4 Bgee:E9PNS4 Uniprot:E9PNS4
        Length = 278

 Score = 477 (173.0 bits), Expect = 2.1e-45, P = 2.1e-45
 Identities = 95/225 (42%), Positives = 141/225 (62%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189

Query:   179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGR 223
             A I   +P++LITESTY T + + +  RE  F   +H+ V RGG+
Sbjct:   190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK 234


>UNIPROTKB|E9PI75 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI01011963
            ProteinModelPortal:E9PI75 SMR:E9PI75 Ensembl:ENST00000527719
            ArrayExpress:E9PI75 Bgee:E9PI75 Uniprot:E9PI75
        Length = 209

 Score = 411 (149.7 bits), Expect = 4.0e-38, P = 4.0e-38
 Identities = 83/194 (42%), Positives = 123/194 (63%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    16 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 75

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:    76 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 135

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   136 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 195

Query:   179 AEIPPVKPDILITE 192
             A I   +P++LITE
Sbjct:   196 AWIDKCRPNLLITE 209


>TIGR_CMR|CPS_2623 [details] [associations]
            symbol:CPS_2623 "metallo-beta-lactamase family protein"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:CP000083 GenomeReviews:CP000083_GR
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:YP_269337.1
            ProteinModelPortal:Q481D2 STRING:Q481D2 GeneID:3521490
            KEGG:cps:CPS_2623 PATRIC:21468305 OMA:HGPMVII
            ProtClustDB:CLSK2524370 BioCyc:CPSY167879:GI48-2685-MONOMER
            Uniprot:Q481D2
        Length = 451

 Score = 410 (149.4 bits), Expect = 5.2e-38, P = 5.2e-38
 Identities = 122/447 (27%), Positives = 222/447 (49%)

Query:     4 LKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDL-VESDQIDLLLISHFHL 62
             L G G   G S   +E     I++DCG++ G   + A     L ++   +D ++++H HL
Sbjct:     6 LGGTGTVTG-SKYFVETSTTKILVDCGLYQGYKWLRARNREPLPLDLKSLDAIVLTHAHL 64

Query:    63 DHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSD--YI-----------KVSNISTEQM 109
             DH G +P  L K GF+G  +   AT ++   LL D  +I           K+S     + 
Sbjct:    65 DHSGFIP-ALYKQGFRGHVYAHQATISLCSILLPDSGHIQEDDAKFYGKHKISRHENPEP 123

Query:   110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFS 169
             LY ++  E  +   + ++F+EE  +  I+    +AGH+LGAA  +++  G ++ ++GD  
Sbjct:   124 LYDKATAEACLSLFKAVDFNEEFKIGDIEIELQSAGHILGAASVILKADGKRVGFSGDVG 183

Query:   170 RQEDRHLMAAE-IPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
             R +D  +   + +PPV  D+L+ ESTYG  +H++ +  E +   +++    +GG  LIP 
Sbjct:   184 RPDDIIMYPPKPLPPV--DLLLLESTYGNRLHDKEDAFE-QLAEIVNSTAKKGGALLIPS 240

Query:   229 FALGRAQELLLILDEYWS--LHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQ-- 284
             FA+GR + +  +L       L P+L   P+Y  S +A    ++Y  + + +N R+  +  
Sbjct:   241 FAVGRTEAVQHMLASLMKKELIPKL---PVYLDSPMAINVFNIYCEHFD-LN-RLSNEEC 295

Query:   285 ISINNPFVF-KHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIA 343
             + + N   F + +   K +   E I P +++A  GM   G      +    D +  V+  
Sbjct:   296 LEMCNVATFTRTVDESKALS--ELIMPHIIIAGSGMATGGRILHHLKRLLGDYRTTVLFT 353

Query:   344 GYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVR--ELRP- 399
             GY   GT    +L+  + V  + G+ LP+K  V+ ++  S H DY+  +++++  +L P 
Sbjct:   354 GYLSGGTRGAKMLAGKDNV-KIHGKWLPVKARVEVLNGLSGHGDYEDITQWLQISKLHPK 412

Query:   400 AHVVLVHGEQNEMSRLKAALTREYEDD 426
               V+LVHGE      ++  L +  + D
Sbjct:   413 TKVLLVHGEPEASESMRDHLMQHTQFD 439


>UNIPROTKB|E9PIG1 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00981641
            ProteinModelPortal:E9PIG1 SMR:E9PIG1 Ensembl:ENST00000530031
            ArrayExpress:E9PIG1 Bgee:E9PIG1 Uniprot:E9PIG1
        Length = 249

 Score = 406 (148.0 bits), Expect = 1.5e-37, P = 1.5e-37
 Identities = 82/193 (42%), Positives = 122/193 (63%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    57 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 116

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++  
Sbjct:   117 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 176

Query:   120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             M K+  ++ H+   V+  ++  AY AGHVLGAAMF I++    ++YTGD++   DRHL A
Sbjct:   177 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 236

Query:   179 AEIPPVKPDILIT 191
             A I   +P++LIT
Sbjct:   237 AWIDKCRPNLLIT 249


>UNIPROTKB|E2R496 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 GO:GO:0005847 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981 GO:GO:0006398
            EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496
            Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230
            NextBio:20855279 Uniprot:E2R496
        Length = 782

 Score = 390 (142.3 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
 Identities = 106/374 (28%), Positives = 187/374 (50%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
             + TL G  QE    C +L+      ++DCG     S MD +  +      QID +L+SH 
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
                H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  D++ +
Sbjct:    64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122

Query:   120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
              DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+ + + 
Sbjct:   123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182

Query:   175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V   GR
Sbjct:   183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242

Query:   234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
               EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   NNP
Sbjct:   243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302

Query:   291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
             F F+H+S   G+     +  P VV+AS   ++ G SR+LF  WC D KN +I+      G
Sbjct:   303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362

Query:   350 TLAKTILSEPEEVI 363
             TLA+ ++  P E I
Sbjct:   363 TLARFLIDNPSEKI 376

 Score = 74 (31.1 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
 Identities = 21/89 (23%), Positives = 44/89 (49%)

Query:   356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
             LS+ P + I  + + + +K  V YI +   +D     + + +++P  +++VHG   E S+
Sbjct:   515 LSDVPTKCISTT-ESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQ 572

Query:   415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
               A   R +       +++Y P+   +VD
Sbjct:   573 DLAECCRAFG---GKDIKVYMPKLHETVD 598


>UNIPROTKB|Q9P2I0 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:9606 "Homo sapiens" [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase
            II transcription" evidence=TAS] [GO:0006397 "mRNA processing"
            evidence=TAS] [GO:0006406 "mRNA export from nucleus" evidence=TAS]
            [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene
            expression" evidence=TAS] [GO:0031124 "mRNA 3'-end processing"
            evidence=TAS] Reactome:REACT_71 InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            EMBL:CH471061 Reactome:REACT_1675 GO:GO:0003723 GO:GO:0016787
            GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0005847
            GO:GO:0006369 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78
            HOGENOM:HOG000264343 OMA:NNPFQFK CTD:53981 HOVERGEN:HBG051106
            OrthoDB:EOG4MCWZQ GO:GO:0006398 EMBL:AK001627 EMBL:BC070095
            EMBL:AB037788 EMBL:AL442079 IPI:IPI00419531 RefSeq:NP_059133.1
            UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0
            SMR:Q9P2I0 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677
            STRING:Q9P2I0 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0
            PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875
            GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588
            HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0
            PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2
            GenomeRNAi:53981 NextBio:56268 ArrayExpress:Q9P2I0 Bgee:Q9P2I0
            CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 GermOnline:ENSG00000165934
            Uniprot:Q9P2I0
        Length = 782

 Score = 390 (142.3 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
 Identities = 106/374 (28%), Positives = 187/374 (50%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
             + TL G  QE    C +L+      ++DCG     S MD +  +      QID +L+SH 
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
                H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  D++ +
Sbjct:    64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122

Query:   120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
              DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+ + + 
Sbjct:   123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182

Query:   175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V   GR
Sbjct:   183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242

Query:   234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
               EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   NNP
Sbjct:   243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302

Query:   291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
             F F+H+S   G+     +  P VV+AS   ++ G SR+LF  WC D KN +I+      G
Sbjct:   303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362

Query:   350 TLAKTILSEPEEVI 363
             TLA+ ++  P E I
Sbjct:   363 TLARFLIDNPSEKI 376

 Score = 74 (31.1 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
 Identities = 21/89 (23%), Positives = 44/89 (49%)

Query:   356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
             LS+ P + I  + + + +K  V YI +   +D     + + +++P  +++VHG   E S+
Sbjct:   515 LSDVPTKCISTT-ESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQ 572

Query:   415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
               A   R +       +++Y P+   +VD
Sbjct:   573 DLAECCRAFG---GKDIKVYMPKLHETVD 598


>UNIPROTKB|F1NMN0 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK GO:GO:0006398
            EMBL:AADN02003653 IPI:IPI00651282 Ensembl:ENSGALT00000017538
            Uniprot:F1NMN0
        Length = 782

 Score = 388 (141.6 bits), Expect = 1.4e-36, Sum P(2) = 1.4e-36
 Identities = 106/381 (27%), Positives = 192/381 (50%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
             + TL G  QE    C +L+      ++DCG     S MD +  +      Q+D +L+SH 
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDENFS-MDIIDSLKK-HVHQVDAVLLSHP 63

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
                H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  D++ +
Sbjct:    64 DPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122

Query:   120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
              DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+ + + 
Sbjct:   123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182

Query:   175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V   GR
Sbjct:   183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242

Query:   234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
               EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   NNP
Sbjct:   243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302

Query:   291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
             F F+H+S    +     +  P VV+AS   ++ G SR+LF  WC D+KN +I+      G
Sbjct:   303 FQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRTTPG 362

Query:   350 TLAKTILSEP-EEVIGMSGQR 369
             TLA+ ++  P E+VI +  +R
Sbjct:   363 TLARFLIDNPSEKVIDIELRR 383

 Score = 76 (31.8 bits), Expect = 1.4e-36, Sum P(2) = 1.4e-36
 Identities = 24/115 (20%), Positives = 53/115 (46%)

Query:   330 EMWCTDAKNGVIIAGYCV-EGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQ 388
             E+  T+ +   + +G    E  + + +   P + I  + + + +K  V YI +   +D  
Sbjct:   489 ELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCISAT-ESMEIKARVTYIDYEGRSDGD 547

Query:   389 QTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
                + + +++P  +V+VHG   E S+  A   R +       +++Y P+   +VD
Sbjct:   548 SIKKIINQMKPRQLVIVHGPP-EASQDLAECCRAFG---GKDIKVYMPKLHETVD 598


>UNIPROTKB|Q10568 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:9913 "Bos taurus" [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISS] [GO:0005847 "mRNA cleavage
            and polyadenylation specificity factor complex" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
            EMBL:X75931 IPI:IPI00688446 PIR:A56351 RefSeq:NP_787002.1
            UniGene:Bt.4077 ProteinModelPortal:Q10568 STRING:Q10568
            PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689
            KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568
            OrthoDB:EOG4MCWZQ NextBio:20810154 GO:GO:0006398 Uniprot:Q10568
        Length = 782

 Score = 389 (142.0 bits), Expect = 1.6e-36, Sum P(2) = 1.6e-36
 Identities = 105/374 (28%), Positives = 187/374 (50%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
             + TL G  QE    C +L+      ++DCG     S MD +  +      QID +L+SH 
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
                H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  D++ +
Sbjct:    64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122

Query:   120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
              DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+ + + 
Sbjct:   123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182

Query:   175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V   GR
Sbjct:   183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242

Query:   234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
               EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   NNP
Sbjct:   243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302

Query:   291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
             F F+H+S   G+     +  P VV+AS   ++ G SR+LF  WC D KN +I+      G
Sbjct:   303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362

Query:   350 TLAKTILSEPEEVI 363
             TLA+ ++  P E +
Sbjct:   363 TLARFLIDNPSEKV 376

 Score = 74 (31.1 bits), Expect = 1.6e-36, Sum P(2) = 1.6e-36
 Identities = 21/89 (23%), Positives = 44/89 (49%)

Query:   356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
             LS+ P + I  + + + +K  V YI +   +D     + + +++P  +++VHG   E S+
Sbjct:   515 LSDVPTKCISTT-ESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQ 572

Query:   415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
               A   R +       +++Y P+   +VD
Sbjct:   573 DLAECCRAFG---GKDIKVYMPKLHETVD 598


>UNIPROTKB|Q9W799 [details] [associations]
            symbol:cpsf2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006397
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1
            UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394
            KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799
        Length = 783

 Score = 389 (142.0 bits), Expect = 2.1e-36, Sum P(2) = 2.1e-36
 Identities = 108/376 (28%), Positives = 188/376 (50%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES--DQIDLLLIS 58
             + TL GA QE    C +L+      ++DCG     S MD    +D V+    Q+D +L+S
Sbjct:     7 LTTLVGA-QEESAVCYLLQVDEFRFLLDCGWDENFS-MD---IIDSVKKYVHQVDAVLLS 61

Query:    59 HFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLE 117
             H    H GALP+ + K G     + T     + +  + D  + S  +TE   L++  D++
Sbjct:    62 HPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFSLFSLDDVD 120

Query:   118 KSMDKIETINF----HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQE 172
              + DKI+ + +    H +   +G+  +   AGH++G  ++ I   G + I+Y  DF+ + 
Sbjct:   121 CAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180

Query:   173 DRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFAL 231
             + HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V   
Sbjct:   181 EIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240

Query:   232 GRAQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISIN 288
             GR  EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   N
Sbjct:   241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300

Query:   289 NPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCV 347
             NPF F+H++   G      +  P VV+AS   ++ G SRELF  WC D KN VI+     
Sbjct:   301 NPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRTT 360

Query:   348 EGTLAKTILSEPEEVI 363
              GTLA+ ++  P E I
Sbjct:   361 PGTLARFLIDHPSERI 376

 Score = 73 (30.8 bits), Expect = 2.1e-36, Sum P(2) = 2.1e-36
 Identities = 19/89 (21%), Positives = 43/89 (48%)

Query:   356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
             LS+ P + +  + + + +K  V YI +   +D     + + +++P  +++VHG  +    
Sbjct:   515 LSDVPTKCVSTT-ESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQD 573

Query:   415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
             L  A  R +       +++Y P+   +VD
Sbjct:   574 LAEAC-RAFG---GKDIKVYTPKLHETVD 598

 Score = 39 (18.8 bits), Expect = 7.8e-33, Sum P(2) = 7.8e-33
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query:   578 YGEAAVPKMFKGEKITITVDKK 599
             YGE   P+ F   ++ +T D+K
Sbjct:   476 YGEIIKPEDFLVPELQVTEDEK 497


>RGD|1309687 [details] [associations]
            symbol:Cpsf2 "cleavage and polyadenylation specific factor 2,
            100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA
            3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0016787
            EMBL:CH473982 GO:GO:0005847 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981
            OrthoDB:EOG4MCWZQ GO:GO:0006398 IPI:IPI00189534
            RefSeq:NP_001100223.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612
            GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098
            Uniprot:D3Z9E6
        Length = 782

 Score = 385 (140.6 bits), Expect = 5.1e-36, Sum P(2) = 5.1e-36
 Identities = 106/379 (27%), Positives = 189/379 (49%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-----DQIDLL 55
             + TL G  QE    C +L+      ++DCG     S       VD+++S      QID +
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-------VDIIDSLRKHVHQIDAV 58

Query:    56 LISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTES 114
             L+SH    H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  
Sbjct:    59 LLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLD 117

Query:   115 DLEKSMDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFS 169
             D++ + DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+
Sbjct:   118 DVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFN 177

Query:   170 RQEDRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
              + + HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V
Sbjct:   178 HKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAV 237

Query:   229 FALGRAQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QI 285
                GR  EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  + 
Sbjct:   238 DTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFED 297

Query:   286 SINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAG 344
               NNPF F+H+S   G+     +  P VV+AS   ++ G SR+LF  WC D KN +I+  
Sbjct:   298 KRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTY 357

Query:   345 YCVEGTLAKTILSEPEEVI 363
                 GTLA+ ++  P E +
Sbjct:   358 RTTPGTLARFLIDNPSEKV 376

 Score = 74 (31.1 bits), Expect = 5.1e-36, Sum P(2) = 5.1e-36
 Identities = 22/115 (19%), Positives = 53/115 (46%)

Query:   330 EMWCTDAKNGVIIAGYCV-EGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQ 388
             E+  T+ +   + +G    E  + + +   P + +  + + + +K  V YI +   +D  
Sbjct:   489 ELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSAT-ESIEIKARVTYIDYEGRSDGD 547

Query:   389 QTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
                + + +++P  +++VHG   E S+  A   R +       +++Y P+   +VD
Sbjct:   548 SIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFG---GKDIKVYMPKLHETVD 598


>MGI|MGI:1861601 [details] [associations]
            symbol:Cpsf2 "cleavage and polyadenylation specific factor
            2" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISO;IDA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
            CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG4MCWZQ GO:GO:0006398
            EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 IPI:IPI00314302
            RefSeq:NP_058552.1 UniGene:Mm.716 ProteinModelPortal:O35218
            SMR:O35218 STRING:O35218 PhosphoSite:O35218 PaxDb:O35218
            PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786
            UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 Bgee:O35218
            CleanEx:MM_CPSF2 Genevestigator:O35218
            GermOnline:ENSMUSG00000041781 Uniprot:O35218
        Length = 782

 Score = 384 (140.2 bits), Expect = 6.7e-36, Sum P(2) = 6.7e-36
 Identities = 106/379 (27%), Positives = 189/379 (49%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-----DQIDLL 55
             + TL G  QE    C +L+      ++DCG     S       VD+++S      QID +
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-------VDIIDSLRKHVHQIDAV 58

Query:    56 LISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTES 114
             L+SH    H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  
Sbjct:    59 LLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLD 117

Query:   115 DLEKSMDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFS 169
             D++ + DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+
Sbjct:   118 DVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFN 177

Query:   170 RQEDRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
              + + HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V
Sbjct:   178 HKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAV 237

Query:   229 FALGRAQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QI 285
                GR  EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  + 
Sbjct:   238 DTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFED 297

Query:   286 SINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAG 344
               NNPF F+H+S   G+     +  P VV+AS   ++ G SR+LF  WC D KN +I+  
Sbjct:   298 KRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTY 357

Query:   345 YCVEGTLAKTILSEPEEVI 363
                 GTLA+ ++  P E +
Sbjct:   358 RTTPGTLARFLIDNPTEKV 376

 Score = 74 (31.1 bits), Expect = 6.7e-36, Sum P(2) = 6.7e-36
 Identities = 22/115 (19%), Positives = 53/115 (46%)

Query:   330 EMWCTDAKNGVIIAGYCV-EGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQ 388
             E+  T+ +   + +G    E  + + +   P + +  + + + +K  V YI +   +D  
Sbjct:   489 ELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSAT-ESIEIKARVTYIDYEGRSDGD 547

Query:   389 QTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
                + + +++P  +++VHG   E S+  A   R +       +++Y P+   +VD
Sbjct:   548 SIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFG---GKDIKVYMPKLHETVD 598


>UNIPROTKB|F1SD85 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379
            "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA polyadenylation"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            InterPro:IPR022712 PANTHER:PTHR11203:SF5 Pfam:PF10996 SMART:SM01027
            GeneTree:ENSGT00700000104551 EMBL:CU468363
            Ensembl:ENSSSCT00000002717 OMA:GANDESP Uniprot:F1SD85
        Length = 385

 Score = 389 (142.0 bits), Expect = 1.2e-35, P = 1.2e-35
 Identities = 106/374 (28%), Positives = 187/374 (50%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
             + TL G  QE    C +L+      ++DCG     S MD +  +      QID +L+SH 
Sbjct:     7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
                H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  D++ +
Sbjct:    64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122

Query:   120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
              DKI+ + F +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+ + + 
Sbjct:   123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182

Query:   175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             HL    +  + +P +LIT+S   T+V  +R++R+ +  + + + +   G  LI V   GR
Sbjct:   183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGSVLIAVDTAGR 242

Query:   234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
               EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   NNP
Sbjct:   243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302

Query:   291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
             F F+H+S   G+     +  P VV+AS   ++ G SR+LF  WC D KN +I+      G
Sbjct:   303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362

Query:   350 TLAKTILSEPEEVI 363
             TLA+ ++  P E I
Sbjct:   363 TLARFLIDNPSEKI 376


>ZFIN|ZDB-GENE-040718-79 [details] [associations]
            symbol:cpsf2 "cleavage and polyadenylation specific
            factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0016787
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106
            OrthoDB:EOG4MCWZQ EMBL:BC076029 IPI:IPI00512505
            RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5
            STRING:Q6DHE5 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657
            InParanoid:Q6DHE5 NextBio:20831102 ArrayExpress:Q6DHE5 Bgee:Q6DHE5
            Uniprot:Q6DHE5
        Length = 790

 Score = 380 (138.8 bits), Expect = 3.5e-35, Sum P(2) = 3.5e-35
 Identities = 103/372 (27%), Positives = 185/372 (49%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
             +  L G  QE    C +L+      ++DCG     S MD +  +      Q+D +L+SH 
Sbjct:     7 LTALSGV-QEESALCYLLQVDEFRFLLDCGWDETFS-MDIIDSLKRYVH-QVDAVLLSHP 63

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
                H GALP+ + K G     + T     + +  + D  + S  +TE   L+T  D++ +
Sbjct:    64 DHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDSA 122

Query:   120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
              DKI+ + + +  ++    +G+  +   AGH++G  ++ I   G + I+Y  DF+ + + 
Sbjct:   123 FDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIYGVDFNHKREI 182

Query:   175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             HL    +  + +P +LIT+S   ++V  +R++R+ +  + + + +   G  LI V   GR
Sbjct:   183 HLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTAGR 242

Query:   234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
               EL  +LD+ W      L    +   ++++   +   ++ +  M+D++ R  +   NNP
Sbjct:   243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302

Query:   291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
             F F+H+S    +     +  P VV+ S   ++SG SRELF  WC DAKN VI+      G
Sbjct:   303 FQFRHLSLCHSLSDLARVPSPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRTTPG 362

Query:   350 TLAKTILSEPEE 361
             TLA+ ++  P E
Sbjct:   363 TLARYLIDNPGE 374

 Score = 72 (30.4 bits), Expect = 3.5e-35, Sum P(2) = 3.5e-35
 Identities = 17/76 (22%), Positives = 36/76 (47%)

Query:   368 QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDP 427
             Q L ++  V YI +   +D     + + +++P  +++VHG  +    L A   + Y    
Sbjct:   528 QTLDIRARVMYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDASQDL-AESCKAYS--- 583

Query:   428 NTSMELYNPRNTVSVD 443
                +++Y P+   +VD
Sbjct:   584 GKDIKVYIPKLQETVD 599


>WB|WBGene00017313 [details] [associations]
            symbol:cpsf-2 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0051301 "cell division" evidence=IMP] [GO:0000910 "cytokinesis"
            evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0016246
            "RNA interference" evidence=IMP] [GO:0040027 "negative regulation
            of vulval development" evidence=IMP] InterPro:IPR001279
            InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
            GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
            GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
            OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
            ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
            EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
            CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
            Uniprot:O17403
        Length = 843

 Score = 383 (139.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
 Identities = 107/374 (28%), Positives = 186/374 (49%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHP--GLSGMDAL-PFVDLVESDQIDLLLI 57
             +K   GA  E G  C +L+     I++DCG     GL   + L PF+      +I  +LI
Sbjct:     7 LKVFSGAKDE-GPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIP-----KISAVLI 60

Query:    58 SHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQML-YTESDL 116
             SH    H G LP+ + K G     + T     + +  + D +  S++  E+   YT  D+
Sbjct:    61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDV 119

Query:   117 EKSMDKIETINFHEE---KDVNGIKFSAYNAGHVLGAAMFLI-EIAGVKILYTGDFSRQE 172
             + + +K+E + +++    K  +G+ F+A  AGH+LG +++ I  + G  I+Y  DF+ ++
Sbjct:   120 DTAFEKVEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179

Query:   173 DRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFAL 231
             +RHL         +P +LIT + + +    +R++R+ +  + I   V + G C+I +   
Sbjct:   180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query:   232 GRAQELLLILDEYWS-LHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-- 288
             GR  EL  +LD+ WS     L    +   S +A   +   ++ +  MN+++ +  S +  
Sbjct:   240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299

Query:   289 -NPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYC 346
              NPF  KH++          +  P VV+ S   M+SG SRELF  WC+D +NGVI+    
Sbjct:   300 YNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARP 359

Query:   347 VEGTLAKTILSEPE 360
                TLA  +++  E
Sbjct:   360 ASFTLAAKLVNMAE 373

 Score = 65 (27.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
 Identities = 11/49 (22%), Positives = 27/49 (55%)

Query:   369 RLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKA 417
             R+ +   +++I +   +D + T + +  L P  +++VHG +++   L A
Sbjct:   562 RVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSRDDTRDLVA 610


>UNIPROTKB|O17403 [details] [associations]
            symbol:cpsf-2 "Probable cleavage and polyadenylation
            specificity factor subunit 2" species:6239 "Caenorhabditis elegans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0003674
            "molecular_function" evidence=ND] InterPro:IPR001279
            InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
            GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
            GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
            OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
            ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
            EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
            CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
            Uniprot:O17403
        Length = 843

 Score = 383 (139.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
 Identities = 107/374 (28%), Positives = 186/374 (49%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHP--GLSGMDAL-PFVDLVESDQIDLLLI 57
             +K   GA  E G  C +L+     I++DCG     GL   + L PF+      +I  +LI
Sbjct:     7 LKVFSGAKDE-GPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIP-----KISAVLI 60

Query:    58 SHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQML-YTESDL 116
             SH    H G LP+ + K G     + T     + +  + D +  S++  E+   YT  D+
Sbjct:    61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDV 119

Query:   117 EKSMDKIETINFHEE---KDVNGIKFSAYNAGHVLGAAMFLI-EIAGVKILYTGDFSRQE 172
             + + +K+E + +++    K  +G+ F+A  AGH+LG +++ I  + G  I+Y  DF+ ++
Sbjct:   120 DTAFEKVEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179

Query:   173 DRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFAL 231
             +RHL         +P +LIT + + +    +R++R+ +  + I   V + G C+I +   
Sbjct:   180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239

Query:   232 GRAQELLLILDEYWS-LHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-- 288
             GR  EL  +LD+ WS     L    +   S +A   +   ++ +  MN+++ +  S +  
Sbjct:   240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299

Query:   289 -NPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYC 346
              NPF  KH++          +  P VV+ S   M+SG SRELF  WC+D +NGVI+    
Sbjct:   300 YNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARP 359

Query:   347 VEGTLAKTILSEPE 360
                TLA  +++  E
Sbjct:   360 ASFTLAAKLVNMAE 373

 Score = 65 (27.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
 Identities = 11/49 (22%), Positives = 27/49 (55%)

Query:   369 RLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKA 417
             R+ +   +++I +   +D + T + +  L P  +++VHG +++   L A
Sbjct:   562 RVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSRDDTRDLVA 610


>UNIPROTKB|Q9KV92 [details] [associations]
            symbol:VC_0264 "Putative uncharacterized protein"
            species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0016787
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K07576 OMA:CHIDHVG PIR:F82345 RefSeq:NP_229920.1
            ProteinModelPortal:Q9KV92 DNASU:2614470 GeneID:2614470
            KEGG:vch:VC0264 PATRIC:20079570 ProtClustDB:CLSK2517501
            Uniprot:Q9KV92
        Length = 455

 Score = 376 (137.4 bits), Expect = 3.5e-34, P = 3.5e-34
 Identities = 115/435 (26%), Positives = 201/435 (46%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G    V  SC  L    +++++DCG+     G D  P         +D L+++H H+DH 
Sbjct:    24 GGKASVTGSCHELRADGQALLIDCGL---FQGADERPLAVEFALGHVDALILTHAHIDHI 80

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV----SNISTEQMLYTESDLEKSMD 121
             G LPW LL  G K   + T AT  +   +L D +K+    S   +E++L     L +  D
Sbjct:    81 GRLPW-LLAAGLKQPIYSTAATAELVPLMLEDGLKLQLGMSPKQSERVLTEVRRLLRVQD 139

Query:   122 KIETINFHEEK-DVNGIKFSAYNAGHVLGAAMFLIEIA-GVKILYTGDFSRQEDRHLMAA 179
               +      ++ D   ++F    AGH+LG+A   I    G  ++++GD        L+  
Sbjct:   140 YQKWFAVQPKRADSLWVRFQP--AGHILGSAYVEIRRPNGEVVVFSGDLGPSHTP-LLPD 196

Query:   180 EIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLL 239
                P + D L  E+TYG   HE  + R  R  ++I   +  GG  LIP F++GR QELL 
Sbjct:   197 PQSPERADYLFIETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLF 256

Query:   240 ILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFK---- 294
              +++         ++PI   S +A++    Y+ +        + ++ ++ +P  F+    
Sbjct:   257 DIEQLIFSQQIDANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCIT 316

Query:   295 ---HISNLKGIDHFEDIGPC-VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGT 350
                H ++ + ++     G   +V+A+ GM Q G   +  +    D +  +I+AG+  EGT
Sbjct:   317 VEDHRTHERLVNRLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAEGT 376

Query:   351 LAKTILS-EPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVREL--RPAHVVLVH 406
             L ++I S +P   + + G  + +   +  +S +SAH D      F+  +  +P  V L+H
Sbjct:   377 LGRSIQSGQPS--VWIEGTEVEVNAHIHTMSGYSAHADKADLLRFITGIPEKPKQVHLIH 434

Query:   407 GEQNEMSRLKAALTR 421
             GE        A LT+
Sbjct:   435 GEAPAKQAFAAELTQ 449


>TIGR_CMR|VC_0264 [details] [associations]
            symbol:VC_0264 "conserved hypothetical protein" species:686
            "Vibrio cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE003852
            GenomeReviews:AE003852_GR GO:GO:0016787 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K07576 OMA:CHIDHVG
            PIR:F82345 RefSeq:NP_229920.1 ProteinModelPortal:Q9KV92
            DNASU:2614470 GeneID:2614470 KEGG:vch:VC0264 PATRIC:20079570
            ProtClustDB:CLSK2517501 Uniprot:Q9KV92
        Length = 455

 Score = 376 (137.4 bits), Expect = 3.5e-34, P = 3.5e-34
 Identities = 115/435 (26%), Positives = 201/435 (46%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             G    V  SC  L    +++++DCG+     G D  P         +D L+++H H+DH 
Sbjct:    24 GGKASVTGSCHELRADGQALLIDCGL---FQGADERPLAVEFALGHVDALILTHAHIDHI 80

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV----SNISTEQMLYTESDLEKSMD 121
             G LPW LL  G K   + T AT  +   +L D +K+    S   +E++L     L +  D
Sbjct:    81 GRLPW-LLAAGLKQPIYSTAATAELVPLMLEDGLKLQLGMSPKQSERVLTEVRRLLRVQD 139

Query:   122 KIETINFHEEK-DVNGIKFSAYNAGHVLGAAMFLIEIA-GVKILYTGDFSRQEDRHLMAA 179
               +      ++ D   ++F    AGH+LG+A   I    G  ++++GD        L+  
Sbjct:   140 YQKWFAVQPKRADSLWVRFQP--AGHILGSAYVEIRRPNGEVVVFSGDLGPSHTP-LLPD 196

Query:   180 EIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLL 239
                P + D L  E+TYG   HE  + R  R  ++I   +  GG  LIP F++GR QELL 
Sbjct:   197 PQSPERADYLFIETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLF 256

Query:   240 ILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFK---- 294
              +++         ++PI   S +A++    Y+ +        + ++ ++ +P  F+    
Sbjct:   257 DIEQLIFSQQIDANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCIT 316

Query:   295 ---HISNLKGIDHFEDIGPC-VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGT 350
                H ++ + ++     G   +V+A+ GM Q G   +  +    D +  +I+AG+  EGT
Sbjct:   317 VEDHRTHERLVNRLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAEGT 376

Query:   351 LAKTILS-EPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVREL--RPAHVVLVH 406
             L ++I S +P   + + G  + +   +  +S +SAH D      F+  +  +P  V L+H
Sbjct:   377 LGRSIQSGQPS--VWIEGTEVEVNAHIHTMSGYSAHADKADLLRFITGIPEKPKQVHLIH 434

Query:   407 GEQNEMSRLKAALTR 421
             GE        A LT+
Sbjct:   435 GEAPAKQAFAAELTQ 449


>FB|FBgn0027873 [details] [associations]
            symbol:Cpsf100 "Cleavage and polyadenylation specificity
            factor 100" species:7227 "Drosophila melanogaster" [GO:0006379
            "mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
            processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 OMA:NNPFQFK
            GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1
            UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 IntAct:Q9V3D6
            STRING:Q9V3D6 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357
            GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426
            FlyBase:FBgn0027873 InParanoid:Q8IML7 OrthoDB:EOG4XD261
            PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 Bgee:Q9V3D6
            GermOnline:CG1957 Uniprot:Q9V3D6
        Length = 756

 Score = 354 (129.7 bits), Expect = 6.9e-32, Sum P(2) = 6.9e-32
 Identities = 100/369 (27%), Positives = 184/369 (49%)

Query:     1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-DQIDLLLISH 59
             + T+ GA  E    C +L+  +  I++DCG        DA    +L      +D +L+SH
Sbjct:     7 LHTISGAMDE-SPPCYILQIDDVRILLDCGWD---EKFDANFIKELKRQVHTLDAVLLSH 62

Query:    60 FHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSD-YIKVSNISTEQMLYTESDLEK 118
                 H GALP+ + K G     + T     + +  + D Y+   N+     L++  D++ 
Sbjct:    63 PDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFD-LFSLDDVDT 121

Query:   119 SMDKIETINFHEE---KDVN-GIKFSAYNAGHVLGAAMF-LIEIAGVKILYTGDFSRQED 173
             + +KI  + +++    KD   GI  +  NAGH++G  ++ ++++    I+Y  DF+ +++
Sbjct:   122 AFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKE 181

Query:   174 RHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALG 232
             RHL   E+  + +P +LIT++    +   +R  R+ +  + I   V   G  LI V   G
Sbjct:   182 RHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAG 241

Query:   233 RAQELLLILDEYW-SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQI--SINN 289
             R  EL  +LD+ W +    L    +   ++++   +   ++ I  M+D++ +    + NN
Sbjct:   242 RVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNN 301

Query:   290 PFVFKHISNLKGI-DHFE-DIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCV 347
             PF FKHI     + D ++   GP VV+AS   ++SG +R+LF  W ++A N +I+     
Sbjct:   302 PFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTS 361

Query:   348 EGTLAKTIL 356
              GTLA  ++
Sbjct:   362 PGTLAMELV 370

 Score = 69 (29.3 bits), Expect = 6.9e-32, Sum P(2) = 6.9e-32
 Identities = 20/90 (22%), Positives = 43/90 (47%)

Query:   355 ILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
             +L +P ++I    + + +   V  I F   +D +   + + +LRP  V+++HG   E ++
Sbjct:   526 LLEKPTKLISQR-KTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTA-EGTQ 583

Query:   415 LKAALTREYEDDPNTSMELYNPRNTVSVDL 444
             + A   R  E   N    ++ P+    +D+
Sbjct:   584 VVA---RHCEQ--NVGARVFTPQKGEIIDV 608


>TAIR|locus:2172843 [details] [associations]
            symbol:CPSF100 "cleavage and polyadenylation specificity
            factor 100" species:3702 "Arabidopsis thaliana" [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0009793 "embryo development ending
            in seed dormancy" evidence=NAS] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IPI] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS;NAS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0035194 "posttranscriptional gene
            silencing by RNA" evidence=IMP] [GO:0009506 "plasmodesma"
            evidence=IDA] [GO:0000278 "mitotic cell cycle" evidence=RCA]
            [GO:0006306 "DNA methylation" evidence=RCA] [GO:0006342 "chromatin
            silencing" evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0007267 "cell-cell
            signaling" evidence=RCA] [GO:0009220 "pyrimidine ribonucleotide
            biosynthetic process" evidence=RCA] [GO:0009616 "virus induced gene
            silencing" evidence=RCA] [GO:0009640 "photomorphogenesis"
            evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
            interference" evidence=RCA] [GO:0010388 "cullin deneddylation"
            evidence=RCA] [GO:0016569 "covalent chromatin modification"
            evidence=RCA] [GO:0031047 "gene silencing by RNA" evidence=RCA]
            [GO:0035196 "production of miRNAs involved in gene silencing by
            miRNA" evidence=RCA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 GO:GO:0009506 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0006378 EMBL:AB005244 GO:GO:0003723
            GO:GO:0016787 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 EMBL:AF283277 EMBL:AY034982
            EMBL:BT004374 IPI:IPI00521104 RefSeq:NP_197776.1 UniGene:At.25191
            ProteinModelPortal:Q9LKF9 SMR:Q9LKF9 IntAct:Q9LKF9 STRING:Q9LKF9
            PaxDb:Q9LKF9 PRIDE:Q9LKF9 EnsemblPlants:AT5G23880.1 GeneID:832453
            KEGG:ath:AT5G23880 TAIR:At5g23880 HOGENOM:HOG000264343
            InParanoid:Q9LKF9 OMA:NNPFQFK PhylomeDB:Q9LKF9
            ProtClustDB:CLSN2686300 Genevestigator:Q9LKF9 GermOnline:AT5G23880
            GO:GO:0035194 Uniprot:Q9LKF9
        Length = 739

 Score = 373 (136.4 bits), Expect = 1.8e-31, P = 1.8e-31
 Identities = 112/411 (27%), Positives = 209/411 (50%)

Query:    24 SIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFM 83
             + ++DCG +  L     L  +  V S  ID +L+SH    H GALP+ + + G     + 
Sbjct:    29 NFLIDCGWND-LFDTSLLEPLSRVAST-IDAVLLSHPDTLHIGALPYAMKQLGLSAPVY- 85

Query:    84 THATKAIYRW-LLSDYIKVSNISTEQM----LYTESDLEKSMDKIETI----NFHEEKDV 134
               AT+ ++R  LL+ Y +   +S +Q+    L+T  D++ +   +  +    N+H     
Sbjct:    86 --ATEPVHRLGLLTMYDQF--LSRKQVSDFDLFTLDDIDSAFQNVIRLTYSQNYHLSGKG 141

Query:   135 NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPP-VKPDILITES 193
              GI  + + AGH+LG +++ I   G  ++Y  D++ +++RHL    +   V+P +LIT++
Sbjct:   142 EGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRPAVLITDA 201

Query:   194 TYGTHVHEQ-REEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELH 252
              +  + ++  R++R+  F   I   +  GG  L+PV   GR  ELLLIL+++WS      
Sbjct:   202 YHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRG--F 259

Query:   253 DIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNPFVFKHIS---NLKGIDHFED 307
               PIY+ + ++   +   ++++  M+D I +  + S +N F+ +H++   N   +D+   
Sbjct:   260 SFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPP 319

Query:   308 IGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV---IG 364
              GP VV+AS   +++G +RE+F  W  D +N V+       GTLA+ + S P      + 
Sbjct:   320 -GPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVT 378

Query:   365 MSGQRLPLKMSVDYISFSAHTDYQQTSEFVRE--LRPAHVVLVHGEQNEMS 413
             MS +R+PL    + I++    +  +  E +R   ++       HG  +  S
Sbjct:   379 MS-KRVPLA-GEELIAYEEEQNRLKREEALRASLVKEEETKASHGSDDNSS 427


>TIGR_CMR|DET_1061 [details] [associations]
            symbol:DET_1061 "metallo-beta-lactamase family protein"
            species:243164 "Dehalococcoides ethenogenes 195" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:CP000027 GenomeReviews:CP000027_GR
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576
            RefSeq:YP_181776.1 ProteinModelPortal:Q3Z7M3 STRING:Q3Z7M3
            GeneID:3229629 KEGG:det:DET1061 PATRIC:21609167
            ProtClustDB:CLSK2516599 BioCyc:DETH243164:GJNF-1062-MONOMER
            Uniprot:Q3Z7M3
        Length = 468

 Score = 267 (99.0 bits), Expect = 4.3e-30, Sum P(2) = 4.3e-30
 Identities = 84/321 (26%), Positives = 148/321 (46%)

Query:   110 LYTESDLEKSMDKIETINFHEEKDVN-GIKFSAYNAGHVLGAAMFLIEIAGVK----ILY 164
             LYT  D        +T+ +  E  V   I  + +NAGHV G+A   ++I        I++
Sbjct:   129 LYTAEDARAVSPLFKTVEYSREIAVTEDITATFHNAGHVFGSASIELKIQENHRQKVIVF 188

Query:   165 TGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRC 224
             +GD     DR ++       + D ++ ESTYG   H+   E   +   +I+  V  GG  
Sbjct:   189 SGDLGNW-DRPILKNPDLVNQADYVVIESTYGDRTHQDINEASLKLAEIINQTVKLGGNI 247

Query:   225 LIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQ 284
             +IP FAL R Q+LL  L+ + S   ++  + ++  S +A     +++ +   + DR    
Sbjct:   248 VIPSFALERTQDLLFFLNRFMS-EGKIPSLKVFVDSPMAISITKIFKEHPE-LYDR-ETS 304

Query:   285 ISINN---PFVFK--HISNLKGIDH---FEDIGPCVVMASPGMMQSGLSRELFEMWCTDA 336
               +NN   PF F+  H +N K  D      +  PC+++A  GM   G  +       +  
Sbjct:   305 GWVNNGSSPFEFEGLHFTN-KAADSKAILAEKDPCIIIAGSGMCTGGRIKHHLVNNISRP 363

Query:   337 KNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYI-SFSAHTDYQQTSEFVR 395
             ++ ++  G+   GTL + I    +EV  + GQ  P++  ++ + +FSAH D      +++
Sbjct:   364 ESTILFVGFQATGTLGRLITDGAKEV-RILGQHYPVQARIEELRAFSAHADQPTLLRWLK 422

Query:   396 ELR--PAHVVLVHGEQNEMSR 414
               +  P  V + HGE    +R
Sbjct:   423 GFKNKPEMVFVTHGEPETSAR 443

 Score = 136 (52.9 bits), Expect = 4.3e-30, Sum P(2) = 4.3e-30
 Identities = 33/94 (35%), Positives = 51/94 (54%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPG--LSGMDALPFVDLVESDQIDLLLISHFHLD 63
             GA + V  S  +++  +  +++DCG++    L   +  PF   +    +  ++ISH H+D
Sbjct:     9 GAARNVTGSRYLIKTDHTQLLVDCGLYQERRLQDRNWQPFE--IPPQSLSAVIISHAHID 66

Query:    64 HCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSD 97
             HCG LP  L+K GF G  F T AT  I R  L+D
Sbjct:    67 HCGLLPK-LVKEGFAGPVFATEATAEIARISLTD 99


>DICTYBASE|DDB_G0270392 [details] [associations]
            symbol:cpsf2 "cleavage and polyadenylation
            specificity factor 100 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA;ISS] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA;ISS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISS]
            [GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0270392 Pfam:PF07521 EMBL:AAFI02000005
            GenomeReviews:CM000150_GR GO:GO:0006378 GO:GO:0003723 GO:GO:0016787
            GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            OMA:NNPFQFK RefSeq:XP_646760.1 ProteinModelPortal:Q55BS1
            STRING:Q55BS1 EnsemblProtists:DDB0233700 GeneID:8617733
            KEGG:ddi:DDB_G0270392 ProtClustDB:CLSZ2431463 Uniprot:Q55BS1
        Length = 784

 Score = 352 (129.0 bits), Expect = 2.8e-29, Sum P(2) = 2.8e-29
 Identities = 110/461 (23%), Positives = 206/461 (44%)

Query:     4 LKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLD 63
             L GA  E    C +LE  +  I++DCG+   L     L  ++ V + +ID +L+SH    
Sbjct:    10 LSGAKDE-SPPCYLLEIDDFCILLDCGLSYNLD-FSLLEPLEKV-AKKIDAVLLSHSDTT 66

Query:    64 HCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSM--D 121
             H G LP+ + K G  G  + T     +    L D  +      E   Y+  +++     D
Sbjct:    67 HIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNIDSCFGED 126

Query:   122 KIETINFHEEKDVNG----IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLM 177
             + + ++F +   ++G    I  + Y AGH +GA+++ I      I+Y  D++ + + HL 
Sbjct:   127 RFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHRNEGHLD 186

Query:   178 AAEIPP--VKPDILITES--TYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
             + ++    +KP +LIT+S     T   ++   R+      I+  +  GG  LIPV   GR
Sbjct:   187 SLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQSLFEQINRNLRDGGNVLIPVDTAGR 246

Query:   234 AQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDR--IRRQISINNPF 291
               ELLL ++ YWS +  L    + +    +       ++ +  M+    ++ + +I NPF
Sbjct:   247 VLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKFEQNIENPF 306

Query:   292 VFKHISNLKGIDHFEDIGPC--VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
              FKHI  L  ++  +++     V++ S   +++G SRELF  WC+D K  ++      + 
Sbjct:   307 SFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLILFTQKIPKD 366

Query:   350 TLAKTILSEPEEVIG-------MSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHV 402
             +LA  ++ +     G       + G R+PL    + + +      Q+  + + +LR    
Sbjct:   367 SLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGD-ELLQYEMEQAKQREEKRLEQLRKEQE 425

Query:   403 VLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
                  E+ E    +  L    +D     ++L   +    +D
Sbjct:   426 EREERERLEEEEREQLLNATNQDQLQQLLQLQQQKERGIID 466

 Score = 47 (21.6 bits), Expect = 2.8e-29, Sum P(2) = 2.8e-29
 Identities = 12/35 (34%), Positives = 16/35 (45%)

Query:   480 YHLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVL 514
             Y LL    L     LK SKI+  +  Y  G + +L
Sbjct:   627 YELLLKDSL--VNTLKTSKILDYEVSYIQGKVDIL 659


>TIGR_CMR|CHY_2049 [details] [associations]
            symbol:CHY_2049 "metallo-beta-lactamase family protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
            "metabolic process" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 EMBL:CP000141 GenomeReviews:CP000141_GR
            GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 RefSeq:YP_360868.1
            ProteinModelPortal:Q3AAG6 STRING:Q3AAG6 GeneID:3728507
            KEGG:chy:CHY_2049 PATRIC:21277179 HOGENOM:HOG000244774 KO:K07576
            OMA:GGRIVHH BioCyc:CHYD246194:GJCN-2048-MONOMER Uniprot:Q3AAG6
        Length = 504

 Score = 326 (119.8 bits), Expect = 7.0e-27, P = 7.0e-27
 Identities = 100/371 (26%), Positives = 177/371 (47%)

Query:   108 QMLYTESDLEKSMDKIETINFHEE-KDVNGIKFSAYNAGHVLGAAMFLIEIAGVK----I 162
             Q +YT  D   ++   + I        + G++ + ++AGH+LG+AM  I   G      I
Sbjct:   123 QPIYTADDAFNALAYFQKIPLETPITPLPGLEVTFFDAGHILGSAMIKIAYKGQDATRTI 182

Query:   163 LYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGG 222
             L+TGD  R     +   +  P+  DIL+ ESTYG  V  +  + +    SLI  +  R G
Sbjct:   183 LFTGDLGRNGRPFMKEPQKVPLT-DILVLESTYGDRVRSEEGDLKTLLKSLIEKVYRRNG 241

Query:   223 RCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIR 282
               +IP FA+ R Q+L+ IL++    + E+  I +Y  S LA +   +++ Y    N+  +
Sbjct:   242 NLIIPAFAMERTQDLIYILNDLVE-NKEVPPIDVYIDSPLAVEITKLFKKYPMFFNEEYK 300

Query:   283 RQISI-NNPFVFK--HIS-NLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKN 338
              +++  ++P  F   H S + +      +I   +++++ GM  +G  R   +      ++
Sbjct:   301 EKLNRGDDPLAFPGLHFSVSQEDSVKLNNISRAIIISASGMADAGRIRHHLKHNLWRPES 360

Query:   339 GVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSV-DYISFSAHTDYQQTSEFVREL 397
              V++ GY  + TL + +L   +EV  M G+ + +K  V  Y   SAH D ++   F+   
Sbjct:   361 AVLLVGYQAQDTLGRKLLDGAKEVKIM-GEEIAVKAEVYHYDGLSAHADQRELLAFIGRF 419

Query:   398 --RPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPR--NTVSVDLYFKGEKTAK 453
               +PA + LVHGE      LK  +  +Y       +  Y PR   T+S+     G K+ +
Sbjct:   420 SQKPAQIYLVHGEDEARLNLKKLIEEKYR------IPCYLPRYQETISLLANLPG-KSEE 472

Query:   454 VMGELAVENLK 464
             V+ +  +  LK
Sbjct:   473 VLIDKVITLLK 483

 Score = 164 (62.8 bits), Expect = 1.0e-08, P = 1.0e-08
 Identities = 68/303 (22%), Positives = 123/303 (40%)

Query:     3 TLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDL-VESDQIDLLLISHFH 61
             T  GA   V  SC +        ++DCG+  G   +    + +      +I+ +L++H H
Sbjct:     4 TFFGAADTVTGSCYLFNVAGHKFLVDCGLFQGPKAIKERNYGEFPFNPREIEFILLTHAH 63

Query:    62 LDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTE-------------- 107
             +DH G +P  L+K GFKG  + T  T  +   +L D   V  +  E              
Sbjct:    64 IDHSGLIPK-LVKKGFKGTIYATEPTVDLAAVMLPDSGHVQEMEVERKNRKLRRAGKPEL 122

Query:   108 QMLYTESDLEKSMDKIETINFHEE-KDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTG 166
             Q +YT  D   ++   + I        + G++ + ++AGH+LG+AM  I   G     T 
Sbjct:   123 QPIYTADDAFNALAYFQKIPLETPITPLPGLEVTFFDAGHILGSAMIKIAYKGQDATRTI 182

Query:   167 DFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLI 226
              F+    R+       P K  +        T+    R E EG   +L+  ++ +  R   
Sbjct:   183 LFTGDLGRNGRPFMKEPQKVPLTDILVLESTYGDRVRSE-EGDLKTLLKSLIEKVYRRNG 241

Query:   227 PVFALGRAQEL---LLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
              +     A E    L+ +      + E+  I +Y  S LA +   +++ Y    N+  + 
Sbjct:   242 NLIIPAFAMERTQDLIYILNDLVENKEVPPIDVYIDSPLAVEITKLFKKYPMFFNEEYKE 301

Query:   284 QIS 286
             +++
Sbjct:   302 KLN 304


>UNIPROTKB|E9PIL7 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
            HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00977321
            ProteinModelPortal:E9PIL7 SMR:E9PIL7 Ensembl:ENST00000534345
            ArrayExpress:E9PIL7 Bgee:E9PIL7 Uniprot:E9PIL7
        Length = 140

 Score = 267 (99.0 bits), Expect = 2.9e-22, P = 2.9e-22
 Identities = 54/132 (40%), Positives = 81/132 (61%)

Query:     4 LKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLIS 58
             L GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++IS
Sbjct:     9 LVGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIIS 68

Query:    59 HFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLE 117
             HFHLDHCGALP+F    G+ G  +MTH T+AI   LL DY K++ +   E   +T   ++
Sbjct:    69 HFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIK 128

Query:   118 KSMDKIETINFH 129
               M K+  ++ H
Sbjct:   129 DCMKKVVAVHLH 140


>POMBASE|SPBC1709.15c [details] [associations]
            symbol:cft2 "cleavage factor two Cft2/polyadenylation
            factor CPSF-73 (predicted)" species:4896 "Schizosaccharomyces
            pombe" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA]
            [GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
            cleavage" evidence=ISO] InterPro:IPR027075 PomBase:SPBC1709.15c
            Pfam:PF07521 EMBL:CU329671 GO:GO:0006378 GenomeReviews:CU329671_GR
            GO:GO:0005847 GO:GO:0006379 PIR:T39643 RefSeq:NP_595448.1
            ProteinModelPortal:O74740 STRING:O74740 EnsemblFungi:SPBC1709.15c.1
            GeneID:2539954 KEGG:spo:SPBC1709.15c eggNOG:COG1236 KO:K14402
            OMA:ISSIATP OrthoDB:EOG4WWVSN NextBio:20801097 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Uniprot:O74740
        Length = 797

 Score = 288 (106.4 bits), Expect = 6.3e-22, P = 6.3e-22
 Identities = 129/532 (24%), Positives = 233/532 (43%)

Query:    25 IMMDCGIH----PGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGR 80
             I +D GIH    PG    D+L   ++ E  Q DL+L+SH  L H G L +   K  +K  
Sbjct:    18 IELD-GIHIYIDPGSD--DSLKHPEVPE--QPDLILLSHSDLAHIGGLVYAYYKYDWKNA 72

Query:    81 -CFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEK----DVN 135
               + T  T  + R  + D IK + IS      +++D++   D I  + + +        +
Sbjct:    73 YIYATLPTINMGRMTMLDAIKSNYISD----MSKADVDAVFDSIIPLRYQQPTLLLGKCS 128

Query:   136 GIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPV--------KPD 187
             G+  +AYNAGH LG  ++ +      +LY  D++  +D+HL  A +           +P+
Sbjct:   129 GLTITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRPN 188

Query:   188 ILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSL 247
              LIT++         R++R+  F   +   + +GG  L+PV A  R  EL  ILD +WS 
Sbjct:   189 TLITDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWSA 248

Query:   248 -HPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFKHISNLKGIDHF 305
               P L   PI + S  + K +   ++ I  M D I R   IN N   F++I+ +      
Sbjct:   249 SQPPL-PFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGINENLLEFRNINTITDFSQI 307

Query:   306 EDIGPC--VVMASPGMMQSGLSRELFEMWCTDAKNGVII---AGYCVEGTLAKTILSEPE 360
               IGP   V++A+   ++ G S+ +     ++  N +I+      C + +LA   +   E
Sbjct:   308 SHIGPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSRCPQNSLANQFIRYWE 367

Query:   361 EVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALT 420
                    + +P  + +          Y + +  ++   P     + GE+   S  +   +
Sbjct:   368 RA-SKKKRDIPHPVGL----------YAEQAVKIKTKEP-----LEGEELR-SYQELEFS 410

Query:   421 REYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAAL--SGIIVKRNF 478
             +  +D  +T++E  N R  +  DL      ++    +L +    P  AL  S  ++ ++F
Sbjct:   411 KRNKDAEDTALEFRN-RTILDEDL---SSSSSSEDDDLDLNTEVPHVALGSSAFLMGKSF 466

Query:   479 NYHLLAPSDLPKYTDLKASKIIQQQS-VYYSGSISVLRSLISHLAGPVETLD 529
             + +L  P+    +T  K    I+++  +   G I +     S +  P  TL+
Sbjct:   467 DLNLRDPAVQALHTKYKMFPYIEKRRRIDEYGEI-IKHQDFSMINEPANTLE 517


>UNIPROTKB|Q81SC3 [details] [associations]
            symbol:BA_1737 "Metallo-beta-lactamase family protein"
            species:1392 "Bacillus anthracis" [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
            EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
            GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
            RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
            DNASU:1086535 EnsemblBacteria:EBBACT00000009201
            EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
            KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
            HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
            BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
        Length = 419

 Score = 272 (100.8 bits), Expect = 4.2e-21, P = 4.2e-21
 Identities = 98/363 (26%), Positives = 172/363 (47%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAG E GRSC  ++ K   I+ DCGI+      D+ P ++      ++ + +SH H DH 
Sbjct:     8 GAG-EYGRSCYFVKNKETKILFDCGINRSYE--DSYPKIEREVVPFLEAVFLSHIHEDHT 64

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQ---MLYTESDLEKSMDK 122
               LP  L K G+K + + T  TK     L + Y K  N +  Q   + Y + ++ K ++ 
Sbjct:    65 MGLP-LLAKYGYKKKIWTTRYTK---EQLPAYYEKWRNYNVTQGWNVPYNDQNV-KDLNY 119

Query:   123 I---ETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             I   E  N +E   +   ++F    +GHVLG+  FL++++   + Y+GD+S + +  ++ 
Sbjct:   120 IYVDEISNPNEWIQITPTLRFQWGYSGHVLGSVWFLVDMSHTYVFYSGDYSAESN--ILR 177

Query:   179 AEIPP-VKPDI--LITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
             A +P  ++ DI   I ++ Y T    QRE R     + I       G  L+P+  LGRAQ
Sbjct:   178 ANLPEKLRGDIKVAIVDAAYHTDDVSQRE-RVNELCTEIERAAGNKGIALLPLPPLGRAQ 236

Query:   236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKH 295
             +++L L E +   P + D  I          M +Y+ +I    +      S+    +   
Sbjct:   237 DIVLYLYEKYKEFPIIVDQEILDGFDE----MFLYKDWIKNNKELEELMESLKRNIIV-- 290

Query:   296 ISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTI 355
             + +  G  H    G  +V+ S   MQ+  ++  +E    + +N +I  G+  +G+ A+ +
Sbjct:   291 MDDDGGTQH--SCG--IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKV 346

Query:   356 LSE 358
             L E
Sbjct:   347 LKE 349


>TIGR_CMR|BA_1737 [details] [associations]
            symbol:BA_1737 "metallo-beta-lactamase family protein"
            species:198094 "Bacillus anthracis str. Ames" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
            EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
            GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
            RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
            DNASU:1086535 EnsemblBacteria:EBBACT00000009201
            EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
            KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
            HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
            BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
        Length = 419

 Score = 272 (100.8 bits), Expect = 4.2e-21, P = 4.2e-21
 Identities = 98/363 (26%), Positives = 172/363 (47%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
             GAG E GRSC  ++ K   I+ DCGI+      D+ P ++      ++ + +SH H DH 
Sbjct:     8 GAG-EYGRSCYFVKNKETKILFDCGINRSYE--DSYPKIEREVVPFLEAVFLSHIHEDHT 64

Query:    66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQ---MLYTESDLEKSMDK 122
               LP  L K G+K + + T  TK     L + Y K  N +  Q   + Y + ++ K ++ 
Sbjct:    65 MGLP-LLAKYGYKKKIWTTRYTK---EQLPAYYEKWRNYNVTQGWNVPYNDQNV-KDLNY 119

Query:   123 I---ETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
             I   E  N +E   +   ++F    +GHVLG+  FL++++   + Y+GD+S + +  ++ 
Sbjct:   120 IYVDEISNPNEWIQITPTLRFQWGYSGHVLGSVWFLVDMSHTYVFYSGDYSAESN--ILR 177

Query:   179 AEIPP-VKPDI--LITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
             A +P  ++ DI   I ++ Y T    QRE R     + I       G  L+P+  LGRAQ
Sbjct:   178 ANLPEKLRGDIKVAIVDAAYHTDDVSQRE-RVNELCTEIERAAGNKGIALLPLPPLGRAQ 236

Query:   236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKH 295
             +++L L E +   P + D  I          M +Y+ +I    +      S+    +   
Sbjct:   237 DIVLYLYEKYKEFPIIVDQEILDGFDE----MFLYKDWIKNNKELEELMESLKRNIIV-- 290

Query:   296 ISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTI 355
             + +  G  H    G  +V+ S   MQ+  ++  +E    + +N +I  G+  +G+ A+ +
Sbjct:   291 MDDDGGTQH--SCG--IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKV 346

Query:   356 LSE 358
             L E
Sbjct:   347 LKE 349


>UNIPROTKB|Q74C32 [details] [associations]
            symbol:GSU1843 "RNA exonuclease, beta-lactamase fold
            protein" species:243231 "Geobacter sulfurreducens PCA" [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 EMBL:AE017180 GenomeReviews:AE017180_GR
            GO:GO:0004527 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_952893.1
            ProteinModelPortal:Q74C32 GeneID:2688625 KEGG:gsu:GSU1843
            PATRIC:22026545 OMA:CHIDHVG ProtClustDB:CLSK2516562
            BioCyc:GSUL243231:GH27-1786-MONOMER Uniprot:Q74C32
        Length = 475

 Score = 162 (62.1 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 44/152 (28%), Positives = 79/152 (51%)

Query:    14 SCIMLEFK-NKSIMMDCGIHPGLSGMDA--LPFVDLVESDQIDLLLISHFHLDHCGALPW 70
             SC  L    N +I++DCG+  G  G      PF+D    D++  L+++H H+DHCG +P 
Sbjct:    15 SCHELVISDNAAILIDCGLLQGNDGAGGKRFPFIDF-PLDRVKGLVLTHVHIDHCGRIP- 72

Query:    71 FLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTE--SDLEKSMDKIETINF 128
              LL  GF+G  + + A+  +   +L D +KV  I+ ++ L     + ++K +  +    +
Sbjct:    73 HLLGAGFQGPIWCSEASALLLPLVLEDAVKVG-ITRDEHLIARFLNAVKKRLVPLPYDRW 131

Query:   129 HEEKDVNGIKFSA--YNAGHVLGAAMFLIEIA 158
             H+    +G   S     AGH+LG+A   + ++
Sbjct:   132 HQLGSWDGRSASLRLQQAGHILGSAYVEVSVS 163

 Score = 151 (58.2 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 45/153 (29%), Positives = 72/153 (47%)

Query:   162 ILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG 221
             ++++GD        L+    PP + DIL+ ESTYG   HE RE+R  R   +I   +   
Sbjct:   183 VVFSGDLGAPFTP-LLPDPKPPERADILVLESTYGDRQHEGREQRRERLCRVIVRALENR 241

Query:   222 GRCLIPVFALGRAQELLLILDEYWSLH--PEL------HDIPIYYASSLAKKCMSVYQTY 273
             G  L+P F++GR QELL  +++  S H   E        D+ I   S LA     VY   
Sbjct:   242 GALLVPAFSIGRTQELLYEIEDLISRHRTEEAAAGLPWDDLEIIVDSPLALSVTRVYDRL 301

Query:   274 INAMNDRIRRQISIN-NPFVFKHISNLKG-IDH 304
                 ++     ++ N +P  F+ ++ ++   DH
Sbjct:   302 RRLWDEEALETVAQNRHPLSFEQMTVIESHADH 334

 Score = 135 (52.6 bits), Expect = 6.7e-18, Sum P(2) = 6.7e-18
 Identities = 48/190 (25%), Positives = 80/190 (42%)

Query:   253 DIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFKHISNLKG-IDHFEDIG- 309
             D+ I   S LA     VY       ++     ++ N +P  F+ ++ ++   DH   +  
Sbjct:   281 DLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTVIESHADHRATVEY 340

Query:   310 ------PCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV- 362
                   PC+V+A+ GM   G      +    D +  ++  GY   GT  + IL   ++  
Sbjct:   341 LRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDILFVGYQAAGTPGREILEAAKQKW 400

Query:   363 -------IGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVRELR--PAHVVLVHGEQNEM 412
                    I + G   PL+ +V  IS +SAH D +   EFV  +   P  + LVHGE+   
Sbjct:   401 ETGGRPSIDLDGGTYPLRAAVHTISGYSAHADQRDLVEFVEGITVPPKTIRLVHGEEEAR 460

Query:   413 SRLKAALTRE 422
             + L   L  +
Sbjct:   461 TALAGVLAEK 470


>TIGR_CMR|GSU_1843 [details] [associations]
            symbol:GSU_1843 "metallo-beta-lactamase family protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE017180 GenomeReviews:AE017180_GR GO:GO:0004527
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_952893.1
            ProteinModelPortal:Q74C32 GeneID:2688625 KEGG:gsu:GSU1843
            PATRIC:22026545 OMA:CHIDHVG ProtClustDB:CLSK2516562
            BioCyc:GSUL243231:GH27-1786-MONOMER Uniprot:Q74C32
        Length = 475

 Score = 162 (62.1 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 44/152 (28%), Positives = 79/152 (51%)

Query:    14 SCIMLEFK-NKSIMMDCGIHPGLSGMDA--LPFVDLVESDQIDLLLISHFHLDHCGALPW 70
             SC  L    N +I++DCG+  G  G      PF+D    D++  L+++H H+DHCG +P 
Sbjct:    15 SCHELVISDNAAILIDCGLLQGNDGAGGKRFPFIDF-PLDRVKGLVLTHVHIDHCGRIP- 72

Query:    71 FLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTE--SDLEKSMDKIETINF 128
              LL  GF+G  + + A+  +   +L D +KV  I+ ++ L     + ++K +  +    +
Sbjct:    73 HLLGAGFQGPIWCSEASALLLPLVLEDAVKVG-ITRDEHLIARFLNAVKKRLVPLPYDRW 131

Query:   129 HEEKDVNGIKFSA--YNAGHVLGAAMFLIEIA 158
             H+    +G   S     AGH+LG+A   + ++
Sbjct:   132 HQLGSWDGRSASLRLQQAGHILGSAYVEVSVS 163

 Score = 151 (58.2 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
 Identities = 45/153 (29%), Positives = 72/153 (47%)

Query:   162 ILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG 221
             ++++GD        L+    PP + DIL+ ESTYG   HE RE+R  R   +I   +   
Sbjct:   183 VVFSGDLGAPFTP-LLPDPKPPERADILVLESTYGDRQHEGREQRRERLCRVIVRALENR 241

Query:   222 GRCLIPVFALGRAQELLLILDEYWSLH--PEL------HDIPIYYASSLAKKCMSVYQTY 273
             G  L+P F++GR QELL  +++  S H   E        D+ I   S LA     VY   
Sbjct:   242 GALLVPAFSIGRTQELLYEIEDLISRHRTEEAAAGLPWDDLEIIVDSPLALSVTRVYDRL 301

Query:   274 INAMNDRIRRQISIN-NPFVFKHISNLKG-IDH 304
                 ++     ++ N +P  F+ ++ ++   DH
Sbjct:   302 RRLWDEEALETVAQNRHPLSFEQMTVIESHADH 334

 Score = 135 (52.6 bits), Expect = 6.7e-18, Sum P(2) = 6.7e-18
 Identities = 48/190 (25%), Positives = 80/190 (42%)

Query:   253 DIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFKHISNLKG-IDHFEDIG- 309
             D+ I   S LA     VY       ++     ++ N +P  F+ ++ ++   DH   +  
Sbjct:   281 DLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTVIESHADHRATVEY 340

Query:   310 ------PCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV- 362
                   PC+V+A+ GM   G      +    D +  ++  GY   GT  + IL   ++  
Sbjct:   341 LRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDILFVGYQAAGTPGREILEAAKQKW 400

Query:   363 -------IGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVRELR--PAHVVLVHGEQNEM 412
                    I + G   PL+ +V  IS +SAH D +   EFV  +   P  + LVHGE+   
Sbjct:   401 ETGGRPSIDLDGGTYPLRAAVHTISGYSAHADQRDLVEFVEGITVPPKTIRLVHGEEEAR 460

Query:   413 SRLKAALTRE 422
             + L   L  +
Sbjct:   461 TALAGVLAEK 470


>UNIPROTKB|E9PQF0 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
            HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00982774
            ProteinModelPortal:E9PQF0 SMR:E9PQF0 Ensembl:ENST00000498476
            ArrayExpress:E9PQF0 Bgee:E9PQF0 Uniprot:E9PQF0
        Length = 167

 Score = 242 (90.2 bits), Expect = 1.5e-19, P = 1.5e-19
 Identities = 47/98 (47%), Positives = 65/98 (66%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
             GAGQ+VGRSCI++    K++M+DCG+H G +     P F  + ++    D +D ++ISHF
Sbjct:    70 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 129

Query:    61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDY 98
             HLDHCGALP+F    G+ G  +MTH T+AI   LL DY
Sbjct:   130 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY 167


>DICTYBASE|DDB_G0282473 [details] [associations]
            symbol:ints9 "integrator complex subunit 9"
            species:44689 "Dictyostelium discoideum" [GO:0032039 "integrator
            complex" evidence=IEA] [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] InterPro:IPR027074 dictyBase:DDB_G0282473
            GO:GO:0005634 EMBL:AAFI02000047 GenomeReviews:CM000152_GR
            eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            KO:K13146 PANTHER:PTHR11203:SF2 RefSeq:XP_640069.1
            ProteinModelPortal:Q54SH0 EnsemblProtists:DDB0234099 GeneID:8623598
            KEGG:ddi:DDB_G0282473 OMA:DDFSTID ProtClustDB:CLSZ2729002
            Uniprot:Q54SH0
        Length = 712

 Score = 197 (74.4 bits), Expect = 5.8e-17, Sum P(3) = 5.8e-17
 Identities = 61/243 (25%), Positives = 115/243 (47%)

Query:   110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGV-KILYTGDF 168
             LY + D+EKS +KI++I F+E     G +    ++G+ LG+A ++IE  G  +++Y  D 
Sbjct:   217 LYKKIDIEKSFEKIQSIRFNESIKHYGFECIPSSSGYGLGSANWVIESKGFERVVYISDS 276

Query:   169 SRQEDRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIP 227
             S    R+    ++ P+  PD+LI  S    + +   ++      S I   + +GG  LIP
Sbjct:   277 SLSLSRYPTPFQLSPIDNPDVLIL-SKINHYPNNPPDQMLSELCSNIGSTLQQGGTVLIP 335

Query:   228 VFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMN-DRIRRQIS 286
              ++ G   +L   L +Y +    L  +PIY+ SS++K  +S    Y   +N  +  R   
Sbjct:   336 SYSCGIILDLFEHLADYLN-KVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKSKQERAFM 394

Query:   287 INNPFVFKHI---SNLKGIDH----FEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNG 339
                PF+ + +      +   H    F+   PC++       + G    L +++  + KN 
Sbjct:   395 PETPFLHQDLMRKGQFQAYQHVHSNFQANDPCIIFTGHPSCRIGDITTLIKLY-DNPKNS 453

Query:   340 VII 342
             +++
Sbjct:   454 ILL 456

 Score = 79 (32.9 bits), Expect = 5.8e-17, Sum P(3) = 5.8e-17
 Identities = 24/95 (25%), Positives = 50/95 (52%)

Query:    42 PFVDLVES-DQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK 100
             P  ++++    ID++LIS++   +  ALP+    T F+G+ + T  T  I + LL + ++
Sbjct:   106 PQFEMIDDFSTIDMILISNY--TNIYALPFITEYTNFQGKIYATEPTVQIGKLLLEELVQ 163

Query:   101 V------SNISTEQMLYTESDLEKSMDKIETINFH 129
             +      S+I+        SD  ++++ +E +N H
Sbjct:   164 MDKQYSNSSINNNNNNNNLSDCWQNIEILEKLNVH 198

 Score = 58 (25.5 bits), Expect = 5.8e-17, Sum P(3) = 5.8e-17
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:     9 QEVGRSCIMLEFKNKSIMMDCGI 31
             Q     C +LE+KN  I++DC +
Sbjct:     8 QSAQSPCFLLEYKNVKILLDCAL 30


>RGD|1311539 [details] [associations]
            symbol:Ints9 "integrator complex subunit 9" species:10116
            "Rattus norvegicus" [GO:0016180 "snRNA processing"
            evidence=IEA;ISO] [GO:0032039 "integrator complex"
            evidence=IEA;ISO] InterPro:IPR027074 RGD:1311539 InterPro:IPR022712
            Pfam:PF10996 SMART:SM01027 GO:GO:0032039 GO:GO:0016180
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 IPI:IPI00362364
            Ensembl:ENSRNOT00000018071 Uniprot:F1M365
        Length = 659

 Score = 191 (72.3 bits), Expect = 3.0e-16, Sum P(2) = 3.0e-16
 Identities = 111/493 (22%), Positives = 201/493 (40%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   160 KEIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 219

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+LI      T +     +
Sbjct:   220 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 276

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L +IP Y+ S +A 
Sbjct:   277 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVAN 335

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCV+ 
Sbjct:   336 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSHDFRQPCVLF 395

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   396 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 441

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       
Sbjct:   442 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 499

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P  
Sbjct:   500 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPP- 557

Query:   488 LPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TLE 544
              PK T   +SK  ++ S     S  VL+ L+S    PVE   +      F+ I++  T +
Sbjct:   558 -PKPTQPTSSKKRKRVSEDVPDS-KVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTAK 614

Query:   545 KCIVVLEWASNPI 557
               IV+L+ A   I
Sbjct:   615 GHIVLLQEAETLI 627

 Score = 93 (37.8 bits), Expect = 3.0e-16, Sum P(2) = 3.0e-16
 Identities = 24/82 (29%), Positives = 42/82 (51%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    64 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 120

Query:    78 KGRCFMTHATKAIYRWLLSDYI 99
              G  + T  T  I R L+ + +
Sbjct:   121 TGTVYATEPTMQIGRLLMEELV 142

 Score = 67 (28.6 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    15 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 53


>MGI|MGI:1098533 [details] [associations]
            symbol:Ints9 "integrator complex subunit 9" species:10090
            "Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0016180 "snRNA processing"
            evidence=ISO] [GO:0032039 "integrator complex" evidence=ISO]
            InterPro:IPR027074 MGI:MGI:1098533 eggNOG:COG1236
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
            GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359 HOVERGEN:HBG081802
            KO:K13146 OrthoDB:EOG415GCW PANTHER:PTHR11203:SF2
            GeneTree:ENSGT00390000001445 EMBL:AK038979 EMBL:AK077634
            EMBL:AK136992 EMBL:AK150195 EMBL:BC028953 EMBL:BC055700
            IPI:IPI00223422 IPI:IPI00406798 RefSeq:NP_001240660.1
            RefSeq:NP_700463.2 UniGene:Mm.71332 ProteinModelPortal:Q8K114
            SMR:Q8K114 STRING:Q8K114 PhosphoSite:Q8K114 PaxDb:Q8K114
            PRIDE:Q8K114 Ensembl:ENSMUST00000043914 GeneID:210925
            KEGG:mmu:210925 UCSC:uc007uiv.1 UCSC:uc007uiw.1 InParanoid:Q8K114
            NextBio:373083 Bgee:Q8K114 CleanEx:MM_INTS9 Genevestigator:Q8K114
            Uniprot:Q8K114
        Length = 658

 Score = 186 (70.5 bits), Expect = 4.0e-16, Sum P(2) = 4.0e-16
 Identities = 110/494 (22%), Positives = 201/494 (40%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+LI      T +     +
Sbjct:   219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L +IP Y+ S +A 
Sbjct:   276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVAN 334

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCV+ 
Sbjct:   335 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSNDFRQPCVLF 394

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N +I               +EP+   +       PL 
Sbjct:   395 TGHPSLRFGDVVHFMELWGKSSLNTIIF--------------TEPDFSYLEALAPYQPLA 440

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        A   +   D       
Sbjct:   441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQAHRMDLMIDCQPPAMS 498

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P  
Sbjct:   499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPP- 556

Query:   488 LPKYTDLKASKIIQQQSVYYS-GSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
              PK T   +SK  +++ V        VL+ L+S    PVE   +      F+ I++  T 
Sbjct:   557 -PKPTQPTSSK--KRKRVNEDIPDCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 612

Query:   544 EKCIVVLEWASNPI 557
             +  IV+L+ A   I
Sbjct:   613 KGHIVLLQEAETLI 626

 Score = 97 (39.2 bits), Expect = 4.0e-16, Sum P(2) = 4.0e-16
 Identities = 27/104 (25%), Positives = 51/104 (49%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
              G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:   120 TGTVYATEPTMQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163

 Score = 67 (28.6 bits), Expect = 5.2e-13, Sum P(2) = 5.2e-13
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52


>UNIPROTKB|Q9NV88 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0005515 "protein binding" evidence=IPI]
            [GO:0016180 "snRNA processing" evidence=IDA] [GO:0032039
            "integrator complex" evidence=IDA] InterPro:IPR027074
            eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            GO:GO:0032039 EMBL:U96629 GO:GO:0016180 CTD:55756
            HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
            PANTHER:PTHR11203:SF2 OMA:PLAMKCV EMBL:AK001733 EMBL:AK298468
            EMBL:AK300593 EMBL:AC040975 EMBL:AC131969 EMBL:BC025267
            EMBL:BK005726 EMBL:BK005674 IPI:IPI00290514 IPI:IPI00871167
            RefSeq:NP_001138631.1 RefSeq:NP_001166033.1 RefSeq:NP_060720.2
            UniGene:Hs.162397 ProteinModelPortal:Q9NV88 SMR:Q9NV88
            IntAct:Q9NV88 STRING:Q9NV88 PhosphoSite:Q9NV88 DMDM:119371246
            PaxDb:Q9NV88 PRIDE:Q9NV88 DNASU:55756 Ensembl:ENST00000416984
            Ensembl:ENST00000521022 Ensembl:ENST00000521777 GeneID:55756
            KEGG:hsa:55756 UCSC:uc003xha.3 GeneCards:GC08M028625
            HGNC:HGNC:25592 MIM:611352 neXtProt:NX_Q9NV88 PharmGKB:PA162392192
            InParanoid:Q9NV88 PhylomeDB:Q9NV88 ChiTaRS:INTS9 GenomeRNAi:55756
            NextBio:60763 ArrayExpress:Q9NV88 Bgee:Q9NV88 CleanEx:HS_INTS9
            Genevestigator:Q9NV88 GermOnline:ENSG00000104299 Uniprot:Q9NV88
        Length = 658

 Score = 182 (69.1 bits), Expect = 1.4e-15, Sum P(2) = 1.4e-15
 Identities = 108/494 (21%), Positives = 196/494 (39%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+L+      T +     +
Sbjct:   219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLVLTGL--TQIPTANPD 275

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L  +P+Y+ S +A 
Sbjct:   276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSVPLYFISPVAN 334

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCVV 
Sbjct:   335 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       
Sbjct:   441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 498

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N HLL P  
Sbjct:   499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHLLQPPP 557

Query:   488 LPKY-TDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
              P   T  K  K +            VL+ L+S    PVE   +      F+ I++  T 
Sbjct:   558 RPAQPTSGKKRKRVSDDVP----DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 612

Query:   544 EKCIVVLEWASNPI 557
             +  IV+L+ A   I
Sbjct:   613 KGHIVLLQEAETLI 626

 Score = 96 (38.9 bits), Expect = 1.4e-15, Sum P(2) = 1.4e-15
 Identities = 27/104 (25%), Positives = 51/104 (49%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
              G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:   120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163

 Score = 67 (28.6 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52


>UNIPROTKB|F6XI08 [details] [associations]
            symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
            Ensembl:ENSCAFT00000013124 EMBL:AAEX03014336 RefSeq:XP_543216.2
            GeneID:486090 KEGG:cfa:486090 Uniprot:F6XI08
        Length = 658

 Score = 180 (68.4 bits), Expect = 2.4e-15, Sum P(2) = 2.4e-15
 Identities = 112/494 (22%), Positives = 199/494 (40%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+LI      T +     +
Sbjct:   219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L +IP Y+ S +A 
Sbjct:   276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVAN 334

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FV---------FKHISNLKGIDHFEDIG-PCVV 313
               +   Q +   +    + ++ +  P F           KH  +L G D   D   PCVV
Sbjct:   335 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSLHG-DFSSDFRQPCVV 393

Query:   314 MASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPL 372
                   ++ G      E+W   + N VI               +EP+   +       PL
Sbjct:   394 FTGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPL 439

Query:   373 KMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSME 432
              M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D      
Sbjct:   440 AMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAM 497

Query:   433 LYNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPS 486
              Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P 
Sbjct:   498 SYRRAEVLALPFKRRYEKI-EIMPELADALVPMEIKPGISLATVSAVLHTKDNKHVLQPP 556

Query:   487 DLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
               P+ T     K  ++ S        VL+ L+S    PVE   +      F+ I++  T 
Sbjct:   557 --PRPTQPTGGKKRKRASDDIP-DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 612

Query:   544 EKCIVVLEWASNPI 557
             +  IV+L+ A   I
Sbjct:   613 KGHIVLLQEAETLI 626

 Score = 96 (38.9 bits), Expect = 2.4e-15, Sum P(2) = 2.4e-15
 Identities = 27/104 (25%), Positives = 51/104 (49%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
              G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:   120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163

 Score = 67 (28.6 bits), Expect = 2.4e-12, Sum P(2) = 2.4e-12
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52


>UNIPROTKB|Q2KJA6 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9913
            "Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            GO:GO:0005634 eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996
            SMART:SM01027 EMBL:BC105437 IPI:IPI00701634 RefSeq:NP_001039828.1
            UniGene:Bt.91042 ProteinModelPortal:Q2KJA6 STRING:Q2KJA6
            GeneID:533964 KEGG:bta:533964 CTD:55756 HOGENOM:HOG000045359
            HOVERGEN:HBG081802 InParanoid:Q2KJA6 KO:K13146 OrthoDB:EOG415GCW
            NextBio:20876211 PANTHER:PTHR11203:SF2 Uniprot:Q2KJA6
        Length = 658

 Score = 178 (67.7 bits), Expect = 3.9e-15, Sum P(2) = 3.9e-15
 Identities = 91/422 (21%), Positives = 167/422 (39%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQR-E 204
              LG++ ++I+    K+ Y    S     H    +   +K  D+LI      T +     +
Sbjct:   219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275

Query:   205 EREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L  IP Y+ S +A 
Sbjct:   276 SMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSIPFYFISPVAN 334

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCVV 
Sbjct:   335 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       
Sbjct:   441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPTPAQSHRMDLMVDCQPPAMS 498

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P  
Sbjct:   499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPPP 557

Query:   488 LP 489
              P
Sbjct:   558 RP 559

 Score = 96 (38.9 bits), Expect = 3.9e-15, Sum P(2) = 3.9e-15
 Identities = 27/104 (25%), Positives = 51/104 (49%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
              G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:   120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163

 Score = 67 (28.6 bits), Expect = 3.9e-12, Sum P(2) = 3.9e-12
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52


>UNIPROTKB|F1MMA6 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9913
            "Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
            GO:GO:0016180 IPI:IPI00701634 UniGene:Bt.91042
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
            EMBL:DAAA02021965 EMBL:DAAA02021964 Ensembl:ENSBTAT00000049079
            ArrayExpress:F1MMA6 Uniprot:F1MMA6
        Length = 658

 Score = 177 (67.4 bits), Expect = 5.0e-15, Sum P(2) = 5.0e-15
 Identities = 91/422 (21%), Positives = 167/422 (39%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQR-E 204
              LG++ ++I+    K+ Y    S     H    +   +K  D+LI      T +     +
Sbjct:   219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275

Query:   205 EREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L  IP Y+ S +A 
Sbjct:   276 SMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSIPFYFISPVAN 334

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCVV 
Sbjct:   335 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       
Sbjct:   441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMVDCQPPAMS 498

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P  
Sbjct:   499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPPP 557

Query:   488 LP 489
              P
Sbjct:   558 RP 559

 Score = 96 (38.9 bits), Expect = 5.0e-15, Sum P(2) = 5.0e-15
 Identities = 27/104 (25%), Positives = 51/104 (49%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
              G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:   120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163

 Score = 67 (28.6 bits), Expect = 5.1e-12, Sum P(2) = 5.1e-12
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52


>UNIPROTKB|Q0C1L6 [details] [associations]
            symbol:HNE_1669 "Putative uncharacterized protein"
            species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR001279 SMART:SM00849 GO:GO:0016787 EMBL:CP000158
            GenomeReviews:CP000158_GR eggNOG:COG1236 RefSeq:YP_760377.1
            ProteinModelPortal:Q0C1L6 STRING:Q0C1L6 GeneID:4288204
            KEGG:hne:HNE_1669 PATRIC:32216161 HOGENOM:HOG000035995 OMA:STFGLPI
            ProtClustDB:CLSK2517173 BioCyc:HNEP228405:GI69-1701-MONOMER
            InterPro:IPR026360 TIGRFAMs:TIGR04122 Uniprot:Q0C1L6
        Length = 333

 Score = 183 (69.5 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
 Identities = 47/155 (30%), Positives = 80/155 (51%)

Query:   126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
             + + E  +V  ++ + Y AGHVLG+A  L+E AG +++ TGDF R  D         P+ 
Sbjct:    72 VAYGETVEVGDVRVTLYPAGHVLGSAQVLLERAGERVIVTGDFKRAADP--TCPPFVPIA 129

Query:   186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRC-LIPVFALGRAQELLLILDEY 244
              D+LITE+T+G  V       +     ++  +     RC L+  +ALG+AQ ++  L E 
Sbjct:   130 CDVLITEATFGLPVFRHPPASD-EIAKVMERLAESPERCVLVGAYALGKAQRVICHLREA 188

Query:   245 WSLHPELHDIPIYYASSLAKKCMSVYQTYINAMND 279
                    +D PIY   ++ K C ++Y+ +  A+ +
Sbjct:   189 G------YDKPIYLHGAMEKLC-ALYEAHGVALGE 216

 Score = 64 (27.6 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
 Identities = 15/62 (24%), Positives = 33/62 (53%)

Query:   359 PEEVIGMSGQRLPLKM-----SVDY-ISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEM 412
             P+ V+ M+   L ++      ++D  +  S H D+++ +  +RE+ P+ V + HG +  +
Sbjct:   249 PDPVLAMASGWLQVRQRVRQNNIDLPLVISDHADWEELTRTIREVAPSEVWVTHGSEAGL 308

Query:   413 SR 414
              R
Sbjct:   309 LR 310

 Score = 45 (20.9 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
 Identities = 12/38 (31%), Positives = 18/38 (47%)

Query:    31 IHPGLSGMDALPFVDLVE-SDQIDLLLISHFHLDHCGA 67
             I PG  G++       V+ S    L +++H H DH  A
Sbjct:     8 IKPGAGGIEVAGGAAFVDPSLPKPLAIVTHGHADHARA 45


>UNIPROTKB|F1RJQ5 [details] [associations]
            symbol:INTS9 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0032039 "integrator complex" evidence=IEA] [GO:0016180
            "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
            GeneTree:ENSGT00390000001445 OMA:PLAMKCV EMBL:CU407017
            Ensembl:ENSSSCT00000010615 Uniprot:F1RJQ5
        Length = 576

 Score = 176 (67.0 bits), Expect = 1.8e-14, Sum P(2) = 1.8e-14
 Identities = 91/422 (21%), Positives = 167/422 (39%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:    77 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQMVGYSQKIELFGAVQVTPLSSGY 136

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+LI      T +     +
Sbjct:   137 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 193

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L  IP Y+ S +A 
Sbjct:   194 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSIPFYFISPVAN 252

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCVV 
Sbjct:   253 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 312

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   313 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 358

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       
Sbjct:   359 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 416

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P  
Sbjct:   417 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPPP 475

Query:   488 LP 489
              P
Sbjct:   476 RP 477

 Score = 90 (36.7 bits), Expect = 1.8e-14, Sum P(2) = 1.8e-14
 Identities = 22/80 (27%), Positives = 41/80 (51%)

Query:    42 PFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK- 100
             P  +L++   +D++LIS++H     ALP+    TGF G  + T  T  I R L+ + +  
Sbjct:     4 PQTELIDLSTVDVILISNYHC--MMALPYITEHTGFTGTVYATEPTVQIGRLLMEELVNF 61

Query:   101 VSNISTEQM--LYTESDLEK 118
             +  +   Q   L+   D+++
Sbjct:    62 IERVPKAQSASLWKNKDIQR 81


>UNIPROTKB|G3XAN1 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            EMBL:CH471080 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 UniGene:Hs.162397
            HGNC:HGNC:25592 ChiTaRS:INTS9 ProteinModelPortal:G3XAN1
            Ensembl:ENST00000523303 ArrayExpress:G3XAN1 Bgee:G3XAN1
            Uniprot:G3XAN1
        Length = 525

 Score = 168 (64.2 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
 Identities = 71/330 (21%), Positives = 135/330 (40%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:   159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+L+      T +     +
Sbjct:   219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLVLTGL--TQIPTANPD 275

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L  +P+Y+ S +A 
Sbjct:   276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSVPLYFISPVAN 334

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCVV 
Sbjct:   335 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVV 403
             M   Y       ++ Q S+ ++E++P HVV
Sbjct:   441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVV 470

 Score = 96 (38.9 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
 Identities = 27/104 (25%), Positives = 51/104 (49%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
              G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:   120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163

 Score = 67 (28.6 bits), Expect = 2.4e-11, Sum P(2) = 2.4e-11
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52


>UNIPROTKB|Q8EJC6 [details] [associations]
            symbol:SO_0541 "RNA-metabolizing metallo-beta-lactamase
            family protein" species:211586 "Shewanella oneidensis MR-1"
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014299
            GenomeReviews:AE014299_GR InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774
            KO:K07576 RefSeq:NP_716177.2 ProteinModelPortal:Q8EJC6
            DNASU:1168409 GeneID:1168409 KEGG:son:SO_0541 PATRIC:23520762
            ProtClustDB:CLSK2516780 Uniprot:Q8EJC6
        Length = 480

 Score = 213 (80.0 bits), Expect = 3.8e-14, P = 3.8e-14
 Identities = 83/336 (24%), Positives = 152/336 (45%)

Query:   110 LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEIA-GV---KILY 164
             L+T  D E+++ +  ++ + +  + +  +     +AGH+LG+A+  + +  G    KI++
Sbjct:   127 LFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVF 186

Query:   165 TGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG-GR 223
             +GD  R     L    +     D+++ ESTYG   H    +       +    VN   G 
Sbjct:   187 SGDLGRAGMPILQNPTLVDTA-DLVLMESTYGNRFHRSWTDTLAELKDIFAKTVNESQGN 245

Query:   224 CLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
              L+P F++GRAQELL +   Y +   +L    I   S +A +   VY      M++  +R
Sbjct:   246 ILLPAFSVGRAQELLYLFHLY-AKEWDLGRWKICLDSPMAIEATRVYVNNYPLMDEDFKR 304

Query:   284 QISINNPFVFKHISNLKGIDHFED-IG------PCVVMASPGMMQSGLSRELFE--MWCT 334
               +  +P     +SN++ I   E+ I         +++A  GM   G  R   E  +W +
Sbjct:   305 -FTRQHPGQHPLLSNVEFIQTTEESIALNEVHKGLIIIAGSGMCNGGRIRSHLEHNLWRS 363

Query:   335 DAKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEF 393
             +    VII G+   GT  + ++   +E+  + G  + +   +  +   SAH D  +   +
Sbjct:   364 ECD--VIICGFQALGTPGRALVDGAKELT-IHGNSVNVAAKLHTVGGLSAHADQAELLRW 420

Query:   394 VR--ELRPAHVVLVHGEQNEMSRLKAALTREYEDDP 427
              R  E +P  +VLVHGE      L A + ++ +  P
Sbjct:   421 YRHFEEQPP-LVLVHGEPEAQQGLVAVMNQDPKTKP 455

 Score = 150 (57.9 bits), Expect = 3.2e-07, P = 3.2e-07
 Identities = 48/171 (28%), Positives = 83/171 (48%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDAL----PFVDLVESDQIDLLLISHFH 61
             GA +EV  SC ++    K +++DCG+  G    D L    PFV   +   I  +++SH H
Sbjct:     9 GAAREVTGSCHLVTVAGKHLLLDCGLIQG-GKADELRNHEPFV--FDPQTIVAVVLSHAH 65

Query:    62 LDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM------------ 109
             +DH G LP  L+K GF G  +   AT  +   +L D   +    TE+             
Sbjct:    66 IDHSGRLP-LLVKAGFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAKHDLAPL 124

Query:   110 --LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEI 157
               L+T  D E+++ +  ++ + +  + +  +     +AGH+LG+A  L+E+
Sbjct:   125 EPLFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSA--LVEL 173


>TIGR_CMR|SO_0541 [details] [associations]
            symbol:SO_0541 "metallo-beta-lactamase family protein"
            species:211586 "Shewanella oneidensis MR-1" [GO:0008150
            "biological_process" evidence=ND] [GO:0003824 "catalytic activity"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:AE014299 GenomeReviews:AE014299_GR
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_716177.2
            ProteinModelPortal:Q8EJC6 DNASU:1168409 GeneID:1168409
            KEGG:son:SO_0541 PATRIC:23520762 ProtClustDB:CLSK2516780
            Uniprot:Q8EJC6
        Length = 480

 Score = 213 (80.0 bits), Expect = 3.8e-14, P = 3.8e-14
 Identities = 83/336 (24%), Positives = 152/336 (45%)

Query:   110 LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEIA-GV---KILY 164
             L+T  D E+++ +  ++ + +  + +  +     +AGH+LG+A+  + +  G    KI++
Sbjct:   127 LFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVF 186

Query:   165 TGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG-GR 223
             +GD  R     L    +     D+++ ESTYG   H    +       +    VN   G 
Sbjct:   187 SGDLGRAGMPILQNPTLVDTA-DLVLMESTYGNRFHRSWTDTLAELKDIFAKTVNESQGN 245

Query:   224 CLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
              L+P F++GRAQELL +   Y +   +L    I   S +A +   VY      M++  +R
Sbjct:   246 ILLPAFSVGRAQELLYLFHLY-AKEWDLGRWKICLDSPMAIEATRVYVNNYPLMDEDFKR 304

Query:   284 QISINNPFVFKHISNLKGIDHFED-IG------PCVVMASPGMMQSGLSRELFE--MWCT 334
               +  +P     +SN++ I   E+ I         +++A  GM   G  R   E  +W +
Sbjct:   305 -FTRQHPGQHPLLSNVEFIQTTEESIALNEVHKGLIIIAGSGMCNGGRIRSHLEHNLWRS 363

Query:   335 DAKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEF 393
             +    VII G+   GT  + ++   +E+  + G  + +   +  +   SAH D  +   +
Sbjct:   364 ECD--VIICGFQALGTPGRALVDGAKELT-IHGNSVNVAAKLHTVGGLSAHADQAELLRW 420

Query:   394 VR--ELRPAHVVLVHGEQNEMSRLKAALTREYEDDP 427
              R  E +P  +VLVHGE      L A + ++ +  P
Sbjct:   421 YRHFEEQPP-LVLVHGEPEAQQGLVAVMNQDPKTKP 455

 Score = 150 (57.9 bits), Expect = 3.2e-07, P = 3.2e-07
 Identities = 48/171 (28%), Positives = 83/171 (48%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDAL----PFVDLVESDQIDLLLISHFH 61
             GA +EV  SC ++    K +++DCG+  G    D L    PFV   +   I  +++SH H
Sbjct:     9 GAAREVTGSCHLVTVAGKHLLLDCGLIQG-GKADELRNHEPFV--FDPQTIVAVVLSHAH 65

Query:    62 LDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM------------ 109
             +DH G LP  L+K GF G  +   AT  +   +L D   +    TE+             
Sbjct:    66 IDHSGRLP-LLVKAGFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAKHDLAPL 124

Query:   110 --LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEI 157
               L+T  D E+++ +  ++ + +  + +  +     +AGH+LG+A  L+E+
Sbjct:   125 EPLFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSA--LVEL 173


>UNIPROTKB|Q5ZKK2 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9031
            "Gallus gallus" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            GO:GO:0032039 GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359
            HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
            PANTHER:PTHR11203:SF2 EMBL:AJ720082 IPI:IPI00651516
            RefSeq:NP_001026271.1 UniGene:Gga.21113 ProteinModelPortal:Q5ZKK2
            STRING:Q5ZKK2 Ensembl:ENSGALT00000026848 GeneID:422023
            KEGG:gga:422023 GeneTree:ENSGT00390000001445 InParanoid:Q5ZKK2
            OMA:PLAMKCV NextBio:20824712 Uniprot:Q5ZKK2
        Length = 658

 Score = 162 (62.1 bits), Expect = 9.2e-13, Sum P(2) = 9.2e-13
 Identities = 70/340 (20%), Positives = 137/340 (40%)

Query:    78 KGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG- 136
             K +   T   K + R L +       +S  +  YT  ++  ++ KI+ + + ++ ++ G 
Sbjct:   149 KAQSASTWKNKEVQRLLPAPLKDAVEVSMWRKCYTMPEVNAALSKIQLVGYSQKIELFGA 208

Query:   137 IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTY 195
             ++ +  ++G+ LG++ ++I+    K+ Y    S     H    +   +K  D+LI     
Sbjct:   209 VQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL- 266

Query:   196 GTHVHEQREE-REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDI 254
              T +     +   G F S +   V  GG  L+P +  G   +LL  L +Y      L ++
Sbjct:   267 -TQIPTANPDGMVGEFCSNLAMTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNV 324

Query:   255 PIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--- 309
             P Y+ S +A   +   Q +   +    + ++ +  P F    +     + H+  I G   
Sbjct:   325 PFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFS 384

Query:   310 -----PCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVI 363
                  PCV+      ++ G      E+W   + N VI               +EP+   +
Sbjct:   385 NDFKQPCVIFTGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYL 430

Query:   364 GMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVV 403
                    PL M   Y       ++ Q S+ ++E++P HVV
Sbjct:   431 DALAPYQPLAMKCVYCPIDTRLNFIQVSKLLKEVQPLHVV 470

 Score = 90 (36.7 bits), Expect = 9.2e-13, Sum P(2) = 9.2e-13
 Identities = 27/101 (26%), Positives = 48/101 (47%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    63 FLDKELK-ECSGHVFVDSVPEFCLPETELLDLSTVDVILISNYHC--MMALPYITEYTGF 119

Query:    78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQMLYTESDLE 117
              G  + T  T  I R L+ + +  +  +   Q   T  + E
Sbjct:   120 TGTVYATEPTVQIGRLLMEELVNSIERVPKAQSASTWKNKE 160

 Score = 68 (29.0 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
 Identities = 16/41 (39%), Positives = 28/41 (68%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    S ++ LP + LV+S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSKL 52


>UNIPROTKB|H7BYQ6 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
            EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592 ChiTaRS:INTS9
            ProteinModelPortal:H7BYQ6 Ensembl:ENST00000397363 Bgee:H7BYQ6
            Uniprot:H7BYQ6
        Length = 552

 Score = 182 (69.1 bits), Expect = 6.9e-12, Sum P(2) = 6.9e-12
 Identities = 108/494 (21%), Positives = 196/494 (39%)

Query:    88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
             K I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G ++ +  ++G+
Sbjct:    53 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 112

Query:   147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
              LG++ ++I+    K+ Y    S     H    +   +K  D+L+      T +     +
Sbjct:   113 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLVLTGL--TQIPTANPD 169

Query:   206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
                G F S +   V  GG  L+P +  G   +LL  L +Y      L  +P+Y+ S +A 
Sbjct:   170 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSVPLYFISPVAN 228

Query:   265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
               +   Q +   +    + ++ +  P F    +     + H+  I G        PCVV 
Sbjct:   229 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 288

Query:   315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
                  ++ G      E+W   + N VI               +EP+   +       PL 
Sbjct:   289 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 334

Query:   374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
             M   Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       
Sbjct:   335 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 392

Query:   434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
             Y     +++    + EK  ++M ELA       +KP  +L+ +  ++    N HLL P  
Sbjct:   393 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHLLQPPP 451

Query:   488 LPKY-TDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
              P   T  K  K +            VL+ L+S    PVE   +      F+ I++  T 
Sbjct:   452 RPAQPTSGKKRKRVSDDVP----DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 506

Query:   544 EKCIVVLEWASNPI 557
             +  IV+L+ A   I
Sbjct:   507 KGHIVLLQEAETLI 520

 Score = 58 (25.5 bits), Expect = 6.9e-12, Sum P(2) = 6.9e-12
 Identities = 15/55 (27%), Positives = 26/55 (47%)

Query:    67 ALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
             ALP+    TGF G  + T  T  I R L+ + +  +  +   Q   L+   D+++
Sbjct:     3 ALPYITEHTGFTGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 57


>FB|FBgn0036570 [details] [associations]
            symbol:IntS9 "Integrator 9" species:7227 "Drosophila
            melanogaster" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0034472 "snRNA 3'-end processing" evidence=IDA]
            [GO:0032039 "integrator complex" evidence=ISS] [GO:0016180 "snRNA
            processing" evidence=ISS] InterPro:IPR027074 EMBL:AE014296
            GO:GO:0006378 GO:GO:0005847 GO:GO:0006379 InterPro:IPR022712
            Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
            GO:GO:0034472 EMBL:AY058574 RefSeq:NP_648838.3 UniGene:Dm.977
            SMR:Q95TS5 IntAct:Q95TS5 MINT:MINT-1734573
            EnsemblMetazoa:FBtr0075495 GeneID:39763 KEGG:dme:Dmel_CG5222
            UCSC:CG5222-RA FlyBase:FBgn0036570 InParanoid:Q95TS5
            OrthoDB:EOG4FJ6QV GenomeRNAi:39763 NextBio:815254 Uniprot:Q95TS5
        Length = 654

 Score = 144 (55.7 bits), Expect = 7.5e-12, Sum P(2) = 7.5e-12
 Identities = 64/310 (20%), Positives = 132/310 (42%)

Query:   110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSA-YNAGHVLGAAMFLIEIAGVKILYTGDF 168
             +++  D++ S+ K+  + + E+ D+ G   +   ++G+ LG++ +++  A  KI Y    
Sbjct:   180 IFSLKDVQGSLSKVTIMGYDEKLDILGAFIATPVSSGYCLGSSNWVLSTAHEKICYVSGS 239

Query:   169 SRQEDRHLMAAEIPPVK-PDILI-TESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLI 226
             S     H        +K  D+LI T  T    V+   + + G     +   +   G  LI
Sbjct:   240 STLTT-HPRPINQSALKHADVLIMTGLTQAPTVNP--DTKLGELCMNVALTIRNNGSALI 296

Query:   227 PVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQIS 286
             P +  G   +L   L +    +  L+++P+++ S +A   ++        ++   + ++ 
Sbjct:   297 PCYPSGVVYDLFECLTQNLE-NAGLNNVPMFFISPVADSSLAYSNILAEWLSSAKQNKVY 355

Query:   287 I-NNPFVFK-HISN--LKGIDHFEDIG-------PCVVMASPGMMQSGLSRELFEMWCTD 335
             + ++PF    ++ N  LK  +H    G       PCVV      ++ G +    EMW  +
Sbjct:   356 LPDDPFPHAFYLRNNKLKHYNHVFSEGFSKDFRQPCVVFCGHPSLRFGDAVHFIEMWGNN 415

Query:   336 AKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFV 394
               N +I               +EP+   + +     PL M   Y       +YQQ ++ +
Sbjct:   416 PNNSIIF--------------TEPDFPYLQVLAPFQPLAMKAFYCPIDTSLNYQQANKLI 461

Query:   395 RELRPAHVVL 404
             +EL+P  +V+
Sbjct:   462 KELKPNVLVI 471

 Score = 100 (40.3 bits), Expect = 7.5e-12, Sum P(2) = 7.5e-12
 Identities = 27/76 (35%), Positives = 44/76 (57%)

Query:    41 LPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLS---D 97
             LP   +++  ++D++LIS++ L+   ALP+    TGFKG+ + T  T  I R+ L    D
Sbjct:    86 LPMDKMLDFSEVDVILISNY-LNML-ALPYITENTGFKGKVYATEPTLQIGRFFLEELVD 143

Query:    98 YIKVSNISTEQMLYTE 113
             YI+VS  +    L+ E
Sbjct:   144 YIEVSPKACTARLWKE 159

 Score = 61 (26.5 bits), Expect = 8.0e-08, Sum P(2) = 8.0e-08
 Identities = 14/53 (26%), Positives = 26/53 (49%)

Query:    10 EVGRSCIMLEFKNKSIMMDCGI-HPGLSGMDALPFVDLVESDQIDLLLISHFH 61
             ++ + C ++ FK   IM+DCG+    +     LPFV  ++   +   + S  H
Sbjct:     9 DLAKPCYIITFKGLRIMLDCGLTEQTVLNFLPLPFVQSLKWSNLPNFVPSRDH 61


>ZFIN|ZDB-GENE-061013-129 [details] [associations]
            symbol:ints9 "integrator complex subunit 9"
            species:7955 "Danio rerio" [GO:0016180 "snRNA processing"
            evidence=IEA] [GO:0032039 "integrator complex" evidence=IEA]
            InterPro:IPR027074 ZFIN:ZDB-GENE-061013-129 eggNOG:COG1236
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756
            HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445
            EMBL:CABZ01076623 EMBL:CABZ01078246 EMBL:CABZ01078247
            EMBL:CABZ01078248 EMBL:CABZ01078249 EMBL:BC124793 IPI:IPI00800641
            RefSeq:NP_001070738.1 UniGene:Dr.116109 Ensembl:ENSDART00000097865
            GeneID:768124 KEGG:dre:768124 InParanoid:Q08BB6 NextBio:20918446
            Uniprot:Q08BB6
        Length = 658

 Score = 148 (57.2 bits), Expect = 6.3e-11, Sum P(2) = 6.3e-11
 Identities = 90/437 (20%), Positives = 178/437 (40%)

Query:    88 KAIYRWL---LSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYN 143
             K I R L   L D ++V + S     Y+  ++  ++ K++ + + ++ ++ G ++ +  +
Sbjct:   159 KEIQRLLPGPLKDAVEVWSWSK---CYSLQEVNSALSKVQLVGYSQKVELFGAVQVTPLS 215

Query:   144 AGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQ 202
             +G+ LG++ ++I+    K+ Y    S     H    E   +K  D+LI      T +   
Sbjct:   216 SGYSLGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMEQSSLKNSDVLILTGL--TQIPTA 272

Query:   203 REERE-GRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASS 261
               +   G F S +   V  GG  L+P ++ G   +LL  L ++      L   P Y+ S 
Sbjct:   273 NPDGMLGEFCSNLAMTVRAGGNVLVPCYSSGVIYDLLECLYQFMD-SANLGTTPFYFISP 331

Query:   262 LAKKCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PC 311
             +A   +   Q +   +    + ++ +  P F    +     + H+  I G        PC
Sbjct:   332 VANSSLEFSQIFAEWLCQNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSSEFRQPC 391

Query:   312 VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRL 370
             VV      ++ G      E+W   + N +I               +EP+   +       
Sbjct:   392 VVFTGHPSLRFGDVVHFMELWGKSSLNTIIF--------------TEPDFSYLDALAPYQ 437

Query:   371 PLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHG-EQNEMSRL-KAALTREYEDDPN 428
             PL M   Y       ++ Q S+ +++++P HVV      Q   S+  ++ L  E +  P 
Sbjct:   438 PLAMKCVYCPIDTRLNFHQVSKLLKDIQPLHVVCPEPYTQPPPSQPHRSDLMLELQPPPM 497

Query:   429 TSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGI-------IVKRNFNYH 481
                  Y   + + +    + E+   ++ ELA ++L P    +G+       +++   N H
Sbjct:   498 A----YRRCSVLRLPFRRRYERI-HLLPELA-KSLVPSEVKAGVSVATVSAVLQSKDNKH 551

Query:   482 LLAPSDLPKYTDLKASK 498
             +L P  +PK   +  SK
Sbjct:   552 VLQP--VPKVAPVAPSK 566

 Score = 87 (35.7 bits), Expect = 6.3e-11, Sum P(2) = 6.3e-11
 Identities = 21/59 (35%), Positives = 33/59 (55%)

Query:    41 LPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYI 99
             LP  +L++   ID++LIS++H     ALP+    TGF G  + T  T  I R L+ + +
Sbjct:    85 LPEKELLDLSTIDVILISNYHC--MMALPYITEHTGFTGTVYATEPTLQIGRLLMEELV 141

 Score = 62 (26.9 bits), Expect = 2.4e-08, Sum P(2) = 2.4e-08
 Identities = 15/41 (36%), Positives = 26/41 (63%)

Query:    15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
             C +L+FK+ +IM+DCG+    + +  LP + LV S ++  L
Sbjct:    14 CNVLKFKSTTIMLDCGLDT-TAALYFLP-LPLVHSPRLSKL 52


>WB|WBGene00017608 [details] [associations]
            symbol:F19F10.12 species:6239 "Caenorhabditis elegans"
            [GO:0009792 "embryo development ending in birth or egg hatching"
            evidence=IMP] InterPro:IPR027074 GO:GO:0009792 eggNOG:COG1236
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 KO:K13146
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 EMBL:FO080914
            RefSeq:NP_504953.1 ProteinModelPortal:Q95ZM2 PaxDb:Q95ZM2
            EnsemblMetazoa:F19F10.12 GeneID:179142 KEGG:cel:CELE_F19F10.12
            UCSC:F19F10.12 CTD:179142 WormBase:F19F10.12 HOGENOM:HOG000199610
            InParanoid:Q95ZM2 OMA:EFMERIE NextBio:904092 Uniprot:Q95ZM2
        Length = 646

 Score = 154 (59.3 bits), Expect = 8.9e-10, Sum P(3) = 8.9e-10
 Identities = 57/281 (20%), Positives = 122/281 (43%)

Query:   111 YTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSR 170
             YT +D+   + K+ T++F++  D+  IK +   +GH  G+A + I+    +  Y    S 
Sbjct:   174 YTTTDMHSCLAKVITLSFNQTIDLFRIKVTPVVSGHTYGSAYWTIKTENEQFAYLSA-SN 232

Query:   171 QEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFA 230
                  +   E  P++    I  ++    V    +E        I D++ + G  L+P+  
Sbjct:   233 PSATDVKLMETAPLRAVDHILVTSLSRLVDTTAKEMGYSLIKTITDVLKKHGSVLLPICP 292

Query:   231 LGRAQELL-LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISI-N 288
             +G   E++  + D   + +    D PIY+ S +AK  +++       M++  +  + +  
Sbjct:   293 VGPIFEMIEAVSDIITTTNGIPLDTPIYFISPVAKSAIAMASISAEWMSESRQNAVYLPE 352

Query:   289 NPFVFKHISNLKGIDHFEDI-G--------PCVVMASPGMMQSGLSRELFEMWCTDAKNG 339
              P+   ++     +  ++ + G        PCV+ AS   ++ G +  + E+  +D KN 
Sbjct:   353 EPYSHSNLIKSGRVKIYDSLYGSFSKEFKTPCVIFASHASLRIGDAAHMVEVLGSDPKNA 412

Query:   340 VIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS 380
             VI+     +  L    + EP   + +    +P+   +D+ S
Sbjct:   413 VIVT----DPDLPCEDVREPFRNLPIKFINIPMDFRMDFAS 449

 Score = 68 (29.0 bits), Expect = 8.9e-10, Sum P(3) = 8.9e-10
 Identities = 16/61 (26%), Positives = 35/61 (57%)

Query:    45 DLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK-VSN 103
             D+++ D ID +L+S++     G LP++   +GF G+ ++T       + L+ + ++ +S 
Sbjct:    83 DMLKMDTIDAILVSNYE-SFVG-LPFYTEGSGFSGKIYVTEIAYQYGKLLMEEMLEFISR 140

Query:   104 I 104
             I
Sbjct:   141 I 141

 Score = 43 (20.2 bits), Expect = 8.9e-10, Sum P(3) = 8.9e-10
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:    13 RSCIMLEFKNKSIMMDCGI 31
             + C +LE+ N  I+MD  I
Sbjct:    12 KPCFLLEWPNARILMDTPI 30


>CGD|CAL0004705 [details] [associations]
            symbol:orf19.325 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493 EMBL:AACQ01000027
            EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 RefSeq:XP_720020.1 RefSeq:XP_720152.1
            ProteinModelPortal:Q5AEE3 STRING:Q5AEE3 GeneID:3638181
            GeneID:3638320 KEGG:cal:CaO19.325 KEGG:cal:CaO19.7957
            Uniprot:Q5AEE3
        Length = 931

 Score = 164 (62.8 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 74/350 (21%), Positives = 149/350 (42%)

Query:    17 MLEFKNK-SIMMDCGIHPGLSGMDALPFVDLVES-DQIDLLLISHFHLDHCGALPWFLLK 74
             +LEF N+  ++ D    P  +G+D    + + E   + + +L+SH   +         +K
Sbjct:    20 LLEFDNEFKLIAD----PSWNGVDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIK 75

Query:    75 TGFKGRCFMTHATKAIY---RWLLSDYIKVSNI--STEQMLYTESDLEKSMDKIETINFH 129
                       ++T  +    R    +Y +        +  +    +++   DK+  + + 
Sbjct:    76 FPILMSSIPVYSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQ 135

Query:   130 EEKDV--NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAE-IPP--- 183
             +  ++  N +  + YNAGH LG   +LI     +++Y   ++  +D  L +A  I P   
Sbjct:   136 QSLNLFDNKVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTG 195

Query:   184 ------VKPDILITESTYGTHV-HEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQE 236
                   ++P   IT +  G+ + H +R E+   F  L+   +  GG  ++P    GR  E
Sbjct:   196 NPHLSLLRPTAFITATDMGSVMSHRKRTEK---FLQLVDATLANGGAAVLPTSLSGRFLE 252

Query:   237 LLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQIS-INN-PFVFK 294
             L  ++DE+    P    IP+Y+ S    K ++     ++ M+    ++   +++ PF   
Sbjct:   253 LFHLIDEHLKGAP----IPVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPS 308

Query:   295 HISNLKGIDHFEDI-GPCVVMASPGMMQSG-LSRELFEMWCTDAKNGVII 342
              +  L        + GP +V  S   ++SG +S E F+  C D    +I+
Sbjct:   309 KVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIIL 358

 Score = 52 (23.4 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 13/58 (22%), Positives = 31/58 (53%)

Query:   356 LSEPEEVIGMS-G-------QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLV 405
             LS P++ +G++ G       Q+L ++  + ++  S   D +     V+ L+P +++L+
Sbjct:   619 LSNPKKRVGLNYGTKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKPYNLILL 676

 Score = 44 (20.5 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 24/85 (28%), Positives = 38/85 (44%)

Query:   450 KTAKVMGELAVENLKPDAALSGIIVKR-NFNYHLLAPSDLPKYTDLKASKIIQQQSVYYS 508
             K AK+ GEL ++N  P A  +  +    N N H      L K     A K  +Q+++   
Sbjct:   785 KVAKLYGELELQNQFPAAKKTRTLQDYINSNTHF----SLRKLDGTTAVK--RQETIANQ 838

Query:   509 GSISVLRSLISHLAGPVETLDEKRL 533
                  +R+LI++  GP   +   RL
Sbjct:   839 VQDPKIRALITN--GPKLAIGNIRL 861


>UNIPROTKB|Q5AEE3 [details] [associations]
            symbol:CFT2 "Putative uncharacterized protein CFT2"
            species:237561 "Candida albicans SC5314" [GO:0042493 "response to
            drug" evidence=IMP] InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493
            EMBL:AACQ01000027 EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 RefSeq:XP_720020.1
            RefSeq:XP_720152.1 ProteinModelPortal:Q5AEE3 STRING:Q5AEE3
            GeneID:3638181 GeneID:3638320 KEGG:cal:CaO19.325
            KEGG:cal:CaO19.7957 Uniprot:Q5AEE3
        Length = 931

 Score = 164 (62.8 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 74/350 (21%), Positives = 149/350 (42%)

Query:    17 MLEFKNK-SIMMDCGIHPGLSGMDALPFVDLVES-DQIDLLLISHFHLDHCGALPWFLLK 74
             +LEF N+  ++ D    P  +G+D    + + E   + + +L+SH   +         +K
Sbjct:    20 LLEFDNEFKLIAD----PSWNGVDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIK 75

Query:    75 TGFKGRCFMTHATKAIY---RWLLSDYIKVSNI--STEQMLYTESDLEKSMDKIETINFH 129
                       ++T  +    R    +Y +        +  +    +++   DK+  + + 
Sbjct:    76 FPILMSSIPVYSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQ 135

Query:   130 EEKDV--NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAE-IPP--- 183
             +  ++  N +  + YNAGH LG   +LI     +++Y   ++  +D  L +A  I P   
Sbjct:   136 QSLNLFDNKVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTG 195

Query:   184 ------VKPDILITESTYGTHV-HEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQE 236
                   ++P   IT +  G+ + H +R E+   F  L+   +  GG  ++P    GR  E
Sbjct:   196 NPHLSLLRPTAFITATDMGSVMSHRKRTEK---FLQLVDATLANGGAAVLPTSLSGRFLE 252

Query:   237 LLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQIS-INN-PFVFK 294
             L  ++DE+    P    IP+Y+ S    K ++     ++ M+    ++   +++ PF   
Sbjct:   253 LFHLIDEHLKGAP----IPVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPS 308

Query:   295 HISNLKGIDHFEDI-GPCVVMASPGMMQSG-LSRELFEMWCTDAKNGVII 342
              +  L        + GP +V  S   ++SG +S E F+  C D    +I+
Sbjct:   309 KVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIIL 358

 Score = 52 (23.4 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 13/58 (22%), Positives = 31/58 (53%)

Query:   356 LSEPEEVIGMS-G-------QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLV 405
             LS P++ +G++ G       Q+L ++  + ++  S   D +     V+ L+P +++L+
Sbjct:   619 LSNPKKRVGLNYGTKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKPYNLILL 676

 Score = 44 (20.5 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
 Identities = 24/85 (28%), Positives = 38/85 (44%)

Query:   450 KTAKVMGELAVENLKPDAALSGIIVKR-NFNYHLLAPSDLPKYTDLKASKIIQQQSVYYS 508
             K AK+ GEL ++N  P A  +  +    N N H      L K     A K  +Q+++   
Sbjct:   785 KVAKLYGELELQNQFPAAKKTRTLQDYINSNTHF----SLRKLDGTTAVK--RQETIANQ 838

Query:   509 GSISVLRSLISHLAGPVETLDEKRL 533
                  +R+LI++  GP   +   RL
Sbjct:   839 VQDPKIRALITN--GPKLAIGNIRL 861


>UNIPROTKB|H0YBH8 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
            ChiTaRS:INTS9 Ensembl:ENST00000524081 Uniprot:H0YBH8
        Length = 223

 Score = 151 (58.2 bits), Expect = 1.7e-08, P = 1.7e-08
 Identities = 39/153 (25%), Positives = 79/153 (51%)

Query:    20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
             F +K +  +C  H  +  +    LP  +L++   +D++LIS++H     ALP+    TGF
Sbjct:    55 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 111

Query:    78 KGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG- 136
              G  + T  T  I R L S       +ST +  YT  ++  ++ KI+ + + ++ ++ G 
Sbjct:   112 TGTVYATEPTVQIGRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGA 171

Query:   137 IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFS 169
             ++ +  ++G+ LG++ ++I+    K+ Y    S
Sbjct:   172 VQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSS 204


>UNIPROTKB|Q87XP2 [details] [associations]
            symbol:PSPTO_4134 "Uncharacterized protein" species:223283
            "Pseudomonas syringae pv. tomato str. DC3000" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:AE016853 GenomeReviews:AE016853_GR eggNOG:COG1236
            HOGENOM:HOG000035995 OMA:STFGLPI InterPro:IPR026360
            TIGRFAMs:TIGR04122 RefSeq:NP_793895.1 ProteinModelPortal:Q87XP2
            GeneID:1185814 KEGG:pst:PSPTO_4134 PATRIC:19999765 KO:K07577
            ProtClustDB:CLSK2517054 BioCyc:PSYR223283:GJIX-4198-MONOMER
            Uniprot:Q87XP2
        Length = 348

 Score = 127 (49.8 bits), Expect = 6.1e-05, P = 6.1e-05
 Identities = 47/166 (28%), Positives = 80/166 (48%)

Query:    80 RCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKF 139
             R  +THA     R     Y+  +  S E +L   S L + ++ ++T+ + E    +G+K 
Sbjct:    28 RAVITHAHGDHARTGNQHYLSAA--SGEGIL--RSRLGQDIN-LQTLEYGETITHHGVKL 82

Query:   140 SAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHV 199
             S + AGHVLG+A   +E  G   + +GD+  + D    A E  PV+    ITEST+G  +
Sbjct:    83 SLHPAGHVLGSAQVRLEYEGEVWVASGDYKVEPDGTCAAFE--PVRCQTFITESTFGLPI 140

Query:   200 HE---QREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILD 242
             +    Q +  EG           +G   ++  ++ G+AQ +L  +D
Sbjct:   141 YRWAPQSQIFEG-INEWWRGNAAQGKASVLFAYSFGKAQRILHGID 185


>UNIPROTKB|E2QVB2 [details] [associations]
            symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
            GO:GO:0016180 PANTHER:PTHR11203:SF2 Ensembl:ENSCAFT00000013124
            Uniprot:E2QVB2
        Length = 409

 Score = 125 (49.1 bits), Expect = 0.00014, P = 0.00014
 Identities = 87/371 (23%), Positives = 144/371 (38%)

Query:   208 GRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCM 267
             G F S +   V  GG  L+P +  G   +LL  L +Y      L +IP Y+ S +A   +
Sbjct:    30 GEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVANSSL 88

Query:   268 SVYQTYINAMNDRIRRQISINNP-FV---------FKHISNLKGIDHFEDIG-PCVVMAS 316
                Q +   +    + ++ +  P F           KH  +L G D   D   PCVV   
Sbjct:    89 EFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSLHG-DFSSDFRQPCVVFTG 147

Query:   317 PGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLKMS 375
                ++ G      E+W   + N VI               +EP+   +       PL M 
Sbjct:   148 HPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLAMK 193

Query:   376 VDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYN 435
               Y       ++ Q S+ ++E++P HVV    EQ        +   +   D       Y 
Sbjct:   194 CIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMSYR 251

Query:   436 PRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSDLP 489
                 +++    + EK  ++M ELA       +KP  +L+ +  ++    N H+L P   P
Sbjct:   252 RAEVLALPFKRRYEKI-EIMPELADALVPMEIKPGISLATVSAVLHTKDNKHVLQPP--P 308

Query:   490 KYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TLEKC 546
             + T     K  ++ S        VL+ L+S    PVE   +      F+ I++  T +  
Sbjct:   309 RPTQPTGGKKRKRASDDIP-DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTAKGH 366

Query:   547 IVVLEWASNPI 557
             IV+L+ A   I
Sbjct:   367 IVLLQEAETLI 377


>TAIR|locus:2079696 [details] [associations]
            symbol:AT3G07530 "AT3G07530" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR027074 EMBL:CP002686 InterPro:IPR022712
            Pfam:PF10996 SMART:SM01027 KO:K13146 PANTHER:PTHR11203:SF2
            IPI:IPI00520313 RefSeq:NP_187409.2 UniGene:At.53215
            ProteinModelPortal:F4JEH2 PRIDE:F4JEH2 EnsemblPlants:AT3G07530.1
            GeneID:819942 KEGG:ath:AT3G07530 OMA:CYNGTLI Uniprot:F4JEH2
        Length = 699

 Score = 80 (33.2 bits), Expect = 0.00019, Sum P(3) = 0.00019
 Identities = 38/167 (22%), Positives = 71/167 (42%)

Query:   192 ESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPEL 251
             +S   T    +  E+     S   +  + GG  LI +  +G   +LL +L    SL    
Sbjct:   315 DSLLNTEDSLEEMEKLAFVCSCAAESADAGGSTLITITRIGIVLQLLELLSN--SLESSS 372

Query:   252 HDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKG--IDHFEDIG 309
               +PI+  SS+A++ ++   T    + ++ + ++    P  F H+  +K   I  F  I 
Sbjct:   373 LKVPIFVISSVAEELLAYTNTIPEWLCEQRQEKLISGEPS-FGHLKFIKNKKIHLFPAIH 431

Query:   310 --------------PCVVMASPGMMQSGLSRELFEMWCTDAKNGVII 342
                           PC+V AS   ++ G S +L + W  D K+ +++
Sbjct:   432 SPNLIYANRTSWQEPCIVFASHWSLRLGPSVQLLQRWRGDPKSLLVL 478

 Score = 78 (32.5 bits), Expect = 0.00019, Sum P(3) = 0.00019
 Identities = 25/83 (30%), Positives = 40/83 (48%)

Query:   110 LYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDF 168
             LY+  D+E  M K++ + F EE   NG +   A ++G  +GA  +LI      + Y  D 
Sbjct:   199 LYSLDDIESCMKKVQGVKFAEEVCYNGTLIIKALSSGLDIGACNWLINGPNGSLSYVSD- 257

Query:   169 SRQEDRHLMAAEIPPVKP-DILI 190
             S     H  + +   +K  D+LI
Sbjct:   258 SIFVSHHARSFDFHGLKETDVLI 280

 Score = 60 (26.2 bits), Expect = 0.00019, Sum P(3) = 0.00019
 Identities = 17/56 (30%), Positives = 30/56 (53%)

Query:    46 LVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV 101
             L E+  ID++LIS+  +   G LP+     GF  + +MT  T  I + ++ D + +
Sbjct:    97 LWEASFIDIVLISN-PMGLLG-LPFLTQNPGFFAKIYMTEVTAKIGQLMMEDIVSM 150


>UNIPROTKB|Q81SK8 [details] [associations]
            symbol:BA_1640 "Ribonuclease J" species:1392 "Bacillus
            anthracis" [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR001279 InterPro:IPR001587 InterPro:IPR004613
            Pfam:PF00753 PIRSF:PIRSF004803 PROSITE:PS01292 SMART:SM00849
            Pfam:PF07521 GO:GO:0046872 EMBL:AE016879 GenomeReviews:AE016879_GR
            GO:GO:0003723 GO:GO:0016788 InterPro:IPR011108 HOGENOM:HOG000280201
            KO:K12574 PANTHER:PTHR11203:SF22 TIGRFAMs:TIGR00649
            RefSeq:NP_844087.1 ProteinModelPortal:Q81SK8 DNASU:1086943
            EnsemblBacteria:EBBACT00000008733 GeneID:1086943 KEGG:ban:BA_1640
            PATRIC:18780866 ProtClustDB:CLSK916310 Uniprot:Q81SK8
        Length = 549

 Score = 130 (50.8 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 61/245 (24%), Positives = 112/245 (45%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIH-P--GLSGMDAL-PFVDLVES--DQIDLLLISH 59
             G   E+G++   ++++N  +++DCG   P   L G+D + P V  ++   ++I  L+++H
Sbjct:    14 GGVNEIGKNMYAIQYENDIVVIDCGSKFPDESLLGIDLIIPDVTYLQENKEKIRGLVVTH 73

Query:    60 FHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKS 119
              H DH G +P+FL +       + T  T  +    L ++  + N +   ++++ES+++  
Sbjct:    74 GHEDHIGGIPYFLKQLNVP--IYATRLTLGLIEIKLKEH-NLQNDTELIVIHSESEID-- 128

Query:   120 MDKIETINF---HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL 176
                I+T  F   H   D  GI F     G V+    F  ++  V        ++Q D H 
Sbjct:   129 FGSIKTTFFKTNHSIPDCLGIAFHTPE-GTVVHTGDFKFDLTPVN-------NQQPDIHK 180

Query:   177 MAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGR-CLIPVFA--LGR 233
             MA +I       L++EST          ER       I +I  +  R  +I  FA  + R
Sbjct:   181 MA-KIGSEGVLALLSESTNAERPGFTPSERS--VGERIEEIFMKANRKVIISTFASNVNR 237

Query:   234 AQELL 238
              Q+++
Sbjct:   238 VQQIV 242

 Score = 40 (19.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query:   379 ISFSAHTDYQQTSEFVREL-RPAHVVLVHGE 408
             +  S H  YQ+  + +  L +P + + +HGE
Sbjct:   365 VHVSGHA-YQEELKLMLALMKPKYFIPIHGE 394

 Score = 39 (18.8 bits), Expect = 0.00028, Sum P(2) = 0.00028
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query:   388 QQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             + + EF++EL    V+ ++  + E       L RE  +
Sbjct:   497 RDSEEFLKELNKLAVITINNLKKEKVNSWGILKREVRE 534


>TIGR_CMR|BA_1640 [details] [associations]
            symbol:BA_1640 "metallo-beta-lactamase family protein"
            species:198094 "Bacillus anthracis str. Ames" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 InterPro:IPR001587
            InterPro:IPR004613 Pfam:PF00753 PIRSF:PIRSF004803 PROSITE:PS01292
            SMART:SM00849 Pfam:PF07521 GO:GO:0046872 EMBL:AE016879
            GenomeReviews:AE016879_GR GO:GO:0003723 GO:GO:0016788
            InterPro:IPR011108 HOGENOM:HOG000280201 KO:K12574
            PANTHER:PTHR11203:SF22 TIGRFAMs:TIGR00649 RefSeq:NP_844087.1
            ProteinModelPortal:Q81SK8 DNASU:1086943
            EnsemblBacteria:EBBACT00000008733 GeneID:1086943 KEGG:ban:BA_1640
            PATRIC:18780866 ProtClustDB:CLSK916310 Uniprot:Q81SK8
        Length = 549

 Score = 130 (50.8 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 61/245 (24%), Positives = 112/245 (45%)

Query:     6 GAGQEVGRSCIMLEFKNKSIMMDCGIH-P--GLSGMDAL-PFVDLVES--DQIDLLLISH 59
             G   E+G++   ++++N  +++DCG   P   L G+D + P V  ++   ++I  L+++H
Sbjct:    14 GGVNEIGKNMYAIQYENDIVVIDCGSKFPDESLLGIDLIIPDVTYLQENKEKIRGLVVTH 73

Query:    60 FHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKS 119
              H DH G +P+FL +       + T  T  +    L ++  + N +   ++++ES+++  
Sbjct:    74 GHEDHIGGIPYFLKQLNVP--IYATRLTLGLIEIKLKEH-NLQNDTELIVIHSESEID-- 128

Query:   120 MDKIETINF---HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL 176
                I+T  F   H   D  GI F     G V+    F  ++  V        ++Q D H 
Sbjct:   129 FGSIKTTFFKTNHSIPDCLGIAFHTPE-GTVVHTGDFKFDLTPVN-------NQQPDIHK 180

Query:   177 MAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGR-CLIPVFA--LGR 233
             MA +I       L++EST          ER       I +I  +  R  +I  FA  + R
Sbjct:   181 MA-KIGSEGVLALLSESTNAERPGFTPSERS--VGERIEEIFMKANRKVIISTFASNVNR 237

Query:   234 AQELL 238
              Q+++
Sbjct:   238 VQQIV 242

 Score = 40 (19.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 9/31 (29%), Positives = 17/31 (54%)

Query:   379 ISFSAHTDYQQTSEFVREL-RPAHVVLVHGE 408
             +  S H  YQ+  + +  L +P + + +HGE
Sbjct:   365 VHVSGHA-YQEELKLMLALMKPKYFIPIHGE 394

 Score = 39 (18.8 bits), Expect = 0.00028, Sum P(2) = 0.00028
 Identities = 9/38 (23%), Positives = 18/38 (47%)

Query:   388 QQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
             + + EF++EL    V+ ++  + E       L RE  +
Sbjct:   497 RDSEEFLKELNKLAVITINNLKKEKVNSWGILKREVRE 534


>TIGR_CMR|CHY_1157 [details] [associations]
            symbol:CHY_1157 "metallo-beta-lactamase family protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
            "metabolic process" evidence=ISS] InterPro:IPR001279
            InterPro:IPR004613 Pfam:PF00753 PIRSF:PIRSF004803 SMART:SM00849
            Pfam:PF07521 GO:GO:0046872 EMBL:CP000141 GenomeReviews:CP000141_GR
            GO:GO:0003723 GO:GO:0016788 InterPro:IPR011108 eggNOG:COG0595
            HOGENOM:HOG000280201 KO:K12574 PANTHER:PTHR11203:SF22
            TIGRFAMs:TIGR00649 RefSeq:YP_360002.1 ProteinModelPortal:Q3ACY2
            STRING:Q3ACY2 GeneID:3726430 KEGG:chy:CHY_1157 PATRIC:21275454
            OMA:FLVDSTN BioCyc:CHYD246194:GJCN-1156-MONOMER Uniprot:Q3ACY2
        Length = 554

 Score = 113 (44.8 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 60/231 (25%), Positives = 102/231 (44%)

Query:     4 LKGAGQEVGRSCIMLEFKNKSIMMDCGI---HPGLSGMD-ALPFVD-LVES-DQIDLLLI 57
             L G G E+G++ +++++ +  I++D G+      L G+D  +P +  L+E+ +++  +L+
Sbjct:    13 LGGLG-EIGKNMMVIKYNDAIIVIDAGLMFPEEELLGIDMVIPDMSYLIENKEKVKAVLL 71

Query:    58 SHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESD- 115
             +H H DH G +P+FL +  F    + T  T      LLS  +K + I    + +    D 
Sbjct:    72 THGHEDHIGGMPYFLKQ--FDVPVYGTRLTLG----LLSAKLKEAGIPRASLNVVAPRDV 125

Query:   116 LEKSMDKIETINF-HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQE-- 172
             L     KIE I   H   D  GI       G V+    F ++   V    T  +   E  
Sbjct:   126 LNIGPFKIEFIKVSHSIPDTVGIAVHT-PVGTVVHTGDFKLDPTPVDGKVTDFYKLAELG 184

Query:   173 DRH---LMAAEIPPVKPDILITESTYGTHVHEQREEREGR-----FTSLIH 215
             ++    LM+      +P   ++E T G    E     EGR     F S +H
Sbjct:   185 EKGVLVLMSDSTNAERPGFTLSEKTVGNTFEETFRVAEGRIIIATFASNVH 235

 Score = 58 (25.5 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 22/100 (22%), Positives = 44/100 (44%)

Query:   336 AKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVR 395
             A + VII+   + G   + ++S   + +   G ++ +  +V  I  S H   ++    + 
Sbjct:   322 AGDTVIISAMPIPGN--EKLVSRIIDQLFKLGAKV-IYEAVSGIHVSGHPSQEELKLMIN 378

Query:   396 ELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYN 435
              L+P + V +HGE   + +  A + RE    P     + N
Sbjct:   379 LLKPKYFVPIHGEYRHLIK-HAEIARELGIKPQNIFVVEN 417


>UNIPROTKB|Q83DU6 [details] [associations]
            symbol:CBU_0596 "Metal-dependent hydrolase" species:227377
            "Coxiella burnetii RSA 493" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
            SMART:SM00849 GO:GO:0016787 EMBL:AE016828 GenomeReviews:AE016828_GR
            RefSeq:NP_819626.1 ProteinModelPortal:Q83DU6 GeneID:1208481
            KEGG:cbu:CBU_0596 PATRIC:17929885 HOGENOM:HOG000279110 OMA:IFHDCET
            ProtClustDB:CLSK892609 BioCyc:CBUR227377:GJ7S-597-MONOMER
            Uniprot:Q83DU6
        Length = 249

 Score = 114 (45.2 bits), Expect = 0.00081, P = 0.00081
 Identities = 54/204 (26%), Positives = 88/204 (43%)

Query:    13 RSCIMLEFKN-KSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWF 71
             +S I+LE  N K +++DCG         +L  V+L  +D ID + +SHFH DH G L W 
Sbjct:    15 QSNILLENSNHKRLLIDCGT----DAHHSLKNVNLRYAD-IDSVYVSHFHFDHVGGLEWL 69

Query:    72 LLKTGF-----KGRCFMTHATKAIYRW--LLS---DYIKVSNISTEQMLYTESDL-EKSM 120
                  F     K + F+ H +     W  +LS     +K  + +T +  +T + + E+  
Sbjct:    70 AFSAYFDPAVKKPKLFI-HPSMLNILWDHVLSGGLQSLKGESPATLETYFTLAPIREEKY 128

Query:   121 DKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL-MAA 179
                E+INF   K ++      +N   +L +      +   KI  T D      R+     
Sbjct:   129 FTWESINFEMVKTIH-----VHNGKLLLPSYGLFFSLEKTKIFITTDTQFFPHRYADYYR 183

Query:   180 EIPPVKPDILITESTYGTHVHEQR 203
             E   +  D  I ++  G H H Q+
Sbjct:   184 EADLIFHDCEIDKTKTGVHAHFQQ 207


>TIGR_CMR|CBU_0596 [details] [associations]
            symbol:CBU_0596 "conserved hypothetical protein"
            species:227377 "Coxiella burnetii RSA 493" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            InterPro:IPR001279 SMART:SM00849 GO:GO:0016787 EMBL:AE016828
            GenomeReviews:AE016828_GR RefSeq:NP_819626.1
            ProteinModelPortal:Q83DU6 GeneID:1208481 KEGG:cbu:CBU_0596
            PATRIC:17929885 HOGENOM:HOG000279110 OMA:IFHDCET
            ProtClustDB:CLSK892609 BioCyc:CBUR227377:GJ7S-597-MONOMER
            Uniprot:Q83DU6
        Length = 249

 Score = 114 (45.2 bits), Expect = 0.00081, P = 0.00081
 Identities = 54/204 (26%), Positives = 88/204 (43%)

Query:    13 RSCIMLEFKN-KSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWF 71
             +S I+LE  N K +++DCG         +L  V+L  +D ID + +SHFH DH G L W 
Sbjct:    15 QSNILLENSNHKRLLIDCGT----DAHHSLKNVNLRYAD-IDSVYVSHFHFDHVGGLEWL 69

Query:    72 LLKTGF-----KGRCFMTHATKAIYRW--LLS---DYIKVSNISTEQMLYTESDL-EKSM 120
                  F     K + F+ H +     W  +LS     +K  + +T +  +T + + E+  
Sbjct:    70 AFSAYFDPAVKKPKLFI-HPSMLNILWDHVLSGGLQSLKGESPATLETYFTLAPIREEKY 128

Query:   121 DKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL-MAA 179
                E+INF   K ++      +N   +L +      +   KI  T D      R+     
Sbjct:   129 FTWESINFEMVKTIH-----VHNGKLLLPSYGLFFSLEKTKIFITTDTQFFPHRYADYYR 183

Query:   180 EIPPVKPDILITESTYGTHVHEQR 203
             E   +  D  I ++  G H H Q+
Sbjct:   184 EADLIFHDCEIDKTKTGVHAHFQQ 207


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.136   0.400    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      622       622   0.00090  120 3  11 22  0.38    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  97
  No. of states in DFA:  619 (66 KB)
  Total size of DFA:  340 KB (2172 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  51.56u 0.09s 51.65t   Elapsed:  00:00:07
  Total cpu time:  51.61u 0.09s 51.70t   Elapsed:  00:00:07
  Start:  Thu Aug 15 17:02:32 2013   End:  Thu Aug 15 17:02:39 2013

Back to top