BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>044504
MASVGQPPSLKRRDAPVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYS
GMAALPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLT
DYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVD
IAGVRVLYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHS
TISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILS
MNERIRNQFANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDK
KNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKE
LMPPNIILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLA
EKTPEVGETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT

High Scoring Gene Products

Symbol, full name Information P value
CPSF73-I
cleavage and polyadenylation specificity factor 73-I
protein from Arabidopsis thaliana 3.3e-260
CPSF3
Uncharacterized protein
protein from Gallus gallus 4.5e-178
CPSF3
Uncharacterized protein
protein from Canis lupus familiaris 5.7e-178
CPSF3
Cleavage and polyadenylation specificity factor subunit 3
protein from Bos taurus 3.1e-177
CPSF3
Cleavage and polyadenylation specificity factor subunit 3
protein from Homo sapiens 3.1e-177
Cpsf3
cleavage and polyadenylation specificity factor 3
protein from Mus musculus 6.5e-177
Cpsf3
cleavage and polyadenylation specific factor 3, 73kDa
gene from Rattus norvegicus 1.4e-176
CPSF3
Uncharacterized protein
protein from Sus scrofa 7.5e-176
cpsf3
cleavage and polyadenylation specific factor 3
gene_product from Danio rerio 9.6e-176
Cpsf73
Cleavage and polyadenylation specificity factor 73
protein from Drosophila melanogaster 3.2e-168
cpsf3
cleavage and polyadenylation specificity factor 73 kDa subunit
gene from Dictyostelium discoideum 2.9e-167
CPSF3
Cleavage and polyadenylation-specificity factor subunit 3
protein from Homo sapiens 1.1e-165
cpsf-3 gene from Caenorhabditis elegans 5.7e-162
YSH1
Putative endoribonuclease
gene from Saccharomyces cerevisiae 1.5e-136
orf19.5486 gene_product from Candida albicans 1.1e-133
YSH1
Endoribonuclease YSH1
protein from Candida albicans SC5314 1.1e-133
PF14_0364
cleavage and polyadenylation specifity factor protein, putative
gene from Plasmodium falciparum 1.2e-114
PF14_0364
Cleavage and polyadenylation specificity factor protein, putative
protein from Plasmodium falciparum 3D7 1.2e-114
CPSF3L
Integrator complex subunit 11
protein from Gallus gallus 2.0e-88
CPSF3L
Integrator complex subunit 11
protein from Gallus gallus 2.5e-88
Cpsf3l
cleavage and polyadenylation specific factor 3-like
protein from Mus musculus 4.3e-86
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 5.4e-86
Cpsf3l
cleavage and polyadenylation specific factor 3-like
gene from Rattus norvegicus 5.4e-86
CPSF3L
Integrator complex subunit 11
protein from Bos taurus 8.9e-86
IntS11
Integrator 11
protein from Drosophila melanogaster 1.3e-84
CPSF3L
Uncharacterized protein
protein from Sus scrofa 1.3e-84
CPSF3L
Uncharacterized protein
protein from Canis lupus familiaris 1.7e-84
CPSF3L
Integrator complex subunit 11
protein from Bos taurus 2.7e-84
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 7.2e-84
F10B5.8 gene from Caenorhabditis elegans 2.9e-80
ints11
integrator complex subunit 11
gene from Dictyostelium discoideum 6.0e-80
cpsf3l
cleavage and polyadenylation specific factor 3-like
gene_product from Danio rerio 9.7e-80
CPSF73-II
AT2G01730
protein from Arabidopsis thaliana 3.5e-75
PFC0825c
cleavage and polyadenylation specificity factor protein, putative
gene from Plasmodium falciparum 2.6e-62
PFC0825c
Cleavage and polyadenylation specificity factor protein, putative
protein from Plasmodium falciparum 3D7 2.6e-62
CPSF3
Cleavage and polyadenylation-specificity factor subunit 3
protein from Homo sapiens 1.7e-50
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 5.3e-48
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 3.4e-45
CPSF100
cleavage and polyadenylation specificity factor 100
protein from Arabidopsis thaliana 5.2e-41
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 2.1e-36
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 5.7e-36
CHY_2049
metallo-beta-lactamase family protein
protein from Carboxydothermus hydrogenoformans Z-2901 1.2e-35
CPS_2623
metallo-beta-lactamase family protein
protein from Colwellia psychrerythraea 34H 8.3e-35
VC_0264
Putative uncharacterized protein
protein from Vibrio cholerae O1 biovar El Tor str. N16961 2.2e-34
VC_0264
conserved hypothetical protein
protein from Vibrio cholerae O1 biovar El Tor 2.2e-34
cpsf-2 gene from Caenorhabditis elegans 3.9e-34
cpsf-2
Probable cleavage and polyadenylation specificity factor subunit 2
protein from Caenorhabditis elegans 3.9e-34
Cpsf100
Cleavage and polyadenylation specificity factor 100
protein from Drosophila melanogaster 4.2e-31
CPSF2
Uncharacterized protein
protein from Sus scrofa 1.0e-30
cpsf2
cleavage and polyadenylation specificity factor 100 kDa subunit
gene from Dictyostelium discoideum 1.5e-30
CPSF2
Uncharacterized protein
protein from Gallus gallus 2.0e-30
cpsf2
cleavage and polyadenylation specific factor 2
gene_product from Danio rerio 3.2e-30
CPSF2
Cleavage and polyadenylation specificity factor subunit 2
protein from Bos taurus 7.1e-30
CPSF2
Uncharacterized protein
protein from Canis lupus familiaris 7.1e-30
CPSF2
Cleavage and polyadenylation specificity factor subunit 2
protein from Homo sapiens 7.1e-30
Cpsf2
cleavage and polyadenylation specific factor 2, 100kDa
gene from Rattus norvegicus 3.1e-29
DET_1061
metallo-beta-lactamase family protein
protein from Dehalococcoides ethenogenes 195 3.4e-29
cpsf2
Cleavage and polyadenylation specificity factor subunit 2
protein from Xenopus laevis 3.7e-29
Cpsf2
cleavage and polyadenylation specific factor 2
protein from Mus musculus 4.0e-29
SO_0541
RNA-metabolizing metallo-beta-lactamase family protein
protein from Shewanella oneidensis MR-1 7.4e-28
SO_0541
metallo-beta-lactamase family protein
protein from Shewanella oneidensis MR-1 7.4e-28
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 1.8e-21
BA_1737
Metallo-beta-lactamase family protein
protein from Bacillus anthracis 2.6e-21
BA_1737
metallo-beta-lactamase family protein
protein from Bacillus anthracis str. Ames 2.6e-21
GSU1843
RNA exonuclease, beta-lactamase fold protein
protein from Geobacter sulfurreducens PCA 7.9e-19
GSU_1843
metallo-beta-lactamase family protein
protein from Geobacter sulfurreducens PCA 7.9e-19
CPSF3L
Integrator complex subunit 11
protein from Homo sapiens 2.6e-18
ints9
integrator complex subunit 9
gene from Dictyostelium discoideum 2.6e-18
HNE_1669
Putative uncharacterized protein
protein from Hyphomonas neptunium ATCC 15444 4.3e-13
INTS9
Integrator complex subunit 9
protein from Homo sapiens 1.0e-10
INTS9
Uncharacterized protein
protein from Canis lupus familiaris 7.3e-10
orf19.325 gene_product from Candida albicans 7.8e-10
CFT2
Putative uncharacterized protein CFT2
protein from Candida albicans SC5314 7.8e-10
Ints9
integrator complex subunit 9
gene from Rattus norvegicus 1.6e-09
INTS9
Integrator complex subunit 9
protein from Bos taurus 2.0e-09
INTS9
Integrator complex subunit 9
protein from Bos taurus 2.0e-09
Ints9
integrator complex subunit 9
protein from Mus musculus 3.3e-09
INTS9
Integrator complex subunit 9
protein from Homo sapiens 4.2e-09
INTS9
Uncharacterized protein
protein from Sus scrofa 4.6e-09
INTS9
Integrator complex subunit 9
protein from Homo sapiens 5.5e-09
INTS9
Integrator complex subunit 9
protein from Homo sapiens 5.8e-09
INTS9
Integrator complex subunit 9
protein from Gallus gallus 7.0e-09
ints9
integrator complex subunit 9
gene_product from Danio rerio 4.1e-08
F19F10.12 gene from Caenorhabditis elegans 9.3e-08
INTS9
Integrator complex subunit 9
protein from Homo sapiens 1.8e-07
INTS9
Integrator complex subunit 9
protein from Homo sapiens 4.8e-06
IntS9
Integrator 9
protein from Drosophila melanogaster 8.3e-06
PSPTO_4134
Uncharacterized protein
protein from Pseudomonas syringae pv. tomato str. DC3000 1.2e-05
NSE_0829
metallo-beta-lactamase family, beta-CASP subfamily
protein from Neorickettsia sennetsu str. Miyayama 5.7e-05
CHY_1157
metallo-beta-lactamase family protein
protein from Carboxydothermus hydrogenoformans Z-2901 0.00014
MGG_06570
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 0.00061
INTS9
Uncharacterized protein
protein from Canis lupus familiaris 0.00063

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  044504
        (525 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2206076 - symbol:CPSF73-I "cleavage and polyad...  2504  3.3e-260  1
UNIPROTKB|F1NKW5 - symbol:CPSF3 "Uncharacterized protein"...  1729  4.5e-178  1
UNIPROTKB|E2R7R2 - symbol:CPSF3 "Uncharacterized protein"...  1728  5.7e-178  1
UNIPROTKB|P79101 - symbol:CPSF3 "Cleavage and polyadenyla...  1721  3.1e-177  1
UNIPROTKB|Q9UKF6 - symbol:CPSF3 "Cleavage and polyadenyla...  1721  3.1e-177  1
MGI|MGI:1859328 - symbol:Cpsf3 "cleavage and polyadenylat...  1718  6.5e-177  1
UNIPROTKB|G3V6W7 - symbol:Cpsf3 "Protein Cpsf3" species:1...  1718  6.5e-177  1
RGD|1305767 - symbol:Cpsf3 "cleavage and polyadenylation ...  1715  1.4e-176  1
UNIPROTKB|I3LKR1 - symbol:CPSF3 "Uncharacterized protein"...  1708  7.5e-176  1
ZFIN|ZDB-GENE-030131-3275 - symbol:cpsf3 "cleavage and po...  1707  9.6e-176  1
FB|FBgn0261065 - symbol:Cpsf73 "Cleavage and polyadenylat...  1636  3.2e-168  1
DICTYBASE|DDB_G0274799 - symbol:cpsf3 "cleavage and polya...  1627  2.9e-167  1
UNIPROTKB|G5E9W3 - symbol:CPSF3 "Cleavage and polyadenyla...  1612  1.1e-165  1
WB|WBGene00013460 - symbol:cpsf-3 species:6239 "Caenorhab...  1577  5.7e-162  1
POMBASE|SPAC17G6.16c - symbol:ysh1 "mRNA cleavage and pol...  1492  5.8e-153  1
SGD|S000004267 - symbol:YSH1 "Putative endoribonuclease" ...  1337  1.5e-136  1
CGD|CAL0005344 - symbol:orf19.5486 species:5476 "Candida ...  1245  1.1e-133  2
UNIPROTKB|Q59P50 - symbol:YSH1 "Endoribonuclease YSH1" sp...  1245  1.1e-133  2
GENEDB_PFALCIPARUM|PF14_0364 - symbol:PF14_0364 "cleavage...   812  1.2e-114  3
UNIPROTKB|Q8IL83 - symbol:PF14_0364 "Cleavage and polyade...   812  1.2e-114  3
ASPGD|ASPL0000060573 - symbol:AN0990 species:162425 "Emer...   839  6.3e-110  2
UNIPROTKB|F1NV30 - symbol:CPSF3L "Integrator complex subu...   883  2.0e-88   1
UNIPROTKB|Q5ZIH0 - symbol:CPSF3L "Integrator complex subu...   882  2.5e-88   1
MGI|MGI:1919207 - symbol:Cpsf3l "cleavage and polyadenyla...   861  4.3e-86   1
UNIPROTKB|Q5TA45 - symbol:CPSF3L "Integrator complex subu...   860  5.4e-86   1
RGD|1306841 - symbol:Cpsf3l "cleavage and polyadenylation...   860  5.4e-86   1
UNIPROTKB|E1B7Q9 - symbol:CPSF3L "Integrator complex subu...   858  8.9e-86   1
FB|FBgn0039691 - symbol:IntS11 "Integrator 11" species:72...   847  1.3e-84   1
UNIPROTKB|F1RJE8 - symbol:CPSF3L "Uncharacterized protein...   847  1.3e-84   1
UNIPROTKB|E2QY53 - symbol:CPSF3L "Uncharacterized protein...   846  1.7e-84   1
UNIPROTKB|Q2YDM2 - symbol:CPSF3L "Integrator complex subu...   844  2.7e-84   1
UNIPROTKB|G3V1S5 - symbol:CPSF3L "Integrator complex subu...   840  7.2e-84   1
WB|WBGene00008642 - symbol:F10B5.8 species:6239 "Caenorha...   806  2.9e-80   1
DICTYBASE|DDB_G0278189 - symbol:ints11 "integrator comple...   803  6.0e-80   1
ZFIN|ZDB-GENE-050522-13 - symbol:cpsf3l "cleavage and pol...   801  9.7e-80   1
TAIR|locus:2065368 - symbol:CPSF73-II "AT2G01730" species...   758  3.5e-75   1
GENEDB_PFALCIPARUM|PFC0825c - symbol:PFC0825c "cleavage a...   537  2.6e-62   3
UNIPROTKB|O77371 - symbol:PFC0825c "Cleavage and polyaden...   537  2.6e-62   3
UNIPROTKB|C9JZH6 - symbol:CPSF3 "Cleavage and polyadenyla...   525  1.7e-50   1
UNIPROTKB|C9J979 - symbol:CPSF3L "Integrator complex subu...   287  5.3e-48   2
UNIPROTKB|E9PNS4 - symbol:CPSF3L "Integrator complex subu...   475  3.4e-45   1
TAIR|locus:2172843 - symbol:CPSF100 "cleavage and polyade...   408  5.2e-41   2
UNIPROTKB|E9PI75 - symbol:CPSF3L "Integrator complex subu...   392  2.1e-36   1
UNIPROTKB|E9PIG1 - symbol:CPSF3L "Integrator complex subu...   388  5.7e-36   1
TIGR_CMR|CHY_2049 - symbol:CHY_2049 "metallo-beta-lactama...   293  1.2e-35   2
TIGR_CMR|CPS_2623 - symbol:CPS_2623 "metallo-beta-lactama...   377  8.3e-35   1
UNIPROTKB|Q9KV92 - symbol:VC_0264 "Putative uncharacteriz...   373  2.2e-34   1
TIGR_CMR|VC_0264 - symbol:VC_0264 "conserved hypothetical...   373  2.2e-34   1
WB|WBGene00017313 - symbol:cpsf-2 species:6239 "Caenorhab...   372  3.9e-34   2
UNIPROTKB|O17403 - symbol:cpsf-2 "Probable cleavage and p...   372  3.9e-34   2
FB|FBgn0027873 - symbol:Cpsf100 "Cleavage and polyadenyla...   342  4.2e-31   2
UNIPROTKB|F1SD85 - symbol:CPSF2 "Uncharacterized protein"...   341  1.0e-30   1
DICTYBASE|DDB_G0270392 - symbol:cpsf2 "cleavage and polya...   340  1.5e-30   2
UNIPROTKB|F1NMN0 - symbol:CPSF2 "Uncharacterized protein"...   347  2.0e-30   2
ZFIN|ZDB-GENE-040718-79 - symbol:cpsf2 "cleavage and poly...   344  3.2e-30   2
UNIPROTKB|Q10568 - symbol:CPSF2 "Cleavage and polyadenyla...   342  7.1e-30   2
UNIPROTKB|E2R496 - symbol:CPSF2 "Uncharacterized protein"...   342  7.1e-30   2
UNIPROTKB|Q9P2I0 - symbol:CPSF2 "Cleavage and polyadenyla...   342  7.1e-30   2
RGD|1309687 - symbol:Cpsf2 "cleavage and polyadenylation ...   337  3.1e-29   2
TIGR_CMR|DET_1061 - symbol:DET_1061 "metallo-beta-lactama...   264  3.4e-29   2
UNIPROTKB|Q9W799 - symbol:cpsf2 "Cleavage and polyadenyla...   333  3.7e-29   2
MGI|MGI:1861601 - symbol:Cpsf2 "cleavage and polyadenylat...   336  4.0e-29   2
UNIPROTKB|Q8EJC6 - symbol:SO_0541 "RNA-metabolizing metal...   236  7.4e-28   2
TIGR_CMR|SO_0541 - symbol:SO_0541 "metallo-beta-lactamase...   236  7.4e-28   2
POMBASE|SPBC1709.15c - symbol:cft2 "cleavage factor two C...   279  2.6e-22   2
UNIPROTKB|E9PIL7 - symbol:CPSF3L "Integrator complex subu...   258  1.8e-21   1
UNIPROTKB|Q81SC3 - symbol:BA_1737 "Metallo-beta-lactamase...   272  2.6e-21   1
TIGR_CMR|BA_1737 - symbol:BA_1737 "metallo-beta-lactamase...   272  2.6e-21   1
UNIPROTKB|Q74C32 - symbol:GSU1843 "RNA exonuclease, beta-...   154  7.9e-19   4
TIGR_CMR|GSU_1843 - symbol:GSU_1843 "metallo-beta-lactama...   154  7.9e-19   4
UNIPROTKB|E9PQF0 - symbol:CPSF3L "Integrator complex subu...   229  2.6e-18   1
DICTYBASE|DDB_G0282473 - symbol:ints9 "integrator complex...   190  2.6e-18   3
UNIPROTKB|Q0C1L6 - symbol:HNE_1669 "Putative uncharacteri...   173  4.3e-13   3
UNIPROTKB|H0YBH8 - symbol:INTS9 "Integrator complex subun...   154  1.0e-10   2
UNIPROTKB|F6XI08 - symbol:INTS9 "Uncharacterized protein"...   173  7.3e-10   2
CGD|CAL0004705 - symbol:orf19.325 species:5476 "Candida a...   187  7.8e-10   2
UNIPROTKB|Q5AEE3 - symbol:CFT2 "Putative uncharacterized ...   187  7.8e-10   2
RGD|1311539 - symbol:Ints9 "integrator complex subunit 9"...   170  1.6e-09   2
UNIPROTKB|F1MMA6 - symbol:INTS9 "Integrator complex subun...   169  2.0e-09   2
UNIPROTKB|Q2KJA6 - symbol:INTS9 "Integrator complex subun...   169  2.0e-09   2
MGI|MGI:1098533 - symbol:Ints9 "integrator complex subuni...   167  3.3e-09   2
UNIPROTKB|Q9NV88 - symbol:INTS9 "Integrator complex subun...   166  4.2e-09   2
UNIPROTKB|F1RJQ5 - symbol:INTS9 "Uncharacterized protein"...   167  4.6e-09   1
UNIPROTKB|H7BYQ6 - symbol:INTS9 "Integrator complex subun...   166  5.5e-09   1
UNIPROTKB|G3XAN1 - symbol:INTS9 "Integrator complex subun...   162  5.8e-09   2
UNIPROTKB|Q5ZKK2 - symbol:INTS9 "Integrator complex subun...   164  7.0e-09   2
ZFIN|ZDB-GENE-061013-129 - symbol:ints9 "integrator compl...   157  4.1e-08   2
WB|WBGene00017608 - symbol:F19F10.12 species:6239 "Caenor...   128  9.3e-08   2
UNIPROTKB|E5RG70 - symbol:INTS9 "Integrator complex subun...   140  1.8e-07   2
UNIPROTKB|E5RK47 - symbol:INTS9 "Integrator complex subun...   112  4.8e-06   2
FB|FBgn0036570 - symbol:IntS9 "Integrator 9" species:7227...   138  8.3e-06   1
UNIPROTKB|Q87XP2 - symbol:PSPTO_4134 "Uncharacterized pro...   129  1.2e-05   2
TIGR_CMR|NSE_0829 - symbol:NSE_0829 "metallo-beta-lactama...   124  5.7e-05   2
TIGR_CMR|CHY_1157 - symbol:CHY_1157 "metallo-beta-lactama...   114  0.00014   2
UNIPROTKB|G4N6C6 - symbol:MGG_06570 "Uncharacterized prot...   107  0.00061   3
UNIPROTKB|E2QVB2 - symbol:INTS9 "Uncharacterized protein"...   118  0.00063   1


>TAIR|locus:2206076 [details] [associations]
            symbol:CPSF73-I "cleavage and polyadenylation specificity
            factor 73-I" species:3702 "Arabidopsis thaliana" [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0000278 "mitotic cell cycle" evidence=RCA]
            [GO:0006346 "methylation-dependent chromatin silencing"
            evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009855
            "determination of bilateral symmetry" evidence=RCA] [GO:0010014
            "meristem initiation" evidence=RCA] [GO:0010073 "meristem
            maintenance" evidence=RCA] [GO:0016246 "RNA interference"
            evidence=RCA] [GO:0031507 "heterochromatin assembly" evidence=RCA]
            [GO:0045787 "positive regulation of cell cycle" evidence=RCA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006397
            GO:GO:0090305 EMBL:AC018908 GO:GO:0004518 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            EMBL:AY140900 EMBL:AY150478 EMBL:AY074280 EMBL:AK316692
            EMBL:AK316794 IPI:IPI00533462 PIR:G96635 RefSeq:NP_001031215.1
            RefSeq:NP_176297.1 RefSeq:NP_849835.1 UniGene:At.23510
            ProteinModelPortal:Q9C952 SMR:Q9C952 IntAct:Q9C952 STRING:Q9C952
            PaxDb:Q9C952 PRIDE:Q9C952 EnsemblPlants:AT1G61010.1
            EnsemblPlants:AT1G61010.2 EnsemblPlants:AT1G61010.3 GeneID:842393
            KEGG:ath:AT1G61010 TAIR:At1g61010 HOGENOM:HOG000203394
            InParanoid:Q9C952 KO:K14403 OMA:YVSFSAH PhylomeDB:Q9C952
            ProtClustDB:CLSN2681829 Genevestigator:Q9C952 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 Uniprot:Q9C952
        Length = 693

 Score = 2504 (886.5 bits), Expect = 3.3e-260, P = 3.3e-260
 Identities = 469/517 (90%), Positives = 503/517 (97%)

Query:     9 SLKRRDAPVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYF 68
             SLKRR+ P+SR+GDQLI+TPLGAG+EVGRSCVYMS++GK ILFDCGIHPAYSGMAALPYF
Sbjct:     7 SLKRREQPISRDGDQLIVTPLGAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYF 66

Query:    69 DEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKV 128
             DEIDPS+IDVLLITHFH+DHAASLPYFLEKTTF GRVFMTHATKAIYKLLLTDYVKVSKV
Sbjct:    67 DEIDPSSIDVLLITHFHIDHAASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKV 126

Query:   129 SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLY 188
             SVEDMLFDEQDIN+SMDKIEV+DFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVR+LY
Sbjct:   127 SVEDMLFDEQDINKSMDKIEVIDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILY 186

Query:   189 TGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRV 248
             TGDYSREEDRHLRAAELPQFSPDICIIEST GVQLHQ R+IREKRFTDVIHST++QGGRV
Sbjct:   187 TGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRV 246

Query:   249 LIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQ 308
             LIPAFALGRAQELLLILDEYW+NHP+ HNIPIYYASPLAKKCMAVYQTYILSMN+RIRNQ
Sbjct:   247 LIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQ 306

Query:   309 FANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPG 368
             FANSNPF FKHISPLNSIDDF+DVGPSVVMA+PGGLQSGLSRQLFD WCSDKKNAC+IPG
Sbjct:   307 FANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPG 366

Query:   369 YVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL 428
             Y+VEGTLAKTII+EPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL
Sbjct:   367 YMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIIL 426

Query:   429 VHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGE 488
             VHGE++EM RLK KL+TE  D NTKI+TPKNC+SVEMYFNSEK+AKTIGRLAEKTP+VG+
Sbjct:   427 VHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGD 486

Query:   489 TVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
             TVSGILVKKGFTYQIMAPD+LH+FSQLSTA +TQRIT
Sbjct:   487 TVSGILVKKGFTYQIMAPDELHVFSQLSTATVTQRIT 523


>UNIPROTKB|F1NKW5 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0004521
            "endoribonuclease activity" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0003723 GO:GO:0004521 GO:GO:0005847
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:AADN02018718
            IPI:IPI00600642 Ensembl:ENSGALT00000026493 Uniprot:F1NKW5
        Length = 685

 Score = 1729 (613.7 bits), Expect = 4.5e-178, P = 4.5e-178
 Identities = 315/510 (61%), Positives = 397/510 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S +DML+ E
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
              D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYTGD+SR+ED
Sbjct:   126 TDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQED 185

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
             RHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FALGR
Sbjct:   186 RHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 245

Query:   258 AQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKF 317
             AQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NPF F
Sbjct:   246 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVF 305

Query:   318 KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             KHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY VEGTLAK
Sbjct:   306 KHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAK 365

Query:   378 TIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMG 437
              I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +EM 
Sbjct:   366 HIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMA 425

Query:   438 RLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGIL 494
             RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ +SGIL
Sbjct:   426 RLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGSLADKKPEQGQRISGIL 485

Query:   495 VKKGFTYQIMAPDDLHIFSQLSTANITQRI 524
             VK+ F Y I++P DL  ++ L+ + +TQ +
Sbjct:   486 VKRNFNYHILSPCDLSNYTDLAMSTVTQTL 515


>UNIPROTKB|E2R7R2 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 GeneTree:ENSGT00700000104485 EMBL:AAEX03010701
            RefSeq:XP_003639652.1 Ensembl:ENSCAFT00000005417 GeneID:100856414
            KEGG:cfa:100856414 Uniprot:E2R7R2
        Length = 717

 Score = 1728 (613.3 bits), Expect = 5.7e-178, P = 5.7e-178
 Identities = 321/528 (60%), Positives = 405/528 (76%)

Query:     2 ASVGQPPSLKRRDAPVS----REGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHP 57
             A+   PP L+R+ + +S     E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP
Sbjct:    20 AACSSPP-LRRQISEMSAIPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHP 78

Query:    58 AYSGMAALPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL 117
                GM ALPY D IDP+ ID+LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ 
Sbjct:    79 GLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRW 138

Query:   118 LLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMF 177
             LL+DYVKVS +S +DML+ E D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMF
Sbjct:   139 LLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMF 198

Query:   178 MVDIAGVRVLYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDV 237
             M++IAGV++LYTGD+SR+EDRHL AAE+P   PDI IIESTYG  +H+ R  RE RF + 
Sbjct:   199 MIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNT 258

Query:   238 IHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTY 297
             +H  +++GGR LIP FALGRAQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY
Sbjct:   259 VHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTY 318

Query:   298 ILSMNERIRNQFANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWC 357
             + +MN++IR Q   +NPF FKHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC
Sbjct:   319 VNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWC 378

Query:   358 SDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTF 417
             +DK+N  +I GY VEGTLAK I+SEP+E+T M+G   PL M V YISFSAH DY QTS F
Sbjct:   379 TDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEF 438

Query:   418 LKELMPPNIILVHGESHEMGRLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAK 474
             ++ L PP++ILVHGE +EM RLK  L+ E  D    + ++  P+N ++V + F  EK+AK
Sbjct:   439 IRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAK 498

Query:   475 TIGRLAEKTPEVGETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQ 522
              +G LA+K PE G+ VSGILVK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   499 VMGFLADKKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQ 546


>UNIPROTKB|P79101 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation specificity
            factor subunit 3" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=ISS] [GO:0003723 "RNA binding" evidence=ISS] [GO:0004521
            "endoribonuclease activity" evidence=ISS] [GO:0008409 "5'-3'
            exonuclease activity" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0030529 "ribonucleoprotein complex" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0046872 GO:GO:0003723 GO:GO:0030529 GO:GO:0004521
            GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 GO:GO:0006398 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            EMBL:X95906 EMBL:BC104553 IPI:IPI00708839 RefSeq:NP_776709.1
            UniGene:Bt.5045 ProteinModelPortal:P79101 SMR:P79101 STRING:P79101
            PRIDE:P79101 Ensembl:ENSBTAT00000026303 GeneID:281712
            KEGG:bta:281712 CTD:51692 GeneTree:ENSGT00700000104485
            HOVERGEN:HBG051107 InParanoid:P79101 OrthoDB:EOG4FN4H6
            NextBio:20805634 ArrayExpress:P79101 GO:GO:0008409 Uniprot:P79101
        Length = 684

 Score = 1721 (610.9 bits), Expect = 3.1e-177, P = 3.1e-177
 Identities = 315/508 (62%), Positives = 395/508 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S +DML+ E
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
              D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYTGD+SR+ED
Sbjct:   126 TDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQED 185

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
             RHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FALGR
Sbjct:   186 RHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 245

Query:   258 AQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKF 317
             AQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NPF F
Sbjct:   246 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVF 305

Query:   318 KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             KHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY VEGTLAK
Sbjct:   306 KHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAK 365

Query:   378 TIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMG 437
              I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +EM 
Sbjct:   366 HIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMA 425

Query:   438 RLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGIL 494
             RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ VSGIL
Sbjct:   426 RLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGIL 485

Query:   495 VKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             VK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   486 VKRNFNYHILSPCDLSNYTDLAMSTVKQ 513


>UNIPROTKB|Q9UKF6 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0030529
            "ribonucleoprotein complex" evidence=IEA] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0008409 "5'-3' exonuclease activity" evidence=ISS] [GO:0004521
            "endoribonuclease activity" evidence=ISS] [GO:0003723 "RNA binding"
            evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0006378 "mRNA
            polyadenylation" evidence=TAS] [GO:0006379 "mRNA cleavage"
            evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=TAS]
            [GO:0006369 "termination of RNA polymerase II transcription"
            evidence=TAS] [GO:0006397 "mRNA processing" evidence=TAS]
            [GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380
            "RNA splicing" evidence=TAS] [GO:0010467 "gene expression"
            evidence=TAS] [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            EMBL:AF017269 Pfam:PF07521 EMBL:AF171877 EMBL:CH471053
            GO:GO:0046872 Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003723
            GO:GO:0030529 GO:GO:0006406 GO:GO:0004521 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 Reactome:REACT_78 GO:GO:0006398 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409
            EMBL:AC080162 EMBL:BC011654 EMBL:BC020211 IPI:IPI00007818
            RefSeq:NP_057291.1 UniGene:Hs.515972 PDB:2I7T PDB:2I7V PDBsum:2I7T
            PDBsum:2I7V ProteinModelPortal:Q9UKF6 SMR:Q9UKF6 DIP:DIP-42501N
            MINT:MINT-1742891 STRING:Q9UKF6 PhosphoSite:Q9UKF6 DMDM:18203503
            PaxDb:Q9UKF6 PeptideAtlas:Q9UKF6 PRIDE:Q9UKF6 DNASU:51692
            Ensembl:ENST00000238112 GeneID:51692 KEGG:hsa:51692 UCSC:uc002qzo.1
            GeneCards:GC02P009514 HGNC:HGNC:2326 HPA:HPA034657 MIM:606029
            neXtProt:NX_Q9UKF6 PharmGKB:PA26843 InParanoid:Q9UKF6
            PhylomeDB:Q9UKF6 ChiTaRS:CPSF3 EvolutionaryTrace:Q9UKF6
            GenomeRNAi:51692 NextBio:55702 ArrayExpress:Q9UKF6 Bgee:Q9UKF6
            CleanEx:HS_CPSF3 Genevestigator:Q9UKF6 GermOnline:ENSG00000119203
            Uniprot:Q9UKF6
        Length = 684

 Score = 1721 (610.9 bits), Expect = 3.1e-177, P = 3.1e-177
 Identities = 315/508 (62%), Positives = 395/508 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S +DML+ E
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
              D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYTGD+SR+ED
Sbjct:   126 TDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQED 185

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
             RHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FALGR
Sbjct:   186 RHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 245

Query:   258 AQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKF 317
             AQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NPF F
Sbjct:   246 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVF 305

Query:   318 KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             KHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY VEGTLAK
Sbjct:   306 KHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAK 365

Query:   378 TIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMG 437
              I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +EM 
Sbjct:   366 HIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMA 425

Query:   438 RLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGIL 494
             RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ VSGIL
Sbjct:   426 RLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGIL 485

Query:   495 VKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             VK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   486 VKRNFNYHILSPCDLSNYTDLAMSTVKQ 513


>MGI|MGI:1859328 [details] [associations]
            symbol:Cpsf3 "cleavage and polyadenylation specificity
            factor 3" species:10090 "Mus musculus" [GO:0003723 "RNA binding"
            evidence=IDA] [GO:0003729 "mRNA binding" evidence=ISO] [GO:0004518
            "nuclease activity" evidence=IEA] [GO:0004519 "endonuclease
            activity" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=ISO]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISO;IDA] [GO:0008409 "5'-3'
            exonuclease activity" evidence=IDA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0030529 "ribonucleoprotein complex"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            MGI:MGI:1859328 GO:GO:0046872 GO:GO:0003723 GO:GO:0030529
            GO:GO:0004521 GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 GO:GO:0006398
            HOGENOM:HOG000203394 KO:K14403 OMA:YVSFSAH InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 CTD:51692 GeneTree:ENSGT00700000104485
            HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409 ChiTaRS:CPSF3
            EMBL:AF203969 EMBL:BC023297 IPI:IPI00135099 RefSeq:NP_061283.2
            UniGene:Mm.356778 ProteinModelPortal:Q9QXK7 SMR:Q9QXK7
            STRING:Q9QXK7 PhosphoSite:Q9QXK7 PaxDb:Q9QXK7 PRIDE:Q9QXK7
            Ensembl:ENSMUST00000067284 GeneID:54451 KEGG:mmu:54451
            InParanoid:Q8CIM0 NextBio:311332 Bgee:Q9QXK7 CleanEx:MM_CPSF3
            Genevestigator:Q9QXK7 GermOnline:ENSMUSG00000054309 Uniprot:Q9QXK7
        Length = 684

 Score = 1718 (609.8 bits), Expect = 6.5e-177, P = 6.5e-177
 Identities = 314/508 (61%), Positives = 395/508 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S +DML+ E
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
              D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYTGD+SR+ED
Sbjct:   126 TDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQED 185

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
             RHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FALGR
Sbjct:   186 RHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 245

Query:   258 AQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKF 317
             AQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NPF F
Sbjct:   246 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVF 305

Query:   318 KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             KHIS L S+D F D+GPSVVMASPG +Q+GLSR+LF+ WC+DK+N  +I GY VEGTLAK
Sbjct:   306 KHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAK 365

Query:   378 TIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMG 437
              I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +EM 
Sbjct:   366 HIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMA 425

Query:   438 RLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGIL 494
             RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ VSGIL
Sbjct:   426 RLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGIL 485

Query:   495 VKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             VK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   486 VKRNFNYHILSPCDLSNYTDLAMSTVKQ 513


>UNIPROTKB|G3V6W7 [details] [associations]
            symbol:Cpsf3 "Protein Cpsf3" species:10116 "Rattus
            norvegicus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            RGD:1305767 GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 EMBL:CH473947 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 UniGene:Rn.100522
            Ensembl:ENSRNOT00000009652 Uniprot:G3V6W7
        Length = 685

 Score = 1718 (609.8 bits), Expect = 6.5e-177, P = 6.5e-177
 Identities = 314/508 (61%), Positives = 395/508 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S +DML+ E
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
              D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYTGD+SR+ED
Sbjct:   126 TDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQED 185

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
             RHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FALGR
Sbjct:   186 RHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 245

Query:   258 AQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKF 317
             AQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NPF F
Sbjct:   246 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVF 305

Query:   318 KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             KHIS L S+D F D+GPSVVMASPG +Q+GLSR+LF+ WC+DK+N  +I GY VEGTLAK
Sbjct:   306 KHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAK 365

Query:   378 TIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMG 437
              I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +EM 
Sbjct:   366 HIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMA 425

Query:   438 RLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGIL 494
             RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ VSGIL
Sbjct:   426 RLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGIL 485

Query:   495 VKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             VK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   486 VKRNFNYHILSPCDLSNYTDLAMSTVKQ 513


>RGD|1305767 [details] [associations]
            symbol:Cpsf3 "cleavage and polyadenylation specific factor 3,
            73kDa" species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
            evidence=ISO] [GO:0004521 "endoribonuclease activity" evidence=ISO]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISO] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=ISO] [GO:0008409 "5'-3' exonuclease activity"
            evidence=ISO] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 RGD:1305767 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6
            UniGene:Rn.100522 EMBL:BC099817 IPI:IPI00365532
            RefSeq:NP_001025201.1 ProteinModelPortal:Q499P4 SMR:Q499P4
            STRING:Q499P4 GeneID:298916 KEGG:rno:298916 InParanoid:Q499P4
            NextBio:644507 Genevestigator:Q499P4 Uniprot:Q499P4
        Length = 685

 Score = 1715 (608.8 bits), Expect = 1.4e-176, P = 1.4e-176
 Identities = 313/508 (61%), Positives = 395/508 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S +DML+ E
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
              D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAG+++LYTGD+SR+ED
Sbjct:   126 TDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGMKLLYTGDFSRQED 185

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
             RHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FALGR
Sbjct:   186 RHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGR 245

Query:   258 AQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKF 317
             AQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NPF F
Sbjct:   246 AQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVF 305

Query:   318 KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             KHIS L S+D F D+GPSVVMASPG +Q+GLSR+LF+ WC+DK+N  +I GY VEGTLAK
Sbjct:   306 KHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAK 365

Query:   378 TIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMG 437
              I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +EM 
Sbjct:   366 HIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMA 425

Query:   438 RLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGIL 494
             RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ VSGIL
Sbjct:   426 RLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGIL 485

Query:   495 VKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             VK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   486 VKRNFNYHILSPCDLSNYTDLAMSTVKQ 513


>UNIPROTKB|I3LKR1 [details] [associations]
            symbol:CPSF3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
            [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0003723 GO:GO:0004521 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:FP312696
            Ensembl:ENSSSCT00000027309 Uniprot:I3LKR1
        Length = 687

 Score = 1708 (606.3 bits), Expect = 7.5e-176, P = 7.5e-176
 Identities = 315/511 (61%), Positives = 395/511 (77%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D IDP+ ID
Sbjct:     6 AEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEID 65

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSK---VSVEDML 134
             +LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKV K   +S +DML
Sbjct:    66 LLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVRKCSNISADDML 125

Query:   135 FDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSR 194
             + E D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYTGD+SR
Sbjct:   126 YTETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSR 185

Query:   195 EEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFA 254
             +EDRHL AAE+P   PDI IIESTYG  +H+ R  RE RF + +H  +++GGR LIP FA
Sbjct:   186 QEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFA 245

Query:   255 LGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNP 314
             LGRAQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR Q   +NP
Sbjct:   246 LGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNP 305

Query:   315 FKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGT 374
             F FKHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY VEGT
Sbjct:   306 FVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGT 365

Query:   375 LAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESH 434
             LAK I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILVHGE +
Sbjct:   366 LAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQN 425

Query:   435 EMGRLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVS 491
             EM RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K PE G+ VS
Sbjct:   426 EMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVS 485

Query:   492 GILVKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             GILVK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   486 GILVKRNFNYHILSPCDLSNYTDLAMSTVKQ 516


>ZFIN|ZDB-GENE-030131-3275 [details] [associations]
            symbol:cpsf3 "cleavage and polyadenylation
            specific factor 3" species:7955 "Danio rerio" [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-030131-3275 GO:GO:0016787
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 CTD:51692
            HOVERGEN:HBG051107 EMBL:AY648793 IPI:IPI00509063
            RefSeq:NP_001003836.1 UniGene:Dr.77231 ProteinModelPortal:Q6DRG6
            SMR:Q6DRG6 STRING:Q6DRG6 GeneID:324554 KEGG:dre:324554
            NextBio:20808833 ArrayExpress:Q6DRG6 Uniprot:Q6DRG6
        Length = 690

 Score = 1707 (606.0 bits), Expect = 9.6e-176, P = 9.6e-176
 Identities = 314/516 (60%), Positives = 396/516 (76%)

Query:    11 KRRDAPV-SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD 69
             ++ D PV + E DQL+I PLGAG EVGRSC+ + +KG+ I+ DCGIHP   GM ALPY D
Sbjct:     5 RKADVPVPAEESDQLLIRPLGAGQEVGRSCIILEFKGRKIMVDCGIHPGLEGMDALPYID 64

Query:    70 EIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVS 129
              IDP+ ID+LLI+HFHLDH  +LP+FL+KT+FKGR FMTHATKAIY+ LL+DYVKVS +S
Sbjct:    65 LIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNIS 124

Query:   130 VEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYT 189
              +DML+ E D+  SMDKIE ++FH+  EV GIKFWCY AGHVLGAAMFM++IAGV++LYT
Sbjct:   125 ADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYT 184

Query:   190 GDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVL 249
             GD+SR+EDRHL AAE+P   PDI I ESTYG  +H+ R  RE RF + +H  +++ GR L
Sbjct:   185 GDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCL 244

Query:   250 IPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQF 309
             IP FALGRAQELLLILDEYW NHPE H+IPIYYAS LAKKCMAVYQTY+ +MN++IR   
Sbjct:   245 IPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAI 304

Query:   310 ANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
               +NPF FKHIS L S+D F D+GPSVVMASPG +QSGLSR+LF+ WC+DK+N  +I GY
Sbjct:   305 NINNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGY 364

Query:   370 VVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILV 429
              VEGTLAK I+SEP+E+T M+G   PL M V YISFSAH DY QTS F++ L PP++ILV
Sbjct:   365 CVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILV 424

Query:   430 HGESHEMGRLKTKLMTELAD---CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEV 486
             HGE +EM RLK  L+ E  D    + ++  P+N ++V + F  EK+AK +G LA+K    
Sbjct:   425 HGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGSLADKKCSQ 484

Query:   487 GETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             G+ VSGILVKK F+Y I++P DL  ++ L+ + + Q
Sbjct:   485 GQRVSGILVKKNFSYHILSPSDLSNYTDLAMSTVKQ 520


>FB|FBgn0261065 [details] [associations]
            symbol:Cpsf73 "Cleavage and polyadenylation specificity
            factor 73" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS;NAS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
            processing" evidence=IMP] [GO:0003677 "DNA binding" evidence=IDA]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
            GO:GO:0003677 GO:GO:0006378 GO:GO:0016787 GO:GO:0005847
            GO:GO:0006379 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 GO:GO:0006398 KO:K14403 OMA:YVSFSAH
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 EMBL:AY119128 RefSeq:NP_650738.1
            UniGene:Dm.13714 SMR:Q9VE51 IntAct:Q9VE51 MINT:MINT-804945
            STRING:Q9VE51 EnsemblMetazoa:FBtr0083690 GeneID:42240
            KEGG:dme:Dmel_CG7698 UCSC:CG7698-RA CTD:42240 FlyBase:FBgn0261065
            InParanoid:Q9VE51 OrthoDB:EOG4P5HR4 GenomeRNAi:42240 NextBio:827838
            Uniprot:Q9VE51
        Length = 684

 Score = 1636 (581.0 bits), Expect = 3.2e-168, P = 3.2e-168
 Identities = 300/507 (59%), Positives = 381/507 (75%)

Query:    20 EGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVL 79
             E D L I PLGAG EVGRSC+ + +KGK I+ DCGIHP  SGM ALPY D I+   ID+L
Sbjct:    14 ESDLLQIKPLGAGQEVGRSCIMLEFKGKKIMLDCGIHPGLSGMDALPYVDLIEADEIDLL 73

Query:    80 LITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQD 139
              I+HFHLDH  +LP+FL KT+FKGR FMTHATKAIY+ +L+DY+K+S +S E ML+ E D
Sbjct:    74 FISHFHLDHCGALPWFLMKTSFKGRCFMTHATKAIYRWMLSDYIKISNISTEQMLYTEAD 133

Query:   140 INRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRH 199
             +  SM+KIE ++FH+  +V G++F  Y AGHVLGAAMFM++IAG+++LYTGD+SR+EDRH
Sbjct:   134 LEASMEKIETINFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFSRQEDRH 193

Query:   200 LRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQ 259
             L AAE+P   PD+ I ESTYG  +H+ R  RE RFT ++   + QGGR LIP FALGRAQ
Sbjct:   194 LMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPVFALGRAQ 253

Query:   260 ELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKH 319
             ELLLILDE+WS +P+ H IPIYYAS LAKKCMAVYQTYI +MN+RIR Q A +NPF F+H
Sbjct:   254 ELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQIAVNNPFVFRH 313

Query:   320 ISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTI 379
             IS L  ID F D+GP V+MASPG +QSGLSR+LF+ WC+D KN  +I GY VEGTLAK +
Sbjct:   314 ISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKAV 373

Query:   380 ISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRL 439
             +SEP+E+T ++G   PLNM V YISFSAH DY QTS F++ L P +++LVHGE +EM RL
Sbjct:   374 LSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTSEFIRLLKPTHVVLVHGEQNEMSRL 433

Query:   440 KTKLMTEL-ADCNT--KIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
             K  L  E  AD +T  K   P+N  +V++YF  EK AK +G LA K  EVG  +SG+LVK
Sbjct:   434 KLALQREYEADASTDIKFYNPRNTHAVDLYFRGEKTAKVMGSLAAKNSEVGSKLSGVLVK 493

Query:   497 KGFTYQIMAPDDLHIFSQLSTANITQR 523
             + F Y ++AP DL  ++ +S + +TQR
Sbjct:   494 RDFKYHLLAPSDLGKYTDMSMSVVTQR 520


>DICTYBASE|DDB_G0274799 [details] [associations]
            symbol:cpsf3 "cleavage and polyadenylation
            specificity factor 73 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA;IC] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=ISS] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0004519 "endonuclease
            activity" evidence=IEA] [GO:0004518 "nuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0274799 Pfam:PF07521 GO:GO:0046872 GO:GO:0006378
            GenomeReviews:CM000151_GR EMBL:AAFI02000012 GO:GO:0003730
            GO:GO:0004519 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            RefSeq:XP_643926.1 ProteinModelPortal:Q86A79 SMR:Q86A79
            STRING:Q86A79 EnsemblProtists:DDB0233696 GeneID:8619353
            KEGG:ddi:DDB_G0274799 ProtClustDB:CLSZ2431003 Uniprot:Q86A79
        Length = 774

 Score = 1627 (577.8 bits), Expect = 2.9e-167, P = 2.9e-167
 Identities = 300/519 (57%), Positives = 395/519 (76%)

Query:    10 LKRRDAPVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD 69
             LKR     + + D L ITP+G+G+EVGRSCV + YKGK ++FDCG+HPAYSG+ +LP+FD
Sbjct:    22 LKRPLKGGTEDDDILEITPIGSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFD 81

Query:    70 EIDPSA--IDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSK 127
              I+     ID+LL++HFHLDHAA++PYF+ KT FKGRVFMTH TKAIY +LL+DYVKVS 
Sbjct:    82 SIESDIPDIDLLLVSHFHLDHAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSDYVKVSN 141

Query:   128 VSVED-MLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRV 186
             ++ +D MLFD+ D++RS++KIE + + Q VE NGIK  C+ AGHVLGAAMFM++IAGV++
Sbjct:   142 ITRDDDMLFDKSDLDRSLEKIEKVRYRQKVEHNGIKVTCFNAGHVLGAAMFMIEIAGVKI 201

Query:   187 LYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGG 246
             LYTGD+SR+EDRHL  AE P    D+ IIESTYGVQ+H+PR  REKRFT  +H  + + G
Sbjct:   202 LYTGDFSRQEDRHLMGAETPPVKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNG 261

Query:   247 RVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIR 306
             + LIP FALGRAQELLLILDEYW  +P+ H++PIYYAS LAKKCM VY+TYI  MN+R+R
Sbjct:   262 KCLIPVFALGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVR 321

Query:   307 NQFANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVI 366
              QF  SNPF+FKHI  +  I+ F D GP V MASPG LQSGLSRQLF+ WCSDK+N  VI
Sbjct:   322 AQFDVSNPFEFKHIKNIKGIESFDDRGPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVI 381

Query:   367 PGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNI 426
             PGY VEGTLAK I+SEP E+T ++ +  PLN+ V Y+SFSAH+D+ QTS F++E+ PP++
Sbjct:   382 PGYSVEGTLAKHIMSEPAEITRLDNVNVPLNLTVSYVSFSAHSDFLQTSEFIQEIQPPHV 441

Query:   427 ILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEV 486
             +LVHG+++EM RL+  L+ +    N  ++TPKN  SV + F  EK+AKT+G +    P+ 
Sbjct:   442 VLVHGDANEMSRLRQSLVAKFKTIN--VLTPKNAMSVALEFRPEKVAKTLGSIITNPPKQ 499

Query:   487 GETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
              + + GILV K FT+ I++  D+H ++ L T  I Q++T
Sbjct:   500 NDIIQGILVTKDFTHHILSASDIHNYTNLKTNIIKQKLT 538


>UNIPROTKB|G5E9W3 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation-specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
            binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:CH471053 GO:GO:0003723 GO:GO:0004521
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 GO:GO:0008409
            EMBL:AC080162 UniGene:Hs.515972 HGNC:HGNC:2326 ChiTaRS:CPSF3
            ProteinModelPortal:G5E9W3 SMR:G5E9W3 PRIDE:G5E9W3
            Ensembl:ENST00000460593 ArrayExpress:G5E9W3 Bgee:G5E9W3
            Uniprot:G5E9W3
        Length = 647

 Score = 1612 (572.5 bits), Expect = 1.1e-165, P = 1.1e-165
 Identities = 296/476 (62%), Positives = 370/476 (77%)

Query:    50 LFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTH 109
             + DCGIHP   GM ALPY D IDP+ ID+LLI+HFHLDH  +LP+FL+KT+FKGR FMTH
Sbjct:     1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60

Query:   110 ATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAG 169
             ATKAIY+ LL+DYVKVS +S +DML+ E D+  SMDKIE ++FH+  EV GIKFWCY AG
Sbjct:    61 ATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAG 120

Query:   170 HVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNI 229
             HVLGAAMFM++IAGV++LYTGD+SR+EDRHL AAE+P   PDI IIESTYG  +H+ R  
Sbjct:   121 HVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREE 180

Query:   230 REKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKK 289
             RE RF + +H  +++GGR LIP FALGRAQELLLILDEYW NHPE H+IPIYYAS LAKK
Sbjct:   181 REARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKK 240

Query:   290 CMAVYQTYILSMNERIRNQFANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLS 349
             CMAVYQTY+ +MN++IR Q   +NPF FKHIS L S+D F D+GPSVVMASPG +QSGLS
Sbjct:   241 CMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLS 300

Query:   350 RQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHA 409
             R+LF+ WC+DK+N  +I GY VEGTLAK I+SEP+E+T M+G   PL M V YISFSAH 
Sbjct:   301 RELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHT 360

Query:   410 DYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELAD---CNTKIITPKNCQSVEMY 466
             DY QTS F++ L PP++ILVHGE +EM RLK  L+ E  D    + ++  P+N ++V + 
Sbjct:   361 DYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLN 420

Query:   467 FNSEKMAKTIGRLAEKTPEVGETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQ 522
             F  EK+AK +G LA+K PE G+ VSGILVK+ F Y I++P DL  ++ L+ + + Q
Sbjct:   421 FRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQ 476


>WB|WBGene00013460 [details] [associations]
            symbol:cpsf-3 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0040007 "growth" evidence=IMP] [GO:0002119 "nematode larval
            development" evidence=IMP] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000203394
            KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 EMBL:AL132951 RefSeq:NP_502553.2
            ProteinModelPortal:Q95PY8 SMR:Q95PY8 STRING:Q95PY8 PaxDb:Q95PY8
            EnsemblMetazoa:Y67H2A.1.1 EnsemblMetazoa:Y67H2A.1.2 GeneID:178285
            KEGG:cel:CELE_Y67H2A.1 UCSC:Y67H2A.1 CTD:178285 WormBase:Y67H2A.1
            InParanoid:Q95PY8 NextBio:900506 Uniprot:Q95PY8
        Length = 707

 Score = 1577 (560.2 bits), Expect = 5.7e-162, P = 5.7e-162
 Identities = 285/508 (56%), Positives = 375/508 (73%)

Query:    22 DQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLI 81
             D L  TPLG+G EVGRSC  + YKGK ++ DCG+HP   G+ ALP+ D ++   ID+LLI
Sbjct:     9 DSLCFTPLGSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLI 68

Query:    82 THFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVED--MLFDEQD 139
             THFHLDH  +LP+ L+KT F+G+ FMTHATKAIY++LL DYV++SK    D   L+ E D
Sbjct:    69 THFHLDHCGALPWLLQKTAFQGKCFMTHATKAIYRMLLGDYVRISKYGGPDRNQLYTEDD 128

Query:   140 INRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRH 199
             + +SM KIE +DF +  EVNGI+FW Y AGHVLGA  FM++IAGVRVLYTGD+S  EDRH
Sbjct:   129 LEKSMAKIETIDFREQKEVNGIRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCLEDRH 188

Query:   200 LRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQ 259
             L AAE+P  +P + I ESTYG Q H+ R +REKRFT ++H  +++GGR LIPAFA+G AQ
Sbjct:   189 LCAAEIPPITPQVLITESTYGTQTHEDRAVREKRFTQMVHDIVTRGGRCLIPAFAIGPAQ 248

Query:   260 ELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKH 319
             EL+LILDEYW +H E H+IP+YYAS LAKKCM+VYQT++  MN RI+ Q A  NPF FKH
Sbjct:   249 ELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQIAVKNPFIFKH 308

Query:   320 ISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTI 379
             +S L  +D F D GP VV+A+PG LQSG SR+LF+ WC D KN C+I GY VEGTLAK I
Sbjct:   309 VSTLRGMDQFEDAGPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHI 368

Query:   380 ISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRL 439
             +SEP+E+  ++G   P+ MQV Y+SFSAH DY QTS F+K L PP+++LVHGE HEM RL
Sbjct:   369 LSEPEEIVSLSGEKLPMRMQVGYVSFSAHTDYHQTSNFVKALKPPHLVLVHGELHEMSRL 428

Query:   440 KTKLMTELADCNT--KIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVKK 497
             K+ +  +  D N   ++  P+N + +++ F  EK AK IG+LA++ PE  ET+SG+LVK 
Sbjct:   429 KSGIERQFQDDNIPIEVHNPRNTERLQLQFRGEKTAKVIGKLAQRVPENNETISGVLVKN 488

Query:   498 GFTYQIMAPDDLHIFSQLSTANITQRIT 525
              F+Y IM P++L  ++ L  +++ QR++
Sbjct:   489 NFSYSIMVPEELGSYTSLRISSLEQRMS 516


>POMBASE|SPAC17G6.16c [details] [associations]
            symbol:ysh1 "mRNA cleavage and polyadenylation
            specificity factor complex endoribonuclease subunit Ysh1"
            species:4896 "Schizosaccharomyces pombe" [GO:0004521
            "endoribonuclease activity" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IC]
            [GO:0006379 "mRNA cleavage" evidence=IC] [GO:0046872 "metal ion
            binding" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 PomBase:SPAC17G6.16c Pfam:PF07521 GO:GO:0005829
            EMBL:CU329670 GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0006378
            GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            OrthoDB:EOG41ZJK7 PIR:T37848 RefSeq:NP_594263.2 STRING:O13794
            EnsemblFungi:SPAC17G6.16c.1 GeneID:2542258 NextBio:20803322
            Uniprot:O13794
        Length = 757

 Score = 1492 (530.3 bits), Expect = 5.8e-153, P = 5.8e-153
 Identities = 276/512 (53%), Positives = 371/512 (72%)

Query:    14 DAPVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDP 73
             DAPV    D L    LGAGNEVGRSC  + YKGKT++ D G+HPAY+G++ALP+FDE D 
Sbjct:    10 DAPVD-PSDLLEFINLGAGNEVGRSCHVIQYKGKTVMLDAGVHPAYTGLSALPFFDEFDL 68

Query:    74 SAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM 133
             S +DVLLI+HFHLDH ASLPY ++KT F+GRVFMTH TKA+ K LL+DYVKVS V +ED 
Sbjct:    69 STVDVLLISHFHLDHVASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSDYVKVSNVGMEDQ 128

Query:   134 LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS 193
             L+DE+D+  + D+IE +D+H T+EV GIKF  Y AGHVLGA M+ V++AGV +L+TGDYS
Sbjct:   129 LYDEKDLLAAFDRIEAVDYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILFTGDYS 188

Query:   194 REEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAF 253
             REEDRHL  AE+P   PD+ I ESTYG   HQPR  +E R  ++IHSTI  GGRVL+P F
Sbjct:   189 REEDRHLHVAEVPPKRPDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGRVLMPVF 248

Query:   254 ALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSN 313
             ALGRAQELLLILDEYW+NH +  ++PIYYAS LA+KCMA++QTY+  MN+ IR  FA  N
Sbjct:   249 ALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRKIFAERN 308

Query:   314 PFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEG 373
             PF F+ +  L +++ F D+GPSV++ASPG LQ+G+SR L + W  D +N  ++ GY VEG
Sbjct:   309 PFIFRFVKSLRNLEKFDDIGPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTGYSVEG 368

Query:   374 TLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGES 433
             T+AK I +EP E+  ++G   P  M V  +SF+AH DY Q S F+  +   +IILVHGE 
Sbjct:   369 TMAKQITNEPIEIVSLSGQKIPRRMAVEELSFAAHVDYLQNSEFIDLVNADHIILVHGEQ 428

Query:   434 HEMGRLKTKLMTELAD--CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVS 491
               MGRLK+ L ++  +   + K+ TP+NC  + + F  E++ + +G++A   P+ G+ +S
Sbjct:   429 TNMGRLKSALASKFHNRKVDVKVYTPRNCVPLYLPFKGERLVRALGKVAVHKPKEGDIMS 488

Query:   492 GILVKKGFTYQIMAPDDLHIFSQLSTANITQR 523
             GIL++K   Y++M+ +DL  FS L+T  +TQ+
Sbjct:   489 GILIQKDANYKLMSAEDLRDFSDLTTTVLTQK 520


>SGD|S000004267 [details] [associations]
            symbol:YSH1 "Putative endoribonuclease" species:4932
            "Saccharomyces cerevisiae" [GO:0004518 "nuclease activity"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0006379 "mRNA
            cleavage" evidence=IMP] [GO:0006369 "termination of RNA polymerase
            II transcription" evidence=IMP] [GO:0031126 "snoRNA 3'-end
            processing" evidence=IMP] [GO:0008380 "RNA splicing" evidence=IMP]
            [GO:0034247 "snoRNA splicing" evidence=IMP] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA;IPI] [GO:0005849 "mRNA cleavage factor complex"
            evidence=IPI] [GO:0004521 "endoribonuclease activity"
            evidence=ISS;IMP] [GO:0003723 "RNA binding" evidence=IC]
            [GO:0004519 "endonuclease activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 SGD:S000004267
            Pfam:PF07521 GO:GO:0046872 GO:GO:0006378 EMBL:BK006945
            GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 EMBL:U17245 HOGENOM:HOG000203394 KO:K14403
            InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            GeneTree:ENSGT00700000104485 GO:GO:0031126 GO:GO:0034247
            OMA:EISFAAH OrthoDB:EOG41ZJK7 PIR:S51413 RefSeq:NP_013379.1
            ProteinModelPortal:Q06224 SMR:Q06224 DIP:DIP-2470N IntAct:Q06224
            MINT:MINT-375457 STRING:Q06224 PaxDb:Q06224 PeptideAtlas:Q06224
            EnsemblFungi:YLR277C GeneID:850983 KEGG:sce:YLR277C CYGD:YLR277c
            NextBio:967501 Genevestigator:Q06224 GermOnline:YLR277C
            Uniprot:Q06224
        Length = 779

 Score = 1337 (475.7 bits), Expect = 1.5e-136, P = 1.5e-136
 Identities = 254/474 (53%), Positives = 343/474 (72%)

Query:    29 LGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFHLDH 88
             LG  NEVGRSC  + YKGKT++ D GIHPAY G+A+LP++DE D S +D+LLI+HFHLDH
Sbjct:    14 LGGSNEVGRSCHILQYKGKTVMLDAGIHPAYQGLASLPFYDEFDLSKVDILLISHFHLDH 73

Query:    89 AASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKV--SVEDM------LFDEQDI 140
             AASLPY +++T F+GRVFMTH TKAIY+ LL D+V+V+ +  S   M      LF ++D+
Sbjct:    74 AASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSSSMGTKDEGLFSDEDL 133

Query:   141 NRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHL 200
               S DKIE +D+H TV+VNGIKF  + AGHVLGAAMF ++IAG+RVL+TGDYSRE DRHL
Sbjct:   134 VDSFDKIETVDYHSTVDVNGIKFTAFHAGHVLGAAMFQIEIAGLRVLFTGDYSREVDRHL 193

Query:   201 RAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQE 260
              +AE+P  S ++ I+EST+G   H+PR  RE++ T +IHST+ +GGRVL+P FALGRAQE
Sbjct:   194 NSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHSTVMRGGRVLLPVFALGRAQE 253

Query:   261 LLLILDEYWSNHPEF---HNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANS--NPF 315
             ++LILDEYWS H +      +PI+YAS LAKKCM+V+QTY+  MN+ IR +F +S  NPF
Sbjct:   254 IMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPF 313

Query:   316 KFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTL 375
              FK+IS L +++DF D GPSV++ASPG LQSGLSR L + WC + KN  +I GY +EGT+
Sbjct:   314 IFKNISYLRNLEDFQDFGPSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTM 373

Query:   376 AKTIISEPKEVTLMNG--LTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGES 433
             AK I+ EP  +  +N   +T P   QV  ISF+AH D+ +   F++++  PNIILVHGE+
Sbjct:   374 AKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQENLEFIEKISAPNIILVHGEA 433

Query:   434 HEMGRLKTKLMTELA-----DCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEK 482
             + MGRLK+ L++  A     D    +  P+NC  V++ F   K+AK +G +  +
Sbjct:   434 NPMGRLKSALLSNFASLKGTDNEVHVFNPRNCVEVDLEFQGVKVAKAVGNIVNE 487


>CGD|CAL0005344 [details] [associations]
            symbol:orf19.5486 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0004521 "endoribonuclease
            activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0034247 "snoRNA splicing"
            evidence=IEA] [GO:0031126 "snoRNA 3'-end processing" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006369 "termination
            of RNA polymerase II transcription" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 CGD:CAL0005344 Pfam:PF07521 GO:GO:0005634
            GO:GO:0042493 GO:GO:0046872 GO:GO:0006397 GO:GO:0090305
            GO:GO:0004519 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 EMBL:AACQ01000196 EMBL:AACQ01000195
            RefSeq:XP_711478.1 RefSeq:XP_711502.1 ProteinModelPortal:Q59P50
            STRING:Q59P50 GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
            KEGG:cal:CaO19.5486 Uniprot:Q59P50
        Length = 870

 Score = 1245 (443.3 bits), Expect = 1.1e-133, Sum P(2) = 1.1e-133
 Identities = 241/478 (50%), Positives = 336/478 (70%)

Query:    29 LGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFHLDH 88
             LG  NEVGRSC  + YK K I+ D G+HPA SG A+ PYFDE D S +D+LLI+HFH+DH
Sbjct:   105 LGGCNEVGRSCHIIEYKNKVIMLDSGMHPALSGHASFPYFDEYDISKVDILLISHFHVDH 164

Query:    89 AASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVS---VED-------MLFDEQ 138
             +ASLPY ++++ F+G+VFMTHATKAIY+ L+ D+V+V+ +     ED        L+ + 
Sbjct:   165 SASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRSEDGGGGEGSNLYTDD 224

Query:   139 DINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDR 198
             DI +S D+IE +D+H T+E++GI+F  Y AGHVLGA M+ ++I G++VL+TGDYSREE+R
Sbjct:   225 DIMKSFDRIETIDYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENR 284

Query:   199 HLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRA 258
             HL AAE+P   PDI I EST+G    +PR   E++ T  IH+TI++GGRVL+P FALG A
Sbjct:   285 HLHAAEVPPLKPDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNA 344

Query:   259 QELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANS---NPF 315
             QELLLILDEYWS + +  N+ ++YAS LAKKCMAVY+TY   MN++IR   A+S   NPF
Sbjct:   345 QELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPF 404

Query:   316 KFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTL 375
              FK+I  +  +  F D+GPSVV+A+PG LQ+G+SRQL + W  D KN  ++ GY VEGT+
Sbjct:   405 DFKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTM 464

Query:   376 AKTIISEPKEV-TLMN-GLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGES 433
             AK ++ EP  + +  N  +T P  + +  ISF+AH D+ Q S F++++ P  +ILVHG+S
Sbjct:   465 AKELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVDFQQNSEFIEKVSPSKVILVHGDS 524

Query:   434 HEMGRLKTKLMTELA-----DCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEV 486
               MGRLK+ L+++ A     D   K+  PKNC+ + + F   K+AK +G LAE+  +V
Sbjct:   525 VPMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIGFKGLKIAKVLGSLAEEQLQV 582

 Score = 86 (35.3 bits), Expect = 1.1e-133, Sum P(2) = 1.1e-133
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query:   485 EVGETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRI 524
             + G+ VSG+LV K F   ++   DLH F+QLST+ +  ++
Sbjct:   633 KTGQVVSGVLVSKDFNLNLLQLQDLHEFTQLSTSIVKSKM 672


>UNIPROTKB|Q59P50 [details] [associations]
            symbol:YSH1 "Endoribonuclease YSH1" species:237561 "Candida
            albicans SC5314" [GO:0042493 "response to drug" evidence=IMP]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 CGD:CAL0005344
            Pfam:PF07521 GO:GO:0005634 GO:GO:0042493 GO:GO:0046872
            GO:GO:0006397 GO:GO:0090305 GO:GO:0004519 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
            EMBL:AACQ01000196 EMBL:AACQ01000195 RefSeq:XP_711478.1
            RefSeq:XP_711502.1 ProteinModelPortal:Q59P50 STRING:Q59P50
            GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
            KEGG:cal:CaO19.5486 Uniprot:Q59P50
        Length = 870

 Score = 1245 (443.3 bits), Expect = 1.1e-133, Sum P(2) = 1.1e-133
 Identities = 241/478 (50%), Positives = 336/478 (70%)

Query:    29 LGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFHLDH 88
             LG  NEVGRSC  + YK K I+ D G+HPA SG A+ PYFDE D S +D+LLI+HFH+DH
Sbjct:   105 LGGCNEVGRSCHIIEYKNKVIMLDSGMHPALSGHASFPYFDEYDISKVDILLISHFHVDH 164

Query:    89 AASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVS---VED-------MLFDEQ 138
             +ASLPY ++++ F+G+VFMTHATKAIY+ L+ D+V+V+ +     ED        L+ + 
Sbjct:   165 SASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRSEDGGGGEGSNLYTDD 224

Query:   139 DINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDR 198
             DI +S D+IE +D+H T+E++GI+F  Y AGHVLGA M+ ++I G++VL+TGDYSREE+R
Sbjct:   225 DIMKSFDRIETIDYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENR 284

Query:   199 HLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRA 258
             HL AAE+P   PDI I EST+G    +PR   E++ T  IH+TI++GGRVL+P FALG A
Sbjct:   285 HLHAAEVPPLKPDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNA 344

Query:   259 QELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANS---NPF 315
             QELLLILDEYWS + +  N+ ++YAS LAKKCMAVY+TY   MN++IR   A+S   NPF
Sbjct:   345 QELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPF 404

Query:   316 KFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTL 375
              FK+I  +  +  F D+GPSVV+A+PG LQ+G+SRQL + W  D KN  ++ GY VEGT+
Sbjct:   405 DFKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTM 464

Query:   376 AKTIISEPKEV-TLMN-GLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGES 433
             AK ++ EP  + +  N  +T P  + +  ISF+AH D+ Q S F++++ P  +ILVHG+S
Sbjct:   465 AKELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVDFQQNSEFIEKVSPSKVILVHGDS 524

Query:   434 HEMGRLKTKLMTELA-----DCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEV 486
               MGRLK+ L+++ A     D   K+  PKNC+ + + F   K+AK +G LAE+  +V
Sbjct:   525 VPMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIGFKGLKIAKVLGSLAEEQLQV 582

 Score = 86 (35.3 bits), Expect = 1.1e-133, Sum P(2) = 1.1e-133
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query:   485 EVGETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRI 524
             + G+ VSG+LV K F   ++   DLH F+QLST+ +  ++
Sbjct:   633 KTGQVVSGVLVSKDFNLNLLQLQDLHEFTQLSTSIVKSKM 672


>GENEDB_PFALCIPARUM|PF14_0364 [details] [associations]
            symbol:PF14_0364 "cleavage and polyadenylation
            specifity factor protein, putative" species:5833 "Plasmodium
            falciparum" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006378 "mRNA
            polyadenylation" evidence=ISS] [GO:0006379 "mRNA cleavage"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:AE014187 GO:GO:0005847
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
            SMART:SM01098 RefSeq:XP_001348538.1 ProteinModelPortal:Q8IL83
            PRIDE:Q8IL83 EnsemblProtists:PF14_0364:mRNA GeneID:811946
            KEGG:pfa:PF14_0364 EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH
            ProtClustDB:CLSZ2457730 Uniprot:Q8IL83
        Length = 876

 Score = 812 (290.9 bits), Expect = 1.2e-114, Sum P(3) = 1.2e-114
 Identities = 163/388 (42%), Positives = 245/388 (63%)

Query:   132 DMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGD 191
             ++L+DE DI+++MD IE L+FHQ  E   +KF  Y AGHV+GA MF+V+I  +R LYTGD
Sbjct:   165 NVLYDENDIDKTMDLIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGD 224

Query:   192 YSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIP 251
             YSRE DRH+  AE+P     + I E TYG+++H  R  RE RF +++ S I+  G+VL+P
Sbjct:   225 YSREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLP 284

Query:   252 AFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQF-A 310
              FALGRAQELLLIL+E+W  +    NIPI+Y S +A K + +Y+T+I    E ++     
Sbjct:   285 VFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNE 344

Query:   311 NSNPFKFKHISPLNSIDDFS-----DVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACV 365
               NPF FK++    S++  S     D  P V+MASPG LQ+G+S+ +F+I  SDKK+  +
Sbjct:   345 GKNPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVI 404

Query:   366 IPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPN 425
             + GY V+GTLA  + +EP+ VT+ N        +   ISFSAH+D+ QT TF+++L  PN
Sbjct:   405 LTGYTVKGTLADELKTEPEFVTI-NDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPN 463

Query:   426 IILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPE 485
             ++LVHG+ +E+ RLK KL+ E    +  + TP+  Q +  +F       ++G+L+E   +
Sbjct:   464 VVLVHGDKNELNRLKNKLIEEKQYLS--VFTPELLQKLSFHFEQNDSLISLGKLSEHIKK 521

Query:   486 VGETVS--GILVKKGFTYQIMAPDDLHI 511
             + + +   G+ +KK    + M  +D HI
Sbjct:   522 INKKIKLEGLKMKK----EKMIANDEHI 545

 Score = 301 (111.0 bits), Expect = 1.2e-114, Sum P(3) = 1.2e-114
 Identities = 53/102 (51%), Positives = 71/102 (69%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFH 85
             I  LG  +EVGRSCV +     +++ DCGIHPA+ G+  LP +D  D S +D+ LITHFH
Sbjct:     6 IVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLITHFH 65

Query:    86 LDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSK 127
             +DH+ +LPY + KT FKGR+FMT ATK+I  LL  DY ++ K
Sbjct:    66 MDHSGALPYLINKTRFKGRIFMTEATKSICYLLWNDYARIEK 107

 Score = 52 (23.4 bits), Expect = 1.2e-114, Sum P(3) = 1.2e-114
 Identities = 12/47 (25%), Positives = 25/47 (53%)

Query:   479 LAEKTPEVGETVSGILVKKGFTYQIMA-PDDLHIFSQLSTANITQRI 524
             ++ +   V   + GI++ +     I+  P+D++ ++ L TA I Q I
Sbjct:   583 ISNEKHNVNNQIEGIIITEPQNVPILIYPNDIYEYTNLKTAMIDQTI 629

 Score = 37 (18.1 bits), Expect = 4.6e-24, Sum P(3) = 4.6e-24
 Identities = 12/45 (26%), Positives = 23/45 (51%)

Query:   116 KLLLTD-YVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVN 159
             K++  D ++ V K  + D+  DE+++  S  K   +D H    +N
Sbjct:   537 KMIANDEHISV-KNEMGDINNDEENLQISDKKKNKVDEHDKHNIN 580


>UNIPROTKB|Q8IL83 [details] [associations]
            symbol:PF14_0364 "Cleavage and polyadenylation specificity
            factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014187
            GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 RefSeq:XP_001348538.1
            ProteinModelPortal:Q8IL83 PRIDE:Q8IL83
            EnsemblProtists:PF14_0364:mRNA GeneID:811946 KEGG:pfa:PF14_0364
            EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH ProtClustDB:CLSZ2457730
            Uniprot:Q8IL83
        Length = 876

 Score = 812 (290.9 bits), Expect = 1.2e-114, Sum P(3) = 1.2e-114
 Identities = 163/388 (42%), Positives = 245/388 (63%)

Query:   132 DMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGD 191
             ++L+DE DI+++MD IE L+FHQ  E   +KF  Y AGHV+GA MF+V+I  +R LYTGD
Sbjct:   165 NVLYDENDIDKTMDLIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGD 224

Query:   192 YSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIP 251
             YSRE DRH+  AE+P     + I E TYG+++H  R  RE RF +++ S I+  G+VL+P
Sbjct:   225 YSREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLP 284

Query:   252 AFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQF-A 310
              FALGRAQELLLIL+E+W  +    NIPI+Y S +A K + +Y+T+I    E ++     
Sbjct:   285 VFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNE 344

Query:   311 NSNPFKFKHISPLNSIDDFS-----DVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACV 365
               NPF FK++    S++  S     D  P V+MASPG LQ+G+S+ +F+I  SDKK+  +
Sbjct:   345 GKNPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVI 404

Query:   366 IPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPN 425
             + GY V+GTLA  + +EP+ VT+ N        +   ISFSAH+D+ QT TF+++L  PN
Sbjct:   405 LTGYTVKGTLADELKTEPEFVTI-NDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPN 463

Query:   426 IILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPE 485
             ++LVHG+ +E+ RLK KL+ E    +  + TP+  Q +  +F       ++G+L+E   +
Sbjct:   464 VVLVHGDKNELNRLKNKLIEEKQYLS--VFTPELLQKLSFHFEQNDSLISLGKLSEHIKK 521

Query:   486 VGETVS--GILVKKGFTYQIMAPDDLHI 511
             + + +   G+ +KK    + M  +D HI
Sbjct:   522 INKKIKLEGLKMKK----EKMIANDEHI 545

 Score = 301 (111.0 bits), Expect = 1.2e-114, Sum P(3) = 1.2e-114
 Identities = 53/102 (51%), Positives = 71/102 (69%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFH 85
             I  LG  +EVGRSCV +     +++ DCGIHPA+ G+  LP +D  D S +D+ LITHFH
Sbjct:     6 IVCLGGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLITHFH 65

Query:    86 LDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSK 127
             +DH+ +LPY + KT FKGR+FMT ATK+I  LL  DY ++ K
Sbjct:    66 MDHSGALPYLINKTRFKGRIFMTEATKSICYLLWNDYARIEK 107

 Score = 52 (23.4 bits), Expect = 1.2e-114, Sum P(3) = 1.2e-114
 Identities = 12/47 (25%), Positives = 25/47 (53%)

Query:   479 LAEKTPEVGETVSGILVKKGFTYQIMA-PDDLHIFSQLSTANITQRI 524
             ++ +   V   + GI++ +     I+  P+D++ ++ L TA I Q I
Sbjct:   583 ISNEKHNVNNQIEGIIITEPQNVPILIYPNDIYEYTNLKTAMIDQTI 629

 Score = 37 (18.1 bits), Expect = 4.6e-24, Sum P(3) = 4.6e-24
 Identities = 12/45 (26%), Positives = 23/45 (51%)

Query:   116 KLLLTD-YVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVN 159
             K++  D ++ V K  + D+  DE+++  S  K   +D H    +N
Sbjct:   537 KMIANDEHISV-KNEMGDINNDEENLQISDKKKNKVDEHDKHNIN 580


>ASPGD|ASPL0000060573 [details] [associations]
            symbol:AN0990 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:BN001308
            GO:GO:0046872 GO:GO:0006397 GO:GO:0090305 GO:GO:0004519
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
            Pfam:PF11718 SMART:SM01098 EMBL:AACD01000015 RefSeq:XP_658594.1
            ProteinModelPortal:Q5BEP0 STRING:Q5BEP0
            EnsemblFungi:CADANIAT00001661 GeneID:2876766 KEGG:ani:AN0990.2
            OMA:EISFAAH OrthoDB:EOG41ZJK7 Uniprot:Q5BEP0
        Length = 884

 Score = 839 (300.4 bits), Expect = 6.3e-110, Sum P(2) = 6.3e-110
 Identities = 164/301 (54%), Positives = 209/301 (69%)

Query:    14 DAPVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDP 73
             D PV    D+L    LG GNEVGRSC  + YKGKT++ D G+HPA  G +ALP+FDE D 
Sbjct:    15 DEPVD-PSDELAFYCLGGGNEVGRSCHIIQYKGKTVMLDAGMHPAKEGFSALPFFDEFDL 73

Query:    74 SAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKV-SVED 132
             S +D+LLI+HFH+DH+++LPY L KT FKGRVFMTHATKAIYK L+ D V+V+   S  D
Sbjct:    74 STVDILLISHFHVDHSSALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSD 133

Query:   133 M---LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYT 189
                 L+ E D   ++  IE +DF+ T  +N I+   Y AGHVLGAAMF++ IAG+ +L+T
Sbjct:   134 QRTTLYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFT 193

Query:   190 GDYSREEDRHLRAAELPQ-FSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRV 248
             GDYSREEDRHL  A +P+    D+ I EST+G+  + PR  RE      I   +++GGRV
Sbjct:   194 GDYSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRV 253

Query:   249 LIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQ 308
             L+P FALGRAQELLLIL+EYW  HPE   IPIYY    A++CM VYQTYI +MN+ I+  
Sbjct:   254 LMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRL 313

Query:   309 F 309
             F
Sbjct:   314 F 314

 Score = 635 (228.6 bits), Expect = 2.1e-88, Sum P(2) = 2.1e-88
 Identities = 126/268 (47%), Positives = 176/268 (65%)

Query:   134 LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS 193
             L+ E D   ++  IE +DF+ T  +N I+   Y AGHVLGAAMF++ IAG+ +L+TGDYS
Sbjct:   138 LYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFTGDYS 197

Query:   194 REEDRHLRAAELPQ-FSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPA 252
             REEDRHL  A +P+    D+ I EST+G+  + PR  RE      I   +++GGRVL+P 
Sbjct:   198 REEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRVLMPV 257

Query:   253 FALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQF--- 309
             FALGRAQELLLIL+EYW  HPE   IPIYY    A++CM VYQTYI +MN+ I+  F   
Sbjct:   258 FALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRLFRQR 317

Query:   310 -----------ANSNPFKFKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCS 358
                         ++ P+ FK++  L S++ F DVG  V++ASPG LQ+G SR+L + W  
Sbjct:   318 MAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDVGGCVMLASPGMLQTGTSRELLERWAP 377

Query:   359 DKKNACVIPGYVVEGTLAKTIISEPKEV 386
             +++N  V+ GY VEGT+AK +++EP ++
Sbjct:   378 NERNGVVMTGYSVEGTMAKQLLNEPDQI 405

 Score = 267 (99.0 bits), Expect = 6.3e-110, Sum P(2) = 6.3e-110
 Identities = 64/155 (41%), Positives = 93/155 (60%)

Query:   387 TLMNG------LTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLK 440
             T MNG      +  P    V  ISF+AH D  +   F++E+  P +ILVHGE H+M RLK
Sbjct:   419 TRMNGNDEEQKIMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLK 478

Query:   441 TKLMTELAD--CNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKT-P---EVGE--TVSG 492
             +KL++  A+     K+ TP NC+ V + F  +K+AK +G+LA+ T P   E G+   ++G
Sbjct:   479 SKLLSLNAEKTVKVKVYTPANCEEVRIPFRKDKIAKVVGKLAQTTLPTDNEDGDGPLMAG 538

Query:   493 ILVKKGFTYQIMAPDDLHIFSQLSTANIT--QRIT 525
             +LV+ GF   +MAPDDL  ++ L+T  IT  Q IT
Sbjct:   539 VLVQNGFDLSLMAPDDLREYAGLATTTITCKQHIT 573

 Score = 49 (22.3 bits), Expect = 4.9e-20, Sum P(2) = 4.9e-20
 Identities = 13/44 (29%), Positives = 22/44 (50%)

Query:     8 PSLKRRDAPVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILF 51
             P ++  D   +   + + ITP  AG+ +G +   +S  G  ILF
Sbjct:   149 PLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILF 192


>UNIPROTKB|F1NV30 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9031
            "Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            IPI:IPI00571913 EMBL:AADN02040858 Ensembl:ENSGALT00000002586
            Uniprot:F1NV30
        Length = 600

 Score = 883 (315.9 bits), Expect = 2.0e-88, P = 2.0e-88
 Identities = 194/521 (37%), Positives = 297/521 (57%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  Y+     P F  I  +      +D
Sbjct:     3 EIKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH TKAI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  PD+ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNI--PIYYASPLAKKCMAVYQTYILSMNERIRNQFANSN 313
             GRAQEL ++L+ +W    E  N+  PIY+++ L +K    Y+ +I   N++IR  F   N
Sbjct:   243 GRAQELCILLETFW----ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRN 298

Query:   314 PFKFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVE 372
              F+FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+
Sbjct:   299 MFEFKHIKAFDRA--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQ 356

Query:   373 GTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGE 432
             GT+   I+S  +++ +       + MQV Y+SFSAHAD       +++  P N++LVHGE
Sbjct:   357 GTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGE 416

Query:   433 SHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFN-SEKMAKTIGRLAEKT-----PEV 486
             + +M  LK K+  E    +     P N ++  ++ N S  +  ++G L  +T     P+ 
Sbjct:   417 AKKMEFLKQKIEQEF---HVNCYMPANGETTSIFTNPSIPVDISLGLLKRETAIGLLPDA 473

Query:   487 GET--VSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
              +   + G L+ K  ++++++P+      +L  A    R T
Sbjct:   474 KKPKLMHGTLIMKDNSFRLVSPEQA--LKELGLAEHQLRFT 512


>UNIPROTKB|Q5ZIH0 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9031
            "Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000231294 KO:K13148 GO:GO:0032039 GO:GO:0016180
            HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB EMBL:AJ720814 IPI:IPI00571913
            RefSeq:NP_001012854.1 UniGene:Gga.13403 ProteinModelPortal:Q5ZIH0
            STRING:Q5ZIH0 GeneID:419418 KEGG:gga:419418 CTD:54973
            InParanoid:Q5ZIH0 NextBio:20822477 Uniprot:Q5ZIH0
        Length = 600

 Score = 882 (315.5 bits), Expect = 2.5e-88, P = 2.5e-88
 Identities = 194/521 (37%), Positives = 297/521 (57%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  Y+     P F  I  +      +D
Sbjct:     3 EIKVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH TKAI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  PD+ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNI--PIYYASPLAKKCMAVYQTYILSMNERIRNQFANSN 313
             GRAQEL ++L+ +W    E  N+  PIY+++ L +K    Y+ +I   N++IR  F   N
Sbjct:   243 GRAQELCILLETFW----ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRN 298

Query:   314 PFKFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVE 372
              F+FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+
Sbjct:   299 MFEFKHIKAFDRA--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQ 356

Query:   373 GTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGE 432
             GT+   I+S  +++ +       + MQV Y+SFSAHAD       +++  P N++LVHGE
Sbjct:   357 GTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGE 416

Query:   433 SHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFN-SEKMAKTIGRLAEKT-----PEV 486
             + +M  LK K+  E    +     P N ++  ++ N S  +  ++G L  +T     P+ 
Sbjct:   417 AKKMEFLKQKIEQEF---HVNCYMPANGETTTIFTNPSIPVDISLGLLKRETAIGLLPDA 473

Query:   487 GET--VSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
              +   + G L+ K  ++++++P+      +L  A    R T
Sbjct:   474 KKPKLMHGTLIMKDNSFRLVSPEQA--LKELGLAEHQLRFT 512


>MGI|MGI:1919207 [details] [associations]
            symbol:Cpsf3l "cleavage and polyadenylation specific factor
            3-like" species:10090 "Mus musculus" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 MGI:MGI:1919207 GO:GO:0005737 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
            GeneTree:ENSGT00700000104485 GO:GO:0032039 GO:GO:0016180
            HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB CTD:54973 EMBL:AK010425
            EMBL:AK090206 EMBL:AK150436 EMBL:AK152740 EMBL:AK167607
            EMBL:AK172533 EMBL:BC008240 EMBL:BC011155 IPI:IPI00467084
            RefSeq:NP_082296.1 UniGene:Mm.259270 UniGene:Mm.475640
            ProteinModelPortal:Q9CWS4 SMR:Q9CWS4 STRING:Q9CWS4
            PhosphoSite:Q9CWS4 PaxDb:Q9CWS4 PRIDE:Q9CWS4
            Ensembl:ENSMUST00000030901 GeneID:71957 KEGG:mmu:71957
            InParanoid:Q9CWS4 NextBio:335052 Bgee:Q9CWS4 Genevestigator:Q9CWS4
            GermOnline:ENSMUSG00000029034 Uniprot:Q9CWS4
        Length = 600

 Score = 861 (308.1 bits), Expect = 4.3e-86, P = 4.3e-86
 Identities = 189/519 (36%), Positives = 291/519 (56%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  Y+     P F  I  S      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPF 315
             GRAQEL ++L+ +W        +PIY+++ L +K    Y+ +I   N++IR  F   N F
Sbjct:   243 GRAQELCILLETFWERMNL--KVPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMF 300

Query:   316 KFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGT 374
             +FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT
Sbjct:   301 EFKHIKAFDRT--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 358

Query:   375 LAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESH 434
             +   I+S  +++ +       + MQV Y+SFSAHAD       + +  P +++LVHGE+ 
Sbjct:   359 VGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAK 418

Query:   435 EMGRLKTKLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTPEV 486
             +M  L+ K+  E    C    N + +T     S+ +  +   + + +  G L E K P +
Sbjct:   419 KMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQGLLPEAKKPRL 478

Query:   487 GETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
                + G L+ K   +++++ +      +L  A    R T
Sbjct:   479 ---LHGTLIMKDSNFRLVSSEQA--LKELGLAEHQLRFT 512


>UNIPROTKB|Q5TA45 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
            EMBL:AL139287 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K13148 HOVERGEN:HBG080215
            OrthoDB:EOG4GXFMB CTD:54973 EMBL:AL136813 EMBL:AK000549
            EMBL:AK021939 EMBL:AK023356 EMBL:AK297350 EMBL:CR533557
            EMBL:BC000675 EMBL:BC007978 EMBL:BC013904 EMBL:BK005728
            EMBL:BK005673 IPI:IPI00063404 IPI:IPI00306882 IPI:IPI00514973
            RefSeq:NP_001243392.1 RefSeq:NP_060341.2 UniGene:Hs.6449
            ProteinModelPortal:Q5TA45 SMR:Q5TA45 IntAct:Q5TA45
            MINT:MINT-1482228 STRING:Q5TA45 PhosphoSite:Q5TA45 DMDM:118572557
            PaxDb:Q5TA45 PRIDE:Q5TA45 DNASU:54973 Ensembl:ENST00000419704
            Ensembl:ENST00000435064 Ensembl:ENST00000450926
            Ensembl:ENST00000545578 GeneID:54973 KEGG:hsa:54973 UCSC:uc001aee.1
            UCSC:uc001aeh.1 UCSC:uc009vjz.1 GeneCards:GC01M001236
            HGNC:HGNC:26052 HPA:HPA028379 HPA:HPA029025 MIM:611354
            neXtProt:NX_Q5TA45 PharmGKB:PA142672080 InParanoid:Q5TA45
            PhylomeDB:Q5TA45 ChiTaRS:CPSF3L GenomeRNAi:54973 NextBio:58222
            ArrayExpress:Q5TA45 Bgee:Q5TA45 Genevestigator:Q5TA45
            GermOnline:ENSG00000127054 Uniprot:Q5TA45
        Length = 600

 Score = 860 (307.8 bits), Expect = 5.4e-86, P = 5.4e-86
 Identities = 189/519 (36%), Positives = 293/519 (56%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPF 315
             GRAQEL ++L+ +W        +PIY+++ L +K    Y+ +I   N++IR  F   N F
Sbjct:   243 GRAQELCILLETFWERMNL--KVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF 300

Query:   316 KFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGT 374
             +FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT
Sbjct:   301 EFKHIKAFDRA--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 358

Query:   375 LAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESH 434
             +   I+S  +++ +       + MQV Y+SFSAHAD       + +  P +++LVHGE+ 
Sbjct:   359 VGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAK 418

Query:   435 EMGRLKTKLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTPEV 486
             +M  LK K+  EL  +C    N + +T     S+ +  +   + + +  G L E K P +
Sbjct:   419 KMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRL 478

Query:   487 GETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
                + G L+ K   +++++ +      +L  A    R T
Sbjct:   479 ---LHGTLIMKDSNFRLVSSEQA--LKELGLAEHQLRFT 512


>RGD|1306841 [details] [associations]
            symbol:Cpsf3l "cleavage and polyadenylation specific factor
            3-like" species:10116 "Rattus norvegicus" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 RGD:1306841 GO:GO:0005634 GO:GO:0005737
            GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000231294 KO:K13148
            OMA:MAVEYMS GeneTree:ENSGT00700000104485 HOVERGEN:HBG080215
            OrthoDB:EOG4GXFMB CTD:54973 EMBL:BC105303 IPI:IPI00365477
            RefSeq:NP_001029064.1 UniGene:Rn.98615 ProteinModelPortal:Q3MHC2
            STRING:Q3MHC2 Ensembl:ENSRNOT00000026725 GeneID:298688
            KEGG:rno:298688 InParanoid:Q3MHC2 NextBio:644186
            Genevestigator:Q3MHC2 GermOnline:ENSRNOG00000019712 Uniprot:Q3MHC2
        Length = 600

 Score = 860 (307.8 bits), Expect = 5.4e-86, P = 5.4e-86
 Identities = 189/519 (36%), Positives = 291/519 (56%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  Y+     P F  I  S      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPF 315
             GRAQEL ++L+ +W        +PIY+++ L +K    Y+ +I   N++IR  F   N F
Sbjct:   243 GRAQELCILLETFWERMNL--KVPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMF 300

Query:   316 KFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGT 374
             +FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT
Sbjct:   301 EFKHIKAFDRT--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 358

Query:   375 LAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESH 434
             +   I+S  +++ +       + MQV Y+SFSAHAD       + +  P +++LVHGE+ 
Sbjct:   359 VGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAK 418

Query:   435 EMGRLKTKLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTPEV 486
             +M  L+ K+  E    C    N + +T     S+ +  +   + + +  G L E K P +
Sbjct:   419 KMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQGLLPEAKKPRL 478

Query:   487 GETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
                + G L+ K   +++++ +      +L  A    R T
Sbjct:   479 ---LHGTLIMKDNNFRLVSSEQA--LKELGLAEHQLRFT 512


>UNIPROTKB|E1B7Q9 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9913
            "Bos taurus" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:DAAA02043243 IPI:IPI00971575 Ensembl:ENSBTAT00000010020
            Uniprot:E1B7Q9
        Length = 598

 Score = 858 (307.1 bits), Expect = 8.9e-86, P = 8.9e-86
 Identities = 189/518 (36%), Positives = 289/518 (55%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  +S     P F  I  S      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
              ++I+HFHLDH  +LPYF E   + G ++MT  T+AI  +LL DY K++    E   F  
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKGEANFFTS 122

Query:   138 QDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREE 196
             Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+   
Sbjct:   123 QMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTP 182

Query:   197 DRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALG 256
             DRHL AA + +  P + I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FALG
Sbjct:   183 DRHLGAAWIDKCRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALG 242

Query:   257 RAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFK 316
             RAQEL ++L+ +W         PIY+++ L +K    Y+ +I   N++IR  F   N F+
Sbjct:   243 RAQELCILLETFWERMDL--KAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFE 300

Query:   317 FKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTL 375
             FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT+
Sbjct:   301 FKHIKAFDRA--FADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTV 358

Query:   376 AKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHE 435
                I+S  +++ +       + MQV Y+SFSAHAD       + +  P N++LVHGE+ +
Sbjct:   359 GHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKK 418

Query:   436 MGRLKTKLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTPEVG 487
             M  LK K+  E   +C    N + +T     S+ +  +   + + +  G L + K P + 
Sbjct:   419 MEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPDAKKPRL- 477

Query:   488 ETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
               + G L+ K   +++++ +      +L  A    R T
Sbjct:   478 --LHGTLIMKDSNFRLVSSEQA--LKELGLAEHQLRFT 511


>FB|FBgn0039691 [details] [associations]
            symbol:IntS11 "Integrator 11" species:7227 "Drosophila
            melanogaster" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0034472 "snRNA
            3'-end processing" evidence=IDA] [GO:0016180 "snRNA processing"
            evidence=ISS] [GO:0032039 "integrator complex" evidence=ISS]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
            SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
            GO:GO:0006378 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            GO:GO:0034472 EMBL:AY061097 RefSeq:NP_651721.1 UniGene:Dm.3722
            SMR:Q9VAH9 STRING:Q9VAH9 EnsemblMetazoa:FBtr0085476 GeneID:43506
            KEGG:dme:Dmel_CG1972 UCSC:CG1972-RA CTD:43506 FlyBase:FBgn0039691
            InParanoid:Q9VAH9 OrthoDB:EOG47D7X3 GenomeRNAi:43506 NextBio:834295
            Uniprot:Q9VAH9
        Length = 597

 Score = 847 (303.2 bits), Expect = 1.3e-84, P = 1.3e-84
 Identities = 182/443 (41%), Positives = 256/443 (57%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDP-----SAIDVLL 80
             ITPLGAG +VGRSC+ +S  GK I+ DCG+H  Y+     P F  I P     S ID ++
Sbjct:     6 ITPLGAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDCVI 65

Query:    81 ITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQD 139
             I+HFHLDH  +LPY  E   + G ++MTH TKAI  +LL D  KV+ +   E   F  Q 
Sbjct:    66 ISHFHLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTTQM 125

Query:   140 INRSMDKIEVLDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDR 198
             I   M K+  +  HQ++ V+  ++   Y AGHVLGAAMF + +    V+YTGDY+   DR
Sbjct:   126 IKDCMKKVIPVTLHQSMMVDTDLEIKAYYAGHVLGAAMFWIKVGSQSVVYTGDYNMTPDR 185

Query:   199 HLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRA 258
             HL AA + +  PD+ I ESTY   +   +  RE+ F   +H  +++GG+VLIP FALGRA
Sbjct:   186 HLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPVFALGRA 245

Query:   259 QELLLILDEYWSNHPEFHNI--PIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFK 316
             QEL ++L+ YW    E  N+  PIY+A  L +K    Y+ +I   N++IR  F + N F 
Sbjct:   246 QELCILLETYW----ERMNLKYPIYFALGLTEKANTYYKMFITWTNQKIRKTFVHRNMFD 301

Query:   317 FKHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLA 376
             FKHI P +      + G  VV A+PG L +GLS Q+F  W  ++ N  ++PGY V+GT+ 
Sbjct:   302 FKHIKPFDKAY-IDNPGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGYCVQGTVG 360

Query:   377 KTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
               I+   K+V   N     + M V Y+SFSAHAD       ++   P N++LVHGE+ +M
Sbjct:   361 NKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLIQNCEPKNVMLVHGEAGKM 420

Query:   437 GRLKTKLMTELADCNTKIITPKN 459
               L++K+  E    N +   P N
Sbjct:   421 KFLRSKIKDEF---NLETYMPAN 440


>UNIPROTKB|F1RJE8 [details] [associations]
            symbol:CPSF3L "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:FP102596 RefSeq:XP_003127541.3 Ensembl:ENSSSCT00000003708
            GeneID:100523908 KEGG:ssc:100523908 Uniprot:F1RJE8
        Length = 599

 Score = 847 (303.2 bits), Expect = 1.3e-84, P = 1.3e-84
 Identities = 186/516 (36%), Positives = 284/516 (55%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  +S     P F  I         +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIGGKNVMLDCGMHMGFSDDRRFPDFSYITRHGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MT  T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K   +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKAVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPF 315
             GRAQEL ++L+ +W         PIY+++ L +K    Y+ +I   N++IR  F   N F
Sbjct:   243 GRAQELCILLETFWERMDL--KAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF 300

Query:   316 KFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGT 374
             +FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT
Sbjct:   301 EFKHIKAFDRA--FADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 358

Query:   375 LAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESH 434
             +   I+S  +++ L       + MQV Y+SFSAHAD       + +  P N++LVHGE+ 
Sbjct:   359 VGHKILSGQRKLELEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAK 418

Query:   435 EMGRLKTKLMTELA-DC----NTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGET 489
             +M  LK K+  E    C    N + +T     S+ +  +   + + + +      +    
Sbjct:   419 KMEFLKQKIEQEFRLSCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPDAKKARL 478

Query:   490 VSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
             + G L+ K  T+++++ +      +L  A    R T
Sbjct:   479 LHGTLIMKDSTFRLVSSEQA--LKELGLAEHQLRFT 512


>UNIPROTKB|E2QY53 [details] [associations]
            symbol:CPSF3L "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
            EMBL:AAEX03003844 RefSeq:XP_003639102.1 Ensembl:ENSCAFT00000030626
            GeneID:100855777 KEGG:cfa:100855777 Uniprot:E2QY53
        Length = 600

 Score = 846 (302.9 bits), Expect = 1.7e-84, P = 1.7e-84
 Identities = 188/521 (36%), Positives = 293/521 (56%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  P++ I ESTY   +   +  RE+ F   +H  + +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNI--PIYYASPLAKKCMAVYQTYILSMNERIRNQFANSN 313
             GRAQEL ++L+ +W    E  N+  PIY+++ L +K    Y+ +I   N++IR  F   N
Sbjct:   243 GRAQELCILLETFW----ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRN 298

Query:   314 PFKFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVE 372
              F+FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+
Sbjct:   299 MFEFKHIKAFDRA--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQ 356

Query:   373 GTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGE 432
             GT+   I+S  +++ +       + MQV Y+SFSAHAD       + +  P +++LVHGE
Sbjct:   357 GTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGE 416

Query:   433 SHEMGRLKTKLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTP 484
             + +M  LK K+  E   +C    N + +T     S+ +  +   + + +  G L + K P
Sbjct:   417 AKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPDVKKP 476

Query:   485 EVGETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
              +   + G L+ K   +++++ +      +L  A    R T
Sbjct:   477 RL---LHGTLIMKDSNFRLVSSEQA--LKELGLAEHQLRFT 512


>UNIPROTKB|Q2YDM2 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9913
            "Bos taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000231294 EMBL:BC110155 IPI:IPI00725178
            UniGene:Bt.4894 ProteinModelPortal:Q2YDM2 STRING:Q2YDM2
            PRIDE:Q2YDM2 HOVERGEN:HBG080215 InParanoid:Q2YDM2 OrthoDB:EOG4GXFMB
            Uniprot:Q2YDM2
        Length = 599

 Score = 844 (302.2 bits), Expect = 2.7e-84, P = 2.7e-84
 Identities = 188/519 (36%), Positives = 288/519 (55%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  +S     P F     S      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MT  T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFAL 255
              DRHL AA + +  P + I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FAL
Sbjct:   183 PDRHLGAAWIDKCRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFAL 242

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPF 315
             GRAQEL ++L+ +W         PIY+++ L +K    Y+ +I   N++IR  F   N F
Sbjct:   243 GRAQELCILLETFWERMDL--KAPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMF 300

Query:   316 KFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGT 374
             +FKHI   +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT
Sbjct:   301 EFKHIKAFDRA--FADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 358

Query:   375 LAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESH 434
             +   I+S  +++ +       + MQV Y+SFSAHAD       + +  P N++LVHGE+ 
Sbjct:   359 VGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAK 418

Query:   435 EMGRLKTKLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTPEV 486
             +M  LK K+  E   +C    N + +T     S+ +  +   + + +  G L + K P +
Sbjct:   419 KMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPDAKKPRL 478

Query:   487 GETVSGILVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
                + G L+ K   +++++ +      +L  A    R T
Sbjct:   479 ---LHGTLIMKDSNFRLVSSEQA--LKELGLAEHQLRFT 512


>UNIPROTKB|G3V1S5 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 EMBL:AL139287 EMBL:CH471183 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 OMA:MAVEYMS
            CTD:54973 UniGene:Hs.6449 GeneID:54973 KEGG:hsa:54973
            HGNC:HGNC:26052 ChiTaRS:CPSF3L GenomeRNAi:54973
            RefSeq:NP_001243385.1 ProteinModelPortal:G3V1S5 SMR:G3V1S5
            Ensembl:ENST00000540437 ArrayExpress:G3V1S5 Bgee:G3V1S5
            Uniprot:G3V1S5
        Length = 606

 Score = 840 (300.8 bits), Expect = 7.2e-84, P = 7.2e-84
 Identities = 186/512 (36%), Positives = 287/512 (56%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----IDVLLITHF 84
             GAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D ++I+HF
Sbjct:    16 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 75

Query:    85 HLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINRS 143
             HLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F  Q I   
Sbjct:    76 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 135

Query:   144 MDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRA 202
             M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+   DRHL A
Sbjct:   136 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 195

Query:   203 AELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELL 262
             A + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FALGRAQEL 
Sbjct:   196 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 255

Query:   263 LILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKHISP 322
             ++L+ +W        +PIY+++ L +K    Y+ +I   N++IR  F   N F+FKHI  
Sbjct:   256 ILLETFWERMNL--KVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHIKA 313

Query:   323 LNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIIS 381
              +    F+D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V+GT+   I+S
Sbjct:   314 FDRA--FADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKILS 371

Query:   382 EPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKT 441
               +++ +       + MQV Y+SFSAHAD       + +  P +++LVHGE+ +M  LK 
Sbjct:   372 GQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFLKQ 431

Query:   442 KLMTEL-ADC----NTKIITPKNCQSVEMYFNSEKMAKTI--GRLAE-KTPEVGETVSGI 493
             K+  EL  +C    N + +T     S+ +  +   + + +  G L E K P +   + G 
Sbjct:   432 KIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQGLLPEAKKPRL---LHGT 488

Query:   494 LVKKGFTYQIMAPDDLHIFSQLSTANITQRIT 525
             L+ K   +++++ +      +L  A    R T
Sbjct:   489 LIMKDSNFRLVSSEQA--LKELGLAEHQLRFT 518


>WB|WBGene00008642 [details] [associations]
            symbol:F10B5.8 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0009792 EMBL:Z48334 GO:GO:0016787 eggNOG:COG1236
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
            GeneTree:ENSGT00700000104485 PIR:T20694 RefSeq:NP_495706.2
            ProteinModelPortal:Q9U3K2 SMR:Q9U3K2 STRING:Q9U3K2 PaxDb:Q9U3K2
            EnsemblMetazoa:F10B5.8 GeneID:174310 KEGG:cel:CELE_F10B5.8
            UCSC:F10B5.8 CTD:174310 WormBase:F10B5.8 InParanoid:Q9U3K2
            NextBio:883468 Uniprot:Q9U3K2
        Length = 608

 Score = 806 (288.8 bits), Expect = 2.9e-80, P = 2.9e-80
 Identities = 169/433 (39%), Positives = 245/433 (56%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ I PLGAG +VGRSC+ ++  GK I+ DCG+H  Y      P F  I         +D
Sbjct:     7 EIKIVPLGAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLD 66

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  SLP+  E   + G ++MT+ TKAI  +LL DY KV   +  E   F 
Sbjct:    67 CVIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFT 126

Query:   137 EQDINRSMDKIEVLDFHQTVEV-NGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
               DI   M K+     H+ + V N +    + AGHVLGAAMF + +    VLYTGDY+  
Sbjct:   127 SDDIKNCMKKVVGCALHEIIHVDNELSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDYNMT 186

Query:   196 EDRHLRAAE-LPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFA 254
              DRHL AA  LP   P + I ESTY   +   +  RE+ F   +H  + +GG+V+IP FA
Sbjct:   187 PDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHECVMKGGKVIIPVFA 246

Query:   255 LGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNP 314
             LGRAQEL ++L+ YW       N+PIY++  LA++    Y+ +I   NE I+  F   N 
Sbjct:   247 LGRAQELCILLESYWERMAL--NVPIYFSQGLAERANQYYRLFISWTNENIKKTFVERNM 304

Query:   315 FKFKHISPLNS-IDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEG 373
             F+FKHI P+    +D    GP V+ ++PG L  G S ++F  WCSD  N  ++PGY V G
Sbjct:   305 FEFKHIKPMEKGCED--QPGPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAG 362

Query:   374 TLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGES 433
             T+   +I+  K++ +   +   + + V Y+SFSAHAD       +++  P +++ VHGE+
Sbjct:   363 TVGARVINGEKKIEIDQKMHE-IRLGVEYMSFSAHADAKGIMQLIRQCEPQHVMFVHGEA 421

Query:   434 HEMGRLKTKLMTE 446
              +M  LK K+  E
Sbjct:   422 SKMEFLKGKVEKE 434


>DICTYBASE|DDB_G0278189 [details] [associations]
            symbol:ints11 "integrator complex subunit 11"
            species:44689 "Dictyostelium discoideum" [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0278189 Pfam:PF07521 GO:GO:0005634 GO:GO:0005737
            GenomeReviews:CM000152_GR EMBL:AAFI02000023 GO:GO:0016787
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 KO:K13148 RefSeq:XP_642189.1
            ProteinModelPortal:Q54YL3 PRIDE:Q54YL3 EnsemblProtists:DDB0234100
            GeneID:8621396 KEGG:ddi:DDB_G0278189 OMA:RTIANET
            ProtClustDB:CLSZ2729107 Uniprot:Q54YL3
        Length = 744

 Score = 803 (287.7 bits), Expect = 6.0e-80, P = 6.0e-80
 Identities = 175/445 (39%), Positives = 251/445 (56%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----IDVLL 80
             + PLGAG +VGRSCV ++   K I+FDCG+H   +     P F  I  +      ID ++
Sbjct:     5 VVPLGAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVIDCVI 64

Query:    81 ITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQD 139
             ITHFHLDH  +LP+F E   + G ++MT  TKAI  +LL DY K++ +   E   F  Q 
Sbjct:    65 ITHFHLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFTAQM 124

Query:   140 INRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDR 198
             I   M K+  ++ HQT++V+  +    Y AGHVLGAAMF   +    V+YTGDY+   DR
Sbjct:   125 IKDCMKKVIPVNLHQTIKVDEELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDYNMTPDR 184

Query:   199 HLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRA 258
             HL +A + Q  PD+ I E+TY   +   +  RE+ F   IH  + +GG+VLIP FALGR 
Sbjct:   185 HLGSAWIDQVKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIPVFALGRV 244

Query:   259 QELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFK 318
             QEL +++D YW      H IPIY+++ LA+K    Y+ +I   N++I+  F   N F FK
Sbjct:   245 QELCILIDSYWEQMNLGH-IPIYFSAGLAEKANLYYKLFINWTNQKIKQTFVKRNMFDFK 303

Query:   319 HISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             HI P  S     D  G  V+ A+PG L +G S ++F  W  ++ N  +IPGY V GT+  
Sbjct:   304 HIKPFQS--HLVDAPGAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPGYCVVGTVGN 361

Query:   378 TII---------SEPKE--VTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNI 426
              ++         S+P+   V +    T  +  ++H +SFSAHAD       +K   P N+
Sbjct:   362 KLLTTGSDQQQQSKPQSQMVEIDKKTTIEVKCKIHNLSFSAHADAKGILQLIKMSNPRNV 421

Query:   427 ILVHGESHEMGRLKTKLMTELA-DC 450
             ILVHGE  +MG L  K++ E+  +C
Sbjct:   422 ILVHGEKEKMGFLSQKIIKEMGVNC 446


>ZFIN|ZDB-GENE-050522-13 [details] [associations]
            symbol:cpsf3l "cleavage and polyadenylation specific
            factor 3-like" species:7955 "Danio rerio" [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0043484 "regulation of RNA splicing"
            evidence=IMP] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            ZFIN:ZDB-GENE-050522-13 GO:GO:0016787 GO:GO:0043484
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:MAVEYMS GeneTree:ENSGT00700000104485 EMBL:CABZ01054885
            EMBL:CR846089 IPI:IPI00865509 Ensembl:ENSDART00000102902
            Uniprot:E7EXW1
        Length = 601

 Score = 801 (287.0 bits), Expect = 9.7e-80, P = 9.7e-80
 Identities = 190/506 (37%), Positives = 283/506 (55%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----IDVLL 80
             +TPLGAG +VGRSC+ +S  GK I+ DCG+H  ++     P F  I  +      +D ++
Sbjct:     6 VTPLGAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDCVI 65

Query:    81 ITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQD 139
             I+HFHLDH  +LPY  E   + G ++MTH TKAI  +LL D+ K++     E   F  Q 
Sbjct:    66 ISHFHLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTSQM 125

Query:   140 INRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAM----FMVDIAGVRVLYTGDYSR 194
             I   M K+  L+ HQTV+V+  ++   Y AGHVLGAAM    F V +  V V YT     
Sbjct:   126 IKDCMKKVVPLNLHQTVQVDDELEIKAYYAGHVLGAAMVQSRFRV-VYTVSVSYTYSNLM 184

Query:   195 EEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFA 254
                  LRAA + +  PDI I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FA
Sbjct:   185 TPASDLRAAWIDKCRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFA 244

Query:   255 LGRAQELLLILDEYWSNHPEFHNI--PIYYASPLAKKCMAVYQTYILSMNERIRNQFANS 312
             LGRAQEL ++L+ +W    E  N+  PIY+++ L +K    Y+ +I   N++IR  F   
Sbjct:   245 LGRAQELCILLETFW----ERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQR 300

Query:   313 NPFKFKHISPLNSIDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVV 371
             N F+FKHI   +    ++D  GP VV A+PG L +G S Q+F  W  ++KN  ++PGY V
Sbjct:   301 NMFEFKHIKAFDR--SYADNPGPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIMPGYCV 358

Query:   372 EGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHG 431
             +GT+   I++  K++ +    T  + +QV Y+SFSAHAD       ++   P N++LVHG
Sbjct:   359 QGTVGHKILNGQKKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLIRMAEPRNMLLVHG 418

Query:   432 ESHEMGRLKTKLMTELA-DC-------NTKIITPKNCQSVEMYFNSEKMAKTIGR-LAE- 481
             E+ +M  LK K+  E +  C        T I+T  +   V++  N  K    +G  L + 
Sbjct:   419 EAKKMEFLKDKIEQEFSISCFMPANGETTTIVTNPSVP-VDISLNLLKREMALGGPLPDA 477

Query:   482 KTPEVGETVSGILVKKGFTYQIMAPD 507
             K P    T+ G L+ K  + ++++P+
Sbjct:   478 KKPR---TMHGTLIMKDNSLRLVSPE 500


>TAIR|locus:2065368 [details] [associations]
            symbol:CPSF73-II "AT2G01730" species:3702 "Arabidopsis
            thaliana" [GO:0003824 "catalytic activity" evidence=ISS]
            [GO:0008152 "metabolic process" evidence=ISS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0010197 "polar nucleus
            fusion" evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=IDA] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0006397 GO:GO:0090305 EMBL:AC006069
            GO:GO:0004518 GO:GO:0010197 eggNOG:COG1236 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 EMBL:AY168923
            EMBL:AK221561 IPI:IPI00536069 PIR:D84428 RefSeq:NP_178282.2
            UniGene:At.42473 ProteinModelPortal:Q8GUU3 SMR:Q8GUU3 IntAct:Q8GUU3
            STRING:Q8GUU3 PaxDb:Q8GUU3 PRIDE:Q8GUU3 EnsemblPlants:AT2G01730.1
            GeneID:814702 KEGG:ath:AT2G01730 TAIR:At2g01730
            HOGENOM:HOG000231294 InParanoid:Q56XW2 KO:K13148 OMA:MAVEYMS
            Genevestigator:Q8GUU3 Uniprot:Q8GUU3
        Length = 613

 Score = 758 (271.9 bits), Expect = 3.5e-75, P = 3.5e-75
 Identities = 164/444 (36%), Positives = 242/444 (54%)

Query:    29 LGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPS-----AIDVLLITH 83
             LGAG E+G+SCV ++  GK I+FDCG+H         P F  I  S     AI  ++ITH
Sbjct:     8 LGAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITH 67

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDY--VKVSKVSVEDMLFDEQDIN 141
             FH+DH  +LPYF E   + G ++M++ TKA+  L+L DY  V V +   E+ LF    I 
Sbjct:    68 FHMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEE-LFTTTHIA 126

Query:   142 RSMDKIEVLDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHL 200
               M K+  +D  QT++V+  ++   Y AGHVLGA M    +    ++YTGDY+   DRHL
Sbjct:   127 NCMKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHL 186

Query:   201 RAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQE 260
              AA++ +   D+ I ESTY   +   +  RE+ F   +H  ++ GG+ LIP+FALGRAQE
Sbjct:   187 GAAKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQE 246

Query:   261 LLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKHI 320
             L ++LD+YW        +PIY++S L  +    Y+  I   ++ ++ +    NPF FK++
Sbjct:   247 LCMLLDDYWERMNI--KVPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFDFKNV 304

Query:   321 SPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLA-KTI 379
                +        GP V+ A+PG L +G S ++F  W     N   +PGY V GT+  K +
Sbjct:   305 KDFDR-SLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLM 363

Query:   380 ISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRL 439
               +P  V L NG    +  +VH ++FS H D        K L P N++LVHGE   M  L
Sbjct:   364 AGKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPSMMIL 423

Query:   440 KTKLMTELADCNTKIITPKNCQSV 463
             K K+ +EL   +     P N ++V
Sbjct:   424 KEKITSEL---DIPCFVPANGETV 444


>GENEDB_PFALCIPARUM|PFC0825c [details] [associations]
            symbol:PFC0825c "cleavage and polyadenylation
            specificity factor protein, putative" species:5833 "Plasmodium
            falciparum" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0006378
            "mRNA polyadenylation" evidence=ISS] [GO:0003729 "mRNA binding"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] InterPro:IPR001279
            SMART:SM00849 Pfam:PF07521 GO:GO:0003729 GO:GO:0016787
            EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
            RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
            EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
            EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
            ProtClustDB:CLSZ2433497 Uniprot:O77371
        Length = 1017

 Score = 537 (194.1 bits), Expect = 2.6e-62, Sum P(3) = 2.6e-62
 Identities = 131/402 (32%), Positives = 214/402 (53%)

Query:    50 LFDCGI--HPAYSGMAALPYFDEIDPSAIDVLLITH--------FHLDHAASLPYFLEKT 99
             + DC I  H     + ALP+F EI      ++L+++          LD         EK 
Sbjct:   169 IIDCVIISHFHMDHIGALPFFTEILKYR-GIILMSYPTKALSPILLLDSCRVTDMKWEKK 227

Query:   100 TFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVN 159
              F+ ++ M +      +LL  +Y  ++ +  +    +E +I   +DK+  L  ++T E+ 
Sbjct:   228 NFERQIKMLNEKSD--ELL--NY-NINCIKKDPWNINEDNIYNCIDKVIGLQINETFELG 282

Query:   160 GIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAELPQFSPDICIIESTY 219
              +    Y AGHVLGA ++ +++    V+YTGDY+   D+HL +A +P  +P+I I ESTY
Sbjct:   283 DMSITPYYAGHVLGACIYKIEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTY 342

Query:   220 GVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIP 279
                +   +   E    +++H  + +GG+VLIP FA+GRAQEL ++LD+YW    + H  P
Sbjct:   343 ATYVRPTKKASELELCNLVHECVHKGGKVLIPVFAIGRAQELSILLDDYWKKM-KIH-YP 400

Query:   280 IYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKHISP-LNSIDDFSDVGPSVVM 338
             IY+   L +     Y+ Y   +N    +     N F F +ISP LN+    ++  P V+ 
Sbjct:   401 IYFGCGLTENANKYYKIYSSWINSSCMSN-EKENLFDFANISPFLNNY--LNEKRPMVLF 457

Query:   339 ASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLT-APLN 397
             A+PG L +GLS + F  W  + +N  V+PGY V+GT+   +I   K+++L +G T   + 
Sbjct:   458 ATPGMLHTGLSLKAFKAWAGNPQNLIVLPGYCVQGTVGHKLIMGEKQISL-DGTTYIKVL 516

Query:   398 MQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRL 439
              ++ Y+SFSAHAD       +K + P N+I VHGE + M +L
Sbjct:   517 CKIIYLSFSAHADSNGIQQLIKHVSPKNVIFVHGEKNGMQKL 558

 Score = 109 (43.4 bits), Expect = 2.6e-62, Sum P(3) = 2.6e-62
 Identities = 20/47 (42%), Positives = 28/47 (59%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD 69
             ++II  LGAG  VGRSCV +  + + ++FDCG H  Y      P F+
Sbjct:     8 KIIIQVLGAGQTVGRSCVIVELENRKVMFDCGCHLGYKDERKYPNFN 54

 Score = 47 (21.6 bits), Expect = 2.6e-62, Sum P(3) = 2.6e-62
 Identities = 15/51 (29%), Positives = 22/51 (43%)

Query:   425 NIILVHGESHE-MGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAK 474
             N I +H   H+ + +LK K+         K I   N Q  ++Y N  K  K
Sbjct:   589 NYIYIHKNIHKHILQLKKKIT------KNKHINTTNIQKTDLYINENKKKK 633


>UNIPROTKB|O77371 [details] [associations]
            symbol:PFC0825c "Cleavage and polyadenylation specificity
            factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
            [GO:0003729 "mRNA binding" evidence=ISS] [GO:0005847 "mRNA cleavage
            and polyadenylation specificity factor complex" evidence=ISS]
            InterPro:IPR001279 SMART:SM00849 Pfam:PF07521 GO:GO:0003729
            GO:GO:0016787 EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
            RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
            EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
            EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
            ProtClustDB:CLSZ2433497 Uniprot:O77371
        Length = 1017

 Score = 537 (194.1 bits), Expect = 2.6e-62, Sum P(3) = 2.6e-62
 Identities = 131/402 (32%), Positives = 214/402 (53%)

Query:    50 LFDCGI--HPAYSGMAALPYFDEIDPSAIDVLLITH--------FHLDHAASLPYFLEKT 99
             + DC I  H     + ALP+F EI      ++L+++          LD         EK 
Sbjct:   169 IIDCVIISHFHMDHIGALPFFTEILKYR-GIILMSYPTKALSPILLLDSCRVTDMKWEKK 227

Query:   100 TFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVN 159
              F+ ++ M +      +LL  +Y  ++ +  +    +E +I   +DK+  L  ++T E+ 
Sbjct:   228 NFERQIKMLNEKSD--ELL--NY-NINCIKKDPWNINEDNIYNCIDKVIGLQINETFELG 282

Query:   160 GIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAELPQFSPDICIIESTY 219
              +    Y AGHVLGA ++ +++    V+YTGDY+   D+HL +A +P  +P+I I ESTY
Sbjct:   283 DMSITPYYAGHVLGACIYKIEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTY 342

Query:   220 GVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIP 279
                +   +   E    +++H  + +GG+VLIP FA+GRAQEL ++LD+YW    + H  P
Sbjct:   343 ATYVRPTKKASELELCNLVHECVHKGGKVLIPVFAIGRAQELSILLDDYWKKM-KIH-YP 400

Query:   280 IYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKHISP-LNSIDDFSDVGPSVVM 338
             IY+   L +     Y+ Y   +N    +     N F F +ISP LN+    ++  P V+ 
Sbjct:   401 IYFGCGLTENANKYYKIYSSWINSSCMSN-EKENLFDFANISPFLNNY--LNEKRPMVLF 457

Query:   339 ASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLT-APLN 397
             A+PG L +GLS + F  W  + +N  V+PGY V+GT+   +I   K+++L +G T   + 
Sbjct:   458 ATPGMLHTGLSLKAFKAWAGNPQNLIVLPGYCVQGTVGHKLIMGEKQISL-DGTTYIKVL 516

Query:   398 MQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRL 439
              ++ Y+SFSAHAD       +K + P N+I VHGE + M +L
Sbjct:   517 CKIIYLSFSAHADSNGIQQLIKHVSPKNVIFVHGEKNGMQKL 558

 Score = 109 (43.4 bits), Expect = 2.6e-62, Sum P(3) = 2.6e-62
 Identities = 20/47 (42%), Positives = 28/47 (59%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD 69
             ++II  LGAG  VGRSCV +  + + ++FDCG H  Y      P F+
Sbjct:     8 KIIIQVLGAGQTVGRSCVIVELENRKVMFDCGCHLGYKDERKYPNFN 54

 Score = 47 (21.6 bits), Expect = 2.6e-62, Sum P(3) = 2.6e-62
 Identities = 15/51 (29%), Positives = 22/51 (43%)

Query:   425 NIILVHGESHE-MGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAK 474
             N I +H   H+ + +LK K+         K I   N Q  ++Y N  K  K
Sbjct:   589 NYIYIHKNIHKHILQLKKKIT------KNKHINTTNIQKTDLYINENKKKK 633


>UNIPROTKB|C9JZH6 [details] [associations]
            symbol:CPSF3 "Cleavage and polyadenylation-specificity
            factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
            binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
            evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
            evidence=IEA] InterPro:IPR001279 Pfam:PF00753 GO:GO:0003723
            GO:GO:0004521 GO:GO:0008409 EMBL:AC080162 HGNC:HGNC:2326
            ChiTaRS:CPSF3 IPI:IPI00807384 ProteinModelPortal:C9JZH6 SMR:C9JZH6
            STRING:C9JZH6 Ensembl:ENST00000475482 HOGENOM:HOG000191757
            ArrayExpress:C9JZH6 Bgee:C9JZH6 Uniprot:C9JZH6
        Length = 136

 Score = 525 (189.9 bits), Expect = 1.7e-50, P = 1.7e-50
 Identities = 93/136 (68%), Positives = 113/136 (83%)

Query:    50 LFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTH 109
             + DCGIHP   GM ALPY D IDP+ ID+LLI+HFHLDH  +LP+FL+KT+FKGR FMTH
Sbjct:     1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60

Query:   110 ATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAG 169
             ATKAIY+ LL+DYVKVS +S +DML+ E D+  SMDKIE ++FH+  EV GIKFWCY AG
Sbjct:    61 ATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAG 120

Query:   170 HVLGAAMFMVDIAGVR 185
             HVLGAAMFM++IAGV+
Sbjct:   121 HVLGAAMFMIEIAGVK 136


>UNIPROTKB|C9J979 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 InterPro:IPR022712 Pfam:PF10996 HOGENOM:HOG000231294
            HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00514808
            ProteinModelPortal:C9J979 SMR:C9J979 STRING:C9J979
            Ensembl:ENST00000434694 ArrayExpress:C9J979 Bgee:C9J979
            Uniprot:C9J979
        Length = 344

 Score = 287 (106.1 bits), Expect = 5.3e-48, Sum P(2) = 5.3e-48
 Identities = 57/142 (40%), Positives = 84/142 (59%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEV 158
              Q I   M K+  +  HQTV+V
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQV 144

 Score = 243 (90.6 bits), Expect = 5.3e-48, Sum P(2) = 5.3e-48
 Identities = 47/119 (39%), Positives = 72/119 (60%)

Query:   202 AAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQEL 261
             AA + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+VLIP FALGRAQEL
Sbjct:   219 AAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQEL 278

Query:   262 LLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANSNPFKFKHI 320
              ++L+ +W        +PIY+++ L +K    Y+ +I   N++IR  F   N F+FKHI
Sbjct:   279 CILLETFWERMNL--KVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI 335

 Score = 37 (18.1 bits), Expect = 3.2e-19, Sum P(2) = 3.2e-19
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query:   168 AGHVLGAAMFMVDIAGVRVL 187
             AG  +G +  +V IAG  V+
Sbjct:    11 AGQDVGRSCILVSIAGKNVM 30


>UNIPROTKB|E9PNS4 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00984775
            ProteinModelPortal:E9PNS4 SMR:E9PNS4 Ensembl:ENST00000528879
            ArrayExpress:E9PNS4 Bgee:E9PNS4 Uniprot:E9PNS4
        Length = 278

 Score = 475 (172.3 bits), Expect = 3.4e-45, P = 3.4e-45
 Identities = 93/232 (40%), Positives = 138/232 (59%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----ID 77
             ++ +TPLGAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D
Sbjct:     3 EIRVTPLGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLD 62

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFD 136
              ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F 
Sbjct:    63 CVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFT 122

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSRE 195
              Q I   M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+  
Sbjct:   123 SQMIKDCMKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMT 182

Query:   196 EDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGR 247
              DRHL AA + +  P++ I ESTY   +   +  RE+ F   +H T+ +GG+
Sbjct:   183 PDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK 234


>TAIR|locus:2172843 [details] [associations]
            symbol:CPSF100 "cleavage and polyadenylation specificity
            factor 100" species:3702 "Arabidopsis thaliana" [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0009793 "embryo development ending
            in seed dormancy" evidence=NAS] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IPI] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS;NAS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0035194 "posttranscriptional gene
            silencing by RNA" evidence=IMP] [GO:0009506 "plasmodesma"
            evidence=IDA] [GO:0000278 "mitotic cell cycle" evidence=RCA]
            [GO:0006306 "DNA methylation" evidence=RCA] [GO:0006342 "chromatin
            silencing" evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0007267 "cell-cell
            signaling" evidence=RCA] [GO:0009220 "pyrimidine ribonucleotide
            biosynthetic process" evidence=RCA] [GO:0009616 "virus induced gene
            silencing" evidence=RCA] [GO:0009640 "photomorphogenesis"
            evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
            interference" evidence=RCA] [GO:0010388 "cullin deneddylation"
            evidence=RCA] [GO:0016569 "covalent chromatin modification"
            evidence=RCA] [GO:0031047 "gene silencing by RNA" evidence=RCA]
            [GO:0035196 "production of miRNAs involved in gene silencing by
            miRNA" evidence=RCA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 GO:GO:0009506 EMBL:CP002688
            GenomeReviews:BA000015_GR GO:GO:0006378 EMBL:AB005244 GO:GO:0003723
            GO:GO:0016787 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 EMBL:AF283277 EMBL:AY034982
            EMBL:BT004374 IPI:IPI00521104 RefSeq:NP_197776.1 UniGene:At.25191
            ProteinModelPortal:Q9LKF9 SMR:Q9LKF9 IntAct:Q9LKF9 STRING:Q9LKF9
            PaxDb:Q9LKF9 PRIDE:Q9LKF9 EnsemblPlants:AT5G23880.1 GeneID:832453
            KEGG:ath:AT5G23880 TAIR:At5g23880 HOGENOM:HOG000264343
            InParanoid:Q9LKF9 OMA:NNPFQFK PhylomeDB:Q9LKF9
            ProtClustDB:CLSN2686300 Genevestigator:Q9LKF9 GermOnline:AT5G23880
            GO:GO:0035194 Uniprot:Q9LKF9
        Length = 739

 Score = 408 (148.7 bits), Expect = 5.2e-41, Sum P(2) = 5.2e-41
 Identities = 116/377 (30%), Positives = 193/377 (51%)

Query:    21 GDQLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVL 79
             G  + +TPL G  NE   S + +S  G   L DCG +  +      P    +  S ID +
Sbjct:     2 GTSVQVTPLCGVYNENPLSYL-VSIDGFNFLIDCGWNDLFDTSLLEP-LSRV-ASTIDAV 58

Query:    80 LITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKL-LLTDYVK-VSKVSVEDM-LFD 136
             L++H    H  +LPY +++      V+   AT+ +++L LLT Y + +S+  V D  LF 
Sbjct:    59 LLSHPDTLHIGALPYAMKQLGLSAPVY---ATEPVHRLGLLTMYDQFLSRKQVSDFDLFT 115

Query:   137 EQDINRSMDKIEVLDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDY 192
               DI+ +   +  L + Q   ++G    I    + AGH+LG +++ +   G  V+Y  DY
Sbjct:   116 LDDIDSAFQNVIRLTYSQNYHLSGKGEGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDY 175

Query:   193 SREEDRHLRAAELPQF-SPDICIIESTYGVQLHQP-RNIREKRFTDVIHSTISQGGRVLI 250
             +  ++RHL    L  F  P + I ++ + +  +Q  R  R+K F D I   +  GG VL+
Sbjct:   176 NHRKERHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKEFLDTISKHLEVGGNVLL 235

Query:   251 PAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFA 310
             P    GR  ELLLIL+++WS    F + PIY+ + ++   +   ++++  M++ I   F 
Sbjct:   236 PVDTAGRVLELLLILEQHWSQRG-F-SFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFE 293

Query:   311 NS--NPFKFKHISPL-NSID-DFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVI 366
              S  N F  +H++ L N  D D +  GP VV+AS   L++G +R++F  W +D +N  + 
Sbjct:   294 TSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVEWANDPRNLVLF 353

Query:   367 PGYVVEGTLAKTIISEP 383
                   GTLA+ + S P
Sbjct:   354 TETGQFGTLARMLQSAP 370

 Score = 58 (25.5 bits), Expect = 5.2e-41, Sum P(2) = 5.2e-41
 Identities = 22/117 (18%), Positives = 53/117 (45%)

Query:   366 IPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPN 425
             + G + E T +  + + P +V + N L   ++  +  + +   +D     + +  + P  
Sbjct:   506 VDGRLDEATASLMLDTRPSKV-MSNELIVTVSCSLVKMDYEGRSDGRSIKSMIAHVSPLK 564

Query:   426 IILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEK 482
             ++LVH  +     LK   +  +  C   +  P+  ++V++   S+  A  + +L+EK
Sbjct:   565 LVLVHAIAEATEHLKQHCLNNI--C-PHVYAPQIEETVDV--TSDLCAYKV-QLSEK 615


>UNIPROTKB|E9PI75 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI01011963
            ProteinModelPortal:E9PI75 SMR:E9PI75 Ensembl:ENST00000527719
            ArrayExpress:E9PI75 Bgee:E9PI75 Uniprot:E9PI75
        Length = 209

 Score = 392 (143.0 bits), Expect = 2.1e-36, P = 2.1e-36
 Identities = 80/194 (41%), Positives = 115/194 (59%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----IDVLLITHF 84
             GAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D ++I+HF
Sbjct:    16 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 75

Query:    85 HLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINRS 143
             HLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F  Q I   
Sbjct:    76 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 135

Query:   144 MDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRA 202
             M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+   DRHL A
Sbjct:   136 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 195

Query:   203 AELPQFSPDICIIE 216
             A + +  P++ I E
Sbjct:   196 AWIDKCRPNLLITE 209


>UNIPROTKB|E9PIG1 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00981641
            ProteinModelPortal:E9PIG1 SMR:E9PIG1 Ensembl:ENST00000530031
            ArrayExpress:E9PIG1 Bgee:E9PIG1 Uniprot:E9PIG1
        Length = 249

 Score = 388 (141.6 bits), Expect = 5.7e-36, P = 5.7e-36
 Identities = 79/192 (41%), Positives = 114/192 (59%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----IDVLLITHF 84
             GAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D ++I+HF
Sbjct:    57 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 116

Query:    85 HLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLFDEQDINRS 143
             HLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F  Q I   
Sbjct:   117 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 176

Query:   144 MDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRA 202
             M K+  +  HQTV+V+  ++   Y AGHVLGAAMF + +    V+YTGDY+   DRHL A
Sbjct:   177 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 236

Query:   203 AELPQFSPDICI 214
             A + +  P++ I
Sbjct:   237 AWIDKCRPNLLI 248


>TIGR_CMR|CHY_2049 [details] [associations]
            symbol:CHY_2049 "metallo-beta-lactamase family protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
            "metabolic process" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 EMBL:CP000141 GenomeReviews:CP000141_GR
            GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 RefSeq:YP_360868.1
            ProteinModelPortal:Q3AAG6 STRING:Q3AAG6 GeneID:3728507
            KEGG:chy:CHY_2049 PATRIC:21277179 HOGENOM:HOG000244774 KO:K07576
            OMA:GGRIVHH BioCyc:CHYD246194:GJCN-2048-MONOMER Uniprot:Q3AAG6
        Length = 504

 Score = 293 (108.2 bits), Expect = 1.2e-35, Sum P(2) = 1.2e-35
 Identities = 88/303 (29%), Positives = 150/303 (49%)

Query:   160 GIKFWCYTAGHVLGAAMFMVDIAG---VR-VLYTGDYSREEDRHLRAAELPQFSP--DIC 213
             G++   + AGH+LG+AM  +   G    R +L+TGD  R     ++    PQ  P  DI 
Sbjct:   152 GLEVTFFDAGHILGSAMIKIAYKGQDATRTILFTGDLGRNGRPFMKE---PQKVPLTDIL 208

Query:   214 IIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHP 273
             ++ESTYG ++       +     +I     + G ++IPAFA+ R Q+L+ IL++   N  
Sbjct:   209 VLESTYGDRVRSEEGDLKTLLKSLIEKVYRRNGNLIIPAFAMERTQDLIYILNDLVENK- 267

Query:   274 EFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQF-ANSNPFKFK--H--ISPLNSIDD 328
             E   I +Y  SPLA +   +++ Y +  NE  + +     +P  F   H  +S  +S+  
Sbjct:   268 EVPPIDVYIDSPLAVEITKLFKKYPMFFNEEYKEKLNRGDDPLAFPGLHFSVSQEDSVK- 326

Query:   329 FSDVGPSVVMASPGGLQSGLSRQLF--DIWCSDKKNACVIPGYVVEGTLAKTIISEPKEV 386
              +++  ++++++ G   +G  R     ++W  +  +A ++ GY  + TL + ++   KEV
Sbjct:   327 LNNISRAIIISASGMADAGRIRHHLKHNLWRPE--SAVLLVGYQAQDTLGRKLLDGAKEV 384

Query:   387 TLMNGLTAPLNMQV-HYISFSAHADYAQTSTFLKELM--PPNIILVHGESHEMGRLKTKL 443
              +M G    +  +V HY   SAHAD  +   F+      P  I LVHGE      LK KL
Sbjct:   385 KIM-GEEIAVKAEVYHYDGLSAHADQRELLAFIGRFSQKPAQIYLVHGEDEARLNLK-KL 442

Query:   444 MTE 446
             + E
Sbjct:   443 IEE 445

 Score = 157 (60.3 bits), Expect = 1.2e-35, Sum P(2) = 1.2e-35
 Identities = 46/174 (26%), Positives = 79/174 (45%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD-EIDPSAIDVLLITHF 84
             +T  GA + V  SC   +  G   L DCG+      +    Y +   +P  I+ +L+TH 
Sbjct:     3 LTFFGAADTVTGSCYLFNVAGHKFLVDCGLFQGPKAIKERNYGEFPFNPREIEFILLTHA 62

Query:    85 HLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLF--------D 136
             H+DH+  +P  ++K  FKG ++ T  T  +  ++L D   V ++ VE            +
Sbjct:    63 HIDHSGLIPKLVKKG-FKGTIYATEPTVDLAAVMLPDSGHVQEMEVERKNRKLRRAGKPE 121

Query:   137 EQDINRSMDKIEVLDFHQTVEVN-------GIKFWCYTAGHVLGAAMFMVDIAG 183
              Q I  + D    L + Q + +        G++   + AGH+LG+AM  +   G
Sbjct:   122 LQPIYTADDAFNALAYFQKIPLETPITPLPGLEVTFFDAGHILGSAMIKIAYKG 175


>TIGR_CMR|CPS_2623 [details] [associations]
            symbol:CPS_2623 "metallo-beta-lactamase family protein"
            species:167879 "Colwellia psychrerythraea 34H" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:CP000083 GenomeReviews:CP000083_GR
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:YP_269337.1
            ProteinModelPortal:Q481D2 STRING:Q481D2 GeneID:3521490
            KEGG:cps:CPS_2623 PATRIC:21468305 OMA:HGPMVII
            ProtClustDB:CLSK2524370 BioCyc:CPSY167879:GI48-2685-MONOMER
            Uniprot:Q481D2
        Length = 451

 Score = 377 (137.8 bits), Expect = 8.3e-35, P = 8.3e-35
 Identities = 113/440 (25%), Positives = 209/440 (47%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD-EIDPSAIDVLLITHF 84
             IT LG    V  S  ++      IL DCG++  Y  + A       +D  ++D +++TH 
Sbjct:     3 ITFLGGTGTVTGSKYFVETSTTKILVDCGLYQGYKWLRARNREPLPLDLKSLDAIVLTHA 62

Query:    85 HLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTD--YV-----------KVSKVSVE 131
             HLDH+  +P  L K  F+G V+   AT ++  +LL D  ++           K+S+    
Sbjct:    63 HLDHSGFIPA-LYKQGFRGHVYAHQATISLCSILLPDSGHIQEDDAKFYGKHKISRHENP 121

Query:   132 DMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGD 191
             + L+D+      +   + +DF++  ++  I+    +AGH+LGAA  ++   G RV ++GD
Sbjct:   122 EPLYDKATAEACLSLFKAVDFNEEFKIGDIEIELQSAGHILGAASVILKADGKRVGFSGD 181

Query:   192 YSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIP 251
               R +D  +   + P    D+ ++ESTYG +LH   +  E+   ++++ST  +GG +LIP
Sbjct:   182 VGRPDDIIMYPPK-PLPPVDLLLLESTYGNRLHDKEDAFEQ-LAEIVNSTAKKGGALLIP 239

Query:   252 AFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQ--F 309
             +FA+GR + +  +L            +P+Y  SP+A     +Y  +   +N R+ N+   
Sbjct:   240 SFAVGRTEAVQHMLASLMKKEL-IPKLPVYLDSPMAINVFNIYCEHF-DLN-RLSNEECL 296

Query:   310 ANSNPFKF-KHISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPG 368
                N   F + +    ++ +   + P +++A  G    G           D +   +  G
Sbjct:   297 EMCNVATFTRTVDESKALSEL--IMPHIIIAGSGMATGGRILHHLKRLLGDYRTTVLFTG 354

Query:   369 YVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYIS-FSAHADYAQTSTFLK--ELMPPN 425
             Y+  GT    +++    V + +G   P+  +V  ++  S H DY   + +L+  +L P  
Sbjct:   355 YLSGGTRGAKMLAGKDNVKI-HGKWLPVKARVEVLNGLSGHGDYEDITQWLQISKLHPKT 413

Query:   426 -IILVHGESHEMGRLKTKLM 444
              ++LVHGE      ++  LM
Sbjct:   414 KVLLVHGEPEASESMRDHLM 433


>UNIPROTKB|Q9KV92 [details] [associations]
            symbol:VC_0264 "Putative uncharacterized protein"
            species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0016787
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            KO:K07576 OMA:CHIDHVG PIR:F82345 RefSeq:NP_229920.1
            ProteinModelPortal:Q9KV92 DNASU:2614470 GeneID:2614470
            KEGG:vch:VC0264 PATRIC:20079570 ProtClustDB:CLSK2517501
            Uniprot:Q9KV92
        Length = 455

 Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 115/435 (26%), Positives = 204/435 (46%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             +R  +  ++   G  +  G SC  +   G+ +L DCG+   + G    P   E     +D
Sbjct:    13 NRRNNMEVVHHGGKASVTG-SCHELRADGQALLIDCGL---FQGADERPLAVEFALGHVD 68

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
              L++TH H+DH   LP+ L     K  ++ T AT  +  L+L D +K+ ++ +      E
Sbjct:    69 ALILTHAHIDHIGRLPWLLA-AGLKQPIYSTAATAELVPLMLEDGLKL-QLGMSPKQ-SE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIK---FWC--YTAGHVLGAAMFMVDIA-GVRVLYTGD 191
             + +      + V D+ +   V   +    W     AGH+LG+A   +    G  V+++GD
Sbjct:   126 RVLTEVRRLLRVQDYQKWFAVQPKRADSLWVRFQPAGHILGSAYVEIRRPNGEVVVFSGD 185

Query:   192 YSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIP 251
                     L   + P+ + D   IE+TYG + H+    R +R   +I  +++ GG +LIP
Sbjct:   186 LGPSHTPLLPDPQSPERA-DYLFIETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIP 244

Query:   252 AFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTY--ILSMNERIRNQF 309
             AF++GR QELL  +++   +     N+PI   SP+A++    Y+ +  +     + R Q 
Sbjct:   245 AFSVGRTQELLFDIEQLIFSQQIDANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQM 304

Query:   310 ANSNPFKFK-------HISPLNSIDDFSDVGPS-VVMASPGGLQSGLSRQLFDIWCSDKK 361
              + +P  F+       H +    ++  +  G + +V+A+ G  Q G           DK+
Sbjct:   305 -HRHPLAFEQCITVEDHRTHERLVNRLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKR 363

Query:   362 NACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYIS-FSAHADYAQTSTFLKE 420
                ++ G+  EGTL ++I S    V +  G    +N  +H +S +SAHAD A    F+  
Sbjct:   364 TDLILAGFQAEGTLGRSIQSGQPSVWI-EGTEVEVNAHIHTMSGYSAHADKADLLRFITG 422

Query:   421 L--MPPNIILVHGES 433
             +   P  + L+HGE+
Sbjct:   423 IPEKPKQVHLIHGEA 437


>TIGR_CMR|VC_0264 [details] [associations]
            symbol:VC_0264 "conserved hypothetical protein" species:686
            "Vibrio cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE003852
            GenomeReviews:AE003852_GR GO:GO:0016787 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K07576 OMA:CHIDHVG
            PIR:F82345 RefSeq:NP_229920.1 ProteinModelPortal:Q9KV92
            DNASU:2614470 GeneID:2614470 KEGG:vch:VC0264 PATRIC:20079570
            ProtClustDB:CLSK2517501 Uniprot:Q9KV92
        Length = 455

 Score = 373 (136.4 bits), Expect = 2.2e-34, P = 2.2e-34
 Identities = 115/435 (26%), Positives = 204/435 (46%)

Query:    18 SREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAID 77
             +R  +  ++   G  +  G SC  +   G+ +L DCG+   + G    P   E     +D
Sbjct:    13 NRRNNMEVVHHGGKASVTG-SCHELRADGQALLIDCGL---FQGADERPLAVEFALGHVD 68

Query:    78 VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDE 137
              L++TH H+DH   LP+ L     K  ++ T AT  +  L+L D +K+ ++ +      E
Sbjct:    69 ALILTHAHIDHIGRLPWLLA-AGLKQPIYSTAATAELVPLMLEDGLKL-QLGMSPKQ-SE 125

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIK---FWC--YTAGHVLGAAMFMVDIA-GVRVLYTGD 191
             + +      + V D+ +   V   +    W     AGH+LG+A   +    G  V+++GD
Sbjct:   126 RVLTEVRRLLRVQDYQKWFAVQPKRADSLWVRFQPAGHILGSAYVEIRRPNGEVVVFSGD 185

Query:   192 YSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIP 251
                     L   + P+ + D   IE+TYG + H+    R +R   +I  +++ GG +LIP
Sbjct:   186 LGPSHTPLLPDPQSPERA-DYLFIETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIP 244

Query:   252 AFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTY--ILSMNERIRNQF 309
             AF++GR QELL  +++   +     N+PI   SP+A++    Y+ +  +     + R Q 
Sbjct:   245 AFSVGRTQELLFDIEQLIFSQQIDANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQM 304

Query:   310 ANSNPFKFK-------HISPLNSIDDFSDVGPS-VVMASPGGLQSGLSRQLFDIWCSDKK 361
              + +P  F+       H +    ++  +  G + +V+A+ G  Q G           DK+
Sbjct:   305 -HRHPLAFEQCITVEDHRTHERLVNRLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKR 363

Query:   362 NACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYIS-FSAHADYAQTSTFLKE 420
                ++ G+  EGTL ++I S    V +  G    +N  +H +S +SAHAD A    F+  
Sbjct:   364 TDLILAGFQAEGTLGRSIQSGQPSVWI-EGTEVEVNAHIHTMSGYSAHADKADLLRFITG 422

Query:   421 L--MPPNIILVHGES 433
             +   P  + L+HGE+
Sbjct:   423 IPEKPKQVHLIHGEA 437


>WB|WBGene00017313 [details] [associations]
            symbol:cpsf-2 species:6239 "Caenorhabditis elegans"
            [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0051301 "cell division" evidence=IMP] [GO:0000910 "cytokinesis"
            evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0016246
            "RNA interference" evidence=IMP] [GO:0040027 "negative regulation
            of vulval development" evidence=IMP] InterPro:IPR001279
            InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
            GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
            GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
            OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
            ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
            EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
            CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
            Uniprot:O17403
        Length = 843

 Score = 372 (136.0 bits), Expect = 3.9e-34, Sum P(2) = 3.9e-34
 Identities = 102/365 (27%), Positives = 177/365 (48%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDP--SAIDVLLITHFHLD 87
             GA +E G  C  +   G  IL DCG    +     L YF+E+ P    I  +LI+H    
Sbjct:    12 GAKDE-GPLCYLLQVDGDYILLDCGWDERFG----LQYFEELKPFIPKISAVLISHPDPL 66

Query:    88 HAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDML-FDEQDINRSMDK 146
             H   LPY + K      V+ T     + ++ + D V  S + VE+   +   D++ + +K
Sbjct:    67 HLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDVDTAFEK 125

Query:   147 IEVLDFHQTVEV---NGIKFWCYTAGHVLGAAMFMV-DIAGVRVLYTGDYSREEDRHLRA 202
             +E + ++QTV +   +G+ F    AGH+LG +++ +  + G  ++Y  D++ +++RHL  
Sbjct:   126 VEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNG 185

Query:   203 AELPQFSPDICIIESTYGVQLHQPRNI-REKRFTDVIHSTISQGGRVLIPAFALGRAQEL 261
                  F+    +I   + + L Q R   R+++    I  T+ Q G  +I     GR  EL
Sbjct:   186 CSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLEL 245

Query:   262 LLILDEYWSNHPE-FHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANS---NPFKF 317
               +LD+ WSN         +   S +A   +   ++ +  MNE++    ++S   NPF  
Sbjct:   246 AHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTL 305

Query:   318 KHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLA 376
             KH++  +S  +   V  P VV+ S   ++SG SR+LF  WCSD +N  ++       TLA
Sbjct:   306 KHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLA 365

Query:   377 KTIIS 381
               +++
Sbjct:   366 AKLVN 370

 Score = 58 (25.5 bits), Expect = 3.9e-34, Sum P(2) = 3.9e-34
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHG 431
             ++ ++ +I +   +D   T   L  L+P  II+VHG
Sbjct:   565 VSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHG 600


>UNIPROTKB|O17403 [details] [associations]
            symbol:cpsf-2 "Probable cleavage and polyadenylation
            specificity factor subunit 2" species:6239 "Caenorhabditis elegans"
            [GO:0005575 "cellular_component" evidence=ND] [GO:0003674
            "molecular_function" evidence=ND] InterPro:IPR001279
            InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
            GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
            GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
            OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
            ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
            EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
            CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
            Uniprot:O17403
        Length = 843

 Score = 372 (136.0 bits), Expect = 3.9e-34, Sum P(2) = 3.9e-34
 Identities = 102/365 (27%), Positives = 177/365 (48%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDP--SAIDVLLITHFHLD 87
             GA +E G  C  +   G  IL DCG    +     L YF+E+ P    I  +LI+H    
Sbjct:    12 GAKDE-GPLCYLLQVDGDYILLDCGWDERFG----LQYFEELKPFIPKISAVLISHPDPL 66

Query:    88 HAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDML-FDEQDINRSMDK 146
             H   LPY + K      V+ T     + ++ + D V  S + VE+   +   D++ + +K
Sbjct:    67 HLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDVDTAFEK 125

Query:   147 IEVLDFHQTVEV---NGIKFWCYTAGHVLGAAMFMV-DIAGVRVLYTGDYSREEDRHLRA 202
             +E + ++QTV +   +G+ F    AGH+LG +++ +  + G  ++Y  D++ +++RHL  
Sbjct:   126 VEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKKERHLNG 185

Query:   203 AELPQFSPDICIIESTYGVQLHQPRNI-REKRFTDVIHSTISQGGRVLIPAFALGRAQEL 261
                  F+    +I   + + L Q R   R+++    I  T+ Q G  +I     GR  EL
Sbjct:   186 CSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTAGRVLEL 245

Query:   262 LLILDEYWSNHPE-FHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFANS---NPFKF 317
               +LD+ WSN         +   S +A   +   ++ +  MNE++    ++S   NPF  
Sbjct:   246 AHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSARYNPFTL 305

Query:   318 KHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLA 376
             KH++  +S  +   V  P VV+ S   ++SG SR+LF  WCSD +N  ++       TLA
Sbjct:   306 KHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARPASFTLA 365

Query:   377 KTIIS 381
               +++
Sbjct:   366 AKLVN 370

 Score = 58 (25.5 bits), Expect = 3.9e-34, Sum P(2) = 3.9e-34
 Identities = 11/36 (30%), Positives = 20/36 (55%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHG 431
             ++ ++ +I +   +D   T   L  L+P  II+VHG
Sbjct:   565 VSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHG 600


>FB|FBgn0027873 [details] [associations]
            symbol:Cpsf100 "Cleavage and polyadenylation specificity
            factor 100" species:7227 "Drosophila melanogaster" [GO:0006379
            "mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISS]
            [GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
            processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 GeneTree:ENSGT00700000104551 OMA:NNPFQFK
            GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1
            UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 IntAct:Q9V3D6
            STRING:Q9V3D6 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357
            GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426
            FlyBase:FBgn0027873 InParanoid:Q8IML7 OrthoDB:EOG4XD261
            PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 Bgee:Q9V3D6
            GermOnline:CG1957 Uniprot:Q9V3D6
        Length = 756

 Score = 342 (125.4 bits), Expect = 4.2e-31, Sum P(2) = 4.2e-31
 Identities = 96/363 (26%), Positives = 170/363 (46%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSAIDVLLITHFHLDHA 89
             GA +E    C  +      IL DCG    +          ++    +D +L++H    H 
Sbjct:    12 GAMDE-SPPCYILQIDDVRILLDCGWDEKFDANFIKELKRQVH--TLDAVLLSHPDAYHL 68

Query:    90 ASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINRSMDKIE 148
              +LPY + K      ++ T     + ++ + D + +S  ++ D  LF   D++ + +KI 
Sbjct:    69 GALPYLVGKLGLNCPIYATIPVFKMGQMFMYD-LYMSHFNMGDFDLFSLDDVDTAFEKIT 127

Query:   149 VLDFHQTVEVN----GIKFWCYTAGHVLGAAMF-MVDIAGVRVLYTGDYSREEDRHLRAA 203
              L ++QTV +     GI      AGH++G  ++ +V +    ++Y  D++ +++RHL   
Sbjct:   128 QLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKERHLSGC 187

Query:   204 ELPQFSPDICIIESTYGVQLHQPRN-IREKRFTDVIHSTISQGGRVLIPAFALGRAQELL 262
             EL +      +I   Y  Q  Q R   R+++    I  T+   G VLI     GR  EL 
Sbjct:   188 ELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAGRVLELA 247

Query:   263 LILDEYWSNHPE-FHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQF--ANSNPFKFKH 319
              +LD+ W N         +   + ++   +   ++ I  M++++   F  A +NPF+FKH
Sbjct:   248 HMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNNPFQFKH 307

Query:   320 ISPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAK 377
             I   +S+ D   +  GP VV+AS   L+SG +R LF  W S+  N+ ++      GTLA 
Sbjct:   308 IQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTSPGTLAM 367

Query:   378 TII 380
              ++
Sbjct:   368 ELV 370

 Score = 67 (28.6 bits), Expect = 4.2e-31, Sum P(2) = 4.2e-31
 Identities = 19/89 (21%), Positives = 43/89 (48%)

Query:   379 IISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGR 438
             ++ +P ++ +    T  +N QV  I F   +D       L +L P  +I++HG +   G 
Sbjct:   526 LLEKPTKL-ISQRKTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAE--G- 581

Query:   439 LKTKLMTELADCNT--KIITPKNCQSVEM 465
               T+++    + N   ++ TP+  + +++
Sbjct:   582 --TQVVARHCEQNVGARVFTPQKGEIIDV 608


>UNIPROTKB|F1SD85 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379
            "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA polyadenylation"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
            InterPro:IPR022712 PANTHER:PTHR11203:SF5 Pfam:PF10996 SMART:SM01027
            GeneTree:ENSGT00700000104551 EMBL:CU468363
            Ensembl:ENSSSCT00000002717 OMA:GANDESP Uniprot:F1SD85
        Length = 385

 Score = 341 (125.1 bits), Expect = 1.0e-30, P = 1.0e-30
 Identities = 101/376 (26%), Positives = 173/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      ID +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS----MDIIDSLRKHVHQIDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGSVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  + + D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPSE 374


>DICTYBASE|DDB_G0270392 [details] [associations]
            symbol:cpsf2 "cleavage and polyadenylation
            specificity factor 100 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA;ISS] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA;ISS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISS]
            [GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            dictyBase:DDB_G0270392 Pfam:PF07521 EMBL:AAFI02000005
            GenomeReviews:CM000150_GR GO:GO:0006378 GO:GO:0003723 GO:GO:0016787
            GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            OMA:NNPFQFK RefSeq:XP_646760.1 ProteinModelPortal:Q55BS1
            STRING:Q55BS1 EnsemblProtists:DDB0233700 GeneID:8617733
            KEGG:ddi:DDB_G0270392 ProtClustDB:CLSZ2431463 Uniprot:Q55BS1
        Length = 784

 Score = 340 (124.7 bits), Expect = 1.5e-30, Sum P(2) = 1.5e-30
 Identities = 101/381 (26%), Positives = 173/381 (45%)

Query:    27 TPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYS-GMAALPYFDEIDPSAIDVLLITHFH 85
             T L    +    C  +      IL DCG+  +Y+   + L   +++    ID +L++H  
Sbjct:     8 TALSGAKDESPPCYLLEIDDFCILLDCGL--SYNLDFSLLEPLEKV-AKKIDAVLLSHSD 64

Query:    86 LDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYV--KVSKVSVEDMLFDEQDINRS 143
               H   LPY + K    G ++ T     +  + L D    K+S+   +    D  D    
Sbjct:    65 TTHIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNIDSCFG 124

Query:   144 MDKIEVLDFHQTVEVNG----IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRH 199
              D+ + L F Q   ++G    I    Y AGH +GA+++ +      ++Y  DY+   + H
Sbjct:   125 EDRFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHRNEGH 184

Query:   200 LRAAELPQ--FSPDICIIESTYGVQ--LHQPRNI-REKRFTDVIHSTISQGGRVLIPAFA 254
             L + +L      P + I +S  GV   L   + I R++   + I+  +  GG VLIP   
Sbjct:   185 LDSLQLTSDILKPSLLITDSK-GVDKTLAFKKTITRDQSLFEQINRNLRDGGNVLIPVDT 243

Query:   255 LGRAQELLLILDEYWSNHPEFHNIPIYYASPLA-KKCM-AVYQTYILSMNERIRNQFANS 312
              GR  ELLL ++ YWS +       + +    +   C  A  Q   +S    ++ +    
Sbjct:   244 AGRVLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKFEQNIE 303

Query:   313 NPFKFKHISPLNSIDDFSDVGPS--VVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYV 370
             NPF FKHI  L+S+++  ++  +  V++ S   L++G SR+LF  WCSD K   +    +
Sbjct:   304 NPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLILFTQKI 363

Query:   371 VEGTLAKTIISEPKEVTLMNG 391
              + +LA  +I   K+ +  NG
Sbjct:   364 PKDSLADKLI---KQYSTPNG 381

 Score = 65 (27.9 bits), Expect = 1.5e-30, Sum P(2) = 1.5e-30
 Identities = 15/69 (21%), Positives = 33/69 (47%)

Query:   370 VVEGTLAKTIISE---PKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNI 426
             V E T+ +  I E   PK++ +   L  P+N ++  I +   +D       ++++ P  +
Sbjct:   525 VEEVTMEEDEIQEQEIPKKI-ITQTLRLPINCKIQTIDYEGCSDGRSIKAIIQQIAPTKL 583

Query:   427 ILVHGESHE 435
             +L+ G   +
Sbjct:   584 VLIRGSEQQ 592


>UNIPROTKB|F1NMN0 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
            InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 GO:GO:0005847 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK GO:GO:0006398
            EMBL:AADN02003653 IPI:IPI00651282 Ensembl:ENSGALT00000017538
            Uniprot:F1NMN0
        Length = 782

 Score = 347 (127.2 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 101/376 (26%), Positives = 174/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      +D +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDENFS----MDIIDSLKKHVHQVDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  +S+ D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPSE 374

 Score = 55 (24.4 bits), Expect = 2.0e-30, Sum P(2) = 2.0e-30
 Identities = 13/69 (18%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +++VHG   E  +   +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLVIVHGPP-EASQDLAECCRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
              PK  ++V+
Sbjct:   590 MPKLHETVD 598


>ZFIN|ZDB-GENE-040718-79 [details] [associations]
            symbol:cpsf2 "cleavage and polyadenylation specific
            factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity"
            evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0016787
            eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106
            OrthoDB:EOG4MCWZQ EMBL:BC076029 IPI:IPI00512505
            RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5
            STRING:Q6DHE5 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657
            InParanoid:Q6DHE5 NextBio:20831102 ArrayExpress:Q6DHE5 Bgee:Q6DHE5
            Uniprot:Q6DHE5
        Length = 790

 Score = 344 (126.2 bits), Expect = 3.2e-30, Sum P(2) = 3.2e-30
 Identities = 100/376 (26%), Positives = 175/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      +D +L++H
Sbjct:     7 LTALSGVQEESALCYLLQVDEFRFLLDCGWDETFS----MDIIDSLKRYVHQVDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDS 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L + Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIYGVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVME-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  +S+ D + V  P VV+ S   L+SG SR+LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHSLSDLARVPSPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARYLIDNPGE 374

 Score = 57 (25.1 bits), Expect = 3.2e-30, Sum P(2) = 3.2e-30
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query:   393 TAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHG 431
             T  +  +V YI +   +D       + ++ P  +I+VHG
Sbjct:   529 TLDIRARVMYIDYEGRSDGDSIKKIINQMKPRQLIIVHG 567

 Score = 39 (18.8 bits), Expect = 2.5e-28, Sum P(2) = 2.5e-28
 Identities = 8/26 (30%), Positives = 13/26 (50%)

Query:   461 QSVEMYFNSEKMAKTIGRLAEKTPEV 486
             + +E Y   E+M K   +  E+  EV
Sbjct:   390 RELEEYMEKERMKKEAAKKLEQAKEV 415


>UNIPROTKB|Q10568 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:9913 "Bos taurus" [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISS] [GO:0005847 "mRNA cleavage
            and polyadenylation specificity factor complex" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
            EMBL:X75931 IPI:IPI00688446 PIR:A56351 RefSeq:NP_787002.1
            UniGene:Bt.4077 ProteinModelPortal:Q10568 STRING:Q10568
            PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689
            KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568
            OrthoDB:EOG4MCWZQ NextBio:20810154 GO:GO:0006398 Uniprot:Q10568
        Length = 782

 Score = 342 (125.4 bits), Expect = 7.1e-30, Sum P(2) = 7.1e-30
 Identities = 101/376 (26%), Positives = 173/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      ID +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS----MDIIDSLRKHVHQIDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  + + D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPSE 374

 Score = 56 (24.8 bits), Expect = 7.1e-30, Sum P(2) = 7.1e-30
 Identities = 14/69 (20%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +I+VHG   E  +   +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
              PK  ++V+
Sbjct:   590 MPKLHETVD 598


>UNIPROTKB|E2R496 [details] [associations]
            symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] [GO:0016787 "hydrolase
            activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 GO:GO:0005847 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981 GO:GO:0006398
            EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496
            Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230
            NextBio:20855279 Uniprot:E2R496
        Length = 782

 Score = 342 (125.4 bits), Expect = 7.1e-30, Sum P(2) = 7.1e-30
 Identities = 101/376 (26%), Positives = 173/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      ID +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS----MDIIDSLRKHVHQIDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  + + D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPSE 374

 Score = 56 (24.8 bits), Expect = 7.1e-30, Sum P(2) = 7.1e-30
 Identities = 14/69 (20%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +I+VHG   E  +   +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
              PK  ++V+
Sbjct:   590 MPKLHETVD 598


>UNIPROTKB|Q9P2I0 [details] [associations]
            symbol:CPSF2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:9606 "Homo sapiens" [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0006398 "histone mRNA 3'-end processing"
            evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase
            II transcription" evidence=TAS] [GO:0006397 "mRNA processing"
            evidence=TAS] [GO:0006406 "mRNA export from nucleus" evidence=TAS]
            [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene
            expression" evidence=TAS] [GO:0031124 "mRNA 3'-end processing"
            evidence=TAS] Reactome:REACT_71 InterPro:IPR001279
            InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            EMBL:CH471061 Reactome:REACT_1675 GO:GO:0003723 GO:GO:0016787
            GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0005847
            GO:GO:0006369 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78
            HOGENOM:HOG000264343 OMA:NNPFQFK CTD:53981 HOVERGEN:HBG051106
            OrthoDB:EOG4MCWZQ GO:GO:0006398 EMBL:AK001627 EMBL:BC070095
            EMBL:AB037788 EMBL:AL442079 IPI:IPI00419531 RefSeq:NP_059133.1
            UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0
            SMR:Q9P2I0 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677
            STRING:Q9P2I0 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0
            PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875
            GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588
            HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0
            PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2
            GenomeRNAi:53981 NextBio:56268 ArrayExpress:Q9P2I0 Bgee:Q9P2I0
            CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 GermOnline:ENSG00000165934
            Uniprot:Q9P2I0
        Length = 782

 Score = 342 (125.4 bits), Expect = 7.1e-30, Sum P(2) = 7.1e-30
 Identities = 101/376 (26%), Positives = 173/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      ID +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS----MDIIDSLRKHVHQIDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  + + D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPSE 374

 Score = 56 (24.8 bits), Expect = 7.1e-30, Sum P(2) = 7.1e-30
 Identities = 14/69 (20%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +I+VHG   E  +   +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
              PK  ++V+
Sbjct:   590 MPKLHETVD 598


>RGD|1309687 [details] [associations]
            symbol:Cpsf2 "cleavage and polyadenylation specific factor 2,
            100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA
            3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0016787
            EMBL:CH473982 GO:GO:0005847 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981
            OrthoDB:EOG4MCWZQ GO:GO:0006398 IPI:IPI00189534
            RefSeq:NP_001100223.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612
            GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098
            Uniprot:D3Z9E6
        Length = 782

 Score = 337 (123.7 bits), Expect = 3.1e-29, Sum P(2) = 3.1e-29
 Identities = 100/376 (26%), Positives = 173/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      ID +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS----VDIIDSLRKHVHQIDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LP+ + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  + + D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPSE 374

 Score = 56 (24.8 bits), Expect = 3.1e-29, Sum P(2) = 3.1e-29
 Identities = 14/69 (20%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +I+VHG   E  +   +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
              PK  ++V+
Sbjct:   590 MPKLHETVD 598


>TIGR_CMR|DET_1061 [details] [associations]
            symbol:DET_1061 "metallo-beta-lactamase family protein"
            species:243164 "Dehalococcoides ethenogenes 195" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:CP000027 GenomeReviews:CP000027_GR
            eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576
            RefSeq:YP_181776.1 ProteinModelPortal:Q3Z7M3 STRING:Q3Z7M3
            GeneID:3229629 KEGG:det:DET1061 PATRIC:21609167
            ProtClustDB:CLSK2516599 BioCyc:DETH243164:GJNF-1062-MONOMER
            Uniprot:Q3Z7M3
        Length = 468

 Score = 264 (98.0 bits), Expect = 3.4e-29, Sum P(2) = 3.4e-29
 Identities = 80/328 (24%), Positives = 151/328 (46%)

Query:   134 LFDEQDINRSMDKIEVLDFHQTVEVN-GIKFWCYTAGHVLGAAMFMVDIAGVR----VLY 188
             L+  +D        + +++ + + V   I    + AGHV G+A   + I        +++
Sbjct:   129 LYTAEDARAVSPLFKTVEYSREIAVTEDITATFHNAGHVFGSASIELKIQENHRQKVIVF 188

Query:   189 TGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRV 248
             +GD    +   L+  +L     D  +IESTYG + HQ  N    +  ++I+ T+  GG +
Sbjct:   189 SGDLGNWDRPILKNPDLVN-QADYVVIESTYGDRTHQDINEASLKLAEIINQTVKLGGNI 247

Query:   249 LIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQ 308
             +IP+FAL R Q+LL  L+ + S   +  ++ ++  SP+A     +++ +   + +R  + 
Sbjct:   248 VIPSFALERTQDLLFFLNRFMSEG-KIPSLKVFVDSPMAISITKIFKEHP-ELYDRETSG 305

Query:   309 FAN--SNPFKFKHISPLNSIDD----FSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKN 362
             + N  S+PF+F+ +   N   D     ++  P +++A  G    G  +       S  ++
Sbjct:   306 WVNNGSSPFEFEGLHFTNKAADSKAILAEKDPCIIIAGSGMCTGGRIKHHLVNNISRPES 365

Query:   363 ACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYI-SFSAHADYAQTSTFLKEL 421
               +  G+   GTL + I    KEV ++ G   P+  ++  + +FSAHAD      +LK  
Sbjct:   366 TILFVGFQATGTLGRLITDGAKEVRIL-GQHYPVQARIEELRAFSAHADQPTLLRWLKGF 424

Query:   422 M--PPNIILVHGESHEMGRLKTKLMTEL 447
                P  + + HGE     R    +   L
Sbjct:   425 KNKPEMVFVTHGEPETSARFTETIKNTL 452

 Score = 127 (49.8 bits), Expect = 3.4e-29, Sum P(2) = 3.4e-29
 Identities = 37/112 (33%), Positives = 53/112 (47%)

Query:    29 LGAGNEVGRSCVYMSYKGKTILFDCGIHPA--YSGMAALPYFDEIDPSAIDVLLITHFHL 86
             LGA   V  S   +      +L DCG++           P+  EI P ++  ++I+H H+
Sbjct:     8 LGAARNVTGSRYLIKTDHTQLLVDCGLYQERRLQDRNWQPF--EIPPQSLSAVIISHAHI 65

Query:    87 DHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQ 138
             DH   LP  L K  F G VF T AT  I ++ LTD     K+  ED  F ++
Sbjct:    66 DHCGLLPK-LVKEGFAGPVFATEATAEIARISLTD---AGKLQEEDAAFKKK 113


>UNIPROTKB|Q9W799 [details] [associations]
            symbol:cpsf2 "Cleavage and polyadenylation specificity
            factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006397
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1
            UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394
            KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799
        Length = 783

 Score = 333 (122.3 bits), Expect = 3.7e-29, Sum P(2) = 3.7e-29
 Identities = 97/376 (25%), Positives = 174/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      +D +L++H
Sbjct:     7 LTTLVGAQEESAVCYLLQVDEFRFLLDCGWDENFS----MDIIDSVKKYVHQVDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LPY + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFSLFSLDDVDC 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L ++Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   +    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H++  +   D + V  P VV+AS   L+ G SR+LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDHPSE 374

 Score = 60 (26.2 bits), Expect = 3.7e-29, Sum P(2) = 3.7e-29
 Identities = 15/69 (21%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +I+VHG       L  +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQDL-AEACRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
             TPK  ++V+
Sbjct:   590 TPKLHETVD 598


>MGI|MGI:1861601 [details] [associations]
            symbol:Cpsf2 "cleavage and polyadenylation specific factor
            2" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISO;IDA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
            mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase
            activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601
            GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
            PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
            GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
            CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG4MCWZQ GO:GO:0006398
            EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 IPI:IPI00314302
            RefSeq:NP_058552.1 UniGene:Mm.716 ProteinModelPortal:O35218
            SMR:O35218 STRING:O35218 PhosphoSite:O35218 PaxDb:O35218
            PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786
            UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 Bgee:O35218
            CleanEx:MM_CPSF2 Genevestigator:O35218
            GermOnline:ENSMUSG00000041781 Uniprot:O35218
        Length = 782

 Score = 336 (123.3 bits), Expect = 4.0e-29, Sum P(2) = 4.0e-29
 Identities = 100/376 (26%), Positives = 173/376 (46%)

Query:    26 ITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA--IDVLLITH 83
             +T L    E    C  +       L DCG    +S    +   D +      ID +L++H
Sbjct:     7 LTTLSGVQEESALCYLLQVDEFRFLLDCGWDEHFS----VDIIDSLRKHVHQIDAVLLSH 62

Query:    84 FHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDM-LFDEQDINR 142
                 H  +LP+ + K      ++ T     + ++ + D  + S+ + ED  LF   D++ 
Sbjct:    63 PDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDA 121

Query:   143 SMDKIEVLDFHQTVEV----NGIKFWCYTAGHVLGAAMFMVDIAGVR-VLYTGDYSREED 197
             + DKI+ L F Q V +    +G+      AGH++G  ++ +   G   ++Y  D++ + +
Sbjct:   122 AFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKRE 181

Query:   198 RHLRAAELPQFSPDICIIESTYGVQLHQPRNIR--EKRFTDVIHSTISQGGRVLIPAFAL 255
              HL    L   S    +I  ++     QPR  +  E+  T+V+  T+   G VLI     
Sbjct:   182 IHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLE-TLRGDGNVLIAVDTA 240

Query:   256 GRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQ---TYILSMNERIRNQFAN- 311
             GR  EL  +LD+ W        + +Y  + L      V +   + +  M++++   F + 
Sbjct:   241 GRVLELAQLLDQIWRTKDA--GLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDK 298

Query:   312 -SNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGY 369
              +NPF+F+H+S  + + D + V  P VV+AS   L+ G SR LF  WC D KN+ ++   
Sbjct:   299 RNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYR 358

Query:   370 VVEGTLAKTIISEPKE 385
                GTLA+ +I  P E
Sbjct:   359 TTPGTLARFLIDNPTE 374

 Score = 56 (24.8 bits), Expect = 4.0e-29, Sum P(2) = 4.0e-29
 Identities = 14/69 (20%), Positives = 30/69 (43%)

Query:   396 LNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKII 455
             +  +V YI +   +D       + ++ P  +I+VHG   E  +   +        + K+ 
Sbjct:   531 IKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFGGKDIKVY 589

Query:   456 TPKNCQSVE 464
              PK  ++V+
Sbjct:   590 MPKLHETVD 598


>UNIPROTKB|Q8EJC6 [details] [associations]
            symbol:SO_0541 "RNA-metabolizing metallo-beta-lactamase
            family protein" species:211586 "Shewanella oneidensis MR-1"
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
            Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014299
            GenomeReviews:AE014299_GR InterPro:IPR022712 InterPro:IPR011108
            Pfam:PF10996 SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774
            KO:K07576 RefSeq:NP_716177.2 ProteinModelPortal:Q8EJC6
            DNASU:1168409 GeneID:1168409 KEGG:son:SO_0541 PATRIC:23520762
            ProtClustDB:CLSK2516780 Uniprot:Q8EJC6
        Length = 480

 Score = 236 (88.1 bits), Expect = 7.4e-28, Sum P(2) = 7.4e-28
 Identities = 82/317 (25%), Positives = 156/317 (49%)

Query:   134 LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYT-AGHVLGAAMFMVDIA-GV---RVLY 188
             LF  +D  +++ +   L++ Q   V      C + AGH+LG+A+  + +  G    ++++
Sbjct:   127 LFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVF 186

Query:   189 TGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQG-GR 247
             +GD  R     L+   L   + D+ ++ESTYG + H+          D+   T+++  G 
Sbjct:   187 SGDLGRAGMPILQNPTLVD-TADLVLMESTYGNRFHRSWTDTLAELKDIFAKTVNESQGN 245

Query:   248 VLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRN 307
             +L+PAF++GRAQELL +   Y +   +     I   SP+A +   VY      M+E  + 
Sbjct:   246 ILLPAFSVGRAQELLYLFHLY-AKEWDLGRWKICLDSPMAIEATRVYVNNYPLMDEDFK- 303

Query:   308 QFANSNPFKFKHISPLNSIDD------FSDVGPSVVMASPGGLQSG---LSRQLFDIWCS 358
             +F   +P +   +S +  I         ++V   +++ +  G+ +G    S    ++W S
Sbjct:   304 RFTRQHPGQHPLLSNVEFIQTTEESIALNEVHKGLIIIAGSGMCNGGRIRSHLEHNLWRS 363

Query:   359 DKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYIS-FSAHADYAQTSTF 417
             +     +I G+   GT  + ++   KE+T+ +G +  +  ++H +   SAHAD A+   +
Sbjct:   364 ECD--VIICGFQALGTPGRALVDGAKELTI-HGNSVNVAAKLHTVGGLSAHADQAELLRW 420

Query:   418 LK--ELMPPNIILVHGE 432
              +  E  PP ++LVHGE
Sbjct:   421 YRHFEEQPP-LVLVHGE 436

 Score = 147 (56.8 bits), Expect = 7.4e-28, Sum P(2) = 7.4e-28
 Identities = 49/172 (28%), Positives = 81/172 (47%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMA-ALPYFDEI--DPSAIDVL 79
             Q+ +  LGA  EV  SC  ++  GK +L DCG+     G A  L   +    DP  I  +
Sbjct:     2 QMTLQFLGAAREVTGSCHLVTVAGKHLLLDCGL--IQGGKADELRNHEPFVFDPQTIVAV 59

Query:    80 LITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS---------KVSV 130
             +++H H+DH+  LP  L K  F G ++   AT  +  ++L D   +          K + 
Sbjct:    60 VLSHAHIDHSGRLP-LLVKAGFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAK 118

Query:   131 EDM-----LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYT-AGHVLGAAM 176
              D+     LF  +D  +++ +   L++ Q   V      C + AGH+LG+A+
Sbjct:   119 HDLAPLEPLFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSAL 170


>TIGR_CMR|SO_0541 [details] [associations]
            symbol:SO_0541 "metallo-beta-lactamase family protein"
            species:211586 "Shewanella oneidensis MR-1" [GO:0008150
            "biological_process" evidence=ND] [GO:0003824 "catalytic activity"
            evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 GO:GO:0016787 EMBL:AE014299 GenomeReviews:AE014299_GR
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_716177.2
            ProteinModelPortal:Q8EJC6 DNASU:1168409 GeneID:1168409
            KEGG:son:SO_0541 PATRIC:23520762 ProtClustDB:CLSK2516780
            Uniprot:Q8EJC6
        Length = 480

 Score = 236 (88.1 bits), Expect = 7.4e-28, Sum P(2) = 7.4e-28
 Identities = 82/317 (25%), Positives = 156/317 (49%)

Query:   134 LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYT-AGHVLGAAMFMVDIA-GV---RVLY 188
             LF  +D  +++ +   L++ Q   V      C + AGH+LG+A+  + +  G    ++++
Sbjct:   127 LFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVF 186

Query:   189 TGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQG-GR 247
             +GD  R     L+   L   + D+ ++ESTYG + H+          D+   T+++  G 
Sbjct:   187 SGDLGRAGMPILQNPTLVD-TADLVLMESTYGNRFHRSWTDTLAELKDIFAKTVNESQGN 245

Query:   248 VLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRN 307
             +L+PAF++GRAQELL +   Y +   +     I   SP+A +   VY      M+E  + 
Sbjct:   246 ILLPAFSVGRAQELLYLFHLY-AKEWDLGRWKICLDSPMAIEATRVYVNNYPLMDEDFK- 303

Query:   308 QFANSNPFKFKHISPLNSIDD------FSDVGPSVVMASPGGLQSG---LSRQLFDIWCS 358
             +F   +P +   +S +  I         ++V   +++ +  G+ +G    S    ++W S
Sbjct:   304 RFTRQHPGQHPLLSNVEFIQTTEESIALNEVHKGLIIIAGSGMCNGGRIRSHLEHNLWRS 363

Query:   359 DKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYIS-FSAHADYAQTSTF 417
             +     +I G+   GT  + ++   KE+T+ +G +  +  ++H +   SAHAD A+   +
Sbjct:   364 ECD--VIICGFQALGTPGRALVDGAKELTI-HGNSVNVAAKLHTVGGLSAHADQAELLRW 420

Query:   418 LK--ELMPPNIILVHGE 432
              +  E  PP ++LVHGE
Sbjct:   421 YRHFEEQPP-LVLVHGE 436

 Score = 147 (56.8 bits), Expect = 7.4e-28, Sum P(2) = 7.4e-28
 Identities = 49/172 (28%), Positives = 81/172 (47%)

Query:    23 QLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMA-ALPYFDEI--DPSAIDVL 79
             Q+ +  LGA  EV  SC  ++  GK +L DCG+     G A  L   +    DP  I  +
Sbjct:     2 QMTLQFLGAAREVTGSCHLVTVAGKHLLLDCGL--IQGGKADELRNHEPFVFDPQTIVAV 59

Query:    80 LITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS---------KVSV 130
             +++H H+DH+  LP  L K  F G ++   AT  +  ++L D   +          K + 
Sbjct:    60 VLSHAHIDHSGRLP-LLVKAGFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAK 118

Query:   131 EDM-----LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYT-AGHVLGAAM 176
              D+     LF  +D  +++ +   L++ Q   V      C + AGH+LG+A+
Sbjct:   119 HDLAPLEPLFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSAL 170


>POMBASE|SPBC1709.15c [details] [associations]
            symbol:cft2 "cleavage factor two Cft2/polyadenylation
            factor CPSF-73 (predicted)" species:4896 "Schizosaccharomyces
            pombe" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA]
            [GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
            cleavage" evidence=ISO] InterPro:IPR027075 PomBase:SPBC1709.15c
            Pfam:PF07521 EMBL:CU329671 GO:GO:0006378 GenomeReviews:CU329671_GR
            GO:GO:0005847 GO:GO:0006379 PIR:T39643 RefSeq:NP_595448.1
            ProteinModelPortal:O74740 STRING:O74740 EnsemblFungi:SPBC1709.15c.1
            GeneID:2539954 KEGG:spo:SPBC1709.15c eggNOG:COG1236 KO:K14402
            OMA:ISSIATP OrthoDB:EOG4WWVSN NextBio:20801097 InterPro:IPR022712
            InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Uniprot:O74740
        Length = 797

 Score = 279 (103.3 bits), Expect = 2.6e-22, Sum P(2) = 2.6e-22
 Identities = 88/315 (27%), Positives = 151/315 (47%)

Query:    73 PSAIDVLLITHFHLDHAASLPYFLEKTTFKGR-VFMTHATKAIYKLLLTDYVKVSKVSVE 131
             P   D++L++H  L H   L Y   K  +K   ++ T  T  + ++ + D +K + +S  
Sbjct:    41 PEQPDLILLSHSDLAHIGGLVYAYYKYDWKNAYIYATLPTINMGRMTMLDAIKSNYIS-- 98

Query:   132 DMLFDEQDINRSMDKIEVLDFHQTV----EVNGIKFWCYTAGHVLGAAMFMVDIAGVRVL 187
             DM   + D++   D I  L + Q      + +G+    Y AGH LG  ++ +      VL
Sbjct:    99 DM--SKADVDAVFDSIIPLRYQQPTLLLGKCSGLTITAYNAGHTLGGTLWSLIKESESVL 156

Query:   188 YTGDYSREEDRHLRAAELPQFS--------PDICIIESTYGVQLHQPRNIREKRFTDVIH 239
             Y  D++  +D+HL  A L            P+  I ++   +     R  R++ F + + 
Sbjct:   157 YAVDWNHSKDKHLNGAALYSNGHILEALNRPNTLITDANNSLVSIPSRKKRDEAFIESVM 216

Query:   240 STISQGGRVLIPAFALGRAQELLLILDEYWS-NHPEFHNIPIYYASPLAKKCMAVYQTYI 298
             S++ +GG VL+P  A  R  EL  ILD +WS + P     PI + SP + K +   ++ I
Sbjct:   217 SSLLKGGTVLLPVDAASRVLELCCILDNHWSASQPPLP-FPILFLSPTSTKTIDYAKSMI 275

Query:   299 LSMNERIRNQFA-NSNPFKFKHISPLNSIDDFSDV-----GPSVVMASPGGLQSGLSRQ- 351
               M + I   F  N N  +F++I   N+I DFS +     GP V++A+   L+ G S++ 
Sbjct:   276 EWMGDNIVRDFGINENLLEFRNI---NTITDFSQISHIGPGPKVILATALTLECGFSQRI 332

Query:   352 LFDIWCSDKKNACVI 366
             L D+  S+  N  ++
Sbjct:   333 LLDLM-SENSNDLIL 346

 Score = 56 (24.8 bits), Expect = 2.6e-22, Sum P(2) = 2.6e-22
 Identities = 14/65 (21%), Positives = 27/65 (41%)

Query:   393 TAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNT 452
             T  ++ QV +I      D     T + ++ P  ++L+H  + E   +K K    L+    
Sbjct:   565 TIRVSCQVQFIDIEGLHDGRSLKTIIPQVNPRRLVLIHASTEEKEDMK-KTCASLSAFTK 623

Query:   453 KIITP 457
              +  P
Sbjct:   624 DVYIP 628


>UNIPROTKB|E9PIL7 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
            HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00977321
            ProteinModelPortal:E9PIL7 SMR:E9PIL7 Ensembl:ENST00000534345
            ArrayExpress:E9PIL7 Bgee:E9PIL7 Uniprot:E9PIL7
        Length = 140

 Score = 258 (95.9 bits), Expect = 1.8e-21, P = 1.8e-21
 Identities = 53/138 (38%), Positives = 79/138 (57%)

Query:    23 QLIITPL-GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----I 76
             ++ +TPL GAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +
Sbjct:     3 EIRVTPLVGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFL 62

Query:    77 DVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVS-KVSVEDMLF 135
             D ++I+HFHLDH  +LPYF E   + G ++MTH T+AI  +LL DY K++     E   F
Sbjct:    63 DCVIISHFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFF 122

Query:   136 DEQDINRSMDKIEVLDFH 153
               Q I   M K+  +  H
Sbjct:   123 TSQMIKDCMKKVVAVHLH 140


>UNIPROTKB|Q81SC3 [details] [associations]
            symbol:BA_1737 "Metallo-beta-lactamase family protein"
            species:1392 "Bacillus anthracis" [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
            EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
            GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
            RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
            DNASU:1086535 EnsemblBacteria:EBBACT00000009201
            EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
            KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
            HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
            BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
        Length = 419

 Score = 272 (100.8 bits), Expect = 2.6e-21, P = 2.6e-21
 Identities = 97/364 (26%), Positives = 180/364 (49%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD-EIDPSAIDVLLITHFHLDH 88
             GAG E GRSC ++  K   ILFDCGI+ +Y    + P  + E+ P  ++ + ++H H DH
Sbjct:     8 GAG-EYGRSCYFVKNKETKILFDCGINRSYED--SYPKIEREVVPF-LEAVFLSHIHEDH 63

Query:    89 AASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKI- 147
                LP  L K  +K +++ T  TK         +   +     ++ +++Q++ + ++ I 
Sbjct:    64 TMGLP-LLAKYGYKKKIWTTRYTKEQLPAYYEKWRNYNVTQGWNVPYNDQNV-KDLNYIY 121

Query:   148 --EVLDFHQTVEVNG-IKF-WCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAA 203
               E+ + ++ +++   ++F W Y+ GHVLG+  F+VD++   V Y+GDYS E +  LRA 
Sbjct:   122 VDEISNPNEWIQITPTLRFQWGYS-GHVLGSVWFLVDMSHTYVFYSGDYSAESNI-LRA- 178

Query:   204 ELPQ-FSPDI--CIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQE 260
              LP+    DI   I+++ Y       R  R       I       G  L+P   LGRAQ+
Sbjct:   179 NLPEKLRGDIKVAIVDAAYHTDDVSQRE-RVNELCTEIERAAGNKGIALLPLPPLGRAQD 237

Query:   261 LLLILDEYWSNHPEFHNIPIYYASPLAKKC--MAVYQTYILSMNERIRNQFANSNPFKFK 318
             ++L L E    + EF   PI     +      M +Y+ +I   N +   +   S   K +
Sbjct:   238 IVLYLYE---KYKEF---PIIVDQEILDGFDEMFLYKDWI--KNNKELEELMES--LK-R 286

Query:   319 HISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
             +I  ++  D  +     +V+ S   +Q+  ++  ++    +++N+ +  G+V +G+ A+ 
Sbjct:   287 NIIVMDD-DGGTQHSCGIVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEK 345

Query:   379 IISE 382
             ++ E
Sbjct:   346 VLKE 349


>TIGR_CMR|BA_1737 [details] [associations]
            symbol:BA_1737 "metallo-beta-lactamase family protein"
            species:198094 "Bacillus anthracis str. Ames" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
            EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
            GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
            InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
            RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
            DNASU:1086535 EnsemblBacteria:EBBACT00000009201
            EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
            KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
            HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
            BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
        Length = 419

 Score = 272 (100.8 bits), Expect = 2.6e-21, P = 2.6e-21
 Identities = 97/364 (26%), Positives = 180/364 (49%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFD-EIDPSAIDVLLITHFHLDH 88
             GAG E GRSC ++  K   ILFDCGI+ +Y    + P  + E+ P  ++ + ++H H DH
Sbjct:     8 GAG-EYGRSCYFVKNKETKILFDCGINRSYED--SYPKIEREVVPF-LEAVFLSHIHEDH 63

Query:    89 AASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKI- 147
                LP  L K  +K +++ T  TK         +   +     ++ +++Q++ + ++ I 
Sbjct:    64 TMGLP-LLAKYGYKKKIWTTRYTKEQLPAYYEKWRNYNVTQGWNVPYNDQNV-KDLNYIY 121

Query:   148 --EVLDFHQTVEVNG-IKF-WCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAA 203
               E+ + ++ +++   ++F W Y+ GHVLG+  F+VD++   V Y+GDYS E +  LRA 
Sbjct:   122 VDEISNPNEWIQITPTLRFQWGYS-GHVLGSVWFLVDMSHTYVFYSGDYSAESNI-LRA- 178

Query:   204 ELPQ-FSPDI--CIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQE 260
              LP+    DI   I+++ Y       R  R       I       G  L+P   LGRAQ+
Sbjct:   179 NLPEKLRGDIKVAIVDAAYHTDDVSQRE-RVNELCTEIERAAGNKGIALLPLPPLGRAQD 237

Query:   261 LLLILDEYWSNHPEFHNIPIYYASPLAKKC--MAVYQTYILSMNERIRNQFANSNPFKFK 318
             ++L L E    + EF   PI     +      M +Y+ +I   N +   +   S   K +
Sbjct:   238 IVLYLYE---KYKEF---PIIVDQEILDGFDEMFLYKDWI--KNNKELEELMES--LK-R 286

Query:   319 HISPLNSIDDFSDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
             +I  ++  D  +     +V+ S   +Q+  ++  ++    +++N+ +  G+V +G+ A+ 
Sbjct:   287 NIIVMDD-DGGTQHSCGIVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEK 345

Query:   379 IISE 382
             ++ E
Sbjct:   346 VLKE 349


>UNIPROTKB|Q74C32 [details] [associations]
            symbol:GSU1843 "RNA exonuclease, beta-lactamase fold
            protein" species:243231 "Geobacter sulfurreducens PCA" [GO:0008150
            "biological_process" evidence=ND] InterPro:IPR001279 Pfam:PF00753
            SMART:SM00849 Pfam:PF07521 EMBL:AE017180 GenomeReviews:AE017180_GR
            GO:GO:0004527 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
            SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_952893.1
            ProteinModelPortal:Q74C32 GeneID:2688625 KEGG:gsu:GSU1843
            PATRIC:22026545 OMA:CHIDHVG ProtClustDB:CLSK2516562
            BioCyc:GSUL243231:GH27-1786-MONOMER Uniprot:Q74C32
        Length = 475

 Score = 154 (59.3 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 54/186 (29%), Positives = 86/186 (46%)

Query:   168 AGHVLGAAMFMVDIA------------GVR----VLYTGDYSREEDRHLRAAELPQFSPD 211
             AGH+LG+A   V ++            G R    V+++GD        L   + P+ + D
Sbjct:   149 AGHILGSAYVEVSVSPASQAEQTGTVNGTRGDTVVVFSGDLGAPFTPLLPDPKPPERA-D 207

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             I ++ESTYG + H+ R  R +R   VI   +   G +L+PAF++GR QELL  +++  S 
Sbjct:   208 ILVLESTYGDRQHEGREQRRERLCRVIVRALENRGALLVPAFSIGRTQELLYEIEDLISR 267

Query:   272 HPE--------FHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFA-NSNPFKFKHISP 322
             H          + ++ I   SPLA     VY       +E      A N +P  F+ ++ 
Sbjct:   268 HRTEEAAAGLPWDDLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTV 327

Query:   323 LNSIDD 328
             + S  D
Sbjct:   328 IESHAD 333

 Score = 125 (49.1 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 38/141 (26%), Positives = 65/141 (46%)

Query:    49 ILFDCGIHPAYSGMAA--LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVF 106
             IL DCG+     G      P+ D      +  L++TH H+DH   +P+ L    F+G ++
Sbjct:    27 ILIDCGLLQGNDGAGGKRFPFID-FPLDRVKGLVLTHVHIDHCGRIPHLLG-AGFQGPIW 84

Query:   107 MTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDF---HQTVEVNG--I 161
              + A+  +  L+L D VKV     E ++   + +N    ++  L +   HQ    +G   
Sbjct:    85 CSEASALLLPLVLEDAVKVGITRDEHLI--ARFLNAVKKRLVPLPYDRWHQLGSWDGRSA 142

Query:   162 KFWCYTAGHVLGAAMFMVDIA 182
                   AGH+LG+A   V ++
Sbjct:   143 SLRLQQAGHILGSAYVEVSVS 163

 Score = 111 (44.1 bits), Expect = 8.0e-12, Sum P(3) = 8.0e-12
 Identities = 48/176 (27%), Positives = 73/176 (41%)

Query:   277 NIPIYYASPLAKKCMAVYQTYILSMNERIRNQFA-NSNPFKFKHISPLNSIDDF------ 329
             ++ I   SPLA     VY       +E      A N +P  F+ ++ + S  D       
Sbjct:   281 DLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTVIESHADHRATVEY 340

Query:   330 --SDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEV- 386
                   P +V+A+ G    G           D +   +  GY   GT  + I+   K+  
Sbjct:   341 LRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDILFVGYQAAGTPGREILEAAKQKW 400

Query:   387 ------TL-MNGLTAPLNMQVHYIS-FSAHADYAQTSTFLKEL-MPPNII-LVHGE 432
                   ++ ++G T PL   VH IS +SAHAD      F++ + +PP  I LVHGE
Sbjct:   401 ETGGRPSIDLDGGTYPLRAAVHTISGYSAHADQRDLVEFVEGITVPPKTIRLVHGE 456

 Score = 59 (25.8 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 16/61 (26%), Positives = 29/61 (47%)

Query:   395 PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKI 454
             PL+ +   +   +HAD+  T  +L++   P I++  G     GR+   L   + D  T I
Sbjct:   319 PLSFEQMTV-IESHADHRATVEYLRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDI 377

Query:   455 I 455
             +
Sbjct:   378 L 378

 Score = 43 (20.2 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query:   474 KTIGRLAEKTPEVGETVSGILVKKGFTYQI 503
             KTI RL     E    ++G+L +KG  YQ+
Sbjct:   448 KTI-RLVHGEEEARTALAGVLAEKG--YQV 474


>TIGR_CMR|GSU_1843 [details] [associations]
            symbol:GSU_1843 "metallo-beta-lactamase family protein"
            species:243231 "Geobacter sulfurreducens PCA" [GO:0003824
            "catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
            Pfam:PF07521 EMBL:AE017180 GenomeReviews:AE017180_GR GO:GO:0004527
            InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
            HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_952893.1
            ProteinModelPortal:Q74C32 GeneID:2688625 KEGG:gsu:GSU1843
            PATRIC:22026545 OMA:CHIDHVG ProtClustDB:CLSK2516562
            BioCyc:GSUL243231:GH27-1786-MONOMER Uniprot:Q74C32
        Length = 475

 Score = 154 (59.3 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 54/186 (29%), Positives = 86/186 (46%)

Query:   168 AGHVLGAAMFMVDIA------------GVR----VLYTGDYSREEDRHLRAAELPQFSPD 211
             AGH+LG+A   V ++            G R    V+++GD        L   + P+ + D
Sbjct:   149 AGHILGSAYVEVSVSPASQAEQTGTVNGTRGDTVVVFSGDLGAPFTPLLPDPKPPERA-D 207

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             I ++ESTYG + H+ R  R +R   VI   +   G +L+PAF++GR QELL  +++  S 
Sbjct:   208 ILVLESTYGDRQHEGREQRRERLCRVIVRALENRGALLVPAFSIGRTQELLYEIEDLISR 267

Query:   272 HPE--------FHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQFA-NSNPFKFKHISP 322
             H          + ++ I   SPLA     VY       +E      A N +P  F+ ++ 
Sbjct:   268 HRTEEAAAGLPWDDLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTV 327

Query:   323 LNSIDD 328
             + S  D
Sbjct:   328 IESHAD 333

 Score = 125 (49.1 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 38/141 (26%), Positives = 65/141 (46%)

Query:    49 ILFDCGIHPAYSGMAA--LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVF 106
             IL DCG+     G      P+ D      +  L++TH H+DH   +P+ L    F+G ++
Sbjct:    27 ILIDCGLLQGNDGAGGKRFPFID-FPLDRVKGLVLTHVHIDHCGRIPHLLG-AGFQGPIW 84

Query:   107 MTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDF---HQTVEVNG--I 161
              + A+  +  L+L D VKV     E ++   + +N    ++  L +   HQ    +G   
Sbjct:    85 CSEASALLLPLVLEDAVKVGITRDEHLI--ARFLNAVKKRLVPLPYDRWHQLGSWDGRSA 142

Query:   162 KFWCYTAGHVLGAAMFMVDIA 182
                   AGH+LG+A   V ++
Sbjct:   143 SLRLQQAGHILGSAYVEVSVS 163

 Score = 111 (44.1 bits), Expect = 8.0e-12, Sum P(3) = 8.0e-12
 Identities = 48/176 (27%), Positives = 73/176 (41%)

Query:   277 NIPIYYASPLAKKCMAVYQTYILSMNERIRNQFA-NSNPFKFKHISPLNSIDDF------ 329
             ++ I   SPLA     VY       +E      A N +P  F+ ++ + S  D       
Sbjct:   281 DLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTVIESHADHRATVEY 340

Query:   330 --SDVGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEV- 386
                   P +V+A+ G    G           D +   +  GY   GT  + I+   K+  
Sbjct:   341 LRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDILFVGYQAAGTPGREILEAAKQKW 400

Query:   387 ------TL-MNGLTAPLNMQVHYIS-FSAHADYAQTSTFLKEL-MPPNII-LVHGE 432
                   ++ ++G T PL   VH IS +SAHAD      F++ + +PP  I LVHGE
Sbjct:   401 ETGGRPSIDLDGGTYPLRAAVHTISGYSAHADQRDLVEFVEGITVPPKTIRLVHGE 456

 Score = 59 (25.8 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 16/61 (26%), Positives = 29/61 (47%)

Query:   395 PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKI 454
             PL+ +   +   +HAD+  T  +L++   P I++  G     GR+   L   + D  T I
Sbjct:   319 PLSFEQMTV-IESHADHRATVEYLRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDI 377

Query:   455 I 455
             +
Sbjct:   378 L 378

 Score = 43 (20.2 bits), Expect = 7.9e-19, Sum P(4) = 7.9e-19
 Identities = 12/30 (40%), Positives = 17/30 (56%)

Query:   474 KTIGRLAEKTPEVGETVSGILVKKGFTYQI 503
             KTI RL     E    ++G+L +KG  YQ+
Sbjct:   448 KTI-RLVHGEEEARTALAGVLAEKG--YQV 474


>UNIPROTKB|E9PQF0 [details] [associations]
            symbol:CPSF3L "Integrator complex subunit 11" species:9606
            "Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
            InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
            HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00982774
            ProteinModelPortal:E9PQF0 SMR:E9PQF0 Ensembl:ENST00000498476
            ArrayExpress:E9PQF0 Bgee:E9PQF0 Uniprot:E9PQF0
        Length = 167

 Score = 229 (85.7 bits), Expect = 2.6e-18, P = 2.6e-18
 Identities = 42/98 (42%), Positives = 61/98 (62%)

Query:    30 GAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEIDPSA-----IDVLLITHF 84
             GAG +VGRSC+ +S  GK ++ DCG+H  ++     P F  I  +      +D ++I+HF
Sbjct:    70 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 129

Query:    85 HLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDY 122
             HLDH  +LPYF E   + G ++MTH T+AI  +LL DY
Sbjct:   130 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY 167


>DICTYBASE|DDB_G0282473 [details] [associations]
            symbol:ints9 "integrator complex subunit 9"
            species:44689 "Dictyostelium discoideum" [GO:0032039 "integrator
            complex" evidence=IEA] [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] InterPro:IPR027074 dictyBase:DDB_G0282473
            GO:GO:0005634 EMBL:AAFI02000047 GenomeReviews:CM000152_GR
            eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            KO:K13146 PANTHER:PTHR11203:SF2 RefSeq:XP_640069.1
            ProteinModelPortal:Q54SH0 EnsemblProtists:DDB0234099 GeneID:8623598
            KEGG:ddi:DDB_G0282473 OMA:DDFSTID ProtClustDB:CLSZ2729002
            Uniprot:Q54SH0
        Length = 712

 Score = 190 (71.9 bits), Expect = 2.6e-18, Sum P(3) = 2.6e-18
 Identities = 82/369 (22%), Positives = 176/369 (47%)

Query:   134 LFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGV-RVLYTGDY 192
             L+ + DI +S +KI+ + F+++++  G +    ++G+ LG+A ++++  G  RV+Y  D 
Sbjct:   217 LYKKIDIEKSFEKIQSIRFNESIKHYGFECIPSSSGYGLGSANWVIESKGFERVVYISDS 276

Query:   193 SREEDRHLRAAEL-PQFSPDICIIESTYGVQLHQPRNIREKRFTDV---IHSTISQGGRV 248
             S    R+    +L P  +PD+ I+        H P N  ++  +++   I ST+ QGG V
Sbjct:   277 SLSLSRYPTPFQLSPIDNPDVLILSKIN----HYPNNPPDQMLSELCSNIGSTLQQGGTV 332

Query:   249 LIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQ 308
             LIP+++ G   +L   L +Y  N      +PIY+ S ++K  ++    Y   +N+  + +
Sbjct:   333 LIPSYSCGIILDLFEHLADYL-NKVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKSKQER 391

Query:   309 -FANSNPFKFKHISPLNSIDDFSDV-------GPSVVMASPGGLQSGLSRQLFDIWCSDK 360
              F    PF  + +        +  V        P ++       + G    L  ++ + K
Sbjct:   392 AFMPETPFLHQDLMRKGQFQAYQHVHSNFQANDPCIIFTGHPSCRIGDITTLIKLYDNPK 451

Query:   361 KNACVI-PGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLK 419
              +  +I P +        T++   K+++ +  L  P++ ++++    A+   ++ S   K
Sbjct:   452 NSILLIEPDF----DFKSTVLPFSKQISRIQFL--PIDPRINFNE--ANLLISKLSP--K 501

Query:   420 ELMPPNIILVHGES-HEMGRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIG- 477
              L+ P I   + ++ H  G     ++T +   +T  I  +N Q+ E  F  +++A+TI  
Sbjct:   502 HLIIPRIYKNYVKNKHSNGNFG--IVTTILPLDT--IKIQNNQNFESGFIDKELAQTIQT 557

Query:   478 RLAEKTPEV 486
             ++ +K+ ++
Sbjct:   558 KVLDKSSQL 566

 Score = 111 (44.1 bits), Expect = 2.6e-18, Sum P(3) = 2.6e-18
 Identities = 30/94 (31%), Positives = 51/94 (54%)

Query:    66 PYFDEIDP-SAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             P F+ ID  S ID++LI+++   +A  LP+  E T F+G+++ T  T  I KLLL + V+
Sbjct:   106 PQFEMIDDFSTIDMILISNYTNIYA--LPFITEYTNFQGKIYATEPTVQIGKLLLEELVQ 163

Query:   125 VSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEV 158
             + K       +    IN + +   + D  Q +E+
Sbjct:   164 MDKQ------YSNSSINNNNNNNNLSDCWQNIEI 191

 Score = 43 (20.2 bits), Expect = 2.6e-18, Sum P(3) = 2.6e-18
 Identities = 7/17 (41%), Positives = 9/17 (52%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + YK   IL DC +
Sbjct:    14 CFLLEYKNVKILLDCAL 30


>UNIPROTKB|Q0C1L6 [details] [associations]
            symbol:HNE_1669 "Putative uncharacterized protein"
            species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR001279 SMART:SM00849 GO:GO:0016787 EMBL:CP000158
            GenomeReviews:CP000158_GR eggNOG:COG1236 RefSeq:YP_760377.1
            ProteinModelPortal:Q0C1L6 STRING:Q0C1L6 GeneID:4288204
            KEGG:hne:HNE_1669 PATRIC:32216161 HOGENOM:HOG000035995 OMA:STFGLPI
            ProtClustDB:CLSK2517173 BioCyc:HNEP228405:GI69-1701-MONOMER
            InterPro:IPR026360 TIGRFAMs:TIGR04122 Uniprot:Q0C1L6
        Length = 333

 Score = 173 (66.0 bits), Expect = 4.3e-13, Sum P(3) = 4.3e-13
 Identities = 51/158 (32%), Positives = 82/158 (51%)

Query:   152 FHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAELPQFSP- 210
             + +TVEV  ++   Y AGHVLG+A  +++ AG RV+ TGD+ R  D        P F P 
Sbjct:    74 YGETVEVGDVRVTLYPAGHVLGSAQVLLERAGERVIVTGDFKRAAD-----PTCPPFVPI 128

Query:   211 --DICIIESTYGVQL--HQPRNIREKRFTDVIHSTISQGGR-VLIPAFALGRAQELLLIL 265
               D+ I E+T+G+ +  H P +        V+        R VL+ A+ALG+AQ ++  L
Sbjct:   129 ACDVLITEATFGLPVFRHPPAS---DEIAKVMERLAESPERCVLVGAYALGKAQRVICHL 185

Query:   266 DEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNE 303
              E        ++ PIY    + K C A+Y+ + +++ E
Sbjct:   186 REAG------YDKPIYLHGAMEKLC-ALYEAHGVALGE 216

 Score = 53 (23.7 bits), Expect = 4.3e-13, Sum P(3) = 4.3e-13
 Identities = 11/35 (31%), Positives = 20/35 (57%)

Query:   406 SAHADYAQTSTFLKELMPPNIILVHGESHEMGRLK 440
             S HAD+ + +  ++E+ P  + + HG   E G L+
Sbjct:   278 SDHADWEELTRTIREVAPSEVWVTHGS--EAGLLR 310

 Score = 49 (22.3 bits), Expect = 4.3e-13, Sum P(3) = 4.3e-13
 Identities = 13/36 (36%), Positives = 17/36 (47%)

Query:    55 IHPAYSGMAALPYFDEIDPSAIDVL-LITHFHLDHA 89
             I P   G+        +DPS    L ++TH H DHA
Sbjct:     8 IKPGAGGIEVAGGAAFVDPSLPKPLAIVTHGHADHA 43


>UNIPROTKB|H0YBH8 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
            ChiTaRS:INTS9 Ensembl:ENST00000524081 Uniprot:H0YBH8
        Length = 223

 Score = 154 (59.3 bits), Expect = 1.0e-10, Sum P(2) = 1.0e-10
 Identities = 39/130 (30%), Positives = 71/130 (54%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL +    
Sbjct:    77 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLPSPLKD 134

Query:   125 VSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAGHVLGAAMFMVDIAG 183
               +VS     +  Q++N ++ KI+++ + Q +E+ G ++    ++G+ LG++ +++    
Sbjct:   135 AVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGYALGSSNWIIQSHY 194

Query:   184 VRVLYTGDYS 193
              +V Y    S
Sbjct:   195 EKVSYVSGSS 204

 Score = 48 (22.0 bits), Expect = 1.0e-10, Sum P(2) = 1.0e-10
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:     6 CNVLKFKSTTIMLDCGL 22


>UNIPROTKB|F6XI08 [details] [associations]
            symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
            Ensembl:ENSCAFT00000013124 EMBL:AAEX03014336 RefSeq:XP_543216.2
            GeneID:486090 KEGG:cfa:486090 Uniprot:F6XI08
        Length = 658

 Score = 173 (66.0 bits), Expect = 7.3e-10, Sum P(2) = 7.3e-10
 Identities = 91/424 (21%), Positives = 177/424 (41%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   143 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 201

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   202 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 259

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + I+     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   260 VLILTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 318

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  NIP Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   319 -AGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQTK-VYLPEPPFPHAELIQTNKLKHY 376

Query:   321 SPLNSIDDFS-DVG-PSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               L+   DFS D   P VV      L+ G      ++W           G   + +L   
Sbjct:   377 PSLHG--DFSSDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 420

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  + 
Sbjct:   421 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQP 478

Query:   437 GRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
                ++  M  + DC    ++ +  + + + F   +  K      E  PE+ + +  + +K
Sbjct:   479 PPAQSHRMDLMIDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADALVPMEIK 532

Query:   497 KGFT 500
              G +
Sbjct:   533 PGIS 536

 Score = 117 (46.2 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 39/122 (31%), Positives = 61/122 (50%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AG 183
              G
Sbjct:   197 VG 198

 Score = 48 (22.0 bits), Expect = 7.3e-10, Sum P(2) = 7.3e-10
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>CGD|CAL0004705 [details] [associations]
            symbol:orf19.325 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493 EMBL:AACQ01000027
            EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
            InterPro:IPR025069 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 RefSeq:XP_720020.1 RefSeq:XP_720152.1
            ProteinModelPortal:Q5AEE3 STRING:Q5AEE3 GeneID:3638181
            GeneID:3638320 KEGG:cal:CaO19.325 KEGG:cal:CaO19.7957
            Uniprot:Q5AEE3
        Length = 931

 Score = 187 (70.9 bits), Expect = 7.8e-10, Sum P(2) = 7.8e-10
 Identities = 57/252 (22%), Positives = 118/252 (46%)

Query:   130 VEDMLFDEQDINRSMDKIEVLDFHQTVEV--NGIKFWCYTAGHVLGAAMFMVDIAGVRVL 187
             V+  + +  +++   DK+ +L + Q++ +  N +    Y AGH LG   +++     RV+
Sbjct:   112 VDSAILELDEVDNWFDKVNLLKYQQSLNLFDNKVVVTPYNAGHSLGGTFWLITKRIDRVI 171

Query:   188 YTGDYSREEDRHLRAAEL--PQF-SPDICIIESTYGVQLHQPRNI-----REKRFTDVIH 239
             Y   ++  +D  L +A    P   +P + ++  T  +      ++     R ++F  ++ 
Sbjct:   172 YAPAWNHSKDSFLNSASFISPSTGNPHLSLLRPTAFITATDMGSVMSHRKRTEKFLQLVD 231

Query:   240 STISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYIL 299
             +T++ GG  ++P    GR  EL  ++DE+    P    IP+Y+ S    K +  Y + +L
Sbjct:   232 ATLANGGAAVLPTSLSGRFLELFHLIDEHLKGAP----IPVYFLSYSGTKILT-YASNLL 286

Query:   300 S-MNERIRNQFA--NSNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSG-LSRQLFD 354
               M++    ++   +S PF    +  L    +   + GP +V  S   L+SG +S + F 
Sbjct:   287 DWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAFQ 346

Query:   355 IWCSDKKNACVI 366
               C+D+    ++
Sbjct:   347 YLCNDEHTTIIL 358

 Score = 37 (18.1 bits), Expect = 7.8e-10, Sum P(2) = 7.8e-10
 Identities = 7/28 (25%), Positives = 13/28 (46%)

Query:   402 YISFSAHADYAQTSTFLKELMPPNIILV 429
             ++  S   D       ++ L P N+IL+
Sbjct:   649 FVDLSGQVDLRSLGIIVQALKPYNLILL 676


>UNIPROTKB|Q5AEE3 [details] [associations]
            symbol:CFT2 "Putative uncharacterized protein CFT2"
            species:237561 "Candida albicans SC5314" [GO:0042493 "response to
            drug" evidence=IMP] InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493
            EMBL:AACQ01000027 EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402
            InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
            Pfam:PF10996 Pfam:PF13299 SMART:SM01027 RefSeq:XP_720020.1
            RefSeq:XP_720152.1 ProteinModelPortal:Q5AEE3 STRING:Q5AEE3
            GeneID:3638181 GeneID:3638320 KEGG:cal:CaO19.325
            KEGG:cal:CaO19.7957 Uniprot:Q5AEE3
        Length = 931

 Score = 187 (70.9 bits), Expect = 7.8e-10, Sum P(2) = 7.8e-10
 Identities = 57/252 (22%), Positives = 118/252 (46%)

Query:   130 VEDMLFDEQDINRSMDKIEVLDFHQTVEV--NGIKFWCYTAGHVLGAAMFMVDIAGVRVL 187
             V+  + +  +++   DK+ +L + Q++ +  N +    Y AGH LG   +++     RV+
Sbjct:   112 VDSAILELDEVDNWFDKVNLLKYQQSLNLFDNKVVVTPYNAGHSLGGTFWLITKRIDRVI 171

Query:   188 YTGDYSREEDRHLRAAEL--PQF-SPDICIIESTYGVQLHQPRNI-----REKRFTDVIH 239
             Y   ++  +D  L +A    P   +P + ++  T  +      ++     R ++F  ++ 
Sbjct:   172 YAPAWNHSKDSFLNSASFISPSTGNPHLSLLRPTAFITATDMGSVMSHRKRTEKFLQLVD 231

Query:   240 STISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYIL 299
             +T++ GG  ++P    GR  EL  ++DE+    P    IP+Y+ S    K +  Y + +L
Sbjct:   232 ATLANGGAAVLPTSLSGRFLELFHLIDEHLKGAP----IPVYFLSYSGTKILT-YASNLL 286

Query:   300 S-MNERIRNQFA--NSNPFKFKHISPLNSIDDFSDV-GPSVVMASPGGLQSG-LSRQLFD 354
               M++    ++   +S PF    +  L    +   + GP +V  S   L+SG +S + F 
Sbjct:   287 DWMSKSFTKEWEELSSVPFNPSKVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAFQ 346

Query:   355 IWCSDKKNACVI 366
               C+D+    ++
Sbjct:   347 YLCNDEHTTIIL 358

 Score = 37 (18.1 bits), Expect = 7.8e-10, Sum P(2) = 7.8e-10
 Identities = 7/28 (25%), Positives = 13/28 (46%)

Query:   402 YISFSAHADYAQTSTFLKELMPPNIILV 429
             ++  S   D       ++ L P N+IL+
Sbjct:   649 FVDLSGQVDLRSLGIIVQALKPYNLILL 676


>RGD|1311539 [details] [associations]
            symbol:Ints9 "integrator complex subunit 9" species:10116
            "Rattus norvegicus" [GO:0016180 "snRNA processing"
            evidence=IEA;ISO] [GO:0032039 "integrator complex"
            evidence=IEA;ISO] InterPro:IPR027074 RGD:1311539 InterPro:IPR022712
            Pfam:PF10996 SMART:SM01027 GO:GO:0032039 GO:GO:0016180
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 IPI:IPI00362364
            Ensembl:ENSRNOT00000018071 Uniprot:F1M365
        Length = 659

 Score = 170 (64.9 bits), Expect = 1.6e-09, Sum P(2) = 1.6e-09
 Identities = 85/415 (20%), Positives = 175/415 (42%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   144 FIERVP-KAQSASLWKNKEIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 202

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   203 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 260

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + I+     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   261 VLILTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 319

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPFKFKHISPLNSIDDF 329
                  NIP Y+ SP+A   +   Q +   L  N++ +  +    PF    +   N +  +
Sbjct:   320 -AGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSK-VYLPEPPFPHAELIQTNKLKHY 377

Query:   330 SDV-GP-SVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVT 387
               + G  S     P  L +G     F     D  +   + G   + +L   I +EP + +
Sbjct:   378 RSIHGDFSHDFRQPCVLFTGHPSLRF----GDVVHFMELWG---KSSLNTVIFTEP-DFS 429

Query:   388 LMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMT 445
              +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  +    ++  M 
Sbjct:   430 YLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQPPPAQSHRMD 488

Query:   446 ELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVKKGFT 500
              + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K G +
Sbjct:   489 LMIDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIKPGIS 537

 Score = 48 (22.0 bits), Expect = 1.6e-09, Sum P(2) = 1.6e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    15 CNVLKFKSTTIMLDCGL 31


>UNIPROTKB|F1MMA6 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9913
            "Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
            GO:GO:0016180 IPI:IPI00701634 UniGene:Bt.91042
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
            EMBL:DAAA02021965 EMBL:DAAA02021964 Ensembl:ENSBTAT00000049079
            ArrayExpress:F1MMA6 Uniprot:F1MMA6
        Length = 658

 Score = 169 (64.5 bits), Expect = 2.0e-09, Sum P(2) = 2.0e-09
 Identities = 88/424 (20%), Positives = 179/424 (42%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   143 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 201

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   202 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 259

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + I+     +    P ++  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   260 VLILTGLTQIPTANPDSMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 318

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  +IP Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   319 -AGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTK-VYLPEPPFPHAELIQTNKLKHY 376

Query:   321 SPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               ++   DFS+    P VV      L+ G      ++W           G   + +L   
Sbjct:   377 PSIHG--DFSNDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 420

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  + 
Sbjct:   421 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQP 478

Query:   437 GRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
                ++  M  + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K
Sbjct:   479 PPAQSHRMDLMVDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIK 532

Query:   497 KGFT 500
              G +
Sbjct:   533 PGIS 536

 Score = 117 (46.2 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 39/122 (31%), Positives = 61/122 (50%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AG 183
              G
Sbjct:   197 VG 198

 Score = 48 (22.0 bits), Expect = 2.0e-09, Sum P(2) = 2.0e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>UNIPROTKB|Q2KJA6 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9913
            "Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            GO:GO:0005634 eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996
            SMART:SM01027 EMBL:BC105437 IPI:IPI00701634 RefSeq:NP_001039828.1
            UniGene:Bt.91042 ProteinModelPortal:Q2KJA6 STRING:Q2KJA6
            GeneID:533964 KEGG:bta:533964 CTD:55756 HOGENOM:HOG000045359
            HOVERGEN:HBG081802 InParanoid:Q2KJA6 KO:K13146 OrthoDB:EOG415GCW
            NextBio:20876211 PANTHER:PTHR11203:SF2 Uniprot:Q2KJA6
        Length = 658

 Score = 169 (64.5 bits), Expect = 2.0e-09, Sum P(2) = 2.0e-09
 Identities = 88/424 (20%), Positives = 179/424 (42%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   143 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 201

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   202 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 259

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + I+     +    P ++  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   260 VLILTGLTQIPTANPDSMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 318

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  +IP Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   319 -AGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTK-VYLPEPPFPHAELIQTNKLKHY 376

Query:   321 SPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               ++   DFS+    P VV      L+ G      ++W           G   + +L   
Sbjct:   377 PSIHG--DFSNDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 420

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  + 
Sbjct:   421 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQP 478

Query:   437 GRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
                ++  M  + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K
Sbjct:   479 TPAQSHRMDLMVDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIK 532

Query:   497 KGFT 500
              G +
Sbjct:   533 PGIS 536

 Score = 117 (46.2 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 39/122 (31%), Positives = 61/122 (50%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AG 183
              G
Sbjct:   197 VG 198

 Score = 48 (22.0 bits), Expect = 2.0e-09, Sum P(2) = 2.0e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>MGI|MGI:1098533 [details] [associations]
            symbol:Ints9 "integrator complex subunit 9" species:10090
            "Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0016180 "snRNA processing"
            evidence=ISO] [GO:0032039 "integrator complex" evidence=ISO]
            InterPro:IPR027074 MGI:MGI:1098533 eggNOG:COG1236
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
            GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359 HOVERGEN:HBG081802
            KO:K13146 OrthoDB:EOG415GCW PANTHER:PTHR11203:SF2
            GeneTree:ENSGT00390000001445 EMBL:AK038979 EMBL:AK077634
            EMBL:AK136992 EMBL:AK150195 EMBL:BC028953 EMBL:BC055700
            IPI:IPI00223422 IPI:IPI00406798 RefSeq:NP_001240660.1
            RefSeq:NP_700463.2 UniGene:Mm.71332 ProteinModelPortal:Q8K114
            SMR:Q8K114 STRING:Q8K114 PhosphoSite:Q8K114 PaxDb:Q8K114
            PRIDE:Q8K114 Ensembl:ENSMUST00000043914 GeneID:210925
            KEGG:mmu:210925 UCSC:uc007uiv.1 UCSC:uc007uiw.1 InParanoid:Q8K114
            NextBio:373083 Bgee:Q8K114 CleanEx:MM_INTS9 Genevestigator:Q8K114
            Uniprot:Q8K114
        Length = 658

 Score = 167 (63.8 bits), Expect = 3.3e-09, Sum P(2) = 3.3e-09
 Identities = 85/415 (20%), Positives = 174/415 (41%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   143 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 201

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   202 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 259

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + I+     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   260 VLILTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 318

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPFKFKHISPLNSIDDF 329
                  NIP Y+ SP+A   +   Q +   L  N++ +  +    PF    +   N +  +
Sbjct:   319 -AGLSNIPFYFISPVANSSLEFSQIFAEWLCHNKQSK-VYLPEPPFPHAELIQTNKLKHY 376

Query:   330 SDV-GP-SVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVT 387
               + G  S     P  L +G     F     D  +   + G   + +L   I +EP + +
Sbjct:   377 RSIHGDFSNDFRQPCVLFTGHPSLRF----GDVVHFMELWG---KSSLNTIIFTEP-DFS 428

Query:   388 LMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMT 445
              +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  +    +   M 
Sbjct:   429 YLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQPPPAQAHRMD 487

Query:   446 ELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVKKGFT 500
              + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K G +
Sbjct:   488 LMIDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIKPGIS 536

 Score = 118 (46.6 bits), Expect = 0.00068, Sum P(2) = 0.00067
 Identities = 39/122 (31%), Positives = 61/122 (50%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTMQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AG 183
              G
Sbjct:   197 VG 198

 Score = 48 (22.0 bits), Expect = 3.3e-09, Sum P(2) = 3.3e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>UNIPROTKB|Q9NV88 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0005515 "protein binding" evidence=IPI]
            [GO:0016180 "snRNA processing" evidence=IDA] [GO:0032039
            "integrator complex" evidence=IDA] InterPro:IPR027074
            eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            GO:GO:0032039 EMBL:U96629 GO:GO:0016180 CTD:55756
            HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
            PANTHER:PTHR11203:SF2 OMA:PLAMKCV EMBL:AK001733 EMBL:AK298468
            EMBL:AK300593 EMBL:AC040975 EMBL:AC131969 EMBL:BC025267
            EMBL:BK005726 EMBL:BK005674 IPI:IPI00290514 IPI:IPI00871167
            RefSeq:NP_001138631.1 RefSeq:NP_001166033.1 RefSeq:NP_060720.2
            UniGene:Hs.162397 ProteinModelPortal:Q9NV88 SMR:Q9NV88
            IntAct:Q9NV88 STRING:Q9NV88 PhosphoSite:Q9NV88 DMDM:119371246
            PaxDb:Q9NV88 PRIDE:Q9NV88 DNASU:55756 Ensembl:ENST00000416984
            Ensembl:ENST00000521022 Ensembl:ENST00000521777 GeneID:55756
            KEGG:hsa:55756 UCSC:uc003xha.3 GeneCards:GC08M028625
            HGNC:HGNC:25592 MIM:611352 neXtProt:NX_Q9NV88 PharmGKB:PA162392192
            InParanoid:Q9NV88 PhylomeDB:Q9NV88 ChiTaRS:INTS9 GenomeRNAi:55756
            NextBio:60763 ArrayExpress:Q9NV88 Bgee:Q9NV88 CleanEx:HS_INTS9
            Genevestigator:Q9NV88 GermOnline:ENSG00000104299 Uniprot:Q9NV88
        Length = 658

 Score = 166 (63.5 bits), Expect = 4.2e-09, Sum P(2) = 4.2e-09
 Identities = 86/424 (20%), Positives = 179/424 (42%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   143 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 201

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   202 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 259

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + ++     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   260 VLVLTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 318

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  ++P+Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   319 -AGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSK-VYLPEPPFPHAELIQTNKLKHY 376

Query:   321 SPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               ++   DFS+    P VV      L+ G      ++W           G   + +L   
Sbjct:   377 PSIHG--DFSNDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 420

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  + 
Sbjct:   421 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQP 478

Query:   437 GRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
                ++  M  + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K
Sbjct:   479 PPAQSHRMDLMIDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIK 532

Query:   497 KGFT 500
              G +
Sbjct:   533 PGIS 536

 Score = 117 (46.2 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 39/122 (31%), Positives = 61/122 (50%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AG 183
              G
Sbjct:   197 VG 198

 Score = 48 (22.0 bits), Expect = 4.2e-09, Sum P(2) = 4.2e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>UNIPROTKB|F1RJQ5 [details] [associations]
            symbol:INTS9 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0032039 "integrator complex" evidence=IEA] [GO:0016180
            "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
            GeneTree:ENSGT00390000001445 OMA:PLAMKCV EMBL:CU407017
            Ensembl:ENSSSCT00000010615 Uniprot:F1RJQ5
        Length = 576

 Score = 167 (63.8 bits), Expect = 4.6e-09, P = 4.6e-09
 Identities = 88/424 (20%), Positives = 178/424 (41%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:    61 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQMVGYSQ 119

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   120 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 177

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + I+     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   178 VLILTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 236

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  +IP Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   237 -AGLSSIPFYFISPVANSSLEFSQIFAEWLCHNKQTK-VYLPEPPFPHAELIQTNKLKHY 294

Query:   321 SPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               ++   DFS+    P VV      L+ G      ++W           G   + +L   
Sbjct:   295 PSIHG--DFSNDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 338

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  + 
Sbjct:   339 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQP 396

Query:   437 GRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
                ++  M  + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K
Sbjct:   397 PPAQSHRMDLMIDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIK 450

Query:   497 KGFT 500
              G +
Sbjct:   451 PGIS 454


>UNIPROTKB|H7BYQ6 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
            EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592 ChiTaRS:INTS9
            ProteinModelPortal:H7BYQ6 Ensembl:ENST00000397363 Bgee:H7BYQ6
            Uniprot:H7BYQ6
        Length = 552

 Score = 166 (63.5 bits), Expect = 5.5e-09, P = 5.5e-09
 Identities = 86/424 (20%), Positives = 179/424 (42%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:    37 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 95

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:    96 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 153

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + ++     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   154 VLVLTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 212

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  ++P+Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   213 -AGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSK-VYLPEPPFPHAELIQTNKLKHY 270

Query:   321 SPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               ++   DFS+    P VV      L+ G      ++W           G   + +L   
Sbjct:   271 PSIHG--DFSNDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 314

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEM 436
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++    +  + 
Sbjct:   315 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCPE-QYTQP 372

Query:   437 GRLKTKLMTELADCNTKIITPKNCQSVEMYFNSEKMAKTIGRLAEKTPEVGETVSGILVK 496
                ++  M  + DC    ++ +  + + + F   +  K      E  PE+ +++  + +K
Sbjct:   373 PPAQSHRMDLMIDCQPPAMSYRRAEVLALPFK-RRYEKI-----EIMPELADSLVPMEIK 426

Query:   497 KGFT 500
              G +
Sbjct:   427 PGIS 430


>UNIPROTKB|G3XAN1 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            EMBL:CH471080 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 UniGene:Hs.162397
            HGNC:HGNC:25592 ChiTaRS:INTS9 ProteinModelPortal:G3XAN1
            Ensembl:ENST00000523303 ArrayExpress:G3XAN1 Bgee:G3XAN1
            Uniprot:G3XAN1
        Length = 525

 Score = 162 (62.1 bits), Expect = 5.8e-09, Sum P(2) = 5.8e-09
 Identities = 76/351 (21%), Positives = 150/351 (42%)

Query:    95 FLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQ 154
             F+E+   K +       K I +LL +      +VS     +  Q++N ++ KI+++ + Q
Sbjct:   143 FIERVP-KAQSASLWKNKDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQ 201

Query:   155 TVEVNG-IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPD 211
              +E+ G ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D
Sbjct:   202 KIELFGAVQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSD 259

Query:   212 ICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSN 271
             + ++     +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +
Sbjct:   260 VLVLTGLTQIPTANPDGMVGE-FCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS 318

Query:   272 HPEFHNIPIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHI 320
                  ++P+Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH 
Sbjct:   319 -AGLSSVPLYFISPVANSSLEFSQIFAEWLCHNKQSK-VYLPEPPFPHAELIQTNKLKHY 376

Query:   321 SPLNSIDDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKT 378
               ++   DFS+    P VV      L+ G      ++W           G   + +L   
Sbjct:   377 PSIHG--DFSNDFRQPCVVFTGHPSLRFGDVVHFMELW-----------G---KSSLNTV 420

Query:   379 IISEPKEVTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNII 427
             I +EP + + +  L    PL M+  Y       ++ Q S  LKE+ P +++
Sbjct:   421 IFTEP-DFSYLEALAPYQPLAMKCIYCPIDTRLNFIQVSKLLKEVQPLHVV 470

 Score = 117 (46.2 bits), Expect = 0.00048, Sum P(2) = 0.00048
 Identities = 39/122 (31%), Positives = 61/122 (50%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AG 183
              G
Sbjct:   197 VG 198

 Score = 48 (22.0 bits), Expect = 5.8e-09, Sum P(2) = 5.8e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>UNIPROTKB|Q5ZKK2 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9031
            "Gallus gallus" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
            GO:GO:0032039 GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359
            HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
            PANTHER:PTHR11203:SF2 EMBL:AJ720082 IPI:IPI00651516
            RefSeq:NP_001026271.1 UniGene:Gga.21113 ProteinModelPortal:Q5ZKK2
            STRING:Q5ZKK2 Ensembl:ENSGALT00000026848 GeneID:422023
            KEGG:gga:422023 GeneTree:ENSGT00390000001445 InParanoid:Q5ZKK2
            OMA:PLAMKCV NextBio:20824712 Uniprot:Q5ZKK2
        Length = 658

 Score = 164 (62.8 bits), Expect = 7.0e-09, Sum P(2) = 7.0e-09
 Identities = 74/344 (21%), Positives = 146/344 (42%)

Query:   102 KGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNG- 160
             K +   T   K + +LL        +VS+    +   ++N ++ KI+++ + Q +E+ G 
Sbjct:   149 KAQSASTWKNKEVQRLLPAPLKDAVEVSMWRKCYTMPEVNAALSKIQLVGYSQKIELFGA 208

Query:   161 IKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYS--REEDRHLRAAELPQFSPDICIIEST 218
             ++    ++G+ LG++ +++     +V Y    S      + +  A L   + D+ I+   
Sbjct:   209 VQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTTHPQPMDQASLK--NSDVLILTGL 266

Query:   219 YGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNI 278
               +    P  +  + F   +  T+  GG VL+P +  G   +LL  L +Y  +     N+
Sbjct:   267 TQIPTANPDGMVGE-FCSNLAMTVRNGGNVLVPCYPSGVIYDLLECLYQYIDS-AGLSNV 324

Query:   279 PIYYASPLAKKCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHISPLNSID 327
             P Y+ SP+A   +   Q +   L  N++ +  +    PF         K KH   ++   
Sbjct:   325 PFYFISPVANSSLEFSQIFAEWLCHNKQTK-VYLPEPPFPHAELIQTNKLKHYPSIHG-- 381

Query:   328 DFSD--VGPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKE 385
             DFS+    P V+      L+ G      ++W           G   + +L   I +EP +
Sbjct:   382 DFSNDFKQPCVIFTGHPSLRFGDVVHFMELW-----------G---KSSLNTVIFTEP-D 426

Query:   386 VTLMNGLTA--PLNMQVHYISFSAHADYAQTSTFLKELMPPNII 427
              + ++ L    PL M+  Y       ++ Q S  LKE+ P +++
Sbjct:   427 FSYLDALAPYQPLAMKCVYCPIDTRLNFIQVSKLLKEVQPLHVV 470

 Score = 48 (22.0 bits), Expect = 7.0e-09, Sum P(2) = 7.0e-09
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>ZFIN|ZDB-GENE-061013-129 [details] [associations]
            symbol:ints9 "integrator complex subunit 9"
            species:7955 "Danio rerio" [GO:0016180 "snRNA processing"
            evidence=IEA] [GO:0032039 "integrator complex" evidence=IEA]
            InterPro:IPR027074 ZFIN:ZDB-GENE-061013-129 eggNOG:COG1236
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756
            HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445
            EMBL:CABZ01076623 EMBL:CABZ01078246 EMBL:CABZ01078247
            EMBL:CABZ01078248 EMBL:CABZ01078249 EMBL:BC124793 IPI:IPI00800641
            RefSeq:NP_001070738.1 UniGene:Dr.116109 Ensembl:ENSDART00000097865
            GeneID:768124 KEGG:dre:768124 InParanoid:Q08BB6 NextBio:20918446
            Uniprot:Q08BB6
        Length = 658

 Score = 157 (60.3 bits), Expect = 4.1e-08, Sum P(2) = 4.1e-08
 Identities = 91/394 (23%), Positives = 159/394 (40%)

Query:    59 YSGMAALPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHAT-----KA 113
             Y  M ALPY  E        +  T   L     L    E   F  RV   HA      K 
Sbjct:   104 YHCMMALPYITE-HTGFTGTVYATEPTLQIGRLL--MEELVAFMERVPKAHAASCWKNKE 160

Query:   114 IYKLL---LTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNG-IKFWCYTAG 169
             I +LL   L D V+V   S     +  Q++N ++ K++++ + Q VE+ G ++    ++G
Sbjct:   161 IQRLLPGPLKDAVEVWSWS---KCYSLQEVNSALSKVQLVGYSQKVELFGAVQVTPLSSG 217

Query:   170 HVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAELPQF-SPDICIIESTYGVQLHQPRN 228
             + LG++ +++     +V Y    S     H +  E     + D+ I+     +    P  
Sbjct:   218 YSLGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMEQSSLKNSDVLILTGLTQIPTANPDG 276

Query:   229 IREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAK 288
             +  + F   +  T+  GG VL+P ++ G   +LL  L ++  +       P Y+ SP+A 
Sbjct:   277 MLGE-FCSNLAMTVRAGGNVLVPCYSSGVIYDLLECLYQFMDS-ANLGTTPFYFISPVAN 334

Query:   289 KCMAVYQTYI--LSMNERIRNQFANSNPF---------KFKHISPLNSIDDFSDV--GPS 335
               +   Q +   L  N++ +  +    PF         K KH   ++   DFS     P 
Sbjct:   335 SSLEFSQIFAEWLCQNKQSK-VYLPEPPFPHAELIQTNKLKHYPSIHG--DFSSEFRQPC 391

Query:   336 VVMASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTA- 394
             VV      L+ G      ++W     N           T+   I +EP + + ++ L   
Sbjct:   392 VVFTGHPSLRFGDVVHFMELWGKSSLN-----------TI---IFTEP-DFSYLDALAPY 436

Query:   395 -PLNMQVHYISFSAHADYAQTSTFLKELMPPNII 427
              PL M+  Y       ++ Q S  LK++ P +++
Sbjct:   437 QPLAMKCVYCPIDTRLNFHQVSKLLKDIQPLHVV 470

 Score = 48 (22.0 bits), Expect = 4.1e-08, Sum P(2) = 4.1e-08
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>WB|WBGene00017608 [details] [associations]
            symbol:F19F10.12 species:6239 "Caenorhabditis elegans"
            [GO:0009792 "embryo development ending in birth or egg hatching"
            evidence=IMP] InterPro:IPR027074 GO:GO:0009792 eggNOG:COG1236
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 KO:K13146
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 EMBL:FO080914
            RefSeq:NP_504953.1 ProteinModelPortal:Q95ZM2 PaxDb:Q95ZM2
            EnsemblMetazoa:F19F10.12 GeneID:179142 KEGG:cel:CELE_F19F10.12
            UCSC:F19F10.12 CTD:179142 WormBase:F19F10.12 HOGENOM:HOG000199610
            InParanoid:Q95ZM2 OMA:EFMERIE NextBio:904092 Uniprot:Q95ZM2
        Length = 646

 Score = 128 (50.1 bits), Expect = 9.3e-08, Sum P(2) = 9.3e-08
 Identities = 60/278 (21%), Positives = 119/278 (42%)

Query:   139 DINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDR 198
             D++  + K+  L F+QT+++  IK     +GH  G+A + +     +  Y    S     
Sbjct:   178 DMHSCLAKVITLSFNQTIDLFRIKVTPVVSGHTYGSAYWTIKTENEQFAYLSA-SNPSAT 236

Query:   199 HLRAAEL-PQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGR 257
              ++  E  P  + D  ++ S   +     + +        I   + + G VL+P   +G 
Sbjct:   237 DVKLMETAPLRAVDHILVTSLSRLVDTTAKEMGYS-LIKTITDVLKKHGSVLLPICPVGP 295

Query:   258 AQELL-LILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYILSMNERIRNQ-------F 309
               E++  + D   + +    + PIY+ SP+AK  +A+       M+E  +N        +
Sbjct:   296 IFEMIEAVSDIITTTNGIPLDTPIYFISPVAKSAIAMASISAEWMSESRQNAVYLPEEPY 355

Query:   310 ANSNPFKFKHISPLNSI-DDFSDV--GPSVVMASPGGLQSGLSRQLFDIWCSDKKNACVI 366
             ++SN  K   +   +S+   FS     P V+ AS   L+ G +  + ++  SD KNA + 
Sbjct:   356 SHSNLIKSGRVKIYDSLYGSFSKEFKTPCVIFASHASLRIGDAAHMVEVLGSDPKNAVI- 414

Query:   367 PGYVVEGTLAKTIISEPKEVTLMNGLTAPLNMQVHYIS 404
                V +  L    + EP     +  +  P++ ++ + S
Sbjct:   415 ---VTDPDLPCEDVREPFRNLPIKFINIPMDFRMDFAS 449

 Score = 75 (31.5 bits), Expect = 9.3e-08, Sum P(2) = 9.3e-08
 Identities = 15/63 (23%), Positives = 36/63 (57%)

Query:    69 DEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK-VSK 127
             D +    ID +L++++  +    LP++ E + F G++++T       KLL+ + ++ +S+
Sbjct:    83 DMLKMDTIDAILVSNY--ESFVGLPFYTEGSGFSGKIYVTEIAYQYGKLLMEEMLEFISR 140

Query:   128 VSV 130
             + V
Sbjct:   141 IEV 143


>UNIPROTKB|E5RG70 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
            ChiTaRS:INTS9 IPI:IPI00974179 ProteinModelPortal:E5RG70 SMR:E5RG70
            Ensembl:ENST00000523436 ArrayExpress:E5RG70 Bgee:E5RG70
            Uniprot:E5RG70
        Length = 300

 Score = 140 (54.3 bits), Expect = 1.8e-07, Sum P(2) = 1.8e-07
 Identities = 57/207 (27%), Positives = 95/207 (45%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDI 181
              + +V  +    L+  +DI R +           VEV+  +  CYT   V  +A+  + +
Sbjct:   143 FIERVPKAQSASLWKNKDIQRLLPS----PLKDAVEVSTWRR-CYTMQEV-NSALSKIQL 196

Query:   182 AGVRVLYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHST 241
              G        YS++    +  A L   + D+ ++     +    P  +  + F   +  T
Sbjct:   197 VG--------YSQKIP--MDQASLK--NSDVLVLTGLTQIPTANPDGMVGE-FCSNLALT 243

Query:   242 ISQGGRVLIPAFALGRAQELLLILDEY 268
             +  GG VL+P +  G   +LL  L +Y
Sbjct:   244 VRNGGNVLVPCYPSGVIYDLLECLYQY 270

 Score = 48 (22.0 bits), Expect = 1.8e-07, Sum P(2) = 1.8e-07
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>UNIPROTKB|E5RK47 [details] [associations]
            symbol:INTS9 "Integrator complex subunit 9" species:9606
            "Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
            [GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
            PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
            ChiTaRS:INTS9 IPI:IPI00976000 ProteinModelPortal:E5RK47 SMR:E5RK47
            Ensembl:ENST00000518510 ArrayExpress:E5RK47 Bgee:E5RK47
            Uniprot:E5RK47
        Length = 170

 Score = 112 (44.5 bits), Expect = 4.8e-06, Sum P(2) = 4.8e-06
 Identities = 31/82 (37%), Positives = 46/82 (56%)

Query:    65 LPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVK 124
             LP  + ID S +DV+LI+++H   A  LPY  E T F G V+ T  T  I +LL+ + V 
Sbjct:    85 LPETELIDLSTVDVILISNYHCMMA--LPYITEHTGFTGTVYATEPTVQIGRLLMEELVN 142

Query:   125 -VSKV--SVEDMLFDEQDINRS 143
              + +V  +    L+  +DI RS
Sbjct:   143 FIERVPKAQSASLWKNKDIQRS 164

 Score = 48 (22.0 bits), Expect = 4.8e-06, Sum P(2) = 4.8e-06
 Identities = 7/17 (41%), Positives = 11/17 (64%)

Query:    39 CVYMSYKGKTILFDCGI 55
             C  + +K  TI+ DCG+
Sbjct:    14 CNVLKFKSTTIMLDCGL 30


>FB|FBgn0036570 [details] [associations]
            symbol:IntS9 "Integrator 9" species:7227 "Drosophila
            melanogaster" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0034472 "snRNA 3'-end processing" evidence=IDA]
            [GO:0032039 "integrator complex" evidence=ISS] [GO:0016180 "snRNA
            processing" evidence=ISS] InterPro:IPR027074 EMBL:AE014296
            GO:GO:0006378 GO:GO:0005847 GO:GO:0006379 InterPro:IPR022712
            Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
            PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
            GO:GO:0034472 EMBL:AY058574 RefSeq:NP_648838.3 UniGene:Dm.977
            SMR:Q95TS5 IntAct:Q95TS5 MINT:MINT-1734573
            EnsemblMetazoa:FBtr0075495 GeneID:39763 KEGG:dme:Dmel_CG5222
            UCSC:CG5222-RA FlyBase:FBgn0036570 InParanoid:Q95TS5
            OrthoDB:EOG4FJ6QV GenomeRNAi:39763 NextBio:815254 Uniprot:Q95TS5
        Length = 654

 Score = 138 (53.6 bits), Expect = 8.3e-06, P = 8.3e-06
 Identities = 100/476 (21%), Positives = 188/476 (39%)

Query:    59 YSGMAALPYFDEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYK-- 116
             Y  M ALPY  E +      +  T   L       +FLE+      V     T  ++K  
Sbjct:   105 YLNMLALPYITE-NTGFKGKVYATEPTLQIGR---FFLEELVDYIEVSPKACTARLWKEK 160

Query:   117 --LLLTDYVKVSKVSVEDMLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWC-YTAGHVLG 173
               LL +   +  +      +F  +D+  S+ K+ ++ + + +++ G       ++G+ LG
Sbjct:   161 LHLLPSPLSEAFRAKKWRTIFSLKDVQGSLSKVTIMGYDEKLDILGAFIATPVSSGYCLG 220

Query:   174 AAMFMVDIAGVRVLYTGDYSR--EEDRHLRAAELPQFSPDICIIES-TYGVQLHQPRNIR 230
             ++ +++  A  ++ Y    S      R +  + L     D+ I+   T    ++    + 
Sbjct:   221 SSNWVLSTAHEKICYVSGSSTLTTHPRPINQSALKH--ADVLIMTGLTQAPTVNPDTKLG 278

Query:   231 EKRFTDVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKC 290
             E      +  TI   G  LIP +  G   +L   L +   N    +N+P+++ SP+A   
Sbjct:   279 ELCMNVAL--TIRNNGSALIPCYPSGVVYDLFECLTQNLEN-AGLNNVPMFFISPVADSS 335

Query:   291 MAVYQTYILS-MNERIRNQ-FANSNPF---------KFKHISPLNSIDDFS-DVG-PSVV 337
             +A Y   +   ++   +N+ +   +PF         K KH + + S + FS D   P VV
Sbjct:   336 LA-YSNILAEWLSSAKQNKVYLPDDPFPHAFYLRNNKLKHYNHVFS-EGFSKDFRQPCVV 393

Query:   338 MASPGGLQSGLSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTAPLN 397
                   L+ G +    ++W ++  N+ +      E       +  P +         PL 
Sbjct:   394 FCGHPSLRFGDAVHFIEMWGNNPNNSIIF----TEPDFPYLQVLAPFQ---------PLA 440

Query:   398 MQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKIITP 457
             M+  Y       +Y Q +  +KEL P N++++     +       L  E  D   KIIT 
Sbjct:   441 MKAFYCPIDTSLNYQQANKLIKELKP-NVLVIPEAYTKPHPSAPNLFIEQPD--KKIITF 497

Query:   458 KNCQSVEMYFNSEKMAKTI--GRLAEK-TPEVGETVSGILVKKGFTYQIMAPDDLH 510
             K C  +       K+ +      LA+K +P+  E  +G+      T  +   D +H
Sbjct:   498 K-CGEIIRLPLKRKLDRIYITSELAQKISPK--EVAAGVTFST-LTGVLQVKDKVH 549


>UNIPROTKB|Q87XP2 [details] [associations]
            symbol:PSPTO_4134 "Uncharacterized protein" species:223283
            "Pseudomonas syringae pv. tomato str. DC3000" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:AE016853 GenomeReviews:AE016853_GR eggNOG:COG1236
            HOGENOM:HOG000035995 OMA:STFGLPI InterPro:IPR026360
            TIGRFAMs:TIGR04122 RefSeq:NP_793895.1 ProteinModelPortal:Q87XP2
            GeneID:1185814 KEGG:pst:PSPTO_4134 PATRIC:19999765 KO:K07577
            ProtClustDB:CLSK2517054 BioCyc:PSYR223283:GJIX-4198-MONOMER
            Uniprot:Q87XP2
        Length = 348

 Score = 129 (50.5 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 39/134 (29%), Positives = 70/134 (52%)

Query:   138 QDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREED 197
             QDIN     ++ L++ +T+  +G+K   + AGHVLG+A   ++  G   + +GDY  E D
Sbjct:    62 QDIN-----LQTLEYGETITHHGVKLSLHPAGHVLGSAQVRLEYEGEVWVASGDYKVEPD 116

Query:   198 RHLRAAELPQFSPDIC---IIESTYGVQLHQ--PRNIREKRFTDVIHSTISQGGRVLIPA 252
                 A     F P  C   I EST+G+ +++  P++   +   +      +QG   ++ A
Sbjct:   117 GTCAA-----FEPVRCQTFITESTFGLPIYRWAPQSQIFEGINEWWRGNAAQGKASVLFA 171

Query:   253 FALGRAQELLLILD 266
             ++ G+AQ +L  +D
Sbjct:   172 YSFGKAQRILHGID 185

 Score = 45 (20.9 bits), Expect = 1.2e-05, Sum P(2) = 1.2e-05
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query:    71 IDP-SAIDVLLITHFHLDHA 89
             IDP   ++  +ITH H DHA
Sbjct:    20 IDPWRPVERAVITHAHGDHA 39


>TIGR_CMR|NSE_0829 [details] [associations]
            symbol:NSE_0829 "metallo-beta-lactamase family, beta-CASP
            subfamily" species:222891 "Neorickettsia sennetsu str. Miyayama"
            [GO:0003824 "catalytic activity" evidence=ISS] [GO:0016070 "RNA
            metabolic process" evidence=ISS] InterPro:IPR001279
            InterPro:IPR004613 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
            GO:GO:0016787 EMBL:CP000237 GenomeReviews:CP000237_GR
            InterPro:IPR011108 eggNOG:COG0595 PANTHER:PTHR11203:SF22
            HOGENOM:HOG000280200 RefSeq:YP_506699.1 ProteinModelPortal:Q2GCU7
            STRING:Q2GCU7 GeneID:3931644 KEGG:nse:NSE_0829 PATRIC:22681653
            KO:K07021 OMA:MRDDDKL ProtClustDB:CLSK2528128
            BioCyc:NSEN222891:GHFU-835-MONOMER Uniprot:Q2GCU7
        Length = 542

 Score = 124 (48.7 bits), Expect = 5.7e-05, Sum P(2) = 5.7e-05
 Identities = 72/262 (27%), Positives = 112/262 (42%)

Query:    22 DQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGIHPAYSGMAALPYFDEI--DPSAID-- 77
             + LI  PLG   E+G +     YKG+ I+ DCG   A    A LP  D +  D S I+  
Sbjct:     4 NNLIFLPLGGTGEIGMNVTLYGYKGRWIMIDCG---AGFADAELPGIDIVVADISFIEER 60

Query:    78 -----VLLITHFHLDHAASLPYFLEKTTFKGRVFMTHAT-KAIYKLLLTDYVKVSKVSVE 131
                   ++ITH H DH  +LPY  ++      V+ T  T   + K +  D V+     VE
Sbjct:    61 KDDLLAIIITHIHEDHCGALPYLWDRLAVP--VYTTQFTANFLLKKIGRDKVQFPIHVVE 118

Query:   132 D-MLFDEQDINRSMDKIEVLDFHQTVEVNGIKFWCYTAGHV-LGAAMFMVDIAGVRVLYT 189
                L    D   +++ I +   H   E+N I    +TA  V + +  + +D   V V  T
Sbjct:   119 PGKLLHLGDF--TLEFINMT--HSVPEMNAIAI--HTADKVVIHSGDWKIDDDPV-VGKT 171

Query:   190 GDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQG-GRV 248
              D+ R E       EL +      + +ST  V  H  R+  E    + +   IS   G  
Sbjct:   172 CDFKRLE-------ELSEKGVLAMVCDST-NVFSHG-RSDSESSLREPLMEVISSASGAC 222

Query:   249 LIPAFA--LGRAQELLLILDEY 268
             ++  F+  + R + +  I +EY
Sbjct:   223 VVTLFSSNIARIETVTRIANEY 244

 Score = 50 (22.7 bits), Expect = 5.7e-05, Sum P(2) = 5.7e-05
 Identities = 16/54 (29%), Positives = 25/54 (46%)

Query:   399 QVHYISFSAHADYAQTSTFLKELMPPNIIL-VHGES---HEMGRLKTKLMTELA 448
             + H +  S H  Y      L +L+ P I++ VHGE+   HE  +   +   E A
Sbjct:   355 RTHSVHVSGHP-YRDELKQLYQLLKPKILIPVHGENLHLHEHAKFALECGVESA 407


>TIGR_CMR|CHY_1157 [details] [associations]
            symbol:CHY_1157 "metallo-beta-lactamase family protein"
            species:246194 "Carboxydothermus hydrogenoformans Z-2901"
            [GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
            "metabolic process" evidence=ISS] InterPro:IPR001279
            InterPro:IPR004613 Pfam:PF00753 PIRSF:PIRSF004803 SMART:SM00849
            Pfam:PF07521 GO:GO:0046872 EMBL:CP000141 GenomeReviews:CP000141_GR
            GO:GO:0003723 GO:GO:0016788 InterPro:IPR011108 eggNOG:COG0595
            HOGENOM:HOG000280201 KO:K12574 PANTHER:PTHR11203:SF22
            TIGRFAMs:TIGR00649 RefSeq:YP_360002.1 ProteinModelPortal:Q3ACY2
            STRING:Q3ACY2 GeneID:3726430 KEGG:chy:CHY_1157 PATRIC:21275454
            OMA:FLVDSTN BioCyc:CHYD246194:GJCN-1156-MONOMER Uniprot:Q3ACY2
        Length = 554

 Score = 114 (45.2 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 62/249 (24%), Positives = 106/249 (42%)

Query:    16 PVSREGDQLIITPLGAGNEVGRSCVYMSYKGKTILFDCGI-HPAYS--GM-AALP---YF 68
             P+ ++G +L I PLG   E+G++ + + Y    I+ D G+  P     G+   +P   Y 
Sbjct:     2 PI-KDG-RLQIIPLGGLGEIGKNMMVIKYNDAIIVIDAGLMFPEEELLGIDMVIPDMSYL 59

Query:    69 DEIDPSAIDVLLITHFHLDHAASLPYFLEKTTFKGRVFMTHATKAIYKLLLTDYVKVSKV 128
              E +   +  +L+TH H DH   +PYFL++  F   V+ T  T  +    L +   + + 
Sbjct:    60 IE-NKEKVKAVLLTHGHEDHIGGMPYFLKQ--FDVPVYGTRLTLGLLSAKLKE-AGIPRA 115

Query:   129 SVEDMLFDEQDINRSMDKIEVLDF-HQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRVL 187
             S+ +++     +N    KIE +   H   +  GI       G V+    F +D   V   
Sbjct:   116 SL-NVVAPRDVLNIGPFKIEFIKVSHSIPDTVGIAVHT-PVGTVVHTGDFKLDPTPVDGK 173

Query:   188 YTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQLHQPR-NIREKRFTDVIHSTISQG- 245
              T  Y        + AEL +    + + +ST      +P   + EK   +    T     
Sbjct:   174 VTDFY--------KLAELGEKGVLVLMSDST---NAERPGFTLSEKTVGNTFEETFRVAE 222

Query:   246 GRVLIPAFA 254
             GR++I  FA
Sbjct:   223 GRIIIATFA 231

 Score = 57 (25.1 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 25/93 (26%), Positives = 37/93 (39%)

Query:   403 ISFSAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQS 462
             I  S H    +    +  L P   + +HGE   + +   ++  EL       I P+N   
Sbjct:   362 IHVSGHPSQEELKLMINLLKPKYFVPIHGEYRHLIK-HAEIARELG------IKPQNIFV 414

Query:   463 VEMYFNSE--KMAKTIGRLAEKTPEVGETVSGI 493
             VE   N +  +  K  GRLA K P     V G+
Sbjct:   415 VE---NGQVLEFTKKSGRLAGKVPAGRVLVDGL 444


>UNIPROTKB|G4N6C6 [details] [associations]
            symbol:MGG_06570 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005634 "nucleus" evidence=ISS] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0006379 "mRNA cleavage" evidence=ISS] InterPro:IPR027075
            Pfam:PF07521 GO:GO:0006378 EMBL:CM001234 GO:GO:0005847
            GO:GO:0006379 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
            InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
            SMART:SM01027 RefSeq:XP_003716967.1 EnsemblFungi:MGG_06570T0
            GeneID:2684725 KEGG:mgr:MGG_06570 Uniprot:G4N6C6
        Length = 962

 Score = 107 (42.7 bits), Expect = 0.00061, Sum P(3) = 0.00061
 Identities = 32/125 (25%), Positives = 52/125 (41%)

Query:   158 VNGIKFWCYTAGHVLGAAMFMVDIAGVRVLYTGDYSREEDRHLRAAE------------- 204
             +NG+    Y AGH LG  ++ +      ++Y  D++   D     A              
Sbjct:   172 LNGLTITAYNAGHSLGGTIWHIQHGMESIVYAVDWNLARDNVYAGAAWMGGGHGGGGAEV 231

Query:   205 LPQFSPDICIIESTYGVQLHQPRNIREKRFTDVIHSTISQGGRVLIPAFALGRAQELLLI 264
             + Q      ++ ST   +    R  R+K+  D +   IS+GG VLIP  +  R  EL  +
Sbjct:   232 IEQLRKPTALVCSTRTAEGGLTRAARDKQLLDTMRMAISRGGTVLIPVDSSARVLELAYL 291

Query:   265 LDEYW 269
             L+  W
Sbjct:   292 LEHAW 296

 Score = 53 (23.7 bits), Expect = 0.00061, Sum P(3) = 0.00061
 Identities = 13/57 (22%), Positives = 28/57 (49%)

Query:   379 IISEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGESHE 435
             +++ P ++ +    T  +N+++  I FS   D    +  +  + P  +ILV G + E
Sbjct:   693 VVTGPAKL-VHTSTTVSVNLRLALIDFSGLHDRRSLAMLIPLIQPRKLILVAGSADE 748

 Score = 53 (23.7 bits), Expect = 0.00061, Sum P(3) = 0.00061
 Identities = 17/79 (21%), Positives = 36/79 (45%)

Query:   311 NSNPFKFKHISPLNS-------IDDFSD-VGPSVVMASPGGLQSGLSRQLFDIWCSDKKN 362
             +  PF FK++  L+        ++  +D +   V++A+   L+ G S+ +     +D +N
Sbjct:   365 DGGPFDFKYLRLLDRKAQVLKLLEPSTDELRGKVILATDTSLEWGFSKDIISAIANDSRN 424

Query:   363 ACVIPGYVVEGTLAKTIIS 381
               ++P    E +     IS
Sbjct:   425 MVILPEKPAESSRDNPSIS 443


>UNIPROTKB|E2QVB2 [details] [associations]
            symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
            [GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
            InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
            GO:GO:0016180 PANTHER:PTHR11203:SF2 Ensembl:ENSCAFT00000013124
            Uniprot:E2QVB2
        Length = 409

 Score = 118 (46.6 bits), Expect = 0.00063, P = 0.00063
 Identities = 63/275 (22%), Positives = 112/275 (40%)

Query:   241 TISQGGRVLIPAFALGRAQELLLILDEYWSNHPEFHNIPIYYASPLAKKCMAVYQTYI-- 298
             T+  GG VL+P +  G   +LL  L +Y  +     NIP Y+ SP+A   +   Q +   
Sbjct:    39 TVRNGGNVLVPCYPSGVIYDLLECLYQYIDS-AGLSNIPFYFISPVANSSLEFSQIFAEW 97

Query:   299 LSMNERIRNQFANSNPF---------KFKHISPLNSIDDFS-DVG-PSVVMASPGGLQSG 347
             L  N++ +  +    PF         K KH   L+   DFS D   P VV      L+ G
Sbjct:    98 LCHNKQTK-VYLPEPPFPHAELIQTNKLKHYPSLHG--DFSSDFRQPCVVFTGHPSLRFG 154

Query:   348 LSRQLFDIWCSDKKNACVIPGYVVEGTLAKTIISEPKEVTLMNGLTA--PLNMQVHYISF 405
                   ++W           G   + +L   I +EP + + +  L    PL M+  Y   
Sbjct:   155 DVVHFMELW-----------G---KSSLNTVIFTEP-DFSYLEALAPYQPLAMKCIYCPI 199

Query:   406 SAHADYAQTSTFLKELMPPNIILVHGESHEMGRLKTKLMTELADCNTKIITPKNCQSVEM 465
                 ++ Q S  LKE+ P +++    +  +    ++  M  + DC    ++ +  + + +
Sbjct:   200 DTRLNFIQVSKLLKEVQPLHVVCPE-QYTQPPPAQSHRMDLMIDCQPPAMSYRRAEVLAL 258

Query:   466 YFNSEKMAKTIGRLAEKTPEVGETVSGILVKKGFT 500
              F   +  K      E  PE+ + +  + +K G +
Sbjct:   259 PFK-RRYEKI-----EIMPELADALVPMEIKPGIS 287


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.136   0.404    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      525       525   0.00091  119 3  11 22  0.37    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  96
  No. of states in DFA:  616 (65 KB)
  Total size of DFA:  299 KB (2155 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  42.14u 0.12s 42.26t   Elapsed:  00:00:02
  Total cpu time:  42.17u 0.12s 42.29t   Elapsed:  00:00:02
  Start:  Mon May 20 16:05:06 2013   End:  Mon May 20 16:05:08 2013

Back to top