Your job contains 1 sequence.
>psy8348
MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF
HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSM
DKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAE
IPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLI
LDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLK
GIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE
EVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALT
REYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNY
HLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEKRLRAFACIE
ITLEKCIVVLEWASNPISDMYADSLISECLIEILVEMYGEAAVPKMFKGEKITITVDKKK
ACIDLVDLSVQCEDSKLKSTVQ
The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= psy8348
(622 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
FB|FBgn0261065 - symbol:Cpsf73 "Cleavage and polyadenylat... 2368 1.2e-256 2
UNIPROTKB|E2R7R2 - symbol:CPSF3 "Uncharacterized protein"... 2233 1.9e-236 2
UNIPROTKB|P79101 - symbol:CPSF3 "Cleavage and polyadenyla... 2233 2.4e-236 2
UNIPROTKB|I3LKR1 - symbol:CPSF3 "Uncharacterized protein"... 2221 3.5e-235 2
UNIPROTKB|F1NKW5 - symbol:CPSF3 "Uncharacterized protein"... 2229 1.2e-234 2
ZFIN|ZDB-GENE-030131-3275 - symbol:cpsf3 "cleavage and po... 2246 7.3e-233 1
UNIPROTKB|Q9UKF6 - symbol:CPSF3 "Cleavage and polyadenyla... 2235 1.1e-231 1
MGI|MGI:1859328 - symbol:Cpsf3 "cleavage and polyadenylat... 2229 4.6e-231 1
UNIPROTKB|G3V6W7 - symbol:Cpsf3 "Protein Cpsf3" species:1... 2229 4.6e-231 1
RGD|1305767 - symbol:Cpsf3 "cleavage and polyadenylation ... 2226 9.6e-231 1
UNIPROTKB|G5E9W3 - symbol:CPSF3 "Cleavage and polyadenyla... 2150 1.1e-222 1
WB|WBGene00013460 - symbol:cpsf-3 species:6239 "Caenorhab... 1904 1.3e-196 1
DICTYBASE|DDB_G0274799 - symbol:cpsf3 "cleavage and polya... 1678 1.1e-172 1
TAIR|locus:2206076 - symbol:CPSF73-I "cleavage and polyad... 1671 6.2e-172 1
POMBASE|SPAC17G6.16c - symbol:ysh1 "mRNA cleavage and pol... 1516 1.7e-155 1
SGD|S000004267 - symbol:YSH1 "Putative endoribonuclease" ... 1325 1.2e-144 3
CGD|CAL0005344 - symbol:orf19.5486 species:5476 "Candida ... 1281 1.7e-143 3
UNIPROTKB|Q59P50 - symbol:YSH1 "Endoribonuclease YSH1" sp... 1281 1.7e-143 3
ASPGD|ASPL0000060573 - symbol:AN0990 species:162425 "Emer... 849 6.9e-114 4
GENEDB_PFALCIPARUM|PF14_0364 - symbol:PF14_0364 "cleavage... 812 1.0e-111 3
UNIPROTKB|Q8IL83 - symbol:PF14_0364 "Cleavage and polyade... 812 1.0e-111 3
UNIPROTKB|F1NV30 - symbol:CPSF3L "Integrator complex subu... 886 9.6e-89 1
UNIPROTKB|Q5ZIH0 - symbol:CPSF3L "Integrator complex subu... 886 9.6e-89 1
MGI|MGI:1919207 - symbol:Cpsf3l "cleavage and polyadenyla... 875 1.4e-87 1
UNIPROTKB|E1B7Q9 - symbol:CPSF3L "Integrator complex subu... 874 1.8e-87 1
RGD|1306841 - symbol:Cpsf3l "cleavage and polyadenylation... 874 1.8e-87 1
UNIPROTKB|E2QY53 - symbol:CPSF3L "Uncharacterized protein... 866 1.3e-86 1
UNIPROTKB|G3V1S5 - symbol:CPSF3L "Integrator complex subu... 865 1.6e-86 1
UNIPROTKB|Q5TA45 - symbol:CPSF3L "Integrator complex subu... 865 1.6e-86 1
UNIPROTKB|Q2YDM2 - symbol:CPSF3L "Integrator complex subu... 864 2.0e-86 1
WB|WBGene00008642 - symbol:F10B5.8 species:6239 "Caenorha... 845 2.1e-86 2
UNIPROTKB|F1RJE8 - symbol:CPSF3L "Uncharacterized protein... 855 1.8e-85 1
FB|FBgn0039691 - symbol:IntS11 "Integrator 11" species:72... 843 3.4e-84 1
DICTYBASE|DDB_G0278189 - symbol:ints11 "integrator comple... 810 1.1e-80 1
ZFIN|ZDB-GENE-050522-13 - symbol:cpsf3l "cleavage and pol... 791 1.1e-78 1
TAIR|locus:2065368 - symbol:CPSF73-II "AT2G01730" species... 743 1.4e-73 1
GENEDB_PFALCIPARUM|PFC0825c - symbol:PFC0825c "cleavage a... 542 4.0e-62 3
UNIPROTKB|O77371 - symbol:PFC0825c "Cleavage and polyaden... 542 4.0e-62 3
UNIPROTKB|C9JZH6 - symbol:CPSF3 "Cleavage and polyadenyla... 609 2.2e-59 1
UNIPROTKB|C9J979 - symbol:CPSF3L "Integrator complex subu... 268 2.2e-46 2
UNIPROTKB|E9PNS4 - symbol:CPSF3L "Integrator complex subu... 477 2.1e-45 1
UNIPROTKB|E9PI75 - symbol:CPSF3L "Integrator complex subu... 411 4.0e-38 1
TIGR_CMR|CPS_2623 - symbol:CPS_2623 "metallo-beta-lactama... 410 5.2e-38 1
UNIPROTKB|E9PIG1 - symbol:CPSF3L "Integrator complex subu... 406 1.5e-37 1
UNIPROTKB|E2R496 - symbol:CPSF2 "Uncharacterized protein"... 390 1.3e-36 2
UNIPROTKB|Q9P2I0 - symbol:CPSF2 "Cleavage and polyadenyla... 390 1.3e-36 2
UNIPROTKB|F1NMN0 - symbol:CPSF2 "Uncharacterized protein"... 388 1.4e-36 2
UNIPROTKB|Q10568 - symbol:CPSF2 "Cleavage and polyadenyla... 389 1.6e-36 2
UNIPROTKB|Q9W799 - symbol:cpsf2 "Cleavage and polyadenyla... 389 2.1e-36 2
RGD|1309687 - symbol:Cpsf2 "cleavage and polyadenylation ... 385 5.1e-36 2
MGI|MGI:1861601 - symbol:Cpsf2 "cleavage and polyadenylat... 384 6.7e-36 2
UNIPROTKB|F1SD85 - symbol:CPSF2 "Uncharacterized protein"... 389 1.2e-35 1
ZFIN|ZDB-GENE-040718-79 - symbol:cpsf2 "cleavage and poly... 380 3.5e-35 2
WB|WBGene00017313 - symbol:cpsf-2 species:6239 "Caenorhab... 383 1.2e-34 2
UNIPROTKB|O17403 - symbol:cpsf-2 "Probable cleavage and p... 383 1.2e-34 2
UNIPROTKB|Q9KV92 - symbol:VC_0264 "Putative uncharacteriz... 376 3.5e-34 1
TIGR_CMR|VC_0264 - symbol:VC_0264 "conserved hypothetical... 376 3.5e-34 1
FB|FBgn0027873 - symbol:Cpsf100 "Cleavage and polyadenyla... 354 6.9e-32 2
TAIR|locus:2172843 - symbol:CPSF100 "cleavage and polyade... 373 1.8e-31 1
TIGR_CMR|DET_1061 - symbol:DET_1061 "metallo-beta-lactama... 267 4.3e-30 2
DICTYBASE|DDB_G0270392 - symbol:cpsf2 "cleavage and polya... 352 2.8e-29 2
TIGR_CMR|CHY_2049 - symbol:CHY_2049 "metallo-beta-lactama... 326 7.0e-27 1
UNIPROTKB|E9PIL7 - symbol:CPSF3L "Integrator complex subu... 267 2.9e-22 1
POMBASE|SPBC1709.15c - symbol:cft2 "cleavage factor two C... 288 6.3e-22 1
UNIPROTKB|Q81SC3 - symbol:BA_1737 "Metallo-beta-lactamase... 272 4.2e-21 1
TIGR_CMR|BA_1737 - symbol:BA_1737 "metallo-beta-lactamase... 272 4.2e-21 1
UNIPROTKB|Q74C32 - symbol:GSU1843 "RNA exonuclease, beta-... 162 1.4e-19 2
TIGR_CMR|GSU_1843 - symbol:GSU_1843 "metallo-beta-lactama... 162 1.4e-19 2
UNIPROTKB|E9PQF0 - symbol:CPSF3L "Integrator complex subu... 242 1.5e-19 1
DICTYBASE|DDB_G0282473 - symbol:ints9 "integrator complex... 197 5.8e-17 3
RGD|1311539 - symbol:Ints9 "integrator complex subunit 9"... 191 3.0e-16 2
MGI|MGI:1098533 - symbol:Ints9 "integrator complex subuni... 186 4.0e-16 2
UNIPROTKB|Q9NV88 - symbol:INTS9 "Integrator complex subun... 182 1.4e-15 2
UNIPROTKB|F6XI08 - symbol:INTS9 "Uncharacterized protein"... 180 2.4e-15 2
UNIPROTKB|Q2KJA6 - symbol:INTS9 "Integrator complex subun... 178 3.9e-15 2
UNIPROTKB|F1MMA6 - symbol:INTS9 "Integrator complex subun... 177 5.0e-15 2
UNIPROTKB|Q0C1L6 - symbol:HNE_1669 "Putative uncharacteri... 183 1.1e-14 3
UNIPROTKB|F1RJQ5 - symbol:INTS9 "Uncharacterized protein"... 176 1.8e-14 2
UNIPROTKB|G3XAN1 - symbol:INTS9 "Integrator complex subun... 168 2.3e-14 2
UNIPROTKB|Q8EJC6 - symbol:SO_0541 "RNA-metabolizing metal... 213 3.8e-14 1
TIGR_CMR|SO_0541 - symbol:SO_0541 "metallo-beta-lactamase... 213 3.8e-14 1
UNIPROTKB|Q5ZKK2 - symbol:INTS9 "Integrator complex subun... 162 9.2e-13 2
UNIPROTKB|H7BYQ6 - symbol:INTS9 "Integrator complex subun... 182 6.9e-12 2
FB|FBgn0036570 - symbol:IntS9 "Integrator 9" species:7227... 144 7.5e-12 2
ZFIN|ZDB-GENE-061013-129 - symbol:ints9 "integrator compl... 148 6.3e-11 2
WB|WBGene00017608 - symbol:F19F10.12 species:6239 "Caenor... 154 8.9e-10 3
CGD|CAL0004705 - symbol:orf19.325 species:5476 "Candida a... 164 1.0e-08 3
UNIPROTKB|Q5AEE3 - symbol:CFT2 "Putative uncharacterized ... 164 1.0e-08 3
UNIPROTKB|H0YBH8 - symbol:INTS9 "Integrator complex subun... 151 1.7e-08 1
UNIPROTKB|Q87XP2 - symbol:PSPTO_4134 "Uncharacterized pro... 127 6.1e-05 1
UNIPROTKB|E2QVB2 - symbol:INTS9 "Uncharacterized protein"... 125 0.00014 1
TAIR|locus:2079696 - symbol:AT3G07530 "AT3G07530" species... 80 0.00019 3
UNIPROTKB|Q81SK8 - symbol:BA_1640 "Ribonuclease J" specie... 130 0.00022 2
TIGR_CMR|BA_1640 - symbol:BA_1640 "metallo-beta-lactamase... 130 0.00022 2
TIGR_CMR|CHY_1157 - symbol:CHY_1157 "metallo-beta-lactama... 113 0.00022 2
UNIPROTKB|Q83DU6 - symbol:CBU_0596 "Metal-dependent hydro... 114 0.00081 1
TIGR_CMR|CBU_0596 - symbol:CBU_0596 "conserved hypothetic... 114 0.00081 1
>FB|FBgn0261065 [details] [associations]
symbol:Cpsf73 "Cleavage and polyadenylation specificity
factor 73" species:7227 "Drosophila melanogaster" [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS;NAS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
processing" evidence=IMP] [GO:0003677 "DNA binding" evidence=IDA]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
GO:GO:0003677 GO:GO:0006378 GO:GO:0016787 GO:GO:0005847
GO:GO:0006379 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 GO:GO:0006398 KO:K14403 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 EMBL:AY119128 RefSeq:NP_650738.1
UniGene:Dm.13714 SMR:Q9VE51 IntAct:Q9VE51 MINT:MINT-804945
STRING:Q9VE51 EnsemblMetazoa:FBtr0083690 GeneID:42240
KEGG:dme:Dmel_CG7698 UCSC:CG7698-RA CTD:42240 FlyBase:FBgn0261065
InParanoid:Q9VE51 OrthoDB:EOG4P5HR4 GenomeRNAi:42240 NextBio:827838
Uniprot:Q9VE51
Length = 684
Score = 2368 (838.6 bits), Expect = 1.2e-256, Sum P(2) = 1.2e-256
Identities = 434/569 (76%), Positives = 510/569 (89%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCIMLEFK K IM+DCGIHPGLSGMDALP+VDL+E+D+IDLL ISHFHLDHC
Sbjct: 24 GAGQEVGRSCIMLEFKGKKIMLDCGIHPGLSGMDALPYVDLIEADEIDLLFISHFHLDHC 83
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL+KT FKGRCFMTHATKAIYRW+LSDYIK+SNISTEQMLYTE+DLE SM+KIET
Sbjct: 84 GALPWFLMKTSFKGRCFMTHATKAIYRWMLSDYIKISNISTEQMLYTEADLEASMEKIET 143
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHEE+DV G++F AY AGHVLGAAMF+IEIAG+KILYTGDFSRQEDRHLMAAE+PP+K
Sbjct: 144 INFHEERDVMGVRFCAYIAGHVLGAAMFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMK 203
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PD+LITESTYGTH+HE+RE+RE RFTSL+ IV +GGRCLIPVFALGRAQELLLILDE+W
Sbjct: 204 PDVLITESTYGTHIHEKREDRENRFTSLVQKIVQQGGRCLIPVFALGRAQELLLILDEFW 263
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
S +P+LH+IPIYYASSLAKKCM+VYQTYINAMNDRIRRQI++NNPFVF+HISNLKGIDHF
Sbjct: 264 SQNPDLHEIPIYYASSLAKKCMAVYQTYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHF 323
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
EDIGPCV+MASPGMMQSGLSRELFE WCTD KNGVIIAGYCVEGTLAK +LSEPEE+ +
Sbjct: 324 EDIGPCVIMASPGMMQSGLSRELFESWCTDPKNGVIIAGYCVEGTLAKAVLSEPEEITTL 383
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPL MSVDYISFSAHTDYQQTSEF+R L+P HVVLVHGEQNEMSRLK AL REYE
Sbjct: 384 SGQKLPLNMSVDYISFSAHTDYQQTSEFIRLLKPTHVVLVHGEQNEMSRLKLALQREYEA 443
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
D +T ++ YNPRNT +VDLYF+GEKTAKVMG LA +N + + LSG++VKR+F YHLLAP
Sbjct: 444 DASTDIKFYNPRNTHAVDLYFRGEKTAKVMGSLAAKNSEVGSKLSGVLVKRDFKYHLLAP 503
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHL-AGPVETLD-EKRLRAFACIEITL 543
SDL KYTD+ S + Q+QS+ + S+S L L+ + AG VE L+ E++LR F CIE+T+
Sbjct: 504 SDLGKYTDMSMSVVTQRQSIPWGSSLSTLELLLDRIGAGCVEVLEAERKLRVFGCIELTV 563
Query: 544 EKCIVVLEWASNPISDMYADSLISECLIE 572
E+ I+V+EW + ++D+YAD++++ C+++
Sbjct: 564 EQKIIVMEWQATHVNDVYADAVLA-CIMQ 591
Score = 126 (49.4 bits), Expect = 1.2e-256, Sum P(2) = 1.2e-256
Identities = 26/57 (45%), Positives = 38/57 (66%)
Query: 563 DSLISECLIEILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQC-EDSKLK 618
DS ECLIE L + +G+ VPKMF+G+ + +TV K+A I+L L++ C ED L+
Sbjct: 610 DSRFRECLIETLQDTFGDNCVPKMFEGDLLPVTVSGKRAEINLETLAISCAEDDVLR 666
Score = 45 (20.9 bits), Expect = 0.00033, Sum P(2) = 0.00033
Identities = 12/34 (35%), Positives = 18/34 (52%)
Query: 130 EEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKIL 163
EE D+ IK AG +G + ++E G KI+
Sbjct: 13 EESDLLQIK--PLGAGQEVGRSCIMLEFKGKKIM 44
>UNIPROTKB|E2R7R2 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 GeneTree:ENSGT00700000104485 EMBL:AAEX03010701
RefSeq:XP_003639652.1 Ensembl:ENSCAFT00000005417 GeneID:100856414
KEGG:cfa:100856414 Uniprot:E2R7R2
Length = 717
Score = 2233 (791.1 bits), Expect = 1.9e-236, Sum P(2) = 1.9e-236
Identities = 430/607 (70%), Positives = 503/607 (82%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 51 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 110
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 111 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 170
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 171 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 230
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 231 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 290
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 291 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 350
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 351 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 410
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 411 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 470
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 471 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 530
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
DL YTDL S + Q Q++ Y+G ++L + L G VE L+ EK L+ F I +
Sbjct: 531 CDLSNYTDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGDVEELEIQEKPALKVFKNITVI 590
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V K+
Sbjct: 591 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 647
Query: 602 CIDLVDL 608
I L D+
Sbjct: 648 EIMLQDI 654
Score = 70 (29.7 bits), Expect = 1.9e-236, Sum P(2) = 1.9e-236
Identities = 25/76 (32%), Positives = 42/76 (55%)
Query: 548 VVLEWASNPISDMYADSLISECL--------IEILVE-MYGEAAVPKMFKGEKITITVDK 598
V+LE SNP A +S+ L +EI+++ ++GE V + G +++TVD
Sbjct: 616 VILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGEDCV-SVKDGSVLSVTVDG 674
Query: 599 KKACIDLVDLSVQCED 614
K A I+L +V+CE+
Sbjct: 675 KTANINLETRTVECEE 690
>UNIPROTKB|P79101 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation specificity
factor subunit 3" species:9913 "Bos taurus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
evidence=ISS] [GO:0003723 "RNA binding" evidence=ISS] [GO:0004521
"endoribonuclease activity" evidence=ISS] [GO:0008409 "5'-3'
exonuclease activity" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0030529 "ribonucleoprotein complex" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0046872 GO:GO:0003723 GO:GO:0030529 GO:GO:0004521
GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 GO:GO:0006398 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
EMBL:X95906 EMBL:BC104553 IPI:IPI00708839 RefSeq:NP_776709.1
UniGene:Bt.5045 ProteinModelPortal:P79101 SMR:P79101 STRING:P79101
PRIDE:P79101 Ensembl:ENSBTAT00000026303 GeneID:281712
KEGG:bta:281712 CTD:51692 GeneTree:ENSGT00700000104485
HOVERGEN:HBG051107 InParanoid:P79101 OrthoDB:EOG4FN4H6
NextBio:20805634 ArrayExpress:P79101 GO:GO:0008409 Uniprot:P79101
Length = 684
Score = 2233 (791.1 bits), Expect = 2.4e-236, Sum P(2) = 2.4e-236
Identities = 430/607 (70%), Positives = 503/607 (82%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 318 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
DL YTDL S + Q Q++ Y+G ++L + L G VE L+ EK L+ F I +
Sbjct: 498 CDLSNYTDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGDVEELEIQEKPALKVFKNITVI 557
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V K+
Sbjct: 558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614
Query: 602 CIDLVDL 608
I L D+
Sbjct: 615 EIMLQDI 621
Score = 69 (29.3 bits), Expect = 2.4e-236, Sum P(2) = 2.4e-236
Identities = 25/76 (32%), Positives = 42/76 (55%)
Query: 548 VVLEWASNPISDMYADSLISECL--------IEILVE-MYGEAAVPKMFKGEKITITVDK 598
V+LE SNP A +S+ L +EI+++ ++GE V + G +++TVD
Sbjct: 583 VILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGEDCV-SVKDGSILSVTVDG 641
Query: 599 KKACIDLVDLSVQCED 614
K A I+L +V+CE+
Sbjct: 642 KTANINLETRTVECEE 657
>UNIPROTKB|I3LKR1 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008409 "5'-3' exonuclease activity" evidence=IEA]
[GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0003723 GO:GO:0004521 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:FP312696
Ensembl:ENSSSCT00000027309 Uniprot:I3LKR1
Length = 687
Score = 2221 (786.9 bits), Expect = 3.5e-235, Sum P(2) = 3.5e-235
Identities = 430/610 (70%), Positives = 503/610 (82%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV---SNISTEQMLYTESDLEKSMDK 122
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KV SNIS + MLYTE+DLE+SMDK
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVRKCSNISADDMLYTETDLEESMDK 137
Query: 123 IETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIP 182
IETINFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP
Sbjct: 138 IETINFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIP 197
Query: 183 PVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILD 242
+KPDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILD
Sbjct: 198 NIKPDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILD 257
Query: 243 EYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGI 302
EYW HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +
Sbjct: 258 EYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSM 317
Query: 303 DHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV 362
DHF+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+
Sbjct: 318 DHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEI 377
Query: 363 IGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTRE 422
MSGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL RE
Sbjct: 378 TTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIRE 437
Query: 423 YEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHL 482
YED+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+
Sbjct: 438 YEDNDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHI 497
Query: 483 LAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACI 539
L+P DL YTDL S + Q Q++ Y+G ++L + L G VE L+ EK L+ F I
Sbjct: 498 LSPCDLSNYTDLAMSTVKQTQAIPYTGPFNLLSYQLQKLTGDVEELEIQEKPALKVFKNI 557
Query: 540 EITLEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDK 598
+ E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V
Sbjct: 558 TVIQEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYS 614
Query: 599 KKACIDLVDL 608
K+ I L D+
Sbjct: 615 KRLEIMLQDI 624
Score = 70 (29.7 bits), Expect = 3.5e-235, Sum P(2) = 3.5e-235
Identities = 25/76 (32%), Positives = 42/76 (55%)
Query: 548 VVLEWASNPISDMYADSLISECL--------IEILVE-MYGEAAVPKMFKGEKITITVDK 598
V+LE SNP A +S+ L +EI+++ ++GE V + G +++TVD
Sbjct: 586 VILEVQSNPKIRKGAVQKVSKKLEMHVYSKRLEIMLQDIFGEDCV-SVKDGSVLSVTVDG 644
Query: 599 KKACIDLVDLSVQCED 614
K A I+L +V+CE+
Sbjct: 645 KTANINLETRTVECEE 660
>UNIPROTKB|F1NKW5 [details] [associations]
symbol:CPSF3 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0003723 "RNA binding" evidence=IEA] [GO:0004521
"endoribonuclease activity" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0003723 GO:GO:0004521 GO:GO:0005847
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0008409 EMBL:AADN02018718
IPI:IPI00600642 Ensembl:ENSGALT00000026493 Uniprot:F1NKW5
Length = 685
Score = 2229 (789.7 bits), Expect = 1.2e-234, Sum P(2) = 1.2e-234
Identities = 427/616 (69%), Positives = 505/616 (81%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 318 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRNFNYHILSP 497
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEKR---LRAFACIEIT 542
DL YTDL S + Q ++ Y+G ++L + L G VE ++ ++ L+ F I +
Sbjct: 498 CDLSNYTDLAMSTVTQTLAIPYTGPFNLLFYQLQKLTGDVEEIEIQQKPALKVFKSITVI 557
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ +AAV K+ K+ + +K+
Sbjct: 558 QEPGMVVLEWVANPANDMYADT-VTTVILEVQSNPKIQKAAVHKV--STKVDMEEYRKRM 614
Query: 602 CIDLVDL-SVQCEDSK 616
+ L D+ C SK
Sbjct: 615 EMMLQDMFGEDCVSSK 630
Score = 57 (25.1 bits), Expect = 1.2e-234, Sum P(2) = 1.2e-234
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 573 ILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQCE 613
+L +M+GE V +G + +TVD K A + L + CE
Sbjct: 617 MLQDMFGEDCVSSK-EGSILCVTVDGKTANLSLETRTADCE 656
>ZFIN|ZDB-GENE-030131-3275 [details] [associations]
symbol:cpsf3 "cleavage and polyadenylation
specific factor 3" species:7955 "Danio rerio" [GO:0016787
"hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-030131-3275 GO:GO:0016787
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 CTD:51692
HOVERGEN:HBG051107 EMBL:AY648793 IPI:IPI00509063
RefSeq:NP_001003836.1 UniGene:Dr.77231 ProteinModelPortal:Q6DRG6
SMR:Q6DRG6 STRING:Q6DRG6 GeneID:324554 KEGG:dre:324554
NextBio:20808833 ArrayExpress:Q6DRG6 Uniprot:Q6DRG6
Length = 690
Score = 2246 (795.7 bits), Expect = 7.3e-233, P = 7.3e-233
Identities = 428/618 (69%), Positives = 507/618 (82%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 25 GAGQEVGRSCIILEFKGRKIMVDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 84
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 85 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 144
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP VK
Sbjct: 145 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVK 204
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILITESTYGTH+HE+REERE RF + +HDIVNR GRCLIPVFALGRAQELLLILDEYW
Sbjct: 205 PDILITESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYW 264
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+ I+INNPFVFKHISNLK +DHF
Sbjct: 265 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKAININNPFVFKHISNLKSMDHF 324
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 325 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 384
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 385 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 444
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + +SGI+VK+NF+YH+L+P
Sbjct: 445 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGSLADKKCSQGQRVSGILVKKNFSYHILSP 504
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EKR-LRAFACIEIT 542
SDL YTDL S + Q Q++ ++G +L S + HL G VE ++ EK ++ F I +
Sbjct: 505 SDLSNYTDLAMSTVKQTQAIPFTGPFPLLLSQLRHLTGDVEEIEMSEKSTVKVFNSITVI 564
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVEMYGEAAVPKMFKGEKITITVDKKKAC 602
E +VVLEW +NP++DMYAD+ ++ ++E+ + A+ K K+ + V + +
Sbjct: 565 HENNLVVLEWFANPLNDMYADA-VTTVVLEVQSNPKAQKALQPQEK--KVDVNVFQNRLL 621
Query: 603 IDLVDL-SVQCEDSKLKS 619
D+ +C D K K+
Sbjct: 622 KMFQDMFGEECVDFKDKN 639
>UNIPROTKB|Q9UKF6 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0030529
"ribonucleoprotein complex" evidence=IEA] [GO:0046872 "metal ion
binding" evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0008409 "5'-3' exonuclease activity" evidence=ISS] [GO:0004521
"endoribonuclease activity" evidence=ISS] [GO:0003723 "RNA binding"
evidence=ISS] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IDA] [GO:0006378 "mRNA
polyadenylation" evidence=TAS] [GO:0006379 "mRNA cleavage"
evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=TAS]
[GO:0006369 "termination of RNA polymerase II transcription"
evidence=TAS] [GO:0006397 "mRNA processing" evidence=TAS]
[GO:0006406 "mRNA export from nucleus" evidence=TAS] [GO:0008380
"RNA splicing" evidence=TAS] [GO:0010467 "gene expression"
evidence=TAS] [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
Reactome:REACT_71 InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
EMBL:AF017269 Pfam:PF07521 EMBL:AF171877 EMBL:CH471053
GO:GO:0046872 Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003723
GO:GO:0030529 GO:GO:0006406 GO:GO:0004521 GO:GO:0000398
Reactome:REACT_1788 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 Reactome:REACT_78 GO:GO:0006398 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409
EMBL:AC080162 EMBL:BC011654 EMBL:BC020211 IPI:IPI00007818
RefSeq:NP_057291.1 UniGene:Hs.515972 PDB:2I7T PDB:2I7V PDBsum:2I7T
PDBsum:2I7V ProteinModelPortal:Q9UKF6 SMR:Q9UKF6 DIP:DIP-42501N
MINT:MINT-1742891 STRING:Q9UKF6 PhosphoSite:Q9UKF6 DMDM:18203503
PaxDb:Q9UKF6 PeptideAtlas:Q9UKF6 PRIDE:Q9UKF6 DNASU:51692
Ensembl:ENST00000238112 GeneID:51692 KEGG:hsa:51692 UCSC:uc002qzo.1
GeneCards:GC02P009514 HGNC:HGNC:2326 HPA:HPA034657 MIM:606029
neXtProt:NX_Q9UKF6 PharmGKB:PA26843 InParanoid:Q9UKF6
PhylomeDB:Q9UKF6 ChiTaRS:CPSF3 EvolutionaryTrace:Q9UKF6
GenomeRNAi:51692 NextBio:55702 ArrayExpress:Q9UKF6 Bgee:Q9UKF6
CleanEx:HS_CPSF3 Genevestigator:Q9UKF6 GermOnline:ENSG00000119203
Uniprot:Q9UKF6
Length = 684
Score = 2235 (791.8 bits), Expect = 1.1e-231, P = 1.1e-231
Identities = 436/625 (69%), Positives = 510/625 (81%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGMMQSGLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 318 DDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
DL YTDL S + Q Q++ Y+G ++L + L G VE L+ EK L+ F I +
Sbjct: 498 CDLSNYTDLAMSTVKQTQAIPYTGPFNLLCYQLQKLTGDVEELEIQEKPALKVFKNITVI 557
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V K+
Sbjct: 558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614
Query: 602 CIDLVDL-SVQC----EDSKLKSTV 621
I L D+ C +DS L TV
Sbjct: 615 EIMLQDIFGEDCVSVKDDSILSVTV 639
>MGI|MGI:1859328 [details] [associations]
symbol:Cpsf3 "cleavage and polyadenylation specificity
factor 3" species:10090 "Mus musculus" [GO:0003723 "RNA binding"
evidence=IDA] [GO:0003729 "mRNA binding" evidence=ISO] [GO:0004518
"nuclease activity" evidence=IEA] [GO:0004519 "endonuclease
activity" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISO;IDA] [GO:0006378 "mRNA polyadenylation" evidence=ISO]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISO;IDA] [GO:0008409 "5'-3'
exonuclease activity" evidence=IDA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0030529 "ribonucleoprotein complex"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
MGI:MGI:1859328 GO:GO:0046872 GO:GO:0003723 GO:GO:0030529
GO:GO:0004521 GO:GO:0005847 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 GO:GO:0006398
HOGENOM:HOG000203394 KO:K14403 OMA:YVSFSAH InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 CTD:51692 GeneTree:ENSGT00700000104485
HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6 GO:GO:0008409 ChiTaRS:CPSF3
EMBL:AF203969 EMBL:BC023297 IPI:IPI00135099 RefSeq:NP_061283.2
UniGene:Mm.356778 ProteinModelPortal:Q9QXK7 SMR:Q9QXK7
STRING:Q9QXK7 PhosphoSite:Q9QXK7 PaxDb:Q9QXK7 PRIDE:Q9QXK7
Ensembl:ENSMUST00000067284 GeneID:54451 KEGG:mmu:54451
InParanoid:Q8CIM0 NextBio:311332 Bgee:Q9QXK7 CleanEx:MM_CPSF3
Genevestigator:Q9QXK7 GermOnline:ENSMUSG00000054309 Uniprot:Q9QXK7
Length = 684
Score = 2229 (789.7 bits), Expect = 4.6e-231, P = 4.6e-231
Identities = 433/625 (69%), Positives = 509/625 (81%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGM+Q+GLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 318 DDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
DL YTDL S + Q Q++ Y+G +L + L G VE L+ EK L+ F I +
Sbjct: 498 CDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEKPALKVFKSITVV 557
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V K+
Sbjct: 558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614
Query: 602 CIDLVDL-SVQC----EDSKLKSTV 621
+ L D+ C +DS L TV
Sbjct: 615 EVMLQDIFGEDCVSVKDDSVLSVTV 639
>UNIPROTKB|G3V6W7 [details] [associations]
symbol:Cpsf3 "Protein Cpsf3" species:10116 "Rattus
norvegicus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
RGD:1305767 GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 EMBL:CH473947 OMA:YVSFSAH
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 UniGene:Rn.100522
Ensembl:ENSRNOT00000009652 Uniprot:G3V6W7
Length = 685
Score = 2229 (789.7 bits), Expect = 4.6e-231, P = 4.6e-231
Identities = 433/625 (69%), Positives = 509/625 (81%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIK 197
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGM+Q+GLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 318 DDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
DL YTDL S + Q Q++ Y+G +L + L G VE L+ EK L+ F I +
Sbjct: 498 CDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEKPALKVFKSITVV 557
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V K+
Sbjct: 558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614
Query: 602 CIDLVDL-SVQC----EDSKLKSTV 621
+ L D+ C +DS L TV
Sbjct: 615 EVMLQDIFGEDCVSVKDDSVLSVTV 639
>RGD|1305767 [details] [associations]
symbol:Cpsf3 "cleavage and polyadenylation specific factor 3,
73kDa" species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
evidence=ISO] [GO:0004521 "endoribonuclease activity" evidence=ISO]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISO] [GO:0006398 "histone mRNA 3'-end processing"
evidence=ISO] [GO:0008409 "5'-3' exonuclease activity"
evidence=ISO] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 RGD:1305767 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 CTD:51692 HOVERGEN:HBG051107 OrthoDB:EOG4FN4H6
UniGene:Rn.100522 EMBL:BC099817 IPI:IPI00365532
RefSeq:NP_001025201.1 ProteinModelPortal:Q499P4 SMR:Q499P4
STRING:Q499P4 GeneID:298916 KEGG:rno:298916 InParanoid:Q499P4
NextBio:644507 Genevestigator:Q499P4 Uniprot:Q499P4
Length = 685
Score = 2226 (788.7 bits), Expect = 9.6e-231, P = 9.6e-231
Identities = 432/625 (69%), Positives = 509/625 (81%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAGQEVGRSCI+LEFK + IM+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHC
Sbjct: 18 GAGQEVGRSCIILEFKGRKIMLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHC 77
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
GALPWFL KT FKGR FMTHATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIET
Sbjct: 78 GALPWFLQKTSFKGRTFMTHATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIET 137
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
INFHE K+V GIKF Y+AGHVLGAAMF+IEIAG+K+LYTGDFSRQEDRHLMAAEIP +K
Sbjct: 138 INFHEVKEVAGIKFWCYHAGHVLGAAMFMIEIAGMKLLYTGDFSRQEDRHLMAAEIPNIK 197
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDILI ESTYGTH+HE+REERE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW
Sbjct: 198 PDILIIESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYW 257
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
HPELHDIPIYYASSLAKKCM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF
Sbjct: 258 QNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHF 317
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP VVMASPGM+Q+GLSRELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ M
Sbjct: 318 DDIGPSVVMASPGMIQNGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTM 377
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ+LPLKMSVDYISFSAHTDYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED
Sbjct: 378 SGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYED 437
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+ +E++NPRNT +V L F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P
Sbjct: 438 NDEVHIEVHNPRNTEAVTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSP 497
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEIT 542
DL YTDL S + Q Q++ Y+G +L + L G VE L+ EK L+ F I +
Sbjct: 498 CDLSNYTDLAMSTVKQTQAIPYTGPFYLLYYQLQKLTGDVEELEIQEKPALKVFKSITVV 557
Query: 543 LEKCIVVLEWASNPISDMYADSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKA 601
E +VVLEW +NP +DMYAD+ ++ ++E+ + AV K+ K K+ + V K+
Sbjct: 558 QEPGMVVLEWLANPSNDMYADT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRL 614
Query: 602 CIDLVDL-SVQC----EDSKLKSTV 621
+ L D+ C +DS L TV
Sbjct: 615 EVMLQDIFGEDCVSVKDDSVLSVTV 639
>UNIPROTKB|G5E9W3 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation-specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:CH471053 GO:GO:0003723 GO:GO:0004521
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098 GO:GO:0008409
EMBL:AC080162 UniGene:Hs.515972 HGNC:HGNC:2326 ChiTaRS:CPSF3
ProteinModelPortal:G5E9W3 SMR:G5E9W3 PRIDE:G5E9W3
Ensembl:ENST00000460593 ArrayExpress:G5E9W3 Bgee:G5E9W3
Uniprot:G5E9W3
Length = 647
Score = 2150 (761.9 bits), Expect = 1.1e-222, P = 1.1e-222
Identities = 420/605 (69%), Positives = 492/605 (81%)
Query: 26 MMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTH 85
M+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHCGALPWFL KT FKGR FMTH
Sbjct: 1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60
Query: 86 ATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAG 145
ATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIETINFHE K+V GIKF Y+AG
Sbjct: 61 ATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAG 120
Query: 146 HVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREE 205
HVLGAAMF+IEIAGVK+LYTGDFSRQEDRHLMAAEIP +KPDILI ESTYGTH+HE+REE
Sbjct: 121 HVLGAAMFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREE 180
Query: 206 REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKK 265
RE RF + +HDIVNRGGR LIPVFALGRAQELLLILDEYW HPELHDIPIYYASSLAKK
Sbjct: 181 REARFCNTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKK 240
Query: 266 CMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLS 325
CM+VYQTY+NAMND+IR+QI+INNPFVFKHISNLK +DHF+DIGP VVMASPGMMQSGLS
Sbjct: 241 CMAVYQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLS 300
Query: 326 RELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHT 385
RELFE WCTD +NGVIIAGYCVEGTLAK I+SEPEE+ MSGQ+LPLKMSVDYISFSAHT
Sbjct: 301 RELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHT 360
Query: 386 DYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLY 445
DYQQTSEF+R L+P HV+LVHGEQNEM+RLKAAL REYED+ +E++NPRNT +V L
Sbjct: 361 DYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTLN 420
Query: 446 FKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAPSDLPKYTDLKASKIIQQQSV 505
F+GEK AKVMG LA + + +SGI+VKRNFNYH+L+P DL YTDL S + Q Q++
Sbjct: 421 FRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILSPCDLSNYTDLAMSTVKQTQAI 480
Query: 506 YYSGSISVLRSLISHLAGPVETLD--EK-RLRAFACIEITLEKCIVVLEWASNPISDMYA 562
Y+G ++L + L G VE L+ EK L+ F I + E +VVLEW +NP +DMYA
Sbjct: 481 PYTGPFNLLCYQLQKLTGDVEELEIQEKPALKVFKNITVIQEPGMVVLEWLANPSNDMYA 540
Query: 563 DSLISECLIEILVE-MYGEAAVPKMFKGEKITITVDKKKACIDLVDL-SVQC----EDSK 616
D+ ++ ++E+ + AV K+ K K+ + V K+ I L D+ C +DS
Sbjct: 541 DT-VTTVILEVQSNPKIRKGAVQKVSK--KLEMHVYSKRLEIMLQDIFGEDCVSVKDDSI 597
Query: 617 LKSTV 621
L TV
Sbjct: 598 LSVTV 602
>WB|WBGene00013460 [details] [associations]
symbol:cpsf-3 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
[GO:0040007 "growth" evidence=IMP] [GO:0002119 "nematode larval
development" evidence=IMP] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000203394
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 EMBL:AL132951 RefSeq:NP_502553.2
ProteinModelPortal:Q95PY8 SMR:Q95PY8 STRING:Q95PY8 PaxDb:Q95PY8
EnsemblMetazoa:Y67H2A.1.1 EnsemblMetazoa:Y67H2A.1.2 GeneID:178285
KEGG:cel:CELE_Y67H2A.1 UCSC:Y67H2A.1 CTD:178285 WormBase:Y67H2A.1
InParanoid:Q95PY8 NextBio:900506 Uniprot:Q95PY8
Length = 707
Score = 1904 (675.3 bits), Expect = 1.3e-196, P = 1.3e-196
Identities = 354/583 (60%), Positives = 449/583 (77%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G+GQEVGRSC +LE+K K +M+DCG+HPGL G+DALPFVD VE + IDLLLI+HFHLDHC
Sbjct: 17 GSGQEVGRSCHLLEYKGKRVMLDCGVHPGLHGVDALPFVDFVEIENIDLLLITHFHLDHC 76
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNIS--TEQMLYTESDLEKSMDKI 123
GALPW L KT F+G+CFMTHATKAIYR LL DY+++S LYTE DLEKSM KI
Sbjct: 77 GALPWLLQKTAFQGKCFMTHATKAIYRMLLGDYVRISKYGGPDRNQLYTEDDLEKSMAKI 136
Query: 124 ETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPP 183
ETI+F E+K+VNGI+F Y AGHVLGA F+IEIAGV++LYTGDFS EDRHL AAEIPP
Sbjct: 137 ETIDFREQKEVNGIRFWPYVAGHVLGACQFMIEIAGVRVLYTGDFSCLEDRHLCAAEIPP 196
Query: 184 VKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDE 243
+ P +LITESTYGT HE R RE RFT ++HDIV RGGRCLIP FA+G AQEL+LILDE
Sbjct: 197 ITPQVLITESTYGTQTHEDRAVREKRFTQMVHDIVTRGGRCLIPAFAIGPAQELMLILDE 256
Query: 244 YWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGID 303
YW H ELHDIP+YYASSLAKKCMSVYQT++N MN RI++QI++ NPF+FKH+S L+G+D
Sbjct: 257 YWESHQELHDIPVYYASSLAKKCMSVYQTFVNGMNSRIQKQIAVKNPFIFKHVSTLRGMD 316
Query: 304 HFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVI 363
FED GPCVV+A+PGM+QSG SRELFE WC D KNG IIAGYCVEGTLAK ILSEPEE++
Sbjct: 317 QFEDAGPCVVLATPGMLQSGFSRELFESWCPDTKNGCIIAGYCVEGTLAKHILSEPEEIV 376
Query: 364 GMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREY 423
+SG++LP++M V Y+SFSAHTDY QTS FV+ L+P H+VLVHGE +EMSRLK+ + R++
Sbjct: 377 SLSGEKLPMRMQVGYVSFSAHTDYHQTSNFVKALKPPHLVLVHGELHEMSRLKSGIERQF 436
Query: 424 EDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLL 483
+DD N +E++NPRNT + L F+GEKTAKV+G+LA + + +SG++VK NF+Y ++
Sbjct: 437 QDD-NIPIEVHNPRNTERLQLQFRGEKTAKVIGKLAQRVPENNETISGVLVKNNFSYSIM 495
Query: 484 APSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHL---AGPVETLDEKRL------- 533
P +L YT L+ S + Q+ SV+YSGS+ +L + L A ++ + K +
Sbjct: 496 VPEELGSYTSLRISSLEQRMSVHYSGSLKLLIFNLQQLNDDACLIQNIKLKEISKKGSVT 555
Query: 534 RAFAC----IEITL--EKCIVVLEWASNPISDMYADSLISECL 570
+A + +T+ +VV+ W SNP+ DMYADS+++ L
Sbjct: 556 QAITVFQGKVNVTVYGNDHVVVVRWDSNPVYDMYADSVVAAIL 598
>DICTYBASE|DDB_G0274799 [details] [associations]
symbol:cpsf3 "cleavage and polyadenylation
specificity factor 73 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA;IC] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR
binding" evidence=ISS] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0004519 "endonuclease
activity" evidence=IEA] [GO:0004518 "nuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0274799 Pfam:PF07521 GO:GO:0046872 GO:GO:0006378
GenomeReviews:CM000151_GR EMBL:AAFI02000012 GO:GO:0003730
GO:GO:0004519 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 OMA:YVSFSAH InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
RefSeq:XP_643926.1 ProteinModelPortal:Q86A79 SMR:Q86A79
STRING:Q86A79 EnsemblProtists:DDB0233696 GeneID:8619353
KEGG:ddi:DDB_G0274799 ProtClustDB:CLSZ2431003 Uniprot:Q86A79
Length = 774
Score = 1678 (595.7 bits), Expect = 1.1e-172, P = 1.1e-172
Identities = 317/575 (55%), Positives = 428/575 (74%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESD--QIDLLLISHFHLD 63
G+G EVGRSC++L++K K +M DCG+HP SG+ +LPF D +ESD IDLLL+SHFHLD
Sbjct: 42 GSGSEVGRSCVLLKYKGKKVMFDCGVHPAYSGLVSLPFFDSIESDIPDIDLLLVSHFHLD 101
Query: 64 HCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQ-MLYTESDLEKSMDK 122
H A+P+F+ KT FKGR FMTH TKAIY LLSDY+KVSNI+ + ML+ +SDL++S++K
Sbjct: 102 HAAAVPYFVGKTKFKGRVFMTHPTKAIYGMLLSDYVKVSNITRDDDMLFDKSDLDRSLEK 161
Query: 123 IETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIP 182
IE + + ++ + NGIK + +NAGHVLGAAMF+IEIAGVKILYTGDFSRQEDRHLM AE P
Sbjct: 162 IEKVRYRQKVEHNGIKVTCFNAGHVLGAAMFMIEIAGVKILYTGDFSRQEDRHLMGAETP 221
Query: 183 PVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILD 242
PVK D+LI ESTYG VHE R ERE RFTS +H +V R G+CLIPVFALGRAQELLLILD
Sbjct: 222 PVKVDVLIIESTYGVQVHEPRLEREKRFTSSVHQVVERNGKCLIPVFALGRAQELLLILD 281
Query: 243 EYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGI 302
EYW +P+LH +PIYYAS+LAKKCM VY+TYIN MNDR+R Q ++NPF FKHI N+KGI
Sbjct: 282 EYWIANPQLHHVPIYYASALAKKCMGVYRTYINMMNDRVRAQFDVSNPFEFKHIKNIKGI 341
Query: 303 DHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV 362
+ F+D GPCV MASPGM+QSGLSR+LFE WC+D +NG++I GY VEGTLAK I+SEP E+
Sbjct: 342 ESFDDRGPCVFMASPGMLQSGLSRQLFERWCSDKRNGIVIPGYSVEGTLAKHIMSEPAEI 401
Query: 363 IGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTRE 422
+ +PL ++V Y+SFSAH+D+ QTSEF++E++P HVVLVHG+ NEMSRL+ +L +
Sbjct: 402 TRLDNVNVPLNLTVSYVSFSAHSDFLQTSEFIQEIQPPHVVLVHGDANEMSRLRQSLVAK 461
Query: 423 YEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHL 482
++ ++ + P+N +SV L F+ EK AK +G + K + + GI+V ++F +H+
Sbjct: 462 FK-----TINVLTPKNAMSVALEFRPEKVAKTLGSIITNPPKQNDIIQGILVTKDFTHHI 516
Query: 483 LAPSDLPKYTDLKASKIIQQQSVYYSGS----ISVLRSLISHLAGPVETL----DEK-RL 533
L+ SD+ YT+LK + I Q+ ++ ++ + IS L + + E+ +EK +
Sbjct: 517 LSASDIHNYTNLKTNIIKQKLTLPFAQTYHILISTLEQIYEQIIESTESTGGGGNEKPTI 576
Query: 534 RAFACIEITLEKCI-VVLEWASNPISDMYADSLIS 567
+ I++ + ++LEW SN ++DM DS+I+
Sbjct: 577 TIYNEIKLIYNIGVSIILEWNSNTVNDMICDSIIA 611
>TAIR|locus:2206076 [details] [associations]
symbol:CPSF73-I "cleavage and polyadenylation specificity
factor 73-I" species:3702 "Arabidopsis thaliana" [GO:0005634
"nucleus" evidence=ISM;IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0005515 "protein binding"
evidence=IPI] [GO:0000278 "mitotic cell cycle" evidence=RCA]
[GO:0006346 "methylation-dependent chromatin silencing"
evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0009855
"determination of bilateral symmetry" evidence=RCA] [GO:0010014
"meristem initiation" evidence=RCA] [GO:0010073 "meristem
maintenance" evidence=RCA] [GO:0016246 "RNA interference"
evidence=RCA] [GO:0031507 "heterochromatin assembly" evidence=RCA]
[GO:0045787 "positive regulation of cell cycle" evidence=RCA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0006397
GO:GO:0090305 EMBL:AC018908 GO:GO:0004518 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
EMBL:AY140900 EMBL:AY150478 EMBL:AY074280 EMBL:AK316692
EMBL:AK316794 IPI:IPI00533462 PIR:G96635 RefSeq:NP_001031215.1
RefSeq:NP_176297.1 RefSeq:NP_849835.1 UniGene:At.23510
ProteinModelPortal:Q9C952 SMR:Q9C952 IntAct:Q9C952 STRING:Q9C952
PaxDb:Q9C952 PRIDE:Q9C952 EnsemblPlants:AT1G61010.1
EnsemblPlants:AT1G61010.2 EnsemblPlants:AT1G61010.3 GeneID:842393
KEGG:ath:AT1G61010 TAIR:At1g61010 HOGENOM:HOG000203394
InParanoid:Q9C952 KO:K14403 OMA:YVSFSAH PhylomeDB:Q9C952
ProtClustDB:CLSN2681829 Genevestigator:Q9C952 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 Uniprot:Q9C952
Length = 693
Score = 1671 (593.3 bits), Expect = 6.2e-172, P = 6.2e-172
Identities = 324/569 (56%), Positives = 416/569 (73%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAG EVGRSC+ + F+ K+I+ DCGIHP SGM ALP+ D ++ ID+LLI+HFH+DH
Sbjct: 28 GAGSEVGRSCVYMSFRGKNILFDCGIHPAYSGMAALPYFDEIDPSSIDVLLITHFHIDHA 87
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
+LP+FL KT F GR FMTHATKAIY+ LL+DY+KVS +S E ML+ E D+ KSMDKIE
Sbjct: 88 ASLPYFLEKTTFNGRVFMTHATKAIYKLLLTDYVKVSKVSVEDMLFDEQDINKSMDKIEV 147
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
I+FH+ +VNGIKF Y AGHVLGAAMF+++IAGV+ILYTGD+SR+EDRHL AAE+P
Sbjct: 148 IDFHQTVEVNGIKFWCYTAGHVLGAAMFMVDIAGVRILYTGDYSREEDRHLRAAELPQFS 207
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PDI I EST G +H+ R RE RFT +IH V +GGR LIP FALGRAQELLLILDEYW
Sbjct: 208 PDICIIESTSGVQLHQSRHIREKRFTDVIHSTVAQGGRVLIPAFALGRAQELLLILDEYW 267
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
+ HP+LH+IPIYYAS LAKKCM+VYQTYI +MNDRIR Q + +NPFVFKHIS L ID F
Sbjct: 268 ANHPDLHNIPIYYASPLAKKCMAVYQTYILSMNDRIRNQFANSNPFVFKHISPLNSIDDF 327
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
D+GP VVMA+PG +QSGLSR+LF+ WC+D KN II GY VEGTLAKTI++EP+EV M
Sbjct: 328 NDVGPSVVMATPGGLQSGLSRQLFDSWCSDKKNACIIPGYMVEGTLAKTIINEPKEVTLM 387
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
+G PL M V YISFSAH DY QTS F++EL P +++LVHGE NEM RLK L E+ D
Sbjct: 388 NGLTAPLNMQVHYISFSAHADYAQTSTFLKELMPPNIILVHGEANEMMRLKQKLLTEFPD 447
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
NT ++ P+N SV++YF EK AK +G LA + +SGI+VK+ F Y ++AP
Sbjct: 448 G-NT--KIMTPKNCESVEMYFNSEKLAKTIGRLAEKTPDVGDTVSGILVKKGFTYQIMAP 504
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVE--TLDEKRLRAFACIE-IT 542
+L ++ L + + Q+ ++ + G+ V++ + + VE T +E L A E +T
Sbjct: 505 DELHVFSQLSTATVTQRITIPFVGAFGVIKHRLEKIFESVEFSTDEESGLPALKVHERVT 564
Query: 543 L----EKCIVVLEWASNPISDMYADSLIS 567
+ EK I L+W+S+PISDM +DS+++
Sbjct: 565 VKQESEKHIS-LQWSSDPISDMVSDSIVA 592
>POMBASE|SPAC17G6.16c [details] [associations]
symbol:ysh1 "mRNA cleavage and polyadenylation
specificity factor complex endoribonuclease subunit Ysh1"
species:4896 "Schizosaccharomyces pombe" [GO:0004521
"endoribonuclease activity" evidence=ISO] [GO:0005634 "nucleus"
evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IC]
[GO:0006379 "mRNA cleavage" evidence=IC] [GO:0046872 "metal ion
binding" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 PomBase:SPAC17G6.16c Pfam:PF07521 GO:GO:0005829
EMBL:CU329670 GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0006378
GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
OrthoDB:EOG41ZJK7 PIR:T37848 RefSeq:NP_594263.2 STRING:O13794
EnsemblFungi:SPAC17G6.16c.1 GeneID:2542258 NextBio:20803322
Uniprot:O13794
Length = 757
Score = 1516 (538.7 bits), Expect = 1.7e-155, P = 1.7e-155
Identities = 273/567 (48%), Positives = 402/567 (70%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAG EVGRSC ++++K K++M+D G+HP +G+ ALPF D + +D+LLISHFHLDH
Sbjct: 25 GAGNEVGRSCHVIQYKGKTVMLDAGVHPAYTGLSALPFFDEFDLSTVDVLLISHFHLDHV 84
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIET 125
+LP+ + KT F+GR FMTH TKA+ +WLLSDY+KVSN+ E LY E DL + D+IE
Sbjct: 85 ASLPYVMQKTNFRGRVFMTHPTKAVCKWLLSDYVKVSNVGMEDQLYDEKDLLAAFDRIEA 144
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
+++H +V GIKF+ Y+AGHVLGA M+ +E+AGV IL+TGD+SR+EDRHL AE+PP +
Sbjct: 145 VDYHSTIEVEGIKFTPYHAGHVLGACMYFVEMAGVNILFTGDYSREEDRHLHVAEVPPKR 204
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYW 245
PD+LITESTYGT H+ R E+E R ++IH + GGR L+PVFALGRAQELLLILDEYW
Sbjct: 205 PDVLITESTYGTASHQPRLEKEARLLNIIHSTIRNGGRVLMPVFALGRAQELLLILDEYW 264
Query: 246 SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKGIDHF 305
+ H +L +PIYYASSLA+KCM+++QTY+N MND IR+ + NPF+F+ + +L+ ++ F
Sbjct: 265 NNHLDLRSVPIYYASSLARKCMAIFQTYVNMMNDNIRKIFAERNPFIFRFVKSLRNLEKF 324
Query: 306 EDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEVIGM 365
+DIGP V++ASPGM+Q+G+SR L E W D +N +++ GY VEGT+AK I +EP E++ +
Sbjct: 325 DDIGPSVILASPGMLQNGVSRTLLERWAPDPRNTLLLTGYSVEGTMAKQITNEPIEIVSL 384
Query: 366 SGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
SGQ++P +M+V+ +SF+AH DY Q SEF+ + H++LVHGEQ M RLK+AL ++ +
Sbjct: 385 SGQKIPRRMAVEELSFAAHVDYLQNSEFIDLVNADHIILVHGEQTNMGRLKSALASKFHN 444
Query: 426 DPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGIIVKRNFNYHLLAP 485
+++Y PRN V + L FKGE+ + +G++AV K +SGI+++++ NY L++
Sbjct: 445 R-KVDVKVYTPRNCVPLYLPFKGERLVRALGKVAVHKPKEGDIMSGILIQKDANYKLMSA 503
Query: 486 SDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK----RLRAFACIEI 541
DL ++DL + + Q+Q + + S+ + + + G V+ K + I +
Sbjct: 504 EDLRDFSDLTTTVLTQKQVIPFFSSMELANFHLKQMFGYVKQSKTKAGQPQYTVMDAITL 563
Query: 542 TL-EKCIVVLEWASNPISDMYADSLIS 567
TL ++ + LEW N ++D ADS+I+
Sbjct: 564 TLIQEHKLALEWVGNIMNDTIADSVIT 590
>SGD|S000004267 [details] [associations]
symbol:YSH1 "Putative endoribonuclease" species:4932
"Saccharomyces cerevisiae" [GO:0004518 "nuclease activity"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0006379 "mRNA
cleavage" evidence=IMP] [GO:0006369 "termination of RNA polymerase
II transcription" evidence=IMP] [GO:0031126 "snoRNA 3'-end
processing" evidence=IMP] [GO:0008380 "RNA splicing" evidence=IMP]
[GO:0034247 "snoRNA splicing" evidence=IMP] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IDA;IPI] [GO:0005849 "mRNA cleavage factor complex"
evidence=IPI] [GO:0004521 "endoribonuclease activity"
evidence=ISS;IMP] [GO:0003723 "RNA binding" evidence=IC]
[GO:0004519 "endonuclease activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 SGD:S000004267
Pfam:PF07521 GO:GO:0046872 GO:GO:0006378 EMBL:BK006945
GO:GO:0004521 GO:GO:0005847 GO:GO:0006379 GO:GO:0006369
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 EMBL:U17245 HOGENOM:HOG000203394 KO:K14403
InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
GeneTree:ENSGT00700000104485 GO:GO:0031126 GO:GO:0034247
OMA:EISFAAH OrthoDB:EOG41ZJK7 PIR:S51413 RefSeq:NP_013379.1
ProteinModelPortal:Q06224 SMR:Q06224 DIP:DIP-2470N IntAct:Q06224
MINT:MINT-375457 STRING:Q06224 PaxDb:Q06224 PeptideAtlas:Q06224
EnsemblFungi:YLR277C GeneID:850983 KEGG:sce:YLR277C CYGD:YLR277c
NextBio:967501 Genevestigator:Q06224 GermOnline:YLR277C
Uniprot:Q06224
Length = 779
Score = 1325 (471.5 bits), Expect = 1.2e-144, Sum P(3) = 1.2e-144
Identities = 252/478 (52%), Positives = 346/478 (72%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G EVGRSC +L++K K++M+D GIHP G+ +LPF D + ++D+LLISHFHLDH
Sbjct: 15 GGSNEVGRSCHILQYKGKTVMLDAGIHPAYQGLASLPFYDEFDLSKVDILLISHFHLDHA 74
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNI--STEQM------LYTESDLE 117
+LP+ + +T F+GR FMTH TKAIYRWLL D+++V++I S+ M L+++ DL
Sbjct: 75 ASLPYVMQRTNFQGRVFMTHPTKAIYRWLLRDFVRVTSIGSSSSSMGTKDEGLFSDEDLV 134
Query: 118 KSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLM 177
S DKIET+++H DVNGIKF+A++AGHVLGAAMF IEIAG+++L+TGD+SR+ DRHL
Sbjct: 135 DSFDKIETVDYHSTVDVNGIKFTAFHAGHVLGAAMFQIEIAGLRVLFTGDYSREVDRHLN 194
Query: 178 AAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQEL 237
+AE+PP+ ++LI EST+GT HE R RE + T LIH V RGGR L+PVFALGRAQE+
Sbjct: 195 SAEVPPLSSNVLIVESTFGTATHEPRLNRERKLTQLIHSTVMRGGRVLLPVFALGRAQEI 254
Query: 238 LLILDEYWSLHP-ELH--DIPIYYASSLAKKCMSVYQTYINAMNDRIRRQI--SINNPFV 292
+LILDEYWS H EL +PI+YAS+LAKKCMSV+QTY+N MND IR++ S NPF+
Sbjct: 255 MLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVFQTYVNMMNDDIRKKFRDSQTNPFI 314
Query: 293 FKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
FK+IS L+ ++ F+D GP V++ASPGM+QSGLSR+L E WC + KN V+I GY +EGT+A
Sbjct: 315 FKNISYLRNLEDFQDFGPSVMLASPGMLQSGLSRDLLERWCPEDKNLVLITGYSIEGTMA 374
Query: 353 KTILSEPEEVIGMSGQRL--PLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQN 410
K I+ EP+ + ++ + P + V+ ISF+AH D+Q+ EF+ ++ +++LVHGE N
Sbjct: 375 KFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHVDFQENLEFIEKISAPNIILVHGEAN 434
Query: 411 EMSRLKAALTREYEDDPNTSMEL--YNPRNTVSVDLYFKGEKTAKVMGELAVENLKPD 466
M RLK+AL + T E+ +NPRN V VDL F+G K AK +G + E K +
Sbjct: 435 PMGRLKSALLSNFASLKGTDNEVHVFNPRNCVEVDLEFQGVKVAKAVGNIVNEIYKEE 492
Score = 65 (27.9 bits), Expect = 1.2e-144, Sum P(3) = 1.2e-144
Identities = 24/80 (30%), Positives = 40/80 (50%)
Query: 458 LAVENLKPDAALSGIIVK--RNFNYHLLAPSDLPKY-TDLKASKIIQQQSVYYSGSISVL 514
L E D +SGI+V +NF L+ SDL ++ DL + + ++QSV + ++
Sbjct: 523 LVDEEEHKDIVVSGILVSDDKNFELDFLSLSDLREHHPDLSTTILRERQSVRVNCKKELI 582
Query: 515 RSLISHLAGPVETL-DEKRL 533
I + G E L D+ R+
Sbjct: 583 YWHILQMFGEAEVLQDDDRV 602
Score = 60 (26.2 bits), Expect = 1.2e-144, Sum P(3) = 1.2e-144
Identities = 11/35 (31%), Positives = 22/35 (62%)
Query: 533 LRAFACIEITLEKCIVVLEWASNPISDMYADSLIS 567
L+ I++T+ + V+EW + ++D ADS+I+
Sbjct: 625 LQIMGDIKLTIVNTLAVVEWTQDLMNDTVADSIIA 659
>CGD|CAL0005344 [details] [associations]
symbol:orf19.5486 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0004521 "endoribonuclease
activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0034247 "snoRNA splicing"
evidence=IEA] [GO:0031126 "snoRNA 3'-end processing" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006369 "termination
of RNA polymerase II transcription" evidence=IEA] [GO:0006378 "mRNA
polyadenylation" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 CGD:CAL0005344 Pfam:PF07521 GO:GO:0005634
GO:GO:0042493 GO:GO:0046872 GO:GO:0006397 GO:GO:0090305
GO:GO:0004519 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 EMBL:AACQ01000196 EMBL:AACQ01000195
RefSeq:XP_711478.1 RefSeq:XP_711502.1 ProteinModelPortal:Q59P50
STRING:Q59P50 GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
KEGG:cal:CaO19.5486 Uniprot:Q59P50
Length = 870
Score = 1281 (456.0 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
Identities = 240/476 (50%), Positives = 343/476 (72%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G EVGRSC ++E+KNK IM+D G+HP LSG + P+ D + ++D+LLISHFH+DH
Sbjct: 106 GGCNEVGRSCHIIEYKNKVIMLDSGMHPALSGHASFPYFDEYDISKVDILLISHFHVDHS 165
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM----------LYTESD 115
+LP+ + ++ F+G+ FMTHATKAIYRWL+ D+++V++I + LYT+ D
Sbjct: 166 ASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRSEDGGGGEGSNLYTDDD 225
Query: 116 LEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRH 175
+ KS D+IETI++H +++GI+F+AY+AGHVLGA M+ IEI G+K+L+TGD+SR+E+RH
Sbjct: 226 IMKSFDRIETIDYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENRH 285
Query: 176 LMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
L AAE+PP+KPDILI+EST+GT E R E E + T+ IH + +GGR L+PVFALG AQ
Sbjct: 286 LHAAEVPPLKPDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNAQ 345
Query: 236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN---NPFV 292
ELLLILDEYWS + +L ++ ++YAS+LAKKCM+VY+TY MND+IR + + NPF
Sbjct: 346 ELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPFD 405
Query: 293 FKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
FK+I ++K + F+D+GP VV+A+PGM+Q+G+SR+L E W D KN VI+ GY VEGT+A
Sbjct: 406 FKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMA 465
Query: 353 KTILSEPEEVIGMSG--QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQN 410
K +L EP + + +P ++ ++ ISF+AH D+QQ SEF+ ++ P+ V+LVHG+
Sbjct: 466 KELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVDFQQNSEFIEKVSPSKVILVHGDSV 525
Query: 411 EMSRLKAALTREYEDDPNTSMEL--YNPRNTVSVDLYFKGEKTAKVMGELAVENLK 464
M RLK+AL +Y T E+ YNP+N + + FKG K AKV+G LA E L+
Sbjct: 526 PMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIGFKGLKIAKVLGSLAEEQLQ 581
Score = 104 (41.7 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
Identities = 35/171 (20%), Positives = 79/171 (46%)
Query: 409 QNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEK-TAKVMGELAVENLKPDA 467
+ ++ LK + E + + EL + GE T + E ++ LK
Sbjct: 577 EEQLQVLKKIIQDEVSAENSKITELTEEKEEADEIKEDNGETDTTQKPNESSINVLKTGQ 636
Query: 468 ALSGIIVKRNFNYHLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVET 527
+SG++V ++FN +LL DL ++T L S + + + + IS++ + + G +
Sbjct: 637 VVSGVLVSKDFNLNLLQLQDLHEFTQLSTSIVKSKMHLKINADISLMVWHLEQMFGYINV 696
Query: 528 LDEKRLRAFACI-----EITLEKC-----IVVLEWAS-NPISDMYADSLIS 567
+++ + C+ ++ +++ + +EW + N ++D ADS+I+
Sbjct: 697 INDDD-EEWECVIMDVVDVFIDRSKGPGLFITVEWINDNLMADSLADSVIA 746
Score = 54 (24.1 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
Identities = 15/50 (30%), Positives = 23/50 (46%)
Query: 573 ILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
+L +G++ K EK I + K A +D L V+C LK V+
Sbjct: 803 LLKAQFGDSL--KELPEEKAIIQIGKTVANVDYKRLEVECSSKVLKDRVE 850
>UNIPROTKB|Q59P50 [details] [associations]
symbol:YSH1 "Endoribonuclease YSH1" species:237561 "Candida
albicans SC5314" [GO:0042493 "response to drug" evidence=IMP]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 CGD:CAL0005344
Pfam:PF07521 GO:GO:0005634 GO:GO:0042493 GO:GO:0046872
GO:GO:0006397 GO:GO:0090305 GO:GO:0004519 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K14403 InterPro:IPR021718 Pfam:PF11718 SMART:SM01098
EMBL:AACQ01000196 EMBL:AACQ01000195 RefSeq:XP_711478.1
RefSeq:XP_711502.1 ProteinModelPortal:Q59P50 STRING:Q59P50
GeneID:3646887 GeneID:3646911 KEGG:cal:CaO19.12941
KEGG:cal:CaO19.5486 Uniprot:Q59P50
Length = 870
Score = 1281 (456.0 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
Identities = 240/476 (50%), Positives = 343/476 (72%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G EVGRSC ++E+KNK IM+D G+HP LSG + P+ D + ++D+LLISHFH+DH
Sbjct: 106 GGCNEVGRSCHIIEYKNKVIMLDSGMHPALSGHASFPYFDEYDISKVDILLISHFHVDHS 165
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM----------LYTESD 115
+LP+ + ++ F+G+ FMTHATKAIYRWL+ D+++V++I + LYT+ D
Sbjct: 166 ASLPYVMQQSNFRGKVFMTHATKAIYRWLMQDFVRVTSIGNSRSEDGGGGEGSNLYTDDD 225
Query: 116 LEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRH 175
+ KS D+IETI++H +++GI+F+AY+AGHVLGA M+ IEI G+K+L+TGD+SR+E+RH
Sbjct: 226 IMKSFDRIETIDYHSTMEIDGIRFTAYHAGHVLGACMYFIEIGGLKVLFTGDYSREENRH 285
Query: 176 LMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
L AAE+PP+KPDILI+EST+GT E R E E + T+ IH + +GGR L+PVFALG AQ
Sbjct: 286 LHAAEVPPLKPDILISESTFGTGTLEPRIELERKLTTHIHATIAKGGRVLLPVFALGNAQ 345
Query: 236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN---NPFV 292
ELLLILDEYWS + +L ++ ++YAS+LAKKCM+VY+TY MND+IR + + NPF
Sbjct: 346 ELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETYTGIMNDKIRLSSASSEKSNPFD 405
Query: 293 FKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
FK+I ++K + F+D+GP VV+A+PGM+Q+G+SR+L E W D KN VI+ GY VEGT+A
Sbjct: 406 FKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLEKWAPDGKNLVILTGYSVEGTMA 465
Query: 353 KTILSEPEEVIGMSG--QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQN 410
K +L EP + + +P ++ ++ ISF+AH D+QQ SEF+ ++ P+ V+LVHG+
Sbjct: 466 KELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVDFQQNSEFIEKVSPSKVILVHGDSV 525
Query: 411 EMSRLKAALTREYEDDPNTSMEL--YNPRNTVSVDLYFKGEKTAKVMGELAVENLK 464
M RLK+AL +Y T E+ YNP+N + + FKG K AKV+G LA E L+
Sbjct: 526 PMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIGFKGLKIAKVLGSLAEEQLQ 581
Score = 104 (41.7 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
Identities = 35/171 (20%), Positives = 79/171 (46%)
Query: 409 QNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEK-TAKVMGELAVENLKPDA 467
+ ++ LK + E + + EL + GE T + E ++ LK
Sbjct: 577 EEQLQVLKKIIQDEVSAENSKITELTEEKEEADEIKEDNGETDTTQKPNESSINVLKTGQ 636
Query: 468 ALSGIIVKRNFNYHLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVET 527
+SG++V ++FN +LL DL ++T L S + + + + IS++ + + G +
Sbjct: 637 VVSGVLVSKDFNLNLLQLQDLHEFTQLSTSIVKSKMHLKINADISLMVWHLEQMFGYINV 696
Query: 528 LDEKRLRAFACI-----EITLEKC-----IVVLEWAS-NPISDMYADSLIS 567
+++ + C+ ++ +++ + +EW + N ++D ADS+I+
Sbjct: 697 INDDD-EEWECVIMDVVDVFIDRSKGPGLFITVEWINDNLMADSLADSVIA 746
Score = 54 (24.1 bits), Expect = 1.7e-143, Sum P(3) = 1.7e-143
Identities = 15/50 (30%), Positives = 23/50 (46%)
Query: 573 ILVEMYGEAAVPKMFKGEKITITVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
+L +G++ K EK I + K A +D L V+C LK V+
Sbjct: 803 LLKAQFGDSL--KELPEEKAIIQIGKTVANVDYKRLEVECSSKVLKDRVE 850
>ASPGD|ASPL0000060573 [details] [associations]
symbol:AN0990 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:BN001308
GO:GO:0046872 GO:GO:0006397 GO:GO:0090305 GO:GO:0004519
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 EMBL:AACD01000015 RefSeq:XP_658594.1
ProteinModelPortal:Q5BEP0 STRING:Q5BEP0
EnsemblFungi:CADANIAT00001661 GeneID:2876766 KEGG:ani:AN0990.2
OMA:EISFAAH OrthoDB:EOG41ZJK7 Uniprot:Q5BEP0
Length = 884
Score = 849 (303.9 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
Identities = 163/283 (57%), Positives = 207/283 (73%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G G EVGRSC ++++K K++M+D G+HP G ALPF D + +D+LLISHFH+DH
Sbjct: 30 GGGNEVGRSCHIIQYKGKTVMLDAGMHPAKEGFSALPFFDEFDLSTVDILLISHFHVDHS 89
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNI--STEQM--LYTESDLEKSMD 121
ALP+ L KT FKGR FMTHATKAIY+WL+ D ++V+N S++Q LYTE D ++
Sbjct: 90 SALPYVLSKTNFKGRVFMTHATKAIYKWLIQDNVRVNNTASSSDQRTTLYTEHDHLSTLP 149
Query: 122 KIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEI 181
IETI+F+ +N I+ + Y AGHVLGAAMFLI IAG+ IL+TGD+SR+EDRHL+ A +
Sbjct: 150 LIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFTGDYSREEDRHLIPATV 209
Query: 182 PP-VKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLI 240
P VK D+LITEST+G + R ERE I ++NRGGR L+PVFALGRAQELLLI
Sbjct: 210 PRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRVLMPVFALGRAQELLLI 269
Query: 241 LDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
L+EYW HPEL IPIYY + A++CM VYQTYI AMND I+R
Sbjct: 270 LEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKR 312
Score = 726 (260.6 bits), Expect = 5.4e-101, Sum P(4) = 5.4e-101
Identities = 142/268 (52%), Positives = 191/268 (71%)
Query: 110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFS 169
LYTE D ++ IETI+F+ +N I+ + Y AGHVLGAAMFLI IAG+ IL+TGD+S
Sbjct: 138 LYTEHDHLSTLPLIETIDFNTTHTINSIRITPYPAGHVLGAAMFLISIAGLNILFTGDYS 197
Query: 170 RQEDRHLMAAEIPP-VKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
R+EDRHL+ A +P VK D+LITEST+G + R ERE I ++NRGGR L+PV
Sbjct: 198 REEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAALMKSITGVLNRGGRVLMPV 257
Query: 229 FALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQI--- 285
FALGRAQELLLIL+EYW HPEL IPIYY + A++CM VYQTYI AMND I+R
Sbjct: 258 FALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDNIKRLFRQR 317
Query: 286 ----------SIN-NPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCT 334
S++ P+ FK++ +L+ ++ F+D+G CV++ASPGM+Q+G SREL E W
Sbjct: 318 MAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDVGGCVMLASPGMLQTGTSRELLERWAP 377
Query: 335 DAKNGVIIAGYCVEGTLAKTILSEPEEV 362
+ +NGV++ GY VEGT+AK +L+EP+++
Sbjct: 378 NERNGVVMTGYSVEGTMAKQLLNEPDQI 405
Score = 224 (83.9 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
Identities = 52/166 (31%), Positives = 91/166 (54%)
Query: 370 LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNT 429
+P + +VD ISF+AH D + F+ E+ V+LVHGE+++M RLK+ L +
Sbjct: 432 IPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSKLL-SLNAEKTV 490
Query: 430 SMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPD------AALSGIIVKRNFNYHLL 483
+++Y P N V + F+ +K AKV+G+LA L D ++G++V+ F+ L+
Sbjct: 491 KVKVYTPANCEEVRIPFRKDKIAKVVGKLAQTTLPTDNEDGDGPLMAGVLVQNGFDLSLM 550
Query: 484 APSDLPKYTDLKASKIIQQQSVYYSG-SISVLRSLISHLAGPVETL 528
AP DL +Y L + I +Q + S S+ +++ + G +E +
Sbjct: 551 APDDLREYAGLATTTITCKQHITLSSASMDLIKWALEGTFGAIEEI 596
Score = 54 (24.1 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 592 ITITVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
I I VDK A + L DL V+C ++ L+ V+
Sbjct: 803 IEIKVDKHVARVWLEDLEVECANAVLRDRVR 833
Score = 41 (19.5 bits), Expect = 6.9e-114, Sum P(4) = 6.9e-114
Identities = 8/23 (34%), Positives = 15/23 (65%)
Query: 548 VVLEWASNPISDMYADSLISECL 570
V L+W N ++D AD++++ L
Sbjct: 650 VELQWEGNMMNDGIADAVMAVLL 672
>GENEDB_PFALCIPARUM|PF14_0364 [details] [associations]
symbol:PF14_0364 "cleavage and polyadenylation
specifity factor protein, putative" species:5833 "Plasmodium
falciparum" [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0006378 "mRNA
polyadenylation" evidence=ISS] [GO:0006379 "mRNA cleavage"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:AE014187 GO:GO:0005847
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718 Pfam:PF11718
SMART:SM01098 RefSeq:XP_001348538.1 ProteinModelPortal:Q8IL83
PRIDE:Q8IL83 EnsemblProtists:PF14_0364:mRNA GeneID:811946
KEGG:pfa:PF14_0364 EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH
ProtClustDB:CLSZ2457730 Uniprot:Q8IL83
Length = 876
Score = 812 (290.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
Identities = 156/376 (41%), Positives = 252/376 (67%)
Query: 109 MLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDF 168
+LY E+D++K+MD IET+NFH+ + +KF+AY AGHV+GA MFL+EI ++ LYTGD+
Sbjct: 166 VLYDENDIDKTMDLIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGDY 225
Query: 169 SRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
SR+ DRH+ AEIP + +LI E TYG VH+ R++RE RF +++ ++N G+ L+PV
Sbjct: 226 SREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLPV 285
Query: 229 FALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN 288
FALGRAQELLLIL+E+W + L +IPI+Y SS+A K + +Y+T+IN + +++ ++
Sbjct: 286 FALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNEG 345
Query: 289 -NPFVFKHISNLKGIDH-----FEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVII 342
NPF FK++ K ++ ++D PCV+MASPGM+Q+G+S+ +F + +D K+GVI+
Sbjct: 346 KNPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVIL 405
Query: 343 AGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHV 402
GY V+GTLA + +EPE V ++ + + K + ISFSAH+D+ QT F+ +L+ +V
Sbjct: 406 TGYTVKGTLADELKTEPEFVT-INDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPNV 464
Query: 403 VLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELA--V 460
VLVHG++NE++RLK L E + + ++ P + +F+ + +G+L+ +
Sbjct: 465 VLVHGDKNELNRLKNKLIEEKQ-----YLSVFTPELLQKLSFHFEQNDSLISLGKLSEHI 519
Query: 461 ENLKPDAALSGIIVKR 476
+ + L G+ +K+
Sbjct: 520 KKINKKIKLEGLKMKK 535
Score = 278 (102.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G EVGRSC+++E S+M+DCGIHP G+ LP D + ++DL LI+HFH+DH
Sbjct: 10 GGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLITHFHMDHS 69
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV 101
GALP+ + KT FKGR FMT ATK+I L +DY ++
Sbjct: 70 GALPYLINKTRFKGRIFMTEATKSICYLLWNDYARI 105
Score = 98 (39.6 bits), Expect = 2.0e-26, Sum P(3) = 2.0e-26
Identities = 53/247 (21%), Positives = 108/247 (43%)
Query: 338 NGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVREL 397
N V++ G E K L E ++ + + L K+S + + + SE ++++
Sbjct: 463 NVVLVHGDKNELNRLKNKLIEEKQYLSVFTPELLQKLSFHFEQNDSLISLGKLSEHIKKI 522
Query: 398 RPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTA----K 453
+ L E +M + K E+ N ++ N + + K +
Sbjct: 523 NKK-IKL---EGLKMKKEKMIANDEHISVKNEMGDINNDEENLQISDKKKNKVDEHDKHN 578
Query: 454 VMGELAVENLKPDAALSGIIVKRNFNYHLLA-PSDLPKYTDLKASKIIQQQSVYYSGSIS 512
+ ++ E + + GII+ N +L P+D+ +YT+LK + I Q ++ +
Sbjct: 579 INNNISNEKHNVNNQIEGIIITEPQNVPILIYPNDIYEYTNLKTAMIDQTINISFPYRFD 638
Query: 513 VLRSLISHLAGPVETLDEKRLRAFACIEITLEKC--IVVLEWASNPISDMYADSLISECL 570
+L ++I ++ ET + L I+I K ++ + W S+P++D+ ADS I+ +
Sbjct: 639 LLYNVIINVYE--ETHMDDNLIIVKDIKIIYCKDDKMIKINWLSSPLNDLIADS-INFLI 695
Query: 571 IEILVEM 577
+E L M
Sbjct: 696 LEFLETM 702
Score = 47 (21.6 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
Identities = 17/53 (32%), Positives = 26/53 (49%)
Query: 554 SNPISDMYADSLISECLIEILVEMYGEAA-VPKMFKGEKITITVDKKKACIDL 605
S PI+D+ D I E +I + E + + K EK T+ + KKK + L
Sbjct: 708 SIPIADVLTDHNIYEMIISYVEENFTNVERISKEILKEK-TLQMIKKKEQLKL 759
>UNIPROTKB|Q8IL83 [details] [associations]
symbol:PF14_0364 "Cleavage and polyadenylation specificity
factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014187
GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000203394 KO:K14403 InterPro:IPR021718
Pfam:PF11718 SMART:SM01098 RefSeq:XP_001348538.1
ProteinModelPortal:Q8IL83 PRIDE:Q8IL83
EnsemblProtists:PF14_0364:mRNA GeneID:811946 KEGG:pfa:PF14_0364
EuPathDB:PlasmoDB:PF3D7_1438500 OMA:CLITHFH ProtClustDB:CLSZ2457730
Uniprot:Q8IL83
Length = 876
Score = 812 (290.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
Identities = 156/376 (41%), Positives = 252/376 (67%)
Query: 109 MLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDF 168
+LY E+D++K+MD IET+NFH+ + +KF+AY AGHV+GA MFL+EI ++ LYTGD+
Sbjct: 166 VLYDENDIDKTMDLIETLNFHQNFEFPNVKFTAYRAGHVIGACMFLVEINNIRFLYTGDY 225
Query: 169 SRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
SR+ DRH+ AEIP + +LI E TYG VH+ R++RE RF +++ ++N G+ L+PV
Sbjct: 226 SREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFLNILTSMINNKGKVLLPV 285
Query: 229 FALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN 288
FALGRAQELLLIL+E+W + L +IPI+Y SS+A K + +Y+T+IN + +++ ++
Sbjct: 286 FALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYETFINLCGEFVKKVVNEG 345
Query: 289 -NPFVFKHISNLKGIDH-----FEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVII 342
NPF FK++ K ++ ++D PCV+MASPGM+Q+G+S+ +F + +D K+GVI+
Sbjct: 346 KNPFNFKYVKYAKSLESISSYLYQDNNPCVIMASPGMLQNGISKNIFNIIASDKKSGVIL 405
Query: 343 AGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHV 402
GY V+GTLA + +EPE V ++ + + K + ISFSAH+D+ QT F+ +L+ +V
Sbjct: 406 TGYTVKGTLADELKTEPEFVT-INDKVVKRKCRFEQISFSAHSDFNQTKTFIEKLKCPNV 464
Query: 403 VLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELA--V 460
VLVHG++NE++RLK L E + + ++ P + +F+ + +G+L+ +
Sbjct: 465 VLVHGDKNELNRLKNKLIEEKQ-----YLSVFTPELLQKLSFHFEQNDSLISLGKLSEHI 519
Query: 461 ENLKPDAALSGIIVKR 476
+ + L G+ +K+
Sbjct: 520 KKINKKIKLEGLKMKK 535
Score = 278 (102.9 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
Identities = 49/96 (51%), Positives = 66/96 (68%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G EVGRSC+++E S+M+DCGIHP G+ LP D + ++DL LI+HFH+DH
Sbjct: 10 GGASEVGRSCVIIECDKTSVMLDCGIHPAFMGIGCLPIYDAYDISKVDLCLITHFHMDHS 69
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV 101
GALP+ + KT FKGR FMT ATK+I L +DY ++
Sbjct: 70 GALPYLINKTRFKGRIFMTEATKSICYLLWNDYARI 105
Score = 98 (39.6 bits), Expect = 2.0e-26, Sum P(3) = 2.0e-26
Identities = 53/247 (21%), Positives = 108/247 (43%)
Query: 338 NGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVREL 397
N V++ G E K L E ++ + + L K+S + + + SE ++++
Sbjct: 463 NVVLVHGDKNELNRLKNKLIEEKQYLSVFTPELLQKLSFHFEQNDSLISLGKLSEHIKKI 522
Query: 398 RPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVDLYFKGEKTA----K 453
+ L E +M + K E+ N ++ N + + K +
Sbjct: 523 NKK-IKL---EGLKMKKEKMIANDEHISVKNEMGDINNDEENLQISDKKKNKVDEHDKHN 578
Query: 454 VMGELAVENLKPDAALSGIIVKRNFNYHLLA-PSDLPKYTDLKASKIIQQQSVYYSGSIS 512
+ ++ E + + GII+ N +L P+D+ +YT+LK + I Q ++ +
Sbjct: 579 INNNISNEKHNVNNQIEGIIITEPQNVPILIYPNDIYEYTNLKTAMIDQTINISFPYRFD 638
Query: 513 VLRSLISHLAGPVETLDEKRLRAFACIEITLEKC--IVVLEWASNPISDMYADSLISECL 570
+L ++I ++ ET + L I+I K ++ + W S+P++D+ ADS I+ +
Sbjct: 639 LLYNVIINVYE--ETHMDDNLIIVKDIKIIYCKDDKMIKINWLSSPLNDLIADS-INFLI 695
Query: 571 IEILVEM 577
+E L M
Sbjct: 696 LEFLETM 702
Score = 47 (21.6 bits), Expect = 1.0e-111, Sum P(3) = 1.0e-111
Identities = 17/53 (32%), Positives = 26/53 (49%)
Query: 554 SNPISDMYADSLISECLIEILVEMYGEAA-VPKMFKGEKITITVDKKKACIDL 605
S PI+D+ D I E +I + E + + K EK T+ + KKK + L
Sbjct: 708 SIPIADVLTDHNIYEMIISYVEENFTNVERISKEILKEK-TLQMIKKKEQLKL 759
>UNIPROTKB|F1NV30 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9031
"Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
IPI:IPI00571913 EMBL:AADN02040858 Ensembl:ENSGALT00000002586
Uniprot:F1NV30
Length = 600
Score = 886 (316.9 bits), Expect = 9.6e-89, P = 9.6e-89
Identities = 201/501 (40%), Positives = 295/501 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH TKAI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +PD+LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G++ L +KM V+Y+SFSAH D + + +R+ P +V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDD---P---NTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAAL 469
K + +E+ + P T+ NP V + L +TA +G L + KP
Sbjct: 424 KQKIEQEFHVNCYMPANGETTSIFTNPSIPVDISLGLLKRETA--IG-LLPDAKKPKLMH 480
Query: 470 SGIIVKRNFNYHLLAPSDLPK 490
+I+K N ++ L++P K
Sbjct: 481 GTLIMKDN-SFRLVSPEQALK 500
>UNIPROTKB|Q5ZIH0 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9031
"Gallus gallus" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000231294 KO:K13148 GO:GO:0032039 GO:GO:0016180
HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB EMBL:AJ720814 IPI:IPI00571913
RefSeq:NP_001012854.1 UniGene:Gga.13403 ProteinModelPortal:Q5ZIH0
STRING:Q5ZIH0 GeneID:419418 KEGG:gga:419418 CTD:54973
InParanoid:Q5ZIH0 NextBio:20822477 Uniprot:Q5ZIH0
Length = 600
Score = 886 (316.9 bits), Expect = 9.6e-89, P = 9.6e-89
Identities = 201/501 (40%), Positives = 295/501 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGYNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH TKAI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTKAICPILLEDYRKITVDKKGETNFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDEELEIKAYYAGHVLGAAMFQIKVGCESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +PD+LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G++ L +KM V+Y+SFSAH D + + +R+ P +V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQAEPRNVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDD---P---NTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAAL 469
K + +E+ + P T+ NP V + L +TA +G L + KP
Sbjct: 424 KQKIEQEFHVNCYMPANGETTTIFTNPSIPVDISLGLLKRETA--IG-LLPDAKKPKLMH 480
Query: 470 SGIIVKRNFNYHLLAPSDLPK 490
+I+K N ++ L++P K
Sbjct: 481 GTLIMKDN-SFRLVSPEQALK 500
>MGI|MGI:1919207 [details] [associations]
symbol:Cpsf3l "cleavage and polyadenylation specific factor
3-like" species:10090 "Mus musculus" [GO:0003674
"molecular_function" evidence=ND] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 MGI:MGI:1919207 GO:GO:0005737 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
GeneTree:ENSGT00700000104485 GO:GO:0032039 GO:GO:0016180
HOVERGEN:HBG080215 OrthoDB:EOG4GXFMB CTD:54973 EMBL:AK010425
EMBL:AK090206 EMBL:AK150436 EMBL:AK152740 EMBL:AK167607
EMBL:AK172533 EMBL:BC008240 EMBL:BC011155 IPI:IPI00467084
RefSeq:NP_082296.1 UniGene:Mm.259270 UniGene:Mm.475640
ProteinModelPortal:Q9CWS4 SMR:Q9CWS4 STRING:Q9CWS4
PhosphoSite:Q9CWS4 PaxDb:Q9CWS4 PRIDE:Q9CWS4
Ensembl:ENSMUST00000030901 GeneID:71957 KEGG:mmu:71957
InParanoid:Q9CWS4 NextBio:335052 Bgee:Q9CWS4 Genevestigator:Q9CWS4
GermOnline:ENSMUSG00000029034 Uniprot:Q9CWS4
Length = 600
Score = 875 (313.1 bits), Expect = 1.4e-87, P = 1.4e-87
Identities = 196/493 (39%), Positives = 289/493 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + +S D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L +PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSG-QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G Q L +KM V+Y+SFSAH D + + V + P V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDD---PNTSMELYNPRN-TVSVDLYFKGEKTAKVMGELAVENLKPDAALSG 471
+ + +E+ P + P + ++ V + K V G L E KP L G
Sbjct: 424 RQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQG-LLPEAKKP-RLLHG 481
Query: 472 IIVKRNFNYHLLA 484
++ ++ N+ L++
Sbjct: 482 TLIMKDSNFRLVS 494
>UNIPROTKB|E1B7Q9 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9913
"Bos taurus" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:DAAA02043243 IPI:IPI00971575 Ensembl:ENSBTAT00000010020
Uniprot:E1B7Q9
Length = 598
Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
Identities = 198/497 (39%), Positives = 289/497 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G S P F + S D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYITRSGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSM 120
HLDHCGALP+F G+ G +MT T+AI LL DY K++ E +T ++ M
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKGEANFFTSQMIKDCM 129
Query: 121 DKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAA 179
K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL AA
Sbjct: 130 KKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGAA 189
Query: 180 EIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLL 239
I +P +LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL +
Sbjct: 190 WIDKCRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELCI 249
Query: 240 ILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNL 299
+L+ +W +L PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 LLETFWE-RMDLK-APIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI--- 304
Query: 300 KGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILS 357
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ ILS
Sbjct: 305 KAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKILS 364
Query: 358 EPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
+ + M G++ L +KM V+Y+SFSAH D + + V + P +V+LVHGE +M LK
Sbjct: 365 GQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEFLK 423
Query: 417 AALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA--- 468
+ +E+ + Y P N +V L G + E+A + L PDA
Sbjct: 424 QKIEQEFR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDAKKPR 476
Query: 469 -LSGIIVKRNFNYHLLA 484
L G ++ ++ N+ L++
Sbjct: 477 LLHGTLIMKDSNFRLVS 493
>RGD|1306841 [details] [associations]
symbol:Cpsf3l "cleavage and polyadenylation specific factor
3-like" species:10116 "Rattus norvegicus" [GO:0005634 "nucleus"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016787
"hydrolase activity" evidence=IEA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 RGD:1306841 GO:GO:0005634 GO:GO:0005737
GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 HOGENOM:HOG000231294 KO:K13148
OMA:MAVEYMS GeneTree:ENSGT00700000104485 HOVERGEN:HBG080215
OrthoDB:EOG4GXFMB CTD:54973 EMBL:BC105303 IPI:IPI00365477
RefSeq:NP_001029064.1 UniGene:Rn.98615 ProteinModelPortal:Q3MHC2
STRING:Q3MHC2 Ensembl:ENSRNOT00000026725 GeneID:298688
KEGG:rno:298688 InParanoid:Q3MHC2 NextBio:644186
Genevestigator:Q3MHC2 GermOnline:ENSRNOG00000019712 Uniprot:Q3MHC2
Length = 600
Score = 874 (312.7 bits), Expect = 1.8e-87, P = 1.8e-87
Identities = 196/493 (39%), Positives = 289/493 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + +S D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSISGKNVMLDCGMHMGYNDDRRFPDFSYITQSGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L +PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSG-QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G Q L +KM V+Y+SFSAH D + + V + P V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDD---PNTSMELYNPRN-TVSVDLYFKGEKTAKVMGELAVENLKPDAALSG 471
+ + +E+ P + P + ++ V + K V G L E KP L G
Sbjct: 424 RQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQG-LLPEAKKP-RLLHG 481
Query: 472 IIVKRNFNYHLLA 484
++ ++ N+ L++
Sbjct: 482 TLIMKDNNFRLVS 494
>UNIPROTKB|E2QY53 [details] [associations]
symbol:CPSF3L "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:AAEX03003844 RefSeq:XP_003639102.1 Ensembl:ENSCAFT00000030626
GeneID:100855777 KEGG:cfa:100855777 Uniprot:E2QY53
Length = 600
Score = 866 (309.9 bits), Expect = 1.3e-86, P = 1.3e-86
Identities = 195/498 (39%), Positives = 288/498 (57%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVE-----SDQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P + +D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITRNGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHEAVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G++ L +KM V+Y+SFSAH D + + V + P V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
K + +E+ + Y P N +V L G + E+A + L PD
Sbjct: 424 KQKIEQEFR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDVKKP 476
Query: 469 --LSGIIVKRNFNYHLLA 484
L G ++ ++ N+ L++
Sbjct: 477 RLLHGTLIMKDSNFRLVS 494
>UNIPROTKB|G3V1S5 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 EMBL:AL139287 EMBL:CH471183 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 OMA:MAVEYMS
CTD:54973 UniGene:Hs.6449 GeneID:54973 KEGG:hsa:54973
HGNC:HGNC:26052 ChiTaRS:CPSF3L GenomeRNAi:54973
RefSeq:NP_001243385.1 ProteinModelPortal:G3V1S5 SMR:G3V1S5
Ensembl:ENST00000540437 ArrayExpress:G3V1S5 Bgee:G3V1S5
Uniprot:G3V1S5
Length = 606
Score = 865 (309.6 bits), Expect = 1.6e-86, P = 1.6e-86
Identities = 196/498 (39%), Positives = 291/498 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 16 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 75
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 76 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 135
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 136 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 195
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 196 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 255
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L +PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 256 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 311
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 312 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 370
Query: 357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G++ L +KM V+Y+SFSAH D + + V + P V+LVHGE +M L
Sbjct: 371 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 429
Query: 416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
K + +E + Y P N +V L G + E+A + L P+A
Sbjct: 430 KQKIEQELR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPEAKKP 482
Query: 469 --LSGIIVKRNFNYHLLA 484
L G ++ ++ N+ L++
Sbjct: 483 RLLHGTLIMKDSNFRLVS 500
>UNIPROTKB|Q5TA45 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
EMBL:AL139287 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K13148 HOVERGEN:HBG080215
OrthoDB:EOG4GXFMB CTD:54973 EMBL:AL136813 EMBL:AK000549
EMBL:AK021939 EMBL:AK023356 EMBL:AK297350 EMBL:CR533557
EMBL:BC000675 EMBL:BC007978 EMBL:BC013904 EMBL:BK005728
EMBL:BK005673 IPI:IPI00063404 IPI:IPI00306882 IPI:IPI00514973
RefSeq:NP_001243392.1 RefSeq:NP_060341.2 UniGene:Hs.6449
ProteinModelPortal:Q5TA45 SMR:Q5TA45 IntAct:Q5TA45
MINT:MINT-1482228 STRING:Q5TA45 PhosphoSite:Q5TA45 DMDM:118572557
PaxDb:Q5TA45 PRIDE:Q5TA45 DNASU:54973 Ensembl:ENST00000419704
Ensembl:ENST00000435064 Ensembl:ENST00000450926
Ensembl:ENST00000545578 GeneID:54973 KEGG:hsa:54973 UCSC:uc001aee.1
UCSC:uc001aeh.1 UCSC:uc009vjz.1 GeneCards:GC01M001236
HGNC:HGNC:26052 HPA:HPA028379 HPA:HPA029025 MIM:611354
neXtProt:NX_Q5TA45 PharmGKB:PA142672080 InParanoid:Q5TA45
PhylomeDB:Q5TA45 ChiTaRS:CPSF3L GenomeRNAi:54973 NextBio:58222
ArrayExpress:Q5TA45 Bgee:Q5TA45 Genevestigator:Q5TA45
GermOnline:ENSG00000127054 Uniprot:Q5TA45
Length = 600
Score = 865 (309.6 bits), Expect = 1.6e-86, P = 1.6e-86
Identities = 196/498 (39%), Positives = 291/498 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W L +PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G++ L +KM V+Y+SFSAH D + + V + P V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
K + +E + Y P N +V L G + E+A + L P+A
Sbjct: 424 KQKIEQELR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPEAKKP 476
Query: 469 --LSGIIVKRNFNYHLLA 484
L G ++ ++ N+ L++
Sbjct: 477 RLLHGTLIMKDSNFRLVS 494
>UNIPROTKB|Q2YDM2 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9913
"Bos taurus" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0005634 GO:GO:0005737 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000231294 EMBL:BC110155 IPI:IPI00725178
UniGene:Bt.4894 ProteinModelPortal:Q2YDM2 STRING:Q2YDM2
PRIDE:Q2YDM2 HOVERGEN:HBG080215 InParanoid:Q2YDM2 OrthoDB:EOG4GXFMB
Uniprot:Q2YDM2
Length = 599
Score = 864 (309.2 bits), Expect = 2.0e-86, P = 2.0e-86
Identities = 198/498 (39%), Positives = 289/498 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G S P F S D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFSDDRRFPDFSYNTRSGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MT T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P +LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPSLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W +L PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMDLK-APIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRL 415
S + + M G++ L +KM V+Y+SFSAH D + + V + P +V+LVHGE +M L
Sbjct: 365 SGQRK-LEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEFL 423
Query: 416 KAALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA-- 468
K + +E+ + Y P N +V L G + E+A + L PDA
Sbjct: 424 KQKIEQEFR------VNCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDAKKP 476
Query: 469 --LSGIIVKRNFNYHLLA 484
L G ++ ++ N+ L++
Sbjct: 477 RLLHGTLIMKDSNFRLVS 494
>WB|WBGene00008642 [details] [associations]
symbol:F10B5.8 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0009792 EMBL:Z48334 GO:GO:0016787 eggNOG:COG1236
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000231294 KO:K13148 OMA:MAVEYMS
GeneTree:ENSGT00700000104485 PIR:T20694 RefSeq:NP_495706.2
ProteinModelPortal:Q9U3K2 SMR:Q9U3K2 STRING:Q9U3K2 PaxDb:Q9U3K2
EnsemblMetazoa:F10B5.8 GeneID:174310 KEGG:cel:CELE_F10B5.8
UCSC:F10B5.8 CTD:174310 WormBase:F10B5.8 InParanoid:Q9U3K2
NextBio:883468 Uniprot:Q9U3K2
Length = 608
Score = 845 (302.5 bits), Expect = 2.1e-86, Sum P(2) = 2.1e-86
Identities = 173/428 (40%), Positives = 261/428 (60%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVE-----SDQIDLLLISHF 60
GAGQ+VGRSCI++ K+IM+DCG+H G P + +D +D ++ISHF
Sbjct: 14 GAGQDVGRSCILITIGGKNIMVDCGMHMGYQDDRRFPDFSYIGGGGRLTDYLDCVIISHF 73
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCG+LP G+ G +MT+ TKAI LL DY KV +I E +T D++
Sbjct: 74 HLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETNFFTSDDIKNC 133
Query: 120 MDKIETINFHEEKDV-NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ HE V N + A+ AGHVLGAAMF I + +LYTGD++ DRHL A
Sbjct: 134 MKKVVGCALHEIIHVDNELSIRAFYAGHVLGAAMFEIRLGDHSVLYTGDYNMTPDRHLGA 193
Query: 179 AEI-PPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQEL 237
A + P V+P +LI+ESTY T + + + RE F +H+ V +GG+ +IPVFALGRAQEL
Sbjct: 194 ARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHECVMKGGKVIIPVFALGRAQEL 253
Query: 238 LLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHIS 297
++L+ YW L+ +PIY++ LA++ Y+ +I+ N+ I++ N F FKHI
Sbjct: 254 CILLESYWE-RMALN-VPIYFSQGLAERANQYYRLFISWTNENIKKTFVERNMFEFKHIK 311
Query: 298 NL-KGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
+ KG + + GP V+ ++PGM+ G S ++F+ WC+D N +I+ GYCV GT+ ++
Sbjct: 312 PMEKGCE--DQPGPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMIIMPGYCVAGTVGARVI 369
Query: 357 SEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
+ E+ I + + +++ V+Y+SFSAH D + + +R+ P HV+ VHGE ++M LK
Sbjct: 370 NG-EKKIEIDQKMHEIRLGVEYMSFSAHADAKGIMQLIRQCEPQHVMFVHGEASKMEFLK 428
Query: 417 AALTREYE 424
+ +EY+
Sbjct: 429 GKVEKEYK 436
Score = 38 (18.4 bits), Expect = 2.1e-86, Sum P(2) = 2.1e-86
Identities = 8/26 (30%), Positives = 17/26 (65%)
Query: 565 LISECLIEILVEMYGEAAVPKMFKGE 590
LI +C + ++ ++GEA+ + KG+
Sbjct: 405 LIRQCEPQHVMFVHGEASKMEFLKGK 430
>UNIPROTKB|F1RJE8 [details] [associations]
symbol:CPSF3L "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
EMBL:FP102596 RefSeq:XP_003127541.3 Ensembl:ENSSSCT00000003708
GeneID:100523908 KEGG:ssc:100523908 Uniprot:F1RJE8
Length = 599
Score = 855 (306.0 bits), Expect = 1.8e-85, P = 1.8e-85
Identities = 194/497 (39%), Positives = 285/497 (57%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVE-----SDQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G S P + +D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIGGKNVMLDCGMHMGFSDDRRFPDFSYITRHGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MT T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTQPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKAVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ +W +L PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETFWE-RMDLK-APIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI-- 305
Query: 299 LKGIDH-FEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL 356
K D F D GP VV A+PGM+ +G S ++F W + KN VI+ GYCV+GT+ IL
Sbjct: 306 -KAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKIL 364
Query: 357 SEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
S ++ Q L +KM V+Y+SFSAH D + + V + P +V+LVHGE +M LK
Sbjct: 365 SGQRKLELEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPENVLLVHGEAKKMEFLK 424
Query: 417 AALTREYEDDPNTSMELYNPRNTVSVDLYFK-----GEKTAKVMGELAVENLKPDAA--- 468
+ +E+ + Y P N +V L G + E+A + L PDA
Sbjct: 425 QKIEQEFR------LSCYMPANGETVTLPTSPSIPVGISLGLLKREMA-QGLLPDAKKAR 477
Query: 469 -LSGIIVKRNFNYHLLA 484
L G ++ ++ + L++
Sbjct: 478 LLHGTLIMKDSTFRLVS 494
>FB|FBgn0039691 [details] [associations]
symbol:IntS11 "Integrator 11" species:7227 "Drosophila
melanogaster" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0034472 "snRNA
3'-end processing" evidence=IDA] [GO:0016180 "snRNA processing"
evidence=ISS] [GO:0032039 "integrator complex" evidence=ISS]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR001279
SMART:SM00849 Pfam:PF07521 EMBL:AE014297 GO:GO:0022008
GO:GO:0006378 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 OMA:MAVEYMS GeneTree:ENSGT00700000104485
GO:GO:0034472 EMBL:AY061097 RefSeq:NP_651721.1 UniGene:Dm.3722
SMR:Q9VAH9 STRING:Q9VAH9 EnsemblMetazoa:FBtr0085476 GeneID:43506
KEGG:dme:Dmel_CG1972 UCSC:CG1972-RA CTD:43506 FlyBase:FBgn0039691
InParanoid:Q9VAH9 OrthoDB:EOG47D7X3 GenomeRNAi:43506 NextBio:834295
Uniprot:Q9VAH9
Length = 597
Score = 843 (301.8 bits), Expect = 3.4e-84, P = 3.4e-84
Identities = 201/564 (35%), Positives = 309/564 (54%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVE----SDQIDLLLISHF 60
GAGQ+VGRSC++L K+IM+DCG+H G + P F +V + ID ++ISHF
Sbjct: 10 GAGQDVGRSCLLLSMGGKNIMLDCGMHMGYNDERRFPDFSYIVPEGPITSHIDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+ G+ G +MTH TKAI LL D KV+ E +T ++
Sbjct: 70 HLDHCGALPYMSEIVGYTGPIYMTHPTKAIAPILLEDMRKVAVERKGESNFFTTQMIKDC 129
Query: 120 MDKIETINFHEEKDVN-GIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ + H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVIPVTLHQSMMVDTDLEIKAYYAGHVLGAAMFWIKVGSQSVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I +PD+LI+ESTY T + + + RE F +H+ V +GG+ LIPVFALGRAQEL
Sbjct: 190 AWIDKCRPDLLISESTYATTIRDSKRCRERDFLKKVHECVAKGGKVLIPVFALGRAQELC 249
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++L+ YW L PIY+A L +K + Y+ +I N +IR+ N F FKHI
Sbjct: 250 ILLETYWE-RMNLK-YPIYFALGLTEKANTYYKMFITWTNQKIRKTFVHRNMFDFKHIKP 307
Query: 299 LKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSE 358
+ ++ G VV A+PGM+ +GLS ++F+ W + N VI+ GYCV+GT+ IL
Sbjct: 308 FDKA-YIDNPGAMVVFATPGMLHAGLSLQIFKKWAPNENNMVIMPGYCVQGTVGNKILGG 366
Query: 359 PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAA 418
++V + Q + +KM+V+Y+SFSAH D + + ++ P +V+LVHGE +M L++
Sbjct: 367 AKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLIQNCEPKNVMLVHGEAGKMKFLRSK 426
Query: 419 LTREYEDDPNTSMELYNPRN----TVSVDLYFKGEKTAKVM-GELAVENLKPD-----AA 468
+ E+ ++E Y P N +S + + + ++ E N +P
Sbjct: 427 IKDEF------NLETYMPANGETCVISTPVKIPVDASVSLLKAEARSYNAQPPDPKRRRL 480
Query: 469 LSGIIVKRNFNYHLLAPSDLPKYTDLK------ASKIIQQQSVYYSGSISVLRSLISH-L 521
+ G++V ++ L +D K + SK+ S + L++L+ L
Sbjct: 481 IHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDSGPVIRTSERLKTLLEEKL 540
Query: 522 AGPVETLDEKRLRAFACIEITLEK 545
AG T+ E A +E+ +E+
Sbjct: 541 AGWTVTMQENGSIAIESVEVKVEE 564
>DICTYBASE|DDB_G0278189 [details] [associations]
symbol:ints11 "integrator complex subunit 11"
species:44689 "Dictyostelium discoideum" [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0278189 Pfam:PF07521 GO:GO:0005634 GO:GO:0005737
GenomeReviews:CM000152_GR EMBL:AAFI02000023 GO:GO:0016787
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 KO:K13148 RefSeq:XP_642189.1
ProteinModelPortal:Q54YL3 PRIDE:Q54YL3 EnsemblProtists:DDB0234100
GeneID:8621396 KEGG:ddi:DDB_G0278189 OMA:RTIANET
ProtClustDB:CLSZ2729107 Uniprot:Q54YL3
Length = 744
Score = 810 (290.2 bits), Expect = 1.1e-80, P = 1.1e-80
Identities = 177/456 (38%), Positives = 265/456 (58%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVESDQ----IDLLLISHF 60
GAGQ+VGRSC+++ NK+IM DCG+H G++ P F + ++ Q ID ++I+HF
Sbjct: 9 GAGQDVGRSCVIVTIGNKNIMFDCGMHMGMNDARRFPDFSYISKNGQFTKVIDCVIITHF 68
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MT TKAI LL DY K++ E +T ++
Sbjct: 69 HLDHCGALPFFTEMCGYDGPIYMTLPTKAICPILLEDYRKITVEKKGETNFFTAQMIKDC 128
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ +N H+ V+ + AY AGHVLGAAMF ++ ++YTGD++ DRHL +
Sbjct: 129 MKKVIPVNLHQTIKVDEELSIKAYYAGHVLGAAMFYAKVGDESVVYTGDYNMTPDRHLGS 188
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A I VKPD+LITE+TY T + + + RE F IH+ V +GG+ LIPVFALGR QEL
Sbjct: 189 AWIDQVKPDVLITETTYATTIRDSKRGRERDFLKRIHECVEKGGKVLIPVFALGRVQELC 248
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
+++D YW L IPIY+++ LA+K Y+ +IN N +I++ N F FKHI
Sbjct: 249 ILIDSYWE-QMNLGHIPIYFSAGLAEKANLYYKLFINWTNQKIKQTFVKRNMFDFKHIKP 307
Query: 299 LKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTIL- 356
+ H D G V+ A+PGM+ +G S E+F+ W + N II GYCV GT+ +L
Sbjct: 308 FQS--HLVDAPGAMVLFATPGMLHAGASLEVFKKWAPNELNMTIIPGYCVVGTVGNKLLT 365
Query: 357 --------SEPE-EVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVH 406
S+P+ +++ + + + +K + +SFSAH D + + ++ P +V+LVH
Sbjct: 366 TGSDQQQQSKPQSQMVEIDKKTTIEVKCKIHNLSFSAHADAKGILQLIKMSNPRNVILVH 425
Query: 407 GEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSV 442
GE+ +M L + +E + Y P N V++
Sbjct: 426 GEKEKMGFLSQKIIKEM------GVNCYYPANGVTI 455
>ZFIN|ZDB-GENE-050522-13 [details] [associations]
symbol:cpsf3l "cleavage and polyadenylation specific
factor 3-like" species:7955 "Danio rerio" [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0043484 "regulation of RNA splicing"
evidence=IMP] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
ZFIN:ZDB-GENE-050522-13 GO:GO:0016787 GO:GO:0043484
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:MAVEYMS GeneTree:ENSGT00700000104485 EMBL:CABZ01054885
EMBL:CR846089 IPI:IPI00865509 Ensembl:ENSDART00000102902
Uniprot:E7EXW1
Length = 601
Score = 791 (283.5 bits), Expect = 1.1e-78, P = 1.1e-78
Identities = 188/514 (36%), Positives = 289/514 (56%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVESDQI----DLLLISHF 60
GAGQ+VGRSCI++ K+IM+DCG+H G + P F + ++ ++ D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIGGKNIMLDCGMHMGFNDDRRFPDFSYITQNGRLTEFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+ G+ G +MTH TKAI LL D+ K++ + E +T ++
Sbjct: 70 HLDHCGALPYMSEMVGYDGPIYMTHPTKAICPILLEDFRKITVDKKGETNFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAM----FLIEIAGVKILYTGDFSRQEDR 174
M K+ +N H+ V+ ++ AY AGHVLGAAM F + + V + YT
Sbjct: 130 MKKVVPLNLHQTVQVDDELEIKAYYAGHVLGAAMVQSRFRV-VYTVSVSYTYSNLMTPAS 188
Query: 175 HLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRA 234
L AA I +PDILI+ESTY T + + + RE F +H+ V RGG+ LIPVFALGRA
Sbjct: 189 DLRAAWIDKCRPDILISESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRA 248
Query: 235 QELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFK 294
QEL ++L+ +W L PIY+++ L +K Y+ +I N +IR+ N F FK
Sbjct: 249 QELCILLETFWE-RMNLK-APIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQRNMFEFK 306
Query: 295 HISNLKGID--HFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA 352
HI K D + ++ GP VV A+PGM+ +G S ++F+ W + KN VI+ GYCV+GT+
Sbjct: 307 HI---KAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVIMPGYCVQGTVG 363
Query: 353 KTILSEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNE 411
IL+ ++ + M G+ L +K+ V+Y+SFSAH D + + +R P +++LVHGE +
Sbjct: 364 HKILNGQKK-LEMEGRATLDVKLQVEYMSFSAHADAKGIMQLIRMAEPRNMLLVHGEAKK 422
Query: 412 MSRLKAALTREYEDD---P---NTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKP 465
M LK + +E+ P T+ + NP +V VD+ K +G + KP
Sbjct: 423 MEFLKDKIEQEFSISCFMPANGETTTIVTNP--SVPVDISLNLLKREMALGGPLPDAKKP 480
Query: 466 DAALSGIIVKRNFNYHLLAPSDLPKYTDLKASKI 499
+ G ++ ++ + L++P K L ++
Sbjct: 481 -RTMHGTLIMKDNSLRLVSPEQALKELGLNEHQL 513
>TAIR|locus:2065368 [details] [associations]
symbol:CPSF73-II "AT2G01730" species:3702 "Arabidopsis
thaliana" [GO:0003824 "catalytic activity" evidence=ISS]
[GO:0008152 "metabolic process" evidence=ISS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0010197 "polar nucleus
fusion" evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005634 "nucleus" evidence=IDA] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005634 EMBL:CP002685
GenomeReviews:CT485783_GR GO:GO:0006397 GO:GO:0090305 EMBL:AC006069
GO:GO:0004518 GO:GO:0010197 eggNOG:COG1236 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 EMBL:AY168923
EMBL:AK221561 IPI:IPI00536069 PIR:D84428 RefSeq:NP_178282.2
UniGene:At.42473 ProteinModelPortal:Q8GUU3 SMR:Q8GUU3 IntAct:Q8GUU3
STRING:Q8GUU3 PaxDb:Q8GUU3 PRIDE:Q8GUU3 EnsemblPlants:AT2G01730.1
GeneID:814702 KEGG:ath:AT2G01730 TAIR:At2g01730
HOGENOM:HOG000231294 InParanoid:Q56XW2 KO:K13148 OMA:MAVEYMS
Genevestigator:Q8GUU3 Uniprot:Q8GUU3
Length = 613
Score = 743 (266.6 bits), Expect = 1.4e-73, P = 1.4e-73
Identities = 157/428 (36%), Positives = 248/428 (57%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-----DQIDLLLISHF 60
GAGQE+G+SC+++ K IM DCG+H G + P L+ + I ++I+HF
Sbjct: 9 GAGQEIGKSCVVVTINGKKIMFDCGMHMGCDDHNRYPNFSLISKSGDFDNAISCIIITHF 68
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
H+DH GALP+F G+ G +M++ TKA+ +L DY +V + E+ L+T + +
Sbjct: 69 HMDHVGALPYFTEVCGYNGPIYMSYPTKALSPLMLEDYRRVMVDRRGEEELFTTTHIANC 128
Query: 120 MDKIETINFHEEKDVN-GIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ I+ + V+ ++ AY AGHVLGA M ++ I+YTGD++ DRHL A
Sbjct: 129 MKKVIAIDLKQTIQVDEDLQIRAYYAGHVLGAVMVYAKMGDAAIVYTGDYNMTTDRHLGA 188
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELL 238
A+I ++ D+LI+ESTY T + + RE F +H V GG+ LIP FALGRAQEL
Sbjct: 189 AKIDRLQLDLLISESTYATTIRGSKYPREREFLQAVHKCVAGGGKALIPSFALGRAQELC 248
Query: 239 LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISN 298
++LD+YW + +PIY++S L + Y+ I+ + ++ + + +NPF FK++ +
Sbjct: 249 MLLDDYWE-RMNIK-VPIYFSSGLTIQANMYYKMLISWTSQNVKEKHNTHNPFDFKNVKD 306
Query: 299 L-KGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLA-KTIL 356
+ + H GPCV+ A+PGM+ +G S E+F+ W N V + GY V GT+ K +
Sbjct: 307 FDRSLIHAP--GPCVLFATPGMLCAGFSLEVFKHWAPSPLNLVALPGYSVAGTVGHKLMA 364
Query: 357 SEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLK 416
+P V +G ++ ++ V ++FS HTD + + + L P +VVLVHGE+ M LK
Sbjct: 365 GKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTKFLSPKNVVLVHGEKPSMMILK 424
Query: 417 AALTREYE 424
+T E +
Sbjct: 425 EKITSELD 432
>GENEDB_PFALCIPARUM|PFC0825c [details] [associations]
symbol:PFC0825c "cleavage and polyadenylation
specificity factor protein, putative" species:5833 "Plasmodium
falciparum" [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0006378
"mRNA polyadenylation" evidence=ISS] [GO:0003729 "mRNA binding"
evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] InterPro:IPR001279
SMART:SM00849 Pfam:PF07521 GO:GO:0003729 GO:GO:0016787
EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
ProtClustDB:CLSZ2433497 Uniprot:O77371
Length = 1017
Score = 542 (195.9 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
Identities = 110/331 (33%), Positives = 192/331 (58%)
Query: 95 LSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFL 154
L +Y ++ I + E ++ +DK+ + +E ++ + + Y AGHVLGA ++
Sbjct: 243 LLNY-NINCIKKDPWNINEDNIYNCIDKVIGLQINETFELGDMSITPYYAGHVLGACIYK 301
Query: 155 IEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLI 214
IE+ ++YTGD++ D+HL +A IP + P+I I+ESTY T+V ++ E +L+
Sbjct: 302 IEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTYATYVRPTKKASELELCNLV 361
Query: 215 HDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYI 274
H+ V++GG+ LIPVFA+GRAQEL ++LD+YW ++H PIY+ L + Y+ Y
Sbjct: 362 HECVHKGGKVLIPVFAIGRAQELSILLDDYWK-KMKIH-YPIYFGCGLTENANKYYKIYS 419
Query: 275 NAMNDRIRRQISINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCT 334
+ +N N F F +IS ++ + P V+ A+PGM+ +GLS + F+ W
Sbjct: 420 SWINSSCMSNEK-ENLFDFANISPFLN-NYLNEKRPMVLFATPGMLHTGLSLKAFKAWAG 477
Query: 335 DAKNGVIIAGYCVEGTLA-KTILSEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSE 392
+ +N +++ GYCV+GT+ K I+ E + I + G + + + Y+SFSAH D +
Sbjct: 478 NPQNLIVLPGYCVQGTVGHKLIMGEKQ--ISLDGTTYIKVLCKIIYLSFSAHADSNGIQQ 535
Query: 393 FVRELRPAHVVLVHGEQNEMSRLKAALTREY 423
++ + P +V+ VHGE+N M +L ++ ++
Sbjct: 536 LIKHVSPKNVIFVHGEKNGMQKLAKYISNKH 566
Score = 128 (50.1 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
Identities = 32/93 (34%), Positives = 55/93 (59%)
Query: 52 IDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLY 111
ID ++ISHFH+DH GALP+F ++G M++ TKA+ LL D +V+++ E+ +
Sbjct: 170 IDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLDSCRVTDMKWEKKNF 229
Query: 112 TESDLEKSMDKI-ETINFHEEKDVNGIKFSAYN 143
E ++ +K E +N++ +N IK +N
Sbjct: 230 -ERQIKMLNEKSDELLNYN----INCIKKDPWN 257
Score = 107 (42.7 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLV 47
GAGQ VGRSC+++E +N+ +M DCG H G P +L+
Sbjct: 15 GAGQTVGRSCVIVELENRKVMFDCGCHLGYKDERKYPNFNLL 56
Score = 42 (19.8 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 586 MFKGEKITI--TVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
++K EKI+ DKKK ID L V+ + + K +Q
Sbjct: 702 LYKNEKISNYHKKDKKKKAIDEHKLKVRNKLIQKKINIQ 740
>UNIPROTKB|O77371 [details] [associations]
symbol:PFC0825c "Cleavage and polyadenylation specificity
factor protein, putative" species:36329 "Plasmodium falciparum 3D7"
[GO:0003729 "mRNA binding" evidence=ISS] [GO:0005847 "mRNA cleavage
and polyadenylation specificity factor complex" evidence=ISS]
InterPro:IPR001279 SMART:SM00849 Pfam:PF07521 GO:GO:0003729
GO:GO:0016787 EMBL:AL844502 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K13148 PIR:T18488
RefSeq:XP_001351256.1 ProteinModelPortal:O77371 PRIDE:O77371
EnsemblProtists:PFC0825c:mRNA GeneID:814500 KEGG:pfa:PFC0825c
EuPathDB:PlasmoDB:PF3D7_0318600 HOGENOM:HOG000283200
ProtClustDB:CLSZ2433497 Uniprot:O77371
Length = 1017
Score = 542 (195.9 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
Identities = 110/331 (33%), Positives = 192/331 (58%)
Query: 95 LSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFL 154
L +Y ++ I + E ++ +DK+ + +E ++ + + Y AGHVLGA ++
Sbjct: 243 LLNY-NINCIKKDPWNINEDNIYNCIDKVIGLQINETFELGDMSITPYYAGHVLGACIYK 301
Query: 155 IEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLI 214
IE+ ++YTGD++ D+HL +A IP + P+I I+ESTY T+V ++ E +L+
Sbjct: 302 IEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTYATYVRPTKKASELELCNLV 361
Query: 215 HDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYI 274
H+ V++GG+ LIPVFA+GRAQEL ++LD+YW ++H PIY+ L + Y+ Y
Sbjct: 362 HECVHKGGKVLIPVFAIGRAQELSILLDDYWK-KMKIH-YPIYFGCGLTENANKYYKIYS 419
Query: 275 NAMNDRIRRQISINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCT 334
+ +N N F F +IS ++ + P V+ A+PGM+ +GLS + F+ W
Sbjct: 420 SWINSSCMSNEK-ENLFDFANISPFLN-NYLNEKRPMVLFATPGMLHTGLSLKAFKAWAG 477
Query: 335 DAKNGVIIAGYCVEGTLA-KTILSEPEEVIGMSGQR-LPLKMSVDYISFSAHTDYQQTSE 392
+ +N +++ GYCV+GT+ K I+ E + I + G + + + Y+SFSAH D +
Sbjct: 478 NPQNLIVLPGYCVQGTVGHKLIMGEKQ--ISLDGTTYIKVLCKIIYLSFSAHADSNGIQQ 535
Query: 393 FVRELRPAHVVLVHGEQNEMSRLKAALTREY 423
++ + P +V+ VHGE+N M +L ++ ++
Sbjct: 536 LIKHVSPKNVIFVHGEKNGMQKLAKYISNKH 566
Score = 128 (50.1 bits), Expect = 4.5e-10, Sum P(3) = 4.5e-10
Identities = 32/93 (34%), Positives = 55/93 (59%)
Query: 52 IDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLY 111
ID ++ISHFH+DH GALP+F ++G M++ TKA+ LL D +V+++ E+ +
Sbjct: 170 IDCVIISHFHMDHIGALPFFTEILKYRGIILMSYPTKALSPILLLDSCRVTDMKWEKKNF 229
Query: 112 TESDLEKSMDKI-ETINFHEEKDVNGIKFSAYN 143
E ++ +K E +N++ +N IK +N
Sbjct: 230 -ERQIKMLNEKSDELLNYN----INCIKKDPWN 257
Score = 107 (42.7 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLV 47
GAGQ VGRSC+++E +N+ +M DCG H G P +L+
Sbjct: 15 GAGQTVGRSCVIVELENRKVMFDCGCHLGYKDERKYPNFNLL 56
Score = 42 (19.8 bits), Expect = 4.0e-62, Sum P(3) = 4.0e-62
Identities = 14/39 (35%), Positives = 21/39 (53%)
Query: 586 MFKGEKITI--TVDKKKACIDLVDLSVQCEDSKLKSTVQ 622
++K EKI+ DKKK ID L V+ + + K +Q
Sbjct: 702 LYKNEKISNYHKKDKKKKAIDEHKLKVRNKLIQKKINIQ 740
>UNIPROTKB|C9JZH6 [details] [associations]
symbol:CPSF3 "Cleavage and polyadenylation-specificity
factor subunit 3" species:9606 "Homo sapiens" [GO:0003723 "RNA
binding" evidence=IEA] [GO:0004521 "endoribonuclease activity"
evidence=IEA] [GO:0008409 "5'-3' exonuclease activity"
evidence=IEA] InterPro:IPR001279 Pfam:PF00753 GO:GO:0003723
GO:GO:0004521 GO:GO:0008409 EMBL:AC080162 HGNC:HGNC:2326
ChiTaRS:CPSF3 IPI:IPI00807384 ProteinModelPortal:C9JZH6 SMR:C9JZH6
STRING:C9JZH6 Ensembl:ENST00000475482 HOGENOM:HOG000191757
ArrayExpress:C9JZH6 Bgee:C9JZH6 Uniprot:C9JZH6
Length = 136
Score = 609 (219.4 bits), Expect = 2.2e-59, P = 2.2e-59
Identities = 111/136 (81%), Positives = 124/136 (91%)
Query: 26 MMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTH 85
M+DCGIHPGL GMDALP++DL++ +IDLLLISHFHLDHCGALPWFL KT FKGR FMTH
Sbjct: 1 MLDCGIHPGLEGMDALPYIDLIDPAEIDLLLISHFHLDHCGALPWFLQKTSFKGRTFMTH 60
Query: 86 ATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAG 145
ATKAIYRWLLSDY+KVSNIS + MLYTE+DLE+SMDKIETINFHE K+V GIKF Y+AG
Sbjct: 61 ATKAIYRWLLSDYVKVSNISADDMLYTETDLEESMDKIETINFHEVKEVAGIKFWCYHAG 120
Query: 146 HVLGAAMFLIEIAGVK 161
HVLGAAMF+IEIAGVK
Sbjct: 121 HVLGAAMFMIEIAGVK 136
>UNIPROTKB|C9J979 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 InterPro:IPR022712 Pfam:PF10996 HOGENOM:HOG000231294
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00514808
ProteinModelPortal:C9J979 SMR:C9J979 STRING:C9J979
Ensembl:ENST00000434694 ArrayExpress:C9J979 Bgee:C9J979
Uniprot:C9J979
Length = 344
Score = 268 (99.4 bits), Expect = 2.2e-46, Sum P(2) = 2.2e-46
Identities = 54/135 (40%), Positives = 82/135 (60%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDV 134
M K+ ++ H+ V
Sbjct: 130 MKKVVAVHLHQTVQV 144
Score = 252 (93.8 bits), Expect = 2.2e-46, Sum P(2) = 2.2e-46
Identities = 52/119 (43%), Positives = 74/119 (62%)
Query: 178 AAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQEL 237
AA I +P++LITESTY T + + + RE F +H+ V RGG+ LIPVFALGRAQEL
Sbjct: 219 AAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFALGRAQEL 278
Query: 238 LLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHI 296
++L+ +W L +PIY+++ L +K Y+ +I N +IR+ N F FKHI
Sbjct: 279 CILLETFWE-RMNLK-VPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHI 335
Score = 41 (19.5 bits), Expect = 6.2e-22, Sum P(2) = 6.2e-22
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 137 IKFSAYNAGHVLGAAMFLIEIAGVKIL 163
I+ + AG +G + L+ IAG ++
Sbjct: 4 IRVTPLGAGQDVGRSCILVSIAGKNVM 30
>UNIPROTKB|E9PNS4 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00984775
ProteinModelPortal:E9PNS4 SMR:E9PNS4 Ensembl:ENST00000528879
ArrayExpress:E9PNS4 Bgee:E9PNS4 Uniprot:E9PNS4
Length = 278
Score = 477 (173.0 bits), Expect = 2.1e-45, P = 2.1e-45
Identities = 95/225 (42%), Positives = 141/225 (62%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 10 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 69
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 70 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 129
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 130 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 189
Query: 179 AEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGR 223
A I +P++LITESTY T + + + RE F +H+ V RGG+
Sbjct: 190 AWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGK 234
>UNIPROTKB|E9PI75 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI01011963
ProteinModelPortal:E9PI75 SMR:E9PI75 Ensembl:ENST00000527719
ArrayExpress:E9PI75 Bgee:E9PI75 Uniprot:E9PI75
Length = 209
Score = 411 (149.7 bits), Expect = 4.0e-38, P = 4.0e-38
Identities = 83/194 (42%), Positives = 123/194 (63%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 16 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 75
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 76 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 135
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 136 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 195
Query: 179 AEIPPVKPDILITE 192
A I +P++LITE
Sbjct: 196 AWIDKCRPNLLITE 209
>TIGR_CMR|CPS_2623 [details] [associations]
symbol:CPS_2623 "metallo-beta-lactamase family protein"
species:167879 "Colwellia psychrerythraea 34H" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:CP000083 GenomeReviews:CP000083_GR
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:YP_269337.1
ProteinModelPortal:Q481D2 STRING:Q481D2 GeneID:3521490
KEGG:cps:CPS_2623 PATRIC:21468305 OMA:HGPMVII
ProtClustDB:CLSK2524370 BioCyc:CPSY167879:GI48-2685-MONOMER
Uniprot:Q481D2
Length = 451
Score = 410 (149.4 bits), Expect = 5.2e-38, P = 5.2e-38
Identities = 122/447 (27%), Positives = 222/447 (49%)
Query: 4 LKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDL-VESDQIDLLLISHFHL 62
L G G G S +E I++DCG++ G + A L ++ +D ++++H HL
Sbjct: 6 LGGTGTVTG-SKYFVETSTTKILVDCGLYQGYKWLRARNREPLPLDLKSLDAIVLTHAHL 64
Query: 63 DHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSD--YI-----------KVSNISTEQM 109
DH G +P L K GF+G + AT ++ LL D +I K+S +
Sbjct: 65 DHSGFIP-ALYKQGFRGHVYAHQATISLCSILLPDSGHIQEDDAKFYGKHKISRHENPEP 123
Query: 110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFS 169
LY ++ E + + ++F+EE + I+ +AGH+LGAA +++ G ++ ++GD
Sbjct: 124 LYDKATAEACLSLFKAVDFNEEFKIGDIEIELQSAGHILGAASVILKADGKRVGFSGDVG 183
Query: 170 RQEDRHLMAAE-IPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
R +D + + +PPV D+L+ ESTYG +H++ + E + +++ +GG LIP
Sbjct: 184 RPDDIIMYPPKPLPPV--DLLLLESTYGNRLHDKEDAFE-QLAEIVNSTAKKGGALLIPS 240
Query: 229 FALGRAQELLLILDEYWS--LHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQ-- 284
FA+GR + + +L L P+L P+Y S +A ++Y + + +N R+ +
Sbjct: 241 FAVGRTEAVQHMLASLMKKELIPKL---PVYLDSPMAINVFNIYCEHFD-LN-RLSNEEC 295
Query: 285 ISINNPFVF-KHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIA 343
+ + N F + + K + E I P +++A GM G + D + V+
Sbjct: 296 LEMCNVATFTRTVDESKALS--ELIMPHIIIAGSGMATGGRILHHLKRLLGDYRTTVLFT 353
Query: 344 GYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVR--ELRP- 399
GY GT +L+ + V + G+ LP+K V+ ++ S H DY+ +++++ +L P
Sbjct: 354 GYLSGGTRGAKMLAGKDNV-KIHGKWLPVKARVEVLNGLSGHGDYEDITQWLQISKLHPK 412
Query: 400 AHVVLVHGEQNEMSRLKAALTREYEDD 426
V+LVHGE ++ L + + D
Sbjct: 413 TKVLLVHGEPEASESMRDHLMQHTQFD 439
>UNIPROTKB|E9PIG1 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
EMBL:AL139287 HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00981641
ProteinModelPortal:E9PIG1 SMR:E9PIG1 Ensembl:ENST00000530031
ArrayExpress:E9PIG1 Bgee:E9PIG1 Uniprot:E9PIG1
Length = 249
Score = 406 (148.0 bits), Expect = 1.5e-37, P = 1.5e-37
Identities = 82/193 (42%), Positives = 122/193 (63%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 57 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 116
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLEKS 119
HLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 117 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIKDC 176
Query: 120 MDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
M K+ ++ H+ V+ ++ AY AGHVLGAAMF I++ ++YTGD++ DRHL A
Sbjct: 177 MKKVVAVHLHQTVQVDDELEIKAYYAGHVLGAAMFQIKVGSESVVYTGDYNMTPDRHLGA 236
Query: 179 AEIPPVKPDILIT 191
A I +P++LIT
Sbjct: 237 AWIDKCRPNLLIT 249
>UNIPROTKB|E2R496 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006398 "histone mRNA 3'-end processing"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0016787 GO:GO:0005847 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981 GO:GO:0006398
EMBL:AAEX03005582 RefSeq:XP_537353.2 ProteinModelPortal:E2R496
Ensembl:ENSCAFT00000017381 GeneID:480230 KEGG:cfa:480230
NextBio:20855279 Uniprot:E2R496
Length = 782
Score = 390 (142.3 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
Identities = 106/374 (28%), Positives = 187/374 (50%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
+ TL G QE C +L+ ++DCG S MD + + QID +L+SH
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
H GALP+ + K G + T + + + D + S +TE L+T D++ +
Sbjct: 64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122
Query: 120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+ + +
Sbjct: 123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182
Query: 175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V GR
Sbjct: 183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242
Query: 234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
EL +LD+ W L + ++++ + ++ + M+D++ R + NNP
Sbjct: 243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302
Query: 291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
F F+H+S G+ + P VV+AS ++ G SR+LF WC D KN +I+ G
Sbjct: 303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362
Query: 350 TLAKTILSEPEEVI 363
TLA+ ++ P E I
Sbjct: 363 TLARFLIDNPSEKI 376
Score = 74 (31.1 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
Identities = 21/89 (23%), Positives = 44/89 (49%)
Query: 356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
LS+ P + I + + + +K V YI + +D + + +++P +++VHG E S+
Sbjct: 515 LSDVPTKCISTT-ESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQ 572
Query: 415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
A R + +++Y P+ +VD
Sbjct: 573 DLAECCRAFG---GKDIKVYMPKLHETVD 598
>UNIPROTKB|Q9P2I0 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation specificity
factor subunit 2" species:9606 "Homo sapiens" [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0005515 "protein
binding" evidence=IPI] [GO:0006398 "histone mRNA 3'-end processing"
evidence=IDA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IDA] [GO:0000398 "mRNA
splicing, via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
evidence=TAS] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=TAS] [GO:0006369 "termination of RNA polymerase
II transcription" evidence=TAS] [GO:0006397 "mRNA processing"
evidence=TAS] [GO:0006406 "mRNA export from nucleus" evidence=TAS]
[GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467 "gene
expression" evidence=TAS] [GO:0031124 "mRNA 3'-end processing"
evidence=TAS] Reactome:REACT_71 InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
EMBL:CH471061 Reactome:REACT_1675 GO:GO:0003723 GO:GO:0016787
GO:GO:0006406 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0005847
GO:GO:0006369 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Reactome:REACT_78
HOGENOM:HOG000264343 OMA:NNPFQFK CTD:53981 HOVERGEN:HBG051106
OrthoDB:EOG4MCWZQ GO:GO:0006398 EMBL:AK001627 EMBL:BC070095
EMBL:AB037788 EMBL:AL442079 IPI:IPI00419531 RefSeq:NP_059133.1
UniGene:Hs.657632 UniGene:Hs.736541 ProteinModelPortal:Q9P2I0
SMR:Q9P2I0 DIP:DIP-42500N IntAct:Q9P2I0 MINT:MINT-1697677
STRING:Q9P2I0 PhosphoSite:Q9P2I0 DMDM:51338827 PaxDb:Q9P2I0
PeptideAtlas:Q9P2I0 PRIDE:Q9P2I0 Ensembl:ENST00000298875
GeneID:53981 KEGG:hsa:53981 UCSC:uc001yah.2 GeneCards:GC14P092588
HGNC:HGNC:2325 HPA:HPA024238 MIM:606028 neXtProt:NX_Q9P2I0
PharmGKB:PA26842 InParanoid:Q9P2I0 PhylomeDB:Q9P2I0 ChiTaRS:CPSF2
GenomeRNAi:53981 NextBio:56268 ArrayExpress:Q9P2I0 Bgee:Q9P2I0
CleanEx:HS_CPSF2 Genevestigator:Q9P2I0 GermOnline:ENSG00000165934
Uniprot:Q9P2I0
Length = 782
Score = 390 (142.3 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
Identities = 106/374 (28%), Positives = 187/374 (50%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
+ TL G QE C +L+ ++DCG S MD + + QID +L+SH
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
H GALP+ + K G + T + + + D + S +TE L+T D++ +
Sbjct: 64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122
Query: 120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+ + +
Sbjct: 123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182
Query: 175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V GR
Sbjct: 183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242
Query: 234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
EL +LD+ W L + ++++ + ++ + M+D++ R + NNP
Sbjct: 243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302
Query: 291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
F F+H+S G+ + P VV+AS ++ G SR+LF WC D KN +I+ G
Sbjct: 303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362
Query: 350 TLAKTILSEPEEVI 363
TLA+ ++ P E I
Sbjct: 363 TLARFLIDNPSEKI 376
Score = 74 (31.1 bits), Expect = 1.3e-36, Sum P(2) = 1.3e-36
Identities = 21/89 (23%), Positives = 44/89 (49%)
Query: 356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
LS+ P + I + + + +K V YI + +D + + +++P +++VHG E S+
Sbjct: 515 LSDVPTKCISTT-ESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQ 572
Query: 415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
A R + +++Y P+ +VD
Sbjct: 573 DLAECCRAFG---GKDIKVYMPKLHETVD 598
>UNIPROTKB|F1NMN0 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0006398 "histone mRNA 3'-end processing" evidence=IEA]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 GO:GO:0005847 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK GO:GO:0006398
EMBL:AADN02003653 IPI:IPI00651282 Ensembl:ENSGALT00000017538
Uniprot:F1NMN0
Length = 782
Score = 388 (141.6 bits), Expect = 1.4e-36, Sum P(2) = 1.4e-36
Identities = 106/381 (27%), Positives = 192/381 (50%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
+ TL G QE C +L+ ++DCG S MD + + Q+D +L+SH
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDENFS-MDIIDSLKK-HVHQVDAVLLSHP 63
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
H GALP+ + K G + T + + + D + S +TE L+T D++ +
Sbjct: 64 DPLHLGALPYAVGKMGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122
Query: 120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+ + +
Sbjct: 123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182
Query: 175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V GR
Sbjct: 183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242
Query: 234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
EL +LD+ W L + ++++ + ++ + M+D++ R + NNP
Sbjct: 243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302
Query: 291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
F F+H+S + + P VV+AS ++ G SR+LF WC D+KN +I+ G
Sbjct: 303 FQFRHLSLCHSLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDSKNSIILTYRTTPG 362
Query: 350 TLAKTILSEP-EEVIGMSGQR 369
TLA+ ++ P E+VI + +R
Sbjct: 363 TLARFLIDNPSEKVIDIELRR 383
Score = 76 (31.8 bits), Expect = 1.4e-36, Sum P(2) = 1.4e-36
Identities = 24/115 (20%), Positives = 53/115 (46%)
Query: 330 EMWCTDAKNGVIIAGYCV-EGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQ 388
E+ T+ + + +G E + + + P + I + + + +K V YI + +D
Sbjct: 489 ELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCISAT-ESMEIKARVTYIDYEGRSDGD 547
Query: 389 QTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
+ + +++P +V+VHG E S+ A R + +++Y P+ +VD
Sbjct: 548 SIKKIINQMKPRQLVIVHGPP-EASQDLAECCRAFG---GKDIKVYMPKLHETVD 598
>UNIPROTKB|Q10568 [details] [associations]
symbol:CPSF2 "Cleavage and polyadenylation specificity
factor subunit 2" species:9913 "Bos taurus" [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISS] [GO:0005847 "mRNA cleavage
and polyadenylation specificity factor complex" evidence=ISS]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0016787 "hydrolase
activity" evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 Pfam:PF07521
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
EMBL:X75931 IPI:IPI00688446 PIR:A56351 RefSeq:NP_787002.1
UniGene:Bt.4077 ProteinModelPortal:Q10568 STRING:Q10568
PRIDE:Q10568 Ensembl:ENSBTAT00000013500 GeneID:327689
KEGG:bta:327689 CTD:53981 HOVERGEN:HBG051106 InParanoid:Q10568
OrthoDB:EOG4MCWZQ NextBio:20810154 GO:GO:0006398 Uniprot:Q10568
Length = 782
Score = 389 (142.0 bits), Expect = 1.6e-36, Sum P(2) = 1.6e-36
Identities = 105/374 (28%), Positives = 187/374 (50%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
+ TL G QE C +L+ ++DCG S MD + + QID +L+SH
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
H GALP+ + K G + T + + + D + S +TE L+T D++ +
Sbjct: 64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122
Query: 120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+ + +
Sbjct: 123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182
Query: 175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V GR
Sbjct: 183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTAGR 242
Query: 234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
EL +LD+ W L + ++++ + ++ + M+D++ R + NNP
Sbjct: 243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302
Query: 291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
F F+H+S G+ + P VV+AS ++ G SR+LF WC D KN +I+ G
Sbjct: 303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362
Query: 350 TLAKTILSEPEEVI 363
TLA+ ++ P E +
Sbjct: 363 TLARFLIDNPSEKV 376
Score = 74 (31.1 bits), Expect = 1.6e-36, Sum P(2) = 1.6e-36
Identities = 21/89 (23%), Positives = 44/89 (49%)
Query: 356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
LS+ P + I + + + +K V YI + +D + + +++P +++VHG E S+
Sbjct: 515 LSDVPTKCISTT-ESIEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPP-EASQ 572
Query: 415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
A R + +++Y P+ +VD
Sbjct: 573 DLAECCRAFG---GKDIKVYMPKLHETVD 598
>UNIPROTKB|Q9W799 [details] [associations]
symbol:cpsf2 "Cleavage and polyadenylation specificity
factor subunit 2" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 GO:GO:0005737 GO:GO:0006397
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
CTD:53981 HOVERGEN:HBG051106 EMBL:AF139986 RefSeq:NP_001081123.1
UniGene:Xl.3876 ProteinModelPortal:Q9W799 GeneID:394394
KEGG:xla:394394 Xenbase:XB-GENE-950598 Uniprot:Q9W799
Length = 783
Score = 389 (142.0 bits), Expect = 2.1e-36, Sum P(2) = 2.1e-36
Identities = 108/376 (28%), Positives = 188/376 (50%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES--DQIDLLLIS 58
+ TL GA QE C +L+ ++DCG S MD +D V+ Q+D +L+S
Sbjct: 7 LTTLVGA-QEESAVCYLLQVDEFRFLLDCGWDENFS-MD---IIDSVKKYVHQVDAVLLS 61
Query: 59 HFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLE 117
H H GALP+ + K G + T + + + D + S +TE L++ D++
Sbjct: 62 HPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFSLFSLDDVD 120
Query: 118 KSMDKIETINF----HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQE 172
+ DKI+ + + H + +G+ + AGH++G ++ I G + I+Y DF+ +
Sbjct: 121 CAFDKIQQLKYNQIVHLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKR 180
Query: 173 DRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFAL 231
+ HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V
Sbjct: 181 EIHLNGCSLEMINRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAVDTA 240
Query: 232 GRAQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISIN 288
GR EL +LD+ W L + ++++ + ++ + M+D++ R + N
Sbjct: 241 GRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRN 300
Query: 289 NPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCV 347
NPF F+H++ G + P VV+AS ++ G SRELF WC D KN VI+
Sbjct: 301 NPFQFRHLTLCHGYSDLARVPSPKVVLASQPDLECGFSRELFIQWCQDPKNSVILTYRTT 360
Query: 348 EGTLAKTILSEPEEVI 363
GTLA+ ++ P E I
Sbjct: 361 PGTLARFLIDHPSERI 376
Score = 73 (30.8 bits), Expect = 2.1e-36, Sum P(2) = 2.1e-36
Identities = 19/89 (21%), Positives = 43/89 (48%)
Query: 356 LSE-PEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
LS+ P + + + + + +K V YI + +D + + +++P +++VHG +
Sbjct: 515 LSDVPTKCVSTT-ESMEIKARVTYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDATQD 573
Query: 415 LKAALTREYEDDPNTSMELYNPRNTVSVD 443
L A R + +++Y P+ +VD
Sbjct: 574 LAEAC-RAFG---GKDIKVYTPKLHETVD 598
Score = 39 (18.8 bits), Expect = 7.8e-33, Sum P(2) = 7.8e-33
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 578 YGEAAVPKMFKGEKITITVDKK 599
YGE P+ F ++ +T D+K
Sbjct: 476 YGEIIKPEDFLVPELQVTEDEK 497
>RGD|1309687 [details] [associations]
symbol:Cpsf2 "cleavage and polyadenylation specific factor 2,
100kDa" species:10116 "Rattus norvegicus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA;ISO] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006398 "histone mRNA
3'-end processing" evidence=IEA;ISO] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 RGD:1309687 GO:GO:0016787
EMBL:CH473982 GO:GO:0005847 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 OMA:NNPFQFK CTD:53981
OrthoDB:EOG4MCWZQ GO:GO:0006398 IPI:IPI00189534
RefSeq:NP_001100223.1 UniGene:Rn.8038 Ensembl:ENSRNOT00000008612
GeneID:299256 KEGG:rno:299256 UCSC:RGD:1309687 NextBio:645098
Uniprot:D3Z9E6
Length = 782
Score = 385 (140.6 bits), Expect = 5.1e-36, Sum P(2) = 5.1e-36
Identities = 106/379 (27%), Positives = 189/379 (49%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-----DQIDLL 55
+ TL G QE C +L+ ++DCG S VD+++S QID +
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-------VDIIDSLRKHVHQIDAV 58
Query: 56 LISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTES 114
L+SH H GALP+ + K G + T + + + D + S +TE L+T
Sbjct: 59 LLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLD 117
Query: 115 DLEKSMDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFS 169
D++ + DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+
Sbjct: 118 DVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFN 177
Query: 170 RQEDRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
+ + HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V
Sbjct: 178 HKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAV 237
Query: 229 FALGRAQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QI 285
GR EL +LD+ W L + ++++ + ++ + M+D++ R +
Sbjct: 238 DTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFED 297
Query: 286 SINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAG 344
NNPF F+H+S G+ + P VV+AS ++ G SR+LF WC D KN +I+
Sbjct: 298 KRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTY 357
Query: 345 YCVEGTLAKTILSEPEEVI 363
GTLA+ ++ P E +
Sbjct: 358 RTTPGTLARFLIDNPSEKV 376
Score = 74 (31.1 bits), Expect = 5.1e-36, Sum P(2) = 5.1e-36
Identities = 22/115 (19%), Positives = 53/115 (46%)
Query: 330 EMWCTDAKNGVIIAGYCV-EGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQ 388
E+ T+ + + +G E + + + P + + + + + +K V YI + +D
Sbjct: 489 ELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSAT-ESIEIKARVTYIDYEGRSDGD 547
Query: 389 QTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
+ + +++P +++VHG E S+ A R + +++Y P+ +VD
Sbjct: 548 SIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFG---GKDIKVYMPKLHETVD 598
>MGI|MGI:1861601 [details] [associations]
symbol:Cpsf2 "cleavage and polyadenylation specific factor
2" species:10090 "Mus musculus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISO;IDA]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006398 "histone
mRNA 3'-end processing" evidence=ISO] [GO:0016787 "hydrolase
activity" evidence=IEA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 MGI:MGI:1861601
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343 OMA:NNPFQFK
CTD:53981 HOVERGEN:HBG051106 OrthoDB:EOG4MCWZQ GO:GO:0006398
EMBL:AF012822 EMBL:BC013628 EMBL:BC007163 IPI:IPI00314302
RefSeq:NP_058552.1 UniGene:Mm.716 ProteinModelPortal:O35218
SMR:O35218 STRING:O35218 PhosphoSite:O35218 PaxDb:O35218
PRIDE:O35218 Ensembl:ENSMUST00000047357 GeneID:51786 KEGG:mmu:51786
UCSC:uc007otx.2 InParanoid:O35218 NextBio:308008 Bgee:O35218
CleanEx:MM_CPSF2 Genevestigator:O35218
GermOnline:ENSMUSG00000041781 Uniprot:O35218
Length = 782
Score = 384 (140.2 bits), Expect = 6.7e-36, Sum P(2) = 6.7e-36
Identities = 106/379 (27%), Positives = 189/379 (49%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-----DQIDLL 55
+ TL G QE C +L+ ++DCG S VD+++S QID +
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-------VDIIDSLRKHVHQIDAV 58
Query: 56 LISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTES 114
L+SH H GALP+ + K G + T + + + D + S +TE L+T
Sbjct: 59 LLSHPDPLHLGALPFAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLD 117
Query: 115 DLEKSMDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFS 169
D++ + DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+
Sbjct: 118 DVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFN 177
Query: 170 RQEDRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPV 228
+ + HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V
Sbjct: 178 HKREIHLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGNVLIAV 237
Query: 229 FALGRAQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QI 285
GR EL +LD+ W L + ++++ + ++ + M+D++ R +
Sbjct: 238 DTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFED 297
Query: 286 SINNPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAG 344
NNPF F+H+S G+ + P VV+AS ++ G SR+LF WC D KN +I+
Sbjct: 298 KRNNPFQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTY 357
Query: 345 YCVEGTLAKTILSEPEEVI 363
GTLA+ ++ P E +
Sbjct: 358 RTTPGTLARFLIDNPTEKV 376
Score = 74 (31.1 bits), Expect = 6.7e-36, Sum P(2) = 6.7e-36
Identities = 22/115 (19%), Positives = 53/115 (46%)
Query: 330 EMWCTDAKNGVIIAGYCV-EGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQ 388
E+ T+ + + +G E + + + P + + + + + +K V YI + +D
Sbjct: 489 ELQATEEEKSKLESGLTNGEEPMDQDLSDVPTKCVSAT-ESIEIKARVTYIDYEGRSDGD 547
Query: 389 QTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
+ + +++P +++VHG E S+ A R + +++Y P+ +VD
Sbjct: 548 SIKKIINQMKPRQLIIVHGPP-EASQDLAECCRAFG---GKDIKVYMPKLHETVD 598
>UNIPROTKB|F1SD85 [details] [associations]
symbol:CPSF2 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0016787 "hydrolase activity" evidence=IEA] [GO:0006379
"mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA polyadenylation"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=IEA] InterPro:IPR001279
InterPro:IPR027075 Pfam:PF00753 SMART:SM00849 GO:GO:0016787
InterPro:IPR022712 PANTHER:PTHR11203:SF5 Pfam:PF10996 SMART:SM01027
GeneTree:ENSGT00700000104551 EMBL:CU468363
Ensembl:ENSSSCT00000002717 OMA:GANDESP Uniprot:F1SD85
Length = 385
Score = 389 (142.0 bits), Expect = 1.2e-35, P = 1.2e-35
Identities = 106/374 (28%), Positives = 187/374 (50%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
+ TL G QE C +L+ ++DCG S MD + + QID +L+SH
Sbjct: 7 LTTLSGV-QEESALCYLLQVDEFRFLLDCGWDEHFS-MDIIDSLRK-HVHQIDAVLLSHP 63
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
H GALP+ + K G + T + + + D + S +TE L+T D++ +
Sbjct: 64 DPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDAA 122
Query: 120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
DKI+ + F + ++ +G+ + AGH++G ++ I G + I+Y DF+ + +
Sbjct: 123 FDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAVDFNHKREI 182
Query: 175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
HL + + +P +LIT+S T+V +R++R+ + + + + + G LI V GR
Sbjct: 183 HLNGCSLEMLSRPSLLITDSFNATYVQPRRKQRDEQLLTNVLETLRGDGSVLIAVDTAGR 242
Query: 234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
EL +LD+ W L + ++++ + ++ + M+D++ R + NNP
Sbjct: 243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302
Query: 291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
F F+H+S G+ + P VV+AS ++ G SR+LF WC D KN +I+ G
Sbjct: 303 FQFRHLSLCHGLSDLARVPSPKVVLASQPDLECGFSRDLFIQWCQDPKNSIILTYRTTPG 362
Query: 350 TLAKTILSEPEEVI 363
TLA+ ++ P E I
Sbjct: 363 TLARFLIDNPSEKI 376
>ZFIN|ZDB-GENE-040718-79 [details] [associations]
symbol:cpsf2 "cleavage and polyadenylation specific
factor 2" species:7955 "Danio rerio" [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=IEA] InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 ZFIN:ZDB-GENE-040718-79 GO:GO:0016787
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 HOGENOM:HOG000264343 CTD:53981 HOVERGEN:HBG051106
OrthoDB:EOG4MCWZQ EMBL:BC076029 IPI:IPI00512505
RefSeq:NP_001002384.1 UniGene:Dr.121547 ProteinModelPortal:Q6DHE5
STRING:Q6DHE5 PRIDE:Q6DHE5 GeneID:436657 KEGG:dre:436657
InParanoid:Q6DHE5 NextBio:20831102 ArrayExpress:Q6DHE5 Bgee:Q6DHE5
Uniprot:Q6DHE5
Length = 790
Score = 380 (138.8 bits), Expect = 3.5e-35, Sum P(2) = 3.5e-35
Identities = 103/372 (27%), Positives = 185/372 (49%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHF 60
+ L G QE C +L+ ++DCG S MD + + Q+D +L+SH
Sbjct: 7 LTALSGV-QEESALCYLLQVDEFRFLLDCGWDETFS-MDIIDSLKRYVH-QVDAVLLSHP 63
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESDLEKS 119
H GALP+ + K G + T + + + D + S +TE L+T D++ +
Sbjct: 64 DHVHLGALPYAVGKLGLNCTIYATIPVYKMGQMFMYDLYQ-SRHNTEDFTLFTLDDVDSA 122
Query: 120 MDKIETINFHEEKDV----NGIKFSAYNAGHVLGAAMFLIEIAGVK-ILYTGDFSRQEDR 174
DKI+ + + + ++ +G+ + AGH++G ++ I G + I+Y DF+ + +
Sbjct: 123 FDKIQQLKYSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIIYGVDFNHKREI 182
Query: 175 HLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
HL + + +P +LIT+S ++V +R++R+ + + + + + G LI V GR
Sbjct: 183 HLNGCSLESLSRPSLLITDSFNASYVQPRRKQRDEQLLTNVMETLRGDGNVLIAVDTAGR 242
Query: 234 AQELLLILDEYWSLHPE-LHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNP 290
EL +LD+ W L + ++++ + ++ + M+D++ R + NNP
Sbjct: 243 VLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKSQVEWMSDKLMRCFEDKRNNP 302
Query: 291 FVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
F F+H+S + + P VV+ S ++SG SRELF WC DAKN VI+ G
Sbjct: 303 FQFRHLSLCHSLSDLARVPSPKVVLCSQPDLESGFSRELFIQWCQDAKNSVILTYRTTPG 362
Query: 350 TLAKTILSEPEE 361
TLA+ ++ P E
Sbjct: 363 TLARYLIDNPGE 374
Score = 72 (30.4 bits), Expect = 3.5e-35, Sum P(2) = 3.5e-35
Identities = 17/76 (22%), Positives = 36/76 (47%)
Query: 368 QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDP 427
Q L ++ V YI + +D + + +++P +++VHG + L A + Y
Sbjct: 528 QTLDIRARVMYIDYEGRSDGDSIKKIINQMKPRQLIIVHGPPDASQDL-AESCKAYS--- 583
Query: 428 NTSMELYNPRNTVSVD 443
+++Y P+ +VD
Sbjct: 584 GKDIKVYIPKLQETVD 599
>WB|WBGene00017313 [details] [associations]
symbol:cpsf-2 species:6239 "Caenorhabditis elegans"
[GO:0016787 "hydrolase activity" evidence=IEA] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
[GO:0051301 "cell division" evidence=IMP] [GO:0000910 "cytokinesis"
evidence=IMP] [GO:0000003 "reproduction" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0016246
"RNA interference" evidence=IMP] [GO:0040027 "negative regulation
of vulval development" evidence=IMP] InterPro:IPR001279
InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
Uniprot:O17403
Length = 843
Score = 383 (139.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
Identities = 107/374 (28%), Positives = 186/374 (49%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHP--GLSGMDAL-PFVDLVESDQIDLLLI 57
+K GA E G C +L+ I++DCG GL + L PF+ +I +LI
Sbjct: 7 LKVFSGAKDE-GPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIP-----KISAVLI 60
Query: 58 SHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQML-YTESDL 116
SH H G LP+ + K G + T + + + D + S++ E+ YT D+
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDV 119
Query: 117 EKSMDKIETINFHEE---KDVNGIKFSAYNAGHVLGAAMFLI-EIAGVKILYTGDFSRQE 172
+ + +K+E + +++ K +G+ F+A AGH+LG +++ I + G I+Y DF+ ++
Sbjct: 120 DTAFEKVEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179
Query: 173 DRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFAL 231
+RHL +P +LIT + + + +R++R+ + + I V + G C+I +
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239
Query: 232 GRAQELLLILDEYWS-LHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-- 288
GR EL +LD+ WS L + S +A + ++ + MN+++ + S +
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299
Query: 289 -NPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYC 346
NPF KH++ + P VV+ S M+SG SRELF WC+D +NGVI+
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARP 359
Query: 347 VEGTLAKTILSEPE 360
TLA +++ E
Sbjct: 360 ASFTLAAKLVNMAE 373
Score = 65 (27.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
Identities = 11/49 (22%), Positives = 27/49 (55%)
Query: 369 RLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKA 417
R+ + +++I + +D + T + + L P +++VHG +++ L A
Sbjct: 562 RVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSRDDTRDLVA 610
>UNIPROTKB|O17403 [details] [associations]
symbol:cpsf-2 "Probable cleavage and polyadenylation
specificity factor subunit 2" species:6239 "Caenorhabditis elegans"
[GO:0005575 "cellular_component" evidence=ND] [GO:0003674
"molecular_function" evidence=ND] InterPro:IPR001279
InterPro:IPR027075 SMART:SM00849 Pfam:PF07521 GO:GO:0005634
GO:GO:0009792 GO:GO:0016246 GO:GO:0006397 GO:GO:0003723
GO:GO:0016787 GO:GO:0000910 GO:GO:0040035 GO:GO:0040027
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 HOGENOM:HOG000264343
OMA:NNPFQFK EMBL:FO080529 PIR:T32487 RefSeq:NP_504822.1
ProteinModelPortal:O17403 SMR:O17403 STRING:O17403 PaxDb:O17403
EnsemblMetazoa:F09G2.4 GeneID:179103 KEGG:cel:CELE_F09G2.4
CTD:179103 WormBase:F09G2.4 InParanoid:O17403 NextBio:903938
Uniprot:O17403
Length = 843
Score = 383 (139.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
Identities = 107/374 (28%), Positives = 186/374 (49%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHP--GLSGMDAL-PFVDLVESDQIDLLLI 57
+K GA E G C +L+ I++DCG GL + L PF+ +I +LI
Sbjct: 7 LKVFSGAKDE-GPLCYLLQVDGDYILLDCGWDERFGLQYFEELKPFIP-----KISAVLI 60
Query: 58 SHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQML-YTESDL 116
SH H G LP+ + K G + T + + + D + S++ E+ YT D+
Sbjct: 61 SHPDPLHLGGLPYLVSKCGLTAPVYATVPVYKMGQMFIYDMV-YSHLDVEEFEHYTLDDV 119
Query: 117 EKSMDKIETINFHEE---KDVNGIKFSAYNAGHVLGAAMFLI-EIAGVKILYTGDFSRQE 172
+ + +K+E + +++ K +G+ F+A AGH+LG +++ I + G I+Y DF+ ++
Sbjct: 120 DTAFEKVEQVKYNQTVVLKGDSGVHFTALPAGHMLGGSIWRICRVTGEDIVYCVDFNHKK 179
Query: 173 DRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFAL 231
+RHL +P +LIT + + + +R++R+ + + I V + G C+I +
Sbjct: 180 ERHLNGCSFDNFNRPHLLITGAHHISLPQMRRKDRDEQLVTKILRTVRQKGDCMIVIDTA 239
Query: 232 GRAQELLLILDEYWS-LHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-- 288
GR EL +LD+ WS L + S +A + ++ + MN+++ + S +
Sbjct: 240 GRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQLEWMNEKLFKYDSSSAR 299
Query: 289 -NPFVFKHISNLKGIDHFEDI-GPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYC 346
NPF KH++ + P VV+ S M+SG SRELF WC+D +NGVI+
Sbjct: 300 YNPFTLKHVTLCHSHQELMRVRSPKVVLCSSQDMESGFSRELFLDWCSDPRNGVILTARP 359
Query: 347 VEGTLAKTILSEPE 360
TLA +++ E
Sbjct: 360 ASFTLAAKLVNMAE 373
Score = 65 (27.9 bits), Expect = 1.2e-34, Sum P(2) = 1.2e-34
Identities = 11/49 (22%), Positives = 27/49 (55%)
Query: 369 RLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKA 417
R+ + +++I + +D + T + + L P +++VHG +++ L A
Sbjct: 562 RVEVSCRIEFIEYEGISDGESTKKLLAGLLPRQIIVVHGSRDDTRDLVA 610
>UNIPROTKB|Q9KV92 [details] [associations]
symbol:VC_0264 "Putative uncharacterized protein"
species:243277 "Vibrio cholerae O1 biovar El Tor str. N16961"
[GO:0003674 "molecular_function" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE003852 GenomeReviews:AE003852_GR GO:GO:0016787
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
KO:K07576 OMA:CHIDHVG PIR:F82345 RefSeq:NP_229920.1
ProteinModelPortal:Q9KV92 DNASU:2614470 GeneID:2614470
KEGG:vch:VC0264 PATRIC:20079570 ProtClustDB:CLSK2517501
Uniprot:Q9KV92
Length = 455
Score = 376 (137.4 bits), Expect = 3.5e-34, P = 3.5e-34
Identities = 115/435 (26%), Positives = 201/435 (46%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G V SC L +++++DCG+ G D P +D L+++H H+DH
Sbjct: 24 GGKASVTGSCHELRADGQALLIDCGL---FQGADERPLAVEFALGHVDALILTHAHIDHI 80
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV----SNISTEQMLYTESDLEKSMD 121
G LPW LL G K + T AT + +L D +K+ S +E++L L + D
Sbjct: 81 GRLPW-LLAAGLKQPIYSTAATAELVPLMLEDGLKLQLGMSPKQSERVLTEVRRLLRVQD 139
Query: 122 KIETINFHEEK-DVNGIKFSAYNAGHVLGAAMFLIEIA-GVKILYTGDFSRQEDRHLMAA 179
+ ++ D ++F AGH+LG+A I G ++++GD L+
Sbjct: 140 YQKWFAVQPKRADSLWVRFQP--AGHILGSAYVEIRRPNGEVVVFSGDLGPSHTP-LLPD 196
Query: 180 EIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLL 239
P + D L E+TYG HE + R R ++I + GG LIP F++GR QELL
Sbjct: 197 PQSPERADYLFIETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLF 256
Query: 240 ILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFK---- 294
+++ ++PI S +A++ Y+ + + ++ ++ +P F+
Sbjct: 257 DIEQLIFSQQIDANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCIT 316
Query: 295 ---HISNLKGIDHFEDIGPC-VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGT 350
H ++ + ++ G +V+A+ GM Q G + + D + +I+AG+ EGT
Sbjct: 317 VEDHRTHERLVNRLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAEGT 376
Query: 351 LAKTILS-EPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVREL--RPAHVVLVH 406
L ++I S +P + + G + + + +S +SAH D F+ + +P V L+H
Sbjct: 377 LGRSIQSGQPS--VWIEGTEVEVNAHIHTMSGYSAHADKADLLRFITGIPEKPKQVHLIH 434
Query: 407 GEQNEMSRLKAALTR 421
GE A LT+
Sbjct: 435 GEAPAKQAFAAELTQ 449
>TIGR_CMR|VC_0264 [details] [associations]
symbol:VC_0264 "conserved hypothetical protein" species:686
"Vibrio cholerae O1 biovar El Tor" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 EMBL:AE003852
GenomeReviews:AE003852_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 KO:K07576 OMA:CHIDHVG
PIR:F82345 RefSeq:NP_229920.1 ProteinModelPortal:Q9KV92
DNASU:2614470 GeneID:2614470 KEGG:vch:VC0264 PATRIC:20079570
ProtClustDB:CLSK2517501 Uniprot:Q9KV92
Length = 455
Score = 376 (137.4 bits), Expect = 3.5e-34, P = 3.5e-34
Identities = 115/435 (26%), Positives = 201/435 (46%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
G V SC L +++++DCG+ G D P +D L+++H H+DH
Sbjct: 24 GGKASVTGSCHELRADGQALLIDCGL---FQGADERPLAVEFALGHVDALILTHAHIDHI 80
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV----SNISTEQMLYTESDLEKSMD 121
G LPW LL G K + T AT + +L D +K+ S +E++L L + D
Sbjct: 81 GRLPW-LLAAGLKQPIYSTAATAELVPLMLEDGLKLQLGMSPKQSERVLTEVRRLLRVQD 139
Query: 122 KIETINFHEEK-DVNGIKFSAYNAGHVLGAAMFLIEIA-GVKILYTGDFSRQEDRHLMAA 179
+ ++ D ++F AGH+LG+A I G ++++GD L+
Sbjct: 140 YQKWFAVQPKRADSLWVRFQP--AGHILGSAYVEIRRPNGEVVVFSGDLGPSHTP-LLPD 196
Query: 180 EIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLL 239
P + D L E+TYG HE + R R ++I + GG LIP F++GR QELL
Sbjct: 197 PQSPERADYLFIETTYGDKQHEDVQSRGQRLRAMIERSLTDGGAILIPAFSVGRTQELLF 256
Query: 240 ILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFK---- 294
+++ ++PI S +A++ Y+ + + ++ ++ +P F+
Sbjct: 257 DIEQLIFSQQIDANLPIILDSPMAQRVTRSYRRFKQLWGREAKARLQMHRHPLAFEQCIT 316
Query: 295 ---HISNLKGIDHFEDIGPC-VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGT 350
H ++ + ++ G +V+A+ GM Q G + + D + +I+AG+ EGT
Sbjct: 317 VEDHRTHERLVNRLASTGEAAIVVAASGMCQGGRIMDYLKALLPDKRTDLILAGFQAEGT 376
Query: 351 LAKTILS-EPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVREL--RPAHVVLVH 406
L ++I S +P + + G + + + +S +SAH D F+ + +P V L+H
Sbjct: 377 LGRSIQSGQPS--VWIEGTEVEVNAHIHTMSGYSAHADKADLLRFITGIPEKPKQVHLIH 434
Query: 407 GEQNEMSRLKAALTR 421
GE A LT+
Sbjct: 435 GEAPAKQAFAAELTQ 449
>FB|FBgn0027873 [details] [associations]
symbol:Cpsf100 "Cleavage and polyadenylation specificity
factor 100" species:7227 "Drosophila melanogaster" [GO:0006379
"mRNA cleavage" evidence=ISS;NAS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS;IMP;NAS]
[GO:0003730 "mRNA 3'-UTR binding" evidence=NAS] [GO:0016787
"hydrolase activity" evidence=IEA] [GO:0006398 "histone mRNA 3'-end
processing" evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE014297 GO:GO:0022008 GO:GO:0006378
GO:GO:0003723 GO:GO:0016787 GO:GO:0005847 GO:GO:0006379
eggNOG:COG1236 KO:K14402 InterPro:IPR022712 InterPro:IPR025069
InterPro:IPR011108 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 GeneTree:ENSGT00700000104551 OMA:NNPFQFK
GO:GO:0006398 EMBL:AF160933 RefSeq:NP_651658.1 RefSeq:NP_733264.1
UniGene:Dm.1362 ProteinModelPortal:Q9V3D6 SMR:Q9V3D6 IntAct:Q9V3D6
STRING:Q9V3D6 PaxDb:Q9V3D6 PRIDE:Q9V3D6 EnsemblMetazoa:FBtr0085357
GeneID:43426 KEGG:dme:Dmel_CG1957 UCSC:CG1957-RA CTD:43426
FlyBase:FBgn0027873 InParanoid:Q8IML7 OrthoDB:EOG4XD261
PhylomeDB:Q9V3D6 GenomeRNAi:43426 NextBio:833860 Bgee:Q9V3D6
GermOnline:CG1957 Uniprot:Q9V3D6
Length = 756
Score = 354 (129.7 bits), Expect = 6.9e-32, Sum P(2) = 6.9e-32
Identities = 100/369 (27%), Positives = 184/369 (49%)
Query: 1 MKTLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVES-DQIDLLLISH 59
+ T+ GA E C +L+ + I++DCG DA +L +D +L+SH
Sbjct: 7 LHTISGAMDE-SPPCYILQIDDVRILLDCGWD---EKFDANFIKELKRQVHTLDAVLLSH 62
Query: 60 FHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSD-YIKVSNISTEQMLYTESDLEK 118
H GALP+ + K G + T + + + D Y+ N+ L++ D++
Sbjct: 63 PDAYHLGALPYLVGKLGLNCPIYATIPVFKMGQMFMYDLYMSHFNMGDFD-LFSLDDVDT 121
Query: 119 SMDKIETINFHEE---KDVN-GIKFSAYNAGHVLGAAMF-LIEIAGVKILYTGDFSRQED 173
+ +KI + +++ KD GI + NAGH++G ++ ++++ I+Y DF+ +++
Sbjct: 122 AFEKITQLKYNQTVSLKDKGYGISITPLNAGHMIGGTIWKIVKVGEEDIVYATDFNHKKE 181
Query: 174 RHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALG 232
RHL E+ + +P +LIT++ + +R R+ + + I V G LI V G
Sbjct: 182 RHLSGCELDRLQRPSLLITDAYNAQYQQARRRARDEKLMTNILQTVRNNGNVLIAVDTAG 241
Query: 233 RAQELLLILDEYW-SLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQI--SINN 289
R EL +LD+ W + L + ++++ + ++ I M+D++ + + NN
Sbjct: 242 RVLELAHMLDQLWKNKESGLMAYSLALLNNVSYNVIEFAKSQIEWMSDKLTKAFEGARNN 301
Query: 290 PFVFKHISNLKGI-DHFE-DIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCV 347
PF FKHI + D ++ GP VV+AS ++SG +R+LF W ++A N +I+
Sbjct: 302 PFQFKHIQLCHSLADVYKLPAGPKVVLASTPDLESGFTRDLFVQWASNANNSIILTTRTS 361
Query: 348 EGTLAKTIL 356
GTLA ++
Sbjct: 362 PGTLAMELV 370
Score = 69 (29.3 bits), Expect = 6.9e-32, Sum P(2) = 6.9e-32
Identities = 20/90 (22%), Positives = 43/90 (47%)
Query: 355 ILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSR 414
+L +P ++I + + + V I F +D + + + +LRP V+++HG E ++
Sbjct: 526 LLEKPTKLISQR-KTIEVNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTA-EGTQ 583
Query: 415 LKAALTREYEDDPNTSMELYNPRNTVSVDL 444
+ A R E N ++ P+ +D+
Sbjct: 584 VVA---RHCEQ--NVGARVFTPQKGEIIDV 608
>TAIR|locus:2172843 [details] [associations]
symbol:CPSF100 "cleavage and polyadenylation specificity
factor 100" species:3702 "Arabidopsis thaliana" [GO:0005634
"nucleus" evidence=ISM;IDA] [GO:0009793 "embryo development ending
in seed dormancy" evidence=NAS] [GO:0016787 "hydrolase activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IPI] [GO:0005515
"protein binding" evidence=IPI] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=ISS;NAS]
[GO:0006378 "mRNA polyadenylation" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0035194 "posttranscriptional gene
silencing by RNA" evidence=IMP] [GO:0009506 "plasmodesma"
evidence=IDA] [GO:0000278 "mitotic cell cycle" evidence=RCA]
[GO:0006306 "DNA methylation" evidence=RCA] [GO:0006342 "chromatin
silencing" evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0007267 "cell-cell
signaling" evidence=RCA] [GO:0009220 "pyrimidine ribonucleotide
biosynthetic process" evidence=RCA] [GO:0009616 "virus induced gene
silencing" evidence=RCA] [GO:0009640 "photomorphogenesis"
evidence=RCA] [GO:0010267 "production of ta-siRNAs involved in RNA
interference" evidence=RCA] [GO:0010388 "cullin deneddylation"
evidence=RCA] [GO:0016569 "covalent chromatin modification"
evidence=RCA] [GO:0031047 "gene silencing by RNA" evidence=RCA]
[GO:0035196 "production of miRNAs involved in gene silencing by
miRNA" evidence=RCA] InterPro:IPR001279 InterPro:IPR027075
Pfam:PF00753 SMART:SM00849 GO:GO:0009506 EMBL:CP002688
GenomeReviews:BA000015_GR GO:GO:0006378 EMBL:AB005244 GO:GO:0003723
GO:GO:0016787 GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 EMBL:AF283277 EMBL:AY034982
EMBL:BT004374 IPI:IPI00521104 RefSeq:NP_197776.1 UniGene:At.25191
ProteinModelPortal:Q9LKF9 SMR:Q9LKF9 IntAct:Q9LKF9 STRING:Q9LKF9
PaxDb:Q9LKF9 PRIDE:Q9LKF9 EnsemblPlants:AT5G23880.1 GeneID:832453
KEGG:ath:AT5G23880 TAIR:At5g23880 HOGENOM:HOG000264343
InParanoid:Q9LKF9 OMA:NNPFQFK PhylomeDB:Q9LKF9
ProtClustDB:CLSN2686300 Genevestigator:Q9LKF9 GermOnline:AT5G23880
GO:GO:0035194 Uniprot:Q9LKF9
Length = 739
Score = 373 (136.4 bits), Expect = 1.8e-31, P = 1.8e-31
Identities = 112/411 (27%), Positives = 209/411 (50%)
Query: 24 SIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFM 83
+ ++DCG + L L + V S ID +L+SH H GALP+ + + G +
Sbjct: 29 NFLIDCGWND-LFDTSLLEPLSRVAST-IDAVLLSHPDTLHIGALPYAMKQLGLSAPVY- 85
Query: 84 THATKAIYRW-LLSDYIKVSNISTEQM----LYTESDLEKSMDKIETI----NFHEEKDV 134
AT+ ++R LL+ Y + +S +Q+ L+T D++ + + + N+H
Sbjct: 86 --ATEPVHRLGLLTMYDQF--LSRKQVSDFDLFTLDDIDSAFQNVIRLTYSQNYHLSGKG 141
Query: 135 NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPP-VKPDILITES 193
GI + + AGH+LG +++ I G ++Y D++ +++RHL + V+P +LIT++
Sbjct: 142 EGIVIAPHVAGHMLGGSIWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRPAVLITDA 201
Query: 194 TYGTHVHEQ-REEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELH 252
+ + ++ R++R+ F I + GG L+PV GR ELLLIL+++WS
Sbjct: 202 YHALYTNQTARQQRDKEFLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRG--F 259
Query: 253 DIPIYYASSLAKKCMSVYQTYINAMNDRIRR--QISINNPFVFKHIS---NLKGIDHFED 307
PIY+ + ++ + ++++ M+D I + + S +N F+ +H++ N +D+
Sbjct: 260 SFPIYFLTYVSSSTIDYVKSFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPP 319
Query: 308 IGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV---IG 364
GP VV+AS +++G +RE+F W D +N V+ GTLA+ + S P +
Sbjct: 320 -GPKVVLASMASLEAGFAREIFVEWANDPRNLVLFTETGQFGTLARMLQSAPPPKFVKVT 378
Query: 365 MSGQRLPLKMSVDYISFSAHTDYQQTSEFVRE--LRPAHVVLVHGEQNEMS 413
MS +R+PL + I++ + + E +R ++ HG + S
Sbjct: 379 MS-KRVPLA-GEELIAYEEEQNRLKREEALRASLVKEEETKASHGSDDNSS 427
>TIGR_CMR|DET_1061 [details] [associations]
symbol:DET_1061 "metallo-beta-lactamase family protein"
species:243164 "Dehalococcoides ethenogenes 195" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008152 "metabolic process"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:CP000027 GenomeReviews:CP000027_GR
eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576
RefSeq:YP_181776.1 ProteinModelPortal:Q3Z7M3 STRING:Q3Z7M3
GeneID:3229629 KEGG:det:DET1061 PATRIC:21609167
ProtClustDB:CLSK2516599 BioCyc:DETH243164:GJNF-1062-MONOMER
Uniprot:Q3Z7M3
Length = 468
Score = 267 (99.0 bits), Expect = 4.3e-30, Sum P(2) = 4.3e-30
Identities = 84/321 (26%), Positives = 148/321 (46%)
Query: 110 LYTESDLEKSMDKIETINFHEEKDVN-GIKFSAYNAGHVLGAAMFLIEIAGVK----ILY 164
LYT D +T+ + E V I + +NAGHV G+A ++I I++
Sbjct: 129 LYTAEDARAVSPLFKTVEYSREIAVTEDITATFHNAGHVFGSASIELKIQENHRQKVIVF 188
Query: 165 TGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRC 224
+GD DR ++ + D ++ ESTYG H+ E + +I+ V GG
Sbjct: 189 SGDLGNW-DRPILKNPDLVNQADYVVIESTYGDRTHQDINEASLKLAEIINQTVKLGGNI 247
Query: 225 LIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQ 284
+IP FAL R Q+LL L+ + S ++ + ++ S +A +++ + + DR
Sbjct: 248 VIPSFALERTQDLLFFLNRFMS-EGKIPSLKVFVDSPMAISITKIFKEHPE-LYDR-ETS 304
Query: 285 ISINN---PFVFK--HISNLKGIDH---FEDIGPCVVMASPGMMQSGLSRELFEMWCTDA 336
+NN PF F+ H +N K D + PC+++A GM G + +
Sbjct: 305 GWVNNGSSPFEFEGLHFTN-KAADSKAILAEKDPCIIIAGSGMCTGGRIKHHLVNNISRP 363
Query: 337 KNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYI-SFSAHTDYQQTSEFVR 395
++ ++ G+ GTL + I +EV + GQ P++ ++ + +FSAH D +++
Sbjct: 364 ESTILFVGFQATGTLGRLITDGAKEV-RILGQHYPVQARIEELRAFSAHADQPTLLRWLK 422
Query: 396 ELR--PAHVVLVHGEQNEMSR 414
+ P V + HGE +R
Sbjct: 423 GFKNKPEMVFVTHGEPETSAR 443
Score = 136 (52.9 bits), Expect = 4.3e-30, Sum P(2) = 4.3e-30
Identities = 33/94 (35%), Positives = 51/94 (54%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPG--LSGMDALPFVDLVESDQIDLLLISHFHLD 63
GA + V S +++ + +++DCG++ L + PF + + ++ISH H+D
Sbjct: 9 GAARNVTGSRYLIKTDHTQLLVDCGLYQERRLQDRNWQPFE--IPPQSLSAVIISHAHID 66
Query: 64 HCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSD 97
HCG LP L+K GF G F T AT I R L+D
Sbjct: 67 HCGLLPK-LVKEGFAGPVFATEATAEIARISLTD 99
>DICTYBASE|DDB_G0270392 [details] [associations]
symbol:cpsf2 "cleavage and polyadenylation
specificity factor 100 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0016787 "hydrolase activity" evidence=IEA]
[GO:0006379 "mRNA cleavage" evidence=IEA;ISS] [GO:0006378 "mRNA
polyadenylation" evidence=IEA;ISS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA;ISS]
[GO:0003723 "RNA binding" evidence=IEA;ISS] [GO:0006397 "mRNA
processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR001279 InterPro:IPR027075 Pfam:PF00753 SMART:SM00849
dictyBase:DDB_G0270392 Pfam:PF07521 EMBL:AAFI02000005
GenomeReviews:CM000150_GR GO:GO:0006378 GO:GO:0003723 GO:GO:0016787
GO:GO:0005847 GO:GO:0006379 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 InterPro:IPR011108
PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299 SMART:SM01027
OMA:NNPFQFK RefSeq:XP_646760.1 ProteinModelPortal:Q55BS1
STRING:Q55BS1 EnsemblProtists:DDB0233700 GeneID:8617733
KEGG:ddi:DDB_G0270392 ProtClustDB:CLSZ2431463 Uniprot:Q55BS1
Length = 784
Score = 352 (129.0 bits), Expect = 2.8e-29, Sum P(2) = 2.8e-29
Identities = 110/461 (23%), Positives = 206/461 (44%)
Query: 4 LKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLD 63
L GA E C +LE + I++DCG+ L L ++ V + +ID +L+SH
Sbjct: 10 LSGAKDE-SPPCYLLEIDDFCILLDCGLSYNLD-FSLLEPLEKV-AKKIDAVLLSHSDTT 66
Query: 64 HCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSM--D 121
H G LP+ + K G G + T + L D + E Y+ +++ D
Sbjct: 67 HIGGLPYVVGKYGLTGTIYGTTPVLKMGTMFLYDLYENKMSQEEFQQYSLDNIDSCFGED 126
Query: 122 KIETINFHEEKDVNG----IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLM 177
+ + ++F + ++G I + Y AGH +GA+++ I I+Y D++ + + HL
Sbjct: 127 RFKELSFSQHYSLSGKGKGISITPYLAGHTIGASVWKITKGTYSIVYAIDYNHRNEGHLD 186
Query: 178 AAEIPP--VKPDILITES--TYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGR 233
+ ++ +KP +LIT+S T ++ R+ I+ + GG LIPV GR
Sbjct: 187 SLQLTSDILKPSLLITDSKGVDKTLAFKKTITRDQSLFEQINRNLRDGGNVLIPVDTAGR 246
Query: 234 AQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDR--IRRQISINNPF 291
ELLL ++ YWS + L + + + ++ + M+ ++ + +I NPF
Sbjct: 247 VLELLLCIENYWSKNKSLALYSVVFLGRFSFSVCQFARSQLEFMSSTASVKFEQNIENPF 306
Query: 292 VFKHISNLKGIDHFEDIGPC--VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEG 349
FKHI L ++ +++ V++ S +++G SRELF WC+D K ++ +
Sbjct: 307 SFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSRELFIQWCSDPKTLILFTQKIPKD 366
Query: 350 TLAKTILSEPEEVIG-------MSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHV 402
+LA ++ + G + G R+PL + + + Q+ + + +LR
Sbjct: 367 SLADKLIKQYSTPNGRGKCIEIVQGSRVPLTGD-ELLQYEMEQAKQREEKRLEQLRKEQE 425
Query: 403 VLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPRNTVSVD 443
E+ E + L +D ++L + +D
Sbjct: 426 EREERERLEEEEREQLLNATNQDQLQQLLQLQQQKERGIID 466
Score = 47 (21.6 bits), Expect = 2.8e-29, Sum P(2) = 2.8e-29
Identities = 12/35 (34%), Positives = 16/35 (45%)
Query: 480 YHLLAPSDLPKYTDLKASKIIQQQSVYYSGSISVL 514
Y LL L LK SKI+ + Y G + +L
Sbjct: 627 YELLLKDSL--VNTLKTSKILDYEVSYIQGKVDIL 659
>TIGR_CMR|CHY_2049 [details] [associations]
symbol:CHY_2049 "metallo-beta-lactamase family protein"
species:246194 "Carboxydothermus hydrogenoformans Z-2901"
[GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
"metabolic process" evidence=ISS] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 EMBL:CP000141 GenomeReviews:CP000141_GR
GO:GO:0016787 eggNOG:COG1236 InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 RefSeq:YP_360868.1
ProteinModelPortal:Q3AAG6 STRING:Q3AAG6 GeneID:3728507
KEGG:chy:CHY_2049 PATRIC:21277179 HOGENOM:HOG000244774 KO:K07576
OMA:GGRIVHH BioCyc:CHYD246194:GJCN-2048-MONOMER Uniprot:Q3AAG6
Length = 504
Score = 326 (119.8 bits), Expect = 7.0e-27, P = 7.0e-27
Identities = 100/371 (26%), Positives = 177/371 (47%)
Query: 108 QMLYTESDLEKSMDKIETINFHEE-KDVNGIKFSAYNAGHVLGAAMFLIEIAGVK----I 162
Q +YT D ++ + I + G++ + ++AGH+LG+AM I G I
Sbjct: 123 QPIYTADDAFNALAYFQKIPLETPITPLPGLEVTFFDAGHILGSAMIKIAYKGQDATRTI 182
Query: 163 LYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGG 222
L+TGD R + + P+ DIL+ ESTYG V + + + SLI + R G
Sbjct: 183 LFTGDLGRNGRPFMKEPQKVPLT-DILVLESTYGDRVRSEEGDLKTLLKSLIEKVYRRNG 241
Query: 223 RCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIR 282
+IP FA+ R Q+L+ IL++ + E+ I +Y S LA + +++ Y N+ +
Sbjct: 242 NLIIPAFAMERTQDLIYILNDLVE-NKEVPPIDVYIDSPLAVEITKLFKKYPMFFNEEYK 300
Query: 283 RQISI-NNPFVFK--HIS-NLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKN 338
+++ ++P F H S + + +I +++++ GM +G R + ++
Sbjct: 301 EKLNRGDDPLAFPGLHFSVSQEDSVKLNNISRAIIISASGMADAGRIRHHLKHNLWRPES 360
Query: 339 GVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSV-DYISFSAHTDYQQTSEFVREL 397
V++ GY + TL + +L +EV M G+ + +K V Y SAH D ++ F+
Sbjct: 361 AVLLVGYQAQDTLGRKLLDGAKEVKIM-GEEIAVKAEVYHYDGLSAHADQRELLAFIGRF 419
Query: 398 --RPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYNPR--NTVSVDLYFKGEKTAK 453
+PA + LVHGE LK + +Y + Y PR T+S+ G K+ +
Sbjct: 420 SQKPAQIYLVHGEDEARLNLKKLIEEKYR------IPCYLPRYQETISLLANLPG-KSEE 472
Query: 454 VMGELAVENLK 464
V+ + + LK
Sbjct: 473 VLIDKVITLLK 483
Score = 164 (62.8 bits), Expect = 1.0e-08, P = 1.0e-08
Identities = 68/303 (22%), Positives = 123/303 (40%)
Query: 3 TLKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDL-VESDQIDLLLISHFH 61
T GA V SC + ++DCG+ G + + + +I+ +L++H H
Sbjct: 4 TFFGAADTVTGSCYLFNVAGHKFLVDCGLFQGPKAIKERNYGEFPFNPREIEFILLTHAH 63
Query: 62 LDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTE-------------- 107
+DH G +P L+K GFKG + T T + +L D V + E
Sbjct: 64 IDHSGLIPK-LVKKGFKGTIYATEPTVDLAAVMLPDSGHVQEMEVERKNRKLRRAGKPEL 122
Query: 108 QMLYTESDLEKSMDKIETINFHEE-KDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTG 166
Q +YT D ++ + I + G++ + ++AGH+LG+AM I G T
Sbjct: 123 QPIYTADDAFNALAYFQKIPLETPITPLPGLEVTFFDAGHILGSAMIKIAYKGQDATRTI 182
Query: 167 DFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLI 226
F+ R+ P K + T+ R E EG +L+ ++ + R
Sbjct: 183 LFTGDLGRNGRPFMKEPQKVPLTDILVLESTYGDRVRSE-EGDLKTLLKSLIEKVYRRNG 241
Query: 227 PVFALGRAQEL---LLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
+ A E L+ + + E+ I +Y S LA + +++ Y N+ +
Sbjct: 242 NLIIPAFAMERTQDLIYILNDLVENKEVPPIDVYIDSPLAVEITKLFKKYPMFFNEEYKE 301
Query: 284 QIS 286
+++
Sbjct: 302 KLN 304
>UNIPROTKB|E9PIL7 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00977321
ProteinModelPortal:E9PIL7 SMR:E9PIL7 Ensembl:ENST00000534345
ArrayExpress:E9PIL7 Bgee:E9PIL7 Uniprot:E9PIL7
Length = 140
Score = 267 (99.0 bits), Expect = 2.9e-22, P = 2.9e-22
Identities = 54/132 (40%), Positives = 81/132 (61%)
Query: 4 LKGAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLIS 58
L GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++IS
Sbjct: 9 LVGAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIIS 68
Query: 59 HFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVS-NISTEQMLYTESDLE 117
HFHLDHCGALP+F G+ G +MTH T+AI LL DY K++ + E +T ++
Sbjct: 69 HFHLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDYRKIAVDKKGEANFFTSQMIK 128
Query: 118 KSMDKIETINFH 129
M K+ ++ H
Sbjct: 129 DCMKKVVAVHLH 140
>POMBASE|SPBC1709.15c [details] [associations]
symbol:cft2 "cleavage factor two Cft2/polyadenylation
factor CPSF-73 (predicted)" species:4896 "Schizosaccharomyces
pombe" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA]
[GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
[GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
cleavage" evidence=ISO] InterPro:IPR027075 PomBase:SPBC1709.15c
Pfam:PF07521 EMBL:CU329671 GO:GO:0006378 GenomeReviews:CU329671_GR
GO:GO:0005847 GO:GO:0006379 PIR:T39643 RefSeq:NP_595448.1
ProteinModelPortal:O74740 STRING:O74740 EnsemblFungi:SPBC1709.15c.1
GeneID:2539954 KEGG:spo:SPBC1709.15c eggNOG:COG1236 KO:K14402
OMA:ISSIATP OrthoDB:EOG4WWVSN NextBio:20801097 InterPro:IPR022712
InterPro:IPR025069 InterPro:IPR011108 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 Uniprot:O74740
Length = 797
Score = 288 (106.4 bits), Expect = 6.3e-22, P = 6.3e-22
Identities = 129/532 (24%), Positives = 233/532 (43%)
Query: 25 IMMDCGIH----PGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGR 80
I +D GIH PG D+L ++ E Q DL+L+SH L H G L + K +K
Sbjct: 18 IELD-GIHIYIDPGSD--DSLKHPEVPE--QPDLILLSHSDLAHIGGLVYAYYKYDWKNA 72
Query: 81 -CFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEK----DVN 135
+ T T + R + D IK + IS +++D++ D I + + + +
Sbjct: 73 YIYATLPTINMGRMTMLDAIKSNYISD----MSKADVDAVFDSIIPLRYQQPTLLLGKCS 128
Query: 136 GIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPV--------KPD 187
G+ +AYNAGH LG ++ + +LY D++ +D+HL A + +P+
Sbjct: 129 GLTITAYNAGHTLGGTLWSLIKESESVLYAVDWNHSKDKHLNGAALYSNGHILEALNRPN 188
Query: 188 ILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSL 247
LIT++ R++R+ F + + +GG L+PV A R EL ILD +WS
Sbjct: 189 TLITDANNSLVSIPSRKKRDEAFIESVMSSLLKGGTVLLPVDAASRVLELCCILDNHWSA 248
Query: 248 -HPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFKHISNLKGIDHF 305
P L PI + S + K + ++ I M D I R IN N F++I+ +
Sbjct: 249 SQPPL-PFPILFLSPTSTKTIDYAKSMIEWMGDNIVRDFGINENLLEFRNINTITDFSQI 307
Query: 306 EDIGPC--VVMASPGMMQSGLSRELFEMWCTDAKNGVII---AGYCVEGTLAKTILSEPE 360
IGP V++A+ ++ G S+ + ++ N +I+ C + +LA + E
Sbjct: 308 SHIGPGPKVILATALTLECGFSQRILLDLMSENSNDLILFTQRSRCPQNSLANQFIRYWE 367
Query: 361 EVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALT 420
+ +P + + Y + + ++ P + GE+ S + +
Sbjct: 368 RA-SKKKRDIPHPVGL----------YAEQAVKIKTKEP-----LEGEELR-SYQELEFS 410
Query: 421 REYEDDPNTSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAAL--SGIIVKRNF 478
+ +D +T++E N R + DL ++ +L + P AL S ++ ++F
Sbjct: 411 KRNKDAEDTALEFRN-RTILDEDL---SSSSSSEDDDLDLNTEVPHVALGSSAFLMGKSF 466
Query: 479 NYHLLAPSDLPKYTDLKASKIIQQQS-VYYSGSISVLRSLISHLAGPVETLD 529
+ +L P+ +T K I+++ + G I + S + P TL+
Sbjct: 467 DLNLRDPAVQALHTKYKMFPYIEKRRRIDEYGEI-IKHQDFSMINEPANTLE 517
>UNIPROTKB|Q81SC3 [details] [associations]
symbol:BA_1737 "Metallo-beta-lactamase family protein"
species:1392 "Bacillus anthracis" [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
DNASU:1086535 EnsemblBacteria:EBBACT00000009201
EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
Length = 419
Score = 272 (100.8 bits), Expect = 4.2e-21, P = 4.2e-21
Identities = 98/363 (26%), Positives = 172/363 (47%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAG E GRSC ++ K I+ DCGI+ D+ P ++ ++ + +SH H DH
Sbjct: 8 GAG-EYGRSCYFVKNKETKILFDCGINRSYE--DSYPKIEREVVPFLEAVFLSHIHEDHT 64
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQ---MLYTESDLEKSMDK 122
LP L K G+K + + T TK L + Y K N + Q + Y + ++ K ++
Sbjct: 65 MGLP-LLAKYGYKKKIWTTRYTK---EQLPAYYEKWRNYNVTQGWNVPYNDQNV-KDLNY 119
Query: 123 I---ETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
I E N +E + ++F +GHVLG+ FL++++ + Y+GD+S + + ++
Sbjct: 120 IYVDEISNPNEWIQITPTLRFQWGYSGHVLGSVWFLVDMSHTYVFYSGDYSAESN--ILR 177
Query: 179 AEIPP-VKPDI--LITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
A +P ++ DI I ++ Y T QRE R + I G L+P+ LGRAQ
Sbjct: 178 ANLPEKLRGDIKVAIVDAAYHTDDVSQRE-RVNELCTEIERAAGNKGIALLPLPPLGRAQ 236
Query: 236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKH 295
+++L L E + P + D I M +Y+ +I + S+ +
Sbjct: 237 DIVLYLYEKYKEFPIIVDQEILDGFDE----MFLYKDWIKNNKELEELMESLKRNIIV-- 290
Query: 296 ISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTI 355
+ + G H G +V+ S MQ+ ++ +E + +N +I G+ +G+ A+ +
Sbjct: 291 MDDDGGTQH--SCG--IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKV 346
Query: 356 LSE 358
L E
Sbjct: 347 LKE 349
>TIGR_CMR|BA_1737 [details] [associations]
symbol:BA_1737 "metallo-beta-lactamase family protein"
species:198094 "Bacillus anthracis str. Ames" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 SMART:SM00849 Pfam:PF07521
EMBL:AE016879 EMBL:AE017334 GenomeReviews:AE016879_GR
GenomeReviews:AE017334_GR GO:GO:0016787 InterPro:IPR022712
InterPro:IPR011108 Pfam:PF10996 SMART:SM01027 RefSeq:NP_844172.1
RefSeq:YP_018378.1 ProteinModelPortal:Q81SC3 IntAct:Q81SC3
DNASU:1086535 EnsemblBacteria:EBBACT00000009201
EnsemblBacteria:EBBACT00000014472 GeneID:1086535 GeneID:2817971
KEGG:ban:BA_1737 KEGG:bar:GBAA_1737 PATRIC:18781074
HOGENOM:HOG000087450 OMA:SQHERVN ProtClustDB:CLSK2516952
BioCyc:BANT261594:GJ7F-1754-MONOMER Uniprot:Q81SC3
Length = 419
Score = 272 (100.8 bits), Expect = 4.2e-21, P = 4.2e-21
Identities = 98/363 (26%), Positives = 172/363 (47%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHC 65
GAG E GRSC ++ K I+ DCGI+ D+ P ++ ++ + +SH H DH
Sbjct: 8 GAG-EYGRSCYFVKNKETKILFDCGINRSYE--DSYPKIEREVVPFLEAVFLSHIHEDHT 64
Query: 66 GALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQ---MLYTESDLEKSMDK 122
LP L K G+K + + T TK L + Y K N + Q + Y + ++ K ++
Sbjct: 65 MGLP-LLAKYGYKKKIWTTRYTK---EQLPAYYEKWRNYNVTQGWNVPYNDQNV-KDLNY 119
Query: 123 I---ETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMA 178
I E N +E + ++F +GHVLG+ FL++++ + Y+GD+S + + ++
Sbjct: 120 IYVDEISNPNEWIQITPTLRFQWGYSGHVLGSVWFLVDMSHTYVFYSGDYSAESN--ILR 177
Query: 179 AEIPP-VKPDI--LITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQ 235
A +P ++ DI I ++ Y T QRE R + I G L+P+ LGRAQ
Sbjct: 178 ANLPEKLRGDIKVAIVDAAYHTDDVSQRE-RVNELCTEIERAAGNKGIALLPLPPLGRAQ 236
Query: 236 ELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKH 295
+++L L E + P + D I M +Y+ +I + S+ +
Sbjct: 237 DIVLYLYEKYKEFPIIVDQEILDGFDE----MFLYKDWIKNNKELEELMESLKRNIIV-- 290
Query: 296 ISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTI 355
+ + G H G +V+ S MQ+ ++ +E + +N +I G+ +G+ A+ +
Sbjct: 291 MDDDGGTQH--SCG--IVVMSDANMQTKRAQLYYEQIRHEERNSIIFTGHVAKGSFAEKV 346
Query: 356 LSE 358
L E
Sbjct: 347 LKE 349
>UNIPROTKB|Q74C32 [details] [associations]
symbol:GSU1843 "RNA exonuclease, beta-lactamase fold
protein" species:243231 "Geobacter sulfurreducens PCA" [GO:0008150
"biological_process" evidence=ND] InterPro:IPR001279 Pfam:PF00753
SMART:SM00849 Pfam:PF07521 EMBL:AE017180 GenomeReviews:AE017180_GR
GO:GO:0004527 InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996
SMART:SM01027 HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_952893.1
ProteinModelPortal:Q74C32 GeneID:2688625 KEGG:gsu:GSU1843
PATRIC:22026545 OMA:CHIDHVG ProtClustDB:CLSK2516562
BioCyc:GSUL243231:GH27-1786-MONOMER Uniprot:Q74C32
Length = 475
Score = 162 (62.1 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
Identities = 44/152 (28%), Positives = 79/152 (51%)
Query: 14 SCIMLEFK-NKSIMMDCGIHPGLSGMDA--LPFVDLVESDQIDLLLISHFHLDHCGALPW 70
SC L N +I++DCG+ G G PF+D D++ L+++H H+DHCG +P
Sbjct: 15 SCHELVISDNAAILIDCGLLQGNDGAGGKRFPFIDF-PLDRVKGLVLTHVHIDHCGRIP- 72
Query: 71 FLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTE--SDLEKSMDKIETINF 128
LL GF+G + + A+ + +L D +KV I+ ++ L + ++K + + +
Sbjct: 73 HLLGAGFQGPIWCSEASALLLPLVLEDAVKVG-ITRDEHLIARFLNAVKKRLVPLPYDRW 131
Query: 129 HEEKDVNGIKFSA--YNAGHVLGAAMFLIEIA 158
H+ +G S AGH+LG+A + ++
Sbjct: 132 HQLGSWDGRSASLRLQQAGHILGSAYVEVSVS 163
Score = 151 (58.2 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
Identities = 45/153 (29%), Positives = 72/153 (47%)
Query: 162 ILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG 221
++++GD L+ PP + DIL+ ESTYG HE RE+R R +I +
Sbjct: 183 VVFSGDLGAPFTP-LLPDPKPPERADILVLESTYGDRQHEGREQRRERLCRVIVRALENR 241
Query: 222 GRCLIPVFALGRAQELLLILDEYWSLH--PEL------HDIPIYYASSLAKKCMSVYQTY 273
G L+P F++GR QELL +++ S H E D+ I S LA VY
Sbjct: 242 GALLVPAFSIGRTQELLYEIEDLISRHRTEEAAAGLPWDDLEIIVDSPLALSVTRVYDRL 301
Query: 274 INAMNDRIRRQISIN-NPFVFKHISNLKG-IDH 304
++ ++ N +P F+ ++ ++ DH
Sbjct: 302 RRLWDEEALETVAQNRHPLSFEQMTVIESHADH 334
Score = 135 (52.6 bits), Expect = 6.7e-18, Sum P(2) = 6.7e-18
Identities = 48/190 (25%), Positives = 80/190 (42%)
Query: 253 DIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFKHISNLKG-IDHFEDIG- 309
D+ I S LA VY ++ ++ N +P F+ ++ ++ DH +
Sbjct: 281 DLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTVIESHADHRATVEY 340
Query: 310 ------PCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV- 362
PC+V+A+ GM G + D + ++ GY GT + IL ++
Sbjct: 341 LRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDILFVGYQAAGTPGREILEAAKQKW 400
Query: 363 -------IGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVRELR--PAHVVLVHGEQNEM 412
I + G PL+ +V IS +SAH D + EFV + P + LVHGE+
Sbjct: 401 ETGGRPSIDLDGGTYPLRAAVHTISGYSAHADQRDLVEFVEGITVPPKTIRLVHGEEEAR 460
Query: 413 SRLKAALTRE 422
+ L L +
Sbjct: 461 TALAGVLAEK 470
>TIGR_CMR|GSU_1843 [details] [associations]
symbol:GSU_1843 "metallo-beta-lactamase family protein"
species:243231 "Geobacter sulfurreducens PCA" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 EMBL:AE017180 GenomeReviews:AE017180_GR GO:GO:0004527
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_952893.1
ProteinModelPortal:Q74C32 GeneID:2688625 KEGG:gsu:GSU1843
PATRIC:22026545 OMA:CHIDHVG ProtClustDB:CLSK2516562
BioCyc:GSUL243231:GH27-1786-MONOMER Uniprot:Q74C32
Length = 475
Score = 162 (62.1 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
Identities = 44/152 (28%), Positives = 79/152 (51%)
Query: 14 SCIMLEFK-NKSIMMDCGIHPGLSGMDA--LPFVDLVESDQIDLLLISHFHLDHCGALPW 70
SC L N +I++DCG+ G G PF+D D++ L+++H H+DHCG +P
Sbjct: 15 SCHELVISDNAAILIDCGLLQGNDGAGGKRFPFIDF-PLDRVKGLVLTHVHIDHCGRIP- 72
Query: 71 FLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTE--SDLEKSMDKIETINF 128
LL GF+G + + A+ + +L D +KV I+ ++ L + ++K + + +
Sbjct: 73 HLLGAGFQGPIWCSEASALLLPLVLEDAVKVG-ITRDEHLIARFLNAVKKRLVPLPYDRW 131
Query: 129 HEEKDVNGIKFSA--YNAGHVLGAAMFLIEIA 158
H+ +G S AGH+LG+A + ++
Sbjct: 132 HQLGSWDGRSASLRLQQAGHILGSAYVEVSVS 163
Score = 151 (58.2 bits), Expect = 1.4e-19, Sum P(2) = 1.4e-19
Identities = 45/153 (29%), Positives = 72/153 (47%)
Query: 162 ILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG 221
++++GD L+ PP + DIL+ ESTYG HE RE+R R +I +
Sbjct: 183 VVFSGDLGAPFTP-LLPDPKPPERADILVLESTYGDRQHEGREQRRERLCRVIVRALENR 241
Query: 222 GRCLIPVFALGRAQELLLILDEYWSLH--PEL------HDIPIYYASSLAKKCMSVYQTY 273
G L+P F++GR QELL +++ S H E D+ I S LA VY
Sbjct: 242 GALLVPAFSIGRTQELLYEIEDLISRHRTEEAAAGLPWDDLEIIVDSPLALSVTRVYDRL 301
Query: 274 INAMNDRIRRQISIN-NPFVFKHISNLKG-IDH 304
++ ++ N +P F+ ++ ++ DH
Sbjct: 302 RRLWDEEALETVAQNRHPLSFEQMTVIESHADH 334
Score = 135 (52.6 bits), Expect = 6.7e-18, Sum P(2) = 6.7e-18
Identities = 48/190 (25%), Positives = 80/190 (42%)
Query: 253 DIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISIN-NPFVFKHISNLKG-IDHFEDIG- 309
D+ I S LA VY ++ ++ N +P F+ ++ ++ DH +
Sbjct: 281 DLEIIVDSPLALSVTRVYDRLRRLWDEEALETVAQNRHPLSFEQMTVIESHADHRATVEY 340
Query: 310 ------PCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPEEV- 362
PC+V+A+ GM G + D + ++ GY GT + IL ++
Sbjct: 341 LRKTARPCIVIAAGGMCAGGRIVNYLKALMPDPRTDILFVGYQAAGTPGREILEAAKQKW 400
Query: 363 -------IGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEFVRELR--PAHVVLVHGEQNEM 412
I + G PL+ +V IS +SAH D + EFV + P + LVHGE+
Sbjct: 401 ETGGRPSIDLDGGTYPLRAAVHTISGYSAHADQRDLVEFVEGITVPPKTIRLVHGEEEAR 460
Query: 413 SRLKAALTRE 422
+ L L +
Sbjct: 461 TALAGVLAEK 470
>UNIPROTKB|E9PQF0 [details] [associations]
symbol:CPSF3L "Integrator complex subunit 11" species:9606
"Homo sapiens" [GO:0016787 "hydrolase activity" evidence=IEA]
InterPro:IPR001279 Pfam:PF00753 GO:GO:0016787 EMBL:AL139287
HGNC:HGNC:26052 ChiTaRS:CPSF3L IPI:IPI00982774
ProteinModelPortal:E9PQF0 SMR:E9PQF0 Ensembl:ENST00000498476
ArrayExpress:E9PQF0 Bgee:E9PQF0 Uniprot:E9PQF0
Length = 167
Score = 242 (90.2 bits), Expect = 1.5e-19, P = 1.5e-19
Identities = 47/98 (47%), Positives = 65/98 (66%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDALP-FVDLVES----DQIDLLLISHF 60
GAGQ+VGRSCI++ K++M+DCG+H G + P F + ++ D +D ++ISHF
Sbjct: 70 GAGQDVGRSCILVSIAGKNVMLDCGMHMGFNDDRRFPDFSYITQNGRLTDFLDCVIISHF 129
Query: 61 HLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDY 98
HLDHCGALP+F G+ G +MTH T+AI LL DY
Sbjct: 130 HLDHCGALPYFSEMVGYDGPIYMTHPTQAICPILLEDY 167
>DICTYBASE|DDB_G0282473 [details] [associations]
symbol:ints9 "integrator complex subunit 9"
species:44689 "Dictyostelium discoideum" [GO:0032039 "integrator
complex" evidence=IEA] [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0044351 "macropinocytosis"
evidence=RCA] InterPro:IPR027074 dictyBase:DDB_G0282473
GO:GO:0005634 EMBL:AAFI02000047 GenomeReviews:CM000152_GR
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
KO:K13146 PANTHER:PTHR11203:SF2 RefSeq:XP_640069.1
ProteinModelPortal:Q54SH0 EnsemblProtists:DDB0234099 GeneID:8623598
KEGG:ddi:DDB_G0282473 OMA:DDFSTID ProtClustDB:CLSZ2729002
Uniprot:Q54SH0
Length = 712
Score = 197 (74.4 bits), Expect = 5.8e-17, Sum P(3) = 5.8e-17
Identities = 61/243 (25%), Positives = 115/243 (47%)
Query: 110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGV-KILYTGDF 168
LY + D+EKS +KI++I F+E G + ++G+ LG+A ++IE G +++Y D
Sbjct: 217 LYKKIDIEKSFEKIQSIRFNESIKHYGFECIPSSSGYGLGSANWVIESKGFERVVYISDS 276
Query: 169 SRQEDRHLMAAEIPPV-KPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIP 227
S R+ ++ P+ PD+LI S + + ++ S I + +GG LIP
Sbjct: 277 SLSLSRYPTPFQLSPIDNPDVLIL-SKINHYPNNPPDQMLSELCSNIGSTLQQGGTVLIP 335
Query: 228 VFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMN-DRIRRQIS 286
++ G +L L +Y + L +PIY+ SS++K +S Y +N + R
Sbjct: 336 SYSCGIILDLFEHLADYLN-KVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKSKQERAFM 394
Query: 287 INNPFVFKHI---SNLKGIDH----FEDIGPCVVMASPGMMQSGLSRELFEMWCTDAKNG 339
PF+ + + + H F+ PC++ + G L +++ + KN
Sbjct: 395 PETPFLHQDLMRKGQFQAYQHVHSNFQANDPCIIFTGHPSCRIGDITTLIKLY-DNPKNS 453
Query: 340 VII 342
+++
Sbjct: 454 ILL 456
Score = 79 (32.9 bits), Expect = 5.8e-17, Sum P(3) = 5.8e-17
Identities = 24/95 (25%), Positives = 50/95 (52%)
Query: 42 PFVDLVES-DQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK 100
P ++++ ID++LIS++ + ALP+ T F+G+ + T T I + LL + ++
Sbjct: 106 PQFEMIDDFSTIDMILISNY--TNIYALPFITEYTNFQGKIYATEPTVQIGKLLLEELVQ 163
Query: 101 V------SNISTEQMLYTESDLEKSMDKIETINFH 129
+ S+I+ SD ++++ +E +N H
Sbjct: 164 MDKQYSNSSINNNNNNNNLSDCWQNIEILEKLNVH 198
Score = 58 (25.5 bits), Expect = 5.8e-17, Sum P(3) = 5.8e-17
Identities = 9/23 (39%), Positives = 14/23 (60%)
Query: 9 QEVGRSCIMLEFKNKSIMMDCGI 31
Q C +LE+KN I++DC +
Sbjct: 8 QSAQSPCFLLEYKNVKILLDCAL 30
>RGD|1311539 [details] [associations]
symbol:Ints9 "integrator complex subunit 9" species:10116
"Rattus norvegicus" [GO:0016180 "snRNA processing"
evidence=IEA;ISO] [GO:0032039 "integrator complex"
evidence=IEA;ISO] InterPro:IPR027074 RGD:1311539 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 GO:GO:0032039 GO:GO:0016180
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 IPI:IPI00362364
Ensembl:ENSRNOT00000018071 Uniprot:F1M365
Length = 659
Score = 191 (72.3 bits), Expect = 3.0e-16, Sum P(2) = 3.0e-16
Identities = 111/493 (22%), Positives = 201/493 (40%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 160 KEIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 219
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+LI T + +
Sbjct: 220 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 276
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L +IP Y+ S +A
Sbjct: 277 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVAN 335
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCV+
Sbjct: 336 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSHDFRQPCVLF 395
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 396 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 441
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 442 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 499
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N H+L P
Sbjct: 500 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPP- 557
Query: 488 LPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TLE 544
PK T +SK ++ S S VL+ L+S PVE + F+ I++ T +
Sbjct: 558 -PKPTQPTSSKKRKRVSEDVPDS-KVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTAK 614
Query: 545 KCIVVLEWASNPI 557
IV+L+ A I
Sbjct: 615 GHIVLLQEAETLI 627
Score = 93 (37.8 bits), Expect = 3.0e-16, Sum P(2) = 3.0e-16
Identities = 24/82 (29%), Positives = 42/82 (51%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 64 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 120
Query: 78 KGRCFMTHATKAIYRWLLSDYI 99
G + T T I R L+ + +
Sbjct: 121 TGTVYATEPTMQIGRLLMEELV 142
Score = 67 (28.6 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 15 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 53
>MGI|MGI:1098533 [details] [associations]
symbol:Ints9 "integrator complex subunit 9" species:10090
"Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0005634 "nucleus" evidence=IEA] [GO:0016180 "snRNA processing"
evidence=ISO] [GO:0032039 "integrator complex" evidence=ISO]
InterPro:IPR027074 MGI:MGI:1098533 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359 HOVERGEN:HBG081802
KO:K13146 OrthoDB:EOG415GCW PANTHER:PTHR11203:SF2
GeneTree:ENSGT00390000001445 EMBL:AK038979 EMBL:AK077634
EMBL:AK136992 EMBL:AK150195 EMBL:BC028953 EMBL:BC055700
IPI:IPI00223422 IPI:IPI00406798 RefSeq:NP_001240660.1
RefSeq:NP_700463.2 UniGene:Mm.71332 ProteinModelPortal:Q8K114
SMR:Q8K114 STRING:Q8K114 PhosphoSite:Q8K114 PaxDb:Q8K114
PRIDE:Q8K114 Ensembl:ENSMUST00000043914 GeneID:210925
KEGG:mmu:210925 UCSC:uc007uiv.1 UCSC:uc007uiw.1 InParanoid:Q8K114
NextBio:373083 Bgee:Q8K114 CleanEx:MM_INTS9 Genevestigator:Q8K114
Uniprot:Q8K114
Length = 658
Score = 186 (70.5 bits), Expect = 4.0e-16, Sum P(2) = 4.0e-16
Identities = 110/494 (22%), Positives = 201/494 (40%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+LI T + +
Sbjct: 219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L +IP Y+ S +A
Sbjct: 276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVAN 334
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCV+
Sbjct: 335 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSNDFRQPCVLF 394
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N +I +EP+ + PL
Sbjct: 395 TGHPSLRFGDVVHFMELWGKSSLNTIIF--------------TEPDFSYLEALAPYQPLA 440
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ A + D
Sbjct: 441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQAHRMDLMIDCQPPAMS 498
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N H+L P
Sbjct: 499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPP- 556
Query: 488 LPKYTDLKASKIIQQQSVYYS-GSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
PK T +SK +++ V VL+ L+S PVE + F+ I++ T
Sbjct: 557 -PKPTQPTSSK--KRKRVNEDIPDCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 612
Query: 544 EKCIVVLEWASNPI 557
+ IV+L+ A I
Sbjct: 613 KGHIVLLQEAETLI 626
Score = 97 (39.2 bits), Expect = 4.0e-16, Sum P(2) = 4.0e-16
Identities = 27/104 (25%), Positives = 51/104 (49%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
G + T T I R L+ + + + + Q L+ D+++
Sbjct: 120 TGTVYATEPTMQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163
Score = 67 (28.6 bits), Expect = 5.2e-13, Sum P(2) = 5.2e-13
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52
>UNIPROTKB|Q9NV88 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0005515 "protein binding" evidence=IPI]
[GO:0016180 "snRNA processing" evidence=IDA] [GO:0032039
"integrator complex" evidence=IDA] InterPro:IPR027074
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
GO:GO:0032039 EMBL:U96629 GO:GO:0016180 CTD:55756
HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 OMA:PLAMKCV EMBL:AK001733 EMBL:AK298468
EMBL:AK300593 EMBL:AC040975 EMBL:AC131969 EMBL:BC025267
EMBL:BK005726 EMBL:BK005674 IPI:IPI00290514 IPI:IPI00871167
RefSeq:NP_001138631.1 RefSeq:NP_001166033.1 RefSeq:NP_060720.2
UniGene:Hs.162397 ProteinModelPortal:Q9NV88 SMR:Q9NV88
IntAct:Q9NV88 STRING:Q9NV88 PhosphoSite:Q9NV88 DMDM:119371246
PaxDb:Q9NV88 PRIDE:Q9NV88 DNASU:55756 Ensembl:ENST00000416984
Ensembl:ENST00000521022 Ensembl:ENST00000521777 GeneID:55756
KEGG:hsa:55756 UCSC:uc003xha.3 GeneCards:GC08M028625
HGNC:HGNC:25592 MIM:611352 neXtProt:NX_Q9NV88 PharmGKB:PA162392192
InParanoid:Q9NV88 PhylomeDB:Q9NV88 ChiTaRS:INTS9 GenomeRNAi:55756
NextBio:60763 ArrayExpress:Q9NV88 Bgee:Q9NV88 CleanEx:HS_INTS9
Genevestigator:Q9NV88 GermOnline:ENSG00000104299 Uniprot:Q9NV88
Length = 658
Score = 182 (69.1 bits), Expect = 1.4e-15, Sum P(2) = 1.4e-15
Identities = 108/494 (21%), Positives = 196/494 (39%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+L+ T + +
Sbjct: 219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLVLTGL--TQIPTANPD 275
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L +P+Y+ S +A
Sbjct: 276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSVPLYFISPVAN 334
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCVV
Sbjct: 335 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 498
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N HLL P
Sbjct: 499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHLLQPPP 557
Query: 488 LPKY-TDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
P T K K + VL+ L+S PVE + F+ I++ T
Sbjct: 558 RPAQPTSGKKRKRVSDDVP----DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 612
Query: 544 EKCIVVLEWASNPI 557
+ IV+L+ A I
Sbjct: 613 KGHIVLLQEAETLI 626
Score = 96 (38.9 bits), Expect = 1.4e-15, Sum P(2) = 1.4e-15
Identities = 27/104 (25%), Positives = 51/104 (49%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
G + T T I R L+ + + + + Q L+ D+++
Sbjct: 120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163
Score = 67 (28.6 bits), Expect = 1.4e-12, Sum P(2) = 1.4e-12
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52
>UNIPROTKB|F6XI08 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
Ensembl:ENSCAFT00000013124 EMBL:AAEX03014336 RefSeq:XP_543216.2
GeneID:486090 KEGG:cfa:486090 Uniprot:F6XI08
Length = 658
Score = 180 (68.4 bits), Expect = 2.4e-15, Sum P(2) = 2.4e-15
Identities = 112/494 (22%), Positives = 199/494 (40%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+LI T + +
Sbjct: 219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L +IP Y+ S +A
Sbjct: 276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVAN 334
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FV---------FKHISNLKGIDHFEDIG-PCVV 313
+ Q + + + ++ + P F KH +L G D D PCVV
Sbjct: 335 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSLHG-DFSSDFRQPCVV 393
Query: 314 MASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPL 372
++ G E+W + N VI +EP+ + PL
Sbjct: 394 FTGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPL 439
Query: 373 KMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSME 432
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 440 AMKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAM 497
Query: 433 LYNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPS 486
Y +++ + EK ++M ELA +KP +L+ + ++ N H+L P
Sbjct: 498 SYRRAEVLALPFKRRYEKI-EIMPELADALVPMEIKPGISLATVSAVLHTKDNKHVLQPP 556
Query: 487 DLPKYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
P+ T K ++ S VL+ L+S PVE + F+ I++ T
Sbjct: 557 --PRPTQPTGGKKRKRASDDIP-DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 612
Query: 544 EKCIVVLEWASNPI 557
+ IV+L+ A I
Sbjct: 613 KGHIVLLQEAETLI 626
Score = 96 (38.9 bits), Expect = 2.4e-15, Sum P(2) = 2.4e-15
Identities = 27/104 (25%), Positives = 51/104 (49%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
G + T T I R L+ + + + + Q L+ D+++
Sbjct: 120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163
Score = 67 (28.6 bits), Expect = 2.4e-12, Sum P(2) = 2.4e-12
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52
>UNIPROTKB|Q2KJA6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9913
"Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
GO:GO:0005634 eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996
SMART:SM01027 EMBL:BC105437 IPI:IPI00701634 RefSeq:NP_001039828.1
UniGene:Bt.91042 ProteinModelPortal:Q2KJA6 STRING:Q2KJA6
GeneID:533964 KEGG:bta:533964 CTD:55756 HOGENOM:HOG000045359
HOVERGEN:HBG081802 InParanoid:Q2KJA6 KO:K13146 OrthoDB:EOG415GCW
NextBio:20876211 PANTHER:PTHR11203:SF2 Uniprot:Q2KJA6
Length = 658
Score = 178 (67.7 bits), Expect = 3.9e-15, Sum P(2) = 3.9e-15
Identities = 91/422 (21%), Positives = 167/422 (39%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQR-E 204
LG++ ++I+ K+ Y S H + +K D+LI T + +
Sbjct: 219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275
Query: 205 EREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L IP Y+ S +A
Sbjct: 276 SMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSIPFYFISPVAN 334
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCVV
Sbjct: 335 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPTPAQSHRMDLMVDCQPPAMS 498
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N H+L P
Sbjct: 499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPPP 557
Query: 488 LP 489
P
Sbjct: 558 RP 559
Score = 96 (38.9 bits), Expect = 3.9e-15, Sum P(2) = 3.9e-15
Identities = 27/104 (25%), Positives = 51/104 (49%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
G + T T I R L+ + + + + Q L+ D+++
Sbjct: 120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163
Score = 67 (28.6 bits), Expect = 3.9e-12, Sum P(2) = 3.9e-12
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52
>UNIPROTKB|F1MMA6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9913
"Bos taurus" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 IPI:IPI00701634 UniGene:Bt.91042
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
EMBL:DAAA02021965 EMBL:DAAA02021964 Ensembl:ENSBTAT00000049079
ArrayExpress:F1MMA6 Uniprot:F1MMA6
Length = 658
Score = 177 (67.4 bits), Expect = 5.0e-15, Sum P(2) = 5.0e-15
Identities = 91/422 (21%), Positives = 167/422 (39%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQR-E 204
LG++ ++I+ K+ Y S H + +K D+LI T + +
Sbjct: 219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 275
Query: 205 EREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L IP Y+ S +A
Sbjct: 276 SMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSIPFYFISPVAN 334
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCVV
Sbjct: 335 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMVDCQPPAMS 498
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N H+L P
Sbjct: 499 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPPP 557
Query: 488 LP 489
P
Sbjct: 558 RP 559
Score = 96 (38.9 bits), Expect = 5.0e-15, Sum P(2) = 5.0e-15
Identities = 27/104 (25%), Positives = 51/104 (49%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
G + T T I R L+ + + + + Q L+ D+++
Sbjct: 120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163
Score = 67 (28.6 bits), Expect = 5.1e-12, Sum P(2) = 5.1e-12
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52
>UNIPROTKB|Q0C1L6 [details] [associations]
symbol:HNE_1669 "Putative uncharacterized protein"
species:228405 "Hyphomonas neptunium ATCC 15444" [GO:0003674
"molecular_function" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
InterPro:IPR001279 SMART:SM00849 GO:GO:0016787 EMBL:CP000158
GenomeReviews:CP000158_GR eggNOG:COG1236 RefSeq:YP_760377.1
ProteinModelPortal:Q0C1L6 STRING:Q0C1L6 GeneID:4288204
KEGG:hne:HNE_1669 PATRIC:32216161 HOGENOM:HOG000035995 OMA:STFGLPI
ProtClustDB:CLSK2517173 BioCyc:HNEP228405:GI69-1701-MONOMER
InterPro:IPR026360 TIGRFAMs:TIGR04122 Uniprot:Q0C1L6
Length = 333
Score = 183 (69.5 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
Identities = 47/155 (30%), Positives = 80/155 (51%)
Query: 126 INFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK 185
+ + E +V ++ + Y AGHVLG+A L+E AG +++ TGDF R D P+
Sbjct: 72 VAYGETVEVGDVRVTLYPAGHVLGSAQVLLERAGERVIVTGDFKRAADP--TCPPFVPIA 129
Query: 186 PDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRC-LIPVFALGRAQELLLILDEY 244
D+LITE+T+G V + ++ + RC L+ +ALG+AQ ++ L E
Sbjct: 130 CDVLITEATFGLPVFRHPPASD-EIAKVMERLAESPERCVLVGAYALGKAQRVICHLREA 188
Query: 245 WSLHPELHDIPIYYASSLAKKCMSVYQTYINAMND 279
+D PIY ++ K C ++Y+ + A+ +
Sbjct: 189 G------YDKPIYLHGAMEKLC-ALYEAHGVALGE 216
Score = 64 (27.6 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
Identities = 15/62 (24%), Positives = 33/62 (53%)
Query: 359 PEEVIGMSGQRLPLKM-----SVDY-ISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEM 412
P+ V+ M+ L ++ ++D + S H D+++ + +RE+ P+ V + HG + +
Sbjct: 249 PDPVLAMASGWLQVRQRVRQNNIDLPLVISDHADWEELTRTIREVAPSEVWVTHGSEAGL 308
Query: 413 SR 414
R
Sbjct: 309 LR 310
Score = 45 (20.9 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 31 IHPGLSGMDALPFVDLVE-SDQIDLLLISHFHLDHCGA 67
I PG G++ V+ S L +++H H DH A
Sbjct: 8 IKPGAGGIEVAGGAAFVDPSLPKPLAIVTHGHADHARA 45
>UNIPROTKB|F1RJQ5 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0032039 "integrator complex" evidence=IEA] [GO:0016180
"snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
GeneTree:ENSGT00390000001445 OMA:PLAMKCV EMBL:CU407017
Ensembl:ENSSSCT00000010615 Uniprot:F1RJQ5
Length = 576
Score = 176 (67.0 bits), Expect = 1.8e-14, Sum P(2) = 1.8e-14
Identities = 91/422 (21%), Positives = 167/422 (39%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 77 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQMVGYSQKIELFGAVQVTPLSSGY 136
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+LI T + +
Sbjct: 137 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL--TQIPTANPD 193
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L IP Y+ S +A
Sbjct: 194 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSIPFYFISPVAN 252
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCVV
Sbjct: 253 SSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 312
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 313 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 358
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 359 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 416
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N H+L P
Sbjct: 417 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHVLQPPP 475
Query: 488 LP 489
P
Sbjct: 476 RP 477
Score = 90 (36.7 bits), Expect = 1.8e-14, Sum P(2) = 1.8e-14
Identities = 22/80 (27%), Positives = 41/80 (51%)
Query: 42 PFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK- 100
P +L++ +D++LIS++H ALP+ TGF G + T T I R L+ + +
Sbjct: 4 PQTELIDLSTVDVILISNYHC--MMALPYITEHTGFTGTVYATEPTVQIGRLLMEELVNF 61
Query: 101 VSNISTEQM--LYTESDLEK 118
+ + Q L+ D+++
Sbjct: 62 IERVPKAQSASLWKNKDIQR 81
>UNIPROTKB|G3XAN1 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
EMBL:CH471080 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 UniGene:Hs.162397
HGNC:HGNC:25592 ChiTaRS:INTS9 ProteinModelPortal:G3XAN1
Ensembl:ENST00000523303 ArrayExpress:G3XAN1 Bgee:G3XAN1
Uniprot:G3XAN1
Length = 525
Score = 168 (64.2 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
Identities = 71/330 (21%), Positives = 135/330 (40%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 159 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 218
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+L+ T + +
Sbjct: 219 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLVLTGL--TQIPTANPD 275
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L +P+Y+ S +A
Sbjct: 276 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSVPLYFISPVAN 334
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCVV
Sbjct: 335 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 394
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 395 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 440
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVV 403
M Y ++ Q S+ ++E++P HVV
Sbjct: 441 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVV 470
Score = 96 (38.9 bits), Expect = 2.3e-14, Sum P(2) = 2.3e-14
Identities = 27/104 (25%), Positives = 51/104 (49%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
G + T T I R L+ + + + + Q L+ D+++
Sbjct: 120 TGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 163
Score = 67 (28.6 bits), Expect = 2.4e-11, Sum P(2) = 2.4e-11
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSNL 52
>UNIPROTKB|Q8EJC6 [details] [associations]
symbol:SO_0541 "RNA-metabolizing metallo-beta-lactamase
family protein" species:211586 "Shewanella oneidensis MR-1"
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
Pfam:PF00753 SMART:SM00849 Pfam:PF07521 GO:GO:0016787 EMBL:AE014299
GenomeReviews:AE014299_GR InterPro:IPR022712 InterPro:IPR011108
Pfam:PF10996 SMART:SM01027 OMA:MAVEYMS HOGENOM:HOG000244774
KO:K07576 RefSeq:NP_716177.2 ProteinModelPortal:Q8EJC6
DNASU:1168409 GeneID:1168409 KEGG:son:SO_0541 PATRIC:23520762
ProtClustDB:CLSK2516780 Uniprot:Q8EJC6
Length = 480
Score = 213 (80.0 bits), Expect = 3.8e-14, P = 3.8e-14
Identities = 83/336 (24%), Positives = 152/336 (45%)
Query: 110 LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEIA-GV---KILY 164
L+T D E+++ + ++ + + + + + +AGH+LG+A+ + + G KI++
Sbjct: 127 LFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVF 186
Query: 165 TGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG-GR 223
+GD R L + D+++ ESTYG H + + VN G
Sbjct: 187 SGDLGRAGMPILQNPTLVDTA-DLVLMESTYGNRFHRSWTDTLAELKDIFAKTVNESQGN 245
Query: 224 CLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
L+P F++GRAQELL + Y + +L I S +A + VY M++ +R
Sbjct: 246 ILLPAFSVGRAQELLYLFHLY-AKEWDLGRWKICLDSPMAIEATRVYVNNYPLMDEDFKR 304
Query: 284 QISINNPFVFKHISNLKGIDHFED-IG------PCVVMASPGMMQSGLSRELFE--MWCT 334
+ +P +SN++ I E+ I +++A GM G R E +W +
Sbjct: 305 -FTRQHPGQHPLLSNVEFIQTTEESIALNEVHKGLIIIAGSGMCNGGRIRSHLEHNLWRS 363
Query: 335 DAKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEF 393
+ VII G+ GT + ++ +E+ + G + + + + SAH D + +
Sbjct: 364 ECD--VIICGFQALGTPGRALVDGAKELT-IHGNSVNVAAKLHTVGGLSAHADQAELLRW 420
Query: 394 VR--ELRPAHVVLVHGEQNEMSRLKAALTREYEDDP 427
R E +P +VLVHGE L A + ++ + P
Sbjct: 421 YRHFEEQPP-LVLVHGEPEAQQGLVAVMNQDPKTKP 455
Score = 150 (57.9 bits), Expect = 3.2e-07, P = 3.2e-07
Identities = 48/171 (28%), Positives = 83/171 (48%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDAL----PFVDLVESDQIDLLLISHFH 61
GA +EV SC ++ K +++DCG+ G D L PFV + I +++SH H
Sbjct: 9 GAAREVTGSCHLVTVAGKHLLLDCGLIQG-GKADELRNHEPFV--FDPQTIVAVVLSHAH 65
Query: 62 LDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM------------ 109
+DH G LP L+K GF G + AT + +L D + TE+
Sbjct: 66 IDHSGRLP-LLVKAGFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAKHDLAPL 124
Query: 110 --LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEI 157
L+T D E+++ + ++ + + + + + +AGH+LG+A L+E+
Sbjct: 125 EPLFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSA--LVEL 173
>TIGR_CMR|SO_0541 [details] [associations]
symbol:SO_0541 "metallo-beta-lactamase family protein"
species:211586 "Shewanella oneidensis MR-1" [GO:0008150
"biological_process" evidence=ND] [GO:0003824 "catalytic activity"
evidence=ISS] InterPro:IPR001279 Pfam:PF00753 SMART:SM00849
Pfam:PF07521 GO:GO:0016787 EMBL:AE014299 GenomeReviews:AE014299_GR
InterPro:IPR022712 InterPro:IPR011108 Pfam:PF10996 SMART:SM01027
OMA:MAVEYMS HOGENOM:HOG000244774 KO:K07576 RefSeq:NP_716177.2
ProteinModelPortal:Q8EJC6 DNASU:1168409 GeneID:1168409
KEGG:son:SO_0541 PATRIC:23520762 ProtClustDB:CLSK2516780
Uniprot:Q8EJC6
Length = 480
Score = 213 (80.0 bits), Expect = 3.8e-14, P = 3.8e-14
Identities = 83/336 (24%), Positives = 152/336 (45%)
Query: 110 LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEIA-GV---KILY 164
L+T D E+++ + ++ + + + + + +AGH+LG+A+ + + G KI++
Sbjct: 127 LFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSALVELWLGEGKSQKKIVF 186
Query: 165 TGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRG-GR 223
+GD R L + D+++ ESTYG H + + VN G
Sbjct: 187 SGDLGRAGMPILQNPTLVDTA-DLVLMESTYGNRFHRSWTDTLAELKDIFAKTVNESQGN 245
Query: 224 CLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRR 283
L+P F++GRAQELL + Y + +L I S +A + VY M++ +R
Sbjct: 246 ILLPAFSVGRAQELLYLFHLY-AKEWDLGRWKICLDSPMAIEATRVYVNNYPLMDEDFKR 304
Query: 284 QISINNPFVFKHISNLKGIDHFED-IG------PCVVMASPGMMQSGLSRELFE--MWCT 334
+ +P +SN++ I E+ I +++A GM G R E +W +
Sbjct: 305 -FTRQHPGQHPLLSNVEFIQTTEESIALNEVHKGLIIIAGSGMCNGGRIRSHLEHNLWRS 363
Query: 335 DAKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS-FSAHTDYQQTSEF 393
+ VII G+ GT + ++ +E+ + G + + + + SAH D + +
Sbjct: 364 ECD--VIICGFQALGTPGRALVDGAKELT-IHGNSVNVAAKLHTVGGLSAHADQAELLRW 420
Query: 394 VR--ELRPAHVVLVHGEQNEMSRLKAALTREYEDDP 427
R E +P +VLVHGE L A + ++ + P
Sbjct: 421 YRHFEEQPP-LVLVHGEPEAQQGLVAVMNQDPKTKP 455
Score = 150 (57.9 bits), Expect = 3.2e-07, P = 3.2e-07
Identities = 48/171 (28%), Positives = 83/171 (48%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIHPGLSGMDAL----PFVDLVESDQIDLLLISHFH 61
GA +EV SC ++ K +++DCG+ G D L PFV + I +++SH H
Sbjct: 9 GAAREVTGSCHLVTVAGKHLLLDCGLIQG-GKADELRNHEPFV--FDPQTIVAVVLSHAH 65
Query: 62 LDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM------------ 109
+DH G LP L+K GF G + AT + +L D + TE+
Sbjct: 66 IDHSGRLP-LLVKAGFDGPIYTHKATAELCAIMLKDAAMLQVRDTERTNKKRAKHDLAPL 124
Query: 110 --LYTESDLEKSMDKIETINFHE-EKDVNGIKFSAYNAGHVLGAAMFLIEI 157
L+T D E+++ + ++ + + + + + +AGH+LG+A L+E+
Sbjct: 125 EPLFTVEDAEQAISQFVSLEYGQVTRVIPHVDICLSDAGHILGSA--LVEL 173
>UNIPROTKB|Q5ZKK2 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9031
"Gallus gallus" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
eggNOG:COG1236 InterPro:IPR022712 Pfam:PF10996 SMART:SM01027
GO:GO:0032039 GO:GO:0016180 CTD:55756 HOGENOM:HOG000045359
HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 EMBL:AJ720082 IPI:IPI00651516
RefSeq:NP_001026271.1 UniGene:Gga.21113 ProteinModelPortal:Q5ZKK2
STRING:Q5ZKK2 Ensembl:ENSGALT00000026848 GeneID:422023
KEGG:gga:422023 GeneTree:ENSGT00390000001445 InParanoid:Q5ZKK2
OMA:PLAMKCV NextBio:20824712 Uniprot:Q5ZKK2
Length = 658
Score = 162 (62.1 bits), Expect = 9.2e-13, Sum P(2) = 9.2e-13
Identities = 70/340 (20%), Positives = 137/340 (40%)
Query: 78 KGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG- 136
K + T K + R L + +S + YT ++ ++ KI+ + + ++ ++ G
Sbjct: 149 KAQSASTWKNKEVQRLLPAPLKDAVEVSMWRKCYTMPEVNAALSKIQLVGYSQKIELFGA 208
Query: 137 IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTY 195
++ + ++G+ LG++ ++I+ K+ Y S H + +K D+LI
Sbjct: 209 VQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLILTGL- 266
Query: 196 GTHVHEQREE-REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDI 254
T + + G F S + V GG L+P + G +LL L +Y L ++
Sbjct: 267 -TQIPTANPDGMVGEFCSNLAMTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNV 324
Query: 255 PIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--- 309
P Y+ S +A + Q + + + ++ + P F + + H+ I G
Sbjct: 325 PFYFISPVANSSLEFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSIHGDFS 384
Query: 310 -----PCVVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVI 363
PCV+ ++ G E+W + N VI +EP+ +
Sbjct: 385 NDFKQPCVIFTGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYL 430
Query: 364 GMSGQRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVV 403
PL M Y ++ Q S+ ++E++P HVV
Sbjct: 431 DALAPYQPLAMKCVYCPIDTRLNFIQVSKLLKEVQPLHVV 470
Score = 90 (36.7 bits), Expect = 9.2e-13, Sum P(2) = 9.2e-13
Identities = 27/101 (26%), Positives = 48/101 (47%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 63 FLDKELK-ECSGHVFVDSVPEFCLPETELLDLSTVDVILISNYHC--MMALPYITEYTGF 119
Query: 78 KGRCFMTHATKAIYRWLLSDYIK-VSNISTEQMLYTESDLE 117
G + T T I R L+ + + + + Q T + E
Sbjct: 120 TGTVYATEPTVQIGRLLMEELVNSIERVPKAQSASTWKNKE 160
Score = 68 (29.0 bits), Expect = 1.7e-10, Sum P(2) = 1.7e-10
Identities = 16/41 (39%), Positives = 28/41 (68%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ S ++ LP + LV+S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDM-TSTLNFLP-LPLVQSPRLSKL 52
>UNIPROTKB|H7BYQ6 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 PANTHER:PTHR11203:SF2
EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592 ChiTaRS:INTS9
ProteinModelPortal:H7BYQ6 Ensembl:ENST00000397363 Bgee:H7BYQ6
Uniprot:H7BYQ6
Length = 552
Score = 182 (69.1 bits), Expect = 6.9e-12, Sum P(2) = 6.9e-12
Identities = 108/494 (21%), Positives = 196/494 (39%)
Query: 88 KAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGH 146
K I R L S +ST + YT ++ ++ KI+ + + ++ ++ G ++ + ++G+
Sbjct: 53 KDIQRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGAVQVTPLSSGY 112
Query: 147 VLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQREE 205
LG++ ++I+ K+ Y S H + +K D+L+ T + +
Sbjct: 113 ALGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMDQASLKNSDVLVLTGL--TQIPTANPD 169
Query: 206 -REGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAK 264
G F S + V GG L+P + G +LL L +Y L +P+Y+ S +A
Sbjct: 170 GMVGEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSSVPLYFISPVAN 228
Query: 265 KCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PCVVM 314
+ Q + + + ++ + P F + + H+ I G PCVV
Sbjct: 229 SSLEFSQIFAEWLCHNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSNDFRQPCVVF 288
Query: 315 ASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLK 373
++ G E+W + N VI +EP+ + PL
Sbjct: 289 TGHPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLA 334
Query: 374 MSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMEL 433
M Y ++ Q S+ ++E++P HVV EQ + + D
Sbjct: 335 MKCIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMS 392
Query: 434 YNPRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSD 487
Y +++ + EK ++M ELA +KP +L+ + ++ N HLL P
Sbjct: 393 YRRAEVLALPFKRRYEKI-EIMPELADSLVPMEIKPGISLATVSAVLHTKDNKHLLQPPP 451
Query: 488 LPKY-TDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TL 543
P T K K + VL+ L+S PVE + F+ I++ T
Sbjct: 452 RPAQPTSGKKRKRVSDDVP----DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTA 506
Query: 544 EKCIVVLEWASNPI 557
+ IV+L+ A I
Sbjct: 507 KGHIVLLQEAETLI 520
Score = 58 (25.5 bits), Expect = 6.9e-12, Sum P(2) = 6.9e-12
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 67 ALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK-VSNISTEQM--LYTESDLEK 118
ALP+ TGF G + T T I R L+ + + + + Q L+ D+++
Sbjct: 3 ALPYITEHTGFTGTVYATEPTVQIGRLLMEELVNFIERVPKAQSASLWKNKDIQR 57
>FB|FBgn0036570 [details] [associations]
symbol:IntS9 "Integrator 9" species:7227 "Drosophila
melanogaster" [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0034472 "snRNA 3'-end processing" evidence=IDA]
[GO:0032039 "integrator complex" evidence=ISS] [GO:0016180 "snRNA
processing" evidence=ISS] InterPro:IPR027074 EMBL:AE014296
GO:GO:0006378 GO:GO:0005847 GO:GO:0006379 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 CTD:55756 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 OMA:PLAMKCV
GO:GO:0034472 EMBL:AY058574 RefSeq:NP_648838.3 UniGene:Dm.977
SMR:Q95TS5 IntAct:Q95TS5 MINT:MINT-1734573
EnsemblMetazoa:FBtr0075495 GeneID:39763 KEGG:dme:Dmel_CG5222
UCSC:CG5222-RA FlyBase:FBgn0036570 InParanoid:Q95TS5
OrthoDB:EOG4FJ6QV GenomeRNAi:39763 NextBio:815254 Uniprot:Q95TS5
Length = 654
Score = 144 (55.7 bits), Expect = 7.5e-12, Sum P(2) = 7.5e-12
Identities = 64/310 (20%), Positives = 132/310 (42%)
Query: 110 LYTESDLEKSMDKIETINFHEEKDVNGIKFSA-YNAGHVLGAAMFLIEIAGVKILYTGDF 168
+++ D++ S+ K+ + + E+ D+ G + ++G+ LG++ +++ A KI Y
Sbjct: 180 IFSLKDVQGSLSKVTIMGYDEKLDILGAFIATPVSSGYCLGSSNWVLSTAHEKICYVSGS 239
Query: 169 SRQEDRHLMAAEIPPVK-PDILI-TESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLI 226
S H +K D+LI T T V+ + + G + + G LI
Sbjct: 240 STLTT-HPRPINQSALKHADVLIMTGLTQAPTVNP--DTKLGELCMNVALTIRNNGSALI 296
Query: 227 PVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQIS 286
P + G +L L + + L+++P+++ S +A ++ ++ + ++
Sbjct: 297 PCYPSGVVYDLFECLTQNLE-NAGLNNVPMFFISPVADSSLAYSNILAEWLSSAKQNKVY 355
Query: 287 I-NNPFVFK-HISN--LKGIDHFEDIG-------PCVVMASPGMMQSGLSRELFEMWCTD 335
+ ++PF ++ N LK +H G PCVV ++ G + EMW +
Sbjct: 356 LPDDPFPHAFYLRNNKLKHYNHVFSEGFSKDFRQPCVVFCGHPSLRFGDAVHFIEMWGNN 415
Query: 336 AKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFV 394
N +I +EP+ + + PL M Y +YQQ ++ +
Sbjct: 416 PNNSIIF--------------TEPDFPYLQVLAPFQPLAMKAFYCPIDTSLNYQQANKLI 461
Query: 395 RELRPAHVVL 404
+EL+P +V+
Sbjct: 462 KELKPNVLVI 471
Score = 100 (40.3 bits), Expect = 7.5e-12, Sum P(2) = 7.5e-12
Identities = 27/76 (35%), Positives = 44/76 (57%)
Query: 41 LPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLS---D 97
LP +++ ++D++LIS++ L+ ALP+ TGFKG+ + T T I R+ L D
Sbjct: 86 LPMDKMLDFSEVDVILISNY-LNML-ALPYITENTGFKGKVYATEPTLQIGRFFLEELVD 143
Query: 98 YIKVSNISTEQMLYTE 113
YI+VS + L+ E
Sbjct: 144 YIEVSPKACTARLWKE 159
Score = 61 (26.5 bits), Expect = 8.0e-08, Sum P(2) = 8.0e-08
Identities = 14/53 (26%), Positives = 26/53 (49%)
Query: 10 EVGRSCIMLEFKNKSIMMDCGI-HPGLSGMDALPFVDLVESDQIDLLLISHFH 61
++ + C ++ FK IM+DCG+ + LPFV ++ + + S H
Sbjct: 9 DLAKPCYIITFKGLRIMLDCGLTEQTVLNFLPLPFVQSLKWSNLPNFVPSRDH 61
>ZFIN|ZDB-GENE-061013-129 [details] [associations]
symbol:ints9 "integrator complex subunit 9"
species:7955 "Danio rerio" [GO:0016180 "snRNA processing"
evidence=IEA] [GO:0032039 "integrator complex" evidence=IEA]
InterPro:IPR027074 ZFIN:ZDB-GENE-061013-129 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 CTD:55756
HOGENOM:HOG000045359 HOVERGEN:HBG081802 KO:K13146 OrthoDB:EOG415GCW
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445
EMBL:CABZ01076623 EMBL:CABZ01078246 EMBL:CABZ01078247
EMBL:CABZ01078248 EMBL:CABZ01078249 EMBL:BC124793 IPI:IPI00800641
RefSeq:NP_001070738.1 UniGene:Dr.116109 Ensembl:ENSDART00000097865
GeneID:768124 KEGG:dre:768124 InParanoid:Q08BB6 NextBio:20918446
Uniprot:Q08BB6
Length = 658
Score = 148 (57.2 bits), Expect = 6.3e-11, Sum P(2) = 6.3e-11
Identities = 90/437 (20%), Positives = 178/437 (40%)
Query: 88 KAIYRWL---LSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYN 143
K I R L L D ++V + S Y+ ++ ++ K++ + + ++ ++ G ++ + +
Sbjct: 159 KEIQRLLPGPLKDAVEVWSWSK---CYSLQEVNSALSKVQLVGYSQKVELFGAVQVTPLS 215
Query: 144 AGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVK-PDILITESTYGTHVHEQ 202
+G+ LG++ ++I+ K+ Y S H E +K D+LI T +
Sbjct: 216 SGYSLGSSNWIIQSHYEKVSYVSGSSLLTT-HPQPMEQSSLKNSDVLILTGL--TQIPTA 272
Query: 203 REERE-GRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASS 261
+ G F S + V GG L+P ++ G +LL L ++ L P Y+ S
Sbjct: 273 NPDGMLGEFCSNLAMTVRAGGNVLVPCYSSGVIYDLLECLYQFMD-SANLGTTPFYFISP 331
Query: 262 LAKKCMSVYQTYINAMNDRIRRQISINNP-FVFKHISNLKGIDHFEDI-G--------PC 311
+A + Q + + + ++ + P F + + H+ I G PC
Sbjct: 332 VANSSLEFSQIFAEWLCQNKQSKVYLPEPPFPHAELIQTNKLKHYPSIHGDFSSEFRQPC 391
Query: 312 VVMASPGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRL 370
VV ++ G E+W + N +I +EP+ +
Sbjct: 392 VVFTGHPSLRFGDVVHFMELWGKSSLNTIIF--------------TEPDFSYLDALAPYQ 437
Query: 371 PLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLVHG-EQNEMSRL-KAALTREYEDDPN 428
PL M Y ++ Q S+ +++++P HVV Q S+ ++ L E + P
Sbjct: 438 PLAMKCVYCPIDTRLNFHQVSKLLKDIQPLHVVCPEPYTQPPPSQPHRSDLMLELQPPPM 497
Query: 429 TSMELYNPRNTVSVDLYFKGEKTAKVMGELAVENLKPDAALSGI-------IVKRNFNYH 481
Y + + + + E+ ++ ELA ++L P +G+ +++ N H
Sbjct: 498 A----YRRCSVLRLPFRRRYERI-HLLPELA-KSLVPSEVKAGVSVATVSAVLQSKDNKH 551
Query: 482 LLAPSDLPKYTDLKASK 498
+L P +PK + SK
Sbjct: 552 VLQP--VPKVAPVAPSK 566
Score = 87 (35.7 bits), Expect = 6.3e-11, Sum P(2) = 6.3e-11
Identities = 21/59 (35%), Positives = 33/59 (55%)
Query: 41 LPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYI 99
LP +L++ ID++LIS++H ALP+ TGF G + T T I R L+ + +
Sbjct: 85 LPEKELLDLSTIDVILISNYHC--MMALPYITEHTGFTGTVYATEPTLQIGRLLMEELV 141
Score = 62 (26.9 bits), Expect = 2.4e-08, Sum P(2) = 2.4e-08
Identities = 15/41 (36%), Positives = 26/41 (63%)
Query: 15 CIMLEFKNKSIMMDCGIHPGLSGMDALPFVDLVESDQIDLL 55
C +L+FK+ +IM+DCG+ + + LP + LV S ++ L
Sbjct: 14 CNVLKFKSTTIMLDCGLDT-TAALYFLP-LPLVHSPRLSKL 52
>WB|WBGene00017608 [details] [associations]
symbol:F19F10.12 species:6239 "Caenorhabditis elegans"
[GO:0009792 "embryo development ending in birth or egg hatching"
evidence=IMP] InterPro:IPR027074 GO:GO:0009792 eggNOG:COG1236
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 KO:K13146
PANTHER:PTHR11203:SF2 GeneTree:ENSGT00390000001445 EMBL:FO080914
RefSeq:NP_504953.1 ProteinModelPortal:Q95ZM2 PaxDb:Q95ZM2
EnsemblMetazoa:F19F10.12 GeneID:179142 KEGG:cel:CELE_F19F10.12
UCSC:F19F10.12 CTD:179142 WormBase:F19F10.12 HOGENOM:HOG000199610
InParanoid:Q95ZM2 OMA:EFMERIE NextBio:904092 Uniprot:Q95ZM2
Length = 646
Score = 154 (59.3 bits), Expect = 8.9e-10, Sum P(3) = 8.9e-10
Identities = 57/281 (20%), Positives = 122/281 (43%)
Query: 111 YTESDLEKSMDKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSR 170
YT +D+ + K+ T++F++ D+ IK + +GH G+A + I+ + Y S
Sbjct: 174 YTTTDMHSCLAKVITLSFNQTIDLFRIKVTPVVSGHTYGSAYWTIKTENEQFAYLSA-SN 232
Query: 171 QEDRHLMAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFA 230
+ E P++ I ++ V +E I D++ + G L+P+
Sbjct: 233 PSATDVKLMETAPLRAVDHILVTSLSRLVDTTAKEMGYSLIKTITDVLKKHGSVLLPICP 292
Query: 231 LGRAQELL-LILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISI-N 288
+G E++ + D + + D PIY+ S +AK +++ M++ + + +
Sbjct: 293 VGPIFEMIEAVSDIITTTNGIPLDTPIYFISPVAKSAIAMASISAEWMSESRQNAVYLPE 352
Query: 289 NPFVFKHISNLKGIDHFEDI-G--------PCVVMASPGMMQSGLSRELFEMWCTDAKNG 339
P+ ++ + ++ + G PCV+ AS ++ G + + E+ +D KN
Sbjct: 353 EPYSHSNLIKSGRVKIYDSLYGSFSKEFKTPCVIFASHASLRIGDAAHMVEVLGSDPKNA 412
Query: 340 VIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYIS 380
VI+ + L + EP + + +P+ +D+ S
Sbjct: 413 VIVT----DPDLPCEDVREPFRNLPIKFINIPMDFRMDFAS 449
Score = 68 (29.0 bits), Expect = 8.9e-10, Sum P(3) = 8.9e-10
Identities = 16/61 (26%), Positives = 35/61 (57%)
Query: 45 DLVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIK-VSN 103
D+++ D ID +L+S++ G LP++ +GF G+ ++T + L+ + ++ +S
Sbjct: 83 DMLKMDTIDAILVSNYE-SFVG-LPFYTEGSGFSGKIYVTEIAYQYGKLLMEEMLEFISR 140
Query: 104 I 104
I
Sbjct: 141 I 141
Score = 43 (20.2 bits), Expect = 8.9e-10, Sum P(3) = 8.9e-10
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 13 RSCIMLEFKNKSIMMDCGI 31
+ C +LE+ N I+MD I
Sbjct: 12 KPCFLLEWPNARILMDTPI 30
>CGD|CAL0004705 [details] [associations]
symbol:orf19.325 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493 EMBL:AACQ01000027
EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402 InterPro:IPR022712
InterPro:IPR025069 PANTHER:PTHR11203:SF5 Pfam:PF10996 Pfam:PF13299
SMART:SM01027 RefSeq:XP_720020.1 RefSeq:XP_720152.1
ProteinModelPortal:Q5AEE3 STRING:Q5AEE3 GeneID:3638181
GeneID:3638320 KEGG:cal:CaO19.325 KEGG:cal:CaO19.7957
Uniprot:Q5AEE3
Length = 931
Score = 164 (62.8 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 74/350 (21%), Positives = 149/350 (42%)
Query: 17 MLEFKNK-SIMMDCGIHPGLSGMDALPFVDLVES-DQIDLLLISHFHLDHCGALPWFLLK 74
+LEF N+ ++ D P +G+D + + E + + +L+SH + +K
Sbjct: 20 LLEFDNEFKLIAD----PSWNGVDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIK 75
Query: 75 TGFKGRCFMTHATKAIY---RWLLSDYIKVSNI--STEQMLYTESDLEKSMDKIETINFH 129
++T + R +Y + + + +++ DK+ + +
Sbjct: 76 FPILMSSIPVYSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQ 135
Query: 130 EEKDV--NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAE-IPP--- 183
+ ++ N + + YNAGH LG +LI +++Y ++ +D L +A I P
Sbjct: 136 QSLNLFDNKVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTG 195
Query: 184 ------VKPDILITESTYGTHV-HEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQE 236
++P IT + G+ + H +R E+ F L+ + GG ++P GR E
Sbjct: 196 NPHLSLLRPTAFITATDMGSVMSHRKRTEK---FLQLVDATLANGGAAVLPTSLSGRFLE 252
Query: 237 LLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQIS-INN-PFVFK 294
L ++DE+ P IP+Y+ S K ++ ++ M+ ++ +++ PF
Sbjct: 253 LFHLIDEHLKGAP----IPVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPS 308
Query: 295 HISNLKGIDHFEDI-GPCVVMASPGMMQSG-LSRELFEMWCTDAKNGVII 342
+ L + GP +V S ++SG +S E F+ C D +I+
Sbjct: 309 KVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIIL 358
Score = 52 (23.4 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 13/58 (22%), Positives = 31/58 (53%)
Query: 356 LSEPEEVIGMS-G-------QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLV 405
LS P++ +G++ G Q+L ++ + ++ S D + V+ L+P +++L+
Sbjct: 619 LSNPKKRVGLNYGTKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKPYNLILL 676
Score = 44 (20.5 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 24/85 (28%), Positives = 38/85 (44%)
Query: 450 KTAKVMGELAVENLKPDAALSGIIVKR-NFNYHLLAPSDLPKYTDLKASKIIQQQSVYYS 508
K AK+ GEL ++N P A + + N N H L K A K +Q+++
Sbjct: 785 KVAKLYGELELQNQFPAAKKTRTLQDYINSNTHF----SLRKLDGTTAVK--RQETIANQ 838
Query: 509 GSISVLRSLISHLAGPVETLDEKRL 533
+R+LI++ GP + RL
Sbjct: 839 VQDPKIRALITN--GPKLAIGNIRL 861
>UNIPROTKB|Q5AEE3 [details] [associations]
symbol:CFT2 "Putative uncharacterized protein CFT2"
species:237561 "Candida albicans SC5314" [GO:0042493 "response to
drug" evidence=IMP] InterPro:IPR027075 CGD:CAL0004705 GO:GO:0042493
EMBL:AACQ01000027 EMBL:AACQ01000026 eggNOG:COG1236 KO:K14402
InterPro:IPR022712 InterPro:IPR025069 PANTHER:PTHR11203:SF5
Pfam:PF10996 Pfam:PF13299 SMART:SM01027 RefSeq:XP_720020.1
RefSeq:XP_720152.1 ProteinModelPortal:Q5AEE3 STRING:Q5AEE3
GeneID:3638181 GeneID:3638320 KEGG:cal:CaO19.325
KEGG:cal:CaO19.7957 Uniprot:Q5AEE3
Length = 931
Score = 164 (62.8 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 74/350 (21%), Positives = 149/350 (42%)
Query: 17 MLEFKNK-SIMMDCGIHPGLSGMDALPFVDLVES-DQIDLLLISHFHLDHCGALPWFLLK 74
+LEF N+ ++ D P +G+D + + E + + +L+SH + +K
Sbjct: 20 LLEFDNEFKLIAD----PSWNGVDVNAAMFMEEHLKETNAILLSHSTAEFISGFILLCIK 75
Query: 75 TGFKGRCFMTHATKAIY---RWLLSDYIKVSNI--STEQMLYTESDLEKSMDKIETINFH 129
++T + R +Y + + + +++ DK+ + +
Sbjct: 76 FPILMSSIPVYSTLPVNQLGRVSTVEYYRAMGFLGPVDSAILELDEVDNWFDKVNLLKYQ 135
Query: 130 EEKDV--NGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAE-IPP--- 183
+ ++ N + + YNAGH LG +LI +++Y ++ +D L +A I P
Sbjct: 136 QSLNLFDNKVVVTPYNAGHSLGGTFWLITKRIDRVIYAPAWNHSKDSFLNSASFISPSTG 195
Query: 184 ------VKPDILITESTYGTHV-HEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQE 236
++P IT + G+ + H +R E+ F L+ + GG ++P GR E
Sbjct: 196 NPHLSLLRPTAFITATDMGSVMSHRKRTEK---FLQLVDATLANGGAAVLPTSLSGRFLE 252
Query: 237 LLLILDEYWSLHPELHDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQIS-INN-PFVFK 294
L ++DE+ P IP+Y+ S K ++ ++ M+ ++ +++ PF
Sbjct: 253 LFHLIDEHLKGAP----IPVYFLSYSGTKILTYASNLLDWMSKSFTKEWEELSSVPFNPS 308
Query: 295 HISNLKGIDHFEDI-GPCVVMASPGMMQSG-LSRELFEMWCTDAKNGVII 342
+ L + GP +V S ++SG +S E F+ C D +I+
Sbjct: 309 KVDLLLDPSELLKLSGPKIVFCSGIDLRSGDISAEAFQYLCNDEHTTIIL 358
Score = 52 (23.4 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 13/58 (22%), Positives = 31/58 (53%)
Query: 356 LSEPEEVIGMS-G-------QRLPLKMSVDYISFSAHTDYQQTSEFVRELRPAHVVLV 405
LS P++ +G++ G Q+L ++ + ++ S D + V+ L+P +++L+
Sbjct: 619 LSNPKKRVGLNYGTKKKSETQKLKVRCGLSFVDLSGQVDLRSLGIIVQALKPYNLILL 676
Score = 44 (20.5 bits), Expect = 1.0e-08, Sum P(3) = 1.0e-08
Identities = 24/85 (28%), Positives = 38/85 (44%)
Query: 450 KTAKVMGELAVENLKPDAALSGIIVKR-NFNYHLLAPSDLPKYTDLKASKIIQQQSVYYS 508
K AK+ GEL ++N P A + + N N H L K A K +Q+++
Sbjct: 785 KVAKLYGELELQNQFPAAKKTRTLQDYINSNTHF----SLRKLDGTTAVK--RQETIANQ 838
Query: 509 GSISVLRSLISHLAGPVETLDEKRL 533
+R+LI++ GP + RL
Sbjct: 839 VQDPKIRALITN--GPKLAIGNIRL 861
>UNIPROTKB|H0YBH8 [details] [associations]
symbol:INTS9 "Integrator complex subunit 9" species:9606
"Homo sapiens" [GO:0016180 "snRNA processing" evidence=IEA]
[GO:0032039 "integrator complex" evidence=IEA] InterPro:IPR027074
PANTHER:PTHR11203:SF2 EMBL:AC040975 EMBL:AC131969 HGNC:HGNC:25592
ChiTaRS:INTS9 Ensembl:ENST00000524081 Uniprot:H0YBH8
Length = 223
Score = 151 (58.2 bits), Expect = 1.7e-08, P = 1.7e-08
Identities = 39/153 (25%), Positives = 79/153 (51%)
Query: 20 FKNKSIMMDCGIHPGLSGMD--ALPFVDLVESDQIDLLLISHFHLDHCGALPWFLLKTGF 77
F +K + +C H + + LP +L++ +D++LIS++H ALP+ TGF
Sbjct: 55 FLDKELK-ECSGHVFVDSVPEFCLPETELIDLSTVDVILISNYHC--MMALPYITEHTGF 111
Query: 78 KGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNG- 136
G + T T I R L S +ST + YT ++ ++ KI+ + + ++ ++ G
Sbjct: 112 TGTVYATEPTVQIGRLLPSPLKDAVEVSTWRRCYTMQEVNSALSKIQLVGYSQKIELFGA 171
Query: 137 IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFS 169
++ + ++G+ LG++ ++I+ K+ Y S
Sbjct: 172 VQVTPLSSGYALGSSNWIIQSHYEKVSYVSGSS 204
>UNIPROTKB|Q87XP2 [details] [associations]
symbol:PSPTO_4134 "Uncharacterized protein" species:223283
"Pseudomonas syringae pv. tomato str. DC3000" [GO:0003674
"molecular_function" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
EMBL:AE016853 GenomeReviews:AE016853_GR eggNOG:COG1236
HOGENOM:HOG000035995 OMA:STFGLPI InterPro:IPR026360
TIGRFAMs:TIGR04122 RefSeq:NP_793895.1 ProteinModelPortal:Q87XP2
GeneID:1185814 KEGG:pst:PSPTO_4134 PATRIC:19999765 KO:K07577
ProtClustDB:CLSK2517054 BioCyc:PSYR223283:GJIX-4198-MONOMER
Uniprot:Q87XP2
Length = 348
Score = 127 (49.8 bits), Expect = 6.1e-05, P = 6.1e-05
Identities = 47/166 (28%), Positives = 80/166 (48%)
Query: 80 RCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKSMDKIETINFHEEKDVNGIKF 139
R +THA R Y+ + S E +L S L + ++ ++T+ + E +G+K
Sbjct: 28 RAVITHAHGDHARTGNQHYLSAA--SGEGIL--RSRLGQDIN-LQTLEYGETITHHGVKL 82
Query: 140 SAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHLMAAEIPPVKPDILITESTYGTHV 199
S + AGHVLG+A +E G + +GD+ + D A E PV+ ITEST+G +
Sbjct: 83 SLHPAGHVLGSAQVRLEYEGEVWVASGDYKVEPDGTCAAFE--PVRCQTFITESTFGLPI 140
Query: 200 HE---QREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILD 242
+ Q + EG +G ++ ++ G+AQ +L +D
Sbjct: 141 YRWAPQSQIFEG-INEWWRGNAAQGKASVLFAYSFGKAQRILHGID 185
>UNIPROTKB|E2QVB2 [details] [associations]
symbol:INTS9 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0032039 "integrator complex" evidence=IEA]
[GO:0016180 "snRNA processing" evidence=IEA] InterPro:IPR027074
InterPro:IPR022712 Pfam:PF10996 SMART:SM01027 GO:GO:0032039
GO:GO:0016180 PANTHER:PTHR11203:SF2 Ensembl:ENSCAFT00000013124
Uniprot:E2QVB2
Length = 409
Score = 125 (49.1 bits), Expect = 0.00014, P = 0.00014
Identities = 87/371 (23%), Positives = 144/371 (38%)
Query: 208 GRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHDIPIYYASSLAKKCM 267
G F S + V GG L+P + G +LL L +Y L +IP Y+ S +A +
Sbjct: 30 GEFCSNLALTVRNGGNVLVPCYPSGVIYDLLECLYQYID-SAGLSNIPFYFISPVANSSL 88
Query: 268 SVYQTYINAMNDRIRRQISINNP-FV---------FKHISNLKGIDHFEDIG-PCVVMAS 316
Q + + + ++ + P F KH +L G D D PCVV
Sbjct: 89 EFSQIFAEWLCHNKQTKVYLPEPPFPHAELIQTNKLKHYPSLHG-DFSSDFRQPCVVFTG 147
Query: 317 PGMMQSGLSRELFEMWCTDAKNGVIIAGYCVEGTLAKTILSEPE-EVIGMSGQRLPLKMS 375
++ G E+W + N VI +EP+ + PL M
Sbjct: 148 HPSLRFGDVVHFMELWGKSSLNTVIF--------------TEPDFSYLEALAPYQPLAMK 193
Query: 376 VDYISFSAHTDYQQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYN 435
Y ++ Q S+ ++E++P HVV EQ + + D Y
Sbjct: 194 CIYCPIDTRLNFIQVSKLLKEVQPLHVVCP--EQYTQPPPAQSHRMDLMIDCQPPAMSYR 251
Query: 436 PRNTVSVDLYFKGEKTAKVMGELAVE----NLKPDAALSGI--IVKRNFNYHLLAPSDLP 489
+++ + EK ++M ELA +KP +L+ + ++ N H+L P P
Sbjct: 252 RAEVLALPFKRRYEKI-EIMPELADALVPMEIKPGISLATVSAVLHTKDNKHVLQPP--P 308
Query: 490 KYTDLKASKIIQQQSVYYSGSISVLRSLISHLAGPVETLDEK-RLRAFACIEI--TLEKC 546
+ T K ++ S VL+ L+S PVE + F+ I++ T +
Sbjct: 309 RPTQPTGGKKRKRASDDIP-DCKVLKPLLSGSI-PVEQFVQTLEKHGFSDIKVEDTAKGH 366
Query: 547 IVVLEWASNPI 557
IV+L+ A I
Sbjct: 367 IVLLQEAETLI 377
>TAIR|locus:2079696 [details] [associations]
symbol:AT3G07530 "AT3G07530" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR027074 EMBL:CP002686 InterPro:IPR022712
Pfam:PF10996 SMART:SM01027 KO:K13146 PANTHER:PTHR11203:SF2
IPI:IPI00520313 RefSeq:NP_187409.2 UniGene:At.53215
ProteinModelPortal:F4JEH2 PRIDE:F4JEH2 EnsemblPlants:AT3G07530.1
GeneID:819942 KEGG:ath:AT3G07530 OMA:CYNGTLI Uniprot:F4JEH2
Length = 699
Score = 80 (33.2 bits), Expect = 0.00019, Sum P(3) = 0.00019
Identities = 38/167 (22%), Positives = 71/167 (42%)
Query: 192 ESTYGTHVHEQREEREGRFTSLIHDIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPEL 251
+S T + E+ S + + GG LI + +G +LL +L SL
Sbjct: 315 DSLLNTEDSLEEMEKLAFVCSCAAESADAGGSTLITITRIGIVLQLLELLSN--SLESSS 372
Query: 252 HDIPIYYASSLAKKCMSVYQTYINAMNDRIRRQISINNPFVFKHISNLKG--IDHFEDIG 309
+PI+ SS+A++ ++ T + ++ + ++ P F H+ +K I F I
Sbjct: 373 LKVPIFVISSVAEELLAYTNTIPEWLCEQRQEKLISGEPS-FGHLKFIKNKKIHLFPAIH 431
Query: 310 --------------PCVVMASPGMMQSGLSRELFEMWCTDAKNGVII 342
PC+V AS ++ G S +L + W D K+ +++
Sbjct: 432 SPNLIYANRTSWQEPCIVFASHWSLRLGPSVQLLQRWRGDPKSLLVL 478
Score = 78 (32.5 bits), Expect = 0.00019, Sum P(3) = 0.00019
Identities = 25/83 (30%), Positives = 40/83 (48%)
Query: 110 LYTESDLEKSMDKIETINFHEEKDVNG-IKFSAYNAGHVLGAAMFLIEIAGVKILYTGDF 168
LY+ D+E M K++ + F EE NG + A ++G +GA +LI + Y D
Sbjct: 199 LYSLDDIESCMKKVQGVKFAEEVCYNGTLIIKALSSGLDIGACNWLINGPNGSLSYVSD- 257
Query: 169 SRQEDRHLMAAEIPPVKP-DILI 190
S H + + +K D+LI
Sbjct: 258 SIFVSHHARSFDFHGLKETDVLI 280
Score = 60 (26.2 bits), Expect = 0.00019, Sum P(3) = 0.00019
Identities = 17/56 (30%), Positives = 30/56 (53%)
Query: 46 LVESDQIDLLLISHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKV 101
L E+ ID++LIS+ + G LP+ GF + +MT T I + ++ D + +
Sbjct: 97 LWEASFIDIVLISN-PMGLLG-LPFLTQNPGFFAKIYMTEVTAKIGQLMMEDIVSM 150
>UNIPROTKB|Q81SK8 [details] [associations]
symbol:BA_1640 "Ribonuclease J" species:1392 "Bacillus
anthracis" [GO:0008150 "biological_process" evidence=ND]
InterPro:IPR001279 InterPro:IPR001587 InterPro:IPR004613
Pfam:PF00753 PIRSF:PIRSF004803 PROSITE:PS01292 SMART:SM00849
Pfam:PF07521 GO:GO:0046872 EMBL:AE016879 GenomeReviews:AE016879_GR
GO:GO:0003723 GO:GO:0016788 InterPro:IPR011108 HOGENOM:HOG000280201
KO:K12574 PANTHER:PTHR11203:SF22 TIGRFAMs:TIGR00649
RefSeq:NP_844087.1 ProteinModelPortal:Q81SK8 DNASU:1086943
EnsemblBacteria:EBBACT00000008733 GeneID:1086943 KEGG:ban:BA_1640
PATRIC:18780866 ProtClustDB:CLSK916310 Uniprot:Q81SK8
Length = 549
Score = 130 (50.8 bits), Expect = 0.00022, Sum P(2) = 0.00022
Identities = 61/245 (24%), Positives = 112/245 (45%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIH-P--GLSGMDAL-PFVDLVES--DQIDLLLISH 59
G E+G++ ++++N +++DCG P L G+D + P V ++ ++I L+++H
Sbjct: 14 GGVNEIGKNMYAIQYENDIVVIDCGSKFPDESLLGIDLIIPDVTYLQENKEKIRGLVVTH 73
Query: 60 FHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKS 119
H DH G +P+FL + + T T + L ++ + N + ++++ES+++
Sbjct: 74 GHEDHIGGIPYFLKQLNVP--IYATRLTLGLIEIKLKEH-NLQNDTELIVIHSESEID-- 128
Query: 120 MDKIETINF---HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL 176
I+T F H D GI F G V+ F ++ V ++Q D H
Sbjct: 129 FGSIKTTFFKTNHSIPDCLGIAFHTPE-GTVVHTGDFKFDLTPVN-------NQQPDIHK 180
Query: 177 MAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGR-CLIPVFA--LGR 233
MA +I L++EST ER I +I + R +I FA + R
Sbjct: 181 MA-KIGSEGVLALLSESTNAERPGFTPSERS--VGERIEEIFMKANRKVIISTFASNVNR 237
Query: 234 AQELL 238
Q+++
Sbjct: 238 VQQIV 242
Score = 40 (19.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 379 ISFSAHTDYQQTSEFVREL-RPAHVVLVHGE 408
+ S H YQ+ + + L +P + + +HGE
Sbjct: 365 VHVSGHA-YQEELKLMLALMKPKYFIPIHGE 394
Score = 39 (18.8 bits), Expect = 0.00028, Sum P(2) = 0.00028
Identities = 9/38 (23%), Positives = 18/38 (47%)
Query: 388 QQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
+ + EF++EL V+ ++ + E L RE +
Sbjct: 497 RDSEEFLKELNKLAVITINNLKKEKVNSWGILKREVRE 534
>TIGR_CMR|BA_1640 [details] [associations]
symbol:BA_1640 "metallo-beta-lactamase family protein"
species:198094 "Bacillus anthracis str. Ames" [GO:0003824
"catalytic activity" evidence=ISS] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR001279 InterPro:IPR001587
InterPro:IPR004613 Pfam:PF00753 PIRSF:PIRSF004803 PROSITE:PS01292
SMART:SM00849 Pfam:PF07521 GO:GO:0046872 EMBL:AE016879
GenomeReviews:AE016879_GR GO:GO:0003723 GO:GO:0016788
InterPro:IPR011108 HOGENOM:HOG000280201 KO:K12574
PANTHER:PTHR11203:SF22 TIGRFAMs:TIGR00649 RefSeq:NP_844087.1
ProteinModelPortal:Q81SK8 DNASU:1086943
EnsemblBacteria:EBBACT00000008733 GeneID:1086943 KEGG:ban:BA_1640
PATRIC:18780866 ProtClustDB:CLSK916310 Uniprot:Q81SK8
Length = 549
Score = 130 (50.8 bits), Expect = 0.00022, Sum P(2) = 0.00022
Identities = 61/245 (24%), Positives = 112/245 (45%)
Query: 6 GAGQEVGRSCIMLEFKNKSIMMDCGIH-P--GLSGMDAL-PFVDLVES--DQIDLLLISH 59
G E+G++ ++++N +++DCG P L G+D + P V ++ ++I L+++H
Sbjct: 14 GGVNEIGKNMYAIQYENDIVVIDCGSKFPDESLLGIDLIIPDVTYLQENKEKIRGLVVTH 73
Query: 60 FHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQMLYTESDLEKS 119
H DH G +P+FL + + T T + L ++ + N + ++++ES+++
Sbjct: 74 GHEDHIGGIPYFLKQLNVP--IYATRLTLGLIEIKLKEH-NLQNDTELIVIHSESEID-- 128
Query: 120 MDKIETINF---HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL 176
I+T F H D GI F G V+ F ++ V ++Q D H
Sbjct: 129 FGSIKTTFFKTNHSIPDCLGIAFHTPE-GTVVHTGDFKFDLTPVN-------NQQPDIHK 180
Query: 177 MAAEIPPVKPDILITESTYGTHVHEQREEREGRFTSLIHDIVNRGGR-CLIPVFA--LGR 233
MA +I L++EST ER I +I + R +I FA + R
Sbjct: 181 MA-KIGSEGVLALLSESTNAERPGFTPSERS--VGERIEEIFMKANRKVIISTFASNVNR 237
Query: 234 AQELL 238
Q+++
Sbjct: 238 VQQIV 242
Score = 40 (19.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
Identities = 9/31 (29%), Positives = 17/31 (54%)
Query: 379 ISFSAHTDYQQTSEFVREL-RPAHVVLVHGE 408
+ S H YQ+ + + L +P + + +HGE
Sbjct: 365 VHVSGHA-YQEELKLMLALMKPKYFIPIHGE 394
Score = 39 (18.8 bits), Expect = 0.00028, Sum P(2) = 0.00028
Identities = 9/38 (23%), Positives = 18/38 (47%)
Query: 388 QQTSEFVRELRPAHVVLVHGEQNEMSRLKAALTREYED 425
+ + EF++EL V+ ++ + E L RE +
Sbjct: 497 RDSEEFLKELNKLAVITINNLKKEKVNSWGILKREVRE 534
>TIGR_CMR|CHY_1157 [details] [associations]
symbol:CHY_1157 "metallo-beta-lactamase family protein"
species:246194 "Carboxydothermus hydrogenoformans Z-2901"
[GO:0003824 "catalytic activity" evidence=ISS] [GO:0008152
"metabolic process" evidence=ISS] InterPro:IPR001279
InterPro:IPR004613 Pfam:PF00753 PIRSF:PIRSF004803 SMART:SM00849
Pfam:PF07521 GO:GO:0046872 EMBL:CP000141 GenomeReviews:CP000141_GR
GO:GO:0003723 GO:GO:0016788 InterPro:IPR011108 eggNOG:COG0595
HOGENOM:HOG000280201 KO:K12574 PANTHER:PTHR11203:SF22
TIGRFAMs:TIGR00649 RefSeq:YP_360002.1 ProteinModelPortal:Q3ACY2
STRING:Q3ACY2 GeneID:3726430 KEGG:chy:CHY_1157 PATRIC:21275454
OMA:FLVDSTN BioCyc:CHYD246194:GJCN-1156-MONOMER Uniprot:Q3ACY2
Length = 554
Score = 113 (44.8 bits), Expect = 0.00022, Sum P(2) = 0.00022
Identities = 60/231 (25%), Positives = 102/231 (44%)
Query: 4 LKGAGQEVGRSCIMLEFKNKSIMMDCGI---HPGLSGMD-ALPFVD-LVES-DQIDLLLI 57
L G G E+G++ +++++ + I++D G+ L G+D +P + L+E+ +++ +L+
Sbjct: 13 LGGLG-EIGKNMMVIKYNDAIIVIDAGLMFPEEELLGIDMVIPDMSYLIENKEKVKAVLL 71
Query: 58 SHFHLDHCGALPWFLLKTGFKGRCFMTHATKAIYRWLLSDYIKVSNISTEQM-LYTESD- 115
+H H DH G +P+FL + F + T T LLS +K + I + + D
Sbjct: 72 THGHEDHIGGMPYFLKQ--FDVPVYGTRLTLG----LLSAKLKEAGIPRASLNVVAPRDV 125
Query: 116 LEKSMDKIETINF-HEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQE-- 172
L KIE I H D GI G V+ F ++ V T + E
Sbjct: 126 LNIGPFKIEFIKVSHSIPDTVGIAVHT-PVGTVVHTGDFKLDPTPVDGKVTDFYKLAELG 184
Query: 173 DRH---LMAAEIPPVKPDILITESTYGTHVHEQREEREGR-----FTSLIH 215
++ LM+ +P ++E T G E EGR F S +H
Sbjct: 185 EKGVLVLMSDSTNAERPGFTLSEKTVGNTFEETFRVAEGRIIIATFASNVH 235
Score = 58 (25.5 bits), Expect = 0.00022, Sum P(2) = 0.00022
Identities = 22/100 (22%), Positives = 44/100 (44%)
Query: 336 AKNGVIIAGYCVEGTLAKTILSEPEEVIGMSGQRLPLKMSVDYISFSAHTDYQQTSEFVR 395
A + VII+ + G + ++S + + G ++ + +V I S H ++ +
Sbjct: 322 AGDTVIISAMPIPGN--EKLVSRIIDQLFKLGAKV-IYEAVSGIHVSGHPSQEELKLMIN 378
Query: 396 ELRPAHVVLVHGEQNEMSRLKAALTREYEDDPNTSMELYN 435
L+P + V +HGE + + A + RE P + N
Sbjct: 379 LLKPKYFVPIHGEYRHLIK-HAEIARELGIKPQNIFVVEN 417
>UNIPROTKB|Q83DU6 [details] [associations]
symbol:CBU_0596 "Metal-dependent hydrolase" species:227377
"Coxiella burnetii RSA 493" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] InterPro:IPR001279
SMART:SM00849 GO:GO:0016787 EMBL:AE016828 GenomeReviews:AE016828_GR
RefSeq:NP_819626.1 ProteinModelPortal:Q83DU6 GeneID:1208481
KEGG:cbu:CBU_0596 PATRIC:17929885 HOGENOM:HOG000279110 OMA:IFHDCET
ProtClustDB:CLSK892609 BioCyc:CBUR227377:GJ7S-597-MONOMER
Uniprot:Q83DU6
Length = 249
Score = 114 (45.2 bits), Expect = 0.00081, P = 0.00081
Identities = 54/204 (26%), Positives = 88/204 (43%)
Query: 13 RSCIMLEFKN-KSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWF 71
+S I+LE N K +++DCG +L V+L +D ID + +SHFH DH G L W
Sbjct: 15 QSNILLENSNHKRLLIDCGT----DAHHSLKNVNLRYAD-IDSVYVSHFHFDHVGGLEWL 69
Query: 72 LLKTGF-----KGRCFMTHATKAIYRW--LLS---DYIKVSNISTEQMLYTESDL-EKSM 120
F K + F+ H + W +LS +K + +T + +T + + E+
Sbjct: 70 AFSAYFDPAVKKPKLFI-HPSMLNILWDHVLSGGLQSLKGESPATLETYFTLAPIREEKY 128
Query: 121 DKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL-MAA 179
E+INF K ++ +N +L + + KI T D R+
Sbjct: 129 FTWESINFEMVKTIH-----VHNGKLLLPSYGLFFSLEKTKIFITTDTQFFPHRYADYYR 183
Query: 180 EIPPVKPDILITESTYGTHVHEQR 203
E + D I ++ G H H Q+
Sbjct: 184 EADLIFHDCEIDKTKTGVHAHFQQ 207
>TIGR_CMR|CBU_0596 [details] [associations]
symbol:CBU_0596 "conserved hypothetical protein"
species:227377 "Coxiella burnetii RSA 493" [GO:0008150
"biological_process" evidence=ND] [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
InterPro:IPR001279 SMART:SM00849 GO:GO:0016787 EMBL:AE016828
GenomeReviews:AE016828_GR RefSeq:NP_819626.1
ProteinModelPortal:Q83DU6 GeneID:1208481 KEGG:cbu:CBU_0596
PATRIC:17929885 HOGENOM:HOG000279110 OMA:IFHDCET
ProtClustDB:CLSK892609 BioCyc:CBUR227377:GJ7S-597-MONOMER
Uniprot:Q83DU6
Length = 249
Score = 114 (45.2 bits), Expect = 0.00081, P = 0.00081
Identities = 54/204 (26%), Positives = 88/204 (43%)
Query: 13 RSCIMLEFKN-KSIMMDCGIHPGLSGMDALPFVDLVESDQIDLLLISHFHLDHCGALPWF 71
+S I+LE N K +++DCG +L V+L +D ID + +SHFH DH G L W
Sbjct: 15 QSNILLENSNHKRLLIDCGT----DAHHSLKNVNLRYAD-IDSVYVSHFHFDHVGGLEWL 69
Query: 72 LLKTGF-----KGRCFMTHATKAIYRW--LLS---DYIKVSNISTEQMLYTESDL-EKSM 120
F K + F+ H + W +LS +K + +T + +T + + E+
Sbjct: 70 AFSAYFDPAVKKPKLFI-HPSMLNILWDHVLSGGLQSLKGESPATLETYFTLAPIREEKY 128
Query: 121 DKIETINFHEEKDVNGIKFSAYNAGHVLGAAMFLIEIAGVKILYTGDFSRQEDRHL-MAA 179
E+INF K ++ +N +L + + KI T D R+
Sbjct: 129 FTWESINFEMVKTIH-----VHNGKLLLPSYGLFFSLEKTKIFITTDTQFFPHRYADYYR 183
Query: 180 EIPPVKPDILITESTYGTHVHEQR 203
E + D I ++ G H H Q+
Sbjct: 184 EADLIFHDCEIDKTKTGVHAHFQQ 207
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.320 0.136 0.400 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 622 622 0.00090 120 3 11 22 0.38 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 97
No. of states in DFA: 619 (66 KB)
Total size of DFA: 340 KB (2172 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 51.56u 0.09s 51.65t Elapsed: 00:00:07
Total cpu time: 51.61u 0.09s 51.70t Elapsed: 00:00:07
Start: Thu Aug 15 17:02:32 2013 End: Thu Aug 15 17:02:39 2013