Your job contains 1 sequence.
>001853
MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV
TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS
QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG
PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD
LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF
SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN
SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN
GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE
LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT
ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST
VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP
WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF
VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF
LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY
TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVL
HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVFFLYF
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 001853
(1004 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade... 2091 0. 2
ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya... 648 7.2e-97 3
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla... 651 6.8e-96 4
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ... 652 7.5e-95 4
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"... 640 8.3e-95 3
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat... 648 9.3e-95 4
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla... 651 2.5e-94 4
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla... 566 2.3e-79 4
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya... 413 8.5e-65 6
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"... 466 3.9e-61 4
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab... 431 4.2e-54 3
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p... 431 4.2e-54 3
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"... 250 7.3e-43 3
POMBASE|SPBC1709.08 - symbol:cft1 "cleavage factor one Cf... 268 1.3e-25 2
CGD|CAL0004251 - symbol:orf19.2760 species:5476 "Candida ... 224 8.7e-18 3
UNIPROTKB|Q5AFT3 - symbol:CFT1 "Protein CFT1" species:237... 224 8.7e-18 3
UNIPROTKB|K7GNU1 - symbol:CPSF1 "Uncharacterized protein"... 197 3.0e-16 2
ASPGD|ASPL0000050546 - symbol:AN1413 species:162425 "Emer... 209 1.1e-12 1
FB|FBgn0260962 - symbol:pic "piccolo" species:7227 "Droso... 141 1.1e-05 4
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr... 100 1.4e-05 3
TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr... 91 4.8e-05 3
SGD|S000002709 - symbol:CFT1 "RNA-binding subunit of the ... 91 4.9e-05 3
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ... 116 0.00014 3
>TAIR|locus:2153122 [details] [associations]
symbol:CPSF160 "cleavage and polyadenylation specificity
factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
"protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
of flower development" evidence=RCA] [GO:0016570 "histone
modification" evidence=RCA] [GO:0048449 "floral organ formation"
evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
GermOnline:AT5G51660 Uniprot:Q9FGR0
Length = 1442
Score = 2091 (741.1 bits), Expect = 0., Sum P(2) = 0.
Identities = 396/545 (72%), Positives = 464/545 (85%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQXXXXXXXXXX-XXTKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR Q KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +S D QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYELV 542
NYELV
Sbjct: 540 NYELV 544
Score = 1725 (612.3 bits), Expect = 0., Sum P(2) = 0.
Identities = 329/457 (71%), Positives = 367/457 (80%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
VELPGCKGIWTVYHKSSRGHNADSS+MAA +DEYHAYLIISLEARTMVLETADLLTEVTE
Sbjct: 570 VELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISLEARTMVLETADLLTEVTE 629
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTV 661
SVDY+VQGRTIAAGNLFGRRRVIQVFE GARILDGS+M Q+LSFG TV
Sbjct: 630 SVDYYVQGRTIAAGNLFGRRRVIQVFEHGARILDGSFMNQELSFGASNSESNSGSESSTV 689
Query: 662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
SVSIADPYVLL M+D SIRLLVGDPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPW
Sbjct: 690 SSVSIADPYVLLRMTDDSIRLLVGDPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPW 749
Query: 722 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
LRK STDAWLS+GVGEA+D DGGP DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF
Sbjct: 750 LRKASTDAWLSSGVGEAVDSVDGGPQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFA 809
Query: 782 SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFL 841
SGR H+ D + E E E+N +SE+ T KE I + +VVELAMQRWS HH+RPFL
Sbjct: 810 SGRRHLSDMPIHEL----EYELNKNSEDNTSS--KE-IKNTRVVELAMQRWSGHHTRPFL 862
Query: 842 FAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYT 901
FA+L DGTILCY AYLF+G ++T K+++ L+F R PLD T
Sbjct: 863 FAVLADGTILCYHAYLFDGVDST-KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTST 921
Query: 902 REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
RE T G QRIT+FKNISGHQGFFLSGSRP WCM+FRERLR H QLCDGSI AFTVLH
Sbjct: 922 REGTSDGVASQRITMFKNISGHQGFFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLH 981
Query: 962 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
NVNCNHGFIYVT+QG+LKICQLPS S YDNYWPVQK+
Sbjct: 982 NVNCNHGFIYVTAQGVLKICQLPSASIYDNYWPVQKI 1018
>ZFIN|ZDB-GENE-040709-2 [details] [associations]
symbol:cpsf1 "cleavage and polyadenylation specific
factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
"definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
Length = 1451
Score = 648 (233.2 bits), Expect = 7.2e-97, Sum P(3) = 7.2e-97
Identities = 169/478 (35%), Positives = 265/478 (55%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAAS-LELVCHYRLHGNVES 115
NLVV A ++YV R+ +K DG S LE V + L GNV S
Sbjct: 29 NLVV--AGTSQLYVYRI------IYDVESTSKSEKSSDGKSRKEKLEQVASFSLFGNVMS 80
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 81 MASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFV 133
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
P+V+VDP+ RC +LVYG +++L + + DE G G + S++
Sbjct: 134 QNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKD---TLADEQEGIVGEGQKSSFLPSYI 190
Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 191 IDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQK 250
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSS 352
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS ++LN+ +
Sbjct: 251 VHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSLNSLTNGTTAF 310
Query: 353 QELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVL 411
P+ + LD + A+++ +D ++S K G++ +LT++ DG R V+ K SVL
Sbjct: 311 PLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVL 370
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
T+ + T+ FLGSRLG+SLL+++T + + G + E + + + P+ K+ R S
Sbjct: 371 TTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQEEPPNKKK-RVDS 429
Query: 472 SDA-------LQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ A L D + +E+ +YGS A + T+ A T+SF V DS++NIGP S G
Sbjct: 430 NWAGCPGKGNLPDEL--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMG 483
Score = 275 (101.9 bits), Expect = 7.2e-97, Sum P(3) = 7.2e-97
Identities = 81/289 (28%), Positives = 134/289 (46%)
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGDIYSVVCYESGALEIFDVPN 770
LY + P K + S A G + G + + ++ E+G +EI+ +P+
Sbjct: 751 LYGESNPLTSPNKEESSRG-SAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPD 809
Query: 771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
+ VF V F G+ +VD+ + S T+ EE T QG +I +K E+A+
Sbjct: 810 WRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALV 860
Query: 831 RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXL 890
+HSRP+L A + + +L Y+A+ ++ + S +
Sbjct: 861 SLGYNHSRPYLLAHV-EQELLIYEAFPYDQQQAQSNLK---VRFKKMPHNINYREKKVKV 916
Query: 891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQL 949
R + P + + R F++ISG+ G F+ G P W +V R +R+HP
Sbjct: 917 RKDKKP-EGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMT 975
Query: 950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
DG+I +F+ HN+NC GF+Y QG L+I LP+ +YD WPV+K+
Sbjct: 976 IDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKI 1024
Score = 162 (62.1 bits), Expect = 7.2e-97, Sum P(3) = 7.2e-97
Identities = 54/190 (28%), Positives = 90/190 (47%)
Query: 543 ELPGCKGIWTVYH-------KSSRGHNA---DSSRMAAYDDEY--HAYLIISLEARTMVL 590
ELPGC +WTV + S+ G + R +D+ H +LI+S E TM+L
Sbjct: 530 ELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSREDSTMIL 589
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXX 650
+T + E+ S + QG T+ AGN+ + +IQV G R+L+G L F P
Sbjct: 590 QTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQLHFIPVDL 645
Query: 651 XXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV--GDP---STCTVSVQTPAAIESSKK 705
++ S+ADPYV++ ++G + + V D + +++Q P I + +
Sbjct: 646 GS-------PIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQ-IHTQSR 697
Query: 706 PVSSCTLYHD 715
++ C Y D
Sbjct: 698 VITLCA-YRD 706
Score = 48 (22.0 bits), Expect = 4.7e-19, Sum P(2) = 4.7e-19
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 595 IMELDTSGFATQGPTVYAG-NIGDNKY-IIQV-SPMGIRLLEGVNQLHF 640
Score = 39 (18.8 bits), Expect = 1.8e-59, Sum P(2) = 1.8e-59
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 515 LKDFSYGLRINADASATGISKQSNYELVEL 544
+K+F G R+ D+SA+ + Q + E+
Sbjct: 816 VKNFPVGQRVLVDSSASQSATQGELKKEEV 845
>UNIPROTKB|Q10569 [details] [associations]
symbol:CPSF1 "Cleavage and polyadenylation specificity
factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
"mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
ArrayExpress:Q10569 Uniprot:Q10569
Length = 1444
Score = 651 (234.2 bits), Expect = 6.8e-96, Sum P(4) = 6.8e-96
Identities = 167/475 (35%), Positives = 255/475 (53%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T S+ E D E KR+ +
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVDATTG 432
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
S QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 433 WSGSKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483
Score = 255 (94.8 bits), Expect = 6.8e-96, Sum P(4) = 6.8e-96
Identities = 72/242 (29%), Positives = 109/242 (45%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+GA+EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 791 ENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 846
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ RP+L + D +L Y+A+ P ++
Sbjct: 847 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLKVRFKKV 896
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREET-PHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
+ T E T P G R F++I G+ G F+ G P W +
Sbjct: 897 PHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVA-RFRYFEDIYGYSGVFICGPSPHWLL 955
Query: 938 VF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
V R LR+HP DG I +F HN+NC GF+Y QG L+I LP+ +YD WPV+
Sbjct: 956 VTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVR 1015
Query: 997 KV 998
K+
Sbjct: 1016 KI 1017
Score = 160 (61.4 bits), Expect = 6.8e-96, Sum P(4) = 6.8e-96
Identities = 63/223 (28%), Positives = 103/223 (46%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNADSSRMA--AYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++ +G + A A DD H +LI+S E TM+L+T
Sbjct: 530 ELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQT 589
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 590 GQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 645
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPV 707
++ ++ADPYV++ ++G + LL D +++ P + K +
Sbjct: 646 -------PIVQCAVADPYVVIMSAEGHVTMFLLKNDSYGGRHHRLALHKPP-LHHQSKVI 697
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ C +Y D +T++ L GV + + G GGP +G
Sbjct: 698 TLC-VYRDVSG-----MFTTESRLG-GVRDELGGR-GGPEAEG 732
Score = 49 (22.3 bits), Expect = 5.7e-17, Sum P(3) = 5.7e-17
Identities = 21/74 (28%), Positives = 36/74 (48%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+ S
Sbjct: 593 IMELDASGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 649
Query: 338 CALALNNYAVSLDS 351
CA+A + Y V + +
Sbjct: 650 CAVA-DPYVVIMSA 662
Score = 48 (22.0 bits), Expect = 6.8e-96, Sum P(4) = 6.8e-96
Identities = 16/52 (30%), Positives = 25/52 (48%)
Query: 3 FAAYKMMHWPTGIA-NCGSGFITHSRADYV----PQIPLIQTEELDSELPSK 49
+A YK H PTG+ + F +S + V Q+ + + DSE P+K
Sbjct: 2 YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNR-DSEAPTK 52
>RGD|1306406 [details] [associations]
symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA;ISO]
[GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
"mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
Uniprot:D4A0H5
Length = 1386
Score = 652 (234.6 bits), Expect = 7.5e-95, Sum P(4) = 7.5e-95
Identities = 167/473 (35%), Positives = 256/473 (54%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 478
Score = 250 (93.1 bits), Expect = 7.5e-95, Sum P(4) = 7.5e-95
Identities = 70/241 (29%), Positives = 107/241 (44%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 784 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATRQGELPL 839
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P ++
Sbjct: 840 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLKVRFKKV 889
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ T E + R F++I G+ G F+ G P W +V
Sbjct: 890 PHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLV 949
Query: 939 F-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV+K
Sbjct: 950 TGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRK 1009
Query: 998 V 998
+
Sbjct: 1010 I 1010
Score = 158 (60.7 bits), Expect = 7.5e-95, Sum P(4) = 7.5e-95
Identities = 45/152 (29%), Positives = 72/152 (47%)
Query: 543 ELPGCKGIWTVYH----------KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
ELPGC +WTV K+ S+ A D H +LI+S E TM+L+T
Sbjct: 523 ELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQT 582
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 583 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGA 638
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 639 -------PIVQCAVADPYVVIMSAEGHVTMFL 663
Score = 47 (21.6 bits), Expect = 1.1e-15, Sum P(3) = 1.1e-15
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 586 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 631
Score = 42 (19.8 bits), Expect = 7.5e-95, Sum P(4) = 7.5e-95
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 3 FAAYKMMHWPTGI 15
+A YK H PTG+
Sbjct: 2 YAVYKQAHPPTGL 14
>UNIPROTKB|F1PC28 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
Length = 1398
Score = 640 (230.4 bits), Expect = 8.3e-95, Sum P(3) = 8.3e-95
Identities = 158/431 (36%), Positives = 240/431 (55%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 23 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 78
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 79 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 132
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 133 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 192
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 193 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 252
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 253 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 312
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+ E D
Sbjct: 313 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAAD 370
Query: 457 IEADAPSTKRLRRSSS-----DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLV 510
E KR+ ++ QD V +E+ +YGS A + T+ A T+SF V DS++
Sbjct: 371 KEEPPSKKKRVDCAAGWSGGKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSIL 426
Query: 511 NIGPLKDFSYG 521
NIGP + + G
Sbjct: 427 NIGPCANAAMG 437
Score = 250 (93.1 bits), Expect = 8.3e-95, Sum P(3) = 8.3e-95
Identities = 74/243 (30%), Positives = 115/243 (47%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T QG
Sbjct: 745 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATRQGELPL 800
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P + S+
Sbjct: 801 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 849
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWC 936
+ S+ + EE GA + R F++I G+ G F+ G P W
Sbjct: 850 VPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFEDIYGYSGVFICGPSPHWL 908
Query: 937 MVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPV 995
+V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV
Sbjct: 909 LVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPV 968
Query: 996 QKV 998
+K+
Sbjct: 969 RKI 971
Score = 176 (67.0 bits), Expect = 8.3e-95, Sum P(3) = 8.3e-95
Identities = 49/152 (32%), Positives = 79/152 (51%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNAD--SSRMAAYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++S+G A+ SS + A DD H +LI+S E TM+L+T
Sbjct: 484 ELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQT 543
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 544 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 599
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 600 -------PIVQCAVADPYVVIMSAEGHVTMFL 624
Score = 49 (22.3 bits), Expect = 1.6e-16, Sum P(2) = 1.6e-16
Identities = 21/74 (28%), Positives = 36/74 (48%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+ S
Sbjct: 547 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 603
Query: 338 CALALNNYAVSLDS 351
CA+A + Y V + +
Sbjct: 604 CAVA-DPYVVIMSA 616
>MGI|MGI:2679722 [details] [associations]
symbol:Cpsf1 "cleavage and polyadenylation specific factor
1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
"mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
Length = 1441
Score = 648 (233.2 bits), Expect = 9.3e-95, Sum P(4) = 9.3e-95
Identities = 167/475 (35%), Positives = 255/475 (53%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 480
Score = 256 (95.2 bits), Expect = 9.3e-95, Sum P(4) = 9.3e-95
Identities = 73/242 (30%), Positives = 112/242 (46%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 788 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATRQGELPL 843
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P + S+
Sbjct: 844 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 892
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHG-APCQRITIFKNISGHQGFFLSGSRPCWCM 937
+ S+ + + EE G R F++I G+ G F+ G P W +
Sbjct: 893 VPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLL 952
Query: 938 VF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV+
Sbjct: 953 VTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVR 1012
Query: 997 KV 998
K+
Sbjct: 1013 KI 1014
Score = 158 (60.7 bits), Expect = 9.3e-95, Sum P(4) = 9.3e-95
Identities = 45/152 (29%), Positives = 72/152 (47%)
Query: 543 ELPGCKGIWTVYH----------KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
ELPGC +WTV K+ S+ A D H +LI+S E TM+L+T
Sbjct: 527 ELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQT 586
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 587 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGA 642
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 643 -------PIVQCAVADPYVVIMSAEGHVTMFL 667
Score = 47 (21.6 bits), Expect = 2.9e-16, Sum P(3) = 2.9e-16
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 590 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 635
Score = 42 (19.8 bits), Expect = 9.3e-95, Sum P(4) = 9.3e-95
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 3 FAAYKMMHWPTGI 15
+A YK H PTG+
Sbjct: 2 YAVYKQAHPPTGL 14
>UNIPROTKB|Q10570 [details] [associations]
symbol:CPSF1 "Cleavage and polyadenylation specificity
factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
[GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
from RNA polymerase II promoter" evidence=TAS] [GO:0006369
"termination of RNA polymerase II transcription" evidence=TAS]
[GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
[GO:0031124 "mRNA 3'-end processing" evidence=TAS]
Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
Uniprot:Q10570
Length = 1443
Score = 651 (234.2 bits), Expect = 2.5e-94, Sum P(4) = 2.5e-94
Identities = 166/474 (35%), Positives = 257/474 (54%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +LVYG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A AT++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ + T+ FLGSRLG+SLL+++T +++ + KEE + +T
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWS 431
Query: 469 RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 432 AAGKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVG 481
Score = 248 (92.4 bits), Expect = 2.5e-94, Sum P(4) = 2.5e-94
Identities = 73/243 (30%), Positives = 113/243 (46%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 790 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----REEATRQGELPL 845
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P + S+
Sbjct: 846 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 894
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWC 936
+ S+ + EE GA + R F++I G+ G F+ G P W
Sbjct: 895 VPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFEDIYGYSGVFICGPSPHWL 953
Query: 937 MVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPV 995
+V R LR+HP DG + +F HNVNC GF+Y QG L+I LP+ +YD WPV
Sbjct: 954 LVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPV 1013
Query: 996 QKV 998
+K+
Sbjct: 1014 RKI 1016
Score = 158 (60.7 bits), Expect = 2.5e-94, Sum P(4) = 2.5e-94
Identities = 46/153 (30%), Positives = 75/153 (49%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDD-EYHAYLIISLEARTMVLE 591
ELPGC +WTV + +G + S+ A DD H +LI+S E TM+L+
Sbjct: 528 ELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQ 587
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXX 651
T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 588 TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLG 643
Query: 652 XXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 644 A-------PIVQCAVADPYVVIMSAEGHVTMFL 669
Score = 47 (21.6 bits), Expect = 2.1e-15, Sum P(3) = 2.1e-15
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 592 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 637
Score = 42 (19.8 bits), Expect = 2.5e-94, Sum P(4) = 2.5e-94
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 3 FAAYKMMHWPTGI 15
+A YK H PTG+
Sbjct: 2 YAVYKQAHPPTGL 14
>FB|FBgn0024698 [details] [associations]
symbol:Cpsf160 "Cleavage and polyadenylation specificity
factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
[GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
Uniprot:Q9V726
Length = 1455
Score = 566 (204.3 bits), Expect = 2.3e-79, Sum P(4) = 2.3e-79
Identities = 158/509 (31%), Positives = 260/509 (51%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV ANV+++Y + R LE + Y L+GNV SL
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLA-----PKMRLECLATYTLYGNVMSL 83
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S GA RD+++++F+DAK+SVL+ D L+ S+H FE + GR
Sbjct: 84 QCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWTGRY- 138
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
F P V+VDP RC +LVYG ++++L + S L + + +R I
Sbjct: 139 FV--PTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTPI 196
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S+
Sbjct: 197 MASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISL 256
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAV 347
+ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS ++LN+ A
Sbjct: 257 NIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSAD 316
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+ K
Sbjct: 317 NSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKA 376
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEF 454
SVLTS I + + FLGSRLG+SLL+ FT +++++ L++E
Sbjct: 377 AASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDED 436
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
++E + +L + + A + EEL +YGS + + + F F V DSL+N+ P
Sbjct: 437 QNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAP 495
Query: 515 LKDFSYGLRINADASATGISKQSNYELVE 543
+ G R+ + G++ + + E ++
Sbjct: 496 INYMCAGERVEFEED--GVTLRPHAESLQ 522
Score = 174 (66.3 bits), Expect = 2.3e-79, Sum P(4) = 2.3e-79
Identities = 35/88 (39%), Positives = 52/88 (59%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN +GF+
Sbjct: 942 QKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFL 1001
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKV 998
Y + LKI LPS +YD+ WPV+KV
Sbjct: 1002 YFDTTYELKISVLPSYLSYDSVWPVRKV 1029
Score = 159 (61.0 bits), Expect = 2.3e-79, Sum P(4) = 2.3e-79
Identities = 58/198 (29%), Positives = 98/198 (49%)
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T + E+ E+
Sbjct: 556 ELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-EN 605
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTVL 662
+ V TI GNL +R ++QV R R+L G+ + Q++ V+
Sbjct: 606 TGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPID----------VGSPVV 655
Query: 663 SVSIADPYVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTL 712
VSIADPYV L + +G + L G P T+S +PA + S+ K +S L
Sbjct: 656 QVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTIS-SSPAVVAISAYKDLSG--L 712
Query: 713 YHDKGPEPWLRKTSTDAW 730
+ KG + L +S A+
Sbjct: 713 FTVKGDDINLTGSSNSAF 730
Score = 75 (31.5 bits), Expect = 2.3e-79, Sum P(4) = 2.3e-79
Identities = 28/105 (26%), Positives = 50/105 (47%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
VV +SG LEI+ +P+ V+ V+ +G + D E + S T +S+ G Q
Sbjct: 795 VVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM--EFVPISLTT-QENSKAGIVQA 851
Query: 815 -RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
++ +S +EL++ + RP L + T +L YQ + +
Sbjct: 852 CMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELLIYQVFRY 895
Score = 37 (18.1 bits), Expect = 6.3e-10, Sum P(3) = 6.3e-10
Identities = 9/18 (50%), Positives = 11/18 (61%)
Query: 373 QNDVALLSTKTGDLVLLT 390
Q+D LLS + LVL T
Sbjct: 579 QHDFMLLSQRNSTLVLQT 596
>DICTYBASE|DDB_G0281585 [details] [associations]
symbol:cpsf1 "cleavage and polyadenylation
specificity factor 160 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
evidence=ISS] InterPro:IPR004871 Pfam:PF03178
dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
Length = 1628
Score = 413 (150.4 bits), Expect = 8.5e-65, Sum P(6) = 8.5e-65
Identities = 99/283 (34%), Positives = 160/283 (56%)
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+++++++VKDF F+HGY EP ++ LHE TW R++ K TC ++A+S++ K I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W+ N P++ L++VP P+GG LV+ AN + Y +Q++ LA+N YA S+D+S +
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399
Query: 359 SFSVE----------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
F LD ++ +L++D + S K G+L++ ++ DGR VQR+ +SK
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SVLTS I + N+L FLGSRLGDSLL+Q+T S+ L+ E P K+
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYT---EKSITDDQLEHE----NFSNPYKKQKT 512
Query: 469 RSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLV 510
D + N E + S +NN E+ +K+ S ++ L+
Sbjct: 513 SEVFDLFDE--NSETNNNNNSNNNNNKENQEKSSSSSIASKLL 553
Score = 210 (79.0 bits), Expect = 8.5e-65, Sum P(6) = 8.5e-65
Identities = 60/173 (34%), Positives = 88/173 (50%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMD----GISAA-------SLELVC 105
NLV+ NV++IY +R + +++ + I+ SLEL+
Sbjct: 32 NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91
Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+L GN+ES+A + NS R DS+IL F DAKISVL++D + I S+H FE
Sbjct: 92 EKKLFGNIESMASVRY---PNSER-DSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147
Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
E+ K GR F PL+KVD Q RC +L+Y + +L + S L D+D
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSILDDDDD 197
Score = 119 (46.9 bits), Expect = 8.5e-65, Sum P(6) = 8.5e-65
Identities = 35/148 (23%), Positives = 72/148 (48%)
Query: 572 DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
D +H YL +SL + T++ ET L EV + +++ GNLFGR+R++ +++ G
Sbjct: 712 DKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDIGNLFGRKRIVVIYQGG 766
Query: 631 ARILDG-SYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLVG-DPS 688
++++G + Q++ + S I DP++LL +G+I++ G D
Sbjct: 767 IKLINGFDRVIQEIQINE------------PIKSSYICDPFILLQFHNGTIQIFKGIDEE 814
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDK 716
+ + + + + S +L+ D+
Sbjct: 815 NQLIQFSINSISNNLNQSIFSSSLFFDR 842
Score = 91 (37.1 bits), Expect = 8.5e-65, Sum P(6) = 8.5e-65
Identities = 24/80 (30%), Positives = 37/80 (46%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ-LCDGS---------------IV 955
+RI F +ISG +G F+ G +P W + LR+H D S +
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181
Query: 956 AFTVLHNVNCNHGFIYVTSQ 975
FT +N++C GFIY + +
Sbjct: 1182 TFTSFNNISCQDGFIYFSKE 1201
Score = 79 (32.9 bits), Expect = 1.5e-63, Sum P(6) = 1.5e-63
Identities = 13/47 (27%), Positives = 29/47 (61%)
Query: 953 SIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKV 998
++ FT +N++C GFIY + + ++KIC L + ++N ++++
Sbjct: 1179 TVETFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRI 1225
Score = 73 (30.8 bits), Expect = 2.5e-28, Sum P(6) = 2.5e-28
Identities = 22/82 (26%), Positives = 35/82 (42%)
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS 529
S + L + + EE L+ N K++ + D ++NIGP+ D G I+
Sbjct: 547 SIASKLLEEIEDEEDQLFKEKKNQL----KSYQLGICDQIINIGPIGDIVVGQSIDPTYD 602
Query: 530 ATGISKQSNY--ELVELPGCKG 549
T Q Y + +EL C G
Sbjct: 603 ETIQPNQPEYVPKTLELVTCSG 624
Score = 64 (27.6 bits), Expect = 6.3e-57, Sum P(4) = 6.3e-57
Identities = 21/89 (23%), Positives = 38/89 (42%)
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
F++P V+TV K HI + K + N+++E+ + + +
Sbjct: 646 FELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQEDNEDNEEEEE 705
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
E MQ+ H +L+ L DGT L ++
Sbjct: 706 EEKMQKDKNWHD--YLYLSLKDGTTLIFE 732
Score = 58 (25.5 bits), Expect = 8.5e-65, Sum P(6) = 8.5e-65
Identities = 19/77 (24%), Positives = 38/77 (49%)
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVD--KF---VSG-RTHIVDTYMREALKDSET 801
DQ +IY + +G+ EI+ + + C+F V KF + G T++ + E + ++
Sbjct: 932 DQDNIYLNIYTTNGSYEIYRLTSQECIFKVSDIKFEYDILGINTNVSQNQILEQVLTPKS 991
Query: 802 EINSSSEEGTGQGRKEN 818
++ + Q +KEN
Sbjct: 992 SLSKKQLQQHLQKQKEN 1008
Score = 53 (23.7 bits), Expect = 2.8e-64, Sum P(6) = 2.8e-64
Identities = 14/60 (23%), Positives = 31/60 (51%)
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
K E INS + + +N + +VE+++ ++ +S P+LF G ++ Y+++
Sbjct: 1004 KQKENGINSKNN----YNQIQNSEILDIVEISLHNFN--NSDPYLFMFNKIGDLIIYKSF 1057
Score = 49 (22.3 bits), Expect = 8.5e-65, Sum P(6) = 8.5e-65
Identities = 8/12 (66%), Positives = 9/12 (75%)
Query: 543 ELPGCKGIWTVY 554
ELPG +WTVY
Sbjct: 647 ELPGILNVWTVY 658
>UNIPROTKB|F1RSN8 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
Length = 1108
Score = 466 (169.1 bits), Expect = 3.9e-61, Sum P(4) = 3.9e-61
Identities = 114/325 (35%), Positives = 176/325 (54%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LELV + G V S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFG-VMSM 83
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 84 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 136
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 137 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 193
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 194 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 253
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 254 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 313
Query: 354 ELPRSSFSVELDAAHATWLQN-DVA 377
+ + LD A A ++ + DVA
Sbjct: 314 LRTQEGVRITLDCAQAAFISSQDVA 338
Score = 197 (74.4 bits), Expect = 3.9e-61, Sum P(4) = 3.9e-61
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 595 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLY 654
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 655 FNRQGELRISVLPAYLSYDAPWPVRKI 681
Score = 94 (38.1 bits), Expect = 3.9e-61, Sum P(4) = 3.9e-61
Identities = 27/98 (27%), Positives = 47/98 (47%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 455 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 510
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ RP+L + D +L Y+A+
Sbjct: 511 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 542
Score = 45 (20.9 bits), Expect = 3.9e-61, Sum P(4) = 3.9e-61
Identities = 15/52 (28%), Positives = 25/52 (48%)
Query: 3 FAAYKMMHWPTGIA-NCGSGFITHSRADYV----PQIPLIQTEELDSELPSK 49
+A YK H PTG+ + F +S + V Q+ + + D+E P+K
Sbjct: 2 YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNR-DAEAPTK 52
Score = 40 (19.1 bits), Expect = 2.1e-39, Sum P(3) = 2.1e-39
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 468 RRSSSDALQDMVNGEELS--LYGSASNNTESAQK 499
RR +A++++++GE L+ LY S E A+K
Sbjct: 1053 RRVLQNAVRNVLDGELLNRYLYLSTMERGELAKK 1086
>WB|WBGene00022301 [details] [associations]
symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
"nematode larval development" evidence=IMP] [GO:0040018 "positive
regulation of multicellular organism growth" evidence=IMP]
[GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
"negative regulation of vulval development" evidence=IMP]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
Length = 1454
Score = 431 (156.8 bits), Expect = 4.2e-54, Sum P(3) = 4.2e-54
Identities = 158/551 (28%), Positives = 257/551 (46%)
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
+L+ G + + PLV+ DP RC LVYG + IL + S
Sbjct: 128 YLRDGFINHFQPPLVRSDPSNRCAACLVYGKHIAILPFHEN-----------------SK 170
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
RI S +VI L+ +D + ++ D +F+ GY EP ++ L+E T GR ++ T I +
Sbjct: 171 RIHS-YVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGV 229
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY 345
S++ +Q ++W NLP D +LL +P P+GG LV G+NT+ Y +Q+ C L LN+
Sbjct: 230 SVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS- 288
Query: 346 AVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQR 400
D + P + LD + + ++++ + ++ GDL LL ++ G V+
Sbjct: 289 --CYDGFTKFPLKDLKHLKMTLDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKS 346
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
L+ SK + + +T F+GSRLGDS L+++T LK D
Sbjct: 347 LEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTL----------LKTT-----RD 391
Query: 461 APSTKRLRRSSSD--ALQDMVNGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIG 513
+ KRL+ + D A + ++ +++ LYG A +++ E ++ F D L N+G
Sbjct: 392 C-AVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVG 450
Query: 514 PLKDFSYGLRINADASATGISKQSN--YELVELPGC--KGIWTVYHKSSRGHNADSSRMA 569
P+K G R N ++ +K+ + ++LV G G V+ +S R SS +
Sbjct: 451 PVKSMCVG-RPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLE 509
Query: 570 AYD---------DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFG 619
+ +E H YLI+S T++LE + L E+ E + FV G T+AAG L
Sbjct: 510 GAEQLWAVGRKENESHKYLIVSRVRSTLILELGEELVELEEQL--FVTGEPTVAAGELSQ 567
Query: 620 RRRVIQVFERG-ARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDG 678
+QV A + DG M Q++ V+ SI DPYV L +G
Sbjct: 568 GALAVQVTSTCIALVTDGQQM-QEVHID----------SNFPVIQASIVDPYVALLTQNG 616
Query: 679 SIRL--LVGDP 687
+ L LV +P
Sbjct: 617 RLLLYELVMEP 627
Score = 136 (52.9 bits), Expect = 4.2e-54, Sum P(3) = 4.2e-54
Identities = 27/75 (36%), Positives = 46/75 (61%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIIL 204
RC LVYG + IL
Sbjct: 149 RCAACLVYGKHIAIL 163
Score = 134 (52.2 bits), Expect = 4.2e-54, Sum P(3) = 4.2e-54
Identities = 59/288 (20%), Positives = 117/288 (40%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGG-PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
++ DA S+ GE D D + +V +E+G L I +P V+ + +F
Sbjct: 744 KRLGHDAIQSSRGGEQSDAIDPTRTFSSISHWLIVSHENGRLSIHSLPEMEVVYQIGRFS 803
Query: 782 SGRTHIVDTYMREALKDSETEINSSSEEG---TGQGRKENIHSMKVVELAMQR----WSA 834
+ +VD + E K+ + + +++E T + + N ++ E ++
Sbjct: 804 NVPELLVDLTVEEEEKERKAKAQQAAKEASVPTDEAEQLNTEMKQLCERVLEAQIVGMGI 863
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSR 894
+ + P L AI+ + +L Y+ + +S + P +
Sbjct: 864 NQAHPILMAIVDEQVVL-YEMF-------SSSNPIPGHLGISFRKLPHFICLRTSSHLNS 915
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCD 951
A + +G I F+ +S + G + G+ P +V+ ++ H D
Sbjct: 916 DGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPT-LLVYGAWGGMQTHQMTVD 974
Query: 952 GSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKV 998
G I AFT +N N HG +Y+T + L+I ++ Y+ +PV+K+
Sbjct: 975 GPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKI 1022
Score = 43 (20.2 bits), Expect = 1.6e-12, Sum P(3) = 1.6e-12
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
+++ E G+ +T++ +R DA+Q GE+
Sbjct: 720 TIMEQNFPVENGEATIKQSNTRKRKRLGHDAIQSSRGGEQ 759
>UNIPROTKB|Q9N4C2 [details] [associations]
symbol:cpsf-1 "Probable cleavage and polyadenylation
specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
[GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=NAS]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
Length = 1454
Score = 431 (156.8 bits), Expect = 4.2e-54, Sum P(3) = 4.2e-54
Identities = 158/551 (28%), Positives = 257/551 (46%)
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
+L+ G + + PLV+ DP RC LVYG + IL + S
Sbjct: 128 YLRDGFINHFQPPLVRSDPSNRCAACLVYGKHIAILPFHEN-----------------SK 170
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
RI S +VI L+ +D + ++ D +F+ GY EP ++ L+E T GR ++ T I +
Sbjct: 171 RIHS-YVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGV 229
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY 345
S++ +Q ++W NLP D +LL +P P+GG LV G+NT+ Y +Q+ C L LN+
Sbjct: 230 SVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS- 288
Query: 346 AVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQR 400
D + P + LD + + ++++ + ++ GDL LL ++ G V+
Sbjct: 289 --CYDGFTKFPLKDLKHLKMTLDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKS 346
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
L+ SK + + +T F+GSRLGDS L+++T LK D
Sbjct: 347 LEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTL----------LKTT-----RD 391
Query: 461 APSTKRLRRSSSD--ALQDMVNGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIG 513
+ KRL+ + D A + ++ +++ LYG A +++ E ++ F D L N+G
Sbjct: 392 C-AVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVG 450
Query: 514 PLKDFSYGLRINADASATGISKQSN--YELVELPGC--KGIWTVYHKSSRGHNADSSRMA 569
P+K G R N ++ +K+ + ++LV G G V+ +S R SS +
Sbjct: 451 PVKSMCVG-RPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLE 509
Query: 570 AYD---------DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFG 619
+ +E H YLI+S T++LE + L E+ E + FV G T+AAG L
Sbjct: 510 GAEQLWAVGRKENESHKYLIVSRVRSTLILELGEELVELEEQL--FVTGEPTVAAGELSQ 567
Query: 620 RRRVIQVFERG-ARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDG 678
+QV A + DG M Q++ V+ SI DPYV L +G
Sbjct: 568 GALAVQVTSTCIALVTDGQQM-QEVHID----------SNFPVIQASIVDPYVALLTQNG 616
Query: 679 SIRL--LVGDP 687
+ L LV +P
Sbjct: 617 RLLLYELVMEP 627
Score = 136 (52.9 bits), Expect = 4.2e-54, Sum P(3) = 4.2e-54
Identities = 27/75 (36%), Positives = 46/75 (61%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIIL 204
RC LVYG + IL
Sbjct: 149 RCAACLVYGKHIAIL 163
Score = 134 (52.2 bits), Expect = 4.2e-54, Sum P(3) = 4.2e-54
Identities = 59/288 (20%), Positives = 117/288 (40%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGG-PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
++ DA S+ GE D D + +V +E+G L I +P V+ + +F
Sbjct: 744 KRLGHDAIQSSRGGEQSDAIDPTRTFSSISHWLIVSHENGRLSIHSLPEMEVVYQIGRFS 803
Query: 782 SGRTHIVDTYMREALKDSETEINSSSEEG---TGQGRKENIHSMKVVELAMQR----WSA 834
+ +VD + E K+ + + +++E T + + N ++ E ++
Sbjct: 804 NVPELLVDLTVEEEEKERKAKAQQAAKEASVPTDEAEQLNTEMKQLCERVLEAQIVGMGI 863
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSR 894
+ + P L AI+ + +L Y+ + +S + P +
Sbjct: 864 NQAHPILMAIVDEQVVL-YEMF-------SSSNPIPGHLGISFRKLPHFICLRTSSHLNS 915
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCD 951
A + +G I F+ +S + G + G+ P +V+ ++ H D
Sbjct: 916 DGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPT-LLVYGAWGGMQTHQMTVD 974
Query: 952 GSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKV 998
G I AFT +N N HG +Y+T + L+I ++ Y+ +PV+K+
Sbjct: 975 GPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKI 1022
Score = 43 (20.2 bits), Expect = 1.6e-12, Sum P(3) = 1.6e-12
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
+++ E G+ +T++ +R DA+Q GE+
Sbjct: 720 TIMEQNFPVENGEATIKQSNTRKRKRLGHDAIQSSRGGEQ 759
>UNIPROTKB|J9P418 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
Ensembl:ENSCAFT00000043656 Uniprot:J9P418
Length = 1107
Score = 250 (93.1 bits), Expect = 7.3e-43, Sum P(3) = 7.3e-43
Identities = 74/243 (30%), Positives = 115/243 (47%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T QG
Sbjct: 454 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATRQGELPL 509
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P + S+
Sbjct: 510 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 558
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWC 936
+ S+ + EE GA + R F++I G+ G F+ G P W
Sbjct: 559 VPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFEDIYGYSGVFICGPSPHWL 617
Query: 937 MVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPV 995
+V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV
Sbjct: 618 LVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPV 677
Query: 996 QKV 998
+K+
Sbjct: 678 RKI 680
Score = 176 (67.0 bits), Expect = 7.3e-43, Sum P(3) = 7.3e-43
Identities = 49/152 (32%), Positives = 79/152 (51%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNAD--SSRMAAYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++S+G A+ SS + A DD H +LI+S E TM+L+T
Sbjct: 193 ELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQT 252
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 253 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 308
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 309 -------PIVQCAVADPYVVIMSAEGHVTMFL 333
Score = 172 (65.6 bits), Expect = 7.3e-43, Sum P(3) = 7.3e-43
Identities = 53/151 (35%), Positives = 82/151 (54%)
Query: 378 LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
++S K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL++
Sbjct: 2 VISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 61
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-----DALQDMVNGEELSLYGS-A 490
+T S+ E D E KR+ ++ QD V +E+ +YGS A
Sbjct: 62 YTEKLQEPPASAA--REAADKEEPPSKKKRVDCAAGWSGGKSVPQDEV--DEIEVYGSEA 117
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ T+ A T+SF V DS++NIGP + + G
Sbjct: 118 QSGTQLA--TYSFEVCDSILNIGPCANAAMG 146
Score = 49 (22.3 bits), Expect = 8.5e-17, Sum P(2) = 8.5e-17
Identities = 21/74 (28%), Positives = 36/74 (48%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+ S
Sbjct: 256 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 312
Query: 338 CALALNNYAVSLDS 351
CA+A + Y V + +
Sbjct: 313 CAVA-DPYVVIMSA 325
>POMBASE|SPBC1709.08 [details] [associations]
symbol:cft1 "cleavage factor one Cft1 (predicted)"
species:4896 "Schizosaccharomyces pombe" [GO:0003723 "RNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
"cytosol" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA]
[GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
[GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
cleavage" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
PomBase:SPBC1709.08 GO:GO:0005829 EMBL:CU329671 GO:GO:0006378
GenomeReviews:CU329671_GR GO:GO:0003723 eggNOG:COG5161 KO:K14401
OMA:HNDRIFQ OrthoDB:EOG451HZS PIR:T39636 RefSeq:NP_595441.1
STRING:O74733 EnsemblFungi:SPBC1709.08.1 GeneID:2539694
KEGG:spo:SPBC1709.08 NextBio:20800847 GO:GO:0005847 GO:GO:0006379
Uniprot:O74733
Length = 1441
Score = 268 (99.4 bits), Expect = 1.3e-25, Sum P(2) = 1.3e-25
Identities = 117/467 (25%), Positives = 200/467 (42%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV ++ G + ++ L G++ D +I+ + AK+S LE+D S+H
Sbjct: 92 LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148
Query: 161 CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
+E +K + P + VDP C +L + M+ + L +E
Sbjct: 149 YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202
Query: 220 F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
S S + S V+ LD + + D F++GY EP + IL+ E T +
Sbjct: 203 IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
+ T + S +++ + +I + +LP+D Y +++P+P+GG L++G N + Y S
Sbjct: 263 RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND------VALLSTKTGDLVLL 389
+ + +N+Y +S F++EL+ A L + V L+ T +G L
Sbjct: 323 RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHT-SGQFFYL 381
Query: 390 TVVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSG 442
+ DG+ V+ L L + N L S IT G +L FLGS+ DS L++++
Sbjct: 382 DFLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS--RR 439
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
T+ L E GD D L ++ + DM++ E +
Sbjct: 440 TTNEEVRLDE--GD---DT-----LYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLR 489
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
+ D L NIGP+ DF+ G A + Q N+ +EL G G
Sbjct: 490 LEICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAG 531
Score = 151 (58.2 bits), Expect = 4.1e-13, Sum P(2) = 4.1e-13
Identities = 101/432 (23%), Positives = 181/432 (41%)
Query: 285 ALSISTTLKQHPL-IWSAMNLPHD--------AYKLLAVPSPIGGVLVVGANTIHYHSQS 335
A ++ TT++ P I++++++P +L+ V S G + +G N+ Y+S+
Sbjct: 280 ASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSA-GRTVGIGVNS--YYSKC 336
Query: 336 ASCALA-LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
L +++ + L+ + +P +S E L D +L
Sbjct: 337 TDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYL-----DFLLDGKSVK 391
Query: 395 GRVVQRLDLSKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFT---------CGSG 442
G +Q LDL + N L S IT G +L FLGS+ DS L++++ G
Sbjct: 392 GLSLQALDL-EINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRRTTNEEVRLDEG 450
Query: 443 TSMLSSGLKEEFGDI----EAD-APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
L E D+ E D + +KR + L+ + + L+ G ++
Sbjct: 451 DDTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRLEIC-DVLTNIGPITDFAVGK 509
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINAD-ASATGISKQSNYELV----ELPGCKGIWT 552
++S+ +D N GPL+ G AD A + +++ + L+ + GC+ +WT
Sbjct: 510 AGSYSYFPQD---NHGPLE--LVGTA-GADGAGGLVVFRRNIFPLIAGEFQFDGCEALWT 563
Query: 553 VYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
V S + N S A Y + E YL++S E + + + EV S D+ +T
Sbjct: 564 V-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAGETFDEVQHS-DFSKDSKT 621
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPXXXXXXXXXXXXTVLSVSIADPY 670
+ G+L R++Q+ R+ D + +TQ +F V+S SI DP
Sbjct: 622 LNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNFSKKQI----------VVSTSICDPC 671
Query: 671 VLLGMSDGSIRL 682
+++ G I L
Sbjct: 672 IIVVFLGGGIAL 683
Score = 118 (46.6 bits), Expect = 1.3e-25, Sum P(2) = 1.3e-25
Identities = 46/201 (22%), Positives = 74/201 (36%)
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
T N E T KE+ S ++VEL + P LF I Y+A+L+
Sbjct: 831 TLFNGMESERT-YFNKES--SQELVELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYS- 886
Query: 861 PENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFK 918
NT K + TP DA + E + ++T +
Sbjct: 887 --NTDKHKNLLAFAKVPQETMTREFQANV----GTPRDAESTMEKKASSSVDHLKMTALE 940
Query: 919 NISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+ H F++G +P + + P + I++ H + G+IYV
Sbjct: 941 VVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPFHAHHAPQGYIYVDENSF 1000
Query: 978 LKICQLPSGSTYDNYWPVQKV 998
++IC+ YDN WP +KV
Sbjct: 1001 IRICKFQEDFEYDNKWPYKKV 1021
Score = 38 (18.4 bits), Expect = 3.3e-16, Sum P(3) = 3.3e-16
Identities = 7/18 (38%), Positives = 12/18 (66%)
Query: 670 YVLLGMSDGSIRLLVGDP 687
Y ++ + G++RLL DP
Sbjct: 1268 YFVVADTSGNLRLLAYDP 1285
Score = 38 (18.4 bits), Expect = 3.3e-16, Sum P(3) = 3.3e-16
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 527 DASATGISKQSNYELVEL 544
++ T +K+S+ ELVEL
Sbjct: 837 ESERTYFNKESSQELVEL 854
>CGD|CAL0004251 [details] [associations]
symbol:orf19.2760 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
evidence=IEA] [GO:0006369 "termination of RNA polymerase II
transcription" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634 GO:GO:0042493
GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023 EMBL:AACQ01000025
RefSeq:XP_720278.1 RefSeq:XP_720279.1 RefSeq:XP_720280.1
RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848 GeneID:3638158
GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
Length = 1420
Score = 224 (83.9 bits), Expect = 8.7e-18, Sum P(3) = 8.7e-18
Identities = 77/312 (24%), Positives = 138/312 (44%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA- 346
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 347 ---VSLDSSQELPRSSFSVELDAAHATWLQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LS----KTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ KT + + ++ + ++ F+ + G+S L+Q +S S + +
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQVRYRD-SSKTSDTKESKLN 453
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS-----FAVRDSLV 510
IE K + D D +E LY E QKT S F D L+
Sbjct: 454 KIEE-----KEDNKDDDDNDDD----DEDDLY--KEEEEEETQKTISKSHIEFLYHDELI 502
Query: 511 NIGPLKDFSYGL 522
N GP F+ G+
Sbjct: 503 NNGPSSTFTLGI 514
Score = 76 (31.8 bits), Expect = 8.7e-18, Sum P(3) = 8.7e-18
Identities = 21/95 (22%), Positives = 45/95 (47%)
Query: 906 PHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNV 963
P+G +R + F N++G F++G P + + R+ Q + ++ + +
Sbjct: 875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIF-QFSKIAAMSISAFSDS 933
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+G I++ +Q +IC+LP Y+ P++ V
Sbjct: 934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV 968
Score = 61 (26.5 bits), Expect = 8.7e-18, Sum P(3) = 8.7e-18
Identities = 14/63 (22%), Positives = 35/63 (55%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ ++L G + L + +N D ++++ + AK S++++D ++ + S+H
Sbjct: 57 LKLIDQFKLQGTITDLKSIRT--IENPNL-DYLMVSTKYAKFSIIKWDHHLNTIATVSLH 113
Query: 161 CFE 163
+E
Sbjct: 114 YYE 116
>UNIPROTKB|Q5AFT3 [details] [associations]
symbol:CFT1 "Protein CFT1" species:237561 "Candida albicans
SC5314" [GO:0042493 "response to drug" evidence=IMP]
InterPro:IPR004871 Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634
GO:GO:0042493 GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023
EMBL:AACQ01000025 RefSeq:XP_720278.1 RefSeq:XP_720279.1
RefSeq:XP_720280.1 RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848
GeneID:3638158 GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
Length = 1420
Score = 224 (83.9 bits), Expect = 8.7e-18, Sum P(3) = 8.7e-18
Identities = 77/312 (24%), Positives = 138/312 (44%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA- 346
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 347 ---VSLDSSQELPRSSFSVELDAAHATWLQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LS----KTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ KT + + ++ + ++ F+ + G+S L+Q +S S + +
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQVRYRD-SSKTSDTKESKLN 453
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS-----FAVRDSLV 510
IE K + D D +E LY E QKT S F D L+
Sbjct: 454 KIEE-----KEDNKDDDDNDDD----DEDDLY--KEEEEEETQKTISKSHIEFLYHDELI 502
Query: 511 NIGPLKDFSYGL 522
N GP F+ G+
Sbjct: 503 NNGPSSTFTLGI 514
Score = 76 (31.8 bits), Expect = 8.7e-18, Sum P(3) = 8.7e-18
Identities = 21/95 (22%), Positives = 45/95 (47%)
Query: 906 PHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNV 963
P+G +R + F N++G F++G P + + R+ Q + ++ + +
Sbjct: 875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIF-QFSKIAAMSISAFSDS 933
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
+G I++ +Q +IC+LP Y+ P++ V
Sbjct: 934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV 968
Score = 61 (26.5 bits), Expect = 8.7e-18, Sum P(3) = 8.7e-18
Identities = 14/63 (22%), Positives = 35/63 (55%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ ++L G + L + +N D ++++ + AK S++++D ++ + S+H
Sbjct: 57 LKLIDQFKLQGTITDLKSIRT--IENPNL-DYLMVSTKYAKFSIIKWDHHLNTIATVSLH 113
Query: 161 CFE 163
+E
Sbjct: 114 YYE 116
>UNIPROTKB|K7GNU1 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic
acid binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GeneTree:ENSGT00550000075040 EMBL:CU468594
Ensembl:ENSSSCT00000033207 Uniprot:K7GNU1
Length = 757
Score = 197 (74.4 bits), Expect = 3.0e-16, Sum P(2) = 3.0e-16
Identities = 39/87 (44%), Positives = 52/87 (59%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 244 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLY 303
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKV 998
QG L+I LP+ +YD WPV+K+
Sbjct: 304 FNRQGELRISVLPAYLSYDAPWPVRKI 330
Score = 94 (38.1 bits), Expect = 3.0e-16, Sum P(2) = 3.0e-16
Identities = 27/98 (27%), Positives = 47/98 (47%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 104 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 159
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ RP+L + D +L Y+A+
Sbjct: 160 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 191
>ASPGD|ASPL0000050546 [details] [associations]
symbol:AN1413 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0005634 EMBL:BN001307 GO:GO:0006397
GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AACD01000022
RefSeq:XP_659017.1 EnsemblFungi:CADANIAT00008024 GeneID:2875502
KEGG:ani:AN1413.2 HOGENOM:HOG000048586 OMA:HNDRIFQ
OrthoDB:EOG451HZS Uniprot:Q5BDG7
Length = 1339
Score = 209 (78.6 bits), Expect = 1.1e-12, P = 1.1e-12
Identities = 117/503 (23%), Positives = 202/503 (40%)
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D + H F++ Y EP IL+ + T + + + +++ + +
Sbjct: 224 DPSVIHPISLAFLYEYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLL 283
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
S LP D +K++A+P P+GG L++G+N +H + A+ +N ++ S +S
Sbjct: 284 SVTRLPSDLFKVVALPPPVGGSLLIGSNELVHIDQAGKTNAVGVNEFSRQASSFSMTDQS 343
Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLD---LSKTNPSVLTS 413
++ L+ +D LL+ TG L++ DGR V + LS + L S
Sbjct: 344 DLALRLENCVVERFSDDNGDLLLALSTGVFALVSFKLDGRSVSGISVRPLSGPSKEFLAS 403
Query: 414 DITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
++ +GN F GS DS+L+ ++ S + S + E DA L S
Sbjct: 404 TASSSAFLGNGKVFFGSESADSVLLGWSSASSATKKSFSGSTSNDESEDDAYEDD-LYSS 462
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ A+ D N + SN++ +A + D L + GP++D G A +
Sbjct: 463 APAAMTD--NPQN-----QPSNSSVAAFG--DLRIHDRLSSPGPIRDIVLGRSSEASSRD 513
Query: 531 T--GI----SKQSNYELVELPGCKGIWTVYHKSSRGHN-ADS----SRMAAYDDEYHAYL 579
T G+ + Q + E + K Y +S + A+S S + +D+ Y+
Sbjct: 514 TKDGVLELVAAQGSDEGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYV 573
Query: 580 IISL-------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
I+S E+ VLE D L +T T+ G L + RVIQV R
Sbjct: 574 ILSKQEKPDKEESEVFVLE--DKLRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVR 631
Query: 633 ILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
D + D ++ ++ DPY+ + D ++ LL D S
Sbjct: 632 SYDAVWDEDD-------------SDERVAVNATLVDPYLAIIRDDSTLLLLQADDSGDLD 678
Query: 693 SVQTPAAIESSKKPVSSCTLYHD 715
V + S K +S+C Y D
Sbjct: 679 EVTLSEDVVSQKW-LSAC-FYSD 699
Score = 150 (57.9 bits), Expect = 5.8e-06, Sum P(2) = 5.8e-06
Identities = 135/596 (22%), Positives = 234/596 (39%)
Query: 44 SELPSKRGIG---PVPNLVVTAANVIEIYVVRVQXXXXXXXXXXXX-TKRRVLMDGISAA 99
+EL S G+ VP L TA N+I +Q T+ R
Sbjct: 5 TELISPTGVTHALAVPFLSATANNLIVARTSLLQIFSLRDVSLSALDTEVRPAQHRQETC 64
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
L L Y+L G V + + D++++AF DAK+S++E+D +GL S+
Sbjct: 65 KLVLEREYQLPGTVTDICRVKI--LKTKSGGDAVLVAFRDAKLSLVEWDPERYGLSTISI 122
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDED 218
H +E + + G ++ DP RC + +G + + I+ Q G LV D+
Sbjct: 123 HYYERDDMTRSPWASDLSTCGSILSADPGSRCA-IFQFGARSLAIIPFHQPGDDLVMDD- 180
Query: 219 TFGSGGGFSARIES---SHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
FGS + R+E SH +D + F+ ++P ++H L +
Sbjct: 181 -FGSEPDYENRVEGNSRSHEAKDKDAAEYQTPYASSFVLPLTALDPS--VIHPISLAFLY 237
Query: 273 RVSWKHHTCMISALSISTTL---KQHPLIWSAMNLPHD---AYKLLAV---PSPIGGVLV 323
+ S ++ S L ++ + ++ + L + + LL+V PS + V+
Sbjct: 238 EYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVA 297
Query: 324 ----VGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-L 378
VG + + ++ A AV ++ SSFS+ + A L+N V
Sbjct: 298 LPPPVGGSLLIGSNELVHIDQAGKTNAVGVNEFSR-QASSFSMTDQSDLALRLENCVVER 356
Query: 379 LSTKTGDLVL-LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
S GDL+L L+ V +LD ++ + ++ G S FL S S +
Sbjct: 357 FSDDNGDLLLALSTGVFALVSFKLD-GRSVSGISVRPLS--GPSKEFLASTASSSAFL-- 411
Query: 438 TCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSA-SNN 493
G+G S + G A + + K S+S D +D + E LY SA +
Sbjct: 412 --GNGKVFFGSESADSVLLGWSSASSATKKSFSGSTSNDESED--DAYEDDLYSSAPAAM 467
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTV 553
T++ Q S S+ G L+ R+++ I + E G+ +
Sbjct: 468 TDNPQNQPS---NSSVAAFGDLRIHD---RLSSPGPIRDIVLGRSSEASSRDTKDGVLEL 521
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-VLETADLLTEVTESV-DYFV 607
+++G + + M E YL+ S+ A T L T LL + + DY +
Sbjct: 522 V--AAQGSD-EGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVI 574
Score = 49 (22.3 bits), Expect = 5.8e-06, Sum P(2) = 5.8e-06
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 588 MVLETADL-LTEVT-ESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
MV++T L ++E T E D V G ++A G R I VFE
Sbjct: 989 MVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFE 1031
>FB|FBgn0260962 [details] [associations]
symbol:pic "piccolo" species:7227 "Drosophila melanogaster"
[GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0006289
"nucleotide-excision repair" evidence=ISS;NAS] [GO:0005634
"nucleus" evidence=IEA] [GO:0006974 "response to DNA damage
stimulus" evidence=IMP] [GO:0035220 "wing disc development"
evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
[GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
protein catabolic process" evidence=ISS] [GO:0007307 "eggshell
chorion gene amplification" evidence=IDA] [GO:0007095 "mitotic G2
DNA damage checkpoint" evidence=IGI] InterPro:IPR004871
Pfam:PF03178 UniPathway:UPA00143 EMBL:AE014297 GO:GO:0005634
GO:GO:0005737 GO:GO:0007095 GO:GO:0043161 GO:GO:0003677
GO:GO:0006281 GO:GO:0035220 GO:GO:0042787 GO:GO:0007307
eggNOG:NOG247734 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
HSSP:Q16531 EMBL:AF132145 RefSeq:NP_650257.1 UniGene:Dm.3215
ProteinModelPortal:Q9XYZ5 SMR:Q9XYZ5 STRING:Q9XYZ5 PaxDb:Q9XYZ5
PRIDE:Q9XYZ5 EnsemblMetazoa:FBtr0082709 GeneID:41611
KEGG:dme:Dmel_CG7769 UCSC:CG7769-RA CTD:41611 FlyBase:FBgn0260962
InParanoid:Q9XYZ5 OrthoDB:EOG4S1RP0 PhylomeDB:Q9XYZ5
GenomeRNAi:41611 NextBio:824642 Bgee:Q9XYZ5 Uniprot:Q9XYZ5
Length = 1140
Score = 141 (54.7 bits), Expect = 1.1e-05, Sum P(4) = 1.1e-05
Identities = 59/205 (28%), Positives = 94/205 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLR +D +V D F+HG + P ++++H+ GR H I+ L +K
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE---IN-LRDKEFMK--- 204
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
+ W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ P
Sbjct: 205 IAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA-------P 250
Query: 357 RSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVL 411
+ F +A N + LL G L +L + G V+ + + + +
Sbjct: 251 LT-FRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISI 309
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQ 436
IT + N ++G+R GDS LV+
Sbjct: 310 PECITYLDNGFLYIGARHGDSQLVR 334
Score = 64 (27.6 bits), Expect = 1.1e-05, Sum P(4) = 1.1e-05
Identities = 31/152 (20%), Positives = 60/152 (39%)
Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
GI Q + ++LPG KG+W++ ++ + Y L+++ T +L
Sbjct: 391 GIGIQE-HACIDLPGIKGMWSL-------------KVGVDESPYENTLVLAFVGHTRILT 436
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXX 651
+ E TE + +T N+ ++IQV R++ + + P
Sbjct: 437 LSGEEVEETEIPGFASDLQTFLCSNV-DYDQLIQVTSDSVRLVSSATKALVAEWRPTGDR 495
Query: 652 XXXXXXXXT--VLSVSIADPYVLLGMSDGSIR 681
T +L S D + ++ + DGS+R
Sbjct: 496 TIGVVSCNTTQILVASACDIFYIV-IEDGSLR 526
Score = 41 (19.5 bits), Expect = 1.1e-05, Sum P(4) = 1.1e-05
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMII 203
G + +DP+ R G+ +Y GL II
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTII 143
Score = 38 (18.4 bits), Expect = 1.1e-05, Sum P(4) = 1.1e-05
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 670 YVLLGMSDGSIRLLVGDPST 689
Y+L + DGS+ + D +T
Sbjct: 600 YLLCALGDGSMYYFIMDQTT 619
>TAIR|locus:2127368 [details] [associations]
symbol:DDB1B "damaged DNA binding protein 1B"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
[GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
development ending in seed dormancy" evidence=IMP] [GO:0005515
"protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
[GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
specification" evidence=RCA] [GO:0010072 "primary shoot apical
meristem specification" evidence=RCA] [GO:0010100 "negative
regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
evidence=RCA] [GO:0010564 "regulation of cell cycle process"
evidence=RCA] [GO:0045595 "regulation of cell differentiation"
evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
[GO:0048608 "reproductive structure development" evidence=RCA]
[GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
GermOnline:AT4G21100 Uniprot:O49552
Length = 1088
Score = 100 (40.3 bits), Expect = 1.4e-05, Sum P(3) = 1.4e-05
Identities = 35/117 (29%), Positives = 56/117 (47%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P + +L++ A V T +S
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQDNKD-ARHVK----TYEVSL 197
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+ P WS NL + A L+ VPSP+ GVL++G TI Y S +A A+ +
Sbjct: 198 KD--KNFVEGP--WSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 250
Score = 73 (30.8 bits), Expect = 1.4e-05, Sum P(3) = 1.4e-05
Identities = 17/59 (28%), Positives = 31/59 (52%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
LL G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 269 LLGDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIK 327
Score = 68 (29.0 bits), Expect = 1.4e-05, Sum P(3) = 1.4e-05
Identities = 36/133 (27%), Positives = 64/133 (48%)
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
G KD S LRI + GI++Q++ VEL G KG+W++ KSS D
Sbjct: 372 GAYKDGS--LRIVRNG--IGINEQAS---VELQGIKGMWSL--KSS------------ID 410
Query: 573 DEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+ + +L++S E R + + D L E TE + + +T+ + +++QV
Sbjct: 411 EAFDTFLVVSFISETRILAMNIEDELEE-TEIEGFLSEVQTLFCHDAV-YNQLVQVTSNS 468
Query: 631 ARILDGSYMTQDL 643
R++ + T++L
Sbjct: 469 VRLVSST--TREL 479
>TAIR|locus:2115909 [details] [associations]
symbol:DDB1A "damaged DNA binding protein 1A"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
[GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
"negative regulation of photomorphogenesis" evidence=IGI;RCA]
[GO:0045892 "negative regulation of transcription, DNA-dependent"
evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
[GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
[GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
formation" evidence=RCA] [GO:0003002 "regionalization"
evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
"protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
evidence=RCA] [GO:0008284 "positive regulation of cell
proliferation" evidence=RCA] [GO:0009630 "gravitropism"
evidence=RCA] [GO:0009639 "response to red or far red light"
evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
[GO:0033043 "regulation of organelle organization" evidence=RCA]
[GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
organ formation" evidence=RCA] [GO:0048608 "reproductive structure
development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
Uniprot:Q9M0V3
Length = 1088
Score = 91 (37.1 bits), Expect = 4.8e-05, Sum P(3) = 4.8e-05
Identities = 33/120 (27%), Positives = 55/120 (45%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F+ G +P + +L++ +H +
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAVLYQ------DNKDARH----VKT 192
Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+S LK + WS +L + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 193 YEVS--LKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPI 250
Score = 74 (31.1 bits), Expect = 4.8e-05, Sum P(3) = 4.8e-05
Identities = 18/59 (30%), Positives = 31/59 (52%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
LL G + LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 269 LLGDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVK 327
Score = 71 (30.1 bits), Expect = 4.8e-05, Sum P(3) = 4.8e-05
Identities = 36/133 (27%), Positives = 64/133 (48%)
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
G KD S LR+ + GI++Q++ VEL G KG+W++ KSS D
Sbjct: 372 GAFKDGS--LRVVRNG--IGINEQAS---VELQGIKGMWSL--KSS------------ID 410
Query: 573 DEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+ + +L++S E R + + D L E TE + Q +T+ + +++QV
Sbjct: 411 EAFDTFLVVSFISETRILAMNLEDELEE-TEIEGFLSQVQTLFCHDAV-YNQLVQVTSNS 468
Query: 631 ARILDGSYMTQDL 643
R++ + T++L
Sbjct: 469 VRLVSST--TREL 479
>SGD|S000002709 [details] [associations]
symbol:CFT1 "RNA-binding subunit of the mRNA cleavage and
polyadenylation factor" species:4932 "Saccharomyces cerevisiae"
[GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0003723 "RNA binding"
evidence=IEA;IDA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005739
"mitochondrion" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA;IPI]
[GO:0006369 "termination of RNA polymerase II transcription"
evidence=IMP] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
[GO:0006379 "mRNA cleavage" evidence=IDA;TAS] [GO:0005849 "mRNA
cleavage factor complex" evidence=IPI] InterPro:IPR004871
Pfam:PF03178 SGD:S000002709 GO:GO:0005739 GO:GO:0006378
EMBL:BK006938 GO:GO:0003723 EMBL:U28374 eggNOG:COG5161 KO:K14401
OMA:HNDRIFQ GO:GO:0005847 GO:GO:0006379 PIR:S61187
RefSeq:NP_010587.1 ProteinModelPortal:Q06632 DIP:DIP-2467N
IntAct:Q06632 MINT:MINT-375530 STRING:Q06632 PaxDb:Q06632
PeptideAtlas:Q06632 EnsemblFungi:YDR301W GeneID:851895
KEGG:sce:YDR301W CYGD:YDR301w GeneTree:ENSGT00550000075040
HOGENOM:HOG000246682 OrthoDB:EOG4D29XZ NextBio:969889
Genevestigator:Q06632 GermOnline:YDR301W GO:GO:0006369
Uniprot:Q06632
Length = 1357
Score = 91 (37.1 bits), Expect = 4.9e-05, Sum P(3) = 4.9e-05
Identities = 35/157 (22%), Positives = 69/157 (43%)
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGR--VSWKHHTCMISALSI----STTLKQHPL 297
K++ D F+ + +P + +L++ +L WAG +S +I L+I S T +
Sbjct: 211 KNIIDIQFLKNFTKPTIALLYQPKLVWAGNTTISKLPTQYVILTLNIQPAESATKIESTT 270
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQE 354
I LP D + ++ V + G ++VG N + + + + LN++A L ++
Sbjct: 271 IAFVKELPWDLHTIVPVSN---GAIIVGTNELAFLDNTGVLQSTVLLNSFADKELQKTKI 327
Query: 355 LPRSSFSVELDAAHAT--WLQNDVALLSTKTGDLVLL 389
+ SS + + T W+ + + D LL
Sbjct: 328 INNSSLEIMFREKNTTSIWIPSSKSKNGGSNNDETLL 364
Score = 85 (35.0 bits), Expect = 4.9e-05, Sum P(3) = 4.9e-05
Identities = 70/331 (21%), Positives = 130/331 (39%)
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRLG 430
++ LL ++ + + +GR++ + D+ K N + + L S
Sbjct: 360 DETLLLMDLKSNIYYIQMEAEGRLLIKFDIFKLPIVNDLLKENSNPKCITRLNATNSNKN 419
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS--TKRLRRSSSDALQDM--VNGEELSL 486
L + F G+ + + LK EA PS T L + D ++M + +E
Sbjct: 420 MDLFIGFGSGNALVLRLNNLKSTIETREAHNPSSGTNSLMDINDDDDEEMDDLYADEAPE 479
Query: 487 YGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISK--QSNYEL 541
G +N+++ +T F + SL N+GP+ + G + D G+ ++ Y L
Sbjct: 480 NGLTTNDSKGTVETVQPFDIELLSSLRNVGPITSLTVGKVSSIDDVVKGLPNPNKNEYSL 539
Query: 542 VELPGC-KGIW-TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL--- 596
V G G TV S + + + + ++ + ++ R L T D
Sbjct: 540 VATSGNGSGSHLTVIQTSVQPEIELALKFISITQIWN----LKIKGRDRYLITTDSTKSR 595
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRV----IQVFERGARILDGSYMTQDLS-FGPXXXX 651
+++ ES + F + G L RR I +F RI+ + T L +
Sbjct: 596 SDIYESDNNF---KLHKGGRL--RRDATTVYISMFGEEKRIIQVT--TNHLYLYDTHFRR 648
Query: 652 XXXXXXXXTVLSVSIADPYVLLGMSDGSIRL 682
V+ VS+ DPY+L+ +S G I++
Sbjct: 649 LTTIKFDYEVIHVSVMDPYILVTVSRGDIKI 679
Score = 63 (27.2 bits), Expect = 4.9e-05, Sum P(3) = 4.9e-05
Identities = 20/91 (21%), Positives = 41/91 (45%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ HG + + ++ Q + S ++L AKIS+L+F+ + + S+H
Sbjct: 48 LYLTDEFKFHGLITDIGLIPQKDSPLS----CLLLCTGVAKISILKFNTLTNSIDTLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC 191
+E + A+ +++DP C
Sbjct: 104 YYEGK---FKGKSLVELAKISTLRMDPGSSC 131
>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
symbol:ddb1 "damage specific DNA binding protein
1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
UniGene:Dr.77970 Uniprot:I1XUS8
Length = 1140
Score = 116 (45.9 bits), Expect = 0.00014, Sum P(3) = 0.00014
Identities = 42/164 (25%), Positives = 74/164 (45%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A + S + +
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAVA----PPIIKQSTIVCHN 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+T + N + F+GSRLGDS LV+ S G+ E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVMETFTNL 356
Score = 71 (30.1 bits), Expect = 0.00014, Sum P(3) = 0.00014
Identities = 26/93 (27%), Positives = 45/93 (48%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +SSR D+ DD L++S +T VL + E TE
Sbjct: 402 IDLPGIKGLWPLRSESSR----DT------DD----MLVLSFVGQTRVLMLSGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 448 LQGFVDNQQTFFCGNV-AHQQLIQITSVSVRLV 479
Score = 44 (20.5 bits), Expect = 0.00014, Sum P(3) = 0.00014
Identities = 12/50 (24%), Positives = 24/50 (48%)
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFL 841
+ ++ S+ +S+S T G + +HS+ VV+ + H+ FL
Sbjct: 761 LSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD--QHTFEVLHAHQFL 808
Score = 40 (19.1 bits), Expect = 0.00035, Sum P(3) = 0.00035
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ + D S T +V+ A+ ++ VSS L+
Sbjct: 737 SSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLF 771
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.135 0.404 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 1004 961 0.00095 122 3 11 22 0.39 34
38 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 23
No. of states in DFA: 632 (67 KB)
Total size of DFA: 451 KB (2213 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 75.91u 0.09s 76.00t Elapsed: 00:00:05
Total cpu time: 75.92u 0.09s 76.01t Elapsed: 00:00:05
Start: Tue May 21 05:34:41 2013 End: Tue May 21 05:34:46 2013