Your job contains 1 sequence.
>psy92
MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFK
KLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR
AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP
HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF
SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF
DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG
IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA
GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM
YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGAL
GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW
KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= psy92
(638 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya... 1220 6.4e-166 2
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"... 1198 3.5e-165 2
UNIPROTKB|K7GNU1 - symbol:CPSF1 "Uncharacterized protein"... 1198 3.5e-165 2
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla... 1194 2.5e-164 2
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"... 1191 5.1e-164 2
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"... 1191 5.1e-164 2
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla... 1191 1.4e-163 2
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat... 1185 1.4e-163 2
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla... 1126 1.4e-159 2
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ... 774 3.6e-118 2
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab... 654 2.0e-89 2
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p... 654 2.0e-89 2
TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade... 455 3.6e-65 2
POMBASE|SPBC1709.08 - symbol:cft1 "cleavage factor one Cf... 450 1.6e-60 2
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya... 449 4.0e-53 2
ASPGD|ASPL0000050546 - symbol:AN1413 species:162425 "Emer... 370 3.6e-46 2
CGD|CAL0004251 - symbol:orf19.2760 species:5476 "Candida ... 326 6.8e-33 2
UNIPROTKB|Q5AFT3 - symbol:CFT1 "Protein CFT1" species:237... 326 6.8e-33 2
SGD|S000002709 - symbol:CFT1 "RNA-binding subunit of the ... 266 3.7e-31 3
TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr... 128 2.0e-11 2
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr... 120 5.0e-11 3
GENEDB_PFALCIPARUM|PFL1680w - symbol:PFL1680w "splicing f... 106 4.4e-05 2
UNIPROTKB|Q8I574 - symbol:PFL1680w "Splicing factor 3b, s... 106 4.4e-05 2
UNIPROTKB|G4N4E2 - symbol:MGG_16867 "Uncharacterized prot... 120 0.00016 2
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ... 127 0.00034 1
WB|WBGene00010890 - symbol:ddb-1 species:6239 "Caenorhabd... 104 0.00085 2
UNIPROTKB|Q21554 - symbol:ddb-1 "DNA damage-binding prote... 104 0.00085 2
DICTYBASE|DDB_G0282569 - symbol:sf3b3 "splicing factor 3B... 93 0.00090 2
>ZFIN|ZDB-GENE-040709-2 [details] [associations]
symbol:cpsf1 "cleavage and polyadenylation specific
factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
"definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
Length = 1451
Score = 1220 (434.5 bits), Expect = 6.4e-166, Sum P(2) = 6.4e-166
Identities = 230/429 (53%), Positives = 302/429 (70%)
Query: 13 DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXSD--- 69
D +V+E+ VSLG + +RP LL + ELLIY+AF + + +
Sbjct: 850 DIPLVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKMPHNINY 909
Query: 70 RSK----RANEQP-GLPR---GV--RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
R K R +++P G GV R+++ RYF +I+GY GVF+CGP P W+ +TSRG +
Sbjct: 910 REKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAM 969
Query: 120 RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
R HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct: 970 RLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCT 1029
Query: 180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
H+++YH+E+K Y + TS EP T + GE+KE T RD R+I P +F + L SP
Sbjct: 1030 VHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISP 1089
Query: 240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
SWE IP T L EWEHV C+K V+++ + T+SGL+GY+ALGT E+VTCRGRIL+
Sbjct: 1090 VSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILI 1149
Query: 300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH +GFLV+A+GQKI++W LKDNDLT
Sbjct: 1150 LDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLT 1209
Query: 360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
G+AFIDT++YI M S+KN IL D +SI+LLRYQPE +TLSLV+RD KP + S +
Sbjct: 1210 GMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFM 1269
Query: 420 AGNPSRGII 428
N G +
Sbjct: 1270 VDNNQLGFL 1278
Score = 416 (151.5 bits), Expect = 6.4e-166, Sum P(2) = 6.4e-166
Identities = 80/179 (44%), Positives = 122/179 (68%)
Query: 463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSI 519
+ +GF++SD+DKN++++MY PEA+ES GG RL+++ DF++G HVN F+++ C+ ++
Sbjct: 1273 NQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTAN 1332
Query: 520 SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
A ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GLNP+AFR
Sbjct: 1333 KKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRML 1392
Query: 580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
N + I+DG L+ K+L LS ER E+ KKIG+ + ILD+L +IE +++HF
Sbjct: 1393 HCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERVTAHF 1451
Score = 86 (35.3 bits), Expect = 4.8e-131, Sum P(2) = 4.8e-131
Identities = 19/40 (47%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N + I+DG L+ K+L LS ER E+ KKIG+ + ILD+
Sbjct: 1401 NAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDD 1440
>UNIPROTKB|F1RSN8 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
Length = 1108
Score = 1198 (426.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
Identities = 233/441 (52%), Positives = 291/441 (65%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
G R + E +V+E+L V+LG RP LLV ELLIY+AF H
Sbjct: 495 GEARKEEATRQGELPLVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 554
Query: 61 XXXXXXXSD---R------SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPH 107
+ R SK+ E G GV R+++ RYF +I GY GVF+CGP
Sbjct: 555 VRFKKVPHNINFREKKPKPSKKKTEGGGSEEGVGARGRVARFRYFEDIYGYSGVFICGPS 614
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 615 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 674
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
PWPVRK+PL+CT H++AYH+E+K Y + TST P T + GE+KE T RD R+I P
Sbjct: 675 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIDRDDRYIHP 734
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 735 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 794
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQK
Sbjct: 795 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 854
Query: 348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
I++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD
Sbjct: 855 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 914
Query: 408 YKPTQPNSKGYYAGNPSRGII 428
KP + S + N G +
Sbjct: 915 AKPLEVYSVDFMVDNAQLGFL 935
Score = 431 (156.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
Identities = 82/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 924 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 983
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ D P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 984 AT--DGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1041
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 1042 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1101
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 1102 DRVTAHF 1108
Score = 83 (34.3 bits), Expect = 2.1e-128, Sum P(2) = 2.1e-128
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1058 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1097
>UNIPROTKB|K7GNU1 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic
acid binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GeneTree:ENSGT00550000075040 EMBL:CU468594
Ensembl:ENSSSCT00000033207 Uniprot:K7GNU1
Length = 757
Score = 1198 (426.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
Identities = 233/441 (52%), Positives = 291/441 (65%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
G R + E +V+E+L V+LG RP LLV ELLIY+AF H
Sbjct: 144 GEARKEEATRQGELPLVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 203
Query: 61 XXXXXXXSD---R------SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPH 107
+ R SK+ E G GV R+++ RYF +I GY GVF+CGP
Sbjct: 204 VRFKKVPHNINFREKKPKPSKKKTEGGGSEEGVGARGRVARFRYFEDIYGYSGVFICGPS 263
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 264 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 323
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
PWPVRK+PL+CT H++AYH+E+K Y + TST P T + GE+KE T RD R+I P
Sbjct: 324 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIDRDDRYIHP 383
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 384 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 443
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQK
Sbjct: 444 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 503
Query: 348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
I++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD
Sbjct: 504 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 563
Query: 408 YKPTQPNSKGYYAGNPSRGII 428
KP + S + N G +
Sbjct: 564 AKPLEVYSVDFMVDNAQLGFL 584
Score = 431 (156.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
Identities = 82/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 573 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 632
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ D P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 633 AT--DGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 690
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 691 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 750
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 751 DRVTAHF 757
Score = 83 (34.3 bits), Expect = 2.1e-128, Sum P(2) = 2.1e-128
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 707 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 746
>UNIPROTKB|Q10569 [details] [associations]
symbol:CPSF1 "Cleavage and polyadenylation specificity
factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
"mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
ArrayExpress:Q10569 Uniprot:Q10569
Length = 1444
Score = 1194 (425.4 bits), Expect = 2.5e-164, Sum P(2) = 2.5e-164
Identities = 230/442 (52%), Positives = 293/442 (66%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
G R + E +V+E+L V+LG RP LLV ELLIY+AF H
Sbjct: 831 GEARKEEATRQGELPLVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 890
Query: 61 XXXXXXXSD---RSKR-----------ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
+ R K+ + E+ PRG R+++ RYF +I GY GVF+CGP
Sbjct: 891 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGP 949
Query: 107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
P WL +T RG LR HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYD
Sbjct: 950 SPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYD 1009
Query: 167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
APWPVRK+PL+CT H++AYH+E+K Y + TST+ P T + GE+KE T RD R++
Sbjct: 1010 APWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVH 1069
Query: 227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
P F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1070 PQQEAFCIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLM 1129
Query: 287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQ
Sbjct: 1130 QGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1189
Query: 347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
KI++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+R
Sbjct: 1190 KIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSR 1249
Query: 407 DYKPTQPNSKGYYAGNPSRGII 428
D KP + S + N G +
Sbjct: 1250 DAKPLEVYSVDFMVDNAQLGFL 1271
Score = 427 (155.4 bits), Expect = 2.5e-164, Sum P(2) = 2.5e-164
Identities = 81/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 1260 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1319
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 1320 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1377
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 1378 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1437
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 1438 DRVTAHF 1444
Score = 83 (34.3 bits), Expect = 5.5e-128, Sum P(2) = 5.5e-128
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1394 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1433
>UNIPROTKB|F1PC28 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
Length = 1398
Score = 1191 (424.3 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
Identities = 229/426 (53%), Positives = 287/426 (67%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXSD---R-- 70
+V+E+L V+LG +RP LLV ELLIY+AF H + R
Sbjct: 800 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 859
Query: 71 ----SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
SK+ E G G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 860 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 919
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 920 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 979
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP SW
Sbjct: 980 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1039
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 1040 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1099
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 1100 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1159
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 1160 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1219
Query: 423 PSRGII 428
G +
Sbjct: 1220 AQLGFL 1225
Score = 427 (155.4 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
Identities = 81/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 1214 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1273
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 1274 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1331
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 1332 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1391
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 1392 DRVTAHF 1398
Score = 83 (34.3 bits), Expect = 1.1e-127, Sum P(2) = 1.1e-127
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1348 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1387
>UNIPROTKB|J9P418 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
Ensembl:ENSCAFT00000043656 Uniprot:J9P418
Length = 1107
Score = 1191 (424.3 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
Identities = 229/426 (53%), Positives = 287/426 (67%)
Query: 16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXSD---R-- 70
+V+E+L V+LG +RP LLV ELLIY+AF H + R
Sbjct: 509 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 568
Query: 71 ----SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
SK+ E G G R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct: 569 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 628
Query: 123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct: 629 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 688
Query: 183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
+AYH+E+K Y + TST P T + GE+KE T RD R+I P F + L SP SW
Sbjct: 689 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 748
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
E IP L EWEHV C+K VS+ E T+SGL+GY+A GT E+VTCRGRIL+ D+
Sbjct: 749 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 808
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQKI++W L+ ++LTG+A
Sbjct: 809 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 868
Query: 363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
FIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD KP + S + N
Sbjct: 869 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 928
Query: 423 PSRGII 428
G +
Sbjct: 929 AQLGFL 934
Score = 427 (155.4 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
Identities = 81/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 923 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 982
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 983 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1040
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 1041 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1100
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 1101 DRVTAHF 1107
Score = 83 (34.3 bits), Expect = 1.1e-127, Sum P(2) = 1.1e-127
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1057 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1096
>UNIPROTKB|Q10570 [details] [associations]
symbol:CPSF1 "Cleavage and polyadenylation specificity
factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
[GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
from RNA polymerase II promoter" evidence=TAS] [GO:0006369
"termination of RNA polymerase II transcription" evidence=TAS]
[GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
[GO:0031124 "mRNA 3'-end processing" evidence=TAS]
Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
Uniprot:Q10570
Length = 1443
Score = 1191 (424.3 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
Identities = 232/441 (52%), Positives = 290/441 (65%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
G R + E +V+E+L V+LG +RP LLV ELLIY+AF H
Sbjct: 830 GEARREEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 889
Query: 61 XXXXXXXSD---R------SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPH 107
+ R SK+ E G G R+++ RYF +I GY GVF+CGP
Sbjct: 890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P WL +T RG LR HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct: 950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
PWPVRK+PL+CT H++AYH+E+K Y + TST P + GE+KE T RD R+I P
Sbjct: 1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHP 1069
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1070 QQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1129
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQK
Sbjct: 1130 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 1189
Query: 348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
I++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+RD
Sbjct: 1190 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 1249
Query: 408 YKPTQPNSKGYYAGNPSRGII 428
KP + S + N G +
Sbjct: 1250 AKPLEVYSVDFMVDNAQLGFL 1270
Score = 423 (154.0 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
Identities = 80/185 (43%), Positives = 125/185 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 1259 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1318
Query: 517 SS--ISDAPGA-RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
++ +S ++ +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GLNP
Sbjct: 1319 ATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNP 1378
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
RAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L + +
Sbjct: 1379 RAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDR 1438
Query: 634 LSSHF 638
+++HF
Sbjct: 1439 VTAHF 1443
Score = 85 (35.0 bits), Expect = 7.0e-128, Sum P(2) = 7.0e-128
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1393 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1432
>MGI|MGI:2679722 [details] [associations]
symbol:Cpsf1 "cleavage and polyadenylation specific factor
1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
"mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
Length = 1441
Score = 1185 (422.2 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
Identities = 230/442 (52%), Positives = 291/442 (65%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
G R + E +V+E+L V+LG +RP LLV ELLIY+AF H
Sbjct: 828 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 887
Query: 61 XXXXXXXSD---RSKR-----------ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
+ R K+ + E+ RG R+++ RYF +I GY GVF+CGP
Sbjct: 888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRG-RVARFRYFEDIYGYSGVFICGP 946
Query: 107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYD
Sbjct: 947 SPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYD 1006
Query: 167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
APWPVRK+PL+CT H++AYH+E+K Y + TST P T + GE+KE RD R+I
Sbjct: 1007 APWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIH 1066
Query: 227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
P F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1067 PQQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLM 1126
Query: 287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH G LV+A+GQ
Sbjct: 1127 QGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1186
Query: 347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
KI++W L+ ++LTG+AFIDT++YI M+SVKN IL D +SI+LLRYQ E +TLSLV+R
Sbjct: 1187 KIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSR 1246
Query: 407 DYKPTQPNSKGYYAGNPSRGII 428
D KP + S + N G +
Sbjct: 1247 DAKPLEVYSVDFMVDNAQLGFL 1268
Score = 429 (156.1 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
Identities = 81/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 1257 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1316
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 1317 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1374
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 1375 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLET 1434
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 1435 DRVTAHF 1441
Score = 85 (35.0 bits), Expect = 3.0e-127, Sum P(2) = 3.0e-127
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1391 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1430
>FB|FBgn0024698 [details] [associations]
symbol:Cpsf160 "Cleavage and polyadenylation specificity
factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
[GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
Uniprot:Q9V726
Length = 1455
Score = 1126 (401.4 bits), Expect = 1.4e-159, Sum P(2) = 1.4e-159
Identities = 217/430 (50%), Positives = 287/430 (66%)
Query: 9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXS 68
P + + EL + LGL+G RPLLLVRT+ ELLIYQ FR+PKG
Sbjct: 854 PQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKMDQLNLL 913
Query: 69 DRSK------RANEQPGLP----RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGE 118
D+ +EQ + + + ++R F+N+ G GV +CG +P ++FLT RGE
Sbjct: 914 DQQPTHIDLDENDEQEEIESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGE 973
Query: 119 LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
LR H + +G V + A F+NVN P GFLYF+ EL+ISVLP++LSYD+ WPVRKVPL+C
Sbjct: 974 LRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRC 1033
Query: 179 TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
TP L YH E + YC++T T EP T YY+FNGEDKEL + R RFI P+ SQF + L S
Sbjct: 1034 TPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGSQFEMVLIS 1093
Query: 239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
P +WE +P + WEHV K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I
Sbjct: 1094 PETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIH 1153
Query: 299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDL 358
++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI V GFLVT +GQKIYIWQL+D DL
Sbjct: 1154 IYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDL 1213
Query: 359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
G+AFIDT +Y+ +++VK+LI + D +SI+LLR+Q EYRTLSL +RD+ P + +
Sbjct: 1214 IGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEF 1273
Query: 419 YAGNPSRGII 428
N + G +
Sbjct: 1274 MVDNSNLGFL 1283
Score = 450 (163.5 bits), Expect = 1.4e-159, Sum P(2) = 1.4e-159
Identities = 84/178 (47%), Positives = 124/178 (69%)
Query: 463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C +
Sbjct: 1278 SNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQR 1337
Query: 523 -PGA-RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
P ++ Y +LDGALG+ LPLPEK YRR LMLQNV++++ H GLNP+ +RT K
Sbjct: 1338 QPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLK 1397
Query: 581 GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
NPSR IIDG L+W + ++ ER E+ KKIG++ +IL +L +IE L+S F
Sbjct: 1398 SSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455
Score = 110 (43.8 bits), Expect = 1.2e-123, Sum P(2) = 1.2e-123
Identities = 27/71 (38%), Positives = 42/71 (59%)
Query: 391 LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
LL YQ E+ L ++Y+ + + K NPSR IIDG L+W + ++ ER E+ KK
Sbjct: 1378 LLSYQ-EH-LCGLNPKEYRTLKSSKK--QGINPSRCIIDGDLIWSYRLMANSERNEVAKK 1433
Query: 451 IGSKHNDILDE 461
IG++ +IL +
Sbjct: 1434 IGTRTEEILGD 1444
>RGD|1306406 [details] [associations]
symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA;ISO]
[GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
"mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
Uniprot:D4A0H5
Length = 1386
Score = 774 (277.5 bits), Expect = 3.6e-118, Sum P(2) = 3.6e-118
Identities = 173/413 (41%), Positives = 226/413 (54%)
Query: 2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
G R + E +V+E+L V+LG +RP LLV ELLIY+AF H
Sbjct: 824 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 883
Query: 61 XXXXXXXSD---RSKR-----------ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
+ R K+ + E+ RG R+++ RYF +I GY GVF+CGP
Sbjct: 884 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRG-RVARFRYFEDIYGYSGVFICGP 942
Query: 107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYD
Sbjct: 943 SPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYD 1002
Query: 167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
APWPVRK+PL+CT H++AYH+E+K Y + TST P T + GE+KE RD R+I
Sbjct: 1003 APWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIH 1062
Query: 227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
P F + L SP SWE IP L EWEHV C+K VS+ E T+SGL+GY+A GT
Sbjct: 1063 PQQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLM 1122
Query: 287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
E+VTCRGRI L+ + G ++ Y + I +A ++ ++
Sbjct: 1123 QGEEVTCRGRIFLWSL-RASELTGMAFIDTQL---YIHQMISVKNFI--LAADVMKSISL 1176
Query: 347 KIYIWQLKDNDLTGIAFIDTEVY-IASMVSVKNL-ILVGDYARSIALLRYQPE 397
Y + K L EVY + MV L LV D R++ + Y PE
Sbjct: 1177 LRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 1229
Score = 429 (156.1 bits), Expect = 3.6e-118, Sum P(2) = 3.6e-118
Identities = 81/187 (43%), Positives = 126/187 (67%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++ C+
Sbjct: 1202 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1261
Query: 517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
++ + P +S + +TW+A+LDG +G LP+ EK YRRLLMLQN + T H GL
Sbjct: 1262 AA--EGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1319
Query: 572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
NPRAFR N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+L +
Sbjct: 1320 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLET 1379
Query: 632 EALSSHF 638
+ +++HF
Sbjct: 1380 DRVTAHF 1386
Score = 229 (85.7 bits), Expect = 1.1e-56, Sum P(2) = 1.1e-56
Identities = 50/103 (48%), Positives = 68/103 (66%)
Query: 327 KGPVTA-ICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDY 385
KG V A C + G VT G +I++W L+ ++LTG+AFIDT++YI M+SVKN IL D
Sbjct: 1112 KGYVAAGTCLMQGEEVTCRG-RIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADV 1170
Query: 386 ARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
+SI+LLRYQ E +TLSLV+RD KP + S + N G +
Sbjct: 1171 MKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFL 1213
Score = 85 (35.0 bits), Expect = 7.3e-82, Sum P(2) = 7.3e-82
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
N R ++DG L+ ++L LS ER E+ KKIG+ + ILD+
Sbjct: 1336 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1375
>WB|WBGene00022301 [details] [associations]
symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
"nematode larval development" evidence=IMP] [GO:0040018 "positive
regulation of multicellular organism growth" evidence=IMP]
[GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
"negative regulation of vulval development" evidence=IMP]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
Length = 1454
Score = 654 (235.3 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
Identities = 137/410 (33%), Positives = 224/410 (54%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR--HP----KGAXXXXXXXXXXXXXSDR 70
V E V +G++ P+L+ ++++Y+ F +P G S
Sbjct: 853 VLEAQIVGMGINQAHPILMAIVDEQVVLYEMFSSSNPIPGHLGISFRKLPHFICLRTSSH 912
Query: 71 ----SKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
KRA + + G R S + F ++ GV + G P L + G ++ H MT
Sbjct: 913 LNSDGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPTLLVYGAWGGMQTHQMT 972
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
+DGP+ PF+N N G +Y KSELRI+ + Y+ P+PV+K+ + T H +
Sbjct: 973 VDGPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKIEVGRTIHHVR 1032
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
Y + + Y +V+S +PS + +DK+ +D F+ P ++ ++LFS W
Sbjct: 1033 YLMNSDVYAVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
+P T + E V ++V+++ E T+SGL +A+GT NY E+V RGRI+L ++IE
Sbjct: 1093 VPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIE 1152
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
VVPEP QP + KIK+++ KEQKGPVT +C + G L+ +GQK++IWQ KDNDL GI+F+
Sbjct: 1153 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFL 1212
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD-YKPTQP 413
D Y+ + S++ + + D S++L+R+Q + + +S+ +RD K QP
Sbjct: 1213 DMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQP 1262
Score = 281 (104.0 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
Identities = 60/180 (33%), Positives = 103/180 (57%)
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---- 520
+GF++SD+ N+ +F Y PEA ESNGG RL + ++G ++N F ++R S +
Sbjct: 1275 VGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNE 1334
Query: 521 -DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+ R T +ASLDG+ GF PL EK+YRRL LQ + + T GL+ + R+
Sbjct: 1335 DEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSA 1394
Query: 580 K-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
K + G +R +IDG +V ++L LSL ++ ++ +++G I+D+L + ++ ++
Sbjct: 1395 KPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454
Score = 84 (34.6 bits), Expect = 1.2e-68, Sum P(2) = 1.2e-68
Identities = 19/61 (31%), Positives = 35/61 (57%)
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
+R KP+QP G A R +IDG +V ++L LSL ++ ++ +++G I+D+
Sbjct: 1391 SRSAKPSQPIVNGRNA----RNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQ 1446
Query: 465 M 465
+
Sbjct: 1447 L 1447
>UNIPROTKB|Q9N4C2 [details] [associations]
symbol:cpsf-1 "Probable cleavage and polyadenylation
specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
[GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=NAS]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
Length = 1454
Score = 654 (235.3 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
Identities = 137/410 (33%), Positives = 224/410 (54%)
Query: 17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR--HP----KGAXXXXXXXXXXXXXSDR 70
V E V +G++ P+L+ ++++Y+ F +P G S
Sbjct: 853 VLEAQIVGMGINQAHPILMAIVDEQVVLYEMFSSSNPIPGHLGISFRKLPHFICLRTSSH 912
Query: 71 ----SKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
KRA + + G R S + F ++ GV + G P L + G ++ H MT
Sbjct: 913 LNSDGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPTLLVYGAWGGMQTHQMT 972
Query: 126 IDGPVSTLAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
+DGP+ PF+N N G +Y KSELRI+ + Y+ P+PV+K+ + T H +
Sbjct: 973 VDGPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKIEVGRTIHHVR 1032
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
Y + + Y +V+S +PS + +DK+ +D F+ P ++ ++LFS W
Sbjct: 1033 YLMNSDVYAVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092
Query: 245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
+P T + E V ++V+++ E T+SGL +A+GT NY E+V RGRI+L ++IE
Sbjct: 1093 VPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIE 1152
Query: 305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
VVPEP QP + KIK+++ KEQKGPVT +C + G L+ +GQK++IWQ KDNDL GI+F+
Sbjct: 1153 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFL 1212
Query: 365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD-YKPTQP 413
D Y+ + S++ + + D S++L+R+Q + + +S+ +RD K QP
Sbjct: 1213 DMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQP 1262
Score = 281 (104.0 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
Identities = 60/180 (33%), Positives = 103/180 (57%)
Query: 465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---- 520
+GF++SD+ N+ +F Y PEA ESNGG RL + ++G ++N F ++R S +
Sbjct: 1275 VGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNE 1334
Query: 521 -DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
+ R T +ASLDG+ GF PL EK+YRRL LQ + + T GL+ + R+
Sbjct: 1335 DEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSA 1394
Query: 580 K-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
K + G +R +IDG +V ++L LSL ++ ++ +++G I+D+L + ++ ++
Sbjct: 1395 KPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454
Score = 84 (34.6 bits), Expect = 1.2e-68, Sum P(2) = 1.2e-68
Identities = 19/61 (31%), Positives = 35/61 (57%)
Query: 405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
+R KP+QP G A R +IDG +V ++L LSL ++ ++ +++G I+D+
Sbjct: 1391 SRSAKPSQPIVNGRNA----RNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQ 1446
Query: 465 M 465
+
Sbjct: 1447 L 1447
>TAIR|locus:2153122 [details] [associations]
symbol:CPSF160 "cleavage and polyadenylation specificity
factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
"protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
of flower development" evidence=RCA] [GO:0016570 "histone
modification" evidence=RCA] [GO:0048449 "floral organ formation"
evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
GermOnline:AT5G51660 Uniprot:Q9FGR0
Length = 1442
Score = 455 (165.2 bits), Expect = 3.6e-65, Sum P(2) = 3.6e-65
Identities = 112/343 (32%), Positives = 175/343 (51%)
Query: 79 GLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHN 138
G GV ++ F NI+G+QG FL G P W L R LR H DG ++ HN
Sbjct: 924 GTSDGVASQRITMFKNISGHQGFFLSGSRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHN 982
Query: 139 VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS- 197
VNC GF+Y A+ L+I LP+ YD WPV+K+PLK TPH + Y+ E Y ++ S
Sbjct: 983 VNCNHGFIYVTAQGVLKICQLPSASIYDNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSY 1042
Query: 198 -TAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQ 247
++P S+ + G+ + D V +F + + P WE +
Sbjct: 1043 PVSKPLNQVLSSLVDQEAGQQLDNHNMSSDDLQRTYTVEEFEIQILEPERSGGPWET--K 1100
Query: 248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
P+ EH L ++ V++ T +A+GT Y EDV RGR+LLF
Sbjct: 1101 AKIPMQTSEHALTVRVVTLLNASTGEN-ETLLAVGTAYVQGEDVAARGRVLLFSF----G 1155
Query: 308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
+ G ++N + +Y++E KG ++A+ + G L+ + G KI + + +L G+AF D
Sbjct: 1156 KNGDN-SQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKWNGTELNGVAFFDAP 1214
Query: 368 -VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
+Y+ SM VK+ IL+GD +SI L ++ + LSL+A+D++
Sbjct: 1215 PLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFE 1257
Score = 270 (100.1 bits), Expect = 3.6e-65, Sum P(2) = 3.6e-65
Identities = 59/180 (32%), Positives = 97/180 (53%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
+ L + S++ +SD+ KN+ +F Y P+ ES G +L+ + +FH+G HV+ F +++
Sbjct: 1265 EFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQM-- 1322
Query: 517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
+S +RF + +LDG+ G PL E +RRL LQ +V H GLNP AF
Sbjct: 1323 --VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAF 1380
Query: 577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
R ++ G + I+D L+ + L L E+LE+ +IG+ IL +L D+ +S
Sbjct: 1381 RQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSVGTS 1440
Score = 54 (24.1 bits), Expect = 5.3e-19, Sum P(3) = 5.3e-19
Identities = 16/65 (24%), Positives = 29/65 (44%)
Query: 140 NCPRGFLYFN-AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST 198
NC G++ + + S L+I ++ H +A WP K + P+ + IV +
Sbjct: 17 NCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNVVITAANILEVYIVRAQ 76
Query: 199 AEPST 203
E +T
Sbjct: 77 EEGNT 81
Score = 52 (23.4 bits), Expect = 3.1e-42, Sum P(2) = 3.1e-42
Identities = 15/54 (27%), Positives = 28/54 (51%)
Query: 408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
++ + + K +G S I+D L+ + L L E+LE+ +IG+ IL +
Sbjct: 1380 FRQFRSSGKARRSGPDS--IVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKD 1431
Score = 37 (18.1 bits), Expect = 5.3e-19, Sum P(3) = 5.3e-19
Identities = 11/44 (25%), Positives = 21/44 (47%)
Query: 254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
E +++ L+++ M++ L GYI E+ T GR+
Sbjct: 234 ESSYIINLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRV 277
>POMBASE|SPBC1709.08 [details] [associations]
symbol:cft1 "cleavage factor one Cft1 (predicted)"
species:4896 "Schizosaccharomyces pombe" [GO:0003723 "RNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
"cytosol" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA]
[GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
[GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
cleavage" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
PomBase:SPBC1709.08 GO:GO:0005829 EMBL:CU329671 GO:GO:0006378
GenomeReviews:CU329671_GR GO:GO:0003723 eggNOG:COG5161 KO:K14401
OMA:HNDRIFQ OrthoDB:EOG451HZS PIR:T39636 RefSeq:NP_595441.1
STRING:O74733 EnsemblFungi:SPBC1709.08.1 GeneID:2539694
KEGG:spo:SPBC1709.08 NextBio:20800847 GO:GO:0005847 GO:GO:0006379
Uniprot:O74733
Length = 1441
Score = 450 (163.5 bits), Expect = 1.6e-60, Sum P(2) = 1.6e-60
Identities = 117/414 (28%), Positives = 200/414 (48%)
Query: 1 MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPKGAXXXXX 59
M + R++ + +V ELL LG P L +R++ +E+ +Y+AF +
Sbjct: 836 MESERTYFNKESSQELV-ELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYSNTDKHKNL 894
Query: 60 XXXXXXXXSDRSKRANEQPGLPRGVRISQMRYFSN------------IAGYQGVFLCGPH 107
++ G PR + + S+ + + VF+ G
Sbjct: 895 LAFAKVPQETMTREFQANVGTPRDAESTMEKKASSSVDHLKMTALEVVGNHSAVFVTGRK 954
Query: 108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
P + T + P++ + P+ ++APFH + P+G++Y + S +RI YD
Sbjct: 955 PFLILSTLHSNAKFFPISSNIPILSVAPFHAHHAPQGYIYVDENSFIRICKFQEDFEYDN 1014
Query: 168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
WP +KV L + +AYH TK V S +G + +TD D ++P
Sbjct: 1015 KWPYKKVSLGKQINGIAYH-PTKMVYAVGSAVPIEFKVTDEDGNEPYAITDDND--YLP- 1070
Query: 228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
+ + + L SP +W I F ++E L + V++E T + YIA+GT+
Sbjct: 1071 MANTGSLDLVSPLTWTVIDSYEF--QQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITK 1128
Query: 288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
ED+ RG LF+II+VVP+PG+P T++K+K++ +E KG V +C V G+L++ GQK
Sbjct: 1129 GEDIAVRGSTYLFEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQK 1188
Query: 348 IYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YR 399
+ + L+D D L G++FID Y S ++NL+L GD +++ + + E YR
Sbjct: 1189 VIVRALEDEDHLVGVSFIDLGSYTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYR 1242
Score = 236 (88.1 bits), Expect = 1.6e-60, Sum P(2) = 1.6e-60
Identities = 59/183 (32%), Positives = 96/183 (52%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
D L + ++ F+++D N+ L Y PE ES+ G RL+ + DFH+G +V T I K
Sbjct: 1259 DFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIG-NVITAMTILPKE 1317
Query: 517 SSISDAP---GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
+A F + DG L +P+ ++ YRRL ++QN + + GGLNP
Sbjct: 1318 KKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNP 1377
Query: 574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI-E 632
+++R NP+R I+DG L+ F +S+ R E+ K G + I+++L ++ E
Sbjct: 1378 KSYRLITSPSNLT-NPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDE 1436
Query: 633 ALS 635
ALS
Sbjct: 1437 ALS 1439
Score = 67 (28.6 bits), Expect = 2.3e-43, Sum P(3) = 2.3e-43
Identities = 15/49 (30%), Positives = 27/49 (55%)
Query: 422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
NP+R I+DG L+ F +S+ R E+ K G + I+++ + +S
Sbjct: 1391 NPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEALS 1439
Score = 41 (19.5 bits), Expect = 2.3e-43, Sum P(3) = 2.3e-43
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 373 MVSVKNL-ILVGDYARSIALLRYQPE 397
+V +NL +V D + ++ LL Y PE
Sbjct: 1261 LVQGENLYFVVADTSGNLRLLAYDPE 1286
Score = 40 (19.1 bits), Expect = 1.1e-14, Sum P(2) = 1.1e-14
Identities = 19/72 (26%), Positives = 30/72 (41%)
Query: 107 HPAWLFLTSRGELRAHPMT-----IDGPVSTLAP--FHNVNCPRGFLYFNAKSELR-ISV 158
HP LT G+L+ + + ++ V L P F+ + R YFN +S + +
Sbjct: 797 HPILFALTDEGKLKVYNLADFSLLMECDVFDLPPTLFNGMESER--TYFNKESSQELVEL 854
Query: 159 LPTHLSYDAPWP 170
L L D P
Sbjct: 855 LVADLGDDFKEP 866
>DICTYBASE|DDB_G0281585 [details] [associations]
symbol:cpsf1 "cleavage and polyadenylation
specificity factor 160 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
evidence=ISS] InterPro:IPR004871 Pfam:PF03178
dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
Length = 1628
Score = 449 (163.1 bits), Expect = 4.0e-53, Sum P(2) = 4.0e-53
Identities = 108/364 (29%), Positives = 201/364 (55%)
Query: 70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM-TIDG 128
+ K E+ L R RI + FS+I+G +G+F+ G P W F +G LR H M + D
Sbjct: 1108 KKKEEEEEENLNRQKRIFE---FSSISGKRGLFIGGKKPIWAFC-EKGYLRLHSMDSSDN 1163
Query: 129 P---------------VSTLAPFHNVNCPRGFLYFNAKSE-LRISVLPTHLSYDAPWPVR 172
V T F+N++C GF+YF+ + + ++I L T ++++ +R
Sbjct: 1164 SNSNNSNNNNNNNSNTVETFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIR 1223
Query: 173 KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
++P K + H +AYH E K Y ++ S + + + + K ++TD + F
Sbjct: 1224 RIPTKNSCHKIAYHSEAKCYVVIVSFPQVTQELQE--DSKKPILTDDK-----------F 1270
Query: 233 HVSLFSP---FSWEEIPQTNFPLHEWEHVLCLKNVSMEY---EGTLSGLRGYIALGTNYN 286
+ L P ++W+ I +F L + E VL +K VS+++ +G ++ R ++ +GT +
Sbjct: 1271 QIKLIDPTIDWNWKFID--SFSLQDRETVLAMKIVSLKFTEPDG-ITRARPFLVIGTAFT 1327
Query: 287 YSEDVTCRGRILLFDIIEVVPE-PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
+ ED C+GR+L+F+I+ + + L + ++ ++Y KEQKGPVTA+ V G L+ +G
Sbjct: 1328 FGEDTQCKGRVLVFEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIG 1387
Query: 346 QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
K+ + Q L ++F D ++YI S+ ++KN I++GD +S+ L+++ + +TL+L++
Sbjct: 1388 PKLTVNQFYTGSLVTLSFYDAQIYICSICTIKNYIVIGDMYKSVYFLQWK-DNKTLNLLS 1446
Query: 406 RDYK 409
+DY+
Sbjct: 1447 KDYQ 1450
Score = 169 (64.5 bits), Expect = 4.0e-53, Sum P(2) = 4.0e-53
Identities = 58/207 (28%), Positives = 94/207 (45%)
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEF----SSMGFMISDKDKNVVLFMYQPEARESNGG 491
FLQ + L + K N EF ++ ++SD DKN++LF ++P+ S G
Sbjct: 1433 FLQWKDNKTLNLLSKDYQALNIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSG 1492
Query: 492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
+ Q +N K +D + L + +LDG L PL EK Y
Sbjct: 1493 Q---------INQEINGNNK--------NDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIY 1535
Query: 552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS------RGIIDGSLVWKFLQ 604
+Q+ + + T GLNP+ +R++K + +PS + I+DG L+ KFL
Sbjct: 1536 LLFYHIQSKLY-YLPQTAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLS 1594
Query: 605 LSLGERLEICKKIGSKHNDILDELYDI 631
LS E+ I I S ++I++ L D+
Sbjct: 1595 LSQSEKRLISNSINSTSDEIIESLKDV 1621
Score = 71 (30.1 bits), Expect = 8.0e-43, Sum P(2) = 8.0e-43
Identities = 23/80 (28%), Positives = 40/80 (50%)
Query: 392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS------RGIIDGSLVWKFLQLSLGERL 445
L Y P+ T L + Y+ + S+ ++ +PS + I+DG L+ KFL LS E+
Sbjct: 1545 LYYLPQ--TAGLNPKQYRSFKSFSQNFHF-SPSTFHQLPKFILDGDLISKFLSLSQSEKR 1601
Query: 446 EICKKIGSKHNDILDEFSSM 465
I I S ++I++ +
Sbjct: 1602 LISNSINSTSDEIIESLKDV 1621
>ASPGD|ASPL0000050546 [details] [associations]
symbol:AN1413 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0005634 EMBL:BN001307 GO:GO:0006397
GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AACD01000022
RefSeq:XP_659017.1 EnsemblFungi:CADANIAT00008024 GeneID:2875502
KEGG:ani:AN1413.2 HOGENOM:HOG000048586 OMA:HNDRIFQ
OrthoDB:EOG451HZS Uniprot:Q5BDG7
Length = 1339
Score = 370 (135.3 bits), Expect = 3.6e-46, Sum P(2) = 3.6e-46
Identities = 119/410 (29%), Positives = 199/410 (48%)
Query: 10 SAMDETIVQELLTVSLG-LHGNRPLLLVRTQHE-LLIYQAFRHPKGAXXXXXXXXXXXXX 67
S E ++Q + V LG + + P L++RT+++ L++Y+ F +
Sbjct: 753 STTRENVLQ-IAVVELGDSYSSLPFLILRTENDDLVVYKPFF--TNSKELTGLRFLKEAN 809
Query: 68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
K N L ++ +R NIAG +F+ GP ++F S H + +
Sbjct: 810 HTLPKTPNTTDELQSEMK--PLRILPNIAGCSSIFMPGPSAGFIFRASTTS--PHFIRLR 865
Query: 128 GP-VSTLAPFHNVNCPRGFLYFNAKSELRISVLP--THLSYDAPWPVRKVPLKCTPHFLA 184
G + L F + + +GF Y ++ L ++ LP T L Y PW +R VP+ L
Sbjct: 866 GGFIKGLGCFDSPD--KGFAYLDSHG-LHLAKLPEGTQLGY--PWIMRTVPIGQQIDKLT 920
Query: 185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSR--FIPPLVSQFHVSLFSPFSW 242
Y + TY V T + ++ ED EL + R+ F+P V+Q + + SP +W
Sbjct: 921 YVSASDTY--VLGTCQRCE--FRLP-EDDELHPEWRNEEISFLPE-VNQSSLKVVSPKTW 974
Query: 243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
I ++PL EH++ +K +S+E R I +GT+ ED+ RG I +F++
Sbjct: 975 SVID--SYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFEV 1032
Query: 303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG--FLVTAVGQKIYIWQLK-DNDLT 359
IEVVP+P QP T ++K+I + KG VTA+ + G FL+ A GQK + LK D L
Sbjct: 1033 IEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLKEDGSLL 1092
Query: 360 GIAFIDTEVYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARD 407
+AF+D + +++ + +K + + GD + + Y E +SL A+D
Sbjct: 1093 PVAFMDMQCFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKD 1142
Score = 197 (74.4 bits), Expect = 3.6e-46, Sum P(2) = 3.6e-46
Identities = 55/188 (29%), Positives = 96/188 (51%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCK 515
D L + + + +++D D N+ + Y PE S+ G +L+ ++ FH G +T + R
Sbjct: 1152 DFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTVTLLPRTL 1211
Query: 516 PSSISDAPGARSRFLTWYASL--------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
SS G+ + A L +G++G +PE++YRRL LQ+ + H
Sbjct: 1212 VSSERAMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQSQLTNTLEH 1271
Query: 568 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
GLNPRA+R + AG RG++D +L+ ++L +S + EI ++G+ +I
Sbjct: 1272 PCGLNPRAYRAVESDAS-AG---RGMLDSNLLLQYLDMSKQRKAEIAGRVGATEWEIRA- 1326
Query: 628 LYDIEALS 635
D+EA+S
Sbjct: 1327 --DLEAIS 1332
Score = 73 (30.8 bits), Expect = 3.7e-33, Sum P(2) = 3.7e-33
Identities = 26/90 (28%), Positives = 44/90 (48%)
Query: 380 ILVGDYARSIALLRYQPE--YRTLSLVARDYKPT--QP---NSKGYYA----GNPSRGII 428
+LV + SI L+ PE YR LS + T P N + Y A + RG++
Sbjct: 1235 VLVTSHNGSIGLVTCVPEESYRRLSALQSQLTNTLEHPCGLNPRAYRAVESDASAGRGML 1294
Query: 429 DGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
D +L+ ++L +S + EI ++G+ +I
Sbjct: 1295 DSNLLLQYLDMSKQRKAEIAGRVGATEWEI 1324
>CGD|CAL0004251 [details] [associations]
symbol:orf19.2760 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
evidence=IEA] [GO:0006369 "termination of RNA polymerase II
transcription" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634 GO:GO:0042493
GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023 EMBL:AACQ01000025
RefSeq:XP_720278.1 RefSeq:XP_720279.1 RefSeq:XP_720280.1
RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848 GeneID:3638158
GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
Length = 1420
Score = 326 (119.8 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
Identities = 88/342 (25%), Positives = 163/342 (47%)
Query: 81 PRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
P G I + + YF N+ G+ +F+ G P + T R + +S ++ F +
Sbjct: 875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIFQFSKIAAMS-ISAFSDS 933
Query: 140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
G ++ + + RI LP +Y+ P++ V + + +AYH + T +V ST
Sbjct: 934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIGESIKSIAYHETSDT--VVLSTF 991
Query: 200 EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVL 259
+ Y + E K + +D + P + + + L SP++W I +E L
Sbjct: 992 K-QIPYDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLVSPYNWTVIETIELGDNEVGMTL 1050
Query: 260 --CLKNVSMEYEGTLSG------------LRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
+ +V E TL R YI +G ED+ G +++II++
Sbjct: 1051 KSMILDVGSESGSTLGSDPNSLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDI 1110
Query: 306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
+PEPG+P T +K K I+ +E +G +T+IC ++G + + GQK+ + L+D+ +AF+D
Sbjct: 1111 IPEPGKPETNHKFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLD 1170
Query: 366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
T VY++ S NL+++GD + L+ + E + ++ +D
Sbjct: 1171 TPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKD 1212
Score = 119 (46.9 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
Identities = 45/182 (24%), Positives = 76/182 (41%)
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPSSISDA-- 522
+++D + + L Y P+ +S G +L+ K F L ++ I + S +DA
Sbjct: 1233 LVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALT 1292
Query: 523 ---------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
P S + S DG+ P+ E YRR+ +LQ ++ H GLN
Sbjct: 1293 NIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLN 1352
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 630
PR R K ++ I+D L+ F +LS + + K+ K + DI ++
Sbjct: 1353 PRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIR 1412
Query: 631 IE 632
E
Sbjct: 1413 FE 1414
Score = 40 (19.1 bits), Expect = 1.3e-24, Sum P(2) = 1.3e-24
Identities = 11/50 (22%), Positives = 23/50 (46%)
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDEFSSMGFMISD 471
++ I+D L+ F +LS + + K+ K + DI + ++D
Sbjct: 1370 TKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIRFEHTLND 1419
>UNIPROTKB|Q5AFT3 [details] [associations]
symbol:CFT1 "Protein CFT1" species:237561 "Candida albicans
SC5314" [GO:0042493 "response to drug" evidence=IMP]
InterPro:IPR004871 Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634
GO:GO:0042493 GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023
EMBL:AACQ01000025 RefSeq:XP_720278.1 RefSeq:XP_720279.1
RefSeq:XP_720280.1 RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848
GeneID:3638158 GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
Length = 1420
Score = 326 (119.8 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
Identities = 88/342 (25%), Positives = 163/342 (47%)
Query: 81 PRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
P G I + + YF N+ G+ +F+ G P + T R + +S ++ F +
Sbjct: 875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIFQFSKIAAMS-ISAFSDS 933
Query: 140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
G ++ + + RI LP +Y+ P++ V + + +AYH + T +V ST
Sbjct: 934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIGESIKSIAYHETSDT--VVLSTF 991
Query: 200 EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVL 259
+ Y + E K + +D + P + + + L SP++W I +E L
Sbjct: 992 K-QIPYDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLVSPYNWTVIETIELGDNEVGMTL 1050
Query: 260 --CLKNVSMEYEGTLSG------------LRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
+ +V E TL R YI +G ED+ G +++II++
Sbjct: 1051 KSMILDVGSESGSTLGSDPNSLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDI 1110
Query: 306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
+PEPG+P T +K K I+ +E +G +T+IC ++G + + GQK+ + L+D+ +AF+D
Sbjct: 1111 IPEPGKPETNHKFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLD 1170
Query: 366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
T VY++ S NL+++GD + L+ + E + ++ +D
Sbjct: 1171 TPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKD 1212
Score = 119 (46.9 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
Identities = 45/182 (24%), Positives = 76/182 (41%)
Query: 468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPSSISDA-- 522
+++D + + L Y P+ +S G +L+ K F L ++ I + S +DA
Sbjct: 1233 LVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALT 1292
Query: 523 ---------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
P S + S DG+ P+ E YRR+ +LQ ++ H GLN
Sbjct: 1293 NIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLN 1352
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 630
PR R K ++ I+D L+ F +LS + + K+ K + DI ++
Sbjct: 1353 PRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIR 1412
Query: 631 IE 632
E
Sbjct: 1413 FE 1414
Score = 40 (19.1 bits), Expect = 1.3e-24, Sum P(2) = 1.3e-24
Identities = 11/50 (22%), Positives = 23/50 (46%)
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDEFSSMGFMISD 471
++ I+D L+ F +LS + + K+ K + DI + ++D
Sbjct: 1370 TKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIRFEHTLND 1419
>SGD|S000002709 [details] [associations]
symbol:CFT1 "RNA-binding subunit of the mRNA cleavage and
polyadenylation factor" species:4932 "Saccharomyces cerevisiae"
[GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0003723 "RNA binding"
evidence=IEA;IDA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005739
"mitochondrion" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA;IPI]
[GO:0006369 "termination of RNA polymerase II transcription"
evidence=IMP] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
[GO:0006379 "mRNA cleavage" evidence=IDA;TAS] [GO:0005849 "mRNA
cleavage factor complex" evidence=IPI] InterPro:IPR004871
Pfam:PF03178 SGD:S000002709 GO:GO:0005739 GO:GO:0006378
EMBL:BK006938 GO:GO:0003723 EMBL:U28374 eggNOG:COG5161 KO:K14401
OMA:HNDRIFQ GO:GO:0005847 GO:GO:0006379 PIR:S61187
RefSeq:NP_010587.1 ProteinModelPortal:Q06632 DIP:DIP-2467N
IntAct:Q06632 MINT:MINT-375530 STRING:Q06632 PaxDb:Q06632
PeptideAtlas:Q06632 EnsemblFungi:YDR301W GeneID:851895
KEGG:sce:YDR301W CYGD:YDR301w GeneTree:ENSGT00550000075040
HOGENOM:HOG000246682 OrthoDB:EOG4D29XZ NextBio:969889
Genevestigator:Q06632 GermOnline:YDR301W GO:GO:0006369
Uniprot:Q06632
Length = 1357
Score = 266 (98.7 bits), Expect = 3.7e-31, Sum P(3) = 3.7e-31
Identities = 82/325 (25%), Positives = 149/325 (45%)
Query: 89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
M YF + GY +F+ G P L + + P+ ++ P+ R +
Sbjct: 851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPWSE----RSVMCV 905
Query: 149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
+ R+ L T ++ Y P++++ + T L YH + + + P
Sbjct: 906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965
Query: 202 STDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC 260
Y+ GED E V ++ +P Q + L +P SW+ I + +FP + + +
Sbjct: 966 ----YEALGEDGEKVIGYDEN--VPHAEGFQSGILLINPKSWKVIDKIDFPKNSVVNEM- 1018
Query: 261 LKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKM 320
++ ++ R YI G +ED G ++D+IEVVPEPG+P T K+K
Sbjct: 1019 -RSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLKE 1077
Query: 321 IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNL 379
I+ +E G V+ +C V+G + + QK+ + ++ DN + +AF+D V++ S NL
Sbjct: 1078 IFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGNL 1137
Query: 380 ILVGDYARSIALLRYQPE-YRTLSL 403
+++GD + + + E YR +SL
Sbjct: 1138 LIIGDAMQGFQFIGFDAEPYRMISL 1162
Score = 171 (65.3 bits), Expect = 3.7e-31, Sum P(3) = 3.7e-31
Identities = 53/198 (26%), Positives = 91/198 (45%)
Query: 436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
+ +SLG + K + + L M F +D D+NV + Y P+ S G RL+
Sbjct: 1157 YRMISLGRSMS---KFQTMSLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLV 1213
Query: 496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
+ F L N+ + + +P S F +DG++ +PL E+ YRRL
Sbjct: 1214 HCSSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLY 1270
Query: 556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
++Q ++ GGLNPR R Y G+ R ++D +++ +F L++ R I +
Sbjct: 1271 VIQQQIIDRELQLGGLNPRMERL-ANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQ 1329
Query: 616 KIGSK-HNDILDELYDIE 632
K G H + ++ +IE
Sbjct: 1330 KAGRHAHFEAWRDIINIE 1347
Score = 47 (21.6 bits), Expect = 4.6e-08, Sum P(3) = 4.6e-08
Identities = 18/70 (25%), Positives = 29/70 (41%)
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
LVT I I++L++ + + +D + MV LIL + I L + Q E
Sbjct: 669 LVTVSRGDIKIFELEEKNKRKLLKVDLPEILNEMVITSGLILKSNMCNEFLIGLSKSQEE 728
Query: 398 YRTLSLVARD 407
+ V D
Sbjct: 729 QLLFTFVTAD 738
Score = 41 (19.5 bits), Expect = 3.7e-31, Sum P(3) = 3.7e-31
Identities = 13/41 (31%), Positives = 19/41 (46%)
Query: 12 MDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK 52
+D T+V L LL+VRT + L +Y+ R K
Sbjct: 8 LDATVVSHSLATHFTTSDYEELLVVRT-NILSVYRPTRDGK 47
>TAIR|locus:2115909 [details] [associations]
symbol:DDB1A "damaged DNA binding protein 1A"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
[GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
"negative regulation of photomorphogenesis" evidence=IGI;RCA]
[GO:0045892 "negative regulation of transcription, DNA-dependent"
evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
[GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
[GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
formation" evidence=RCA] [GO:0003002 "regionalization"
evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
"protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
evidence=RCA] [GO:0008284 "positive regulation of cell
proliferation" evidence=RCA] [GO:0009630 "gravitropism"
evidence=RCA] [GO:0009639 "response to red or far red light"
evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
[GO:0033043 "regulation of organelle organization" evidence=RCA]
[GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
organ formation" evidence=RCA] [GO:0048608 "reproductive structure
development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
Uniprot:Q9M0V3
Length = 1088
Score = 128 (50.1 bits), Expect = 2.0e-11, Sum P(2) = 2.0e-11
Identities = 55/189 (29%), Positives = 92/189 (48%)
Query: 230 SQFH-VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
S+ H V L ++E + + +PL +E+ + + S + + Y +GT Y
Sbjct: 741 SEMHFVRLLDDQTFEFM--STYPLDSFEYGCSILSCSFTEDKNV-----YYCVGTAYVLP 793
Query: 289 ED-VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
E+ +GRIL+F + E G ++++I KE KG V ++ G L+ A+ QK
Sbjct: 794 EENEPTKGRILVF-----IVEDG------RLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
Query: 348 I--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYR 399
I Y W L+D+ G + +E +A V + + I+VGD +SI+LL Y+ E
Sbjct: 843 IQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEG 899
Query: 400 TLSLVARDY 408
+ ARDY
Sbjct: 900 AIEERARDY 908
Score = 118 (46.6 bits), Expect = 2.0e-11, Sum P(2) = 2.0e-11
Identities = 42/182 (23%), Positives = 80/182 (43%)
Query: 457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKI 512
+ILD+ +G ++ + N++ E RL ++HLG+ VN F +
Sbjct: 917 EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 973
Query: 513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
R S I P + +++G +G LP++ Y L LQ+ + GGL+
Sbjct: 974 RLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLS 1027
Query: 573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
+R++ + A +R +DG L+ FL LS + +I K + + ++ + ++
Sbjct: 1028 HEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRVEELT 1085
Query: 633 AL 634
L
Sbjct: 1086 RL 1087
Score = 48 (22.0 bits), Expect = 0.00031, Sum P(2) = 0.00031
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICKKI 451
+R +DG L+ FL LS + +I K +
Sbjct: 1043 ARNFLDGDLIESFLDLSRNKMEDISKSM 1070
>TAIR|locus:2127368 [details] [associations]
symbol:DDB1B "damaged DNA binding protein 1B"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
[GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
development ending in seed dormancy" evidence=IMP] [GO:0005515
"protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
[GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
specification" evidence=RCA] [GO:0010072 "primary shoot apical
meristem specification" evidence=RCA] [GO:0010100 "negative
regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
evidence=RCA] [GO:0010564 "regulation of cell cycle process"
evidence=RCA] [GO:0045595 "regulation of cell differentiation"
evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
[GO:0048608 "reproductive structure development" evidence=RCA]
[GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
GermOnline:AT4G21100 Uniprot:O49552
Length = 1088
Score = 120 (47.3 bits), Expect = 5.0e-11, Sum P(3) = 5.0e-11
Identities = 45/140 (32%), Positives = 71/140 (50%)
Query: 278 YIALGTNYNYSED-VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
Y +GT Y E+ +GRIL+F I+E + ++++I KE KG V ++
Sbjct: 783 YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLITEKETKGAVYSLNAF 831
Query: 337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NLILVGDYARS 388
G L+ ++ QKI Y W L+D+ G + +E +A V + + I VGD +S
Sbjct: 832 NGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIAVGDLMKS 888
Query: 389 IALLRYQPEYRTLSLVARDY 408
I+LL Y+ E + ARDY
Sbjct: 889 ISLLIYKHEEGAIEERARDY 908
Score = 112 (44.5 bits), Expect = 5.0e-11, Sum P(3) = 5.0e-11
Identities = 34/140 (24%), Positives = 64/140 (45%)
Query: 499 DFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
++H+G+ VN F ++ S I P + ++ G +G LP++ Y L
Sbjct: 956 EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009
Query: 555 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 614
LQ + GGL+ +R++ + A ++G +DG L+ FL LS G+ EI
Sbjct: 1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1067
Query: 615 KKIGSKHNDILDELYDIEAL 634
K + + ++ + ++ L
Sbjct: 1068 KGMDVQVEELCKRVEELTRL 1087
Score = 58 (25.5 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 424 SRGIIDGSLVWKFLQLSLGERLEICK 449
++G +DG L+ FL LS G+ EI K
Sbjct: 1043 AKGYLDGDLIESFLDLSRGKMEEISK 1068
Score = 56 (24.8 bits), Expect = 5.0e-11, Sum P(3) = 5.0e-11
Identities = 27/122 (22%), Positives = 51/122 (41%)
Query: 83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
G R +R FS+ + VF PA ++ ++ L ++ + VS + PF++ P
Sbjct: 626 GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--VSHMCPFNSAAFP 682
Query: 143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
L + EL I + +R +P+ + + +T+T+ I EPS
Sbjct: 683 DS-LAIAREGELTIGTIDDIQKLH----IRTIPIGEHARRICHQEQTRTFAISCLRNEPS 737
Query: 203 TD 204
+
Sbjct: 738 AE 739
>GENEDB_PFALCIPARUM|PFL1680w [details] [associations]
symbol:PFL1680w "splicing factor 3b, subunit 3,
130kD, putative" species:5833 "Plasmodium falciparum" [GO:0005681
"spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
HOGENOM:HOG000216677 RefSeq:XP_001350742.1
ProteinModelPortal:Q8I574 PRIDE:Q8I574
EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
Uniprot:Q8I574
Length = 1329
Score = 106 (42.4 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
Identities = 41/153 (26%), Positives = 69/153 (45%)
Query: 481 YQPEARESNGGHRLIKKT-DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
Y E S+ +R ++ FH+G+ V + K+R P+S S + Y+++ G
Sbjct: 1188 YGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTS--------SECII-YSTIMGT 1238
Query: 540 LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G F+P K L L+ ++ T G FR+Y Y+ P + ++DG L
Sbjct: 1239 IGAFIPYDNKEELELTQHLEIILRTEKPPLCGREHIFFRSY----YH---PVQNVVDGDL 1291
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
+F LS + +I + DIL +L DI
Sbjct: 1292 CEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324
Score = 81 (33.6 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
Identities = 24/95 (25%), Positives = 40/95 (42%)
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVS 375
K+ +++ + C G L+ ++G K+ I+ L K L + D I S+
Sbjct: 1049 KLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKI 1108
Query: 376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
N I D S+ + Y P TL L++ D P
Sbjct: 1109 SGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIP 1143
>UNIPROTKB|Q8I574 [details] [associations]
symbol:PFL1680w "Splicing factor 3b, subunit 3, 130kD,
putative" species:36329 "Plasmodium falciparum 3D7" [GO:0005681
"spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
HOGENOM:HOG000216677 RefSeq:XP_001350742.1
ProteinModelPortal:Q8I574 PRIDE:Q8I574
EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
Uniprot:Q8I574
Length = 1329
Score = 106 (42.4 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
Identities = 41/153 (26%), Positives = 69/153 (45%)
Query: 481 YQPEARESNGGHRLIKKT-DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
Y E S+ +R ++ FH+G+ V + K+R P+S S + Y+++ G
Sbjct: 1188 YGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTS--------SECII-YSTIMGT 1238
Query: 540 LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
+G F+P K L L+ ++ T G FR+Y Y+ P + ++DG L
Sbjct: 1239 IGAFIPYDNKEELELTQHLEIILRTEKPPLCGREHIFFRSY----YH---PVQNVVDGDL 1291
Query: 599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
+F LS + +I + DIL +L DI
Sbjct: 1292 CEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324
Score = 81 (33.6 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
Identities = 24/95 (25%), Positives = 40/95 (42%)
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVS 375
K+ +++ + C G L+ ++G K+ I+ L K L + D I S+
Sbjct: 1049 KLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKI 1108
Query: 376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
N I D S+ + Y P TL L++ D P
Sbjct: 1109 SGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIP 1143
>UNIPROTKB|G4N4E2 [details] [associations]
symbol:MGG_16867 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
GO:GO:0005634 Gene3D:2.130.10.10 EMBL:CM001233 GO:GO:0003676
RefSeq:XP_003712617.1 EnsemblFungi:MGG_16867T0 GeneID:12985117
KEGG:mgr:MGG_16867 Uniprot:G4N4E2
Length = 1183
Score = 120 (47.3 bits), Expect = 0.00016, Sum P(2) = 0.00016
Identities = 40/182 (21%), Positives = 82/182 (45%)
Query: 445 LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ 504
+E+C+ + + + ++++D D N+V+ + R+ ++F LG+
Sbjct: 998 VEVCRDYQAMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGE 1057
Query: 505 HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVT 563
VN K+ + S+ +AP FL+ + +G++ F + K ++ LLM Q M
Sbjct: 1058 CVNKIQKVMVETSA--NAPIVAKAFLS---TTEGSIYLFGTVAPK-FQSLLMDFQANMEA 1111
Query: 564 HTSHT-GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
H S G L +R+++ P R +DG + FL + +++IC+ +
Sbjct: 1112 HVSSPLGELQFNQWRSFRNPEREGAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAE 1170
Query: 623 DI 624
D+
Sbjct: 1171 DM 1172
Score = 60 (26.2 bits), Expect = 0.00016, Sum P(2) = 0.00016
Identities = 45/182 (24%), Positives = 69/182 (37%)
Query: 222 SRFIPPLVSQFHVSLFSPFSWEEIPQTN-FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIA 280
SR I + F LF +E P+ N F L + E C+ + + +I
Sbjct: 803 SRGIEKVYGTF--KLFDEVIFE--PKGNVFALEDGEVPECVTRAPL-LDSYGEQAERFI- 856
Query: 281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG-F 339
+GT Y GR+L+F V E P +I+A K I +
Sbjct: 857 VGTRYLSGTGSGHGGRVLVFG----VDESRSPY------LIHAHSTKSGCRRIATMDDDL 906
Query: 340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
LV A+ + + + + + T F+ + S +V LI V D +SI LL Y
Sbjct: 907 LVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTVHGKLIAVADIMKSITLLEY 966
Query: 395 QP 396
P
Sbjct: 967 IP 968
Score = 54 (24.1 bits), Expect = 0.00065, Sum P(2) = 0.00065
Identities = 21/93 (22%), Positives = 39/93 (41%)
Query: 306 VPEPGQPLT--KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
VP+PG + +++ + P CH+ G +V + +YI + T A
Sbjct: 229 VPDPGSTMMIPVERVETERRHNFRNPARDECHLGGVIVVGESRMLYIDD-QSWTWTETAL 287
Query: 364 IDTEVYIA-SMVSVKNLILVGDYARSIALLRYQ 395
+ V++A + + +L DY + LL Q
Sbjct: 288 KNAMVFVAWAKFDNTHYLLADDYG-GLHLLTIQ 319
>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
symbol:ddb1 "damage specific DNA binding protein
1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
UniGene:Dr.77970 Uniprot:I1XUS8
Length = 1140
Score = 127 (49.8 bits), Expect = 0.00034, P = 0.00034
Identities = 77/352 (21%), Positives = 145/352 (41%)
Query: 75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
+E+ + G + + +R F +++ VF C P ++ +S +L + + V+ +
Sbjct: 624 SERKKVTLGTQPTVLRTFRSLST-SNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 680
Query: 135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
P ++ P N S L I + +R VPL +P + Y ++ + +
Sbjct: 681 PLNSEGYPDSLALAN-NSTLTIGTIDEIQKLH----IRTVPLYESPKRICYQEVSQCFGV 735
Query: 195 VTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL---FSPFSWEEIP 246
++S E +T + + + L + S+ P S S S +
Sbjct: 736 LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD 795
Query: 247 QTNFPL---HEW-EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFD 301
Q F + H++ ++ L VS + G + Y +GT Y E+ + GRI++F
Sbjct: 796 QHTFEVLHAHQFLQNEYALSMVSCKL-GRDPAV--YFIVGTAMVYPEEAEPKQGRIIVFH 852
Query: 302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLT 359
T K++ + KE KG V ++ G L+ ++ ++Y W + T
Sbjct: 853 Y-----------TDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRT 901
Query: 360 GIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
+ +A + K + ILVGD RS+ LL Y+P + +ARD+ P
Sbjct: 902 ECNHYNN--IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNP 951
>WB|WBGene00010890 [details] [associations]
symbol:ddb-1 species:6239 "Caenorhabditis elegans"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0040010 "positive regulation of growth
rate" evidence=IMP] [GO:0002119 "nematode larval development"
evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0000003
"reproduction" evidence=IMP] [GO:0009792 "embryo development ending
in birth or egg hatching" evidence=IMP] [GO:0006898
"receptor-mediated endocytosis" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0030163
"protein catabolic process" evidence=IMP] [GO:0007276 "gamete
generation" evidence=IMP] [GO:0005515 "protein binding"
evidence=IPI] InterPro:IPR004871 Pfam:PF03178 UniPathway:UPA00143
GO:GO:0005634 GO:GO:0009792 GO:GO:0006898 GO:GO:0005737
GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0006281
GO:GO:0040011 GO:GO:0016567 GO:GO:0007049 GO:GO:0040035
InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163 GO:GO:0007276
eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855 PIR:T23798
RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
Length = 1134
Score = 104 (41.7 bits), Expect = 0.00085, Sum P(2) = 0.00085
Identities = 53/246 (21%), Positives = 100/246 (40%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
VR +P+ + +AY T TY + ++ E + + LVT +
Sbjct: 714 VRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAE--RVFASKNALVTSQSRPKVASTRAD 771
Query: 231 QFHVSLFSPFSWEEIPQTNFPL---HE---WEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
+ S+ + Q F + HE WE L +S ++ S Y +GT
Sbjct: 772 MDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSC--ISGQFTNDSST---YYVVGTG 826
Query: 285 YNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTA 343
Y ++ + GRI++F++ +V ++K++ ++ +G AI + G LV A
Sbjct: 827 LIYPDETETKIGRIVVFEVDDV--------ERSKLRRVHELVVRGSPLAIRILNGKLVAA 878
Query: 344 VGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
+ I +++ D +L V + + + V D RS++LL Y+
Sbjct: 879 INSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFE 938
Query: 403 LVARDY 408
VA+D+
Sbjct: 939 EVAKDW 944
Score = 69 (29.3 bits), Expect = 0.00085, Sum P(2) = 0.00085
Identities = 16/88 (18%), Positives = 38/88 (43%)
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
+ + G +G + + +K + L+ ++ + + + ++RT+ + P G
Sbjct: 1025 FGTNQGTIGMIVQIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQK--RAEPPSG 1082
Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSK 620
+DG LV L + ++I K+ K
Sbjct: 1083 FVDGDLVESILDMDRSVAMDILSKVSDK 1110
>UNIPROTKB|Q21554 [details] [associations]
symbol:ddb-1 "DNA damage-binding protein 1" species:6239
"Caenorhabditis elegans" [GO:0005515 "protein binding"
evidence=IPI] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634
"nucleus" evidence=ISS] InterPro:IPR004871 Pfam:PF03178
UniPathway:UPA00143 GO:GO:0005634 GO:GO:0009792 GO:GO:0006898
GO:GO:0005737 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
GO:GO:0006281 GO:GO:0040011 GO:GO:0016567 GO:GO:0007049
GO:GO:0040035 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163
GO:GO:0007276 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
OMA:CALGDGS GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855
PIR:T23798 RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
Length = 1134
Score = 104 (41.7 bits), Expect = 0.00085, Sum P(2) = 0.00085
Identities = 53/246 (21%), Positives = 100/246 (40%)
Query: 171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
VR +P+ + +AY T TY + ++ E + + LVT +
Sbjct: 714 VRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAE--RVFASKNALVTSQSRPKVASTRAD 771
Query: 231 QFHVSLFSPFSWEEIPQTNFPL---HE---WEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
+ S+ + Q F + HE WE L +S ++ S Y +GT
Sbjct: 772 MDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSC--ISGQFTNDSST---YYVVGTG 826
Query: 285 YNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTA 343
Y ++ + GRI++F++ +V ++K++ ++ +G AI + G LV A
Sbjct: 827 LIYPDETETKIGRIVVFEVDDV--------ERSKLRRVHELVVRGSPLAIRILNGKLVAA 878
Query: 344 VGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
+ I +++ D +L V + + + V D RS++LL Y+
Sbjct: 879 INSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFE 938
Query: 403 LVARDY 408
VA+D+
Sbjct: 939 EVAKDW 944
Score = 69 (29.3 bits), Expect = 0.00085, Sum P(2) = 0.00085
Identities = 16/88 (18%), Positives = 38/88 (43%)
Query: 533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
+ + G +G + + +K + L+ ++ + + + ++RT+ + P G
Sbjct: 1025 FGTNQGTIGMIVQIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQK--RAEPPSG 1082
Query: 593 IIDGSLVWKFLQLSLGERLEICKKIGSK 620
+DG LV L + ++I K+ K
Sbjct: 1083 FVDGDLVESILDMDRSVAMDILSKVSDK 1110
>DICTYBASE|DDB_G0282569 [details] [associations]
symbol:sf3b3 "splicing factor 3B subunit 3"
species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0030532 "small nuclear ribonucleoprotein complex" evidence=ISS]
[GO:0008380 "RNA splicing" evidence=IEA;ISS] [GO:0006461 "protein
complex assembly" evidence=ISS] [GO:0005681 "spliceosomal complex"
evidence=IEA;ISS] [GO:0006397 "mRNA processing" evidence=IEA]
InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 dictyBase:DDB_G0282569 GO:GO:0006461 GO:GO:0008380
Gene3D:2.130.10.10 SUPFAM:SSF50978 EMBL:AAFI02000047
GenomeReviews:CM000152_GR GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
GO:GO:0030532 eggNOG:NOG247734 KO:K12830 OMA:FDTIPVA
RefSeq:XP_640132.1 STRING:Q54SA7 EnsemblProtists:DDB0233171
GeneID:8623669 KEGG:ddi:DDB_G0282569 ProtClustDB:CLSZ2729005
Uniprot:Q54SA7
Length = 1256
Score = 93 (37.8 bits), Expect = 0.00091, Sum P(2) = 0.00090
Identities = 27/97 (27%), Positives = 50/97 (51%)
Query: 317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
K++++Y E + PV A+ G LV VG+ I I+ + L + +T+ ++V++
Sbjct: 975 KLELLYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKL--LRKCETKNLPNTIVNI 1032
Query: 377 KNL---ILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
+L ++VGD SI ++Y+ L + A D P
Sbjct: 1033 HSLGDRLVVGDIQESIHFIKYKRSENMLYVFADDLAP 1069
Score = 81 (33.6 bits), Expect = 0.00091, Sum P(2) = 0.00090
Identities = 36/152 (23%), Positives = 69/152 (45%)
Query: 484 EARESNGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGF 542
E+ NG H+L +F +G V T K S + P + Y ++ GA+G
Sbjct: 1117 ESGTLNGAPHKLDHIANFFVGDTVTTLNKT----SLVVGGPE-----VILYTTISGAIGA 1167
Query: 543 FLPLPEK-NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
+P + + L+ M + G + A+R+Y Y+ P + IIDG L +
Sbjct: 1168 LIPFTSREDVDFFSTLEMNMRSDCLPLCGRDHLAYRSY----YF---PVKNIIDGDLCEQ 1220
Query: 602 FLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
F L+ ++L I +++ ++++ +L +I +
Sbjct: 1221 FSTLNYQKQLSISEELSRSPSEVIKKLEEIRS 1252
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.321 0.139 0.428 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 638 625 0.00090 120 3 11 22 0.37 34
36 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 28
No. of states in DFA: 620 (66 KB)
Total size of DFA: 362 KB (2179 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 49.02u 0.09s 49.11t Elapsed: 00:00:12
Total cpu time: 49.03u 0.09s 49.12t Elapsed: 00:00:12
Start: Thu Aug 15 11:36:39 2013 End: Thu Aug 15 11:36:51 2013