Your job contains 1 sequence.
>000545
MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV
TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS
QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG
PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD
LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF
SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN
SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN
GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE
LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT
ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST
VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP
WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF
VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF
LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY
TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVL
HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI
VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT
RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD
NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV
SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD
EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT
NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH
RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL
The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 000545
(1432 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade... 5068 0. 1
ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya... 789 6.8e-157 4
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"... 786 2.9e-156 4
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla... 780 5.7e-155 5
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla... 778 6.8e-154 5
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat... 788 6.1e-149 4
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla... 660 5.3e-129 4
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"... 777 1.7e-125 4
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya... 488 6.1e-115 7
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ... 652 2.6e-113 4
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"... 786 9.9e-111 4
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab... 522 3.3e-94 3
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p... 522 3.3e-94 3
UNIPROTKB|K7GNU1 - symbol:CPSF1 "Uncharacterized protein"... 777 3.9e-83 2
POMBASE|SPBC1709.08 - symbol:cft1 "cleavage factor one Cf... 509 2.0e-65 3
ASPGD|ASPL0000050546 - symbol:AN1413 species:162425 "Emer... 459 2.4e-55 2
CGD|CAL0004251 - symbol:orf19.2760 species:5476 "Candida ... 321 7.8e-43 3
UNIPROTKB|Q5AFT3 - symbol:CFT1 "Protein CFT1" species:237... 321 7.8e-43 3
SGD|S000002709 - symbol:CFT1 "RNA-binding subunit of the ... 278 1.5e-28 4
TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr... 222 8.9e-23 4
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr... 209 6.7e-22 4
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ... 203 3.8e-20 4
MGI|MGI:1202384 - symbol:Ddb1 "damage specific DNA bindin... 208 2.6e-19 5
UNIPROTKB|A1A4K3 - symbol:DDB1 "DNA damage-binding protei... 210 3.2e-19 5
UNIPROTKB|E2R9E3 - symbol:DDB1 "Uncharacterized protein" ... 210 3.2e-19 5
UNIPROTKB|Q16531 - symbol:DDB1 "DNA damage-binding protei... 210 3.2e-19 5
UNIPROTKB|F1RIE2 - symbol:DDB1 "Uncharacterized protein" ... 210 3.2e-19 5
UNIPROTKB|P33194 - symbol:DDB1 "DNA damage-binding protei... 210 3.2e-19 5
UNIPROTKB|Q6P6Z0 - symbol:ddb1 "DNA damage-binding protei... 208 3.7e-19 4
UNIPROTKB|Q5R649 - symbol:DDB1 "DNA damage-binding protei... 208 5.2e-19 5
UNIPROTKB|F5GY55 - symbol:DDB1 "Uncharacterized protein" ... 197 6.2e-19 3
UNIPROTKB|J9NVR7 - symbol:DDB1 "Uncharacterized protein" ... 193 1.6e-18 3
UNIPROTKB|F1P4I8 - symbol:DDB1 "DNA damage-binding protei... 201 3.0e-18 4
UNIPROTKB|Q805F9 - symbol:DDB1 "DNA damage-binding protei... 200 4.1e-18 4
UNIPROTKB|F1NVV3 - symbol:DDB1 "DNA damage-binding protei... 194 4.7e-18 3
UNIPROTKB|F1NVV2 - symbol:DDB1 "DNA damage-binding protei... 194 4.7e-18 3
FB|FBgn0260962 - symbol:pic "piccolo" species:7227 "Droso... 161 1.7e-17 6
RGD|621889 - symbol:Ddb1 "damage-specific DNA binding pro... 198 2.3e-17 5
TAIR|locus:2100616 - symbol:SAP130a "spliceosome-associat... 176 3.1e-13 5
TAIR|locus:2100646 - symbol:SAP130b "spliceosome-associat... 176 3.1e-13 5
WB|WBGene00010890 - symbol:ddb-1 species:6239 "Caenorhabd... 152 8.5e-13 5
UNIPROTKB|Q21554 - symbol:ddb-1 "DNA damage-binding prote... 152 8.5e-13 5
UNIPROTKB|B4DG00 - symbol:DDB1 "cDNA FLJ52436, highly sim... 210 1.3e-12 2
UNIPROTKB|F1M680 - symbol:Ddb1 "DNA damage-binding protei... 209 1.2e-11 2
DICTYBASE|DDB_G0286013 - symbol:repE "UV-damaged DNA bind... 135 4.7e-11 5
FB|FBgn0035162 - symbol:CG13900 species:7227 "Drosophila ... 125 1.6e-09 5
DICTYBASE|DDB_G0282569 - symbol:sf3b3 "splicing factor 3B... 151 5.4e-09 3
UNIPROTKB|E9PT66 - symbol:Sf3b3 "Protein Sf3b3" species:1... 125 3.8e-08 4
POMBASE|SPAPJ698.03c - symbol:prp12 "U2 snRNP-associated ... 117 4.9e-08 5
UNIPROTKB|A0JN52 - symbol:SF3B3 "Splicing factor 3B subun... 125 5.9e-08 5
UNIPROTKB|Q15393 - symbol:SF3B3 "Splicing factor 3B subun... 125 5.9e-08 5
MGI|MGI:1289341 - symbol:Sf3b3 "splicing factor 3b, subun... 125 5.9e-08 5
UNIPROTKB|E2RR33 - symbol:SF3B3 "Uncharacterized protein"... 123 9.4e-08 5
ASPGD|ASPL0000031473 - symbol:AN5452 species:162425 "Emer... 133 1.0e-07 5
WB|WBGene00019323 - symbol:teg-4 species:6239 "Caenorhabd... 149 1.1e-07 5
UNIPROTKB|F5H0Y5 - symbol:DDB1 "DNA damage-binding protei... 143 1.9e-07 1
UNIPROTKB|F1P529 - symbol:SF3B3 "Uncharacterized protein"... 116 2.4e-07 6
ZFIN|ZDB-GENE-040426-2901 - symbol:sf3b3 "splicing factor... 117 4.7e-07 5
GENEDB_PFALCIPARUM|PFL1680w - symbol:PFL1680w "splicing f... 113 8.0e-07 4
UNIPROTKB|Q8I574 - symbol:PFL1680w "Splicing factor 3b, s... 113 8.0e-07 4
RGD|1311636 - symbol:Sf3b3 "splicing factor 3b, subunit 3... 103 2.5e-06 6
CGD|CAL0004426 - symbol:orf19.5391 species:5476 "Candida ... 94 7.6e-06 4
POMBASE|SPAC17H9.10c - symbol:ddb1 "damaged DNA binding p... 103 1.9e-05 4
UNIPROTKB|F1NZF7 - symbol:SF3B3 "Uncharacterized protein"... 124 0.00065 1
>TAIR|locus:2153122 [details] [associations]
symbol:CPSF160 "cleavage and polyadenylation specificity
factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
"protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
[GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
of flower development" evidence=RCA] [GO:0016570 "histone
modification" evidence=RCA] [GO:0048449 "floral organ formation"
evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
GermOnline:AT5G51660 Uniprot:Q9FGR0
Length = 1442
Score = 5068 (1789.1 bits), Expect = 0., P = 0.
Identities = 991/1370 (72%), Positives = 1128/1370 (82%)
Query: 88 KRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
KR +MDG+ SLELVCHYRLHGNVES+A+L GG ++S+ RDSIIL F DAKISVLEF
Sbjct: 91 KRGGVMDGVYGVSLELVCHYRLHGNVESIAVLPMGGGNSSKGRDSIILTFRDAKISVLEF 150
Query: 148 DDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKAS 207
DDSIH LR+TSMHCFE P+WLHLKRGRESF RGPLVKVDPQGRCGGVLVYGLQMIILK S
Sbjct: 151 DDSIHSLRMTSMHCFEGPDWLHLKRGRESFPRGPLVKVDPQGRCGGVLVYGLQMIILKTS 210
Query: 208 QGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERE 267
Q GSGLVGD+D F SGG SAR+ESS++INLRDL+MKHVKDF+F+HGYIEPV+VIL E E
Sbjct: 211 QVGSGLVGDDDAFSSGGTVSARVESSYIINLRDLEMKHVKDFVFLHGYIEPVIVILQEEE 270
Query: 268 LTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGAN 327
TWAGRVSWKHHTC++SALSI++TLKQHP+IWSA+NLPHDAYKLLAVPSPIGGVLV+ AN
Sbjct: 271 HTWAGRVSWKHHTCVLSALSINSTLKQHPVIWSAINLPHDAYKLLAVPSPIGGVLVLCAN 330
Query: 328 TIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLV 387
TIHYHSQSASCALALNNYA S DSSQELP S+FSVELDAAH TW+ NDVALLSTK+G+L+
Sbjct: 331 TIHYHSQSASCALALNNYASSADSSQELPASNFSVELDAAHGTWISNDVALLSTKSGELL 390
Query: 388 LLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS 447
LLT++YDGR VQRLDLSK+ SVL SDIT++GNSLFFLGSRLGDSLLVQF+C SG +
Sbjct: 391 LLTLIYDGRAVQRLDLSKSKASVLASDITSVGNSLFFLGSRLGDSLLVQFSCRSGPAASL 450
Query: 448 SGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQ--------- 498
GL++E DIE + KRLR +S + N E + +N+ + +
Sbjct: 451 PGLRDEDEDIEGEGHQAKRLRMTSDTFQDTIGNEELSLFGSTPNNSDSAQKSFSFAVRDS 510
Query: 499 -------KTFSFAVR-DSLVNI-GPLKDFSYGLRINADASATG---ISKQS-NYEL---V 542
K F++ +R ++ N G K +Y L + G + +QS E+ V
Sbjct: 511 LVNVGPVKDFAYGLRINADANATGVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEV 570
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
ELPGCKGIWTVYHKSSRGHNADSS+MAA +DEYHAYLIISLEARTMVLETADLLTEVTES
Sbjct: 571 ELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISLEARTMVLETADLLTEVTES 630
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTVL 662
VDY+VQGRTIAAGNLFGRRRVIQVFE GARILDGS+M Q+LSFG TV
Sbjct: 631 VDYYVQGRTIAAGNLFGRRRVIQVFEHGARILDGSFMNQELSFGASNSESNSGSESSTVS 690
Query: 663 SVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWL 722
SVSIADPYVLL M+D SIRLLVGDPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPWL
Sbjct: 691 SVSIADPYVLLRMTDDSIRLLVGDPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPWL 750
Query: 723 RKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS 782
RK STDAWLS+GVGEA+D DGGP DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF S
Sbjct: 751 RKASTDAWLSSGVGEAVDSVDGGPQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFAS 810
Query: 783 GRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLF 842
GR H+ D + E E E+N +SE+ T KE I + +VVELAMQRWS HH+RPFLF
Sbjct: 811 GRRHLSDMPIHEL----EYELNKNSEDNTSS--KE-IKNTRVVELAMQRWSGHHTRPFLF 863
Query: 843 AILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTR 902
A+L DGTILCY AYLF+G ++T K+++ L+F R PLD TR
Sbjct: 864 AVLADGTILCYHAYLFDGVDST-KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTSTR 922
Query: 903 EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHN 962
E T G QRIT+FKNISGHQGFFLSGSRP WCM+FRERLR H QLCDGSI AFTVLHN
Sbjct: 923 EGTSDGVASQRITMFKNISGHQGFFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLHN 982
Query: 963 VNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022
VNCNHGFIYVT+QG+LKICQLPS S YDNYWPVQK IPLKATPHQ+TY+AEKNLYPLIVS
Sbjct: 983 VNCNHGFIYVTAQGVLKICQLPSASIYDNYWPVQK-IPLKATPHQVTYYAEKNLYPLIVS 1041
Query: 1023 VPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRA 1082
PV KPLNQVLS L+DQE G Q+DNHN+SS DL RTYTVEE+E++ILEP+R+GGPW+T+A
Sbjct: 1042 YPVSKPLNQVLSSLVDQEAGQQLDNHNMSSDDLQRTYTVEEFEIQILEPERSGGPWETKA 1101
Query: 1083 TIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNP 1142
IPMQ+SE+ALTVRVVTL N +T ENETLLA+GTAYVQGEDVAARGRVLLFS G+N DN
Sbjct: 1102 KIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYVQGEDVAARGRVLLFSFGKNGDNS 1161
Query: 1143 QNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSL 1202
QN+VTEVYS+ELKGAISA+AS+QGHLLI+SGPKIILHKW GTELNG+AF+DAPPLYVVS+
Sbjct: 1162 QNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKWNGTELNGVAFFDAPPLYVVSM 1221
Query: 1203 NIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQ 1262
N+VK+FILLGD+HKSIYFLSWKEQG+QL+LLAKDF SLDCFATEFLIDGSTLSL VSDEQ
Sbjct: 1222 NVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGSTLSLAVSDEQ 1281
Query: 1263 KNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNR 1322
KNIQ+FYYAPKM ESWKG KLLSRAEFHVGAHV+KFLRLQM+++ G+DK NR
Sbjct: 1282 KNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQMVSS---------GADKINR 1332
Query: 1323 FALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
FALLFGTLDGS GCIAPLDE+TFRRLQSLQKKLVD+VPHVAGLNP +FRQF S+GKA R
Sbjct: 1333 FALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSGKARRS 1392
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSFL 1432
GPDSIVDCELL HYEMLPLEEQLE+AHQ GTTR IL +L DL++GTSFL
Sbjct: 1393 GPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSVGTSFL 1442
Score = 2091 (741.1 bits), Expect = 1.9e-216, P = 1.9e-216
Identities = 396/545 (72%), Positives = 464/545 (85%)
Query: 1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
MSFAAYKMMHWPTG+ NC SG+ITHS +D QIP++ +++++E P+ KRGIGP+PN+
Sbjct: 1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60
Query: 59 VVTAANVIEIYVVRVQXXXXXXXXXX-XXTKRRVLMDGISAASLELVCHYRLHGNVESLA 117
V+TAAN++E+Y+VR Q KR +MDG+ SLELVCHYRLHGNVES+A
Sbjct: 61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120
Query: 118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
+L GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct: 121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180
Query: 178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG SAR+ESS++IN
Sbjct: 181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240
Query: 238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct: 241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP
Sbjct: 301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360
Query: 358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+ SVL SDIT+
Sbjct: 361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420
Query: 418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
+GNSLFFLGSRLGDSLLVQF+C SG + GL++E DIE + KRLR +S D QD
Sbjct: 421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479
Query: 478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
+ EELSL+GS NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 538 NYELV 542
NYELV
Sbjct: 540 NYELV 544
>ZFIN|ZDB-GENE-040709-2 [details] [associations]
symbol:cpsf1 "cleavage and polyadenylation specific
factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
"definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
Length = 1451
Score = 789 (282.8 bits), Expect = 6.8e-157, Sum P(4) = 6.8e-157
Identities = 197/636 (30%), Positives = 326/636 (51%)
Query: 803 INSSSEEGTGQG--RKENIHSMK----VVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
++SS+ + QG +KE + V E+A+ +HSRP+L A + + +L Y+A+
Sbjct: 827 VDSSASQSATQGELKKEEVTRQGDIPLVKEVALVSLGYNHSRPYLLAHV-EQELLIYEAF 885
Query: 857 LFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQRITI 916
++ + S +R + P + + R
Sbjct: 886 PYDQQQAQSNLK---VRFKKMPHNINYREKKVKVRKDKKP-EGQGEDTLGVKGRVARFRY 941
Query: 917 FKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ 975
F++ISG+ G F+ G P W +V R +R+HP DG+I +F+ HN+NC GF+Y Q
Sbjct: 942 FQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMTIDGAIESFSPFHNINCPKGFLYFNKQ 1001
Query: 976 GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSL 1035
G L+I LP+ +YD WPV+K IPL+ T H ++Y E +Y + SV +P ++ +
Sbjct: 1002 GELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVSYHVESKVYAVCTSVK--EPCTRIPRM 1058
Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
+++ I+ +H +++ ++++ P TR + ++ E+ +
Sbjct: 1059 TGEEKEFETIERDERY---IHPQQ--DKFSIQLISPVSWEAIPNTR--VDLEEWEHVTCM 1111
Query: 1096 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1149
+ V L + T + +A+GT +QGE+V RGR+L+ P +T+ +
Sbjct: 1112 KTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILILDVIEVVPEPGQPLTKNKFKVL 1171
Query: 1150 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1209
Y KE KG ++AL G L+ A G KI L +L G+AF D LY+ + +KNFI
Sbjct: 1172 YEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLTGMAFIDTQ-LYIHQMYSIKNFI 1230
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
L D+ KSI L ++ + L+L+++D L+ ++ EF++D + L +VSD KN+ ++
Sbjct: 1231 LAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYM 1290
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
Y P+ ES+ G +LL RA+F+VG+HV F R+ T A D N+ F T
Sbjct: 1291 YLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWD--NKHITWFAT 1348
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
LDG +G + P+ E T+RRL LQ L +PH AGLNP++FR H + + + +I+D
Sbjct: 1349 LDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILD 1408
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
ELL+ Y L E+ E+A + GTT IL +L ++
Sbjct: 1409 GELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1444
Score = 648 (233.2 bits), Expect = 6.8e-157, Sum P(4) = 6.8e-157
Identities = 169/478 (35%), Positives = 265/478 (55%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAAS-LELVCHYRLHGNVES 115
NLVV A ++YV R+ +K DG S LE V + L GNV S
Sbjct: 29 NLVV--AGTSQLYVYRI------IYDVESTSKSEKSSDGKSRKEKLEQVASFSLFGNVMS 80
Query: 116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
+A + G + RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 81 MASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFV 133
Query: 176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
P+V+VDP+ RC +LVYG +++L + + DE G G + S++
Sbjct: 134 QNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKD---TLADEQEGIVGEGQKSSFLPSYI 190
Query: 236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
I++R+LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ K
Sbjct: 191 IDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQK 250
Query: 294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSS 352
HP+IWS NLP D +++AVP PIGGV+V N++ Y +QS ++LN+ +
Sbjct: 251 VHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSLNSLTNGTTAF 310
Query: 353 QELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVL 411
P+ + LD + A+++ +D ++S K G++ +LT++ DG R V+ K SVL
Sbjct: 311 PLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVL 370
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
T+ + T+ FLGSRLG+SLL+++T + + G + E + + + P+ K+ R S
Sbjct: 371 TTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQEEPPNKKK-RVDS 429
Query: 472 SDA-------LQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ A L D + +E+ +YGS A + T+ A T+SF V DS++NIGP S G
Sbjct: 430 NWAGCPGKGNLPDEL--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMG 483
Score = 162 (62.1 bits), Expect = 6.8e-157, Sum P(4) = 6.8e-157
Identities = 54/190 (28%), Positives = 90/190 (47%)
Query: 543 ELPGCKGIWTVYH-------KSSRGHNA---DSSRMAAYDDEY--HAYLIISLEARTMVL 590
ELPGC +WTV + S+ G + R +D+ H +LI+S E TM+L
Sbjct: 530 ELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSREDSTMIL 589
Query: 591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXX 650
+T + E+ S + QG T+ AGN+ + +IQV G R+L+G L F P
Sbjct: 590 QTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQLHFIPVDL 645
Query: 651 XXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV--GDP---STCTVSVQTPAAIESSKK 705
++ S+ADPYV++ ++G + + V D + +++Q P I + +
Sbjct: 646 GS-------PIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQ-IHTQSR 697
Query: 706 PVSSCTLYHD 715
++ C Y D
Sbjct: 698 VITLCA-YRD 706
Score = 113 (44.8 bits), Expect = 6.8e-157, Sum P(4) = 6.8e-157
Identities = 40/149 (26%), Positives = 71/149 (47%)
Query: 712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGDIYSVVCYESGALEIFDVPN 770
LY + P K + S A G + G + + ++ E+G +EI+ +P+
Sbjct: 751 LYGESNPLTSPNKEESSRG-SAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPD 809
Query: 771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
+ VF V F G+ +VD+ + S T+ EE T QG +I +K E+A+
Sbjct: 810 WRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALV 860
Query: 831 RWSAHHSRPFLFAILTDGTILCYQAYLFE 859
+HSRP+L A + + +L Y+A+ ++
Sbjct: 861 SLGYNHSRPYLLAHV-EQELLIYEAFPYD 888
Score = 48 (22.0 bits), Expect = 1.9e-81, Sum P(3) = 1.9e-81
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 595 IMELDTSGFATQGPTVYAG-NIGDNKY-IIQV-SPMGIRLLEGVNQLHF 640
Score = 39 (18.8 bits), Expect = 8.8e-137, Sum P(3) = 8.8e-137
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 515 LKDFSYGLRINADASATGISKQSNYELVEL 544
+K+F G R+ D+SA+ + Q + E+
Sbjct: 816 VKNFPVGQRVLVDSSASQSATQGELKKEEV 845
Score = 39 (18.8 bits), Expect = 9.8e-70, Sum P(3) = 9.8e-70
Identities = 7/19 (36%), Positives = 12/19 (63%)
Query: 966 NHGFIYVTSQGILKICQLP 984
+H + V G+++I QLP
Sbjct: 790 SHWCLLVRENGVMEIYQLP 808
>UNIPROTKB|F1PC28 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
[GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
Length = 1398
Score = 786 (281.7 bits), Expect = 2.9e-156, Sum P(4) = 2.9e-156
Identities = 209/638 (32%), Positives = 320/638 (50%)
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
T+ + EE T QG + + +V L ++ SRP+L + D +L Y+A+
Sbjct: 783 TQAEARKEEATRQGELPLVKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF---- 832
Query: 861 PENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFK 918
P + S+ + S+ + EE GA + R F+
Sbjct: 833 PHD-SQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFE 890
Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y QG
Sbjct: 891 DIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGE 950
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S + P + I
Sbjct: 951 LRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----I 1002
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1095
+ G + + + D + E + ++++ P W+ A I ++ E+ +
Sbjct: 1003 PRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCM 1058
Query: 1096 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1149
+ V+L + T + +A GT +QGE+V RGR+L+ P +T+ +
Sbjct: 1059 KTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVL 1118
Query: 1150 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1209
Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKNFI
Sbjct: 1119 YEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFI 1177
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
L D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++
Sbjct: 1178 LAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYM 1237
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFA 1324
Y P+ ES+ G +LL RA+FHVGAHV F R GAA G K N+
Sbjct: 1238 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPC-------RGAAEGPSKKSVVWENKHI 1290
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1291 TWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAV 1350
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1351 RNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1388
Score = 640 (230.4 bits), Expect = 2.9e-156, Sum P(4) = 2.9e-156
Identities = 158/431 (36%), Positives = 240/431 (55%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
LELV + GNV S+A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H
Sbjct: 23 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 78
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
FE PE L+ G P V+VDP GRC +L+YG ++++L + + +E
Sbjct: 79 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 132
Query: 221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
G G + S++I++R LD K ++ D F+HGY EP ++IL E TW GRV+ +
Sbjct: 133 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 192
Query: 279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
TC I A+S++ T K HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS
Sbjct: 193 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 252
Query: 338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
+ALN + + + LD A A ++ D ++S K G++ +LT++ DG R
Sbjct: 253 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 312
Query: 397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
V+ K SVLT+ + T+ FLGSRLG+SLL+++T S+ E D
Sbjct: 313 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAAD 370
Query: 457 IEADAPSTKRLRRSSS-----DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLV 510
E KR+ ++ QD V +E+ +YGS A + T+ A T+SF V DS++
Sbjct: 371 KEEPPSKKKRVDCAAGWSGGKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSIL 426
Query: 511 NIGPLKDFSYG 521
NIGP + + G
Sbjct: 427 NIGPCANAAMG 437
Score = 176 (67.0 bits), Expect = 2.9e-156, Sum P(4) = 2.9e-156
Identities = 49/152 (32%), Positives = 79/152 (51%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNAD--SSRMAAYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++S+G A+ SS + A DD H +LI+S E TM+L+T
Sbjct: 484 ELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQT 543
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 544 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 599
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 600 -------PIVQCAVADPYVVIMSAEGHVTMFL 624
Score = 102 (41.0 bits), Expect = 2.9e-156, Sum P(4) = 2.9e-156
Identities = 28/98 (28%), Positives = 50/98 (51%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T QG
Sbjct: 745 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATRQGELPL 800
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ SRP+L + D +L Y+A+
Sbjct: 801 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF 832
Score = 49 (22.3 bits), Expect = 3.0e-80, Sum P(3) = 3.0e-80
Identities = 21/74 (28%), Positives = 36/74 (48%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+ S
Sbjct: 547 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 603
Query: 338 CALALNNYAVSLDS 351
CA+A + Y V + +
Sbjct: 604 CAVA-DPYVVIMSA 616
>UNIPROTKB|Q10569 [details] [associations]
symbol:CPSF1 "Cleavage and polyadenylation specificity
factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
"mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
ArrayExpress:Q10569 Uniprot:Q10569
Length = 1444
Score = 780 (279.6 bits), Expect = 5.7e-155, Sum P(5) = 5.7e-155
Identities = 206/637 (32%), Positives = 315/637 (49%)
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
T+ + EE T QG + + +V L ++ RP+L + D +L Y+A+
Sbjct: 829 TQGEARKEEATRQGELPLVKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF---- 878
Query: 861 PENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREET-PHGAPCQRITIFKN 919
P ++ + T E T P G R F++
Sbjct: 879 PHDSQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVA-RFRYFED 937
Query: 920 ISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGIL 978
I G+ G F+ G P W +V R LR+HP DG I +F HN+NC GF+Y QG L
Sbjct: 938 IYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGEL 997
Query: 979 KICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLID 1038
+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +V + +
Sbjct: 998 RISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRMTGE 1054
Query: 1039 QEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVR 1096
++ I+ +H E + ++++ P W+ A I ++ E+ ++
Sbjct: 1055 EKEFETIERDERY---VHPQQ--EAFCIQLISPVS----WEAIPNARIELEEWEHVTCMK 1105
Query: 1097 VVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VY 1150
V+L + T + +A GT +QGE+V RGR+L+ P +T+ +Y
Sbjct: 1106 TVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLY 1165
Query: 1151 SKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFIL 1210
KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKNFIL
Sbjct: 1166 EKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFIL 1224
Query: 1211 LGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYY 1270
D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y
Sbjct: 1225 AADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMY 1284
Query: 1271 APKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFAL 1325
P+ ES+ G +LL RA+FHVGAHV F R GAA G K N+
Sbjct: 1285 LPEAKESFGGMRLLRRADFHVGAHVNTFWRTPC-------RGAAEGPSKKSVVWENKHIT 1337
Query: 1326 LFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPD 1385
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1338 WFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVR 1397
Query: 1386 SIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1398 NVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1434
Score = 651 (234.2 bits), Expect = 5.7e-155, Sum P(5) = 5.7e-155
Identities = 167/475 (35%), Positives = 255/475 (53%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 194
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T S+ E D E KR+ +
Sbjct: 375 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVDATTG 432
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
S QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 433 WSGSKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483
Score = 160 (61.4 bits), Expect = 5.7e-155, Sum P(5) = 5.7e-155
Identities = 63/223 (28%), Positives = 103/223 (46%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNADSSRMA--AYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++ +G + A A DD H +LI+S E TM+L+T
Sbjct: 530 ELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQT 589
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 590 GQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 645
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPV 707
++ ++ADPYV++ ++G + LL D +++ P + K +
Sbjct: 646 -------PIVQCAVADPYVVIMSAEGHVTMFLLKNDSYGGRHHRLALHKPP-LHHQSKVI 697
Query: 708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
+ C +Y D +T++ L GV + + G GGP +G
Sbjct: 698 TLC-VYRDVSG-----MFTTESRLG-GVRDELGGR-GGPEAEG 732
Score = 98 (39.6 bits), Expect = 5.7e-155, Sum P(5) = 5.7e-155
Identities = 28/98 (28%), Positives = 48/98 (48%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+GA+EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 791 ENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 846
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ RP+L + D +L Y+A+
Sbjct: 847 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 878
Score = 49 (22.3 bits), Expect = 1.6e-79, Sum P(4) = 1.6e-79
Identities = 21/74 (28%), Positives = 36/74 (48%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+ S
Sbjct: 593 IMELDASGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 649
Query: 338 CALALNNYAVSLDS 351
CA+A + Y V + +
Sbjct: 650 CAVA-DPYVVIMSA 662
Score = 48 (22.0 bits), Expect = 5.7e-155, Sum P(5) = 5.7e-155
Identities = 16/52 (30%), Positives = 25/52 (48%)
Query: 3 FAAYKMMHWPTGIA-NCGSGFITHSRADYV----PQIPLIQTEELDSELPSK 49
+A YK H PTG+ + F +S + V Q+ + + DSE P+K
Sbjct: 2 YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNR-DSEAPTK 52
>UNIPROTKB|Q10570 [details] [associations]
symbol:CPSF1 "Cleavage and polyadenylation specificity
factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
[GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
from RNA polymerase II promoter" evidence=TAS] [GO:0006369
"termination of RNA polymerase II transcription" evidence=TAS]
[GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
[GO:0031124 "mRNA 3'-end processing" evidence=TAS]
Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
Uniprot:Q10570
Length = 1443
Score = 778 (278.9 bits), Expect = 6.8e-154, Sum P(5) = 6.8e-154
Identities = 203/633 (32%), Positives = 319/633 (50%)
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
T+ + EE T QG + + +V L ++ SRP+L + D +L Y+A+
Sbjct: 828 TQGEARREEATRQGELPLVKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF---- 877
Query: 861 PENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFK 918
P + S+ + S+ + EE GA + R F+
Sbjct: 878 PHD-SQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFE 935
Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+I G+ G F+ G P W +V R LR+HP DG + +F HNVNC GF+Y QG
Sbjct: 936 DIYGYSGVFICGPSPHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGE 995
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P ++ +
Sbjct: 996 LRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCARIPRMTG 1052
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1095
+++ I+ +H E + ++++ P W+ A I +Q E+ +
Sbjct: 1053 EEKEFETIERDERY---IHPQQ--EAFSIQLISPVS----WEAIPNARIELQEWEHVTCM 1103
Query: 1096 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1149
+ V+L + T + +A GT +QGE+V RGR+L+ P +T+ +
Sbjct: 1104 KTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVL 1163
Query: 1150 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1209
Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKNFI
Sbjct: 1164 YEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFI 1222
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
L D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++
Sbjct: 1223 LAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYM 1282
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGT 1329
Y P+ ES+ G +LL RA+FHVGAHV F R + + + + N+ F T
Sbjct: 1283 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSKKSVVWE--NKHITWFAT 1340
Query: 1330 LDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVD 1389
LDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D
Sbjct: 1341 LDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLD 1400
Query: 1390 CELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1401 GELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1433
Score = 651 (234.2 bits), Expect = 6.8e-154, Sum P(5) = 6.8e-154
Identities = 166/474 (35%), Positives = 257/474 (54%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LEL + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +LVYG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV+V N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A AT++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPSTKRLR 468
+ + T+ FLGSRLG+SLL+++T +++ + KEE + +T
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWS 431
Query: 469 RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 432 AAGKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVG 481
Score = 158 (60.7 bits), Expect = 6.8e-154, Sum P(5) = 6.8e-154
Identities = 46/153 (30%), Positives = 75/153 (49%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDD-EYHAYLIISLEARTMVLE 591
ELPGC +WTV + +G + S+ A DD H +LI+S E TM+L+
Sbjct: 528 ELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQ 587
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXX 651
T + E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 588 TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLG 643
Query: 652 XXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 644 A-------PIVQCAVADPYVVIMSAEGHVTMFL 669
Score = 98 (39.6 bits), Expect = 6.8e-154, Sum P(5) = 6.8e-154
Identities = 28/98 (28%), Positives = 48/98 (48%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 790 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----REEATRQGELPL 845
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ SRP+L + D +L Y+A+
Sbjct: 846 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF 877
Score = 47 (21.6 bits), Expect = 1.9e-78, Sum P(4) = 1.9e-78
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 592 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 637
Score = 42 (19.8 bits), Expect = 6.8e-154, Sum P(5) = 6.8e-154
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 3 FAAYKMMHWPTGI 15
+A YK H PTG+
Sbjct: 2 YAVYKQAHPPTGL 14
>MGI|MGI:2679722 [details] [associations]
symbol:Cpsf1 "cleavage and polyadenylation specific factor
1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
"mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
Length = 1441
Score = 788 (282.4 bits), Expect = 6.1e-149, Sum P(4) = 6.1e-149
Identities = 207/630 (32%), Positives = 315/630 (50%)
Query: 808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKS 867
EE T QG + + +V L ++ SRP+L + D +L Y+A+ P + S+
Sbjct: 833 EEATRQGELPLVKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQL 881
Query: 868 DDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHG-APCQRITIFKNISGHQGF 926
+ S+ + + EE G R F++I G+ G
Sbjct: 882 GQGNLKVRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGV 941
Query: 927 FLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPS 985
F+ G P W +V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+
Sbjct: 942 FICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPA 1001
Query: 986 GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
+YD WPV+K IPL+ T H + Y E +Y + S P + I + G +
Sbjct: 1002 YLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEK 1053
Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNT 1103
+ + D + E + ++++ P W+ A I ++ E+ ++ V+L +
Sbjct: 1054 EFEAIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSE 1109
Query: 1104 TTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGA 1157
T + +A GT +QGE+V RGR+L+ P +T+ +Y KE KG
Sbjct: 1110 ETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGP 1169
Query: 1158 ISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKS 1217
++AL GHL+ A G KI L +EL G+AF D LY+ + VKNFIL D+ KS
Sbjct: 1170 VTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAADVMKS 1228
Query: 1218 IYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES 1277
I L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P+ ES
Sbjct: 1229 ISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKES 1288
Query: 1278 WKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLFGTLDG 1332
+ G +LL RA+FHVGAHV F R GAA G K N+ F TLDG
Sbjct: 1289 FGGMRLLRRADFHVGAHVNTFWRTPC-------RGAAEGPSKKSVVWENKHITWFATLDG 1341
Query: 1333 SIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + + +++D EL
Sbjct: 1342 GIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGEL 1401
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
L+ Y L E+ E+A + GTT IL +L
Sbjct: 1402 LNRYLYLSTMERSELAKKIGTTPDIILDDL 1431
Score = 648 (233.2 bits), Expect = 6.1e-149, Sum P(4) = 6.1e-149
Identities = 167/475 (35%), Positives = 255/475 (53%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429
Query: 471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 480
Score = 297 (109.6 bits), Expect = 6.5e-96, Sum P(4) = 6.5e-96
Identities = 82/266 (30%), Positives = 124/266 (46%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 788 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATRQGELPL 843
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P + S+
Sbjct: 844 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 892
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHG-APCQRITIFKNISGHQGFFLSGSRPCWCM 937
+ S+ + + EE G R F++I G+ G F+ G P W +
Sbjct: 893 VPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLL 952
Query: 938 VF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
V R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV+
Sbjct: 953 VTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVR 1012
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVS 1022
K IPL+ T H + Y E +Y + S
Sbjct: 1013 K-IPLRCTAHYVAYHVESKVYAVATS 1037
Score = 158 (60.7 bits), Expect = 6.1e-149, Sum P(4) = 6.1e-149
Identities = 45/152 (29%), Positives = 72/152 (47%)
Query: 543 ELPGCKGIWTVYH----------KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
ELPGC +WTV K+ S+ A D H +LI+S E TM+L+T
Sbjct: 527 ELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQT 586
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 587 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGA 642
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 643 -------PIVQCAVADPYVVIMSAEGHVTMFL 667
Score = 47 (21.6 bits), Expect = 7.9e-74, Sum P(3) = 7.9e-74
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 590 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 635
Score = 42 (19.8 bits), Expect = 6.1e-149, Sum P(4) = 6.1e-149
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 3 FAAYKMMHWPTGI 15
+A YK H PTG+
Sbjct: 2 YAVYKQAHPPTGL 14
>FB|FBgn0024698 [details] [associations]
symbol:Cpsf160 "Cleavage and polyadenylation specificity
factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
"mRNA cleavage and polyadenylation specificity factor complex"
evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
[GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
Uniprot:Q9V726
Length = 1455
Score = 660 (237.4 bits), Expect = 5.3e-129, Sum P(4) = 5.3e-129
Identities = 161/525 (30%), Positives = 273/525 (52%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
Q++ F N+ G G + G PC+ + FR LR+H L +G + +F +NVN +GF+
Sbjct: 942 QKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFL 1001
Query: 971 YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
Y + LKI LPS +YD+ WPV+KV PL+ TP Q+ Y E +Y LI +P+
Sbjct: 1002 YFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE--EPMT 1058
Query: 1031 QVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RATIPM 1086
+ D+E+ + Y + ++E+ ++ P+ W+ A+I
Sbjct: 1059 KYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDASITF 1107
Query: 1087 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1145
+ E+ ++V L T+ + L IGT + ED+ +RG + ++ P
Sbjct: 1108 EPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIYDIIEVVPEPGKP 1167
Query: 1146 VT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVV 1200
+T E++ KE KG +SA++ + G L+ G KI + + +L G+AF D +YV
Sbjct: 1168 MTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTN-IYVH 1226
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
+ VK+ I + D++KSI L ++E+ L+L ++DF L+ + EF++D S L +V+D
Sbjct: 1227 QIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVTD 1286
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
++NI ++ Y P+ ES GQKLL +A++H+G V R+Q + P +
Sbjct: 1287 AERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQR--QPFLYEN 1344
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
F +++GTLDG++G PL E +RR LQ L+ H+ GLNP+ +R S+ K
Sbjct: 1345 KHF-VVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQG 1403
Query: 1381 RPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
I+D +L+ Y ++ E+ E+A + GT +IL +L ++
Sbjct: 1404 INPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEI 1448
Score = 566 (204.3 bits), Expect = 5.3e-129, Sum P(4) = 5.3e-129
Identities = 158/509 (31%), Positives = 260/509 (51%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV ANV+++Y + R LE + Y L+GNV SL
Sbjct: 29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLA-----PKMRLECLATYTLYGNVMSL 83
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
+S GA RD+++++F+DAK+SVL+ D L+ S+H FE + GR
Sbjct: 84 QCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWTGRY- 138
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
F P V+VDP RC +LVYG ++++L + S L + + +R I
Sbjct: 139 FV--PTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTPI 196
Query: 231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+S++I LRDLD K +V D F+HGY EP ++IL+E T GR+ + TC++ A+S+
Sbjct: 197 MASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISL 256
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAV 347
+ + HP+IW+ +LP D ++ + PIGG LV+ N + Y +QS ++LN+ A
Sbjct: 257 NIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSAD 316
Query: 348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
+ + P+ + LD A+ ++ D ++S +TGDL +LT+ D R V+ K
Sbjct: 317 NSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKA 376
Query: 407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEF 454
SVLTS I + + FLGSRLG+SLL+ FT +++++ L++E
Sbjct: 377 AASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDED 436
Query: 455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
++E + +L + + A + EEL +YGS + + + F F V DSL+N+ P
Sbjct: 437 QNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAP 495
Query: 515 LKDFSYGLRINADASATGISKQSNYELVE 543
+ G R+ + G++ + + E ++
Sbjct: 496 INYMCAGERVEFEED--GVTLRPHAESLQ 522
Score = 159 (61.0 bits), Expect = 5.3e-129, Sum P(4) = 5.3e-129
Identities = 58/198 (29%), Positives = 98/198 (49%)
Query: 543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
EL GC +WTV+ D+++ ++ +D+ H ++++S T+VL+T + E+ E+
Sbjct: 556 ELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-EN 605
Query: 603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTVL 662
+ V TI GNL +R ++QV R R+L G+ + Q++ V+
Sbjct: 606 TGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPID----------VGSPVV 655
Query: 663 SVSIADPYVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTL 712
VSIADPYV L + +G + L G P T+S +PA + S+ K +S L
Sbjct: 656 QVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTIS-SSPAVVAISAYKDLSG--L 712
Query: 713 YHDKGPEPWLRKTSTDAW 730
+ KG + L +S A+
Sbjct: 713 FTVKGDDINLTGSSNSAF 730
Score = 75 (31.5 bits), Expect = 5.3e-129, Sum P(4) = 5.3e-129
Identities = 28/105 (26%), Positives = 50/105 (47%)
Query: 755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
VV +SG LEI+ +P+ V+ V+ +G + D E + S T +S+ G Q
Sbjct: 795 VVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM--EFVPISLTT-QENSKAGIVQA 851
Query: 815 -RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
++ +S +EL++ + RP L + T +L YQ + +
Sbjct: 852 CMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELLIYQVFRY 895
Score = 37 (18.1 bits), Expect = 9.7e-62, Sum P(3) = 9.7e-62
Identities = 9/18 (50%), Positives = 11/18 (61%)
Query: 373 QNDVALLSTKTGDLVLLT 390
Q+D LLS + LVL T
Sbjct: 579 QHDFMLLSQRNSTLVLQT 596
>UNIPROTKB|F1RSN8 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
Length = 1108
Score = 777 (278.6 bits), Expect = 1.7e-125, Sum P(4) = 1.7e-125
Identities = 185/524 (35%), Positives = 279/524 (53%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 595 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLY 654
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +
Sbjct: 655 FNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR 711
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
+ + +++ ID + +H E + ++++ P W+ A I ++
Sbjct: 712 IPRMTGEEKEFETIDRDDRY---IHPQQ--EAFSIQLISPVS----WEAIPNARIELEEW 762
Query: 1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 763 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 822
Query: 1149 -----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ +
Sbjct: 823 NKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMI 881
Query: 1204 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1263
VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +
Sbjct: 882 SVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDR 941
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--- 1320
N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R GA G K
Sbjct: 942 NLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPC-------RGATDGPSKKSVV 994
Query: 1321 --NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
N+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + +
Sbjct: 995 WENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRR 1054
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1055 VLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1098
Score = 466 (169.1 bits), Expect = 1.7e-125, Sum P(4) = 1.7e-125
Identities = 114/325 (35%), Positives = 176/325 (54%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LELV + G V S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFG-VMSM 83
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 84 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 136
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 137 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 193
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 194 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 253
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 254 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 313
Query: 354 ELPRSSFSVELDAAHATWLQN-DVA 377
+ + LD A A ++ + DVA
Sbjct: 314 LRTQEGVRITLDCAQAAFISSQDVA 338
Score = 94 (38.1 bits), Expect = 1.7e-125, Sum P(4) = 1.7e-125
Identities = 27/98 (27%), Positives = 47/98 (47%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 455 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 510
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ RP+L + D +L Y+A+
Sbjct: 511 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 542
Score = 47 (21.6 bits), Expect = 4.6e-39, Sum P(3) = 4.6e-39
Identities = 15/38 (39%), Positives = 20/38 (52%)
Query: 1071 PDRAGGPWQTRATIPMQSSENALTV-RVVT-LFNTTTK 1106
PD A P + R P QS AL V R V+ +F T ++
Sbjct: 341 PDPAAAPTEPRPPPPQQSKVIALCVYRDVSGMFTTESR 378
Score = 45 (20.9 bits), Expect = 1.7e-125, Sum P(4) = 1.7e-125
Identities = 15/52 (28%), Positives = 25/52 (48%)
Query: 3 FAAYKMMHWPTGIA-NCGSGFITHSRADYV----PQIPLIQTEELDSELPSK 49
+A YK H PTG+ + F +S + V Q+ + + D+E P+K
Sbjct: 2 YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNR-DAEAPTK 52
Score = 40 (19.1 bits), Expect = 2.5e-38, Sum P(3) = 2.5e-38
Identities = 12/34 (35%), Positives = 21/34 (61%)
Query: 468 RRSSSDALQDMVNGEELS--LYGSASNNTESAQK 499
RR +A++++++GE L+ LY S E A+K
Sbjct: 1053 RRVLQNAVRNVLDGELLNRYLYLSTMERGELAKK 1086
>DICTYBASE|DDB_G0281585 [details] [associations]
symbol:cpsf1 "cleavage and polyadenylation
specificity factor 160 kDa subunit" species:44689 "Dictyostelium
discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
[GO:0005847 "mRNA cleavage and polyadenylation specificity factor
complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
evidence=ISS] InterPro:IPR004871 Pfam:PF03178
dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
Length = 1628
Score = 488 (176.8 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 131/398 (32%), Positives = 220/398 (55%)
Query: 912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ-LCDGS---------------IV 955
+RI F +ISG +G F+ G +P W + LR+H D S +
Sbjct: 1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181
Query: 956 AFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEK 1014
FT +N++C GFIY + + ++KIC L + ++N +++ IP K + H+I Y +E
Sbjct: 1182 TFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRR-IPTKNSCHKIAYHSEA 1240
Query: 1015 NLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRA 1074
Y +IVS P QV QE+ Q D+ D +++++++++P
Sbjct: 1241 KCYVVIVSFP------QVT-----QEL--QEDSKKPILTD-------DKFQIKLIDPT-I 1279
Query: 1075 GGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGRV 1130
W+ + +Q E L +++V+L T + T L IGTA+ GED +GRV
Sbjct: 1280 DWNWKFIDSFSLQDRETVLAMKIVSL-KFTEPDGITRARPFLVIGTAFTFGEDTQCKGRV 1338
Query: 1131 LLFS--TGRNADNPQNL----VTEVYSKELKGAISALASLQGHLLIASGPKIILHK-WTG 1183
L+F + + + L + +Y KE KG ++AL+S+ G LL+ GPK+ +++ +TG
Sbjct: 1339 LVFEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFYTG 1398
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
+ L ++FYDA +Y+ S+ +KN+I++GD++KS+YFL WK+ LNLL+KD+ +L+ F
Sbjct: 1399 S-LVTLSFYDAQ-IYICSICTIKNYIVIGDMYKSVYFLQWKDNKT-LNLLSKDYQALNIF 1455
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQ 1281
+TEF+++ TLS++VSD KNI +F + P+ S GQ
Sbjct: 1456 STEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQ 1493
Score = 413 (150.4 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 99/283 (34%), Positives = 160/283 (56%)
Query: 239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
+++++++VKDF F+HGY EP ++ LHE TW R++ K TC ++A+S++ K I
Sbjct: 281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W+ N P++ L++VP P+GG LV+ AN + Y +Q++ LA+N YA S+D+S +
Sbjct: 341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399
Query: 359 SFSVE----------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
F LD ++ +L++D + S K G+L++ ++ DGR VQR+ +SK
Sbjct: 400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459
Query: 409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
SVLTS I + N+L FLGSRLGDSLL+Q+T S+ L+ E P K+
Sbjct: 460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYT---EKSITDDQLEHE----NFSNPYKKQKT 512
Query: 469 RSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLV 510
D + N E + S +NN E+ +K+ S ++ L+
Sbjct: 513 SEVFDLFDE--NSETNNNNNSNNNNNKENQEKSSSSSIASKLL 553
Score = 210 (79.0 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 60/173 (34%), Positives = 88/173 (50%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMD----GISAA-------SLELVC 105
NLV+ NV++IY +R + +++ + I+ SLEL+
Sbjct: 32 NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91
Query: 106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
+L GN+ES+A + NS R DS+IL F DAKISVL++D + I S+H FE
Sbjct: 92 EKKLFGNIESMASVRY---PNSER-DSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147
Query: 166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
E+ K GR F PL+KVD Q RC +L+Y + +L + S L D+D
Sbjct: 148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSILDDDDD 197
Score = 142 (55.0 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 36/108 (33%), Positives = 58/108 (53%)
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHS-NGKAH-RP 1382
++FGTLDG + + PLDE + +Q KL +P AGLNP+ +R F S + H P
Sbjct: 1515 VIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGLNPKQYRSFKSFSQNFHFSP 1573
Query: 1383 G-----PDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
P I+D +L+S + L E+ I++ +T +I+ +L D+
Sbjct: 1574 STFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEIIESLKDV 1621
Score = 119 (46.9 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 35/148 (23%), Positives = 72/148 (48%)
Query: 572 DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
D +H YL +SL + T++ ET L EV + +++ GNLFGR+R++ +++ G
Sbjct: 712 DKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDIGNLFGRKRIVVIYQGG 766
Query: 631 ARILDG-SYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLVG-DPS 688
++++G + Q++ + S I DP++LL +G+I++ G D
Sbjct: 767 IKLINGFDRVIQEIQINE------------PIKSSYICDPFILLQFHNGTIQIFKGIDEE 814
Query: 689 TCTVSVQTPAAIESSKKPVSSCTLYHDK 716
+ + + + + S +L+ D+
Sbjct: 815 NQLIQFSINSISNNLNQSIFSSSLFFDR 842
Score = 73 (30.8 bits), Expect = 8.8e-80, Sum P(7) = 8.8e-80
Identities = 22/82 (26%), Positives = 35/82 (42%)
Query: 470 SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS 529
S + L + + EE L+ N K++ + D ++NIGP+ D G I+
Sbjct: 547 SIASKLLEEIEDEEDQLFKEKKNQL----KSYQLGICDQIINIGPIGDIVVGQSIDPTYD 602
Query: 530 ATGISKQSNY--ELVELPGCKG 549
T Q Y + +EL C G
Sbjct: 603 ETIQPNQPEYVPKTLELVTCSG 624
Score = 64 (27.6 bits), Expect = 2.3e-107, Sum P(5) = 2.3e-107
Identities = 21/89 (23%), Positives = 38/89 (42%)
Query: 766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
F++P V+TV K HI + K + N+++E+ + + +
Sbjct: 646 FELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQEDNEDNEEEEE 705
Query: 826 ELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
E MQ+ H +L+ L DGT L ++
Sbjct: 706 EEKMQKDKNWHD--YLYLSLKDGTTLIFE 732
Score = 58 (25.5 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 19/77 (24%), Positives = 38/77 (49%)
Query: 748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVD--KF---VSG-RTHIVDTYMREALKDSET 801
DQ +IY + +G+ EI+ + + C+F V KF + G T++ + E + ++
Sbjct: 932 DQDNIYLNIYTTNGSYEIYRLTSQECIFKVSDIKFEYDILGINTNVSQNQILEQVLTPKS 991
Query: 802 EINSSSEEGTGQGRKEN 818
++ + Q +KEN
Sbjct: 992 SLSKKQLQQHLQKQKEN 1008
Score = 53 (23.7 bits), Expect = 2.0e-114, Sum P(7) = 2.0e-114
Identities = 14/60 (23%), Positives = 31/60 (51%)
Query: 797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
K E INS + + +N + +VE+++ ++ +S P+LF G ++ Y+++
Sbjct: 1004 KQKENGINSKNN----YNQIQNSEILDIVEISLHNFN--NSDPYLFMFNKIGDLIIYKSF 1057
Score = 49 (22.3 bits), Expect = 6.1e-115, Sum P(7) = 6.1e-115
Identities = 8/12 (66%), Positives = 9/12 (75%)
Query: 543 ELPGCKGIWTVY 554
ELPG +WTVY
Sbjct: 647 ELPGILNVWTVY 658
>RGD|1306406 [details] [associations]
symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA;ISO]
[GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
"mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
Uniprot:D4A0H5
Length = 1386
Score = 652 (234.6 bits), Expect = 2.6e-113, Sum P(4) = 2.6e-113
Identities = 167/473 (35%), Positives = 256/473 (54%)
Query: 57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
NLVV A ++YV R+ T+ + + LELV + GNV S+
Sbjct: 29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81
Query: 117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
A + GA +RD+++L+F+DAK+SV+E+D H L+ S+H FE PE L+ G
Sbjct: 82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134
Query: 177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
P V+VDP GRC +L+YG ++++L + + +E G G + S++I
Sbjct: 135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191
Query: 237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
++R LD K ++ D F+HGY EP ++IL E TW GRV+ + TC I A+S++ T K
Sbjct: 192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251
Query: 295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
HP+IWS +LP D + LAVP PIGGV++ N++ Y +QS +ALN+ +
Sbjct: 252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
+ + LD A A ++ D ++S K G++ +LT++ DG R V+ K SVLT
Sbjct: 312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
+ + T+ FLGSRLG+SLL+++T SS E D E KR+ +
Sbjct: 372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429
Query: 471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ QD V +E+ +YGS A + T+ A T+SF V DS++NIGP + + G
Sbjct: 430 WTGGKTQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 478
Score = 457 (165.9 bits), Expect = 2.6e-113, Sum P(4) = 2.6e-113
Identities = 107/275 (38%), Positives = 156/275 (56%)
Query: 1154 LKGAISALASL-QGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
LKG ++A L QG + G +I L +EL G+AF D LY+ + VKNFIL
Sbjct: 1111 LKGYVAAGTCLMQGEEVTCRG-RIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFILAA 1168
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP 1272
D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++ Y P
Sbjct: 1169 DVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLP 1228
Query: 1273 KMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFALLF 1327
+ ES+ G +LL RA+FHVGAHV F R GAA G K N+ F
Sbjct: 1229 EAKESFGGMRLLRRADFHVGAHVNTFWRTPC-------RGAAEGPSKKSVMWENKHITWF 1281
Query: 1328 GTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + + ++
Sbjct: 1282 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNV 1341
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1342 LDGELLNRYLYLSTMERSELAKKIGTTPDIILDDL 1376
Score = 354 (129.7 bits), Expect = 1.6e-102, Sum P(4) = 1.6e-102
Identities = 121/461 (26%), Positives = 201/461 (43%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 784 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATRQGELPL 839
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
+ + +V L ++ SRP+L + D +L Y+A+ P ++
Sbjct: 840 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLKVRFKKV 889
Query: 879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
+ T E + R F++I G+ G F+ G P W +V
Sbjct: 890 PHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLV 949
Query: 939 F-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
R LR+HP DG I +F HNVNC GF+Y QG L+I LP+ +YD WPV+K
Sbjct: 950 TGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRK 1009
Query: 998 VIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR 1057
IPL+ T H + Y E +Y + S P + I + G + + + D +
Sbjct: 1010 -IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIERDDRYI 1061
Query: 1058 TYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAI 1114
E + ++++ P W+ A I ++ E+ ++ V+L + T + +A
Sbjct: 1062 HPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAA 1117
Query: 1115 GTAYVQGEDVAARGRVLLFSTGRNADNPQNLV-TEVYSKELKGAISALASLQGHLLIASG 1173
GT +QGE+V RGR+ L+S + + T++Y I + S++ +L A
Sbjct: 1118 GTCLMQGEEVTCRGRIFLWSLRASELTGMAFIDTQLY-------IHQMISVKNFILAADV 1170
Query: 1174 PKII--LHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
K I L ++ + DA PL V S++ + + LG
Sbjct: 1171 MKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLG 1211
Score = 158 (60.7 bits), Expect = 2.6e-113, Sum P(4) = 2.6e-113
Identities = 45/152 (29%), Positives = 72/152 (47%)
Query: 543 ELPGCKGIWTVYH----------KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
ELPGC +WTV K+ S+ A D H +LI+S E TM+L+T
Sbjct: 523 ELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQT 582
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 583 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGA 638
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 639 -------PIVQCAVADPYVVIMSAEGHVTMFL 663
Score = 47 (21.6 bits), Expect = 3.5e-37, Sum P(3) = 3.5e-37
Identities = 15/49 (30%), Positives = 26/49 (53%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+
Sbjct: 586 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 631
Score = 42 (19.8 bits), Expect = 2.6e-113, Sum P(4) = 2.6e-113
Identities = 7/13 (53%), Positives = 9/13 (69%)
Query: 3 FAAYKMMHWPTGI 15
+A YK H PTG+
Sbjct: 2 YAVYKQAHPPTGL 14
>UNIPROTKB|J9P418 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
Ensembl:ENSCAFT00000043656 Uniprot:J9P418
Length = 1107
Score = 786 (281.7 bits), Expect = 9.9e-111, Sum P(4) = 9.9e-111
Identities = 209/638 (32%), Positives = 320/638 (50%)
Query: 801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
T+ + EE T QG + + +V L ++ SRP+L + D +L Y+A+
Sbjct: 492 TQAEARKEEATRQGELPLVKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF---- 541
Query: 861 PENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFK 918
P + S+ + S+ + EE GA + R F+
Sbjct: 542 PHD-SQLGQGNLKVRFKKVPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFE 599
Query: 919 NISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
+I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y QG
Sbjct: 600 DIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGE 659
Query: 978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037
L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S + P + I
Sbjct: 660 LRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----I 711
Query: 1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTV 1095
+ G + + + D + E + ++++ P W+ A I ++ E+ +
Sbjct: 712 PRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCM 767
Query: 1096 RVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----V 1149
+ V+L + T + +A GT +QGE+V RGR+L+ P +T+ +
Sbjct: 768 KTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVL 827
Query: 1150 YSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFI 1209
Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ + VKNFI
Sbjct: 828 YEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMISVKNFI 886
Query: 1210 LLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY 1269
L D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +N+ ++
Sbjct: 887 LAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYM 946
Query: 1270 YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT-----NRFA 1324
Y P+ ES+ G +LL RA+FHVGAHV F R GAA G K N+
Sbjct: 947 YLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPC-------RGAAEGPSKKSVVWENKHI 999
Query: 1325 LLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGP 1384
F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + + +
Sbjct: 1000 TWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAV 1059
Query: 1385 DSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 1060 RNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 1097
Score = 176 (67.0 bits), Expect = 9.9e-111, Sum P(4) = 9.9e-111
Identities = 49/152 (32%), Positives = 79/152 (51%)
Query: 543 ELPGCKGIWTVY-------HKSSRGHNAD--SSRMAAYDD-EYHAYLIISLEARTMVLET 592
ELPGC +WTV ++S+G A+ SS + A DD H +LI+S E TM+L+T
Sbjct: 193 ELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQT 252
Query: 593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
+ E+ S + QG T+ AGN+ R ++QV G R+L+G L F P
Sbjct: 253 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 308
Query: 653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
++ ++ADPYV++ ++G + + +
Sbjct: 309 -------PIVQCAVADPYVVIMSAEGHVTMFL 333
Score = 172 (65.6 bits), Expect = 9.9e-111, Sum P(4) = 9.9e-111
Identities = 53/151 (35%), Positives = 82/151 (54%)
Query: 378 LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
++S K G++ +LT++ DG R V+ K SVLT+ + T+ FLGSRLG+SLL++
Sbjct: 2 VISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 61
Query: 437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-----DALQDMVNGEELSLYGS-A 490
+T S+ E D E KR+ ++ QD V +E+ +YGS A
Sbjct: 62 YTEKLQEPPASAA--REAADKEEPPSKKKRVDCAAGWSGGKSVPQDEV--DEIEVYGSEA 117
Query: 491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
+ T+ A T+SF V DS++NIGP + + G
Sbjct: 118 QSGTQLA--TYSFEVCDSILNIGPCANAAMG 146
Score = 102 (41.0 bits), Expect = 9.9e-111, Sum P(4) = 9.9e-111
Identities = 28/98 (28%), Positives = 50/98 (51%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + T+ + EE T QG
Sbjct: 454 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATRQGELPL 509
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ SRP+L + D +L Y+A+
Sbjct: 510 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF 541
Score = 49 (22.3 bits), Expect = 5.9e-83, Sum P(3) = 5.9e-83
Identities = 21/74 (28%), Positives = 36/74 (48%)
Query: 283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
I L S Q P +++ N+ + Y ++ V SP+G L+ G N +H+ S
Sbjct: 256 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 312
Query: 338 CALALNNYAVSLDS 351
CA+A + Y V + +
Sbjct: 313 CAVA-DPYVVIMSA 325
>WB|WBGene00022301 [details] [associations]
symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
"nematode larval development" evidence=IMP] [GO:0040018 "positive
regulation of multicellular organism growth" evidence=IMP]
[GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
"negative regulation of vulval development" evidence=IMP]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
Length = 1454
Score = 522 (188.8 bits), Expect = 3.3e-94, Sum P(3) = 3.3e-94
Identities = 176/728 (24%), Positives = 328/728 (45%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGG-PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
++ DA S+ GE D D + +V +E+G L I +P V+ + +F
Sbjct: 744 KRLGHDAIQSSRGGEQSDAIDPTRTFSSISHWLIVSHENGRLSIHSLPEMEVVYQIGRFS 803
Query: 782 SGRTHIVDTYMREALKDSETEINSSSEEG---TGQGRKENIHSMKVVELAMQR----WSA 834
+ +VD + E K+ + + +++E T + + N ++ E ++
Sbjct: 804 NVPELLVDLTVEEEEKERKAKAQQAAKEASVPTDEAEQLNTEMKQLCERVLEAQIVGMGI 863
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSR 894
+ + P L AI+ + +L Y+ + +S + P +
Sbjct: 864 NQAHPILMAIVDEQVVL-YEMF-------SSSNPIPGHLGISFRKLPHFICLRTSSHLNS 915
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCD 951
A + +G I F+ +S + G + G+ P +V+ ++ H D
Sbjct: 916 DGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPT-LLVYGAWGGMQTHQMTVD 974
Query: 952 GSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
G I AFT +N N HG +Y+T + L+I ++ Y+ +PV+K I + T H + Y
Sbjct: 975 GPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKK-IEVGRTIHHVRY 1033
Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1068
++Y ++ S+P KP N++ ++ D QE H+ D + + + YT+ + +
Sbjct: 1034 LMNSDVYAVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ- 1088
Query: 1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR 1127
D A P I + E V L + +T ETLLA+GT GE+V R
Sbjct: 1089 ---DWAAVP---NTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVR 1142
Query: 1128 GRVLL---FSTGRNADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
GR++L D P + + ++ KE KG ++ L ++ G LL G K+ + ++
Sbjct: 1143 GRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFK 1202
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+L GI+F D YV L+ ++ + D +S+ + ++E +++ ++D C
Sbjct: 1203 DNDLMGISFLDMH-YYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRD--DRKC 1259
Query: 1243 ----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1298
A++ ++DG+ + ++SDE NI +F YAP+ ES G++L RA ++G ++ F
Sbjct: 1260 AQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINAF 1319
Query: 1299 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1358
+RL+ + R +F +LDGS G + PL E ++RRL LQ +
Sbjct: 1320 VRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSV 1379
Query: 1359 VPHVAGLNPRSFRQFH-SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1417
P +AGL+ + R S + +++D +++ Y L L ++ ++A + G R
Sbjct: 1380 TPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYH 1439
Query: 1418 ILSNLNDL 1425
I+ +L L
Sbjct: 1440 IIDDLMQL 1447
Score = 431 (156.8 bits), Expect = 3.3e-94, Sum P(3) = 3.3e-94
Identities = 158/551 (28%), Positives = 257/551 (46%)
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
+L+ G + + PLV+ DP RC LVYG + IL + S
Sbjct: 128 YLRDGFINHFQPPLVRSDPSNRCAACLVYGKHIAILPFHEN-----------------SK 170
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
RI S +VI L+ +D + ++ D +F+ GY EP ++ L+E T GR ++ T I +
Sbjct: 171 RIHS-YVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGV 229
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY 345
S++ +Q ++W NLP D +LL +P P+GG LV G+NT+ Y +Q+ C L LN+
Sbjct: 230 SVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS- 288
Query: 346 AVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQR 400
D + P + LD + + ++++ + ++ GDL LL ++ G V+
Sbjct: 289 --CYDGFTKFPLKDLKHLKMTLDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKS 346
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
L+ SK + + +T F+GSRLGDS L+++T LK D
Sbjct: 347 LEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTL----------LKTT-----RD 391
Query: 461 APSTKRLRRSSSD--ALQDMVNGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIG 513
+ KRL+ + D A + ++ +++ LYG A +++ E ++ F D L N+G
Sbjct: 392 C-AVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVG 450
Query: 514 PLKDFSYGLRINADASATGISKQSN--YELVELPGC--KGIWTVYHKSSRGHNADSSRMA 569
P+K G R N ++ +K+ + ++LV G G V+ +S R SS +
Sbjct: 451 PVKSMCVG-RPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLE 509
Query: 570 AYD---------DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFG 619
+ +E H YLI+S T++LE + L E+ E + FV G T+AAG L
Sbjct: 510 GAEQLWAVGRKENESHKYLIVSRVRSTLILELGEELVELEEQL--FVTGEPTVAAGELSQ 567
Query: 620 RRRVIQVFERG-ARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDG 678
+QV A + DG M Q++ V+ SI DPYV L +G
Sbjct: 568 GALAVQVTSTCIALVTDGQQM-QEVHID----------SNFPVIQASIVDPYVALLTQNG 616
Query: 679 SIRL--LVGDP 687
+ L LV +P
Sbjct: 617 RLLLYELVMEP 627
Score = 136 (52.9 bits), Expect = 3.3e-94, Sum P(3) = 3.3e-94
Identities = 27/75 (36%), Positives = 46/75 (61%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIIL 204
RC LVYG + IL
Sbjct: 149 RCAACLVYGKHIAIL 163
Score = 43 (20.2 bits), Expect = 1.6e-53, Sum P(3) = 1.6e-53
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
+++ E G+ +T++ +R DA+Q GE+
Sbjct: 720 TIMEQNFPVENGEATIKQSNTRKRKRLGHDAIQSSRGGEQ 759
Score = 37 (18.1 bits), Expect = 3.4e-43, Sum P(3) = 3.4e-43
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 45 ELPSKRGIGPVPNLVVTAAN 64
EL R +GPV ++ V N
Sbjct: 442 ELDRLRNVGPVKSMCVGRPN 461
>UNIPROTKB|Q9N4C2 [details] [associations]
symbol:cpsf-1 "Probable cleavage and polyadenylation
specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
[GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=NAS]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
Length = 1454
Score = 522 (188.8 bits), Expect = 3.3e-94, Sum P(3) = 3.3e-94
Identities = 176/728 (24%), Positives = 328/728 (45%)
Query: 723 RKTSTDAWLSTGVGEAIDGADGG-PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
++ DA S+ GE D D + +V +E+G L I +P V+ + +F
Sbjct: 744 KRLGHDAIQSSRGGEQSDAIDPTRTFSSISHWLIVSHENGRLSIHSLPEMEVVYQIGRFS 803
Query: 782 SGRTHIVDTYMREALKDSETEINSSSEEG---TGQGRKENIHSMKVVELAMQR----WSA 834
+ +VD + E K+ + + +++E T + + N ++ E ++
Sbjct: 804 NVPELLVDLTVEEEEKERKAKAQQAAKEASVPTDEAEQLNTEMKQLCERVLEAQIVGMGI 863
Query: 835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSR 894
+ + P L AI+ + +L Y+ + +S + P +
Sbjct: 864 NQAHPILMAIVDEQVVL-YEMF-------SSSNPIPGHLGISFRKLPHFICLRTSSHLNS 915
Query: 895 TPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCD 951
A + +G I F+ +S + G + G+ P +V+ ++ H D
Sbjct: 916 DGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPT-LLVYGAWGGMQTHQMTVD 974
Query: 952 GSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
G I AFT +N N HG +Y+T + L+I ++ Y+ +PV+K I + T H + Y
Sbjct: 975 GPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKK-IEVGRTIHHVRY 1033
Query: 1011 FAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1068
++Y ++ S+P KP N++ ++ D QE H+ D + + + YT+ + +
Sbjct: 1034 LMNSDVYAVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ- 1088
Query: 1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR 1127
D A P I + E V L + +T ETLLA+GT GE+V R
Sbjct: 1089 ---DWAAVP---NTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVR 1142
Query: 1128 GRVLL---FSTGRNADNPQN--LVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT 1182
GR++L D P + + ++ KE KG ++ L ++ G LL G K+ + ++
Sbjct: 1143 GRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFK 1202
Query: 1183 GTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDC 1242
+L GI+F D YV L+ ++ + D +S+ + ++E +++ ++D C
Sbjct: 1203 DNDLMGISFLDMH-YYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRD--DRKC 1259
Query: 1243 ----FATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKF 1298
A++ ++DG+ + ++SDE NI +F YAP+ ES G++L RA ++G ++ F
Sbjct: 1260 AQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINAF 1319
Query: 1299 LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDS 1358
+RL+ + R +F +LDGS G + PL E ++RRL LQ +
Sbjct: 1320 VRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSV 1379
Query: 1359 VPHVAGLNPRSFRQFH-SNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQ 1417
P +AGL+ + R S + +++D +++ Y L L ++ ++A + G R
Sbjct: 1380 TPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYH 1439
Query: 1418 ILSNLNDL 1425
I+ +L L
Sbjct: 1440 IIDDLMQL 1447
Score = 431 (156.8 bits), Expect = 3.3e-94, Sum P(3) = 3.3e-94
Identities = 158/551 (28%), Positives = 257/551 (46%)
Query: 169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
+L+ G + + PLV+ DP RC LVYG + IL + S
Sbjct: 128 YLRDGFINHFQPPLVRSDPSNRCAACLVYGKHIAILPFHEN-----------------SK 170
Query: 229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
RI S +VI L+ +D + ++ D +F+ GY EP ++ L+E T GR ++ T I +
Sbjct: 171 RIHS-YVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGV 229
Query: 287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY 345
S++ +Q ++W NLP D +LL +P P+GG LV G+NT+ Y +Q+ C L LN+
Sbjct: 230 SVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS- 288
Query: 346 AVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQR 400
D + P + LD + + ++++ + ++ GDL LL ++ G V+
Sbjct: 289 --CYDGFTKFPLKDLKHLKMTLDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKS 346
Query: 401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
L+ SK + + +T F+GSRLGDS L+++T LK D
Sbjct: 347 LEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTL----------LKTT-----RD 391
Query: 461 APSTKRLRRSSSD--ALQDMVNGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIG 513
+ KRL+ + D A + ++ +++ LYG A +++ E ++ F D L N+G
Sbjct: 392 C-AVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVG 450
Query: 514 PLKDFSYGLRINADASATGISKQSN--YELVELPGC--KGIWTVYHKSSRGHNADSSRMA 569
P+K G R N ++ +K+ + ++LV G G V+ +S R SS +
Sbjct: 451 PVKSMCVG-RPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLE 509
Query: 570 AYD---------DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFG 619
+ +E H YLI+S T++LE + L E+ E + FV G T+AAG L
Sbjct: 510 GAEQLWAVGRKENESHKYLIVSRVRSTLILELGEELVELEEQL--FVTGEPTVAAGELSQ 567
Query: 620 RRRVIQVFERG-ARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDG 678
+QV A + DG M Q++ V+ SI DPYV L +G
Sbjct: 568 GALAVQVTSTCIALVTDGQQM-QEVHID----------SNFPVIQASIVDPYVALLTQNG 616
Query: 679 SIRL--LVGDP 687
+ L LV +P
Sbjct: 617 RLLLYELVMEP 627
Score = 136 (52.9 bits), Expect = 3.3e-94, Sum P(3) = 3.3e-94
Identities = 27/75 (36%), Positives = 46/75 (61%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+DSI++ F+DAK+S++ ++ ++ S+H FE+ +L+ G + + PLV+ DP
Sbjct: 92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148
Query: 190 RCGGVLVYGLQMIIL 204
RC LVYG + IL
Sbjct: 149 RCAACLVYGKHIAIL 163
Score = 43 (20.2 bits), Expect = 1.6e-53, Sum P(3) = 1.6e-53
Identities = 9/40 (22%), Positives = 19/40 (47%)
Query: 444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
+++ E G+ +T++ +R DA+Q GE+
Sbjct: 720 TIMEQNFPVENGEATIKQSNTRKRKRLGHDAIQSSRGGEQ 759
Score = 37 (18.1 bits), Expect = 3.4e-43, Sum P(3) = 3.4e-43
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 45 ELPSKRGIGPVPNLVVTAAN 64
EL R +GPV ++ V N
Sbjct: 442 ELDRLRNVGPVKSMCVGRPN 461
>UNIPROTKB|K7GNU1 [details] [associations]
symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic
acid binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GeneTree:ENSGT00550000075040 EMBL:CU468594
Ensembl:ENSSSCT00000033207 Uniprot:K7GNU1
Length = 757
Score = 777 (278.6 bits), Expect = 3.9e-83, Sum P(2) = 3.9e-83
Identities = 185/524 (35%), Positives = 279/524 (53%)
Query: 913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
R F++I G+ G F+ G P W +V R LR+HP DG I +F HNVNC GF+Y
Sbjct: 244 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLY 303
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
QG L+I LP+ +YD WPV+K IPL+ T H + Y E +Y + S P +
Sbjct: 304 FNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR 360
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
+ + +++ ID + +H E + ++++ P W+ A I ++
Sbjct: 361 IPRMTGEEKEFETIDRDDRY---IHPQQ--EAFSIQLISPVS----WEAIPNARIELEEW 411
Query: 1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE 1148
E+ ++ V+L + T + +A GT +QGE+V RGR+L+ P +T+
Sbjct: 412 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 471
Query: 1149 -----VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLN 1203
+Y KE KG ++AL GHL+ A G KI L +EL G+AF D LY+ +
Sbjct: 472 NKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQ-LYIHQMI 530
Query: 1204 IVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQK 1263
VKNFIL D+ KSI L ++E+ L+L+++D L+ ++ +F++D + L +VSD +
Sbjct: 531 SVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDR 590
Query: 1264 NIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT--- 1320
N+ ++ Y P+ ES+ G +LL RA+FHVGAHV F R GA G K
Sbjct: 591 NLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPC-------RGATDGPSKKSVV 643
Query: 1321 --NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK 1378
N+ F TLDG IG + P+ E T+RRL LQ L +PH AGLNPR+FR H + +
Sbjct: 644 WENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRR 703
Query: 1379 AHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNL 1422
+ +++D ELL+ Y L E+ E+A + GTT IL +L
Sbjct: 704 VLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDL 747
Score = 94 (38.1 bits), Expect = 3.9e-83, Sum P(2) = 3.9e-83
Identities = 27/98 (27%), Positives = 47/98 (47%)
Query: 759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
E+G +EI+ +P++ VF V F G+ +VD+ + E EE T QG
Sbjct: 104 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 159
Query: 819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
+ + +V L ++ RP+L + D +L Y+A+
Sbjct: 160 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 191
>POMBASE|SPBC1709.08 [details] [associations]
symbol:cft1 "cleavage factor one Cft1 (predicted)"
species:4896 "Schizosaccharomyces pombe" [GO:0003723 "RNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
"cytosol" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA]
[GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
[GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
cleavage" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
PomBase:SPBC1709.08 GO:GO:0005829 EMBL:CU329671 GO:GO:0006378
GenomeReviews:CU329671_GR GO:GO:0003723 eggNOG:COG5161 KO:K14401
OMA:HNDRIFQ OrthoDB:EOG451HZS PIR:T39636 RefSeq:NP_595441.1
STRING:O74733 EnsemblFungi:SPBC1709.08.1 GeneID:2539694
KEGG:spo:SPBC1709.08 NextBio:20800847 GO:GO:0005847 GO:GO:0006379
Uniprot:O74733
Length = 1441
Score = 509 (184.2 bits), Expect = 2.0e-65, Sum P(3) = 2.0e-65
Identities = 155/623 (24%), Positives = 280/623 (44%)
Query: 821 SMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXX 880
S ++VEL + P LF I Y+A+L+ NT K +
Sbjct: 848 SQELVELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYS---NTDKHKNLLAFAKVPQET 904
Query: 881 XXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWCM- 937
TP DA + E + ++T + + H F++G +P +
Sbjct: 905 MTREFQANV----GTPRDAESTMEKKASSSVDHLKMTALEVVGNHSAVFVTGRKPFLILS 960
Query: 938 VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
+ P + I++ H + G+IYV ++IC+ YDN WP +K
Sbjct: 961 TLHSNAKFFPISSNIPILSVAPFHAHHAPQGYIYVDENSFIRICKFQEDFEYDNKWPYKK 1020
Query: 998 VIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH 1056
V L + I Y K +Y + +VP+ K ++ D + I + N + +
Sbjct: 1021 V-SLGKQINGIAYHPTKMVYAVGSAVPIEFKVTDE------DGNEPYAITDDN-DYLPMA 1072
Query: 1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL-FNTTTKENETLLAIG 1115
T +++ ++ P W + Q E L+V +V L + TTK + +A+G
Sbjct: 1073 NTGSLD-----LVSPLT----WTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVG 1123
Query: 1116 TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTE-----VYSKELKGAISALASLQGHLLI 1170
T+ +GED+A RG LF P T V +E+KG ++ + + G+LL
Sbjct: 1124 TSITKGEDIAVRGSTYLFEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLS 1183
Query: 1171 ASGPKIILHKWTGTE-LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQ 1229
G K+I+ + L G++F D Y +S ++N +L GD+ +++ F+ + E+ +
Sbjct: 1184 GQGQKVIVRALEDEDHLVGVSFIDLGS-YTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYR 1242
Query: 1230 LNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEF 1289
+ L +K +L+ A +FL+ G L VV+D N+++ Y P+ ES G++L++R +F
Sbjct: 1243 MTLFSKGQEALNVSAADFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDF 1302
Query: 1290 HVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQ 1349
H+G +T + +L A G D + F+ + DG + + P+ + +RRL
Sbjct: 1303 HIGNVITA---MTILPKEKKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLN 1359
Query: 1350 SLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAH 1409
+Q L + V + GLNP+S+R S P I+D L+ ++ + + + E+AH
Sbjct: 1360 IIQNYLANRVNTIGGLNPKSYRLITSPSNLTNP-TRRILDGMLIDYFTYMSVAHRHEMAH 1418
Query: 1410 QTGTTRSQILSNLNDLALGTSFL 1432
+ G S I+++L +L S++
Sbjct: 1419 KCGVPVSTIMNDLVELDEALSYM 1441
Score = 268 (99.4 bits), Expect = 2.0e-65, Sum P(3) = 2.0e-65
Identities = 117/467 (25%), Positives = 200/467 (42%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L LV ++ G + ++ L G++ D +I+ + AK+S LE+D S+H
Sbjct: 92 LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148
Query: 161 CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
+E +K + P + VDP C +L + M+ + L +E
Sbjct: 149 YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202
Query: 220 F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
S S + S V+ LD + + D F++GY EP + IL+ E T +
Sbjct: 203 IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262
Query: 277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
+ T + S +++ + +I + +LP+D Y +++P+P+GG L++G N + Y S
Sbjct: 263 RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322
Query: 336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND------VALLSTKTGDLVLL 389
+ + +N+Y +S F++EL+ A L + V L+ T +G L
Sbjct: 323 RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHT-SGQFFYL 381
Query: 390 TVVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSG 442
+ DG+ V+ L L + N L S IT G +L FLGS+ DS L++++
Sbjct: 382 DFLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS--RR 439
Query: 443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
T+ L E GD D L ++ + DM++ E +
Sbjct: 440 TTNEEVRLDE--GD---DT-----LYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLR 489
Query: 503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
+ D L NIGP+ DF+ G A + Q N+ +EL G G
Sbjct: 490 LEICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAG 531
Score = 151 (58.2 bits), Expect = 1.5e-54, Sum P(2) = 1.5e-54
Identities = 101/432 (23%), Positives = 181/432 (41%)
Query: 285 ALSISTTLKQHPL-IWSAMNLPHD--------AYKLLAVPSPIGGVLVVGANTIHYHSQS 335
A ++ TT++ P I++++++P +L+ V S G + +G N+ Y+S+
Sbjct: 280 ASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSA-GRTVGIGVNS--YYSKC 336
Query: 336 ASCALA-LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
L +++ + L+ + +P +S E L D +L
Sbjct: 337 TDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYL-----DFLLDGKSVK 391
Query: 395 GRVVQRLDLSKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFT---------CGSG 442
G +Q LDL + N L S IT G +L FLGS+ DS L++++ G
Sbjct: 392 GLSLQALDL-EINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRRTTNEEVRLDEG 450
Query: 443 TSMLSSGLKEEFGDI----EAD-APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
L E D+ E D + +KR + L+ + + L+ G ++
Sbjct: 451 DDTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRLEIC-DVLTNIGPITDFAVGK 509
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINAD-ASATGISKQSNYELV----ELPGCKGIWT 552
++S+ +D N GPL+ G AD A + +++ + L+ + GC+ +WT
Sbjct: 510 AGSYSYFPQD---NHGPLE--LVGTA-GADGAGGLVVFRRNIFPLIAGEFQFDGCEALWT 563
Query: 553 VYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
V S + N S A Y + E YL++S E + + + EV S D+ +T
Sbjct: 564 V-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAGETFDEVQHS-DFSKDSKT 621
Query: 612 IAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPXXXXXXXXXXXXTVLSVSIADPY 670
+ G+L R++Q+ R+ D + +TQ +F V+S SI DP
Sbjct: 622 LNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNFSKKQI----------VVSTSICDPC 671
Query: 671 VLLGMSDGSIRL 682
+++ G I L
Sbjct: 672 IIVVFLGGGIAL 683
Score = 38 (18.4 bits), Expect = 2.0e-65, Sum P(3) = 2.0e-65
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 527 DASATGISKQSNYELVEL 544
++ T +K+S+ ELVEL
Sbjct: 837 ESERTYFNKESSQELVEL 854
Score = 38 (18.4 bits), Expect = 1.6e-15, Sum P(3) = 1.6e-15
Identities = 7/18 (38%), Positives = 12/18 (66%)
Query: 670 YVLLGMSDGSIRLLVGDP 687
Y ++ + G++RLL DP
Sbjct: 1268 YFVVADTSGNLRLLAYDP 1285
>ASPGD|ASPL0000050546 [details] [associations]
symbol:AN1413 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
cleavage and polyadenylation specificity factor complex"
evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 GO:GO:0005634 EMBL:BN001307 GO:GO:0006397
GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AACD01000022
RefSeq:XP_659017.1 EnsemblFungi:CADANIAT00008024 GeneID:2875502
KEGG:ani:AN1413.2 HOGENOM:HOG000048586 OMA:HNDRIFQ
OrthoDB:EOG451HZS Uniprot:Q5BDG7
Length = 1339
Score = 459 (166.6 bits), Expect = 2.4e-55, Sum P(2) = 2.4e-55
Identities = 149/536 (27%), Positives = 261/536 (48%)
Query: 914 ITIFKNISGHQGFFLSGSRPCWCMVFR-ERLRVH-PQLCDGSIVAFTVLHNVNCNHGFIY 971
+ I NI+G F+ G P +FR H +L G I + + GF Y
Sbjct: 829 LRILPNIAGCSSIFMPG--PSAGFIFRASTTSPHFIRLRGGFIKGLGCFDSPD--KGFAY 884
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
+ S G L + +LP G+ W + + +P+ ++TY + + Y VL +
Sbjct: 885 LDSHG-LHLAKLPEGTQLGYPW-IMRTVPIGQQIDKLTYVSASDTY-------VLGTCQR 935
Query: 1032 V-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSE 1090
L D E+ + N +S + V + ++++ P W + P++ +E
Sbjct: 936 CEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVVSPKT----WSVIDSYPLEPAE 986
Query: 1091 NALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVT-- 1147
+ + ++ ++L + T E ++ +GT+ +GED+ +RG + +F +P+ T
Sbjct: 987 HIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFEVIEVVPDPEQPETNR 1046
Query: 1148 --EVYSKE-LKGAISALASL--QGHLLIASGPKIILH--KWTGTELNGIAFYDAPPLYVV 1200
++ KE +KGA++AL+ + QG L+ A G K ++ K G+ L +AF D +V
Sbjct: 1047 RLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLKEDGSLLP-VAFMDMQ-CFVS 1104
Query: 1201 SLNIVKN--FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 1258
+ +K + GD K ++F + E+ +++L AKD L+ A +FL DG+ L +VV
Sbjct: 1105 VIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFIVV 1164
Query: 1259 SDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSD 1318
+D N+ + Y P+ S G KLL+R++FH G + L SS+R A GSD
Sbjct: 1165 ADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTVTLLPRTLVSSER--AMSGSD 1222
Query: 1319 KTN--RFALLFGTL----DGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
K + A L L +GSIG + + E ++RRL +LQ +L +++ H GLNPR++R
Sbjct: 1223 KMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQSQLTNTLEHPCGLNPRAYRA 1282
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
S+ A R ++D LL Y + + + EIA + G T +I ++L ++ G
Sbjct: 1283 VESDASAGR----GMLDSNLLLQYLDMSKQRKAEIAGRVGATEWEIRADLEAISGG 1334
Score = 209 (78.6 bits), Expect = 2.4e-55, Sum P(2) = 2.4e-55
Identities = 117/503 (23%), Positives = 202/503 (40%)
Query: 240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
D + H F++ Y EP IL+ + T + + + +++ + +
Sbjct: 224 DPSVIHPISLAFLYEYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLL 283
Query: 300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
S LP D +K++A+P P+GG L++G+N +H + A+ +N ++ S +S
Sbjct: 284 SVTRLPSDLFKVVALPPPVGGSLLIGSNELVHIDQAGKTNAVGVNEFSRQASSFSMTDQS 343
Query: 359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLD---LSKTNPSVLTS 413
++ L+ +D LL+ TG L++ DGR V + LS + L S
Sbjct: 344 DLALRLENCVVERFSDDNGDLLLALSTGVFALVSFKLDGRSVSGISVRPLSGPSKEFLAS 403
Query: 414 DITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
++ +GN F GS DS+L+ ++ S + S + E DA L S
Sbjct: 404 TASSSAFLGNGKVFFGSESADSVLLGWSSASSATKKSFSGSTSNDESEDDAYEDD-LYSS 462
Query: 471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
+ A+ D N + SN++ +A + D L + GP++D G A +
Sbjct: 463 APAAMTD--NPQN-----QPSNSSVAAFG--DLRIHDRLSSPGPIRDIVLGRSSEASSRD 513
Query: 531 T--GI----SKQSNYELVELPGCKGIWTVYHKSSRGHN-ADS----SRMAAYDDEYHAYL 579
T G+ + Q + E + K Y +S + A+S S + +D+ Y+
Sbjct: 514 TKDGVLELVAAQGSDEGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYV 573
Query: 580 IISL-------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
I+S E+ VLE D L +T T+ G L + RVIQV R
Sbjct: 574 ILSKQEKPDKEESEVFVLE--DKLRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVR 631
Query: 633 ILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
D + D ++ ++ DPY+ + D ++ LL D S
Sbjct: 632 SYDAVWDEDD-------------SDERVAVNATLVDPYLAIIRDDSTLLLLQADDSGDLD 678
Query: 693 SVQTPAAIESSKKPVSSCTLYHD 715
V + S K +S+C Y D
Sbjct: 679 EVTLSEDVVSQKW-LSAC-FYSD 699
Score = 150 (57.9 bits), Expect = 3.8e-49, Sum P(2) = 3.8e-49
Identities = 135/596 (22%), Positives = 234/596 (39%)
Query: 44 SELPSKRGIG---PVPNLVVTAANVIEIYVVRVQXXXXXXXXXXXX-TKRRVLMDGISAA 99
+EL S G+ VP L TA N+I +Q T+ R
Sbjct: 5 TELISPTGVTHALAVPFLSATANNLIVARTSLLQIFSLRDVSLSALDTEVRPAQHRQETC 64
Query: 100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
L L Y+L G V + + D++++AF DAK+S++E+D +GL S+
Sbjct: 65 KLVLEREYQLPGTVTDICRVKI--LKTKSGGDAVLVAFRDAKLSLVEWDPERYGLSTISI 122
Query: 160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDED 218
H +E + + G ++ DP RC + +G + + I+ Q G LV D+
Sbjct: 123 HYYERDDMTRSPWASDLSTCGSILSADPGSRCA-IFQFGARSLAIIPFHQPGDDLVMDD- 180
Query: 219 TFGSGGGFSARIES---SHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
FGS + R+E SH +D + F+ ++P ++H L +
Sbjct: 181 -FGSEPDYENRVEGNSRSHEAKDKDAAEYQTPYASSFVLPLTALDPS--VIHPISLAFLY 237
Query: 273 RVSWKHHTCMISALSISTTL---KQHPLIWSAMNLPHD---AYKLLAV---PSPIGGVLV 323
+ S ++ S L ++ + ++ + L + + LL+V PS + V+
Sbjct: 238 EYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVA 297
Query: 324 ----VGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-L 378
VG + + ++ A AV ++ SSFS+ + A L+N V
Sbjct: 298 LPPPVGGSLLIGSNELVHIDQAGKTNAVGVNEFSR-QASSFSMTDQSDLALRLENCVVER 356
Query: 379 LSTKTGDLVL-LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
S GDL+L L+ V +LD ++ + ++ G S FL S S +
Sbjct: 357 FSDDNGDLLLALSTGVFALVSFKLD-GRSVSGISVRPLS--GPSKEFLASTASSSAFL-- 411
Query: 438 TCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSA-SNN 493
G+G S + G A + + K S+S D +D + E LY SA +
Sbjct: 412 --GNGKVFFGSESADSVLLGWSSASSATKKSFSGSTSNDESED--DAYEDDLYSSAPAAM 467
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTV 553
T++ Q S S+ G L+ R+++ I + E G+ +
Sbjct: 468 TDNPQNQPS---NSSVAAFGDLRIHD---RLSSPGPIRDIVLGRSSEASSRDTKDGVLEL 521
Query: 554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-VLETADLLTEVTESV-DYFV 607
+++G + + M E YL+ S+ A T L T LL + + DY +
Sbjct: 522 V--AAQGSD-EGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVI 574
Score = 49 (22.3 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 588 MVLETADL-LTEVT-ESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
MV++T L ++E T E D V G ++A G R I VFE
Sbjct: 989 MVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFE 1031
>CGD|CAL0004251 [details] [associations]
symbol:orf19.2760 species:5476 "Candida albicans" [GO:0042493
"response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
evidence=IEA] [GO:0006369 "termination of RNA polymerase II
transcription" evidence=IEA] [GO:0006379 "mRNA cleavage"
evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR004871
Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634 GO:GO:0042493
GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023 EMBL:AACQ01000025
RefSeq:XP_720278.1 RefSeq:XP_720279.1 RefSeq:XP_720280.1
RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848 GeneID:3638158
GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
Length = 1420
Score = 321 (118.1 bits), Expect = 7.8e-43, Sum P(3) = 7.8e-43
Identities = 115/526 (21%), Positives = 229/526 (43%)
Query: 906 PHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNV 963
P+G +R + F N++G F++G P + + R+ Q + ++ + +
Sbjct: 875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIF-QFSKIAAMSISAFSDS 933
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV-- 1021
+G I++ +Q +IC+LP Y+ P+ K + + + I Y + L
Sbjct: 934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPM-KHVDIGESIKSIAYHETSDTVVLSTFK 992
Query: 1022 SVPV--LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
+P L + ++ +I +++ S+ L Y E L + G +
Sbjct: 993 QIPYDCLDEEGKPIAGII-KDIKDTPAMSFKGSIKLVSPYNWTVIETIELGDNEVGMTLK 1051
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1139
+ S + L +L K+ + IG + ED+AA G ++
Sbjct: 1052 SMILDVGSESGSTLGSDPNSLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDII 1111
Query: 1140 DNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
P T E++ +E +GAI+++ L G L++ G K+I+ +AF D
Sbjct: 1112 PEPGKPETNHKFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDT 1171
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +YV N ++LGD+ K + + + + ++ +L KD + +F+I+ +
Sbjct: 1172 P-VYVSESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEI 1230
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML----ATSSDR 1310
++V+D + + Y P +S G KLL++A F + + ++ L ++ + +D
Sbjct: 1231 FVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDA 1290
Query: 1311 -TGAA-----PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1364
T A P + +N F ++ T DGS + P++E +RR+ LQ++L+D H G
Sbjct: 1291 LTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCG 1350
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
LNPR R + + I+D +L+ + L + + +A++
Sbjct: 1351 LNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANK 1396
Score = 224 (83.9 bits), Expect = 7.8e-43, Sum P(3) = 7.8e-43
Identities = 77/312 (24%), Positives = 138/312 (44%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA- 346
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 347 ---VSLDSSQELPRSSFSVELDAAHATWLQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LS----KTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ KT + + ++ + ++ F+ + G+S L+Q +S S + +
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQVRYRD-SSKTSDTKESKLN 453
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS-----FAVRDSLV 510
IE K + D D +E LY E QKT S F D L+
Sbjct: 454 KIEE-----KEDNKDDDDNDDD----DEDDLY--KEEEEEETQKTISKSHIEFLYHDELI 502
Query: 511 NIGPLKDFSYGL 522
N GP F+ G+
Sbjct: 503 NNGPSSTFTLGI 514
Score = 61 (26.5 bits), Expect = 7.8e-43, Sum P(3) = 7.8e-43
Identities = 14/63 (22%), Positives = 35/63 (55%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ ++L G + L + +N D ++++ + AK S++++D ++ + S+H
Sbjct: 57 LKLIDQFKLQGTITDLKSIRT--IENPNL-DYLMVSTKYAKFSIIKWDHHLNTIATVSLH 113
Query: 161 CFE 163
+E
Sbjct: 114 YYE 116
Score = 38 (18.4 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
N ++ILS G D+ + + I + + S L F
Sbjct: 528 NYNEVSILSNAGTDSQTKLNIITPTIQPSISSSLTF 563
Score = 38 (18.4 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
Identities = 11/44 (25%), Positives = 18/44 (40%)
Query: 6 YKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK 49
Y++ H N GF +++ P I I EL + +K
Sbjct: 788 YQLNHVDKFTENLSLGFFDPNQSTVDPFIKQIMLNELGDKFDTK 831
>UNIPROTKB|Q5AFT3 [details] [associations]
symbol:CFT1 "Protein CFT1" species:237561 "Candida albicans
SC5314" [GO:0042493 "response to drug" evidence=IMP]
InterPro:IPR004871 Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634
GO:GO:0042493 GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023
EMBL:AACQ01000025 RefSeq:XP_720278.1 RefSeq:XP_720279.1
RefSeq:XP_720280.1 RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848
GeneID:3638158 GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
Length = 1420
Score = 321 (118.1 bits), Expect = 7.8e-43, Sum P(3) = 7.8e-43
Identities = 115/526 (21%), Positives = 229/526 (43%)
Query: 906 PHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNV 963
P+G +R + F N++G F++G P + + R+ Q + ++ + +
Sbjct: 875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIF-QFSKIAAMSISAFSDS 933
Query: 964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV-- 1021
+G I++ +Q +IC+LP Y+ P+ K + + + I Y + L
Sbjct: 934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPM-KHVDIGESIKSIAYHETSDTVVLSTFK 992
Query: 1022 SVPV--LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQ 1079
+P L + ++ +I +++ S+ L Y E L + G +
Sbjct: 993 QIPYDCLDEEGKPIAGII-KDIKDTPAMSFKGSIKLVSPYNWTVIETIELGDNEVGMTLK 1051
Query: 1080 TRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNA 1139
+ S + L +L K+ + IG + ED+AA G ++
Sbjct: 1052 SMILDVGSESGSTLGSDPNSLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDII 1111
Query: 1140 DNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDA 1194
P T E++ +E +GAI+++ L G L++ G K+I+ +AF D
Sbjct: 1112 PEPGKPETNHKFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLDT 1171
Query: 1195 PPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTL 1254
P +YV N ++LGD+ K + + + + ++ +L KD + +F+I+ +
Sbjct: 1172 P-VYVSESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEI 1230
Query: 1255 SLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQML----ATSSDR 1310
++V+D + + Y P +S G KLL++A F + + ++ L ++ + +D
Sbjct: 1231 FVLVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDA 1290
Query: 1311 -TGAA-----PGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAG 1364
T A P + +N F ++ T DGS + P++E +RR+ LQ++L+D H G
Sbjct: 1291 LTNIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCG 1350
Query: 1365 LNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQ 1410
LNPR R + + I+D +L+ + L + + +A++
Sbjct: 1351 LNPRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANK 1396
Score = 224 (83.9 bits), Expect = 7.8e-43, Sum P(3) = 7.8e-43
Identities = 77/312 (24%), Positives = 138/312 (44%)
Query: 231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
+SS +I+ LD + V D F+H Y EP + +L ++ WAG + L++
Sbjct: 217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276
Query: 289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA- 346
LK ++ NLP++ +++ +PSP+ G L+VG N IH + +A+N +
Sbjct: 277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336
Query: 347 ---VSLDSSQELPRSSFSVELDAAHATWLQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLD 402
S S Q+ +S +++L+ + +D LL +TG+ + DG+ ++R+
Sbjct: 337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394
Query: 403 LS----KTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
+ KT + + ++ + ++ F+ + G+S L+Q +S S + +
Sbjct: 395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQVRYRD-SSKTSDTKESKLN 453
Query: 456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS-----FAVRDSLV 510
IE K + D D +E LY E QKT S F D L+
Sbjct: 454 KIEE-----KEDNKDDDDNDDD----DEDDLY--KEEEEEETQKTISKSHIEFLYHDELI 502
Query: 511 NIGPLKDFSYGL 522
N GP F+ G+
Sbjct: 503 NNGPSSTFTLGI 514
Score = 61 (26.5 bits), Expect = 7.8e-43, Sum P(3) = 7.8e-43
Identities = 14/63 (22%), Positives = 35/63 (55%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L+L+ ++L G + L + +N D ++++ + AK S++++D ++ + S+H
Sbjct: 57 LKLIDQFKLQGTITDLKSIRT--IENPNL-DYLMVSTKYAKFSIIKWDHHLNTIATVSLH 113
Query: 161 CFE 163
+E
Sbjct: 114 YYE 116
Score = 38 (18.4 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
Identities = 10/36 (27%), Positives = 17/36 (47%)
Query: 112 NVESLAILSQGGADNSRRRDSIILAFEDAKISVLEF 147
N ++ILS G D+ + + I + + S L F
Sbjct: 528 NYNEVSILSNAGTDSQTKLNIITPTIQPSISSSLTF 563
Score = 38 (18.4 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
Identities = 11/44 (25%), Positives = 18/44 (40%)
Query: 6 YKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSK 49
Y++ H N GF +++ P I I EL + +K
Sbjct: 788 YQLNHVDKFTENLSLGFFDPNQSTVDPFIKQIMLNELGDKFDTK 831
>SGD|S000002709 [details] [associations]
symbol:CFT1 "RNA-binding subunit of the mRNA cleavage and
polyadenylation factor" species:4932 "Saccharomyces cerevisiae"
[GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0003723 "RNA binding"
evidence=IEA;IDA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005739
"mitochondrion" evidence=IDA] [GO:0005847 "mRNA cleavage and
polyadenylation specificity factor complex" evidence=IDA;IPI]
[GO:0006369 "termination of RNA polymerase II transcription"
evidence=IMP] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
[GO:0006379 "mRNA cleavage" evidence=IDA;TAS] [GO:0005849 "mRNA
cleavage factor complex" evidence=IPI] InterPro:IPR004871
Pfam:PF03178 SGD:S000002709 GO:GO:0005739 GO:GO:0006378
EMBL:BK006938 GO:GO:0003723 EMBL:U28374 eggNOG:COG5161 KO:K14401
OMA:HNDRIFQ GO:GO:0005847 GO:GO:0006379 PIR:S61187
RefSeq:NP_010587.1 ProteinModelPortal:Q06632 DIP:DIP-2467N
IntAct:Q06632 MINT:MINT-375530 STRING:Q06632 PaxDb:Q06632
PeptideAtlas:Q06632 EnsemblFungi:YDR301W GeneID:851895
KEGG:sce:YDR301W CYGD:YDR301w GeneTree:ENSGT00550000075040
HOGENOM:HOG000246682 OrthoDB:EOG4D29XZ NextBio:969889
Genevestigator:Q06632 GermOnline:YDR301W GO:GO:0006369
Uniprot:Q06632
Length = 1357
Score = 278 (102.9 bits), Expect = 1.5e-28, Sum P(4) = 1.5e-28
Identities = 80/346 (23%), Positives = 155/346 (44%)
Query: 1078 WQT--RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFST 1135
W+ + P S N + ++ + + T ++ E ++A G A ED G ++
Sbjct: 1001 WKVIDKIDFPKNSVVNEMRSSMIQINSKTKRKREYIIA-GVANATTEDTPPTGAFHIYDV 1059
Query: 1136 GRNADNPQNLVT-----EVYSKELKGAISALASLQGHLLIASGPKIILHK-WTGTELNGI 1189
P T E++ +E+ G +S + + G +I+ K+++ + +
Sbjct: 1060 IEVVPEPGKPDTNYKLKEIFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPV 1119
Query: 1190 AFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLI 1249
AF D P ++V N +++GD + F+ + + ++ L + + EFL+
Sbjct: 1120 AFLDIP-VFVTDSKSFGNLLIIGDAMQGFQFIGFDAEPYRMISLGRSMSKFQTMSLEFLV 1178
Query: 1250 DGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
+G + +D +N+ + YAP S GQ+L+ + F + H T ML ++
Sbjct: 1179 NGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLVHCSSFTL--HSTN--SCMMLLPRNE 1234
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRS 1369
G S + F + G +DGS+ I PL E +RRL +Q++++D + GLNPR
Sbjct: 1235 EFG----SPQVPSFQNVGGQVDGSVFKIVPLSEEKYRRLYVIQQQIIDRELQLGGLNPRM 1290
Query: 1370 FR---QFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
R F+ G + RP ++D ++ + L ++ + IA + G
Sbjct: 1291 ERLANDFYQMGHSMRP----MLDFNVIRRFCGLAIDRRKSIAQKAG 1332
Score = 91 (37.1 bits), Expect = 1.5e-28, Sum P(4) = 1.5e-28
Identities = 35/157 (22%), Positives = 69/157 (43%)
Query: 244 KHVKDFIFVHGYIEPVMVILHERELTWAGR--VSWKHHTCMISALSI----STTLKQHPL 297
K++ D F+ + +P + +L++ +L WAG +S +I L+I S T +
Sbjct: 211 KNIIDIQFLKNFTKPTIALLYQPKLVWAGNTTISKLPTQYVILTLNIQPAESATKIESTT 270
Query: 298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQE 354
I LP D + ++ V + G ++VG N + + + + LN++A L ++
Sbjct: 271 IAFVKELPWDLHTIVPVSN---GAIIVGTNELAFLDNTGVLQSTVLLNSFADKELQKTKI 327
Query: 355 LPRSSFSVELDAAHAT--WLQNDVALLSTKTGDLVLL 389
+ SS + + T W+ + + D LL
Sbjct: 328 INNSSLEIMFREKNTTSIWIPSSKSKNGGSNNDETLL 364
Score = 85 (35.0 bits), Expect = 1.5e-28, Sum P(4) = 1.5e-28
Identities = 70/331 (21%), Positives = 130/331 (39%)
Query: 374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRLG 430
++ LL ++ + + +GR++ + D+ K N + + L S
Sbjct: 360 DETLLLMDLKSNIYYIQMEAEGRLLIKFDIFKLPIVNDLLKENSNPKCITRLNATNSNKN 419
Query: 431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS--TKRLRRSSSDALQDM--VNGEELSL 486
L + F G+ + + LK EA PS T L + D ++M + +E
Sbjct: 420 MDLFIGFGSGNALVLRLNNLKSTIETREAHNPSSGTNSLMDINDDDDEEMDDLYADEAPE 479
Query: 487 YGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISK--QSNYEL 541
G +N+++ +T F + SL N+GP+ + G + D G+ ++ Y L
Sbjct: 480 NGLTTNDSKGTVETVQPFDIELLSSLRNVGPITSLTVGKVSSIDDVVKGLPNPNKNEYSL 539
Query: 542 VELPGC-KGIW-TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL--- 596
V G G TV S + + + + ++ + ++ R L T D
Sbjct: 540 VATSGNGSGSHLTVIQTSVQPEIELALKFISITQIWN----LKIKGRDRYLITTDSTKSR 595
Query: 597 TEVTESVDYFVQGRTIAAGNLFGRRRV----IQVFERGARILDGSYMTQDLS-FGPXXXX 651
+++ ES + F + G L RR I +F RI+ + T L +
Sbjct: 596 SDIYESDNNF---KLHKGGRL--RRDATTVYISMFGEEKRIIQVT--TNHLYLYDTHFRR 648
Query: 652 XXXXXXXXTVLSVSIADPYVLLGMSDGSIRL 682
V+ VS+ DPY+L+ +S G I++
Sbjct: 649 LTTIKFDYEVIHVSVMDPYILVTVSRGDIKI 679
Score = 63 (27.2 bits), Expect = 1.5e-28, Sum P(4) = 1.5e-28
Identities = 20/91 (21%), Positives = 41/91 (45%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L L ++ HG + + ++ Q + S ++L AKIS+L+F+ + + S+H
Sbjct: 48 LYLTDEFKFHGLITDIGLIPQKDSPLS----CLLLCTGVAKISILKFNTLTNSIDTLSLH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC 191
+E + A+ +++DP C
Sbjct: 104 YYEGK---FKGKSLVELAKISTLRMDPGSSC 131
Score = 41 (19.5 bits), Expect = 3.0e-18, Sum P(2) = 3.0e-18
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 37 IQT-EELDSELPSK-RGIGPVPNLVVTAANVIE 67
++T + D EL S R +GP+ +L V + I+
Sbjct: 491 VETVQPFDIELLSSLRNVGPITSLTVGKVSSID 523
>TAIR|locus:2115909 [details] [associations]
symbol:DDB1A "damaged DNA binding protein 1A"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
[GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
"negative regulation of photomorphogenesis" evidence=IGI;RCA]
[GO:0045892 "negative regulation of transcription, DNA-dependent"
evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
[GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
[GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
formation" evidence=RCA] [GO:0003002 "regionalization"
evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
"protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
evidence=RCA] [GO:0008284 "positive regulation of cell
proliferation" evidence=RCA] [GO:0009630 "gravitropism"
evidence=RCA] [GO:0009639 "response to red or far red light"
evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
[GO:0033043 "regulation of organelle organization" evidence=RCA]
[GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
organ formation" evidence=RCA] [GO:0048608 "reproductive structure
development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
Uniprot:Q9M0V3
Length = 1088
Score = 222 (83.2 bits), Expect = 8.9e-23, Sum P(4) = 8.9e-23
Identities = 91/353 (25%), Positives = 157/353 (44%)
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNAD 1140
+T P+ S E ++ L + T++ +GTAYV E+ +GR+L+F D
Sbjct: 758 STYPLDSFEYGCSI----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 810
Query: 1141 NPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP 1195
L+ E KE KGA+ +L + G LL A KI L+KW GT EL +
Sbjct: 811 GRLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 867
Query: 1196 --PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGST 1253
LYV + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 868 ILALYVQTRG---DFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDD-- 922
Query: 1254 LSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSD-RTG 1312
+ L + + + + ++ +G +L E+H+G V +F ++ D G
Sbjct: 923 IYLGAENNFNLLTVKKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIG 981
Query: 1313 AAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQ 1372
P ++FGT++G IG IA L + + L+ LQ L + V GL+ +R
Sbjct: 982 QIP--------TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRS 1033
Query: 1373 FHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
F N + + +D +L+ + L + +I+ ++ + +L
Sbjct: 1034 F--NNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRVEEL 1084
Score = 91 (37.1 bits), Expect = 8.9e-23, Sum P(4) = 8.9e-23
Identities = 33/120 (27%), Positives = 55/120 (45%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F+ G +P + +L++ +H +
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAVLYQ------DNKDARH----VKT 192
Query: 286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+S LK + WS +L + A L+ VP P+ GVL++G TI Y S SA A+ +
Sbjct: 193 YEVS--LKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPI 250
Score = 74 (31.1 bits), Expect = 8.9e-23, Sum P(4) = 8.9e-23
Identities = 18/59 (30%), Positives = 31/59 (52%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
LL G + LL + ++ V L + + + S I+ + N++ F+GS GDS LV+
Sbjct: 269 LLGDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVK 327
Score = 71 (30.1 bits), Expect = 8.9e-23, Sum P(4) = 8.9e-23
Identities = 36/133 (27%), Positives = 64/133 (48%)
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
G KD S LR+ + GI++Q++ VEL G KG+W++ KSS D
Sbjct: 372 GAFKDGS--LRVVRNG--IGINEQAS---VELQGIKGMWSL--KSS------------ID 410
Query: 573 DEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+ + +L++S E R + + D L E TE + Q +T+ + +++QV
Sbjct: 411 EAFDTFLVVSFISETRILAMNLEDELEE-TEIEGFLSQVQTLFCHDAV-YNQLVQVTSNS 468
Query: 631 ARILDGSYMTQDL 643
R++ + T++L
Sbjct: 469 VRLVSST--TREL 479
Score = 45 (20.9 bits), Expect = 5.6e-13, Sum P(2) = 5.6e-13
Identities = 8/18 (44%), Positives = 11/18 (61%)
Query: 1061 VEEYEVRILEPDRAGGPW 1078
V+ YEV + + D GPW
Sbjct: 190 VKTYEVSLKDKDFVEGPW 207
Score = 39 (18.8 bits), Expect = 1.3e-14, Sum P(3) = 1.3e-14
Identities = 17/77 (22%), Positives = 33/77 (42%)
Query: 213 LVGDED-TFGSGGGFSA-RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW 270
++G+E + S F A I S +D+ + + H + ++VI HE+E
Sbjct: 231 IIGEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVT 290
Query: 271 AGRVSWKHHTCMISALS 287
++ T + S +S
Sbjct: 291 GLKIELLGETSIASTIS 307
Score = 37 (18.1 bits), Expect = 1.8e-16, Sum P(3) = 1.8e-16
Identities = 47/176 (26%), Positives = 67/176 (38%)
Query: 341 ALNNYAVS-LDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVY-DGRVV 398
AL Y VS LD + ++S +L AA W V + S +L L+T G ++
Sbjct: 526 ALLEYEVSCLDINPIGDNPNYS-QL-AAVGMWTDISVRIFSLP--ELTLITKEQLGGEII 581
Query: 399 QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD-- 456
R SVL I +L LGD L+ F + T L K G
Sbjct: 582 PR--------SVLLCAFEGIS----YLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQP 629
Query: 457 IEADAPSTKRLRR--SSSDALQDMVNGEELSLYGSASNNTESAQKTF-SFAVRDSL 509
I S+K ++SD + + + LY + + S F S A DSL
Sbjct: 630 ITLRTFSSKSATHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 685
>TAIR|locus:2127368 [details] [associations]
symbol:DDB1B "damaged DNA binding protein 1B"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
[GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
development ending in seed dormancy" evidence=IMP] [GO:0005515
"protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
[GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
specification" evidence=RCA] [GO:0010072 "primary shoot apical
meristem specification" evidence=RCA] [GO:0010100 "negative
regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
evidence=RCA] [GO:0010564 "regulation of cell cycle process"
evidence=RCA] [GO:0045595 "regulation of cell differentiation"
evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
[GO:0048608 "reproductive structure development" evidence=RCA]
[GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
GermOnline:AT4G21100 Uniprot:O49552
Length = 1088
Score = 209 (78.6 bits), Expect = 6.7e-22, Sum P(4) = 6.7e-22
Identities = 92/333 (27%), Positives = 150/333 (45%)
Query: 1105 TKENETLLAIGTAYVQGED-VAARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T + +GTAYV E+ +GR+L+F + L+TE KE KGA+ +L +
Sbjct: 777 TDDKNVYYCVGTAYVLPEENEPTKGRILVFIV---EEGRLQLITE---KETKGAVYSLNA 830
Query: 1164 LQGHLLIASGPKIILHKWT----GT-ELNGIAFYDAP--PLYVVSLNIVKNFILLGDIHK 1216
G LL + KI L+KW GT EL + LYV + +FI +GD+ K
Sbjct: 831 FNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIAVGDLMK 887
Query: 1217 SIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSE 1276
SI L +K + + A+D+ + A E L D L +D NI + +
Sbjct: 888 SISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLG---TDNCFNIFTVKKNNEGAT 944
Query: 1277 SWKGQKLLSRAEFHVGAHVTKFLR--LQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSI 1334
+ ++ E+H+G V +F L M SD G P ++FGT+ G I
Sbjct: 945 DEERARMEVVGEYHIGEFVNRFRHGSLVMKLPDSD-IGQIP--------TVIFGTVSGMI 995
Query: 1335 GCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGK-AHRPG--PDSIVDCE 1391
G IA L + + L+ LQ L + V GL+ +R F++ + A G +++
Sbjct: 996 GVIASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESF 1055
Query: 1392 L-LSHYEMLPLEEQLEIAHQTGTTRSQILSNLN 1423
L LS +M + + +++ + R + L+ L+
Sbjct: 1056 LDLSRGKMEEISKGMDVQVEELCKRVEELTRLH 1088
Score = 100 (40.3 bits), Expect = 6.7e-22, Sum P(4) = 6.7e-22
Identities = 35/117 (29%), Positives = 56/117 (47%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
F + + N+R L+ V D F++G +P + +L++ A V T +S
Sbjct: 144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQDNKD-ARHVK----TYEVSL 197
Query: 286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
+ P WS NL + A L+ VPSP+ GVL++G TI Y S +A A+ +
Sbjct: 198 KD--KNFVEGP--WSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 250
Score = 73 (30.8 bits), Expect = 6.7e-22, Sum P(4) = 6.7e-22
Identities = 17/59 (28%), Positives = 31/59 (52%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
LL G + LL + ++ V L + + + S I+ + N++ F+GS GDS L++
Sbjct: 269 LLGDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIK 327
Score = 68 (29.0 bits), Expect = 6.7e-22, Sum P(4) = 6.7e-22
Identities = 36/133 (27%), Positives = 64/133 (48%)
Query: 513 GPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
G KD S LRI + GI++Q++ VEL G KG+W++ KSS D
Sbjct: 372 GAYKDGS--LRIVRNG--IGINEQAS---VELQGIKGMWSL--KSS------------ID 410
Query: 573 DEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
+ + +L++S E R + + D L E TE + + +T+ + +++QV
Sbjct: 411 EAFDTFLVVSFISETRILAMNIEDELEE-TEIEGFLSEVQTLFCHDAV-YNQLVQVTSNS 468
Query: 631 ARILDGSYMTQDL 643
R++ + T++L
Sbjct: 469 VRLVSST--TREL 479
Score = 42 (19.8 bits), Expect = 2.9e-13, Sum P(3) = 2.9e-13
Identities = 17/77 (22%), Positives = 34/77 (44%)
Query: 213 LVGDED-TFGSGGGFSA-RIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTW 270
++G+E + S F A I S +D+ + + H + ++VI HE+E
Sbjct: 231 IIGEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVT 290
Query: 271 AGRVSWKHHTCMISALS 287
++ T + S++S
Sbjct: 291 GLKIELLGETSIASSIS 307
Score = 40 (19.1 bits), Expect = 4.4e-11, Sum P(2) = 4.4e-11
Identities = 7/18 (38%), Positives = 11/18 (61%)
Query: 1061 VEEYEVRILEPDRAGGPW 1078
V+ YEV + + + GPW
Sbjct: 190 VKTYEVSLKDKNFVEGPW 207
>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
symbol:ddb1 "damage specific DNA binding protein
1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
UniGene:Dr.77970 Uniprot:I1XUS8
Length = 1140
Score = 203 (76.5 bits), Expect = 3.8e-20, Sum P(4) = 3.8e-20
Identities = 81/293 (27%), Positives = 129/293 (44%)
Query: 1114 IGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIAS 1172
+GTA V E+ + GR+++F D V E KE+KGA+ ++ G LL +
Sbjct: 831 VGTAMVYPEEAEPKQGRIIVF---HYTDGKLQTVAE---KEVKGAVYSMVEFNGKLLASI 884
Query: 1173 GPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQG 1227
+ L++WT TE N + + LY L +FIL+GD+ +S+ L++K
Sbjct: 885 NSTVRLYEWTAEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPME 939
Query: 1228 AQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRA 1287
+A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 940 GSFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVG 996
Query: 1288 EFHVGAHVTKF----LRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F L LQ L SS T GS +LFGT++G IG + L E
Sbjct: 997 LFHLGEFVNVFSHGSLVLQNLGESSTPT---QGS-------VLFGTVNGMIGLVTSLSEG 1046
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L LQ +L + V + +R FH+ K + +D +L+ +
Sbjct: 1047 WYSLLLDLQNRLNKVIKSVGKIEHSFWRSFHTERKTEQA--TGFIDGDLIESF 1097
Score = 116 (45.9 bits), Expect = 3.8e-20, Sum P(4) = 3.8e-20
Identities = 42/164 (25%), Positives = 74/164 (45%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++ VP P GG +++G +I YH+ A+A + S + +
Sbjct: 207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAVA----PPIIKQSTIVCHN 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + + DG VV + L + + +
Sbjct: 263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLHVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
+T + N + F+GSRLGDS LV+ S G+ E F ++
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVMETFTNL 356
Score = 71 (30.1 bits), Expect = 3.8e-20, Sum P(4) = 3.8e-20
Identities = 26/93 (27%), Positives = 45/93 (48%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +SSR D+ DD L++S +T VL + E TE
Sbjct: 402 IDLPGIKGLWPLRSESSR----DT------DD----MLVLSFVGQTRVLMLSGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 448 LQGFVDNQQTFFCGNV-AHQQLIQITSVSVRLV 479
Score = 44 (20.5 bits), Expect = 3.8e-20, Sum P(4) = 3.8e-20
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 984 PSGSTYDNYWPVQ--KVIPLKATPHQITYFAEKNLYPLIV 1021
PS ST V K+ P +PH+ ++ E ++ L+V
Sbjct: 754 PSASTQALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLV 793
Score = 44 (20.5 bits), Expect = 3.8e-20, Sum P(4) = 3.8e-20
Identities = 12/50 (24%), Positives = 24/50 (48%)
Query: 792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFL 841
+ ++ S+ +S+S T G + +HS+ VV+ + H+ FL
Sbjct: 761 LSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD--QHTFEVLHAHQFL 808
Score = 43 (20.2 bits), Expect = 1.1e-10, Sum P(2) = 1.1e-10
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 40 (19.1 bits), Expect = 4.4e-19, Sum P(5) = 4.4e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ + D S T +V+ A+ ++ VSS L+
Sbjct: 737 SSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLF 771
Score = 40 (19.1 bits), Expect = 9.4e-09, Sum P(4) = 9.4e-09
Identities = 19/58 (32%), Positives = 24/58 (41%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFEGSHYLLCA-LGDGALFYFGLDIQTGVLSERKKVTLGT----QPTVLRTFRSLS 645
Score = 39 (18.8 bits), Expect = 1.2e-08, Sum P(4) = 1.2e-08
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPE 166
DS+ LA ++ +++ D+ I L I ++ +ESP+
Sbjct: 689 DSLALA-NNSTLTIGTIDE-IQKLHIRTVPLYESPK 722
Score = 37 (18.1 bits), Expect = 4.4e-19, Sum P(5) = 4.4e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ H+L VD H T+ V
Sbjct: 783 GEEVEVHSLLVVDQH-TFEV 801
>MGI|MGI:1202384 [details] [associations]
symbol:Ddb1 "damage specific DNA binding protein 1"
species:10090 "Mus musculus" [GO:0000075 "cell cycle checkpoint"
evidence=ISO] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0003684 "damaged DNA
binding" evidence=ISO] [GO:0005634 "nucleus" evidence=ISO]
[GO:0005737 "cytoplasm" evidence=ISO] [GO:0006281 "DNA repair"
evidence=IEA] [GO:0006974 "response to DNA damage stimulus"
evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
evidence=IDA] [GO:0031464 "Cul4A-RING ubiquitin ligase complex"
evidence=ISO] [GO:0031465 "Cul4B-RING ubiquitin ligase complex"
evidence=ISO] [GO:0042787 "protein ubiquitination involved in
ubiquitin-dependent protein catabolic process" evidence=ISO]
[GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
process" evidence=ISO] [GO:0080008 "Cul4-RING ubiquitin ligase
complex" evidence=ISO] InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 UniPathway:UPA00143 MGI:MGI:1202384 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
GO:GO:0003684 GO:GO:0006281 GO:GO:0042787 GO:GO:0000075
GO:GO:0031464 GO:GO:0031465 eggNOG:NOG247734 KO:K10610 OMA:CALGDGS
CTD:1642 GeneTree:ENSGT00530000063396 HOVERGEN:HBG005460
HSSP:Q16531 ChiTaRS:DDB1 EMBL:AB026432 EMBL:AF159853 EMBL:AK146522
EMBL:AK152228 EMBL:AK154303 EMBL:AK155020 EMBL:AK155920
EMBL:AK157491 EMBL:BC002210 EMBL:BC009661 IPI:IPI00316740
PIR:JC7152 RefSeq:NP_056550.1 UniGene:Mm.289915 UniGene:Mm.466856
ProteinModelPortal:Q3U1J4 SMR:Q3U1J4 IntAct:Q3U1J4 STRING:Q3U1J4
PaxDb:Q3U1J4 PRIDE:Q3U1J4 Ensembl:ENSMUST00000025649 GeneID:13194
KEGG:mmu:13194 UCSC:uc008gqm.1 InParanoid:Q3U1J4 NextBio:283320
Bgee:Q3U1J4 CleanEx:MM_DDB1 Genevestigator:Q3U1J4 Uniprot:Q3U1J4
Length = 1140
Score = 208 (78.3 bits), Expect = 2.6e-19, Sum P(5) = 2.6e-19
Identities = 78/297 (26%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + G A S T ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQN---LGEA--STPTQG-SVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 2.6e-19, Sum P(5) = 2.6e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 68 (29.0 bits), Expect = 2.6e-19, Sum P(5) = 2.6e-19
Identities = 23/93 (24%), Positives = 42/93 (45%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +S G D + L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--RSDPGRETDDT------------LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 8.6e-13, Sum P(5) = 8.6e-13
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 3.1e-11, Sum P(2) = 3.1e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 2.6e-10, Sum P(5) = 2.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 2.6e-19, Sum P(5) = 2.6e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 2.6e-19, Sum P(5) = 2.6e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDSSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|A1A4K3 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9913
"Bos taurus" [GO:0080008 "Cul4-RING ubiquitin ligase complex"
evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase complex"
evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent protein
catabolic process" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin
ligase complex" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0005634 "nucleus" evidence=ISS] [GO:0042787 "protein
ubiquitination involved in ubiquitin-dependent protein catabolic
process" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
evidence=IEA] [GO:0000075 "cell cycle checkpoint" evidence=IEA]
[GO:0006281 "DNA repair" evidence=IEA] [GO:0003677 "DNA binding"
evidence=IEA] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
UniPathway:UPA00143 GO:GO:0005634 GO:GO:0005737 GO:GO:0043161
GO:GO:0016055 Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281
GO:GO:0042787 GO:GO:0000075 GO:GO:0031464 GO:GO:0031465
eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
EMBL:BC126629 IPI:IPI00713891 RefSeq:NP_001073731.1
UniGene:Bt.62917 STRING:A1A4K3 PRIDE:A1A4K3
Ensembl:ENSBTAT00000028740 GeneID:511951 KEGG:bta:511951 CTD:1642
GeneTree:ENSGT00530000063396 HOVERGEN:HBG005460 InParanoid:A1A4K3
OrthoDB:EOG4KPT91 NextBio:20870176 Uniprot:A1A4K3
Length = 1140
Score = 210 (79.0 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 76/297 (25%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 1.1e-12, Sum P(5) = 1.1e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 1.6e-10, Sum P(5) = 1.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|E2R9E3 [details] [associations]
symbol:DDB1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0043161 "proteasomal ubiquitin-dependent
protein catabolic process" evidence=IEA] [GO:0042787 "protein
ubiquitination involved in ubiquitin-dependent protein catabolic
process" evidence=IEA] [GO:0031465 "Cul4B-RING ubiquitin ligase
complex" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin ligase
complex" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000075
"cell cycle checkpoint" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
GO:GO:0031465 KO:K10610 OMA:CALGDGS CTD:1642
GeneTree:ENSGT00530000063396 EMBL:AAEX03011677 RefSeq:XP_533275.2
Ensembl:ENSCAFT00000025824 GeneID:476067 KEGG:cfa:476067
NextBio:20851798 Uniprot:E2R9E3
Length = 1140
Score = 210 (79.0 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 76/297 (25%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 1.1e-12, Sum P(5) = 1.1e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 1.6e-10, Sum P(5) = 1.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|Q16531 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9606
"Homo sapiens" [GO:0019048 "virus-host interaction" evidence=IEA]
[GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
[GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
evidence=IDA] [GO:0000075 "cell cycle checkpoint" evidence=IMP]
[GO:0005634 "nucleus" evidence=IDA] [GO:0042787 "protein
ubiquitination involved in ubiquitin-dependent protein catabolic
process" evidence=IDA] [GO:0031464 "Cul4A-RING ubiquitin ligase
complex" evidence=IDA] [GO:0031465 "Cul4B-RING ubiquitin ligase
complex" evidence=IDA] [GO:0043161 "proteasomal ubiquitin-dependent
protein catabolic process" evidence=IMP] [GO:0080008 "Cul4-RING
ubiquitin ligase complex" evidence=IDA] [GO:0003677 "DNA binding"
evidence=TAS] [GO:0003684 "damaged DNA binding" evidence=TAS]
[GO:0000718 "nucleotide-excision repair, DNA damage removal"
evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281
"DNA repair" evidence=TAS] [GO:0006289 "nucleotide-excision repair"
evidence=TAS] Reactome:REACT_216 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 EMBL:U32986
GO:GO:0005737 GO:GO:0019048 GO:GO:0005654 GO:GO:0043161
GO:GO:0016055 Gene3D:2.130.10.10 GO:GO:0003684 EMBL:CH471076
GO:GO:0042787 GO:GO:0000075 GO:GO:0000718 EMBL:AP003108
GO:GO:0031464 PDB:2HYE PDB:4A0K PDBsum:2HYE PDBsum:4A0K PDB:4A0L
PDBsum:4A0L GO:GO:0031465 PDB:3I7P PDBsum:3I7P PDB:3I8C PDBsum:3I8C
PDB:3I89 PDBsum:3I89 PDB:3I7O PDBsum:3I7O PDB:3I8E PDBsum:3I8E
eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
CTD:1642 HOVERGEN:HBG005460 OrthoDB:EOG4KPT91 EMBL:U18299
EMBL:L40326 EMBL:AJ002955 EMBL:AK312436 EMBL:AY960579 EMBL:BC011686
EMBL:BC050530 EMBL:BC051764 IPI:IPI00293464 PIR:I38908
RefSeq:NP_001914.3 UniGene:Hs.290758 PDB:2B5L PDB:2B5M PDB:2B5N
PDB:3E0C PDB:3EI1 PDB:3EI2 PDB:3EI3 PDB:3EI4 PDB:3I7H PDB:3I7K
PDB:3I7L PDB:3I7N PDB:4A08 PDB:4A09 PDB:4A0A PDB:4A0B PDB:4A11
PDB:4E54 PDB:4E5Z PDBsum:2B5L PDBsum:2B5M PDBsum:2B5N PDBsum:3E0C
PDBsum:3EI1 PDBsum:3EI2 PDBsum:3EI3 PDBsum:3EI4 PDBsum:3I7H
PDBsum:3I7K PDBsum:3I7L PDBsum:3I7N PDBsum:4A08 PDBsum:4A09
PDBsum:4A0A PDBsum:4A0B PDBsum:4A11 PDBsum:4E54 PDBsum:4E5Z
ProteinModelPortal:Q16531 SMR:Q16531 DIP:DIP-430N IntAct:Q16531
MINT:MINT-1134697 STRING:Q16531 PhosphoSite:Q16531 PaxDb:Q16531
PRIDE:Q16531 Ensembl:ENST00000301764 GeneID:1642 KEGG:hsa:1642
UCSC:uc001nrc.4 GeneCards:GC11M061066 H-InvDB:HIX0171380
HGNC:HGNC:2717 HPA:CAB032821 MIM:600045 neXtProt:NX_Q16531
PharmGKB:PA27187 InParanoid:Q16531 ChiTaRS:DDB1
EvolutionaryTrace:Q16531 GenomeRNAi:1642 NextBio:6750
ArrayExpress:Q16531 Bgee:Q16531 CleanEx:HS_DDB1
Genevestigator:Q16531 GermOnline:ENSG00000167986 Uniprot:Q16531
Length = 1140
Score = 210 (79.0 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 76/297 (25%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 1.1e-12, Sum P(5) = 1.1e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 1.6e-10, Sum P(5) = 1.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|F1RIE2 [details] [associations]
symbol:DDB1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0043161 "proteasomal ubiquitin-dependent protein
catabolic process" evidence=IEA] [GO:0042787 "protein
ubiquitination involved in ubiquitin-dependent protein catabolic
process" evidence=IEA] [GO:0031465 "Cul4B-RING ubiquitin ligase
complex" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin ligase
complex" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000075
"cell cycle checkpoint" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
GO:GO:0031465 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
EMBL:CU462918 RefSeq:XP_003122699.1 Ensembl:ENSSSCT00000014314
GeneID:100522239 KEGG:ssc:100522239 Uniprot:F1RIE2
Length = 1140
Score = 210 (79.0 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 76/297 (25%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 1.1e-12, Sum P(5) = 1.1e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 1.6e-10, Sum P(5) = 1.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|P33194 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9534
"Chlorocebus aethiops" [GO:0005634 "nucleus" evidence=ISS]
[GO:0005737 "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING
ubiquitin ligase complex" evidence=ISS] [GO:0031465 "Cul4B-RING
ubiquitin ligase complex" evidence=ISS] [GO:0043161 "proteasomal
ubiquitin-dependent protein catabolic process" evidence=ISS]
[GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=ISS]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
UniPathway:UPA00143 GO:GO:0005634 GO:GO:0005737 GO:GO:0043161
Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281 GO:GO:0016567
GO:GO:0031464 GO:GO:0031465 HOVERGEN:HBG005460 EMBL:L20216
PIR:S38777 PRIDE:P33194 Uniprot:P33194
Length = 1140
Score = 210 (79.0 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 76/297 (25%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 1.1e-12, Sum P(5) = 1.1e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 1.9e-11, Sum P(2) = 1.9e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 1.6e-10, Sum P(5) = 1.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 3.2e-19, Sum P(5) = 3.2e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|Q6P6Z0 [details] [associations]
symbol:ddb1 "DNA damage-binding protein 1" species:8355
"Xenopus laevis" [GO:0005634 "nucleus" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
complex" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin ligase
complex" evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent
protein catabolic process" evidence=ISS] [GO:0080008 "Cul4-RING
ubiquitin ligase complex" evidence=ISS] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677
GO:GO:0006281 GO:GO:0016567 GO:GO:0031464 GO:GO:0031465 KO:K10610
CTD:1642 HOVERGEN:HBG005460 HSSP:Q16531 EMBL:BC061946
RefSeq:NP_001083624.1 UniGene:Xl.23906 PRIDE:Q6P6Z0 GeneID:399026
KEGG:xla:399026 Xenbase:XB-GENE-967911 Uniprot:Q6P6Z0
Length = 1140
Score = 208 (78.3 bits), Expect = 3.7e-19, Sum P(4) = 3.7e-19
Identities = 81/316 (25%), Positives = 139/316 (43%)
Query: 1087 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNL 1145
Q +N T+ +V+ K+ T +GTA V ++ + GR+++F N L
Sbjct: 806 QFLQNEYTLSLVSC--KLGKDPTTYFVVGTAMVYPDEAEPKQGRIVVFQY-----NDGKL 858
Query: 1146 VTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVV 1200
T V KE+KGA+ ++ G LL + + L++WT TE N + + LY
Sbjct: 859 QT-VAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRTECNH--YNNIMALY-- 913
Query: 1201 SLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSD 1260
L +FIL+GD+ +S+ L++K +A+DF A E L D + L ++
Sbjct: 914 -LKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AE 969
Query: 1261 EQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKT 1320
N+ + + + Q L FH+G V F ++ + T S T
Sbjct: 970 NAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-----SPPT 1024
Query: 1321 NRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAH 1380
++LFGT++G IG + L E + L +Q +L + V + +R FH+ K
Sbjct: 1025 QG-SVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSFHTERKTE 1083
Query: 1381 RPGPDSIVDCELLSHY 1396
P +D +L+ +
Sbjct: 1084 -PAT-GFIDGDLIESF 1097
Score = 112 (44.5 bits), Expect = 3.7e-19, Sum P(4) = 3.7e-19
Identities = 40/148 (27%), Positives = 68/148 (45%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
W N+ +A ++AVP P GG +++G +I YH+ A+A + S + +
Sbjct: 207 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIA----PPIIKQSTIVCHN 262
Query: 359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTVV----YDGRVVQR-LDLSKTNPSVLTS 413
+D + +L D+ G L +L + DG V + L + + +
Sbjct: 263 ----RVDVNGSRYLLGDME------GRLFMLLLEKEEQMDGSVTLKDLRVELLGETSIAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ T S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLTTES 340
Score = 63 (27.2 bits), Expect = 3.7e-19, Sum P(4) = 3.7e-19
Identities = 23/93 (24%), Positives = 42/93 (45%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + R+AA D + L++S +T VL E T+
Sbjct: 402 IDLPGIKGLWPL-------------RVAA-DRDTDDTLVLSFVGQTRVLTLTGEEVEETD 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 448 LAGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 52 (23.4 bits), Expect = 4.0e-13, Sum P(4) = 4.0e-13
Identities = 42/183 (22%), Positives = 68/183 (37%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + + WK
Sbjct: 155 FNIRLEELHVIDVKFLYSCQAPTICFVYQDPQGRHVKTYEVSLREKEFS---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D S L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLL 389
L+
Sbjct: 332 QLV 334
Score = 47 (21.6 bits), Expect = 1.6e-11, Sum P(4) = 1.6e-11
Identities = 19/60 (31%), Positives = 25/60 (41%)
Query: 413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S + T S +L LGD L F+ + T +LS K G P+ R RS S
Sbjct: 590 SILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 46 (21.3 bits), Expect = 1.5e-11, Sum P(2) = 1.5e-11
Identities = 8/19 (42%), Positives = 13/19 (68%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + + GPW+
Sbjct: 190 VKTYEVSLREKEFSKGPWK 208
Score = 46 (21.3 bits), Expect = 1.5e-11, Sum P(2) = 1.5e-11
Identities = 28/124 (22%), Positives = 52/124 (41%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLV-KVDPQG 189
DS+ LA ++ +++ D+ I L I ++ FESP + + + F G L +++ Q
Sbjct: 689 DSLALA-NNSTLTIGTIDE-IQKLHIRTVPLFESPRKICYQEVSQCF--GVLSSRIEVQD 744
Query: 190 RCGGV--LVYGLQMIILKASQGGSGLV-GDEDTFGSGGGFSARIESSHVINLRDLDMKHV 246
GG L L +S S L G + G + + +I+ ++ H
Sbjct: 745 ASGGSSPLRPSASTQALSSSVSCSKLFSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHT 804
Query: 247 KDFI 250
F+
Sbjct: 805 HQFL 808
Score = 41 (19.5 bits), Expect = 3.7e-19, Sum P(4) = 3.7e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
>UNIPROTKB|Q5R649 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9601
"Pongo abelii" [GO:0005634 "nucleus" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
complex" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin ligase
complex" evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent
protein catabolic process" evidence=ISS] [GO:0080008 "Cul4-RING
ubiquitin ligase complex" evidence=ISS] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677
GO:GO:0006281 GO:GO:0016567 GO:GO:0031464 GO:GO:0031465 KO:K10610
CTD:1642 HOVERGEN:HBG005460 HSSP:Q16531 EMBL:CR860647
RefSeq:NP_001126613.1 UniGene:Pab.18111 GeneID:100173610
KEGG:pon:100173610 InParanoid:Q5R649 Uniprot:Q5R649
Length = 1140
Score = 208 (78.3 bits), Expect = 5.2e-19, Sum P(5) = 5.2e-19
Identities = 76/297 (25%), Positives = 132/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ +
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYPMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 113 (44.8 bits), Expect = 5.2e-19, Sum P(5) = 5.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 5.2e-19, Sum P(5) = 5.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 1.7e-12, Sum P(5) = 1.7e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 3.1e-11, Sum P(2) = 3.1e-11
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 2.6e-10, Sum P(5) = 2.6e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 5.2e-19, Sum P(5) = 5.2e-19
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 39 (18.8 bits), Expect = 5.2e-19, Sum P(5) = 5.2e-19
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 SSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
>UNIPROTKB|F5GY55 [details] [associations]
symbol:DDB1 "Uncharacterized protein" species:9606 "Homo
sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
GO:GO:0003676 EMBL:AP003108 HGNC:HGNC:2717 ChiTaRS:DDB1
EMBL:AP003037 IPI:IPI00977083 SMR:F5GY55 Ensembl:ENST00000540166
Uniprot:F5GY55
Length = 1092
Score = 197 (74.4 bits), Expect = 6.2e-19, Sum P(3) = 6.2e-19
Identities = 97/398 (24%), Positives = 168/398 (42%)
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNL-- 1050
+ +PL +P +I Y + ++ S + V +L Q + + + L
Sbjct: 713 RTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLFS 772
Query: 1051 SSVDLHRTYTVEEYEVR-ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1109
SS H T EE EV +L D+ ++ +E AL++ L K+
Sbjct: 773 SSTAPHETSFGEEVEVHNLLIIDQH--TFEVLHAHQFLQNEYALSLVSCKL----GKDPN 826
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++ G L
Sbjct: 827 TYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEFNGKL 880
Query: 1169 LIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L + + L++WT TE N + + LY L +FIL+GD+ +S+ L++
Sbjct: 881 LASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAY 935
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
K +A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 936 KPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHL 992
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F ++ + T + P + ++LFGT++G IG + L E
Sbjct: 993 QEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTSLSES 1046
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHR 1381
+ L +Q +L + V + FH +HR
Sbjct: 1047 WYNLLLDMQNRLNKVIKSVGKIE----HSFHLEILSHR 1080
Score = 113 (44.8 bits), Expect = 6.2e-19, Sum P(3) = 6.2e-19
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 6.2e-19, Sum P(3) = 6.2e-19
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 3.4e-12, Sum P(3) = 3.4e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 5.7e-10, Sum P(3) = 5.7e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
>UNIPROTKB|J9NVR7 [details] [associations]
symbol:DDB1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AAEX03011677
Ensembl:ENSCAFT00000049486 Uniprot:J9NVR7
Length = 1084
Score = 193 (73.0 bits), Expect = 1.6e-18, Sum P(3) = 1.6e-18
Identities = 92/372 (24%), Positives = 160/372 (43%)
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNL-- 1050
+ +PL +P +I Y + ++ S + V +L Q + + + L
Sbjct: 713 RTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLFS 772
Query: 1051 SSVDLHRTYTVEEYEVR-ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1109
SS H T EE EV +L D+ ++ +E AL++ L K+
Sbjct: 773 SSTAPHETSFGEEVEVHNLLIIDQH--TFEVLHAHQFLQNEYALSLVSCKL----GKDPN 826
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++ G L
Sbjct: 827 TYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEFNGKL 880
Query: 1169 LIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L + + L++WT TE N + + LY L +FIL+GD+ +S+ L++
Sbjct: 881 LASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAY 935
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
K +A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 936 KPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHL 992
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F ++ + T + P + ++LFGT++G IG + L E
Sbjct: 993 QEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTSLSES 1046
Query: 1344 TFRRLQSLQKKL 1355
+ L +Q +L
Sbjct: 1047 WYNLLLDMQNRL 1058
Score = 113 (44.8 bits), Expect = 1.6e-18, Sum P(3) = 1.6e-18
Identities = 54/208 (25%), Positives = 94/208 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS LV+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 1.6e-18, Sum P(3) = 1.6e-18
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 47 (21.6 bits), Expect = 8.7e-12, Sum P(3) = 8.7e-12
Identities = 42/188 (22%), Positives = 69/188 (36%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI--GGVLV----VGANTIHY 331
A +++ +I H+ K LA+ PI +V V N Y
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRY 271
Query: 332 --HSQSASCALALNNYAVSLDSSQELP--RSSFSVELDAAHA-TWLQNDVALLSTKTGDL 386
+ L +D + L R E A T+L N V + ++ GD
Sbjct: 272 LLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDS 331
Query: 387 VLLTVVYD 394
L+ + D
Sbjct: 332 QLVKLNVD 339
Score = 43 (20.2 bits), Expect = 1.5e-09, Sum P(3) = 1.5e-09
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
>UNIPROTKB|F1P4I8 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9031
"Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AADN02017118
EMBL:AADN02017119 IPI:IPI00818299 Ensembl:ENSGALT00000008352
ArrayExpress:F1P4I8 Uniprot:F1P4I8
Length = 1120
Score = 201 (75.8 bits), Expect = 3.0e-18, Sum P(4) = 3.0e-18
Identities = 80/330 (24%), Positives = 144/330 (43%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F +D + E KE+KGA+ ++
Sbjct: 803 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---HYSDGKLQSLAE---KEVKGAVYSMVEF 856
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 857 NGKLLASINSTVRLYEWTAEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 911
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 912 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 968
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIA 1338
Q L H+G V F ++ + +++ GS +LFGT++G IG +
Sbjct: 969 RQHLQEVGLSHLGEFVNVFCHGSLVMQNLGEKSTPTQGS-------VLFGTVNGMIGLVT 1021
Query: 1339 PLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY-- 1396
L E + L +Q +L + V + ++R FH+ K P +D +L+ +
Sbjct: 1022 SLSESWYNLLLDMQNRLNKVIKSVGKIEHATWRSFHTERKTE-PAT-GFIDGDLIESFLD 1079
Query: 1397 ----EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+M + L+I +G R + +L
Sbjct: 1080 ISRPKMQEVVANLQIDDGSGMKREATVDDL 1109
Score = 108 (43.1 bits), Expect = 3.0e-18, Sum P(4) = 3.0e-18
Identities = 40/145 (27%), Positives = 68/145 (46%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQELP 356
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++ P
Sbjct: 187 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDP 246
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
S + D ++ LL K + + D RV L L +T+ + +T
Sbjct: 247 NGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAECLT 295
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ N + F+GSRLGDS LV+ S
Sbjct: 296 YLDNGVVFVGSRLGDSQLVKLNVDS 320
Score = 65 (27.9 bits), Expect = 3.0e-18, Sum P(4) = 3.0e-18
Identities = 24/93 (25%), Positives = 41/93 (44%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +DS R E L++S +T VL E TE
Sbjct: 382 IDLPGIKGLWPL--------RSDSHR------EMDNMLVLSFVGQTRVLMLNGEEVEETE 427
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 428 LTGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 459
Score = 43 (20.2 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 170 VKTYEVSLREKEFNKGPWK 188
Score = 43 (20.2 bits), Expect = 9.7e-10, Sum P(3) = 9.7e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 573 MTTFESSHYLLCA-LGDGALFYFGLSLETGLLSDRKKVTLGT----QPTVLRTFRSLS 625
Score = 41 (19.5 bits), Expect = 3.0e-18, Sum P(4) = 3.0e-18
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 763 GEEVEVHNLLIIDQH-TFEV 781
>UNIPROTKB|Q805F9 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9031
"Gallus gallus" [GO:0003677 "DNA binding" evidence=IEA] [GO:0016567
"protein ubiquitination" evidence=IEA] [GO:0005654 "nucleoplasm"
evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0080008
"Cul4-RING ubiquitin ligase complex" evidence=ISS] [GO:0031465
"Cul4B-RING ubiquitin ligase complex" evidence=ISS] [GO:0005634
"nucleus" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
[GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
process" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
complex" evidence=ISS] InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005737 GO:GO:0005654
GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281
GO:GO:0016567 Reactome:REACT_115612 GO:GO:0031464 GO:GO:0031465
eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 CTD:1642
HOVERGEN:HBG005460 OrthoDB:EOG4KPT91 HSSP:Q16531 EMBL:AB074298
EMBL:AJ719779 IPI:IPI00597295 RefSeq:NP_989547.1 UniGene:Gga.12977
STRING:Q805F9 PRIDE:Q805F9 GeneID:374050 KEGG:gga:374050
NextBio:20813572 Uniprot:Q805F9
Length = 1140
Score = 200 (75.5 bits), Expect = 4.1e-18, Sum P(4) = 4.1e-18
Identities = 80/329 (24%), Positives = 143/329 (43%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F +D + E KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---HYSDGKLQSLAE---KEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTAEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L H+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 989 RQHLQEVGLSHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY--- 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESFLDI 1100
Query: 1397 ---EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+M + L+I +G R + +L
Sbjct: 1101 SRPKMQEVVANLQIDDGSGMKREATVDDL 1129
Score = 108 (43.1 bits), Expect = 4.1e-18, Sum P(4) = 4.1e-18
Identities = 40/145 (27%), Positives = 68/145 (46%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQELP 356
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++ P
Sbjct: 207 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDP 266
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
S + D ++ LL K + + D RV L L +T+ + +T
Sbjct: 267 NGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAECLT 315
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ N + F+GSRLGDS LV+ S
Sbjct: 316 YLDNGVVFVGSRLGDSQLVKLNVDS 340
Score = 65 (27.9 bits), Expect = 4.1e-18, Sum P(4) = 4.1e-18
Identities = 24/93 (25%), Positives = 41/93 (44%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +DS R E L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDSHR------EMDNMLVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 448 LTGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 43 (20.2 bits), Expect = 2.2e-10, Sum P(2) = 2.2e-10
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 1.3e-09, Sum P(3) = 1.3e-09
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLSLETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 4.1e-18, Sum P(4) = 4.1e-18
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
>UNIPROTKB|F1NVV3 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9031
"Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AADN02017118
EMBL:AADN02017119 IPI:IPI00821712 Ensembl:ENSGALT00000040604
ArrayExpress:F1NVV3 Uniprot:F1NVV3
Length = 1119
Score = 194 (73.4 bits), Expect = 4.7e-18, Sum P(3) = 4.7e-18
Identities = 104/446 (23%), Positives = 184/446 (41%)
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNL-- 1050
+ +PL +P +I Y + ++ S + V +L Q + + L
Sbjct: 693 RTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFS 752
Query: 1051 SSVDLHRTYTVEEYEVR-ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1109
SS H T EE EV +L D+ ++ +E AL++ L K+
Sbjct: 753 SSTAPHETSFGEEVEVHNLLIIDQH--TFEVLHAHQFLQNEYALSLVSCKL----GKDPN 806
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR+++F +D + E KE+KGA+ ++ G L
Sbjct: 807 TYFIVGTAMVYPEEAEPKQGRIVVF---HYSDGKLQSLAE---KEVKGAVYSMVEFNGKL 860
Query: 1169 LIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L + + L++WT TE N + + LY L +FIL+GD+ +S+ L++
Sbjct: 861 LASINSTVRLYEWTAEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAY 915
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
K +A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 916 KPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHL 972
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1342
H+G V F ++ + +++ GS +LFGT++G IG + L E
Sbjct: 973 QEVGLSHLGEFVNVFCHGSLVMQNLGEKSTPTQGS-------VLFGTVNGMIGLVTSLSE 1025
Query: 1343 LTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY------ 1396
+ L +Q +L + V + S FH+ K P +D +L+ +
Sbjct: 1026 SWYNLLLDMQNRLNKVIKSVGKIE-HSLYSFHTERKTE-PAT-GFIDGDLIESFLDISRP 1082
Query: 1397 EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+M + L+I +G R + +L
Sbjct: 1083 KMQEVVANLQIDDGSGMKREATVDDL 1108
Score = 108 (43.1 bits), Expect = 4.7e-18, Sum P(3) = 4.7e-18
Identities = 40/145 (27%), Positives = 68/145 (46%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQELP 356
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++ P
Sbjct: 187 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDP 246
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
S + D ++ LL K + + D RV L L +T+ + +T
Sbjct: 247 NGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAECLT 295
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ N + F+GSRLGDS LV+ S
Sbjct: 296 YLDNGVVFVGSRLGDSQLVKLNVDS 320
Score = 65 (27.9 bits), Expect = 4.7e-18, Sum P(3) = 4.7e-18
Identities = 24/93 (25%), Positives = 41/93 (44%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +DS R E L++S +T VL E TE
Sbjct: 382 IDLPGIKGLWPL--------RSDSHR------EMDNMLVLSFVGQTRVLMLNGEEVEETE 427
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 428 LTGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 459
Score = 43 (20.2 bits), Expect = 9.0e-10, Sum P(2) = 9.0e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 573 MTTFESSHYLLCA-LGDGALFYFGLSLETGLLSDRKKVTLGT----QPTVLRTFRSLS 625
>UNIPROTKB|F1NVV2 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9031
"Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0000075 "cell cycle
checkpoint" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
[GO:0031464 "Cul4A-RING ubiquitin ligase complex" evidence=IEA]
[GO:0031465 "Cul4B-RING ubiquitin ligase complex" evidence=IEA]
[GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
protein catabolic process" evidence=IEA] [GO:0043161 "proteasomal
ubiquitin-dependent protein catabolic process" evidence=IEA]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
GO:GO:0031465 OMA:CALGDGS GeneTree:ENSGT00530000063396
IPI:IPI00597295 EMBL:AADN02017118 EMBL:AADN02017119
Ensembl:ENSGALT00000040605 ArrayExpress:F1NVV2 Uniprot:F1NVV2
Length = 1123
Score = 194 (73.4 bits), Expect = 4.7e-18, Sum P(3) = 4.7e-18
Identities = 105/449 (23%), Positives = 187/449 (41%)
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNL-- 1050
+ +PL +P +I Y + ++ S + V +L Q + + L
Sbjct: 693 RTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFS 752
Query: 1051 SSVDLHRTYTVEEYEVR-ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1109
SS H T EE EV +L D+ ++ +E AL++ L K+
Sbjct: 753 SSTAPHETSFGEEVEVHNLLIIDQH--TFEVLHAHQFLQNEYALSLVSCKL----GKDPN 806
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR+++F +D + E KE+KGA+ ++ G L
Sbjct: 807 TYFIVGTAMVYPEEAEPKQGRIVVF---HYSDGKLQSLAE---KEVKGAVYSMVEFNGKL 860
Query: 1169 LIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L + + L++WT TE N + + LY L +FIL+GD+ +S+ L++
Sbjct: 861 LASINSTVRLYEWTAEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAY 915
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
K +A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 916 KPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHL 972
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDE 1342
H+G V F ++ + +++ GS +LFGT++G IG + L E
Sbjct: 973 QEVGLSHLGEFVNVFCHGSLVMQNLGEKSTPTQGS-------VLFGTVNGMIGLVTSLSE 1025
Query: 1343 LTFRRLQSLQKKL---VDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY--- 1396
+ L +Q +L + SV + ++R FH+ K P +D +L+ +
Sbjct: 1026 SWYNLLLDMQNRLNKVIKSVGKIEHSLYATWRSFHTERKTE-PAT-GFIDGDLIESFLDI 1083
Query: 1397 ---EMLPLEEQLEIAHQTGTTRSQILSNL 1422
+M + L+I +G R + +L
Sbjct: 1084 SRPKMQEVVANLQIDDGSGMKREATVDDL 1112
Score = 108 (43.1 bits), Expect = 4.7e-18, Sum P(3) = 4.7e-18
Identities = 40/145 (27%), Positives = 68/145 (46%)
Query: 299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQELP 356
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++ P
Sbjct: 187 WKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDP 246
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDIT 416
S + D ++ LL K + + D RV L L +T+ + +T
Sbjct: 247 NGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAECLT 295
Query: 417 TIGNSLFFLGSRLGDSLLVQFTCGS 441
+ N + F+GSRLGDS LV+ S
Sbjct: 296 YLDNGVVFVGSRLGDSQLVKLNVDS 320
Score = 65 (27.9 bits), Expect = 4.7e-18, Sum P(3) = 4.7e-18
Identities = 24/93 (25%), Positives = 41/93 (44%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +DS R E L++S +T VL E TE
Sbjct: 382 IDLPGIKGLWPL--------RSDSHR------EMDNMLVLSFVGQTRVLMLNGEEVEETE 427
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ +T GN+ +++IQ+ R++
Sbjct: 428 LTGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 459
Score = 43 (20.2 bits), Expect = 9.1e-10, Sum P(2) = 9.1e-10
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 573 MTTFESSHYLLCA-LGDGALFYFGLSLETGLLSDRKKVTLGT----QPTVLRTFRSLS 625
>FB|FBgn0260962 [details] [associations]
symbol:pic "piccolo" species:7227 "Drosophila melanogaster"
[GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0006289
"nucleotide-excision repair" evidence=ISS;NAS] [GO:0005634
"nucleus" evidence=IEA] [GO:0006974 "response to DNA damage
stimulus" evidence=IMP] [GO:0035220 "wing disc development"
evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
[GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
protein catabolic process" evidence=ISS] [GO:0007307 "eggshell
chorion gene amplification" evidence=IDA] [GO:0007095 "mitotic G2
DNA damage checkpoint" evidence=IGI] InterPro:IPR004871
Pfam:PF03178 UniPathway:UPA00143 EMBL:AE014297 GO:GO:0005634
GO:GO:0005737 GO:GO:0007095 GO:GO:0043161 GO:GO:0003677
GO:GO:0006281 GO:GO:0035220 GO:GO:0042787 GO:GO:0007307
eggNOG:NOG247734 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
HSSP:Q16531 EMBL:AF132145 RefSeq:NP_650257.1 UniGene:Dm.3215
ProteinModelPortal:Q9XYZ5 SMR:Q9XYZ5 STRING:Q9XYZ5 PaxDb:Q9XYZ5
PRIDE:Q9XYZ5 EnsemblMetazoa:FBtr0082709 GeneID:41611
KEGG:dme:Dmel_CG7769 UCSC:CG7769-RA CTD:41611 FlyBase:FBgn0260962
InParanoid:Q9XYZ5 OrthoDB:EOG4S1RP0 PhylomeDB:Q9XYZ5
GenomeRNAi:41611 NextBio:824642 Bgee:Q9XYZ5 Uniprot:Q9XYZ5
Length = 1140
Score = 161 (61.7 bits), Expect = 1.7e-17, Sum P(6) = 1.7e-17
Identities = 66/289 (22%), Positives = 123/289 (42%)
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T + T+ V E+ + GR+++F +N +T+V ++ G AL G +
Sbjct: 828 TYYVVATSLVIPEEPEPKVGRIIIFHYH------ENKLTQVAETKVDGTCYALVEFNGKV 881
Query: 1169 LIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGA 1228
L G + L++WT + + + + L +FIL+GD+ +SI L K+
Sbjct: 882 LAGIGSFVRLYEWTNEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEG 941
Query: 1229 QLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAE 1288
+A+D A E L D + L S+ N+ + + + Q L A
Sbjct: 942 IFVEIARDCEPKWMRAVEILDDDTFLG---SETNGNLFVCQKDSAATTDEERQLLPELAR 998
Query: 1289 FHVGAHVTKFLRLQMLATS-SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELTFRR 1347
FH+G V F ++ + +RT G +L+GT +G+IG + + + +
Sbjct: 999 FHLGDTVNVFRHGSLVMQNVGERTTPING-------CVLYGTCNGAIGIVTQIPQDFYDF 1051
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L L+++L + V + +R F N K P + +D +L+ +
Sbjct: 1052 LHGLEERLKKIIKSVGKIEHTYYRNFQINSKVE-PS-EGFIDGDLIESF 1098
Score = 141 (54.7 bits), Expect = 1.7e-17, Sum P(6) = 1.7e-17
Identities = 59/205 (28%), Positives = 94/205 (45%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
NLR +D +V D F+HG + P ++++H+ GR H I+ L +K
Sbjct: 156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE---IN-LRDKEFMK--- 204
Query: 297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
+ W N+ +A L+ VPSPIGGV+V+G +I YH S N +AV+ P
Sbjct: 205 IAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA-------P 250
Query: 357 RSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVL 411
+ F +A N + LL G L +L + G V+ + + + +
Sbjct: 251 LT-FRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISI 309
Query: 412 TSDITTIGNSLFFLGSRLGDSLLVQ 436
IT + N ++G+R GDS LV+
Sbjct: 310 PECITYLDNGFLYIGARHGDSQLVR 334
Score = 64 (27.6 bits), Expect = 1.7e-17, Sum P(6) = 1.7e-17
Identities = 31/152 (20%), Positives = 60/152 (39%)
Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
GI Q + ++LPG KG+W++ ++ + Y L+++ T +L
Sbjct: 391 GIGIQE-HACIDLPGIKGMWSL-------------KVGVDESPYENTLVLAFVGHTRILT 436
Query: 592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXX 651
+ E TE + +T N+ ++IQV R++ + + P
Sbjct: 437 LSGEEVEETEIPGFASDLQTFLCSNV-DYDQLIQVTSDSVRLVSSATKALVAEWRPTGDR 495
Query: 652 XXXXXXXXT--VLSVSIADPYVLLGMSDGSIR 681
T +L S D + ++ + DGS+R
Sbjct: 496 TIGVVSCNTTQILVASACDIFYIV-IEDGSLR 526
Score = 56 (24.8 bits), Expect = 9.0e-08, Sum P(3) = 9.0e-08
Identities = 20/62 (32%), Positives = 31/62 (50%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
DS+ LA ++A I L D I L I ++ E P + + ++FA L ++D GR
Sbjct: 688 DSLALANKNAVI--LGTIDEIQKLHIRTVPLGEGPRRIAYQESSQTFAVSTL-RIDVHGR 744
Query: 191 CG 192
G
Sbjct: 745 GG 746
Score = 50 (22.7 bits), Expect = 1.7e-17, Sum P(6) = 1.7e-17
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 1034 SLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
S + EVG +ID HNL +D T+ V
Sbjct: 776 STAANAEVGQEIDVHNLLVID-QNTFEV 802
Score = 48 (22.0 bits), Expect = 1.3e-06, Sum P(4) = 1.3e-06
Identities = 50/184 (27%), Positives = 70/184 (38%)
Query: 301 AMNLPHDAYK-LLAVPSPIGG--VLVVGANTIHYHSQSASCALALNNYAVSLDSS-QELP 356
++ L A K L+A P G + VV NT SA C + Y V D S +E
Sbjct: 474 SVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQILVASA-CDIF---YIVIEDGSLREQS 529
Query: 357 RSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD-GRVVQRL-DLSKTNPSVLTSD 414
R + + E+ T L + K DLV + + D V+ L DL L+ +
Sbjct: 530 RRTLAYEVACLDITPLDE-----TQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGE 584
Query: 415 IT------TIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
I T + +L LGD + F T L+ K G P+T R
Sbjct: 585 IIPRSILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGT----QPTTLRTF 640
Query: 469 RSSS 472
RS S
Sbjct: 641 RSLS 644
Score = 42 (19.8 bits), Expect = 1.0e-05, Sum P(5) = 1.0e-05
Identities = 12/38 (31%), Positives = 17/38 (44%)
Query: 539 YELVELPGCKGIWT-VYHKSSRGHNADSSRMAAYDDEY 575
Y++ L GC V HK S G + S + D E+
Sbjct: 165 YDVEFLHGCLNPTVIVIHKDSDGRHVKSHEINLRDKEF 202
Score = 41 (19.5 bits), Expect = 1.7e-17, Sum P(6) = 1.7e-17
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 180 GPLVKVDPQGRCGGVLVY-GLQMII 203
G + +DP+ R G+ +Y GL II
Sbjct: 119 GVIAAIDPKARVIGMCLYQGLFTII 143
Score = 38 (18.4 bits), Expect = 1.7e-17, Sum P(6) = 1.7e-17
Identities = 7/20 (35%), Positives = 12/20 (60%)
Query: 670 YVLLGMSDGSIRLLVGDPST 689
Y+L + DGS+ + D +T
Sbjct: 600 YLLCALGDGSMYYFIMDQTT 619
>RGD|621889 [details] [associations]
symbol:Ddb1 "damage-specific DNA binding protein 1, 127kDa"
species:10116 "Rattus norvegicus" [GO:0000075 "cell cycle
checkpoint" evidence=IEA;ISO] [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IMP]
[GO:0005575 "cellular_component" evidence=ND] [GO:0005634 "nucleus"
evidence=IEA;ISO;ISS] [GO:0005737 "cytoplasm" evidence=IEA;ISO;ISS]
[GO:0006281 "DNA repair" evidence=TAS] [GO:0016055 "Wnt receptor
signaling pathway" evidence=IEA;ISO] [GO:0016567 "protein
ubiquitination" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin
ligase complex" evidence=IEA;ISO;ISS] [GO:0031465 "Cul4B-RING
ubiquitin ligase complex" evidence=IEA;ISO;ISS] [GO:0042787
"protein ubiquitination involved in ubiquitin-dependent protein
catabolic process" evidence=IEA;ISO] [GO:0043161 "proteasomal
ubiquitin-dependent protein catabolic process"
evidence=IEA;ISO;ISS] [GO:0080008 "Cul4-RING ubiquitin ligase
complex" evidence=ISO;ISS] InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 UniPathway:UPA00143 RGD:621889 GO:GO:0005634
GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
GO:GO:0003684 GO:GO:0006281 GO:GO:0042787 GO:GO:0000075
GO:GO:0031464 GO:GO:0031465 eggNOG:NOG247734 HOGENOM:HOG000007241
HOVERGEN:HBG005460 HSSP:Q16531 EMBL:AJ277077 IPI:IPI00324451
UniGene:Rn.8402 IntAct:Q9ESW0 MINT:MINT-4784948 STRING:Q9ESW0
PhosphoSite:Q9ESW0 PRIDE:Q9ESW0 UCSC:RGD:621889 InParanoid:Q9ESW0
ArrayExpress:Q9ESW0 Genevestigator:Q9ESW0 Uniprot:Q9ESW0
Length = 1140
Score = 198 (74.8 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
Identities = 75/297 (25%), Positives = 130/297 (43%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F L T V KE+KGA+ ++
Sbjct: 823 KDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSGG-----KLQT-VAEKEVKGAVYSMVEF 876
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 877 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 931
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 932 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 988
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++L GT++G IG +
Sbjct: 989 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLLGTVNGMIGLVTS 1042
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 1043 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 1097
Score = 106 (42.4 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
Identities = 53/208 (25%), Positives = 93/208 (44%)
Query: 237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-H 295
N+R L+ HV D F++G P + +++ GR H + +S K+ +
Sbjct: 156 NIR-LEELHVIDVKFLYGCQAPTICFVYQDP---QGR----H----VKTYEVSLREKEFN 203
Query: 296 PLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALA--LNNYAVSLDSSQ 353
W N+ +A ++AVP P GG +++G +I YH+ A+A + + + ++
Sbjct: 204 KGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNR 263
Query: 354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTS 413
P S + D ++ LL K + + D RV L L +T+ +
Sbjct: 264 VDPNGSRYLLGDMEGRLFM-----LLLEKEEQMDGTVTLKDLRV--EL-LGETS---IAE 312
Query: 414 DITTIGNSLFFLGSRLGDSLLVQFTCGS 441
+T + N + F+GSRLGDS V+ S
Sbjct: 313 CLTYLDNGVVFVGSRLGDSQPVKLNVDS 340
Score = 65 (27.9 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
Identities = 24/93 (25%), Positives = 43/93 (46%)
Query: 542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
++LPG KG+W + +D +R DD L++S +T VL E TE
Sbjct: 402 IDLPGIKGLWPL--------RSDPNRET--DDT----LVLSFVGQTRVLMLNGEEVEETE 447
Query: 602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
+ + +T GN+ +++IQ+ R++
Sbjct: 448 LMGFVDDQQTFFCGNV-AHQQLIQITSASVRLV 479
Score = 46 (21.3 bits), Expect = 1.8e-11, Sum P(5) = 1.8e-11
Identities = 24/101 (23%), Positives = 41/101 (40%)
Query: 226 FSARIESSHVINLRDLDMKHVKDFIFVH----G-YIEPVMVILHERELTWAGRVSWKHHT 280
F+ R+E HVI+++ L FV+ G +++ V L E+E + WK
Sbjct: 155 FNIRLEELHVIDVKFLYGCQAPTICFVYQDPQGRHVKTYEVSLREKEFN---KGPWKQEN 211
Query: 281 CMISA---LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPI 318
A +++ +I H+ K LA+ PI
Sbjct: 212 VEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPI 252
Score = 43 (20.2 bits), Expect = 3.6e-10, Sum P(2) = 3.6e-10
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 1061 VEEYEVRILEPDRAGGPWQ 1079
V+ YEV + E + GPW+
Sbjct: 190 VKTYEVSLREKEFNKGPWK 208
Score = 43 (20.2 bits), Expect = 2.5e-09, Sum P(5) = 2.5e-09
Identities = 19/58 (32%), Positives = 25/58 (43%)
Query: 415 ITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
+TT +S + L + LGD L F T +LS K G P+ R RS S
Sbjct: 593 MTTFESSHYLLCA-LGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 645
Score = 41 (19.5 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 783 GEEVEVHNLLIIDQH-TFEV 801
Score = 40 (19.1 bits), Expect = 2.3e-17, Sum P(5) = 2.3e-17
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLY 713
S R+ V D S T +++ A+ ++ VSS L+
Sbjct: 737 STRIEVQDTSGGTTALRPSASTQALSSSVSSSKLF 771
Score = 37 (18.1 bits), Expect = 2.7e-17, Sum P(4) = 2.7e-17
Identities = 11/40 (27%), Positives = 19/40 (47%)
Query: 984 PSGSTYDNYWPVQ--KVIPLKATPHQITYFAEKNLYPLIV 1021
PS ST V K+ A PH+ ++ E ++ L++
Sbjct: 754 PSASTQALSSSVSSSKLFSSSAAPHETSFGEEVEVHNLLI 793
>TAIR|locus:2100616 [details] [associations]
symbol:SAP130a "spliceosome-associated protein 130 a"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
evidence=ISM;IEA;ISS] [GO:0009507 "chloroplast" evidence=IDA]
[GO:0005829 "cytosol" evidence=RCA] [GO:0009555 "pollen
development" evidence=IMP] [GO:0009846 "pollen germination"
evidence=IMP] [GO:0048481 "ovule development" evidence=IMP]
InterPro:IPR001680 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 SMART:SM00320 GO:GO:0009506 GO:GO:0005634
GO:GO:0009507 EMBL:CP002686 Gene3D:2.130.10.10 GO:GO:0009555
GO:GO:0003676 EMBL:AL132954 GO:GO:0048481 GO:GO:0009846
eggNOG:NOG247734 KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA
IPI:IPI00517026 PIR:T47659 RefSeq:NP_567015.1 RefSeq:NP_567016.1
UniGene:At.28226 UniGene:At.72270 ProteinModelPortal:Q9LD60
SMR:Q9LD60 STRING:Q9LD60 PaxDb:Q9LD60 PRIDE:Q9LD60
EnsemblPlants:AT3G55200.1 EnsemblPlants:AT3G55220.1 GeneID:824686
GeneID:824688 KEGG:ath:AT3G55200 KEGG:ath:AT3G55220
KEGG:dosa:Os02t0137400-01 TAIR:At3g55200 TAIR:At3g55220
InParanoid:Q9LD60 PhylomeDB:Q9LD60 ProtClustDB:CLSN2689171
ArrayExpress:Q9LD60 Genevestigator:Q9LD60 Uniprot:Q9LD60
Length = 1214
Score = 176 (67.0 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 84/372 (22%), Positives = 163/372 (43%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 863 IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 913
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 914 PKKNLVAGFIHIYRFVEDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 972
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 973 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 1032
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA------HVTK 1297
A+ +D T++ +D+ N+ +SE + + ++ G V +
Sbjct: 1033 ASHH-VDFDTMA--GADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDE 1089
Query: 1298 FLRLQM--LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQ 1352
++ + + T + PG ++ +++GT+ GSIG + D++ F L+
Sbjct: 1090 IVQFHVGDVVTCLQKASMIPGGSES----IMYGTVMGSIGALHAFTSRDDVDF--FSHLE 1143
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+ P + G + ++R A+ P D ++D +L + LP++ Q +IA +
Sbjct: 1144 MHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDLQRKIADELD 1196
Query: 1413 TTRSQILSNLND 1424
T ++IL L D
Sbjct: 1197 RTPAEILKKLED 1208
Score = 73 (30.8 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 19/60 (31%), Positives = 31/60 (51%)
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DE+ AY+++S T+VL + + EV +S F+ A +L G ++QV G R
Sbjct: 466 DEFDAYIVVSFTNATLVLSIGEQVEEVNDSG--FLDTTPSLAVSLIGDDSLMQVHPNGIR 523
Score = 62 (26.9 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 34/142 (23%), Positives = 56/142 (39%)
Query: 299 WSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
WS + + A L+ VP GVLV N + Y +Q A+ + +L
Sbjct: 223 WSNP-VDNGANMLVTVPGGADGPSGVLVCAENFVIYMNQGHPDVRAV------IPRRTDL 275
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
P + + AA L+ T+ GD+ +T+ ++G V L + + + S I
Sbjct: 276 PAERGVLVVSAAVHKQKTMFFFLIQTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSI 335
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
+ F S G+ L QF
Sbjct: 336 CVLKLGFLFSASEFGNHGLYQF 357
Score = 52 (23.4 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 21/90 (23%), Positives = 37/90 (41%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
++ + + G + SLA GA ++D I++ + +I +LE++ +
Sbjct: 49 IQTIHSVEVFGAIRSLAQFRLTGA----QKDYIVVGSDSGRIVILEYNKEKNVFDKVHQE 104
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGR 190
F K G G V VDP+GR
Sbjct: 105 TFG-------KSGCRRIVPGQYVAVDPKGR 127
Score = 48 (22.0 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQTPAAIESS 703
++ +G D ++R+L DP C +SVQ+ ++ S
Sbjct: 601 FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPES 636
>TAIR|locus:2100646 [details] [associations]
symbol:SAP130b "spliceosome-associated protein 130 b"
species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
evidence=ISM;IEA;ISS] [GO:0005829 "cytosol" evidence=RCA]
[GO:0009506 "plasmodesma" evidence=IDA] [GO:0009555 "pollen
development" evidence=IMP] [GO:0009846 "pollen germination"
evidence=IMP] [GO:0048481 "ovule development" evidence=IMP]
InterPro:IPR001680 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 SMART:SM00320 GO:GO:0009506 GO:GO:0005634
GO:GO:0009507 EMBL:CP002686 Gene3D:2.130.10.10 GO:GO:0009555
GO:GO:0003676 EMBL:AL132954 GO:GO:0048481 GO:GO:0009846
eggNOG:NOG247734 KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA
IPI:IPI00517026 PIR:T47659 RefSeq:NP_567015.1 RefSeq:NP_567016.1
UniGene:At.28226 UniGene:At.72270 ProteinModelPortal:Q9LD60
SMR:Q9LD60 STRING:Q9LD60 PaxDb:Q9LD60 PRIDE:Q9LD60
EnsemblPlants:AT3G55200.1 EnsemblPlants:AT3G55220.1 GeneID:824686
GeneID:824688 KEGG:ath:AT3G55200 KEGG:ath:AT3G55220
KEGG:dosa:Os02t0137400-01 TAIR:At3g55200 TAIR:At3g55220
InParanoid:Q9LD60 PhylomeDB:Q9LD60 ProtClustDB:CLSN2689171
ArrayExpress:Q9LD60 Genevestigator:Q9LD60 Uniprot:Q9LD60
Length = 1214
Score = 176 (67.0 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 84/372 (22%), Positives = 163/372 (43%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R+L+P A T + +Q +E A +V V N KE TLLA+GT V+G
Sbjct: 863 IRVLDPKTA----TTTCLLELQDNEAAYSVCTV---NFHDKEYGTLLAVGT--VKGMQFW 913
Query: 1126 ARGRVL--LFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG 1183
+ ++ R ++ ++L ++ +++G AL QG LL GP + L+
Sbjct: 914 PKKNLVAGFIHIYRFVEDGKSLEL-LHKTQVEGVPLALCQFQGRLLAGIGPVLRLYDLGK 972
Query: 1184 TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCF 1243
L P ++S+ ++ I +GDI +S ++ ++ QL + A D
Sbjct: 973 KRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLT 1032
Query: 1244 ATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGA------HVTK 1297
A+ +D T++ +D+ N+ +SE + + ++ G V +
Sbjct: 1033 ASHH-VDFDTMA--GADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDE 1089
Query: 1298 FLRLQM--LATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQ 1352
++ + + T + PG ++ +++GT+ GSIG + D++ F L+
Sbjct: 1090 IVQFHVGDVVTCLQKASMIPGGSES----IMYGTVMGSIGALHAFTSRDDVDF--FSHLE 1143
Query: 1353 KKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTG 1412
+ P + G + ++R A+ P D ++D +L + LP++ Q +IA +
Sbjct: 1144 MHMRQEYPPLCGRDHMAYRS------AYFPVKD-VIDGDLCEQFPTLPMDLQRKIADELD 1196
Query: 1413 TTRSQILSNLND 1424
T ++IL L D
Sbjct: 1197 RTPAEILKKLED 1208
Score = 73 (30.8 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 19/60 (31%), Positives = 31/60 (51%)
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DE+ AY+++S T+VL + + EV +S F+ A +L G ++QV G R
Sbjct: 466 DEFDAYIVVSFTNATLVLSIGEQVEEVNDSG--FLDTTPSLAVSLIGDDSLMQVHPNGIR 523
Score = 62 (26.9 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 34/142 (23%), Positives = 56/142 (39%)
Query: 299 WSAMNLPHDAYKLLAVPSPI---GGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQEL 355
WS + + A L+ VP GVLV N + Y +Q A+ + +L
Sbjct: 223 WSNP-VDNGANMLVTVPGGADGPSGVLVCAENFVIYMNQGHPDVRAV------IPRRTDL 275
Query: 356 PRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDI 415
P + + AA L+ T+ GD+ +T+ ++G V L + + + S I
Sbjct: 276 PAERGVLVVSAAVHKQKTMFFFLIQTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSI 335
Query: 416 TTIGNSLFFLGSRLGDSLLVQF 437
+ F S G+ L QF
Sbjct: 336 CVLKLGFLFSASEFGNHGLYQF 357
Score = 52 (23.4 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 21/90 (23%), Positives = 37/90 (41%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
++ + + G + SLA GA ++D I++ + +I +LE++ +
Sbjct: 49 IQTIHSVEVFGAIRSLAQFRLTGA----QKDYIVVGSDSGRIVILEYNKEKNVFDKVHQE 104
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGR 190
F K G G V VDP+GR
Sbjct: 105 TFG-------KSGCRRIVPGQYVAVDPKGR 127
Score = 48 (22.0 bits), Expect = 3.1e-13, Sum P(5) = 3.1e-13
Identities = 11/36 (30%), Positives = 21/36 (58%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQTPAAIESS 703
++ +G D ++R+L DP C +SVQ+ ++ S
Sbjct: 601 FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPES 636
>WB|WBGene00010890 [details] [associations]
symbol:ddb-1 species:6239 "Caenorhabditis elegans"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0040010 "positive regulation of growth
rate" evidence=IMP] [GO:0002119 "nematode larval development"
evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0000003
"reproduction" evidence=IMP] [GO:0009792 "embryo development ending
in birth or egg hatching" evidence=IMP] [GO:0006898
"receptor-mediated endocytosis" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0030163
"protein catabolic process" evidence=IMP] [GO:0007276 "gamete
generation" evidence=IMP] [GO:0005515 "protein binding"
evidence=IPI] InterPro:IPR004871 Pfam:PF03178 UniPathway:UPA00143
GO:GO:0005634 GO:GO:0009792 GO:GO:0006898 GO:GO:0005737
GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0006281
GO:GO:0040011 GO:GO:0016567 GO:GO:0007049 GO:GO:0040035
InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163 GO:GO:0007276
eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855 PIR:T23798
RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
Length = 1134
Score = 152 (58.6 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 60/292 (20%), Positives = 125/292 (42%)
Query: 1105 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T ++ T +GT + ++ + GR+++F D ++ + V+ ++G+ A+
Sbjct: 814 TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L G L+ A I L +WT + + + + L ++ + + D+ +S+ LS+
Sbjct: 871 LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+ +AKD+ S EF+ S L +++ P + G+ +
Sbjct: 931 RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1341
L + + K + L + D +++ ++FGT G+IG I +D
Sbjct: 988 LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ + L +++K + DSV + + S+R F +A P P VD +L+
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAE-P-PSGFVDGDLV 1089
Score = 107 (42.7 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 39/134 (29%), Positives = 65/134 (48%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP IGGV+V+G+N++ Y + Y SL L ++F+ +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262
Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL 422
DA+ +L +D LL L+ +T G V+ + + + + I I N +
Sbjct: 263 DASGERFLLSDTDGRLLML----LLNVTESQSGYTVKEMRIDYLGETSIADSINYIDNGV 318
Query: 423 FFLGSRLGDSLLVQ 436
F+GSRLGDS L++
Sbjct: 319 VFVGSRLGDSQLIR 332
Score = 59 (25.8 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 37/157 (23%), Positives = 68/157 (43%)
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-----SATGISKQSNYELVELPGCK 548
TE ++S + ++ NIGP++D + + +D + TG K + ++ G
Sbjct: 335 TEPNGGSYS-VILETYSNIGPIRDM---VMVESDGQPQLVTCTGADKDGSLRVIR-NGI- 388
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE-TADLLTEVTESVDYFV 607
GI + G D Y+I+SL T VL+ T + L +V + ++
Sbjct: 389 GIDELASVDLAG--VVGIFPIRLDSNADNYVIVSLSDETHVLQITGEELEDV-KLLEINT 445
Query: 608 QGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQ 641
TI A LFG ++Q E+ R++ S +++
Sbjct: 446 DLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK 482
Score = 48 (22.0 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 14/60 (23%), Positives = 31/60 (51%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ VC ++G V ++A++ +R S+I+ E +++L + D
Sbjct: 38 RIDVQLVSPEGLKNVCEIPIYGQVLTIALVKC----KRDKRHSLIVVTEKWHMAILAYRD 93
Score = 47 (21.6 bits), Expect = 6.8e-12, Sum P(4) = 6.8e-12
Identities = 17/66 (25%), Positives = 30/66 (45%)
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
++S G FG ++ D +++ + + S YG SN TES + FA
Sbjct: 694 VISDGNSMVFGTVD-DIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAERV-FA 751
Query: 505 VRDSLV 510
+++LV
Sbjct: 752 SKNALV 757
Score = 43 (20.2 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
Identities = 19/86 (22%), Positives = 36/86 (41%)
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
V+ +G+ +C++P Y + V + H + EK + I++ K + +
Sbjct: 44 VSPEGLKNVCEIP---IYGQVLTIALVKCKRDKRHSLIVVTEK-WHMAILAYRDGKVVTR 99
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHR 1057
+ D G DN L S+ +HR
Sbjct: 100 AAGCIADP-TGRATDN--LFSLTIHR 122
Score = 41 (19.5 bits), Expect = 3.8e-05, Sum P(2) = 3.8e-05
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 11 WPTGIANCGSG-FITHSRADYVPQIPLIQTEELDSEL 46
W T ++ C SG F S YV LI +E ++++
Sbjct: 802 WETALS-CISGQFTNDSSTYYVVGTGLIYPDETETKI 837
Score = 40 (19.1 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
+ DG+ + F + ++ H + +L+I S STY
Sbjct: 695 ISDGNSMVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTY 735
>UNIPROTKB|Q21554 [details] [associations]
symbol:ddb-1 "DNA damage-binding protein 1" species:6239
"Caenorhabditis elegans" [GO:0005515 "protein binding"
evidence=IPI] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634
"nucleus" evidence=ISS] InterPro:IPR004871 Pfam:PF03178
UniPathway:UPA00143 GO:GO:0005634 GO:GO:0009792 GO:GO:0006898
GO:GO:0005737 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
GO:GO:0006281 GO:GO:0040011 GO:GO:0016567 GO:GO:0007049
GO:GO:0040035 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163
GO:GO:0007276 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
OMA:CALGDGS GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855
PIR:T23798 RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
Length = 1134
Score = 152 (58.6 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 60/292 (20%), Positives = 125/292 (42%)
Query: 1105 TKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALAS 1163
T ++ T +GT + ++ + GR+++F D ++ + V+ ++G+ A+
Sbjct: 814 TNDSSTYYVVGTGLIYPDETETKIGRIVVFEVD---DVERSKLRRVHELVVRGSPLAIRI 870
Query: 1164 LQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L G L+ A I L +WT + + + + L ++ + + D+ +S+ LS+
Sbjct: 871 LNGKLVAAINSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSY 930
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
+ +AKD+ S EF+ S L +++ P + G+ +
Sbjct: 931 RMLEGNFEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDD---GRYV 987
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFA--LLFGTLDGSIGCIAPLD 1341
L + + K + L + D +++ ++FGT G+IG I +D
Sbjct: 988 LEPTGYWYLGELPKVMTRSTLVIQPE--------DSIIQYSQPIMFGTNQGTIGMIVQID 1039
Query: 1342 ELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELL 1393
+ + L +++K + DSV + + S+R F +A P P VD +L+
Sbjct: 1040 DKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAE-P-PSGFVDGDLV 1089
Score = 107 (42.7 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 39/134 (29%), Positives = 65/134 (48%)
Query: 307 DAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVE--L 364
D+ L+ VP IGGV+V+G+N++ Y + Y SL L ++F+ +
Sbjct: 210 DSSVLIPVPHAIGGVIVLGSNSVLYKPNDNLGEVV--PYTCSL-----LENTTFTCHGIV 262
Query: 365 DAAHATWLQNDV--ALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSL 422
DA+ +L +D LL L+ +T G V+ + + + + I I N +
Sbjct: 263 DASGERFLLSDTDGRLLML----LLNVTESQSGYTVKEMRIDYLGETSIADSINYIDNGV 318
Query: 423 FFLGSRLGDSLLVQ 436
F+GSRLGDS L++
Sbjct: 319 VFVGSRLGDSQLIR 332
Score = 59 (25.8 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 37/157 (23%), Positives = 68/157 (43%)
Query: 494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA-----SATGISKQSNYELVELPGCK 548
TE ++S + ++ NIGP++D + + +D + TG K + ++ G
Sbjct: 335 TEPNGGSYS-VILETYSNIGPIRDM---VMVESDGQPQLVTCTGADKDGSLRVIR-NGI- 388
Query: 549 GIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE-TADLLTEVTESVDYFV 607
GI + G D Y+I+SL T VL+ T + L +V + ++
Sbjct: 389 GIDELASVDLAG--VVGIFPIRLDSNADNYVIVSLSDETHVLQITGEELEDV-KLLEINT 445
Query: 608 QGRTIAAGNLFGRRR---VIQVFERGARILDGSYMTQ 641
TI A LFG ++Q E+ R++ S +++
Sbjct: 446 DLPTIFASTLFGPNDSGIILQATEKQIRLMSSSGLSK 482
Score = 48 (22.0 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 14/60 (23%), Positives = 31/60 (51%)
Query: 90 RVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD 149
R+ + +S L+ VC ++G V ++A++ +R S+I+ E +++L + D
Sbjct: 38 RIDVQLVSPEGLKNVCEIPIYGQVLTIALVKC----KRDKRHSLIVVTEKWHMAILAYRD 93
Score = 47 (21.6 bits), Expect = 6.8e-12, Sum P(4) = 6.8e-12
Identities = 17/66 (25%), Positives = 30/66 (45%)
Query: 445 MLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFA 504
++S G FG ++ D +++ + + S YG SN TES + FA
Sbjct: 694 VISDGNSMVFGTVD-DIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAERV-FA 751
Query: 505 VRDSLV 510
+++LV
Sbjct: 752 SKNALV 757
Score = 43 (20.2 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
Identities = 19/86 (22%), Positives = 36/86 (41%)
Query: 972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
V+ +G+ +C++P Y + V + H + EK + I++ K + +
Sbjct: 44 VSPEGLKNVCEIP---IYGQVLTIALVKCKRDKRHSLIVVTEK-WHMAILAYRDGKVVTR 99
Query: 1032 VLSLLIDQEVGHQIDNHNLSSVDLHR 1057
+ D G DN L S+ +HR
Sbjct: 100 AAGCIADP-TGRATDN--LFSLTIHR 122
Score = 41 (19.5 bits), Expect = 3.8e-05, Sum P(2) = 3.8e-05
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 11 WPTGIANCGSG-FITHSRADYVPQIPLIQTEELDSEL 46
W T ++ C SG F S YV LI +E ++++
Sbjct: 802 WETALS-CISGQFTNDSSTYYVVGTGLIYPDETETKI 837
Score = 40 (19.1 bits), Expect = 8.5e-13, Sum P(5) = 8.5e-13
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 949 LCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY 989
+ DG+ + F + ++ H + +L+I S STY
Sbjct: 695 ISDGNSMVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTY 735
>UNIPROTKB|B4DG00 [details] [associations]
symbol:DDB1 "cDNA FLJ52436, highly similar to DNA
damage-binding protein 1" species:9606 "Homo sapiens" [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:AP003108
UniGene:Hs.290758 HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037
EMBL:AK294341 IPI:IPI00909177 SMR:B4DG00 STRING:B4DG00
Ensembl:ENST00000450997 UCSC:uc010rle.1 HOGENOM:HOG000069916
HOVERGEN:HBG102355 Uniprot:B4DG00
Length = 451
Score = 210 (79.0 bits), Expect = 1.3e-12, Sum P(2) = 1.3e-12
Identities = 76/297 (25%), Positives = 133/297 (44%)
Query: 1106 KENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASL 1164
K+ T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++
Sbjct: 134 KDPNTYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEF 187
Query: 1165 QGHLLIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
G LL + + L++WT TE N + + LY L +FIL+GD+ +S+
Sbjct: 188 NGKLLASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVL 242
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWK 1279
L++K +A+DF A E L D + L ++ N+ + + +
Sbjct: 243 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEE 299
Query: 1280 GQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAP 1339
Q L FH+G V F ++ + T + P + ++LFGT++G IG +
Sbjct: 300 RQHLQEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTS 353
Query: 1340 LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
L E + L +Q +L + V + +R FH+ K P +D +L+ +
Sbjct: 354 LSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTE-PAT-GFIDGDLIESF 408
Score = 41 (19.5 bits), Expect = 1.3e-12, Sum P(2) = 1.3e-12
Identities = 8/20 (40%), Positives = 13/20 (65%)
Query: 1042 GHQIDNHNLSSVDLHRTYTV 1061
G +++ HNL +D H T+ V
Sbjct: 94 GEEVEVHNLLIIDQH-TFEV 112
>UNIPROTKB|F1M680 [details] [associations]
symbol:Ddb1 "DNA damage-binding protein 1" species:10116
"Rattus norvegicus" [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 RGD:621889
GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 IPI:IPI00950036
Ensembl:ENSRNOT00000063867 ArrayExpress:F1M680 Uniprot:F1M680
Length = 600
Score = 209 (78.6 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
Identities = 100/414 (24%), Positives = 177/414 (42%)
Query: 997 KVIPLKATPHQITYFAEKNLYPLIVS-VPVLKPLNQVLSL---LIDQEVGHQIDNHNL-- 1050
+ +PL +P +I Y + ++ S + V +L Q + + + L
Sbjct: 172 RTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKLFS 231
Query: 1051 SSVDLHRTYTVEEYEVR-ILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENE 1109
SS H T EE EV +L D+ ++ +E AL++ L K+
Sbjct: 232 SSTAPHETSFGEEVEVHNLLIIDQH--TFEVLHAHQFLQNEYALSLVSCKL----GKDPN 285
Query: 1110 TLLAIGTAYVQGEDVAAR-GRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHL 1168
T +GTA V E+ + GR+++F + +D V E KE+KGA+ ++ G L
Sbjct: 286 TYFIVGTAMVYPEEAEPKQGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEFNGKL 339
Query: 1169 LIASGPKIILHKWTG-----TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSW 1223
L + + L++WT TE N + + LY L +FIL+GD+ +S+ L++
Sbjct: 340 LASINSTVRLYEWTTEKELRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAY 394
Query: 1224 KEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKL 1283
K +A+DF A E L D + L ++ N+ + + + Q L
Sbjct: 395 KPMEGNFEEIARDFNPNWMSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHL 451
Query: 1284 LSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDEL 1343
FH+G V F ++ + T + P + ++LFGT++G IG + L E
Sbjct: 452 QEVGLFHLGEFVNVFCHGSLVMQNLGET-STP-----TQGSVLFGTVNGMIGLVTSLSES 505
Query: 1344 TFRRLQSLQKKLVDSVPHVAGLNPR-SFRQFHSNGKAHRPGPDSIVDCELLSHY 1396
+ L +Q +L + + L ++R FH+ K P +D +L+ +
Sbjct: 506 WYNLLLDMQNRLNKVIKSLCSLTHLFTWRSFHTERKTE-PAT-GFIDGDLIESF 557
Score = 38 (18.4 bits), Expect = 1.2e-11, Sum P(2) = 1.2e-11
Identities = 17/52 (32%), Positives = 20/52 (38%)
Query: 421 SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS 472
S +L LGD L F T +LS K G P+ R RS S
Sbjct: 57 SSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGT----QPTVLRTFRSLS 104
>DICTYBASE|DDB_G0286013 [details] [associations]
symbol:repE "UV-damaged DNA binding protein1"
species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
evidence=IEA;ISS;IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0006974 "response to DNA damage stimulus" evidence=IEA;IEP]
[GO:0006289 "nucleotide-excision repair" evidence=ISS] [GO:0003684
"damaged DNA binding" evidence=ISS] [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] [GO:0003677 "DNA binding"
evidence=IEA] [GO:0016567 "protein ubiquitination" evidence=IEA]
InterPro:IPR017986 InterPro:IPR004871 Pfam:PF03178
UniPathway:UPA00143 dictyBase:DDB_G0286013 GO:GO:0005634
GO:GO:0005737 GenomeReviews:CM000153_GR SUPFAM:SSF50978
GO:GO:0003684 GO:GO:0016567 EMBL:AAFI02000085 GO:GO:0006289
eggNOG:NOG247734 KO:K10610 OMA:CALGDGS EMBL:U50042 PIR:S71092
RefSeq:XP_637896.2 STRING:B0M0P5 EnsemblProtists:DDB0191144
GeneID:8625406 KEGG:ddi:DDB_G0286013 ProtClustDB:CLSZ2430134
Uniprot:B0M0P5
Length = 1181
Score = 135 (52.6 bits), Expect = 4.7e-11, Sum P(5) = 4.7e-11
Identities = 79/339 (23%), Positives = 140/339 (41%)
Query: 1110 TLLAIGTAYVQGEDVAARGRVLLFSTGRNA--------DNPQN---------LVTEVYSK 1152
T LA+GT+ + + GRVLLFS ++ DN N +T +
Sbjct: 855 TYLAVGTSI--NTPIKSSGRVLLFSLSSSSSSNDKDSLDNNNNNNNNSGANGKLTLLEEI 912
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-----N 1207
+ + ++ L S G L+ A ++ ++T ++ + ++ I+K +
Sbjct: 913 KFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGH 972
Query: 1208 FILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQI 1267
FIL+GD+ KS+ L + G+ L +A++ + + + D + E N I
Sbjct: 973 FILVGDMMKSMSLLVEQSDGS-LEQIARNPQPIWIRSVAMIND----DYFIGAEASNNFI 1027
Query: 1268 FYYAPKMSESWKGQKLL-SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
S + ++LL S +H+G + +R L P SD+ +L
Sbjct: 1028 VVKKNNDSTNELERELLDSVGHYHIGESINS-MRHGSLVR-------LPDSDQPIIPTIL 1079
Query: 1327 FGTLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDS 1386
+ +++GSIG +A + E F LQK L V V G + ++R F SN H +
Sbjct: 1080 YASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAF-SNDH-HTIDSKN 1137
Query: 1387 IVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+D +L+ + L E QL+ G T + L
Sbjct: 1138 FIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESL 1176
Score = 86 (35.3 bits), Expect = 4.7e-11, Sum P(5) = 4.7e-11
Identities = 37/111 (33%), Positives = 53/111 (47%)
Query: 234 HVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH-HTCMISALSISTTL 292
+V N+R L+ V D F++G P + +L + KH T IS S T L
Sbjct: 192 NVNNVR-LEELQVLDMTFLYGCKVPTIAVLFK------DTKDEKHISTYEIS--SKDTEL 242
Query: 293 KQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALN 343
P WS N+ Y L VP P+GGVLVV N I Y + + ++A++
Sbjct: 243 VVGP--WSQSNV--GVYSSLLVPVPLGGVLVVADNGITYLNGKVTRSVAVS 289
Score = 59 (25.8 bits), Expect = 4.7e-11, Sum P(5) = 4.7e-11
Identities = 13/53 (24%), Positives = 28/53 (52%)
Query: 384 GDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
G L +L +++ + V L + + S I+ + + + ++GS GDS L++
Sbjct: 313 GRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLDSGVVYIGSSSGDSQLIR 365
Score = 57 (25.1 bits), Expect = 4.7e-11, Sum P(5) = 4.7e-11
Identities = 26/113 (23%), Positives = 47/113 (41%)
Query: 532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSR-------MAAYDDEYHAYLIISLE 584
GI++Q++ +EL G KGI+ + + ++ +N +++ D YLI S
Sbjct: 426 GIAEQAS---IELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNGITDSKDRYLITSFI 482
Query: 585 ARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS 637
T VL E TE T+ G + +IQ+ ++D +
Sbjct: 483 ECTKVLSFQGEEIEETEFEGLESNCSTLYCGTIDKLNLLIQITNVSINLIDSN 535
Score = 53 (23.7 bits), Expect = 4.7e-11, Sum P(5) = 4.7e-11
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 493 NTESAQKTFSFAVR-DSLVNIGPLKDF 518
NTE Q T S+ ++ NIGP+ DF
Sbjct: 367 NTEKDQTTDSYVTYLEAFTNIGPVVDF 393
Score = 44 (20.5 bits), Expect = 9.2e-10, Sum P(5) = 9.2e-10
Identities = 11/44 (25%), Positives = 21/44 (47%)
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY 605
N + R + +++ Y +I+++ +L A L E E V Y
Sbjct: 774 NEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSY 817
>FB|FBgn0035162 [details] [associations]
symbol:CG13900 species:7227 "Drosophila melanogaster"
[GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0030532 "small
nuclear ribonucleoprotein complex" evidence=ISS] [GO:0000398 "mRNA
splicing, via spliceosome" evidence=IC;ISS] [GO:0005686 "U2 snRNP"
evidence=ISS;IDA] [GO:0007052 "mitotic spindle organization"
evidence=IMP] [GO:0071011 "precatalytic spliceosome" evidence=IDA]
[GO:0071013 "catalytic step 2 spliceosome" evidence=IDA]
[GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0007052 GO:GO:0022008
Gene3D:2.130.10.10 GO:GO:0003676 GO:GO:0071011 GO:GO:0000398
GO:GO:0071013 GO:GO:0005686 eggNOG:NOG247734 EMBL:BT021338
ProteinModelPortal:Q5BI86 SMR:Q5BI86 STRING:Q5BI86 PaxDb:Q5BI86
PRIDE:Q5BI86 FlyBase:FBgn0035162 InParanoid:Q5BI86
OrthoDB:EOG4B5MM0 ArrayExpress:Q5BI86 Bgee:Q5BI86 Uniprot:Q5BI86
Length = 1227
Score = 125 (49.1 bits), Expect = 1.6e-09, Sum P(5) = 1.6e-09
Identities = 73/365 (20%), Positives = 153/365 (41%)
Query: 1079 QTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAY-VQ-GEDVAARGRVLLFSTG 1136
QT ++P+ +E +++ ++ + + LA+G A +Q ++ G + ++
Sbjct: 884 QTMFSVPLTQNEAIMSMAMLKF--SIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKID 941
Query: 1137 RNADNPQNLV-TEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAP 1195
+ + + T++ E+ GA L QG LL G + ++ + ++
Sbjct: 942 PTCSSLEFMHRTDI--DEIPGA---LCGFQGRLLAGCGRMLRIYDFGKKKMLRKCENKHI 996
Query: 1196 PLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLS 1255
P +V++ + + + + D+ +S++F+ ++ QL + A D AT L+D T++
Sbjct: 997 PYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATT-LLDYDTIA 1055
Query: 1256 LVVSDEQKNIQIFYYA--PKMSESWKGQK------LLSRAEFHVGAHVTKFLRLQMLATS 1307
+ +IQ ++ + E G K LLS A ++ F + + S
Sbjct: 1056 IADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGAS-QKSENICSF-HVGEIIMS 1113
Query: 1308 SDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPLDELT-FRRLQSLQKKLVDSVPHVAGLN 1366
+ PG + AL++ TL G++G P + Q L+ + + P + G +
Sbjct: 1114 LQKATLIPGGSE----ALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCGRD 1169
Query: 1367 PRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLA 1426
S+R + K +++D +L Y + +Q IA T +QI L D+
Sbjct: 1170 HLSYRSSYYPVK-------NVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDIR 1222
Query: 1427 LGTSF 1431
+F
Sbjct: 1223 TRYAF 1227
Score = 82 (33.9 bits), Expect = 1.6e-09, Sum P(5) = 1.6e-09
Identities = 21/61 (34%), Positives = 33/61 (54%)
Query: 572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGA 631
DDE+ AY+I+S T+VL + + EVT+S + T+ L G ++QV+ G
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIGETVEEVTDS-GFLGTTPTLCCAAL-GDDALVQVYPDGI 524
Query: 632 R 632
R
Sbjct: 525 R 525
Score = 62 (26.9 bits), Expect = 1.6e-09, Sum P(5) = 1.6e-09
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTL 712
++ +G++D ++R+L DP+ C TP ++++ P S L
Sbjct: 604 FLAVGLADNTVRILSLDPNNCL----TPCSMQALPSPAESLCL 642
Score = 56 (24.8 bits), Expect = 1.6e-09, Sum P(5) = 1.6e-09
Identities = 15/59 (25%), Positives = 26/59 (44%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
LL T+ GD+ +T+ D VV + L + + + + F+ S G+ L Q
Sbjct: 302 LLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQ 360
Score = 51 (23.0 bits), Expect = 1.6e-09, Sum P(5) = 1.6e-09
Identities = 15/61 (24%), Positives = 26/61 (42%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I++ + +I +LE++ S + L F K G G +DP+G
Sbjct: 75 KDYIVVGSDSGRIVILEYNPSKNALEKVHQETFG-------KSGCRRIVPGQYFAIDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 47 (21.6 bits), Expect = 4.0e-08, Sum P(5) = 4.0e-08
Identities = 39/163 (23%), Positives = 68/163 (41%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
Y+ +G+S+G + V DP + ++ + S +PV + +G E L +S
Sbjct: 673 YLNIGLSNGVLLRTVLDPVSGDLADTRTRYLGS--RPVKLFRIKM-QGSEAVLAMSSR-T 728
Query: 730 WLS---------TGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK 779
WLS T + E ++ A G +Q +V + L I + VF
Sbjct: 729 WLSYYHQNRFHLTPLSYETLEYASGFSSEQCS-EGIVAISTNTLRILALEKLGAVFNQVA 787
Query: 780 F---VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENI 819
F + RT ++ L +ET+ N+ +E+ T RKE +
Sbjct: 788 FPLQYTPRTFVIHPDTGRMLI-AETDHNAYTED-TKSARKEQM 828
>DICTYBASE|DDB_G0282569 [details] [associations]
symbol:sf3b3 "splicing factor 3B subunit 3"
species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0030532 "small nuclear ribonucleoprotein complex" evidence=ISS]
[GO:0008380 "RNA splicing" evidence=IEA;ISS] [GO:0006461 "protein
complex assembly" evidence=ISS] [GO:0005681 "spliceosomal complex"
evidence=IEA;ISS] [GO:0006397 "mRNA processing" evidence=IEA]
InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 dictyBase:DDB_G0282569 GO:GO:0006461 GO:GO:0008380
Gene3D:2.130.10.10 SUPFAM:SSF50978 EMBL:AAFI02000047
GenomeReviews:CM000152_GR GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
GO:GO:0030532 eggNOG:NOG247734 KO:K12830 OMA:FDTIPVA
RefSeq:XP_640132.1 STRING:Q54SA7 EnsemblProtists:DDB0233171
GeneID:8623669 KEGG:ddi:DDB_G0282569 ProtClustDB:CLSZ2729005
Uniprot:Q54SA7
Length = 1256
Score = 151 (58.2 bits), Expect = 5.4e-09, Sum P(3) = 5.4e-09
Identities = 95/463 (20%), Positives = 199/463 (42%)
Query: 996 QKVIPLKATP-----H-QITYF----AEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQI 1045
Q+ I L ATP H Q +Y E N + + + ++ L L +E+ ++
Sbjct: 814 QETIKLNATPKRFIIHPQTSYIIILETETNYNTDNIDIDKINEQSEKLLLEKQKELQQEM 873
Query: 1046 DNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATI--PM--QSSENALTVRVVTLF 1101
D + D + +E ++ ++ +P G W++ I P+ +S E+ + F
Sbjct: 874 D---IDDDDQNNNNEIEPFK-KLFKPKAGKGKWKSYIKIMDPITHESLESLMLEDGEAGF 929
Query: 1102 NTTT----KENETLLAIG--TAYVQGEDVAARGRVLLFSTGRNADNPQNLVTEVYSKELK 1155
+ T + E L +G T V + L+ R D + L +Y E++
Sbjct: 930 SVCTCSFGESGEIFLVVGCVTDMVLNPKSHKSAHLNLY---RFIDGGKKLEL-LYKTEVE 985
Query: 1156 GAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIH 1215
+ A+A QG L+ G I ++ +L P +V+++ + + +++GDI
Sbjct: 986 EPVYAMAQFQGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDRLVVGDIQ 1045
Query: 1216 KSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIF----YYA 1271
+SI+F+ +K L + A D + ++D T++ +D+ NI + +
Sbjct: 1046 ESIHFIKYKRSENMLYVFADDLAPR-WMTSSVMLDYDTVA--GADKFGNIFVLRLPLLIS 1102
Query: 1272 PKMSESWKGQKLLSRAEFHVGA-----HVTKFLRLQMLATSSDRTGAAPGSDKTNRFALL 1326
++ E G KL + GA H+ F + T+ ++T G + +L
Sbjct: 1103 DEVEEDPTGTKLKFESGTLNGAPHKLDHIANFF-VGDTVTTLNKTSLVVGGPEV----IL 1157
Query: 1327 FGTLDGSIGCIAPL---DELTFRRLQSLQKKL-VDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
+ T+ G+IG + P +++ F +L+ + D +P + G + ++R ++ K
Sbjct: 1158 YTTISGAIGALIPFTSREDVDF--FSTLEMNMRSDCLP-LCGRDHLAYRSYYFPVK---- 1210
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+I+D +L + L ++QL I+ + + S+++ L ++
Sbjct: 1211 ---NIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250
Score = 89 (36.4 bits), Expect = 5.4e-09, Sum P(3) = 5.4e-09
Identities = 46/184 (25%), Positives = 71/184 (38%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL +T+ Y G V ++++ + VL + +T + N F S GD L F
Sbjct: 309 LVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDHTLYFF 368
Query: 438 TCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
G G + D + T R S ++++ N E S S S
Sbjct: 369 K-SIGDEE-EEGQAKRLEDKDGHLWFTPR--NSCGTKMEELKNLEPTSHLSSLS------ 418
Query: 498 QKTFSFAVRDSLVNIGPLKDFSYGLRINADASAT--GISKQSNYELVELPGC-KGIWTVY 554
F V D + P G +N+ G+S + LPG GIWTV
Sbjct: 419 -PIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSV-TTITTANLPGVPSGIWTVP 476
Query: 555 HKSS 558
+S
Sbjct: 477 KSTS 480
Score = 71 (30.1 bits), Expect = 3.4e-07, Sum P(3) = 3.4e-07
Identities = 33/113 (29%), Positives = 44/113 (38%)
Query: 521 GLRINADASATGISKQSNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYL 579
GL + G+S + LPG GIWTV S NA D+ Y+
Sbjct: 443 GLNSSLKVLRHGLSV-TTITTANLPGVPSGIWTV--PKSTSPNAI--------DQTDKYI 491
Query: 580 IISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
++S T VL D + E ES ++ T G +IQVF G R
Sbjct: 492 VVSFVGTTSVLSVGDTIQENHESG--ILETTTTLLVKSMGDDAIIQVFPTGFR 542
Score = 41 (19.5 bits), Expect = 5.4e-09, Sum P(3) = 5.4e-09
Identities = 15/64 (23%), Positives = 29/64 (45%)
Query: 127 SRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVD 186
S +D II+ + ++ +LE++ + + +H E + G G + VD
Sbjct: 71 SGTKDYIIVGSDSGRVVILEYNSQKN--QFDKIH----QETFG-RSGCRRIVPGQYLAVD 123
Query: 187 PQGR 190
P+GR
Sbjct: 124 PKGR 127
Score = 41 (19.5 bits), Expect = 0.00032, Sum P(3) = 0.00032
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 406 TNPSVLTSDITTIGNSLF-FLGSRLGDSLLVQFTCGSGTSMLS 447
T+ S +S +T+ G SLF F+G + G ++ + T S T LS
Sbjct: 686 TSTSSASSSVTS-GGSLFLFVGLKNG--VVKRATLDSVTGELS 725
>UNIPROTKB|E9PT66 [details] [associations]
symbol:Sf3b3 "Protein Sf3b3" species:10116 "Rattus
norvegicus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
RGD:1311636 GO:GO:0003676 GO:GO:0071013
GeneTree:ENSGT00530000063396 GO:GO:0005689 IPI:IPI00958853
Ensembl:ENSRNOT00000023854 ArrayExpress:E9PT66 Uniprot:E9PT66
Length = 920
Score = 125 (49.1 bits), Expect = 3.8e-08, Sum P(4) = 3.8e-08
Identities = 105/550 (19%), Positives = 211/550 (38%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 402 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 460
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
S L+I L G+ ++ Q PL+ TP + E N LI+ +
Sbjct: 461 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN--NLIIIETDHNAYTE 512
Query: 1032 VLSLLIDQEVGHQI------DNHNLSSVDLHRTYTVEEYEVRILEPDRAG-GPWQT--RA 1082
Q++ ++ D L++ ++ + E I +AG G W + R
Sbjct: 513 ATKAQRKQQMAEEMVEAPGEDERELAA-EMAAAFLNENLPESIFGAPKAGNGQWASVIRV 571
Query: 1083 TIPMQSS----------ENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
P+Q + E A +V V F+ T ++ L+ + + A G V
Sbjct: 572 MNPIQGNTLDLVQLEQNEAAFSVAVCR-FSNTGEDWYVLVGVAKDLILSPRSVAGGFVYT 630
Query: 1133 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1192
+ N + + L + ++ +A+A QG +LI G + ++ +L
Sbjct: 631 YKLVNNGEKLEFL----HKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCEN 686
Query: 1193 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
Y+ + + + +++ D+ +S ++ +K QL + A D T L+D
Sbjct: 687 KHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPR-WVTTASLLDYD 745
Query: 1253 TLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG---AHVTKFLRLQML 1304
T++ +D+ NI + P ++ E G K L R + A V +
Sbjct: 746 TVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGET 803
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1361
S +T PG ++ L++ TL G IG + P ++ F Q ++ L P
Sbjct: 804 VLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF--FQHVEMHLRSEHPP 857
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+ G + SFR ++ K +++D +L + + +Q ++ + T ++
Sbjct: 858 LCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKK 910
Query: 1422 LNDLALGTSF 1431
L D+ +F
Sbjct: 911 LEDIRTRYAF 920
Score = 84 (34.6 bits), Expect = 3.8e-08, Sum P(4) = 3.8e-08
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 148 SEMAVSELPGNPNAVWTV-----RRH---------IEDEFDAYIIVSFVNATLVLSIGET 193
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 194 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 228
Score = 54 (24.1 bits), Expect = 3.8e-08, Sum P(4) = 3.8e-08
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 307 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 342
Score = 50 (22.7 bits), Expect = 3.8e-08, Sum P(4) = 3.8e-08
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 5 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 64
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 65 AHLGDDDEEPEFSSAMPLEEGD 86
Score = 41 (19.5 bits), Expect = 7.0e-07, Sum P(4) = 7.0e-07
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 362 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 418
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 419 SSR-SWLS 425
>POMBASE|SPAPJ698.03c [details] [associations]
symbol:prp12 "U2 snRNP-associated protein Sap130
(predicted)" species:4896 "Schizosaccharomyces pombe" [GO:0000245
"spliceosomal complex assembly" evidence=ISS] [GO:0005681
"spliceosomal complex" evidence=IEA] [GO:0005686 "U2 snRNP"
evidence=ISS] [GO:0030620 "U2 snRNA binding" evidence=ISS]
[GO:0045292 "mRNA cis splicing, via spliceosome" evidence=ISS]
InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 PomBase:SPAPJ698.03c EMBL:CU329670
GenomeReviews:CU329670_GR Gene3D:2.130.10.10 SUPFAM:SSF50978
GO:GO:0005681 GO:GO:0007049 GO:GO:0000245 GO:GO:0005686
GO:GO:0045292 eggNOG:NOG247734 GO:GO:0030620 KO:K12830
HOGENOM:HOG000216677 OMA:FDTIPVA OrthoDB:EOG4FR40R EMBL:AB034966
RefSeq:NP_594414.1 IntAct:Q9UTT2 STRING:Q9UTT2
EnsemblFungi:SPAPJ698.03c.1 GeneID:2543278 KEGG:spo:SPAPJ698.03c
NextBio:20804299 Uniprot:Q9UTT2
Length = 1206
Score = 117 (46.2 bits), Expect = 4.9e-08, Sum P(5) = 4.9e-08
Identities = 61/282 (21%), Positives = 120/282 (42%)
Query: 1153 ELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLG 1212
E+ G AL QG +L G + ++ ++ A PL++ + + + I++
Sbjct: 934 EIDGIPMALTPFQGRMLAGVGRFLRIYDLGNKKMLRKGELSAVPLFITHITVQASRIVVA 993
Query: 1213 DIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT-EFLIDGSTLSLVVSDEQKNIQIFY-- 1269
D S+ F+ +K + L A D ++ + T L+D TL+ D+ NI +
Sbjct: 994 DSQYSVRFVVYKPEDNHLLTFADD--TIHRWTTTNVLVDYDTLA--GGDKFGNIWLLRCP 1049
Query: 1270 -YAPKMSESWKGQ-KLLSRAEF-HVGAHVTKFLR---LQMLATSSDRTGAAPGSDKTNRF 1323
+ K+++ + KL+ F + H + + TS + G+ R
Sbjct: 1050 EHVSKLADEENSESKLIHEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGA----RE 1105
Query: 1324 ALLFGTLDGSIGCIAP-LDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRP 1382
LL+ L G++G P +++ R Q L+ L P +AG + ++R +++ K
Sbjct: 1106 VLLWTGLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKC--- 1162
Query: 1383 GPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLND 1424
++D +L Y LP Q IA++ T +++ + D
Sbjct: 1163 ----VIDGDLCEMYYSLPHPVQEMIANELDRTIAEVSKKIED 1200
Score = 77 (32.2 bits), Expect = 4.9e-08, Sum P(5) = 4.9e-08
Identities = 18/70 (25%), Positives = 37/70 (52%)
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
D Y +Y+I+S T+VL + + E+++S + T+ A + GR ++Q+ +G R
Sbjct: 493 DVYDSYIILSFTNGTLVLSIGETVEEISDS-GFLSSVSTLNARQM-GRDSLVQIHPKGIR 550
Query: 633 ILDGSYMTQD 642
+ + T +
Sbjct: 551 YIRANKQTSE 560
Score = 76 (31.8 bits), Expect = 4.9e-08, Sum P(5) = 4.9e-08
Identities = 45/155 (29%), Positives = 67/155 (43%)
Query: 299 WSAMNLPHDAYKLLAVPS----PIGGVLVVGANTIHY-HSQSASCAL------ALNNYAV 347
WS + + ++Y L+ VP P G LV+ I Y H Q A + A + A+
Sbjct: 230 WSKV-VDRNSYMLIPVPGGNDGP-SGTLVISNGWISYRHLQKAFHQIPILRRQAASANAI 287
Query: 348 SLDSSQELPRSSFSVEL--DAAHATWLQNDVALLSTKTGDLVLLTVVYDGR--VVQ-RLD 402
S +Q S+ L A + LL T GDL+ LT+ +DG+ VV+ RL
Sbjct: 288 STPWNQVNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLK 347
Query: 403 LSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
T P + +I G F+ + G+ L QF
Sbjct: 348 YFDTVPLAVQLNILKTG--FLFVATEFGNHQLYQF 380
Score = 46 (21.3 bits), Expect = 4.9e-08, Sum P(5) = 4.9e-08
Identities = 14/64 (21%), Positives = 33/64 (51%)
Query: 87 TKRRVLMDGISAASLELVC--HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISV 144
T+ R+L+ + A + C + G + ++A L G +RD +++ + +I++
Sbjct: 40 TESRLLIYKVDATDGRMNCILNQNCFGIIRNVAPLRLTGF----KRDYLVVTSDSGRITI 95
Query: 145 LEFD 148
LE++
Sbjct: 96 LEYN 99
Score = 44 (20.5 bits), Expect = 4.9e-08, Sum P(5) = 4.9e-08
Identities = 20/81 (24%), Positives = 38/81 (46%)
Query: 1036 LIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTV 1095
L+ E+ ++ L+ +T T + L P + G R+ + ++A TV
Sbjct: 588 LVYFEMSDDVEGGQLNEYQERKTLTANVTSLA-LGPVQEGS---RRSNFMCLACDDA-TV 642
Query: 1096 RVVTLFNTTTKENETLLAIGT 1116
RV++L TT EN ++ A+ +
Sbjct: 643 RVLSLDLYTTLENLSVQALSS 663
Score = 43 (20.2 bits), Expect = 8.2e-05, Sum P(5) = 8.2e-05
Identities = 6/21 (28%), Positives = 15/21 (71%)
Query: 666 IADPYVLLGMSDGSIRLLVGD 686
+ D Y++L ++G++ L +G+
Sbjct: 494 VYDSYIILSFTNGTLVLSIGE 514
Score = 37 (18.1 bits), Expect = 2.3e-07, Sum P(5) = 2.3e-07
Identities = 7/21 (33%), Positives = 12/21 (57%)
Query: 1096 RVVTLFNTTTKENETLLAIGT 1116
R V ++ T K T+LA+ +
Sbjct: 715 RAVKIYPITMKNQNTVLAVSS 735
>UNIPROTKB|A0JN52 [details] [associations]
symbol:SF3B3 "Splicing factor 3B subunit 3" species:9913
"Bos taurus" [GO:0071013 "catalytic step 2 spliceosome"
evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0006397
"mRNA processing" evidence=IEA] [GO:0003676 "nucleic acid binding"
evidence=IEA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0008380
GO:GO:0006397 GO:GO:0003676 GO:GO:0071013 eggNOG:NOG247734
GeneTree:ENSGT00530000063396 GO:GO:0005689 KO:K12830
HOGENOM:HOG000216677 OMA:FDTIPVA EMBL:BC126518 IPI:IPI00690059
RefSeq:NP_001071319.1 UniGene:Bt.7895 ProteinModelPortal:A0JN52
STRING:A0JN52 PRIDE:A0JN52 Ensembl:ENSBTAT00000014050 GeneID:504962
KEGG:bta:504962 CTD:23450 HOVERGEN:HBG093942 InParanoid:A0JN52
OrthoDB:EOG4RV2QJ BioCyc:CATTLE:504962-MONOMER BindingDB:A0JN52
NextBio:20866909 ArrayExpress:A0JN52 Uniprot:A0JN52
Length = 1217
Score = 125 (49.1 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 105/550 (19%), Positives = 211/550 (38%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 757
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
S L+I L G+ ++ Q PL+ TP + E N LI+ +
Sbjct: 758 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN--NLIIIETDHNAYTE 809
Query: 1032 VLSLLIDQEVGHQI------DNHNLSSVDLHRTYTVEEYEVRILEPDRAG-GPWQT--RA 1082
Q++ ++ D L++ ++ + E I +AG G W + R
Sbjct: 810 ATKAQRKQQMAEEMVEAAGEDERELAA-EMAAAFLNENLPESIFGAPKAGNGQWASVIRV 868
Query: 1083 TIPMQSS----------ENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
P+Q + E A +V V F+ T ++ L+ + + A G V
Sbjct: 869 MNPIQGNTLDLVQLEQNEAAFSVAVCR-FSNTGEDWYVLVGVAKDLILNPRSVAGGFVYT 927
Query: 1133 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1192
+ N + + L + ++ +A+A QG +LI G + ++ +L
Sbjct: 928 YKLVNNGEKLEFL----HKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCEN 983
Query: 1193 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
Y+ + + + +++ D+ +S ++ +K QL + A D T L+D
Sbjct: 984 KHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPR-WVTTASLLDYD 1042
Query: 1253 TLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG---AHVTKFLRLQML 1304
T++ +D+ NI + P ++ E G K L R + A V +
Sbjct: 1043 TVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGET 1100
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1361
S +T PG ++ L++ TL G IG + P ++ F Q ++ L P
Sbjct: 1101 VLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF--FQHVEMHLRSEHPP 1154
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+ G + SFR ++ K +++D +L + + +Q ++ + T ++
Sbjct: 1155 LCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKK 1207
Query: 1422 LNDLALGTSF 1431
L D+ +F
Sbjct: 1208 LEDIRTRYAF 1217
Score = 84 (34.6 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 445 SEMAVSELPGNPNAVWTV-----RRH---------IEDEFDAYIIVSFVNATLVLSIGET 490
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 491 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 525
Score = 54 (24.1 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 604 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 639
Score = 50 (22.7 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 362 AHLGDDDEEPEFSSAMPLEEGD 383
Score = 46 (21.3 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 15/61 (24%), Positives = 25/61 (40%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I++ + +I +LE+ S + F K G G + VDP+G
Sbjct: 75 KDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFG-------KSGCRRIVPGQFLAVDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 41 (19.5 bits), Expect = 1.0e-06, Sum P(5) = 1.0e-06
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 715
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 716 SSR-SWLS 722
Score = 39 (18.8 bits), Expect = 2.8e-07, Sum P(5) = 2.8e-07
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 245 PSGVLICSENYITYKNFGDQPDI 267
>UNIPROTKB|Q15393 [details] [associations]
symbol:SF3B3 "Splicing factor 3B subunit 3" species:9606
"Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000375 "RNA splicing, via transesterification reactions"
evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
evidence=IC;TAS] [GO:0071013 "catalytic step 2 spliceosome"
evidence=IDA] [GO:0005689 "U12-type spliceosomal complex"
evidence=IDA] [GO:0030532 "small nuclear ribonucleoprotein complex"
evidence=TAS] [GO:0005681 "spliceosomal complex" evidence=TAS]
[GO:0006397 "mRNA processing" evidence=TAS] [GO:0006461 "protein
complex assembly" evidence=TAS] [GO:0005654 "nucleoplasm"
evidence=TAS] [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467
"gene expression" evidence=TAS] Reactome:REACT_71
InterPro:IPR004871 Pfam:PF03178 GO:GO:0005654 GO:GO:0006461
Reactome:REACT_1675 GO:GO:0003676 GO:GO:0000398 GO:GO:0071013
GO:GO:0030532 eggNOG:NOG247734 GO:GO:0005689 KO:K12830
HOGENOM:HOG000216677 OMA:FDTIPVA CTD:23450 HOVERGEN:HBG093942
OrthoDB:EOG4RV2QJ EMBL:AJ001443 EMBL:D87686 EMBL:D13642
EMBL:BC000463 EMBL:BC003146 EMBL:BC009780 EMBL:BC068974
EMBL:AL110251 IPI:IPI00179138 IPI:IPI00300371 IPI:IPI00828110
PIR:T14779 RefSeq:NP_036558.3 UniGene:Hs.514435
ProteinModelPortal:Q15393 DIP:DIP-28152N IntAct:Q15393
MINT:MINT-1402891 STRING:Q15393 PhosphoSite:Q15393 DMDM:116242787
PaxDb:Q15393 PeptideAtlas:Q15393 PRIDE:Q15393
Ensembl:ENST00000302516 GeneID:23450 KEGG:hsa:23450 UCSC:uc002ezf.3
GeneCards:GC16P070557 HGNC:HGNC:10770 HPA:HPA042986 MIM:605592
neXtProt:NX_Q15393 PharmGKB:PA35688 InParanoid:Q15393
PhylomeDB:Q15393 BindingDB:Q15393 ChEMBL:CHEMBL1250378
GenomeRNAi:23450 NextBio:45731 ArrayExpress:Q15393 Bgee:Q15393
CleanEx:HS_SAP130 CleanEx:HS_SF3B3 Genevestigator:Q15393
GermOnline:ENSG00000189091 Uniprot:Q15393
Length = 1217
Score = 125 (49.1 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 105/550 (19%), Positives = 211/550 (38%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 757
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
S L+I L G+ ++ Q PL+ TP + E N LI+ +
Sbjct: 758 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN--NLIIIETDHNAYTE 809
Query: 1032 VLSLLIDQEVGHQI------DNHNLSSVDLHRTYTVEEYEVRILEPDRAG-GPWQT--RA 1082
Q++ ++ D L++ ++ + E I +AG G W + R
Sbjct: 810 ATKAQRKQQMAEEMVEAAGEDERELAA-EMAAAFLNENLPESIFGAPKAGNGQWASVIRV 868
Query: 1083 TIPMQSS----------ENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
P+Q + E A +V V F+ T ++ L+ + + A G V
Sbjct: 869 MNPIQGNTLDLVQLEQNEAAFSVAVCR-FSNTGEDWYVLVGVAKDLILNPRSVAGGFVYT 927
Query: 1133 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1192
+ N + + L + ++ +A+A QG +LI G + ++ +L
Sbjct: 928 YKLVNNGEKLEFL----HKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCEN 983
Query: 1193 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
Y+ + + + +++ D+ +S ++ +K QL + A D T L+D
Sbjct: 984 KHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPR-WVTTASLLDYD 1042
Query: 1253 TLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG---AHVTKFLRLQML 1304
T++ +D+ NI + P ++ E G K L R + A V +
Sbjct: 1043 TVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGET 1100
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1361
S +T PG ++ L++ TL G IG + P ++ F Q ++ L P
Sbjct: 1101 VLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF--FQHVEMHLRSEHPP 1154
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+ G + SFR ++ K +++D +L + + +Q ++ + T ++
Sbjct: 1155 LCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKK 1207
Query: 1422 LNDLALGTSF 1431
L D+ +F
Sbjct: 1208 LEDIRTRYAF 1217
Score = 84 (34.6 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 445 SEMAVSELPGNPNAVWTV-----RRH---------IEDEFDAYIIVSFVNATLVLSIGET 490
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 491 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 525
Score = 54 (24.1 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 604 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 639
Score = 50 (22.7 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 362 AHLGDDDEEPEFSSAMPLEEGD 383
Score = 46 (21.3 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 15/61 (24%), Positives = 25/61 (40%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I++ + +I +LE+ S + F K G G + VDP+G
Sbjct: 75 KDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFG-------KSGCRRIVPGQFLAVDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 41 (19.5 bits), Expect = 1.0e-06, Sum P(5) = 1.0e-06
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 715
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 716 SSR-SWLS 722
Score = 39 (18.8 bits), Expect = 2.8e-07, Sum P(5) = 2.8e-07
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 245 PSGVLICSENYITYKNFGDQPDI 267
>MGI|MGI:1289341 [details] [associations]
symbol:Sf3b3 "splicing factor 3b, subunit 3" species:10090
"Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0005681 "spliceosomal complex"
evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0008380 "RNA splicing" evidence=IEA] [GO:0071013 "catalytic
step 2 spliceosome" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
MGI:MGI:1289341 GO:GO:0008380 GO:GO:0006397 GO:GO:0003676
GO:GO:0071013 eggNOG:NOG247734 GeneTree:ENSGT00530000063396
HSSP:Q16531 GO:GO:0005689 KO:K12830 HOGENOM:HOG000216677
OMA:FDTIPVA CTD:23450 HOVERGEN:HBG093942 OrthoDB:EOG4RV2QJ
EMBL:AK085705 EMBL:AK088268 EMBL:AK129035 EMBL:AK147914
EMBL:BC011412 EMBL:BC031197 EMBL:BC042580 IPI:IPI00122011
IPI:IPI00625759 RefSeq:NP_598714.1 UniGene:Mm.236123
ProteinModelPortal:Q921M3 IntAct:Q921M3 STRING:Q921M3
PhosphoSite:Q921M3 PaxDb:Q921M3 PRIDE:Q921M3
Ensembl:ENSMUST00000042012 GeneID:101943 KEGG:mmu:101943
UCSC:uc009nlc.1 InParanoid:Q921M3 NextBio:355190 Bgee:Q921M3
CleanEx:MM_SF3B3 Genevestigator:Q921M3 Uniprot:Q921M3
Length = 1217
Score = 125 (49.1 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 105/550 (19%), Positives = 211/550 (38%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 757
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
S L+I L G+ ++ Q PL+ TP + E N LI+ +
Sbjct: 758 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN--NLIIIETDHNAYTE 809
Query: 1032 VLSLLIDQEVGHQI------DNHNLSSVDLHRTYTVEEYEVRILEPDRAG-GPWQT--RA 1082
Q++ ++ D L++ ++ + E I +AG G W + R
Sbjct: 810 ATKAQRKQQMAEEMVEAAGEDERELAA-EMAAAFLNENLPESIFGAPKAGNGQWASVIRV 868
Query: 1083 TIPMQSS----------ENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
P+Q + E A +V V F+ T ++ L+ + + A G V
Sbjct: 869 MNPIQGNTLDLVQLEQNEAAFSVAVCR-FSNTGEDWYVLVGVAKDLILSPRSVAGGFVYT 927
Query: 1133 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1192
+ N + + L + ++ +A+A QG +LI G + ++ +L
Sbjct: 928 YKLVNNGEKLEFL----HKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCEN 983
Query: 1193 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
Y+ + + + +++ D+ +S ++ +K QL + A D T L+D
Sbjct: 984 KHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPR-WVTTASLLDYD 1042
Query: 1253 TLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG---AHVTKFLRLQML 1304
T++ +D+ NI + P ++ E G K L R + A V +
Sbjct: 1043 TVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGET 1100
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1361
S +T PG ++ L++ TL G IG + P ++ F Q ++ L P
Sbjct: 1101 VLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF--FQHVEMHLRSEHPP 1154
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+ G + SFR ++ K +++D +L + + +Q ++ + T ++
Sbjct: 1155 LCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKK 1207
Query: 1422 LNDLALGTSF 1431
L D+ +F
Sbjct: 1208 LEDIRTRYAF 1217
Score = 84 (34.6 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 445 SEMAVSELPGNPNAVWTV-----RRH---------IEDEFDAYIIVSFVNATLVLSIGET 490
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 491 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 525
Score = 54 (24.1 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 604 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 639
Score = 50 (22.7 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 362 AHLGDDDEEPEFSSAMPLEEGD 383
Score = 46 (21.3 bits), Expect = 5.9e-08, Sum P(5) = 5.9e-08
Identities = 15/61 (24%), Positives = 25/61 (40%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I++ + +I +LE+ S + F K G G + VDP+G
Sbjct: 75 KDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFG-------KSGCRRIVPGQFLAVDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 41 (19.5 bits), Expect = 1.0e-06, Sum P(5) = 1.0e-06
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 715
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 716 SSR-SWLS 722
Score = 39 (18.8 bits), Expect = 2.8e-07, Sum P(5) = 2.8e-07
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 245 PSGVLICSENYITYKNFGDQPDI 267
>UNIPROTKB|E2RR33 [details] [associations]
symbol:SF3B3 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0071013 "catalytic step 2 spliceosome"
evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0003676 GO:GO:0071013
GeneTree:ENSGT00530000063396 GO:GO:0005689 KO:K12830 OMA:FDTIPVA
CTD:23450 EMBL:AAEX03004077 RefSeq:XP_536791.2
Ensembl:ENSCAFT00000032086 GeneID:479659 KEGG:cfa:479659
Uniprot:E2RR33
Length = 1217
Score = 123 (48.4 bits), Expect = 9.4e-08, Sum P(5) = 9.4e-08
Identities = 105/550 (19%), Positives = 210/550 (38%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 757
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
S L+I L G+ ++ Q PL+ TP + E N LI+ +
Sbjct: 758 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN--NLIIIETDHNAYTE 809
Query: 1032 VLSLLIDQEVGHQI------DNHNLSSVDLHRTYTVEEYEVRILEPDRAG-GPWQT--RA 1082
Q++ ++ D L++ ++ + E I +AG G W + R
Sbjct: 810 ATKAQRKQQMAEEMVEAAGEDERELAA-EMAAAFLNENLPESIFGAPKAGNGQWASVIRV 868
Query: 1083 TIPMQSS----------ENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLL 1132
P+Q + E A +V V F+ T + L+ + + A G V
Sbjct: 869 MNPIQGNTLDLVQLEQNEAAFSVAVCR-FSNTGDDWYVLVGVAKDLILNPRSVAGGFVYT 927
Query: 1133 FSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFY 1192
+ N + + L + ++ +A+A QG +LI G + ++ +L
Sbjct: 928 YKLVNNGEKLEFL----HKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCEN 983
Query: 1193 DAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGS 1252
Y+ + + + +++ D+ +S ++ +K QL + A D T L+D
Sbjct: 984 KHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPR-WVTTASLLDYD 1042
Query: 1253 TLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG---AHVTKFLRLQML 1304
T++ +D+ NI + P ++ E G K L R + A V +
Sbjct: 1043 TVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGET 1100
Query: 1305 ATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPH 1361
S +T PG ++ L++ TL G IG + P ++ F Q ++ L P
Sbjct: 1101 VLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF--FQHVEMHLRSEHPP 1154
Query: 1362 VAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSN 1421
+ G + SFR ++ K +++D +L + + +Q ++ + T ++
Sbjct: 1155 LCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKK 1207
Query: 1422 LNDLALGTSF 1431
L D+ +F
Sbjct: 1208 LEDIRTRYAF 1217
Score = 84 (34.6 bits), Expect = 9.4e-08, Sum P(5) = 9.4e-08
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 445 SEMAVSELPGNPNAVWTV-----RRH---------IEDEFDAYIIVSFVNATLVLSIGET 490
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 491 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 525
Score = 54 (24.1 bits), Expect = 9.4e-08, Sum P(5) = 9.4e-08
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 604 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 639
Score = 50 (22.7 bits), Expect = 9.4e-08, Sum P(5) = 9.4e-08
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 362 AHLGDDDEEPEFSSAMPLEEGD 383
Score = 46 (21.3 bits), Expect = 9.4e-08, Sum P(5) = 9.4e-08
Identities = 15/61 (24%), Positives = 25/61 (40%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I++ + +I +LE+ S + F K G G + VDP+G
Sbjct: 75 KDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFG-------KSGCRRIVPGQFLAVDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 41 (19.5 bits), Expect = 1.6e-06, Sum P(5) = 1.6e-06
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 715
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 716 SSR-SWLS 722
Score = 39 (18.8 bits), Expect = 4.4e-07, Sum P(5) = 4.4e-07
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 245 PSGVLICSENYITYKNFGDQPDI 267
>ASPGD|ASPL0000031473 [details] [associations]
symbol:AN5452 species:162425 "Emericella nidulans"
[GO:0008150 "biological_process" evidence=ND] [GO:0003676 "nucleic
acid binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380
Gene3D:2.130.10.10 GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
GO:GO:0007049 EMBL:BN001305 EMBL:AACD01000094 eggNOG:NOG247734
KO:K12830 RefSeq:XP_663056.1 STRING:Q5B1X8 GeneID:2871744
KEGG:ani:AN5452.2 HOGENOM:HOG000216677 OMA:FDTIPVA
OrthoDB:EOG4FR40R Uniprot:Q5B1X8
Length = 1209
Score = 133 (51.9 bits), Expect = 1.0e-07, Sum P(5) = 1.0e-07
Identities = 101/478 (21%), Positives = 197/478 (41%)
Query: 965 CNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATP-HQITYFAEKNLYPLIVSV 1023
C G + + Q L+I + DN +Q+ IPL TP H I + E Y +
Sbjct: 760 CVEGMVGIQGQN-LRIFSIEK---LDNNM-LQQSIPLAYTPRHFIKHPEEPLFYVIEADN 814
Query: 1024 PVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYE--VRILEPDRAGGPWQTR 1081
VL P + + L++ D L D + ++I++P A
Sbjct: 815 NVLSPATR--ARLLEDSKARGGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKA---VV 869
Query: 1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTA--YVQGEDVAARGRVLLFSTGRNA 1139
+ ++ +E A+++ V T++++ET L +GTA +A G + ++ R
Sbjct: 870 GAVELEENEAAVSIAAVPF---TSQDDETFLVVGTAKDMTVNPPSSAGGYIHIY---RFQ 923
Query: 1140 DNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYV 1199
++ + L ++ +++ AL QG LL G + ++ +L P +
Sbjct: 924 EDGKELEF-IHKTKVEEPPLALLGFQGRLLAGVGSVLRIYDLGMKQLLRKCQAAVAPKAI 982
Query: 1200 VSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEF-LIDGSTLSLVV 1258
V L + I++ D+ +S+ ++ +K Q L D S+ + T ++D T +
Sbjct: 983 VGLQTQGSRIVVSDVRESVTYVVYKYQDNVLIPFVDD--SIARWTTAATMVDYETTA--G 1038
Query: 1259 SDEQKNIQIFYYAPKMSES----WKGQKLLSRAEFHVGAHVTKFLRL----QMLATSSDR 1310
D+ N+ + K SE G L+ + G L + Q + TS +
Sbjct: 1039 GDKFGNLWLVRCPKKASEEADEEGSGAHLIHDRGYLQGTPNRLELMIHVFTQDIPTSLHK 1098
Query: 1311 TGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNP 1367
T G R L++ G+IG + P +++ F QSL+ +L P +AG +
Sbjct: 1099 TQLVAGG----RDILVWTGFQGTIGILVPFVSREDVDF--FQSLEMQLASQCPPLAGRDH 1152
Query: 1368 RSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+R +++ K ++D +L Y +L + ++ IA + + +I ++D+
Sbjct: 1153 LIYRSYYAPVKG-------VIDGDLCEQYFLLSNDTKMMIAAELDRSVREIERKISDM 1203
Score = 84 (34.6 bits), Expect = 1.0e-07, Sum P(5) = 1.0e-07
Identities = 20/60 (33%), Positives = 33/60 (55%)
Query: 573 DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
DE+ AY+++S T+VL + + EVT++ + T+A L G +IQ+ RG R
Sbjct: 484 DEFDAYIVLSFANGTLVLSIGETVEEVTDT-GFLSSAPTLAVQQL-GEDSLIQIHPRGIR 541
Score = 53 (23.7 bits), Expect = 1.0e-07, Sum P(5) = 1.0e-07
Identities = 21/80 (26%), Positives = 38/80 (47%)
Query: 111 GNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL 170
G + +LA G++ +D II+ + +I+++E+ S + R +H E+
Sbjct: 66 GIIRTLAAFRLAGSN----KDYIIIGSDSGRITIIEYVPSQN--RFNRIH-LET----FG 114
Query: 171 KRGRESFARGPLVKVDPQGR 190
K G G + VDP+GR
Sbjct: 115 KSGVRRVVPGQYLAVDPKGR 134
Score = 46 (21.3 bits), Expect = 1.0e-07, Sum P(5) = 1.0e-07
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 666 IADPYVLLGMSDGSIRLLVGDPSTC--TVSVQTPAAIESS 703
+ ++ +G D ++R+L DP T SVQ A S+
Sbjct: 616 VRSSFLAVGCDDSTVRILSLDPDTTLENKSVQALTAAPSA 655
Score = 40 (19.1 bits), Expect = 1.0e-07, Sum P(5) = 1.0e-07
Identities = 18/67 (26%), Positives = 32/67 (47%)
Query: 378 LLSTKTGDLVLLTV--VYD--GRV---VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLG 430
LL T+ GDL LT+ V D G++ V+ L + + L S + + + ++ + G
Sbjct: 307 LLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLYVAAEGG 366
Query: 431 DSLLVQF 437
+ QF
Sbjct: 367 NHHFYQF 373
>WB|WBGene00019323 [details] [associations]
symbol:teg-4 species:6239 "Caenorhabditis elegans"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
birth or egg hatching" evidence=IMP] [GO:0040035 "hermaphrodite
genitalia development" evidence=IMP] [GO:0009790 "embryo
development" evidence=IMP] [GO:0001703 "gastrulation with mouth
forming first" evidence=IMP] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
"nematode larval development" evidence=IMP] [GO:0002009
"morphogenesis of an epithelium" evidence=IMP] [GO:0042127
"regulation of cell proliferation" evidence=IMP] [GO:0040020
"regulation of meiosis" evidence=IMP] [GO:0008406 "gonad
development" evidence=IMP] [GO:0016477 "cell migration"
evidence=IMP] [GO:0007281 "germ cell development" evidence=IMP]
InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
GO:GO:0040007 GO:GO:0016477 GO:GO:0008406 GO:GO:0002119
Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003676 GO:GO:0042127
GO:GO:0040035 GO:GO:0007281 GO:GO:0040020 eggNOG:NOG247734
GeneTree:ENSGT00530000063396 GO:GO:0001703 KO:K12830
HOGENOM:HOG000216677 OMA:FDTIPVA EMBL:FO081029 PIR:T32916
RefSeq:NP_491953.1 ProteinModelPortal:O44985 STRING:O44985
PaxDb:O44985 EnsemblMetazoa:K02F2.3 GeneID:172406
KEGG:cel:CELE_K02F2.3 UCSC:K02F2.3 CTD:172406 WormBase:K02F2.3
InParanoid:O44985 NextBio:875387 Uniprot:O44985
Length = 1220
Score = 149 (57.5 bits), Expect = 1.1e-07, Sum P(5) = 1.1e-07
Identities = 64/317 (20%), Positives = 145/317 (45%)
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTEL 1186
RG V F N D L + E + A+ +G L+ G + ++ +L
Sbjct: 925 RGCVYTFHLSANGDRFDFL----HRTETPLPVGAIHDFRGMALVGFGRFLRMYDIGQKKL 980
Query: 1187 NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT- 1245
P+ +V++ I++ D +S++FL +++ QL + A D + + T
Sbjct: 981 LAKCENKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADD--TTPRYVTC 1038
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLA 1305
++D T++ V+D+ N+ + +++E + +S++ + G ++++++
Sbjct: 1039 VCVLDYHTVA--VADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVS 1096
Query: 1306 --------TSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKK 1354
TS +T PG+++ AL++ T+ G+IGC+ DE+ F +L+
Sbjct: 1097 NFFIGDTITSLQKTSLMPGANE----ALVYTTIGGAIGCLVSFMSKDEVDF--FTNLEMH 1150
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
+ P + G + ++R +++ K S++D ++ + ++ ++Q ++A + G T
Sbjct: 1151 VRSEYPPLCGRDHLAYRSYYAPCK-------SVIDGDICEQFSLMDTQKQKDVAEELGKT 1203
Query: 1415 RSQILSNLNDLALGTSF 1431
S+I L D+ +F
Sbjct: 1204 VSEISKKLEDIRTRYAF 1220
Score = 72 (30.4 bits), Expect = 1.1e-07, Sum P(5) = 1.1e-07
Identities = 31/131 (23%), Positives = 59/131 (45%)
Query: 507 DSLVNIGPLKDFSYGLRINADA----SATGISKQSNYELVELPGCKGIWTVYHKSSRGH- 561
DS+ ++ PL D G DA S G +S+ +++ G + I + G+
Sbjct: 399 DSMDSLSPLTDAVIGDIAREDAAQIYSLVGRGARSSLKVLR-NGLE-ISEMAVSDLPGNP 456
Query: 562 NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR 621
NA + +D+Y +Y+++S T+ L D + E ++S + TI + G
Sbjct: 457 NAVWTVKKNIEDQYDSYIVVSFVNATLALTIGDTVEEASDS-GFLPTTPTIGCA-MIGDD 514
Query: 622 RVIQVFERGAR 632
++Q++ G R
Sbjct: 515 SLVQIYSEGIR 525
Score = 49 (22.3 bits), Expect = 1.1e-07, Sum P(5) = 1.1e-07
Identities = 26/103 (25%), Positives = 43/103 (41%)
Query: 93 MDGISAASLELVCHYRLHGNVESL-AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSI 151
+D ++ ++++CH + G V SL A G RD I + + +I +L+++
Sbjct: 44 LDTVTG-KIKVMCHQDIFGIVRSLLAFRLTAGT-----RDFIAVGSDSGRIVILQYN--- 94
Query: 152 HGLRITSMHCFESPEWLHL----KRGRESFARGPLVKVDPQGR 190
CFE LH K G G + DP+GR
Sbjct: 95 -----AEKTCFER---LHQETFGKTGCRRIVPGHFLVGDPRGR 129
Score = 45 (20.9 bits), Expect = 1.1e-07, Sum P(5) = 1.1e-07
Identities = 18/84 (21%), Positives = 37/84 (44%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ + GD+ +T+ D +V + L + + + + + F+ + G+ L Q
Sbjct: 303 LVQAENGDIFKVTLETDEDLVSEMKLKYFDTVPPANALCILKSGFLFVAAEFGNHELYQI 362
Query: 438 -TCGSGTS-MLSSGLKEEFGDIEA 459
+ G G SS + FG+ +A
Sbjct: 363 ASLGEGDDDEFSSAMG--FGENDA 384
Score = 40 (19.1 bits), Expect = 1.1e-07, Sum P(5) = 1.1e-07
Identities = 7/27 (25%), Positives = 16/27 (59%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQT 696
++ LG D ++R++ DP+ + + T
Sbjct: 604 FLALGTVDNAVRIISLDPNDMLMPLST 630
Score = 39 (18.8 bits), Expect = 0.00012, Sum P(3) = 0.00012
Identities = 30/140 (21%), Positives = 57/140 (40%)
Query: 553 VYHKSSRGHNADSSRMAAYD-DEYHAYLIISLEARTMV-LETADLLTEVTESVDYFVQGR 610
+Y +S G D +A E A E ++++ +++ D L+ +T++V + R
Sbjct: 359 LYQIASLGEGDDDEFSSAMGFGENDAAFFEPHELKSLIPIDSMDSLSPLTDAVIGDI-AR 417
Query: 611 TIAAG--NLFGR--RRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSI 666
AA +L GR R ++V G I + + DL P
Sbjct: 418 EDAAQIYSLVGRGARSSLKVLRNGLEISEMA--VSDLPGNPNAVWTVKKNIEDQY----- 470
Query: 667 ADPYVLLGMSDGSIRLLVGD 686
D Y+++ + ++ L +GD
Sbjct: 471 -DSYIVVSFVNATLALTIGD 489
Score = 38 (18.4 bits), Expect = 5.2e-07, Sum P(5) = 5.2e-07
Identities = 7/12 (58%), Positives = 8/12 (66%)
Query: 215 GDEDTFGSGGGF 226
GD+D F S GF
Sbjct: 368 GDDDEFSSAMGF 379
>UNIPROTKB|F5H0Y5 [details] [associations]
symbol:DDB1 "DNA damage-binding protein 1" species:9606
"Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] [GO:0016055 "Wnt receptor
signaling pathway" evidence=IEA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 GO:GO:0016055
Gene3D:2.130.10.10 GO:GO:0003684 EMBL:AP003108 HGNC:HGNC:2717
ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909177
ProteinModelPortal:F5H0Y5 SMR:F5H0Y5 Ensembl:ENST00000539332
ArrayExpress:F5H0Y5 Bgee:F5H0Y5 Uniprot:F5H0Y5
Length = 204
Score = 143 (55.4 bits), Expect = 1.9e-07, P = 1.9e-07
Identities = 56/214 (26%), Positives = 98/214 (45%)
Query: 1127 RGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTG--- 1183
+GR+++F + +D V E KE+KGA+ ++ G LL + + L++WT
Sbjct: 11 QGRIVVF---QYSDGKLQTVAE---KEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKE 64
Query: 1184 --TELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLD 1241
TE N + + LY L +FIL+GD+ +S+ L++K +A+DF
Sbjct: 65 LRTECNH--YNNIMALY---LKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNW 119
Query: 1242 CFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRL 1301
A E L D + L ++ N+ + + + Q L FH+G V F
Sbjct: 120 MSAVEILDDDNFLG---AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHG 176
Query: 1302 QMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIG 1335
++ + T + P + ++LFGT++G IG
Sbjct: 177 SLVMQNLGET-STP-----TQGSVLFGTVNGMIG 204
>UNIPROTKB|F1P529 [details] [associations]
symbol:SF3B3 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005689 "U12-type spliceosomal complex" evidence=IEA]
[GO:0071013 "catalytic step 2 spliceosome" evidence=IEA]
InterPro:IPR004871 Pfam:PF03178 GO:GO:0003676 GO:GO:0071013
GeneTree:ENSGT00530000063396 GO:GO:0005689 OMA:FDTIPVA
EMBL:AADN02051593 EMBL:AADN02051594 IPI:IPI00576925
Ensembl:ENSGALT00000003987 ArrayExpress:F1P529 Uniprot:F1P529
Length = 1228
Score = 116 (45.9 bits), Expect = 2.4e-07, Sum P(6) = 2.4e-07
Identities = 75/384 (19%), Positives = 152/384 (39%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R++ P + T + ++ +E A +V V F+ T +E L+ + +
Sbjct: 866 IRVMNPIQGN----TLDLVQLEQNEAAFSVAVCR-FSNTGEEWYVLVGVAKDLILNPRSV 920
Query: 1126 ARGRVLLFSTGRNADNPQNLVTE------VYSKELKGAISALASLQGHLLIASGPKIILH 1179
A G V + + LV ++ ++ +A+A QG +LI G + ++
Sbjct: 921 AGGFVYTYKLVNGGEXTYKLVNGGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY 980
Query: 1180 KWTGTEL-NGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFG 1238
+L Y+ + + + +++ D+ +S ++ +K QL + A D
Sbjct: 981 DLGKKKLLRKCENKKHIANYICGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTY 1040
Query: 1239 SLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG- 1292
T L+D T++ +D+ NI + P ++ E G K L R +
Sbjct: 1041 PR-WVTTATLLDYDTVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGAS 1097
Query: 1293 --AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRR 1347
A V + S +T PG ++ L++ TL G IG + P ++ F
Sbjct: 1098 QKAEVIMNYHVGETVLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF-- 1151
Query: 1348 LQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEI 1407
Q ++ L P + G + SFR ++ K +++D +L + + +Q +
Sbjct: 1152 FQHVEMHLRSEHPPLCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNV 1204
Query: 1408 AHQTGTTRSQILSNLNDLALGTSF 1431
A + T ++ L D+ +F
Sbjct: 1205 AEELDRTPPEVSKKLEDIRTRYAF 1228
Score = 84 (34.6 bits), Expect = 2.4e-07, Sum P(6) = 2.4e-07
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 445 SEMAVSELPGNPNAVWTV-----RRH---------VEDEFDAYIIVSFVNATLVLSIGET 490
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 491 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 525
Score = 54 (24.1 bits), Expect = 2.4e-07, Sum P(6) = 2.4e-07
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 604 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 639
Score = 50 (22.7 bits), Expect = 2.4e-07, Sum P(6) = 2.4e-07
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 362 AHLGDDDEEPEFSSAMPLEEGD 383
Score = 45 (20.9 bits), Expect = 2.4e-07, Sum P(6) = 2.4e-07
Identities = 15/61 (24%), Positives = 25/61 (40%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D I++ + +I +LE+ S + F K G G + VDP+G
Sbjct: 75 KDYIVVGSDSGRIVILEYQPSKNVFEKIHQETFG-------KSGCRRIVPGQYLAVDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 45 (20.9 bits), Expect = 2.4e-07, Sum P(6) = 2.4e-07
Identities = 21/104 (20%), Positives = 41/104 (39%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 757
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
S L+I L G+ ++ Q PL+ TP + E N
Sbjct: 758 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN 795
Score = 41 (19.5 bits), Expect = 1.0e-05, Sum P(5) = 1.0e-05
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 715
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 716 SSR-SWLS 722
Score = 39 (18.8 bits), Expect = 8.6e-07, Sum P(6) = 8.6e-07
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 245 PSGVLICSENYITYKNFGDQPDI 267
>ZFIN|ZDB-GENE-040426-2901 [details] [associations]
symbol:sf3b3 "splicing factor 3b, subunit 3"
species:7955 "Danio rerio" [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005681
"spliceosomal complex" evidence=IEA] [GO:0006397 "mRNA processing"
evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA]
InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
Pfam:PF03178 ZFIN:ZDB-GENE-040426-2901 GO:GO:0008380
Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0006397 GO:GO:0005681
GO:GO:0003676 eggNOG:NOG247734 GeneTree:ENSGT00530000063396
KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA CTD:23450
HOVERGEN:HBG093942 OrthoDB:EOG4RV2QJ EMBL:BX784024 EMBL:BC047171
IPI:IPI00508652 RefSeq:NP_998668.1 RefSeq:XP_002667683.2
UniGene:Dr.76176 STRING:Q1LVE8 PRIDE:Q1LVE8
Ensembl:ENSDART00000008310 Ensembl:ENSDART00000122831
Ensembl:ENSDART00000129666 Ensembl:ENSDART00000147743
GeneID:100334114 GeneID:406824 KEGG:dre:100334114 KEGG:dre:406824
InParanoid:Q1LVE8 NextBio:20818331 Bgee:Q1LVE8 Uniprot:Q1LVE8
Length = 1217
Score = 117 (46.2 bits), Expect = 4.7e-07, Sum P(5) = 4.7e-07
Identities = 56/283 (19%), Positives = 117/283 (41%)
Query: 1160 ALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIY 1219
A+A QG +L+ G + ++ +L P V ++ + +++ D+ +S++
Sbjct: 951 AIAPFQGRVLVGVGKLLRIYDLGKKKLLRKCENKHVPNLVTGIHTIGQRVIVSDVQESLF 1010
Query: 1220 FLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMS---- 1275
++ ++ QL + A D T L+D T++ +D+ NI + P S
Sbjct: 1011 WVRYRRNENQLIIFADDTYPR-WITTACLLDYDTMAS--ADKFGNICVVRLPPNTSDDVD 1067
Query: 1276 ESWKGQKLL-SRAEFHVGAHVTKFL---RLQMLATSSDRTGAAPGSDKTNRFALLFGTLD 1331
E G K L R + + + + + S +T PG ++ L++ TL
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLIPGGSES----LVYTTLS 1123
Query: 1332 GSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
G IG + P ++ F Q L+ + P + G + SFR ++ K +++
Sbjct: 1124 GGIGILVPFTSHEDHDF--FQHLEMHMRSEFPPLCGRDHLSFRSYYFPVK-------NVI 1174
Query: 1389 DCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
D +L + + +Q ++ + T ++ L D+ +F
Sbjct: 1175 DGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDIRTRYAF 1217
Score = 86 (35.3 bits), Expect = 4.7e-07, Sum P(5) = 4.7e-07
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 445 SEMAVSELPGNPNAVWTV-----RRH---------VEDEFDAYIIVSFVNATLVLSIGET 490
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 491 VEEVTDS-GFLGTTPTLSC-SLLGEDALVQVYPDGIR 525
Score = 54 (24.1 bits), Expect = 4.7e-07, Sum P(5) = 4.7e-07
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 604 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 639
Score = 49 (22.3 bits), Expect = 4.7e-07, Sum P(5) = 4.7e-07
Identities = 18/82 (21%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + + + + + + + F+ S G+ L Q
Sbjct: 302 LAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGFLFVSSEFGNHYLYQI 361
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 362 AHLGDDDEEPEFSSAMPLEEGD 383
Score = 44 (20.5 bits), Expect = 4.7e-07, Sum P(5) = 4.7e-07
Identities = 14/61 (22%), Positives = 25/61 (40%)
Query: 130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
+D +++ + +I +LE+ S + F K G G + VDP+G
Sbjct: 75 KDYVVVGSDSGRIVILEYHPSKNMFEKIHQETFG-------KSGCRRIVPGQFLAVDPKG 127
Query: 190 R 190
R
Sbjct: 128 R 128
Score = 40 (19.1 bits), Expect = 9.9e-06, Sum P(5) = 9.9e-06
Identities = 40/172 (23%), Positives = 71/172 (41%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA 729
Y+ +G+ +G + V DP T +S + S +PV + +G E L +S +
Sbjct: 664 YLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAMSSR-S 719
Query: 730 WLS---------TGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK 779
WLS T + E ++ A G +Q +V + L I + VF
Sbjct: 720 WLSYSYQSRFHLTPLSYETLEYASGFASEQCP-EGIVAISTNTLRILALEKLGAVFNQVA 778
Query: 780 F---VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELA 828
F + R ++ + ET+ N+ +E Q RK+ + + ++VE A
Sbjct: 779 FPLQYTPRKFVIHPETNNLIL-IETDHNAYTEATKAQ-RKQQM-AEEMVEAA 827
Score = 39 (18.8 bits), Expect = 1.4e-06, Sum P(5) = 1.4e-06
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 245 PSGVLICSENYITYKNFGDQPDI 267
>GENEDB_PFALCIPARUM|PFL1680w [details] [associations]
symbol:PFL1680w "splicing factor 3b, subunit 3,
130kD, putative" species:5833 "Plasmodium falciparum" [GO:0005681
"spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
HOGENOM:HOG000216677 RefSeq:XP_001350742.1
ProteinModelPortal:Q8I574 PRIDE:Q8I574
EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
Uniprot:Q8I574
Length = 1329
Score = 113 (44.8 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 62/273 (22%), Positives = 111/273 (40%)
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
S G L+ + G K+ ++ +L Y P +VS+ I N I DI +S+
Sbjct: 1067 SYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVLIFF 1126
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES----- 1277
+ L L++ D +E L D T+ + +D+ ++ I + +
Sbjct: 1127 YDPNQNTLRLISDDIIPRWITCSEIL-DHHTI--MAADKFDSVFILRVPEEAKQDEYGIT 1183
Query: 1278 ---WKGQKLL-SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
W G +++ S + H+ F + + TS + +P S + +++ T+ G+
Sbjct: 1184 NKCWYGGEIMNSSTKNRKLEHMMSF-HIGEIVTSMQKVRLSPTSSE----CIIYSTIMGT 1238
Query: 1334 IGCIAPLDELTFRRL-QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
IG P D L Q L+ L P + G FR ++ H P ++VD +L
Sbjct: 1239 IGAFIPYDNKEELELTQHLEIILRTEKPPLCGREHIFFRSYY-----H-P-VQNVVDGDL 1291
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + Q +IA+ T IL L D+
Sbjct: 1292 CEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324
Score = 86 (35.3 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
EY Y+++S E T++LE + + EV++++ + T N+ IQV++ G R
Sbjct: 501 EYDGYIVVSFEGNTLILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRH 558
Query: 634 LDGSYMTQ 641
++G + +
Sbjct: 559 INGKVVQE 566
Score = 58 (25.5 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 19/90 (21%), Positives = 43/90 (47%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L ++ + G + S++ G++ +D I++ + ++ +LE+++ + +H
Sbjct: 50 LNVIISKDIFGIIRSISTFRLTGSN----KDYIVIGSDSGRLVILEYNNEKNDF--VRVH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGR 190
C E+ + K G G + VDP+GR
Sbjct: 104 C-ET----YGKTGIRRIIPGEYIAVDPKGR 128
Score = 50 (22.7 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 13/65 (20%), Positives = 32/65 (49%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I+ + + F+ + G+ QF
Sbjct: 334 LIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVAAEFGNHYFYQF 393
Query: 438 TCGSG 442
+ G G
Sbjct: 394 S-GIG 397
Score = 39 (18.8 bits), Expect = 5.4e-05, Sum P(4) = 5.4e-05
Identities = 5/19 (26%), Positives = 11/19 (57%)
Query: 12 PTGIANCGSGFITHSRADY 30
P+G+ C F+ + + D+
Sbjct: 280 PSGVLICCENFLVYKKVDH 298
>UNIPROTKB|Q8I574 [details] [associations]
symbol:PFL1680w "Splicing factor 3b, subunit 3, 130kD,
putative" species:36329 "Plasmodium falciparum 3D7" [GO:0005681
"spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
HOGENOM:HOG000216677 RefSeq:XP_001350742.1
ProteinModelPortal:Q8I574 PRIDE:Q8I574
EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
Uniprot:Q8I574
Length = 1329
Score = 113 (44.8 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 62/273 (22%), Positives = 111/273 (40%)
Query: 1163 SLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLS 1222
S G L+ + G K+ ++ +L Y P +VS+ I N I DI +S+
Sbjct: 1067 SYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVLIFF 1126
Query: 1223 WKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAPKMSES----- 1277
+ L L++ D +E L D T+ + +D+ ++ I + +
Sbjct: 1127 YDPNQNTLRLISDDIIPRWITCSEIL-DHHTI--MAADKFDSVFILRVPEEAKQDEYGIT 1183
Query: 1278 ---WKGQKLL-SRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGS 1333
W G +++ S + H+ F + + TS + +P S + +++ T+ G+
Sbjct: 1184 NKCWYGGEIMNSSTKNRKLEHMMSF-HIGEIVTSMQKVRLSPTSSE----CIIYSTIMGT 1238
Query: 1334 IGCIAPLDELTFRRL-QSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCEL 1392
IG P D L Q L+ L P + G FR ++ H P ++VD +L
Sbjct: 1239 IGAFIPYDNKEELELTQHLEIILRTEKPPLCGREHIFFRSYY-----H-P-VQNVVDGDL 1291
Query: 1393 LSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDL 1425
+ L + Q +IA+ T IL L D+
Sbjct: 1292 CEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324
Score = 86 (35.3 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 574 EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARI 633
EY Y+++S E T++LE + + EV++++ + T N+ IQV++ G R
Sbjct: 501 EYDGYIVVSFEGNTLILEIGESVEEVSDTL--LLNNVTTLHINILYDNSFIQVYDTGIRH 558
Query: 634 LDGSYMTQ 641
++G + +
Sbjct: 559 INGKVVQE 566
Score = 58 (25.5 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 19/90 (21%), Positives = 43/90 (47%)
Query: 101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
L ++ + G + S++ G++ +D I++ + ++ +LE+++ + +H
Sbjct: 50 LNVIISKDIFGIIRSISTFRLTGSN----KDYIVIGSDSGRLVILEYNNEKNDF--VRVH 103
Query: 161 CFESPEWLHLKRGRESFARGPLVKVDPQGR 190
C E+ + K G G + VDP+GR
Sbjct: 104 C-ET----YGKTGIRRIIPGEYIAVDPKGR 128
Score = 50 (22.7 bits), Expect = 8.0e-07, Sum P(4) = 8.0e-07
Identities = 13/65 (20%), Positives = 32/65 (49%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L+ ++ GDL + V ++ +V+ + + + + I+ + + F+ + G+ QF
Sbjct: 334 LIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVAAEFGNHYFYQF 393
Query: 438 TCGSG 442
+ G G
Sbjct: 394 S-GIG 397
Score = 39 (18.8 bits), Expect = 5.4e-05, Sum P(4) = 5.4e-05
Identities = 5/19 (26%), Positives = 11/19 (57%)
Query: 12 PTGIANCGSGFITHSRADY 30
P+G+ C F+ + + D+
Sbjct: 280 PSGVLICCENFLVYKKVDH 298
>RGD|1311636 [details] [associations]
symbol:Sf3b3 "splicing factor 3b, subunit 3" species:10116
"Rattus norvegicus" [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005689
"U12-type spliceosomal complex" evidence=ISO] [GO:0071013
"catalytic step 2 spliceosome" evidence=ISO] InterPro:IPR004871
Pfam:PF03178 RGD:1311636 GO:GO:0005634 GO:GO:0003676
IPI:IPI00563335 PRIDE:F1LSZ9 Ensembl:ENSRNOT00000044193
UCSC:RGD:1311636 ArrayExpress:F1LSZ9 Uniprot:F1LSZ9
Length = 902
Score = 103 (41.3 bits), Expect = 2.5e-06, Sum P(6) = 2.5e-06
Identities = 58/284 (20%), Positives = 116/284 (40%)
Query: 1159 SALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSI 1218
+A+A QG +LI G + ++ +L Y+ + + + +++ D+ +S
Sbjct: 635 AAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESF 694
Query: 1219 YFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFYYAP----KM 1274
++ +K QL + A D T L+D T++ +D+ NI + P ++
Sbjct: 695 IWVRYKRNENQLIIFADDTYPR-WVTTASLLDYDTVA--GADKFGNICVVRLPPNTNDEV 751
Query: 1275 SESWKGQKLL-SRAEFHVG---AHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFGTL 1330
E G K L R + A V + S +T PG ++ L++ TL
Sbjct: 752 DEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSES----LVYTTL 807
Query: 1331 DGSIGCIAPL---DELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSI 1387
G IG + P ++ F Q ++ L P + G + SFR ++ K ++
Sbjct: 808 SGGIGILVPFTSHEDHDF--FQHVEMHLRSEHPPLCGRDHLSFRSYYFPVK-------NV 858
Query: 1388 VDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALGTSF 1431
+D +L + + +Q ++ + T ++ L D+ +F
Sbjct: 859 IDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDIRTRYAF 902
Score = 84 (34.6 bits), Expect = 2.5e-06, Sum P(6) = 2.5e-06
Identities = 30/97 (30%), Positives = 47/97 (48%)
Query: 537 SNYELVELPGC-KGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADL 595
S + ELPG +WTV R H +DE+ AY+I+S T+VL +
Sbjct: 225 SEMAVSELPGNPNAVWTV-----RRH---------IEDEFDAYIIVSFVNATLVLSIGET 270
Query: 596 LTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
+ EVT+S + T++ +L G ++QV+ G R
Sbjct: 271 VEEVTDS-GFLGTTPTLSC-SLLGDDALVQVYPDGIR 305
Score = 54 (24.1 bits), Expect = 2.5e-06, Sum P(6) = 2.5e-06
Identities = 13/36 (36%), Positives = 23/36 (63%)
Query: 670 YVLLGMSDGSIRLLVGDPSTCT--VSVQT-PAAIES 702
++ +G+ D ++R++ DPS C +S+Q PA ES
Sbjct: 384 FLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 419
Score = 50 (22.7 bits), Expect = 2.5e-06, Sum P(6) = 2.5e-06
Identities = 19/82 (23%), Positives = 33/82 (40%)
Query: 378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
L T+ GD+ +T+ D +V + L + + + + + F+ S G+ L Q
Sbjct: 82 LAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQI 141
Query: 438 T-CGSGTSM--LSSGLKEEFGD 456
G SS + E GD
Sbjct: 142 AHLGDDDEEPEFSSAMPLEEGD 163
Score = 45 (20.9 bits), Expect = 2.5e-06, Sum P(6) = 2.5e-06
Identities = 21/104 (20%), Positives = 41/104 (39%)
Query: 914 ITIFK-NISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV 972
+ +F+ + G + SR ++ R + P L ++ + + C G + +
Sbjct: 479 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTP-LSYETLEFASGFASEQCPEGIVAI 537
Query: 973 TSQGILKICQLPS-GSTYDNYWPVQKVIPLKATPHQITYFAEKN 1015
S L+I L G+ ++ Q PL+ TP + E N
Sbjct: 538 -STNTLRILALEKLGAVFN-----QVAFPLQYTPRKFVIHPESN 575
Score = 41 (19.5 bits), Expect = 0.00015, Sum P(5) = 0.00015
Identities = 20/68 (29%), Positives = 32/68 (47%)
Query: 665 SIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRK 724
SI Y+ +G+ +G + V DP T +S + S +PV + +G E L
Sbjct: 439 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS--RPVKLFRV-RMQGQEAVLAM 495
Query: 725 TSTDAWLS 732
+S +WLS
Sbjct: 496 SSR-SWLS 502
Score = 39 (18.8 bits), Expect = 2.5e-06, Sum P(6) = 2.5e-06
Identities = 7/23 (30%), Positives = 11/23 (47%)
Query: 12 PTGIANCGSGFITHSRADYVPQI 34
P+G+ C +IT+ P I
Sbjct: 25 PSGVLICSENYITYKNFGDQPDI 47
>CGD|CAL0004426 [details] [associations]
symbol:orf19.5391 species:5476 "Candida albicans" [GO:0071004
"U2-type prespliceosome" evidence=IEA] [GO:0005686 "U2 snRNP"
evidence=IEA] [GO:0030620 "U2 snRNA binding" evidence=IEA]
[GO:0000245 "spliceosomal complex assembly" evidence=IEA]
InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 CGD:CAL0004426
GO:GO:0008380 Gene3D:2.130.10.10 GO:GO:0006397 GO:GO:0005681
GO:GO:0003676 GO:GO:0007049 eggNOG:NOG247734 EMBL:AACQ01000051
EMBL:AACQ01000050 RefSeq:XP_717672.1 RefSeq:XP_717766.1
STRING:Q5A7S5 GeneID:3640538 GeneID:3640666 KEGG:cal:CaO19.12846
KEGG:cal:CaO19.5391 KO:K12830 Uniprot:Q5A7S5
Length = 1219
Score = 94 (38.1 bits), Expect = 7.6e-06, Sum P(4) = 7.6e-06
Identities = 52/208 (25%), Positives = 88/208 (42%)
Query: 288 ISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAV 347
++ +K+ P ++ LP+D L+ +P IGG++V G N Y L+ +
Sbjct: 246 LNHVVKKKPNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNWCFYDK--------LDGPRI 297
Query: 348 SLDSSQELPRSSFSVELD-AAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQRLDLS 404
L + ++ S+ ++ H + LL GDL LTV YD +++ + ++
Sbjct: 298 YLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDFDKEIIKNISIT 357
Query: 405 --KTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT-CGS----GTSMLSSGLKEEFGDI 457
T P L+ +I N F D LL QF G G +++S E +
Sbjct: 358 YFDTIPPALSLNI--FKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINSSDYESLNSV 415
Query: 458 EADAPSTKRLRRSSSDALQDMVNGEELS 485
S K L+ + AL D++ E LS
Sbjct: 416 RESVTSFK-LKGLDNLALIDVL--ETLS 440
Score = 80 (33.2 bits), Expect = 7.6e-06, Sum P(4) = 7.6e-06
Identities = 70/303 (23%), Positives = 126/303 (41%)
Query: 1149 VYSKELKGAISALASLQGHLLIASGPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVK-- 1206
V+ EL L + Q LL+ASG I L+ +L + + S NI K
Sbjct: 942 VHKTELDHIPQVLENFQDKLLVASGNHIRLYDIGQKQL----LKKSTTIIDFSTNINKII 997
Query: 1207 ---NFILLGDIHKS-IYFLSWKEQGAQL-----NLLAKDFGSLDCFATEFLIDGSTL-SL 1256
N I++ D HKS I F + E Q +++ + S+ + LI G ++
Sbjct: 998 PQTNRIIICDSHKSSIVFAKFDESQNQFVPFADDVMKRQITSIMNLDIDTLIGGDKFGNI 1057
Query: 1257 VVS--DEQ--KNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTKFLRLQMLATSSD 1309
V+ DE K + K + KL + EFH+G +T F L L +
Sbjct: 1058 FVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEFHIGDIITSF-NLGCLNLA-- 1114
Query: 1310 RTGAAPGSDKTNRFALLFGTLDGSIGCIAPL-DELTFRRLQSLQKKLVDSVPHVAGLNPR 1368
G++ ++++ L G+IG + PL + L +LQ + S ++ G +
Sbjct: 1115 ------GTE-----SVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYMQQSQNNLVGKDHL 1163
Query: 1369 SFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTTRSQILSNLNDLALG 1428
R +++ K +++D +LL + + ++EI+ + + + I L DL
Sbjct: 1164 KLRSYYNPIK-------NVIDGDLLERFLEFDISLKIEISRKLNKSVNDIEKKLIDLRNR 1216
Query: 1429 TSF 1431
++F
Sbjct: 1217 SAF 1219
Score = 73 (30.8 bits), Expect = 7.6e-06, Sum P(4) = 7.6e-06
Identities = 21/70 (30%), Positives = 40/70 (57%)
Query: 578 YLIIS--LEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG---AR 632
YL+IS L ++T+VL +++ +V +S ++ + TIA + G V+Q++ G R
Sbjct: 499 YLVISSSLSSKTLVLSIGEVVEDVEDS-EFVLDQPTIAVQQV-GIASVVQIYSNGIKHVR 556
Query: 633 ILDGSYMTQD 642
++G+ T D
Sbjct: 557 TVNGNKKTTD 566
Score = 49 (22.3 bits), Expect = 7.6e-06, Sum P(4) = 7.6e-06
Identities = 17/84 (20%), Positives = 33/84 (39%)
Query: 131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGR 190
D +++ + +S+L++D+ W GR G + +DP+ R
Sbjct: 118 DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGW-----GRNYV--GENLAIDPENR 170
Query: 191 CGGVLVYGLQM--IILKASQGGSG 212
C +LV ++ + K SG
Sbjct: 171 C--ILVAAMEKNKLFYKIESNSSG 192
>POMBASE|SPAC17H9.10c [details] [associations]
symbol:ddb1 "damaged DNA binding protein Ddb1"
species:4896 "Schizosaccharomyces pombe" [GO:0003677 "DNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
"nucleolus" evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
[GO:0006279 "premeiotic DNA replication" evidence=TAS] [GO:0006282
"regulation of DNA repair" evidence=IMP] [GO:0006283
"transcription-coupled nucleotide-excision repair" evidence=IMP]
[GO:0006974 "response to DNA damage stimulus" evidence=IMP]
[GO:0007090 "regulation of S phase of mitotic cell cycle"
evidence=IMP] [GO:0034644 "cellular response to UV" evidence=IMP]
[GO:0040020 "regulation of meiosis" evidence=IGI] [GO:0042787
"protein ubiquitination involved in ubiquitin-dependent protein
catabolic process" evidence=IMP] [GO:0051445 "regulation of meiotic
cell cycle" evidence=IGI] [GO:0070912 "Ddb1-Ckn1 complex"
evidence=IDA] [GO:0070913 "Ddb1-Wdr21 complex" evidence=IDA]
[GO:0008180 "signalosome" evidence=IDA] [GO:0031465 "Cul4B-RING
ubiquitin ligase complex" evidence=IDA] InterPro:IPR004871
InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143
PomBase:SPAC17H9.10c GO:GO:0005829 EMBL:CU329670 GO:GO:0005730
GenomeReviews:CU329670_GR Gene3D:2.130.10.10 GO:GO:0003677
GO:GO:0007049 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0034644
GO:GO:0040020 GO:GO:0042787 GO:GO:0007090 GO:GO:0006283
GO:GO:0006282 GO:GO:0006279 GO:GO:0070912 eggNOG:NOG247734
KO:K10610 OMA:CALGDGS PIR:T37876 RefSeq:NP_593580.1 IntAct:O13807
STRING:O13807 EnsemblFungi:SPAC17H9.10c.1 GeneID:2542207
KEGG:spo:SPAC17H9.10c OrthoDB:EOG473T0C NextBio:20803277
GO:GO:0070913 Uniprot:O13807
Length = 1072
Score = 103 (41.3 bits), Expect = 1.9e-05, Sum P(4) = 1.9e-05
Identities = 52/257 (20%), Positives = 112/257 (43%)
Query: 1153 ELKGAISALASLQGHLLIAS-GPKIILHKWTGTELNGIAFYDAPPLYVVSLNIVKNFILL 1211
+++G+++ L L HL++A + + ++ ++ + P Y + +++ ++ I+
Sbjct: 807 KVQGSVNTLV-LYKHLIVAGINASVCIFEYEHGTMH-VRNSIRTPTYTIDISVNQDEIIA 864
Query: 1212 GDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQIFY-- 1269
D+ KSI L + + QL +A+D+ L + E L S V++ N I
Sbjct: 865 ADLMKSITVLQFIDD--QLIEVARDYHPLWATSVEIL---SERKYFVTEADGNAVILLRD 919
Query: 1270 -YAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLATSSDRTGAAPGSDKTNRFALLFG 1328
+P++S+ +KL +F++G + K + D++ P LL
Sbjct: 920 NVSPQLSDR---KKLRWYKKFYLGELINKTRHCTFIEPQ-DKSLVTP--------QLLCA 967
Query: 1329 TLDGSIGCIAPLDELTFRRLQSLQKKLVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIV 1388
T+DGS+ + L LQ + +P GL+ + ++++ + P ++
Sbjct: 968 TVDGSLMIVGDAGMSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET---SPSDLI 1024
Query: 1389 DCELLSHYEMLPLEEQL 1405
D L+ +L L E +
Sbjct: 1025 DGSLIE--SILGLREPI 1039
Score = 98 (39.6 bits), Expect = 1.9e-05, Sum P(4) = 1.9e-05
Identities = 31/131 (23%), Positives = 60/131 (45%)
Query: 306 HDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELD 365
HD + +PS GGV V G ++Y S+ + L Y P ++FS +
Sbjct: 218 HDIEEGKLIPSENGGVFVFGEMYVYYISKDIQVSKLLLTY----------PITAFSPSIS 267
Query: 366 AAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFL 425
T L + + +++ ++G L ++ V ++L K S + S + + ++ F+
Sbjct: 268 NDPETGLDSSIYIVADESGMLYKFKALFTDETVS-MELEKLGESSIASCLIALPDNHLFV 326
Query: 426 GSRLGDSLLVQ 436
GS +S+L+Q
Sbjct: 327 GSHFNNSVLLQ 337
Score = 50 (22.7 bits), Expect = 1.9e-05, Sum P(4) = 1.9e-05
Identities = 18/65 (27%), Positives = 31/65 (47%)
Query: 174 RESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESS 233
RES + GPL+ VDP R + VY + I+ + + + FS RI+
Sbjct: 111 RESQS-GPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQEL 169
Query: 234 HVINL 238
+V+++
Sbjct: 170 NVVDI 174
Score = 38 (18.4 bits), Expect = 1.9e-05, Sum P(4) = 1.9e-05
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 957 FTVLHNVNCNHGFIYV-TSQGILKICQL 983
F+ H+++C I+V T G +I Q+
Sbjct: 441 FSANHDLSCEESTIFVSTIYGNSQILQI 468
>UNIPROTKB|F1NZF7 [details] [associations]
symbol:SF3B3 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
GO:GO:0005634 GO:GO:0003676 GeneTree:ENSGT00530000063396
EMBL:AADN02051593 EMBL:AADN02051594 IPI:IPI00819465
Ensembl:ENSGALT00000040057 ArrayExpress:F1NZF7 Uniprot:F1NZF7
Length = 504
Score = 124 (48.7 bits), Expect = 0.00066, P = 0.00065
Identities = 76/377 (20%), Positives = 150/377 (39%)
Query: 1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVA 1125
+R++ P + T + ++ +E A +V V F+ T +E L+ + +
Sbjct: 152 IRVMNPIQGN----TLDLVQLEQNEAAFSVAVCR-FSNTGEEWYVLVGVAKDLILNPRSV 206
Query: 1126 ARGRVLLFSTGRNADNPQNLVTEVYSKELKGAISALASLQGHLLIASGPKIILHKWTGTE 1185
A G V + N + + +E+ AI A QG +LI G + ++ +
Sbjct: 207 AGGFVYTYKLLVNGGEKLEFLHKTPVEEVPAAI---APFQGRVLIGVGKLLRVYDLGKKK 263
Query: 1186 LNGIAFYDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLNLLAKDFGSLDCFAT 1245
L Y+ + + + +++ D+ +S ++ +K QL + A D T
Sbjct: 264 LLRKCENKHIANYICGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPR-WVTT 322
Query: 1246 EFLIDGSTLSLVVSDEQKNIQIFYYAP----KMSESWKGQKLL-SRAEFHVG---AHVTK 1297
L+D T++ +D+ NI + P ++ E G K L R + A V
Sbjct: 323 ATLLDYDTVA--GADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIM 380
Query: 1298 FLRLQMLATSSDRTGAAPGSDKTNRFALLFGTLDGSIGCIAPL---DELTFRRLQSLQKK 1354
+ S +T PG ++ L++ TL G IG + P ++ F Q ++
Sbjct: 381 NYHVGETVLSLQKTTLIPGGSES----LVYTTLSGGIGILVPFTSHEDHDF--FQHVEMH 434
Query: 1355 LVDSVPHVAGLNPRSFRQFHSNGKAHRPGPDSIVDCELLSHYEMLPLEEQLEIAHQTGTT 1414
L P + G + SFR ++ K +++D +L + + +Q +A + T
Sbjct: 435 LRSEHPPLCGRDHLSFRSYYFPVK-------NVIDGDLCEQFNSMEPNKQKNVAEELDRT 487
Query: 1415 RSQILSNLNDLALGTSF 1431
++ L D+ +F
Sbjct: 488 PPEVSKKLEDIRTRYAF 504
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.135 0.398 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 1432 1389 0.00090 124 3 11 22 0.39 34
39 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 64
No. of states in DFA: 634 (67 KB)
Total size of DFA: 578 KB (2260 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 113.64u 0.11s 113.75t Elapsed: 00:00:05
Total cpu time: 113.67u 0.11s 113.78t Elapsed: 00:00:05
Start: Tue May 21 16:40:54 2013 End: Tue May 21 16:40:59 2013