BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>001387
MSIWNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYG
RIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQI
GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVL
YQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC
SANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGET
SIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLE
RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVS
FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR
NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI
GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL
GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKL
LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH
QEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS
NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN
QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA
IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG
EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI
KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR
VEELTRLH

High Scoring Gene Products

Symbol, full name Information P value
DDB1A
AT4G05420
protein from Arabidopsis thaliana 0.
DDB1B
damaged DNA binding protein 1B
protein from Arabidopsis thaliana 0.
DDB1
Uncharacterized protein
protein from Canis lupus familiaris 1.2e-312
DDB1
DNA damage-binding protein 1
protein from Bos taurus 2.0e-312
Ddb1
damage specific DNA binding protein 1
protein from Mus musculus 2.0e-312
DDB1
Uncharacterized protein
protein from Sus scrofa 4.2e-312
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 5.4e-312
ddb1
DNA damage-binding protein 1
protein from Xenopus laevis 1.1e-311
DDB1
DNA damage-binding protein 1
protein from Pongo abelii 1.4e-311
DDB1
DNA damage-binding protein 1
protein from Chlorocebus aethiops 2.3e-311
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 3.8e-311
ddb1
damage specific DNA binding protein 1
gene_product from Danio rerio 6.1e-311
Ddb1
damage-specific DNA binding protein 1, 127kDa
gene from Rattus norvegicus 2.1e-308
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 3.5e-306
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 6.6e-305
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 9.6e-304
DDB1
Uncharacterized protein
protein from Canis lupus familiaris 8.1e-298
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 1.2e-296
pic
piccolo
protein from Drosophila melanogaster 2.1e-268
repE
UV-damaged DNA binding protein1
gene from Dictyostelium discoideum 7.0e-267
ddb-1 gene from Caenorhabditis elegans 2.7e-167
ddb-1
DNA damage-binding protein 1
protein from Caenorhabditis elegans 2.7e-167
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 7.8e-114
MGG_16867
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 1.4e-79
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 2.3e-75
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 1.8e-69
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 3.4e-64
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 2.2e-60
SAP130a
AT3G55200
protein from Arabidopsis thaliana 2.6e-58
SAP130b
AT3G55220
protein from Arabidopsis thaliana 2.6e-58
sf3b3
splicing factor 3B subunit 3
gene from Dictyostelium discoideum 1.2e-54
SF3B3
Uncharacterized protein
protein from Canis lupus familiaris 1.1e-50
SF3B3
Splicing factor 3B subunit 3
protein from Bos taurus 1.4e-50
SF3B3
Splicing factor 3B subunit 3
protein from Homo sapiens 1.4e-50
Sf3b3
splicing factor 3b, subunit 3
protein from Mus musculus 1.4e-50
CG13900 protein from Drosophila melanogaster 1.7e-50
sf3b3
splicing factor 3b, subunit 3
gene_product from Danio rerio 1.8e-50
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 3.0e-49
SF3B3
Uncharacterized protein
protein from Gallus gallus 3.1e-48
teg-4 gene from Caenorhabditis elegans 1.1e-46
Sf3b3
splicing factor 3b, subunit 3
gene from Rattus norvegicus 1.1e-43
PFL1680w
splicing factor 3b, subunit 3, 130kD, putative
gene from Plasmodium falciparum 1.2e-42
PFL1680w
Splicing factor 3b, subunit 3, 130kD, putative
protein from Plasmodium falciparum 3D7 1.2e-42
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 6.0e-42
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 9.9e-42
AT3G11960 protein from Arabidopsis thaliana 6.6e-31
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 1.8e-29
cpsf-1 gene from Caenorhabditis elegans 4.7e-26
cpsf-1
Probable cleavage and polyadenylation specificity factor subunit 1
protein from Caenorhabditis elegans 4.7e-26
cpsf1
cleavage and polyadenylation specific factor 1
gene_product from Danio rerio 2.3e-25
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Homo sapiens 3.0e-25
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 1.7e-24
Cpsf1
cleavage and polyadenylation specific factor 1
protein from Mus musculus 2.0e-24
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Bos taurus 7.9e-24
cpsf1
cleavage and polyadenylation specificity factor 160 kDa subunit
gene from Dictyostelium discoideum 1.4e-23
CPSF160
cleavage and polyadenylation specificity factor 160
protein from Arabidopsis thaliana 3.5e-23
Cpsf160
Cleavage and polyadenylation specificity factor 160
protein from Drosophila melanogaster 3.6e-23
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 2.1e-22
CPSF1
Uncharacterized protein
protein from Sus scrofa 5.8e-22
orf19.5391 gene_product from Candida albicans 1.8e-21
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 1.2e-20
Cpsf1
cleavage and polyadenylation specific factor 1, 160kDa
gene from Rattus norvegicus 1.5e-18
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 3.5e-17
SF3B3
Uncharacterized protein
protein from Gallus gallus 4.9e-16
CPSF1
Uncharacterized protein
protein from Sus scrofa 1.6e-15
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 2.1e-13
RSE1
Protein involved in pre-mRNA splicing
gene from Saccharomyces cerevisiae 6.3e-11
LOC100512659
Uncharacterized protein
protein from Sus scrofa 7.5e-09
orf19.2760 gene_product from Candida albicans 1.4e-07
CFT1
Protein CFT1
protein from Candida albicans SC5314 1.4e-07

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  001387
        (1088 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr...  5144  0.        1
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr...  4980  0.        1
UNIPROTKB|E2R9E3 - symbol:DDB1 "Uncharacterized protein" ...  2083  1.2e-312  2
UNIPROTKB|A1A4K3 - symbol:DDB1 "DNA damage-binding protei...  2081  2.0e-312  2
MGI|MGI:1202384 - symbol:Ddb1 "damage specific DNA bindin...  2081  2.0e-312  2
UNIPROTKB|F1RIE2 - symbol:DDB1 "Uncharacterized protein" ...  2078  4.2e-312  2
UNIPROTKB|Q16531 - symbol:DDB1 "DNA damage-binding protei...  2077  5.4e-312  2
UNIPROTKB|Q6P6Z0 - symbol:ddb1 "DNA damage-binding protei...  2078  1.1e-311  2
UNIPROTKB|Q5R649 - symbol:DDB1 "DNA damage-binding protei...  2078  1.4e-311  2
UNIPROTKB|P33194 - symbol:DDB1 "DNA damage-binding protei...  2071  2.3e-311  2
UNIPROTKB|Q805F9 - symbol:DDB1 "DNA damage-binding protei...  2069  3.8e-311  2
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ...  2057  6.1e-311  2
RGD|621889 - symbol:Ddb1 "damage-specific DNA binding pro...  2056  2.1e-308  2
UNIPROTKB|F1P4I8 - symbol:DDB1 "DNA damage-binding protei...  2022  3.5e-306  2
UNIPROTKB|F1NVV2 - symbol:DDB1 "DNA damage-binding protei...  2022  6.6e-305  2
UNIPROTKB|F1NVV3 - symbol:DDB1 "DNA damage-binding protei...  2022  9.6e-304  2
UNIPROTKB|J9NVR7 - symbol:DDB1 "Uncharacterized protein" ...  2083  8.1e-298  2
UNIPROTKB|F5GY55 - symbol:DDB1 "Uncharacterized protein" ...  2077  1.2e-296  2
FB|FBgn0260962 - symbol:pic "piccolo" species:7227 "Droso...  1856  2.1e-268  2
DICTYBASE|DDB_G0286013 - symbol:repE "UV-damaged DNA bind...   835  7.0e-267  4
WB|WBGene00010890 - symbol:ddb-1 species:6239 "Caenorhabd...  1119  2.7e-167  2
UNIPROTKB|Q21554 - symbol:ddb-1 "DNA damage-binding prote...  1119  2.7e-167  2
UNIPROTKB|F1M680 - symbol:Ddb1 "DNA damage-binding protei...   922  6.2e-142  2
ASPGD|ASPL0000052925 - symbol:ddbA species:162425 "Emeric...   937  3.8e-114  2
UNIPROTKB|B4DG00 - symbol:DDB1 "cDNA FLJ52436, highly sim...   940  7.8e-114  2
UNIPROTKB|G4N4E2 - symbol:MGG_16867 "Uncharacterized prot...   471  1.4e-79   3
UNIPROTKB|F5H6C5 - symbol:DDB1 "DNA damage-binding protei...   760  2.3e-75   1
UNIPROTKB|F5H581 - symbol:DDB1 "DNA damage-binding protei...   686  1.8e-69   2
UNIPROTKB|F5H775 - symbol:DDB1 "DNA damage-binding protei...   661  3.4e-64   1
UNIPROTKB|F5H0Y5 - symbol:DDB1 "DNA damage-binding protei...   626  2.2e-60   1
TAIR|locus:2100616 - symbol:SAP130a "spliceosome-associat...   345  2.6e-58   4
TAIR|locus:2100646 - symbol:SAP130b "spliceosome-associat...   345  2.6e-58   4
POMBASE|SPAC17H9.10c - symbol:ddb1 "damaged DNA binding p...   615  8.3e-57   1
DICTYBASE|DDB_G0282569 - symbol:sf3b3 "splicing factor 3B...   326  1.2e-54   4
ASPGD|ASPL0000031473 - symbol:AN5452 species:162425 "Emer...   343  2.0e-53   5
UNIPROTKB|E2RR33 - symbol:SF3B3 "Uncharacterized protein"...   337  1.1e-50   4
UNIPROTKB|A0JN52 - symbol:SF3B3 "Splicing factor 3B subun...   337  1.4e-50   4
UNIPROTKB|Q15393 - symbol:SF3B3 "Splicing factor 3B subun...   337  1.4e-50   4
MGI|MGI:1289341 - symbol:Sf3b3 "splicing factor 3b, subun...   337  1.4e-50   4
FB|FBgn0035162 - symbol:CG13900 species:7227 "Drosophila ...   282  1.7e-50   4
ZFIN|ZDB-GENE-040426-2901 - symbol:sf3b3 "splicing factor...   338  1.8e-50   4
UNIPROTKB|F5H2L3 - symbol:DDB1 "DNA damage-binding protei...   523  3.0e-49   1
UNIPROTKB|F1P529 - symbol:SF3B3 "Uncharacterized protein"...   333  3.1e-48   4
WB|WBGene00019323 - symbol:teg-4 species:6239 "Caenorhabd...   342  1.1e-46   3
UNIPROTKB|E9PT66 - symbol:Sf3b3 "Protein Sf3b3" species:1...   337  1.8e-46   2
POMBASE|SPAPJ698.03c - symbol:prp12 "U2 snRNP-associated ...   322  7.9e-46   4
RGD|1311636 - symbol:Sf3b3 "splicing factor 3b, subunit 3...   337  1.1e-43   3
GENEDB_PFALCIPARUM|PFL1680w - symbol:PFL1680w "splicing f...   280  1.2e-42   4
UNIPROTKB|Q8I574 - symbol:PFL1680w "Splicing factor 3b, s...   280  1.2e-42   4
UNIPROTKB|F5GZ34 - symbol:DDB1 "DNA damage-binding protei...   455  6.0e-42   1
UNIPROTKB|F5GZY8 - symbol:DDB1 "DNA damage-binding protei...   453  9.9e-42   1
TAIR|locus:2081576 - symbol:AT3G11960 species:3702 "Arabi...   222  6.6e-31   4
UNIPROTKB|F5H198 - symbol:DDB1 "DNA damage-binding protei...   203  1.8e-29   2
POMBASE|SPCC11E10.08 - symbol:rik1 "silencing protein Rik...   255  1.7e-26   3
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab...   214  4.7e-26   4
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p...   214  4.7e-26   4
ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya...   226  2.3e-25   7
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla...   234  3.0e-25   7
UNIPROTKB|F5GYG8 - symbol:DDB1 "DNA damage-binding protei...   292  1.7e-24   1
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat...   230  2.0e-24   7
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla...   230  7.9e-24   7
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya...   228  1.4e-23   7
TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade...   208  3.5e-23   4
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla...   248  3.6e-23   7
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"...   229  2.1e-22   6
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"...   229  5.8e-22   6
CGD|CAL0004426 - symbol:orf19.5391 species:5476 "Candida ...   179  1.8e-21   3
UNIPROTKB|F8WF81 - symbol:DDB1 "DNA damage-binding protei...   256  1.2e-20   1
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ...   161  1.5e-18   8
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"...   229  3.5e-17   4
UNIPROTKB|F1NZF7 - symbol:SF3B3 "Uncharacterized protein"...   234  4.9e-16   1
UNIPROTKB|K7GNU1 - symbol:CPSF1 "Uncharacterized protein"...   229  1.6e-15   2
UNIPROTKB|E1C725 - symbol:DDB1 "DNA damage-binding protei...   188  2.1e-13   1
POMBASE|SPBC1709.08 - symbol:cft1 "cleavage factor one Cf...   140  2.0e-12   6
SGD|S000004513 - symbol:RSE1 "Protein involved in pre-mRN...   120  6.3e-11   4
ASPGD|ASPL0000050546 - symbol:AN1413 species:162425 "Emer...   168  2.4e-09   6
UNIPROTKB|F1S419 - symbol:LOC100512659 "Uncharacterized p...   164  7.5e-09   1
CGD|CAL0004251 - symbol:orf19.2760 species:5476 "Candida ...    86  1.4e-07   6
UNIPROTKB|Q5AFT3 - symbol:CFT1 "Protein CFT1" species:237...    86  1.4e-07   6


>TAIR|locus:2115909 [details] [associations]
            symbol:DDB1A "damaged DNA binding protein 1A"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
            "negative regulation of photomorphogenesis" evidence=IGI;RCA]
            [GO:0045892 "negative regulation of transcription, DNA-dependent"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
            cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
            formation" evidence=RCA] [GO:0003002 "regionalization"
            evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
            "protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
            evidence=RCA] [GO:0008284 "positive regulation of cell
            proliferation" evidence=RCA] [GO:0009630 "gravitropism"
            evidence=RCA] [GO:0009639 "response to red or far red light"
            evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
            [GO:0033043 "regulation of organelle organization" evidence=RCA]
            [GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
            organ formation" evidence=RCA] [GO:0048608 "reproductive structure
            development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
            GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
            GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
            GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
            IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
            UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
            IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
            EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
            GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
            InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
            ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
            Uniprot:Q9M0V3
        Length = 1088

 Score = 5144 (1815.8 bits), Expect = 0., P = 0.
 Identities = 981/1088 (90%), Positives = 1034/1088 (95%)

Query:     1 MSIWNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYG 60
             MS WNYVVTAHKPT+VTHSCVGNFTSPQELNLI+AKCTRIEIHLLTPQGLQPMLDVPIYG
Sbjct:     1 MSSWNYVVTAHKPTSVTHSCVGNFTSPQELNLIVAKCTRIEIHLLTPQGLQPMLDVPIYG 60

Query:    61 RIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQI 120
             RIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIGRPTDNGQI
Sbjct:    61 RIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIGRPTDNGQI 120

Query:   121 GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVL 180
             GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GCAKPTI VL
Sbjct:   121 GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGCAKPTIAVL 180

Query:   181 YQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC 240
             YQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLIIGEETIVYC
Sbjct:   181 YQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIGEETIVYC 240

Query:   241 SANAFKAIPIRPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTGLKIELLGET 300
             SA+AFKAIPIRPSITKAYGRVD DGSRY            VITHEKEKVTGLKIELLGET
Sbjct:   241 SASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGLKIELLGET 300

Query:   301 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLE 360
             SIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIVDFCVVDLE
Sbjct:   301 SIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIVDFCVVDLE 360

Query:   361 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVS 420
             RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ FDTFLVVS
Sbjct:   361 RQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEAFDTFLVVS 420

Query:   421 FISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 480
             FISETRILAMNL          GF SQ QTLFCHDA+YNQLVQVTS SVRLVSST+RELR
Sbjct:   421 FISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLVSSTTRELR 480

Query:   481 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 540
             +EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+SCLDINPI
Sbjct:   481 DEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEVSCLDINPI 540

Query:   541 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL 600
             G+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEGISYLLCAL
Sbjct:   541 GDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEGISYLLCAL 600

Query:   601 GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKL 660
             GDGHLLNF ++  TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTVIYSSNKKL
Sbjct:   601 GDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTVIYSSNKKL 660

Query:   661 LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 720
             LYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLGEH RRICH
Sbjct:   661 LYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLGEHARRICH 720

Query:   721 QEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 780
             QEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSILSCSF++D 
Sbjct:   721 QEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSILSCSFTEDK 780

Query:   781 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN 840
             NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFNGKLLAAIN
Sbjct:   781 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFNGKLLAAIN 840

Query:   841 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 900
             QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL+YKHEEGA
Sbjct:   841 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEGA 900

Query:   901 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 960
             IEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLEVVGEYHLG
Sbjct:   901 IEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLEVVGEYHLG 960

Query:   961 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI 1020
             EFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKLQ++LRKVI
Sbjct:   961 EFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVI 1020

Query:  1021 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1080
             KGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV VEELCKR
Sbjct:  1021 KGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKR 1080

Query:  1081 VEELTRLH 1088
             VEELTRLH
Sbjct:  1081 VEELTRLH 1088


>TAIR|locus:2127368 [details] [associations]
            symbol:DDB1B "damaged DNA binding protein 1B"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
            development ending in seed dormancy" evidence=IMP] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
            [GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
            chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
            specification" evidence=RCA] [GO:0010072 "primary shoot apical
            meristem specification" evidence=RCA] [GO:0010100 "negative
            regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
            dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
            evidence=RCA] [GO:0010564 "regulation of cell cycle process"
            evidence=RCA] [GO:0045595 "regulation of cell differentiation"
            evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
            [GO:0048608 "reproductive structure development" evidence=RCA]
            [GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
            division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
            SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
            GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
            UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
            PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
            DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
            PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
            KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
            OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
            GermOnline:AT4G21100 Uniprot:O49552
        Length = 1088

 Score = 4980 (1758.1 bits), Expect = 0., P = 0.
 Identities = 948/1088 (87%), Positives = 1014/1088 (93%)

Query:     1 MSIWNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYG 60
             MS+WNY VTA KPT VTHSCVGNFTSPQELNLI+AK TRIEIHLL+PQGLQ +LDVP+YG
Sbjct:     1 MSVWNYAVTAQKPTCVTHSCVGNFTSPQELNLIVAKSTRIEIHLLSPQGLQTILDVPLYG 60

Query:    61 RIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQI 120
             RIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIGRPTDNGQI
Sbjct:    61 RIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIGRPTDNGQI 120

Query:   121 GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVL 180
             GIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC KPTI VL
Sbjct:   121 GIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCTKPTIAVL 180

Query:   181 YQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC 240
             YQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLIIGEETIVYC
Sbjct:   181 YQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYC 240

Query:   241 SANAFKAIPIRPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTGLKIELLGET 300
             SANAFKAIPIRPSITKAYGRVD DGSRY            VITHEKEKVTGLKIELLGET
Sbjct:   241 SANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGLKIELLGET 300

Query:   301 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLE 360
             SIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIVDFCVVDLE
Sbjct:   301 SIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIVDFCVVDLE 360

Query:   361 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVS 420
             RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+ FDTFLVVS
Sbjct:   361 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEAFDTFLVVS 420

Query:   421 FISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 480
             FISETRILAMN+          GF S+ QTLFCHDA+YNQLVQVTS SVRLVSST+RELR
Sbjct:   421 FISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLVSSTTRELR 480

Query:   481 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 540
             N+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH  LEYE+SCLDINPI
Sbjct:   481 NKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEVSCLDINPI 540

Query:   541 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL 600
             G+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEGISYLLCAL
Sbjct:   541 GDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEGISYLLCAL 600

Query:   601 GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKL 660
             GDGHLLNF L+   G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP VIYS+NKKL
Sbjct:   601 GDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAVIYSNNKKL 660

Query:   661 LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 720
             LYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+GEH RRICH
Sbjct:   661 LYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIGEHARRICH 720

Query:   721 QEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 780
             QEQ+RTFAI  L+N+  AEESE HFVRLLD Q+FEF+S+YPLD FE GCSILSCSF+DD 
Sbjct:   721 QEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSILSCSFTDDK 780

Query:   781 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN 840
             NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFNGKLLA+IN
Sbjct:   781 NVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFNGKLLASIN 840

Query:   841 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 900
             QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLLIYKHEEGA
Sbjct:   841 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLLIYKHEEGA 900

Query:   901 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 960
             IEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+EVVGEYH+G
Sbjct:   901 IEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARMEVVGEYHIG 960

Query:   961 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI 1020
             EFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKLQT+LRKVI
Sbjct:   961 EFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKLQTSLRKVI 1020

Query:  1021 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1080
             KGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V VEELCKR
Sbjct:  1021 KGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDVQVEELCKR 1080

Query:  1081 VEELTRLH 1088
             VEELTRLH
Sbjct:  1081 VEELTRLH 1088


>UNIPROTKB|E2R9E3 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=IEA] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000075
            "cell cycle checkpoint" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
            GO:GO:0031465 KO:K10610 OMA:CALGDGS CTD:1642
            GeneTree:ENSGT00530000063396 EMBL:AAEX03011677 RefSeq:XP_533275.2
            Ensembl:ENSCAFT00000025824 GeneID:476067 KEGG:cfa:476067
            NextBio:20851798 Uniprot:E2R9E3
        Length = 1140

 Score = 2083 (738.3 bits), Expect = 1.2e-312, Sum P(2) = 1.2e-312
 Identities = 406/738 (55%), Positives = 538/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 1.2e-312, Sum P(2) = 1.2e-312
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|A1A4K3 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9913
            "Bos taurus" [GO:0080008 "Cul4-RING ubiquitin ligase complex"
            evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase complex"
            evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent protein
            catabolic process" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin
            ligase complex" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISS] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] [GO:0000075 "cell cycle checkpoint" evidence=IEA]
            [GO:0006281 "DNA repair" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0005737 GO:GO:0043161
            GO:GO:0016055 Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281
            GO:GO:0042787 GO:GO:0000075 GO:GO:0031464 GO:GO:0031465
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            EMBL:BC126629 IPI:IPI00713891 RefSeq:NP_001073731.1
            UniGene:Bt.62917 STRING:A1A4K3 PRIDE:A1A4K3
            Ensembl:ENSBTAT00000028740 GeneID:511951 KEGG:bta:511951 CTD:1642
            GeneTree:ENSGT00530000063396 HOVERGEN:HBG005460 InParanoid:A1A4K3
            OrthoDB:EOG4KPT91 NextBio:20870176 Uniprot:A1A4K3
        Length = 1140

 Score = 2081 (737.6 bits), Expect = 2.0e-312, Sum P(2) = 2.0e-312
 Identities = 406/738 (55%), Positives = 538/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 2.0e-312, Sum P(2) = 2.0e-312
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>MGI|MGI:1202384 [details] [associations]
            symbol:Ddb1 "damage specific DNA binding protein 1"
            species:10090 "Mus musculus" [GO:0000075 "cell cycle checkpoint"
            evidence=ISO] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003684 "damaged DNA
            binding" evidence=ISO] [GO:0005634 "nucleus" evidence=ISO]
            [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006281 "DNA repair"
            evidence=IEA] [GO:0006974 "response to DNA damage stimulus"
            evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IDA] [GO:0031464 "Cul4A-RING ubiquitin ligase complex"
            evidence=ISO] [GO:0031465 "Cul4B-RING ubiquitin ligase complex"
            evidence=ISO] [GO:0042787 "protein ubiquitination involved in
            ubiquitin-dependent protein catabolic process" evidence=ISO]
            [GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
            process" evidence=ISO] [GO:0080008 "Cul4-RING ubiquitin ligase
            complex" evidence=ISO] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 UniPathway:UPA00143 MGI:MGI:1202384 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003684 GO:GO:0006281 GO:GO:0042787 GO:GO:0000075
            GO:GO:0031464 GO:GO:0031465 eggNOG:NOG247734 KO:K10610 OMA:CALGDGS
            CTD:1642 GeneTree:ENSGT00530000063396 HOVERGEN:HBG005460
            HSSP:Q16531 ChiTaRS:DDB1 EMBL:AB026432 EMBL:AF159853 EMBL:AK146522
            EMBL:AK152228 EMBL:AK154303 EMBL:AK155020 EMBL:AK155920
            EMBL:AK157491 EMBL:BC002210 EMBL:BC009661 IPI:IPI00316740
            PIR:JC7152 RefSeq:NP_056550.1 UniGene:Mm.289915 UniGene:Mm.466856
            ProteinModelPortal:Q3U1J4 SMR:Q3U1J4 IntAct:Q3U1J4 STRING:Q3U1J4
            PaxDb:Q3U1J4 PRIDE:Q3U1J4 Ensembl:ENSMUST00000025649 GeneID:13194
            KEGG:mmu:13194 UCSC:uc008gqm.1 InParanoid:Q3U1J4 NextBio:283320
            Bgee:Q3U1J4 CleanEx:MM_DDB1 Genevestigator:Q3U1J4 Uniprot:Q3U1J4
        Length = 1140

 Score = 2081 (737.6 bits), Expect = 2.0e-312, Sum P(2) = 2.0e-312
 Identities = 406/738 (55%), Positives = 537/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS      
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPGRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 2.0e-312, Sum P(2) = 2.0e-312
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|F1RIE2 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0043161 "proteasomal ubiquitin-dependent protein
            catabolic process" evidence=IEA] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000075
            "cell cycle checkpoint" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
            GO:GO:0031465 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
            EMBL:CU462918 RefSeq:XP_003122699.1 Ensembl:ENSSSCT00000014314
            GeneID:100522239 KEGG:ssc:100522239 Uniprot:F1RIE2
        Length = 1140

 Score = 2078 (736.6 bits), Expect = 4.2e-312, Sum P(2) = 4.2e-312
 Identities = 405/738 (54%), Positives = 538/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P G +++VA+ N++QV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQGKNISVASCNSNQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARISKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 4.2e-312, Sum P(2) = 4.2e-312
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|Q16531 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0019048 "virus-host interaction" evidence=IEA]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
            evidence=IDA] [GO:0000075 "cell cycle checkpoint" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IDA] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=IDA] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=IDA] [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=IMP] [GO:0080008 "Cul4-RING
            ubiquitin ligase complex" evidence=IDA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0003684 "damaged DNA binding" evidence=TAS]
            [GO:0000718 "nucleotide-excision repair, DNA damage removal"
            evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281
            "DNA repair" evidence=TAS] [GO:0006289 "nucleotide-excision repair"
            evidence=TAS] Reactome:REACT_216 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 EMBL:U32986
            GO:GO:0005737 GO:GO:0019048 GO:GO:0005654 GO:GO:0043161
            GO:GO:0016055 Gene3D:2.130.10.10 GO:GO:0003684 EMBL:CH471076
            GO:GO:0042787 GO:GO:0000075 GO:GO:0000718 EMBL:AP003108
            GO:GO:0031464 PDB:2HYE PDB:4A0K PDBsum:2HYE PDBsum:4A0K PDB:4A0L
            PDBsum:4A0L GO:GO:0031465 PDB:3I7P PDBsum:3I7P PDB:3I8C PDBsum:3I8C
            PDB:3I89 PDBsum:3I89 PDB:3I7O PDBsum:3I7O PDB:3I8E PDBsum:3I8E
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            CTD:1642 HOVERGEN:HBG005460 OrthoDB:EOG4KPT91 EMBL:U18299
            EMBL:L40326 EMBL:AJ002955 EMBL:AK312436 EMBL:AY960579 EMBL:BC011686
            EMBL:BC050530 EMBL:BC051764 IPI:IPI00293464 PIR:I38908
            RefSeq:NP_001914.3 UniGene:Hs.290758 PDB:2B5L PDB:2B5M PDB:2B5N
            PDB:3E0C PDB:3EI1 PDB:3EI2 PDB:3EI3 PDB:3EI4 PDB:3I7H PDB:3I7K
            PDB:3I7L PDB:3I7N PDB:4A08 PDB:4A09 PDB:4A0A PDB:4A0B PDB:4A11
            PDB:4E54 PDB:4E5Z PDBsum:2B5L PDBsum:2B5M PDBsum:2B5N PDBsum:3E0C
            PDBsum:3EI1 PDBsum:3EI2 PDBsum:3EI3 PDBsum:3EI4 PDBsum:3I7H
            PDBsum:3I7K PDBsum:3I7L PDBsum:3I7N PDBsum:4A08 PDBsum:4A09
            PDBsum:4A0A PDBsum:4A0B PDBsum:4A11 PDBsum:4E54 PDBsum:4E5Z
            ProteinModelPortal:Q16531 SMR:Q16531 DIP:DIP-430N IntAct:Q16531
            MINT:MINT-1134697 STRING:Q16531 PhosphoSite:Q16531 PaxDb:Q16531
            PRIDE:Q16531 Ensembl:ENST00000301764 GeneID:1642 KEGG:hsa:1642
            UCSC:uc001nrc.4 GeneCards:GC11M061066 H-InvDB:HIX0171380
            HGNC:HGNC:2717 HPA:CAB032821 MIM:600045 neXtProt:NX_Q16531
            PharmGKB:PA27187 InParanoid:Q16531 ChiTaRS:DDB1
            EvolutionaryTrace:Q16531 GenomeRNAi:1642 NextBio:6750
            ArrayExpress:Q16531 Bgee:Q16531 CleanEx:HS_DDB1
            Genevestigator:Q16531 GermOnline:ENSG00000167986 Uniprot:Q16531
        Length = 1140

 Score = 2077 (736.2 bits), Expect = 5.4e-312, Sum P(2) = 5.4e-312
 Identities = 405/738 (54%), Positives = 537/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 5.4e-312, Sum P(2) = 5.4e-312
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|Q6P6Z0 [details] [associations]
            symbol:ddb1 "DNA damage-binding protein 1" species:8355
            "Xenopus laevis" [GO:0005634 "nucleus" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0080008 "Cul4-RING
            ubiquitin ligase complex" evidence=ISS] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677
            GO:GO:0006281 GO:GO:0016567 GO:GO:0031464 GO:GO:0031465 KO:K10610
            CTD:1642 HOVERGEN:HBG005460 HSSP:Q16531 EMBL:BC061946
            RefSeq:NP_001083624.1 UniGene:Xl.23906 PRIDE:Q6P6Z0 GeneID:399026
            KEGG:xla:399026 Xenbase:XB-GENE-967911 Uniprot:Q6P6Z0
        Length = 1140

 Score = 2078 (736.6 bits), Expect = 1.1e-311, Sum P(2) = 1.1e-311
 Identities = 414/741 (55%), Positives = 539/741 (72%)

Query:     4 WNYVVTAHKPTNVTHSCV-GNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRI 62
             +NYVVTA KPT V ++CV G+FTS  +LNL+IAK TR+EI+++TP+GL+P+ +V +YG+I
Sbjct:     3 YNYVVTAQKPTAV-NACVTGHFTSEDDLNLLIAKNTRLEIYVVTPEGLRPVKEVGMYGKI 61

Query:    63 ATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQI 120
             A +ELFRP GE++D LFI T +Y  C+L++    +S ++ITRA G+V DRIGRP++ G I
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIGRPSETGII 121

Query:   121 GIIDPDCRLIGLHLYDGLFKVIPF--DNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIV 178
             GIIDPDCR+IGL LYDGLFKVIP   DNK +LK AFNIRLEEL V+D+KFLY C  PTI 
Sbjct:   122 GIIDPDCRMIGLRLYDGLFKVIPLERDNK-ELK-AFNIRLEELHVIDVKFLYSCQAPTIC 179

Query:   179 VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIV 238
              +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I 
Sbjct:   180 FVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESIT 238

Query:   239 YCSANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKE---KVT-- 290
             Y + + + AI  PI + S    + RVD +GSRY            ++  E++    VT  
Sbjct:   239 YHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMDGSVTLK 298

Query:   291 GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGP 350
              L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KL  + + +GSYV V+E + NLGP
Sbjct:   299 DLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMETFTNLGP 358

Query:   351 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 410
             IVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LR + D
Sbjct:   359 IVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRVAAD 418

Query:   411 DPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVR 470
                D  LV+SF+ +TR+L +            GF    QT FC +  + QL+Q+TS SVR
Sbjct:   419 RDTDDTLVLSFVGQTRVLTLT-GEEVEETDLAGFVDDQQTFFCGNVAHQQLIQITSASVR 477

Query:   471 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 530
             LVS   + L +EWK P G  V+V + N+ QVLLA G   L YLEI  G L +    ++E+
Sbjct:   478 LVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLLAVGRV-LYYLEIHPGELRQTSCTEMEH 536

Query:   531 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF 590
             E++CLD+ P+G N + S + A+G+WTDIS RI SLP   L+ KE LGGEIIPRS+L+ +F
Sbjct:   537 EVACLDVTPLGGNDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRSILMTSF 596

Query:   591 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 650
             E   YLLCALGDG L  F LN  TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRP
Sbjct:   597 ESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRP 656

Query:   651 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 710
             TVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++P
Sbjct:   657 TVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKLHIRTVP 716

Query:   711 LGEHPRRICHQEQSRTFAICS 731
             L E PR+IC+QE S+ F + S
Sbjct:   717 LFESPRKICYQEVSQCFGVLS 737

 Score = 936 (334.5 bits), Expect = 1.1e-311, Sum P(2) = 1.1e-311
 Identities = 191/364 (52%), Positives = 249/364 (68%)

Query:   736 SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 795
             S  EE E+H + ++D  TFE + T+     EY  S++SC    D   Y+ VGTA V P+E
Sbjct:   781 SFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFVVGTAMVYPDE 840

Query:   796 NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 855
              EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       
Sbjct:   841 AEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAE---- 896

Query:   856 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 915
             +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSA
Sbjct:   897 KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSA 956

Query:   916 VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR- 974
             VEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLVM+ 
Sbjct:   957 VEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQN 1016

Query:   975 LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 1034
             L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  WRSF
Sbjct:  1017 LGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVGKIEHSFWRSF 1076

Query:  1035 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCKRVEEL 1084
             + E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V++L K VEEL
Sbjct:  1077 HTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTVDDLIKVVEEL 1136

Query:  1085 TRLH 1088
             TR+H
Sbjct:  1137 TRIH 1140


>UNIPROTKB|Q5R649 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9601
            "Pongo abelii" [GO:0005634 "nucleus" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0080008 "Cul4-RING
            ubiquitin ligase complex" evidence=ISS] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677
            GO:GO:0006281 GO:GO:0016567 GO:GO:0031464 GO:GO:0031465 KO:K10610
            CTD:1642 HOVERGEN:HBG005460 HSSP:Q16531 EMBL:CR860647
            RefSeq:NP_001126613.1 UniGene:Pab.18111 GeneID:100173610
            KEGG:pon:100173610 InParanoid:Q5R649 Uniprot:Q5R649
        Length = 1140

 Score = 2078 (736.6 bits), Expect = 1.4e-311, Sum P(2) = 1.4e-311
 Identities = 405/738 (54%), Positives = 537/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 935 (334.2 bits), Expect = 1.4e-311, Sum P(2) = 1.4e-311
 Identities = 192/377 (50%), Positives = 253/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVY +  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYPMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|P33194 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9534
            "Chlorocebus aethiops" [GO:0005634 "nucleus" evidence=ISS]
            [GO:0005737 "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING
            ubiquitin ligase complex" evidence=ISS] [GO:0031465 "Cul4B-RING
            ubiquitin ligase complex" evidence=ISS] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process" evidence=ISS]
            [GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=ISS]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0005737 GO:GO:0043161
            Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281 GO:GO:0016567
            GO:GO:0031464 GO:GO:0031465 HOVERGEN:HBG005460 EMBL:L20216
            PIR:S38777 PRIDE:P33194 Uniprot:P33194
        Length = 1140

 Score = 2071 (734.1 bits), Expect = 2.3e-311, Sum P(2) = 2.3e-311
 Identities = 404/738 (54%), Positives = 536/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V      +FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTAHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 2.3e-311, Sum P(2) = 2.3e-311
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|Q805F9 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003677 "DNA binding" evidence=IEA] [GO:0016567
            "protein ubiquitination" evidence=IEA] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0080008
            "Cul4-RING ubiquitin ligase complex" evidence=ISS] [GO:0031465
            "Cul4B-RING ubiquitin ligase complex" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
            process" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=ISS] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005737 GO:GO:0005654
            GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281
            GO:GO:0016567 Reactome:REACT_115612 GO:GO:0031464 GO:GO:0031465
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 CTD:1642
            HOVERGEN:HBG005460 OrthoDB:EOG4KPT91 HSSP:Q16531 EMBL:AB074298
            EMBL:AJ719779 IPI:IPI00597295 RefSeq:NP_989547.1 UniGene:Gga.12977
            STRING:Q805F9 PRIDE:Q805F9 GeneID:374050 KEGG:gga:374050
            NextBio:20813572 Uniprot:Q805F9
        Length = 1140

 Score = 2069 (733.4 bits), Expect = 3.8e-311, Sum P(2) = 3.8e-311
 Identities = 402/738 (54%), Positives = 538/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+ A
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKTA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++  + ++ ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS +    
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDSHREM 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DNMLVLSFVGQTRVLMLN-GEEVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P G +++VA+ N++QV++A G   L YLEI    L ++   ++E+E++
Sbjct:   481 QEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQINCTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G+    S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 940 (336.0 bits), Expect = 3.8e-311, Sum P(2) = 3.8e-311
 Identities = 194/377 (51%), Positives = 255/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTAE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1072
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +         
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKRE 1123

Query:  1073 -SVEELCKRVEELTRLH 1088
              +V++L K VEELTR+H
Sbjct:  1124 ATVDDLIKIVEELTRIH 1140


>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
            symbol:ddb1 "damage specific DNA binding protein
            1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
            UniGene:Dr.77970 Uniprot:I1XUS8
        Length = 1140

 Score = 2057 (729.2 bits), Expect = 6.1e-311, Sum P(2) = 6.1e-311
 Identities = 402/739 (54%), Positives = 539/739 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+ +T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNACITGHFTSAEDLNLLIAKNTRLEIYAVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    +S ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             I+DP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D++FLYGC  PT+  +Y
Sbjct:   123 IVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVQFLYGCQAPTVCFIY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTG------L 292
              + + A+  PI + S    + RVD +GSRY            ++  E E + G      L
Sbjct:   242 GDKYLAVAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKE-ELMDGAVVLKDL 300

Query:   293 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 352
              +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV V+E + NLGPIV
Sbjct:   301 HVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVMETFTNLGPIV 360

Query:   353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
             D CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS +   
Sbjct:   361 DMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSESSRD 420

Query:   413 FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 472
              D  LV+SF+ +TR+L ++           GF    QT FC +  + QL+Q+TS SVRLV
Sbjct:   421 TDDMLVLSFVGQTRVLMLS-GEEVEETELQGFVDNQQTFFCGNVAHQQLIQITSVSVRLV 479

Query:   473 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 532
             +  S+ L +EWK P G +++VA+ N +QV+LA G   L YL+I  G L ++   ++E+E+
Sbjct:   480 TQDSKALVSEWKEPQGRNISVASCNNTQVVLAVGRV-LYYLQILSGELKQISSTEMEHEV 538

Query:   533 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 592
             +CLDI P+GE  + S I AVG+WTDIS R+  LP    + KE LGGEIIPRS+L+  FEG
Sbjct:   539 ACLDITPLGERTADSCICAVGLWTDISARLLKLPCFTPLHKEMLGGEIIPRSILMTTFEG 598

Query:   593 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
               YLLCALGDG L  F L+++TG L++RKKV+LGTQP  LRTF S +T++VFA SDRPTV
Sbjct:   599 SHYLLCALGDGALFYFGLDIQTGVLSERKKVTLGTQPTVLRTFRSLSTSNVFACSDRPTV 658

Query:   653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712
             IYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL 
Sbjct:   659 IYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLY 718

Query:   713 EHPRRICHQEQSRTFAICS 731
             E P+RIC+QE S+ F + S
Sbjct:   719 ESPKRICYQEVSQCFGVLS 737

 Score = 950 (339.5 bits), Expect = 6.1e-311, Sum P(2) = 6.1e-311
 Identities = 194/364 (53%), Positives = 250/364 (68%)

Query:   736 SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 795
             S  EE E+H + ++D  TFE +  +     EY  S++SC    D  VY+ VGTA V PEE
Sbjct:   781 SFGEEVEVHSLLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVYFIVGTAMVYPEE 840

Query:   796 NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 855
              EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       
Sbjct:   841 AEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAE---- 896

Query:   856 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 915
             +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG+ EE ARD+N NWMSA
Sbjct:   897 KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNWMSA 956

Query:   916 VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR- 974
             VEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLV++ 
Sbjct:   957 VEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGSLVLQN 1016

Query:   975 LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 1034
             L +S      +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK VG + H  WRSF
Sbjct:  1017 LGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHSFWRSF 1076

Query:  1035 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCKRVEEL 1084
             + E+KT  A  F+DGDLIESFLDL R +M E+  T+ +          +V+E+ K VEEL
Sbjct:  1077 HTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKIVEEL 1136

Query:  1085 TRLH 1088
             TR+H
Sbjct:  1137 TRIH 1140


>RGD|621889 [details] [associations]
            symbol:Ddb1 "damage-specific DNA binding protein 1, 127kDa"
            species:10116 "Rattus norvegicus" [GO:0000075 "cell cycle
            checkpoint" evidence=IEA;ISO] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IMP]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005634 "nucleus"
            evidence=IEA;ISO;ISS] [GO:0005737 "cytoplasm" evidence=IEA;ISO;ISS]
            [GO:0006281 "DNA repair" evidence=TAS] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IEA;ISO] [GO:0016567 "protein
            ubiquitination" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin
            ligase complex" evidence=IEA;ISO;ISS] [GO:0031465 "Cul4B-RING
            ubiquitin ligase complex" evidence=IEA;ISO;ISS] [GO:0042787
            "protein ubiquitination involved in ubiquitin-dependent protein
            catabolic process" evidence=IEA;ISO] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process"
            evidence=IEA;ISO;ISS] [GO:0080008 "Cul4-RING ubiquitin ligase
            complex" evidence=ISO;ISS] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 UniPathway:UPA00143 RGD:621889 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003684 GO:GO:0006281 GO:GO:0042787 GO:GO:0000075
            GO:GO:0031464 GO:GO:0031465 eggNOG:NOG247734 HOGENOM:HOG000007241
            HOVERGEN:HBG005460 HSSP:Q16531 EMBL:AJ277077 IPI:IPI00324451
            UniGene:Rn.8402 IntAct:Q9ESW0 MINT:MINT-4784948 STRING:Q9ESW0
            PhosphoSite:Q9ESW0 PRIDE:Q9ESW0 UCSC:RGD:621889 InParanoid:Q9ESW0
            ArrayExpress:Q9ESW0 Genevestigator:Q9ESW0 Uniprot:Q9ESW0
        Length = 1140

 Score = 2056 (728.8 bits), Expect = 2.1e-308, Sum P(2) = 2.1e-308
 Identities = 399/740 (53%), Positives = 537/740 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS +++NL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDINLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQ +KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLD+ P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDVTPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGT+++IQKLHIR++P+ E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANTSTLTIGTMNEIQKLHIRTVPIYE 719

Query:   714 HPRRICHQEQSRTFAICSLK 733
              PR+IC+QE S+ F + S +
Sbjct:   720 SPRKICYQEVSQCFGVLSTR 739

 Score = 927 (331.4 bits), Expect = 2.1e-308, Sum P(2) = 2.1e-308
 Identities = 191/377 (50%), Positives = 252/377 (66%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSAAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF    GKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSGGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+ GTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:  1064 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1123

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:  1124 ATADDLIKVVEELTRIH 1140


>UNIPROTKB|F1P4I8 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AADN02017118
            EMBL:AADN02017119 IPI:IPI00818299 Ensembl:ENSGALT00000008352
            ArrayExpress:F1P4I8 Uniprot:F1P4I8
        Length = 1120

 Score = 2022 (716.8 bits), Expect = 3.5e-306, Sum P(2) = 3.5e-306
 Identities = 392/720 (54%), Positives = 527/720 (73%)

Query:    22 GNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHGEAQDFLFIA 81
             G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+ A +ELFRP GE++D LFI 
Sbjct:     1 GHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKTAVMELFRPKGESKDLLFIL 60

Query:    82 TERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLF 139
             T +Y  C+L++  + ++ ++ITRA G+V DRIGRP++ G IGIIDP+CR+IGL LYDGLF
Sbjct:    61 TAKYNACILEYKQNGDNIDIITRAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLF 120

Query:   140 KVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKD 199
             KVIP D + +  +AFNIRLEELQV+D+KFLYGC  PTI  +YQD +  RHVKTYEV+L++
Sbjct:   121 KVIPLDRENKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ-GRHVKTYEVSLRE 179

Query:   200 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAI--PI-RPSITK 256
             K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y + + + AI  PI + S   
Sbjct:   180 KEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIV 239

Query:   257 AYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLKIELLGETSIASTISYLDN 311
              + RVD +GSRY            ++  E++    VT   L++ELLGETSIA  ++YLDN
Sbjct:   240 CHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDN 299

Query:   312 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCS 371
              VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD CVVDLERQGQGQ+VTCS
Sbjct:   300 GVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCS 359

Query:   372 GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 431
             GA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS +    D  LV+SF+ +TR+L +N
Sbjct:   360 GAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDSHREMDNMLVLSFVGQTRVLMLN 419

Query:   432 LXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 491
                        GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK P G ++
Sbjct:   420 -GEEVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPNGKNI 478

Query:   492 NVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAA 551
             +VA+ N++QV++A G   L YLEI    L ++   ++E+E++CLDI P+G+    S + A
Sbjct:   479 SVASCNSNQVVVAVGRA-LYYLEIRPQELRQINCTEMEHEVACLDITPLGDTNGMSPLCA 537

Query:   552 VGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 611
             +G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F L+
Sbjct:   538 IGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLS 597

Query:   612 MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 671
             ++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV+
Sbjct:   598 LETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVN 657

Query:   672 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 731
             +MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S
Sbjct:   658 YMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLS 717

 Score = 940 (336.0 bits), Expect = 3.5e-306, Sum P(2) = 3.5e-306
 Identities = 194/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   748 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 807

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   808 YFIVGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINST 867

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   868 VRLYEWTAE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 923

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEF
Sbjct:   924 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEF 983

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L +       +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:   984 VNVFCHGSLVMQNLGEKSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1043

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1072
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +         
Sbjct:  1044 SVGKIEHATWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKRE 1103

Query:  1073 -SVEELCKRVEELTRLH 1088
              +V++L K VEELTR+H
Sbjct:  1104 ATVDDLIKIVEELTRIH 1120


>UNIPROTKB|F1NVV2 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0000075 "cell cycle
            checkpoint" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
            [GO:0031464 "Cul4A-RING ubiquitin ligase complex" evidence=IEA]
            [GO:0031465 "Cul4B-RING ubiquitin ligase complex" evidence=IEA]
            [GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
            protein catabolic process" evidence=IEA] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
            GO:GO:0031465 OMA:CALGDGS GeneTree:ENSGT00530000063396
            IPI:IPI00597295 EMBL:AADN02017118 EMBL:AADN02017119
            Ensembl:ENSGALT00000040605 ArrayExpress:F1NVV2 Uniprot:F1NVV2
        Length = 1123

 Score = 2022 (716.8 bits), Expect = 6.6e-305, Sum P(2) = 6.6e-305
 Identities = 392/720 (54%), Positives = 527/720 (73%)

Query:    22 GNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHGEAQDFLFIA 81
             G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+ A +ELFRP GE++D LFI 
Sbjct:     1 GHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKTAVMELFRPKGESKDLLFIL 60

Query:    82 TERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLF 139
             T +Y  C+L++  + ++ ++ITRA G+V DRIGRP++ G IGIIDP+CR+IGL LYDGLF
Sbjct:    61 TAKYNACILEYKQNGDNIDIITRAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLF 120

Query:   140 KVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKD 199
             KVIP D + +  +AFNIRLEELQV+D+KFLYGC  PTI  +YQD +  RHVKTYEV+L++
Sbjct:   121 KVIPLDRENKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ-GRHVKTYEVSLRE 179

Query:   200 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAI--PI-RPSITK 256
             K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y + + + AI  PI + S   
Sbjct:   180 KEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIV 239

Query:   257 AYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLKIELLGETSIASTISYLDN 311
              + RVD +GSRY            ++  E++    VT   L++ELLGETSIA  ++YLDN
Sbjct:   240 CHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDN 299

Query:   312 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCS 371
              VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD CVVDLERQGQGQ+VTCS
Sbjct:   300 GVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCS 359

Query:   372 GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 431
             GA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS +    D  LV+SF+ +TR+L +N
Sbjct:   360 GAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDSHREMDNMLVLSFVGQTRVLMLN 419

Query:   432 LXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 491
                        GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK P G ++
Sbjct:   420 -GEEVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPNGKNI 478

Query:   492 NVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAA 551
             +VA+ N++QV++A G   L YLEI    L ++   ++E+E++CLDI P+G+    S + A
Sbjct:   479 SVASCNSNQVVVAVGRA-LYYLEIRPQELRQINCTEMEHEVACLDITPLGDTNGMSPLCA 537

Query:   552 VGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 611
             +G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F L+
Sbjct:   538 IGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLS 597

Query:   612 MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 671
             ++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV+
Sbjct:   598 LETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVN 657

Query:   672 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 731
             +MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S
Sbjct:   658 YMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLS 717

 Score = 928 (331.7 bits), Expect = 6.6e-305, Sum P(2) = 6.6e-305
 Identities = 194/380 (51%), Positives = 254/380 (66%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   748 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 807

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   808 YFIVGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINST 867

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   868 VRLYEWTAE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 923

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEF
Sbjct:   924 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEF 983

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L +       +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:   984 VNVFCHGSLVMQNLGEKSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1043

Query:  1022 GVGGLNHE---QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV------ 1072
              VG + H     WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +      
Sbjct:  1044 SVGKIEHSLYATWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGM 1103

Query:  1073 ----SVEELCKRVEELTRLH 1088
                 +V++L K VEELTR+H
Sbjct:  1104 KREATVDDLIKIVEELTRIH 1123


>UNIPROTKB|F1NVV3 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AADN02017118
            EMBL:AADN02017119 IPI:IPI00821712 Ensembl:ENSGALT00000040604
            ArrayExpress:F1NVV3 Uniprot:F1NVV3
        Length = 1119

 Score = 2022 (716.8 bits), Expect = 9.6e-304, Sum P(2) = 9.6e-304
 Identities = 392/720 (54%), Positives = 527/720 (73%)

Query:    22 GNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHGEAQDFLFIA 81
             G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+ A +ELFRP GE++D LFI 
Sbjct:     1 GHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKTAVMELFRPKGESKDLLFIL 60

Query:    82 TERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLF 139
             T +Y  C+L++  + ++ ++ITRA G+V DRIGRP++ G IGIIDP+CR+IGL LYDGLF
Sbjct:    61 TAKYNACILEYKQNGDNIDIITRAHGNVQDRIGRPSETGIIGIIDPECRMIGLRLYDGLF 120

Query:   140 KVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKD 199
             KVIP D + +  +AFNIRLEELQV+D+KFLYGC  PTI  +YQD +  RHVKTYEV+L++
Sbjct:   121 KVIPLDRENKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ-GRHVKTYEVSLRE 179

Query:   200 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAI--PI-RPSITK 256
             K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y + + + AI  PI + S   
Sbjct:   180 KEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIV 239

Query:   257 AYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLKIELLGETSIASTISYLDN 311
              + RVD +GSRY            ++  E++    VT   L++ELLGETSIA  ++YLDN
Sbjct:   240 CHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDN 299

Query:   312 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCS 371
              VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD CVVDLERQGQGQ+VTCS
Sbjct:   300 GVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCS 359

Query:   372 GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 431
             GA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS +    D  LV+SF+ +TR+L +N
Sbjct:   360 GAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDSHREMDNMLVLSFVGQTRVLMLN 419

Query:   432 LXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 491
                        GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK P G ++
Sbjct:   420 -GEEVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPNGKNI 478

Query:   492 NVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAA 551
             +VA+ N++QV++A G   L YLEI    L ++   ++E+E++CLDI P+G+    S + A
Sbjct:   479 SVASCNSNQVVVAVGRA-LYYLEIRPQELRQINCTEMEHEVACLDITPLGDTNGMSPLCA 537

Query:   552 VGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 611
             +G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F L+
Sbjct:   538 IGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLS 597

Query:   612 MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 671
             ++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV+
Sbjct:   598 LETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVN 657

Query:   672 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 731
             +MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S
Sbjct:   658 YMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLS 717

 Score = 917 (327.9 bits), Expect = 9.6e-304, Sum P(2) = 9.6e-304
 Identities = 192/377 (50%), Positives = 253/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   748 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 807

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   808 YFIVGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINST 867

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   868 VRLYEWTAE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 923

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEF
Sbjct:   924 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEF 983

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L +       +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:   984 VNVFCHGSLVMQNLGEKSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1043

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1072
              VG + H  + SF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +         
Sbjct:  1044 SVGKIEHSLY-SFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKRE 1102

Query:  1073 -SVEELCKRVEELTRLH 1088
              +V++L K VEELTR+H
Sbjct:  1103 ATVDDLIKIVEELTRIH 1119


>UNIPROTKB|J9NVR7 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AAEX03011677
            Ensembl:ENSCAFT00000049486 Uniprot:J9NVR7
        Length = 1084

 Score = 2083 (738.3 bits), Expect = 8.1e-298, Sum P(2) = 8.1e-298
 Identities = 406/738 (55%), Positives = 538/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 800 (286.7 bits), Expect = 8.1e-298, Sum P(2) = 8.1e-298
 Identities = 164/321 (51%), Positives = 216/321 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNHEQWRSFNNEKKTVD 1042
              VG + H  + S  +    VD
Sbjct:  1064 SVGKIEHSLYPSQRHVPAQVD 1084


>UNIPROTKB|F5GY55 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9606 "Homo
            sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 EMBL:AP003108 HGNC:HGNC:2717 ChiTaRS:DDB1
            EMBL:AP003037 IPI:IPI00977083 SMR:F5GY55 Ensembl:ENST00000540166
            Uniprot:F5GY55
        Length = 1092

 Score = 2077 (736.2 bits), Expect = 1.2e-296, Sum P(2) = 1.2e-296
 Identities = 405/738 (54%), Positives = 537/738 (72%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLY 181
             IIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  PTI  +Y
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAPTICFVY 182

Query:   182 QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS 241
             QD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y +
Sbjct:   183 QDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHN 241

Query:   242 ANAFKAI--PI-RPSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEK---VT--GLK 293
              + + AI  PI + S    + RVD +GSRY            ++  E++    VT   L+
Sbjct:   242 GDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLR 301

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 353
             +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD
Sbjct:   302 VELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVD 361

Query:   354 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 413
              CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   
Sbjct:   362 MCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRET 421

Query:   414 DTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 473
             D  LV+SF+ +TR+L +N           GF    QT FC +  + QL+Q+TS SVRLVS
Sbjct:   422 DDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVS 480

Query:   474 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 533
                + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++
Sbjct:   481 QEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVA 539

Query:   534 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI 593
             CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE  
Sbjct:   540 CLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESS 599

Query:   594 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 653
              YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVI
Sbjct:   600 HYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVI 659

Query:   654 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 713
             YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E
Sbjct:   660 YSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYE 719

Query:   714 HPRRICHQEQSRTFAICS 731
              PR+IC+QE S+ F + S
Sbjct:   720 SPRKICYQEVSQCFGVLS 737

 Score = 795 (284.9 bits), Expect = 1.2e-296, Sum P(2) = 1.2e-296
 Identities = 161/307 (52%), Positives = 211/307 (68%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   768 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 827

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   828 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 887

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   888 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 943

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   944 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 1003

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:  1004 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 1063

Query:  1022 GVGGLNH 1028
              VG + H
Sbjct:  1064 SVGKIEH 1070


>FB|FBgn0260962 [details] [associations]
            symbol:pic "piccolo" species:7227 "Drosophila melanogaster"
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0006289
            "nucleotide-excision repair" evidence=ISS;NAS] [GO:0005634
            "nucleus" evidence=IEA] [GO:0006974 "response to DNA damage
            stimulus" evidence=IMP] [GO:0035220 "wing disc development"
            evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0007307 "eggshell
            chorion gene amplification" evidence=IDA] [GO:0007095 "mitotic G2
            DNA damage checkpoint" evidence=IGI] InterPro:IPR004871
            Pfam:PF03178 UniPathway:UPA00143 EMBL:AE014297 GO:GO:0005634
            GO:GO:0005737 GO:GO:0007095 GO:GO:0043161 GO:GO:0003677
            GO:GO:0006281 GO:GO:0035220 GO:GO:0042787 GO:GO:0007307
            eggNOG:NOG247734 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
            HSSP:Q16531 EMBL:AF132145 RefSeq:NP_650257.1 UniGene:Dm.3215
            ProteinModelPortal:Q9XYZ5 SMR:Q9XYZ5 STRING:Q9XYZ5 PaxDb:Q9XYZ5
            PRIDE:Q9XYZ5 EnsemblMetazoa:FBtr0082709 GeneID:41611
            KEGG:dme:Dmel_CG7769 UCSC:CG7769-RA CTD:41611 FlyBase:FBgn0260962
            InParanoid:Q9XYZ5 OrthoDB:EOG4S1RP0 PhylomeDB:Q9XYZ5
            GenomeRNAi:41611 NextBio:824642 Bgee:Q9XYZ5 Uniprot:Q9XYZ5
        Length = 1140

 Score = 1856 (658.4 bits), Expect = 2.1e-268, Sum P(2) = 2.1e-268
 Identities = 368/739 (49%), Positives = 506/739 (68%)

Query:     5 NYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIAT 64
             +YVVTA KPT V     GNFTSP +LNLIIA+  ++EI L+TP+GL+P+ ++ I G IA 
Sbjct:     4 HYVVTAQKPTAVVACLTGNFTSPTDLNLIIARNNQVEIDLVTPEGLRPLKEININGTIAV 63

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPTDNGQIGI 122
             +  FRP    +D LFI T RY   +L+    +  +  +T+A G+VSD +G P++ G I  
Sbjct:    64 MRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVGIPSEGGVIAA 123

Query:   123 IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQ 182
             IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC  PT++V+++
Sbjct:   124 IDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGCLNPTVIVIHK 183

Query:   183 DNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSA 242
             D+ D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++IG E+IVY   
Sbjct:   184 DS-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDG 242

Query:   243 NAFKAI-PI--RPSITKAYGRVDADGSRYXXXXXXXXXXXXVI-THEKEK---VTGLKIE 295
             + + A+ P+  R S    Y RV ++G RY             + T E  K   V  +K+E
Sbjct:   243 SNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVE 302

Query:   296 LLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC 355
              LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E + NL PI+D  
Sbjct:   303 QLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENFTNLAPILDIA 361

Query:   356 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD-PFD 414
             VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL+   D+ P++
Sbjct:   362 VVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSLKVGVDESPYE 421

Query:   415 TFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 474
               LV++F+  TRIL ++           GF S  QT  C +  Y+QL+QVTS SVRLVSS
Sbjct:   422 NTLVLAFVGHTRILTLS-GEEVEETEIPGFASDLQTFLCSNVDYDQLIQVTSDSVRLVSS 480

Query:   475 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISC 534
              ++ L  EW+     ++ V + N +Q+L+A+    + Y+ I DG L E     L YE++C
Sbjct:   481 ATKALVAEWRPTGDRTIGVVSCNTTQILVASACD-IFYIVIEDGSLREQSRRTLAYEVAC 539

Query:   535 LDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS 594
             LDI P+ E    S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS+L+  FEGI 
Sbjct:   540 LDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRSILMTTFEGIH 599

Query:   595 YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY 654
             YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VFA SDRPTVIY
Sbjct:   600 YLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVFACSDRPTVIY 659

Query:   655 SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH 714
             SSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKLHIR++PLGE 
Sbjct:   660 SSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKLHIRTVPLGEG 719

Query:   715 PRRICHQEQSRTFAICSLK 733
             PRRI +QE S+TFA+ +L+
Sbjct:   720 PRRIAYQESSQTFAVSTLR 738

 Score = 749 (268.7 bits), Expect = 2.1e-268, Sum P(2) = 2.1e-268
 Identities = 160/369 (43%), Positives = 225/369 (60%)

Query:   734 NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP 793
             N    +E ++H + ++D  TFE +  +     E   S++S    DD N YY V T+ V+P
Sbjct:   780 NAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTYYVVATSLVIP 839

Query:   794 EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD 853
             EE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   ++LY+W     
Sbjct:   840 EEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFVRLYEWT---- 895

Query:   854 GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM 913
               +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   E ARD    WM
Sbjct:   896 NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVEIARDCEPKWM 955

Query:   914 SAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 973
              AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ VN FRHGSLVM
Sbjct:   956 RAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTVNVFRHGSLVM 1015

Query:   974 RLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 1028
             +    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K+IK VG + H
Sbjct:  1016 Q----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKKIIKSVGKIEH 1071

Query:  1029 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN-----VSVEELCK 1079
               +R+F    K   ++ F+DGDLIESFLDLSR +M +    +  T+N       VE++ K
Sbjct:  1072 TYYRNFQINSKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGERKSADVEDVIK 1131

Query:  1080 RVEELTRLH 1088
              VE+LTR+H
Sbjct:  1132 IVEDLTRMH 1140


>DICTYBASE|DDB_G0286013 [details] [associations]
            symbol:repE "UV-damaged DNA binding protein1"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA;ISS;IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0006974 "response to DNA damage stimulus" evidence=IEA;IEP]
            [GO:0006289 "nucleotide-excision repair" evidence=ISS] [GO:0003684
            "damaged DNA binding" evidence=ISS] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0016567 "protein ubiquitination" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 Pfam:PF03178
            UniPathway:UPA00143 dictyBase:DDB_G0286013 GO:GO:0005634
            GO:GO:0005737 GenomeReviews:CM000153_GR SUPFAM:SSF50978
            GO:GO:0003684 GO:GO:0016567 EMBL:AAFI02000085 GO:GO:0006289
            eggNOG:NOG247734 KO:K10610 OMA:CALGDGS EMBL:U50042 PIR:S71092
            RefSeq:XP_637896.2 STRING:B0M0P5 EnsemblProtists:DDB0191144
            GeneID:8625406 KEGG:ddi:DDB_G0286013 ProtClustDB:CLSZ2430134
            Uniprot:B0M0P5
        Length = 1181

 Score = 835 (299.0 bits), Expect = 7.0e-267, Sum P(4) = 7.0e-267
 Identities = 178/412 (43%), Positives = 257/412 (62%)

Query:   411 DPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCH--DAIYNQLVQVTSGS 468
             D  D +L+ SFI  T++L+             G  S   TL+C   D + N L+Q+T+ S
Sbjct:   471 DSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITNVS 528

Query:   469 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG--ILTEVKHA 526
             + L+ S + +  ++W   P   +N+ + N  Q++L+     L+Y +I      +  VK  
Sbjct:   529 INLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVKEI 587

Query:   527 QLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 585
             +L +EISC+DI+P      + SQ+ +VG+W DI++RIF LP L  I KE LGGEI+PRS+
Sbjct:   588 ELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPRSI 647

Query:   586 LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 645
             L+ +F+ I Y+ C+LGDGHL  F  +  + +L D++K++LGTQPI L+ F  KNT ++FA
Sbjct:   648 LMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINIFA 707

Query:   646 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 705
              SDRPTVIYS NKKL YS VNLK+V+++  FNS  FP+S+AIA    LTIGTID+IQKLH
Sbjct:   708 ISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQKLH 767

Query:   706 IRSIPLGEHP-RRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQTFE 755
             I++IPL E   RRI H E    +A+ ++KN           C E+ E+ ++R+ +DQTFE
Sbjct:   768 IKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQTFE 827

Query:   756 FISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVF 806
              IS+Y LD +E G SI  C F+ DD N Y  VGT+   P ++    GR+L+F
Sbjct:   828 LISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIKSS---GRVLLF 876

 Score = 711 (255.3 bits), Expect = 7.0e-267, Sum P(4) = 7.0e-267
 Identities = 140/264 (53%), Positives = 187/264 (70%)

Query:   151 KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN 210
             K   N+RLEELQVLD+ FLYGC  PTI VL++D KD +H+ TYE++ KD + V GPWSQ+
Sbjct:   191 KNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHISTYEISSKDTELVVGPWSQS 250

Query:   211 NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK--AYGRVDADGSRY 268
             N+   + LL+PVP  L GVL++ +  I Y +    +++ +  S TK  A+ RVD DGSR+
Sbjct:   251 NVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV--SYTKFLAFTRVDKDGSRF 306

Query:   269 XXXXXXXXXXXXVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 328
                         V+ H+++KV  LK E LG  SI S+ISYLD+ VVYIGSS GDSQLI+L
Sbjct:   307 LFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLDSGVVYIGSSSGDSQLIRL 366

Query:   329 NLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 387
             N + D    SYV  LE + N+GP+VDFCVVD E+QGQ Q+VTCSG Y+DGSLRI+RNGIG
Sbjct:   367 NTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVTCSGTYRDGSLRIIRNGIG 426

Query:   388 INEQASVELQGIKGMWSLRSSTDD 411
             I EQAS+EL+GIKG++ + ++ ++
Sbjct:   427 IAEQASIELEGIKGIFPINNNNNN 450

 Score = 619 (223.0 bits), Expect = 7.0e-267, Sum P(4) = 7.0e-267
 Identities = 116/276 (42%), Positives = 190/276 (68%)

Query:   810 DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM-LRDDGTRELQSECGHHGHI 868
             +GKL L+ E + + +VY L +FNG+L+AA+++++   ++   ++   + + SE  H GH 
Sbjct:   903 NGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYTHSKEKNCKVISSESVHKGHT 962

Query:   869 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 928
             + L + +RG FI+VGD+MKS+SLL+ +  +G++E+ AR+    W+ +V +++DD ++GAE
Sbjct:   963 MILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNPQPIWIRSVAMINDDYFIGAE 1021

Query:   929 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 988
              + N   V+KN++   + ER  L+ VG YH+GE +N  RHGSLV RLPDSD   IPT+++
Sbjct:  1022 ASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRHGSLV-RLPDSDQPIIPTILY 1080

Query:   989 GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 1048
              +VNG IGV+AS+  E ++F  KLQ  L +V++GVGG +HE WR+F+N+  T+D+KNF+D
Sbjct:  1081 ASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSHETWRAFSNDHHTIDSKNFID 1140

Query:  1049 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             GDLIE+FLDL      +    + ++ ++  +R+E L
Sbjct:  1141 GDLIETFLDLKYESQLKAVADLGITPDDAFRRIESL 1176

 Score = 463 (168.0 bits), Expect = 7.0e-267, Sum P(4) = 7.0e-267
 Identities = 84/144 (58%), Positives = 109/144 (75%)

Query:     3 IWNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRI 62
             ++N+V T  KPT+VTHS  GNFT P + NLII+KCT+IEI L+   GL+PM DV IYGRI
Sbjct:     1 MYNFVSTVQKPTSVTHSVTGNFTGPNDKNLIISKCTKIEIFLMDQDGLKPMFDVNIYGRI 60

Query:    63 ATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGI 122
             + L+LF   G  QD+LFI+TE +KFC+L +D E  E+IT+A G+  D IGRPT+ GQ+GI
Sbjct:    61 SVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIGRPTEAGQLGI 120

Query:   123 IDPDCRLIGLHLYDGLFKVIPFDN 146
             IDPD R++ LHLY+GL K+I  DN
Sbjct:   121 IDPDGRIVALHLYEGLLKLITLDN 144

 Score = 45 (20.9 bits), Expect = 3.1e-39, Sum P(2) = 3.1e-39
 Identities = 44/210 (20%), Positives = 82/210 (39%)

Query:   873 VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY--NANWMSAVEILDDDI-YLGAEN 929
             V   G   + GD    +S+L+  H++  + E   +     +  S++  LD  + Y+G+ +
Sbjct:   299 VDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLDSGVVYIGSSS 358

Query:   930 -NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 988
              +  L  +    +  TD     LE     ++G  V+ F     V+        QI T   
Sbjct:   359 GDSQLIRLNTEKDQTTDSYVTYLEAFT--NIGPVVD-F----CVVDAEKQGQAQIVTCS- 410

Query:   989 GTV-NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 1047
             GT  +G + +I +         E+    L   IKG+  +N+    + NN     +  N  
Sbjct:   411 GTYRDGSLRIIRN----GIGIAEQASIELEG-IKGIFPINNNNNNNNNNNNNNNNNNNNN 465

Query:  1048 DGDLIESFLDLSRTRMDEISKTMNVSVEEL 1077
                + +S      T   E +K ++   EE+
Sbjct:   466 SNGITDSKDRYLITSFIECTKVLSFQGEEI 495


>WB|WBGene00010890 [details] [associations]
            symbol:ddb-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0040010 "positive regulation of growth
            rate" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0030163
            "protein catabolic process" evidence=IMP] [GO:0007276 "gamete
            generation" evidence=IMP] [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR004871 Pfam:PF03178 UniPathway:UPA00143
            GO:GO:0005634 GO:GO:0009792 GO:GO:0006898 GO:GO:0005737
            GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0006281
            GO:GO:0040011 GO:GO:0016567 GO:GO:0007049 GO:GO:0040035
            InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163 GO:GO:0007276
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855 PIR:T23798
            RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
            DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
            PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
            GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
            WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
        Length = 1134

 Score = 1119 (399.0 bits), Expect = 2.7e-167, Sum P(2) = 2.7e-167
 Identities = 263/760 (34%), Positives = 434/760 (57%)

Query:     5 NYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIAT 64
             +Y V+A K + V  S VGNFT  + +NLI+A+  RI++ L++P+GL+ + ++PIYG++ T
Sbjct:     4 SYCVSAKKASVVVESVVGNFTGHENVNLIVARGNRIDVQLVSPEGLKNVCEIPIYGQVLT 63

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIID 124
             + L +   + +  L + TE++   +L +     +++TRA G ++D  GR TDN    +  
Sbjct:    64 IALVKCKRDKRHSLIVVTEKWHMAILAY--RDGKVVTRAAGCIADPTGRATDN-LFSLTI 120

Query:   125 PDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCAKPTIVVLYQD 183
                 LI +  ++G  K+I +++   L+  FN+R +   V D KF+  G      V    D
Sbjct:   121 HRNGLIAIRAFEGSVKMIQWESGTDLRH-FNVRFDYPNVSDFKFVDTGEDDVYRVAFIYD 179

Query:   184 NKDARHVKTYEVALKDKDFVEGPWS-QNNLDNGADLLIPVPPPLCGVLIIGEETIVYC-S 241
             +   +H++  ++ + DK+F    +S Q ++   + +LIPVP  + GV+++G  +++Y  +
Sbjct:   180 DDHGKHLQFSDLNMHDKEF--RTYSRQASIAADSSVLIPVPHAIGGVIVLGSNSVLYKPN 237

Query:   242 ANAFKAIPIRPSITK-----AYGRVDADGSRYXXXXXXXXXXXXV--ITHEKEKVT--GL 292
              N  + +P   S+ +      +G VDA G R+            +  +T  +   T   +
Sbjct:   238 DNLGEVVPYTCSLLENTTFTCHGIVDASGERFLLSDTDGRLLMLLLNVTESQSGYTVKEM 297

Query:   293 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 352
             +I+ LGETSIA +I+Y+DN VV++GS  GDSQLI+L  +P+  GSY  +LE Y N+GPI 
Sbjct:   298 RIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMTEPNG-GSYSVILETYSNIGPIR 356

Query:   353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
             D  +V  E  GQ Q+VTC+GA KDGSLR++RNGIGI+E ASV+L G+ G++ +R   D  
Sbjct:   357 DMVMV--ESDGQPQLVTCTGADKDGSLRVIRNGIGIDELASVDLAGVVGIFPIR--LDSN 412

Query:   413 FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIY-NQ---LVQVTSGS 468
              D +++VS   ET +L +               +   T+F       N    ++Q T   
Sbjct:   413 ADNYVIVSLSDETHVLQIT-GEELEDVKLLEINTDLPTIFASTLFGPNDSGIILQATEKQ 471

Query:   469 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL------EIG--DGIL 520
             +RL+SS+   L   W+   G  ++  + NA+   +       VYL      E+G  D  L
Sbjct:   472 IRLMSSSG--LSKFWEPTNGEIISKVSVNAANGQIVLAARDTVYLLTCIVDEMGALDIQL 529

Query:   521 TEVKHAQLEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 579
             T  K  + E EI+CLD++  G++P+  +    +  W+  ++ +  LPDL  +    L  +
Sbjct:   530 TAEK--KFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAMEVIQLPDLITVCHTDLPTK 587

Query:   580 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 639
             IIPRS++    E + YLL A GDG L+ ++ ++KTG   + KK ++GT+P +L    +KN
Sbjct:   588 IIPRSIIATCIEEVHYLLVAFGDGALVYYVFDIKTGTHGEPKKSNVGTRPPSLHRVRNKN 647

Query:   640 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 699
               H+F  SDRP +I+S++KKL++SNVN+K V  +C  +S+A+ D L I+    +  GT+D
Sbjct:   648 RQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDTVCSLSSSAYRDCLVISDGNSMVFGTVD 707

Query:   700 DIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE 739
             DIQK+H+RSIP+GE   RI +Q+ + T+ +CS + +S AE
Sbjct:   708 DIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAE 747

 Score = 530 (191.6 bits), Expect = 2.7e-167, Sum P(2) = 2.7e-167
 Identities = 120/354 (33%), Positives = 196/354 (55%)

Query:   748 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 807
             +LD  TF+ + ++    +E   S +S  F++DS+ YY VGT  + P+E E   GRI+VF 
Sbjct:   785 VLDQNTFQVLHSHEFGPWETALSCISGQFTNDSSTYYVVGTGLIYPDETETKIGRIVVFE 844

Query:   808 VED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 864
             V+D    KL+ + E   +G+  ++   NGKL+AAIN  I+L++W       +EL+ EC  
Sbjct:   845 VDDVERSKLRRVHELVVRGSPLAIRILNGKLVAAINSSIRLFEWTT----DKELRLECSS 900

Query:   865 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 924
               H++AL ++   + + V D+M+S+SLL Y+  EG  EE A+D+N+ WM   E +  +  
Sbjct:   901 FNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFEEVAKDWNSQWMVTCEFITAESI 960

Query:   925 LGAENNFNLFTVRKN-SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 983
             LG E + NLFTV  + +   TD+ R  LE  G ++LGE        +LV++  DS +   
Sbjct:   961 LGGEAHLNLFTVEVDKTRPITDDGRYVLEPTGYWYLGELPKVMTRSTLVIQPEDSIIQYS 1020

Query:   984 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 1043
               ++FGT  G IG+I  +  +   FL  ++  +   +K    + H  +R+F  +K+    
Sbjct:  1021 QPIMFGTNQGTIGMIVQIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080

Query:  1044 KNFLDGDLIESFLDLSRT-RMDEISKTMNVSVE--------ELCKRVEELTRLH 1088
               F+DGDL+ES LD+ R+  MD +SK  +   +        E+ K +E+L R+H
Sbjct:  1081 SGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSLPRDPVEILKVIEDLARMH 1134

 Score = 42 (19.8 bits), Expect = 9.8e-116, Sum P(2) = 9.8e-116
 Identities = 7/29 (24%), Positives = 16/29 (55%)

Query:   681 FPDSLAIAKEGELTIGTIDDIQKLHIRSI 709
             +PD     K G + +  +DD+++  +R +
Sbjct:   829 YPDETE-TKIGRIVVFEVDDVERSKLRRV 856


>UNIPROTKB|Q21554 [details] [associations]
            symbol:ddb-1 "DNA damage-binding protein 1" species:6239
            "Caenorhabditis elegans" [GO:0005515 "protein binding"
            evidence=IPI] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0009792 GO:GO:0006898
            GO:GO:0005737 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
            GO:GO:0006281 GO:GO:0040011 GO:GO:0016567 GO:GO:0007049
            GO:GO:0040035 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163
            GO:GO:0007276 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            OMA:CALGDGS GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855
            PIR:T23798 RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
            DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
            PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
            GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
            WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
        Length = 1134

 Score = 1119 (399.0 bits), Expect = 2.7e-167, Sum P(2) = 2.7e-167
 Identities = 263/760 (34%), Positives = 434/760 (57%)

Query:     5 NYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIAT 64
             +Y V+A K + V  S VGNFT  + +NLI+A+  RI++ L++P+GL+ + ++PIYG++ T
Sbjct:     4 SYCVSAKKASVVVESVVGNFTGHENVNLIVARGNRIDVQLVSPEGLKNVCEIPIYGQVLT 63

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIID 124
             + L +   + +  L + TE++   +L +     +++TRA G ++D  GR TDN    +  
Sbjct:    64 IALVKCKRDKRHSLIVVTEKWHMAILAY--RDGKVVTRAAGCIADPTGRATDN-LFSLTI 120

Query:   125 PDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCAKPTIVVLYQD 183
                 LI +  ++G  K+I +++   L+  FN+R +   V D KF+  G      V    D
Sbjct:   121 HRNGLIAIRAFEGSVKMIQWESGTDLRH-FNVRFDYPNVSDFKFVDTGEDDVYRVAFIYD 179

Query:   184 NKDARHVKTYEVALKDKDFVEGPWS-QNNLDNGADLLIPVPPPLCGVLIIGEETIVYC-S 241
             +   +H++  ++ + DK+F    +S Q ++   + +LIPVP  + GV+++G  +++Y  +
Sbjct:   180 DDHGKHLQFSDLNMHDKEF--RTYSRQASIAADSSVLIPVPHAIGGVIVLGSNSVLYKPN 237

Query:   242 ANAFKAIPIRPSITK-----AYGRVDADGSRYXXXXXXXXXXXXV--ITHEKEKVT--GL 292
              N  + +P   S+ +      +G VDA G R+            +  +T  +   T   +
Sbjct:   238 DNLGEVVPYTCSLLENTTFTCHGIVDASGERFLLSDTDGRLLMLLLNVTESQSGYTVKEM 297

Query:   293 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 352
             +I+ LGETSIA +I+Y+DN VV++GS  GDSQLI+L  +P+  GSY  +LE Y N+GPI 
Sbjct:   298 RIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMTEPNG-GSYSVILETYSNIGPIR 356

Query:   353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
             D  +V  E  GQ Q+VTC+GA KDGSLR++RNGIGI+E ASV+L G+ G++ +R   D  
Sbjct:   357 DMVMV--ESDGQPQLVTCTGADKDGSLRVIRNGIGIDELASVDLAGVVGIFPIR--LDSN 412

Query:   413 FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIY-NQ---LVQVTSGS 468
              D +++VS   ET +L +               +   T+F       N    ++Q T   
Sbjct:   413 ADNYVIVSLSDETHVLQIT-GEELEDVKLLEINTDLPTIFASTLFGPNDSGIILQATEKQ 471

Query:   469 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL------EIG--DGIL 520
             +RL+SS+   L   W+   G  ++  + NA+   +       VYL      E+G  D  L
Sbjct:   472 IRLMSSSG--LSKFWEPTNGEIISKVSVNAANGQIVLAARDTVYLLTCIVDEMGALDIQL 529

Query:   521 TEVKHAQLEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 579
             T  K  + E EI+CLD++  G++P+  +    +  W+  ++ +  LPDL  +    L  +
Sbjct:   530 TAEK--KFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAMEVIQLPDLITVCHTDLPTK 587

Query:   580 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 639
             IIPRS++    E + YLL A GDG L+ ++ ++KTG   + KK ++GT+P +L    +KN
Sbjct:   588 IIPRSIIATCIEEVHYLLVAFGDGALVYYVFDIKTGTHGEPKKSNVGTRPPSLHRVRNKN 647

Query:   640 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 699
               H+F  SDRP +I+S++KKL++SNVN+K V  +C  +S+A+ D L I+    +  GT+D
Sbjct:   648 RQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDTVCSLSSSAYRDCLVISDGNSMVFGTVD 707

Query:   700 DIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE 739
             DIQK+H+RSIP+GE   RI +Q+ + T+ +CS + +S AE
Sbjct:   708 DIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAE 747

 Score = 530 (191.6 bits), Expect = 2.7e-167, Sum P(2) = 2.7e-167
 Identities = 120/354 (33%), Positives = 196/354 (55%)

Query:   748 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 807
             +LD  TF+ + ++    +E   S +S  F++DS+ YY VGT  + P+E E   GRI+VF 
Sbjct:   785 VLDQNTFQVLHSHEFGPWETALSCISGQFTNDSSTYYVVGTGLIYPDETETKIGRIVVFE 844

Query:   808 VED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 864
             V+D    KL+ + E   +G+  ++   NGKL+AAIN  I+L++W       +EL+ EC  
Sbjct:   845 VDDVERSKLRRVHELVVRGSPLAIRILNGKLVAAINSSIRLFEWTT----DKELRLECSS 900

Query:   865 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 924
               H++AL ++   + + V D+M+S+SLL Y+  EG  EE A+D+N+ WM   E +  +  
Sbjct:   901 FNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFEEVAKDWNSQWMVTCEFITAESI 960

Query:   925 LGAENNFNLFTVRKN-SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 983
             LG E + NLFTV  + +   TD+ R  LE  G ++LGE        +LV++  DS +   
Sbjct:   961 LGGEAHLNLFTVEVDKTRPITDDGRYVLEPTGYWYLGELPKVMTRSTLVIQPEDSIIQYS 1020

Query:   984 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 1043
               ++FGT  G IG+I  +  +   FL  ++  +   +K    + H  +R+F  +K+    
Sbjct:  1021 QPIMFGTNQGTIGMIVQIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQKRAEPP 1080

Query:  1044 KNFLDGDLIESFLDLSRT-RMDEISKTMNVSVE--------ELCKRVEELTRLH 1088
               F+DGDL+ES LD+ R+  MD +SK  +   +        E+ K +E+L R+H
Sbjct:  1081 SGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSLPRDPVEILKVIEDLARMH 1134

 Score = 42 (19.8 bits), Expect = 9.8e-116, Sum P(2) = 9.8e-116
 Identities = 7/29 (24%), Positives = 16/29 (55%)

Query:   681 FPDSLAIAKEGELTIGTIDDIQKLHIRSI 709
             +PD     K G + +  +DD+++  +R +
Sbjct:   829 YPDETE-TKIGRIVVFEVDDVERSKLRRV 856


>UNIPROTKB|F1M680 [details] [associations]
            symbol:Ddb1 "DNA damage-binding protein 1" species:10116
            "Rattus norvegicus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 RGD:621889
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 IPI:IPI00950036
            Ensembl:ENSRNOT00000063867 ArrayExpress:F1M680 Uniprot:F1M680
        Length = 600

 Score = 922 (329.6 bits), Expect = 6.2e-142, Sum P(2) = 6.2e-142
 Identities = 192/378 (50%), Positives = 253/378 (66%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:   227 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 286

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   287 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 346

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   347 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 402

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   403 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 462

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:   463 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 522

Query:  1022 GVGGLNHE-QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN--------- 1071
              +  L H   WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +          
Sbjct:   523 SLCSLTHLFTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 582

Query:  1072 -VSVEELCKRVEELTRLH 1088
               + ++L K VEELTR+H
Sbjct:   583 EATADDLIKVVEELTRIH 600

 Score = 487 (176.5 bits), Expect = 6.2e-142, Sum P(2) = 6.2e-142
 Identities = 94/137 (68%), Positives = 113/137 (82%)

Query:   595 YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY 654
             YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIY
Sbjct:    60 YLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIY 119

Query:   655 SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH 714
             SSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E 
Sbjct:   120 SSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYES 179

Query:   715 PRRICHQEQSRTFAICS 731
             PR+IC+QE S+ F + S
Sbjct:   180 PRKICYQEVSQCFGVLS 196


>ASPGD|ASPL0000052925 [details] [associations]
            symbol:ddbA species:162425 "Emericella nidulans"
            [GO:0006282 "regulation of DNA repair" evidence=IEA;ISA]
            [GO:0006974 "response to DNA damage stimulus" evidence=IEP;IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0070913 "Ddb1-Wdr21
            complex" evidence=IEA] [GO:0008180 "signalosome" evidence=IEA]
            [GO:0070912 "Ddb1-Ckn1 complex" evidence=IEA] [GO:0031465
            "Cul4B-RING ubiquitin ligase complex" evidence=IEA] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=IEA]
            [GO:0040020 "regulation of meiosis" evidence=IEA] [GO:0042787
            "protein ubiquitination involved in ubiquitin-dependent protein
            catabolic process" evidence=IEA] [GO:0007090 "regulation of S phase
            of mitotic cell cycle" evidence=IEA] [GO:0034644 "cellular response
            to UV" evidence=IEA] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10 EMBL:BN001308
            GO:GO:0003676 EMBL:AACD01000007 KO:K10610 OMA:DRPAVIY
            OrthoDB:EOG473T0C RefSeq:XP_658200.1 STRING:Q5BFT4
            EnsemblFungi:CADANIAT00002078 GeneID:2876375 KEGG:ani:AN0596.2
            eggNOG:NOG316722 HOGENOM:HOG000216556 Uniprot:Q5BFT4
        Length = 1132

 Score = 937 (334.9 bits), Expect = 3.8e-114, Sum P(2) = 3.8e-114
 Identities = 279/940 (29%), Positives = 467/940 (49%)

Query:   188 RHVKTYEVALKDKDFVE-GPWSQNNLDNGADLLIPVPPPLC---GVLIIGEETIVYCSA- 242
             R +K    A  + +F     ++Q  LD GA  LIPVP PL    G+LI+GE +I Y  A 
Sbjct:   213 RELKYSTAAGAESEFTSIADYAQE-LDLGASHLIPVPAPLAAAGGLLILGETSIKYVDAD 271

Query:   243 -NAFKAIPIRPS-ITKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTGLKIELLGET 300
              N   + P+  + I  A+ +VD+   R+            ++     +V   ++  LG T
Sbjct:   272 NNEIVSQPLEEATIFVAWEQVDSQ--RWLLADDYGRLFFLMLVLRNSEVERWELHSLGNT 329

Query:   301 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL- 359
             S AS + YL   VV++GS  GDSQ+I++  Q     S+ +V++   N+ P++DF ++DL 
Sbjct:   330 SRASVLVYLGGGVVFVGSHQGDSQVIRIGDQ-----SF-QVIQTLSNIAPVLDFTIMDLG 383

Query:   360 ----ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-S 407
                 E Q      GQ ++VT SGA+ DG+LR VR+G+G+ E   + +++ I  +W L+  
Sbjct:   384 NRTSENQMHEFSSGQARIVTGSGAFDDGTLRSVRSGVGLEELGVLGDMEHITDLWGLQVG 443

Query:   408 STDDPFDTFLVVSFISETRILAMNLXXXXXXXXXX-GFCSQTQTLFCHDAIYNQLVQVTS 466
             S  D  DT L+V+F++ETR+   +            G      TL   +   ++++QVT 
Sbjct:   444 SRGDFLDT-LLVTFVNETRVFRFSPDGEAEELESFLGLSLSENTLLAANLPGSRILQVTE 502

Query:   467 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKH 525
               V +          EW       +  A+AN   ++L  GG H+  L+I  +  +   K 
Sbjct:   503 QRVLIADIECGMTIFEWTPKNQLIITAASANDDTIVLVAGGKHVTVLDIQSEARVVSEKD 562

Query:   526 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPR 583
                + +IS + + P    P+   +  VG      V +  L DL+ I+   LG  GE  PR
Sbjct:   563 FGADNQISGVTL-PT--TPT--DVCIVGFPQLAKVSVLKLQDLSHISSTSLGPAGEAFPR 617

Query:   584 SVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-T 640
             SVL+ +   E    L  ++ DG ++ +  N +   L+   K+ LG++  T +     N  
Sbjct:   618 SVLVASVLAENAPTLFISMADGSVITYDYNDQDHSLSGMNKLVLGSEQPTFKKLPRGNGL 677

Query:   641 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 700
             ++VFA  + P++IY S  +++YS VN +  S +C FNS A+P+S+A+A   EL IG +D 
Sbjct:   678 SNVFATCENPSLIYGSEGRIIYSAVNSEGASRICHFNSEAYPESIAVATAQELKIGLVDK 737

Query:   701 IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES--EMHFVRLLDDQTFEFIS 758
              +   I+++P+    RR+ +    + F + +++ +  + E   +  FV L D+  F  + 
Sbjct:   738 ERTTQIQTLPIKATVRRVAYSPSEKAFGMGTIERKLVSGEEIVKSQFV-LADEILFRRLD 796

Query:   759 TYPLDTFEY-GCSILS-CSFSDDSNVY--YCVGTAYVLPEENEPTKGRILVFIVEDG-KL 813
              + L+  E   C I +    S D      + VG+AY+  ++ + T G I VF V++G KL
Sbjct:   797 AFDLEGEEIVECVIRAEAPESKDGEAKDRFVVGSAYLGEDDGDSTLGYIRVFEVDNGRKL 856

Query:   814 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 873
               +A++  KGA  +L     K++AA+ + + +++ + R  G  +LQ    +      + +
Sbjct:   857 AKVAQERVKGACRALAVMGDKIVAALVKTVVVFQVVPRSGGL-QLQRLASYRTSTAPVDI 915

Query:   874 QTRGDFIVVGDLMKSISLLIYKH-EEGA---IEERARDYNANWMSAVEILDDDIYLGAEN 929
                 + I + DLMKS+ ++ Y   E GA   + E AR +   W + V  +  D YL ++ 
Sbjct:   916 TVTRNVIAIADLMKSVCVVEYHEGENGAPDKLVEVARHFQTVWATGVTSVAPDTYLESDA 975

Query:   930 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 989
               NL  +R+N  G  +++R RLEV GE  L E VNR R  + + +LP + V  +P     
Sbjct:   976 EGNLIVLRRNRSGVEEDDRRRLEVTGEICLNEMVNRIRPVN-IQQLPSATV--VPRAFLA 1032

Query:   990 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLD 1048
             TV G I + A +  +   FL +LQ  +      +GG+    +R+F    ++  +   F+D
Sbjct:  1033 TVEGSIYLYAIINPDYQDFLMRLQATMASRADSLGGIPFTDYRAFRTMTRQATEPYRFVD 1092

Query:  1049 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1088
             G+LIE FL        EI   +  S+EE+   VE L RLH
Sbjct:  1093 GELIERFLTCEPAVQKEIVDIVGSSLEEVRAIVEALRRLH 1132

 Score = 446 (162.1 bits), Expect = 1.1e-54, Sum P(2) = 1.1e-54
 Identities = 152/524 (29%), Positives = 251/524 (47%)

Query:   148 GQLKEAFNIRLEELQVLDIKFLYGCA-KPTIVVLYQDNKD-----ARHVKTYEVALKDKD 201
             G+L E    R++EL V    FL+  A  P + +LY+DN+       R +K    A  + +
Sbjct:   167 GELGEPIITRIDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESE 226

Query:   202 FVE-GPWSQNNLDNGADLLIPVPPPLC---GVLIIGEETIVYCSA--NAFKAIPIRPS-I 254
             F     ++Q  LD GA  LIPVP PL    G+LI+GE +I Y  A  N   + P+  + I
Sbjct:   227 FTSIADYAQE-LDLGASHLIPVPAPLAAAGGLLILGETSIKYVDADNNEIVSQPLEEATI 285

Query:   255 TKAYGRVDADGSRYXXXXXXXXXXXXVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 314
               A+ +VD+   R+            ++     +V   ++  LG TS AS + YL   VV
Sbjct:   286 FVAWEQVDSQ--RWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRASVLVYLGGGVV 343

Query:   315 YIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------G 363
             ++GS  GDSQ+I++  Q     S+ +V++   N+ P++DF ++DL     E Q      G
Sbjct:   344 FVGSHQGDSQVIRIGDQ-----SF-QVIQTLSNIAPVLDFTIMDLGNRTSENQMHEFSSG 397

Query:   364 QGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSF 421
             Q ++VT SGA+ DG+LR VR+G+G+ E   + +++ I  +W L+  S  D  DT L+V+F
Sbjct:   398 QARIVTGSGAFDDGTLRSVRSGVGLEELGVLGDMEHITDLWGLQVGSRGDFLDT-LLVTF 456

Query:   422 ISETRILAMNLXXXXXXXXXX-GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 480
             ++ETR+   +            G      TL   +   ++++QVT   V +         
Sbjct:   457 VNETRVFRFSPDGEAEELESFLGLSLSENTLLAANLPGSRILQVTEQRVLIADIECGMTI 516

Query:   481 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD-INP 539
              EW       +  A+AN   ++L  GG H+  L+I     +E +    E +    + I+ 
Sbjct:   517 FEWTPKNQLIITAASANDDTIVLVAGGKHVTVLDIQ----SEARVVS-EKDFGADNQISG 571

Query:   540 IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISY 595
             +    + + +  VG      V +  L DL+ I+   LG  GE  PRSVL+ +   E    
Sbjct:   572 VTLPTTPTDVCIVGFPQLAKVSVLKLQDLSHISSTSLGPAGEAFPRSVLVASVLAENAPT 631

Query:   596 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 639
             L  ++ DG ++ +  N +   L+   K+ LG++  T +     N
Sbjct:   632 LFISMADGSVITYDYNDQDHSLSGMNKLVLGSEQPTFKKLPRGN 675

 Score = 209 (78.6 bits), Expect = 3.8e-114, Sum P(2) = 3.8e-114
 Identities = 43/141 (30%), Positives = 76/141 (53%)

Query:     5 NYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIAT 64
             +Y+   H+ +++ H+   +F + ++  L++AK  ++E + +TP GL  +    I+ R+  
Sbjct:     2 SYIAPIHRASSIRHALKLHFLNAEDECLVVAKANQLEFYSVTPDGLALVTSCSIFARVTM 61

Query:    65 LE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGI 122
             L  L  P     D LF+ T+RY +  L WD+  +++ T R   D++D   R    G   +
Sbjct:    62 LACLPAPANSPTDHLFVGTDRYSYFTLSWDSARNQVRTERDYVDIADPSSRDARTGSRCM 121

Query:   123 IDPDCRLIGLHLYDGLFKVIP 143
             IDP  R + L +YDG+  VIP
Sbjct:   122 IDPSGRFMTLEIYDGMIVVIP 142

 Score = 41 (19.5 bits), Expect = 2.0e-11, Sum P(2) = 2.0e-11
 Identities = 14/55 (25%), Positives = 28/55 (50%)

Query:   642 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 696
             HV A S R  ++Y  N+K +   + ++E+ +     + +   S+A   + EL +G
Sbjct:   189 HVQAGSPRLALLYEDNQKKV--KLKVRELKYSTAAGAESEFTSIADYAQ-ELDLG 240


>UNIPROTKB|B4DG00 [details] [associations]
            symbol:DDB1 "cDNA FLJ52436, highly similar to DNA
            damage-binding protein 1" species:9606 "Homo sapiens" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:AP003108
            UniGene:Hs.290758 HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037
            EMBL:AK294341 IPI:IPI00909177 SMR:B4DG00 STRING:B4DG00
            Ensembl:ENST00000450997 UCSC:uc010rle.1 HOGENOM:HOG000069916
            HOVERGEN:HBG102355 Uniprot:B4DG00
        Length = 451

 Score = 940 (336.0 bits), Expect = 7.8e-114, Sum P(2) = 7.8e-114
 Identities = 193/377 (51%), Positives = 254/377 (67%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:    79 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 138

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842
             Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  
Sbjct:   139 YFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINST 198

Query:   843 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 902
             ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  E
Sbjct:   199 VRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFE 254

Query:   903 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 962
             E ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEF
Sbjct:   255 EIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEF 314

Query:   963 VNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 1021
             VN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK
Sbjct:   315 VNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIK 374

Query:  1022 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1071
              VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct:   375 SVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 434

Query:  1072 VSVEELCKRVEELTRLH 1088
              + ++L K VEELTR+H
Sbjct:   435 ATADDLIKVVEELTRIH 451

 Score = 203 (76.5 bits), Expect = 7.8e-114, Sum P(2) = 7.8e-114
 Identities = 38/67 (56%), Positives = 54/67 (80%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRP 70
              +ELFRP
Sbjct:    63 VMELFRP 69


>UNIPROTKB|G4N4E2 [details] [associations]
            symbol:MGG_16867 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            GO:GO:0005634 Gene3D:2.130.10.10 EMBL:CM001233 GO:GO:0003676
            RefSeq:XP_003712617.1 EnsemblFungi:MGG_16867T0 GeneID:12985117
            KEGG:mgr:MGG_16867 Uniprot:G4N4E2
        Length = 1183

 Score = 471 (170.9 bits), Expect = 1.4e-79, Sum P(3) = 1.4e-79
 Identities = 157/559 (28%), Positives = 252/559 (45%)

Query:   363 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFLVVS 420
             GQ ++VT SGA KDGSLR VR+G+G+ +   +  E+ G+ G++SL+S   D  DT LVVS
Sbjct:   417 GQARIVTASGAQKDGSLRSVRSGVGLEDIGVITDEISGVTGLFSLKSYGSDVEDT-LVVS 475

Query:   421 FISETRILAMNLXXXXXXXXXX-GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 479
             F++ETR+   +            G      TL         ++ VT     L  +     
Sbjct:   476 FLTETRVFRFDKQGEVEELSQLQGLDISQPTLLVLGLDNGHVLYVTEEKATLFDAEGGVT 535

Query:   480 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDIN 538
              + W    G  +  A++N   VLL+  G  LV L IG  +    +  +  E +ISC    
Sbjct:   536 ISSWSPTSGKPITHASSNGRWVLLSVDGRKLVSLNIGLDLKVSAESEERDEDQISC---- 591

Query:   539 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC----AF 590
              +  +P    + AVG W+  ++ I  L  L     E L       ++ R V+L     A 
Sbjct:   592 -VNASPHLLDVGAVGFWSSGTISIIDLKTLEATQTEKLRRNEDDAVVAREVVLARVLPAE 650

Query:   591 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDR 649
                  L  +  DG ++ F+ N   G L+ RK V LGT+    R     N    +F   + 
Sbjct:   651 VANPTLFVSKDDGEVMTFVYN-DNGTLSSRKSVVLGTREARFRVLPQPNGLCSIFVTCEH 709

Query:   650 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG------ELTIGTIDDIQK 703
              ++I+ + ++++YS V     +++CPF++AAF D LA+A E       EL I  ID  ++
Sbjct:   710 SSLIHGAERRIVYSAVTAHSAAYVCPFDTAAFRDCLAVATESAIDRRMELKISRIDRQRQ 769

Query:   704 LHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYP 761
               + + P+GE+ R I +    + F + C  +  S   E      +L D+  FE   + + 
Sbjct:   770 CQMMTRPMGENVRSIAYSSADKVFGLGCIRRVLSRGIEKVYGTFKLFDEVIFEPKGNVFA 829

Query:   762 LDTFEYG-C---SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ-LI 816
             L+  E   C   + L  S+ + +   + VGT Y L        GR+LVF V++ +   LI
Sbjct:   830 LEDGEVPECVTRAPLLDSYGEQAE-RFIVGTRY-LSGTGSGHGGRVLVFGVDESRSPYLI 887

Query:   817 AEKETKGAVYSLNAFNGKLLA-AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 875
                 TK     +   +  LL  A+ + + L ++      + +            A+ V  
Sbjct:   888 HAHSTKSGCRRIATMDDDLLVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTV 947

Query:   876 RGDFIVVGDLMKSISLLIY 894
              G  I V D+MKSI+LL Y
Sbjct:   948 HGKLIAVADIMKSITLLEY 966

 Score = 320 (117.7 bits), Expect = 7.9e-63, Sum P(3) = 7.9e-63
 Identities = 106/390 (27%), Positives = 179/390 (45%)

Query:   226 LCGVLIIGEETIVYCSANAFK--AIPIRPSIT-KAYGRVDADGSRYXXXXXXXXXXXXVI 282
             L GV+++GE  ++Y    ++      ++ ++   A+ + D   + Y             I
Sbjct:   261 LGGVIVVGESRMLYIDDQSWTWTETALKNAMVFVAWAKFD--NTHYLLADDYGGLHLLTI 318

Query:   283 THEKEKVTG---LKIELLGETSIASTISYLD-NAVVYIGSSYGDSQLIKLNLQPDA-KG- 336
               ++   T    +    +G TS A+ + Y + N  +++ S YGDSQ   +NL  DA KG 
Sbjct:   319 QVKQNSDTAVDHMSTVQIGTTSRATKLVYSETNRTLFVASHYGDSQFYDVNLFADAAKGE 378

Query:   337 SYVEVLERYVNLGPIVDFCVVDL-ERQG-----------QGQVVTCSGAYKDGSLRIVRN 384
             S++E+ +   N+ PI+DF V+D+  R+G           Q ++VT SGA KDGSLR VR+
Sbjct:   379 SFLELRQTIENIAPILDFAVMDMGNREGDSQLGNEYSSGQARIVTASGAQKDGSLRSVRS 438

Query:   385 GIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXX 442
             G+G+ +   +  E+ G+ G++SL+S   D  DT LVVSF++ETR+   +           
Sbjct:   439 GVGLEDIGVITDEISGVTGLFSLKSYGSDVEDT-LVVSFLTETRVFRFDKQGEVEELSQL 497

Query:   443 -GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQV 501
              G      TL         ++ VT     L  +      + W    G  +  A++N   V
Sbjct:   498 QGLDISQPTLLVLGLDNGHVLYVTEEKATLFDAEGGVTISSWSPTSGKPITHASSNGRWV 557

Query:   502 LLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVR 561
             LL+  G  LV L IG  +    +  + + +     I+ +  +P    + AVG W+  ++ 
Sbjct:   558 LLSVDGRKLVSLNIGLDLKVSAESEERDED----QISCVNASPHLLDVGAVGFWSSGTIS 613

Query:   562 IFSLPDLNLITKEHL----GGEIIPRSVLL 587
             I  L  L     E L       ++ R V+L
Sbjct:   614 IIDLKTLEATQTEKLRRNEDDAVVAREVVL 643

 Score = 237 (88.5 bits), Expect = 1.4e-79, Sum P(3) = 1.4e-79
 Identities = 55/195 (28%), Positives = 102/195 (52%)

Query:   897 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 956
             ++  + E  RDY A W +AV  L+ D ++ A+ + NL  + +N+ G T E++ R+++  E
Sbjct:   993 KQAKLVEVCRDYQAMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSE 1052

Query:   957 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTN 1015
             + LGE VN+ +    VM    ++   +      T  G I +  ++ P  Q L ++  Q N
Sbjct:  1053 FGLGECVNKIQK---VMVETSANAPIVAKAFLSTTEGSIYLFGTVAPKFQSLLMD-FQAN 1108

Query:  1016 LRKVIKG-VGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1073
             +   +   +G L   QWRSF N E++    + FLDG+ +E FLD+      +I + ++ +
Sbjct:  1109 MEAHVSSPLGELQFNQWRSFRNPEREGAGPERFLDGEFLEMFLDMEENTQIDICQGLSYT 1168

Query:  1074 VEELCKRVEELTRLH 1088
              E++   + E+  +H
Sbjct:  1169 AEDMRNLIGEMKNMH 1183

 Score = 232 (86.7 bits), Expect = 1.4e-79, Sum P(3) = 1.4e-79
 Identities = 63/210 (30%), Positives = 104/210 (49%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG-LQPMLDVPIYGRIAT 64
             Y+   H+ ++V H+      + +E +L++AK  R+EI   T +G L+      ++G+I  
Sbjct:     3 YIAPIHRSSSVRHALYIQLLAGEEPSLVLAKTNRLEIWRRTDEGQLKLEHSQSVFGKIVM 62

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTDNGQIGII 123
             L+  RP     D LF+ T+R+K+   ++D ++ EL+TR A+ D+ ++  R   +    I+
Sbjct:    63 LQAVRPKDSETDMLFVGTDRFKYFTAEYDPDTRELVTRQAISDLGEQFVREVSSRNRCIV 122

Query:   124 DPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLE--------ELQVLDIKFLYG-CA 173
             DP  R + L L+ G+  V      KGQ K     RLE        EL + D  FL+   A
Sbjct:   123 DPSGRYMVLLLWSGIMHVWRLHKRKGQQKGQLQTRLELMDQARISELYIKDAVFLHSETA 182

Query:   174 KPTIVVLYQDNKDARHVK--TYEVALKDKD 201
              P I  LYQ   +    K  +Y +   D+D
Sbjct:   183 HPRIAFLYQPRPNEPDCKFASYRLCTDDRD 212

 Score = 60 (26.2 bits), Expect = 1.3e-15, Sum P(3) = 1.3e-15
 Identities = 20/62 (32%), Positives = 33/62 (53%)

Query:   140 KVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKD 199
             +V  FD +G+++E     L +LQ LDI      ++PT++VL  DN    +V   +  L D
Sbjct:   481 RVFRFDKQGEVEE-----LSQLQGLDI------SQPTLLVLGLDNGHVLYVTEEKATLFD 529

Query:   200 KD 201
              +
Sbjct:   530 AE 531

 Score = 49 (22.3 bits), Expect = 3.5e-43, Sum P(3) = 3.5e-43
 Identities = 13/45 (28%), Positives = 26/45 (57%)

Query:   584 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 628
             S  +   EG S+++ A GDG+L+  L N     L D++++ + ++
Sbjct:  1009 STAVSHLEGDSWIV-ADGDGNLVVLLRNTAGVTLEDKRRMQMTSE 1052

 Score = 45 (20.9 bits), Expect = 3.0e-14, Sum P(2) = 3.0e-14
 Identities = 13/48 (27%), Positives = 25/48 (52%)

Query:  1033 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1080
             S   + +  D   F D    ESFL+L +T ++ I+  ++ +V ++  R
Sbjct:   358 SHYGDSQFYDVNLFADAAKGESFLELRQT-IENIAPILDFAVMDMGNR 404

 Score = 41 (19.5 bits), Expect = 2.5e-33, Sum P(3) = 2.5e-33
 Identities = 16/49 (32%), Positives = 22/49 (44%)

Query:   219 LIPVPPPLCGV--------LIIG-EETIVYCSANAFKAIPIRPSITKAY 258
             ++P P  LC +        LI G E  IVY +  A  A  + P  T A+
Sbjct:   693 VLPQPNGLCSIFVTCEHSSLIHGAERRIVYSAVTAHSAAYVCPFDTAAF 741


>UNIPROTKB|F5H6C5 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909008
            ProteinModelPortal:F5H6C5 SMR:F5H6C5 Ensembl:ENST00000535967
            ArrayExpress:F5H6C5 Bgee:F5H6C5 Uniprot:F5H6C5
        Length = 272

 Score = 760 (272.6 bits), Expect = 2.3e-75, P = 2.3e-75
 Identities = 145/274 (52%), Positives = 193/274 (70%)

Query:   342 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 401
             +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG
Sbjct:     1 METFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKG 60

Query:   402 MWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQL 461
             +W LRS  +   D  LV+SF+ +TR+L +N           GF    QT FC +  + QL
Sbjct:    61 LWPLRSDPNRETDDTLVLSFVGQTRVLMLN-GEEVEETELMGFVDDQQTFFCGNVAHQQL 119

Query:   462 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 521
             +Q+TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L 
Sbjct:   120 IQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELR 178

Query:   522 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 581
             ++ H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEII
Sbjct:   179 QISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEII 238

Query:   582 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 615
             PRS+L+  FE   YLLCALGDG L  F LN++TG
Sbjct:   239 PRSILMTTFESSHYLLCALGDGALFYFGLNIETG 272


>UNIPROTKB|F5H581 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909251
            ProteinModelPortal:F5H581 SMR:F5H581 Ensembl:ENST00000535147
            ArrayExpress:F5H581 Bgee:F5H581 Uniprot:F5H581
        Length = 267

 Score = 686 (246.5 bits), Expect = 1.8e-69, Sum P(2) = 1.8e-69
 Identities = 129/204 (63%), Positives = 161/204 (78%)

Query:   528 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 587
             +E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+
Sbjct:     1 MEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILM 60

Query:   588 CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 647
               FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA S
Sbjct:    61 TTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACS 120

Query:   648 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 707
             DRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR
Sbjct:   121 DRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIR 180

Query:   708 SIPLGEHPRRICHQEQSRTFAICS 731
             ++PL E PR+IC+QE S+ F + S
Sbjct:   181 TVPLYESPRKICYQEVSQCFGVLS 204

 Score = 48 (22.0 bits), Expect = 1.8e-69, Sum P(2) = 1.8e-69
 Identities = 11/33 (33%), Positives = 20/33 (60%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFE 755
             S+ F+  +  ++ S  EE E+H + ++D  TFE
Sbjct:   235 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFE 267


>UNIPROTKB|F5H775 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01015574
            ProteinModelPortal:F5H775 SMR:F5H775 Ensembl:ENST00000537877
            ArrayExpress:F5H775 Bgee:F5H775 Uniprot:F5H775
        Length = 240

 Score = 661 (237.7 bits), Expect = 3.4e-64, P = 3.4e-64
 Identities = 128/228 (56%), Positives = 166/228 (72%)

Query:   443 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 502
             GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK P   +++VA+ N+SQV+
Sbjct:    14 GFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVV 73

Query:   503 LATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRI 562
             +A G   L YL+I    L ++ H ++E+E++CLDI P+G++   S + A+G+WTDIS RI
Sbjct:    74 VAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARI 132

Query:   563 FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 622
               LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F LN++TG L+DRKK
Sbjct:   133 LKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKK 192

Query:   623 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEV 670
             V+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV
Sbjct:   193 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEV 240


>UNIPROTKB|F5H0Y5 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 GO:GO:0016055
            Gene3D:2.130.10.10 GO:GO:0003684 EMBL:AP003108 HGNC:HGNC:2717
            ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909177
            ProteinModelPortal:F5H0Y5 SMR:F5H0Y5 Ensembl:ENST00000539332
            ArrayExpress:F5H0Y5 Bgee:F5H0Y5 Uniprot:F5H0Y5
        Length = 204

 Score = 626 (225.4 bits), Expect = 2.2e-60, P = 2.2e-60
 Identities = 125/207 (60%), Positives = 157/207 (75%)

Query:   791 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 850
             V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W  
Sbjct:     2 VYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTT 61

Query:   851 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 910
                  +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N 
Sbjct:    62 E----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNP 117

Query:   911 NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 970
             NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGS
Sbjct:   118 NWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGS 177

Query:   971 LVMR-LPDSDVGQIPTVIFGTVNGVIG 996
             LVM+ L ++      +V+FGTVNG+IG
Sbjct:   178 LVMQNLGETSTPTQGSVLFGTVNGMIG 204


>TAIR|locus:2100616 [details] [associations]
            symbol:SAP130a "spliceosome-associated protein 130 a"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA;ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005829 "cytosol" evidence=RCA] [GO:0009555 "pollen
            development" evidence=IMP] [GO:0009846 "pollen germination"
            evidence=IMP] [GO:0048481 "ovule development" evidence=IMP]
            InterPro:IPR001680 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 SMART:SM00320 GO:GO:0009506 GO:GO:0005634
            GO:GO:0009507 EMBL:CP002686 Gene3D:2.130.10.10 GO:GO:0009555
            GO:GO:0003676 EMBL:AL132954 GO:GO:0048481 GO:GO:0009846
            eggNOG:NOG247734 KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA
            IPI:IPI00517026 PIR:T47659 RefSeq:NP_567015.1 RefSeq:NP_567016.1
            UniGene:At.28226 UniGene:At.72270 ProteinModelPortal:Q9LD60
            SMR:Q9LD60 STRING:Q9LD60 PaxDb:Q9LD60 PRIDE:Q9LD60
            EnsemblPlants:AT3G55200.1 EnsemblPlants:AT3G55220.1 GeneID:824686
            GeneID:824688 KEGG:ath:AT3G55200 KEGG:ath:AT3G55220
            KEGG:dosa:Os02t0137400-01 TAIR:At3g55200 TAIR:At3g55220
            InParanoid:Q9LD60 PhylomeDB:Q9LD60 ProtClustDB:CLSN2689171
            ArrayExpress:Q9LD60 Genevestigator:Q9LD60 Uniprot:Q9LD60
        Length = 1214

 Score = 345 (126.5 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 116/431 (26%), Positives = 202/431 (46%)

Query:   331 QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 390
             QP    + V + ++  +L P++D  V+++  +   Q+ +  G     SLRI+R G+ I E
Sbjct:   386 QPRRLKNLVRI-DQVESLMPLMDMKVLNIFEEETPQIFSLCGRGPRSSLRILRPGLAITE 444

Query:   391 QASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQ 449
              A  +L G    +W+++ +  D FD ++VVSF + T  L +++          GF   T 
Sbjct:   445 MAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT--LVLSIGEQVEEVNDSGFLDTTP 502

Query:   450 TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 509
             +L       + L+QV    +R +    R   NEW++P   S+     N  QV++A  GG 
Sbjct:   503 SLAVSLIGDDSLMQVHPNGIRHIREDGRI--NEWRTPGKRSIVKVGYNRLQVVIALSGGE 560

Query:   510 LVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD 567
             L+Y E    G L EV+  ++  +++CLDI P+ E    S+  AVG + D +VRI SL PD
Sbjct:   561 LIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSRFLAVGSY-DNTVRILSLDPD 619

Query:   568 --LNLITKEHLGGEIIPRSVLLCAFE-------GIS-----YLLCALGDGHLLNFLLNMK 613
               L +++ + +     P S+L    +       G       +L   L +G L   +++M 
Sbjct:   620 DCLQILSVQSVSSA--PESLLFLEVQASIGGDDGADHPANLFLNSGLQNGVLFRTVVDMV 677

Query:   614 TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 673
             TG+L+D +   LG +P  L + S +  + +   S RP + Y        + ++ + +   
Sbjct:   678 TGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIHRGHFHLTPLSYETLEFA 737

Query:   674 CPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICS 731
              PF+S    + +       L I  ID + +  +   +PL   PR+ + H ++     I S
Sbjct:   738 APFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTPRKFVLHPKRKLLVIIES 797

Query:   732 LKNQSCAEESE 742
              +    AEE E
Sbjct:   798 DQGAFTAEERE 808

 Score = 241 (89.9 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 100/369 (27%), Positives = 172/369 (46%)

Query:   740 ESE--MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---P 793
             ESE  +  +R+LD +T        L   E   S+ + +F D +      VGT   +   P
Sbjct:   855 ESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGTLLAVGTVKGMQFWP 914

Query:   794 EENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 851
             ++N    G I ++  VEDGK L+L+ + + +G   +L  F G+LLA I   ++LY     
Sbjct:   915 KKNL-VAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY----- 968

Query:   852 DDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 910
             D G + L  +C +      +  +QT  D I VGD+ +S     Y+ +E  +   A D   
Sbjct:   969 DLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVP 1028

Query:   911 NWMSAVEILDDDIYLGAENNFNLFTVRK----NSEGATDEERGRLE-----VVGEYHLGE 961
              W++A   +D D   GA+   N++ VR     + E   D   G+++     + G  +  +
Sbjct:  1029 RWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVD 1088

Query:   962 FVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRK 1018
              + +F  G +V  L  + +  G   ++++GTV G IG + A    +   F   L+ ++R+
Sbjct:  1089 EIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFSHLEMHMRQ 1148

Query:  1019 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSV 1074
                 + G +H  +RS          K+ +DGDL E F    +DL R   DE+ +T     
Sbjct:  1149 EYPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTP---- 1199

Query:  1075 EELCKRVEE 1083
              E+ K++E+
Sbjct:  1200 AEILKKLED 1208

 Score = 104 (41.7 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 42/181 (23%), Positives = 75/181 (41%)

Query:   188 RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSA-- 242
             +H+  YE+ L   + V   WS N +DNGA++L+ VP       GVL+  E  ++Y +   
Sbjct:   205 KHLTFYELDL-GLNHVSRKWS-NPVDNGANMLVTVPGGADGPSGVLVCAENFVIYMNQGH 262

Query:   243 -NAFKAIPIRPSITKAYG------RVDADGSRYXXXXXXXXXXXXVIT--HEKEKVTGLK 293
              +    IP R  +    G       V    + +             +T  H  + V+ LK
Sbjct:   263 PDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQTEYGDVFKVTLDHNGDHVSELK 322

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL---QPDAKGSYVEVLERYVNLGP 350
             ++      +AS+I  L    ++  S +G+  L +      +PD + S   ++E      P
Sbjct:   323 VKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAIGEEPDVESSSSNLMETEEGFQP 382

Query:   351 I 351
             +
Sbjct:   383 V 383

 Score = 102 (41.0 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 53/238 (22%), Positives = 101/238 (42%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ--G-LQPMLDVPIYGRI 62
             Y +T  + T +  +  GNF+  +   + +A+  +I + LL P   G +Q +  V ++G I
Sbjct:     4 YSLTLQQATGIVCAINGNFSGGKTQEIAVAR-GKI-LDLLRPDENGKIQTIHSVEVFGAI 61

Query:    63 ATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIG 121
              +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G R    GQ  
Sbjct:    62 RSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKN-VFDKVHQETFGKSGCRRIVPGQYV 120

Query:   122 IIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA--KPT 176
              +DP  R  +IG      L  V+  D   +L  +  +   +   +        C    P 
Sbjct:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 180

Query:   177 IVVLYQDNKDARHVKTYEVAL---KDKDFVEGPWSQNNL--------DNGADLLIPVP 223
                +  D  +A    T + A    K   F E     N++        DNGA++L+ VP
Sbjct:   181 FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVP 238

 Score = 41 (19.5 bits), Expect = 9.3e-15, Sum P(2) = 9.3e-15
 Identities = 23/79 (29%), Positives = 35/79 (44%)

Query:   526 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV----RIFSLPDLN----LITKEHLG 577
             A +E + S  D +P G+  S +Q        D+ +    R +S P  N    L+T    G
Sbjct:   182 AAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVP--G 239

Query:   578 GEIIPRSVLLCAFEGISYL 596
             G   P  VL+CA   + Y+
Sbjct:   240 GADGPSGVLVCAENFVIYM 258

 Score = 39 (18.8 bits), Expect = 1.1e-05, Sum P(3) = 1.1e-05
 Identities = 14/53 (26%), Positives = 22/53 (41%)

Query:   369 TCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 421
             T +GA K G++  VR    ++E+   +  G K  W        P     +V F
Sbjct:  1041 TMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDEIVQF 1093


>TAIR|locus:2100646 [details] [associations]
            symbol:SAP130b "spliceosome-associated protein 130 b"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA;ISS] [GO:0005829 "cytosol" evidence=RCA]
            [GO:0009506 "plasmodesma" evidence=IDA] [GO:0009555 "pollen
            development" evidence=IMP] [GO:0009846 "pollen germination"
            evidence=IMP] [GO:0048481 "ovule development" evidence=IMP]
            InterPro:IPR001680 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 SMART:SM00320 GO:GO:0009506 GO:GO:0005634
            GO:GO:0009507 EMBL:CP002686 Gene3D:2.130.10.10 GO:GO:0009555
            GO:GO:0003676 EMBL:AL132954 GO:GO:0048481 GO:GO:0009846
            eggNOG:NOG247734 KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA
            IPI:IPI00517026 PIR:T47659 RefSeq:NP_567015.1 RefSeq:NP_567016.1
            UniGene:At.28226 UniGene:At.72270 ProteinModelPortal:Q9LD60
            SMR:Q9LD60 STRING:Q9LD60 PaxDb:Q9LD60 PRIDE:Q9LD60
            EnsemblPlants:AT3G55200.1 EnsemblPlants:AT3G55220.1 GeneID:824686
            GeneID:824688 KEGG:ath:AT3G55200 KEGG:ath:AT3G55220
            KEGG:dosa:Os02t0137400-01 TAIR:At3g55200 TAIR:At3g55220
            InParanoid:Q9LD60 PhylomeDB:Q9LD60 ProtClustDB:CLSN2689171
            ArrayExpress:Q9LD60 Genevestigator:Q9LD60 Uniprot:Q9LD60
        Length = 1214

 Score = 345 (126.5 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 116/431 (26%), Positives = 202/431 (46%)

Query:   331 QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 390
             QP    + V + ++  +L P++D  V+++  +   Q+ +  G     SLRI+R G+ I E
Sbjct:   386 QPRRLKNLVRI-DQVESLMPLMDMKVLNIFEEETPQIFSLCGRGPRSSLRILRPGLAITE 444

Query:   391 QASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQ 449
              A  +L G    +W+++ +  D FD ++VVSF + T  L +++          GF   T 
Sbjct:   445 MAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT--LVLSIGEQVEEVNDSGFLDTTP 502

Query:   450 TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 509
             +L       + L+QV    +R +    R   NEW++P   S+     N  QV++A  GG 
Sbjct:   503 SLAVSLIGDDSLMQVHPNGIRHIREDGRI--NEWRTPGKRSIVKVGYNRLQVVIALSGGE 560

Query:   510 LVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD 567
             L+Y E    G L EV+  ++  +++CLDI P+ E    S+  AVG + D +VRI SL PD
Sbjct:   561 LIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSRFLAVGSY-DNTVRILSLDPD 619

Query:   568 --LNLITKEHLGGEIIPRSVLLCAFE-------GIS-----YLLCALGDGHLLNFLLNMK 613
               L +++ + +     P S+L    +       G       +L   L +G L   +++M 
Sbjct:   620 DCLQILSVQSVSSA--PESLLFLEVQASIGGDDGADHPANLFLNSGLQNGVLFRTVVDMV 677

Query:   614 TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 673
             TG+L+D +   LG +P  L + S +  + +   S RP + Y        + ++ + +   
Sbjct:   678 TGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIHRGHFHLTPLSYETLEFA 737

Query:   674 CPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICS 731
              PF+S    + +       L I  ID + +  +   +PL   PR+ + H ++     I S
Sbjct:   738 APFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTPRKFVLHPKRKLLVIIES 797

Query:   732 LKNQSCAEESE 742
              +    AEE E
Sbjct:   798 DQGAFTAEERE 808

 Score = 241 (89.9 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 100/369 (27%), Positives = 172/369 (46%)

Query:   740 ESE--MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---P 793
             ESE  +  +R+LD +T        L   E   S+ + +F D +      VGT   +   P
Sbjct:   855 ESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGTLLAVGTVKGMQFWP 914

Query:   794 EENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 851
             ++N    G I ++  VEDGK L+L+ + + +G   +L  F G+LLA I   ++LY     
Sbjct:   915 KKNL-VAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY----- 968

Query:   852 DDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 910
             D G + L  +C +      +  +QT  D I VGD+ +S     Y+ +E  +   A D   
Sbjct:   969 DLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVP 1028

Query:   911 NWMSAVEILDDDIYLGAENNFNLFTVRK----NSEGATDEERGRLE-----VVGEYHLGE 961
              W++A   +D D   GA+   N++ VR     + E   D   G+++     + G  +  +
Sbjct:  1029 RWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVD 1088

Query:   962 FVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRK 1018
              + +F  G +V  L  + +  G   ++++GTV G IG + A    +   F   L+ ++R+
Sbjct:  1089 EIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFSHLEMHMRQ 1148

Query:  1019 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSV 1074
                 + G +H  +RS          K+ +DGDL E F    +DL R   DE+ +T     
Sbjct:  1149 EYPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTP---- 1199

Query:  1075 EELCKRVEE 1083
              E+ K++E+
Sbjct:  1200 AEILKKLED 1208

 Score = 104 (41.7 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 42/181 (23%), Positives = 75/181 (41%)

Query:   188 RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSA-- 242
             +H+  YE+ L   + V   WS N +DNGA++L+ VP       GVL+  E  ++Y +   
Sbjct:   205 KHLTFYELDL-GLNHVSRKWS-NPVDNGANMLVTVPGGADGPSGVLVCAENFVIYMNQGH 262

Query:   243 -NAFKAIPIRPSITKAYG------RVDADGSRYXXXXXXXXXXXXVIT--HEKEKVTGLK 293
              +    IP R  +    G       V    + +             +T  H  + V+ LK
Sbjct:   263 PDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQTEYGDVFKVTLDHNGDHVSELK 322

Query:   294 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL---QPDAKGSYVEVLERYVNLGP 350
             ++      +AS+I  L    ++  S +G+  L +      +PD + S   ++E      P
Sbjct:   323 VKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAIGEEPDVESSSSNLMETEEGFQP 382

Query:   351 I 351
             +
Sbjct:   383 V 383

 Score = 102 (41.0 bits), Expect = 2.6e-58, Sum P(4) = 2.6e-58
 Identities = 53/238 (22%), Positives = 101/238 (42%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ--G-LQPMLDVPIYGRI 62
             Y +T  + T +  +  GNF+  +   + +A+  +I + LL P   G +Q +  V ++G I
Sbjct:     4 YSLTLQQATGIVCAINGNFSGGKTQEIAVAR-GKI-LDLLRPDENGKIQTIHSVEVFGAI 61

Query:    63 ATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIG 121
              +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G R    GQ  
Sbjct:    62 RSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKN-VFDKVHQETFGKSGCRRIVPGQYV 120

Query:   122 IIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA--KPT 176
              +DP  R  +IG      L  V+  D   +L  +  +   +   +        C    P 
Sbjct:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 180

Query:   177 IVVLYQDNKDARHVKTYEVAL---KDKDFVEGPWSQNNL--------DNGADLLIPVP 223
                +  D  +A    T + A    K   F E     N++        DNGA++L+ VP
Sbjct:   181 FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVP 238

 Score = 41 (19.5 bits), Expect = 9.3e-15, Sum P(2) = 9.3e-15
 Identities = 23/79 (29%), Positives = 35/79 (44%)

Query:   526 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV----RIFSLPDLN----LITKEHLG 577
             A +E + S  D +P G+  S +Q        D+ +    R +S P  N    L+T    G
Sbjct:   182 AAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVP--G 239

Query:   578 GEIIPRSVLLCAFEGISYL 596
             G   P  VL+CA   + Y+
Sbjct:   240 GADGPSGVLVCAENFVIYM 258

 Score = 39 (18.8 bits), Expect = 1.1e-05, Sum P(3) = 1.1e-05
 Identities = 14/53 (26%), Positives = 22/53 (41%)

Query:   369 TCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 421
             T +GA K G++  VR    ++E+   +  G K  W        P     +V F
Sbjct:  1041 TMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDEIVQF 1093


>POMBASE|SPAC17H9.10c [details] [associations]
            symbol:ddb1 "damaged DNA binding protein Ddb1"
            species:4896 "Schizosaccharomyces pombe" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
            "nucleolus" evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0006279 "premeiotic DNA replication" evidence=TAS] [GO:0006282
            "regulation of DNA repair" evidence=IMP] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=IMP]
            [GO:0006974 "response to DNA damage stimulus" evidence=IMP]
            [GO:0007090 "regulation of S phase of mitotic cell cycle"
            evidence=IMP] [GO:0034644 "cellular response to UV" evidence=IMP]
            [GO:0040020 "regulation of meiosis" evidence=IGI] [GO:0042787
            "protein ubiquitination involved in ubiquitin-dependent protein
            catabolic process" evidence=IMP] [GO:0051445 "regulation of meiotic
            cell cycle" evidence=IGI] [GO:0070912 "Ddb1-Ckn1 complex"
            evidence=IDA] [GO:0070913 "Ddb1-Wdr21 complex" evidence=IDA]
            [GO:0008180 "signalosome" evidence=IDA] [GO:0031465 "Cul4B-RING
            ubiquitin ligase complex" evidence=IDA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143
            PomBase:SPAC17H9.10c GO:GO:0005829 EMBL:CU329670 GO:GO:0005730
            GenomeReviews:CU329670_GR Gene3D:2.130.10.10 GO:GO:0003677
            GO:GO:0007049 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0034644
            GO:GO:0040020 GO:GO:0042787 GO:GO:0007090 GO:GO:0006283
            GO:GO:0006282 GO:GO:0006279 GO:GO:0070912 eggNOG:NOG247734
            KO:K10610 OMA:CALGDGS PIR:T37876 RefSeq:NP_593580.1 IntAct:O13807
            STRING:O13807 EnsemblFungi:SPAC17H9.10c.1 GeneID:2542207
            KEGG:spo:SPAC17H9.10c OrthoDB:EOG473T0C NextBio:20803277
            GO:GO:0070913 Uniprot:O13807
        Length = 1072

 Score = 615 (221.5 bits), Expect = 8.3e-57, P = 8.3e-57
 Identities = 181/692 (26%), Positives = 351/692 (50%)

Query:   413 FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIY--NQLVQVTSGSVR 470
             +D ++ +S I ETR + ++             C ++ T+F    IY  +Q++Q+T+  +R
Sbjct:   418 YDNYIFLSLICETRAIIVSPEGVFSANHDLS-CEES-TIFV-STIYGNSQILQITTKEIR 474

Query:   471 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 530
             L     ++L + W SP   S+   ++ A  V +A  GG +++ E   GI TEV   Q + 
Sbjct:   475 LFDG--KKLHS-WISP--MSITCGSSFADNVCVAVAGGLILFFE---GI-TEVGRYQCDT 525

Query:   531 EISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLNLITKEHLGGEIIPRSVLLCA 589
             E+S L      EN  Y     VG+W+ DI +  +    ++L     L    IPRS++   
Sbjct:   526 EVSSLCFTE--ENVVY-----VGLWSADIIMLTYCQDGISLTHSLKLTD--IPRSIVYSQ 576

Query:   590 FEGIS--YLLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQPITLRTFSSKNTTHVFA 645
               G     L  +  +G++L F  N + G++ +   ++  LG  PI L+ F SK    +FA
Sbjct:   577 KYGDDGGTLYVSTNNGYVLMF--NFQNGQVIEHSLRRNQLGVAPIILKHFDSKEKNAIFA 634

Query:   646 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 705
               ++P ++Y  + KL+ + ++  E+ ++  + + +   ++       +++  + +I+ L+
Sbjct:   635 LGEKPQLMYYESDKLVITPLSCTEMLNISSYVNPSLGVNMLYCTNSYISLAKMSEIRSLN 694

Query:   706 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE--MHFVRLLDDQTFEFISTYPLD 763
             ++++ +   PRRIC       F +C    +S   + +  + F+R+ +  T   I+ +  +
Sbjct:   695 VQTVSVKGFPRRICSNSLFY-FVLCMQLEESIGTQEQRLLSFLRVYEKNTLSEIAHHKFN 753

Query:   764 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETK 822
              +E   SI+    +DD  V   VGT +  P+++ P  GR++VF +  D  +++ AE + +
Sbjct:   754 EYEMVESIIL--MNDDKRVV--VGTGFNFPDQDAPDSGRLMVFEMTSDNNIEMQAEHKVQ 809

Query:   823 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 882
             G+V +L  +   ++A IN  + ++++   + GT  +++      + + + V    D I+ 
Sbjct:   810 GSVNTLVLYKHLIVAGINASVCIFEY---EHGTMHVRNSIRTPTYTIDISVNQ--DEIIA 864

Query:   883 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 942
              DLMKSI++L +  ++  + E ARDY+  W ++VEIL +  Y   E + N   + +++  
Sbjct:   865 ADLMKSITVLQFIDDQ--LIEVARDYHPLWATSVEILSERKYFVTEADGNAVILLRDNVS 922

Query:   943 ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP 1002
                 +R +L    +++LGE +N+ RH + +     S V   P ++  TV+G + ++    
Sbjct:   923 PQLSDRKKLRWYKKFYLGELINKTRHCTFIEPQDKSLV--TPQLLCATVDGSLMIVGDAG 980

Query:  1003 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1062
                   L +LQ N+RKVI   GGL+H++W+ +  E +T    + +DG LIES L L    
Sbjct:   981 MSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET-SPSDLIDGSLIESILGLREPI 1039

Query:  1063 MDEI------SKTMNVSVEELCKRVEELTRLH 1088
             ++EI         +++SV++L   +E L +LH
Sbjct:  1040 LNEIVNGGHEGTKLDISVQDLKSIIENLEKLH 1071

 Score = 605 (218.0 bits), Expect = 1.1e-55, P = 1.1e-55
 Identities = 165/553 (29%), Positives = 292/553 (52%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIATL 65
             YV   HKP+++ ++    F +    N+I+AK   +E++      L  +    I+ +I  +
Sbjct:     3 YVTYLHKPSSIRNAVFCKFVNASSWNVIVAKVNCLEVYSYENNRLCLITSANIFAKIVNV 62

Query:    66 ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRA-MGDVSDRIGRPTDNGQIGIID 124
             + F+P     D + +AT+ +++  L WDA  + +     + D S+R  R + +G + ++D
Sbjct:    63 KAFKPVSSPTDHIIVATDSFRYFTLFWDANDNTVSNGIKIQDCSERSLRESQSGPLLLVD 122

Query:   125 PDCRLIGLHLYDGLFKVIPF----------DNKGQLKEAFNIRLEELQVLDIKFLYGCAK 174
             P  R+I LH+Y GL  +IP            N   L + F++R++EL V+DI  LY  ++
Sbjct:   123 PFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQELNVVDIAMLYNSSR 182

Query:   175 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 234
             P++ VLY+D+K   H+ TY++ +++++  E     ++++ G   LIP      GV + GE
Sbjct:   183 PSLAVLYKDSKSIVHLSTYKINVREQEIDEDDVVCHDIEEGK--LIPSENG--GVFVFGE 238

Query:   235 ETIVYCSAN--AFKAI---PIR---PSITKAYGRVDADGSRYXXXXXXXXXXXXVITHEK 286
               + Y S +    K +   PI    PSI+        D S Y                  
Sbjct:   239 MYVYYISKDIQVSKLLLTYPITAFSPSISND-PETGLDSSIYIVADESGMLYKFKALFTD 297

Query:   287 EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD-AKGSY-VEVLER 344
             E V+ +++E LGE+SIAS +  L +  +++GS + +S L++L   P   K ++ +E+L+ 
Sbjct:   298 ETVS-MELEKLGESSIASCLIALPDNHLFVGSHFNNSVLLQL---PSITKNNHKLEILQN 353

Query:   345 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 404
             +VN+ PI DF ++D ++ G   ++TCSGAYKDG+LRI+RN I I   A +E++GIK  +S
Sbjct:   354 FVNIAPISDF-IIDDDQTGSS-IITCSGAYKDGTLRIIRNSINIENVALIEMEGIKDFFS 411

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIY--NQLV 462
             +    +  +D ++ +S I ETR + ++             C ++ T+F    IY  +Q++
Sbjct:   412 VSFRAN--YDNYIFLSLICETRAIIVSPEGVFSANHDLS-CEES-TIFV-STIYGNSQIL 466

Query:   463 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 522
             Q+T+  +RL     ++L + W SP   S+   ++ A  V +A  GG +++ E   GI TE
Sbjct:   467 QITTKEIRLFDG--KKLHS-WISP--MSITCGSSFADNVCVAVAGGLILFFE---GI-TE 517

Query:   523 VKHAQLEYEISCL 535
             V   Q + E+S L
Sbjct:   518 VGRYQCDTEVSSL 530


>DICTYBASE|DDB_G0282569 [details] [associations]
            symbol:sf3b3 "splicing factor 3B subunit 3"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0030532 "small nuclear ribonucleoprotein complex" evidence=ISS]
            [GO:0008380 "RNA splicing" evidence=IEA;ISS] [GO:0006461 "protein
            complex assembly" evidence=ISS] [GO:0005681 "spliceosomal complex"
            evidence=IEA;ISS] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 dictyBase:DDB_G0282569 GO:GO:0006461 GO:GO:0008380
            Gene3D:2.130.10.10 SUPFAM:SSF50978 EMBL:AAFI02000047
            GenomeReviews:CM000152_GR GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
            GO:GO:0030532 eggNOG:NOG247734 KO:K12830 OMA:FDTIPVA
            RefSeq:XP_640132.1 STRING:Q54SA7 EnsemblProtists:DDB0233171
            GeneID:8623669 KEGG:ddi:DDB_G0282569 ProtClustDB:CLSZ2729005
            Uniprot:Q54SA7
        Length = 1256

 Score = 326 (119.8 bits), Expect = 1.2e-54, Sum P(4) = 1.2e-54
 Identities = 127/493 (25%), Positives = 223/493 (45%)

Query:   623 VSLGTQPI-TLRTFSSKNTTH-VFAASDRPTVIYSSNKKL-LYSNVNLKEVSHMCPFNSA 679
             V L  +P+      SS+ +   + A S+   +I+S +K   L++   +K   +  P    
Sbjct:   770 VPLSIEPLENASNLSSEQSAESIVATSENKIIIFSIDKLGDLFNQETIK--LNATPKRFI 827

Query:   680 AFPD-SLAIAKEGELTIGTID-DIQKLHIRSIPLGEHPRRICHQE-------QSRTFAIC 730
               P  S  I  E E    T + DI K++ +S  L    ++   QE       Q+    I 
Sbjct:   828 IHPQTSYIIILETETNYNTDNIDIDKINEQSEKLLLEKQKELQQEMDIDDDDQNNNNEIE 887

Query:   731 SLKN---QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 787
               K        +     +++++D  T E + +  L+  E G S+ +CSF +   ++  VG
Sbjct:   888 PFKKLFKPKAGKGKWKSYIKIMDPITHESLESLMLEDGEAGFSVCTCSFGESGEIFLVVG 947

Query:   788 --TAYVL-PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 844
               T  VL P+ ++     +  FI    KL+L+ + E +  VY++  F GKL+  + + I+
Sbjct:   948 CVTDMVLNPKSHKSAHLNLYRFIDGGKKLELLYKTEVEEPVYAMAQFQGKLVCGVGKSIR 1007

Query:   845 LYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 903
             +Y     D G ++L  +C        +  + + GD +VVGD+ +SI  + YK  E  +  
Sbjct:  1008 IY-----DMGKKKLLRKCETKNLPNTIVNIHSLGDRLVVGDIQESIHFIKYKRSENMLYV 1062

Query:   904 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEERG---RLE---VV 954
              A D    WM++  +LD D   GA+   N+F +R     S+   ++  G   + E   + 
Sbjct:  1063 FADDLAPRWMTSSVMLDYDTVAGADKFGNIFVLRLPLLISDEVEEDPTGTKLKFESGTLN 1122

Query:   955 GEYHLGEFVNRFRHGSLVMRLPDSD--VGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEK 1011
             G  H  + +  F  G  V  L  +   VG    +++ T++G IG +I     E   F   
Sbjct:  1123 GAPHKLDHIANFFVGDTVTTLNKTSLVVGGPEVILYTTISGAIGALIPFTSREDVDFFST 1182

Query:  1012 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
             L+ N+R     + G +H  +RS+         KN +DGDL E F  L+  +   IS+ ++
Sbjct:  1183 LEMNMRSDCLPLCGRDHLAYRSYY-----FPVKNIIDGDLCEQFSTLNYQKQLSISEELS 1237

Query:  1072 VSVEELCKRVEEL 1084
              S  E+ K++EE+
Sbjct:  1238 RSPSEVIKKLEEI 1250

 Score = 270 (100.1 bits), Expect = 1.2e-54, Sum P(4) = 1.2e-54
 Identities = 67/226 (29%), Positives = 118/226 (52%)

Query:   347 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 405
             +L PI+DF V+DL R+   Q+ +  G   + SL+++R+G+ +    +  L G+  G+W++
Sbjct:   416 SLSPIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTANLPGVPSGIWTV 475

Query:   406 RSSTD----DPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQL 461
               ST     D  D ++VVSF+  T +L++            G    T TL       + +
Sbjct:   476 PKSTSPNAIDQTDKYIVVSFVGTTSVLSVG--DTIQENHESGILETTTTLLVKSMGDDAI 533

Query:   462 VQVTSGSVRLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI- 519
             +QV     R + S   +LR NEW++P   ++  A+AN SQ+ +A  GG ++Y E+     
Sbjct:   534 IQVFPTGFRHIKS---DLRINEWRAPGRKTIVRASANQSQLAIALSGGEIIYFELDQASN 590

Query:   520 LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 565
             L E+    L  +I+C++I+PI +  + ++  AV  W    +R+ SL
Sbjct:   591 LIEIIKKDLRRDIACIEISPIPKGRNMARFIAVSDWEG-PIRVLSL 635

 Score = 104 (41.7 bits), Expect = 1.2e-54, Sum P(4) = 1.2e-54
 Identities = 31/149 (20%), Positives = 72/149 (48%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG-LQPMLDVPIYGRIAT 64
             Y +T  +PT+V  S  GNF+  +++ +++     +E+      G +Q +L   ++G + +
Sbjct:     4 YNLTLQRPTSVYQSISGNFSGTKQVEIVLNHGRSLELIRYDENGKMQSVLYTEVFGIVRS 63

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGII 123
             +  FR     +D++ + ++  +  +L+++++ ++   +   +   R G R    GQ   +
Sbjct:    64 IIPFRLTSGTKDYIIVGSDSGRVVILEYNSQKNQF-DKIHQETFGRSGCRRIVPGQYLAV 122

Query:   124 DPDCR--LIGLHLYDGLFKVIPFDNKGQL 150
             DP  R  +IG      L  ++  D+   L
Sbjct:   123 DPKGRAFMIGAIEKQKLVYILNRDSSANL 151

 Score = 104 (41.7 bits), Expect = 2.2e-31, Sum P(4) = 2.2e-31
 Identities = 34/135 (25%), Positives = 64/135 (47%)

Query:   592 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 651
             G  +L   L +G +    L+  TGEL+D +   LG +P+ L     + +  + A S R  
Sbjct:   699 GSLFLFVGLKNGVVKRATLDSVTGELSDIRTRLLGRKPVKLFKVKVRGSNAMLALSSRVW 758

Query:   652 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL-HIRSIP 710
             + Y +  KL    ++++ + +    +S    +S+    E ++ I +ID +  L +  +I 
Sbjct:   759 LNYINQGKLDIVPLSIEPLENASNLSSEQSAESIVATSENKIIIFSIDKLGDLFNQETIK 818

Query:   711 LGEHPRR-ICHQEQS 724
             L   P+R I H + S
Sbjct:   819 LNATPKRFIIHPQTS 833

 Score = 59 (25.8 bits), Expect = 1.2e-54, Sum P(4) = 1.2e-54
 Identities = 31/147 (21%), Positives = 64/147 (43%)

Query:   193 YEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFK 246
             YE+ L   + V   WS + +D+ A++++ VP       GVL+  E+ IVY +   A    
Sbjct:   221 YELDLGLNNVVR-KWS-DQVDDSANIVMTVPGGTEGPGGVLVASEDYIVYRNQDHAEVRS 278

Query:   247 AIPIR----PS---ITKAYGRVDADGSRYXXXXXXXXXXXXV-ITHEKEKVTGLKIELLG 298
              IP R    P+   +  ++      G  +            + + ++ ++V+ + +    
Sbjct:   279 RIPRRYGSDPNKGVLIISHSSHKQKGMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFD 338

Query:   299 ETSIASTISYLDNAVVYIGSSYGDSQL 325
                +A+ ++ L N  ++  S +GD  L
Sbjct:   339 TIVLANCLTVLKNGFLFAASEFGDHTL 365

 Score = 57 (25.1 bits), Expect = 1.2e-32, Sum P(4) = 1.2e-32
 Identities = 36/175 (20%), Positives = 75/175 (42%)

Query:   468 SVRLVSSTSRELRNEWK---SPPGYSVNVATANASQVLLATGGGHL-VYLEIGDGILTEV 523
             S+ ++     E+  E K   S  G +    T+ +S     T GG L +++ + +G++   
Sbjct:   656 SLSIIEMQLNEMGIETKKSQSQTGQTTTTTTSTSSASSSVTSGGSLFLFVGLKNGVVKRA 715

Query:   524 KHAQLEYEISCLDINPIGENP-SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 582
                 +  E+S +    +G  P    ++   G  ++  + + S   LN I +  L  +I+P
Sbjct:   716 TLDSVTGELSDIRTRLLGRKPVKLFKVKVRG--SNAMLALSSRVWLNYINQGKL--DIVP 771

Query:   583 RSVLLCAFEGISYL--------LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 629
              S+     E  S L        + A  +  ++ F ++ K G+L +++ + L   P
Sbjct:   772 LSIE--PLENASNLSSEQSAESIVATSENKIIIFSID-KLGDLFNQETIKLNATP 823


>ASPGD|ASPL0000031473 [details] [associations]
            symbol:AN5452 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380
            Gene3D:2.130.10.10 GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
            GO:GO:0007049 EMBL:BN001305 EMBL:AACD01000094 eggNOG:NOG247734
            KO:K12830 RefSeq:XP_663056.1 STRING:Q5B1X8 GeneID:2871744
            KEGG:ani:AN5452.2 HOGENOM:HOG000216677 OMA:FDTIPVA
            OrthoDB:EOG4FR40R Uniprot:Q5B1X8
        Length = 1209

 Score = 343 (125.8 bits), Expect = 2.0e-53, Sum P(5) = 2.0e-53
 Identities = 117/443 (26%), Positives = 193/443 (43%)

Query:   339 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 398
             + ++E   +L P+VD  VV++      Q+ T SG     + R +++G+ ++E    EL  
Sbjct:   411 LNLVEAINSLNPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPS 470

Query:   399 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAI 457
             +   +W+ + +  D FD ++V+SF + T  L +++          GF S   TL      
Sbjct:   471 VPSAVWTTKLTRADEFDAYIVLSFANGT--LVLSIGETVEEVTDTGFLSSAPTLAVQQLG 528

Query:   458 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-G 516
              + L+Q+    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  
Sbjct:   529 EDSLIQIHPRGIRHILADRRV--NEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDA 586

Query:   517 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 574
             DG L E  +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K 
Sbjct:   587 DGSLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKS 645

Query:   575 HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 628
                    P ++ + A    S      YL   L  G  L   L+  TGEL+D +   LG++
Sbjct:   646 VQALTAAPSALNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSK 705

Query:   629 PITLRTFSSKNTTHVFAASDRPTVIYSSN--KKLLYSNVNLKEVSHMCPFNSAAFPDSLA 686
              + L   S    T V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct:   706 AVKLFQVSVTGQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMV 765

Query:   687 IAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 745
               +   L I +I+ +    ++ SIPL   PR      +   F +    N   +  +    
Sbjct:   766 GIQGQNLRIFSIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR--- 822

Query:   746 VRLLDDQTFEFISTYPLDTFEYG 768
              RLL+D       T  L   ++G
Sbjct:   823 ARLLEDSKARGGDTTVLPPEDFG 845

 Score = 227 (85.0 bits), Expect = 2.0e-53, Sum P(5) = 2.0e-53
 Identities = 91/351 (25%), Positives = 159/351 (45%)

Query:   757 ISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK 812
             +    L+  E   SI +  F S D   +  VGTA  +   N P+   G I ++   EDGK
Sbjct:   869 VGAVELEENEAAVSIAAVPFTSQDDETFLVVGTAKDMTV-NPPSSAGGYIHIYRFQEDGK 927

Query:   813 -LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 871
              L+ I + + +    +L  F G+LLA +   +++Y     D G ++L  +C       A+
Sbjct:   928 ELEFIHKTKVEEPPLALLGFQGRLLAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAI 982

Query:   872 Y-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 930
               +QT+G  IVV D+ +S++ ++YK+++  +     D  A W +A  ++D +   G +  
Sbjct:   983 VGLQTQGSRIVVSDVRESVTYVVYKYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKF 1042

Query:   931 FNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFR---HGSLVMRLPDS------ 978
              NL+ VR   K SE A +E  G   +    +L    NR     H      +P S      
Sbjct:  1043 GNLWLVRCPKKASEEADEEGSGAHLIHDRGYLQGTPNRLELMIH-VFTQDIPTSLHKTQL 1101

Query:   979 DVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 1037
               G    +++    G IG++   +  E   F + L+  L      + G +H  +RS+   
Sbjct:  1102 VAGGRDILVWTGFQGTIGILVPFVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYY-- 1159

Query:  1038 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL-TRL 1087
                   K  +DGDL E +  LS      I+  ++ SV E+ +++ ++ TR+
Sbjct:  1160 ---APVKGVIDGDLCEQYFLLSNDTKMMIAAELDRSVREIERKISDMRTRV 1207

 Score = 120 (47.3 bits), Expect = 2.0e-53, Sum P(5) = 2.0e-53
 Identities = 33/126 (26%), Positives = 60/126 (47%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTP-QG-LQPMLDVPIYGRIA 63
             Y +T   PT +T + +G F   +E  ++ A  +++ IH   P QG + P+    ++G I 
Sbjct:    10 YSLTIQPPTAITQAILGQFAGTKEQQIVTASGSKLTIHRPDPTQGKVIPLYTQDVFGIIR 69

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGI 122
             TL  FR  G  +D++ I ++  +  ++++   S     R   +   + G R    GQ   
Sbjct:    70 TLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVVPGQYLA 128

Query:   123 IDPDCR 128
             +DP  R
Sbjct:   129 VDPKGR 134

 Score = 50 (22.7 bits), Expect = 2.0e-53, Sum P(5) = 2.0e-53
 Identities = 20/63 (31%), Positives = 32/63 (50%)

Query:   193 YEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSAN--AFKA 247
             YE+ L   + V   W+ + +D  + +L  VP       GVL+  E+ I Y  +N  AF+ 
Sbjct:   217 YELDL-GLNHVVRKWT-DPVDRTSSMLFQVPGGADGPSGVLVCAEDNITYRHSNQDAFR- 273

Query:   248 IPI 250
             +PI
Sbjct:   274 VPI 276

 Score = 49 (22.3 bits), Expect = 2.0e-53, Sum P(5) = 2.0e-53
 Identities = 10/35 (28%), Positives = 20/35 (57%)

Query:   288 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 322
             +V GLKI+      +AS++  L +  +Y+ +  G+
Sbjct:   333 EVKGLKIKYFDTVPLASSLLILKSGFLYVAAEGGN 367

 Score = 43 (20.2 bits), Expect = 7.5e-21, Sum P(3) = 7.5e-21
 Identities = 9/19 (47%), Positives = 12/19 (63%)

Query:   577 GGEIIPRSVLLCAFEGISY 595
             GG   P  VL+CA + I+Y
Sbjct:   246 GGADGPSGVLVCAEDNITY 264


>UNIPROTKB|E2RR33 [details] [associations]
            symbol:SF3B3 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0003676 GO:GO:0071013
            GeneTree:ENSGT00530000063396 GO:GO:0005689 KO:K12830 OMA:FDTIPVA
            CTD:23450 EMBL:AAEX03004077 RefSeq:XP_536791.2
            Ensembl:ENSCAFT00000032086 GeneID:479659 KEGG:cfa:479659
            Uniprot:E2RR33
        Length = 1217

 Score = 337 (123.7 bits), Expect = 1.1e-50, Sum P(4) = 1.1e-50
 Identities = 112/408 (27%), Positives = 188/408 (46%)

Query:   347 NLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 404
             +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+
Sbjct:   403 SLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWT 461

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             +R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV
Sbjct:   462 VRRHIEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 519

Query:   465 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE- 522
                 +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E 
Sbjct:   520 YPDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEY 577

Query:   523 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGE 579
              +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  +
Sbjct:   578 TERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ 636

Query:   580 IIPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKV 623
               P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +  
Sbjct:   637 --PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTR 692

Query:   624 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+
Sbjct:   693 YLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPE 752

Query:   684 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   753 GIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 800

 Score = 242 (90.2 bits), Expect = 1.1e-50, Sum P(4) = 1.1e-50
 Identities = 87/363 (23%), Positives = 169/363 (46%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+  + +Y  VG A  ++        G +
Sbjct:   866 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFV 925

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   926 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 980

Query:   862 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 918
             C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct:   981 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 1038

Query:   919 LDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFR 967
             LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  + 
Sbjct:  1039 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYH 1096

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVG 1024
              G  V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + 
Sbjct:  1097 VGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLC 1156

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct:  1157 GRDHLSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211

Query:  1085 -TR 1086
              TR
Sbjct:  1212 RTR 1214

 Score = 100 (40.3 bits), Expect = 1.1e-50, Sum P(4) = 1.1e-50
 Identities = 27/128 (21%), Positives = 63/128 (49%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ-G-LQPMLDVPIYGRIA 63
             Y +T  + T ++ +  GNF+  ++  +++++   +E+    P  G +  +L V ++G I 
Sbjct:     4 YNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFGVIR 63

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGI 122
             +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ   
Sbjct:    64 SLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSGCRRIVPGQFLA 122

Query:   123 IDPDCRLI 130
             +DP  R +
Sbjct:   123 VDPKGRAV 130

 Score = 47 (21.6 bits), Expect = 4.5e-20, Sum P(4) = 4.5e-20
 Identities = 34/111 (30%), Positives = 45/111 (40%)

Query:   494 ATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 552
             A    S  L A     LVY  +G  +  E    A LE +    D +P GE  + +Q    
Sbjct:   150 ARLTISSPLEAHKANTLVYHVVGVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLT 209

Query:   553 GMWTDIS----VRIFSLP---DLN-LITKEHLGGEIIPRSVLLCAFEGISY 595
                 D+     VR +S P     N LIT    GG   P  VL+C+   I+Y
Sbjct:   210 FYELDLGLNHVVRKYSEPLEEHGNFLITVP--GGSDGPSGVLICSENYITY 258

 Score = 40 (19.1 bits), Expect = 1.1e-50, Sum P(4) = 1.1e-50
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:   258 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 298

 Score = 38 (18.4 bits), Expect = 4.5e-20, Sum P(4) = 4.5e-20
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   338 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 385

 Score = 38 (18.4 bits), Expect = 4.5e-20, Sum P(4) = 4.5e-20
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   574 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 626

 Score = 37 (18.1 bits), Expect = 4.6e-19, Sum P(4) = 4.6e-19
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   540 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 572


>UNIPROTKB|A0JN52 [details] [associations]
            symbol:SF3B3 "Splicing factor 3B subunit 3" species:9913
            "Bos taurus" [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0008380
            GO:GO:0006397 GO:GO:0003676 GO:GO:0071013 eggNOG:NOG247734
            GeneTree:ENSGT00530000063396 GO:GO:0005689 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA EMBL:BC126518 IPI:IPI00690059
            RefSeq:NP_001071319.1 UniGene:Bt.7895 ProteinModelPortal:A0JN52
            STRING:A0JN52 PRIDE:A0JN52 Ensembl:ENSBTAT00000014050 GeneID:504962
            KEGG:bta:504962 CTD:23450 HOVERGEN:HBG093942 InParanoid:A0JN52
            OrthoDB:EOG4RV2QJ BioCyc:CATTLE:504962-MONOMER BindingDB:A0JN52
            NextBio:20866909 ArrayExpress:A0JN52 Uniprot:A0JN52
        Length = 1217

 Score = 337 (123.7 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 112/408 (27%), Positives = 188/408 (46%)

Query:   347 NLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 404
             +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+
Sbjct:   403 SLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWT 461

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             +R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV
Sbjct:   462 VRRHIEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 519

Query:   465 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE- 522
                 +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E 
Sbjct:   520 YPDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEY 577

Query:   523 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGE 579
              +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  +
Sbjct:   578 TERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ 636

Query:   580 IIPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKV 623
               P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +  
Sbjct:   637 --PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTR 692

Query:   624 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+
Sbjct:   693 YLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPE 752

Query:   684 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   753 GIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 800

 Score = 241 (89.9 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 87/363 (23%), Positives = 168/363 (46%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+    +Y  VG A  ++        G +
Sbjct:   866 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFV 925

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   926 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 980

Query:   862 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 918
             C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct:   981 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 1038

Query:   919 LDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFR 967
             LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  + 
Sbjct:  1039 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYH 1096

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVG 1024
              G  V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + 
Sbjct:  1097 VGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLC 1156

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct:  1157 GRDHLSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211

Query:  1085 -TR 1086
              TR
Sbjct:  1212 RTR 1214

 Score = 100 (40.3 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 27/128 (21%), Positives = 63/128 (49%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ-G-LQPMLDVPIYGRIA 63
             Y +T  + T ++ +  GNF+  ++  +++++   +E+    P  G +  +L V ++G I 
Sbjct:     4 YNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFGVIR 63

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGI 122
             +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ   
Sbjct:    64 SLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSGCRRIVPGQFLA 122

Query:   123 IDPDCRLI 130
             +DP  R +
Sbjct:   123 VDPKGRAV 130

 Score = 47 (21.6 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 34/111 (30%), Positives = 45/111 (40%)

Query:   494 ATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 552
             A    S  L A     LVY  +G  +  E    A LE +    D +P GE  + +Q    
Sbjct:   150 ARLTISSPLEAHKANTLVYHVVGVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLT 209

Query:   553 GMWTDIS----VRIFSLP---DLN-LITKEHLGGEIIPRSVLLCAFEGISY 595
                 D+     VR +S P     N LIT    GG   P  VL+C+   I+Y
Sbjct:   210 FYELDLGLNHVVRKYSEPLEEHGNFLITVP--GGSDGPSGVLICSENYITY 258

 Score = 40 (19.1 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:   258 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 298

 Score = 38 (18.4 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   338 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 385

 Score = 38 (18.4 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   574 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 626

 Score = 37 (18.1 bits), Expect = 5.9e-19, Sum P(4) = 5.9e-19
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   540 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 572


>UNIPROTKB|Q15393 [details] [associations]
            symbol:SF3B3 "Splicing factor 3B subunit 3" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000375 "RNA splicing, via transesterification reactions"
            evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IC;TAS] [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IDA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=IDA] [GO:0030532 "small nuclear ribonucleoprotein complex"
            evidence=TAS] [GO:0005681 "spliceosomal complex" evidence=TAS]
            [GO:0006397 "mRNA processing" evidence=TAS] [GO:0006461 "protein
            complex assembly" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467
            "gene expression" evidence=TAS] Reactome:REACT_71
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0005654 GO:GO:0006461
            Reactome:REACT_1675 GO:GO:0003676 GO:GO:0000398 GO:GO:0071013
            GO:GO:0030532 eggNOG:NOG247734 GO:GO:0005689 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA CTD:23450 HOVERGEN:HBG093942
            OrthoDB:EOG4RV2QJ EMBL:AJ001443 EMBL:D87686 EMBL:D13642
            EMBL:BC000463 EMBL:BC003146 EMBL:BC009780 EMBL:BC068974
            EMBL:AL110251 IPI:IPI00179138 IPI:IPI00300371 IPI:IPI00828110
            PIR:T14779 RefSeq:NP_036558.3 UniGene:Hs.514435
            ProteinModelPortal:Q15393 DIP:DIP-28152N IntAct:Q15393
            MINT:MINT-1402891 STRING:Q15393 PhosphoSite:Q15393 DMDM:116242787
            PaxDb:Q15393 PeptideAtlas:Q15393 PRIDE:Q15393
            Ensembl:ENST00000302516 GeneID:23450 KEGG:hsa:23450 UCSC:uc002ezf.3
            GeneCards:GC16P070557 HGNC:HGNC:10770 HPA:HPA042986 MIM:605592
            neXtProt:NX_Q15393 PharmGKB:PA35688 InParanoid:Q15393
            PhylomeDB:Q15393 BindingDB:Q15393 ChEMBL:CHEMBL1250378
            GenomeRNAi:23450 NextBio:45731 ArrayExpress:Q15393 Bgee:Q15393
            CleanEx:HS_SAP130 CleanEx:HS_SF3B3 Genevestigator:Q15393
            GermOnline:ENSG00000189091 Uniprot:Q15393
        Length = 1217

 Score = 337 (123.7 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 112/408 (27%), Positives = 188/408 (46%)

Query:   347 NLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 404
             +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+
Sbjct:   403 SLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWT 461

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             +R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV
Sbjct:   462 VRRHIEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 519

Query:   465 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE- 522
                 +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E 
Sbjct:   520 YPDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEY 577

Query:   523 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGE 579
              +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  +
Sbjct:   578 TERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ 636

Query:   580 IIPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKV 623
               P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +  
Sbjct:   637 --PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTR 692

Query:   624 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+
Sbjct:   693 YLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPE 752

Query:   684 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   753 GIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 800

 Score = 241 (89.9 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 87/363 (23%), Positives = 168/363 (46%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+    +Y  VG A  ++        G +
Sbjct:   866 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFV 925

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   926 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 980

Query:   862 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 918
             C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct:   981 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 1038

Query:   919 LDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFR 967
             LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  + 
Sbjct:  1039 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYH 1096

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVG 1024
              G  V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + 
Sbjct:  1097 VGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLC 1156

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct:  1157 GRDHLSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211

Query:  1085 -TR 1086
              TR
Sbjct:  1212 RTR 1214

 Score = 100 (40.3 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 27/128 (21%), Positives = 63/128 (49%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ-G-LQPMLDVPIYGRIA 63
             Y +T  + T ++ +  GNF+  ++  +++++   +E+    P  G +  +L V ++G I 
Sbjct:     4 YNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFGVIR 63

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGI 122
             +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ   
Sbjct:    64 SLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSGCRRIVPGQFLA 122

Query:   123 IDPDCRLI 130
             +DP  R +
Sbjct:   123 VDPKGRAV 130

 Score = 47 (21.6 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 34/111 (30%), Positives = 45/111 (40%)

Query:   494 ATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 552
             A    S  L A     LVY  +G  +  E    A LE +    D +P GE  + +Q    
Sbjct:   150 ARLTISSPLEAHKANTLVYHVVGVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLT 209

Query:   553 GMWTDIS----VRIFSLP---DLN-LITKEHLGGEIIPRSVLLCAFEGISY 595
                 D+     VR +S P     N LIT    GG   P  VL+C+   I+Y
Sbjct:   210 FYELDLGLNHVVRKYSEPLEEHGNFLITVP--GGSDGPSGVLICSENYITY 258

 Score = 40 (19.1 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:   258 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 298

 Score = 38 (18.4 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   338 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 385

 Score = 38 (18.4 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   574 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 626

 Score = 37 (18.1 bits), Expect = 5.9e-19, Sum P(4) = 5.9e-19
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   540 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 572


>MGI|MGI:1289341 [details] [associations]
            symbol:Sf3b3 "splicing factor 3b, subunit 3" species:10090
            "Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005681 "spliceosomal complex"
            evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0071013 "catalytic
            step 2 spliceosome" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
            MGI:MGI:1289341 GO:GO:0008380 GO:GO:0006397 GO:GO:0003676
            GO:GO:0071013 eggNOG:NOG247734 GeneTree:ENSGT00530000063396
            HSSP:Q16531 GO:GO:0005689 KO:K12830 HOGENOM:HOG000216677
            OMA:FDTIPVA CTD:23450 HOVERGEN:HBG093942 OrthoDB:EOG4RV2QJ
            EMBL:AK085705 EMBL:AK088268 EMBL:AK129035 EMBL:AK147914
            EMBL:BC011412 EMBL:BC031197 EMBL:BC042580 IPI:IPI00122011
            IPI:IPI00625759 RefSeq:NP_598714.1 UniGene:Mm.236123
            ProteinModelPortal:Q921M3 IntAct:Q921M3 STRING:Q921M3
            PhosphoSite:Q921M3 PaxDb:Q921M3 PRIDE:Q921M3
            Ensembl:ENSMUST00000042012 GeneID:101943 KEGG:mmu:101943
            UCSC:uc009nlc.1 InParanoid:Q921M3 NextBio:355190 Bgee:Q921M3
            CleanEx:MM_SF3B3 Genevestigator:Q921M3 Uniprot:Q921M3
        Length = 1217

 Score = 337 (123.7 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 112/408 (27%), Positives = 188/408 (46%)

Query:   347 NLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 404
             +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+
Sbjct:   403 SLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWT 461

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             +R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV
Sbjct:   462 VRRHIEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 519

Query:   465 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE- 522
                 +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E 
Sbjct:   520 YPDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEY 577

Query:   523 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGE 579
              +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  +
Sbjct:   578 TERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ 636

Query:   580 IIPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKV 623
               P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +  
Sbjct:   637 --PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTR 692

Query:   624 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+
Sbjct:   693 YLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPE 752

Query:   684 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   753 GIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 800

 Score = 241 (89.9 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 87/363 (23%), Positives = 168/363 (46%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+    +Y  VG A  ++        G +
Sbjct:   866 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFV 925

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   926 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 980

Query:   862 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 918
             C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct:   981 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 1038

Query:   919 LDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFR 967
             LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  + 
Sbjct:  1039 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYH 1096

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVG 1024
              G  V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + 
Sbjct:  1097 VGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLC 1156

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct:  1157 GRDHLSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211

Query:  1085 -TR 1086
              TR
Sbjct:  1212 RTR 1214

 Score = 100 (40.3 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 27/128 (21%), Positives = 63/128 (49%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ-G-LQPMLDVPIYGRIA 63
             Y +T  + T ++ +  GNF+  ++  +++++   +E+    P  G +  +L V ++G I 
Sbjct:     4 YNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFGVIR 63

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGI 122
             +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ   
Sbjct:    64 SLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSGCRRIVPGQFLA 122

Query:   123 IDPDCRLI 130
             +DP  R +
Sbjct:   123 VDPKGRAV 130

 Score = 47 (21.6 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 34/111 (30%), Positives = 45/111 (40%)

Query:   494 ATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 552
             A    S  L A     LVY  +G  +  E    A LE +    D +P GE  + +Q    
Sbjct:   150 ARLTISSPLEAHKANTLVYHVVGVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLT 209

Query:   553 GMWTDIS----VRIFSLP---DLN-LITKEHLGGEIIPRSVLLCAFEGISY 595
                 D+     VR +S P     N LIT    GG   P  VL+C+   I+Y
Sbjct:   210 FYELDLGLNHVVRKYSEPLEEHGNFLITVP--GGSDGPSGVLICSENYITY 258

 Score = 40 (19.1 bits), Expect = 1.4e-50, Sum P(4) = 1.4e-50
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:   258 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 298

 Score = 38 (18.4 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   338 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 385

 Score = 38 (18.4 bits), Expect = 5.8e-20, Sum P(4) = 5.8e-20
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   574 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 626

 Score = 37 (18.1 bits), Expect = 5.9e-19, Sum P(4) = 5.9e-19
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   540 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 572


>FB|FBgn0035162 [details] [associations]
            symbol:CG13900 species:7227 "Drosophila melanogaster"
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0030532 "small
            nuclear ribonucleoprotein complex" evidence=ISS] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=IC;ISS] [GO:0005686 "U2 snRNP"
            evidence=ISS;IDA] [GO:0007052 "mitotic spindle organization"
            evidence=IMP] [GO:0071011 "precatalytic spliceosome" evidence=IDA]
            [GO:0071013 "catalytic step 2 spliceosome" evidence=IDA]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0007052 GO:GO:0022008
            Gene3D:2.130.10.10 GO:GO:0003676 GO:GO:0071011 GO:GO:0000398
            GO:GO:0071013 GO:GO:0005686 eggNOG:NOG247734 EMBL:BT021338
            ProteinModelPortal:Q5BI86 SMR:Q5BI86 STRING:Q5BI86 PaxDb:Q5BI86
            PRIDE:Q5BI86 FlyBase:FBgn0035162 InParanoid:Q5BI86
            OrthoDB:EOG4B5MM0 ArrayExpress:Q5BI86 Bgee:Q5BI86 Uniprot:Q5BI86
        Length = 1227

 Score = 282 (104.3 bits), Expect = 1.7e-50, Sum P(4) = 1.7e-50
 Identities = 72/235 (30%), Positives = 116/235 (49%)

Query:   341 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 399
             +++   +  PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G  
Sbjct:   397 LVDELPSFAPIITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNP 456

Query:   400 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYN 459
               +W+++   DD FD +++VSF++ T  L +++          GF   T TL C     +
Sbjct:   457 NAVWTVKKRADDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLCCAALGDD 514

Query:   460 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-G 518
              LVQV    +R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G
Sbjct:   515 ALVQVYPDGIRHIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTG 572

Query:   519 ILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 572
              L E  + +++  EI C+ +  + E    S   AVG+  D +VRI SL   N +T
Sbjct:   573 ELNEYTERSEMPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLT 626

 Score = 229 (85.7 bits), Expect = 1.7e-50, Sum P(4) = 1.7e-50
 Identities = 85/363 (23%), Positives = 161/363 (44%)

Query:   746 VRLLDDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 803
             +R LD    + + + PL   E     ++L  S + D   Y  VG A  L      ++G  
Sbjct:   875 IRCLDAMHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGC 934

Query:   804 L-VFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 860
             + ++ ++     L+ +   +      +L  F G+LLA   + +++Y     D G +++  
Sbjct:   935 IDIYKIDPTCSSLEFMHRTDIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLR 989

Query:   861 ECGH-HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 919
             +C + H     + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +L
Sbjct:   990 KCENKHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLL 1049

Query:   920 DDDIYLGAENNFNLFTVRKNSEGATDE------------ERGRLEVVGEYHLGEFVNRFR 967
             D D    A+   NL ++++     TD+            +RG L   G     E +  F 
Sbjct:  1050 DYDTIAIADKFGNL-SIQRLPHSVTDDVDEDPTGTKSLWDRGLLS--GASQKSENICSFH 1106

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVG 1024
              G ++M L  + +  G    +I+ T++G +G        E Y F + L+ ++R     + 
Sbjct:  1107 VGEIIMSLQKATLIPGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLC 1166

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             G +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct:  1167 GRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221

Query:  1085 -TR 1086
              TR
Sbjct:  1222 RTR 1224

 Score = 113 (44.8 bits), Expect = 1.7e-50, Sum P(4) = 1.7e-50
 Identities = 32/130 (24%), Positives = 63/130 (48%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG----LQPMLDVPIYGR 61
             Y +T  K T VTH+  GNF+  ++  +++++   +E  LL P      +  +L   I+G 
Sbjct:     4 YNLTLQKATGVTHAVHGNFSGGKQQEVLLSRGKSLE--LLRPDSNTGKVHTLLSTEIFGC 61

Query:    62 IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQI 120
             +  L  FR  G  +D++ + ++  +  +L+++  S   + +   +   + G R    GQ 
Sbjct:    62 VRALMAFRLTGGTKDYIVVGSDSGRIVILEYNP-SKNALEKVHQETFGKSGCRRIVPGQY 120

Query:   121 GIIDPDCRLI 130
               IDP  R +
Sbjct:   121 FAIDPKGRAV 130

 Score = 97 (39.2 bits), Expect = 1.7e-50, Sum P(4) = 1.7e-50
 Identities = 32/154 (20%), Positives = 68/154 (44%)

Query:   591 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 650
             +G  YL   L +G LL  +L+  +G+L D +   LG++P+ L     + +  V A S R 
Sbjct:   669 KGTIYLNIGLSNGVLLRTVLDPVSGDLADTRTRYLGSRPVKLFRIKMQGSEAVLAMSSRT 728

Query:   651 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SI 709
              + Y    +   + ++ + + +   F+S    + +       L I  ++ +  +  + + 
Sbjct:   729 WLSYYHQNRFHLTPLSYETLEYASGFSSEQCSEGIVAISTNTLRILALEKLGAVFNQVAF 788

Query:   710 PLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESE 742
             PL   PR  + H +  R   I    + +  E+++
Sbjct:   789 PLQYTPRTFVIHPDTGRML-IAETDHNAYTEDTK 821

 Score = 58 (25.5 bits), Expect = 4.2e-21, Sum P(4) = 4.2e-21
 Identities = 33/136 (24%), Positives = 60/136 (44%)

Query:   509 HLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS----VRIF 563
             H+V +++G D  +      +++YE +  D++P G+    +Q        D+     VR +
Sbjct:   169 HMVGVDVGFDNPMLAC--LEIDYEEA--DMDPSGDAAQRTQQTLTFYELDLGLNHVVRKY 224

Query:   564 SLP---DLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR 620
             S P     N +     GG   P  VL+C+   ++Y    LGD H +   +  +  +L D 
Sbjct:   225 SEPLEEHANFLVSVP-GGNDGPSGVLICSENYLTYK--NLGDQHDIRCPIPRRRNDLDDP 281

Query:   621 KKVSLGTQPITLRTFS 636
             ++  +     T RT S
Sbjct:   282 ERGMIFICSATHRTKS 297

 Score = 39 (18.8 bits), Expect = 3.6e-25, Sum P(4) = 3.6e-25
 Identities = 12/41 (29%), Positives = 19/41 (46%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y NLG   D  C +     DL+   +G +  CS  ++  S+
Sbjct:   258 YKNLGDQHDIRCPIPRRRNDLDDPERGMIFICSATHRTKSM 298

 Score = 38 (18.4 bits), Expect = 2.1e-44, Sum P(4) = 2.1e-44
 Identities = 9/31 (29%), Positives = 15/31 (48%)

Query:   637 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNL 667
             +++TT      D P     +NK  +Y N+ L
Sbjct:   648 TESTTQGGLDDDAPAQRSGNNKGTIYLNIGL 678


>ZFIN|ZDB-GENE-040426-2901 [details] [associations]
            symbol:sf3b3 "splicing factor 3b, subunit 3"
            species:7955 "Danio rerio" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005681
            "spliceosomal complex" evidence=IEA] [GO:0006397 "mRNA processing"
            evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 ZFIN:ZDB-GENE-040426-2901 GO:GO:0008380
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0006397 GO:GO:0005681
            GO:GO:0003676 eggNOG:NOG247734 GeneTree:ENSGT00530000063396
            KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA CTD:23450
            HOVERGEN:HBG093942 OrthoDB:EOG4RV2QJ EMBL:BX784024 EMBL:BC047171
            IPI:IPI00508652 RefSeq:NP_998668.1 RefSeq:XP_002667683.2
            UniGene:Dr.76176 STRING:Q1LVE8 PRIDE:Q1LVE8
            Ensembl:ENSDART00000008310 Ensembl:ENSDART00000122831
            Ensembl:ENSDART00000129666 Ensembl:ENSDART00000147743
            GeneID:100334114 GeneID:406824 KEGG:dre:100334114 KEGG:dre:406824
            InParanoid:Q1LVE8 NextBio:20818331 Bgee:Q1LVE8 Uniprot:Q1LVE8
        Length = 1217

 Score = 338 (124.0 bits), Expect = 1.8e-50, Sum P(4) = 1.8e-50
 Identities = 106/411 (25%), Positives = 190/411 (46%)

Query:   341 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 399
             +++   +L PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G  
Sbjct:   397 LVDEQESLSPIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNP 456

Query:   400 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYN 459
               +W++R   +D FD +++VSF++ T  L +++          GF   T TL C     +
Sbjct:   457 NAVWTVRRHVEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGED 514

Query:   460 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 518
              LVQV    +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G
Sbjct:   515 ALVQVYPDGIRHIRADKRV--NEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSG 572

Query:   519 ILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKE 574
              L E  +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ +
Sbjct:   573 QLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQ 631

Query:   575 HLGGEIIPRSVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDR 620
              L  +  P S+ +    G+               YL   L +G LL  +L+  TG+L+D 
Sbjct:   632 ALPAQ--PESLCIVEMGGVEKQDELGEKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDT 689

Query:   621 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 680
             +   LG++P+ L     +    V A S R  + YS   +   + ++ + + +   F S  
Sbjct:   690 RTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQ 749

Query:   681 FPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              P+ +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   750 CPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILI 800

 Score = 230 (86.0 bits), Expect = 1.8e-50, Sum P(4) = 1.8e-50
 Identities = 85/361 (23%), Positives = 164/361 (45%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             VRL++      +    L+  E   S+  C F +  + +Y  VG A  ++        G I
Sbjct:   866 VRLINPIQGNTLDLVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYI 925

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + IV  G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   926 YTYRIVGGGDKLEFLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRK 980

Query:   862 CGH-HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 920
             C + H   L   + T G  ++V D+ +S+  + Y+  E  +   A D    W++   +LD
Sbjct:   981 CENKHVPNLVTGIHTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLD 1040

Query:   921 DDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFRHG 969
              D    A+   N+  VR   N+    DE+         RG L   G     E +  +  G
Sbjct:  1041 YDTMASADKFGNICVVRLPPNTSDDVDEDPTGNKALWDRGLLN--GASQKAEIIINYHIG 1098

Query:   970 SLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGL 1026
               V+ L  + +  G   ++++ T++G IG++     HE + F + L+ ++R     + G 
Sbjct:  1099 ETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGR 1158

Query:  1027 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL-T 1085
             +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++ T
Sbjct:  1159 DHLSFRSYY-----FPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDIRT 1213

Query:  1086 R 1086
             R
Sbjct:  1214 R 1214

 Score = 109 (43.4 bits), Expect = 1.8e-50, Sum P(4) = 1.8e-50
 Identities = 41/201 (20%), Positives = 88/201 (43%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG----LQPMLDVPIYGR 61
             Y +T  + T ++H+  GNF+  ++  +++++   +E  LL P      +  +L + ++G 
Sbjct:     4 YNITLQRATGISHAIHGNFSGTKQQEIVVSRGKILE--LLRPDANTGKVHTLLTMEVFGV 61

Query:    62 IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQI 120
             + +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ 
Sbjct:    62 VRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSGCRRIVPGQF 120

Query:   121 GIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCAKP 175
               +DP  R  +IG      L  ++  D   +L  +  +   +   L    +    G   P
Sbjct:   121 LAVDPKGRAVMIGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   176 TIVVLYQDNKDARHVKTYEVA 196
                 L  D ++A +  T E A
Sbjct:   181 MFACLEMDYEEADNDPTGEAA 201

 Score = 44 (20.5 bits), Expect = 5.6e-13, Sum P(3) = 5.6e-13
 Identities = 33/111 (29%), Positives = 45/111 (40%)

Query:   494 ATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 552
             A    S  L A     LVY  +G  +  E    A LE +    D +P GE  + +Q    
Sbjct:   150 ARLTISSPLEAHKANTLVYHVVGVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLT 209

Query:   553 GMWTDIS----VRIFS--LPDLN--LITKEHLGGEIIPRSVLLCAFEGISY 595
                 D+     VR +S  L +    LIT    GG   P  VL+C+   I+Y
Sbjct:   210 FYELDLGLNHVVRKYSEALEEHGNFLITVP--GGSDGPSGVLICSENYITY 258

 Score = 40 (19.1 bits), Expect = 1.8e-50, Sum P(4) = 1.8e-50
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:   258 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 298

 Score = 39 (18.8 bits), Expect = 4.3e-19, Sum P(4) = 4.3e-19
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   338 AMCVLKTGFLFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 385

 Score = 38 (18.4 bits), Expect = 5.4e-19, Sum P(4) = 5.4e-19
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   574 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 626


>UNIPROTKB|F5H2L3 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01011208
            ProteinModelPortal:F5H2L3 SMR:F5H2L3 Ensembl:ENST00000539426
            ArrayExpress:F5H2L3 Bgee:F5H2L3 Uniprot:F5H2L3
        Length = 165

 Score = 523 (189.2 bits), Expect = 3.0e-49, P = 3.0e-49
 Identities = 97/165 (58%), Positives = 128/165 (77%)

Query:    58 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPT 115
             +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP+
Sbjct:     1 MYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPS 60

Query:   116 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 175
             + G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC  P
Sbjct:    61 ETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGCQAP 120

Query:   176 TIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 220
             TI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I
Sbjct:   121 TICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVI 164


>UNIPROTKB|F1P529 [details] [associations]
            symbol:SF3B3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005689 "U12-type spliceosomal complex" evidence=IEA]
            [GO:0071013 "catalytic step 2 spliceosome" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0003676 GO:GO:0071013
            GeneTree:ENSGT00530000063396 GO:GO:0005689 OMA:FDTIPVA
            EMBL:AADN02051593 EMBL:AADN02051594 IPI:IPI00576925
            Ensembl:ENSGALT00000003987 ArrayExpress:F1P529 Uniprot:F1P529
        Length = 1228

 Score = 333 (122.3 bits), Expect = 3.1e-48, Sum P(4) = 3.1e-48
 Identities = 110/407 (27%), Positives = 186/407 (45%)

Query:   347 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 405
             +L PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++
Sbjct:   403 SLSPILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTV 462

Query:   406 RSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVT 465
             R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV 
Sbjct:   463 RRHVEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVY 520

Query:   466 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-V 523
                +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  
Sbjct:   521 PDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYT 578

Query:   524 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEI 580
             +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  + 
Sbjct:   579 ERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ- 636

Query:   581 IPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 624
              P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +   
Sbjct:   637 -PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRY 693

Query:   625 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 684
             LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ 
Sbjct:   694 LGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEG 753

Query:   685 LAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
             +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   754 IVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 800

 Score = 222 (83.2 bits), Expect = 3.1e-48, Sum P(4) = 3.1e-48
 Identities = 74/299 (24%), Positives = 142/299 (47%)

Query:   807 IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH 865
             +V  G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +C + 
Sbjct:   940 LVNGGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKCENK 994

Query:   866 GHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 922
              HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D
Sbjct:   995 KHI-ANYICGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYD 1053

Query:   923 IYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFRHGSL 971
                GA+   N+  VR   N+    DE+         RG L   G     E +  +  G  
Sbjct:  1054 TVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYHVGET 1111

Query:   972 VMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNH 1028
             V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + G +H
Sbjct:  1112 VLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDH 1171

Query:  1029 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL-TR 1086
               +RS+         KN +DGDL E F  +   +   +++ ++ +  E+ K++E++ TR
Sbjct:  1172 LSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDIRTR 1225

 Score = 101 (40.6 bits), Expect = 3.1e-48, Sum P(4) = 3.1e-48
 Identities = 27/128 (21%), Positives = 64/128 (50%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQ-G-LQPMLDVPIYGRIA 63
             Y +T  + T ++++  GNF+  ++  +++++   +E+    P  G +  +L V ++G I 
Sbjct:     4 YNLTLQRATGISYAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFGVIR 63

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGI 122
             +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ   
Sbjct:    64 SLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSGCRRIVPGQYLA 122

Query:   123 IDPDCRLI 130
             +DP  R +
Sbjct:   123 VDPKGRAV 130

 Score = 47 (21.6 bits), Expect = 5.1e-18, Sum P(4) = 5.1e-18
 Identities = 34/111 (30%), Positives = 45/111 (40%)

Query:   494 ATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAV 552
             A    S  L A     LVY  +G  +  E    A LE +    D +P GE  + +Q    
Sbjct:   150 ARLTISSPLEAHKANTLVYHVVGVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLT 209

Query:   553 GMWTDIS----VRIFSLP---DLN-LITKEHLGGEIIPRSVLLCAFEGISY 595
                 D+     VR +S P     N LIT    GG   P  VL+C+   I+Y
Sbjct:   210 FYELDLGLNHVVRKYSEPLEEHGNFLITVP--GGSDGPSGVLICSENYITY 258

 Score = 42 (19.8 bits), Expect = 8.0e-13, Sum P(2) = 8.0e-13
 Identities = 12/36 (33%), Positives = 19/36 (52%)

Query:   799 TKGRILVFIVED---GKLQLIAEKETKGAVYSLNAF 831
             ++G+IL  +  D   GK+  +   E  G + SL AF
Sbjct:    33 SRGKILELLRPDPNTGKVHTLLTVEVFGVIRSLMAF 68

 Score = 40 (19.1 bits), Expect = 3.1e-48, Sum P(4) = 3.1e-48
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:   258 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 298

 Score = 38 (18.4 bits), Expect = 5.1e-18, Sum P(4) = 5.1e-18
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   338 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 385

 Score = 38 (18.4 bits), Expect = 5.1e-18, Sum P(4) = 5.1e-18
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   574 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 626

 Score = 37 (18.1 bits), Expect = 5.2e-17, Sum P(4) = 5.2e-17
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   540 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 572


>WB|WBGene00019323 [details] [associations]
            symbol:teg-4 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040035 "hermaphrodite
            genitalia development" evidence=IMP] [GO:0009790 "embryo
            development" evidence=IMP] [GO:0001703 "gastrulation with mouth
            forming first" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0002009
            "morphogenesis of an epithelium" evidence=IMP] [GO:0042127
            "regulation of cell proliferation" evidence=IMP] [GO:0040020
            "regulation of meiosis" evidence=IMP] [GO:0008406 "gonad
            development" evidence=IMP] [GO:0016477 "cell migration"
            evidence=IMP] [GO:0007281 "germ cell development" evidence=IMP]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
            GO:GO:0040007 GO:GO:0016477 GO:GO:0008406 GO:GO:0002119
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003676 GO:GO:0042127
            GO:GO:0040035 GO:GO:0007281 GO:GO:0040020 eggNOG:NOG247734
            GeneTree:ENSGT00530000063396 GO:GO:0001703 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA EMBL:FO081029 PIR:T32916
            RefSeq:NP_491953.1 ProteinModelPortal:O44985 STRING:O44985
            PaxDb:O44985 EnsemblMetazoa:K02F2.3 GeneID:172406
            KEGG:cel:CELE_K02F2.3 UCSC:K02F2.3 CTD:172406 WormBase:K02F2.3
            InParanoid:O44985 NextBio:875387 Uniprot:O44985
        Length = 1220

 Score = 342 (125.4 bits), Expect = 1.1e-46, Sum P(3) = 1.1e-46
 Identities = 101/384 (26%), Positives = 181/384 (47%)

Query:   347 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 405
             +L P+ D  + D+ R+   Q+ +  G     SL+++RNG+ I+E A  +L G    +W++
Sbjct:   403 SLSPLTDAVIGDIAREDAAQIYSLVGRGARSSLKVLRNGLEISEMAVSDLPGNPNAVWTV 462

Query:   406 RSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQVT 465
             + + +D +D+++VVSF++ T  LA+ +          GF   T T+ C     + LVQ+ 
Sbjct:   463 KKNIEDQYDSYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCAMIGDDSLVQIY 520

Query:   466 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVK 524
             S  +R + +  R   NEWK+PP   +     N  QV +A  GG LVY E+  +G L E  
Sbjct:   521 SEGIRHIRADKRI--NEWKAPPRRQIVKCAVNRRQVAVALTGGELVYFELDLNGTLNEFT 578

Query:   525 HAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 582
               +L   +I+C+  + I E    S+  A+G   D +VRI SL P+  L+          P
Sbjct:   579 ERKLFNADIACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQSLPCPP 637

Query:   583 RSVLLCAF-----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 635
              S+LL        +G++  +L   L +G L    ++  TG + D +   LGT+P+ L   
Sbjct:   638 ESILLIDTPNEDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKV 697

Query:   636 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 695
               +  + +   S R  ++Y   ++   + ++   + +   F S    + +       L I
Sbjct:   698 QCQGRSAILCTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCSEGIVAISASTLRI 757

Query:   696 GTIDDIQ-KLHIRSIPLGEHPRRI 718
                + +    +++S      PRR+
Sbjct:   758 IAAEKLGVAFNVQSFEHKMTPRRV 781

 Score = 194 (73.4 bits), Expect = 1.1e-46, Sum P(3) = 1.1e-46
 Identities = 71/307 (23%), Positives = 144/307 (46%)

Query:   798 PTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 855
             PT+G +  F +  +G +   +   ET   V +++ F G  L    + +++Y     D G 
Sbjct:   923 PTRGCVYTFHLSANGDRFDFLHRTETPLPVGAIHDFRGMALVGFGRFLRMY-----DIGQ 977

Query:   856 RELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 914
             ++L ++C +    +++  +Q+ G  I+V D  +S+  L Y+  +  +   A D    +++
Sbjct:   978 KKLLAKCENKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVT 1037

Query:   915 AVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--------ERGRLEVVGEYHLGEFV 963
              V +LD      A+   NL  VR   + +E   D+        +RG L   G     E V
Sbjct:  1038 CVCVLDYHTVAVADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWLN--GASQKVELV 1095

Query:   964 NRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVI 1020
             + F  G  +  L  + +  G    +++ T+ G IG + S +  ++  F   L+ ++R   
Sbjct:  1096 SNFFIGDTITSLQKTSLMPGANEALVYTTIGGAIGCLVSFMSKDEVDFFTNLEMHVRSEY 1155

Query:  1021 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1080
               + G +H  +RS+    K+V     +DGD+ E F  +   +  ++++ +  +V E+ K+
Sbjct:  1156 PPLCGRDHLAYRSYYAPCKSV-----IDGDICEQFSLMDTQKQKDVAEELGKTVSEISKK 1210

Query:  1081 VEEL-TR 1086
             +E++ TR
Sbjct:  1211 LEDIRTR 1217

 Score = 98 (39.6 bits), Expect = 1.1e-46, Sum P(3) = 1.1e-46
 Identities = 45/200 (22%), Positives = 88/200 (44%)

Query:     6 YVVTAHKPTNVTHSCVGNFT-SPQELNLIIAKCTRIEIHLL-TPQG-LQPMLDVPIYGRI 62
             Y +T    + +  +  GNF+ +P+   +++ + + +E+  L T  G ++ M    I+G +
Sbjct:     4 YNLTLQGQSAINQAIQGNFSGTPKAQEIVVGRGSALELLTLDTVTGKIKVMCHQDIFGIV 63

Query:    63 ATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIG 121
              +L  FR     +DF+ + ++  +  +LQ++AE +    R   +   + G R    G   
Sbjct:    64 RSLLAFRLTAGTRDFIAVGSDSGRIVILQYNAEKT-CFERLHQETFGKTGCRRIVPGHFL 122

Query:   122 IIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCAKPT 176
             + DP  R  +IG      L  ++  D++  L  +  +   +   L    +    G   PT
Sbjct:   123 VGDPRGRALMIGAVERQKLVYIMNRDSEAHLTISSPLEAHKHHTLCYAMVGIDVGFENPT 182

Query:   177 IVVLYQDNKDARHVKTYEVA 196
                L  D +DA +  T E A
Sbjct:   183 FACLEFDYEDADNDPTGEAA 202

 Score = 37 (18.1 bits), Expect = 1.9e-14, Sum P(3) = 1.9e-14
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query:   496 ANASQVLLATGGGHLVYLEI 515
             A+A  ++ AT G  L Y E+
Sbjct:   857 ASAISLISATSGDKLSYFEL 876


>UNIPROTKB|E9PT66 [details] [associations]
            symbol:Sf3b3 "Protein Sf3b3" species:10116 "Rattus
            norvegicus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            RGD:1311636 GO:GO:0003676 GO:GO:0071013
            GeneTree:ENSGT00530000063396 GO:GO:0005689 IPI:IPI00958853
            Ensembl:ENSRNOT00000023854 ArrayExpress:E9PT66 Uniprot:E9PT66
        Length = 920

 Score = 337 (123.7 bits), Expect = 1.8e-46, Sum P(2) = 1.8e-46
 Identities = 112/408 (27%), Positives = 188/408 (46%)

Query:   347 NLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 404
             +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+
Sbjct:   106 SLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWT 164

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             +R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV
Sbjct:   165 VRRHIEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 222

Query:   465 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE- 522
                 +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E 
Sbjct:   223 YPDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEY 280

Query:   523 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGE 579
              +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  +
Sbjct:   281 TERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ 339

Query:   580 IIPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKV 623
               P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +  
Sbjct:   340 --PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTR 395

Query:   624 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+
Sbjct:   396 YLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPE 455

Query:   684 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   456 GIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 503

 Score = 241 (89.9 bits), Expect = 1.8e-46, Sum P(2) = 1.8e-46
 Identities = 87/363 (23%), Positives = 168/363 (46%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+    +Y  VG A  ++        G +
Sbjct:   569 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFV 628

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   629 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 683

Query:   862 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 918
             C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct:   684 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 741

Query:   919 LDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFR 967
             LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  + 
Sbjct:   742 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYH 799

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVG 1024
              G  V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + 
Sbjct:   800 VGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLC 859

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
             G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct:   860 GRDHLSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 914

Query:  1085 -TR 1086
              TR
Sbjct:   915 RTR 917

 Score = 38 (18.4 bits), Expect = 8.6e-15, Sum P(2) = 8.6e-15
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:    41 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 88

 Score = 38 (18.4 bits), Expect = 7.7e-14, Sum P(3) = 7.7e-14
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   277 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 329

 Score = 37 (18.1 bits), Expect = 7.7e-14, Sum P(3) = 7.7e-14
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   243 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 275


>POMBASE|SPAPJ698.03c [details] [associations]
            symbol:prp12 "U2 snRNP-associated protein Sap130
            (predicted)" species:4896 "Schizosaccharomyces pombe" [GO:0000245
            "spliceosomal complex assembly" evidence=ISS] [GO:0005681
            "spliceosomal complex" evidence=IEA] [GO:0005686 "U2 snRNP"
            evidence=ISS] [GO:0030620 "U2 snRNA binding" evidence=ISS]
            [GO:0045292 "mRNA cis splicing, via spliceosome" evidence=ISS]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 PomBase:SPAPJ698.03c EMBL:CU329670
            GenomeReviews:CU329670_GR Gene3D:2.130.10.10 SUPFAM:SSF50978
            GO:GO:0005681 GO:GO:0007049 GO:GO:0000245 GO:GO:0005686
            GO:GO:0045292 eggNOG:NOG247734 GO:GO:0030620 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA OrthoDB:EOG4FR40R EMBL:AB034966
            RefSeq:NP_594414.1 IntAct:Q9UTT2 STRING:Q9UTT2
            EnsemblFungi:SPAPJ698.03c.1 GeneID:2543278 KEGG:spo:SPAPJ698.03c
            NextBio:20804299 Uniprot:Q9UTT2
        Length = 1206

 Score = 322 (118.4 bits), Expect = 7.9e-46, Sum P(4) = 7.9e-46
 Identities = 107/413 (25%), Positives = 187/413 (45%)

Query:   322 DSQLIKLNLQPDAKG-SYVEVLERYVNLGPIVDFCVVDLERQGQG-QVVTCSGAYKDGSL 379
             D+++   N+    +G   + ++E   +L  + D  ++     G+  Q+ T  G   + SL
Sbjct:   401 DNEVGTKNVHFGVRGLQNLSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSL 460

Query:   380 RIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXX 438
             R +R G+   E  + EL G    +W+L+ +  D +D+++++SF + T  L +++      
Sbjct:   461 RQLRRGLETTEIVASELPGAPIAIWTLKLNQTDVYDSYIILSFTNGT--LVLSIGETVEE 518

Query:   439 XXXXGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANA 498
                 GF S   TL       + LVQ+    +R + +  +   +EWK P    V  +  N 
Sbjct:   519 ISDSGFLSSVSTLNARQMGRDSLVQIHPKGIRYIRANKQT--SEWKLPQDVYVVQSAIND 576

Query:   499 SQVLLATGGGHLVYLEIGD----GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVG 553
              Q+++A   G LVY E+ D    G L E +  + L   ++ L + P+ E    S    + 
Sbjct:   577 MQIVVALSNGELVYFEMSDDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLA 636

Query:   554 MWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAF----EGIS--YLLCALGDGHLL 606
                D +VR+ SL DL   T E+L  + +      LC       G+S  YL   L +G  L
Sbjct:   637 C-DDATVRVLSL-DL-YTTLENLSVQALSSPANSLCIIPMNVNGVSTLYLHIGLMNGVYL 693

Query:   607 NFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVN 666
               ++++ +G+L D +   LG + + +   + KN   V A S R  + YS  + L  S + 
Sbjct:   694 RTVIDVTSGQLLDTRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIA 753

Query:   667 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK-LHIRSIPLGEHPRRI 718
                + H   F S   P+ +   ++  L I T+D +Q  L     PL   PR+I
Sbjct:   754 YSAIDHASSFASEQCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKI 806

 Score = 205 (77.2 bits), Expect = 7.9e-46, Sum P(4) = 7.9e-46
 Identities = 83/371 (22%), Positives = 161/371 (43%)

Query:   733 KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 792
             K      +S + F+ + D  + + I   PL   E   S+ +  F +    +   G+A  +
Sbjct:   844 KQNEHTSKSWVSFISVFDMISKKIIHESPLGDNEAAFSMTAAFFKNRDEFFLVAGSATNM 903

Query:   793 PEENEP-TKGRILVFIVED-GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 849
               E    + G   V+   D GK L+LI+  E  G   +L  F G++LA + + +++Y   
Sbjct:   904 DLECRTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMALTPFQGRMLAGVGRFLRIY--- 960

Query:   850 LRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 907
               D G +++  + G    +     ++  +   IVV D   S+  ++YK E+  +   A D
Sbjct:   961 --DLGNKKMLRK-GELSAVPLFITHITVQASRIVVADSQYSVRFVVYKPEDNHLLTFADD 1017

Query:   908 YNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGRLEVVGEYHLGEFVNR 965
                 W +   ++D D   G +   N++ +R  ++     DEE    +++   H   F+N 
Sbjct:  1018 TIHRWTTTNVLVDYDTLAGGDKFGNIWLLRCPEHVSKLADEENSESKLI---HEKPFLNS 1074

Query:   966 FRHG-SLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKL 1012
               H   L+     +D+           G    +++  + G +GV    +  E   F ++L
Sbjct:  1075 TPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVLLWTGLLGTVGVFTPFINQEDVRFFQQL 1134

Query:  1013 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1072
             +  LRK    + G +H  +RS+    K V     +DGDL E +  L     + I+  ++ 
Sbjct:  1135 EFLLRKECPPLAGRDHLAYRSYYAPVKCV-----IDGDLCEMYYSLPHPVQEMIANELDR 1189

Query:  1073 SVEELCKRVEE 1083
             ++ E+ K++E+
Sbjct:  1190 TIAEVSKKIED 1200

 Score = 82 (33.9 bits), Expect = 7.9e-46, Sum P(4) = 7.9e-46
 Identities = 21/101 (20%), Positives = 51/101 (50%)

Query:     2 SIWNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLL--TPQGLQPMLDVPIY 59
             S++ Y +T      V  SC  + +  +   ++IA  +R+ I+ +  T   +  +L+   +
Sbjct:     6 SLFLYSLTIQNSNYVQSSCAASLSGKKAQEIVIATESRLLIYKVDATDGRMNCILNQNCF 65

Query:    60 GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELI 100
             G I  +   R  G  +D+L + ++  +  +L+++ E ++L+
Sbjct:    66 GIIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNKLV 106

 Score = 64 (27.6 bits), Expect = 7.9e-46, Sum P(4) = 7.9e-46
 Identities = 13/54 (24%), Positives = 30/54 (55%)

Query:   298 GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 351
             G   +++ +  +  +  Y+  + GD  L+KL ++ D +G+ VE+  +Y +  P+
Sbjct:   302 GPLIVSAVLHKMKGSFFYLLQT-GDGDLLKLTIEHDGQGNVVELRLKYFDTVPL 354

 Score = 44 (20.5 bits), Expect = 1.0e-14, Sum P(3) = 1.0e-14
 Identities = 13/55 (23%), Positives = 22/55 (40%)

Query:   469 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 523
             +R  ++++  +   W      S N      S VL    G     L+ GDG L ++
Sbjct:   277 LRRQAASANAISTPWNQVNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKL 331

 Score = 44 (20.5 bits), Expect = 1.0e-14, Sum P(3) = 1.0e-14
 Identities = 16/66 (24%), Positives = 29/66 (43%)

Query:   584 SVLLCAFEGISYLLCALGDGHLLNFLLNMK-TGELTDRKKVSLGTQPITLRTFSSKNTTH 642
             S +L   +G  + L   GDG LL   +     G + + +     T P+ ++    K T  
Sbjct:   307 SAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVPLAVQLNILK-TGF 365

Query:   643 VFAASD 648
             +F A++
Sbjct:   366 LFVATE 371

 Score = 42 (19.8 bits), Expect = 1.6e-14, Sum P(3) = 1.6e-14
 Identities = 18/59 (30%), Positives = 25/59 (42%)

Query:   318 SSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD 376
             +S  D  LI   +    KGS+  +L+     G   D   + +E  GQG VV     Y D
Sbjct:   297 NSANDGPLIVSAVLHKMKGSFFYLLQ--TGDG---DLLKLTIEHDGQGNVVELRLKYFD 350

 Score = 37 (18.1 bits), Expect = 5.2e-14, Sum P(3) = 5.2e-14
 Identities = 11/26 (42%), Positives = 15/26 (57%)

Query:   502 LLATGGGHLVYLEI---GDGILTEVK 524
             LL TG G L+ L I   G G + E++
Sbjct:   320 LLQTGDGDLLKLTIEHDGQGNVVELR 345


>RGD|1311636 [details] [associations]
            symbol:Sf3b3 "splicing factor 3b, subunit 3" species:10116
            "Rattus norvegicus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005689
            "U12-type spliceosomal complex" evidence=ISO] [GO:0071013
            "catalytic step 2 spliceosome" evidence=ISO] InterPro:IPR004871
            Pfam:PF03178 RGD:1311636 GO:GO:0005634 GO:GO:0003676
            IPI:IPI00563335 PRIDE:F1LSZ9 Ensembl:ENSRNOT00000044193
            UCSC:RGD:1311636 ArrayExpress:F1LSZ9 Uniprot:F1LSZ9
        Length = 902

 Score = 337 (123.7 bits), Expect = 1.1e-43, Sum P(3) = 1.1e-43
 Identities = 112/408 (27%), Positives = 188/408 (46%)

Query:   347 NLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 404
             +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+
Sbjct:   183 SLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWT 241

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             +R   +D FD +++VSF++ T  L +++          GF   T TL C     + LVQV
Sbjct:   242 VRRHIEDEFDAYIIVSFVNAT--LVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 299

Query:   465 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE- 522
                 +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E 
Sbjct:   300 YPDGIRHIRADKRV--NEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEY 357

Query:   523 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGE 579
              +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL P   L  ++ + L  +
Sbjct:   358 TERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ 416

Query:   580 IIPRSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKV 623
               P S  LC  E                G  YL   L +G LL  +L+  TG+L+D +  
Sbjct:   417 --PES--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTR 472

Query:   624 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 683
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+
Sbjct:   473 YLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPE 532

Query:   684 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 729
              +       L I  ++ +  +  + + PL   PR+ + H E +    I
Sbjct:   533 GIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIII 580

 Score = 222 (83.2 bits), Expect = 1.1e-43, Sum P(3) = 1.1e-43
 Identities = 75/299 (25%), Positives = 143/299 (47%)

Query:   807 IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH 865
             +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +C  +
Sbjct:   615 LVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-EN 668

Query:   866 GHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 922
              HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D
Sbjct:   669 KHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYD 727

Query:   923 IYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFRHGSL 971
                GA+   N+  VR   N+    DE+         RG L   G     E +  +  G  
Sbjct:   728 TVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYHVGET 785

Query:   972 VMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNH 1028
             V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     + G +H
Sbjct:   786 VLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDH 845

Query:  1029 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL-TR 1086
               +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++ TR
Sbjct:   846 LSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDIRTR 899

 Score = 40 (19.1 bits), Expect = 1.1e-43, Sum P(3) = 1.1e-43
 Identities = 12/41 (29%), Positives = 18/41 (43%)

Query:   345 YVNLGPIVDF-CVV-----DLERQGQGQVVTCSGAYKDGSL 379
             Y N G   D  C +     DL+   +G +  CS  +K  S+
Sbjct:    38 YKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSM 78

 Score = 38 (18.4 bits), Expect = 3.8e-12, Sum P(3) = 3.8e-12
 Identities = 17/48 (35%), Positives = 22/48 (45%)

Query:   728 AICSLKNQSCAEESEM--HFVRLL-----DDQTFEFISTYPL---DTF 765
             A+C LK       SE   H++  +     DD+  EF S  PL   DTF
Sbjct:   118 AMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTF 165

 Score = 38 (18.4 bits), Expect = 3.8e-12, Sum P(3) = 3.8e-12
 Identities = 16/56 (28%), Positives = 20/56 (35%)

Query:   711 LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 766
             L E+  R   +E S      SL N    E+        L D T   IS  P D  +
Sbjct:   354 LNEYTER---KEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQ 406

 Score = 37 (18.1 bits), Expect = 7.6e-12, Sum P(3) = 7.6e-12
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query:   577 GGEIIPRSVLLCAFEGISY 595
             GG   P  VL+C+   I+Y
Sbjct:    20 GGSDGPSGVLICSENYITY 38

 Score = 37 (18.1 bits), Expect = 7.6e-12, Sum P(3) = 7.6e-12
 Identities = 9/33 (27%), Positives = 15/33 (45%)

Query:   233 GEETIVYCSANAFKAIPIRPSITKAYGRVDADG 265
             G++TIV C+ N  + +         Y  +D  G
Sbjct:   320 GKKTIVKCAVNQRQVVIALTGGELVYFEMDPSG 352


>GENEDB_PFALCIPARUM|PFL1680w [details] [associations]
            symbol:PFL1680w "splicing factor 3b, subunit 3,
            130kD, putative" species:5833 "Plasmodium falciparum" [GO:0005681
            "spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
            evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
            SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
            HOGENOM:HOG000216677 RefSeq:XP_001350742.1
            ProteinModelPortal:Q8I574 PRIDE:Q8I574
            EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
            EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
            Uniprot:Q8I574
        Length = 1329

 Score = 280 (103.6 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 90/371 (24%), Positives = 161/371 (43%)

Query:   341 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 399
             ++++  +L PI+D  ++D +     Q+ T  G     SLRI+++G+ I E A  EL G  
Sbjct:   429 LVDQIYSLSPILDMKIIDAKNTHTPQIYTLCGRGPRSSLRILQHGLSIEELADNELPGKP 488

Query:   400 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYN 459
             K +W+++      +D ++VVSF   T IL +               +   TL  +    N
Sbjct:   489 KYIWTIKKDNLSEYDGYIVVSFEGNTLILEIGESVEEVSDTL--LLNNVTTLHINILYDN 546

Query:   460 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG- 518
               +QV    +R ++    ++  EW +P    +  A++N+SQ++++  GG L+Y EI +  
Sbjct:   547 SFIQVYDTGIRHING---KVVQEWVAPKNKQIKAASSNSSQIVISLSGGELIYFEIDESH 603

Query:   519 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 578
              L E+    L  E+ CL I  I  N   +   AVG   ++ VR+ S+ + +   K+ L  
Sbjct:   604 TLVEIFRKNLNVEVLCLSIQQIPPNRVRANFLAVGCLDNV-VRLLSI-EKDKYFKQ-LST 660

Query:   579 EIIPRSVL---LCAFE-----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 624
              ++P +     +C  E            I +L   L  G LL  +++   G L++     
Sbjct:   661 HLLPNNSSPQDICISEMNDNGNTMKERNIIFLNIGLNTGVLLRSIIDPVAGTLSNHYSKY 720

Query:   625 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 684
             LG + I +   +      +    ++  + Y    K LYS +N   + +   F S    D 
Sbjct:   721 LGAKSIKICPVNVNKNPALLVLCEKTYLCYMHQGKFLYSPLNYDMLEYASSFYSPQCSDG 780

Query:   685 LAIAKEGELTI 695
                     L I
Sbjct:   781 YVAISSNSLRI 791

 Score = 196 (74.1 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 81/358 (22%), Positives = 155/358 (43%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEP-TKGR 802
             +++++    + +    LD  E   S+ +C         +C  VGT   L  + +  T   
Sbjct:   983 IKIINPVNLQILDKISLDMEEAALSVCACELE----ALHCLIVGTTTNLSLKTKSLTSAS 1038

Query:   803 ILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
             + V+  +   KL L+     +   Y   ++NGKL+A+I  K+++Y       G ++L  +
Sbjct:  1039 LRVYTYDIQYKLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYAL-----GKKKLLKK 1093

Query:   862 CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 920
             C +     A+  ++  G+ I   D+ +S+ +  Y   +  +   + D    W++  EILD
Sbjct:  1094 CEYKDIPEAIVSIKISGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIPRWITCSEILD 1153

Query:   921 DDIYLGAENNFNLFTVRKNSEGATDEE--RGRLEVVGEYHLGEFVNR-------FRHGSL 971
                 + A+   ++F +R   E   DE     +    GE       NR       F  G +
Sbjct:  1154 HHTIMAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNRKLEHMMSFHIGEI 1213

Query:   972 VMRLPDSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVIKGVGGL 1026
             V  +    V   PT    +I+ T+ G IG      +++ L L + L+  LR     + G 
Sbjct:  1214 VTSM--QKVRLSPTSSECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEKPPLCGR 1271

Query:  1027 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
              H  +RS+ +       +N +DGDL E F  LS     +I+  +  + E++ +++E++
Sbjct:  1272 EHIFFRSYYHP-----VQNVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324

 Score = 132 (51.5 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 34/127 (26%), Positives = 66/127 (51%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG-LQPMLDVPIYGRIAT 64
             Y +T  KPT +T +  GNF+ P+   +I+AK   +E+     QG L  ++   I+G I +
Sbjct:     5 YHLTLQKPTAITKTVYGNFSGPRFHEIIVAKGQVLELLRSDKQGKLNVIISKDIFGIIRS 64

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGII 123
             +  FR  G  +D++ I ++  +  +L+++ E ++ + R   +   + G R    G+   +
Sbjct:    65 ISTFRLTGSNKDYIVIGSDSGRLVILEYNNEKNDFV-RVHCETYGKTGIRRIIPGEYIAV 123

Query:   124 DPDCRLI 130
             DP  R +
Sbjct:   124 DPKGRAL 130

 Score = 62 (26.9 bits), Expect = 0.00018, Sum P(3) = 0.00018
 Identities = 28/135 (20%), Positives = 65/135 (48%)

Query:   954 VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             + EY  G  V  F   +L++ + +S V ++   +   +N V  +  ++ ++   F++   
Sbjct:   499 LSEYD-GYIVVSFEGNTLILEIGES-VEEVSDTLL--LNNVTTLHINILYDNS-FIQVYD 553

Query:  1014 TNLR----KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1069
             T +R    KV++      ++Q ++ ++    +   +   G+LI   +D S T ++   K 
Sbjct:   554 TGIRHINGKVVQEWVAPKNKQIKAASSNSSQI-VISLSGGELIYFEIDESHTLVEIFRKN 612

Query:  1070 MNVSVEELCKRVEEL 1084
             +NV V  LC  ++++
Sbjct:   613 LNVEV--LCLSIQQI 625

 Score = 61 (26.5 bits), Expect = 0.00023, Sum P(3) = 0.00023
 Identities = 30/114 (26%), Positives = 47/114 (41%)

Query:   550 AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 609
             A  G W    ++I +  +L ++ K  L  E    SV  C  E +  L+     G   N  
Sbjct:   974 AGQGKWGSC-IKIINPVNLQILDKISLDMEEAALSVCACELEALHCLIV----GTTTN-- 1026

Query:   610 LNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS 663
             L++KT  LT    + + T  I  +     N  H+    ++P    S N KL+ S
Sbjct:  1027 LSLKTKSLTSAS-LRVYTYDIQYKL----NLLHITPIEEQPYCFCSYNGKLIAS 1075

 Score = 60 (26.2 bits), Expect = 0.00029, Sum P(3) = 0.00029
 Identities = 24/88 (27%), Positives = 44/88 (50%)

Query:   528 LEYEISCLDINPIGENP----SYSQ--IAAVGMWTDISVRIFSLPDLNLITK-EHLGGEI 580
             ++Y+++ L I PI E P    SY+   IA++G      +RI++L    L+ K E+   + 
Sbjct:  1046 IQYKLNLLHITPIEEQPYCFCSYNGKLIASIGN----KLRIYALGKKKLLKKCEY---KD 1098

Query:   581 IPRSVLLCAFEGISYLLCALGDGHLLNF 608
             IP +++     G     C + +  L+ F
Sbjct:  1099 IPEAIVSIKISGNRIFACDIRESVLIFF 1126

 Score = 47 (21.6 bits), Expect = 2.6e-27, Sum P(4) = 2.6e-27
 Identities = 16/86 (18%), Positives = 41/86 (47%)

Query:   995 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1054
             I ++A +  +   + E  Q  ++K +K +   + E+ +  +NE   +    + +GD+   
Sbjct:   839 IRMLAIIEADHNSYDENTQREIQKALKDIKLSDTERRKENDNENNNI----YSNGDVDNI 894

Query:  1055 FLDLSRTRMDEISKTMNVSVEELCKR 1080
              ++ S    +E +   N+S+ +  K+
Sbjct:   895 DVNDSANMNEEFNSNDNISLAQNHKK 920

 Score = 39 (18.8 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 8/41 (19%), Positives = 20/41 (48%)

Query:   282 ITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 322
             + HE   V  +  +      I ++IS L +  +++ + +G+
Sbjct:   347 VDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVAAEFGN 387


>UNIPROTKB|Q8I574 [details] [associations]
            symbol:PFL1680w "Splicing factor 3b, subunit 3, 130kD,
            putative" species:36329 "Plasmodium falciparum 3D7" [GO:0005681
            "spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
            evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
            SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
            HOGENOM:HOG000216677 RefSeq:XP_001350742.1
            ProteinModelPortal:Q8I574 PRIDE:Q8I574
            EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
            EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
            Uniprot:Q8I574
        Length = 1329

 Score = 280 (103.6 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 90/371 (24%), Positives = 161/371 (43%)

Query:   341 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 399
             ++++  +L PI+D  ++D +     Q+ T  G     SLRI+++G+ I E A  EL G  
Sbjct:   429 LVDQIYSLSPILDMKIIDAKNTHTPQIYTLCGRGPRSSLRILQHGLSIEELADNELPGKP 488

Query:   400 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYN 459
             K +W+++      +D ++VVSF   T IL +               +   TL  +    N
Sbjct:   489 KYIWTIKKDNLSEYDGYIVVSFEGNTLILEIGESVEEVSDTL--LLNNVTTLHINILYDN 546

Query:   460 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG- 518
               +QV    +R ++    ++  EW +P    +  A++N+SQ++++  GG L+Y EI +  
Sbjct:   547 SFIQVYDTGIRHING---KVVQEWVAPKNKQIKAASSNSSQIVISLSGGELIYFEIDESH 603

Query:   519 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 578
              L E+    L  E+ CL I  I  N   +   AVG   ++ VR+ S+ + +   K+ L  
Sbjct:   604 TLVEIFRKNLNVEVLCLSIQQIPPNRVRANFLAVGCLDNV-VRLLSI-EKDKYFKQ-LST 660

Query:   579 EIIPRSVL---LCAFE-----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 624
              ++P +     +C  E            I +L   L  G LL  +++   G L++     
Sbjct:   661 HLLPNNSSPQDICISEMNDNGNTMKERNIIFLNIGLNTGVLLRSIIDPVAGTLSNHYSKY 720

Query:   625 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 684
             LG + I +   +      +    ++  + Y    K LYS +N   + +   F S    D 
Sbjct:   721 LGAKSIKICPVNVNKNPALLVLCEKTYLCYMHQGKFLYSPLNYDMLEYASSFYSPQCSDG 780

Query:   685 LAIAKEGELTI 695
                     L I
Sbjct:   781 YVAISSNSLRI 791

 Score = 196 (74.1 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 81/358 (22%), Positives = 155/358 (43%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEP-TKGR 802
             +++++    + +    LD  E   S+ +C         +C  VGT   L  + +  T   
Sbjct:   983 IKIINPVNLQILDKISLDMEEAALSVCACELE----ALHCLIVGTTTNLSLKTKSLTSAS 1038

Query:   803 ILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
             + V+  +   KL L+     +   Y   ++NGKL+A+I  K+++Y       G ++L  +
Sbjct:  1039 LRVYTYDIQYKLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYAL-----GKKKLLKK 1093

Query:   862 CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 920
             C +     A+  ++  G+ I   D+ +S+ +  Y   +  +   + D    W++  EILD
Sbjct:  1094 CEYKDIPEAIVSIKISGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIPRWITCSEILD 1153

Query:   921 DDIYLGAENNFNLFTVRKNSEGATDEE--RGRLEVVGEYHLGEFVNR-------FRHGSL 971
                 + A+   ++F +R   E   DE     +    GE       NR       F  G +
Sbjct:  1154 HHTIMAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNRKLEHMMSFHIGEI 1213

Query:   972 VMRLPDSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVIKGVGGL 1026
             V  +    V   PT    +I+ T+ G IG      +++ L L + L+  LR     + G 
Sbjct:  1214 VTSM--QKVRLSPTSSECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEKPPLCGR 1271

Query:  1027 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1084
              H  +RS+ +       +N +DGDL E F  LS     +I+  +  + E++ +++E++
Sbjct:  1272 EHIFFRSYYHP-----VQNVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324

 Score = 132 (51.5 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 34/127 (26%), Positives = 66/127 (51%)

Query:     6 YVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG-LQPMLDVPIYGRIAT 64
             Y +T  KPT +T +  GNF+ P+   +I+AK   +E+     QG L  ++   I+G I +
Sbjct:     5 YHLTLQKPTAITKTVYGNFSGPRFHEIIVAKGQVLELLRSDKQGKLNVIISKDIFGIIRS 64

Query:    65 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQIGII 123
             +  FR  G  +D++ I ++  +  +L+++ E ++ + R   +   + G R    G+   +
Sbjct:    65 ISTFRLTGSNKDYIVIGSDSGRLVILEYNNEKNDFV-RVHCETYGKTGIRRIIPGEYIAV 123

Query:   124 DPDCRLI 130
             DP  R +
Sbjct:   124 DPKGRAL 130

 Score = 62 (26.9 bits), Expect = 0.00018, Sum P(3) = 0.00018
 Identities = 28/135 (20%), Positives = 65/135 (48%)

Query:   954 VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             + EY  G  V  F   +L++ + +S V ++   +   +N V  +  ++ ++   F++   
Sbjct:   499 LSEYD-GYIVVSFEGNTLILEIGES-VEEVSDTLL--LNNVTTLHINILYDNS-FIQVYD 553

Query:  1014 TNLR----KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1069
             T +R    KV++      ++Q ++ ++    +   +   G+LI   +D S T ++   K 
Sbjct:   554 TGIRHINGKVVQEWVAPKNKQIKAASSNSSQI-VISLSGGELIYFEIDESHTLVEIFRKN 612

Query:  1070 MNVSVEELCKRVEEL 1084
             +NV V  LC  ++++
Sbjct:   613 LNVEV--LCLSIQQI 625

 Score = 61 (26.5 bits), Expect = 0.00023, Sum P(3) = 0.00023
 Identities = 30/114 (26%), Positives = 47/114 (41%)

Query:   550 AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 609
             A  G W    ++I +  +L ++ K  L  E    SV  C  E +  L+     G   N  
Sbjct:   974 AGQGKWGSC-IKIINPVNLQILDKISLDMEEAALSVCACELEALHCLIV----GTTTN-- 1026

Query:   610 LNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS 663
             L++KT  LT    + + T  I  +     N  H+    ++P    S N KL+ S
Sbjct:  1027 LSLKTKSLTSAS-LRVYTYDIQYKL----NLLHITPIEEQPYCFCSYNGKLIAS 1075

 Score = 60 (26.2 bits), Expect = 0.00029, Sum P(3) = 0.00029
 Identities = 24/88 (27%), Positives = 44/88 (50%)

Query:   528 LEYEISCLDINPIGENP----SYSQ--IAAVGMWTDISVRIFSLPDLNLITK-EHLGGEI 580
             ++Y+++ L I PI E P    SY+   IA++G      +RI++L    L+ K E+   + 
Sbjct:  1046 IQYKLNLLHITPIEEQPYCFCSYNGKLIASIGN----KLRIYALGKKKLLKKCEY---KD 1098

Query:   581 IPRSVLLCAFEGISYLLCALGDGHLLNF 608
             IP +++     G     C + +  L+ F
Sbjct:  1099 IPEAIVSIKISGNRIFACDIRESVLIFF 1126

 Score = 47 (21.6 bits), Expect = 2.6e-27, Sum P(4) = 2.6e-27
 Identities = 16/86 (18%), Positives = 41/86 (47%)

Query:   995 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1054
             I ++A +  +   + E  Q  ++K +K +   + E+ +  +NE   +    + +GD+   
Sbjct:   839 IRMLAIIEADHNSYDENTQREIQKALKDIKLSDTERRKENDNENNNI----YSNGDVDNI 894

Query:  1055 FLDLSRTRMDEISKTMNVSVEELCKR 1080
              ++ S    +E +   N+S+ +  K+
Sbjct:   895 DVNDSANMNEEFNSNDNISLAQNHKK 920

 Score = 39 (18.8 bits), Expect = 1.2e-42, Sum P(4) = 1.2e-42
 Identities = 8/41 (19%), Positives = 20/41 (48%)

Query:   282 ITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 322
             + HE   V  +  +      I ++IS L +  +++ + +G+
Sbjct:   347 VDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVAAEFGN 387


>UNIPROTKB|F5GZ34 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0005634 GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01010454
            ProteinModelPortal:F5GZ34 SMR:F5GZ34 Ensembl:ENST00000538470
            ArrayExpress:F5GZ34 Bgee:F5GZ34 Uniprot:F5GZ34
        Length = 187

 Score = 455 (165.2 bits), Expect = 6.0e-42, P = 6.0e-42
 Identities = 98/187 (52%), Positives = 124/187 (66%)

Query:   913 MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 972
             MSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLV
Sbjct:     1 MSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLV 60

Query:   973 MR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 1031
             M+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  W
Sbjct:    61 MQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFW 120

Query:  1032 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRV 1081
             RSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + ++L K V
Sbjct:   121 RSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVV 180

Query:  1082 EELTRLH 1088
             EELTR+H
Sbjct:   181 EELTRIH 187


>UNIPROTKB|F5GZY8 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01011929
            ProteinModelPortal:F5GZY8 SMR:F5GZY8 Ensembl:ENST00000542337
            ArrayExpress:F5GZY8 Bgee:F5GZY8 Uniprot:F5GZY8
        Length = 146

 Score = 453 (164.5 bits), Expect = 9.9e-42, P = 9.9e-42
 Identities = 85/144 (59%), Positives = 114/144 (79%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIG 121
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IG
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIG 122

Query:   122 IIDPDCRLIGLHLYDGLFKVIPFD 145
             IIDP+CR+IGL LYDGLFKVIP D
Sbjct:   123 IIDPECRMIGLRLYDGLFKVIPLD 146


>TAIR|locus:2081576 [details] [associations]
            symbol:AT3G11960 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM;IEA;ISS] [GO:0008150 "biological_process"
            evidence=ND] [GO:0000956 "nuclear-transcribed mRNA catabolic
            process" evidence=RCA] [GO:0006486 "protein glycosylation"
            evidence=RCA] [GO:0009755 "hormone-mediated signaling pathway"
            evidence=RCA] [GO:0010182 "sugar mediated signaling pathway"
            evidence=RCA] [GO:0048825 "cotyledon development" evidence=RCA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0003676 KO:K12830 EMBL:BT006164
            EMBL:AK229623 IPI:IPI00530914 RefSeq:NP_187802.2 UniGene:At.5413
            ProteinModelPortal:Q84R20 IntAct:Q84R20 PaxDb:Q84R20 PRIDE:Q84R20
            EnsemblPlants:AT3G11960.1 GeneID:820369 KEGG:ath:AT3G11960
            TAIR:At3g11960 eggNOG:NOG322382 HOGENOM:HOG000030342
            InParanoid:Q84R20 OMA:GMLLRFE PhylomeDB:Q84R20
            ProtClustDB:CLSN2690873 ArrayExpress:Q84R20 Genevestigator:Q84R20
            Uniprot:Q84R20
        Length = 1379

 Score = 222 (83.2 bits), Expect = 6.6e-31, Sum P(4) = 6.6e-31
 Identities = 56/209 (26%), Positives = 102/209 (48%)

Query:   347 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 404
             N+ PI+DF V+D + + + Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct:   468 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 527

Query:   405 LRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAIYNQLVQV 464
             ++    D + +FLV+SF+ ETR+L++ L          GF S   T  C       LVQ+
Sbjct:   528 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFKDVTDSV-GFQSDVCTFACGLVADGLLVQI 586

Query:   465 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL- 513
                ++RL   T          S    + W  P   S+++     + ++++T     + + 
Sbjct:   587 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 645

Query:   514 ---EIGDGI--LTEVKHAQLEYEISCLDI 537
                 +      + E++   L+YE+SC+ +
Sbjct:   646 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 674

 Score = 138 (53.6 bits), Expect = 6.6e-31, Sum P(4) = 6.6e-31
 Identities = 43/192 (22%), Positives = 88/192 (45%)

Query:   625 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 684
             +G  P+ L  FS    + + A SDRP ++ ++ + L Y++++ +  +H  P  S   P  
Sbjct:   822 IGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPSTHATPVCSFECPQG 881

Query:   685 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEM 743
             +    E  L +  +   ++ + +   LG  PR++ +  +S+   +       +C  +   
Sbjct:   882 ILFVSENCLHLVEMVHSKRRNAQKFQLGGTPRKVIYHSESKLLIVMRTDLYDTCTSD--- 938

Query:   744 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENE 797
               +  +D  +   +S+Y L   E G S+      ++  +   VGT+      +LP  E E
Sbjct:   939 --ICCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLV--VGTSLSSGPAILPSGEAE 994

Query:   798 PTKGRILVFIVE 809
              TKGR+++  +E
Sbjct:   995 STKGRVIILCLE 1006

 Score = 121 (47.7 bits), Expect = 6.6e-31, Sum P(4) = 6.6e-31
 Identities = 38/126 (30%), Positives = 61/126 (48%)

Query:   957 YHLGEFVNRFRHGSLVMRLPDSDV----G---QIPT----VIFGTVNGVIGVIASLPHEQ 1005
             Y++GE     + G  + +LP  DV    G    I T    +I GT+ G I V A +  E+
Sbjct:  1220 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1279

Query:  1006 YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1063
             Y  LE +Q  L    +   V G +H ++R   N  +   A+  LDGD++  FL+L+  + 
Sbjct:  1280 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1336

Query:  1064 DEISKT 1069
             + +  T
Sbjct:  1337 ESVLST 1342

 Score = 56 (24.8 bits), Expect = 6.6e-31, Sum P(4) = 6.6e-31
 Identities = 14/55 (25%), Positives = 28/55 (50%)

Query:    12 KPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGL-QPMLDVPIYGRIATL 65
             +P+ V     G F SP   +++  K T IE+ ++   G+ + + +  ++G I  L
Sbjct:    43 RPSVVLQVAYGYFRSPSSRDIVFGKETCIELVVIGEDGIVESVCEQYVFGTIKDL 97

 Score = 43 (20.2 bits), Expect = 2.3e-12, Sum P(4) = 2.3e-12
 Identities = 19/70 (27%), Positives = 32/70 (45%)

Query:   501 VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI-------GENPSYSQI-AAV 552
             +L   GG    + E+ DG + ++   +L +  S  +I PI        +N    QI A  
Sbjct:   433 ILWIEGGFLATFAEMADGTVFKLGTEKLHWMSSIQNIAPILDFSVMDDQNEKRDQIFACC 492

Query:   553 GMWTDISVRI 562
             G+  + S+RI
Sbjct:   493 GVTPEGSLRI 502


>UNIPROTKB|F5H198 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0005634 GO:GO:0003676 EMBL:AP003108 HGNC:HGNC:2717
            ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01014884
            ProteinModelPortal:F5H198 SMR:F5H198 Ensembl:ENST00000543658
            ArrayExpress:F5H198 Bgee:F5H198 Uniprot:F5H198
        Length = 162

 Score = 203 (76.5 bits), Expect = 1.8e-29, Sum P(2) = 1.8e-29
 Identities = 38/67 (56%), Positives = 54/67 (80%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRP 70
              +ELFRP
Sbjct:    63 VMELFRP 69

 Score = 165 (63.1 bits), Expect = 1.8e-29, Sum P(2) = 1.8e-29
 Identities = 34/84 (40%), Positives = 50/84 (59%)

Query:   724 SRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 782
             S+ F+  +  ++ S  EE E+H + ++D  TFE +  +     EY  S++SC    D N 
Sbjct:    79 SKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNT 138

Query:   783 YYCVGTAYVLPEENEPTKGRILVF 806
             Y+ VGTA V PEE EP +GRI+VF
Sbjct:   139 YFIVGTAMVYPEEAEPKQGRIVVF 162


>POMBASE|SPCC11E10.08 [details] [associations]
            symbol:rik1 "silencing protein Rik1" species:4896
            "Schizosaccharomyces pombe" [GO:0000790 "nuclear chromatin"
            evidence=TAS] [GO:0003677 "DNA binding" evidence=ISS] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0006348 "chromatin silencing at telomere" evidence=TAS]
            [GO:0007535 "donor selection" evidence=IMP] [GO:0030466 "chromatin
            silencing at silent mating-type cassette" evidence=IMP] [GO:0030702
            "chromatin silencing at centromere" evidence=IMP] [GO:0030989
            "dynein-driven meiotic oscillatory nuclear movement" evidence=IGI]
            [GO:0034613 "cellular protein localization" evidence=IMP]
            [GO:0035391 "maintenance of chromatin silencing at silent
            mating-type cassette" evidence=NAS] [GO:0043234 "protein complex"
            evidence=EXP] [GO:0043494 "CLRC ubiquitin ligase complex"
            evidence=IDA] [GO:0044732 "mitotic spindle pole body" evidence=IDA]
            [GO:0045141 "meiotic telomere clustering" evidence=IMP] [GO:0051572
            "negative regulation of histone H3-K4 methylation" evidence=IMP]
            [GO:0051574 "positive regulation of histone H3-K9 methylation"
            evidence=IMP] [GO:0000723 "telomere maintenance" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 PomBase:SPCC11E10.08 GO:GO:0005737
            GO:GO:0034613 EMBL:CU329672 GenomeReviews:CU329672_GR GO:GO:0044732
            GO:GO:0003677 GO:GO:0006351 GO:GO:0016568 GO:GO:0000790
            GO:GO:0030702 GO:GO:0006348 GO:GO:0030989 GO:GO:0051572
            GO:GO:0045141 GO:GO:0051574 GO:GO:0007535 GO:GO:0035391
            GO:GO:0043494 EMBL:AF136156 PIR:T40859 RefSeq:NP_588204.1
            DIP:DIP-35634N IntAct:Q10426 STRING:Q10426
            EnsemblFungi:SPCC11E10.08.1 GeneID:2539050 KEGG:spo:SPCC11E10.08
            eggNOG:NOG255855 OrthoDB:EOG4Q887H NextBio:20800224 Uniprot:Q10426
        Length = 1040

 Score = 255 (94.8 bits), Expect = 1.7e-26, Sum P(3) = 1.7e-26
 Identities = 117/495 (23%), Positives = 219/495 (44%)

Query:   521 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE-HLGGE 579
             TEV     E EISCLD +      +  QI  VG W+   V I +  D + I+        
Sbjct:   497 TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCAFQTNVP 548

Query:   580 IIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 635
              +PR+++L   EG+    + LL + G G   +++L  K   +    K   GT P++ R F
Sbjct:   549 SLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVL-FKNNLVFSETK-HFGTTPVSFRRF 603

Query:   636 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 695
             +    T++   +D P ++Y  N  L Y  +++ +   +C F   +  D L     G L  
Sbjct:   604 TMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVSLGGLKF 663

Query:   696 GTIDDIQKLHIRSIPLGEHP-RRICHQEQS--RTFAICSLKNQSCAEESEMHFVRLLDDQ 752
               ++ + +L  R + L   P + I  Q +   RT        +S  E   +  V   DD 
Sbjct:   664 LQLNPLPELTPRKVLLEHVPLQAIIFQNKLLLRTLENRYEDYESYKENYHLELVDSYDDN 723

Query:   753 TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV--FIVED 810
             +F   S    +  E    +L  +   +S++   VGT+ +  ++  P  GR+++  F  E 
Sbjct:   724 SFRVFSFTENERCE---KVLKIN---ESSLL--VGTSIIEQDKLVPVNGRLILLEFEKEL 775

Query:   811 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 870
               L++++      AV  L  +N + + A  Q++ + K  L ++    + S       +L 
Sbjct:   776 QSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVK--LTEERLM-IDSRISLGSIVLQ 832

Query:   871 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 930
             L V+  G+ I + D +   +++ +  ++  +  R   +  N + A  + +  +Y+ A N+
Sbjct:   833 LIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEGTVYIIATNS 888

Query:   931 FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 990
               L  +R N +     +R   E V  YHL + V++F++      + +++    P ++F T
Sbjct:   889 GLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN----FPITNTNSFLEPKMLFAT 942

Query:   991 VNGVIGVIASLPHEQ 1005
               G IG I SL  ++
Sbjct:   943 EIGAIGSIVSLKDKE 957

 Score = 109 (43.4 bits), Expect = 1.7e-26, Sum P(3) = 1.7e-26
 Identities = 43/197 (21%), Positives = 86/197 (43%)

Query:    19 SCVGNFTSPQELNLIIAKCTRIEIHLLTP-QGLQPMLDVPIYGRIATLELFRPHGEAQDF 77
             SC  +F S +   L++ +  +I I+L +   GLQ    +P++  +  +  +RP G  +D+
Sbjct:    18 SC--HFISSENC-LVLLQALKINIYLCSEVHGLQFFTSIPLFSTVKHIRPYRPPGLDRDY 74

Query:    78 LFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHL-YD 136
             LF+      +  + WD +  ++I      V  R+  P +         D R+  + L  D
Sbjct:    75 LFVVLNDDTYFSIYWDEDYQKVIVDHP-PVRYRVTFPWNRNAKSYCLVDLRMRAIFLSID 133

Query:   137 GL----FKVIPFDNK---GQ-LKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDAR 188
              +     +++  + +   G+ +   F        + D+  L   + PT+VVL+ D  D  
Sbjct:   134 EISMICIRILSAEERLKTGRSIDSGFPFSFPVHLIYDMCILNDSSTPTLVVLHSDGLDC- 192

Query:   189 HVKTYEVALKDKDFVEG 205
             +V  + + L  K   +G
Sbjct:   193 YVTAFLLDLSSKSLGKG 209

 Score = 77 (32.2 bits), Expect = 1.7e-26, Sum P(3) = 1.7e-26
 Identities = 32/131 (24%), Positives = 63/131 (48%)

Query:   304 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV--DLER 361
             ++++ +   +++IGS   +S+LI L+   D        ++   NLGPI D  V+  D+E+
Sbjct:   306 TSLNSIHEGLLFIGSKNSESKLINLSTLKD--------VDSIPNLGPIHDLLVLKNDIEK 357

Query:   362 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 421
                   + C+G  ++ SL   ++ + ++     ++ GI     L S  +      L + F
Sbjct:   358 S----FLVCAGTPRNASLIYFQHALKLDILGQTKISGILRAMVLPSYPEHK----LFLGF 409

Query:   422 ISETRILAMNL 432
              SET  +A N+
Sbjct:   410 PSET--VAFNI 418

 Score = 44 (20.5 bits), Expect = 0.00023, Sum P(4) = 0.00023
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query:  1042 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV 1074
             DAKNF D  + ES   L     D++SK  N  +
Sbjct:   899 DAKNFNDRFICESVYHLH----DKVSKFQNFPI 927

 Score = 42 (19.8 bits), Expect = 0.00023, Sum P(4) = 0.00023
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query:   807 IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 846
             I ED +L+L     TK    +L+  NG+ +   +  + +Y
Sbjct:   418 IKEDFQLELDPSLSTKERTIALSGTNGEFVQVTSTFLCIY 457

 Score = 38 (18.4 bits), Expect = 1.8e-22, Sum P(3) = 1.8e-22
 Identities = 6/15 (40%), Positives = 12/15 (80%)

Query:   292 LKIELLGETSIASTI 306
             LK+++LG+T I+  +
Sbjct:   378 LKLDILGQTKISGIL 392


>WB|WBGene00022301 [details] [associations]
            symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 214 (80.4 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 81/314 (25%), Positives = 152/314 (48%)

Query:   791 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 850
             V+PE ++PT  R         K++++ +KE KG V  L A NG LL  + QK+  + W  
Sbjct:  1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201

Query:   851 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 909
             +D+    + S    H ++  L+  +     +  D  +S+SL+ ++ +  A+   +RD   
Sbjct:  1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258

Query:   910 -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVN 964
              A    A +++ D  ++G   ++   N+ T+   +  A +   G RL V    ++G  +N
Sbjct:  1259 CAQPPMASQLVVDGAHVGFLLSDETGNI-TMFNYAPEAPESNGGERLTVRAAINIGTNIN 1317

Query:   965 RF-R---HGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 1016
              F R   H SL ++L + D    + Q  T +F +++G  G +  L  + Y  L  LQT +
Sbjct:  1318 AFVRLRGHTSL-LQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFI 1376

Query:  1017 RKVIKGVGGLNHEQWRSFNNEKKTVD---AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1073
               V   + GL+ +  RS    +  V+   A+N +DGD++E +L LS     ++++ + V 
Sbjct:  1377 GSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVG 1436

Query:  1074 VEELCKRVEELTRL 1087
                +   + +L R+
Sbjct:  1437 RYHIIDDLMQLRRM 1450

 Score = 119 (46.9 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 36/131 (27%), Positives = 57/131 (43%)

Query:   124 DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQ-----VLDIKFLYGCAKPTIV 178
             DP  R     +Y     ++PF    +   ++ I L+++      + D+ FL G  +PTI+
Sbjct:   145 DPSNRCAACLVYGKHIAILPFHENSKRIHSYVIPLKQIDPRLDNIADMVFLDGYYEPTIL 204

Query:   179 VLYQ--DNKDARHVKTYE--------VALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 228
              LY+       R    Y+        V + D+ F    W   NL      L+P+P PL G
Sbjct:   205 FLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCSQLLPIPKPLGG 263

Query:   229 VLIIGEETIVY 239
              L+ G  T+VY
Sbjct:   264 ALVFGSNTVVY 274

 Score = 92 (37.4 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 58/234 (24%), Positives = 99/234 (42%)

Query:   342 LERYVNLGPIVDFCV----------VDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINE 390
             L+R  N+GP+   CV          VD +R+     +VT SG  K+G+L + +  +    
Sbjct:   443 LDRLRNVGPVKSMCVGRPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEI 502

Query:   391 QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQT 450
               S  L+G + +W++    ++    +L+VS +  T IL   L           F +   T
Sbjct:   503 ITSSLLEGAEQLWAVGRKENESHK-YLIVSRVRSTLIL--ELGEELVELEEQLFVTGEPT 559

Query:   451 LFCHDAIYNQL-VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 509
             +   +     L VQVTS  + LV  T  +   E      + V  A+     V L T  G 
Sbjct:   560 VAAGELSQGALAVQVTSTCIALV--TDGQQMQEVHIDSNFPVIQASIVDPYVALLTQNGR 617

Query:   510 LVYLEIGDGILTEVKHAQL-EYEISCLDI---NPIGENPSYSQIAAVGMWTDIS 559
             L+  E+    + E  + QL E +IS       +   +N   +Q+ ++ ++ D S
Sbjct:   618 LLLYEL----VME-PYVQLREVDISATSFATWHATAQN--LTQLTSISIYADAS 664

 Score = 70 (29.7 bits), Expect = 4.2e-11, Sum P(4) = 4.2e-11
 Identities = 31/110 (28%), Positives = 56/110 (50%)

Query:   740 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 799
             +SE+   R+  D  F++   YP+   E G +I    +  +S+VY  V +   +P+   P+
Sbjct:   999 KSELRIARMHPD--FDYEMPYPVKKIEVGRTIHHVRYLMNSDVYAVVSS---IPK---PS 1050

Query:   800 KGRILVFIVEDGKLQLIAEKETKGAV-----YSLNAFNGKLLAAI-NQKI 843
               +I V ++ D K + I EK+    +     Y+LN F+ +  AA+ N +I
Sbjct:  1051 N-KIWV-VMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAAVPNTEI 1098

 Score = 66 (28.3 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 17/55 (30%), Positives = 30/55 (54%)

Query:   289 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 343
             V  L+   + ETSIA +++      +++GS  GDSQL++  L    +   V+ L+
Sbjct:   344 VKSLEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTLLKTTRDCAVKRLK 398

 Score = 49 (22.3 bits), Expect = 2.6e-24, Sum P(4) = 2.6e-24
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query:   304 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 343
             ST  Y+++  + +GS  GD  L  L L   + G  V+ LE
Sbjct:   311 STSVYMEDGRIAVGSRDGD--LFLLRLMTSSGGGTVKSLE 348

 Score = 43 (20.2 bits), Expect = 1.9e-08, Sum P(4) = 1.9e-08
 Identities = 27/107 (25%), Positives = 48/107 (44%)

Query:   648 DRPTVIYSSNK-KLLYSNVNLKEVSHMCPFNS---AAFPDSLAI--AKEGELT-IGTIDD 700
             D+PT   S+ K K+L+       V+ +C  N          + I   K+ +L  I  +D 
Sbjct:  1158 DQPT---SNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDM 1214

Query:   701 ---IQKLH-IRSIPLGEHPRR----ICHQEQSRTFAICSLKNQSCAE 739
                + +LH +R+I +    R     I  QE ++  +I S  ++ CA+
Sbjct:  1215 HYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQ 1261

 Score = 40 (19.1 bits), Expect = 3.8e-08, Sum P(4) = 3.8e-08
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query:  1064 DEISKTMNVSVEELCKRVEE 1083
             DE ++ +N  +++LC+RV E
Sbjct:   837 DE-AEQLNTEMKQLCERVLE 855

 Score = 39 (18.8 bits), Expect = 4.8e-08, Sum P(4) = 4.8e-08
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   602 DGHLLNFLLNMKTGELT 618
             DG  + FLL+ +TG +T
Sbjct:  1271 DGAHVGFLLSDETGNIT 1287


>UNIPROTKB|Q9N4C2 [details] [associations]
            symbol:cpsf-1 "Probable cleavage and polyadenylation
            specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
            [GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
            cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=NAS]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 214 (80.4 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 81/314 (25%), Positives = 152/314 (48%)

Query:   791 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 850
             V+PE ++PT  R         K++++ +KE KG V  L A NG LL  + QK+  + W  
Sbjct:  1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201

Query:   851 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 909
             +D+    + S    H ++  L+  +     +  D  +S+SL+ ++ +  A+   +RD   
Sbjct:  1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258

Query:   910 -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVN 964
              A    A +++ D  ++G   ++   N+ T+   +  A +   G RL V    ++G  +N
Sbjct:  1259 CAQPPMASQLVVDGAHVGFLLSDETGNI-TMFNYAPEAPESNGGERLTVRAAINIGTNIN 1317

Query:   965 RF-R---HGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 1016
              F R   H SL ++L + D    + Q  T +F +++G  G +  L  + Y  L  LQT +
Sbjct:  1318 AFVRLRGHTSL-LQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFI 1376

Query:  1017 RKVIKGVGGLNHEQWRSFNNEKKTVD---AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1073
               V   + GL+ +  RS    +  V+   A+N +DGD++E +L LS     ++++ + V 
Sbjct:  1377 GSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVG 1436

Query:  1074 VEELCKRVEELTRL 1087
                +   + +L R+
Sbjct:  1437 RYHIIDDLMQLRRM 1450

 Score = 119 (46.9 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 36/131 (27%), Positives = 57/131 (43%)

Query:   124 DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQ-----VLDIKFLYGCAKPTIV 178
             DP  R     +Y     ++PF    +   ++ I L+++      + D+ FL G  +PTI+
Sbjct:   145 DPSNRCAACLVYGKHIAILPFHENSKRIHSYVIPLKQIDPRLDNIADMVFLDGYYEPTIL 204

Query:   179 VLYQ--DNKDARHVKTYE--------VALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 228
              LY+       R    Y+        V + D+ F    W   NL      L+P+P PL G
Sbjct:   205 FLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCSQLLPIPKPLGG 263

Query:   229 VLIIGEETIVY 239
              L+ G  T+VY
Sbjct:   264 ALVFGSNTVVY 274

 Score = 92 (37.4 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 58/234 (24%), Positives = 99/234 (42%)

Query:   342 LERYVNLGPIVDFCV----------VDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINE 390
             L+R  N+GP+   CV          VD +R+     +VT SG  K+G+L + +  +    
Sbjct:   443 LDRLRNVGPVKSMCVGRPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEI 502

Query:   391 QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQT 450
               S  L+G + +W++    ++    +L+VS +  T IL   L           F +   T
Sbjct:   503 ITSSLLEGAEQLWAVGRKENESHK-YLIVSRVRSTLIL--ELGEELVELEEQLFVTGEPT 559

Query:   451 LFCHDAIYNQL-VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 509
             +   +     L VQVTS  + LV  T  +   E      + V  A+     V L T  G 
Sbjct:   560 VAAGELSQGALAVQVTSTCIALV--TDGQQMQEVHIDSNFPVIQASIVDPYVALLTQNGR 617

Query:   510 LVYLEIGDGILTEVKHAQL-EYEISCLDI---NPIGENPSYSQIAAVGMWTDIS 559
             L+  E+    + E  + QL E +IS       +   +N   +Q+ ++ ++ D S
Sbjct:   618 LLLYEL----VME-PYVQLREVDISATSFATWHATAQN--LTQLTSISIYADAS 664

 Score = 70 (29.7 bits), Expect = 4.2e-11, Sum P(4) = 4.2e-11
 Identities = 31/110 (28%), Positives = 56/110 (50%)

Query:   740 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 799
             +SE+   R+  D  F++   YP+   E G +I    +  +S+VY  V +   +P+   P+
Sbjct:   999 KSELRIARMHPD--FDYEMPYPVKKIEVGRTIHHVRYLMNSDVYAVVSS---IPK---PS 1050

Query:   800 KGRILVFIVEDGKLQLIAEKETKGAV-----YSLNAFNGKLLAAI-NQKI 843
               +I V ++ D K + I EK+    +     Y+LN F+ +  AA+ N +I
Sbjct:  1051 N-KIWV-VMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAAVPNTEI 1098

 Score = 66 (28.3 bits), Expect = 4.7e-26, Sum P(4) = 4.7e-26
 Identities = 17/55 (30%), Positives = 30/55 (54%)

Query:   289 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 343
             V  L+   + ETSIA +++      +++GS  GDSQL++  L    +   V+ L+
Sbjct:   344 VKSLEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTLLKTTRDCAVKRLK 398

 Score = 49 (22.3 bits), Expect = 2.6e-24, Sum P(4) = 2.6e-24
 Identities = 14/40 (35%), Positives = 21/40 (52%)

Query:   304 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 343
             ST  Y+++  + +GS  GD  L  L L   + G  V+ LE
Sbjct:   311 STSVYMEDGRIAVGSRDGD--LFLLRLMTSSGGGTVKSLE 348

 Score = 43 (20.2 bits), Expect = 1.9e-08, Sum P(4) = 1.9e-08
 Identities = 27/107 (25%), Positives = 48/107 (44%)

Query:   648 DRPTVIYSSNK-KLLYSNVNLKEVSHMCPFNS---AAFPDSLAI--AKEGELT-IGTIDD 700
             D+PT   S+ K K+L+       V+ +C  N          + I   K+ +L  I  +D 
Sbjct:  1158 DQPT---SNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFLDM 1214

Query:   701 ---IQKLH-IRSIPLGEHPRR----ICHQEQSRTFAICSLKNQSCAE 739
                + +LH +R+I +    R     I  QE ++  +I S  ++ CA+
Sbjct:  1215 HYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQ 1261

 Score = 40 (19.1 bits), Expect = 3.8e-08, Sum P(4) = 3.8e-08
 Identities = 8/20 (40%), Positives = 15/20 (75%)

Query:  1064 DEISKTMNVSVEELCKRVEE 1083
             DE ++ +N  +++LC+RV E
Sbjct:   837 DE-AEQLNTEMKQLCERVLE 855

 Score = 39 (18.8 bits), Expect = 4.8e-08, Sum P(4) = 4.8e-08
 Identities = 8/17 (47%), Positives = 12/17 (70%)

Query:   602 DGHLLNFLLNMKTGELT 618
             DG  + FLL+ +TG +T
Sbjct:  1271 DGAHVGFLLSDETGNIT 1287


>ZFIN|ZDB-GENE-040709-2 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation specific
            factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
            "definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
            GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
            EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
            ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
        Length = 1451

 Score = 226 (84.6 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 90/326 (27%), Positives = 157/326 (48%)

Query:   783 YYCVGTAYVLPEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAF 831
             Y  +GT  +  EE    +GRIL+  V     E G      K +++ EKE KG V +L   
Sbjct:  1128 YVALGTCLMQGEE-VTCRGRILILDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHC 1186

Query:   832 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 891
             +G L++AI QKI L  W L+D+    + +      +I  +Y  +  +FI+  D+MKSISL
Sbjct:  1187 SGFLVSAIGQKIFL--WSLKDNDLTGM-AFIDTQLYIHQMY--SIKNFILAADVMKSISL 1241

Query:   892 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEER 948
             L Y+ E   +   +RD     + ++E + D+  LG   ++ + NL       E       
Sbjct:  1242 LRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYLPEAKESFGG 1301

Query:   949 GRLEVVGEYHLGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPH 1003
              RL    ++++G  VN F R    G+L      +       +  F T++G +G++  +  
Sbjct:  1302 MRLLRRADFNVGSHVNAFWRMPCRGTLDTANKKALTWDNKHITWFATLDGGVGLLLPMQE 1361

Query:  1004 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRT 1061
             + Y  L  LQ  L  ++    GLN + +R  + +++T+    KN LDG+L+  +L LS  
Sbjct:  1362 KTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKNILDGELLNKYLYLSTM 1421

Query:  1062 RMDEISKTMNVSVEELCKRVEELTRL 1087
                E++K +  + + +   + E+ R+
Sbjct:  1422 ERSELAKKIGTTPDIILDDLLEIERV 1447

 Score = 94 (38.1 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 23/88 (26%), Positives = 48/88 (54%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D+KFL+G  +PT+++L++ N+      A    T  +     + ++   P  WS +N
Sbjct:   201 LNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHPVIWSLSN 260

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L    + ++ VP P+ GV++    +++Y
Sbjct:   261 LPFDCNQVMAVPKPIGGVVVFAVNSLLY 288

 Score = 70 (29.7 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 15/47 (31%), Positives = 25/47 (53%)

Query:   366 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
             +VV CSG  K+G+L +++  I      + EL G   MW++    + P
Sbjct:   501 EVVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKP 547

 Score = 65 (27.9 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 13/34 (38%), Positives = 21/34 (61%)

Query:    10 AHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIH 43
             AH PT V  +   NF S QE NL++A  +++ ++
Sbjct:     8 AHPPTAVEFAVYCNFISSQEKNLVVAGTSQLYVY 41

 Score = 61 (26.5 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 18/79 (22%), Positives = 32/79 (40%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct:   980 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1039

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+C+   + C     M
Sbjct:  1040 KVYAVCTSVKEPCTRIPRM 1058

 Score = 55 (24.4 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 20/99 (20%), Positives = 46/99 (46%)

Query:    50 LQPMLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSD 109
             L+ +    ++G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++    +
Sbjct:    66 LEQVASFSLFGNVMSMASVQLVGTNRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE 125

Query:   110 RIGRP--TDNGQIGII--DPDCRLIGLHLYDGLFKVIPF 144
                R     N  I ++  DP+ R   + +Y     V+PF
Sbjct:   126 PELRDGFVQNVHIPMVRVDPENRCAVMLVYGTCLVVLPF 164

 Score = 46 (21.3 bits), Expect = 4.4e-05, Sum P(6) = 4.4e-05
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query:   602 DGHLLNFLLNMKTGELTDRKKVSLGTQP 629
             DG LLN  L + T E ++  K  +GT P
Sbjct:  1408 DGELLNKYLYLSTMERSELAK-KIGTTP 1434

 Score = 40 (19.1 bits), Expect = 0.00015, Sum P(6) = 0.00015
 Identities = 9/41 (21%), Positives = 23/41 (56%)

Query:   818 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 858
             E+ ++G+  + +A  GK   +  Q+   +  ++R++G  E+
Sbjct:   764 EESSRGSAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEI 804

 Score = 38 (18.4 bits), Expect = 2.3e-25, Sum P(7) = 2.3e-25
 Identities = 14/68 (20%), Positives = 29/68 (42%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI----KLNLQPDAKG 336
             +IT     V     +    + + + +  ++   +++GS  G+S L+    KL   P  +G
Sbjct:   349 LITDGMRSVRAFHFDKAAASVLTTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEG 408

Query:   337 SYVEVLER 344
                E  E+
Sbjct:   409 KENEEKEK 416


>UNIPROTKB|Q10570 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
            3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
            [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
            from RNA polymerase II promoter" evidence=TAS] [GO:0006369
            "termination of RNA polymerase II transcription" evidence=TAS]
            [GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
            export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
            evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
            Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
            GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
            OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
            RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
            DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
            PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
            PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
            Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
            GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
            PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
            GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
            CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
            Uniprot:Q10570
        Length = 1443

 Score = 234 (87.4 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 101/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L  +E+   + + S   +  V     Y   GT  + 
Sbjct:  1070 QQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1129

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:  1130 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1188

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:  1189 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1243

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:  1244 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 1303

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:  1304 VGAHVNTFWRTPCRGATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1363

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++T+    +N LDG+L+  +L LS     E++K + 
Sbjct:  1364 NALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1423

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:  1424 TTPDIILDDLLETDRV 1439

 Score = 81 (33.6 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 21/88 (23%), Positives = 44/88 (50%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +
Sbjct:   201 LNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTS 260

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L       + VP P+ GV++    +++Y
Sbjct:   261 LPFDCTQALAVPKPIGGVVVFAVNSLLY 288

 Score = 74 (31.1 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 21/75 (28%), Positives = 37/75 (49%)

Query:   340 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 390
             EV +  +N+GP  +  V +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct:   464 EVCDSILNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQV 523

Query:   391 QASVELQGIKGMWSL 405
               + EL G   MW++
Sbjct:   524 VTTFELPGCYDMWTV 538

 Score = 60 (26.2 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 19/79 (24%), Positives = 32/79 (40%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   972 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  CA    M
Sbjct:  1032 KVYAVATSTNTPCARIPRM 1050

 Score = 59 (25.8 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 12/36 (33%), Positives = 21/36 (58%)

Query:    10 AHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLL 45
             AH PT +  S   NF +  E NL++A  +++ ++ L
Sbjct:     8 AHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRL 43

 Score = 58 (25.5 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 19/99 (19%), Positives = 46/99 (46%)

Query:    59 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG-----DVSDRIGR 113
             +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   +
Sbjct:    75 FGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQ 134

Query:   114 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE 152
                  ++ + DPD R   + +Y     V+PF  +   +E
Sbjct:   135 NVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEE 172

 Score = 46 (21.3 bits), Expect = 0.00033, Sum P(6) = 0.00033
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query:   602 DGHLLNFLLNMKTGELTDRKKVSLGTQP 629
             DG LLN  L + T E ++  K  +GT P
Sbjct:  1400 DGELLNRYLYLSTMERSELAK-KIGTTP 1426

 Score = 41 (19.5 bits), Expect = 3.0e-25, Sum P(7) = 3.0e-25
 Identities = 9/47 (19%), Positives = 23/47 (48%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 327
             +IT     V     +    + + +++  ++   +++GS  G+S L+K
Sbjct:   349 LITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 395


>UNIPROTKB|F5GYG8 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01012348
            ProteinModelPortal:F5GYG8 SMR:F5GYG8 Ensembl:ENST00000543627
            ArrayExpress:F5GYG8 Bgee:F5GYG8 Uniprot:F5GYG8
        Length = 109

 Score = 292 (107.8 bits), Expect = 1.7e-24, P = 1.7e-24
 Identities = 56/106 (52%), Positives = 81/106 (76%)

Query:     4 WNYVVTAHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPIYGRIA 63
             +NYVVTA KPT V     G+FTS ++LNL+IAK TR+EI+++T +GL+P+ +V +YG+IA
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEGLRPVKEVGMYGKIA 62

Query:    64 TLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDV 107
              +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V
Sbjct:    63 VMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNV 108


>MGI|MGI:2679722 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor
            1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
            "mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
            polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
            evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
            GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
            HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
            EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
            RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
            STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
            Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
            UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
            CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
            GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
        Length = 1441

 Score = 230 (86.0 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 100/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:  1068 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1127

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:  1128 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1186

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:  1187 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1241

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:  1242 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 1301

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:  1302 VGAHVNTFWRTPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1361

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++ +    +N LDG+L+  +L LS     E++K + 
Sbjct:  1362 NALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIG 1421

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:  1422 TTPDIILDDLLETDRV 1437

 Score = 82 (33.9 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 22/88 (25%), Positives = 44/88 (50%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +
Sbjct:   201 LNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTS 260

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L       + VP P+ GV+I    +++Y
Sbjct:   261 LPFDCTQALAVPKPIGGVVIFAVNSLLY 288

 Score = 74 (31.1 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 21/75 (28%), Positives = 37/75 (49%)

Query:   340 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 390
             EV +  +N+GP  +  V +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct:   463 EVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 522

Query:   391 QASVELQGIKGMWSL 405
               + EL G   MW++
Sbjct:   523 VTTFELPGCYDMWTV 537

 Score = 60 (26.2 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 20/108 (18%), Positives = 49/108 (45%)

Query:    50 LQPMLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG---- 105
             L+ +     +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++     
Sbjct:    66 LELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE 125

Query:   106 -DVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE 152
              ++ D   +     ++ + DPD R   + +Y     V+PF  +   +E
Sbjct:   126 PELRDGFVQNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEE 172

 Score = 56 (24.8 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query:    10 AHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLL 45
             AH PT +  +   NF +  E NL++A  +++ ++ L
Sbjct:     8 AHPPTGLEFTMYCNFFNNSERNLVVAGTSQLYVYRL 43

 Score = 55 (24.4 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 17/79 (21%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  C     M
Sbjct:  1030 KVYAVATSTNTPCTRIPRM 1048

 Score = 46 (21.3 bits), Expect = 0.00026, Sum P(6) = 0.00026
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query:   602 DGHLLNFLLNMKTGELTDRKKVSLGTQP 629
             DG LLN  L + T E ++  K  +GT P
Sbjct:  1398 DGELLNRYLYLSTMERSELAK-KIGTTP 1424

 Score = 42 (19.8 bits), Expect = 2.0e-24, Sum P(7) = 2.0e-24
 Identities = 13/66 (19%), Positives = 29/66 (43%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN--LQPDAKGSY 338
             +IT     V     +    + + +++  ++   +++GS  G+S L+K    LQ     S 
Sbjct:   349 LITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSV 408

Query:   339 VEVLER 344
              E  ++
Sbjct:   409 REAADK 414


>UNIPROTKB|Q10569 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
            "mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
            IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
            STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
            KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
            ArrayExpress:Q10569 Uniprot:Q10569
        Length = 1444

 Score = 230 (86.0 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 100/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:  1071 QQEAFCIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1130

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:  1131 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1189

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:  1190 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1244

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:  1245 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 1304

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:  1305 VGAHVNTFWRTPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1364

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++ +    +N LDG+L+  +L LS     E++K + 
Sbjct:  1365 NALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIG 1424

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:  1425 TTPDIILDDLLETDRV 1440

 Score = 81 (33.6 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 22/88 (25%), Positives = 44/88 (50%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +
Sbjct:   204 LNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTS 263

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L       + VP P+ GV+I    +++Y
Sbjct:   264 LPFDCTQALAVPKPIGGVVIFAVNSLLY 291

 Score = 71 (30.1 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 20/75 (26%), Positives = 37/75 (49%)

Query:   340 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 390
             EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct:   466 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 525

Query:   391 QASVELQGIKGMWSL 405
               + EL G   MW++
Sbjct:   526 VTTFELPGCYDMWTV 540

 Score = 60 (26.2 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query:    10 AHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQP 52
             AH PT +  S   NF +  E NL++A  +++ ++ L      P
Sbjct:     8 AHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDSEAP 50

 Score = 60 (26.2 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 20/108 (18%), Positives = 49/108 (45%)

Query:    50 LQPMLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG---- 105
             L+ +     +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++     
Sbjct:    69 LELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE 128

Query:   106 -DVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE 152
              ++ D   +     ++ + DPD R   + +Y     V+PF  +   +E
Sbjct:   129 PELRDGFVQNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEE 175

 Score = 50 (22.7 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 16/79 (20%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   973 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  +  C     M
Sbjct:  1033 KVYAVATSTSTPCTRVPRM 1051

 Score = 41 (19.5 bits), Expect = 7.9e-24, Sum P(7) = 7.9e-24
 Identities = 9/47 (19%), Positives = 23/47 (48%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 327
             +IT     V     +    + + +++  ++   +++GS  G+S L+K
Sbjct:   352 LITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 398


>DICTYBASE|DDB_G0281585 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation
            specificity factor 160 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
            evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
            EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
            EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
            InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
        Length = 1628

 Score = 228 (85.3 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 99/375 (26%), Positives = 176/375 (46%)

Query:   746 VRLLD---DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNV-----YYCVGTAYVLPEE 795
             ++L+D   D  ++FI ++ L   E      I+S  F++   +     +  +GTA+   E+
Sbjct:  1272 IKLIDPTIDWNWKFIDSFSLQDRETVLAMKIVSLKFTEPDGITRARPFLVIGTAFTFGED 1331

Query:   796 NEPTKGRILVF-IVE-----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 843
              +  KGR+LVF IV            + +L L+ EKE KG V +L++ NG LL  I  K+
Sbjct:  1332 TQ-CKGRVLVFEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIGPKL 1390

Query:   844 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 903
              + ++      T  L +   +   I    + T  ++IV+GD+ KS+  L +K  +  +  
Sbjct:  1391 TVNQFY-----TGSLVTLSFYDAQIYICSICTIKNYIVIGDMYKSVYFLQWKDNK-TLNL 1444

Query:   904 RARDYNA-NWMSAVEILDDD----IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
              ++DY A N  S   I++      +    + N  LF+       +   +  + E+ G   
Sbjct:  1445 LSKDYQALNIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSGQINQ-EINGN-- 1501

Query:   959 LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 1018
                  N+  +     RLP  +  Q+  VIFGT++G + V+  L  + YL    +Q+ L  
Sbjct:  1502 -----NKNDN-----RLPKKE--QL--VIFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY- 1546

Query:  1019 VIKGVGGLNHEQWRSFNNEKKTVD---------AKNFLDGDLIESFLDLSRTRMDEISKT 1069
              +    GLN +Q+RSF +  +             K  LDGDLI  FL LS++    IS +
Sbjct:  1547 YLPQTAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNS 1606

Query:  1070 MNVSVEELCKRVEEL 1084
             +N + +E+ + ++++
Sbjct:  1607 INSTSDEIIESLKDV 1621

 Score = 88 (36.0 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 32/106 (30%), Positives = 53/106 (50%)

Query:   146 NKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKD--AR-HVKTYE-----VAL 197
             N  Q KE  NI +E ++  D  FL+G  +PTI+ L++  +   +R  VK +      ++L
Sbjct:   273 NNNQDKEKKNIEIENVK--DFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISL 330

Query:   198 ----KDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 239
                 K   F+   W+ +N     ++L+ VP PL G L+I    + Y
Sbjct:   331 NLLTKAGSFI---WNVSNFPYNCEMLVSVPEPLGGALVITANIMFY 373

 Score = 77 (32.2 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query:   366 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 405
             ++VTCSG  K+GS+ +++N I      + EL GI  +W++
Sbjct:   618 ELVTCSGYGKNGSISVLQNNIKPELVMAFELPGILNVWTV 657

 Score = 58 (25.5 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 12/39 (30%), Positives = 22/39 (56%)

Query:   289 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 327
             V  + +   G + + S I  L N ++++GS  GDS L++
Sbjct:   449 VQRIHVSKAGGSVLTSCICVLSNNLIFLGSRLGDSLLLQ 487

 Score = 53 (23.7 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 26/105 (24%), Positives = 48/105 (45%)

Query:    45 LTPQGLQPMLDVPIYGRIATLELFR-PHGEAQDFLFIATERYKFCVLQWDAESSELITRA 103
             L P  L+ +++  ++G I ++   R P+ E +D L +     K  VL +D++  +   R+
Sbjct:    83 LKPS-LELIIEKKLFGNIESMASVRYPNSE-RDSLILTFRDAKISVLDYDSDLLDFEIRS 140

Query:   104 MGDVS-DRI--GRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVIPF 144
             +     D    GR    +  +  +D   R   + LYD    V+PF
Sbjct:   141 LHYFEKDEFKGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPF 185

 Score = 48 (22.0 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query:    13 PTNVTHSCVGNFTSPQELNLIIAKCTRIEIH 43
             PT V      N  +   +NL++AK   ++I+
Sbjct:    14 PTGVEQCIKANLINDDSINLVLAKTNVLQIY 44

 Score = 43 (20.2 bits), Expect = 1.4e-23, Sum P(7) = 1.4e-23
 Identities = 10/48 (20%), Positives = 22/48 (45%)

Query:   502 LLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQI 549
             +  T G + +Y       + +V   + EY+I  ++ N + +N    Q+
Sbjct:   940 IYTTNGSYEIYRLTSQECIFKVSDIKFEYDILGINTN-VSQNQILEQV 986

 Score = 43 (20.2 bits), Expect = 1.8e-18, Sum P(6) = 1.8e-18
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query:   383 RNGIGINEQASVELQGIKGMWSLRSSTDD 411
             R G+ +NE AS++   I G        DD
Sbjct:   379 RYGLAVNEYASIDTSTIIGSQPFDFPIDD 407

 Score = 39 (18.8 bits), Expect = 7.9e-17, Sum P(5) = 7.9e-17
 Identities = 11/40 (27%), Positives = 20/40 (50%)

Query:    11 HKPTNV-THSCVGNFTSPQELNLIIAKCTRIEIHLLTPQG 49
             ++PT +  H  +  +TS   +     + T I ++LLT  G
Sbjct:   298 YEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAG 337

 Score = 39 (18.8 bits), Expect = 0.00025, Sum P(6) = 0.00025
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:  1029 EQWRSFNNEKKTVD-AKNFLDGD 1050
             +Q +  N +KK  D ++ FLD D
Sbjct:   863 QQQQPSNEKKKKKDKSRGFLDSD 885


>TAIR|locus:2153122 [details] [associations]
            symbol:CPSF160 "cleavage and polyadenylation specificity
            factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
            of flower development" evidence=RCA] [GO:0016570 "histone
            modification" evidence=RCA] [GO:0048449 "floral organ formation"
            evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
            EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
            IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
            EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
            TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
            PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
            GermOnline:AT5G51660 Uniprot:Q9FGR0
        Length = 1442

 Score = 208 (78.3 bits), Expect = 3.5e-23, Sum P(4) = 3.5e-23
 Identities = 81/295 (27%), Positives = 135/295 (45%)

Query:   761 PLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE---DGKL 813
             P+ T E+  ++    L  + + ++     VGTAYV  E+    +GR+L+F      D   
Sbjct:  1104 PMQTSEHALTVRVVTLLNASTGENETLLAVGTAYVQGED-VAARGRVLLFSFGKNGDNSQ 1162

Query:   814 QLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 870
              ++ E   +E KGA+ ++ +  G LL +   KI L+KW    +GT EL            
Sbjct:  1163 NVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGT-ELNGVAFFDAP--P 1215

Query:   871 LYVQTRG---DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD--IYL 925
             LYV +      FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   + L
Sbjct:  1216 LYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGSTLSL 1275

Query:   926 GAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 984
                +      V   +    +  +G +L    E+H+G  V++F    +V    D  + +  
Sbjct:  1276 AVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQMVSSGADK-INRF- 1333

Query:   985 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 1039
              ++FGT++G  G IA L    +  L+ LQ  L   +  V GLN   +R F +  K
Sbjct:  1334 ALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSGK 1388

 Score = 102 (41.0 bits), Expect = 3.5e-23, Sum P(4) = 3.5e-23
 Identities = 38/124 (30%), Positives = 58/124 (46%)

Query:   155 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNKD-ARHV--KTYEVALK----DKDFVEGP 206
             N+R LE   V D  FL+G  +P IV+L ++    A  V  K +   L     +    + P
Sbjct:   240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299

Query:   207 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 264
               WS  NL + A  L+ VP P+ GVL++   TI Y S +A  A+ +    + A    +  
Sbjct:   300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359

Query:   265 GSRY 268
              S +
Sbjct:   360 ASNF 363

 Score = 98 (39.6 bits), Expect = 3.5e-23, Sum P(4) = 3.5e-23
 Identities = 34/125 (27%), Positives = 58/125 (46%)

Query:   329 NLQPDAKGSY-VEVLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDG 377
             N    A+ S+   V +  VN+GP+ DF              + +Q   ++V CSG  K+G
Sbjct:   494 NNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQSNYELVCCSGHGKNG 553

Query:   378 SLRIVRNGIGINEQASVELQGIKGMWSL--RSST------------DDPFDTFLVVSFIS 423
             +L ++R  I       VEL G KG+W++  +SS             +D +  +L++S  +
Sbjct:   554 ALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISLEA 613

Query:   424 ETRIL 428
              T +L
Sbjct:   614 RTMVL 618

 Score = 55 (24.4 bits), Expect = 3.5e-23, Sum P(4) = 3.5e-23
 Identities = 10/28 (35%), Positives = 20/28 (71%)

Query:   302 IASTISYLDNAVVYIGSSYGDSQLIKLN 329
             +AS I+ + N++ ++GS  GDS L++ +
Sbjct:   414 LASDITSVGNSLFFLGSRLGDSLLVQFS 441


>FB|FBgn0024698 [details] [associations]
            symbol:Cpsf160 "Cleavage and polyadenylation specificity
            factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
            binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
            EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
            RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
            STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
            GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
            InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
            GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
            Uniprot:Q9V726
        Length = 1455

 Score = 248 (92.4 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 91/328 (27%), Positives = 160/328 (48%)

Query:   783 YYCVGTAYVLPEENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAF 831
             Y C+GT +   E+   ++G I ++     + E GK      ++ I +KE KG V +++  
Sbjct:  1133 YLCIGTNFNYSEDIT-SRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDV 1191

Query:   832 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 891
              G L+  + QKI  Y W LRD    +L        +I    + T    I + D+ KSISL
Sbjct:  1192 LGFLVTGLGQKI--YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISL 1246

Query:   892 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 946
             L ++ E   +   +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   
Sbjct:  1247 LRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGG 1306

Query:   947 ERGRLEVVGEYHLGEFVNR-FR---HGS-LVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 1001
             ++  L    +YHLG+ VN  FR   H   L  R P     +   V++GT++G +G    L
Sbjct:  1307 QK--LLRKADYHLGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPL 1363

Query:  1002 PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT-VD-AKNFLDGDLIESFLDLS 1059
             P + Y     LQ  L    + + GLN +++R+  + KK  ++ ++  +DGDLI S+  ++
Sbjct:  1364 PEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMA 1423

Query:  1060 RTRMDEISKTMNVSVEELCKRVEELTRL 1087
              +  +E++K +    EE+   + E+ RL
Sbjct:  1424 NSERNEVAKKIGTRTEEILGDLLEIERL 1451

 Score = 88 (36.0 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 43/153 (28%), Positives = 70/153 (45%)

Query:   336 GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 395
             G  VE  E  V L P  +  + DL+ +    +V  +G  K+G+L +  N I      S E
Sbjct:   502 GERVEFEEDGVTLRPHAE-SLQDLKIE----LVAATGHSKNGALSVFVNCINPQIITSFE 556

Query:   396 LQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQ 449
             L G   +W++      +SS +D  D F+++S  + T  L +            GF     
Sbjct:   557 LDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNST--LVLQTGQEINEIENTGFTVNQP 613

Query:   450 TLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRN 481
             T+F  +    + +VQVT+  VRL+  T R ++N
Sbjct:   614 TIFVGNLGQQRFIVQVTTRHVRLLQGT-RLIQN 645

 Score = 83 (34.3 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 39/165 (23%), Positives = 71/165 (43%)

Query:    41 EIHLLTPQGLQPMLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELI 100
             E+ L     L+ +    +YG + +L+     G  +D L I+ +  K  VLQ D ++  L 
Sbjct:    59 EMRLAPKMRLECLATYTLYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALK 118

Query:   101 TRAMGDVSDRIGRPTDNGQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI 156
             T ++    +   R    G+  +    +DPD R   + +Y     V+PF     L      
Sbjct:   119 TLSLHYFEEDDIRGGWTGRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSL------ 172

Query:   157 RLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKD 201
               +E+++ D+K +     PT +V          + +Y +AL+D D
Sbjct:   173 --DEIELADVKPIKKA--PTAMV-----SRTPIMASYLIALRDLD 208

 Score = 76 (31.8 bits), Expect = 1.7e-22, Sum P(7) = 1.7e-22
 Identities = 23/87 (26%), Positives = 44/87 (50%)

Query:   163 VLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VALKDKDFVEGP--WSQNNL 212
             VLDI+FL+G  +PT+++LY+  +     +K          ++L  +  V  P  W+ N+L
Sbjct:   214 VLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISLNIQQRVH-PIIWTVNSL 272

Query:   213 DNGADLLIPVPPPLCGVLIIGEETIVY 239
                   + P+  P+ G L++    ++Y
Sbjct:   273 PFDCLQVYPIQKPIGGCLVMTVNAVIY 299

 Score = 44 (20.5 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 14/66 (21%), Positives = 27/66 (40%)

Query:   669 EVSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQ 723
             +V     FN+   P+  L      EL I  +           +R +PL   PR++ +  +
Sbjct:   984 DVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRCTPRQLVYHRE 1043

Query:   724 SRTFAI 729
             +R + +
Sbjct:  1044 NRVYCL 1049

 Score = 42 (19.8 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 9/37 (24%), Positives = 18/37 (48%)

Query:    11 HKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTP 47
             H  T V  S    F +  + NL++A    ++++ + P
Sbjct:     9 HSATAVEFSIACRFFNNLDENLVVAGANVLKVYRIAP 45

 Score = 40 (19.1 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query:   302 IASTISYLDNAVVYIGSSYGDSQLI 326
             + S I  L +  +++GS  G+S L+
Sbjct:   381 LTSCICVLHSEYIFLGSRLGNSLLL 405

 Score = 40 (19.1 bits), Expect = 3.6e-23, Sum P(7) = 3.6e-23
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query:   544 PSYSQIAAVGMWTDISVRIFSLPDLNLI 571
             PSY  + A    T   + I+S+PD+ L+
Sbjct:   790 PSYWLVVARQSGT---LEIYSMPDMKLV 814

 Score = 38 (18.4 bits), Expect = 1.8e-18, Sum P(6) = 1.8e-18
 Identities = 7/21 (33%), Positives = 11/21 (52%)

Query:   234 EETIVYCSANAFKAIPIRPSI 254
             +E +V   AN  K   I P++
Sbjct:    27 DENLVVAGANVLKVYRIAPNV 47


>UNIPROTKB|F1PC28 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
            Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
        Length = 1398

 Score = 229 (85.7 bits), Expect = 2.1e-22, Sum P(6) = 2.1e-22
 Identities = 100/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:  1025 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1084

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:  1085 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1143

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:  1144 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1198

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:  1199 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 1258

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:  1259 VGAHVNTFWRTPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1318

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++ +    +N LDG+L+  +L LS     E++K + 
Sbjct:  1319 NALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIG 1378

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:  1379 TTPDIILDDLLETDRV 1394

 Score = 80 (33.2 bits), Expect = 2.1e-22, Sum P(6) = 2.1e-22
 Identities = 21/88 (23%), Positives = 44/88 (50%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +
Sbjct:   158 LNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTS 217

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L       + VP P+ GV++    +++Y
Sbjct:   218 LPFDCTQALAVPKPIGGVVVFAVNSLLY 245

 Score = 71 (30.1 bits), Expect = 2.1e-22, Sum P(6) = 2.1e-22
 Identities = 20/75 (26%), Positives = 37/75 (49%)

Query:   340 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 390
             EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct:   420 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 479

Query:   391 QASVELQGIKGMWSL 405
               + EL G   MW++
Sbjct:   480 VTTFELPGCYDMWTV 494

 Score = 61 (26.5 bits), Expect = 2.1e-22, Sum P(6) = 2.1e-22
 Identities = 22/122 (18%), Positives = 54/122 (44%)

Query:    36 KCTRIEIHLLTPQGLQPMLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE 95
             + T  + H    + L+ +     +G + ++   +  G  +D L ++ +  K  V+++D  
Sbjct:     9 RSTEGKAHREHREKLELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPG 68

Query:    96 SSELITRAMG-----DVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL 150
             + +L T ++      ++ D   +     ++ + DPD R   + +Y     V+PF  +   
Sbjct:    69 THDLKTLSLHYFEEPELRDGFVQNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLA 127

Query:   151 KE 152
             +E
Sbjct:   128 EE 129

 Score = 56 (24.8 bits), Expect = 2.1e-22, Sum P(6) = 2.1e-22
 Identities = 17/79 (21%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   927 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 986

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  C     M
Sbjct:   987 KVYAVATSTNMPCTRIPRM 1005

 Score = 41 (19.5 bits), Expect = 2.1e-22, Sum P(6) = 2.1e-22
 Identities = 9/47 (19%), Positives = 23/47 (48%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 327
             +IT     V     +    + + +++  ++   +++GS  G+S L+K
Sbjct:   306 LITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 352


>UNIPROTKB|F1RSN8 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
            EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
        Length = 1108

 Score = 229 (85.7 bits), Expect = 5.8e-22, Sum P(6) = 5.8e-22
 Identities = 100/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:   735 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 794

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:   795 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 853

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:   854 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 908

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:   909 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 968

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:   969 VGAHVNTFWRTPCRGATDGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1028

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++ +    +N LDG+L+  +L LS     E++K + 
Sbjct:  1029 NALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIG 1088

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:  1089 TTPDIILDDLLETDRV 1104

 Score = 81 (33.6 bits), Expect = 5.8e-22, Sum P(6) = 5.8e-22
 Identities = 22/88 (25%), Positives = 44/88 (50%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +
Sbjct:   203 LNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTS 262

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L       + VP P+ GV+I    +++Y
Sbjct:   263 LPFDCTQALAVPKPIGGVVIFAVNSLLY 290

 Score = 60 (26.2 bits), Expect = 5.8e-22, Sum P(6) = 5.8e-22
 Identities = 13/43 (30%), Positives = 22/43 (51%)

Query:    10 AHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQP 52
             AH PT +  S   NF +  E NL++A  +++ ++ L      P
Sbjct:     8 AHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNRDAEAP 50

 Score = 55 (24.4 bits), Expect = 5.8e-22, Sum P(6) = 5.8e-22
 Identities = 17/79 (21%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   637 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 696

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  C     M
Sbjct:   697 KVYAVATSTNTPCTRIPRM 715

 Score = 53 (23.7 bits), Expect = 5.8e-22, Sum P(6) = 5.8e-22
 Identities = 18/86 (20%), Positives = 40/86 (46%)

Query:    72 GEAQDFLFIATERYKFCVLQWDAESSELITRAMG-----DVSDRIGRPTDNGQIGIIDPD 126
             G  +D L ++ +  K  V+++D  + +L T ++      ++ D   +     ++ + DPD
Sbjct:    90 GAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRV-DPD 148

Query:   127 CRLIGLHLYDGLFKVIPFDNKGQLKE 152
              R   + +Y     V+PF  +   +E
Sbjct:   149 GRCAAMLIYGTRLVVLPFRRESLAEE 174

 Score = 48 (22.0 bits), Expect = 5.8e-22, Sum P(6) = 5.8e-22
 Identities = 22/112 (19%), Positives = 43/112 (38%)

Query:   467 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 526
             G V + +  S    N+   P G ++N  T   +   L T  G  + L+         +  
Sbjct:   278 GGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISSQDV 337

Query:   527 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 578
                 + +     P    P  S++ A+ ++ D+S  +F+        ++ LGG
Sbjct:   338 ARPPDPAAAPTEPRPPPPQQSKVIALCVYRDVS-GMFTTESRLGGARDELGG 388


>CGD|CAL0004426 [details] [associations]
            symbol:orf19.5391 species:5476 "Candida albicans" [GO:0071004
            "U2-type prespliceosome" evidence=IEA] [GO:0005686 "U2 snRNP"
            evidence=IEA] [GO:0030620 "U2 snRNA binding" evidence=IEA]
            [GO:0000245 "spliceosomal complex assembly" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 CGD:CAL0004426
            GO:GO:0008380 Gene3D:2.130.10.10 GO:GO:0006397 GO:GO:0005681
            GO:GO:0003676 GO:GO:0007049 eggNOG:NOG247734 EMBL:AACQ01000051
            EMBL:AACQ01000050 RefSeq:XP_717672.1 RefSeq:XP_717766.1
            STRING:Q5A7S5 GeneID:3640538 GeneID:3640666 KEGG:cal:CaO19.12846
            KEGG:cal:CaO19.5391 KO:K12830 Uniprot:Q5A7S5
        Length = 1219

 Score = 179 (68.1 bits), Expect = 1.8e-21, Sum P(3) = 1.8e-21
 Identities = 94/370 (25%), Positives = 167/370 (45%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAY---VLPEENE 797
             ++++D ++ + I +  LD  E   S+ + SF+  S       +  VG      +LP  N 
Sbjct:   866 IQVVDSKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NS 923

Query:   798 PTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 857
               K  +  F +    LQL+ + E       L  F  KLL A    I+LY     D G ++
Sbjct:   924 YDKSYLYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQ 978

Query:   858 LQSEC----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNAN 911
             L  +         +I  +  QT    I++ D  KS S++  K +E   +    A D    
Sbjct:   979 LLKKSTTIIDFSTNINKIIPQTNR--IIICDSHKS-SIVFAKFDESQNQFVPFADDVMKR 1035

Query:   912 WMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDE------ERG-------RLEVVG 955
              ++++  LD D  +G +   N+F  R +   S+ A D+      + G       +L+ + 
Sbjct:  1036 QITSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLI 1095

Query:   956 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQT 1014
             E+H+G+ +  F  G L +   +S       VI+  + G IG++  L  +  +  L  LQ 
Sbjct:  1096 EFHIGDIITSFNLGCLNLAGTES-------VIYTGLQGTIGLLIPLVSKSEVELLFNLQL 1148

Query:  1015 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV 1074
              +++    + G +H + RS+ N       KN +DGDL+E FL+   +   EIS+ +N SV
Sbjct:  1149 YMQQSQNNLVGKDHLKLRSYYNP-----IKNVIDGDLLERFLEFDISLKIEISRKLNKSV 1203

Query:  1075 EELCKRVEEL 1084
              ++ K++ +L
Sbjct:  1204 NDIEKKLIDL 1213

 Score = 141 (54.7 bits), Expect = 1.8e-21, Sum P(3) = 1.8e-21
 Identities = 69/323 (21%), Positives = 136/323 (42%)

Query:   339 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 398
             ++VLE    L PI D  ++D       ++VT S       ++ + +G+         L  
Sbjct:   433 IDVLE---TLSPITDSKIID------SKLVTLSS---HSYVKSITHGVPTTTLVESPLPI 480

Query:   399 IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQTLFCHDAI 457
                 +++ + S +   D +LV+S    ++ L +++           F     T+      
Sbjct:   481 TPTDIFTTKLSLESANDEYLVISSSLSSKTLVLSIGEVVEDVEDSEFVLDQPTIAVQQVG 540

Query:   458 YNQLVQVTSGSVRLVSSTSRELRN-EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 516
                +VQ+ S  ++ V + +   +  +W  P G ++  AT N  QVL+A     +VY EI 
Sbjct:   541 IASVVQIYSNGIKHVRTVNGNKKTTDWFPPAGITITHATTNNQQVLIALSNLSVVYFEI- 599

Query:   517 DGILTEVKHAQLEYEISC-LDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLITKE 574
             D    ++   Q   EI+  +    I EN S  S  A +G  +D ++++ SL + N +  +
Sbjct:   600 DATDDQLIEYQDRLEIATTITAMAIQENISEKSPFAIIGC-SDETIQVVSLQEHNCLEIK 658

Query:   575 HLGGEIIPRSVL-LCAFEGI-SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 632
              L       S L +    G  +++   + +G      ++   G L++ +   +G++P++L
Sbjct:   659 SLQALSANSSSLKMLKSSGKETHVHIGMENGVYARIKIDTINGNLSNSRVKYIGSKPVSL 718

Query:   633 RTFSSKNTTH-VFAASDRPTVIY 654
                   N    + A S  P + Y
Sbjct:   719 SVIKFSNEIEGILAISSAPWISY 741

 Score = 78 (32.5 bits), Expect = 1.8e-21, Sum P(3) = 1.8e-21
 Identities = 52/253 (20%), Positives = 103/253 (40%)

Query:     2 SIWNYVVTAHKPTNVTHSCVGNF-----TSPQELNLIIAKCTRIEIHLLTPQG--LQPML 54
             S++ Y +T   P+    S VG F     ++     L++   T +++  +  +   L+   
Sbjct:    39 SVYLYNLTLKPPSYYISSIVGQFYKQDNSTKNAQQLVLVSSTTLQLFEINEEAGKLELQS 98

Query:    55 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 114
                + G I ++E      E  D + I ++     +LQ+D ++ + I++    ++      
Sbjct:    99 SQNLLGIINSIEKICL-SEV-DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGR 156

Query:   115 TDNGQIGIIDPD--CRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 170
                G+   IDP+  C L+     + LF  I  ++ G  + +  +     QVL +K +   
Sbjct:   157 NYVGENLAIDPENRCILVAAMEKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALN 216

Query:   171 -GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQNN--LDNGADLLIPVPPPL 226
                  P    L    +    +  YE+       V+  P S N+  L N  + LIP+P  +
Sbjct:   217 TDHNNPLFGALELTPEKKCIINYYELDQGLNHVVKKKPNSSNSDPLPNDVNYLIPLPGHI 276

Query:   227 CGVLIIGEETIVY 239
              G+++ G     Y
Sbjct:   277 GGMVVCGTNWCFY 289


>UNIPROTKB|F8WF81 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0005634 GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00942006
            ProteinModelPortal:F8WF81 SMR:F8WF81 Ensembl:ENST00000451943
            ArrayExpress:F8WF81 Bgee:F8WF81 Uniprot:F8WF81
        Length = 127

 Score = 256 (95.2 bits), Expect = 1.2e-20, P = 1.2e-20
 Identities = 54/114 (47%), Positives = 72/114 (63%)

Query:   985 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 1044
             +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  WRSF+ E+KT  A 
Sbjct:    14 SVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPAT 73

Query:  1045 NFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1088
              F+DGDLIESFLD+SR +M E+   +            + ++L K VEELTR+H
Sbjct:    74 GFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 127


>RGD|1306406 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
            160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
            "mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
            RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
            GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
            RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
            GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
            Uniprot:D4A0H5
        Length = 1386

 Score = 161 (61.7 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 57/220 (25%), Positives = 104/220 (47%)

Query:   878 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 934
             +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct:  1163 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1222

Query:   935 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FG 989
                   E        RL    ++H+G  VN F R    G+       S + +   +  F 
Sbjct:  1223 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGAAEGPSKKSVMWENKHITWFA 1282

Query:   990 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFL 1047
             T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +    +N L
Sbjct:  1283 TLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVL 1342

Query:  1048 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1087
             DG+L+  +L LS     E++K +  + + +   + E  R+
Sbjct:  1343 DGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDRV 1382

 Score = 82 (33.9 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 22/88 (25%), Positives = 44/88 (50%)

Query:   161 LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNN 211
             L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +
Sbjct:   201 LNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTS 260

Query:   212 LDNGADLLIPVPPPLCGVLIIGEETIVY 239
             L       + VP P+ GV+I    +++Y
Sbjct:   261 LPFDCTQALAVPKPIGGVVIFAVNSLLY 288

 Score = 74 (31.1 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 20/73 (27%), Positives = 36/73 (49%)

Query:   340 EVLERYVNLGPIVDFCVVD---LERQGQGQ----VVTCSGAYKDGSLRIVRNGIGINEQA 392
             EV +  +N+GP  +  V +   L  +   +    +V CSG  K+G+L +++  I      
Sbjct:   461 EVCDSMLNIGPCANAAVGEPAFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVT 520

Query:   393 SVELQGIKGMWSL 405
             + EL G   MW++
Sbjct:   521 TFELPGCYDMWTV 533

 Score = 60 (26.2 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 20/108 (18%), Positives = 49/108 (45%)

Query:    50 LQPMLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG---- 105
             L+ +     +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++     
Sbjct:    66 LELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEE 125

Query:   106 -DVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE 152
              ++ D   +     ++ + DPD R   + +Y     V+PF  +   +E
Sbjct:   126 PELRDGFVQNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEE 172

 Score = 56 (24.8 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 11/36 (30%), Positives = 21/36 (58%)

Query:    10 AHKPTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLL 45
             AH PT +  +   NF +  E NL++A  +++ ++ L
Sbjct:     8 AHPPTGLEFAMYCNFFNNSERNLVVAGTSQLYVYRL 43

 Score = 55 (24.4 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 17/79 (21%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   966 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1025

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  C     M
Sbjct:  1026 KVYAVATSTNTPCTRIPRM 1044

 Score = 51 (23.0 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 24/117 (20%), Positives = 55/117 (47%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:  1064 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1123

Query:   793 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAA-INQKIQLYKW 848
              EE    +GRI ++ +   +L  +A  +T+  ++ + +    +LAA + + I L ++
Sbjct:  1124 GEE-VTCRGRIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRY 1179

 Score = 46 (21.3 bits), Expect = 0.00021, Sum P(6) = 0.00021
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query:   602 DGHLLNFLLNMKTGELTDRKKVSLGTQP 629
             DG LLN  L + T E ++  K  +GT P
Sbjct:  1343 DGELLNRYLYLSTMERSELAK-KIGTTP 1369

 Score = 42 (19.8 bits), Expect = 1.5e-18, Sum P(8) = 1.5e-18
 Identities = 13/66 (19%), Positives = 29/66 (43%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN--LQPDAKGSY 338
             +IT     V     +    + + +++  ++   +++GS  G+S L+K    LQ     S 
Sbjct:   349 LITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSV 408

Query:   339 VEVLER 344
              E  ++
Sbjct:   409 REAADK 414


>UNIPROTKB|J9P418 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
            GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
            Ensembl:ENSCAFT00000043656 Uniprot:J9P418
        Length = 1107

 Score = 229 (85.7 bits), Expect = 3.5e-17, Sum P(4) = 3.5e-17
 Identities = 100/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:   734 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 793

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:   794 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 852

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:   853 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 907

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:   908 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 967

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:   968 VGAHVNTFWRTPCRGAAEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 1027

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++ +    +N LDG+L+  +L LS     E++K + 
Sbjct:  1028 NALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIG 1087

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:  1088 TTPDIILDDLLETDRV 1103

 Score = 71 (30.1 bits), Expect = 3.5e-17, Sum P(4) = 3.5e-17
 Identities = 20/75 (26%), Positives = 37/75 (49%)

Query:   340 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 390
             EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct:   129 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 188

Query:   391 QASVELQGIKGMWSL 405
               + EL G   MW++
Sbjct:   189 VTTFELPGCYDMWTV 203

 Score = 56 (24.8 bits), Expect = 3.5e-17, Sum P(4) = 3.5e-17
 Identities = 17/79 (21%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   636 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 695

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  C     M
Sbjct:   696 KVYAVATSTNMPCTRIPRM 714

 Score = 41 (19.5 bits), Expect = 3.5e-17, Sum P(4) = 3.5e-17
 Identities = 9/47 (19%), Positives = 23/47 (48%)

Query:   281 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 327
             +IT     V     +    + + +++  ++   +++GS  G+S L+K
Sbjct:    15 LITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 61


>UNIPROTKB|F1NZF7 [details] [associations]
            symbol:SF3B3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0005634 GO:GO:0003676 GeneTree:ENSGT00530000063396
            EMBL:AADN02051593 EMBL:AADN02051594 IPI:IPI00819465
            Ensembl:ENSGALT00000040057 ArrayExpress:F1NZF7 Uniprot:F1NZF7
        Length = 504

 Score = 234 (87.4 bits), Expect = 4.9e-16, P = 4.9e-16
 Identities = 86/364 (23%), Positives = 167/364 (45%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+    +Y  VG A  ++        G +
Sbjct:   152 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFV 211

Query:   804 LVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 860
               +  +V  G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  
Sbjct:   212 YTYKLLVNGGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLR 266

Query:   861 ECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 917
             +C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   
Sbjct:   267 KC-ENKHI-ANYICGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTAT 324

Query:   918 ILDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRF 966
             +LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  +
Sbjct:   325 LLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNY 382

Query:   967 RHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGV 1023
               G  V+ L  + +  G   ++++ T++G IG++     HE + F + ++ +LR     +
Sbjct:   383 HVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPL 442

Query:  1024 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1083
              G +H  +RS+         KN +DGDL E F  +   +   +++ ++ +  E+ K++E+
Sbjct:   443 CGRDHLSFRSYY-----FPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLED 497

Query:  1084 L-TR 1086
             + TR
Sbjct:   498 IRTR 501


>UNIPROTKB|K7GNU1 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GeneTree:ENSGT00550000075040 EMBL:CU468594
            Ensembl:ENSSSCT00000033207 Uniprot:K7GNU1
        Length = 757

 Score = 229 (85.7 bits), Expect = 1.6e-15, Sum P(2) = 1.6e-15
 Identities = 100/376 (26%), Positives = 171/376 (45%)

Query:   740 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 792
             + E   ++L+   ++E I  +   L+ +E+   + + S   +  V     Y   GT  + 
Sbjct:   384 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 443

Query:   793 PEENEPTKGRILVFIV-----EDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 841
              EE    +GRIL+  V     E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct:   444 GEE-VTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 502

Query:   842 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 901
             KI L  W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct:   503 KIFL--WSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 557

Query:   902 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 958
                +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct:   558 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 617

Query:   959 LGEFVNRF-R---HGSLVMRLPDSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQ 1013
             +G  VN F R    G+       S V +   +  F T++G IG++  +  + Y  L  LQ
Sbjct:   618 VGAHVNTFWRTPCRGATDGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQ 677

Query:  1014 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
               L  ++    GLN   +R  + +++ +    +N LDG+L+  +L LS     E++K + 
Sbjct:   678 NALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIG 737

Query:  1072 VSVEELCKRVEELTRL 1087
              + + +   + E  R+
Sbjct:   738 TTPDIILDDLLETDRV 753

 Score = 55 (24.4 bits), Expect = 1.6e-15, Sum P(2) = 1.6e-15
 Identities = 17/79 (21%), Positives = 31/79 (39%)

Query:   670 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQS 724
             +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct:   286 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 345

Query:   725 RTFAICSLKNQSCAEESEM 743
             + +A+ +  N  C     M
Sbjct:   346 KVYAVATSTNTPCTRIPRM 364


>UNIPROTKB|E1C725 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0000075 "cell cycle
            checkpoint" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
            [GO:0031464 "Cul4A-RING ubiquitin ligase complex" evidence=IEA]
            [GO:0031465 "Cul4B-RING ubiquitin ligase complex" evidence=IEA]
            [GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
            protein catabolic process" evidence=IEA] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0005634 GO:GO:0005737
            GO:GO:0043161 GO:GO:0016055 GO:GO:0003676 GO:GO:0042787
            GO:GO:0000075 GO:GO:0031464 GO:GO:0031465
            GeneTree:ENSGT00530000063396 EMBL:AADN02017118 EMBL:AADN02017119
            IPI:IPI00820296 Ensembl:ENSGALT00000040602 ArrayExpress:E1C725
            Uniprot:E1C725
        Length = 90

 Score = 188 (71.2 bits), Expect = 2.1e-13, P = 2.1e-13
 Identities = 42/90 (46%), Positives = 56/90 (62%)

Query:  1012 LQTNLRKVIKGVGGLNHE---QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1068
             +Q  L KVIK VG + H     WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+  
Sbjct:     1 MQNRLNKVIKSVGKIEHSLYATWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVA 60

Query:  1069 TMNV----------SVEELCKRVEELTRLH 1088
              + +          +V++L K VEELTR+H
Sbjct:    61 NLQIDDGSGMKREATVDDLIKIVEELTRIH 90


>POMBASE|SPBC1709.08 [details] [associations]
            symbol:cft1 "cleavage factor one Cft1 (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
            "cytosol" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA]
            [GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
            cleavage" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
            PomBase:SPBC1709.08 GO:GO:0005829 EMBL:CU329671 GO:GO:0006378
            GenomeReviews:CU329671_GR GO:GO:0003723 eggNOG:COG5161 KO:K14401
            OMA:HNDRIFQ OrthoDB:EOG451HZS PIR:T39636 RefSeq:NP_595441.1
            STRING:O74733 EnsemblFungi:SPBC1709.08.1 GeneID:2539694
            KEGG:spo:SPBC1709.08 NextBio:20800847 GO:GO:0005847 GO:GO:0006379
            Uniprot:O74733
        Length = 1441

 Score = 140 (54.3 bits), Expect = 2.0e-12, Sum P(6) = 2.0e-12
 Identities = 75/362 (20%), Positives = 154/362 (42%)

Query:   748 LLDDQTFEFISTYPLDTFEYGCSI--LSCSFSDDSNV---YYCVGTAYVLPEENEPTKGR 802
             L+   T+  I +Y    FE   S+  ++   S+ + +   Y  VGT+ +   E+   +G 
Sbjct:  1079 LVSPLTWTVIDSYEFQQFEIPLSVALVNLEVSETTKLRKPYIAVGTS-ITKGEDIAVRGS 1137

Query:   803 ILVFIVED-----G------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 851
               +F + D     G      KL+L+  +E KG V  +   +G LL+   QK+ + + +  
Sbjct:  1138 TYLFEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQKV-IVRALED 1196

Query:   852 DDGTRELQS-ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 910
             +D    +   + G +     L  +   + ++ GD+ ++++ + +  E   +   ++   A
Sbjct:  1197 EDHLVGVSFIDLGSY----TLSAKCLRNLLLFGDVRQNVTFVGFAEEPYRMTLFSKGQEA 1252

Query:   911 NWMSAVEIL--DDDIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 967
               +SA + L   +++Y + A+ + NL  +  + E        RL   G++H+G  +    
Sbjct:  1253 LNVSAADFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIGNVITAMT 1312

Query:   968 HGSLVMRLPDSDVGQIPTVIFGTV----NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 1023
                   +  +++ G      F  V    +G + ++  +    Y  L  +Q  L   +  +
Sbjct:  1313 ILPKEKKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTI 1372

Query:  1024 GGLNHEQWRSFNNEKK-TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1082
             GGLN + +R   +    T   +  LDG LI+ F  +S     E++    V V  +   + 
Sbjct:  1373 GGLNPKSYRLITSPSNLTNPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLV 1432

Query:  1083 EL 1084
             EL
Sbjct:  1433 EL 1434

 Score = 81 (33.6 bits), Expect = 2.0e-12, Sum P(6) = 2.0e-12
 Identities = 25/100 (25%), Positives = 44/100 (44%)

Query:    50 LQPMLDVPIYGRIATLELFRPHGE-AQDFLFIATERYKFCVLQWDAESSELITRAMG--- 105
             L+ +  V ++G I  +   +  G    D L + T+  K   L+WD +S   +T ++    
Sbjct:    92 LRLVSQVKVFGTITEISALKGKGSNGCDLLIMLTDYAKVSTLEWDMQSQSFVTNSLHYYE 151

Query:   106 DV-SDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF 144
             DV S  I       Q+ ++DPD     L     +  +IP+
Sbjct:   152 DVKSSNICSSHTPTQL-LVDPDSDCCLLRFLTDMMAIIPY 190

 Score = 76 (31.8 bits), Expect = 2.0e-12, Sum P(6) = 2.0e-12
 Identities = 27/92 (29%), Positives = 45/92 (48%)

Query:   162 QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI- 220
             ++LD+KFLYG  +PT+ +LY   + +    T  + L+ KD V       +L+  A  +I 
Sbjct:   231 RILDVKFLYGYREPTLAILYSPEQTS----TVTLPLR-KDTVLFSLVTLDLEQRASAVIT 285

Query:   221 -----P--------VPPPLCGVLIIGEETIVY 239
                  P        +P PL G L++G   ++Y
Sbjct:   286 TIQSLPYDIYASVSIPTPLGGSLLLGGNELIY 317

 Score = 59 (25.8 bits), Expect = 2.0e-12, Sum P(6) = 2.0e-12
 Identities = 19/78 (24%), Positives = 33/78 (42%)

Query:   339 VEVLERYVNLGPIVDFCV--------VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 390
             +E+ +   N+GPI DF V           +  G  ++V  +GA   G L + R  I    
Sbjct:   490 LEICDVLTNIGPITDFAVGKAGSYSYFPQDNHGPLELVGTAGADGAGGLVVFRRNIFPLI 549

Query:   391 QASVELQGIKGMWSLRSS 408
                 +  G + +W++  S
Sbjct:   550 AGEFQFDGCEALWTVSIS 567

 Score = 44 (20.5 bits), Expect = 2.0e-12, Sum P(6) = 2.0e-12
 Identities = 28/126 (22%), Positives = 56/126 (44%)

Query:   393 SVELQG-IKGMWS-LRSSTDDP-FDTFLVVSFISETRILAMNLXXXXXXXXXXGFCSQTQ 449
             +V + G ++ M S +++   +P  +T+LV+S   E+ I                F   ++
Sbjct:   563 TVSISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAG--ETFDEVQHSDFSKDSK 620

Query:   450 TLFCHDAIYN-QLVQVTSGSVRLVSSTSR--ELRNEWKSPPGYSVNVATANASQVLLATG 506
             TL     +   ++VQ+   S+R+  S  R  +L N  K      V+ +  +   +++  G
Sbjct:   621 TLNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNFSKKQ--IVVSTSICDPCIIVVFLG 678

Query:   507 GGHLVY 512
             GG  +Y
Sbjct:   679 GGIALY 684

 Score = 43 (20.2 bits), Expect = 2.0e-12, Sum P(6) = 2.0e-12
 Identities = 11/38 (28%), Positives = 20/38 (52%)

Query:    14 TNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQ 51
             T + ++  G FTS    NL+++K     +HL   + +Q
Sbjct:    12 TVIKNAVQGQFTSLVSNNLVVSKVN--SLHLFEIEKIQ 47


>SGD|S000004513 [details] [associations]
            symbol:RSE1 "Protein involved in pre-mRNA splicing"
            species:4932 "Saccharomyces cerevisiae" [GO:0005686 "U2 snRNP"
            evidence=IDA;IPI] [GO:0000245 "spliceosomal complex assembly"
            evidence=IDA] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IGI;IMP;IPI] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0005681 "spliceosomal
            complex" evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0030620 "U2 snRNA binding" evidence=IPI]
            [GO:0071004 "U2-type prespliceosome" evidence=IDA]
            InterPro:IPR004871 Pfam:PF03178 SGD:S000004513 EMBL:BK006946
            GO:GO:0007049 EMBL:Z47816 GO:GO:0000245 GO:GO:0005686 GO:GO:0071004
            GeneTree:ENSGT00530000063396 GO:GO:0030620 KO:K12830
            OrthoDB:EOG4FR40R PIR:S50943 RefSeq:NP_013663.1
            ProteinModelPortal:Q04693 DIP:DIP-856N IntAct:Q04693
            MINT:MINT-368995 STRING:Q04693 PaxDb:Q04693 PeptideAtlas:Q04693
            EnsemblFungi:YML049C GeneID:854956 KEGG:sce:YML049C CYGD:YML049c
            eggNOG:KOG1898 HOGENOM:HOG000066036 OMA:DIHESVT NextBio:978033
            Genevestigator:Q04693 GermOnline:YML049C Uniprot:Q04693
        Length = 1361

 Score = 120 (47.3 bits), Expect = 6.3e-11, Sum P(4) = 6.3e-11
 Identities = 62/323 (19%), Positives = 135/323 (41%)

Query:   328 LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 387
             L  +P  K   + +L + +NL P +   +V         +   +  + +  +  + N + 
Sbjct:   464 LVFEPSIKLQNLSILSQQLNLNPSIKSQIVS-----DSPLSIATKHFTNNKIITLTNAVN 518

Query:   388 INEQASVELQ-GIKGMWSLRS-STDDPFDTFLVVSFISETRILAMN---LXXXXXXXXXX 442
              +   S  L      +W +   +T    +T L ++F  +T IL ++   +          
Sbjct:   519 YSNLISTSLPPNATKLWLIPDPATTGDNNTLLFITFPKKTMILQIDNESMEELTPDEATR 578

Query:   443 GFCSQTQTLFCHDAIY--NQLVQVTSGSVRLVSSTSRE-LRNE--WKSPPGYSVNVATAN 497
                  +Q    H  +   + ++QV +  +R +  T +    N+  W  P G  +  AT++
Sbjct:   579 SAFKLSQDTTIHTCLMGSHSIIQVCTAELRHIVPTGKSRYSNKLTWVPPAGIRIVCATSS 638

Query:   498 ASQVLLATGGGHLVYLEI---GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGM 554
              +Q++++     LVY +I    D ++    H +L+   S +    I ++  ++ + A+  
Sbjct:   639 KTQLIISLSNYELVYFKIDVSSDSLIELTTHPELDTMPSKV---AIVQDTQHADLLAIAD 695

Query:   555 WTDISVRIFSLPD-----LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFL 609
               +  ++I SL D     L +I+ + +  +I    ++  +  G   L   L +G  + F 
Sbjct:   696 -NEGMIKIMSLKDQKEDFLTVISLQLVSEKISDMIMVRDSSIGQLNLHVGLENGVYMKFH 754

Query:   610 LNMKTGELTDRKKVSLGTQPITL 632
             +    G  TD K+  LG +P++L
Sbjct:   755 IGDVDGSFTDIKRRFLGLKPVSL 777

 Score = 81 (33.6 bits), Expect = 6.3e-11, Sum P(4) = 6.3e-11
 Identities = 19/61 (31%), Positives = 37/61 (60%)

Query:  1025 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM-NVSVEELCKRVEE 1083
             G +H+++RS+    + V     +DGDL E+FL LS    + ++K + +V VE++ + + E
Sbjct:  1301 GRDHQEYRSYYAPVRKV-----IDGDLCENFLRLSLNEQEFLAKNLKSVQVEDIIQTINE 1355

Query:  1084 L 1084
             +
Sbjct:  1356 V 1356

 Score = 80 (33.2 bits), Expect = 6.3e-11, Sum P(4) = 6.3e-11
 Identities = 31/133 (23%), Positives = 62/133 (46%)

Query:   813 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHI 868
             ++L+ + E    ++++  F   LL A+   I LY    K +LR   T+   S       I
Sbjct:  1019 IELLHQTEIISPIHAMLKFKNFLLTAMGSTIVLYGLGKKQLLRRSVTQTPVSIT----KI 1074

Query:   869 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 928
             ++++ Q   + + VGD+ +S++L I+            D     ++ ++ LD+   +GA+
Sbjct:  1075 VSMH-QWNYERLAVGDIHESVTLFIWDPAGNVFIPYVDDSVKRHVTVLKFLDEATVIGAD 1133

Query:   929 NNFNLFTVRKNSE 941
                N +T+R   E
Sbjct:  1134 RYGNAWTLRSPPE 1146

 Score = 63 (27.2 bits), Expect = 6.3e-11, Sum P(4) = 6.3e-11
 Identities = 16/47 (34%), Positives = 27/47 (57%)

Query:     3 IWNYVVTAHKPTNVTHSCVGNFT-----SPQELN-LIIAKCTRIEIH 43
             ++ Y +T  K TN  HSC+G+F      S +E + L +A  T +E++
Sbjct:    58 LYLYHLTLKKQTNFVHSCIGHFVDLEAGSKREQSQLCVATETHLELY 104

 Score = 37 (18.1 bits), Expect = 1.0e-06, Sum P(4) = 1.0e-06
 Identities = 10/37 (27%), Positives = 16/37 (43%)

Query:   902 EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 938
             EE      A WMS V       ++      N++T+R+
Sbjct:   805 EEEINSSGAKWMSCVVCHSSSTWVSYTWK-NVWTIRQ 840


>ASPGD|ASPL0000050546 [details] [associations]
            symbol:AN1413 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 EMBL:BN001307 GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AACD01000022
            RefSeq:XP_659017.1 EnsemblFungi:CADANIAT00008024 GeneID:2875502
            KEGG:ani:AN1413.2 HOGENOM:HOG000048586 OMA:HNDRIFQ
            OrthoDB:EOG451HZS Uniprot:Q5BDG7
        Length = 1339

 Score = 168 (64.2 bits), Expect = 2.4e-09, Sum P(6) = 2.4e-09
 Identities = 69/291 (23%), Positives = 135/291 (46%)

Query:   812 KLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 869
             +L+LI ++  KGAV +L+   G+  L+AA  QK  +    L++DG+  L          +
Sbjct:  1047 RLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRG--LKEDGSL-LPVAFMDMQCFV 1103

Query:   870 ALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD--IYLG 926
             ++  + +G    + GD +K +    Y  E   +   A+D +   + A + L D   +++ 
Sbjct:  1104 SVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFIV 1163

Query:   927 -AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR--HGSLVMR---LPDSDV 980
              A+++ NL+ ++ + E        +L    ++H G F +       +LV     +  SD 
Sbjct:  1164 VADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTVTLLPRTLVSSERAMSGSDK 1223

Query:   981 GQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 1034
               I        V+  + NG IG++  +P E Y  L  LQ+ L   ++   GLN   +R+ 
Sbjct:  1224 MDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQSQLTNTLEHPCGLNPRAYRAV 1283

Query:  1035 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1085
              ++      +  LD +L+  +LD+S+ R  EI+  +  +  E+   +E ++
Sbjct:  1284 ESDASA--GRGMLDSNLLLQYLDMSKQRKAEIAGRVGATEWEIRADLEAIS 1332

 Score = 59 (25.8 bits), Expect = 2.4e-09, Sum P(6) = 2.4e-09
 Identities = 16/46 (34%), Positives = 23/46 (50%)

Query:    13 PTNVTHSCVGNFTSPQELNLIIAKCTRIEIHLLTPQGLQPMLDVPI 58
             PT VTH+    F S    NLI+A+ + ++I  L    L   LD  +
Sbjct:    10 PTGVTHALAVPFLSATANNLIVARTSLLQIFSLRDVSLSA-LDTEV 54

 Score = 56 (24.8 bits), Expect = 2.4e-09, Sum P(6) = 2.4e-09
 Identities = 21/85 (24%), Positives = 36/85 (42%)

Query:   164 LDIKFLYGCAKPTIVVLYQDNK-------DARHVKTYEVALKDKDFVEGPW--SQNNLDN 214
             + + FLY   +PT  +LY           + + V  Y V   D +        S   L +
Sbjct:   231 ISLAFLYEYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPS 290

Query:   215 GADLLIPVPPPLCGVLIIGEETIVY 239
                 ++ +PPP+ G L+IG   +V+
Sbjct:   291 DLFKVVALPPPVGGSLLIGSNELVH 315

 Score = 45 (20.9 bits), Expect = 2.4e-09, Sum P(6) = 2.4e-09
 Identities = 9/41 (21%), Positives = 21/41 (51%)

Query:   706 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH 744
             +R++P+G+   ++ +   S T+ +  C        E+ E+H
Sbjct:   907 MRTVPIGQQIDKLTYVSASDTYVLGTCQRCEFRLPEDDELH 947

 Score = 44 (20.5 bits), Expect = 3.0e-09, Sum P(6) = 3.0e-09
 Identities = 11/30 (36%), Positives = 18/30 (60%)

Query:   791 VLPEENEPTKGRILVFIVEDGKLQLIAEKE 820
             +L ++ +P K    VF++ED KL+ I   E
Sbjct:   574 ILSKQEKPDKEESEVFVLED-KLRPITAPE 602

 Score = 40 (19.1 bits), Expect = 2.4e-09, Sum P(6) = 2.4e-09
 Identities = 20/80 (25%), Positives = 30/80 (37%)

Query:    76 DFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRP--TDNGQIGII---DPDCR 128
             D + +A    K  +++WD E   L T ++   +  D    P  +D    G I   DP  R
Sbjct:    94 DAVLVAFRDAKLSLVEWDPERYGLSTISIHYYERDDMTRSPWASDLSTCGSILSADPGSR 153

Query:   129 LIGLHLYDGLFKVIPFDNKG 148
                         +IPF   G
Sbjct:   154 CAIFQFGARSLAIIPFHQPG 173

 Score = 40 (19.1 bits), Expect = 5.9e-09, Sum P(5) = 5.9e-09
 Identities = 12/33 (36%), Positives = 20/33 (60%)

Query:   720 HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ 752
             + +Q R + I S + +   EESE+ FV  L+D+
Sbjct:   565 NNDQKRDYVILSKQEKPDKEESEV-FV--LEDK 594

 Score = 39 (18.8 bits), Expect = 2.4e-09, Sum P(6) = 2.4e-09
 Identities = 12/62 (19%), Positives = 29/62 (46%)

Query:   361 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW--SLRSSTDDPFDTFLV 418
             + G  ++V   G+ + G++ I++  +     AS+       +W  SL    +D    +++
Sbjct:   515 KDGVLELVAAQGSDEGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVI 574

Query:   419 VS 420
             +S
Sbjct:   575 LS 576


>UNIPROTKB|F1S419 [details] [associations]
            symbol:LOC100512659 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
            GeneTree:ENSGT00530000063396 EMBL:CU915803 EMBL:AEMK01191757
            Ensembl:ENSSSCT00000003019 OMA:SHEVIYS Uniprot:F1S419
        Length = 319

 Score = 164 (62.8 bits), Expect = 7.5e-09, P = 7.5e-09
 Identities = 64/283 (22%), Positives = 127/283 (44%)

Query:   746 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRI 803
             +R+++      +    L+  E   S+  C FS+  + +Y  VG A  ++        G +
Sbjct:    44 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFV 103

Query:   804 LVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 861
               + +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct:   104 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 158

Query:   862 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 918
             C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct:   159 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 216

Query:   919 LDDDIYLGAENNFNLFTVR--KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFR 967
             LD D   GA+   N+  VR   N+    DE+         RG L   G     E +  + 
Sbjct:   217 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLN--GASQKAEVIMNYH 274

Query:   968 HGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLPHEQYLF 1008
              G  V+ L  + +  G   ++++ T++G IG++      + ++
Sbjct:   275 VGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEVIY 317


>CGD|CAL0004251 [details] [associations]
            symbol:orf19.2760 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0006369 "termination of RNA polymerase II
            transcription" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634 GO:GO:0042493
            GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023 EMBL:AACQ01000025
            RefSeq:XP_720278.1 RefSeq:XP_720279.1 RefSeq:XP_720280.1
            RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848 GeneID:3638158
            GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
            KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
            eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
        Length = 1420

 Score = 86 (35.3 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 31/122 (25%), Positives = 62/122 (50%)

Query:   812 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 871
             K + I ++ET+GA+ S+   +G+ L +  QK+ +    L+DDGT  +         +   
Sbjct:  1122 KFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRD--LQDDGTVPVAFL---DTPVYVS 1176

Query:   872 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDIY-LGAE 928
               ++ G+ +++GDL+K   L+ +  E   +    +D     +   + +  DD+I+ L A+
Sbjct:  1177 ESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVAD 1236

Query:   929 NN 930
             NN
Sbjct:  1237 NN 1238

 Score = 86 (35.3 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 35/148 (23%), Positives = 67/148 (45%)

Query:    13 PTNVTHSCVG-NFTSPQELNLIIAKCTRIEIH---------LLTPQGLQPMLD-VPIYGR 61
             P+ V ++CVG NF S  + NLI+ K + ++I          +  PQ    ++D   + G 
Sbjct:    10 PSKV-NNCVGCNFISSTKKNLIVGKGSLLQIFETIQLKQSTINKPQYRLKLIDQFKLQGT 68

Query:    62 IATLELFRP-HGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT-DNGQ 119
             I  L+  R       D+L ++T+  KF +++WD   + + T ++      I   T +   
Sbjct:    69 ITDLKSIRTIENPNLDYLMVSTKYAKFSIIKWDHHLNTIATVSLHYYEHCIQNSTFEKLA 128

Query:   120 IG--IIDPDCRLIGLHLYDGLFKVIPFD 145
             +   I++P    +    +  L   +PF+
Sbjct:   129 VSELILEPTYNSVSCLRFKNLLCFLPFE 156

 Score = 76 (31.8 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 26/98 (26%), Positives = 48/98 (48%)

Query:   163 VLDIKFLYGCAKPTIVVLY--QDNKDARHVKT-----YEVALKDKDFVE--GPWSQNNLD 213
             V+D++FL+   +PTI VL   Q+      +K+     ++V   D +       +  +NL 
Sbjct:   234 VVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLP 293

Query:   214 NGADLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIPI 250
                D +IP+P PL G L++G   +++  +    K I +
Sbjct:   294 YEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAV 331

 Score = 51 (23.0 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 15/37 (40%), Positives = 19/37 (51%)

Query:  1035 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
             NNE  T   K  LD DLI SF  LS  R   ++  ++
Sbjct:  1365 NNETNT---KPILDYDLIRSFTKLSDDRKRNLANKVS 1398

 Score = 50 (22.7 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query:   306 ISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 337
             ++ LD  +++I +S G+S LI++  +  +K S
Sbjct:   414 VAILDKNMLFIANSNGNSPLIQVRYRDSSKTS 445

 Score = 46 (21.3 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 17/77 (22%), Positives = 32/77 (41%)

Query:   677 NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-- 734
             N   F D+   A+  EL +    +   L ++ + +GE  + I + E S T  + + K   
Sbjct:   937 NGLIFLDNQQNARICELPLDFNYEFN-LPMKHVDIGESIKSIAYHETSDTVVLSTFKQIP 995

Query:   735 QSCAEESEMHFVRLLDD 751
               C +E       ++ D
Sbjct:   996 YDCLDEEGKPIAGIIKD 1012

 Score = 37 (18.1 bits), Expect = 2.2e-06, Sum P(6) = 2.2e-06
 Identities = 6/24 (25%), Positives = 13/24 (54%)

Query:   288 KVTGLKIELLGETSIASTISYLDN 311
             K+  + I    ++ I + + +LDN
Sbjct:   921 KIAAMSISAFSDSKIKNGLIFLDN 944


>UNIPROTKB|Q5AFT3 [details] [associations]
            symbol:CFT1 "Protein CFT1" species:237561 "Candida albicans
            SC5314" [GO:0042493 "response to drug" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634
            GO:GO:0042493 GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023
            EMBL:AACQ01000025 RefSeq:XP_720278.1 RefSeq:XP_720279.1
            RefSeq:XP_720280.1 RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848
            GeneID:3638158 GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
            KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
            eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
        Length = 1420

 Score = 86 (35.3 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 31/122 (25%), Positives = 62/122 (50%)

Query:   812 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 871
             K + I ++ET+GA+ S+   +G+ L +  QK+ +    L+DDGT  +         +   
Sbjct:  1122 KFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRD--LQDDGTVPVAFL---DTPVYVS 1176

Query:   872 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDIY-LGAE 928
               ++ G+ +++GDL+K   L+ +  E   +    +D     +   + +  DD+I+ L A+
Sbjct:  1177 ESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKDTQHISVECADFIINDDEIFVLVAD 1236

Query:   929 NN 930
             NN
Sbjct:  1237 NN 1238

 Score = 86 (35.3 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 35/148 (23%), Positives = 67/148 (45%)

Query:    13 PTNVTHSCVG-NFTSPQELNLIIAKCTRIEIH---------LLTPQGLQPMLD-VPIYGR 61
             P+ V ++CVG NF S  + NLI+ K + ++I          +  PQ    ++D   + G 
Sbjct:    10 PSKV-NNCVGCNFISSTKKNLIVGKGSLLQIFETIQLKQSTINKPQYRLKLIDQFKLQGT 68

Query:    62 IATLELFRP-HGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT-DNGQ 119
             I  L+  R       D+L ++T+  KF +++WD   + + T ++      I   T +   
Sbjct:    69 ITDLKSIRTIENPNLDYLMVSTKYAKFSIIKWDHHLNTIATVSLHYYEHCIQNSTFEKLA 128

Query:   120 IG--IIDPDCRLIGLHLYDGLFKVIPFD 145
             +   I++P    +    +  L   +PF+
Sbjct:   129 VSELILEPTYNSVSCLRFKNLLCFLPFE 156

 Score = 76 (31.8 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 26/98 (26%), Positives = 48/98 (48%)

Query:   163 VLDIKFLYGCAKPTIVVLY--QDNKDARHVKT-----YEVALKDKDFVE--GPWSQNNLD 213
             V+D++FL+   +PTI VL   Q+      +K+     ++V   D +       +  +NL 
Sbjct:   234 VVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTLDLNLKSTISVFKIDNLP 293

Query:   214 NGADLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIPI 250
                D +IP+P PL G L++G   +++  +    K I +
Sbjct:   294 YEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAV 331

 Score = 51 (23.0 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 15/37 (40%), Positives = 19/37 (51%)

Query:  1035 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1071
             NNE  T   K  LD DLI SF  LS  R   ++  ++
Sbjct:  1365 NNETNT---KPILDYDLIRSFTKLSDDRKRNLANKVS 1398

 Score = 50 (22.7 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 10/32 (31%), Positives = 21/32 (65%)

Query:   306 ISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 337
             ++ LD  +++I +S G+S LI++  +  +K S
Sbjct:   414 VAILDKNMLFIANSNGNSPLIQVRYRDSSKTS 445

 Score = 46 (21.3 bits), Expect = 1.4e-07, Sum P(6) = 1.4e-07
 Identities = 17/77 (22%), Positives = 32/77 (41%)

Query:   677 NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-- 734
             N   F D+   A+  EL +    +   L ++ + +GE  + I + E S T  + + K   
Sbjct:   937 NGLIFLDNQQNARICELPLDFNYEFN-LPMKHVDIGESIKSIAYHETSDTVVLSTFKQIP 995

Query:   735 QSCAEESEMHFVRLLDD 751
               C +E       ++ D
Sbjct:   996 YDCLDEEGKPIAGIIKD 1012

 Score = 37 (18.1 bits), Expect = 2.2e-06, Sum P(6) = 2.2e-06
 Identities = 6/24 (25%), Positives = 13/24 (54%)

Query:   288 KVTGLKIELLGETSIASTISYLDN 311
             K+  + I    ++ I + + +LDN
Sbjct:   921 KIAAMSISAFSDSKIKNGLIFLDN 944


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.138   0.405    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1088      1066   0.00084  123 3  11 22  0.39    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  79
  No. of states in DFA:  627 (67 KB)
  Total size of DFA:  488 KB (2227 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  88.40u 0.09s 88.49t   Elapsed:  00:00:05
  Total cpu time:  88.44u 0.09s 88.53t   Elapsed:  00:00:05
  Start:  Tue May 21 16:14:00 2013   End:  Tue May 21 16:14:05 2013

Back to top