BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>000944
MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGA
IRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYL
AVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI
FAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGG
GDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQ
TEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAI
GADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQ
IFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNAT
LVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKR
TIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSR
FLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFL
NAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIH
RGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTP
RRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFEAAGMGENGNGNMDQMENGDDENKY
DPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTL
LAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGI
GPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYI
FADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLN
GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSH
LEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGE
ILKKLEEIRNKIV

High Scoring Gene Products

Symbol, full name Information P value
SAP130a
AT3G55200
protein from Arabidopsis thaliana 0.
SAP130b
AT3G55220
protein from Arabidopsis thaliana 0.
Sf3b3
splicing factor 3b, subunit 3
protein from Mus musculus 0.
SF3B3
Splicing factor 3B subunit 3
protein from Bos taurus 0.
SF3B3
Splicing factor 3B subunit 3
protein from Homo sapiens 0.
SF3B3
Uncharacterized protein
protein from Canis lupus familiaris 0.
sf3b3
splicing factor 3b, subunit 3
gene_product from Danio rerio 0.
SF3B3
Uncharacterized protein
protein from Gallus gallus 0.
CG13900 protein from Drosophila melanogaster 0.
teg-4 gene from Caenorhabditis elegans 0.
Sf3b3
splicing factor 3b, subunit 3
gene from Rattus norvegicus 1.7e-288
sf3b3
splicing factor 3B subunit 3
gene from Dictyostelium discoideum 3.1e-287
PFL1680w
splicing factor 3b, subunit 3, 130kD, putative
gene from Plasmodium falciparum 2.6e-275
PFL1680w
Splicing factor 3b, subunit 3, 130kD, putative
protein from Plasmodium falciparum 3D7 2.6e-275
orf19.5391 gene_product from Candida albicans 5.7e-142
SF3B3
Uncharacterized protein
protein from Gallus gallus 1.1e-137
LOC100512659
Uncharacterized protein
protein from Sus scrofa 3.1e-77
RSE1
Protein involved in pre-mRNA splicing
gene from Saccharomyces cerevisiae 5.5e-73
DDB1B
damaged DNA binding protein 1B
protein from Arabidopsis thaliana 2.0e-63
DDB1A
AT4G05420
protein from Arabidopsis thaliana 3.4e-61
pic
piccolo
protein from Drosophila melanogaster 3.6e-59
ddb1
damage specific DNA binding protein 1
gene_product from Danio rerio 5.5e-50
DDB1
DNA damage-binding protein 1
protein from Pongo abelii 2.7e-49
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 4.4e-49
DDB1
Uncharacterized protein
protein from Canis lupus familiaris 5.6e-49
Ddb1
damage specific DNA binding protein 1
protein from Mus musculus 9.9e-49
DDB1
DNA damage-binding protein 1
protein from Bos taurus 1.6e-48
DDB1
DNA damage-binding protein 1
protein from Chlorocebus aethiops 1.8e-48
DDB1
Uncharacterized protein
protein from Sus scrofa 3.3e-48
Ddb1
damage-specific DNA binding protein 1, 127kDa
gene from Rattus norvegicus 8.4e-48
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 1.1e-47
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 1.3e-46
DDB1
Uncharacterized protein
protein from Canis lupus familiaris 1.5e-46
ddb1
DNA damage-binding protein 1
protein from Xenopus laevis 2.8e-46
repE
UV-damaged DNA binding protein1
gene from Dictyostelium discoideum 2.2e-45
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 3.0e-44
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 2.0e-43
DDB1
DNA damage-binding protein 1
protein from Gallus gallus 2.6e-43
ddb-1 gene from Caenorhabditis elegans 8.6e-39
ddb-1
DNA damage-binding protein 1
protein from Caenorhabditis elegans 8.6e-39
AT3G11960 protein from Arabidopsis thaliana 1.3e-36
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 3.0e-23
Cpsf160
Cleavage and polyadenylation specificity factor 160
protein from Drosophila melanogaster 1.5e-17
CPSF160
cleavage and polyadenylation specificity factor 160
protein from Arabidopsis thaliana 8.2e-16
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Homo sapiens 1.0e-15
MGG_16867
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 1.4e-15
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 3.1e-15
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Bos taurus 4.1e-15
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 6.3e-15
cpsf1
cleavage and polyadenylation specific factor 1
gene_product from Danio rerio 1.6e-14
Cpsf1
cleavage and polyadenylation specific factor 1
protein from Mus musculus 5.7e-14
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 3.2e-13
Cpsf1
cleavage and polyadenylation specific factor 1, 160kDa
gene from Rattus norvegicus 1.9e-10
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 3.9e-10
cpsf1
cleavage and polyadenylation specificity factor 160 kDa subunit
gene from Dictyostelium discoideum 1.1e-09
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 3.4e-09
CPSF1
Uncharacterized protein
protein from Sus scrofa 4.0e-05
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 5.7e-05
cpsf-1 gene from Caenorhabditis elegans 0.00010
cpsf-1
Probable cleavage and polyadenylation specificity factor subunit 1
protein from Caenorhabditis elegans 0.00010
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 0.00011
DDB1
DNA damage-binding protein 1
protein from Homo sapiens 0.00028

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  000944
        (1213 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2100616 - symbol:SAP130a "spliceosome-associat...  5562  0.        1
TAIR|locus:2100646 - symbol:SAP130b "spliceosome-associat...  5562  0.        1
MGI|MGI:1289341 - symbol:Sf3b3 "splicing factor 3b, subun...  3758  0.        1
UNIPROTKB|A0JN52 - symbol:SF3B3 "Splicing factor 3B subun...  3757  0.        1
UNIPROTKB|Q15393 - symbol:SF3B3 "Splicing factor 3B subun...  3757  0.        1
UNIPROTKB|E2RR33 - symbol:SF3B3 "Uncharacterized protein"...  3756  0.        1
ZFIN|ZDB-GENE-040426-2901 - symbol:sf3b3 "splicing factor...  3743  0.        1
UNIPROTKB|F1P529 - symbol:SF3B3 "Uncharacterized protein"...  3730  0.        1
FB|FBgn0035162 - symbol:CG13900 species:7227 "Drosophila ...  3575  0.        1
WB|WBGene00019323 - symbol:teg-4 species:6239 "Caenorhabd...  3154  0.        1
ASPGD|ASPL0000031473 - symbol:AN5452 species:162425 "Emer...  3038  8.7e-317  1
RGD|1311636 - symbol:Sf3b3 "splicing factor 3b, subunit 3...  1836  1.7e-288  2
DICTYBASE|DDB_G0282569 - symbol:sf3b3 "splicing factor 3B...  1392  3.1e-287  2
UNIPROTKB|E9PT66 - symbol:Sf3b3 "Protein Sf3b3" species:1...  2666  2.3e-277  1
GENEDB_PFALCIPARUM|PFL1680w - symbol:PFL1680w "splicing f...  1306  2.6e-275  3
UNIPROTKB|Q8I574 - symbol:PFL1680w "Splicing factor 3b, s...  1306  2.6e-275  3
POMBASE|SPAPJ698.03c - symbol:prp12 "U2 snRNP-associated ...  1121  4.4e-191  2
CGD|CAL0004426 - symbol:orf19.5391 species:5476 "Candida ...   848  5.7e-142  2
UNIPROTKB|F1NZF7 - symbol:SF3B3 "Uncharacterized protein"...  1348  1.1e-137  1
UNIPROTKB|F1S419 - symbol:LOC100512659 "Uncharacterized p...   783  3.1e-77   1
SGD|S000004513 - symbol:RSE1 "Protein involved in pre-mRN...   525  5.5e-73   4
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr...   332  2.0e-63   4
TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr...   342  3.4e-61   4
FB|FBgn0260962 - symbol:pic "piccolo" species:7227 "Droso...   370  3.6e-59   4
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ...   363  5.5e-50   4
UNIPROTKB|Q5R649 - symbol:DDB1 "DNA damage-binding protei...   353  2.7e-49   4
UNIPROTKB|Q16531 - symbol:DDB1 "DNA damage-binding protei...   353  4.4e-49   4
UNIPROTKB|E2R9E3 - symbol:DDB1 "Uncharacterized protein" ...   352  5.6e-49   4
MGI|MGI:1202384 - symbol:Ddb1 "damage specific DNA bindin...   346  9.9e-49   4
UNIPROTKB|A1A4K3 - symbol:DDB1 "DNA damage-binding protei...   348  1.6e-48   4
UNIPROTKB|P33194 - symbol:DDB1 "DNA damage-binding protei...   353  1.8e-48   4
UNIPROTKB|F1RIE2 - symbol:DDB1 "Uncharacterized protein" ...   345  3.3e-48   4
RGD|621889 - symbol:Ddb1 "damage-specific DNA binding pro...   347  8.4e-48   4
UNIPROTKB|Q805F9 - symbol:DDB1 "DNA damage-binding protei...   353  1.1e-47   5
UNIPROTKB|F5GY55 - symbol:DDB1 "Uncharacterized protein" ...   353  1.3e-46   4
UNIPROTKB|J9NVR7 - symbol:DDB1 "Uncharacterized protein" ...   352  1.5e-46   4
UNIPROTKB|Q6P6Z0 - symbol:ddb1 "DNA damage-binding protei...   337  2.8e-46   4
DICTYBASE|DDB_G0286013 - symbol:repE "UV-damaged DNA bind...   279  2.2e-45   3
UNIPROTKB|F1P4I8 - symbol:DDB1 "DNA damage-binding protei...   353  3.0e-44   5
UNIPROTKB|F1NVV3 - symbol:DDB1 "DNA damage-binding protei...   353  2.0e-43   5
UNIPROTKB|F1NVV2 - symbol:DDB1 "DNA damage-binding protei...   353  2.6e-43   5
WB|WBGene00010890 - symbol:ddb-1 species:6239 "Caenorhabd...   239  8.6e-39   4
UNIPROTKB|Q21554 - symbol:ddb-1 "DNA damage-binding prote...   239  8.6e-39   4
TAIR|locus:2081576 - symbol:AT3G11960 species:3702 "Arabi...   209  1.3e-36   5
UNIPROTKB|F5H6C5 - symbol:DDB1 "DNA damage-binding protei...   281  3.0e-23   1
UNIPROTKB|F1M680 - symbol:Ddb1 "DNA damage-binding protei...   215  9.3e-20   2
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla...   116  1.5e-17   6
ASPGD|ASPL0000052925 - symbol:ddbA species:162425 "Emeric...   144  9.1e-17   3
TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade...   174  8.2e-16   4
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla...   131  1.0e-15   6
UNIPROTKB|G4N4E2 - symbol:MGG_16867 "Uncharacterized prot...   144  1.4e-15   3
UNIPROTKB|B4DG00 - symbol:DDB1 "cDNA FLJ52436, highly sim...   204  3.1e-15   2
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla...   128  4.1e-15   6
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"...   114  6.3e-15   5
ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya...   113  1.6e-14   7
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat...   115  5.7e-14   5
UNIPROTKB|F5H775 - symbol:DDB1 "DNA damage-binding protei...   187  3.2e-13   1
POMBASE|SPAC17H9.10c - symbol:ddb1 "damaged DNA binding p...   117  6.3e-12   5
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ...    91  1.9e-10   7
UNIPROTKB|F5H0Y5 - symbol:DDB1 "DNA damage-binding protei...   158  3.9e-10   1
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya...    94  1.1e-09   5
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"...   114  3.4e-09   4
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"...   114  4.0e-05   4
UNIPROTKB|F5H581 - symbol:DDB1 "DNA damage-binding protei...   128  5.7e-05   1
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab...   112  0.00010   4
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p...   112  0.00010   4
UNIPROTKB|F5GZY8 - symbol:DDB1 "DNA damage-binding protei...   107  0.00011   1
UNIPROTKB|F5GYG8 - symbol:DDB1 "DNA damage-binding protei...   103  0.00028   1


>TAIR|locus:2100616 [details] [associations]
            symbol:SAP130a "spliceosome-associated protein 130 a"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA;ISS] [GO:0009507 "chloroplast" evidence=IDA]
            [GO:0005829 "cytosol" evidence=RCA] [GO:0009555 "pollen
            development" evidence=IMP] [GO:0009846 "pollen germination"
            evidence=IMP] [GO:0048481 "ovule development" evidence=IMP]
            InterPro:IPR001680 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 SMART:SM00320 GO:GO:0009506 GO:GO:0005634
            GO:GO:0009507 EMBL:CP002686 Gene3D:2.130.10.10 GO:GO:0009555
            GO:GO:0003676 EMBL:AL132954 GO:GO:0048481 GO:GO:0009846
            eggNOG:NOG247734 KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA
            IPI:IPI00517026 PIR:T47659 RefSeq:NP_567015.1 RefSeq:NP_567016.1
            UniGene:At.28226 UniGene:At.72270 ProteinModelPortal:Q9LD60
            SMR:Q9LD60 STRING:Q9LD60 PaxDb:Q9LD60 PRIDE:Q9LD60
            EnsemblPlants:AT3G55200.1 EnsemblPlants:AT3G55220.1 GeneID:824686
            GeneID:824688 KEGG:ath:AT3G55200 KEGG:ath:AT3G55220
            KEGG:dosa:Os02t0137400-01 TAIR:At3g55200 TAIR:At3g55220
            InParanoid:Q9LD60 PhylomeDB:Q9LD60 ProtClustDB:CLSN2689171
            ArrayExpress:Q9LD60 Genevestigator:Q9LD60 Uniprot:Q9LD60
        Length = 1214

 Score = 5562 (1963.0 bits), Expect = 0., P = 0.
 Identities = 1042/1214 (85%), Positives = 1137/1214 (93%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGA 60
             MYLYSLTLQQ TGI+ AINGNFSG KT EI VARGK+L+LLRP+ +G+I+T+ S E+FGA
Sbjct:     1 MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA 60

Query:    61 IRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYL 120
             IRSLAQFRLTG+QKDYIVVGSDSGRIVILEYN  KNVFDK+HQETFGKSGCRRIVPGQY+
Sbjct:    61 IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV 120

Query:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI 180
             AVDPKGRAVMIGACEKQKLVYVLNRDT ARLTISSPLEAHKSHTI YS+CG+DCGFDNPI
Sbjct:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 180

Query:   181 FAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGG 240
             FAAIELDYSEADQD TGQAASEAQK+LTFYELDLGLNHVSRKWS PVDNGANMLVTVPGG
Sbjct:   181 FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG 240

Query:   241 GDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQ 300
              DGPSGVLVCAENFVIY NQGHPDVRAVIPRR DLPAERGVL+VSAA H+QKT+FFFL+Q
Sbjct:   241 ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ 300

Query:   301 TEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAI 360
             TEYGD+FKVTL+H+ +HVSELK+KYFDTIPV +S+CVLK G+LF+ASEFGNH LYQFQAI
Sbjct:   301 TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI 360

Query:   361 GADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQ 420
             G +PDVE+SSS LMETEEGFQPVFFQPR LKNLVRI+QVESLMP+MDM++ N+FEEE PQ
Sbjct:   361 GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ 420

Query:   421 IFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNAT 480
             IF+LCGRGPRSSLRILRPGLA++EMAVSQLPG PSAVWTVKKNV+DEFDAYIVVSF NAT
Sbjct:   421 IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT 480

Query:   481 LVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKR 540
             LVLSIGE VEEV+DSGFLDTTPSLAVSLIGDDSLMQVHP+GIRHIREDGRINEWRTPGKR
Sbjct:   481 LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR 540

Query:   541 TIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSR 600
             +IVKVG NRLQVVIALSGGELIYFE DMTGQL+EVEKHEMSGDVACLDIA VPEGRKRSR
Sbjct:   541 SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR 600

Query:   601 FLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFL 660
             FLAVGSYDNT+RILSLDPDDC+QILSVQSVSS PESLLFLEVQAS+GG+DGADHPA+LFL
Sbjct:   601 FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL 660

Query:   661 NAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIH 720
             N+GLQNGVLFRTVVDMVTGQLSDSRSRFLGL+PPKLFS+ V GR+AMLCLSSRPWLGYIH
Sbjct:   661 NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH 720

Query:   721 RGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTP 780
             RG F LTPLSYETLE+AA FSSDQC EGVVSVAG+ALR+F I+RLGETFNET +PLRYTP
Sbjct:   721 RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP 780

Query:   781 RRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXXXXX-ENK 839
             R+FVL PK+KL+VIIE+DQGA TAEEREAA+KECF                      E+K
Sbjct:   781 RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK 840

Query:   840 YDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGT 899
              DPLSDEQYGYPKAES+KWVSCIRVLDP++A TTCLLELQDNEAA+S+CTVNFHDKE+GT
Sbjct:   841 EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT 900

Query:   900 LLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAG 959
             LLAVGT KG+QFWPK+N+VAG+IHIYRFVE+GKSLELLHKTQVEG+PLALCQFQGRLLAG
Sbjct:   901 LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG 960

Query:   960 IGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLY 1019
             IGPVLRLYDLGKKRLLRKCENKLFPNTI+SI TYRDRIYVGDIQESFH+CKYRRDENQLY
Sbjct:   961 IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY 1020

Query:  1020 IFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKL 1079
             IFADD VPRWLTA+HH+DFDTMAGADKFGN+YFVRLPQD+S+EIEEDPTGGKIKWEQGKL
Sbjct:  1021 IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL 1080

Query:  1080 NGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFS 1139
             NGAPNK++EIVQFHVGDVVT LQKAS++PGG ES++YGTVMGS+GA+ AF+SRDDVDFFS
Sbjct:  1081 NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS 1140

Query:  1140 HLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPG 1199
             HLEMHMRQE+PPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTL +DLQRKIADELDRTP 
Sbjct:  1141 HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA 1200

Query:  1200 EILKKLEEIRNKIV 1213
             EILKKLE+ RNKI+
Sbjct:  1201 EILKKLEDARNKII 1214


>TAIR|locus:2100646 [details] [associations]
            symbol:SAP130b "spliceosome-associated protein 130 b"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA;ISS] [GO:0005829 "cytosol" evidence=RCA]
            [GO:0009506 "plasmodesma" evidence=IDA] [GO:0009555 "pollen
            development" evidence=IMP] [GO:0009846 "pollen germination"
            evidence=IMP] [GO:0048481 "ovule development" evidence=IMP]
            InterPro:IPR001680 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 SMART:SM00320 GO:GO:0009506 GO:GO:0005634
            GO:GO:0009507 EMBL:CP002686 Gene3D:2.130.10.10 GO:GO:0009555
            GO:GO:0003676 EMBL:AL132954 GO:GO:0048481 GO:GO:0009846
            eggNOG:NOG247734 KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA
            IPI:IPI00517026 PIR:T47659 RefSeq:NP_567015.1 RefSeq:NP_567016.1
            UniGene:At.28226 UniGene:At.72270 ProteinModelPortal:Q9LD60
            SMR:Q9LD60 STRING:Q9LD60 PaxDb:Q9LD60 PRIDE:Q9LD60
            EnsemblPlants:AT3G55200.1 EnsemblPlants:AT3G55220.1 GeneID:824686
            GeneID:824688 KEGG:ath:AT3G55200 KEGG:ath:AT3G55220
            KEGG:dosa:Os02t0137400-01 TAIR:At3g55200 TAIR:At3g55220
            InParanoid:Q9LD60 PhylomeDB:Q9LD60 ProtClustDB:CLSN2689171
            ArrayExpress:Q9LD60 Genevestigator:Q9LD60 Uniprot:Q9LD60
        Length = 1214

 Score = 5562 (1963.0 bits), Expect = 0., P = 0.
 Identities = 1042/1214 (85%), Positives = 1137/1214 (93%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGA 60
             MYLYSLTLQQ TGI+ AINGNFSG KT EI VARGK+L+LLRP+ +G+I+T+ S E+FGA
Sbjct:     1 MYLYSLTLQQATGIVCAINGNFSGGKTQEIAVARGKILDLLRPDENGKIQTIHSVEVFGA 60

Query:    61 IRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYL 120
             IRSLAQFRLTG+QKDYIVVGSDSGRIVILEYN  KNVFDK+HQETFGKSGCRRIVPGQY+
Sbjct:    61 IRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKNVFDKVHQETFGKSGCRRIVPGQYV 120

Query:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI 180
             AVDPKGRAVMIGACEKQKLVYVLNRDT ARLTISSPLEAHKSHTI YS+CG+DCGFDNPI
Sbjct:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVDCGFDNPI 180

Query:   181 FAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGG 240
             FAAIELDYSEADQD TGQAASEAQK+LTFYELDLGLNHVSRKWS PVDNGANMLVTVPGG
Sbjct:   181 FAAIELDYSEADQDPTGQAASEAQKHLTFYELDLGLNHVSRKWSNPVDNGANMLVTVPGG 240

Query:   241 GDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQ 300
              DGPSGVLVCAENFVIY NQGHPDVRAVIPRR DLPAERGVL+VSAA H+QKT+FFFL+Q
Sbjct:   241 ADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQKTMFFFLIQ 300

Query:   301 TEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAI 360
             TEYGD+FKVTL+H+ +HVSELK+KYFDTIPV +S+CVLK G+LF+ASEFGNH LYQFQAI
Sbjct:   301 TEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGNHGLYQFQAI 360

Query:   361 GADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQ 420
             G +PDVE+SSS LMETEEGFQPVFFQPR LKNLVRI+QVESLMP+MDM++ N+FEEE PQ
Sbjct:   361 GEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVLNIFEEETPQ 420

Query:   421 IFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNAT 480
             IF+LCGRGPRSSLRILRPGLA++EMAVSQLPG PSAVWTVKKNV+DEFDAYIVVSF NAT
Sbjct:   421 IFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAYIVVSFTNAT 480

Query:   481 LVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKR 540
             LVLSIGE VEEV+DSGFLDTTPSLAVSLIGDDSLMQVHP+GIRHIREDGRINEWRTPGKR
Sbjct:   481 LVLSIGEQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDGRINEWRTPGKR 540

Query:   541 TIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSR 600
             +IVKVG NRLQVVIALSGGELIYFE DMTGQL+EVEKHEMSGDVACLDIA VPEGRKRSR
Sbjct:   541 SIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACLDIAPVPEGRKRSR 600

Query:   601 FLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFL 660
             FLAVGSYDNT+RILSLDPDDC+QILSVQSVSS PESLLFLEVQAS+GG+DGADHPA+LFL
Sbjct:   601 FLAVGSYDNTVRILSLDPDDCLQILSVQSVSSAPESLLFLEVQASIGGDDGADHPANLFL 660

Query:   661 NAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIH 720
             N+GLQNGVLFRTVVDMVTGQLSDSRSRFLGL+PPKLFS+ V GR+AMLCLSSRPWLGYIH
Sbjct:   661 NSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLGYIH 720

Query:   721 RGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTP 780
             RG F LTPLSYETLE+AA FSSDQC EGVVSVAG+ALR+F I+RLGETFNET +PLRYTP
Sbjct:   721 RGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLRYTP 780

Query:   781 RRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXXXXX-ENK 839
             R+FVL PK+KL+VIIE+DQGA TAEEREAA+KECF                      E+K
Sbjct:   781 RKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADDEDK 840

Query:   840 YDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGT 899
              DPLSDEQYGYPKAES+KWVSCIRVLDP++A TTCLLELQDNEAA+S+CTVNFHDKE+GT
Sbjct:   841 EDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGT 900

Query:   900 LLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAG 959
             LLAVGT KG+QFWPK+N+VAG+IHIYRFVE+GKSLELLHKTQVEG+PLALCQFQGRLLAG
Sbjct:   901 LLAVGTVKGMQFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQVEGVPLALCQFQGRLLAG 960

Query:   960 IGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLY 1019
             IGPVLRLYDLGKKRLLRKCENKLFPNTI+SI TYRDRIYVGDIQESFH+CKYRRDENQLY
Sbjct:   961 IGPVLRLYDLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYCKYRRDENQLY 1020

Query:  1020 IFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKL 1079
             IFADD VPRWLTA+HH+DFDTMAGADKFGN+YFVRLPQD+S+EIEEDPTGGKIKWEQGKL
Sbjct:  1021 IFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKL 1080

Query:  1080 NGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFS 1139
             NGAPNK++EIVQFHVGDVVT LQKAS++PGG ES++YGTVMGS+GA+ AF+SRDDVDFFS
Sbjct:  1081 NGAPNKVDEIVQFHVGDVVTCLQKASMIPGGSESIMYGTVMGSIGALHAFTSRDDVDFFS 1140

Query:  1140 HLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPG 1199
             HLEMHMRQE+PPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTL +DLQRKIADELDRTP 
Sbjct:  1141 HLEMHMRQEYPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPA 1200

Query:  1200 EILKKLEEIRNKIV 1213
             EILKKLE+ RNKI+
Sbjct:  1201 EILKKLEDARNKII 1214


>MGI|MGI:1289341 [details] [associations]
            symbol:Sf3b3 "splicing factor 3b, subunit 3" species:10090
            "Mus musculus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005681 "spliceosomal complex"
            evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0071013 "catalytic
            step 2 spliceosome" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
            MGI:MGI:1289341 GO:GO:0008380 GO:GO:0006397 GO:GO:0003676
            GO:GO:0071013 eggNOG:NOG247734 GeneTree:ENSGT00530000063396
            HSSP:Q16531 GO:GO:0005689 KO:K12830 HOGENOM:HOG000216677
            OMA:FDTIPVA CTD:23450 HOVERGEN:HBG093942 OrthoDB:EOG4RV2QJ
            EMBL:AK085705 EMBL:AK088268 EMBL:AK129035 EMBL:AK147914
            EMBL:BC011412 EMBL:BC031197 EMBL:BC042580 IPI:IPI00122011
            IPI:IPI00625759 RefSeq:NP_598714.1 UniGene:Mm.236123
            ProteinModelPortal:Q921M3 IntAct:Q921M3 STRING:Q921M3
            PhosphoSite:Q921M3 PaxDb:Q921M3 PRIDE:Q921M3
            Ensembl:ENSMUST00000042012 GeneID:101943 KEGG:mmu:101943
            UCSC:uc009nlc.1 InParanoid:Q921M3 NextBio:355190 Bgee:Q921M3
            CleanEx:MM_SF3B3 Genevestigator:Q921M3 Uniprot:Q921M3
        Length = 1217

 Score = 3758 (1327.9 bits), Expect = 0., P = 0.
 Identities = 713/1218 (58%), Positives = 927/1218 (76%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M+LY+LTLQ+ TGI  AI+GNFSGTK  EIVV+RGK+LELLRP+ N+G++ TL++ E+FG
Sbjct:     1 MFLYNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IRSL  FRLTG  KDYIVVGSDSGRIVILEY PSKN+F+KIHQETFGKSGCRRIVPGQ+
Sbjct:    61 VIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFGKSGCRRIVPGQF 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRAVMI A EKQKLVY+LNRD AARLTISSPLEAHK++T+VY + G+D GF+NP
Sbjct:   121 LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FA +E+DY EAD D TG+AA+  Q+ LTFYELDLGLNHV RK+SEP++   N L+TVPG
Sbjct:   181 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   ERG++ V +ATH+ K++FF
Sbjct:   241 GSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCVLK+G+LF ASEFGNH LYQ
Sbjct:   301 FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG    FFQPR LKNLV +++++SL PI+  +IA+L  E
Sbjct:   361 IAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDELDSLSPILFCQIADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AVWTV++++ DEFDAYI+VSF
Sbjct:   419 DTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV+P GIRHIR D R+NEW+T
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKRVNEWKT 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+VP G
Sbjct:   539 PGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHP 655
              +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+  +   ++  +  
Sbjct:   599 EQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEMGGTEKQDELGERG 658

Query:   656 AS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSR 713
             +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +SSR
Sbjct:   659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSR 718

Query:   714 PWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETA 773
              WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  FN+ A
Sbjct:   719 SWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVA 778

Query:   774 LPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXX 833
              PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                    
Sbjct:   779 FPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERELAAE 837

Query:   834 XXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFH 893
                    + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+    F 
Sbjct:   838 MAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFS 897

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ 953
             +      + VG AK L   P R++  G+++ Y+ V  G+ LE LHKT VE +P A+  FQ
Sbjct:   898 NTGEDWYVLVGVAKDLILSP-RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQ 956

Query:   954 GRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRR 1013
             GR+L G+G +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF + +Y+R
Sbjct:   957 GRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKR 1016

Query:  1014 DENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIK 1073
             +ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPTG K  
Sbjct:  1017 NENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKAL 1076

Query:  1074 WEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRD 1133
             W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S +
Sbjct:  1077 WDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHE 1136

Query:  1134 DVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADE 1193
             D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++  + Q+ +++E
Sbjct:  1137 DHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNVSEE 1196

Query:  1194 LDRTPGEILKKLEEIRNK 1211
             LDRTP E+ KKLE+IR +
Sbjct:  1197 LDRTPPEVSKKLEDIRTR 1214


>UNIPROTKB|A0JN52 [details] [associations]
            symbol:SF3B3 "Splicing factor 3B subunit 3" species:9913
            "Bos taurus" [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0008380
            GO:GO:0006397 GO:GO:0003676 GO:GO:0071013 eggNOG:NOG247734
            GeneTree:ENSGT00530000063396 GO:GO:0005689 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA EMBL:BC126518 IPI:IPI00690059
            RefSeq:NP_001071319.1 UniGene:Bt.7895 ProteinModelPortal:A0JN52
            STRING:A0JN52 PRIDE:A0JN52 Ensembl:ENSBTAT00000014050 GeneID:504962
            KEGG:bta:504962 CTD:23450 HOVERGEN:HBG093942 InParanoid:A0JN52
            OrthoDB:EOG4RV2QJ BioCyc:CATTLE:504962-MONOMER BindingDB:A0JN52
            NextBio:20866909 ArrayExpress:A0JN52 Uniprot:A0JN52
        Length = 1217

 Score = 3757 (1327.6 bits), Expect = 0., P = 0.
 Identities = 713/1218 (58%), Positives = 927/1218 (76%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M+LY+LTLQ+ TGI  AI+GNFSGTK  EIVV+RGK+LELLRP+ N+G++ TL++ E+FG
Sbjct:     1 MFLYNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IRSL  FRLTG  KDYIVVGSDSGRIVILEY PSKN+F+KIHQETFGKSGCRRIVPGQ+
Sbjct:    61 VIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFGKSGCRRIVPGQF 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRAVMI A EKQKLVY+LNRD AARLTISSPLEAHK++T+VY + G+D GF+NP
Sbjct:   121 LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FA +E+DY EAD D TG+AA+  Q+ LTFYELDLGLNHV RK+SEP++   N L+TVPG
Sbjct:   181 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   ERG++ V +ATH+ K++FF
Sbjct:   241 GSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCVLK+G+LF ASEFGNH LYQ
Sbjct:   301 FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG    FFQPR LKNLV +++++SL PI+  +IA+L  E
Sbjct:   361 IAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDELDSLSPILFCQIADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AVWTV++++ DEFDAYI+VSF
Sbjct:   419 DTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV+P GIRHIR D R+NEW+T
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKRVNEWKT 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+VP G
Sbjct:   539 PGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHP 655
              +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+  +   ++  +  
Sbjct:   599 EQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEMGGTEKQDELGERG 658

Query:   656 AS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSR 713
             +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +SSR
Sbjct:   659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSR 718

Query:   714 PWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETA 773
              WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  FN+ A
Sbjct:   719 SWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVA 778

Query:   774 LPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXX 833
              PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                    
Sbjct:   779 FPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERELAAE 837

Query:   834 XXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFH 893
                    + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+    F 
Sbjct:   838 MAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFS 897

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ 953
             +      + VG AK L   P R++  G+++ Y+ V  G+ LE LHKT VE +P A+  FQ
Sbjct:   898 NTGEDWYVLVGVAKDLILNP-RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQ 956

Query:   954 GRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRR 1013
             GR+L G+G +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF + +Y+R
Sbjct:   957 GRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKR 1016

Query:  1014 DENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIK 1073
             +ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPTG K  
Sbjct:  1017 NENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKAL 1076

Query:  1074 WEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRD 1133
             W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S +
Sbjct:  1077 WDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHE 1136

Query:  1134 DVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADE 1193
             D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++  + Q+ +++E
Sbjct:  1137 DHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNVSEE 1196

Query:  1194 LDRTPGEILKKLEEIRNK 1211
             LDRTP E+ KKLE+IR +
Sbjct:  1197 LDRTPPEVSKKLEDIRTR 1214


>UNIPROTKB|Q15393 [details] [associations]
            symbol:SF3B3 "Splicing factor 3B subunit 3" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000375 "RNA splicing, via transesterification reactions"
            evidence=TAS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IC;TAS] [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IDA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=IDA] [GO:0030532 "small nuclear ribonucleoprotein complex"
            evidence=TAS] [GO:0005681 "spliceosomal complex" evidence=TAS]
            [GO:0006397 "mRNA processing" evidence=TAS] [GO:0006461 "protein
            complex assembly" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467
            "gene expression" evidence=TAS] Reactome:REACT_71
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0005654 GO:GO:0006461
            Reactome:REACT_1675 GO:GO:0003676 GO:GO:0000398 GO:GO:0071013
            GO:GO:0030532 eggNOG:NOG247734 GO:GO:0005689 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA CTD:23450 HOVERGEN:HBG093942
            OrthoDB:EOG4RV2QJ EMBL:AJ001443 EMBL:D87686 EMBL:D13642
            EMBL:BC000463 EMBL:BC003146 EMBL:BC009780 EMBL:BC068974
            EMBL:AL110251 IPI:IPI00179138 IPI:IPI00300371 IPI:IPI00828110
            PIR:T14779 RefSeq:NP_036558.3 UniGene:Hs.514435
            ProteinModelPortal:Q15393 DIP:DIP-28152N IntAct:Q15393
            MINT:MINT-1402891 STRING:Q15393 PhosphoSite:Q15393 DMDM:116242787
            PaxDb:Q15393 PeptideAtlas:Q15393 PRIDE:Q15393
            Ensembl:ENST00000302516 GeneID:23450 KEGG:hsa:23450 UCSC:uc002ezf.3
            GeneCards:GC16P070557 HGNC:HGNC:10770 HPA:HPA042986 MIM:605592
            neXtProt:NX_Q15393 PharmGKB:PA35688 InParanoid:Q15393
            PhylomeDB:Q15393 BindingDB:Q15393 ChEMBL:CHEMBL1250378
            GenomeRNAi:23450 NextBio:45731 ArrayExpress:Q15393 Bgee:Q15393
            CleanEx:HS_SAP130 CleanEx:HS_SF3B3 Genevestigator:Q15393
            GermOnline:ENSG00000189091 Uniprot:Q15393
        Length = 1217

 Score = 3757 (1327.6 bits), Expect = 0., P = 0.
 Identities = 713/1218 (58%), Positives = 927/1218 (76%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M+LY+LTLQ+ TGI  AI+GNFSGTK  EIVV+RGK+LELLRP+ N+G++ TL++ E+FG
Sbjct:     1 MFLYNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IRSL  FRLTG  KDYIVVGSDSGRIVILEY PSKN+F+KIHQETFGKSGCRRIVPGQ+
Sbjct:    61 VIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFGKSGCRRIVPGQF 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRAVMI A EKQKLVY+LNRD AARLTISSPLEAHK++T+VY + G+D GF+NP
Sbjct:   121 LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FA +E+DY EAD D TG+AA+  Q+ LTFYELDLGLNHV RK+SEP++   N L+TVPG
Sbjct:   181 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   ERG++ V +ATH+ K++FF
Sbjct:   241 GSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCVLK+G+LF ASEFGNH LYQ
Sbjct:   301 FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG    FFQPR LKNLV +++++SL PI+  +IA+L  E
Sbjct:   361 IAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDELDSLSPILFCQIADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AVWTV++++ DEFDAYI+VSF
Sbjct:   419 DTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV+P GIRHIR D R+NEW+T
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKRVNEWKT 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+VP G
Sbjct:   539 PGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHP 655
              +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+  +   ++  +  
Sbjct:   599 EQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEMGGTEKQDELGERG 658

Query:   656 AS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSR 713
             +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +SSR
Sbjct:   659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSR 718

Query:   714 PWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETA 773
              WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  FN+ A
Sbjct:   719 SWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVA 778

Query:   774 LPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXX 833
              PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                    
Sbjct:   779 FPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERELAAE 837

Query:   834 XXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFH 893
                    + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+    F 
Sbjct:   838 MAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFS 897

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ 953
             +      + VG AK L   P R++  G+++ Y+ V  G+ LE LHKT VE +P A+  FQ
Sbjct:   898 NTGEDWYVLVGVAKDLILNP-RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQ 956

Query:   954 GRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRR 1013
             GR+L G+G +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF + +Y+R
Sbjct:   957 GRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKR 1016

Query:  1014 DENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIK 1073
             +ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPTG K  
Sbjct:  1017 NENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKAL 1076

Query:  1074 WEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRD 1133
             W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S +
Sbjct:  1077 WDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHE 1136

Query:  1134 DVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADE 1193
             D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++  + Q+ +++E
Sbjct:  1137 DHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNVSEE 1196

Query:  1194 LDRTPGEILKKLEEIRNK 1211
             LDRTP E+ KKLE+IR +
Sbjct:  1197 LDRTPPEVSKKLEDIRTR 1214


>UNIPROTKB|E2RR33 [details] [associations]
            symbol:SF3B3 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IEA] [GO:0005689 "U12-type spliceosomal complex"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0003676 GO:GO:0071013
            GeneTree:ENSGT00530000063396 GO:GO:0005689 KO:K12830 OMA:FDTIPVA
            CTD:23450 EMBL:AAEX03004077 RefSeq:XP_536791.2
            Ensembl:ENSCAFT00000032086 GeneID:479659 KEGG:cfa:479659
            Uniprot:E2RR33
        Length = 1217

 Score = 3756 (1327.2 bits), Expect = 0., P = 0.
 Identities = 713/1218 (58%), Positives = 927/1218 (76%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M+LY+LTLQ+ TGI  AI+GNFSGTK  EIVV+RGK+LELLRP+ N+G++ TL++ E+FG
Sbjct:     1 MFLYNLTLQRATGISFAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IRSL  FRLTG  KDYIVVGSDSGRIVILEY PSKN+F+KIHQETFGKSGCRRIVPGQ+
Sbjct:    61 VIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQPSKNMFEKIHQETFGKSGCRRIVPGQF 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRAVMI A EKQKLVY+LNRD AARLTISSPLEAHK++T+VY + G+D GF+NP
Sbjct:   121 LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FA +E+DY EAD D TG+AA+  Q+ LTFYELDLGLNHV RK+SEP++   N L+TVPG
Sbjct:   181 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   ERG++ V +ATH+ K++FF
Sbjct:   241 GSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCVLK+G+LF ASEFGNH LYQ
Sbjct:   301 FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG    FFQPR LKNLV +++++SL PI+  +IA+L  E
Sbjct:   361 IAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDELDSLSPILFCQIADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AVWTV++++ DEFDAYI+VSF
Sbjct:   419 DTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV+P GIRHIR D R+NEW+T
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKRVNEWKT 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+VP G
Sbjct:   539 PGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHP 655
              +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+  +   ++  +  
Sbjct:   599 EQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEMGGTEKQDELGERG 658

Query:   656 AS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSR 713
             +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +SSR
Sbjct:   659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSR 718

Query:   714 PWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETA 773
              WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  FN+ A
Sbjct:   719 SWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVA 778

Query:   774 LPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXX 833
              PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                    
Sbjct:   779 FPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERELAAE 837

Query:   834 XXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFH 893
                    + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+    F 
Sbjct:   838 MAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFS 897

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQ 953
             +      + VG AK L   P R++  G+++ Y+ V  G+ LE LHKT VE +P A+  FQ
Sbjct:   898 NTGDDWYVLVGVAKDLILNP-RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQ 956

Query:   954 GRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRR 1013
             GR+L G+G +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF + +Y+R
Sbjct:   957 GRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKR 1016

Query:  1014 DENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIK 1073
             +ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPTG K  
Sbjct:  1017 NENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKAL 1076

Query:  1074 WEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRD 1133
             W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S +
Sbjct:  1077 WDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHE 1136

Query:  1134 DVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADE 1193
             D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++  + Q+ +++E
Sbjct:  1137 DHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNVSEE 1196

Query:  1194 LDRTPGEILKKLEEIRNK 1211
             LDRTP E+ KKLE+IR +
Sbjct:  1197 LDRTPPEVSKKLEDIRTR 1214


>ZFIN|ZDB-GENE-040426-2901 [details] [associations]
            symbol:sf3b3 "splicing factor 3b, subunit 3"
            species:7955 "Danio rerio" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005681
            "spliceosomal complex" evidence=IEA] [GO:0006397 "mRNA processing"
            evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 ZFIN:ZDB-GENE-040426-2901 GO:GO:0008380
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0006397 GO:GO:0005681
            GO:GO:0003676 eggNOG:NOG247734 GeneTree:ENSGT00530000063396
            KO:K12830 HOGENOM:HOG000216677 OMA:FDTIPVA CTD:23450
            HOVERGEN:HBG093942 OrthoDB:EOG4RV2QJ EMBL:BX784024 EMBL:BC047171
            IPI:IPI00508652 RefSeq:NP_998668.1 RefSeq:XP_002667683.2
            UniGene:Dr.76176 STRING:Q1LVE8 PRIDE:Q1LVE8
            Ensembl:ENSDART00000008310 Ensembl:ENSDART00000122831
            Ensembl:ENSDART00000129666 Ensembl:ENSDART00000147743
            GeneID:100334114 GeneID:406824 KEGG:dre:100334114 KEGG:dre:406824
            InParanoid:Q1LVE8 NextBio:20818331 Bgee:Q1LVE8 Uniprot:Q1LVE8
        Length = 1217

 Score = 3743 (1322.7 bits), Expect = 0., P = 0.
 Identities = 713/1219 (58%), Positives = 923/1219 (75%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M+LY++TLQ+ TGI  AI+GNFSGTK  EIVV+RGK+LELLRP+ N+G++ TL++ E+FG
Sbjct:     1 MFLYNITLQRATGISHAIHGNFSGTKQQEIVVSRGKILELLRPDANTGKVHTLLTMEVFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              +RSL  FRLTG  KDY+VVGSDSGRIVILEY+PSKN+F+KIHQETFGKSGCRRIVPGQ+
Sbjct:    61 VVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHPSKNMFEKIHQETFGKSGCRRIVPGQF 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRAVMIGA EKQKLVY+LNRD AARLTISSPLEAHK++T+VY + G+D GF+NP
Sbjct:   121 LAVDPKGRAVMIGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FA +E+DY EAD D TG+AA+  Q+ LTFYELDLGLNHV RK+SE ++   N L+TVPG
Sbjct:   181 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEHGNFLITVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   ERG++ V +ATH+ K++FF
Sbjct:   241 GSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL QTE GDIFKVTLE D E V+E+++KYFDTIPV  +MCVLK+G+LF +SEFGNH LYQ
Sbjct:   301 FLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGFLFVSSEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG    FFQPR LKNLV +++ ESL PIM  +IA+L  E
Sbjct:   361 IAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDEQESLSPIMSCQIADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++  CGRGPRS+LR+LR GL VSEMAVS+LPG P+AVWTV+++V DEFDAYI+VSF
Sbjct:   419 DTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHVEDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L+ SL+G+D+L+QV+P GIRHIR D R+NEW+T
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDGIRHIRADKRVNEWKT 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK+TI++   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+VP G
Sbjct:   539 PGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHP 655
              +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+   V  +D     
Sbjct:   599 EQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEM-GGVEKQDELGEK 657

Query:   656 AS---LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSS 712
              +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +SS
Sbjct:   658 GTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSS 717

Query:   713 RPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNET 772
             R WL Y ++ RF LTPLSYETLEYA+ F+S+QC EG+V+++ N LR+  +E+LG  FN+ 
Sbjct:   718 RSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQV 777

Query:   773 ALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXX 832
             A PL+YTPR+FV+ P+   +++IETD  A T E  +A +K+                   
Sbjct:   778 AFPLQYTPRKFVIHPETNNLILIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERELAA 836

Query:   833 XXXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNF 892
                     + L +  +G PKA S +W S +R+++P   NT  L++L+ NEAAFS+    F
Sbjct:   837 EMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLDLVQLEQNEAAFSVAICRF 896

Query:   893 HDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF 952
              +      + VG A+ +   P R++  GYI+ YR V  G  LE LHKT VE +PLA+  F
Sbjct:   897 LNGGDDWYVLVGVARDMILNP-RSVGGGYIYTYRIVGGGDKLEFLHKTPVEDVPLAIAPF 955

Query:   953 QGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYR 1012
             QGR+L G+G +LR+YDLGKK+LLRKCENK  PN +  I+T   R+ V D+QES  + +YR
Sbjct:   956 QGRVLVGVGKLLRIYDLGKKKLLRKCENKHVPNLVTGIHTIGQRVIVSDVQESLFWVRYR 1015

Query:  1013 RDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKI 1072
             R+ENQL IFADD+ PRW+T A  +D+DTMA ADKFGNI  VRLP + SD+++EDPTG K 
Sbjct:  1016 RNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNICVVRLPPNTSDDVDEDPTGNKA 1075

Query:  1073 KWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSR 1132
              W++G LNGA  K E I+ +H+G+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S 
Sbjct:  1076 LWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSH 1135

Query:  1133 DDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIAD 1192
             +D DFF HLEMHMR E PPLCGRDH+++RS YFPVK+VIDGDLCEQF ++    Q+ +++
Sbjct:  1136 EDHDFFQHLEMHMRSEFPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMDPHKQKSVSE 1195

Query:  1193 ELDRTPGEILKKLEEIRNK 1211
             ELDRTP E+ KKLE+IR +
Sbjct:  1196 ELDRTPPEVSKKLEDIRTR 1214


>UNIPROTKB|F1P529 [details] [associations]
            symbol:SF3B3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005689 "U12-type spliceosomal complex" evidence=IEA]
            [GO:0071013 "catalytic step 2 spliceosome" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0003676 GO:GO:0071013
            GeneTree:ENSGT00530000063396 GO:GO:0005689 OMA:FDTIPVA
            EMBL:AADN02051593 EMBL:AADN02051594 IPI:IPI00576925
            Ensembl:ENSGALT00000003987 ArrayExpress:F1P529 Uniprot:F1P529
        Length = 1228

 Score = 3730 (1318.1 bits), Expect = 0., P = 0.
 Identities = 717/1229 (58%), Positives = 927/1229 (75%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M+LY+LTLQ+ TGI  AI+GNFSGTK  EIVV+RGK+LELLRP+ N+G++ TL++ E+FG
Sbjct:     1 MFLYNLTLQRATGISYAIHGNFSGTKQQEIVVSRGKILELLRPDPNTGKVHTLLTVEVFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IRSL  FRLTG  KDYIVVGSDSGRIVILEY PSKNVF+KIHQETFGKSGCRRIVPGQY
Sbjct:    61 VIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQPSKNVFEKIHQETFGKSGCRRIVPGQY 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRAVMI A EKQKLVY+LNRD AARLTISSPLEAHK++T+VY + G+D GF+NP
Sbjct:   121 LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FA +E+DY EAD D TG+AA+  Q+ LTFYELDLGLNHV RK+SEP++   N L+TVPG
Sbjct:   181 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   ERG++ V +ATH+ K++FF
Sbjct:   241 GSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDPERGMIFVCSATHKTKSMFF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCVLK+G+LF ASEFGNH LYQ
Sbjct:   301 FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG    FFQPR LKNLV +++++SL PI+  +IA+L  E
Sbjct:   361 IAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDELDSLSPILCCQIADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AVWTV+++V DEFDAYI+VSF
Sbjct:   419 DTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHVEDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV+P GIRHIR D R+NEW+T
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKRVNEWKT 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+VP G
Sbjct:   539 PGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHP 655
              +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+  +   ++  +  
Sbjct:   599 EQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEMGGTEKQDELGERG 658

Query:   656 AS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSR 713
             +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +SSR
Sbjct:   659 SIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSR 718

Query:   714 PWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETA 773
              WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  FN+ A
Sbjct:   719 SWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVA 778

Query:   774 LPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXX 833
              PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                    
Sbjct:   779 FPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERELAAE 837

Query:   834 XXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFH 893
                    + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+    F 
Sbjct:   838 MAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFS 897

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKS----------LELLHKTQVE 943
             +      + VG AK L   P R++  G+++ Y+ V  G+           LE LHKT VE
Sbjct:   898 NTGEEWYVLVGVAKDLILNP-RSVAGGFVYTYKLVNGGEXTYKLVNGGEKLEFLHKTPVE 956

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKL-FPNTIVSINTYRDRIYVGDI 1002
              +P A+  FQGR+L G+G +LR+YDLGKK+LLRKCENK    N I  I T   R+ V D+
Sbjct:   957 EVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKKHIANYICGIQTIGHRVIVSDV 1016

Query:  1003 QESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDE 1062
             QESF + +Y+R+ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE
Sbjct:  1017 QESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFGNICVVRLPPNTNDE 1076

Query:  1063 IEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGS 1122
             ++EDPTG K  W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G 
Sbjct:  1077 VDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGG 1136

Query:  1123 LGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTL 1182
             +G ++ F+S +D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++
Sbjct:  1137 IGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSM 1196

Query:  1183 SLDLQRKIADELDRTPGEILKKLEEIRNK 1211
               + Q+ +A+ELDRTP E+ KKLE+IR +
Sbjct:  1197 EPNKQKNVAEELDRTPPEVSKKLEDIRTR 1225


>FB|FBgn0035162 [details] [associations]
            symbol:CG13900 species:7227 "Drosophila melanogaster"
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0030532 "small
            nuclear ribonucleoprotein complex" evidence=ISS] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=IC;ISS] [GO:0005686 "U2 snRNP"
            evidence=ISS;IDA] [GO:0007052 "mitotic spindle organization"
            evidence=IMP] [GO:0071011 "precatalytic spliceosome" evidence=IDA]
            [GO:0071013 "catalytic step 2 spliceosome" evidence=IDA]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0007052 GO:GO:0022008
            Gene3D:2.130.10.10 GO:GO:0003676 GO:GO:0071011 GO:GO:0000398
            GO:GO:0071013 GO:GO:0005686 eggNOG:NOG247734 EMBL:BT021338
            ProteinModelPortal:Q5BI86 SMR:Q5BI86 STRING:Q5BI86 PaxDb:Q5BI86
            PRIDE:Q5BI86 FlyBase:FBgn0035162 InParanoid:Q5BI86
            OrthoDB:EOG4B5MM0 ArrayExpress:Q5BI86 Bgee:Q5BI86 Uniprot:Q5BI86
        Length = 1227

 Score = 3575 (1263.5 bits), Expect = 0., P = 0.
 Identities = 695/1229 (56%), Positives = 897/1229 (72%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             MYLY+LTLQ+ TG+  A++GNFSG K  E++++RGK LELLRP+ N+G++ TL+STEIFG
Sbjct:     1 MYLYNLTLQKATGVTHAVHGNFSGGKQQEVLLSRGKSLELLRPDSNTGKVHTLLSTEIFG 60

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              +R+L  FRLTG  KDYIVVGSDSGRIVILEYNPSKN  +K+HQETFGKSGCRRIVPGQY
Sbjct:    61 CVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGCRRIVPGQY 120

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
              A+DPKGRAVMIGA EKQKL Y++NRDT ARLTISSPLEAHKS+T+ Y + G+D GFDNP
Sbjct:   121 FAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVGVDVGFDNP 180

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             + A +E+DY EAD D +G AA   Q+ LTFYELDLGLNHV RK+SEP++  AN LV+VPG
Sbjct:   181 MLACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHANFLVSVPG 240

Query:   240 GGDGPSGVLVCAENFVIYKNQGHP-DVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLFF 296
             G DGPSGVL+C+EN++ YKN G   D+R  IPRR  DL   ERG++ + +ATHR K+++F
Sbjct:   241 GNDGPSGVLICSENYLTYKNLGDQHDIRCPIPRRRNDLDDPERGMIFICSATHRTKSMYF 300

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FLLQTE GDIFK+TLE D++ VSE+K+KYFDT+P   +MCVLK+G+LF ASEFGNH LYQ
Sbjct:   301 FLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFLFVASEFGNHYLYQ 360

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +G D D E   S+ M  EEG +  FF PR LKNLV ++++ S  PI+  ++A+L  E
Sbjct:   361 IAHLGDDDD-EPEFSSAMPLEEG-ETFFFAPRALKNLVLVDELPSFAPIITSQVADLANE 418

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSF 476
             + PQ++ LCGRGPRS+LR+LR GL VSEMAVS+LPG P+AVWTVKK  +DEFDAYI+VSF
Sbjct:   419 DTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRADDEFDAYIIVSF 478

Query:   477 NNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRT 536
              NATLVLSIGETVEEV+DSGFL TTP+L  + +GDD+L+QV+P GIRHIR D R+NEW+ 
Sbjct:   479 VNATLVLSIGETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGIRHIRSDKRVNEWKA 538

Query:   537 PGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPEG 595
             PGK++I K   N+ QVVI LSG EL+YFE+D TG+L E  E+ EM  ++ C+ + +VPEG
Sbjct:   539 PGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSEMPAEIMCMALGTVPEG 598

Query:   596 RKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEV----QASVGGEDG 651
              +RS FLAVG  DNT+RILSLDP++C+   S+Q++ SP ESL  +E+      + GG D 
Sbjct:   599 EQRSWFLAVGLADNTVRILSLDPNNCLTPCSMQALPSPAESLCLVEMGHTESTTQGGLDD 658

Query:   652 ADHPA--------SLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGG 703
              D PA        +++LN GL NGVL RTV+D V+G L+D+R+R+LG RP KLF + + G
Sbjct:   659 -DAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRTRYLGSRPVKLFRIKMQG 717

Query:   704 RAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIE 763
               A+L +SSR WL Y H+ RF LTPLSYETLEYA+ FSS+QC EG+V+++ N LR+  +E
Sbjct:   718 SEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCSEGIVAISTNTLRILALE 777

Query:   764 RLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXX 823
             +LG  FN+ A PL+YTPR FV+ P    M+I ETD  A T E+ ++A+KE          
Sbjct:   778 KLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYT-EDTKSARKEQMAEEMRSAA 836

Query:   824 XXXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEA 883
                              + L ++ +  PKA    W S IR LD     T   + L  NEA
Sbjct:   837 GDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDAMHGQTMFSVPLTQNEA 896

Query:   884 AFSICTVNFHDKEHGTL-LAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQV 942
               S+  + F     G   LAVG AK LQ  P+ +   G I IY+      SLE +H+T +
Sbjct:   897 IMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRIS-QGGCIDIYKIDPTCSSLEFMHRTDI 955

Query:   943 EGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDI 1002
             + IP ALC FQGRLLAG G +LR+YD GKK++LRKCENK  P  IV+I     R+YV D+
Sbjct:   956 DEIPGALCGFQGRLLAGCGRMLRIYDFGKKKMLRKCENKHIPYQIVNIQAMGHRVYVSDV 1015

Query:  1003 QESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDE 1062
             QES  F +YRR ENQL IFADD+ PRW+TA   +D+DT+A ADKFGN+   RLP  V+D+
Sbjct:  1016 QESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDTIAIADKFGNLSIQRLPHSVTDD 1075

Query:  1063 IEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGS 1122
             ++EDPTG K  W++G L+GA  K E I  FHVG+++ SLQKA+L+PGG E++IY T+ G+
Sbjct:  1076 VDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSLQKATLIPGGSEALIYATLSGT 1135

Query:  1123 LGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTL 1182
             +GA + F+SR+D DFF HLEMHMR E+PPLCGRDH++YRS+Y+PVK+V+DGDLCEQ+ ++
Sbjct:  1136 VGAFVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRSSYYPVKNVLDGDLCEQYLSI 1195

Query:  1183 SLDLQRKIADELDRTPGEILKKLEEIRNK 1211
                 Q+ IA ++ RTP +I KKLE+IR +
Sbjct:  1196 EAAKQKSIAGDMFRTPNQICKKLEDIRTR 1224


>WB|WBGene00019323 [details] [associations]
            symbol:teg-4 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0040035 "hermaphrodite
            genitalia development" evidence=IMP] [GO:0009790 "embryo
            development" evidence=IMP] [GO:0001703 "gastrulation with mouth
            forming first" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0002009
            "morphogenesis of an epithelium" evidence=IMP] [GO:0042127
            "regulation of cell proliferation" evidence=IMP] [GO:0040020
            "regulation of meiosis" evidence=IMP] [GO:0008406 "gonad
            development" evidence=IMP] [GO:0016477 "cell migration"
            evidence=IMP] [GO:0007281 "germ cell development" evidence=IMP]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
            GO:GO:0040007 GO:GO:0016477 GO:GO:0008406 GO:GO:0002119
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003676 GO:GO:0042127
            GO:GO:0040035 GO:GO:0007281 GO:GO:0040020 eggNOG:NOG247734
            GeneTree:ENSGT00530000063396 GO:GO:0001703 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA EMBL:FO081029 PIR:T32916
            RefSeq:NP_491953.1 ProteinModelPortal:O44985 STRING:O44985
            PaxDb:O44985 EnsemblMetazoa:K02F2.3 GeneID:172406
            KEGG:cel:CELE_K02F2.3 UCSC:K02F2.3 CTD:172406 WormBase:K02F2.3
            InParanoid:O44985 NextBio:875387 Uniprot:O44985
        Length = 1220

 Score = 3154 (1115.3 bits), Expect = 0., P = 0.
 Identities = 620/1227 (50%), Positives = 840/1227 (68%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGT-KTPEIVVARGKVLELLRPEN-SGRIETLVSTEIF 58
             M+LY+LTLQ  + I  AI GNFSGT K  EIVV RG  LELL  +  +G+I+ +   +IF
Sbjct:     1 MHLYNLTLQGQSAINQAIQGNFSGTPKAQEIVVGRGSALELLTLDTVTGKIKVMCHQDIF 60

Query:    59 GAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQ 118
             G +RSL  FRLT   +D+I VGSDSGRIVIL+YN  K  F+++HQETFGK+GCRRIVPG 
Sbjct:    61 GIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNAEKTCFERLHQETFGKTGCRRIVPGH 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDN 178
             +L  DP+GRA+MIGA E+QKLVY++NRD+ A LTISSPLEAHK HT+ Y++ GID GF+N
Sbjct:   121 FLVGDPRGRALMIGAVERQKLVYIMNRDSEAHLTISSPLEAHKHHTLCYAMVGIDVGFEN 180

Query:   179 PIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVP 238
             P FA +E DY +AD D TG+AA   Q+ LTFYELDLGLNHV RK++EP+++  N+L+ VP
Sbjct:   181 PTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLNDPGNLLIAVP 240

Query:   239 GGGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-AERGVLIVSAATHRQKTLF 295
             GG DGPSGV+VC EN+++YKN G  PD+R  IPRR  +L  A+R +LI++ ATH+ K ++
Sbjct:   241 GGNDGPSGVIVCCENYLVYKNLGDQPDIRCPIPRRRNELDDADRTMLIIATATHKTKNMY 300

Query:   296 FFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALY 355
             FFL+Q E GDIFKVTLE D + VSE+K+KYFDT+P   ++C+LKSG+LF A+EFGNH LY
Sbjct:   301 FFLVQAENGDIFKVTLETDEDLVSEMKLKYFDTVPPANALCILKSGFLFVAAEFGNHELY 360

Query:   356 QFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFE 415
             Q  ++G   D E SS+  M   E     FF+P  LK+L+ I+ ++SL P+ D  I ++  
Sbjct:   361 QIASLGEGDDDEFSSA--MGFGEN-DAAFFEPHELKSLIPIDSMDSLSPLTDAVIGDIAR 417

Query:   416 EEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVS 475
             E+A QI++L GRG RSSL++LR GL +SEMAVS LPG P+AVWTVKKN+ D++D+YIVVS
Sbjct:   418 EDAAQIYSLVGRGARSSLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYDSYIVVS 477

Query:   476 FNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWR 535
             F NATL L+IG+TVEE SDSGFL TTP++  ++IGDDSL+Q++  GIRHIR D RINEW+
Sbjct:   478 FVNATLALTIGDTVEEASDSGFLPTTPTIGCAMIGDDSLVQIYSEGIRHIRADKRINEWK 537

Query:   536 TPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASVPE 594
              P +R IVK   NR QV +AL+GGEL+YFE+D+ G L E  E+   + D+AC+  + + E
Sbjct:   538 APPRRQIVKCAVNRRQVAVALTGGELVYFELDLNGTLNEFTERKLFNADIACMTFSEISE 597

Query:   595 GRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADH 654
             G   SRFLA+G+ DN +RI+SLDP+D +  LS QS+  PPES+L ++       EDG   
Sbjct:   598 GELNSRFLALGTVDNAVRIISLDPNDMLMPLSTQSLPCPPESILLIDTP----NEDGKG- 652

Query:   655 PASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRP 714
              A++ LN GLQNG LFR  VD VTG + D+R+R+LG RP KLF V   GR+A+LC SSR 
Sbjct:   653 VAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQCQGRSAILCTSSRS 712

Query:   715 WLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETAL 774
             WL Y  + RF LTPLSY  LEYAASF S+QC EG+V+++ + LR+   E+LG  FN  + 
Sbjct:   713 WLLYHFQRRFHLTPLSYANLEYAASFCSNQCSEGIVAISASTLRIIAAEKLGVAFNVQSF 772

Query:   775 PLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXXXXXX 834
               + TPRR  + P    +++IETD  + T E  +  K+                      
Sbjct:   773 EHKMTPRRVAVHPSMPCLIVIETDHASYT-EVTKNIKRNQMAADVEAMASDETEAQLAQE 831

Query:   835 XXENKYDPLSDEQ-YGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNF- 892
                N  +   DE+ YG P+A   KW S I ++   S +     EL  +E A  +  V F 
Sbjct:   832 IATNLRERRLDERVYGAPRAARGKWASAISLISATSGDKLSYFELPQDENAKCVALVQFS 891

Query:   893 -HDKEHGTLLAVGTAKGLQFW---PKRNIVA---GYIHIYRFVEEGKSLELLHKTQVEGI 945
              H  E   L+  G  + L      P    +    G ++ +     G   + LH+T+   +
Sbjct:   892 KHPNEAMVLVGCGVNEVLNVHDIDPNDTSIRPTRGCVYTFHLSANGDRFDFLHRTETP-L 950

Query:   946 PL-ALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQE 1004
             P+ A+  F+G  L G G  LR+YD+G+K+LL KCENK FP +IV+I +   RI V D QE
Sbjct:   951 PVGAIHDFRGMALVGFGRFLRMYDIGQKKLLAKCENKNFPVSIVNIQSTGQRIIVSDSQE 1010

Query:  1005 SFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIE 1064
             S HF +YR+ +NQL +FADD+ PR++T    +D+ T+A ADKFGN+  VRLP+ V+++++
Sbjct:  1011 SVHFLRYRKGDNQLVVFADDTTPRYVTCVCVLDYHTVAVADKFGNLAVVRLPERVNEDVQ 1070

Query:  1065 EDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLG 1124
             +DPT  K  W++G LNGA  K+E +  F +GD +TSLQK SL+PG  E+++Y T+ G++G
Sbjct:  1071 DDPTVSKSVWDRGWLNGASQKVELVSNFFIGDTITSLQKTSLMPGANEALVYTTIGGAIG 1130

Query:  1125 AMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSL 1184
              +++F S+D+VDFF++LEMH+R E+PPLCGRDH+AYRS Y P K VIDGD+CEQF  +  
Sbjct:  1131 CLVSFMSKDEVDFFTNLEMHVRSEYPPLCGRDHLAYRSYYAPCKSVIDGDICEQFSLMDT 1190

Query:  1185 DLQRKIADELDRTPGEILKKLEEIRNK 1211
               Q+ +A+EL +T  EI KKLE+IR +
Sbjct:  1191 QKQKDVAEELGKTVSEISKKLEDIRTR 1217


>ASPGD|ASPL0000031473 [details] [associations]
            symbol:AN5452 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380
            Gene3D:2.130.10.10 GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
            GO:GO:0007049 EMBL:BN001305 EMBL:AACD01000094 eggNOG:NOG247734
            KO:K12830 RefSeq:XP_663056.1 STRING:Q5B1X8 GeneID:2871744
            KEGG:ani:AN5452.2 HOGENOM:HOG000216677 OMA:FDTIPVA
            OrthoDB:EOG4FR40R Uniprot:Q5B1X8
        Length = 1209

 Score = 3038 (1074.5 bits), Expect = 8.7e-317, P = 8.7e-317
 Identities = 615/1229 (50%), Positives = 824/1229 (67%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             M++YSLT+Q PT I  AI G F+GTK  +IV A G  L + RP+   G++  L + ++FG
Sbjct:     7 MFMYSLTIQPPTAITQAILGQFAGTKEQQIVTASGSKLTIHRPDPTQGKVIPLYTQDVFG 66

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IR+LA FRL GS KDYI++GSDSGRI I+EY PS+N F++IH ETFGKSG RR+VPGQY
Sbjct:    67 IIRTLAAFRLAGSNKDYIIIGSDSGRITIIEYVPSQNRFNRIHLETFGKSGVRRVVPGQY 126

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LAVDPKGRA +I + EK KLVYVLNR++ A LTISSPLEAHK  T+VYS+  +D G++NP
Sbjct:   127 LAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAGYENP 186

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             +FAA+E+DYSE+DQD TG+A  E +K L +YELDLGLNHV RKW++PVD  ++ML  VPG
Sbjct:   187 VFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRKWTDPVDRTSSMLFQVPG 246

Query:   240 GGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRR--ADLPAERGVLIVSAATHRQKTLFFF 297
             G DGPSGVLVCAE+ + Y++      R  IPRR  A    ER   I +   H+ +  FFF
Sbjct:   247 GADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKMRGAFFF 306

Query:   298 LLQTEYGDIFKVTLE--HDNE-----HVSELKIKYFDTIPVTASMCVLKSGYLFAASEFG 350
             LLQTE GD+FK+TL+   D++      V  LKIKYFDT+P+ +S+ +LKSG+L+ A+E G
Sbjct:   307 LLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLYVAAEGG 366

Query:   351 NHALYQFQAIGADPD-VEASSSTLM-ETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDM 408
             NH  YQF+ +G D +  E +S     +      PV+FQPRG +NL  +E + SL P++D 
Sbjct:   367 NHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSLNPLVDS 426

Query:   409 RIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEF 468
             ++ N+ E++APQIFT+ G G RS+ R L+ GL VSE+  S+LP VPSAVWT K    DEF
Sbjct:   427 KVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKLTRADEF 486

Query:   469 DAYIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIRED 528
             DAYIV+SF N TLVLSIGETVEEV+D+GFL + P+LAV  +G+DSL+Q+HP GIRHI  D
Sbjct:   487 DAYIVLSFANGTLVLSIGETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPRGIRHILAD 546

Query:   529 GRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEV-EKHEMSGDVACL 587
              R+NEW  P  R+IV   +N  QV +ALS GE++YFE+D  G L E  E+ +MSG V CL
Sbjct:   547 RRVNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDERRQMSGTVTCL 606

Query:   588 DIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVG 647
              +  VPEGR RS FLAVG  D+T+RILSLDPD  ++  SVQ++++ P +L  + +  S  
Sbjct:   607 SLGEVPEGRVRSSFLAVGCDDSTVRILSLDPDTTLENKSVQALTAAPSALNIIAMADSSS 666

Query:   648 GEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAM 707
             G  G     +L+L+ GL +GV  RT +D VTG+LSD+R+RFLG +  KLF V V G+ A+
Sbjct:   667 G--GT----TLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVTGQTAV 720

Query:   708 LCLSSRPWLGYIH---RGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIER 764
             L LSSRPWLGY     +G F+LTPL Y  LE+  +FSS+QCVEG+V + G  LR+F+IE+
Sbjct:   721 LALSSRPWLGYSDTQTKG-FMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIFSIEK 779

Query:   765 LGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXX 824
             L     + ++PL YTPR F+  P++ L  +IE D   L+   R    ++           
Sbjct:   780 LDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATRARLLEDSKARGGDTTV- 838

Query:   825 XXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTC-LLELQDNEA 883
                               L  E +GYP+  +  W SCI+++DP  A      +EL++NEA
Sbjct:   839 ------------------LPPEDFGYPRG-TGHWASCIQIIDPLDAKAVVGAVELEENEA 879

Query:   884 AFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVE 943
             A SI  V F  ++  T L VGTAK +   P  +   GYIHIYRF E+GK LE +HKT+VE
Sbjct:   880 AVSIAAVPFTSQDDETFLVVGTAKDMTVNPPSS-AGGYIHIYRFQEDGKELEFIHKTKVE 938

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQ 1003
               PLAL  FQGRLLAG+G VLR+YDLG K+LLRKC+  + P  IV + T   RI V D++
Sbjct:   939 EPPLALLGFQGRLLAGVGSVLRIYDLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVR 998

Query:  1004 ESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEI 1063
             ES  +  Y+  +N L  F DDS+ RW TAA  +D++T AG DKFGN++ VR P+  S+E 
Sbjct:   999 ESVTYVVYKYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEA 1058

Query:  1064 EEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSL 1123
             +E+ +G  +  ++G L G PN++E ++     D+ TSL K  LV GG + +++    G++
Sbjct:  1059 DEEGSGAHLIHDRGYLQGTPNRLELMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTI 1118

Query:  1124 GAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLS 1183
             G ++ F SR+DVDFF  LEM +  + PPL GRDH+ YRS Y PVK VIDGDLCEQ+  LS
Sbjct:  1119 GILVPFVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGVIDGDLCEQYFLLS 1178

Query:  1184 LDLQRKIADELDRTPGEILKKLEEIRNKI 1212
              D +  IA ELDR+  EI +K+ ++R ++
Sbjct:  1179 NDTKMMIAAELDRSVREIERKISDMRTRV 1207


>RGD|1311636 [details] [associations]
            symbol:Sf3b3 "splicing factor 3b, subunit 3" species:10116
            "Rattus norvegicus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005689
            "U12-type spliceosomal complex" evidence=ISO] [GO:0071013
            "catalytic step 2 spliceosome" evidence=ISO] InterPro:IPR004871
            Pfam:PF03178 RGD:1311636 GO:GO:0005634 GO:GO:0003676
            IPI:IPI00563335 PRIDE:F1LSZ9 Ensembl:ENSRNOT00000044193
            UCSC:RGD:1311636 ArrayExpress:F1LSZ9 Uniprot:F1LSZ9
        Length = 902

 Score = 1836 (651.4 bits), Expect = 1.7e-288, Sum P(2) = 1.7e-288
 Identities = 353/599 (58%), Positives = 465/599 (77%)

Query:   221 RKWSEPVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQG-HPDVRAVIPRRA-DLP-A 277
             RK+SEP++   N L+TVPGG DGPSGVL+C+EN++ YKN G  PD+R  IPRR  DL   
Sbjct:     2 RKYSEPLEEHGNFLITVPGGSDGPSGVLICSENYITYKNFGDQPDIRCPIPRRRNDLDDP 61

Query:   278 ERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCV 337
             ERG++ V +ATH+ K++FFFL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCV
Sbjct:    62 ERGMIFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCV 121

Query:   338 LKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIE 397
             LK+G+LF ASEFGNH LYQ   +G D D E   S+ M  EEG    FFQPR LKNLV ++
Sbjct:   122 LKTGFLFVASEFGNHYLYQIAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVD 179

Query:   398 QVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAV 457
             +++SL PI+  +IA+L  E+ PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AV
Sbjct:   180 ELDSLSPILFCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAV 239

Query:   458 WTVKKNVNDEFDAYIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQV 517
             WTV++++ DEFDAYI+VSF NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV
Sbjct:   240 WTVRRHIEDEFDAYIIVSFVNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQV 299

Query:   518 HPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VE 576
             +P GIRHIR D R+NEW+TPGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E
Sbjct:   300 YPDGIRHIRADKRVNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTE 359

Query:   577 KHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPES 636
             + EMS DV C+ +A+VP G +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PES
Sbjct:   360 RKEMSADVVCMSLANVPPGEQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPES 419

Query:   637 LLFLEVQASVGGEDGADHPAS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPP 694
             L  +E+  +   ++  +  +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP 
Sbjct:   420 LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPV 479

Query:   695 KLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAG 754
             KLF V + G+ A+L +SSR WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ 
Sbjct:   480 KLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIST 539

Query:   755 NALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKE 813
             N LR+  +E+LG  FN+ A PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+
Sbjct:   540 NTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQ 597

 Score = 959 (342.6 bits), Expect = 1.7e-288, Sum P(2) = 1.7e-288
 Identities = 168/297 (56%), Positives = 228/297 (76%)

Query:   915 RNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL 974
             R++  G+++ Y+ V  G+ LE LHKT VE +P A+  FQGR+L G+G +LR+YDLGKK+L
Sbjct:   603 RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKL 662

Query:   975 LRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAH 1034
             LRKCENK   N I  I T   R+ V D+QESF + +Y+R+ENQL IFADD+ PRW+T A 
Sbjct:   663 LRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTAS 722

Query:  1035 HIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHV 1094
              +D+DT+AGADKFGNI  VRLP + +DE++EDPTG K  W++G LNGA  K E I+ +HV
Sbjct:   723 LLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHV 782

Query:  1095 GDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCG 1154
             G+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S +D DFF H+EMH+R EHPPLCG
Sbjct:   783 GETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCG 842

Query:  1155 RDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIRNK 1211
             RDH+++RS YFPVK+VIDGDLCEQF ++  + Q+ +++ELDRTP E+ KKLE+IR +
Sbjct:   843 RDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDIRTR 899


>DICTYBASE|DDB_G0282569 [details] [associations]
            symbol:sf3b3 "splicing factor 3B subunit 3"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0030532 "small nuclear ribonucleoprotein complex" evidence=ISS]
            [GO:0008380 "RNA splicing" evidence=IEA;ISS] [GO:0006461 "protein
            complex assembly" evidence=ISS] [GO:0005681 "spliceosomal complex"
            evidence=IEA;ISS] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 dictyBase:DDB_G0282569 GO:GO:0006461 GO:GO:0008380
            Gene3D:2.130.10.10 SUPFAM:SSF50978 EMBL:AAFI02000047
            GenomeReviews:CM000152_GR GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
            GO:GO:0030532 eggNOG:NOG247734 KO:K12830 OMA:FDTIPVA
            RefSeq:XP_640132.1 STRING:Q54SA7 EnsemblProtists:DDB0233171
            GeneID:8623669 KEGG:ddi:DDB_G0282569 ProtClustDB:CLSZ2729005
            Uniprot:Q54SA7
        Length = 1256

 Score = 1392 (495.1 bits), Expect = 3.1e-287, Sum P(2) = 3.1e-287
 Identities = 274/559 (49%), Positives = 384/559 (68%)

Query:   657 SLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWL 716
             SLFL  GL+NGV+ R  +D VTG+LSD R+R LG +P KLF V V G  AML LSSR WL
Sbjct:   700 SLFLFVGLKNGVVKRATLDSVTGELSDIRTRLLGRKPVKLFKVKVRGSNAMLALSSRVWL 759

Query:   717 GYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPL 776
              YI++G+  + PLS E LE A++ SS+Q  E +V+ + N + +F+I++LG+ FN+  + L
Sbjct:   760 NYINQGKLDIVPLSIEPLENASNLSSEQSAESIVATSENKIIIFSIDKLGDLFNQETIKL 819

Query:   777 RYTPRRFVLQPKKKLMVIIETDQGALTAE---EREAAKKECFXXXXXXXXXXXXXXXXXX 833
               TP+RF++ P+   ++I+ET+    T     ++   + E                    
Sbjct:   820 NATPKRFIIHPQTSYIIILETETNYNTDNIDIDKINEQSEKLLLEKQKELQQEMDIDDDD 879

Query:   834 XXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFH 893
                 N+ +P   ++   PKA   KW S I+++DP +  +   L L+D EA FS+CT +F 
Sbjct:   880 QNNNNEIEPF--KKLFKPKAGKGKWKSYIKIMDPITHESLESLMLEDGEAGFSVCTCSFG 937

Query:   894 DKEHGTL-LAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQF 952
               E G + L VG    +   PK +  A ++++YRF++ GK LELL+KT+VE    A+ QF
Sbjct:   938 --ESGEIFLVVGCVTDMVLNPKSHKSA-HLNLYRFIDGGKKLELLYKTEVEEPVYAMAQF 994

Query:   953 QGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYR 1012
             QG+L+ G+G  +R+YD+GKK+LLRKCE K  PNTIV+I++  DR+ VGDIQES HF KY+
Sbjct:   995 QGKLVCGVGKSIRIYDMGKKKLLRKCETKNLPNTIVNIHSLGDRLVVGDIQESIHFIKYK 1054

Query:  1013 RDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKI 1072
             R EN LY+FADD  PRW+T++  +D+DT+AGADKFGNI+ +RLP  +SDE+EEDPTG K+
Sbjct:  1055 RSENMLYVFADDLAPRWMTSSVMLDYDTVAGADKFGNIFVLRLPLLISDEVEEDPTGTKL 1114

Query:  1073 KWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSR 1132
             K+E G LNGAP+K++ I  F VGD VT+L K SLV GG E ++Y T+ G++GA++ F+SR
Sbjct:  1115 KFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLVVGGPEVILYTTISGAIGALIPFTSR 1174

Query:  1133 DDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIAD 1192
             +DVDFFS LEM+MR +  PLCGRDH+AYRS YFPVK++IDGDLCEQF TL+   Q  I++
Sbjct:  1175 EDVDFFSTLEMNMRSDCLPLCGRDHLAYRSYYFPVKNIIDGDLCEQFSTLNYQKQLSISE 1234

Query:  1193 ELDRTPGEILKKLEEIRNK 1211
             EL R+P E++KKLEEIR++
Sbjct:  1235 ELSRSPSEVIKKLEEIRSQ 1253

 Score = 1391 (494.7 bits), Expect = 3.1e-287, Sum P(2) = 3.1e-287
 Identities = 267/484 (55%), Positives = 364/484 (75%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGA 60
             MYLY+LTLQ+PT +  +I+GNFSGTK  EIV+  G+ LEL+R + +G++++++ TE+FG 
Sbjct:     1 MYLYNLTLQRPTSVYQSISGNFSGTKQVEIVLNHGRSLELIRYDENGKMQSVLYTEVFGI 60

Query:    61 IRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYL 120
             +RS+  FRLT   KDYI+VGSDSGR+VILEYN  KN FDKIHQETFG+SGCRRIVPGQYL
Sbjct:    61 VRSIIPFRLTSGTKDYIIVGSDSGRVVILEYNSQKNQFDKIHQETFGRSGCRRIVPGQYL 120

Query:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI 180
             AVDPKGRA MIGA EKQKLVY+LNRD++A LTISSPLEAHKS+TIV+S+CG+D GFDNPI
Sbjct:   121 AVDPKGRAFMIGAIEKQKLVYILNRDSSANLTISSPLEAHKSNTIVFSMCGVDVGFDNPI 180

Query:   181 FAAIELDYSEADQDSTG-----------QAASEAQKNLTFYELDLGLNHVSRKWSEPVDN 229
             FA I +DY+E D  S G           +   + +K LT+YELDLGLN+V RKWS+ VD+
Sbjct:   181 FATISVDYTEEDSSSGGGGGGSIEEMMDEDIGKKKKLLTYYELDLGLNNVVRKWSDQVDD 240

Query:   230 GANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATH 289
              AN+++TVPGG +GP GVLV +E++++Y+NQ H +VR+ IPRR      +GVLI+S ++H
Sbjct:   241 SANIVMTVPGGTEGPGGVLVASEDYIVYRNQDHAEVRSRIPRRYGSDPNKGVLIISHSSH 300

Query:   290 RQKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEF 349
             +QK +FFFL+Q+E+GD++K+TL++  + VSE+ + YFDTI +   + VLK+G+LFAASEF
Sbjct:   301 KQKGMFFFLVQSEHGDLYKITLDYQGDQVSEVNVNYFDTIVLANCLTVLKNGFLFAASEF 360

Query:   350 GNHALYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRG--------LKNLVRIEQVES 401
             G+H LY F++IG D + E  +  L E ++G   ++F PR         LKNL     + S
Sbjct:   361 GDHTLYFFKSIG-DEEEEGQAKRL-EDKDGH--LWFTPRNSCGTKMEELKNLEPTSHLSS 416

Query:   402 LMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVK 461
             L PI+D ++ +L  EE PQ+++LCG G  SSL++LR GL+V+ +  + LPGVPS +WTV 
Sbjct:   417 LSPIIDFKVLDLVREENPQLYSLCGTGLNSSLKVLRHGLSVTTITTANLPGVPSGIWTVP 476

Query:   462 KNVN 465
             K+ +
Sbjct:   477 KSTS 480

 Score = 1139 (406.0 bits), Expect = 1.4e-260, Sum P(2) = 1.4e-260
 Identities = 219/458 (47%), Positives = 326/458 (71%)

Query:   204 QKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHP 263
             +K LT+YELDLGLN+V RKWS+ VD+ AN+++TVPGG +GP GVLV +E++++Y+NQ H 
Sbjct:   215 KKLLTYYELDLGLNNVVRKWSDQVDDSANIVMTVPGGTEGPGGVLVASEDYIVYRNQDHA 274

Query:   264 DVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKI 323
             +VR+ IPRR      +GVLI+S ++H+QK +FFFL+Q+E+GD++K+TL++  + VSE+ +
Sbjct:   275 EVRSRIPRRYGSDPNKGVLIISHSSHKQKGMFFFLVQSEHGDLYKITLDYQGDQVSEVNV 334

Query:   324 KYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQPV 383
              YFDTI +   + VLK+G+LFAASEFG+H LY F++IG D + E  +  L E ++G   +
Sbjct:   335 NYFDTIVLANCLTVLKNGFLFAASEFGDHTLYFFKSIG-DEEEEGQAKRL-EDKDGH--L 390

Query:   384 FFQPRG--------LKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRI 435
             +F PR         LKNL     + SL PI+D ++ +L  EE PQ+++LCG G  SSL++
Sbjct:   391 WFTPRNSCGTKMEELKNLEPTSHLSSLSPIIDFKVLDLVREENPQLYSLCGTGLNSSLKV 450

Query:   436 LRPGLAVSEMAVSQLPGVPSAVWTVKK----NVNDEFDAYIVVSFNNATLVLSIGETVEE 491
             LR GL+V+ +  + LPGVPS +WTV K    N  D+ D YIVVSF   T VLS+G+T++E
Sbjct:   451 LRHGLSVTTITTANLPGVPSGIWTVPKSTSPNAIDQTDKYIVVSFVGTTSVLSVGDTIQE 510

Query:   492 VSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQ 551
               +SG L+TT +L V  +GDD+++QV P+G RHI+ D RINEWR PG++TIV+  +N+ Q
Sbjct:   511 NHESGILETTTTLLVKSMGDDAIIQVFPTGFRHIKSDLRINEWRAPGRKTIVRASANQSQ 570

Query:   552 VVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTI 611
             + IALSGGE+IYFE+D    L+E+ K ++  D+AC++I+ +P+GR  +RF+AV  ++  I
Sbjct:   571 LAIALSGGEIIYFELDQASNLIEIIKKDLRRDIACIEISPIPKGRNMARFIAVSDWEGPI 630

Query:   612 RILSLDPDDCM-QILSVQSVSSPPESLLFLEVQASVGG 648
             R+LSLD D+C+ Q+  + +     ESL  +E+Q +  G
Sbjct:   631 RVLSLDRDNCLGQVSMLDTDKVYIESLSIIEMQLNEMG 668


>UNIPROTKB|E9PT66 [details] [associations]
            symbol:Sf3b3 "Protein Sf3b3" species:10116 "Rattus
            norvegicus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            RGD:1311636 GO:GO:0003676 GO:GO:0071013
            GeneTree:ENSGT00530000063396 GO:GO:0005689 IPI:IPI00958853
            Ensembl:ENSRNOT00000023854 ArrayExpress:E9PT66 Uniprot:E9PT66
        Length = 920

 Score = 2666 (943.5 bits), Expect = 2.3e-277, P = 2.3e-277
 Identities = 506/921 (54%), Positives = 677/921 (73%)

Query:   294 LFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHA 353
             +FFFL QTE GDIFK+TLE D + V+E+++KYFDT+PV A+MCVLK+G+LF ASEFGNH 
Sbjct:     1 MFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHY 60

Query:   354 LYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANL 413
             LYQ   +G D D E   S+ M  EEG    FFQPR LKNLV +++++SL PI+  +IA+L
Sbjct:    61 LYQIAHLG-DDDEEPEFSSAMPLEEG-DTFFFQPRPLKNLVLVDELDSLSPILFCQIADL 118

Query:   414 FEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIV 473
               E+ PQ++  CGRGPRSSLR+LR GL VSEMAVS+LPG P+AVWTV++++ DEFDAYI+
Sbjct:   119 ANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYII 178

Query:   474 VSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINE 533
             VSF NATLVLSIGETVEEV+DSGFL TTP+L+ SL+GDD+L+QV+P GIRHIR D R+NE
Sbjct:   179 VSFVNATLVLSIGETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKRVNE 238

Query:   534 WRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLE-VEKHEMSGDVACLDIASV 592
             W+TPGK+TIVK   N+ QVVIAL+GGEL+YFE+D +GQL E  E+ EMS DV C+ +A+V
Sbjct:   239 WKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANV 298

Query:   593 PEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGA 652
             P G +RSRFLAVG  DNT+RI+SLDP DC+Q LS+Q++ + PESL  +E+  +   ++  
Sbjct:   299 PPGEQRSRFLAVGLVDNTVRIISLDPSDCLQPLSMQALPAQPESLCIVEMGGTEKQDELG 358

Query:   653 DHPAS--LFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCL 710
             +  +   L+LN GLQNGVL RTV+D VTG LSD+R+R+LG RP KLF V + G+ A+L +
Sbjct:   359 ERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAM 418

Query:   711 SSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFN 770
             SSR WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  FN
Sbjct:   419 SSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFN 478

Query:   771 ETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXXX 830
             + A PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                 
Sbjct:   479 QVAFPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAPGEDEREL 537

Query:   831 XXXXXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTV 890
                       + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+   
Sbjct:   538 AAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVC 597

Query:   891 NFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALC 950
              F +      + VG AK L   P R++  G+++ Y+ V  G+ LE LHKT VE +P A+ 
Sbjct:   598 RFSNTGEDWYVLVGVAKDLILSP-RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIA 656

Query:   951 QFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCK 1010
              FQGR+L G+G +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF + +
Sbjct:   657 PFQGRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVR 716

Query:  1011 YRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGG 1070
             Y+R+ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPTG 
Sbjct:   717 YKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGN 776

Query:  1071 KIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFS 1130
             K  W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ F+
Sbjct:   777 KALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFT 836

Query:  1131 SRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKI 1190
             S +D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++  + Q+ +
Sbjct:   837 SHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQKNV 896

Query:  1191 ADELDRTPGEILKKLEEIRNK 1211
             ++ELDRTP E+ KKLE+IR +
Sbjct:   897 SEELDRTPPEVSKKLEDIRTR 917


>GENEDB_PFALCIPARUM|PFL1680w [details] [associations]
            symbol:PFL1680w "splicing factor 3b, subunit 3,
            130kD, putative" species:5833 "Plasmodium falciparum" [GO:0005681
            "spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
            evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
            SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
            HOGENOM:HOG000216677 RefSeq:XP_001350742.1
            ProteinModelPortal:Q8I574 PRIDE:Q8I574
            EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
            EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
            Uniprot:Q8I574
        Length = 1329

 Score = 1306 (464.8 bits), Expect = 2.6e-275, Sum P(3) = 2.6e-275
 Identities = 262/604 (43%), Positives = 387/604 (64%)

Query:   187 DYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGGGDGPSG 246
             D +  D      +   A+K L F+ELDLGLNHV +K   P+D  A++L+ +PGG  GPSG
Sbjct:   223 DKNVKDNKDNDFSLDYAKKVLCFWELDLGLNHVIKKHILPIDITAHLLIPLPGGQQGPSG 282

Query:   247 VLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDI 306
             VL+C ENF++YK   H D+    PRR ++  ++ + I+    HR K  FF L+Q+EYGD+
Sbjct:   283 VLICCENFLVYKKVDHEDIYCAYPRRLEIGQDKNISIICWTMHRIKKFFFILIQSEYGDL 342

Query:   307 FKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDV 366
             +K+ ++H++  V E+  KYFDT+P+  S+ VLKSG LF A+EFGNH  YQF  IG D   
Sbjct:   343 YKIEVDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQ 402

Query:   367 EASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCG 426
                +S     +     + F+   LKNL  ++Q+ SL PI+DM+I +      PQI+TLCG
Sbjct:   403 FMCTSNHPLGKNAI--IAFKTNKLKNLYLVDQIYSLSPILDMKIIDAKNTHTPQIYTLCG 460

Query:   427 RGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIG 486
             RGPRSSLRIL+ GL++ E+A ++LPG P  +WT+KK+   E+D YIVVSF   TL+L IG
Sbjct:   461 RGPRSSLRILQHGLSIEELADNELPGKPKYIWTIKKDNLSEYDGYIVVSFEGNTLILEIG 520

Query:   487 ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRI-NEWRTPGKRTIVKV 545
             E+VEEVSD+  L+   +L ++++ D+S +QV+ +GIRHI  +G++  EW  P  + I   
Sbjct:   521 ESVEEVSDTLLLNNVTTLHINILYDNSFIQVYDTGIRHI--NGKVVQEWVAPKNKQIKAA 578

Query:   546 GSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVG 605
              SN  Q+VI+LSGGELIYFE+D +  L+E+ +  ++ +V CL I  +P  R R+ FLAVG
Sbjct:   579 SSNSSQIVISLSGGELIYFEIDESHTLVEIFRKNLNVEVLCLSIQQIPPNRVRANFLAVG 638

Query:   606 SYDNTIRILSLDPDDCMQILSVQSV--SSPPESLLFLEVQASVGGEDGADHPASLFLNAG 663
               DN +R+LS++ D   + LS   +  +S P+ +   E+  +  G    +    +FLN G
Sbjct:   639 CLDNVVRLLSIEKDKYFKQLSTHLLPNNSSPQDICISEMNDN--GNTMKERNI-IFLNIG 695

Query:   664 LQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGR 723
             L  GVL R+++D V G LS+  S++LG +  K+  V V    A+L L  + +L Y+H+G+
Sbjct:   696 LNTGVLLRSIIDPVAGTLSNHYSKYLGAKSIKICPVNVNKNPALLVLCEKTYLCYMHQGK 755

Query:   724 FLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRF 783
             FL +PL+Y+ LEYA+SF S QC +G V+++ N+LR+F   RLGE F++  L L +TPR+ 
Sbjct:   756 FLYSPLNYDMLEYASSFYSPQCSDGYVAISSNSLRIFRFYRLGEVFSQNILHLTFTPRKI 815

Query:   784 VLQP 787
             V  P
Sbjct:   816 VPLP 819

 Score = 751 (269.4 bits), Expect = 2.6e-275, Sum P(3) = 2.6e-275
 Identities = 155/381 (40%), Positives = 239/381 (62%)

Query:   837 ENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKE 896
             EN+ D    ++ G  KA   KW SCI++++P +      + L   EAA S+C     +  
Sbjct:   959 ENE-DEYYYDRIGTFKAGQGKWGSCIKIINPVNLQILDKISLDMEEAALSVCACEL-EAL 1016

Query:   897 HGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRL 956
             H   L VGT   L    K ++ +  + +Y +  + K L LLH T +E  P   C + G+L
Sbjct:  1017 H--CLIVGTTTNLSLKTK-SLTSASLRVYTYDIQYK-LNLLHITPIEEQPYCFCSYNGKL 1072

Query:   957 LAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDEN 1016
             +A IG  LR+Y LGKK+LL+KCE K  P  IVSI    +RI+  DI+ES     Y  ++N
Sbjct:  1073 IASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVLIFFYDPNQN 1132

Query:  1017 QLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIK-WE 1075
              L + +DD +PRW+T +  +D  T+  ADKF +++ +R+P+    E ++D  G   K W 
Sbjct:  1133 TLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPE----EAKQDEYGITNKCWY 1188

Query:  1076 QGKL-NGAPN--KMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSR 1132
              G++ N +    K+E ++ FH+G++VTS+QK  L P   E +IY T+MG++GA + + ++
Sbjct:  1189 GGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTSSECIIYSTIMGTIGAFIPYDNK 1248

Query:  1133 DDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIAD 1192
             ++++   HLE+ +R E PPLCGR+H+ +RS Y PV++V+DGDLCEQF +LS D Q+KIA+
Sbjct:  1249 EELELTQHLEIILRTEKPPLCGREHIFFRSYYHPVQNVVDGDLCEQFSSLSYDAQKKIAN 1308

Query:  1193 ELDRTPGEILKKLEEIRNKIV 1213
             +L+RTP +IL+KLE+IRNKI+
Sbjct:  1309 DLERTPEDILRKLEDIRNKIL 1329

 Score = 631 (227.2 bits), Expect = 2.6e-275, Sum P(3) = 2.6e-275
 Identities = 117/200 (58%), Positives = 154/200 (77%)

Query:     3 LYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIR 62
             LY LTLQ+PT I   + GNFSG +  EI+VA+G+VLELLR +  G++  ++S +IFG IR
Sbjct:     4 LYHLTLQKPTAITKTVYGNFSGPRFHEIIVAKGQVLELLRSDKQGKLNVIISKDIFGIIR 63

Query:    63 SLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYLAV 122
             S++ FRLTGS KDYIV+GSDSGR+VILEYN  KN F ++H ET+GK+G RRI+PG+Y+AV
Sbjct:    64 SISTFRLTGSNKDYIVIGSDSGRLVILEYNNEKNDFVRVHCETYGKTGIRRIIPGEYIAV 123

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPIFA 182
             DPKGRA+MI A EKQK VY+LNRD    LTISSPLEAHKSH+I + + G++ GF+NP+F 
Sbjct:   124 DPKGRALMICAVEKQKFVYILNRDNKENLTISSPLEAHKSHSICHDVVGLNVGFENPMFV 183

Query:   183 AIELDYSEADQDSTGQAASE 202
             +IE +Y   D+    +  +E
Sbjct:   184 SIEQNYESLDKQINEELENE 203

 Score = 42 (19.8 bits), Expect = 6.0e-57, Sum P(2) = 6.0e-57
 Identities = 20/71 (28%), Positives = 34/71 (47%)

Query:   924 IYRFVEEGKSL-ELLHKT-QVEGIPLALCQFQ-GRLLAG---IG---PVLRLYDLGKKRL 974
             IY  ++E  +L E+  K   VE + L++ Q    R+ A    +G    V+RL  + K + 
Sbjct:   595 IYFEIDESHTLVEIFRKNLNVEVLCLSIQQIPPNRVRANFLAVGCLDNVVRLLSIEKDKY 654

Query:   975 LRKCENKLFPN 985
              ++    L PN
Sbjct:   655 FKQLSTHLLPN 665

 Score = 41 (19.5 bits), Expect = 2.0e-70, Sum P(2) = 2.0e-70
 Identities = 20/81 (24%), Positives = 34/81 (41%)

Query:    66 QFRLTGSQKDYIVVGSDSGRIVILEYNPSKN---VFDK-IHQETFGKSGCRRIVPGQ--- 118
             Q +   S    IV+    G ++  E + S     +F K ++ E    S  ++I P +   
Sbjct:   574 QIKAASSNSSQIVISLSGGELIYFEIDESHTLVEIFRKNLNVEVLCLS-IQQIPPNRVRA 632

Query:   119 -YLAVDPKGRAVMIGACEKQK 138
              +LAV      V + + EK K
Sbjct:   633 NFLAVGCLDNVVRLLSIEKDK 653

 Score = 39 (18.8 bits), Expect = 3.3e-70, Sum P(2) = 3.3e-70
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query:   546 GSNRLQVVIALSGGELIYFE 565
             GSN+  +VI    G L+  E
Sbjct:    72 GSNKDYIVIGSDSGRLVILE 91

 Score = 37 (18.1 bits), Expect = 5.3e-70, Sum P(2) = 5.3e-70
 Identities = 13/59 (22%), Positives = 25/59 (42%)

Query:   261 GHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVT--LEHDNEH 317
             G   +R +IP        +G  ++  A  +QK  F ++L  +  +   ++  LE    H
Sbjct:   108 GKTGIRRIIPGEYIAVDPKGRALMICAVEKQK--FVYILNRDNKENLTISSPLEAHKSH 164


>UNIPROTKB|Q8I574 [details] [associations]
            symbol:PFL1680w "Splicing factor 3b, subunit 3, 130kD,
            putative" species:36329 "Plasmodium falciparum 3D7" [GO:0005681
            "spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
            evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
            SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
            HOGENOM:HOG000216677 RefSeq:XP_001350742.1
            ProteinModelPortal:Q8I574 PRIDE:Q8I574
            EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
            EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
            Uniprot:Q8I574
        Length = 1329

 Score = 1306 (464.8 bits), Expect = 2.6e-275, Sum P(3) = 2.6e-275
 Identities = 262/604 (43%), Positives = 387/604 (64%)

Query:   187 DYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGGGDGPSG 246
             D +  D      +   A+K L F+ELDLGLNHV +K   P+D  A++L+ +PGG  GPSG
Sbjct:   223 DKNVKDNKDNDFSLDYAKKVLCFWELDLGLNHVIKKHILPIDITAHLLIPLPGGQQGPSG 282

Query:   247 VLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDI 306
             VL+C ENF++YK   H D+    PRR ++  ++ + I+    HR K  FF L+Q+EYGD+
Sbjct:   283 VLICCENFLVYKKVDHEDIYCAYPRRLEIGQDKNISIICWTMHRIKKFFFILIQSEYGDL 342

Query:   307 FKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDV 366
             +K+ ++H++  V E+  KYFDT+P+  S+ VLKSG LF A+EFGNH  YQF  IG D   
Sbjct:   343 YKIEVDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVAAEFGNHYFYQFSGIGDDNKQ 402

Query:   367 EASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCG 426
                +S     +     + F+   LKNL  ++Q+ SL PI+DM+I +      PQI+TLCG
Sbjct:   403 FMCTSNHPLGKNAI--IAFKTNKLKNLYLVDQIYSLSPILDMKIIDAKNTHTPQIYTLCG 460

Query:   427 RGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIG 486
             RGPRSSLRIL+ GL++ E+A ++LPG P  +WT+KK+   E+D YIVVSF   TL+L IG
Sbjct:   461 RGPRSSLRILQHGLSIEELADNELPGKPKYIWTIKKDNLSEYDGYIVVSFEGNTLILEIG 520

Query:   487 ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRI-NEWRTPGKRTIVKV 545
             E+VEEVSD+  L+   +L ++++ D+S +QV+ +GIRHI  +G++  EW  P  + I   
Sbjct:   521 ESVEEVSDTLLLNNVTTLHINILYDNSFIQVYDTGIRHI--NGKVVQEWVAPKNKQIKAA 578

Query:   546 GSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVG 605
              SN  Q+VI+LSGGELIYFE+D +  L+E+ +  ++ +V CL I  +P  R R+ FLAVG
Sbjct:   579 SSNSSQIVISLSGGELIYFEIDESHTLVEIFRKNLNVEVLCLSIQQIPPNRVRANFLAVG 638

Query:   606 SYDNTIRILSLDPDDCMQILSVQSV--SSPPESLLFLEVQASVGGEDGADHPASLFLNAG 663
               DN +R+LS++ D   + LS   +  +S P+ +   E+  +  G    +    +FLN G
Sbjct:   639 CLDNVVRLLSIEKDKYFKQLSTHLLPNNSSPQDICISEMNDN--GNTMKERNI-IFLNIG 695

Query:   664 LQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGR 723
             L  GVL R+++D V G LS+  S++LG +  K+  V V    A+L L  + +L Y+H+G+
Sbjct:   696 LNTGVLLRSIIDPVAGTLSNHYSKYLGAKSIKICPVNVNKNPALLVLCEKTYLCYMHQGK 755

Query:   724 FLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRF 783
             FL +PL+Y+ LEYA+SF S QC +G V+++ N+LR+F   RLGE F++  L L +TPR+ 
Sbjct:   756 FLYSPLNYDMLEYASSFYSPQCSDGYVAISSNSLRIFRFYRLGEVFSQNILHLTFTPRKI 815

Query:   784 VLQP 787
             V  P
Sbjct:   816 VPLP 819

 Score = 751 (269.4 bits), Expect = 2.6e-275, Sum P(3) = 2.6e-275
 Identities = 155/381 (40%), Positives = 239/381 (62%)

Query:   837 ENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKE 896
             EN+ D    ++ G  KA   KW SCI++++P +      + L   EAA S+C     +  
Sbjct:   959 ENE-DEYYYDRIGTFKAGQGKWGSCIKIINPVNLQILDKISLDMEEAALSVCACEL-EAL 1016

Query:   897 HGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRL 956
             H   L VGT   L    K ++ +  + +Y +  + K L LLH T +E  P   C + G+L
Sbjct:  1017 H--CLIVGTTTNLSLKTK-SLTSASLRVYTYDIQYK-LNLLHITPIEEQPYCFCSYNGKL 1072

Query:   957 LAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDEN 1016
             +A IG  LR+Y LGKK+LL+KCE K  P  IVSI    +RI+  DI+ES     Y  ++N
Sbjct:  1073 IASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKISGNRIFACDIRESVLIFFYDPNQN 1132

Query:  1017 QLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIK-WE 1075
              L + +DD +PRW+T +  +D  T+  ADKF +++ +R+P+    E ++D  G   K W 
Sbjct:  1133 TLRLISDDIIPRWITCSEILDHHTIMAADKFDSVFILRVPE----EAKQDEYGITNKCWY 1188

Query:  1076 QGKL-NGAPN--KMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSR 1132
              G++ N +    K+E ++ FH+G++VTS+QK  L P   E +IY T+MG++GA + + ++
Sbjct:  1189 GGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTSSECIIYSTIMGTIGAFIPYDNK 1248

Query:  1133 DDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIAD 1192
             ++++   HLE+ +R E PPLCGR+H+ +RS Y PV++V+DGDLCEQF +LS D Q+KIA+
Sbjct:  1249 EELELTQHLEIILRTEKPPLCGREHIFFRSYYHPVQNVVDGDLCEQFSSLSYDAQKKIAN 1308

Query:  1193 ELDRTPGEILKKLEEIRNKIV 1213
             +L+RTP +IL+KLE+IRNKI+
Sbjct:  1309 DLERTPEDILRKLEDIRNKIL 1329

 Score = 631 (227.2 bits), Expect = 2.6e-275, Sum P(3) = 2.6e-275
 Identities = 117/200 (58%), Positives = 154/200 (77%)

Query:     3 LYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIR 62
             LY LTLQ+PT I   + GNFSG +  EI+VA+G+VLELLR +  G++  ++S +IFG IR
Sbjct:     4 LYHLTLQKPTAITKTVYGNFSGPRFHEIIVAKGQVLELLRSDKQGKLNVIISKDIFGIIR 63

Query:    63 SLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQYLAV 122
             S++ FRLTGS KDYIV+GSDSGR+VILEYN  KN F ++H ET+GK+G RRI+PG+Y+AV
Sbjct:    64 SISTFRLTGSNKDYIVIGSDSGRLVILEYNNEKNDFVRVHCETYGKTGIRRIIPGEYIAV 123

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPIFA 182
             DPKGRA+MI A EKQK VY+LNRD    LTISSPLEAHKSH+I + + G++ GF+NP+F 
Sbjct:   124 DPKGRALMICAVEKQKFVYILNRDNKENLTISSPLEAHKSHSICHDVVGLNVGFENPMFV 183

Query:   183 AIELDYSEADQDSTGQAASE 202
             +IE +Y   D+    +  +E
Sbjct:   184 SIEQNYESLDKQINEELENE 203

 Score = 42 (19.8 bits), Expect = 6.0e-57, Sum P(2) = 6.0e-57
 Identities = 20/71 (28%), Positives = 34/71 (47%)

Query:   924 IYRFVEEGKSL-ELLHKT-QVEGIPLALCQFQ-GRLLAG---IG---PVLRLYDLGKKRL 974
             IY  ++E  +L E+  K   VE + L++ Q    R+ A    +G    V+RL  + K + 
Sbjct:   595 IYFEIDESHTLVEIFRKNLNVEVLCLSIQQIPPNRVRANFLAVGCLDNVVRLLSIEKDKY 654

Query:   975 LRKCENKLFPN 985
              ++    L PN
Sbjct:   655 FKQLSTHLLPN 665

 Score = 41 (19.5 bits), Expect = 2.0e-70, Sum P(2) = 2.0e-70
 Identities = 20/81 (24%), Positives = 34/81 (41%)

Query:    66 QFRLTGSQKDYIVVGSDSGRIVILEYNPSKN---VFDK-IHQETFGKSGCRRIVPGQ--- 118
             Q +   S    IV+    G ++  E + S     +F K ++ E    S  ++I P +   
Sbjct:   574 QIKAASSNSSQIVISLSGGELIYFEIDESHTLVEIFRKNLNVEVLCLS-IQQIPPNRVRA 632

Query:   119 -YLAVDPKGRAVMIGACEKQK 138
              +LAV      V + + EK K
Sbjct:   633 NFLAVGCLDNVVRLLSIEKDK 653

 Score = 39 (18.8 bits), Expect = 3.3e-70, Sum P(2) = 3.3e-70
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query:   546 GSNRLQVVIALSGGELIYFE 565
             GSN+  +VI    G L+  E
Sbjct:    72 GSNKDYIVIGSDSGRLVILE 91

 Score = 37 (18.1 bits), Expect = 5.3e-70, Sum P(2) = 5.3e-70
 Identities = 13/59 (22%), Positives = 25/59 (42%)

Query:   261 GHPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVT--LEHDNEH 317
             G   +R +IP        +G  ++  A  +QK  F ++L  +  +   ++  LE    H
Sbjct:   108 GKTGIRRIIPGEYIAVDPKGRALMICAVEKQK--FVYILNRDNKENLTISSPLEAHKSH 164


>POMBASE|SPAPJ698.03c [details] [associations]
            symbol:prp12 "U2 snRNP-associated protein Sap130
            (predicted)" species:4896 "Schizosaccharomyces pombe" [GO:0000245
            "spliceosomal complex assembly" evidence=ISS] [GO:0005681
            "spliceosomal complex" evidence=IEA] [GO:0005686 "U2 snRNP"
            evidence=ISS] [GO:0030620 "U2 snRNA binding" evidence=ISS]
            [GO:0045292 "mRNA cis splicing, via spliceosome" evidence=ISS]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 PomBase:SPAPJ698.03c EMBL:CU329670
            GenomeReviews:CU329670_GR Gene3D:2.130.10.10 SUPFAM:SSF50978
            GO:GO:0005681 GO:GO:0007049 GO:GO:0000245 GO:GO:0005686
            GO:GO:0045292 eggNOG:NOG247734 GO:GO:0030620 KO:K12830
            HOGENOM:HOG000216677 OMA:FDTIPVA OrthoDB:EOG4FR40R EMBL:AB034966
            RefSeq:NP_594414.1 IntAct:Q9UTT2 STRING:Q9UTT2
            EnsemblFungi:SPAPJ698.03c.1 GeneID:2543278 KEGG:spo:SPAPJ698.03c
            NextBio:20804299 Uniprot:Q9UTT2
        Length = 1206

 Score = 1121 (399.7 bits), Expect = 4.4e-191, Sum P(2) = 4.4e-191
 Identities = 230/532 (43%), Positives = 350/532 (65%)

Query:   277 AERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTIPVTASM 335
             A  G LIVSA  H+ K  FF+LLQT  GD+ K+T+EHD + +V EL++KYFDT+P+   +
Sbjct:   299 ANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVPLAVQL 358

Query:   336 CVLKSGYLFAASEFGNHALYQFQAIGADPD-VEASSSTLM--ETEEGFQPVFFQPRGLKN 392
              +LK+G+LF A+EFGNH LYQF+ +G D D +E +S      + E G + V F  RGL+N
Sbjct:   359 NILKTGFLFVATEFGNHQLYQFENLGIDDDELEITSLDFQAQDNEVGTKNVHFGVRGLQN 418

Query:   393 LVRIEQVESLMPIMDMRIANLFEE-EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLP 451
             L  +E++ SL  + D  +       EA Q++T+CGRG  SSLR LR GL  +E+  S+LP
Sbjct:   419 LSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSLRQLRRGLETTEIVASELP 478

Query:   452 GVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGD 511
             G P A+WT+K N  D +D+YI++SF N TLVLSIGETVEE+SDSGFL +  +L    +G 
Sbjct:   479 GAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIGETVEEISDSGFLSSVSTLNARQMGR 538

Query:   512 DSLMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEV--DMT 569
             DSL+Q+HP GIR+IR + + +EW+ P    +V+   N +Q+V+ALS GEL+YFE+  D+ 
Sbjct:   539 DSLVQIHPKGIRYIRANKQTSEWKLPQDVYVVQSAINDMQIVVALSNGELVYFEMSDDVE 598

Query:   570 G-QLLEV-EKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSV 627
             G QL E  E+  ++ +V  L +  V EG +RS F+ +   D T+R+LSLD    ++ LSV
Sbjct:   599 GGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLACDDATVRVLSLDLYTTLENLSV 658

Query:   628 QSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSR 687
             Q++SSP  SL  + +  +V G       ++L+L+ GL NGV  RTV+D+ +GQL D+R+R
Sbjct:   659 QALSSPANSLCIIPM--NVNGV------STLYLHIGLMNGVYLRTVIDVTSGQLLDTRTR 710

Query:   688 FLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVE 747
             FLG R  K++ + +  +  +L +SSR +L Y ++    L+P++Y  +++A+SF+S+QC E
Sbjct:   711 FLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIAYSAIDHASSFASEQCPE 770

Query:   748 GVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQ 799
             G+V++  N L++FT++ L +       PL  TPR+ V  P   ++ I+++++
Sbjct:   771 GIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNFPVLYILQSER 822

 Score = 949 (339.1 bits), Expect = 6.7e-173, Sum P(2) = 6.7e-173
 Identities = 193/382 (50%), Positives = 262/382 (68%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPE-NSGRIETLVSTEIFG 59
             ++LYSLT+Q    + ++   + SG K  EIV+A    L + + +   GR+  +++   FG
Sbjct:     7 LFLYSLTIQNSNYVQSSCAASLSGKKAQEIVIATESRLLIYKVDATDGRMNCILNQNCFG 66

Query:    60 AIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVPGQY 119
              IR++A  RLTG ++DY+VV SDSGRI ILEYN  KN    I+QETFGKSG RR+VPG+Y
Sbjct:    67 IIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNKLVPIYQETFGKSGIRRVVPGEY 126

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
             LA+D KGRA MI + EK KLVYVLNRD+ A LTISSPLEAHK++ I + + G+D G+ NP
Sbjct:   127 LAIDAKGRAAMIASVEKNKLVYVLNRDSEANLTISSPLEAHKANNICFHLIGLDTGYANP 186

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
             IFAA+E+DYSE D DST +A + ++K L++YELDLGLNHV ++WS+ VD  + ML+ VPG
Sbjct:   187 IFAALEVDYSEIDHDSTREAFTSSEKVLSYYELDLGLNHVVKRWSKVVDRNSYMLIPVPG 246

Query:   240 GGDGPSGVLVCAENFVIYKNQG---H--PDVR--AVIPRRADLP--------AERGVLIV 284
             G DGPSG LV +  ++ Y++     H  P +R  A        P        A  G LIV
Sbjct:   247 GNDGPSGTLVISNGWISYRHLQKAFHQIPILRRQAASANAISTPWNQVNSNSANDGPLIV 306

Query:   285 SAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTIPVTASMCVLKSGYL 343
             SA  H+ K  FF+LLQT  GD+ K+T+EHD + +V EL++KYFDT+P+   + +LK+G+L
Sbjct:   307 SAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVPLAVQLNILKTGFL 366

Query:   344 FAASEFGNHALYQFQAIGADPD 365
             F A+EFGNH LYQF+ +G D D
Sbjct:   367 FVATEFGNHQLYQFENLGIDDD 388

 Score = 753 (270.1 bits), Expect = 4.4e-191, Sum P(2) = 4.4e-191
 Identities = 158/355 (44%), Positives = 218/355 (61%)

Query:   855 SDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPK 914
             S  WVS I V D  S        L DNEAAFS+    F +++   L+A G+A  +     
Sbjct:   850 SKSWVSFISVFDMISKKIIHESPLGDNEAAFSMTAAFFKNRDEFFLVA-GSATNMDL-EC 907

Query:   915 RNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL 974
             R    G   +YRF +EGK LEL+  T+++GIP+AL  FQGR+LAG+G  LR+YDLG K++
Sbjct:   908 RTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMALTPFQGRMLAGVGRFLRIYDLGNKKM 967

Query:   975 LRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAH 1034
             LRK E    P  I  I     RI V D Q S  F  Y+ ++N L  FADD++ RW T   
Sbjct:   968 LRKGELSAVPLFITHITVQASRIVVADSQYSVRFVVYKPEDNHLLTFADDTIHRWTTTNV 1027

Query:  1035 HIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHV 1094
              +D+DT+AG DKFGNI+ +R P+ VS   +E+ +  K+  E+  LN  P+K++ +  F  
Sbjct:  1028 LVDYDTLAGGDKFGNIWLLRCPEHVSKLADEENSESKLIHEKPFLNSTPHKLDLMAHFFT 1087

Query:  1095 GDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCG 1154
              D+ TSLQK  LV G  E +++  ++G++G    F +++DV FF  LE  +R+E PPL G
Sbjct:  1088 NDIPTSLQKVQLVEGAREVLLWTGLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAG 1147

Query:  1155 RDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIR 1209
             RDH+AYRS Y PVK VIDGDLCE + +L   +Q  IA+ELDRT  E+ KK+E+ R
Sbjct:  1148 RDHLAYRSYYAPVKCVIDGDLCEMYYSLPHPVQEMIANELDRTIAEVSKKIEDFR 1202

 Score = 42 (19.8 bits), Expect = 3.4e-71, Sum P(2) = 3.4e-71
 Identities = 11/45 (24%), Positives = 19/45 (42%)

Query:   297 FLLQTEYGDI-FKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKS 340
             F    E GD+    T E  NEH S+  + +     + +   + +S
Sbjct:   827 FKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHES 871


>CGD|CAL0004426 [details] [associations]
            symbol:orf19.5391 species:5476 "Candida albicans" [GO:0071004
            "U2-type prespliceosome" evidence=IEA] [GO:0005686 "U2 snRNP"
            evidence=IEA] [GO:0030620 "U2 snRNA binding" evidence=IEA]
            [GO:0000245 "spliceosomal complex assembly" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 CGD:CAL0004426
            GO:GO:0008380 Gene3D:2.130.10.10 GO:GO:0006397 GO:GO:0005681
            GO:GO:0003676 GO:GO:0007049 eggNOG:NOG247734 EMBL:AACQ01000051
            EMBL:AACQ01000050 RefSeq:XP_717672.1 RefSeq:XP_717766.1
            STRING:Q5A7S5 GeneID:3640538 GeneID:3640666 KEGG:cal:CaO19.12846
            KEGG:cal:CaO19.5391 KO:K12830 Uniprot:Q5A7S5
        Length = 1219

 Score = 848 (303.6 bits), Expect = 5.7e-142, Sum P(2) = 5.7e-142
 Identities = 231/699 (33%), Positives = 375/699 (53%)

Query:     1 MYLYSLTLQQPTGIIAAINGNF----SGTKTPE-IVVARGKVLELLR-PENSGRIETLVS 54
             +YLY+LTL+ P+  I++I G F    + TK  + +V+     L+L    E +G++E   S
Sbjct:    40 VYLYNLTLKPPSYYISSIVGQFYKQDNSTKNAQQLVLVSSTTLQLFEINEEAGKLELQSS 99

Query:    55 TEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEY-NPSKNVFDKIHQETFGKSGCRR 113
               + G I S+ +  L  S+ D +V+ SDSG + IL+Y N +K    KI QE   K+G  R
Sbjct:   100 QNLLGIINSIEKICL--SEVDGVVITSDSGNLSILQYDNKTKKFISKI-QEPMTKNGWGR 156

Query:   114 IVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGID 173
                G+ LA+DP+ R +++ A EK KL Y +  +++    +SSPLEAH    +   I  ++
Sbjct:   157 NYVGENLAIDPENRCILVAAMEKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALN 216

Query:   174 CGFDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKW-----SEPVD 228
                +NP+F A+EL               E +  + +YELD GLNHV +K      S+P+ 
Sbjct:   217 TDHNNPLFGALEL-------------TPEKKCIINYYELDQGLNHVVKKKPNSSNSDPLP 263

Query:   229 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAT 288
             N  N L+ +PG      G++VC  N+  Y     P +   +PRR     +   +IV+  T
Sbjct:   264 NDVNYLIPLPGH---IGGMVVCGTNWCFYDKLDGPRIYLPLPRRNGQTQDS--IIVNHVT 318

Query:   289 H-RQKTLFFFLLQTEYGDIFKVTLEHD--NEHVSELKIKYFDTIPVTASMCVLKSGYLFA 345
             H  +K  FF LLQ   GD+FK+T+++D   E +  + I YFDTIP   S+ + K+G+LFA
Sbjct:   319 HVLKKKKFFILLQNALGDLFKLTVDYDFDKEIIKNISITYFDTIPPALSLNIFKNGFLFA 378

Query:   346 ASEFGNHALYQFQAIGAD---PDVEASSSTLMETEEGFQPVF-FQPRGLKNLVRIEQVES 401
                  +  LYQF+ +G D    ++  +SS         + V  F+ +GL NL  I+ +E+
Sbjct:   379 NVLNNDKLLYQFEKLGDDLTEGELVINSSDYESLNSVRESVTSFKLKGLDNLALIDVLET 438

Query:   402 LMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVK 461
             L PI D +I +       ++ TL      S ++ +  G+  + +  S LP  P+ ++T K
Sbjct:   439 LSPITDSKIID------SKLVTLSSH---SYVKSITHGVPTTTLVESPLPITPTDIFTTK 489

Query:   462 KNVNDEFDAYIVVS--FNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHP 519
              ++    D Y+V+S   ++ TLVLSIGE VE+V DS F+   P++AV  +G  S++Q++ 
Sbjct:   490 LSLESANDEYLVISSSLSSKTLVLSIGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYS 549

Query:   520 SGIRHIRE-DG--RINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTG-QLLEV 575
             +GI+H+R  +G  +  +W  P   TI    +N  QV+IALS   ++YFE+D T  QL+E 
Sbjct:   550 NGIKHVRTVNGNKKTTDWFPPAGITITHATTNNQQVLIALSNLSVVYFEIDATDDQLIEY 609

Query:   576 EKH-EMSGDVACLDIA-SVPEGRKRSRFLAVGSYDNTIRILSLDPDDCMQILSVQSVSSP 633
             +   E++  +  + I  ++ E   +S F  +G  D TI+++SL   +C++I S+Q++S+ 
Sbjct:   610 QDRLEIATTITAMAIQENISE---KSPFAIIGCSDETIQVVSLQEHNCLEIKSLQALSAN 666

Query:   634 PESLLFL-----EVQASVGGEDGADHPASL-FLNAGLQN 666
               SL  L     E    +G E+G      +  +N  L N
Sbjct:   667 SSSLKMLKSSGKETHVHIGMENGVYARIKIDTINGNLSN 705

 Score = 752 (269.8 bits), Expect = 1.1e-129, Sum P(2) = 1.1e-129
 Identities = 200/606 (33%), Positives = 329/606 (54%)

Query:   207 LTFYELDLGLNHVSRKW-----SEPVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQG 261
             + +YELD GLNHV +K      S+P+ N  N L+ +PG      G++VC  N+  Y    
Sbjct:   237 INYYELDQGLNHVVKKKPNSSNSDPLPNDVNYLIPLPGH---IGGMVVCGTNWCFYDKLD 293

Query:   262 HPDVRAVIPRRADLPAERGVLIVSAATH-RQKTLFFFLLQTEYGDIFKVTLEHD--NEHV 318
              P +   +PRR     +   +IV+  TH  +K  FF LLQ   GD+FK+T+++D   E +
Sbjct:   294 GPRIYLPLPRRNGQTQDS--IIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDFDKEII 351

Query:   319 SELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGAD---PDVEASSSTLME 375
               + I YFDTIP   S+ + K+G+LFA     +  LYQF+ +G D    ++  +SS    
Sbjct:   352 KNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINSSDYES 411

Query:   376 TEEGFQPVF-FQPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLR 434
                  + V  F+ +GL NL  I+ +E+L PI D +I +       ++ TL      S ++
Sbjct:   412 LNSVRESVTSFKLKGLDNLALIDVLETLSPITDSKIID------SKLVTLSSH---SYVK 462

Query:   435 ILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVS--FNNATLVLSIGETVEEV 492
              +  G+  + +  S LP  P+ ++T K ++    D Y+V+S   ++ TLVLSIGE VE+V
Sbjct:   463 SITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLSIGEVVEDV 522

Query:   493 SDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIRE-DG--RINEWRTPGKRTIVKVGSNR 549
              DS F+   P++AV  +G  S++Q++ +GI+H+R  +G  +  +W  P   TI    +N 
Sbjct:   523 EDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRTVNGNKKTTDWFPPAGITITHATTNN 582

Query:   550 LQVVIALSGGELIYFEVDMTG-QLLEVEKH-EMSGDVACLDIA-SVPEGRKRSRFLAVGS 606
              QV+IALS   ++YFE+D T  QL+E +   E++  +  + I  ++ E   +S F  +G 
Sbjct:   583 QQVLIALSNLSVVYFEIDATDDQLIEYQDRLEIATTITAMAIQENISE---KSPFAIIGC 639

Query:   607 YDNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQN 666
              D TI+++SL   +C++I S+Q++S+   SL  L+   S G E    H     ++ G++N
Sbjct:   640 SDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLK---SSGKET---H-----VHIGMEN 688

Query:   667 GVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRA-AMLCLSSRPWLGYIHRGRFL 725
             GV  R  +D + G LS+SR +++G +P  L  +        +L +SS PW+ Y++R  F 
Sbjct:   689 GVYARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSNEIEGILAISSAPWISYLYRDSFK 748

Query:   726 LTPLSYETLEYAASF-SSDQCVEGVVSVAGNALRVFTIERLGETFNET------ALPLRY 778
             +TPL    +   +SF S D   EG+V +  N L +F++ +    F+ +         LRY
Sbjct:   749 ITPLLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIATTKLRY 808

Query:   779 TPRRFV 784
             TPR+ +
Sbjct:   809 TPRKMI 814

 Score = 598 (215.6 bits), Expect = 5.7e-142, Sum P(2) = 5.7e-142
 Identities = 129/372 (34%), Positives = 219/372 (58%)

Query:   846 EQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTL----L 901
             E +GY + + + W SCI+V+D +S      L+L  NE+  S+  V+F+     ++    L
Sbjct:   851 EAFGY-EWKQNSWASCIQVVDSKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHL 909

Query:   902 AVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIG 961
              VG        P  +    Y++ ++  +  K L+L+HKT+++ IP  L  FQ +LL   G
Sbjct:   910 VVGVCTNQTILPN-SYDKSYLYTFKIGK--KHLQLVHKTELDHIPQVLENFQDKLLVASG 966

Query:   962 PVLRLYDLGKKRLLRKCENKL-FPNTIVSINTYRDRIYVGDIQES-FHFCKYRRDENQLY 1019
               +RLYD+G+K+LL+K    + F   I  I    +RI + D  +S   F K+   +NQ  
Sbjct:   967 NHIRLYDIGQKQLLKKSTTIIDFSTNINKIIPQTNRIIICDSHKSSIVFAKFDESQNQFV 1026

Query:  1020 IFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKL 1079
              FADD + R +T+  ++D DT+ G DKFGNI+  R+ +D+S + ++D T   +K + G L
Sbjct:  1027 PFADDVMKRQITSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWT--ILKTQDGIL 1084

Query:  1080 NGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFS 1139
             N  P K++ +++FH+GD++TS     L   G ESVIY  + G++G ++   S+ +V+   
Sbjct:  1085 NSCPYKLQNLIEFHIGDIITSFNLGCLNLAGTESVIYTGLQGTIGLLIPLVSKSEVELLF 1144

Query:  1140 HLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPG 1199
             +L+++M+Q    L G+DH+  RS Y P+K+VIDGDL E+F    + L+ +I+ +L+++  
Sbjct:  1145 NLQLYMQQSQNNLVGKDHLKLRSYYNPIKNVIDGDLLERFLEFDISLKIEISRKLNKSVN 1204

Query:  1200 EILKKLEEIRNK 1211
             +I KKL ++RN+
Sbjct:  1205 DIEKKLIDLRNR 1216


>UNIPROTKB|F1NZF7 [details] [associations]
            symbol:SF3B3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0005634 GO:GO:0003676 GeneTree:ENSGT00530000063396
            EMBL:AADN02051593 EMBL:AADN02051594 IPI:IPI00819465
            Ensembl:ENSGALT00000040057 ArrayExpress:F1NZF7 Uniprot:F1NZF7
        Length = 504

 Score = 1348 (479.6 bits), Expect = 1.1e-137, P = 1.1e-137
 Identities = 251/503 (49%), Positives = 347/503 (68%)

Query:   710 LSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETF 769
             +SSR WL Y ++ RF LTPLSYETLE+A+ F+S+QC EG+V+++ N LR+  +E+LG  F
Sbjct:     1 MSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVF 60

Query:   770 NETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREAAKKECFXXXXXXXXXXXXXX 829
             N+ A PL+YTPR+FV+ P+   ++IIETD  A T E  +A +K+                
Sbjct:    61 NQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYT-EATKAQRKQQMAEEMVEAAGEDERE 119

Query:   830 XXXXXXXENKYDPLSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICT 889
                        + L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+  
Sbjct:   120 LAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAV 179

Query:   890 VNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRF-VEEGKSLELLHKTQVEGIPLA 948
               F +      + VG AK L   P R++  G+++ Y+  V  G+ LE LHKT VE +P A
Sbjct:   180 CRFSNTGEEWYVLVGVAKDLILNP-RSVAGGFVYTYKLLVNGGEKLEFLHKTPVEEVPAA 238

Query:   949 LCQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHF 1008
             +  FQGR+L G+G +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF +
Sbjct:   239 IAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKHIANYICGIQTIGHRVIVSDVQESFIW 298

Query:  1009 CKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPT 1068
              +Y+R+ENQL IFADD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPT
Sbjct:   299 VRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPT 358

Query:  1069 GGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLA 1128
             G K  W++G LNGA  K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ 
Sbjct:   359 GNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVP 418

Query:  1129 FSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQR 1188
             F+S +D DFF H+EMH+R EHPPLCGRDH+++RS YFPVK+VIDGDLCEQF ++  + Q+
Sbjct:   419 FTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSYYFPVKNVIDGDLCEQFNSMEPNKQK 478

Query:  1189 KIADELDRTPGEILKKLEEIRNK 1211
              +A+ELDRTP E+ KKLE+IR +
Sbjct:   479 NVAEELDRTPPEVSKKLEDIRTR 501


>UNIPROTKB|F1S419 [details] [associations]
            symbol:LOC100512659 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
            GeneTree:ENSGT00530000063396 EMBL:CU915803 EMBL:AEMK01191757
            Ensembl:ENSSSCT00000003019 OMA:SHEVIYS Uniprot:F1S419
        Length = 319

 Score = 783 (280.7 bits), Expect = 3.1e-77, P = 3.1e-77
 Identities = 146/291 (50%), Positives = 202/291 (69%)

Query:   843 LSDEQYGYPKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLA 902
             L +  +G PKA + +W S IRV++P   NT  L++L+ NEAAFS+    F +      + 
Sbjct:    25 LPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGDDWYVL 84

Query:   903 VGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGP 962
             VG AK L   P R++  G+++ Y+ V  G+ LE LHKT VE +P A+  FQGR+L G+G 
Sbjct:    85 VGVAKDLILNP-RSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGK 143

Query:   963 VLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFA 1022
             +LR+YDLGKK+LLRKCENK   N I  I T   R+ V D+QESF + +Y+R+ENQL IFA
Sbjct:   144 LLRVYDLGKKKLLRKCENKHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFA 203

Query:  1023 DDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGA 1082
             DD+ PRW+T A  +D+DT+AGADKFGNI  VRLP + +DE++EDPTG K  W++G LNGA
Sbjct:   204 DDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGA 263

Query:  1083 PNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRD 1133
               K E I+ +HVG+ V SLQK +L+PGG ES++Y T+ G +G ++ F+S +
Sbjct:   264 SQKAEVIMNYHVGETVLSLQKTTLIPGGSESLVYTTLSGGIGILVPFTSHE 314


>SGD|S000004513 [details] [associations]
            symbol:RSE1 "Protein involved in pre-mRNA splicing"
            species:4932 "Saccharomyces cerevisiae" [GO:0005686 "U2 snRNP"
            evidence=IDA;IPI] [GO:0000245 "spliceosomal complex assembly"
            evidence=IDA] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IGI;IMP;IPI] [GO:0007049 "cell cycle" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0005681 "spliceosomal
            complex" evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0030620 "U2 snRNA binding" evidence=IPI]
            [GO:0071004 "U2-type prespliceosome" evidence=IDA]
            InterPro:IPR004871 Pfam:PF03178 SGD:S000004513 EMBL:BK006946
            GO:GO:0007049 EMBL:Z47816 GO:GO:0000245 GO:GO:0005686 GO:GO:0071004
            GeneTree:ENSGT00530000063396 GO:GO:0030620 KO:K12830
            OrthoDB:EOG4FR40R PIR:S50943 RefSeq:NP_013663.1
            ProteinModelPortal:Q04693 DIP:DIP-856N IntAct:Q04693
            MINT:MINT-368995 STRING:Q04693 PaxDb:Q04693 PeptideAtlas:Q04693
            EnsemblFungi:YML049C GeneID:854956 KEGG:sce:YML049C CYGD:YML049c
            eggNOG:KOG1898 HOGENOM:HOG000066036 OMA:DIHESVT NextBio:978033
            Genevestigator:Q04693 GermOnline:YML049C Uniprot:Q04693
        Length = 1361

 Score = 525 (189.9 bits), Expect = 5.5e-73, Sum P(4) = 5.5e-73
 Identities = 174/655 (26%), Positives = 325/655 (49%)

Query:    76 YIVVGSDSGRIVILEYNPSKNVF--DKIHQETFGKSGCRRIVPGQYLAVDPKGRAVMIGA 133
             ++ + SDSG + I++            +  +   ++  RR+ P  Y+ +DP GR +++ +
Sbjct:   147 FLALTSDSGNLSIVQIIMHAGALRLKTLVNQPLTRTTLRRVSPISYMEIDPNGRCIILSS 206

Query:   134 CEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPIFAAIELDYSEADQ 193
              E+ KL +++  D A +L ISSPLE  + H +   +  +D  F+NP F  +E+D + A Q
Sbjct:   207 VEQNKLCFLV--DYAQKLRISSPLEIIRPHMVTLDMAVVDVNFNNPCFVTLEID-NAATQ 263

Query:   194 DSTGQA--ASEAQKNLTFYELDLGLNHVSR-KWSEPVDNGANMLVTVPGGG-DGPSG--- 246
              S        E   N    + D  +N  +    S P  +  N+  ++     D       
Sbjct:   264 LSVHLIFYVLELGLNHIVKKADYLVNPSANFVLSLPDLSRYNITTSLSDNNYDADYDTLF 323

Query:   247 ---VLVCAENFVIYKNQ-GHPDVRAVIPRRADLPAE-RGVLIVSAATHRQKTLFFFLLQT 301
                V++  EN ++ K+  G   ++  IP+R+   +  + V I+S    + K  FF LLQ+
Sbjct:   324 NPFVVIGFENHILVKDMNGFFSLKVEIPKRSITNSRHKNVTIISGIVQKLKNDFFVLLQS 383

Query:   302 EYGDIFKVTLEHD-NEHVSEL-KIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQA 359
              +GD+FK+T+  D N+    L ++ YFDTI  +  + + K+GYLFA SE  N+ L+QF+ 
Sbjct:   384 NHGDLFKLTVSPDTNDRNRPLVQLSYFDTIQNSHQLHIFKNGYLFALSEMNNNFLFQFEK 443

Query:   360 IGADPDVEASSSTLMETEEGFQPVFFQPR-GLKNLVRIEQVESLMPIMDMRIANLFEEEA 418
             +G + +     S ++ +++  + + F+P   L+NL  + Q  +L P +  +I +    ++
Sbjct:   444 LGVEKN---DFSNVLTSKDPNKSLVFEPSIKLQNLSILSQQLNLNPSIKSQIVS----DS 496

Query:   419 PQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVND-EFDAYIVVSFN 477
             P +         + +  L   +  S +  + LP   + +W +       + +  + ++F 
Sbjct:   497 P-LSIATKHFTNNKIITLTNAVNYSNLISTSLPPNATKLWLIPDPATTGDNNTLLFITFP 555

Query:   478 NATLVLSI-GETVEEVSD-----SGF-LDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGR 530
               T++L I  E++EE++      S F L    ++   L+G  S++QV  + +RHI   G+
Sbjct:   556 KKTMILQIDNESMEELTPDEATRSAFKLSQDTTIHTCLMGSHSIIQVCTAELRHIVPTGK 615

Query:   531 I---NE--WRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQ-LLEVEKH-EMSGD 583
                 N+  W  P    IV   S++ Q++I+LS  EL+YF++D++   L+E+  H E+  D
Sbjct:   616 SRYSNKLTWVPPAGIRIVCATSSKTQLIISLSNYELVYFKIDVSSDSLIELTTHPEL--D 673

Query:   584 VACLDIASVPEGRKRSRFLAVGSYDNTIRILSL--DPDDCMQILSVQSVSSPPESLLFLE 641
                  +A V +  + +  LA+   +  I+I+SL    +D + ++S+Q VS     ++ + 
Sbjct:   674 TMPSKVAIVQD-TQHADLLAIADNEGMIKIMSLKDQKEDFLTVISLQLVSEKISDMIMVR 732

Query:   642 VQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKL 696
               +S+G          L L+ GL+NGV  +  +  V G  +D + RFLGL+P  L
Sbjct:   733 -DSSIG---------QLNLHVGLENGVYMKFHIGDVDGSFTDIKRRFLGLKPVSL 777

 Score = 254 (94.5 bits), Expect = 4.3e-43, Sum P(4) = 4.3e-43
 Identities = 72/255 (28%), Positives = 131/255 (51%)

Query:     1 MYLYSLTLQQPTGIIAAINGNF----SGTKTPE--IVVARGKVLELLRPENSGRIETLVS 54
             +YLY LTL++ T  + +  G+F    +G+K  +  + VA    LEL    + G ++ +  
Sbjct:    58 LYLYHLTLKKQTNFVHSCIGHFVDLEAGSKREQSQLCVATETHLELYDTAD-GELKLIAK 116

Query:    55 TE-IFGAIRSLAQFRL--TGSQKD------YIVVGSDSGRIVILEYNPSKNVF--DKIHQ 103
              + +F  I S+    L  +GS+        ++ + SDSG + I++            +  
Sbjct:   117 FQNLFATITSMKSLDLPHSGSRAKASNWPTFLALTSDSGNLSIVQIIMHAGALRLKTLVN 176

Query:   104 ETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSH 163
             +   ++  RR+ P  Y+ +DP GR +++ + E+ KL +++  D A +L ISSPLE  + H
Sbjct:   177 QPLTRTTLRRVSPISYMEIDPNGRCIILSSVEQNKLCFLV--DYAQKLRISSPLEIIRPH 234

Query:   164 TIVYSICGIDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKW 223
              +   +  +D  F+NP F  +E+D           AA++   +L FY L+LGLNH+ +K 
Sbjct:   235 MVTLDMAVVDVNFNNPCFVTLEID----------NAATQLSVHLIFYVLELGLNHIVKKA 284

Query:   224 SEPVDNGANMLVTVP 238
                V+  AN ++++P
Sbjct:   285 DYLVNPSANFVLSLP 299

 Score = 213 (80.0 bits), Expect = 5.5e-73, Sum P(4) = 5.5e-73
 Identities = 68/231 (29%), Positives = 113/231 (48%)

Query:   934 LELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKC--ENKLFPNTIVSIN 991
             +ELLH+T++     A+ +F+  LL  +G  + LY LGKK+LLR+   +  +    IVS++
Sbjct:  1019 IELLHQTEIISPIHAMLKFKNFLLTAMGSTIVLYGLGKKQLLRRSVTQTPVSITKIVSMH 1078

Query:   992 TYR-DRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNI 1050
              +  +R+ VGDI ES     +    N    + DDSV R +T    +D  T+ GAD++GN 
Sbjct:  1079 QWNYERLAVGDIHESVTLFIWDPAGNVFIPYVDDSVKRHVTVLKFLDEATVIGADRYGNA 1138

Query:  1051 YFVRLPQDVSDEIEE-DPT---GGKIKWE------QGKLNGAPN---KMEEIVQFHVGDV 1097
             + +R P +    +   DP+    G IK+       Q KL    +   K + +  F V D+
Sbjct:  1139 WTLRSPPECEKIMSNHDPSELSNGAIKYPLDVITLQQKLPNTYDCKFKFQLLNHFFVNDI 1198

Query:  1098 VTSLQKA-SLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQ 1147
             +T      SL        IY  + G++G  +   S+ +V    ++E  M +
Sbjct:  1199 ITDFHILDSLSNSDRPGCIYMGLQGTVGCFIPLLSKGNVFMMGNIENIMAE 1249

 Score = 134 (52.2 bits), Expect = 5.5e-73, Sum P(4) = 5.5e-73
 Identities = 29/59 (49%), Positives = 37/59 (62%)

Query:  1152 LCGRDHMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGE-ILKKLEEIR 1209
             + GRDH  YRS Y PV+ VIDGDLCE F  LSL+ Q  +A  L     E I++ + E+R
Sbjct:  1299 ILGRDHQEYRSYYAPVRKVIDGDLCENFLRLSLNEQEFLAKNLKSVQVEDIIQTINEVR 1357

 Score = 55 (24.4 bits), Expect = 5.5e-73, Sum P(4) = 5.5e-73
 Identities = 11/52 (21%), Positives = 27/52 (51%)

Query:   705 AAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASF-SSDQCVEGVVSVAGN 755
             + ++C SS  W+ Y  +  + +  L  + +   + F ++D  + GV S++ +
Sbjct:   817 SCVVCHSSSTWVSYTWKNVWTIRQLKDQNMLSCSKFVNADVAINGVCSISSS 868

 Score = 40 (19.1 bits), Expect = 6.0e-54, Sum P(3) = 6.0e-54
 Identities = 13/42 (30%), Positives = 21/42 (50%)

Query:  1057 QDVSDEIE-EDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDV 1097
             + +SD I   D + G++    G  NG        ++FH+GDV
Sbjct:   723 EKISDMIMVRDSSIGQLNLHVGLENGV------YMKFHIGDV 758

 Score = 37 (18.1 bits), Expect = 8.5e-20, Sum P(3) = 8.5e-20
 Identities = 8/20 (40%), Positives = 12/20 (60%)

Query:    49 IETLVSTEIFGAIRSLAQFR 68
             IE L  TEI   I ++ +F+
Sbjct:  1019 IELLHQTEIISPIHAMLKFK 1038


>TAIR|locus:2127368 [details] [associations]
            symbol:DDB1B "damaged DNA binding protein 1B"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
            development ending in seed dormancy" evidence=IMP] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
            [GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
            chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
            specification" evidence=RCA] [GO:0010072 "primary shoot apical
            meristem specification" evidence=RCA] [GO:0010100 "negative
            regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
            dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
            evidence=RCA] [GO:0010564 "regulation of cell cycle process"
            evidence=RCA] [GO:0045595 "regulation of cell differentiation"
            evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
            [GO:0048608 "reproductive structure development" evidence=RCA]
            [GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
            division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
            SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
            GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
            UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
            PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
            DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
            PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
            KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
            OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
            GermOnline:AT4G21100 Uniprot:O49552
        Length = 1088

 Score = 332 (121.9 bits), Expect = 2.0e-63, Sum P(4) = 2.0e-63
 Identities = 113/432 (26%), Positives = 203/432 (46%)

Query:   386 QPRGLKNLVRI-EQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSE 444
             QP    + V I E+  +L PI+D  + +L  +   Q+ T  G     SLRI+R G+ ++E
Sbjct:   331 QPDAKGSYVEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 390

Query:   445 MAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNAT--LVLSIGETVEEVSDSGFLDTTP 502
              A  +L G+   +W++K ++++ FD ++VVSF + T  L ++I + +EE    GFL    
Sbjct:   391 QASVELQGI-KGMWSLKSSIDEAFDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQ 449

Query:   503 SLAVSLIGDDSLMQVHPSGIRHIREDGRI--NEWRTPGKRTIVKVGSNRLQVVIALSGGE 560
             +L       + L+QV  + +R +    R   N+W  P   ++    +N  QV++A  GG 
Sbjct:   450 TLFCHDAVYNQLVQVTSNSVRLVSSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGH 509

Query:   561 LIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPD 619
             L+Y E+   G L EV+   +  +V+CLDI  + +    S+  AVG + D ++RI  L PD
Sbjct:   510 LVYLEIG-DGTLTEVKHVLLEYEVSCLDINPIGDNPNYSQLAAVGMWTDISVRIFVL-PD 567

Query:   620 DCMQILSVQSVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMV 677
               + +++ + +     P S+L    +       G       +L   L +G L    +D  
Sbjct:   568 --LTLITKEELGGEIIPRSVLLCAFE-------GIS-----YLLCALGDGHLLNFQLDTS 613

Query:   678 TGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYA 737
              G+L D +   LG RP  L +        +   S RP + Y +  + L + ++ + + + 
Sbjct:   614 CGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAVIYSNNKKLLYSNVNLKEVSHM 673

Query:   738 ASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIET 797
               F+S    + +       L + TI+ + +    T +P+    RR   Q + +   I   
Sbjct:   674 CPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRT-IPIGEHARRICHQEQTRTFAI-SC 731

Query:   798 DQGALTAEEREA 809
              +   +AEE E+
Sbjct:   732 LRNEPSAEESES 743

 Score = 250 (93.1 bits), Expect = 2.0e-63, Sum P(4) = 2.0e-63
 Identities = 102/369 (27%), Positives = 170/369 (46%)

Query:   851 PKAESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQ 910
             P AE  +    +R+LD +S        L   E   SI + +F D ++     VGTA  L 
Sbjct:   736 PSAEESE-SHFVRLLDAQSFEFLSSYPLDAFECGCSILSCSFTDDKN-VYYCVGTAYVL- 792

Query:   911 FWPKRNI-VAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLY-- 967
               P+ N    G I ++  VEEG+ L+L+ + + +G   +L  F G+LLA I   ++LY  
Sbjct:   793 --PEENEPTKGRILVF-IVEEGR-LQLITEKETKGAVYSLNAFNGKLLASINQKIQLYKW 848

Query:   968 ---DLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADD 1024
                D G + L  +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D
Sbjct:   849 MLRDDGTRELQSECGHHGHILALY-VQTRGDFIAVGDLMKSISLLIYKHEEGAIEERARD 907

Query:  1025 SVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPN 1084
                 W+TA   ++ D   G D   NI+ V+   + + + EE     + + E   + G  +
Sbjct:   908 YNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATD-EE-----RARME---VVGEYH 958

Query:  1085 KMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMH 1144
               E + +F  G +V  L  + +  G   +VI+GTV G +G ++A   ++   F   L+  
Sbjct:   959 IGEFVNRFRHGSLVMKLPDSDI--GQIPTVIFGTVSGMIG-VIASLPQEQYAFLEKLQTS 1015

Query:  1145 MRQEHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPG 1199
             +R+    + G  H  +RS          K  +DGDL E F  LS     +I+  +D    
Sbjct:  1016 LRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDVQVE 1075

Query:  1200 EILKKLEEI 1208
             E+ K++EE+
Sbjct:  1076 ELCKRVEEL 1084

 Score = 133 (51.9 bits), Expect = 2.0e-63, Sum P(4) = 2.0e-63
 Identities = 55/238 (23%), Positives = 107/238 (44%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLE--LLRPENSGRIETLVSTEIFGAI 61
             Y++T Q+PT +  +  GNF+  +   ++VA+   +E  LL P+    ++T++   ++G I
Sbjct:     6 YAVTAQKPTCVTHSCVGNFTSPQELNLIVAKSTRIEIHLLSPQG---LQTILDVPLYGRI 62

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYN-PSKNVFDKIHQETFGKSGCRRIVPGQYL 120
              ++  FR  G  +D++ V ++  +  +L+++  S  +  +   +   + G R    GQ  
Sbjct:    63 ATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG-RPTDNGQIG 121

Query:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI 180
              +DP  R  +IG      L  V+  D   +L  +  +   +   +         G   P 
Sbjct:   122 IIDPDCR--VIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCTKPT 176

Query:   181 FAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVP 238
              A +  D  +A    T + + +  KN  F E     N++        DNGA++L+ VP
Sbjct:   177 IAVLYQDNKDARHVKTYEVSLK-DKN--FVEGPWSQNNL--------DNGADLLIPVP 223

 Score = 127 (49.8 bits), Expect = 2.0e-63, Sum P(4) = 2.0e-63
 Identities = 45/182 (24%), Positives = 82/182 (45%)

Query:   205 KNLTFYELDL-GLNHVSRKWSEP-VDNGANMLVTVPGGGDGP-SGVLVCAENFVIYKNQG 261
             +++  YE+ L   N V   WS+  +DNGA++L+ VP     P  GVL+  E  ++Y +  
Sbjct:   188 RHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPS----PLCGVLIIGEETIVYCSA- 242

Query:   262 HPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSEL 321
               +    IP R  +    G + +  +         +LL    G I  + + H+ E V+ L
Sbjct:   243 --NAFKAIPIRPSITKAYGRVDLDGSR--------YLLGDHAGLIHLLVITHEKEKVTGL 292

Query:   322 KIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQ 381
             KI+      + +S+  L +  +F  S +G+  L +   +   PD + S   ++E      
Sbjct:   293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIK---LNLQPDAKGSYVEILEKYVNLG 349

Query:   382 PV 383
             P+
Sbjct:   350 PI 351

 Score = 37 (18.1 bits), Expect = 5.5e-11, Sum P(3) = 5.5e-11
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKW 1074
             T +GA K G++  VR    ++++   +  G K  W
Sbjct:   369 TCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 403


>TAIR|locus:2115909 [details] [associations]
            symbol:DDB1A "damaged DNA binding protein 1A"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
            "negative regulation of photomorphogenesis" evidence=IGI;RCA]
            [GO:0045892 "negative regulation of transcription, DNA-dependent"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
            cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
            formation" evidence=RCA] [GO:0003002 "regionalization"
            evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
            "protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
            evidence=RCA] [GO:0008284 "positive regulation of cell
            proliferation" evidence=RCA] [GO:0009630 "gravitropism"
            evidence=RCA] [GO:0009639 "response to red or far red light"
            evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
            [GO:0033043 "regulation of organelle organization" evidence=RCA]
            [GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
            organ formation" evidence=RCA] [GO:0048608 "reproductive structure
            development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
            GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
            GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
            GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
            IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
            UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
            IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
            EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
            GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
            InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
            ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
            Uniprot:Q9M0V3
        Length = 1088

 Score = 342 (125.4 bits), Expect = 3.4e-61, Sum P(4) = 3.4e-61
 Identities = 109/420 (25%), Positives = 199/420 (47%)

Query:   396 IEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPS 455
             +E+  +L PI+D  + +L  +   Q+ T  G     SLR++R G+ ++E A  +L G+  
Sbjct:   342 LERYINLGPIVDFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGI-K 400

Query:   456 AVWTVKKNVNDEFDAYIVVSFNNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDS 513
              +W++K ++++ FD ++VVSF + T +L++   + +EE    GFL    +L       + 
Sbjct:   401 GMWSLKSSIDEAFDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQ 460

Query:   514 LMQVHPSGIRHIREDGRI--NEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQ 571
             L+QV  + +R +    R   +EW  P   T+    +N  QV++A  GG L+Y E+   G+
Sbjct:   461 LVQVTSNSVRLVSSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIG-DGK 519

Query:   572 LLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSV 630
             L EV+   +  +V+CLDI  + +    S+  AVG + D ++RI SL P+  + +++ + +
Sbjct:   520 LTEVQHALLEYEVSCLDINPIGDNPNYSQLAAVGMWTDISVRIFSL-PE--LTLITKEQL 576

Query:   631 SSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRF 688
                  P S+L    +       G       +L   L +G L    +D  TGQL D +   
Sbjct:   577 GGEIIPRSVLLCAFE-------GIS-----YLLCALGDGHLLNFQMDTTTGQLKDRKKVS 624

Query:   689 LGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEG 748
             LG +P  L +        +   S RP + Y    + L + ++ + + +   F+S    + 
Sbjct:   625 LGTQPITLRTFSSKSATHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 684

Query:   749 VVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEERE 808
             +       L + TI+ + +    T +PL    RR   Q + +   I      +  +EE E
Sbjct:   685 LAIAREGELTIGTIDDIQKLHIRT-IPLGEHARRICHQEQTRTFGICSLGNQS-NSEESE 742

 Score = 233 (87.1 bits), Expect = 3.4e-61, Sum P(4) = 3.4e-61
 Identities = 95/361 (26%), Positives = 167/361 (46%)

Query:   862 IRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNI-VAG 920
             +R+LD ++        L   E   SI + +F + ++     VGTA  L   P+ N    G
Sbjct:   746 VRLLDDQTFEFMSTYPLDSFEYGCSILSCSFTEDKN-VYYCVGTAYVL---PEENEPTKG 801

Query:   921 YIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLY-----DLGKKRLL 975
              I ++  VE+G+ L+L+ + + +G   +L  F G+LLA I   ++LY     D G + L 
Sbjct:   802 RILVF-IVEDGR-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 859

Query:   976 RKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHH 1035
              +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D    W++A   
Sbjct:   860 SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEI 918

Query:  1036 IDFDTMAGADKFGNIYFVRLPQD-VSDEIEEDPTGGKIKWEQGKLN--GAPNKMEEIVQF 1092
             +D D   GA+   N+  V+   +  +DE            E+G+L   G  +  E + +F
Sbjct:   919 LDDDIYLGAENNFNLLTVKKNSEGATDE------------ERGRLEVVGEYHLGEFVNRF 966

Query:  1093 HVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPL 1152
               G +V  L  + +  G   +VI+GTV G +G ++A   ++   F   L+  +R+    +
Sbjct:   967 RHGSLVMRLPDSEI--GQIPTVIFGTVNGVIG-VIASLPQEQYTFLEKLQSSLRKVIKGV 1023

Query:  1153 CGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
              G  H  +RS          ++ +DGDL E F  LS +    I+  ++    E+ K++EE
Sbjct:  1024 GGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRVEE 1083

Query:  1208 I 1208
             +
Sbjct:  1084 L 1084

 Score = 127 (49.8 bits), Expect = 3.4e-61, Sum P(4) = 3.4e-61
 Identities = 54/238 (22%), Positives = 103/238 (43%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLE--LLRPENSGRIETLVSTEIFGAI 61
             Y +T  +PT +  +  GNF+  +   ++VA+   +E  LL P+    ++ ++   I+G I
Sbjct:     6 YVVTAHKPTSVTHSCVGNFTSPQELNLIVAKCTRIEIHLLTPQG---LQPMLDVPIYGRI 62

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNP-SKNVFDKIHQETFGKSGCRRIVPGQYL 120
              +L  FR  G  +D++ + ++  +  +L+++P S  +  +   +   + G R    GQ  
Sbjct:    63 ATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG-RPTDNGQIG 121

Query:   121 AVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI 180
              +DP  R  +IG      L  V+  D   +L  +  +   +   +        C    P 
Sbjct:   122 IIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-FGCA--KPT 176

Query:   181 FAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVP 238
              A +  D  +A    T + +    K+  F E     N +        DNGA++L+ VP
Sbjct:   177 IAVLYQDNKDARHVKTYEVSL---KDKDFVEGPWSQNSL--------DNGADLLIPVP 223

 Score = 118 (46.6 bits), Expect = 3.4e-61, Sum P(4) = 3.4e-61
 Identities = 45/182 (24%), Positives = 83/182 (45%)

Query:   205 KNLTFYELDL-GLNHVSRKWSE-PVDNGANMLVTVPGGGDGP-SGVLVCAENFVIYKNQG 261
             +++  YE+ L   + V   WS+  +DNGA++L+ VP     P  GVL+  E  ++Y +  
Sbjct:   188 RHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPP----PLCGVLIIGEETIVYCSAS 243

Query:   262 HPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSEL 321
                 +A IP R  +    G + V  +         +LL    G I  + + H+ E V+ L
Sbjct:   244 A--FKA-IPIRPSITKAYGRVDVDGSR--------YLLGDHAGMIHLLVITHEKEKVTGL 292

Query:   322 KIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQ 381
             KI+      + +++  L +  +F  S +G+  L +   +   PD + S   ++E      
Sbjct:   293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVK---LNLHPDAKGSYVEVLERYINLG 349

Query:   382 PV 383
             P+
Sbjct:   350 PI 351


>FB|FBgn0260962 [details] [associations]
            symbol:pic "piccolo" species:7227 "Drosophila melanogaster"
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0006289
            "nucleotide-excision repair" evidence=ISS;NAS] [GO:0005634
            "nucleus" evidence=IEA] [GO:0006974 "response to DNA damage
            stimulus" evidence=IMP] [GO:0035220 "wing disc development"
            evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0007307 "eggshell
            chorion gene amplification" evidence=IDA] [GO:0007095 "mitotic G2
            DNA damage checkpoint" evidence=IGI] InterPro:IPR004871
            Pfam:PF03178 UniPathway:UPA00143 EMBL:AE014297 GO:GO:0005634
            GO:GO:0005737 GO:GO:0007095 GO:GO:0043161 GO:GO:0003677
            GO:GO:0006281 GO:GO:0035220 GO:GO:0042787 GO:GO:0007307
            eggNOG:NOG247734 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
            HSSP:Q16531 EMBL:AF132145 RefSeq:NP_650257.1 UniGene:Dm.3215
            ProteinModelPortal:Q9XYZ5 SMR:Q9XYZ5 STRING:Q9XYZ5 PaxDb:Q9XYZ5
            PRIDE:Q9XYZ5 EnsemblMetazoa:FBtr0082709 GeneID:41611
            KEGG:dme:Dmel_CG7769 UCSC:CG7769-RA CTD:41611 FlyBase:FBgn0260962
            InParanoid:Q9XYZ5 OrthoDB:EOG4S1RP0 PhylomeDB:Q9XYZ5
            GenomeRNAi:41611 NextBio:824642 Bgee:Q9XYZ5 Uniprot:Q9XYZ5
        Length = 1140

 Score = 370 (135.3 bits), Expect = 3.6e-59, Sum P(4) = 3.6e-59
 Identities = 111/409 (27%), Positives = 199/409 (48%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+D+ + +L  +   QI T  G     SLRI+R G+ + E A   LPG
Sbjct:   345 VVPVENFTNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPG 404

Query:   453 VPSAVWTVKKNVNDE-FDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIG 510
             +   +W++K  V++  ++  +V++F   T +L++ GE VEE    GF     +   S + 
Sbjct:   405 I-KGMWSLKVGVDESPYENTLVLAFVGHTRILTLSGEEVEETEIPGFASDLQTFLCSNVD 463

Query:   511 DDSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDM 568
              D L+QV    +R +    +  + EWR  G RTI  V  N  Q+++A S  ++ Y  ++ 
Sbjct:   464 YDQLIQVTSDSVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQILVA-SACDIFYIVIE- 521

Query:   569 TGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSV 627
              G L E  +  ++ +VACLDI  + E +K+S  +AVG + D +  ILSL PD  ++ +  
Sbjct:   522 DGSLREQSRRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSL-PD--LETIYT 578

Query:   628 QSVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSR 685
             + +S    P S+L    +          H    +L   L +G ++  ++D  TGQL+D +
Sbjct:   579 EKLSGEIIPRSILMTTFEGI--------H----YLLCALGDGSMYYFIMDQTTGQLTDKK 626

Query:   686 SRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQC 745
                LG +P  L +        +   S RP + Y    + + + ++ + + +  S ++   
Sbjct:   627 KVTLGTQPTTLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAY 686

Query:   746 VEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVI 794
              + +     NA+ + TI+ + +    T +PL   PRR   Q   +   +
Sbjct:   687 PDSLALANKNAVILGTIDEIQKLHIRT-VPLGEGPRRIAYQESSQTFAV 734

 Score = 231 (86.4 bits), Expect = 3.6e-59, Sum P(4) = 3.6e-59
 Identities = 91/340 (26%), Positives = 149/340 (43%)

Query:   882 EAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRFVEEGKSLELLHKTQ 941
             E   S+ +    D +  T   V T+  +   P+  +  G I I+ +  E K L  + +T+
Sbjct:   812 ETISSLMSAKLGD-DPNTYYVVATSLVIPEEPEPKV--GRIIIFHY-HENK-LTQVAETK 866

Query:   942 VEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLRKCENKLFPNTIVSI--NTYRDRIY 998
             V+G   AL +F G++LAGIG  +RLY+   +K L  +C      N I ++      D I 
Sbjct:   867 VDGTCYALVEFNGKVLAGIGSFVRLYEWTNEKELRMECN---IQNMIAALFLKAKGDFIL 923

Query:   999 VGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQD 1058
             VGD+  S    ++++ E      A D  P+W+ A   +D DT  G++  GN++  +    
Sbjct:   924 VGDLMRSITLLQHKQMEGIFVEIARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSA 983

Query:  1059 VSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIYGT 1118
              + + E        ++  G         + +  F  G +V         P  G  V+YGT
Sbjct:   984 ATTDEERQLLPELARFHLG---------DTVNVFRHGSLVMQNVGERTTPING-CVLYGT 1033

Query:  1119 VMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYR-----SAYFPVKDVIDG 1173
               G++G +     +D  DF   LE  +++    +   +H  YR     S   P +  IDG
Sbjct:  1034 CNGAIGIVTQIP-QDFYDFLHGLEERLKKIIKSVGKIEHTYYRNFQINSKVEPSEGFIDG 1092

Query:  1174 DLCEQFPTLSLDLQRKIADELDRT-PGEILKKLEEIRNKI 1212
             DL E F  LS D  R     L+ T  GE  +K  ++ + I
Sbjct:  1093 DLIESFLDLSRDKMRDAVQGLELTLNGE--RKSADVEDVI 1130

 Score = 144 (55.7 bits), Expect = 3.6e-59, Sum P(4) = 3.6e-59
 Identities = 65/244 (26%), Positives = 104/244 (42%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLE--LLRPENSGRIETLVSTEIFGAI 61
             Y +T Q+PT ++A + GNF+      +++AR   +E  L+ PE    +  L    I G I
Sbjct:     5 YVVTAQKPTAVVACLTGNFTSPTDLNLIIARNNQVEIDLVTPEG---LRPLKEININGTI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRRIVP--GQY 119
               +  FR   S KD + + +    ++ILE     +V   + +     S    I    G  
Sbjct:    62 AVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVGIPSEGGVI 121

Query:   120 LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNP 179
              A+DPK R  +IG C  Q L  ++  D  A    ++ L   + +  VY +  +  G  NP
Sbjct:   122 AAIDPKAR--VIGMCLYQGLFTIIPMDKDASELKATNLRMDELN--VYDVEFLH-GCLNP 176

Query:   180 IFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPG 239
                 I        +DS G+     + NL     D     ++ K  + V+  A ML+ VP 
Sbjct:   177 TVIVIH-------KDSDGRHVKSHEINLR----DKEFMKIAWK-QDNVETEATMLIPVPS 224

Query:   240 --GG 241
               GG
Sbjct:   225 PIGG 228

 Score = 55 (24.4 bits), Expect = 3.6e-59, Sum P(4) = 3.6e-59
 Identities = 32/147 (21%), Positives = 59/147 (40%)

Query:   223 WSEP-VDNGANMLVTVPGGGDGP-SGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERG 280
             W +  V+  A ML+ VP     P  GV+V     ++Y + G  +  AV P    L   + 
Sbjct:   207 WKQDNVETEATMLIPVPS----PIGGVIVIGRESIVY-HDGS-NYHAVAP----LTFRQS 256

Query:   281 VLIVSAATHRQKTLFFFLLQTEYGDIFKV---TLEHDNE-HVSELKIKYFDTIPVTASMC 336
              +   A        +  LL    G ++ +   T E      V ++K++    I +   + 
Sbjct:   257 TINCYARVSSNGLRY--LLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISIPECIT 314

Query:   337 VLKSGYLFAASEFGNHALYQFQAIGAD 363
              L +G+L+  +  G+  L +  +   D
Sbjct:   315 YLDNGFLYIGARHGDSQLVRLNSEAID 341


>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
            symbol:ddb1 "damage specific DNA binding protein
            1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
            UniGene:Dr.77970 Uniprot:I1XUS8
        Length = 1140

 Score = 363 (132.8 bits), Expect = 5.5e-50, Sum P(4) = 5.5e-50
 Identities = 114/429 (26%), Positives = 200/429 (46%)

Query:   396 IEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPS 455
             +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG+  
Sbjct:   350 METFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGI-K 408

Query:   456 AVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGDDSL 514
              +W ++   + + D  +V+SF   T VL + GE VEE    GF+D   +     +    L
Sbjct:   409 GLWPLRSESSRDTDDMLVLSFVGQTRVLMLSGEEVEETELQGFVDNQQTFFCGNVAHQQL 468

Query:   515 MQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQL 572
             +Q+    +R + +D +  ++EW+ P  R I     N  QVV+A+  G ++Y+   ++G+L
Sbjct:   469 IQITSVSVRLVTQDSKALVSEWKEPQGRNISVASCNNTQVVLAV--GRVLYYLQILSGEL 526

Query:   573 LEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVS 631
              ++   EM  +VACLDI  + E    S   AVG + D + R+L L P  C   L  + + 
Sbjct:   527 KQISSTEMEHEVACLDITPLGERTADSCICAVGLWTDISARLLKL-P--CFTPLHKEMLG 583

Query:   632 SP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFL 689
                 P S+L    + S        H    +L   L +G LF   +D+ TG LS+ +   L
Sbjct:   584 GEIIPRSILMTTFEGS--------H----YLLCALGDGALFYFGLDIQTGVLSERKKVTL 631

Query:   690 GLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGV 749
             G +P  L +      + +   S RP + Y    + + + ++ + + Y    +S+   + +
Sbjct:   632 GTQPTVLRTFRSLSTSNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSL 691

Query:   750 VSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIET-----DQGALTA 804
                  + L + TI+ + +    T +PL  +P+R   Q   +   ++ +     D    TA
Sbjct:   692 ALANNSTLTIGTIDEIQKLHIRT-VPLYESPKRICYQEVSQCFGVLSSRVEMQDASGTTA 750

Query:   805 EEREAAKKE 813
               R +A  +
Sbjct:   751 AVRPSASTQ 759

 Score = 195 (73.7 bits), Expect = 5.5e-50, Sum P(4) = 5.5e-50
 Identities = 88/356 (24%), Positives = 153/356 (42%)

Query:   859 VSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NI 917
             V  + V+D  +       +   NE A S+ +     ++      VGTA     +P+    
Sbjct:   788 VHSLLVVDQHTFEVLHAHQFLQNEYALSMVSCKL-GRDPAVYFIVGTA---MVYPEEAEP 843

Query:   918 VAGYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLR 976
               G I ++ + + GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   +K L  
Sbjct:   844 KQGRIIVFHYTD-GK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRT 901

Query:   977 KCENKLFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAH 1034
             +C +    N I+++   T  D I VGD+  S     Y+  E      A D  P W++A  
Sbjct:   902 ECNHY---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNPNWMSAVE 958

Query:  1035 HIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQ-FH 1093
              +D D   GA+   N++  +  +D +   +E+    +   E G  +     + E V  F 
Sbjct:   959 ILDDDNFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFS 1008

Query:  1094 VGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLC 1153
              G +V      S  P  G SV++GTV G +G + + S          L+  + +    + 
Sbjct:  1009 HGSLVLQNLGESSTPTQG-SVLFGTVNGMIGLVTSLSE-GWYSLLLDLQNRLNKVIKSVG 1066

Query:  1154 GRDHMAYRSAYFPVKD-----VIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +H  +RS +   K       IDGDL E F  L     +++   L    G  +K+
Sbjct:  1067 KIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKR 1122

 Score = 116 (45.9 bits), Expect = 5.5e-50, Sum P(4) = 5.5e-50
 Identities = 39/134 (29%), Positives = 61/134 (45%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT + A I G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNACITGHFTSAEDLNLLIAKNTRLEIYAVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S +  D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIG 132
                VDP+ R  MIG
Sbjct:   121 IGIVDPECR--MIG 132

 Score = 47 (21.6 bits), Expect = 2.4e-17, Sum P(3) = 2.4e-17
 Identities = 10/51 (19%), Positives = 24/51 (47%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGID 173
             +P+GR + + +C   ++V  + R       +S  L+   S  + + +  +D
Sbjct:   492 EPQGRNISVASCNNTQVVLAVGRVLYYLQILSGELKQISSTEMEHEVACLD 542

 Score = 38 (18.4 bits), Expect = 5.5e-50, Sum P(4) = 5.5e-50
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288


>UNIPROTKB|Q5R649 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9601
            "Pongo abelii" [GO:0005634 "nucleus" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0080008 "Cul4-RING
            ubiquitin ligase complex" evidence=ISS] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677
            GO:GO:0006281 GO:GO:0016567 GO:GO:0031464 GO:GO:0031465 KO:K10610
            CTD:1642 HOVERGEN:HBG005460 HSSP:Q16531 EMBL:CR860647
            RefSeq:NP_001126613.1 UniGene:Pab.18111 GeneID:100173610
            KEGG:pon:100173610 InParanoid:Q5R649 Uniprot:Q5R649
        Length = 1140

 Score = 353 (129.3 bits), Expect = 2.7e-49, Sum P(4) = 2.7e-49
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 202 (76.2 bits), Expect = 2.7e-49, Sum P(4) = 2.7e-49
 Identities = 87/352 (24%), Positives = 157/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G    + +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYPMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 113 (44.8 bits), Expect = 2.7e-49, Sum P(4) = 2.7e-49
 Identities = 37/148 (25%), Positives = 65/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +    + ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 38 (18.4 bits), Expect = 2.7e-49, Sum P(4) = 2.7e-49
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 1.6e-41, Sum P(4) = 1.6e-41
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 3.8e-10, Sum P(2) = 3.8e-10
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|Q16531 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0019048 "virus-host interaction" evidence=IEA]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
            evidence=IDA] [GO:0000075 "cell cycle checkpoint" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IDA] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=IDA] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=IDA] [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=IMP] [GO:0080008 "Cul4-RING
            ubiquitin ligase complex" evidence=IDA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0003684 "damaged DNA binding" evidence=TAS]
            [GO:0000718 "nucleotide-excision repair, DNA damage removal"
            evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281
            "DNA repair" evidence=TAS] [GO:0006289 "nucleotide-excision repair"
            evidence=TAS] Reactome:REACT_216 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 EMBL:U32986
            GO:GO:0005737 GO:GO:0019048 GO:GO:0005654 GO:GO:0043161
            GO:GO:0016055 Gene3D:2.130.10.10 GO:GO:0003684 EMBL:CH471076
            GO:GO:0042787 GO:GO:0000075 GO:GO:0000718 EMBL:AP003108
            GO:GO:0031464 PDB:2HYE PDB:4A0K PDBsum:2HYE PDBsum:4A0K PDB:4A0L
            PDBsum:4A0L GO:GO:0031465 PDB:3I7P PDBsum:3I7P PDB:3I8C PDBsum:3I8C
            PDB:3I89 PDBsum:3I89 PDB:3I7O PDBsum:3I7O PDB:3I8E PDBsum:3I8E
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            CTD:1642 HOVERGEN:HBG005460 OrthoDB:EOG4KPT91 EMBL:U18299
            EMBL:L40326 EMBL:AJ002955 EMBL:AK312436 EMBL:AY960579 EMBL:BC011686
            EMBL:BC050530 EMBL:BC051764 IPI:IPI00293464 PIR:I38908
            RefSeq:NP_001914.3 UniGene:Hs.290758 PDB:2B5L PDB:2B5M PDB:2B5N
            PDB:3E0C PDB:3EI1 PDB:3EI2 PDB:3EI3 PDB:3EI4 PDB:3I7H PDB:3I7K
            PDB:3I7L PDB:3I7N PDB:4A08 PDB:4A09 PDB:4A0A PDB:4A0B PDB:4A11
            PDB:4E54 PDB:4E5Z PDBsum:2B5L PDBsum:2B5M PDBsum:2B5N PDBsum:3E0C
            PDBsum:3EI1 PDBsum:3EI2 PDBsum:3EI3 PDBsum:3EI4 PDBsum:3I7H
            PDBsum:3I7K PDBsum:3I7L PDBsum:3I7N PDBsum:4A08 PDBsum:4A09
            PDBsum:4A0A PDBsum:4A0B PDBsum:4A11 PDBsum:4E54 PDBsum:4E5Z
            ProteinModelPortal:Q16531 SMR:Q16531 DIP:DIP-430N IntAct:Q16531
            MINT:MINT-1134697 STRING:Q16531 PhosphoSite:Q16531 PaxDb:Q16531
            PRIDE:Q16531 Ensembl:ENST00000301764 GeneID:1642 KEGG:hsa:1642
            UCSC:uc001nrc.4 GeneCards:GC11M061066 H-InvDB:HIX0171380
            HGNC:HGNC:2717 HPA:CAB032821 MIM:600045 neXtProt:NX_Q16531
            PharmGKB:PA27187 InParanoid:Q16531 ChiTaRS:DDB1
            EvolutionaryTrace:Q16531 GenomeRNAi:1642 NextBio:6750
            ArrayExpress:Q16531 Bgee:Q16531 CleanEx:HS_DDB1
            Genevestigator:Q16531 GermOnline:ENSG00000167986 Uniprot:Q16531
        Length = 1140

 Score = 353 (129.3 bits), Expect = 4.4e-49, Sum P(4) = 4.4e-49
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 204 (76.9 bits), Expect = 4.4e-49, Sum P(4) = 4.4e-49
 Identities = 87/352 (24%), Positives = 158/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 109 (43.4 bits), Expect = 4.4e-49, Sum P(4) = 4.4e-49
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 38 (18.4 bits), Expect = 4.4e-49, Sum P(4) = 4.4e-49
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 9.7e-42, Sum P(4) = 9.7e-42
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|E2R9E3 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=IEA] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000075
            "cell cycle checkpoint" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
            GO:GO:0031465 KO:K10610 OMA:CALGDGS CTD:1642
            GeneTree:ENSGT00530000063396 EMBL:AAEX03011677 RefSeq:XP_533275.2
            Ensembl:ENSCAFT00000025824 GeneID:476067 KEGG:cfa:476067
            NextBio:20851798 Uniprot:E2R9E3
        Length = 1140

 Score = 352 (129.0 bits), Expect = 5.6e-49, Sum P(4) = 5.6e-49
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 204 (76.9 bits), Expect = 5.6e-49, Sum P(4) = 5.6e-49
 Identities = 87/352 (24%), Positives = 158/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 109 (43.4 bits), Expect = 5.6e-49, Sum P(4) = 5.6e-49
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 43 (20.2 bits), Expect = 5.7e-11, Sum P(2) = 5.7e-11
 Identities = 12/43 (27%), Positives = 22/43 (51%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHK-SHT 164
             +P+G+ + + +C   ++V  + R     L I  P E  + SHT
Sbjct:   492 EPQGKNISVASCNSSQVVVAVGR-ALYYLQIH-PQELRQISHT 532

 Score = 38 (18.4 bits), Expect = 5.6e-49, Sum P(4) = 5.6e-49
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 1.2e-41, Sum P(4) = 1.2e-41
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>MGI|MGI:1202384 [details] [associations]
            symbol:Ddb1 "damage specific DNA binding protein 1"
            species:10090 "Mus musculus" [GO:0000075 "cell cycle checkpoint"
            evidence=ISO] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003684 "damaged DNA
            binding" evidence=ISO] [GO:0005634 "nucleus" evidence=ISO]
            [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006281 "DNA repair"
            evidence=IEA] [GO:0006974 "response to DNA damage stimulus"
            evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IDA] [GO:0031464 "Cul4A-RING ubiquitin ligase complex"
            evidence=ISO] [GO:0031465 "Cul4B-RING ubiquitin ligase complex"
            evidence=ISO] [GO:0042787 "protein ubiquitination involved in
            ubiquitin-dependent protein catabolic process" evidence=ISO]
            [GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
            process" evidence=ISO] [GO:0080008 "Cul4-RING ubiquitin ligase
            complex" evidence=ISO] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 UniPathway:UPA00143 MGI:MGI:1202384 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003684 GO:GO:0006281 GO:GO:0042787 GO:GO:0000075
            GO:GO:0031464 GO:GO:0031465 eggNOG:NOG247734 KO:K10610 OMA:CALGDGS
            CTD:1642 GeneTree:ENSGT00530000063396 HOVERGEN:HBG005460
            HSSP:Q16531 ChiTaRS:DDB1 EMBL:AB026432 EMBL:AF159853 EMBL:AK146522
            EMBL:AK152228 EMBL:AK154303 EMBL:AK155020 EMBL:AK155920
            EMBL:AK157491 EMBL:BC002210 EMBL:BC009661 IPI:IPI00316740
            PIR:JC7152 RefSeq:NP_056550.1 UniGene:Mm.289915 UniGene:Mm.466856
            ProteinModelPortal:Q3U1J4 SMR:Q3U1J4 IntAct:Q3U1J4 STRING:Q3U1J4
            PaxDb:Q3U1J4 PRIDE:Q3U1J4 Ensembl:ENSMUST00000025649 GeneID:13194
            KEGG:mmu:13194 UCSC:uc008gqm.1 InParanoid:Q3U1J4 NextBio:283320
            Bgee:Q3U1J4 CleanEx:MM_DDB1 Genevestigator:Q3U1J4 Uniprot:Q3U1J4
        Length = 1140

 Score = 346 (126.9 bits), Expect = 9.9e-49, Sum P(4) = 9.9e-49
 Identities = 108/400 (27%), Positives = 190/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ +   E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPGRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 208 (78.3 bits), Expect = 9.9e-49, Sum P(4) = 9.9e-49
 Identities = 88/352 (25%), Positives = 159/352 (45%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L +AS  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGEAS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 109 (43.4 bits), Expect = 9.9e-49, Sum P(4) = 9.9e-49
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 43 (20.2 bits), Expect = 2.1e-11, Sum P(2) = 2.1e-11
 Identities = 12/43 (27%), Positives = 22/43 (51%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHK-SHT 164
             +P+G+ + + +C   ++V  + R     L I  P E  + SHT
Sbjct:   492 EPQGKNISVASCNSSQVVVAVGR-ALYYLQIH-PQELRQISHT 532

 Score = 38 (18.4 bits), Expect = 9.9e-49, Sum P(4) = 9.9e-49
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 2.2e-41, Sum P(4) = 2.2e-41
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 8.9e-11, Sum P(2) = 8.9e-11
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|A1A4K3 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9913
            "Bos taurus" [GO:0080008 "Cul4-RING ubiquitin ligase complex"
            evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase complex"
            evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent protein
            catabolic process" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin
            ligase complex" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISS] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] [GO:0000075 "cell cycle checkpoint" evidence=IEA]
            [GO:0006281 "DNA repair" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0005737 GO:GO:0043161
            GO:GO:0016055 Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281
            GO:GO:0042787 GO:GO:0000075 GO:GO:0031464 GO:GO:0031465
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            EMBL:BC126629 IPI:IPI00713891 RefSeq:NP_001073731.1
            UniGene:Bt.62917 STRING:A1A4K3 PRIDE:A1A4K3
            Ensembl:ENSBTAT00000028740 GeneID:511951 KEGG:bta:511951 CTD:1642
            GeneTree:ENSGT00530000063396 HOVERGEN:HBG005460 InParanoid:A1A4K3
            OrthoDB:EOG4KPT91 NextBio:20870176 Uniprot:A1A4K3
        Length = 1140

 Score = 348 (127.6 bits), Expect = 1.6e-48, Sum P(4) = 1.6e-48
 Identities = 108/400 (27%), Positives = 190/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RI  L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 204 (76.9 bits), Expect = 1.6e-48, Sum P(4) = 1.6e-48
 Identities = 87/352 (24%), Positives = 158/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 109 (43.4 bits), Expect = 1.6e-48, Sum P(4) = 1.6e-48
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 43 (20.2 bits), Expect = 5.7e-11, Sum P(2) = 5.7e-11
 Identities = 12/43 (27%), Positives = 22/43 (51%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHK-SHT 164
             +P+G+ + + +C   ++V  + R     L I  P E  + SHT
Sbjct:   492 EPQGKNISVASCNSSQVVVAVGR-ALYYLQIH-PQELRQISHT 532

 Score = 38 (18.4 bits), Expect = 1.6e-48, Sum P(4) = 1.6e-48
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 3.5e-41, Sum P(4) = 3.5e-41
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|P33194 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9534
            "Chlorocebus aethiops" [GO:0005634 "nucleus" evidence=ISS]
            [GO:0005737 "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING
            ubiquitin ligase complex" evidence=ISS] [GO:0031465 "Cul4B-RING
            ubiquitin ligase complex" evidence=ISS] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process" evidence=ISS]
            [GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=ISS]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0005737 GO:GO:0043161
            Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281 GO:GO:0016567
            GO:GO:0031464 GO:GO:0031465 HOVERGEN:HBG005460 EMBL:L20216
            PIR:S38777 PRIDE:P33194 Uniprot:P33194
        Length = 1140

 Score = 353 (129.3 bits), Expect = 1.8e-48, Sum P(4) = 1.8e-48
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 204 (76.9 bits), Expect = 1.8e-48, Sum P(4) = 1.8e-48
 Identities = 87/352 (24%), Positives = 158/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 103 (41.3 bits), Expect = 1.8e-48, Sum P(4) = 1.8e-48
 Identities = 36/148 (24%), Positives = 63/148 (42%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   +  +F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTAHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 38 (18.4 bits), Expect = 1.8e-48, Sum P(4) = 1.8e-48
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 9.7e-42, Sum P(4) = 9.7e-42
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|F1RIE2 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0043161 "proteasomal ubiquitin-dependent protein
            catabolic process" evidence=IEA] [GO:0042787 "protein
            ubiquitination involved in ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=IEA] [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000075
            "cell cycle checkpoint" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
            GO:GO:0031465 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
            EMBL:CU462918 RefSeq:XP_003122699.1 Ensembl:ENSSSCT00000014314
            GeneID:100522239 KEGG:ssc:100522239 Uniprot:F1RIE2
        Length = 1140

 Score = 345 (126.5 bits), Expect = 3.3e-48, Sum P(4) = 3.3e-48
 Identities = 108/400 (27%), Positives = 190/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSNQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RI  L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARISKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 204 (76.9 bits), Expect = 3.3e-48, Sum P(4) = 3.3e-48
 Identities = 87/352 (24%), Positives = 158/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 109 (43.4 bits), Expect = 3.3e-48, Sum P(4) = 3.3e-48
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 43 (20.2 bits), Expect = 5.7e-11, Sum P(2) = 5.7e-11
 Identities = 12/43 (27%), Positives = 22/43 (51%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHK-SHT 164
             +P+G+ + + +C   ++V  + R     L I  P E  + SHT
Sbjct:   492 EPQGKNISVASCNSNQVVVAVGR-ALYYLQIH-PQELRQISHT 532

 Score = 38 (18.4 bits), Expect = 3.3e-48, Sum P(4) = 3.3e-48
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 7.4e-41, Sum P(4) = 7.4e-41
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 2.4e-10, Sum P(2) = 2.4e-10
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>RGD|621889 [details] [associations]
            symbol:Ddb1 "damage-specific DNA binding protein 1, 127kDa"
            species:10116 "Rattus norvegicus" [GO:0000075 "cell cycle
            checkpoint" evidence=IEA;ISO] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IMP]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005634 "nucleus"
            evidence=IEA;ISO;ISS] [GO:0005737 "cytoplasm" evidence=IEA;ISO;ISS]
            [GO:0006281 "DNA repair" evidence=TAS] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IEA;ISO] [GO:0016567 "protein
            ubiquitination" evidence=IEA] [GO:0031464 "Cul4A-RING ubiquitin
            ligase complex" evidence=IEA;ISO;ISS] [GO:0031465 "Cul4B-RING
            ubiquitin ligase complex" evidence=IEA;ISO;ISS] [GO:0042787
            "protein ubiquitination involved in ubiquitin-dependent protein
            catabolic process" evidence=IEA;ISO] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process"
            evidence=IEA;ISO;ISS] [GO:0080008 "Cul4-RING ubiquitin ligase
            complex" evidence=ISO;ISS] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 UniPathway:UPA00143 RGD:621889 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003684 GO:GO:0006281 GO:GO:0042787 GO:GO:0000075
            GO:GO:0031464 GO:GO:0031465 eggNOG:NOG247734 HOGENOM:HOG000007241
            HOVERGEN:HBG005460 HSSP:Q16531 EMBL:AJ277077 IPI:IPI00324451
            UniGene:Rn.8402 IntAct:Q9ESW0 MINT:MINT-4784948 STRING:Q9ESW0
            PhosphoSite:Q9ESW0 PRIDE:Q9ESW0 UCSC:RGD:621889 InParanoid:Q9ESW0
            ArrayExpress:Q9ESW0 Genevestigator:Q9ESW0 Uniprot:Q9ESW0
        Length = 1140

 Score = 347 (127.2 bits), Expect = 8.4e-48, Sum P(4) = 8.4e-48
 Identities = 107/411 (26%), Positives = 194/411 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLD+  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDVTPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIET 797
             + +     + L + T+  + +    T +P+  +PR+   Q   +   ++ T
Sbjct:   689 DSLALANTSTLTIGTMNEIQKLHIRT-VPIYESPRKICYQEVSQCFGVLST 738

 Score = 198 (74.8 bits), Expect = 8.4e-48, Sum P(4) = 8.4e-48
 Identities = 86/352 (24%), Positives = 155/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++   G  L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY--SGGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV+ GTV G +G + + S     +    ++  + +    +   +H
Sbjct:  1014 MQNLGETS-TPTQG-SVLLGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 1070

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1071 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1122

 Score = 109 (43.4 bits), Expect = 8.4e-48, Sum P(4) = 8.4e-48
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDINLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 38 (18.4 bits), Expect = 8.4e-48, Sum P(4) = 8.4e-48
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 1.8e-40, Sum P(4) = 1.8e-40
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 38 (18.4 bits), Expect = 8.0e-10, Sum P(2) = 8.0e-10
 Identities = 11/43 (25%), Positives = 21/43 (48%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHK-SHT 164
             +P+ + + + +C   ++V  + R     L I  P E  + SHT
Sbjct:   492 EPRAKNISVASCNSSQVVVAVGR-ALYYLQIH-PQELRQISHT 532

 Score = 37 (18.1 bits), Expect = 1.0e-09, Sum P(2) = 1.0e-09
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|Q805F9 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003677 "DNA binding" evidence=IEA] [GO:0016567
            "protein ubiquitination" evidence=IEA] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0080008
            "Cul4-RING ubiquitin ligase complex" evidence=ISS] [GO:0031465
            "Cul4B-RING ubiquitin ligase complex" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS]
            [GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
            process" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=ISS] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005737 GO:GO:0005654
            GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677 GO:GO:0006281
            GO:GO:0016567 Reactome:REACT_115612 GO:GO:0031464 GO:GO:0031465
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 CTD:1642
            HOVERGEN:HBG005460 OrthoDB:EOG4KPT91 HSSP:Q16531 EMBL:AB074298
            EMBL:AJ719779 IPI:IPI00597295 RefSeq:NP_989547.1 UniGene:Gga.12977
            STRING:Q805F9 PRIDE:Q805F9 GeneID:374050 KEGG:gga:374050
            NextBio:20813572 Uniprot:Q805F9
        Length = 1140

 Score = 353 (129.3 bits), Expect = 1.1e-47, Sum P(5) = 1.1e-47
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + + E D  +V+SF   T VL + GE VEE   +GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDSHREMDNMLVLSFVGQTRVLMLNGEEVEETELTGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y E+   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAV-GRALYYLEI-RP 523

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   524 QELRQINCTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   + + TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLSLETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 206 (77.6 bits), Expect = 1.1e-47, Sum P(5) = 1.1e-47
 Identities = 85/350 (24%), Positives = 153/350 (43%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLRKCENK 981
              ++ +  +GK L+ L + +V+G   ++ +F G+LLA I   +RLY+   +K L  +C + 
Sbjct:   849 VVFHY-SDGK-LQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVT 1099
                GA+   N++  +  +D +   +E+        +  +  G  +  E +  F  G +V 
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEER-------QHLQEVGLSHLGEFVNVFCHGSLVM 1014

Query:  1100 SLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMA 1159
                  +  P  G SV++GTV G +G + + S     +    ++  + +    +   +H  
Sbjct:  1015 QNLGETSTPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEHSF 1072

Query:  1160 YRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
             +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1073 WRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKR 1122

 Score = 99 (39.9 bits), Expect = 1.1e-47, Sum P(5) = 1.1e-47
 Identities = 34/134 (25%), Positives = 59/134 (44%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G  
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKT 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  + +  D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIG 132
                +DP+ R  MIG
Sbjct:   121 IGIIDPECR--MIG 132

 Score = 41 (19.5 bits), Expect = 3.6e-16, Sum P(3) = 3.6e-16
 Identities = 5/23 (21%), Positives = 13/23 (56%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNR 145
             +P G+ + + +C   ++V  + R
Sbjct:   492 EPNGKNISVASCNSNQVVVAVGR 514

 Score = 38 (18.4 bits), Expect = 1.1e-47, Sum P(5) = 1.1e-47
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 38 (18.4 bits), Expect = 1.1e-47, Sum P(5) = 1.1e-47
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320


>UNIPROTKB|F5GY55 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9606 "Homo
            sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 EMBL:AP003108 HGNC:HGNC:2717 ChiTaRS:DDB1
            EMBL:AP003037 IPI:IPI00977083 SMR:F5GY55 Ensembl:ENST00000540166
            Uniprot:F5GY55
        Length = 1092

 Score = 353 (129.3 bits), Expect = 1.3e-46, Sum P(4) = 1.3e-46
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 179 (68.1 bits), Expect = 1.3e-46, Sum P(4) = 1.3e-46
 Identities = 72/273 (26%), Positives = 128/273 (46%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFS 1130
             + +L + S  P  G SV++GTV G +G + + S
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLS 1044

 Score = 109 (43.4 bits), Expect = 1.3e-46, Sum P(4) = 1.3e-46
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 38 (18.4 bits), Expect = 1.3e-46, Sum P(4) = 1.3e-46
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 2.8e-39, Sum P(4) = 2.8e-39
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 9.2e-08, Sum P(2) = 9.2e-08
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|J9NVR7 [details] [associations]
            symbol:DDB1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AAEX03011677
            Ensembl:ENSCAFT00000049486 Uniprot:J9NVR7
        Length = 1084

 Score = 352 (129.0 bits), Expect = 1.5e-46, Sum P(4) = 1.5e-46
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +  
Sbjct:   407 I-KGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++   
Sbjct:   466 QQLIQITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAV-GRALYYLQIHPQ 524

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   525 -ELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   +++ TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 727

 Score = 179 (68.1 bits), Expect = 1.5e-46, Sum P(4) = 1.5e-46
 Identities = 72/273 (26%), Positives = 128/273 (46%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   793 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 848

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   849 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 906

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   907 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 963

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   964 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 1013

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFS 1130
             + +L + S  P  G SV++GTV G +G + + S
Sbjct:  1014 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLS 1044

 Score = 109 (43.4 bits), Expect = 1.5e-46, Sum P(4) = 1.5e-46
 Identities = 37/148 (25%), Positives = 64/148 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP+ R + +   +    V  L+RD
Sbjct:   121 IGIIDPECRMIGLRLYDGLFKVIPLDRD 148

 Score = 43 (20.2 bits), Expect = 2.2e-08, Sum P(2) = 2.2e-08
 Identities = 12/43 (27%), Positives = 22/43 (51%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHK-SHT 164
             +P+G+ + + +C   ++V  + R     L I  P E  + SHT
Sbjct:   492 EPQGKNISVASCNSSQVVVAVGR-ALYYLQIH-PQELRQISHT 532

 Score = 38 (18.4 bits), Expect = 1.5e-46, Sum P(4) = 1.5e-46
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 320

 Score = 38 (18.4 bits), Expect = 3.4e-39, Sum P(4) = 3.4e-39
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   264 VDPNGSRYLLGDMEGRLFMLLLEKE 288

 Score = 37 (18.1 bits), Expect = 9.1e-08, Sum P(2) = 9.1e-08
 Identities = 6/21 (28%), Positives = 12/21 (57%)

Query:   600 RFLAVGSYDNTIRILSLDPDD 620
             R + +  YD   +++ LD D+
Sbjct:   129 RMIGLRLYDGLFKVIPLDRDN 149


>UNIPROTKB|Q6P6Z0 [details] [associations]
            symbol:ddb1 "DNA damage-binding protein 1" species:8355
            "Xenopus laevis" [GO:0005634 "nucleus" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0031464 "Cul4A-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0031465 "Cul4B-RING ubiquitin ligase
            complex" evidence=ISS] [GO:0043161 "proteasomal ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0080008 "Cul4-RING
            ubiquitin ligase complex" evidence=ISS] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 Gene3D:2.130.10.10 GO:GO:0003677
            GO:GO:0006281 GO:GO:0016567 GO:GO:0031464 GO:GO:0031465 KO:K10610
            CTD:1642 HOVERGEN:HBG005460 HSSP:Q16531 EMBL:BC061946
            RefSeq:NP_001083624.1 UniGene:Xl.23906 PRIDE:Q6P6Z0 GeneID:399026
            KEGG:xla:399026 Xenbase:XB-GENE-967911 Uniprot:Q6P6Z0
        Length = 1140

 Score = 337 (123.7 bits), Expect = 2.8e-46, Sum P(4) = 2.8e-46
 Identities = 108/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   347 VVVMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 406

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++   + + D  +V+SF   T VL++ GE VEE   +GF+D   +     +  
Sbjct:   407 I-KGLWPLRVAADRDTDDTLVLSFVGQTRVLTLTGEEVEETDLAGFVDDQQTFFCGNVAH 465

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  R +     N  QV++A+ G  L Y E+   
Sbjct:   466 QQLIQITSASVRLVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLLAV-GRVLYYLEIH-P 523

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
             G+L +    EM  +VACLD+  +      S   A+G + D + RILSL P    Q+L  +
Sbjct:   524 GELRQTSCTEMEHEVACLDVTPLGGNDTLSSLCAIGLWTDISARILSL-PG--FQLLHKE 580

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   ++  TG LSD + 
Sbjct:   581 MLGGEIIPRSILMTSFESS--------H----YLLCALGDGALFYFSLNTDTGLLSDRKK 628

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +S+   
Sbjct:   629 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYP 688

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   689 DSLALANNSTLTIGTIDEIQKLHIRT-VPLFESPRKICYQ 727

 Score = 196 (74.1 bits), Expect = 2.8e-46, Sum P(4) = 2.8e-46
 Identities = 85/351 (24%), Positives = 156/351 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIH 923
             ++D  +       +   NE   S+ +     K+  T   VGTA  + +  +     G I 
Sbjct:   793 IIDQHTFEVLHTHQFLQNEYTLSLVSCKL-GKDPTTYFVVGTA--MVYPDEAEPKQGRIV 849

Query:   924 IYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLRKCENKL 982
             ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   +K L  +C +  
Sbjct:   850 VFQY-NDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRTECNHY- 906

Query:   983 FPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDT 1040
               N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D 
Sbjct:   907 --NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDN 964

Query:  1041 MAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDVV 1098
               GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V+
Sbjct:   965 FLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLVM 1014

Query:  1099 TSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHM 1158
              +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H 
Sbjct:  1015 QNLGETS-PPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDVQNRLNKVIKSVGKIEHS 1071

Query:  1159 AYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
              +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1072 FWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKR 1122

 Score = 108 (43.1 bits), Expect = 2.8e-46, Sum P(4) = 2.8e-46
 Identities = 38/148 (25%), Positives = 63/148 (42%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT + A + G+F+      +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNACVTGHFTSEDDLNLLIAKNTRLEIYVVTPEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S +  D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIGACEKQKLVYVLNRD 146
                +DP  R + +   +    V  L RD
Sbjct:   121 IGIIDPDCRMIGLRLYDGLFKVIPLERD 148

 Score = 42 (19.8 bits), Expect = 5.0e-10, Sum P(2) = 5.0e-10
 Identities = 6/23 (26%), Positives = 15/23 (65%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNR 145
             +P+GR V + +C  ++++  + R
Sbjct:   492 EPQGRKVSVCSCNSRQVLLAVGR 514

 Score = 37 (18.1 bits), Expect = 2.8e-46, Sum P(4) = 2.8e-46
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   285 LEKEEQMDGSVTLKDLRVELLGETSIAECLTYLDNG 320


>DICTYBASE|DDB_G0286013 [details] [associations]
            symbol:repE "UV-damaged DNA binding protein1"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA;ISS;IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0006974 "response to DNA damage stimulus" evidence=IEA;IEP]
            [GO:0006289 "nucleotide-excision repair" evidence=ISS] [GO:0003684
            "damaged DNA binding" evidence=ISS] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0016567 "protein ubiquitination" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 Pfam:PF03178
            UniPathway:UPA00143 dictyBase:DDB_G0286013 GO:GO:0005634
            GO:GO:0005737 GenomeReviews:CM000153_GR SUPFAM:SSF50978
            GO:GO:0003684 GO:GO:0016567 EMBL:AAFI02000085 GO:GO:0006289
            eggNOG:NOG247734 KO:K10610 OMA:CALGDGS EMBL:U50042 PIR:S71092
            RefSeq:XP_637896.2 STRING:B0M0P5 EnsemblProtists:DDB0191144
            GeneID:8625406 KEGG:ddi:DDB_G0286013 ProtClustDB:CLSZ2430134
            Uniprot:B0M0P5
        Length = 1181

 Score = 279 (103.3 bits), Expect = 2.2e-45, Sum P(3) = 2.2e-45
 Identities = 125/539 (23%), Positives = 238/539 (44%)

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             FL    +G +  + L H  + V ELK +    I + +S+  L SG ++  S  G+  L +
Sbjct:   306 FLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLDSGVVYIGSSSGDSQLIR 365

Query:   357 FQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLKNLVRIEQVESLMPIMDMRIANLFEE 416
                +  + D         +T + +            +  +E   ++ P++D  + +  ++
Sbjct:   366 ---LNTEKD---------QTTDSY------------VTYLEAFTNIGPVVDFCVVDAEKQ 401

Query:   417 EAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVN----------- 465
                QI T  G     SLRI+R G+ ++E A  +L G+   ++ +  N N           
Sbjct:   402 GQAQIVTCSGTYRDGSLRIIRNGIGIAEQASIELEGI-KGIFPINNNNNNNNNNNNNNNN 460

Query:   466 ----------DEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD-DS 513
                       D  D Y++ SF   T VLS  GE +EE    G      +L    I   + 
Sbjct:   461 NNNNNSNGITDSKDRYLITSFIECTKVLSFQGEEIEETEFEGLESNCSTLYCGTIDKLNL 520

Query:   514 LMQVHPSGIRHIREDG--RINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQ 571
             L+Q+    I  I  +   R+++W     R I  V +N+ Q+V+++    L+YF+++ + +
Sbjct:   521 LIQITNVSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNK 579

Query:   572 LLEVEKH-EMSGDVACLDIASVPEGRK-RSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +++ K  E+  +++C+DI+        +S+ ++VG + D T+RI  L P   ++ +  +
Sbjct:   580 SIQLVKEIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKL-PT--LEEIWKE 636

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L +         D  D+   +F + G  +G LF+   D  + +L D R 
Sbjct:   637 PLGGEILPRSILMISF-------DSIDY---IFCSLG--DGHLFKFQFDFSSFKLFDKRK 684

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L    +     +  +S RP + Y H  +   + ++ + +    SF+SD   
Sbjct:   685 LTLGTQPIILKKFKLKNTINIFAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFP 744

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTP-RRFV-LQPKKKLMVI-IETDQGAL 802
               +     N+L + TI+ + +   +T +PL     RR V L+      VI ++ ++G L
Sbjct:   745 NSMAIATTNSLTIGTIDEIQKLHIKT-IPLNEEMGRRIVHLEDHSCYAVITVKNNEGLL 802

 Score = 179 (68.1 bits), Expect = 2.2e-45, Sum P(3) = 2.2e-45
 Identities = 91/391 (23%), Positives = 160/391 (40%)

Query:   854 ESDKWVSCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWP 913
             E D+ VS IR+ + ++       +L   E  +SI    F   +  T LAVGT+       
Sbjct:   810 EEDEEVSYIRIYNDQTFELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIKS 869

Query:   914 KRNIVAGYIHIYRFVEEGKSLE----------------LLHKTQVEGIPLALCQFQGRLL 957
                ++   +       +  SL+                LL + +       L  F GRL+
Sbjct:   870 SGRVLLFSLSSSSSSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLI 929

Query:   958 AGIGPVLRL--YDLGKKRLLR--KCENKLFPNTIVSINTYRDR-IYVGDIQESFHFCKYR 1012
             A +   L    Y   K++  +    E+    +T++     R   I VGD+ +S      +
Sbjct:   930 AAVHKRLFSIRYTHSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLVEQ 989

Query:  1013 RDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKI 1072
              D   L   A +  P W+ +   I+ D   GA+   N   V+   D ++E+E +      
Sbjct:   990 SD-GSLEQIARNPQPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERE------ 1042

Query:  1073 KWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLV--PGGGESVI----YGTVMGSLGAM 1126
                          ++ +  +H+G+ + S++  SLV  P   + +I    Y +V GS+G +
Sbjct:  1043 ------------LLDSVGHYHIGESINSMRHGSLVRLPDSDQPIIPTILYASVNGSIGVV 1090

Query:  1127 LAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSA---YFPV--KDVIDGDLCEQFPT 1181
              + S  D + FFS L+  + Q    + G  H  +R+    +  +  K+ IDGDL E F  
Sbjct:  1091 ASISEEDFI-FFSKLQKGLNQVVRGVGGFSHETWRAFSNDHHTIDSKNFIDGDLIETFLD 1149

Query:  1182 LSLDLQRKIADELDRTPGEILKKLEEIRNKI 1212
             L  + Q K   +L  TP +  +++E +   I
Sbjct:  1150 LKYESQLKAVADLGITPDDAFRRIESLMQYI 1180

 Score = 168 (64.2 bits), Expect = 2.2e-45, Sum P(3) = 2.2e-45
 Identities = 41/153 (26%), Positives = 77/153 (50%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGA 60
             MY +  T+Q+PT +  ++ GNF+G     +++++   +E+   +  G ++ +    I+G 
Sbjct:     1 MYNFVSTVQKPTSVTHSVTGNFTGPNDKNLIISKCTKIEIFLMDQDG-LKPMFDVNIYGR 59

Query:    61 IRSLAQFRLTGSQKDYIVVGSDSGRIVILEYN-PSKNVFDKIHQETFGKSGCRRIVPGQY 119
             I  L  F + GS++DY+ + ++S +  IL Y+   K +  K         G R    GQ 
Sbjct:    60 ISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG-RPTEAGQL 118

Query:   120 LAVDPKGRAVMIGACEKQ-KLVYVLNRDTAARL 151
               +DP GR V +   E   KL+ + N +T  ++
Sbjct:   119 GIIDPDGRIVALHLYEGLLKLITLDNNNTPNKI 151

 Score = 93 (37.8 bits), Expect = 1.1e-25, Sum P(3) = 1.1e-25
 Identities = 48/183 (26%), Positives = 78/183 (42%)

Query:   204 QKNLTFYELDL-GLNHVSRKWSEP-VDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQG 261
             +K+++ YE+       V   WS+  V   +++LV VP GG     VLV A+N + Y N G
Sbjct:   227 EKHISTYEISSKDTELVVGPWSQSNVGVYSSLLVPVPLGG-----VLVVADNGITYLN-G 280

Query:   262 HPDVRAVIPRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSEL 321
                    + R   +   + +    A T   K    FL    +G +  + L H  + V EL
Sbjct:   281 K------VTRSVAVSYTKFL----AFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMEL 330

Query:   322 KIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSS-TLMETEEGF 380
             K +    I + +S+  L SG ++  S  G+  L +   +  + D    S  T +E     
Sbjct:   331 KFEQLGRISIPSSISYLDSGVVYIGSSSGDSQLIR---LNTEKDQTTDSYVTYLEAFTNI 387

Query:   381 QPV 383
              PV
Sbjct:   388 GPV 390

 Score = 39 (18.8 bits), Expect = 2.7e-11, Sum P(3) = 2.7e-11
 Identities = 18/92 (19%), Positives = 40/92 (43%)

Query:   990 INTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGN 1049
             ++T +D+I +  I +S  + +       + +  +  +P  ++      FD+    D    
Sbjct:   554 VSTNQDQIVLS-IDKSLLYFQINSSNKSIQLVKEIELPHEISCIDISPFDSFM--DTKSQ 610

Query:  1050 IYFVRLPQDVS---------DEIEEDPTGGKI 1072
             +  V L  D++         +EI ++P GG+I
Sbjct:   611 LVSVGLWNDITLRIFKLPTLEEIWKEPLGGEI 642


>UNIPROTKB|F1P4I8 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AADN02017118
            EMBL:AADN02017119 IPI:IPI00818299 Ensembl:ENSGALT00000008352
            ArrayExpress:F1P4I8 Uniprot:F1P4I8
        Length = 1120

 Score = 353 (129.3 bits), Expect = 3.0e-44, Sum P(5) = 3.0e-44
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   327 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 386

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + + E D  +V+SF   T VL + GE VEE   +GF+D   +     +  
Sbjct:   387 I-KGLWPLRSDSHREMDNMLVLSFVGQTRVLMLNGEEVEETELTGFVDDQQTFFCGNVAH 445

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y E+   
Sbjct:   446 QQLIQITSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAV-GRALYYLEI-RP 503

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   504 QELRQINCTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKL-PS--FELLHKE 560

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   + + TG LSD + 
Sbjct:   561 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLSLETGLLSDRKK 608

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   609 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 668

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   669 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 707

 Score = 207 (77.9 bits), Expect = 3.0e-44, Sum P(5) = 3.0e-44
 Identities = 85/350 (24%), Positives = 152/350 (43%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   773 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 828

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLRKCENK 981
              ++ +  +GK L+ L + +V+G   ++ +F G+LLA I   +RLY+   +K L  +C + 
Sbjct:   829 VVFHY-SDGK-LQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRTECNHY 886

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   887 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 943

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVT 1099
                GA+   N++  +  +D +   +E+        +  +  G  +  E +  F  G +V 
Sbjct:   944 NFLGAENAFNLFVCQ--KDSAATTDEER-------QHLQEVGLSHLGEFVNVFCHGSLVM 994

Query:  1100 SLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMA 1159
                     P  G SV++GTV G +G + + S     +    ++  + +    +   +H  
Sbjct:   995 QNLGEKSTPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEHAT 1052

Query:  1160 YRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
             +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1053 WRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKR 1102

 Score = 64 (27.6 bits), Expect = 3.0e-44, Sum P(5) = 3.0e-44
 Identities = 28/116 (24%), Positives = 49/116 (42%)

Query:    20 GNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVV 79
             G+F+  +   +++A+   LE+      G +  +    ++G    +  FR  G  KD + +
Sbjct:     1 GHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKTAVMELFRPKGESKDLLFI 59

Query:    80 GSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQYLAVDPKGRAVMIG 132
              +      ILEY  + +  D I   H     + G R    G    +DP+ R  MIG
Sbjct:    60 LTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG-RPSETGIIGIIDPECR--MIG 112

 Score = 41 (19.5 bits), Expect = 9.8e-13, Sum P(3) = 9.8e-13
 Identities = 5/23 (21%), Positives = 13/23 (56%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNR 145
             +P G+ + + +C   ++V  + R
Sbjct:   472 EPNGKNISVASCNSNQVVVAVGR 494

 Score = 38 (18.4 bits), Expect = 3.0e-44, Sum P(5) = 3.0e-44
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   244 VDPNGSRYLLGDMEGRLFMLLLEKE 268

 Score = 38 (18.4 bits), Expect = 3.0e-44, Sum P(5) = 3.0e-44
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   265 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 300


>UNIPROTKB|F1NVV3 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10
            GO:GO:0003676 GeneTree:ENSGT00530000063396 EMBL:AADN02017118
            EMBL:AADN02017119 IPI:IPI00821712 Ensembl:ENSGALT00000040604
            ArrayExpress:F1NVV3 Uniprot:F1NVV3
        Length = 1119

 Score = 353 (129.3 bits), Expect = 2.0e-43, Sum P(5) = 2.0e-43
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   327 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 386

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + + E D  +V+SF   T VL + GE VEE   +GF+D   +     +  
Sbjct:   387 I-KGLWPLRSDSHREMDNMLVLSFVGQTRVLMLNGEEVEETELTGFVDDQQTFFCGNVAH 445

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y E+   
Sbjct:   446 QQLIQITSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAV-GRALYYLEI-RP 503

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   504 QELRQINCTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKL-PS--FELLHKE 560

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   + + TG LSD + 
Sbjct:   561 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLSLETGLLSDRKK 608

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   609 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 668

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   669 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 707

 Score = 199 (75.1 bits), Expect = 2.0e-43, Sum P(5) = 2.0e-43
 Identities = 84/349 (24%), Positives = 149/349 (42%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   773 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 828

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLRKCENK 981
              ++ +  +GK L+ L + +V+G   ++ +F G+LLA I   +RLY+   +K L  +C + 
Sbjct:   829 VVFHY-SDGK-LQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRTECNHY 886

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   887 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 943

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVT 1099
                GA+   N++  +  +D +   +E+        +  +  G  +  E +  F  G +V 
Sbjct:   944 NFLGAENAFNLFVCQ--KDSAATTDEER-------QHLQEVGLSHLGEFVNVFCHGSLVM 994

Query:  1100 SLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMA 1159
                     P  G SV++GTV G +G + + S     +    ++  + +    +   +H  
Sbjct:   995 QNLGEKSTPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEHSL 1052

Query:  1160 Y----RSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
             Y         P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1053 YSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKR 1101

 Score = 64 (27.6 bits), Expect = 2.0e-43, Sum P(5) = 2.0e-43
 Identities = 28/116 (24%), Positives = 49/116 (42%)

Query:    20 GNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVV 79
             G+F+  +   +++A+   LE+      G +  +    ++G    +  FR  G  KD + +
Sbjct:     1 GHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKTAVMELFRPKGESKDLLFI 59

Query:    80 GSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQYLAVDPKGRAVMIG 132
              +      ILEY  + +  D I   H     + G R    G    +DP+ R  MIG
Sbjct:    60 LTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG-RPSETGIIGIIDPECR--MIG 112

 Score = 41 (19.5 bits), Expect = 6.8e-12, Sum P(3) = 6.8e-12
 Identities = 5/23 (21%), Positives = 13/23 (56%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNR 145
             +P G+ + + +C   ++V  + R
Sbjct:   472 EPNGKNISVASCNSNQVVVAVGR 494

 Score = 38 (18.4 bits), Expect = 2.0e-43, Sum P(5) = 2.0e-43
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   244 VDPNGSRYLLGDMEGRLFMLLLEKE 268

 Score = 38 (18.4 bits), Expect = 2.0e-43, Sum P(5) = 2.0e-43
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   265 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 300


>UNIPROTKB|F1NVV2 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9031
            "Gallus gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0000075 "cell cycle
            checkpoint" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=IEA]
            [GO:0031464 "Cul4A-RING ubiquitin ligase complex" evidence=IEA]
            [GO:0031465 "Cul4B-RING ubiquitin ligase complex" evidence=IEA]
            [GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
            protein catabolic process" evidence=IEA] [GO:0043161 "proteasomal
            ubiquitin-dependent protein catabolic process" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634
            GO:GO:0005737 GO:GO:0043161 GO:GO:0016055 Gene3D:2.130.10.10
            GO:GO:0003676 GO:GO:0042787 GO:GO:0000075 GO:GO:0031464
            GO:GO:0031465 OMA:CALGDGS GeneTree:ENSGT00530000063396
            IPI:IPI00597295 EMBL:AADN02017118 EMBL:AADN02017119
            Ensembl:ENSGALT00000040605 ArrayExpress:F1NVV2 Uniprot:F1NVV2
        Length = 1123

 Score = 353 (129.3 bits), Expect = 2.6e-43, Sum P(5) = 2.6e-43
 Identities = 109/400 (27%), Positives = 191/400 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPG 452
             +V +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG
Sbjct:   327 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 386

Query:   453 VPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGD 511
             +   +W ++ + + E D  +V+SF   T VL + GE VEE   +GF+D   +     +  
Sbjct:   387 I-KGLWPLRSDSHREMDNMLVLSFVGQTRVLMLNGEEVEETELTGFVDDQQTFFCGNVAH 445

Query:   512 DSLMQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
               L+Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y E+   
Sbjct:   446 QQLIQITSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAV-GRALYYLEI-RP 503

Query:   570 GQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQ 628
              +L ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  +
Sbjct:   504 QELRQINCTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKL-PS--FELLHKE 560

Query:   629 SVSSP--PESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRS 686
              +     P S+L    ++S        H    +L   L +G LF   + + TG LSD + 
Sbjct:   561 MLGGEIIPRSILMTTFESS--------H----YLLCALGDGALFYFGLSLETGLLSDRKK 608

Query:   687 RFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCV 746
               LG +P  L +        +   S RP + Y    + + + ++ + + Y    +SD   
Sbjct:   609 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYP 668

Query:   747 EGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             + +     + L + TI+ + +    T +PL  +PR+   Q
Sbjct:   669 DSLALANNSTLTIGTIDEIQKLHIRT-VPLYESPRKICYQ 707

 Score = 198 (74.8 bits), Expect = 2.6e-43, Sum P(5) = 2.6e-43
 Identities = 86/353 (24%), Positives = 152/353 (43%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   773 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 828

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYD-LGKKRLLRKCENK 981
              ++ +  +GK L+ L + +V+G   ++ +F G+LLA I   +RLY+   +K L  +C + 
Sbjct:   829 VVFHY-SDGK-LQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRTECNHY 886

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   887 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 943

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVT 1099
                GA+   N++  +  +D +   +E+        +  +  G  +  E +  F  G +V 
Sbjct:   944 NFLGAENAFNLFVCQ--KDSAATTDEER-------QHLQEVGLSHLGEFVNVFCHGSLVM 994

Query:  1100 SLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMA 1159
                     P  G SV++GTV G +G + + S     +    ++  + +    +   +H  
Sbjct:   995 QNLGEKSTPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEHSL 1052

Query:  1160 Y---RSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
             Y   RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:  1053 YATWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKR 1105

 Score = 64 (27.6 bits), Expect = 2.6e-43, Sum P(5) = 2.6e-43
 Identities = 28/116 (24%), Positives = 49/116 (42%)

Query:    20 GNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVV 79
             G+F+  +   +++A+   LE+      G +  +    ++G    +  FR  G  KD + +
Sbjct:     1 GHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKTAVMELFRPKGESKDLLFI 59

Query:    80 GSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQYLAVDPKGRAVMIG 132
              +      ILEY  + +  D I   H     + G R    G    +DP+ R  MIG
Sbjct:    60 LTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG-RPSETGIIGIIDPECR--MIG 112

 Score = 41 (19.5 bits), Expect = 8.7e-12, Sum P(3) = 8.7e-12
 Identities = 5/23 (21%), Positives = 13/23 (56%)

Query:   123 DPKGRAVMIGACEKQKLVYVLNR 145
             +P G+ + + +C   ++V  + R
Sbjct:   472 EPNGKNISVASCNSNQVVVAVGR 494

 Score = 38 (18.4 bits), Expect = 2.6e-43, Sum P(5) = 2.6e-43
 Identities = 7/25 (28%), Positives = 14/25 (56%)

Query:   122 VDPKGRAVMIGACEKQKLVYVLNRD 146
             VDP G   ++G  E +  + +L ++
Sbjct:   244 VDPNGSRYLLGDMEGRLFMLLLEKE 268

 Score = 38 (18.4 bits), Expect = 2.6e-43, Sum P(5) = 2.6e-43
 Identities = 9/36 (25%), Positives = 17/36 (47%)

Query:   393 LVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRG 428
             L + EQ++  + + D+R+  L E    +  T    G
Sbjct:   265 LEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 300


>WB|WBGene00010890 [details] [associations]
            symbol:ddb-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0040010 "positive regulation of growth
            rate" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0030163
            "protein catabolic process" evidence=IMP] [GO:0007276 "gamete
            generation" evidence=IMP] [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR004871 Pfam:PF03178 UniPathway:UPA00143
            GO:GO:0005634 GO:GO:0009792 GO:GO:0006898 GO:GO:0005737
            GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0006281
            GO:GO:0040011 GO:GO:0016567 GO:GO:0007049 GO:GO:0040035
            InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163 GO:GO:0007276
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855 PIR:T23798
            RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
            DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
            PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
            GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
            WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
        Length = 1134

 Score = 239 (89.2 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 101/414 (24%), Positives = 186/414 (44%)

Query:   386 QPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEM 445
             +P G    V +E   ++ PI DM +     +  PQ+ T  G     SLR++R G+ + E+
Sbjct:   336 EPNGGSYSVILETYSNIGPIRDMVMVE--SDGQPQLVTCTGADKDGSLRVIRNGIGIDEL 393

Query:   446 AVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSL 504
             A   L GV   ++ ++ + N   D Y++VS ++ T VL I GE +E+V         P++
Sbjct:   394 ASVDLAGVVG-IFPIRLDSNA--DNYVIVSLSDETHVLQITGEELEDVKLLEINTDLPTI 450

Query:   505 -AVSLIG-DDS--LMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGE 560
              A +L G +DS  ++Q     IR +   G    W       I KV  N     I L+  +
Sbjct:   451 FASTLFGPNDSGIILQATEKQIRLMSSSGLSKFWEPTNGEIISKVSVNAANGQIVLAARD 510

Query:   561 LIYFE---VDMTGQL---LEVEKHEMSGDVACLDIASVPEG-RKRSRFLAVGSYDN-TIR 612
              +Y     VD  G L   L  EK +   ++ACLD+++  +    ++ FL +  +    + 
Sbjct:   511 TVYLLTCIVDEMGALDIQLTAEK-KFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAME 569

Query:   613 ILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRT 672
             ++ L PD    +++V     P + ++   + A+   E    H    +L     +G L   
Sbjct:   570 VIQL-PD----LITVCHTDLPTK-IIPRSIIATCIEEV---H----YLLVAFGDGALVYY 616

Query:   673 VVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE 732
             V D+ TG   + +   +G RPP L  V    R  +   S RP + +    + + + ++ +
Sbjct:   617 VFDIKTGTHGEPKKSNVGTRPPSLHRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVK 676

Query:   733 TLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
              ++   S SS    + +V   GN++   T++ + +  +  ++P+  +  R   Q
Sbjct:   677 LVDTVCSLSSSAYRDCLVISDGNSMVFGTVDDI-QKIHVRSIPMGESVLRIAYQ 729

 Score = 193 (73.0 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 87/371 (23%), Positives = 159/371 (42%)

Query:   860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVA 919
             S   VLD  +       E    E A S  +  F + +  T   VGT  GL +  +     
Sbjct:   781 SSFMVLDQNTFQVLHSHEFGPWETALSCISGQFTN-DSSTYYVVGT--GLIYPDETETKI 837

Query:   920 GYIHIYRF--VEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR- 976
             G I ++    VE  K L  +H+  V G PLA+    G+L+A I   +RL++    + LR 
Sbjct:   838 GRIVVFEVDDVERSK-LRRVHELVVRGSPLAIRILNGKLVAAINSSIRLFEWTTDKELRL 896

Query:   977 KCENKLFPNTI-VSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHH 1035
             +C +  F + I + +    + + V D+  S     YR  E      A D   +W+     
Sbjct:   897 ECSS--FNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFEEVAKDWNSQWMVTCEF 954

Query:  1036 IDFDTMAGADKFGNIYFVRLPQD--VSDEIEE--DPTGGKIKWEQGKLNGAPNKMEEIVQ 1091
             I  +++ G +   N++ V + +   ++D+     +PTG    W  G+L     +   ++Q
Sbjct:   955 ITAESILGGEAHLNLFTVEVDKTRPITDDGRYVLEPTG---YWYLGELPKVMTRSTLVIQ 1011

Query:  1092 FHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPP 1151
                        + S++    + +++GT  G++G ++    +    F   +E  +      
Sbjct:  1012 ----------PEDSIIQYS-QPIMFGTNQGTIGMIVQIDDKWK-KFLIAIEKAIADSVKN 1059

Query:  1152 LCGRDHMAYRSAYF-----PVKDVIDGDLCEQF----PTLSLDLQRKIADE-----LDRT 1197
                 +H +YR+  F     P    +DGDL E       ++++D+  K++D+     L R 
Sbjct:  1060 CMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSLPRD 1119

Query:  1198 PGEILKKLEEI 1208
             P EILK +E++
Sbjct:  1120 PVEILKVIEDL 1130

 Score = 94 (38.1 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 21/96 (21%), Positives = 52/96 (54%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLE--LLRPENSGRIETLVSTEIFGAI 61
             Y ++ ++ + ++ ++ GNF+G +   ++VARG  ++  L+ PE    ++ +    I+G +
Sbjct:     5 YCVSAKKASVVVESVVGNFTGHENVNLIVARGNRIDVQLVSPEG---LKNVCEIPIYGQV 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNV 97
              ++A  +    ++  ++V ++   + IL Y   K V
Sbjct:    62 LTIALVKCKRDKRHSLIVVTEKWHMAILAYRDGKVV 97

 Score = 85 (35.0 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 45/186 (24%), Positives = 75/186 (40%)

Query:   205 KNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPD 264
             K+L F +L++  +   R +S      A+  V +P       GV+V   N V+YK   + +
Sbjct:   184 KHLQFSDLNMH-DKEFRTYSRQASIAADSSVLIPVP-HAIGGVIVLGSNSVLYKP--NDN 239

Query:   265 VRAVIPRRADLPAERGVL---IVSAATHRQKTLFFFLLQTEYGDIF----KVTLEHDNEH 317
             +  V+P    L          IV A+  R      FLL    G +      VT       
Sbjct:   240 LGEVVPYTCSLLENTTFTCHGIVDASGER------FLLSDTDGRLLMLLLNVTESQSGYT 293

Query:   318 VSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETE 377
             V E++I Y     +  S+  + +G +F  S  G+  L +      +P+   S S ++ET 
Sbjct:   294 VKEMRIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMT---EPN-GGSYSVILETY 349

Query:   378 EGFQPV 383
                 P+
Sbjct:   350 SNIGPI 355

 Score = 40 (19.1 bits), Expect = 3.7e-34, Sum P(4) = 3.7e-34
 Identities = 7/27 (25%), Positives = 15/27 (55%)

Query:   296 FFLLQTEYGDIFKVTLEHDNEHVSELK 322
             F  + T   D+++V   +D++H   L+
Sbjct:   161 FKFVDTGEDDVYRVAFIYDDDHGKHLQ 187


>UNIPROTKB|Q21554 [details] [associations]
            symbol:ddb-1 "DNA damage-binding protein 1" species:6239
            "Caenorhabditis elegans" [GO:0005515 "protein binding"
            evidence=IPI] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0009792 GO:GO:0006898
            GO:GO:0005737 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
            GO:GO:0006281 GO:GO:0040011 GO:GO:0016567 GO:GO:0007049
            GO:GO:0040035 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163
            GO:GO:0007276 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            OMA:CALGDGS GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855
            PIR:T23798 RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
            DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
            PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
            GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
            WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
        Length = 1134

 Score = 239 (89.2 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 101/414 (24%), Positives = 186/414 (44%)

Query:   386 QPRGLKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEM 445
             +P G    V +E   ++ PI DM +     +  PQ+ T  G     SLR++R G+ + E+
Sbjct:   336 EPNGGSYSVILETYSNIGPIRDMVMVE--SDGQPQLVTCTGADKDGSLRVIRNGIGIDEL 393

Query:   446 AVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSL 504
             A   L GV   ++ ++ + N   D Y++VS ++ T VL I GE +E+V         P++
Sbjct:   394 ASVDLAGVVG-IFPIRLDSNA--DNYVIVSLSDETHVLQITGEELEDVKLLEINTDLPTI 450

Query:   505 -AVSLIG-DDS--LMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGE 560
              A +L G +DS  ++Q     IR +   G    W       I KV  N     I L+  +
Sbjct:   451 FASTLFGPNDSGIILQATEKQIRLMSSSGLSKFWEPTNGEIISKVSVNAANGQIVLAARD 510

Query:   561 LIYFE---VDMTGQL---LEVEKHEMSGDVACLDIASVPEG-RKRSRFLAVGSYDN-TIR 612
              +Y     VD  G L   L  EK +   ++ACLD+++  +    ++ FL +  +    + 
Sbjct:   511 TVYLLTCIVDEMGALDIQLTAEK-KFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAME 569

Query:   613 ILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRT 672
             ++ L PD    +++V     P + ++   + A+   E    H    +L     +G L   
Sbjct:   570 VIQL-PD----LITVCHTDLPTK-IIPRSIIATCIEEV---H----YLLVAFGDGALVYY 616

Query:   673 VVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE 732
             V D+ TG   + +   +G RPP L  V    R  +   S RP + +    + + + ++ +
Sbjct:   617 VFDIKTGTHGEPKKSNVGTRPPSLHRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVK 676

Query:   733 TLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQ 786
              ++   S SS    + +V   GN++   T++ + +  +  ++P+  +  R   Q
Sbjct:   677 LVDTVCSLSSSAYRDCLVISDGNSMVFGTVDDI-QKIHVRSIPMGESVLRIAYQ 729

 Score = 193 (73.0 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 87/371 (23%), Positives = 159/371 (42%)

Query:   860 SCIRVLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVA 919
             S   VLD  +       E    E A S  +  F + +  T   VGT  GL +  +     
Sbjct:   781 SSFMVLDQNTFQVLHSHEFGPWETALSCISGQFTN-DSSTYYVVGT--GLIYPDETETKI 837

Query:   920 GYIHIYRF--VEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR- 976
             G I ++    VE  K L  +H+  V G PLA+    G+L+A I   +RL++    + LR 
Sbjct:   838 GRIVVFEVDDVERSK-LRRVHELVVRGSPLAIRILNGKLVAAINSSIRLFEWTTDKELRL 896

Query:   977 KCENKLFPNTI-VSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHH 1035
             +C +  F + I + +    + + V D+  S     YR  E      A D   +W+     
Sbjct:   897 ECSS--FNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFEEVAKDWNSQWMVTCEF 954

Query:  1036 IDFDTMAGADKFGNIYFVRLPQD--VSDEIEE--DPTGGKIKWEQGKLNGAPNKMEEIVQ 1091
             I  +++ G +   N++ V + +   ++D+     +PTG    W  G+L     +   ++Q
Sbjct:   955 ITAESILGGEAHLNLFTVEVDKTRPITDDGRYVLEPTG---YWYLGELPKVMTRSTLVIQ 1011

Query:  1092 FHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPP 1151
                        + S++    + +++GT  G++G ++    +    F   +E  +      
Sbjct:  1012 ----------PEDSIIQYS-QPIMFGTNQGTIGMIVQIDDKWK-KFLIAIEKAIADSVKN 1059

Query:  1152 LCGRDHMAYRSAYF-----PVKDVIDGDLCEQF----PTLSLDLQRKIADE-----LDRT 1197
                 +H +YR+  F     P    +DGDL E       ++++D+  K++D+     L R 
Sbjct:  1060 CMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSLPRD 1119

Query:  1198 PGEILKKLEEI 1208
             P EILK +E++
Sbjct:  1120 PVEILKVIEDL 1130

 Score = 94 (38.1 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 21/96 (21%), Positives = 52/96 (54%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLE--LLRPENSGRIETLVSTEIFGAI 61
             Y ++ ++ + ++ ++ GNF+G +   ++VARG  ++  L+ PE    ++ +    I+G +
Sbjct:     5 YCVSAKKASVVVESVVGNFTGHENVNLIVARGNRIDVQLVSPEG---LKNVCEIPIYGQV 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNV 97
              ++A  +    ++  ++V ++   + IL Y   K V
Sbjct:    62 LTIALVKCKRDKRHSLIVVTEKWHMAILAYRDGKVV 97

 Score = 85 (35.0 bits), Expect = 8.6e-39, Sum P(4) = 8.6e-39
 Identities = 45/186 (24%), Positives = 75/186 (40%)

Query:   205 KNLTFYELDLGLNHVSRKWSEPVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPD 264
             K+L F +L++  +   R +S      A+  V +P       GV+V   N V+YK   + +
Sbjct:   184 KHLQFSDLNMH-DKEFRTYSRQASIAADSSVLIPVP-HAIGGVIVLGSNSVLYKP--NDN 239

Query:   265 VRAVIPRRADLPAERGVL---IVSAATHRQKTLFFFLLQTEYGDIF----KVTLEHDNEH 317
             +  V+P    L          IV A+  R      FLL    G +      VT       
Sbjct:   240 LGEVVPYTCSLLENTTFTCHGIVDASGER------FLLSDTDGRLLMLLLNVTESQSGYT 293

Query:   318 VSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETE 377
             V E++I Y     +  S+  + +G +F  S  G+  L +      +P+   S S ++ET 
Sbjct:   294 VKEMRIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMT---EPN-GGSYSVILETY 349

Query:   378 EGFQPV 383
                 P+
Sbjct:   350 SNIGPI 355

 Score = 40 (19.1 bits), Expect = 3.7e-34, Sum P(4) = 3.7e-34
 Identities = 7/27 (25%), Positives = 15/27 (55%)

Query:   296 FFLLQTEYGDIFKVTLEHDNEHVSELK 322
             F  + T   D+++V   +D++H   L+
Sbjct:   161 FKFVDTGEDDVYRVAFIYDDDHGKHLQ 187


>TAIR|locus:2081576 [details] [associations]
            symbol:AT3G11960 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM;IEA;ISS] [GO:0008150 "biological_process"
            evidence=ND] [GO:0000956 "nuclear-transcribed mRNA catabolic
            process" evidence=RCA] [GO:0006486 "protein glycosylation"
            evidence=RCA] [GO:0009755 "hormone-mediated signaling pathway"
            evidence=RCA] [GO:0010182 "sugar mediated signaling pathway"
            evidence=RCA] [GO:0048825 "cotyledon development" evidence=RCA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0003676 KO:K12830 EMBL:BT006164
            EMBL:AK229623 IPI:IPI00530914 RefSeq:NP_187802.2 UniGene:At.5413
            ProteinModelPortal:Q84R20 IntAct:Q84R20 PaxDb:Q84R20 PRIDE:Q84R20
            EnsemblPlants:AT3G11960.1 GeneID:820369 KEGG:ath:AT3G11960
            TAIR:At3g11960 eggNOG:NOG322382 HOGENOM:HOG000030342
            InParanoid:Q84R20 OMA:GMLLRFE PhylomeDB:Q84R20
            ProtClustDB:CLSN2690873 ArrayExpress:Q84R20 Genevestigator:Q84R20
            Uniprot:Q84R20
        Length = 1379

 Score = 209 (78.6 bits), Expect = 1.3e-36, Sum P(5) = 1.3e-36
 Identities = 46/138 (33%), Positives = 78/138 (56%)

Query:   389 GLKNLVRIEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVS 448
             G + L  +  ++++ PI+D  + +   E+  QIF  CG  P  SLRI+R G+ V ++  +
Sbjct:   456 GTEKLHWMSSIQNIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKT 515

Query:   449 Q--LPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIGETVEEVSDS-GFLDTTPSLA 505
                  G+ +  WTVK  + D + +++V+SF   T VLS+G + ++V+DS GF     + A
Sbjct:   516 APVYQGI-TGTWTVKMKLTDVYHSFLVLSFVEETRVLSVGLSFKDVTDSVGFQSDVCTFA 574

Query:   506 VSLIGDDSLMQVHPSGIR 523
               L+ D  L+Q+H   IR
Sbjct:   575 CGLVADGLLVQIHQDAIR 592

 Score = 151 (58.2 bits), Expect = 1.3e-36, Sum P(5) = 1.3e-36
 Identities = 65/282 (23%), Positives = 118/282 (41%)

Query:   934 LELLHKTQVEGIPLALCQFQGR-LLAGIGPVLRLYDL---GKKRLLRKCENKLFPNTIVS 989
             L L   T   G+ LA+C +     LA  G    +        +R+ R    +     I S
Sbjct:  1070 LRLASSTTWPGMVLAICPYLDHYFLASAGNAFYVCGFPNDSPERMKRFAVGRT-RFMITS 1128

Query:   990 INTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAGADKFGN 1049
             + TY  RI VGD ++   F  Y  +  +L+    D   R +     +D +++A +D+ G+
Sbjct:  1129 LRTYFTRIVVGDCRDGVLFYSYHEESKKLHQIYCDPAQRLVADCFLMDANSVAVSDRKGS 1188

Query:  1050 IYFVRLPQDVSD----EIE-EDPTGG---KIKWEQGKLNGAPNKMEEIVQFHVGDVVTSL 1101
             I  +   +D SD     +E   P         +  G++  +  K   I +    DV+ S 
Sbjct:  1189 IAILSC-KDHSDFGMKHLEYSSPESNLNLNCAYYMGEIAMSIKKGCNIYKLPADDVLRSY 1247

Query:  1102 QKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHP---PLCGRDHM 1158
               +  +    +++I GT++GS+      SS ++ +    ++  +   HP   P+ G DH 
Sbjct:  1248 GLSKSIDTADDTIIAGTLLGSIFVFAPISS-EEYELLEGVQAKLGI-HPLTAPVLGNDHN 1305

Query:  1159 AYRSAYFP--VKDVIDGDLCEQFPTLSLDLQRKIADELDRTP 1198
              +R    P   + ++DGD+  QF  L+   Q  +      +P
Sbjct:  1306 EFRGRENPSQARKILDGDMLAQFLELTNRQQESVLSTPQPSP 1347

 Score = 118 (46.6 bits), Expect = 1.3e-36, Sum P(5) = 1.3e-36
 Identities = 43/181 (23%), Positives = 79/181 (43%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             YL    L+ P+ ++    G F    + +IV  +   +EL+     G +E++    +FG I
Sbjct:    36 YLAKCILR-PSVVLQVAYGYFRSPSSRDIVFGKETCIELVVIGEDGIVESVCEQYVFGTI 94

Query:    62 RSLAQF----RLTGSQ----KDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFGKSGCRR 113
             + LA      +L  +     KD + V SDSG++  L ++   + F  I        G  R
Sbjct:    95 KDLAVIPQSSKLYSNSLQMGKDLLAVLSDSGKLSFLSFSNEMHRFSPIQHVQLSTPGNSR 154

Query:   114 IVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLT---ISSPLEAHKSHTIVYSIC 170
             I  G+ L +D  G  + + A   +  ++ L+  +   +    IS P E   + + + +I 
Sbjct:   155 IQLGRMLTIDSSGLFLAVSAYHDRFALFSLSTSSMGDIIHQRISYPSEDGGNGSSIQAIS 214

Query:   171 G 171
             G
Sbjct:   215 G 215

 Score = 115 (45.5 bits), Expect = 1.3e-36, Sum P(5) = 1.3e-36
 Identities = 29/113 (25%), Positives = 54/113 (47%)

Query:   686 SRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQC 745
             +R +G+ P  L        + ++ LS RPWL    R     T +S++   +A    S +C
Sbjct:   819 TRRIGITPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPSTHATPVCSFEC 878

Query:   746 VEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIETD 798
              +G++ V+ N L +  +    +  N     L  TPR+ +   + KL++++ TD
Sbjct:   879 PQGILFVSENCLHLVEMVH-SKRRNAQKFQLGGTPRKVIYHSESKLLIVMRTD 930

 Score = 54 (24.1 bits), Expect = 1.3e-20, Sum P(5) = 1.3e-20
 Identities = 32/135 (23%), Positives = 53/135 (39%)

Query:   483 LSIGETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIR------HIREDGR------ 530
             LS  +  + V   GF     + A  L+ D  L+Q+H   IR          DG       
Sbjct:   555 LSFKDVTDSV---GFQSDVCTFACGLVADGLLVQIHQDAIRLCMPTMDAHSDGIPVSSPF 611

Query:   531 INEWRTPGKR-TIVKVGSNRLQVVIALSGGELIYFEVDMTG----QLLEVEKHEMSGDVA 585
              + W       ++  VG N L VV   +   L    V        ++ E+++  +  +V+
Sbjct:   612 FSSWFPENVSISLGAVGQN-LIVVSTSNPCFLSILGVKSVSSQCCEIYEIQRVTLQYEVS 670

Query:   586 CLDIASVPEGRKRSR 600
             C+ +     G+KRSR
Sbjct:   671 CISVPQKHIGKKRSR 685

 Score = 48 (22.0 bits), Expect = 1.3e-36, Sum P(5) = 1.3e-36
 Identities = 16/83 (19%), Positives = 40/83 (48%)

Query:   590 ASVPEGRKRSRFLAVGSYDNTIRILSLDPDDC-MQILSVQSVSSPPESLLFLEVQASVGG 648
             A++P   ++     +G++  ++ +LS   D   +++L+   VS    + +   +   +  
Sbjct:   695 AAIPSAMEQGYTFLIGTHKPSVEVLSFTEDGVGVRVLASGLVSLT--NTMGTVISGCIPQ 752

Query:   649 EDGADHPASLFLNAGLQNGVLFR 671
             +        L++ +GL+NG+L R
Sbjct:   753 DVRLVLVDQLYVLSGLRNGMLLR 775

 Score = 45 (20.9 bits), Expect = 2.0e-12, Sum P(3) = 2.0e-12
 Identities = 12/42 (28%), Positives = 18/42 (42%)

Query:   716 LGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNAL 757
             L ++   R L   LS++ +  +  F SD C      VA   L
Sbjct:   542 LSFVEETRVLSVGLSFKDVTDSVGFQSDVCTFACGLVADGLL 583

 Score = 45 (20.9 bits), Expect = 4.1e-12, Sum P(3) = 4.1e-12
 Identities = 12/35 (34%), Positives = 17/35 (48%)

Query:   601 FLAVGSYDNTIRILSLDPDDCMQILSVQSVSSPPE 635
             FLAV +Y +   + SL       I+  Q +S P E
Sbjct:   169 FLAVSAYHDRFALFSLSTSSMGDIIH-QRISYPSE 202

 Score = 38 (18.4 bits), Expect = 1.4e-35, Sum P(5) = 1.4e-35
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query:   602 LAVGSYDNTIRILSLDPDDCMQILSVQSVSS 632
             +++G+    + ++S      + IL V+SVSS
Sbjct:   622 ISLGAVGQNLIVVSTSNPCFLSILGVKSVSS 652


>UNIPROTKB|F5H6C5 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909008
            ProteinModelPortal:F5H6C5 SMR:F5H6C5 Ensembl:ENST00000535967
            ArrayExpress:F5H6C5 Bgee:F5H6C5 Uniprot:F5H6C5
        Length = 272

 Score = 281 (104.0 bits), Expect = 3.0e-23, P = 3.0e-23
 Identities = 77/256 (30%), Positives = 130/256 (50%)

Query:   396 IEQVESLMPIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPS 455
             +E   +L PI+DM + +L  +   Q+ T  G     SLRI+R G+ + E A   LPG+  
Sbjct:     1 METFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGI-K 59

Query:   456 AVWTVKKNVNDEFDAYIVVSFNNATLVLSI-GETVEEVSDSGFLDTTPSLAVSLIGDDSL 514
              +W ++ + N E D  +V+SF   T VL + GE VEE    GF+D   +     +    L
Sbjct:    60 GLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGEEVEETELMGFVDDQQTFFCGNVAHQQL 119

Query:   515 MQVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQL 572
             +Q+  + +R + ++ +  ++EW+ P  + I     N  QVV+A+ G  L Y ++    +L
Sbjct:   120 IQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAV-GRALYYLQIHPQ-EL 177

Query:   573 LEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVS 631
              ++   EM  +VACLDI  + +    S   A+G + D + RIL L P    ++L  + + 
Sbjct:   178 RQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKEMLG 234

Query:   632 SP--PESLLFLEVQAS 645
                 P S+L    ++S
Sbjct:   235 GEIIPRSILMTTFESS 250


>UNIPROTKB|F1M680 [details] [associations]
            symbol:Ddb1 "DNA damage-binding protein 1" species:10116
            "Rattus norvegicus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178 RGD:621889
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 IPI:IPI00950036
            Ensembl:ENSRNOT00000063867 ArrayExpress:F1M680 Uniprot:F1M680
        Length = 600

 Score = 215 (80.7 bits), Expect = 9.3e-20, Sum P(2) = 9.3e-20
 Identities = 89/353 (25%), Positives = 159/353 (45%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   252 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 307

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   308 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 365

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   366 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 422

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   423 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 472

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    LC   H
Sbjct:   473 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSLCSLTH 529

Query:  1158 M-AYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
             +  +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:   530 LFTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 582

 Score = 108 (43.1 bits), Expect = 9.3e-20, Sum P(2) = 9.3e-20
 Identities = 33/136 (24%), Positives = 62/136 (45%)

Query:   651 GADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCL 710
             G   P+S +L   L +G LF   +++ TG LSD +   LG +P  L +        +   
Sbjct:    52 GRLEPSSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFAC 111

Query:   711 SSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNALRVFTIERLGETFN 770
             S RP + Y    + + + ++ + + Y    +SD   + +     + L + TI+ + +   
Sbjct:   112 SDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHI 171

Query:   771 ETALPLRYTPRRFVLQ 786
              T +PL  +PR+   Q
Sbjct:   172 RT-VPLYESPRKICYQ 186

 Score = 38 (18.4 bits), Expect = 1.8e-12, Sum P(2) = 1.8e-12
 Identities = 10/27 (37%), Positives = 17/27 (62%)

Query:   552 VVIALSGGELIYFEVDM-TGQLLEVEK 577
             ++ AL  G L YF +++ TG L + +K
Sbjct:    61 LLCALGDGALFYFGLNIETGLLSDRKK 87


>FB|FBgn0024698 [details] [associations]
            symbol:Cpsf160 "Cleavage and polyadenylation specificity
            factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
            binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
            EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
            RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
            STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
            GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
            InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
            GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
            Uniprot:Q9V726
        Length = 1455

 Score = 116 (45.9 bits), Expect = 1.5e-17, Sum P(6) = 1.5e-17
 Identities = 82/356 (23%), Positives = 147/356 (41%)

Query:   880 DNEAAFSICTVNFHDKEHGT--LLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGK 932
             ++  AF I  +++     G    L +GT     F    +I + G IHIY  +E     GK
Sbjct:  1111 EHVTAFKIVKLSYEGTRSGLKEYLCIGT----NFNYSEDITSRGNIHIYDIIEVVPEPGK 1166

Query:   933 SL------ELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLRKC--ENKLFP 984
              +      E+  K Q +G   A+    G L+ G+G  + ++ L    L+     +  ++ 
Sbjct:  1167 PMTKFKIKEIFKKEQ-KGPVSAISDVLGFLVTGLGQKIYIWQLRDGDLIGVAFIDTNIYV 1225

Query:   985 NTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG- 1043
             + I+++ +    I++ D+ +S    +++ +   L + + D  P  +     +  ++  G 
Sbjct:  1226 HQIITVKSL---IFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDNSNLGF 1282

Query:  1044 --ADKFGNIYFVRLPQDVSDEIEEDPTGGKI-KWEQGKLNGAPNKMEEIVQFHVGDVVTS 1100
                D   NI  V + Q    E  E   G K+ +     L    N M   VQ H   +   
Sbjct:  1283 LVTDAERNI-IVYMYQP---EARESLGGQKLLRKADYHLGQVVNTMFR-VQCHQKGLH-- 1335

Query:  1101 LQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMR-QEHPPLCGRDHMA 1159
              Q+   +      V+YGT+ G+LG  L    +    F     + +  QEH  LCG +   
Sbjct:  1336 -QRQPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEH--LCGLNPKE 1392

Query:  1160 YRS-------AYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEI 1208
             YR+          P + +IDGDL   +  ++   + ++A ++     EIL  L EI
Sbjct:  1393 YRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEI 1448

 Score = 110 (43.8 bits), Expect = 1.5e-17, Sum P(6) = 1.5e-17
 Identities = 38/144 (26%), Positives = 70/144 (48%)

Query:   457 VWTV------KKNVNDEFDAYIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIG 510
             VWTV      K + ND+ D ++++S  N+TLVL  G+ + E+ ++GF    P++ V  +G
Sbjct:   563 VWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTGQEINEIENTGFTVNQPTIFVGNLG 621

Query:   511 DDS-LMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMT 569
                 ++QV    +R ++    I          +V+V      V + +  G++I   +  T
Sbjct:   622 QQRFIVQVTTRHVRLLQGTRLIQNVPIDVGSPVVQVSIADPYVCLRVLNGQVITLALRET 681

Query:   570 -GQ-LLEVEKHEMSGDVACLDIAS 591
              G   L + KH +S   A + I++
Sbjct:   682 RGTPRLAINKHTISSSPAVVAISA 705

 Score = 105 (42.0 bits), Expect = 1.5e-17, Sum P(6) = 1.5e-17
 Identities = 55/204 (26%), Positives = 82/204 (40%)

Query:   246 GVLVCAENFVIYKNQGHPDVRAVIPRRAD------LPAERGVLI-VSAATHRQKTLFFFL 298
             G LV   N VIY NQ  P     +   AD      L  + GV I +  A      +   +
Sbjct:   288 GCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFPLKPQDGVRISLDCANFAFIDVDKLV 347

Query:   299 LQTEYGDIFKVTLEHDNEH-VSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQF 357
             +    GD++ +TL  D+   V            +T+ +CVL S Y+F  S  GN  L  F
Sbjct:   348 ISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLLHF 407

Query:   358 QAIGADPDVEASSSTLMETEEGFQPVFFQPRGLK----NLVRIEQVESLMPIMDMRIANL 413
                      E   ST++  +E  Q    Q R L+    NL  I  V+ L        +  
Sbjct:   408 --------TEEDQSTVITLDEVEQQSEQQQRNLQDEDQNLEEIFDVDQLEMAPTQAKSRR 459

Query:   414 FEEEAPQIFTLCGRGPRSSLRILR 437
              E+E  +++   G G ++S+  LR
Sbjct:   460 IEDEELEVY---GSGAKASVLQLR 480

 Score = 85 (35.0 bits), Expect = 1.5e-17, Sum P(6) = 1.5e-17
 Identities = 22/105 (20%), Positives = 47/105 (44%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETFG 107
             R+E L +  ++G + SL    L G+ +D +++     ++ +L+++P       +    F 
Sbjct:    67 RLECLATYTLYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFE 126

Query:   108 KSGCRRIVPGQYLA----VDPKGRAVMIGACEKQKLVYVLNRDTA 148
             +   R    G+Y      VDP  R  ++    K+ +V    +D +
Sbjct:   127 EDDIRGGWTGRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNS 171

 Score = 50 (22.7 bits), Expect = 6.9e-06, Sum P(4) = 6.9e-06
 Identities = 31/143 (21%), Positives = 62/143 (43%)

Query:   487 ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGRINEWRTPGKRTIVKVG 546
             + +EE+ D   L+  P+ A S   +D  ++V+ SG +       + + R    + I +V 
Sbjct:   437 QNLEEIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAK-----ASVLQLR----KFIFEVC 487

Query:   547 SNRLQV--VIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAV 604
              + + V  +  +  GE + FE D  G  L      +  D+    +A+    +  +  + V
Sbjct:   488 DSLMNVAPINYMCAGERVEFEED--GVTLRPHAESLQ-DLKIELVAATGHSKNGALSVFV 544

Query:   605 GSYDNTIRILSLDPDDCMQILSV 627
                +  I I S + D C+ + +V
Sbjct:   545 NCINPQI-ITSFELDGCLDVWTV 566

 Score = 45 (20.9 bits), Expect = 1.5e-17, Sum P(6) = 1.5e-17
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query:   774 LPLRYTPRRFVLQPKKKLMVIIETDQGALT 803
             +PLR TPR+ V   + ++  +I   +  +T
Sbjct:  1029 VPLRCTPRQLVYHRENRVYCLITQTEEPMT 1058

 Score = 44 (20.5 bits), Expect = 2.6e-05, Sum P(4) = 2.6e-05
 Identities = 14/53 (26%), Positives = 21/53 (39%)

Query:   618 PDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLF 670
             P DC+Q+  +Q    P    L + V A +        P  + LN+   N   F
Sbjct:   273 PFDCLQVYPIQK---PIGGCLVMTVNAVIYLNQSVP-PYGVSLNSSADNSTAF 321

 Score = 40 (19.1 bits), Expect = 1.5e-17, Sum P(6) = 1.5e-17
 Identities = 16/62 (25%), Positives = 27/62 (43%)

Query:   593 PEGRKRSRFLAVGSY---DNTIRILSLDPDDCMQILSVQSVSSPPESLLFLEVQASVGGE 649
             P+G  + RF  +      D     + LD +D  +   ++S    P+ +  L   A+VGG 
Sbjct:   896 PKGHLKIRFRKMDQLNLLDQQPTHIDLDENDEQE--EIESYQMQPKYVQKLRPFANVGGL 953

Query:   650 DG 651
              G
Sbjct:   954 SG 955

 Score = 37 (18.1 bits), Expect = 0.00012, Sum P(4) = 0.00012
 Identities = 10/21 (47%), Positives = 12/21 (57%)

Query:   243 GPSGVLVCAEN--FVIYKNQG 261
             G SGV+VC  N  FV    +G
Sbjct:   952 GLSGVMVCGVNPCFVFLTFRG 972


>ASPGD|ASPL0000052925 [details] [associations]
            symbol:ddbA species:162425 "Emericella nidulans"
            [GO:0006282 "regulation of DNA repair" evidence=IEA;ISA]
            [GO:0006974 "response to DNA damage stimulus" evidence=IEP;IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0070913 "Ddb1-Wdr21
            complex" evidence=IEA] [GO:0008180 "signalosome" evidence=IEA]
            [GO:0070912 "Ddb1-Ckn1 complex" evidence=IEA] [GO:0031465
            "Cul4B-RING ubiquitin ligase complex" evidence=IEA] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=IEA]
            [GO:0040020 "regulation of meiosis" evidence=IEA] [GO:0042787
            "protein ubiquitination involved in ubiquitin-dependent protein
            catabolic process" evidence=IEA] [GO:0007090 "regulation of S phase
            of mitotic cell cycle" evidence=IEA] [GO:0034644 "cellular response
            to UV" evidence=IEA] InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 GO:GO:0005634 Gene3D:2.130.10.10 EMBL:BN001308
            GO:GO:0003676 EMBL:AACD01000007 KO:K10610 OMA:DRPAVIY
            OrthoDB:EOG473T0C RefSeq:XP_658200.1 STRING:Q5BFT4
            EnsemblFungi:CADANIAT00002078 GeneID:2876375 KEGG:ani:AN0596.2
            eggNOG:NOG316722 HOGENOM:HOG000216556 Uniprot:Q5BFT4
        Length = 1132

 Score = 144 (55.7 bits), Expect = 9.1e-17, Sum P(3) = 9.1e-17
 Identities = 94/430 (21%), Positives = 172/430 (40%)

Query:   396 IEQVESLMPIMDMRIANL-----------FEEEAPQIFTLCGRGPRSSLRILRPGLAVSE 444
             I+ + ++ P++D  I +L           F     +I T  G     +LR +R G+ + E
Sbjct:   365 IQTLSNIAPVLDFTIMDLGNRTSENQMHEFSSGQARIVTGSGAFDDGTLRSVRSGVGLEE 424

Query:   445 MAV-SQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSI---GETVEEVSDSGF-LD 499
             + V   +  +   +W ++     +F   ++V+F N T V      GE  E  S  G  L 
Sbjct:   425 LGVLGDMEHITD-LWGLQVGSRGDFLDTLLVTFVNETRVFRFSPDGEAEELESFLGLSLS 483

Query:   500 TTPSLAVSLIGDDSLMQVHPSG--IRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALS 557
                 LA +L G   ++QV      I  I     I EW TP  + I+   S     ++ ++
Sbjct:   484 ENTLLAANLPGS-RILQVTEQRVLIADIECGMTIFEW-TPKNQLIITAASANDDTIVLVA 541

Query:   558 GGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLAVGSYDNTIRILSLD 617
             GG+ +   +D+  +   V + +   D     I+ V      +    VG +    ++  L 
Sbjct:   542 GGKHVTV-LDIQSEARVVSEKDFGADN---QISGVTLPTTPTDVCIVG-FPQLAKVSVLK 596

Query:   618 PDDCMQILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMV 677
               D   I S  S+    E+     + ASV  E+    P +LF++  + +G +     +  
Sbjct:   597 LQDLSHISST-SLGPAGEAFPRSVLVASVLAENA---P-TLFIS--MADGSVITYDYNDQ 649

Query:   678 TGQLSDSRSRFLGLRPPKLFSVVVG-GRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEY 736
                LS      LG   P    +  G G + +      P L Y   GR + + ++ E    
Sbjct:   650 DHSLSGMNKLVLGSEQPTFKKLPRGNGLSNVFATCENPSLIYGSEGRIIYSAVNSEGASR 709

Query:   737 AASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIE 796
                F+S+   E +       L++  +++   T  +T LP++ T RR    P +K   +  
Sbjct:   710 ICHFNSEAYPESIAVATAQELKIGLVDKERTTQIQT-LPIKATVRRVAYSPSEKAFGMGT 768

Query:   797 TDQGALTAEE 806
              ++  ++ EE
Sbjct:   769 IERKLVSGEE 778

 Score = 131 (51.2 bits), Expect = 9.1e-17, Sum P(3) = 9.1e-17
 Identities = 73/305 (23%), Positives = 131/305 (42%)

Query:   920 GYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDL----GKKRLL 975
             GYI ++  V+ G+ L  + + +V+G   AL     +++A +   + ++ +    G  +L 
Sbjct:   843 GYIRVFE-VDNGRKLAKVAQERVKGACRALAVMGDKIVAALVKTVVVFQVVPRSGGLQLQ 901

Query:   976 RKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDEN----QLYIFADDSVPRWLT 1031
             R    +      V I   R+ I + D+ +S    +Y   EN    +L   A      W T
Sbjct:   902 RLASYRT-STAPVDITVTRNVIAIADLMKSVCVVEYHEGENGAPDKLVEVARHFQTVWAT 960

Query:  1032 AAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQ 1091
                 +  DT   +D  GN+  +R  ++ S   E+D    ++  E   LN   N++  +  
Sbjct:   961 GVTSVAPDTYLESDAEGNLIVLR--RNRSGVEEDDRRRLEVTGEIC-LNEMVNRIRPVN- 1016

Query:  1092 FHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPP 1151
                   +  L  A++VP         TV GS+  + A  + D  DF   L+  M      
Sbjct:  1017 ------IQQLPSATVVP----RAFLATVEGSI-YLYAIINPDYQDFLMRLQATMASRADS 1065

Query:  1152 LCG---RDHMAYRS----AYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
             L G    D+ A+R+    A  P +  +DG+L E+F T    +Q++I D +  +  E+   
Sbjct:  1066 LGGIPFTDYRAFRTMTRQATEPYR-FVDGELIERFLTCEPAVQKEIVDIVGSSLEEVRAI 1124

Query:  1205 LEEIR 1209
             +E +R
Sbjct:  1125 VEALR 1129

 Score = 79 (32.9 bits), Expect = 9.1e-17, Sum P(3) = 9.1e-17
 Identities = 70/366 (19%), Positives = 131/366 (35%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRS 63
             Y   + + + I  A+  +F   +   +VVA+   LE       G +  + S  IF  +  
Sbjct:     3 YIAPIHRASSIRHALKLHFLNAEDECLVVAKANQLEFYSVTPDG-LALVTSCSIFARVTM 61

Query:    64 LAQFRL-TGSQKDYIVVGSDSGRIVILEYNPSKN-VFDKIHQETFGKSGCRRIVPGQYLA 121
             LA       S  D++ VG+D      L ++ ++N V  +           R    G    
Sbjct:    62 LACLPAPANSPTDHLFVGTDRYSYFTLSWDSARNQVRTERDYVDIADPSSRDARTGSRCM 121

Query:   122 VDPKGR---------------AVMIGACEKQKLVYVLNRDTAARL-TISSPLEAHKSHTI 165
             +DP GR                + + +  + + V +     A R+  +  P+        
Sbjct:   122 IDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPIITRIDELF 181

Query:   166 VYSICGIDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKNLTFYELDLGLNHVSRKWSE 225
             V S   +     +P  A +   Y E +Q        E  K  T    +     ++  +++
Sbjct:   182 VRSSAFLHVQAGSPRLALL---Y-EDNQKKVKLKVREL-KYSTAAGAESEFTSIA-DYAQ 235

Query:   226 PVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVS 285
              +D GA+ L+ VP       G+L+  E  + Y +  + ++          P E   + V+
Sbjct:   236 ELDLGASHLIPVPAPLAAAGGLLILGETSIKYVDADNNEI-------VSQPLEEATIFVA 288

Query:   286 AATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFA 345
                  Q     +LL  +YG +F + L   N  V   ++         + +  L  G +F 
Sbjct:   289 ---WEQVDSQRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRASVLVYLGGGVVFV 345

Query:   346 ASEFGN 351
              S  G+
Sbjct:   346 GSHQGD 351

 Score = 52 (23.4 bits), Expect = 2.1e-07, Sum P(3) = 2.1e-07
 Identities = 19/72 (26%), Positives = 32/72 (44%)

Query:   538 GKRTIVKVGSNRLQVVIALSG-GELIYFEV-DMTGQLLEVEKHEMSGDVACLDIAS--VP 593
             G   ++++G    QV+  LS    ++ F + D+  +  E + HE S   A +   S    
Sbjct:   350 GDSQVIRIGDQSFQVIQTLSNIAPVLDFTIMDLGNRTSENQMHEFSSGQARIVTGSGAFD 409

Query:   594 EGRKRSRFLAVG 605
             +G  RS    VG
Sbjct:   410 DGTLRSVRSGVG 421

 Score = 43 (20.2 bits), Expect = 7.5e-08, Sum P(3) = 7.5e-08
 Identities = 6/13 (46%), Positives = 10/13 (76%)

Query:   886 SICTVNFHDKEHG 898
             S+C V +H+ E+G
Sbjct:   930 SVCVVEYHEGENG 942

 Score = 37 (18.1 bits), Expect = 6.5e-06, Sum P(3) = 6.5e-06
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query:   559 GELIYFEVDMTGQLLEVEKHEMSGDVACLDIA 590
             G +  FEVD   +L +V +  + G  AC  +A
Sbjct:   843 GYIRVFEVDNGRKLAKVAQERVKG--ACRALA 872


>TAIR|locus:2153122 [details] [associations]
            symbol:CPSF160 "cleavage and polyadenylation specificity
            factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
            of flower development" evidence=RCA] [GO:0016570 "histone
            modification" evidence=RCA] [GO:0048449 "floral organ formation"
            evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
            EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
            IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
            EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
            TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
            PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
            GermOnline:AT5G51660 Uniprot:Q9FGR0
        Length = 1442

 Score = 174 (66.3 bits), Expect = 8.2e-16, Sum P(4) = 8.2e-16
 Identities = 93/378 (24%), Positives = 166/378 (43%)

Query:   862 IRVLDP-RSAN---TTCLLELQDNEAAFSICTV---NFHDKEHGTLLAVGTAKGLQFWPK 914
             I++L+P RS     T   + +Q +E A ++  V   N    E+ TLLAVGTA    +   
Sbjct:  1085 IQILEPERSGGPWETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTA----YVQG 1140

Query:   915 RNIVA-GYIHIYRFVEEGKSLELL----HKTQVEGIPLALCQFQGRLLAGIGPVLRLYDL 969
              ++ A G + ++ F + G + + +    +  +++G   A+   QG LL   GP + L+  
Sbjct:  1141 EDVAARGRVLLFSFGKNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW 1200

Query:   970 GKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADD--SVP 1027
                 L         P  +VS+N  +  I +GD+ +S +F  ++   +QL + A D  S+ 
Sbjct:  1201 NGTELNGVAFFDAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLD 1260

Query:  1028 RWLTAAHHIDFDTM--AGADKFGNIY-FVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPN 1084
              + T    ID  T+  A +D+  NI  F   P+ +              W+  KL     
Sbjct:  1261 CFATE-FLIDGSTLSLAVSDEQKNIQVFYYAPKMIES------------WKGLKLLSR-- 1305

Query:  1085 KMEEIVQFHVGDVVTSLQKASLVPGGGESV-----IYGTVMGSLGAMLAFSSRDDVDF-- 1137
                   +FHVG  V+   +  +V  G + +     ++GT+ GS G +      D+V F  
Sbjct:  1306 -----AEFHVGAHVSKFLRLQMVSSGADKINRFALLFGTLDGSFGCIAPL---DEVTFRR 1357

Query:  1138 FSHLEMHMRQEHPPLCGRDHMAYRSAYFPVK-------DVIDGDLCEQFPTLSLDLQRKI 1190
                L+  +    P + G + +A+R      K        ++D +L   +  L L+ Q ++
Sbjct:  1358 LQSLQKKLVDAVPHVAGLNPLAFRQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLEL 1417

Query:  1191 ADELDRTPGEILKKLEEI 1208
             A ++  T   ILK L ++
Sbjct:  1418 AHQIGTTRYSILKDLVDL 1435

 Score = 95 (38.5 bits), Expect = 8.2e-16, Sum P(4) = 8.2e-16
 Identities = 47/208 (22%), Positives = 86/208 (41%)

Query:   407 DMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTV--KK-- 462
             D     + ++   ++    G G   +L +LR  +    +   +LPG    +WTV  K   
Sbjct:   529 DANATGVSKQSNYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGC-KGIWTVYHKSSR 587

Query:   463 --NVN--------DEFDAYIVVSFNNATLVLSIGETVEEVSDSG--FLDTTPSLAVSLIG 510
               N +        DE+ AY+++S    T+VL   + + EV++S   ++      A +L G
Sbjct:   588 GHNADSSKMAADEDEYHAYLIISLEARTMVLETADLLTEVTESVDYYVQGRTIAAGNLFG 647

Query:   511 DDSLMQVHPSGIRHIREDGRINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTG 570
                ++QV   G R I +   +N+  + G         +    V ++S  +  Y  + MT 
Sbjct:   648 RRRVIQVFEHGAR-ILDGSFMNQELSFGASNSESNSGSESSTVSSVSIADP-YVLLRMTD 705

Query:   571 QLLEVEKHEMSGDVACLDIASVPEGRKR 598
               + +   + S     +   SV EG KR
Sbjct:   706 DSIRLLVGDPSTCTVSISSPSVLEGSKR 733

 Score = 72 (30.4 bits), Expect = 8.2e-16, Sum P(4) = 8.2e-16
 Identities = 44/177 (24%), Positives = 71/177 (40%)

Query:   223 WSE-PVDNGANMLVTVPGGGDGP-SGVLVCAENFVIYKNQGHPDVRAV--IPRRAD---- 274
             WS   + + A  L+ VP     P  GVLV   N + Y +Q      A+      AD    
Sbjct:   302 WSAINLPHDAYKLLAVPS----PIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQE 357

Query:   275 LPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTAS 334
             LPA    + + AA     +    LL T+ G++  +TL +D   V  L +       + + 
Sbjct:   358 LPASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASD 417

Query:   335 MCVLKSGYLFAASEFGNHALYQFQAIGADPDVEASSSTLMETEEGFQPVFFQPRGLK 391
             +  + +   F  S  G+  L QF    + P   AS   L + +E  +    Q + L+
Sbjct:   418 ITSVGNSLFFLGSRLGDSLLVQFSC-RSGP--AASLPGLRDEDEDIEGEGHQAKRLR 471

 Score = 53 (23.7 bits), Expect = 8.2e-16, Sum P(4) = 8.2e-16
 Identities = 27/119 (22%), Positives = 50/119 (42%)

Query:    20 GNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQ----KD 75
             GN    + P++   RG V++ +   +   +E +    + G + S+A   + G      +D
Sbjct:    79 GNTQELRNPKLA-KRGGVMDGVYGVS---LELVCHYRLHGNVESIAVLPMGGGNSSKGRD 134

Query:    76 YIVVGSDSGRIVILEYNPSKNVFDKIHQETF-G------KSGCRRIVPGQYLAVDPKGR 127
              I++     +I +LE++ S +         F G      K G      G  + VDP+GR
Sbjct:   135 SIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESFPRGPLVKVDPQGR 193

 Score = 44 (20.5 bits), Expect = 1.0e-10, Sum P(4) = 1.0e-10
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query:   608 DNTIRILSLDPDDCMQILSVQSV 630
             D++IR+L  DP  C   +S  SV
Sbjct:   705 DDSIRLLVGDPSTCTVSISSPSV 727

 Score = 43 (20.2 bits), Expect = 1.3e-10, Sum P(4) = 1.3e-10
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query:   452 GVPSAVWTVKKNVNDEFDAYIVVSFNNATL 481
             GV  AV +V     D+ D Y VV + +  L
Sbjct:   762 GVGEAVDSVDGGPQDQGDIYCVVCYESGAL 791


>UNIPROTKB|Q10570 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
            3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
            [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
            from RNA polymerase II promoter" evidence=TAS] [GO:0006369
            "termination of RNA polymerase II transcription" evidence=TAS]
            [GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
            export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
            evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
            Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
            GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
            OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
            RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
            DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
            PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
            PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
            Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
            GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
            PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
            GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
            CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
            Uniprot:Q10570
        Length = 1443

 Score = 131 (51.2 bits), Expect = 1.0e-15, Sum P(6) = 1.0e-15
 Identities = 84/397 (21%), Positives = 168/397 (42%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  DE+Y +P+ E+      I+++ P S  A     +ELQ+ E    + TV+  
Sbjct:  1054 EKEFETIERDERYIHPQQEAFS----IQLISPVSWEAIPNARIELQEWEHVTCMKTVSLR 1109

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGKSL-----ELLHKTQVE 943
              +E  + L    A G        +   G I I   +E     G+ L     ++L++ + +
Sbjct:  1110 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 1169

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGD 1001
             G   ALC   G L++ IG  + L+ L    L  +   + +L+ + ++S+  +   I   D
Sbjct:  1170 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNF---ILAAD 1226

Query:  1002 IQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRLPQ 1057
             + +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ LP+
Sbjct:  1227 VMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 1286

Query:  1058 DVSDEIEEDPTGGKIKWEQGKLN-GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIY 1116
                    ++  GG     +   + GA   +    +         L K S+V        +
Sbjct:  1287 ------AKESFGGMRLLRRADFHVGA--HVNTFWRTPCRGATEGLSKKSVVWENKHITWF 1338

Query:  1117 GTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKDV 1170
              T+ G +G +L    +    +    +    M   H  L  R     H+  R+    V++V
Sbjct:  1339 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNV 1398

Query:  1171 IDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             +DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1399 LDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLE 1435

 Score = 91 (37.1 bits), Expect = 1.0e-15, Sum P(6) = 1.0e-15
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   574 FLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIR 627

 Score = 87 (35.7 bits), Expect = 1.0e-15, Sum P(6) = 1.0e-15
 Identities = 24/106 (22%), Positives = 51/106 (48%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KIH--Q 103
             ++E   S   FG + S+A  +L G+++D +++     ++ ++EY+P  +      +H  +
Sbjct:    65 KLELAASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE 124

Query:   104 ETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAA 149
             E   + G  + V    + VDP GR   +     + +V    R++ A
Sbjct:   125 EPELRDGFVQNVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRESLA 170

 Score = 85 (35.0 bits), Expect = 1.0e-15, Sum P(6) = 1.0e-15
 Identities = 35/135 (25%), Positives = 59/135 (43%)

Query:   246 GVLVCAENFVIYKNQGHPD----VRAVIPRRADLP--AERGVLIVSAATHRQKTLFFF-- 297
             GV+V A N ++Y NQ  P     + ++       P   + GV I       Q T   +  
Sbjct:   277 GVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCA--QATFISYDK 334

Query:   298 -LLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIP---VTASMCVLKSGYLFAASEFGNHA 353
              ++  + G+I+ +TL  D   +  ++  +FD      +T SM  ++ GYLF  S  GN  
Sbjct:   335 MVISLKGGEIYVLTLITDG--MRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSL 392

Query:   354 LYQFQAIGADPDVEA 368
             L ++     +P   A
Sbjct:   393 LLKYTEKLQEPPASA 407

 Score = 50 (22.7 bits), Expect = 1.0e-15, Sum P(6) = 1.0e-15
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query:   420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDE 467
             +I    G G   +L +L+  +    +   +LPG    +WTV   V  E
Sbjct:   499 EIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYD-MWTVIAPVRKE 545

 Score = 45 (20.9 bits), Expect = 4.3e-12, Sum P(5) = 4.3e-12
 Identities = 21/83 (25%), Positives = 39/83 (46%)

Query:   130 MIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI-FAAIELDY 188
             ++ A   Q  VY LNRD  A LT +      K+H     +      F N +  A+++L  
Sbjct:    30 LVVAGTSQLYVYRLNRDAEA-LTKNDRSTEGKAHREKLELAASFSFFGNVMSMASVQL-- 86

Query:   189 SEADQDSTGQAASEAQKNLTFYE 211
             + A +D+   +  +A+ ++  Y+
Sbjct:    87 AGAKRDALLLSFKDAKLSVVEYD 109

 Score = 45 (20.9 bits), Expect = 2.4e-06, Sum P(4) = 2.4e-06
 Identities = 13/61 (21%), Positives = 30/61 (49%)

Query:   271 RRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTI 329
             R+ +LP  + VL+V+  + + +   + L+  +   +      HD++     LK++ F  +
Sbjct:   839 RQGELPLVKEVLLVALGSRQSRP--YLLVHVDQELLIYEAFPHDSQLGQGNLKVR-FKKV 895

Query:   330 P 330
             P
Sbjct:   896 P 896

 Score = 37 (18.1 bits), Expect = 1.0e-15, Sum P(6) = 1.0e-15
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLR 42
             MY        PTG+  ++  NF       +VVA    L + R
Sbjct:     1 MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYR 42


>UNIPROTKB|G4N4E2 [details] [associations]
            symbol:MGG_16867 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            GO:GO:0005634 Gene3D:2.130.10.10 EMBL:CM001233 GO:GO:0003676
            RefSeq:XP_003712617.1 EnsemblFungi:MGG_16867T0 GeneID:12985117
            KEGG:mgr:MGG_16867 Uniprot:G4N4E2
        Length = 1183

 Score = 144 (55.7 bits), Expect = 1.4e-15, Sum P(3) = 1.4e-15
 Identities = 103/422 (24%), Positives = 183/422 (43%)

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGY------LFAASEFG 350
             +LL  +YG +  +T++      S+  + +  T+ +  +    K  Y      LF AS +G
Sbjct:   304 YLLADDYGGLHLLTIQVKQN--SDTAVDHMSTVQIGTTSRATKLVYSETNRTLFVASHYG 361

Query:   351 NHALYQFQAIGADPDVEASSSTLMETEEGFQPVF-FQPRGLKNLVRIEQVESLMPIMDMR 409
             +   Y      AD     S   L +T E   P+  F    + N       E      D +
Sbjct:   362 DSQFYDVNLF-ADAAKGESFLELRQTIENIAPILDFAVMDMGNR------EG-----DSQ 409

Query:   410 IANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVSEMAV--SQLPGVPSAVWTVKKNVNDE 467
             + N +     +I T  G     SLR +R G+ + ++ V   ++ GV + ++++K   +D 
Sbjct:   410 LGNEYSSGQARIVTASGAQKDGSLRSVRSGVGLEDIGVITDEISGV-TGLFSLKSYGSDV 468

Query:   468 FDAYIVVSFNNATLVLSI---GETVEEVSDSGFLDTT-PSLAVSLIGDDSLMQVHPSGIR 523
              D  +VVSF   T V      GE VEE+S    LD + P+L V  + +  ++ V      
Sbjct:   469 EDT-LVVSFLTETRVFRFDKQGE-VEELSQLQGLDISQPTLLVLGLDNGHVLYVTEEKAT 526

Query:   524 HIREDG--RINEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMS 581
                 +G   I+ W     + I    SN   V++++ G +L+   + +  ++   E  E  
Sbjct:   527 LFDAEGGVTISSWSPTSGKPITHASSNGRWVLLSVDGRKLVSLNIGLDLKV-SAESEERD 585

Query:   582 GD-VACLDIASVPEGRKRSRFLAVGSYDN-TIRILSLDPDDCMQILSVQSVSSPPESLLF 639
              D ++C++ AS P         AVG + + TI I+ L   +  Q   ++   +  ++++ 
Sbjct:   586 EDQISCVN-AS-PHLLDVG---AVGFWSSGTISIIDLKTLEATQTEKLRR--NEDDAVVA 638

Query:   640 LEVQ-ASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFS 698
              EV  A V   + A+ P +LF++    +G +  T V    G LS  +S  LG R  + F 
Sbjct:   639 REVVLARVLPAEVAN-P-TLFVSK--DDGEVM-TFVYNDNGTLSSRKSVVLGTREAR-FR 692

Query:   699 VV 700
             V+
Sbjct:   693 VL 694

 Score = 108 (43.1 bits), Expect = 1.4e-15, Sum P(3) = 1.4e-15
 Identities = 28/105 (26%), Positives = 51/105 (48%)

Query:    28 PEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIV 87
             P +V+A+   LE+ R  + G+++   S  +FG I  L   R   S+ D + VG+D  +  
Sbjct:    27 PSLVLAKTNRLEIWRRTDEGQLKLEHSQSVFGKIVMLQAVRPKDSETDMLFVGTDRFKYF 86

Query:    88 ILEYNP-SKNVFDKIHQETFGKSGCRRIVPGQYLAVDPKGRAVMI 131
               EY+P ++ +  +      G+   R +       VDP GR +++
Sbjct:    87 TAEYDPDTRELVTRQAISDLGEQFVREVSSRNRCIVDPSGRYMVL 131

 Score = 91 (37.1 bits), Expect = 1.4e-15, Sum P(3) = 1.4e-15
 Identities = 45/188 (23%), Positives = 81/188 (43%)

Query:  1029 WLTAAHHIDFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEE 1088
             W TA  H++ D+   AD  GN+  V L ++ +    ED    ++  E G L    NK+++
Sbjct:  1008 WSTAVSHLEGDSWIVADGDGNL--VVLLRNTAGVTLEDKRRMQMTSEFG-LGECVNKIQK 1064

Query:  1089 IVQFHVGDVVTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQE 1148
             ++          + KA L    G   ++GTV     ++L       +DF +++E H+   
Sbjct:  1065 VM-VETSANAPIVAKAFLSTTEGSIYLFGTVAPKFQSLL-------MDFQANMEAHVSS- 1115

Query:  1149 HPPLCGRDHMAYRSAYFPVKD------VIDGDLCEQFPTLSLDLQRKIADELDRTPGEIL 1202
               PL       +RS   P ++       +DG+  E F  +  + Q  I   L  T  ++ 
Sbjct:  1116 --PLGELQFNQWRSFRNPEREGAGPERFLDGEFLEMFLDMEENTQIDICQGLSYTAEDMR 1173

Query:  1203 KKLEEIRN 1210
               + E++N
Sbjct:  1174 NLIGEMKN 1181

 Score = 39 (18.8 bits), Expect = 2.7e-10, Sum P(3) = 2.7e-10
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query:  1094 VGDVVTSLQKASLVPGGGESVIYG 1117
             V D++ S+     +PG G+S   G
Sbjct:   954 VADIMKSITLLEYIPGVGKSAKTG 977


>UNIPROTKB|B4DG00 [details] [associations]
            symbol:DDB1 "cDNA FLJ52436, highly similar to DNA
            damage-binding protein 1" species:9606 "Homo sapiens" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:AP003108
            UniGene:Hs.290758 HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037
            EMBL:AK294341 IPI:IPI00909177 SMR:B4DG00 STRING:B4DG00
            Ensembl:ENST00000450997 UCSC:uc010rle.1 HOGENOM:HOG000069916
            HOVERGEN:HBG102355 Uniprot:B4DG00
        Length = 451

 Score = 204 (76.9 bits), Expect = 3.1e-15, Sum P(2) = 3.1e-15
 Identities = 87/352 (24%), Positives = 158/352 (44%)

Query:   864 VLDPRSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKR-NIVAGYI 922
             ++D  +       +   NE A S+ +     K+  T   VGTA     +P+      G I
Sbjct:   104 IIDQHTFEVLHAHQFLQNEYALSLVSCKL-GKDPNTYFIVGTA---MVYPEEAEPKQGRI 159

Query:   923 HIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KCENK 981
              ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C + 
Sbjct:   160 VVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTECNHY 217

Query:   982 LFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFD 1039
                N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +D D
Sbjct:   218 ---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDD 274

Query:  1040 TMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDV 1097
                GA+   N++  +  +D +   +E+    +   E G  +     + E V    H   V
Sbjct:   275 NFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHGSLV 324

Query:  1098 VTSLQKASLVPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDH 1157
             + +L + S  P  G SV++GTV G +G + + S     +    ++  + +    +   +H
Sbjct:   325 MQNLGETS-TPTQG-SVLFGTVNGMIGLVTSLSE-SWYNLLLDMQNRLNKVIKSVGKIEH 381

Query:  1158 MAYRSAYF-----PVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKK 1204
               +RS +      P    IDGDL E F  +S    +++   L    G  +K+
Sbjct:   382 SFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 433

 Score = 71 (30.1 bits), Expect = 3.1e-15, Sum P(2) = 3.1e-15
 Identities = 16/67 (23%), Positives = 33/67 (49%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFR 68
               +  FR
Sbjct:    62 AVMELFR 68


>UNIPROTKB|Q10569 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
            "mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
            IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
            STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
            KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
            ArrayExpress:Q10569 Uniprot:Q10569
        Length = 1444

 Score = 128 (50.1 bits), Expect = 4.1e-15, Sum P(6) = 4.1e-15
 Identities = 83/397 (20%), Positives = 167/397 (42%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  DE+Y +P+ E+     CI+++ P S  A     +EL++ E    + TV+  
Sbjct:  1055 EKEFETIERDERYVHPQQEA----FCIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLR 1110

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGKSL-----ELLHKTQVE 943
              +E  + L    A G        +   G I I   +E     G+ L     ++L++ + +
Sbjct:  1111 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 1170

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGD 1001
             G   ALC   G L++ IG  + L+ L    L  +   + +L+ + ++S+  +   I   D
Sbjct:  1171 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNF---ILAAD 1227

Query:  1002 IQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRLPQ 1057
             + +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ LP+
Sbjct:  1228 VMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 1287

Query:  1058 DVSDEIEEDPTGGKIKWEQGKLN-GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIY 1116
                    ++  GG     +   + GA   +    +           K S+V        +
Sbjct:  1288 ------AKESFGGMRLLRRADFHVGA--HVNTFWRTPCRGAAEGPSKKSVVWENKHITWF 1339

Query:  1117 GTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKDV 1170
              T+ G +G +L    +    +    +    M   H  L  R     H+  R     V++V
Sbjct:  1340 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNV 1399

Query:  1171 IDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             +DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1400 LDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLE 1436

 Score = 97 (39.2 bits), Expect = 6.0e-12, Sum P(6) = 6.0e-12
 Identities = 67/358 (18%), Positives = 132/358 (36%)

Query:   695 KLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA 753
             + F  + G     +C  S  WL    RG   L P+  +  ++  A F +  C  G +   
Sbjct:   933 RYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFN 992

Query:   754 GNA-LRVFTIERL---GETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREA 809
                 LR+  +         +    +PLR T        + K+  +  +     T   R  
Sbjct:   993 RQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRVPRMT 1052

Query:   810 AKKECFXXXXXXXXXXXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKW--VSCIRVLDP 867
              +++ F                          P+S E     + E ++W  V+C++ +  
Sbjct:  1053 GEEKEFETIERDERYVHPQQEAFCIQL---ISPVSWEAIPNARIELEEWEHVTCMKTVSL 1109

Query:   868 RSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRF 927
             RS  T   L+      A   C +   +          T +G      R ++   I +   
Sbjct:  1110 RSEETVSGLK---GYVAAGTCLMQGEEV---------TCRG------RILIMDVIEVVP- 1150

Query:   928 VEEGKSL-----ELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCEN 980
              E G+ L     ++L++ + +G   ALC   G L++ IG  + L+ L    L  +   + 
Sbjct:  1151 -EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDT 1209

Query:   981 KLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDF 1038
             +L+ + ++S+  +   I   D+ +S    +Y+ +   L + + D+ P      + +DF
Sbjct:  1210 QLYIHQMISVKNF---ILAADVMKSISLLRYQEESKTLSLVSRDAKP---LEVYSVDF 1261

 Score = 91 (37.1 bits), Expect = 4.1e-15, Sum P(6) = 4.1e-15
 Identities = 24/93 (25%), Positives = 49/93 (52%)

Query:    44 ENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KI 101
             E+  ++E + S   FG + S+A  +L G+++D +++     ++ ++EY+P  +      +
Sbjct:    64 EHREKLELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 123

Query:   102 H--QETFGKSGCRRIVPGQYLAVDPKGR-AVMI 131
             H  +E   + G  + V    + VDP GR A M+
Sbjct:   124 HYFEEPELRDGFVQNVHTPRVRVDPDGRCAAML 156

 Score = 90 (36.7 bits), Expect = 4.1e-15, Sum P(6) = 4.1e-15
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   575 FLILSREDSTMILQTGQEIMELDASGFATQGPTVFAGNIGDNRYIVQVSPLGIR 628

 Score = 81 (33.6 bits), Expect = 4.1e-15, Sum P(6) = 4.1e-15
 Identities = 36/147 (24%), Positives = 65/147 (44%)

Query:   246 GVLVCAENFVIYKNQGHPD----VRAVIPRRADLP--AERGVLI-VSAATHRQKTLFFFL 298
             GV++ A N ++Y NQ  P     + ++       P   + GV I +  A     +    +
Sbjct:   280 GVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMV 339

Query:   299 LQTEYGDIFKVTLEHDNEHVSELKIKYFDTIP---VTASMCVLKSGYLFAASEFGNHALY 355
             +  + G+I+ +TL  D   +  ++  +FD      +T SM  ++ GYLF  S  GN  L 
Sbjct:   340 ISLKGGEIYVLTLITDG--MRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLL 397

Query:   356 QFQAIGADPDVEASSSTLMETEEGFQP 382
             ++     +P     +ST  E  +  +P
Sbjct:   398 KYTEKLQEPP----ASTAREAADKEEP 420

 Score = 48 (22.0 bits), Expect = 4.1e-15, Sum P(6) = 4.1e-15
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query:   420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDE 467
             +I    G G   +L +L+  +    +   +LPG    +WTV   V  E
Sbjct:   501 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYD-MWTVIAPVRKE 547

 Score = 48 (22.0 bits), Expect = 2.2e-11, Sum P(5) = 2.2e-11
 Identities = 21/85 (24%), Positives = 42/85 (49%)

Query:   130 MIGACEKQKLVYVLNRDTAA--RLTISSPLEAHKSHTIVYSICGIDCGFDNPI-FAAIEL 186
             ++ A   Q  VY LNRD+ A  +   S+  +AH+ H     +      F N +  A+++L
Sbjct:    30 LVVAGTSQLYVYRLNRDSEAPTKNDRSTDGKAHREHREKLELVASFSFFGNVMSMASVQL 89

Query:   187 DYSEADQDSTGQAASEAQKNLTFYE 211
               + A +D+   +  +A+ ++  Y+
Sbjct:    90 --AGAKRDALLLSFKDAKLSVVEYD 112

 Score = 46 (21.3 bits), Expect = 1.6e-06, Sum P(4) = 1.6e-06
 Identities = 13/61 (21%), Positives = 31/61 (50%)

Query:   271 RRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTI 329
             R+ +LP  + VL+V+  + +++   + L+  +   +      HD++     LK++ F  +
Sbjct:   840 RQGELPLVKEVLLVALGSRQRRP--YLLVHVDQELLIYEAFPHDSQLGQGNLKVR-FKKV 896

Query:   330 P 330
             P
Sbjct:   897 P 897

 Score = 37 (18.1 bits), Expect = 4.1e-15, Sum P(6) = 4.1e-15
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLR 42
             MY        PTG+  ++  NF       +VVA    L + R
Sbjct:     1 MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYR 42


>UNIPROTKB|F1PC28 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
            Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
        Length = 1398

 Score = 114 (45.2 bits), Expect = 6.3e-15, Sum P(5) = 6.3e-15
 Identities = 81/397 (20%), Positives = 166/397 (41%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  D++Y +P+ E+      I+++ P S  A     +EL++ E    + TV+  
Sbjct:  1009 EKEFETIERDDRYIHPQQEAFS----IQLISPVSWEAIPNARIELEEWEHVTCMKTVSLR 1064

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGKSL-----ELLHKTQVE 943
              +E  + L    A G        +   G I I   +E     G+ L     ++L++ + +
Sbjct:  1065 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 1124

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGD 1001
             G   ALC   G L++ IG  + L+ L    L  +   + +L+ + ++S+  +   I   D
Sbjct:  1125 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNF---ILAAD 1181

Query:  1002 IQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRLPQ 1057
             + +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ LP+
Sbjct:  1182 VMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 1241

Query:  1058 DVSDEIEEDPTGGKIKWEQGKLN-GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIY 1116
                    ++  GG     +   + GA   +    +           K S+V        +
Sbjct:  1242 ------AKESFGGMRLLRRADFHVGA--HVNTFWRTPCRGAAEGPSKKSVVWENKHITWF 1293

Query:  1117 GTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKDV 1170
              T+ G +G +L    +    +    +    M   H  L  R     H+  R     V++V
Sbjct:  1294 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNV 1353

Query:  1171 IDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             +DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1354 LDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLE 1390

 Score = 97 (39.2 bits), Expect = 3.6e-13, Sum P(5) = 3.6e-13
 Identities = 67/358 (18%), Positives = 132/358 (36%)

Query:   695 KLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA 753
             + F  + G     +C  S  WL    RG   L P+  +  ++  A F +  C  G +   
Sbjct:   887 RYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFN 946

Query:   754 GNA-LRVFTIERL---GETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREA 809
                 LR+  +         +    +PLR T        + K+  +  +     T   R  
Sbjct:   947 RQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNMPCTRIPRMT 1006

Query:   810 AKKECFXXXXXXXXXXXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKW--VSCIRVLDP 867
              +++ F                          P+S E     + E ++W  V+C++ +  
Sbjct:  1007 GEEKEFETIERDDRYIHPQQEAFSIQL---ISPVSWEAIPNARIELEEWEHVTCMKTVSL 1063

Query:   868 RSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRF 927
             RS  T   L+      A   C +   +          T +G      R ++   I +   
Sbjct:  1064 RSEETVSGLK---GYVAAGTCLMQGEEV---------TCRG------RILIMDVIEVVP- 1104

Query:   928 VEEGKSL-----ELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCEN 980
              E G+ L     ++L++ + +G   ALC   G L++ IG  + L+ L    L  +   + 
Sbjct:  1105 -EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDT 1163

Query:   981 KLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDF 1038
             +L+ + ++S+  +   I   D+ +S    +Y+ +   L + + D+ P      + +DF
Sbjct:  1164 QLYIHQMISVKNF---ILAADVMKSISLLRYQEESKTLSLVSRDAKP---LEVYSVDF 1215

 Score = 95 (38.5 bits), Expect = 6.3e-15, Sum P(5) = 6.3e-15
 Identities = 31/149 (20%), Positives = 67/149 (44%)

Query:    44 ENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KI 101
             E+  ++E + S   FG + S+A  +L G+++D +++     ++ ++EY+P  +      +
Sbjct:    18 EHREKLELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 77

Query:   102 H--QETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEA 159
             H  +E   + G  + V    + VDP GR   +     + +V    R++ A        E 
Sbjct:    78 HYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEG 137

Query:   160 HKSHTIVYSICGIDCGFDNPIFAAIELDY 188
              +S  +   I  +  G D  +   ++L +
Sbjct:   138 QRSSFLPSYIIDVR-GLDEKLLNIVDLQF 165

 Score = 91 (37.1 bits), Expect = 6.3e-15, Sum P(5) = 6.3e-15
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   529 FLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIR 582

 Score = 83 (34.3 bits), Expect = 6.3e-15, Sum P(5) = 6.3e-15
 Identities = 34/134 (25%), Positives = 58/134 (43%)

Query:   246 GVLVCAENFVIYKNQGHPDVRAVI------PRRADLPAERGVLI-VSAATHRQKTLFFFL 298
             GV+V A N ++Y NQ  P     +           L  + GV I +  A     +    +
Sbjct:   234 GVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMV 293

Query:   299 LQTEYGDIFKVTLEHDNEHVSELKIKYFDTIP---VTASMCVLKSGYLFAASEFGNHALY 355
             +  + G+I+ +TL  D   +  ++  +FD      +T SM  ++ GYLF  S  GN  L 
Sbjct:   294 ISLKGGEIYVLTLITDG--MRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLL 351

Query:   356 QFQAIGADPDVEAS 369
             ++     +P   A+
Sbjct:   352 KYTEKLQEPPASAA 365

 Score = 48 (22.0 bits), Expect = 6.3e-15, Sum P(5) = 6.3e-15
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query:   420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDE 467
             +I    G G   +L +L+  +    +   +LPG    +WTV   V  E
Sbjct:   455 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYD-MWTVIAPVRKE 501

 Score = 45 (20.9 bits), Expect = 3.0e-06, Sum P(3) = 3.0e-06
 Identities = 13/61 (21%), Positives = 30/61 (49%)

Query:   271 RRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTI 329
             R+ +LP  + VL+V+  + + +   + L+  +   +      HD++     LK++ F  +
Sbjct:   794 RQGELPLVKEVLLVALGSRQSRP--YLLVHVDQELLIYEAFPHDSQLGQGNLKVR-FKKV 850

Query:   330 P 330
             P
Sbjct:   851 P 851


>ZFIN|ZDB-GENE-040709-2 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation specific
            factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
            "definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
            GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
            EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
            ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
        Length = 1451

 Score = 113 (44.8 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 82/399 (20%), Positives = 172/399 (43%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  DE+Y +P  + DK+   I+++ P S  A     ++L++ E    + TV   
Sbjct:  1062 EKEFETIERDERYIHP--QQDKF--SIQLISPVSWEAIPNTRVDLEEWEHVTCMKTVALK 1117

Query:   894 DKE--HGT--LLAVGTA--KGLQFWPKRNIVAGYIHIYRFVEE-GKSL-----ELLHKTQ 941
              +E   G    +A+GT   +G +   +  I+   + +   V E G+ L     ++L++ +
Sbjct:  1118 SQETVSGLKGYVALGTCLMQGEEVTCRGRILI--LDVIEVVPEPGQPLTKNKFKVLYEKE 1175

Query:   942 VEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYV 999
              +G   ALC   G L++ IG  + L+ L    L  +   + +L+ + + SI  +   I  
Sbjct:  1176 QKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLTGMAFIDTQLYIHQMYSIKNF---ILA 1232

Query:  1000 GDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRL 1055
              D+ +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ L
Sbjct:  1233 ADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDNNQLGFLVSDRDKNLMVYMYL 1292

Query:  1056 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVI 1115
             P+       ++  GG     +   N   + +    +      + +  K +L         
Sbjct:  1293 PE------AKESFGGMRLLRRADFN-VGSHVNAFWRMPCRGTLDTANKKALTWDNKHITW 1345

Query:  1116 YGTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKD 1169
             + T+ G +G +L    +    +    +    M   H  L  +     H   R+    VK+
Sbjct:  1346 FATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRMLHCDRRTLQNAVKN 1405

Query:  1170 VIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEI 1208
             ++DG+L  ++  LS   + ++A ++  TP  IL  L EI
Sbjct:  1406 ILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEI 1444

 Score = 95 (38.5 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 29/109 (26%), Positives = 56/109 (51%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KIH--Q 103
             ++E + S  +FG + S+A  +L G+ +D +++     ++ ++EY+P  +      +H  +
Sbjct:    65 KLEQVASFSLFGNVMSMASVQLVGTNRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE 124

Query:   104 ETFGKSGCRRIVPGQYLAVDPKGR-AVMI--GACEKQKLVYVLNRDTAA 149
             E   + G  + V    + VDP+ R AVM+  G C    +V    +DT A
Sbjct:   125 EPELRDGFVQNVHIPMVRVDPENRCAVMLVYGTC---LVVLPFRKDTLA 170

 Score = 93 (37.8 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   577 FLILSREDSTMILQTGQEIMELDTSGFATQGPTVYAGNIGDNKYIIQVSPMGIR 630

 Score = 77 (32.2 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 39/164 (23%), Positives = 66/164 (40%)

Query:   226 PVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRAD-------LPAE 278
             P D    M V  P GG     V+V A N ++Y NQ  P     +    +        P E
Sbjct:   262 PFDCNQVMAVPKPIGG-----VVVFAVNSLLYLNQSVPPFGVSLNSLTNGTTAFPLRPQE 316

Query:   279 RGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIP---VTASM 335
                + +  +     T    ++  + G+I+ +TL  D   +  ++  +FD      +T  M
Sbjct:   317 EVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDG--MRSVRAFHFDKAAASVLTTCM 374

Query:   336 CVLKSGYLFAASEFGNHALYQF-QAIGADPDVEASSSTLMETEE 378
               ++ GYLF  S  GN  L ++ + +   P  E   +   E +E
Sbjct:   375 MTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQE 418

 Score = 46 (21.3 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 13/29 (44%), Positives = 15/29 (51%)

Query:   716 LGYIHRGRFLLTPLSYETLEYAASFSSDQ 744
             LGY H   +LL  +  E L Y A F  DQ
Sbjct:   862 LGYNHSRPYLLAHVEQELLIYEA-FPYDQ 889

 Score = 46 (21.3 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 10/41 (24%), Positives = 19/41 (46%)

Query:   420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTV 460
             ++    G G   +L +L+  +    +   +LPG    +WTV
Sbjct:   501 EVVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCHD-MWTV 540

 Score = 40 (19.1 bits), Expect = 1.6e-14, Sum P(7) = 1.6e-14
 Identities = 12/42 (28%), Positives = 18/42 (42%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLR 42
             MY        PT +  A+  NF  ++   +VVA    L + R
Sbjct:     1 MYAVYRQAHPPTAVEFAVYCNFISSQEKNLVVAGTSQLYVYR 42


>MGI|MGI:2679722 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor
            1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
            "mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
            polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
            evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
            GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
            HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
            EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
            RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
            STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
            Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
            UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
            CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
            GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
        Length = 1441

 Score = 115 (45.5 bits), Expect = 5.7e-14, Sum P(5) = 5.7e-14
 Identities = 81/397 (20%), Positives = 166/397 (41%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  D++Y +P+ E+      I+++ P S  A     +EL++ E    + TV+  
Sbjct:  1052 EKEFEAIERDDRYIHPQQEAFS----IQLISPVSWEAIPNARIELEEWEHVTCMKTVSLR 1107

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGKSL-----ELLHKTQVE 943
              +E  + L    A G        +   G I I   +E     G+ L     ++L++ + +
Sbjct:  1108 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 1167

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGD 1001
             G   ALC   G L++ IG  + L+ L    L  +   + +L+ + ++S+  +   I   D
Sbjct:  1168 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNF---ILAAD 1224

Query:  1002 IQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRLPQ 1057
             + +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ LP+
Sbjct:  1225 VMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 1284

Query:  1058 DVSDEIEEDPTGGKIKWEQGKLN-GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIY 1116
                    ++  GG     +   + GA   +    +           K S+V        +
Sbjct:  1285 ------AKESFGGMRLLRRADFHVGA--HVNTFWRTPCRGAAEGPSKKSVVWENKHITWF 1336

Query:  1117 GTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKDV 1170
              T+ G +G +L    +    +    +    M   H  L  R     H+  R     V++V
Sbjct:  1337 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNV 1396

Query:  1171 IDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             +DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1397 LDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLE 1433

 Score = 98 (39.6 bits), Expect = 3.1e-12, Sum P(5) = 3.1e-12
 Identities = 67/358 (18%), Positives = 132/358 (36%)

Query:   695 KLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA 753
             + F  + G     +C  S  WL    RG   L P+  +  ++  A F +  C  G +   
Sbjct:   930 RYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFN 989

Query:   754 GNA-LRVFTIERL---GETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREA 809
                 LR+  +         +    +PLR T        + K+  +  +     T   R  
Sbjct:   990 RQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMT 1049

Query:   810 AKKECFXXXXXXXXXXXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKW--VSCIRVLDP 867
              +++ F                          P+S E     + E ++W  V+C++ +  
Sbjct:  1050 GEEKEFEAIERDDRYIHPQQEAFSIQL---ISPVSWEAIPNARIELEEWEHVTCMKTVSL 1106

Query:   868 RSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRF 927
             RS  T   L+      A   C +   +          T +G      R ++   I +   
Sbjct:  1107 RSEETVSGLK---GYVAAGTCLMQGEEV---------TCRG------RILIMDVIEVVP- 1147

Query:   928 VEEGKSL-----ELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCEN 980
              E G+ L     ++L++ + +G   ALC   G L++ IG  + L+ L    L  +   + 
Sbjct:  1148 -EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDT 1206

Query:   981 KLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDF 1038
             +L+ + ++S+  +   I   D+ +S    +Y+ +   L + + D+ P      + +DF
Sbjct:  1207 QLYIHQMISVKNF---ILAADVMKSISLLRYQEESKTLSLVSRDAKP---LEVYSVDF 1258

 Score = 91 (37.1 bits), Expect = 5.7e-14, Sum P(5) = 5.7e-14
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   572 FLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIR 625

 Score = 88 (36.0 bits), Expect = 5.7e-14, Sum P(5) = 5.7e-14
 Identities = 23/89 (25%), Positives = 47/89 (52%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KIH--Q 103
             ++E + S   FG + S+A  +L G+++D +++     ++ ++EY+P  +      +H  +
Sbjct:    65 KLELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE 124

Query:   104 ETFGKSGCRRIVPGQYLAVDPKGR-AVMI 131
             E   + G  + V    + VDP GR A M+
Sbjct:   125 EPELRDGFVQNVHTPRVRVDPDGRCAAML 153

 Score = 80 (33.2 bits), Expect = 5.7e-14, Sum P(5) = 5.7e-14
 Identities = 31/122 (25%), Positives = 56/122 (45%)

Query:   246 GVLVCAENFVIYKNQGHPD----VRAVIPRRADLP--AERGVLI-VSAATHRQKTLFFFL 298
             GV++ A N ++Y NQ  P     + ++       P   + GV I +  A     +    +
Sbjct:   277 GVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMV 336

Query:   299 LQTEYGDIFKVTLEHDNEHVSELKIKYFDTIP---VTASMCVLKSGYLFAASEFGNHALY 355
             +  + G+I+ +TL  D   +  ++  +FD      +T SM  ++ GYLF  S  GN  L 
Sbjct:   337 ISLKGGEIYVLTLITDG--MRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLL 394

Query:   356 QF 357
             ++
Sbjct:   395 KY 396

 Score = 48 (22.0 bits), Expect = 5.7e-14, Sum P(5) = 5.7e-14
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query:   420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDE 467
             +I    G G   +L +L+  +    +   +LPG    +WTV   V  E
Sbjct:   498 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYD-MWTVIAPVRKE 544

 Score = 45 (20.9 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 13/61 (21%), Positives = 30/61 (49%)

Query:   271 RRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTI 329
             R+ +LP  + VL+V+  + + +   + L+  +   +      HD++     LK++ F  +
Sbjct:   837 RQGELPLVKEVLLVALGSRQSRP--YLLVHVDQELLIYEAFPHDSQLGQGNLKVR-FKKV 893

Query:   330 P 330
             P
Sbjct:   894 P 894

 Score = 44 (20.5 bits), Expect = 1.2e-09, Sum P(5) = 1.2e-09
 Identities = 21/83 (25%), Positives = 39/83 (46%)

Query:   130 MIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI-FAAIELDY 188
             ++ A   Q  VY LNRD  A LT +      K+H     +      F N +  A+++L  
Sbjct:    30 LVVAGTSQLYVYRLNRDAEA-LTKNDGSTEGKAHREKLELVASFSFFGNVMSMASVQL-- 86

Query:   189 SEADQDSTGQAASEAQKNLTFYE 211
             + A +D+   +  +A+ ++  Y+
Sbjct:    87 AGAKRDALLLSFKDAKLSVVEYD 109


>UNIPROTKB|F5H775 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01015574
            ProteinModelPortal:F5H775 SMR:F5H775 Ensembl:ENST00000537877
            ArrayExpress:F5H775 Bgee:F5H775 Uniprot:F5H775
        Length = 240

 Score = 187 (70.9 bits), Expect = 3.2e-13, P = 3.2e-13
 Identities = 65/238 (27%), Positives = 110/238 (46%)

Query:   486 GETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIREDGR--INEWRTPGKRTIV 543
             GE VEE    GF+D   +     +    L+Q+  + +R + ++ +  ++EW+ P  + I 
Sbjct:     4 GEEVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNIS 63

Query:   544 KVGSNRLQVVIALSGGELIYFEVDMTGQLLEVEKHEMSGDVACLDIASVPEGRKRSRFLA 603
                 N  QVV+A+ G  L Y ++    +L ++   EM  +VACLDI  + +    S   A
Sbjct:    64 VASCNSSQVVVAV-GRALYYLQIHPQ-ELRQISHTEMEHEVACLDITPLGDSNGLSPLCA 121

Query:   604 VGSY-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFLEVQASVGGEDGADHPASLFL 660
             +G + D + RIL L P    ++L  + +     P S+L    ++S        H    +L
Sbjct:   122 IGLWTDISARILKL-PS--FELLHKEMLGGEIIPRSILMTTFESS--------H----YL 166

Query:   661 NAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGY 718
                L +G LF   +++ TG LSD +   LG +P  L +        +   S RP + Y
Sbjct:   167 LCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIY 224


>POMBASE|SPAC17H9.10c [details] [associations]
            symbol:ddb1 "damaged DNA binding protein Ddb1"
            species:4896 "Schizosaccharomyces pombe" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
            "nucleolus" evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0006279 "premeiotic DNA replication" evidence=TAS] [GO:0006282
            "regulation of DNA repair" evidence=IMP] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=IMP]
            [GO:0006974 "response to DNA damage stimulus" evidence=IMP]
            [GO:0007090 "regulation of S phase of mitotic cell cycle"
            evidence=IMP] [GO:0034644 "cellular response to UV" evidence=IMP]
            [GO:0040020 "regulation of meiosis" evidence=IGI] [GO:0042787
            "protein ubiquitination involved in ubiquitin-dependent protein
            catabolic process" evidence=IMP] [GO:0051445 "regulation of meiotic
            cell cycle" evidence=IGI] [GO:0070912 "Ddb1-Ckn1 complex"
            evidence=IDA] [GO:0070913 "Ddb1-Wdr21 complex" evidence=IDA]
            [GO:0008180 "signalosome" evidence=IDA] [GO:0031465 "Cul4B-RING
            ubiquitin ligase complex" evidence=IDA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143
            PomBase:SPAC17H9.10c GO:GO:0005829 EMBL:CU329670 GO:GO:0005730
            GenomeReviews:CU329670_GR Gene3D:2.130.10.10 GO:GO:0003677
            GO:GO:0007049 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0034644
            GO:GO:0040020 GO:GO:0042787 GO:GO:0007090 GO:GO:0006283
            GO:GO:0006282 GO:GO:0006279 GO:GO:0070912 eggNOG:NOG247734
            KO:K10610 OMA:CALGDGS PIR:T37876 RefSeq:NP_593580.1 IntAct:O13807
            STRING:O13807 EnsemblFungi:SPAC17H9.10c.1 GeneID:2542207
            KEGG:spo:SPAC17H9.10c OrthoDB:EOG473T0C NextBio:20803277
            GO:GO:0070913 Uniprot:O13807
        Length = 1072

 Score = 117 (46.2 bits), Expect = 6.3e-12, Sum P(5) = 6.3e-12
 Identities = 36/129 (27%), Positives = 61/129 (47%)

Query:     4 YSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAIRS 63
             Y   L +P+ I  A+   F    +  ++VA+   LE+   EN+ R+  + S  IF  I +
Sbjct:     3 YVTYLHKPSSIRNAVFCKFVNASSWNVIVAKVNCLEVYSYENN-RLCLITSANIFAKIVN 61

Query:    64 LAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKN-VFDKIHQETFGKSGCRRIVPGQYLAV 122
             +  F+   S  D+I+V +DS R   L ++ + N V + I  +   +   R    G  L V
Sbjct:    62 VKAFKPVSSPTDHIIVATDSFRYFTLFWDANDNTVSNGIKIQDCSERSLRESQSGPLLLV 121

Query:   123 DPKGRAVMI 131
             DP  R + +
Sbjct:   122 DPFQRVICL 130

 Score = 108 (43.1 bits), Expect = 6.3e-12, Sum P(5) = 6.3e-12
 Identities = 48/207 (23%), Positives = 98/207 (47%)

Query:   387 PRGLKNLVRIEQVESLM---PIMDMRIANLFEEEAPQIFTLCGRGPRSSLRILRPGLAVS 443
             P   KN  ++E +++ +   PI D  I +  ++    I T  G     +LRI+R  + + 
Sbjct:   339 PSITKNNHKLEILQNFVNIAPISDFIIDD--DQTGSSIITCSGAYKDGTLRIIRNSINIE 396

Query:   444 EMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNATLVLSIG-ETVEEVSDSGFLDTTP 502
              +A+ ++ G+    ++V    N  +D YI +S    T  + +  E V   +     + + 
Sbjct:   397 NVALIEMEGIKD-FFSVSFRAN--YDNYIFLSLICETRAIIVSPEGVFSANHDLSCEEST 453

Query:   503 SLAVSLIGDDSLMQVHPSGIRHIREDGR-INEWRTPGKRTIVKVGSNRLQ-VVIALSGGE 560
                 ++ G+  ++Q+    IR    DG+ ++ W +P   T    GS+    V +A++GG 
Sbjct:   454 IFVSTIYGNSQILQITTKEIRLF--DGKKLHSWISPMSITC---GSSFADNVCVAVAGGL 508

Query:   561 LIYFEVDMTGQLLEVEKHEMSGDVACL 587
             +++FE    G + EV +++   +V+ L
Sbjct:   509 ILFFE----G-ITEVGRYQCDTEVSSL 530

 Score = 65 (27.9 bits), Expect = 6.3e-12, Sum P(5) = 6.3e-12
 Identities = 35/191 (18%), Positives = 78/191 (40%)

Query:   620 DCMQILSVQSVSSPPESLLFLEVQAS-VGGEDGADHPASLFLNAGLQNGVLFRTVVDMVT 678
             D + +   Q   S   SL   ++  S V  +   D   +L+++    NG +   + +   
Sbjct:   546 DIIMLTYCQDGISLTHSLKLTDIPRSIVYSQKYGDDGGTLYVSTN--NGYVL--MFNFQN 601

Query:   679 GQLSDS--RSRFLGLRPPKLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEY 736
             GQ+ +   R   LG+ P  L       + A+  L  +P L Y    + ++TPLS   +  
Sbjct:   602 GQVIEHSLRRNQLGVAPIILKHFDSKEKNAIFALGEKPQLMYYESDKLVITPLSCTEMLN 661

Query:   737 AASFSSDQCVEGVVSVAGNALRVFTIERLGETFNETALPLRYTPRRFVLQPKKKLMVIIE 796
              +S+ +      ++    + + +  +  +  + N   + ++  PRR         ++ ++
Sbjct:   662 ISSYVNPSLGVNMLYCTNSYISLAKMSEI-RSLNVQTVSVKGFPRRICSNSLFYFVLCMQ 720

Query:   797 TDQGALTAEER 807
              ++   T E+R
Sbjct:   721 LEESIGTQEQR 731

 Score = 60 (26.2 bits), Expect = 6.3e-12, Sum P(5) = 6.3e-12
 Identities = 15/64 (23%), Positives = 30/64 (46%)

Query:   297 FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMCVLKSGYLFAASEFGNHALYQ 356
             +++  E G ++K      +E VS ++++      + + +  L   +LF  S F N  L Q
Sbjct:   279 YIVADESGMLYKFKALFTDETVS-MELEKLGESSIASCLIALPDNHLFVGSHFNNSVLLQ 337

Query:   357 FQAI 360
               +I
Sbjct:   338 LPSI 341

 Score = 50 (22.7 bits), Expect = 2.9e-10, Sum P(4) = 2.9e-10
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query:   675 DMVTGQLSDSRSRFLGLRPPKLFSVVVGG 703
             D++ G L +S    LGLR P L  +V GG
Sbjct:  1022 DLIDGSLIES---ILGLREPILNEIVNGG 1047

 Score = 44 (20.5 bits), Expect = 6.3e-12, Sum P(5) = 6.3e-12
 Identities = 11/22 (50%), Positives = 14/22 (63%)

Query:   860 SCIRVLDPRSANTTCLLELQDN 881
             S + V D   +NT  LL+LQDN
Sbjct:   972 SLMIVGDAGMSNTPLLLQLQDN 993


>RGD|1306406 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
            160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
            "mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
            RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
            GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
            RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
            GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
            Uniprot:D4A0H5
        Length = 1386

 Score = 91 (37.1 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   568 FLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIR 621

 Score = 88 (36.0 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 23/89 (25%), Positives = 47/89 (52%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KIH--Q 103
             ++E + S   FG + S+A  +L G+++D +++     ++ ++EY+P  +      +H  +
Sbjct:    65 KLELVASFSFFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFE 124

Query:   104 ETFGKSGCRRIVPGQYLAVDPKGR-AVMI 131
             E   + G  + V    + VDP GR A M+
Sbjct:   125 EPELRDGFVQNVHTPRVRVDPDGRCAAML 153

 Score = 80 (33.2 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 31/122 (25%), Positives = 56/122 (45%)

Query:   246 GVLVCAENFVIYKNQGHPD----VRAVIPRRADLP--AERGVLI-VSAATHRQKTLFFFL 298
             GV++ A N ++Y NQ  P     + ++       P   + GV I +  A     +    +
Sbjct:   277 GVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMV 336

Query:   299 LQTEYGDIFKVTLEHDNEHVSELKIKYFDTIP---VTASMCVLKSGYLFAASEFGNHALY 355
             +  + G+I+ +TL  D   +  ++  +FD      +T SM  ++ GYLF  S  GN  L 
Sbjct:   337 ISLKGGEIYVLTLITDG--MRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLL 394

Query:   356 QF 357
             ++
Sbjct:   395 KY 396

 Score = 59 (25.8 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 16/51 (31%), Positives = 28/51 (54%)

Query:  1157 HMAYRSAYFPVKDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             H+  R     V++V+DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1328 HVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLE 1378

 Score = 55 (24.4 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 34/185 (18%), Positives = 62/185 (33%)

Query:   695 KLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA 753
             + F  + G     +C  S  WL    RG   L P+  +  ++  A F +  C  G +   
Sbjct:   926 RYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFN 985

Query:   754 GNA-LRVFTIERL---GETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREA 809
                 LR+  +         +    +PLR T        + K+  +  +     T   R  
Sbjct:   986 RQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMT 1045

Query:   810 AKKECFXXXXXXXXXXXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKW--VSCIRVLDP 867
              +++ F                          P+S E     + E ++W  V+C++ +  
Sbjct:  1046 GEEKEFEAIERDDRYIHPQQEAFSIQL---ISPVSWEAIPNARIELEEWEHVTCMKTVSL 1102

Query:   868 RSANT 872
             RS  T
Sbjct:  1103 RSEET 1107

 Score = 54 (24.1 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 18/61 (29%), Positives = 27/61 (44%)

Query:   411 ANLFEEEAPQ----IFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVND 466
             A L EE +P+    I    G G   +L +L+  +    +   +LPG    +WTV   V  
Sbjct:   481 AFLSEENSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYD-MWTVIAPVRK 539

Query:   467 E 467
             E
Sbjct:   540 E 540

 Score = 44 (20.5 bits), Expect = 2.8e-06, Sum P(6) = 2.8e-06
 Identities = 21/83 (25%), Positives = 39/83 (46%)

Query:   130 MIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSICGIDCGFDNPI-FAAIELDY 188
             ++ A   Q  VY LNRD  A LT +      K+H     +      F N +  A+++L  
Sbjct:    30 LVVAGTSQLYVYRLNRDAEA-LTKNDGSTEGKAHREKLELVASFSFFGNVMSMASVQL-- 86

Query:   189 SEADQDSTGQAASEAQKNLTFYE 211
             + A +D+   +  +A+ ++  Y+
Sbjct:    87 AGAKRDALLLSFKDAKLSVVEYD 109

 Score = 40 (19.1 bits), Expect = 1.9e-10, Sum P(7) = 1.9e-10
 Identities = 13/42 (30%), Positives = 17/42 (40%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLR 42
             MY        PTG+  A+  NF       +VVA    L + R
Sbjct:     1 MYAVYKQAHPPTGLEFAMYCNFFNNSERNLVVAGTSQLYVYR 42


>UNIPROTKB|F5H0Y5 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0005634 GO:GO:0016055
            Gene3D:2.130.10.10 GO:GO:0003684 EMBL:AP003108 HGNC:HGNC:2717
            ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909177
            ProteinModelPortal:F5H0Y5 SMR:F5H0Y5 Ensembl:ENST00000539332
            ArrayExpress:F5H0Y5 Bgee:F5H0Y5 Uniprot:F5H0Y5
        Length = 204

 Score = 158 (60.7 bits), Expect = 3.9e-10, P = 3.9e-10
 Identities = 59/210 (28%), Positives = 104/210 (49%)

Query:   920 GYIHIYRFVEEGKSLELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRLLR-KC 978
             G I ++++  +GK L+ + + +V+G   ++ +F G+LLA I   +RLY+   ++ LR +C
Sbjct:    12 GRIVVFQY-SDGK-LQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTEKELRTEC 69

Query:   979 ENKLFPNTIVSI--NTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHI 1036
              +    N I+++   T  D I VGD+  S     Y+  E      A D  P W++A   +
Sbjct:    70 NHY---NNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEIL 126

Query:  1037 DFDTMAGADKFGNIYFVRLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKMEEIVQF--HV 1094
             D D   GA+   N++  +  +D +   +E+    +   E G  +     + E V    H 
Sbjct:   127 DDDNFLGAENAFNLFVCQ--KDSAATTDEERQHLQ---EVGLFH-----LGEFVNVFCHG 176

Query:  1095 GDVVTSLQKASLVPGGGESVIYGTVMGSLG 1124
               V+ +L + S  P  G SV++GTV G +G
Sbjct:   177 SLVMQNLGETS-TPTQG-SVLFGTVNGMIG 204


>DICTYBASE|DDB_G0281585 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation
            specificity factor 160 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
            evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
            EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
            EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
            InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
        Length = 1628

 Score = 94 (38.1 bits), Expect = 1.1e-09, Sum P(5) = 1.1e-09
 Identities = 54/192 (28%), Positives = 88/192 (45%)

Query:   851 PKAESDKWVSCIRVLDPR-SANTTCL--LELQDNEA--AFSICTVNFHDKEHGT----LL 901
             P    DK+   I+++DP    N   +    LQD E   A  I ++ F + +  T     L
Sbjct:  1263 PILTDDKFQ--IKLIDPTIDWNWKFIDSFSLQDRETVLAMKIVSLKFTEPDGITRARPFL 1320

Query:   902 AVGTAK--GLQFWPKRNIVAGYI--HIYRFVEE--G-KSLELLHKTQVEGIPLALCQFQG 954
              +GTA   G     K  ++   I  H  +F  E  G K L LL++ + +G   AL    G
Sbjct:  1321 VIGTAFTFGEDTQCKGRVLVFEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNG 1380

Query:   955 RLLAGIGPVLRL--YDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFCKYR 1012
              LL  IGP L +  +  G    L   + +++   I SI T ++ I +GD+ +S +F +++
Sbjct:  1381 LLLMTIGPKLTVNQFYTGSLVTLSFYDAQIY---ICSICTIKNYIVIGDMYKSVYFLQWK 1437

Query:  1013 RDENQLYIFADD 1024
              D   L + + D
Sbjct:  1438 -DNKTLNLLSKD 1448

 Score = 79 (32.9 bits), Expect = 1.1e-09, Sum P(5) = 1.1e-09
 Identities = 28/106 (26%), Positives = 53/106 (50%)

Query:    32 VARGKVLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEY 91
             + + K +EL +P     +E ++  ++FG I S+A  R   S++D +++     +I +L+Y
Sbjct:    75 ITQKKKIEL-KPS----LELIIEKKLFGNIESMASVRYPNSERDSLILTFRDAKISVLDY 129

Query:    92 NPSKNVFD--KIH---QETFGKSGCRRIVPGQYLAVDPKGR-AVMI 131
             +     F+   +H   ++ F K G         L VD + R AVM+
Sbjct:   130 DSDLLDFEIRSLHYFEKDEF-KGGRNHFKHPPLLKVDTQQRCAVML 174

 Score = 78 (32.5 bits), Expect = 1.1e-09, Sum P(5) = 1.1e-09
 Identities = 29/116 (25%), Positives = 56/116 (48%)

Query:  1107 VPGGGESVIYGTVMGSLGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSA--- 1163
             +P   + VI+GT+ G L  +     +  + F+ H++  +    P   G +   YRS    
Sbjct:  1508 LPKKEQLVIFGTLDGGLNVLRPLDEKIYLLFY-HIQSKLYYL-PQTAGLNPKQYRSFKSF 1565

Query:  1164 -----YFPV------KDVIDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEEI 1208
                  + P       K ++DGDL  +F +LS   +R I++ ++ T  EI++ L+++
Sbjct:  1566 SQNFHFSPSTFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEIIESLKDV 1621

 Score = 66 (28.3 bits), Expect = 1.1e-09, Sum P(5) = 1.1e-09
 Identities = 20/72 (27%), Positives = 35/72 (48%)

Query:   400 ESLMPIMDMRIANLFEEEAP---QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSA 456
             +S+ P  D  I     E  P   ++ T  G G   S+ +L+  +    +   +LPG+ + 
Sbjct:   595 QSIDPTYDETIQPNQPEYVPKTLELVTCSGYGKNGSISVLQNNIKPELVMAFELPGILN- 653

Query:   457 VWTV-KKNVNDE 467
             VWTV K+ + +E
Sbjct:   654 VWTVYKEEIEEE 665

 Score = 65 (27.9 bits), Expect = 1.1e-09, Sum P(5) = 1.1e-09
 Identities = 33/141 (23%), Positives = 55/141 (39%)

Query:   233 MLVTVPGGGDGPSGVLVCAENFVIYKNQ-----------GHPDVRAVIPRRA-DLPAERG 280
             MLV+VP   +   G LV   N + Y NQ              D   +I  +  D P +  
Sbjct:   352 MLVSVP---EPLGGALVITANIMFYVNQTSRYGLAVNEYASIDTSTIIGSQPFDFPIDDT 408

Query:   281 VLIVSAATHRQKTLFF----FLLQTEYGDIFKVTLEHDNEHVSELKIKYFDTIPVTASMC 336
             + +V     R   +F     F+   + G++    L  D   V  + +       +T+ +C
Sbjct:   409 LNLVFTLD-RSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGGSVLTSCIC 467

Query:   337 VLKSGYLFAASEFGNHALYQF 357
             VL +  +F  S  G+  L Q+
Sbjct:   468 VLSNNLIFLGSRLGDSLLLQY 488

 Score = 46 (21.3 bits), Expect = 1.5e-06, Sum P(5) = 1.5e-06
 Identities = 12/48 (25%), Positives = 24/48 (50%)

Query:    11 PTGIIAAINGNFSGTKTPEIVVARGKVLEL--LRPENSGRIETLVSTE 56
             PTG+   I  N     +  +V+A+  VL++  +R E   + E +  ++
Sbjct:    14 PTGVEQCIKANLINDDSINLVLAKTNVLQIYKIRYEKIEKYENVSDSQ 61

 Score = 42 (19.8 bits), Expect = 2.5e-05, Sum P(4) = 2.5e-05
 Identities = 11/40 (27%), Positives = 22/40 (55%)

Query:   555 ALSGGELIYFEVDMTGQLLE-VEKHEMSGDV--ACLDIAS 591
             +L GGEL+ F +   G+ ++ +   +  G V  +C+ + S
Sbjct:   431 SLKGGELLIFHLISDGRSVQRIHVSKAGGSVLTSCICVLS 470

 Score = 41 (19.5 bits), Expect = 0.00026, Sum P(5) = 0.00026
 Identities = 9/30 (30%), Positives = 15/30 (50%)

Query:    77 IVVGSDSGRIVILEYNPSKNVFDKIHQETF 106
             I +GS  G  ++L+Y       D++  E F
Sbjct:   474 IFLGSRLGDSLLLQYTEKSITDDQLEHENF 503

 Score = 40 (19.1 bits), Expect = 3.9e-05, Sum P(4) = 3.9e-05
 Identities = 8/18 (44%), Positives = 14/18 (77%)

Query:   312 EHDNEHVSELKIKYFDTI 329
             E++NE+ +E++IK  D I
Sbjct:   919 ENENENENEIEIKDQDNI 936


>UNIPROTKB|J9P418 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
            GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
            Ensembl:ENSCAFT00000043656 Uniprot:J9P418
        Length = 1107

 Score = 114 (45.2 bits), Expect = 3.4e-09, Sum P(4) = 3.4e-09
 Identities = 81/397 (20%), Positives = 166/397 (41%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  D++Y +P+ E+      I+++ P S  A     +EL++ E    + TV+  
Sbjct:   718 EKEFETIERDDRYIHPQQEAFS----IQLISPVSWEAIPNARIELEEWEHVTCMKTVSLR 773

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGKSL-----ELLHKTQVE 943
              +E  + L    A G        +   G I I   +E     G+ L     ++L++ + +
Sbjct:   774 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 833

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGD 1001
             G   ALC   G L++ IG  + L+ L    L  +   + +L+ + ++S+  +   I   D
Sbjct:   834 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNF---ILAAD 890

Query:  1002 IQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRLPQ 1057
             + +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ LP+
Sbjct:   891 VMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 950

Query:  1058 DVSDEIEEDPTGGKIKWEQGKLN-GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIY 1116
                    ++  GG     +   + GA   +    +           K S+V        +
Sbjct:   951 ------AKESFGGMRLLRRADFHVGA--HVNTFWRTPCRGAAEGPSKKSVVWENKHITWF 1002

Query:  1117 GTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKDV 1170
              T+ G +G +L    +    +    +    M   H  L  R     H+  R     V++V
Sbjct:  1003 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQNAVRNV 1062

Query:  1171 IDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             +DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1063 LDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLE 1099

 Score = 97 (39.2 bits), Expect = 1.8e-07, Sum P(4) = 1.8e-07
 Identities = 67/358 (18%), Positives = 132/358 (36%)

Query:   695 KLFSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYE-TLEYAASFSSDQCVEGVVSVA 753
             + F  + G     +C  S  WL    RG   L P+  +  ++  A F +  C  G +   
Sbjct:   596 RYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFN 655

Query:   754 GNA-LRVFTIERL---GETFNETALPLRYTPRRFVLQPKKKLMVIIETDQGALTAEEREA 809
                 LR+  +         +    +PLR T        + K+  +  +     T   R  
Sbjct:   656 RQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVESKVYAVATSTNMPCTRIPRMT 715

Query:   810 AKKECFXXXXXXXXXXXXXXXXXXXXXENKYDPLSDEQYGYPKAESDKW--VSCIRVLDP 867
              +++ F                          P+S E     + E ++W  V+C++ +  
Sbjct:   716 GEEKEFETIERDDRYIHPQQEAFSIQL---ISPVSWEAIPNARIELEEWEHVTCMKTVSL 772

Query:   868 RSANTTCLLELQDNEAAFSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNIVAGYIHIYRF 927
             RS  T   L+      A   C +   +          T +G      R ++   I +   
Sbjct:   773 RSEETVSGLK---GYVAAGTCLMQGEEV---------TCRG------RILIMDVIEVVP- 813

Query:   928 VEEGKSL-----ELLHKTQVEGIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCEN 980
              E G+ L     ++L++ + +G   ALC   G L++ IG  + L+ L    L  +   + 
Sbjct:   814 -EPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDT 872

Query:   981 KLFPNTIVSINTYRDRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDF 1038
             +L+ + ++S+  +   I   D+ +S    +Y+ +   L + + D+ P      + +DF
Sbjct:   873 QLYIHQMISVKNF---ILAADVMKSISLLRYQEESKTLSLVSRDAKP---LEVYSVDF 924

 Score = 91 (37.1 bits), Expect = 3.4e-09, Sum P(4) = 3.4e-09
 Identities = 18/54 (33%), Positives = 34/54 (62%)

Query:   471 YIVVSFNNATLVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDS-LMQVHPSGIR 523
             ++++S  ++T++L  G+ + E+  SGF    P++    IGD+  ++QV P GIR
Sbjct:   238 FLILSREDSTMILQTGQEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIR 291

 Score = 72 (30.4 bits), Expect = 3.4e-09, Sum P(4) = 3.4e-09
 Identities = 20/69 (28%), Positives = 34/69 (49%)

Query:   304 GDIFKVTLEHDNEHVSELKIKYFDTIP---VTASMCVLKSGYLFAASEFGNHALYQFQAI 360
             G+I+ +TL  D   +  ++  +FD      +T SM  ++ GYLF  S  GN  L ++   
Sbjct:     8 GEIYVLTLITDG--MRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEK 65

Query:   361 GADPDVEAS 369
               +P   A+
Sbjct:    66 LQEPPASAA 74

 Score = 48 (22.0 bits), Expect = 3.4e-09, Sum P(4) = 3.4e-09
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query:   420 QIFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDE 467
             +I    G G   +L +L+  +    +   +LPG    +WTV   V  E
Sbjct:   164 EIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYD-MWTVIAPVRKE 210


>UNIPROTKB|F1RSN8 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
            EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
        Length = 1108

 Score = 114 (45.2 bits), Expect = 4.0e-05, Sum P(4) = 4.0e-05
 Identities = 81/397 (20%), Positives = 166/397 (41%)

Query:   837 ENKYDPLS-DEQYGYPKAESDKWVSCIRVLDPRS--ANTTCLLELQDNEAAFSICTVNFH 893
             E +++ +  D++Y +P+ E+      I+++ P S  A     +EL++ E    + TV+  
Sbjct:   719 EKEFETIDRDDRYIHPQQEAFS----IQLISPVSWEAIPNARIELEEWEHVTCMKTVSLR 774

Query:   894 DKEHGTLLAVGTAKGLQFWPKRNIVA-GYIHIYRFVE----EGKSL-----ELLHKTQVE 943
              +E  + L    A G        +   G I I   +E     G+ L     ++L++ + +
Sbjct:   775 SEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQK 834

Query:   944 GIPLALCQFQGRLLAGIGPVLRLYDLGKKRL--LRKCENKLFPNTIVSINTYRDRIYVGD 1001
             G   ALC   G L++ IG  + L+ L    L  +   + +L+ + ++S+  +   I   D
Sbjct:   835 GPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMAFIDTQLYIHQMISVKNF---ILAAD 891

Query:  1002 IQESFHFCKYRRDENQLYIFADDSVPRWLTAAHHIDFDTMAG---ADKFGNIY-FVRLPQ 1057
             + +S    +Y+ +   L + + D+ P  + +   +  +   G   +D+  N+  ++ LP+
Sbjct:   892 VMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 951

Query:  1058 DVSDEIEEDPTGGKIKWEQGKLN-GAPNKMEEIVQFHVGDVVTSLQKASLVPGGGESVIY 1116
                    ++  GG     +   + GA   +    +           K S+V        +
Sbjct:   952 ------AKESFGGMRLLRRADFHVGA--HVNTFWRTPCRGATDGPSKKSVVWENKHITWF 1003

Query:  1117 GTVMGSLGAMLAFSSRD--DVDFFSHLEMHMRQEHPPLCGRD----HMAYRSAYFPVKDV 1170
              T+ G +G +L    +    +    +    M   H  L  R     H+  R     V++V
Sbjct:  1004 ATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRVLQNAVRNV 1063

Query:  1171 IDGDLCEQFPTLSLDLQRKIADELDRTPGEILKKLEE 1207
             +DG+L  ++  LS   + ++A ++  TP  IL  L E
Sbjct:  1064 LDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLE 1100

 Score = 86 (35.3 bits), Expect = 4.0e-05, Sum P(4) = 4.0e-05
 Identities = 51/241 (21%), Positives = 97/241 (40%)

Query:    44 ENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFD--KI 101
             E+  ++E + S   FG + S+A  +L G+++D +++     ++ ++EY+P  +      +
Sbjct:    64 EHREKLELVASFSFFG-VMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSL 122

Query:   102 H--QETFGKSGCRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEA 159
             H  +E   + G  + V    + VDP GR   +     + +V    R++ A        E 
Sbjct:   123 HYFEEPELRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEG 182

Query:   160 HKSHTIVYSICGIDCGFDNPIFAAIELDYS------------EADQDSTGQAASEAQKNL 207
              +S  +   I  +    D  +   ++L +             E +Q   G+ A   Q   
Sbjct:   183 QRSSFLPSYIIDVRA-LDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVR-QDTC 240

Query:   208 TFYELDLGLNHVSRK--WSE---PVDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGH 262
             +   + L +        WS    P D    + V  P GG     V++ A N ++Y NQ  
Sbjct:   241 SIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGG-----VVIFAVNSLLYLNQSV 295

Query:   263 P 263
             P
Sbjct:   296 P 296

 Score = 46 (21.3 bits), Expect = 4.0e-05, Sum P(4) = 4.0e-05
 Identities = 13/61 (21%), Positives = 31/61 (50%)

Query:   271 RRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTLEHDNE-HVSELKIKYFDTI 329
             R+ +LP  + VL+V+  + +++   + L+  +   +      HD++     LK++ F  +
Sbjct:   504 RQGELPLVKEVLLVALGSRQRRP--YLLVHVDQELLIYEAFPHDSQLGQGNLKVR-FKKV 560

Query:   330 P 330
             P
Sbjct:   561 P 561

 Score = 37 (18.1 bits), Expect = 4.0e-05, Sum P(4) = 4.0e-05
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query:     1 MYLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLR 42
             MY        PTG+  ++  NF       +VVA    L + R
Sbjct:     1 MYAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYR 42


>UNIPROTKB|F5H581 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI00909251
            ProteinModelPortal:F5H581 SMR:F5H581 Ensembl:ENST00000535147
            ArrayExpress:F5H581 Bgee:F5H581 Uniprot:F5H581
        Length = 267

 Score = 128 (50.1 bits), Expect = 5.7e-05, P = 5.7e-05
 Identities = 52/210 (24%), Positives = 93/210 (44%)

Query:   580 MSGDVACLDIASVPEGRKRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSSP--PES 636
             M  +VACLDI  + +    S   A+G + D + RIL L P    ++L  + +     P S
Sbjct:     1 MEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKL-PS--FELLHKEMLGGEIIPRS 57

Query:   637 LLFLEVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDSRSRFLGLRPPKL 696
             +L    ++S        H    +L   L +G LF   +++ TG LSD +   LG +P  L
Sbjct:    58 ILMTTFESS--------H----YLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVL 105

Query:   697 FSVVVGGRAAMLCLSSRPWLGYIHRGRFLLTPLSYETLEYAASFSSDQCVEGVVSVAGNA 756
              +        +   S RP + Y    + + + ++ + + Y    +SD   + +     + 
Sbjct:   106 RTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNST 165

Query:   757 LRVFTIERLGETFNETALPLRYTPRRFVLQ 786
             L + TI+ + +    T +PL  +PR+   Q
Sbjct:   166 LTIGTIDEIQKLHIRT-VPLYESPRKICYQ 194


>WB|WBGene00022301 [details] [associations]
            symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 112 (44.5 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 38/159 (23%), Positives = 72/159 (45%)

Query:   421 IFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNAT 480
             + T  G G   +L + +  L    +  S L G    +W V +  N E   Y++VS   +T
Sbjct:   479 LVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQ-LWAVGRKEN-ESHKYLIVSRVRST 536

Query:   481 LVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSL-MQVHPSGIRHIREDGRINEWRTPGK 539
             L+L +GE + E+ +  F+   P++A   +   +L +QV  + I  + +  ++ E      
Sbjct:   537 LILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQMQEVHIDSN 596

Query:   540 RTIVKVGSNRLQVVIALSGGELIYFEVDMTG--QLLEVE 576
               +++       V +    G L+ +E+ M    QL EV+
Sbjct:   597 FPVIQASIVDPYVALLTQNGRLLLYELVMEPYVQLREVD 635

 Score = 65 (27.9 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 67/335 (20%), Positives = 130/335 (38%)

Query:   899 TLLAVGTAKGLQFWPKRNIVAGYIHIYRFVE---------EGKSLELLHKTQVEGIPLAL 949
             TLLA+GT      + +  +V G I +   +E           + +++L   + +G    L
Sbjct:  1125 TLLAMGTVNN---YGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGL 1181

Query:   950 CQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFC 1009
             C   G LL G+G  + ++      L+      +    +  +++ R      D +ES    
Sbjct:  1182 CAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTIAIACDARESMSLI 1240

Query:  1010 KYRRDENQLYIFA-DDSVPRWLTAAHHIDFD-TMAG---ADKFGNIYFVRLPQDVSDEIE 1064
             +++ D   + I + DD        A  +  D    G   +D+ GNI       + + E  
Sbjct:  1241 RFQEDNKAMSIASRDDRKCAQPPMASQLVVDGAHVGFLLSDETGNITMF----NYAPEAP 1296

Query:  1065 EDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDVVTSLQKASLVPGGGESVIYGTVMGS 1122
             E   GG+    +  +N   N +   V+   H   +  + +          + ++ ++ GS
Sbjct:  1297 ES-NGGERLTVRAAINIGTN-INAFVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGS 1354

Query:  1123 LGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAY--FPV------KDVIDGD 1174
              G +   + +        L+  +    P + G      RSA    P+      +++IDGD
Sbjct:  1355 FGFVRPLTEKS-YRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGD 1413

Query:  1175 LCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIR 1209
             + EQ+  LSL  +  +A  L      I+  L ++R
Sbjct:  1414 VVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLR 1448

 Score = 59 (25.8 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 79/353 (22%), Positives = 131/353 (37%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTG-SQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETF 106
             ++E + S  +     S+A  R+     +D I++  D  ++ I+  N  +     I    F
Sbjct:    65 KLECMFSCRLLNKCHSIAVARVPQLPDQDSILMTFDDAKLSIVSINEKERNMQTISLHAF 124

Query:   107 GKSGCRRIVPGQY----LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKS 162
                  R      +    +  DP  R     AC    LVY  +    A L       + + 
Sbjct:   125 ENEYLRDGFINHFQPPLVRSDPSNRCA---AC----LVYGKH---IAILPFHE--NSKRI 172

Query:   163 HTIVYSICGIDCGFDNPIFAAIELD-YSEAD--------QDSTGQAASEAQKNLTFYELD 213
             H+ V  +  ID   DN I   + LD Y E          Q + G+A        T   + 
Sbjct:   173 HSYVIPLKQIDPRLDN-IADMVFLDGYYEPTILFLYEPIQTTPGRACVRYD---TMCIMG 228

Query:   214 LGLNHVSRK----WSE---PVDNGANMLVTVPGGGDGP-SGVLVCAENFVIYKNQGHPDV 265
             + +N V R+    W     P+D   + L+ +P     P  G LV   N V+Y NQ  P  
Sbjct:   229 VSVNIVDRQFAVVWQTANLPMD--CSQLLPIPK----PLGGALVFGSNTVVYLNQAVPPC 282

Query:   266 RAVI----------PRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTL--EH 313
               V+          P + DL   +  L  S + + +       + +  GD+F + L    
Sbjct:   283 GLVLNSCYDGFTKFPLK-DLKHLKMTLDCSTSVYMEDGRI--AVGSRDGDLFLLRLMTSS 339

Query:   314 DNEHVSELKI-KYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPD 365
                 V  L+  K ++T  +  S+ V   G+LF  S  G+  L ++  +    D
Sbjct:   340 GGGTVKSLEFSKVYET-SIAYSLTVCAPGHLFVGSRLGDSQLLEYTLLKTTRD 391

 Score = 53 (23.7 bits), Expect = 0.00030, Sum P(5) = 0.00030
 Identities = 17/63 (26%), Positives = 31/63 (49%)

Query:    37 VLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKN 96
             +L L+     G +++L  ++++    S+A + LT     ++ VGS  G   +LEY   K 
Sbjct:   332 LLRLMTSSGGGTVKSLEFSKVYET--SIA-YSLTVCAPGHLFVGSRLGDSQLLEYTLLKT 388

Query:    97 VFD 99
               D
Sbjct:   389 TRD 391

 Score = 48 (22.0 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query:   623 QILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNA 662
             Q+ S+   +   E + F   + S+GG  G D   S   NA
Sbjct:   653 QLTSISIYADASEIMKFAAAEKSMGGGGGGDGEVSTAENA 692

 Score = 43 (20.2 bits), Expect = 0.00030, Sum P(5) = 0.00030
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query:   177 DNPIFAAIELDYSEADQDSTGQAASEAQKN 206
             DN   AA E++  E D +  G A  E Q +
Sbjct:   400 DNKDPAAAEIELDEDDMELYGGAIEEQQND 429


>UNIPROTKB|Q9N4C2 [details] [associations]
            symbol:cpsf-1 "Probable cleavage and polyadenylation
            specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
            [GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
            cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=NAS]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 112 (44.5 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 38/159 (23%), Positives = 72/159 (45%)

Query:   421 IFTLCGRGPRSSLRILRPGLAVSEMAVSQLPGVPSAVWTVKKNVNDEFDAYIVVSFNNAT 480
             + T  G G   +L + +  L    +  S L G    +W V +  N E   Y++VS   +T
Sbjct:   479 LVTASGHGKNGALCVHQRSLRPEIITSSLLEGAEQ-LWAVGRKEN-ESHKYLIVSRVRST 536

Query:   481 LVLSIGETVEEVSDSGFLDTTPSLAVSLIGDDSL-MQVHPSGIRHIREDGRINEWRTPGK 539
             L+L +GE + E+ +  F+   P++A   +   +L +QV  + I  + +  ++ E      
Sbjct:   537 LILELGEELVELEEQLFVTGEPTVAAGELSQGALAVQVTSTCIALVTDGQQMQEVHIDSN 596

Query:   540 RTIVKVGSNRLQVVIALSGGELIYFEVDMTG--QLLEVE 576
               +++       V +    G L+ +E+ M    QL EV+
Sbjct:   597 FPVIQASIVDPYVALLTQNGRLLLYELVMEPYVQLREVD 635

 Score = 65 (27.9 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 67/335 (20%), Positives = 130/335 (38%)

Query:   899 TLLAVGTAKGLQFWPKRNIVAGYIHIYRFVE---------EGKSLELLHKTQVEGIPLAL 949
             TLLA+GT      + +  +V G I +   +E           + +++L   + +G    L
Sbjct:  1125 TLLAMGTVNN---YGEEVLVRGRIILCEVIEVVPEPDQPTSNRKIKVLFDKEQKGPVTGL 1181

Query:   950 CQFQGRLLAGIGPVLRLYDLGKKRLLRKCENKLFPNTIVSINTYRDRIYVGDIQESFHFC 1009
             C   G LL G+G  + ++      L+      +    +  +++ R      D +ES    
Sbjct:  1182 CAINGLLLCGMGQKVFIWQFKDNDLMGISFLDMH-YYVYQLHSLRTIAIACDARESMSLI 1240

Query:  1010 KYRRDENQLYIFA-DDSVPRWLTAAHHIDFD-TMAG---ADKFGNIYFVRLPQDVSDEIE 1064
             +++ D   + I + DD        A  +  D    G   +D+ GNI       + + E  
Sbjct:  1241 RFQEDNKAMSIASRDDRKCAQPPMASQLVVDGAHVGFLLSDETGNITMF----NYAPEAP 1296

Query:  1065 EDPTGGKIKWEQGKLNGAPNKMEEIVQF--HVGDVVTSLQKASLVPGGGESVIYGTVMGS 1122
             E   GG+    +  +N   N +   V+   H   +  + +          + ++ ++ GS
Sbjct:  1297 ES-NGGERLTVRAAINIGTN-INAFVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGS 1354

Query:  1123 LGAMLAFSSRDDVDFFSHLEMHMRQEHPPLCGRDHMAYRSAY--FPV------KDVIDGD 1174
              G +   + +        L+  +    P + G      RSA    P+      +++IDGD
Sbjct:  1355 FGFVRPLTEKS-YRRLHFLQTFIGSVTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGD 1413

Query:  1175 LCEQFPTLSLDLQRKIADELDRTPGEILKKLEEIR 1209
             + EQ+  LSL  +  +A  L      I+  L ++R
Sbjct:  1414 VVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLR 1448

 Score = 59 (25.8 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 79/353 (22%), Positives = 131/353 (37%)

Query:    48 RIETLVSTEIFGAIRSLAQFRLTG-SQKDYIVVGSDSGRIVILEYNPSKNVFDKIHQETF 106
             ++E + S  +     S+A  R+     +D I++  D  ++ I+  N  +     I    F
Sbjct:    65 KLECMFSCRLLNKCHSIAVARVPQLPDQDSILMTFDDAKLSIVSINEKERNMQTISLHAF 124

Query:   107 GKSGCRRIVPGQY----LAVDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKS 162
                  R      +    +  DP  R     AC    LVY  +    A L       + + 
Sbjct:   125 ENEYLRDGFINHFQPPLVRSDPSNRCA---AC----LVYGKH---IAILPFHE--NSKRI 172

Query:   163 HTIVYSICGIDCGFDNPIFAAIELD-YSEAD--------QDSTGQAASEAQKNLTFYELD 213
             H+ V  +  ID   DN I   + LD Y E          Q + G+A        T   + 
Sbjct:   173 HSYVIPLKQIDPRLDN-IADMVFLDGYYEPTILFLYEPIQTTPGRACVRYD---TMCIMG 228

Query:   214 LGLNHVSRK----WSE---PVDNGANMLVTVPGGGDGP-SGVLVCAENFVIYKNQGHPDV 265
             + +N V R+    W     P+D   + L+ +P     P  G LV   N V+Y NQ  P  
Sbjct:   229 VSVNIVDRQFAVVWQTANLPMD--CSQLLPIPK----PLGGALVFGSNTVVYLNQAVPPC 282

Query:   266 RAVI----------PRRADLPAERGVLIVSAATHRQKTLFFFLLQTEYGDIFKVTL--EH 313
               V+          P + DL   +  L  S + + +       + +  GD+F + L    
Sbjct:   283 GLVLNSCYDGFTKFPLK-DLKHLKMTLDCSTSVYMEDGRI--AVGSRDGDLFLLRLMTSS 339

Query:   314 DNEHVSELKI-KYFDTIPVTASMCVLKSGYLFAASEFGNHALYQFQAIGADPD 365
                 V  L+  K ++T  +  S+ V   G+LF  S  G+  L ++  +    D
Sbjct:   340 GGGTVKSLEFSKVYET-SIAYSLTVCAPGHLFVGSRLGDSQLLEYTLLKTTRD 391

 Score = 53 (23.7 bits), Expect = 0.00030, Sum P(5) = 0.00030
 Identities = 17/63 (26%), Positives = 31/63 (49%)

Query:    37 VLELLRPENSGRIETLVSTEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKN 96
             +L L+     G +++L  ++++    S+A + LT     ++ VGS  G   +LEY   K 
Sbjct:   332 LLRLMTSSGGGTVKSLEFSKVYET--SIA-YSLTVCAPGHLFVGSRLGDSQLLEYTLLKT 388

Query:    97 VFD 99
               D
Sbjct:   389 TRD 391

 Score = 48 (22.0 bits), Expect = 0.00010, Sum P(4) = 0.00010
 Identities = 12/40 (30%), Positives = 18/40 (45%)

Query:   623 QILSVQSVSSPPESLLFLEVQASVGGEDGADHPASLFLNA 662
             Q+ S+   +   E + F   + S+GG  G D   S   NA
Sbjct:   653 QLTSISIYADASEIMKFAAAEKSMGGGGGGDGEVSTAENA 692

 Score = 43 (20.2 bits), Expect = 0.00030, Sum P(5) = 0.00030
 Identities = 11/30 (36%), Positives = 15/30 (50%)

Query:   177 DNPIFAAIELDYSEADQDSTGQAASEAQKN 206
             DN   AA E++  E D +  G A  E Q +
Sbjct:   400 DNKDPAAAEIELDEDDMELYGGAIEEQQND 429


>UNIPROTKB|F5GZY8 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01011929
            ProteinModelPortal:F5GZY8 SMR:F5GZY8 Ensembl:ENST00000542337
            ArrayExpress:F5GZY8 Bgee:F5GZY8 Uniprot:F5GZY8
        Length = 146

 Score = 107 (42.7 bits), Expect = 0.00011, P = 0.00011
 Identities = 36/134 (26%), Positives = 59/134 (44%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI---HQETFGKSGCRRIVPGQ 118
               +  FR  G  KD + + +      ILEY  S    D I   H     + G R    G 
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG-RPSETGI 120

Query:   119 YLAVDPKGRAVMIG 132
                +DP+ R  MIG
Sbjct:   121 IGIIDPECR--MIG 132


>UNIPROTKB|F5GYG8 [details] [associations]
            symbol:DDB1 "DNA damage-binding protein 1" species:9606
            "Homo sapiens" [GO:0016055 "Wnt receptor signaling pathway"
            evidence=IEA] GO:GO:0016055 GO:GO:0003684 EMBL:AP003108
            HGNC:HGNC:2717 ChiTaRS:DDB1 EMBL:AP003037 IPI:IPI01012348
            ProteinModelPortal:F5GYG8 SMR:F5GYG8 Ensembl:ENST00000543627
            ArrayExpress:F5GYG8 Bgee:F5GYG8 Uniprot:F5GYG8
        Length = 109

 Score = 103 (41.3 bits), Expect = 0.00028, P = 0.00028
 Identities = 26/100 (26%), Positives = 46/100 (46%)

Query:     2 YLYSLTLQQPTGIIAAINGNFSGTKTPEIVVARGKVLELLRPENSGRIETLVSTEIFGAI 61
             Y Y +T Q+PT +   + G+F+  +   +++A+   LE+      G +  +    ++G I
Sbjct:     3 YNYVVTAQKPTAVNGCVTGHFTSAEDLNLLIAKNTRLEIYVVTAEG-LRPVKEVGMYGKI 61

Query:    62 RSLAQFRLTGSQKDYIVVGSDSGRIVILEYNPSKNVFDKI 101
               +  FR  G  KD + + +      ILEY  S    D I
Sbjct:    62 AVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDII 101


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.320   0.137   0.403    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1213      1192   0.00096  123 3  11 22  0.39    34
                                                     39  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  68
  No. of states in DFA:  624 (66 KB)
  Total size of DFA:  510 KB (2236 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  98.83u 0.16s 98.99t   Elapsed:  00:00:06
  Total cpu time:  98.87u 0.16s 99.03t   Elapsed:  00:00:06
  Start:  Mon May 20 23:14:12 2013   End:  Mon May 20 23:14:18 2013

Back to top