BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>001003
MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVV
TAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILS
QGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARG
PLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD
LDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWS
AMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSF
SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGN
SLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVN
GEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYE
LVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVT
ESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENST
VLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEP
WLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF
VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPF
LFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAY
TREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVL
HNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI
VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT
RATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD
NPQNLVLSGSYGPLFSSVQIDFASHFFAICSNSFVFVFLFSFLRSLFIIGNV

High Scoring Gene Products

Symbol, full name Information P value
CPSF160
cleavage and polyadenylation specificity factor 160
protein from Arabidopsis thaliana 0.
cpsf1
cleavage and polyadenylation specific factor 1
gene_product from Danio rerio 1.3e-105
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 7.4e-104
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Bos taurus 2.0e-103
Cpsf1
cleavage and polyadenylation specific factor 1, 160kDa
gene from Rattus norvegicus 7.0e-103
Cpsf1
cleavage and polyadenylation specific factor 1
protein from Mus musculus 7.3e-103
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Homo sapiens 2.9e-102
Cpsf160
Cleavage and polyadenylation specificity factor 160
protein from Drosophila melanogaster 1.6e-86
cpsf1
cleavage and polyadenylation specificity factor 160 kDa subunit
gene from Dictyostelium discoideum 4.8e-73
CPSF1
Uncharacterized protein
protein from Sus scrofa 9.9e-71
cpsf-1 gene from Caenorhabditis elegans 1.9e-63
cpsf-1
Probable cleavage and polyadenylation specificity factor subunit 1
protein from Caenorhabditis elegans 1.9e-63
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 2.3e-53
CPSF1
Uncharacterized protein
protein from Sus scrofa 8.2e-27
orf19.2760 gene_product from Candida albicans 1.7e-17
CFT1
Protein CFT1
protein from Candida albicans SC5314 1.7e-17
DDB1A
AT4G05420
protein from Arabidopsis thaliana 7.1e-07
pic
piccolo
protein from Drosophila melanogaster 2.9e-06
DDB1B
damaged DNA binding protein 1B
protein from Arabidopsis thaliana 3.1e-06
CFT1
RNA-binding subunit of the mRNA cleavage and polyadenylation factor
gene from Saccharomyces cerevisiae 8.2e-05
ddb1
damage specific DNA binding protein 1
gene_product from Danio rerio 0.00024

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  001003
        (1192 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade...  2330  0.        2
ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya...   648  1.3e-105  3
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"...   640  7.4e-104  3
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla...   651  2.0e-103  4
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ...   652  7.0e-103  4
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat...   648  7.3e-103  4
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla...   651  2.9e-102  4
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla...   566  1.6e-86   4
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya...   413  4.8e-73   6
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"...   466  9.9e-71   4
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab...   431  1.9e-63   3
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p...   431  1.9e-63   3
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"...   349  2.3e-53   3
POMBASE|SPBC1709.08 - symbol:cft1 "cleavage factor one Cf...   268  3.3e-30   3
UNIPROTKB|K7GNU1 - symbol:CPSF1 "Uncharacterized protein"...   296  8.2e-27   2
ASPGD|ASPL0000050546 - symbol:AN1413 species:162425 "Emer...   209  6.7e-20   2
CGD|CAL0004251 - symbol:orf19.2760 species:5476 "Candida ...   224  1.7e-17   3
UNIPROTKB|Q5AFT3 - symbol:CFT1 "Protein CFT1" species:237...   224  1.7e-17   3
TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr...    91  7.1e-07   4
FB|FBgn0260962 - symbol:pic "piccolo" species:7227 "Droso...   141  2.9e-06   5
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr...   100  3.1e-06   4
SGD|S000002709 - symbol:CFT1 "RNA-binding subunit of the ...    91  8.2e-05   3
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ...   116  0.00024   3


>TAIR|locus:2153122 [details] [associations]
            symbol:CPSF160 "cleavage and polyadenylation specificity
            factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
            of flower development" evidence=RCA] [GO:0016570 "histone
            modification" evidence=RCA] [GO:0048449 "floral organ formation"
            evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
            EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
            IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
            EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
            TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
            PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
            GermOnline:AT5G51660 Uniprot:Q9FGR0
        Length = 1442

 Score = 2330 (825.3 bits), Expect = 0., Sum P(2) = 0.
 Identities = 445/605 (73%), Positives = 500/605 (82%)

Query:   542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
             VELPGCKGIWTVYHKSSRGHNADSS+MAA +DEYHAYLIISLEARTMVLETADLLTEVTE
Sbjct:   570 VELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISLEARTMVLETADLLTEVTE 629

Query:   602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTV 661
             SVDY+VQGRTIAAGNLFGRRRVIQVFE GARILDGS+M Q+LSFG             TV
Sbjct:   630 SVDYYVQGRTIAAGNLFGRRRVIQVFEHGARILDGSFMNQELSFGASNSESNSGSESSTV 689

Query:   662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPW 721
              SVSIADPYVLL M+D SIRLLVGDPSTCTVS+ +P+ +E SK+ +S+CTLYHDKGPEPW
Sbjct:   690 SSVSIADPYVLLRMTDDSIRLLVGDPSTCTVSISSPSVLEGSKRKISACTLYHDKGPEPW 749

Query:   722 LRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
             LRK STDAWLS+GVGEA+D  DGGP DQGDIY VVCYESGALEIFDVP+FNCVF+VDKF 
Sbjct:   750 LRKASTDAWLSSGVGEAVDSVDGGPQDQGDIYCVVCYESGALEIFDVPSFNCVFSVDKFA 809

Query:   782 SGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFL 841
             SGR H+ D  + E     E E+N +SE+ T    KE I + +VVELAMQRWS HH+RPFL
Sbjct:   810 SGRRHLSDMPIHEL----EYELNKNSEDNTSS--KE-IKNTRVVELAMQRWSGHHTRPFL 862

Query:   842 FAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYT 901
             FA+L DGTILCY AYLF+G ++T K+++                    L+F R PLD  T
Sbjct:   863 FAVLADGTILCYHAYLFDGVDST-KAENSLSSENPAALNSSGSSKLRNLKFLRIPLDTST 921

Query:   902 REETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLH 961
             RE T  G   QRIT+FKNISGHQGFFLSGSRP WCM+FRERLR H QLCDGSI AFTVLH
Sbjct:   922 REGTSDGVASQRITMFKNISGHQGFFLSGSRPGWCMLFRERLRFHSQLCDGSIAAFTVLH 981

Query:   962 NVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021
             NVNCNHGFIYVT+QG+LKICQLPS S YDNYWPVQK IPLKATPHQ+TY+AEKNLYPLIV
Sbjct:   982 NVNCNHGFIYVTAQGVLKICQLPSASIYDNYWPVQK-IPLKATPHQVTYYAEKNLYPLIV 1040

Query:  1022 SVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTR 1081
             S PV KPLNQVLS L+DQE G Q+DNHN+SS DL RTYTVEE+E++ILEP+R+GGPW+T+
Sbjct:  1041 SYPVSKPLNQVLSSLVDQEAGQQLDNHNMSSDDLQRTYTVEEFEIQILEPERSGGPWETK 1100

Query:  1082 ATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADN 1141
             A IPMQ+SE+ALTVRVVTL N +T ENETLLA+GTAYVQGEDVAARGRVLLFS G+N DN
Sbjct:  1101 AKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYVQGEDVAARGRVLLFSFGKNGDN 1160

Query:  1142 PQNLV 1146
              QN+V
Sbjct:  1161 SQNVV 1165

 Score = 2091 (741.1 bits), Expect = 0., Sum P(2) = 0.
 Identities = 396/545 (72%), Positives = 464/545 (85%)

Query:     1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQT-EELDSELPS-KRGIGPVPNL 58
             MSFAAYKMMHWPTG+ NC SG+ITHS +D   QIP++   +++++E P+ KRGIGP+PN+
Sbjct:     1 MSFAAYKMMHWPTGVENCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNV 60

Query:    59 VVTAANVIEIYVVRVQXXXXXXXXXX-XXTKRRVLMDGISAASLELVCHYRLHGNVESLA 117
             V+TAAN++E+Y+VR Q              KR  +MDG+   SLELVCHYRLHGNVES+A
Sbjct:    61 VITAANILEVYIVRAQEEGNTQELRNPKLAKRGGVMDGVYGVSLELVCHYRLHGNVESIA 120

Query:   118 ILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESF 177
             +L  GG ++S+ RDSIIL F DAKISVLEFDDSIH LR+TSMHCFE P+WLHLKRGRESF
Sbjct:   121 VLPMGGGNSSKGRDSIILTFRDAKISVLEFDDSIHSLRMTSMHCFEGPDWLHLKRGRESF 180

Query:   178 ARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVIN 237
              RGPLVKVDPQGRCGGVLVYGLQMIILK SQ GSGLVGD+D F SGG  SAR+ESS++IN
Sbjct:   181 PRGPLVKVDPQGRCGGVLVYGLQMIILKTSQVGSGLVGDDDAFSSGGTVSARVESSYIIN 240

Query:   238 LRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPL 297
             LRDL+MKHVKDF+F+HGYIEPV+VIL E E TWAGRVSWKHHTC++SALSI++TLKQHP+
Sbjct:   241 LRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHPV 300

Query:   298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPR 357
             IWSA+NLPHDAYKLLAVPSPIGGVLV+ ANTIHYHSQSASCALALNNYA S DSSQELP 
Sbjct:   301 IWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELPA 360

Query:   358 SSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITT 417
             S+FSVELDAAH TW+ NDVALLSTK+G+L+LLT++YDGR VQRLDLSK+  SVL SDIT+
Sbjct:   361 SNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDITS 420

Query:   418 IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD 477
             +GNSLFFLGSRLGDSLLVQF+C SG +    GL++E  DIE +    KRLR +S D  QD
Sbjct:   421 VGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTS-DTFQD 479

Query:   478 MVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQS 537
              +  EELSL+GS  NN++SAQK+FSFAVRDSLVN+GP+KDF+YGLRINADA+ATG+SKQS
Sbjct:   480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query:   538 NYELV 542
             NYELV
Sbjct:   540 NYELV 544


>ZFIN|ZDB-GENE-040709-2 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation specific
            factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
            "definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
            GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
            EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
            ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
        Length = 1451

 Score = 648 (233.2 bits), Expect = 1.3e-105, Sum P(3) = 1.3e-105
 Identities = 169/478 (35%), Positives = 265/478 (55%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAAS-LELVCHYRLHGNVES 115
             NLVV  A   ++YV R+             +K     DG S    LE V  + L GNV S
Sbjct:    29 NLVV--AGTSQLYVYRI------IYDVESTSKSEKSSDGKSRKEKLEQVASFSLFGNVMS 80

Query:   116 LAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE 175
             +A +   G +    RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G  
Sbjct:    81 MASVQLVGTN----RDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFV 133

Query:   176 SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHV 235
                  P+V+VDP+ RC  +LVYG  +++L   +     + DE     G G  +    S++
Sbjct:   134 QNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKD---TLADEQEGIVGEGQKSSFLPSYI 190

Query:   236 INLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLK 293
             I++R+LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++   K
Sbjct:   191 IDVRELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQK 250

Query:   294 QHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSS 352
              HP+IWS  NLP D  +++AVP PIGGV+V   N++ Y +QS     ++LN+      + 
Sbjct:   251 VHPVIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSLNSLTNGTTAF 310

Query:   353 QELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVL 411
                P+    + LD + A+++ +D  ++S K G++ +LT++ DG R V+     K   SVL
Sbjct:   311 PLRPQEEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVL 370

Query:   412 TSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSS 471
             T+ + T+     FLGSRLG+SLL+++T     + +  G + E  + + + P+ K+ R  S
Sbjct:   371 TTCMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQEEPPNKKK-RVDS 429

Query:   472 SDA-------LQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
             + A       L D +  +E+ +YGS A + T+ A  T+SF V DS++NIGP    S G
Sbjct:   430 NWAGCPGKGNLPDEL--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCASASMG 483

 Score = 372 (136.0 bits), Expect = 1.3e-105, Sum P(3) = 1.3e-105
 Identities = 115/451 (25%), Positives = 205/451 (45%)

Query:   712 LYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP-LDQGDIYSVVCYESGALEIFDVPN 770
             LY +  P     K  +    S     A  G + G    +   + ++  E+G +EI+ +P+
Sbjct:   751 LYGESNPLTSPNKEESSRG-SAAASSAHTGKESGSGRQEPSHWCLLVRENGVMEIYQLPD 809

Query:   771 FNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ 830
             +  VF V  F  G+  +VD+    +   S T+     EE T QG   +I  +K  E+A+ 
Sbjct:   810 WRLVFLVKNFPVGQRVLVDS----SASQSATQGELKKEEVTRQG---DIPLVK--EVALV 860

Query:   831 RWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXL 890
                 +HSRP+L A + +  +L Y+A+ ++  +  S                        +
Sbjct:   861 SLGYNHSRPYLLAHV-EQELLIYEAFPYDQQQAQSNLK---VRFKKMPHNINYREKKVKV 916

Query:   891 RFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQL 949
             R  + P +    +         R   F++ISG+ G F+ G  P W +V  R  +R+HP  
Sbjct:   917 RKDKKP-EGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAMRLHPMT 975

Query:   950 CDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQIT 1009
              DG+I +F+  HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+K IPL+ T H ++
Sbjct:   976 IDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRK-IPLRCTVHYVS 1034

Query:  1010 YFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRIL 1069
             Y  E  +Y +  SV   +P  ++  +  +++    I+        +H     +++ ++++
Sbjct:  1035 YHVESKVYAVCTSVK--EPCTRIPRMTGEEKEFETIERDERY---IHPQQ--DKFSIQLI 1087

Query:  1070 EPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARG 1128
              P        TR  + ++  E+   ++ V L +  T    +  +A+GT  +QGE+V  RG
Sbjct:  1088 SPVSWEAIPNTR--VDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRG 1145

Query:  1129 RVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             R+L+         P   +    +  L+   Q
Sbjct:  1146 RILILDVIEVVPEPGQPLTKNKFKVLYEKEQ 1176

 Score = 162 (62.1 bits), Expect = 1.3e-105, Sum P(3) = 1.3e-105
 Identities = 54/190 (28%), Positives = 90/190 (47%)

Query:   543 ELPGCKGIWTVYH-------KSSRGHNA---DSSRMAAYDDEY--HAYLIISLEARTMVL 590
             ELPGC  +WTV +        S+ G      +  R    +D+   H +LI+S E  TM+L
Sbjct:   530 ELPGCHDMWTVIYCEEKPEKPSAEGDGESPEEEKREPTIEDDKKKHGFLILSREDSTMIL 589

Query:   591 ETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXX 650
             +T   + E+  S  +  QG T+ AGN+   + +IQV   G R+L+G      L F P   
Sbjct:   590 QTGQEIMELDTS-GFATQGPTVYAGNIGDNKYIIQVSPMGIRLLEG---VNQLHFIPVDL 645

Query:   651 XXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV--GDP---STCTVSVQTPAAIESSKK 705
                       ++  S+ADPYV++  ++G + + V   D     +  +++Q P  I +  +
Sbjct:   646 GS-------PIVHCSVADPYVVIMTAEGVVTMFVLKNDSYMGKSHRLALQKPQ-IHTQSR 697

Query:   706 PVSSCTLYHD 715
              ++ C  Y D
Sbjct:   698 VITLCA-YRD 706

 Score = 48 (22.0 bits), Expect = 3.1e-29, Sum P(2) = 3.1e-29
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+
Sbjct:   595 IMELDTSGFATQGPTVYAG-NIGDNKY-IIQV-SPMGIRLLEGVNQLHF 640

 Score = 39 (18.8 bits), Expect = 1.7e-58, Sum P(2) = 1.7e-58
 Identities = 9/30 (30%), Positives = 17/30 (56%)

Query:   515 LKDFSYGLRINADASATGISKQSNYELVEL 544
             +K+F  G R+  D+SA+  + Q   +  E+
Sbjct:   816 VKNFPVGQRVLVDSSASQSATQGELKKEEV 845


>UNIPROTKB|F1PC28 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
            Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
        Length = 1398

 Score = 640 (230.4 bits), Expect = 7.4e-104, Sum P(3) = 7.4e-104
 Identities = 158/431 (36%), Positives = 240/431 (55%)

Query:   101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
             LELV  +   GNV S+A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H
Sbjct:    23 LELVASFSFFGNVMSMASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLH 78

Query:   161 CFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTF 220
              FE PE   L+ G       P V+VDP GRC  +L+YG ++++L   +     + +E   
Sbjct:    79 YFEEPE---LRDGFVQNVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEG 132

Query:   221 GSGGGFSARIESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKH 278
               G G  +    S++I++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ + 
Sbjct:   133 LMGEGQRSSFLPSYIIDVRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQ 192

Query:   279 HTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS- 337
              TC I A+S++ T K HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS   
Sbjct:   193 DTCSIVAISLNITQKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPP 252

Query:   338 CALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-R 396
               +ALN       +     +    + LD A A ++  D  ++S K G++ +LT++ DG R
Sbjct:   253 YGVALNGLTTGTTAFPLRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMR 312

Query:   397 VVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGD 456
              V+     K   SVLT+ + T+     FLGSRLG+SLL+++T        S+    E  D
Sbjct:   313 SVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAA--REAAD 370

Query:   457 IEADAPSTKRLRRSSS-----DALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLV 510
              E      KR+  ++         QD V  +E+ +YGS A + T+ A  T+SF V DS++
Sbjct:   371 KEEPPSKKKRVDCAAGWSGGKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSIL 426

Query:   511 NIGPLKDFSYG 521
             NIGP  + + G
Sbjct:   427 NIGPCANAAMG 437

 Score = 349 (127.9 bits), Expect = 7.4e-104, Sum P(3) = 7.4e-104
 Identities = 110/407 (27%), Positives = 183/407 (44%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T QG    
Sbjct:   745 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATRQGELPL 800

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
             +  + +V L  ++     SRP+L  +  D  +L Y+A+    P + S+            
Sbjct:   801 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 849

Query:   879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWC 936
                         + S+   +    EE   GA  +  R   F++I G+ G F+ G  P W 
Sbjct:   850 VPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFEDIYGYSGVFICGPSPHWL 908

Query:   937 MVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPV 995
             +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV
Sbjct:   909 LVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPV 968

Query:   996 QKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1055
             +K IPL+ T H + Y  E  +Y +  S  +  P  +     I +  G + +   +   D 
Sbjct:   969 RK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFETIERDDR 1020

Query:  1056 HRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLL 1112
             +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +  +
Sbjct:  1021 YIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYV 1076

Query:  1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct:  1077 AAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1123

 Score = 176 (67.0 bits), Expect = 7.4e-104, Sum P(3) = 7.4e-104
 Identities = 49/152 (32%), Positives = 79/152 (51%)

Query:   543 ELPGCKGIWTVY-------HKSSRGHNAD--SSRMAAYDD-EYHAYLIISLEARTMVLET 592
             ELPGC  +WTV         ++S+G  A+  SS + A DD   H +LI+S E  TM+L+T
Sbjct:   484 ELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQT 543

Query:   593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
                + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P     
Sbjct:   544 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 599

Query:   653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
                     ++  ++ADPYV++  ++G + + +
Sbjct:   600 -------PIVQCAVADPYVVIMSAEGHVTMFL 624

 Score = 49 (22.3 bits), Expect = 6.4e-27, Sum P(2) = 6.4e-27
 Identities = 21/74 (28%), Positives = 36/74 (48%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+      S    
Sbjct:   547 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 603

Query:   338 CALALNNYAVSLDS 351
             CA+A + Y V + +
Sbjct:   604 CAVA-DPYVVIMSA 616


>UNIPROTKB|Q10569 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
            "mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
            IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
            STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
            KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
            ArrayExpress:Q10569 Uniprot:Q10569
        Length = 1444

 Score = 651 (234.2 bits), Expect = 2.0e-103, Sum P(4) = 2.0e-103
 Identities = 167/475 (35%), Positives = 255/475 (53%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
             NLVV  A   ++YV R+             T  +   +      LELV  +   GNV S+
Sbjct:    29 NLVV--AGTSQLYVYRLNRDSEAPTKNDRSTDGKAHRE--HREKLELVASFSFFGNVMSM 84

Query:   117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct:    85 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 137

Query:   177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                 P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct:   138 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 194

Query:   237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
             ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct:   195 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 254

Query:   295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
             HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct:   255 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 314

Query:   354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
                +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct:   315 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 374

Query:   413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
             + + T+     FLGSRLG+SLL+++T        S+    E  D E      KR+  +  
Sbjct:   375 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTA--REAADKEEPPSKKKRVDATTG 432

Query:   471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                S    QD V  +E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct:   433 WSGSKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAMG 483

 Score = 346 (126.9 bits), Expect = 2.0e-103, Sum P(4) = 2.0e-103
 Identities = 108/406 (26%), Positives = 177/406 (43%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+GA+EI+ +P++  VF V  F  G+  +VD+   +     E       EE T QG    
Sbjct:   791 ENGAMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 846

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
             +  + +V L  ++      RP+L  +  D  +L Y+A+    P ++              
Sbjct:   847 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLKVRFKKV 896

Query:   879 XXXXXXXXXXXLRFSRTPLDAYTREET-PHGAPCQRITIFKNISGHQGFFLSGSRPCWCM 937
                            +      T E T P G    R   F++I G+ G F+ G  P W +
Sbjct:   897 PHNINFREKKPKPSKKKAEGGSTEEGTGPRGRVA-RFRYFEDIYGYSGVFICGPSPHWLL 955

Query:   938 VF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
             V  R  LR+HP   DG I +F   HN+NC  GF+Y   QG L+I  LP+  +YD  WPV+
Sbjct:   956 VTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVR 1015

Query:   997 KVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH 1056
             K IPL+ T H + Y  E  +Y +  S     P  +V  +  +++    I+        +H
Sbjct:  1016 K-IPLRCTAHYVAYHVESKVYAVATSTST--PCTRVPRMTGEEKEFETIERDERY---VH 1069

Query:  1057 RTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLA 1113
                  E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +  +A
Sbjct:  1070 PQQ--EAFCIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVA 1123

Query:  1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct:  1124 AGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1169

 Score = 160 (61.4 bits), Expect = 2.0e-103, Sum P(4) = 2.0e-103
 Identities = 63/223 (28%), Positives = 103/223 (46%)

Query:   543 ELPGCKGIWTVY-------HKSSRGHNADSSRMA--AYDD-EYHAYLIISLEARTMVLET 592
             ELPGC  +WTV         ++ +G   +    A  A DD   H +LI+S E  TM+L+T
Sbjct:   530 ELPGCYDMWTVIAPVRKEQEETLKGEGTEPEPGAPEAEDDGRRHGFLILSREDSTMILQT 589

Query:   593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
                + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P     
Sbjct:   590 GQEIMELDAS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 645

Query:   653 XXXXXXXTVLSVSIADPYVLLGMSDGSIR--LLVGDP---STCTVSVQTPAAIESSKKPV 707
                     ++  ++ADPYV++  ++G +   LL  D        +++  P  +    K +
Sbjct:   646 -------PIVQCAVADPYVVIMSAEGHVTMFLLKNDSYGGRHHRLALHKPP-LHHQSKVI 697

Query:   708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQG 750
             + C +Y D          +T++ L  GV + + G  GGP  +G
Sbjct:   698 TLC-VYRDVSG-----MFTTESRLG-GVRDELGGR-GGPEAEG 732

 Score = 49 (22.3 bits), Expect = 2.2e-26, Sum P(3) = 2.2e-26
 Identities = 21/74 (28%), Positives = 36/74 (48%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+      S    
Sbjct:   593 IMELDASGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 649

Query:   338 CALALNNYAVSLDS 351
             CA+A + Y V + +
Sbjct:   650 CAVA-DPYVVIMSA 662

 Score = 48 (22.0 bits), Expect = 2.0e-103, Sum P(4) = 2.0e-103
 Identities = 16/52 (30%), Positives = 25/52 (48%)

Query:     3 FAAYKMMHWPTGIA-NCGSGFITHSRADYV----PQIPLIQTEELDSELPSK 49
             +A YK  H PTG+  +    F  +S  + V     Q+ + +    DSE P+K
Sbjct:     2 YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNR-DSEAPTK 52


>RGD|1306406 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
            160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
            "mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
            RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
            GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
            RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
            GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
            Uniprot:D4A0H5
        Length = 1386

 Score = 652 (234.6 bits), Expect = 7.0e-103, Sum P(4) = 7.0e-103
 Identities = 167/473 (35%), Positives = 256/473 (54%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
             NLVV  A   ++YV R+             T+ +   +      LELV  +   GNV S+
Sbjct:    29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query:   117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct:    82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query:   177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                 P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct:   135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query:   237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
             ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct:   192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query:   295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
             HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct:   252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query:   354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
                +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct:   312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query:   413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
             + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct:   372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVDPTVG 429

Query:   471 -SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
              +    QD V  +E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct:   430 WTGGKTQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 478

 Score = 346 (126.9 bits), Expect = 7.0e-103, Sum P(4) = 7.0e-103
 Identities = 104/380 (27%), Positives = 169/380 (44%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T QG    
Sbjct:   784 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATRQGELPL 839

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
             +  + +V L  ++     SRP+L  +  D  +L Y+A+    P ++              
Sbjct:   840 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHDSQLGQGNLKVRFKKV 889

Query:   879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV 938
                            +      T E +       R   F++I G+ G F+ G  P W +V
Sbjct:   890 PHNINFREKKPKPSKKKAEGCSTEEGSGVRGRVARFRYFEDIYGYSGVFICGPSPHWLLV 949

Query:   939 F-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK 997
               R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV+K
Sbjct:   950 TGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRK 1009

Query:   998 VIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHR 1057
              IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +   D + 
Sbjct:  1010 -IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIERDDRYI 1061

Query:  1058 TYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAI 1114
                 E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +  +A 
Sbjct:  1062 HPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAA 1117

Query:  1115 GTAYVQGEDVAARGRVLLFS 1134
             GT  +QGE+V  RGR+ L+S
Sbjct:  1118 GTCLMQGEEVTCRGRIFLWS 1137

 Score = 158 (60.7 bits), Expect = 7.0e-103, Sum P(4) = 7.0e-103
 Identities = 45/152 (29%), Positives = 72/152 (47%)

Query:   543 ELPGCKGIWTVYH----------KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
             ELPGC  +WTV            K+       S+  A  D   H +LI+S E  TM+L+T
Sbjct:   523 ELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQT 582

Query:   593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
                + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P     
Sbjct:   583 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGA 638

Query:   653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
                     ++  ++ADPYV++  ++G + + +
Sbjct:   639 -------PIVQCAVADPYVVIMSAEGHVTMFL 663

 Score = 47 (21.6 bits), Expect = 1.2e-25, Sum P(3) = 1.2e-25
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+
Sbjct:   586 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 631

 Score = 42 (19.8 bits), Expect = 7.0e-103, Sum P(4) = 7.0e-103
 Identities = 7/13 (53%), Positives = 9/13 (69%)

Query:     3 FAAYKMMHWPTGI 15
             +A YK  H PTG+
Sbjct:     2 YAVYKQAHPPTGL 14


>MGI|MGI:2679722 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor
            1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
            "mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
            polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
            evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
            GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
            HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
            EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
            RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
            STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
            Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
            UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
            CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
            GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
        Length = 1441

 Score = 648 (233.2 bits), Expect = 7.3e-103, Sum P(4) = 7.3e-103
 Identities = 167/475 (35%), Positives = 255/475 (53%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
             NLVV  A   ++YV R+             T+ +   +      LELV  +   GNV S+
Sbjct:    29 NLVV--AGTSQLYVYRLNRDAEALTKNDGSTEGKAHRE-----KLELVASFSFFGNVMSM 81

Query:   117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct:    82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query:   177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                 P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct:   135 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLMGEGQRSSFLPSYII 191

Query:   237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
             ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct:   192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query:   295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
             HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct:   252 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query:   354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
                +    + LD A A ++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct:   312 LRTQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query:   413 SDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS-- 470
             + + T+     FLGSRLG+SLL+++T        SS    E  D E      KR+  +  
Sbjct:   372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASS--VREAADKEEPPSKKKRVEPAVG 429

Query:   471 ---SSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
                     QD V  +E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct:   430 WTGGKTVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSMLNIGPCANAAVG 480

 Score = 352 (129.0 bits), Expect = 7.3e-103, Sum P(4) = 7.3e-103
 Identities = 109/406 (26%), Positives = 179/406 (44%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T QG    
Sbjct:   788 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEVR----KEEATRQGELPL 843

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
             +  + +V L  ++     SRP+L  +  D  +L Y+A+    P + S+            
Sbjct:   844 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 892

Query:   879 XXXXXXXXXXXLRFSRTPLDAYTREETPHG-APCQRITIFKNISGHQGFFLSGSRPCWCM 937
                         + S+   +  + EE   G     R   F++I G+ G F+ G  P W +
Sbjct:   893 VPHNINFREKKPKPSKKKAEGCSTEEGSGGRGRVARFRYFEDIYGYSGVFICGPSPHWLL 952

Query:   938 VF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ 996
             V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV+
Sbjct:   953 VTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVR 1012

Query:   997 KVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH 1056
             K IPL+ T H + Y  E  +Y +  S     P  +     I +  G + +   +   D +
Sbjct:  1013 K-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR-----IPRMTGEEKEFEAIERDDRY 1064

Query:  1057 RTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLLA 1113
                  E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +  +A
Sbjct:  1065 IHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVA 1120

Query:  1114 IGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
              GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct:  1121 AGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1166

 Score = 158 (60.7 bits), Expect = 7.3e-103, Sum P(4) = 7.3e-103
 Identities = 45/152 (29%), Positives = 72/152 (47%)

Query:   543 ELPGCKGIWTVYH----------KSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLET 592
             ELPGC  +WTV            K+       S+  A  D   H +LI+S E  TM+L+T
Sbjct:   527 ELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQT 586

Query:   593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
                + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P     
Sbjct:   587 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGA 642

Query:   653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
                     ++  ++ADPYV++  ++G + + +
Sbjct:   643 -------PIVQCAVADPYVVIMSAEGHVTMFL 667

 Score = 47 (21.6 bits), Expect = 3.3e-26, Sum P(3) = 3.3e-26
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+
Sbjct:   590 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 635

 Score = 42 (19.8 bits), Expect = 7.3e-103, Sum P(4) = 7.3e-103
 Identities = 7/13 (53%), Positives = 9/13 (69%)

Query:     3 FAAYKMMHWPTGI 15
             +A YK  H PTG+
Sbjct:     2 YAVYKQAHPPTGL 14


>UNIPROTKB|Q10570 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
            3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
            [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
            from RNA polymerase II promoter" evidence=TAS] [GO:0006369
            "termination of RNA polymerase II transcription" evidence=TAS]
            [GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
            export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
            evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
            Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
            GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
            OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
            RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
            DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
            PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
            PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
            Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
            GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
            PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
            GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
            CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
            Uniprot:Q10570
        Length = 1443

 Score = 651 (234.2 bits), Expect = 2.9e-102, Sum P(4) = 2.9e-102
 Identities = 166/474 (35%), Positives = 257/474 (54%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
             NLVV  A   ++YV R+             T+ +   +      LEL   +   GNV S+
Sbjct:    29 NLVV--AGTSQLYVYRLNRDAEALTKNDRSTEGKAHRE-----KLELAASFSFFGNVMSM 81

Query:   117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct:    82 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 134

Query:   177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                 P V+VDP GRC  +LVYG ++++L   +     + +E     G G  +    S++I
Sbjct:   135 NVHTPRVRVDPDGRCAAMLVYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 191

Query:   237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
             ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct:   192 DVRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 251

Query:   295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
             HP+IWS  +LP D  + LAVP PIGGV+V   N++ Y +QS     +ALN+      +  
Sbjct:   252 HPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 311

Query:   354 ELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLT 412
                +    + LD A AT++  D  ++S K G++ +LT++ DG R V+     K   SVLT
Sbjct:   312 LRTQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLT 371

Query:   413 SDITTIGNSLFFLGSRLGDSLLVQFTCG----SGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             + + T+     FLGSRLG+SLL+++T        +++  +  KEE    +    +T    
Sbjct:   372 TSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWS 431

Query:   469 RSSSDALQDMVNGEELSLYGS-ASNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
              +     QD V  +E+ +YGS A + T+ A  T+SF V DS++NIGP  + + G
Sbjct:   432 AAGKSVPQDEV--DEIEVYGSEAQSGTQLA--TYSFEVCDSILNIGPCANAAVG 481

 Score = 343 (125.8 bits), Expect = 2.9e-102, Sum P(4) = 2.9e-102
 Identities = 109/407 (26%), Positives = 181/407 (44%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T QG    
Sbjct:   790 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----REEATRQGELPL 845

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
             +  + +V L  ++     SRP+L  +  D  +L Y+A+    P + S+            
Sbjct:   846 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 894

Query:   879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWC 936
                         + S+   +    EE   GA  +  R   F++I G+ G F+ G  P W 
Sbjct:   895 VPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFEDIYGYSGVFICGPSPHWL 953

Query:   937 MVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPV 995
             +V  R  LR+HP   DG + +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV
Sbjct:   954 LVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPV 1013

Query:   996 QKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1055
             +K IPL+ T H + Y  E  +Y +  S     P  ++  +  +++    I+        +
Sbjct:  1014 RK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCARIPRMTGEEKEFETIERDERY---I 1067

Query:  1056 HRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLL 1112
             H     E + ++++ P      W+    A I +Q  E+   ++ V+L +  T    +  +
Sbjct:  1068 HPQQ--EAFSIQLISPVS----WEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYV 1121

Query:  1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct:  1122 AAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 1168

 Score = 158 (60.7 bits), Expect = 2.9e-102, Sum P(4) = 2.9e-102
 Identities = 46/153 (30%), Positives = 75/153 (49%)

Query:   543 ELPGCKGIWTVY-------HKSSRGHNAD---SSRMAAYDD-EYHAYLIISLEARTMVLE 591
             ELPGC  +WTV          + +G   +   S+   A DD   H +LI+S E  TM+L+
Sbjct:   528 ELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQ 587

Query:   592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXX 651
             T   + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P    
Sbjct:   588 TGQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLG 643

Query:   652 XXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
                      ++  ++ADPYV++  ++G + + +
Sbjct:   644 A-------PIVQCAVADPYVVIMSAEGHVTMFL 669

 Score = 47 (21.6 bits), Expect = 3.1e-25, Sum P(3) = 3.1e-25
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY 331
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+
Sbjct:   592 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHF 637

 Score = 42 (19.8 bits), Expect = 2.9e-102, Sum P(4) = 2.9e-102
 Identities = 7/13 (53%), Positives = 9/13 (69%)

Query:     3 FAAYKMMHWPTGI 15
             +A YK  H PTG+
Sbjct:     2 YAVYKQAHPPTGL 14


>FB|FBgn0024698 [details] [associations]
            symbol:Cpsf160 "Cleavage and polyadenylation specificity
            factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
            binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
            EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
            RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
            STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
            GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
            InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
            GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
            Uniprot:Q9V726
        Length = 1455

 Score = 566 (204.3 bits), Expect = 1.6e-86, Sum P(4) = 1.6e-86
 Identities = 158/509 (31%), Positives = 260/509 (51%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
             NLVV  ANV+++Y +                 R           LE +  Y L+GNV SL
Sbjct:    29 NLVVAGANVLKVYRIAPNVEASQRQKLNPSEMRLA-----PKMRLECLATYTLYGNVMSL 83

Query:   117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
               +S  GA     RD+++++F+DAK+SVL+ D     L+  S+H FE  +      GR  
Sbjct:    84 QCVSLAGA----MRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWTGRY- 138

Query:   177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS----GLVGDEDTFGSGGGFSAR--I 230
             F   P V+VDP  RC  +LVYG ++++L   +  S     L   +    +     +R  I
Sbjct:   139 FV--PTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTPI 196

Query:   231 ESSHVINLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
              +S++I LRDLD K  +V D  F+HGY EP ++IL+E   T  GR+  +  TC++ A+S+
Sbjct:   197 MASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAISL 256

Query:   289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAV 347
             +   + HP+IW+  +LP D  ++  +  PIGG LV+  N + Y +QS     ++LN+ A 
Sbjct:   257 NIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSAD 316

Query:   348 SLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDG-RVVQRLDLSKT 406
             +  +    P+    + LD A+  ++  D  ++S +TGDL +LT+  D  R V+     K 
Sbjct:   317 NSTAFPLKPQDGVRISLDCANFAFIDVDKLVISLRTGDLYVLTLCVDSMRTVRNFHFHKA 376

Query:   407 NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLS------------SGLKEEF 454
               SVLTS I  + +   FLGSRLG+SLL+ FT    +++++              L++E 
Sbjct:   377 AASVLTSCICVLHSEYIFLGSRLGNSLLLHFTEEDQSTVITLDEVEQQSEQQQRNLQDED 436

Query:   455 GDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP 514
              ++E +     +L  + + A    +  EEL +YGS +  +    + F F V DSL+N+ P
Sbjct:   437 QNLE-EIFDVDQLEMAPTQAKSRRIEDEELEVYGSGAKASVLQLRKFIFEVCDSLMNVAP 495

Query:   515 LKDFSYGLRINADASATGISKQSNYELVE 543
             +     G R+  +    G++ + + E ++
Sbjct:   496 INYMCAGERVEFEED--GVTLRPHAESLQ 522

 Score = 253 (94.1 bits), Expect = 1.6e-86, Sum P(4) = 1.6e-86
 Identities = 66/228 (28%), Positives = 113/228 (49%)

Query:   912 QRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFI 970
             Q++  F N+ G  G  + G  PC+  + FR  LR+H  L +G + +F   +NVN  +GF+
Sbjct:   942 QKLRPFANVGGLSGVMVCGVNPCFVFLTFRGELRIHRLLGNGDVRSFAAFNNVNIPNGFL 1001

Query:   971 YVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLN 1030
             Y  +   LKI  LPS  +YD+ WPV+KV PL+ TP Q+ Y  E  +Y LI      +P+ 
Sbjct:  1002 YFDTTYELKISVLPSYLSYDSVWPVRKV-PLRCTPRQLVYHRENRVYCLITQTE--EPMT 1058

Query:  1031 QVLSLL-IDQEVGHQIDNHNLSSVDLHRTYTV-EEYEVRILEPDRAGGPWQT--RATIPM 1086
             +       D+E+  +              Y +  ++E+ ++ P+     W+    A+I  
Sbjct:  1059 KYYRFNGEDKELSEESRGERF-------IYPIGSQFEMVLISPET----WEIVPDASITF 1107

Query:  1087 QSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLF 1133
             +  E+    ++V L    T+   +  L IGT +   ED+ +RG + ++
Sbjct:  1108 EPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIHIY 1155

 Score = 159 (61.0 bits), Expect = 1.6e-86, Sum P(4) = 1.6e-86
 Identities = 58/198 (29%), Positives = 98/198 (49%)

Query:   543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTES 602
             EL GC  +WTV+         D+++ ++ +D+ H ++++S    T+VL+T   + E+ E+
Sbjct:   556 ELDGCLDVWTVFD--------DATKKSSRNDQ-HDFMLLSQRNSTLVLQTGQEINEI-EN 605

Query:   603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXXXXXXXXXTVL 662
               + V   TI  GNL  +R ++QV  R  R+L G+ + Q++                 V+
Sbjct:   606 TGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGTRLIQNVPID----------VGSPVV 655

Query:   663 SVSIADPYVLLGMSDGSIRLLV-----GDPSTC----TVSVQTPAAIE-SSKKPVSSCTL 712
              VSIADPYV L + +G +  L      G P       T+S  +PA +  S+ K +S   L
Sbjct:   656 QVSIADPYVCLRVLNGQVITLALRETRGTPRLAINKHTIS-SSPAVVAISAYKDLSG--L 712

Query:   713 YHDKGPEPWLRKTSTDAW 730
             +  KG +  L  +S  A+
Sbjct:   713 FTVKGDDINLTGSSNSAF 730

 Score = 75 (31.5 bits), Expect = 1.6e-86, Sum P(4) = 1.6e-86
 Identities = 28/105 (26%), Positives = 50/105 (47%)

Query:   755 VVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQG 814
             VV  +SG LEI+ +P+   V+ V+   +G   + D    E +  S T    +S+ G  Q 
Sbjct:   795 VVARQSGTLEIYSMPDMKLVYLVNDVGNGSMVLTDAM--EFVPISLTT-QENSKAGIVQA 851

Query:   815 -RKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLF 858
                ++ +S   +EL++     +  RP L  + T   +L YQ + +
Sbjct:   852 CMPQHANSPLPLELSVIGLGLNGERPLLL-VRTRVELLIYQVFRY 895

 Score = 37 (18.1 bits), Expect = 5.7e-18, Sum P(3) = 5.7e-18
 Identities = 9/18 (50%), Positives = 11/18 (61%)

Query:   373 QNDVALLSTKTGDLVLLT 390
             Q+D  LLS +   LVL T
Sbjct:   579 QHDFMLLSQRNSTLVLQT 596


>DICTYBASE|DDB_G0281585 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation
            specificity factor 160 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
            evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
            EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
            EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
            InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
        Length = 1628

 Score = 413 (150.4 bits), Expect = 4.8e-73, Sum P(6) = 4.8e-73
 Identities = 99/283 (34%), Positives = 160/283 (56%)

Query:   239 RDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLI 298
             +++++++VKDF F+HGY EP ++ LHE   TW  R++ K  TC ++A+S++   K    I
Sbjct:   281 KNIEIENVKDFCFLHGYYEPTILFLHEPIQTWTSRIAVKKFTCQMTAISLNLLTKAGSFI 340

Query:   299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
             W+  N P++   L++VP P+GG LV+ AN + Y +Q++   LA+N YA S+D+S  +   
Sbjct:   341 WNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAVNEYA-SIDTSTIIGSQ 399

Query:   359 SFSVE----------LDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP 408
              F             LD ++  +L++D  + S K G+L++  ++ DGR VQR+ +SK   
Sbjct:   400 PFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLISDGRSVQRIHVSKAGG 459

Query:   409 SVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR 468
             SVLTS I  + N+L FLGSRLGDSLL+Q+T     S+    L+ E        P  K+  
Sbjct:   460 SVLTSCICVLSNNLIFLGSRLGDSLLLQYT---EKSITDDQLEHE----NFSNPYKKQKT 512

Query:   469 RSSSDALQDMVNGEELSLYGSASNNT-ESAQKTFSFAVRDSLV 510
                 D   +  N E  +   S +NN  E+ +K+ S ++   L+
Sbjct:   513 SEVFDLFDE--NSETNNNNNSNNNNNKENQEKSSSSSIASKLL 553

 Score = 210 (79.0 bits), Expect = 4.8e-73, Sum P(6) = 4.8e-73
 Identities = 60/173 (34%), Positives = 88/173 (50%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMD----GISAA-------SLELVC 105
             NLV+   NV++IY +R +             +++   +     I+         SLEL+ 
Sbjct:    32 NLVLAKTNVLQIYKIRYEKIEKYENVSDSQPQQQQEQEQQQQDITQKKKIELKPSLELII 91

Query:   106 HYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESP 165
               +L GN+ES+A +      NS R DS+IL F DAKISVL++D  +    I S+H FE  
Sbjct:    92 EKKLFGNIESMASVRY---PNSER-DSLILTFRDAKISVLDYDSDLLDFEIRSLHYFEKD 147

Query:   166 EWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDED 218
             E+   K GR  F   PL+KVD Q RC  +L+Y   + +L   +  S L  D+D
Sbjct:   148 EF---KGGRNHFKHPPLLKVDTQQRCAVMLLYDRNLAVLPFKKTSSILDDDDD 197

 Score = 179 (68.1 bits), Expect = 4.8e-73, Sum P(6) = 4.8e-73
 Identities = 68/244 (27%), Positives = 113/244 (46%)

Query:   912 QRITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQ-LCDGS---------------IV 955
             +RI  F +ISG +G F+ G +P W    +  LR+H     D S               + 
Sbjct:  1122 KRIFEFSSISGKRGLFIGGKKPIWAFCEKGYLRLHSMDSSDNSNSNNSNNNNNNNSNTVE 1181

Query:   956 AFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV--IPLKATPHQITYFAE 1013
              FT  +N++C  GFIY + +    + ++ + ST  N+     +  IP K + H+I Y +E
Sbjct:  1182 TFTSFNNISCQDGFIYFSKEK--DVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSE 1239

Query:  1014 KNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDR 1073
                Y +IVS P      QV      QE+  Q D+      D       +++++++++P  
Sbjct:  1240 AKCYVVIVSFP------QVT-----QEL--QEDSKKPILTD-------DKFQIKLIDPT- 1278

Query:  1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENET----LLAIGTAYVQGEDVAARGR 1129
                 W+   +  +Q  E  L +++V+L   T  +  T     L IGTA+  GED   +GR
Sbjct:  1279 IDWNWKFIDSFSLQDRETVLAMKIVSL-KFTEPDGITRARPFLVIGTAFTFGEDTQCKGR 1337

Query:  1130 VLLF 1133
             VL+F
Sbjct:  1338 VLVF 1341

 Score = 119 (46.9 bits), Expect = 4.8e-73, Sum P(6) = 4.8e-73
 Identities = 35/148 (23%), Positives = 72/148 (48%)

Query:   572 DDEYHAYLIISL-EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
             D  +H YL +SL +  T++ ET   L EV +        +++  GNLFGR+R++ +++ G
Sbjct:   712 DKNWHDYLYLSLKDGTTLIFETGRDLKEVGK-----FNFKSLDIGNLFGRKRIVVIYQGG 766

Query:   631 ARILDG-SYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLVG-DPS 688
              ++++G   + Q++                 + S  I DP++LL   +G+I++  G D  
Sbjct:   767 IKLINGFDRVIQEIQINE------------PIKSSYICDPFILLQFHNGTIQIFKGIDEE 814

Query:   689 TCTVSVQTPAAIESSKKPVSSCTLYHDK 716
                +     +   +  + + S +L+ D+
Sbjct:   815 NQLIQFSINSISNNLNQSIFSSSLFFDR 842

 Score = 73 (30.8 bits), Expect = 1.2e-36, Sum P(6) = 1.2e-36
 Identities = 22/82 (26%), Positives = 35/82 (42%)

Query:   470 SSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADAS 529
             S +  L + +  EE  L+    N      K++   + D ++NIGP+ D   G  I+    
Sbjct:   547 SIASKLLEEIEDEEDQLFKEKKNQL----KSYQLGICDQIINIGPIGDIVVGQSIDPTYD 602

Query:   530 ATGISKQSNY--ELVELPGCKG 549
              T    Q  Y  + +EL  C G
Sbjct:   603 ETIQPNQPEYVPKTLELVTCSG 624

 Score = 64 (27.6 bits), Expect = 1.8e-65, Sum P(4) = 1.8e-65
 Identities = 21/89 (23%), Positives = 38/89 (42%)

Query:   766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV 825
             F++P    V+TV K      HI     +   K    + N+++E+   +      +  +  
Sbjct:   646 FELPGILNVWTVYKEEIEEEHIEKEIKKNTSKKRSRDENNNNEQEDNEQEDNEDNEEEEE 705

Query:   826 ELAMQRWSAHHSRPFLFAILTDGTILCYQ 854
             E  MQ+    H   +L+  L DGT L ++
Sbjct:   706 EEKMQKDKNWHD--YLYLSLKDGTTLIFE 732

 Score = 58 (25.5 bits), Expect = 4.8e-73, Sum P(6) = 4.8e-73
 Identities = 19/77 (24%), Positives = 38/77 (49%)

Query:   748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVD--KF---VSG-RTHIVDTYMREALKDSET 801
             DQ +IY  +   +G+ EI+ + +  C+F V   KF   + G  T++    + E +   ++
Sbjct:   932 DQDNIYLNIYTTNGSYEIYRLTSQECIFKVSDIKFEYDILGINTNVSQNQILEQVLTPKS 991

Query:   802 EINSSSEEGTGQGRKEN 818
              ++    +   Q +KEN
Sbjct:   992 SLSKKQLQQHLQKQKEN 1008

 Score = 53 (23.7 bits), Expect = 1.6e-72, Sum P(6) = 1.6e-72
 Identities = 14/60 (23%), Positives = 31/60 (51%)

Query:   797 KDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
             K  E  INS +       + +N   + +VE+++  ++  +S P+LF     G ++ Y+++
Sbjct:  1004 KQKENGINSKNN----YNQIQNSEILDIVEISLHNFN--NSDPYLFMFNKIGDLIIYKSF 1057

 Score = 49 (22.3 bits), Expect = 4.8e-73, Sum P(6) = 4.8e-73
 Identities = 8/12 (66%), Positives = 9/12 (75%)

Query:   543 ELPGCKGIWTVY 554
             ELPG   +WTVY
Sbjct:   647 ELPGILNVWTVY 658


>UNIPROTKB|F1RSN8 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
            EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
        Length = 1108

 Score = 466 (169.1 bits), Expect = 9.9e-71, Sum P(4) = 9.9e-71
 Identities = 114/325 (35%), Positives = 176/325 (54%)

Query:    57 NLVVTAANVIEIYVVRVQXXXXXXXXXXXXTKRRVLMDGISAASLELVCHYRLHGNVESL 116
             NLVV  A   ++YV R+             T+ +   +      LELV  +   G V S+
Sbjct:    29 NLVV--AGTSQLYVYRLNRDAEAPTKNDRSTEGKAHRE--HREKLELVASFSFFG-VMSM 83

Query:   117 AILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRES 176
             A +   GA    +RD+++L+F+DAK+SV+E+D   H L+  S+H FE PE   L+ G   
Sbjct:    84 ASVQLAGA----KRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPE---LRDGFVQ 136

Query:   177 FARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVI 236
                 P V+VDP GRC  +L+YG ++++L   +     + +E     G G  +    S++I
Sbjct:   137 NVHTPRVRVDPDGRCAAMLIYGTRLVVLPFRRES---LAEEHEGLVGEGQRSSFLPSYII 193

Query:   237 NLRDLDMK--HVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ 294
             ++R LD K  ++ D  F+HGY EP ++IL E   TW GRV+ +  TC I A+S++ T K 
Sbjct:   194 DVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKV 253

Query:   295 HPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNYAVSLDSSQ 353
             HP+IWS  +LP D  + LAVP PIGGV++   N++ Y +QS     +ALN+      +  
Sbjct:   254 HPVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFP 313

Query:   354 ELPRSSFSVELDAAHATWLQN-DVA 377
                +    + LD A A ++ + DVA
Sbjct:   314 LRTQEGVRITLDCAQAAFISSQDVA 338

 Score = 296 (109.3 bits), Expect = 9.9e-71, Sum P(4) = 9.9e-71
 Identities = 75/251 (29%), Positives = 121/251 (48%)

Query:   913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
             R   F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y
Sbjct:   595 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLY 654

Query:   972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
                QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S     P  +
Sbjct:   655 FNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR 711

Query:  1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
             +  +  +++    ID  +     +H     E + ++++ P      W+    A I ++  
Sbjct:   712 IPRMTGEEKEFETIDRDDRY---IHPQQ--EAFSIQLISPVS----WEAIPNARIELEEW 762

Query:  1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLS 1148
             E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +  
Sbjct:   763 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 822

Query:  1149 GSYGPLFSSVQ 1159
               +  L+   Q
Sbjct:   823 NKFKVLYEKEQ 833

 Score = 94 (38.1 bits), Expect = 9.9e-71, Sum P(4) = 9.9e-71
 Identities = 27/98 (27%), Positives = 47/98 (47%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T QG    
Sbjct:   455 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 510

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
             +  + +V L  ++      RP+L  +  D  +L Y+A+
Sbjct:   511 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 542

 Score = 56 (24.8 bits), Expect = 1.6e-40, Sum P(3) = 1.6e-40
 Identities = 26/88 (29%), Positives = 37/88 (42%)

Query:  1071 PDRAGGPWQTRATIPMQSSENALTV-RVVT-LFNTTTKENETLLAIGTAYVQGEDVAARG 1128
             PD A  P + R   P QS   AL V R V+ +F T ++       +G     G +   +G
Sbjct:   341 PDPAAAPTEPRPPPPQQSKVIALCVYRDVSGMFTTESRLGGARDELGGR--SGSEAEGQG 398

Query:  1129 RVLLFSTGRNADNPQNLVLSGSYGPLFS 1156
                   T    D+ + + L G  G LFS
Sbjct:   399 S----ETSPTVDDEEEM-LYGDSGSLFS 421

 Score = 45 (20.9 bits), Expect = 9.9e-71, Sum P(4) = 9.9e-71
 Identities = 15/52 (28%), Positives = 25/52 (48%)

Query:     3 FAAYKMMHWPTGIA-NCGSGFITHSRADYV----PQIPLIQTEELDSELPSK 49
             +A YK  H PTG+  +    F  +S  + V     Q+ + +    D+E P+K
Sbjct:     2 YAVYKQAHPPTGLEFSMYCNFFNNSERNLVVAGTSQLYVYRLNR-DAEAPTK 52

 Score = 40 (19.1 bits), Expect = 7.4e-39, Sum P(3) = 7.4e-39
 Identities = 12/34 (35%), Positives = 21/34 (61%)

Query:   468 RRSSSDALQDMVNGEELS--LYGSASNNTESAQK 499
             RR   +A++++++GE L+  LY S     E A+K
Sbjct:  1053 RRVLQNAVRNVLDGELLNRYLYLSTMERGELAKK 1086


>WB|WBGene00022301 [details] [associations]
            symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 431 (156.8 bits), Expect = 1.9e-63, Sum P(3) = 1.9e-63
 Identities = 158/551 (28%), Positives = 257/551 (46%)

Query:   169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
             +L+ G  +  + PLV+ DP  RC   LVYG  + IL   +                  S 
Sbjct:   128 YLRDGFINHFQPPLVRSDPSNRCAACLVYGKHIAILPFHEN-----------------SK 170

Query:   229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
             RI S +VI L+ +D  + ++ D +F+ GY EP ++ L+E   T  GR   ++ T  I  +
Sbjct:   171 RIHS-YVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGV 229

Query:   287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY 345
             S++   +Q  ++W   NLP D  +LL +P P+GG LV G+NT+ Y +Q+   C L LN+ 
Sbjct:   230 SVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS- 288

Query:   346 AVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQR 400
                 D   + P        + LD + + ++++    + ++ GDL LL ++    G  V+ 
Sbjct:   289 --CYDGFTKFPLKDLKHLKMTLDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKS 346

Query:   401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
             L+ SK   + +   +T       F+GSRLGDS L+++T           LK        D
Sbjct:   347 LEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTL----------LKTT-----RD 391

Query:   461 APSTKRLRRSSSD--ALQDMVNGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIG 513
               + KRL+  + D  A +  ++ +++ LYG A     +++ E   ++  F   D L N+G
Sbjct:   392 C-AVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVG 450

Query:   514 PLKDFSYGLRINADASATGISKQSN--YELVELPGC--KGIWTVYHKSSRGHNADSSRMA 569
             P+K    G R N  ++    +K+ +  ++LV   G    G   V+ +S R     SS + 
Sbjct:   451 PVKSMCVG-RPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLE 509

Query:   570 AYD---------DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFG 619
               +         +E H YLI+S    T++LE  + L E+ E +  FV G  T+AAG L  
Sbjct:   510 GAEQLWAVGRKENESHKYLIVSRVRSTLILELGEELVELEEQL--FVTGEPTVAAGELSQ 567

Query:   620 RRRVIQVFERG-ARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDG 678
                 +QV     A + DG  M Q++                 V+  SI DPYV L   +G
Sbjct:   568 GALAVQVTSTCIALVTDGQQM-QEVHID----------SNFPVIQASIVDPYVALLTQNG 616

Query:   679 SIRL--LVGDP 687
              + L  LV +P
Sbjct:   617 RLLLYELVMEP 627

 Score = 228 (85.3 bits), Expect = 1.9e-63, Sum P(3) = 1.9e-63
 Identities = 98/425 (23%), Positives = 180/425 (42%)

Query:   723 RKTSTDAWLSTGVGEAIDGADGG-PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
             ++   DA  S+  GE  D  D          + +V +E+G L I  +P    V+ + +F 
Sbjct:   744 KRLGHDAIQSSRGGEQSDAIDPTRTFSSISHWLIVSHENGRLSIHSLPEMEVVYQIGRFS 803

Query:   782 SGRTHIVDTYMREALKDSETEINSSSEEG---TGQGRKENIHSMKVVELAMQR----WSA 834
             +    +VD  + E  K+ + +   +++E    T +  + N    ++ E  ++        
Sbjct:   804 NVPELLVDLTVEEEEKERKAKAQQAAKEASVPTDEAEQLNTEMKQLCERVLEAQIVGMGI 863

Query:   835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSR 894
             + + P L AI+ +  +L Y+ +       +S +  P                      + 
Sbjct:   864 NQAHPILMAIVDEQVVL-YEMF-------SSSNPIPGHLGISFRKLPHFICLRTSSHLNS 915

Query:   895 TPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCD 951
                 A    +  +G     I  F+ +S  + G  + G+ P   +V+     ++ H    D
Sbjct:   916 DGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPT-LLVYGAWGGMQTHQMTVD 974

Query:   952 GSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
             G I AFT  +N N  HG +Y+T  +  L+I ++     Y+  +PV+K I +  T H + Y
Sbjct:   975 GPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKK-IEVGRTIHHVRY 1033

Query:  1011 FAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1068
                 ++Y ++ S+P  KP N++  ++ D  QE  H+ D + +  +     YT+  +  + 
Sbjct:  1034 LMNSDVYAVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ- 1088

Query:  1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR 1127
                D A  P      I  +  E       V L + +T    ETLLA+GT    GE+V  R
Sbjct:  1089 ---DWAAVP---NTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVR 1142

Query:  1128 GRVLL 1132
             GR++L
Sbjct:  1143 GRIIL 1147

 Score = 136 (52.9 bits), Expect = 1.9e-63, Sum P(3) = 1.9e-63
 Identities = 27/75 (36%), Positives = 46/75 (61%)

Query:   130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
             +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  +  + PLV+ DP  
Sbjct:    92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148

Query:   190 RCGGVLVYGLQMIIL 204
             RC   LVYG  + IL
Sbjct:   149 RCAACLVYGKHIAIL 163

 Score = 43 (20.2 bits), Expect = 3.4e-22, Sum P(3) = 3.4e-22
 Identities = 9/40 (22%), Positives = 19/40 (47%)

Query:   444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
             +++      E G+      +T++ +R   DA+Q    GE+
Sbjct:   720 TIMEQNFPVENGEATIKQSNTRKRKRLGHDAIQSSRGGEQ 759

 Score = 37 (18.1 bits), Expect = 4.5e-12, Sum P(3) = 4.5e-12
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query:    45 ELPSKRGIGPVPNLVVTAAN 64
             EL   R +GPV ++ V   N
Sbjct:   442 ELDRLRNVGPVKSMCVGRPN 461


>UNIPROTKB|Q9N4C2 [details] [associations]
            symbol:cpsf-1 "Probable cleavage and polyadenylation
            specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
            [GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
            cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=NAS]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 431 (156.8 bits), Expect = 1.9e-63, Sum P(3) = 1.9e-63
 Identities = 158/551 (28%), Positives = 257/551 (46%)

Query:   169 HLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSA 228
             +L+ G  +  + PLV+ DP  RC   LVYG  + IL   +                  S 
Sbjct:   128 YLRDGFINHFQPPLVRSDPSNRCAACLVYGKHIAILPFHEN-----------------SK 170

Query:   229 RIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISAL 286
             RI S +VI L+ +D  + ++ D +F+ GY EP ++ L+E   T  GR   ++ T  I  +
Sbjct:   171 RIHS-YVIPLKQIDPRLDNIADMVFLDGYYEPTILFLYEPIQTTPGRACVRYDTMCIMGV 229

Query:   287 SISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS-CALALNNY 345
             S++   +Q  ++W   NLP D  +LL +P P+GG LV G+NT+ Y +Q+   C L LN+ 
Sbjct:   230 SVNIVDRQFAVVWQTANLPMDCSQLLPIPKPLGGALVFGSNTVVYLNQAVPPCGLVLNS- 288

Query:   346 AVSLDSSQELPRSSFS---VELDAAHATWLQNDVALLSTKTGDLVLLTVVYD--GRVVQR 400
                 D   + P        + LD + + ++++    + ++ GDL LL ++    G  V+ 
Sbjct:   289 --CYDGFTKFPLKDLKHLKMTLDCSTSVYMEDGRIAVGSRDGDLFLLRLMTSSGGGTVKS 346

Query:   401 LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEAD 460
             L+ SK   + +   +T       F+GSRLGDS L+++T           LK        D
Sbjct:   347 LEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLEYTL----------LKTT-----RD 391

Query:   461 APSTKRLRRSSSD--ALQDMVNGEELSLYGSA-----SNNTESAQKTFSFAVRDSLVNIG 513
               + KRL+  + D  A +  ++ +++ LYG A     +++ E   ++  F   D L N+G
Sbjct:   392 C-AVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRELDRLRNVG 450

Query:   514 PLKDFSYGLRINADASATGISKQSN--YELVELPGC--KGIWTVYHKSSRGHNADSSRMA 569
             P+K    G R N  ++    +K+ +  ++LV   G    G   V+ +S R     SS + 
Sbjct:   451 PVKSMCVG-RPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSSLLE 509

Query:   570 AYD---------DEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR-TIAAGNLFG 619
               +         +E H YLI+S    T++LE  + L E+ E +  FV G  T+AAG L  
Sbjct:   510 GAEQLWAVGRKENESHKYLIVSRVRSTLILELGEELVELEEQL--FVTGEPTVAAGELSQ 567

Query:   620 RRRVIQVFERG-ARILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDG 678
                 +QV     A + DG  M Q++                 V+  SI DPYV L   +G
Sbjct:   568 GALAVQVTSTCIALVTDGQQM-QEVHID----------SNFPVIQASIVDPYVALLTQNG 616

Query:   679 SIRL--LVGDP 687
              + L  LV +P
Sbjct:   617 RLLLYELVMEP 627

 Score = 228 (85.3 bits), Expect = 1.9e-63, Sum P(3) = 1.9e-63
 Identities = 98/425 (23%), Positives = 180/425 (42%)

Query:   723 RKTSTDAWLSTGVGEAIDGADGG-PLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV 781
             ++   DA  S+  GE  D  D          + +V +E+G L I  +P    V+ + +F 
Sbjct:   744 KRLGHDAIQSSRGGEQSDAIDPTRTFSSISHWLIVSHENGRLSIHSLPEMEVVYQIGRFS 803

Query:   782 SGRTHIVDTYMREALKDSETEINSSSEEG---TGQGRKENIHSMKVVELAMQR----WSA 834
             +    +VD  + E  K+ + +   +++E    T +  + N    ++ E  ++        
Sbjct:   804 NVPELLVDLTVEEEEKERKAKAQQAAKEASVPTDEAEQLNTEMKQLCERVLEAQIVGMGI 863

Query:   835 HHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSR 894
             + + P L AI+ +  +L Y+ +       +S +  P                      + 
Sbjct:   864 NQAHPILMAIVDEQVVL-YEMF-------SSSNPIPGHLGISFRKLPHFICLRTSSHLNS 915

Query:   895 TPLDAYTREETPHGAPCQRITIFKNISG-HQGFFLSGSRPCWCMVFRE--RLRVHPQLCD 951
                 A    +  +G     I  F+ +S  + G  + G+ P   +V+     ++ H    D
Sbjct:   916 DGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPT-LLVYGAWGGMQTHQMTVD 974

Query:   952 GSIVAFTVLHNVNCNHGFIYVTS-QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITY 1010
             G I AFT  +N N  HG +Y+T  +  L+I ++     Y+  +PV+K I +  T H + Y
Sbjct:   975 GPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKK-IEVGRTIHHVRY 1033

Query:  1011 FAEKNLYPLIVSVPVLKPLNQVLSLLID--QEVGHQIDNHNLSSVDLHRTYTVEEYEVRI 1068
                 ++Y ++ S+P  KP N++  ++ D  QE  H+ D + +  +     YT+  +  + 
Sbjct:  1034 LMNSDVYAVVSSIP--KPSNKIWVVMNDDKQEEIHEKDENFV--LPAPPKYTLNLFSSQ- 1088

Query:  1069 LEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAAR 1127
                D A  P      I  +  E       V L + +T    ETLLA+GT    GE+V  R
Sbjct:  1089 ---DWAAVP---NTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVR 1142

Query:  1128 GRVLL 1132
             GR++L
Sbjct:  1143 GRIIL 1147

 Score = 136 (52.9 bits), Expect = 1.9e-63, Sum P(3) = 1.9e-63
 Identities = 27/75 (36%), Positives = 46/75 (61%)

Query:   130 RDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQG 189
             +DSI++ F+DAK+S++  ++    ++  S+H FE+    +L+ G  +  + PLV+ DP  
Sbjct:    92 QDSILMTFDDAKLSIVSINEKERNMQTISLHAFENE---YLRDGFINHFQPPLVRSDPSN 148

Query:   190 RCGGVLVYGLQMIIL 204
             RC   LVYG  + IL
Sbjct:   149 RCAACLVYGKHIAIL 163

 Score = 43 (20.2 bits), Expect = 3.4e-22, Sum P(3) = 3.4e-22
 Identities = 9/40 (22%), Positives = 19/40 (47%)

Query:   444 SMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEE 483
             +++      E G+      +T++ +R   DA+Q    GE+
Sbjct:   720 TIMEQNFPVENGEATIKQSNTRKRKRLGHDAIQSSRGGEQ 759

 Score = 37 (18.1 bits), Expect = 4.5e-12, Sum P(3) = 4.5e-12
 Identities = 8/20 (40%), Positives = 11/20 (55%)

Query:    45 ELPSKRGIGPVPNLVVTAAN 64
             EL   R +GPV ++ V   N
Sbjct:   442 ELDRLRNVGPVKSMCVGRPN 461


>UNIPROTKB|J9P418 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
            GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
            Ensembl:ENSCAFT00000043656 Uniprot:J9P418
        Length = 1107

 Score = 349 (127.9 bits), Expect = 2.3e-53, Sum P(3) = 2.3e-53
 Identities = 110/407 (27%), Positives = 183/407 (44%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+    +     T+  +  EE T QG    
Sbjct:   454 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDS----SFGQPTTQAEARKEEATRQGELPL 509

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPXXXXXXXX 878
             +  + +V L  ++     SRP+L  +  D  +L Y+A+    P + S+            
Sbjct:   510 VKEVLLVALGSRQ-----SRPYLL-VHVDQELLIYEAF----PHD-SQLGQGNLKVRFKK 558

Query:   879 XXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFKNISGHQGFFLSGSRPCWC 936
                         + S+   +    EE   GA  +  R   F++I G+ G F+ G  P W 
Sbjct:   559 VPHNINFREKKPKPSKKKAEGGGAEEGA-GARGRVARFRYFEDIYGYSGVFICGPSPHWL 617

Query:   937 MVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPV 995
             +V  R  LR+HP   DG I +F   HNVNC  GF+Y   QG L+I  LP+  +YD  WPV
Sbjct:   618 LVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPV 677

Query:   996 QKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDL 1055
             +K IPL+ T H + Y  E  +Y +  S  +  P  +     I +  G + +   +   D 
Sbjct:   678 RK-IPLRCTAHYVAYHVESKVYAVATSTNM--PCTR-----IPRMTGEEKEFETIERDDR 729

Query:  1056 HRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSSENALTVRVVTLFNTTTKEN-ETLL 1112
             +     E + ++++ P      W+    A I ++  E+   ++ V+L +  T    +  +
Sbjct:   730 YIHPQQEAFSIQLISPVS----WEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYV 785

Query:  1113 AIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQ 1159
             A GT  +QGE+V  RGR+L+         P   +    +  L+   Q
Sbjct:   786 AAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQ 832

 Score = 176 (67.0 bits), Expect = 2.3e-53, Sum P(3) = 2.3e-53
 Identities = 49/152 (32%), Positives = 79/152 (51%)

Query:   543 ELPGCKGIWTVY-------HKSSRGHNAD--SSRMAAYDD-EYHAYLIISLEARTMVLET 592
             ELPGC  +WTV         ++S+G  A+  SS + A DD   H +LI+S E  TM+L+T
Sbjct:   193 ELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTMILQT 252

Query:   593 ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXXX 652
                + E+  S  +  QG T+ AGN+   R ++QV   G R+L+G      L F P     
Sbjct:   253 GQEIMELDTS-GFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEG---VNQLHFIPVDLGS 308

Query:   653 XXXXXXXTVLSVSIADPYVLLGMSDGSIRLLV 684
                     ++  ++ADPYV++  ++G + + +
Sbjct:   309 -------PIVQCAVADPYVVIMSAEGHVTMFL 333

 Score = 172 (65.6 bits), Expect = 2.3e-53, Sum P(3) = 2.3e-53
 Identities = 53/151 (35%), Positives = 82/151 (54%)

Query:   378 LLSTKTGDLVLLTVVYDG-RVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
             ++S K G++ +LT++ DG R V+     K   SVLT+ + T+     FLGSRLG+SLL++
Sbjct:     2 VISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLFLGSRLGNSLLLK 61

Query:   437 FTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSS-----DALQDMVNGEELSLYGS-A 490
             +T        S+    E  D E      KR+  ++         QD V  +E+ +YGS A
Sbjct:    62 YTEKLQEPPASAA--REAADKEEPPSKKKRVDCAAGWSGGKSVPQDEV--DEIEVYGSEA 117

Query:   491 SNNTESAQKTFSFAVRDSLVNIGPLKDFSYG 521
              + T+ A  T+SF V DS++NIGP  + + G
Sbjct:   118 QSGTQLA--TYSFEVCDSILNIGPCANAAMG 146

 Score = 49 (22.3 bits), Expect = 3.1e-27, Sum P(2) = 3.1e-27
 Identities = 21/74 (28%), Positives = 36/74 (48%)

Query:   283 ISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-----HSQSAS 337
             I  L  S    Q P +++  N+  + Y ++ V SP+G  L+ G N +H+      S    
Sbjct:   256 IMELDTSGFATQGPTVFAG-NIGDNRY-IVQV-SPLGIRLLEGVNQLHFIPVDLGSPIVQ 312

Query:   338 CALALNNYAVSLDS 351
             CA+A + Y V + +
Sbjct:   313 CAVA-DPYVVIMSA 325


>POMBASE|SPBC1709.08 [details] [associations]
            symbol:cft1 "cleavage factor one Cft1 (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
            "cytosol" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA]
            [GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
            cleavage" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
            PomBase:SPBC1709.08 GO:GO:0005829 EMBL:CU329671 GO:GO:0006378
            GenomeReviews:CU329671_GR GO:GO:0003723 eggNOG:COG5161 KO:K14401
            OMA:HNDRIFQ OrthoDB:EOG451HZS PIR:T39636 RefSeq:NP_595441.1
            STRING:O74733 EnsemblFungi:SPBC1709.08.1 GeneID:2539694
            KEGG:spo:SPBC1709.08 NextBio:20800847 GO:GO:0005847 GO:GO:0006379
            Uniprot:O74733
        Length = 1441

 Score = 268 (99.4 bits), Expect = 3.3e-30, Sum P(3) = 3.3e-30
 Identities = 117/467 (25%), Positives = 200/467 (42%)

Query:   101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
             L LV   ++ G +  ++ L   G++     D +I+  + AK+S LE+D         S+H
Sbjct:    92 LRLVSQVKVFGTITEISALKGKGSNGC---DLLIMLTDYAKVSTLEWDMQSQSFVTNSLH 148

Query:   161 CFESPEWLHLKRGRESFARGPL-VKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDT 219
              +E      +K      +  P  + VDP   C  +L +   M+ +        L  +E  
Sbjct:   149 YYED-----VKSSNICSSHTPTQLLVDPDSDCC-LLRFLTDMMAIIPYPANEDLDMEEAA 202

Query:   220 F-GSGGGFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSW 276
                S    S   + S V+    LD  +  + D  F++GY EP + IL+  E T    +  
Sbjct:   203 IENSKISSSYAYKPSFVLASSQLDASISRILDVKFLYGYREPTLAILYSPEQTSTVTLPL 262

Query:   277 KHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHY-HSQS 335
             +  T + S +++    +   +I +  +LP+D Y  +++P+P+GG L++G N + Y  S  
Sbjct:   263 RKDTVLFSLVTLDLEQRASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSAG 322

Query:   336 ASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQND------VALLSTKTGDLVLL 389
              +  + +N+Y           +S F++EL+   A  L +       V L+ T +G    L
Sbjct:   323 RTVGIGVNSYYSKCTDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHT-SGQFFYL 381

Query:   390 TVVYDGRVVQRLDLS----KTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFTCGSG 442
               + DG+ V+ L L     + N   L S IT     G +L FLGS+  DS L++++    
Sbjct:   382 DFLLDGKSVKGLSLQALDLEINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWS--RR 439

Query:   443 TSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS 502
             T+     L E  GD   D      L  ++   + DM++  E      +            
Sbjct:   440 TTNEEVRLDE--GD---DT-----LYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLR 489

Query:   503 FAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKG 549
               + D L NIGP+ DF+ G      A +     Q N+  +EL G  G
Sbjct:   490 LEICDVLTNIGPITDFAVG-----KAGSYSYFPQDNHGPLELVGTAG 531

 Score = 177 (67.4 bits), Expect = 3.3e-30, Sum P(3) = 3.3e-30
 Identities = 80/338 (23%), Positives = 134/338 (39%)

Query:   801 TEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG 860
             T  N    E T    KE+  S ++VEL +         P LF       I  Y+A+L+  
Sbjct:   831 TLFNGMESERT-YFNKES--SQELVELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYS- 886

Query:   861 PENTSKSDDPXXXXXXXXXXXXXXXXXXXLRFSRTPLDAYTREETPHGAPCQ--RITIFK 918
               NT K  +                         TP DA +  E    +     ++T  +
Sbjct:   887 --NTDKHKNLLAFAKVPQETMTREFQANV----GTPRDAESTMEKKASSSVDHLKMTALE 940

Query:   919 NISGHQGFFLSGSRPCWCM-VFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI 977
              +  H   F++G +P   +       +  P   +  I++    H  +   G+IYV     
Sbjct:   941 VVGNHSAVFVTGRKPFLILSTLHSNAKFFPISSNIPILSVAPFHAHHAPQGYIYVDENSF 1000

Query:   978 LKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPV-LKPLNQVLSLL 1036
             ++IC+      YDN WP +KV  L    + I Y   K +Y +  +VP+  K  ++     
Sbjct:  1001 IRICKFQEDFEYDNKWPYKKV-SLGKQINGIAYHPTKMVYAVGSAVPIEFKVTDE----- 1054

Query:  1037 IDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096
              D    + I + N   + +  T +++     ++ P      W    +   Q  E  L+V 
Sbjct:  1055 -DGNEPYAITDDN-DYLPMANTGSLD-----LVSPLT----WTVIDSYEFQQFEIPLSVA 1103

Query:  1097 VVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133
             +V L  + TTK  +  +A+GT+  +GED+A RG   LF
Sbjct:  1104 LVNLEVSETTKLRKPYIAVGTSITKGEDIAVRGSTYLF 1141

 Score = 151 (58.2 bits), Expect = 4.0e-19, Sum P(2) = 4.0e-19
 Identities = 101/432 (23%), Positives = 181/432 (41%)

Query:   285 ALSISTTLKQHPL-IWSAMNLPHD--------AYKLLAVPSPIGGVLVVGANTIHYHSQS 335
             A ++ TT++  P  I++++++P            +L+ V S  G  + +G N+  Y+S+ 
Sbjct:   280 ASAVITTIQSLPYDIYASVSIPTPLGGSLLLGGNELIYVDSA-GRTVGIGVNS--YYSKC 336

Query:   336 ASCALA-LNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD 394
                 L   +++ + L+ +  +P +S   E               L     D +L      
Sbjct:   337 TDFPLQDQSDFNLELEGTIAIPLTSSKTETPFVVLVHTSGQFFYL-----DFLLDGKSVK 391

Query:   395 GRVVQRLDLSKTNPSVLTSDITTI---GNSLFFLGSRLGDSLLVQFT---------CGSG 442
             G  +Q LDL + N   L S IT     G +L FLGS+  DS L++++            G
Sbjct:   392 GLSLQALDL-EINDDFLKSGITCAVPAGENLVFLGSQTTDSYLLRWSRRTTNEEVRLDEG 450

Query:   443 TSMLSSGLKEEFGDI----EAD-APSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESA 497
                L      E  D+    E D +  +KR     +  L+  +  + L+  G  ++     
Sbjct:   451 DDTLYGTNDAEMDDMLDIYETDESVGSKRKIAYENGPLRLEIC-DVLTNIGPITDFAVGK 509

Query:   498 QKTFSFAVRDSLVNIGPLKDFSYGLRINAD-ASATGISKQSNYELV----ELPGCKGIWT 552
               ++S+  +D   N GPL+    G    AD A    + +++ + L+    +  GC+ +WT
Sbjct:   510 AGSYSYFPQD---NHGPLE--LVGTA-GADGAGGLVVFRRNIFPLIAGEFQFDGCEALWT 563

Query:   553 VYHKSSRGHNADSSRMAAYDD-EYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT 611
             V   S +  N  S   A Y + E   YL++S E  + +    +   EV  S D+    +T
Sbjct:   564 V-SISGKLRNMKSRIQAQYSNPELETYLVLSKEKESFIFLAGETFDEVQHS-DFSKDSKT 621

Query:   612 IAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPXXXXXXXXXXXXTVLSVSIADPY 670
             +  G+L    R++Q+     R+ D +  +TQ  +F               V+S SI DP 
Sbjct:   622 LNVGSLLSGMRMVQICPTSLRVYDSNLRLTQLFNFSKKQI----------VVSTSICDPC 671

Query:   671 VLLGMSDGSIRL 682
             +++    G I L
Sbjct:   672 IIVVFLGGGIAL 683

 Score = 38 (18.4 bits), Expect = 3.3e-30, Sum P(3) = 3.3e-30
 Identities = 8/18 (44%), Positives = 13/18 (72%)

Query:   527 DASATGISKQSNYELVEL 544
             ++  T  +K+S+ ELVEL
Sbjct:   837 ESERTYFNKESSQELVEL 854

 Score = 38 (18.4 bits), Expect = 6.8e-16, Sum P(3) = 6.8e-16
 Identities = 7/18 (38%), Positives = 12/18 (66%)

Query:   670 YVLLGMSDGSIRLLVGDP 687
             Y ++  + G++RLL  DP
Sbjct:  1268 YFVVADTSGNLRLLAYDP 1285


>UNIPROTKB|K7GNU1 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GeneTree:ENSGT00550000075040 EMBL:CU468594
            Ensembl:ENSSSCT00000033207 Uniprot:K7GNU1
        Length = 757

 Score = 296 (109.3 bits), Expect = 8.2e-27, Sum P(2) = 8.2e-27
 Identities = 75/251 (29%), Positives = 121/251 (48%)

Query:   913 RITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY 971
             R   F++I G+ G F+ G  P W +V  R  LR+HP   DG I +F   HNVNC  GF+Y
Sbjct:   244 RFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLY 303

Query:   972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQ 1031
                QG L+I  LP+  +YD  WPV+K IPL+ T H + Y  E  +Y +  S     P  +
Sbjct:   304 FNRQGELRISVLPAYLSYDAPWPVRK-IPLRCTAHYVAYHVESKVYAVATSTNT--PCTR 360

Query:  1032 VLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQT--RATIPMQSS 1089
             +  +  +++    ID  +     +H     E + ++++ P      W+    A I ++  
Sbjct:   361 IPRMTGEEKEFETIDRDDRY---IHPQQ--EAFSIQLISPVS----WEAIPNARIELEEW 411

Query:  1090 ENALTVRVVTLFNTTTKEN-ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLS 1148
             E+   ++ V+L +  T    +  +A GT  +QGE+V  RGR+L+         P   +  
Sbjct:   412 EHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDVIEVVPEPGQPLTK 471

Query:  1149 GSYGPLFSSVQ 1159
               +  L+   Q
Sbjct:   472 NKFKVLYEKEQ 482

 Score = 94 (38.1 bits), Expect = 8.2e-27, Sum P(2) = 8.2e-27
 Identities = 27/98 (27%), Positives = 47/98 (47%)

Query:   759 ESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN 818
             E+G +EI+ +P++  VF V  F  G+  +VD+   +     E       EE T QG    
Sbjct:   104 ENGTMEIYQLPDWRLVFLVKNFPVGQRVLVDSSFGQPTTQGEAR----KEEATRQGELPL 159

Query:   819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY 856
             +  + +V L  ++      RP+L  +  D  +L Y+A+
Sbjct:   160 VKEVLLVALGSRQ-----RRPYLL-VHVDQELLIYEAF 191


>ASPGD|ASPL0000050546 [details] [associations]
            symbol:AN1413 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 EMBL:BN001307 GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AACD01000022
            RefSeq:XP_659017.1 EnsemblFungi:CADANIAT00008024 GeneID:2875502
            KEGG:ani:AN1413.2 HOGENOM:HOG000048586 OMA:HNDRIFQ
            OrthoDB:EOG451HZS Uniprot:Q5BDG7
        Length = 1339

 Score = 209 (78.6 bits), Expect = 6.7e-20, Sum P(2) = 6.7e-20
 Identities = 117/503 (23%), Positives = 202/503 (40%)

Query:   240 DLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 299
             D  + H     F++ Y EP   IL+ +  T    +  +      + +++    +    + 
Sbjct:   224 DPSVIHPISLAFLYEYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLL 283

Query:   300 SAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYAVSLDSSQELPRS 358
             S   LP D +K++A+P P+GG L++G+N  +H      + A+ +N ++    S     +S
Sbjct:   284 SVTRLPSDLFKVVALPPPVGGSLLIGSNELVHIDQAGKTNAVGVNEFSRQASSFSMTDQS 343

Query:   359 SFSVELDAAHATWLQNDVA--LLSTKTGDLVLLTVVYDGRVVQRLD---LSKTNPSVLTS 413
               ++ L+        +D    LL+  TG   L++   DGR V  +    LS  +   L S
Sbjct:   344 DLALRLENCVVERFSDDNGDLLLALSTGVFALVSFKLDGRSVSGISVRPLSGPSKEFLAS 403

Query:   414 DITT---IGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRS 470
               ++   +GN   F GS   DS+L+ ++  S  +  S        + E DA     L  S
Sbjct:   404 TASSSAFLGNGKVFFGSESADSVLLGWSSASSATKKSFSGSTSNDESEDDAYEDD-LYSS 462

Query:   471 SSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASA 530
             +  A+ D  N +        SN++ +A       + D L + GP++D   G    A +  
Sbjct:   463 APAAMTD--NPQN-----QPSNSSVAAFG--DLRIHDRLSSPGPIRDIVLGRSSEASSRD 513

Query:   531 T--GI----SKQSNYELVELPGCKGIWTVYHKSSRGHN-ADS----SRMAAYDDEYHAYL 579
             T  G+    + Q + E   +   K     Y  +S   + A+S    S +   +D+   Y+
Sbjct:   514 TKDGVLELVAAQGSDEGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYV 573

Query:   580 IISL-------EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGAR 632
             I+S        E+   VLE  D L  +T          T+  G L  + RVIQV     R
Sbjct:   574 ILSKQEKPDKEESEVFVLE--DKLRPITAPEFNPNHELTVEIGTLASKSRVIQVLRNEVR 631

Query:   633 ILDGSYMTQDLSFGPXXXXXXXXXXXXTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV 692
               D  +   D                   ++ ++ DPY+ +   D ++ LL  D S    
Sbjct:   632 SYDAVWDEDD-------------SDERVAVNATLVDPYLAIIRDDSTLLLLQADDSGDLD 678

Query:   693 SVQTPAAIESSKKPVSSCTLYHD 715
              V     + S K  +S+C  Y D
Sbjct:   679 EVTLSEDVVSQKW-LSAC-FYSD 699

 Score = 150 (57.9 bits), Expect = 1.3e-13, Sum P(2) = 1.3e-13
 Identities = 135/596 (22%), Positives = 234/596 (39%)

Query:    44 SELPSKRGIG---PVPNLVVTAANVIEIYVVRVQXXXXXXXXXXXX-TKRRVLMDGISAA 99
             +EL S  G+     VP L  TA N+I      +Q             T+ R         
Sbjct:     5 TELISPTGVTHALAVPFLSATANNLIVARTSLLQIFSLRDVSLSALDTEVRPAQHRQETC 64

Query:   100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSM 159
              L L   Y+L G V  +  +           D++++AF DAK+S++E+D   +GL   S+
Sbjct:    65 KLVLEREYQLPGTVTDICRVKI--LKTKSGGDAVLVAFRDAKLSLVEWDPERYGLSTISI 122

Query:   160 HCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMI-ILKASQGGSGLVGDED 218
             H +E  +        +    G ++  DP  RC  +  +G + + I+   Q G  LV D+ 
Sbjct:   123 HYYERDDMTRSPWASDLSTCGSILSADPGSRCA-IFQFGARSLAIIPFHQPGDDLVMDD- 180

Query:   219 TFGSGGGFSARIES---SHVINLRDL---DMKHVKDFIFVHGYIEPVMVILHERELTWAG 272
              FGS   +  R+E    SH    +D       +   F+     ++P   ++H   L +  
Sbjct:   181 -FGSEPDYENRVEGNSRSHEAKDKDAAEYQTPYASSFVLPLTALDPS--VIHPISLAFLY 237

Query:   273 RVSWKHHTCMISALSISTTL---KQHPLIWSAMNLPHD---AYKLLAV---PSPIGGVLV 323
                      + S ++ S  L   ++  + ++ + L  +   +  LL+V   PS +  V+ 
Sbjct:   238 EYREPTFGILYSQVATSHALLHERKDVVFYTVITLDLEQRASTTLLSVTRLPSDLFKVVA 297

Query:   324 ----VGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVA-L 378
                 VG + +   ++      A    AV ++       SSFS+   +  A  L+N V   
Sbjct:   298 LPPPVGGSLLIGSNELVHIDQAGKTNAVGVNEFSR-QASSFSMTDQSDLALRLENCVVER 356

Query:   379 LSTKTGDLVL-LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQF 437
              S   GDL+L L+      V  +LD  ++   +    ++  G S  FL S    S  +  
Sbjct:   357 FSDDNGDLLLALSTGVFALVSFKLD-GRSVSGISVRPLS--GPSKEFLASTASSSAFL-- 411

Query:   438 TCGSGTSMLSSGLKEE--FGDIEADAPSTKRLRRSSS-DALQDMVNGEELSLYGSA-SNN 493
               G+G     S   +    G   A + + K    S+S D  +D  +  E  LY SA +  
Sbjct:   412 --GNGKVFFGSESADSVLLGWSSASSATKKSFSGSTSNDESED--DAYEDDLYSSAPAAM 467

Query:   494 TESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTV 553
             T++ Q   S     S+   G L+      R+++      I    + E        G+  +
Sbjct:   468 TDNPQNQPS---NSSVAAFGDLRIHD---RLSSPGPIRDIVLGRSSEASSRDTKDGVLEL 521

Query:   554 YHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTM-VLETADLLTEVTESV-DYFV 607
                +++G + +   M     E   YL+ S+ A T   L T  LL +  +   DY +
Sbjct:   522 V--AAQGSD-EGGTMVIMKREVDPYLVASMAADTANSLWTVSLLPDNNDQKRDYVI 574

 Score = 125 (49.1 bits), Expect = 6.7e-20, Sum P(2) = 6.7e-20
 Identities = 38/178 (21%), Positives = 82/178 (46%)

Query:   968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLK 1027
             GF Y+ S G L + +LP G+     W + + +P+     ++TY +  + Y       VL 
Sbjct:   881 GFAYLDSHG-LHLAKLPEGTQLGYPW-IMRTVPIGQQIDKLTYVSASDTY-------VLG 931

Query:  1028 PLNQV-LSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086
                +    L  D E+  +  N  +S +       V +  ++++ P      W    + P+
Sbjct:   932 TCQRCEFRLPEDDELHPEWRNEEISFLP-----EVNQSSLKVVSPKT----WSVIDSYPL 982

Query:  1087 QSSENALTVRVVTL-FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ 1143
             + +E+ + ++ ++L  +  T E   ++ +GT+  +GED+ +RG + +F       +P+
Sbjct:   983 EPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFEVIEVVPDPE 1040

 Score = 49 (22.3 bits), Expect = 8.6e-06, Sum P(2) = 8.6e-06
 Identities = 17/43 (39%), Positives = 23/43 (53%)

Query:   588 MVLETADL-LTEVT-ESVDYFVQGRTIAAGNLFGRRRVIQVFE 628
             MV++T  L ++E T E  D  V G ++A G     R  I VFE
Sbjct:   989 MVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFE 1031


>CGD|CAL0004251 [details] [associations]
            symbol:orf19.2760 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0006369 "termination of RNA polymerase II
            transcription" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634 GO:GO:0042493
            GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023 EMBL:AACQ01000025
            RefSeq:XP_720278.1 RefSeq:XP_720279.1 RefSeq:XP_720280.1
            RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848 GeneID:3638158
            GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
            KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
            eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
        Length = 1420

 Score = 224 (83.9 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 77/312 (24%), Positives = 138/312 (44%)

Query:   231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG +           L++
Sbjct:   217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276

Query:   289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA- 346
                LK    ++   NLP++  +++ +PSP+ G L+VG N  IH  +      +A+N +  
Sbjct:   277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336

Query:   347 ---VSLDSSQELPRSSFSVELDAAHATWLQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLD 402
                 S  S Q+  +S  +++L+      + +D   LL  +TG+   +    DG+ ++R+ 
Sbjct:   337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394

Query:   403 LS----KTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
             +     KT   +  +   ++  +  ++ F+ +  G+S L+Q      +S  S   + +  
Sbjct:   395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQVRYRD-SSKTSDTKESKLN 453

Query:   456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS-----FAVRDSLV 510
              IE      K   +   D   D    +E  LY       E  QKT S     F   D L+
Sbjct:   454 KIEE-----KEDNKDDDDNDDD----DEDDLY--KEEEEEETQKTISKSHIEFLYHDELI 502

Query:   511 NIGPLKDFSYGL 522
             N GP   F+ G+
Sbjct:   503 NNGPSSTFTLGI 514

 Score = 76 (31.8 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 21/95 (22%), Positives = 45/95 (47%)

Query:   906 PHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNV 963
             P+G   +R +  F N++G    F++G  P   +     + R+  Q    + ++ +   + 
Sbjct:   875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIF-QFSKIAAMSISAFSDS 933

Query:   964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                +G I++ +Q   +IC+LP    Y+   P++ V
Sbjct:   934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV 968

 Score = 61 (26.5 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 14/63 (22%), Positives = 35/63 (55%)

Query:   101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
             L+L+  ++L G +  L  +     +N    D ++++ + AK S++++D  ++ +   S+H
Sbjct:    57 LKLIDQFKLQGTITDLKSIRT--IENPNL-DYLMVSTKYAKFSIIKWDHHLNTIATVSLH 113

Query:   161 CFE 163
              +E
Sbjct:   114 YYE 116


>UNIPROTKB|Q5AFT3 [details] [associations]
            symbol:CFT1 "Protein CFT1" species:237561 "Candida albicans
            SC5314" [GO:0042493 "response to drug" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634
            GO:GO:0042493 GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023
            EMBL:AACQ01000025 RefSeq:XP_720278.1 RefSeq:XP_720279.1
            RefSeq:XP_720280.1 RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848
            GeneID:3638158 GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
            KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
            eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
        Length = 1420

 Score = 224 (83.9 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 77/312 (24%), Positives = 138/312 (44%)

Query:   231 ESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
             +SS +I+   LD  +  V D  F+H Y EP + +L  ++  WAG +           L++
Sbjct:   217 DSSFIIDATTLDSSIDTVVDMQFLHNYREPTIAVLSSKQEVWAGNLIKSKDNIQFQVLTL 276

Query:   289 STTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANT-IHYHSQSASCALALNNYA- 346
                LK    ++   NLP++  +++ +PSP+ G L+VG N  IH  +      +A+N +  
Sbjct:   277 DLNLKSTISVFKIDNLPYEIDRIIPLPSPLNGTLLVGCNELIHVDNGGVLKRIAVNKFTR 336

Query:   347 ---VSLDSSQELPRSSFSVELDAAHATWLQNDV-ALLSTKTGDLVLLTVVYDGRVVQRLD 402
                 S  S Q+  +S  +++L+      + +D   LL  +TG+   +    DG+ ++R+ 
Sbjct:   337 LITASFKSFQD--QSDLNLKLENCSVVPIPDDHRVLLILQTGEFYFINFELDGKSIKRIH 394

Query:   403 LS----KTNPSVLTS---DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFG 455
             +     KT   +  +   ++  +  ++ F+ +  G+S L+Q      +S  S   + +  
Sbjct:   395 IDNVDKKTYDKIQLNHPGEVAILDKNMLFIANSNGNSPLIQVRYRD-SSKTSDTKESKLN 453

Query:   456 DIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFS-----FAVRDSLV 510
              IE      K   +   D   D    +E  LY       E  QKT S     F   D L+
Sbjct:   454 KIEE-----KEDNKDDDDNDDD----DEDDLY--KEEEEEETQKTISKSHIEFLYHDELI 502

Query:   511 NIGPLKDFSYGL 522
             N GP   F+ G+
Sbjct:   503 NNGPSSTFTLGI 514

 Score = 76 (31.8 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 21/95 (22%), Positives = 45/95 (47%)

Query:   906 PHGAPCQR-ITIFKNISGHQGFFLSGSRPCWCMVFRERL-RVHPQLCDGSIVAFTVLHNV 963
             P+G   +R +  F N++G    F++G  P   +     + R+  Q    + ++ +   + 
Sbjct:   875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIF-QFSKIAAMSISAFSDS 933

Query:   964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKV 998
                +G I++ +Q   +IC+LP    Y+   P++ V
Sbjct:   934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHV 968

 Score = 61 (26.5 bits), Expect = 1.7e-17, Sum P(3) = 1.7e-17
 Identities = 14/63 (22%), Positives = 35/63 (55%)

Query:   101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
             L+L+  ++L G +  L  +     +N    D ++++ + AK S++++D  ++ +   S+H
Sbjct:    57 LKLIDQFKLQGTITDLKSIRT--IENPNL-DYLMVSTKYAKFSIIKWDHHLNTIATVSLH 113

Query:   161 CFE 163
              +E
Sbjct:   114 YYE 116


>TAIR|locus:2115909 [details] [associations]
            symbol:DDB1A "damaged DNA binding protein 1A"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
            "negative regulation of photomorphogenesis" evidence=IGI;RCA]
            [GO:0045892 "negative regulation of transcription, DNA-dependent"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
            cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
            formation" evidence=RCA] [GO:0003002 "regionalization"
            evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
            "protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
            evidence=RCA] [GO:0008284 "positive regulation of cell
            proliferation" evidence=RCA] [GO:0009630 "gravitropism"
            evidence=RCA] [GO:0009639 "response to red or far red light"
            evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
            [GO:0033043 "regulation of organelle organization" evidence=RCA]
            [GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
            organ formation" evidence=RCA] [GO:0048608 "reproductive structure
            development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
            GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
            GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
            GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
            IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
            UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
            IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
            EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
            GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
            InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
            ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
            Uniprot:Q9M0V3
        Length = 1088

 Score = 91 (37.1 bits), Expect = 7.1e-07, Sum P(4) = 7.1e-07
 Identities = 33/120 (27%), Positives = 55/120 (45%)

Query:   226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
             F  + +     N+R L+   V D  F+ G  +P + +L++           +H    +  
Sbjct:   144 FDNKGQLKEAFNIR-LEELQVLDIKFLFGCAKPTIAVLYQ------DNKDARH----VKT 192

Query:   286 LSISTTLKQHPLI---WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
               +S  LK    +   WS  +L + A  L+ VP P+ GVL++G  TI Y S SA  A+ +
Sbjct:   193 YEVS--LKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLIIGEETIVYCSASAFKAIPI 250

 Score = 74 (31.1 bits), Expect = 7.1e-07, Sum P(4) = 7.1e-07
 Identities = 18/59 (30%), Positives = 31/59 (52%)

Query:   378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
             LL    G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS LV+
Sbjct:   269 LLGDHAGMIHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVFVGSSYGDSQLVK 327

 Score = 71 (30.1 bits), Expect = 7.1e-07, Sum P(4) = 7.1e-07
 Identities = 36/133 (27%), Positives = 64/133 (48%)

Query:   513 GPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
             G  KD S  LR+  +    GI++Q++   VEL G KG+W++  KSS             D
Sbjct:   372 GAFKDGS--LRVVRNG--IGINEQAS---VELQGIKGMWSL--KSS------------ID 410

Query:   573 DEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
             + +  +L++S   E R + +   D L E TE   +  Q +T+   +     +++QV    
Sbjct:   411 EAFDTFLVVSFISETRILAMNLEDELEE-TEIEGFLSQVQTLFCHDAV-YNQLVQVTSNS 468

Query:   631 ARILDGSYMTQDL 643
              R++  +  T++L
Sbjct:   469 VRLVSST--TREL 479

 Score = 65 (27.9 bits), Expect = 7.1e-07, Sum P(4) = 7.1e-07
 Identities = 26/99 (26%), Positives = 45/99 (45%)

Query:  1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
             + + HQ          L      EE E   VR+L+ D+    ++  +T P+ S E   ++
Sbjct:   716 RRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLD-DQT---FEFMSTYPLDSFEYGCSI 771

Query:  1096 RVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLF 1133
                 L  + T++      +GTAYV  E+    +GR+L+F
Sbjct:   772 ----LSCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVF 806


>FB|FBgn0260962 [details] [associations]
            symbol:pic "piccolo" species:7227 "Drosophila melanogaster"
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0006289
            "nucleotide-excision repair" evidence=ISS;NAS] [GO:0005634
            "nucleus" evidence=IEA] [GO:0006974 "response to DNA damage
            stimulus" evidence=IMP] [GO:0035220 "wing disc development"
            evidence=IMP] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0042787 "protein ubiquitination involved in ubiquitin-dependent
            protein catabolic process" evidence=ISS] [GO:0007307 "eggshell
            chorion gene amplification" evidence=IDA] [GO:0007095 "mitotic G2
            DNA damage checkpoint" evidence=IGI] InterPro:IPR004871
            Pfam:PF03178 UniPathway:UPA00143 EMBL:AE014297 GO:GO:0005634
            GO:GO:0005737 GO:GO:0007095 GO:GO:0043161 GO:GO:0003677
            GO:GO:0006281 GO:GO:0035220 GO:GO:0042787 GO:GO:0007307
            eggNOG:NOG247734 KO:K10610 OMA:CALGDGS GeneTree:ENSGT00530000063396
            HSSP:Q16531 EMBL:AF132145 RefSeq:NP_650257.1 UniGene:Dm.3215
            ProteinModelPortal:Q9XYZ5 SMR:Q9XYZ5 STRING:Q9XYZ5 PaxDb:Q9XYZ5
            PRIDE:Q9XYZ5 EnsemblMetazoa:FBtr0082709 GeneID:41611
            KEGG:dme:Dmel_CG7769 UCSC:CG7769-RA CTD:41611 FlyBase:FBgn0260962
            InParanoid:Q9XYZ5 OrthoDB:EOG4S1RP0 PhylomeDB:Q9XYZ5
            GenomeRNAi:41611 NextBio:824642 Bgee:Q9XYZ5 Uniprot:Q9XYZ5
        Length = 1140

 Score = 141 (54.7 bits), Expect = 2.9e-06, Sum P(5) = 2.9e-06
 Identities = 59/205 (28%), Positives = 94/205 (45%)

Query:   237 NLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHP 296
             NLR +D  +V D  F+HG + P ++++H+      GR    H    I+ L     +K   
Sbjct:   156 NLR-MDELNVYDVEFLHGCLNPTVIVIHKDS---DGRHVKSHE---IN-LRDKEFMK--- 204

Query:   297 LIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELP 356
             + W   N+  +A  L+ VPSPIGGV+V+G  +I YH  S       N +AV+       P
Sbjct:   205 IAWKQDNVETEATMLIPVPSPIGGVIVIGRESIVYHDGS-------NYHAVA-------P 250

Query:   357 RSSFSVELDAAHATWLQNDVA-LLSTKTGDLVLLTV----VYDGRVVQRLDLSKTNPSVL 411
              + F       +A    N +  LL    G L +L +       G  V+ + + +     +
Sbjct:   251 LT-FRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSKGVTVKDIKVEQLGEISI 309

Query:   412 TSDITTIGNSLFFLGSRLGDSLLVQ 436
                IT + N   ++G+R GDS LV+
Sbjct:   310 PECITYLDNGFLYIGARHGDSQLVR 334

 Score = 64 (27.6 bits), Expect = 2.9e-06, Sum P(5) = 2.9e-06
 Identities = 31/152 (20%), Positives = 60/152 (39%)

Query:   532 GISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLE 591
             GI  Q  +  ++LPG KG+W++             ++   +  Y   L+++    T +L 
Sbjct:   391 GIGIQE-HACIDLPGIKGMWSL-------------KVGVDESPYENTLVLAFVGHTRILT 436

Query:   592 TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPXXXX 651
              +    E TE   +    +T    N+    ++IQV     R++  +       + P    
Sbjct:   437 LSGEEVEETEIPGFASDLQTFLCSNV-DYDQLIQVTSDSVRLVSSATKALVAEWRPTGDR 495

Query:   652 XXXXXXXXT--VLSVSIADPYVLLGMSDGSIR 681
                     T  +L  S  D + ++ + DGS+R
Sbjct:   496 TIGVVSCNTTQILVASACDIFYIV-IEDGSLR 526

 Score = 50 (22.7 bits), Expect = 2.9e-06, Sum P(5) = 2.9e-06
 Identities = 12/28 (42%), Positives = 16/28 (57%)

Query:  1034 SLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061
             S   + EVG +ID HNL  +D   T+ V
Sbjct:   776 STAANAEVGQEIDVHNLLVID-QNTFEV 802

 Score = 41 (19.5 bits), Expect = 2.9e-06, Sum P(5) = 2.9e-06
 Identities = 10/25 (40%), Positives = 15/25 (60%)

Query:   180 GPLVKVDPQGRCGGVLVY-GLQMII 203
             G +  +DP+ R  G+ +Y GL  II
Sbjct:   119 GVIAAIDPKARVIGMCLYQGLFTII 143

 Score = 38 (18.4 bits), Expect = 2.9e-06, Sum P(5) = 2.9e-06
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query:   670 YVLLGMSDGSIRLLVGDPST 689
             Y+L  + DGS+   + D +T
Sbjct:   600 YLLCALGDGSMYYFIMDQTT 619


>TAIR|locus:2127368 [details] [associations]
            symbol:DDB1B "damaged DNA binding protein 1B"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
            development ending in seed dormancy" evidence=IMP] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
            [GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
            chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
            specification" evidence=RCA] [GO:0010072 "primary shoot apical
            meristem specification" evidence=RCA] [GO:0010100 "negative
            regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
            dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
            evidence=RCA] [GO:0010564 "regulation of cell cycle process"
            evidence=RCA] [GO:0045595 "regulation of cell differentiation"
            evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
            [GO:0048608 "reproductive structure development" evidence=RCA]
            [GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
            division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
            SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
            GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
            UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
            PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
            DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
            PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
            KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
            OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
            GermOnline:AT4G21100 Uniprot:O49552
        Length = 1088

 Score = 100 (40.3 bits), Expect = 3.1e-06, Sum P(4) = 3.1e-06
 Identities = 35/117 (29%), Positives = 56/117 (47%)

Query:   226 FSARIESSHVINLRDLDMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISA 285
             F  + +     N+R L+   V D  F++G  +P + +L++     A  V     T  +S 
Sbjct:   144 FDNKGQLKEAFNIR-LEELQVLDIKFLYGCTKPTIAVLYQDNKD-ARHVK----TYEVSL 197

Query:   286 LSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALAL 342
                     + P  WS  NL + A  L+ VPSP+ GVL++G  TI Y S +A  A+ +
Sbjct:   198 KD--KNFVEGP--WSQNNLDNGADLLIPVPSPLCGVLIIGEETIVYCSANAFKAIPI 250

 Score = 73 (30.8 bits), Expect = 3.1e-06, Sum P(4) = 3.1e-06
 Identities = 17/59 (28%), Positives = 31/59 (52%)

Query:   378 LLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQ 436
             LL    G + LL + ++   V  L +     + + S I+ + N++ F+GS  GDS L++
Sbjct:   269 LLGDHAGLIHLLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIK 327

 Score = 68 (29.0 bits), Expect = 3.1e-06, Sum P(4) = 3.1e-06
 Identities = 36/133 (27%), Positives = 64/133 (48%)

Query:   513 GPLKDFSYGLRINADASATGISKQSNYELVELPGCKGIWTVYHKSSRGHNADSSRMAAYD 572
             G  KD S  LRI  +    GI++Q++   VEL G KG+W++  KSS             D
Sbjct:   372 GAYKDGS--LRIVRNG--IGINEQAS---VELQGIKGMWSL--KSS------------ID 410

Query:   573 DEYHAYLIISL--EARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERG 630
             + +  +L++S   E R + +   D L E TE   +  + +T+   +     +++QV    
Sbjct:   411 EAFDTFLVVSFISETRILAMNIEDELEE-TEIEGFLSEVQTLFCHDAV-YNQLVQVTSNS 468

Query:   631 ARILDGSYMTQDL 643
              R++  +  T++L
Sbjct:   469 VRLVSST--TREL 479

 Score = 53 (23.7 bits), Expect = 3.1e-06, Sum P(4) = 3.1e-06
 Identities = 23/99 (23%), Positives = 44/99 (44%)

Query:  1039 QEVGHQIDNHNLSSVDLHRTYTVEEYE---VRILEPDRAGGPWQTRATIPMQSSENALTV 1095
             + + HQ      +   L    + EE E   VR+L+       ++  ++ P+ + E   ++
Sbjct:   716 RRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQS----FEFLSSYPLDAFECGCSI 771

Query:  1096 RVVTLFNTTTKENETLLAIGTAYVQGED-VAARGRVLLF 1133
                 L  + T +      +GTAYV  E+    +GR+L+F
Sbjct:   772 ----LSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVF 806


>SGD|S000002709 [details] [associations]
            symbol:CFT1 "RNA-binding subunit of the mRNA cleavage and
            polyadenylation factor" species:4932 "Saccharomyces cerevisiae"
            [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0003723 "RNA binding"
            evidence=IEA;IDA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA;IPI]
            [GO:0006369 "termination of RNA polymerase II transcription"
            evidence=IMP] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
            [GO:0006379 "mRNA cleavage" evidence=IDA;TAS] [GO:0005849 "mRNA
            cleavage factor complex" evidence=IPI] InterPro:IPR004871
            Pfam:PF03178 SGD:S000002709 GO:GO:0005739 GO:GO:0006378
            EMBL:BK006938 GO:GO:0003723 EMBL:U28374 eggNOG:COG5161 KO:K14401
            OMA:HNDRIFQ GO:GO:0005847 GO:GO:0006379 PIR:S61187
            RefSeq:NP_010587.1 ProteinModelPortal:Q06632 DIP:DIP-2467N
            IntAct:Q06632 MINT:MINT-375530 STRING:Q06632 PaxDb:Q06632
            PeptideAtlas:Q06632 EnsemblFungi:YDR301W GeneID:851895
            KEGG:sce:YDR301W CYGD:YDR301w GeneTree:ENSGT00550000075040
            HOGENOM:HOG000246682 OrthoDB:EOG4D29XZ NextBio:969889
            Genevestigator:Q06632 GermOnline:YDR301W GO:GO:0006369
            Uniprot:Q06632
        Length = 1357

 Score = 91 (37.1 bits), Expect = 8.2e-05, Sum P(3) = 8.2e-05
 Identities = 35/157 (22%), Positives = 69/157 (43%)

Query:   244 KHVKDFIFVHGYIEPVMVILHERELTWAGR--VSWKHHTCMISALSI----STTLKQHPL 297
             K++ D  F+  + +P + +L++ +L WAG   +S      +I  L+I    S T  +   
Sbjct:   211 KNIIDIQFLKNFTKPTIALLYQPKLVWAGNTTISKLPTQYVILTLNIQPAESATKIESTT 270

Query:   298 IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA--SCALALNNYA-VSLDSSQE 354
             I     LP D + ++ V +   G ++VG N + +   +      + LN++A   L  ++ 
Sbjct:   271 IAFVKELPWDLHTIVPVSN---GAIIVGTNELAFLDNTGVLQSTVLLNSFADKELQKTKI 327

Query:   355 LPRSSFSVELDAAHAT--WLQNDVALLSTKTGDLVLL 389
             +  SS  +     + T  W+ +  +       D  LL
Sbjct:   328 INNSSLEIMFREKNTTSIWIPSSKSKNGGSNNDETLL 364

 Score = 85 (35.0 bits), Expect = 8.2e-05, Sum P(3) = 8.2e-05
 Identities = 70/331 (21%), Positives = 130/331 (39%)

Query:   374 NDVALLSTKTGDLVLLTVVYDGRVVQRLDLSK---TNPSVLTSDITTIGNSLFFLGSRLG 430
             ++  LL     ++  + +  +GR++ + D+ K    N  +  +        L    S   
Sbjct:   360 DETLLLMDLKSNIYYIQMEAEGRLLIKFDIFKLPIVNDLLKENSNPKCITRLNATNSNKN 419

Query:   431 DSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPS--TKRLRRSSSDALQDM--VNGEELSL 486
               L + F  G+   +  + LK      EA  PS  T  L   + D  ++M  +  +E   
Sbjct:   420 MDLFIGFGSGNALVLRLNNLKSTIETREAHNPSSGTNSLMDINDDDDEEMDDLYADEAPE 479

Query:   487 YGSASNNTESAQKT---FSFAVRDSLVNIGPLKDFSYGLRINADASATGISK--QSNYEL 541
              G  +N+++   +T   F   +  SL N+GP+   + G   + D    G+    ++ Y L
Sbjct:   480 NGLTTNDSKGTVETVQPFDIELLSSLRNVGPITSLTVGKVSSIDDVVKGLPNPNKNEYSL 539

Query:   542 VELPGC-KGIW-TVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLL--- 596
             V   G   G   TV   S +     + +  +    ++    + ++ R   L T D     
Sbjct:   540 VATSGNGSGSHLTVIQTSVQPEIELALKFISITQIWN----LKIKGRDRYLITTDSTKSR 595

Query:   597 TEVTESVDYFVQGRTIAAGNLFGRRRV----IQVFERGARILDGSYMTQDLS-FGPXXXX 651
             +++ ES + F   +    G L  RR      I +F    RI+  +  T  L  +      
Sbjct:   596 SDIYESDNNF---KLHKGGRL--RRDATTVYISMFGEEKRIIQVT--TNHLYLYDTHFRR 648

Query:   652 XXXXXXXXTVLSVSIADPYVLLGMSDGSIRL 682
                      V+ VS+ DPY+L+ +S G I++
Sbjct:   649 LTTIKFDYEVIHVSVMDPYILVTVSRGDIKI 679

 Score = 63 (27.2 bits), Expect = 8.2e-05, Sum P(3) = 8.2e-05
 Identities = 20/91 (21%), Positives = 41/91 (45%)

Query:   101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH 160
             L L   ++ HG +  + ++ Q  +  S     ++L    AKIS+L+F+   + +   S+H
Sbjct:    48 LYLTDEFKFHGLITDIGLIPQKDSPLS----CLLLCTGVAKISILKFNTLTNSIDTLSLH 103

Query:   161 CFESPEWLHLKRGRESFARGPLVKVDPQGRC 191
              +E        +     A+   +++DP   C
Sbjct:   104 YYEGK---FKGKSLVELAKISTLRMDPGSSC 131


>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
            symbol:ddb1 "damage specific DNA binding protein
            1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
            UniGene:Dr.77970 Uniprot:I1XUS8
        Length = 1140

 Score = 116 (45.9 bits), Expect = 0.00024, Sum P(3) = 0.00024
 Identities = 42/164 (25%), Positives = 74/164 (45%)

Query:   299 WSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRS 358
             W   N+  +A  ++ VP P GG +++G  +I YH+     A+A       +  S  +  +
Sbjct:   207 WKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAVA----PPIIKQSTIVCHN 262

Query:   359 SFSVELDAAHATWLQNDVALLSTKTGDLVLLTV----VYDGRVVQR-LDLSKTNPSVLTS 413
                  +D   + +L  D+       G L +L +    + DG VV + L +     + +  
Sbjct:   263 ----RVDPNGSRYLLGDME------GRLFMLLLEKEELMDGAVVLKDLHVELLGETSIAE 312

Query:   414 DITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDI 457
              +T + N + F+GSRLGDS LV+    S       G+ E F ++
Sbjct:   313 CLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVMETFTNL 356

 Score = 71 (30.1 bits), Expect = 0.00024, Sum P(3) = 0.00024
 Identities = 26/93 (27%), Positives = 45/93 (48%)

Query:   542 VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTE 601
             ++LPG KG+W +  +SSR    D+      DD     L++S   +T VL  +    E TE
Sbjct:   402 IDLPGIKGLWPLRSESSR----DT------DD----MLVLSFVGQTRVLMLSGEEVEETE 447

Query:   602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARIL 634
                +    +T   GN+   +++IQ+     R++
Sbjct:   448 LQGFVDNQQTFFCGNV-AHQQLIQITSVSVRLV 479

 Score = 44 (20.5 bits), Expect = 0.00024, Sum P(3) = 0.00024
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query:   984 PSGSTYDNYWPVQ--KVIPLKATPHQITYFAEKNLYPLIV 1021
             PS ST      V   K+ P   +PH+ ++  E  ++ L+V
Sbjct:   754 PSASTQALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLV 793

 Score = 44 (20.5 bits), Expect = 0.00024, Sum P(3) = 0.00024
 Identities = 12/50 (24%), Positives = 24/50 (48%)

Query:   792 MREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFL 841
             +  ++  S+   +S+S   T  G +  +HS+ VV+     +   H+  FL
Sbjct:   761 LSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD--QHTFEVLHAHQFL 808


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.135   0.399    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1192      1134   0.00091  123 3  11 22  0.39    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  23
  No. of states in DFA:  632 (67 KB)
  Total size of DFA:  501 KB (2232 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  90.30u 0.12s 90.42t   Elapsed:  00:00:04
  Total cpu time:  90.31u 0.12s 90.43t   Elapsed:  00:00:04
  Start:  Tue May 21 02:40:06 2013   End:  Tue May 21 02:40:10 2013

Back to top