BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy92
MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGALKLRFK
KLKVLFVSDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELR
AHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTP
HFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPF
SWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLF
DIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTG
IAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYA
GNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFM
YQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGAL
GFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVW
KFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF

High Scoring Gene Products

Symbol, full name Information P value
cpsf1
cleavage and polyadenylation specific factor 1
gene_product from Danio rerio 6.4e-166
F1RSN8
Uncharacterized protein
protein from Sus scrofa 3.5e-165
K7GNU1
Uncharacterized protein
protein from Sus scrofa 3.5e-165
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Bos taurus 2.5e-164
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 5.1e-164
CPSF1
Uncharacterized protein
protein from Canis lupus familiaris 5.1e-164
CPSF1
Cleavage and polyadenylation specificity factor subunit 1
protein from Homo sapiens 1.4e-163
Cpsf1
cleavage and polyadenylation specific factor 1
protein from Mus musculus 1.4e-163
Cpsf160
Cleavage and polyadenylation specificity factor 160
protein from Drosophila melanogaster 1.4e-159
Cpsf1
cleavage and polyadenylation specific factor 1, 160kDa
gene from Rattus norvegicus 3.6e-118
cpsf-1 gene from Caenorhabditis elegans 2.0e-89
cpsf-1
Probable cleavage and polyadenylation specificity factor subunit 1
protein from Caenorhabditis elegans 2.0e-89
CPSF160
cleavage and polyadenylation specificity factor 160
protein from Arabidopsis thaliana 3.6e-65
cpsf1
cleavage and polyadenylation specificity factor 160 kDa subunit
gene from Dictyostelium discoideum 4.0e-53
orf19.2760 gene_product from Candida albicans 6.8e-33
CFT1
Protein CFT1
protein from Candida albicans SC5314 6.8e-33
CFT1
RNA-binding subunit of the mRNA cleavage and polyadenylation factor
gene from Saccharomyces cerevisiae 3.7e-31
DDB1A
AT4G05420
protein from Arabidopsis thaliana 2.0e-11
DDB1B
damaged DNA binding protein 1B
protein from Arabidopsis thaliana 5.0e-11
PFL1680w
splicing factor 3b, subunit 3, 130kD, putative
gene from Plasmodium falciparum 4.4e-05
PFL1680w
Splicing factor 3b, subunit 3, 130kD, putative
protein from Plasmodium falciparum 3D7 4.4e-05
MGG_16867
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 0.00016
ddb1
damage specific DNA binding protein 1
gene_product from Danio rerio 0.00034
ddb-1 gene from Caenorhabditis elegans 0.00085
ddb-1
DNA damage-binding protein 1
protein from Caenorhabditis elegans 0.00085
sf3b3
splicing factor 3B subunit 3
gene from Dictyostelium discoideum 0.00090

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy92
        (638 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

ZFIN|ZDB-GENE-040709-2 - symbol:cpsf1 "cleavage and polya...  1220  6.4e-166  2
UNIPROTKB|F1RSN8 - symbol:CPSF1 "Uncharacterized protein"...  1198  3.5e-165  2
UNIPROTKB|K7GNU1 - symbol:CPSF1 "Uncharacterized protein"...  1198  3.5e-165  2
UNIPROTKB|Q10569 - symbol:CPSF1 "Cleavage and polyadenyla...  1194  2.5e-164  2
UNIPROTKB|F1PC28 - symbol:CPSF1 "Uncharacterized protein"...  1191  5.1e-164  2
UNIPROTKB|J9P418 - symbol:CPSF1 "Uncharacterized protein"...  1191  5.1e-164  2
UNIPROTKB|Q10570 - symbol:CPSF1 "Cleavage and polyadenyla...  1191  1.4e-163  2
MGI|MGI:2679722 - symbol:Cpsf1 "cleavage and polyadenylat...  1185  1.4e-163  2
FB|FBgn0024698 - symbol:Cpsf160 "Cleavage and polyadenyla...  1126  1.4e-159  2
RGD|1306406 - symbol:Cpsf1 "cleavage and polyadenylation ...   774  3.6e-118  2
WB|WBGene00022301 - symbol:cpsf-1 species:6239 "Caenorhab...   654  2.0e-89   2
UNIPROTKB|Q9N4C2 - symbol:cpsf-1 "Probable cleavage and p...   654  2.0e-89   2
TAIR|locus:2153122 - symbol:CPSF160 "cleavage and polyade...   455  3.6e-65   2
POMBASE|SPBC1709.08 - symbol:cft1 "cleavage factor one Cf...   450  1.6e-60   2
DICTYBASE|DDB_G0281585 - symbol:cpsf1 "cleavage and polya...   449  4.0e-53   2
ASPGD|ASPL0000050546 - symbol:AN1413 species:162425 "Emer...   370  3.6e-46   2
CGD|CAL0004251 - symbol:orf19.2760 species:5476 "Candida ...   326  6.8e-33   2
UNIPROTKB|Q5AFT3 - symbol:CFT1 "Protein CFT1" species:237...   326  6.8e-33   2
SGD|S000002709 - symbol:CFT1 "RNA-binding subunit of the ...   266  3.7e-31   3
TAIR|locus:2115909 - symbol:DDB1A "damaged DNA binding pr...   128  2.0e-11   2
TAIR|locus:2127368 - symbol:DDB1B "damaged DNA binding pr...   120  5.0e-11   3
GENEDB_PFALCIPARUM|PFL1680w - symbol:PFL1680w "splicing f...   106  4.4e-05   2
UNIPROTKB|Q8I574 - symbol:PFL1680w "Splicing factor 3b, s...   106  4.4e-05   2
UNIPROTKB|G4N4E2 - symbol:MGG_16867 "Uncharacterized prot...   120  0.00016   2
ZFIN|ZDB-GENE-040426-1272 - symbol:ddb1 "damage specific ...   127  0.00034   1
WB|WBGene00010890 - symbol:ddb-1 species:6239 "Caenorhabd...   104  0.00085   2
UNIPROTKB|Q21554 - symbol:ddb-1 "DNA damage-binding prote...   104  0.00085   2
DICTYBASE|DDB_G0282569 - symbol:sf3b3 "splicing factor 3B...    93  0.00090   2


>ZFIN|ZDB-GENE-040709-2 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation specific
            factor 1" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IMP] [GO:0060216
            "definitive hemopoiesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 ZFIN:ZDB-GENE-040709-2 GO:GO:0005634 GO:GO:0006378
            GO:GO:0003676 GeneTree:ENSGT00550000075040 GO:GO:0060216
            EMBL:CU467825 IPI:IPI00932321 Ensembl:ENSDART00000110017
            ArrayExpress:F1QCJ8 Bgee:F1QCJ8 Uniprot:F1QCJ8
        Length = 1451

 Score = 1220 (434.5 bits), Expect = 6.4e-166, Sum P(2) = 6.4e-166
 Identities = 230/429 (53%), Positives = 302/429 (70%)

Query:    13 DETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXSD--- 69
             D  +V+E+  VSLG + +RP LL   + ELLIY+AF + +                +   
Sbjct:   850 DIPLVKEVALVSLGYNHSRPYLLAHVEQELLIYEAFPYDQQQAQSNLKVRFKKMPHNINY 909

Query:    70 RSK----RANEQP-GLPR---GV--RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGEL 119
             R K    R +++P G      GV  R+++ RYF +I+GY GVF+CGP P W+ +TSRG +
Sbjct:   910 REKKVKVRKDKKPEGQGEDTLGVKGRVARFRYFQDISGYSGVFICGPSPHWMLVTSRGAM 969

Query:   120 RAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCT 179
             R HPMTIDG + + +PFHN+NCP+GFLYFN + ELRISVLPT+LSYDAPWPVRK+PL+CT
Sbjct:   970 RLHPMTIDGAIESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCT 1029

Query:   180 PHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSP 239
              H+++YH+E+K Y + TS  EP T   +  GE+KE  T  RD R+I P   +F + L SP
Sbjct:  1030 VHYVSYHVESKVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISP 1089

Query:   240 FSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILL 299
              SWE IP T   L EWEHV C+K V+++ + T+SGL+GY+ALGT     E+VTCRGRIL+
Sbjct:  1090 VSWEAIPNTRVDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVTCRGRILI 1149

Query:   300 FDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLT 359
              D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH +GFLV+A+GQKI++W LKDNDLT
Sbjct:  1150 LDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKDNDLT 1209

Query:   360 GIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYY 419
             G+AFIDT++YI  M S+KN IL  D  +SI+LLRYQPE +TLSLV+RD KP +  S  + 
Sbjct:  1210 GMAFIDTQLYIHQMYSIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFM 1269

Query:   420 AGNPSRGII 428
               N   G +
Sbjct:  1270 VDNNQLGFL 1278

 Score = 416 (151.5 bits), Expect = 6.4e-166, Sum P(2) = 6.4e-166
 Identities = 80/179 (44%), Positives = 122/179 (68%)

Query:   463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCK---PSSI 519
             + +GF++SD+DKN++++MY PEA+ES GG RL+++ DF++G HVN F+++ C+    ++ 
Sbjct:  1273 NQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTAN 1332

Query:   520 SDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
               A    ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GLNP+AFR  
Sbjct:  1333 KKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRML 1392

Query:   580 KGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                     N  + I+DG L+ K+L LS  ER E+ KKIG+  + ILD+L +IE +++HF
Sbjct:  1393 HCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERVTAHF 1451

 Score = 86 (35.3 bits), Expect = 4.8e-131, Sum P(2) = 4.8e-131
 Identities = 19/40 (47%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  + I+DG L+ K+L LS  ER E+ KKIG+  + ILD+
Sbjct:  1401 NAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDD 1440


>UNIPROTKB|F1RSN8 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0006378 GO:GO:0003730 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY
            EMBL:CU468594 Ensembl:ENSSSCT00000006486 Uniprot:F1RSN8
        Length = 1108

 Score = 1198 (426.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
 Identities = 233/441 (52%), Positives = 291/441 (65%)

Query:     2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
             G  R    +   E  +V+E+L V+LG    RP LLV    ELLIY+AF H          
Sbjct:   495 GEARKEEATRQGELPLVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 554

Query:    61 XXXXXXXSD---R------SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPH 107
                     +   R      SK+  E  G   GV    R+++ RYF +I GY GVF+CGP 
Sbjct:   555 VRFKKVPHNINFREKKPKPSKKKTEGGGSEEGVGARGRVARFRYFEDIYGYSGVFICGPS 614

Query:   108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
             P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct:   615 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 674

Query:   168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             PWPVRK+PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P
Sbjct:   675 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIDRDDRYIHP 734

Query:   228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
                 F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT    
Sbjct:   735 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 794

Query:   288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
              E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQK
Sbjct:   795 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 854

Query:   348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
             I++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD
Sbjct:   855 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 914

Query:   408 YKPTQPNSKGYYAGNPSRGII 428
              KP +  S  +   N   G +
Sbjct:   915 AKPLEVYSVDFMVDNAQLGFL 935

 Score = 431 (156.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
 Identities = 82/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:   924 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 983

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  D P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:   984 AT--DGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1041

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:  1042 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1101

Query:   632 EALSSHF 638
             + +++HF
Sbjct:  1102 DRVTAHF 1108

 Score = 83 (34.3 bits), Expect = 2.1e-128, Sum P(2) = 2.1e-128
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1058 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1097


>UNIPROTKB|K7GNU1 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GeneTree:ENSGT00550000075040 EMBL:CU468594
            Ensembl:ENSSSCT00000033207 Uniprot:K7GNU1
        Length = 757

 Score = 1198 (426.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
 Identities = 233/441 (52%), Positives = 291/441 (65%)

Query:     2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
             G  R    +   E  +V+E+L V+LG    RP LLV    ELLIY+AF H          
Sbjct:   144 GEARKEEATRQGELPLVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 203

Query:    61 XXXXXXXSD---R------SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPH 107
                     +   R      SK+  E  G   GV    R+++ RYF +I GY GVF+CGP 
Sbjct:   204 VRFKKVPHNINFREKKPKPSKKKTEGGGSEEGVGARGRVARFRYFEDIYGYSGVFICGPS 263

Query:   108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
             P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct:   264 PHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 323

Query:   168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             PWPVRK+PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P
Sbjct:   324 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFETIDRDDRYIHP 383

Query:   228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
                 F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT    
Sbjct:   384 QQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 443

Query:   288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
              E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQK
Sbjct:   444 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 503

Query:   348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
             I++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD
Sbjct:   504 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 563

Query:   408 YKPTQPNSKGYYAGNPSRGII 428
              KP +  S  +   N   G +
Sbjct:   564 AKPLEVYSVDFMVDNAQLGFL 584

 Score = 431 (156.8 bits), Expect = 3.5e-165, Sum P(2) = 3.5e-165
 Identities = 82/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:   573 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 632

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  D P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:   633 AT--DGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 690

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:   691 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 750

Query:   632 EALSSHF 638
             + +++HF
Sbjct:   751 DRVTAHF 757

 Score = 83 (34.3 bits), Expect = 2.1e-128, Sum P(2) = 2.1e-128
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:   707 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 746


>UNIPROTKB|Q10569 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378
            "mRNA polyadenylation" evidence=IEA] [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA] InterPro:IPR004871 Pfam:PF03178
            GO:GO:0006378 GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847
            GO:GO:0006379 GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:X83097
            IPI:IPI00713487 PIR:S57335 RefSeq:NP_777145.1 UniGene:Bt.4911
            STRING:Q10569 PRIDE:Q10569 Ensembl:ENSBTAT00000011004 GeneID:282703
            KEGG:bta:282703 CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            InParanoid:Q10569 OrthoDB:EOG4BCDM3 NextBio:20806363
            ArrayExpress:Q10569 Uniprot:Q10569
        Length = 1444

 Score = 1194 (425.4 bits), Expect = 2.5e-164, Sum P(2) = 2.5e-164
 Identities = 230/442 (52%), Positives = 293/442 (66%)

Query:     2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
             G  R    +   E  +V+E+L V+LG    RP LLV    ELLIY+AF H          
Sbjct:   831 GEARKEEATRQGELPLVKEVLLVALGSRQRRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 890

Query:    61 XXXXXXXSD---RSKR-----------ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
                     +   R K+           + E+   PRG R+++ RYF +I GY GVF+CGP
Sbjct:   891 VRFKKVPHNINFREKKPKPSKKKAEGGSTEEGTGPRG-RVARFRYFEDIYGYSGVFICGP 949

Query:   107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
              P WL +T RG LR HPM IDGP+ + APFHN+NCPRGFLYFN + ELRISVLP +LSYD
Sbjct:   950 SPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYD 1009

Query:   167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
             APWPVRK+PL+CT H++AYH+E+K Y + TST+ P T   +  GE+KE  T  RD R++ 
Sbjct:  1010 APWPVRKIPLRCTAHYVAYHVESKVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVH 1069

Query:   227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
             P    F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT   
Sbjct:  1070 PQQEAFCIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLM 1129

Query:   287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
               E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQ
Sbjct:  1130 QGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1189

Query:   347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
             KI++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+R
Sbjct:  1190 KIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSR 1249

Query:   407 DYKPTQPNSKGYYAGNPSRGII 428
             D KP +  S  +   N   G +
Sbjct:  1250 DAKPLEVYSVDFMVDNAQLGFL 1271

 Score = 427 (155.4 bits), Expect = 2.5e-164, Sum P(2) = 2.5e-164
 Identities = 81/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:  1260 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1319

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:  1320 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1377

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:  1378 NPRAFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1437

Query:   632 EALSSHF 638
             + +++HF
Sbjct:  1438 DRVTAHF 1444

 Score = 83 (34.3 bits), Expect = 5.5e-128, Sum P(2) = 5.5e-128
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1394 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1433


>UNIPROTKB|F1PC28 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006379 "mRNA cleavage" evidence=IEA]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0003730 "mRNA 3'-UTR binding" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0006378 GO:GO:0003730
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY EMBL:AAEX03008966 ProteinModelPortal:F1PC28
            Ensembl:ENSCAFT00000002514 Uniprot:F1PC28
        Length = 1398

 Score = 1191 (424.3 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
 Identities = 229/426 (53%), Positives = 287/426 (67%)

Query:    16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXSD---R-- 70
             +V+E+L V+LG   +RP LLV    ELLIY+AF H                  +   R  
Sbjct:   800 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 859

Query:    71 ----SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
                 SK+  E  G   G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct:   860 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 919

Query:   123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
             PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct:   920 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 979

Query:   183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
             +AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct:   980 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 1039

Query:   243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
             E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct:  1040 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 1099

Query:   303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
             IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct:  1100 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 1159

Query:   363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
             FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct:  1160 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 1219

Query:   423 PSRGII 428
                G +
Sbjct:  1220 AQLGFL 1225

 Score = 427 (155.4 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
 Identities = 81/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:  1214 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1273

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:  1274 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1331

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:  1332 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1391

Query:   632 EALSSHF 638
             + +++HF
Sbjct:  1392 DRVTAHF 1398

 Score = 83 (34.3 bits), Expect = 1.1e-127, Sum P(2) = 1.1e-127
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1348 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1387


>UNIPROTKB|J9P418 [details] [associations]
            symbol:CPSF1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 GO:GO:0003676
            GeneTree:ENSGT00550000075040 EMBL:AAEX03008966
            Ensembl:ENSCAFT00000043656 Uniprot:J9P418
        Length = 1107

 Score = 1191 (424.3 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
 Identities = 229/426 (53%), Positives = 287/426 (67%)

Query:    16 IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXSD---R-- 70
             +V+E+L V+LG   +RP LLV    ELLIY+AF H                  +   R  
Sbjct:   509 LVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLKVRFKKVPHNINFREK 568

Query:    71 ----SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAH 122
                 SK+  E  G   G     R+++ RYF +I GY GVF+CGP P WL +T RG LR H
Sbjct:   569 KPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPSPHWLLVTGRGALRLH 628

Query:   123 PMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHF 182
             PM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYDAPWPVRK+PL+CT H+
Sbjct:   629 PMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHY 688

Query:   183 LAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSW 242
             +AYH+E+K Y + TST  P T   +  GE+KE  T  RD R+I P    F + L SP SW
Sbjct:   689 VAYHVESKVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSW 748

Query:   243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
             E IP     L EWEHV C+K VS+  E T+SGL+GY+A GT     E+VTCRGRIL+ D+
Sbjct:   749 EAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVTCRGRILIMDV 808

Query:   303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIA 362
             IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQKI++W L+ ++LTG+A
Sbjct:   809 IEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFLWSLRASELTGMA 868

Query:   363 FIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGN 422
             FIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD KP +  S  +   N
Sbjct:   869 FIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDN 928

Query:   423 PSRGII 428
                G +
Sbjct:   929 AQLGFL 934

 Score = 427 (155.4 bits), Expect = 5.1e-164, Sum P(2) = 5.1e-164
 Identities = 81/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:   923 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 982

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:   983 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1040

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:  1041 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLET 1100

Query:   632 EALSSHF 638
             + +++HF
Sbjct:  1101 DRVTAHF 1107

 Score = 83 (34.3 bits), Expect = 1.1e-127, Sum P(2) = 1.1e-127
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1057 NAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDD 1096


>UNIPROTKB|Q10570 [details] [associations]
            symbol:CPSF1 "Cleavage and polyadenylation specificity
            factor subunit 1" species:9606 "Homo sapiens" [GO:0003730 "mRNA
            3'-UTR binding" evidence=IDA] [GO:0006379 "mRNA cleavage"
            evidence=IDA] [GO:0006378 "mRNA polyadenylation" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0000398 "mRNA splicing, via spliceosome" evidence=TAS]
            [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006366 "transcription
            from RNA polymerase II promoter" evidence=TAS] [GO:0006369
            "termination of RNA polymerase II transcription" evidence=TAS]
            [GO:0006397 "mRNA processing" evidence=TAS] [GO:0006406 "mRNA
            export from nucleus" evidence=TAS] [GO:0008380 "RNA splicing"
            evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0031124 "mRNA 3'-end processing" evidence=TAS]
            Reactome:REACT_71 InterPro:IPR004871 Pfam:PF03178
            Reactome:REACT_1675 GO:GO:0006378 GO:GO:0003730 GO:GO:0006406
            GO:GO:0000398 Reactome:REACT_1788 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GO:GO:0006369 Reactome:REACT_78
            OMA:NIGDNRY CTD:29894 HOGENOM:HOG000007904 HOVERGEN:HBG051105
            OrthoDB:EOG4BCDM3 EMBL:U37012 EMBL:BC017232 IPI:IPI00026219
            RefSeq:NP_037423.2 UniGene:Hs.493202 ProteinModelPortal:Q10570
            DIP:DIP-32694N IntAct:Q10570 MINT:MINT-1601544 STRING:Q10570
            PhosphoSite:Q10570 DMDM:23503048 PaxDb:Q10570 PeptideAtlas:Q10570
            PRIDE:Q10570 DNASU:29894 Ensembl:ENST00000349769
            Ensembl:ENST00000568627 GeneID:29894 KEGG:hsa:29894 UCSC:uc003zcj.3
            GeneCards:GC08M145618 HGNC:HGNC:2324 MIM:606027 neXtProt:NX_Q10570
            PharmGKB:PA26841 InParanoid:Q10570 PhylomeDB:Q10570 ChiTaRS:CPSF1
            GenomeRNAi:29894 NextBio:52452 ArrayExpress:Q10570 Bgee:Q10570
            CleanEx:HS_CPSF1 Genevestigator:Q10570 GermOnline:ENSG00000071894
            Uniprot:Q10570
        Length = 1443

 Score = 1191 (424.3 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
 Identities = 232/441 (52%), Positives = 290/441 (65%)

Query:     2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
             G  R    +   E  +V+E+L V+LG   +RP LLV    ELLIY+AF H          
Sbjct:   830 GEARREEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 889

Query:    61 XXXXXXXSD---R------SKRANEQPGLPRGV----RISQMRYFSNIAGYQGVFLCGPH 107
                     +   R      SK+  E  G   G     R+++ RYF +I GY GVF+CGP 
Sbjct:   890 VRFKKVPHNINFREKKPKPSKKKAEGGGAEEGAGARGRVARFRYFEDIYGYSGVFICGPS 949

Query:   108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
             P WL +T RG LR HPM IDGPV + APFHNVNCPRGFLYFN + ELRISVLP +LSYDA
Sbjct:   950 PHWLLVTGRGALRLHPMAIDGPVDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDA 1009

Query:   168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
             PWPVRK+PL+CT H++AYH+E+K Y + TST  P     +  GE+KE  T  RD R+I P
Sbjct:  1010 PWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHP 1069

Query:   228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
                 F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT    
Sbjct:  1070 QQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1129

Query:   288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
              E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQK
Sbjct:  1130 GEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQK 1189

Query:   348 IYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
             I++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+RD
Sbjct:  1190 IFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRD 1249

Query:   408 YKPTQPNSKGYYAGNPSRGII 428
              KP +  S  +   N   G +
Sbjct:  1250 AKPLEVYSVDFMVDNAQLGFL 1270

 Score = 423 (154.0 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
 Identities = 80/185 (43%), Positives = 125/185 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:  1259 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1318

Query:   517 SS--ISDAPGA-RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
             ++  +S       ++ +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GLNP
Sbjct:  1319 ATEGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNP 1378

Query:   574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             RAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + + 
Sbjct:  1379 RAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLETDR 1438

Query:   634 LSSHF 638
             +++HF
Sbjct:  1439 VTAHF 1443

 Score = 85 (35.0 bits), Expect = 7.0e-128, Sum P(2) = 7.0e-128
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1393 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1432


>MGI|MGI:2679722 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor
            1" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003730
            "mRNA 3'-UTR binding" evidence=ISO] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISO] [GO:0006378 "mRNA
            polyadenylation" evidence=ISO] [GO:0006379 "mRNA cleavage"
            evidence=ISO] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR004871 Pfam:PF03178 MGI:MGI:2679722 GO:GO:0006378
            GO:GO:0003730 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY CTD:29894
            HOGENOM:HOG000007904 HOVERGEN:HBG051105 OrthoDB:EOG4BCDM3
            EMBL:AF322193 EMBL:BC056388 IPI:IPI00110363 RefSeq:NP_001157645.1
            RefSeq:NP_444423.1 UniGene:Mm.45141 ProteinModelPortal:Q9EPU4
            STRING:Q9EPU4 PhosphoSite:Q9EPU4 PaxDb:Q9EPU4 PRIDE:Q9EPU4
            Ensembl:ENSMUST00000071898 GeneID:94230 KEGG:mmu:94230
            UCSC:uc007wky.2 InParanoid:Q9EPU4 NextBio:352239 Bgee:Q9EPU4
            CleanEx:MM_CPSF1 Genevestigator:Q9EPU4
            GermOnline:ENSMUSG00000034022 Uniprot:Q9EPU4
        Length = 1441

 Score = 1185 (422.2 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
 Identities = 230/442 (52%), Positives = 291/442 (65%)

Query:     2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
             G  R    +   E  +V+E+L V+LG   +RP LLV    ELLIY+AF H          
Sbjct:   828 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 887

Query:    61 XXXXXXXSD---RSKR-----------ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
                     +   R K+           + E+    RG R+++ RYF +I GY GVF+CGP
Sbjct:   888 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGGRG-RVARFRYFEDIYGYSGVFICGP 946

Query:   107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
              P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYD
Sbjct:   947 SPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYD 1006

Query:   167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
             APWPVRK+PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE     RD R+I 
Sbjct:  1007 APWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIH 1066

Query:   227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
             P    F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT   
Sbjct:  1067 PQQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLM 1126

Query:   287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
               E+VTCRGRIL+ D+IEVVPEPGQPLTKNK K++Y KEQKGPVTA+CH  G LV+A+GQ
Sbjct:  1127 QGEEVTCRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1186

Query:   347 KIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVAR 406
             KI++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D  +SI+LLRYQ E +TLSLV+R
Sbjct:  1187 KIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSR 1246

Query:   407 DYKPTQPNSKGYYAGNPSRGII 428
             D KP +  S  +   N   G +
Sbjct:  1247 DAKPLEVYSVDFMVDNAQLGFL 1268

 Score = 429 (156.1 bits), Expect = 1.4e-163, Sum P(2) = 1.4e-163
 Identities = 81/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:  1257 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1316

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:  1317 AA--EGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1374

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:  1375 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLET 1434

Query:   632 EALSSHF 638
             + +++HF
Sbjct:  1435 DRVTAHF 1441

 Score = 85 (35.0 bits), Expect = 3.0e-127, Sum P(2) = 3.0e-127
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1391 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1430


>FB|FBgn0024698 [details] [associations]
            symbol:Cpsf160 "Cleavage and polyadenylation specificity
            factor 160" species:7227 "Drosophila melanogaster" [GO:0005847
            "mRNA cleavage and polyadenylation specificity factor complex"
            evidence=ISS;NAS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS;NAS] [GO:0006379 "mRNA cleavage" evidence=ISS;NAS]
            [GO:0003730 "mRNA 3'-UTR binding" evidence=ISS] [GO:0003729 "mRNA
            binding" evidence=NAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0022008 "neurogenesis" evidence=IMP] InterPro:IPR004871
            Pfam:PF03178 EMBL:AE013599 GO:GO:0022008 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 GO:GO:0005847 GO:GO:0006379
            GeneTree:ENSGT00550000075040 OMA:NIGDNRY EMBL:AF241364
            EMBL:AF241365 EMBL:AF241366 EMBL:AY051896 RefSeq:NP_725397.1
            RefSeq:NP_995833.1 UniGene:Dm.3414 ProteinModelPortal:Q9V726
            STRING:Q9V726 PaxDb:Q9V726 PRIDE:Q9V726 EnsemblMetazoa:FBtr0089258
            GeneID:44250 KEGG:dme:Dmel_CG10110 CTD:44250 FlyBase:FBgn0024698
            InParanoid:Q9V726 OrthoDB:EOG4ZCRK8 PhylomeDB:Q9V726
            GenomeRNAi:44250 NextBio:837008 Bgee:Q9V726 GermOnline:CG10110
            Uniprot:Q9V726
        Length = 1455

 Score = 1126 (401.4 bits), Expect = 1.4e-159, Sum P(2) = 1.4e-159
 Identities = 217/430 (50%), Positives = 287/430 (66%)

Query:     9 PSAMDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXXXXXXXXXS 68
             P   +  +  EL  + LGL+G RPLLLVRT+ ELLIYQ FR+PKG               
Sbjct:   854 PQHANSPLPLELSVIGLGLNGERPLLLVRTRVELLIYQVFRYPKGHLKIRFRKMDQLNLL 913

Query:    69 DRSK------RANEQPGLP----RGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGE 118
             D+          +EQ  +     +   + ++R F+N+ G  GV +CG +P ++FLT RGE
Sbjct:   914 DQQPTHIDLDENDEQEEIESYQMQPKYVQKLRPFANVGGLSGVMVCGVNPCFVFLTFRGE 973

Query:   119 LRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKC 178
             LR H +  +G V + A F+NVN P GFLYF+   EL+ISVLP++LSYD+ WPVRKVPL+C
Sbjct:   974 LRIHRLLGNGDVRSFAAFNNVNIPNGFLYFDTTYELKISVLPSYLSYDSVWPVRKVPLRC 1033

Query:   179 TPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFS 238
             TP  L YH E + YC++T T EP T YY+FNGEDKEL  + R  RFI P+ SQF + L S
Sbjct:  1034 TPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGSQFEMVLIS 1093

Query:   239 PFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRIL 298
             P +WE +P  +     WEHV   K V + YEGT SGL+ Y+ +GTN+NYSED+T RG I 
Sbjct:  1094 PETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNYSEDITSRGNIH 1153

Query:   299 LFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDL 358
             ++DIIEVVPEPG+P+TK KIK I+ KEQKGPV+AI  V GFLVT +GQKIYIWQL+D DL
Sbjct:  1154 IYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKIYIWQLRDGDL 1213

Query:   359 TGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYKPTQPNSKGY 418
              G+AFIDT +Y+  +++VK+LI + D  +SI+LLR+Q EYRTLSL +RD+ P +     +
Sbjct:  1214 IGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEF 1273

Query:   419 YAGNPSRGII 428
                N + G +
Sbjct:  1274 MVDNSNLGFL 1283

 Score = 450 (163.5 bits), Expect = 1.4e-159, Sum P(2) = 1.4e-159
 Identities = 84/178 (47%), Positives = 124/178 (69%)

Query:   463 SSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSISDA 522
             S++GF+++D ++N++++MYQPEARES GG +L++K D+HLGQ VNT F+++C    +   
Sbjct:  1278 SNLGFLVTDAERNIIVYMYQPEARESLGGQKLLRKADYHLGQVVNTMFRVQCHQKGLHQR 1337

Query:   523 -PGA-RSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYK 580
              P    ++    Y +LDGALG+ LPLPEK YRR LMLQNV++++  H  GLNP+ +RT K
Sbjct:  1338 QPFLYENKHFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQEHLCGLNPKEYRTLK 1397

Query:   581 GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
                    NPSR IIDG L+W +  ++  ER E+ KKIG++  +IL +L +IE L+S F
Sbjct:  1398 SSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGTRTEEILGDLLEIERLASVF 1455

 Score = 110 (43.8 bits), Expect = 1.2e-123, Sum P(2) = 1.2e-123
 Identities = 27/71 (38%), Positives = 42/71 (59%)

Query:   391 LLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKK 450
             LL YQ E+    L  ++Y+  + + K     NPSR IIDG L+W +  ++  ER E+ KK
Sbjct:  1378 LLSYQ-EH-LCGLNPKEYRTLKSSKK--QGINPSRCIIDGDLIWSYRLMANSERNEVAKK 1433

Query:   451 IGSKHNDILDE 461
             IG++  +IL +
Sbjct:  1434 IGTRTEEILGD 1444


>RGD|1306406 [details] [associations]
            symbol:Cpsf1 "cleavage and polyadenylation specific factor 1,
            160kDa" species:10116 "Rattus norvegicus" [GO:0003730 "mRNA 3'-UTR
            binding" evidence=IEA;ISO] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA;ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=IEA;ISO] [GO:0006379
            "mRNA cleavage" evidence=IEA;ISO] InterPro:IPR004871 Pfam:PF03178
            RGD:1306406 GO:GO:0005634 GO:GO:0003676 EMBL:CH473950 KO:K14401
            GeneTree:ENSGT00550000075040 CTD:29894 IPI:IPI00949657
            RefSeq:NP_001124043.1 UniGene:Rn.40455 Ensembl:ENSRNOT00000066244
            GeneID:366952 KEGG:rno:366952 UCSC:RGD:1306406 NextBio:690318
            Uniprot:D4A0H5
        Length = 1386

 Score = 774 (277.5 bits), Expect = 3.6e-118, Sum P(2) = 3.6e-118
 Identities = 173/413 (41%), Positives = 226/413 (54%)

Query:     2 GNFRSHSPSAMDET-IVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPKGAXXXXXX 60
             G  R    +   E  +V+E+L V+LG   +RP LLV    ELLIY+AF H          
Sbjct:   824 GEVRKEEATRQGELPLVKEVLLVALGSRQSRPYLLVHVDQELLIYEAFPHDSQLGQGNLK 883

Query:    61 XXXXXXXSD---RSKR-----------ANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGP 106
                     +   R K+           + E+    RG R+++ RYF +I GY GVF+CGP
Sbjct:   884 VRFKKVPHNINFREKKPKPSKKKAEGCSTEEGSGVRG-RVARFRYFEDIYGYSGVFICGP 942

Query:   107 HPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYD 166
              P WL +T RG LR HPM IDGP+ + APFHNVNCPRGFLYFN + ELRISVLP +LSYD
Sbjct:   943 SPHWLLVTGRGALRLHPMGIDGPIDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYD 1002

Query:   167 APWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIP 226
             APWPVRK+PL+CT H++AYH+E+K Y + TST  P T   +  GE+KE     RD R+I 
Sbjct:  1003 APWPVRKIPLRCTAHYVAYHVESKVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIH 1062

Query:   227 PLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYN 286
             P    F + L SP SWE IP     L EWEHV C+K VS+  E T+SGL+GY+A GT   
Sbjct:  1063 PQQEAFSIQLISPVSWEAIPNARIELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLM 1122

Query:   287 YSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ 346
               E+VTCRGRI L+ +       G      ++   Y  +       I  +A  ++ ++  
Sbjct:  1123 QGEEVTCRGRIFLWSL-RASELTGMAFIDTQL---YIHQMISVKNFI--LAADVMKSISL 1176

Query:   347 KIYIWQLKDNDLTGIAFIDTEVY-IASMVSVKNL-ILVGDYARSIALLRYQPE 397
               Y  + K   L        EVY +  MV    L  LV D  R++ +  Y PE
Sbjct:  1177 LRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPE 1229

 Score = 429 (156.1 bits), Expect = 3.6e-118, Sum P(2) = 3.6e-118
 Identities = 81/187 (43%), Positives = 126/187 (67%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D + + + +GF++SD+D+N++++MY PEA+ES GG RL+++ DFH+G HVNTF++  C+ 
Sbjct:  1202 DFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRG 1261

Query:   517 SSISDAPGARS-----RFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGL 571
             ++  + P  +S     + +TW+A+LDG +G  LP+ EK YRRLLMLQN + T   H  GL
Sbjct:  1262 AA--EGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1319

Query:   572 NPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
             NPRAFR          N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+L + 
Sbjct:  1320 NPRAFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDDLLET 1379

Query:   632 EALSSHF 638
             + +++HF
Sbjct:  1380 DRVTAHF 1386

 Score = 229 (85.7 bits), Expect = 1.1e-56, Sum P(2) = 1.1e-56
 Identities = 50/103 (48%), Positives = 68/103 (66%)

Query:   327 KGPVTA-ICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDY 385
             KG V A  C + G  VT  G +I++W L+ ++LTG+AFIDT++YI  M+SVKN IL  D 
Sbjct:  1112 KGYVAAGTCLMQGEEVTCRG-RIFLWSLRASELTGMAFIDTQLYIHQMISVKNFILAADV 1170

Query:   386 ARSIALLRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPSRGII 428
              +SI+LLRYQ E +TLSLV+RD KP +  S  +   N   G +
Sbjct:  1171 MKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFL 1213

 Score = 85 (35.0 bits), Expect = 7.3e-82, Sum P(2) = 7.3e-82
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             N  R ++DG L+ ++L LS  ER E+ KKIG+  + ILD+
Sbjct:  1336 NAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDIILDD 1375


>WB|WBGene00022301 [details] [associations]
            symbol:cpsf-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0010171 "body morphogenesis" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 654 (235.3 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
 Identities = 137/410 (33%), Positives = 224/410 (54%)

Query:    17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR--HP----KGAXXXXXXXXXXXXXSDR 70
             V E   V +G++   P+L+     ++++Y+ F   +P     G              S  
Sbjct:   853 VLEAQIVGMGINQAHPILMAIVDEQVVLYEMFSSSNPIPGHLGISFRKLPHFICLRTSSH 912

Query:    71 ----SKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
                  KRA  +  +  G R S +  F  ++    GV + G  P  L   + G ++ H MT
Sbjct:   913 LNSDGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPTLLVYGAWGGMQTHQMT 972

Query:   126 IDGPVSTLAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
             +DGP+    PF+N N   G +Y    KSELRI+ +     Y+ P+PV+K+ +  T H + 
Sbjct:   973 VDGPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKIEVGRTIHHVR 1032

Query:   185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
             Y + +  Y +V+S  +PS   +    +DK+     +D  F+ P   ++ ++LFS   W  
Sbjct:  1033 YLMNSDVYAVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092

Query:   245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
             +P T     + E V   ++V+++ E T+SGL   +A+GT  NY E+V  RGRI+L ++IE
Sbjct:  1093 VPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIE 1152

Query:   305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
             VVPEP QP +  KIK+++ KEQKGPVT +C + G L+  +GQK++IWQ KDNDL GI+F+
Sbjct:  1153 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFL 1212

Query:   365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD-YKPTQP 413
             D   Y+  + S++ + +  D   S++L+R+Q + + +S+ +RD  K  QP
Sbjct:  1213 DMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQP 1262

 Score = 281 (104.0 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
 Identities = 60/180 (33%), Positives = 103/180 (57%)

Query:   465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---- 520
             +GF++SD+  N+ +F Y PEA ESNGG RL  +   ++G ++N F ++R   S +     
Sbjct:  1275 VGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNE 1334

Query:   521 -DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
              +      R  T +ASLDG+ GF  PL EK+YRRL  LQ  + + T    GL+ +  R+ 
Sbjct:  1335 DEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSA 1394

Query:   580 K-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             K  +    G  +R +IDG +V ++L LSL ++ ++ +++G     I+D+L  +  ++ ++
Sbjct:  1395 KPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454

 Score = 84 (34.6 bits), Expect = 1.2e-68, Sum P(2) = 1.2e-68
 Identities = 19/61 (31%), Positives = 35/61 (57%)

Query:   405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
             +R  KP+QP   G  A    R +IDG +V ++L LSL ++ ++ +++G     I+D+   
Sbjct:  1391 SRSAKPSQPIVNGRNA----RNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQ 1446

Query:   465 M 465
             +
Sbjct:  1447 L 1447


>UNIPROTKB|Q9N4C2 [details] [associations]
            symbol:cpsf-1 "Probable cleavage and polyadenylation
            specificity factor subunit 1" species:6239 "Caenorhabditis elegans"
            [GO:0006378 "mRNA polyadenylation" evidence=NAS] [GO:0006379 "mRNA
            cleavage" evidence=NAS] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=NAS]
            InterPro:IPR004871 Pfam:PF03178 GO:GO:0009792 GO:GO:0040007
            GO:GO:0002119 GO:GO:0006378 GO:GO:0010171 GO:GO:0040018
            GO:GO:0000003 GO:GO:0003723 GO:GO:0040027 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 GeneTree:ENSGT00550000075040
            OMA:NIGDNRY HOGENOM:HOG000007904 EMBL:FO081666 RefSeq:NP_500157.2
            ProteinModelPortal:Q9N4C2 MINT:MINT-3384281 STRING:Q9N4C2
            PaxDb:Q9N4C2 EnsemblMetazoa:Y76B12C.7.1 EnsemblMetazoa:Y76B12C.7.2
            GeneID:177003 KEGG:cel:CELE_Y76B12C.7 CTD:177003 WormBase:Y76B12C.7
            InParanoid:Q9N4C2 NextBio:894932 Uniprot:Q9N4C2
        Length = 1454

 Score = 654 (235.3 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
 Identities = 137/410 (33%), Positives = 224/410 (54%)

Query:    17 VQELLTVSLGLHGNRPLLLVRTQHELLIYQAFR--HP----KGAXXXXXXXXXXXXXSDR 70
             V E   V +G++   P+L+     ++++Y+ F   +P     G              S  
Sbjct:   853 VLEAQIVGMGINQAHPILMAIVDEQVVLYEMFSSSNPIPGHLGISFRKLPHFICLRTSSH 912

Query:    71 ----SKRANEQPGLPRGVRISQMRYFSNIAGYQ-GVFLCGPHPAWLFLTSRGELRAHPMT 125
                  KRA  +  +  G R S +  F  ++    GV + G  P  L   + G ++ H MT
Sbjct:   913 LNSDGKRAPFEMKINNGKRFSLIHPFERVSSVNNGVMIVGAVPTLLVYGAWGGMQTHQMT 972

Query:   126 IDGPVSTLAPFHNVNCPRGFLYFNA-KSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLA 184
             +DGP+    PF+N N   G +Y    KSELRI+ +     Y+ P+PV+K+ +  T H + 
Sbjct:   973 VDGPIKAFTPFNNENVLHGIVYMTQHKSELRIARMHPDFDYEMPYPVKKIEVGRTIHHVR 1032

Query:   185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEE 244
             Y + +  Y +V+S  +PS   +    +DK+     +D  F+ P   ++ ++LFS   W  
Sbjct:  1033 YLMNSDVYAVVSSIPKPSNKIWVVMNDDKQEEIHEKDENFVLPAPPKYTLNLFSSQDWAA 1092

Query:   245 IPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIE 304
             +P T     + E V   ++V+++ E T+SGL   +A+GT  NY E+V  RGRI+L ++IE
Sbjct:  1093 VPNTEISFEDMEAVTACEDVALKSESTISGLETLLAMGTVNNYGEEVLVRGRIILCEVIE 1152

Query:   305 VVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFI 364
             VVPEP QP +  KIK+++ KEQKGPVT +C + G L+  +GQK++IWQ KDNDL GI+F+
Sbjct:  1153 VVPEPDQPTSNRKIKVLFDKEQKGPVTGLCAINGLLLCGMGQKVFIWQFKDNDLMGISFL 1212

Query:   365 DTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD-YKPTQP 413
             D   Y+  + S++ + +  D   S++L+R+Q + + +S+ +RD  K  QP
Sbjct:  1213 DMHYYVYQLHSLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRKCAQP 1262

 Score = 281 (104.0 bits), Expect = 2.0e-89, Sum P(2) = 2.0e-89
 Identities = 60/180 (33%), Positives = 103/180 (57%)

Query:   465 MGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKPSSIS---- 520
             +GF++SD+  N+ +F Y PEA ESNGG RL  +   ++G ++N F ++R   S +     
Sbjct:  1275 VGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINAFVRLRGHTSLLQLNNE 1334

Query:   521 -DAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTY 579
              +      R  T +ASLDG+ GF  PL EK+YRRL  LQ  + + T    GL+ +  R+ 
Sbjct:  1335 DEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGSVTPQIAGLHIKGSRSA 1394

Query:   580 K-GKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSSHF 638
             K  +    G  +R +IDG +V ++L LSL ++ ++ +++G     I+D+L  +  ++ ++
Sbjct:  1395 KPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQLRRMAFYY 1454

 Score = 84 (34.6 bits), Expect = 1.2e-68, Sum P(2) = 1.2e-68
 Identities = 19/61 (31%), Positives = 35/61 (57%)

Query:   405 ARDYKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSS 464
             +R  KP+QP   G  A    R +IDG +V ++L LSL ++ ++ +++G     I+D+   
Sbjct:  1391 SRSAKPSQPIVNGRNA----RNLIDGDVVEQYLHLSLYDKTDLARRLGVGRYHIIDDLMQ 1446

Query:   465 M 465
             +
Sbjct:  1447 L 1447


>TAIR|locus:2153122 [details] [associations]
            symbol:CPSF160 "cleavage and polyadenylation specificity
            factor 160" species:3702 "Arabidopsis thaliana" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM;IEA;IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0006379 "mRNA cleavage" evidence=ISS] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0006397 "mRNA processing" evidence=RCA] [GO:0009909 "regulation
            of flower development" evidence=RCA] [GO:0016570 "histone
            modification" evidence=RCA] [GO:0048449 "floral organ formation"
            evidence=RCA] InterPro:IPR004871 Pfam:PF03178 GO:GO:0005829
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AY140902 EMBL:AB025607
            EMBL:AB018109 IPI:IPI00533913 RefSeq:NP_199979.2 UniGene:At.43551
            IntAct:Q9FGR0 STRING:Q9FGR0 PaxDb:Q9FGR0 PRIDE:Q9FGR0
            EnsemblPlants:AT5G51660.1 GeneID:835240 KEGG:ath:AT5G51660
            TAIR:At5g51660 HOGENOM:HOG000265012 InParanoid:Q9FGR0 OMA:NIGDNRY
            PhylomeDB:Q9FGR0 ProtClustDB:CLSN2680511 Genevestigator:Q9FGR0
            GermOnline:AT5G51660 Uniprot:Q9FGR0
        Length = 1442

 Score = 455 (165.2 bits), Expect = 3.6e-65, Sum P(2) = 3.6e-65
 Identities = 112/343 (32%), Positives = 175/343 (51%)

Query:    79 GLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHN 138
             G   GV   ++  F NI+G+QG FL G  P W  L  R  LR H    DG ++     HN
Sbjct:   924 GTSDGVASQRITMFKNISGHQGFFLSGSRPGWCMLF-RERLRFHSQLCDGSIAAFTVLHN 982

Query:   139 VNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTS- 197
             VNC  GF+Y  A+  L+I  LP+   YD  WPV+K+PLK TPH + Y+ E   Y ++ S 
Sbjct:   983 VNCNHGFIYVTAQGVLKICQLPSASIYDNYWPVQKIPLKATPHQVTYYAEKNLYPLIVSY 1042

Query:   198 -TAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFS----WEEIPQ 247
               ++P     S+   +  G+  +      D       V +F + +  P      WE   +
Sbjct:  1043 PVSKPLNQVLSSLVDQEAGQQLDNHNMSSDDLQRTYTVEEFEIQILEPERSGGPWET--K 1100

Query:   248 TNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVP 307
                P+   EH L ++ V++    T       +A+GT Y   EDV  RGR+LLF       
Sbjct:  1101 AKIPMQTSEHALTVRVVTLLNASTGEN-ETLLAVGTAYVQGEDVAARGRVLLFSF----G 1155

Query:   308 EPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTE 367
             + G   ++N +  +Y++E KG ++A+  + G L+ + G KI + +    +L G+AF D  
Sbjct:  1156 KNGDN-SQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKWNGTELNGVAFFDAP 1214

Query:   368 -VYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARDYK 409
              +Y+ SM  VK+ IL+GD  +SI  L ++ +   LSL+A+D++
Sbjct:  1215 PLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFE 1257

 Score = 270 (100.1 bits), Expect = 3.6e-65, Sum P(2) = 3.6e-65
 Identities = 59/180 (32%), Positives = 97/180 (53%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             + L + S++   +SD+ KN+ +F Y P+  ES  G +L+ + +FH+G HV+ F +++   
Sbjct:  1265 EFLIDGSTLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKFLRLQM-- 1322

Query:   517 SSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAF 576
               +S      +RF   + +LDG+ G   PL E  +RRL  LQ  +V    H  GLNP AF
Sbjct:  1323 --VSSGADKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAF 1380

Query:   577 RTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIEALSS 636
             R ++  G    +    I+D  L+  +  L L E+LE+  +IG+    IL +L D+   +S
Sbjct:  1381 RQFRSSGKARRSGPDSIVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKDLVDLSVGTS 1440

 Score = 54 (24.1 bits), Expect = 5.3e-19, Sum P(3) = 5.3e-19
 Identities = 16/65 (24%), Positives = 29/65 (44%)

Query:   140 NCPRGFLYFN-AKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTST 198
             NC  G++  + + S L+I ++  H   +A WP  K  +   P+ +          IV + 
Sbjct:    17 NCASGYITHSLSDSTLQIPIVSVHDDIEAEWPNPKRGIGPLPNVVITAANILEVYIVRAQ 76

Query:   199 AEPST 203
              E +T
Sbjct:    77 EEGNT 81

 Score = 52 (23.4 bits), Expect = 3.1e-42, Sum P(2) = 3.1e-42
 Identities = 15/54 (27%), Positives = 28/54 (51%)

Query:   408 YKPTQPNSKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 461
             ++  + + K   +G  S  I+D  L+  +  L L E+LE+  +IG+    IL +
Sbjct:  1380 FRQFRSSGKARRSGPDS--IVDCELLCHYEMLPLEEQLELAHQIGTTRYSILKD 1431

 Score = 37 (18.1 bits), Expect = 5.3e-19, Sum P(3) = 5.3e-19
 Identities = 11/44 (25%), Positives = 21/44 (47%)

Query:   254 EWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRI 297
             E  +++ L+++ M++      L GYI         E+ T  GR+
Sbjct:   234 ESSYIINLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRV 277


>POMBASE|SPBC1709.08 [details] [associations]
            symbol:cft1 "cleavage factor one Cft1 (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
            "cytosol" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA]
            [GO:0005849 "mRNA cleavage factor complex" evidence=ISO]
            [GO:0006378 "mRNA polyadenylation" evidence=ISO] [GO:0006379 "mRNA
            cleavage" evidence=ISO] InterPro:IPR004871 Pfam:PF03178
            PomBase:SPBC1709.08 GO:GO:0005829 EMBL:CU329671 GO:GO:0006378
            GenomeReviews:CU329671_GR GO:GO:0003723 eggNOG:COG5161 KO:K14401
            OMA:HNDRIFQ OrthoDB:EOG451HZS PIR:T39636 RefSeq:NP_595441.1
            STRING:O74733 EnsemblFungi:SPBC1709.08.1 GeneID:2539694
            KEGG:spo:SPBC1709.08 NextBio:20800847 GO:GO:0005847 GO:GO:0006379
            Uniprot:O74733
        Length = 1441

 Score = 450 (163.5 bits), Expect = 1.6e-60, Sum P(2) = 1.6e-60
 Identities = 117/414 (28%), Positives = 200/414 (48%)

Query:     1 MGNFRSHSPSAMDETIVQELLTVSLGLHGNRPLLLVRTQ-HELLIYQAFRHPKGAXXXXX 59
             M + R++      + +V ELL   LG     P L +R++ +E+ +Y+AF +         
Sbjct:   836 MESERTYFNKESSQELV-ELLVADLGDDFKEPHLFLRSRLNEITVYKAFLYSNTDKHKNL 894

Query:    60 XXXXXXXXSDRSKRANEQPGLPRGVRISQMRYFSN------------IAGYQGVFLCGPH 107
                        ++      G PR    +  +  S+            +  +  VF+ G  
Sbjct:   895 LAFAKVPQETMTREFQANVGTPRDAESTMEKKASSSVDHLKMTALEVVGNHSAVFVTGRK 954

Query:   108 PAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYFNAKSELRISVLPTHLSYDA 167
             P  +  T     +  P++ + P+ ++APFH  + P+G++Y +  S +RI        YD 
Sbjct:   955 PFLILSTLHSNAKFFPISSNIPILSVAPFHAHHAPQGYIYVDENSFIRICKFQEDFEYDN 1014

Query:   168 PWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPP 227
              WP +KV L    + +AYH  TK    V S           +G +   +TD  D  ++P 
Sbjct:  1015 KWPYKKVSLGKQINGIAYH-PTKMVYAVGSAVPIEFKVTDEDGNEPYAITDDND--YLP- 1070

Query:   228 LVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNY 287
             + +   + L SP +W  I    F   ++E  L +  V++E   T    + YIA+GT+   
Sbjct:  1071 MANTGSLDLVSPLTWTVIDSYEF--QQFEIPLSVALVNLEVSETTKLRKPYIAVGTSITK 1128

Query:   288 SEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
              ED+  RG   LF+II+VVP+PG+P T++K+K++  +E KG V  +C V G+L++  GQK
Sbjct:  1129 GEDIAVRGSTYLFEIIDVVPQPGRPETRHKLKLVTREEIKGTVAVVCEVDGYLLSGQGQK 1188

Query:   348 IYIWQLKDND-LTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPE-YR 399
             + +  L+D D L G++FID   Y  S   ++NL+L GD  +++  + +  E YR
Sbjct:  1189 VIVRALEDEDHLVGVSFIDLGSYTLSAKCLRNLLLFGDVRQNVTFVGFAEEPYR 1242

 Score = 236 (88.1 bits), Expect = 1.6e-60, Sum P(2) = 1.6e-60
 Identities = 59/183 (32%), Positives = 96/183 (52%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKIRCKP 516
             D L +  ++ F+++D   N+ L  Y PE  ES+ G RL+ + DFH+G +V T   I  K 
Sbjct:  1259 DFLVQGENLYFVVADTSGNLRLLAYDPENPESHSGERLVTRGDFHIG-NVITAMTILPKE 1317

Query:   517 SSISDAP---GARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNP 573
                 +A         F     + DG L   +P+ ++ YRRL ++QN +    +  GGLNP
Sbjct:  1318 KKHQNAEYGYDTGDDFSCVMVNSDGGLQMLVPISDRVYRRLNIIQNYLANRVNTIGGLNP 1377

Query:   574 RAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDI-E 632
             +++R          NP+R I+DG L+  F  +S+  R E+  K G   + I+++L ++ E
Sbjct:  1378 KSYRLITSPSNLT-NPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDE 1436

Query:   633 ALS 635
             ALS
Sbjct:  1437 ALS 1439

 Score = 67 (28.6 bits), Expect = 2.3e-43, Sum P(3) = 2.3e-43
 Identities = 15/49 (30%), Positives = 27/49 (55%)

Query:   422 NPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDEFSSMGFMIS 470
             NP+R I+DG L+  F  +S+  R E+  K G   + I+++   +   +S
Sbjct:  1391 NPTRRILDGMLIDYFTYMSVAHRHEMAHKCGVPVSTIMNDLVELDEALS 1439

 Score = 41 (19.5 bits), Expect = 2.3e-43, Sum P(3) = 2.3e-43
 Identities = 10/26 (38%), Positives = 16/26 (61%)

Query:   373 MVSVKNL-ILVGDYARSIALLRYQPE 397
             +V  +NL  +V D + ++ LL Y PE
Sbjct:  1261 LVQGENLYFVVADTSGNLRLLAYDPE 1286

 Score = 40 (19.1 bits), Expect = 1.1e-14, Sum P(2) = 1.1e-14
 Identities = 19/72 (26%), Positives = 30/72 (41%)

Query:   107 HPAWLFLTSRGELRAHPMT-----IDGPVSTLAP--FHNVNCPRGFLYFNAKSELR-ISV 158
             HP    LT  G+L+ + +      ++  V  L P  F+ +   R   YFN +S    + +
Sbjct:   797 HPILFALTDEGKLKVYNLADFSLLMECDVFDLPPTLFNGMESER--TYFNKESSQELVEL 854

Query:   159 LPTHLSYDAPWP 170
             L   L  D   P
Sbjct:   855 LVADLGDDFKEP 866


>DICTYBASE|DDB_G0281585 [details] [associations]
            symbol:cpsf1 "cleavage and polyadenylation
            specificity factor 160 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0005634 "nucleus" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] [GO:0003730 "mRNA 3'-UTR binding"
            evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            dictyBase:DDB_G0281585 GenomeReviews:CM000152_GR GO:GO:0006378
            EMBL:AAFI02000042 GO:GO:0003730 eggNOG:COG5161 KO:K14401
            GO:GO:0005847 GO:GO:0006379 RefSeq:XP_640515.1
            EnsemblProtists:DDB0233702 GeneID:8623125 KEGG:ddi:DDB_G0281585
            InParanoid:Q54TS6 OMA:TSATIQD Uniprot:Q54TS6
        Length = 1628

 Score = 449 (163.1 bits), Expect = 4.0e-53, Sum P(2) = 4.0e-53
 Identities = 108/364 (29%), Positives = 201/364 (55%)

Query:    70 RSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPM-TIDG 128
             + K   E+  L R  RI +   FS+I+G +G+F+ G  P W F   +G LR H M + D 
Sbjct:  1108 KKKEEEEEENLNRQKRIFE---FSSISGKRGLFIGGKKPIWAFC-EKGYLRLHSMDSSDN 1163

Query:   129 P---------------VSTLAPFHNVNCPRGFLYFNAKSE-LRISVLPTHLSYDAPWPVR 172
                             V T   F+N++C  GF+YF+ + + ++I  L T ++++    +R
Sbjct:  1164 SNSNNSNNNNNNNSNTVETFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIR 1223

Query:   173 KVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQF 232
             ++P K + H +AYH E K Y ++ S  + + +  +     K ++TD +           F
Sbjct:  1224 RIPTKNSCHKIAYHSEAKCYVVIVSFPQVTQELQE--DSKKPILTDDK-----------F 1270

Query:   233 HVSLFSP---FSWEEIPQTNFPLHEWEHVLCLKNVSMEY---EGTLSGLRGYIALGTNYN 286
              + L  P   ++W+ I   +F L + E VL +K VS+++   +G ++  R ++ +GT + 
Sbjct:  1271 QIKLIDPTIDWNWKFID--SFSLQDRETVLAMKIVSLKFTEPDG-ITRARPFLVIGTAFT 1327

Query:   287 YSEDVTCRGRILLFDIIEVVPE-PGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVG 345
             + ED  C+GR+L+F+I+    +   + L + ++ ++Y KEQKGPVTA+  V G L+  +G
Sbjct:  1328 FGEDTQCKGRVLVFEIVSHKTQFESEELGEKRLNLLYEKEQKGPVTALSSVNGLLLMTIG 1387

Query:   346 QKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVA 405
              K+ + Q     L  ++F D ++YI S+ ++KN I++GD  +S+  L+++ + +TL+L++
Sbjct:  1388 PKLTVNQFYTGSLVTLSFYDAQIYICSICTIKNYIVIGDMYKSVYFLQWK-DNKTLNLLS 1446

Query:   406 RDYK 409
             +DY+
Sbjct:  1447 KDYQ 1450

 Score = 169 (64.5 bits), Expect = 4.0e-53, Sum P(2) = 4.0e-53
 Identities = 58/207 (28%), Positives = 94/207 (45%)

Query:   436 FLQLSLGERLEICKKIGSKHNDILDEF----SSMGFMISDKDKNVVLFMYQPEARESNGG 491
             FLQ    + L +  K     N    EF     ++  ++SD DKN++LF ++P+   S  G
Sbjct:  1433 FLQWKDNKTLNLLSKDYQALNIFSTEFIVNQKTLSILVSDLDKNILLFSFEPQDPSSRSG 1492

Query:   492 HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNY 551
                       + Q +N   K        +D    +   L  + +LDG L    PL EK Y
Sbjct:  1493 Q---------INQEINGNNK--------NDNRLPKKEQLVIFGTLDGGLNVLRPLDEKIY 1535

Query:   552 RRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGY-YAGNPS------RGIIDGSLVWKFLQ 604
                  +Q+ +  +   T GLNP+ +R++K     +  +PS      + I+DG L+ KFL 
Sbjct:  1536 LLFYHIQSKLY-YLPQTAGLNPKQYRSFKSFSQNFHFSPSTFHQLPKFILDGDLISKFLS 1594

Query:   605 LSLGERLEICKKIGSKHNDILDELYDI 631
             LS  E+  I   I S  ++I++ L D+
Sbjct:  1595 LSQSEKRLISNSINSTSDEIIESLKDV 1621

 Score = 71 (30.1 bits), Expect = 8.0e-43, Sum P(2) = 8.0e-43
 Identities = 23/80 (28%), Positives = 40/80 (50%)

Query:   392 LRYQPEYRTLSLVARDYKPTQPNSKGYYAGNPS------RGIIDGSLVWKFLQLSLGERL 445
             L Y P+  T  L  + Y+  +  S+ ++  +PS      + I+DG L+ KFL LS  E+ 
Sbjct:  1545 LYYLPQ--TAGLNPKQYRSFKSFSQNFHF-SPSTFHQLPKFILDGDLISKFLSLSQSEKR 1601

Query:   446 EICKKIGSKHNDILDEFSSM 465
              I   I S  ++I++    +
Sbjct:  1602 LISNSINSTSDEIIESLKDV 1621


>ASPGD|ASPL0000050546 [details] [associations]
            symbol:AN1413 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IEA] [GO:0005829 "cytosol" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 GO:GO:0005634 EMBL:BN001307 GO:GO:0006397
            GO:GO:0003723 eggNOG:COG5161 KO:K14401 EMBL:AACD01000022
            RefSeq:XP_659017.1 EnsemblFungi:CADANIAT00008024 GeneID:2875502
            KEGG:ani:AN1413.2 HOGENOM:HOG000048586 OMA:HNDRIFQ
            OrthoDB:EOG451HZS Uniprot:Q5BDG7
        Length = 1339

 Score = 370 (135.3 bits), Expect = 3.6e-46, Sum P(2) = 3.6e-46
 Identities = 119/410 (29%), Positives = 199/410 (48%)

Query:    10 SAMDETIVQELLTVSLG-LHGNRPLLLVRTQHE-LLIYQAFRHPKGAXXXXXXXXXXXXX 67
             S   E ++Q +  V LG  + + P L++RT+++ L++Y+ F     +             
Sbjct:   753 STTRENVLQ-IAVVELGDSYSSLPFLILRTENDDLVVYKPFF--TNSKELTGLRFLKEAN 809

Query:    68 SDRSKRANEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTID 127
                 K  N    L   ++   +R   NIAG   +F+ GP   ++F  S      H + + 
Sbjct:   810 HTLPKTPNTTDELQSEMK--PLRILPNIAGCSSIFMPGPSAGFIFRASTTS--PHFIRLR 865

Query:   128 GP-VSTLAPFHNVNCPRGFLYFNAKSELRISVLP--THLSYDAPWPVRKVPLKCTPHFLA 184
             G  +  L  F + +  +GF Y ++   L ++ LP  T L Y  PW +R VP+      L 
Sbjct:   866 GGFIKGLGCFDSPD--KGFAYLDSHG-LHLAKLPEGTQLGY--PWIMRTVPIGQQIDKLT 920

Query:   185 YHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSR--FIPPLVSQFHVSLFSPFSW 242
             Y   + TY  V  T +     ++   ED EL  + R+    F+P  V+Q  + + SP +W
Sbjct:   921 YVSASDTY--VLGTCQRCE--FRLP-EDDELHPEWRNEEISFLPE-VNQSSLKVVSPKTW 974

Query:   243 EEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDI 302
               I   ++PL   EH++ +K +S+E        R  I +GT+    ED+  RG I +F++
Sbjct:   975 SVID--SYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLARGEDIPSRGCIYVFEV 1032

Query:   303 IEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG--FLVTAVGQKIYIWQLK-DNDLT 359
             IEVVP+P QP T  ++K+I  +  KG VTA+  + G  FL+ A GQK  +  LK D  L 
Sbjct:  1033 IEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRGLKEDGSLL 1092

Query:   360 GIAFIDTEVYIASMVSVKN--LILVGDYARSIALLRYQPEYRTLSLVARD 407
              +AF+D + +++ +  +K   + + GD  + +    Y  E   +SL A+D
Sbjct:  1093 PVAFMDMQCFVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSLFAKD 1142

 Score = 197 (74.4 bits), Expect = 3.6e-46, Sum P(2) = 3.6e-46
 Identities = 55/188 (29%), Positives = 96/188 (51%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFKI-RCK 515
             D L + + +  +++D D N+ +  Y PE   S+ G +L+ ++ FH G   +T   + R  
Sbjct:  1152 DFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTGNFASTVTLLPRTL 1211

Query:   516 PSSISDAPGARSRFLTWYASL--------DGALGFFLPLPEKNYRRLLMLQNVMVTHTSH 567
              SS     G+    +   A L        +G++G    +PE++YRRL  LQ+ +     H
Sbjct:  1212 VSSERAMSGSDKMDIDNTAPLHQVLVTSHNGSIGLVTCVPEESYRRLSALQSQLTNTLEH 1271

Query:   568 TGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDE 627
               GLNPRA+R  +     AG   RG++D +L+ ++L +S   + EI  ++G+   +I   
Sbjct:  1272 PCGLNPRAYRAVESDAS-AG---RGMLDSNLLLQYLDMSKQRKAEIAGRVGATEWEIRA- 1326

Query:   628 LYDIEALS 635
               D+EA+S
Sbjct:  1327 --DLEAIS 1332

 Score = 73 (30.8 bits), Expect = 3.7e-33, Sum P(2) = 3.7e-33
 Identities = 26/90 (28%), Positives = 44/90 (48%)

Query:   380 ILVGDYARSIALLRYQPE--YRTLSLVARDYKPT--QP---NSKGYYA----GNPSRGII 428
             +LV  +  SI L+   PE  YR LS +      T   P   N + Y A     +  RG++
Sbjct:  1235 VLVTSHNGSIGLVTCVPEESYRRLSALQSQLTNTLEHPCGLNPRAYRAVESDASAGRGML 1294

Query:   429 DGSLVWKFLQLSLGERLEICKKIGSKHNDI 458
             D +L+ ++L +S   + EI  ++G+   +I
Sbjct:  1295 DSNLLLQYLDMSKQRKAEIAGRVGATEWEI 1324


>CGD|CAL0004251 [details] [associations]
            symbol:orf19.2760 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0006369 "termination of RNA polymerase II
            transcription" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR004871
            Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634 GO:GO:0042493
            GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023 EMBL:AACQ01000025
            RefSeq:XP_720278.1 RefSeq:XP_720279.1 RefSeq:XP_720280.1
            RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848 GeneID:3638158
            GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
            KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
            eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
        Length = 1420

 Score = 326 (119.8 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
 Identities = 88/342 (25%), Positives = 163/342 (47%)

Query:    81 PRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
             P G  I + + YF N+ G+  +F+ G  P  +  T     R    +    +S ++ F + 
Sbjct:   875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIFQFSKIAAMS-ISAFSDS 933

Query:   140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
                 G ++ + +   RI  LP   +Y+   P++ V +  +   +AYH  + T  +V ST 
Sbjct:   934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIGESIKSIAYHETSDT--VVLSTF 991

Query:   200 EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVL 259
             +    Y   + E K +    +D +  P +  +  + L SP++W  I       +E    L
Sbjct:   992 K-QIPYDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLVSPYNWTVIETIELGDNEVGMTL 1050

Query:   260 --CLKNVSMEYEGTLSG------------LRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
                + +V  E   TL               R YI +G      ED+   G   +++II++
Sbjct:  1051 KSMILDVGSESGSTLGSDPNSLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDI 1110

Query:   306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
             +PEPG+P T +K K I+ +E +G +T+IC ++G  + + GQK+ +  L+D+    +AF+D
Sbjct:  1111 IPEPGKPETNHKFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLD 1170

Query:   366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
             T VY++   S  NL+++GD  +   L+ +  E   + ++ +D
Sbjct:  1171 TPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKD 1212

 Score = 119 (46.9 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
 Identities = 45/182 (24%), Positives = 76/182 (41%)

Query:   468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPSSISDA-- 522
             +++D +  + L  Y P+  +S  G +L+ K  F L   ++       I  + S  +DA  
Sbjct:  1233 LVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALT 1292

Query:   523 ---------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
                      P   S +     S  DG+     P+ E  YRR+ +LQ  ++    H  GLN
Sbjct:  1293 NIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLN 1352

Query:   573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 630
             PR  R    K       ++ I+D  L+  F +LS   +  +  K+  K  + DI  ++  
Sbjct:  1353 PRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIR 1412

Query:   631 IE 632
              E
Sbjct:  1413 FE 1414

 Score = 40 (19.1 bits), Expect = 1.3e-24, Sum P(2) = 1.3e-24
 Identities = 11/50 (22%), Positives = 23/50 (46%)

Query:   424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDEFSSMGFMISD 471
             ++ I+D  L+  F +LS   +  +  K+  K  + DI  +       ++D
Sbjct:  1370 TKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIRFEHTLND 1419


>UNIPROTKB|Q5AFT3 [details] [associations]
            symbol:CFT1 "Protein CFT1" species:237561 "Candida albicans
            SC5314" [GO:0042493 "response to drug" evidence=IMP]
            InterPro:IPR004871 Pfam:PF03178 CGD:CAL0004251 GO:GO:0005634
            GO:GO:0042493 GO:GO:0006397 GO:GO:0003723 EMBL:AACQ01000023
            EMBL:AACQ01000025 RefSeq:XP_720278.1 RefSeq:XP_720279.1
            RefSeq:XP_720280.1 RefSeq:XP_720510.1 STRING:Q5AFT3 GeneID:3637848
            GeneID:3638158 GeneID:3638159 GeneID:3638160 KEGG:cal:CaO19.10274
            KEGG:cal:CaO19.10275 KEGG:cal:CaO19.10276 KEGG:cal:CaO19.2760
            eggNOG:COG5161 KO:K14401 Uniprot:Q5AFT3
        Length = 1420

 Score = 326 (119.8 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
 Identities = 88/342 (25%), Positives = 163/342 (47%)

Query:    81 PRGVRISQ-MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNV 139
             P G  I + + YF N+ G+  +F+ G  P  +  T     R    +    +S ++ F + 
Sbjct:   875 PYGTSIERRLVYFPNLNGFTSIFVTGVIPYLILKTVHSIPRIFQFSKIAAMS-ISAFSDS 933

Query:   140 NCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTA 199
                 G ++ + +   RI  LP   +Y+   P++ V +  +   +AYH  + T  +V ST 
Sbjct:   934 KIKNGLIFLDNQQNARICELPLDFNYEFNLPMKHVDIGESIKSIAYHETSDT--VVLSTF 991

Query:   200 EPSTDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSLFSPFSWEEIPQTNFPLHEWEHVL 259
             +    Y   + E K +    +D +  P +  +  + L SP++W  I       +E    L
Sbjct:   992 K-QIPYDCLDEEGKPIAGIIKDIKDTPAMSFKGSIKLVSPYNWTVIETIELGDNEVGMTL 1050

Query:   260 --CLKNVSMEYEGTLSG------------LRGYIALGTNYNYSEDVTCRGRILLFDIIEV 305
                + +V  E   TL               R YI +G      ED+   G   +++II++
Sbjct:  1051 KSMILDVGSESGSTLGSDPNSLIKKYNKKKREYIVIGIGKYRMEDLAANGIFKIYEIIDI 1110

Query:   306 VPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFID 365
             +PEPG+P T +K K I+ +E +G +T+IC ++G  + + GQK+ +  L+D+    +AF+D
Sbjct:  1111 IPEPGKPETNHKFKEIFKEETRGAITSICELSGRFLVSQGQKVIVRDLQDDGTVPVAFLD 1170

Query:   366 TEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLSLVARD 407
             T VY++   S  NL+++GD  +   L+ +  E   + ++ +D
Sbjct:  1171 TPVYVSESKSFGNLLILGDLLKGCWLVGFDAEPFRMIMLGKD 1212

 Score = 119 (46.9 bits), Expect = 6.8e-33, Sum P(2) = 6.8e-33
 Identities = 45/182 (24%), Positives = 76/182 (41%)

Query:   468 MISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTFFK---IRCKPSSISDA-- 522
             +++D +  + L  Y P+  +S  G +L+ K  F L   ++       I  + S  +DA  
Sbjct:  1233 LVADNNNVLHLLNYDPDDPQSINGTKLLTKASFELNSTISCLRSLPLIDIEESVQTDALT 1292

Query:   523 ---------PGARSRFLTWYASL-DGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
                      P   S +     S  DG+     P+ E  YRR+ +LQ  ++    H  GLN
Sbjct:  1293 NIAVPPPLPPNTTSNYFQVIGSTQDGSFFNVFPINEAAYRRMYILQQQLIDKEFHYCGLN 1352

Query:   573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDELYD 630
             PR  R    K       ++ I+D  L+  F +LS   +  +  K+  K  + DI  ++  
Sbjct:  1353 PRLNRIGSIKLQNNETNTKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIR 1412

Query:   631 IE 632
              E
Sbjct:  1413 FE 1414

 Score = 40 (19.1 bits), Expect = 1.3e-24, Sum P(2) = 1.3e-24
 Identities = 11/50 (22%), Positives = 23/50 (46%)

Query:   424 SRGIIDGSLVWKFLQLSLGERLEICKKIGSK--HNDILDEFSSMGFMISD 471
             ++ I+D  L+  F +LS   +  +  K+  K  + DI  +       ++D
Sbjct:  1370 TKPILDYDLIRSFTKLSDDRKRNLANKVSGKGIYQDIWKDIIRFEHTLND 1419


>SGD|S000002709 [details] [associations]
            symbol:CFT1 "RNA-binding subunit of the mRNA cleavage and
            polyadenylation factor" species:4932 "Saccharomyces cerevisiae"
            [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0003723 "RNA binding"
            evidence=IEA;IDA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IDA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IDA;IPI]
            [GO:0006369 "termination of RNA polymerase II transcription"
            evidence=IMP] [GO:0006378 "mRNA polyadenylation" evidence=IDA;TAS]
            [GO:0006379 "mRNA cleavage" evidence=IDA;TAS] [GO:0005849 "mRNA
            cleavage factor complex" evidence=IPI] InterPro:IPR004871
            Pfam:PF03178 SGD:S000002709 GO:GO:0005739 GO:GO:0006378
            EMBL:BK006938 GO:GO:0003723 EMBL:U28374 eggNOG:COG5161 KO:K14401
            OMA:HNDRIFQ GO:GO:0005847 GO:GO:0006379 PIR:S61187
            RefSeq:NP_010587.1 ProteinModelPortal:Q06632 DIP:DIP-2467N
            IntAct:Q06632 MINT:MINT-375530 STRING:Q06632 PaxDb:Q06632
            PeptideAtlas:Q06632 EnsemblFungi:YDR301W GeneID:851895
            KEGG:sce:YDR301W CYGD:YDR301w GeneTree:ENSGT00550000075040
            HOGENOM:HOG000246682 OrthoDB:EOG4D29XZ NextBio:969889
            Genevestigator:Q06632 GermOnline:YDR301W GO:GO:0006369
            Uniprot:Q06632
        Length = 1357

 Score = 266 (98.7 bits), Expect = 3.7e-31, Sum P(3) = 3.7e-31
 Identities = 82/325 (25%), Positives = 149/325 (45%)

Query:    89 MRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCPRGFLYF 148
             M YF +  GY  +F+ G  P  L        +      + P+ ++ P+      R  +  
Sbjct:   851 MHYFPDYNGYSVIFVTGSVPYILIKEDDSTPKIFKFG-NIPLVSVTPWSE----RSVMCV 905

Query:   149 NAKSELRISVLPT-HLSYDAPWPVRKVPLKC------TPHFLAYHLETKTYCIVTSTAEP 201
             +     R+  L T ++ Y    P++++ +        T   L YH   + + +      P
Sbjct:   906 DDIKNARVYTLTTDNMYYGNKLPLKQIKISNVLDDYKTLQKLVYHERAQLFLVSYCKRVP 965

Query:   202 STDYYKFNGEDKELVTDPRDSRFIPPLVS-QFHVSLFSPFSWEEIPQTNFPLHEWEHVLC 260
                 Y+  GED E V    ++  +P     Q  + L +P SW+ I + +FP +   + + 
Sbjct:   966 ----YEALGEDGEKVIGYDEN--VPHAEGFQSGILLINPKSWKVIDKIDFPKNSVVNEM- 1018

Query:   261 LKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKM 320
              ++  ++        R YI  G     +ED    G   ++D+IEVVPEPG+P T  K+K 
Sbjct:  1019 -RSSMIQINSKTKRKREYIIAGVANATTEDTPPTGAFHIYDVIEVVPEPGKPDTNYKLKE 1077

Query:   321 IYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLK-DNDLTGIAFIDTEVYIASMVSVKNL 379
             I+ +E  G V+ +C V+G  + +  QK+ +  ++ DN +  +AF+D  V++    S  NL
Sbjct:  1078 IFQEEVSGTVSTVCEVSGRFMISQSQKVLVRDIQEDNSVIPVAFLDIPVFVTDSKSFGNL 1137

Query:   380 ILVGDYARSIALLRYQPE-YRTLSL 403
             +++GD  +    + +  E YR +SL
Sbjct:  1138 LIIGDAMQGFQFIGFDAEPYRMISL 1162

 Score = 171 (65.3 bits), Expect = 3.7e-31, Sum P(3) = 3.7e-31
 Identities = 53/198 (26%), Positives = 91/198 (45%)

Query:   436 FLQLSLGERLEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLI 495
             +  +SLG  +    K  +   + L     M F  +D D+NV +  Y P+   S  G RL+
Sbjct:  1157 YRMISLGRSMS---KFQTMSLEFLVNGGDMYFAATDADRNVHVLKYAPDEPNSLSGQRLV 1213

Query:   496 KKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLL 555
               + F L    N+   +  +      +P   S F      +DG++   +PL E+ YRRL 
Sbjct:  1214 HCSSFTL-HSTNSCMMLLPRNEEFG-SPQVPS-FQNVGGQVDGSVFKIVPLSEEKYRRLY 1270

Query:   556 MLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICK 615
             ++Q  ++      GGLNPR  R      Y  G+  R ++D +++ +F  L++  R  I +
Sbjct:  1271 VIQQQIIDRELQLGGLNPRMERL-ANDFYQMGHSMRPMLDFNVIRRFCGLAIDRRKSIAQ 1329

Query:   616 KIGSK-HNDILDELYDIE 632
             K G   H +   ++ +IE
Sbjct:  1330 KAGRHAHFEAWRDIINIE 1347

 Score = 47 (21.6 bits), Expect = 4.6e-08, Sum P(3) = 4.6e-08
 Identities = 18/70 (25%), Positives = 29/70 (41%)

Query:   340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARS--IALLRYQPE 397
             LVT     I I++L++ +   +  +D    +  MV    LIL  +      I L + Q E
Sbjct:   669 LVTVSRGDIKIFELEEKNKRKLLKVDLPEILNEMVITSGLILKSNMCNEFLIGLSKSQEE 728

Query:   398 YRTLSLVARD 407
                 + V  D
Sbjct:   729 QLLFTFVTAD 738

 Score = 41 (19.5 bits), Expect = 3.7e-31, Sum P(3) = 3.7e-31
 Identities = 13/41 (31%), Positives = 19/41 (46%)

Query:    12 MDETIVQELLTVSLGLHGNRPLLLVRTQHELLIYQAFRHPK 52
             +D T+V   L           LL+VRT + L +Y+  R  K
Sbjct:     8 LDATVVSHSLATHFTTSDYEELLVVRT-NILSVYRPTRDGK 47


>TAIR|locus:2115909 [details] [associations]
            symbol:DDB1A "damaged DNA binding protein 1A"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM;IEA;IDA;IPI] [GO:0010100
            "negative regulation of photomorphogenesis" evidence=IGI;RCA]
            [GO:0045892 "negative regulation of transcription, DNA-dependent"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0080008 "Cul4-RING ubiquitin ligase complex" evidence=IPI]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0000278 "mitotic cell
            cycle" evidence=RCA] [GO:0000911 "cytokinesis by cell plate
            formation" evidence=RCA] [GO:0003002 "regionalization"
            evidence=RCA] [GO:0006281 "DNA repair" evidence=RCA] [GO:0006486
            "protein glycosylation" evidence=RCA] [GO:0007155 "cell adhesion"
            evidence=RCA] [GO:0008284 "positive regulation of cell
            proliferation" evidence=RCA] [GO:0009630 "gravitropism"
            evidence=RCA] [GO:0009639 "response to red or far red light"
            evidence=RCA] [GO:0010090 "trichome morphogenesis" evidence=RCA]
            [GO:0033043 "regulation of organelle organization" evidence=RCA]
            [GO:0045010 "actin nucleation" evidence=RCA] [GO:0048449 "floral
            organ formation" evidence=RCA] [GO:0048608 "reproductive structure
            development" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005829
            GO:GO:0005634 GO:GO:0045892 EMBL:CP002687 GenomeReviews:CT486007_GR
            Gene3D:2.130.10.10 SUPFAM:SSF50978 GO:GO:0003677 GO:GO:0006281
            GO:GO:0016567 GO:GO:0009585 EMBL:AL161503 GO:GO:0080008
            GO:GO:0010100 EMBL:AY074257 EMBL:BT001905 EMBL:AK230366
            IPI:IPI00548104 PIR:B85068 RefSeq:NP_192451.1 UniGene:At.32663
            UniGene:At.47587 ProteinModelPortal:Q9M0V3 DIP:DIP-40455N
            IntAct:Q9M0V3 STRING:Q9M0V3 PaxDb:Q9M0V3 PRIDE:Q9M0V3 ProMEX:Q9M0V3
            EnsemblPlants:AT4G05420.1 GeneID:825890 KEGG:ath:AT4G05420
            GeneFarm:4660 TAIR:At4g05420 eggNOG:NOG247734 HOGENOM:HOG000007241
            InParanoid:Q9M0V3 KO:K10610 OMA:CALGDGS PhylomeDB:Q9M0V3
            ProtClustDB:CLSN2685347 Genevestigator:Q9M0V3 GermOnline:AT4G05420
            Uniprot:Q9M0V3
        Length = 1088

 Score = 128 (50.1 bits), Expect = 2.0e-11, Sum P(2) = 2.0e-11
 Identities = 55/189 (29%), Positives = 92/189 (48%)

Query:   230 SQFH-VSLFSPFSWEEIPQTNFPLHEWEHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYS 288
             S+ H V L    ++E +  + +PL  +E+   + + S   +  +     Y  +GT Y   
Sbjct:   741 SEMHFVRLLDDQTFEFM--STYPLDSFEYGCSILSCSFTEDKNV-----YYCVGTAYVLP 793

Query:   289 ED-VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQK 347
             E+    +GRIL+F     + E G      ++++I  KE KG V ++    G L+ A+ QK
Sbjct:   794 EENEPTKGRILVF-----IVEDG------RLQLIAEKETKGAVYSLNAFNGKLLAAINQK 842

Query:   348 I--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NLILVGDYARSIALLRYQPEYR 399
             I  Y W L+D+   G   + +E       +A  V  + + I+VGD  +SI+LL Y+ E  
Sbjct:   843 IQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEG 899

Query:   400 TLSLVARDY 408
              +   ARDY
Sbjct:   900 AIEERARDY 908

 Score = 118 (46.6 bits), Expect = 2.0e-11, Sum P(2) = 2.0e-11
 Identities = 42/182 (23%), Positives = 80/182 (43%)

Query:   457 DILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQHVNTF----FKI 512
             +ILD+   +G   ++ + N++      E        RL    ++HLG+ VN F      +
Sbjct:   917 EILDDDIYLG---AENNFNLLTVKKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 973

Query:   513 RCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLN 572
             R   S I   P         + +++G +G    LP++ Y  L  LQ+ +       GGL+
Sbjct:   974 RLPDSEIGQIP------TVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIKGVGGLS 1027

Query:   573 PRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHNDILDELYDIE 632
                +R++  +   A   +R  +DG L+  FL LS  +  +I K +  +  ++   + ++ 
Sbjct:  1028 HEQWRSFNNEKRTA--EARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRVEELT 1085

Query:   633 AL 634
              L
Sbjct:  1086 RL 1087

 Score = 48 (22.0 bits), Expect = 0.00031, Sum P(2) = 0.00031
 Identities = 10/28 (35%), Positives = 16/28 (57%)

Query:   424 SRGIIDGSLVWKFLQLSLGERLEICKKI 451
             +R  +DG L+  FL LS  +  +I K +
Sbjct:  1043 ARNFLDGDLIESFLDLSRNKMEDISKSM 1070


>TAIR|locus:2127368 [details] [associations]
            symbol:DDB1B "damaged DNA binding protein 1B"
            species:3702 "Arabidopsis thaliana" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IEA;IDA]
            [GO:0003684 "damaged DNA binding" evidence=ISS] [GO:0009793 "embryo
            development ending in seed dormancy" evidence=IMP] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005829 "cytosol" evidence=RCA]
            [GO:0006281 "DNA repair" evidence=RCA] [GO:0007062 "sister
            chromatid cohesion" evidence=RCA] [GO:0009880 "embryonic pattern
            specification" evidence=RCA] [GO:0010072 "primary shoot apical
            meristem specification" evidence=RCA] [GO:0010100 "negative
            regulation of photomorphogenesis" evidence=RCA] [GO:0010162 "seed
            dormancy process" evidence=RCA] [GO:0010431 "seed maturation"
            evidence=RCA] [GO:0010564 "regulation of cell cycle process"
            evidence=RCA] [GO:0045595 "regulation of cell differentiation"
            evidence=RCA] [GO:0048366 "leaf development" evidence=RCA]
            [GO:0048608 "reproductive structure development" evidence=RCA]
            [GO:0048825 "cotyledon development" evidence=RCA] [GO:0051301 "cell
            division" evidence=RCA] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 UniPathway:UPA00143 GO:GO:0005634
            EMBL:CP002687 GenomeReviews:CT486007_GR Gene3D:2.130.10.10
            SUPFAM:SSF50978 EMBL:AL161554 GO:GO:0003677 GO:GO:0006281
            GO:GO:0009793 GO:GO:0016567 GO:GO:0009585 EMBL:AL021960
            UniGene:At.32663 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            ProtClustDB:CLSN2685347 EMBL:AK220648 EMBL:AK229805 IPI:IPI00536598
            PIR:T04941 RefSeq:NP_193842.1 ProteinModelPortal:O49552 SMR:O49552
            DIP:DIP-46981N IntAct:O49552 STRING:O49552 PaxDb:O49552
            PRIDE:O49552 EnsemblPlants:AT4G21100.1 GeneID:827857
            KEGG:ath:AT4G21100 GeneFarm:4661 TAIR:At4g21100 InParanoid:O49552
            OMA:DRPAVIY PhylomeDB:O49552 Genevestigator:O49552
            GermOnline:AT4G21100 Uniprot:O49552
        Length = 1088

 Score = 120 (47.3 bits), Expect = 5.0e-11, Sum P(3) = 5.0e-11
 Identities = 45/140 (32%), Positives = 71/140 (50%)

Query:   278 YIALGTNYNYSED-VTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHV 336
             Y  +GT Y   E+    +GRIL+F I+E          + ++++I  KE KG V ++   
Sbjct:   783 YYCVGTAYVLPEENEPTKGRILVF-IVE----------EGRLQLITEKETKGAVYSLNAF 831

Query:   337 AGFLVTAVGQKI--YIWQLKDNDLTGIAFIDTEV-----YIASMVSVK-NLILVGDYARS 388
              G L+ ++ QKI  Y W L+D+   G   + +E       +A  V  + + I VGD  +S
Sbjct:   832 NGKLLASINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIAVGDLMKS 888

Query:   389 IALLRYQPEYRTLSLVARDY 408
             I+LL Y+ E   +   ARDY
Sbjct:   889 ISLLIYKHEEGAIEERARDY 908

 Score = 112 (44.5 bits), Expect = 5.0e-11, Sum P(3) = 5.0e-11
 Identities = 34/140 (24%), Positives = 64/140 (45%)

Query:   499 DFHLGQHVNTF----FKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRL 554
             ++H+G+ VN F      ++   S I   P         + ++ G +G    LP++ Y  L
Sbjct:   956 EYHIGEFVNRFRHGSLVMKLPDSDIGQIP------TVIFGTVSGMIGVIASLPQEQYAFL 1009

Query:   555 LMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEIC 614
               LQ  +       GGL+   +R++  +   A   ++G +DG L+  FL LS G+  EI 
Sbjct:  1010 EKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTA--EAKGYLDGDLIESFLDLSRGKMEEIS 1067

Query:   615 KKIGSKHNDILDELYDIEAL 634
             K +  +  ++   + ++  L
Sbjct:  1068 KGMDVQVEELCKRVEELTRL 1087

 Score = 58 (25.5 bits), Expect = 1.3e-05, Sum P(3) = 1.3e-05
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query:   424 SRGIIDGSLVWKFLQLSLGERLEICK 449
             ++G +DG L+  FL LS G+  EI K
Sbjct:  1043 AKGYLDGDLIESFLDLSRGKMEEISK 1068

 Score = 56 (24.8 bits), Expect = 5.0e-11, Sum P(3) = 5.0e-11
 Identities = 27/122 (22%), Positives = 51/122 (41%)

Query:    83 GVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLAPFHNVNCP 142
             G R   +R FS+ +    VF     PA ++  ++  L ++    +  VS + PF++   P
Sbjct:   626 GTRPITLRTFSSKSATH-VFAASDRPAVIYSNNKKLLYSNVNLKE--VSHMCPFNSAAFP 682

Query:   143 RGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCIVTSTAEPS 202
                L    + EL I  +           +R +P+      + +  +T+T+ I     EPS
Sbjct:   683 DS-LAIAREGELTIGTIDDIQKLH----IRTIPIGEHARRICHQEQTRTFAISCLRNEPS 737

Query:   203 TD 204
              +
Sbjct:   738 AE 739


>GENEDB_PFALCIPARUM|PFL1680w [details] [associations]
            symbol:PFL1680w "splicing factor 3b, subunit 3,
            130kD, putative" species:5833 "Plasmodium falciparum" [GO:0005681
            "spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
            evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
            SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
            HOGENOM:HOG000216677 RefSeq:XP_001350742.1
            ProteinModelPortal:Q8I574 PRIDE:Q8I574
            EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
            EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
            Uniprot:Q8I574
        Length = 1329

 Score = 106 (42.4 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
 Identities = 41/153 (26%), Positives = 69/153 (45%)

Query:   481 YQPEARESNGGHRLIKKT-DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
             Y  E   S+  +R ++    FH+G+ V +  K+R  P+S        S  +  Y+++ G 
Sbjct:  1188 YGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTS--------SECII-YSTIMGT 1238

Query:   540 LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G F+P   K    L   L+ ++ T      G     FR+Y    Y+   P + ++DG L
Sbjct:  1239 IGAFIPYDNKEELELTQHLEIILRTEKPPLCGREHIFFRSY----YH---PVQNVVDGDL 1291

Query:   599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
               +F  LS   + +I   +     DIL +L DI
Sbjct:  1292 CEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324

 Score = 81 (33.6 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
 Identities = 24/95 (25%), Positives = 40/95 (42%)

Query:   317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVS 375
             K+ +++    +      C   G L+ ++G K+ I+ L K   L    + D    I S+  
Sbjct:  1049 KLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKI 1108

Query:   376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
               N I   D   S+ +  Y P   TL L++ D  P
Sbjct:  1109 SGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIP 1143


>UNIPROTKB|Q8I574 [details] [associations]
            symbol:PFL1680w "Splicing factor 3b, subunit 3, 130kD,
            putative" species:36329 "Plasmodium falciparum 3D7" [GO:0005681
            "spliceosomal complex" evidence=ISS] [GO:0008380 "RNA splicing"
            evidence=ISS] InterPro:IPR017986 InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 GO:GO:0008380 Gene3D:2.130.10.10
            SUPFAM:SSF50978 GO:GO:0005681 GO:GO:0003676 EMBL:AE014188 KO:K12830
            HOGENOM:HOG000216677 RefSeq:XP_001350742.1
            ProteinModelPortal:Q8I574 PRIDE:Q8I574
            EnsemblProtists:PFL1680w:mRNA GeneID:811388 KEGG:pfa:PFL1680w
            EuPathDB:PlasmoDB:PF3D7_1234800 OMA:PVTSSMC ProtClustDB:CLSZ2733835
            Uniprot:Q8I574
        Length = 1329

 Score = 106 (42.4 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
 Identities = 41/153 (26%), Positives = 69/153 (45%)

Query:   481 YQPEARESNGGHRLIKKT-DFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGA 539
             Y  E   S+  +R ++    FH+G+ V +  K+R  P+S        S  +  Y+++ G 
Sbjct:  1188 YGGEIMNSSTKNRKLEHMMSFHIGEIVTSMQKVRLSPTS--------SECII-YSTIMGT 1238

Query:   540 LGFFLPLPEKNYRRLLM-LQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSL 598
             +G F+P   K    L   L+ ++ T      G     FR+Y    Y+   P + ++DG L
Sbjct:  1239 IGAFIPYDNKEELELTQHLEIILRTEKPPLCGREHIFFRSY----YH---PVQNVVDGDL 1291

Query:   599 VWKFLQLSLGERLEICKKIGSKHNDILDELYDI 631
               +F  LS   + +I   +     DIL +L DI
Sbjct:  1292 CEQFSSLSYDAQKKIANDLERTPEDILRKLEDI 1324

 Score = 81 (33.6 bits), Expect = 4.4e-05, Sum P(2) = 4.4e-05
 Identities = 24/95 (25%), Positives = 40/95 (42%)

Query:   317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQL-KDNDLTGIAFIDTEVYIASMVS 375
             K+ +++    +      C   G L+ ++G K+ I+ L K   L    + D    I S+  
Sbjct:  1049 KLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYALGKKKLLKKCEYKDIPEAIVSIKI 1108

Query:   376 VKNLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
               N I   D   S+ +  Y P   TL L++ D  P
Sbjct:  1109 SGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIP 1143


>UNIPROTKB|G4N4E2 [details] [associations]
            symbol:MGG_16867 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR004871 InterPro:IPR015943 Pfam:PF03178
            GO:GO:0005634 Gene3D:2.130.10.10 EMBL:CM001233 GO:GO:0003676
            RefSeq:XP_003712617.1 EnsemblFungi:MGG_16867T0 GeneID:12985117
            KEGG:mgr:MGG_16867 Uniprot:G4N4E2
        Length = 1183

 Score = 120 (47.3 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 40/182 (21%), Positives = 82/182 (45%)

Query:   445 LEICKKIGSKHNDILDEFSSMGFMISDKDKNVVLFMYQPEARESNGGHRLIKKTDFHLGQ 504
             +E+C+   +  +  +       ++++D D N+V+ +            R+   ++F LG+
Sbjct:   998 VEVCRDYQAMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGE 1057

Query:   505 HVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGFFLPLPEKNYRRLLM-LQNVMVT 563
              VN   K+  + S+  +AP     FL+   + +G++  F  +  K ++ LLM  Q  M  
Sbjct:  1058 CVNKIQKVMVETSA--NAPIVAKAFLS---TTEGSIYLFGTVAPK-FQSLLMDFQANMEA 1111

Query:   564 HTSHT-GGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWKFLQLSLGERLEICKKIGSKHN 622
             H S   G L    +R+++        P R  +DG  +  FL +    +++IC+ +     
Sbjct:  1112 HVSSPLGELQFNQWRSFRNPEREGAGPER-FLDGEFLEMFLDMEENTQIDICQGLSYTAE 1170

Query:   623 DI 624
             D+
Sbjct:  1171 DM 1172

 Score = 60 (26.2 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 45/182 (24%), Positives = 69/182 (37%)

Query:   222 SRFIPPLVSQFHVSLFSPFSWEEIPQTN-FPLHEWEHVLCLKNVSMEYEGTLSGLRGYIA 280
             SR I  +   F   LF    +E  P+ N F L + E   C+    +  +        +I 
Sbjct:   803 SRGIEKVYGTF--KLFDEVIFE--PKGNVFALEDGEVPECVTRAPL-LDSYGEQAERFI- 856

Query:   281 LGTNYNYSEDVTCRGRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAG-F 339
             +GT Y         GR+L+F     V E   P       +I+A   K     I  +    
Sbjct:   857 VGTRYLSGTGSGHGGRVLVFG----VDESRSPY------LIHAHSTKSGCRRIATMDDDL 906

Query:   340 LVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSVK-----NLILVGDYARSIALLRY 394
             LV A+ + + + +  +   T   F+    +  S  +V       LI V D  +SI LL Y
Sbjct:   907 LVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDVTVHGKLIAVADIMKSITLLEY 966

Query:   395 QP 396
              P
Sbjct:   967 IP 968

 Score = 54 (24.1 bits), Expect = 0.00065, Sum P(2) = 0.00065
 Identities = 21/93 (22%), Positives = 39/93 (41%)

Query:   306 VPEPGQPLT--KNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAF 363
             VP+PG  +     +++       + P    CH+ G +V    + +YI   +    T  A 
Sbjct:   229 VPDPGSTMMIPVERVETERRHNFRNPARDECHLGGVIVVGESRMLYIDD-QSWTWTETAL 287

Query:   364 IDTEVYIA-SMVSVKNLILVGDYARSIALLRYQ 395
              +  V++A +     + +L  DY   + LL  Q
Sbjct:   288 KNAMVFVAWAKFDNTHYLLADDYG-GLHLLTIQ 319


>ZFIN|ZDB-GENE-040426-1272 [details] [associations]
            symbol:ddb1 "damage specific DNA binding protein
            1" species:7955 "Danio rerio" [GO:0005634 "nucleus" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR004871
            InterPro:IPR015943 Pfam:PF03178 ZFIN:ZDB-GENE-040426-1272
            GO:GO:0005634 Gene3D:2.130.10.10 GO:GO:0003676 EMBL:JQ692623
            UniGene:Dr.77970 Uniprot:I1XUS8
        Length = 1140

 Score = 127 (49.8 bits), Expect = 0.00034, P = 0.00034
 Identities = 77/352 (21%), Positives = 145/352 (41%)

Query:    75 NEQPGLPRGVRISQMRYFSNIAGYQGVFLCGPHPAWLFLTSRGELRAHPMTIDGPVSTLA 134
             +E+  +  G + + +R F +++    VF C   P  ++ +S  +L    + +   V+ + 
Sbjct:   624 SERKKVTLGTQPTVLRTFRSLST-SNVFACSDRPTVIY-SSNHKLVFSNVNLK-EVNYMC 680

Query:   135 PFHNVNCPRGFLYFNAKSELRISVLPTHLSYDAPWPVRKVPLKCTPHFLAYHLETKTYCI 194
             P ++   P      N  S L I  +           +R VPL  +P  + Y   ++ + +
Sbjct:   681 PLNSEGYPDSLALAN-NSTLTIGTIDEIQKLH----IRTVPLYESPKRICYQEVSQCFGV 735

Query:   195 VTSTAEP-----STDYYKFNGEDKELVTDPRDSRFIPPLVSQFHVSL---FSPFSWEEIP 246
             ++S  E      +T   + +   + L +    S+  P   S    S        S   + 
Sbjct:   736 LSSRVEMQDASGTTAAVRPSASTQALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVD 795

Query:   247 QTNFPL---HEW-EHVLCLKNVSMEYEGTLSGLRGYIALGTNYNYSEDVTCR-GRILLFD 301
             Q  F +   H++ ++   L  VS +  G    +  Y  +GT   Y E+   + GRI++F 
Sbjct:   796 QHTFEVLHAHQFLQNEYALSMVSCKL-GRDPAV--YFIVGTAMVYPEEAEPKQGRIIVFH 852

Query:   302 IIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTAVGQ--KIYIWQLKDNDLT 359
                         T  K++ +  KE KG V ++    G L+ ++    ++Y W  +    T
Sbjct:   853 Y-----------TDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTAEKELRT 901

Query:   360 GIAFIDTEVYIASMVSVK-NLILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
                  +    +A  +  K + ILVGD  RS+ LL Y+P   +   +ARD+ P
Sbjct:   902 ECNHYNN--IMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNP 951


>WB|WBGene00010890 [details] [associations]
            symbol:ddb-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0040010 "positive regulation of growth
            rate" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0030163
            "protein catabolic process" evidence=IMP] [GO:0007276 "gamete
            generation" evidence=IMP] [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR004871 Pfam:PF03178 UniPathway:UPA00143
            GO:GO:0005634 GO:GO:0009792 GO:GO:0006898 GO:GO:0005737
            GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0006281
            GO:GO:0040011 GO:GO:0016567 GO:GO:0007049 GO:GO:0040035
            InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163 GO:GO:0007276
            eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610 OMA:CALGDGS
            GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855 PIR:T23798
            RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
            DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
            PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
            GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
            WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
        Length = 1134

 Score = 104 (41.7 bits), Expect = 0.00085, Sum P(2) = 0.00085
 Identities = 53/246 (21%), Positives = 100/246 (40%)

Query:   171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
             VR +P+  +   +AY   T TY + ++  E   +  +       LVT     +       
Sbjct:   714 VRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAE--RVFASKNALVTSQSRPKVASTRAD 771

Query:   231 QFHVSLFSPFSWEEIPQTNFPL---HE---WEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
                    +  S+  + Q  F +   HE   WE  L    +S ++    S    Y  +GT 
Sbjct:   772 MDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSC--ISGQFTNDSST---YYVVGTG 826

Query:   285 YNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTA 343
               Y ++   + GRI++F++ +V         ++K++ ++    +G   AI  + G LV A
Sbjct:   827 LIYPDETETKIGRIVVFEVDDV--------ERSKLRRVHELVVRGSPLAIRILNGKLVAA 878

Query:   344 VGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
             +   I +++   D +L         V    +  +   + V D  RS++LL Y+       
Sbjct:   879 INSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFE 938

Query:   403 LVARDY 408
              VA+D+
Sbjct:   939 EVAKDW 944

 Score = 69 (29.3 bits), Expect = 0.00085, Sum P(2) = 0.00085
 Identities = 16/88 (18%), Positives = 38/88 (43%)

Query:   533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
             + +  G +G  + + +K  + L+ ++  +     +   +   ++RT+  +      P  G
Sbjct:  1025 FGTNQGTIGMIVQIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQK--RAEPPSG 1082

Query:   593 IIDGSLVWKFLQLSLGERLEICKKIGSK 620
              +DG LV   L +     ++I  K+  K
Sbjct:  1083 FVDGDLVESILDMDRSVAMDILSKVSDK 1110


>UNIPROTKB|Q21554 [details] [associations]
            symbol:ddb-1 "DNA damage-binding protein 1" species:6239
            "Caenorhabditis elegans" [GO:0005515 "protein binding"
            evidence=IPI] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634
            "nucleus" evidence=ISS] InterPro:IPR004871 Pfam:PF03178
            UniPathway:UPA00143 GO:GO:0005634 GO:GO:0009792 GO:GO:0006898
            GO:GO:0005737 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
            GO:GO:0006281 GO:GO:0040011 GO:GO:0016567 GO:GO:0007049
            GO:GO:0040035 InterPro:IPR011047 SUPFAM:SSF50998 GO:GO:0030163
            GO:GO:0007276 eggNOG:NOG247734 HOGENOM:HOG000007241 KO:K10610
            OMA:CALGDGS GeneTree:ENSGT00530000063396 EMBL:Z68507 PIR:A88855
            PIR:T23798 RefSeq:NP_502299.1 HSSP:Q16531 ProteinModelPortal:Q21554
            DIP:DIP-25884N IntAct:Q21554 MINT:MINT-1055778 STRING:Q21554
            PaxDb:Q21554 EnsemblMetazoa:M18.5.1 EnsemblMetazoa:M18.5.2
            GeneID:178156 KEGG:cel:CELE_M18.5 UCSC:M18.5 CTD:178156
            WormBase:M18.5 InParanoid:Q21554 NextBio:899950 Uniprot:Q21554
        Length = 1134

 Score = 104 (41.7 bits), Expect = 0.00085, Sum P(2) = 0.00085
 Identities = 53/246 (21%), Positives = 100/246 (40%)

Query:   171 VRKVPLKCTPHFLAYHLETKTYCIVTSTAEPSTDYYKFNGEDKELVTDPRDSRFIPPLVS 230
             VR +P+  +   +AY   T TY + ++  E   +  +       LVT     +       
Sbjct:   714 VRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAE--RVFASKNALVTSQSRPKVASTRAD 771

Query:   231 QFHVSLFSPFSWEEIPQTNFPL---HE---WEHVLCLKNVSMEYEGTLSGLRGYIALGTN 284
                    +  S+  + Q  F +   HE   WE  L    +S ++    S    Y  +GT 
Sbjct:   772 MDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSC--ISGQFTNDSST---YYVVGTG 826

Query:   285 YNYSEDVTCR-GRILLFDIIEVVPEPGQPLTKNKIKMIYAKEQKGPVTAICHVAGFLVTA 343
               Y ++   + GRI++F++ +V         ++K++ ++    +G   AI  + G LV A
Sbjct:   827 LIYPDETETKIGRIVVFEVDDV--------ERSKLRRVHELVVRGSPLAIRILNGKLVAA 878

Query:   344 VGQKIYIWQ-LKDNDLTGIAFIDTEVYIASMVSVKNLILVGDYARSIALLRYQPEYRTLS 402
             +   I +++   D +L         V    +  +   + V D  RS++LL Y+       
Sbjct:   879 INSSIRLFEWTTDKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGNFE 938

Query:   403 LVARDY 408
              VA+D+
Sbjct:   939 EVAKDW 944

 Score = 69 (29.3 bits), Expect = 0.00085, Sum P(2) = 0.00085
 Identities = 16/88 (18%), Positives = 38/88 (43%)

Query:   533 YASLDGALGFFLPLPEKNYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRG 592
             + +  G +G  + + +K  + L+ ++  +     +   +   ++RT+  +      P  G
Sbjct:  1025 FGTNQGTIGMIVQIDDKWKKFLIAIEKAIADSVKNCMHIEHSSYRTFVFQK--RAEPPSG 1082

Query:   593 IIDGSLVWKFLQLSLGERLEICKKIGSK 620
              +DG LV   L +     ++I  K+  K
Sbjct:  1083 FVDGDLVESILDMDRSVAMDILSKVSDK 1110


>DICTYBASE|DDB_G0282569 [details] [associations]
            symbol:sf3b3 "splicing factor 3B subunit 3"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0030532 "small nuclear ribonucleoprotein complex" evidence=ISS]
            [GO:0008380 "RNA splicing" evidence=IEA;ISS] [GO:0006461 "protein
            complex assembly" evidence=ISS] [GO:0005681 "spliceosomal complex"
            evidence=IEA;ISS] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR017986 InterPro:IPR004871 InterPro:IPR015943
            Pfam:PF03178 dictyBase:DDB_G0282569 GO:GO:0006461 GO:GO:0008380
            Gene3D:2.130.10.10 SUPFAM:SSF50978 EMBL:AAFI02000047
            GenomeReviews:CM000152_GR GO:GO:0006397 GO:GO:0005681 GO:GO:0003676
            GO:GO:0030532 eggNOG:NOG247734 KO:K12830 OMA:FDTIPVA
            RefSeq:XP_640132.1 STRING:Q54SA7 EnsemblProtists:DDB0233171
            GeneID:8623669 KEGG:ddi:DDB_G0282569 ProtClustDB:CLSZ2729005
            Uniprot:Q54SA7
        Length = 1256

 Score = 93 (37.8 bits), Expect = 0.00091, Sum P(2) = 0.00090
 Identities = 27/97 (27%), Positives = 50/97 (51%)

Query:   317 KIKMIYAKEQKGPVTAICHVAGFLVTAVGQKIYIWQLKDNDLTGIAFIDTEVYIASMVSV 376
             K++++Y  E + PV A+    G LV  VG+ I I+ +    L  +   +T+    ++V++
Sbjct:   975 KLELLYKTEVEEPVYAMAQFQGKLVCGVGKSIRIYDMGKKKL--LRKCETKNLPNTIVNI 1032

Query:   377 KNL---ILVGDYARSIALLRYQPEYRTLSLVARDYKP 410
              +L   ++VGD   SI  ++Y+     L + A D  P
Sbjct:  1033 HSLGDRLVVGDIQESIHFIKYKRSENMLYVFADDLAP 1069

 Score = 81 (33.6 bits), Expect = 0.00091, Sum P(2) = 0.00090
 Identities = 36/152 (23%), Positives = 69/152 (45%)

Query:   484 EARESNGG-HRLIKKTDFHLGQHVNTFFKIRCKPSSISDAPGARSRFLTWYASLDGALGF 542
             E+   NG  H+L    +F +G  V T  K     S +   P      +  Y ++ GA+G 
Sbjct:  1117 ESGTLNGAPHKLDHIANFFVGDTVTTLNKT----SLVVGGPE-----VILYTTISGAIGA 1167

Query:   543 FLPLPEK-NYRRLLMLQNVMVTHTSHTGGLNPRAFRTYKGKGYYAGNPSRGIIDGSLVWK 601
              +P   + +      L+  M +      G +  A+R+Y    Y+   P + IIDG L  +
Sbjct:  1168 LIPFTSREDVDFFSTLEMNMRSDCLPLCGRDHLAYRSY----YF---PVKNIIDGDLCEQ 1220

Query:   602 FLQLSLGERLEICKKIGSKHNDILDELYDIEA 633
             F  L+  ++L I +++    ++++ +L +I +
Sbjct:  1221 FSTLNYQKQLSISEELSRSPSEVIKKLEEIRS 1252


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.139   0.428    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      638       625   0.00090  120 3  11 22  0.37    34
                                                     36  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  28
  No. of states in DFA:  620 (66 KB)
  Total size of DFA:  362 KB (2179 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  49.02u 0.09s 49.11t   Elapsed:  00:00:12
  Total cpu time:  49.03u 0.09s 49.12t   Elapsed:  00:00:12
  Start:  Thu Aug 15 11:36:39 2013   End:  Thu Aug 15 11:36:51 2013

Back to top