BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>014918
MPDNRQVKSNAVANQSADNIEEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTG
LCGYGSNCRFNHPAYAAQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNG
AGPVSFNILGLPMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSSRLQGTQ
SYMPLIVSPSQGIVPAPGWNTYMGNIGPLSPTSIAGSNLIYSSRNQGDLGAGAQMHILSA
SSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAIC
SNYSMYGICKFGPTCRFDHPYAGYPINYGLSLPPLSILDSSLMNHQAISATHSIETSPDA
SSKIPNWVQNSDAVSVQHQNPDMKNSTTKNSDDSSKVDHPPHSVPNCSEPPHDQSN

High Scoring Gene Products

Symbol, full name Information P value
AT1G04990 protein from Arabidopsis thaliana 8.2e-83
ZFN3
AT5G16540
protein from Arabidopsis thaliana 5.1e-58
AT2G47850 protein from Arabidopsis thaliana 2.8e-50
ZFN1
AT3G02830
protein from Arabidopsis thaliana 8.8e-47
AT5G18550 protein from Arabidopsis thaliana 1.4e-46
AT3G06410 protein from Arabidopsis thaliana 1.8e-46
HUA1
ENHANCER OF AG-4 1
protein from Arabidopsis thaliana 2.8e-41
AT3G48440 protein from Arabidopsis thaliana 1.7e-36
AT1G48195 protein from Arabidopsis thaliana 3.5e-21
cth1 gene_product from Danio rerio 1.2e-08
AT1G29570 protein from Arabidopsis thaliana 1.1e-07
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 7.2e-07
LOC100518830
Uncharacterized protein
protein from Sus scrofa 9.1e-07
CPSF4
Cleavage and polyadenylation specificity factor subunit 4
protein from Homo sapiens 1.2e-06
CPSF4
Uncharacterized protein
protein from Gallus gallus 1.4e-06
AT1G29560 protein from Arabidopsis thaliana 1.6e-06
CPSF4
Cleavage and polyadenylation specificity factor subunit 4
protein from Bos taurus 2.6e-06
Cpsf4
cleavage and polyadenylation specific factor 4
gene from Rattus norvegicus 2.6e-06
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 4.0e-06
cpsf4
Cleavage and polyadenylation specificity factor subunit 4
protein from Xenopus (Silurana) tropicalis 1.4e-05
cpsf4
Cleavage and polyadenylation specificity factor subunit 4
protein from Xenopus laevis 1.4e-05
LOC100738395
Uncharacterized protein
protein from Sus scrofa 2.0e-05
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 5.7e-05
ccch-5 gene from Caenorhabditis elegans 6.6e-05
ccch-2 gene from Caenorhabditis elegans 0.00019
Y116A8C.20 gene from Caenorhabditis elegans 0.00021
CPSF4L
Putative cleavage and polyadenylation-specificity factor subunit 4-like protein
protein from Homo sapiens 0.00022
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 0.00033
Cpsf4
cleavage and polyadenylation specific factor 4
protein from Mus musculus 0.00033
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00034
F1LWJ4
Uncharacterized protein
protein from Rattus norvegicus 0.00043
dct-13 gene from Caenorhabditis elegans 0.00068
cpsf4
cleavage and polyadenylation specificity factor 30 kDa subunit
gene from Dictyostelium discoideum 0.00071
cpsf4
cleavage and polyadenylation specific factor 4
gene_product from Danio rerio 0.00080
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00088

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  014918
        (416 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2010562 - symbol:AT1G04990 species:3702 "Arabi...   830  8.2e-83   1
TAIR|locus:2171407 - symbol:ZFN3 "zinc finger nuclease 3"...   596  5.1e-58   1
TAIR|locus:2043368 - symbol:AT2G47850 species:3702 "Arabi...   523  2.8e-50   1
TAIR|locus:2075477 - symbol:ZFN1 "zinc finger protein 1" ...   490  8.8e-47   1
TAIR|locus:2182988 - symbol:AT5G18550 species:3702 "Arabi...   488  1.4e-46   1
TAIR|locus:2081066 - symbol:AT3G06410 species:3702 "Arabi...   487  1.8e-46   1
TAIR|locus:2087775 - symbol:HUA1 "ENHANCER OF AG-4 1" spe...   438  2.8e-41   1
TAIR|locus:2101170 - symbol:AT3G48440 species:3702 "Arabi...   393  1.7e-36   1
TAIR|locus:1006230718 - symbol:AT1G48195 species:3702 "Ar...   251  3.5e-21   1
ZFIN|ZDB-GENE-990806-20 - symbol:cth1 "cth1" species:7955...   116  1.2e-08   2
TAIR|locus:2013763 - symbol:AT1G29570 species:3702 "Arabi...   148  1.1e-07   1
UNIPROTKB|D4A905 - symbol:Cpsf4 "Cleavage and polyadenyla...   109  4.1e-07   2
UNIPROTKB|C9K0K2 - symbol:CPSF4 "Cleavage and polyadenyla...    91  7.2e-07   2
UNIPROTKB|F1REX3 - symbol:LOC100518830 "Uncharacterized p...   105  9.1e-07   3
UNIPROTKB|O95639 - symbol:CPSF4 "Cleavage and polyadenyla...   104  1.2e-06   3
UNIPROTKB|E1BV31 - symbol:CPSF4 "Uncharacterized protein"...   108  1.4e-06   2
TAIR|locus:2013758 - symbol:AT1G29560 species:3702 "Arabi...   142  1.6e-06   1
UNIPROTKB|O19137 - symbol:CPSF4 "Cleavage and polyadenyla...   104  2.6e-06   2
RGD|620440 - symbol:Cpsf4 "cleavage and polyadenylation s...   104  2.6e-06   2
UNIPROTKB|J9P398 - symbol:CPSF4 "Uncharacterized protein"...   104  4.0e-06   2
UNIPROTKB|Q66KE3 - symbol:cpsf4 "Cleavage and polyadenyla...   101  1.4e-05   2
UNIPROTKB|Q6DJP7 - symbol:cpsf4 "Cleavage and polyadenyla...   101  1.4e-05   2
UNIPROTKB|I3LCK9 - symbol:LOC100738395 "Uncharacterized p...   104  2.0e-05   3
POMBASE|SPAC227.08c - symbol:yth1 "mRNA cleavage and poly...   105  2.2e-05   2
UNIPROTKB|B7Z7B0 - symbol:CPSF4 "Cleavage and polyadenyla...   104  5.7e-05   2
WB|WBGene00013319 - symbol:ccch-5 species:6239 "Caenorhab...   118  6.6e-05   1
WB|WBGene00009537 - symbol:ccch-2 species:6239 "Caenorhab...   113  0.00019   1
WB|WBGene00013797 - symbol:Y116A8C.20 species:6239 "Caeno...   114  0.00021   1
UNIPROTKB|H9KVA5 - symbol:CPSF4L "Putative cleavage and p...    91  0.00022   2
UNIPROTKB|C9JEV9 - symbol:CPSF4 "Cleavage and polyadenyla...   113  0.00033   1
MGI|MGI:1861602 - symbol:Cpsf4 "cleavage and polyadenylat...   113  0.00033   1
UNIPROTKB|E2RBK7 - symbol:CPSF4 "Uncharacterized protein"...   113  0.00034   1
UNIPROTKB|F1LWJ4 - symbol:F1LWJ4 "Uncharacterized protein...    91  0.00043   2
WB|WBGene00013794 - symbol:dct-13 species:6239 "Caenorhab...   110  0.00068   1
DICTYBASE|DDB_G0270148 - symbol:cpsf4 "cleavage and polya...    90  0.00071   2
ZFIN|ZDB-GENE-990415-180 - symbol:cpsf4 "cleavage and pol...    92  0.00080   2
UNIPROTKB|E2RBM0 - symbol:CPSF4 "Uncharacterized protein"...    91  0.00088   2


>TAIR|locus:2010562 [details] [associations]
            symbol:AT1G04990 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0007623 "circadian rhythm" evidence=RCA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0003723 GO:GO:0090305 EMBL:AC004809
            GO:GO:0004518 HOGENOM:HOG000237733 EMBL:AY048253 EMBL:AY113065
            IPI:IPI00522113 PIR:F86183 RefSeq:NP_563725.1 RefSeq:NP_973759.1
            UniGene:At.21743 ProteinModelPortal:Q94AD9 SMR:Q94AD9 PaxDb:Q94AD9
            PRIDE:Q94AD9 EnsemblPlants:AT1G04990.1 EnsemblPlants:AT1G04990.2
            GeneID:839351 KEGG:ath:AT1G04990 TAIR:At1g04990 eggNOG:NOG290936
            InParanoid:Q94AD9 OMA:THQRISP PhylomeDB:Q94AD9
            ProtClustDB:CLSN2687681 Genevestigator:Q94AD9 GermOnline:AT1G04990
            Uniprot:Q94AD9
        Length = 404

 Score = 830 (297.2 bits), Expect = 8.2e-83, P = 8.2e-83
 Identities = 179/409 (43%), Positives = 237/409 (57%)

Query:     1 MPDNRQVKSNAVANQSADNIEEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTG 60
             M D + V+S+ V+ +S+D IE+A  ++K+++     GV + +PYP RPGE DC FY RTG
Sbjct:     5 MSDTQHVQSSLVSIRSSDKIEDAFRKMKVNET----GVEELNPYPDRPGERDCQFYLRTG 60

Query:    61 LCGYGSNCRFNHPAYAAQG-AQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRN 119
             LCGYGS+CR+NHP +  Q  A Y+EELPER GQPDC Y+LKTG CKYG TCKYHHPKDRN
Sbjct:    61 LCGYGSSCRYNHPTHLPQDVAYYKEELPERIGQPDCEYFLKTGACKYGPTCKYHHPKDRN 120

Query:   120 GAGPVSFNILGLPMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSSRLQGT 179
             GA PV FN++GLPMR  EK CPYY+RTG+       ++    P      A  +SS     
Sbjct:   121 GAQPVMFNVIGLPMRLGEKPCPYYLRTGTCRFGVACKFHHPQPDNGHSTAYGMSSFPAAD 180

Query:   180 QSYMP-LIVSPSQGIVPAPGW-NTYMGNI-----GPLSPTS----IAGSNLIYSSRNQGD 228
               Y   L +  + G +P P    +Y+  +     G L P      +A SN +Y+ +NQ  
Sbjct:   181 LRYASGLTMMSTYGTLPRPQVPQSYVPILVSPSQGFLPPQGWAPYMAASNSMYNVKNQPY 240

Query:   229 L-GAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIG 287
               G+ A M +  A ++ L E  DQP+CR++MNTGTCKYG DCK+ HP  RI+Q   S I 
Sbjct:   241 YSGSSASMAMAVALNRGLSESSDQPECRFFMNTGTCKYGDDCKYSHPGVRISQPPPSLIN 300

Query:   288 PLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAGYPINYGXXXXXXXXXXXXXMNHQA 347
             P  LP+RPGQ  C N+  YG CKFGP C+FDHP   YP                   HQ 
Sbjct:   301 PFVLPARPGQPACGNFRSYGFCKFGPNCKFDHPMLPYP-GLTMATSLPTPFASPVTTHQR 359

Query:   348 ISATHSIETSPDASSKIPNWVQNSDAVSVQHQNPDMKNSTTKN-SDDSS 395
             IS T +   S   S+  P+  + S     + + PD  N   ++ S+D+S
Sbjct:   360 ISPTPNRSDSKSLSNGKPDVKKESS----ETEKPD--NGEVQDLSEDAS 402

 Score = 190 (71.9 bits), Expect = 3.4e-12, P = 3.4e-12
 Identities = 41/117 (35%), Positives = 63/117 (53%)

Query:   208 PLSPTSIAGSNLIYSSRNQGDL-GAGAQMHILSASSQNL---PERPDQPDCRYYMNTGTC 263
             P+S T    S+L+ S R+   +  A  +M +     + L   P+RP + DC++Y+ TG C
Sbjct:     4 PMSDTQHVQSSLV-SIRSSDKIEDAFRKMKVNETGVEELNPYPDRPGERDCQFYLRTGLC 62

Query:   264 KYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHP 320
              YG+ C+++HP   + Q  A     L  P R GQ  C  +   G CK+GPTC++ HP
Sbjct:    63 GYGSSCRYNHPTH-LPQDVAYYKEEL--PERIGQPDCEYFLKTGACKYGPTCKYHHP 116


>TAIR|locus:2171407 [details] [associations]
            symbol:ZFN3 "zinc finger nuclease 3" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0004518 "nuclease activity" evidence=TAS]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 EMBL:AB005242 GO:GO:0004518
            HOGENOM:HOG000237733 EMBL:AF138872 EMBL:AY084634 EMBL:AY128342
            EMBL:BT000014 EMBL:BX831982 IPI:IPI00516322 IPI:IPI00528450
            IPI:IPI00528912 RefSeq:NP_568332.2 RefSeq:NP_851041.1
            RefSeq:NP_974790.1 UniGene:At.21711 ProteinModelPortal:Q8L7N8
            SMR:Q8L7N8 EnsemblPlants:AT5G16540.1 GeneID:831516
            KEGG:ath:AT5G16540 GeneFarm:4900 TAIR:At5g16540 eggNOG:NOG281021
            InParanoid:Q8L7N8 OMA:SAGNQGM PhylomeDB:Q8L7N8
            ProtClustDB:CLSN2690167 Genevestigator:Q8L7N8 Uniprot:Q8L7N8
        Length = 375

 Score = 596 (214.9 bits), Expect = 5.1e-58, P = 5.1e-58
 Identities = 124/318 (38%), Positives = 180/318 (56%)

Query:    21 EEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHP---AYAA 77
             + A+W++ +  +   G     S YP R GEPDC +Y RTGLC +GS CRFNHP       
Sbjct:    19 QNAMWQMNLGSDDTMG--VDGS-YPERHGEPDCAYYIRTGLCRFGSTCRFNHPHDRKLVI 75

Query:    78 QGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGA-GPVSFNILGLPMRQD 136
               A+ + E PER GQP+C +YLKTGTCK+G TCK+HHP+++ G  G VS N+L  P+R +
Sbjct:    76 ATARIKGEYPERIGQPECEFYLKTGTCKFGVTCKFHHPRNKAGIDGSVSVNVLSYPLRPN 135

Query:   137 EKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQ-RAPYLSSRLQGTQSYMPLIVSPSQGIVP 195
             E  C Y++R G        ++ G+      Q ++  L   ++G+  Y  L     Q   P
Sbjct:   136 EDDCSYFLRIGQ------CKFGGTCKFNHPQTQSTNLMVSVRGSPVYSALQSLTGQ---P 186

Query:   196 APGWN--TYMGNIGPLS-PTSIAGSNL--IYSSRNQGDLGAGAQMHILSASSQNL-PERP 249
             +  W+  +++ N   L  P+  A  +   ++SS      G    +   +   +N+ PERP
Sbjct:   187 SYSWSRTSFVANPPRLQDPSGFASGSQGGLFSSGFHS--GNSVPLGFYALPRENVFPERP 244

Query:   250 DQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGIC 309
              QP+C++YM TG CK+G  CKFHHP++R        +  +GLP RPG+ +C  YS YGIC
Sbjct:   245 GQPECQFYMKTGDCKFGTVCKFHHPRDRQTPPPDCVLSSVGLPLRPGEPLCVFYSRYGIC 304

Query:   310 KFGPTCRFDHPYAGYPIN 327
             KFGP+C+FDHP   +  N
Sbjct:   305 KFGPSCKFDHPMRVFTYN 322


>TAIR|locus:2043368 [details] [associations]
            symbol:AT2G47850 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 EMBL:CP002685 GenomeReviews:CT485783_GR GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 EMBL:AC005309 EMBL:BT030391
            EMBL:BT004106 IPI:IPI00519400 PIR:C84920 RefSeq:NP_001078078.1
            RefSeq:NP_182306.2 UniGene:At.21006 ProteinModelPortal:Q84W91
            SMR:Q84W91 PaxDb:Q84W91 PRIDE:Q84W91 EnsemblPlants:AT2G47850.1
            EnsemblPlants:AT2G47850.3 GeneID:819397 KEGG:ath:AT2G47850
            TAIR:At2g47850 eggNOG:NOG312935 HOGENOM:HOG000237733
            InParanoid:Q84W91 OMA:RYGVACK PhylomeDB:Q84W91
            ProtClustDB:CLSN2680305 Genevestigator:Q84W91 GermOnline:AT2G47850
            Uniprot:Q84W91
        Length = 468

 Score = 523 (189.2 bits), Expect = 2.8e-50, P = 2.8e-50
 Identities = 119/302 (39%), Positives = 162/302 (53%)

Query:    38 VAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREEL-----PERNGQ 92
             V     YP R GEP C FY +TG C +G++C+F+HP  A  G+     L     P R G 
Sbjct:    81 VRATGQYPERFGEPPCQFYLKTGTCKFGASCKFHHPKNAG-GSMSHVPLNIYGYPVREGD 139

Query:    93 PDCGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSFNILGLPMRQDEKSCP-YYMRTGSFLP 151
              +C YYLKTG CK+G TCK+HHP+      P    +   P      S P +Y    S +P
Sbjct:   140 NECSYYLKTGQCKFGITCKFHHPQ------PAGTTVPPPPA-----SAPQFYPSVQSLMP 188

Query:   152 SSGLQYAGSLPTWSLQRAPYL--SSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGP- 208
                 QY G  P+ SL+ A  L   S +QG  +Y P++++P  G+VP PGW+ Y   + P 
Sbjct:   189 D---QYGG--PSSSLRVARTLLPGSYMQG--AYGPMLLTP--GVVPIPGWSPYSAPVSPA 239

Query:   209 LSP--------TSIAGSNLIYSSRNQ--GDLGAGAQMHILSASSQNLPERPDQPDCRYYM 258
             LSP        TS+ G   + S+     G   + +    +    Q  PERP +P+C+YY+
Sbjct:   240 LSPGAQHAVGATSLYGVTQLTSTTPSLPGVYPSLSSPTGVIQKEQAFPERPGEPECQYYL 299

Query:   259 NTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFD 318
              TG CK+G  CKFHHP++R+   A   + P+GLP RPG   C+ Y   G CKFG TC+FD
Sbjct:   300 KTGDCKFGTSCKFHHPRDRVPPRANCVLSPIGLPLRPGVQRCTFYVQNGFCKFGSTCKFD 359

Query:   319 HP 320
             HP
Sbjct:   360 HP 361

 Score = 356 (130.4 bits), Expect = 5.2e-41, Sum P(2) = 5.2e-41
 Identities = 95/243 (39%), Positives = 130/243 (53%)

Query:    17 ADN-IEEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPA- 74
             AD  ++E++WRL         G+   S YP RPG PDC +Y RTG+CGYG+ CR+NHP  
Sbjct:    24 ADTGLQESMWRL---------GLGSDS-YPERPGAPDCAYYMRTGVCGYGNRCRYNHPRD 73

Query:    75 YAAQGAQYRE--ELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGA-GPVSFNILGL 131
              A+  A  R   + PER G+P C +YLKTGTCK+G++CK+HHPK+  G+   V  NI G 
Sbjct:    74 RASVEATVRATGQYPERFGEPPCQFYLKTGTCKFGASCKFHHPKNAGGSMSHVPLNIYGY 133

Query:   132 PMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSSR--LQGTQSYMP-LIVS 188
             P+R+ +  C YY++TG        ++    P  +    P  S+       QS MP     
Sbjct:   134 PVREGDNECSYYLKTGQCKFGITCKFHHPQPAGTTVPPPPASAPQFYPSVQSLMPDQYGG 193

Query:   189 PSQGIVPA----PGWNTYM-GNIGP--LSP--TSIAGSNLIYSSRNQGDLGAGAQMHILS 239
             PS  +  A    PG  +YM G  GP  L+P    I G +  YS+     L  GAQ H + 
Sbjct:   194 PSSSLRVARTLLPG--SYMQGAYGPMLLTPGVVPIPGWSP-YSAPVSPALSPGAQ-HAVG 249

Query:   240 ASS 242
             A+S
Sbjct:   250 ATS 252

 Score = 211 (79.3 bits), Expect = 2.0e-14, P = 2.0e-14
 Identities = 40/94 (42%), Positives = 51/94 (54%)

Query:   228 DLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIG 287
             D G    M  L   S + PERP  PDC YYM TG C YG  C+++HP++R   S  + + 
Sbjct:    25 DTGLQESMWRLGLGSDSYPERPGAPDCAYYMRTGVCGYGNRCRYNHPRDRA--SVEATVR 82

Query:   288 PLG-LPSRPGQAICSNYSMYGICKFGPTCRFDHP 320
               G  P R G+  C  Y   G CKFG +C+F HP
Sbjct:    83 ATGQYPERFGEPPCQFYLKTGTCKFGASCKFHHP 116

 Score = 190 (71.9 bits), Expect = 4.9e-12, P = 4.9e-12
 Identities = 36/102 (35%), Positives = 56/102 (54%)

Query:    36 GGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYA----AQGAQYREELPERNG 91
             G + +   +P RPGEP+C +Y +TG C +G++C+F+HP       A        LP R G
Sbjct:   278 GVIQKEQAFPERPGEPECQYYLKTGDCKFGTSCKFHHPRDRVPPRANCVLSPIGLPLRPG 337

Query:    92 QPDCGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSFNILGLPM 133
                C +Y++ G CK+GSTCK+ HP       P + ++   P+
Sbjct:   338 VQRCTFYVQNGFCKFGSTCKFDHPMGTIRYNPSASSLADAPV 379

 Score = 96 (38.9 bits), Expect = 5.2e-41, Sum P(2) = 5.2e-41
 Identities = 18/43 (41%), Positives = 26/43 (60%)

Query:   245 LPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQS-AASNI 286
             LP RP    C +Y+  G CK+G+ CKF HP   I  + +AS++
Sbjct:   332 LPLRPGVQRCTFYVQNGFCKFGSTCKFDHPMGTIRYNPSASSL 374


>TAIR|locus:2075477 [details] [associations]
            symbol:ZFN1 "zinc finger protein 1" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0004518 "nuclease activity" evidence=TAS]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0010313 "phytochrome
            binding" evidence=IPI] [GO:0017148 "negative regulation of
            translation" evidence=IMP] [GO:0048027 "mRNA 5'-UTR binding"
            evidence=IPI] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005829 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0017148 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0048027 GO:GO:0004518 HOGENOM:HOG000237733
            EMBL:AF138743 EMBL:AC018363 EMBL:AK117978 EMBL:BT025966
            IPI:IPI00539955 PIR:T48874 RefSeq:NP_566183.1 UniGene:At.23706
            ProteinModelPortal:Q8GXX7 SMR:Q8GXX7 STRING:Q8GXX7 PaxDb:Q8GXX7
            PRIDE:Q8GXX7 EnsemblPlants:AT3G02830.1 GeneID:821230
            KEGG:ath:AT3G02830 GeneFarm:4898 TAIR:At3g02830 eggNOG:NOG329662
            InParanoid:Q8GXX7 OMA:SSDDQQR PhylomeDB:Q8GXX7
            ProtClustDB:CLSN2917075 Genevestigator:Q8GXX7 GermOnline:AT3G02830
            Uniprot:Q8GXX7
        Length = 397

 Score = 490 (177.5 bits), Expect = 8.8e-47, P = 8.8e-47
 Identities = 124/321 (38%), Positives = 167/321 (52%)

Query:    21 EEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPA---YAA 77
             ++A+W++ +  ++      +   YP RPGEPDC +Y RTGLC +GS CRFNHP       
Sbjct:    18 QDAMWQMNLSSDE----TMETGSYPERPGEPDCSYYIRTGLCRFGSTCRFNHPRDRELVI 73

Query:    78 QGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNG-AGPVSFNILGLPMRQD 136
               A+ R E PER GQP+C YYLKTGTCK+G TCK+HHP+++ G AG VS N+LG P+R +
Sbjct:    74 ATARMRGEYPERIGQPECEYYLKTGTCKFGVTCKFHHPRNKAGIAGRVSLNMLGYPLRSN 133

Query:   137 EKSCPYYMRTG--SFLPSSGLQYAGSLPT-------------WSLQRAPYLSS-RLQGTQ 180
             E  C Y++RTG   F  +    +    PT             WS  RA +++S R Q   
Sbjct:   134 EVDCAYFLRTGHCKFGGTCKFNHPQPQPTNMMVPTSGQQSYPWS--RASFIASPRWQDPS 191

Query:   181 SYMPLIVSPSQGIVPAPGWNTYMGNIGPLSPTSIAGSNLIYSSRNQGD-LGAGAQMHILS 239
             SY  LI+ P QG+VP  GWN Y G +G +SP+   G++  Y +  Q + + +G+Q    S
Sbjct:   192 SYASLIM-P-QGVVPVQGWNPYSGQLGSVSPSG-TGNDQNYRNLQQNETIESGSQSQG-S 247

Query:   240 ASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAI 299
              S  N P     P   YY                P+E +             P RPGQ  
Sbjct:   248 FSGYN-PGS-SVPLGGYYAL--------------PRENV------------FPERPGQPE 279

Query:   300 CSNYSMYGICKFGPTCRFDHP 320
             C  Y   G CKFG  C+F HP
Sbjct:   280 CQFYMKTGDCKFGTVCKFHHP 300

 Score = 447 (162.4 bits), Expect = 3.2e-42, P = 3.2e-42
 Identities = 112/301 (37%), Positives = 160/301 (53%)

Query:    44 YPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREEL-----PERNGQPDCGYY 98
             YP R G+P+C +Y +TG C +G  C+F+HP   A G   R  L     P R+ + DC Y+
Sbjct:    82 YPERIGQPECEYYLKTGTCKFGVTCKFHHPRNKA-GIAGRVSLNMLGYPLRSNEVDCAYF 140

Query:    99 LKTGTCKYGSTCKYHHPKDRNGAGPVSFNILGLPMRQDEKSCPYYMRTGSFLPSSGLQYA 158
             L+TG CK+G TCK++HP+ +                      P    T   +P+SG Q  
Sbjct:   141 LRTGHCKFGGTCKFNHPQPQ----------------------P----TNMMVPTSGQQ-- 172

Query:   159 GSLPTWSLQRAPYLSS-RLQGTQSYMPLI----VSPSQGIVPAPGWNTYMGNIGPLSP-- 211
              S P WS  RA +++S R Q   SY  LI    V P QG  P  G    +G++ P     
Sbjct:   173 -SYP-WS--RASFIASPRWQDPSSYASLIMPQGVVPVQGWNPYSG---QLGSVSPSGTGN 225

Query:   212 ----TSIAGSNLIYS-SRNQGDLGA---GAQMHI---LSASSQNL-PERPDQPDCRYYMN 259
                  ++  +  I S S++QG       G+ + +    +   +N+ PERP QP+C++YM 
Sbjct:   226 DQNYRNLQQNETIESGSQSQGSFSGYNPGSSVPLGGYYALPRENVFPERPGQPECQFYMK 285

Query:   260 TGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDH 319
             TG CK+G  CKFHHP++R A      +  +GLP RPG+ +C  Y+ YGICKFGP+C+FDH
Sbjct:   286 TGDCKFGTVCKFHHPRDRQAPPPDCLLSSIGLPLRPGEPLCVFYTRYGICKFGPSCKFDH 345

Query:   320 P 320
             P
Sbjct:   346 P 346


>TAIR|locus:2182988 [details] [associations]
            symbol:AT5G18550 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 eggNOG:NOG312935
            HOGENOM:HOG000237733 ProtClustDB:CLSN2681554 EMBL:AC069328
            EMBL:BT010886 EMBL:AK230175 IPI:IPI00533261 RefSeq:NP_197356.2
            UniGene:At.22535 ProteinModelPortal:Q6NPN3 SMR:Q6NPN3 STRING:Q6NPN3
            PaxDb:Q6NPN3 PRIDE:Q6NPN3 EnsemblPlants:AT5G18550.1 GeneID:831973
            KEGG:ath:AT5G18550 TAIR:At5g18550 InParanoid:Q6NPN3 OMA:GSQPCAY
            PhylomeDB:Q6NPN3 Genevestigator:Q6NPN3 GermOnline:AT5G18550
            Uniprot:Q6NPN3
        Length = 465

 Score = 488 (176.8 bits), Expect = 1.4e-46, P = 1.4e-46
 Identities = 105/309 (33%), Positives = 157/309 (50%)

Query:    35 GGGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREE------LPE 88
             GG   +A  +P R G+P C  + RTG C +G++C+++HP     G             P 
Sbjct:    85 GGLRTEAGEFPERMGQPVCQHFMRTGTCKFGASCKYHHPRQGGGGDSVTPVSLNYMGFPL 144

Query:    89 RNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSFNILGLPMRQDEKSCP--YYMRT 146
             R G+ +C Y+++TG CK+GSTC+YHHP       P          +Q   + P  Y    
Sbjct:   145 RPGEKECSYFMRTGQCKFGSTCRYHHPVPPGVQAPSQ------QQQQQLSAGPTMYPSLQ 198

Query:   147 GSFLPSSGLQYAGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNI 206
                +PSS  QY   L    L    Y+    Q    Y  +++ P  G+VP  GWN Y  ++
Sbjct:   199 SQTVPSSQ-QYGVVLARPQLLPGSYV----QSPYGYGQMVLPP--GMVPYSGWNPYQASV 251

Query:   207 GPL-SPTS--IAGSNLIYS----SRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMN 259
               + SP +    G++ +Y     S +     +G     +S   Q  P+RP+QP+C+Y+M 
Sbjct:   252 SAMPSPGTQPSMGTSSVYGITPLSPSAPAYQSGPSSTGVSNKEQTFPQRPEQPECQYFMR 311

Query:   260 TGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDH 319
             TG CK+G  C+FHHP E  A   AS +  +GLP RPG   C++++ +GICKFGP C+FDH
Sbjct:   312 TGDCKFGTSCRFHHPMEA-ASPEASTLSHIGLPLRPGAVPCTHFAQHGICKFGPACKFDH 370

Query:   320 PYAGYPINY 328
                   ++Y
Sbjct:   371 SLGSSSLSY 379

 Score = 433 (157.5 bits), Expect = 9.6e-41, P = 9.6e-41
 Identities = 122/336 (36%), Positives = 159/336 (47%)

Query:     3 DNRQVKSNAVANQSADN-IEEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTGL 61
             ++R   S+  + Q  +  IE ++WRL +     GGG      +P RP EPDC++Y RTG+
Sbjct:    11 ESRSDPSHEWSAQGTETGIEASMWRLGLRGG--GGG---GETFPERPDEPDCIYYLRTGV 65

Query:    62 CGYGSNCRFNHPAYAAQ--GAQYRE--ELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
             CGYGS CRFNHP   A   G    E  E PER GQP C ++++TGTCK+G++CKYHHP+ 
Sbjct:    66 CGYGSRCRFNHPRNRAPVLGGLRTEAGEFPERMGQPVCQHFMRTGTCKFGASCKYHHPRQ 125

Query:   118 RNGAG---PVSFNILGLPMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSS 174
               G     PVS N +G P+R  EK C Y+MRTG     S  +Y   +P   +Q AP  S 
Sbjct:   126 GGGGDSVTPVSLNYMGFPLRPGEKECSYFMRTGQCKFGSTCRYHHPVPP-GVQ-AP--SQ 181

Query:   175 RLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPLSPTSIAGSNLIYSSRNQGDLGAGAQ 234
             + Q   S  P +  PS      P    Y   +    P  + GS  + S    G +     
Sbjct:   182 QQQQQLSAGPTMY-PSLQSQTVPSSQQY--GVVLARPQLLPGS-YVQSPYGYGQMVLPPG 237

Query:   235 MHILS------ASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGP 288
             M   S      AS   +P    QP     M T +  YG       P     QS  S+ G 
Sbjct:   238 MVPYSGWNPYQASVSAMPSPGTQPS----MGTSSV-YGITPL--SPSAPAYQSGPSSTGV 290

Query:   289 LG----LPSRPGQAICSNYSMYGICKFGPTCRFDHP 320
                    P RP Q  C  +   G CKFG +CRF HP
Sbjct:   291 SNKEQTFPQRPEQPECQYFMRTGDCKFGTSCRFHHP 326

 Score = 215 (80.7 bits), Expect = 7.0e-15, P = 7.0e-15
 Identities = 34/81 (41%), Positives = 47/81 (58%)

Query:   243 QNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSN 302
             +  PERPD+PDC YY+ TG C YG+ C+F+HP+ R              P R GQ +C +
Sbjct:    46 ETFPERPDEPDCIYYLRTGVCGYGSRCRFNHPRNRAPVLGGLRTEAGEFPERMGQPVCQH 105

Query:   303 YSMYGICKFGPTCRFDHPYAG 323
             +   G CKFG +C++ HP  G
Sbjct:   106 FMRTGTCKFGASCKYHHPRQG 126

 Score = 162 (62.1 bits), Expect = 6.8e-09, P = 6.8e-09
 Identities = 33/107 (30%), Positives = 59/107 (55%)

Query:    73 PAY----AAQGAQYREE-LPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSFN 127
             PAY    ++ G   +E+  P+R  QP+C Y+++TG CK+G++C++HHP +       + +
Sbjct:   279 PAYQSGPSSTGVSNKEQTFPQRPEQPECQYFMRTGDCKFGTSCRFHHPMEAASPEASTLS 338

Query:   128 ILGLPMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSS 174
              +GLP+R     C ++ + G        ++  SL + SL  +P  SS
Sbjct:   339 HIGLPLRPGAVPCTHFAQHGICKFGPACKFDHSLGSSSLSYSPSPSS 385


>TAIR|locus:2081066 [details] [associations]
            symbol:AT3G06410 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 EMBL:AC011623
            eggNOG:NOG312935 HOGENOM:HOG000237733 EMBL:AK230312 EMBL:AK230438
            IPI:IPI00535086 RefSeq:NP_187292.2 UniGene:At.27771
            ProteinModelPortal:Q9SQU4 SMR:Q9SQU4 EnsemblPlants:AT3G06410.1
            GeneID:819815 KEGG:ath:AT3G06410 TAIR:At3g06410 InParanoid:Q9SQU4
            OMA:SSQQYGL PhylomeDB:Q9SQU4 ProtClustDB:CLSN2681554
            Genevestigator:Q9SQU4 GermOnline:AT3G06410 Uniprot:Q9SQU4
        Length = 462

 Score = 487 (176.5 bits), Expect = 1.8e-46, P = 1.8e-46
 Identities = 106/299 (35%), Positives = 153/299 (51%)

Query:    36 GGV-AQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREE------LPE 88
             GGV  +A   P R G P C  + RTG C +G++C+++HP     G             P 
Sbjct:    88 GGVRGEAGALPERMGHPVCQHFMRTGTCKFGASCKYHHPRQGGGGGSVAPVSLSYLGYPL 147

Query:    89 RNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSFNILGLPMRQDEKSCPYYMRTGS 148
             R G+ +C YYL+TG CK+G TC+++HP      GP        P  Q +    Y      
Sbjct:   148 RPGEKECSYYLRTGQCKFGLTCRFNHPVPLAVQGPPQQPQQQQPQPQPQLQTIYPTLQSQ 207

Query:   149 FLPSSGLQYAGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGP 208
              +PSS  QY   L   S     YL S       Y P +V P  G+VP  GWN Y  ++  
Sbjct:   208 SIPSSQ-QYGLVLTRPSFLTGSYLQS------PYGPPMVLPP-GMVPYSGWNPYQASLSA 259

Query:   209 L-SPTS--IAGSNLIY-----SSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNT 260
             + SP +    GS+ IY     S       G    +   +++S+  P+RPDQP+C+Y+M T
Sbjct:   260 MPSPGTQPSIGSSSIYGLTPLSPSATAYTGTYQSVPSSNSTSKEFPQRPDQPECQYFMRT 319

Query:   261 GTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDH 319
             G CK+G+ C++HHP + +       +  +GLP RPG A C++++ +GICKFGP C+FDH
Sbjct:   320 GDCKFGSSCRYHHPVDAVPPKTGIVLSSIGLPLRPGVAQCTHFAQHGICKFGPACKFDH 378

 Score = 416 (151.5 bits), Expect = 6.1e-39, P = 6.1e-39
 Identities = 108/321 (33%), Positives = 146/321 (45%)

Query:    20 IEEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAY--AA 77
             +E  +WRL +     GGG  ++  YP RP EPDC++Y RTG+CGYGS CRFNHP    A 
Sbjct:    29 VEAPMWRLGLSGGGGGGGGGES--YPERPDEPDCIYYLRTGVCGYGSRCRFNHPRDRGAV 86

Query:    78 QGAQYREE--LPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGAG---PVSFNILGLP 132
              G    E   LPER G P C ++++TGTCK+G++CKYHHP+   G G   PVS + LG P
Sbjct:    87 IGGVRGEAGALPERMGHPVCQHFMRTGTCKFGASCKYHHPRQGGGGGSVAPVSLSYLGYP 146

Query:   133 MRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQG 192
             +R  EK C YY+RTG        ++   +P  ++Q  P    + Q         + P+  
Sbjct:   147 LRPGEKECSYYLRTGQCKFGLTCRFNHPVPL-AVQGPPQQPQQQQPQPQPQLQTIYPTLQ 205

Query:   193 IVPAPGWNTYMGNIGPLSPTSIAGSNLIYSSRNQGDLGAG----AQMHILSASSQNLPER 248
                 P    Y G +    P+ + GS L         L  G    +  +   AS   +P  
Sbjct:   206 SQSIPSSQQY-GLV-LTRPSFLTGSYLQSPYGPPMVLPPGMVPYSGWNPYQASLSAMPSP 263

Query:   249 PDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGI 308
               QP        G          +    +   S  SN      P RP Q  C  +   G 
Sbjct:   264 GTQPSIGSSSIYGLTPLSPSATAYTGTYQSVPS--SNSTSKEFPQRPDQPECQYFMRTGD 321

Query:   309 CKFGPTCRFDHPYAGYPINYG 329
             CKFG +CR+ HP    P   G
Sbjct:   322 CKFGSSCRYHHPVDAVPPKTG 342


>TAIR|locus:2087775 [details] [associations]
            symbol:HUA1 "ENHANCER OF AG-4 1" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IDA;TAS]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001709 "cell fate
            determination" evidence=TAS] [GO:0003723 "RNA binding"
            evidence=ISS;IDA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=RCA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0046872 GO:GO:0003677 GO:GO:0016607
            GO:GO:0008270 GO:GO:0006397 GO:GO:0003723 GO:GO:0009908
            EMBL:AB024033 GO:GO:0001709 EMBL:AY024357 EMBL:AC069474
            EMBL:AK229145 IPI:IPI00536814 RefSeq:NP_187874.2 UniGene:At.5670
            ProteinModelPortal:Q941Q3 SMR:Q941Q3 STRING:Q941Q3 PaxDb:Q941Q3
            PRIDE:Q941Q3 EnsemblPlants:AT3G12680.1 GeneID:820448
            KEGG:ath:AT3G12680 TAIR:At3g12680 eggNOG:NOG250655
            HOGENOM:HOG000078745 InParanoid:Q941Q3 OMA:LGAHNTI PhylomeDB:Q941Q3
            ProtClustDB:CLSN2690537 Genevestigator:Q941Q3 Uniprot:Q941Q3
        Length = 524

 Score = 438 (159.2 bits), Expect = 2.8e-41, P = 2.8e-41
 Identities = 105/299 (35%), Positives = 151/299 (50%)

Query:    44 YPARPGEPDCLFYRRTGLCGYGSNCRFNHPAY-AAQGAQYREELPERNGQPDCGYYLKTG 102
             YP RPGEPDC +Y +T  C YGS C+FNHP   AA   + ++ LPER  +P C +Y+KTG
Sbjct:   222 YPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDSLPERPSEPMCTFYMKTG 281

Query:   103 TCKYGSTCKYHHPKD---RNGAGPVSFNILGLPMRQDEKSCPYYMRTGSFLPSS-GLQY- 157
              CK+G +CK+HHPKD    + +  +  ++ GL    D  + P+   T +   +S GL   
Sbjct:   282 KCKFGLSCKFHHPKDIQLPSSSQDIGSSV-GLTSEPDATNNPHVTFTPALYHNSKGLPVR 340

Query:   158 AGS------LPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPLSP 211
             +G       L T S +             +++P     +  +V +   NT   N+G ++P
Sbjct:   341 SGEVDCPFYLKTGSCKYGATCRYNHPERTAFIPQAAGVNYSLVSS---NTANLNLGLVTP 397

Query:   212 TSIAGSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKF 271
              +       Y +  Q  LG      ++SA+    P+RP Q +C YYM TG CK+G  CKF
Sbjct:   398 ATS-----FYQTLTQPTLG------VISAT---YPQRPGQSECDYYMKTGECKFGERCKF 443

Query:   272 HHPKERIA-------QSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAG 323
             HHP +R++       Q     +   G P R G   C  Y   G CK+G TC+FDHP  G
Sbjct:   444 HHPADRLSAMTKQAPQQPNVKLSLAGYPRREGALNCPYYMKTGTCKYGATCKFDHPPPG 502

 Score = 258 (95.9 bits), Expect = 9.3e-20, P = 9.3e-20
 Identities = 48/113 (42%), Positives = 66/113 (58%)

Query:    44 YPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGA--QYRE-------ELPERNGQPD 94
             YP R GE DC  Y +T  C +G +CRF+HP +  +G    ++E       E PER G+PD
Sbjct:   171 YPQRAGEKDCTHYMQTRTCKFGESCRFDHPIWVPEGGIPDWKEAPVVPNEEYPERPGEPD 230

Query:    95 CGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSFNILGLPMRQDEKSCPYYMRTG 147
             C YY+KT  CKYGS CK++HP++       + +   LP R  E  C +YM+TG
Sbjct:   231 CPYYIKTQRCKYGSKCKFNHPREEAAVSVETQD--SLPERPSEPMCTFYMKTG 281

 Score = 216 (81.1 bits), Expect = 7.3e-15, P = 7.3e-15
 Identities = 37/79 (46%), Positives = 50/79 (63%)

Query:   242 SQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICS 301
             ++  PERP +PDC YY+ T  CKYG+ CKF+HP+E  A S  +      LP RP + +C+
Sbjct:   219 NEEYPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDS---LPERPSEPMCT 275

Query:   302 NYSMYGICKFGPTCRFDHP 320
              Y   G CKFG +C+F HP
Sbjct:   276 FYMKTGKCKFGLSCKFHHP 294

 Score = 209 (78.6 bits), Expect = 4.5e-14, P = 4.5e-14
 Identities = 41/90 (45%), Positives = 54/90 (60%)

Query:    37 GVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPA--------YAAQGAQYREEL-- 86
             GV  A+ YP RPG+ +C +Y +TG C +G  C+F+HPA         A Q    +  L  
Sbjct:   411 GVISAT-YPQRPGQSECDYYMKTGECKFGERCKFHHPADRLSAMTKQAPQQPNVKLSLAG 469

Query:    87 -PERNGQPDCGYYLKTGTCKYGSTCKYHHP 115
              P R G  +C YY+KTGTCKYG+TCK+ HP
Sbjct:   470 YPRREGALNCPYYMKTGTCKYGATCKFDHP 499

 Score = 153 (58.9 bits), Expect = 8.3e-08, P = 8.3e-08
 Identities = 30/80 (37%), Positives = 42/80 (52%)

Query:   246 PERPDQPDCRYYMNTGTCKYGADCKFHHP----KERIAQSAASNIGPLG-LPSRPGQAIC 300
             P+R  + DC +YM T TCK+G  C+F HP    +  I     + + P    P RPG+  C
Sbjct:   172 PQRAGEKDCTHYMQTRTCKFGESCRFDHPIWVPEGGIPDWKEAPVVPNEEYPERPGEPDC 231

Query:   301 SNYSMYGICKFGPTCRFDHP 320
               Y     CK+G  C+F+HP
Sbjct:   232 PYYIKTQRCKYGSKCKFNHP 251


>TAIR|locus:2101170 [details] [associations]
            symbol:AT3G48440 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 EMBL:AL049659
            HOGENOM:HOG000237733 EMBL:BT033139 IPI:IPI00517303 PIR:T06698
            RefSeq:NP_190414.1 UniGene:At.50258 ProteinModelPortal:Q9STM4
            SMR:Q9STM4 PaxDb:Q9STM4 PRIDE:Q9STM4 EnsemblPlants:AT3G48440.1
            GeneID:824003 KEGG:ath:AT3G48440 TAIR:At3g48440 eggNOG:NOG288127
            InParanoid:Q9STM4 OMA:PEWNGYQ PhylomeDB:Q9STM4
            ProtClustDB:CLSN2719348 Genevestigator:Q9STM4 GermOnline:AT3G48440
            Uniprot:Q9STM4
        Length = 448

 Score = 393 (143.4 bits), Expect = 1.7e-36, P = 1.7e-36
 Identities = 98/279 (35%), Positives = 134/279 (48%)

Query:    52 DCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPDCGYYLKTGT--CKY--- 106
             DC +Y RTG C YG  CRFNH      G       PE N     G  L+ G   C Y   
Sbjct:   163 DCKYYFRTGGCKYGETCRFNH-TIPKSGLA---SAPELNF---LGLPLRPGEVECPYYMR 215

Query:   107 GSTCKYHHPKDRNGAGPVSFNILGLPMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSL 166
               +CKY      N   P +      P  +        +  G+F P +  Q   S  +WS 
Sbjct:   216 NGSCKYGAECKFNHPDPTTIGGTDSPSFRGNNG----VSIGTFSPKATFQ--ASSTSWSS 269

Query:   167 QRAPYLSSRLQGTQSYMPLIVSPSQGIVPA-PGWNTYMGNI-----GPLSPTSIAGSNLI 220
              R       + GT  ++P+++S + G+    P WN Y  ++     G  SP++   + L+
Sbjct:   270 PR------HVNGTSPFIPVMLSQTHGVTSQNPEWNGYQASVYSSERGVFSPST---TYLM 320

Query:   221 YSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQ 280
              +S  +  +      H + A  +  PERPDQP+C YYM TG CK+  +CK+HHPK R+ +
Sbjct:   321 NNSSAETSMLLSQYRHQMPA--EEFPERPDQPECSYYMKTGDCKFKFNCKYHHPKNRLPK 378

Query:   281 SAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDH 319
                  +   GLP RP Q IC+ YS YGICKFGP CRFDH
Sbjct:   379 LPPYALNDKGLPLRPDQNICTYYSRYGICKFGPACRFDH 417

 Score = 322 (118.4 bits), Expect = 5.6e-29, P = 5.6e-29
 Identities = 98/329 (29%), Positives = 148/329 (44%)

Query:     4 NRQVKSNAVANQSADNIEEAIWRLKIHDNQEGGGVAQA-SPYPARPGEPDCLFYRRTGLC 62
             N  + SNAV   + +  EE   R   +   +G    ++ + YP RPG  DC FY RTG C
Sbjct:    67 NGGLDSNAVVTINQEEEEEEEDR-DGYGYGDGWSENESENVYPVRPGAEDCSFYMRTGSC 125

Query:    63 GYGSNCRFNHPA----YAAQGAQYREELPE--RNGQPDCGYYLKTGTCKYGSTCKYHHPK 116
              +GS+C+FNHP       A+  + RE+  +  + G  DC YY +TG CKYG TC+++H  
Sbjct:   126 KFGSSCKFNHPLARKFQIARDNKVREKEDDGGKLGLIDCKYYFRTGGCKYGETCRFNHTI 185

Query:   117 DRNG-AGPVSFNILGLPMRQDEKSCPYYMRTGSFLPSSGLQYAGSLPTWSLQRAPYLSSR 175
              ++G A     N LGLP+R  E  CPYYMR GS    +  ++    PT ++      S  
Sbjct:   186 PKSGLASAPELNFLGLPLRPGEVECPYYMRNGSCKYGAECKFNHPDPT-TIGGTD--SPS 242

Query:   176 LQGTQSYMPLIVSPSQGI-VPAPGWNT--YMGNIGPLSPTSIAGSNLIYSSRNQGDLGAG 232
              +G         SP       +  W++  ++    P  P  ++ ++ + S   + +   G
Sbjct:   243 FRGNNGVSIGTFSPKATFQASSTSWSSPRHVNGTSPFIPVMLSQTHGVTSQNPEWN---G 299

Query:   233 AQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADC-KFHHPKERIAQSAASNIGPLGL 291
              Q  + S S + +      P   Y MN  + +      ++ H      Q  A        
Sbjct:   300 YQASVYS-SERGV----FSPSTTYLMNNSSAETSMLLSQYRH------QMPAEEF----- 343

Query:   292 PSRPGQAICSNYSMYGICKFGPTCRFDHP 320
             P RP Q  CS Y   G CKF   C++ HP
Sbjct:   344 PERPDQPECSYYMKTGDCKFKFNCKYHHP 372


>TAIR|locus:1006230718 [details] [associations]
            symbol:AT1G48195 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 EMBL:AC023673 EMBL:BX818039 IPI:IPI00522286
            RefSeq:NP_973988.1 UniGene:At.38465 UniGene:At.63148
            ProteinModelPortal:Q3ECU8 SMR:Q3ECU8 EnsemblPlants:AT1G48195.1
            GeneID:2745816 KEGG:ath:AT1G48195 TAIR:At1g48195 eggNOG:NOG304278
            HOGENOM:HOG000107451 InParanoid:Q3ECU8 OMA:AICPHYS PhylomeDB:Q3ECU8
            ProtClustDB:CLSN2681286 Genevestigator:Q3ECU8 Uniprot:Q3ECU8
        Length = 82

 Score = 251 (93.4 bits), Expect = 3.5e-21, P = 3.5e-21
 Identities = 40/79 (50%), Positives = 51/79 (64%)

Query:   241 SSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAIC 300
             S +  PERP +P+C YY+ TG C    +CK+HHPK          +   GLP RPGQAIC
Sbjct:     2 SEEKFPERPGEPECSYYLRTGNCYLKQNCKYHHPKNITPSEPQCTLNDKGLPLRPGQAIC 61

Query:   301 SNYSMYGICKFGPTCRFDH 319
              +YS +GIC+ GPTC+FDH
Sbjct:    62 PHYSRFGICRSGPTCKFDH 80

 Score = 179 (68.1 bits), Expect = 3.8e-13, P = 3.8e-13
 Identities = 31/65 (47%), Positives = 41/65 (63%)

Query:    84 EELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKDRNGAGP-VSFNILGLPMRQDEKSCPY 142
             E+ PER G+P+C YYL+TG C     CKYHHPK+   + P  + N  GLP+R  +  CP+
Sbjct:     4 EKFPERPGEPECSYYLRTGNCYLKQNCKYHHPKNITPSEPQCTLNDKGLPLRPGQAICPH 63

Query:   143 YMRTG 147
             Y R G
Sbjct:    64 YSRFG 68

 Score = 164 (62.8 bits), Expect = 1.7e-11, P = 1.7e-11
 Identities = 31/75 (41%), Positives = 42/75 (56%)

Query:    44 YPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAA----QGAQYREELPERNGQPDCGYYL 99
             +P RPGEP+C +Y RTG C    NC+++HP        Q     + LP R GQ  C +Y 
Sbjct:     6 FPERPGEPECSYYLRTGNCYLKQNCKYHHPKNITPSEPQCTLNDKGLPLRPGQAICPHYS 65

Query:   100 KTGTCKYGSTCKYHH 114
             + G C+ G TCK+ H
Sbjct:    66 RFGICRSGPTCKFDH 80


>ZFIN|ZDB-GENE-990806-20 [details] [associations]
            symbol:cth1 "cth1" species:7955 "Danio rerio"
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 ZFIN:ZDB-GENE-990806-20 GO:GO:0008270
            GO:GO:0003676 HSSP:P22893 GeneTree:ENSGT00530000063262
            EMBL:AL954709 EMBL:BC107984 EMBL:AJ249490 IPI:IPI00509714
            RefSeq:NP_571014.1 UniGene:Dr.621 SMR:Q9PU62 STRING:Q9PU62
            Ensembl:ENSDART00000101601 GeneID:30114 KEGG:dre:30114 CTD:30114
            HOGENOM:HOG000153347 HOVERGEN:HBG078993 InParanoid:Q9PU62 KO:K13056
            OMA:FTFSSQH NextBio:20806593 Uniprot:Q9PU62
        Length = 319

 Score = 116 (45.9 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 37/136 (27%), Positives = 48/136 (35%)

Query:   235 MHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPS- 293
             +H L       P R + P CR +   G C +G  C F H +       A        PS 
Sbjct:   121 VHNLKEQRPIRPRRRNVP-CRTFRAFGVCPFGNRCHFLHVEGGSESDGAEEEQTWQPPSQ 179

Query:   294 ----RPGQAICSNYSMYGICKFGPTCRFDHPYAGYPINYGXXXXXXXXXXXXXMNHQAIS 349
                 +P  A+C  +S +G C +G  CRF H   G P                  N  +IS
Sbjct:   180 SQEWKPRGALCRTFSAFGFCLYGTRCRFQH---GLPNTIKGHNANHTSWPQQMTNGGSIS 236

Query:   350 ATHSIETSPDASSKIP 365
                   TSP   S  P
Sbjct:   237 PISDTCTSPSPPSSSP 252

 Score = 84 (34.6 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 23/81 (28%), Positives = 35/81 (43%)

Query:    53 CLFYRRTGLCGYGSNCRFNHPAY----AAQGAQYREELPERNGQPDCGYYLKTGTCKYGS 108
             C  Y  TG C Y   C+F H  +     ++  +Y+ EL        C  Y   G C YG+
Sbjct:    65 CSRYAETGTCKYAERCQFAHGLHDLHVPSRHPKYKTEL--------CRTYHTAGYCVYGT 116

Query:   109 TCKY-HHPKDRNGAGPVSFNI 128
              C + H+ K++    P   N+
Sbjct:   117 RCLFVHNLKEQRPIRPRRRNV 137


>TAIR|locus:2013763 [details] [associations]
            symbol:AT1G29570 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] [GO:0048445 "carpel morphogenesis" evidence=RCA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 EMBL:AC068667 IPI:IPI00519526 PIR:G86418
            RefSeq:NP_174250.1 UniGene:At.51822 EnsemblPlants:AT1G29570.1
            GeneID:839834 KEGG:ath:AT1G29570 TAIR:At1g29570 eggNOG:NOG325481
            HOGENOM:HOG000107458 OMA:HIMDRNV PhylomeDB:Q9C7P4
            ProtClustDB:CLSN2914472 Genevestigator:Q9C7P4 Uniprot:Q9C7P4
        Length = 321

 Score = 148 (57.2 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 27/53 (50%), Positives = 34/53 (64%)

Query:    40 QASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYR--EELPERN 90
             Q+SPYP RPG+ DC FY + GLC Y S+CRFNHP    Q    R  + + +RN
Sbjct:    48 QSSPYPVRPGKKDCQFYLKNGLCRYRSSCRFNHPTQRPQELPVRICKHIMDRN 100


>UNIPROTKB|D4A905 [details] [associations]
            symbol:Cpsf4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:10116 "Rattus norvegicus" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 RGD:620440 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 GeneTree:ENSGT00390000009627
            OMA:PLDQVTC OrthoDB:EOG4KH2VQ IPI:IPI00358639
            Ensembl:ENSRNOT00000038958 Uniprot:D4A905
        Length = 243

 Score = 109 (43.4 bits), Expect = 4.1e-07, Sum P(2) = 4.1e-07
 Identities = 27/82 (32%), Positives = 37/82 (45%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL    R  + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRRTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 72 (30.4 bits), Expect = 4.1e-07, Sum P(2) = 4.1e-07
 Identities = 29/101 (28%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L I   Q+ G  AQ  P+P   + G   C F+ +   CG GS C F H    
Sbjct:     7 SVDHIKFDLAIAVEQQLG--AQPLPFPGMDKSGTAVCEFFLKAA-CGKGSMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89


>UNIPROTKB|C9K0K2 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0008270 GO:GO:0003676 HOGENOM:HOG000212457
            HGNC:HGNC:2327 EMBL:AC073063 IPI:IPI01014332
            ProteinModelPortal:C9K0K2 SMR:C9K0K2 STRING:C9K0K2
            Ensembl:ENST00000412686 ArrayExpress:C9K0K2 Bgee:C9K0K2
            Uniprot:C9K0K2
        Length = 112

 Score = 91 (37.1 bits), Expect = 7.2e-07, Sum P(2) = 7.2e-07
 Identities = 23/73 (31%), Positives = 32/73 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    41 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 98

Query:   305 MYGICKFGPTCRF 317
             + G C  GP+C+F
Sbjct:    99 LVGFCPEGPSCKF 111

 Score = 51 (23.0 bits), Expect = 7.2e-07, Sum P(2) = 7.2e-07
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:    95 CGYYLKTGTCKYGSTCKYHHPKD 117
             C ++L+ G CK G  C++ H  D
Sbjct:    15 CKHWLR-GLCKKGDQCEFLHEYD 36


>UNIPROTKB|F1REX3 [details] [associations]
            symbol:LOC100518830 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            GeneTree:ENSGT00390000009627 KO:K14404 EMBL:FP102617
            RefSeq:XP_003124350.1 Ensembl:ENSSSCT00000008355 GeneID:100518830
            KEGG:ssc:100518830 OMA:MQDIVAS Uniprot:F1REX3
        Length = 269

 Score = 105 (42.0 bits), Expect = 9.1e-07, Sum P(3) = 9.1e-07
 Identities = 26/82 (31%), Positives = 37/82 (45%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQ-----SAASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I       +     GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDARFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 70 (29.7 bits), Expect = 9.1e-07, Sum P(3) = 9.1e-07
 Identities = 28/101 (27%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L+I   Q+ G  AQ  P+P   + G   C F+ +   CG G  C F H    
Sbjct:     7 SVDHIKFDLEIAVEQQLG--AQPLPFPGMDKSGAAVCEFFLKAA-CGKGGMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89

 Score = 39 (18.8 bits), Expect = 9.1e-07, Sum P(3) = 9.1e-07
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   398 DHPPHSVPNCSEPPHDQSN 416
             + PP  +P  ++PP  QSN
Sbjct:   177 EQPP--LPQQTQPPAKQSN 193


>UNIPROTKB|O95639 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0019048
            "virus-host interaction" evidence=TAS] [GO:0019054 "modulation by
            virus of host cellular process" evidence=TAS] [GO:0019058 "viral
            infectious cycle" evidence=TAS] [GO:0046778 "modification by virus
            of host mRNA processing" evidence=TAS] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005730 "nucleolus" evidence=IDA] [GO:0005739
            "mitochondrion" evidence=IDA] InterPro:IPR000571 InterPro:IPR001878
            Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158
            SMART:SM00343 SMART:SM00356 GO:GO:0005739 Reactome:REACT_116125
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            EMBL:CH236956 EMBL:CH471091 GO:GO:0019058 Gene3D:4.10.60.10
            SUPFAM:SSF57756 eggNOG:COG5084 GO:GO:0005847 HOGENOM:HOG000212457
            HOVERGEN:HBG051108 CTD:10898 KO:K14404 OMA:PLDQVTC
            OrthoDB:EOG4KH2VQ EMBL:U79569 EMBL:CR542161 EMBL:EF191081
            EMBL:BC003101 EMBL:BC050738 IPI:IPI00009137 IPI:IPI00029707
            IPI:IPI00375469 RefSeq:NP_001075028.1 RefSeq:NP_006684.1
            UniGene:Hs.489287 PDB:2D9N PDB:2RHK PDBsum:2D9N PDBsum:2RHK
            ProteinModelPortal:O95639 SMR:O95639 DIP:DIP-48675N IntAct:O95639
            MINT:MINT-1429837 STRING:O95639 PhosphoSite:O95639 PaxDb:O95639
            PRIDE:O95639 DNASU:10898 Ensembl:ENST00000292476
            Ensembl:ENST00000436336 GeneID:10898 KEGG:hsa:10898 UCSC:uc003uqi.3
            UCSC:uc003uqj.3 UCSC:uc003uqk.3 GeneCards:GC07P099036
            HGNC:HGNC:2327 HPA:HPA049094 MIM:603052 neXtProt:NX_O95639
            PharmGKB:PA26844 InParanoid:O95639 PhylomeDB:O95639
            EvolutionaryTrace:O95639 GenomeRNAi:10898 NextBio:41385
            ArrayExpress:O95639 Bgee:O95639 CleanEx:HS_CPSF4
            Genevestigator:O95639 GermOnline:ENSG00000160917 GO:GO:0046778
            Uniprot:O95639
        Length = 269

 Score = 104 (41.7 bits), Expect = 1.2e-06, Sum P(3) = 1.2e-06
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 70 (29.7 bits), Expect = 1.2e-06, Sum P(3) = 1.2e-06
 Identities = 28/101 (27%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L+I   Q+ G  AQ  P+P   + G   C F+ +   CG G  C F H    
Sbjct:     7 SVDHIKFDLEIAVEQQLG--AQPLPFPGMDKSGAAVCEFFLKAA-CGKGGMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89

 Score = 39 (18.8 bits), Expect = 1.2e-06, Sum P(3) = 1.2e-06
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   398 DHPPHSVPNCSEPPHDQSN 416
             + PP  +P  ++PP  QSN
Sbjct:   177 EQPP--LPQQTQPPAKQSN 193


>UNIPROTKB|E1BV31 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            SUPFAM:SSF57756 GO:GO:0005847 CTD:10898
            GeneTree:ENSGT00390000009627 KO:K14404 OMA:PLDQVTC
            EMBL:AADN02023770 IPI:IPI00572429 RefSeq:XP_414800.1
            UniGene:Gga.12217 Ensembl:ENSGALT00000007510 GeneID:416494
            KEGG:gga:416494 NextBio:20819939 Uniprot:E1BV31
        Length = 243

 Score = 108 (43.1 bits), Expect = 1.4e-06, Sum P(2) = 1.4e-06
 Identities = 27/82 (32%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GPTC+F HP    P+
Sbjct:   152 LVGFCPEGPTCKFMHPRFELPM 173

 Score = 68 (29.0 bits), Expect = 1.4e-06, Sum P(2) = 1.4e-06
 Identities = 27/101 (26%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L++   Q+ G  AQ  P+P   + G   C F+ +   CG G  C F H    
Sbjct:     7 SVDHIKFDLELAVEQQLG--AQPLPFPGMDKSGAAVCEFFLKAA-CGKGGMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89


>TAIR|locus:2013758 [details] [associations]
            symbol:AT1G29560 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0008270 GO:GO:0003676
            EMBL:AC068667 eggNOG:KOG1677 IPI:IPI00527997 RefSeq:NP_174249.2
            UniGene:At.73942 ProteinModelPortal:B3H4U9 PRIDE:B3H4U9
            EnsemblPlants:AT1G29560.1 GeneID:839833 KEGG:ath:AT1G29560
            TAIR:At1g29560 HOGENOM:HOG000064587 OMA:WRDSESR PhylomeDB:B3H4U9
            ProtClustDB:CLSN2682005 Genevestigator:B3H4U9 Uniprot:B3H4U9
        Length = 572

 Score = 142 (55.0 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 35/84 (41%), Positives = 46/84 (54%)

Query:    45 PAR-PGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPDCGYYLKTGT 103
             P R PGE +C   R    C  G +CR+NHP       Q  +ELP RN    C Y+L+ G 
Sbjct:   215 PVRWPGE-ECWCLR----CRNGGSCRYNHPT------QLPQELPVRNRLQICRYFLR-GY 262

Query:   104 CKYGSTCKYHHPKDRNGAGPVSFN 127
             CK+GS C + H +DR+ A P+  N
Sbjct:   263 CKFGSVCGFQHIRDRDVAEPMYEN 286


>UNIPROTKB|O19137 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084 HSSP:P47974
            GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108 EMBL:U96448
            IPI:IPI00715166 RefSeq:NP_776367.1 UniGene:Bt.55595
            ProteinModelPortal:O19137 SMR:O19137 STRING:O19137
            Ensembl:ENSBTAT00000002701 GeneID:280875 KEGG:bta:280875 CTD:10898
            GeneTree:ENSGT00390000009627 InParanoid:O19137 KO:K14404
            OMA:PLDQVTC OrthoDB:EOG4KH2VQ NextBio:20805014 Uniprot:O19137
        Length = 243

 Score = 104 (41.7 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 70 (29.7 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
 Identities = 28/101 (27%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L+I   Q+ G  AQ  P+P   + G   C F+ +   CG G  C F H    
Sbjct:     7 SVDHIKFDLEIAVEQQLG--AQPLPFPGMDKSGAAVCEFFLKAA-CGKGGMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89


>RGD|620440 [details] [associations]
            symbol:Cpsf4 "cleavage and polyadenylation specific factor 4"
            species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 RGD:620440 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084
            HSSP:P47974 GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108
            CTD:10898 GeneTree:ENSGT00390000009627 KO:K14404 OrthoDB:EOG4KH2VQ
            EMBL:BC089824 IPI:IPI00553898 RefSeq:NP_001012351.1
            UniGene:Rn.104788 ProteinModelPortal:Q5FVR7 SMR:Q5FVR7
            Ensembl:ENSRNOT00000042474 GeneID:304277 KEGG:rno:304277
            InParanoid:Q5FVR7 NextBio:652764 ArrayExpress:Q5FVR7
            Genevestigator:Q5FVR7 GermOnline:ENSRNOG00000025217 Uniprot:Q5FVR7
        Length = 243

 Score = 104 (41.7 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 70 (29.7 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
 Identities = 28/101 (27%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L+I   Q+ G  AQ  P+P   + G   C F+ +   CG G  C F H    
Sbjct:     7 SVDHIKFDLEIAVEQQLG--AQPLPFPGMDKSGAAVCEFFLKAA-CGKGGMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89


>UNIPROTKB|J9P398 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            CTD:10898 GeneTree:ENSGT00390000009627 KO:K14404 OMA:PLDQVTC
            EMBL:AAEX03004276 RefSeq:XP_850149.1 ProteinModelPortal:J9P398
            Ensembl:ENSCAFT00000043832 GeneID:489859 KEGG:cfa:489859
            Uniprot:J9P398
        Length = 269

 Score = 104 (41.7 bits), Expect = 4.0e-06, Sum P(2) = 4.0e-06
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 70 (29.7 bits), Expect = 4.0e-06, Sum P(2) = 4.0e-06
 Identities = 28/101 (27%), Positives = 44/101 (43%)

Query:    19 NIEEAIWRLKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             +++   + L+I   Q+ G  AQ  P+P   + G   C F+ +   CG G  C F H    
Sbjct:     7 SVDHIKFDLEIAVEQQLG--AQPLPFPGMDKSGAAVCEFFLKAA-CGKGGMCPFRH---- 59

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    60 ISG--------EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89


>UNIPROTKB|Q66KE3 [details] [associations]
            symbol:cpsf4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:8364 "Xenopus (Silurana) tropicalis"
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] InterPro:IPR000571 InterPro:IPR001878
            Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158
            SMART:SM00343 SMART:SM00356 GO:GO:0046872 GO:GO:0008270
            GO:GO:0006397 GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756
            eggNOG:COG5084 GO:GO:0042462 GO:GO:0005847 HOVERGEN:HBG051108
            CTD:10898 KO:K14404 OrthoDB:EOG4KH2VQ EMBL:BC080440
            RefSeq:NP_001007933.1 UniGene:Str.3196 ProteinModelPortal:Q66KE3
            SMR:Q66KE3 STRING:Q66KE3 GeneID:493312 KEGG:xtr:493312
            Xenbase:XB-GENE-948302 InParanoid:Q66KE3 Bgee:Q66KE3 Uniprot:Q66KE3
        Length = 269

 Score = 101 (40.6 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 26/82 (31%), Positives = 35/82 (42%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP C+F HP    P+
Sbjct:   152 LVGFCIEGPNCKFMHPRFELPM 173

 Score = 68 (29.0 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 24/81 (29%), Positives = 35/81 (43%)

Query:    39 AQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPDCG 96
             AQ  P+P   + G   C F+ ++  CG G  C F H      G        E+     C 
Sbjct:    25 AQPLPFPGMDKSGAAVCEFFLKSA-CGKGGMCPFRH----ISG--------EKTVV--CK 69

Query:    97 YYLKTGTCKYGSTCKYHHPKD 117
             ++L+ G CK G  C++ H  D
Sbjct:    70 HWLR-GLCKKGDQCEFLHEYD 89


>UNIPROTKB|Q6DJP7 [details] [associations]
            symbol:cpsf4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 GO:GO:0005847
            HOVERGEN:HBG051108 CTD:10898 KO:K14404 EMBL:BC075128
            RefSeq:NP_001086337.1 UniGene:Xl.25683 ProteinModelPortal:Q6DJP7
            SMR:Q6DJP7 GeneID:444766 KEGG:xla:444766 Xenbase:XB-GENE-948308
            Uniprot:Q6DJP7
        Length = 269

 Score = 101 (40.6 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 26/82 (31%), Positives = 35/82 (42%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP C+F HP    P+
Sbjct:   152 LVGFCIEGPNCKFMHPRFELPM 173

 Score = 68 (29.0 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 24/81 (29%), Positives = 35/81 (43%)

Query:    39 AQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPDCG 96
             AQ  P+P   + G   C F+ ++  CG G  C F H      G        E+     C 
Sbjct:    25 AQPLPFPGMDKSGAAVCEFFLKSA-CGKGGMCPFRH----ISG--------EKTVV--CK 69

Query:    97 YYLKTGTCKYGSTCKYHHPKD 117
             ++L+ G CK G  C++ H  D
Sbjct:    70 HWLR-GLCKKGDQCEFLHEYD 89


>UNIPROTKB|I3LCK9 [details] [associations]
            symbol:LOC100738395 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            GeneTree:ENSGT00390000009627 OMA:PLDQVTC EMBL:FP103031
            Ensembl:ENSSSCT00000031676 Uniprot:I3LCK9
        Length = 243

 Score = 104 (41.7 bits), Expect = 2.0e-05, Sum P(3) = 2.0e-05
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    68 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 125

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:   126 LVGFCPEGPSCKFMHPRFELPM 147

 Score = 55 (24.4 bits), Expect = 2.0e-05, Sum P(3) = 2.0e-05
 Identities = 20/71 (28%), Positives = 29/71 (40%)

Query:    47 RPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPDCGYYLKTGTCKY 106
             + G   C F+ +   CG G  C F H      G        E+     C ++L+ G CK 
Sbjct:     9 KSGAAVCEFFLKAA-CGKGGMCPFRH----ISG--------EKTVV--CKHWLR-GLCKK 52

Query:   107 GSTCKYHHPKD 117
             G  C++ H  D
Sbjct:    53 GDQCEFLHEYD 63

 Score = 52 (23.4 bits), Expect = 3.9e-05, Sum P(3) = 3.9e-05
 Identities = 9/29 (31%), Positives = 15/29 (51%)

Query:    86 LPERNGQPDCGYYLKTGTCKYGSTCKYHH 114
             + E++G   C ++LK   C  G  C + H
Sbjct:     6 MAEKSGAAVCEFFLKAA-CGKGGMCPFRH 33

 Score = 39 (18.8 bits), Expect = 2.0e-05, Sum P(3) = 2.0e-05
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   398 DHPPHSVPNCSEPPHDQSN 416
             + PP  +P  ++PP  QSN
Sbjct:   151 EQPP--LPQQTQPPAKQSN 167


>POMBASE|SPAC227.08c [details] [associations]
            symbol:yth1 "mRNA cleavage and polyadenylation
            specificity factor complex Yth1" species:4896 "Schizosaccharomyces
            pombe" [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=IC] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            PomBase:SPAC227.08c GO:GO:0005829 EMBL:CU329670
            GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0008270 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5084 GO:GO:0005847 HOGENOM:HOG000212457
            KO:K14404 OrthoDB:EOG4PG99D PIR:T50164 RefSeq:NP_592962.1
            ProteinModelPortal:Q9UTD1 SMR:Q9UTD1 STRING:Q9UTD1
            EnsemblFungi:SPAC227.08c.1 GeneID:2541506 KEGG:spo:SPAC227.08c
            NextBio:20802605 Uniprot:Q9UTD1
        Length = 170

 Score = 105 (42.0 bits), Expect = 2.2e-05, Sum P(2) = 2.2e-05
 Identities = 27/83 (32%), Positives = 37/83 (44%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQSAASNIG--PLGLPSRPGQAI-----CSN 302
             P C +Y   G C  G +C + H  P +++   A  N+G  PLG P   G+ +     C  
Sbjct:    80 PPCHFYAERGWCSNGEECLYLHLDPSKQVGVCAWYNMGFCPLG-PICRGKHVRKPRPCPK 138

Query:   303 YSMYGICKFGPTCRFDHPYAGYP 325
             Y + G C  GP C   HP    P
Sbjct:   139 Y-LAGFCPLGPNCPDAHPKHSEP 160

 Score = 49 (22.3 bits), Expect = 2.2e-05, Sum P(2) = 2.2e-05
 Identities = 11/36 (30%), Positives = 16/36 (44%)

Query:    91 GQPDCGYYLKTGTCKYGSTCKYHHPKDRNGAGPVSF 126
             G   C ++L+ G CK G  C + H  +     P  F
Sbjct:    50 GSVVCKHWLR-GLCKKGEQCDFLHEYNLKKMPPCHF 84


>UNIPROTKB|B7Z7B0 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 HOGENOM:HOG000212457
            HOVERGEN:HBG051108 OrthoDB:EOG4KH2VQ UniGene:Hs.489287
            HGNC:HGNC:2327 EMBL:AC073063 EMBL:AK301745 IPI:IPI00924476
            SMR:B7Z7B0 STRING:B7Z7B0 Ensembl:ENST00000441580 UCSC:uc011kix.2
            Uniprot:B7Z7B0
        Length = 191

 Score = 104 (41.7 bits), Expect = 5.7e-05, Sum P(2) = 5.7e-05
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    41 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 98

Query:   305 MYGICKFGPTCRFDHPYAGYPI 326
             + G C  GP+C+F HP    P+
Sbjct:    99 LVGFCPEGPSCKFMHPRFELPM 120

 Score = 51 (23.0 bits), Expect = 5.7e-05, Sum P(2) = 5.7e-05
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:    95 CGYYLKTGTCKYGSTCKYHHPKD 117
             C ++L+ G CK G  C++ H  D
Sbjct:    15 CKHWLR-GLCKKGDQCEFLHEYD 36


>WB|WBGene00013319 [details] [associations]
            symbol:ccch-5 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0000003 "reproduction" evidence=IMP]
            [GO:0009792 "embryo development ending in birth or egg hatching"
            evidence=IMP] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0009792 EMBL:Z99281 GO:GO:0008270 GO:GO:0000003
            GO:GO:0003676 eggNOG:COG5063 GeneTree:ENSGT00530000063262
            PIR:T27239 RefSeq:NP_502805.1 ProteinModelPortal:O18251 SMR:O18251
            STRING:O18251 EnsemblMetazoa:Y57G11C.25 GeneID:178412
            KEGG:cel:CELE_Y57G11C.25 UCSC:Y57G11C.25 CTD:178412
            WormBase:Y57G11C.25 HOGENOM:HOG000114059 InParanoid:O18251
            NextBio:901036 Uniprot:O18251
        Length = 199

 Score = 118 (46.6 bits), Expect = 6.6e-05, P = 6.6e-05
 Identities = 24/66 (36%), Positives = 33/66 (50%)

Query:   254 CRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGP 313
             C+ +  T  C YG  CKF H  E + Q    N+G +  P      +C N+S  G CK+G 
Sbjct:    74 CKTFQLTKACSYGEQCKFAHSVEEL-QLKHQNLG-INNPKYK-TVLCDNFSTTGHCKYGT 130

Query:   314 TCRFDH 319
              C+F H
Sbjct:   131 KCQFIH 136


>WB|WBGene00009537 [details] [associations]
            symbol:ccch-2 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0009792
            GO:GO:0008270 GO:GO:0003676 eggNOG:COG5063
            GeneTree:ENSGT00530000063262 EMBL:Z82267 HOGENOM:HOG000114059
            PIR:T21961 RefSeq:NP_502931.1 ProteinModelPortal:O45491 SMR:O45491
            IntAct:O45491 STRING:O45491 EnsemblMetazoa:F38C2.5 GeneID:178454
            KEGG:cel:CELE_F38C2.5 UCSC:F38C2.5 CTD:178454 WormBase:F38C2.5
            InParanoid:O45491 NextBio:901202 Uniprot:O45491
        Length = 186

 Score = 113 (44.8 bits), Expect = 0.00019, P = 0.00019
 Identities = 24/66 (36%), Positives = 32/66 (48%)

Query:   254 CRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGP 313
             C+ +  T  C YG  CKF H  E + Q    N G +  P      +C N+S  G CK+G 
Sbjct:    78 CKTFQLTRACSYGEQCKFAHSVEEL-QLKQKNRG-VNHPKYK-TVLCDNFSRTGHCKYGT 134

Query:   314 TCRFDH 319
              C+F H
Sbjct:   135 KCQFIH 140


>WB|WBGene00013797 [details] [associations]
            symbol:Y116A8C.20 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0008270 GO:GO:0003676
            eggNOG:COG5063 GeneTree:ENSGT00530000063262 EMBL:AL117204
            HOGENOM:HOG000114669 PIR:T31492 RefSeq:NP_503020.1
            ProteinModelPortal:Q9U2V1 SMR:Q9U2V1 STRING:Q9U2V1
            EnsemblMetazoa:Y116A8C.20 GeneID:178478 KEGG:cel:CELE_Y116A8C.20
            UCSC:Y116A8C.20 CTD:178478 WormBase:Y116A8C.20 InParanoid:Q9U2V1
            NextBio:901292 Uniprot:Q9U2V1
        Length = 201

 Score = 114 (45.2 bits), Expect = 0.00021, P = 0.00021
 Identities = 31/103 (30%), Positives = 49/103 (47%)

Query:    53 CLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPD-CGYYLKTGTCKYGSTCK 111
             CL ++R   C YG  C+F H  +  +  Q ++    RN +   C  +  TG CKYG  C+
Sbjct:    94 CLSHKRGKTCIYGEQCKFAHGVHELRCQQAKKN--HRNYKTVLCDKFTTTGYCKYGIRCQ 151

Query:   112 Y-HHPKDR-NGAGPVSFNILGLPMRQDEKSCPYYMRTGSFLPS 152
             + H   D  N   P+      L ++ D  S  + + + SFLP+
Sbjct:   152 FIHRSMDATNVTRPIDTADFKLDVQSD-LSRAFALDSSSFLPN 193


>UNIPROTKB|H9KVA5 [details] [associations]
            symbol:CPSF4L "Putative cleavage and
            polyadenylation-specificity factor subunit 4-like protein"
            species:9606 "Homo sapiens" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0008270 GO:GO:0003676 EMBL:AC087301 HGNC:HGNC:33632
            ProteinModelPortal:H9KVA5 SMR:H9KVA5 PRIDE:H9KVA5
            Ensembl:ENST00000397671 Bgee:H9KVA5 Uniprot:H9KVA5
        Length = 152

 Score = 91 (37.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 23/76 (30%), Positives = 31/76 (40%)

Query:   252 PDCRYYMNTGTCKYGADCKFHHPKERIA--------QSAASNIGPLGLPSRPGQAICSNY 303
             P+C +Y   G C    +C F H K            Q    + GPL       + +C NY
Sbjct:    30 PECYFYSKFGDCS-NKECSFLHVKPAFKSQDCPWYDQGFCKDAGPLCKYRHVPRIMCLNY 88

Query:   304 SMYGICKFGPTCRFDH 319
              + G C  GP C+F H
Sbjct:    89 -LVGFCPEGPKCQFAH 103

 Score = 54 (24.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query:    95 CGYYLKTGTCKYGSTCKYHHPKD 117
             C ++L+ G CK G  CK+ H  D
Sbjct:     4 CKHWLR-GLCKKGDHCKFLHQYD 25


>UNIPROTKB|C9JEV9 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
            "nucleolus" evidence=IDA] [GO:0005739 "mitochondrion" evidence=IDA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0005739 GO:GO:0005634 GO:GO:0046872 GO:GO:0008270
            GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            HOGENOM:HOG000212457 HGNC:HGNC:2327 EMBL:AC073063 IPI:IPI00927478
            ProteinModelPortal:C9JEV9 SMR:C9JEV9 STRING:C9JEV9
            Ensembl:ENST00000451876 ArrayExpress:C9JEV9 Bgee:C9JEV9
            Uniprot:C9JEV9
        Length = 211

 Score = 113 (44.8 bits), Expect = 0.00033, P = 0.00033
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query:   254 CRYYMNTGTCKYGADCKFHHPKERIAQSAA---SNIGPLGLPSRPGQAICSNYSMYGICK 310
             C++++  G CK G  C+F H  +          S  GPL       + IC NY + G C 
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYDMTKMPECYFYSKFGPLCRHRHTRRVICVNY-LVGFCP 125

Query:   311 FGPTCRFDHPYAGYPI 326
              GP+C+F HP    P+
Sbjct:   126 EGPSCKFMHPRFELPM 141


>MGI|MGI:1861602 [details] [associations]
            symbol:Cpsf4 "cleavage and polyadenylation specific factor
            4" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISO]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            MGI:MGI:1861602 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084
            GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108 CTD:10898
            GeneTree:ENSGT00390000009627 KO:K14404 OrthoDB:EOG4KH2VQ
            EMBL:AK046064 EMBL:AF033201 EMBL:BC057067 IPI:IPI00309761
            IPI:IPI00380450 IPI:IPI01027761 RefSeq:NP_848671.1
            UniGene:Mm.196884 ProteinModelPortal:Q8BQZ5 SMR:Q8BQZ5
            STRING:Q8BQZ5 PhosphoSite:Q8BQZ5 PaxDb:Q8BQZ5 PRIDE:Q8BQZ5
            Ensembl:ENSMUST00000070487 GeneID:54188 KEGG:mmu:54188
            UCSC:uc009amj.1 ChiTaRS:CPSF4 NextBio:311022 Bgee:Q8BQZ5
            CleanEx:MM_CPSF4 Genevestigator:Q8BQZ5
            GermOnline:ENSMUSG00000029625 Uniprot:Q8BQZ5
        Length = 211

 Score = 113 (44.8 bits), Expect = 0.00033, P = 0.00033
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query:   254 CRYYMNTGTCKYGADCKFHHPKERIAQSAA---SNIGPLGLPSRPGQAICSNYSMYGICK 310
             C++++  G CK G  C+F H  +          S  GPL       + IC NY + G C 
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYDMTKMPECYFYSKFGPLCRHRHTRRVICVNY-LVGFCP 125

Query:   311 FGPTCRFDHPYAGYPI 326
              GP+C+F HP    P+
Sbjct:   126 EGPSCKFMHPRFELPM 141


>UNIPROTKB|E2RBK7 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            SUPFAM:SSF57756 GO:GO:0005847 GeneTree:ENSGT00390000009627
            EMBL:AAEX03004276 Ensembl:ENSCAFT00000023892 Uniprot:E2RBK7
        Length = 212

 Score = 113 (44.8 bits), Expect = 0.00034, P = 0.00034
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query:   254 CRYYMNTGTCKYGADCKFHHPKERIAQSAA---SNIGPLGLPSRPGQAICSNYSMYGICK 310
             C++++  G CK G  C+F H  +          S  GPL       + IC NY + G C 
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYDMTKMPECYFYSKFGPLCRHRHTRRVICVNY-LVGFCP 125

Query:   311 FGPTCRFDHPYAGYPI 326
              GP+C+F HP    P+
Sbjct:   126 EGPSCKFMHPRFELPM 141


>UNIPROTKB|F1LWJ4 [details] [associations]
            symbol:F1LWJ4 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            GeneTree:ENSGT00390000009627 IPI:IPI00776496
            Ensembl:ENSRNOT00000029618 Uniprot:F1LWJ4
        Length = 243

 Score = 91 (37.1 bits), Expect = 0.00043, Sum P(2) = 0.00043
 Identities = 24/81 (29%), Positives = 35/81 (43%)

Query:   253 DCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYSM 305
             +C +Y     C  G DC F H  P+ +I        +    GPL       + +C NY +
Sbjct:    96 ECYFYSKFWKCS-GKDCSFVHMDPESKIKDCPWYDCSFCKHGPLCRYQHTRRVLCVNY-L 153

Query:   306 YGICKFGPTCRFDHPYAGYPI 326
              G C  G +C+F HP    P+
Sbjct:   154 VGFCPGGASCKFIHPRFELPM 174

 Score = 63 (27.2 bits), Expect = 0.00043, Sum P(2) = 0.00043
 Identities = 27/101 (26%), Positives = 41/101 (40%)

Query:    17 ADNIEEAIWRLKIHDNQEGGGVAQASPYPARPGEPDCLFYRRTGLCGYGSNCRFNHPAYA 76
             A  I+   + L+I   Q+ G    + P   + G   C F+ +   CG G  C F H    
Sbjct:     6 AGTIDHNKFALEITMEQQLGAQQLSFPSMDKSGAAVCEFFVKAA-CGKGGMCPFCH---- 60

Query:    77 AQGAQYREELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
               G        E+     C ++L+ G CK G  C++ H  D
Sbjct:    61 ISG--------EKTVV--CQHWLR-GLCKKGDQCEFLHKYD 90


>WB|WBGene00013794 [details] [associations]
            symbol:dct-13 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0008270 GO:GO:0003676
            eggNOG:COG5063 GeneTree:ENSGT00530000063262 EMBL:AL117204
            HOGENOM:HOG000114669 PIR:T31489 RefSeq:NP_503017.1
            ProteinModelPortal:Q9U2V4 SMR:Q9U2V4 STRING:Q9U2V4
            EnsemblMetazoa:Y116A8C.17 GeneID:178476 KEGG:cel:CELE_Y116A8C.17
            UCSC:Y116A8C.17 CTD:178476 WormBase:Y116A8C.17 InParanoid:Q9U2V4
            NextBio:901284 Uniprot:Q9U2V4
        Length = 205

 Score = 110 (43.8 bits), Expect = 0.00068, P = 0.00068
 Identities = 30/102 (29%), Positives = 45/102 (44%)

Query:    53 CLFYRRTGLCGYGSNCRFNHPAYAAQGAQYREELPERNGQPD-CGYYLKTGTCKYGSTCK 111
             CL ++R   C YG  C+F H  +  +  Q       RN +   C  +  TG CKYG+ C+
Sbjct:    98 CLSHKRGKTCIYGEACKFAHGVHELRCQQTTRN--HRNYKTVLCDKFTTTGYCKYGARCQ 155

Query:   112 Y-HHPKDRNGAGPVSFNILGLPMRQDEKSCPYYMRTGSFLPS 152
             + H   D   A          P  Q + S  + + + SFLP+
Sbjct:   156 FIHRSMDTTPAAKPMETADFKPNVQSDLSRAFALDSSSFLPN 197


>DICTYBASE|DDB_G0270148 [details] [associations]
            symbol:cpsf4 "cleavage and polyadenylation
            specificity factor 30 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0003723 "RNA binding"
            evidence=ISS] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 dictyBase:DDB_G0270148
            EMBL:AAFI02000005 GenomeReviews:CM000150_GR GO:GO:0046872
            GO:GO:0008270 GO:GO:0006378 GO:GO:0003723 Gene3D:4.10.60.10
            SUPFAM:SSF57756 eggNOG:COG5084 GO:GO:0005847 GO:GO:0006379
            KO:K14404 RefSeq:XP_646578.1 ProteinModelPortal:Q55CA3 SMR:Q55CA3
            STRING:Q55CA3 EnsemblProtists:DDB0233701 GeneID:8617548
            KEGG:ddi:DDB_G0270148 InParanoid:Q55CA3 OMA:ECMYLHV
            ProtClustDB:CLSZ2437480 Uniprot:Q55CA3
        Length = 372

 Score = 90 (36.7 bits), Expect = 0.00071, Sum P(2) = 0.00071
 Identities = 20/76 (26%), Positives = 34/76 (44%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C ++   G C    +C + H  P+E++ +           GP        + +C NY 
Sbjct:    91 PECYFFSKHGECN-NQECMYLHVNPEEKVRECPWYSRGFCKHGPKCRHKHIKKLLCENYY 149

Query:   305 MYGICKFGPTCRFDHP 320
             + G C  GP C++ HP
Sbjct:   150 L-GFCPEGPKCKYGHP 164

 Score = 68 (29.0 bits), Expect = 0.00071, Sum P(2) = 0.00071
 Identities = 12/29 (41%), Positives = 18/29 (62%)

Query:    88 ERNGQPDCGYYLKTGTCKYGSTCKYHHPK 116
             +++G   C ++LK G+C  GS C Y H K
Sbjct:    31 DKDGSDICRFFLK-GSCTKGSDCPYKHTK 58


>ZFIN|ZDB-GENE-990415-180 [details] [associations]
            symbol:cpsf4 "cleavage and polyadenylation specific
            factor 4" species:7955 "Danio rerio" [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0042462 "eye photoreceptor cell development" evidence=IMP]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            ZFIN:ZDB-GENE-990415-180 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084 GO:GO:0042462
            HOGENOM:HOG000212457 HOVERGEN:HBG051108 CTD:10898 KO:K14404
            OrthoDB:EOG4KH2VQ EMBL:U70479 EMBL:BC045289 IPI:IPI00630205
            RefSeq:NP_571084.1 UniGene:Dr.75095 SMR:Q98881 STRING:Q98881
            GeneID:30203 KEGG:dre:30203 InParanoid:Q98881 NextBio:20806666
            Uniprot:Q98881
        Length = 271

 Score = 92 (37.4 bits), Expect = 0.00080, Sum P(2) = 0.00080
 Identities = 38/144 (26%), Positives = 55/144 (38%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GP        + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPDCRHRHTRRVICVNY- 151

Query:   305 MYGICKFGPTCRFDHPYAGYPINYGXX--XXXXXXXXXXXMNHQAI--SATHSIE-TSPD 359
             + G C  G +C+F HP    P+                   N Q I  S+   I+ T+P+
Sbjct:   152 LVGFCPEGKSCKFMHPRFELPMGATEQPPLPQQVQTQQKQQNMQPINRSSQSLIQLTNPN 211

Query:   360 ASSKIPNWVQNSDAVSVQHQNPDM 383
              S+     + N  AV + H N  M
Sbjct:   212 ISNNNHQRIPN--AVGIVHSNSHM 233

 Score = 61 (26.5 bits), Expect = 0.00080, Sum P(2) = 0.00080
 Identities = 27/93 (29%), Positives = 39/93 (41%)

Query:    27 LKIHDNQEGGGVAQASPYPA--RPGEPDCLFYRRTGLCGYGSNCRFNHPAYAAQGAQYRE 84
             L+I   Q+ G  AQ  P+P   + G   C ++ R   C  G  C F H      G     
Sbjct:    15 LEIAVEQQLG--AQPLPFPGMDKSGAAVCEYFMRAA-CMKGGMCPFRH----ISG----- 62

Query:    85 ELPERNGQPDCGYYLKTGTCKYGSTCKYHHPKD 117
                E+     C ++L+ G CK G  C++ H  D
Sbjct:    63 ---EKTVV--CKHWLR-GLCKKGDQCEFLHEYD 89


>UNIPROTKB|E2RBM0 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 GeneTree:ENSGT00390000009627 EMBL:AAEX03004276
            Ensembl:ENSCAFT00000023887 NextBio:20862973 Uniprot:E2RBM0
        Length = 164

 Score = 91 (37.1 bits), Expect = 0.00088, Sum P(2) = 0.00088
 Identities = 23/73 (31%), Positives = 32/73 (43%)

Query:   252 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 304
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    92 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 149

Query:   305 MYGICKFGPTCRF 317
             + G C  GP+C+F
Sbjct:   150 LVGFCPEGPSCKF 162

 Score = 51 (23.0 bits), Expect = 0.00088, Sum P(2) = 0.00088
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query:    95 CGYYLKTGTCKYGSTCKYHHPKD 117
             C ++L+ G CK G  C++ H  D
Sbjct:    66 CKHWLR-GLCKKGDQCEFLHEYD 87


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.314   0.132   0.419    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      416       403   0.00099  117 3  11 23  0.46    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  37
  No. of states in DFA:  608 (65 KB)
  Total size of DFA:  289 KB (2148 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  36.53u 0.09s 36.62t   Elapsed:  00:00:06
  Total cpu time:  36.54u 0.09s 36.63t   Elapsed:  00:00:06
  Start:  Tue May 21 01:25:26 2013   End:  Tue May 21 01:25:32 2013

Back to top