BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>020994
MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQY
AGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPLSPTSIAGS
NLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKER
IAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAGYPINYGLSLPPLSI
LDSSLMNHQAISATHSIETSPDASSKIPNWVQNSDAVSVQHQNPDMKNSTTKNSDDSSKV
DHPPHSVPNCSEPPHDQSN

High Scoring Gene Products

Symbol, full name Information P value
AT1G04990 protein from Arabidopsis thaliana 3.7e-48
AT2G47850 protein from Arabidopsis thaliana 1.9e-42
AT3G48440 protein from Arabidopsis thaliana 1.4e-41
AT5G18550 protein from Arabidopsis thaliana 2.0e-40
AT3G06410 protein from Arabidopsis thaliana 3.9e-37
ZFN1
AT3G02830
protein from Arabidopsis thaliana 3.1e-35
ZFN3
AT5G16540
protein from Arabidopsis thaliana 2.9e-32
HUA1
ENHANCER OF AG-4 1
protein from Arabidopsis thaliana 1.5e-25
AT1G48195 protein from Arabidopsis thaliana 1.9e-21
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 6.4e-07
cth1 gene_product from Danio rerio 1.0e-06
LOC100738395
Uncharacterized protein
protein from Sus scrofa 2.7e-05
LOC100518830
Uncharacterized protein
protein from Sus scrofa 3.7e-05
ccch-5 gene from Caenorhabditis elegans 3.9e-05
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 4.7e-05
CPSF4
Cleavage and polyadenylation specificity factor subunit 4
protein from Homo sapiens 5.1e-05
CPSF4
Uncharacterized protein
protein from Gallus gallus 6.0e-05
ccch-2 gene from Caenorhabditis elegans 0.00011
CPSF4
Cleavage and polyadenylation specificity factor subunit 4
protein from Bos taurus 0.00019
Cpsf4
cleavage and polyadenylation specific factor 4
gene from Rattus norvegicus 0.00019
CPSF4L
Putative cleavage and polyadenylation-specificity factor subunit 4-like protein
protein from Homo sapiens 0.00020
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 0.00020
Cpsf4
cleavage and polyadenylation specific factor 4
protein from Mus musculus 0.00020
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00021
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00029
YTH1 gene_product from Candida albicans 0.00030
YTH1
mRNA 3'-end-processing protein YTH1
protein from Candida albicans SC5314 0.00030
MBNL1
Muscleblind-like protein 1
protein from Homo sapiens 0.00030
MBNL1
Muscleblind-like protein 1
protein from Homo sapiens 0.00037
Mbnl1
muscleblind-like 1 (Drosophila)
protein from Mus musculus 0.00040
MBNL1
Muscleblind-like protein 1
protein from Gallus gallus 0.00046
MBNL1
Muscleblind-like protein 1
protein from Gallus gallus 0.00049
MBNL1
Uncharacterized protein
protein from Canis lupus familiaris 0.00052
MBNL1
Muscleblind-like protein 1
protein from Homo sapiens 0.00052
MBNL1
Uncharacterized protein
protein from Bos taurus 0.00060
MBNL1
Uncharacterized protein
protein from Canis lupus familiaris 0.00063
MBNL1
Uncharacterized protein
protein from Canis lupus familiaris 0.00064
cpsf4
Cleavage and polyadenylation specificity factor subunit 4
protein from Xenopus (Silurana) tropicalis 0.00064
cpsf4
Cleavage and polyadenylation specificity factor subunit 4
protein from Xenopus laevis 0.00064
MBNL1
Uncharacterized protein
protein from Canis lupus familiaris 0.00068
MBNL1
Uncharacterized protein
protein from Sus scrofa 0.00068
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00078
LEE1
Zinc-finger protein of unknown function
gene from Saccharomyces cerevisiae 0.00089

The BLAST search returned 3 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  020994
        (319 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2010562 - symbol:AT1G04990 species:3702 "Arabi...   422  3.7e-48   2
TAIR|locus:2043368 - symbol:AT2G47850 species:3702 "Arabi...   449  1.9e-42   1
TAIR|locus:2101170 - symbol:AT3G48440 species:3702 "Arabi...   441  1.4e-41   1
TAIR|locus:2182988 - symbol:AT5G18550 species:3702 "Arabi...   430  2.0e-40   1
TAIR|locus:2081066 - symbol:AT3G06410 species:3702 "Arabi...   399  3.9e-37   1
TAIR|locus:2075477 - symbol:ZFN1 "zinc finger protein 1" ...   381  3.1e-35   1
TAIR|locus:2171407 - symbol:ZFN3 "zinc finger nuclease 3"...   353  2.9e-32   1
TAIR|locus:2087775 - symbol:HUA1 "ENHANCER OF AG-4 1" spe...   295  1.5e-25   1
TAIR|locus:1006230718 - symbol:AT1G48195 species:3702 "Ar...   251  1.9e-21   1
UNIPROTKB|C9K0K2 - symbol:CPSF4 "Cleavage and polyadenyla...    91  6.4e-07   2
ZFIN|ZDB-GENE-990806-20 - symbol:cth1 "cth1" species:7955...   116  1.0e-06   2
POMBASE|SPAC227.08c - symbol:yth1 "mRNA cleavage and poly...   105  1.4e-05   2
UNIPROTKB|I3LCK9 - symbol:LOC100738395 "Uncharacterized p...   104  2.7e-05   3
UNIPROTKB|F1REX3 - symbol:LOC100518830 "Uncharacterized p...   105  3.7e-05   3
WB|WBGene00013319 - symbol:ccch-5 species:6239 "Caenorhab...   118  3.9e-05   1
UNIPROTKB|D4A905 - symbol:Cpsf4 "Cleavage and polyadenyla...   109  4.6e-05   2
UNIPROTKB|B7Z7B0 - symbol:CPSF4 "Cleavage and polyadenyla...   104  4.7e-05   2
UNIPROTKB|O95639 - symbol:CPSF4 "Cleavage and polyadenyla...   104  5.1e-05   3
UNIPROTKB|E1BV31 - symbol:CPSF4 "Uncharacterized protein"...   108  6.0e-05   2
WB|WBGene00009537 - symbol:ccch-2 species:6239 "Caenorhab...   113  0.00011   1
ASPGD|ASPL0000062209 - symbol:AN0298 species:162425 "Emer...    94  0.00013   2
UNIPROTKB|O19137 - symbol:CPSF4 "Cleavage and polyadenyla...   104  0.00019   2
RGD|620440 - symbol:Cpsf4 "cleavage and polyadenylation s...   104  0.00019   2
UNIPROTKB|H9KVA5 - symbol:CPSF4L "Putative cleavage and p...    91  0.00020   2
UNIPROTKB|C9JEV9 - symbol:CPSF4 "Cleavage and polyadenyla...   113  0.00020   1
MGI|MGI:1861602 - symbol:Cpsf4 "cleavage and polyadenylat...   113  0.00020   1
UNIPROTKB|E2RBK7 - symbol:CPSF4 "Uncharacterized protein"...   113  0.00021   1
UNIPROTKB|J9P398 - symbol:CPSF4 "Uncharacterized protein"...   104  0.00029   2
CGD|CAL0005897 - symbol:YTH1 species:5476 "Candida albica...   101  0.00030   2
UNIPROTKB|Q59T36 - symbol:YTH1 "mRNA 3'-end-processing pr...   101  0.00030   2
UNIPROTKB|H7C4T5 - symbol:MBNL1 "Muscleblind-like protein...   111  0.00030   2
UNIPROTKB|F1LPR3 - symbol:Mbnl1 "Protein Mbnl1" species:1...   114  0.00036   1
UNIPROTKB|C9JP00 - symbol:MBNL1 "Muscleblind-like protein...   111  0.00037   2
MGI|MGI:1928482 - symbol:Mbnl1 "muscleblind-like 1 (Droso...   115  0.00040   1
UNIPROTKB|Q5ZKW9 - symbol:MBNL1 "Muscleblind-like protein...   115  0.00046   1
UNIPROTKB|F1M9N4 - symbol:Mbnl1 "Protein Mbnl1" species:1...   114  0.00047   1
UNIPROTKB|F1NBC8 - symbol:MBNL1 "Muscleblind-like protein...   115  0.00049   1
UNIPROTKB|E2QSA8 - symbol:MBNL1 "Uncharacterized protein"...   114  0.00052   1
UNIPROTKB|Q9NR56 - symbol:MBNL1 "Muscleblind-like protein...   111  0.00052   2
UNIPROTKB|G3X6F9 - symbol:MBNL1 "Uncharacterized protein"...   114  0.00060   1
UNIPROTKB|E2QSC6 - symbol:MBNL1 "Uncharacterized protein"...   114  0.00063   1
UNIPROTKB|F6UUJ0 - symbol:MBNL1 "Uncharacterized protein"...   114  0.00064   1
UNIPROTKB|Q66KE3 - symbol:cpsf4 "Cleavage and polyadenyla...   101  0.00064   2
UNIPROTKB|Q6DJP7 - symbol:cpsf4 "Cleavage and polyadenyla...   101  0.00064   2
UNIPROTKB|F6UU15 - symbol:MBNL1 "Uncharacterized protein"...   114  0.00068   1
UNIPROTKB|F1SJM7 - symbol:MBNL1 "Uncharacterized protein"...   114  0.00068   1
UNIPROTKB|E2RBM0 - symbol:CPSF4 "Uncharacterized protein"...    91  0.00078   2
SGD|S000005975 - symbol:LEE1 "Zinc-finger protein of unkn...   111  0.00089   1


>TAIR|locus:2010562 [details] [associations]
            symbol:AT1G04990 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0007623 "circadian rhythm" evidence=RCA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0003723 GO:GO:0090305 EMBL:AC004809
            GO:GO:0004518 HOGENOM:HOG000237733 EMBL:AY048253 EMBL:AY113065
            IPI:IPI00522113 PIR:F86183 RefSeq:NP_563725.1 RefSeq:NP_973759.1
            UniGene:At.21743 ProteinModelPortal:Q94AD9 SMR:Q94AD9 PaxDb:Q94AD9
            PRIDE:Q94AD9 EnsemblPlants:AT1G04990.1 EnsemblPlants:AT1G04990.2
            GeneID:839351 KEGG:ath:AT1G04990 TAIR:At1g04990 eggNOG:NOG290936
            InParanoid:Q94AD9 OMA:THQRISP PhylomeDB:Q94AD9
            ProtClustDB:CLSN2687681 Genevestigator:Q94AD9 GermOnline:AT1G04990
            Uniprot:Q94AD9
        Length = 404

 Score = 422 (153.6 bits), Expect = 3.7e-48, Sum P(2) = 3.7e-48
 Identities = 101/263 (38%), Positives = 134/263 (50%)

Query:    38 PLTGNASLGSMGSSVLPSSGLQYAGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIV 97
             P   N    + G S  P++ L+YA  L   S         R Q  QSY+P++VSPSQG +
Sbjct:   161 PQPDNGHSTAYGMSSFPAADLRYASGLTMMSTYGT---LPRPQVPQSYVPILVSPSQGFL 217

Query:    98 PAPGWNTYMGNIGPLSPTSIAGSNLIYSSRNQGDL-GAGAQMHILSASSQNLPERPDQPD 156
             P  GW  YM           A SN +Y+ +NQ    G+ A M +  A ++ L E  DQP+
Sbjct:   218 PPQGWAPYM-----------AASNSMYNVKNQPYYSGSSASMAMAVALNRGLSESSDQPE 266

Query:   157 CRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGP 216
             CR++MNTGTCKYG DCK+ HP  RI+Q   S I P  LP+RPGQ  C N+  YG CKFGP
Sbjct:   267 CRFFMNTGTCKYGDDCKYSHPGVRISQPPPSLINPFVLPARPGQPACGNFRSYGFCKFGP 326

Query:   217 TCRFDHPYAGYPINYGXXXXXXXXXXXXXMNHQAISATHSIETSPDASSKIPNWVQNSDA 276
              C+FDHP   YP                   HQ IS T +   S   S+  P+  + S  
Sbjct:   327 NCKFDHPMLPYP-GLTMATSLPTPFASPVTTHQRISPTPNRSDSKSLSNGKPDVKKESS- 384

Query:   277 VSVQHQNPDMKNSTTKN-SDDSS 298
                + + PD  N   ++ S+D+S
Sbjct:   385 ---ETEKPD--NGEVQDLSEDAS 402

 Score = 236 (88.1 bits), Expect = 1.8e-19, P = 1.8e-19
 Identities = 50/106 (47%), Positives = 65/106 (61%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQY 60
             MR  EK CPYY+RTG+C+FGVACKFHHPQP +        G+++  + G S  P++ L+Y
Sbjct:   134 MRLGEKPCPYYLRTGTCRFGVACKFHHPQPDN--------GHST--AYGMSSFPAADLRY 183

Query:    61 AGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAPGWNTYM 106
             A  L   S         R Q  QSY+P++VSPSQG +P  GW  YM
Sbjct:   184 ASGLTMMSTYGT---LPRPQVPQSYVPILVSPSQGFLPPQGWAPYM 226

 Score = 230 (86.0 bits), Expect = 3.4e-26, Sum P(2) = 3.4e-26
 Identities = 43/90 (47%), Positives = 54/90 (60%)

Query:   146 QNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERI-AQSAASNIGPLGLPSRPGQAICS 204
             + LPER  QPDC Y++ TG CKYG  CK+HHPK+R  AQ    N+  +GLP R G+  C 
Sbjct:    85 EELPERIGQPDCEYFLKTGACKYGPTCKYHHPKDRNGAQPVMFNV--IGLPMRLGEKPCP 142

Query:   205 NYSMYGICKFGPTCRFDHPYA--GYPINYG 232
              Y   G C+FG  C+F HP    G+   YG
Sbjct:   143 YYLRTGTCRFGVACKFHHPQPDNGHSTAYG 172

 Score = 190 (71.9 bits), Expect = 1.1e-12, P = 1.1e-12
 Identities = 41/117 (35%), Positives = 63/117 (53%)

Query:   111 PLSPTSIAGSNLIYSSRNQGDL-GAGAQMHILSASSQNL---PERPDQPDCRYYMNTGTC 166
             P+S T    S+L+ S R+   +  A  +M +     + L   P+RP + DC++Y+ TG C
Sbjct:     4 PMSDTQHVQSSLV-SIRSSDKIEDAFRKMKVNETGVEELNPYPDRPGERDCQFYLRTGLC 62

Query:   167 KYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHP 223
              YG+ C+++HP   + Q  A     L  P R GQ  C  +   G CK+GPTC++ HP
Sbjct:    63 GYGSSCRYNHPTH-LPQDVAYYKEEL--PERIGQPDCEYFLKTGACKYGPTCKYHHP 116

 Score = 98 (39.6 bits), Expect = 3.7e-48, Sum P(2) = 3.7e-48
 Identities = 13/28 (46%), Positives = 21/28 (75%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQ 29
             R  +  C Y+++TG+CK+G  CK+HHP+
Sbjct:    90 RIGQPDCEYFLKTGACKYGPTCKYHHPK 117

 Score = 89 (36.4 bits), Expect = 3.3e-47, Sum P(2) = 3.3e-47
 Identities = 12/27 (44%), Positives = 20/27 (74%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHP 28
             R  E+ C +Y+RTG C +G +C+++HP
Sbjct:    47 RPGERDCQFYLRTGLCGYGSSCRYNHP 73


>TAIR|locus:2043368 [details] [associations]
            symbol:AT2G47850 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 EMBL:CP002685 GenomeReviews:CT485783_GR GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 EMBL:AC005309 EMBL:BT030391
            EMBL:BT004106 IPI:IPI00519400 PIR:C84920 RefSeq:NP_001078078.1
            RefSeq:NP_182306.2 UniGene:At.21006 ProteinModelPortal:Q84W91
            SMR:Q84W91 PaxDb:Q84W91 PRIDE:Q84W91 EnsemblPlants:AT2G47850.1
            EnsemblPlants:AT2G47850.3 GeneID:819397 KEGG:ath:AT2G47850
            TAIR:At2g47850 eggNOG:NOG312935 HOGENOM:HOG000237733
            InParanoid:Q84W91 OMA:RYGVACK PhylomeDB:Q84W91
            ProtClustDB:CLSN2680305 Genevestigator:Q84W91 GermOnline:AT2G47850
            Uniprot:Q84W91
        Length = 468

 Score = 449 (163.1 bits), Expect = 1.9e-42, P = 1.9e-42
 Identities = 97/236 (41%), Positives = 134/236 (56%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQY 60
             +R+ +  C YY++TG CKFG+ CKFHHPQP+  GT +P    AS      SV      QY
Sbjct:   135 VREGDNECSYYLKTGQCKFGITCKFHHPQPA--GTTVP-PPPASAPQFYPSVQSLMPDQY 191

Query:    61 AGSLPTWSLQRAPYL--SSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSP--- 114
              G  P+ SL+ A  L   S +QG  +Y P++++P  G+VP PGW+ Y   + P LSP   
Sbjct:   192 GG--PSSSLRVARTLLPGSYMQG--AYGPMLLTP--GVVPIPGWSPYSAPVSPALSPGAQ 245

Query:   115 -----TSIAGSNLIYSSRNQ--GDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCK 167
                  TS+ G   + S+     G   + +    +    Q  PERP +P+C+YY+ TG CK
Sbjct:   246 HAVGATSLYGVTQLTSTTPSLPGVYPSLSSPTGVIQKEQAFPERPGEPECQYYLKTGDCK 305

Query:   168 YGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHP 223
             +G  CKFHHP++R+   A   + P+GLP RPG   C+ Y   G CKFG TC+FDHP
Sbjct:   306 FGTSCKFHHPRDRVPPRANCVLSPIGLPLRPGVQRCTFYVQNGFCKFGSTCKFDHP 361

 Score = 211 (79.3 bits), Expect = 5.2e-21, Sum P(2) = 5.2e-21
 Identities = 41/78 (52%), Positives = 50/78 (64%)

Query:   149 PERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPL---GLPSRPGQAICSN 205
             PER  +P C++Y+ TGTCK+GA CKFHHPK   A  + S++ PL   G P R G   CS 
Sbjct:    88 PERFGEPPCQFYLKTGTCKFGASCKFHHPKN--AGGSMSHV-PLNIYGYPVREGDNECSY 144

Query:   206 YSMYGICKFGPTCRFDHP 223
             Y   G CKFG TC+F HP
Sbjct:   145 YLKTGQCKFGITCKFHHP 162

 Score = 211 (79.3 bits), Expect = 3.3e-15, P = 3.3e-15
 Identities = 40/94 (42%), Positives = 51/94 (54%)

Query:   131 DLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIG 190
             D G    M  L   S + PERP  PDC YYM TG C YG  C+++HP++R   S  + + 
Sbjct:    25 DTGLQESMWRLGLGSDSYPERPGAPDCAYYMRTGVCGYGNRCRYNHPRDRA--SVEATVR 82

Query:   191 PLG-LPSRPGQAICSNYSMYGICKFGPTCRFDHP 223
               G  P R G+  C  Y   G CKFG +C+F HP
Sbjct:    83 ATGQYPERFGEPPCQFYLKTGTCKFGASCKFHHP 116

 Score = 114 (45.2 bits), Expect = 7.0e-11, Sum P(2) = 7.0e-11
 Identities = 19/45 (42%), Positives = 25/45 (55%)

Query:   134 AGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPK 178
             AG  M  +  +    P R    +C YY+ TG CK+G  CKFHHP+
Sbjct:   119 AGGSMSHVPLNIYGYPVREGDNECSYYLKTGQCKFGITCKFHHPQ 163

 Score = 109 (43.4 bits), Expect = 7.0e-11, Sum P(2) = 7.0e-11
 Identities = 16/28 (57%), Positives = 22/28 (78%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQ 29
             R  E  C +Y++TG+CKFG +CKFHHP+
Sbjct:    90 RFGEPPCQFYLKTGTCKFGASCKFHHPK 117

 Score = 96 (38.9 bits), Expect = 5.8e-09, Sum P(2) = 5.8e-09
 Identities = 18/43 (41%), Positives = 26/43 (60%)

Query:   148 LPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQS-AASNI 189
             LP RP    C +Y+  G CK+G+ CKF HP   I  + +AS++
Sbjct:   332 LPLRPGVQRCTFYVQNGFCKFGSTCKFDHPMGTIRYNPSASSL 374

 Score = 88 (36.0 bits), Expect = 5.2e-21, Sum P(2) = 5.2e-21
 Identities = 16/41 (39%), Positives = 24/41 (58%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQP-SSLGTALPLTG 41
             R     C YYMRTG C +G  C+++HP+  +S+   +  TG
Sbjct:    45 RPGAPDCAYYMRTGVCGYGNRCRYNHPRDRASVEATVRATG 85


>TAIR|locus:2101170 [details] [associations]
            symbol:AT3G48440 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 EMBL:AL049659
            HOGENOM:HOG000237733 EMBL:BT033139 IPI:IPI00517303 PIR:T06698
            RefSeq:NP_190414.1 UniGene:At.50258 ProteinModelPortal:Q9STM4
            SMR:Q9STM4 PaxDb:Q9STM4 PRIDE:Q9STM4 EnsemblPlants:AT3G48440.1
            GeneID:824003 KEGG:ath:AT3G48440 TAIR:At3g48440 eggNOG:NOG288127
            InParanoid:Q9STM4 OMA:PEWNGYQ PhylomeDB:Q9STM4
            ProtClustDB:CLSN2719348 Genevestigator:Q9STM4 GermOnline:AT3G48440
            Uniprot:Q9STM4
        Length = 448

 Score = 441 (160.3 bits), Expect = 1.4e-41, P = 1.4e-41
 Identities = 93/230 (40%), Positives = 131/230 (56%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLG-TALP-LTGNASLGSMGSSVLPSSGL 58
             +R  E  CPYYMR GSCK+G  CKF+HP P+++G T  P   GN  + S+G+   P +  
Sbjct:   203 LRPGEVECPYYMRNGSCKYGAECKFNHPDPTTIGGTDSPSFRGNNGV-SIGT-FSPKATF 260

Query:    59 QYAGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPA-PGWNTYMGNI-----GPL 112
             Q   S  +WS  R       + GT  ++P+++S + G+    P WN Y  ++     G  
Sbjct:   261 Q--ASSTSWSSPR------HVNGTSPFIPVMLSQTHGVTSQNPEWNGYQASVYSSERGVF 312

Query:   113 SPTSIAGSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADC 172
             SP++   + L+ +S  +  +      H + A  +  PERPDQP+C YYM TG CK+  +C
Sbjct:   313 SPST---TYLMNNSSAETSMLLSQYRHQMPA--EEFPERPDQPECSYYMKTGDCKFKFNC 367

Query:   173 KFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDH 222
             K+HHPK R+ +     +   GLP RP Q IC+ YS YGICKFGP CRFDH
Sbjct:   368 KYHHPKNRLPKLPPYALNDKGLPLRPDQNICTYYSRYGICKFGPACRFDH 417

 Score = 191 (72.3 bits), Expect = 3.4e-20, Sum P(2) = 3.4e-20
 Identities = 34/70 (48%), Positives = 43/70 (61%)

Query:   156 DCRYYMNTGTCKYGADCKFHH--PKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICK 213
             DC+YY  TG CKYG  C+F+H  PK  +A +   N   LGLP RPG+  C  Y   G CK
Sbjct:   163 DCKYYFRTGGCKYGETCRFNHTIPKSGLASAPELNF--LGLPLRPGEVECPYYMRNGSCK 220

Query:   214 FGPTCRFDHP 223
             +G  C+F+HP
Sbjct:   221 YGAECKFNHP 230

 Score = 168 (64.2 bits), Expect = 6.0e-10, P = 6.0e-10
 Identities = 37/87 (42%), Positives = 48/87 (55%)

Query:   145 SQNL-PERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNI--GPLGLPSRPGQA 201
             S+N+ P RP   DC +YM TG+CK+G+ CKF+HP  R  Q A  N          + G  
Sbjct:   103 SENVYPVRPGAEDCSFYMRTGSCKFGSSCKFNHPLARKFQIARDNKVREKEDDGGKLGLI 162

Query:   202 ICSNYSMYGICKFGPTCRFDH--PYAG 226
              C  Y   G CK+G TCRF+H  P +G
Sbjct:   163 DCKYYFRTGGCKYGETCRFNHTIPKSG 189

 Score = 123 (48.4 bits), Expect = 3.7e-16, Sum P(3) = 3.7e-16
 Identities = 19/30 (63%), Positives = 24/30 (80%)

Query:   148 LPERPDQPDCRYYMNTGTCKYGADCKFHHP 177
             LP RP + +C YYM  G+CKYGA+CKF+HP
Sbjct:   201 LPLRPGEVECPYYMRNGSCKYGAECKFNHP 230

 Score = 108 (43.1 bits), Expect = 3.4e-20, Sum P(2) = 3.4e-20
 Identities = 17/28 (60%), Positives = 22/28 (78%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHP 28
             +R   + C +YMRTGSCKFG +CKF+HP
Sbjct:   109 VRPGAEDCSFYMRTGSCKFGSSCKFNHP 136

 Score = 92 (37.4 bits), Expect = 1.6e-14, Sum P(3) = 1.6e-14
 Identities = 14/25 (56%), Positives = 17/25 (68%)

Query:     8 CPYYMRTGSCKFGVACKFHHPQPSS 32
             C YY RTG CK+G  C+F+H  P S
Sbjct:   164 CKYYFRTGGCKYGETCRFNHTIPKS 188

 Score = 78 (32.5 bits), Expect = 3.7e-16, Sum P(3) = 3.7e-16
 Identities = 14/29 (48%), Positives = 16/29 (55%)

Query:   195 PSRPGQAICSNYSMYGICKFGPTCRFDHP 223
             P RP Q  CS Y   G CKF   C++ HP
Sbjct:   344 PERPDQPECSYYMKTGDCKFKFNCKYHHP 372


>TAIR|locus:2182988 [details] [associations]
            symbol:AT5G18550 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 eggNOG:NOG312935
            HOGENOM:HOG000237733 ProtClustDB:CLSN2681554 EMBL:AC069328
            EMBL:BT010886 EMBL:AK230175 IPI:IPI00533261 RefSeq:NP_197356.2
            UniGene:At.22535 ProteinModelPortal:Q6NPN3 SMR:Q6NPN3 STRING:Q6NPN3
            PaxDb:Q6NPN3 PRIDE:Q6NPN3 EnsemblPlants:AT5G18550.1 GeneID:831973
            KEGG:ath:AT5G18550 TAIR:At5g18550 InParanoid:Q6NPN3 OMA:GSQPCAY
            PhylomeDB:Q6NPN3 Genevestigator:Q6NPN3 GermOnline:AT5G18550
            Uniprot:Q6NPN3
        Length = 465

 Score = 430 (156.4 bits), Expect = 2.0e-40, P = 2.0e-40
 Identities = 89/241 (36%), Positives = 129/241 (53%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQY 60
             +R  EK C Y+MRTG CKFG  C++HHP P   G   P        S G ++ PS   Q 
Sbjct:   144 LRPGEKECSYFMRTGQCKFGSTCRYHHPVPP--GVQAPSQQQQQQLSAGPTMYPSLQSQT 201

Query:    61 AGSLPTWSLQRA-PYL--SSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPL-SPTS 116
               S   + +  A P L   S +Q    Y  +++ P  G+VP  GWN Y  ++  + SP +
Sbjct:   202 VPSSQQYGVVLARPQLLPGSYVQSPYGYGQMVLPP--GMVPYSGWNPYQASVSAMPSPGT 259

Query:   117 --IAGSNLIYS----SRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGA 170
                 G++ +Y     S +     +G     +S   Q  P+RP+QP+C+Y+M TG CK+G 
Sbjct:   260 QPSMGTSSVYGITPLSPSAPAYQSGPSSTGVSNKEQTFPQRPEQPECQYFMRTGDCKFGT 319

Query:   171 DCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAGYPIN 230
              C+FHHP E  A   AS +  +GLP RPG   C++++ +GICKFGP C+FDH      ++
Sbjct:   320 SCRFHHPMEA-ASPEASTLSHIGLPLRPGAVPCTHFAQHGICKFGPACKFDHSLGSSSLS 378

Query:   231 Y 231
             Y
Sbjct:   379 Y 379

 Score = 218 (81.8 bits), Expect = 2.8e-23, Sum P(2) = 2.8e-23
 Identities = 37/85 (43%), Positives = 53/85 (62%)

Query:   141 LSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAAS--NIGPLGLPSRP 198
             L   +   PER  QP C+++M TGTCK+GA CK+HHP++     + +  ++  +G P RP
Sbjct:    87 LRTEAGEFPERMGQPVCQHFMRTGTCKFGASCKYHHPRQGGGGDSVTPVSLNYMGFPLRP 146

Query:   199 GQAICSNYSMYGICKFGPTCRFDHP 223
             G+  CS +   G CKFG TCR+ HP
Sbjct:   147 GEKECSYFMRTGQCKFGSTCRYHHP 171

 Score = 215 (80.7 bits), Expect = 8.8e-16, P = 8.8e-16
 Identities = 34/81 (41%), Positives = 47/81 (58%)

Query:   146 QNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSN 205
             +  PERPD+PDC YY+ TG C YG+ C+F+HP+ R              P R GQ +C +
Sbjct:    46 ETFPERPDEPDCIYYLRTGVCGYGSRCRFNHPRNRAPVLGGLRTEAGEFPERMGQPVCQH 105

Query:   206 YSMYGICKFGPTCRFDHPYAG 226
             +   G CKFG +C++ HP  G
Sbjct:   106 FMRTGTCKFGASCKYHHPRQG 126

 Score = 151 (58.2 bits), Expect = 5.7e-14, Sum P(2) = 5.7e-14
 Identities = 49/194 (25%), Positives = 75/194 (38%)

Query:   119 GSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPK 178
             G++  Y    QG  G G  +  +S +    P RP + +C Y+M TG CK+G+ C++HHP 
Sbjct:   115 GASCKYHHPRQG--GGGDSVTPVSLNYMGFPLRPGEKECSYFMRTGQCKFGSTCRYHHPV 172

Query:   179 ERIAQSAAS------NIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAGYPINYG 232
                 Q+ +       + GP   PS   Q + S+   YG+    P       Y   P  YG
Sbjct:   173 PPGVQAPSQQQQQQLSAGPTMYPSLQSQTVPSS-QQYGVVLARPQL-LPGSYVQSPYGYG 230

Query:   233 XXXXXXXXXXXXXMN-HQA-ISATHSIETSPD-ASSKIPNWVQNSDAVSVQHQNPDMKNS 289
                           N +QA +SA  S  T P   +S +      S +       P     
Sbjct:   231 QMVLPPGMVPYSGWNPYQASVSAMPSPGTQPSMGTSSVYGITPLSPSAPAYQSGPSSTGV 290

Query:   290 TTKNSDDSSKVDHP 303
             + K      + + P
Sbjct:   291 SNKEQTFPQRPEQP 304

 Score = 97 (39.2 bits), Expect = 2.8e-23, Sum P(2) = 2.8e-23
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQ 29
             R DE  C YY+RTG C +G  C+F+HP+
Sbjct:    51 RPDEPDCIYYLRTGVCGYGSRCRFNHPR 78


>TAIR|locus:2081066 [details] [associations]
            symbol:AT3G06410 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 EMBL:AC011623
            eggNOG:NOG312935 HOGENOM:HOG000237733 EMBL:AK230312 EMBL:AK230438
            IPI:IPI00535086 RefSeq:NP_187292.2 UniGene:At.27771
            ProteinModelPortal:Q9SQU4 SMR:Q9SQU4 EnsemblPlants:AT3G06410.1
            GeneID:819815 KEGG:ath:AT3G06410 TAIR:At3g06410 InParanoid:Q9SQU4
            OMA:SSQQYGL PhylomeDB:Q9SQU4 ProtClustDB:CLSN2681554
            Genevestigator:Q9SQU4 GermOnline:AT3G06410 Uniprot:Q9SQU4
        Length = 462

 Score = 399 (145.5 bits), Expect = 3.9e-37, P = 3.9e-37
 Identities = 86/235 (36%), Positives = 129/235 (54%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSL-GTAL-PLTGNASLGSMGSSVLPSSGL 58
             +R  EK C YY+RTG CKFG+ C+F+HP P ++ G    P            ++ P+   
Sbjct:   147 LRPGEKECSYYLRTGQCKFGLTCRFNHPVPLAVQGPPQQPQQQQPQPQPQLQTIYPTLQS 206

Query:    59 QYAGSLPTWSL--QRAPYLS-SRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPL-SP 114
             Q   S   + L   R  +L+ S LQ    Y P +V P  G+VP  GWN Y  ++  + SP
Sbjct:   207 QSIPSSQQYGLVLTRPSFLTGSYLQSP--YGPPMVLPP-GMVPYSGWNPYQASLSAMPSP 263

Query:   115 TS--IAGSNLIY-----SSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCK 167
              +    GS+ IY     S       G    +   +++S+  P+RPDQP+C+Y+M TG CK
Sbjct:   264 GTQPSIGSSSIYGLTPLSPSATAYTGTYQSVPSSNSTSKEFPQRPDQPECQYFMRTGDCK 323

Query:   168 YGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDH 222
             +G+ C++HHP + +       +  +GLP RPG A C++++ +GICKFGP C+FDH
Sbjct:   324 FGSSCRYHHPVDAVPPKTGIVLSSIGLPLRPGVAQCTHFAQHGICKFGPACKFDH 378

 Score = 222 (83.2 bits), Expect = 2.4e-24, Sum P(2) = 2.4e-24
 Identities = 39/78 (50%), Positives = 52/78 (66%)

Query:   148 LPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAAS--NIGPLGLPSRPGQAICSN 205
             LPER   P C+++M TGTCK+GA CK+HHP++     + +  ++  LG P RPG+  CS 
Sbjct:    97 LPERMGHPVCQHFMRTGTCKFGASCKYHHPRQGGGGGSVAPVSLSYLGYPLRPGEKECSY 156

Query:   206 YSMYGICKFGPTCRFDHP 223
             Y   G CKFG TCRF+HP
Sbjct:   157 YLRTGQCKFGLTCRFNHP 174

 Score = 220 (82.5 bits), Expect = 1.2e-16, P = 1.2e-16
 Identities = 37/82 (45%), Positives = 52/82 (63%)

Query:   146 QNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLG-LPSRPGQAICS 204
             ++ PERPD+PDC YY+ TG C YG+ C+F+HP++R A       G  G LP R G  +C 
Sbjct:    49 ESYPERPDEPDCIYYLRTGVCGYGSRCRFNHPRDRGAVIGGVR-GEAGALPERMGHPVCQ 107

Query:   205 NYSMYGICKFGPTCRFDHPYAG 226
             ++   G CKFG +C++ HP  G
Sbjct:   108 HFMRTGTCKFGASCKYHHPRQG 129

 Score = 124 (48.7 bits), Expect = 2.0e-16, Sum P(3) = 2.0e-16
 Identities = 23/59 (38%), Positives = 33/59 (55%)

Query:   119 GSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHP 177
             G++  Y    QG  G G  +  +S S    P RP + +C YY+ TG CK+G  C+F+HP
Sbjct:   118 GASCKYHHPRQG--GGGGSVAPVSLSYLGYPLRPGEKECSYYLRTGQCKFGLTCRFNHP 174

 Score = 116 (45.9 bits), Expect = 0.00051, P = 0.00051
 Identities = 39/129 (30%), Positives = 58/129 (44%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPS-SGLQY 60
             R D+  C Y+MRTG CKFG +C++HHP    +    P TG   L S+G  + P  +   +
Sbjct:   307 RPDQPECQYFMRTGDCKFGSSCRYHHP----VDAVPPKTGIV-LSSIGLPLRPGVAQCTH 361

Query:    61 AGSLPTWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPLSPTSIAGS 120
                        A      +  + SY P   S +   V AP    Y     P+  +S++GS
Sbjct:   362 FAQHGICKFGPACKFDHSMSSSLSYSPSASSLTDMPV-AP----Y-----PIGSSSLSGS 411

Query:   121 NLIYSSRNQ 129
             +   SS N+
Sbjct:   412 SAPVSSSNE 420

 Score = 109 (43.4 bits), Expect = 1.8e-08, Sum P(2) = 1.8e-08
 Identities = 22/54 (40%), Positives = 31/54 (57%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHP-QPSSLGTALPLTGNASLGSMGSSVLP 54
             R     C ++MRTG+CKFG +CK+HHP Q    G+  P+    SL  +G  + P
Sbjct:   100 RMGHPVCQHFMRTGTCKFGASCKYHHPRQGGGGGSVAPV----SLSYLGYPLRP 149

 Score = 97 (39.2 bits), Expect = 2.4e-24, Sum P(2) = 2.4e-24
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQ 29
             R DE  C YY+RTG C +G  C+F+HP+
Sbjct:    54 RPDEPDCIYYLRTGVCGYGSRCRFNHPR 81

 Score = 91 (37.1 bits), Expect = 2.0e-16, Sum P(3) = 2.0e-16
 Identities = 18/47 (38%), Positives = 22/47 (46%)

Query:   186 ASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAGYPINYG 232
             +SN      P RP Q  C  +   G CKFG +CR+ HP    P   G
Sbjct:   296 SSNSTSKEFPQRPDQPECQYFMRTGDCKFGSSCRYHHPVDAVPPKTG 342


>TAIR|locus:2075477 [details] [associations]
            symbol:ZFN1 "zinc finger protein 1" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0004518 "nuclease activity" evidence=TAS]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0010313 "phytochrome
            binding" evidence=IPI] [GO:0017148 "negative regulation of
            translation" evidence=IMP] [GO:0048027 "mRNA 5'-UTR binding"
            evidence=IPI] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005829 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0017148 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0048027 GO:GO:0004518 HOGENOM:HOG000237733
            EMBL:AF138743 EMBL:AC018363 EMBL:AK117978 EMBL:BT025966
            IPI:IPI00539955 PIR:T48874 RefSeq:NP_566183.1 UniGene:At.23706
            ProteinModelPortal:Q8GXX7 SMR:Q8GXX7 STRING:Q8GXX7 PaxDb:Q8GXX7
            PRIDE:Q8GXX7 EnsemblPlants:AT3G02830.1 GeneID:821230
            KEGG:ath:AT3G02830 GeneFarm:4898 TAIR:At3g02830 eggNOG:NOG329662
            InParanoid:Q8GXX7 OMA:SSDDQQR PhylomeDB:Q8GXX7
            ProtClustDB:CLSN2917075 Genevestigator:Q8GXX7 GermOnline:AT3G02830
            Uniprot:Q8GXX7
        Length = 397

 Score = 381 (139.2 bits), Expect = 3.1e-35, P = 3.1e-35
 Identities = 85/225 (37%), Positives = 116/225 (51%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQY 60
             +R +E  C Y++RTG CKFG  CKF+HPQP      +P +G  S     +S + S   Q 
Sbjct:   130 LRSNEVDCAYFLRTGHCKFGGTCKFNHPQPQPTNMMVPTSGQQSYPWSRASFIASPRWQD 189

Query:    61 AGSLPTWSLQRAPYLSSRLQGTQSYMPLI--VSPSQGIVPAPGWNTYMGNIGPLSPTSIA 118
               S  +  +   P     +QG   Y   +  VSPS G      +     N    S +   
Sbjct:   190 PSSYASLIM---PQGVVPVQGWNPYSGQLGSVSPS-GTGNDQNYRNLQQNETIESGSQSQ 245

Query:   119 GSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPK 178
             GS   +S  N G        + L   +   PERP QP+C++YM TG CK+G  CKFHHP+
Sbjct:   246 GS---FSGYNPGSSVPLGGYYALPRENV-FPERPGQPECQFYMKTGDCKFGTVCKFHHPR 301

Query:   179 ERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHP 223
             +R A      +  +GLP RPG+ +C  Y+ YGICKFGP+C+FDHP
Sbjct:   302 DRQAPPPDCLLSSIGLPLRPGEPLCVFYTRYGICKFGPSCKFDHP 346

 Score = 220 (82.5 bits), Expect = 1.1e-24, Sum P(2) = 1.1e-24
 Identities = 35/75 (46%), Positives = 48/75 (64%)

Query:   149 PERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSM 208
             PER  QP+C YY+ TGTCK+G  CKFHHP+ +   +   ++  LG P R  +  C+ +  
Sbjct:    83 PERIGQPECEYYLKTGTCKFGVTCKFHHPRNKAGIAGRVSLNMLGYPLRSNEVDCAYFLR 142

Query:   209 YGICKFGPTCRFDHP 223
              G CKFG TC+F+HP
Sbjct:   143 TGHCKFGGTCKFNHP 157

 Score = 206 (77.6 bits), Expect = 1.0e-14, P = 1.0e-14
 Identities = 38/90 (42%), Positives = 54/90 (60%)

Query:   137 QMHILSASSQ---NLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLG 193
             QM++ S  +    + PERP +PDC YY+ TG C++G+ C+F+HP++R    A + +    
Sbjct:    23 QMNLSSDETMETGSYPERPGEPDCSYYIRTGLCRFGSTCRFNHPRDRELVIATARMRG-E 81

Query:   194 LPSRPGQAICSNYSMYGICKFGPTCRFDHP 223
              P R GQ  C  Y   G CKFG TC+F HP
Sbjct:    82 YPERIGQPECEYYLKTGTCKFGVTCKFHHP 111

 Score = 169 (64.5 bits), Expect = 3.5e-10, P = 3.5e-10
 Identities = 66/234 (28%), Positives = 99/234 (42%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYA 61
             R  +  C YY++TG+CKFGV CKFHHP+ +  G A    G  SL  +G   L S+ +  A
Sbjct:    85 RIGQPECEYYLKTGTCKFGVTCKFHHPR-NKAGIA----GRVSLNMLGYP-LRSNEVDCA 138

Query:    62 GSLPTWSLQ---RAPYLSSRLQGTQSYMPLIVSPSQ-----GIVPAPGW---NTYMGNIG 110
               L T   +      +   + Q T   +P     S        + +P W   ++Y   I 
Sbjct:   139 YFLRTGHCKFGGTCKFNHPQPQPTNMMVPTSGQQSYPWSRASFIASPRWQDPSSYASLIM 198

Query:   111 PLSPTSIAGSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGA 170
             P     + G N  YS    G LG+ +     + + QN      Q +    + +G+   G+
Sbjct:   199 PQGVVPVQGWNP-YS----GQLGSVSPSG--TGNDQNY-RNLQQNET---IESGSQSQGS 247

Query:   171 DCKFHHPKERIAQSAASNIGPLGL-PSRPGQAICSNYSMYGICKFGPTCRFDHP 223
                ++ P   +       +    + P RPGQ  C  Y   G CKFG  C+F HP
Sbjct:   248 FSGYN-PGSSVPLGGYYALPRENVFPERPGQPECQFYMKTGDCKFGTVCKFHHP 300

 Score = 101 (40.6 bits), Expect = 1.1e-24, Sum P(2) = 1.1e-24
 Identities = 16/32 (50%), Positives = 21/32 (65%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQPSSL 33
             R  E  C YY+RTG C+FG  C+F+HP+   L
Sbjct:    40 RPGEPDCSYYIRTGLCRFGSTCRFNHPRDREL 71


>TAIR|locus:2171407 [details] [associations]
            symbol:ZFN3 "zinc finger nuclease 3" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0004518 "nuclease activity" evidence=TAS]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 EMBL:AB005242 GO:GO:0004518
            HOGENOM:HOG000237733 EMBL:AF138872 EMBL:AY084634 EMBL:AY128342
            EMBL:BT000014 EMBL:BX831982 IPI:IPI00516322 IPI:IPI00528450
            IPI:IPI00528912 RefSeq:NP_568332.2 RefSeq:NP_851041.1
            RefSeq:NP_974790.1 UniGene:At.21711 ProteinModelPortal:Q8L7N8
            SMR:Q8L7N8 EnsemblPlants:AT5G16540.1 GeneID:831516
            KEGG:ath:AT5G16540 GeneFarm:4900 TAIR:At5g16540 eggNOG:NOG281021
            InParanoid:Q8L7N8 OMA:SAGNQGM PhylomeDB:Q8L7N8
            ProtClustDB:CLSN2690167 Genevestigator:Q8L7N8 Uniprot:Q8L7N8
        Length = 375

 Score = 353 (129.3 bits), Expect = 2.9e-32, P = 2.9e-32
 Identities = 82/244 (33%), Positives = 125/244 (51%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSG-LQY 60
             R  +  C +Y++TG+CKFGV CKFHHP+ +  G    + G+ S+  +   + P+     Y
Sbjct:    87 RIGQPECEFYLKTGTCKFGVTCKFHHPR-NKAG----IDGSVSVNVLSYPLRPNEDDCSY 141

Query:    61 ---AGSLP---TWSLQRAPYLSSRLQGTQSYMPLIVSPSQGIV--PAPGWN--TYMGNIG 110
                 G      T         S+ L  +    P + S  Q +   P+  W+  +++ N  
Sbjct:   142 FLRIGQCKFGGTCKFNHPQTQSTNLMVSVRGSP-VYSALQSLTGQPSYSWSRTSFVANPP 200

Query:   111 PLS-PTSIAGSNL--IYSSRNQGDLGAGAQMHILSASSQNL-PERPDQPDCRYYMNTGTC 166
              L  P+  A  +   ++SS      G    +   +   +N+ PERP QP+C++YM TG C
Sbjct:   201 RLQDPSGFASGSQGGLFSSGFHS--GNSVPLGFYALPRENVFPERPGQPECQFYMKTGDC 258

Query:   167 KYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYAG 226
             K+G  CKFHHP++R        +  +GLP RPG+ +C  YS YGICKFGP+C+FDHP   
Sbjct:   259 KFGTVCKFHHPRDRQTPPPDCVLSSVGLPLRPGEPLCVFYSRYGICKFGPSCKFDHPMRV 318

Query:   227 YPIN 230
             +  N
Sbjct:   319 FTYN 322

 Score = 222 (83.2 bits), Expect = 2.0e-25, Sum P(2) = 2.0e-25
 Identities = 35/75 (46%), Positives = 48/75 (64%)

Query:   149 PERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSM 208
             PER  QP+C +Y+ TGTCK+G  CKFHHP+ +     + ++  L  P RP +  CS +  
Sbjct:    85 PERIGQPECEFYLKTGTCKFGVTCKFHHPRNKAGIDGSVSVNVLSYPLRPNEDDCSYFLR 144

Query:   209 YGICKFGPTCRFDHP 223
              G CKFG TC+F+HP
Sbjct:   145 IGQCKFGGTCKFNHP 159

 Score = 195 (73.7 bits), Expect = 2.2e-13, P = 2.2e-13
 Identities = 35/75 (46%), Positives = 45/75 (60%)

Query:   149 PERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSM 208
             PER  +PDC YY+ TG C++G+ C+F+HP +R    A + I     P R GQ  C  Y  
Sbjct:    40 PERHGEPDCAYYIRTGLCRFGSTCRFNHPHDRKLVIATARIKG-EYPERIGQPECEFYLK 98

Query:   209 YGICKFGPTCRFDHP 223
              G CKFG TC+F HP
Sbjct:    99 TGTCKFGVTCKFHHP 113

 Score = 188 (71.2 bits), Expect = 1.6e-12, P = 1.6e-12
 Identities = 63/200 (31%), Positives = 88/200 (44%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQY 60
             +R +E  C Y++R G CKFG  CKF+HPQ  S         N  +   GS V   S LQ 
Sbjct:   132 LRPNEDDCSYFLRIGQCKFGGTCKFNHPQTQST--------NLMVSVRGSPVY--SALQS 181

Query:    61 AGSLPTWSLQRAPYLSS--RLQGTQSYMPLIVSPSQGIVPAPGWNTYMGNIGPLSPTSIA 118
                 P++S  R  ++++  RLQ    +     S SQG + + G+++  GN  PL   ++ 
Sbjct:   182 LTGQPSYSWSRTSFVANPPRLQDPSGF----ASGSQGGLFSSGFHS--GNSVPLGFYALP 235

Query:   119 GSNLIYSSRNQ---------GD--LGAGAQMHI----------LSASSQNLPERPDQPDC 157
               N+      Q         GD   G   + H              SS  LP RP +P C
Sbjct:   236 RENVFPERPGQPECQFYMKTGDCKFGTVCKFHHPRDRQTPPPDCVLSSVGLPLRPGEPLC 295

Query:   158 RYYMNTGTCKYGADCKFHHP 177
              +Y   G CK+G  CKF HP
Sbjct:   296 VFYSRYGICKFGPSCKFDHP 315

 Score = 101 (40.6 bits), Expect = 2.0e-25, Sum P(2) = 2.0e-25
 Identities = 16/32 (50%), Positives = 20/32 (62%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQPSSL 33
             R  E  C YY+RTG C+FG  C+F+HP    L
Sbjct:    42 RHGEPDCAYYIRTGLCRFGSTCRFNHPHDRKL 73


>TAIR|locus:2087775 [details] [associations]
            symbol:HUA1 "ENHANCER OF AG-4 1" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IDA;TAS]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001709 "cell fate
            determination" evidence=TAS] [GO:0003723 "RNA binding"
            evidence=ISS;IDA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=RCA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0046872 GO:GO:0003677 GO:GO:0016607
            GO:GO:0008270 GO:GO:0006397 GO:GO:0003723 GO:GO:0009908
            EMBL:AB024033 GO:GO:0001709 EMBL:AY024357 EMBL:AC069474
            EMBL:AK229145 IPI:IPI00536814 RefSeq:NP_187874.2 UniGene:At.5670
            ProteinModelPortal:Q941Q3 SMR:Q941Q3 STRING:Q941Q3 PaxDb:Q941Q3
            PRIDE:Q941Q3 EnsemblPlants:AT3G12680.1 GeneID:820448
            KEGG:ath:AT3G12680 TAIR:At3g12680 eggNOG:NOG250655
            HOGENOM:HOG000078745 InParanoid:Q941Q3 OMA:LGAHNTI PhylomeDB:Q941Q3
            ProtClustDB:CLSN2690537 Genevestigator:Q941Q3 Uniprot:Q941Q3
        Length = 524

 Score = 295 (108.9 bits), Expect = 1.5e-25, P = 1.5e-25
 Identities = 83/248 (33%), Positives = 116/248 (46%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQ----PSS---LGTALPLTGNASLGSMGSSVLP 54
             R  E  C +YM+TG CKFG++CKFHHP+    PSS   +G+++ LT      +       
Sbjct:   268 RPSEPMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSSVGLTSEPDATNNPHVTFT 327

Query:    55 SSGLQYAGSLPTWSLQ-RAP-YL---SSRLQGTQSYMPLIVSPSQ-GIVP-APGWNTYM- 106
              +    +  LP  S +   P YL   S +   T  Y      P +   +P A G N  + 
Sbjct:   328 PALYHNSKGLPVRSGEVDCPFYLKTGSCKYGATCRYN----HPERTAFIPQAAGVNYSLV 383

Query:   107 -GNIGPLSPTSIAGSNLIYSSRNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGT 165
               N   L+   +  +   Y +  Q  LG      ++SA+    P+RP Q +C YYM TG 
Sbjct:   384 SSNTANLNLGLVTPATSFYQTLTQPTLG------VISAT---YPQRPGQSECDYYMKTGE 434

Query:   166 CKYGADCKFHHPKERIA-------QSAASNIGPLGLPSRPGQAICSNYSMYGICKFGPTC 218
             CK+G  CKFHHP +R++       Q     +   G P R G   C  Y   G CK+G TC
Sbjct:   435 CKFGERCKFHHPADRLSAMTKQAPQQPNVKLSLAGYPRREGALNCPYYMKTGTCKYGATC 494

Query:   219 RFDHPYAG 226
             +FDHP  G
Sbjct:   495 KFDHPPPG 502

 Score = 216 (81.1 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
 Identities = 37/79 (46%), Positives = 50/79 (63%)

Query:   145 SQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICS 204
             ++  PERP +PDC YY+ T  CKYG+ CKF+HP+E  A S  +      LP RP + +C+
Sbjct:   219 NEEYPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDS---LPERPSEPMCT 275

Query:   205 NYSMYGICKFGPTCRFDHP 223
              Y   G CKFG +C+F HP
Sbjct:   276 FYMKTGKCKFGLSCKFHHP 294

 Score = 162 (62.1 bits), Expect = 2.5e-15, Sum P(2) = 2.5e-15
 Identities = 30/60 (50%), Positives = 41/60 (68%)

Query:   141 LSASSQN-LPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGP-LGLPSRP 198
             +S  +Q+ LPERP +P C +YM TG CK+G  CKFHHPK+    S++ +IG  +GL S P
Sbjct:   257 VSVETQDSLPERPSEPMCTFYMKTGKCKFGLSCKFHHPKDIQLPSSSQDIGSSVGLTSEP 316

 Score = 153 (58.9 bits), Expect = 4.2e-08, P = 4.2e-08
 Identities = 30/80 (37%), Positives = 42/80 (52%)

Query:   149 PERPDQPDCRYYMNTGTCKYGADCKFHHP----KERIAQSAASNIGPLG-LPSRPGQAIC 203
             P+R  + DC +YM T TCK+G  C+F HP    +  I     + + P    P RPG+  C
Sbjct:   172 PQRAGEKDCTHYMQTRTCKFGESCRFDHPIWVPEGGIPDWKEAPVVPNEEYPERPGEPDC 231

Query:   204 SNYSMYGICKFGPTCRFDHP 223
               Y     CK+G  C+F+HP
Sbjct:   232 PYYIKTQRCKYGSKCKFNHP 251

 Score = 117 (46.2 bits), Expect = 0.00047, P = 0.00047
 Identities = 17/32 (53%), Positives = 23/32 (71%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQPSSL 33
             R+   +CPYYM+TG+CK+G  CKF HP P  +
Sbjct:   473 REGALNCPYYMKTGTCKYGATCKFDHPPPGEV 504

 Score = 116 (45.9 bits), Expect = 0.00061, P = 0.00061
 Identities = 25/80 (31%), Positives = 43/80 (53%)

Query:     1 MRQDEKSCPYYMRTGSCKFGVACKFHHPQ-----PSSLGTALPL-TGNASLGSMGSSVLP 54
             +R  E  CP+Y++TGSCK+G  C+++HP+     P + G    L + N +  ++G  V P
Sbjct:   339 VRSGEVDCPFYLKTGSCKYGATCRYNHPERTAFIPQAAGVNYSLVSSNTANLNLGL-VTP 397

Query:    55 SSGLQYAGSLPTWSLQRAPY 74
             ++      + PT  +  A Y
Sbjct:   398 ATSFYQTLTQPTLGVISATY 417

 Score = 99 (39.9 bits), Expect = 2.5e-15, Sum P(2) = 2.5e-15
 Identities = 15/28 (53%), Positives = 20/28 (71%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHPQ 29
             R  E  CPYY++T  CK+G  CKF+HP+
Sbjct:   225 RPGEPDCPYYIKTQRCKYGSKCKFNHPR 252

 Score = 96 (38.9 bits), Expect = 1.8e-22, Sum P(2) = 1.8e-22
 Identities = 15/27 (55%), Positives = 20/27 (74%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHHP 28
             R  EK C +YM+T +CKFG +C+F HP
Sbjct:   174 RAGEKDCTHYMQTRTCKFGESCRFDHP 200


>TAIR|locus:1006230718 [details] [associations]
            symbol:AT1G48195 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 EMBL:AC023673 EMBL:BX818039 IPI:IPI00522286
            RefSeq:NP_973988.1 UniGene:At.38465 UniGene:At.63148
            ProteinModelPortal:Q3ECU8 SMR:Q3ECU8 EnsemblPlants:AT1G48195.1
            GeneID:2745816 KEGG:ath:AT1G48195 TAIR:At1g48195 eggNOG:NOG304278
            HOGENOM:HOG000107451 InParanoid:Q3ECU8 OMA:AICPHYS PhylomeDB:Q3ECU8
            ProtClustDB:CLSN2681286 Genevestigator:Q3ECU8 Uniprot:Q3ECU8
        Length = 82

 Score = 251 (93.4 bits), Expect = 1.9e-21, P = 1.9e-21
 Identities = 40/79 (50%), Positives = 51/79 (64%)

Query:   144 SSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAIC 203
             S +  PERP +P+C YY+ TG C    +CK+HHPK          +   GLP RPGQAIC
Sbjct:     2 SEEKFPERPGEPECSYYLRTGNCYLKQNCKYHHPKNITPSEPQCTLNDKGLPLRPGQAIC 61

Query:   204 SNYSMYGICKFGPTCRFDH 222
              +YS +GIC+ GPTC+FDH
Sbjct:    62 PHYSRFGICRSGPTCKFDH 80


>UNIPROTKB|C9K0K2 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0008270 GO:GO:0003676 HOGENOM:HOG000212457
            HGNC:HGNC:2327 EMBL:AC073063 IPI:IPI01014332
            ProteinModelPortal:C9K0K2 SMR:C9K0K2 STRING:C9K0K2
            Ensembl:ENST00000412686 ArrayExpress:C9K0K2 Bgee:C9K0K2
            Uniprot:C9K0K2
        Length = 112

 Score = 91 (37.1 bits), Expect = 6.4e-07, Sum P(2) = 6.4e-07
 Identities = 23/73 (31%), Positives = 32/73 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    41 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 98

Query:   208 MYGICKFGPTCRF 220
             + G C  GP+C+F
Sbjct:    99 LVGFCPEGPSCKF 111

 Score = 48 (22.0 bits), Expect = 6.4e-07, Sum P(2) = 6.4e-07
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    15 CKHWLR-GLCKKGDQCEFLH 33


>ZFIN|ZDB-GENE-990806-20 [details] [associations]
            symbol:cth1 "cth1" species:7955 "Danio rerio"
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 ZFIN:ZDB-GENE-990806-20 GO:GO:0008270
            GO:GO:0003676 HSSP:P22893 GeneTree:ENSGT00530000063262
            EMBL:AL954709 EMBL:BC107984 EMBL:AJ249490 IPI:IPI00509714
            RefSeq:NP_571014.1 UniGene:Dr.621 SMR:Q9PU62 STRING:Q9PU62
            Ensembl:ENSDART00000101601 GeneID:30114 KEGG:dre:30114 CTD:30114
            HOGENOM:HOG000153347 HOVERGEN:HBG078993 InParanoid:Q9PU62 KO:K13056
            OMA:FTFSSQH NextBio:20806593 Uniprot:Q9PU62
        Length = 319

 Score = 116 (45.9 bits), Expect = 1.0e-06, Sum P(2) = 1.0e-06
 Identities = 37/136 (27%), Positives = 48/136 (35%)

Query:   138 MHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPS- 196
             +H L       P R + P CR +   G C +G  C F H +       A        PS 
Sbjct:   121 VHNLKEQRPIRPRRRNVP-CRTFRAFGVCPFGNRCHFLHVEGGSESDGAEEEQTWQPPSQ 179

Query:   197 ----RPGQAICSNYSMYGICKFGPTCRFDHPYAGYPINYGXXXXXXXXXXXXXMNHQAIS 252
                 +P  A+C  +S +G C +G  CRF H   G P                  N  +IS
Sbjct:   180 SQEWKPRGALCRTFSAFGFCLYGTRCRFQH---GLPNTIKGHNANHTSWPQQMTNGGSIS 236

Query:   253 ATHSIETSPDASSKIP 268
                   TSP   S  P
Sbjct:   237 PISDTCTSPSPPSSSP 252

 Score = 61 (26.5 bits), Expect = 1.0e-06, Sum P(2) = 1.0e-06
 Identities = 10/26 (38%), Positives = 14/26 (53%)

Query:     2 RQDEKSCPYYMRTGSCKFGVACKFHH 27
             R   + C  Y  TG+CK+   C+F H
Sbjct:    59 RYKTELCSRYAETGTCKYAERCQFAH 84

 Score = 46 (21.3 bits), Expect = 3.5e-05, Sum P(2) = 3.5e-05
 Identities = 8/20 (40%), Positives = 9/20 (45%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C  Y   G C +G  C F H
Sbjct:   103 CRTYHTAGYCVYGTRCLFVH 122


>POMBASE|SPAC227.08c [details] [associations]
            symbol:yth1 "mRNA cleavage and polyadenylation
            specificity factor complex Yth1" species:4896 "Schizosaccharomyces
            pombe" [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=IC] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            PomBase:SPAC227.08c GO:GO:0005829 EMBL:CU329670
            GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0008270 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5084 GO:GO:0005847 HOGENOM:HOG000212457
            KO:K14404 OrthoDB:EOG4PG99D PIR:T50164 RefSeq:NP_592962.1
            ProteinModelPortal:Q9UTD1 SMR:Q9UTD1 STRING:Q9UTD1
            EnsemblFungi:SPAC227.08c.1 GeneID:2541506 KEGG:spo:SPAC227.08c
            NextBio:20802605 Uniprot:Q9UTD1
        Length = 170

 Score = 105 (42.0 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 27/83 (32%), Positives = 37/83 (44%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQSAASNIG--PLGLPSRPGQAI-----CSN 205
             P C +Y   G C  G +C + H  P +++   A  N+G  PLG P   G+ +     C  
Sbjct:    80 PPCHFYAERGWCSNGEECLYLHLDPSKQVGVCAWYNMGFCPLG-PICRGKHVRKPRPCPK 138

Query:   206 YSMYGICKFGPTCRFDHPYAGYP 228
             Y + G C  GP C   HP    P
Sbjct:   139 Y-LAGFCPLGPNCPDAHPKHSEP 160

 Score = 47 (21.6 bits), Expect = 1.4e-05, Sum P(2) = 1.4e-05
 Identities = 9/20 (45%), Positives = 12/20 (60%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C F H
Sbjct:    54 CKHWLR-GLCKKGEQCDFLH 72


>UNIPROTKB|I3LCK9 [details] [associations]
            symbol:LOC100738395 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            GeneTree:ENSGT00390000009627 OMA:PLDQVTC EMBL:FP103031
            Ensembl:ENSSSCT00000031676 Uniprot:I3LCK9
        Length = 243

 Score = 104 (41.7 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    68 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 125

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   126 LVGFCPEGPSCKFMHPRFELPM 147

 Score = 48 (22.0 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    42 CKHWLR-GLCKKGDQCEFLH 60

 Score = 39 (18.8 bits), Expect = 2.7e-05, Sum P(3) = 2.7e-05
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   301 DHPPHSVPNCSEPPHDQSN 319
             + PP  +P  ++PP  QSN
Sbjct:   151 EQPP--LPQQTQPPAKQSN 167


>UNIPROTKB|F1REX3 [details] [associations]
            symbol:LOC100518830 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            GeneTree:ENSGT00390000009627 KO:K14404 EMBL:FP102617
            RefSeq:XP_003124350.1 Ensembl:ENSSSCT00000008355 GeneID:100518830
            KEGG:ssc:100518830 OMA:MQDIVAS Uniprot:F1REX3
        Length = 269

 Score = 105 (42.0 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 26/82 (31%), Positives = 37/82 (45%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQ-----SAASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I       +     GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDARFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86

 Score = 39 (18.8 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   301 DHPPHSVPNCSEPPHDQSN 319
             + PP  +P  ++PP  QSN
Sbjct:   177 EQPP--LPQQTQPPAKQSN 193


>WB|WBGene00013319 [details] [associations]
            symbol:ccch-5 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0000003 "reproduction" evidence=IMP]
            [GO:0009792 "embryo development ending in birth or egg hatching"
            evidence=IMP] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0009792 EMBL:Z99281 GO:GO:0008270 GO:GO:0000003
            GO:GO:0003676 eggNOG:COG5063 GeneTree:ENSGT00530000063262
            PIR:T27239 RefSeq:NP_502805.1 ProteinModelPortal:O18251 SMR:O18251
            STRING:O18251 EnsemblMetazoa:Y57G11C.25 GeneID:178412
            KEGG:cel:CELE_Y57G11C.25 UCSC:Y57G11C.25 CTD:178412
            WormBase:Y57G11C.25 HOGENOM:HOG000114059 InParanoid:O18251
            NextBio:901036 Uniprot:O18251
        Length = 199

 Score = 118 (46.6 bits), Expect = 3.9e-05, P = 3.9e-05
 Identities = 24/66 (36%), Positives = 33/66 (50%)

Query:   157 CRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGP 216
             C+ +  T  C YG  CKF H  E + Q    N+G +  P      +C N+S  G CK+G 
Sbjct:    74 CKTFQLTKACSYGEQCKFAHSVEEL-QLKHQNLG-INNPKYK-TVLCDNFSTTGHCKYGT 130

Query:   217 TCRFDH 222
              C+F H
Sbjct:   131 KCQFIH 136


>UNIPROTKB|D4A905 [details] [associations]
            symbol:Cpsf4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:10116 "Rattus norvegicus" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 RGD:620440 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 GeneTree:ENSGT00390000009627
            OMA:PLDQVTC OrthoDB:EOG4KH2VQ IPI:IPI00358639
            Ensembl:ENSRNOT00000038958 Uniprot:D4A905
        Length = 243

 Score = 109 (43.4 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
 Identities = 27/82 (32%), Positives = 37/82 (45%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL    R  + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRRTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>UNIPROTKB|B7Z7B0 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 HOGENOM:HOG000212457
            HOVERGEN:HBG051108 OrthoDB:EOG4KH2VQ UniGene:Hs.489287
            HGNC:HGNC:2327 EMBL:AC073063 EMBL:AK301745 IPI:IPI00924476
            SMR:B7Z7B0 STRING:B7Z7B0 Ensembl:ENST00000441580 UCSC:uc011kix.2
            Uniprot:B7Z7B0
        Length = 191

 Score = 104 (41.7 bits), Expect = 4.7e-05, Sum P(2) = 4.7e-05
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    41 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 98

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:    99 LVGFCPEGPSCKFMHPRFELPM 120

 Score = 48 (22.0 bits), Expect = 4.7e-05, Sum P(2) = 4.7e-05
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    15 CKHWLR-GLCKKGDQCEFLH 33


>UNIPROTKB|O95639 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0019048
            "virus-host interaction" evidence=TAS] [GO:0019054 "modulation by
            virus of host cellular process" evidence=TAS] [GO:0019058 "viral
            infectious cycle" evidence=TAS] [GO:0046778 "modification by virus
            of host mRNA processing" evidence=TAS] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005730 "nucleolus" evidence=IDA] [GO:0005739
            "mitochondrion" evidence=IDA] InterPro:IPR000571 InterPro:IPR001878
            Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158
            SMART:SM00343 SMART:SM00356 GO:GO:0005739 Reactome:REACT_116125
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            EMBL:CH236956 EMBL:CH471091 GO:GO:0019058 Gene3D:4.10.60.10
            SUPFAM:SSF57756 eggNOG:COG5084 GO:GO:0005847 HOGENOM:HOG000212457
            HOVERGEN:HBG051108 CTD:10898 KO:K14404 OMA:PLDQVTC
            OrthoDB:EOG4KH2VQ EMBL:U79569 EMBL:CR542161 EMBL:EF191081
            EMBL:BC003101 EMBL:BC050738 IPI:IPI00009137 IPI:IPI00029707
            IPI:IPI00375469 RefSeq:NP_001075028.1 RefSeq:NP_006684.1
            UniGene:Hs.489287 PDB:2D9N PDB:2RHK PDBsum:2D9N PDBsum:2RHK
            ProteinModelPortal:O95639 SMR:O95639 DIP:DIP-48675N IntAct:O95639
            MINT:MINT-1429837 STRING:O95639 PhosphoSite:O95639 PaxDb:O95639
            PRIDE:O95639 DNASU:10898 Ensembl:ENST00000292476
            Ensembl:ENST00000436336 GeneID:10898 KEGG:hsa:10898 UCSC:uc003uqi.3
            UCSC:uc003uqj.3 UCSC:uc003uqk.3 GeneCards:GC07P099036
            HGNC:HGNC:2327 HPA:HPA049094 MIM:603052 neXtProt:NX_O95639
            PharmGKB:PA26844 InParanoid:O95639 PhylomeDB:O95639
            EvolutionaryTrace:O95639 GenomeRNAi:10898 NextBio:41385
            ArrayExpress:O95639 Bgee:O95639 CleanEx:HS_CPSF4
            Genevestigator:O95639 GermOnline:ENSG00000160917 GO:GO:0046778
            Uniprot:O95639
        Length = 269

 Score = 104 (41.7 bits), Expect = 5.1e-05, Sum P(3) = 5.1e-05
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 5.1e-05, Sum P(3) = 5.1e-05
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86

 Score = 39 (18.8 bits), Expect = 5.1e-05, Sum P(3) = 5.1e-05
 Identities = 8/19 (42%), Positives = 12/19 (63%)

Query:   301 DHPPHSVPNCSEPPHDQSN 319
             + PP  +P  ++PP  QSN
Sbjct:   177 EQPP--LPQQTQPPAKQSN 193


>UNIPROTKB|E1BV31 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            SUPFAM:SSF57756 GO:GO:0005847 CTD:10898
            GeneTree:ENSGT00390000009627 KO:K14404 OMA:PLDQVTC
            EMBL:AADN02023770 IPI:IPI00572429 RefSeq:XP_414800.1
            UniGene:Gga.12217 Ensembl:ENSGALT00000007510 GeneID:416494
            KEGG:gga:416494 NextBio:20819939 Uniprot:E1BV31
        Length = 243

 Score = 108 (43.1 bits), Expect = 6.0e-05, Sum P(2) = 6.0e-05
 Identities = 27/82 (32%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GPTC+F HP    P+
Sbjct:   152 LVGFCPEGPTCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 6.0e-05, Sum P(2) = 6.0e-05
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>WB|WBGene00009537 [details] [associations]
            symbol:ccch-2 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0009792
            GO:GO:0008270 GO:GO:0003676 eggNOG:COG5063
            GeneTree:ENSGT00530000063262 EMBL:Z82267 HOGENOM:HOG000114059
            PIR:T21961 RefSeq:NP_502931.1 ProteinModelPortal:O45491 SMR:O45491
            IntAct:O45491 STRING:O45491 EnsemblMetazoa:F38C2.5 GeneID:178454
            KEGG:cel:CELE_F38C2.5 UCSC:F38C2.5 CTD:178454 WormBase:F38C2.5
            InParanoid:O45491 NextBio:901202 Uniprot:O45491
        Length = 186

 Score = 113 (44.8 bits), Expect = 0.00011, P = 0.00011
 Identities = 24/66 (36%), Positives = 32/66 (48%)

Query:   157 CRYYMNTGTCKYGADCKFHHPKERIAQSAASNIGPLGLPSRPGQAICSNYSMYGICKFGP 216
             C+ +  T  C YG  CKF H  E + Q    N G +  P      +C N+S  G CK+G 
Sbjct:    78 CKTFQLTRACSYGEQCKFAHSVEEL-QLKQKNRG-VNHPKYK-TVLCDNFSRTGHCKYGT 134

Query:   217 TCRFDH 222
              C+F H
Sbjct:   135 KCQFIH 140


>ASPGD|ASPL0000062209 [details] [associations]
            symbol:AN0298 species:162425 "Emericella nidulans"
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0005634 EMBL:BN001308
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            eggNOG:COG5084 EMBL:AACD01000006 HOGENOM:HOG000212457 KO:K14404
            RefSeq:XP_657902.1 ProteinModelPortal:Q5BGN2 STRING:Q5BGN2
            EnsemblFungi:CADANIAT00002417 GeneID:2876077 KEGG:ani:AN0298.2
            OMA:DPDRPVC OrthoDB:EOG4PG99D Uniprot:Q5BGN2
        Length = 254

 Score = 94 (38.1 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 21/76 (27%), Positives = 33/76 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHHPKERIAQSAASN-------IGPLGLPSRPGQAICSNYS 207
             P+C+ +  +G C  G DC + H +E+       +       +GPL       + +C  Y 
Sbjct:   118 PECQSFSRSGYCPNGDDCLYQHVREQARLPPCEHYDQGFCPLGPLCAKRHVRRRLCPYY- 176

Query:   208 MYGICKFGPTCRFDHP 223
             + G C  GP C   HP
Sbjct:   177 VAGFCPEGPNCANAHP 192

 Score = 62 (26.9 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 26/81 (32%), Positives = 33/81 (40%)

Query:     4 DEKSCPYYMRTGSCKFGVACKFHHPQPSSLGTALPL-TGNASLGSMGSSVLPS--SGLQY 60
             D   C  Y   G C  G AC   HP PS + T+    +G A   + GS V      GL  
Sbjct:    43 DVPVCKAYSE-GHCPLGPACPDRHPTPSRVTTSTTTASGLAPSTTHGSLVCKHFLKGLCK 101

Query:    61 AGS----LPTWSLQRAPYLSS 77
              G     L  ++L+R P   S
Sbjct:   102 KGMKCEYLHEYNLRRMPECQS 122


>UNIPROTKB|O19137 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084 HSSP:P47974
            GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108 EMBL:U96448
            IPI:IPI00715166 RefSeq:NP_776367.1 UniGene:Bt.55595
            ProteinModelPortal:O19137 SMR:O19137 STRING:O19137
            Ensembl:ENSBTAT00000002701 GeneID:280875 KEGG:bta:280875 CTD:10898
            GeneTree:ENSGT00390000009627 InParanoid:O19137 KO:K14404
            OMA:PLDQVTC OrthoDB:EOG4KH2VQ NextBio:20805014 Uniprot:O19137
        Length = 243

 Score = 104 (41.7 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>RGD|620440 [details] [associations]
            symbol:Cpsf4 "cleavage and polyadenylation specific factor 4"
            species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 RGD:620440 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084
            HSSP:P47974 GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108
            CTD:10898 GeneTree:ENSGT00390000009627 KO:K14404 OrthoDB:EOG4KH2VQ
            EMBL:BC089824 IPI:IPI00553898 RefSeq:NP_001012351.1
            UniGene:Rn.104788 ProteinModelPortal:Q5FVR7 SMR:Q5FVR7
            Ensembl:ENSRNOT00000042474 GeneID:304277 KEGG:rno:304277
            InParanoid:Q5FVR7 NextBio:652764 ArrayExpress:Q5FVR7
            Genevestigator:Q5FVR7 GermOnline:ENSRNOG00000025217 Uniprot:Q5FVR7
        Length = 243

 Score = 104 (41.7 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 0.00019, Sum P(2) = 0.00019
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>UNIPROTKB|H9KVA5 [details] [associations]
            symbol:CPSF4L "Putative cleavage and
            polyadenylation-specificity factor subunit 4-like protein"
            species:9606 "Homo sapiens" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0008270 GO:GO:0003676 EMBL:AC087301 HGNC:HGNC:33632
            ProteinModelPortal:H9KVA5 SMR:H9KVA5 PRIDE:H9KVA5
            Ensembl:ENST00000397671 Bgee:H9KVA5 Uniprot:H9KVA5
        Length = 152

 Score = 91 (37.1 bits), Expect = 0.00020, Sum P(2) = 0.00020
 Identities = 23/76 (30%), Positives = 31/76 (40%)

Query:   155 PDCRYYMNTGTCKYGADCKFHHPKERIA--------QSAASNIGPLGLPSRPGQAICSNY 206
             P+C +Y   G C    +C F H K            Q    + GPL       + +C NY
Sbjct:    30 PECYFYSKFGDCS-NKECSFLHVKPAFKSQDCPWYDQGFCKDAGPLCKYRHVPRIMCLNY 88

Query:   207 SMYGICKFGPTCRFDH 222
              + G C  GP C+F H
Sbjct:    89 -LVGFCPEGPKCQFAH 103

 Score = 51 (23.0 bits), Expect = 0.00020, Sum P(2) = 0.00020
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  CKF H
Sbjct:     4 CKHWLR-GLCKKGDHCKFLH 22


>UNIPROTKB|C9JEV9 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
            "nucleolus" evidence=IDA] [GO:0005739 "mitochondrion" evidence=IDA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0005739 GO:GO:0005634 GO:GO:0046872 GO:GO:0008270
            GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            HOGENOM:HOG000212457 HGNC:HGNC:2327 EMBL:AC073063 IPI:IPI00927478
            ProteinModelPortal:C9JEV9 SMR:C9JEV9 STRING:C9JEV9
            Ensembl:ENST00000451876 ArrayExpress:C9JEV9 Bgee:C9JEV9
            Uniprot:C9JEV9
        Length = 211

 Score = 113 (44.8 bits), Expect = 0.00020, P = 0.00020
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query:   157 CRYYMNTGTCKYGADCKFHHPKERIAQSAA---SNIGPLGLPSRPGQAICSNYSMYGICK 213
             C++++  G CK G  C+F H  +          S  GPL       + IC NY + G C 
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYDMTKMPECYFYSKFGPLCRHRHTRRVICVNY-LVGFCP 125

Query:   214 FGPTCRFDHPYAGYPI 229
              GP+C+F HP    P+
Sbjct:   126 EGPSCKFMHPRFELPM 141


>MGI|MGI:1861602 [details] [associations]
            symbol:Cpsf4 "cleavage and polyadenylation specific factor
            4" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISO]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            MGI:MGI:1861602 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084
            GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108 CTD:10898
            GeneTree:ENSGT00390000009627 KO:K14404 OrthoDB:EOG4KH2VQ
            EMBL:AK046064 EMBL:AF033201 EMBL:BC057067 IPI:IPI00309761
            IPI:IPI00380450 IPI:IPI01027761 RefSeq:NP_848671.1
            UniGene:Mm.196884 ProteinModelPortal:Q8BQZ5 SMR:Q8BQZ5
            STRING:Q8BQZ5 PhosphoSite:Q8BQZ5 PaxDb:Q8BQZ5 PRIDE:Q8BQZ5
            Ensembl:ENSMUST00000070487 GeneID:54188 KEGG:mmu:54188
            UCSC:uc009amj.1 ChiTaRS:CPSF4 NextBio:311022 Bgee:Q8BQZ5
            CleanEx:MM_CPSF4 Genevestigator:Q8BQZ5
            GermOnline:ENSMUSG00000029625 Uniprot:Q8BQZ5
        Length = 211

 Score = 113 (44.8 bits), Expect = 0.00020, P = 0.00020
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query:   157 CRYYMNTGTCKYGADCKFHHPKERIAQSAA---SNIGPLGLPSRPGQAICSNYSMYGICK 213
             C++++  G CK G  C+F H  +          S  GPL       + IC NY + G C 
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYDMTKMPECYFYSKFGPLCRHRHTRRVICVNY-LVGFCP 125

Query:   214 FGPTCRFDHPYAGYPI 229
              GP+C+F HP    P+
Sbjct:   126 EGPSCKFMHPRFELPM 141


>UNIPROTKB|E2RBK7 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            SUPFAM:SSF57756 GO:GO:0005847 GeneTree:ENSGT00390000009627
            EMBL:AAEX03004276 Ensembl:ENSCAFT00000023892 Uniprot:E2RBK7
        Length = 212

 Score = 113 (44.8 bits), Expect = 0.00021, P = 0.00021
 Identities = 25/76 (32%), Positives = 36/76 (47%)

Query:   157 CRYYMNTGTCKYGADCKFHHPKERIAQSAA---SNIGPLGLPSRPGQAICSNYSMYGICK 213
             C++++  G CK G  C+F H  +          S  GPL       + IC NY + G C 
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYDMTKMPECYFYSKFGPLCRHRHTRRVICVNY-LVGFCP 125

Query:   214 FGPTCRFDHPYAGYPI 229
              GP+C+F HP    P+
Sbjct:   126 EGPSCKFMHPRFELPM 141


>UNIPROTKB|J9P398 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            CTD:10898 GeneTree:ENSGT00390000009627 KO:K14404 OMA:PLDQVTC
            EMBL:AAEX03004276 RefSeq:XP_850149.1 ProteinModelPortal:J9P398
            Ensembl:ENSCAFT00000043832 GeneID:489859 KEGG:cfa:489859
            Uniprot:J9P398
        Length = 269

 Score = 104 (41.7 bits), Expect = 0.00029, Sum P(2) = 0.00029
 Identities = 26/82 (31%), Positives = 36/82 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP+C+F HP    P+
Sbjct:   152 LVGFCPEGPSCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 0.00029, Sum P(2) = 0.00029
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>CGD|CAL0005897 [details] [associations]
            symbol:YTH1 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 CGD:CAL0005897
            GO:GO:0005634 GO:GO:0042493 GO:GO:0046872 GO:GO:0008270
            GO:GO:0006397 GO:GO:0003723 eggNOG:COG5084 KO:K14404
            EMBL:AACQ01000145 EMBL:AACQ01000144 RefSeq:XP_712810.1
            RefSeq:XP_712839.1 ProteinModelPortal:Q59T36 SMR:Q59T36
            STRING:Q59T36 GeneID:3645540 GeneID:3645572 KEGG:cal:CaO19.14170
            KEGG:cal:CaO19.6881 Uniprot:Q59T36
        Length = 215

 Score = 101 (40.6 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 23/76 (30%), Positives = 34/76 (44%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQSAASNIG-----PLGLPSRPGQAICSNYS 207
             P+C +Y   G C   ++C + H  P+ +I +    N G     P        + +C  Y 
Sbjct:    97 PECLFYSKNGYCTQTSECLYLHVDPQSKIPECLNYNQGFCSEGPNCKNRHVRRVLCPLY- 155

Query:   208 MYGICKFGPTCRFDHP 223
             +YG C  GP C F HP
Sbjct:   156 LYGFCPKGPECEFTHP 171

 Score = 47 (21.6 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    71 CKHWLR-GLCKKGDHCEFLH 89


>UNIPROTKB|Q59T36 [details] [associations]
            symbol:YTH1 "mRNA 3'-end-processing protein YTH1"
            species:237561 "Candida albicans SC5314" [GO:0042493 "response to
            drug" evidence=IMP] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 CGD:CAL0005897 GO:GO:0005634 GO:GO:0042493
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            eggNOG:COG5084 KO:K14404 EMBL:AACQ01000145 EMBL:AACQ01000144
            RefSeq:XP_712810.1 RefSeq:XP_712839.1 ProteinModelPortal:Q59T36
            SMR:Q59T36 STRING:Q59T36 GeneID:3645540 GeneID:3645572
            KEGG:cal:CaO19.14170 KEGG:cal:CaO19.6881 Uniprot:Q59T36
        Length = 215

 Score = 101 (40.6 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 23/76 (30%), Positives = 34/76 (44%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQSAASNIG-----PLGLPSRPGQAICSNYS 207
             P+C +Y   G C   ++C + H  P+ +I +    N G     P        + +C  Y 
Sbjct:    97 PECLFYSKNGYCTQTSECLYLHVDPQSKIPECLNYNQGFCSEGPNCKNRHVRRVLCPLY- 155

Query:   208 MYGICKFGPTCRFDHP 223
             +YG C  GP C F HP
Sbjct:   156 LYGFCPKGPECEFTHP 171

 Score = 47 (21.6 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    71 CKHWLR-GLCKKGDHCEFLH 89


>UNIPROTKB|H7C4T5 [details] [associations]
            symbol:MBNL1 "Muscleblind-like protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 EMBL:AC026347 EMBL:AC106722 HGNC:HGNC:6923
            ChiTaRS:MBNL1 ProteinModelPortal:H7C4T5 Ensembl:ENST00000464596
            Uniprot:H7C4T5
        Length = 329

 Score = 111 (44.1 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 57/209 (27%), Positives = 86/209 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    65 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 118

Query:    82 TQSYMPLIVSPSQGI-VPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQM 138
                     V+PS      A  +N Y+G + P L P  I  +  +  + N G  + A A  
Sbjct:   119 --------VAPSLATNASAAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA-- 168

Query:   139 HILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPS 196
                +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N        
Sbjct:   169 ---AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT------- 216

Query:   197 RPGQAICSNYSMYGICKFGPTCRFDHPYA 225
                  +C +Y + G C     C++ HP A
Sbjct:   217 ---VTVCMDY-IKGRCS-REKCKYFHPPA 240

 Score = 43 (20.2 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query:     8 CPYYMRTGSC-KFGVACKFHHPQPS 31
             C  + R G+C +    CKF HP  S
Sbjct:    18 CREFQR-GTCSRPDTECKFAHPSKS 41


>UNIPROTKB|F1LPR3 [details] [associations]
            symbol:Mbnl1 "Protein Mbnl1" species:10116 "Rattus
            norvegicus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 IPI:IPI00567436 Ensembl:ENSRNOT00000018867
            ArrayExpress:F1LPR3 Uniprot:F1LPR3
        Length = 283

 Score = 114 (45.2 bits), Expect = 0.00036, P = 0.00036
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:     8 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 61

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:    62 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 110

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   111 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 158

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   159 --VTVCMDY-IKGRCS-REKCKYFHPPA 182


>UNIPROTKB|C9JP00 [details] [associations]
            symbol:MBNL1 "Muscleblind-like protein 1" species:9606
            "Homo sapiens" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 EMBL:AC026347 EMBL:AC106722 HGNC:HGNC:6923
            HOGENOM:HOG000230928 ChiTaRS:MBNL1 IPI:IPI00945690
            ProteinModelPortal:C9JP00 SMR:C9JP00 STRING:C9JP00
            Ensembl:ENST00000498502 ArrayExpress:C9JP00 Bgee:C9JP00
            Uniprot:C9JP00
        Length = 348

 Score = 111 (44.1 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 57/209 (27%), Positives = 86/209 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGI-VPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQM 138
                     V+PS      A  +N Y+G + P L P  I  +  +  + N G  + A A  
Sbjct:   120 --------VAPSLATNASAAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA-- 169

Query:   139 HILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPS 196
                +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N        
Sbjct:   170 ---AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT------- 217

Query:   197 RPGQAICSNYSMYGICKFGPTCRFDHPYA 225
                  +C +Y + G C     C++ HP A
Sbjct:   218 ---VTVCMDY-IKGRCS-REKCKYFHPPA 241

 Score = 43 (20.2 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query:     8 CPYYMRTGSC-KFGVACKFHHPQPS 31
             C  + R G+C +    CKF HP  S
Sbjct:    19 CREFQR-GTCSRPDTECKFAHPSKS 42


>MGI|MGI:1928482 [details] [associations]
            symbol:Mbnl1 "muscleblind-like 1 (Drosophila)" species:10090
            "Mus musculus" [GO:0000380 "alternative mRNA splicing, via
            spliceosome" evidence=ISO;IMP] [GO:0000381 "regulation of
            alternative mRNA splicing, via spliceosome" evidence=IMP]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0003723 "RNA
            binding" evidence=ISO] [GO:0003725 "double-stranded RNA binding"
            evidence=ISO] [GO:0005634 "nucleus" evidence=ISO;IDA] [GO:0005737
            "cytoplasm" evidence=ISO;IDA] [GO:0006376 "mRNA splice site
            selection" evidence=IMP] [GO:0006397 "mRNA processing"
            evidence=IEA] [GO:0007519 "skeletal muscle tissue development"
            evidence=IMP] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=ISO] [GO:0010494 "cytoplasmic
            stress granule" evidence=ISO] [GO:0043484 "regulation of RNA
            splicing" evidence=ISO] [GO:0045445 "myoblast differentiation"
            evidence=IDA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            MGI:MGI:1928482 GO:GO:0005634 GO:GO:0007399 GO:GO:0046872
            GO:GO:0008270 GO:GO:0001701 GO:GO:0030326 GO:GO:0003725
            GO:GO:0045445 GO:GO:0010494 GO:GO:0007519 GO:GO:0000380
            GO:GO:0006376 GO:GO:0000381 eggNOG:NOG241142 KO:K14943 CTD:4154
            HOVERGEN:HBG006999 HOGENOM:HOG000230928 EMBL:AF231110
            IPI:IPI00466710 RefSeq:NP_001240637.1 UniGene:Mm.255723
            ProteinModelPortal:Q9JKP5 SMR:Q9JKP5 STRING:Q9JKP5
            PhosphoSite:Q9JKP5 PaxDb:Q9JKP5 PRIDE:Q9JKP5 GeneID:56758
            KEGG:mmu:56758 UCSC:uc008pjh.1 CleanEx:MM_MBNL1
            Genevestigator:Q9JKP5 GermOnline:ENSMUSG00000027763 Uniprot:Q9JKP5
        Length = 341

 Score = 115 (45.5 bits), Expect = 0.00040, P = 0.00040
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATSASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|Q5ZKW9 [details] [associations]
            symbol:MBNL1 "Muscleblind-like protein 1" species:9031
            "Gallus gallus" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] InterPro:IPR000571 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 GO:GO:0005737 GO:GO:0008380 GO:GO:0046872
            GO:GO:0008270 GO:GO:0006397 GO:GO:0003723 eggNOG:NOG241142
            GeneTree:ENSGT00390000001586 KO:K14943 EMBL:AJ719965
            IPI:IPI00587570 RefSeq:NP_001026493.1 UniGene:Gga.4840
            ProteinModelPortal:Q5ZKW9 SMR:Q5ZKW9 STRING:Q5ZKW9
            Ensembl:ENSGALT00000016870 GeneID:425033 KEGG:gga:425033 CTD:4154
            HOVERGEN:HBG006999 OrthoDB:EOG4BRWN5 NextBio:20827282
            ArrayExpress:Q5ZKW9 Uniprot:Q5ZKW9
        Length = 369

 Score = 115 (45.5 bits), Expect = 0.00046, P = 0.00046
 Identities = 58/208 (27%), Positives = 89/208 (42%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPGLVPAEILPTAPMLVAGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP    A SA  +     +   
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHP----ADSAMIDTNDNTV--- 217

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   218 ---TVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|F1M9N4 [details] [associations]
            symbol:Mbnl1 "Protein Mbnl1" species:10116 "Rattus
            norvegicus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0005634
            GO:GO:0008270 GO:GO:0003725 GO:GO:0010494 GO:GO:0007519
            GO:GO:0006376 GO:GO:0000381 GeneTree:ENSGT00390000001586
            IPI:IPI00373207 Ensembl:ENSRNOT00000036808 ArrayExpress:F1M9N4
            Uniprot:F1M9N4
        Length = 323

 Score = 114 (45.2 bits), Expect = 0.00047, P = 0.00047
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:     8 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 61

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:    62 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 110

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   111 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 158

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   159 --VTVCMDY-IKGRCS-REKCKYFHPPA 182


>UNIPROTKB|F1NBC8 [details] [associations]
            symbol:MBNL1 "Muscleblind-like protein 1" species:9031
            "Gallus gallus" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0000380 "alternative mRNA splicing, via spliceosome"
            evidence=IEA] [GO:0000381 "regulation of alternative mRNA splicing,
            via spliceosome" evidence=IEA] [GO:0003725 "double-stranded RNA
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006376 "mRNA splice site selection" evidence=IEA] [GO:0007519
            "skeletal muscle tissue development" evidence=IEA] [GO:0010494
            "cytoplasmic stress granule" evidence=IEA] InterPro:IPR000571
            PROSITE:PS50103 SMART:SM00356 GO:GO:0005634 GO:GO:0008270
            GO:GO:0003725 GO:GO:0010494 GO:GO:0006376 GO:GO:0000381
            GeneTree:ENSGT00390000001586 OMA:QQQAAFI EMBL:AADN02021113
            EMBL:AADN02021114 EMBL:AADN02021115 EMBL:AADN02021116
            IPI:IPI00944391 Ensembl:ENSGALT00000016871 ArrayExpress:F1NBC8
            Uniprot:F1NBC8
        Length = 384

 Score = 115 (45.5 bits), Expect = 0.00049, P = 0.00049
 Identities = 58/208 (27%), Positives = 89/208 (42%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    69 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 122

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   123 --------VAPSLATNASAAFNPYLGPVSPGLVPAEILPTAPMLVAGNPGVPVPAAA--- 171

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP    A SA  +     +   
Sbjct:   172 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHP----ADSAMIDTNDNTV--- 220

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   221 ---TVCMDY-IKGRCS-REKCKYFHPPA 243


>UNIPROTKB|E2QSA8 [details] [associations]
            symbol:MBNL1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 KO:K14943 CTD:4154 RefSeq:XP_866180.2
            Ensembl:ENSCAFT00000013668 GeneID:477116 KEGG:cfa:477116
            NextBio:20852652 Uniprot:E2QSA8
        Length = 341

 Score = 114 (45.2 bits), Expect = 0.00052, P = 0.00052
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|Q9NR56 [details] [associations]
            symbol:MBNL1 "Muscleblind-like protein 1" species:9606
            "Homo sapiens" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0000380 "alternative mRNA splicing, via spliceosome"
            evidence=IEA] [GO:0000381 "regulation of alternative mRNA splicing,
            via spliceosome" evidence=IEA] [GO:0006376 "mRNA splice site
            selection" evidence=IEA] [GO:0007519 "skeletal muscle tissue
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IDA]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0001701 "in utero
            embryonic development" evidence=ISS] [GO:0030326 "embryonic limb
            morphogenesis" evidence=ISS] [GO:0045445 "myoblast differentiation"
            evidence=ISS] [GO:0007399 "nervous system development"
            evidence=ISS] [GO:0003725 "double-stranded RNA binding"
            evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0008380 "RNA splicing" evidence=IDA] [GO:0010494 "cytoplasmic
            stress granule" evidence=IDA] [GO:0043484 "regulation of RNA
            splicing" evidence=IDA] [GO:0003723 "RNA binding" evidence=IDA]
            InterPro:IPR000571 PROSITE:PS50103 SMART:SM00356 GO:GO:0005634
            GO:GO:0007399 GO:GO:0008380 GO:GO:0046872 EMBL:CH471052
            GO:GO:0008270 GO:GO:0001701 GO:GO:0003676 GO:GO:0030326
            GO:GO:0003725 GO:GO:0043484 GO:GO:0045445 GO:GO:0010494
            GO:GO:0007519 GO:GO:0006376 GO:GO:0000381 MIM:160900
            eggNOG:NOG241142 KO:K14943 CTD:4154 HOVERGEN:HBG006999
            OrthoDB:EOG4BRWN5 EMBL:Y13829 EMBL:AB007888 EMBL:AF255334
            EMBL:AF395876 EMBL:AF401998 EMBL:AJ308400 EMBL:AF497718
            EMBL:AF497719 EMBL:AC026347 EMBL:AC106722 EMBL:BC043493
            IPI:IPI00021692 IPI:IPI00178116 IPI:IPI00218263 IPI:IPI00384261
            IPI:IPI00395357 IPI:IPI00410205 IPI:IPI00410279 RefSeq:NP_066368.2
            RefSeq:NP_997175.1 RefSeq:NP_997176.1 RefSeq:NP_997177.1
            RefSeq:NP_997178.1 RefSeq:NP_997179.1 RefSeq:NP_997180.1
            UniGene:Hs.201858 UniGene:Hs.725347 PDB:3D2N PDB:3D2Q PDB:3D2S
            PDBsum:3D2N PDBsum:3D2Q PDBsum:3D2S ProteinModelPortal:Q9NR56
            SMR:Q9NR56 IntAct:Q9NR56 MINT:MINT-3072751 STRING:Q9NR56
            PhosphoSite:Q9NR56 DMDM:17369313 PaxDb:Q9NR56 PeptideAtlas:Q9NR56
            PRIDE:Q9NR56 DNASU:4154 Ensembl:ENST00000282486
            Ensembl:ENST00000282488 Ensembl:ENST00000324196
            Ensembl:ENST00000324210 Ensembl:ENST00000355460
            Ensembl:ENST00000357472 Ensembl:ENST00000463374
            Ensembl:ENST00000465907 Ensembl:ENST00000485910
            Ensembl:ENST00000492948 Ensembl:ENST00000545754 GeneID:4154
            KEGG:hsa:4154 UCSC:uc003ezi.3 UCSC:uc003ezm.3 UCSC:uc003ezo.3
            GeneCards:GC03P151961 HGNC:HGNC:6923 HPA:CAB016398 MIM:606516
            neXtProt:NX_Q9NR56 PharmGKB:PA30666 HOGENOM:HOG000230928
            InParanoid:Q9NR56 OMA:QQQAAFI PhylomeDB:Q9NR56 ChEMBL:CHEMBL1293317
            ChiTaRS:MBNL1 EvolutionaryTrace:Q9NR56 GenomeRNAi:4154
            NextBio:16346 ArrayExpress:Q9NR56 Bgee:Q9NR56 CleanEx:HS_MBNL1
            Genevestigator:Q9NR56 GermOnline:ENSG00000152601 Uniprot:Q9NR56
        Length = 388

 Score = 111 (44.1 bits), Expect = 0.00052, Sum P(2) = 0.00052
 Identities = 57/209 (27%), Positives = 86/209 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGI-VPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQM 138
                     V+PS      A  +N Y+G + P L P  I  +  +  + N G  + A A  
Sbjct:   120 --------VAPSLATNASAAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA-- 169

Query:   139 HILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPS 196
                +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N        
Sbjct:   170 ---AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT------- 217

Query:   197 RPGQAICSNYSMYGICKFGPTCRFDHPYA 225
                  +C +Y + G C     C++ HP A
Sbjct:   218 ---VTVCMDY-IKGRCS-REKCKYFHPPA 241

 Score = 43 (20.2 bits), Expect = 0.00052, Sum P(2) = 0.00052
 Identities = 10/25 (40%), Positives = 13/25 (52%)

Query:     8 CPYYMRTGSC-KFGVACKFHHPQPS 31
             C  + R G+C +    CKF HP  S
Sbjct:    19 CREFQR-GTCSRPDTECKFAHPSKS 42


>UNIPROTKB|G3X6F9 [details] [associations]
            symbol:MBNL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0010494 "cytoplasmic stress granule" evidence=IEA]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEA]
            [GO:0006376 "mRNA splice site selection" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0003725 "double-stranded RNA binding"
            evidence=IEA] [GO:0000381 "regulation of alternative mRNA splicing,
            via spliceosome" evidence=IEA] [GO:0000380 "alternative mRNA
            splicing, via spliceosome" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000571 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 GO:GO:0008270 GO:GO:0003725
            GO:GO:0010494 GO:GO:0007519 GO:GO:0006376 GO:GO:0000381
            GeneTree:ENSGT00390000001586 OMA:QQQAAFI EMBL:DAAA02002626
            Ensembl:ENSBTAT00000005998 Uniprot:G3X6F9
        Length = 370

 Score = 114 (45.2 bits), Expect = 0.00060, P = 0.00060
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|E2QSC6 [details] [associations]
            symbol:MBNL1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            PROSITE:PS50103 SMART:SM00356 GO:GO:0008270 GO:GO:0003676
            Ensembl:ENSCAFT00000013656 Uniprot:E2QSC6
        Length = 381

 Score = 114 (45.2 bits), Expect = 0.00063, P = 0.00063
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|F6UUJ0 [details] [associations]
            symbol:MBNL1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            PROSITE:PS50103 SMART:SM00356 GO:GO:0008270 GO:GO:0003676
            GeneTree:ENSGT00390000001586 Ensembl:ENSCAFT00000013656
            EMBL:AAEX03013688 EMBL:AAEX03013689 EMBL:AAEX03013690
            Uniprot:F6UUJ0
        Length = 387

 Score = 114 (45.2 bits), Expect = 0.00064, P = 0.00064
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|Q66KE3 [details] [associations]
            symbol:cpsf4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:8364 "Xenopus (Silurana) tropicalis"
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=ISS] InterPro:IPR000571 InterPro:IPR001878
            Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158
            SMART:SM00343 SMART:SM00356 GO:GO:0046872 GO:GO:0008270
            GO:GO:0006397 GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756
            eggNOG:COG5084 GO:GO:0042462 GO:GO:0005847 HOVERGEN:HBG051108
            CTD:10898 KO:K14404 OrthoDB:EOG4KH2VQ EMBL:BC080440
            RefSeq:NP_001007933.1 UniGene:Str.3196 ProteinModelPortal:Q66KE3
            SMR:Q66KE3 STRING:Q66KE3 GeneID:493312 KEGG:xtr:493312
            Xenbase:XB-GENE-948302 InParanoid:Q66KE3 Bgee:Q66KE3 Uniprot:Q66KE3
        Length = 269

 Score = 101 (40.6 bits), Expect = 0.00064, Sum P(2) = 0.00064
 Identities = 26/82 (31%), Positives = 35/82 (42%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP C+F HP    P+
Sbjct:   152 LVGFCIEGPNCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 0.00064, Sum P(2) = 0.00064
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>UNIPROTKB|Q6DJP7 [details] [associations]
            symbol:cpsf4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:8355 "Xenopus laevis" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 GO:GO:0005847
            HOVERGEN:HBG051108 CTD:10898 KO:K14404 EMBL:BC075128
            RefSeq:NP_001086337.1 UniGene:Xl.25683 ProteinModelPortal:Q6DJP7
            SMR:Q6DJP7 GeneID:444766 KEGG:xla:444766 Xenbase:XB-GENE-948308
            Uniprot:Q6DJP7
        Length = 269

 Score = 101 (40.6 bits), Expect = 0.00064, Sum P(2) = 0.00064
 Identities = 26/82 (31%), Positives = 35/82 (42%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    94 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 151

Query:   208 MYGICKFGPTCRFDHPYAGYPI 229
             + G C  GP C+F HP    P+
Sbjct:   152 LVGFCIEGPNCKFMHPRFELPM 173

 Score = 48 (22.0 bits), Expect = 0.00064, Sum P(2) = 0.00064
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    68 CKHWLR-GLCKKGDQCEFLH 86


>UNIPROTKB|F6UU15 [details] [associations]
            symbol:MBNL1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            PROSITE:PS50103 SMART:SM00356 GO:GO:0008270 GO:GO:0003676
            GeneTree:ENSGT00390000001586 OMA:QQQAAFI Ensembl:ENSCAFT00000013668
            EMBL:AAEX03013688 EMBL:AAEX03013689 EMBL:AAEX03013690
            Uniprot:F6UU15
        Length = 399

 Score = 114 (45.2 bits), Expect = 0.00068, P = 0.00068
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|F1SJM7 [details] [associations]
            symbol:MBNL1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0010494 "cytoplasmic stress granule" evidence=IEA]
            [GO:0007519 "skeletal muscle tissue development" evidence=IEA]
            [GO:0006376 "mRNA splice site selection" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0003725 "double-stranded RNA binding"
            evidence=IEA] [GO:0000381 "regulation of alternative mRNA splicing,
            via spliceosome" evidence=IEA] [GO:0000380 "alternative mRNA
            splicing, via spliceosome" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000571 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 GO:GO:0008270 GO:GO:0003725
            GO:GO:0010494 GO:GO:0007519 GO:GO:0006376 GO:GO:0000381
            GeneTree:ENSGT00390000001586 OMA:QQQAAFI EMBL:CU856483
            Ensembl:ENSSSCT00000012830 Uniprot:F1SJM7
        Length = 399

 Score = 114 (45.2 bits), Expect = 0.00068, P = 0.00068
 Identities = 56/208 (26%), Positives = 86/208 (41%)

Query:    23 CKFHHPQPSSLGTALPLTGNASLGSMGSSVLPSSGLQYAGSL-PTWSLQRAPYLSSRLQG 81
             CK+ HP P  L T L + G  +L    +  + +  +Q A ++ P   LQ  P  S     
Sbjct:    66 CKYLHPPPH-LKTQLEINGRNNLIQQKNMAMLAQQMQLANAMMPGAPLQPVPMFS----- 119

Query:    82 TQSYMPLIVSPSQGIVPAPGWNTYMGNIGP-LSPTSIAGSNLIYSSRNQG-DLGAGAQMH 139
                     V+PS     +  +N Y+G + P L P  I  +  +  + N G  + A A   
Sbjct:   120 --------VAPSLATNASAAFNPYLGPVSPSLVPAEILPTAPMLVTGNPGVPVPAAA--- 168

Query:   140 ILSASSQNLPERPDQPD-CRYYMNTGTCKYGA-DCKFHHPKERIAQSAASNIGPLGLPSR 197
               +A++Q L  R D+ + CR Y   G C  G  DC+F HP +        N         
Sbjct:   169 --AAAAQKLM-RTDRLEVCREYQR-GNCNRGENDCRFAHPADSTMIDTNDNT-------- 216

Query:   198 PGQAICSNYSMYGICKFGPTCRFDHPYA 225
                 +C +Y + G C     C++ HP A
Sbjct:   217 --VTVCMDY-IKGRCS-REKCKYFHPPA 240


>UNIPROTKB|E2RBM0 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 GeneTree:ENSGT00390000009627 EMBL:AAEX03004276
            Ensembl:ENSCAFT00000023887 NextBio:20862973 Uniprot:E2RBM0
        Length = 164

 Score = 91 (37.1 bits), Expect = 0.00078, Sum P(2) = 0.00078
 Identities = 23/73 (31%), Positives = 32/73 (43%)

Query:   155 PDCRYYMNTGTCKYGADCKFHH--PKERIAQS-----AASNIGPLGLPSRPGQAICSNYS 207
             P+C +Y   G C    +C F H  P+ +I             GPL       + IC NY 
Sbjct:    92 PECYFYSKFGECS-NKECPFLHIDPESKIKDCPWYDRGFCKHGPLCRHRHTRRVICVNY- 149

Query:   208 MYGICKFGPTCRF 220
             + G C  GP+C+F
Sbjct:   150 LVGFCPEGPSCKF 162

 Score = 48 (22.0 bits), Expect = 0.00078, Sum P(2) = 0.00078
 Identities = 9/20 (45%), Positives = 13/20 (65%)

Query:     8 CPYYMRTGSCKFGVACKFHH 27
             C +++R G CK G  C+F H
Sbjct:    66 CKHWLR-GLCKKGDQCEFLH 84


>SGD|S000005975 [details] [associations]
            symbol:LEE1 "Zinc-finger protein of unknown function"
            species:4932 "Saccharomyces cerevisiae" [GO:0008150
            "biological_process" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 SGD:S000005975 GO:GO:0046872 GO:GO:0008270
            GO:GO:0003676 EMBL:BK006949 EMBL:U39205 eggNOG:COG5084
            InterPro:IPR026290 PANTHER:PTHR11224 OrthoDB:EOG49S9H4 EMBL:X86735
            EMBL:AY693073 PIR:S60936 RefSeq:NP_015271.1
            ProteinModelPortal:Q02799 SMR:Q02799 STRING:Q02799
            EnsemblFungi:YPL054W GeneID:856053 KEGG:sce:YPL054W CYGD:YPL054w
            NextBio:981014 Genevestigator:Q02799 GermOnline:YPL054W
            Uniprot:Q02799
        Length = 301

 Score = 111 (44.1 bits), Expect = 0.00089, P = 0.00089
 Identities = 58/218 (26%), Positives = 92/218 (42%)

Query:    68 SLQRAPYLSSRLQGTQSYMPLIVSPSQGIVPAP-GWNTYMGNIGPLSPTSIAGSNLIYSS 126
             S+   P  ++R + +QS   ++ S  Q     P  +N    N+  LS     G+ L +SS
Sbjct:     8 SVSNHPGGNAR-RNSQSANEMLASQIQDFQNIPRSFNDSNANVN-LSKNCTVGNQLPFSS 65

Query:   127 RNQGDLGAGAQMHILSASSQNLPERPDQPDCRYYMNTGTCKYGADCKFHHPKERIAQSAA 186
             R Q  +     + I   +SQ   +    P C+++   G C+ G+ C F H  + I  S+A
Sbjct:    66 RQQKIIME--HLLITKNNSQQQKDYSHVP-CKFF-KMGNCQAGSSCPFSHSPDII--SSA 119

Query:   187 SNIGPLGLPSRPGQAICSNYSMYGICKFGPTCRFDHPYA-GYPINYGXXXXXXXXXXXXX 245
             +N     LP       C  Y   G CKFG  C   H    G+ +N               
Sbjct:   120 NN-----LP-------CK-YFAKGNCKFGNKCVNAHVLPNGFKMNSKEPIDITPPSQNNY 166

Query:   246 MNHQAISATHSIETSP--DASSKIPNWVQNSDAVSVQH 281
             ++H A SA+ S  TSP   A ++  +   N++  S Q+
Sbjct:   167 LSH-ARSASFSTYTSPPLSAQTEFSHSASNANYFSSQY 203


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.129   0.406    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      319       306   0.00099  115 3  11 23  0.46    34
                                                     33  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  48
  No. of states in DFA:  606 (64 KB)
  Total size of DFA:  237 KB (2126 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  28.18u 0.10s 28.28t   Elapsed:  00:00:01
  Total cpu time:  28.19u 0.10s 28.29t   Elapsed:  00:00:01
  Start:  Fri May 10 20:08:30 2013   End:  Fri May 10 20:08:31 2013

Back to top