BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>033320
MLVDEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNV
CSYYSRYGICKFGPACKYDHPIHPDASAEYGLDPPPSFGDSTTRQETGMAGTGNGNGSDK
NI

High Scoring Gene Products

Symbol, full name Information P value
AT3G48440 protein from Arabidopsis thaliana 1.5e-37
AT1G48195 protein from Arabidopsis thaliana 1.6e-31
ZFN3
AT5G16540
protein from Arabidopsis thaliana 6.2e-30
ZFN1
AT3G02830
protein from Arabidopsis thaliana 1.5e-28
AT2G47850 protein from Arabidopsis thaliana 6.3e-26
AT3G06410 protein from Arabidopsis thaliana 9.8e-26
AT5G18550 protein from Arabidopsis thaliana 1.1e-23
AT1G04990 protein from Arabidopsis thaliana 1.8e-22
HUA1
ENHANCER OF AG-4 1
protein from Arabidopsis thaliana 2.6e-18
AT1G29570 protein from Arabidopsis thaliana 1.0e-07
mex-1 gene from Caenorhabditis elegans 4.2e-07
cpsf4
cleavage and polyadenylation specificity factor 30 kDa subunit
gene from Dictyostelium discoideum 1.9e-05
YTH1 gene_product from Candida albicans 2.0e-05
YTH1
mRNA 3'-end-processing protein YTH1
protein from Candida albicans SC5314 2.0e-05
orf19.5334 gene_product from Candida albicans 6.4e-05
YTH1
Essential RNA-binding component of cleavage and polyadenylation factor
gene from Saccharomyces cerevisiae 0.00011
DDB_G0270816
Zinc finger CCCH domain-containing protein 14
gene from Dictyostelium discoideum 0.00011
CPSF4
Uncharacterized protein
protein from Gallus gallus 0.00013
Mkrn1
Makorin 1
protein from Drosophila melanogaster 0.00014
ccch-1 gene from Caenorhabditis elegans 0.00015
TIS11
mRNA-binding protein expressed during iron starvation
gene from Saccharomyces cerevisiae 0.00018
AT1G29560 protein from Arabidopsis thaliana 0.00019
CPSF4L
Putative cleavage and polyadenylation-specificity factor subunit 4-like protein
protein from Homo sapiens 0.00021
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 0.00026
Cpsf4
cleavage and polyadenylation specific factor 4
protein from Mus musculus 0.00026
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00026
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 0.00034
CPSF4
Cleavage and polyadenylation-specificity factor subunit 4
protein from Homo sapiens 0.00035
CPSF4L
Putative cleavage and polyadenylation specificity factor subunit 4-like protein
protein from Homo sapiens 0.00048
CPSF4
Cleavage and polyadenylation specificity factor subunit 4
protein from Bos taurus 0.00060
LOC100738395
Uncharacterized protein
protein from Sus scrofa 0.00060
Cpsf4
cleavage and polyadenylation specific factor 4
gene from Rattus norvegicus 0.00060
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00073
CPSF4
Cleavage and polyadenylation specificity factor subunit 4
protein from Homo sapiens 0.00073
ZC3H3
Uncharacterized protein
protein from Gallus gallus 0.00077
CPSF4
Uncharacterized protein
protein from Canis lupus familiaris 0.00092

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  033320
        (122 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2101170 - symbol:AT3G48440 species:3702 "Arabi...   403  1.5e-37   1
TAIR|locus:1006230718 - symbol:AT1G48195 species:3702 "Ar...   346  1.6e-31   1
TAIR|locus:2171407 - symbol:ZFN3 "zinc finger nuclease 3"...   331  6.2e-30   1
TAIR|locus:2075477 - symbol:ZFN1 "zinc finger protein 1" ...   318  1.5e-28   1
TAIR|locus:2043368 - symbol:AT2G47850 species:3702 "Arabi...   296  6.3e-26   1
TAIR|locus:2081066 - symbol:AT3G06410 species:3702 "Arabi...   294  9.8e-26   1
TAIR|locus:2182988 - symbol:AT5G18550 species:3702 "Arabi...   276  1.1e-23   1
TAIR|locus:2010562 - symbol:AT1G04990 species:3702 "Arabi...   262  1.8e-22   1
TAIR|locus:2087775 - symbol:HUA1 "ENHANCER OF AG-4 1" spe...   229  2.6e-18   1
TAIR|locus:2013763 - symbol:AT1G29570 species:3702 "Arabi...   113  1.0e-07   2
WB|WBGene00003228 - symbol:mex-1 species:6239 "Caenorhabd...   112  4.2e-07   2
ASPGD|ASPL0000062209 - symbol:AN0298 species:162425 "Emer...   117  8.4e-07   1
DICTYBASE|DDB_G0270148 - symbol:cpsf4 "cleavage and polya...    95  1.9e-05   2
CGD|CAL0005897 - symbol:YTH1 species:5476 "Candida albica...   103  2.0e-05   1
UNIPROTKB|Q59T36 - symbol:YTH1 "mRNA 3'-end-processing pr...   103  2.0e-05   1
ASPGD|ASPL0000000121 - symbol:AN6331 species:162425 "Emer...    98  3.3e-05   2
POMBASE|SPAC227.08c - symbol:yth1 "mRNA cleavage and poly...    97  4.4e-05   1
CGD|CAL0004295 - symbol:orf19.5334 species:5476 "Candida ...    98  6.4e-05   1
SGD|S000006311 - symbol:YTH1 "Essential RNA-binding compo...    96  0.00011   1
DICTYBASE|DDB_G0270816 - symbol:DDB_G0270816 "Zinc finger...   102  0.00011   1
UNIPROTKB|E1BV31 - symbol:CPSF4 "Uncharacterized protein"...    97  0.00013   1
FB|FBgn0029152 - symbol:Mkrn1 "Makorin 1" species:7227 "D...   100  0.00014   1
WB|WBGene00009532 - symbol:ccch-1 species:6239 "Caenorhab...    89  0.00015   2
SGD|S000004126 - symbol:TIS11 "mRNA-binding protein expre...    97  0.00018   1
TAIR|locus:2013758 - symbol:AT1G29560 species:3702 "Arabi...   101  0.00019   1
UNIPROTKB|H9KVA5 - symbol:CPSF4L "Putative cleavage and p...    90  0.00021   1
UNIPROTKB|D4A905 - symbol:Cpsf4 "Cleavage and polyadenyla...    95  0.00022   1
UNIPROTKB|C9JEV9 - symbol:CPSF4 "Cleavage and polyadenyla...    93  0.00026   1
MGI|MGI:1861602 - symbol:Cpsf4 "cleavage and polyadenylat...    93  0.00026   1
UNIPROTKB|E2RBK7 - symbol:CPSF4 "Uncharacterized protein"...    93  0.00026   1
UNIPROTKB|G3V789 - symbol:Mkrn1 "Protein Mkrn1" species:1...    75  0.00033   2
UNIPROTKB|B7Z7B0 - symbol:CPSF4 "Cleavage and polyadenyla...    91  0.00034   1
UNIPROTKB|C9K0K2 - symbol:CPSF4 "Cleavage and polyadenyla...    88  0.00035   1
UNIPROTKB|A6NMK7 - symbol:CPSF4L "Putative cleavage and p...    89  0.00048   1
UNIPROTKB|O19137 - symbol:CPSF4 "Cleavage and polyadenyla...    91  0.00060   1
UNIPROTKB|I3LCK9 - symbol:LOC100738395 "Uncharacterized p...    91  0.00060   1
RGD|620440 - symbol:Cpsf4 "cleavage and polyadenylation s...    91  0.00060   1
UNIPROTKB|J9P398 - symbol:CPSF4 "Uncharacterized protein"...    91  0.00073   1
UNIPROTKB|O95639 - symbol:CPSF4 "Cleavage and polyadenyla...    91  0.00073   1
UNIPROTKB|F1NHU3 - symbol:ZC3H3 "Uncharacterized protein"...    93  0.00077   1
UNIPROTKB|E2RBM0 - symbol:CPSF4 "Uncharacterized protein"...    88  0.00092   1


>TAIR|locus:2101170 [details] [associations]
            symbol:AT3G48440 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 EMBL:AL049659
            HOGENOM:HOG000237733 EMBL:BT033139 IPI:IPI00517303 PIR:T06698
            RefSeq:NP_190414.1 UniGene:At.50258 ProteinModelPortal:Q9STM4
            SMR:Q9STM4 PaxDb:Q9STM4 PRIDE:Q9STM4 EnsemblPlants:AT3G48440.1
            GeneID:824003 KEGG:ath:AT3G48440 TAIR:At3g48440 eggNOG:NOG288127
            InParanoid:Q9STM4 OMA:PEWNGYQ PhylomeDB:Q9STM4
            ProtClustDB:CLSN2719348 Genevestigator:Q9STM4 GermOnline:AT3G48440
            Uniprot:Q9STM4
        Length = 448

 Score = 403 (146.9 bits), Expect = 1.5e-37, P = 1.5e-37
 Identities = 66/98 (67%), Positives = 80/98 (81%)

Query:     1 MLVDEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNV 60
             M  +EFPERP QPECSY+++TGDCK+K NCKYHHPKNR+PK PP  L+DKGLPLRP QN+
Sbjct:   338 MPAEEFPERPDQPECSYYMKTGDCKFKFNCKYHHPKNRLPKLPPYALNDKGLPLRPDQNI 397

Query:    61 CSYYSRYGICKFGPACKYDHPIHPDASAEYG---LDPP 95
             C+YYSRYGICKFGPAC++DH + P  S E     ++PP
Sbjct:   398 CTYYSRYGICKFGPACRFDHSVQPPYSTESSQAIVEPP 435

 Score = 200 (75.5 bits), Expect = 2.5e-15, P = 2.5e-15
 Identities = 39/93 (41%), Positives = 55/93 (59%)

Query:    11 GQPECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGLPLRPGQNVCSYYSRYG 68
             G  +C Y+ RTG CKY   C+++H  PK+ +  +P   L+  GLPLRPG+  C YY R G
Sbjct:   160 GLIDCKYYFRTGGCKYGETCRFNHTIPKSGLASAPE--LNFLGLPLRPGEVECPYYMRNG 217

Query:    69 ICKFGPACKYDHPIHPDASAEYGLDPPPSFGDS 101
              CK+G  CK++HP   D +   G D P   G++
Sbjct:   218 SCKYGAECKFNHP---DPTTIGGTDSPSFRGNN 247

 Score = 166 (63.5 bits), Expect = 1.3e-11, P = 1.3e-11
 Identities = 33/81 (40%), Positives = 48/81 (59%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHP---KNRIPKSPPCT-LSDKGLPLRPGQNVC 61
             +P RPG  +CS+++RTG CK+ S+CK++HP   K +I +        D G  L  G   C
Sbjct:   107 YPVRPGAEDCSFYMRTGSCKFGSSCKFNHPLARKFQIARDNKVREKEDDGGKL--GLIDC 164

Query:    62 SYYSRYGICKFGPACKYDHPI 82
              YY R G CK+G  C+++H I
Sbjct:   165 KYYFRTGGCKYGETCRFNHTI 185

 Score = 114 (45.2 bits), Expect = 5.4e-06, P = 5.4e-06
 Identities = 18/35 (51%), Positives = 25/35 (71%)

Query:    48 SDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHPI 82
             S+   P+RPG   CS+Y R G CKFG +CK++HP+
Sbjct:   103 SENVYPVRPGAEDCSFYMRTGSCKFGSSCKFNHPL 137

 Score = 111 (44.1 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 16/29 (55%), Positives = 22/29 (75%)

Query:     7 PERPGQPECSYFLRTGDCKYKSNCKYHHP 35
             P RPG+ EC Y++R G CKY + CK++HP
Sbjct:   202 PLRPGEVECPYYMRNGSCKYGAECKFNHP 230


>TAIR|locus:1006230718 [details] [associations]
            symbol:AT1G48195 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 EMBL:AC023673 EMBL:BX818039 IPI:IPI00522286
            RefSeq:NP_973988.1 UniGene:At.38465 UniGene:At.63148
            ProteinModelPortal:Q3ECU8 SMR:Q3ECU8 EnsemblPlants:AT1G48195.1
            GeneID:2745816 KEGG:ath:AT1G48195 TAIR:At1g48195 eggNOG:NOG304278
            HOGENOM:HOG000107451 InParanoid:Q3ECU8 OMA:AICPHYS PhylomeDB:Q3ECU8
            ProtClustDB:CLSN2681286 Genevestigator:Q3ECU8 Uniprot:Q3ECU8
        Length = 82

 Score = 346 (126.9 bits), Expect = 1.6e-31, P = 1.6e-31
 Identities = 55/80 (68%), Positives = 66/80 (82%)

Query:     1 MLVDEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNV 60
             M  ++FPERPG+PECSY+LRTG+C  K NCKYHHPKN  P  P CTL+DKGLPLRPGQ +
Sbjct:     1 MSEEKFPERPGEPECSYYLRTGNCYLKQNCKYHHPKNITPSEPQCTLNDKGLPLRPGQAI 60

Query:    61 CSYYSRYGICKFGPACKYDH 80
             C +YSR+GIC+ GP CK+DH
Sbjct:    61 CPHYSRFGICRSGPTCKFDH 80

 Score = 95 (38.5 bits), Expect = 6.3e-05, P = 6.3e-05
 Identities = 17/35 (48%), Positives = 21/35 (60%)

Query:    47 LSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHP 81
             +S++  P RPG+  CSYY R G C     CKY HP
Sbjct:     1 MSEEKFPERPGEPECSYYLRTGNCYLKQNCKYHHP 35


>TAIR|locus:2171407 [details] [associations]
            symbol:ZFN3 "zinc finger nuclease 3" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0004518 "nuclease activity" evidence=TAS]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 EMBL:AB005242 GO:GO:0004518
            HOGENOM:HOG000237733 EMBL:AF138872 EMBL:AY084634 EMBL:AY128342
            EMBL:BT000014 EMBL:BX831982 IPI:IPI00516322 IPI:IPI00528450
            IPI:IPI00528912 RefSeq:NP_568332.2 RefSeq:NP_851041.1
            RefSeq:NP_974790.1 UniGene:At.21711 ProteinModelPortal:Q8L7N8
            SMR:Q8L7N8 EnsemblPlants:AT5G16540.1 GeneID:831516
            KEGG:ath:AT5G16540 GeneFarm:4900 TAIR:At5g16540 eggNOG:NOG281021
            InParanoid:Q8L7N8 OMA:SAGNQGM PhylomeDB:Q8L7N8
            ProtClustDB:CLSN2690167 Genevestigator:Q8L7N8 Uniprot:Q8L7N8
        Length = 375

 Score = 331 (121.6 bits), Expect = 6.2e-30, P = 6.2e-30
 Identities = 56/104 (53%), Positives = 73/104 (70%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYS 65
             FPERPGQPEC ++++TGDCK+ + CK+HHP++R    P C LS  GLPLRPG+ +C +YS
Sbjct:   240 FPERPGQPECQFYMKTGDCKFGTVCKFHHPRDRQTPPPDCVLSSVGLPLRPGEPLCVFYS 299

Query:    66 RYGICKFGPACKYDHPIHPDASAEYGLDPPPSFGDSTTRQETGM 109
             RYGICKFGP+CK+DHP+           P PS   S+  QET +
Sbjct:   300 RYGICKFGPSCKFDHPMRVFTYNNNTASPSPS---SSLHQETAI 340

 Score = 234 (87.4 bits), Expect = 2.0e-19, P = 2.0e-19
 Identities = 37/77 (48%), Positives = 53/77 (68%)

Query:     5 EFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYY 64
             E+PER GQPEC ++L+TG CK+   CK+HHP+N+       +++    PLRP ++ CSY+
Sbjct:    83 EYPERIGQPECEFYLKTGTCKFGVTCKFHHPRNKAGIDGSVSVNVLSYPLRPNEDDCSYF 142

Query:    65 SRYGICKFGPACKYDHP 81
              R G CKFG  CK++HP
Sbjct:   143 LRIGQCKFGGTCKFNHP 159

 Score = 200 (75.5 bits), Expect = 1.5e-15, P = 1.5e-15
 Identities = 38/83 (45%), Positives = 53/83 (63%)

Query:     1 MLVD-EFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKG-LPLRPGQ 58
             M VD  +PER G+P+C+Y++RTG C++ S C+++HP +R  K    T   KG  P R GQ
Sbjct:    33 MGVDGSYPERHGEPDCAYYIRTGLCRFGSTCRFNHPHDR--KLVIATARIKGEYPERIGQ 90

Query:    59 NVCSYYSRYGICKFGPACKYDHP 81
               C +Y + G CKFG  CK+ HP
Sbjct:    91 PECEFYLKTGTCKFGVTCKFHHP 113

 Score = 114 (45.2 bits), Expect = 4.0e-06, P = 4.0e-06
 Identities = 20/49 (40%), Positives = 31/49 (63%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPL 54
             +P RP + +CSYFLR G CK+   CK++HP+ +   S    +S +G P+
Sbjct:   130 YPLRPNEDDCSYFLRIGQCKFGGTCKFNHPQTQ---STNLMVSVRGSPV 175

 Score = 105 (42.0 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 19/48 (39%), Positives = 24/48 (50%)

Query:    34 HPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHP 81
             H  N +P        +   P RPGQ  C +Y + G CKFG  CK+ HP
Sbjct:   222 HSGNSVPLGFYALPRENVFPERPGQPECQFYMKTGDCKFGTVCKFHHP 269


>TAIR|locus:2075477 [details] [associations]
            symbol:ZFN1 "zinc finger protein 1" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=TAS] [GO:0004518 "nuclease activity" evidence=TAS]
            [GO:0005829 "cytosol" evidence=IDA] [GO:0010313 "phytochrome
            binding" evidence=IPI] [GO:0017148 "negative regulation of
            translation" evidence=IMP] [GO:0048027 "mRNA 5'-UTR binding"
            evidence=IPI] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005829 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0017148 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0048027 GO:GO:0004518 HOGENOM:HOG000237733
            EMBL:AF138743 EMBL:AC018363 EMBL:AK117978 EMBL:BT025966
            IPI:IPI00539955 PIR:T48874 RefSeq:NP_566183.1 UniGene:At.23706
            ProteinModelPortal:Q8GXX7 SMR:Q8GXX7 STRING:Q8GXX7 PaxDb:Q8GXX7
            PRIDE:Q8GXX7 EnsemblPlants:AT3G02830.1 GeneID:821230
            KEGG:ath:AT3G02830 GeneFarm:4898 TAIR:At3g02830 eggNOG:NOG329662
            InParanoid:Q8GXX7 OMA:SSDDQQR PhylomeDB:Q8GXX7
            ProtClustDB:CLSN2917075 Genevestigator:Q8GXX7 GermOnline:AT3G02830
            Uniprot:Q8GXX7
        Length = 397

 Score = 318 (117.0 bits), Expect = 1.5e-28, P = 1.5e-28
 Identities = 59/123 (47%), Positives = 81/123 (65%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYS 65
             FPERPGQPEC ++++TGDCK+ + CK+HHP++R    P C LS  GLPLRPG+ +C +Y+
Sbjct:   271 FPERPGQPECQFYMKTGDCKFGTVCKFHHPRDRQAPPPDCLLSSIGLPLRPGEPLCVFYT 330

Query:    66 RYGICKFGPACKYDHPI----HPDASAEYGLDPPPSFGDST--TRQETGMAGTGNGNGSD 119
             RYGICKFGP+CK+DHP+    + + ++E       S G S   +  ET  A T   +G D
Sbjct:   331 RYGICKFGPSCKFDHPMRVFTYDNTASETDEVVETSTGKSRRLSVSETRQAAT-TSSGKD 389

Query:   120 KNI 122
               I
Sbjct:   390 TTI 392

 Score = 231 (86.4 bits), Expect = 6.1e-19, P = 6.1e-19
 Identities = 38/77 (49%), Positives = 53/77 (68%)

Query:     5 EFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYY 64
             E+PER GQPEC Y+L+TG CK+   CK+HHP+N+   +   +L+  G PLR  +  C+Y+
Sbjct:    81 EYPERIGQPECEYYLKTGTCKFGVTCKFHHPRNKAGIAGRVSLNMLGYPLRSNEVDCAYF 140

Query:    65 SRYGICKFGPACKYDHP 81
              R G CKFG  CK++HP
Sbjct:   141 LRTGHCKFGGTCKFNHP 157

 Score = 211 (79.3 bits), Expect = 1.1e-16, P = 1.1e-16
 Identities = 37/82 (45%), Positives = 53/82 (64%)

Query:     1 MLVDEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKG-LPLRPGQN 59
             M    +PERPG+P+CSY++RTG C++ S C+++HP++R  +    T   +G  P R GQ 
Sbjct:    32 METGSYPERPGEPDCSYYIRTGLCRFGSTCRFNHPRDR--ELVIATARMRGEYPERIGQP 89

Query:    60 VCSYYSRYGICKFGPACKYDHP 81
              C YY + G CKFG  CK+ HP
Sbjct:    90 ECEYYLKTGTCKFGVTCKFHHP 111

 Score = 110 (43.8 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 17/36 (47%), Positives = 24/36 (66%)

Query:    46 TLSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHP 81
             T+     P RPG+  CSYY R G+C+FG  C+++HP
Sbjct:    31 TMETGSYPERPGEPDCSYYIRTGLCRFGSTCRFNHP 66

 Score = 104 (41.7 bits), Expect = 5.4e-05, P = 5.4e-05
 Identities = 15/31 (48%), Positives = 23/31 (74%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPK 36
             +P R  + +C+YFLRTG CK+   CK++HP+
Sbjct:   128 YPLRSNEVDCAYFLRTGHCKFGGTCKFNHPQ 158


>TAIR|locus:2043368 [details] [associations]
            symbol:AT2G47850 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0005634 EMBL:CP002685 GenomeReviews:CT485783_GR GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 EMBL:AC005309 EMBL:BT030391
            EMBL:BT004106 IPI:IPI00519400 PIR:C84920 RefSeq:NP_001078078.1
            RefSeq:NP_182306.2 UniGene:At.21006 ProteinModelPortal:Q84W91
            SMR:Q84W91 PaxDb:Q84W91 PRIDE:Q84W91 EnsemblPlants:AT2G47850.1
            EnsemblPlants:AT2G47850.3 GeneID:819397 KEGG:ath:AT2G47850
            TAIR:At2g47850 eggNOG:NOG312935 HOGENOM:HOG000237733
            InParanoid:Q84W91 OMA:RYGVACK PhylomeDB:Q84W91
            ProtClustDB:CLSN2680305 Genevestigator:Q84W91 GermOnline:AT2G47850
            Uniprot:Q84W91
        Length = 468

 Score = 296 (109.3 bits), Expect = 6.3e-26, P = 6.3e-26
 Identities = 45/77 (58%), Positives = 60/77 (77%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYS 65
             FPERPG+PEC Y+L+TGDCK+ ++CK+HHP++R+P    C LS  GLPLRPG   C++Y 
Sbjct:   286 FPERPGEPECQYYLKTGDCKFGTSCKFHHPRDRVPPRANCVLSPIGLPLRPGVQRCTFYV 345

Query:    66 RYGICKFGPACKYDHPI 82
             + G CKFG  CK+DHP+
Sbjct:   346 QNGFCKFGSTCKFDHPM 362

 Score = 233 (87.1 bits), Expect = 6.9e-19, P = 6.9e-19
 Identities = 41/93 (44%), Positives = 58/93 (62%)

Query:     5 EFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYY 64
             ++PER G+P C ++L+TG CK+ ++CK+HHPKN         L+  G P+R G N CSYY
Sbjct:    86 QYPERFGEPPCQFYLKTGTCKFGASCKFHHPKNAGGSMSHVPLNIYGYPVREGDNECSYY 145

Query:    65 SRYGICKFGPACKYDHPIHPDASAEYGLDPPPS 97
              + G CKFG  CK+ HP  P  +    + PPP+
Sbjct:   146 LKTGQCKFGITCKFHHP-QPAGTT---VPPPPA 174

 Score = 213 (80.0 bits), Expect = 1.1e-16, P = 1.1e-16
 Identities = 37/88 (42%), Positives = 55/88 (62%)

Query:     4 DEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKG-LPLRPGQNVCS 62
             D +PERPG P+C+Y++RTG C Y + C+Y+HP++R   S   T+   G  P R G+  C 
Sbjct:    40 DSYPERPGAPDCAYYMRTGVCGYGNRCRYNHPRDRA--SVEATVRATGQYPERFGEPPCQ 97

Query:    63 YYSRYGICKFGPACKYDHPIHPDASAEY 90
             +Y + G CKFG +CK+ HP +   S  +
Sbjct:    98 FYLKTGTCKFGASCKFHHPKNAGGSMSH 125

 Score = 104 (41.7 bits), Expect = 6.9e-05, P = 6.9e-05
 Identities = 16/33 (48%), Positives = 22/33 (66%)

Query:    49 DKGLPLRPGQNVCSYYSRYGICKFGPACKYDHP 81
             ++  P RPG+  C YY + G CKFG +CK+ HP
Sbjct:   283 EQAFPERPGEPECQYYLKTGDCKFGTSCKFHHP 315

 Score = 101 (40.6 bits), Expect = 0.00015, P = 0.00015
 Identities = 19/43 (44%), Positives = 24/43 (55%)

Query:    47 LSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHPIHPDASAE 89
             L     P RPG   C+YY R G+C +G  C+Y+HP    AS E
Sbjct:    37 LGSDSYPERPGAPDCAYYMRTGVCGYGNRCRYNHP-RDRASVE 78

 Score = 100 (40.3 bits), Expect = 0.00019, P = 0.00019
 Identities = 19/51 (37%), Positives = 30/51 (58%)

Query:     7 PERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCT-LSDKGLPLRP 56
             P RPG   C+++++ G CK+ S CK+ HP   I  +P  + L+D   P+ P
Sbjct:   333 PLRPGVQRCTFYVQNGFCKFGSTCKFDHPMGTIRYNPSASSLADA--PVAP 381


>TAIR|locus:2081066 [details] [associations]
            symbol:AT3G06410 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 EMBL:AC011623
            eggNOG:NOG312935 HOGENOM:HOG000237733 EMBL:AK230312 EMBL:AK230438
            IPI:IPI00535086 RefSeq:NP_187292.2 UniGene:At.27771
            ProteinModelPortal:Q9SQU4 SMR:Q9SQU4 EnsemblPlants:AT3G06410.1
            GeneID:819815 KEGG:ath:AT3G06410 TAIR:At3g06410 InParanoid:Q9SQU4
            OMA:SSQQYGL PhylomeDB:Q9SQU4 ProtClustDB:CLSN2681554
            Genevestigator:Q9SQU4 GermOnline:AT3G06410 Uniprot:Q9SQU4
        Length = 462

 Score = 294 (108.6 bits), Expect = 9.8e-26, P = 9.8e-26
 Identities = 47/83 (56%), Positives = 62/83 (74%)

Query:     5 EFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYY 64
             EFP+RP QPEC YF+RTGDCK+ S+C+YHHP + +P      LS  GLPLRPG   C+++
Sbjct:   303 EFPQRPDQPECQYFMRTGDCKFGSSCRYHHPVDAVPPKTGIVLSSIGLPLRPGVAQCTHF 362

Query:    65 SRYGICKFGPACKYDHPIHPDAS 87
             +++GICKFGPACK+DH +    S
Sbjct:   363 AQHGICKFGPACKFDHSMSSSLS 385

 Score = 232 (86.7 bits), Expect = 8.5e-19, P = 8.5e-19
 Identities = 40/78 (51%), Positives = 52/78 (66%)

Query:     7 PERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSP--PCTLSDKGLPLRPGQNVCSYY 64
             PER G P C +F+RTG CK+ ++CKYHHP+         P +LS  G PLRPG+  CSYY
Sbjct:    98 PERMGHPVCQHFMRTGTCKFGASCKYHHPRQGGGGGSVAPVSLSYLGYPLRPGEKECSYY 157

Query:    65 SRYGICKFGPACKYDHPI 82
              R G CKFG  C+++HP+
Sbjct:   158 LRTGQCKFGLTCRFNHPV 175

 Score = 205 (77.2 bits), Expect = 7.7e-16, P = 7.7e-16
 Identities = 35/78 (44%), Positives = 48/78 (61%)

Query:     4 DEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSY 63
             + +PERP +P+C Y+LRTG C Y S C+++HP++R             LP R G  VC +
Sbjct:    49 ESYPERPDEPDCIYYLRTGVCGYGSRCRFNHPRDRGAVIGGVRGEAGALPERMGHPVCQH 108

Query:    64 YSRYGICKFGPACKYDHP 81
             + R G CKFG +CKY HP
Sbjct:   109 FMRTGTCKFGASCKYHHP 126

 Score = 123 (48.4 bits), Expect = 4.0e-16, Sum P(2) = 4.0e-16
 Identities = 19/39 (48%), Positives = 27/39 (69%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPP 44
             +P RPG+ ECSY+LRTG CK+   C+++HP     + PP
Sbjct:   145 YPLRPGEKECSYYLRTGQCKFGLTCRFNHPVPLAVQGPP 183

 Score = 109 (43.4 bits), Expect = 4.0e-16, Sum P(2) = 4.0e-16
 Identities = 18/40 (45%), Positives = 23/40 (57%)

Query:    43 PPCTLSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHPI 82
             P    + K  P RP Q  C Y+ R G CKFG +C+Y HP+
Sbjct:   295 PSSNSTSKEFPQRPDQPECQYFMRTGDCKFGSSCRYHHPV 334


>TAIR|locus:2182988 [details] [associations]
            symbol:AT5G18550 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 eggNOG:NOG312935
            HOGENOM:HOG000237733 ProtClustDB:CLSN2681554 EMBL:AC069328
            EMBL:BT010886 EMBL:AK230175 IPI:IPI00533261 RefSeq:NP_197356.2
            UniGene:At.22535 ProteinModelPortal:Q6NPN3 SMR:Q6NPN3 STRING:Q6NPN3
            PaxDb:Q6NPN3 PRIDE:Q6NPN3 EnsemblPlants:AT5G18550.1 GeneID:831973
            KEGG:ath:AT5G18550 TAIR:At5g18550 InParanoid:Q6NPN3 OMA:GSQPCAY
            PhylomeDB:Q6NPN3 Genevestigator:Q6NPN3 GermOnline:AT5G18550
            Uniprot:Q6NPN3
        Length = 465

 Score = 276 (102.2 bits), Expect = 1.1e-23, P = 1.1e-23
 Identities = 49/93 (52%), Positives = 65/93 (69%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPC-TLSDKGLPLRPGQNVCSYY 64
             FP+RP QPEC YF+RTGDCK+ ++C++HHP      SP   TLS  GLPLRPG   C+++
Sbjct:   297 FPQRPEQPECQYFMRTGDCKFGTSCRFHHPMEAA--SPEASTLSHIGLPLRPGAVPCTHF 354

Query:    65 SRYGICKFGPACKYDHPIHPDASAEYGLDPPPS 97
             +++GICKFGPACK+DH +    S+     P PS
Sbjct:   355 AQHGICKFGPACKFDHSL---GSSSLSYSPSPS 384

 Score = 257 (95.5 bits), Expect = 1.5e-21, P = 1.5e-21
 Identities = 44/86 (51%), Positives = 56/86 (65%)

Query:     5 EFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRI--PKSPPCTLSDKGLPLRPGQNVCS 62
             EFPER GQP C +F+RTG CK+ ++CKYHHP+         P +L+  G PLRPG+  CS
Sbjct:    93 EFPERMGQPVCQHFMRTGTCKFGASCKYHHPRQGGGGDSVTPVSLNYMGFPLRPGEKECS 152

Query:    63 YYSRYGICKFGPACKYDHPIHPDASA 88
             Y+ R G CKFG  C+Y HP+ P   A
Sbjct:   153 YFMRTGQCKFGSTCRYHHPVPPGVQA 178

 Score = 225 (84.3 bits), Expect = 5.1e-18, P = 5.1e-18
 Identities = 38/78 (48%), Positives = 49/78 (62%)

Query:     4 DEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSY 63
             + FPERP +P+C Y+LRTG C Y S C+++HP+NR P            P R GQ VC +
Sbjct:    46 ETFPERPDEPDCIYYLRTGVCGYGSRCRFNHPRNRAPVLGGLRTEAGEFPERMGQPVCQH 105

Query:    64 YSRYGICKFGPACKYDHP 81
             + R G CKFG +CKY HP
Sbjct:   106 FMRTGTCKFGASCKYHHP 123

 Score = 106 (42.4 bits), Expect = 4.2e-05, P = 4.2e-05
 Identities = 26/79 (32%), Positives = 38/79 (48%)

Query:    40 PKSPPCTLSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHPIH---PDAS--AEYGLDP 94
             P S   +  ++  P RP Q  C Y+ R G CKFG +C++ HP+    P+AS  +  GL  
Sbjct:   285 PSSTGVSNKEQTFPQRPEQPECQYFMRTGDCKFGTSCRFHHPMEAASPEASTLSHIGLPL 344

Query:    95 PPSFGDSTTRQETGMAGTG 113
              P     T   + G+   G
Sbjct:   345 RPGAVPCTHFAQHGICKFG 363


>TAIR|locus:2010562 [details] [associations]
            symbol:AT1G04990 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA;ISS] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0007623 "circadian rhythm" evidence=RCA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 EMBL:CP002684
            GenomeReviews:CT485782_GR GO:GO:0005634 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0003723 GO:GO:0090305 EMBL:AC004809
            GO:GO:0004518 HOGENOM:HOG000237733 EMBL:AY048253 EMBL:AY113065
            IPI:IPI00522113 PIR:F86183 RefSeq:NP_563725.1 RefSeq:NP_973759.1
            UniGene:At.21743 ProteinModelPortal:Q94AD9 SMR:Q94AD9 PaxDb:Q94AD9
            PRIDE:Q94AD9 EnsemblPlants:AT1G04990.1 EnsemblPlants:AT1G04990.2
            GeneID:839351 KEGG:ath:AT1G04990 TAIR:At1g04990 eggNOG:NOG290936
            InParanoid:Q94AD9 OMA:THQRISP PhylomeDB:Q94AD9
            ProtClustDB:CLSN2687681 Genevestigator:Q94AD9 GermOnline:AT1G04990
            Uniprot:Q94AD9
        Length = 404

 Score = 262 (97.3 bits), Expect = 1.8e-22, P = 1.8e-22
 Identities = 52/108 (48%), Positives = 66/108 (61%)

Query:     4 DEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSY 63
             +E PER GQP+C YFL+TG CKY   CKYHHPK+R   + P   +  GLP+R G+  C Y
Sbjct:    85 EELPERIGQPDCEYFLKTGACKYGPTCKYHHPKDR-NGAQPVMFNVIGLPMRLGEKPCPY 143

Query:    64 YSRYGICKFGPACKYDHPIHPDA--SAEYGLDPPPSFGDSTTRQETGM 109
             Y R G C+FG ACK+ HP  PD   S  YG+    SF  +  R  +G+
Sbjct:   144 YLRTGTCRFGVACKFHHP-QPDNGHSTAYGMS---SFPAADLRYASGL 187

 Score = 246 (91.7 bits), Expect = 1.3e-20, P = 1.3e-20
 Identities = 46/111 (41%), Positives = 59/111 (53%)

Query:     8 ERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYSRY 67
             E   QPEC +F+ TG CKY  +CKY HP  RI + PP  ++   LP RPGQ  C  +  Y
Sbjct:   260 ESSDQPECRFFMNTGTCKYGDDCKYSHPGVRISQPPPSLINPFVLPARPGQPACGNFRSY 319

Query:    68 GICKFGPACKYDHPIHPDASAEYGLDPPPSFGDSTTRQETGMAGTGNGNGS 118
             G CKFGP CK+DHP+ P          P  F    T  +  ++ T N + S
Sbjct:   320 GFCKFGPNCKFDHPMLPYPGLTMATSLPTPFASPVTTHQR-ISPTPNRSDS 369

 Score = 202 (76.2 bits), Expect = 1.1e-15, P = 1.1e-15
 Identities = 34/79 (43%), Positives = 52/79 (65%)

Query:     3 VDEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCS 62
             ++ +P+RPG+ +C ++LRTG C Y S+C+Y+HP + +P+        + LP R GQ  C 
Sbjct:    41 LNPYPDRPGERDCQFYLRTGLCGYGSSCRYNHPTH-LPQD--VAYYKEELPERIGQPDCE 97

Query:    63 YYSRYGICKFGPACKYDHP 81
             Y+ + G CK+GP CKY HP
Sbjct:    98 YFLKTGACKYGPTCKYHHP 116

 Score = 113 (44.8 bits), Expect = 5.9e-06, P = 5.9e-06
 Identities = 20/48 (41%), Positives = 28/48 (58%)

Query:    53 PLRPGQNVCSYYSRYGICKFGPACKYDHPIH-PDASAEYGLDPPPSFG 99
             P RPG+  C +Y R G+C +G +C+Y+HP H P   A Y  + P   G
Sbjct:    45 PDRPGERDCQFYLRTGLCGYGSSCRYNHPTHLPQDVAYYKEELPERIG 92

 Score = 99 (39.9 bits), Expect = 0.00019, P = 0.00019
 Identities = 18/36 (50%), Positives = 22/36 (61%)

Query:     2 LVDEF--PERPGQPECSYFLRTGDCKYKSNCKYHHP 35
             L++ F  P RPGQP C  F   G CK+  NCK+ HP
Sbjct:   298 LINPFVLPARPGQPACGNFRSYGFCKFGPNCKFDHP 333


>TAIR|locus:2087775 [details] [associations]
            symbol:HUA1 "ENHANCER OF AG-4 1" species:3702
            "Arabidopsis thaliana" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IDA;TAS]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001709 "cell fate
            determination" evidence=TAS] [GO:0003723 "RNA binding"
            evidence=ISS;IDA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=RCA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0005634 EMBL:CP002686
            GenomeReviews:BA000014_GR GO:GO:0046872 GO:GO:0003677 GO:GO:0016607
            GO:GO:0008270 GO:GO:0006397 GO:GO:0003723 GO:GO:0009908
            EMBL:AB024033 GO:GO:0001709 EMBL:AY024357 EMBL:AC069474
            EMBL:AK229145 IPI:IPI00536814 RefSeq:NP_187874.2 UniGene:At.5670
            ProteinModelPortal:Q941Q3 SMR:Q941Q3 STRING:Q941Q3 PaxDb:Q941Q3
            PRIDE:Q941Q3 EnsemblPlants:AT3G12680.1 GeneID:820448
            KEGG:ath:AT3G12680 TAIR:At3g12680 eggNOG:NOG250655
            HOGENOM:HOG000078745 InParanoid:Q941Q3 OMA:LGAHNTI PhylomeDB:Q941Q3
            ProtClustDB:CLSN2690537 Genevestigator:Q941Q3 Uniprot:Q941Q3
        Length = 524

 Score = 229 (85.7 bits), Expect = 2.6e-18, P = 2.6e-18
 Identities = 38/83 (45%), Positives = 52/83 (62%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRI-------PKSPPCTLSDKGLPLRPGQ 58
             +P+RPGQ EC Y+++TG+CK+   CK+HHP +R+       P+ P   LS  G P R G 
Sbjct:   417 YPQRPGQSECDYYMKTGECKFGERCKFHHPADRLSAMTKQAPQQPNVKLSLAGYPRREGA 476

Query:    59 NVCSYYSRYGICKFGPACKYDHP 81
               C YY + G CK+G  CK+DHP
Sbjct:   477 LNCPYYMKTGTCKYGATCKFDHP 499

 Score = 216 (81.1 bits), Expect = 6.6e-17, P = 6.6e-17
 Identities = 38/91 (41%), Positives = 58/91 (63%)

Query:     4 DEFPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSY 63
             +E+PERPG+P+C Y+++T  CKY S CK++HP+     S     +   LP RP + +C++
Sbjct:   220 EEYPERPGEPDCPYYIKTQRCKYGSKCKFNHPREEAAVSVE---TQDSLPERPSEPMCTF 276

Query:    64 YSRYGICKFGPACKYDHP--IH-PDASAEYG 91
             Y + G CKFG +CK+ HP  I  P +S + G
Sbjct:   277 YMKTGKCKFGLSCKFHHPKDIQLPSSSQDIG 307

 Score = 163 (62.4 bits), Expect = 3.6e-11, P = 3.6e-11
 Identities = 38/117 (32%), Positives = 61/117 (52%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHP----KNRIP--KSPPCTLSDKGLPLRPGQN 59
             +P+R G+ +C+++++T  CK+  +C++ HP    +  IP  K  P   +++  P RPG+ 
Sbjct:   171 YPQRAGEKDCTHYMQTRTCKFGESCRFDHPIWVPEGGIPDWKEAPVVPNEE-YPERPGEP 229

Query:    60 VCSYYSRYGICKFGPACKYDHPIHPDA-SAEY--GLDPPPSFGDSTTRQETGMAGTG 113
              C YY +   CK+G  CK++HP    A S E    L   PS    T   +TG    G
Sbjct:   230 DCPYYIKTQRCKYGSKCKFNHPREEAAVSVETQDSLPERPSEPMCTFYMKTGKCKFG 286

 Score = 107 (42.7 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 17/40 (42%), Positives = 26/40 (65%)

Query:    42 SPPCTLSDKGLPLRPGQNVCSYYSRYGICKFGPACKYDHP 81
             +P    + KGLP+R G+  C +Y + G CK+G  C+Y+HP
Sbjct:   327 TPALYHNSKGLPVRSGEVDCPFYLKTGSCKYGATCRYNHP 366

 Score = 104 (41.7 bits), Expect = 8.1e-05, P = 8.1e-05
 Identities = 16/38 (42%), Positives = 27/38 (71%)

Query:     7 PERPGQPECSYFLRTGDCKYKSNCKYHHPKNR--IPKS 42
             P R G+ +C ++L+TG CKY + C+Y+HP+    IP++
Sbjct:   338 PVRSGEVDCPFYLKTGSCKYGATCRYNHPERTAFIPQA 375


>TAIR|locus:2013763 [details] [associations]
            symbol:AT1G29570 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=RCA] [GO:0048445 "carpel morphogenesis" evidence=RCA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 EMBL:AC068667 IPI:IPI00519526 PIR:G86418
            RefSeq:NP_174250.1 UniGene:At.51822 EnsemblPlants:AT1G29570.1
            GeneID:839834 KEGG:ath:AT1G29570 TAIR:At1g29570 eggNOG:NOG325481
            HOGENOM:HOG000107458 OMA:HIMDRNV PhylomeDB:Q9C7P4
            ProtClustDB:CLSN2914472 Genevestigator:Q9C7P4 Uniprot:Q9C7P4
        Length = 321

 Score = 113 (44.8 bits), Expect = 1.0e-07, Sum P(2) = 1.0e-07
 Identities = 16/39 (41%), Positives = 29/39 (74%)

Query:     6 FPERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPP 44
             +P RPG+ +C ++L+ G C+Y+S+C+++HP  R P+  P
Sbjct:    52 YPVRPGKKDCQFYLKNGLCRYRSSCRFNHPTQR-PQELP 89

 Score = 91 (37.1 bits), Expect = 2.6e-05, Sum P(2) = 2.6e-05
 Identities = 11/29 (37%), Positives = 22/29 (75%)

Query:    53 PLRPGQNVCSYYSRYGICKFGPACKYDHP 81
             P+RPG+  C +Y + G+C++  +C+++HP
Sbjct:    53 PVRPGKKDCQFYLKNGLCRYRSSCRFNHP 81

 Score = 33 (16.7 bits), Expect = 1.0e-07, Sum P(2) = 1.0e-07
 Identities = 6/17 (35%), Positives = 11/17 (64%)

Query:    97 SFGDSTTRQETGMAGTG 113
             +FGD  T++  G+  +G
Sbjct:   125 TFGDERTQRRYGIEYSG 141


>WB|WBGene00003228 [details] [associations]
            symbol:mex-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0002009 "morphogenesis of an
            epithelium" evidence=IMP] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0001708 "cell fate specification" evidence=IMP]
            [GO:0030010 "establishment of cell polarity" evidence=IMP]
            [GO:0001704 "formation of primary germ layer" evidence=IMP]
            [GO:0009880 "embryonic pattern specification" evidence=IGI;IMP]
            [GO:0005737 "cytoplasm" evidence=IDA] [GO:0043186 "P granule"
            evidence=IDA] [GO:0003674 "molecular_function" evidence=NAS]
            [GO:0003723 "RNA binding" evidence=ISS] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0001708
            GO:GO:0009792 GO:GO:0002009 GO:GO:0008270 GO:GO:0000003
            GO:GO:0003723 GO:GO:0009880 GO:GO:0030010 EMBL:AL023828 HSSP:P22893
            GO:GO:0043186 GeneTree:ENSGT00530000063262 GO:GO:0001704 CTD:174836
            EMBL:U81043 EMBL:Z66516 PIR:T26124 RefSeq:NP_001254325.1
            UniGene:Cel.7263 ProteinModelPortal:G5ECB9 SMR:G5ECB9 IntAct:G5ECB9
            EnsemblMetazoa:W03C9.7a.1 EnsemblMetazoa:W03C9.7a.2
            EnsemblMetazoa:W03C9.7a.3 GeneID:174836 KEGG:cel:CELE_W03C9.7
            WormBase:W03C9.7a OMA:HVERNQT NextBio:885712 Uniprot:G5ECB9
        Length = 494

 Score = 112 (44.5 bits), Expect = 4.2e-07, Sum P(2) = 4.2e-07
 Identities = 23/70 (32%), Positives = 35/70 (50%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKN--RIPKSPPCTLSDKGLPLRPGQNVCSYYSRYGICKF 72
             C  F R+G C Y   C++ H +N  R+P  P      K  P    Q +C  +S +G C +
Sbjct:   144 CDAFKRSGSCPYGEACRFAHGENELRMPSQP----RGKAHPKYKTQ-LCDKFSNFGQCPY 198

Query:    73 GPACKYDHPI 82
             GP C++ H +
Sbjct:   199 GPRCQFIHKL 208

 Score = 34 (17.0 bits), Expect = 4.2e-07, Sum P(2) = 4.2e-07
 Identities = 17/55 (30%), Positives = 21/55 (38%)

Query:    73 GPACKYDHPIHPDASAEYGLDPPPSFG--DSTTRQ---ETGMAGTGNGNGSDKNI 122
             GP    +H  H        +  PPS G   S   Q   E  MA +G   G  +NI
Sbjct:   329 GPKRSDNHHYHHHHHQRRHIPAPPSTGPLSSLIDQADYEASMA-SGKMFGKPENI 382


>ASPGD|ASPL0000062209 [details] [associations]
            symbol:AN0298 species:162425 "Emericella nidulans"
            [GO:0006379 "mRNA cleavage" evidence=IEA] [GO:0006378 "mRNA
            polyadenylation" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0005634 EMBL:BN001308
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            eggNOG:COG5084 EMBL:AACD01000006 HOGENOM:HOG000212457 KO:K14404
            RefSeq:XP_657902.1 ProteinModelPortal:Q5BGN2 STRING:Q5BGN2
            EnsemblFungi:CADANIAT00002417 GeneID:2876077 KEGG:ani:AN0298.2
            OMA:DPDRPVC OrthoDB:EOG4PG99D Uniprot:Q5BGN2
        Length = 254

 Score = 117 (46.2 bits), Expect = 8.4e-07, P = 8.4e-07
 Identities = 28/77 (36%), Positives = 36/77 (46%)

Query:    13 PECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGL-PLRP-------GQNVCSYY 64
             PEC  F R+G C    +C Y H + +  + PPC   D+G  PL P        + +C YY
Sbjct:   118 PECQSFSRSGYCPNGDDCLYQHVREQA-RLPPCEHYDQGFCPLGPLCAKRHVRRRLCPYY 176

Query:    65 SRYGICKFGPACKYDHP 81
                G C  GP C   HP
Sbjct:   177 VA-GFCPEGPNCANAHP 192


>DICTYBASE|DDB_G0270148 [details] [associations]
            symbol:cpsf4 "cleavage and polyadenylation
            specificity factor 30 kDa subunit" species:44689 "Dictyostelium
            discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0006378 "mRNA polyadenylation"
            evidence=ISS] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0003723 "RNA binding"
            evidence=ISS] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 dictyBase:DDB_G0270148
            EMBL:AAFI02000005 GenomeReviews:CM000150_GR GO:GO:0046872
            GO:GO:0008270 GO:GO:0006378 GO:GO:0003723 Gene3D:4.10.60.10
            SUPFAM:SSF57756 eggNOG:COG5084 GO:GO:0005847 GO:GO:0006379
            KO:K14404 RefSeq:XP_646578.1 ProteinModelPortal:Q55CA3 SMR:Q55CA3
            STRING:Q55CA3 EnsemblProtists:DDB0233701 GeneID:8617548
            KEGG:ddi:DDB_G0270148 InParanoid:Q55CA3 OMA:ECMYLHV
            ProtClustDB:CLSZ2437480 Uniprot:Q55CA3
        Length = 372

 Score = 95 (38.5 bits), Expect = 1.9e-05, Sum P(2) = 1.9e-05
 Identities = 25/77 (32%), Positives = 35/77 (45%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPP-----CTLSDKGLPLRPGQNVC-SYY 64
             PEC +F + G+C     C Y H  P+ ++ + P      C    K       + +C +YY
Sbjct:    91 PECYFFSKHGECN-NQECMYLHVNPEEKVRECPWYSRGFCKHGPKCRHKHIKKLLCENYY 149

Query:    65 SRYGICKFGPACKYDHP 81
                G C  GP CKY HP
Sbjct:   150 --LGFCPEGPKCKYGHP 164

 Score = 32 (16.3 bits), Expect = 1.9e-05, Sum P(2) = 1.9e-05
 Identities = 6/13 (46%), Positives = 7/13 (53%)

Query:   104 RQETGMAGTGNGN 116
             +Q  GM G  N N
Sbjct:   206 QQFNGMGGNNNNN 218


>CGD|CAL0005897 [details] [associations]
            symbol:YTH1 species:5476 "Candida albicans" [GO:0042493
            "response to drug" evidence=IMP] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=IEA] [GO:0006378 "mRNA polyadenylation" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 CGD:CAL0005897
            GO:GO:0005634 GO:GO:0042493 GO:GO:0046872 GO:GO:0008270
            GO:GO:0006397 GO:GO:0003723 eggNOG:COG5084 KO:K14404
            EMBL:AACQ01000145 EMBL:AACQ01000144 RefSeq:XP_712810.1
            RefSeq:XP_712839.1 ProteinModelPortal:Q59T36 SMR:Q59T36
            STRING:Q59T36 GeneID:3645540 GeneID:3645572 KEGG:cal:CaO19.14170
            KEGG:cal:CaO19.6881 Uniprot:Q59T36
        Length = 215

 Score = 103 (41.3 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 26/79 (32%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PLRPGQNV----CS 62
             PEC ++ + G C   S C Y H  P+++IP+   C   ++G     P    ++V    C 
Sbjct:    97 PECLFYSKNGYCTQTSECLYLHVDPQSKIPE---CLNYNQGFCSEGPNCKNRHVRRVLCP 153

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y  YG C  GP C++ HP
Sbjct:   154 LYL-YGFCPKGPECEFTHP 171


>UNIPROTKB|Q59T36 [details] [associations]
            symbol:YTH1 "mRNA 3'-end-processing protein YTH1"
            species:237561 "Candida albicans SC5314" [GO:0042493 "response to
            drug" evidence=IMP] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 CGD:CAL0005897 GO:GO:0005634 GO:GO:0042493
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            eggNOG:COG5084 KO:K14404 EMBL:AACQ01000145 EMBL:AACQ01000144
            RefSeq:XP_712810.1 RefSeq:XP_712839.1 ProteinModelPortal:Q59T36
            SMR:Q59T36 STRING:Q59T36 GeneID:3645540 GeneID:3645572
            KEGG:cal:CaO19.14170 KEGG:cal:CaO19.6881 Uniprot:Q59T36
        Length = 215

 Score = 103 (41.3 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 26/79 (32%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PLRPGQNV----CS 62
             PEC ++ + G C   S C Y H  P+++IP+   C   ++G     P    ++V    C 
Sbjct:    97 PECLFYSKNGYCTQTSECLYLHVDPQSKIPE---CLNYNQGFCSEGPNCKNRHVRRVLCP 153

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y  YG C  GP C++ HP
Sbjct:   154 LYL-YGFCPKGPECEFTHP 171


>ASPGD|ASPL0000000121 [details] [associations]
            symbol:AN6331 species:162425 "Emericella nidulans"
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR000504
            InterPro:IPR000571 InterPro:IPR002483 InterPro:IPR012677
            Pfam:PF01480 PROSITE:PS50102 PROSITE:PS50103 SMART:SM00360
            GO:GO:0000166 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            Gene3D:3.30.70.330 GO:GO:0003723 EMBL:AACD01000107 EMBL:BN001301
            eggNOG:NOG280405 Gene3D:1.20.1390.10 SUPFAM:SSF101233
            OrthoDB:EOG4BGD4C RefSeq:XP_663935.1 ProteinModelPortal:Q5AZE9
            EnsemblFungi:CADANIAT00006655 GeneID:2871230 KEGG:ani:AN6331.2
            HOGENOM:HOG000166860 OMA:SSRRRCK Uniprot:Q5AZE9
        Length = 733

 Score = 98 (39.6 bits), Expect = 3.3e-05, Sum P(2) = 3.3e-05
 Identities = 31/96 (32%), Positives = 40/96 (41%)

Query:    38 RIPKSPPCTLSDKGLP-LRPGQ------NVCSYYSRYGICKFGPACKYDHPIHPDASAEY 90
             ++P  PP  +   G P  +P Q        C +Y   GIC  G AC Y H     A  + 
Sbjct:   207 QMPGMPPMPMPPGGGPGQQPDQMGPKSTEKCPFYETQGICYLGGACPYQHDTVAGAPKDD 266

Query:    91 GLDPPPS--FGDSTTRQETGMAGT----GNGNGSDK 120
               DP  S    DS  R +  M G+    G G GSD+
Sbjct:   267 EYDPKTSGIVPDSRRRLDGSMRGSDRGRGRGRGSDR 302

 Score = 34 (17.0 bits), Expect = 3.3e-05, Sum P(2) = 3.3e-05
 Identities = 5/8 (62%), Positives = 6/8 (75%)

Query:     6 FPERPGQP 13
             FP+ PG P
Sbjct:   205 FPQMPGMP 212


>POMBASE|SPAC227.08c [details] [associations]
            symbol:yth1 "mRNA cleavage and polyadenylation
            specificity factor complex Yth1" species:4896 "Schizosaccharomyces
            pombe" [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            [GO:0005847 "mRNA cleavage and polyadenylation specificity factor
            complex" evidence=IDA] [GO:0006378 "mRNA polyadenylation"
            evidence=IC] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            PomBase:SPAC227.08c GO:GO:0005829 EMBL:CU329670
            GenomeReviews:CU329670_GR GO:GO:0046872 GO:GO:0008270 GO:GO:0006378
            GO:GO:0003723 eggNOG:COG5084 GO:GO:0005847 HOGENOM:HOG000212457
            KO:K14404 OrthoDB:EOG4PG99D PIR:T50164 RefSeq:NP_592962.1
            ProteinModelPortal:Q9UTD1 SMR:Q9UTD1 STRING:Q9UTD1
            EnsemblFungi:SPAC227.08c.1 GeneID:2541506 KEGG:spo:SPAC227.08c
            NextBio:20802605 Uniprot:Q9UTD1
        Length = 170

 Score = 97 (39.2 bits), Expect = 4.4e-05, P = 4.4e-05
 Identities = 29/93 (31%), Positives = 42/93 (45%)

Query:    11 GQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTL-SDKG----------LPLRPGQN 59
             G   C ++LR G CK    C + H  N + K PPC   +++G          L L P + 
Sbjct:    50 GSVVCKHWLR-GLCKKGEQCDFLHEYN-LKKMPPCHFYAERGWCSNGEECLYLHLDPSKQ 107

Query:    60 V--CSYYSRYGICKFGPACKYDHPIHPDASAEY 90
             V  C++Y+  G C  GP C+  H   P    +Y
Sbjct:   108 VGVCAWYNM-GFCPLGPICRGKHVRKPRPCPKY 139


>CGD|CAL0004295 [details] [associations]
            symbol:orf19.5334 species:5476 "Candida albicans" [GO:0003729
            "mRNA binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0006879 "cellular iron
            ion homeostasis" evidence=IEA] [GO:0000956 "nuclear-transcribed
            mRNA catabolic process" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 CGD:CAL0004295
            GO:GO:0008270 GO:GO:0003676 EMBL:AACQ01000059 EMBL:AACQ01000058
            eggNOG:COG5063 RefSeq:XP_717058.1 RefSeq:XP_717137.1
            ProteinModelPortal:Q5A5R5 GeneID:3641257 GeneID:3641293
            KEGG:cal:CaO19.12794 KEGG:cal:CaO19.5334 Uniprot:Q5A5R5
        Length = 203

 Score = 98 (39.6 bits), Expect = 6.4e-05, P = 6.4e-05
 Identities = 20/68 (29%), Positives = 37/68 (54%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIP--KSPPCTLSDKGLPLRPGQNVCSYYSRYGICKF 72
             C+ F++TG C Y + C++ H +N +   + PP   S      +P    C+ +++YG C++
Sbjct:   144 CASFMKTGVCPYANKCQFAHGENELKHVERPPKWRS------KP----CANWTKYGSCRY 193

Query:    73 GPACKYDH 80
             G  C + H
Sbjct:   194 GNRCCFKH 201


>SGD|S000006311 [details] [associations]
            symbol:YTH1 "Essential RNA-binding component of cleavage and
            polyadenylation factor" species:4932 "Saccharomyces cerevisiae"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA;IDA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IDA] [GO:0006379 "mRNA
            cleavage" evidence=IMP;IDA;TAS] [GO:0006378 "mRNA polyadenylation"
            evidence=IDA;IMP;TAS] InterPro:IPR000571 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00356 SGD:S000006311 GO:GO:0046872
            GO:GO:0008270 GO:GO:0006378 GO:GO:0003723 EMBL:BK006949
            eggNOG:COG5084 GO:GO:0005847 GO:GO:0006379 EMBL:U32445
            HOGENOM:HOG000212457 GeneTree:ENSGT00390000009627 KO:K14404
            OMA:DPDRPVC OrthoDB:EOG4PG99D EMBL:AY558061 PIR:S59772
            RefSeq:NP_015432.1 ProteinModelPortal:Q06102 SMR:Q06102
            DIP:DIP-2028N IntAct:Q06102 MINT:MINT-375481 STRING:Q06102
            PaxDb:Q06102 PeptideAtlas:Q06102 EnsemblFungi:YPR107C GeneID:856222
            KEGG:sce:YPR107C CYGD:YPR107c NextBio:981453 Genevestigator:Q06102
            GermOnline:YPR107C Uniprot:Q06102
        Length = 208

 Score = 96 (38.9 bits), Expect = 0.00011, P = 0.00011
 Identities = 29/89 (32%), Positives = 43/89 (48%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL-PLR---PGQNVCSYY-S 65
             PEC +F + G C    +C+Y H  P ++IPK   C   + G  PL    P +++   +  
Sbjct:    93 PECVFFSKNGYCTQSPDCQYLHIDPASKIPK---CENYEMGFCPLGSSCPRRHIKKVFCQ 149

Query:    66 RY--GICKFGP-ACKYDHP--IHPDASAE 89
             RY  G C  G   C  +HP  I PD  ++
Sbjct:   150 RYMTGFCPLGKDECDMEHPQFIIPDEGSK 178


>DICTYBASE|DDB_G0270816 [details] [associations]
            symbol:DDB_G0270816 "Zinc finger CCCH
            domain-containing protein 14" species:44689 "Dictyostelium
            discoideum" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR002483
            Pfam:PF01480 PROSITE:PS50103 dictyBase:DDB_G0270816
            EMBL:AAFI02000005 GO:GO:0008270 GO:GO:0006397 GO:GO:0003676
            eggNOG:NOG330419 RefSeq:XP_646803.1 ProteinModelPortal:Q55BM8
            EnsemblProtists:DDB0202048 GeneID:8617776 KEGG:ddi:DDB_G0270816
            InParanoid:Q55BM8 OMA:RKNINRD ProtClustDB:CLSZ2495641
            Uniprot:Q55BM8
        Length = 472

 Score = 102 (41.0 bits), Expect = 0.00011, P = 0.00011
 Identities = 26/80 (32%), Positives = 36/80 (45%)

Query:     7 PERPGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYSR 66
             P++  +  CSY+     C+    C YHHP  +    P CT  DK L + P     S   +
Sbjct:   265 PKKSKKERCSYWPL---CRNAEACIYHHPTTQCLLFPNCTYGDKCLYIHP-----SIPCK 316

Query:    67 YGICKFGPACKYDHPIHPDA 86
             +GI      C Y+HP  P A
Sbjct:   317 FGINCTNVDCVYNHPQRPTA 336


>UNIPROTKB|E1BV31 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            SUPFAM:SSF57756 GO:GO:0005847 CTD:10898
            GeneTree:ENSGT00390000009627 KO:K14404 OMA:PLDQVTC
            EMBL:AADN02023770 IPI:IPI00572429 RefSeq:XP_414800.1
            UniGene:Gga.12217 Ensembl:ENSGALT00000007510 GeneID:416494
            KEGG:gga:416494 NextBio:20819939 Uniprot:E1BV31
        Length = 243

 Score = 97 (39.2 bits), Expect = 0.00013, P = 0.00013
 Identities = 36/121 (29%), Positives = 52/121 (42%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    94 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 149

Query:    63 YYSRYGICKFGPACKYDHPIH--PDASAEYG-LDPPPSFGDSTTRQETGMAGTGNGNGSD 119
              Y   G C  GP CK+ HP    P  + E   L  P       T Q  G+  + N N  +
Sbjct:   150 NYL-VGFCPEGPTCKFMHPRFELPMGTTEQPPLPQPAQTQQKRTPQVIGVMQSQNNNTGN 208

Query:   120 K 120
             +
Sbjct:   209 R 209


>FB|FBgn0029152 [details] [associations]
            symbol:Mkrn1 "Makorin 1" species:7227 "Drosophila
            melanogaster" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR001841
            InterPro:IPR000571 Pfam:PF00642 Pfam:PF13639 PROSITE:PS50089
            PROSITE:PS50103 SMART:SM00184 SMART:SM00356 Prosite:PS00518
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:3.30.40.10
            InterPro:IPR013083 InterPro:IPR017907 InterPro:IPR026290
            PANTHER:PTHR11224 eggNOG:NOG268458 ChiTaRS:MKRN1 EMBL:AF192794
            EMBL:AF192788 STRING:Q9TVX6 FlyBase:FBgn0029152 InParanoid:Q9TVX6
            OrthoDB:EOG4CC2GZ Uniprot:Q9TVX6
        Length = 386

 Score = 100 (40.3 bits), Expect = 0.00014, P = 0.00014
 Identities = 21/58 (36%), Positives = 30/58 (51%)

Query:    51 GLPLRPGQNVCSYYSRYGICKFGPACKYDHPI---HPDASAEYGLD--PPPSFGDSTT 103
             G+ L   Q +C YY R GIC+FG  C++ H +    P+   +   D  P PS   S+T
Sbjct:    13 GMALGRSQTICRYYVR-GICRFGELCRFSHDLSRGRPECEEQVATDVLPKPSTSSSST 69


>WB|WBGene00009532 [details] [associations]
            symbol:ccch-1 species:6239 "Caenorhabditis elegans"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0016529 "sarcoplasmic reticulum"
            evidence=IDA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0008270 GO:GO:0003676 GO:GO:0016529 HSSP:P22893
            eggNOG:COG5063 GeneTree:ENSGT00530000063262 EMBL:Z74033 PIR:T21954
            RefSeq:NP_505926.2 ProteinModelPortal:Q20155 SMR:Q20155
            PaxDb:Q20155 EnsemblMetazoa:F38B7.1a GeneID:179584
            KEGG:cel:CELE_F38B7.1 UCSC:F38B7.1a CTD:179584 WormBase:F38B7.1a
            HOGENOM:HOG000022515 InParanoid:Q20155 OMA:PCSSNDS NextBio:906034
            ArrayExpress:Q20155 Uniprot:Q20155
        Length = 460

 Score = 89 (36.4 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 21/78 (26%), Positives = 32/78 (41%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNV--CSYYSRYGICKF 72
             C  ++  G C Y   C+Y H    + K P        +P  P      C  + + G C +
Sbjct:   204 CRSWMDHGRCNYGERCQYAH--GELEKRP--------VPRHPKYKTEACQSFHQSGYCPY 253

Query:    73 GPACKYDHPIHPDASAEY 90
             GP C + H   P A ++Y
Sbjct:   254 GPRCHFIHNEPPSAQSQY 271

 Score = 32 (16.3 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/23 (34%), Positives = 9/23 (39%)

Query:    91 GLDPPPSFGDSTTRQETGMAGTG 113
             G  PP S  DS +    G    G
Sbjct:   329 GESPPCSSNDSGSESPNGSFSPG 351


>SGD|S000004126 [details] [associations]
            symbol:TIS11 "mRNA-binding protein expressed during iron
            starvation" species:4932 "Saccharomyces cerevisiae" [GO:0005634
            "nucleus" evidence=IEA;IDA] [GO:0000956 "nuclear-transcribed mRNA
            catabolic process" evidence=IMP] [GO:0005737 "cytoplasm"
            evidence=IEA;IDA] [GO:0000932 "cytoplasmic mRNA processing body"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0046872
            "metal ion binding" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003729 "mRNA binding" evidence=IDA] [GO:0006879 "cellular iron
            ion homeostasis" evidence=IMP] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 SGD:S000004126 GO:GO:0005634
            GO:GO:0005737 GO:GO:0046872 GO:GO:0008270 GO:GO:0003729
            EMBL:BK006945 GO:GO:0006879 EMBL:X91258 EMBL:U53881 GO:GO:0000932
            GO:GO:0000956 eggNOG:COG5063 GeneTree:ENSGT00530000063262
            HOGENOM:HOG000001038 OrthoDB:EOG4W3WXK EMBL:S76619 EMBL:L42134
            EMBL:Z73308 EMBL:AY558210 PIR:S59328 RefSeq:NP_013237.1
            ProteinModelPortal:P47977 SMR:P47977 DIP:DIP-5614N IntAct:P47977
            MINT:MINT-504800 STRING:P47977 EnsemblFungi:YLR136C GeneID:850827
            KEGG:sce:YLR136C CYGD:YLR136c NextBio:967088 Genevestigator:P47977
            GermOnline:YLR136C Uniprot:P47977
        Length = 285

 Score = 97 (39.2 bits), Expect = 0.00018, P = 0.00018
 Identities = 31/121 (25%), Positives = 46/121 (38%)

Query:     3 VDEFPERPGQPE-CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVC 61
             V E P++  + E C  F   G C Y S C++ H    +     C    K    +P    C
Sbjct:   162 VQETPKQLYKTELCESFTLKGSCPYGSKCQFAHGLGELKVKKSC----KNFRTKP----C 213

Query:    62 SYYSRYGICKFGPACKYDHPIHPDASAEYGLDPPPSFGDSTTRQETGMAGTGNGNGSDKN 121
               + + G C +G  C + H    D  A Y          ST++Q       G G+   KN
Sbjct:   214 VNWEKLGYCPYGRRCCFKHGDDNDI-AVYVKAGTYCNVSSTSKQSDEKRSNGRGSAKKKN 272

Query:   122 I 122
             +
Sbjct:   273 L 273


>TAIR|locus:2013758 [details] [associations]
            symbol:AT1G29560 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 PROSITE:PS50103 SMART:SM00356
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0008270 GO:GO:0003676
            EMBL:AC068667 eggNOG:KOG1677 IPI:IPI00527997 RefSeq:NP_174249.2
            UniGene:At.73942 ProteinModelPortal:B3H4U9 PRIDE:B3H4U9
            EnsemblPlants:AT1G29560.1 GeneID:839833 KEGG:ath:AT1G29560
            TAIR:At1g29560 HOGENOM:HOG000064587 OMA:WRDSESR PhylomeDB:B3H4U9
            ProtClustDB:CLSN2682005 Genevestigator:B3H4U9 Uniprot:B3H4U9
        Length = 572

 Score = 101 (40.6 bits), Expect = 0.00019, P = 0.00019
 Identities = 27/75 (36%), Positives = 40/75 (53%)

Query:     7 PER-PGQPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPLRPGQNVCSYYS 65
             P R PG+ EC + LR   C+   +C+Y+HP  ++P+          LP+R    +C Y+ 
Sbjct:   215 PVRWPGE-EC-WCLR---CRNGGSCRYNHP-TQLPQE---------LPVRNRLQICRYFL 259

Query:    66 RYGICKFGPACKYDH 80
             R G CKFG  C + H
Sbjct:   260 R-GYCKFGSVCGFQH 273


>UNIPROTKB|H9KVA5 [details] [associations]
            symbol:CPSF4L "Putative cleavage and
            polyadenylation-specificity factor subunit 4-like protein"
            species:9606 "Homo sapiens" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103 SMART:SM00356
            GO:GO:0008270 GO:GO:0003676 EMBL:AC087301 HGNC:HGNC:33632
            ProteinModelPortal:H9KVA5 SMR:H9KVA5 PRIDE:H9KVA5
            Ensembl:ENST00000397671 Bgee:H9KVA5 Uniprot:H9KVA5
        Length = 152

 Score = 90 (36.7 bits), Expect = 0.00021, P = 0.00021
 Identities = 26/77 (33%), Positives = 35/77 (45%)

Query:    13 PECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGL-----PLRPGQNV----CSY 63
             PEC ++ + GDC  K  C + H K    KS  C   D+G      PL   ++V    C  
Sbjct:    30 PECYFYSKFGDCSNKE-CSFLHVKPAF-KSQDCPWYDQGFCKDAGPLCKYRHVPRIMCLN 87

Query:    64 YSRYGICKFGPACKYDH 80
             Y   G C  GP C++ H
Sbjct:    88 YL-VGFCPEGPKCQFAH 103


>UNIPROTKB|D4A905 [details] [associations]
            symbol:Cpsf4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:10116 "Rattus norvegicus" [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 RGD:620440 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 GeneTree:ENSGT00390000009627
            OMA:PLDQVTC OrthoDB:EOG4KH2VQ IPI:IPI00358639
            Ensembl:ENSRNOT00000038958 Uniprot:D4A905
        Length = 243

 Score = 95 (38.5 bits), Expect = 0.00022, P = 0.00022
 Identities = 26/79 (32%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL----RPGQNVCS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL    R  + +C 
Sbjct:    94 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRRTRRVICV 149

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:   150 NYL-VGFCPEGPSCKFMHP 167


>UNIPROTKB|C9JEV9 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005730
            "nucleolus" evidence=IDA] [GO:0005739 "mitochondrion" evidence=IDA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0005739 GO:GO:0005634 GO:GO:0046872 GO:GO:0008270
            GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            HOGENOM:HOG000212457 HGNC:HGNC:2327 EMBL:AC073063 IPI:IPI00927478
            ProteinModelPortal:C9JEV9 SMR:C9JEV9 STRING:C9JEV9
            Ensembl:ENST00000451876 ArrayExpress:C9JEV9 Bgee:C9JEV9
            Uniprot:C9JEV9
        Length = 211

 Score = 93 (37.8 bits), Expect = 0.00026, P = 0.00026
 Identities = 26/71 (36%), Positives = 35/71 (49%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPL---RPGQNV-CSYYSRYGIC 70
             C ++LR G CK    C++ H  + + K P C    K  PL   R  + V C  Y   G C
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYD-MTKMPECYFYSKFGPLCRHRHTRRVICVNYL-VGFC 124

Query:    71 KFGPACKYDHP 81
               GP+CK+ HP
Sbjct:   125 PEGPSCKFMHP 135


>MGI|MGI:1861602 [details] [associations]
            symbol:Cpsf4 "cleavage and polyadenylation specific factor
            4" species:10090 "Mus musculus" [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005847 "mRNA cleavage and
            polyadenylation specificity factor complex" evidence=ISO]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            MGI:MGI:1861602 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084
            GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108 CTD:10898
            GeneTree:ENSGT00390000009627 KO:K14404 OrthoDB:EOG4KH2VQ
            EMBL:AK046064 EMBL:AF033201 EMBL:BC057067 IPI:IPI00309761
            IPI:IPI00380450 IPI:IPI01027761 RefSeq:NP_848671.1
            UniGene:Mm.196884 ProteinModelPortal:Q8BQZ5 SMR:Q8BQZ5
            STRING:Q8BQZ5 PhosphoSite:Q8BQZ5 PaxDb:Q8BQZ5 PRIDE:Q8BQZ5
            Ensembl:ENSMUST00000070487 GeneID:54188 KEGG:mmu:54188
            UCSC:uc009amj.1 ChiTaRS:CPSF4 NextBio:311022 Bgee:Q8BQZ5
            CleanEx:MM_CPSF4 Genevestigator:Q8BQZ5
            GermOnline:ENSMUSG00000029625 Uniprot:Q8BQZ5
        Length = 211

 Score = 93 (37.8 bits), Expect = 0.00026, P = 0.00026
 Identities = 26/71 (36%), Positives = 35/71 (49%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPL---RPGQNV-CSYYSRYGIC 70
             C ++LR G CK    C++ H  + + K P C    K  PL   R  + V C  Y   G C
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYD-MTKMPECYFYSKFGPLCRHRHTRRVICVNYL-VGFC 124

Query:    71 KFGPACKYDHP 81
               GP+CK+ HP
Sbjct:   125 PEGPSCKFMHP 135


>UNIPROTKB|E2RBK7 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=IEA] [GO:0005739
            "mitochondrion" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10
            SUPFAM:SSF57756 GO:GO:0005847 GeneTree:ENSGT00390000009627
            EMBL:AAEX03004276 Ensembl:ENSCAFT00000023892 Uniprot:E2RBK7
        Length = 212

 Score = 93 (37.8 bits), Expect = 0.00026, P = 0.00026
 Identities = 26/71 (36%), Positives = 35/71 (49%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKGLPL---RPGQNV-CSYYSRYGIC 70
             C ++LR G CK    C++ H  + + K P C    K  PL   R  + V C  Y   G C
Sbjct:    68 CKHWLR-GLCKKGDQCEFLHEYD-MTKMPECYFYSKFGPLCRHRHTRRVICVNYL-VGFC 124

Query:    71 KFGPACKYDHP 81
               GP+CK+ HP
Sbjct:   125 PEGPSCKFMHP 135


>UNIPROTKB|G3V789 [details] [associations]
            symbol:Mkrn1 "Protein Mkrn1" species:10116 "Rattus
            norvegicus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR001841
            InterPro:IPR000571 Pfam:PF00642 Pfam:PF13639 PROSITE:PS50089
            PROSITE:PS50103 SMART:SM00184 SMART:SM00356 Prosite:PS00518
            GO:GO:0046872 GO:GO:0008270 GO:GO:0003676 Gene3D:3.30.40.10
            InterPro:IPR013083 EMBL:CH473959 InterPro:IPR017907
            InterPro:IPR026290 PANTHER:PTHR11224 GeneTree:ENSGT00390000014093
            OMA:YQRGCCA ProteinModelPortal:G3V789 Ensembl:ENSRNOT00000013124
            Uniprot:G3V789
        Length = 481

 Score = 75 (31.5 bits), Expect = 0.00033, Sum P(2) = 0.00033
 Identities = 17/66 (25%), Positives = 28/66 (42%)

Query:    60 VCSYYSRYGICKFGPACKYDH--PIHPD--ASAEYGLDPPPSFGDSTTRQETGMAGTGNG 115
             VC Y+ R G C +G  C+Y+H  P+  +   + +    P P+   S       +A    G
Sbjct:    89 VCKYFQR-GYCVYGDRCRYEHSKPLKQEEVTATDLSAKPSPAASSSLPSGVGSLAEMNPG 147

Query:   116 NGSDKN 121
                 +N
Sbjct:   148 EAESRN 153

 Score = 68 (29.0 bits), Expect = 0.00033, Sum P(2) = 0.00033
 Identities = 14/40 (35%), Positives = 18/40 (45%)

Query:    12 QPECSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDKG 51
             Q  C YF+  G CK   NC+Y H  +  P    C    +G
Sbjct:    58 QVTCRYFMH-GVCKEGDNCRYSHDLSDSPYGVVCKYFQRG 96


>UNIPROTKB|B7Z7B0 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 GO:GO:0046872 GO:GO:0008270 GO:GO:0003676
            Gene3D:4.10.60.10 SUPFAM:SSF57756 HOGENOM:HOG000212457
            HOVERGEN:HBG051108 OrthoDB:EOG4KH2VQ UniGene:Hs.489287
            HGNC:HGNC:2327 EMBL:AC073063 EMBL:AK301745 IPI:IPI00924476
            SMR:B7Z7B0 STRING:B7Z7B0 Ensembl:ENST00000441580 UCSC:uc011kix.2
            Uniprot:B7Z7B0
        Length = 191

 Score = 91 (37.1 bits), Expect = 0.00034, P = 0.00034
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    41 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 96

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:    97 NYL-VGFCPEGPSCKFMHP 114


>UNIPROTKB|C9K0K2 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation-specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 Pfam:PF00642 PROSITE:PS50103
            SMART:SM00356 GO:GO:0008270 GO:GO:0003676 HOGENOM:HOG000212457
            HGNC:HGNC:2327 EMBL:AC073063 IPI:IPI01014332
            ProteinModelPortal:C9K0K2 SMR:C9K0K2 STRING:C9K0K2
            Ensembl:ENST00000412686 ArrayExpress:C9K0K2 Bgee:C9K0K2
            Uniprot:C9K0K2
        Length = 112

 Score = 88 (36.0 bits), Expect = 0.00035, P = 0.00035
 Identities = 25/78 (32%), Positives = 37/78 (47%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTL-------SDKGLP---LRPGQNV--CS 62
             C ++LR G CK    C++ H  + + K P C         S+K  P   + P   +  C 
Sbjct:    15 CKHWLR-GLCKKGDQCEFLHEYD-MTKMPECYFYSKFGECSNKECPFLHIDPESKIKDCP 72

Query:    63 YYSRYGICKFGPACKYDH 80
             +Y R G CK GP C++ H
Sbjct:    73 WYDR-GFCKHGPLCRHRH 89


>UNIPROTKB|A6NMK7 [details] [associations]
            symbol:CPSF4L "Putative cleavage and polyadenylation
            specificity factor subunit 4-like protein" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003723
            "RNA binding" evidence=IEA] InterPro:IPR000571 Pfam:PF00642
            PROSITE:PS50103 SMART:SM00356 GO:GO:0046872 GO:GO:0008270
            GO:GO:0003723 eggNOG:COG5084 EMBL:AC087301 EMBL:BC157870
            IPI:IPI00376104 RefSeq:NP_001123357.1 UniGene:Hs.534707
            ProteinModelPortal:A6NMK7 SMR:A6NMK7 PhosphoSite:A6NMK7
            PRIDE:A6NMK7 Ensembl:ENST00000344935 GeneID:642843 KEGG:hsa:642843
            UCSC:uc010dfk.1 CTD:642843 GeneCards:GC17M071244 HGNC:HGNC:33632
            HPA:HPA044047 neXtProt:NX_A6NMK7 PharmGKB:PA162382768
            HOGENOM:HOG000212457 HOVERGEN:HBG051108 OMA:HVKPASK
            GenomeRNAi:642843 NextBio:114229 Bgee:A6NMK7 CleanEx:HS_CPSF4L
            Genevestigator:A6NMK7 Uniprot:A6NMK7
        Length = 179

 Score = 89 (36.4 bits), Expect = 0.00048, P = 0.00048
 Identities = 26/78 (33%), Positives = 37/78 (47%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTLSDK-G---------LPLRPG--QNVCS 62
             C ++LR G CK   +CK+ H  + + + P C    K G         L ++P      C 
Sbjct:    68 CKHWLR-GLCKKGDHCKFLHQYD-LTRMPECYFYSKFGDCSNKECSFLHVKPAFKSQDCP 125

Query:    63 YYSRYGICKFGPACKYDH 80
             +Y + G CK GP CKY H
Sbjct:   126 WYDQ-GFCKDGPLCKYRH 142


>UNIPROTKB|O19137 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:9913 "Bos taurus" [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=ISS] [GO:0005739 "mitochondrion" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642
            PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343 SMART:SM00356
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084 HSSP:P47974
            GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108 EMBL:U96448
            IPI:IPI00715166 RefSeq:NP_776367.1 UniGene:Bt.55595
            ProteinModelPortal:O19137 SMR:O19137 STRING:O19137
            Ensembl:ENSBTAT00000002701 GeneID:280875 KEGG:bta:280875 CTD:10898
            GeneTree:ENSGT00390000009627 InParanoid:O19137 KO:K14404
            OMA:PLDQVTC OrthoDB:EOG4KH2VQ NextBio:20805014 Uniprot:O19137
        Length = 243

 Score = 91 (37.1 bits), Expect = 0.00060, P = 0.00060
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    94 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 149

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:   150 NYL-VGFCPEGPSCKFMHP 167


>UNIPROTKB|I3LCK9 [details] [associations]
            symbol:LOC100738395 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            GeneTree:ENSGT00390000009627 OMA:PLDQVTC EMBL:FP103031
            Ensembl:ENSSSCT00000031676 Uniprot:I3LCK9
        Length = 243

 Score = 91 (37.1 bits), Expect = 0.00060, P = 0.00060
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    68 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 123

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:   124 NYL-VGFCPEGPSCKFMHP 141


>RGD|620440 [details] [associations]
            symbol:Cpsf4 "cleavage and polyadenylation specific factor 4"
            species:10116 "Rattus norvegicus" [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005847 "mRNA cleavage and polyadenylation
            specificity factor complex" evidence=ISS] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000571 InterPro:IPR001878 Pfam:PF00098
            Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158 SMART:SM00343
            SMART:SM00356 RGD:620440 GO:GO:0046872 GO:GO:0008270 GO:GO:0006397
            GO:GO:0003723 Gene3D:4.10.60.10 SUPFAM:SSF57756 eggNOG:COG5084
            HSSP:P47974 GO:GO:0005847 HOGENOM:HOG000212457 HOVERGEN:HBG051108
            CTD:10898 GeneTree:ENSGT00390000009627 KO:K14404 OrthoDB:EOG4KH2VQ
            EMBL:BC089824 IPI:IPI00553898 RefSeq:NP_001012351.1
            UniGene:Rn.104788 ProteinModelPortal:Q5FVR7 SMR:Q5FVR7
            Ensembl:ENSRNOT00000042474 GeneID:304277 KEGG:rno:304277
            InParanoid:Q5FVR7 NextBio:652764 ArrayExpress:Q5FVR7
            Genevestigator:Q5FVR7 GermOnline:ENSRNOG00000025217 Uniprot:Q5FVR7
        Length = 243

 Score = 91 (37.1 bits), Expect = 0.00060, P = 0.00060
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    94 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 149

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:   150 NYL-VGFCPEGPSCKFMHP 167


>UNIPROTKB|J9P398 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            InterPro:IPR001878 Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103
            PROSITE:PS50158 SMART:SM00343 SMART:SM00356 GO:GO:0046872
            GO:GO:0008270 GO:GO:0003676 Gene3D:4.10.60.10 SUPFAM:SSF57756
            CTD:10898 GeneTree:ENSGT00390000009627 KO:K14404 OMA:PLDQVTC
            EMBL:AAEX03004276 RefSeq:XP_850149.1 ProteinModelPortal:J9P398
            Ensembl:ENSCAFT00000043832 GeneID:489859 KEGG:cfa:489859
            Uniprot:J9P398
        Length = 269

 Score = 91 (37.1 bits), Expect = 0.00073, P = 0.00073
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    94 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 149

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:   150 NYL-VGFCPEGPSCKFMHP 167


>UNIPROTKB|O95639 [details] [associations]
            symbol:CPSF4 "Cleavage and polyadenylation specificity
            factor subunit 4" species:9606 "Homo sapiens" [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0005847 "mRNA
            cleavage and polyadenylation specificity factor complex"
            evidence=IDA] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0019048
            "virus-host interaction" evidence=TAS] [GO:0019054 "modulation by
            virus of host cellular process" evidence=TAS] [GO:0019058 "viral
            infectious cycle" evidence=TAS] [GO:0046778 "modification by virus
            of host mRNA processing" evidence=TAS] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005730 "nucleolus" evidence=IDA] [GO:0005739
            "mitochondrion" evidence=IDA] InterPro:IPR000571 InterPro:IPR001878
            Pfam:PF00098 Pfam:PF00642 PROSITE:PS50103 PROSITE:PS50158
            SMART:SM00343 SMART:SM00356 GO:GO:0005739 Reactome:REACT_116125
            GO:GO:0046872 GO:GO:0008270 GO:GO:0006397 GO:GO:0003723
            EMBL:CH236956 EMBL:CH471091 GO:GO:0019058 Gene3D:4.10.60.10
            SUPFAM:SSF57756 eggNOG:COG5084 GO:GO:0005847 HOGENOM:HOG000212457
            HOVERGEN:HBG051108 CTD:10898 KO:K14404 OMA:PLDQVTC
            OrthoDB:EOG4KH2VQ EMBL:U79569 EMBL:CR542161 EMBL:EF191081
            EMBL:BC003101 EMBL:BC050738 IPI:IPI00009137 IPI:IPI00029707
            IPI:IPI00375469 RefSeq:NP_001075028.1 RefSeq:NP_006684.1
            UniGene:Hs.489287 PDB:2D9N PDB:2RHK PDBsum:2D9N PDBsum:2RHK
            ProteinModelPortal:O95639 SMR:O95639 DIP:DIP-48675N IntAct:O95639
            MINT:MINT-1429837 STRING:O95639 PhosphoSite:O95639 PaxDb:O95639
            PRIDE:O95639 DNASU:10898 Ensembl:ENST00000292476
            Ensembl:ENST00000436336 GeneID:10898 KEGG:hsa:10898 UCSC:uc003uqi.3
            UCSC:uc003uqj.3 UCSC:uc003uqk.3 GeneCards:GC07P099036
            HGNC:HGNC:2327 HPA:HPA049094 MIM:603052 neXtProt:NX_O95639
            PharmGKB:PA26844 InParanoid:O95639 PhylomeDB:O95639
            EvolutionaryTrace:O95639 GenomeRNAi:10898 NextBio:41385
            ArrayExpress:O95639 Bgee:O95639 CleanEx:HS_CPSF4
            Genevestigator:O95639 GermOnline:ENSG00000160917 GO:GO:0046778
            Uniprot:O95639
        Length = 269

 Score = 91 (37.1 bits), Expect = 0.00073, P = 0.00073
 Identities = 27/79 (34%), Positives = 39/79 (49%)

Query:    13 PECSYFLRTGDCKYKSNCKYHH--PKNRIPKSPPCTLSDKGL----PL---RPGQNV-CS 62
             PEC ++ + G+C  K  C + H  P+++I     C   D+G     PL   R  + V C 
Sbjct:    94 PECYFYSKFGECSNKE-CPFLHIDPESKIKD---CPWYDRGFCKHGPLCRHRHTRRVICV 149

Query:    63 YYSRYGICKFGPACKYDHP 81
              Y   G C  GP+CK+ HP
Sbjct:   150 NYL-VGFCPEGPSCKFMHP 167


>UNIPROTKB|F1NHU3 [details] [associations]
            symbol:ZC3H3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0005634
            GO:GO:0008270 GO:GO:0006378 GO:GO:0003676
            GeneTree:ENSGT00390000009627 GO:GO:0016973 GO:GO:0010793
            EMBL:AADN02037362 EMBL:AADN02037363 IPI:IPI00580233
            Ensembl:ENSGALT00000028087 OMA:CNRGESC Uniprot:F1NHU3
        Length = 377

 Score = 93 (37.8 bits), Expect = 0.00077, P = 0.00077
 Identities = 36/95 (37%), Positives = 46/95 (48%)

Query:    13 PECSYFLRTGDCKYKSNCKYHHPK-NRIPKSPPCTLSDKGL-PL-----RPGQNVCSYYS 65
             P CSYFL+ G C   SNC Y H   +R  K+  C    KG  P+     +    VC  ++
Sbjct:   169 PVCSYFLK-GICN-NSNCPYSHVYVSR--KAEVCQDFLKGYCPMGEKCKKKHTLVCPDFA 224

Query:    66 RYGICKFGPACKYDHPI---H-P-DASAEYGLDPP 95
             + GIC  G  CK  HP    H P + + E G DPP
Sbjct:   225 KKGICPRGACCKLLHPKKKRHSPGNCAGEDG-DPP 258


>UNIPROTKB|E2RBM0 [details] [associations]
            symbol:CPSF4 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000571
            Pfam:PF00642 PROSITE:PS50103 SMART:SM00356 GO:GO:0008270
            GO:GO:0003676 GeneTree:ENSGT00390000009627 EMBL:AAEX03004276
            Ensembl:ENSCAFT00000023887 NextBio:20862973 Uniprot:E2RBM0
        Length = 164

 Score = 88 (36.0 bits), Expect = 0.00092, P = 0.00092
 Identities = 25/78 (32%), Positives = 37/78 (47%)

Query:    15 CSYFLRTGDCKYKSNCKYHHPKNRIPKSPPCTL-------SDKGLP---LRPGQNV--CS 62
             C ++LR G CK    C++ H  + + K P C         S+K  P   + P   +  C 
Sbjct:    66 CKHWLR-GLCKKGDQCEFLHEYD-MTKMPECYFYSKFGECSNKECPFLHIDPESKIKDCP 123

Query:    63 YYSRYGICKFGPACKYDH 80
             +Y R G CK GP C++ H
Sbjct:   124 WYDR-GFCKHGPLCRHRH 140


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.139   0.460    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      122       122   0.00091  102 3  12 22  0.43    31
                                                     29  0.49    32


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  41
  No. of states in DFA:  485 (52 KB)
  Total size of DFA:  106 KB (2074 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  11.62u 0.18s 11.80t   Elapsed:  00:00:00
  Total cpu time:  11.63u 0.18s 11.81t   Elapsed:  00:00:00
  Start:  Fri May 10 23:18:40 2013   End:  Fri May 10 23:18:40 2013

Back to top