BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>001486
MEVQISNLESLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGL
VYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMST
FEDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLH
HYYDSFKKLAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDDETSSVIKDLLDPSVDL
VRSKAIQKYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAE
KQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVI
HLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEAL
ETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHVPNCKLLLEELIKFTMV
HGGRSHISIVDAVISNALYSRPDVLKVFSLEDVEDISSLYLQFLDLCGTIHDIRNAWNQH
IKLFPHTVRTAYECPGRETKSLRAFIRGKRESNVASLPQPFESEHLMPSASQDKKFSPPE
KSDSESGDDATSLPSNQKSPLPENHDIRSDGAEVDILLSGEADSSSQDRMQQVPPEAAEQ
HSQDACDPEVLSLDLAHQVTNENETVQASEAFSEEDDVQREYEHESKKDLKPLSLEGLSL
DPGGNDSPGSLCATSHECEAPQKTNFSHESMLKSEAPRETSLSDGSVLGASQNNNGSHFA
PSSMGTQASSSAPIQTRTVSPSSSASHQNFIPEAHSHPQTPANSGRNWHEQQNPDRVHRD
LRFGYRGHSHKRQHQQRRFSSQRYPRNESGDQMPMNSRFPSQPLPSQNPQAQQGSQAQSQ
FLHSLTAQAWPMQNMQQQTFASASQSEVPAQPVFYPQAQMSQYPSQSSEQQGLLQSNLAY
NQMWQYYYYQQQQQQQLFLQQQHLQLQQQHLQPLQQQQFVQQQQYQQQHSLYLQQQPQHQ
QLEQYQMQQQVQQQDQHPPQQWQLEQRQSEQQIGMSQIEKWNNSSKQVCD

High Scoring Gene Products

Symbol, full name Information P value
PRP39-2 protein from Arabidopsis thaliana 5.6e-167
Prpf39
PRP39 pre-mRNA processing factor 39 homolog (S. cerevisiae)
gene from Rattus norvegicus 1.0e-50
PRPF39
Uncharacterized protein
protein from Gallus gallus 1.2e-50
PRPF39
Uncharacterized protein
protein from Sus scrofa 8.8e-50
PRPF39
Uncharacterized protein
protein from Canis lupus familiaris 1.5e-49
PRPF39
Pre-mRNA-processing factor 39
protein from Homo sapiens 1.9e-49
MGG_04558
Pre-mRNA-processing factor 39
protein from Magnaporthe oryzae 70-15 1.9e-49
Prpf39
PRP39 pre-mRNA processing factor 39 homolog (yeast)
protein from Mus musculus 7.4e-48
CG1646 protein from Drosophila melanogaster 1.1e-45
prpf39
pre-mRNA processing factor 39
gene from Dictyostelium discoideum 2.1e-44
prpf39
PRP39 pre-mRNA processing factor 39 homolog (yeast)
gene_product from Danio rerio 3.0e-43
PRPF39
PRPF39 protein
protein from Bos taurus 6.7e-38
PRPF39
Uncharacterized protein
protein from Canis lupus familiaris 2.8e-27
PRP39
U1 snRNP protein involved in splicing
gene from Saccharomyces cerevisiae 6.5e-22
F25B4.5 gene from Caenorhabditis elegans 1.1e-14
PRP39 gene_product from Candida albicans 6.8e-12
PRP39
Potential spliceosomal U1 snRNP protein Prp39
protein from Candida albicans SC5314 6.8e-12
cstf3
cleavage stimulation factor subunit 3
gene from Dictyostelium discoideum 2.6e-09
PRP42 gene_product from Candida albicans 4.1e-09
PRP42
Potential spliceosomal U1 snRNP protein
protein from Candida albicans SC5314 4.1e-09
PRP42
U1 snRNP protein involved in splicing
gene from Saccharomyces cerevisiae 7.8e-09
CSTF3
Cleavage stimulation factor subunit 3
protein from Homo sapiens 6.4e-08
AT3G51110 protein from Arabidopsis thaliana 1.2e-07
crnkl1
crooked neck pre-mRNA splicing factor-like 1 (Drosophila)
gene_product from Danio rerio 3.0e-06
CG6197 protein from Drosophila melanogaster 3.3e-06
cstf3
cleavage stimulation factor, 3' pre-RNA, subunit 3
gene_product from Danio rerio 5.3e-06
CSTF3
Uncharacterized protein
protein from Canis lupus familiaris 6.8e-06
CSTF3
Cleavage stimulation factor subunit 3
protein from Homo sapiens 6.8e-06
Cstf3
cleavage stimulation factor, 3' pre-RNA, subunit 3
protein from Mus musculus 6.8e-06
CSTF3
Uncharacterized protein
protein from Gallus gallus 6.8e-06
CRNKL1
Uncharacterized protein
protein from Bos taurus 2.2e-05
CRNKL1
Uncharacterized protein
protein from Canis lupus familiaris 2.4e-05
CSTF3
Uncharacterized protein
protein from Bos taurus 2.6e-05
CSTF77 protein from Arabidopsis thaliana 3.5e-05
suf-1 gene from Caenorhabditis elegans 9.4e-05
AT5G45990 protein from Arabidopsis thaliana 0.00011
CRNKL1
Uncharacterized protein
protein from Canis lupus familiaris 0.00012
DDB_G0295719
unknown
gene from Dictyostelium discoideum 0.00012
CRNKL1
Crooked neck-like protein 1
protein from Homo sapiens 0.00013
Crnkl1
Crn, crooked neck-like 1 (Drosophila)
protein from Mus musculus 0.00014
Crnkl1
crooked neck pre-mRNA splicing factor-like 1 (Drosophila)
gene from Rattus norvegicus 0.00014
xab2
TPR-like helical domain-containing protein
gene from Dictyostelium discoideum 0.00014
CRNKL1
Crooked neck-like protein 1
protein from Homo sapiens 0.00014
SRP40
Nucleolar serine-rich protein
gene from Saccharomyces cerevisiae 0.00018
SART3
Uncharacterized protein
protein from Gallus gallus 0.00021
crn
crooked neck
protein from Drosophila melanogaster 0.00024
AT5G41770 protein from Arabidopsis thaliana 0.00025
SART3
Uncharacterized protein
protein from Gallus gallus 0.00030
DDB_G0278819
HAT repeat-containing protein
gene from Dictyostelium discoideum 0.00031
DDB_G0271670 gene from Dictyostelium discoideum 0.00042
M03F8.3 gene from Caenorhabditis elegans 0.00056
G3MZB1
Uncharacterized protein
protein from Bos taurus 0.00081

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  001486
        (1070 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2170453 - symbol:PRP39-2 species:3702 "Arabido...  1609  5.6e-167  2
ASPGD|ASPL0000046692 - symbol:AN1635 species:162425 "Emer...   556  4.8e-53   1
POMBASE|SPBC4B4.09 - symbol:usp105 "U1 snRNP-associated p...   550  2.2e-52   1
RGD|1308702 - symbol:Prpf39 "PRP39 pre-mRNA processing fa...   521  1.0e-50   2
UNIPROTKB|E1C8G8 - symbol:PRPF39 "Uncharacterized protein...   534  1.2e-50   1
UNIPROTKB|F1SI15 - symbol:PRPF39 "Uncharacterized protein...   526  8.8e-50   1
UNIPROTKB|F1PV57 - symbol:PRPF39 "Uncharacterized protein...   524  1.5e-49   1
UNIPROTKB|Q86UA1 - symbol:PRPF39 "Pre-mRNA-processing fac...   523  1.9e-49   1
UNIPROTKB|G4MRU5 - symbol:MGG_04558 "Pre-mRNA-processing ...   523  1.9e-49   1
MGI|MGI:104602 - symbol:Prpf39 "PRP39 pre-mRNA processing...   496  7.4e-48   2
FB|FBgn0039600 - symbol:CG1646 species:7227 "Drosophila m...   287  1.1e-45   3
DICTYBASE|DDB_G0283307 - symbol:prpf39 "pre-mRNA processi...   330  2.1e-44   3
ZFIN|ZDB-GENE-030616-420 - symbol:prpf39 "PRP39 pre-mRNA ...   484  3.0e-43   1
UNIPROTKB|A8E4M9 - symbol:PRPF39 "Uncharacterized protein...   416  6.7e-38   1
UNIPROTKB|D4A0B1 - symbol:D4A0B1 "Uncharacterized protein...   405  1.0e-36   1
UNIPROTKB|E2RPB9 - symbol:PRPF39 "Uncharacterized protein...   317  2.8e-27   1
SGD|S000004509 - symbol:PRP39 "U1 snRNP protein involved ...   208  6.5e-22   2
WB|WBGene00017768 - symbol:F25B4.5 species:6239 "Caenorha...   217  1.1e-14   3
CGD|CAL0004777 - symbol:PRP39 species:5476 "Candida albic...   191  6.8e-12   2
UNIPROTKB|Q5ALT2 - symbol:PRP39 "Potential spliceosomal U...   191  6.8e-12   2
DICTYBASE|DDB_G0286645 - symbol:cstf3 "cleavage stimulati...   141  2.6e-09   3
CGD|CAL0001111 - symbol:PRP42 species:5476 "Candida albic...   168  4.1e-09   1
UNIPROTKB|Q5A014 - symbol:PRP42 "Potential spliceosomal U...   168  4.1e-09   1
SGD|S000002643 - symbol:PRP42 "U1 snRNP protein involved ...   168  7.8e-09   1
UNIPROTKB|E9PLP8 - symbol:CSTF3 "Cleavage stimulation fac...   136  6.4e-08   1
TAIR|locus:2080853 - symbol:AT3G51110 species:3702 "Arabi...   155  1.2e-07   1
ZFIN|ZDB-GENE-040426-694 - symbol:crnkl1 "crooked neck pr...   107  3.0e-06   3
FB|FBgn0033859 - symbol:CG6197 species:7227 "Drosophila m...    84  3.3e-06   4
ZFIN|ZDB-GENE-040426-1997 - symbol:cstf3 "cleavage stimul...   136  5.3e-06   3
UNIPROTKB|E2R479 - symbol:CSTF3 "Uncharacterized protein"...   136  6.8e-06   2
UNIPROTKB|Q12996 - symbol:CSTF3 "Cleavage stimulation fac...   136  6.8e-06   2
MGI|MGI:1351825 - symbol:Cstf3 "cleavage stimulation fact...   136  6.8e-06   2
UNIPROTKB|Q5F4A0 - symbol:CSTF3 "Uncharacterized protein"...   136  6.8e-06   2
UNIPROTKB|F1MZT2 - symbol:CRNKL1 "Uncharacterized protein...   103  2.2e-05   2
UNIPROTKB|J9P5Z1 - symbol:CRNKL1 "Uncharacterized protein...   103  2.4e-05   2
UNIPROTKB|E1BGY7 - symbol:CSTF3 "Uncharacterized protein"...   136  2.6e-05   3
TAIR|locus:2007973 - symbol:CSTF77 species:3702 "Arabidop...   136  3.5e-05   1
WB|WBGene00006307 - symbol:suf-1 species:6239 "Caenorhabd...   139  9.4e-05   2
TAIR|locus:2161363 - symbol:AT5G45990 species:3702 "Arabi...    91  0.00011   2
UNIPROTKB|F1PYE9 - symbol:CRNKL1 "Uncharacterized protein...   103  0.00012   2
DICTYBASE|DDB_G0295719 - symbol:DDB_G0295719 "unknown" sp...   130  0.00012   1
UNIPROTKB|Q5JY65 - symbol:CRNKL1 "Crooked neck-like prote...   103  0.00013   2
MGI|MGI:1914127 - symbol:Crnkl1 "Crn, crooked neck-like 1...   103  0.00014   2
RGD|620507 - symbol:Crnkl1 "crooked neck pre-mRNA splicin...   103  0.00014   2
DICTYBASE|DDB_G0277977 - symbol:xab2 "TPR-like helical do...   102  0.00014   3
UNIPROTKB|Q9BZJ0 - symbol:CRNKL1 "Crooked neck-like prote...   103  0.00014   2
SGD|S000001800 - symbol:SRP40 "Nucleolar serine-rich prot...   126  0.00018   1
UNIPROTKB|E1BWJ0 - symbol:SART3 "Uncharacterized protein"...   101  0.00021   3
FB|FBgn0000377 - symbol:crn "crooked neck" species:7227 "...   128  0.00024   1
TAIR|locus:2152965 - symbol:AT5G41770 species:3702 "Arabi...    94  0.00025   2
UNIPROTKB|F1NF69 - symbol:SART3 "Uncharacterized protein"...   106  0.00030   3
DICTYBASE|DDB_G0278819 - symbol:DDB_G0278819 "HAT repeat-...   125  0.00031   2
DICTYBASE|DDB_G0271670 - symbol:DDB_G0271670 species:4468...   122  0.00042   1
WB|WBGene00019762 - symbol:M03F8.3 species:6239 "Caenorha...    99  0.00056   3
ASPGD|ASPL0000051943 - symbol:AN0461 species:162425 "Emer...   121  0.00069   1
UNIPROTKB|G3MZB1 - symbol:G3MZB1 "Uncharacterized protein...   123  0.00081   1


>TAIR|locus:2170453 [details] [associations]
            symbol:PRP39-2 species:3702 "Arabidopsis thaliana"
            [GO:0005622 "intracellular" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006396 "RNA processing" evidence=IEA]
            InterPro:IPR003107 SMART:SM00386 EMBL:CP002688 GO:GO:0005622
            GO:GO:0006396 KO:K13217 IPI:IPI00548633 RefSeq:NP_199452.2
            UniGene:At.29958 ProteinModelPortal:F4KHG8 SMR:F4KHG8 PRIDE:F4KHG8
            EnsemblPlants:AT5G46400.1 GeneID:834683 KEGG:ath:AT5G46400
            OMA:HIKLFPH ArrayExpress:F4KHG8 Uniprot:F4KHG8
        Length = 1036

 Score = 1609 (571.5 bits), Expect = 5.6e-167, Sum P(2) = 5.6e-167
 Identities = 335/723 (46%), Positives = 456/723 (63%)

Query:     5 ISNLESLSAEP--NSPVGF-GKQGLEEFIAEGSLDFDEWTSLLSEIEN-SCPDDIEMIGL 60
             +S+ E L   P  +S   F     L+E  + G+LDFDEWT L+SEIE  S PDDIE + L
Sbjct:    10 VSDKEPLQRSPELDSSTDFLDNDRLKETFSSGALDFDEWTLLISEIETTSFPDDIEKLCL 69

Query:    61 VYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMST 120
             VYD+FL EFPLC+GYWRKYA HK +LC+++  VEVFERAVQ+ATYSV VW  YC+ +++ 
Sbjct:    70 VYDAFLLEFPLCHGYWRKYAYHKIKLCTLEDAVEVFERAVQAATYSVAVWLDYCAFAVAA 129

Query:   121 FEDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLH 180
             +EDP+DV RLF+R LSF+GKDY C T+WDKYIE+ + QQ+WSSLA ++++TL++PSKKL 
Sbjct:   130 YEDPHDVSRLFERGLSFIGKDYSCCTLWDKYIEYLLGQQQWSSLANVYLRTLKYPSKKLD 189

Query:   181 HYYDSFKKLAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDDETSSVIKDLLDPSVDL 240
              YY +F+K+A + KE+++C  D   +  S+ + E  V   + D+E S V+++L+ PS   
Sbjct:   190 LYYKNFRKIAASLKEKIKCRIDVNGDLSSDPMEEDLVHTRHTDEEISIVVRELMGPSSSS 249

Query:   241 VRSKAIQKYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAE 300
               SKA+  Y  IGEQ Y+++ QL EKI+CFE  IRRPYFHVKPLD  QL NWH YLSF E
Sbjct:   250 AVSKALHTYLSIGEQFYQDSRQLMEKISCFETQIRRPYFHVKPLDTNQLDNWHAYLSFGE 309

Query:   301 KQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVI 360
               GDFDW + LYERCLIPCA+Y EFW RYVDF+ESKGGRE+A++AL RA+Q F+K   VI
Sbjct:   310 TYGDFDWAINLYERCLIPCANYTEFWFRYVDFVESKGGRELANFALARASQTFVKSASVI 369

Query:   361 HLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEAL 420
             HLFNAR+KE +GD SAA  A      +    F+E VT KANME+RLGNF AA  TY+EAL
Sbjct:   370 HLFNARFKEHVGDASAASVALSRCGEELGFGFVENVTKKANMEKRLGNFEAAVTTYREAL 429

Query:   421 -ETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHVPNCKLLLEELIKFTM 479
              +T   +    T   LYVQFSRL Y  T SAD+A  IL++G ++VP+CKLLLEEL++  M
Sbjct:   430 NKTLIGKENLETTARLYVQFSRLKYVITNSADDAAQILLEGNENVPHCKLLLEELMRLLM 489

Query:   480 VHGGRSHISIVDAVISNALYSRPDVLKVFSLEDVEDISSLYLQFLDLCGTIHDIRNAWNQ 539
             +HGG   + ++D +I   L  + D     S ED E+IS+LY++F+DL GTIHD+R A  +
Sbjct:   490 MHGGSRQVDLLDPIIDKELSHQADSSDGLSAEDKEEISNLYMEFIDLSGTIHDVRKALGR 549

Query:   540 HIKLFPHTVRTAYECPGRETKSLRAFIRGKRESNVASLPQPFESEH----LMPSASQDKK 595
             HIKLFPH+ R             R  I+ +RE     L Q   +      ++ S  ++KK
Sbjct:   550 HIKLFPHSARAKLRGSRPSGNLFRELIQ-RREKTRERLNQDLLTNKGISSIVDSPPKEKK 608

Query:   596 FSPPEKSDSESGD----DATSLPSNQKSPLPENHDIR-SDGA-EVDILLSGEADSSSQDR 649
              S  +   ++S D    D  +   NQ   L   H +  +D   E + L   ++D S   +
Sbjct:   609 ESSLDSYGTQSKDAVRADYVNTEPNQGC-LTSGHLVEGNDNVIERETLCESQSDLSMGLK 667

Query:   650 MQQVPPEAAEQHSQDACDPEV-LSLDLAHQVTNENETVQASEAFSEEDDVQREYEHESKK 708
               +    + E        PE       AH  +N  +TV++     +    Q    ++S++
Sbjct:   668 ANEGGKRSHEVSLPIQASPEHGFVTKQAHFSSNSVDTVKSDAIVIQPSGSQSPQSYQSQE 727

Query:   709 DLK 711
              L+
Sbjct:   728 SLR 730

 Score = 37 (18.1 bits), Expect = 5.6e-167, Sum P(2) = 5.6e-167
 Identities = 9/31 (29%), Positives = 16/31 (51%)

Query:   916 QQQTFASASQSEVPAQPV---FYPQAQMSQY 943
             Q QT  +  Q+++P  PV   +  + QM  +
Sbjct:   846 QVQTSFAYPQTQIPQNPVQSNYQQEGQMQSH 876


>ASPGD|ASPL0000046692 [details] [associations]
            symbol:AN1635 species:162425 "Emericella nidulans"
            [GO:0006396 "RNA processing" evidence=IEA] [GO:0005685 "U1 snRNP"
            evidence=IEA] [GO:0003674 "molecular_function" evidence=ND]
            InterPro:IPR003107 InterPro:IPR011990 SMART:SM00386 EMBL:BN001307
            GO:GO:0005622 GO:GO:0006396 eggNOG:COG0457 Gene3D:1.25.40.10
            EMBL:AACD01000026 KO:K13217 HOGENOM:HOG000189748 OMA:ARYFERY
            OrthoDB:EOG4DNJD8 RefSeq:XP_659239.1 ProteinModelPortal:Q5BCU5
            EnsemblFungi:CADANIAT00008273 GeneID:2874721 KEGG:ani:AN1635.2
            Uniprot:Q5BCU5
        Length = 588

 Score = 556 (200.8 bits), Expect = 4.8e-53, P = 4.8e-53
 Identities = 140/443 (31%), Positives = 223/443 (50%)

Query:    26 LEEFIAEGSLDFDEWTSLLSEIE--------NSCPDDIEMIGLVYDSFLAEFPLCYGYWR 77
             LE  + +   +F+ W  L+   E        NS P  I  +  VYD FLA+FPL +GYW+
Sbjct:    19 LEAELLDDPDNFETWERLVRAAEALEGGVNRNSNPQAITTVRNVYDRFLAKFPLLFGYWK 78

Query:    78 KYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSF 137
             KYAD +  +   +    V+ER V S + SVD+W +YC+    T  D + +R LF+R  + 
Sbjct:    79 KYADLEFSITGTEAADMVYERGVASISSSVDLWTNYCTFKAETSHDTDIIRELFERGANC 138

Query:   138 VGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEEL 197
             VG D+L H  WDKYIE+E   + +  +  I  + +  P  +   Y++ +++LA   +   
Sbjct:   139 VGLDFLSHPFWDKYIEYEERVEGYDKIFAILARVIEIPMHQYARYFERYRQLAQT-RPVA 197

Query:   198 ECESDSAM-EFQSEL-VLEGEVPAYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQ 255
             E    + + +F+++L    G V    K D  + + +DL        R + +  Y     +
Sbjct:   198 ELAPPNVISQFRADLDAAAGIVAPGAKAD--AEIERDL--------RLR-LDGYHL---E 243

Query:   256 IYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERC 315
             I+ +      K   +E+ I+RPYFHV  LD+ QL NW  YL F E +G +  +  LYERC
Sbjct:   244 IFSKTQTETTKRWTYESEIKRPYFHVTELDEGQLANWRKYLDFEEAEGSYARIQFLYERC 303

Query:   316 LIPCADYPEFWMRYVDFMESKGGREI-ASYALDRATQIFLKRL-PVIHLFNARYKEQIGD 373
             L+ CA Y EFW RY  +M ++ G+E        RA+ +++    P   L  A ++E  G 
Sbjct:   304 LVTCAHYDEFWQRYARWMSAQPGKEEDVRNIYQRASYLYVPIANPATRLQYAYFEEMCGR 363

Query:   374 TSAARAAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEALETAAEQRKFHTLP 433
              S A+    E+ + +    +E +   ANM RR G   AA + YK  L++   Q +  T  
Sbjct:   364 VSVAKEIH-EAILINIPNHVETIVSLANMCRRHGGLEAAIEVYKSQLDSP--QCEMSTKA 420

Query:   434 LLYVQFSRLTYTTTGSADNARDI 456
              L  +++RL +   GS + AR +
Sbjct:   421 ALVAEWARLLWKIKGSTEEARQV 443


>POMBASE|SPBC4B4.09 [details] [associations]
            symbol:usp105 "U1 snRNP-associated protein Usp105"
            species:4896 "Schizosaccharomyces pombe" [GO:0000243 "commitment
            complex" evidence=ISO] [GO:0000395 "mRNA 5'-splice site
            recognition" evidence=IC;ISO] [GO:0005685 "U1 snRNP" evidence=IDA]
            [GO:0045292 "mRNA cis splicing, via spliceosome" evidence=ISO]
            [GO:0030627 "pre-mRNA 5'-splice site binding" evidence=ISO]
            InterPro:IPR003107 SMART:SM00386 PomBase:SPBC4B4.09 EMBL:CU329671
            GenomeReviews:CU329671_GR eggNOG:COG0457 GO:GO:0000243
            GO:GO:0005685 GO:GO:0000395 KO:K13217 PIR:T40481 RefSeq:NP_596426.1
            ProteinModelPortal:O74970 EnsemblFungi:SPBC4B4.09.1 GeneID:2540869
            KEGG:spo:SPBC4B4.09 HOGENOM:HOG000189748 OMA:ARYFERY
            OrthoDB:EOG4DNJD8 NextBio:20801985 Uniprot:O74970
        Length = 612

 Score = 550 (198.7 bits), Expect = 2.2e-52, P = 2.2e-52
 Identities = 144/502 (28%), Positives = 249/502 (49%)

Query:    36 DFDEWTSLL--SE-IE-----NSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARLC 87
             DFD W  L+  SE +E     NS    I  +  VYD FL ++PL +GYW+KYAD +  + 
Sbjct:    27 DFDAWEGLVRASEHLEGGVGRNSSKQAINTLRSVYDRFLGKYPLLFGYWKKYADFEFFVA 86

Query:    88 SIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSFVGKDYLCHTM 147
               +    ++ER +    +SVD+W +YC+  M T  D N+VR LF +  + VG D+L H  
Sbjct:    87 GAEASEHIYERGIAGIPHSVDLWTNYCAFKMETNGDANEVRELFMQGANMVGLDFLSHPF 146

Query:   148 WDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAMEF 207
             WDKY+EFE  Q+R  ++ Q+  + +  P  +   Y++ F +++ +   +     D     
Sbjct:   147 WDKYLEFEERQERPDNVFQLLERLIHIPLHQYARYFERFVQVSQSQPIQQLLPPDVLASI 206

Query:   208 QSELVLEGEVPAYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDEKI 267
             ++++  E   PA      +  +  +  +  +++ R    + Y  I  QI+++      K 
Sbjct:   207 RADVTRE---PAKVVSAGSKQITVERGE--LEIEREMRARIYN-IHLQIFQKVQLETAKR 260

Query:   268 NCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWM 327
               FE+ I+RPYFHVK LD+ QL NW  YL F E +GDF  +  LYERCLI CA Y EFW 
Sbjct:   261 WTFESEIKRPYFHVKELDEAQLVNWRKYLDFEEVEGDFQRICHLYERCLITCALYDEFWF 320

Query:   328 RYVDFMESKGGR-EIASYALDRATQIFLK-RLPVIHLFNARYKEQIGDTSAARAAFPESY 385
             RY  +M ++       S   +RA+ IF     P I +  A ++E  G+ ++A+A + +S 
Sbjct:   321 RYARWMSAQPDHLNDVSIIYERASCIFASISRPGIRVQYALFEESQGNIASAKAIY-QSI 379

Query:   386 IDSDSRFIEKVTFKANMERRLGNFVAACDTYKEALETAAEQRKFHT--LPLLYVQFSRLT 443
             +      +E V     +ERR        + +   L +   + K +T    +L  +  +L 
Sbjct:   380 LTQLPGNLEAVLGWVGLERRNAPNYDLTNAHA-VLRSIINEGKCNTGITEVLITEDIKLV 438

Query:   444 YTTTGSADNARDILIDGIKHVPNCKLLLEELIKFTMVHGGRS-HISIVDAVISNALYSRP 502
             +   G  + AR++ +     + +C+      ++F +     S + +   A +SN +    
Sbjct:   439 WKIEGDIELARNMFLQNAPALLDCRHFWISFLRFELEQPLNSKNYTEHHARVSNVMEMIR 498

Query:   503 DVLKVFSLEDVEDISSLYLQFL 524
             +  ++     + D++ LY+++L
Sbjct:   499 NKTRL-PPRTIMDLTKLYMEYL 519


>RGD|1308702 [details] [associations]
            symbol:Prpf39 "PRP39 pre-mRNA processing factor 39 homolog (S.
            cerevisiae)" species:10116 "Rattus norvegicus" [GO:0005634
            "nucleus" evidence=IEA;ISO] [GO:0006396 "RNA processing"
            evidence=IEA] [GO:0005730 "nucleolus" evidence=ISO]
            InterPro:IPR003107 InterPro:IPR011990 SMART:SM00386 RGD:1308702
            GO:GO:0005634 GO:GO:0006396 Gene3D:1.25.40.10 CTD:55015
            GeneTree:ENSGT00390000005033 KO:K13217 OrthoDB:EOG49GKG9
            IPI:IPI00948194 RefSeq:XP_003750219.1 RefSeq:XP_003754228.1
            UniGene:Rn.12521 Ensembl:ENSRNOT00000066702 GeneID:314171
            KEGG:rno:314171 UCSC:RGD:1308702 ArrayExpress:D4A5S9 Uniprot:D4A5S9
        Length = 664

 Score = 521 (188.5 bits), Expect = 1.0e-50, Sum P(2) = 1.0e-50
 Identities = 162/584 (27%), Positives = 275/584 (47%)

Query:     7 NLESLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFL 66
             +L     E N P  F K    + +     DF  W  LL  +E    + +      +D F 
Sbjct:    64 DLPVTETEGNFPPEFEK--FWKTVETNPQDFTGWVYLLQYVEQE--NHLMAARKAFDKFF 119

Query:    67 AEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFE--DP 124
               +P CYGYW+KYAD + R  +I +  EV+ R +Q+   SVD+W HY +    T +  DP
Sbjct:   120 IHYPYCYGYWKKYADLEKRHDNIKQSDEVYRRGLQAIPLSVDLWIHYINFLKETLDPGDP 179

Query:   125 ---NDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHH 181
                + +R  F+ A+   G D+    +W+ YI +E  Q     +  ++ + L  P++   H
Sbjct:   180 ETNSTIRGTFEHAVLAAGTDFRSDKLWEMYINWENEQGNLREVTAVYDRILGIPTQLYSH 239

Query:   182 YYDSFKK-LAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDD-----ETSSVIKDLLD 235
             ++  FK+ +      +L    +  ++ + EL     V  +  DD     +  S I+D+ D
Sbjct:   240 HFQRFKEHVQNNLPRDL-LTGEQFIQLRRELA---SVNGHNGDDGPPGDDLPSGIEDITD 295

Query:   236 PSVDLVRSKAIQKYRFIGEQIYKEASQLDE-KIN---CFENLIRRPYFHVKPLDDIQLKN 291
             P+  L+      ++R I  +I++E    +E +++    FE  I+RPYFHVKPL+  QLKN
Sbjct:   296 PA-KLITEIENMRHRII--EIHQEMFNYNEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKN 352

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             W +YL F  + G  + VV L+ERC+I CA Y EFW++Y  +ME+    E   +   RA  
Sbjct:   353 WKEYLEFEIENGTHERVVVLFERCVISCALYEEFWIKYAKYMENHS-IEGVRHVFSRACT 411

Query:   352 IFLKRLPVIHLFNARYKEQIGDTSAARA---AFPESYIDSDSRFIEKVTFKANMERRLGN 408
             + L + P+ H+  A ++EQ G+ + AR     F E  +      + +V+    +ERR GN
Sbjct:   412 VHLPKKPMAHMLWAAFEEQQGNINEARIILRTFEECVLGLAMVRLRRVS----LERRHGN 467

Query:   409 FVAACDTYKEALETAAEQRKFHTLPLLY-VQFSRLTYTTTGSADNARDILIDGI-KHVPN 466
                A    +  L+ A    K +     Y ++ +R  +    +   +R +L++ I K   N
Sbjct:   468 MEEA----EHLLQDAIRNAKSNNESSFYAIKLARHLFKIQKNLPKSRKVLLEAIEKDKEN 523

Query:   467 CKL---LLEELIKFTMVHGGRSHISIVDAVISNALYSRPDVLKV-FSLEDVEDISSLYLQ 522
              KL   LLE      +     + ++  D  I  +L   P  +++ FS   VE     +L+
Sbjct:   524 TKLYLNLLEMEYSCDLKQNEENILNCFDKAIHGSL---PIKMRITFSQRKVE-----FLE 575

Query:   523 FLDLCGTIHDIRNAWNQHIKLFPH--TVRTAYECPGRETKSLRA 564
               D    ++ + NA+++H  L     T++   E    E +  +A
Sbjct:   576 --DFGSDVNKLLNAYDEHQTLLKEQDTLKRKAENGSEEPEEKKA 617

 Score = 39 (18.8 bits), Expect = 1.0e-50, Sum P(2) = 1.0e-50
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query:   655 PEAAEQHSQDACDPEVLSLDL-AHQ 678
             PE  + H++D    +++  DL A+Q
Sbjct:   612 PEEKKAHTEDVSSAQIIDGDLQANQ 636


>UNIPROTKB|E1C8G8 [details] [associations]
            symbol:PRPF39 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006396 "RNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 SMART:SM00386 GO:GO:0005634 GO:GO:0006396
            Gene3D:1.25.40.10 GeneTree:ENSGT00390000005033 OMA:GWVYLLQ
            EMBL:AADN02004089 IPI:IPI00598251 Ensembl:ENSGALT00000020376
            Uniprot:E1C8G8
        Length = 628

 Score = 534 (193.0 bits), Expect = 1.2e-50, P = 1.2e-50
 Identities = 157/558 (28%), Positives = 269/558 (48%)

Query:     5 ISNLESLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDS 64
             I +L++   E   P+ F K    + + +   DF  W  LL  +E    + +      +D 
Sbjct:    25 IGSLQTTDIEAGFPLDFDK--FWKVVEDNPQDFTGWVYLLQYVEQE--NHLPAARKAFDK 80

Query:    65 FLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDP 124
             F   +P CYGYW+KYAD + R  +I +  EV+ R +Q+   SVD+W HY +    T  DP
Sbjct:    81 FFTHYPYCYGYWKKYADLERRHDNIKQSDEVYRRGLQAIPLSVDLWIHYINFLKDTL-DP 139

Query:   125 ND------VRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKK 178
             +D      +R  ++ A+   G D+    +W+ YI +E  Q     +  I+ + L  P++ 
Sbjct:   140 DDPEANSTIRGAYEHAVLAAGTDFRSDRLWEMYINWEDEQGNLREVTSIYDRILGIPTQL 199

Query:   179 LHHYYDSFKK-LAGAWKEELECESDSAMEFQSELV-LEGEVPAYYK-DDETSSVIKDLLD 235
               H++  FK  +      +L   S+  ++ + EL  + G         D+  S  +D+ D
Sbjct:   200 YSHHFQRFKDHVQNNLPRDL-LTSEQFIQLRRELASVNGHAGGDASAGDDLPSGTEDITD 258

Query:   236 PSVDLVRSKAIQKYRFIGEQIYKEASQLDE-KIN---CFENLIRRPYFHVKPLDDIQLKN 291
             P+  L+      ++R I  +I++E    +E +++    FE  I+RPYFHVKPL+  QLKN
Sbjct:   259 PA-KLITEIENMRHRII--EIHQEMFNHNEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKN 315

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             W +YL F  + G  + VV L+ERC+I CA Y +FW++Y  +ME+    E   +   RA  
Sbjct:   316 WKEYLEFEIENGTHERVVVLFERCVISCALYEDFWIKYAKYMENHS-IEGVRHVYSRACT 374

Query:   352 IFLKRLPVIHLFNARYKEQIGDTSAARA---AFPESYIDSDSRFIEKVTFKANMERRLGN 408
             I L + P++H+  A ++EQ G+   AR     F E  +      + +V+    +ERR GN
Sbjct:   375 IHLPKKPMVHMLWAAFEEQQGNIDEARRILKTFEECILGLAMVRLRRVS----LERRHGN 430

Query:   409 FVAACDTYKEALETA--AEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDGIK-HVP 465
                A    +EA+  A    +  F+ + L     +R  +    +   AR +L D I+    
Sbjct:   431 MEEAERLLEEAVRNAKSVSESSFYAIKL-----ARHLFKVQKNLPKARKVLSDAIEIDKE 485

Query:   466 NCKLLLEEL-IKFT--MVHGGRSHISIVDAVISNALYSRPDVLKVFSLEDVEDISSLYLQ 522
             N KL L  L +++   +     + +S  D  ++ +L  +  V   FS   VE     +L+
Sbjct:   486 NTKLYLNLLEMEYCGDLTQNEENILSCFDKAVNGSLSIKMRV--TFSQRKVE-----FLE 538

Query:   523 FLDLCGTIHDIRNAWNQH 540
               D    ++ + +A+++H
Sbjct:   539 --DFGSDVNKLLDAYDEH 554


>UNIPROTKB|F1SI15 [details] [associations]
            symbol:PRPF39 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 GO:GO:0005634 GO:GO:0006396 Gene3D:1.25.40.10
            GeneTree:ENSGT00390000005033 OMA:GWVYLLQ EMBL:CU570961
            Ensembl:ENSSSCT00000005511 Uniprot:F1SI15
        Length = 667

 Score = 526 (190.2 bits), Expect = 8.8e-50, P = 8.8e-50
 Identities = 155/558 (27%), Positives = 271/558 (48%)

Query:     7 NLESLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFL 66
             +L     E N P  + K    + +     DF  W  LL  +E    + +      +D F 
Sbjct:    65 DLPVTETEANFPPEYEK--FWKTVENNPQDFTGWVYLLQYVEQE--NHLMAARKAFDKFF 120

Query:    67 AEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFE--DP 124
               +P CYGYW+KYAD + R  +I +  EV+ R +Q+   SVD+W HY +    T +  DP
Sbjct:   121 IHYPYCYGYWKKYADLEKRHDNIKQSDEVYRRGLQAIPLSVDLWIHYINFLKETLDPGDP 180

Query:   125 ---NDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHH 181
                + ++  F+ A+   G D+    +W+ YI +E  Q     +  I+ + L  P++   H
Sbjct:   181 ETTSTIKGTFEHAVLAAGTDFRSDRLWEMYINWENEQGNLREVTAIYDRILGIPTQLYSH 240

Query:   182 YYDSFKK-LAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDD-----ETSSVIKDLLD 235
             ++  FK+ +      +L    +  ++ + EL     V  +  DD     +  S I+D+ D
Sbjct:   241 HFQRFKEHVQNNLPRDL-LTGEQFIQLRRELA---SVNGHSGDDGPPGDDLPSGIEDITD 296

Query:   236 PSVDLVRSKAIQKYRFIGEQIYKEASQLDE-KIN---CFENLIRRPYFHVKPLDDIQLKN 291
             P+  L+      ++R I  +I++E    +E +++    FE  I+RPYFHVKPL+  QLKN
Sbjct:   297 PAKKLITEIENMRHRII--EIHQEMFNYNEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKN 354

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             W +YL F  + G  + VV L+ERC+I CA Y EFW++Y  +ME+    E   +   RA  
Sbjct:   355 WKEYLEFEIENGTHERVVVLFERCVISCALYEEFWIKYAKYMENHS-IEGVRHVFSRACT 413

Query:   352 IFLKRLPVIHLFNARYKEQIGDTSAAR---AAFPESYIDSDSRFIEKVTFKANMERRLGN 408
             I L + P++H+  A ++EQ G+ + AR     F E  +      + +V+    +ERR GN
Sbjct:   414 IHLPKKPMVHMLWAAFEEQQGNINEARNILRTFEECVLGLAMVRLRRVS----LERRHGN 469

Query:   409 FVAACDTYKEALETAAEQRKFHTLPLLY-VQFSRLTYTTTGSADNARDILIDGIKH-VPN 466
                A    +  L+ A +  K +     Y ++ +R  +    +   +R +L++ I+    N
Sbjct:   470 MEEA----ERLLQDAIKNAKANNESSFYAIKLARHLFKIQKNLPKSRKVLLEAIERDKEN 525

Query:   467 CKLLLEEL-IKFT--MVHGGRSHISIVDAVISNALYSRPDVLKV-FSLEDVEDISSLYLQ 522
              KL L  L ++++  +     + ++  D  I  +L   P  +++ FS   VE     +L+
Sbjct:   526 TKLYLNLLEMEYSGDLKQNEENILNCFDKAIHGSL---PIKMRITFSQRKVE-----FLE 577

Query:   523 FLDLCGTIHDIRNAWNQH 540
               D    ++ + NA+++H
Sbjct:   578 --DFGSDVNKLLNAYDEH 593


>UNIPROTKB|F1PV57 [details] [associations]
            symbol:PRPF39 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0006396
            "RNA processing" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 SMART:SM00386 GO:GO:0005634 GO:GO:0006396
            Gene3D:1.25.40.10 CTD:55015 GeneTree:ENSGT00390000005033 KO:K13217
            OMA:GWVYLLQ EMBL:AAEX03005706 RefSeq:XP_851059.2
            Ensembl:ENSCAFT00000022300 GeneID:480305 KEGG:cfa:480305
            Uniprot:F1PV57
        Length = 667

 Score = 524 (189.5 bits), Expect = 1.5e-49, P = 1.5e-49
 Identities = 159/566 (28%), Positives = 275/566 (48%)

Query:     2 EVQISNLESL---SAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMI 58
             E +I+N   L     E N P  + K    + +     DF  W  LL  +E    + +   
Sbjct:    57 ENEIANAVDLPVTETEANFPPEYEK--FWKTVENNPQDFTGWVYLLQYVEQE--NHLMAA 112

Query:    59 GLVYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSM 118
                +D F   +P CYGYW+KYAD + R  +I +  EV+ R +Q+   SVD+W HY +   
Sbjct:   113 RKAFDKFFIHYPYCYGYWKKYADLEKRHDNIKQSDEVYRRGLQAIPLSVDLWIHYINFLK 172

Query:   119 STFE--DP---NDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLR 173
              T +  DP   + +R  F+ A+   G D+    +W+ YI +E  Q     +  I+ + L 
Sbjct:   173 ETLDPGDPETNSTIRGTFEHAVLAAGTDFRSDRLWEMYINWENEQGNLREVTAIYDRILG 232

Query:   174 FPSKKLHHYYDSFKK-LAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDD-----ETS 227
              P++   H++  FK+ +      +L    +  ++ + EL     V  +  DD     +  
Sbjct:   233 IPTQLYSHHFQRFKEHVQNNLPRDL-LTGEQFIQLRRELA---SVNGHSGDDGPPGDDLP 288

Query:   228 SVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDE-KIN---CFENLIRRPYFHVKP 283
             S I+D+ DP+  L+      ++R I  +I++E    +E +++    FE  I+RPYFHVKP
Sbjct:   289 SGIEDITDPA-KLITEIENMRHRII--EIHQEMFNYNEHEVSKRWTFEEGIKRPYFHVKP 345

Query:   284 LDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIAS 343
             L+  QLKNW +YL F  + G  + VV L+ERC+I CA Y EFW++Y  +ME+    E   
Sbjct:   346 LEKAQLKNWKEYLEFEIENGTHERVVVLFERCVISCALYEEFWIKYAKYMENHS-IEGVR 404

Query:   344 YALDRATQIFLKRLPVIHLFNARYKEQIGDTSAAR---AAFPESYIDSDSRFIEKVTFKA 400
             +   RA  I L + P++H+  A ++EQ G+ + AR     F E  +      + +V+   
Sbjct:   405 HVFSRACTIHLPKKPMVHMLWAAFEEQQGNINEARNILRTFEECVLGLAMVRLRRVS--- 461

Query:   401 NMERRLGNFVAACDTYKEALETAAEQRKFHTLPLLY-VQFSRLTYTTTGSADNARDILID 459
              +ERR GN   A    +  L+ A +  K +     Y ++ +R  +    +   +R +L++
Sbjct:   462 -LERRHGNMEEA----EHLLQDAIKNAKSNNESSFYAIKLARHLFKIQKNLPKSRKVLLE 516

Query:   460 GIKH-VPNCKLLLEEL-IKFT--MVHGGRSHISIVDAVISNALYSRPDVLKV-FSLEDVE 514
              I+    N KL L  L ++++  +     + ++  D  I  +L   P  +++ FS   VE
Sbjct:   517 AIERDKENTKLYLNLLEMEYSGDLKQNEENILNCFDKAIHGSL---PIKMRITFSQRKVE 573

Query:   515 DISSLYLQFLDLCGTIHDIRNAWNQH 540
                  +L+  D    ++ + NA+++H
Sbjct:   574 -----FLE--DFGSDVNKLLNAYDEH 592


>UNIPROTKB|Q86UA1 [details] [associations]
            symbol:PRPF39 "Pre-mRNA-processing factor 39" species:9606
            "Homo sapiens" [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005730 "nucleolus" evidence=IDA]
            InterPro:IPR003107 InterPro:IPR011990 SMART:SM00386 GO:GO:0005634
            GO:GO:0008380 GO:GO:0006397 Gene3D:1.25.40.10 eggNOG:COG5107
            EMBL:AL121809 CTD:55015 HOGENOM:HOG000010277 HOVERGEN:HBG082194
            KO:K13217 OMA:GWVYLLQ OrthoDB:EOG49GKG9 EMBL:AK001990 EMBL:BC051886
            EMBL:BC125126 EMBL:BC125127 IPI:IPI00789246 IPI:IPI00878754
            RefSeq:NP_060392.3 UniGene:Hs.274337 ProteinModelPortal:Q86UA1
            SMR:Q86UA1 IntAct:Q86UA1 STRING:Q86UA1 PhosphoSite:Q86UA1
            DMDM:223590245 PaxDb:Q86UA1 PRIDE:Q86UA1 Ensembl:ENST00000355765
            GeneID:55015 KEGG:hsa:55015 UCSC:uc001wvy.4 UCSC:uc001wwa.1
            GeneCards:GC14P045553 H-InvDB:HIX0011621 HGNC:HGNC:20314
            HPA:HPA001176 MIM:614907 neXtProt:NX_Q86UA1 PharmGKB:PA142671127
            InParanoid:Q86UA1 GenomeRNAi:55015 NextBio:58381
            ArrayExpress:Q86UA1 Bgee:Q86UA1 CleanEx:HS_PRPF39
            Genevestigator:Q86UA1 GermOnline:ENSG00000185246 Uniprot:Q86UA1
        Length = 669

 Score = 523 (189.2 bits), Expect = 1.9e-49, P = 1.9e-49
 Identities = 156/551 (28%), Positives = 268/551 (48%)

Query:    14 EPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCY 73
             E N P  + K    + +     DF  W  LL  +E    + +      +D F   +P CY
Sbjct:    74 EANFPPEYEK--FWKTVENNPQDFTGWVYLLQYVEQE--NHLMAARKAFDRFFIHYPYCY 129

Query:    74 GYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFE--DP---NDVR 128
             GYW+KYAD + R  +I    EV+ R +Q+   SVD+W HY +    T +  DP   N +R
Sbjct:   130 GYWKKYADLEKRHDNIKPSDEVYRRGLQAIPLSVDLWIHYINFLKETLDPGDPETNNTIR 189

Query:   129 RLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKK 188
               F+ A+   G D+    +W+ YI +E  Q     +  I+ + L  P++   H++  FK+
Sbjct:   190 GTFEHAVLAAGTDFRSDRLWEMYINWENEQGNLREVTAIYDRILGIPTQLYSHHFQRFKE 249

Query:   189 -LAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDD-----ETSSVIKDLLDPSVDLVR 242
              +      +L    +  ++ + EL     V  +  DD     +  S I+D+ DP+  L+ 
Sbjct:   250 HVQNNLPRDL-LTGEQFIQLRRELA---SVNGHSGDDGPPGDDLPSGIEDITDPA-KLIT 304

Query:   243 SKAIQKYRFIGEQIYKEASQLDE-KIN---CFENLIRRPYFHVKPLDDIQLKNWHDYLSF 298
                  ++R I  +I++E    +E +++    FE  I+RPYFHVKPL+  QLKNW +YL F
Sbjct:   305 EIENMRHRII--EIHQEMFNYNEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKNWKEYLEF 362

Query:   299 AEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLP 358
               + G  + VV L+ERC+I CA Y EFW++Y  +ME+    E   +   RA  I L + P
Sbjct:   363 EIENGTHERVVVLFERCVISCALYEEFWIKYAKYMENHS-IEGVRHVFSRACTIHLPKKP 421

Query:   359 VIHLFNARYKEQIGDTSAAR---AAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDT 415
             ++H+  A ++EQ G+ + AR     F E  +      + +V+    +ERR GN   A   
Sbjct:   422 MVHMLWAAFEEQQGNINEARNILKTFEECVLGLAMVRLRRVS----LERRHGNLEEA--- 474

Query:   416 YKEALETAAEQRKFHTLPLLY-VQFSRLTYTTTGSADNARDILIDGIKH-VPNCKLLLEE 473
              +  L+ A +  K +     Y V+ +R  +    +   +R +L++ I+    N KL L  
Sbjct:   475 -EHLLQDAIKNAKSNNESSFYAVKLARHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNL 533

Query:   474 L-IKFT--MVHGGRSHISIVDAVISNALYSRPDVLKV-FSLEDVEDISSLYLQFLDLCGT 529
             L ++++  +     + ++  D  +  +L   P  +++ FS   VE     +L+  D    
Sbjct:   534 LEMEYSGDLKQNEENILNCFDKAVHGSL---PIKMRITFSQRKVE-----FLE--DFGSD 583

Query:   530 IHDIRNAWNQH 540
             ++ + NA+++H
Sbjct:   584 VNKLLNAYDEH 594


>UNIPROTKB|G4MRU5 [details] [associations]
            symbol:MGG_04558 "Pre-mRNA-processing factor 39"
            species:242507 "Magnaporthe oryzae 70-15" [GO:0003674
            "molecular_function" evidence=ND] InterPro:IPR003107 SMART:SM00386
            GO:GO:0005622 GO:GO:0006396 EMBL:CM001231 KO:K13217
            RefSeq:XP_003710923.1 ProteinModelPortal:G4MRU5
            EnsemblFungi:MGG_04558T0 GeneID:2677921 KEGG:mgr:MGG_04558
            Uniprot:G4MRU5
        Length = 586

 Score = 523 (189.2 bits), Expect = 1.9e-49, P = 1.9e-49
 Identities = 136/432 (31%), Positives = 211/432 (48%)

Query:    37 FDEWTSLLSEIE--------NSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARLCS 88
             F+ W  L+   E        NS P  +  +   YD FL +FPL +GYW+KYAD +  +  
Sbjct:    31 FENWEKLVRACEALDGGLTRNSSPQALATLRDAYDRFLLKFPLLFGYWKKYADLEFTIAG 90

Query:    89 IDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSFVGKDYLCHTMW 148
              +    V+ER   S T SVD+W  YCS  M T   P  VR LF+R  + VG D++ H  W
Sbjct:    91 PESAEMVYERGCASITNSVDLWTEYCSFKMETTHVPQLVRDLFERGAACVGLDFMAHPFW 150

Query:   149 DKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAMEFQ 208
             +KY+E+E  Q+   ++ +I  + +  P  +   YY+ F  +      +    ++    F+
Sbjct:   151 NKYLEYEERQEAHENIFKILQRVIHIPMYQYARYYERFSTMVHTRALDDVVSAELQARFK 210

Query:   209 SELVLEGEVPAYYKDDETSSVIKDLLDPSVDL-VRSKAIQKYRFIGEQIYKEASQLDEKI 267
             +E+  E E  AY        V K   +P  +  +R K    Y   GE   K  +++ ++ 
Sbjct:   211 TEI--EAEAAAY-------GVTKT--EPEFEQEMRRKVDAHY---GEIFTKTQTEVTKRW 256

Query:   268 NCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWM 327
               +E  I+RPYFHV  L+  +L NW  YL F E +G F     LYERCL+ CA Y EFW 
Sbjct:   257 -LYEAEIKRPYFHVTELEKKELSNWRKYLDFEEAEGSFVRTAFLYERCLVTCAFYDEFWF 315

Query:   328 RYVDFMESKGGR--EIASYALDRATQIFLK-RLPVIHLFNARYKEQIGDTSAARAAFPES 384
             RY  +M ++  +  E+ +  L RA  IF+    P I L  A ++E  G  + AR      
Sbjct:   316 RYARWMSAQPDKTEEVRNIYL-RAATIFVPISRPGIRLQFAYFEESCGRVAMAREVHNAI 374

Query:   385 YIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEALETAAEQRKFHTLPLLYVQFSRLTY 444
              +      IE +   AN+ERR  +   A +  K+ +E+   +    T  +L  +++ L +
Sbjct:   375 LLRLPG-CIEVIISLANLERRHNDIDTAIEVLKQQIESP--EVDIWTKAVLVTEWASLLW 431

Query:   445 TTTGSADNARDI 456
             T  G+A+ AR +
Sbjct:   432 TVKGTAEEARAV 443


>MGI|MGI:104602 [details] [associations]
            symbol:Prpf39 "PRP39 pre-mRNA processing factor 39 homolog
            (yeast)" species:10090 "Mus musculus" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0006397 "mRNA processing"
            evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
            [GO:0008380 "RNA splicing" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 SMART:SM00386 MGI:MGI:104602 GO:GO:0005634
            GO:GO:0008380 GO:GO:0006397 Gene3D:1.25.40.10 eggNOG:COG5107
            HOGENOM:HOG000010277 HOVERGEN:HBG082194 OrthoDB:EOG49GKG9
            EMBL:AK017379 EMBL:AK154170 EMBL:AK168462 EMBL:BC029153
            IPI:IPI00170040 IPI:IPI00761305 UniGene:Mm.283339
            ProteinModelPortal:Q8K2Z2 SMR:Q8K2Z2 STRING:Q8K2Z2
            PhosphoSite:Q8K2Z2 PaxDb:Q8K2Z2 PRIDE:Q8K2Z2 UCSC:uc007nqw.1
            UCSC:uc007nra.1 InParanoid:Q8K2Z2 ChiTaRS:PRPF39
            Genevestigator:Q8K2Z2 GermOnline:ENSMUSG00000035597 Uniprot:Q8K2Z2
        Length = 665

 Score = 496 (179.7 bits), Expect = 7.4e-48, Sum P(2) = 7.4e-48
 Identities = 163/586 (27%), Positives = 272/586 (46%)

Query:     7 NLESLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFL 66
             NL    AE + P  F K    + +     DF  W  LL  +E    + +      +D F 
Sbjct:    65 NLPVTEAEGDFPPEFEK--FWKTVEMNPQDFTGWVYLLQYVEQE--NHLMAARKAFDKFF 120

Query:    67 AEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPND 126
               +P CYGYW+KYAD + R  +I +  EV+ R +Q+   SVD+W HY +    T E P D
Sbjct:   121 VHYPYCYGYWKKYADLEKRHDNIKQSDEVYRRGLQAIPLSVDLWIHYINFLKETLE-PGD 179

Query:   127 ------VRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLH 180
                   +R  F+ A+   G D+    +W+ YI +E  Q     +  ++ + L  P++   
Sbjct:   180 QETNTTIRGTFEHAVLAAGTDFRSDKLWEMYINWENEQGNLREVTAVYDRILGIPTQLYS 239

Query:   181 HYYDSFKK-LAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDD-----ETSSVIKDLL 234
             H++  FK+ +      +L    +  ++ + EL     V  +  DD     +  S I+D+ 
Sbjct:   240 HHFQRFKEHVQNNLPRDL-LTGEQFIQLRRELA---SVNGHSGDDGPPGDDLPSGIEDI- 294

Query:   235 DPSVDLVRSKAIQKYRFIGEQIYKEASQLDE----KINCFENLIRRPYFHVKPLDDIQ-L 289
              P+  L+      ++R I  +I++E    +E    K   FE  I+RPYFHVKPL+  Q  
Sbjct:   295 SPA-KLITEIENMRHRII--EIHQEMFNYNEHEVSKRWTFEEGIKRPYFHVKPLEKAQPK 351

Query:   290 KNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRA 349
             KNW +YL F  + G  + VV L+ERC+I CA Y EFW++Y  +ME+    E   +   RA
Sbjct:   352 KNWKEYLEFEIENGTHERVVVLFERCVISCALYEEFWIKYAKYMENHS-IEGVRHVFSRA 410

Query:   350 TQIFLKRLPVIHLFNARYKEQIGDTSAARA---AFPESYIDSDSRFIEKVTFKANMERRL 406
               + L + P+ H+  A ++EQ G+ + AR     F E  +      + +V+    +ERR 
Sbjct:   411 CTVHLPKKPMAHMLWAAFEEQQGNINEARIILRTFEECVLGLAMVRLRRVS----LERRH 466

Query:   407 GNFVAACDTYKEALETAAEQRKFHTLPLLY-VQFSRLTYTTTGSADNARDILIDGI-KHV 464
             GN   A    +  L+ A +  K +     Y ++ +R  +    +   +R +L++ I K  
Sbjct:   467 GNMEEA----EHLLQDAIKNAKSNNESSFYAIKLARHLFKIQKNLPKSRKVLLEAIEKDK 522

Query:   465 PNCKL---LLEELIKFTMVHGGRSHISIVDAVISNALYSRPDVLKV-FSLEDVEDISSLY 520
              N KL   LLE      +     + ++  D  I  +L   P  +++ FS   VE     +
Sbjct:   523 ENTKLYLNLLEMEYSCDLKQNEENILNCFDKAIHGSL---PIKMRITFSQRKVE-----F 574

Query:   521 LQFLDLCGTIHDIRNAWNQHIKLFPH--TVRTAYECPGRETKSLRA 564
             L+  D    ++ + NA+++H  L     T++   E    E +  +A
Sbjct:   575 LE--DFGSDVNKLLNAYDEHQTLLKEQDTLKRKAENGSEEPEEKKA 618

 Score = 38 (18.4 bits), Expect = 7.4e-48, Sum P(2) = 7.4e-48
 Identities = 8/25 (32%), Positives = 15/25 (60%)

Query:   655 PEAAEQHSQDACDPEVLSLDL-AHQ 678
             PE  + H++D    +++  DL A+Q
Sbjct:   613 PEEKKAHTEDLSSAQIIDGDLQANQ 637


>FB|FBgn0039600 [details] [associations]
            symbol:CG1646 species:7227 "Drosophila melanogaster"
            [GO:0005685 "U1 snRNP" evidence=ISS] [GO:0000398 "mRNA splicing,
            via spliceosome" evidence=ISS] [GO:0005634 "nucleus" evidence=IC]
            [GO:0000381 "regulation of alternative mRNA splicing, via
            spliceosome" evidence=IMP] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 InterPro:IPR001623 EMBL:AE014297 Gene3D:1.25.40.10
            SMART:SM00271 GO:GO:0000398 GO:GO:0000381 eggNOG:COG5107
            GO:GO:0005685 GeneTree:ENSGT00390000005033 KO:K13217 EMBL:AY051737
            RefSeq:NP_001097957.1 RefSeq:NP_001097958.1 RefSeq:NP_651634.1
            RefSeq:NP_733256.2 RefSeq:NP_788753.1 RefSeq:NP_788754.2
            UniGene:Dm.31288 ProteinModelPortal:Q7KRW8 SMR:Q7KRW8 IntAct:Q7KRW8
            MINT:MINT-820225 STRING:Q7KRW8 PaxDb:Q7KRW8 PRIDE:Q7KRW8
            EnsemblMetazoa:FBtr0085322 GeneID:43399 KEGG:dme:Dmel_CG1646
            UCSC:CG1646-RB FlyBase:FBgn0039600 InParanoid:Q7KRW8 OMA:IRWENES
            OrthoDB:EOG4ZKH2F PhylomeDB:Q7KRW8 GenomeRNAi:43399 NextBio:833726
            Bgee:Q7KRW8 Uniprot:Q7KRW8
        Length = 1066

 Score = 287 (106.1 bits), Expect = 1.1e-45, Sum P(3) = 1.1e-45
 Identities = 57/161 (35%), Positives = 90/161 (55%)

Query:    30 IAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARLCSI 89
             + E S DF  WT LL  ++N    D E     YD+FL+ +P CYGYWRKYAD++ R    
Sbjct:   372 VKEDSTDFTGWTYLLQYVDNE--SDAEAAREAYDTFLSHYPYCYGYWRKYADYEKRKGIK 429

Query:    90 DKVVEVFERAVQSATYSVDVWFHYCSLSMSTF-EDPNDVRRLFKRALSFVGKDYLCHTMW 148
                 +VFER +++   SVD+W HY     S   +D   VR  ++RA+   G ++    +W
Sbjct:   430 ANCYKVFERGLEAIPLSVDLWIHYLMHVKSNHGDDETFVRSQYERAVKACGLEFRSDKLW 489

Query:   149 DKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKL 189
             D YI +E   +R+  + QI+ + L  P++  + ++D+F+ L
Sbjct:   490 DAYIRWENESKRYHRVVQIYDRLLAIPTQGYNGHFDNFQDL 530

 Score = 270 (100.1 bits), Expect = 1.1e-45, Sum P(3) = 1.1e-45
 Identities = 83/249 (33%), Positives = 127/249 (51%)

Query:   200 ESDSAMEF--QSELVLEGEVPAYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIY 257
             ESDS  +   +SE       PA   D    S +  L D  V  +R +AI   R    +++
Sbjct:   621 ESDSTTDLTTESESSHAASKPAMQID---FSDLSTLNDEEVVSIRDRAISARR----KVH 673

Query:   258 K-EASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCL 316
             K   S +  + + FE  I+RPYFHVKPL+  QLKNW DYL F  ++GD + V+ L+ERCL
Sbjct:   674 KLTVSAVTARWS-FEEGIKRPYFHVKPLERAQLKNWKDYLDFEIEKGDRERVLVLFERCL 732

Query:   317 IPCADYPEFWM---RYVDFMESKGGR-EIASYALDRATQIFLKRLPVIHLFNARYKE-QI 371
             I CA Y EFW+   RY++ +E + G  ++      RA +I     P +HL  A ++E Q+
Sbjct:   733 IACALYDEFWLKMLRYLESLEDQSGVVDLVRDVYRRACRIHHPDKPSLHLMWAAFEECQM 792

Query:   372 GDTSAARAAFPESYIDSDSRFIEKVTFKA-NMERRLGNFVAACDTYKEALETAAEQRKFH 430
                 AA        ID     + +++++  N+ERR G      + YK  +E+   +    
Sbjct:   793 NFDDAAEIL---QRIDQRCPNLLQLSYRRINVERRRGALDKCRELYKHYIESTKNKGIAG 849

Query:   431 TLPLLYVQF 439
             +L + Y +F
Sbjct:   850 SLAIKYARF 858

 Score = 65 (27.9 bits), Expect = 1.1e-45, Sum P(3) = 1.1e-45
 Identities = 37/164 (22%), Positives = 71/164 (43%)

Query:   669 EVLSLDLAHQVTNENETVQASEAFSEEDDVQREYEHE-SKKDLKPLSLEGLSLDPGGNDS 727
             +++ L L     +E E V+  + F    D++ + +   +++ ++ L   G S   G  D+
Sbjct:   889 QMIDLCLQRPKVDEQEVVEIMDKFMARADIEPDQKVLFAQRKVEFLEDFG-STARGLQDA 947

Query:   728 PGSLCATSHECEAPQKTNFSHESMLKSEAPRETSLSDGSVLGASQNNNGSHFAPSSMGTQ 787
               +L     + +  QK +    S   S + +E  +  GS   A+ NN GS  A +     
Sbjct:   948 QRALQQALTKAKEAQKKSDGSPSRKNSSSSKEGPVPTGSA-AAAYNNGGSAAAVAGYNYG 1006

Query:   788 ASSSAPIQTRTVS--PSSS---ASHQNFIPE-AHSHPQTPANSG 825
             A++    Q  T +  PS +   AS+ ++  +  +      ANSG
Sbjct:  1007 AANPYYGQQNTAAAYPSQTPQQASYDSYYNQWGYGSGGASANSG 1050

 Score = 46 (21.3 bits), Expect = 1.1e-19, Sum P(3) = 1.1e-19
 Identities = 35/147 (23%), Positives = 54/147 (36%)

Query:   688 ASEAFSEEDDVQREYEHESKKDLKPLSLEGLSLDPGG-NDSPGSLCATSHECEAPQKTNF 746
             +S +  ++D  +RE E +  KD K    E      GG   SP     T  + E+   T+ 
Sbjct:   571 SSSSSKDKDSKEREREKDKDKD-KDKDKEKRETVGGGVGKSPKDTSETQVD-ESDSTTDL 628

Query:   747 SHESMLKSEAPRETSLSDGSVLGASQNNNGSHFAPSSMGTQASSSAPIQTRTVSPSSSA- 805
             + ES     A +     D S L    +        S      S+   +   TVS  ++  
Sbjct:   629 TTESESSHAASKPAMQIDFSDLSTLNDEE----VVSIRDRAISARRKVHKLTVSAVTARW 684

Query:   806 SHQNFI--PEAHSHPQTPANSGRNWHE 830
             S +  I  P  H  P   A   +NW +
Sbjct:   685 SFEEGIKRPYFHVKPLERAQL-KNWKD 710

 Score = 38 (18.4 bits), Expect = 6.9e-43, Sum P(3) = 6.9e-43
 Identities = 8/17 (47%), Positives = 11/17 (64%)

Query:   906 TAQAWPMQNMQQQTFAS 922
             TA A+P Q  QQ ++ S
Sbjct:  1017 TAAAYPSQTPQQASYDS 1033

 Score = 37 (18.1 bits), Expect = 1.1e-19, Sum P(3) = 1.1e-19
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query:   195 EELECESDSAMEFQSELVLEGEVPA 219
             EEL     + +   +E++ E E+PA
Sbjct:   234 EELPAPQRAELPEDAEVISEDELPA 258


>DICTYBASE|DDB_G0283307 [details] [associations]
            symbol:prpf39 "pre-mRNA processing factor 39"
            species:44689 "Dictyostelium discoideum" [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0006396 "RNA processing"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0003674 "molecular_function"
            evidence=ND] InterPro:IPR003107 InterPro:IPR008847
            InterPro:IPR011990 Pfam:PF05843 SMART:SM00386
            dictyBase:DDB_G0283307 GO:GO:0005634 GenomeReviews:CM000153_GR
            GO:GO:0006397 Gene3D:1.25.40.10 EMBL:AAFI02000052 KO:K13217
            eggNOG:NOG298273 RefSeq:XP_639156.1 ProteinModelPortal:Q54R91
            EnsemblProtists:DDB0233547 GeneID:8624026 KEGG:ddi:DDB_G0283307
            InParanoid:Q54R91 OMA:ADHEYAH Uniprot:Q54R91
        Length = 699

 Score = 330 (121.2 bits), Expect = 2.1e-44, Sum P(3) = 2.1e-44
 Identities = 65/177 (36%), Positives = 104/177 (58%)

Query:    12 SAEPNSPVGFGKQG-LEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFP 70
             S    SP    ++  L + +    L F++WT L+  IE +  +DIE I  VY  FL EFP
Sbjct:    12 STPTTSPQNLSEEDKLWKIVQTNPLAFNQWTFLIGVIEKT--NDIEKIRKVYSEFLNEFP 69

Query:    71 LCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRL 130
             LC+ YW+++ADH+    +  + +E+FE+AV S  +SVD+W +YC+  +      +++R +
Sbjct:    70 LCFLYWKRFADHEYAHNNTTQSIEIFEKAVSSIPHSVDIWLNYCTHLIDKSYPVDEIRSV 129

Query:   131 FKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFK 187
             FKR ++ +G DY     W+KYIEFE+ Q+  + LA IF   L+ P + L  + + FK
Sbjct:   130 FKRGINIIGTDYQSGKFWEKYIEFEMGQEN-NELASIFNSILKTPLENLQIFNEKFK 185

 Score = 229 (85.7 bits), Expect = 2.1e-44, Sum P(3) = 2.1e-44
 Identities = 57/195 (29%), Positives = 104/195 (53%)

Query:   254 EQIYKEASQLDEKINCFENLI-RRPYFHVKPLDDIQLKNWHDYLSFAEKQGDF--DWVVK 310
             E+ Y E  +   K + FE+++ +R +FH++P+D++ L  W  Y ++ E       + V+K
Sbjct:   221 EKWYHETLEKISKRSNFESIVNKRFFFHIQPIDEMTLSVWRSYFNYMESDPSVTQEEVIK 280

Query:   311 LYERCLIPCADYPEFWMRYVDFM-ESKGG---REIASYALDRATQIFLKRLPVIHLFNAR 366
             L+ERCL+PC  Y EFW++Y+ F+ ES  G    E+     +RAT+IFLK+   IHL  + 
Sbjct:   281 LFERCLVPCCYYSEFWLKYIKFLQESYVGDNKNELIESIFERATKIFLKKRADIHLEYSL 340

Query:   367 YKEQ-IGDTSAARAAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEALETA-A 424
             + E  +G+   A +   E+        +E +    + +RR  +   A   +K+ L +  +
Sbjct:   341 FVESTLGNIEKAFSIL-ENIHSLLPTHLEVILRLVSFKRRNHSIQQANQFFKKVLTSLQS 399

Query:   425 EQRKFHTLPLLYVQF 439
             + + +  L + Y+ F
Sbjct:   400 DSKTYPFLSINYISF 414

 Score = 61 (26.5 bits), Expect = 5.9e-27, Sum P(3) = 5.9e-27
 Identities = 37/154 (24%), Positives = 65/154 (42%)

Query:   340 EIASYALDRATQIFLKRLPVIHLFNARYKEQ-IGDTSAARAAFPESYIDSDSRFIEKVTF 398
             E+     +RAT+IFLK+   IHL  + + E  +G+   A +   E+        +E +  
Sbjct:   314 ELIESIFERATKIFLKKRADIHLEYSLFVESTLGNIEKAFSIL-ENIHSLLPTHLEVILR 372

Query:   399 KANMERRLGNFVAACDTYKEALETA-AEQRKFHTLPLLYVQF-----SRLTYTTTGS--A 450
               + +RR  +   A   +K+ L +  ++ + +  L + Y+ F       L     G   A
Sbjct:   373 LVSFKRRNHSIQQANQFFKKVLTSLQSDSKTYPFLSINYISFLLSNKQSLQGEKEGEEEA 432

Query:   451 DN-------ARDILIDGIKHVPNCKLLLEELIKF 477
             D        +R++L   I   P+ KLL    I F
Sbjct:   433 DKIVDVFETSREVLKKSISLYPDSKLLWLYFINF 466

 Score = 47 (21.6 bits), Expect = 1.6e-25, Sum P(3) = 1.6e-25
 Identities = 13/29 (44%), Positives = 17/29 (58%)

Query:   285 DDIQLKNWHDYLSFAEKQGDFDWVVKLYE 313
             DD +L  W+DYL F   Q D D  +K Y+
Sbjct:   556 DDEKLNIWNDYLEF-NLQYDND--IKGYK 581

 Score = 37 (18.1 bits), Expect = 2.1e-44, Sum P(3) = 2.1e-44
 Identities = 5/29 (17%), Positives = 15/29 (51%)

Query:   680 TNENETVQASEAFSEEDDVQREYEHESKK 708
             +N N  +Q +    ++   Q ++ H+ ++
Sbjct:   644 SNNNGVIQNNSYIYQQQQAQPQHHHQQQQ 672


>ZFIN|ZDB-GENE-030616-420 [details] [associations]
            symbol:prpf39 "PRP39 pre-mRNA processing factor 39
            homolog (yeast)" species:7955 "Danio rerio" [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR003107 InterPro:IPR011990 SMART:SM00386
            ZFIN:ZDB-GENE-030616-420 GO:GO:0005634 GO:GO:0008380 GO:GO:0006397
            Gene3D:1.25.40.10 eggNOG:COG5107 EMBL:AL591492 EMBL:BC116540
            IPI:IPI00481955 RefSeq:NP_001004520.1 UniGene:Dr.340
            ProteinModelPortal:Q1JPZ7 STRING:Q1JPZ7 PRIDE:Q1JPZ7
            Ensembl:ENSDART00000061672 GeneID:368864 KEGG:dre:368864 CTD:55015
            GeneTree:ENSGT00390000005033 HOGENOM:HOG000010277
            HOVERGEN:HBG082194 InParanoid:Q1JPZ7 KO:K13217 OMA:GWVYLLQ
            OrthoDB:EOG49GKG9 NextBio:20813235 ArrayExpress:Q1JPZ7 Bgee:Q1JPZ7
            Uniprot:Q1JPZ7
        Length = 752

 Score = 484 (175.4 bits), Expect = 3.0e-43, P = 3.0e-43
 Identities = 142/550 (25%), Positives = 260/550 (47%)

Query:    14 EPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGL--VYDSFLAEFPL 71
             EP  P  + +  L + + +   DF+ W  LL  +E     +  ++G    +D+F   +P 
Sbjct:   145 EPELPTEYER--LSKVVEDNPEDFNGWVYLLQYVEQ----ENHLLGSRKAFDAFFLHYPY 198

Query:    72 CYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPND----- 126
             CYGYW+KYAD + +   I    EV+ R +Q+   SVD+W HY +      +D +D     
Sbjct:   199 CYGYWKKYADIERKHGYIQMADEVYRRGLQAIPLSVDLWLHYITFLREN-QDTSDGEAES 257

Query:   127 -VRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDS 185
              +R  ++ A+   G D+    +W+ YI +E  Q + +++  I+ + L  P++    ++  
Sbjct:   258 RIRASYEHAVLACGTDFRSDRLWEAYIAWETEQGKLANVTAIYDRLLCIPTQLYSQHFQK 317

Query:   186 FKKLAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDDETSSVIKDLLDPSVDLVR-SK 244
             FK    +   +     +  +  + EL      P+  +D ET +  ++L   + DL   +K
Sbjct:   318 FKDHVQSNNPKHFLSEEEFVSLRVELA-NANKPSGDEDAETEAPGEELPPGTEDLPDPAK 376

Query:   245 AIQKYRFIGEQIYKEASQL----DEKIN---CFENLIRRPYFHVKPLDDIQLKNWHDYLS 297
              + +   +  ++ +   ++    + +++    FE  I+RPYFHVK L+  QL NW +YL 
Sbjct:   377 RVTEIENMRHKVIETRQEMFNHNEHEVSKRWAFEEGIKRPYFHVKALEKTQLNNWREYLD 436

Query:   298 FAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRL 357
             F  + G  + VV L+ERCLI CA Y EFW++Y  ++ES    E   +   +A  + L + 
Sbjct:   437 FELENGTPERVVVLFERCLIACALYEEFWIKYAKYLESYS-TEAVRHIYKKACTVHLPKK 495

Query:   358 PVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYK 417
             P +HL  A ++EQ G    AR+      +      + ++  + ++ERR GN   A    +
Sbjct:   496 PNVHLLWAAFEEQQGSIDEARSILKAVEVSVPGLAMVRLR-RVSLERRHGNMEEAEALLQ 554

Query:   418 EALETA--AEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDGI-KHVPNCKLLLEEL 474
             +A+     + +  F++     V+ +R       S   A+ +L++ + K   N KL L  L
Sbjct:   555 DAITNGRNSSESSFYS-----VKLARQLVKVQKSIGRAKKVLLEAVEKDETNPKLYLN-L 608

Query:   475 IKFTMVHGGRSHISIVDAVISNALYSRPDVLKVFSLEDVEDISSLYLQFLDLCGT-IHDI 533
             ++       + + + + A    AL S        +LE     S   + FL+  G+ I+ +
Sbjct:   609 LELEYSGDVQQNEAEIIACFDRALSSS------MALESRITFSQRKVDFLEDFGSDINTL 662

Query:   534 RNAWNQHIKL 543
               A+ QH +L
Sbjct:   663 MAAYEQHQRL 672


>UNIPROTKB|A8E4M9 [details] [associations]
            symbol:PRPF39 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 GO:GO:0005634 GO:GO:0006396 Gene3D:1.25.40.10
            CTD:55015 GeneTree:ENSGT00390000005033 HOGENOM:HOG000010277
            HOVERGEN:HBG082194 KO:K13217 OMA:GWVYLLQ OrthoDB:EOG49GKG9
            eggNOG:NOG298273 EMBL:DAAA02052958 EMBL:BC149776 IPI:IPI00710325
            RefSeq:NP_001103259.1 UniGene:Bt.27128 Ensembl:ENSBTAT00000003373
            GeneID:505547 KEGG:bta:505547 InParanoid:A8E4M9 NextBio:20867193
            Uniprot:A8E4M9
        Length = 548

 Score = 416 (151.5 bits), Expect = 6.7e-38, P = 6.7e-38
 Identities = 132/470 (28%), Positives = 231/470 (49%)

Query:    95 VFERAVQSATYSVDVWFHYCSLSMSTFE--DP---NDVRRLFKRALSFVGKDYLCHTMWD 149
             V+ R +Q+   SVD+W HY +    T +  DP   + VR  F+ A+   G D+    +W+
Sbjct:    30 VYRRGLQAIPLSVDLWIHYINFLKETLDPGDPETNSTVRGTFEHAVLAAGTDFRSDRLWE 89

Query:   150 KYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKK-LAGAWKEELECESDSAMEFQ 208
              YI +E  Q     +  I+ + L  P++   H++  FK  +      +L    +  ++ +
Sbjct:    90 MYINWENEQGNLREVTAIYDRILGIPTQLYSHHFQRFKDHVQNNLPRDL-LTGEQFIQLR 148

Query:   209 SELVLEGEVPAYYKDD-----ETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQL 263
              EL     V  +  DD     +  S I+D+ DP+  L+      ++R I  +I++E    
Sbjct:   149 RELA---SVNGHSGDDGPPGDDLPSGIEDITDPA-KLITEIENMRHRII--EIHQEMFNY 202

Query:   264 DE-KIN---CFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPC 319
             +E +++    FE  I+RPYFHVKPL+  QLKNW +YL F  + G  + VV L+ERC+I C
Sbjct:   203 NEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKNWKEYLEFEIENGTHERVVVLFERCVISC 262

Query:   320 ADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAAR- 378
             A Y EFW++Y  +ME+    E   +   RA  I L + P++H+  A ++EQ G+ + AR 
Sbjct:   263 ALYEEFWIKYAKYMENHS-IEGVRHVFSRACTIHLPKKPMVHMLWAAFEEQQGNINEARN 321

Query:   379 --AAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEALETAAEQRKFHTLPLLY 436
                 F E  +      + +V+    +ERR GN   A    +EA++ A    +        
Sbjct:   322 ILRTFEECVLGLAMVRLRRVS----LERRHGNMEEAERLLQEAIKNAKSNNESS---FYA 374

Query:   437 VQFSRLTYTTTGSADNARDILIDGIKH-VPNCKLLLEELIKFTMVHGG---RSHISIVDA 492
             ++ +R  +    +   +R +L++ I+    N KL L  L    M + G   ++  +I++ 
Sbjct:   375 IKLARHLFKIQKNLPKSRKVLLEAIERDKENTKLYLNLL---EMEYSGDLKQNEDNILNC 431

Query:   493 VISNALY-SRPDVLKV-FSLEDVEDISSLYLQFLDLCGTIHDIRNAWNQH 540
                 A++ S P  +++ FS   VE     +L+  D    ++ + NA+++H
Sbjct:   432 -FDKAIHGSLPIKMRITFSQRKVE-----FLE--DFGSDVNKLLNAYDEH 473


>UNIPROTKB|D4A0B1 [details] [associations]
            symbol:D4A0B1 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0005622 "intracellular" evidence=IEA]
            [GO:0006396 "RNA processing" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 SMART:SM00386 GO:GO:0005622 GO:GO:0006396
            Gene3D:1.25.40.10 GeneTree:ENSGT00390000005033 IPI:IPI00951935
            Ensembl:ENSRNOT00000065090 ArrayExpress:D4A0B1 Uniprot:D4A0B1
        Length = 426

 Score = 405 (147.6 bits), Expect = 1.0e-36, P = 1.0e-36
 Identities = 116/400 (29%), Positives = 197/400 (49%)

Query:    95 VFERAVQSATYSVDVWFHYCSLSMSTFE--DP---NDVRRLFKRALSFVGKDYLCHTMWD 149
             V+ R +Q+   SVD+W HY +    T +  DP   + +R  F+ A+   G D+    +W+
Sbjct:    30 VYRRGLQAIPLSVDLWIHYINFLKETLDPGDPETNSTIRGTFEHAVLAAGTDFRSDKLWE 89

Query:   150 KYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKK-LAGAWKEELECESDSAMEFQ 208
              YI +E  Q     +  ++ + L  P++   H++  FK+ +      +L    +  ++ +
Sbjct:    90 MYINWENEQGNLREVTAVYDRILGIPTQLYSHHFQRFKEHVQNNLPRDL-LTGEQFIQLR 148

Query:   209 SELVLEGEVPAYYKDD-----ETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQL 263
              EL     V  +  DD     +  S I+D+ DP+  L+      ++R I  +I++E    
Sbjct:   149 RELA---SVNGHNGDDGPPGDDLPSGIEDITDPA-KLITEIENMRHRII--EIHQEMFNY 202

Query:   264 DE-KIN---CFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPC 319
             +E +++    FE  I+RPYFHVKPL+  QLKNW +YL F  + G  + VV L+ERC+I C
Sbjct:   203 NEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKNWKEYLEFEIENGTHERVVVLFERCVISC 262

Query:   320 ADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARA 379
             A Y EFW++Y  +ME+    E   +   RA  + L + P+ H+  A ++EQ G+ + AR 
Sbjct:   263 ALYEEFWIKYAKYMENHS-IEGVRHVFSRACTVHLPKKPMAHMLWAAFEEQQGNINEARI 321

Query:   380 ---AFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEALETAAEQRKFHTLPLLY 436
                 F E  +      + +V+    +ERR GN   A    +  L+ A    K +     Y
Sbjct:   322 ILRTFEECVLGLAMVRLRRVS----LERRHGNMEEA----EHLLQDAIRNAKSNNESSFY 373

Query:   437 -VQFSRLTYTTTGSADNARDILIDGI-KHVPNCKLLLEEL 474
              ++ +R  +    +   +R +L++ I K   N KL L  L
Sbjct:   374 AIKLARHLFKIQKNLPKSRKVLLEAIEKDKENTKLYLNLL 413


>UNIPROTKB|E2RPB9 [details] [associations]
            symbol:PRPF39 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006396 "RNA processing" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR003107
            SMART:SM00386 GO:GO:0005622 GO:GO:0006396
            GeneTree:ENSGT00390000005033 EMBL:AAEX03005706
            Ensembl:ENSCAFT00000022296 Uniprot:E2RPB9
        Length = 299

 Score = 317 (116.6 bits), Expect = 2.8e-27, P = 2.8e-27
 Identities = 79/249 (31%), Positives = 131/249 (52%)

Query:    95 VFERAVQSATYSVDVWFHYCSLSMSTFE--DP---NDVRRLFKRALSFVGKDYLCHTMWD 149
             V+ R +Q+   SVD+W HY +    T +  DP   + +R  F+ A+   G D+    +W+
Sbjct:    30 VYRRGLQAIPLSVDLWIHYINFLKETLDPGDPETNSTIRGTFEHAVLAAGTDFRSDRLWE 89

Query:   150 KYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKK-LAGAWKEELECESDSAMEFQ 208
              YI +E  Q     +  I+ + L  P++   H++  FK+ +      +L    +  ++ +
Sbjct:    90 MYINWENEQGNLREVTAIYDRILGIPTQLYSHHFQRFKEHVQNNLPRDL-LTGEQFIQLR 148

Query:   209 SELVLEGEVPAYYKDD-----ETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQL 263
              EL     V  +  DD     +  S I+D+ DP+  L+      ++R I  +I++E    
Sbjct:   149 RELA---SVNGHSGDDGPPGDDLPSGIEDITDPA-KLITEIENMRHRII--EIHQEMFNY 202

Query:   264 DE-KIN---CFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPC 319
             +E +++    FE  I+RPYFHVKPL+  QLKNW +YL F  + G  + VV L+ERC+I C
Sbjct:   203 NEHEVSKRWTFEEGIKRPYFHVKPLEKAQLKNWKEYLEFEIENGTHERVVVLFERCVISC 262

Query:   320 ADYPEFWMR 328
             A Y EFW++
Sbjct:   263 ALYEEFWIK 271


>SGD|S000004509 [details] [associations]
            symbol:PRP39 "U1 snRNP protein involved in splicing"
            species:4932 "Saccharomyces cerevisiae" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0006396 "RNA processing" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005685 "U1 snRNP"
            evidence=IDA] [GO:0000243 "commitment complex" evidence=IGI]
            [GO:0000395 "mRNA 5'-splice site recognition" evidence=IGI]
            [GO:0030627 "pre-mRNA 5'-splice site binding" evidence=IGI]
            [GO:0071004 "U2-type prespliceosome" evidence=IDA] [GO:0008380 "RNA
            splicing" evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            InterPro:IPR003107 SMART:SM00386 SGD:S000004509 EMBL:BK006946
            eggNOG:COG0457 EMBL:Z47816 GO:GO:0000243 GO:GO:0071004
            GO:GO:0005685 GO:GO:0000395 GeneTree:ENSGT00390000005033 KO:K13217
            OrthoDB:EOG4DNJD8 EMBL:L29224 EMBL:AY692912 PIR:S47920
            RefSeq:NP_013667.1 ProteinModelPortal:P39682 DIP:DIP-812N
            IntAct:P39682 MINT:MINT-612602 STRING:P39682 PaxDb:P39682
            PRIDE:P39682 EnsemblFungi:YML046W GeneID:854960 KEGG:sce:YML046W
            CYGD:YML046w HOGENOM:HOG000115710 OMA:NGHPGIA NextBio:978042
            Genevestigator:P39682 GermOnline:YML046W Uniprot:P39682
        Length = 629

 Score = 208 (78.3 bits), Expect = 6.5e-22, Sum P(2) = 6.5e-22
 Identities = 52/198 (26%), Positives = 102/198 (51%)

Query:     8 LESLSAEPNSPVGFGKQGLEEFIA----EGSLDFDEWTSL---LSEIENSC-----PDDI 55
             +E +   P++  G   Q L++  A       LD+ + +SL   +  IE +      P+D 
Sbjct:     9 IEDIEPRPDALRGLDTQFLQDNTALVQAYRGLDWSDISSLTQMVDVIEQTVVKYGNPNDS 68

Query:    56 EMIGL--VYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHY 113
               + L  +    L ++PL +G+W+++A  + +L  + K + V   +V+    S+++W  Y
Sbjct:    69 IKLALETILWQILRKYPLLFGFWKRFATIEYQLFGLKKSIAVLATSVKWFPTSLELWCDY 128

Query:   114 CSLSMSTFEDPND---VRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQ 170
               L++    +PN+   +R  F+ A   +GK +L H  WDK+IEFE+ Q+ W ++ +I+  
Sbjct:   129 --LNVLCVNNPNETDFIRNNFEIAKDLIGKQFLSHPFWDKFIEFEVGQKNWHNVQRIYEY 186

Query:   171 TLRFPSKKLHHYYDSFKK 188
              +  P  +   ++ S+KK
Sbjct:   187 IIEVPLHQYARFFTSYKK 204

 Score = 134 (52.2 bits), Expect = 6.5e-22, Sum P(2) = 6.5e-22
 Identities = 30/99 (30%), Positives = 56/99 (56%)

Query:   241 VRSKAIQKYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFA- 299
             +  K ++  R I   + K  + ++E I  FE+ I++P+F++  + +  L+NW  YL F  
Sbjct:   206 LNEKNLKTTRNIDIVLRKTQTTVNE-IWQFESKIKQPFFNLGQVLNDDLENWSRYLKFVT 264

Query:   300 --EKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESK 336
                K  D ++V+ +++RCLIPC  +   WM Y+ ++  K
Sbjct:   265 DPSKSLDKEFVMSVFDRCLIPCLYHENTWMMYIKWLTKK 303


>WB|WBGene00017768 [details] [associations]
            symbol:F25B4.5 species:6239 "Caenorhabditis elegans"
            [GO:0005622 "intracellular" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0009792 "embryo development ending in
            birth or egg hatching" evidence=IMP] [GO:0007281 "germ cell
            development" evidence=IMP] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 GO:GO:0009792 GO:GO:0005622 GO:GO:0006396
            Gene3D:1.25.40.10 GO:GO:0007281 EMBL:FO080538
            GeneTree:ENSGT00390000005033 KO:K13217 PIR:T25725
            RefSeq:NP_504495.1 ProteinModelPortal:Q22961 SMR:Q22961
            STRING:Q22961 PaxDb:Q22961 EnsemblMetazoa:F25B4.5.1
            EnsemblMetazoa:F25B4.5.2 GeneID:178955 KEGG:cel:CELE_F25B4.5
            UCSC:F25B4.5.1 CTD:178955 WormBase:F25B4.5 eggNOG:NOG298273
            HOGENOM:HOG000018990 InParanoid:Q22961 OMA:VLLELRY NextBio:903274
            Uniprot:Q22961
        Length = 710

 Score = 217 (81.4 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
 Identities = 46/148 (31%), Positives = 83/148 (56%)

Query:    10 SLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEF 69
             SL   P+    FG   ++  +A    DFD W ++L++++ S  DD++     Y SFL+ +
Sbjct:    78 SLMTRPSVASHFGTPPID--VA----DFDNWVNILAKVDQS--DDVDFAREKYRSFLSRY 129

Query:    70 PLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCS--LSMSTFEDPNDV 127
             P CYG+W+KYA+++ ++ +I +   V+E+ + S   S+D+W  Y +   ++  F  P  +
Sbjct:   130 PNCYGFWQKYAEYEKKMGNIAEAKAVWEKGIISIPLSIDLWLGYTADVKNIKNFP-PESL 188

Query:   128 RRLFKRALSFVGKDYLCHTMWDKYIEFE 155
             R L+ RA+   G +Y    +W + I FE
Sbjct:   189 RDLYARAIEIAGLEYQSDRLWLEAIGFE 216

 Score = 201 (75.8 bits), Expect = 1.3e-11, Sum P(2) = 1.3e-11
 Identities = 85/336 (25%), Positives = 154/336 (45%)

Query:    14 EPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCY 73
             + +  V F ++    F++     +  W    +E E     +I     V++  +   PL  
Sbjct:   110 DQSDDVDFAREKYRSFLSRYPNCYGFWQKY-AEYEKKM-GNIAEAKAVWEKGIISIPLSI 167

Query:    74 GYWRKY-ADHK-ARLCSIDKVVEVFERAVQSA--TYSVD-VWFHYCSLSMSTFEDP---- 124
               W  Y AD K  +    + + +++ RA++ A   Y  D +W        + + D     
Sbjct:   168 DLWLGYTADVKNIKNFPPESLRDLYARAIEIAGLEYQSDRLWLEAIGFERAVYMDELCKG 227

Query:   125 --N-DVRR---LFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLA-QIFVQTLRFPSK 177
               N   +R   LF + LS     +   + +D+Y+++  + +    L+ + + + ++   K
Sbjct:   228 NTNASCKRIGVLFDKLLST--PTFHAPSHFDRYVQYLNTIEPHLLLSDREYEEIMKMVCK 285

Query:   178 KLHHYYDSFKKLAGAWKEELECES--DSAMEFQSELVLEGEVPAYYKDDETSSVIKDLLD 235
             +L     S ++L    +    C+S  +  +   +E   EG  P       T + ++   D
Sbjct:   286 QLGK---SIEELVQQVQLSYICQSGENGMLNIMTESA-EGTFPI------TVNSLQH--D 333

Query:   236 PS-VDLVRSKAIQKYRFIGEQIYKEASQLDEKINC-FENLIRRPYFHVKPLDDIQLKNWH 293
             P+ + L+R + + + + I ++  KE      +I   FE  I+RPYFHVKPLD  QL NW 
Sbjct:   334 PTALQLIRGEIVARRKRIYDKNMKEC-----EIRAGFEANIKRPYFHVKPLDYPQLFNWM 388

Query:   294 DYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRY 329
              YL F  K+G  + V  L++RCLIPC+ Y EFW++Y
Sbjct:   389 SYLDFEIKEGHEERVKILFDRCLIPCSLYEEFWIKY 424

 Score = 57 (25.1 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
 Identities = 12/30 (40%), Positives = 15/30 (50%)

Query:   321 DYPEF--WMRYVDFMESKGGREIASYALDR 348
             DYP+   WM Y+DF   +G  E      DR
Sbjct:   380 DYPQLFNWMSYLDFEIKEGHEERVKILFDR 409

 Score = 46 (21.3 bits), Expect = 1.5e-13, Sum P(2) = 1.5e-13
 Identities = 19/90 (21%), Positives = 45/90 (50%)

Query:   257 YKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFA-EKQGDFDWVVKLYERC 315
             Y + S+ D K+   + ++++    + P + +QL + +  ++++ E   + D V++ ++  
Sbjct:   541 YHQKSRRDPKLA--QKVLKKA-ISIDPFN-LQLYSQYVDIAYSSESMSELD-VIQSFDVA 595

Query:   316 L---IPCADYPEFWMRYVDFMESKGGREIA 342
             L   +   D   F  R +DF+E  G   +A
Sbjct:   596 LDSNLRLEDKVRFSQRKLDFLEELGNNILA 625

 Score = 44 (20.5 bits), Expect = 1.1e-14, Sum P(3) = 1.1e-14
 Identities = 11/39 (28%), Positives = 21/39 (53%)

Query:   436 YVQFSRLTYTTTGSADNARDILIDGIKHVP-NCKLLLEE 473
             +++++R T+ T  S   +R+I +    H P +  L L E
Sbjct:   421 WIKYARWTWKTYKSKTKSREIYMKAKIHCPTSLNLALSE 459

 Score = 37 (18.1 bits), Expect = 1.3e-12, Sum P(2) = 1.3e-12
 Identities = 10/43 (23%), Positives = 18/43 (41%)

Query:   298 FAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGRE 340
             F E   +FD  +K+ +        Y    +RY+  +  K  +E
Sbjct:   462 FEESVENFDDAIKILDNFREEYPGYVLLELRYLGVLRRKSEKE 504


>CGD|CAL0004777 [details] [associations]
            symbol:PRP39 species:5476 "Candida albicans" [GO:0000375 "RNA
            splicing, via transesterification reactions" evidence=ISO]
            [GO:0000395 "mRNA 5'-splice site recognition" evidence=IEA;ISO]
            [GO:0000243 "commitment complex" evidence=IEA;ISO] [GO:0005685 "U1
            snRNP" evidence=IEA;ISO] [GO:0030627 "pre-mRNA 5'-splice site
            binding" evidence=IEA;ISO] [GO:0071004 "U2-type prespliceosome"
            evidence=IEA] InterPro:IPR003107 SMART:SM00386 CGD:CAL0004777
            eggNOG:COG0457 EMBL:AACQ01000008 EMBL:AACQ01000007 GO:GO:0000243
            GO:GO:0005685 GO:GO:0000395 KO:K13217 RefSeq:XP_722403.1
            RefSeq:XP_722540.1 GeneID:3635832 GeneID:3635925
            KEGG:cal:CaO19.1492 KEGG:cal:CaO19.9069 Uniprot:Q5ALT2
        Length = 655

 Score = 191 (72.3 bits), Expect = 6.8e-12, Sum P(2) = 6.8e-12
 Identities = 64/269 (23%), Positives = 128/269 (47%)

Query:    99 AVQSATYSVDVWFHYCSLSMSTFEDPNDV---RRLFKRALSFVGKDYLCHTMWDKYIEFE 155
             AV++   S+ +W  Y S  +   +D  D    R  +K+AL   G D+  H +WD  IEFE
Sbjct:     2 AVENYPNSISLWTQYLSSILIHDKDKTDTELFRNAYKQALIHNGYDFNSHPIWDMAIEFE 61

Query:   156 ISQQRWSS-LAQIFVQTLRFPSKKLHHYYDSFKKLAGAWK-EELECESDSAMEFQSELVL 213
              +Q + S  L +++++ ++ P  +   YY+ F ++   ++ +++   SD   ++ +E   
Sbjct:    62 TNQSKQSKELLELYLRIIKIPLYQYAQYYNQFSEINKQFEIQQIITSSDQLNQYVNEF-- 119

Query:   214 EGEVPAYYKDD----ETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDEKINC 269
              G+    + DD    E   +I D         +S+  + ++F  E +  E  +   +I  
Sbjct:   120 -GK---NHLDDLSLLEKHQIIDDFTASIFSNTQSRVNKNWQF--ESLL-ETQEFTLEIGI 172

Query:   270 FENLIRRPYFHVKPLDDIQLKNWHDYL-SFAEKQGD--FDWVVKLYERCLIP-CADYPEF 325
               N       ++     I +   H  + ++A+K  +  F+ V  L++RCLIP C D  E 
Sbjct:   173 SNN--NNNNNNIAKEKPIWINYLHQEIDTYAKKPNNDQFELVCTLFQRCLIPNCYD-SEI 229

Query:   326 WMRYVDFMESKGGREIASYALDRATQIFL 354
             W++Y+DF+ +    +   +  D+  +I++
Sbjct:   230 WLKYLDFINNSSLSKQEKF--DKQQEIYI 256

 Score = 56 (24.8 bits), Expect = 6.8e-12, Sum P(2) = 6.8e-12
 Identities = 14/43 (32%), Positives = 25/43 (58%)

Query:   472 EELIKFTMVHGGRSHISIV--DAVISNALYSRPDVLKVFSLED 512
             + LI F  ++ GRS  +I+  D  ISN+LY     +K  ++++
Sbjct:   493 DNLINFLKLNDGRSDETIIMKDLEISNSLYYNKTTIKRHAMKN 535


>UNIPROTKB|Q5ALT2 [details] [associations]
            symbol:PRP39 "Potential spliceosomal U1 snRNP protein
            Prp39" species:237561 "Candida albicans SC5314" [GO:0000243
            "commitment complex" evidence=ISO] [GO:0000375 "RNA splicing, via
            transesterification reactions" evidence=ISO] [GO:0000395 "mRNA
            5'-splice site recognition" evidence=ISO] [GO:0005685 "U1 snRNP"
            evidence=ISO] [GO:0030627 "pre-mRNA 5'-splice site binding"
            evidence=ISO] InterPro:IPR003107 SMART:SM00386 CGD:CAL0004777
            eggNOG:COG0457 EMBL:AACQ01000008 EMBL:AACQ01000007 GO:GO:0000243
            GO:GO:0005685 GO:GO:0000395 KO:K13217 RefSeq:XP_722403.1
            RefSeq:XP_722540.1 GeneID:3635832 GeneID:3635925
            KEGG:cal:CaO19.1492 KEGG:cal:CaO19.9069 Uniprot:Q5ALT2
        Length = 655

 Score = 191 (72.3 bits), Expect = 6.8e-12, Sum P(2) = 6.8e-12
 Identities = 64/269 (23%), Positives = 128/269 (47%)

Query:    99 AVQSATYSVDVWFHYCSLSMSTFEDPNDV---RRLFKRALSFVGKDYLCHTMWDKYIEFE 155
             AV++   S+ +W  Y S  +   +D  D    R  +K+AL   G D+  H +WD  IEFE
Sbjct:     2 AVENYPNSISLWTQYLSSILIHDKDKTDTELFRNAYKQALIHNGYDFNSHPIWDMAIEFE 61

Query:   156 ISQQRWSS-LAQIFVQTLRFPSKKLHHYYDSFKKLAGAWK-EELECESDSAMEFQSELVL 213
              +Q + S  L +++++ ++ P  +   YY+ F ++   ++ +++   SD   ++ +E   
Sbjct:    62 TNQSKQSKELLELYLRIIKIPLYQYAQYYNQFSEINKQFEIQQIITSSDQLNQYVNEF-- 119

Query:   214 EGEVPAYYKDD----ETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDEKINC 269
              G+    + DD    E   +I D         +S+  + ++F  E +  E  +   +I  
Sbjct:   120 -GK---NHLDDLSLLEKHQIIDDFTASIFSNTQSRVNKNWQF--ESLL-ETQEFTLEIGI 172

Query:   270 FENLIRRPYFHVKPLDDIQLKNWHDYL-SFAEKQGD--FDWVVKLYERCLIP-CADYPEF 325
               N       ++     I +   H  + ++A+K  +  F+ V  L++RCLIP C D  E 
Sbjct:   173 SNN--NNNNNNIAKEKPIWINYLHQEIDTYAKKPNNDQFELVCTLFQRCLIPNCYD-SEI 229

Query:   326 WMRYVDFMESKGGREIASYALDRATQIFL 354
             W++Y+DF+ +    +   +  D+  +I++
Sbjct:   230 WLKYLDFINNSSLSKQEKF--DKQQEIYI 256

 Score = 56 (24.8 bits), Expect = 6.8e-12, Sum P(2) = 6.8e-12
 Identities = 14/43 (32%), Positives = 25/43 (58%)

Query:   472 EELIKFTMVHGGRSHISIV--DAVISNALYSRPDVLKVFSLED 512
             + LI F  ++ GRS  +I+  D  ISN+LY     +K  ++++
Sbjct:   493 DNLINFLKLNDGRSDETIIMKDLEISNSLYYNKTTIKRHAMKN 535


>DICTYBASE|DDB_G0286645 [details] [associations]
            symbol:cstf3 "cleavage stimulation factor subunit 3"
            species:44689 "Dictyostelium discoideum" [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0006396 "RNA processing"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA;ISS] [GO:0005622
            "intracellular" evidence=IEA] [GO:0006379 "mRNA cleavage"
            evidence=ISS] [GO:0006378 "mRNA polyadenylation" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=ISS] InterPro:IPR003107
            InterPro:IPR008847 InterPro:IPR011990 InterPro:IPR013026
            Pfam:PF05843 PROSITE:PS50293 SMART:SM00386 dictyBase:DDB_G0286645
            GO:GO:0005634 GenomeReviews:CM000153_GR GO:GO:0006378 GO:GO:0003723
            EMBL:AAFI02000089 Gene3D:1.25.40.10 GO:GO:0006379 eggNOG:COG5107
            KO:K14408 RefSeq:XP_637594.1 ProteinModelPortal:Q54LG7 PRIDE:Q54LG7
            EnsemblProtists:DDB0233707 GeneID:8625734 KEGG:ddi:DDB_G0286645
            InParanoid:Q54LG7 OMA:MARDIFE ProtClustDB:CLSZ2430079
            Uniprot:Q54LG7
        Length = 1065

 Score = 141 (54.7 bits), Expect = 2.6e-09, Sum P(3) = 2.6e-09
 Identities = 40/175 (22%), Positives = 84/175 (48%)

Query:    26 LEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKAR 85
             LE  I     D + WT LL+E+++  P  I +   +Y  FL+ FP    YW+ Y + + +
Sbjct:   168 LENRINNDMYDTEAWTLLLNEVQSQ-P--ISIARDIYKRFLSVFPTAGRYWKLYVEEEMK 224

Query:    86 LCSIDKVVEVFERAVQSATYSVDVWFHYCS----LSMSTFEDPNDVRRLFKRALSFVGKD 141
               + D V ++F   ++S   +V+ W  Y +    +     E+  ++ + F+ AL  +G D
Sbjct:   225 EKNYDIVEKIFFENLRSVK-NVEFWKSYIAYIKQIKGDKVENREEIIKAFEFALESIGMD 283

Query:   142 YLCHTMWDKYIEF--------EISQ-QRWSSLAQIFVQTLRFPSKKLHHYYDSFK 187
                 ++W  YI+F        +  + Q+ +++ +++ + +  P   L + Y  ++
Sbjct:   284 ISSTSIWTDYIQFLKDEKASTQFEEGQKMTAIRKLYQRAIENPMHDLDNIYKEYE 338

 Score = 92 (37.4 bits), Expect = 2.6e-09, Sum P(3) = 2.6e-09
 Identities = 46/226 (20%), Positives = 82/226 (36%)

Query:   248 KYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGD-FD 306
             KY+     +Y++   L E I    N++ +P       ++ Q++ W   +++       FD
Sbjct:   359 KYQH-ARNVYRDRKSLLEGI--LRNMLAKPP-RSSDKEEHQVRLWRKLITYERSNPQKFD 414

Query:   307 WV------VKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVI 360
              V      +  Y +CL+    YP+ W     ++   G         DR+     K L  I
Sbjct:   415 AVTLRNRVIATYNQCLLCLYHYPDIWYEAATYLADCGDSSGCIAMFDRSLIALPKNL-FI 473

Query:   361 HLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKANMERRLGNFVAACDTYKEAL 420
             H   A Y E       A+  + E  + ++   +  + +     RR          +K A 
Sbjct:   474 HFAYADYLESQKKQPQAKEIY-EKILQANPEPLVWIQYM-KFSRRTERIEGPRKIFKRAK 531

Query:   421 ETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHVPN 466
              T      +H    +Y+    + Y        ARDI   G+K  P+
Sbjct:   532 STP--DCTYH----VYIALGLIEYYINQDTRMARDIFEIGLKKFPS 571

 Score = 48 (22.0 bits), Expect = 3.4e-09, Sum P(4) = 3.4e-09
 Identities = 12/41 (29%), Positives = 17/41 (41%)

Query:   795 QTRTVSPSSSASHQNFIPEAHSHPQTPANSGRNWHEQQNPD 835
             Q    SP+ S +    IP     PQ P       H+++ PD
Sbjct:   926 QANRTSPTLS-NETLIIPNKPQQPQQPQPQQTQTHKRKQPD 965

 Score = 47 (21.6 bits), Expect = 4.3e-09, Sum P(4) = 4.3e-09
 Identities = 8/22 (36%), Positives = 13/22 (59%)

Query:   643 DSSSQDRMQQVPPEAAEQHSQD 664
             D S+ ++ QQ PP+   Q  Q+
Sbjct:   966 DESNNEQQQQPPPQQPPQQQQE 987

 Score = 43 (20.2 bits), Expect = 2.6e-09, Sum P(3) = 2.6e-09
 Identities = 11/34 (32%), Positives = 15/34 (44%)

Query:   911 PMQNMQQQTFASASQSEVPAQPVFYPQAQMSQYP 944
             P Q  QQQ       + + +QP+   Q Q  Q P
Sbjct:   787 PQQQQQQQ-----QPTPISSQPISQQQQQQQQQP 815

 Score = 39 (18.8 bits), Expect = 2.6e-08, Sum P(4) = 2.6e-08
 Identities = 12/42 (28%), Positives = 17/42 (40%)

Query:   901 FLHSLTAQAWPMQNMQQQTFASASQSEVPAQPVFYPQAQMSQ 942
             FL+    Q    Q +QQQ      Q ++  Q +   Q Q  Q
Sbjct:   886 FLNQQQLQL-QQQQLQQQQLQLQQQQQLQLQQIQQHQQQQQQ 926

 Score = 37 (18.1 bits), Expect = 3.4e-09, Sum P(4) = 3.4e-09
 Identities = 12/41 (29%), Positives = 19/41 (46%)

Query:   579 QPFESEHLMPSASQDKKFSPPEKSDSESGDD--ATSLPSNQ 617
             QP  S+  + + +Q +   PP +S           +LPSNQ
Sbjct:   821 QPISSQPSLQTNTQQQGNQPPNRSGLPDFIFYFLQNLPSNQ 861


>CGD|CAL0001111 [details] [associations]
            symbol:PRP42 species:5476 "Candida albicans" [GO:0003723 "RNA
            binding" evidence=IEA;ISO] [GO:0000398 "mRNA splicing, via
            spliceosome" evidence=IEA;ISO] [GO:0005685 "U1 snRNP"
            evidence=IEA;ISO] [GO:0071004 "U2-type prespliceosome"
            evidence=IEA] InterPro:IPR003107 SMART:SM00386 CGD:CAL0001111
            GO:GO:0003723 eggNOG:COG0457 GO:GO:0000398 EMBL:AACQ01000091
            EMBL:AACQ01000090 GO:GO:0005685 RefSeq:XP_715099.1
            RefSeq:XP_715150.1 ProteinModelPortal:Q5A014 GeneID:3643177
            GeneID:3643241 KEGG:cal:CaO19.11852 KEGG:cal:CaO19.4374
            Uniprot:Q5A014
        Length = 393

 Score = 168 (64.2 bits), Expect = 4.1e-09, P = 4.1e-09
 Identities = 57/299 (19%), Positives = 132/299 (44%)

Query:    39 EWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFER 98
             EW       +++  +++ ++   Y+SFL +FP  + YW +YA+ + +L +     +++ R
Sbjct:    33 EWNEKRGINKSTSEEELNVLRTSYNSFLEKFPFQFKYWIRYAEWEFKLGNTSTAEQIYLR 92

Query:    99 AVQSA-TYSVDVWFHYCSLSMSTFEDP-NDVRRLFKRALSFVGKDYLCHTMWDKYIEFEI 156
              + +  ++ +++W  Y +  ++T  D  +++ + F+ A   +G  +     ++ Y+ F  
Sbjct:    93 GLNTQLSHCIELWISYLNFKINTINDNISEILQKFEAARDLIGFHFFGFEFYELYLSFLD 152

Query:   157 SQQRWSS-LAQIFVQTLRFPSK-KLHHYYDSFKKLAGAWKEELECESDSAMEFQSELVLE 214
             + +  ++   + +   LR   +  ++HY   +KK      + L  +   A +    +   
Sbjct:   153 NYKNDNNEFEKKYYILLRIILEIPIYHYGIFYKKWFDLI-DNLSKDEKLAKQIAPYIAPA 211

Query:   215 GEVPAYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDEKINCFENLI 274
              E+ A     + +S+  +L     D     A Q + F   ++Y+   +L  K N      
Sbjct:   212 NEI-ATLASKKNTSIFNELKKRFTDAYI--ATQYHSF---ELYEFEKKLIPKNN------ 259

Query:   275 RRPYFHVKPLDDIQLKNWHDYLSFAE-KQGDFDWVVKLYERCLIPCADYPEFWMRYVDF 332
             + P          +L  W  Y+ + E KQ    ++  +Y R L    +YP+ W ++ D+
Sbjct:   260 KNPQQDNDLRSRQELDAWMSYIDYLEIKQYPIKFIELVYYRFLYNARNYPQTWSKFADY 318


>UNIPROTKB|Q5A014 [details] [associations]
            symbol:PRP42 "Potential spliceosomal U1 snRNP protein"
            species:237561 "Candida albicans SC5314" [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=ISO] [GO:0003723 "RNA binding"
            evidence=ISO] [GO:0005685 "U1 snRNP" evidence=ISO]
            InterPro:IPR003107 SMART:SM00386 CGD:CAL0001111 GO:GO:0003723
            eggNOG:COG0457 GO:GO:0000398 EMBL:AACQ01000091 EMBL:AACQ01000090
            GO:GO:0005685 RefSeq:XP_715099.1 RefSeq:XP_715150.1
            ProteinModelPortal:Q5A014 GeneID:3643177 GeneID:3643241
            KEGG:cal:CaO19.11852 KEGG:cal:CaO19.4374 Uniprot:Q5A014
        Length = 393

 Score = 168 (64.2 bits), Expect = 4.1e-09, P = 4.1e-09
 Identities = 57/299 (19%), Positives = 132/299 (44%)

Query:    39 EWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFER 98
             EW       +++  +++ ++   Y+SFL +FP  + YW +YA+ + +L +     +++ R
Sbjct:    33 EWNEKRGINKSTSEEELNVLRTSYNSFLEKFPFQFKYWIRYAEWEFKLGNTSTAEQIYLR 92

Query:    99 AVQSA-TYSVDVWFHYCSLSMSTFEDP-NDVRRLFKRALSFVGKDYLCHTMWDKYIEFEI 156
              + +  ++ +++W  Y +  ++T  D  +++ + F+ A   +G  +     ++ Y+ F  
Sbjct:    93 GLNTQLSHCIELWISYLNFKINTINDNISEILQKFEAARDLIGFHFFGFEFYELYLSFLD 152

Query:   157 SQQRWSS-LAQIFVQTLRFPSK-KLHHYYDSFKKLAGAWKEELECESDSAMEFQSELVLE 214
             + +  ++   + +   LR   +  ++HY   +KK      + L  +   A +    +   
Sbjct:   153 NYKNDNNEFEKKYYILLRIILEIPIYHYGIFYKKWFDLI-DNLSKDEKLAKQIAPYIAPA 211

Query:   215 GEVPAYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDEKINCFENLI 274
              E+ A     + +S+  +L     D     A Q + F   ++Y+   +L  K N      
Sbjct:   212 NEI-ATLASKKNTSIFNELKKRFTDAYI--ATQYHSF---ELYEFEKKLIPKNN------ 259

Query:   275 RRPYFHVKPLDDIQLKNWHDYLSFAE-KQGDFDWVVKLYERCLIPCADYPEFWMRYVDF 332
             + P          +L  W  Y+ + E KQ    ++  +Y R L    +YP+ W ++ D+
Sbjct:   260 KNPQQDNDLRSRQELDAWMSYIDYLEIKQYPIKFIELVYYRFLYNARNYPQTWSKFADY 318


>SGD|S000002643 [details] [associations]
            symbol:PRP42 "U1 snRNP protein involved in splicing"
            species:4932 "Saccharomyces cerevisiae" [GO:0005681 "spliceosomal
            complex" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA;IDA]
            [GO:0000398 "mRNA splicing, via spliceosome" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005685 "U1 snRNP"
            evidence=IDA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0030529
            "ribonucleoprotein complex" evidence=IEA] [GO:0071004 "U2-type
            prespliceosome" evidence=IDA] [GO:0006397 "mRNA processing"
            evidence=IEA] SGD:S000002643 EMBL:BK006938 GO:GO:0003723
            eggNOG:COG0457 EMBL:Z49701 GO:GO:0000398 GO:GO:0071004
            GO:GO:0005685 GeneTree:ENSGT00390000005033 OrthoDB:EOG4DNJD8
            EMBL:AF020682 PIR:S54531 RefSeq:NP_010521.1
            ProteinModelPortal:Q03776 SMR:Q03776 DIP:DIP-2836N IntAct:Q03776
            MINT:MINT-619717 STRING:Q03776 PaxDb:Q03776 EnsemblFungi:YDR235W
            GeneID:851821 KEGG:sce:YDR235W CYGD:YDR235w HOGENOM:HOG000115713
            OMA:FWEMYLE NextBio:969692 Genevestigator:Q03776 GermOnline:YDR235W
            Uniprot:Q03776
        Length = 544

 Score = 168 (64.2 bits), Expect = 7.8e-09, P = 7.8e-09
 Identities = 75/317 (23%), Positives = 134/317 (42%)

Query:    52 PDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATY-SVDVW 110
             P  +++I   Y S L EFP    Y+  +A  + +L ++    ++F+R +Q+    S+ +W
Sbjct:    50 PQLLKLIRCTYSSMLNEFPYLENYYIDFALLEYKLGNVSMSHKIFQRGLQAFNQRSLLLW 109

Query:   111 FHYCSLSMSTFEDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLA--QIF 168
               Y     +       + + ++ A  +VG  +     WD Y+E +IS +  SS     + 
Sbjct:   110 TSYLKFCNNVISHQKQLFKKYETAEEYVGLHFFSGEFWDLYLE-QISSRCTSSKKYWNVL 168

Query:   169 VQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAMEFQSELVLEGEVPAYYKDDETSS 228
              + L  P   LH    SF K    W + ++   D  ++  S+L  + E+    K D   S
Sbjct:   169 RKILEIP---LH----SFSKFYALWLQRIDDIMD--LKQLSQLTSKDELLKKLKIDINYS 219

Query:   229 VIKD-LLDPSVDLVRSKAIQKYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPLDDI 287
               K   L  +   ++    + Y  +  Q+ +  S  + KI  + N    P   V   D+I
Sbjct:   220 GRKGPYLQDAKKKLKKITKEMYMVVQYQVLEIYSIFESKI--YINYYTSPETLVSS-DEI 276

Query:   288 QLKNWHDYLSFAEKQGDFDWVVKL-YERCLIPCADYPEFWMRYVDFM-ESKGGREIASYA 345
             +   W  YL +       D +  L ++R L+P A Y   W++Y  ++  SK     A   
Sbjct:   277 E--TWIKYLDYTITL-QTDSLTHLNFQRALLPLAHYDLVWIKYSKWLINSKNDLLGAKNV 333

Query:   346 LDRATQIFLKRLPVIHL 362
             L    +  LK+  +I L
Sbjct:   334 LLMGLKFSLKKTEIIKL 350


>UNIPROTKB|E9PLP8 [details] [associations]
            symbol:CSTF3 "Cleavage stimulation factor subunit 3"
            species:9606 "Homo sapiens" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0006396 "RNA processing" evidence=IEA]
            InterPro:IPR003107 InterPro:IPR011990 SMART:SM00386 GO:GO:0005622
            GO:GO:0006396 Gene3D:1.25.40.10 EMBL:AC131263 HGNC:HGNC:2485
            IPI:IPI00982702 ProteinModelPortal:E9PLP8 SMR:E9PLP8
            Ensembl:ENST00000524827 ArrayExpress:E9PLP8 Bgee:E9PLP8
            Uniprot:E9PLP8
        Length = 185

 Score = 136 (52.9 bits), Expect = 6.4e-08, P = 6.4e-08
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    54 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 110

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:   111 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEI 169

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   170 MSYQIWVDYINF 181


>TAIR|locus:2080853 [details] [associations]
            symbol:AT3G51110 species:3702 "Arabidopsis thaliana"
            [GO:0005622 "intracellular" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006396 "RNA processing" evidence=IEA;ISS]
            InterPro:IPR003107 InterPro:IPR011990 InterPro:IPR013026
            Pfam:PF02184 PROSITE:PS50293 SMART:SM00386 EMBL:CP002686
            GO:GO:0005622 GO:GO:0006396 Gene3D:1.25.40.10 IPI:IPI00546615
            RefSeq:NP_566944.1 UniGene:At.857 ProteinModelPortal:F4J390
            SMR:F4J390 EnsemblPlants:AT3G51110.1 GeneID:824275
            KEGG:ath:AT3G51110 OMA:ERSHTIF Uniprot:F4J390
        Length = 413

 Score = 155 (59.6 bits), Expect = 1.2e-07, P = 1.2e-07
 Identities = 66/294 (22%), Positives = 127/294 (43%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA+ + R  S++    V++RAV+        W+ Y  +      + +  R++F+R +
Sbjct:   109 WLKYAEFEMRNKSVNHARNVWDRAVKILPRVDQFWYKYIHME-EILGNIDGARKIFERWM 167

Query:   136 SFVGKD--YLCHTMWD-KYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAG- 191
              +      +LC   ++ +Y E E S+  +        +   F         +S   LA  
Sbjct:   168 DWSPDQQAWLCFIKFELRYNEIERSRSIYERFVLCHPKASSFIRYAKFEMKNSQVSLARI 227

Query:   192 AWKEELECESDSAMEFQSELVLEGEVPAYYKDDETSSVI-KDLLDPSVDLVRSKAIQKYR 250
              ++  +E   D   E +   V   E     K+ E +  + K  LD  +   R++ + K +
Sbjct:   228 VYERAIEMLKDVEEEAEMIFVAFAEFEELCKEVERARFLYKYALD-HIPKGRAEDLYK-K 285

Query:   251 FIG-EQIYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVV 309
             F+  E+ Y     +D+ I     L         PL+     +W DY+S  E  GD D + 
Sbjct:   286 FVAFEKQYGNKEGIDDAIVGRRKLQYEGEVRKNPLN---YDSWFDYISLEETLGDKDRIR 342

Query:   310 KLYERCL--IPCADYPEFWMRYVD-FMESKGGREIASYALDRATQIFLKRLPVI 360
             ++YER +  +P A+   +W RY+  +++     EI +  ++R   ++ + L +I
Sbjct:   343 EVYERAIANVPLAEEKRYWQRYIYLWIDYALFEEILAEDVERTRAVYRECLNLI 396


>ZFIN|ZDB-GENE-040426-694 [details] [associations]
            symbol:crnkl1 "crooked neck pre-mRNA splicing
            factor-like 1 (Drosophila)" species:7955 "Danio rerio" [GO:0005622
            "intracellular" evidence=IEA] [GO:0006396 "RNA processing"
            evidence=IEA] InterPro:IPR003107 InterPro:IPR011990
            InterPro:IPR013026 Pfam:PF02184 PROSITE:PS50293 SMART:SM00386
            ZFIN:ZDB-GENE-040426-694 GO:GO:0005622 GO:GO:0006396
            Gene3D:1.25.40.10 GeneTree:ENSGT00550000074931 EMBL:CABZ01008082
            EMBL:CABZ01008083 EMBL:CABZ01008084 IPI:IPI00932828
            Ensembl:ENSDART00000112689 ArrayExpress:E7FGM7 Bgee:E7FGM7
            Uniprot:E7FGM7
        Length = 754

 Score = 107 (42.7 bits), Expect = 0.00034, Sum P(3) = 0.00034
 Identities = 35/143 (24%), Positives = 64/143 (44%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDV----RRLF 131
             W  Y + + R   +DK   ++E+A+      V  W  Y       FE+ +      R++F
Sbjct:   184 WHSYINFELRYKEVDKARSIYEKALVMVHPEVKNWIKYAH-----FEEKHGYVARGRKVF 238

Query:   132 KRALSFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTL-RFP---SKKLHHYYDSFK 187
             +RA+ F G++ +   ++  +  FE  Q+ +  +  I+   L R P   +++L   Y  F+
Sbjct:   239 ERAVEFFGEEQVSENLYVAFARFEEKQKEFERVRVIYKYALDRIPKQQAQELFKNYTVFE 298

Query:   188 KLAGAWKEELECESDSAMEFQSE 210
             K  G  +  +E    S   FQ E
Sbjct:   299 KRFGD-RRGIEDVIVSKRRFQYE 320

 Score = 102 (41.0 bits), Expect = 3.0e-06, Sum P(3) = 3.0e-06
 Identities = 25/113 (22%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  + +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:    83 WIKYAQWEESLQEVQRSRSIYERALDVDHRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 141

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   142 TILPR---VNQFWYKYTYMEEMLGNIAGCRQVFERWMEWEPEEQAWHSYINFE 191

 Score = 87 (35.7 bits), Expect = 3.0e-06, Sum P(3) = 3.0e-06
 Identities = 38/181 (20%), Positives = 73/181 (40%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             WH Y++F  +  + D    +YE+ L+      + W++Y  F E  G         +RA +
Sbjct:   184 WHSYINFELRYKEVDKARSIYEKALVMVHPEVKNWIKYAHFEEKHGYVARGRKVFERAVE 243

Query:   352 IFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRLG 407
              F +     +L+ A  R++E+  +    R  +  + +D   +   +  FK     E+R G
Sbjct:   244 FFGEEQVSENLYVAFARFEEKQKEFERVRVIYKYA-LDRIPKQQAQELFKNYTVFEKRFG 302

Query:   408 NFVAACDTY--KEALETAAEQRKF-HTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHV 464
             +     D    K   +   E +   H     +  + RL   +   AD  R++    I ++
Sbjct:   303 DRRGIEDVIVSKRRFQYEEEVKANPHNYDAWF-DYLRLV-ESDADADTVREVYERAIANI 360

Query:   465 P 465
             P
Sbjct:   361 P 361

 Score = 61 (26.5 bits), Expect = 0.00034, Sum P(3) = 0.00034
 Identities = 12/41 (29%), Positives = 21/41 (51%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDF 332
             W  Y+ F  +Q ++D    LY+R L+    + + W+ Y  F
Sbjct:   518 WKSYIDFEIEQEEYDNTRGLYKR-LLQRTQHVKVWISYAQF 557

 Score = 53 (23.7 bits), Expect = 3.0e-06, Sum P(3) = 3.0e-06
 Identities = 20/74 (27%), Positives = 36/74 (48%)

Query:   641 EADSSSQDRMQQVPPEAAEQHSQDACDPE-VLSLDLAHQV----TNENETVQASEAFSEE 695
             E D + ++  QQVP     Q S +A + E     + A +      +E+E+  +S + SE 
Sbjct:   678 EEDENDKEE-QQVPEGKDAQESPEAEEQEPAAGGEQAEKEYDDRDDEDESSSSSSSDSES 736

Query:   696 DDVQREYEHESKKD 709
             D+ Q++    +K D
Sbjct:   737 DEDQKDKTEHNKPD 750


>FB|FBgn0033859 [details] [associations]
            symbol:CG6197 species:7227 "Drosophila melanogaster"
            [GO:0000381 "regulation of alternative mRNA splicing, via
            spliceosome" evidence=IMP] [GO:0005634 "nucleus" evidence=IC]
            [GO:0006911 "phagocytosis, engulfment" evidence=IMP] [GO:0071011
            "precatalytic spliceosome" evidence=IDA] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=IC] [GO:0071013 "catalytic step
            2 spliceosome" evidence=IDA] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 EMBL:AE013599 GO:GO:0006911 Gene3D:1.25.40.10
            GO:GO:0071011 GO:GO:0000398 GO:GO:0071013 GO:GO:0000381
            eggNOG:NOG289100 KO:K12867 OMA:PRSYKLW GeneTree:ENSGT00550000075140
            EMBL:BT133178 RefSeq:NP_610891.1 UniGene:Dm.606 SMR:A1Z9G2
            IntAct:A1Z9G2 STRING:A1Z9G2 EnsemblMetazoa:FBtr0087623 GeneID:36514
            KEGG:dme:Dmel_CG6197 UCSC:CG6197-RA FlyBase:FBgn0033859
            InParanoid:A1Z9G2 OrthoDB:EOG42FQZD GenomeRNAi:36514 NextBio:798953
            Uniprot:A1Z9G2
        Length = 883

 Score = 84 (34.6 bits), Expect = 3.3e-06, Sum P(4) = 3.3e-06
 Identities = 45/177 (25%), Positives = 75/177 (42%)

Query:   294 DYLSFAEKQGDFDWVVKLYER--CLIPCADYPEFWMRYVD-FMESKGGREIASYALDRAT 350
             +Y  F E+   F+   + YE+   L    +  + W  Y+  F+E  GG +     L+RA 
Sbjct:   522 NYGMFLEEHNYFEEAYRAYEKGISLFKWPNVYDIWNSYLTKFLERYGGTK-----LERAR 576

Query:   351 QIF---LKRLPVIH-----LFNARYKEQIGDTSAARAAFPE--SYIDSDSRF-IEKVTFK 399
              +F   L + P  H     L  A+ +E+ G    A + +    S +  D  F +  +  K
Sbjct:   577 DLFEQCLDQCPPEHAKYFYLLYAKLEEEHGLARHAMSVYDRATSAVKEDEMFDMYNIFIK 636

Query:   400 ANMERRLGNFVAACDTYKEALETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDI 456
                E  +       + Y++A+E+  EQ   H    + V+F+ L  T  G  D AR I
Sbjct:   637 KAAE--IYGLPRTREIYEKAIESLPEQNMRH----MCVKFAELE-TKLGEVDRARAI 686

 Score = 75 (31.5 bits), Expect = 3.3e-06, Sum P(4) = 3.3e-06
 Identities = 22/102 (21%), Positives = 49/102 (48%)

Query:    91 KVVEVFERAVQSATYSVDVW-FHYCSLSMSTFEDPN----DVRRLFKRA--LSFVGKDYL 143
             +++  +  AVQ+      V   H   +  + F + N    D R +F+R   + +V  + L
Sbjct:   372 EIISTYTEAVQTVQPKQAVGKLHTLWVEFAKFYEANGQVEDARVVFERGTEVEYVKVEDL 431

Query:   144 CHTMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDS 185
                +W ++ E E+ QQ++ +  ++  +    P +K+ +Y D+
Sbjct:   432 A-AVWCEWAEMELRQQQFEAALKLMQRATAMPKRKIAYYDDT 472

 Score = 71 (30.1 bits), Expect = 6.4e-05, Sum P(4) = 6.4e-05
 Identities = 37/147 (25%), Positives = 54/147 (36%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDR 348
             LK W  Y    E  G F     +YER +      P+  + Y  F+E     E A  A ++
Sbjct:   483 LKVWSMYADLEESFGTFKTCKAVYERIIDLKICTPQIIINYGMFLEEHNYFEEAYRAYEK 542

Query:   349 ATQIFLKRLPVIHLFNA---RYKEQIGDTSAARAA-FPESYIDSDSRFIEKVTFK--ANM 402
                +F K   V  ++N+   ++ E+ G T   RA    E  +D       K  +   A +
Sbjct:   543 GISLF-KWPNVYDIWNSYLTKFLERYGGTKLERARDLFEQCLDQCPPEHAKYFYLLYAKL 601

Query:   403 ERRLGNFVAACDTYKEALETAAEQRKF 429
             E   G    A   Y  A     E   F
Sbjct:   602 EEEHGLARHAMSVYDRATSAVKEDEMF 628

 Score = 69 (29.3 bits), Expect = 0.00090, Sum P(3) = 0.00090
 Identities = 39/186 (20%), Positives = 69/186 (37%)

Query:     8 LESLSAEPNSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLA 67
             ++SL+ E N  V       EE I   +     W   L  I++        + +VY+  L 
Sbjct:     6 IKSLNLEINFEVE--DVPYEEEILRNAYSVKHW---LRYIDHKAKAPNNGVNMVYERALK 60

Query:    68 EFPLCYGYWRKYADHKARLCS--------IDKVVEVFERAVQSATYSVDVWFHYCSLSMS 119
             E P  Y  W  Y   + +            ++V   FERA+        +W  Y +   S
Sbjct:    61 ELPGSYKIWHNYLRTRRKQVRGKIPTDPMYEEVNSAFERALVFMHKMPRIWMDYGAFMTS 120

Query:   120 TFEDPNDVRRLFKRALSFVGKDYLCH-TMWDKYIEFEISQQRWSSLAQIFVQTLRFPSKK 178
               +     R +F RAL  +      H  +W  Y++F    +   +  +++ + L+   + 
Sbjct:   121 QCKITR-TRHVFDRALRAL--PITQHGRIWPLYLQFVRRFEMPETALRVYRRYLKLFPED 177

Query:   179 LHHYYD 184
                Y D
Sbjct:   178 TEEYVD 183

 Score = 67 (28.6 bits), Expect = 3.3e-06, Sum P(4) = 3.3e-06
 Identities = 34/128 (26%), Positives = 54/128 (42%)

Query:   606 SGDDATSLPSNQ-KSPLPENHDIRSDGAEVDIL-LSGEADSSSQDRMQQVP-PEAAEQHS 662
             +G DA  L   + +    E+     + A  +I+ + GE    ++D+   V  P+  +   
Sbjct:   759 AGPDAMRLLEEKARQAAAESKQKPIEKAASNIMFVRGETQGGAKDKKDTVVNPDEIDIGD 818

Query:   663 QDACDPEVLSLDLAHQVTNENETVQASEAFSEEDDVQREYEHESKKDLKPLSLEGLSLDP 722
              D  D E    D  +++TNEN+   A     EE  V ++   E K    P  + G SL P
Sbjct:   819 SDEDDEEEDD-DEENEMTNENQASAAVTKTDEEGLVMKKLRFEQKAI--PAKVFG-SLKP 874

Query:   723 GGN-DSPG 729
                 DS G
Sbjct:   875 SNQGDSDG 882

 Score = 61 (26.5 bits), Expect = 3.3e-06, Sum P(4) = 3.3e-06
 Identities = 18/82 (21%), Positives = 36/82 (43%)

Query:    27 EEFIAE-GSLDFDEWTSLLSEIENSCPDDIEMIGL--VYDSFLAEFPLCYGY-WRKYADH 82
             E F+++ G  +   W  L   I  + P  +  + +  +    L  +    G+ W   AD+
Sbjct:   205 EHFVSKHGKSNHQLWNELCDLISKN-PHKVHSLNVDAIIRGGLRRYTDQLGHLWNSLADY 263

Query:    83 KARLCSIDKVVEVFERAVQSAT 104
               R    D+  +++E A+Q+ T
Sbjct:   264 YVRSGLFDRARDIYEEAIQTVT 285


>ZFIN|ZDB-GENE-040426-1997 [details] [associations]
            symbol:cstf3 "cleavage stimulation factor, 3'
            pre-RNA, subunit 3" species:7955 "Danio rerio" [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] InterPro:IPR003107 InterPro:IPR008847
            InterPro:IPR011990 Pfam:PF05843 SMART:SM00386
            ZFIN:ZDB-GENE-040426-1997 GO:GO:0005634 GO:GO:0006397
            Gene3D:1.25.40.10 CTD:1479 HOGENOM:HOG000231786 HOVERGEN:HBG053813
            KO:K14408 EMBL:BC045871 IPI:IPI00497601 RefSeq:NP_998218.2
            UniGene:Dr.104620 ProteinModelPortal:Q7ZVG5 SMR:Q7ZVG5
            STRING:Q7ZVG5 PRIDE:Q7ZVG5 GeneID:406326 KEGG:dre:406326
            InParanoid:Q7ZVG5 NextBio:20817950 ArrayExpress:Q7ZVG5 Bgee:Q7ZVG5
            Uniprot:Q7ZVG5
        Length = 716

 Score = 136 (52.9 bits), Expect = 5.3e-06, Sum P(3) = 5.3e-06
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    22 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 78

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:    79 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMPQAYDFALDKIGMEI 137

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   138 MSYQIWVDYINF 149

 Score = 56 (24.8 bits), Expect = 5.3e-06, Sum P(3) = 5.3e-06
 Identities = 19/86 (22%), Positives = 34/86 (39%)

Query:   283 PLDDIQLKNWHDYLSFAE----KQGDFDWVVK----LYERCLIPCADYPEFWMRYVDFME 334
             P +  Q++ W  Y+ + +    +  D   + K     YE+CL+    +P+ W     ++E
Sbjct:   243 PQEAQQVEMWKKYIQWEKSNPLRTEDQTLITKRVMFAYEQCLLVLGHHPDIWYEAAQYLE 302

Query:   335 S-------KGGREIASYALDRATQIF 353
                     KG    A    D A  I+
Sbjct:   303 QSSKLLAEKGDMNNAKLFSDEAANIY 328

 Score = 47 (21.6 bits), Expect = 4.1e-05, Sum P(3) = 4.1e-05
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query:   310 KLYERCLIPCADYPEFWMRYVDFM 333
             K++E  L    D PE+ + Y+D++
Sbjct:   431 KIFELGLKKYGDIPEYILAYIDYL 454

 Score = 45 (20.9 bits), Expect = 5.3e-06, Sum P(3) = 5.3e-06
 Identities = 10/20 (50%), Positives = 12/20 (60%)

Query:   803 SSASHQNFIPEAHSHPQTPA 822
             S A + + IPEA   P TPA
Sbjct:   550 SRAKYASLIPEAVVAPSTPA 569

 Score = 37 (18.1 bits), Expect = 0.00040, Sum P(3) = 0.00040
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYER 314
             W  +L+F    GD   ++K+  R
Sbjct:   485 WARFLAFESNIGDLASILKVERR 507


>UNIPROTKB|E2R479 [details] [associations]
            symbol:CSTF3 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005634 "nucleus" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] InterPro:IPR003107
            InterPro:IPR008847 Pfam:PF05843 SMART:SM00386 GO:GO:0005634
            GO:GO:0006397 CTD:1479 KO:K14408 OMA:IAFRIFE
            GeneTree:ENSGT00390000006758 EMBL:AAEX03011399 RefSeq:XP_533159.2
            ProteinModelPortal:E2R479 Ensembl:ENSCAFT00000011665 GeneID:475948
            KEGG:cfa:475948 NextBio:20851693 Uniprot:E2R479
        Length = 717

 Score = 136 (52.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    22 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 78

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:    79 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEI 137

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   138 MSYQIWVDYINF 149

 Score = 56 (24.8 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 14/60 (23%), Positives = 25/60 (41%)

Query:   308 VVKLYERCLIPCADYPEFWMRYVDFMES-------KGGREIASYALDRATQIFLKRLPVI 360
             V+  YE+CL+    +P+ W     ++E        KG    A    D A  I+ + +  +
Sbjct:   276 VMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSDEAANIYERAISTL 335

 Score = 50 (22.7 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/110 (25%), Positives = 44/110 (40%)

Query:   310 KLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKE 369
             K++E  L    D PE+ + Y+D++             D  T++  +R   +    +   E
Sbjct:   431 KIFELGLKKYGDIPEYVLAYIDYLSHLNE--------DNNTRVLFER---VLTSGSLPPE 479

Query:   370 QIGDTSAARAAFPESYIDSDSRF-IEKVTFKANMERRLGNFVAAC-DTYK 417
             + G+  A   AF  +  D  S   +EK  F A  E   G   A   D YK
Sbjct:   480 KSGEIWARFLAFESNIGDLASILKVEKRRFTAFKEEYEGKETALLVDRYK 529

 Score = 41 (19.5 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/128 (21%), Positives = 58/128 (45%)

Query:   147 MWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAME 206
             MW KYI++E S    +       QTL   +K++   Y+    + G    ++  E+   +E
Sbjct:   251 MWKKYIQWEKSNPLRTE-----DQTLI--TKRVMFAYEQCLLVLGH-HPDIWYEAAQYLE 302

Query:   207 FQSELVLE-GEVP-AYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLD 264
               S+L+ E G++  A    DE +++ +  +     L++   +  + +     Y+E+    
Sbjct:   303 QSSKLLAEKGDMNNAKLFSDEAANIYERAIST---LLKKNMLLYFAYAD---YEESRMKY 356

Query:   265 EKINCFEN 272
             EK++   N
Sbjct:   357 EKVHSIYN 364


>UNIPROTKB|Q12996 [details] [associations]
            symbol:CSTF3 "Cleavage stimulation factor subunit 3"
            species:9606 "Homo sapiens" [GO:0003723 "RNA binding" evidence=TAS]
            [GO:0006378 "mRNA polyadenylation" evidence=TAS] [GO:0006379 "mRNA
            cleavage" evidence=TAS] [GO:0000398 "mRNA splicing, via
            spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS]
            [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=TAS] [GO:0006369 "termination of RNA polymerase II
            transcription" evidence=TAS] [GO:0006397 "mRNA processing"
            evidence=TAS] [GO:0008380 "RNA splicing" evidence=TAS] [GO:0010467
            "gene expression" evidence=TAS] [GO:0031124 "mRNA 3'-end
            processing" evidence=TAS] [GO:0005634 "nucleus" evidence=IDA]
            [GO:0005730 "nucleolus" evidence=IDA] Reactome:REACT_71
            InterPro:IPR003107 InterPro:IPR008847 Pfam:PF05843 SMART:SM00386
            GO:GO:0005654 EMBL:CH471064 Reactome:REACT_1675 GO:GO:0006378
            GO:GO:0003723 GO:GO:0000398 Reactome:REACT_1788 GO:GO:0006379
            GO:GO:0006369 Reactome:REACT_78 EMBL:U15782 EMBL:AK290836
            EMBL:AC131263 EMBL:AL121926 EMBL:BC009792 EMBL:BC010533
            EMBL:BC059948 EMBL:BC108319 EMBL:BM014288 IPI:IPI00015195
            IPI:IPI00382661 IPI:IPI00651748 PIR:S50852 RefSeq:NP_001028677.1
            RefSeq:NP_001028678.1 RefSeq:NP_001317.1 UniGene:Hs.44402
            ProteinModelPortal:Q12996 SMR:Q12996 DIP:DIP-48674N IntAct:Q12996
            STRING:Q12996 PhosphoSite:Q12996 DMDM:71153231 PaxDb:Q12996
            PeptideAtlas:Q12996 PRIDE:Q12996 DNASU:1479 Ensembl:ENST00000323959
            Ensembl:ENST00000431742 Ensembl:ENST00000438862 GeneID:1479
            KEGG:hsa:1479 UCSC:uc001muh.3 CTD:1479 GeneCards:GC11M033106
            H-InvDB:HIX0021822 HGNC:HGNC:2485 HPA:HPA039743 HPA:HPA040168
            MIM:600367 neXtProt:NX_Q12996 PharmGKB:PA26987 eggNOG:COG5107
            HOGENOM:HOG000231786 HOVERGEN:HBG053813 InParanoid:Q12996 KO:K14408
            OMA:IAFRIFE OrthoDB:EOG47H5PF PhylomeDB:Q12996 GenomeRNAi:1479
            NextBio:6077 PMAP-CutDB:Q12996 ArrayExpress:Q12996 Bgee:Q12996
            CleanEx:HS_CSTF3 Genevestigator:Q12996 GermOnline:ENSG00000176102
            Uniprot:Q12996
        Length = 717

 Score = 136 (52.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    22 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 78

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:    79 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEI 137

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   138 MSYQIWVDYINF 149

 Score = 56 (24.8 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 14/60 (23%), Positives = 25/60 (41%)

Query:   308 VVKLYERCLIPCADYPEFWMRYVDFMES-------KGGREIASYALDRATQIFLKRLPVI 360
             V+  YE+CL+    +P+ W     ++E        KG    A    D A  I+ + +  +
Sbjct:   276 VMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSDEAANIYERAISTL 335

 Score = 50 (22.7 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/110 (25%), Positives = 44/110 (40%)

Query:   310 KLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKE 369
             K++E  L    D PE+ + Y+D++             D  T++  +R   +    +   E
Sbjct:   431 KIFELGLKKYGDIPEYVLAYIDYLSHLNE--------DNNTRVLFER---VLTSGSLPPE 479

Query:   370 QIGDTSAARAAFPESYIDSDSRF-IEKVTFKANMERRLGNFVAAC-DTYK 417
             + G+  A   AF  +  D  S   +EK  F A  E   G   A   D YK
Sbjct:   480 KSGEIWARFLAFESNIGDLASILKVEKRRFTAFKEEYEGKETALLVDRYK 529

 Score = 41 (19.5 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/128 (21%), Positives = 58/128 (45%)

Query:   147 MWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAME 206
             MW KYI++E S    +       QTL   +K++   Y+    + G    ++  E+   +E
Sbjct:   251 MWKKYIQWEKSNPLRTE-----DQTLI--TKRVMFAYEQCLLVLGH-HPDIWYEAAQYLE 302

Query:   207 FQSELVLE-GEVP-AYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLD 264
               S+L+ E G++  A    DE +++ +  +     L++   +  + +     Y+E+    
Sbjct:   303 QSSKLLAEKGDMNNAKLFSDEAANIYERAIST---LLKKNMLLYFAYAD---YEESRMKY 356

Query:   265 EKINCFEN 272
             EK++   N
Sbjct:   357 EKVHSIYN 364


>MGI|MGI:1351825 [details] [associations]
            symbol:Cstf3 "cleavage stimulation factor, 3' pre-RNA,
            subunit 3" species:10090 "Mus musculus" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0006397 "mRNA processing"
            evidence=IEA] InterPro:IPR003107 InterPro:IPR008847 Pfam:PF05843
            SMART:SM00386 MGI:MGI:1351825 GO:GO:0005634 GO:GO:0006397 CTD:1479
            eggNOG:COG5107 HOGENOM:HOG000231786 HOVERGEN:HBG053813 KO:K14408
            OMA:IAFRIFE OrthoDB:EOG47H5PF EMBL:BC003241 IPI:IPI00116929
            RefSeq:NP_663504.1 UniGene:Mm.259876 UniGene:Mm.479443 PDB:2OND
            PDB:2OOE PDBsum:2OND PDBsum:2OOE ProteinModelPortal:Q99LI7
            SMR:Q99LI7 STRING:Q99LI7 PhosphoSite:Q99LI7 PaxDb:Q99LI7
            PRIDE:Q99LI7 Ensembl:ENSMUST00000028599 GeneID:228410
            KEGG:mmu:228410 GeneTree:ENSGT00390000006758 InParanoid:Q99LI7
            ChiTaRS:CSTF3 EvolutionaryTrace:Q99LI7 NextBio:378988 Bgee:Q99LI7
            Genevestigator:Q99LI7 GermOnline:ENSMUSG00000027176 Uniprot:Q99LI7
        Length = 717

 Score = 136 (52.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    22 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 78

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:    79 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEI 137

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   138 MSYQIWVDYINF 149

 Score = 56 (24.8 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 14/60 (23%), Positives = 25/60 (41%)

Query:   308 VVKLYERCLIPCADYPEFWMRYVDFMES-------KGGREIASYALDRATQIFLKRLPVI 360
             V+  YE+CL+    +P+ W     ++E        KG    A    D A  I+ + +  +
Sbjct:   276 VMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSDEAANIYERAISTL 335

 Score = 50 (22.7 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/110 (25%), Positives = 44/110 (40%)

Query:   310 KLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKE 369
             K++E  L    D PE+ + Y+D++             D  T++  +R   +    +   E
Sbjct:   431 KIFELGLKKYGDIPEYVLAYIDYLSHLNE--------DNNTRVLFER---VLTSGSLPPE 479

Query:   370 QIGDTSAARAAFPESYIDSDSRF-IEKVTFKANMERRLGNFVAAC-DTYK 417
             + G+  A   AF  +  D  S   +EK  F A  E   G   A   D YK
Sbjct:   480 KSGEIWARFLAFESNIGDLASILKVEKRRFTAFREEYEGKETALLVDRYK 529

 Score = 41 (19.5 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/128 (21%), Positives = 58/128 (45%)

Query:   147 MWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAME 206
             MW KYI++E S    +       QTL   +K++   Y+    + G    ++  E+   +E
Sbjct:   251 MWKKYIQWEKSNPLRTE-----DQTLI--TKRVMFAYEQCLLVLGH-HPDIWYEAAQYLE 302

Query:   207 FQSELVLE-GEVP-AYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLD 264
               S+L+ E G++  A    DE +++ +  +     L++   +  + +     Y+E+    
Sbjct:   303 QSSKLLAEKGDMNNAKLFSDEAANIYERAIST---LLKKNMLLYFAYAD---YEESRMKY 356

Query:   265 EKINCFEN 272
             EK++   N
Sbjct:   357 EKVHSIYN 364


>UNIPROTKB|Q5F4A0 [details] [associations]
            symbol:CSTF3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006397 "mRNA processing" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] InterPro:IPR003107 InterPro:IPR008847
            Pfam:PF05843 SMART:SM00386 GO:GO:0005634 GO:GO:0006397 CTD:1479
            eggNOG:COG5107 HOGENOM:HOG000231786 HOVERGEN:HBG053813 KO:K14408
            OMA:IAFRIFE OrthoDB:EOG47H5PF GeneTree:ENSGT00390000006758
            EMBL:AADN02065603 EMBL:AJ851400 IPI:IPI00592390
            RefSeq:NP_001012586.1 UniGene:Gga.22714 SMR:Q5F4A0 STRING:Q5F4A0
            Ensembl:ENSGALT00000019104 GeneID:421595 KEGG:gga:421595
            InParanoid:Q5F4A0 NextBio:20824338 Uniprot:Q5F4A0
        Length = 718

 Score = 136 (52.9 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    23 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 79

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:    80 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEI 138

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   139 MSYQIWVDYINF 150

 Score = 56 (24.8 bits), Expect = 6.8e-06, Sum P(2) = 6.8e-06
 Identities = 14/60 (23%), Positives = 25/60 (41%)

Query:   308 VVKLYERCLIPCADYPEFWMRYVDFMES-------KGGREIASYALDRATQIFLKRLPVI 360
             V+  YE+CL+    +P+ W     ++E        KG    A    D A  I+ + +  +
Sbjct:   277 VMFAYEQCLLVLGHHPDIWYEAAQYLEQSSKLLAEKGDMNNAKLFSDEAANIYERAISTL 336

 Score = 50 (22.7 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/110 (25%), Positives = 44/110 (40%)

Query:   310 KLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKE 369
             K++E  L    D PE+ + Y+D++             D  T++  +R   +    +   E
Sbjct:   432 KIFELGLKKYGDIPEYVLAYIDYLSHLNE--------DNNTRVLFER---VLTSGSLPPE 480

Query:   370 QIGDTSAARAAFPESYIDSDSRF-IEKVTFKANMERRLGNFVAAC-DTYK 417
             + G+  A   AF  +  D  S   +EK  F A  E   G   A   D YK
Sbjct:   481 KSGEIWARFLAFESNIGDLASILKVEKRRFTAFKEEYEGKETALLVDRYK 530

 Score = 41 (19.5 bits), Expect = 5.2e-05, Sum P(3) = 5.2e-05
 Identities = 28/128 (21%), Positives = 58/128 (45%)

Query:   147 MWDKYIEFEISQQRWSSLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKEELECESDSAME 206
             MW KYI++E S    +       QTL   +K++   Y+    + G    ++  E+   +E
Sbjct:   252 MWKKYIQWEKSNPLRTE-----DQTLI--TKRVMFAYEQCLLVLGH-HPDIWYEAAQYLE 303

Query:   207 FQSELVLE-GEVP-AYYKDDETSSVIKDLLDPSVDLVRSKAIQKYRFIGEQIYKEASQLD 264
               S+L+ E G++  A    DE +++ +  +     L++   +  + +     Y+E+    
Sbjct:   304 QSSKLLAEKGDMNNAKLFSDEAANIYERAIST---LLKKNMLLYFAYAD---YEESRMKY 357

Query:   265 EKINCFEN 272
             EK++   N
Sbjct:   358 EKVHSIYN 365

 Score = 38 (18.4 bits), Expect = 0.00079, Sum P(3) = 0.00079
 Identities = 7/23 (30%), Positives = 13/23 (56%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYER 314
             W  +L+F    GD   ++K+ +R
Sbjct:   486 WARFLAFESNIGDLASILKVEKR 508


>UNIPROTKB|F1MZT2 [details] [associations]
            symbol:CRNKL1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071013 "catalytic step 2 spliceosome" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0000245 "spliceosomal
            complex assembly" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 InterPro:IPR013026 Pfam:PF02184 PROSITE:PS50293
            SMART:SM00386 GO:GO:0003723 Gene3D:1.25.40.10 GO:GO:0071013
            GO:GO:0000245 OMA:KFTFAKI GeneTree:ENSGT00550000074931
            EMBL:DAAA02035750 IPI:IPI01017666 Ensembl:ENSBTAT00000011148
            Uniprot:F1MZT2
        Length = 781

 Score = 103 (41.3 bits), Expect = 2.2e-05, Sum P(2) = 2.2e-05
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:   168 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 226

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   227 TTLPR---VNQFWYKYTYMEEMLGNIAGARQVFERWMEWRPEEQAWHSYINFE 276

 Score = 86 (35.3 bits), Expect = 2.2e-05, Sum P(2) = 2.2e-05
 Identities = 53/219 (24%), Positives = 83/219 (37%)

Query:   278 YFHVKPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFME 334
             Y H   L    +KNW  Y  F EK G F    K+YER +    D       ++ +  F E
Sbjct:   292 YIHSLVLVHPDVKNWIKYARFEEKHGYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEE 351

Query:   335 SKGG----REIASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDS 390
             ++      R I  YALDR ++   + L   +     ++++ GD         E  I S  
Sbjct:   352 NQKEFERVRVIYKYALDRISKQEAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKR 403

Query:   391 RFIEKVTFKANMER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYV 437
             RF  +   KAN           RL    A  +T +E  E A       Q K H    +Y+
Sbjct:   404 RFQYEEEVKANPHNYDAWFDYLRLVESDAEAETVREVYERAIANVPPVQEKRHWKRYIYL 463

Query:   438 QFSRLTYTTTGSAD--NARDILIDGIKHVPNCKLLLEEL 474
               +   Y    + D    R +    ++ +P+ K    ++
Sbjct:   464 WINYALYEELEAKDPERTRQVYQASLELIPHKKFTFAKM 502

 Score = 77 (32.2 bits), Expect = 0.00018, Sum P(2) = 0.00018
 Identities = 29/124 (23%), Positives = 54/124 (43%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCA-DYPEF--WMRYVDFMESKGGREIASYALDR 348
             WH Y++F  +  + D    +YER +      +P+   W++Y  F E  G    A    +R
Sbjct:   269 WHSYINFELRYKEVDRARTIYERYIHSLVLVHPDVKNWIKYARFEEKHGYFAHARKVYER 328

Query:   349 ATQIFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMER 404
             A + F       HL+ A  +++E   +    R  +  + +D  S+   +  FK     E+
Sbjct:   329 AVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVIYKYA-LDRISKQEAQELFKNYTIFEK 387

Query:   405 RLGN 408
             + G+
Sbjct:   388 KFGD 391


>UNIPROTKB|J9P5Z1 [details] [associations]
            symbol:CRNKL1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006396 "RNA processing" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 InterPro:IPR013026 Pfam:PF02184 PROSITE:PS50293
            SMART:SM00386 GO:GO:0005622 GO:GO:0006396 Gene3D:1.25.40.10
            GeneTree:ENSGT00550000074931 EMBL:AAEX03013754
            Ensembl:ENSCAFT00000047479 Uniprot:J9P5Z1
        Length = 728

 Score = 103 (41.3 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:   125 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 183

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   184 TTLPR---VNQFWYKYTYMEEMLGNIAGARQVFERWMEWQPEEQAWHSYINFE 233

 Score = 85 (35.0 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
 Identities = 29/121 (23%), Positives = 53/121 (43%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             WH Y++F  +  + D    +YER ++   D    W++Y  F E  G    A    +RA +
Sbjct:   226 WHSYINFELRYKEVDRARTIYERFVLVHPDVKN-WIKYARFEEKHGYFAHARKVYERAVE 284

Query:   352 IFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRLG 407
              F       HL+ A  +++E   +    R  +  + +D  S+   +  FK     E++ G
Sbjct:   285 FFGDEHMDEHLYVAFAKFEENQKEFERVRVIYKYA-LDRISKQEAQELFKNYTIFEKKFG 343

Query:   408 N 408
             +
Sbjct:   344 D 344

 Score = 79 (32.9 bits), Expect = 9.7e-05, Sum P(2) = 9.7e-05
 Identities = 50/208 (24%), Positives = 80/208 (38%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFMESKGG----REI 341
             +KNW  Y  F EK G F    K+YER +    D       ++ +  F E++      R I
Sbjct:   256 VKNWIKYARFEEKHGYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVI 315

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
               YALDR ++   + L   +     ++++ GD         E  I S  RF  +   KAN
Sbjct:   316 YKYALDRISKQEAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKRRFQYEEEVKAN 367

Query:   402 MER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYVQFSRLTYTTTG 448
                        RL    A  +T +E  E A       Q K H    +Y+  +   Y    
Sbjct:   368 PHNYDAWFDYLRLVESDAEAETVREVYERAIANVPPIQEKRHWKRYIYLWVNYALYEELE 427

Query:   449 SAD--NARDILIDGIKHVPNCKLLLEEL 474
             + D    R +    ++ +P+ K    ++
Sbjct:   428 AKDPERTRQVYQASLELIPHKKFTFAKM 455


>UNIPROTKB|E1BGY7 [details] [associations]
            symbol:CSTF3 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] [GO:0006397 "mRNA
            processing" evidence=IEA] InterPro:IPR003107 InterPro:IPR008847
            InterPro:IPR011990 Pfam:PF05843 SMART:SM00386 GO:GO:0005634
            GO:GO:0006397 Gene3D:1.25.40.10 OMA:IAFRIFE
            GeneTree:ENSGT00390000006758 EMBL:DAAA02041256 EMBL:DAAA02041257
            IPI:IPI00709818 Ensembl:ENSBTAT00000011369 Uniprot:E1BGY7
        Length = 718

 Score = 136 (52.9 bits), Expect = 2.6e-05, Sum P(3) = 2.6e-05
 Identities = 34/132 (25%), Positives = 65/132 (49%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             E+ + E   D D W+ L+ E +N  P  I+     Y+  +A+FP    +W+ Y + + + 
Sbjct:    22 EKKLEENPYDLDAWSILIREAQNQ-P--IDKARKTYERLVAQFPSSGRFWKLYIEAEIKA 78

Query:    87 CSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFED-PNDVRRL---FKRALSFVGKDY 142
              + DKV ++F+R +    + +D+W  Y S    T    P+   ++   +  AL  +G + 
Sbjct:    79 KNYDKVEKLFQRCLMKVLH-IDLWKCYLSYVRETKGKLPSYKEKMAQAYDFALDKIGMEI 137

Query:   143 LCHTMWDKYIEF 154
             + + +W  YI F
Sbjct:   138 MSYQIWVDYINF 149

 Score = 54 (24.1 bits), Expect = 2.6e-05, Sum P(3) = 2.6e-05
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query:   308 VVKLYERCLIPCADYPEFWMRYVDFME 334
             V+  YE+CL+    +P+ W     ++E
Sbjct:   276 VMFAYEQCLLVLGHHPDIWYEAAQYLE 302

 Score = 50 (22.7 bits), Expect = 6.5e-05, Sum P(3) = 6.5e-05
 Identities = 28/110 (25%), Positives = 44/110 (40%)

Query:   310 KLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQIFLKRLPVIHLFNARYKE 369
             K++E  L    D PE+ + Y+D++             D  T++  +R   +    +   E
Sbjct:   432 KIFELGLKKYGDIPEYVLAYIDYLSHLNE--------DNNTRVLFER---VLTSGSLPPE 480

Query:   370 QIGDTSAARAAFPESYIDSDSRF-IEKVTFKANMERRLGNFVAAC-DTYK 417
             + G+  A   AF  +  D  S   +EK  F A  E   G   A   D YK
Sbjct:   481 KSGEIWARFLAFESNIGDLASILKVEKRRFTAFKEEYEGKETALLVDRYK 530

 Score = 40 (19.1 bits), Expect = 2.6e-05, Sum P(3) = 2.6e-05
 Identities = 7/11 (63%), Positives = 9/11 (81%)

Query:   147 MWDKYIEFEIS 157
             MW KYI++E S
Sbjct:   251 MWKKYIQWEKS 261


>TAIR|locus:2007973 [details] [associations]
            symbol:CSTF77 species:3702 "Arabidopsis thaliana"
            [GO:0005622 "intracellular" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM;IEA] [GO:0006397 "mRNA processing" evidence=ISS]
            [GO:0003729 "mRNA binding" evidence=IDA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0031123 "RNA 3'-end processing"
            evidence=IMP] [GO:0045892 "negative regulation of transcription,
            DNA-dependent" evidence=IMP] [GO:0000278 "mitotic cell cycle"
            evidence=RCA] [GO:0006396 "RNA processing" evidence=RCA]
            [GO:0009630 "gravitropism" evidence=RCA] InterPro:IPR003107
            InterPro:IPR008847 InterPro:IPR011990 InterPro:IPR013026
            InterPro:IPR019734 Pfam:PF05843 PROSITE:PS50005 PROSITE:PS50293
            SMART:SM00386 EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005634
            GO:GO:0045892 GO:GO:0003729 GO:GO:0006397 Gene3D:1.25.40.10
            eggNOG:COG5107 KO:K14408 GO:GO:0031123 UniGene:At.27878
            UniGene:At.28561 EMBL:BT002320 IPI:IPI00548656 RefSeq:NP_173218.2
            SMR:Q8GUP1 IntAct:Q8GUP1 STRING:Q8GUP1 EnsemblPlants:AT1G17760.1
            GeneID:838354 KEGG:ath:AT1G17760 TAIR:At1g17760
            HOGENOM:HOG000030800 InParanoid:Q8GUP1 OMA:FEQTYGD
            ProtClustDB:CLSN2690404 Genevestigator:Q8GUP1 Uniprot:Q8GUP1
        Length = 734

 Score = 136 (52.9 bits), Expect = 3.5e-05, P = 3.5e-05
 Identities = 89/505 (17%), Positives = 195/505 (38%)

Query:    61 VYDSFLAEFPLCYGYWRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMST 120
             +Y+  L+ +P    +W++Y + +  + + D   ++F R + +    V +W  Y       
Sbjct:    28 IYEQLLSLYPTSARFWKQYVEAQMAVNNDDATKQIFSRCLLTCL-QVPLWQCYIRFIRKV 86

Query:   121 F-----EDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEF-------EISQQ--RWSSLAQ 166
             +     E   +  + F+  L+++G D     +W +YI F        +++   R ++L +
Sbjct:    87 YDKKGAEGQEETTKAFEFMLNYIGTDIASGPIWTEYIAFLKSLPALNLNEDLHRKTALRK 146

Query:   167 IFVQTLRFPSKKLHHYYDSFKKLAGAWKEELE--CESDSAMEFQSELVLEGEVPAYYKDD 224
             ++ + +  P+  +   +  ++        +L     ++   +F S   +  E   Y ++ 
Sbjct:   147 VYHRAILTPTHHVEQLWKDYENFENTVNRQLAKGLVNEYQPKFNSARAVYRERKKYIEEI 206

Query:   225 ETSSVIKDLLDPSVDLVRSKAIQKYRFIGE---QIYKEASQLDEKINCFENLIRRPYFHV 281
             + + +       S +  +  A +K+    +   Q    AS     I  +E  +   Y + 
Sbjct:   207 DWNMLAVPPTGTSKEETQWVAWKKFLSFEKGNPQRIDTASSTKRIIYAYEQCLMCLYHY- 265

Query:   282 KPLDDIQLKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREI 341
              P  D+    W+DY  +  K G  D  +K+++R L    D       + +  ES+G  + 
Sbjct:   266 -P--DV----WYDYAEWHVKSGSTDAAIKVFQRALKAIPDSEMLKYAFAEMEESRGAIQS 318

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
             A    +         L   H+   R+  +     AAR  F ++       +   + F A 
Sbjct:   319 AKKLYENILGASTNSLA--HIQYLRFLRRAEGVEAARKYFLDARKSPSCTYHVYIAF-AT 375

Query:   402 MERRLGNFV-AACDTYKEALETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDG 460
             M   +      A + ++E L+    +  +    L Y  F     T      N R +    
Sbjct:   376 MAFCIDKEPKVAHNIFEEGLKLYMSEPVYI---LKYADF----LTRLNDDRNIRALFERA 428

Query:   461 IKHVP--NCKLLLEELIKFTMVHGGRSHISIVDAVISNALYSRPDVLKVFSLEDVEDISS 518
             +  +P  +   + +  I+F   +G  + I  V+  +  AL  + +         ++D+ S
Sbjct:   429 LSTLPVEDSAEVWKRFIQFEQTYGDLASILKVEQRMKEALSGKGEEGSSPPESSLQDVVS 488

Query:   519 LYLQFLDL--CGTIHDIRNAWNQHI 541
              Y  ++DL  C T +D+ +   Q +
Sbjct:   489 RY-SYMDLWPC-TSNDLDHLARQEL 511


>WB|WBGene00006307 [details] [associations]
            symbol:suf-1 species:6239 "Caenorhabditis elegans"
            [GO:0005622 "intracellular" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] InterPro:IPR003107
            InterPro:IPR008847 Pfam:PF05843 SMART:SM00386 GO:GO:0005634
            GO:GO:0009792 GO:GO:0006397 GO:GO:0000003 KO:K14408 OMA:IAFRIFE
            GeneTree:ENSGT00390000006758 EMBL:Z68315 PIR:T21484
            RefSeq:NP_495825.1 ProteinModelPortal:Q19866 SMR:Q19866
            STRING:Q19866 EnsemblMetazoa:F28C6.6.1 EnsemblMetazoa:F28C6.6.2
            GeneID:174380 KEGG:cel:CELE_F28C6.6 UCSC:F28C6.6.1 CTD:174380
            WormBase:F28C6.6 InParanoid:Q19866 NextBio:883790
            ArrayExpress:Q19866 Uniprot:Q19866
        Length = 735

 Score = 139 (54.0 bits), Expect = 9.4e-05, Sum P(2) = 9.4e-05
 Identities = 46/180 (25%), Positives = 78/180 (43%)

Query:    20 GFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKY 79
             G   +  E  I     D D W  LL E   S P D E     Y+S + +FP    YW+ Y
Sbjct:     3 GLSMRNPERRIETNPFDVDAWNLLLRE-HQSRPIDQERD--FYESLVKQFPNSGRYWKAY 59

Query:    80 ADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSF-- 137
              +H+ R  + + V ++F R + S   ++D+W  Y      T    +  R    +A  F  
Sbjct:    60 IEHELRSKNFENVEKLFSRCLVSVL-NIDLWKCYIHYVFETKGQRDQYREEMAKAYDFAL 118

Query:   138 --VGKDYLCHTMWDKYIEF-----EISQ----QRWSSLAQIFVQTLRFPSKKLHHYYDSF 186
               VG D   ++++ +YI F      + Q    QR +++ +I+ + L  P   L   ++ +
Sbjct:   119 EKVGMDVQAYSIFTEYIAFLKKVPAVGQYAENQRITAVRKIYQKALATPMHNLELIWNDY 178

 Score = 42 (19.8 bits), Expect = 9.4e-05, Sum P(2) = 9.4e-05
 Identities = 8/32 (25%), Positives = 18/32 (56%)

Query:   304 DFDWVVKLYERCLIPCADYPEFWMRYVDFMES 335
             D +  +++++  L    + PEF + Y DF+ +
Sbjct:   413 DKEVAIRVFKLGLKKYENEPEFGLAYADFLSN 444

 Score = 41 (19.5 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 8/22 (36%), Positives = 12/22 (54%)

Query:   295 YLSFAEKQGDFDWVVKLYERCL 316
             Y  F E+   F+ V  +Y+R L
Sbjct:   334 YADFQEEHKQFEAVKNIYDRLL 355

 Score = 40 (19.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/29 (27%), Positives = 15/29 (51%)

Query:   286 DIQLKNWHDYLSFAEKQGDFDWVVKLYER 314
             D  ++ W  +L F    GD   ++K+ +R
Sbjct:   467 DKSIRIWDRFLDFESCVGDLASILKVEKR 495

 Score = 37 (18.1 bits), Expect = 0.00030, Sum P(2) = 0.00030
 Identities = 9/18 (50%), Positives = 11/18 (61%)

Query:   830 EQQNPDRVHRDLRFGYRG 847
             E QN  RV +DL+   RG
Sbjct:   200 EYQNARRVEKDLQQMTRG 217


>TAIR|locus:2161363 [details] [associations]
            symbol:AT5G45990 species:3702 "Arabidopsis thaliana"
            [GO:0005622 "intracellular" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA] [GO:0006396 "RNA processing" evidence=IEA;ISS]
            [GO:0006397 "mRNA processing" evidence=IEA] InterPro:IPR003107
            InterPro:IPR008847 InterPro:IPR011990 InterPro:IPR013026
            Pfam:PF02184 Pfam:PF05843 PROSITE:PS50293 SMART:SM00386
            GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR GO:GO:0006397
            Gene3D:1.25.40.10 EMBL:AB006698 eggNOG:NOG327505
            HOGENOM:HOG000207972 IPI:IPI00521045 RefSeq:NP_199411.1
            UniGene:At.55396 ProteinModelPortal:Q9FNM3 SMR:Q9FNM3 PaxDb:Q9FNM3
            PRIDE:Q9FNM3 EnsemblPlants:AT5G45990.1 GeneID:834639
            KEGG:ath:AT5G45990 TAIR:At5g45990 InParanoid:Q9FNM3 OMA:SAFIRYA
            PhylomeDB:Q9FNM3 ProtClustDB:CLSN2684756 Genevestigator:Q9FNM3
            Uniprot:Q9FNM3
        Length = 673

 Score = 91 (37.1 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 22/64 (34%), Positives = 34/64 (53%)

Query:    96 FERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEFE 155
             FE  ++ A +++ VW  Y     S   D    R +++RAL   G +Y  HT+W KY EFE
Sbjct:    67 FEDQIRRARWNIQVWVKYAKWEESQM-DYARARSVWERALE--G-EYRNHTLWVKYAEFE 122

Query:   156 ISQQ 159
             +  +
Sbjct:   123 MKNK 126

 Score = 90 (36.7 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 62/291 (21%), Positives = 117/291 (40%)

Query:   295 YLSFAEKQG-DFDWVVKLYERCLIPCADYPEFWMRYVDFMESKG-------GREIASYAL 346
             Y  F  K+G       ++YER +   A+  E  + +V F E +         R I  +AL
Sbjct:   218 YAKFEMKRGGQVKLAREVYERAVDKLANDEEAEILFVSFAEFEERCKEVERARFIYKFAL 277

Query:   347 D-----RATQIFLKRLPVIHLFNARY--KEQIGDTSAARAAFP-ESYIDSDSRFIEKVTF 398
             D     RA +++ K +     F  +Y  KE I D    +  F  E  +  +    +    
Sbjct:   278 DHIRKGRAEELYKKFVA----FEKQYGDKEGIEDAIVGKKRFEYEDEVSKNPLNYDSWFD 333

Query:   399 KANMERRLGNFVAACDTYKEALET---AAEQRKFHTLPLLYVQFSRLTYTTTGSADNARD 455
                +E  +GN     + Y+ A+     A E+R +     L++ ++      T   +  RD
Sbjct:   334 YVRLEESVGNKDRIREIYERAIANVPPAQEKRFWQRYIYLWINYALYEEIETKDVERTRD 393

Query:   456 ILIDGIKHVPNCKLLLEELIKFTMVHGGRSHISIVDA--VISNALYSRPDVLKVFSLEDV 513
             +  + +K +P+ K    ++      +  R  +++  A  ++ NA+   P V K+F  + +
Sbjct:   394 VYRECLKLIPHTKFSFAKIWLLAAEYEIRQ-LNLTGARQILGNAIGKAPKV-KIFK-KYI 450

Query:   514 EDISSLYLQFLDLCGTIHDIRNAWNQHIKLFPHTVRTAYECPGRETKSLRA 564
             E    L L  +D C  +++    W+     +       +E    ET+  RA
Sbjct:   451 E--MELKLVNIDRCRKLYERFLEWSPE-NCYAWRNYAEFEISLAETERARA 498

 Score = 84 (34.6 bits), Expect = 0.00058, Sum P(2) = 0.00058
 Identities = 24/104 (23%), Positives = 47/104 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +       +   V+ERA++    +  +W  Y    M   +  N+ R ++ R++
Sbjct:    81 WVKYAKWEESQMDYARARSVWERALEGEYRNHTLWVKYAEFEMKN-KFVNNARNVWDRSV 139

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKK 178
             + + +      +W+KYI  E      +   QIF + + + P +K
Sbjct:   140 TLLPR---VDQLWEKYIYMEEKLGNVTGARQIFERWMNWSPDQK 180


>UNIPROTKB|F1PYE9 [details] [associations]
            symbol:CRNKL1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006396 "RNA processing" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 InterPro:IPR013026 Pfam:PF02184 PROSITE:PS50293
            SMART:SM00386 GO:GO:0005622 GO:GO:0006396 Gene3D:1.25.40.10
            OMA:KFTFAKI GeneTree:ENSGT00550000074931 EMBL:AAEX03013754
            Ensembl:ENSCAFT00000008599 Uniprot:F1PYE9
        Length = 797

 Score = 103 (41.3 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:   189 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 247

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   248 TTLPR---VNQFWYKYTYMEEMLGNIAGARQVFERWMEWQPEEQAWHSYINFE 297

 Score = 79 (32.9 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 50/208 (24%), Positives = 80/208 (38%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFMESKGG----REI 341
             +KNW  Y  F EK G F    K+YER +    D       ++ +  F E++      R I
Sbjct:   325 VKNWIKYARFEEKHGYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVI 384

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
               YALDR ++   + L   +     ++++ GD         E  I S  RF  +   KAN
Sbjct:   385 YKYALDRISKQEAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKRRFQYEEEVKAN 436

Query:   402 MER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYVQFSRLTYTTTG 448
                        RL    A  +T +E  E A       Q K H    +Y+  +   Y    
Sbjct:   437 PHNYDAWFDYLRLVESDAEAETVREVYERAIANVPPIQEKRHWKRYIYLWVNYALYEELE 496

Query:   449 SAD--NARDILIDGIKHVPNCKLLLEEL 474
             + D    R +    ++ +P+ K    ++
Sbjct:   497 AKDPERTRQVYQASLELIPHKKFTFAKM 524


>DICTYBASE|DDB_G0295719 [details] [associations]
            symbol:DDB_G0295719 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0295719 EMBL:AAFI02000070 KO:K10867
            RefSeq:XP_002649141.1 EnsemblProtists:DDB0252814 GeneID:8624735
            KEGG:ddi:DDB_G0295719 OMA:ILITSEF Uniprot:C7G031
        Length = 616

 Score = 130 (50.8 bits), Expect = 0.00012, P = 0.00012
 Identities = 54/232 (23%), Positives = 91/232 (39%)

Query:   579 QPFESEHLMPSASQDKKFSPPEKSDSESGDDATSLPSNQKSPLPE--NHDIRSDGAEVDI 636
             QP ES    P+ S      PP  ++S +  +AT +P+       E  N + +   ++   
Sbjct:   261 QPAESNEA-PAQSSSTVEQPP--AESSAAPEATEVPAESAEQPTESSNAEQQQTDSQQPT 317

Query:   637 LLSGEADSSSQDRMQQVPPEAAEQHSQDACDPEVLSLDLAHQVTNENETVQASEAFSEED 696
               SGE      D  QQ  P  ++Q + D+   +        Q + E +  ++S A    D
Sbjct:   318 QSSGEEQQQPTDSQQQ--PTDSQQQTTDSQQQQTSESSNPTQSSGEQQPTESSNAEQPTD 375

Query:   697 DVQREYEHESKKDLKPLSLEGLSLDPGGNDSPGSLCATS-HECEAPQKTNFSHESMLKSE 755
               Q   E  S  +      +     P  +    S   T  ++  + ++     +S    E
Sbjct:   376 SQQPPAESSSAPEATESPEQSGEEQPTESQQETSASPTEENQTSSTEEQQQPADSTASVE 435

Query:   756 APRETSLSDG-SVLGASQNNNGSHFAPSSMGTQASSSAPIQTRTVSPSSSAS 806
              P ETS S   S   A  N   +  + SS  T++SS+ P ++ T    SSA+
Sbjct:   436 QPAETSSSQQTSEAPAQSNEQPTESSASSNPTESSSATPTESSTAPTESSAT 487


>UNIPROTKB|Q5JY65 [details] [associations]
            symbol:CRNKL1 "Crooked neck-like protein 1" species:9606
            "Homo sapiens" [GO:0005622 "intracellular" evidence=IEA]
            [GO:0006396 "RNA processing" evidence=IEA] InterPro:IPR003107
            InterPro:IPR011990 InterPro:IPR013026 Pfam:PF02184 PROSITE:PS50293
            SMART:SM00386 GO:GO:0005622 GO:GO:0006396 Gene3D:1.25.40.10
            HOGENOM:HOG000207972 EMBL:AL035454 IPI:IPI00219317
            UniGene:Hs.171342 HGNC:HGNC:15762 HOVERGEN:HBG051046
            OrthoDB:EOG4SJ5DC SMR:Q5JY65 IntAct:Q5JY65 Ensembl:ENST00000377327
            Uniprot:Q5JY65
        Length = 836

 Score = 103 (41.3 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:   233 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 291

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   292 TTLPR---VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWHSYINFE 341

 Score = 79 (32.9 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 28/121 (23%), Positives = 52/121 (42%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             WH Y++F  +  + D    +YER ++   D    W++Y  F E       A    +RA +
Sbjct:   334 WHSYINFELRYKEVDRARTIYERFVLVHPDVKN-WIKYARFEEKHAYFAHARKVYERAVE 392

Query:   352 IFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRLG 407
              F       HL+ A  +++E   +    R  +  + +D  S+   +  FK     E++ G
Sbjct:   393 FFGDEHMDEHLYVAFAKFEENQKEFERVRVIYKYA-LDRISKQDAQELFKNYTIFEKKFG 451

Query:   408 N 408
             +
Sbjct:   452 D 452

 Score = 76 (31.8 bits), Expect = 0.00027, Sum P(2) = 0.00027
 Identities = 58/267 (21%), Positives = 103/267 (38%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFMESKGG----REI 341
             +KNW  Y  F EK   F    K+YER +    D       ++ +  F E++      R I
Sbjct:   364 VKNWIKYARFEEKHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVI 423

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
               YALDR ++   + L   +     ++++ GD         E  I S  RF  +   KAN
Sbjct:   424 YKYALDRISKQDAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKRRFQYEEEVKAN 475

Query:   402 MER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYVQFSRLTYTTTG 448
                        RL    A  +  +E  E A       Q K H    +Y+  +   Y    
Sbjct:   476 PHNYDAWFDYLRLVESDAEAEAVREVYERAIANVPPIQEKRHWKRYIYLWINYALYEELE 535

Query:   449 SAD--NARDILIDGIKHVPNCKLLLEEL-IKFTMVHGGRSHISIVDAVISNALYSRPDVL 505
             + D    R +    ++ +P+ K    ++ I +      + ++S+    +  ++   P   
Sbjct:   536 AKDPERTRQVYQASLELIPHKKFTFAKMWILYAQFEIRQKNLSLARRALGTSIGKCPKN- 594

Query:   506 KVFSLEDVEDISSLYLQFLDLCGTIHD 532
             K+F +  +E    L L+  D C  +++
Sbjct:   595 KLFKVY-IE--LELQLREFDRCRKLYE 618


>MGI|MGI:1914127 [details] [associations]
            symbol:Crnkl1 "Crn, crooked neck-like 1 (Drosophila)"
            species:10090 "Mus musculus" [GO:0000245 "spliceosomal complex
            assembly" evidence=ISO] [GO:0000398 "mRNA splicing, via
            spliceosome" evidence=ISO] [GO:0003723 "RNA binding" evidence=ISO]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0005681 "spliceosomal complex" evidence=ISO]
            [GO:0006396 "RNA processing" evidence=IEA] [GO:0006397 "mRNA
            processing" evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA]
            [GO:0071013 "catalytic step 2 spliceosome" evidence=ISO]
            InterPro:IPR003107 InterPro:IPR011990 InterPro:IPR013026
            Pfam:PF02184 SMART:SM00386 MGI:MGI:1914127 GO:GO:0016607
            GO:GO:0005681 GO:GO:0003723 Gene3D:1.25.40.10 GO:GO:0000245
            eggNOG:NOG327505 KO:K12869 HOGENOM:HOG000207972 OMA:KFTFAKI
            GeneTree:ENSGT00550000074931 CTD:51340 HOVERGEN:HBG051046
            EMBL:AK004749 EMBL:AK012962 EMBL:AK088882 EMBL:BC029187
            IPI:IPI00132376 RefSeq:NP_080096.1 UniGene:Mm.248755
            ProteinModelPortal:P63154 SMR:P63154 STRING:P63154
            PhosphoSite:P63154 PaxDb:P63154 PRIDE:P63154
            Ensembl:ENSMUST00000001818 GeneID:66877 KEGG:mmu:66877
            InParanoid:P63154 OrthoDB:EOG4SJ5DC ChiTaRS:CRNKL1 NextBio:322905
            Bgee:P63154 CleanEx:MM_CRNKL1 Genevestigator:P63154
            GermOnline:ENSMUSG00000001767 Uniprot:P63154
        Length = 690

 Score = 103 (41.3 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:    84 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 142

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   143 TTLPR---VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWHSYINFE 192

 Score = 77 (32.2 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 50/208 (24%), Positives = 79/208 (37%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFMESKGG----REI 341
             +KNW  Y  F EK   F    K+YER +    D       ++ +  F E++      R I
Sbjct:   215 VKNWIKYARFEEKHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVI 274

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
               YALDR ++   + L   +     ++++ GD         E  I S  RF  +   KAN
Sbjct:   275 YKYALDRISKQEAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKRRFQYEEEVKAN 326

Query:   402 MER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYVQFSRLTYTTTG 448
                        RL    A  DT +E  E A       Q K H    +Y+  +   Y    
Sbjct:   327 PHNYDAWFDYLRLVESDAEADTVREVYERAIANVPPIQEKRHWKRYIYLWVNYALYEELE 386

Query:   449 SAD--NARDILIDGIKHVPNCKLLLEEL 474
             + D    R +    ++ +P+ K    ++
Sbjct:   387 AKDPERTRQVYQASLELIPHKKFTFAKM 414

 Score = 69 (29.3 bits), Expect = 0.00089, Sum P(2) = 0.00089
 Identities = 38/181 (20%), Positives = 71/181 (39%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             WH Y++F  +  + +    +YER ++        W++Y  F E       A    +RA +
Sbjct:   185 WHSYINFELRYKEVERARTIYERFVLVHPAVKN-WIKYARFEEKHAYFAHARKVYERAVE 243

Query:   352 IFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRLG 407
              F       HL+ A  +++E   +    R  +  + +D  S+   +  FK     E++ G
Sbjct:   244 FFGDEHMDEHLYVAFAKFEENQKEFERVRVIYKYA-LDRISKQEAQELFKNYTIFEKKFG 302

Query:   408 NFVAACDTY--KEALETAAEQRKF-HTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHV 464
             +     D    K   +   E +   H     +  + RL   +   AD  R++    I +V
Sbjct:   303 DRRGIEDIIVSKRRFQYEEEVKANPHNYDAWF-DYLRLV-ESDAEADTVREVYERAIANV 360

Query:   465 P 465
             P
Sbjct:   361 P 361


>RGD|620507 [details] [associations]
            symbol:Crnkl1 "crooked neck pre-mRNA splicing factor-like 1
            (Drosophila)" species:10116 "Rattus norvegicus" [GO:0000245
            "spliceosomal complex assembly" evidence=ISO;ISS] [GO:0003723 "RNA
            binding" evidence=ISO;ISS] [GO:0005681 "spliceosomal complex"
            evidence=ISO;ISS] [GO:0071013 "catalytic step 2 spliceosome"
            evidence=IEA;ISO] InterPro:IPR003107 InterPro:IPR011990
            InterPro:IPR013026 Pfam:PF02184 SMART:SM00386 RGD:620507
            GO:GO:0005681 GO:GO:0003723 Gene3D:1.25.40.10 GO:GO:0071013
            GO:GO:0000245 eggNOG:NOG327505 KO:K12869 HOGENOM:HOG000207972
            GeneTree:ENSGT00550000074931 CTD:51340 HOVERGEN:HBG051046
            OrthoDB:EOG4SJ5DC EMBL:AF245018 EMBL:BC085718 IPI:IPI00327482
            RefSeq:NP_446249.1 RefSeq:XP_003749628.1 UniGene:Rn.162694
            ProteinModelPortal:P63155 SMR:P63155 STRING:P63155 PRIDE:P63155
            Ensembl:ENSRNOT00000014632 GeneID:100910202 GeneID:116481
            KEGG:rno:100910202 KEGG:rno:116481 UCSC:RGD:620507
            InParanoid:P63155 NextBio:619051 Genevestigator:P63155
            GermOnline:ENSRNOG00000040045 Uniprot:P63155
        Length = 690

 Score = 103 (41.3 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:    84 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 142

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   143 TTLPR---VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWHSYINFE 192

 Score = 77 (32.2 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 50/208 (24%), Positives = 79/208 (37%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFMESKGG----REI 341
             +KNW  Y  F EK   F    K+YER +    D       ++ +  F E++      R I
Sbjct:   215 VKNWIKYARFEEKHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVI 274

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
               YALDR ++   + L   +     ++++ GD         E  I S  RF  +   KAN
Sbjct:   275 YKYALDRISKQEAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKRRFQYEEEVKAN 326

Query:   402 MER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYVQFSRLTYTTTG 448
                        RL    A  DT +E  E A       Q K H    +Y+  +   Y    
Sbjct:   327 PHNYDAWFDYLRLVESDAEADTVREVYERAIANVPPIQEKRHWKRYIYLWVNYALYEELE 386

Query:   449 SAD--NARDILIDGIKHVPNCKLLLEEL 474
             + D    R +    ++ +P+ K    ++
Sbjct:   387 AKDPERTRQVYQASLELIPHKKFTFAKM 414

 Score = 69 (29.3 bits), Expect = 0.00089, Sum P(2) = 0.00089
 Identities = 38/181 (20%), Positives = 71/181 (39%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             WH Y++F  +  + +    +YER ++        W++Y  F E       A    +RA +
Sbjct:   185 WHSYINFELRYKEVERARTIYERFVLVHPAVKN-WIKYARFEEKHAYFAHARKVYERAVE 243

Query:   352 IFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRLG 407
              F       HL+ A  +++E   +    R  +  + +D  S+   +  FK     E++ G
Sbjct:   244 FFGDEHMDEHLYVAFAKFEENQKEFERVRVIYKYA-LDRISKQEAQELFKNYTIFEKKFG 302

Query:   408 NFVAACDTY--KEALETAAEQRKF-HTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHV 464
             +     D    K   +   E +   H     +  + RL   +   AD  R++    I +V
Sbjct:   303 DRRGIEDIIVSKRRFQYEEEVKANPHNYDAWF-DYLRLV-ESDAEADTVREVYERAIANV 360

Query:   465 P 465
             P
Sbjct:   361 P 361


>DICTYBASE|DDB_G0277977 [details] [associations]
            symbol:xab2 "TPR-like helical domain-containing
            protein" species:44689 "Dictyostelium discoideum" [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0006974 "response to
            DNA damage stimulus" evidence=IEA] [GO:0006397 "mRNA processing"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006281 "DNA repair" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] InterPro:IPR003107 InterPro:IPR011990
            SMART:SM00386 dictyBase:DDB_G0277977 GO:GO:0005634 GO:GO:0008380
            GenomeReviews:CM000152_GR EMBL:AAFI02000023 GO:GO:0006281
            GO:GO:0006397 GO:GO:0006351 Gene3D:1.25.40.10 RefSeq:XP_642044.1
            ProteinModelPortal:Q54Z08 STRING:Q54Z08 EnsemblProtists:DDB0233461
            GeneID:8621256 KEGG:ddi:DDB_G0277977 eggNOG:NOG289100 KO:K12867
            OMA:PRSYKLW ProtClustDB:CLSZ2729105 Uniprot:Q54Z08
        Length = 850

 Score = 102 (41.0 bits), Expect = 0.00014, Sum P(3) = 0.00014
 Identities = 39/179 (21%), Positives = 74/179 (41%)

Query:    27 EEFIAEGSLDFDEWTSLLSEIENSCPDDIEMIGLVYDSFLAEFPLCYGYWRKYADHKARL 86
             EE +++     + W   L E +   P   +    +Y+  + E P  Y  W +Y   +   
Sbjct:    35 EEDVSKNPYSVNCWLRYL-EFKQGSPQ--KQRNYIYERAIRELPRSYKIWHQYLLERTLA 91

Query:    87 ----C----SIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSFV 138
                 C    S + V  +FER++        +W  YC   M   E     R+ F RAL  +
Sbjct:    92 IRGKCILENSFEAVNTLFERSLVFLDKMPRIWIEYCEFLMIQ-EKITLTRKTFDRAL--I 148

Query:   139 GKDYLCH-TMWDKYIEFEISQQRWS-SLAQIFVQTLRFPSKKLHHYYDSFKKLAGAWKE 195
                   H  +W++Y +F + +   S +  +++ + L+   +K+  Y +   K+   W+E
Sbjct:   149 ALPVTQHYRIWNEYTKFILKRSIPSLTCIRVYKRYLKIQPEKVEEYIEYLIKIK-EWQE 206

 Score = 64 (27.6 bits), Expect = 0.00014, Sum P(3) = 0.00014
 Identities = 39/197 (19%), Positives = 79/197 (40%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDR 348
             +K W  Y+   E  G F     +YE+ +      P+  + +  ++E     E    A + 
Sbjct:   497 IKIWTFYVDLEESFGTFHNTKSIYEKMIQLKVVTPQIILNFAKYLEENKYFEDMFKAYEH 556

Query:   349 ATQIFL----KRLPVIHL--FNARYKEQIGDTSAARAAFPE--SYIDSDSRFIEKVTFKA 400
               Q+FL    + + + +L  F  RY          R  F +  S +      I  + + A
Sbjct:   557 GVQLFLFPHVQDIWITYLTKFIQRYAGM--KLERTRDLFEQVLSKVPPKESIIFYLMY-A 613

Query:   401 NMERRLGNFVAACDTYKEALETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDG 460
             N E + G    +   Y  A ++  ++ +F  + LLY+  +   +      +  R+I    
Sbjct:   614 NFEEQYGLARHSMAVYDRAAKSVDKEDRFK-MYLLYIHRASEFF----GVNQTREIFSKA 668

Query:   461 IKHVPNCKLLLEELIKF 477
             I+ +P+ + + +  +KF
Sbjct:   669 IEQLPD-QYVRDMCLKF 684

 Score = 61 (26.5 bits), Expect = 0.00014, Sum P(3) = 0.00014
 Identities = 16/75 (21%), Positives = 37/75 (49%)

Query:   643 DSSSQDRMQQVPPEAAEQHSQDACDPEVLSLDLAHQVT-NENETVQASEAFSEEDDVQRE 701
             D + Q + QQ   E  +Q  Q     +  +L  +  VT +  ET+Q ++    +D++  +
Sbjct:   768 DKNQQQKQQQQQQEKQQQQQQQ--QQQASTLTKSKPVTVSLPETIQYNKKIENDDEINLD 825

Query:   702 YEHESKKDLKPLSLE 716
              + E +++   L+++
Sbjct:   826 DDEEEEEEEDQLAIK 840


>UNIPROTKB|Q9BZJ0 [details] [associations]
            symbol:CRNKL1 "Crooked neck-like protein 1" species:9606
            "Homo sapiens" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0016607
            "nuclear speck" evidence=IEA] [GO:0000245 "spliceosomal complex
            assembly" evidence=IDA] [GO:0005681 "spliceosomal complex"
            evidence=IDA] [GO:0003723 "RNA binding" evidence=IDA] [GO:0000398
            "mRNA splicing, via spliceosome" evidence=IC] [GO:0071013
            "catalytic step 2 spliceosome" evidence=IDA] InterPro:IPR003107
            InterPro:IPR011990 InterPro:IPR013026 Pfam:PF02184 SMART:SM00386
            EMBL:AF318303 GO:GO:0005737 GO:GO:0016607 GO:GO:0003723
            Gene3D:1.25.40.10 GO:GO:0071013 GO:GO:0000245 eggNOG:NOG327505
            KO:K12869 OMA:KFTFAKI EMBL:AF255443 EMBL:AF318302 EMBL:AF318304
            EMBL:AF318305 EMBL:AF111802 EMBL:AK023246 EMBL:AK023728
            EMBL:AK292799 EMBL:AL035454 EMBL:AK022908 IPI:IPI00177437
            IPI:IPI00219317 IPI:IPI00219318 IPI:IPI00219320 IPI:IPI01011870
            RefSeq:NP_057736.4 UniGene:Hs.171342 ProteinModelPortal:Q9BZJ0
            SMR:Q9BZJ0 IntAct:Q9BZJ0 STRING:Q9BZJ0 PhosphoSite:Q9BZJ0
            DMDM:147744555 PaxDb:Q9BZJ0 PRIDE:Q9BZJ0 DNASU:51340
            Ensembl:ENST00000377340 Ensembl:ENST00000490910
            Ensembl:ENST00000496549 Ensembl:ENST00000536226 GeneID:51340
            KEGG:hsa:51340 UCSC:uc002wrs.3 CTD:51340 GeneCards:GC20M019963
            H-InvDB:HIX0015678 HGNC:HGNC:15762 MIM:610952 neXtProt:NX_Q9BZJ0
            PharmGKB:PA26886 HOVERGEN:HBG051046 InParanoid:Q9BZJ0
            GenomeRNAi:51340 NextBio:54782 ArrayExpress:Q9BZJ0 Bgee:Q9BZJ0
            Genevestigator:Q9BZJ0 GermOnline:ENSG00000101343 Uniprot:Q9BZJ0
        Length = 848

 Score = 103 (41.3 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 26/113 (23%), Positives = 51/113 (45%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA  +  L  I +   ++ERA+     ++ +W  Y  + M      N  R ++ RA+
Sbjct:   245 WIKYAQWEESLKEIQRARSIYERALDVDYRNITLWLKYAEMEMKN-RQVNHARNIWDRAI 303

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +    +  W KY   E      +   Q+F + + + P ++  H Y +F+
Sbjct:   304 TTLPR---VNQFWYKYTYMEEMLGNVAGARQVFERWMEWQPEEQAWHSYINFE 353

 Score = 79 (32.9 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 28/121 (23%), Positives = 52/121 (42%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             WH Y++F  +  + D    +YER ++   D    W++Y  F E       A    +RA +
Sbjct:   346 WHSYINFELRYKEVDRARTIYERFVLVHPDVKN-WIKYARFEEKHAYFAHARKVYERAVE 404

Query:   352 IFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRLG 407
              F       HL+ A  +++E   +    R  +  + +D  S+   +  FK     E++ G
Sbjct:   405 FFGDEHMDEHLYVAFAKFEENQKEFERVRVIYKYA-LDRISKQDAQELFKNYTIFEKKFG 463

Query:   408 N 408
             +
Sbjct:   464 D 464

 Score = 76 (31.8 bits), Expect = 0.00028, Sum P(2) = 0.00028
 Identities = 58/267 (21%), Positives = 103/267 (38%)

Query:   289 LKNWHDYLSFAEKQGDFDWVVKLYERCLIPCADY---PEFWMRYVDFMESKGG----REI 341
             +KNW  Y  F EK   F    K+YER +    D       ++ +  F E++      R I
Sbjct:   376 VKNWIKYARFEEKHAYFAHARKVYERAVEFFGDEHMDEHLYVAFAKFEENQKEFERVRVI 435

Query:   342 ASYALDRATQIFLKRLPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKAN 401
               YALDR ++   + L   +     ++++ GD         E  I S  RF  +   KAN
Sbjct:   436 YKYALDRISKQDAQELFKNYTI---FEKKFGDRRGI-----EDIIVSKRRFQYEEEVKAN 487

Query:   402 MER--------RLGNFVAACDTYKEALETAAE-----QRKFHTLPLLYVQFSRLTYTTTG 448
                        RL    A  +  +E  E A       Q K H    +Y+  +   Y    
Sbjct:   488 PHNYDAWFDYLRLVESDAEAEAVREVYERAIANVPPIQEKRHWKRYIYLWINYALYEELE 547

Query:   449 SAD--NARDILIDGIKHVPNCKLLLEEL-IKFTMVHGGRSHISIVDAVISNALYSRPDVL 505
             + D    R +    ++ +P+ K    ++ I +      + ++S+    +  ++   P   
Sbjct:   548 AKDPERTRQVYQASLELIPHKKFTFAKMWILYAQFEIRQKNLSLARRALGTSIGKCPKN- 606

Query:   506 KVFSLEDVEDISSLYLQFLDLCGTIHD 532
             K+F +  +E    L L+  D C  +++
Sbjct:   607 KLFKVY-IE--LELQLREFDRCRKLYE 630


>SGD|S000001800 [details] [associations]
            symbol:SRP40 "Nucleolar serine-rich protein" species:4932
            "Saccharomyces cerevisiae" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0006913 "nucleocytoplasmic transport"
            evidence=ISS] [GO:0005730 "nucleolus" evidence=IDA] SGD:S000001800
            Pfam:PF05022 GO:GO:0005730 GO:GO:0006913 EMBL:BK006944 EMBL:X73541
            GeneTree:ENSGT00700000104548 eggNOG:NOG318801 InterPro:IPR007718
            RefSeq:NP_013018.3 GeneID:853967 KEGG:sce:YKR092C KO:K02927
            RefSeq:NP_013020.3 GeneID:853969 KEGG:sce:YKR094C EMBL:L11275
            EMBL:Z28317 PIR:S38170 ProteinModelPortal:P32583 DIP:DIP-2115N
            IntAct:P32583 MINT:MINT-472548 STRING:P32583 PaxDb:P32583
            EnsemblFungi:YKR092C CYGD:YKR092c OMA:DAXESAR OrthoDB:EOG4M68TB
            NextBio:975404 Genevestigator:P32583 GermOnline:YKR092C
            Uniprot:P32583
        Length = 406

 Score = 126 (49.4 bits), Expect = 0.00018, P = 0.00018
 Identities = 59/247 (23%), Positives = 105/247 (42%)

Query:   587 MPSASQDKKFSPPEKSDSESGDDATSLPSNQKSPLPENHDIRSDGAEVDILLSGEADSS- 645
             +P  S  +K    EKS S S   ++S  S+  S    +    S  +      S  +DSS 
Sbjct:    11 VPKLSVKEK-EIEEKSSSSSSSSSSSSSSSSSSSSSSSSSGESSSSSSSSSSSSSSDSSD 69

Query:   646 SQDRMQQVPPEAAEQHSQDACDPEVLS-LDLAHQVTNENETVQASEAFSE---EDDVQ-- 699
             S D        ++   S  + D E  S  D +   ++ + +  + E+ SE   ED+ +  
Sbjct:    70 SSDSESSSSSSSSSSSSSSSSDSESSSESDSSSSGSSSSSSSSSDESSSESESEDETKKR 129

Query:   700 -REYEHESKKDLKPLSLEGLSLDPGGNDSPGSLCATSHECEAPQKTNFSHESMLKSEAPR 758
              RE ++E  K+ K    E  S     + S GS  ++S E E+  +++    S   S +  
Sbjct:   130 ARESDNEDAKETKKAKTEPESSSSSESSSSGS--SSSSESESGSESDSDSSSSSSSSSDS 187

Query:   759 ET-SLSDGSVLGASQNNNGSHFAPSSMGTQASSSAPIQTRTVSPSSSASHQNFIPEAHSH 817
             E+ S SD     +S +++ S  + SS    +S S    + + S S S S  +   ++ S 
Sbjct:   188 ESDSESDSQSSSSSSSSDSSSDSDSSSSDSSSDSDSSSSSSSSSSDSDSDSDSSSDSDSS 247

Query:   818 PQTPANS 824
               + ++S
Sbjct:   248 GSSDSSS 254


>UNIPROTKB|E1BWJ0 [details] [associations]
            symbol:SART3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0006396 "RNA processing" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR003107 InterPro:IPR012677
            Pfam:PF00076 PROSITE:PS50102 SMART:SM00360 SMART:SM00386
            GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622
            GO:GO:0006396 GeneTree:ENSGT00550000074656 OMA:SQAVMKM
            EMBL:AADN02034959 EMBL:AADN02034960 EMBL:AADN02034961
            EMBL:AADN02034962 EMBL:AADN02034963 IPI:IPI00823214
            ProteinModelPortal:E1BWJ0 Ensembl:ENSGALT00000038892
            ArrayExpress:E1BWJ0 Uniprot:E1BWJ0
        Length = 731

 Score = 101 (40.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 44/176 (25%), Positives = 80/176 (45%)

Query:     1 MEVQISNLESLSAEP--NSPVGFGKQGLEEFIAEGSLDFDEWTSLLSEIENSCPD----- 53
             ME   +  E  S +P   + +   K+ L++   E    ++E   LLS++    P      
Sbjct:   221 MEASYAEYEEWSEDPIPETTIKNYKKALQQL--EKCKPYEEALDLLSQLGAETPKLAEYQ 278

Query:    54 ---DIEM-------IGLVYDSFLAEFPLCYGYWRKYADHKARLCSIDKVV-EVFERAVQS 102
                D EM       I L+Y+  LAE  L    W +Y  +  R   + ++V    +RAV++
Sbjct:   279 AYIDFEMKAGDPARIQLIYERALAENCLVPDLWARYNQYLDRQLKVKELVLSAHDRAVRN 338

Query:   103 ATYSVDVWFHYCSLSMSTFE-DPNDVRRLFKRALS--FV-GKDYLCHTMWDKYIEF 154
               ++V +W  Y  L+M     D   +  +F++AL+  F+   DY+   +W  Y+++
Sbjct:   339 CPWTVGLWIQYL-LAMERHGVDHCIISDMFEKALNAGFIQATDYV--EIWQAYLDY 391

 Score = 73 (30.8 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/83 (27%), Positives = 41/83 (49%)

Query:   246 IQKYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPLDDIQLKNWHDYLSFAEKQGDF 305
             +Q  R++G+Q    AS  D +I     ++    F++   +  +L  +  Y+ F  K GD 
Sbjct:   427 LQTCRYLGQQ----ASHGDLEIGGAAEVVGACSFYLLGAETPKLAEYQAYIDFEMKAGDP 482

Query:   306 DWVVKLYERCL---IPC-ADYPE 324
               +  +YE+ L   + C +DYPE
Sbjct:   483 ARIQLIYEKALHRAVQCTSDYPE 505

 Score = 49 (22.3 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 14/58 (24%), Positives = 26/58 (44%)

Query:   656 EAAEQHSQDACDPEVLSLDLAHQVTNENETVQASEAF---SEEDDVQREYEHESKKDL 710
             E AEQ  Q   + +       +++  + +     E      EE+D  R+ EH+SK ++
Sbjct:   580 EKAEQRKQSRAEKKASKKAKKNRIGEKRKADDDDEGEWGQEEEEDPLRQREHDSKDNI 637


>FB|FBgn0000377 [details] [associations]
            symbol:crn "crooked neck" species:7227 "Drosophila
            melanogaster" [GO:0007443 "Malpighian tubule morphogenesis"
            evidence=IMP] [GO:0007417 "central nervous system development"
            evidence=NAS] [GO:0005634 "nucleus" evidence=IC;NAS] [GO:0007422
            "peripheral nervous system development" evidence=NAS] [GO:0048663
            "neuron fate commitment" evidence=NAS] [GO:0007405 "neuroblast
            proliferation" evidence=NAS] [GO:0005681 "spliceosomal complex"
            evidence=ISS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IC;ISS] [GO:0016607 "nuclear speck" evidence=IDA]
            [GO:0000381 "regulation of alternative mRNA splicing, via
            spliceosome" evidence=IMP] [GO:0007438 "oenocyte development"
            evidence=IMP] [GO:0008347 "glial cell migration" evidence=IMP]
            [GO:0008366 "axon ensheathment" evidence=IMP] [GO:0071013
            "catalytic step 2 spliceosome" evidence=IDA] [GO:0071011
            "precatalytic spliceosome" evidence=IDA] [GO:0051298 "centrosome
            duplication" evidence=IMP] [GO:0005813 "centrosome" evidence=IDA]
            [GO:0072686 "mitotic spindle" evidence=IDA] [GO:0022008
            "neurogenesis" evidence=IMP] InterPro:IPR003107 InterPro:IPR011990
            InterPro:IPR013026 Pfam:PF02184 SMART:SM00386 GO:GO:0005813
            GO:GO:0051298 GO:GO:0016607 EMBL:AE014298 GO:GO:0007438
            Gene3D:1.25.40.10 GO:GO:0071011 GO:GO:0000398 GO:GO:0071013
            GO:GO:0008347 GO:GO:0072686 GO:GO:0007443 GO:GO:0000381
            GO:GO:0008366 eggNOG:NOG327505 KO:K12869 OMA:KFTFAKI
            GeneTree:ENSGT00550000074931 EMBL:X58374 EMBL:AL009195
            EMBL:AY051666 PIR:T13427 RefSeq:NP_477118.1 UniGene:Dm.3140
            ProteinModelPortal:P17886 SMR:P17886 DIP:DIP-17429N IntAct:P17886
            MINT:MINT-282072 STRING:P17886 PaxDb:P17886
            EnsemblMetazoa:FBtr0070418 GeneID:31208 KEGG:dme:Dmel_CG3193
            CTD:12935 FlyBase:FBgn0000377 HOGENOM:HOG000264173
            InParanoid:P17886 OrthoDB:EOG4J9KDW PhylomeDB:P17886
            GenomeRNAi:31208 NextBio:772455 Bgee:P17886 GermOnline:CG3193
            Uniprot:P17886
        Length = 702

 Score = 128 (50.1 bits), Expect = 0.00024, P = 0.00024
 Identities = 59/298 (19%), Positives = 124/298 (41%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KYA+ + +   ++    +++RAV         W+ Y  +     E+    R++F+R +
Sbjct:   113 WLKYAEMEMKNKQVNHARNLWDRAVTIMPRVNQFWYKYTYME-EMLENVAGARQVFERWM 171

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFKKLAG--- 191
              +  ++      W  Y+ FE+  +      +I+ + +   P  K    +  F++  G   
Sbjct:   172 EWQPEEQA----WQTYVNFELRYKEIDRAREIYERFVYVHPDVKNWIKFARFEESHGFIH 227

Query:   192 ----AWKEELECESDSAMEFQSELVLEGEVPAYYKDDETSSVIKDLLDPSVDLVRSKAIQ 247
                  ++  +E   D  +E +  +          + D    + K  LD  +   R++ + 
Sbjct:   228 GSRRVFERAVEFFGDDYIEERLFIAFARFEEGQKEHDRARIIYKYALD-HLPKDRTQELF 286

Query:   248 KYRFIGEQIYKEASQLDEKINCFENLIRRPYFHVKPL--DDIQLKNWHDYLSFAEKQGDF 305
             K     E+ Y + + +++ I     + +R Y + + +  +      W DYL   E +GD 
Sbjct:   287 KAYTKHEKKYGDRAGIEDVI-----VSKRKYQYEQEVAANPTNYDAWFDYLRLIEAEGDR 341

Query:   306 DWVVKLYERCL--IPCADYPEFWMRYVDFMESKG-GREIASYALDRATQIFLKRLPVI 360
             D + + YER +  +P A+   FW RY+    +     E+ +   +R  QI+   L +I
Sbjct:   342 DQIRETYERAISNVPPANEKNFWRRYIYLWINYALYEELEAEDAERTRQIYKTCLELI 399


>TAIR|locus:2152965 [details] [associations]
            symbol:AT5G41770 species:3702 "Arabidopsis thaliana"
            [GO:0005622 "intracellular" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006396 "RNA processing" evidence=IEA;ISS]
            InterPro:IPR003107 InterPro:IPR011990 InterPro:IPR013026
            Pfam:PF02184 PROSITE:PS50293 SMART:SM00386 EMBL:CP002688
            GO:GO:0005622 GO:GO:0006396 Gene3D:1.25.40.10 KO:K12869 OMA:KFTFAKI
            IPI:IPI00530971 RefSeq:NP_198992.2 UniGene:At.9341
            ProteinModelPortal:F4JZX8 SMR:F4JZX8 PRIDE:F4JZX8
            EnsemblPlants:AT5G41770.1 GeneID:834182 KEGG:ath:AT5G41770
            Uniprot:F4JZX8
        Length = 705

 Score = 94 (38.1 bits), Expect = 0.00025, Sum P(2) = 0.00025
 Identities = 23/73 (31%), Positives = 39/73 (53%)

Query:    96 FERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEFE 155
             FE  ++ A +++ VW  Y     S  +D    R +++RA+   G DY  HT+W KY EFE
Sbjct:    81 FEDQIRRARWNIQVWVKYAQWEESQ-KDYARARSVWERAIE--G-DYRNHTLWLKYAEFE 136

Query:   156 ISQQRWSSLAQIF 168
             +  +  +S   ++
Sbjct:   137 MKNKFVNSARNVW 149

 Score = 84 (34.6 bits), Expect = 0.00025, Sum P(2) = 0.00025
 Identities = 59/265 (22%), Positives = 106/265 (40%)

Query:   295 YLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKG-------GREIASYALD 347
             Y  F  K G+      +YER     AD  E  + +V F E +         R I  +ALD
Sbjct:   232 YAKFEMKGGEVARCRSVYERATEKLADDEEAEILFVAFAEFEERCKEVERARFIYKFALD 291

Query:   348 -----RATQIFLKRLPVIHLFNARY--KEQIGDTSAARAAFP-ESYI-DSDSRFIEKVTF 398
                  RA  ++ K +     F  +Y  KE I D    +  F  E  +  S S +     +
Sbjct:   292 HIPKGRAEDLYRKFVA----FEKQYGDKEGIEDAIVGKRRFQYEDEVRKSPSNYDSWFDY 347

Query:   399 KANMERRLGNFVAACDTYKEALET---AAEQRKFHTLPLLYVQFSRLTYTTTGSADNARD 455
                +E  +GN     + Y+ A+     A E+R +     L++ ++      T   +  RD
Sbjct:   348 -VRLEESVGNKDRIREIYERAIANVPPAEEKRYWQRYIYLWINYALFEEIETEDIERTRD 406

Query:   456 ILIDGIKHVPNCKLLLEELIKFTMVHGGRSHISIVDA--VISNALYSRPDVLKVFSLEDV 513
             +  + +K +P+ K    ++         R  +++  A  ++ NA+   P   K+F  + +
Sbjct:   407 VYRECLKLIPHSKFSFAKIWLLAAQFEIRQ-LNLTGARQILGNAIGKAPKD-KIFK-KYI 463

Query:   514 EDISSLYLQFLDLCGTIHDIRNAWN 538
             E    L L  +D C  +++    W+
Sbjct:   464 E--IELQLGNMDRCRKLYERYLEWS 486


>UNIPROTKB|F1NF69 [details] [associations]
            symbol:SART3 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0006396 "RNA processing" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR003107 InterPro:IPR012677
            Pfam:PF00076 PROSITE:PS50102 SMART:SM00360 SMART:SM00386
            GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622
            GO:GO:0006396 GeneTree:ENSGT00550000074656 EMBL:AADN02034959
            EMBL:AADN02034960 EMBL:AADN02034961 EMBL:AADN02034962
            EMBL:AADN02034963 IPI:IPI00577406 Ensembl:ENSGALT00000007806
            ArrayExpress:F1NF69 Uniprot:F1NF69
        Length = 772

 Score = 106 (42.4 bits), Expect = 0.00030, Sum P(3) = 0.00030
 Identities = 38/139 (27%), Positives = 62/139 (44%)

Query:    69 FPLCYGYWRKYADHKARLCSI----DKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDP 124
             FPL    W  +   + ++ S     +KV E+FERAV+      ++W  Y   S+      
Sbjct:    16 FPLTEEIWLDWLKDEIKMASEVSEREKVYELFERAVKDYICP-EIWLEYAQYSIGGIGQE 74

Query:   125 ND---VRRLFKRALSFVGKDYLCHT-MWDKYIEFEISQQRWSSLAQI---FVQTLRFPSK 177
                  VR +F+RAL+ VG      T +W+ Y EFE +    + L +I   F + L  P  
Sbjct:    75 GGIEKVRSIFERALTAVGLHVTKGTALWEAYREFENAILETAQLERIHTLFRRQLGIPLL 134

Query:   178 KLHHYYDSFKKLAGAWKEE 196
              +   Y  +++    W E+
Sbjct:   135 DMEASYAEYEE----WSED 149

 Score = 69 (29.3 bits), Expect = 0.00030, Sum P(3) = 0.00030
 Identities = 23/91 (25%), Positives = 39/91 (42%)

Query:   292 WHDYLSFAEKQG-DFDWVVKLYERCL----IPCADYPEFWMRYVDFMESKGG-REIASYA 345
             W  YL   E+ G D   +  ++E+ L    I   DY E W  Y+D++  +    + +S  
Sbjct:   271 WIQYLLAMERHGVDHCIISDMFEKALNAGFIQATDYVEIWQAYLDYLRRRVDFTQDSSKE 330

Query:   346 LDRATQIFLKRLPVIHL-FNARYKEQIGDTS 375
             L+     F + +  +      R+ E  GD S
Sbjct:   331 LEELRSAFARAVEYLKQEVEERFSES-GDPS 360

 Score = 47 (21.6 bits), Expect = 0.00030, Sum P(3) = 0.00030
 Identities = 16/59 (27%), Positives = 27/59 (45%)

Query:   656 EAAEQHSQDACDPEVLSLDLAHQVTNENETVQASEA-FSEED-DVQREYEHESKKDLKP 712
             E AEQ  Q   + +       +++  + +     E  + +ED D  R+ EHES  + KP
Sbjct:   502 EKAEQRKQSRAEKKASKKAKKNRIGEKRKADDDDEGEWGQEDKDPLRQREHESGPE-KP 559

 Score = 45 (20.9 bits), Expect = 0.00052, Sum P(4) = 0.00052
 Identities = 12/37 (32%), Positives = 18/37 (48%)

Query:   592 QDKKFSPPEKSDSESGDDATSLPSNQKS-PLPENHDI 627
             +DK   P  + + ESG +    P+NQK  P     D+
Sbjct:   542 EDK--DPLRQREHESGPEKPDHPTNQKEKPSTSRKDV 576

 Score = 40 (19.1 bits), Expect = 0.00052, Sum P(4) = 0.00052
 Identities = 7/20 (35%), Positives = 14/20 (70%)

Query:   701 EYEHESKKDLKPLSLEGLSL 720
             EYE+E++     L ++GL++
Sbjct:   731 EYENEAQASQAVLKMDGLTI 750


>DICTYBASE|DDB_G0278819 [details] [associations]
            symbol:DDB_G0278819 "HAT repeat-containing protein"
            species:44689 "Dictyostelium discoideum" [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005681
            "spliceosomal complex" evidence=IEA;ISS] [GO:0003723 "RNA binding"
            evidence=ISS] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IC] [GO:0000245 "spliceosomal complex assembly"
            evidence=ISS] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0006397
            "mRNA processing" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0016607
            "nuclear speck" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] InterPro:IPR003107 InterPro:IPR011990
            InterPro:IPR013026 Pfam:PF02184 SMART:SM00386
            dictyBase:DDB_G0278819 GO:GO:0005737 GenomeReviews:CM000152_GR
            GO:GO:0016607 GO:GO:0005681 GO:GO:0003723 Gene3D:1.25.40.10
            EMBL:AAFI02000024 GO:GO:0000245 eggNOG:NOG327505 KO:K12869
            OMA:KFTFAKI RefSeq:XP_641986.1 ProteinModelPortal:Q54XP4
            STRING:Q54XP4 PRIDE:Q54XP4 EnsemblProtists:DDB0233480
            GeneID:8621718 KEGG:ddi:DDB_G0278819 ProtClustDB:CLSZ2729102
            Uniprot:Q54XP4
        Length = 705

 Score = 125 (49.1 bits), Expect = 0.00031, Sum P(2) = 0.00031
 Identities = 64/262 (24%), Positives = 105/262 (40%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYER--CLIPCADYPEFWMRYVDFMESKGGREIASYALDRA 349
             W  Y     K  + +    +++R  CL+P     + W +Y  FME   G   A+ A+   
Sbjct:   112 WIKYAEMEMKNKNINLARNIWDRAVCLLPRVS--QLWFKYT-FMEDMLGNYPAARAI--- 165

Query:   350 TQIFLKRLPVIHLFNA--RYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFKANMERRLG 407
              + +++  P    +N+  ++++++      R  F E YI     +I+        E RLG
Sbjct:   166 FERWMQWKPEPQAWNSYLKFEQRLKLFENTRLIF-EKYILVHP-YIKTWIKYTKFEERLG 223

Query:   408 NFVAACDTYKEALETAAEQRKFHTLPLLYVQFSRLTYTTTGSADNARDILIDGIKHVPN- 466
             N   A   ++ A+E   E      L + + +F    Y      + AR I    I HVP  
Sbjct:   224 NIENARTIFQRAIEFLGEDGNDEQLFIAFAKFEE-KYK---EIERARVIYKYAIDHVPKS 279

Query:   467 -CKLLLEELIKFTMVHGGRSHISIVDAVISNALYSRPDVLKVFSLEDVEDISSLYLQFLD 525
               K L +    F   HG R  I I D V+    +   + +K  S     DI   YL+  +
Sbjct:   280 RAKDLFDTFTNFEKQHGDR--IGIEDVVLGKKRFQYEEEIKKNSKN--YDIWFDYLKMEE 335

Query:   526 LCGTIHDIRNAWNQHIKLFPHT 547
             + G I   R  + + I   P T
Sbjct:   336 INGEIEKTREIYERSIGNLPPT 357

 Score = 51 (23.0 bits), Expect = 0.00031, Sum P(2) = 0.00031
 Identities = 14/51 (27%), Positives = 25/51 (49%)

Query:   109 VWFHYCSLSMSTFEDPNDVRRLFKRALSFVGKDYLCHTMWDKYIEFEISQQ 159
             ++  Y +   S  +D    R +F+R   F+  D+   T+W KY E E+  +
Sbjct:    77 IYIKYAAWEESQ-KDLTRARSVFER---FLDIDHRIPTVWIKYAEMEMKNK 123


>DICTYBASE|DDB_G0271670 [details] [associations]
            symbol:DDB_G0271670 species:44689 "Dictyostelium
            discoideum" [GO:0005576 "extracellular region" evidence=IEA]
            dictyBase:DDB_G0271670 GO:GO:0005576 EMBL:AAFI02000006
            ProtClustDB:CLSZ2431310 RefSeq:XP_645495.1
            ProteinModelPortal:Q75JC9 EnsemblProtists:DDB0168484 GeneID:8618123
            KEGG:ddi:DDB_G0271670 OMA:EITNEEP Uniprot:Q75JC9
        Length = 374

 Score = 122 (48.0 bits), Expect = 0.00042, P = 0.00042
 Identities = 49/256 (19%), Positives = 92/256 (35%)

Query:   572 SNVASLPQPFESEHLMPSASQDKKFSPPEKSDSESGDDATSLPSNQKSPLPENHDIRSDG 631
             S+ +S      S     S+S     S    S S S   ++S  S+  S    +    S  
Sbjct:   119 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 178

Query:   632 AEVDILLSGEADSSSQDRMQQVPPEAAEQHSQDACDPEVLSLDLAHQVTNENETVQASEA 691
             +      S  + SSS          ++   S  +      S   +   ++ + +  +S +
Sbjct:   179 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 238

Query:   692 FSEEDDVQREYEHESKKDLKPLSLEGLSLDPGGNDSPGSLCATSHECEAPQKTNFSHESM 751
              S            S       S    S     + S  S  ++S    +   ++ S  S 
Sbjct:   239 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 298

Query:   752 LKSEAPRETSLSDGSVLGASQNNNGSHFAPSSMGTQASSSAPIQTRTVSPSSSASHQNFI 811
               S +   +S S  S   +S +++ S  + SS  + +SSS+   + + S SSS+S  +  
Sbjct:   299 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 358

Query:   812 PEAHSHPQTPANSGRN 827
               + S   + ++SG N
Sbjct:   359 SSSSSSSSSSSSSGEN 374


>WB|WBGene00019762 [details] [associations]
            symbol:M03F8.3 species:6239 "Caenorhabditis elegans"
            [GO:0005622 "intracellular" evidence=IEA] [GO:0006396 "RNA
            processing" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] [GO:0018996 "molting
            cycle, collagen and cuticulin-based cuticle" evidence=IMP]
            [GO:0040011 "locomotion" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] InterPro:IPR003107 InterPro:IPR011990
            InterPro:IPR013026 Pfam:PF02184 PROSITE:PS50293 SMART:SM00386
            GO:GO:0009792 GO:GO:0040007 GO:GO:0002119 GO:GO:0018996
            GO:GO:0040011 GO:GO:0000003 GO:GO:0005622 GO:GO:0006396
            Gene3D:1.25.40.10 eggNOG:NOG327505 KO:K12869 HOGENOM:HOG000207972
            OMA:KFTFAKI GeneTree:ENSGT00550000074931 EMBL:FO080495
            RefSeq:NP_001122979.1 ProteinModelPortal:A9D4S6 SMR:A9D4S6
            STRING:A9D4S6 PaxDb:A9D4S6 EnsemblMetazoa:M03F8.3b GeneID:178979
            KEGG:cel:CELE_M03F8.3 UCSC:M03F8.3a CTD:178979 WormBase:M03F8.3b
            InParanoid:A9D4S6 NextBio:903380 ArrayExpress:A9D4S6 Uniprot:A9D4S6
        Length = 747

 Score = 99 (39.9 bits), Expect = 0.00056, Sum P(3) = 0.00056
 Identities = 28/113 (24%), Positives = 49/113 (43%)

Query:    76 WRKYADHKARLCSIDKVVEVFERAVQSATYSVDVWFHYCSLSMSTFEDPNDVRRLFKRAL 135
             W KY   +  +  I +   VFERA+     S+ +W  Y  + M   +  N  R +F RA+
Sbjct:    89 WIKYGKWEESIGEIQRARSVFERALDVDHRSISIWLQYAEMEMRC-KQINHARNVFDRAI 147

Query:   136 SFVGKDYLCHTMWDKYIEFEISQQRWSSLAQIFVQTLRF-PSKKLHHYYDSFK 187
             + + +       W KY   E   +      QIF + + + P ++    Y +F+
Sbjct:   148 TIMPR---AMQFWLKYSYMEEVIENIPGARQIFERWIEWEPPEQAWQTYINFE 197

 Score = 72 (30.4 bits), Expect = 0.00056, Sum P(3) = 0.00056
 Identities = 33/140 (23%), Positives = 58/140 (41%)

Query:   292 WHDYLSFAEKQGDFDWVVKLYERCLIPCADYPEFWMRYVDFMESKGGREIASYALDRATQ 351
             W  Y++F  +  + D    +Y+R L       + W++Y  F E  G    A  A ++A +
Sbjct:   190 WQTYINFELRYKEIDRARSVYQRFLHVHGINVQNWIKYAKFEERNGYIGNARAAYEKAME 249

Query:   352 IFLKR---LPVIHLFNARYKEQIGDTSAARAAFPESYIDSDSRFIEKVTFK--ANMERRL 406
              F +      V+  F A ++E+  +   AR  F     +  S   E++ FK     E++ 
Sbjct:   250 YFGEEDINETVLVAF-ALFEERQKEHERARGIFKYGLDNLPSNRTEEI-FKHYTQHEKKF 307

Query:   407 GNFVAACDTYKEALETAAEQ 426
             G  V   D      +T  E+
Sbjct:   308 GERVGIEDVIISKRKTQYEK 327

 Score = 48 (22.0 bits), Expect = 0.00056, Sum P(3) = 0.00056
 Identities = 21/78 (26%), Positives = 30/78 (38%)

Query:   560 KSLRAFIRGKRESNVASLPQPFESEHLMPSASQDKKFSPPEKSDSES---GDDATSLP-S 615
             K L A  R KRE   A+     E +  +P    D++     K   E    GD  T L  S
Sbjct:   662 KLLEAAARWKREREEAAARAAQELDAPIPEGDDDEEKEEAGKDAEEKVREGDSDTDLSES 721

Query:   616 NQKSPLPENHDIRSDGAE 633
             +  S    +    SD ++
Sbjct:   722 SSSSDSESSSSSSSDSSD 739


>ASPGD|ASPL0000051943 [details] [associations]
            symbol:AN0461 species:162425 "Emericella nidulans"
            [GO:0008150 "biological_process" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0003674 "molecular_function"
            evidence=ND] InterPro:IPR006594 PROSITE:PS50896 SMART:SM00667
            Pfam:PF05022 EMBL:BN001308 EMBL:AACD01000007 InterPro:IPR007718
            OMA:RKHFSRI RefSeq:XP_658065.1 EnsemblFungi:CADANIAT00002232
            GeneID:2876234 KEGG:ani:AN0461.2 eggNOG:NOG131021
            HOGENOM:HOG000211567 OrthoDB:EOG4MKRS7 Uniprot:Q5BG69
        Length = 435

 Score = 121 (47.7 bits), Expect = 0.00069, P = 0.00069
 Identities = 62/275 (22%), Positives = 109/275 (39%)

Query:   559 TKSL-RAFIRGKRESNVASLPQPFES--EHLMPSASQDKKFSPPEKSDSESGDDATSLPS 615
             TK L +  I   ++S+V SL + F+S    L          S    SD++S  D+ S  S
Sbjct:    49 TKELAKKSISASKKSDVPSLLEIFQSWESQLNRKNVPSSSSSASSDSDADSSSDSDSSDS 108

Query:   616 N-QKSPLPENHDIRSDGAEVDILLSGEADSSSQDRMQQVPPEAAEQHSQDACDPEVLSLD 674
             + + S  P+    RS         S  A SSS           ++   +D  +  +    
Sbjct:   109 DVEMSEAPKVQRRRSSSTSSSSS-SSSASSSSSSSSSSSSSSDSDADDEDEDEAALAPGP 167

Query:   675 LAHQVTNENET-VQASEAFSEEDDVQREYEHESKKDLKPLSLEGLSLDPGGNDSPGSLCA 733
              A  V  + E+   +S + SEE    ++ +  SK +    S    S D   +    S  +
Sbjct:   168 AAKGVKRKAESSASSSGSDSEETPKAKKTKLTSKAEESSSSSSESSSDSSSDSDSDSDSS 227

Query:   734 TSHECEAPQKTNFSHESMLKSEAPRETSLSDGSVLGASQNNNGSHFAPSSMGTQASSSAP 793
             +S E E+  +++ SH S   S +    S SD S   +S +++ S     S   + +    
Sbjct:   228 SSSESESESESDASHSSSSSSSSDSSDSSSDSSSDSSSDSSSDSSSESESDAAKKADKKA 287

Query:   794 IQ--TRTVSPSSSASHQNFIPEAHSHPQTPANSGR 826
             ++  T T  P S +S       + S  +  +++ R
Sbjct:   288 LKAATETPLPPSDSSSSGSSDSSSSSGEESSSTSR 322


>UNIPROTKB|G3MZB1 [details] [associations]
            symbol:G3MZB1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071460 "cellular response to cell-matrix adhesion"
            evidence=IEA] GeneTree:ENSGT00550000074777 GO:GO:0071460
            EMBL:DAAA02018360 OMA:NSCGNEG EMBL:DAAA02018358 EMBL:DAAA02018359
            Ensembl:ENSBTAT00000063098 Uniprot:G3MZB1
        Length = 682

 Score = 123 (48.4 bits), Expect = 0.00081, P = 0.00081
 Identities = 50/195 (25%), Positives = 87/195 (44%)

Query:   589 SASQDKKFSPPEKSDSESGDDATSLPSNQKSPLPENHDIRSDGAEVDILLSGEADSSSQD 648
             S S D K    + SDS+S  D+ S  S+  S   ++ D  SD ++ D   S ++DS S  
Sbjct:   469 SDSSDSKSDSSDSSDSDSKSDSDSSDSSDSSD-SDSSD-SSDSSDSDSSDSSDSDSKSDS 526

Query:   649 RMQQVPPEAAEQHSQDACDPEVLSLDLAHQVTNENETVQASEAFSEEDDVQREYEHESKK 708
                     +    S D+ D +  S       ++ +++  +S++    D    +  + S  
Sbjct:   527 DSSDSSNSSDSSDSSDS-DTKSDSDSSDSSDSDSSDSSDSSDSSDSSDSSDSDSSNSSDS 585

Query:   709 DLKPLSLEGLSLDPGGNDSPGSLCATSHECEAPQKTNFSHESMLKSEAPRETSLSDGSVL 768
             D K  S    S D   +DS  S  + S + ++   ++ S++S  KS++    S SD S  
Sbjct:   586 DSKSDSDSSDSSDSDSSDSSDSDSSDSSDSDSSDSSD-SNDSNTKSDSDSSDS-SDSSD- 642

Query:   769 GASQNNNGSHFAPSS 783
               S++ NG+H   SS
Sbjct:   643 SKSKSGNGNHNGGSS 657


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.316   0.131   0.389    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1070       933   0.00092  122 3  11 22  0.42    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  56
  No. of states in DFA:  631 (67 KB)
  Total size of DFA:  496 KB (2230 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  104.84u 0.19s 105.03t   Elapsed:  00:00:04
  Total cpu time:  104.85u 0.19s 105.04t   Elapsed:  00:00:04
  Start:  Mon May 20 18:12:09 2013   End:  Mon May 20 18:12:13 2013

Back to top