BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>001744
MMMMNQNQQGSTQNIASSVDPNSVENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQ
STENGNLSNASYHHEQHTESHVKSLQDGLNATSLTSSSNLGTTNVAQDYSGYTSYPNSSD
PYAYGSTAYPGYYSSYQQQPNHSYPQPVGAYQNSGAPYQPISSFQNSGSYVGPASYSATY
YNPGDYQTAGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSDTSGAYSSGTAPATSLQY
QQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQPPGVTAGYPTAHSQPAPIYHQSWQ
QDSSSSHVSSLQPAATSNGSHDSYWKHGTPSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQ
HKAACPQGPSSQYAIGQQMAPSYQSPPVQTSPQLDNRRVSKLQIPTNPRIASNLALGLPK
TDKDSSTANAAAKPAYIGVSLAKSNEKVVSHADSRVEPGTFPKSLCGYVERALARCKGDA
EIAASQAVMGEIIKKANSDGTLFSRDWDVEPLFPKPTTEAVTKDLPTSTPLSALSKNKRS
PSRRTKSRWEPLPEEKPIDKLASSTNEIVKFSGWIHANEKDRKHISGSVSKEDRLNNIKF
HLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSGAIALANSPEE
RMRRENRSKRFDRGQGNRSETNRFKGKNAGTGNLYVRRASALLISKSFDDGGSRAVEDID
WDALTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQNSQKNYLYKCDQLKSIR
QDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILYAEGIEGCCMEFSAYHL
LCVILHSNNKRELLSLMSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNT
CLMDLYVEKMRFKAVSCMSRSYRPTVPVSYVAQVLGFTGVSPTNEECEERDSDGLEECVE
WLKAHGASLVTDANGEVQLDAKASSSTLFMPEPEDAVSHGDANLAVNDFLARASSQPS

High Scoring Gene Products

Symbol, full name Information P value
SAC3A
AT2G39340
protein from Arabidopsis thaliana 9.0e-299
zgc:158262 gene_product from Danio rerio 1.4e-60
LENG8
Leukocyte receptor cluster member 8
protein from Homo sapiens 2.0e-58
Leng8
leukocyte receptor cluster (LRC) member 8
protein from Mus musculus 7.2e-53
THP3
Protein that may have a role in transcription elongation
gene from Saccharomyces cerevisiae 4.7e-32
DDB_G0277813
SAC3/GANP family protein
gene from Dictyostelium discoideum 5.5e-31
orf19.6271 gene_product from Candida albicans 4.1e-22
CaO19.6271
Putative uncharacterized protein
protein from Candida albicans SC5314 4.1e-22
Muc91C
Mucin 91C
protein from Drosophila melanogaster 1.1e-07
Vml
Vitelline membrane-like
protein from Drosophila melanogaster 1.1e-07
MCM3AP
80 kDa MCM3-associated protein
protein from Homo sapiens 1.9e-05
SAC3D1
Uncharacterized protein
protein from Canis lupus familiaris 1.9e-05
mcm3ap
MCM3 minichromosome maintenance deficient 3 (S. cerevisiae) associated protein
gene_product from Danio rerio 2.5e-05
POLR2A
DNA-directed RNA polymerase II subunit RPB1
protein from Cricetulus griseus 2.5e-05
I3LQ53
Uncharacterized protein
protein from Sus scrofa 3.1e-05
Muc68D
Mucin 68D
protein from Drosophila melanogaster 7.1e-05
LOC507750
Uncharacterized protein
protein from Bos taurus 0.00013
Mcm3ap
minichromosome maintenance deficient 3 (S. cerevisiae) associated protein
protein from Mus musculus 0.00015
POLR2A
DNA-directed RNA polymerase II subunit RPB1
protein from Homo sapiens 0.00016
Polr2a
polymerase (RNA) II (DNA directed) polypeptide A
gene from Rattus norvegicus 0.00016
Mur89F
Mucin related 89F
protein from Drosophila melanogaster 0.00018
LOC507750
Uncharacterized protein
protein from Bos taurus 0.00018
SAC3B
AT3G06290
protein from Arabidopsis thaliana 0.00020
POLR2A
DNA-directed RNA polymerase
protein from Bos taurus 0.00025
Polr2a
polymerase (RNA) II (DNA directed) polypeptide A
protein from Mus musculus 0.00025
MCM3AP
Uncharacterized protein
protein from Sus scrofa 0.00042
Sac3d1
SAC3 domain containing 1
gene from Rattus norvegicus 0.00043
lig
Protein lingerer
protein from Aedes aegypti 0.00047
ddx17
DEAD/DEAH box helicase
gene from Dictyostelium discoideum 0.00049
POLR2A
DNA-directed RNA polymerase
protein from Canis lupus familiaris 0.00053
polr2a
polymerase (RNA) II (DNA directed) polypeptide A
gene_product from Danio rerio 0.00087
rtoA
unknown
gene from Dictyostelium discoideum 0.00091
Sac3d1
SAC3 domain containing 1
protein from Mus musculus 0.00091
DDB_G0271670 gene from Dictyostelium discoideum 0.00094

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  001744
        (1018 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2056058 - symbol:SAC3A "AT2G39340" species:370...  2868  9.0e-299  1
ZFIN|ZDB-GENE-070410-6 - symbol:zgc:158262 "zgc:158262" s...   601  1.4e-60   2
UNIPROTKB|Q96PV6 - symbol:LENG8 "Leukocyte receptor clust...   577  2.0e-58   2
POMBASE|SPBC2A9.11c - symbol:SPBC2A9.11c "nuclear export ...   545  3.4e-55   2
MGI|MGI:2142195 - symbol:Leng8 "leukocyte receptor cluste...   503  7.2e-53   3
SGD|S000006249 - symbol:THP3 "Protein that may have a rol...   362  4.7e-32   1
DICTYBASE|DDB_G0277813 - symbol:DDB_G0277813 "SAC3/GANP f...   386  5.5e-31   3
CGD|CAL0000561 - symbol:orf19.6271 species:5476 "Candida ...   295  4.1e-22   2
UNIPROTKB|Q5AAX8 - symbol:CaO19.6271 "Putative uncharacte...   295  4.1e-22   2
FB|FBgn0038642 - symbol:Muc91C "Mucin 91C" species:7227 "...   161  1.1e-07   1
FB|FBgn0085362 - symbol:Vml "Vitelline membrane-like" spe...   158  1.1e-07   1
UNIPROTKB|O60318 - symbol:MCM3AP "80 kDa MCM3-associated ...   138  1.9e-05   3
UNIPROTKB|F1PJG2 - symbol:SAC3D1 "Uncharacterized protein...   135  1.9e-05   1
ZFIN|ZDB-GENE-040715-1 - symbol:mcm3ap "MCM3 minichromoso...   115  2.5e-05   2
UNIPROTKB|P11414 - symbol:POLR2A "DNA-directed RNA polyme...   135  2.5e-05   1
UNIPROTKB|I3LQ53 - symbol:I3LQ53 "Uncharacterized protein...   135  3.1e-05   1
FB|FBgn0036203 - symbol:Muc68D "Mucin 68D" species:7227 "...   137  7.1e-05   1
UNIPROTKB|F1N4T6 - symbol:LOC507750 "Uncharacterized prot...   126  0.00013   1
MGI|MGI:1930089 - symbol:Mcm3ap "minichromosome maintenan...   135  0.00015   3
UNIPROTKB|P24928 - symbol:POLR2A "DNA-directed RNA polyme...   135  0.00016   1
RGD|1587326 - symbol:Polr2a "polymerase (RNA) II (DNA dir...   135  0.00016   1
FB|FBgn0038492 - symbol:Mur89F "Mucin related 89F" specie...   138  0.00018   3
UNIPROTKB|G3N1X9 - symbol:LOC507750 "Uncharacterized prot...   126  0.00018   1
TAIR|locus:2082485 - symbol:SAC3B "AT3G06290" species:370...   110  0.00020   4
UNIPROTKB|G3MZY8 - symbol:POLR2A "DNA-directed RNA polyme...   133  0.00025   1
MGI|MGI:98086 - symbol:Polr2a "polymerase (RNA) II (DNA d...   133  0.00025   1
UNIPROTKB|I3LCF4 - symbol:MCM3AP "Uncharacterized protein...   146  0.00042   5
RGD|1308049 - symbol:Sac3d1 "SAC3 domain containing 1" sp...   123  0.00043   1
UNIPROTKB|Q16VD3 - symbol:lig "Protein lingerer" species:...   143  0.00047   2
DICTYBASE|DDB_G0293168 - symbol:ddx17 "DEAD/DEAH box heli...   126  0.00049   1
UNIPROTKB|F1PGS0 - symbol:POLR2A "DNA-directed RNA polyme...   130  0.00053   1
ZFIN|ZDB-GENE-041008-78 - symbol:polr2a "polymerase (RNA)...   128  0.00087   1
DICTYBASE|DDB_G0271916 - symbol:rtoA "unknown" species:44...   119  0.00091   1
MGI|MGI:1913656 - symbol:Sac3d1 "SAC3 domain containing 1...   120  0.00091   1
DICTYBASE|DDB_G0271670 - symbol:DDB_G0271670 species:4468...   119  0.00094   1


>TAIR|locus:2056058 [details] [associations]
            symbol:SAC3A "AT2G39340" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM;IDA] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF03399 GO:GO:0005634 EMBL:CP002685
            UniGene:At.48570 UniGene:At.70170 InterPro:IPR005062
            IPI:IPI00526769 RefSeq:NP_181466.2 UniGene:At.75352
            ProteinModelPortal:F4IUY8 PRIDE:F4IUY8 EnsemblPlants:AT2G39340.1
            GeneID:818519 KEGG:ath:AT2G39340 OMA:QQPSHSY Uniprot:F4IUY8
        Length = 1006

 Score = 2868 (1014.6 bits), Expect = 9.0e-299, P = 9.0e-299
 Identities = 586/1022 (57%), Positives = 713/1022 (69%)

Query:     7 NQQGSTQNIASSVDPNSVENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQSTENGN 66
             N  G+TQ +A  +DPNS+ENRY VD SQ+Q  SY  ST GS +  W  H V NQ+ ENGN
Sbjct:     2 NHGGNTQAVAP-MDPNSIENRYGVDGSQTQKYSYQYST-GSESAPWTGHSVENQAVENGN 59

Query:    67 LSNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSGYTSYPNSSDPYAYGS 126
              SN++Y+H Q T     ++Q+                 VAQDYSGYT Y  SSDP+ Y +
Sbjct:    60 YSNSNYYHPQPTGPATGNVQEIPNTVSFTISSTSGTANVAQDYSGYTPYQTSSDPHNYSN 119

Query:   127 TAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQNSGSYVGPASYSATYYNPGDY 186
             T Y          P+ SYPQPVGAYQN+GAP QP+SSFQN GSY G  SYS TYYNP DY
Sbjct:   120 TGYSNYYSGYQQQPSQSYPQPVGAYQNTGAP-QPLSSFQNPGSYAGTPSYSGTYYNPADY 178

Query:   187 QTAGGY-------------PSSGYSHQTTSWNEGNYTNYTSHQYSNYTSDTSGAYSSGTA 233
             QTAGGY             PS+ YS+QT + N+GNYT+YTS+ Y NYT D +  +SS  A
Sbjct:   179 QTAGGYQSTNYNNQTAGSYPSTNYSNQTPASNQGNYTDYTSNPYQNYTPDAANTHSSTIA 238

Query:   234 PATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSS--NQVLQPPGVTAGYPTAHSQP 291
                 + YQQ Y+QW +YYSQTEV CAPGTE LS  ++S  +Q    PGVT+  P ++SQP
Sbjct:   239 TTPPVHYQQNYQQWTEYYSQTEVPCAPGTEKLSTPTTSAYSQSFPVPGVTSEMPASNSQP 298

Query:   292 APIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQNRQVSPVQPHYSKPLEQK 351
             AP Y                 P A  + S+D+YW H  PS Q     P Q +Y  PLE K
Sbjct:   299 APSYVQPWRPETDSSHPPSQQPGAAVSTSNDTYWMHQAPSLQAHHPVPPQNNYQSPLETK 358

Query:   352 TSYNN-FQDQHKAACPQGPSSQYAIGQQMAPSYQSPPVQTSPQLDNRRVSKLQIPTNPRI 410
               Y   FQ   +A  PQ  +SQ +  Q  AP     P QT+P +D++RVSK+QIPTNPRI
Sbjct:   359 PLYETPFQGHQRATYPQEMNSQSSFHQ--APLGYRQPTQTAPLVDSQRVSKVQIPTNPRI 416

Query:   411 ASNLALGLPKTDKDSSTANAAAKPAYIGVSLAKSNEKVVSHADSRVEPGTFPKSLCGYVE 470
             ASNL  G  K DKDS+ A+AA  PAY+ VS+ K  +    H  +  +PGTFPKSL G+VE
Sbjct:   417 ASNLPSGFTKMDKDSTAASAAQAPAYVSVSMPKPKD----HTTAMSDPGTFPKSLRGFVE 472

Query:   471 RALARCKGDAEIAASQAVMGEIIKKANSDGTLFSRDWDVEPLFPKPTTEAVTKDLPTSTP 530
             RA ARCK D E  + +  + +I+KKA  D TL++RDWD EPL    TT  VT    +S  
Sbjct:   473 RAFARCKDDKEKESCEVALRKIVKKAKEDNTLYTRDWDTEPLSTVTTTN-VTNSESSSAQ 531

Query:   531 LSALSKNKRSPSRRTKSRWEPLPEEKPIDKLASSTNEIVKFSGWIHANEKDRKHISGSVS 590
             LS+L +NK SP+RR KSRWEPL E KP  K AS+ +  VKF  W H NE ++K  S S  
Sbjct:   532 LSSL-QNK-SPTRRPKSRWEPLVEGKPFVKPASTFSSAVKFGVWNHQNENNKKS-SESFQ 588

Query:   591 KEDRLNNIKFHLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSG 650
             K D     K   S Q SA KSFQRPVKRQR S  G  T  + +ASSDSDK+  LT YYS 
Sbjct:   589 KVDAATGFKPTYSGQNSAKKSFQRPVKRQRFSG-GAATAIDDEASSDSDKD--LTPYYSS 645

Query:   651 AIALANSPEERMRRENRSKRFDRGQGNRSETNRFKGKNAGTGNLYVRRASALLISKSFDD 710
             A+ALA S EE+ RR++RSKRF++ QG+    +  K KNA  GNL+ RRA+AL +SK FD+
Sbjct:   646 AMALAGSAEEKKRRDSRSKRFEKIQGHSRGNDLTKPKNANVGNLHSRRATALRLSKVFDE 705

Query:   711 GGSRAVEDIDWDALTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQNSQKNYL 770
              GSRAVEDIDWDALTVKGTCQEIEKRYLRLTSAPDP+TVRPE+VLEKAL MVQ+SQKNYL
Sbjct:   706 SGSRAVEDIDWDALTVKGTCQEIEKRYLRLTSAPDPATVRPEDVLEKALIMVQDSQKNYL 765

Query:   771 YKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILYAEGIEG 830
             +KCDQLKSIRQDLTVQRI N LTAKVYETHARLA+E GDLPEYNQC SQLK LYAEG+EG
Sbjct:   766 FKCDQLKSIRQDLTVQRIHNHLTAKVYETHARLALEAGDLPEYNQCLSQLKTLYAEGVEG 825

Query:   831 CCMEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFFR 890
             C +EF+AY LL + LHSNN RELLS MSRLS++ K+D+AV+HAL+VRAAV+SGNY+MFFR
Sbjct:   826 CSLEFAAYSLLYITLHSNNNRELLSSMSRLSEEDKKDEAVRHALSVRAAVTSGNYVMFFR 885

Query:   891 LYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYRPTVPVSYVAQVLGFTGVSPTNEECEER 950
             LYKTAPN+N+CLMDLYVEKMR+KAV+ MSRS RPT+PVSY+ QVLGFTG +  +E  +E+
Sbjct:   886 LYKTAPNMNSCLMDLYVEKMRYKAVNFMSRSCRPTIPVSYIVQVLGFTGAA--SEGTDEK 943

Query:   951 DSDGLEECVEWLKAHGASLVTDANGEVQLDAKASSSTLFMPEPEDAVSHGDANLAVNDFL 1010
             ++DG+E+C+EWLK HGA+++TD+NG++ LD KA+S++LFMPEPEDAV+HGD NL VNDF 
Sbjct:   944 ETDGMEDCLEWLKTHGANIITDSNGDMLLDTKATSTSLFMPEPEDAVAHGDRNLDVNDFF 1003

Query:  1011 AR 1012
              R
Sbjct:  1004 TR 1005


>ZFIN|ZDB-GENE-070410-6 [details] [associations]
            symbol:zgc:158262 "zgc:158262" species:7955 "Danio
            rerio" [GO:0005575 "cellular_component" evidence=ND] [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF03399 ZFIN:ZDB-GENE-070410-6 InterPro:IPR005062
            EMBL:BC139488 IPI:IPI00845920 RefSeq:NP_001082988.1
            UniGene:Dr.44195 Ensembl:ENSDART00000113345 GeneID:100037367
            KEGG:dre:100037367 eggNOG:NOG313808 GeneTree:ENSGT00390000008006
            HOGENOM:HOG000001575 HOVERGEN:HBG070702 OMA:CEQMKSI
            OrthoDB:EOG4J6RQC NextBio:20788530 Bgee:A4QNR8 Uniprot:A4QNR8
        Length = 839

 Score = 601 (216.6 bits), Expect = 1.4e-60, Sum P(2) = 1.4e-60
 Identities = 172/548 (31%), Positives = 265/548 (48%)

Query:   401 KLQIPTNPRIASNLALGLPKTDKDSSTANAAAKPAYIGVSLAKSNEKVVSHADSRVEPGT 460
             K  IP  P + SN     P  ++    A   A     G + A S+    S   ++  P  
Sbjct:   279 KFNIPKRPYVMSNQ--NFPPGEQGPGLAPPTATNPSSGSTSATSS---ASTGGAQTRPQD 333

Query:   461 FPKSLCGYVERALARCKGDAEIAASQAVMGEIIKKANSDGTLFSRDWDVEPL---FPKPT 517
             +P+++  YV+R    C+ + +   ++ ++ ++++    DG+ ++ DW+ EPL    PK +
Sbjct:   334 WPQAMKEYVQRCFTACESEEDKDRTEKMLKDLLQSRLQDGSAYTIDWNREPLPDLKPKQS 393

Query:   518 TEAVTKDLPTSTPLSALSKNKRSPSRRTKSRWEPLPEEKPIDKLASSTNEIVKFSGWIHA 577
                V   L     +      +     R  +      +  P    + S+    + S   H+
Sbjct:   394 RWEVMPQLSQENSIGRGGATRGRGGARLGNYRNVFSQRSPSSSSSGSSRSSSR-SPSPHS 452

Query:   578 NEKDRK--HISGSVSKEDRLNNIKFHLSEQKSASKSFQ-RPVKRQR-LSADGFKTEDNGD 633
               +DR   H S S S+ D   +    LS  +   K  + R  +R+R   A G +    G 
Sbjct:   453 RHRDRSRHHRSDSGSQSDGSMSSDLRLSLTRRQQKGGRGRGAERERGRGAAGERGRGRGK 512

Query:   634 ASSDSDKEQSLTSYYSGAIALANSPEERMRRENRSKRFDRGQGNRSETNRFKGKNAGTGN 693
             A+      + L S   G            +R+  +   D    NR      K   A   +
Sbjct:   513 ANRGRRNMEDLAS---GV---------GKKRKGGNAGLDFHDPNREAK---KQSRAARFH 557

Query:   694 LYVRRASALLISKSFD-DGGSRAVEDIDWDALTVKGTCQEIEKRYLRLTSAPDPSTVRPE 752
               +R    +L    FD   G++  E + WD   + GTCQ+I K YLRLT APDPSTVRP 
Sbjct:   558 TKLRTEPLVLNINVFDLPNGTQ--EGLSWDDCPIVGTCQDITKNYLRLTCAPDPSTVRPV 615

Query:   753 EVLEKALQMVQ---NSQKNYLYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIENGD 809
              VL K+L  V+    S ++Y+Y C+Q+KSIRQDLTVQ +R   T +VYETHAR+A+E GD
Sbjct:   616 PVLRKSLIAVKAHWKSNQDYVYACEQMKSIRQDLTVQGVRTDFTVEVYETHARIALEKGD 675

Query:   810 LPEYNQCQSQLKILYAEGIEGCCMEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKA 869
               E+NQCQ+QLK LY +       EF+AY L+  I  + N  +L + +  L+ + + D  
Sbjct:   676 HEEFNQCQTQLKALYKDCPSDNVGEFTAYRLIYYIF-TKNSGDLTTELVYLTTELRADPC 734

Query:   870 VKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYRPTVPVS 929
             V HAL +R A + GN+  FFRLY+ AP +   L+D +VE+ R  A+  + +S+RP+V V 
Sbjct:   735 VAHALELRTAWALGNFHRFFRLYQKAPRMAAYLIDKFVERERNIALRAILKSFRPSVSVE 794

Query:   930 YVAQVLGF 937
             YV   L F
Sbjct:   795 YVQSSLAF 802

 Score = 135 (52.6 bits), Expect = 5.7e-05, P = 5.7e-05
 Identities = 115/547 (21%), Positives = 185/547 (33%)

Query:   159 QPISSFQNSGSYVGPASYSATYYNPGDYQTAGGYPSSGYSHQT-TSWNEGNYTNYTSHQY 217
             Q ++S   S +   P++   T    G YQ A    S+    Q    W + N   Y  + Y
Sbjct:    37 QALASISKSQNSTKPSTNQTTQ-QAGQYQAAVTDSSAMQQQQYYQQWYQQNQQQYAGYPY 95

Query:   218 S-NY--------------TSDTSGAYSSGTAP---ATSLQYQQQYKQWADYYSQTEVSCA 259
               NY                   G Y + T P   A S+   +   Q ++Y  Q      
Sbjct:    96 PYNYYYPMPPYGGPYPPGQYGVPGGYQTPTTPTGQAPSMPTMED--QSSNYPPQPPPPAT 153

Query:   260 PGTENLSVASSSNQVLQPPGVTAGYPTAHSQPAPIYHXXXXXXXXXXXXXXXXPAATSNG 319
             P     S  SS      PP      P    QP   Y                     SN 
Sbjct:   154 PQPPPQSPTSSDQPPPPPP------PPIPPQPNAQYPRPPSQTPYSPNGVLPYSPTDSNP 207

Query:   320 SHDSYWKHGT--PSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQ 377
                 Y   G   P +Q  Q    QPH  +       Y     +  A  P   S+    GQ
Sbjct:   208 MMRGY-NPGQIRPGYQGYQTPVYQPHQQQA-PAPGLYGG-AGRVDAKKPNN-SNMNKGGQ 263

Query:   378 QMAPSYQSPPVQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTANAAAKPAYI 437
             Q+         Q   Q       K  IP  P + SN     P  ++    A   A     
Sbjct:   264 QLW--------QRMKQAPGTGAVKFNIPKRPYVMSNQ--NFPPGEQGPGLAPPTATNPSS 313

Query:   438 GVSLAKSNEKVVSHADSRVEPGTFPKSLCGYVERALARCKGDAEIAASQAVMGEIIKKAN 497
             G + A S+    S   ++  P  +P+++  YV+R    C+ + +   ++ ++ ++++   
Sbjct:   314 GSTSATSS---ASTGGAQTRPQDWPQAMKEYVQRCFTACESEEDKDRTEKMLKDLLQSRL 370

Query:   498 SDGTLFSRDWDVEPL---FPKPTTEAVTKDLPTSTPLSALSKNKRSPSRRTKSRWEPLPE 554
              DG+ ++ DW+ EPL    PK +   V   L     +      +     R  +      +
Sbjct:   371 QDGSAYTIDWNREPLPDLKPKQSRWEVMPQLSQENSIGRGGATRGRGGARLGNYRNVFSQ 430

Query:   555 EKPIDKLASSTNEIVKFSGWIHANEKDRK--HISGSVSKEDRLNNIKFHLSEQKSASKSF 612
               P    + S+    + S   H+  +DR   H S S S+ D   +    LS  +   K  
Sbjct:   431 RSPSSSSSGSSRSSSR-SPSPHSRHRDRSRHHRSDSGSQSDGSMSSDLRLSLTRRQQKGG 489

Query:   613 Q-RPVKRQR-LSADGFKTEDNGDASSDSDKEQSLTSYYS-----GAIALA-NSPEERMRR 664
             + R  +R+R   A G +    G A+      + L S        G   L  + P    ++
Sbjct:   490 RGRGAERERGRGAAGERGRGRGKANRGRRNMEDLASGVGKKRKGGNAGLDFHDPNREAKK 549

Query:   665 ENRSKRF 671
             ++R+ RF
Sbjct:   550 QSRAARF 556

 Score = 76 (31.8 bits), Expect = 1.4e-60, Sum P(2) = 1.4e-60
 Identities = 31/117 (26%), Positives = 47/117 (40%)

Query:   191 GYPSSGYS--HQTTSWNEGNYT--NYTSHQYS-----NYTSDTSGAYSSGTAPATSLQYQ 241
             G  SSG    H+   W +      + +  Q S     N T+  +G Y +    ++++Q Q
Sbjct:    17 GNNSSGEGPVHENPEWEKARQALASISKSQNSTKPSTNQTTQQAGQYQAAVTDSSAMQQQ 76

Query:   242 QQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQPPG---VTAGY--PTAHSQPAP 293
             Q Y+QW    +Q + +  P   N            PPG   V  GY  PT  +  AP
Sbjct:    77 QYYQQWYQQ-NQQQYAGYPYPYNYYYPMPPYGGPYPPGQYGVPGGYQTPTTPTGQAP 132

 Score = 51 (23.0 bits), Expect = 6.1e-58, Sum P(2) = 6.1e-58
 Identities = 15/41 (36%), Positives = 18/41 (43%)

Query:   152 QNSGAPYQPISSFQNSGSYVGPASYSATYYNPGDYQTAGGY 192
             Q +G PY P + +     Y GP       Y PG Y   GGY
Sbjct:    89 QYAGYPY-PYNYYYPMPPYGGP-------YPPGQYGVPGGY 121

 Score = 49 (22.3 bits), Expect = 9.9e-58, Sum P(2) = 9.9e-58
 Identities = 14/52 (26%), Positives = 17/52 (32%)

Query:   109 YSGYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQP 160
             Y+ Y   P    PY  G    PG           +   P    Q+S  P QP
Sbjct:    97 YNYYYPMPPYGGPYPPGQYGVPGGYQTPTTPTGQAPSMPTMEDQSSNYPPQP 148


>UNIPROTKB|Q96PV6 [details] [associations]
            symbol:LENG8 "Leukocyte receptor cluster member 8"
            species:9606 "Homo sapiens" [GO:0005515 "protein binding"
            evidence=IPI] Pfam:PF03399 EMBL:CH471135 EMBL:CU467002
            EMBL:CU207370 InterPro:IPR005062 eggNOG:NOG313808
            HOGENOM:HOG000001575 HOVERGEN:HBG070702 EMBL:AB067519 EMBL:AL834532
            EMBL:BC028048 IPI:IPI00177888 IPI:IPI00217858 IPI:IPI00386039
            RefSeq:NP_443157.1 UniGene:Hs.502378 UniGene:Hs.740924
            ProteinModelPortal:Q96PV6 IntAct:Q96PV6 MINT:MINT-1435849
            STRING:Q96PV6 PhosphoSite:Q96PV6 DMDM:158705886 PaxDb:Q96PV6
            PRIDE:Q96PV6 DNASU:114823 Ensembl:ENST00000326764
            Ensembl:ENST00000575939 GeneID:114823 KEGG:hsa:114823
            UCSC:uc002qfw.2 CTD:114823 GeneCards:GC19P054960 HGNC:HGNC:15500
            HPA:HPA042004 HPA:HPA042056 neXtProt:NX_Q96PV6 PharmGKB:PA134903953
            ChiTaRS:LENG8 GenomeRNAi:114823 NextBio:79308 ArrayExpress:Q96PV6
            Bgee:Q96PV6 CleanEx:HS_LENG8 Genevestigator:Q96PV6 Uniprot:Q96PV6
        Length = 779

 Score = 577 (208.2 bits), Expect = 2.0e-58, Sum P(2) = 2.0e-58
 Identities = 188/629 (29%), Positives = 286/629 (45%)

Query:   322 DSYWKHGTPSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQQMAP 381
             D    +  P  Q     P QP  S P     + N+      A   Q   +  A GQ   P
Sbjct:   112 DESMSYQAPPQQLPSAQPPQP--SNPPHGAHTLNSGPQPGTAPATQHSQAGPATGQAYGP 169

Query:   382 SYQSPPVQTSPQLDNRRVSKLQ-IPTNPRIASNLALGLPKTDKDSSTANAAAKPAYIGVS 440
                + P +  P+   +  ++++  P    +  N+          S  +NA  +  + G  
Sbjct:   170 HTYTEPAK--PKKGQQLWNRMKPAPGTGGLKFNIQKRPFAVTTQSFGSNAEGQ--HSGFG 225

Query:   441 LAKSNEKVVSHADSRV------EPGTFPKSLCGYVERALARCKGDAEIAASQAVMGEIIK 494
                + EKV +H+ S        +P  +P+ +  YVER    C+ + +   ++ ++ E+++
Sbjct:   226 PQPNPEKVQNHSGSSARGNLSGKPDDWPQDMKEYVERCFTACESEEDKDRTEKLLKEVLQ 285

Query:   495 KANSDGTLFSRDWDVEPLFPKPTTEAVTKDLPTSTPLSALSK--NKRSPSRRTKSRWEPL 552
                 DG+ ++ DW  EPL P  T E V +  P      A S     R     T+    P 
Sbjct:   286 ARLQDGSAYTIDWSREPL-PGLTREPVAES-PKKKRWEAASSLHPPRGAGSATRGGGAPS 343

Query:   553 PEEKPIDKLASST--NEIVKFSG---WIHANEK----DRKHISGSVSKEDRLNNIKFHLS 603
                 P    A     N   KF     ++  N      D +  S S S          H  
Sbjct:   344 QRGTPGAGGAGRARGNSFTKFGNRNVFMKDNSSSSSTDSRSRSSSRSPTRHFRRSDSHSD 403

Query:   604 EQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSGAIALANSPEERMR 663
                S S +   PV R+     G      G   +  D+ +   +       LA  P +R R
Sbjct:   404 SDSSYSGNECHPVGRRNPPPKG-----RGGRGAHMDRGRG-RAQRGKRHDLA--PTKRSR 455

Query:   664 RENRSKRFDRGQGNRSETNRFKGKNAGTGNLYVRRASALLISKSFDDGGSRAVEDIDWDA 723
             ++  +   +  +    +  R      G     +R    +L   S +  G+    D DW  
Sbjct:   456 KKMAALECEDPERELKKQKRAARFQHGHSRR-LRLEPLVLQMSSLESSGA----DPDWQE 510

Query:   724 LTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQ---NSQKNYLYKCDQLKSIR 780
             L + GTC +I K YLRLT APDPSTVRP  VL+K+L MV+     +++Y + C+Q+KSIR
Sbjct:   511 LQIVGTCPDITKHYLRLTCAPDPSTVRPVAVLKKSLCMVKCHWKEKQDYAFACEQMKSIR 570

Query:   781 QDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILYAEGIEGCCMEFSAYHL 840
             QDLTVQ IR + T +VYETHAR+A+E GD  E+NQCQ+QLK LYAE + G   EF+AY +
Sbjct:   571 QDLTVQGIRTEFTVEVYETHARIALEKGDHEEFNQCQTQLKSLYAENLPGNVGEFTAYRI 630

Query:   841 LCVILHSNNKRELLSLMSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNT 900
             L  I  + N  ++ + ++ L+ + K D  V HALA+R A + GNY  FFRLY  AP ++ 
Sbjct:   631 LYYIF-TKNSGDITTELAYLTRELKADPCVAHALALRTAWALGNYHRFFRLYCHAPCMSG 689

Query:   901 CLMDLYVEKMRFKAVSCMSRSYRPTVPVS 929
              L+D + ++ R  A+  M ++Y   VP S
Sbjct:   690 YLVDKFADRERKVALKAMIKTY--VVPSS 716

 Score = 55 (24.4 bits), Expect = 2.0e-58, Sum P(2) = 2.0e-58
 Identities = 27/82 (32%), Positives = 37/82 (45%)

Query:   225 SGAYSSGTAPATSLQYQQQYKQWADYYSQT-------EVSCAPGT-ENLSVASSSNQV-- 274
             S  Y S  A A++LQ QQQY QW   Y+          +   PG  E++S  +   Q+  
Sbjct:    69 SAQYVS-QAEASALQ-QQQYYQWYQQYNYAYPYSYYYPMPPVPGMDESMSYQAPPQQLPS 126

Query:   275 LQPPGVTA---GYPTAHSQPAP 293
              QPP  +    G  T +S P P
Sbjct:   127 AQPPQPSNPPHGAHTLNSGPQP 148


>POMBASE|SPBC2A9.11c [details] [associations]
            symbol:SPBC2A9.11c "nuclear export factor (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0051168 "nuclear
            export" evidence=ISM] PomBase:SPBC2A9.11c Pfam:PF03399
            GO:GO:0005829 GO:GO:0005634 EMBL:CU329671 GO:GO:0006351
            GO:GO:0051168 InterPro:IPR005062 eggNOG:NOG313808
            RefSeq:NP_596221.1 EnsemblFungi:SPBC2A9.11c.1 GeneID:2540452
            KEGG:spo:SPBC2A9.11c HOGENOM:HOG000170697 OrthoDB:EOG4DFSXX
            NextBio:20801579 Uniprot:Q1MTP1
        Length = 395

 Score = 545 (196.9 bits), Expect = 3.4e-55, Sum P(2) = 3.4e-55
 Identities = 107/213 (50%), Positives = 152/213 (71%)

Query:   728 GTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQNS---QKNYLYKCDQLKSIRQDLT 784
             G   E+EKRYLRLTSAPDP TVRP  VL++ L++++     +KNY Y CDQ KS+RQDLT
Sbjct:   134 GRSTELEKRYLRLTSAPDPDTVRPLPVLKQTLELLKKKWKEEKNYAYICDQFKSLRQDLT 193

Query:   785 VQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILYAEGIEGCCMEFSAYHLLCVI 844
             VQRI+N+ +  VYE HAR+A+E GD+ EYNQCQ+QL  LY+ GI G   EF AY +L  +
Sbjct:   194 VQRIQNEFSVLVYEIHARIALEKGDVGEYNQCQTQLFHLYSFGIPGNTKEFLAYRIL-YM 252

Query:   845 LHSNNKRELLSLMSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMD 904
             L + N+ E+ SL++ L ++ K + AV HAL VR+A+++G+Y  FF LY  APN+   LMD
Sbjct:   253 LFTKNRSEMNSLLANLKEEDKTNAAVTHALEVRSAMATGDYYKFFHLYLVAPNMGGYLMD 312

Query:   905 LYVEKMRFKAVSCMSRSYRPTVPVSYVAQVLGF 937
             L++E+ R +A+  M ++YRP++ + ++A  L F
Sbjct:   313 LFIERERVQAMIMMCKAYRPSLTMEFLANTLAF 345

 Score = 58 (25.5 bits), Expect = 3.4e-55, Sum P(2) = 3.4e-55
 Identities = 20/94 (21%), Positives = 43/94 (45%)

Query:   590 SKEDRLNNIKFHLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYS 649
             S+ +  + +K  +S Q      +   V    ++ +  +   N   +   DK++ +    S
Sbjct:    28 SQPELEDEVKLLISRQYEMGNIWN--VDWSSMNLESLRKLTNAQNTIIEDKKRKVEKPVS 85

Query:   650 G-AIALANSPEERMRRENRSKRFDRGQGNRSETN 682
             G   +L +  +E  ++E R +RF+ G  +RS+ N
Sbjct:    86 GNQFSLLSEEDEVDKKEKRRRRFENG--SRSQNN 117


>MGI|MGI:2142195 [details] [associations]
            symbol:Leng8 "leukocyte receptor cluster (LRC) member 8"
            species:10090 "Mus musculus" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            MGI:MGI:2142195 Pfam:PF03399 InterPro:IPR005062 eggNOG:NOG313808
            GeneTree:ENSGT00390000008006 HOGENOM:HOG000001575
            HOVERGEN:HBG070702 CTD:114823 ChiTaRS:LENG8 EMBL:AK034310
            EMBL:AK085304 EMBL:AK170629 EMBL:BC042658 EMBL:BC066768
            IPI:IPI00228486 IPI:IPI00458937 RefSeq:NP_766324.1 UniGene:Mm.22831
            ProteinModelPortal:Q8CBY3 STRING:Q8CBY3 PhosphoSite:Q8CBY3
            PaxDb:Q8CBY3 PRIDE:Q8CBY3 Ensembl:ENSMUST00000037472
            Ensembl:ENSMUST00000117274 GeneID:232798 KEGG:mmu:232798
            UCSC:uc009exb.1 UCSC:uc009exc.1 NextBio:381234 Bgee:Q8CBY3
            CleanEx:MM_LENG8 Genevestigator:Q8CBY3 Uniprot:Q8CBY3
        Length = 785

 Score = 503 (182.1 bits), Expect = 7.2e-53, Sum P(3) = 7.2e-53
 Identities = 104/208 (50%), Positives = 142/208 (68%)

Query:   718 DIDWDALTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQNSQK---NYLYKCD 774
             D DW  L + GTC +I K YLRLT APDPSTVRP  VL+K+L MV++  K   +Y + C+
Sbjct:   543 DPDWQELQIVGTCPDITKHYLRLTCAPDPSTVRPVAVLKKSLCMVKSHWKEKQDYAFACE 602

Query:   775 QLKSIRQDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILYAEGIEGCCME 834
             Q+KSIRQDLTVQ IR + T +VYETHAR+A+E GD  E+NQCQ+QLK LYAE + G   E
Sbjct:   603 QMKSIRQDLTVQGIRTEFTVEVYETHARIALEKGDHEEFNQCQTQLKSLYAENLAGNVGE 662

Query:   835 FSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFFRLYKT 894
             F+AY +L  I  + N  ++ + ++ L+ + K D  V HALA+RAA + GNY  FFRLY  
Sbjct:   663 FTAYRILYYIF-TKNSGDITTELAYLTREMKADPCVAHALALRAAWALGNYHRFFRLYCH 721

Query:   895 APNLNTCLMDLYVEKMRFKAVSCMSRSY 922
             AP ++  L+D + ++ R  A+  M ++Y
Sbjct:   722 APCMSGYLVDKFADRERKAALKAMIKTY 749

 Score = 110 (43.8 bits), Expect = 7.2e-53, Sum P(3) = 7.2e-53
 Identities = 70/320 (21%), Positives = 121/320 (37%)

Query:   213 TSHQYSNYTSDTSGAYSSGTAPATSLQYQQQ-YKQWADYYSQTEVSCA--PGTENLSVAS 269
             TS   ++ +   + A     A A++LQ QQQ Y QW   Y+         P +   S  S
Sbjct:    55 TSSSKASSSGPVASAQYVSQAEASALQQQQQQYYQWYQQYNYAYPYSYYYPMSMYQSYGS 114

Query:   270 SSNQVLQPPGVTAGYPTAHSQ-PAPIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHG 328
              S       G+ + Y +A +Q P+   H                P     G  +S     
Sbjct:   115 PSQY-----GMASSYGSATAQQPSAPQHQGTLNQ----------PPVP--GMDESMAYQA 157

Query:   329 TPSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQQMAPSYQSPPV 388
             +P  Q     P QP  S+      S +N      A   Q   +    GQ   P   S P 
Sbjct:   158 SPQ-QLPAAQPPQPSNSQ--HGTHSLSNGPQPGTAPSTQHSQAGAPTGQAYGPHSYSEPA 214

Query:   389 QTSPQLDNRRVSKLQ-IPTNPRIASNLALGLPKTDKDSSTANAAAKPAYIGVSLAKSNEK 447
             +  P+   +  ++++  P    +  N+          S ++N+  + +  G      N +
Sbjct:   215 K--PKKGQQLWTRMKPAPGTGGLKFNIQKRPFAVTSQSFSSNSEGQHSSFGPQPNSENTQ 272

Query:   448 VVSHADSRV----EPGTFPKSLCGYVERALARCKGDAEIAASQAVMGEIIKKANSDGTLF 503
               S    R     +P  +P+ +  YVER    C+ + +   ++ ++ E+++    DG+ +
Sbjct:   273 NRSGPSGRGNLSGKPDDWPQDMKEYVERCFTACESEEDKDRTEKLLKEVLQARLQDGSAY 332

Query:   504 SRDWDVEPLFPKPTTEAVTK 523
             + DW  EPL P  T E V +
Sbjct:   333 TIDWSREPL-PGLTREPVAE 351

 Score = 59 (25.8 bits), Expect = 1.5e-47, Sum P(3) = 1.5e-47
 Identities = 43/199 (21%), Positives = 64/199 (32%)

Query:    49 AVSWATHG--VNNQSTENGNLSNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVA 106
             A  W++    V   S ENG       H     E   ++L                   VA
Sbjct:    10 AADWSSQYSMVTGNSRENG--METPMHENPEWEKARQALASISKAGATSSSKASSSGPVA 67

Query:   107 QDYSGYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQ-PISSFQ 165
                + Y S   +S         Y          P +SY  P+  YQ+ G+P Q  ++S  
Sbjct:    68 S--AQYVSQAEASALQQQQQQYYQWYQQYNYAYP-YSYYYPMSMYQSYGSPSQYGMASSY 124

Query:   166 NSGSYVGPAS--YSATYYNPGDYQTAGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSD 223
              S +   P++  +  T   P      G   S  Y               ++ Q+   T  
Sbjct:   125 GSATAQQPSAPQHQGTLNQP---PVPGMDESMAYQASPQQLPAAQPPQPSNSQHG--THS 179

Query:   224 TSGAYSSGTAPATSLQYQQ 242
              S     GTAP+T  Q+ Q
Sbjct:   180 LSNGPQPGTAPST--QHSQ 196

 Score = 47 (21.6 bits), Expect = 7.2e-53, Sum P(3) = 7.2e-53
 Identities = 7/21 (33%), Positives = 12/21 (57%)

Query:   658 PEERMRRENRSKRFDRGQGNR 678
             PE  ++++ R+ RF  G   R
Sbjct:   504 PERELKKQKRAARFQHGHSRR 524

 Score = 44 (20.5 bits), Expect = 5.6e-46, Sum P(3) = 5.6e-46
 Identities = 12/57 (21%), Positives = 24/57 (42%)

Query:   193 PSSGYSHQTTSWNEGNYTNYTSHQYSNYTSDTSGAYSSGTAPATSLQYQQQYKQWAD 249
             P +  S   +S +EG ++++     S  T + SG    G        + Q  K++ +
Sbjct:   243 PFAVTSQSFSSNSEGQHSSFGPQPNSENTQNRSGPSGRGNLSGKPDDWPQDMKEYVE 299


>SGD|S000006249 [details] [associations]
            symbol:THP3 "Protein that may have a role in transcription
            elongation" species:4932 "Saccharomyces cerevisiae" [GO:0005634
            "nucleus" evidence=IEA;IDA] [GO:0035327 "transcriptionally active
            chromatin" evidence=IDA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0000398 "mRNA splicing, via
            spliceosome" evidence=IMP] [GO:0003674 "molecular_function"
            evidence=ND] SGD:S000006249 Pfam:PF03399 GO:GO:0005634
            GO:GO:0006355 GO:GO:0006351 EMBL:Z71255 EMBL:BK006949 GO:GO:0000398
            EMBL:Z49219 InterPro:IPR005062 EMBL:Z73616 eggNOG:NOG313808
            GeneTree:ENSGT00390000008006 OrthoDB:EOG4DFSXX PIR:S54069
            RefSeq:NP_015370.1 ProteinModelPortal:Q12049 DIP:DIP-3903N
            IntAct:Q12049 MINT:MINT-2492728 STRING:Q12049 PaxDb:Q12049
            PRIDE:Q12049 EnsemblFungi:YPR045C GeneID:856158 KEGG:sce:YPR045C
            CYGD:YPR045c OMA:TYLCDQF NextBio:981295 Genevestigator:Q12049
            GermOnline:YPR045C Uniprot:Q12049
        Length = 470

 Score = 362 (132.5 bits), Expect = 4.7e-32, P = 4.7e-32
 Identities = 118/385 (30%), Positives = 180/385 (46%)

Query:   605 QKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSD--KEQSLTSYYSGAIALANSPEERM 662
             QK  +K+ ++ + R    A       +G+  S+S+     ++ S     + +  S +E  
Sbjct:    85 QKRMNKNIKKKLPRVSKKASALSNGVSGNVMSNSNIVGHGAVGSASGWKVEMGGS-DELE 143

Query:   663 RRENRSKRFDRGQGNRSETNRFKGKNAGTGNLYVRRASALLISKSFDDGGSRAVEDIDWD 722
             RR+ R++RF   QG  + TN     N    NL        + SKS            D  
Sbjct:   144 RRKRRAERFS--QGPSATTNSNDNLNEDFANLNA------ISSKS---------HQYD-K 185

Query:   723 ALTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKA----LQMVQNSQKNYLYKCDQLKS 778
              + V G CQ +EK YLRLTS P+P  +RP  +L+K     +   Q+    Y Y CDQ KS
Sbjct:   186 KIHVVGRCQTLEKSYLRLTSEPNPDLIRPPNILQKMYCLLMDKYQSKTATYTYLCDQFKS 245

Query:   779 IRQDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILYAEGI--EGCCMEFS 836
             +RQDL VQ I N  T KVY+THAR+A+ENGDL E+NQCQ+++  L+      +    EF 
Sbjct:   246 MRQDLRVQMIENSFTIKVYQTHARIALENGDLGEFNQCQNRIMALFENPTIPKKSYSEFI 305

Query:   837 AYHLLCVIL---HSNNKRELLSLMSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFFRLYK 893
              Y +L  +L   + +     L L+   S +  +D+ VK    +      GNY  F + Y 
Sbjct:   306 CYSVLYSMLTEDYPSISHLKLKLIDDGSSEILEDEHVKMIFELSDMKLVGNYHYFMKNYL 365

Query:   894 TAPNLNTCLMD--LYVEKMRFKAVSCMSRSYRPTVPVSYVAQVLGFTGVSPTNEECEERD 951
                    CL++  L +EK+ F  + C  +SY   V + +V     F  +  T     E++
Sbjct:   366 KLHKFEKCLINSFLNLEKLIFLTIIC--KSYNQ-VNLDFVKSEFNFNSIEETTNFLNEQN 422

Query:   952 SDGLEECVEWLKAHGASLVTDANGE 976
                L E +  L       +TD+NG+
Sbjct:   423 ---LTEFI--LNKQ----ITDSNGK 438


>DICTYBASE|DDB_G0277813 [details] [associations]
            symbol:DDB_G0277813 "SAC3/GANP family protein"
            species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0277813 Pfam:PF03399 EMBL:AAFI02000022
            InterPro:IPR005062 eggNOG:NOG313808 RefSeq:XP_642492.1
            EnsemblProtists:DDB0233557 GeneID:8621203 KEGG:ddi:DDB_G0277813
            InParanoid:Q54Z61 ProtClustDB:CLSZ2846842 Uniprot:Q54Z61
        Length = 774

 Score = 386 (140.9 bits), Expect = 5.5e-31, Sum P(3) = 5.5e-31
 Identities = 101/321 (31%), Positives = 172/321 (53%)

Query:   630 DNGDASSDSDKEQSLTSYYSGAIALANSPEERMRRENRSKRFDRGQGNRSETNRFKGKNA 689
             +N + + +S    S+T          +SP++ +   N++ + +    N +  N     N 
Sbjct:   425 NNSNNNRNSQNSPSITPTRGPLKFNVDSPKQPIVFNNQNNQNNNNNNNNNNNNNNNNNNN 484

Query:   690 GTGNLYVRRASALLISKSFDDGGSRAVEDIDWDALTVKGTCQEIEKRYLRLTSAPDPSTV 749
                N      +  L ++S D     +V+ I        GTC++ EK YLRLT   DP+ +
Sbjct:   485 NNNNN--NNKNVKLTNESID---LNSVKPII-------GTCKDYEKSYLRLTGPADPAKI 532

Query:   750 RPEEVLE----KALQMVQNSQKNYLYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAI 805
             R  E+LE    K ++  QN+ KN+ Y  DQLKSIRQDL VQ IRN+ T  VYE +A++ +
Sbjct:   533 RSIEILETWFPKLIRKYQNN-KNFNYALDQLKSIRQDLMVQHIRNKFTVNVYEANAKICL 591

Query:   806 ENGDLPEYNQCQSQLKILYAEGIEGCCME----FSAYHLLC-VILHSNNKRELLSLMSR- 859
             EN D  E+ QC SQ+K LY    +  C+E    F +Y LL  +I   +N+ EL+SL+ + 
Sbjct:   592 ENSDFIEFGQCLSQIKELYHSISDQSCLENKFEFISYDLLFNLIFIKDNELELISLLPKI 651

Query:   860 LSDKA-KQDKAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYV-EKMRFKAVSC 917
             L+D++   ++ +KH   +  +V   NY  F +LY T  N+   L++  + ++ R  +++ 
Sbjct:   652 LNDESFYSNENIKHTFEIIKSVLENNYCKFNKLYLTCYNMEKYLLEKILNDRFRVYSINA 711

Query:   918 MSRSYRPTVPVSYVAQVLGFT 938
             M +SY+P++ ++ + + L F+
Sbjct:   712 MIKSYKPSIHLNLLEKQLSFS 732

 Score = 42 (19.8 bits), Expect = 5.5e-31, Sum P(3) = 5.5e-31
 Identities = 13/47 (27%), Positives = 23/47 (48%)

Query:   329 TPSFQNRQVSPVQPHYSK----PLEQKTSYNNFQDQHKAAC-P-QGP 369
             TP  + R +  + P+ S     P+++  S NN   Q+  +  P +GP
Sbjct:   399 TPITKRRDIEDLTPYSSPNPSTPIQRNNSNNNRNSQNSPSITPTRGP 445

 Score = 41 (19.5 bits), Expect = 7.0e-31, Sum P(3) = 7.0e-31
 Identities = 9/28 (32%), Positives = 16/28 (57%)

Query:   340 VQPHYSKPLEQKTSYNNFQDQHKAACPQ 367
             VQ H  +  +Q+ +YNN    ++ + PQ
Sbjct:   213 VQQHQHQQQQQQNNYNN---NNQTSSPQ 237

 Score = 41 (19.5 bits), Expect = 7.0e-31, Sum P(3) = 7.0e-31
 Identities = 19/70 (27%), Positives = 28/70 (40%)

Query:   332 FQNRQVSPVQPHYSKPLEQKTSYNNFQDQ--HKAACPQG---PSSQYAIGQQMAPSYQSP 386
             F +   SP QPH  +  +Q+      Q Q  HK         P ++    + + P Y SP
Sbjct:   359 FNSPSSSP-QPHQQQQQQQQQQQQQQQQQQQHKPQQRSNLSTPITKRRDIEDLTP-YSSP 416

Query:   387 PVQTSPQLDN 396
                T  Q +N
Sbjct:   417 NPSTPIQRNN 426

 Score = 40 (19.1 bits), Expect = 8.9e-31, Sum P(3) = 8.9e-31
 Identities = 22/87 (25%), Positives = 33/87 (37%)

Query:   194 SSGYSHQTTSWNEGNYT--NYTSHQYSNYTSDTSGAYSSGTAPATSLQYQQQYKQWADYY 251
             SS    Q     EG  T  +  S ++S + S +S             Q QQQ +Q   + 
Sbjct:   331 SSPQQQQKQHHQEGEDTPSSLMSARFSRFNSPSSSPQPHQQQQQQQQQQQQQQQQQQQHK 390

Query:   252 SQ------TEVSCAPGTENLSVASSSN 272
              Q      T ++     E+L+  SS N
Sbjct:   391 PQQRSNLSTPITKRRDIEDLTPYSSPN 417

 Score = 38 (18.4 bits), Expect = 5.5e-31, Sum P(3) = 5.5e-31
 Identities = 6/25 (24%), Positives = 15/25 (60%)

Query:    58 NNQSTENGNLSNASYHHEQHTESHV 82
             NN +  N N +N + ++  + +S++
Sbjct:    59 NNNNNNNNNNNNNNNNNNNNNKSNI 83

 Score = 37 (18.1 bits), Expect = 7.0e-31, Sum P(3) = 7.0e-31
 Identities = 9/35 (25%), Positives = 14/35 (40%)

Query:   205 NEGNYTNYTSHQYSNYTSDTSGAYSSGTAPATSLQ 239
             N  N  N  ++  SN     S    +   P TS++
Sbjct:    68 NNNNNNNNNNNNKSNINKKASPLKKNTMTPRTSIK 102


>CGD|CAL0000561 [details] [associations]
            symbol:orf19.6271 species:5476 "Candida albicans" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0035327 "transcriptionally active chromatin"
            evidence=IEA] [GO:0000398 "mRNA splicing, via spliceosome"
            evidence=IEA] CGD:CAL0000561 Pfam:PF03399 EMBL:AACQ01000037
            InterPro:IPR005062 RefSeq:XP_718858.1 GeneID:3639557
            KEGG:cal:CaO19.6271 Uniprot:Q5AAX8
        Length = 710

 Score = 295 (108.9 bits), Expect = 4.1e-22, Sum P(2) = 4.1e-22
 Identities = 82/235 (34%), Positives = 124/235 (52%)

Query:   599 KFHLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSGAIALANSP 658
             K + +E  +  KS ++P      + + F  + NGDA  D  K+ + +   + + + +N+ 
Sbjct:   295 KSNSTEDDTNKKSKEKP---SNTNNNSFYIDVNGDAPYDPTKKPTTSKILNNSSSSSNNN 351

Query:   659 EERMRRENRSKRFDRGQGNRSETNRFKGKNAGTGNLYVRRASALLISKSFDDGG-----S 713
                 + +  +K+F +  GN  + N  + K       Y   A     S+ F         S
Sbjct:   352 NNNKKTKGDNKKF-KETGNNFQKN--ENKKRQLQEEYDSEARKRARSERFAQISEFKPIS 408

Query:   714 RAVEDIDW-DALTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQNSQK---NY 769
                ED    D+  V GT +++EK Y RLTSAP+P+ VR  +VL  +L+ V    +   NY
Sbjct:   409 TYYEDQRRKDSGAVVGTSEQLEKSYFRLTSAPNPAQVRSLKVLHDSLKYVVRKYEESHNY 468

Query:   770 LYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILY 824
              Y  DQL SIR DLTVQ I+++ T  VYE +AR++IEN DL E+NQCQ+QLK LY
Sbjct:   469 SYIIDQLNSIRLDLTVQHIKDEFTVHVYEKNARISIENNDLGEFNQCQAQLKSLY 523

 Score = 47 (21.6 bits), Expect = 4.1e-22, Sum P(2) = 4.1e-22
 Identities = 16/80 (20%), Positives = 34/80 (42%)

Query:     5 NQNQQGSTQNIASSVDP---NSVENRYVVDA-SQSQASSYFPSTTGSGAVSWATHGVNNQ 60
             ++ Q G  QN  + ++    ++ E  YV     Q+  ++ + S    G +    +   NQ
Sbjct:    92 DEGQNGPNQNQLNDLEGRIYDANEEAYVPPRLRQNPPNTAYSSANNFGTLPMTNNNNLNQ 151

Query:    61 STENGNLSNASYHHEQHTES 80
              T   N +N   ++ + T +
Sbjct:   152 DTVGNNQNNTVNNNSESTNN 171

 Score = 39 (18.8 bits), Expect = 2.8e-21, Sum P(2) = 2.8e-21
 Identities = 16/74 (21%), Positives = 34/74 (45%)

Query:   166 NSGSYVGPASYSATYYNPGD--YQTAGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSD 223
             N  +YV P        NP +  Y +A  + +   ++   + N+    N  ++  +N  S+
Sbjct:   114 NEEAYVPPRLRQ----NPPNTAYSSANNFGTLPMTNNN-NLNQDTVGNNQNNTVNN-NSE 167

Query:   224 TSGAYSSGTAPATS 237
             ++  + +G APA +
Sbjct:   168 STNNFQAGMAPAAA 181


>UNIPROTKB|Q5AAX8 [details] [associations]
            symbol:CaO19.6271 "Putative uncharacterized protein"
            species:237561 "Candida albicans SC5314" [GO:0003674
            "molecular_function" evidence=ND] CGD:CAL0000561 Pfam:PF03399
            EMBL:AACQ01000037 InterPro:IPR005062 RefSeq:XP_718858.1
            GeneID:3639557 KEGG:cal:CaO19.6271 Uniprot:Q5AAX8
        Length = 710

 Score = 295 (108.9 bits), Expect = 4.1e-22, Sum P(2) = 4.1e-22
 Identities = 82/235 (34%), Positives = 124/235 (52%)

Query:   599 KFHLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSGAIALANSP 658
             K + +E  +  KS ++P      + + F  + NGDA  D  K+ + +   + + + +N+ 
Sbjct:   295 KSNSTEDDTNKKSKEKP---SNTNNNSFYIDVNGDAPYDPTKKPTTSKILNNSSSSSNNN 351

Query:   659 EERMRRENRSKRFDRGQGNRSETNRFKGKNAGTGNLYVRRASALLISKSFDDGG-----S 713
                 + +  +K+F +  GN  + N  + K       Y   A     S+ F         S
Sbjct:   352 NNNKKTKGDNKKF-KETGNNFQKN--ENKKRQLQEEYDSEARKRARSERFAQISEFKPIS 408

Query:   714 RAVEDIDW-DALTVKGTCQEIEKRYLRLTSAPDPSTVRPEEVLEKALQMVQNSQK---NY 769
                ED    D+  V GT +++EK Y RLTSAP+P+ VR  +VL  +L+ V    +   NY
Sbjct:   409 TYYEDQRRKDSGAVVGTSEQLEKSYFRLTSAPNPAQVRSLKVLHDSLKYVVRKYEESHNY 468

Query:   770 LYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIENGDLPEYNQCQSQLKILY 824
              Y  DQL SIR DLTVQ I+++ T  VYE +AR++IEN DL E+NQCQ+QLK LY
Sbjct:   469 SYIIDQLNSIRLDLTVQHIKDEFTVHVYEKNARISIENNDLGEFNQCQAQLKSLY 523

 Score = 47 (21.6 bits), Expect = 4.1e-22, Sum P(2) = 4.1e-22
 Identities = 16/80 (20%), Positives = 34/80 (42%)

Query:     5 NQNQQGSTQNIASSVDP---NSVENRYVVDA-SQSQASSYFPSTTGSGAVSWATHGVNNQ 60
             ++ Q G  QN  + ++    ++ E  YV     Q+  ++ + S    G +    +   NQ
Sbjct:    92 DEGQNGPNQNQLNDLEGRIYDANEEAYVPPRLRQNPPNTAYSSANNFGTLPMTNNNNLNQ 151

Query:    61 STENGNLSNASYHHEQHTES 80
              T   N +N   ++ + T +
Sbjct:   152 DTVGNNQNNTVNNNSESTNN 171

 Score = 39 (18.8 bits), Expect = 2.8e-21, Sum P(2) = 2.8e-21
 Identities = 16/74 (21%), Positives = 34/74 (45%)

Query:   166 NSGSYVGPASYSATYYNPGD--YQTAGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSD 223
             N  +YV P        NP +  Y +A  + +   ++   + N+    N  ++  +N  S+
Sbjct:   114 NEEAYVPPRLRQ----NPPNTAYSSANNFGTLPMTNNN-NLNQDTVGNNQNNTVNN-NSE 167

Query:   224 TSGAYSSGTAPATS 237
             ++  + +G APA +
Sbjct:   168 STNNFQAGMAPAAA 181


>FB|FBgn0038642 [details] [associations]
            symbol:Muc91C "Mucin 91C" species:7227 "Drosophila
            melanogaster" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISM] [GO:0031012 "extracellular matrix"
            evidence=ISM] [GO:0022008 "neurogenesis" evidence=IMP]
            EMBL:AE014297 GO:GO:0022008 eggNOG:NOG12793 GO:GO:0031012
            GO:GO:0005201 GeneTree:ENSGT00700000104744 RefSeq:NP_650744.1
            UniGene:Dm.10760 EnsemblMetazoa:FBtr0083687 GeneID:42246
            KEGG:dme:Dmel_CG7709 UCSC:CG7709-RA CTD:42246 FlyBase:FBgn0038642
            InParanoid:Q9VE45 OMA:GPYPSAP PhylomeDB:Q9VE45 GenomeRNAi:42246
            NextBio:827869 ArrayExpress:Q9VE45 Bgee:Q9VE45 Uniprot:Q9VE45
        Length = 950

 Score = 161 (61.7 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 102/408 (25%), Positives = 153/408 (37%)

Query:     5 NQNQQGSTQNIASSVDPNSVENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQSTEN 64
             N      TQ++ SS   +   + Y    +   +SSY   ++    +S  +      S+ +
Sbjct:   510 NYGAPSKTQSLGSSGYSSGPSSSYEAPVAPP-SSSYGAPSSSFQPISPPSSSYGAPSSGS 568

Query:    65 GNLSNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSG-YTSYPNSSDPYA 123
             G+ S+ S+     +     S                     + +  G Y S P+SS  Y+
Sbjct:   569 GS-SSGSFSAAPSSLYSAPSKGSSGGSFQSAPSSSYSAPSASANSGGSYPSAPSSS--YS 625

Query:   124 YGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQ--PISSFQ------NSG-SYVGPA 174
               S++           P+ SY  P     NSG PY   P SS+       NSG SY  P+
Sbjct:   626 APSSS-SSSGGPYASAPSSSYSAPSSG-SNSGGPYPAAPSSSYSAPSASANSGGSY--PS 681

Query:   175 SYSATYYNPGDYQTAGG-YP---SSGYSHQTTSWNEGN-YTNYTSHQYS--NYTSDTSGA 227
             + S++Y  P     +GG YP   SS YS  + S N G  Y +  S  YS  + +S++ G 
Sbjct:   682 APSSSYSAPSPGSNSGGPYPAAPSSSYSAPSPSANSGGPYASAPSSSYSAPSSSSNSGGP 741

Query:   228 Y-----SSGTAPATSLQYQQQYKQW-ADYYSQTEVSCAPGTENLSVASSSNQVLQPPGVT 281
             Y     SS +AP++S      Y    +  YS    S + G    S  SSS     P   +
Sbjct:   742 YAAAPSSSYSAPSSSSSSGGPYPSAPSSSYSAPSSSLSSGGPYPSAPSSSYAAPSPSSNS 801

Query:   282 AG-YPTAHSQPAPIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQNRQVSPV 340
              G YP A   P+  Y                 P+ + +    SY   G PS  +   S  
Sbjct:   802 GGPYPAA---PSNSYSAPIAPPSSSYGAPASGPSPSFSAPSSSY---GAPSTGSGSSS-- 853

Query:   341 QPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQY-AIGQQMAPSYQSPP 387
                +S      +S++       A  P  PSS Y A       S+ S P
Sbjct:   854 ---FSS---SSSSFSGASSSSSAGYPSAPSSSYGAPSTGSGHSFSSAP 895


>FB|FBgn0085362 [details] [associations]
            symbol:Vml "Vitelline membrane-like" species:7227 "Drosophila
            melanogaster" [GO:0009950 "dorsal/ventral axis specification"
            evidence=IGI] [GO:0060388 "vitelline envelope" evidence=IDA]
            [GO:0007305 "vitelline membrane formation involved in
            chorion-containing eggshell formation" evidence=ISM] [GO:0008316
            "structural constituent of vitelline membrane" evidence=ISM]
            [GO:0035805 "egg coat" evidence=ISM] EMBL:AE014298 GO:GO:0009950
            GeneTree:ENSGT00700000104744 PROSITE:PS51137 GO:GO:0060388
            InterPro:IPR013135 RefSeq:NP_001096866.1 UniGene:Dm.32785
            STRING:A8JUV4 EnsemblMetazoa:FBtr0112535 GeneID:5740271
            KEGG:dme:Dmel_CG34333 UCSC:CG34333-RA CTD:5740271
            FlyBase:FBgn0085362 eggNOG:NOG284187 InParanoid:A8JUV4 OMA:ISKYETI
            OrthoDB:EOG4KPRTT GenomeRNAi:5740271 NextBio:20891311 Bgee:A8JUV4
            Uniprot:A8JUV4
        Length = 578

 Score = 158 (60.7 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 98/399 (24%), Positives = 151/399 (37%)

Query:   106 AQDYSGYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQ 165
             A  YS   + P+ S P A   +A           P++S P    A  +  AP  P  S Q
Sbjct:   174 APSYSAPAA-PSYSAPAAPSYSAPAAPSYSAPAAPSYSAP----AAPSYSAPSAPSYSAQ 228

Query:   166 NSGSYVGPA--SYSATYYNPGDYQTAGGYPSSGYSHQTT-SWNEGNYT----NYTSHQYS 218
              + SY  PA  SY A       Y    G PS  YS     S++  +Y+    +Y++ +  
Sbjct:   229 KTSSYSAPAAPSYHAPAAPASSYSAPAG-PS--YSAPAAPSYSAPSYSAPASSYSALKAP 285

Query:   219 NYTSDTSGAYSSGTAPATSLQYQQQYKQWADY-YSQTEVSC--APGTENLSV-ASSSNQV 274
             +Y++  + +YS+  AP+ S      Y   A   YS        AP  ++ S  A+ S   
Sbjct:   286 SYSAPAAPSYSAPAAPSYSSSASPSYSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSA 345

Query:   275 LQPPGVTAGYPTAHSQPA-PIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQ 333
                P  +A   +++S PA P Y                   A+S     SY     PS+ 
Sbjct:   346 PAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASS-----SYSAPAAPSYS 400

Query:   334 NRQVSPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQQMAPSYQSP--PVQTS 391
                 +P  P YS P    +SY+       +A P  PS  Y+     APSY +P  P  ++
Sbjct:   401 ----APAAPSYSAPAS--SSYSAPAAPSYSA-PAAPS--YSA--PAAPSYSAPAAPSYSA 449

Query:   392 PQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTANAAAKPAYIGVSLAKSNEKVVSH 451
             P       ++     +   AS  +   PKT    S   ++  PA    S   S+     +
Sbjct:   450 PASSGYSAARAYSAGSAAPASGYSA--PKTSSGYSAPASSGSPAASSYSAPASSTASSGY 507

Query:   452 ADSRVEPGTFPKSLCGYVERALARCKGDAEIAASQAVMG 490
             +    +   + +S   +    +AR  G    AA  A  G
Sbjct:   508 SAPASKSSGYARSEMDHQILGMARTAGGYGSAAPSAAYG 546

 Score = 144 (55.7 bits), Expect = 3.6e-06, P = 3.6e-06
 Identities = 98/390 (25%), Positives = 150/390 (38%)

Query:   106 AQDYSGYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQ 165
             A  YS   + P+ S P A   +A           P++S P    A  +  AP  P  S  
Sbjct:   110 APSYSAPAA-PSYSAPAAPSYSAPASSSYSAPAAPSYSAP----AAPSYSAPAAPSYSAP 164

Query:   166 NSGSYVGPA--SYSA----TYYNPG--DYQT--AGGY--PSS-GYSHQTT-SWNEGNYTN 211
              S SY  PA  SYSA    +Y  P    Y    A  Y  P++  YS     S++  +  +
Sbjct:   165 ASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPSAPS 224

Query:   212 YTSHQYSNYTSDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSS 271
             Y++ + S+Y++  + +Y +  APA+S      Y   A        + +    + S  +SS
Sbjct:   225 YSAQKTSSYSAPAAPSYHAPAAPASS------YSAPAGPSYSAPAAPSYSAPSYSAPASS 278

Query:   272 NQVLQPPGVTAGYPTAHSQPA-PIYHXXXXXXXXX-XXXXXXXPAATSNGSH--DSYWKH 327
                L+ P  +A    ++S PA P Y                  PAA +  +    SY   
Sbjct:   279 YSALKAPSYSAPAAPSYSAPAAPSYSSSASPSYSSPASSSYSAPAAPTYSAPKAQSYSAP 338

Query:   328 GTPSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPS-SQYAIGQQMAP---SY 383
               PS+     +P  P YS P    +SY+       +A P  PS S  A     AP   SY
Sbjct:   339 AAPSYS----APAAPSYSAPAS--SSYSAPAAPSYSA-PAAPSYSAPAAPSYSAPASSSY 391

Query:   384 QSP--PVQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA-NAAAKPAYIG-V 439
              +P  P  ++P   +         + P   S  A   P     ++ + +A A P+Y    
Sbjct:   392 SAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPA 451

Query:   440 SLAKSNEKVVSHADSRVEPG-TFPKSLCGY 468
             S   S  +  S   +    G + PK+  GY
Sbjct:   452 SSGYSAARAYSAGSAAPASGYSAPKTSSGY 481


>UNIPROTKB|O60318 [details] [associations]
            symbol:MCM3AP "80 kDa MCM3-associated protein" species:9606
            "Homo sapiens" [GO:0005634 "nucleus" evidence=IDA] [GO:0005829
            "cytosol" evidence=IDA] [GO:0003677 "DNA binding" evidence=TAS]
            [GO:0006260 "DNA replication" evidence=TAS] [GO:0006606 "protein
            import into nucleus" evidence=TAS] Pfam:PF03399 GO:GO:0005829
            GO:GO:0005634 GO:GO:0003677 GO:GO:0006260 GO:GO:0006606
            InterPro:IPR005062 PDB:4DHX PDBsum:4DHX EMBL:AJ010089 EMBL:AY590469
            EMBL:BC104958 EMBL:BC104960 EMBL:AB011144 EMBL:AB005543
            IPI:IPI00028954 PIR:T00339 RefSeq:NP_003897.2 UniGene:Hs.389037
            ProteinModelPortal:O60318 SMR:O60318 DIP:DIP-31696N IntAct:O60318
            MINT:MINT-1180375 STRING:O60318 PhosphoSite:O60318 PaxDb:O60318
            PeptideAtlas:O60318 PRIDE:O60318 Ensembl:ENST00000291688
            Ensembl:ENST00000397708 GeneID:8888 KEGG:hsa:8888 UCSC:uc002zir.1
            CTD:8888 GeneCards:GC21M047655 H-InvDB:HIX0016189 HGNC:HGNC:6946
            HPA:HPA021527 MIM:603294 neXtProt:NX_O60318 PharmGKB:PA30692
            eggNOG:COG5079 HOGENOM:HOG000113500 HOVERGEN:HBG052431
            InParanoid:O60318 OMA:AHQMKVQ OrthoDB:EOG44J2H4 PhylomeDB:O60318
            ChiTaRS:MCM3AP GenomeRNAi:8888 NextBio:33379 ArrayExpress:O60318
            Bgee:O60318 CleanEx:HS_MCM3AP Genevestigator:O60318
            GermOnline:ENSG00000160294 Uniprot:O60318
        Length = 1980

 Score = 138 (53.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 58/236 (24%), Positives = 107/236 (45%)

Query:   769 YLYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIE-------------NGDLPEYN- 814
             Y +  ++ + IR+D+T Q + + LT  + E   R  I              +  +   N 
Sbjct:   720 YDFVWNRTRGIRKDITQQHLCDPLTVSLIEKCTRFHIHCAHFMCEEPMSSFDAKINNENM 779

Query:   815 -QCQSQLKILYAE----GIEGCCME--FSAYHLLCVILHSNNKRELLSLMSRLSDKAKQD 867
              +C   LK +Y +    G+  C  E  F  Y++L     S NK ++L  + +     +  
Sbjct:   780 TKCLQSLKEMYQDLRNKGVF-CASEAEFQGYNVLL----SLNKGDILREVQQFHPAVRNS 834

Query:   868 KAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYRPTVP 927
               VK A+   AA++S N++ FF+L ++A  LN CL+  Y  ++R  A+  ++ +Y  +  
Sbjct:   835 SEVKFAVQAFAALNSNNFVRFFKLVQSASYLNACLLHCYFSQIRKDALRALNFAYTVSTQ 894

Query:   928 VSYVAQVLGFTGVSPTNEECEERDSDGLEECVEWLKAHGASLVTDANGEVQLDAKA 983
              S +  + G   +     +CEE          ++L  HG   +T ++G V+L+  A
Sbjct:   895 RSTIFPLDGVVRML-LFRDCEE--------ATDFLTCHG---LTVSDGCVELNRSA 938

 Score = 67 (28.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 41/164 (25%), Positives = 64/164 (39%)

Query:    10 GSTQNIASSVDPNS-VENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQST----EN 64
             G TQ   SSV P S +E+     A+   +SS     TG    S  + G    ++    E 
Sbjct:    74 GFTQT--SSVGPFSGLEHTSTFVATSGPSSSSVLGNTGFSFKSPTSVGAFPSTSAFGQEA 131

Query:    65 GNLSNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSGYT--SYPNSSDPY 122
             G + N+ +     TE   K L++                  +Q  SG+   S+P SS P 
Sbjct:   132 GEIVNSGFGK---TEFSFKPLENAVFKPILGAESEPEKTQ-SQIASGFFTFSHPISSAPG 187

Query:   123 AYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQN 166
                  ++P          N ++ +PV +  NS + + P  S QN
Sbjct:   188 GLAPFSFPQVTSSSATTSNFTFSKPVSS-NNSLSAFTPALSNQN 230

 Score = 42 (19.8 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 10/29 (34%), Positives = 18/29 (62%)

Query:   542 SRRTKSRWEPLPEE-KPIDKLASSTNEIV 569
             SR +  + EPLP E +P+  L+ + + +V
Sbjct:   677 SRSSADQEEPLPHELRPLPVLSRTMDYLV 705

 Score = 39 (18.8 bits), Expect = 0.00015, Sum P(4) = 0.00015
 Identities = 10/28 (35%), Positives = 14/28 (50%)

Query:   682 NRFKGKNAGTGNLYVRRASALLISKSFD 709
             N F GK A    ++ RR+  L +   FD
Sbjct:   454 NHF-GKIAKVQRIFTRRSKKLAVVHFFD 480

 Score = 39 (18.8 bits), Expect = 0.00015, Sum P(4) = 0.00015
 Identities = 38/164 (23%), Positives = 58/164 (35%)

Query:   530 PLSALSKNKRSPSRRTKSRWEPLPEEKPIDKLASSTNEIVKFSGWIHANEKDRKHISGSV 589
             P  A     R       S+ EPLP      K     +   +  G  H   +D   +S   
Sbjct:   262 PFQASKAGVRQGCEEAVSQVEPLPSLMKGLKRKEDQDRSPRRHG--HEPAEDSDPLSRGD 319

Query:   590 SKED----RLNNIKFHLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLT 645
                D    RLN  +      ++    F+   +  RL     K E     S++SD   ++ 
Sbjct:   320 HPPDKRPVRLNRPRGGTLFGRTIQDVFKSNKEVGRLGNKEAKKETGFVESAESD-HMAIP 378

Query:   646 SYYSGAIALANSP--EERMRRENRSKRFD--RG----QGNRSET 681
                   +A +  P   +    E+R K+ D  RG    Q NRSE+
Sbjct:   379 GGNQSVLAPSRIPGVNKEEETESREKKEDSLRGTPARQSNRSES 422


>UNIPROTKB|F1PJG2 [details] [associations]
            symbol:SAC3D1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0051298 "centrosome duplication"
            evidence=IEA] [GO:0051225 "spindle assembly" evidence=IEA]
            [GO:0005819 "spindle" evidence=IEA] [GO:0005813 "centrosome"
            evidence=IEA] Pfam:PF03399 GO:GO:0005813 GO:GO:0051298
            GO:GO:0051225 GO:GO:0005819 InterPro:IPR005062
            GeneTree:ENSGT00530000063781 EMBL:AAEX03011643
            Ensembl:ENSCAFT00000022212 OMA:RLHRFEV Uniprot:F1PJG2
        Length = 407

 Score = 135 (52.6 bits), Expect = 1.9e-05, P = 1.9e-05
 Identities = 67/252 (26%), Positives = 111/252 (44%)

Query:   771 YKCDQLKSIRQDLTVQRIRNQLTAKVYETH--------ARLAIEN--GDL-PEYNQCQSQ 819
             +  D+L+++R DL +Q       A V E          ARL  +   G   P   Q Q Q
Sbjct:   149 FVADRLRAVRLDLALQGADGVEAAAVLEAALAVLLAVVARLGPDGTRGPADPVLLQAQVQ 208

Query:   820 -----LKILYAEGIEGCCMEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKAVKHAL 874
                  L+  YA+G  G     +A+  L  +L++    E L  + +L D  +   A++ AL
Sbjct:   209 EGFGSLRRCYAQGA-GSRPRQAAFQGL-FLLYNLGSVEALHEVLQLPDALRSCPALRRAL 266

Query:   875 AVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYR-P---TVPVSY 930
             AV +A   GN    FRL +  P L +C +  ++ + R  A++ ++R+   P   T+P+ +
Sbjct:   267 AVDSAFREGNTARLFRLLRILPYLQSCAVRCHIGRARRGALARLARALSTPKGQTLPLGF 326

Query:   931 VAQVLGFTGVSPTNEECEERDS--DGLEECVEWLKAHGAS--LVTDANGEVQLDAKASSS 986
             +  +L   G     + C+      DG E  V +L+ H     L      +V + +K +  
Sbjct:   327 IVHLLALDGPEEARDLCQAHGLPLDGQERVV-FLRGHYTEEGLPPAGTCKVLVGSKLAGR 385

Query:   987 TL---FMPEPED 995
             TL    M E ED
Sbjct:   386 TLEEVVMAEEED 397


>ZFIN|ZDB-GENE-040715-1 [details] [associations]
            symbol:mcm3ap "MCM3 minichromosome maintenance
            deficient 3 (S. cerevisiae) associated protein" species:7955 "Danio
            rerio" [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            Pfam:PF03399 ZFIN:ZDB-GENE-040715-1 GO:GO:0000166
            Gene3D:3.30.70.330 GO:GO:0003676 InterPro:IPR005062
            GeneTree:ENSGT00530000063781 EMBL:BX255930 EMBL:BX001051
            IPI:IPI00608713 Ensembl:ENSDART00000008053 Bgee:F1Q712
            Uniprot:F1Q712
        Length = 2118

 Score = 115 (45.5 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 51/223 (22%), Positives = 101/223 (45%)

Query:   764 NSQKNYLYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIE-------------NGDL 810
             N +  Y +  ++ + IR+D+T Q + +  T  + E   R  I              +  +
Sbjct:   767 NCRDWYDFVWNRTRGIRKDITQQHLCDPETVSLIEKCTRFHIHCAHHLCQEPMMSFDAKI 826

Query:   811 PEYN--QCQSQLKILYAEGI--EGCC---MEFSAYHLLCVILHSNNKRELLSLMSRLSDK 863
                N  +C   LK +Y +    E  C    EF  Y++L  +    N  ++L  + +   +
Sbjct:   827 NNENMTKCLQSLKEMYQDLATKEVYCPKEAEFRQYNVLVKL----NDGDILREVQQFRKE 882

Query:   864 AKQDKAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSY- 922
              ++   V  A+ V AA++S N++ FF+L   A  L++C++  Y  ++R +A+  ++ ++ 
Sbjct:   883 IRESPEVTFAVQVFAALNSNNFVRFFKLVSAASYLSSCILHRYFNQVRRQALKILNVAFT 942

Query:   923 ----RPTV-PVSYVAQVLGFTGVSPTNEECEERD---SDGLEE 957
                 R T+ PV    ++L F   +   E  ++     SDG+ E
Sbjct:   943 VGSQRSTIFPVEDFVRMLMFRNATEATEFIQQYGLTVSDGMVE 985

 Score = 83 (34.3 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 118/552 (21%), Positives = 195/552 (35%)

Query:   143 SYPQPVGAYQNSGAPYQPISSFQNSGSYVGPASYSATYYNPGDYQTAGGYPSSGYSHQTT 202
             ++ Q  G  Q SG      SSF   G++  PA +       G     G  P   YS  T 
Sbjct:    64 TFGQTGGLSQGSGQT----SSFSFLGTH--PA-FGQPSLGQGGTLGFGQPPPPSYSQATG 116

Query:   203 SWNEGNYTNYTSHQYSNYTSDTSGAYSSGTAPATSLQYQQQYKQWADYY--SQTEVSCAP 260
                   +   ++    +Y S +S    S T   TS   Q  + Q +     + T +S   
Sbjct:   117 QTQNSAFGQPSAFSLPSYVSQSSSTSFSSTV--TS---QPSFGQLSGLPVPAATSISSIS 171

Query:   261 G-TENLSVASSSNQV-LQPPGVTAGYPTAHSQPAPIYHXXXXXXXXXXXXXXXXPAATSN 318
             G  EN    +  NQ   +PP      P     P P                        N
Sbjct:   172 GRAEN---PTGGNQFSFKPPNEAVFKPIFSVSPEP-----TSTNMSSASETAGSSKVVDN 223

Query:   319 GSHDSYWKHGTPSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQQ 378
              S  S +    PS      S  QP  +  +    S  NF  Q +     G S Q+   Q 
Sbjct:   224 SSGGSLFSCVKPSALGFSFS--QPAAAPSVSLSNS--NFS-QKETVGGGGSSIQFTFSQP 278

Query:   379 MAPSYQSPPVQ--TSPQLDNRRVSKLQIPTNPRIAS-------NLALGLPKT----DKDS 425
               PS  S      T+P   +     LQ  ++ ++ +         A G+PKT    DK  
Sbjct:   279 ANPSSSSTSASQPTTPSTFSFTPQSLQPQSDTKVPAFGGTGIGPFAFGVPKTVRTEDKTG 338

Query:   426 STANAAAKPAYIGVSLAKSNEKVVSHADSRVEPGTF-PKSLCGYVERALARCKGDAEIAA 484
                ++  + A++   + +  E   +      E G   P+      +R L R +  A    
Sbjct:   339 DGQSSGGETAFVSFGMKRKEEPADAAKSDPSETGADGPRQPA---KRPLLRSRVMAG-GL 394

Query:   485 SQAVMGEIIKKANSDGTLFSRDWDV--EPLFPKPTTEAVTKDL-PTSTPLSALSKNKR-S 540
              +  M +++K   S      R+ ++   P  P P+++  T  L P ++ +S  +++    
Sbjct:   395 FRVAMSDVLKSKVSP---VKREDNLPERPDPPGPSSDLATALLRPQASLVSKKAEDVSVQ 451

Query:   541 PSRRTKSRWEPLPEEKPIDKLA--SSTNEIVKFSGWI--HANEKD--RKHIS--GSVSK- 591
             P  +T++       ++  D L   S T+  V     I  + N KD   +H    G V + 
Sbjct:   452 PEPQTRASGRRAARKESTDSLGGLSPTDATVIQCKNIPSNLNRKDLLMQHFGHFGKVLRV 511

Query:   592 --EDRLNNIKFHLSEQKSASKSFQRPVKRQRLSADGF---KTEDNGDASSDSDKEQSLTS 646
               + + N    H  +  SA+K+ ++    QR     F   K +  GD  +   + + +  
Sbjct:   512 YSKPQKNLAVVHFQDHTSAAKAKKKGKLFQRNEIQIFWQRKKQSPGDKPARPSEIKDVIE 571

Query:   647 YYSGAIALANSP 658
                 A AL  SP
Sbjct:   572 DPESASALLQSP 583


>UNIPROTKB|P11414 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:10029 "Cricetulus griseus" [GO:0005634 "nucleus"
            evidence=ISS] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=ISS] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0006468 "protein
            phosphorylation" evidence=ISS] [GO:0004672 "protein kinase
            activity" evidence=ISS] InterPro:IPR000684 Pfam:PF05001
            PROSITE:PS00115 GO:GO:0003677 GO:GO:0006468 GO:GO:0006366
            GO:GO:0003899 GO:GO:0005665 EMBL:M19538 PIR:A27677
            ProteinModelPortal:P11414 Uniprot:P11414
        Length = 467

 Score = 135 (52.6 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 96/348 (27%), Positives = 142/348 (40%)

Query:   106 AQDYSGYT-SYPNSSDPYAYGSTAYPGXXXXXXXXPNHSY-PQPVGAYQNSGAPYQPISS 163
             A D SG++  Y  +  P   GS   PG        P+  Y P P GA   S +P  P   
Sbjct:    48 ASDASGFSPGYSPAWSPTP-GSPGSPG--------PSSPYIPSPGGAMSPSYSPTSPAYE 98

Query:   164 FQNSGSYVGPA-SYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSN 219
              ++ G Y   + SYS T  +P    T+  Y P+S  YS  + S++  + + + TS  YS 
Sbjct:    99 PRSPGGYTPQSPSYSPT--SPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSP 156

Query:   220 YT---SDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQ 276
              +   S TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     
Sbjct:   157 TSPSYSPTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 214

Query:   277 P---PGVTAGYPTA--HSQPAPIYHXXXXXXXXXXXXXX-XXPA-ATSNGSHDSYWKHGT 329
             P   P   +  PT+  +S  +P Y                  P+ + ++ S+     + T
Sbjct:   215 PSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYT 274

Query:   330 PSFQNRQVSPVQPHYS--KPLEQKTS--YNNFQDQHKAACPQ-GPSS-QYAIGQQMAPSY 383
             P+  N   SP  P YS   P    TS  Y+    ++    P   PSS  Y+     +PSY
Sbjct:   275 PTSPN--YSPTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYS---PSSPSY 329

Query:   384 Q--SPP-VQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA 428
                SP    TSP       S    PT+P+ +       P + K S T+
Sbjct:   330 SPTSPKYTPTSPSYSPS--SPEYTPTSPKYSPTSPKYSPTSPKYSPTS 375


>UNIPROTKB|I3LQ53 [details] [associations]
            symbol:I3LQ53 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR000684 Pfam:PF05001 PROSITE:PS00115 GO:GO:0003677
            GO:GO:0006366 GO:GO:0005665 GeneTree:ENSGT00700000104490
            EMBL:FP565284 Ensembl:ENSSSCT00000030016 OMA:YAESDYL Uniprot:I3LQ53
        Length = 543

 Score = 135 (52.6 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 96/348 (27%), Positives = 142/348 (40%)

Query:   106 AQDYSGYT-SYPNSSDPYAYGSTAYPGXXXXXXXXPNHSY-PQPVGAYQNSGAPYQPISS 163
             A D SG++  Y  +  P   GS   PG        P+  Y P P GA   S +P  P   
Sbjct:   124 ASDASGFSPGYSPAWSPTP-GSPGSPG--------PSSPYIPSPGGAMSPSYSPTSPAYE 174

Query:   164 FQNSGSYVGPA-SYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSN 219
              ++ G Y   + SYS T  +P    T+  Y P+S  YS  + S++  + + + TS  YS 
Sbjct:   175 PRSPGGYTPQSPSYSPT--SPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSP 232

Query:   220 YT---SDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQ 276
              +   S TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     
Sbjct:   233 TSPSYSPTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 290

Query:   277 P---PGVTAGYPTA--HSQPAPIYHXXXXXXXXXXXXXX-XXPA-ATSNGSHDSYWKHGT 329
             P   P   +  PT+  +S  +P Y                  P+ + ++ S+     + T
Sbjct:   291 PSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYT 350

Query:   330 PSFQNRQVSPVQPHYS--KPLEQKTS--YNNFQDQHKAACPQ-GPSS-QYAIGQQMAPSY 383
             P+  N   SP  P YS   P    TS  Y+    ++    P   PSS  Y+     +PSY
Sbjct:   351 PTSPN--YSPTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYS---PSSPSY 405

Query:   384 Q--SPP-VQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA 428
                SP    TSP       S    PT+P+ +       P + K S T+
Sbjct:   406 SPTSPKYTPTSPSYSPS--SPEYTPTSPKYSPTSPKYSPTSPKYSPTS 451


>FB|FBgn0036203 [details] [associations]
            symbol:Muc68D "Mucin 68D" species:7227 "Drosophila
            melanogaster" [GO:0016490 "structural constituent of peritrophic
            membrane" evidence=ISS] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0008061 "chitin binding" evidence=IEA]
            [GO:0006030 "chitin metabolic process" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=ISM] [GO:0005201 "extracellular
            matrix structural constituent" evidence=ISM] InterPro:IPR002557
            Pfam:PF01607 PROSITE:PS50940 SMART:SM00494 GO:GO:0005576
            EMBL:AE014296 eggNOG:NOG12793 GO:GO:0031012 GO:GO:0008061
            GO:GO:0005201 CAZy:CBM14 Gene3D:2.170.140.10 SUPFAM:SSF57625
            GO:GO:0006030 GeneTree:ENSGT00700000104174 EMBL:AY075323
            RefSeq:NP_648504.2 UniGene:Dm.20068 SMR:Q9VTN2 MINT:MINT-900668
            STRING:Q9VTN2 EnsemblMetazoa:FBtr0076119 GeneID:39326
            KEGG:dme:Dmel_CG6004 UCSC:CG6004-RB CTD:39326 FlyBase:FBgn0036203
            InParanoid:Q9VTN2 OMA:STESSQD OrthoDB:EOG4WSTSF GenomeRNAi:39326
            NextBio:813085 Uniprot:Q9VTN2
        Length = 1514

 Score = 137 (53.3 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 126/692 (18%), Positives = 252/692 (36%)

Query:     5 NQNQQGSTQNIASSVD-PNSVENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQSTE 63
             N+   G+T + +S+   P+S +      +S S++   F  +T +   S ++  + N ST+
Sbjct:   283 NEPSTGATDDSSSTESLPDSTQE----SSSSSESPVSFELSTEATNESSSSESLPNSSTQ 338

Query:    64 NGNLSN-ASYHHEQHTESHVKSLQ-DXXXXXXXXXXXXXXXXXVAQDYSGYTSYPNSSDP 121
             + + S   S+  E  T++  +S   +                 ++ + S   +  +SS  
Sbjct:   339 DSSSSTETSFQTESTTDATDESSSTESQPDSTTQESSSSTEGPLSTESSTAVTDQSSSTE 398

Query:   122 YAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQNSGSYVGPASY-SATY 180
              +  ST               S  +      +S    Q  ++ ++S S  GP S  S+T 
Sbjct:   399 SSQDSTTQESSSSTEGPLSTESSTEATNE-SSSTESSQDSTTQESSSSTEGPLSTESSTE 457

Query:   181 YNPGDYQTAGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSDTSGAYSSGTAPATSLQY 240
                    T     S+  + +++S  EG  +  +S + +N +S T  +  S T  ++S   
Sbjct:   458 ATNESSSTESSQDST--TQESSSSTEGPLSTESSTEATNESSSTESSQDSTTQESSSSS- 514

Query:   241 QQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQPPGVTAGYPTAHSQPAPIYHXXXX 300
             +      +   +  E S    +++ +   SS+    P       P+  +  +        
Sbjct:   515 EGPLSTESSTEATNESSSTESSQDSTTQESSSSTESPLSTE---PSTEANESSSTESSQD 571

Query:   301 XXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQNRQVSPVQPHYSKPLEQKTSYNNFQDQ 360
                         P +T + +  +     T S Q+            PL  ++S     + 
Sbjct:   572 STTQESSSSTEDPLSTESSTEATNESSSTESSQDSTTQESSSSTEGPLSTESSTEGSNES 631

Query:   361 HKAACPQGPSSQYAIGQQMAPSYQSPPVQTSPQLDNRRVSKL---QIPTNPRIASNLALG 417
                      SSQ +  Q+ + S +SP + T P  +    S     Q  T    +S+    
Sbjct:   632 SSTE-----SSQDSTTQKSSSSTESP-LSTEPSTEANESSSTESSQDSTTQESSSSTEGP 685

Query:   418 L---PKTDKDSSTANAAAKPAYIGVSLAKSNEKVVSHADSRV-EPGTFPKSLCGYVERAL 473
             L   P T+ + S++  +++ +    S + S   + + + +   E  +   S     + + 
Sbjct:   686 LSTEPSTEANESSSTESSQDSTTQESSSSSEGPLSTESSTEANESSSTESSQDSTTQESS 745

Query:   474 ARCKGDAEIAASQAVMGEIIKKANSDGTLFSRDWDVE-PLFPKPTTEAVTKDLPTSTPLS 532
             +  +       S         +++ D T        E PL  +P+TEA   +  +ST  S
Sbjct:   746 SSTESPLSTEPSTEANESSSTESSQDSTTQESSSSTEGPLSTEPSTEA---NESSSTESS 802

Query:   533 ALSKNKRSPSRRTKSRWEPLPEEKPIDKLASSTNEIVKFSGWIHANEKDRKHISGSVSKE 592
               S  + S S    S   PL  E   +   SS+ E  + S    ++      +S   S E
Sbjct:   803 QDSTTQESSS----SSEGPLSTESSTEANESSSTESSQDSTTQESSSSTEDPLSTESSTE 858

Query:   593 DRLNNIKFHLSEQKS---ASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYS 649
                 +     S+  +   +S S + P+  +  S +G     + ++S DS  ++S +S  S
Sbjct:   859 ATYESSSTESSQDSTTQESSSSTEGPLSTES-STEGSNESSSTESSQDSTTQESSSSTES 917

Query:   650 GAIALANSPEERMRRENRSKRFDRGQGNRSET 681
               ++   S E        S +    Q + S T
Sbjct:   918 -PLSTEPSTEANESSSTESSQDSTTQESSSST 948


>UNIPROTKB|F1N4T6 [details] [associations]
            symbol:LOC507750 "Uncharacterized protein" species:9913
            "Bos taurus" [GO:0051298 "centrosome duplication" evidence=IEA]
            [GO:0051225 "spindle assembly" evidence=IEA] [GO:0005819 "spindle"
            evidence=IEA] [GO:0005813 "centrosome" evidence=IEA] Pfam:PF03399
            GO:GO:0005813 GO:GO:0051298 GO:GO:0051225 GO:GO:0005819
            InterPro:IPR005062 GeneTree:ENSGT00530000063781 EMBL:DAAA02063546
            IPI:IPI00701413 Ensembl:ENSBTAT00000002904 Uniprot:F1N4T6
        Length = 335

 Score = 126 (49.4 bits), Expect = 0.00013, P = 0.00013
 Identities = 58/211 (27%), Positives = 97/211 (45%)

Query:   771 YKCDQLKSIRQDLTVQRIRNQLTAKVYETH--------ARLA--IENGDL-PEYNQCQSQ 819
             +  D+L+++R DL +Q   +  TA V E+         ARL     +G + P   Q Q Q
Sbjct:    87 FVADRLRAVRLDLALQTASDVETALVLESALAVLLAVVARLGPNATHGPVDPMLLQAQVQ 146

Query:   820 -----LKILYAEGIEGCCMEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKAVKHAL 874
                  L+  YA G  G     + +  L  +L++    E L  + RL    +   A++ AL
Sbjct:   147 ESFGSLRRCYALGA-GPHPRQATFQGL-FLLYNLGSVEALHEILRLPAALRSCPALRTAL 204

Query:   875 AVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYR-P---TVPVSY 930
             AV +A   GN    FRL +T P L +C +  +V + R  A++ ++R+   P   T+P+ +
Sbjct:   205 AVDSAFREGNAARLFRLLRTLPYLQSCAVQCHVGRARRGALARLARALSTPKGQTLPLGF 264

Query:   931 VAQVLGFTGVSPTNEECEERDS--DGLEECV 959
             +  +L   G +   + C+      DG E  V
Sbjct:   265 MVHLLALDGPNEARDLCQAHGLPLDGQERVV 295


>MGI|MGI:1930089 [details] [associations]
            symbol:Mcm3ap "minichromosome maintenance deficient 3 (S.
            cerevisiae) associated protein" species:10090 "Mus musculus"
            [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=ISO;IDA] [GO:0005737 "cytoplasm" evidence=IDA]
            [GO:0005829 "cytosol" evidence=ISO] MGI:MGI:1930089 Pfam:PF03399
            GO:GO:0005829 GO:GO:0005634 GO:GO:0005737 EMBL:CH466553
            InterPro:IPR005062 CTD:8888 eggNOG:COG5079 HOVERGEN:HBG052431
            OMA:AHQMKVQ OrthoDB:EOG44J2H4 EMBL:AJ006590 EMBL:BC052452
            IPI:IPI00125222 RefSeq:NP_062307.2 UniGene:Mm.30098
            ProteinModelPortal:Q9WUU9 SMR:Q9WUU9 STRING:Q9WUU9
            PhosphoSite:Q9WUU9 PaxDb:Q9WUU9 PRIDE:Q9WUU9
            Ensembl:ENSMUST00000170795 GeneID:54387 KEGG:mmu:54387
            GeneTree:ENSGT00530000063781 InParanoid:Q7TS87 NextBio:311220
            Bgee:Q9WUU9 Genevestigator:Q9WUU9 GermOnline:ENSMUSG00000001150
            Uniprot:Q9WUU9
        Length = 1971

 Score = 135 (52.6 bits), Expect = 0.00015, Sum P(3) = 0.00015
 Identities = 62/236 (26%), Positives = 106/236 (44%)

Query:   769 YLYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIE-------------NGDLPEYN- 814
             Y +  ++ + IR+D+T Q + + LT  + E   R  I              +  +   N 
Sbjct:   713 YDFVWNRTRGIRKDITQQHLCDPLTVSLIEKCTRFHIHCAHFMCEEPMSSFDAKINNENM 772

Query:   815 -QCQSQLKILYAE----GIEGCCME--FSAYHLLCVILHSNNKRELLSLMSRLSDKAKQD 867
              +C   LK +Y +    G+  C  E  F  Y++L  +    NK ++L  + +     +  
Sbjct:   773 TKCLQSLKEMYQDLRNKGVF-CASEAEFQGYNVLLNL----NKGDILREVQQFHPDVRNS 827

Query:   868 KAVKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYRPTVP 927
               V  A+   AA++S N++ FF+L ++A  LN CL+  Y  ++R  A+  ++ +Y  TV 
Sbjct:   828 PEVNFAVQAFAALNSNNFVRFFKLVQSASYLNACLLHCYFNQIRKDALRALNVAY--TVS 885

Query:   928 VSYVAQVLGFTGVSPTNEECEERDSDGLEECVEWLKAHGASLVTDANGEVQLDAKA 983
                 + V    GV         RDS   EE   +L  HG   +T A+G V+L+  A
Sbjct:   886 TQR-STVFPLDGVV---RMLLFRDS---EEATNFLNYHG---LTVADGCVELNRSA 931

 Score = 60 (26.2 bits), Expect = 0.00015, Sum P(3) = 0.00015
 Identities = 29/123 (23%), Positives = 50/123 (40%)

Query:   173 PASYSATYYNPGDYQTAG----GYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSDTSGAY 228
             P++++ +    G YQT      G PS    + T S       +    Q  ++ + + G++
Sbjct:    12 PSAFAVSSSTTGTYQTKSPFRFGQPSLFGQNSTPS------KSLAFSQVPSFATPSGGSH 65

Query:   229 SSGTAPATSLQYQQQYKQWADY-----YSQTEVSCAPGTENLSVASSSNQVLQPPGVTAG 283
             SS + PA  L        ++       ++ T  S  PG    S  S+S+  + P G T G
Sbjct:    66 SS-SLPAFGLTQTSSVGLFSSLESTPSFAATSSSSVPGNTAFSFKSTSSVGVFPSGATFG 124

Query:   284 YPT 286
               T
Sbjct:   125 PET 127

 Score = 43 (20.2 bits), Expect = 0.00015, Sum P(3) = 0.00015
 Identities = 15/50 (30%), Positives = 21/50 (42%)

Query:   511 PLFPKP-TTEAVTKDLPTSTPLSALSKNKRS--PSRRTKSRWEPLPEEKP 557
             P F  P + + V ++   ST     S +  S  P+    S  EP P  KP
Sbjct:   214 PAFASPLSNQNVEEEKRVSTSAFGSSNSSFSTFPTASPGSLGEPFPANKP 263

 Score = 37 (18.1 bits), Expect = 0.00058, Sum P(3) = 0.00057
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query:   542 SRRTKSRWEPLPEE-KPIDKLASSTNEIV 569
             SR +  + EPLP E +P   L+ + + +V
Sbjct:   670 SRSSADQEEPLPHELRPSAVLSRTMDYLV 698


>UNIPROTKB|P24928 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0003968 "RNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0003677 "DNA binding" evidence=NAS] [GO:0003899 "DNA-directed
            RNA polymerase activity" evidence=NAS] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=NAS] [GO:0006366
            "transcription from RNA polymerase II promoter"
            evidence=IDA;NAS;TAS] [GO:0005634 "nucleus" evidence=IDA;NAS]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IDA] [GO:0004672 "protein kinase activity" evidence=IDA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0000398 "mRNA splicing,
            via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=TAS]
            [GO:0006289 "nucleotide-excision repair" evidence=TAS] [GO:0006367
            "transcription initiation from RNA polymerase II promoter"
            evidence=TAS] [GO:0006368 "transcription elongation from RNA
            polymerase II promoter" evidence=TAS] [GO:0006370
            "7-methylguanosine mRNA capping" evidence=TAS] [GO:0008380 "RNA
            splicing" evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0016032 "viral reproduction" evidence=TAS] [GO:0050434
            "positive regulation of viral transcription" evidence=TAS]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=IPI]
            [GO:0006468 "protein phosphorylation" evidence=IDA]
            Reactome:REACT_216 Reactome:REACT_71 InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 Reactome:REACT_116125
            EMBL:CH471108 GO:GO:0016032 GO:GO:0006355 GO:GO:0046872
            GO:GO:0003677 Reactome:REACT_1675 GO:GO:0006468 GO:GO:0006368
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0006367 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0006370 GO:GO:0050434 GO:GO:0006283
            Reactome:REACT_1892 EMBL:AC113189 GO:GO:0003899 PDB:2GHQ PDB:2GHT
            PDBsum:2GHQ PDBsum:2GHT eggNOG:COG0086 GO:GO:0003968 GO:GO:0005665
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 EMBL:X63564 EMBL:X74874
            EMBL:X74873 EMBL:X74872 EMBL:X74871 EMBL:X74870 EMBL:BC137231
            IPI:IPI00031627 PIR:I38186 PIR:S21054 RefSeq:NP_000928.1
            UniGene:Hs.270017 PDB:2LTO PDBsum:2LTO ProteinModelPortal:P24928
            SMR:P24928 DIP:DIP-29011N IntAct:P24928 MINT:MINT-156582
            STRING:P24928 PhosphoSite:P24928 DMDM:281185484 PaxDb:P24928
            PRIDE:P24928 Ensembl:ENST00000322644 GeneID:5430 KEGG:hsa:5430
            UCSC:uc002ghf.4 CTD:5430 GeneCards:GC17P007387 H-InvDB:HIX0173727
            HGNC:HGNC:9187 HPA:CAB012226 HPA:CAB016388 HPA:CAB022311
            HPA:HPA021563 MIM:180660 neXtProt:NX_P24928 PharmGKB:PA33507
            HOVERGEN:HBG004339 InParanoid:P24928 OrthoDB:EOG4JWVCM
            BindingDB:P24928 ChEMBL:CHEMBL1641353 ChiTaRS:POLR2A
            EvolutionaryTrace:P24928 GenomeRNAi:5430 NextBio:21009
            ArrayExpress:P24928 Bgee:P24928 CleanEx:HS_POLR2A
            Genevestigator:P24928 GermOnline:ENSG00000181222 Uniprot:P24928
        Length = 1970

 Score = 135 (52.6 bits), Expect = 0.00016, P = 0.00016
 Identities = 96/348 (27%), Positives = 142/348 (40%)

Query:   106 AQDYSGYT-SYPNSSDPYAYGSTAYPGXXXXXXXXPNHSY-PQPVGAYQNSGAPYQPISS 163
             A D SG++  Y  +  P   GS   PG        P+  Y P P GA   S +P  P   
Sbjct:  1551 ASDASGFSPGYSPAWSPTP-GSPGSPG--------PSSPYIPSPGGAMSPSYSPTSPAYE 1601

Query:   164 FQNSGSYVGPA-SYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSN 219
              ++ G Y   + SYS T  +P    T+  Y P+S  YS  + S++  + + + TS  YS 
Sbjct:  1602 PRSPGGYTPQSPSYSPT--SPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSP 1659

Query:   220 YT---SDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQ 276
              +   S TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     
Sbjct:  1660 TSPSYSPTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 1717

Query:   277 P---PGVTAGYPTA--HSQPAPIYHXXXXXXXXXXXXXX-XXPA-ATSNGSHDSYWKHGT 329
             P   P   +  PT+  +S  +P Y                  P+ + ++ S+     + T
Sbjct:  1718 PSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYT 1777

Query:   330 PSFQNRQVSPVQPHYS--KPLEQKTS--YNNFQDQHKAACPQ-GPSS-QYAIGQQMAPSY 383
             P+  N   SP  P YS   P    TS  Y+    ++    P   PSS  Y+     +PSY
Sbjct:  1778 PTSPN--YSPTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYS---PSSPSY 1832

Query:   384 Q--SPP-VQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA 428
                SP    TSP       S    PT+P+ +       P + K S T+
Sbjct:  1833 SPASPKYTPTSPSYSPS--SPEYTPTSPKYSPTSPKYSPTSPKYSPTS 1878


>RGD|1587326 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide A"
            species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003677 "DNA binding" evidence=IEA;ISO]
            [GO:0003899 "DNA-directed RNA polymerase activity" evidence=IEA]
            [GO:0004672 "protein kinase activity" evidence=IEA;ISO] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005634 "nucleus"
            evidence=ISO] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISO] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0006468 "protein
            phosphorylation" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA;ISO] [GO:0005730 "nucleolus" evidence=ISO]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 RGD:1587326 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 KO:K03006 CTD:5430 OrthoDB:EOG4JWVCM
            IPI:IPI00952328 RefSeq:XP_001079162.1 RefSeq:XP_343923.3
            UniGene:Rn.163136 Ensembl:ENSRNOT00000068013 GeneID:363633
            KEGG:rno:363633 UCSC:RGD:1587326 NextBio:683839 ArrayExpress:D4A5A6
            Uniprot:D4A5A6
        Length = 1970

 Score = 135 (52.6 bits), Expect = 0.00016, P = 0.00016
 Identities = 96/348 (27%), Positives = 142/348 (40%)

Query:   106 AQDYSGYT-SYPNSSDPYAYGSTAYPGXXXXXXXXPNHSY-PQPVGAYQNSGAPYQPISS 163
             A D SG++  Y  +  P   GS   PG        P+  Y P P GA   S +P  P   
Sbjct:  1551 ASDASGFSPGYSPAWSPTP-GSPGSPG--------PSSPYIPSPGGAMSPSYSPTSPAYE 1601

Query:   164 FQNSGSYVGPA-SYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSN 219
              ++ G Y   + SYS T  +P    T+  Y P+S  YS  + S++  + + + TS  YS 
Sbjct:  1602 PRSPGGYTPQSPSYSPT--SPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSP 1659

Query:   220 YT---SDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQ 276
              +   S TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     
Sbjct:  1660 TSPSYSPTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 1717

Query:   277 P---PGVTAGYPTA--HSQPAPIYHXXXXXXXXXXXXXX-XXPA-ATSNGSHDSYWKHGT 329
             P   P   +  PT+  +S  +P Y                  P+ + ++ S+     + T
Sbjct:  1718 PSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYT 1777

Query:   330 PSFQNRQVSPVQPHYS--KPLEQKTS--YNNFQDQHKAACPQ-GPSS-QYAIGQQMAPSY 383
             P+  N   SP  P YS   P    TS  Y+    ++    P   PSS  Y+     +PSY
Sbjct:  1778 PTSPN--YSPTSPSYSPTSPSYSPTSPSYSPSSPRYTPQSPTYTPSSPSYS---PSSPSY 1832

Query:   384 Q--SPP-VQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA 428
                SP    TSP       S    PT+P+ +       P + K S T+
Sbjct:  1833 SPTSPKYTPTSPSYSPS--SPEYTPTSPKYSPTSPKYSPTSPKYSPTS 1878


>FB|FBgn0038492 [details] [associations]
            symbol:Mur89F "Mucin related 89F" species:7227 "Drosophila
            melanogaster" [GO:0008061 "chitin binding" evidence=IEA]
            [GO:0006030 "chitin metabolic process" evidence=IEA] [GO:0005576
            "extracellular region" evidence=IEA] [GO:0031012 "extracellular
            matrix" evidence=ISM] [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISM] InterPro:IPR002557 Pfam:PF01607
            PROSITE:PS50940 SMART:SM00494 EMBL:AE014297 GO:GO:0005576
            eggNOG:NOG12793 GO:GO:0031012 GO:GO:0008061 GO:GO:0005201
            CAZy:CBM14 Gene3D:2.170.140.10 SUPFAM:SSF57625 GO:GO:0006030
            GeneTree:ENSGT00700000104174 RefSeq:NP_650611.1 UniGene:Dm.16781
            ProteinModelPortal:Q9VEL9 SMR:Q9VEL9 PRIDE:Q9VEL9
            EnsemblMetazoa:FBtr0083413 GeneID:42080 KEGG:dme:Dmel_CG4090
            UCSC:CG4090-RA CTD:42080 FlyBase:FBgn0038492 InParanoid:Q9VEL9
            OMA:QWNNNNQ OrthoDB:EOG46Q59B PhylomeDB:Q9VEL9 GenomeRNAi:42080
            NextBio:827084 ArrayExpress:Q9VEL9 Bgee:Q9VEL9 Uniprot:Q9VEL9
        Length = 2112

 Score = 138 (53.6 bits), Expect = 0.00018, Sum P(3) = 0.00018
 Identities = 68/269 (25%), Positives = 106/269 (39%)

Query:     9 QGSTQNIASSVDPNSVENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQSTENGNLS 68
             Q STQ+  +S   ++  ++     S S +S    ST+ S + S ++   +  S+ N + S
Sbjct:   224 QTSTQSSQTSYQNSTTSSQQSSSTSSSNSSQSSSSTSSSTSNSSSSQESSTSSSSNQSSS 283

Query:    69 NASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSGYTSYPNSSDPYAYGSTA 128
              +S H E  +ES      +                  +   +  +S  N S     GS  
Sbjct:   284 TSSNHQESSSESSNNQESNSGSSSNQESSTSSSSNQGSSSQNAGSSNQNQSS----GSNQ 339

Query:   129 YPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQNSGSYVGPASYSATYYNPGDYQT 188
               G         N S  Q   + QNSG+  Q  SS QNSGS     + S+     G  Q+
Sbjct:   340 SSGSSQSSNSNQNSSNNQNQSS-QNSGSN-QSSSSSQNSGS-----NQSS-----GSNQS 387

Query:   189 AGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYTSDTSGAYSSGTAPA-TSLQYQQQY--- 244
             +G   SS  S+Q++  N+ +  N +S   SN TS +S   S    P  T  + +  Y   
Sbjct:   388 SGNNQSSS-SNQSSGSNQSSNNNQSSSSNSNQTSSSSQNNSGSNKPVPTECEDENTYIPD 446

Query:   245 -KQWADYY----------SQTEVSCAPGT 262
              +  A +Y           Q   +C PGT
Sbjct:   447 KEDCAKFYRCRQDKDGKLEQVPFTCGPGT 475

 Score = 125 (49.1 bits), Expect = 0.00095, Sum P(3) = 0.00095
 Identities = 133/683 (19%), Positives = 229/683 (33%)

Query:    34 QSQASSYFPSTTGSGAVSWATHGVNNQSTENGNLSNASYHHEQHTESHVKSLQDXXXXXX 93
             QS +SS    +  SG  S  +   N  S+ N + SN S  + Q + ++  S  +      
Sbjct:   498 QSGSSS---GSNNSGQQSSGSSSNNQGSSNNQSSSNQSSSNNQGSSNNQGSSSNQGSSSN 554

Query:    94 XXXXXXXXXXXVAQDYSGYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQN 153
                          Q  S   S  N S     GS++  G         N S     G+  N
Sbjct:   555 QGSSSNQGSSS-NQGSSSNQSSSNQSSSSNQGSSSNQGSSSNQGSSSNQSSSSNQGSSSN 613

Query:   154 SGAPY-QPISSFQNSGSYVGPASYSATYYNPGDYQTAGGYPSSGYSHQTTSWNEGNYTNY 212
              G+   Q  SS Q S S  G +S   +  N G     G   + G S   +S N+G+ ++ 
Sbjct:   614 QGSSSNQSSSSNQGSSSNQGSSSNQGSSSNQGSSSNQGSSSNQGSSSNQSS-NQGSTSSS 672

Query:   213 TSHQYSNYTSDTSGAYSSGTAPATSLQYQQQY----KQWADYYSQTEVSCAPGTENLSVA 268
             ++   S+  ++++   +  + P    Q  + Y    K  A +Y   E + + G   +   
Sbjct:   673 SNQSSSSSNNNSTSTQTKPSNPDGECQDTETYLADKKDCARFYRCVE-NGSGGFNKVPFD 731

Query:   269 SSSNQVLQPPGVTAGYPTAHSQPAPIYHXXXXXXXXXXXXXXXXPAATSNGSHD---SYW 325
              S   V  P      +PT   +                       +++S GS     S  
Sbjct:   732 CSPGTVWDPDTKGCNHPTDVQKEQCKAMANGSGSSSSQGSSSNQGSSSSQGSSSNQGSSS 791

Query:   326 KHGTPSFQ----NRQVSPVQPHYSKP---LEQKTSYNNFQDQHK-AACPQGPSSQYAIGQ 377
               G+ S Q    N+  S  Q   S       Q +S N     ++ ++  QG SS      
Sbjct:   792 NQGSSSNQGSSSNQGSSSNQGSSSNQGSSSNQGSSSNQGSSSNQGSSSSQGSSSNQGSSS 851

Query:   378 QMAPSYQSPPVQTSPQLDNRRVSKLQ-IPTNPRIASNLALGLPKTDKDSSTANAAAKPAY 436
                 S             N+  S  Q   +N   +SN   G   +   SS   +++    
Sbjct:   852 NQGSSSNQGSSSNQGSSSNQGSSSSQGSSSNEGSSSNQ--GSSSSQGSSSNQGSSSNQGS 909

Query:   437 IGVSLAKSNEKVVSHADSRVEPGTFPKSLCGYVERALARCKGDAEIAASQAVMGEIIK-- 494
                  + SN+   S+  S    G+         + +  +   +   +++Q       K  
Sbjct:   910 SSNQGSSSNQGSSSNQGSSSNQGSSSNQSSSSNQSSSNQSSSNQSSSSNQTSSSTTQKPF 969

Query:   495 ----KANSDGTLFSRDWDVEPLFPKPTTE--AVTKDLPTSTPLSALSKNKRSPSRRTKSR 548
                 K  S+ T  + + +    +          TK   T  P +       S +   + +
Sbjct:   970 KPAEKCESEETFLADNENCSKFYRCVDNGKGGFTKVSFTCPPNTLWDPEANSCNHPDQIQ 1029

Query:   549 WEPLPEEKPIDKLASSTNEIVKFSGWIHANEKDRKHISGSVSKEDRLNNIKFHLSEQKSA 608
              +PL  +K + +  SS+N     S    +N       SGS S     N+     S   + 
Sbjct:  1030 TKPLKCKKVVSQGGSSSNSTSNSSS--SSNNSGSSSNSGSSSSSSSSNSG----SSSNTG 1083

Query:   609 SKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSGAIALANSPEERMRRENRS 668
             S S          S  G  +     + S+S   QS +S  S + + +N+  +     + S
Sbjct:  1084 SSSNSGASSSGGSSNQGSSSNSGSSSGSNSSGNQSTSSSTSSSSSSSNNNNQGSSSSSSS 1143

Query:   669 KRFDRG-QGNRSETNRFKGKNAG 690
                    + N SET +  G+  G
Sbjct:  1144 SSSSTSSKPNPSETCKVNGQFIG 1166

 Score = 67 (28.6 bits), Expect = 0.00095, Sum P(3) = 0.00095
 Identities = 24/82 (29%), Positives = 37/82 (45%)

Query:     5 NQNQQ-GSTQNIASSVDPNSVEN--RYVVDASQSQASSYFPSTT---GSGAVSWATHGVN 58
             NQNQ  GS Q+  SS   NS +N       +SQ+  S+   S++   GS   S +     
Sbjct:   330 NQNQSSGSNQSSGSSQSSNSNQNSSNNQNQSSQNSGSNQSSSSSQNSGSNQSSGSNQSSG 389

Query:    59 NQSTENGNLSNASYHHEQHTES 80
             N  + + N S+ S     + +S
Sbjct:   390 NNQSSSSNQSSGSNQSSNNNQS 411

 Score = 61 (26.5 bits), Expect = 0.00018, Sum P(3) = 0.00018
 Identities = 29/134 (21%), Positives = 54/134 (40%)

Query:   562 ASSTNEIVKFSGWIHANEKDRKHISGSVSKEDRLNNIKFHLSEQKSASKSFQRPVKRQRL 621
             +S +N   + S    +N +   +   S S +   NN     S     S S Q     Q  
Sbjct:   502 SSGSNNSGQQSSGSSSNNQGSSNNQSS-SNQSSSNN---QGSSNNQGSSSNQGSSSNQGS 557

Query:   622 SADGFKTEDNGDASSDSDKEQSLTSYYSGAIALANSPEERMRRENRSKRFDRGQGNR--S 679
             S++   + + G +S+ S   QS +S   G+ +   S   +    N+S   ++G  +   S
Sbjct:   558 SSNQGSSSNQGSSSNQSSSNQSSSSN-QGSSSNQGSSSNQGSSSNQSSSSNQGSSSNQGS 616

Query:   680 ETNRFKGKNAGTGN 693
              +N+    N G+ +
Sbjct:   617 SSNQSSSSNQGSSS 630

 Score = 54 (24.1 bits), Expect = 0.00087, Sum P(3) = 0.00087
 Identities = 27/139 (19%), Positives = 50/139 (35%)

Query:   558 IDKLASSTNEIVKFSGWIHANEKDRKHISGSVSKEDRLNN--IKFHLSEQKSASKSFQRP 615
             +DK+     E  K    I +      + SG  S     NN     + S    +S + Q  
Sbjct:   480 VDKVCDLPTEDQKKKCNIQSGSSSGSNNSGQQSSGSSSNNQGSSNNQSSSNQSSSNNQGS 539

Query:   616 VKRQRLSADGFKTEDNGDASSD-SDKEQSLTSYYSGAIALANSPEERMRRENRSKRFDRG 674
                Q  S++   + + G +S+  S   Q  +S  S +   ++S +     +  S      
Sbjct:   540 SNNQGSSSNQGSSSNQGSSSNQGSSSNQGSSSNQSSSNQSSSSNQGSSSNQGSSSNQGSS 599

Query:   675 QGNRSETNRFKGKNAGTGN 693
                 S +N+    N G+ +
Sbjct:   600 SNQSSSSNQGSSSNQGSSS 618

 Score = 39 (18.8 bits), Expect = 0.00018, Sum P(3) = 0.00018
 Identities = 11/26 (42%), Positives = 13/26 (50%)

Query:   970 VTDANGEVQLDAKASSSTLFMPEPED 995
             VT    E Q +   SS   F P+PED
Sbjct:  1747 VTTLQPEPQPNYNCSSEGFF-PDPED 1771


>UNIPROTKB|G3N1X9 [details] [associations]
            symbol:LOC507750 "Uncharacterized protein" species:9913
            "Bos taurus" [GO:0051298 "centrosome duplication" evidence=IEA]
            [GO:0051225 "spindle assembly" evidence=IEA] [GO:0005819 "spindle"
            evidence=IEA] [GO:0005813 "centrosome" evidence=IEA] Pfam:PF03399
            GO:GO:0005813 GO:GO:0051298 GO:GO:0051225 GO:GO:0005819
            InterPro:IPR005062 GeneTree:ENSGT00530000063781 EMBL:DAAA02063546
            OMA:RLHRFEV Ensembl:ENSBTAT00000064082 Uniprot:G3N1X9
        Length = 402

 Score = 126 (49.4 bits), Expect = 0.00018, P = 0.00018
 Identities = 58/211 (27%), Positives = 97/211 (45%)

Query:   771 YKCDQLKSIRQDLTVQRIRNQLTAKVYETH--------ARLA--IENGDL-PEYNQCQSQ 819
             +  D+L+++R DL +Q   +  TA V E+         ARL     +G + P   Q Q Q
Sbjct:   144 FVADRLRAVRLDLALQTASDVETALVLESALAVLLAVVARLGPNATHGPVDPMLLQAQVQ 203

Query:   820 -----LKILYAEGIEGCCMEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKAVKHAL 874
                  L+  YA G  G     + +  L  +L++    E L  + RL    +   A++ AL
Sbjct:   204 ESFGSLRRCYALGA-GPHPRQATFQGL-FLLYNLGSVEALHEILRLPAALRSCPALRTAL 261

Query:   875 AVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYR-P---TVPVSY 930
             AV +A   GN    FRL +T P L +C +  +V + R  A++ ++R+   P   T+P+ +
Sbjct:   262 AVDSAFREGNAARLFRLLRTLPYLQSCAVQCHVGRARRGALARLARALSTPKGQTLPLGF 321

Query:   931 VAQVLGFTGVSPTNEECEERDS--DGLEECV 959
             +  +L   G +   + C+      DG E  V
Sbjct:   322 MVHLLALDGPNEARDLCQAHGLPLDGQERVV 352


>TAIR|locus:2082485 [details] [associations]
            symbol:SAC3B "AT3G06290" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0034399 "nuclear periphery" evidence=IDA]
            Pfam:PF03399 EMBL:CP002686 GO:GO:0034399 InterPro:IPR005062
            IPI:IPI01020080 RefSeq:NP_187280.3 UniGene:At.27787 PRIDE:F4JAU2
            EnsemblPlants:AT3G06290.1 GeneID:819803 KEGG:ath:AT3G06290
            OMA:DHNISKR Uniprot:F4JAU2
        Length = 1697

 Score = 110 (43.8 bits), Expect = 0.00020, Sum P(4) = 0.00020
 Identities = 35/109 (32%), Positives = 57/109 (52%)

Query:   834 EFSAYH-LLCVILHSNNKREL--LSL-MSRLSDKAKQDKAVKHALAVRAAVSSGNYIMFF 889
             EF  Y+ LL +  H   K E   LSL ++ ++ + +Q   V  A  V  A  +GN+I FF
Sbjct:   655 EFRGYYALLKLDKHPGYKVEPSELSLDLANMTPEIRQTSEVLFARNVARACRTGNFIAFF 714

Query:   890 RLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYRPT--VPVSYVAQVLG 936
             RL + A  L  CLM  +  K+R +A++ +    +    +PVS ++  +G
Sbjct:   715 RLARKASYLQACLMHAHFSKLRTQALASLHSGLQINQGLPVSDMSNWIG 763

 Score = 71 (30.1 bits), Expect = 0.00020, Sum P(4) = 0.00020
 Identities = 25/91 (27%), Positives = 46/91 (50%)

Query:   735 KRYLRLTSAPDPSTVRPEEVLEKALQMVQNS-----QKNYL----YKCDQLKSIRQDLTV 785
             K+Y R T+  +   +RP  +L+  ++ + +       +N+L    +  D++++IR DL +
Sbjct:   523 KKYTR-TAEREAILIRPMPILQNTMEYLLSLLDRPYNENFLGMYNFLWDRMRAIRMDLRM 581

Query:   786 QRIRNQLTAKVYETHARL-AIENGDLPEYNQ 815
             Q I NQ    + E   RL  I   +L EY +
Sbjct:   582 QHIFNQEAITLLEQMIRLHIIAMHELCEYTK 612

 Score = 53 (23.7 bits), Expect = 0.00020, Sum P(4) = 0.00020
 Identities = 37/154 (24%), Positives = 64/154 (41%)

Query:   536 KNKRSPS-RRTKSRWEPL-PEEKPIDKLASSTNEIVKFSGWIHANEKDRKHISGSVSK-E 592
             K   SP+ +RT+S   P+ P E+ I + +  + +  +  G   A  K      G +    
Sbjct:   373 KTNSSPATKRTRS--PPVYPIEEDIPRNSFPSQDCTE--GEEQARAKRLARFKGELEPIA 428

Query:   593 DRLNNIKFHLSEQKSASKSFQRPVKRQRLSADGFKTEDNGDASSDSDKEQSLTSYYSGAI 652
             DR  +I+   S      K      K+   S +  +    GDA  D +  +   S   G +
Sbjct:   429 DRPVDIQLTKSPVNKTMKPLDN--KQTFNSLESSRDALKGDALPDYENSEQ-PSLIIG-V 484

Query:   653 ALANSPE-ERMRRENRSK--RFDRGQGNRSETNR 683
                  PE ER  RE +     ++R  G+R++T++
Sbjct:   485 CPDMCPESERGERERKGDLDHYERVDGDRNQTSK 518

 Score = 46 (21.3 bits), Expect = 0.00020, Sum P(4) = 0.00020
 Identities = 21/85 (24%), Positives = 33/85 (38%)

Query:   116 PNSSDPYAYGSTAY-PGXXXXXXXXPNHSYPQPVGAYQNS----GAPYQP--------IS 162
             P S +  A+   ++ PG        P+     P+ A QN     G PY+P        I+
Sbjct:    55 PASQNHSAFAGQSFGPGGIRSG---PSIQRAPPLSASQNPQLSIGKPYRPGGVQSVPPIN 111

Query:   163 SFQNSGSYVGPASYSATYYNPGDYQ 187
                +  ++  P+  S   Y PG  Q
Sbjct:   112 RIPSPSAFQNPSPSSGQPYQPGGIQ 136

 Score = 46 (21.3 bits), Expect = 0.00091, Sum P(4) = 0.00091
 Identities = 45/183 (24%), Positives = 72/183 (39%)

Query:   325 WKHGTPSFQNRQV-SPVQPHYSKPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQQMAPSY 383
             W     S +N  V S   P+     EQ T  ++F   H+ A  Q  + + +    +APS 
Sbjct:   263 WMRSPSSAENNPVRSRSNPNQLIHQEQ-TGNSSFPYAHEVAEIQEATRRKS--SAVAPS- 318

Query:   384 QSPPVQTSP---QLDNRRVSKLQIPTNPRIASNLALG----LPKTDKDSSTANAAAKPAY 436
                P+   P   Q D++R S    PT+   +  L+       P      ++ N A K   
Sbjct:   319 -DKPLGDDPILSQHDSQRFSTSP-PTSGTKSYTLSRSSDSQFPGQPSSVNSFNNARK-TN 375

Query:   437 IGVSLAKSNEKVVSHADSRVEPGTFPKSLCGYVE---RA--LARCKGDAEIAASQAVMGE 491
                +  ++    V   +  +   +FP   C   E   RA  LAR KG+ E  A + V  +
Sbjct:   376 SSPATKRTRSPPVYPIEEDIPRNSFPSQDCTEGEEQARAKRLARFKGELEPIADRPVDIQ 435

Query:   492 IIK 494
             + K
Sbjct:   436 LTK 438


>UNIPROTKB|G3MZY8 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9913
            "Bos taurus" [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0004672 "protein kinase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=IEA]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:DAAA02048777
            EMBL:DAAA02048778 EMBL:DAAA02048779 EMBL:DAAA02048780
            EMBL:DAAA02048781 Ensembl:ENSBTAT00000064788 Uniprot:G3MZY8
        Length = 1970

 Score = 133 (51.9 bits), Expect = 0.00025, P = 0.00025
 Identities = 87/353 (24%), Positives = 128/353 (36%)

Query:   111 GYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNH-SY-PQPVGAYQNSGAPYQPIS-SFQNS 167
             G    P  S PY       PG        P   +Y P+  G Y      Y P S S+  +
Sbjct:  1571 GSPGSPGPSSPYIPS----PGGAMSPSYSPTSPAYEPRSPGGYTPQSPSYSPTSPSYSPT 1626

Query:   168 GSYVGPAS--YSATYYNPGDYQTAGGYPSSGYSHQTTSWNEGNYTNYTSHQYSNYT---S 222
                  P S  YS T  +P    T+  Y  +  S+  TS    +Y+  TS  YS  +   S
Sbjct:  1627 SPSYSPTSPNYSPT--SPSYSPTSPSYSPTSPSYSPTS---PSYSP-TSPSYSPTSPSYS 1680

Query:   223 DTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQP---PG 279
              TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     P   P 
Sbjct:  1681 PTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPT 1738

Query:   280 VTAGYPTA--HSQPAPIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQ--NR 335
               +  PT+  +S  +P Y                 P + +       +   +PS+   + 
Sbjct:  1739 SPSYSPTSPNYSPTSPNY--TPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSLTSP 1796

Query:   336 QVSPVQPHYS--KP--LEQKTSYNNFQDQHKAACPQ-GPSS-QYAIGQQMAPSYQSPPVQ 389
               SP  P YS   P    Q  +Y      +  + P   P+S +Y      +PSY     +
Sbjct:  1797 SYSPTSPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPTSPKYT---PASPSYSPSSPE 1853

Query:   390 TSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTAN--AAAKPAYIGVS 440
              +P       S    PT+P+ +       P T K S T+   +   P Y   S
Sbjct:  1854 YTPSPKYSPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPVYTPTS 1906


>MGI|MGI:98086 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide
            A" species:10090 "Mus musculus" [GO:0003677 "DNA binding"
            evidence=IDA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=ISO] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=ISO] [GO:0005730 "nucleolus"
            evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=ISO] [GO:0006468 "protein phosphorylation"
            evidence=ISO] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 MGI:MGI:98086
            GO:GO:0046872 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            EMBL:AL603707 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 GeneTree:ENSGT00700000104490
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 CTD:5430
            HOVERGEN:HBG004339 OrthoDB:EOG4JWVCM ChiTaRS:POLR2A EMBL:M12130
            EMBL:M14101 IPI:IPI00136207 PIR:A28490 RefSeq:NP_033115.1
            UniGene:Mm.16533 DisProt:DP00181 ProteinModelPortal:P08775
            SMR:P08775 DIP:DIP-46369N IntAct:P08775 STRING:P08775
            PhosphoSite:P08775 PaxDb:P08775 PRIDE:P08775
            Ensembl:ENSMUST00000058470 Ensembl:ENSMUST00000071213 GeneID:20020
            KEGG:mmu:20020 UCSC:uc007jrj.1 InParanoid:Q5F298 NextBio:297535
            Bgee:P08775 CleanEx:MM_POLR2A Genevestigator:P08775
            GermOnline:ENSMUSG00000005198 Uniprot:P08775
        Length = 1970

 Score = 133 (51.9 bits), Expect = 0.00025, P = 0.00025
 Identities = 93/341 (27%), Positives = 139/341 (40%)

Query:   106 AQDYSGYT-SYPNSSDPYAYGSTAYPGXXXXXXXXPNHSY-PQPVGAYQNSGAPYQPISS 163
             A D SG++  Y  +  P   GS   PG        P+  Y P P GA   S +P  P   
Sbjct:  1551 ASDASGFSPGYSPAWSPTP-GSPGSPG--------PSSPYIPSPGGAMSPSYSPTSPAYE 1601

Query:   164 FQNSGSYVGPA-SYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSN 219
              ++ G Y   + SYS T  +P    T+  Y P+S  YS  + S++  + + + TS  YS 
Sbjct:  1602 PRSPGGYTPQSPSYSPT--SPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSP 1659

Query:   220 YT---SDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQ 276
              +   S TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     
Sbjct:  1660 TSPSYSPTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 1717

Query:   277 P---PGVTAGYPTA--HSQPAPIYHXXXXXXXXXXXXXX-XXPA-ATSNGSHDSYWKHGT 329
             P   P   +  PT+  +S  +P Y                  P+ + ++ S+     + T
Sbjct:  1718 PSYSPTSPSYSPTSPSYSPTSPSYSPTSPNYSPTSPNYTPTSPSYSPTSPSYSPTSPNYT 1777

Query:   330 PSFQNRQVSPVQPHYS--KPLEQKTSYNNFQDQHKAACPQGPSSQYAIGQQMAPSYQSPP 387
             P+  N   SP  P YS   P    TS  ++        PQ P+  Y      +PSY SP 
Sbjct:  1778 PTSPN--YSPTSPSYSPTSPSYSPTS-PSYSPSSPRYTPQSPT--YT---PSSPSY-SP- 1827

Query:   388 VQTSPQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA 428
               +SP       S    PT+P  + +     P + K S T+
Sbjct:  1828 --SSPSYSP--TSPKYTPTSPSYSPSSPEYTPASPKYSPTS 1864


>UNIPROTKB|I3LCF4 [details] [associations]
            symbol:MCM3AP "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005829 "cytosol" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] Pfam:PF03399 GO:GO:0005829 GO:GO:0005634
            InterPro:IPR005062 OMA:AHQMKVQ GeneTree:ENSGT00530000063781
            EMBL:FP236737 Ensembl:ENSSSCT00000029980 Uniprot:I3LCF4
        Length = 1990

 Score = 146 (56.5 bits), Expect = 0.00042, Sum P(5) = 0.00042
 Identities = 55/218 (25%), Positives = 98/218 (44%)

Query:   769 YLYKCDQLKSIRQDLTVQRIRNQLTAKVYETHARLAIE---------------NGDLPEY 813
             Y +  ++ + IR+D+T Q + + +T  + E   R  I                N +    
Sbjct:   723 YDFLWNRTRGIRKDITQQHLCDPVTVSLIEKCTRFHIHCAHFMCEEPMSSFDANINSENM 782

Query:   814 NQCQSQLKILYAE-GIEG--CC--MEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDK 868
              +C   LK +Y +  ++G  C    EF  Y+   V+LH N K ++L  + +     +   
Sbjct:   783 TRCLQSLKEMYQDLRVKGVFCAGEAEFQGYN---VLLHLN-KGDILREVQQFHPAVRNSS 838

Query:   869 AVKHALAVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSY-----R 923
              VK A+   AA++S N++ FF+L ++A  LN CL+  Y  ++R  A+  ++ +Y     R
Sbjct:   839 EVKFAVQAFAALNSNNFVRFFKLVQSASYLNACLLHRYFNQIRRDALRALNVAYTVSPQR 898

Query:   924 PTV-PVSYVAQVLGFTGVSPTNEECEERDSDGLEECVE 960
              TV P+  V ++L F       +          + CVE
Sbjct:   899 STVFPLDSVVRMLLFQDCEEATDVLSCHGLTASDGCVE 936

 Score = 50 (22.7 bits), Expect = 0.00042, Sum P(5) = 0.00042
 Identities = 37/171 (21%), Positives = 62/171 (36%)

Query:     6 QNQQGSTQNIA----SSVDPNS-VENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQ 60
             Q+   S Q +     S+V P S +E+     A+    SS  P   G    S       + 
Sbjct:    66 QSHSSSAQTLGVSQTSNVGPFSGLEHTPAFVATSGPTSSCVPGNPGFSFKSPNLGTFPST 125

Query:    61 ST---ENGNLSNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSGYT--SY 115
             ST   E G ++++ +     TE   K L++                  +Q  SG+   S+
Sbjct:   126 STFGPETGEMASSGFGK---TEFSFKPLENSVFRPISGTESEPEKTQ-SQITSGFFTFSH 181

Query:   116 PNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQN 166
             P SS P      ++P          N ++ +P     N  + + P  S Q+
Sbjct:   182 PVSSGPGGLAPFSFPQMTSTSATNSNFTFSKP--GNNNLSSAFTPALSNQS 230

 Score = 45 (20.9 bits), Expect = 0.00042, Sum P(5) = 0.00042
 Identities = 13/34 (38%), Positives = 18/34 (52%)

Query:   252 SQTEVSCAPGTEN--LSVASSSNQVLQP---PGV 280
             S+ +V CA   EN  +++   S  VL P   PGV
Sbjct:   360 SKKDVGCAESGENDHVAIPGGSQSVLVPSRLPGV 393

 Score = 40 (19.1 bits), Expect = 0.00042, Sum P(5) = 0.00042
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query:   677 NRSETNRFKGKNAGTGNLYVRRASALLISKSFD 709
             +R+   +  GK A    +Y RR   L +   FD
Sbjct:   448 DRTSLEKHFGKIAKVQRIYTRRNKKLAVVYFFD 480

 Score = 39 (18.8 bits), Expect = 0.00042, Sum P(5) = 0.00042
 Identities = 8/15 (53%), Positives = 9/15 (60%)

Query:   950 RDSDGLEECVEWLKA 964
             R  D +EE V WL A
Sbjct:  1476 RSEDAVEEDVYWLAA 1490

 Score = 39 (18.8 bits), Expect = 0.00051, Sum P(5) = 0.00051
 Identities = 9/25 (36%), Positives = 15/25 (60%)

Query:   597 NIKFHLSEQKSASKSFQRPVKRQRL 621
             NI  +L+++ S  K F +  K QR+
Sbjct:   441 NIPDYLNDRTSLEKHFGKIAKVQRI 465

 Score = 37 (18.1 bits), Expect = 0.00078, Sum P(5) = 0.00078
 Identities = 10/29 (34%), Positives = 17/29 (58%)

Query:   542 SRRTKSRWEPLPEE-KPIDKLASSTNEIV 569
             SR +  + EPLP E +P   L+ + + +V
Sbjct:   680 SRSSADQEEPLPHELRPSAVLSRTMDYLV 708


>RGD|1308049 [details] [associations]
            symbol:Sac3d1 "SAC3 domain containing 1" species:10116 "Rattus
            norvegicus" [GO:0005813 "centrosome" evidence=IEA;ISO] [GO:0005819
            "spindle" evidence=IEA;ISO] [GO:0015630 "microtubule cytoskeleton"
            evidence=ISO] [GO:0051225 "spindle assembly" evidence=IEA;ISO]
            [GO:0051298 "centrosome duplication" evidence=IEA;ISO] Pfam:PF03399
            RGD:1308049 GO:GO:0005813 GO:GO:0051298 GO:GO:0051225 GO:GO:0005819
            InterPro:IPR005062 GeneTree:ENSGT00530000063781 OrthoDB:EOG4HMJ9Z
            EMBL:AC120237 IPI:IPI00371474 Ensembl:ENSRNOT00000028509
            Uniprot:D3ZT97
        Length = 428

 Score = 123 (48.4 bits), Expect = 0.00043, P = 0.00043
 Identities = 49/196 (25%), Positives = 89/196 (45%)

Query:   771 YKCDQLKSIRQDLTVQRIRNQLTAKVYETH-ARLAIENGDL-PEYNQCQSQLKILYAEGI 828
             +  D+L+++R DL++Q + +   A V E   A L +    + PE  +  +   +L  +  
Sbjct:   172 FVADRLRAVRLDLSLQGVDDAEAAAVLEPALATLLVVVARMRPEETRGVADPVLLQTQVQ 231

Query:   829 EG------CCMEFSAYHLL------CVILHSNNKRELLSLMSRLSDKAKQDKAVKHALAV 876
             EG      C     A H          +L++    E L  + +L    +    ++ ALAV
Sbjct:   232 EGFGSLRRCYARGKAPHPRQAAFQGLFLLYNLGSVEALQEVLQLPAALRACPPLQTALAV 291

Query:   877 RAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYR-P---TVPVSYVA 932
              +A    NY   FRL +T P L +C +  ++   R KA+S +SR+   P   T+P+ ++ 
Sbjct:   292 DSAFREDNYARLFRLLRTLPYLQSCAVQEHIGYARRKALSRLSRALSTPRGQTLPLDFIV 351

Query:   933 QVLGFTGVSPTNEECE 948
              +L   G+    + C+
Sbjct:   352 HLLALDGLQEAQDLCQ 367


>UNIPROTKB|Q16VD3 [details] [associations]
            symbol:lig "Protein lingerer" species:7159 "Aedes aegypti"
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005737
            "cytoplasm" evidence=ISS] [GO:0007620 "copulation" evidence=ISS]
            InterPro:IPR009060 GO:GO:0005737 GO:GO:0007610 PROSITE:PS50030
            eggNOG:NOG12793 SUPFAM:SSF46934 GO:GO:0007620 EMBL:CH477597
            RefSeq:XP_001653854.1 RefSeq:XP_001653855.1 RefSeq:XP_001653856.1
            RefSeq:XP_001653857.1 RefSeq:XP_001653858.1 UniGene:Aae.18497
            ProteinModelPortal:Q16VD3 EnsemblMetazoa:AAEL009607-RA
            GeneID:5572191 KEGG:aag:AaeL_AAEL009607 VectorBase:AAEL009607
            HOGENOM:HOG000046379 OMA:PTHHNIN OrthoDB:EOG4V9S5P PhylomeDB:Q16VD3
            InterPro:IPR022166 Pfam:PF12478 Uniprot:Q16VD3
        Length = 1250

 Score = 143 (55.4 bits), Expect = 0.00047, Sum P(2) = 0.00047
 Identities = 76/286 (26%), Positives = 116/286 (40%)

Query:    12 TQNIASSVDPNSVENRYVVDASQSQASSYFPSTTGSGAVSWATHGVNNQSTENGNLSNAS 71
             +Q   S+V P+SV +   V++  S  S+     T +   S +T G N  S   G   N S
Sbjct:   659 SQRNTSTVVPSSVSSGVNVNSINSTTST-LEQLTKTDPYSQST-GTNAGS---GGYQNVS 713

Query:    72 YHHEQ--HTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSGYTSYPNSSDPYAYGSTAY 129
             Y   Q   T S+  S                         + Y+SY N S   +Y     
Sbjct:   714 YSTSQANKTSSYPSSAAPQGYNNSSYSSTQVSTNTYPASSNNYSSY-NQSGVNSYQQ--- 769

Query:   130 PGXXXXXXXXPNHSYPQPVGAYQ-NSGAPYQPISS-FQNSGSYVGPASYSATYYNPGDYQ 187
             P         PN++    VG    N   P  P+++   N+ S      Y ++ Y P   Q
Sbjct:   770 PSSNVSSSVVPNNNSTSSVGVSTVNQSNPNLPVNNNVSNNSSSNTNTGYLSSQY-PVS-Q 827

Query:   188 TAGGYPSSGYSHQTTSWNEGNYTNYTSHQ-YSNYTSDTSGAYSSGTAPATSLQYQQQYKQ 246
             T+  +PS   S+Q +S N    T   S+  YS  TS +SG YS+ ++  + L+      Q
Sbjct:   828 TSSAFPSQ-QSYQNSSQNVYGNTGLNSNTGYSGSTSTSSGQYSNFSS--SKLKDTPVSTQ 884

Query:   247 WADYYSQTEVSCAPGTENLSVASSSNQVLQPPGVTAGYPTAHSQPA 292
             + D  S + VS      N SV+SSS+ +   P  +    T+ S P+
Sbjct:   885 F-DSVSSSTVSNNNSANNNSVSSSSSSL---PNSSVVSTTSMSSPS 926

 Score = 37 (18.1 bits), Expect = 0.00047, Sum P(2) = 0.00047
 Identities = 7/20 (35%), Positives = 12/20 (60%)

Query:   596 NNIKFHLSEQKSASKSFQRP 615
             +NI  H +  + ++ S QRP
Sbjct:  1208 HNINMHQAMHQDSNSSGQRP 1227


>DICTYBASE|DDB_G0293168 [details] [associations]
            symbol:ddx17 "DEAD/DEAH box helicase" species:44689
            "Dictyostelium discoideum" [GO:0008026 "ATP-dependent helicase
            activity" evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA]
            [GO:0004386 "helicase activity" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0008186 "RNA-dependent ATPase
            activity" evidence=ISS] [GO:0006396 "RNA processing" evidence=ISS]
            [GO:0005634 "nucleus" evidence=IEA;ISS] [GO:0003724 "RNA helicase
            activity" evidence=ISS] [GO:0003723 "RNA binding" evidence=IEA;ISS]
            [GO:0042254 "ribosome biogenesis" evidence=IEA] [GO:0016787
            "hydrolase activity" evidence=IEA] [GO:0006364 "rRNA processing"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0000184
            "nuclear-transcribed mRNA catabolic process, nonsense-mediated
            decay" evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR000629
            InterPro:IPR001650 InterPro:IPR011545 Pfam:PF00270 Pfam:PF00271
            PROSITE:PS00039 PROSITE:PS51194 SMART:SM00490
            dictyBase:DDB_G0293168 GO:GO:0005524 GO:GO:0005634 GO:GO:0005737
            GenomeReviews:CM000155_GR GO:GO:0000184 GO:GO:0003723 GO:GO:0006396
            InterPro:IPR014001 SMART:SM00487 PROSITE:PS51192 GO:GO:0006364
            GO:GO:0008026 eggNOG:COG0513 GO:GO:0003724 InterPro:IPR014014
            PROSITE:PS51195 EMBL:AAFI02000199 KO:K12823 GO:GO:0008186
            RefSeq:XP_629279.1 HSSP:P09052 ProteinModelPortal:Q54CE0
            PRIDE:Q54CE0 EnsemblProtists:DDB0233431 GeneID:8629001
            KEGG:ddi:DDB_G0293168 Uniprot:Q54CE0
        Length = 785

 Score = 126 (49.4 bits), Expect = 0.00049, P = 0.00049
 Identities = 77/300 (25%), Positives = 115/300 (38%)

Query:    11 STQNIASSVDPNSVENR-YVVDA-SQSQASSY-FPSTTGSGAVSWATHGVNNQSTENGNL 67
             ST    SS    S  +R Y  D  S ++ SS  + S++GSG+ + ++    N+   + + 
Sbjct:    25 STSRGGSSYGNRSGSDRDYNRDGGSYNRDSSRDYNSSSGSGSGNGSSS--YNKYPSSSSS 82

Query:    68 SNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYS--GYTSYPNSSDPYAYG 125
             S++S     +  S  K  QD                    + S  GY S  NSS  Y   
Sbjct:    83 SSSSSSTSSYGPSKGKDFQDSWGSSSTGTTNGYNGSSNGYNSSSNGYNS-SNSSSSYGAS 141

Query:   126 STAYPGXXXXXXXXPNHSYPQPVGAYQNSGAP----Y-QPISSFQNSGSYVGPASYSATY 180
             +  Y           + S     G+Y NSG+     Y +P S++  S  Y GP +  ++Y
Sbjct:   142 NNGYNNSSGSSSSGSSGS--SNGGSYNNSGSSNSNGYSKPTSNYSYSNGYTGPTTNYSSY 199

Query:   181 YNPGDYQTAGGYPSSGYSHQTTSWNEGNYTNYT--SHQYSNYTSDTSGAYSSGTAPATSL 238
              N G Y T     S+  S  TT+      T+Y   S  Y   TS +S  Y   + P    
Sbjct:   200 SN-G-YSTPPTSTSTSSSSTTTTTTTTPSTSYNGGSTSYGYSTSGSSNGYGGYSQPPIP- 256

Query:   239 QYQQQYKQWADYYSQTEVSCAPGTENLSVASSS--NQVLQPPGV-TAGYPTAHSQPAPIY 295
              Y       +   S   V+ A  + N SV  SS  N   +  G     Y T +S  +  Y
Sbjct:   257 SYDP-----SSVSSYGAVTPASSSYNASVPGSSYGNSTYRSSGYGNQSYATTNSYGSSSY 311


>UNIPROTKB|F1PGS0 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:AAEX03003616
            EMBL:AAEX03003617 Ensembl:ENSCAFT00000026237 Uniprot:F1PGS0
        Length = 1969

 Score = 130 (50.8 bits), Expect = 0.00053, P = 0.00053
 Identities = 87/337 (25%), Positives = 138/337 (40%)

Query:   106 AQDYSGYT-SYPNSSDPYAYGSTAYPGXXXXXXXXPNHSY-PQPVGAYQNSGAPYQPISS 163
             A D SG++  Y  +  P   GS   PG        P+  Y P P GA   S +P  P   
Sbjct:  1551 ASDASGFSPGYSPAWSPTP-GSPGSPG--------PSSPYIPSPGGAMSPSYSPTSPAYE 1601

Query:   164 FQNSGSYVGPA-SYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSN 219
              ++ G Y   + SYS T  +P    T+  Y P+S  YS  + S++  + + + TS  YS 
Sbjct:  1602 PRSPGGYTPQSPSYSPT--SPSYSPTSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSP 1659

Query:   220 YT---SDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQ 276
              +   S TS +YS  T+P+ S      Y   +  YS T  S +P + + S  S S     
Sbjct:  1660 TSPSYSPTSPSYSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTS 1717

Query:   277 PPGVTAGYPTAHSQPAPIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQ--N 334
             P   +   P+ +S  +P Y                 P+  +       +   +PS+   +
Sbjct:  1718 P-SYSPTSPS-YSPTSPSY--SPTSPSYSPTSPNYSPSRPNYTPTSPSYSPTSPSYSPTS 1773

Query:   335 RQVSPVQPHYS--KP-LEQKTSYNNFQDQHKAACPQGPSSQYAIGQQMAPSYQSPPVQTS 391
                +P+ P+YS   P L+   SY+     +  + P+  + Q       +PSY SP   +S
Sbjct:  1774 PNYTPMSPNYSPTSPNLQLSPSYSPTSPSYSPSSPRY-TPQSPTYTPSSPSY-SP---SS 1828

Query:   392 PQLDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTA 428
             P       S    PT+P  + +     P + K S T+
Sbjct:  1829 PSYSP--TSPKYTPTSPSYSPSSPEYTPTSPKYSPTS 1863


>ZFIN|ZDB-GENE-041008-78 [details] [associations]
            symbol:polr2a "polymerase (RNA) II (DNA directed)
            polypeptide A" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 ZFIN:ZDB-GENE-041008-78 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            GO:GO:0005665 GeneTree:ENSGT00700000104490 EMBL:AL929346
            IPI:IPI00608319 Ensembl:ENSDART00000077495 Bgee:F1Q9K4
            Uniprot:F1Q9K4
        Length = 1965

 Score = 128 (50.1 bits), Expect = 0.00087, P = 0.00087
 Identities = 88/349 (25%), Positives = 132/349 (37%)

Query:   116 PNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQN--SGSYVGP 173
             P    P + G  A P         PN+S   P    ++ G  Y P S   +  S SY  P
Sbjct:  1564 PTPGSPGSPGP-ASPYIPSPGALSPNYSPTSPAYEPRSPGGGYTPQSPGYSPTSPSY-SP 1621

Query:   174 ASYSATYYNPGDYQTAGGY-PSS-GYSHQTTSWNEGNYT-NYTSHQYSNYT---SDTSGA 227
              S S +  +P    T+  Y P+S  YS  + S++  + + + TS  YS  +   S TS +
Sbjct:  1622 TSPSYSPTSPNYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPS 1681

Query:   228 YSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQP---PGVTAGY 284
             YS  T+P+ S      Y   +  YS T  S +P + + S  S S     P   P   +  
Sbjct:  1682 YSP-TSPSYS-PTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYSPTSPSYS 1739

Query:   285 PTA--HSQPAPIYHXXXXXXXXXXXXXXXXPAATSNGSHDSYWKHGTPSFQ--NRQVSPV 340
             PT+  +S  +P Y                 P + S       +   +P++   +   SP 
Sbjct:  1740 PTSPSYSPTSPNY--TPTSPSYSPTSPSYSPTSPSYSPTSPNYTPTSPNYSPTSPSYSPT 1797

Query:   341 QPHYS--KP--LEQKTSYNNFQDQHKAACPQ-GPSS-QYAIGQQMAPSYQ-SPPVQTSPQ 393
              P YS   P    Q  +Y      +  + P   P+S +Y      +PSY  S P  T   
Sbjct:  1798 SPSYSPSSPRYTPQSPTYTPSSPSYSPSSPSYSPTSPKYT---PTSPSYSPSSPEYTPTS 1854

Query:   394 LDNRRVSKLQIPTNPRIASNLALGLPKTDKDSSTAN--AAAKPAYIGVS 440
                   S    PT+P+ +       P T K S T+   +   P Y   S
Sbjct:  1855 PKYSPTSPKYSPTSPKYSPTSPTYSPTTPKYSPTSPTYSPTSPTYTPTS 1903


>DICTYBASE|DDB_G0271916 [details] [associations]
            symbol:rtoA "unknown" species:44689 "Dictyostelium
            discoideum" [GO:0006972 "hyperosmotic response" evidence=IEP]
            [GO:0006906 "vesicle fusion" evidence=IMP] [GO:0006979 "response to
            oxidative stress" evidence=IEP] [GO:0006874 "cellular calcium ion
            homeostasis" evidence=IMP] [GO:0048388 "endosomal lumen
            acidification" evidence=IMP] [GO:0031210 "phosphatidylcholine
            binding" evidence=IDA] [GO:0031154 "culmination involved in
            sorocarp development" evidence=IMP] [GO:0005509 "calcium ion
            binding" evidence=IDA] [GO:0006970 "response to osmotic stress"
            evidence=IMP] [GO:0006944 "cellular membrane fusion" evidence=IDA]
            [GO:0006887 "exocytosis" evidence=IMP] [GO:0006885 "regulation of
            pH" evidence=IMP] [GO:0005737 "cytoplasm" evidence=IEA;IDA]
            [GO:0030587 "sorocarp development" evidence=IMP] [GO:0007049 "cell
            cycle" evidence=IMP] [GO:0045595 "regulation of cell
            differentiation" evidence=IMP] dictyBase:DDB_G0271916 GO:GO:0005524
            GO:GO:0005737 GO:GO:0006979 GO:GO:0045595 GenomeReviews:CM000151_GR
            EMBL:AAFI02000007 GO:GO:0006972 GO:GO:0048388 GO:GO:0031210
            GO:GO:0006874 GO:GO:0006887 eggNOG:NOG12793 GO:GO:0031154
            GO:GO:0006906 EMBL:U48298 RefSeq:XP_645420.1
            EnsemblProtists:DDB0185120 GeneID:8618200 KEGG:ddi:DDB_G0271916
            OMA:CSKETHI ProtClustDB:CLSZ2431310 Uniprot:P54681
        Length = 367

 Score = 119 (46.9 bits), Expect = 0.00091, P = 0.00091
 Identities = 64/286 (22%), Positives = 113/286 (39%)

Query:     7 NQQGS----TQNIASSVDPNSVENRYVVDASQSQAS------SYFPSTTGSGAVSWATHG 56
             N QGS    +Q+++S VD + + +      + S+ S      S   ST+ SG+ +  +  
Sbjct:    18 NAQGSIGSSSQSLSSEVDSSDISSSGSNSTASSEGSVSSSSNSGSQSTSNSGSEASGSSN 77

Query:    57 VNNQSTENGNLSNASYHHEQHTESHVKSLQDXXXXXXXXXXXXXXXXXVAQDYSGYTSYP 116
               +QST N   S AS      ++S   S                     + + SG  S  
Sbjct:    78 SGSQSTSNSG-SEASGSSNSGSQSSTDSSNSGSQGSTGSSNSGSQSSTDSSN-SGSQSST 135

Query:   117 NSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPVGAYQNSGAPYQPISSFQNSGSYVGPASY 176
             +SS+  + GST             N       G+  NSG+  +  S   NSGS     S 
Sbjct:   136 DSSNSGSQGSTGSSNSGSESSGSSNSGSEGSTGS-SNSGS--ESSSGSSNSGSESSSGSS 192

Query:   177 SATYYNPGDYQTAGGYPSSGYSHQTTSWNEG--NYTNYTSHQYSNYTSDTS-GAYSSGTA 233
             ++   +      +G   SSG S+  +  + G  N  + +S   SN  S++S G+ +SG+ 
Sbjct:   193 NSGSESSSGSSNSGSESSSGSSNSGSESSSGSSNSGSESSSGSSNSGSESSSGSSNSGSE 252

Query:   234 PATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVASSSNQVLQPPG 279
              + S     +    +   S  + +C    + LS+ +  +  ++  G
Sbjct:   253 SSGSSNSGSESSSDSGSSSDGKTTCISFHDTLSINTVDDDEIECTG 298


>MGI|MGI:1913656 [details] [associations]
            symbol:Sac3d1 "SAC3 domain containing 1" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0005813 "centrosome" evidence=IDA]
            [GO:0005819 "spindle" evidence=IDA] [GO:0005856 "cytoskeleton"
            evidence=IEA] [GO:0007049 "cell cycle" evidence=IEA] [GO:0007067
            "mitosis" evidence=IEA] [GO:0015630 "microtubule cytoskeleton"
            evidence=IDA] [GO:0051225 "spindle assembly" evidence=IMP]
            [GO:0051298 "centrosome duplication" evidence=IMP] [GO:0051301
            "cell division" evidence=IEA] MGI:MGI:1913656 Pfam:PF03399
            GO:GO:0005737 GO:GO:0005813 GO:GO:0051301 GO:GO:0007067
            GO:GO:0051298 GO:GO:0051225 GO:GO:0005819 InterPro:IPR005062
            eggNOG:NOG296331 HOGENOM:HOG000168632 HOVERGEN:HBG108455
            EMBL:AJ131957 EMBL:BC144812 EMBL:BC145789 EMBL:AK075930
            IPI:IPI00226152 RefSeq:NP_598439.3 UniGene:Mm.303924
            ProteinModelPortal:A6H687 STRING:A6H687 PhosphoSite:A6H687
            PRIDE:A6H687 GeneID:66406 KEGG:mmu:66406 UCSC:uc008ghl.1 CTD:29901
            InParanoid:A6H687 NextBio:321591 CleanEx:MM_SAC3D1
            Genevestigator:A6H687 Uniprot:A6H687
        Length = 427

 Score = 120 (47.3 bits), Expect = 0.00091, P = 0.00091
 Identities = 52/198 (26%), Positives = 91/198 (45%)

Query:   771 YKCDQLKSIRQDLTVQRIRNQLTAKVYETHAR--LAIENGDLPEYN---------QCQSQ 819
             +  D+L+++R DL++Q + +   A V E      LA+     PE           Q Q Q
Sbjct:   172 FVADRLRAVRLDLSLQGVDDADAATVLEAALATLLAVVARVRPEETRGAADPVLLQTQVQ 231

Query:   820 -----LKILYAEGIEGCCMEFSAYHLLCVILHSNNKRELLSLMSRLSDKAKQDKAVKHAL 874
                  L+  YA G +G     +A+  L  +L++    E L  + +L    +    ++ AL
Sbjct:   232 EGFGSLRRCYARG-KGPYPRQAAFQGL-FLLYNLGSVEALQEVLQLPAALRACPPLQAAL 289

Query:   875 AVRAAVSSGNYIMFFRLYKTAPNLNTCLMDLYVEKMRFKAVSCMSRSYR-P---TVPVSY 930
             AV AA    N+   FRL +T P L +C +  ++   R KA++ +SR+   P   T+P+ +
Sbjct:   290 AVDAAFREDNHARLFRLLRTLPYLQSCAVQEHIGYARRKALARLSRALSTPKGQTLPLDF 349

Query:   931 VAQVLGFTGVSPTNEECE 948
             +   L   G+    + C+
Sbjct:   350 IEHFLALDGLQEARDLCQ 367


>DICTYBASE|DDB_G0271670 [details] [associations]
            symbol:DDB_G0271670 species:44689 "Dictyostelium
            discoideum" [GO:0005576 "extracellular region" evidence=IEA]
            dictyBase:DDB_G0271670 GO:GO:0005576 EMBL:AAFI02000006
            ProtClustDB:CLSZ2431310 RefSeq:XP_645495.1
            ProteinModelPortal:Q75JC9 EnsemblProtists:DDB0168484 GeneID:8618123
            KEGG:ddi:DDB_G0271670 OMA:EITNEEP Uniprot:Q75JC9
        Length = 374

 Score = 119 (46.9 bits), Expect = 0.00094, P = 0.00094
 Identities = 47/295 (15%), Positives = 111/295 (37%)

Query:    29 VVDASQSQASSYFPSTTGSGAVSWATHGVNNQSTENGNLSNASYHHEQHTESHVKSLQDX 88
             ++D S S +SS   S++ S + S ++   ++ S+ + + S++S      + S   S    
Sbjct:    62 ILDTSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 121

Query:    89 XXXXXXXXXXXXXXXXVAQDYSGYTSYPNSSDPYAYGSTAYPGXXXXXXXXPNHSYPQPV 148
                              +   S  +S  +SS   +  S++            + S     
Sbjct:   122 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 181

Query:   149 GAYQNSGAPYQPISSFQNSGSYVGPASYSATYYNPGDYQTAGGYPSSGYSHQTTSWNEGN 208
              +  +S +     SS  +S S    +S S++  +     ++    SS  S  ++S +  +
Sbjct:   182 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 241

Query:   209 YTNYTSHQYSNYTSDTSGAYSSGTAPATSLQYQQQYKQWADYYSQTEVSCAPGTENLSVA 268
              ++ +S   S+ +S +S + SS ++ ++S          +   S +  S +  + + S +
Sbjct:   242 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 301

Query:   269 SSSNQVLQPPGVTAGYPTAHSQPAPIYHXXXXXXXXXXXXXXXXPAATSNGSHDS 323
             SSS+        ++   ++ S  +                     +++S+ S  S
Sbjct:   302 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 356


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.310   0.124   0.359    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1018       977   0.00097  122 3  11 23  0.49    34
                                                     38  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  35
  No. of states in DFA:  628 (67 KB)
  Total size of DFA:  477 KB (2222 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  113.36u 0.14s 113.50t   Elapsed:  00:00:08
  Total cpu time:  113.38u 0.15s 113.53t   Elapsed:  00:00:08
  Start:  Mon May 20 16:30:31 2013   End:  Mon May 20 16:30:39 2013

Back to top