BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>010712
MGSKGRIPPPHLRRPPPGPGMMHPDPFVSGMRPPMPGAFPPFDMMPPPEVMEQKIASQHV
EMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAKMEA
ELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPALLSELESL
RQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSY
GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG
TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG
SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYE
AQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP
YGSATPPARSGSGQPRGGNPARR

High Scoring Gene Products

Symbol, full name Information P value
AT1G67170 protein from Arabidopsis thaliana 1.4e-87
AT3G14750 protein from Arabidopsis thaliana 1.4e-39
AT1G55170 protein from Arabidopsis thaliana 1.6e-31
AT5G61920 protein from Arabidopsis thaliana 2.2e-25
Vml
Vitelline membrane-like
protein from Drosophila melanogaster 2.0e-22
AT2G30120 protein from Arabidopsis thaliana 5.7e-18
eif3a
Eukaryotic translation initiation factor 3 subunit A
protein from Xenopus laevis 9.0e-15
eif3a
Eukaryotic translation initiation factor 3 subunit A
protein from Xenopus (Silurana) tropicalis 1.3e-14
LOC100518332
Uncharacterized protein
protein from Sus scrofa 3.1e-13
POLR2A
DNA-directed RNA polymerase II subunit RPB1
protein from Cricetulus griseus 3.6e-11
T17H7.1 gene from Caenorhabditis elegans 1.5e-09
prc
pericardin
protein from Drosophila melanogaster 6.3e-09
fhaA
FHA domain-containing protein FhaA
protein from Mycobacterium tuberculosis 1.2e-08
TAF15
TATA-binding protein-associated factor 2N
protein from Homo sapiens 1.9e-08
TAF15
Uncharacterized protein
protein from Canis lupus familiaris 2.0e-08
K02E11.10 gene from Caenorhabditis elegans 4.4e-08
cbpP
calcium-binding protein
gene from Dictyostelium discoideum 5.8e-08
CG30203 protein from Drosophila melanogaster 9.8e-08
spt-5 gene from Caenorhabditis elegans 1.1e-07
spt-5
Transcription elongation factor SPT5
protein from Caenorhabditis elegans 1.1e-07
Krtap6-2
keratin associated protein 6-2
protein from Mus musculus 1.9e-07
let-2 gene from Caenorhabditis elegans 2.1e-07
let-2
Collagen alpha-2(IV) chain
protein from Caenorhabditis elegans 2.1e-07
ama-1 gene from Caenorhabditis elegans 2.3e-07
ama-1
DNA-directed RNA polymerase II subunit RPB1
protein from Caenorhabditis elegans 2.3e-07
ego-2 gene from Caenorhabditis elegans 2.4e-07
arid1ab
AT rich interactive domain 1Ab (SWI-like)
gene_product from Danio rerio 3.3e-07
COL4A4
Collagen alpha-4(IV) chain
protein from Homo sapiens 5.5e-07
COL4A4
Collagen alpha-4(IV) chain
protein from Homo sapiens 5.5e-07
CG7185 protein from Drosophila melanogaster 6.6e-07
AT1G33680 protein from Arabidopsis thaliana 9.6e-07
COL1A1
Collagen alpha-1(I) chain
protein from Gallus gallus 1.3e-06
MGG_04961
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 1.5e-06
swsn-1 gene from Caenorhabditis elegans 1.6e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Homo sapiens 2.0e-06
RPO21
RNA polymerase II largest subunit B220
gene from Saccharomyces cerevisiae 2.2e-06
COL4A4
Uncharacterized protein
protein from Nomascus leucogenys 2.5e-06
zgc:172323 gene_product from Danio rerio 2.9e-06
gho
ghost
protein from Drosophila melanogaster 3.2e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Macaca mulatta 3.4e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Pan troglodytes 3.4e-06
COL7A1
Uncharacterized protein
protein from Sus scrofa 3.6e-06
COL3A1
Collagen alpha-1(III) chain
protein from Gallus gallus 3.7e-06
AT1G10390 protein from Arabidopsis thaliana 3.8e-06
Ldb3
LIM domain binding 3
protein from Mus musculus 3.9e-06
LDB3
LIM domain-binding protein 3
protein from Homo sapiens 4.0e-06
EGK_04858
Putative uncharacterized protein
protein from Macaca mulatta 4.1e-06
EGM_04376
Putative uncharacterized protein
protein from Macaca fascicularis 4.1e-06
pygo2
pygopus homolog 2 (Drosophila)
gene_product from Danio rerio 4.6e-06
COL3A1
Collagen alpha-1(III) chain
protein from Bos taurus 4.9e-06
EWSR1
Ewing sarcoma breakpoint region 1, isoform CRA_e
protein from Homo sapiens 5.0e-06
PPP1R10
Uncharacterized protein
protein from Canis lupus familiaris 5.5e-06
osa protein from Drosophila melanogaster 6.1e-06
fus
fusion (involved in t(12;16) in malignant liposarcoma)
gene_product from Danio rerio 7.1e-06
I3LQ53
Uncharacterized protein
protein from Sus scrofa 7.1e-06
COL3A1
Collagen alpha-1(III) chain
protein from Bos taurus 7.3e-06
AT2G25970 protein from Arabidopsis thaliana 8.3e-06
COL5A1
Uncharacterized protein
protein from Canis lupus familiaris 9.0e-06
TFG
Uncharacterized protein
protein from Gallus gallus 9.0e-06
Col11a1
collagen, type XI, alpha 1
gene from Rattus norvegicus 9.3e-06
Col11a1
Collagen alpha-1(XI) chain
protein from Rattus norvegicus 9.3e-06
AT3G07030 protein from Arabidopsis thaliana 9.4e-06
RPO21 gene_product from Candida albicans 1.1e-05
RPO21
DNA-directed RNA polymerase
protein from Candida albicans SC5314 1.1e-05
SFPQ
Uncharacterized protein
protein from Gallus gallus 1.2e-05
COL5A1
Uncharacterized protein
protein from Canis lupus familiaris 1.2e-05
Zfp768
zinc finger protein 768
protein from Mus musculus 1.3e-05
Krtap21-1
keratin associated protein 21-1
protein from Mus musculus 1.3e-05
COL4A5
Uncharacterized protein
protein from Bos taurus 1.4e-05
ewsr1b
Ewing sarcoma breakpoint region 1b
gene_product from Danio rerio 1.7e-05
TAF15
TATA-binding protein-associated factor 2N
protein from Homo sapiens 1.7e-05
E2RS29
Uncharacterized protein
protein from Canis lupus familiaris 1.9e-05
COL3A1
Uncharacterized protein
protein from Sus scrofa 2.0e-05
COL3A1
Collagen alpha-1(III) chain
protein from Gallus gallus 2.2e-05
col-51 gene from Caenorhabditis elegans 2.3e-05
FUS
RNA-binding protein FUS
protein from Bos taurus 2.3e-05
EWSR1
Uncharacterized protein
protein from Sus scrofa 2.5e-05
bli-1 gene from Caenorhabditis elegans 2.5e-05
col11a1b
collagen, type XI, alpha 1b
gene_product from Danio rerio 2.5e-05
COL3A1
Uncharacterized protein
protein from Canis lupus familiaris 2.6e-05
COL4A2
Collagen alpha-2(IV) chain
protein from Bos taurus 2.9e-05
CROCC
Uncharacterized protein
protein from Canis lupus familiaris 3.1e-05
CROCC
Uncharacterized protein
protein from Canis lupus familiaris 3.1e-05
Col3a1
collagen, type III, alpha 1
protein from Mus musculus 3.3e-05
LOC100858979
Uncharacterized protein
protein from Gallus gallus 3.4e-05
COL5A2
Uncharacterized protein
protein from Sus scrofa 3.6e-05
COL3A1
Uncharacterized protein
protein from Canis lupus familiaris 4.2e-05
COL5A2
Uncharacterized protein
protein from Bos taurus 4.3e-05
COL5A2
Uncharacterized protein
protein from Canis lupus familiaris 4.3e-05
COL10A1
Collagen alpha-1(X) chain
protein from Gallus gallus 4.4e-05
eif3s10
eukaryotic translation initiation factor 3, subunit 10 (theta)
gene_product from Danio rerio 4.6e-05
swsn-1
SWI3-like protein
protein from Caenorhabditis elegans 5.4e-05
Ccdc88b
coiled-coil domain containing 88B
protein from Mus musculus 5.5e-05
col-103 gene from Caenorhabditis elegans 6.2e-05
Prpmp5
proline-rich protein MP5
gene from Rattus norvegicus 6.4e-05
col10a1
collagen, type X, alpha 1
gene_product from Danio rerio 7.0e-05

The BLAST search returned 4 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  010712
        (503 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2033681 - symbol:AT1G67170 "AT1G67170" species...   875  1.4e-87   1
TAIR|locus:2089616 - symbol:AT3G14750 "AT3G14750" species...   422  1.4e-39   1
TAIR|locus:2035751 - symbol:AT1G55170 "AT1G55170" species...   346  1.6e-31   1
TAIR|locus:2156146 - symbol:AT5G61920 "AT5G61920" species...   292  2.2e-25   1
FB|FBgn0085362 - symbol:Vml "Vitelline membrane-like" spe...   286  2.0e-22   1
TAIR|locus:2060848 - symbol:AT2G30120 species:3702 "Arabi...   225  5.7e-18   1
UNIPROTKB|A2VD00 - symbol:eif3a "Eukaryotic translation i...   195  9.0e-15   2
UNIPROTKB|A4II09 - symbol:eif3a "Eukaryotic translation i...   186  1.3e-14   2
UNIPROTKB|F1S187 - symbol:LOC100518332 "Uncharacterized p...   201  3.1e-13   1
UNIPROTKB|P11414 - symbol:POLR2A "DNA-directed RNA polyme...   184  3.6e-11   1
WB|WBGene00020550 - symbol:T17H7.1 species:6239 "Caenorha...   172  1.5e-09   1
FB|FBgn0028573 - symbol:prc "pericardin" species:7227 "Dr...   171  6.3e-09   1
UNIPROTKB|P71590 - symbol:fhaA "FHA domain-containing pro...   162  1.2e-08   1
UNIPROTKB|Q92804 - symbol:TAF15 "TATA-binding protein-ass...   159  1.9e-08   2
UNIPROTKB|F1PB61 - symbol:TAF15 "Uncharacterized protein"...   160  2.0e-08   2
WB|WBGene00044109 - symbol:K02E11.10 species:6239 "Caenor...   154  4.4e-08   1
DICTYBASE|DDB_G0277909 - symbol:cbpP "calcium-binding pro...   155  5.8e-08   1
FB|FBgn0050203 - symbol:CG30203 species:7227 "Drosophila ...   157  9.8e-08   1
WB|WBGene00005015 - symbol:spt-5 species:6239 "Caenorhabd...   158  1.1e-07   1
UNIPROTKB|Q21338 - symbol:spt-5 "Transcription elongation...   158  1.1e-07   1
MGI|MGI:1330280 - symbol:Krtap6-2 "keratin associated pro...   128  1.9e-07   1
WB|WBGene00002280 - symbol:let-2 species:6239 "Caenorhabd...   157  2.1e-07   1
UNIPROTKB|P17140 - symbol:let-2 "Collagen alpha-2(IV) cha...   157  2.1e-07   1
WB|WBGene00000123 - symbol:ama-1 species:6239 "Caenorhabd...   157  2.3e-07   1
UNIPROTKB|P16356 - symbol:ama-1 "DNA-directed RNA polymer...   157  2.3e-07   1
WB|WBGene00001215 - symbol:ego-2 species:6239 "Caenorhabd...   136  2.4e-07   2
ZFIN|ZDB-GENE-030131-5725 - symbol:arid1ab "AT rich inter...   157  3.3e-07   2
UNIPROTKB|J3KNM7 - symbol:COL4A4 "Collagen alpha-4(IV) ch...   153  5.5e-07   1
UNIPROTKB|P53420 - symbol:COL4A4 "Collagen alpha-4(IV) ch...   153  5.5e-07   1
UNIPROTKB|D4ADB1 - symbol:D4ADB1 "Uncharacterized protein...   148  6.3e-07   1
FB|FBgn0035872 - symbol:CG7185 species:7227 "Drosophila m...   141  6.6e-07   2
TAIR|locus:2012713 - symbol:AT1G33680 "AT1G33680" species...   144  9.6e-07   2
UNIPROTKB|P02457 - symbol:COL1A1 "Collagen alpha-1(I) cha...   149  1.3e-06   1
UNIPROTKB|G4N3H5 - symbol:MGG_04961 "Uncharacterized prot...   144  1.5e-06   1
WB|WBGene00004203 - symbol:swsn-1 species:6239 "Caenorhab...   145  1.6e-06   1
UNIPROTKB|Q96QC0 - symbol:PPP1R10 "Serine/threonine-prote...   145  2.0e-06   1
SGD|S000002299 - symbol:RPO21 "RNA polymerase II largest ...   159  2.2e-06   2
UNIPROTKB|G1RSL2 - symbol:COL4A4 "Uncharacterized protein...   147  2.5e-06   1
ZFIN|ZDB-GENE-080204-113 - symbol:zgc:172323 "zgc:172323"...   143  2.9e-06   1
FB|FBgn0262126 - symbol:gho "ghost" species:7227 "Drosoph...   135  3.2e-06   2
UNIPROTKB|Q5TM61 - symbol:PPP1R10 "Serine/threonine-prote...   143  3.4e-06   1
UNIPROTKB|Q7YR38 - symbol:PPP1R10 "Serine/threonine-prote...   143  3.4e-06   1
UNIPROTKB|F1SKM1 - symbol:COL7A1 "Uncharacterized protein...   148  3.6e-06   1
UNIPROTKB|P12105 - symbol:COL3A1 "Collagen alpha-1(III) c...   144  3.7e-06   1
TAIR|locus:2012788 - symbol:AT1G10390 "AT1G10390" species...   143  3.8e-06   1
MGI|MGI:1344412 - symbol:Ldb3 "LIM domain binding 3" spec...   141  3.9e-06   1
UNIPROTKB|O75112 - symbol:LDB3 "LIM domain-binding protei...   141  4.0e-06   1
UNIPROTKB|G7N928 - symbol:EGK_04858 "Putative uncharacter...   145  4.1e-06   1
UNIPROTKB|G7PK77 - symbol:EGM_04376 "Putative uncharacter...   145  4.1e-06   1
ZFIN|ZDB-GENE-050809-108 - symbol:pygo2 "pygopus homolog ...   139  4.6e-06   1
UNIPROTKB|P04258 - symbol:COL3A1 "Collagen alpha-1(III) c...   142  4.9e-06   1
UNIPROTKB|C9JGE3 - symbol:EWSR1 "Ewing sarcoma breakpoint...   127  5.0e-06   2
UNIPROTKB|E2R2K8 - symbol:PPP1R10 "Uncharacterized protei...   141  5.5e-06   1
FB|FBgn0261885 - symbol:osa "osa" species:7227 "Drosophil...   153  6.1e-06   2
ZFIN|ZDB-GENE-040426-1010 - symbol:fus "fusion (involved ...   137  7.1e-06   1
UNIPROTKB|I3LQ53 - symbol:I3LQ53 "Uncharacterized protein...   137  7.1e-06   1
UNIPROTKB|F1MXS8 - symbol:COL3A1 "Collagen alpha-1(III) c...   142  7.3e-06   1
TAIR|locus:2043530 - symbol:AT2G25970 "AT2G25970" species...   140  8.3e-06   2
UNIPROTKB|J9P8F7 - symbol:COL5A1 "Uncharacterized protein...   141  9.0e-06   1
UNIPROTKB|E1C0T1 - symbol:TFG "Uncharacterized protein" s...   134  9.0e-06   1
UNIPROTKB|F1LLX1 - symbol:Col11a1 "Collagen alpha-1(XI) c...   142  9.3e-06   1
RGD|2372 - symbol:Col11a1 "collagen, type XI, alpha 1" sp...   142  9.3e-06   1
UNIPROTKB|P20909 - symbol:Col11a1 "Collagen alpha-1(XI) c...   142  9.3e-06   1
TAIR|locus:2077547 - symbol:AT3G07030 species:3702 "Arabi...   134  9.4e-06   1
CGD|CAL0000919 - symbol:RPO21 species:5476 "Candida albic...   141  1.1e-05   1
UNIPROTKB|Q5ACI7 - symbol:RPO21 "DNA-directed RNA polymer...   141  1.1e-05   1
UNIPROTKB|F1P555 - symbol:SFPQ "Uncharacterized protein" ...   136  1.2e-05   1
UNIPROTKB|F1PHX8 - symbol:COL5A1 "Uncharacterized protein...   141  1.2e-05   1
MGI|MGI:2384582 - symbol:Zfp768 "zinc finger protein 768"...   135  1.3e-05   1
MGI|MGI:2157767 - symbol:Krtap21-1 "keratin associated pr...   111  1.3e-05   1
UNIPROTKB|F1N474 - symbol:COL4A5 "Uncharacterized protein...   140  1.4e-05   1
ZFIN|ZDB-GENE-030131-1600 - symbol:ewsr1b "Ewing sarcoma ...   139  1.7e-05   2
UNIPROTKB|K7EKB2 - symbol:TAF15 "TATA-binding protein-ass...   125  1.7e-05   1
UNIPROTKB|E2RS29 - symbol:E2RS29 "Uncharacterized protein...   133  1.9e-05   1
UNIPROTKB|F1RYI8 - symbol:COL3A1 "Uncharacterized protein...   138  2.0e-05   1
UNIPROTKB|F1NI73 - symbol:COL3A1 "Collagen alpha-1(III) c...   137  2.2e-05   1
WB|WBGene00000628 - symbol:col-51 species:6239 "Caenorhab...   131  2.3e-05   1
UNIPROTKB|Q28009 - symbol:FUS "RNA-binding protein FUS" s...   132  2.3e-05   1
UNIPROTKB|F1RFI8 - symbol:EWSR1 "Uncharacterized protein"...   121  2.5e-05   2
WB|WBGene00000251 - symbol:bli-1 species:6239 "Caenorhabd...   135  2.5e-05   1
ZFIN|ZDB-GENE-070912-607 - symbol:col11a1b "collagen, typ...   138  2.5e-05   1
UNIPROTKB|J9P0L0 - symbol:COL3A1 "Uncharacterized protein...   137  2.6e-05   1
UNIPROTKB|F1N7Q7 - symbol:COL4A2 "Collagen alpha-2(IV) ch...   137  2.9e-05   1
UNIPROTKB|F1LRJ1 - symbol:Col4a3 "Protein Col4a3" species...   137  3.0e-05   1
UNIPROTKB|J9P8I1 - symbol:CROCC "Uncharacterized protein"...   116  3.1e-05   2
UNIPROTKB|F1Q2C0 - symbol:CROCC "Uncharacterized protein"...   116  3.1e-05   2
MGI|MGI:88453 - symbol:Col3a1 "collagen, type III, alpha ...   136  3.3e-05   1
UNIPROTKB|F1NRH2 - symbol:LOC100858979 "Uncharacterized p...   132  3.4e-05   1
UNIPROTKB|F1RXW0 - symbol:COL5A2 "Uncharacterized protein...   135  3.6e-05   1
TAIR|locus:4010713902 - symbol:AT4G22505 species:3702 "Ar...   130  4.0e-05   1
UNIPROTKB|F1PG69 - symbol:COL3A1 "Uncharacterized protein...   135  4.2e-05   1
UNIPROTKB|F1N2Y2 - symbol:COL5A2 "Uncharacterized protein...   135  4.3e-05   1
UNIPROTKB|F1PG08 - symbol:COL5A2 "Uncharacterized protein...   135  4.3e-05   1
UNIPROTKB|P08125 - symbol:COL10A1 "Collagen alpha-1(X) ch...   131  4.4e-05   1
ZFIN|ZDB-GENE-030131-5726 - symbol:eif3s10 "eukaryotic tr...   134  4.6e-05   1
UNIPROTKB|G5EF87 - symbol:swsn-1 "SWI3-like protein" spec...   131  5.4e-05   1
MGI|MGI:1925567 - symbol:Ccdc88b "coiled-coil domain cont...   134  5.5e-05   1
WB|WBGene00000677 - symbol:col-103 species:6239 "Caenorha...   126  6.2e-05   1
RGD|628797 - symbol:Prpmp5 "proline-rich protein MP5" spe...   124  6.4e-05   1
ZFIN|ZDB-GENE-030131-8373 - symbol:col10a1 "collagen, typ...   129  7.0e-05   1

WARNING:  Descriptions of 141 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2033681 [details] [associations]
            symbol:AT1G67170 "AT1G67170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] EMBL:CP002684 EMBL:BT005883
            EMBL:AK228253 IPI:IPI00547288 RefSeq:NP_176888.2 UniGene:At.35681
            ProteinModelPortal:Q84TD8 SMR:Q84TD8 IntAct:Q84TD8 PRIDE:Q84TD8
            EnsemblPlants:AT1G67170.1 GeneID:843037 KEGG:ath:AT1G67170
            TAIR:At1g67170 HOGENOM:HOG000005883 InParanoid:Q84TD8 OMA:MESKGRI
            PhylomeDB:Q84TD8 ProtClustDB:CLSN2918424 Genevestigator:Q84TD8
            Uniprot:Q84TD8
        Length = 359

 Score = 875 (313.1 bits), Expect = 1.4e-87, P = 1.4e-87
 Identities = 189/332 (56%), Positives = 229/332 (68%)

Query:    30 GMRPPMP--GAFPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQ 87
             G  PP    G +P F+M+PPPEVMEQK  +QH E+Q+LA ENQRL  THG+LRQELAAAQ
Sbjct:    35 GAIPPSAAQGVYPSFNMLPPPEVMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQ 94

Query:    88 HELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVARE 147
             HE+Q+LH QIG MKSERE +M  L EK+AKME EL+ +E VKLE Q+++ EA++LVVARE
Sbjct:    95 HEIQMLHAQIGSMKSEREQRMMGLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVARE 154

Query:   148 ELIAKVHQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQV 207
             EL++KVHQLTQ+LQ++ +DVQQIPAL+SELE+LRQEY  CR TY+YEKKFYNDHLESLQ 
Sbjct:   155 ELMSKVHQLTQELQKSRSDVQQIPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQA 214

Query:   208 MEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGV 267
             MEKNY+TMA EVEKL+A+LMN  N DRRA G YG    N+E + SG   G   YED +G 
Sbjct:   215 MEKNYMTMAREVEKLQAQLMNNANSDRRAGGPYGNNI-NAEIDASGHQSGNGYYEDAFG- 272

Query:   268 PQGHGPPPSATTAGVVGAGPNTSTSA--Y---AATQSGT-PMRAAYDIPRGPGYEASKGP 321
             PQG+ P P A  A     GPN+   A  Y     TQ G  P R  Y+ PRGP    S  P
Sbjct:   273 PQGYIPQPVAGNA----TGPNSVVGAAQYPYQGVTQPGYFPQRPGYNFPRGP--PGSYDP 326

Query:   322 GYDASKAPSYDP-TKGPSYD-PAKGPGYDPTK 351
                    P   P   GPS + P  G   +P++
Sbjct:   327 TTRLPTGPYGAPFPPGPSNNTPYAGTHGNPSR 358


>TAIR|locus:2089616 [details] [associations]
            symbol:AT3G14750 "AT3G14750" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0048573 "photoperiodism, flowering" evidence=RCA]
            EMBL:CP002686 EMBL:AY035083 EMBL:AY051034 IPI:IPI00544941
            RefSeq:NP_566492.1 UniGene:At.20367 ProteinModelPortal:Q93V84
            SMR:Q93V84 PaxDb:Q93V84 PRIDE:Q93V84 EnsemblPlants:AT3G14750.1
            GeneID:820703 KEGG:ath:AT3G14750 TAIR:At3g14750 eggNOG:NOG236769
            HOGENOM:HOG000242815 InParanoid:Q93V84 OMA:YAENYEH PhylomeDB:Q93V84
            ProtClustDB:CLSN2688383 ArrayExpress:Q93V84 Genevestigator:Q93V84
            Uniprot:Q93V84
        Length = 331

 Score = 422 (153.6 bits), Expect = 1.4e-39, P = 1.4e-39
 Identities = 99/256 (38%), Positives = 149/256 (58%)

Query:    45 MPPP-EVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQ-ILHGQIGGMKS 102
             +PP   ++E ++A+Q+ ++Q L  +NQRLAATH  L+QEL  AQHELQ I+H  I  +++
Sbjct:    63 LPPQFSILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIMH-YIDSLRA 121

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E E+ MR + +K  + E EL+  + ++ E QK + + +     R+EL ++VH +TQDL R
Sbjct:   122 EEEIMMREMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVHLMTQDLAR 181

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
                D+QQIP L +E+E+ +QE    R   +YEKK Y ++ E  ++ME   + MA E+EKL
Sbjct:   182 LTADLQQIPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKL 241

Query:   223 RAELMNAPNVDRRADG--------SYGGATGNSENETSGRPVGQNAYEDGYGV-PQ---- 269
             RAE+ N+      A+G        +YGG  GN E   +G PV  N Y+  Y + P     
Sbjct:   242 RAEIANS-ETSAYANGPVGNPGGVAYGGGYGNPE---AGYPV--NPYQPNYTMNPAQTGV 295

Query:   270 -GHGPPPSATTAGVVG 284
              G+ PPP    A   G
Sbjct:   296 VGYYPPPYGPQAAWAG 311


>TAIR|locus:2035751 [details] [associations]
            symbol:AT1G55170 "AT1G55170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC073944
            EMBL:AY084916 EMBL:BT006117 EMBL:AK118721 IPI:IPI00529305
            RefSeq:NP_564678.1 UniGene:At.37108 ProteinModelPortal:Q9C717
            SMR:Q9C717 PaxDb:Q9C717 PRIDE:Q9C717 EnsemblPlants:AT1G55170.1
            GeneID:841960 KEGG:ath:AT1G55170 TAIR:At1g55170 eggNOG:NOG306311
            InParanoid:Q9C717 OMA:ELHRMNL PhylomeDB:Q9C717
            ProtClustDB:CLSN2688822 ArrayExpress:Q9C717 Genevestigator:Q9C717
            Uniprot:Q9C717
        Length = 283

 Score = 346 (126.9 bits), Expect = 1.6e-31, P = 1.6e-31
 Identities = 87/240 (36%), Positives = 131/240 (54%)

Query:    32 RPPMPGAFPPFDMMPPPEVMEQ------KIASQHVEMQKLATENQRLAATHGTLRQELAA 85
             RP + G  PP    PPP ++E       +I  Q  E+++L ++N  LA     L +EL A
Sbjct:    25 RPFLRG--PPLLQPPPPSLLEDLQIQEGEIRRQDAEIRRLLSDNHGLADDRMVLERELVA 82

Query:    86 AQHELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVA 145
             A+ EL  ++  I  +++E++LQ+R  +EK  K+E +++  E  K E  + + E Q L   
Sbjct:    83 AKEELHRMNLMISDLRAEQDLQLREFSEKRHKLEGDVRAMESYKKEASQLRGEVQKLDEI 142

Query:   146 REELIAKVHQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESL 205
             + EL   V  L +DL +  +D +QIP + +E++ L++E  H R   EYEKK   + +E  
Sbjct:   143 KRELSGNVQLLRKDLAKLQSDNKQIPGMRAEVKDLQKELMHARDAIEYEKKEKFELMEQR 202

Query:   206 QVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGY 265
             Q MEKN ++MA EVEKLRAEL     VD R  G +GG+ G + N   G   G     D Y
Sbjct:   203 QTMEKNMVSMAREVEKLRAELAT---VDSRPWG-FGGSYGMNYNNMDGTFRGSYGENDTY 258


>TAIR|locus:2156146 [details] [associations]
            symbol:AT5G61920 "AT5G61920" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP002688 GenomeReviews:BA000015_GR EMBL:AB022212
            UniGene:At.55672 EMBL:DQ447104 IPI:IPI00520542
            RefSeq:NP_001119474.1 RefSeq:NP_200998.1 PRIDE:Q9FH51
            EnsemblPlants:AT5G61920.1 EnsemblPlants:AT5G61920.2 GeneID:836313
            KEGG:ath:AT5G61920 TAIR:At5g61920 eggNOG:NOG265125
            HOGENOM:HOG000090683 InParanoid:Q9FH51 OMA:KAHIRSI PhylomeDB:Q9FH51
            ProtClustDB:CLSN2686951 Genevestigator:Q9FH51 Uniprot:Q9FH51
        Length = 238

 Score = 292 (107.8 bits), Expect = 2.2e-25, P = 2.2e-25
 Identities = 64/183 (34%), Positives = 107/183 (58%)

Query:    49 EVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             +++E KIA Q  E+ +L+ +N++LA+++  L+++L  A  E+Q L   I   +++ E+Q+
Sbjct:    51 DILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQI 110

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R+  EKIAKME  +K  E ++ E Q +  EA  L   REEL +KV    +DL++   + +
Sbjct:   111 RSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAE 170

Query:   169 QIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMN 228
              + A   ELE L++E+   R  +E EK    + L  L+ ME+  I     +EKLR+E+  
Sbjct:   171 SLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIST 230

Query:   229 APN 231
             A N
Sbjct:   231 ARN 233


>FB|FBgn0085362 [details] [associations]
            symbol:Vml "Vitelline membrane-like" species:7227 "Drosophila
            melanogaster" [GO:0009950 "dorsal/ventral axis specification"
            evidence=IGI] [GO:0060388 "vitelline envelope" evidence=IDA]
            [GO:0007305 "vitelline membrane formation involved in
            chorion-containing eggshell formation" evidence=ISM] [GO:0008316
            "structural constituent of vitelline membrane" evidence=ISM]
            [GO:0035805 "egg coat" evidence=ISM] EMBL:AE014298 GO:GO:0009950
            GeneTree:ENSGT00700000104744 PROSITE:PS51137 GO:GO:0060388
            InterPro:IPR013135 RefSeq:NP_001096866.1 UniGene:Dm.32785
            STRING:A8JUV4 EnsemblMetazoa:FBtr0112535 GeneID:5740271
            KEGG:dme:Dmel_CG34333 UCSC:CG34333-RA CTD:5740271
            FlyBase:FBgn0085362 eggNOG:NOG284187 InParanoid:A8JUV4 OMA:ISKYETI
            OrthoDB:EOG4KPRTT GenomeRNAi:5740271 NextBio:20891311 Bgee:A8JUV4
            Uniprot:A8JUV4
        Length = 578

 Score = 286 (105.7 bits), Expect = 2.0e-22, P = 2.0e-22
 Identities = 83/283 (29%), Positives = 99/283 (34%)

Query:   229 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVGA 285
             AP+    A  SY      S +  S  P         Y  P     H P   A++     A
Sbjct:   198 APSYSAPAAPSYSAPAAPSYSAPSA-PSYSAQKTSSYSAPAAPSYHAPAAPASSYSAP-A 255

Query:   286 GPNTSTSA---YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 342
             GP+ S  A   Y+A     P  ++Y   + P Y A   P Y A  APSY  +  PSY   
Sbjct:   256 GPSYSAPAAPSYSAPSYSAPA-SSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPSYSSP 314

Query:   343 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 402
                 Y     P Y A K  +Y A   P+Y     PSY       Y     P+Y     P 
Sbjct:   315 ASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 374

Query:   403 YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 462
             Y     P Y       Y A  APSY     P Y       Y    APSY       +  A
Sbjct:   375 YSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYS-A 433

Query:   463 PRGAAPHGQVPP-PLNNVPYGSATPPARS---GSGQPRGGNPA 501
             P  AAP    P  P  + P  S    AR+   GS  P  G  A
Sbjct:   434 P--AAPSYSAPAAPSYSAPASSGYSAARAYSAGSAAPASGYSA 474

 Score = 274 (101.5 bits), Expect = 4.7e-21, P = 4.7e-21
 Identities = 80/271 (29%), Positives = 97/271 (35%)

Query:   243 ATGNSENETSGRPVGQNAYEDGYG--VP-QGHGPP------PSATTAGVVG-AGPNTSTS 292
             AT N E +  G P  +  YE+ +   +P Q + PP       S + A   G + P     
Sbjct:    24 ATRNEEFD-DGFPESEFDYEERHTREIPAQAYAPPIVYNSQSSYSPAKDQGYSAPAAPVY 82

Query:   293 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 352
             + AA     P   +Y  P  P Y A   P Y A  APSY     PSY       Y     
Sbjct:    83 SPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAA 142

Query:   353 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 412
             P Y A    +Y A   P+Y      SY       Y     P+Y     P Y     P Y 
Sbjct:   143 PSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYS 202

Query:   413 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGAPRGAAPHGQ 471
                 P Y A  APSY     P Y  Q+   Y    APSY  P+       AP G  P   
Sbjct:   203 APAAPSYSAPAAPSYSAPSAPSYSAQKTSSYSAPAAPSYHAPAAPASSYSAPAG--PSYS 260

Query:   472 VPP-PLNNVPYGSATPPARSGSGQPRGGNPA 501
              P  P  + P  SA   + S    P    PA
Sbjct:   261 APAAPSYSAPSYSAPASSYSALKAPSYSAPA 291

 Score = 262 (97.3 bits), Expect = 1.1e-19, P = 1.1e-19
 Identities = 69/246 (28%), Positives = 83/246 (33%)

Query:   265 YGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGY 323
             Y  P  +    S + A   G + P     + AA     P   +Y  P  P Y A   P Y
Sbjct:    54 YAPPIVYNSQSSYSPAKDQGYSAPAAPVYSPAAPSYSAPAAPSYSAPAAPSYSAPAAPSY 113

Query:   324 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR 383
              A  APSY     PSY       Y     P Y A    +Y A   P+Y      SY    
Sbjct:   114 SAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPA 173

Query:   384 GLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 443
                Y     P+Y     P Y     P Y     P Y A  APSY     P Y  Q+   Y
Sbjct:   174 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPSAPSYSAQKTSSY 233

Query:   444 DMRRAPSYD-PSRGTGFDGAPRGAAPHGQVPP----PLNNVP---YGSATPPARSGSGQP 495
                 APSY  P+       AP G +      P    P  + P   Y +   P+ S    P
Sbjct:   234 SAPAAPSYHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAP 293

Query:   496 RGGNPA 501
                 PA
Sbjct:   294 SYSAPA 299

 Score = 259 (96.2 bits), Expect = 2.4e-19, P = 2.4e-19
 Identities = 66/241 (27%), Positives = 84/241 (34%)

Query:   265 YGVPQG--HGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 322
             Y  P G  +  P + + +    + P +S SA  A     P   +Y  P  P Y +S  P 
Sbjct:   251 YSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPS 310

Query:   323 YD--------ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 374
             Y         A  AP+Y   K  SY     P Y     P Y A   S+Y A   P+Y   
Sbjct:   311 YSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAP 370

Query:   375 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 434
               PSY       Y      +Y     P Y     P Y       Y A  APSY     P 
Sbjct:   371 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 430

Query:   435 YDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 494
             Y       Y    APSY     +G+  A R  +     P    + P  S+   A + SG 
Sbjct:   431 YSAPAAPSYSAPAAPSYSAPASSGYSAA-RAYSAGSAAPASGYSAPKTSSGYSAPASSGS 489

Query:   495 P 495
             P
Sbjct:   490 P 490

 Score = 254 (94.5 bits), Expect = 8.7e-19, P = 8.7e-19
 Identities = 73/277 (26%), Positives = 91/277 (32%)

Query:   229 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 288
             AP+    A  SY      S +  +       A    Y  P         T++    A P+
Sbjct:   182 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPA-APSYSAPSAPSYSAQKTSSYSAPAAPS 240

Query:   289 TSTSAYAATQSGTPMRAAYDIPRGPGYEASK--GPG--YDASKAPSYDPTKGPSYDPAKG 344
                 A  A+    P   +Y  P  P Y A     P   Y A KAPSY     PSY     
Sbjct:   241 YHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAA 300

Query:   345 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 404
             P Y  +  P Y +   S+Y A   P Y   +  SY       Y     P+Y       Y 
Sbjct:   301 PSYSSSASPSYSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYS 360

Query:   405 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 464
                 P Y     P Y A  APSY       Y       Y    APSY     + +  AP 
Sbjct:   361 APAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYS-AP- 418

Query:   465 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 501
              AAP    P   +   Y +   P+ S    P    PA
Sbjct:   419 -AAPSYSAPAAPS---YSAPAAPSYSAPAAPSYSAPA 451

 Score = 220 (82.5 bits), Expect = 5.6e-15, P = 5.6e-15
 Identities = 80/278 (28%), Positives = 94/278 (33%)

Query:   227 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 286
             + AP+    A  SY      S + +S  P   +     Y  P    P  SA  A    A 
Sbjct:   282 LKAPSYSAPAAPSYSAPAAPSYS-SSASPSYSSPASSSYSAPAA--PTYSAPKAQSYSAP 338

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 346
                S SA AA     P  ++Y  P  P Y A   P Y A  APSY      SY     P 
Sbjct:   339 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPS 398

Query:   347 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 406
             Y     P Y A   S+Y A   P+Y     PSY       Y     P+Y      GY   
Sbjct:   399 YSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSGYSAA 458

Query:   407 RVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDG--A 462
             R   Y    G    A  A  Y  P+   GY      G     A SY  P+  T   G  A
Sbjct:   459 RA--YSA--G---SAAPASGYSAPKTSSGYSAPASSGSPA--ASSYSAPASSTASSGYSA 509

Query:   463 P--------RGAAPHGQVPPPLNNVPYGSATPPARSGS 492
             P        R    H  +        YGSA P A  G+
Sbjct:   510 PASKSSGYARSEMDHQILGMARTAGGYGSAAPSAAYGA 547


>TAIR|locus:2060848 [details] [associations]
            symbol:AT2G30120 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] EMBL:CP002685
            IPI:IPI00938894 RefSeq:NP_001154541.1 UniGene:At.19562
            EnsemblPlants:AT2G30120.2 GeneID:817564 KEGG:ath:AT2G30120
            OMA:PEANGTH Uniprot:F4IMQ0
        Length = 288

 Score = 225 (84.3 bits), Expect = 5.7e-18, P = 5.7e-18
 Identities = 68/234 (29%), Positives = 113/234 (48%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMR 109
             ++E +IA QH E+Q L  +NQRLA  H  L+ +L  A+ EL+ L      +K+E E ++R
Sbjct:    38 ILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAKVR 97

Query:   110 NLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQ 169
              + +   +MEAE +  + +  E  + +++ Q L   R+EL  ++     ++ +A  +  +
Sbjct:    98 EVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNSDR 157

Query:   170 IPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNA 229
                +  E+E LR E    R   E EKK    +L   + MEK    +  E+ KL  EL++ 
Sbjct:   158 AIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEELVDL 217

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVV 283
                 R A+ +   A   S    +    G N  +D YG  QG   P +  T  +V
Sbjct:   218 ETKAREANAAAEAAPTPSPGLAAS--YGNNT-DDIYG-GQGRQYPEANGTHELV 267


>UNIPROTKB|A2VD00 [details] [associations]
            symbol:eif3a "Eukaryotic translation initiation factor 3
            subunit A" species:8355 "Xenopus laevis" [GO:0001732 "formation of
            translation initiation complex" evidence=ISS] [GO:0005852
            "eukaryotic translation initiation factor 3 complex" evidence=ISS]
            [GO:0003743 "translation initiation factor activity" evidence=ISS]
            InterPro:IPR000717 Pfam:PF01399 SMART:SM00088 GO:GO:0003743
            GO:GO:0005852 KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128
            GO:GO:0001732 EMBL:BC129055 RefSeq:NP_001085285.1 UniGene:Xl.57279
            PRIDE:A2VD00 GeneID:443632 KEGG:xla:443632 Uniprot:A2VD00
        Length = 1424

 Score = 195 (73.7 bits), Expect = 1.2e-11, P = 1.2e-11
 Identities = 120/453 (26%), Positives = 179/453 (39%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAATHGT-LRQELAAAQHELQILHGQIGGMKSER--EL 106
             V   K + Q V   KL    +RLA      L +     + E +I + +    + +R  E 
Sbjct:   761 VSNLKASRQSVYDAKLKQFQERLAEEKRVRLEERKRQRKEERRISYYRDKEEEEQRLIEE 820

Query:   107 QMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTD 166
             Q++   E   K+E E + AE  + + +  K E Q       EL  +  +  +D +R   D
Sbjct:   821 QLKQEREDREKIENEKREAEQREYQERLKKLEEQERKKRLRELEIEEREKKRDEERRGPD 880

Query:   167 ----VQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
                  Q  P+   +    R+E    RG    E+K      +     + +      + E  
Sbjct:   881 DSFRKQDTPSRWGD----REESGWRRGADPDERKQAPPERDWRSGGQDSKPVKDEDREGD 936

Query:   223 RAELMNAPNVD-RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAG 281
                ++        R DG    A   S   T  R   ++  EDG G  +G    P      
Sbjct:   937 EDSVLRKDEEQVARGDGDEERAA--SWRGTDDRGPKRSVEEDG-GPRRGFNDEPGPRRGF 993

Query:   282 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTK 335
                 GP          +   P R   D  RGP  G +  +GP  G D  + P    D  +
Sbjct:   994 EDDQGPRRGLD-----EDRGPRRGL-DEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDR 1047

Query:   336 GP--SYDPAKGP--GYDPTKGP--GYDAQKGSN--YDAQRGP--NYDIHRGPSYDPQRGL 385
             GP    D  +GP  G D  +GP  G+D  +G    +D  RGP  ++D  RGP    +RG 
Sbjct:  1048 GPRRGLDEDRGPRRGLDEDRGPRRGFDEDRGPRRGFDEDRGPRRDFDEDRGP----RRG- 1102

Query:   386 GYDMQRGPN--YDMQRGP--GYETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP-- 433
              +D  RGP   +D  RGP  G++  R P  G+D  RGP   ++  R P   +   RGP  
Sbjct:  1103 -FDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGPRRGFDDDRGPRRGFEDDRGPRR 1161

Query:   434 GYDLQRG--QGYDMRRAPSYDPSRGTGFDGAPR 464
             G++  RG  +G++  R P     RG   D  PR
Sbjct:  1162 GFEDDRGPRRGFEDDRGPR----RGFDEDRTPR 1190

 Score = 184 (69.8 bits), Expect = 9.0e-15, Sum P(2) = 9.0e-15
 Identities = 66/197 (33%), Positives = 90/197 (45%)

Query:   304 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 353
             R   D  RGP  G +  +GP  G D  + P    D  +GP   +D  +GP  G+D  +GP
Sbjct:  1030 RRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGFDEDRGPRRGFDEDRGP 1089

Query:   354 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 403
               D      +D  RGP   +D  RGP   +D  RG   G+D  RGP   +D  RGP  G+
Sbjct:  1090 RRD------FDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGPRRGF 1143

Query:   404 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SY 451
             +  R P  G++  RGP   +E  R P   +   RGP  G+D  R   +G++  R P    
Sbjct:  1144 DDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRTPRRGFEDDRGPRRGM 1203

Query:   452 DPSRGTGFDGAPRGAAP 468
             D  R +   GA     P
Sbjct:  1204 DEERVSWRGGAEEDRGP 1220

 Score = 159 (61.0 bits), Expect = 4.9e-12, Sum P(2) = 4.9e-12
 Identities = 61/197 (30%), Positives = 91/197 (46%)

Query:   304 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 353
             R  +D  RGP   ++  +GP  G+D  + P   +D  +GP   +D  +GP  G+D  +GP
Sbjct:  1080 RRGFDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGP 1139

Query:   354 GYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRGLGYDMQRGPN--YDMQRGP--GYETQR 407
                 ++G  +D  RGP   ++  RGP    +RG  ++  RGP   ++  RGP  G++  R
Sbjct:  1140 ----RRG--FDDDRGPRRGFEDDRGP----RRG--FEDDRGPRRGFEDDRGPRRGFDEDR 1187

Query:   408 VP--GYDVQRGPV--YEAQRAP---SYIPQRGPGYDLQRGQGYDMRRAPSYD--PSRGTG 458
              P  G++  RGP    + +R          RGP    +  +G   RR    D  P RG  
Sbjct:  1188 TPRRGFEDDRGPRRGMDEERVSWRGGAEEDRGPRRGAEEDRG--PRRGAEEDRGPRRGAE 1245

Query:   459 FDGAPRGAAPH--GQVP 473
              D  PR  A    GQ P
Sbjct:  1246 EDRGPRRGAEEDRGQTP 1262

 Score = 91 (37.1 bits), Expect = 9.0e-15, Sum P(2) = 9.0e-15
 Identities = 42/178 (23%), Positives = 84/178 (47%)

Query:    53 QKIASQHVEMQ--KLATENQRLAAT--HGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             Q + S+H+  Q   ++T   +  AT     + QE    QH++ I   Q    K  + +  
Sbjct:   512 QNMPSEHIRNQLTAMSTVLSKAVATIKPAHVLQE-KEEQHQIAISAYQKNSRKEHQRILT 570

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R  T +  K   E    +  K E ++ + E Q +  A EE + +  +  ++ +R   + +
Sbjct:   571 RRQTIEERKERLENLNIQREKEEHEQREAELQKVRKAEEERLRQEAK-EREKERILQEHE 629

Query:   169 QIPALLSELESLRQEYHHCRGTYEYEKKFYND-HLESLQVMEKNYITMATEVEKLRAE 225
             QI     + +++R+     + T E+  K + D  +E+L+ ++ ++I MA +VE+L  E
Sbjct:   630 QI-----KKKTVRERLEQIKKT-EFGAKAFKDIDIENLEELDPDFI-MAKQVEQLEKE 680

 Score = 54 (24.1 bits), Expect = 6.1e-11, Sum P(2) = 6.1e-11
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query:   162 RAHTDVQQI--PALLSELE---SLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMA 216
             + H   Q+I  P +L  LE    LR+ +    G Y+Y+      +++SL+ + + Y+ +A
Sbjct:    38 KKHRTWQKIHEPIMLKYLELCVDLRKSHLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLA 97

Query:   217 TE 218
              E
Sbjct:    98 EE 99


>UNIPROTKB|A4II09 [details] [associations]
            symbol:eif3a "Eukaryotic translation initiation factor 3
            subunit A" species:8364 "Xenopus (Silurana) tropicalis" [GO:0001732
            "formation of translation initiation complex" evidence=ISS]
            [GO:0005852 "eukaryotic translation initiation factor 3 complex"
            evidence=ISS] [GO:0003743 "translation initiation factor activity"
            evidence=ISS] InterPro:IPR000717 Pfam:PF01399 SMART:SM00088
            GO:GO:0003743 GO:GO:0005852 eggNOG:NOG236708 HOGENOM:HOG000246822
            KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128 GO:GO:0001732 CTD:8661
            EMBL:BC135790 RefSeq:NP_001096173.1 UniGene:Str.55518 STRING:A4II09
            PRIDE:A4II09 GeneID:100124719 KEGG:xtr:100124719
            Xenbase:XB-GENE-994394 Uniprot:A4II09
        Length = 1391

 Score = 186 (70.5 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 68/224 (30%), Positives = 101/224 (45%)

Query:   266 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP-- 321
             G+ +  GP      AG    G           +     R  +D  RGP  G++  +GP  
Sbjct:   981 GLEEDRGPRRGIDDAGP-RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRR 1039

Query:   322 GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQKGSN--YDAQRGPN- 370
             G+D  + P    D  +GP   +D  + P  G+D  +GP  G+D  +G    +D  RGP  
Sbjct:  1040 GFDEDRGPRRGIDDDRGPRRGFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRR 1099

Query:   371 -YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQRVP--GYDVQRGPVY 419
              ++  RGP   ++  RG   G++  RGP   ++  RGP  G+E  R P  G+D  RGP  
Sbjct:  1100 GFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGP-- 1157

Query:   420 EAQRAPSYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRGT 457
               +R   +   RGP  G+D  R   +G+D  R P    D  RG+
Sbjct:  1158 --RRG--FEDDRGPRRGFDEDRTPRRGFDDDRGPRRGLDEDRGS 1197

 Score = 183 (69.5 bits), Expect = 2.8e-14, Sum P(2) = 2.8e-14
 Identities = 65/191 (34%), Positives = 92/191 (48%)

Query:   304 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 353
             R  ++  RGP  G E  + P  G+D  + P   +D  +GP   +D  +GP  G D  +GP
Sbjct:   998 RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGIDDDRGP 1057

Query:   354 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 403
                 ++G  +D  R P   +D  RGP   +D  RG   G+D  RGP   ++  RGP  G+
Sbjct:  1058 ----RRG--FDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGF 1111

Query:   404 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAPSYDP 453
             E  R P  G++  RGP   +E  R P   +   RGP  G+D  RG  +G++  R P    
Sbjct:  1112 EDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPR--- 1168

Query:   454 SRGTGFDGAPR 464
              RG   D  PR
Sbjct:  1169 -RGFDEDRTPR 1178

 Score = 167 (63.8 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 71/225 (31%), Positives = 103/225 (45%)

Query:   310 PRGPGYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQK 359
             PR  G++  + P  G+D  + P   +D  +GP   +D  +GP  G++  +GP  G++  +
Sbjct:  1057 PRR-GFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGFEDDR 1115

Query:   360 GSN--YDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQR 407
             G    ++  RGP   ++  RGP   ++  RG   G+D  RGP   ++  RGP  G++  R
Sbjct:  1116 GPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPRRGFDEDR 1175

Query:   408 VP--GYDVQRGPV--YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              P  G+D  RGP    +  R  S+   RG G D+ R +G D  R P     RG   D  P
Sbjct:  1176 TPRRGFDDDRGPRRGLDEDRG-SW---RG-GDDVPR-RGADDDRGPR----RGADDDRGP 1225

Query:   464 RGAAPHGQVP--PPLNNVPYG-SATPPARSGS-GQPRGGN-PARR 503
             R      Q P  P   + P G      AR  S G PR    P  R
Sbjct:  1226 RRGEDRDQTPWKPMAASRPGGWREREKAREDSWGPPRDSQAPEER 1270

 Score = 160 (61.4 bits), Expect = 7.7e-08, P = 7.7e-08
 Identities = 107/442 (24%), Positives = 176/442 (39%)

Query:    58 QHVEMQKLATENQRLAATH-GTLRQELAAA----QHELQILH-GQIGGMKSERELQMRNL 111
             + + + K A E QR+       L++E   +    + E  + H  ++  M  ++EL +  L
Sbjct:   705 EEIPLLKKAYEEQRINDMELWELQEEERISTLLLEREKAVEHKNRMSRMVEDKELFVSKL 764

Query:   112 -TEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQI 170
                + +  EA+LK  +    E + ++ E +      E  +       ++ +R   +  Q+
Sbjct:   765 KASRQSLYEAKLKQFQERLAEEKAARLEERKRERKEERRVNYYRDKEEEEERLREE--QL 822

Query:   171 PALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELM 227
                  E E +  E        E E++ Y + L+ L+  E+       E+E   + R E  
Sbjct:   823 KQEREEQEKVENEKR------EAEQRDYQERLKKLEEQERKKRQRELEIEERERKREEER 876

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVG 284
                +   R D S  G     E E SG   G +  E     P+     G P S        
Sbjct:   877 RGGDDTFRKDSSRWG-----EREESGWRRGADPDERKQVPPERDWRRGGPDSKPVINEDA 931

Query:   285 AG-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGY-DASKAPS--YDPTKGP--S 338
             +       +A    +     RA  +    P  +  KG  + D  + P    +  +GP   
Sbjct:   932 SNREEDENAALRKDEEQVSSRAFEEKVSLPDADEEKGGSWRDEDRGPKRGLEEDRGPRRG 991

Query:   339 YDPAKGP--GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGP--SYDPQRG--LGYDMQRG 392
              D A GP  G++  +GP    ++G   D      +D  RGP   +D  RG   G+D  RG
Sbjct:   992 IDDA-GPRRGFEEDRGP----RRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRG 1046

Query:   393 PN--YDMQRGP--GYETQRVP--GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQRG--QG 442
             P    D  RGP  G++  R P  G+D  RGP    +R   +   RGP  G+D  RG  +G
Sbjct:  1047 PRRGIDDDRGPRRGFDEDRTPRRGFDDDRGP----RRG--FDDDRGPRRGFDEDRGPRRG 1100

Query:   443 YDMRRAP--SYDPSRGT--GFD 460
             ++  R P   ++  RG   GF+
Sbjct:  1101 FEDDRGPRRGFEDDRGPRRGFE 1122

 Score = 87 (35.7 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 40/187 (21%), Positives = 81/187 (43%)

Query:    39 FPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIG 98
             F PF    P E +  ++ +    + K       +   H    +E    QH++ I   Q  
Sbjct:   507 FGPFLQNMPSEQIRNQLTAMSCVLSKAVGA---IKPAHVLQEKE---EQHQIAITAYQKN 560

Query:    99 GMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQ 158
               K  + +  R  T +  K   E    +  K E ++ + E Q +  A EE + +  +  +
Sbjct:   561 SRKEHQRILARRQTIEERKERLENLNIQREKEEMEQKEAELQKVRKAEEERLRQEAK-ER 619

Query:   159 DLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATE 218
             + +R   + +QI     + +++R+     + T    K F +  +E+L+ ++ ++I MA +
Sbjct:   620 EKERILQEHEQI-----KKKTVRERLEQIKKTELGAKAFKDIDIENLEELDPDFI-MAKQ 673

Query:   219 VEKLRAE 225
             VE+L  E
Sbjct:   674 VEQLEKE 680

 Score = 54 (24.1 bits), Expect = 3.5e-11, Sum P(2) = 3.5e-11
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query:   162 RAHTDVQQI--PALLSELE---SLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMA 216
             + H   Q+I  P +L  LE    LR+ +    G Y+Y+      +++SL+ + + Y+ +A
Sbjct:    38 KKHRTWQKIHEPIMLKYLELCVDLRKSHLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLA 97

Query:   217 TE 218
              E
Sbjct:    98 EE 99

 Score = 51 (23.0 bits), Expect = 7.2e-11, Sum P(2) = 7.2e-11
 Identities = 27/120 (22%), Positives = 51/120 (42%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAATHGT-LRQELAAAQHELQILHGQIGGMKSER--EL 106
             V + K + Q +   KL    +RLA      L +     + E ++ + +    + ER  E 
Sbjct:   761 VSKLKASRQSLYEAKLKQFQERLAEEKAARLEERKRERKEERRVNYYRDKEEEEERLREE 820

Query:   107 QMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTD 166
             Q++   E+  K+E E + AE    + +  K E Q     + EL  +  +  ++ +R   D
Sbjct:   821 QLKQEREEQEKVENEKREAEQRDYQERLKKLEEQERKKRQRELEIEERERKREEERRGGD 880


>UNIPROTKB|F1S187 [details] [associations]
            symbol:LOC100518332 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 GeneTree:ENSGT00530000063105
            EMBL:CU896616 Ensembl:ENSSSCT00000019273 OMA:TESSSGX Uniprot:F1S187
        Length = 406

 Score = 201 (75.8 bits), Expect = 3.1e-13, P = 3.1e-13
 Identities = 69/221 (31%), Positives = 84/221 (38%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 289
             P   R + G + G     E    GR  G+     GYG  +  G      + G  G G + 
Sbjct:   187 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSGGG-GYGGDR 244

Query:   290 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 349
             S   Y   +SG      Y   RG GY   +G GY   +   Y   +   Y   +G GY  
Sbjct:   245 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGGYGG 300

Query:   350 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQR-GPNY--DMQRGPGYETQ 406
              +G GY   +G  Y   RG  Y   RG  Y   RG GY   R G  Y  D   G GY   
Sbjct:   301 DRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRG-GYGGDRSGGGYGGDRGSGSGYGGD 358

Query:   407 RVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 446
             R  GY   R G  Y   R+  Y   RG GY  + G   D R
Sbjct:   359 RSGGYGGDRSGGGYGGDRSGGYGGDRG-GYGGKMGGRNDYR 398

 Score = 170 (64.9 bits), Expect = 9.5e-10, P = 9.5e-10
 Identities = 57/163 (34%), Positives = 65/163 (39%)

Query:   311 RGPGYEASK-GPGY--DASKAPSYDPTK-GPSYDPAK-GPGYDPTKGPGYDAQKGSNYDA 365
             RG GY   + G GY  D S    Y   + G  Y   + G GY   +G GY   +G  Y  
Sbjct:   218 RG-GYGGDRSGGGYGGDRSGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGG 276

Query:   366 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAP 425
              RG  Y   R   Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R  
Sbjct:   277 DRGGGYGGDRSGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGG-YGGDRGG 335

Query:   426 SYIPQRGPGY--DLQRGQGYDMRRAPSYDPSR-GTGFDGAPRG 465
                 + G GY  D   G GY   R+  Y   R G G+ G   G
Sbjct:   336 YGGDRSGGGYGGDRGSGSGYGGDRSGGYGGDRSGGGYGGDRSG 378

 Score = 141 (54.7 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 50/137 (36%), Positives = 55/137 (40%)

Query:   337 PSYDPAKGPGYDPTKG-PGYDAQKGSN--YDAQR-GPNY--DIHRGPSYDPQR-GLGYDM 389
             PS    +G GY   +G  G   + G    Y   R G  Y  D   G  Y   R G GY  
Sbjct:   192 PSGGDFRGRGYGGERGYRGRGGRGGDRGGYGGDRSGGGYGGDRSGGGGYGGDRSGGGYGG 251

Query:   390 QR-GPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 448
              R G  Y   RG GY   R  GY   RG  Y   R+  Y   RG GY   RG GY   R 
Sbjct:   252 DRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGGYGGDRGGGYGGDRG 311

Query:   449 PSYDPSRGTGFDGAPRG 465
               Y   RG G+ G  RG
Sbjct:   312 GGYGGDRGGGY-GGDRG 327


>UNIPROTKB|P11414 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:10029 "Cricetulus griseus" [GO:0005634 "nucleus"
            evidence=ISS] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=ISS] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0006468 "protein
            phosphorylation" evidence=ISS] [GO:0004672 "protein kinase
            activity" evidence=ISS] InterPro:IPR000684 Pfam:PF05001
            PROSITE:PS00115 GO:GO:0003677 GO:GO:0006468 GO:GO:0006366
            GO:GO:0003899 GO:GO:0005665 EMBL:M19538 PIR:A27677
            ProteinModelPortal:P11414 Uniprot:P11414
        Length = 467

 Score = 184 (69.8 bits), Expect = 3.6e-11, P = 3.6e-11
 Identities = 77/263 (29%), Positives = 101/263 (38%)

Query:   242 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS---TSAYAATQ 298
             GA G S           +A  D  G   G+ P  S T       GP++    +   A + 
Sbjct:    29 GAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSP 88

Query:   299 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 358
             S +P   AY+ PR PG    + P Y  + +PSY PT  PSY P   P Y PT  P Y   
Sbjct:    89 SYSPTSPAYE-PRSPGGYTPQSPSYSPT-SPSYSPTS-PSYSPTS-PNYSPTS-PSYSPT 143

Query:   359 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 418
               S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P 
Sbjct:   144 SPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-PTSPSYSPTS-PSYS-PTSPS 195

Query:   419 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA--APHGQVPPPL 476
             Y +  +PSY P   P Y       Y    +PSY P+  +    +P  +  +P+     P 
Sbjct:   196 Y-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSPSYSPTSPNYSPTSP- 250

Query:   477 NNVPYGSATPPARSGSGQPRGGN 499
             N  P   +  P  S S  P   N
Sbjct:   251 NYTPTSPSYSPT-SPSYSPTSPN 272

 Score = 165 (63.1 bits), Expect = 4.6e-09, P = 4.6e-09
 Identities = 69/236 (29%), Positives = 93/236 (39%)

Query:   228 NAPNVDRRA-DGSYGGATG---NSENETSGRPVGQN-AYEDGYGVPQGHGP--PPSATTA 280
             N P +      G   GA G   ++ ++ SG   G + A+    G P   GP  P   +  
Sbjct:    24 NIPGLGAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPG 83

Query:   281 GVVGAGPNTSTSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS 338
             G +    + ++ AY     G  TP   +Y  P  P Y  +  P Y  + +P+Y PT  PS
Sbjct:    84 GAMSPSYSPTSPAYEPRSPGGYTPQSPSYS-PTSPSYSPTS-PSYSPT-SPNYSPTS-PS 139

Query:   339 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 398
             Y P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P+Y   
Sbjct:   140 YSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-P 191

Query:   399 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 454
               P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:   192 TSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 241

 Score = 121 (47.7 bits), Expect = 0.00032, P = 0.00032
 Identities = 63/225 (28%), Positives = 80/225 (35%)

Query:   273 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 326
             P  S T+       PN     Y  T    +P   +Y  P  P Y  +  P Y  S     
Sbjct:   257 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 309

Query:   327 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 385
              ++P+Y P+  PSY P+  P Y PT  P Y     S Y     P Y     P Y P    
Sbjct:   310 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 363

Query:   386 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 445
              Y     P Y     P Y +   P Y     P Y +  +P Y P   P Y       Y  
Sbjct:   364 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 415

Query:   446 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 490
               +P Y P+  T    +P+G+      P      P  S T PA S
Sbjct:   416 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 459


>WB|WBGene00020550 [details] [associations]
            symbol:T17H7.1 species:6239 "Caenorhabditis elegans"
            [GO:0019915 "lipid storage" evidence=IMP] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            GO:GO:0009792 GO:GO:0019915 InterPro:IPR003677 Pfam:PF02520
            EMBL:FO080638 PIR:T28899 RefSeq:NP_497250.1
            ProteinModelPortal:Q22537 PaxDb:Q22537 EnsemblMetazoa:T17H7.1
            GeneID:175228 KEGG:cel:CELE_T17H7.1 UCSC:T17H7.1 CTD:175228
            WormBase:T17H7.1 eggNOG:NOG271901 GeneTree:ENSGT00700000104820
            HOGENOM:HOG000020548 InParanoid:Q22537 OMA:GRGQGPD NextBio:887312
            Uniprot:Q22537
        Length = 682

 Score = 172 (65.6 bits), Expect = 1.5e-09, P = 1.5e-09
 Identities = 75/273 (27%), Positives = 101/273 (36%)

Query:   235 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 294
             R DG  G   G  +N   G   G+      +G P  +    +  +      GP++  S  
Sbjct:   229 RGDGP-GFVPGTQDNNQRGS--GERGQRQNFG-PSDNLTNGNQFSKKQFARGPSSMNSDL 284

Query:   295 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGP 353
             +     +   + +D PRGPG    +G G D          +GP + P    PG   + GP
Sbjct:   285 SENSQHSDSNSQFDFPRGPGGRGGRGQGPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGP 344

Query:   354 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG-LGYDMQRGPNYDM--QRG---PGYETQR 407
             G    +G   D +   ++   RG     +RG  G     GP  D   +RG   PG    R
Sbjct:   345 GGRGGRGQGPDFEPQDDFPGRRGSGGPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGR 404

Query:   408 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
               G D   GP  +  R     P  GP  D   +RG G      P     RG   D  P G
Sbjct:   405 GQGPDF--GPGRQGGRGQG--PDFGPQDDFSGRRGSG-----GPGGRGGRGQEPDFGPGG 455

Query:   466 AAPHGQVPP--PLNNVP--YGSATPPARSGSGQ 494
                 GQ P   P ++ P   GS  P  R G GQ
Sbjct:   456 QGGRGQGPDFGPQDDFPGRRGSGGPEGRDGRGQ 488

 Score = 139 (54.0 bits), Expect = 6.0e-06, P = 6.0e-06
 Identities = 76/265 (28%), Positives = 93/265 (35%)

Query:   241 GGATGNSENETSGRPVGQNAYEDG--YGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 298
             GG  G  +    G P GQ     G  +G PQ   P    +  G  G G       +   Q
Sbjct:   304 GGRGGRGQGPDFG-PGGQGGRGQGPDFG-PQDDFPGRRGS-GGPGGRGGRGQGPDFEP-Q 359

Query:   299 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDPTKGPGYDA 357
                P R       GPG    +G G D      +   +G      +G  G  P  GPG   
Sbjct:   360 DDFPGRRGSG---GPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQG 416

Query:   358 QKGSNYDAQRGPNYDI--HRGPSYDPQRG-LGYDMQRGPNYDMQRG--PGYETQR-VPGY 411
              +G   D   GP  D    RG      RG  G +   GP     RG  P +  Q   PG 
Sbjct:   417 GRGQGPDF--GPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGR 474

Query:   412 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 471
                 GP  E +      P  GPG    RGQ  D     ++   RG+G  G  RG  P   
Sbjct:   475 RGSGGP--EGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGG-RGQGPDFG 531

Query:   472 VPPPLNNVP--YGSATPPARSGSGQ 494
                P ++ P   GS  P  R G GQ
Sbjct:   532 ---PQDDFPGRRGSGGPEGRDGRGQ 553

 Score = 120 (47.3 bits), Expect = 0.00071, P = 0.00071
 Identities = 72/265 (27%), Positives = 94/265 (35%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP-----QGHGPPPSATTAGVVGAGPN 288
             RR  G  G   G  +    G P        G G P     +G GP       G  G GP+
Sbjct:   365 RRGSGGPGRRGGRGQGPDFG-PQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQGGRGQGPD 423

Query:   289 TSTSA-YAATQ-SGTPM-RAA--YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 343
                   ++  + SG P  R     +   GPG +  +G G D      +   +G      +
Sbjct:   424 FGPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGPEGR 483

Query:   344 -GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM--QRG 400
              G G  P  GPG    +G + D+     +   RG      RG G D   GP  D   +RG
Sbjct:   484 DGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGGRGQGPDF--GPQDDFPGRRG 541

Query:   401 PGYETQRV---------PGYDVQRGPVYEAQRAPSYIPQRGPGYD--LQ-RGQGYDMRRA 448
              G    R          PG    RG   ++    ++  +RGPG    L  RGQG D    
Sbjct:   542 SGGPEGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGPGGPGGLGGRGQGPDF--G 599

Query:   449 PSYDPSRGTGFDGAPRGAAPHGQVP 473
             P     RG G D   R     GQ P
Sbjct:   600 PGGQGDRGQGPDFGARSQGNRGQGP 624


>FB|FBgn0028573 [details] [associations]
            symbol:prc "pericardin" species:7227 "Drosophila
            melanogaster" [GO:0005605 "basal lamina" evidence=NAS] [GO:0007507
            "heart development" evidence=IMP;TAS] [GO:0005578 "proteinaceous
            extracellular matrix" evidence=IDA] [GO:0035088 "establishment or
            maintenance of apical/basal cell polarity" evidence=TAS]
            [GO:0016477 "cell migration" evidence=TAS] [GO:0002009
            "morphogenesis of an epithelium" evidence=TAS] GO:GO:0002009
            GO:GO:0007507 GO:GO:0005578 FlyBase:FBgn0028573 InterPro:IPR009765
            Pfam:PF07054 EMBL:AF203342 STRING:Q9U617 PRIDE:Q9U617
            InParanoid:Q9U617 ArrayExpress:Q9U617 Bgee:Q9U617 Uniprot:Q9U617
        Length = 1729

 Score = 171 (65.3 bits), Expect = 6.3e-09, P = 6.3e-09
 Identities = 81/274 (29%), Positives = 98/274 (35%)

Query:   240 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---VGAG-PNTSTSAYA 295
             YG   G      +G+P G    + G G   G G P   T  G+    GAG P   T    
Sbjct:   417 YGTQPGIGGQTGAGQP-GYGT-QPGIGAQTGAGQPGYGTQPGIGGQTGAGQPGYGTQPGI 474

Query:   296 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG- 352
               Q+G   +  Y    G G +   G PGY +          G P Y    G G     G 
Sbjct:   475 GVQTGAG-QPGYGSQPGIGAQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQ 533

Query:   353 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPG 410
             PGY  Q G    AQ G        P Y  Q G+G     G P Y  Q G G +T    PG
Sbjct:   534 PGYGTQPGIG--AQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPG 586

Query:   411 YDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT-GFDGAPRGAA 467
             Y  Q G   +     P Y  Q G G  +  GQ GY  +         G  G+   P    
Sbjct:   587 YGTQPGVGAQTGTGQPGYGSQPGVGTQIGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGG 646

Query:   468 PHGQVPPPLNNVPYGSATPPARSGSGQPR-GGNP 500
               G   P     P G     A++G+GQP  G  P
Sbjct:   647 QTGAAQPGYGTQP-GVG---AQTGTGQPGYGAQP 676

 Score = 169 (64.5 bits), Expect = 1.0e-08, P = 1.0e-08
 Identities = 86/271 (31%), Positives = 99/271 (36%)

Query:   240 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS--ATTAGVVGAGPNTSTSAYAAT 297
             YGG  G S     G+P G        G+P G+G  P   A TA V G      T      
Sbjct:   876 YGGQPGISGQTGGGQP-GYGGQATISGLP-GYGTQPGIGALTA-VPGGHYGYETQPGIGG 932

Query:   298 QSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 356
             Q+GT        P G G +   G PGY     P      G S    + PGY    G G  
Sbjct:   933 QTGTNQPGFGGQP-GIGGQTGAGQPGYGFIGQPGIGGQTGTS---GRQPGYGTQPGIGGQ 988

Query:   357 AQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYD 412
                G   Y +Q G       G P Y  Q G+G  +  G P Y  Q G G +T    PGY 
Sbjct:   989 TAAGQPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYG 1048

Query:   413 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 472
              Q G  +  Q  P Y  Q  PG   Q G G      P Y    G G  G      P   V
Sbjct:  1049 AQPG--FGGQ--PGYGNQ--PGVGGQTGAGQ-----PGYGSQPGVG--GQTGAGQPGYGV 1095

Query:   473 PPPLNNVP-YGSATPPARSG-SGQPR-GGNP 500
              P     P  G  T   + G  GQP  GG+P
Sbjct:  1096 IPGFGGQPGIGGQTAAGKPGYGGQPGIGGSP 1126

 Score = 154 (59.3 bits), Expect = 4.4e-07, P = 4.4e-07
 Identities = 78/247 (31%), Positives = 90/247 (36%)

Query:   241 GGATGNSENETS-G-RPV--GQNAY-EDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 295
             GG TG S  +   G +P   GQ A  + GYG   G G     T AG  G G  T      
Sbjct:   967 GGQTGTSGRQPGYGTQPGIGGQTAAGQPGYGSQPGIG---GQTGAGQPGYGSQTGVGGQI 1023

Query:   296 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 354
                +G P    Y    G G +   G PGY A   P +    G    P  G G      PG
Sbjct:  1024 G--AGQP---GYGSQPGIGGQTGAGQPGYGAQ--PGFGGQPGYGNQPGVG-GQTGAGQPG 1075

Query:   355 YDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG----PGYETQRV 408
             Y +Q G       G P Y +   P +  Q G+G     G P Y  Q G    P Y TQ+ 
Sbjct:  1076 YGSQPGVGGQTGAGQPGYGVI--PGFGGQPGIGGQTAAGKPGYGGQPGIGGSPVYGTQQG 1133

Query:   409 PG--YDVQRG-PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA-PSYDPSRGTGFDGAP- 463
              G    +  G P Y  Q  P       PGY    G G       P Y P    G  GAP 
Sbjct:  1134 TGGPSGISGGQPGYGTQ--PGQTGAGQPGYGSLPGTGGQATAGQPGYGPGSQPGIGGAPV 1191

Query:   464 RGAAPHG 470
              G  P G
Sbjct:  1192 YGTQPGG 1198

 Score = 151 (58.2 bits), Expect = 9.4e-07, P = 9.4e-07
 Identities = 85/282 (30%), Positives = 100/282 (35%)

Query:   230 PNVDRRADGSYGGATGNSENETS--GRPVGQN-AYEDGYGVPQGHGPPPSATTAGVVGAG 286
             P+  R  D S  G  G  ++  S  G   GQ  A + GYG   G G     T  G  G G
Sbjct:   107 PSSGRILDASGSGGIGRPDSIISLPGGVGGQTGAGQPGYGSQPGIG---GQTATGQPGYG 163

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKG 344
                   A A   +G P    Y    G G +   G PGY +          G P Y    G
Sbjct:   164 SQLGVGAQAG--AGQP---GYGAQPGVGAQTGAGQPGYGSQTGIGGQTGAGQPGYGSQPG 218

Query:   345 PGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPG 402
              G     G PGY +Q G     Q G        P Y  Q G+G     G P Y  Q G G
Sbjct:   219 IGGQTGAGQPGYGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIG 271

Query:   403 YETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGF 459
              +T    PGY  Q G   +     P Y  Q G G     GQ GY  +  P      G G 
Sbjct:   272 GQTGAGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGSQ--PGIGGQTGAGQ 329

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGSATPPA---RSGSGQPRGG 498
              G        GQ         YG  T P    ++G+GQP  G
Sbjct:   330 PGYGSQPGIGGQTGA--GQPGYG--TQPGIGGQTGAGQPGYG 367

 Score = 142 (55.0 bits), Expect = 8.9e-06, P = 8.9e-06
 Identities = 85/297 (28%), Positives = 102/297 (34%)

Query:   230 PNVDRRADGS---YGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGA 285
             P +  +  G    YGG    S     G   G  A      VP GH G        G  G 
Sbjct:   880 PGISGQTGGGQPGYGGQATISGLPGYGTQPGIGALT---AVPGGHYGYETQPGIGGQTGT 936

Query:   286 G-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 344
               P          Q+G   +  Y     PG     G    + + P Y    G     A G
Sbjct:   937 NQPGFGGQPGIGGQTGAG-QPGYGFIGQPGIGGQTGT---SGRQPGYGTQPGIGGQTAAG 992

Query:   345 -PGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRG 400
              PGY    G G     G   Y +Q G    I  G P Y  Q G+G     G P Y  Q G
Sbjct:   993 QPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYGAQPG 1052

Query:   401 ----PGYETQRVPGYDVQRG---PVYEAQRAPSYIPQRG---PGYDL------QRGQGYD 444
                 PGY  Q  PG   Q G   P Y +Q  P    Q G   PGY +      Q G G  
Sbjct:  1053 FGGQPGYGNQ--PGVGGQTGAGQPGYGSQ--PGVGGQTGAGQPGYGVIPGFGGQPGIGGQ 1108

Query:   445 MRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPP-LNNVPYGSATPPARSGSGQPRGGN 499
                  P Y    G G  G+P      G   P  ++    G  T P ++G+GQP  G+
Sbjct:  1109 TAAGKPGYGGQPGIG--GSPVYGTQQGTGGPSGISGGQPGYGTQPGQTGAGQPGYGS 1163


>UNIPROTKB|P71590 [details] [associations]
            symbol:fhaA "FHA domain-containing protein FhaA"
            species:1773 "Mycobacterium tuberculosis" [GO:0005618 "cell wall"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            InterPro:IPR000253 InterPro:IPR008984 Pfam:PF00498 PROSITE:PS50006
            SMART:SM00240 GO:GO:0005829 GO:GO:0005618 GenomeReviews:AL123456_GR
            EMBL:BX842572 Gene3D:2.60.200.20 SUPFAM:SSF49879 PIR:B70700
            RefSeq:NP_214534.1 RefSeq:YP_006513334.1 PDB:2LC0 PDB:2LC1 PDB:3OUN
            PDB:3PO8 PDB:3POA PDBsum:2LC0 PDBsum:2LC1 PDBsum:3OUN PDBsum:3PO8
            PDBsum:3POA ProteinModelPortal:P71590 SMR:P71590 DIP:DIP-59047N
            PhosSite:P12071703 PRIDE:P71590 EnsemblBacteria:EBMYCT00000001781
            GeneID:13315997 GeneID:887067 KEGG:mtu:Rv0020c KEGG:mtv:RVBD_0020c
            PATRIC:18148538 TubercuList:Rv0020c HOGENOM:HOG000235804
            OMA:DQGYGQP ProtClustDB:CLSK790198 EvolutionaryTrace:P71590
            InterPro:IPR022128 Pfam:PF12401 Uniprot:P71590
        Length = 527

 Score = 162 (62.1 bits), Expect = 1.2e-08, P = 1.2e-08
 Identities = 84/244 (34%), Positives = 98/244 (40%)

Query:   274 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYD 332
             P   T   V+      S  A+ A     PM        G G +      YD   A P  D
Sbjct:   127 PDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNSSYRGGQG-QGRPDEYYDDRYARPQED 185

Query:   333 PTKGPSYDPAKGP--GYDPTKGPGYDAQKGSNYDAQRGPNY-DIHRGPSYDPQRGLGYDM 389
             P  GP       P  GY P  G GY  Q G  Y   R P+  D      Y P +G GY  
Sbjct:   186 PRGGPDPQGGSDPRGGYPPETG-GYPPQPG--YPRPRHPDQGDYPEQIGY-PDQG-GYPE 240

Query:   390 QRGPNYDMQRG-P---GYETQRVPGY-DVQRG---PVYEAQRAP-SYIPQRG---PGYDL 437
             QRG  Y  QRG P   GY+ Q   GY D  +G   P YE QR P S  P  G   PGYD 
Sbjct:   241 QRG--YPEQRGYPDQRGYQDQG-RGYPDQGQGGYPPPYE-QRPPVSPGPAAGYGAPGYD- 295

Query:   438 QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN---VPYGSATPPARSGSGQ 494
                QGY  R++  Y PS G G  G   G   +G+ P        VP G   PP +  +  
Sbjct:   296 ---QGY--RQSGGYGPSPGGGQPGYG-GYGEYGRGPARHEEGSYVPSGPPGPPEQRPAYP 349

Query:   495 PRGG 498
              +GG
Sbjct:   350 DQGG 353

 Score = 120 (47.3 bits), Expect = 0.00050, P = 0.00050
 Identities = 92/303 (30%), Positives = 111/303 (36%)

Query:   230 PNVDRRADGS-YGGATGNSENETSGRPVGQNAYEDGYGVPQGH---GPPPSATTAGVVGA 285
             P V   +D S Y G  G       GRP     Y+D Y  PQ     GP P   +    G 
Sbjct:   151 PGVAPMSDNSSYRGGQGQ------GRP--DEYYDDRYARPQEDPRGGPDPQGGSDPRGGY 202

Query:   286 GPNTSTSAYAATQSGTPMRAAY----DIPRGPGYEASKG-P---GYDASKAPSYDPTKGP 337
              P T    Y   Q G P R  +    D P   GY    G P   GY   +   Y   +G 
Sbjct:   203 PPETG--GYPP-QPGYP-RPRHPDQGDYPEQIGYPDQGGYPEQRGYPEQRG--YPDQRG- 255

Query:   338 SYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRGPNYDIHRGPSYDP---QRGLGYDMQRG- 392
              Y   +G GY P +G G Y            GP    +  P YD    Q G GY    G 
Sbjct:   256 -YQD-QGRGY-PDQGQGGYPPPYEQRPPVSPGPAAG-YGAPGYDQGYRQSG-GYGPSPGG 310

Query:   393 --PNY----DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 446
               P Y    +  RGP    +   G  V  GP    ++ P+Y P +G GYD    QG    
Sbjct:   311 GQPGYGGYGEYGRGPARHEE---GSYVPSGPPGPPEQRPAY-PDQG-GYDQGYQQGATTY 365

Query:   447 RAPSYDPSRG-TGFDGAPR--GAAPHG--QVPPPLNNVPYG-SATP----PARSG-SGQP 495
                 Y      T +  +PR  G AP G     P   +  YG S  P    PA  G SG  
Sbjct:   366 GRQDYGGGADYTRYTESPRVPGYAPQGGGYAEPAGRDYDYGQSGAPDYGQPAPGGYSGYG 425

Query:   496 RGG 498
             +GG
Sbjct:   426 QGG 428


>UNIPROTKB|Q92804 [details] [associations]
            symbol:TAF15 "TATA-binding protein-associated factor 2N"
            species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0005730 "nucleolus"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=TAS]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0005737 GO:GO:0045893 GO:GO:0000166 GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003723
            EMBL:CH471147 eggNOG:NOG240581 HOGENOM:HOG000038010 EMBL:AC015849
            EMBL:U51334 EMBL:X98893 EMBL:AB010067 EMBL:AY197697 EMBL:AK313223
            IPI:IPI00020194 IPI:IPI00294426 PIR:S71954 RefSeq:NP_003478.1
            RefSeq:NP_631961.1 UniGene:Hs.402752 ProteinModelPortal:Q92804
            SMR:Q92804 IntAct:Q92804 STRING:Q92804 PhosphoSite:Q92804
            DMDM:8928305 PaxDb:Q92804 PRIDE:Q92804 DNASU:8148
            Ensembl:ENST00000311979 GeneID:8148 KEGG:hsa:8148 UCSC:uc002hkc.3
            UCSC:uc002hkd.3 CTD:8148 GeneCards:GC17P034136 HGNC:HGNC:11547
            HPA:HPA052059 MIM:601574 neXtProt:NX_Q92804 PharmGKB:PA36322
            HOVERGEN:HBG005755 InParanoid:Q92804 KO:K14651 OMA:YGNQGSQ
            OrthoDB:EOG4MW872 PhylomeDB:Q92804 ChiTaRS:TAF15 GenomeRNAi:8148
            NextBio:30819 PMAP-CutDB:Q92804 ArrayExpress:Q92804 Bgee:Q92804
            CleanEx:HS_TAF15 Genevestigator:Q92804 GermOnline:ENSG00000172660
            Uniprot:Q92804
        Length = 592

 Score = 159 (61.0 bits), Expect = 3.2e-08, P = 3.2e-08
 Identities = 68/220 (30%), Positives = 83/220 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 289
             P   R + G + G     E    GR  G+     GYG  +  G      ++G  G   + 
Sbjct:   384 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GYSGDR 441

Query:   290 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 349
             S   Y   +SG      Y   RG GY   +G GY   +   Y   +G  Y   +G GY  
Sbjct:   442 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRGGGYGG 496

Query:   350 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSY--DPQRGLGYDMQRGPNYDMQRGPGYETQR 407
              +G GY   +G  Y   RG  Y   RG  Y  D  RG GY   RG       G GY   R
Sbjct:   497 DRG-GYGGDRGG-YGGDRG-GYGGDRG-GYGGDRSRG-GYGGDRGG------GSGYGGDR 545

Query:   408 VPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 446
               GY   R G  Y   R   Y   RG GY  + G   D R
Sbjct:   546 SGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRNDYR 584

 Score = 153 (58.9 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
 Identities = 60/164 (36%), Positives = 68/164 (41%)

Query:   311 RGPGYEASK-GPGY--DASKAPSYDPTK-GPSYDPAK-GPGYDPTKGPGYDAQKGSNYDA 365
             RG GY   + G GY  D S    Y   + G  Y   + G GY   +G GY   +G  Y  
Sbjct:   415 RG-GYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGG 473

Query:   366 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAP 425
              RG  Y   RG  Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R+ 
Sbjct:   474 DRGGGYGGDRG-GYGGDRGGGYGGDRG-GYGGDRG-GYGGDR-GGYGGDRGG-YGGDRSR 528

Query:   426 S-YIPQRG--PGYDLQRGQGYDMRRAPS-YDPSRGTGFDGAPRG 465
               Y   RG   GY   R  GY   R+   Y   RG G+ G  RG
Sbjct:   529 GGYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGY-GGDRG 571

 Score = 53 (23.7 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
 Identities = 21/96 (21%), Positives = 40/96 (41%)

Query:   188 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 247
             +  Y+ +   Y+ + +S     +NY   +   +  R ++      +R   GS GG  G  
Sbjct:   132 QSNYDQQHDSYSQNQQSYHSQRENY---SHHTQDDRRDVSRYGEDNRGYGGSQGGGRGRG 188

Query:   248 ENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 277
               +  GR P+ G +  + G    +G  + +GP   A
Sbjct:   189 GYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRTDA 224


>UNIPROTKB|F1PB61 [details] [associations]
            symbol:TAF15 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 CTD:8148 KO:K14651
            OMA:YGNQGSQ EMBL:AAEX03006620 EMBL:AAEX03006619 RefSeq:XP_548255.2
            ProteinModelPortal:F1PB61 Ensembl:ENSCAFT00000028877 GeneID:491135
            KEGG:cfa:491135 Uniprot:F1PB61
        Length = 571

 Score = 160 (61.4 bits), Expect = 2.3e-08, P = 2.3e-08
 Identities = 70/240 (29%), Positives = 87/240 (36%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 292
             RR +   GG +G       G   G+  ++   G P+ G    P+ +   +  A  N+   
Sbjct:   319 RRPEFMRGGGSGGGRRGRGGYR-GRGGFQGRGGDPKSGDWVCPNPSCGNMNFARRNSCNQ 377

Query:   293 AYAAT-QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY--DP 349
                   +   P    +   RG GY   +G  Y        D   G   D + G GY  D 
Sbjct:   378 CNEPRPEDSRPSGGDF---RGRGYGGERG--YRGRGGRGGD-RGGYGADRSSG-GYGGDR 430

Query:   350 TKGPGYDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 407
             + G GY   + G  Y   R G  Y   RG  Y   RG GY   RG  Y   RG GY   R
Sbjct:   431 SGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDR 490

Query:   408 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG--YDMRRAPSYDPSRGTGFDGAPRG 465
               GY   RG  Y   R      + G GY   RG G  Y   R   Y   R  G  G  RG
Sbjct:   491 GGGYGGDRGGGYGGDRGGYGGDRSGGGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG 550

 Score = 145 (56.1 bits), Expect = 2.0e-08, Sum P(2) = 2.0e-08
 Identities = 52/152 (34%), Positives = 61/152 (40%)

Query:   304 RAAYDIPR---GPGYEASKGPGYDASKAPS-YDPTK-GPSYDPAKGPGYDPTKGPGYDAQ 358
             R  Y   R   G G + S G GY   ++   Y   + G  Y   +G GY   +G GY   
Sbjct:   414 RGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGD 473

Query:   359 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR-GPGYETQRVPG--YDVQR 415
             +G  Y   RG  Y   RG  Y   RG GY   RG  Y   R G GY   R  G  Y   R
Sbjct:   474 RGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSGGGYGGDRGGGGGYGGDR 532

Query:   416 GPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMR 446
             G  Y   R+   Y   RG GY  + G   D R
Sbjct:   533 GGGYGGDRSGGGYGGDRG-GYGGKMGGRNDYR 563

 Score = 139 (54.0 bits), Expect = 9.0e-08, Sum P(2) = 9.0e-08
 Identities = 68/219 (31%), Positives = 76/219 (34%)

Query:   257 GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYE 316
             G+  Y  G G  QG G  P +     V   P+     +A   S             P   
Sbjct:   335 GRGGYR-GRGGFQGRGGDPKS--GDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGG 391

Query:   317 ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY--DAQKGSNYDAQR-GPNYDI 373
               +G GY   +   Y    G   D   G G D + G GY  D   G  Y   R G  Y  
Sbjct:   392 DFRGRGYGGERG--YRGRGGRGGDRG-GYGADRSSG-GYGGDRSGGGGYGGDRSGGGYGG 447

Query:   374 HR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 432
              R G  Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R   Y   RG
Sbjct:   448 DRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG 507

Query:   433 PGYDLQR-GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 470
              GY   R G GY        D   G G+ G  RG    G
Sbjct:   508 -GYGGDRSGGGY------GGDRGGGGGY-GGDRGGGYGG 538

 Score = 121 (47.7 bits), Expect = 0.00043, P = 0.00043
 Identities = 48/167 (28%), Positives = 62/167 (37%)

Query:   235 RADGSYGGATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 293
             R  G  GG  G    + +SG   G  +   GYG  +  G      + G  G G +     
Sbjct:   405 RGRGGRGGDRGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGG--GYGGDRG-GG 461

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-G 352
             Y   + G      Y   RG GY   +G GY   +   Y   +G  Y   +G GY   + G
Sbjct:   462 YGGDRGG-----GYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSG 515

Query:   353 PGYDAQKGSN--YDAQRGPNYDIHR-GPSYDPQRGLGYDMQRGPNYD 396
              GY   +G    Y   RG  Y   R G  Y   RG GY  + G   D
Sbjct:   516 GGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG-GYGGKMGGRND 561

 Score = 61 (26.5 bits), Expect = 2.0e-08, Sum P(2) = 2.0e-08
 Identities = 25/106 (23%), Positives = 44/106 (41%)

Query:   184 YHHCRGTYEYEKKF------YNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAD 237
             Y   +G+Y+ +  +      YN + +S      NY   +   +  R ++      +R   
Sbjct:   121 YDQHQGSYDEQSNYGPQHDSYNQNQQSYHSQRDNY---SHHTQDDRRDVSRYGEDNRGYG 177

Query:   238 GSYGGATGNSENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 277
             GS GG  G    +  GR P+ G +  + G    +G  + +GP P A
Sbjct:   178 GSQGGGRGRGGYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRPDA 223


>WB|WBGene00044109 [details] [associations]
            symbol:K02E11.10 species:6239 "Caenorhabditis elegans"
            [GO:0016021 "integral to membrane" evidence=IEA] EMBL:Z77665
            RefSeq:NP_001024024.1 ProteinModelPortal:Q5FC49
            EnsemblMetazoa:K02E11.10 GeneID:259661 KEGG:cel:CELE_K02E11.10
            UCSC:K02E11.10 CTD:259661 WormBase:K02E11.10
            GeneTree:ENSGT00530000065030 InParanoid:Q5FC49 OMA:VQASGYQ
            NextBio:952394 Uniprot:Q5FC49
        Length = 360

 Score = 154 (59.3 bits), Expect = 4.4e-08, P = 4.4e-08
 Identities = 69/224 (30%), Positives = 91/224 (40%)

Query:   264 GYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 322
             G+G   G    P A   G+ G  G      A+     G     A     G G     G G
Sbjct:    81 GFGGAGGSYAAP-ALGGGLGGFGGAPAPAPAFGGLGGGYQAAPALGGGLGGGLGGGPGGG 139

Query:   323 YDASKAPSYDPTKGPSYDPA---KGPGYD--PTKGPGYDAQKGSNYDAQRGP---NYDIH 374
             Y A+ A        P+  PA    G GY   PT G G  AQ G+ Y  Q+GP    +   
Sbjct:   140 YQAAPALQLPGLGAPA--PAFGGLGGGYQGAPTLGGG-QAQGGAGY--QQGPAQGRFVAQ 194

Query:   375 RGPSYDPQRGLGYDMQRGP---NYDMQRGPGYETQRVPGYDVQRGPV---YEAQRAPSYI 428
             +G +   Q G GY  Q+GP    +  Q+GP    Q   GY  Q+GP    + AQ+ P+  
Sbjct:   195 QGSAQGVQGGAGY--QQGPAQGGFTAQQGPAQVVQGGAGY--QQGPAQGGFVAQQGPAPA 250

Query:   429 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AAPHGQ 471
              Q G GY     QG     A     ++G G+  A +G +AP  Q
Sbjct:   251 AQGGAGYQQGSTQGGFEAVAQQGQVAQGAGYQSAAQGQSAPVSQ 294


>DICTYBASE|DDB_G0277909 [details] [associations]
            symbol:cbpP "calcium-binding protein" species:44689
            "Dictyostelium discoideum" [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR002048
            InterPro:IPR011992 Pfam:PF13499 PROSITE:PS50222 SMART:SM00054
            dictyBase:DDB_G0277909 Prosite:PS00018 GenomeReviews:CM000152_GR
            EMBL:AAFI02000023 GO:GO:0005509 Gene3D:1.10.238.10
            InterPro:IPR018247 EMBL:U03413 RefSeq:XP_642080.1
            ProteinModelPortal:P35085 PRIDE:P35085 EnsemblProtists:DDB0214957
            GeneID:8621293 KEGG:ddi:DDB_G0277909 eggNOG:NOG135385 OMA:MGAYPPQ
            ProtClustDB:CLSZ2846833 Uniprot:P35085
        Length = 467

 Score = 155 (59.6 bits), Expect = 5.8e-08, P = 5.8e-08
 Identities = 73/247 (29%), Positives = 89/247 (36%)

Query:   268 PQGHGPPPSATTAGVVGAGPNT--STSAYAATQS--GTPMRAAYDIPRGPGYEASKGPGY 323
             PQ   PPP+ + A      P     T     +QS  G P       P+ PG   S  P Y
Sbjct:     4 PQN--PPPAGSAADFYSQMPVKVMGTPGAPGSQSTPGAPGAPGQYPPQQPGAPGSNLPPY 61

Query:   324 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQ 382
               ++ P      G  Y P + PG  P + PG   Q       Q G P     +   Y PQ
Sbjct:    62 PGTQQPGAPGAPG-QYPPQQ-PGQYPPQQPGAPGQYPPQQPGQPGYPPQQPGQSGQYPPQ 119

Query:   383 R-GL-GYDMQR--GPN-YDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 436
             + G  GY  Q+   P  Y  Q+G PG    + PG   Q  P  + Q  P    Q G    
Sbjct:   120 QPGQPGYPPQQPGAPGQYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPGAYPP 179

Query:   437 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP---ARSGSG 493
              Q GQ        +Y P +G     A  GA     VPPP    P     PP   A  G  
Sbjct:   180 QQSGQ------PGAYPPQQGVQNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYPGQQ 233

Query:   494 QPRGGNP 500
              P G  P
Sbjct:   234 PPMGAYP 240

 Score = 139 (54.0 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 79/251 (31%), Positives = 98/251 (39%)

Query:   272 GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY 331
             G P S +T G  GA P      Y   Q G P     ++P  PG +    PG      P  
Sbjct:    29 GAPGSQSTPGAPGA-PGQ----YPPQQPGAP---GSNLPPYPGTQQPGAPGAPGQYPPQ- 79

Query:   332 DPTKGPSYDPAKGPG-YDPTK-G-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL-G 386
              P + P   P   PG Y P + G PGY  Q+      Q  P       P Y PQ+ G  G
Sbjct:    80 QPGQYPPQQPG-APGQYPPQQPGQPGYPPQQPGQ-SGQYPPQQPGQ--PGYPPQQPGAPG 135

Query:   387 -YDMQRG-PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PG-YDLQRGQ 441
              Y  Q+G P     + PG   Q  P    Q  P    Q   +Y PQ+   PG Y  Q+G 
Sbjct:   136 QYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGV 194

Query:   442 GYDMRRA-----PSYDPSRGT--GFDGAP--RGAAPHGQVPPPLNNVPYGSATPPARSGS 492
                + +      P   P +G   G  G P  +GA P GQ PP     P G   P A    
Sbjct:   195 QNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYP-GQQPPMGAYPPQGQ--PGAYPPQ 251

Query:   493 GQPRGGNPARR 503
             GQP G  P ++
Sbjct:   252 GQP-GAYPPQQ 261

 Score = 133 (51.9 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 83/276 (30%), Positives = 101/276 (36%)

Query:   242 GATGNSENETSGRPVGQNAYEDGY-GVPQGHGPP-PSATTAGVVGA-G--PNTSTSAYAA 296
             GA G+    T G P     Y     G P  + PP P     G  GA G  P      Y  
Sbjct:    29 GAPGSQS--TPGAPGAPGQYPPQQPGAPGSNLPPYPGTQQPGAPGAPGQYPPQQPGQYPP 86

Query:   297 TQSGTPMRAAYDIPRGPGYEASKGPG----YDASKA--PSYDPTK--GPS-YDPAKG-PG 346
              Q G P +     P  PGY   + PG    Y   +   P Y P +   P  Y P +G PG
Sbjct:    87 QQPGAPGQYPPQQPGQPGYPPQQ-PGQSGQYPPQQPGQPGYPPQQPGAPGQYPPQQGQPG 145

Query:   347 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL--GYDMQRGPNYDMQRGPGY 403
               P + PG   Q       Q  P      G +Y PQ+ G    Y  Q+G    + +  G 
Sbjct:   146 QYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGVQNTLAK-TGA 203

Query:   404 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA- 462
               Q  PG    +G  Y  Q  P   PQ+G  Y    GQ   M    +Y P    G  GA 
Sbjct:   204 PGQ--PGVPPPQG-AYPGQ--PGVPPQQG-AYP---GQQPPMG---AYPPQ---GQPGAY 248

Query:   463 PRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
             P    P G  PP    V Y    PP   G+  P+ G
Sbjct:   249 PPQGQP-GAYPPQQQQVAYPGQQPPM--GAYPPQQG 281


>FB|FBgn0050203 [details] [associations]
            symbol:CG30203 species:7227 "Drosophila melanogaster"
            [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002223 Pfam:PF00014 PROSITE:PS50279
            SMART:SM00131 EMBL:AE013599 GO:GO:0004867 Gene3D:4.10.410.10
            SUPFAM:SSF57362 InterPro:IPR000884 Pfam:PF00090 SMART:SM00209
            SUPFAM:SSF82895 PROSITE:PS50092 InterPro:IPR002861 Pfam:PF02014
            PROSITE:PS51019 GeneTree:ENSGT00640000091268 InterPro:IPR009465
            Pfam:PF06468 PROSITE:PS51020 EMBL:BT023853 RefSeq:NP_725128.2
            UniGene:Dm.23753 SMR:Q3ZAL6 EnsemblMetazoa:FBtr0273303
            GeneID:246514 KEGG:dme:Dmel_CG30203 FlyBase:FBgn0050203
            eggNOG:NOG244582 OMA:KWARNTH OrthoDB:EOG43R22N GenomeRNAi:246514
            NextBio:842774 Uniprot:Q3ZAL6
        Length = 924

 Score = 157 (60.3 bits), Expect = 9.8e-08, P = 9.8e-08
 Identities = 39/105 (37%), Positives = 49/105 (46%)

Query:   304 RAAYDIP--RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 361
             R +YD    RG  Y+ + G  Y  ++  SYD   G SYD   G  Y  T G  YD  +  
Sbjct:   793 RRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDR 852

Query:   362 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY-ET 405
             +YD   G +Y      SYD  RG  YD   G +YD+  G  Y ET
Sbjct:   853 SYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGET 897

 Score = 153 (58.9 bits), Expect = 2.7e-07, P = 2.7e-07
 Identities = 46/148 (31%), Positives = 60/148 (40%)

Query:   316 EASKGPGYDASKAPSYDP--TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 373
             E S+    D     SYD   T+G  YD   G  Y  T+G  YD + G +YD   G +Y  
Sbjct:   781 ERSENDAMDLYGRRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQ 840

Query:   374 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 433
               G SYD      YD+  G +Y       Y+  R   YD   G  Y+     SY      
Sbjct:   841 TGGGSYDQPEDRSYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEA 900

Query:   434 GYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
             G D+  G+     R+  YD SR   + G
Sbjct:   901 G-DI--GEPMSQTRS-RYDTSRRGRYGG 924

 Score = 134 (52.2 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 36/111 (32%), Positives = 45/111 (40%)

Query:   355 YDAQ--KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 412
             YD +  +G  YD   G  Y    G SYD + G  YD   G +Y    G  Y+      YD
Sbjct:   796 YDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYD 855

Query:   413 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
             +  G  Y      SY   RG  YD   G+ YD+    SY  +   G  G P
Sbjct:   856 LSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEP 906

 Score = 123 (48.4 bits), Expect = 0.00049, P = 0.00049
 Identities = 38/119 (31%), Positives = 52/119 (43%)

Query:   289 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 348
             TS  AY  T+       +YD   G  Y+ + G  Y  +   SYD  +  SYD + G  Y 
Sbjct:   809 TSGIAYGQTEG-----RSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYDLSTGRSYV 863

Query:   349 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP--QRG-LGYDM-QRGPNYDMQRGPGY 403
               +   YD  +G +YD   G +YD+  G SY    + G +G  M Q    YD  R   Y
Sbjct:   864 QPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEPMSQTRSRYDTSRRGRY 922


>WB|WBGene00005015 [details] [associations]
            symbol:spt-5 species:6239 "Caenorhabditis elegans"
            [GO:0032968 "positive regulation of transcription elongation from
            RNA polymerase II promoter" evidence=IEA] [GO:0006357 "regulation
            of transcription from RNA polymerase II promoter" evidence=IEA]
            [GO:0032784 "regulation of DNA-dependent transcription, elongation"
            evidence=IEA] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] [GO:0040007 "growth" evidence=IMP]
            [GO:0002119 "nematode larval development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] InterPro:IPR006645 InterPro:IPR017071
            InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
            Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
            InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
            eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
            InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
            Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
            ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
            EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
            UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
            GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
            NextBio:899898 Uniprot:Q21338
        Length = 1208

 Score = 158 (60.7 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 60/182 (32%), Positives = 76/182 (41%)

Query:   289 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 346
             + T  Y A T     M  AYD  R P Y E  + P Y  SK P+Y      S       G
Sbjct:   813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871

Query:   347 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 405
              D ++ P Y    GS  D  R P Y    G    P  G   D  R P YD   R PGYE+
Sbjct:   872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924

Query:   406 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
                R P YD   + P Y E++ +      R P Y+      YD+  +P+Y+P     +D 
Sbjct:   925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975

Query:   462 AP 463
             AP
Sbjct:   976 AP 977

 Score = 143 (55.4 bits), Expect = 4.6e-06, P = 4.6e-06
 Identities = 73/253 (28%), Positives = 95/253 (37%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 295
             A GS   A G+ +  +S R     AY +       +G    A T    G+     T AY 
Sbjct:   848 AYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTPAYG 903

Query:   296 ATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGYDPT 350
             +   S TP   AYD   R PGYE+  S+ P YD+S K P+Y  ++  +  PA    YD  
Sbjct:   904 SMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDYDIP 960

Query:   351 KGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG----PNY 395
               P Y+      YD           R P YD +    P+Y+P      +   G    P Y
Sbjct:   961 LSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSSPTY 1020

Query:   396 DMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 450
             D    P       PG  +    P  Y     P +     PG     G  YD   APS   
Sbjct:  1021 DSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPSPFA 1073

Query:   451 -YDPSRGTGFDGA 462
              YD +     DGA
Sbjct:  1074 GYDSNNYNNADGA 1086

 Score = 133 (51.9 bits), Expect = 5.5e-05, P = 5.5e-05
 Identities = 67/218 (30%), Positives = 84/218 (38%)

Query:   304 RAAYDIPRGPGYEASKGPG---YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 360
             RA   +    G  A  G G   Y +SK P  D  K P Y  +K P Y   + P Y +   
Sbjct:   773 RARVMVVGDTGITAGSGGGSSFYSSSKTPMRDSGKTPMYG-SKTPMYG-AQTPMYGSMTP 830

Query:   361 SNYDAQRGPNY-DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY--ETQRVPGY---DVQ 414
             + YD  R P Y +  R P+Y  +     D+     +   R P Y  ++ R P Y   D  
Sbjct:   831 A-YDGGRTPAYGEGGRTPAYGSKTPAYGDLDE---HSSSRTPAYGNDSSRTPAYGSADGA 886

Query:   415 RGPVY---EAQRAPSYIPQ---RGPGYDLQ-RGQGYDMR--RAPSYDPSRGTGFDGAPRG 465
             R P Y   E  R P+Y      R P YD   R  GY+    R P+YD S  T     P  
Sbjct:   887 RTPAYGSTEGGRTPAYGSMDNSRTPAYDDSGRTPGYESMPSRTPNYDSSSKT--PAYPE- 943

Query:   466 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN-PAR 502
              + H    P  NN  Y     PA          N PAR
Sbjct:   944 -SEHSARTPAYNN-DYDIPLSPAYEPDAPEAYDNAPAR 979


>UNIPROTKB|Q21338 [details] [associations]
            symbol:spt-5 "Transcription elongation factor SPT5"
            species:6239 "Caenorhabditis elegans" [GO:0032044 "DSIF complex"
            evidence=ISS] InterPro:IPR006645 InterPro:IPR017071
            InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
            Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
            InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
            eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
            InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
            Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
            ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
            EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
            UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
            GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
            NextBio:899898 Uniprot:Q21338
        Length = 1208

 Score = 158 (60.7 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 60/182 (32%), Positives = 76/182 (41%)

Query:   289 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 346
             + T  Y A T     M  AYD  R P Y E  + P Y  SK P+Y      S       G
Sbjct:   813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871

Query:   347 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 405
              D ++ P Y    GS  D  R P Y    G    P  G   D  R P YD   R PGYE+
Sbjct:   872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924

Query:   406 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
                R P YD   + P Y E++ +      R P Y+      YD+  +P+Y+P     +D 
Sbjct:   925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975

Query:   462 AP 463
             AP
Sbjct:   976 AP 977

 Score = 143 (55.4 bits), Expect = 4.6e-06, P = 4.6e-06
 Identities = 73/253 (28%), Positives = 95/253 (37%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 295
             A GS   A G+ +  +S R     AY +       +G    A T    G+     T AY 
Sbjct:   848 AYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTPAYG 903

Query:   296 ATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGYDPT 350
             +   S TP   AYD   R PGYE+  S+ P YD+S K P+Y  ++  +  PA    YD  
Sbjct:   904 SMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDYDIP 960

Query:   351 KGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG----PNY 395
               P Y+      YD           R P YD +    P+Y+P      +   G    P Y
Sbjct:   961 LSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSSPTY 1020

Query:   396 DMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 450
             D    P       PG  +    P  Y     P +     PG     G  YD   APS   
Sbjct:  1021 DSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPSPFA 1073

Query:   451 -YDPSRGTGFDGA 462
              YD +     DGA
Sbjct:  1074 GYDSNNYNNADGA 1086

 Score = 133 (51.9 bits), Expect = 5.5e-05, P = 5.5e-05
 Identities = 67/218 (30%), Positives = 84/218 (38%)

Query:   304 RAAYDIPRGPGYEASKGPG---YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 360
             RA   +    G  A  G G   Y +SK P  D  K P Y  +K P Y   + P Y +   
Sbjct:   773 RARVMVVGDTGITAGSGGGSSFYSSSKTPMRDSGKTPMYG-SKTPMYG-AQTPMYGSMTP 830

Query:   361 SNYDAQRGPNY-DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY--ETQRVPGY---DVQ 414
             + YD  R P Y +  R P+Y  +     D+     +   R P Y  ++ R P Y   D  
Sbjct:   831 A-YDGGRTPAYGEGGRTPAYGSKTPAYGDLDE---HSSSRTPAYGNDSSRTPAYGSADGA 886

Query:   415 RGPVY---EAQRAPSYIPQ---RGPGYDLQ-RGQGYDMR--RAPSYDPSRGTGFDGAPRG 465
             R P Y   E  R P+Y      R P YD   R  GY+    R P+YD S  T     P  
Sbjct:   887 RTPAYGSTEGGRTPAYGSMDNSRTPAYDDSGRTPGYESMPSRTPNYDSSSKT--PAYPE- 943

Query:   466 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN-PAR 502
              + H    P  NN  Y     PA          N PAR
Sbjct:   944 -SEHSARTPAYNN-DYDIPLSPAYEPDAPEAYDNAPAR 979


>MGI|MGI:1330280 [details] [associations]
            symbol:Krtap6-2 "keratin associated protein 6-2"
            species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005882 "intermediate filament" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] MGI:MGI:1330280 GO:GO:0005882
            CTD:337967 EMBL:D89902 IPI:IPI00116464 RefSeq:NP_034803.2
            UniGene:Mm.3524 PRIDE:O08884 DNASU:16701 GeneID:16701
            KEGG:mmu:16701 UCSC:uc007zvp.1 NextBio:290464 Genevestigator:O08884
            Uniprot:O08884
        Length = 159

 Score = 128 (50.1 bits), Expect = 1.9e-07, P = 1.9e-07
 Identities = 38/124 (30%), Positives = 40/124 (32%)

Query:   312 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 371
             G GY +  G GY       Y    G  Y    G GY    G GY    GS Y    G  Y
Sbjct:    13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72

Query:   372 DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQR 431
                 G  Y    G GY    G  Y    G GY      GY    G  Y +     Y    
Sbjct:    73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132

Query:   432 GPGY 435
             G GY
Sbjct:   133 GCGY 136

 Score = 126 (49.4 bits), Expect = 3.1e-07, P = 3.1e-07
 Identities = 39/130 (30%), Positives = 40/130 (30%)

Query:   314 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 373
             G     G GY +     Y    G  Y    G GY    G GY    GS Y    G  Y  
Sbjct:     7 GNSCGYGCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGC 66

Query:   374 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 433
               G  Y    G GY    G  Y    G GY      GY    G  Y       Y    G 
Sbjct:    67 GYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGS 126

Query:   434 GYDLQRGQGY 443
             GY    G GY
Sbjct:   127 GYGSGCGCGY 136

 Score = 125 (49.1 bits), Expect = 3.9e-07, P = 3.9e-07
 Identities = 40/136 (29%), Positives = 42/136 (30%)

Query:   336 GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY 395
             G  Y    G GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y
Sbjct:    13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72

Query:   396 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 455
                 G GY      GY    G  Y       Y    G GY    G GY       Y    
Sbjct:    73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132

Query:   456 GTGFDGAPR-GAAPHG 470
             G G+    R G   +G
Sbjct:   133 GCGYGSYYRSGCCGYG 148

 Score = 124 (48.7 bits), Expect = 5.0e-07, P = 5.0e-07
 Identities = 34/112 (30%), Positives = 37/112 (33%)

Query:   300 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 359
             G+   + Y    G GY    G GY       Y    G  Y    G GY    G GY    
Sbjct:    17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76

Query:   360 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 411
             GS Y    G  Y    G  Y    G GY    G  Y    G GY +    GY
Sbjct:    77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGY 128

 Score = 118 (46.6 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 33/107 (30%), Positives = 35/107 (32%)

Query:   305 AAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 364
             + Y    G GY    G GY       Y    G  Y    G GY    G GY    GS Y 
Sbjct:    30 SGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYG 89

Query:   365 AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 411
                G  Y    G  Y    G GY    G  Y    G GY +    GY
Sbjct:    90 CGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136

 Score = 118 (46.6 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 34/120 (28%), Positives = 39/120 (32%)

Query:   284 GAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 343
             G+G  +     + +  G    + Y    G GY    G GY       Y    G  Y    
Sbjct:    17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76

Query:   344 GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 403
             G GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y    G GY
Sbjct:    77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136

 Score = 111 (44.1 bits), Expect = 0.00010, P = 0.00010
 Identities = 35/127 (27%), Positives = 40/127 (31%)

Query:   261 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 320
             Y  GYG   G+G      +    G G  +       +  G    + Y    G GY    G
Sbjct:    12 YGCGYG--SGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYG 69

Query:   321 PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD 380
              GY       Y    G  Y    G GY    G GY    GS Y    G  Y    G  Y 
Sbjct:    70 SGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYG 129

Query:   381 PQRGLGY 387
                G GY
Sbjct:   130 SGCGCGY 136


>WB|WBGene00002280 [details] [associations]
            symbol:let-2 species:6239 "Caenorhabditis elegans"
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0040007
            "growth" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0040039
            "inductive cell migration" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0005604 "basement membrane" evidence=IDA] [GO:0005198
            "structural molecule activity" evidence=IDA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
            GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
            GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
            EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
            RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
            SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
            KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
            WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
            NextBio:915032 GO:GO:0016043 Uniprot:P17140
        Length = 1758

 Score = 157 (60.3 bits), Expect = 2.1e-07, P = 2.1e-07
 Identities = 82/261 (31%), Positives = 95/261 (36%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTSTS 292
             ++ +  Y G  G   N     P G   + DG   P G  G P +    G  G  P     
Sbjct:   335 QKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-PGAPGP 393

Query:   293 AYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG-PGYDP 349
             A     +G P    Y    G PG +  KG G     AP      G P     KG PGY  
Sbjct:   394 AGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGEPGYRG 452

Query:   350 TKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG---PN-YD 396
             T G      PG D + G      ++G N     RGP  D   GL G   QRG   PN YD
Sbjct:   453 TPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPGPNGYD 512

Query:   397 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYD-PS 454
              + G        PG    RG    A  AP    ++G PGY  Q G   D R  P    P 
Sbjct:   513 GRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPGMPGPV 569

Query:   455 RGTGFDGAPRGAAPHGQVPPP 475
                G DG P  A   G   PP
Sbjct:   570 GDAGDDGLPGPAGRPGSPGPP 590


>UNIPROTKB|P17140 [details] [associations]
            symbol:let-2 "Collagen alpha-2(IV) chain" species:6239
            "Caenorhabditis elegans" [GO:0016043 "cellular component
            organization" evidence=NAS] [GO:0030020 "extracellular matrix
            structural constituent conferring tensile strength" evidence=IMP]
            [GO:0005587 "collagen type IV" evidence=IMP] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
            GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
            GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
            EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
            RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
            SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
            KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
            WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
            NextBio:915032 GO:GO:0016043 Uniprot:P17140
        Length = 1758

 Score = 157 (60.3 bits), Expect = 2.1e-07, P = 2.1e-07
 Identities = 82/261 (31%), Positives = 95/261 (36%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTSTS 292
             ++ +  Y G  G   N     P G   + DG   P G  G P +    G  G  P     
Sbjct:   335 QKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-PGAPGP 393

Query:   293 AYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG-PGYDP 349
             A     +G P    Y    G PG +  KG G     AP      G P     KG PGY  
Sbjct:   394 AGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGEPGYRG 452

Query:   350 TKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG---PN-YD 396
             T G      PG D + G      ++G N     RGP  D   GL G   QRG   PN YD
Sbjct:   453 TPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPGPNGYD 512

Query:   397 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYD-PS 454
              + G        PG    RG    A  AP    ++G PGY  Q G   D R  P    P 
Sbjct:   513 GRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPGMPGPV 569

Query:   455 RGTGFDGAPRGAAPHGQVPPP 475
                G DG P  A   G   PP
Sbjct:   570 GDAGDDGLPGPAGRPGSPGPP 590


>WB|WBGene00000123 [details] [associations]
            symbol:ama-1 species:6239 "Caenorhabditis elegans"
            [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA;IMP] [GO:0009792 "embryo development ending in birth
            or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040010 "positive regulation of growth rate"
            evidence=IMP] [GO:0007052 "mitotic spindle organization"
            evidence=IMP] [GO:0010458 "exit from mitosis" evidence=IMP]
            [GO:0008356 "asymmetric cell division" evidence=IMP] [GO:0032502
            "developmental process" evidence=IMP] [GO:0006479 "protein
            methylation" evidence=IMP] [GO:0007369 "gastrulation" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0001055 "RNA polymerase II
            activity" evidence=IMP] [GO:0042789 "mRNA transcription from RNA
            polymerase II promoter" evidence=IMP] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 GO:GO:0005634
            GO:GO:0009792 GO:GO:0040010 GO:GO:0007052 GO:GO:0010458
            GO:GO:0046872 GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20
            InterPro:IPR009010 GO:GO:0006479 GO:GO:0008356 GO:GO:0007369
            GO:GO:0042789 EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665
            EMBL:M29235 PIR:A34092 PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356
            STRING:P16356 PaxDb:P16356 EnsemblMetazoa:F36A4.7.1
            EnsemblMetazoa:F36A4.7.2 GeneID:177190 KEGG:cel:CELE_F36A4.7
            UCSC:F36A4.7 CTD:247749 WormBase:F36A4.7
            GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975 InParanoid:P16356
            OMA:KVLPWST NextBio:895720 GO:GO:0001055 Uniprot:P16356
        Length = 1856

 Score = 157 (60.3 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 68/254 (26%), Positives = 93/254 (36%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 297
             G   GA  +    T G   G + + +G   P   G P  A +      G   S   Y+ +
Sbjct:  1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582

Query:   298 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 357
                  M + +  P  P Y  +      +  +PSY PT  PSY P   P Y PT  P Y  
Sbjct:  1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639

Query:   358 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 417
                S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P
Sbjct:  1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691

Query:   418 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 472
              Y +  +P+Y P   P Y       + G GY    +P Y PS  T    +P  +    Q 
Sbjct:  1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748

Query:   473 PPPLNNVPYGSATP 486
              P   +  Y  ++P
Sbjct:  1749 SP--TSPQYSPSSP 1760

 Score = 154 (59.3 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 65/219 (29%), Positives = 87/219 (39%)

Query:   274 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 333
             P  + T+   G  P  S S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653

Query:   334 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 393
             T  PSY P+  P Y P+  P Y +     Y +   P Y     P+Y P     Y     P
Sbjct:  1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705

Query:   394 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 448
              Y       + G GY     P Y     P Y +  +PSY P   P Y     Q Y    +
Sbjct:  1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759

Query:   449 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 486
             P+Y PS  T    +PRG ++P      P  +    S TP
Sbjct:  1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798


>UNIPROTKB|P16356 [details] [associations]
            symbol:ama-1 "DNA-directed RNA polymerase II subunit RPB1"
            species:6239 "Caenorhabditis elegans" [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0005634 GO:GO:0009792
            GO:GO:0040010 GO:GO:0007052 GO:GO:0010458 GO:GO:0046872
            GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20 InterPro:IPR009010
            GO:GO:0006479 GO:GO:0008356 GO:GO:0007369 GO:GO:0042789
            EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665 EMBL:M29235 PIR:A34092
            PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356 STRING:P16356
            PaxDb:P16356 EnsemblMetazoa:F36A4.7.1 EnsemblMetazoa:F36A4.7.2
            GeneID:177190 KEGG:cel:CELE_F36A4.7 UCSC:F36A4.7 CTD:247749
            WormBase:F36A4.7 GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975
            InParanoid:P16356 OMA:KVLPWST NextBio:895720 GO:GO:0001055
            Uniprot:P16356
        Length = 1856

 Score = 157 (60.3 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 68/254 (26%), Positives = 93/254 (36%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 297
             G   GA  +    T G   G + + +G   P   G P  A +      G   S   Y+ +
Sbjct:  1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582

Query:   298 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 357
                  M + +  P  P Y  +      +  +PSY PT  PSY P   P Y PT  P Y  
Sbjct:  1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639

Query:   358 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 417
                S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P
Sbjct:  1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691

Query:   418 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 472
              Y +  +P+Y P   P Y       + G GY    +P Y PS  T    +P  +    Q 
Sbjct:  1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748

Query:   473 PPPLNNVPYGSATP 486
              P   +  Y  ++P
Sbjct:  1749 SP--TSPQYSPSSP 1760

 Score = 154 (59.3 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 65/219 (29%), Positives = 87/219 (39%)

Query:   274 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 333
             P  + T+   G  P  S S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653

Query:   334 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 393
             T  PSY P+  P Y P+  P Y +     Y +   P Y     P+Y P     Y     P
Sbjct:  1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705

Query:   394 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 448
              Y       + G GY     P Y     P Y +  +PSY P   P Y     Q Y    +
Sbjct:  1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759

Query:   449 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 486
             P+Y PS  T    +PRG ++P      P  +    S TP
Sbjct:  1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798


>WB|WBGene00001215 [details] [associations]
            symbol:ego-2 species:6239 "Caenorhabditis elegans"
            [GO:0040002 "collagen and cuticulin-based cuticle development"
            evidence=IMP] [GO:0002009 "morphogenesis of an epithelium"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0045747 "positive regulation of Notch signaling pathway"
            evidence=IGI] InterPro:IPR025304 Pfam:PF13949 GO:GO:0009792
            GO:GO:0002009 GO:GO:0040007 GO:GO:0002119 GO:GO:0045747
            GO:GO:0040035 Gene3D:1.25.40.280 InterPro:IPR004328 Pfam:PF03097
            SMART:SM01041 PROSITE:PS51180 GO:GO:0040002 EMBL:AL117201
            UniGene:Cel.16377 GeneID:190251 KEGG:cel:CELE_Y53H1C.2 CTD:190251
            RefSeq:NP_001251634.1 ProteinModelPortal:H8ESG1 WormBase:Y53H1C.2c
            Uniprot:H8ESG1
        Length = 1494

 Score = 136 (52.9 bits), Expect = 2.4e-07, Sum P(2) = 2.4e-07
 Identities = 79/280 (28%), Positives = 107/280 (38%)

Query:   239 SYGGATGNSENETSGRPVGQNAYEDGYGVPQG-----HGPPPSATTAGVVGAGPNTSTSA 293
             SYG  T      + G   G + Y++G   P G      GPP +   A    A P TS   
Sbjct:  1050 SYGAPT--PPQASYGPAPGAHGYQNGAQGPPGAEVGAQGPPGAHFGAHGASAPPPTS--- 1104

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDP--- 349
             Y A     P +A+Y     PG +   G  ++A  A +  PT   +  P +GP G  P   
Sbjct:  1105 YGAPTPQRPPQASYGA--APGAQGPPGGQFEAHGAAALPPTSHGAPTP-QGPFGAAPGAQ 1161

Query:   350 --TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 407
                +GP Y  Q+G+ Y+AQ+ P   I   P   PQ    +  Q G        PG +   
Sbjct:  1162 FGAQGP-Y-GQQGARYEAQKSPGAAIFGAPGAPPQHQGSFGAQFGVPPPQNSAPGAQFGA 1219

Query:   408 VPGYDVQRGPVYEAQRAPSY-IPQRGPGYDL-QRG-QGYDMRRAP---SYD-----P-SR 455
              P       P    Q  PSY  P   P   + Q   QG  +   P   S+      P +R
Sbjct:  1220 KPEAS-SHAPTPPPQPHPSYQAPAPPPALSVFQHSPQGAPITAPPPASSHHEHIAAPQAR 1278

Query:   456 GTGFDGAPRG--AAPHG-QVPPPLNNVPYGSATPPARSGS 492
              T   GAP    A P   +   P N  P   A P A++ +
Sbjct:  1279 FTPTPGAPSPWHATPAELKFQTPWNTTPQYHAPPGAQAAA 1318

 Score = 70 (29.7 bits), Expect = 2.4e-07, Sum P(2) = 2.4e-07
 Identities = 30/122 (24%), Positives = 58/122 (47%)

Query:    57 SQHVEMQKLATENQRLA-ATHGTLRQELAAAQHEL--QIL----HGQIGGMKSERELQMR 109
             ++H+E  K    +   A A H    Q L     E+  +I+     G++    S  ELQ+R
Sbjct:   520 AEHLEQAKAHNVSLNKAIAQHSANLQLLTLPCREMWMKIVPPEQQGEMRNGSSPEELQVR 579

Query:   110 NLTEKIAKMEAEL-KTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
              + EK+ +M+A+  K  E  + +  K+   +  L+   E    ++  +  +L + HT++Q
Sbjct:   580 KMIEKVMEMQAQRRKLVEQFEADL-KADNISNKLMGTNERGAEEI--MKSELTK-HTNIQ 635

Query:   169 QI 170
             Q+
Sbjct:   636 QL 637


>ZFIN|ZDB-GENE-030131-5725 [details] [associations]
            symbol:arid1ab "AT rich interactive domain 1Ab
            (SWI-like)" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
            ZFIN:ZDB-GENE-030131-5725 GO:GO:0003677 GO:GO:0005622
            Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
            GeneTree:ENSGT00550000074575 EMBL:CABZ01050711 EMBL:CT027837
            IPI:IPI00485842 Ensembl:ENSDART00000084272 Bgee:F1RE50
            Uniprot:F1RE50
        Length = 2135

 Score = 157 (60.3 bits), Expect = 3.3e-07, Sum P(2) = 3.3e-07
 Identities = 78/257 (30%), Positives = 104/257 (40%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVP-QGHGPP-PSATTAGVVGAGPNTSTSAYA 295
             G + GA GN  ++  G P      + G   P QG+GPP P     G+ G    TS +  +
Sbjct:   312 GQHYGA-GNPYSQQQGPPPSS---QQGPPYPGQGYGPPGPQRYPMGMQG---RTSGNL-S 363

Query:   296 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP---SY--DPAKGPGYDP- 349
               Q G  M   Y    GPG       GY   + PS  P  GP   SY   P+ GPG  P 
Sbjct:   364 GIQYGQQM--GYG-QHGPGGYGQNQAGYYGQQGPS--PHGGPQQSSYPQQPSTGPGSQPP 418

Query:   350 -TKGPGYD--AQKGSNYDAQRGPNYDIHRGPSYD--PQRGLG---YDMQRGPNYDMQRGP 401
              ++ P      Q G++Y   +GP+      P Y   PQ   G   +   +GP        
Sbjct:   419 YSQQPSGTPHGQSGTSYGQPQGPHVPNQGQPPYSQTPQSQSGQSPFPQSQGPTQSQGPSQ 478

Query:   402 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY---DPSRGT 457
             G + +Q  PGY     P    Q A     Q+GP    Q+ QG    + PS     PS+ T
Sbjct:   479 GQQGSQSQPGYT--HPPSGSGQPAQ----QQGPS---QQQQGPPQSQTPSSAPPQPSQQT 529

Query:   458 GFDGAPRGAAPHGQVPP 474
                G P   +P+ Q PP
Sbjct:   530 SGQGQP---SPYSQTPP 543

 Score = 125 (49.1 bits), Expect = 0.00086, Sum P(2) = 0.00086
 Identities = 79/298 (26%), Positives = 109/298 (36%)

Query:   225 ELMNAPNVDRRADGSYGGATGNSENETSGR-PVGQNA-YEDGYGVPQ--GHGPPPSATTA 280
             +L+ +P+  R          G  E    G   +G ++ Y  G+   Q   H PPP +   
Sbjct:   232 QLLTSPSSTRSYQNYPASEYGGQEGAAKGPGDMGSSSQYGGGHPAWQQRSHHPPPMSP-- 289

Query:   281 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 340
             G  G    T        Q G      Y    G  Y   +GP   + + P Y P +G  Y 
Sbjct:   290 GNTGQANRTQPPG-PMDQVGKIRGQHYGA--GNPYSQQQGPPPSSQQGPPY-PGQG--YG 343

Query:   341 PAKGPGYDPTKGPGYDAQK--GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN---- 394
             P  GP   P    G  +    G  Y  Q G  Y  H GP    Q   GY  Q+GP+    
Sbjct:   344 PP-GPQRYPMGMQGRTSGNLSGIQYGQQMG--YGQH-GPGGYGQNQAGYYGQQGPSPHGG 399

Query:   395 -----YDMQ--RGPGYE---TQRVPGYDV-QRGPVYEAQRAPSYIPQRG-PGYDLQRGQG 442
                  Y  Q   GPG +   +Q+  G    Q G  Y   + P ++P +G P Y  Q  Q 
Sbjct:   400 PQQSSYPQQPSTGPGSQPPYSQQPSGTPHGQSGTSYGQPQGP-HVPNQGQPPYS-QTPQS 457

Query:   443 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 500
                 ++P +  S+G      P       Q  P   + P GS  P  + G  Q + G P
Sbjct:   458 QS-GQSP-FPQSQGPTQSQGPSQGQQGSQSQPGYTHPPSGSGQPAQQQGPSQQQQGPP 513

 Score = 50 (22.7 bits), Expect = 3.3e-07, Sum P(2) = 3.3e-07
 Identities = 9/12 (75%), Positives = 9/12 (75%)

Query:    30 GMRPPMPGAFPP 41
             GM P  PGAFPP
Sbjct:   101 GMAPHHPGAFPP 112


>UNIPROTKB|J3KNM7 [details] [associations]
            symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            EMBL:CH471063 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
            EMBL:AC079235 EMBL:AC073149 UniGene:Hs.591645 HGNC:HGNC:2206
            ChiTaRS:COL4A4 ProteinModelPortal:J3KNM7 Ensembl:ENST00000329662
            Uniprot:J3KNM7
        Length = 1687

 Score = 153 (58.9 bits), Expect = 5.5e-07, P = 5.5e-07
 Identities = 81/253 (32%), Positives = 101/253 (39%)

Query:   261 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 320
             Y   +G P   GPP      G  GA P  S S     + GTP  A  +IP  PG+    G
Sbjct:   672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728

Query:   321 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 372
              PG+   K  S     GP   P     KG PG DP  G  G   ++G S     +GP  D
Sbjct:   729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787

Query:   373 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 426
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843

Query:   427 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 484
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896

Query:   485 TPPARSGSGQPRG 497
               P   G   PRG
Sbjct:   897 GLPGPPGPKGPRG 909

 Score = 130 (50.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/260 (31%), Positives = 104/260 (40%)

Query:   262 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 318
             E G+ GVP GH  P      G+ G  G   S +     + G P    +D P GP G+   
Sbjct:   640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693

Query:   319 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 371
             +G PG   S      P T G +  P   PG+    G PG+  +KGS+     GP      
Sbjct:   694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752

Query:   372 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 421
             +  +G   DP  G LG   +RG    P     RG    PG E    +PG+   +GP    
Sbjct:   753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812

Query:   422 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 477
               A  P  +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP  
Sbjct:   813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865

Query:   478 NVPYGSATPPARSGSGQPRG 497
                 G    P R G+  P G
Sbjct:   866 AGMKGLPGLPGRPGAHGPPG 885

 Score = 123 (48.4 bits), Expect = 0.00099, P = 0.00099
 Identities = 81/280 (28%), Positives = 104/280 (37%)

Query:   242 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 301
             GA+G  +    G PVG    +   G P   G  P     G  G  P    S+     +G 
Sbjct:  1190 GASGLHDVGPPG-PVGIPGLKGERGDPGSPGISPPGPR-GKKGP-PGPPGSSGPPGPAGA 1246

Query:   302 PMRAAYDIPRGPGYEASKGP-GYDASK-AP-------SYDPTKGPSYD-----PAKGPGY 347
               RA  DIP  PG    +GP G D  + AP       S D  +G   D     P   PG 
Sbjct:  1247 TGRAPKDIP-DPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPG- 1304

Query:   348 DPTKGPGYDAQKGSN-YDAQRGP-NYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GY 403
              P   PGY    G +  D Q+GP  +   +GP   P    G   ++G P    ++GP G 
Sbjct:  1305 -PPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFP----GPPGEKGLPGPPGRKGPTGL 1359

Query:   404 ETQRVPGYDVQRGP-VYEAQRAPSYI-PQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD 460
               +  P  DV   P +     AP    P+   G    RG  G   +  P  D  RG   D
Sbjct:  1360 PGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGV--D 1417

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 500
             G P    P G+      +   G   PP   G   P+G  P
Sbjct:  1418 GVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKGFGP 1457


>UNIPROTKB|P53420 [details] [associations]
            symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
            "Homo sapiens" [GO:0005587 "collagen type IV" evidence=IDA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IMP] [GO:0032836 "glomerular basement membrane
            development" evidence=IMP] [GO:0005605 "basal lamina" evidence=IDA]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 Reactome:REACT_118779
            Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005788 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
            HOVERGEN:HBG004933 HOGENOM:HOG000085652 GO:GO:0005587
            Gene3D:2.170.240.10 KO:K06237 OrthoDB:EOG4XGZZF EMBL:AC079235
            EMBL:AB008496 MIM:141200 MIM:203780 Orphanet:88919 Orphanet:97562
            GO:GO:0032836 EMBL:X81053 EMBL:Y17397 EMBL:Y17398 EMBL:Y17399
            EMBL:Y17400 EMBL:Y17401 EMBL:Y17402 EMBL:Y17403 EMBL:Y17404
            EMBL:Y17405 EMBL:Y17406 EMBL:Y17407 EMBL:Y17408 EMBL:Y17409
            EMBL:Y17410 EMBL:Y17411 EMBL:Y17412 EMBL:Y17413 EMBL:Y17427
            EMBL:Y17426 EMBL:Y17414 EMBL:Y17415 EMBL:Y17416 EMBL:Y17417
            EMBL:Y17418 EMBL:Y17419 EMBL:Y17420 EMBL:Y17443 EMBL:Y17442
            EMBL:Y17441 EMBL:Y17440 EMBL:Y17439 EMBL:Y17438 EMBL:Y17437
            EMBL:Y17436 EMBL:Y17435 EMBL:Y17434 EMBL:Y17433 EMBL:Y17432
            EMBL:Y17431 EMBL:Y17430 EMBL:Y17429 EMBL:Y17428 EMBL:Y17421
            EMBL:Y17422 EMBL:Y17423 EMBL:Y17424 EMBL:Y17425 EMBL:AC073149
            EMBL:D17391 IPI:IPI00478572 PIR:A55360 RefSeq:NP_000083.3
            UniGene:Hs.591645 ProteinModelPortal:P53420 SMR:P53420
            IntAct:P53420 STRING:P53420 PhosphoSite:P53420 DMDM:259016360
            PaxDb:P53420 PRIDE:P53420 Ensembl:ENST00000396625 GeneID:1286
            KEGG:hsa:1286 UCSC:uc021vxr.1 CTD:1286 GeneCards:GC02M227867
            H-InvDB:HIX0030014 HGNC:HGNC:2206 MIM:120131 neXtProt:NX_P53420
            PharmGKB:PA26721 InParanoid:P53420 OMA:FRGDMGD ChiTaRS:COL4A4
            GenomeRNAi:1286 NextBio:5201 Bgee:P53420 CleanEx:HS_COL4A4
            Genevestigator:P53420 GermOnline:ENSG00000081052 Uniprot:P53420
        Length = 1690

 Score = 153 (58.9 bits), Expect = 5.5e-07, P = 5.5e-07
 Identities = 81/253 (32%), Positives = 101/253 (39%)

Query:   261 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 320
             Y   +G P   GPP      G  GA P  S S     + GTP  A  +IP  PG+    G
Sbjct:   672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728

Query:   321 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 372
              PG+   K  S     GP   P     KG PG DP  G  G   ++G S     +GP  D
Sbjct:   729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787

Query:   373 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 426
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843

Query:   427 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 484
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896

Query:   485 TPPARSGSGQPRG 497
               P   G   PRG
Sbjct:   897 GLPGPPGPKGPRG 909

 Score = 130 (50.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/260 (31%), Positives = 104/260 (40%)

Query:   262 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 318
             E G+ GVP GH  P      G+ G  G   S +     + G P    +D P GP G+   
Sbjct:   640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693

Query:   319 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 371
             +G PG   S      P T G +  P   PG+    G PG+  +KGS+     GP      
Sbjct:   694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752

Query:   372 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 421
             +  +G   DP  G LG   +RG    P     RG    PG E    +PG+   +GP    
Sbjct:   753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812

Query:   422 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 477
               A  P  +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP  
Sbjct:   813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865

Query:   478 NVPYGSATPPARSGSGQPRG 497
                 G    P R G+  P G
Sbjct:   866 AGMKGLPGLPGRPGAHGPPG 885


>UNIPROTKB|D4ADB1 [details] [associations]
            symbol:D4ADB1 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0046872 GO:GO:0008270 Gene3D:2.10.110.10
            SUPFAM:SSF50156 InterPro:IPR006643 SMART:SM00735 IPI:IPI00951885
            PRIDE:D4ADB1 Ensembl:ENSRNOT00000043713 ArrayExpress:D4ADB1
            Uniprot:D4ADB1
        Length = 684

 Score = 148 (57.2 bits), Expect = 6.3e-07, P = 6.3e-07
 Identities = 50/182 (27%), Positives = 70/182 (38%)

Query:   251 TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 310
             TS  P    +Y +G   P    P P   T   +   P+      A+  S +P  A Y  P
Sbjct:   331 TSPAPAAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASPYSPSP-GANYS-P 383

Query:   311 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 370
               P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+    + Y    GP+
Sbjct:   384 T-P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYNPTPSAAYSG--GPS 439

Query:   371 YDIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 424
                 R P     S+  +   G          + RG P Y         + RG    A+R 
Sbjct:   440 ESASRPPWVTDDSFSQKFAPGKSTTSVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERF 499

Query:   425 PS 426
             P+
Sbjct:   500 PA 501


>FB|FBgn0035872 [details] [associations]
            symbol:CG7185 species:7227 "Drosophila melanogaster"
            [GO:0003729 "mRNA binding" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
            evidence=ISS] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IC] [GO:0000381 "regulation of alternative mRNA
            splicing, via spliceosome" evidence=IMP] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 EMBL:AE014296
            GO:GO:0000166 GO:GO:0003729 Gene3D:3.30.70.330 GO:GO:0000381
            GO:GO:0006379 GO:GO:0005849 eggNOG:NOG313287 KO:K14398
            GeneTree:ENSGT00690000101901 EMBL:AY058563 RefSeq:NP_648206.1
            UniGene:Dm.887 ProteinModelPortal:Q9VSH4 SMR:Q9VSH4 IntAct:Q9VSH4
            MINT:MINT-1562127 STRING:Q9VSH4 PaxDb:Q9VSH4
            EnsemblMetazoa:FBtr0076710 GeneID:38937 KEGG:dme:Dmel_CG7185
            UCSC:CG7185-RA FlyBase:FBgn0035872 InParanoid:Q9VSH4 OMA:PYERGDY
            OrthoDB:EOG4S1RQ4 PhylomeDB:Q9VSH4 ChiTaRS:CG7185 GenomeRNAi:38937
            NextBio:811101 Bgee:Q9VSH4 Uniprot:Q9VSH4
        Length = 652

 Score = 141 (54.7 bits), Expect = 6.6e-07, Sum P(2) = 6.6e-07
 Identities = 63/199 (31%), Positives = 79/199 (39%)

Query:   310 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRG 368
             PRGP    S G G   +  P      GP   P +G   +    PG Y  Q  S      G
Sbjct:   197 PRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRG--MNSIMQPGQYRPQHMSQVPQVGG 253

Query:   369 PNYDIHR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY 427
             PN    R  P   PQ GL  + Q  P Y   +G  +  QR PG   + GP     + P +
Sbjct:   254 PNSGPPRMQPPMHPQGGLMGNQQPPPRYPSAQGQ-WPGQR-PG-GPRPGPPNGPPQRPMF 310

Query:   428 IPQRGP-GYDLQRGQGYDMRRAPSYD--PSRGT--GFDGAPRGAAPHGQVPPPLNNVPYG 482
               Q GP G  ++   G D RR P +   P +G   G   AP    PHG   P +N   + 
Sbjct:   311 --QGGPMGMPVRGPAGPDWRRPPMHGGFPPQGPPRGLPPAPGPGGPHGAPAPHVNPAFFN 368

Query:   483 SATPPARS-GSGQPRGGNP 500
                 PA+  G G P  G P
Sbjct:   369 QPGGPAQHPGMGGPPHGAP 387

 Score = 112 (44.5 bits), Expect = 0.00091, Sum P(2) = 0.00091
 Identities = 53/171 (30%), Positives = 61/171 (35%)

Query:   333 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG 392
             P +GP+  P+ G G  PT  PG     G      RG N  +  G  Y PQ         G
Sbjct:   196 PPRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRGMNSIMQPG-QYRPQHMSQVPQVGG 253

Query:   393 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPS 450
             PN     GP    +  P    Q G +   Q  P Y   +G  PG   QR  G   R  P 
Sbjct:   254 PN----SGP---PRMQPPMHPQGGLMGNQQPPPRYPSAQGQWPG---QRPGG--PRPGPP 301

Query:   451 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 501
               P +   F G P G    G   P     P     PP     G PRG  PA
Sbjct:   302 NGPPQRPMFQGGPMGMPVRGPAGPDWRRPPMHGGFPP----QGPPRGLPPA 348

 Score = 52 (23.4 bits), Expect = 6.6e-07, Sum P(2) = 6.6e-07
 Identities = 24/76 (31%), Positives = 30/76 (39%)

Query:   245 GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 304
             G +++E  G   G + Y+D  G     GP  SA + G  G G   S    A   SG P  
Sbjct:    19 GQAQDEFGGD--GVDLYDD-IG-----GPTESAASGG--GGGGTPSADGAAGPGSGEPGE 68

Query:   305 AAYDIPRGPGYEASKG 320
                  P G  Y  S G
Sbjct:    69 RNSGGPNGV-YHQSSG 83

 Score = 41 (19.5 bits), Expect = 8.8e-06, Sum P(2) = 8.8e-06
 Identities = 9/22 (40%), Positives = 11/22 (50%)

Query:   236 ADGSYGGATGNSENETSGRPVG 257
             ADG+ G  +G      SG P G
Sbjct:    55 ADGAAGPGSGEPGERNSGGPNG 76


>TAIR|locus:2012713 [details] [associations]
            symbol:AT1G33680 "AT1G33680" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
            [GO:0005829 "cytosol" evidence=IDA] InterPro:IPR004087
            InterPro:IPR004088 Pfam:PF13014 PROSITE:PS50084 SMART:SM00322
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005829 GO:GO:0003723
            eggNOG:NOG300923 KO:K13210 UniGene:At.39892 UniGene:At.71035
            HOGENOM:HOG000242545 EMBL:AK229850 EMBL:AK229909 EMBL:AK230055
            IPI:IPI00786006 RefSeq:NP_174629.3 ProteinModelPortal:Q0WLY0
            SMR:Q0WLY0 STRING:Q0WLY0 PaxDb:Q0WLY0 PRIDE:Q0WLY0
            EnsemblPlants:AT1G33680.1 GeneID:840259 KEGG:ath:AT1G33680
            TAIR:At1g33680 InParanoid:Q0WLY0 OMA:PSYGSTP PhylomeDB:Q0WLY0
            ProtClustDB:CLSN2690290 Genevestigator:Q0WLY0 Uniprot:Q0WLY0
        Length = 763

 Score = 144 (55.7 bits), Expect = 9.6e-07, Sum P(2) = 9.6e-07
 Identities = 65/233 (27%), Positives = 82/233 (35%)

Query:   240 YGGATGNSENETSGRPVG-QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 298
             Y  A G  + +   RP G Q + E GYG P+   PP      G   A P+  ++  AA+ 
Sbjct:   537 YPSAGGQHQMQQPSRPYGMQGSAEQGYGPPRPAAPPGDVPYQGPTPAAPSYGSTPAAASY 596

Query:   299 SGTPMRAAY-DIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYDPAK-GPGYD---- 348
               TP   +Y   P  P Y ++       GY AS AP+      PSY  A    GY+    
Sbjct:   597 GSTPAAPSYGSTPAAPSYGSNMAQQQQYGY-ASSAPTQQTY--PSYSSAAPSDGYNGTQP 653

Query:   349 PTKGPGYD---AQKGSNYDAQRG------PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 399
             P   P Y+   AQ  S      G      P       PS  P  G     Q   NY    
Sbjct:   654 PAVAPAYEQHGAQPASGVQQTSGGYGQVPPTGGYSSYPSTQPAYG-NTPAQSNGNY---- 708

Query:   400 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP---GYDLQRGQGYDMRRAP 449
               GY   + P Y       Y A    +   Q  P   GY+    Q      AP
Sbjct:   709 --GYIGSQYPSYGGGNASAYAAPTGQTAYSQTAPPQAGYEQSATQSAGYAAAP 759

 Score = 49 (22.3 bits), Expect = 9.6e-07, Sum P(2) = 9.6e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:    29 SGMRPPMPGAFPPFDMMPP 47
             S  RPP  G +PP   MPP
Sbjct:   444 SHFRPPNSGGYPP-QHMPP 461


>UNIPROTKB|P02457 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M17839
            EMBL:M17838 EMBL:V00401 EMBL:M10571 EMBL:M17607 IPI:IPI00572548
            PIR:A27179 PIR:A90458 PIR:I50629 PIR:S07234 UniGene:Gga.2073
            UniGene:Gga.43371 IntAct:P02457 PRIDE:P02457 Uniprot:P02457
        Length = 1453

 Score = 149 (57.5 bits), Expect = 1.3e-06, P = 1.3e-06
 Identities = 90/285 (31%), Positives = 109/285 (38%)

Query:   236 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 292
             ADG  G  G TG++  +    P G  A   G   P G  G P      G   AGP  +T 
Sbjct:   808 ADGQPGAKGETGDAGAKGDAGPPGP-AGPTGAPGPAGZVGAPGPKGARG--SAGPPGATG 864

Query:   293 AYAATQSGTPMRAAYDI----PRGP-GYEASKGPGYDASKA--PSYDPTKGPSYDPA-KG 344
                A     P   + +I    P GP G + SKGP  +   A  P      GP   P  KG
Sbjct:   865 FPGAAGRVGPPGPSGNIGLPGPPGPAGKZGSKGPRGETGPAGRPGEPGPAGPPGPPGEKG 924

Query:   345 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 392
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   925 SPGADGPIGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984

Query:   393 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 452
             P   M  GP       PG     GP  EA R  +   +  PG D   G   D        
Sbjct:   985 PPGPM--GP-------PGL---AGPPGEAGREGAPGAEGAPGRDGAAGPKGDRGETGPAG 1032

Query:   453 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
             P    G  GAP    P G+        P G A PP  +G+  P G
Sbjct:  1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPPGPAGARGPAG 1077


>UNIPROTKB|G4N3H5 [details] [associations]
            symbol:MGG_04961 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:CM001233
            RefSeq:XP_003712457.1 EnsemblFungi:MGG_04961T0 GeneID:2675293
            KEGG:mgr:MGG_04961 Uniprot:G4N3H5
        Length = 616

 Score = 144 (55.7 bits), Expect = 1.5e-06, P = 1.5e-06
 Identities = 61/185 (32%), Positives = 80/185 (43%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 289
             P   R   G       + ++ +SGR     +     G P G   PP + TA +   GP+ 
Sbjct:   445 PGYQRNQPGGPPSRFDSYDDYSSGRASPAPSMYPSRG-PGGPNMPPRSATAPIPPRGPD- 502

Query:   290 STSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPS-YDPTKGPSYDPAKGPG 346
                AY    +G  +P  + Y  PRGPG     GP   AS APS Y+P + P    A GP 
Sbjct:   503 ---AYDDYSNGRASPAPSMYP-PRGPG-----GPNGRASPAPSMYNPPRAPPQRSATGPM 553

Query:   347 YDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGP----SYDPQRGLGYDMQRGPNYDM--Q 398
               P +GPG+  Q+     A  GP+  YD +  P    S  P RG       G N D+  Q
Sbjct:   554 --PPRGPGFPPQRNMTAPAP-GPDDPYDYNTRPPTSSSQAPPRGA---FGNGWNSDLENQ 607

Query:   399 RG-PG 402
             RG PG
Sbjct:   608 RGGPG 612

 Score = 128 (50.1 bits), Expect = 8.3e-05, P = 8.3e-05
 Identities = 81/289 (28%), Positives = 97/289 (33%)

Query:   223 RAELMNA--PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTA 280
             RA+ M    P   R   G+ G    NS ++    P  Q      Y   Q     P    A
Sbjct:   332 RADTMTTLPPYASR--PGTPGSIELNSLDQKRPMPSRQGTMNSSYSSRQ-----PLVGAA 384

Query:   281 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 340
                G   + + S  +   SG        + R     +S    Y AS AP    T  P+  
Sbjct:   385 AEFGRSASPAPSIPSTNYSGRTYGGQPPMSRMQSNASSMSRAYTASPAPFSSDTV-PAL- 442

Query:   341 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
             P   PGY   + PG    +  +YD            PS  P RG G     GPN   +  
Sbjct:   443 PR--PGYQRNQ-PGGPPSRFDSYDDYSSGRAS--PAPSMYPSRGPG-----GPNMPPRSA 492

Query:   401 PGYETQRVP-GYD-VQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT 457
                   R P  YD    G    A  APS  P RGPG    R      M   P   P R  
Sbjct:   493 TAPIPPRGPDAYDDYSNG---RASPAPSMYPPRGPGGPNGRASPAPSMYNPPRAPPQRSA 549

Query:   458 GFDGAPRGAA--PHGQV--PPPLNNVPYGSAT-PPARSGSGQPRG--GN 499
                  PRG    P   +  P P  + PY   T PP  S    PRG  GN
Sbjct:   550 TGPMPPRGPGFPPQRNMTAPAPGPDDPYDYNTRPPTSSSQAPPRGAFGN 598


>WB|WBGene00004203 [details] [associations]
            symbol:swsn-1 species:6239 "Caenorhabditis elegans"
            [GO:0003682 "chromatin binding" evidence=IEA] [GO:0000003
            "reproduction" evidence=IGI;IMP] [GO:0040035 "hermaphrodite
            genitalia development" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IGI;IMP] [GO:0009792 "embryo development ending in birth
            or egg hatching" evidence=IGI;IMP] [GO:0040018 "positive regulation
            of multicellular organism growth" evidence=IGI;IMP] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            [GO:0046662 "regulation of oviposition" evidence=IMP] [GO:0002009
            "morphogenesis of an epithelium" evidence=IMP] [GO:0035262 "gonad
            morphogenesis" evidence=IMP] InterPro:IPR001005 InterPro:IPR007526
            InterPro:IPR009057 Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934
            SMART:SM00717 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
            GO:GO:0040007 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
            GO:GO:0040018 Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
            InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
            EMBL:AL110477 KO:K11649 UniGene:Cel.7072 GeneID:180324
            KEGG:cel:CELE_Y113G7B.23 CTD:180324 RefSeq:NP_001256907.1
            ProteinModelPortal:H8ESF3 SMR:H8ESF3 WormBase:Y113G7B.23c
            Uniprot:H8ESF3
        Length = 792

 Score = 145 (56.1 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 86/316 (27%), Positives = 123/316 (38%)

Query:   201 HLESL-QVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQN 259
             H + L Q+M+K   ++  +  +L  E   A ++D+     Y      +++E   R     
Sbjct:   493 HFDELEQIMDKERESLEYQRHQLILE-RQAFHMDQL---KY--LENRAKHEAHSRMTSSG 546

Query:   260 AYEDGYGVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP 313
             A   G  +P G    GPP   P    +    A P    ++ AAT +  P  +    P+ P
Sbjct:   547 ALPAG--LPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAP 602

Query:   314 GYEASKGP--GYDASKAP--SYDPTKGPSYDPAKGPGYDPTKGPGYDA----QKGSNYDA 365
               +A+  P     A +AP  +Y    GP   P +   Y P +G  Y      Q+   + A
Sbjct:   603 PVQAAPAPVQAPQAPQAPPQAYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQA 662

Query:   366 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG-PVYEAQRA 424
             Q+  +   H GP    Q G     Q    Y     PG       GY  Q+  P Y+AQ  
Sbjct:   663 QQAQS-QAHYGPPGGGQ-GPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPPYQAQPY 720

Query:   425 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 484
             P   P   P    QRG GY     P   P     F G P    P+GQ+PPP    P+G  
Sbjct:   721 PG--P---PPPQQQRGYGYP----PPPQPV----FSGHPY-QQPYGQMPPP----PHGQY 762

Query:   485 TPPARSGSGQ-PRGGN 499
              P  + G    P GG+
Sbjct:   763 QPQQQQGGPMGPPGGH 778


>UNIPROTKB|Q96QC0 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9606 "Homo sapiens" [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0004864 "protein
            phosphatase inhibitor activity" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] [GO:0000785 "chromatin" evidence=ISS] [GO:0006606
            "protein import into nucleus" evidence=TAS] InterPro:IPR000571
            InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
            PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
            GO:GO:0005634 EMBL:BA000025 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0000785 GO:GO:0006351 GO:GO:0003723
            EMBL:AL662800 EMBL:AL662825 GO:GO:0000790 GO:GO:0006606
            GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
            EMBL:Y13247 EMBL:AJ544537 EMBL:AB088097 EMBL:BX248507
            IPI:IPI00298731 PIR:JE0291 RefSeq:NP_002705.2 UniGene:Hs.106019
            ProteinModelPortal:Q96QC0 SMR:Q96QC0 DIP:DIP-39343N IntAct:Q96QC0
            MINT:MINT-1197376 STRING:Q96QC0 PhosphoSite:Q96QC0 DMDM:61214507
            PaxDb:Q96QC0 PeptideAtlas:Q96QC0 PRIDE:Q96QC0
            Ensembl:ENST00000376511 Ensembl:ENST00000383586
            Ensembl:ENST00000420949 Ensembl:ENST00000424446
            Ensembl:ENST00000426299 Ensembl:ENST00000429597
            Ensembl:ENST00000449113 GeneID:5514 KEGG:hsa:5514 UCSC:uc003nqn.1
            CTD:5514 GeneCards:GC06M030568 H-InvDB:HIX0165052
            H-InvDB:HIX0166290 H-InvDB:HIX0166579 H-InvDB:HIX0166833
            H-InvDB:HIX0167082 H-InvDB:HIX0167322 H-InvDB:HIX0167569
            HGNC:HGNC:9284 HPA:CAB025501 MIM:603771 neXtProt:NX_Q96QC0
            PharmGKB:PA33612 eggNOG:NOG69306 HOGENOM:HOG000049285
            HOVERGEN:HBG053646 InParanoid:Q96QC0 OMA:PPPHEHR OrthoDB:EOG451DQK
            PhylomeDB:Q96QC0 ChiTaRS:PPP1R10 GenomeRNAi:5514 NextBio:21326
            ArrayExpress:Q96QC0 Bgee:Q96QC0 CleanEx:HS_PPP1R10
            Genevestigator:Q96QC0 GermOnline:ENSG00000204569 Uniprot:Q96QC0
        Length = 940

 Score = 145 (56.1 bits), Expect = 2.0e-06, P = 2.0e-06
 Identities = 63/248 (25%), Positives = 83/248 (33%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 290
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   291 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 350
                    +     R+    P G G     GPG        + P +GP        G+ P 
Sbjct:   716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNSSGHRPH 770

Query:   351 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 410
             +GPG     GS +    GP   +  G  + P  G G  +  G  +    GPG       G
Sbjct:   771 EGPG--GGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828

Query:   411 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 461
             +    GP      +  + P  GPG+         D+   +G+D R  P   P    G DG
Sbjct:   829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885

Query:   462 APRGAAPH 469
                G   H
Sbjct:   886 PGHGGGGH 893

 Score = 144 (55.7 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 71/268 (26%), Positives = 90/268 (33%)

Query:   253 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 310
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   311 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 361
              GPG     GPG Y      +  +  P   P +  A+G   G  P  G   PG     G 
Sbjct:   694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749

Query:   362 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 414
              +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  + 
Sbjct:   750 GHRPHEGPGGGMGNSSGHR-PHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGIS 808

Query:   415 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 474
              G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP 
Sbjct:   809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866

Query:   475 PLNNVPYGSATPPARSGSGQPRGGNPAR 502
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 130 (50.8 bits), Expect = 8.7e-05, P = 8.7e-05
 Identities = 53/213 (24%), Positives = 72/213 (33%)

Query:   242 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 300
             G  G +E      P  G      G G P G G P      G  G  P+          SG
Sbjct:   708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSSG 766

Query:   301 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 360
                        G G+   +GPG        + P +GP    + G G+ P +GPG     G
Sbjct:   767 HRPHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAG 826

Query:   361 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVY 419
               +    GP   +     + P  G G+    G   +D+   PG+      G+D  RGP  
Sbjct:   827 GGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPPP 877

Query:   420 EAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 449
                R    P +      G+D     G DM   P
Sbjct:   878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>SGD|S000002299 [details] [associations]
            symbol:RPO21 "RNA polymerase II largest subunit B220"
            species:4932 "Saccharomyces cerevisiae" [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA;IMP] [GO:0003899 "DNA-directed RNA
            polymerase activity" evidence=IEA;IDA] [GO:0005739 "mitochondrion"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA;IDA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed
            RNA polymerase activity" evidence=IDA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 SGD:S000002299 GO:GO:0005739
            GO:GO:0046872 GO:GO:0003677 EMBL:BK006938 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 EMBL:X96876 EMBL:U27182
            GO:GO:0003899 PDB:4GWQ PDBsum:4GWQ PDB:2LO6 PDBsum:2LO6
            eggNOG:COG0086 GO:GO:0005665 PDB:1I3Q PDB:1I50 PDB:1I6H PDB:1K83
            PDB:1NIK PDB:1NT9 PDB:1PQV PDB:1R5U PDB:1R9S PDB:1R9T PDB:1SFO
            PDB:1TWA PDB:1TWC PDB:1TWF PDB:1TWG PDB:1TWH PDB:1WCM PDB:1Y1V
            PDB:1Y1W PDB:1Y1Y PDB:1Y77 PDB:2B63 PDB:2B8K PDB:2E2H PDB:2E2I
            PDB:2E2J PDB:2JA5 PDB:2JA6 PDB:2JA7 PDB:2JA8 PDB:2NVQ PDB:2NVT
            PDB:2NVX PDB:2NVY PDB:2NVZ PDB:2R7Z PDB:2R92 PDB:2R93 PDB:2VUM
            PDB:2YU9 PDB:3CQZ PDB:3FKI PDB:3GTG PDB:3GTJ PDB:3GTK PDB:3GTL
            PDB:3GTM PDB:3GTO PDB:3GTP PDB:3GTQ PDB:3H3V PDB:3HOU PDB:3HOV
            PDB:3HOW PDB:3HOX PDB:3HOY PDB:3HOZ PDB:3I4M PDB:3I4N PDB:3K1F
            PDB:3K7A PDB:3M3Y PDB:3M4O PDB:3PO2 PDB:3PO3 PDB:3QT1 PDB:3RZD
            PDB:3RZO PDB:3S14 PDB:3S15 PDB:3S16 PDB:3S17 PDB:3S1M PDB:3S1N
            PDB:3S1Q PDB:3S1R PDB:3S2D PDB:3S2H PDB:4A3B PDB:4A3C PDB:4A3D
            PDB:4A3E PDB:4A3F PDB:4A3G PDB:4A3I PDB:4A3J PDB:4A3K PDB:4A3L
            PDB:4A3M PDB:4A93 PDB:4BBR PDB:4BBS PDBsum:1I3Q PDBsum:1I50
            PDBsum:1I6H PDBsum:1K83 PDBsum:1NIK PDBsum:1NT9 PDBsum:1PQV
            PDBsum:1R5U PDBsum:1R9S PDBsum:1R9T PDBsum:1SFO PDBsum:1TWA
            PDBsum:1TWC PDBsum:1TWF PDBsum:1TWG PDBsum:1TWH PDBsum:1WCM
            PDBsum:1Y1V PDBsum:1Y1W PDBsum:1Y1Y PDBsum:1Y77 PDBsum:2B63
            PDBsum:2B8K PDBsum:2E2H PDBsum:2E2I PDBsum:2E2J PDBsum:2JA5
            PDBsum:2JA6 PDBsum:2JA7 PDBsum:2JA8 PDBsum:2NVQ PDBsum:2NVT
            PDBsum:2NVX PDBsum:2NVY PDBsum:2NVZ PDBsum:2R7Z PDBsum:2R92
            PDBsum:2R93 PDBsum:2VUM PDBsum:2YU9 PDBsum:3CQZ PDBsum:3FKI
            PDBsum:3GTG PDBsum:3GTJ PDBsum:3GTK PDBsum:3GTL PDBsum:3GTM
            PDBsum:3GTO PDBsum:3GTP PDBsum:3GTQ PDBsum:3H3V PDBsum:3HOU
            PDBsum:3HOV PDBsum:3HOW PDBsum:3HOX PDBsum:3HOY PDBsum:3HOZ
            PDBsum:3I4M PDBsum:3I4N PDBsum:3K1F PDBsum:3K7A PDBsum:3M3Y
            PDBsum:3M4O PDBsum:3PO2 PDBsum:3PO3 PDBsum:3QT1 PDBsum:3RZD
            PDBsum:3RZO PDBsum:3S14 PDBsum:3S15 PDBsum:3S16 PDBsum:3S17
            PDBsum:3S1M PDBsum:3S1N PDBsum:3S1Q PDBsum:3S1R PDBsum:3S2D
            PDBsum:3S2H PDBsum:4A3B PDBsum:4A3C PDBsum:4A3D PDBsum:4A3E
            PDBsum:4A3F PDBsum:4A3G PDBsum:4A3I PDBsum:4A3J PDBsum:4A3K
            PDBsum:4A3L PDBsum:4A3M PDBsum:4A93 PDBsum:4BBR PDBsum:4BBS
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 OrthoDB:EOG4J14H5
            EMBL:X03128 EMBL:Z74188 PIR:S67686 RefSeq:NP_010141.1 PDB:2L0I
            PDBsum:2L0I ProteinModelPortal:P04050 SMR:P04050 DIP:DIP-611N
            IntAct:P04050 MINT:MINT-432838 STRING:P04050 PaxDb:P04050
            PeptideAtlas:P04050 EnsemblFungi:YDL140C GeneID:851415
            KEGG:sce:YDL140C CYGD:YDL140c GeneTree:ENSGT00700000105212
            EvolutionaryTrace:P04050 NextBio:968606 ArrayExpress:P04050
            Genevestigator:P04050 GermOnline:YDL140C Uniprot:P04050
        Length = 1733

 Score = 159 (61.0 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
 Identities = 67/218 (30%), Positives = 90/218 (41%)

Query:   222 LRAELMNAPNVDRRA-DGSYGGAT--GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT 278
             ++ ELM +P VD  + D   GG T  G ++   +  P G  AY          G  P++ 
Sbjct:  1486 VKDELMFSPLVDSGSNDAMAGGFTAYGGADYGEATSPFG--AY----------GEAPTSP 1533

Query:   279 TAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP 337
               GV   G + ++  Y+ T    +P   +Y  P  P Y  +  P Y  + +PSY PT  P
Sbjct:  1534 GFGVSSPGFSPTSPTYSPTSPAYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSPTS-P 1589

Query:   338 SYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM 397
             SY P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P+Y  
Sbjct:  1590 SYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS- 1641

Query:   398 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGY 435
                P Y     P Y     P Y +  +PSY P   P Y
Sbjct:  1642 PTSPSYSPTS-PSYS-PTSPAY-SPTSPSYSPT-SPSY 1675

 Score = 38 (18.4 bits), Expect = 2.2e-06, Sum P(2) = 2.2e-06
 Identities = 12/39 (30%), Positives = 16/39 (41%)

Query:    52 EQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHEL 90
             E  + + H+E Q L T     AA     R +L    H L
Sbjct:   870 EDGMDAAHIEKQSLDTIGGSDAAFEKRYRVDLLNTDHTL 908


>UNIPROTKB|G1RSL2 [details] [associations]
            symbol:COL4A4 "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISS] [GO:0005587 "collagen type IV"
            evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS] [GO:0032836
            "glomerular basement membrane development" evidence=ISS]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
            EMBL:ADFV01083072 EMBL:ADFV01083073 EMBL:ADFV01083074
            EMBL:ADFV01083075 EMBL:ADFV01083076 EMBL:ADFV01083077
            EMBL:ADFV01083078 Ensembl:ENSNLET00000017067 Uniprot:G1RSL2
        Length = 1690

 Score = 147 (56.8 bits), Expect = 2.5e-06, P = 2.5e-06
 Identities = 79/253 (31%), Positives = 99/253 (39%)

Query:   261 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 320
             Y    G P   G P      G  GA P  S S     + GTP     +IP  PG+    G
Sbjct:   671 YPGRQGPPGFDGLPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPGPPGFRGDMG 727

Query:   321 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 372
              PG+   +  S     GP   P     KG PG DP  GP G   ++G S     +GP  D
Sbjct:   728 DPGFGGERGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGPLGPPGKRGLSGVPGIKGPRGD 786

Query:   373 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 426
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   787 --PGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 842

Query:   427 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 484
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   843 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 895

Query:   485 TPPARSGSGQPRG 497
               P   G   PRG
Sbjct:   896 GLPGPPGPKGPRG 908

 Score = 123 (48.4 bits), Expect = 0.00099, P = 0.00099
 Identities = 76/253 (30%), Positives = 97/253 (38%)

Query:   269 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 324
             +GH G P      G  G  G    T +   T  G      +D   GP G+   +G PG  
Sbjct:   640 RGHPGVPGRPGVRGPDGLKGQKGDTISCNVTYPGRQGPPGFDGLPGPKGFPGPQGAPGLS 699

Query:   325 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 378
              S      P T G S  P   PG+    G PG+  ++GS+     GP      +  +G  
Sbjct:   700 GSDGHKGRPGTPGTSEIPGP-PGFRGDMGDPGFGGERGSSPVGPPGPPGSPGVNGQKGIP 758

Query:   379 YDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEAQRA--PS 426
              DP  G LG   +RG    P     RG    PG E    +PG+   +GP      A  P 
Sbjct:   759 GDPAFGPLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPG 818

Query:   427 YIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 484
              +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP      G  
Sbjct:   819 -VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGPAGMKGLP 871

Query:   485 TPPARSGSGQPRG 497
               P R G+  P G
Sbjct:   872 GLPGRPGAHGPPG 884


>ZFIN|ZDB-GENE-080204-113 [details] [associations]
            symbol:zgc:172323 "zgc:172323" species:7955 "Danio
            rerio" [GO:0005882 "intermediate filament" evidence=IEA]
            [GO:0005198 "structural molecule activity" evidence=IEA]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001664
            InterPro:IPR006821 Pfam:PF04732 ZFIN:ZDB-GENE-080204-113
            GO:GO:0005198 GO:GO:0005882 HOVERGEN:HBG013015 InterPro:IPR016044
            PANTHER:PTHR23239 Pfam:PF00038 GeneTree:ENSGT00560000076873
            EMBL:CR848819 EMBL:BC155653 IPI:IPI00492297 RefSeq:NP_001107899.1
            UniGene:Dr.18713 SMR:A9JRG7 Ensembl:ENSDART00000075191
            GeneID:564165 KEGG:dre:564165 eggNOG:NOG147695 HOGENOM:HOG000207709
            NextBio:20885253 Uniprot:A9JRG7
        Length = 847

 Score = 143 (55.4 bits), Expect = 2.9e-06, P = 2.9e-06
 Identities = 112/465 (24%), Positives = 174/465 (37%)

Query:    58 QHVEMQKLATEN-QRLAATHGTLRQEL--AAAQHELQILHGQIGGMKSERELQ-----MR 109
             QH +   +A +N Q + + +     +L    ++H  Q+ H + G   +++++Q     M 
Sbjct:   281 QH-QYDDIAAKNLQEMDSWYKNKFDDLNNKTSKHVDQVRHVREGIASAKKDIQNKERDMD 339

Query:   110 NLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQ 169
             ++  K   +EA+++  +    +++K   + Q  + A +  +    Q T  L R + D   
Sbjct:   340 SMNTKNEALEAQIRDTQD---KYRKELEDLQARIEALQLELKSSKQRTALLLREYQD--- 393

Query:   170 IPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNA 229
                LL+   SL  E    R   E E       ++S+Q M    ++ +T V  + A   N 
Sbjct:   394 ---LLNVKMSLEIEITTYRKLIEGEDSRLTSMVQSMQTM--TLMSGSTSVHTVAAGAAN- 447

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV------- 282
                 R   G  GG  G+   E +G  +G  A     GV  G G   SAT  G        
Sbjct:   448 ----RGGRGLAGGLGGDVGLEFAGG-LGGPATGLERGV--GRGLDGSATVLGESVGGDAA 500

Query:   283 --VGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 340
               VG GP T    +     G  + +   I  G G     GP    +     DP KG    
Sbjct:   501 RGVGGGPTTVLGGHVDGGLGGGIGSGPAIGLGGG--VGSGPATGFAGGVGGDPAKGLPGG 558

Query:   341 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
                GP      G G D  KG       GP   +  G   DP +GL  D+   P   +  G
Sbjct:   559 VGGGPATGLGGGVGGDPAKGLPGGVGGGPATGLTGGVGGDPGKGLS-DVGGVPATSLAGG 617

Query:   401 PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGT- 457
              G +  + +PG  V  GP           P +G PG  +  G    +      D ++G  
Sbjct:   618 VGGDPAKGLPG-GVGGGPATGLAGGVGVDPAKGLPG-GVSGGPASGLAGGVGGDTAKGLP 675

Query:   458 -GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 501
              G  G P      G    P+  +  G    P++   G   GG PA
Sbjct:   676 GGVGGGPATGLAGGVGGVPVTGLAGGVGGDPSKGLPGGV-GGGPA 719


>FB|FBgn0262126 [details] [associations]
            symbol:gho "ghost" species:7227 "Drosophila melanogaster"
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0006888 "ER to Golgi vesicle-mediated transport" evidence=IEA]
            [GO:0006886 "intracellular protein transport" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0030127 "COPII
            vesicle coat" evidence=IEA] [GO:0005811 "lipid particle"
            evidence=IDA] [GO:0035158 "regulation of tube diameter, open
            tracheal system" evidence=IMP] [GO:0009306 "protein secretion"
            evidence=IMP] [GO:0035151 "regulation of tube size, open tracheal
            system" evidence=IMP] [GO:0070971 "endoplasmic reticulum exit site"
            evidence=IDA] [GO:0003331 "positive regulation of extracellular
            matrix constituent secretion" evidence=IMP] [GO:0007029
            "endoplasmic reticulum organization" evidence=IMP] [GO:0048081
            "positive regulation of cuticle pigmentation" evidence=IMP]
            [GO:0030011 "maintenance of cell polarity" evidence=IMP]
            [GO:0007030 "Golgi organization" evidence=IMP] [GO:0016203 "muscle
            attachment" evidence=IMP] [GO:0035149 "lumen formation, open
            tracheal system" evidence=IMP] [GO:0034394 "protein localization to
            cell surface" evidence=IMP] [GO:0040003 "chitin-based cuticle
            development" evidence=IMP] [GO:0022409 "positive regulation of
            cell-cell adhesion" evidence=IMP] [GO:0008360 "regulation of cell
            shape" evidence=IMP] [GO:0071711 "basement membrane organization"
            evidence=IMP] [GO:0000902 "cell morphogenesis" evidence=IMP]
            InterPro:IPR006895 InterPro:IPR006896 InterPro:IPR006900
            Pfam:PF04810 Pfam:PF04811 Pfam:PF04815 GO:GO:0006886 EMBL:AE014134
            GO:GO:0008360 GO:GO:0005811 GO:GO:0008270 GO:GO:0009306
            GO:GO:0016787 GO:GO:0016203 GO:GO:0000902 InterPro:IPR007123
            Pfam:PF00626 GO:GO:0006888 GO:GO:0040003 GO:GO:0034394
            GO:GO:0003331 GO:GO:0071711 GO:GO:0007030 GO:GO:0007029
            GO:GO:0030011 GO:GO:0035158 GO:GO:0022409 GO:GO:0035149
            GO:GO:0030127 SUPFAM:SSF82919 GO:GO:0070971 InterPro:IPR012990
            Pfam:PF08033 SUPFAM:SSF81811 eggNOG:COG5028 KO:K14007
            GeneTree:ENSGT00590000082962 HSSP:P40482 OMA:QDQGNCN GO:GO:0048081
            EMBL:AY052042 RefSeq:NP_608664.2 UniGene:Dm.269 SMR:Q9VQ94
            IntAct:Q9VQ94 MINT:MINT-283494 STRING:Q9VQ94
            EnsemblMetazoa:FBtr0077810 EnsemblMetazoa:FBtr0329964 GeneID:33409
            KEGG:dme:Dmel_CG10882 UCSC:CG10882-RA CTD:33409 FlyBase:FBgn0262126
            InParanoid:Q9VQ94 OrthoDB:EOG4CVDNW GenomeRNAi:33409 NextBio:783418
            Uniprot:Q9VQ94
        Length = 1193

 Score = 135 (52.6 bits), Expect = 3.2e-06, Sum P(2) = 3.2e-06
 Identities = 65/231 (28%), Positives = 84/231 (36%)

Query:   266 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP---RGPGYEASKGPG 322
             G P   G PP +    +  + P  S     +++ G P       P     PG    +  G
Sbjct:   211 GQPPLPGQPPFS--GQIPTSQPAPSPYGVPSSRPGQPQLPPGATPPTYTQPGLPPQQQQG 268

Query:   323 YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 382
                 + P   P + P + P + PG  P   PG   Q G+ Y A +   Y    G  +  Q
Sbjct:   269 IPPLQQPGI-PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQGGYS---G-GFPGQ 322

Query:   383 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG----PGYDLQ 438
                G+     P       PG +    P +   + P Y  Q+ P Y PQ G    PGY  Q
Sbjct:   323 APGGFPGAPPPL------PGQQAAAPPQFGAPQ-PGYPGQQ-PGYPPQPGQQPMPGYPPQ 374

Query:   439 RGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPAR 489
              GQ       P Y P  G GF G P G     Q P P     Y  A P AR
Sbjct:   375 PGQQLG---GPGYPPQPGAGFPGQP-GRPGFNQPPMPGAGNMYQQA-PQAR 420

 Score = 127 (49.8 bits), Expect = 2.3e-05, Sum P(2) = 2.3e-05
 Identities = 75/283 (26%), Positives = 100/283 (35%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHG-PPPSATTAGVVGAGPNTSTSAY-- 294
             G  GGA         G P        G+     +  PPP+       GA P T   +Y  
Sbjct:    90 GGVGGANPLKPPLPQGAPAAAAPPPTGFNQFNSNAAPPPTNNNNAAFGAPPPTQAGSYVN 149

Query:   295 -AATQSGTPMRAAYDIPRGPGYEASKG--PGYDASKAPSYDPTKGPSYDPAKG------- 344
              A   S TP   A  I +     A+    P     KA +     G    PA G       
Sbjct:   150 GALPPSSTPQSVASGINQMSLNSATLAGLPHMPPPKAATPGAAPGQPPIPAAGSTSQPPL 209

Query:   345 PGYDPTKGPGYDAQKGSNYDAQRGPN-YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 403
             PG  P   PG     G    +Q  P+ Y +       PQ   G      P Y     P  
Sbjct:   210 PGQPPL--PGQPPFSGQIPTSQPAPSPYGVPSSRPGQPQLPPG---ATPPTYTQPGLPPQ 264

Query:   404 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGA 462
             + Q +P   +Q+  +   Q+ P + PQ+ PG       G   +    Y  P +G G+ G 
Sbjct:   265 QQQGIP--PLQQPGI--PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQG-GYSGG 318

Query:   463 PRGAAPHG--QVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 503
               G AP G    PPPL   P   A  P + G+ QP  G P ++
Sbjct:   319 FPGQAPGGFPGAPPPL---PGQQAAAPPQFGAPQP--GYPGQQ 356

 Score = 116 (45.9 bits), Expect = 0.00034, Sum P(2) = 0.00034
 Identities = 69/272 (25%), Positives = 96/272 (35%)

Query:   238 GSY--GGATGNSENETSGRPVGQNAYEDGY--GVPQGHGPPPSATTAGVVGAGPNTSTSA 293
             GSY  G    +S  ++    + Q +       G+P  H PPP A T G   A P      
Sbjct:   145 GSYVNGALPPSSTPQSVASGINQMSLNSATLAGLP--HMPPPKAATPG---AAPGQPPIP 199

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-- 351
              A + S  P+     +P  P + + + P    + +P   P+  P   P   PG  P    
Sbjct:   200 AAGSTSQPPLPGQPPLPGQPPF-SGQIPTSQPAPSPYGVPSSRPG-QPQLPPGATPPTYT 257

Query:   352 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 411
              PG   Q+       + P     + P + PQ+  G      P    Q G  Y   +  GY
Sbjct:   258 QPGLPPQQQQGIPPLQQPGIP-QQQPGFPPQQP-GLPPLSQPGLPPQPGAPYGAPQQGGY 315

Query:   412 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGAPRGAAPHG 470
                 G  +  Q AP   P   P    Q+        AP    P +  G+   P G  P  
Sbjct:   316 S---GG-FPGQ-APGGFPGAPPPLPGQQAAAPPQFGAPQPGYPGQQPGYPPQP-GQQPMP 369

Query:   471 QVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
               PP       G   PP + G+G P  G P R
Sbjct:   370 GYPPQPGQQLGGPGYPP-QPGAGFP--GQPGR 398

 Score = 58 (25.5 bits), Expect = 3.2e-06, Sum P(2) = 3.2e-06
 Identities = 20/76 (26%), Positives = 35/76 (46%)

Query:    30 GMRPPMPGAFPPFDM-MPPPEVMEQKIASQHVEMQ-KLATENQRLAAT----HGTLRQEL 83
             G  PP  G +PP    +P  +  +Q++  Q  + Q +        AA+    +G  +Q+L
Sbjct:    20 GAPPPNSGGWPPQQQQLPQQQPPQQQLPPQQQQQQPQYGAPPPTSAASQPYLNGNYQQQL 79

Query:    84 AAAQHELQILHGQIGG 99
             A +   L +  G +GG
Sbjct:    80 ATSMGGLSV-GGGVGG 94

 Score = 40 (19.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query:    31 MRPPMP-GAFPPFDMMPPPEVMEQ 53
             ++PP+P GA  P    PPP    Q
Sbjct:    98 LKPPLPQGA--PAAAAPPPTGFNQ 119


>UNIPROTKB|Q5TM61 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9544 "Macaca mulatta" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
            GO:GO:0006351 GO:GO:0003723 EMBL:AB128049 GO:GO:0004864
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            eggNOG:NOG69306 HOVERGEN:HBG053646 RefSeq:NP_001108416.1
            UniGene:Mmu.17467 ProteinModelPortal:Q5TM61 GeneID:711949
            KEGG:mcc:711949 NextBio:19975847 Uniprot:Q5TM61
        Length = 940

 Score = 143 (55.4 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 73/271 (26%), Positives = 93/271 (34%)

Query:   253 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 310
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   311 RGPG-----YEASKGPGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKG 360
              GPG     Y   +G G   ++ P   P   P +  A+G   G  P  G   PG     G
Sbjct:   694 GGPGPGPGPYHRGRG-GRGGNEPP---PPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGG 749

Query:   361 SNYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDV 413
               +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  +
Sbjct:   750 GGHRPHEGPGGGMGNSSGHR-PHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGI 808

Query:   414 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP 473
               G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP
Sbjct:   809 SGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVP 866

Query:   474 PPLNNVPYGSATPPA--RSGSGQPRGGNPAR 502
                 +   G   PP   R   G   GG   R
Sbjct:   867 GHRGHDHRG---PPHEHRGHDGPGHGGGGHR 894

 Score = 142 (55.0 bits), Expect = 4.3e-06, P = 4.3e-06
 Identities = 54/213 (25%), Positives = 73/213 (34%)

Query:   241 GGATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 299
             GG  GN        P  G      G G P G G P      G  G  P+          S
Sbjct:   708 GGRGGNEPPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSS 766

Query:   300 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 359
             G           G G+   +GPG        + P +GP    + G G+ P +GPG     
Sbjct:   767 GHRPHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGA 826

Query:   360 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPV 418
             G  +    GP   +     + P  G G+    G   +D+   PG+      G+D  RGP 
Sbjct:   827 GGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPP 877

Query:   419 YE--AQRAPSYIPQRGPGYDLQRGQGYDMRRAP 449
             +E      P +      G+D     G DM   P
Sbjct:   878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910

 Score = 140 (54.3 bits), Expect = 7.1e-06, P = 7.1e-06
 Identities = 62/245 (25%), Positives = 83/245 (33%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 290
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   291 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP 349
                        P R A     G G    +G PG        + P +GP        G+ P
Sbjct:   716 PPP-----PPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMGNSSGHRP 770

Query:   350 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP 409
              +GPG  +  GS +    GP   +  G  + P  G G  +  G  +    GPG       
Sbjct:   771 HEGPG--SGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGG 828

Query:   410 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD----PSRGTGFDGAPR 464
             G+    GP      +  + P  GPG+    G + +D+     +D    P    G DG   
Sbjct:   829 GHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPPHEHRGHDGPGH 888

Query:   465 GAAPH 469
             G   H
Sbjct:   889 GGGGH 893


>UNIPROTKB|Q7YR38 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9598 "Pan troglodytes" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
            GO:GO:0006351 GO:GO:0003723 EMBL:BA000041 GO:GO:0004864
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646 OMA:PPPHEHR
            GeneTree:ENSGT00530000063820 EMBL:AB210175 EMBL:AB210176
            RefSeq:NP_001038965.1 UniGene:Ptr.6270 ProteinModelPortal:Q7YR38
            Ensembl:ENSPTRT00000033108 GeneID:462544 KEGG:ptr:462544
            NextBio:20841794 Uniprot:Q7YR38
        Length = 940

 Score = 143 (55.4 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 63/248 (25%), Positives = 83/248 (33%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 290
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   291 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 350
                    +     R+    P G G     GPG        + P +GP        G+ P 
Sbjct:   716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNNSGHRPH 770

Query:   351 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 410
             +GPG     GS +    GP   +  G  + P  G G  +  G  +    GPG       G
Sbjct:   771 EGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828

Query:   411 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 461
             +    GP      +  + P  GPG+         D+   +G+D R  P   P    G DG
Sbjct:   829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885

Query:   462 APRGAAPH 469
                G   H
Sbjct:   886 PGHGGGGH 893

 Score = 142 (55.0 bits), Expect = 4.3e-06, P = 4.3e-06
 Identities = 71/268 (26%), Positives = 90/268 (33%)

Query:   253 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 310
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   311 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 361
              GPG     GPG Y      +  +  P   P +  A+G   G  P  G   PG     G 
Sbjct:   694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749

Query:   362 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 414
              +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  + 
Sbjct:   750 GHRPHEGPGGGMGNNSGHR-PHEGPGGGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGIS 808

Query:   415 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 474
              G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP 
Sbjct:   809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866

Query:   475 PLNNVPYGSATPPARSGSGQPRGGNPAR 502
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 132 (51.5 bits), Expect = 5.3e-05, P = 5.3e-05
 Identities = 54/214 (25%), Positives = 72/214 (33%)

Query:   257 GQNAYEDGYGVPQGHGPPPS-----ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             G   Y  G G   G+ PPP          G  G GP            G      ++ P 
Sbjct:   699 GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPG 758

Query:   312 G-----PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 366
             G      G+   +GPG        + P +GP+     G G+ P +GPG     GS +   
Sbjct:   759 GGMGNNSGHRPHEGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPH 816

Query:   367 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY------ETQRVPGY--DVQRGPV 418
              GP   +  G  + P  G G  M     +    GPG+          VPG+     RGP 
Sbjct:   817 EGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP 876

Query:   419 YEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 449
                 R    P +      G+D     G DM   P
Sbjct:   877 PHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>UNIPROTKB|F1SKM1 [details] [associations]
            symbol:COL7A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 Gene3D:2.60.40.10 InterPro:IPR013783
            GO:GO:0004867 SUPFAM:SSF49265 Gene3D:4.10.410.10 InterPro:IPR020901
            SUPFAM:SSF57362 PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005604 OMA:RRVCTTA GeneTree:ENSGT00700000104301
            EMBL:CU633242 Ensembl:ENSSSCT00000012432 ArrayExpress:F1SKM1
            Uniprot:F1SKM1
        Length = 2939

 Score = 148 (57.2 bits), Expect = 3.6e-06, P = 3.6e-06
 Identities = 82/272 (30%), Positives = 105/272 (38%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 313
             P G        G P   GPP SA   G  G  P    S  +  + GTP  +    P+G P
Sbjct:  1270 PPGPPGLPGRIGAPGPPGPPGSAIAKGERGF-PGADGSPGSPGRPGTPGTSG---PKGSP 1325

Query:   314 GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNY 371
             G+   +G PG    + P  +P +       +GPG    KG PG     GS     RGP+ 
Sbjct:  1326 GWPGPRGEPGERGPRGPKGEPGEPGRVIGGEGPGLPGQKGDPGLPGPPGS-----RGPSG 1380

Query:   372 DIH-RGPSYDPQRGL----GYDMQRGPNY--DMQRGPGYE-TQRVPGYDVQRGPV----Y 419
             D   RGP   P   +    G   +RGP    D    PG      +PG    +GPV     
Sbjct:  1381 DPGPRGPPGFPGTAVKGEKGDRGERGPPGPGDGTAAPGDPGLPGLPGSPGPQGPVGPPGE 1440

Query:   420 EAQRAPSYIPQRG----PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPHGQVP 473
             + ++  S     G    PG   +RG +G+     P  D  RG TG  G P      G  P
Sbjct:  1441 KGEKGDSEDGAPGLPGQPGVPGERGLRGFPGDTGPKGD--RGLTGAVGEPGEKGERGS-P 1497

Query:   474 PPLNNVPYGSATPPARSGSGQPRG--GNPARR 503
              P+   P G    P R G+  P G  G   RR
Sbjct:  1498 GPVG--PQGPPGVPGRPGAEGPEGPPGPTGRR 1527


>UNIPROTKB|P12105 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933
            EMBL:U07973 EMBL:X00822 EMBL:X00823 EMBL:X00826 EMBL:X00825
            EMBL:X00827 EMBL:X00828 EMBL:X00830 EMBL:X00831 EMBL:K02302
            EMBL:K02301 EMBL:V00391 EMBL:V00392 EMBL:M36662 IPI:IPI00590578
            PIR:A05269 PIR:I50694 UniGene:Gga.42140 ProteinModelPortal:P12105
            STRING:P12105 Uniprot:P12105
        Length = 1262

 Score = 144 (55.7 bits), Expect = 3.7e-06, P = 3.7e-06
 Identities = 84/280 (30%), Positives = 109/280 (38%)

Query:   242 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 298
             GA G   +N   G P G+       G+P  +G P     AG  G+ GP   S  A    Q
Sbjct:   467 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 525

Query:   299 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 353
              G P    MR    IP  PG +   GP  +  + P      GP+  P   PG     GP 
Sbjct:   526 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 583

Query:   354 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 407
             G +   G N   +RGP       GP+  +   GL G     GP  D  + GP      Q 
Sbjct:   584 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 641

Query:   408 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 463
             +PG     GP  E  +     P+    GPG+   +G+ G    R P   P   TG  G P
Sbjct:   642 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERGPQGPPGP-TGARGGP 697

Query:   464 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 500
               A   G + PP     P G+  P  +   G+ RG  G+P
Sbjct:   698 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 736

 Score = 128 (50.1 bits), Expect = 0.00020, P = 0.00020
 Identities = 87/281 (30%), Positives = 107/281 (38%)

Query:   241 GGATGNSENETSGRPVGQNAY-EDGY-GVPQGHGPPPSATTAGVVGAGPNTSTSAYA--- 295
             GG TG  E    G P G  A+ +DG  G     GPP    TAG  G+ P     A     
Sbjct:   301 GGPTG--ERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPGS-PGFKGEAGPPGP 357

Query:   296 ATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-P 353
             A  SG P       P+G  G    +GP   A  +P      GPS  P  GPG    +G P
Sbjct:   358 AGASGNPGERGEPGPQGQAGPPGPQGPPGRAG-SPGGKGEMGPSGIPG-GPGPPGGRGLP 415

Query:   354 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGPGYETQRVPGYD 412
             G     G N  A+  P      G   DP    G   +RG N     RGP       PG +
Sbjct:   416 GPPGTSG-NPGAKGTPGEPGKNGAKGDP----GPKGERGENGTPGARGP-------PGEE 463

Query:   413 VQRGPVYEAQR--APSYIPQRG-PGY-DLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AA 467
              +RG   E  +   P    +RG PG+  L    G    + P+ +  RG+     P G A 
Sbjct:   464 GKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPGEKGPAGE--RGSPGPPGPSGPAG 521

Query:   468 PHGQV--P--PPLNNVPYGSATPPARSGSGQPRG--GNPAR 502
               GQ   P  P +  +P G    P   G   P G  G P R
Sbjct:   522 DRGQDGGPGLPGMRGLP-GIPGSPGSDGKPGPPGNQGEPGR 561

 Score = 127 (49.8 bits), Expect = 0.00026, P = 0.00026
 Identities = 78/276 (28%), Positives = 97/276 (35%)

Query:   242 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 300
             G  G   +N   G P G        G P   GPP      G  G  P  +       + G
Sbjct:   428 GTPGEPGKNGAKGDP-GPKGERGENGTPGARGPPGEEGKRGANGE-PGQNGVPGTPGERG 485

Query:   301 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 356
             +P      +P   G    KGP G   S  P   P+ GP+ D  +  GPG    +G PG  
Sbjct:   486 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 541

Query:   357 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 413
                GS  D + GP      G   +P R  G     GP     +   PG +  +  PG + 
Sbjct:   542 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 593

Query:   414 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 469
             +RGP       P    + G    PG     G   D R  P   PS   G  G P G  P 
Sbjct:   594 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 649

Query:   470 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 503
             G+   P    P G    P   G   P+G N  P  R
Sbjct:   650 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 682

 Score = 125 (49.1 bits), Expect = 0.00043, P = 0.00043
 Identities = 74/259 (28%), Positives = 95/259 (36%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 312
             P G N Y+   G P   GP      AG++G AGP          + G P R   +  RG 
Sbjct:   192 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGP--------PGKDGEPGRPGRNGDRGI 243

Query:   313 PGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRG 368
             PG    KG PG      P     +G    D AKG    P  GP G   Q G+N    Q G
Sbjct:   244 PGLPGHKGHPGMPGM--PGMKGARGFDGKDGAKGDSGAP--GPKGEAGQPGANGSPGQPG 299

Query:   369 PNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYE-TQRVPGYDVQRGPVYEAQRAP 425
             P      RG   +P     +     P      GP G   T   PG      P ++ +  P
Sbjct:   300 PGGPTGERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPG-----SPGFKGEAGP 354

Query:   426 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 485
                P    G   +RG+     +A    P    G  G+P G    G++ P  + +P G   
Sbjct:   355 PG-PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGK---GEMGP--SGIPGGPGP 408

Query:   486 PPARSGSGQP-RGGNPARR 503
             P  R   G P   GNP  +
Sbjct:   409 PGGRGLPGPPGTSGNPGAK 427


>TAIR|locus:2012788 [details] [associations]
            symbol:AT1G10390 "AT1G10390" species:3702 "Arabidopsis
            thaliana" [GO:0005215 "transporter activity" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0006810 "transport" evidence=IEA] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005635 "nuclear envelope"
            evidence=IDA] InterPro:IPR007230 Pfam:PF04096 PROSITE:PS51434
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005635 GO:GO:0006810
            GO:GO:0005643 eggNOG:NOG12793 SUPFAM:SSF82215 KO:K14297 HSSP:Q9Y6J4
            EMBL:AY078948 EMBL:BT003030 EMBL:AK226964 IPI:IPI00523265
            RefSeq:NP_001031018.1 RefSeq:NP_172510.2 UniGene:At.27877
            ProteinModelPortal:Q8RY25 SMR:Q8RY25 STRING:Q8RY25 MEROPS:S59.A02
            PaxDb:Q8RY25 PRIDE:Q8RY25 EnsemblPlants:AT1G10390.1
            EnsemblPlants:AT1G10390.2 GeneID:837579 KEGG:ath:AT1G10390
            TAIR:At1g10390 HOGENOM:HOG000085153 InParanoid:Q8RY25 OMA:ESISAMP
            PhylomeDB:Q8RY25 ProtClustDB:CLSN2713828 Genevestigator:Q8RY25
            Uniprot:Q8RY25
        Length = 1041

 Score = 143 (55.4 bits), Expect = 3.8e-06, P = 3.8e-06
 Identities = 52/263 (19%), Positives = 89/263 (33%)

Query:   242 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP---SATTAGVVGAGPNTSTSAYAATQ 298
             GA+ +     S    G +     +G   G G  P   S   +   G     S  A+  T 
Sbjct:    80 GASSSPAFGNSTPAFGASPASSPFGGSSGFGQKPLGFSTPQSNPFGNSTQQSQPAFGNTS 139

Query:   299 SG--TPMRA----AYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 352
              G  TP  A    A+  P  P + A+  P + AS  P++  T  P++  +  P +  T  
Sbjct:   140 FGSSTPFGATNTPAFGAPSTPSFGATSTPSFGASSTPAFGATNTPAFGASNSPSFGATNT 199

Query:   353 PGYDAQKGSNYDAQRGP--NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 410
             P + A     + +      N     G ++       +     P +     P +     P 
Sbjct:   200 PAFGASPTPAFGSTGTTFGNTGFGSGGAFGASNTPAFGASGTPAFGASGTPAFGASSTPA 259

Query:   411 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 470
             +     P + A   P++     P +       +    +P++  S  + F     G++  G
Sbjct:   260 FGASSTPAFGASSTPAFGGSSTPSFGASNTSSFSFGSSPAFGQST-SAF-----GSSAFG 313

Query:   471 QVPPPLNNVPYGSATPPARSGSG 493
               P P        A+ P   GSG
Sbjct:   314 STPSPFGGA---QASTPTFGGSG 333


>MGI|MGI:1344412 [details] [associations]
            symbol:Ldb3 "LIM domain binding 3" species:10090 "Mus
            musculus" [GO:0005080 "protein kinase C binding" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0005856 "cytoskeleton" evidence=ISO] [GO:0008092
            "cytoskeletal protein binding" evidence=ISO] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0030018 "Z disc" evidence=ISO;IDA]
            [GO:0042995 "cell projection" evidence=IEA] [GO:0045214 "sarcomere
            organization" evidence=IMP] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0051371 "muscle alpha-actinin binding"
            evidence=IDA;IPI] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
            InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
            SMART:SM00132 SMART:SM00228 MGI:MGI:1344412 GO:GO:0048471
            GO:GO:0005080 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 GO:GO:0031143 Gene3D:2.10.110.10 SUPFAM:SSF50156
            CTD:11155 eggNOG:NOG286537 HOVERGEN:HBG051478 OMA:CTSQATT
            OrthoDB:EOG4GTKDQ InterPro:IPR006643 SMART:SM00735 EMBL:AF114378
            EMBL:AF114379 EMBL:AJ005621 EMBL:AF228057 EMBL:AF228058
            EMBL:AY206011 EMBL:AY206012 EMBL:AY206013 EMBL:AY206015
            EMBL:AK172980 EMBL:AK004020 EMBL:AK137181 EMBL:AK142292
            EMBL:BC099596 EMBL:BC138793 EMBL:BC145420 IPI:IPI00123369
            IPI:IPI00323030 IPI:IPI00403041 IPI:IPI00621572 IPI:IPI00625287
            IPI:IPI00656173 RefSeq:NP_001034160.1 RefSeq:NP_001034161.1
            RefSeq:NP_001034162.1 RefSeq:NP_001034163.1 RefSeq:NP_001034164.1
            RefSeq:NP_001034165.1 RefSeq:NP_036048.3 UniGene:Mm.29733 PDB:1WJL
            PDBsum:1WJL ProteinModelPortal:Q9JKS4 SMR:Q9JKS4 IntAct:Q9JKS4
            MINT:MINT-97840 STRING:Q9JKS4 PhosphoSite:Q9JKS4 PaxDb:Q9JKS4
            PRIDE:Q9JKS4 Ensembl:ENSMUST00000022327 Ensembl:ENSMUST00000022328
            Ensembl:ENSMUST00000022330 Ensembl:ENSMUST00000090040 GeneID:24131
            KEGG:mmu:24131 UCSC:uc007taz.1 UCSC:uc007tba.1 UCSC:uc007tbc.1
            UCSC:uc007tbd.1 UCSC:uc007tbe.1 UCSC:uc007tbf.1
            GeneTree:ENSGT00700000104411 InParanoid:B2RSB0
            EvolutionaryTrace:Q9JKS4 NextBio:304169 Bgee:Q9JKS4 CleanEx:MM_LDB3
            Genevestigator:Q9JKS4 GermOnline:ENSMUSG00000021798 Uniprot:Q9JKS4
        Length = 723

 Score = 141 (54.7 bits), Expect = 3.9e-06, P = 3.9e-06
 Identities = 49/181 (27%), Positives = 69/181 (38%)

Query:   252 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             S  P    +Y +G   P    P P   T   +   P+      A++ S +P  A Y  P 
Sbjct:   371 SPAPSAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASSYSPSP-GANYS-PT 423

Query:   312 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 371
              P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y     + Y    GP+ 
Sbjct:   424 -P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYSG--GPSE 479

Query:   372 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAP 425
                R P     S+  +   G          + RG P Y         + RG    A+R P
Sbjct:   480 SASRPPWVTDDSFSQKFAPGKSTTTVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERFP 539

Query:   426 S 426
             +
Sbjct:   540 A 540

 Score = 135 (52.6 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 55/192 (28%), Positives = 70/192 (36%)

Query:   265 YGVPQGHGPPPSATTAGVVGAG-----PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 319
             Y       P PSA T+   G       P   T+A        P+ A+   P  PG   S 
Sbjct:   364 YSPAAAASPAPSAHTSYSEGPAAPAPKPRVVTTASIRPSVYQPVPASSYSP-SPGANYSP 422

Query:   320 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSY 379
              P Y  S AP+Y P+  P+Y P+  P Y P+  P Y      NY       Y    GPS 
Sbjct:   423 TP-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYS--GGPSE 479

Query:   380 DPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQ 438
                R          ++  +  PG  T  V    + RG       AP+Y P  GP    L 
Sbjct:   480 SASRP---PWVTDDSFSQKFAPGKSTTTVSKQTLPRG-------APAYNPT-GPQVTPLA 528

Query:   439 RGQGYDMRRAPS 450
             RG      R P+
Sbjct:   529 RGTFQRAERFPA 540

 Score = 132 (51.5 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 56/213 (26%), Positives = 74/213 (34%)

Query:   276 SATTAGVVGA---GPNTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKGPGY--DASKAP 329
             +A+ AG   +    P    SAY+   + +P  +A+     GP   A K P     AS  P
Sbjct:   343 AASAAGPAASPVENPRPQASAYSPAAAASPAPSAHTSYSEGPAAPAPK-PRVVTTASIRP 401

Query:   330 S-YDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 388
             S Y P    SY P+ G  Y PT  P Y       Y     P Y     P+Y P     Y 
Sbjct:   402 SVYQPVPASSYSPSPGANYSPT--P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYT 458

Query:   389 MQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR- 447
                 PNY       Y     P     R P        S+  +  PG          + R 
Sbjct:   459 PSPAPNYTPTPSAAYSGG--PSESASRPPWVTDD---SFSQKFAPGKSTTTVSKQTLPRG 513

Query:   448 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 480
             AP+Y+P+ G       RG     +  P  +  P
Sbjct:   514 APAYNPT-GPQVTPLARGTFQRAERFPASSRTP 545


>UNIPROTKB|O75112 [details] [associations]
            symbol:LDB3 "LIM domain-binding protein 3" species:9606
            "Homo sapiens" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005080 "protein kinase C binding" evidence=IEA] [GO:0031143
            "pseudopodium" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0005856 "cytoskeleton" evidence=IDA] [GO:0008092
            "cytoskeletal protein binding" evidence=IPI] [GO:0030018 "Z disc"
            evidence=IDA] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
            InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
            SMART:SM00132 SMART:SM00228 GO:GO:0048471 GO:GO:0030018
            GO:GO:0005856 GO:GO:0046872 GO:GO:0008270 Orphanet:154
            GO:GO:0031143 Gene3D:2.10.110.10 Orphanet:54260 SUPFAM:SSF50156
            EMBL:AJ133766 EMBL:AJ133767 EMBL:AJ133768 EMBL:AF276807
            EMBL:AF276808 EMBL:AF276809 EMBL:AB014513 EMBL:AK304760
            EMBL:EF179181 EMBL:AC067750 EMBL:BC010929 IPI:IPI00165263
            IPI:IPI00294958 IPI:IPI00294959 IPI:IPI00514458 IPI:IPI00552865
            IPI:IPI00654766 IPI:IPI00909817 RefSeq:NP_001073583.1
            RefSeq:NP_001073584.1 RefSeq:NP_001073585.1 RefSeq:NP_001165081.1
            RefSeq:NP_001165082.1 RefSeq:NP_009009.1 UniGene:Hs.657271 PDB:1RGW
            PDBsum:1RGW ProteinModelPortal:O75112 SMR:O75112 IntAct:O75112
            STRING:O75112 PhosphoSite:O75112 UCD-2DPAGE:O75112
            UCD-2DPAGE:Q9Y4Z5 PaxDb:O75112 PRIDE:O75112 DNASU:11155
            Ensembl:ENST00000263066 Ensembl:ENST00000310944
            Ensembl:ENST00000352360 Ensembl:ENST00000361373
            Ensembl:ENST00000372056 Ensembl:ENST00000372066
            Ensembl:ENST00000429277 Ensembl:ENST00000458213
            Ensembl:ENST00000542786 GeneID:11155 KEGG:hsa:11155 UCSC:uc001kdr.3
            UCSC:uc001kds.3 UCSC:uc001kdu.3 UCSC:uc001kdv.3 UCSC:uc009xsy.3
            UCSC:uc009xsz.3 CTD:11155 GeneCards:GC10P088426 HGNC:HGNC:15710
            HPA:HPA048955 MIM:601493 MIM:605906 MIM:609452 neXtProt:NX_O75112
            Orphanet:247 Orphanet:609 Orphanet:98912 PharmGKB:PA30318
            eggNOG:NOG286537 HOGENOM:HOG000220936 HOVERGEN:HBG051478
            InParanoid:O75112 OMA:CTSQATT OrthoDB:EOG4GTKDQ ChiTaRS:LDB3
            EvolutionaryTrace:O75112 GenomeRNAi:11155 NextBio:42413
            ArrayExpress:O75112 Bgee:O75112 Genevestigator:O75112
            GermOnline:ENSG00000122367 InterPro:IPR006643 SMART:SM00735
            Uniprot:O75112
        Length = 727

 Score = 141 (54.7 bits), Expect = 4.0e-06, P = 4.0e-06
 Identities = 53/183 (28%), Positives = 72/183 (39%)

Query:   252 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             S  P    +Y +G   P    P P   T   +   P+      A+T S +P  A Y  P 
Sbjct:   375 SSAPATHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASTYSPSP-GANYS-PT 427

Query:   312 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 371
              P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+      Y    GP  
Sbjct:   428 -P-YTPSPAPAYTPSPAPAYTPSPVPTYTPSPAPAYTPSPAPNYNPAPSVAYSG--GPAE 483

Query:   372 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQ--RVPGYDVQRGPVYEAQR 423
                R P     S+  +   G          + RG P Y     +VP   + RG V  A+R
Sbjct:   484 PASRPPWVTDDSFSQKFAPGKSTTSISKQTLPRGGPAYTPAGPQVP--PLARGTVQRAER 541

Query:   424 APS 426
              P+
Sbjct:   542 FPA 544


>UNIPROTKB|G7N928 [details] [associations]
            symbol:EGK_04858 "Putative uncharacterized protein"
            species:9544 "Macaca mulatta" [GO:0005201 "extracellular matrix
            structural constituent" evidence=ISS] [GO:0005587 "collagen type
            IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001264 Uniprot:G7N928
        Length = 1692

 Score = 145 (56.1 bits), Expect = 4.1e-06, P = 4.1e-06
 Identities = 81/261 (31%), Positives = 100/261 (38%)

Query:   253 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             G  V  N  Y    G P   GPP      G  GA P  S S     + GTP     +IP 
Sbjct:   663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719

Query:   312 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 363
              PG+    G PG+   K  S     GP   P     KG PG DP  G  G   ++G S  
Sbjct:   720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778

Query:   364 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 419
                +GP  D    P  +   G+ G+   +GP   +   G PG      PG+  +RG P  
Sbjct:   779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835

Query:   420 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 478
               Q  P      G PG    +GQ  D+   P   P+   G  G P     HG  PP L  
Sbjct:   836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888

Query:   479 VP--YGSATPPARSGSGQPRG 497
             +P  +G    P   G   PRG
Sbjct:   889 IPGPFGDDGLPGPPGPKGPRG 909

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 77/252 (30%), Positives = 97/252 (38%)

Query:   269 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 324
             +GH G P      G  G  G    T +   T  G      +D P GP G+   +G PG  
Sbjct:   641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700

Query:   325 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 378
              S      P T G S  P   PG+    G PG+  +KGS+     GP      +  +G  
Sbjct:   701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759

Query:   379 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 426
              DP  G LG   +RG    P     RG PGY        +PG+   +GP      A  P 
Sbjct:   760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819

Query:   427 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 485
              +P   PG+  +RG  G   +     DP    G  GAP G    G V PP      G   
Sbjct:   820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873

Query:   486 PPARSGSGQPRG 497
              P R G+  P G
Sbjct:   874 LPGRPGAHGPPG 885

 Score = 124 (48.7 bits), Expect = 0.00077, P = 0.00077
 Identities = 81/259 (31%), Positives = 100/259 (38%)

Query:   255 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 309
             PVG      G+ G P  +GH G P      G  G  G    T +   T  G      +D 
Sbjct:   626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683

Query:   310 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 365
             P GP G+   +G PG   S      P T G S  P   PG+    G PG+  +KGS+   
Sbjct:   684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742

Query:   366 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 423
               GP       P  + Q+G+  D    P +     PG      VPG    RG P Y    
Sbjct:   743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794

Query:   424 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 481
              P+ IP   PG    +G +G+     P      G   + GAP    P GQ  P L   P 
Sbjct:   795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845

Query:   482 GSATPPARSGSGQPRGGNP 500
             GS  P A  G GQP    P
Sbjct:   846 GS--PGAPGGKGQPGDVGP 862


>UNIPROTKB|G7PK77 [details] [associations]
            symbol:EGM_04376 "Putative uncharacterized protein"
            species:9541 "Macaca fascicularis" [GO:0005201 "extracellular
            matrix structural constituent" evidence=ISS] [GO:0005587 "collagen
            type IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001287 Uniprot:G7PK77
        Length = 1695

 Score = 145 (56.1 bits), Expect = 4.1e-06, P = 4.1e-06
 Identities = 81/261 (31%), Positives = 100/261 (38%)

Query:   253 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             G  V  N  Y    G P   GPP      G  GA P  S S     + GTP     +IP 
Sbjct:   663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719

Query:   312 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 363
              PG+    G PG+   K  S     GP   P     KG PG DP  G  G   ++G S  
Sbjct:   720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778

Query:   364 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 419
                +GP  D    P  +   G+ G+   +GP   +   G PG      PG+  +RG P  
Sbjct:   779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835

Query:   420 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 478
               Q  P      G PG    +GQ  D+   P   P+   G  G P     HG  PP L  
Sbjct:   836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888

Query:   479 VP--YGSATPPARSGSGQPRG 497
             +P  +G    P   G   PRG
Sbjct:   889 IPGPFGDDGLPGPPGPKGPRG 909

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 77/252 (30%), Positives = 97/252 (38%)

Query:   269 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 324
             +GH G P      G  G  G    T +   T  G      +D P GP G+   +G PG  
Sbjct:   641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700

Query:   325 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 378
              S      P T G S  P   PG+    G PG+  +KGS+     GP      +  +G  
Sbjct:   701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759

Query:   379 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 426
              DP  G LG   +RG    P     RG PGY        +PG+   +GP      A  P 
Sbjct:   760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819

Query:   427 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 485
              +P   PG+  +RG  G   +     DP    G  GAP G    G V PP      G   
Sbjct:   820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873

Query:   486 PPARSGSGQPRG 497
              P R G+  P G
Sbjct:   874 LPGRPGAHGPPG 885

 Score = 124 (48.7 bits), Expect = 0.00077, P = 0.00077
 Identities = 81/259 (31%), Positives = 100/259 (38%)

Query:   255 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 309
             PVG      G+ G P  +GH G P      G  G  G    T +   T  G      +D 
Sbjct:   626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683

Query:   310 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 365
             P GP G+   +G PG   S      P T G S  P   PG+    G PG+  +KGS+   
Sbjct:   684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742

Query:   366 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 423
               GP       P  + Q+G+  D    P +     PG      VPG    RG P Y    
Sbjct:   743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794

Query:   424 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 481
              P+ IP   PG    +G +G+     P      G   + GAP    P GQ  P L   P 
Sbjct:   795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845

Query:   482 GSATPPARSGSGQPRGGNP 500
             GS  P A  G GQP    P
Sbjct:   846 GS--PGAPGGKGQPGDVGP 862


>ZFIN|ZDB-GENE-050809-108 [details] [associations]
            symbol:pygo2 "pygopus homolog 2 (Drosophila)"
            species:7955 "Danio rerio" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR001965
            InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
            ZFIN:ZDB-GENE-050809-108 GO:GO:0046872 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
            SUPFAM:SSF57903 InterPro:IPR019786 PROSITE:PS01359
            GeneTree:ENSGT00530000063948 CTD:90780 OrthoDB:EOG4QZ7MB
            EMBL:CR628394 IPI:IPI00650328 RefSeq:NP_001028283.2
            UniGene:Dr.159286 SMR:Q1L8T6 Ensembl:ENSDART00000131324
            GeneID:613247 KEGG:dre:613247 InParanoid:Q1L8T6 OMA:RFGMPPQ
            NextBio:20898499 Uniprot:Q1L8T6
        Length = 571

 Score = 139 (54.0 bits), Expect = 4.6e-06, P = 4.6e-06
 Identities = 83/301 (27%), Positives = 103/301 (34%)

Query:   227 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ---GHGPPPSA 277
             M +P   +R   S G A  + SE      P     V  N ++D +G P    G G P  A
Sbjct:    16 MKSPEKKKRKSNSQGAAFSHLSEFAPPPTPMVDHLVASNPFDDDFGPPSRSAGGGGPGGA 75

Query:   278 TTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP 337
             T     GAG       Y     G  M        GPG   S  PG      P   P  GP
Sbjct:    76 TFLPSPGAGGG----GYGGP--GR-MGGGMGFMGGPGGPGSGQPGRRPPFGPP-TPNTGP 127

Query:   338 SYDPAKG--PGYDPTKGPGYDA----QKGSNYDAQRGPNYD--IHRGPSYDPQRGLGYDM 389
              +    G  PG+    G G         G        PN+   +H G  ++P    G  M
Sbjct:   128 HHPLGFGGMPGFGGGGGGGGGGGGGFPPGGPSQFNMPPNFSPPMHPGQGFNPMLSPGA-M 186

Query:   390 QRGPNYDMQRGPGYET----QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR---GQG 442
               GP      GP +      Q+ P +  Q G  + +   P     RGP +       G G
Sbjct:   187 GGGPGGG--GGPPHPRFGMPQQQPPHG-QGGHPFNSPPLPGGPGPRGPPHGPMNPMGGMG 243

Query:   443 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY-GSATPPARSGS--GQPRGGN 499
               M          G    G   G  P GQ PPP +  PY GS+ P    G   G P GG 
Sbjct:   244 GGMNMMGMGGGGGGGNMVGGHPGMPPQGQFPPPQDG-PYPGSSPPVGEEGKNFGGPGGGP 302

Query:   500 P 500
             P
Sbjct:   303 P 303


>UNIPROTKB|P04258 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
            "Bos taurus" [GO:0005581 "collagen" evidence=IEA] PROSITE:PS01208
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 IPI:IPI00731432 PIR:A02862
            UniGene:Bt.64714 STRING:P04258 PRIDE:P04258 Uniprot:P04258
        Length = 1049

 Score = 142 (55.0 bits), Expect = 4.9e-06, P = 4.9e-06
 Identities = 82/262 (31%), Positives = 97/262 (37%)

Query:   252 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 308
             SG P G+       G P    G GPP      G  G  P    SA      G P      
Sbjct:   521 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 577

Query:   309 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 366
              P GPG +  KG PG      AP  D  +GP+  P   PG  P   PG   + G+     
Sbjct:   578 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 628

Query:   367 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 421
               P      GP   P +RG  G     G P    Q G PG + +R  PG   + GP   A
Sbjct:   629 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 686

Query:   422 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 481
               A    P   PG    +G+    R +P      G G  G P G  P G  PP  N  P 
Sbjct:   687 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 735

Query:   482 --GSATPPARSGSGQPRGGNPA 501
               GS+  P + G   P G N A
Sbjct:   736 PPGSSGAPGKDGPPGPPGSNGA 757

 Score = 139 (54.0 bits), Expect = 1.0e-05, P = 1.0e-05
 Identities = 86/286 (30%), Positives = 103/286 (36%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAY 294
             A G   G  G +       P G + +    G P   GPP     AG  G  GP  +    
Sbjct:    12 AGGGIAGYPGPAGPPGPPGPPGTSGHPGAPGAPGYQGPPGEPGQAGPAGPPGPPGAIGPS 71

Query:   295 AAT-QSGTPMRAAYDIPRG-PGYEASKGP----GYDASKAP-SYDPTKGPSYDPAKGPGY 347
                 +SG P R     PRG PG    KGP    G+   K    +D   G   +P   PG 
Sbjct:    72 GKDGESGRPGRPG---PRGFPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGEPG-APGL 127

Query:   348 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 404
                 G PG D   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   128 KGENGVPGEDGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 182

Query:   405 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              T   PG    +G V  A    S      PG   QRG+      A +  P    G DG+P
Sbjct:   183 GTAGFPGSPGAKGEVGPAGSPGS---SGAPG---QRGEPGPQGHAGAPGPPGPPGSDGSP 236

Query:   464 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 502
              G    G  P  +   P   G+  PP   G+ G P  RG  G P +
Sbjct:   237 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGVPGQRGAAGEPGK 280

 Score = 122 (48.0 bits), Expect = 0.00073, P = 0.00073
 Identities = 84/289 (29%), Positives = 101/289 (34%)

Query:   230 PNVDRRADGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAG 281
             P  +   DGS G  GA G      E    G R P G N      G P   G P  A   G
Sbjct:   304 PKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRG 363

Query:   282 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 341
             V G  P  +         G  +R     P GPG     GP     +     P   P   P
Sbjct:   364 VAGE-PGRN-----GLPGGPGLRGIPGSPGGPGSNGKPGPPGSQGETGRPGPPGSPG--P 415

Query:   342 AKGPGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM- 397
                PG     GP G D   G N + + GP     +GP+  + + G  G     GP+ D  
Sbjct:   416 RGQPGVMGFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKG 474

Query:   398 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRG 456
               GP    Q + G     GP  E  +     P+   G   +  G+G D   AP     RG
Sbjct:   475 DTGPP-GPQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERG 528

Query:   457 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 503
                 G P G  P G   PP      G+A PP   GS    G  G P  R
Sbjct:   529 PPGAGGPPG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 575


>UNIPROTKB|C9JGE3 [details] [associations]
            symbol:EWSR1 "Ewing sarcoma breakpoint region 1, isoform
            CRA_e" species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0000166
            EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 EMBL:AC002059 EMBL:AL031186 EMBL:AC000026
            UniGene:Hs.374477 HGNC:HGNC:3508 HOGENOM:HOG000038010 ChiTaRS:EWSR1
            IPI:IPI00953325 SMR:C9JGE3 STRING:C9JGE3 Ensembl:ENST00000332050
            UCSC:uc003aez.3 Uniprot:C9JGE3
        Length = 583

 Score = 127 (49.8 bits), Expect = 5.0e-06, Sum P(2) = 5.0e-06
 Identities = 68/254 (26%), Positives = 95/254 (37%)

Query:   238 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 347
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   348 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 404
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD      Y 
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQS---SYS 209

Query:   405 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF-DGAP 463
              Q   G     G      +  SY  Q    Y  Q G  Y   +APS    + + +    P
Sbjct:   210 QQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGS-YS--QAPSQYSQQSSSYGQQRP 266

Query:   464 RGAAPHGQVPPPLN 477
                 P   + PP++
Sbjct:   267 MDEGPDLDLGPPVD 280

 Score = 57 (25.1 bits), Expect = 5.0e-06, Sum P(2) = 5.0e-06
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   464 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 502
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   382 RGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 427


>UNIPROTKB|E2R2K8 [details] [associations]
            symbol:PPP1R10 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 Gene3D:1.20.930.10
            SUPFAM:SSF47676 CTD:5514 OMA:PPPHEHR GeneTree:ENSGT00530000063820
            EMBL:AAEX03008197 RefSeq:XP_848400.1 Ensembl:ENSCAFT00000000645
            Ensembl:ENSCAFT00000048295 GeneID:481705 KEGG:cfa:481705
            NextBio:20856447 Uniprot:E2R2K8
        Length = 940

 Score = 141 (54.7 bits), Expect = 5.5e-06, P = 5.5e-06
 Identities = 68/268 (25%), Positives = 87/268 (32%)

Query:   238 GSYGGATGNSENETSGRPV---GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 294
             G +GG  G+      G P    G + + DG G P   GP       G  G GP       
Sbjct:   653 GPHGGPGGSVGPRLLGPPPPPRGGDPFWDGPGDPMRGGP-----MRGGPGPGPGPYHRGR 707

Query:   295 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 354
                    P       P  P +  ++G G      P+     GP      G G+ P +GPG
Sbjct:   708 GGRGGNEPP------PPPPPFRGARG-GRSGGGPPN--GRGGPGGGMVGGGGHRPHEGPG 758

Query:   355 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQ 414
                   S +    GP   +  G  + P  G G  M  G  +    GPG       G+   
Sbjct:   759 GGMNSSSGHRPHEGPGGGM--GGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPH 816

Query:   415 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 474
              GP         + P  GPG  +  G G+         P  G G  G P G  PH  VP 
Sbjct:   817 EGPGSGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-DVPS 866

Query:   475 PLNNVPYGSATPPARSGSGQPRGGNPAR 502
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 139 (54.0 bits), Expect = 9.1e-06, P = 9.1e-06
 Identities = 56/215 (26%), Positives = 74/215 (34%)

Query:   242 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 300
             G  G +E      P  G      G G P G G P      G  G  P+        + SG
Sbjct:   708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMNSSSG 766

Query:   301 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 360
                        G G+   +GPG        + P +GP      G G+ P +GPG     G
Sbjct:   767 HRPHEGPGGGMGGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPHEGPGSGMGGG 826

Query:   361 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP---GYDVQRGP 417
             S +    GP   +  G  + P  G G+    GP+       G+    VP   G+D  RGP
Sbjct:   827 SGHRPHEGPGGGMGAGGGHRPHEGPGHG---GPH-------GHRPHDVPSHRGHD-HRGP 875

Query:   418 VYEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 449
                  R    P +      G+D     G DM   P
Sbjct:   876 PPHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>FB|FBgn0261885 [details] [associations]
            symbol:osa "osa" species:7227 "Drosophila melanogaster"
            [GO:0046530 "photoreceptor cell differentiation" evidence=IMP]
            [GO:0005634 "nucleus" evidence=NAS;IDA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IMP] [GO:0008587 "imaginal disc-derived
            wing margin morphogenesis" evidence=IMP] [GO:0007379 "segment
            specification" evidence=IMP] [GO:0003677 "DNA binding"
            evidence=ISS;IDA;NAS] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IDA;IMP] [GO:0045893 "positive regulation
            of transcription, DNA-dependent" evidence=IDA] [GO:0035060 "brahma
            complex" evidence=IDA;TAS] [GO:0003713 "transcription coactivator
            activity" evidence=IC] [GO:0007476 "imaginal disc-derived wing
            morphogenesis" evidence=IMP] [GO:0048190 "wing disc dorsal/ventral
            pattern formation" evidence=IGI] [GO:0042058 "regulation of
            epidermal growth factor receptor signaling pathway" evidence=IMP]
            [GO:0007480 "imaginal disc-derived leg morphogenesis" evidence=IMP]
            [GO:0008586 "imaginal disc-derived wing vein morphogenesis"
            evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
            InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
            EMBL:AE014297 GO:GO:0048190 GO:GO:0045893 GO:GO:0016055
            GO:GO:0003677 GO:GO:0008586 GO:GO:0006351 GO:GO:0016568
            eggNOG:NOG12793 GO:GO:0007379 GO:GO:0007480 KO:K11653
            Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
            GeneTree:ENSGT00550000074575 GO:GO:0046530 GO:GO:0008587
            GO:GO:0035060 GO:GO:0042058 EMBL:AF053091 PIR:T13049
            RefSeq:NP_001163639.1 RefSeq:NP_524392.2 RefSeq:NP_732263.1
            UniGene:Dm.2989 ProteinModelPortal:Q8IN94 SMR:Q8IN94 DIP:DIP-20699N
            IntAct:Q8IN94 MINT:MINT-297379 STRING:Q8IN94 PaxDb:Q8IN94
            PRIDE:Q8IN94 EnsemblMetazoa:FBtr0089581 EnsemblMetazoa:FBtr0301487
            GeneID:42130 KEGG:dme:Dmel_CG7467 CTD:42130 FlyBase:FBgn0261885
            InParanoid:Q8IN94 OMA:SQMGQGP OrthoDB:EOG4MCVF9 PhylomeDB:Q8IN94
            ChiTaRS:osa GenomeRNAi:42130 NextBio:827314 Bgee:Q8IN94
            GermOnline:CG7467 Uniprot:Q8IN94
        Length = 2716

 Score = 153 (58.9 bits), Expect = 6.1e-06, Sum P(2) = 6.1e-06
 Identities = 91/348 (26%), Positives = 134/348 (38%)

Query:   170 IPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNA 229
             I A  S   +LR+  H+ +    +E  F    ++ L ++++  +   ++ +  +A  + +
Sbjct:  1065 IGASSSAAYTLRK--HYTKNLLTFECHFDRGDIDPLPIIQQ--VEAGSKKKTAKAASVPS 1120

Query:   230 PNVDRRADGSYGGATGNSENETS-GRPVGQ--NAYEDGY-GVPQGHGPPPSATTAGVVGA 285
             P      D     +TG+S ++ S   P G   NA  DGY G P G  P P A+     G 
Sbjct:  1121 PG-GGHLDAGTTNSTGSSNSQDSFPAPPGSAPNAAIDGYPGYPGG-SPYPVAS-----GP 1173

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTK---GPSYDP 341
              P+ +T   A      P +     P  PG  A+   G + S + P  DP     GP    
Sbjct:  1174 QPDYAT---AGQMQRPPSQNNPQTPH-PGAAAAVAAGDNISVSNPFEDPIAAGGGPGSGT 1229

Query:   342 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP----QRGLGYDMQRGPNYDM 397
               GPG  P  GPG  A  G+      G     H  P + P    Q+  G   Q+ P +  
Sbjct:  1230 GPGPGQGP--GPGA-ASGGAGAVGAVGGGPQPHPPPPHSPHTAAQQAAGQHQQQHPQHQH 1286

Query:   398 QRGPGYET-QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRG 456
                PG    Q+  G   Q+ P       P    Q GPG      Q +    A +  P  G
Sbjct:  1287 PGLPGPPPPQQQQGQQGQQPPPSVGGGPPPAPQQHGPGQVPPSPQQHVRPAAGAPYPPGG 1346

Query:   457 TGFDGAPRGAAPHGQVPP-PLNNVPYGSATPPARSGS-GQPRGGNPAR 502
             +G+   P    P    P  P     YGS+     +G  GQP G  P +
Sbjct:  1347 SGYP-TPVSRTPGSPYPSQPGAYGQYGSSDQYNATGPPGQPFGQGPGQ 1393

 Score = 44 (20.5 bits), Expect = 6.1e-06, Sum P(2) = 6.1e-06
 Identities = 7/16 (43%), Positives = 9/16 (56%)

Query:    34 PMPGAFPPFDMMPPPE 49
             P+PG  PP    P P+
Sbjct:   166 PLPGGKPPQQQQPHPQ 181

 Score = 43 (20.2 bits), Expect = 7.7e-06, Sum P(2) = 7.7e-06
 Identities = 8/18 (44%), Positives = 11/18 (61%)

Query:    30 GMRPPMPGAFPPFDMMPP 47
             GM P   G +PP+  +PP
Sbjct:   706 GM-PNHTGQYPPYQWVPP 722

 Score = 43 (20.2 bits), Expect = 7.7e-06, Sum P(2) = 7.7e-06
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query:    29 SGMRP--PMPGAFPPFDMMPPPE 49
             +G +P  P+PG  PP     PP+
Sbjct:   344 AGQQPGGPVPGGPPPGTGQQPPQ 366

 Score = 42 (19.8 bits), Expect = 9.8e-06, Sum P(2) = 9.8e-06
 Identities = 7/16 (43%), Positives = 7/16 (43%)

Query:    33 PPMPGAFPPFDMMPPP 48
             P  P   PP    PPP
Sbjct:   427 PASPHHVPPLQQQPPP 442

 Score = 39 (18.8 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
 Identities = 7/20 (35%), Positives = 10/20 (50%)

Query:    30 GMRPPMPGAFPPFDMMPPPE 49
             G  P  P  +PP +  P P+
Sbjct:   648 GYPPQQPQQYPPGNYPPRPQ 667

 Score = 38 (18.4 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 9/20 (45%), Positives = 9/20 (45%)

Query:    28 VSGMRPPMPGAFPPFDMMPP 47
             V G  PP  G  PP    PP
Sbjct:   352 VPGGPPPGTGQQPPQQNTPP 371

 Score = 37 (18.1 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
 Identities = 6/12 (50%), Positives = 7/12 (58%)

Query:    36 PGAFPPFDMMPP 47
             PG +PP    PP
Sbjct:   659 PGNYPPRPQYPP 670


>ZFIN|ZDB-GENE-040426-1010 [details] [associations]
            symbol:fus "fusion (involved in t(12;16) in
            malignant liposarcoma)" species:7955 "Danio rerio" [GO:0000166
            "nucleotide binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 ZFIN:ZDB-GENE-040426-1010 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622
            GeneTree:ENSGT00530000063105 KO:K13098 CTD:2521 EMBL:BX571714
            IPI:IPI00785727 RefSeq:NP_957377.2 UniGene:Dr.114403
            Ensembl:ENSDART00000055340 GeneID:394058 KEGG:dre:394058
            NextBio:20815017 Bgee:F1R0M4 Uniprot:F1R0M4
        Length = 541

 Score = 137 (53.3 bits), Expect = 7.1e-06, P = 7.1e-06
 Identities = 64/250 (25%), Positives = 91/250 (36%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 289
             P+    +  SYGG   N  +E+S  P  Q  Y   YG  Q  G    A + G   +  + 
Sbjct:    28 PSAQNYSQQSYGGY--NQSSESSSAPYNQGGYSSNYGQSQSGGYGSQAPSQGYSQSSQSY 85

Query:   290 STSAYAATQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 348
             S+  Y+ T    P ++        GY + S   GY+ S +P+  P    S   + G G  
Sbjct:    86 SSGGYSNTSQPPPAQSG-------GYSQQSSYSGYNQS-SPASAPGGYSSSSQSSGYGQQ 137

Query:   349 PTK-GPGYDAQKGSN--YDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 404
               + G GY    G +  Y +  G +      G  +   +  G      PNY       Y 
Sbjct:   138 QQQSGGGYGGSGGQSGGYGSSGGQSSGFGGSGGQHQSSQSGGGSYSPSPNYSSPPPQSYG 197

Query:   405 TQRV---PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
              Q      GY+    P+        Y  Q G GY  Q G+G    R   +      GFD 
Sbjct:   198 QQSQYGQGGYNQDSPPMSGGGGGGGYGGQDG-GYS-QDGRG-GRGRGGGFGGRGAGGFDR 254

Query:   462 APRGAAPHGQ 471
               RG  P G+
Sbjct:   255 GGRGG-PRGR 263


>UNIPROTKB|I3LQ53 [details] [associations]
            symbol:I3LQ53 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR000684 Pfam:PF05001 PROSITE:PS00115 GO:GO:0003677
            GO:GO:0006366 GO:GO:0005665 GeneTree:ENSGT00700000104490
            EMBL:FP565284 Ensembl:ENSSSCT00000030016 OMA:YAESDYL Uniprot:I3LQ53
        Length = 543

 Score = 137 (53.3 bits), Expect = 7.1e-06, P = 7.1e-06
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:    62 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 119

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:   120 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 171

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:   172 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 225

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:   226 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 278

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:   279 YSPTSPSYSPTS--PSYSPTSPSYS 301

 Score = 121 (47.7 bits), Expect = 0.00040, P = 0.00040
 Identities = 63/225 (28%), Positives = 80/225 (35%)

Query:   273 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 326
             P  S T+       PN     Y  T    +P   +Y  P  P Y  +  P Y  S     
Sbjct:   333 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 385

Query:   327 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 385
              ++P+Y P+  PSY P+  P Y PT  P Y     S Y     P Y     P Y P    
Sbjct:   386 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 439

Query:   386 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 445
              Y     P Y     P Y +   P Y     P Y +  +P Y P   P Y       Y  
Sbjct:   440 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 491

Query:   446 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 490
               +P Y P+  T    +P+G+      P      P  S T PA S
Sbjct:   492 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 535


>UNIPROTKB|F1MXS8 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
            "Bos taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0050777 "negative regulation of immune response"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
            "skin development" evidence=IEA] [GO:0043206 "extracellular fibril
            organization" evidence=IEA] [GO:0042060 "wound healing"
            evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
            "heart development" evidence=IEA] [GO:0007229 "integrin-mediated
            signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
            factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
            "cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
            GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
            IPI:IPI00731432 OMA:EGSPGHP GO:GO:0005586 EMBL:DAAA02003919
            EMBL:DAAA02003920 Ensembl:ENSBTAT00000028617 ArrayExpress:F1MXS8
            Uniprot:F1MXS8
        Length = 1466

 Score = 142 (55.0 bits), Expect = 7.3e-06, P = 7.3e-06
 Identities = 82/262 (31%), Positives = 97/262 (37%)

Query:   252 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 308
             SG P G+       G P    G GPP      G  G  P    SA      G P      
Sbjct:   677 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 733

Query:   309 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 366
              P GPG +  KG PG      AP  D  +GP+  P   PG  P   PG   + G+     
Sbjct:   734 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 784

Query:   367 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 421
               P      GP   P +RG  G     G P    Q G PG + +R  PG   + GP   A
Sbjct:   785 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 842

Query:   422 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 481
               A    P   PG    +G+    R +P      G G  G P G  P G  PP  N  P 
Sbjct:   843 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 891

Query:   482 --GSATPPARSGSGQPRGGNPA 501
               GS+  P + G   P G N A
Sbjct:   892 PPGSSGAPGKDGPPGPPGSNGA 913

 Score = 129 (50.5 bits), Expect = 0.00019, P = 0.00019
 Identities = 78/257 (30%), Positives = 104/257 (40%)

Query:   266 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR--AAYDIPRGP----GYEASK 319
             G P   GPP    +    G   +    AY   +SG      A Y  P GP    G   + 
Sbjct:   130 GSPGSPGPPGICESCPTGGQNYSPQYEAYDV-KSGVAGGGIAGYPGPAGPPGPPGPPGTS 188

Query:   320 G-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP----GYDAQKGS-NYDAQRG-PNYD 372
             G PG    KA    P +  SY P   PG     GP    G D + G      +RG P   
Sbjct:   189 GHPGAPHLKAWQKPPQQSTSYSPIGPPGPPGAIGPSGPAGKDGESGRPGRPGERGFPGPP 248

Query:   373 IHRGPSYDP----QRG-LGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRAPS 426
               +GP+  P     +G  G+D + G   +    PG + +  VPG +   GP+   + AP 
Sbjct:   249 GMKGPAGMPGFPGMKGHRGFDGRNGEKGETG-APGLKGENGVPGENGAPGPM-GPRGAPG 306

Query:   427 YIPQRG-PGYDLQRG----QGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVP 480
                + G PG    RG    +G D +  P   P  GT GF G+P GA   G+V P     P
Sbjct:   307 ERGRPGLPGAAGARGNDGARGSDGQPGPPGPP--GTAGFPGSP-GAK--GEVGPA--GSP 359

Query:   481 YGSATPPARSGSGQPRG 497
              GS+  P + G   P+G
Sbjct:   360 -GSSGAPGQRGEPGPQG 375

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 84/289 (29%), Positives = 101/289 (34%)

Query:   230 PNVDRRADGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAG 281
             P  +   DGS G  GA G      E    G R P G N      G P   G P  A   G
Sbjct:   460 PKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRG 519

Query:   282 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 341
             V G  P            G  +R     P GPG +   GP     +     P   P   P
Sbjct:   520 VAGE-PGRD-----GLPGGPGLRGIPGSPGGPGSDGKPGPPGSQGETGRPGPPGSPG--P 571

Query:   342 AKGPGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM- 397
                PG     GP G D   G N + + GP     +GP+  + + G  G     GP+ D  
Sbjct:   572 RGQPGVMGFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKG 630

Query:   398 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRG 456
               GP    Q + G     GP  E  +     P+   G   +  G+G D   AP     RG
Sbjct:   631 DTGPP-GPQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERG 684

Query:   457 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 503
                 G P G  P G   PP      G+A PP   GS    G  G P  R
Sbjct:   685 PPGAGGPPG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 731


>TAIR|locus:2043530 [details] [associations]
            symbol:AT2G25970 "AT2G25970" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0006606 "protein import into nucleus"
            evidence=RCA] InterPro:IPR004087 InterPro:IPR004088 Pfam:PF00013
            PROSITE:PS50084 SMART:SM00322 GO:GO:0005829 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0003723 EMBL:AC004747 EMBL:AC005395
            eggNOG:NOG300923 KO:K13210 HSSP:Q9UNW9 EMBL:AY078954 EMBL:AK226845
            IPI:IPI00540360 PIR:T02627 RefSeq:NP_180167.1 UniGene:At.21555
            ProteinModelPortal:O82762 SMR:O82762 STRING:O82762 PaxDb:O82762
            PRIDE:O82762 ProMEX:O82762 EnsemblPlants:AT2G25970.1 GeneID:817137
            KEGG:ath:AT2G25970 TAIR:At2g25970 HOGENOM:HOG000242545
            InParanoid:O82762 OMA:AANSTQD PhylomeDB:O82762
            ProtClustDB:CLSN2913011 ArrayExpress:O82762 Genevestigator:O82762
            Uniprot:O82762
        Length = 632

 Score = 140 (54.3 bits), Expect = 8.3e-06, Sum P(2) = 8.3e-06
 Identities = 76/283 (26%), Positives = 100/283 (35%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYE---DGYGVPQGHGPPPSATTAGVVGAG 286
             P   +   GSY   T     + S  P  Q + +   D YG  Q   P    ++A      
Sbjct:   355 PQYGQSPYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSA------ 408

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 346
             P T T+ Y   Q  +    A     G GY+      Y+AS+   Y    G  YD  +G G
Sbjct:   409 PPTDTTGYNYYQHASGYGQA-----GQGYQQDGYGAYNASQQSGYGQAAG--YDQ-QG-G 459

Query:   347 YDPTKGPGYD---AQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 402
             Y  T  P  +   +Q      AQ G   Y    G     Q   G   Q G         G
Sbjct:   460 YGSTTNPSQEEDASQAAPPSSAQSGQAGYGT-TGQQPPAQGSTG---QAGYGAPPTSQAG 515

Query:   403 YETQRVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQ--GYDMRRAPSYDPSRGTGF 459
             Y +Q    Y+   G    A + P+Y   Q+ PG     G   GY    A  Y      G+
Sbjct:   516 YSSQPAAAYNSGYGAPPPASKPPTYGQSQQSPGAPGSYGSQSGYAQPAASGYGQPPAYGY 575

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGS-ATPPARSGSGQPRGGNPA 501
               AP+G   +G    P     Y S  +  A +G G   GG PA
Sbjct:   576 GQAPQGYGSYGGYTQPAAGGGYSSDGSAGATAGGG---GGTPA 615

 Score = 123 (48.4 bits), Expect = 0.00057, Sum P(2) = 0.00057
 Identities = 69/265 (26%), Positives = 89/265 (33%)

Query:   246 NSENETSGRPVGQN-AYEDGYGV-PQGHGPPPSATTAGVVGAGPNTSTSAYAAT-QSGTP 302
             + EN      +G     + GY   P     PP    A   G G      AY    Q G  
Sbjct:   302 SGENRMRNSAMGGGYPQQGGYQARPPSSWAPPGGPPAQP-GYGGYMQPGAYPGPPQYGQS 360

Query:   303 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPT-KGPSYDPAKG-PGYDPTKGPGYDA-Q 358
                +Y      GY + S  P    S    YD   +  S  P+ G     PT   GY+  Q
Sbjct:   361 PYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYNYYQ 420

Query:   359 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 418
               S Y  Q G  Y      +Y+  +  GY    G  YD Q G G  T   P  +      
Sbjct:   421 HASGY-GQAGQGYQQDGYGAYNASQQSGYGQAAG--YDQQGGYGSTTN--PSQEEDA--- 472

Query:   419 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 478
               +Q AP    Q G     Q G G   ++ P+   +   G+   P   A +   P    N
Sbjct:   473 --SQAAPPSSAQSG-----QAGYGTTGQQPPAQGSTGQAGYGAPPTSQAGYSSQPAAAYN 525

Query:   479 VPYGSATP---PARSGSGQPRGGNP 500
               YG+  P   P   G  Q   G P
Sbjct:   526 SGYGAPPPASKPPTYGQSQQSPGAP 550

 Score = 107 (42.7 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 57/201 (28%), Positives = 76/201 (37%)

Query:   312 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 371
             G GY   +G GY A    S+ P  GP   PA+ PGY      GY  Q G+ Y     P Y
Sbjct:   313 GGGYP-QQG-GYQARPPSSWAPPGGP---PAQ-PGYG-----GY-MQPGA-YPGP--PQY 357

Query:   372 DIHRGPSYDPQRGLGYDMQRG--PNYDMQRGP----GYETQRVPGYDVQRGPVYEAQRAP 425
                   SY  Q   GY  Q    P+    +G     G +  + P       P  +     
Sbjct:   358 GQSPYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYN 417

Query:   426 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG---AAPHGQVPPPLNNVPYG 482
              Y  Q   GY  Q GQGY      +Y+ S+ +G+ G   G      +G    P       
Sbjct:   418 YY--QHASGYG-QAGQGYQQDGYGAYNASQQSGY-GQAAGYDQQGGYGSTTNPSQEEDAS 473

Query:   483 SATPPARSGSGQPRGGNPARR 503
              A PP+ + SGQ   G   ++
Sbjct:   474 QAAPPSSAQSGQAGYGTTGQQ 494

 Score = 63 (27.2 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 26/107 (24%), Positives = 43/107 (40%)

Query:   218 EVEKLRAELMNA-----PNVDRRADGSYGGATGNSENETSGRPVG---QNAYEDGYGVPQ 269
             + +++ A L+N+     P VD  A   YG   G S   + G+ +     ++    YG  Q
Sbjct:    73 KAQEIAARLLNSADAKRPRVDNGASYDYGDNKGFSSYPSEGKQMSGTVPSSIPVSYGSFQ 132

Query:   270 GHGPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP 313
             G       P+     ++G G  T    Y   QSG  ++   D+   P
Sbjct:   133 GTTKKIDIPNMRVGVIIGKGGETIK--YLQLQSGAKIQVTRDMDADP 177

 Score = 42 (19.8 bits), Expect = 8.3e-06, Sum P(2) = 8.3e-06
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query:    78 TLRQELAAAQHELQI--LHGQIGGMKSERELQMRNLTEKI 115
             T++   A     +Q+  LH   G    ER LQ+  +TE+I
Sbjct:   251 TIKSMQAKTGARIQVIPLHLPPGDPTPERTLQIDGITEQI 290


>UNIPROTKB|J9P8F7 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00700000104155 EMBL:AAEX03006798
            EMBL:AAEX03006799 EMBL:AAEX03006800 Ensembl:ENSCAFT00000044143
            Uniprot:J9P8F7
        Length = 1405

 Score = 141 (54.7 bits), Expect = 9.0e-06, P = 9.0e-06
 Identities = 77/254 (30%), Positives = 100/254 (39%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 313
             PVG    +   G P   GP  S    G  GA            Q G P  A     +G P
Sbjct:   634 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 689

Query:   314 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 370
             G +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP 
Sbjct:   690 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 746

Query:   371 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 429
                  GP+  PQ  +G   Q GP+  D + GP  + Q + G     GP       P  + 
Sbjct:   747 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 795

Query:   430 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 487
              +G PG   ++G+  D+ +     P    G  GAP    P G  P  + N    G    P
Sbjct:   796 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 854

Query:   488 ARSGS-GQPRGGNP 500
               +G  G P  G P
Sbjct:   855 GEAGEPGLPGEGGP 868


>UNIPROTKB|E1C0T1 [details] [associations]
            symbol:TFG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0004871 "signal transducer activity" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043123
            "positive regulation of I-kappaB kinase/NF-kappaB cascade"
            evidence=IEA] InterPro:IPR000270 Pfam:PF00564 SMART:SM00666
            GO:GO:0043123 GO:GO:0004871 CTD:10342 KO:K09292 OMA:YTTQTSQ
            GeneTree:ENSGT00510000047809 EMBL:AADN02032793 IPI:IPI00599103
            RefSeq:XP_416608.1 UniGene:Gga.1550 PRIDE:E1C0T1
            Ensembl:ENSGALT00000024692 GeneID:418391 KEGG:gga:418391
            NextBio:20821576 Uniprot:E1C0T1
        Length = 395

 Score = 134 (52.2 bits), Expect = 9.0e-06, P = 9.0e-06
 Identities = 57/210 (27%), Positives = 81/210 (38%)

Query:   285 AGPNTSTSAYAATQSGTP--MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 342
             AGP    SA A  +SGTP  + ++      PG +  + P Y  ++  +    +G  Y   
Sbjct:   194 AGP---PSAPAEERSGTPDSIASSSSAAHPPGVQPQQAP-YPGAQPQTGQQVEGQMYQQY 249

Query:   343 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 402
             + PGY P + P   AQ    Y  Q    Y   +  S   Q+   Y  Q  P      G G
Sbjct:   250 QQPGY-PAQQP--QAQPQQQYGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAFP-GQG 305

Query:   403 YETQRVPGYDVQRGPV--YEAQ----RAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRG 456
              + Q++P    Q+ P   +  Q    +A    P  GP    Q   G    R P + P  G
Sbjct:   306 -QAQQLPAQQPQQYPAGSFPPQPYTTQASQPAPYSGPP-GAQAAPGTFQPR-PGFTPPPG 362

Query:   457 TGFDGAPRGAAPHGQVPPPLNNVPYGSATP 486
             +     P G  P+ +  PP    P G A P
Sbjct:   363 STMTPPPSGPNPYARTRPPFG--PQGYAQP 390

 Score = 133 (51.9 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 54/197 (27%), Positives = 70/197 (35%)

Query:   310 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 369
             P  P  E S  P   AS + +  P   P   P + P Y     PG   Q G   + Q   
Sbjct:   197 PSAPAEERSGTPDSIASSSSAAHP---PGVQPQQAP-Y-----PGAQPQTGQQVEGQM-- 245

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY-I 428
              Y  ++ P Y  Q+      Q+   Y +Q   GY  Q+      Q+ P Y  Q AP+   
Sbjct:   246 -YQQYQQPGYPAQQPQAQPQQQ---YGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAF 301

Query:   429 PQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP 486
             P +G    L  Q+ Q Y     P   P        AP    P  Q  P       G   P
Sbjct:   302 PGQGQAQQLPAQQPQQYPAGSFPP-QPYTTQASQPAPYSGPPGAQAAPGTFQPRPGFTPP 360

Query:   487 PARSGSGQPRGGNPARR 503
             P  + +  P G NP  R
Sbjct:   361 PGSTMTPPPSGPNPYAR 377


>UNIPROTKB|F1LLX1 [details] [associations]
            symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            OMA:HPGKEGQ IPI:IPI00949317 Ensembl:ENSRNOT00000024138
            ArrayExpress:F1LLX1 Uniprot:F1LLX1
        Length = 1803

 Score = 142 (55.0 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   242 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 293
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1003 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1062

Query:   294 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 345
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1063 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1119

Query:   346 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 399
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1120 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1177

Query:   400 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 457
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1178 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1233

Query:   458 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1234 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1273


>RGD|2372 [details] [associations]
            symbol:Col11a1 "collagen, type XI, alpha 1" species:10116 "Rattus
          norvegicus" [GO:0001502 "cartilage condensation" evidence=ISO]
          [GO:0001503 "ossification" evidence=IEP] [GO:0002063 "chondrocyte
          development" evidence=ISO] [GO:0003007 "heart morphogenesis"
          evidence=ISO] [GO:0005201 "extracellular matrix structural
          constituent" evidence=TAS] [GO:0005581 "collagen" evidence=ISO]
          [GO:0005592 "collagen type XI" evidence=ISO] [GO:0006029
          "proteoglycan metabolic process" evidence=ISO] [GO:0007601 "visual
          perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
          evidence=ISO] [GO:0030199 "collagen fibril organization"
          evidence=ISO;TAS] [GO:0031012 "extracellular matrix"
          evidence=ISO;IDA] [GO:0035989 "tendon development" evidence=ISO]
          [GO:0042472 "inner ear morphogenesis" evidence=ISO] [GO:0048704
          "embryonic skeletal system morphogenesis" evidence=ISO] [GO:0048705
          "skeletal system morphogenesis" evidence=ISO] [GO:0050910 "detection
          of mechanical stimulus involved in sensory perception of sound"
          evidence=ISO] [GO:0051216 "cartilage development" evidence=ISO]
          [GO:0055010 "ventricular cardiac muscle tissue morphogenesis"
          evidence=ISO] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
          PROSITE:PS51461 SMART:SM00038 RGD:2372 GO:GO:0046872 GO:GO:0007601
          GO:GO:0030199 Gene3D:2.60.120.200 InterPro:IPR008985
          InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472 GO:GO:0050910
          GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
          InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025 GO:GO:0001502
          GO:GO:0048704 GO:GO:0006029 GO:GO:0055010 Pfam:PF02210 GO:GO:0005201
          GO:GO:0002063 HOGENOM:HOG000085654 KO:K06236 HOVERGEN:HBG103137
          OrthoDB:EOG49GKHM SMART:SM00210 GeneTree:ENSGT00700000104155 CTD:1301
          EMBL:AABR03012126 EMBL:AABR03013126 EMBL:AABR03014171
          EMBL:AABR03015382 EMBL:AABR03015832 EMBL:AABR03016562
          EMBL:AABR03017847 EMBL:AABR03017951 EMBL:AABR03018245
          EMBL:AABR03019675 EMBL:AABR03023874 EMBL:U20116 EMBL:U20118
          EMBL:U20121 IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589
          IPI:IPI00949317 IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1
          UniGene:Rn.260 IntAct:P20909 STRING:P20909 PhosphoSite:P20909
          PRIDE:P20909 Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413
          GeneID:25654 KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909
          NextBio:607535 ArrayExpress:P20909 Genevestigator:P20909
          GermOnline:ENSRNOG00000023148 Uniprot:P20909
        Length = 1804

 Score = 142 (55.0 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   242 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 293
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063

Query:   294 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 345
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120

Query:   346 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 399
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178

Query:   400 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 457
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234

Query:   458 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274


>UNIPROTKB|P20909 [details] [associations]
            symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2372
            GO:GO:0046872 GO:GO:0007601 GO:GO:0030199 Gene3D:2.60.120.200
            InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472
            GO:GO:0050910 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025
            GO:GO:0001502 GO:GO:0048704 GO:GO:0006029 GO:GO:0055010
            Pfam:PF02210 GO:GO:0005201 GO:GO:0002063 HOGENOM:HOG000085654
            KO:K06236 HOVERGEN:HBG103137 OrthoDB:EOG49GKHM SMART:SM00210
            GeneTree:ENSGT00700000104155 CTD:1301 EMBL:AABR03012126
            EMBL:AABR03013126 EMBL:AABR03014171 EMBL:AABR03015382
            EMBL:AABR03015832 EMBL:AABR03016562 EMBL:AABR03017847
            EMBL:AABR03017951 EMBL:AABR03018245 EMBL:AABR03019675
            EMBL:AABR03023874 EMBL:U20116 EMBL:U20118 EMBL:U20121
            IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589 IPI:IPI00949317
            IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1 UniGene:Rn.260
            IntAct:P20909 STRING:P20909 PhosphoSite:P20909 PRIDE:P20909
            Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413 GeneID:25654
            KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909 NextBio:607535
            ArrayExpress:P20909 Genevestigator:P20909
            GermOnline:ENSRNOG00000023148 Uniprot:P20909
        Length = 1804

 Score = 142 (55.0 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   242 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 293
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063

Query:   294 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 345
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120

Query:   346 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 399
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178

Query:   400 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 457
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234

Query:   458 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274


>TAIR|locus:2077547 [details] [associations]
            symbol:AT3G07030 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005829
            "cytosol" evidence=IDA] InterPro:IPR002775 Pfam:PF01918
            GO:GO:0005829 EMBL:CP002686 GO:GO:0003676 IPI:IPI00519674
            RefSeq:NP_187359.2 UniGene:At.74527 ProteinModelPortal:F4JD88
            SMR:F4JD88 PRIDE:F4JD88 EnsemblPlants:AT3G07030.1 GeneID:3768790
            KEGG:ath:AT3G07030 OMA:ERRNDGY Uniprot:F4JD88
        Length = 405

 Score = 134 (52.2 bits), Expect = 9.4e-06, P = 9.4e-06
 Identities = 57/209 (27%), Positives = 72/209 (34%)

Query:   259 NAY-EDGYGVPQGHGPPP--SATTAGVVGAGPNTSTSAYAATQS-GTPMRA-AYDI-PRG 312
             NAY E+G  V +G         TT GV+      +      T   G   RA A D+    
Sbjct:   150 NAYGEEGEVVAEGEAGEEVDMETTKGVMKEKTKGTIKKIIKTMKVGIQTRAEAVDVVDEA 209

Query:   313 PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYD 372
                   +G GY   +   Y   +   Y   +  GY   +   Y   +   Y   R   Y 
Sbjct:   210 MAIVGGRG-GYGGGRDGGYGGGRDDGYGERRNDGYGERRNDRYGGGRDDGYGGGRDDGYG 268

Query:   373 IHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 432
               R   Y  +RG G+   RG   D   G G       G    +G  Y   R   Y   RG
Sbjct:   269 GGRNDGYGGRRG-GFRGGRGGGRDEGYGGG--RGGYGGRSGGQGDGYGGGRGDGYGGGRG 325

Query:   433 PGYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
              GY   RG GY   R   YD  R  G+ G
Sbjct:   326 DGYGGGRGDGYGGGRVDRYDGGRRDGYGG 354

 Score = 125 (49.1 bits), Expect = 9.3e-05, P = 9.3e-05
 Identities = 50/158 (31%), Positives = 59/158 (37%)

Query:   311 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 370
             R  GY   +  GY   +   Y   +G  +   +G G D     GY   +G  Y  + G  
Sbjct:   255 RDDGYGGGRDDGYGGGRNDGYGGRRG-GFRGGRGGGRDE----GYGGGRGG-YGGRSGG- 307

Query:   371 YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQ 430
                 +G  Y   RG GY   RG  Y   RG GY   RV  YD  R   Y   R   Y   
Sbjct:   308 ----QGDGYGGGRGDGYGGGRGDGYGGGRGDGYGGGRVDRYDGGRRDGYGGGRYDGYGGG 363

Query:   431 RGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA 467
             +  GY   RG GY   R   Y   RG  G  G  R  A
Sbjct:   364 KSDGYGGGRG-GYRGGRG-GYGRGRGRMGNGGRSRDGA 399


>CGD|CAL0000919 [details] [associations]
            symbol:RPO21 species:5476 "Candida albicans" [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0030447 "filamentous growth" evidence=IMP]
            [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0009267 "cellular response to starvation"
            evidence=IMP] [GO:0036170 "filamentous growth of a population of
            unicellular organisms in response to starvation" evidence=IMP]
            [GO:0036180 "filamentous growth of a population of unicellular
            organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
            "cellular response to biotic stimulus" evidence=IMP] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed RNA
            polymerase activity" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 CGD:CAL0000919
            GO:GO:0071216 GO:GO:0036180 GO:GO:0003677 GO:GO:0006366
            GO:GO:0009267 Gene3D:2.40.40.20 InterPro:IPR009010
            EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899 eggNOG:COG0086
            GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1 STRING:Q5ACI7
            GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
        Length = 1728

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 72/234 (30%), Positives = 91/234 (38%)

Query:   226 LMNAPN---VDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 282
             L  AP+   +D  ADG  GGAT   + E        NA ++   +  G G  P       
Sbjct:  1452 LQKAPSSMAMDDIADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMN 1501

Query:   283 VG-AGPNTSTSAYAATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 340
              G  G  TS      + + T P    Y+    PGY  S G GY  + +PSY PT  PSY 
Sbjct:  1502 EGNIGGLTSYGGQPTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYA 1558

Query:   341 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
             P   P Y PT  P Y A     Y +   P+Y     P+Y P     Y     P+Y     
Sbjct:  1559 PTS-PAYSPTS-PSY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTS 1610

Query:   401 PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 454
             P Y     P Y     P Y +  +PSY P   P Y            +PSY P+
Sbjct:  1611 PQYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651


>UNIPROTKB|Q5ACI7 [details] [associations]
            symbol:RPO21 "DNA-directed RNA polymerase" species:237561
            "Candida albicans SC5314" [GO:0009267 "cellular response to
            starvation" evidence=IMP] [GO:0030447 "filamentous growth"
            evidence=IMP] [GO:0036170 "filamentous growth of a population of
            unicellular organisms in response to starvation" evidence=IMP]
            [GO:0036180 "filamentous growth of a population of unicellular
            organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
            "cellular response to biotic stimulus" evidence=IMP]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 CGD:CAL0000919 GO:GO:0071216 GO:GO:0036180
            GO:GO:0003677 GO:GO:0006366 GO:GO:0009267 Gene3D:2.40.40.20
            InterPro:IPR009010 EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1
            STRING:Q5ACI7 GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
        Length = 1728

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 72/234 (30%), Positives = 91/234 (38%)

Query:   226 LMNAPN---VDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 282
             L  AP+   +D  ADG  GGAT   + E        NA ++   +  G G  P       
Sbjct:  1452 LQKAPSSMAMDDIADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMN 1501

Query:   283 VG-AGPNTSTSAYAATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 340
              G  G  TS      + + T P    Y+    PGY  S G GY  + +PSY PT  PSY 
Sbjct:  1502 EGNIGGLTSYGGQPTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYA 1558

Query:   341 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
             P   P Y PT  P Y A     Y +   P+Y     P+Y P     Y     P+Y     
Sbjct:  1559 PTS-PAYSPTS-PSY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTS 1610

Query:   401 PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 454
             P Y     P Y     P Y +  +PSY P   P Y            +PSY P+
Sbjct:  1611 PQYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651


>UNIPROTKB|F1P555 [details] [associations]
            symbol:SFPQ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0000380 "alternative mRNA
            splicing, via spliceosome" evidence=IEA] [GO:0016363 "nuclear
            matrix" evidence=IEA] [GO:0042382 "paraspeckles" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
            SMART:SM00360 GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0016363 GO:GO:0000380 GO:GO:0042382 InterPro:IPR012975
            Pfam:PF08075 GeneTree:ENSGT00390000005004 OMA:APGGHPK
            EMBL:AADN02043825 EMBL:AADN02043826 IPI:IPI00574618
            Ensembl:ENSGALT00000003963 ArrayExpress:F1P555 Uniprot:F1P555
        Length = 647

 Score = 136 (52.9 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 62/219 (28%), Positives = 89/219 (40%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSA-------TTAGVVGAG 286
             R   G  GG   +  +   G  +GQN    G G PQG G PP                A 
Sbjct:    19 RGGGGGRGGPNHDFRSPPPGMGMGQNRGPMGGG-PQGPGGPPGGGPKSEPPKPPASTSAP 77

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDAS-KAPSYDPTKGPSYDPAKGP 345
             P++S+S+ A T      ++    P      A + P   A   APS  P+ GP       P
Sbjct:    78 PSSSSSSSATTAGPAGSQSGPGAPPPSALPAGQPPQQQAQGSAPSSAPS-GPGGQQQPQP 136

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
                P+  P    +KG       GP     +GP   PQ+G G   + GP +  + GPG E+
Sbjct:   137 KPSPSPTPAGGPKKGQGQSPGGGP-----KGPG-GPQQGPGGPHKGGPGH--RGGPGGES 188

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGP-GYDLQRGQGY 443
             +   G    RG  ++ Q++ S   Q+GP G D    +G+
Sbjct:   189 R---G----RGQQHQGQQSLSL--QQGPAGGDQLSDEGF 218


>UNIPROTKB|F1PHX8 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
            OMA:TIYEGIG SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AAEX03006798 EMBL:AAEX03006799 EMBL:AAEX03006800
            Ensembl:ENSCAFT00000031582 Uniprot:F1PHX8
        Length = 1814

 Score = 141 (54.7 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 77/254 (30%), Positives = 100/254 (39%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 313
             PVG    +   G P   GP  S    G  GA            Q G P  A     +G P
Sbjct:  1043 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 1098

Query:   314 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 370
             G +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP 
Sbjct:  1099 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 1155

Query:   371 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 429
                  GP+  PQ  +G   Q GP+  D + GP  + Q + G     GP       P  + 
Sbjct:  1156 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 1204

Query:   430 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 487
              +G PG   ++G+  D+ +     P    G  GAP    P G  P  + N    G    P
Sbjct:  1205 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 1263

Query:   488 ARSGS-GQPRGGNP 500
               +G  G P  G P
Sbjct:  1264 GEAGEPGLPGEGGP 1277


>MGI|MGI:2384582 [details] [associations]
            symbol:Zfp768 "zinc finger protein 768" species:10090 "Mus
            musculus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0008150 "biological_process" evidence=ND] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] Pfam:PF00096 InterPro:IPR007087 InterPro:IPR013087
            InterPro:IPR015880 PROSITE:PS00028 PROSITE:PS50157 SMART:SM00355
            MGI:MGI:2384582 GO:GO:0005634 GO:GO:0006355 GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 eggNOG:COG5048
            Gene3D:3.30.160.60 HOGENOM:HOG000234617
            GeneTree:ENSGT00700000104520 KO:K09228 HSSP:P17028
            HOVERGEN:HBG105926 OMA:SRYESQN OrthoDB:EOG4CNQQT EMBL:AK155155
            EMBL:BC026432 IPI:IPI00153270 RefSeq:NP_666314.1 UniGene:Mm.23031
            ProteinModelPortal:Q8R0T2 SMR:Q8R0T2 IntAct:Q8R0T2 STRING:Q8R0T2
            PhosphoSite:Q8R0T2 PRIDE:Q8R0T2 Ensembl:ENSMUST00000060783
            GeneID:233890 KEGG:mmu:233890 UCSC:uc009jvc.1 CTD:233890
            InParanoid:Q8R0T2 NextBio:381919 Bgee:Q8R0T2 CleanEx:MM_ZFP768
            Genevestigator:Q8R0T2 Uniprot:Q8R0T2
        Length = 568

 Score = 135 (52.6 bits), Expect = 1.3e-05, P = 1.3e-05
 Identities = 70/278 (25%), Positives = 107/278 (38%)

Query:   229 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 288
             A N     +G      GN + E    P G       +  PQ       +        G  
Sbjct:    32 AGNTSENEEGEISQREGNGDYEVEEIPFGLEPQSPEFE-PQSPEFESQSPRFEPESPGFE 90

Query:   289 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 348
             + +  +         R+    P+ P +E S+ P Y+  ++P   P + P  +P   P Y+
Sbjct:    91 SRSPGFVPPSPEFAPRSPESDPQSPEFE-SQSPKYEP-RSPGCHP-RSPGCEPGS-PRYE 146

Query:   349 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYETQR 407
             P K PGY + K   +++Q  P Y+  + P Y+PQ   G  +Q   N + +   P +ETQ 
Sbjct:   147 P-KSPGYGS-KSPEFESQ-SPGYE-SQSPGYEPQNS-GDGVQ---NSEFKTHSPEFETQS 198

Query:   408 VPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRA-PSYD-PSRGTGFDGAPR 464
                 +    P+   ++ P  I       D   +G G     A P +D PS      GA  
Sbjct:   199 SKFQEGAEMPLSPEEKNPLSISLGVHPLDSFTQGFGEQPTGALPPFDMPS------GALL 252

Query:   465 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
              A     +  PLN    G+   P R G G+ RGG   R
Sbjct:   253 AAPQFEMLQNPLNLT--GTLRGPGRRG-GRARGGQGPR 287


>MGI|MGI:2157767 [details] [associations]
            symbol:Krtap21-1 "keratin associated protein 21-1"
            species:10090 "Mus musculus" [GO:0001942 "hair follicle
            development" evidence=IMP] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005882 "intermediate filament" evidence=IEA] [GO:0007165
            "signal transduction" evidence=IMP] [GO:0008283 "cell
            proliferation" evidence=IMP] [GO:0022405 "hair cycle process"
            evidence=IMP] [GO:0031077 "post-embryonic camera-type eye
            development" evidence=IMP] [GO:0042640 "anagen" evidence=IMP]
            [GO:0043480 "pigment accumulation in tissues" evidence=IMP]
            [GO:0043588 "skin development" evidence=IMP] [GO:0048589
            "developmental growth" evidence=IMP] [GO:0051726 "regulation of
            cell cycle" evidence=IMP] MGI:MGI:2157767 GO:GO:0007165
            GO:GO:0043588 GO:GO:0008283 GO:GO:0005882 GO:GO:0051726
            GO:GO:0042640 GO:GO:0031077 EMBL:AF345297 EMBL:AK003736
            IPI:IPI00126890 UniGene:Mm.46109 HSSP:P10969 Genevestigator:Q925H4
            GO:GO:0043480 Uniprot:Q925H4
        Length = 128

 Score = 111 (44.1 bits), Expect = 1.3e-05, P = 1.3e-05
 Identities = 32/103 (31%), Positives = 32/103 (31%)

Query:   300 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 359
             G   R  Y    G GY    G GY       Y    G  Y    G GY    G GY    
Sbjct:    14 GYGSRYGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGY 73

Query:   360 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 402
             GS Y    G  Y    G  Y    G GY    G  Y    G G
Sbjct:    74 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSRYGCGYGSG 116

 Score = 103 (41.3 bits), Expect = 9.3e-05, P = 9.3e-05
 Identities = 31/98 (31%), Positives = 33/98 (33%)

Query:   314 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 373
             GY    G GY       Y    G  Y    G GY    G GY    GS Y    G  Y  
Sbjct:    20 GYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGC 79

Query:   374 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 411
               G  Y    G GY    G  Y    G GY ++   GY
Sbjct:    80 GYGSGY----GCGYGSGYGCGYGSGYGCGYGSRYGCGY 113


>UNIPROTKB|F1N474 [details] [associations]
            symbol:COL4A5 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0031594 "neuromuscular junction" evidence=IEA]
            [GO:0007528 "neuromuscular junction development" evidence=IEA]
            [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587 "collagen type
            IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0007528 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0031594 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:DAAA02071513 EMBL:DAAA02071512
            IPI:IPI00729819 Ensembl:ENSBTAT00000019400 OMA:MPMNMEP
            Uniprot:F1N474
        Length = 1688

 Score = 140 (54.3 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 62/203 (30%), Positives = 76/203 (37%)

Query:   310 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 363
             P  PG     GP  G    K    +P K G P  D   G PG     G PGY  + G   
Sbjct:   266 PGPPGIRGPPGPPGGVKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 323

Query:   364 DAQRGPNYDIHR-GPS--YDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVY 419
             D ++G   D    GP     P+ G G  +    N  +   PG +  R  PG  +Q  P  
Sbjct:   324 DGEKGQKGDTGLPGPPGLVIPRPGTGVTVGEKGNIGLPGLPGDKGDRGFPG--IQGPPGL 381

Query:   420 EAQRAPSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 478
                  P+ I P   PG+  +RGQ  D    P        G DG P    P G   PP  +
Sbjct:   382 PGPPGPAVIGPPGPPGFPGERGQKGD-EGPPGISIPGSPGLDGQPGAPGPPGPPGPPGPH 440

Query:   479 VPYGS----ATPPARSGSGQPRG 497
             +P       A PP   GS   RG
Sbjct:   441 IPPSDKICEAGPPGPPGSPGDRG 463


>ZFIN|ZDB-GENE-030131-1600 [details] [associations]
            symbol:ewsr1b "Ewing sarcoma breakpoint region 1b"
            species:7955 "Danio rerio" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0021954 "central nervous system
            neuron development" evidence=IMP] [GO:0007067 "mitosis"
            evidence=IMP] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            ZFIN:ZDB-GENE-030131-1600 GO:GO:0007067 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 GO:GO:0021954
            GeneTree:ENSGT00530000063105 HOGENOM:HOG000038010
            HOVERGEN:HBG000970 EMBL:BX664747 EMBL:BC097019 UniGene:Dr.76923
            SMR:Q4QRG0 STRING:Q4QRG0 Ensembl:ENSDART00000003998 OMA:PVINIYL
            Uniprot:Q4QRG0
        Length = 579

 Score = 139 (54.0 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 78/283 (27%), Positives = 100/283 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 295
             A  SYG  T     +T G+   Q   +  Y     +  P +A  A    A P  S  AYA
Sbjct:    15 AQQSYGSYTAPPA-QTYGQTAQQGYTQQDYS---SYAQPAAAPEATYSQAAP--SAGAYA 68

Query:   296 ATQSGTPM-RAAYDIPRGPGYEASKGPGYDASKAPSYDPTK--GPSYDPAKGPGYDPTKG 352
               Q G+   +AA      P    +  PG     A SY  +   G +  PA    Y     
Sbjct:    69 QQQYGSTYGQAAATAAAAPAAYGTPQPGAYTQPAQSYGASSYTGSTAAPAAQASYGSQ-- 126

Query:   353 PGYDAQKG-SNYDAQ---RGP-NYDIHRGPSYDPQRGLGYDMQRG---PNYDMQRGPGYE 404
             PGY  Q   S Y  Q     P +Y     P+Y+      Y    G   P Y  Q+ PGY 
Sbjct:   127 PGYSTQPAYSGYSQQPAASAPQSYSASSQPAYNQS---AYSQPAGYSQPGYQAQQ-PGYG 182

Query:   405 TQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGY-DLQRGQGY----DMRRAPSYDPSRGT- 457
              Q+   Y   + P    Q  P+ Y PQ    Y   Q GQ      D ++ P    S+G  
Sbjct:   183 QQQQSAYGQGQPPQQHQQGPPAAYPPQGSSSYAQTQYGQQSAPQNDYQQNPYNSYSQGGV 242

Query:   458 --GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
               G+ G+ RG    G         P G        G    RGG
Sbjct:   243 SGGYPGSQRGGYQDGGRDGYDRGGPRGRGMGRGGMGIAGDRGG 285

 Score = 39 (18.8 bits), Expect = 1.7e-05, Sum P(2) = 1.7e-05
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:   487 PARSGSGQPRGGNPAR 502
             P R G G  RGG   R
Sbjct:   410 PMRGGPGMDRGGMMGR 425


>UNIPROTKB|K7EKB2 [details] [associations]
            symbol:TAF15 "TATA-binding protein-associated factor 2N"
            species:9606 "Homo sapiens" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR001876 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50199
            SMART:SM00547 EMBL:AC015849 HGNC:HGNC:11547 Ensembl:ENST00000585577
            Uniprot:K7EKB2
        Length = 214

 Score = 125 (49.1 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 48/140 (34%), Positives = 52/140 (37%)

Query:   314 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-GPGYDAQK-GSNYDAQRGPNY 371
             GY    G G D           G   D + G GY   + G GY   + G  Y   RG  Y
Sbjct:    69 GYRGRGGRGGDRGGYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGY 128

Query:   372 DIHRGPSYDPQRGLGY--DMQRGPNYDMQRG--PGYETQRVPGYDVQR-GPVYEAQRAPS 426
                RG  Y   RG GY  D  RG  Y   RG   GY   R  GY   R G  Y   R   
Sbjct:   129 GGDRGGGYGGDRGGGYGGDRSRG-GYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGG 187

Query:   427 YIPQRGPGYDLQRGQGYDMR 446
             Y   RG GY  + G   D R
Sbjct:   188 YGGDRG-GYGGKMGGRNDYR 206

 Score = 120 (47.3 bits), Expect = 7.0e-05, P = 7.0e-05
 Identities = 48/170 (28%), Positives = 62/170 (36%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 289
             P   R + G + G     E    GR  G+     GYG  +  G      ++G  G   + 
Sbjct:    49 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GYSGDR 106

Query:   290 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY--DPTKGPSYDPAKGPGY 347
             S   Y   +SG      Y   RG GY   +G GY   +   Y  D ++G       G G 
Sbjct:   107 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSRG-------GYGG 155

Query:   348 DPTKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYD 396
             D   G GY   +   Y   R G  Y   RG  Y   RG GY  + G   D
Sbjct:   156 DRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRND 204


>UNIPROTKB|E2RS29 [details] [associations]
            symbol:E2RS29 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 EMBL:AAEX03026460
            Ensembl:ENSCAFT00000019701 Uniprot:E2RS29
        Length = 538

 Score = 133 (51.9 bits), Expect = 1.9e-05, P = 1.9e-05
 Identities = 80/314 (25%), Positives = 115/314 (36%)

Query:   209 EKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPV-GQNAYEDGYGV 267
             ++ Y    T+  +  A+   A    +++ G+YG  T  S  +       GQ AY   YG 
Sbjct:    15 QQGYSAYTTQPTQGYAQTTQA--YGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQ 72

Query:   268 PQ-GHGPP--PSATTAGVVG--AGP-NTSTSAYAATQSGTPMRAAYDI-PRGPGY---EA 317
             P  G+  P  P A +  V G   G  +T+T+    TQ+    ++AY   P  P Y    A
Sbjct:    73 PPAGYTTPTAPQAYSQPVQGYSTGAYDTTTATVTTTQASYEAQSAYGTQPAYPAYGQQPA 132

Query:   318 SKGPG--YDASK-APSYDP--TKGPSYDPAKGPG---YDPTKGPG-YDAQKGSNYDAQRG 368
             +  P    D +K A +  P  + G    P+ G G   Y   + PG Y  Q  +   +   
Sbjct:   133 ATAPARPQDGNKPAETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPP 192

Query:   369 PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 428
              +Y   +  SYD Q   G     G      +   Y  Q    Y  Q G  Y   +APS  
Sbjct:   193 TSYSSTQPTSYDQQNTYGQPSSYGQQSSYGQQSSYGQQLPTSYPPQTGS-YS--QAPSQY 249

Query:   429 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 488
              Q+   Y  Q     D  R+         GF       +  G   P      +      +
Sbjct:   250 SQQSSSYGQQSSFQQDHPRSMGVYGQESGGFSRPGENRSMSGPDNPGRGRGGFDRGDM-S 308

Query:   489 RSGSGQPRGGNPAR 502
             R G G  RGG  AR
Sbjct:   309 RGGRGGGRGGMGAR 322


>UNIPROTKB|F1RYI8 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0050777 "negative regulation of immune response"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
            "skin development" evidence=IEA] [GO:0043206 "extracellular fibril
            organization" evidence=IEA] [GO:0042060 "wound healing"
            evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
            "heart development" evidence=IEA] [GO:0007229 "integrin-mediated
            signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
            factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
            "cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
            GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:CU467671
            RefSeq:NP_001230226.1 UniGene:Ssc.24309 UniGene:Ssc.97562
            Ensembl:ENSSSCT00000017459 GeneID:100152001 KEGG:ssc:100152001
            Uniprot:F1RYI8
        Length = 1466

 Score = 138 (53.6 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 85/286 (29%), Positives = 105/286 (36%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 290
             A G  GG  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   166 AGGGIGGYPGPAGPPGPPGPPGVSGHPGAPGSPGYQGPPGEPGQAGPAGPPGPPGAIGPS 225

Query:   291 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 347
               A    +SG P R     +P  PG +   G PG+   K    +D   G   D    PG 
Sbjct:   226 GPAGKDGESGRPGRPGERGLPGPPGLKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 284

Query:   348 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 404
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   285 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 339

Query:   405 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              T   PG    +G V  A  +P   P   PG   QRG+      A +  P    G +G+P
Sbjct:   340 GTAGFPGSPGAKGEVGPAG-SPG--PSGSPG---QRGEPGPQGHAGAAGPPGPPGSNGSP 393

Query:   464 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 502
              G    G  P  +   P   G+  PP   G+ G P  RG  G P +
Sbjct:   394 GGKGEMG--PAGIPGAPGLMGARGPPGPPGTNGAPGQRGAAGEPGK 437


>UNIPROTKB|F1NI73 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
            EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI01017330
            Ensembl:ENSGALT00000004032 ArrayExpress:F1NI73 Uniprot:F1NI73
        Length = 1260

 Score = 137 (53.3 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 83/280 (29%), Positives = 109/280 (38%)

Query:   242 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 298
             GA G   +N   G P G+       G+P  +G P     AG  G+ GP   S  A    Q
Sbjct:   465 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 523

Query:   299 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 353
              G P    MR    IP  PG +   GP  +  + P      GP+  P   PG     GP 
Sbjct:   524 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 581

Query:   354 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 407
             G +   G N   +RGP       GP+  +   GL G     GP  D  + GP      Q 
Sbjct:   582 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 639

Query:   408 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 463
             +PG     GP  E  +     P+    GPG+   +G+ G    R  +  P   TG  G P
Sbjct:   640 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERG-AQGPPGPTGARGGP 695

Query:   464 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 500
               A   G + PP     P G+  P  +   G+ RG  G+P
Sbjct:   696 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 734

 Score = 123 (48.4 bits), Expect = 0.00071, P = 0.00071
 Identities = 84/275 (30%), Positives = 104/275 (37%)

Query:   252 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             +G P G        G+P   G P      G+ G  P TS +  A    G P +       
Sbjct:   386 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 440

Query:   312 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 365
             GP G     G PG  A   P  +  +G + +P +   PG    +G PG+    GSN    
Sbjct:   441 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 498

Query:   366 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 420
             ++GP  +       GPS  P    G D   GP     RG PG      PG D + GP   
Sbjct:   499 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 553

Query:   421 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 472
              Q  P      GP G   Q G  G+   +    AP  +  RG G   G P  A  +G V 
Sbjct:   554 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 612

Query:   473 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 501
              P PP    P G    P  SGS    G P G  PA
Sbjct:   613 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 647

 Score = 122 (48.0 bits), Expect = 0.00091, P = 0.00091
 Identities = 80/269 (29%), Positives = 105/269 (39%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAA-YDIPRG 312
             P G N Y+   G P   GP      AG++G AGP          + G P R     IP  
Sbjct:   190 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGPPGKDG-----EPGRPGRNGDRGIPGL 244

Query:   313 PGYEASKG-PGYDASK-APSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP 369
             PG++   G PG    K A  +D   G   D    PG     G PG +   G      RGP
Sbjct:   245 PGHKGHPGMPGMPGMKGARGFDGKDGAKGDSG-APGPKGEAGQPGANGSPGQ--PGPRGP 301

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-------GYETQRVPGYDVQRGPVYEAQ 422
               +  RG   +P   + Y         + +GP       G+     PG+  + GP   A 
Sbjct:   302 TGE--RGRPGNPGGPVTYRCDIVVFLSLFKGPPGPPGTAGFPGS--PGFKGEAGPPGPAG 357

Query:   423 RAPSYIP-QRG-PGYDLQRG----QGYDMRR-APSYDPSRG-TGFDGAPRGAAPHGQ-VP 473
              + S  P +RG PG   Q G    QG   R  +P      G +G  GAP    P G+ +P
Sbjct:   358 ASGS--PGERGEPGPQGQAGPPGPQGPPGRAGSPGNKGEMGPSGIPGAP--GLPGGRGLP 413

Query:   474 PPLNNVPYGSATPPARSGSGQPRGGNPAR 502
              P    P  S  P A+   G+P G N A+
Sbjct:   414 GP----PGTSGNPGAKGTPGEP-GKNGAK 437


>WB|WBGene00000628 [details] [associations]
            symbol:col-51 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00530000064217 EMBL:FO080999 RefSeq:NP_491195.1
            UniGene:Cel.29694 ProteinModelPortal:Q7Z152 MINT:MINT-3384184
            STRING:Q7Z152 EnsemblMetazoa:T28F2.8 GeneID:189052
            KEGG:cel:CELE_T28F2.8 UCSC:T28F2.8 CTD:189052 WormBase:T28F2.8
            eggNOG:NOG245561 InParanoid:Q7Z152 OMA:MMASRRI NextBio:941036
            Uniprot:Q7Z152
        Length = 435

 Score = 131 (51.2 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 90/299 (30%), Positives = 102/299 (34%)

Query:   220 EKLRAE-LMNAPNVDRRADGSYGG--ATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPP 275
             EK+  E L  A      A G  GG  A G       G   G +      G P G  GPP 
Sbjct:    84 EKVAFEGLFRAKRQYATAAGGGGGYAAGGGGGGGGGGGGGGCHCAAQASGCPAGPPGPPG 143

Query:   276 SATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGP----GYEASKGP-GYDASKAP 329
              A T G  G AG +          SG+  +A    P GP    G + + GP G      P
Sbjct:   144 EAGTDGEPGQAGQDGQPGQAGQADSGSSGQACITCPAGPPGPPGPDGNAGPAGAPGVPGP 203

Query:   330 SYD----PTKGPSYDPAKGPGYDPTKG-PGYDAQKGS----NYDAQRGPNYDIHRGPSYD 380
               D    P  GP   P   PG D   G PG D Q G+      ++  GP      GP   
Sbjct:   204 DGDAGSPPPPGPPGPPGP-PGNDGQPGAPGQDGQPGAPGTNTVNSPGGPGPAGPPGPPGP 262

Query:   381 P-QRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 439
             P Q G G   Q GP       PG      PG D Q G        P   P  GPG D   
Sbjct:   263 PGQDGSGGAAQPGP-------PG--PPGPPGNDGQPG-------GPGQ-PG-GPGQD--G 302

Query:   440 GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
             G G D    P   P R       P G    G  P       Y +     R+ SG   GG
Sbjct:   303 GPGTDAAYCPC--PPR------TPAGGGGGGDFPAGGGGGGYSTGGGGGRADSGGAAGG 353


>UNIPROTKB|Q28009 [details] [associations]
            symbol:FUS "RNA-binding protein FUS" species:9913 "Bos
            taurus" [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=ISS]
            [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634 "nucleus"
            evidence=ISS] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0005737 GO:GO:0000166 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0045944 GO:GO:0003723
            eggNOG:NOG240581 GeneTree:ENSGT00530000063105 KO:K13098
            HOGENOM:HOG000038010 CTD:2521 EMBL:U26024 EMBL:BC119965
            IPI:IPI00705463 RefSeq:NP_776337.1 UniGene:Bt.2474
            ProteinModelPortal:Q28009 STRING:Q28009 PRIDE:Q28009
            Ensembl:ENSBTAT00000007571 GeneID:280796 KEGG:bta:280796
            InParanoid:Q28009 OrthoDB:EOG4DV5NH NextBio:20804952 Uniprot:Q28009
        Length = 513

 Score = 132 (51.5 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 67/237 (28%), Positives = 93/237 (39%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 297
             G+Y    G   ++ S +P GQ +Y  GYG          ++ +G  G   NT  S  +A 
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSY-GGYGQSTDTSGYGQSSYSGSYGQTQNTGYSTQSAP 73

Query:   298 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 354
             Q G      Y   +     Y + S  PGY    APS   T G     ++  GY   +G G
Sbjct:    74 Q-GYSSAGGYGSSQSSQSSYGQQSSYPGYGQQPAPS--GTSGSYGSSSQSSGYGQPQGGG 130

Query:   355 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG--YETQRVPGYD 412
             Y  Q G  Y  Q+  +Y   +  SY+P +G G   Q   +     G G  Y   +     
Sbjct:   131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQSQYNSSGGGGGGGGGSYGQDQPSMSS 185

Query:   413 VQRGPVYEAQ-RAPSY---IPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                G  Y  Q ++  Y      RG G     G GY+ R +  Y+P RG G     RG
Sbjct:   186 GGGGGGYGNQDQSGGYGGGQQDRG-GRGRGGGGGYN-RSSGGYEP-RGRGGGRGGRG 239


>UNIPROTKB|F1RFI8 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 OMA:EGTSTGY EMBL:CU640468
            EMBL:CT737304 Ensembl:ENSSSCT00000010930 Uniprot:F1RFI8
        Length = 606

 Score = 121 (47.7 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 54/178 (30%), Positives = 75/178 (42%)

Query:   238 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAP-SYDPTKGPSYDPAKGPGYD 348
             T+    TQ+    ++AY   P  P Y   + P   A+ AP SY  T+  SYD +     +
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQP---AATAPASYSSTQPTSYDQSSYSQQN 157

Query:   349 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 406
                 P    Q+ S+Y  Q   +Y      SY PQ G  Y   + P+   Q+   Y  Q
Sbjct:   158 TYGQPSSYGQQ-SSYGQQS--SYGQQPPTSYPPQTG-SYS--QAPSQYSQQSSSYGQQ 209

 Score = 57 (25.1 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   464 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 502
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   404 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 449

 Score = 49 (22.3 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 25/86 (29%), Positives = 33/86 (38%)

Query:   421 AQRAPSYIPQRGPGYDLQRGQGYD--MRRAPSYDPSRGTGFDGAPRG-----AAPHGQVP 473
             A++ P     RG G   + G+G    +R  P      G G  G P G         G  P
Sbjct:   394 ARKKPPMNSMRG-GMPPREGRGMPPPLRGGPG-----GPGGPGGPMGRMGGRGGDRGGFP 447

Query:   474 PPLNNVPYGSATPPARSGSGQPRGGN 499
             P     P GS   P+  G+ Q R G+
Sbjct:   448 P---RGPRGSRGNPSGGGNVQHRAGD 470


>WB|WBGene00000251 [details] [associations]
            symbol:bli-1 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0002119 "nematode larval
            development" evidence=IMP] [GO:0018996 "molting cycle, collagen and
            cuticulin-based cuticle" evidence=IMP] [GO:0005578 "proteinaceous
            extracellular matrix" evidence=ISS] [GO:0042329 "structural
            constituent of collagen and cuticulin-based cuticle" evidence=ISS]
            InterPro:IPR002486 InterPro:IPR012613 Pfam:PF01484 Pfam:PF08175
            SMART:SM01088 GO:GO:0009792 GO:GO:0002119 GO:GO:0018996
            GO:GO:0005578 GO:GO:0040011 GO:GO:0000003 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0040002
            EMBL:Z46791 PIR:T19140 RefSeq:NP_496311.2 ProteinModelPortal:Q09457
            STRING:Q09457 PaxDb:Q09457 EnsemblMetazoa:C09G5.6 GeneID:174653
            KEGG:cel:CELE_C09G5.6 UCSC:C09G5.6 CTD:174653 WormBase:C09G5.6
            GeneTree:ENSGT00690000102663 HOGENOM:HOG000016778 InParanoid:Q09457
            OMA:WEEHRKS NextBio:884926 GO:GO:0042601 GO:GO:0042329
            GO:GO:0030436 Uniprot:Q09457
        Length = 948

 Score = 135 (52.6 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 89/338 (26%), Positives = 120/338 (35%)

Query:   197 FYNDHLESLQVMEK--NYITMATEVEKLRAELMNAPNVDRRA-----DGSYGGATGNSEN 249
             FY++  E L   +   N I      E    E+  A + DR       +G Y   T     
Sbjct:    36 FYSEAQEELVEFKDIANNIWEEMVFELTPEEMREAEDNDREKRSYEPEGPYQSETTTPST 95

Query:   250 ETSGRPVGQNAYED--GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAY 307
              TS       A ED  GY     +GPP S          P T     A   + T   + Y
Sbjct:    96 TTSTAATTTEAAEDESGYDFVNDNGPPSSRPRKPEPPTMPRTIQGFRAPPPAAT---STY 152

Query:   308 DIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD-PAKGPG-----YDPTKGP--GYDAQK 359
               P G  Y+ + G    +S+ P Y P + PS   P   P      Y+P   P  GY    
Sbjct:   153 RPPHGSNYD-NYGREPASSRRP-YPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNP 210

Query:   360 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP--GYET--QRVPG----Y 411
                Y+  + PNY   R P+Y       Y   R PN    R P  GY++  Q  P     Y
Sbjct:   211 RVPYNPPQ-PNYT--RQPTYPEDNRAPYKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIY 267

Query:   412 DVQR----GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 467
             + +R    GP Y   + P+  P   PG   QR      R  P+   +R       P    
Sbjct:   268 NTRRPNNHGPGYPEDQVPTAPPV--PGQ--QRVPPTQTRNPPNPTNTRQPSRPVPPTSDG 323

Query:   468 PHGQVPPPLN-NVPYGSATPPARSGSG--QPRGGNPAR 502
              H +   P N +  Y +    +  G G  +PR G   R
Sbjct:   324 -HIEATTPYNPSAQYPTGKRGSHPGFGPQRPRPGTRPR 360

 Score = 131 (51.2 bits), Expect = 6.8e-05, P = 6.8e-05
 Identities = 76/266 (28%), Positives = 102/266 (38%)

Query:   255 PVGQNAYEDGYGVPQGHG----PPPSATTAGVVGAGPNTSTSAY---AATQSGTPM--RA 305
             P G N Y D YG          PP    +     + PN  TS Y      ++G P   R 
Sbjct:   155 PHGSN-Y-DNYGREPASSRRPYPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNPRV 212

Query:   306 AYDIPRGPGYEASKGPGY-DASKAPSYDPTKGPSYDPAKGP--GYD-----PTKGPG-YD 356
              Y+ P+ P Y  ++ P Y + ++AP Y PT+ P+  P + P  GYD     P   P  Y+
Sbjct:   213 PYNPPQ-PNY--TRQPTYPEDNRAP-YKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIYN 268

Query:   357 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG 416
              ++ +N+    GP Y   + P+  P  G     QR P    +  P     R P   V   
Sbjct:   269 TRRPNNH----GPGYPEDQVPTAPPVPG----QQRVPPTQTRNPPNPTNTRQPSRPVPPT 320

Query:   417 PVYEAQRAPSYIPQRGPGYDL-QRGQ--GYDMRRA-PSYDPSRGTGFDGAPRGAAP-HGQ 471
                  +    Y P     Y   +RG   G+  +R  P   P RG   D     A P H  
Sbjct:   321 SDGHIEATTPYNPSAQ--YPTGKRGSHPGFGPQRPRPGTRP-RGNPCDQC--SAQPNHCP 375

Query:   472 VPPPLNNVPYGSATPPARSGSGQPRG 497
               PP    P G   PP   G   PRG
Sbjct:   376 SGPP---GPRGRPGPPGFPGQDGPRG 398

 Score = 130 (50.8 bits), Expect = 8.8e-05, P = 8.8e-05
 Identities = 76/265 (28%), Positives = 97/265 (36%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPN 288
             P  +R  DG+  G  G    +      GQ+      G P  HG   S  T G  G  G N
Sbjct:   427 PPGERGPDGT-PGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGKPGLPGRN 485

Query:   289 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKA----PSYDPTKGPSYDPA- 342
               +        G P      +P   G   + G  G D S      P  D T GP   P  
Sbjct:   486 GQSCKSIPGPPGQP--GVMGVPGRDGDPGTDGEHGQDGSPGIQGPPGRDGTSGPDGQPGV 543

Query:   343 KGPGYDPTKGPGYDA--QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
               PG   T G GY    ++ S +D     N D  RG   +  R  GYD +R      +  
Sbjct:   544 SAPGAPGTDG-GYCPCPKRSSKFDFNDAYNDDEKRG--LEEHRPRGYDSERAE----EPR 596

Query:   401 PGYETQRVPGYDVQRGPVYEAQRAPSY------IPQRGPGY-DLQRGQGYDMRRAPSYDP 453
             P  +T R   YD   G   E QR P+Y       P R   Y D +R +    +R P   P
Sbjct:   597 PR-QTVRTNTYDENSGA--EHQRRPNYEPSAEVAPPRQDRYEDEERVREPPPKRPPP--P 651

Query:   454 SRGTGFDGAPRGAAPHGQVPPPLNN 478
              R T  +  P    P+ + PPP  N
Sbjct:   652 HRQTPHELYPE-EQPYVRRPPPPQN 675

 Score = 122 (48.0 bits), Expect = 0.00065, P = 0.00065
 Identities = 71/243 (29%), Positives = 88/243 (36%)

Query:   273 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 332
             P P  +   +    P   ++ Y   + G+        PR PG      P    S  P++ 
Sbjct:   316 PVPPTSDGHIEATTPYNPSAQYPTGKRGSHPGFGPQRPR-PGTRPRGNPCDQCSAQPNHC 374

Query:   333 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRG-----L 385
             P+ GP   P   PG  P   PG D  +G      RG N  Y   +  SYDP  G     +
Sbjct:   375 PS-GPP-GPRGRPG--PPGFPGQDGPRGL-----RGLNGGYSGVQPSSYDPVIGCVQCPI 425

Query:   386 GYDMQRGPNYDMQRG-PGYE----TQRVPGYDVQRG----PVYEAQRAPSYIPQRGPGYD 436
             G   +RGP  D   G PG +     Q V G D Q G    P Y         P + PG  
Sbjct:   426 GPPGERGP--DGTPGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGK-PGLP 482

Query:   437 LQRGQGYDMRRAPSYDPS-RGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 494
              + GQ       P   P   G  G DG P     HGQ   P      G   PP R G+  
Sbjct:   483 GRNGQSCKSIPGPPGQPGVMGVPGRDGDPGTDGEHGQDGSP------GIQGPPGRDGTSG 536

Query:   495 PRG 497
             P G
Sbjct:   537 PDG 539


>ZFIN|ZDB-GENE-070912-607 [details] [associations]
            symbol:col11a1b "collagen, type XI, alpha 1b"
            species:7955 "Danio rerio" [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS51461 SMART:SM00038 ZFIN:ZDB-GENE-070912-607
            Gene3D:2.60.120.200 InterPro:IPR008985 InterPro:IPR013320
            SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            InterPro:IPR001791 SMART:SM00282 Pfam:PF02210 GO:GO:0005201
            HOGENOM:HOG000085654 SMART:SM00210 GeneTree:ENSGT00700000104155
            UniGene:Dr.3536 EMBL:BX510342 EMBL:BX547933 EMBL:CT583637
            EMBL:GQ485665 IPI:IPI00511026 RefSeq:NP_001171883.1
            UniGene:Dr.42128 Ensembl:ENSDART00000049589 GeneID:555202
            KEGG:dre:555202 CTD:555202 NextBio:20880850 Uniprot:D6MUD3
        Length = 1815

 Score = 138 (53.6 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/250 (28%), Positives = 100/250 (40%)

Query:   266 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA 325
             G P  HG P      G  G          + T    P RA  +  +GP   A +     A
Sbjct:   469 GSPGLHGDPGERGPPGRPGLPGGDGAPGPSGTILMLPFRAGGESSKGPVVSAQEAQA-QA 527

Query:   326 SKAPSYDPTKGPSYDPAKGPGYD-PTKGPGYDAQKGSNYDA-QRGPNYDIHRGPSYDP-- 381
               A +    +GP   P    G   P  GPG    KG + D+  +GP     +GP+  P  
Sbjct:   528 ILAQARLTMRGPP-GPMGLTGRSGPVGGPGAPGAKGESGDSGPQGPRG--LQGPTGSPGK 584

Query:   382 --QRGL-GYDMQRG-PNYDMQRGP-GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 436
               +RG  G D  RG P     +G  G++   +PG   ++G  +  ++ P  +P   PG D
Sbjct:   585 PGKRGRNGADGARGIPGESGAKGDRGFDG--LPGLPGEKG--HRGEQGPIGLPG-SPGED 639

Query:   437 LQRGQGYDM--RRAPSYDPSRGT-GFDGAPRGAAPHGQV----PP-PLNNV-PYGSATPP 487
               RG+  ++  R  P     RG  G  G+P  A   G      PP P  N+ P G   PP
Sbjct:   640 GPRGEDGEIGQRGMPGESGPRGLLGPRGSPGTAGQRGLTGLDGPPGPKGNMGPQGEPGPP 699

Query:   488 ARSGSGQPRG 497
              + G+  P G
Sbjct:   700 GQQGNTGPHG 709


>UNIPROTKB|J9P0L0 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            CTD:1281 EMBL:AAEX03017880 RefSeq:XP_851009.1
            Ensembl:ENSCAFT00000047312 GeneID:478835 KEGG:cfa:478835
            Uniprot:J9P0L0
        Length = 1465

 Score = 137 (53.3 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 83/284 (29%), Positives = 105/284 (36%)

Query:   237 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYA 295
             +G  G      E+ + G P G+    D  G P   GPP +A   G  G AGP        
Sbjct:   653 NGKPGEPGPKGESGSPGVPGGKG---DS-GAPGERGPPGAAGPMGPRGGAGPPGPEGGKG 708

Query:   296 AT-------QSGTP----MRAAYDIPRGPGYEASKG-PGY-DASKAPSYDPTKGPSYDPA 342
             A         +GTP    M      P GPG +  KG PG   A  AP  D  +GP+  P 
Sbjct:   709 AAGPPGPPGSAGTPGLQGMPGERGGPGGPGPKGDKGEPGSAGADGAPGKDGPRGPT-GPI 767

Query:   343 KGPGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 401
               PG  P   PG   + G+       GP         + P    G+    G N +    P
Sbjct:   768 GPPG--PAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAGFPGAPGQNGE----P 821

Query:   402 GYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 460
             G + +R  PG   + GP   A       P   PG    +G+    R +P      G G  
Sbjct:   822 GAKGERGAPGEKGEGGPPGVAGPPGGAGPAGPPGPQGVKGE----RGSPG-----GPGAA 872

Query:   461 GAPRGAAPHGQVPPPLNNV---PYGSATPPARSGSGQPRGGNPA 501
             G P G    G   PP NN    P GS+  P + G   P G N A
Sbjct:   873 GFPGGRGLPG---PPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 913

 Score = 132 (51.5 bits), Expect = 8.9e-05, P = 8.9e-05
 Identities = 83/280 (29%), Positives = 101/280 (36%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 290
             A G  GG  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAMGPS 224

Query:   291 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 347
               A    +SG P R     +P  PG +   G PG+   K    +D   G   D    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 283

Query:   348 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 404
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   405 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPPGSNGSP 392

Query:   464 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 498
              G    G  P  +   P   G+  PP   G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 78/261 (29%), Positives = 98/261 (37%)

Query:   257 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 310
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   P
Sbjct:   321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380

Query:   311 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 366
               PG   S G PG      P+  P   P    A+GP   P T G PG     G    +  
Sbjct:   381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439

Query:   367 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 418
             +G P     RG +  P   G  G D + G P      G PG   +R  PG+   RGP   
Sbjct:   440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496

Query:   419 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 475
                  ++ P+   + GPG    RG  G   R      P    G  G+P G    G+  PP
Sbjct:   497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554

Query:   476 LNNVPYGSATPPARSGS-GQP 495
              +    G   PP  SG  GQP
Sbjct:   555 GSQGESGRPGPPGPSGPRGQP 575


>UNIPROTKB|F1N7Q7 [details] [associations]
            symbol:COL4A2 "Collagen alpha-2(IV) chain" species:9913
            "Bos taurus" [GO:0071560 "cellular response to transforming growth
            factor beta stimulus" evidence=IEA] [GO:0016525 "negative
            regulation of angiogenesis" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005587 "collagen
            type IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0071560 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006351 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0016525 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:DAAA02034911 IPI:IPI00712524
            Ensembl:ENSBTAT00000005916 OMA:QETIQPG Uniprot:F1N7Q7
        Length = 1650

 Score = 137 (53.3 bits), Expect = 2.9e-05, P = 2.9e-05
 Identities = 75/251 (29%), Positives = 98/251 (39%)

Query:   226 LMNAPNVD-RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVV 283
             L   P +  R+ D    GA G +  +    P G + +    G+P GH G        G  
Sbjct:    18 LQGFPGLQGRKGDKGQRGAPGITGPKGDVGPRGVSGFPGADGIP-GHPGQGGPRGPPGYD 76

Query:   284 GAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA 342
             G       S YA    G P    +  PRGP G +  KG  Y A  +   D  +G   +P 
Sbjct:    77 GCNGTVGDSGYA----GPPGPGGFLGPRGPQGPKGQKGEPY-ALSSEDRDKYRGEPGEPG 131

Query:   343 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRGPNYDMQ-RG 400
                   P   PG   Q G    A   P      GP   P  RGLG+  ++G   DM  +G
Sbjct:   132 LVGLQGPPGRPGPVGQMGP-VGAPGRPGPPGPPGPKGQPGNRGLGFYGEKGEKGDMGLQG 190

Query:   401 PGYETQRVP---GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRG 456
             PG     +P   GY  +  PVYE       +P++  G   ++G QG   R   S     G
Sbjct:   191 PG----GIPPDNGYVEKPTPVYEL------LPEQYKG---EKGSQGEPGRIGVSLKGEEG 237

Query:   457 T-GFDGAPRGA 466
               GF G PRGA
Sbjct:   238 VVGFSG-PRGA 247


>UNIPROTKB|F1LRJ1 [details] [associations]
            symbol:Col4a3 "Protein Col4a3" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            RGD:71085 GO:GO:0006917 GO:GO:0008283 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006919 GO:GO:0007166 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0016525 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1285
            GO:GO:0032836 IPI:IPI00367109 RefSeq:NP_001129231.1
            UniGene:Rn.121139 Ensembl:ENSRNOT00000020669 GeneID:363265
            KEGG:rno:363265 NextBio:683046 ArrayExpress:F1LRJ1 Uniprot:F1LRJ1
        Length = 1670

 Score = 137 (53.3 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 93/289 (32%), Positives = 106/289 (36%)

Query:   237 DGSYGGATGNSENETSGRPV--GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 294
             DGS GG          G P   G+   +   G P   GPP  A  AG  G GP       
Sbjct:   568 DGSPGGPGAKGPRGPRGEPALSGRKGDQGPPGAPGSPGPPGPAGPAGPPGYGPQGEPGPK 627

Query:   295 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK-GP-GYDPTKG 352
              A   G P   A     GP  EA    G  ++  P   P  GP   P + GP G     G
Sbjct:   628 GA--QGVP--GAL----GPPGEAGL-KGESSASIPVLGPP-GPPGPPGQAGPRGLPGLPG 677

Query:   353 PGYDAQKGS-NYDAQRG-PNYDIH--RGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR 407
             P      G    D + G P       RGP  D     G+    G P Y     PG ET R
Sbjct:   678 PVGTCDPGHPGPDGEPGIPEVGFPGARGPKGDQ----GFPGTIGLPGY-----PG-ETGR 727

Query:   408 VPGYDVQRGPVYEAQRAPSY-IP-QRG-PGYDLQRGQGYDMRRA--PSYDPSRGT----G 458
              PGY  + G V  A+  PS   P + G PG+  +RG   +      P      GT    G
Sbjct:   728 -PGYPGEMG-VPGAKGEPSVGRPGEPGKPGFPGERGNSGENGDIGLPGLPGPPGTPGKDG 785

Query:   459 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQP--RG--GNPARR 503
             FDG P    P GQ  PP    P G   P  R   G P   G  G P RR
Sbjct:   786 FDGPP--GDP-GQSGPPGAKGPPGRCIPGPRGTQGLPGLNGLKGQPGRR 831


>UNIPROTKB|J9P8I1 [details] [associations]
            symbol:CROCC "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0051297 "centrosome organization"
            evidence=IEA] [GO:0035253 "ciliary rootlet" evidence=IEA]
            InterPro:IPR026733 GO:GO:0051297 GO:GO:0035253
            GeneTree:ENSGT00700000104019 PANTHER:PTHR23159:SF5
            EMBL:AAEX03001849 Ensembl:ENSCAFT00000047339 Uniprot:J9P8I1
        Length = 2015

 Score = 116 (45.9 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 38/135 (28%), Positives = 69/135 (51%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIG--GMKSERELQM 108
             +E++  S   E ++L T+ + L      LR+EL  AQ   Q+  GQ G    + E     
Sbjct:  1147 LERQRESSTREAEELRTQLRLLEDARDGLRRELLEAQR--QVREGQDGREAQRQEASELR 1204

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R+L+E + + EA  +T E ++   +K+++E  +L +A E+   K+  LT+       + +
Sbjct:  1205 RSLSEGVQEREALRRTNEELRAAVKKAESERISLKLANEDKEQKLALLTEARVAVGKEAE 1264

Query:   169 QIPALLSELESLRQE 183
             ++ A L E+E  R E
Sbjct:  1265 ELRAGLQEVERSRLE 1279

 Score = 106 (42.4 bits), Expect = 0.00036, Sum P(2) = 0.00036
 Identities = 52/200 (26%), Positives = 90/200 (45%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATEN--QRLAATHGTLRQELAAAQHELQI-LHGQIGG 99
             ++M    V  ++ A   +  Q++A E   QRL        +EL A + +LQ  L  +   
Sbjct:   969 ELMAQKLVQAEREAQASLREQRVAHEEDLQRLQQEKEAAWRELEAERAQLQSQLQREREE 1028

Query:   100 MKSERELQMRNLTEKIAKMEAE----LKTAEPVK---LEFQKSKTEA--QNLVVAREELI 150
             + +  E +   L+E+IA ++ E    L  AE  K   L  ++S+  A  + L+  +  L 
Sbjct:  1029 LLARLEAEKEELSEEIAALQQERDEGLLLAESEKQQALSLKESEKTALSEKLMGTQHSLA 1088

Query:   151 A---KVHQLTQDLQ-RAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQ 206
             A   ++ +  +D Q R   D   + AL+SEL  LR +      T+  E K   +   +L 
Sbjct:  1089 AISLEMERQKRDAQSRQEQDRSTVNALMSELRDLRAQLEEAADTHAQEVKRLQEQARNL- 1147

Query:   207 VMEKNYITMATEVEKLRAEL 226
               E+   +   E E+LR +L
Sbjct:  1148 --ERQRESSTREAEELRTQL 1165

 Score = 73 (30.8 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 29/92 (31%), Positives = 39/92 (42%)

Query:   407 RVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD--GAPR 464
             R  G + +   V EAQR    +   G    L+RG G  + R+PS  P   T F    AP 
Sbjct:  1413 RGQGLEAELARV-EAQRRADEVQLGGLRSALRRGLG--LGRSPSPPPLPSTSFPTGSAPA 1469

Query:   465 GAAPHG-QVPPPLNNVPYGSATPPARSGSGQP 495
             G +  G + P PL   P     PP+   +  P
Sbjct:  1470 GGSGEGLRSPSPLERSP--GCEPPSPGPTTSP 1499


>UNIPROTKB|F1Q2C0 [details] [associations]
            symbol:CROCC "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0051297 "centrosome organization"
            evidence=IEA] [GO:0035253 "ciliary rootlet" evidence=IEA]
            InterPro:IPR026733 GO:GO:0051297 GO:GO:0035253
            GeneTree:ENSGT00700000104019 PANTHER:PTHR23159:SF5
            EMBL:AAEX03001849 Ensembl:ENSCAFT00000025161 OMA:SDWRREE
            Uniprot:F1Q2C0
        Length = 2018

 Score = 116 (45.9 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 38/135 (28%), Positives = 69/135 (51%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIG--GMKSERELQM 108
             +E++  S   E ++L T+ + L      LR+EL  AQ   Q+  GQ G    + E     
Sbjct:  1147 LERQRESSTREAEELRTQLRLLEDARDGLRRELLEAQR--QVREGQDGREAQRQEASELR 1204

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R+L+E + + EA  +T E ++   +K+++E  +L +A E+   K+  LT+       + +
Sbjct:  1205 RSLSEGVQEREALRRTNEELRAAVKKAESERISLKLANEDKEQKLALLTEARVAVGKEAE 1264

Query:   169 QIPALLSELESLRQE 183
             ++ A L E+E  R E
Sbjct:  1265 ELRAGLQEVERSRLE 1279

 Score = 106 (42.4 bits), Expect = 0.00036, Sum P(2) = 0.00036
 Identities = 52/200 (26%), Positives = 90/200 (45%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATEN--QRLAATHGTLRQELAAAQHELQI-LHGQIGG 99
             ++M    V  ++ A   +  Q++A E   QRL        +EL A + +LQ  L  +   
Sbjct:   969 ELMAQKLVQAEREAQASLREQRVAHEEDLQRLQQEKEAAWRELEAERAQLQSQLQREREE 1028

Query:   100 MKSERELQMRNLTEKIAKMEAE----LKTAEPVK---LEFQKSKTEA--QNLVVAREELI 150
             + +  E +   L+E+IA ++ E    L  AE  K   L  ++S+  A  + L+  +  L 
Sbjct:  1029 LLARLEAEKEELSEEIAALQQERDEGLLLAESEKQQALSLKESEKTALSEKLMGTQHSLA 1088

Query:   151 A---KVHQLTQDLQ-RAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQ 206
             A   ++ +  +D Q R   D   + AL+SEL  LR +      T+  E K   +   +L 
Sbjct:  1089 AISLEMERQKRDAQSRQEQDRSTVNALMSELRDLRAQLEEAADTHAQEVKRLQEQARNL- 1147

Query:   207 VMEKNYITMATEVEKLRAEL 226
               E+   +   E E+LR +L
Sbjct:  1148 --ERQRESSTREAEELRTQL 1165

 Score = 73 (30.8 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 29/92 (31%), Positives = 39/92 (42%)

Query:   407 RVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD--GAPR 464
             R  G + +   V EAQR    +   G    L+RG G  + R+PS  P   T F    AP 
Sbjct:  1413 RGQGLEAELARV-EAQRRADEVQLGGLRSALRRGLG--LGRSPSPPPLPSTSFPTGSAPA 1469

Query:   465 GAAPHG-QVPPPLNNVPYGSATPPARSGSGQP 495
             G +  G + P PL   P     PP+   +  P
Sbjct:  1470 GGSGEGLRSPSPLERSP--GCEPPSPGPTTSP 1499


>MGI|MGI:88453 [details] [associations]
            symbol:Col3a1 "collagen, type III, alpha 1" species:10090 "Mus
            musculus" [GO:0001568 "blood vessel development" evidence=IMP]
            [GO:0005178 "integrin binding" evidence=ISO] [GO:0005201
            "extracellular matrix structural constituent" evidence=ISO]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
            "collagen" evidence=IDA] [GO:0005586 "collagen type III"
            evidence=ISO;IDA] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0007160 "cell-matrix adhesion" evidence=ISO] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=ISO] [GO:0007229 "integrin-mediated signaling pathway"
            evidence=ISO] [GO:0007507 "heart development" evidence=ISO]
            [GO:0009314 "response to radiation" evidence=ISO] [GO:0018149
            "peptide cross-linking" evidence=ISO] [GO:0030199 "collagen fibril
            organization" evidence=ISO;IMP] [GO:0031012 "extracellular matrix"
            evidence=ISO;IDA] [GO:0032964 "collagen biosynthetic process"
            evidence=ISO] [GO:0034097 "response to cytokine stimulus"
            evidence=ISO] [GO:0042060 "wound healing" evidence=ISO] [GO:0043206
            "extracellular fibril organization" evidence=ISO] [GO:0043588 "skin
            development" evidence=ISO] [GO:0046332 "SMAD binding" evidence=IPI]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=ISO] [GO:0048565
            "digestive tract development" evidence=IMP] [GO:0050777 "negative
            regulation of immune response" evidence=ISO] [GO:0071230 "cellular
            response to amino acid stimulus" evidence=IDA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 MGI:MGI:88453 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0046872 GO:GO:0034097 GO:GO:0030199
            GO:GO:0001501 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0042060
            GO:GO:0001568 GO:GO:0048565 GO:GO:0050777 GO:GO:0009314
            GO:GO:0018149 GO:GO:0032964 GO:GO:0071230 GO:GO:0043206
            GO:GO:0005201 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
            CTD:1281 OMA:EGSPGHP ChiTaRS:COL3A1 GO:GO:0005586 EMBL:X52046
            EMBL:BC043089 EMBL:BC058724 EMBL:M18933 EMBL:K03037 EMBL:AK019448
            EMBL:X57983 IPI:IPI00129571 PIR:A27353 PIR:S59856
            RefSeq:NP_034060.2 UniGene:Mm.249555 ProteinModelPortal:P08121
            SMR:P08121 STRING:P08121 PhosphoSite:P08121 PaxDb:P08121
            PRIDE:P08121 Ensembl:ENSMUST00000087883 GeneID:12825 KEGG:mmu:12825
            InParanoid:P08121 NextBio:282310 Bgee:P08121 CleanEx:MM_COL3A1
            Genevestigator:P08121 Uniprot:P08121
        Length = 1464

 Score = 136 (52.9 bits), Expect = 3.3e-05, P = 3.3e-05
 Identities = 86/285 (30%), Positives = 101/285 (35%)

Query:   230 PNVDRRADGSYGGATGNSENETSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP 287
             P  +   DGS G    N     +G   P G        G+P   GPP      G  G   
Sbjct:   459 PKGEDGKDGSPGEPGANGLPGAAGERGPSGFRGPAGPNGIPGEKGPPGERGGPGPAGPRG 518

Query:   288 NTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPG 346
                      T  G  +R     P GPG +   GP    S+  S  P   GPS  P   PG
Sbjct:   519 VAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGP--PGSQGESGRPGPPGPS-GPRGQPG 575

Query:   347 YDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM-QRGP- 401
                  GP G D   G N + + GP      GP+  + + G  G     GP  D    GP 
Sbjct:   576 VMGFPGPKGNDGAPGKNGE-RGGPGGPGLPGPAGKNGETGPQGPPGPTGPAGDKGDSGPP 634

Query:   402 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GF 459
             G +  Q +PG     GP  E  +     P+   G     G G     AP      GT G 
Sbjct:   635 GPQGLQGIPGTG---GPPGENGKPGEPGPKGEVGAPGAPG-GKGDSGAPGERGPPGTAGI 690

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS----GQP--RGG 498
              GA  GA P G   P     P G   PP  SGS    G P  RGG
Sbjct:   691 PGARGGAGPPG---PEGGKGPAGPPGPPGASGSPGLQGMPGERGG 732


>UNIPROTKB|F1NRH2 [details] [associations]
            symbol:LOC100858979 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005581 "collagen" evidence=IEA] [GO:0005938
            "cell cortex" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
            GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:AC147437
            IPI:IPI01017314 RefSeq:XP_003641055.1 Ensembl:ENSGALT00000024133
            GeneID:100858979 KEGG:gga:100858979 Uniprot:F1NRH2
        Length = 674

 Score = 132 (51.5 bits), Expect = 3.4e-05, P = 3.4e-05
 Identities = 87/283 (30%), Positives = 107/283 (37%)

Query:   235 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSA 293
             + D    GA G +       P G+   E G G P   GPP  A   G  G  GP      
Sbjct:   227 KGDRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP------ 279

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPT 350
               A   G+P    +  P  PG +  +GP G      P  D  +GP+  P + GP G    
Sbjct:   280 --AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGEPGPAGPQGN 335

Query:   351 KGP-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PG 402
              GP G     G N     GP  D+   GP+  P    +RGL G D +  P Y  ++G PG
Sbjct:   336 MGPQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPG 391

Query:   403 YETQRVPGYDVQRGPVYEAQRAPSYIPQR-GP-GYDLQRG-QGYDMRRAPSYDPS-RGT- 457
              +    PG   Q+G    A   P  +P   GP G     G  G    R PS  P  RG  
Sbjct:   392 PKGH--PGLPGQKGDTGHA--GPPGLPGPVGPQGVKGVPGINGEPGPRGPSGIPGIRGPI 447

Query:   458 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 500
             G  G P      G+   P    P G AT   R   G P    P
Sbjct:   448 GPPGMPGAPGAKGEAGAPGLPGPAGIATKGLRGPMGPPGPPGP 490


>UNIPROTKB|F1RXW0 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0005588 "collagen type V"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0043588
            GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
            EMBL:CU467671 Ensembl:ENSSSCT00000017460 ArrayExpress:F1RXW0
            Uniprot:F1RXW0
        Length = 1269

 Score = 135 (52.6 bits), Expect = 3.6e-05, P = 3.6e-05
 Identities = 87/293 (29%), Positives = 109/293 (37%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 289
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   554 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 612

Query:   290 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 345
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   613 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 671

Query:   346 GYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 395
                P  T  PG   + G        GP   +   P  +   GL  D         RGP  
Sbjct:   672 QGPPGATGFPGSAGRVGPPGPTGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 729

Query:   396 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 452
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   730 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 785

Query:   453 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 503
             P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:   786 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 837


>TAIR|locus:4010713902 [details] [associations]
            symbol:AT4G22505 species:3702 "Arabidopsis thaliana"
            [GO:0006869 "lipid transport" evidence=IEA] EMBL:CP002687
            GO:GO:0006869 InterPro:IPR016140 SUPFAM:SSF47699 UniGene:At.22887
            UniGene:At.74604 IPI:IPI00938995 RefSeq:NP_001154263.1 PRIDE:F4JLV7
            EnsemblPlants:AT4G22505.1 GeneID:5008157 KEGG:ath:AT4G22505
            OMA:GSEMAGM Uniprot:F4JLV7
        Length = 530

 Score = 130 (50.8 bits), Expect = 4.0e-05, P = 4.0e-05
 Identities = 54/229 (23%), Positives = 67/229 (29%)

Query:   268 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 327
             P+   PPP  T      A P T   +        P       P+ P     K P     +
Sbjct:    74 PRTPPPPPPRTPRTPPTAPPRTPPVSPRIPPILPPKTPPTAPPQTPPVSPPKSPPNSPPR 133

Query:   328 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 387
             AP   P + P   P + P   P + P     +       R P+    R P   P R    
Sbjct:   134 APPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPSTSPPRAPPLSPPRTPPT 193

Query:   388 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 447
                R P       P     R P     R P     R P   P R P     R       R
Sbjct:   194 SPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPRAPPVPPPRISPTAPPR 253

Query:   448 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR 496
             AP   P R T     PR       + PP +       +PP    +  PR
Sbjct:   254 APPLSPPR-TPPTSPPRTPPLSPPITPPTSPPRAPPLSPPRTPPTSPPR 301

 Score = 121 (47.7 bits), Expect = 0.00039, P = 0.00039
 Identities = 58/231 (25%), Positives = 69/231 (29%)

Query:   268 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 327
             P+   PPP  T        P T  +   A     P+      PR P     K P     +
Sbjct:    63 PRTPPPPPPRTPRTPPPPPPRTPRTPPTAPPRTPPVS-----PRIPPILPPKTPPTAPPQ 117

Query:   328 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 387
              P   P K P   P + P   P + P     +       R P     R P   P R    
Sbjct:   118 TPPVSPPKSPPNSPPRAPPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPST 177

Query:   388 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 447
                R P     R P     R P       P     RAP   P R P     R       R
Sbjct:   178 SPPRAPPLSPPRTPPTSPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPR 237

Query:   448 APSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPR 496
             AP   P R +     PR  AP    P  PP +       +PP    +  PR
Sbjct:   238 APPVPPPRISP-TAPPR--APPLSPPRTPPTSPPRTPPLSPPITPPTSPPR 285


>UNIPROTKB|F1PG69 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 OMA:EGSPGHP
            EMBL:AAEX03017880 Ensembl:ENSCAFT00000023503 Uniprot:F1PG69
        Length = 1467

 Score = 135 (52.6 bits), Expect = 4.2e-05, P = 4.2e-05
 Identities = 85/274 (31%), Positives = 106/274 (38%)

Query:   252 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIP 310
             +G+P G+ +++   G P   GPP +A   G  G AGP           SG  +R    I 
Sbjct:   653 NGKP-GEPSHQGDSGAPGERGPPGAAGPMGPRGGAGP---PGPEGGKVSGGDLRPP--IS 706

Query:   311 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGS-NYDAQRG 368
              G G     GP   A   P      G    P  GPG    KG PG     G+   D  RG
Sbjct:   707 AGAGAAGPPGPPGSAG-TPGLQGMPGERGGPG-GPGPKGDKGEPGSAGADGAPGKDGPRG 764

Query:   369 PNYDIHR-GPSYDP-QRGLG--------YDMQRGPNYDMQRGPGYETQRVPGYDVQRG-P 417
             P   I   GP+  P  +G G           + GP    + GP       PG   Q G P
Sbjct:   765 PTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAG-FPGAPGQNGEP 823

Query:   418 VYEAQR-APSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA--PHGQ- 471
               + +R AP    + GP G     G G      P     +G  G  G P GAA  P G+ 
Sbjct:   824 GAKGERGAPGEKGEGGPPGVAGPPG-GAGPAGPPGPQGVKGERGSPGGP-GAAGFPGGRG 881

Query:   472 VP-PPLNNV---PYGSATPPARSGSGQPRGGNPA 501
             +P PP NN    P GS+  P + G   P G N A
Sbjct:   882 LPGPPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 915

 Score = 132 (51.5 bits), Expect = 8.9e-05, P = 8.9e-05
 Identities = 83/280 (29%), Positives = 101/280 (36%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 290
             A G  GG  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAMGPS 224

Query:   291 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 347
               A    +SG P R     +P  PG +   G PG+   K    +D   G   D    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 283

Query:   348 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 404
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   405 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPPGSNGSP 392

Query:   464 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 498
              G    G  P  +   P   G+  PP   G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 78/261 (29%), Positives = 98/261 (37%)

Query:   257 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 310
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   P
Sbjct:   321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380

Query:   311 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 366
               PG   S G PG      P+  P   P    A+GP   P T G PG     G    +  
Sbjct:   381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439

Query:   367 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 418
             +G P     RG +  P   G  G D + G P      G PG   +R  PG+   RGP   
Sbjct:   440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496

Query:   419 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 475
                  ++ P+   + GPG    RG  G   R      P    G  G+P G    G+  PP
Sbjct:   497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554

Query:   476 LNNVPYGSATPPARSGS-GQP 495
              +    G   PP  SG  GQP
Sbjct:   555 GSQGESGRPGPPGPSGPRGQP 575


>UNIPROTKB|F1N2Y2 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0005588 "collagen type V"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            GO:GO:0043588 GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
            EMBL:DAAA02003915 EMBL:DAAA02003916 EMBL:DAAA02003917
            EMBL:DAAA02003918 IPI:IPI00826022 Ensembl:ENSBTAT00000038684
            Uniprot:F1N2Y2
        Length = 1491

 Score = 135 (52.6 bits), Expect = 4.3e-05, P = 4.3e-05
 Identities = 88/293 (30%), Positives = 110/293 (37%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 289
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   785 EKGAEGTAGNDGARGLPGPLGPPGPSGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 843

Query:   290 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 345
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   844 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 902

Query:   346 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 395
                P  T  PG   + G    A   GP   +   P  +   GL  D         RGP  
Sbjct:   903 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 960

Query:   396 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 452
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   961 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1016

Query:   453 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 503
             P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1017 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1068


>UNIPROTKB|F1PG08 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
            EMBL:AAEX03017882 EMBL:AAEX03017883 EMBL:AAEX03017884
            Ensembl:ENSCAFT00000023545 OMA:ETCNGLD Uniprot:F1PG08
        Length = 1499

 Score = 135 (52.6 bits), Expect = 4.3e-05, P = 4.3e-05
 Identities = 87/293 (29%), Positives = 109/293 (37%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 289
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   784 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 842

Query:   290 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 345
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   843 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 901

Query:   346 GYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 395
                P  T  PG   + G        GP   +   P  +   GL  D         RGP  
Sbjct:   902 QGPPGATGFPGSAGRVGPPGPPGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 959

Query:   396 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 452
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   960 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1015

Query:   453 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 503
             P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1016 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067


>UNIPROTKB|P08125 [details] [associations]
            symbol:COL10A1 "Collagen alpha-1(X) chain" species:9031
            "Gallus gallus" [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR008983 HOGENOM:HOG000085653 HOVERGEN:HBG108220
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871 eggNOG:NOG114228
            OrthoDB:EOG4FFD29 EMBL:M13496 EMBL:J04194 IPI:IPI00600819
            PIR:S23297 ProteinModelPortal:P08125 SMR:P08125 STRING:P08125
            InParanoid:P08125 Reactome:REACT_132934 PMAP-CutDB:P08125
            Uniprot:P08125
        Length = 674

 Score = 131 (51.2 bits), Expect = 4.4e-05, P = 4.4e-05
 Identities = 91/293 (31%), Positives = 116/293 (39%)

Query:   235 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSA 293
             + D    GA G +       P G+   E G G P   GPP  A   G  G  GP      
Sbjct:   227 KGDRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP------ 279

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPT 350
               A   G+P    +  P  PG +  +GP G      P  D  +GP+  P + GP G    
Sbjct:   280 --AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGELGPAGPQGN 335

Query:   351 KGP-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PG 402
              GP G     G N     GP  D+   GP+  P    +RGL G D +  P Y  ++G PG
Sbjct:   336 MGPQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPG 391

Query:   403 YETQRVPGYDVQRGPVYEAQRA--PSYI-PQ--RG-PGYDLQRGQGYDMRRAPSYDPS-R 455
              +    PG   Q+G    A     P  + PQ  +G PG + + G      R PS  P  R
Sbjct:   392 PKGH--PGLPGQKGDTGHAGHPGLPGPVGPQGVKGVPGINGEPGP-----RGPSGIPGVR 444

Query:   456 GT----GFDGAP--RGAAPHGQVPPPLNNV------PYGSATPPARSG-SGQP 495
             G     G  GAP  +G A    +P P   V      P G   PP   G SG+P
Sbjct:   445 GPIGPPGMPGAPGAKGEAGAPGLPGPAGIVTKGLRGPMGPLGPPGPKGNSGEP 497


>ZFIN|ZDB-GENE-030131-5726 [details] [associations]
            symbol:eif3s10 "eukaryotic translation initiation
            factor 3, subunit 10 (theta)" species:7955 "Danio rerio"
            [GO:0001732 "formation of translation initiation complex"
            evidence=ISS] [GO:0005852 "eukaryotic translation initiation factor
            3 complex" evidence=ISS] [GO:0003743 "translation initiation factor
            activity" evidence=IEA;ISS] [GO:0006413 "translational initiation"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0006412
            "translation" evidence=IEA] InterPro:IPR000717 Pfam:PF01399
            SMART:SM00088 ZFIN:ZDB-GENE-030131-5726 GO:GO:0003743 GO:GO:0005852
            HOGENOM:HOG000246822 KO:K03254 HAMAP:MF_03000
            GeneTree:ENSGT00690000102108 EMBL:BC059196 EMBL:BC066670
            IPI:IPI00489212 RefSeq:NP_956114.2 UniGene:Dr.132282
            ProteinModelPortal:Q6PCR7 STRING:Q6PCR7 PRIDE:Q6PCR7
            Ensembl:ENSDART00000111462 GeneID:327515 KEGG:dre:327515 CTD:327515
            eggNOG:NOG123880 HOVERGEN:HBG006128 InParanoid:Q6PCR7
            NextBio:20810067 Bgee:Q6PCR7 GO:GO:0001732 Uniprot:Q6PCR7
        Length = 1267

 Score = 134 (52.2 bits), Expect = 4.6e-05, P = 4.6e-05
 Identities = 109/437 (24%), Positives = 175/437 (40%)

Query:    58 QHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS-ERELQMRNLTEKIA 116
             + + + K A E QR+         EL   Q E +I + ++   K+ E + +M  + E   
Sbjct:   705 EEIPLIKKAYEEQRIKD------MELWELQEEERITNMKMEREKALEHKQRMSRMMEDKE 758

Query:   117 KMEAELKTAEPVKLEFQKSKTEAQNLVVAREE-LIAKVHQLTQDLQRA--HTDVQQIPAL 173
                +++K A     E +K K   + LV  R++ L  +  Q  +D ++A  H   ++   +
Sbjct:   759 NFLSKIKAARSFIYE-EKLKQFQERLVEERKKRLEERKKQRKEDRRKAFYHQKEEEAQRI 817

Query:   174 LSE-LESLRQEYHHCRGTY-EYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELMN 228
               E L+  R+E         E E++ Y + L  L+  E+       E+E   + + E   
Sbjct:   818 REEQLKKEREERERLEQEQREEEEREYQERLRKLEEQERKQRARQQEIEERERRKEEERR 877

Query:   229 APNVDRR---ADGSYGGATGNSENETSGR-PVGQNAY-EDGYGVPQGHGPPPSATTAGVV 283
             AP        A+    G     E E+  R PV    + ++G    +G   P         
Sbjct:   878 APEEKPNKEWAEREESGWRKRGEGESEWRRPVPDRDWRQEGR---EGREEPDREDRDLPF 934

Query:   284 GAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP 337
               G  ++    A+ + G  +R   D  RGP  G +  + P  G+D  +     +D  +G 
Sbjct:   935 RRGGESARRG-ASDEKG--LRRGCDDDRGPRRGGDDERPPRRGFDDDRGTRRGFDDDRGQ 991

Query:   338 SY-DPAKGP--GYDPTKGPG--YDAQKGSNY-DAQRGPN--YDIHRGPSYDPQRGLGYDM 389
                D  +GP  G D  +GP    D  +G    D  RGP   +D  RGP    +RG+  D 
Sbjct:   992 RRGDDDRGPRRGMDDDRGPRRPIDDDRGPRRSDDDRGPRRGFDDDRGP----RRGM--DE 1045

Query:   390 QRGPNY--DMQRGP--GYETQRVP--GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 443
              RGP    D   GP  G + +R    G D   GP       P + P   PG    R +  
Sbjct:  1046 PRGPRRGADDDWGPRRGGDDERGGRRGMD-DSGPRRGEDSRP-WKPLGRPGAGGWRER-- 1101

Query:   444 DMRRAPSYDPSRGTGFD 460
             +  R  S+ P R +G D
Sbjct:  1102 EKAREESWGPPRDSGHD 1118


>UNIPROTKB|G5EF87 [details] [associations]
            symbol:swsn-1 "SWI3-like protein" species:6239
            "Caenorhabditis elegans" [GO:0042802 "identical protein binding"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            InterPro:IPR001005 InterPro:IPR007526 InterPro:IPR009057
            Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934 SMART:SM00717
            GO:GO:0005634 GO:GO:0009792 GO:GO:0002009 GO:GO:0040007
            GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0040018
            Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
            InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
            EMBL:AL110477 KO:K11649 GeneTree:ENSGT00390000018166 EMBL:AF230279
            PIR:T26449 RefSeq:NP_001256906.1 UniGene:Cel.7072 SMR:G5EF87
            IntAct:G5EF87 EnsemblMetazoa:Y113G7B.23 GeneID:180324
            KEGG:cel:CELE_Y113G7B.23 CTD:180324 WormBase:Y113G7B.23a
            OMA:HFDELEQ NextBio:908892 Uniprot:G5EF87
        Length = 789

 Score = 131 (51.2 bits), Expect = 5.4e-05, P = 5.4e-05
 Identities = 71/248 (28%), Positives = 92/248 (37%)

Query:   266 GVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 319
             G+P G    GPP   P    +    A P    ++ AAT +  P  +    P+ P  +A+ 
Sbjct:   551 GLPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAPPVQAAP 608

Query:   320 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI-HRGPS 378
              P   A +AP   P    +Y    GPG  P +   Y  Q+G  Y     P     H+   
Sbjct:   609 AP-VQAPQAPQAPPQ---AYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQAQQ 664

Query:   379 YDPQRGLGYDMQ-RGPNYDMQRGPGYETQRVPG--YDVQRGPVYEAQRAPSYIPQRGPGY 435
                Q   G     +GP    Q    Y     PG  Y    G   + QR P Y  Q  PG 
Sbjct:   665 AQSQAHYGPPGGGQGPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPP-YQAQPYPGP 723

Query:   436 ---DLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 492
                  QRG GY     P   P       G P    P+GQ+PPP    P+G   P  + G 
Sbjct:   724 PPPQQQRGYGYP----PPPQP-------GHPY-QQPYGQMPPP----PHGQYQPQQQQGG 767

Query:   493 GQ-PRGGN 499
                P GG+
Sbjct:   768 PMGPPGGH 775


>MGI|MGI:1925567 [details] [associations]
            symbol:Ccdc88b "coiled-coil domain containing 88B"
            species:10090 "Mus musculus" [GO:0000226 "microtubule cytoskeleton
            organization" evidence=IEA] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008017 "microtubule
            binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR008636 Pfam:PF05622 MGI:MGI:1925567
            GO:GO:0005737 GO:GO:0000226 CTD:283234 eggNOG:NOG287357
            HOVERGEN:HBG104809 OMA:EGLEVQE OrthoDB:EOG4NS39S EMBL:AC120557
            EMBL:BC076600 EMBL:BC151001 EMBL:BC151009 IPI:IPI00608004
            IPI:IPI00874526 RefSeq:NP_001074760.1 UniGene:Mm.329596 HSSP:Q09013
            ProteinModelPortal:Q4QRL3 SMR:Q4QRL3 PhosphoSite:Q4QRL3
            PaxDb:Q4QRL3 PRIDE:Q4QRL3 Ensembl:ENSMUST00000113440 GeneID:78317
            KEGG:mmu:78317 UCSC:uc008gjb.1 GeneTree:ENSGT00690000101702
            HOGENOM:HOG000060297 InParanoid:B2RX63 NextBio:348677 Bgee:Q4QRL3
            CleanEx:MM_CCDC88B Genevestigator:Q4QRL3 Uniprot:Q4QRL3
        Length = 1481

 Score = 134 (52.2 bits), Expect = 5.5e-05, P = 5.5e-05
 Identities = 51/189 (26%), Positives = 92/189 (48%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRN 110
             +E ++ S     Q+L  ++QR       L+ E +  + + Q LH ++G ++ E     R 
Sbjct:  1009 LEGQLGSLQGRAQELLLQSQRAQEHSSRLQAEKSMMEMQGQELHRKLGVLEEEVRAARRA 1068

Query:   111 LTEKIAKMEAELKTAEP-VKLEFQKSKTEAQNLVVAREELIAKVHQLT---QDLQ----- 161
               E   + +A L+  E  V+L+ ++ +TE + L+V   +L A +  L    ++LQ     
Sbjct:  1069 QEETRGQQQALLRDHEALVQLQ-RRQETELEGLLVRHRDLKANMRALELAHRELQGRHEQ 1127

Query:   162 ----RAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMAT 217
                 RA+ + Q++ ALL+E E L Q+ H  RG  E  ++  N+H E  Q++         
Sbjct:  1128 LQAQRANVEAQEV-ALLAERERLMQDGHRQRGLEEELRRLQNEH-ERAQMLLAEVSRERG 1185

Query:   218 EVEKLRAEL 226
             E++  R EL
Sbjct:  1186 ELQGERGEL 1194


>WB|WBGene00000677 [details] [associations]
            symbol:col-103 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0040011
            "locomotion" evidence=IMP] InterPro:IPR002486 Pfam:PF01484
            SMART:SM01088 GO:GO:0040011 GeneTree:ENSGT00690000102663
            GO:GO:0042302 HOGENOM:HOG000085656 EMBL:FO081484 PIR:E88633
            RefSeq:NP_499982.1 ProteinModelPortal:O45114 STRING:O45114
            EnsemblMetazoa:F56B3.1 GeneID:176901 KEGG:cel:CELE_F56B3.1
            UCSC:F56B3.1 CTD:176901 WormBase:F56B3.1 eggNOG:NOG301529
            InParanoid:O45114 OMA:SNTCPPG NextBio:894512 Uniprot:O45114
        Length = 371

 Score = 126 (49.4 bits), Expect = 6.2e-05, P = 6.2e-05
 Identities = 87/287 (30%), Positives = 103/287 (35%)

Query:   229 APNVDRRA------DGSYGGATGNSE-NETSGRPVGQNA---YEDGYGVPQGHGPPPSAT 278
             APN ++R        G YGG  G +      G  VG      Y  G+G   GHG      
Sbjct:    63 APNREKRGYAQYGGGGGYGGGHGGAAVGGGYGGAVGGGGGGGYGGGHG--GGHGGAVGGG 120

Query:   279 TAGVVGAGPNTSTSAYAAT----QSGTPMRAAYD-IPRGPGYEASKGPGYDASKAPSYDP 333
               G  G G     S  + T      G P +A  D +P  PG   S G     S   S   
Sbjct:   121 YGGGGGGGGGCQCSPSSNTCPPGPRGPPGQAGLDGLPGAPGQPGSNGGA--GSNGASEGS 178

Query:   334 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 393
               G    PA  PG  P  GP   A +  N D Q G        PS+    G+G     GP
Sbjct:   179 AGGCKTCPAGPPG--PP-GPAGQAGRPGN-DGQPG-------APSFGG--GVGAPGAPGP 225

Query:   394 NYDM-QRG-PGYETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS 450
               D    G PG   Q   PG + Q G        P+  P   PG +   G GY +   P 
Sbjct:   226 AGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAG-PPGPPGNNGAPGGGYGV--GPP 282

Query:   451 YDPSRGTGFDGAPRGAAPHGQVPPPLNN-VPYGSAT--P-PARSGSG 493
               P   +G  GAP    P GQ   P N+  P   A   P P R G G
Sbjct:   283 GPPGP-SGRPGAPGQPGPDGQPGAPGNDGTPGTDAAYCPCPGRGGGG 328


>RGD|628797 [details] [associations]
            symbol:Prpmp5 "proline-rich protein MP5" species:10116 "Rattus
            norvegicus" [GO:0005576 "extracellular region" evidence=IEA]
            RGD:628797 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
            CTD:5542 KO:K13911 EMBL:L17318 EMBL:M11899 IPI:IPI00187926
            PIR:B48013 RefSeq:NP_742062.1 UniGene:Rn.29950 GeneID:257651
            KEGG:rno:257651 UCSC:RGD:628797 NextBio:624204
            Genevestigator:P10165 Uniprot:P10165
        Length = 295

 Score = 124 (48.7 bits), Expect = 6.4e-05, P = 6.4e-05
 Identities = 63/200 (31%), Positives = 77/200 (38%)

Query:   310 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR-- 367
             P   G +    PG  + + P   P  GP   P +GP   P  GP    Q GS        
Sbjct:   101 PPAAGPQRPPQPG--SPQGPP--PPGGPQQRPPQGP--PPQGGPQRPPQPGSPQGPPPPG 154

Query:   368 GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVP-GYDVQRGPVYEAQR 423
             GP     +GP   PQ G     QR P     +GP   G   QR P G   Q GP    QR
Sbjct:   155 GPQQRPPQGPP--PQGG----PQRPPQPGSPQGPPPPGGPQQRAPQGPPPQGGP----QR 204

Query:   424 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP--PLNNVPY 481
              P     +GP        G   +R P   P +G G    P+  +P G  PP  P    P 
Sbjct:   205 PPQPGSPQGPP-----PPGGPQQRPPQGPPPQG-GPQRPPQPGSPQGPPPPGGPQQRPPQ 258

Query:   482 GSATPPARSGSGQP-RGGNP 500
             G   PP + G  +P + GNP
Sbjct:   259 G---PPPQGGPQRPPQPGNP 275


>ZFIN|ZDB-GENE-030131-8373 [details] [associations]
            symbol:col10a1 "collagen, type X, alpha 1"
            species:7955 "Danio rerio" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR008983 ZFIN:ZDB-GENE-030131-8373 GO:GO:0005581
            Gene3D:2.60.120.40 InterPro:IPR001073 InterPro:IPR008160
            Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007 SMART:SM00110
            SUPFAM:SSF49842 PROSITE:PS50871 GeneTree:ENSGT00700000104270
            OMA:KPGHGSP EMBL:CU306817 IPI:IPI00491103
            Ensembl:ENSDART00000091021 ArrayExpress:F1QXD5 Bgee:F1QXD5
            Uniprot:F1QXD5
        Length = 655

 Score = 129 (50.5 bits), Expect = 7.0e-05, P = 7.0e-05
 Identities = 81/269 (30%), Positives = 107/269 (39%)

Query:   255 PVGQNAYEDGYGVPQGHGPP----PSATTA-GVVGA--GPNTSTSAYAATQSGTPMRAAY 307
             P G  A +DG G+P   GPP    P+  +A G  G+  GP    +  A    G       
Sbjct:    64 PPGP-AGQDGEGLPGPQGPPGAPGPAGYSAPGKPGSPGGPGKPGATGAPGLKGDTGAPGL 122

Query:   308 DIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDP-AKGP-GYDPTKG----PGYDAQK 359
               PRG PG   S GP G  A+  P      GP+  P A GP G    KG    PG   QK
Sbjct:   123 QGPRGMPGPSGSPGPAGISATGKP------GPAGLPGAMGPRGEQGFKGHPGIPGLPGQK 176

Query:   360 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQR-VPGYDVQRGP 417
             G      +GP  +  RGP+  P    G     G     + G PG   +   PG D + GP
Sbjct:   177 GEMGVGVQGPAGE--RGPT-GPVGPSGKPGAPGVGLPGKPGAPGEAGKSGSPGRDGESGP 233

Query:   418 VY-EAQRAPSYIPQRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 474
             +  + Q+  +  P  G PG   + G  G      P   P   +G  GAP G   +G+  P
Sbjct:   234 MGPQGQKGQTGAPGVGIPGKPGENGAPGMPGPTGPK-GPQGASGAPGAP-GVPGYGK--P 289

Query:   475 PLNNVPYGSATPPARSGSGQPRGGNPARR 503
               N +      P +   +GQ   G P  +
Sbjct:   290 GENGLKGDRGVPGSPGTTGQK--GEPGAK 316


>UNIPROTKB|Q04118 [details] [associations]
            symbol:PRB3 "Basic salivary proline-rich protein 3"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=NAS] [GO:0051636 "Gram-negative bacterial cell surface
            binding" evidence=NAS] [GO:0008150 "biological_process"
            evidence=ND] GO:GO:0005576 GO:GO:0051636 InterPro:IPR026086
            PANTHER:PTHR23203 EMBL:X07637 EMBL:X07881 EMBL:BC096209
            EMBL:BC096210 EMBL:BC096211 IPI:IPI00006699 PIR:A36298 PIR:B36298
            PIR:S10889 RefSeq:NP_006240.4 UniGene:Hs.73031 STRING:Q04118
            DMDM:229462763 PaxDb:Q04118 PRIDE:Q04118 Ensembl:ENST00000381842
            GeneID:5544 KEGG:hsa:5544 CTD:5544 GeneCards:GC12M011418
            H-InvDB:HIX0201930 HGNC:HGNC:9339 MIM:168840 neXtProt:NX_Q04118
            PharmGKB:PA33701 HOGENOM:HOG000060075 GenomeRNAi:5544 NextBio:21478
            ArrayExpress:Q04118 Bgee:Q04118 CleanEx:HS_PRB3
            Genevestigator:Q04118 GermOnline:ENSG00000197870 Uniprot:Q04118
        Length = 309

 Score = 124 (48.7 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 79/271 (29%), Positives = 99/271 (36%)

Query:   247 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 306
             S +  SG+P G+     G   PQ   PPP     G    G N S         G P R  
Sbjct:    28 SPSVISGKPEGRRP--QGGNQPQ-RTPPPPGKPEGRPPQGGNQS--------QGPPPRPG 76

Query:   307 YDIPRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 363
                P GP   G   S+GP     K P   P +G +   ++GP   P K  G   Q G N 
Sbjct:    77 K--PEGPPPQGGNQSQGPPPRPGK-PEGQPPQGGNQ--SQGPPPRPGKPEGPPPQ-GGNQ 130

Query:   364 DAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP----GYETQRVPGYDVQ-RGPV 418
                  P      GP   P +G        P+     GP    G ++Q  P    +  GP 
Sbjct:   131 SQGPPPRPGKPEGP---PPQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPP 187

Query:   419 YEAQRAPSYIPQRGPGY-DLQRGQGYDMRRAPSYDPSR--GTGFDGA--PRGAAPH-G-- 470
              +        P R PG  +    QG +  + P   P +  G+   G   P+G  PH G  
Sbjct:   188 PQGGNQSQGPPPR-PGKPEGPPPQGGNQSQGPPPRPGKPEGSPSQGGNKPQGPPPHPGKP 246

Query:   471 QVPPPLN-NVPYGSATPPARSGSGQPRGGNP 500
             Q PPP   N P     PP R     P GGNP
Sbjct:   247 QGPPPQEGNKPQ-RPPPPGRPQGPPPPGGNP 276


>TAIR|locus:2204400 [details] [associations]
            symbol:AT1G76010 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0005829 "cytosol"
            evidence=IDA] InterPro:IPR002775 Pfam:PF01918 EMBL:CP002684
            GO:GO:0005829 GO:GO:0003676 EMBL:AF412102 EMBL:AY054208
            EMBL:AF428441 EMBL:AY124847 IPI:IPI00531013 RefSeq:NP_565124.1
            UniGene:At.24580 UniGene:At.67776 UniGene:At.75066 HSSP:P60849
            ProteinModelPortal:Q93VA8 SMR:Q93VA8 STRING:Q93VA8 PRIDE:Q93VA8
            EnsemblPlants:AT1G76010.1 GeneID:843932 KEGG:ath:AT1G76010
            TAIR:At1g76010 HOGENOM:HOG000240806 InParanoid:Q93VA8 OMA:YDGPPQG
            PhylomeDB:Q93VA8 ProtClustDB:CLSN2917456 Genevestigator:Q93VA8
            Uniprot:Q93VA8
        Length = 350

 Score = 125 (49.1 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 70/207 (33%), Positives = 88/207 (42%)

Query:   254 RPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ---SGTPMRAAYDIP 310
             +P+G   YE   G P G G           G G     +AY   +    G     +Y   
Sbjct:   134 KPMGDIDYEGREGSPGGRGRGRGRGRGR--GRGRGGRGNAYVNVEHEDGGWEREQSYGRG 191

Query:   311 RGPGY-EASKGPGYDASKAP--SYDPTK--GPSYD-PAKGPGYDPTKGPGYDA--QKGSN 362
             RG G   +S+G G      P   YD  +  G  YD P +  GYD  +G GYDA  Q    
Sbjct:   192 RGRGRGRSSRGRGRGGYNGPPNEYDAPQDGGYGYDAPHEHRGYDD-RG-GYDAPPQGRGG 249

Query:   363 YDAQRGPN-YDIHRGP-SYD--PQ-RGLGYDMQRGPNYDMQRGPGYE--TQRVPGYDVQR 415
             YD  +G   YD  +G   YD  PQ RG GYD   GP+    RG GY+  +Q   GYD   
Sbjct:   250 YDGPQGRGGYDGPQGRRGYDGPPQGRG-GYD---GPSQG--RG-GYDGPSQGRGGYD--- 299

Query:   416 GPVYEAQRAPSYIPQRGPGYDLQRGQG 442
             GP   +Q    Y   +G G    RG+G
Sbjct:   300 GP---SQGRGGYDGPQGRGRGRGRGRG 323


>UNIPROTKB|F1RZK4 [details] [associations]
            symbol:COL10A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005938 "cell cortex" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
            GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:CU062641
            Ensembl:ENSSSCT00000004901 Uniprot:F1RZK4
        Length = 675

 Score = 129 (50.5 bits), Expect = 7.3e-05, P = 7.3e-05
 Identities = 88/296 (29%), Positives = 113/296 (38%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG--AGPN 288
             ++ A G  G  G  G +     GRP G        G P G   PP     G  G    P 
Sbjct:   176 EKGAPGVPGINGQKGETGYGAPGRP-GDRGLPGPQG-PMGPPGPPGVGKRGENGFPGQPG 233

Query:   289 TSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG---PGYD-ASKAPSYDPTKG----PSY 339
                      +SG P  A    P+GP G +  +G   PG   A+  P    TKG    P  
Sbjct:   234 IKGDRGFPGESG-P--AGPPGPQGPPGEQGREGIGKPGAPGAAGQPGLPGTKGHPGAPGM 290

Query:   340 -DPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGL-GYDMQRGPNYD 396
               P   PG+     PG   Q+G        P     +GP+  P + GL G    RGP   
Sbjct:   291 AGPPGAPGFGKPGLPGLKGQRGP-IGLPGAPGAKGEQGPAGHPGEPGLTGPPGSRGP--- 346

Query:   397 MQRGPGYETQRVPGYDVQRGPVYEAQRA-PSYIP----QRGP-GYDLQRGQ-GYDMRRAP 449
               +GP    + +PG +   GP  E   A P+  P    +RGP G D + G  G      P
Sbjct:   347 --QGP----KGIPGNNGVPGPKGEIGLAGPAGFPGAKGERGPSGLDGKPGYPGEPGLNGP 400

Query:   450 SYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 503
               +P    G  G P    P G +P P+   P G+   P  +G G PRG  G P  R
Sbjct:   401 KGNPGL-PGPKGDPGIGGPPG-LPGPVG--PAGAKGVPGHNGEGGPRGAPGIPGTR 452


>UNIPROTKB|H7BZW9 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP EMBL:AC099777
            ProteinModelPortal:H7BZW9 PRIDE:H7BZW9 Ensembl:ENST00000438794
            Uniprot:H7BZW9
        Length = 316

 Score = 124 (48.7 bits), Expect = 7.4e-05, P = 7.4e-05
 Identities = 40/165 (24%), Positives = 82/165 (49%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:    84 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 139

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   ++
Sbjct:   140 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGEL 197

Query:   168 QQIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             +++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   198 EKLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 242


>UNIPROTKB|B7Z863 [details] [associations]
            symbol:SLMAP "cDNA FLJ54742, highly similar to Mus musculus
            sarcolemma associated protein (Slmap), mRNA" species:9606 "Homo
            sapiens" [GO:0006457 "protein folding" evidence=IEA] [GO:0016272
            "prefoldin complex" evidence=IEA] [GO:0051082 "unfolded protein
            binding" evidence=IEA] InterPro:IPR002777 Pfam:PF01920
            GO:GO:0006457 GO:GO:0016272 EMBL:AC114480 UniGene:Hs.476432
            HGNC:HGNC:16643 ChiTaRS:SLMAP HOGENOM:HOG000072709 EMBL:AC099777
            EMBL:AK302934 IPI:IPI00945565 STRING:B7Z863 Ensembl:ENST00000494088
            UCSC:uc011bfc.1 HOVERGEN:HBG087998 Uniprot:B7Z863
        Length = 318

 Score = 124 (48.7 bits), Expect = 7.5e-05, P = 7.5e-05
 Identities = 40/165 (24%), Positives = 82/165 (49%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:    39 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 94

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   ++
Sbjct:    95 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGEL 152

Query:   168 QQIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             +++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   153 EKLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 197


>ZFIN|ZDB-GENE-030131-2281 [details] [associations]
            symbol:col4a5 "collagen, type IV, alpha 5 (Alport
            syndrome)" species:7955 "Danio rerio" [GO:0005201 "extracellular
            matrix structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] [GO:0031290 "retinal ganglion cell axon guidance"
            evidence=IMP] [GO:0007412 "axon target recognition" evidence=IMP]
            [GO:0030198 "extracellular matrix organization" evidence=IMP]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            ZFIN:ZDB-GENE-030131-2281 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0030198 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0007412 GO:GO:0031290 GO:GO:0005201
            HOVERGEN:HBG004933 HOGENOM:HOG000085652 OrthoDB:EOG45DWPF
            Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1287
            OMA:MPMNMEP EMBL:CR354588 EMBL:CR936978 IPI:IPI00835382
            RefSeq:NP_001116702.1 UniGene:Dr.77841 SMR:B0UXF7
            Ensembl:ENSDART00000073827 GeneID:323561 KEGG:dre:323561
            NextBio:20808319 Uniprot:B0UXF7
        Length = 1659

 Score = 133 (51.9 bits), Expect = 8.0e-05, P = 8.0e-05
 Identities = 83/294 (28%), Positives = 100/294 (34%)

Query:   227 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP--PSATTAGVVG 284
             M  P V  R      G  G+        P GQ  +    G+P   G P  P     G  G
Sbjct:   652 MTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFP---GLPGSKGEPGLPGIGLPGPPG 708

Query:   285 AGPNTSTSAYAATQSGTPMRAAYD-IPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPA 342
             A       A +    G P R   D +P  PG   SKG PGY     P   PT  P     
Sbjct:   709 A-KGFPGIAGSPGGPGIPGRPGLDGLPGQPGLPGSKGDPGYGLPGPPG--PTGSPGIKGG 765

Query:   343 KGPGYDPTKGPGYDAQKGS-NYDAQRGPNYD--IHRGPS-YDPQRGLGYDMQRGPNYDMQ 398
              GP  D +  PG   Q G    D   GP  D     GP    P     + +Q  P     
Sbjct:   766 PGPKGD-SGFPGSPGQPGRPGLDGAPGPKGDAGFPGGPGPRGPPGAPAFGLQGPPG--PP 822

Query:   399 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPS-R 455
               PG   +  VPG + ++G      R P  +    PG+   RG  G      P   P   
Sbjct:   823 GAPGSIGSPGVPGANGEKG-----DRGPPGLST--PGFQGDRGISGLPGPPGPVGPPGVP 875

Query:   456 GT-GFDGAPRGAAPHGQV----PPPLNNVPYGSATP--PARSGS-GQP-RGGNP 500
             G  G DG P      G++    PP     P     P  P   G  G P + GNP
Sbjct:   876 GRPGQDGLPGLPGSKGEMGSMGPPGSKGNPGNPGAPGFPGPKGDDGVPGQSGNP 929

 Score = 126 (49.4 bits), Expect = 0.00046, P = 0.00046
 Identities = 82/275 (29%), Positives = 97/275 (35%)

Query:   242 GATGNSENETSGR-PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSAYAATQ 298
             G  G +  E   R P GQ+      G P   GPP      G+ G+ G P           
Sbjct:   648 GEPGMTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFPGLPGSKGEPGLPGIGLPGPP 707

Query:   299 SGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTKG-PSYDPAKGPGYDPTKGPGYD 356
                        P GPG      PG D     P    +KG P Y     PG  PT  PG  
Sbjct:   708 GAKGFPGIAGSPGGPGIPGR--PGLDGLPGQPGLPGSKGDPGYGLPGPPG--PTGSPGI- 762

Query:   357 AQKGSNYDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYD 412
               KG       GP  D    G    P R  G D   GP  D     GPG       P + 
Sbjct:   763 --KGGP-----GPKGDSGFPGSPGQPGRP-GLDGAPGPKGDAGFPGGPGPRGPPGAPAFG 814

Query:   413 VQRGPVYEAQRAPSYIPQRG-PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPH 469
             +Q GP      AP  I   G PG + ++G +G      P +   RG +G  G P    P 
Sbjct:   815 LQ-GPP-GPPGAPGSIGSPGVPGANGEKGDRGPPGLSTPGFQGDRGISGLPGPPGPVGPP 872

Query:   470 GQVP--PPLNNVPYGSATPPARSGSGQPRG--GNP 500
             G VP  P  + +P G        GS  P G  GNP
Sbjct:   873 G-VPGRPGQDGLP-GLPGSKGEMGSMGPPGSKGNP 905


>UNIPROTKB|G3N3C9 [details] [associations]
            symbol:LDB3 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
            "cytoskeletal protein binding" evidence=IEA] [GO:0005856
            "cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 OMA:CTSQATT
            InterPro:IPR006643 SMART:SM00735 GeneTree:ENSGT00700000104411
            EMBL:DAAA02062163 Ensembl:ENSBTAT00000065403 Uniprot:G3N3C9
        Length = 730

 Score = 129 (50.5 bits), Expect = 8.1e-05, P = 8.1e-05
 Identities = 54/206 (26%), Positives = 76/206 (36%)

Query:   225 ELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 284
             E M  P+ +         +T +    TS  P   + Y +    P    P P   T   + 
Sbjct:   353 EYMQDPDEEALRRSRPQASTYSPAVATSPAPAA-HTYSEAPAAP---APKPRVVTTASIR 408

Query:   285 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 344
               P+      A+T S +P  A Y  P  P Y  S  P Y  S  P+Y P+  P+Y P+  
Sbjct:   409 --PSVYQPVPASTYSPSP-GANYS-PT-P-YTPSPAPAYTPSPTPAYTPSPAPTYSPSPA 462

Query:   345 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGY 403
             P Y P+  P Y+    S   A+           S+  +   G          + RG P Y
Sbjct:   463 PAYTPSPAPSYNPTLYSGGPAESASRPPWVTDDSFSQKFAPGKTTTTVSKQSLPRGAPAY 522

Query:   404 ETQRVPGYDVQ---RGPVYEAQRAPS 426
              T   P   V    RG V  A+R P+
Sbjct:   523 -TPPPPAPQVSPLARGTVQRAERFPA 547


>UNIPROTKB|G8ENL4 [details] [associations]
            symbol:FUS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0005634 GO:GO:0005737 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 EMBL:CU464163 EMBL:JF940526
            Ensembl:ENSSSCT00000036326 Uniprot:G8ENL4
        Length = 517

 Score = 127 (49.8 bits), Expect = 8.2e-05, P = 8.2e-05
 Identities = 68/240 (28%), Positives = 93/240 (38%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 297
             G+Y    G   ++ S +P GQ +Y  GYG          ++     G   NT   A +A 
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSYS-GYGQSADTSGYGQSSYGSSYGQTQNTGYGAQSAP 73

Query:   298 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 354
             Q G      Y   +G    Y + S  PGY    APS   T G     ++  GY   +  G
Sbjct:    74 Q-GYGSTGGYGSGQGSQSSYGQQSSYPGYGQQPAPS--STSGSYGTSSQSSGYGQPQSGG 130

Query:   355 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET--QRVPGYD 412
             Y  Q G  Y  Q+  +Y   +  SY+P +G G   Q   +     G G  +  Q  P   
Sbjct:   131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQNQYNSSSGGGGGGGGGSYGQDQPSMS 185

Query:   413 VQRGPVYEAQ-RAPSYI--PQ----RGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                G  Y  Q ++  Y    Q    RG G     G GY+ R +  Y+P RG G     RG
Sbjct:   186 GGGGGGYGNQDQSGGYGGGQQDRGGRGRGGGSGGGGGYN-RSSGGYEP-RGRGGGRGGRG 243


>UNIPROTKB|E2RA07 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 OMA:EGTSTGY
            EMBL:AAEX03014786 EMBL:AAEX03014787 Ensembl:ENSCAFT00000019384
            Uniprot:E2RA07
        Length = 671

 Score = 117 (46.2 bits), Expect = 8.7e-05, Sum P(2) = 8.7e-05
 Identities = 63/238 (26%), Positives = 87/238 (36%)

Query:   238 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 348
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 160

Query:   349 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              P+ G G   Q   +Y    G  P   +   PSY P R   ++      Y   R   Y +
Sbjct:   161 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPTR---FNSSSLKLYHYSRS--YSS 212

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              +   YD            PS   Q+   Y  Q    Y  +   SY P  G+ +  AP
Sbjct:   213 TQPTSYDQSSYSQQNTYGQPSSYGQQS-SYGQQ--SSYGQQPPTSYPPQTGS-YSQAP 266

 Score = 57 (25.1 bits), Expect = 8.7e-05, Sum P(2) = 8.7e-05
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   464 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 502
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   470 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 515


>RGD|71029 [details] [associations]
            symbol:Col3a1 "collagen, type III, alpha 1" species:10116 "Rattus
           norvegicus" [GO:0001501 "skeletal system development" evidence=IEP]
           [GO:0001568 "blood vessel development" evidence=IEA;ISO] [GO:0005201
           "extracellular matrix structural constituent" evidence=IEA]
           [GO:0005581 "collagen" evidence=ISO] [GO:0005586 "collagen type III"
           evidence=ISO;TAS] [GO:0005615 "extracellular space" evidence=IEA]
           [GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
           "transforming growth factor beta receptor signaling pathway"
           evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
           evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
           [GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
           "peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
           organization" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
           evidence=ISO] [GO:0032964 "collagen biosynthetic process"
           evidence=IEA] [GO:0034097 "response to cytokine stimulus"
           evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
           "extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
           development" evidence=IEA] [GO:0046332 "SMAD binding"
           evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
           [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
           [GO:0048565 "digestive tract development" evidence=IEA;ISO]
           [GO:0050777 "negative regulation of immune response" evidence=IEA]
           [GO:0071230 "cellular response to amino acid stimulus"
           evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
           Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
           PROSITE:PS51461 SMART:SM00038 SMART:SM00214 RGD:71029 GO:GO:0043588
           GO:GO:0005615 GO:GO:0007507 GO:GO:0046872 GO:GO:0034097
           GO:GO:0030199 GO:GO:0001501 GO:GO:0007179 GO:GO:0007229
           GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
           GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
           GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
           GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
           HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
           CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:BC087039 EMBL:X70369
           EMBL:AJ005395 EMBL:M21354 IPI:IPI00366944 PIR:S41067
           RefSeq:NP_114474.1 UniGene:Rn.3247 ProteinModelPortal:P13941
           IntAct:P13941 STRING:P13941 PRIDE:P13941 Ensembl:ENSRNOT00000004956
           GeneID:84032 KEGG:rno:84032 UCSC:RGD:71029 InParanoid:P13941
           NextBio:616623 Genevestigator:P13941 GermOnline:ENSRNOG00000003357
           Uniprot:P13941
        Length = 1463

 Score = 132 (51.5 bits), Expect = 8.9e-05, P = 8.9e-05
 Identities = 76/261 (29%), Positives = 102/261 (39%)

Query:   257 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAYDIPR 311
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   +
Sbjct:   320 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAQ 379

Query:   312 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 369
             GP G   + G PG      P+  P   P    A+GP   P    G   Q+G +   + G 
Sbjct:   380 GPPGPPGNNGSPGGKGEMGPAGIPG-APGLLGARGPP-GPAGANGAPGQRGPS--GEPGK 435

Query:   370 NYDIHRGPSYDPQRG-LGYDMQRGPN-YDMQRG-PGYE-TQRVPGYDVQRG-PVYEAQRA 424
             N      P    +RG  G     GP   D + G PG      VPG   +RG P +     
Sbjct:   436 N-GAKGEPGARGERGEAGSPGIPGPKGEDGKDGSPGEPGANGVPGNPGERGAPGFRGPAG 494

Query:   425 PSYIP-QRGPGYDLQRGQGYDMRRAPSYDPSR-GT-------GFDGAPRGAAPHGQVPPP 475
             P+  P ++GP  + + G G    R  + +P R GT       G  G+P G    G+  PP
Sbjct:   495 PNGAPGEKGPAGE-RGGPGPAGPRGVAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGPP 553

Query:   476 LNNVPYGSATPPARSGS-GQP 495
              +    G   PP  SG  GQP
Sbjct:   554 GSQGESGRPGPPGPSGPRGQP 574

 Score = 126 (49.4 bits), Expect = 0.00040, P = 0.00040
 Identities = 82/284 (28%), Positives = 103/284 (36%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTSTS 292
             G  GG  G +       P G + +    G P   GPP     AG  G  GP      S  
Sbjct:   166 GGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAIGPSGP 225

Query:   293 AYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGYDP 349
             A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG   
Sbjct:   226 AGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGLKG 284

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE-T 405
               G PG +   G      RG   +  R P      G  G D  RG   D Q GP G   T
Sbjct:   285 ENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPPGT 339

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                PG    +G V  A    S      PG   QRG+      A +  P    G +G+P G
Sbjct:   340 AGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGNNGSPGG 393

Query:   466 AAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 502
                 G  P  +   P   G+  PP  +G+ G P  RG  G P +
Sbjct:   394 KGEMG--PAGIPGAPGLLGARGPPGPAGANGAPGQRGPSGEPGK 435


>FB|FBgn0052685 [details] [associations]
            symbol:ZAP3 species:7227 "Drosophila melanogaster"
            [GO:0005634 "nucleus" evidence=IEA] [GO:0008157 "protein
            phosphatase 1 binding" evidence=IPI] [GO:0048812 "neuron projection
            morphogenesis" evidence=IMP] InterPro:IPR026314 GO:GO:0005634
            EMBL:AE014298 PANTHER:PTHR13413 GeneTree:ENSGT00440000039837
            FlyBase:FBgn0052685 RefSeq:NP_727393.1 UniGene:Dm.10734
            ProteinModelPortal:Q9W2Y5 SMR:Q9W2Y5 IntAct:Q9W2Y5 MINT:MINT-741898
            STRING:Q9W2Y5 EnsemblMetazoa:FBtr0071489 GeneID:31942
            KEGG:dme:Dmel_CG32685 UCSC:CG32685-RC InParanoid:Q9W2Y5
            PhylomeDB:Q9W2Y5 GenomeRNAi:31942 NextBio:776058
            ArrayExpress:Q9W2Y5 Bgee:Q9W2Y5 Uniprot:Q9W2Y5
        Length = 1884

 Score = 136 (52.9 bits), Expect = 9.1e-05, Sum P(2) = 9.1e-05
 Identities = 77/285 (27%), Positives = 109/285 (38%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAG 286
             N+ N ++  D     +T N E   +  P G     +G G   G GP  +  +   V G  
Sbjct:   994 NSGNENKSQDAGDSVSTNNGEKPDNNGPPGGFGPGNGPGGGPGSGPGQNDGSRFDVFGPN 1053

Query:   287 PNTSTSAYAATQSGTPMRAAYDI---PRGPGYEASKGPGYDASKAPSYD--PTKGPSYDP 341
               +  +      +G P          P GPG   + GP +  +  P     P   P+  P
Sbjct:  1054 QVSGNNFIDLDNNGPPGFGPPGRNFGPNGPGPRGNFGPNFGHNFGPRGPGGPFIRPN-GP 1112

Query:   342 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 401
               GPG  P  GP +    G N+    GPN+    GP++ P+ G      RGP+     GP
Sbjct:  1113 LPGPG--PNFGPHF-RPNGPNF----GPNF----GPNFGPRPGSRNFGPRGPD-----GP 1156

Query:   402 -GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP--SYDPSRGTG 458
              G      PG D   GP +   R P   P  GPG++++   G  +   P        G G
Sbjct:  1157 FG------PGRDDFGGPPFGGPR-PHMGPN-GPGHNMRGFNGGPISDNPFRRQGGPPGPG 1208

Query:   459 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 503
             F     GA P  + P    N  +G+   P   G G   GGN  R+
Sbjct:  1209 FGNDDLGAGPP-RGPRNFGN-RFGN---PGGGGGGGGGGGNNNRK 1248

 Score = 47 (21.6 bits), Expect = 9.1e-05, Sum P(2) = 9.1e-05
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:    33 PPMPGAFPPFDMMPPP 48
             PP P   PP    PPP
Sbjct:    18 PPQPSVPPPLPDAPPP 33


>ZFIN|ZDB-GENE-050302-9 [details] [associations]
            symbol:col2a1b "collagen type II, alpha-1b"
            species:7955 "Danio rerio" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0033333 "fin development" evidence=IMP]
            [GO:0033334 "fin morphogenesis" evidence=IMP] [GO:0005581
            "collagen" evidence=IEA] EMBL:HF563615 EMBL:HF563616 EMBL:HF563617
            Uniprot:L0S5L0
        Length = 1493

 Score = 132 (51.5 bits), Expect = 9.1e-05, P = 9.1e-05
 Identities = 82/282 (29%), Positives = 99/282 (35%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNT 289
             +R   G  G  GA GN        P G        G P   G P +   AG  GA GP  
Sbjct:   337 ERGRPGPSGASGARGNDGLPGGAGPPGPVGTAGSPGFP---GSPGAKGEAGPTGARGPEG 393

Query:   290 STSAYAATQSGTPMRAAYDIPRG-PGYEASKG-PGYDASK-APSYDPTKG-PSYDPAKGP 345
             +       +SG P  +    P G  G   S G PG   S  AP      G P   P   P
Sbjct:   394 AQGPRG--ESGVPGASG---PSGVSGNPGSDGMPGAKGSVGAPGIGGAPGFPG--PRGPP 446

Query:   346 GYDPTKGP-GYDAQKGSN----YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
             G     GP G   Q G +    +  + GP  +I            G + +RGP  +    
Sbjct:   447 GPQGATGPLGPKGQSGDSGLAGFKGEAGPKGEIGNAGLQGAPGPAGEEGKRGPRGEPGAA 506

Query:   401 --PGYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPS 454
               PG   +R  PG    RG P  +    P   P +RGP G    +G G D  R       
Sbjct:   507 GPPGPTGERGTPG---NRGFPGQDGLAGPKGAPGERGPAGVSGPKGAGGDPGRPGEPGLP 563

Query:   455 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSG-SGQP 495
                G  G P  A P G+V P       G   PP   G  GQP
Sbjct:   564 GARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGVRGQP 605

 Score = 131 (51.2 bits), Expect = 0.00012, P = 0.00012
 Identities = 88/298 (29%), Positives = 112/298 (37%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 295
             A G  G A    E    G+P G + ++   G+P   GPP      G  G  P  +  A A
Sbjct:   646 AAGPPGPAGSAGERGEQGQP-GPSGFQ---GLPGPPGPPGEGGKPGDQGV-PGEAGGAGA 700

Query:   296 AT---QSGTPMRAAYDIPRG-PGYEASKG-PGYDASKAPSYDP--TKGPSYDPA-KG-PG 346
                  + G P       P+G  G     G PG D  K     P  T G    P  +G PG
Sbjct:   701 TGPRGERGFPGERGGAGPQGLQGPRGLPGTPGTDGPKG-GVGPAGTAGAQGPPGLQGMPG 759

Query:   347 YDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL----GYDMQRGPNYDM-QRG 400
                T G PG    +G N D  +GP       P  D  RGL    G     GPN +  + G
Sbjct:   760 ERGTSGNPGPKGDRGDNGD--KGPE----GAPGKDGSRGLTGPIGPTGPAGPNGEKGESG 813

Query:   401 P----GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 455
             P    G   T+ VPG   + GP   A  A        PG   ++G+G     A +  P  
Sbjct:   814 PAGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADGQPGVKGEQGEGGQKGDAGAPGPQG 873

Query:   456 GTGFDG--APRGAA-PHG----QVPPPLNNVP--YGSATPPARSGSGQPRG--GNPAR 502
              +G  G   P G + P G    Q PP     P   G   PP  +G+  P G  G P +
Sbjct:   874 PSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPNGNPGPAGPAGPPGK 931

 Score = 124 (48.7 bits), Expect = 0.00067, P = 0.00067
 Identities = 78/259 (30%), Positives = 90/259 (34%)

Query:   257 GQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAAT-QSGTPMRAAYDIPRGPG 314
             G+   +   G P   GP  +    G  G +GP  +  A      +G P  A    P GP 
Sbjct:   858 GEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPN 917

Query:   315 YEASKGPGYDASKAPSYDPTKGPSYD--PAKGPGYDPTKGP-GYDAQKGS-NYDAQRGPN 370
                + GP   A   P  D  KG   D  P   PG    +G  G   +KG    D   GP 
Sbjct:   918 --GNPGPAGPAGP-PGKDGPKGVRGDGGPPGRPGDAGLRGSAGPAGEKGDPGEDGPHGP- 973

Query:   371 YDIHRGPS-YDPQRGL-GYDMQRGPN-YDMQRGPGYET--QRVPGYDVQRGPVYEAQRAP 425
              D   GP     QRG+ G   QRG   +    GP  E   Q  PG    RGP      AP
Sbjct:   974 -DGPAGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGPGDRGPPGPVG-AP 1031

Query:   426 SYIPQRG-PGYDLQRGQGYDMRRAPS--YDPSRG----TGFDGAPRGAAPHGQVPPPLNN 478
                   G PG +   G      R  S      RG     G  GAP G    G V P    
Sbjct:  1032 GLTGAAGEPGREGNPGSDGPPGRDGSAGIKGDRGDTGPAGAPGAPGGPGAPGPVGPTGKQ 1091

Query:   479 VPYGSATPPARSGSGQPRG 497
                G A P   SG   P G
Sbjct:  1092 GDRGEAGPHGPSGPPGPAG 1110

 Score = 123 (48.4 bits), Expect = 0.00086, P = 0.00086
 Identities = 79/280 (28%), Positives = 96/280 (34%)

Query:   233 DRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTST 291
             D+   G  GGA         G P G+     G G PQG  GP     T G  G       
Sbjct:   688 DQGVPGEAGGAGATGPRGERGFP-GERG---GAG-PQGLQGPRGLPGTPGTDGPKGGVGP 742

Query:   292 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PG-YDP 349
             +  A  Q G P      +P   G   + GP  D        P   P  D ++G  G   P
Sbjct:   743 AGTAGAQ-GPP--GLQGMPGERGTSGNPGPKGDRGDNGDKGPEGAPGKDGSRGLTGPIGP 799

Query:   350 TKGPGYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRGLGYDM-QRGPN--YDMQRGPGYET 405
             T   G + +KG +     GP      GPS     RG+  D  + GP         PG + 
Sbjct:   800 TGPAGPNGEKGES-----GP-----AGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADG 849

Query:   406 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 464
             Q  V G   + G   +A       P   PG     G         +  P   TGF GA  
Sbjct:   850 QPGVKGEQGEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAG 909

Query:   465 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPARR 503
                P G   P  N  P G A PP + G    RG G P  R
Sbjct:   910 RVGPPG---PNGNPGPAGPAGPPGKDGPKGVRGDGGPPGR 946


>UNIPROTKB|B7Z964 [details] [associations]
            symbol:SLMAP "cDNA, FLJ79335, highly similar to Homo
            sapiens sarcolemma associated protein (SLMAP), mRNA" species:9606
            "Homo sapiens" [GO:0006457 "protein folding" evidence=IEA]
            [GO:0016272 "prefoldin complex" evidence=IEA] [GO:0051082 "unfolded
            protein binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR002777 Pfam:PF01920 GO:GO:0016021
            GO:GO:0006457 GO:GO:0016272 EMBL:AC114480 UniGene:Hs.476432
            HGNC:HGNC:16643 ChiTaRS:SLMAP HOGENOM:HOG000072709 EMBL:AC099777
            HOVERGEN:HBG087998 EMBL:AK304493 EMBL:AK316436 IPI:IPI00946123
            STRING:B7Z964 Ensembl:ENST00000495364 UCSC:uc011bfa.1
            Uniprot:B7Z964
        Length = 362

 Score = 124 (48.7 bits), Expect = 9.7e-05, P = 9.7e-05
 Identities = 40/165 (24%), Positives = 82/165 (49%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:    80 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 135

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   ++
Sbjct:   136 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGEL 193

Query:   168 QQIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             +++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   194 EKLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 238


>UNIPROTKB|Q8WML4 [details] [associations]
            symbol:MUC1 "Mucin-1" species:9913 "Bos taurus" [GO:0016324
            "apical plasma membrane" evidence=IBA] [GO:0009986 "cell surface"
            evidence=IBA] [GO:0005737 "cytoplasm" evidence=IBA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] PANTHER:PTHR10006 GO:GO:0016021 GO:GO:0005634
            GO:GO:0005737 GO:GO:0009986 GO:GO:0016324 InterPro:IPR000082
            Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 EMBL:AJ400824
            EMBL:AF399757 IPI:IPI00706283 RefSeq:NP_776540.1 UniGene:Bt.9561
            HSSP:Q16615 ProteinModelPortal:Q8WML4 SMR:Q8WML4 STRING:Q8WML4
            MEROPS:S71.001 Ensembl:ENSBTAT00000014051 GeneID:281333
            KEGG:bta:281333 CTD:4582 eggNOG:NOG77744
            GeneTree:ENSGT00700000104548 HOGENOM:HOG000290201
            HOVERGEN:HBG003075 InParanoid:Q8WML4 KO:K06568 OMA:PPAHGVT
            OrthoDB:EOG4NGGNM NextBio:20805343 PMAP-CutDB:Q8WML4
            ArrayExpress:Q8WML4 InterPro:IPR023217 Uniprot:Q8WML4
        Length = 580

 Score = 127 (49.8 bits), Expect = 9.8e-05, P = 9.8e-05
 Identities = 49/202 (24%), Positives = 71/202 (35%)

Query:   263 DGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGP 321
             DG   P     P  A + G  GA  +T TS+ A + + +P       P   P    +  P
Sbjct:    81 DGASTPTSSPAPSPAASPGHDGA--STPTSSPAPSPAASPGHDGASTPTSSPAPSPAASP 138

Query:   322 GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 381
             G+D +  P+  P   P+  P       PT  P         +D    P       P+  P
Sbjct:   139 GHDGASTPTSSPAPSPAASPGHNGTSSPTGSPAPSPAASPGHDGASTPT----SSPAPSP 194

Query:   382 QRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ 441
                 G++    P       P       PG+D    P   +  APS  P   PG++   G 
Sbjct:   195 AASPGHNGTSSPTGSPAPSPAAS----PGHDGASTPT--SSPAPS--PAASPGHN---GT 243

Query:   442 GYDMRRAPSYDPSRGTGFDGAP 463
                   +P+  P+   G D AP
Sbjct:   244 S-SPTGSPAPSPTASPGHDSAP 264

 Score = 121 (47.7 bits), Expect = 0.00044, P = 0.00044
 Identities = 59/236 (25%), Positives = 82/236 (34%)

Query:   275 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK----GPGYDASKAPS 330
             P +TT     + P   TS    T   T    A      PG++ +      P    + +P 
Sbjct:    40 PVSTTQSSPTSSPTKETSWSTTTTLLTASSPAPSPAASPGHDGASTPTSSPAPSPAASPG 99

Query:   331 YD----PTKGPSYDPAKGPGYD----PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 382
             +D    PT  P+  PA  PG+D    PT  P         +D    P       P+  P 
Sbjct:   100 HDGASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPA 155

Query:   383 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 442
                G++    P       P       PG+D    P   +  APS  P   PG++   G  
Sbjct:   156 ASPGHNGTSSPTGSPAPSPAAS----PGHDGASTPT--SSPAPS--PAASPGHN---GTS 204

Query:   443 YDMRRAPSYDPSRGTGFDGA--PRGA-APHGQVPPPLNNV--PYGSATPPARSGSG 493
                  +P+  P+   G DGA  P  + AP     P  N    P GS  P   +  G
Sbjct:   205 -SPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTGSPAPSPTASPG 259

 Score = 121 (47.7 bits), Expect = 0.00044, P = 0.00044
 Identities = 55/234 (23%), Positives = 80/234 (34%)

Query:   274 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 333
             P   T      + P +S +   +  + T +  A   P  P   AS  PG+D +  P+  P
Sbjct:    35 PRRTTPVSTTQSSPTSSPTKETSWSTTTTLLTASS-P-APSPAAS--PGHDGASTPTSSP 90

Query:   334 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 393
                P+  P       PT  P         +D    P       P+  P    G+D    P
Sbjct:    91 APSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPAASPGHDGASTP 146

Query:   394 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 453
                    P       PG++    P      APS  P   PG+D           +P+  P
Sbjct:   147 TSSPAPSPAAS----PGHNGTSSPT--GSPAPS--PAASPGHDGASTPTSSPAPSPAASP 198

Query:   454 SR-GTGFD-GAPR---GAAP-HGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 501
                GT    G+P     A+P H     P ++     A  P  +G+  P G +PA
Sbjct:   199 GHNGTSSPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTG-SPA 251


>UNIPROTKB|F6UV28 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA] [GO:0004828
            "serine-tRNA ligase activity" evidence=IEA] InterPro:IPR012929
            InterPro:IPR015866 Pfam:PF07926 GO:GO:0005524 GO:GO:0005737
            GO:GO:0005643 GO:GO:0006606 InterPro:IPR009053 SUPFAM:SSF46579
            GeneTree:ENSGT00700000104019 GO:GO:0004828 GO:GO:0006434
            Gene3D:1.10.287.40 OMA:RFIRREK EMBL:AAEX03005165
            Ensembl:ENSCAFT00000021777 Uniprot:F6UV28
        Length = 2127

 Score = 126 (49.4 bits), Expect = 0.00010, Sum P(2) = 0.00010
 Identities = 41/186 (22%), Positives = 87/186 (46%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1113 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1172

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1173 EKETIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1232

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
              H  VQ++  L   L     +        E  +K  ++     + +++  + +  E+ +L
Sbjct:  1233 QHVSVQEMQELKETLNQAETKAKSLESQVENLQKTLSEKEAEARNLQEQTVQLQCELSRL 1292

Query:   223 RAELMN 228
             R +L +
Sbjct:  1293 RQDLQD 1298

 Score = 58 (25.5 bits), Expect = 0.00010, Sum P(2) = 0.00010
 Identities = 19/63 (30%), Positives = 25/63 (39%)

Query:   233 DRRADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTS 290
             D   D +  G  G   NE +G   G + YE  D  G   G G  P   T   +G G +  
Sbjct:  1737 DDDEDDTGMGDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGESNQ 1793

Query:   291 TSA 293
              +A
Sbjct:  1794 RAA 1796


>UNIPROTKB|F1S4P6 [details] [associations]
            symbol:EIF3A "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005852 "eukaryotic translation initiation factor 3
            complex" evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA]
            [GO:0003743 "translation initiation factor activity" evidence=IEA]
            [GO:0001732 "formation of translation initiation complex"
            evidence=IEA] InterPro:IPR000717 Pfam:PF01399 SMART:SM00088
            GO:GO:0005730 GO:GO:0003743 GO:GO:0005852 OMA:QDRDEND
            GeneTree:ENSGT00690000102108 GO:GO:0001732 EMBL:CU407047
            Ensembl:ENSSSCT00000011680 Uniprot:F1S4P6
        Length = 1378

 Score = 131 (51.2 bits), Expect = 0.00011, P = 0.00011
 Identities = 110/425 (25%), Positives = 153/425 (36%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAAT-HGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             VM  K A Q V  +KL    +RLA   H  L +     + E +I +      + + E + 
Sbjct:   761 VMRLKAARQSVYEEKLKQFEERLAEERHNRLEERKRQRKEERRITY-----YREKEEEEQ 815

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R   E++ K   E + AE  K E  +   E Q  V   EE+  K  Q   +++      +
Sbjct:   816 RRAEEQMLKEREERERAERAKRE--EELREYQERVKKLEEVERKKRQRELEIEERERRRE 873

Query:   169 QIPAL----LSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRA 224
             +   L    LS  +S R       GT+   +K     ++S       +     E +  R 
Sbjct:   874 EERRLGEDPLSRKDS-RWGDRDSEGTW---RK--GPEIDS------EWRRGPPEKDWRRG 921

Query:   225 ELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 284
             E  +     RR D        + E E+S RP        G  +    GP           
Sbjct:   922 EGRDEERPHRRDDDRPRRLGDDEERESSLRPDEDRGPRRG--MDDDRGPRRGLDEDRFSR 979

Query:   285 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP--SYDPA 342
              G +    ++  T    P R   D  RG    A      D       D  +G   + D  
Sbjct:   980 RGADDDRPSWRNTDDDRPPRRIGDEDRGSWRHADD----DRPPRRGLDEDRGSWRTADED 1035

Query:   343 KGP--GYDPTKGP---GYDAQKGS--NYDAQRGPN-YDIHRGP--SYDPQRG--LGYDMQ 390
             +GP  G D  +GP   G D ++ S  N D  R     D  RGP    D  RG   G D  
Sbjct:  1036 RGPRRGMDEDRGPRRGGVDDERSSWRNADDDRPRRGMDDDRGPRRGMDDDRGPRRGMDDD 1095

Query:   391 RGPN--YDMQRGPGYETQ--RVP--GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYD 444
             RGP    D  RGP   T   R+   G D  RGP          IP+RG    + R +G D
Sbjct:  1096 RGPRRGLDDDRGPWRNTDDDRISRRGADDDRGPWRNMD--DDRIPRRGDDDRIPR-RGDD 1152

Query:   445 MRRAP 449
              R  P
Sbjct:  1153 SRPGP 1157


>UNIPROTKB|P0CG41 [details] [associations]
            symbol:CTAGE8 "Cutaneous T-cell lymphoma-associated antigen
            8" species:9606 "Homo sapiens" [GO:0016021 "integral to membrane"
            evidence=IEA] GO:GO:0016021 HPA:HPA000387 HPA:HPA000922
            EMBL:AC004889 UniGene:Hs.661442 IPI:IPI00969223
            ProteinModelPortal:P0CG41 PhosphoSite:P0CG41 DMDM:300680906
            PRIDE:P0CG41 Ensembl:ENST00000487179 GeneCards:GC07M143963
            HGNC:HGNC:37294 neXtProt:NX_P0CG41 OMA:LERELMV ArrayExpress:P0CG41
            Bgee:P0CG41 Uniprot:P0CG41
        Length = 777

 Score = 128 (50.1 bits), Expect = 0.00011, P = 0.00011
 Identities = 105/458 (22%), Positives = 177/458 (38%)

Query:    56 ASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSER---ELQMRNLT 112
             A  +V ++ L  E   +      + +        ++ L  Q   ++SE    E + + L 
Sbjct:   322 AKLNVSLKSLEGERNHIIIQLSEVDKTKEELTEHIKNLQTQQASLQSENIYFESENQKLQ 381

Query:   113 EKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPA 172
             +K+ K+  E      +KL ++K   E +N  +  EE +++V +    + RA   ++    
Sbjct:   382 QKL-KIMTEFYQENEMKL-YRKLTVE-ENYRIEEEEKLSRVEE---KISRATEGLETYRK 435

Query:   173 LLSELES-LRQEYH-HCRGTYEYEKKFYNDHLESLQVMEKNYITMATE----VEKL-RAE 225
             L  +LE  L +  H + +    YEK+ +++ L + +  E+N   +  E     +KL   E
Sbjct:   436 LAKDLEEELERTVHFYQKQVISYEKRGHDNWLAA-RTAERNLSDLRKENAHNKQKLTETE 494

Query:   226 L-MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYED-GYGVPQGHGPPPSATTAGVV 283
             L       D  A      A G   +  S  P+G+ + E   +  PQ     P   +  + 
Sbjct:   495 LKFELLEKDPNALDVSNTAFGREHSPCSPSPLGRPSSETRAFPSPQTLLEDPLRLSPVLP 554

Query:   284 GAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEA----SKGPGYDASKAPSYDPTKGPS 338
             G G    +S       G P+       RG P Y+      + P    S +   +  +   
Sbjct:   555 GGGGRGPSSP------GNPLDHQITNERGEPSYDRLIDPHRAPSDTGSLSSPVEQDRRMM 608

Query:   339 YDPAKGPGY-DPTKGPGYDAQKGSNYDAQRGPNYD-IHRGPSYDPQ-RGLGYDMQRGPNY 395
             + P  G  Y D T  P  + +  SN +   GP      +  S D   R +  +M+   N 
Sbjct:   609 FPPP-GQSYPDSTLPPQREDRFYSNSERLSGPAEPRSFKMTSLDKMDRSMPSEMESSRN- 666

Query:   396 DMQRGPG---YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 452
             D +   G        +P  +   GP +     P   P RGP + +   +G  MRR P + 
Sbjct:   667 DAKDDLGNLNVPDSSLPAENEATGPGFIP---PPLAPVRGPLFPVDT-RGPFMRRGPPFP 722

Query:   453 PSR-GTGFDGAPRGAAPHGQVPPPLNNVPYGSAT--PP 487
             P   GT F GA RG  P    P P  + P+      PP
Sbjct:   723 PPPPGTMF-GASRGYFPPRDFPGP-PHAPFAMRNIYPP 758


>UNIPROTKB|F1SN69 [details] [associations]
            symbol:F1SN69 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 InterPro:IPR008985 SUPFAM:SSF49899 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 GO:GO:0005201
            SMART:SM00210 GeneTree:ENSGT00700000104301 OMA:YSYPDRL
            EMBL:CU618340 EMBL:CU606988 EMBL:CU861519
            Ensembl:ENSSSCT00000006033 Uniprot:F1SN69
        Length = 1869

 Score = 132 (51.5 bits), Expect = 0.00012, P = 0.00012
 Identities = 74/250 (29%), Positives = 98/250 (39%)

Query:   266 GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRG-------PGYEA 317
             GVP   GPP +    G  G+ GP  +     A   G P  A YD  +G       PG + 
Sbjct:  1274 GVPGDPGPPGTPGPKGSRGSLGPTGAPGRMGA--QGEPGLAGYDGHKGIMGPLGPPGPKG 1331

Query:   318 SKGP-GYDA-SKAPSYDP-TKGPSYDPAKGPGYDPTKGPGYDAQKG-----SNYDAQRGP 369
              KG  G D  ++ P   P  +GP  D  +G   +P   PGY  Q+G      N   Q  P
Sbjct:  1332 EKGEQGEDGKAEGPPGPPGDRGPVGD--RGDRGEPGD-PGYPGQEGVQGLRGNPGQQGQP 1388

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRGPVYEAQRAPSYI 428
              +   RG    P+   G +  +G        PG   TQ +PG    RG V   ++ P  +
Sbjct:  1389 GHPGPRGRP-GPKGSKGEEGPKGKQ-GKAGAPGRRGTQGLPGLPGPRGVV--GRQGPEGV 1444

Query:   429 --PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA-PHGQVPPPL---NNVPYG 482
               P   PG D Q GQ  +        P    G  G P  A  P  Q PP     + +P G
Sbjct:  1445 AGPDGLPGLDGQAGQQGEQGDDGDPGPLGPAGKRGNPGVAGLPGAQGPPGFKGESGLP-G 1503

Query:   483 SATPPARSGS 492
                PP + G+
Sbjct:  1504 QLGPPGKRGT 1513


>UNIPROTKB|H7C3M8 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP EMBL:AC099777
            ProteinModelPortal:H7C3M8 PRIDE:H7C3M8 Ensembl:ENST00000417128
            Uniprot:H7C3M8
        Length = 409

 Score = 124 (48.7 bits), Expect = 0.00012, P = 0.00012
 Identities = 40/165 (24%), Positives = 82/165 (49%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:   130 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 185

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   ++
Sbjct:   186 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGEL 243

Query:   168 QQIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             +++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   244 EKLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 288


>WB|WBGene00000694 [details] [associations]
            symbol:col-120 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00610000086159 EMBL:AL032632 PIR:T26465
            RefSeq:NP_501617.1 ProteinModelPortal:Q9XWR2 DIP:DIP-26936N
            IntAct:Q9XWR2 MINT:MINT-1070946 STRING:Q9XWR2
            EnsemblMetazoa:Y11D7A.11 GeneID:177748 KEGG:cel:CELE_Y11D7A.11
            UCSC:Y11D7A.11 CTD:177748 WormBase:Y11D7A.11 eggNOG:NOG265281
            InParanoid:Q9XWR2 OMA:HWELLED NextBio:898216 Uniprot:Q9XWR2
        Length = 313

 Score = 122 (48.0 bits), Expect = 0.00012, P = 0.00012
 Identities = 77/268 (28%), Positives = 97/268 (36%)

Query:   246 NSENE-TSGRPVGQNAY--EDGYGV--PQ---GHGPPPSATTAGVVGAGPNTSTSAYAAT 297
             N EN   S + VG      + GYG   P    G  P PS   A    A  ++S+S+ +  
Sbjct:    64 NLENMYESTKAVGSGPVKRQAGYGASSPSRASGSHPAPSPYDA----ASTSSSSSSDSCC 119

Query:   298 QSGTPMRAAYDIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYD---PAKGPGYDPT 350
               G  +      P  PG +   GP    G D         + G   +   PA  PG  P 
Sbjct:   120 SCGIGLAGPAGFPGRPGRDGIDGPAGKPGRDGQDLDGESSSDGSQIELDCPAGPPG--PP 177

Query:   351 KGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRV 408
               PG     G    D   G N    R P    +RG  G D + G   D    PG     +
Sbjct:   178 GNPGPQGNSGRPGMDGMPGRNGRCGR-PGEQGERGPNGEDGRPGRRGD-DGMPG-TVNEI 234

Query:   409 PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAA 467
             PG   Q GP    + AP     +GP     RG  G    + P+  P    GFDGAP G  
Sbjct:   235 PG---QAGPP-GLRGAPGATGSQGP-----RGNDGRPGNKGPAGPPG-DQGFDGAPGGPG 284

Query:   468 PHGQ--VPPPLNNVPYGSATPPARSGSG 493
               G+     PL      S  PP R+  G
Sbjct:   285 ADGEPGAQGPLGAKGECSHCPPPRTAPG 312


>UNIPROTKB|P12270 [details] [associations]
            symbol:TPR "Nucleoprotein TPR" species:9606 "Homo sapiens"
            [GO:0004828 "serine-tRNA ligase activity" evidence=IEA] [GO:0005524
            "ATP binding" evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0031965 "nuclear membrane" evidence=IDA] [GO:0005643 "nuclear
            pore" evidence=IDA] [GO:0007094 "mitotic spindle assembly
            checkpoint" evidence=IMP] [GO:0000776 "kinetochore" evidence=IDA]
            [GO:0006404 "RNA import into nucleus" evidence=IDA] [GO:0006606
            "protein import into nucleus" evidence=IMP;IDA] [GO:0005635
            "nuclear envelope" evidence=IDA] [GO:0034399 "nuclear periphery"
            evidence=IDA] [GO:0042803 "protein homodimerization activity"
            evidence=IDA] [GO:0042405 "nuclear inclusion body" evidence=IDA]
            [GO:0090267 "positive regulation of mitotic cell cycle spindle
            assembly checkpoint" evidence=IMP] [GO:0090316 "positive regulation
            of intracellular protein transport" evidence=IMP] [GO:1901673
            "regulation of spindle assembly involved in mitosis" evidence=IMP]
            [GO:0035457 "cellular response to interferon-alpha" evidence=ISS]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0000122 "negative
            regulation of transcription from RNA polymerase II promoter"
            evidence=IMP] [GO:0046832 "negative regulation of RNA export from
            nucleus" evidence=IDA;IMP] [GO:0045947 "negative regulation of
            translational initiation" evidence=IMP] [GO:0031647 "regulation of
            protein stability" evidence=IMP] [GO:0010793 "regulation of mRNA
            export from nucleus" evidence=IMP] [GO:0042306 "regulation of
            protein import into nucleus" evidence=IMP] [GO:0046825 "regulation
            of protein export from nucleus" evidence=IMP] [GO:0005487
            "nucleocytoplasmic transporter activity" evidence=IDA] [GO:0031453
            "positive regulation of heterochromatin assembly" evidence=IMP]
            [GO:0044615 "nuclear pore nuclear basket" evidence=IDA] [GO:0005737
            "cytoplasm" evidence=IDA] [GO:0019898 "extrinsic to membrane"
            evidence=IDA] [GO:0043495 "protein anchor" evidence=IMP]
            [GO:0051019 "mitogen-activated protein kinase binding"
            evidence=IDA] [GO:0070849 "response to epidermal growth factor
            stimulus" evidence=IDA] [GO:0000189 "MAPK import into nucleus"
            evidence=IMP] [GO:0042307 "positive regulation of protein import
            into nucleus" evidence=IMP] [GO:0070840 "dynein complex binding"
            evidence=IDA] [GO:0005868 "cytoplasmic dynein complex"
            evidence=IDA] [GO:0015631 "tubulin binding" evidence=IDA]
            [GO:0072686 "mitotic spindle" evidence=IDA] [GO:0010965 "regulation
            of mitotic sister chromatid separation" evidence=IMP] [GO:0046827
            "positive regulation of protein export from nucleus" evidence=ISS]
            [GO:0031990 "mRNA export from nucleus in response to heat stress"
            evidence=IDA] [GO:0031072 "heat shock protein binding"
            evidence=IDA] [GO:0034605 "cellular response to heat" evidence=IDA]
            [GO:0003682 "chromatin binding" evidence=IDA] [GO:0003729 "mRNA
            binding" evidence=IDA] [GO:0006999 "nuclear pore organization"
            evidence=IMP] [GO:0043578 "nuclear matrix organization"
            evidence=IMP] [GO:0006611 "protein export from nucleus"
            evidence=IMP] [GO:0005215 "transporter activity" evidence=IMP]
            [GO:0006405 "RNA export from nucleus" evidence=IMP] [GO:0051292
            "nuclear pore complex assembly" evidence=IMP] [GO:0005654
            "nucleoplasm" evidence=TAS] [GO:0005975 "carbohydrate metabolic
            process" evidence=TAS] [GO:0008645 "hexose transport" evidence=TAS]
            [GO:0010827 "regulation of glucose transport" evidence=TAS]
            [GO:0015758 "glucose transport" evidence=TAS] [GO:0016032 "viral
            reproduction" evidence=TAS] [GO:0019221 "cytokine-mediated
            signaling pathway" evidence=TAS] [GO:0044281 "small molecule
            metabolic process" evidence=TAS] [GO:0055085 "transmembrane
            transport" evidence=TAS] Reactome:REACT_111217 Reactome:REACT_15518
            InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926 GO:GO:0005524
            GO:GO:0005737 Reactome:REACT_116125 Reactome:REACT_6900
            GO:GO:0005654 GO:GO:0016032 GO:GO:0007094 GO:GO:0044281
            GO:GO:0005975 GO:GO:0031965 EMBL:CH471067 GO:GO:0005643
            GO:GO:0019221 GO:GO:0015758 GO:GO:0010827 GO:GO:0055085
            GO:GO:0006606 eggNOG:NOG12793 KO:K09291 GO:GO:0051028 GO:GO:0000777
            InterPro:IPR009053 SUPFAM:SSF46579 MIM:188550 Orphanet:146
            EMBL:AL133553 EMBL:X62947 PIR:S23741 EMBL:AL596220 GO:GO:0004828
            GO:GO:0006434 Gene3D:1.10.287.40 EMBL:X66397 EMBL:Y00672
            IPI:IPI00742682 RefSeq:NP_003283.2 UniGene:Hs.279640
            ProteinModelPortal:P12270 IntAct:P12270 MINT:MINT-1144652
            STRING:P12270 PhosphoSite:P12270 DMDM:215274208 PaxDb:P12270
            PRIDE:P12270 Ensembl:ENST00000367478 GeneID:7175 KEGG:hsa:7175
            UCSC:uc001grv.3 CTD:7175 GeneCards:GC01M186281 HGNC:HGNC:12017
            HPA:HPA019661 HPA:HPA019663 HPA:HPA024336 MIM:189940
            neXtProt:NX_P12270 PharmGKB:PA36696 HOGENOM:HOG000139431
            HOVERGEN:HBG009158 InParanoid:P12270 OMA:RFIRREK OrthoDB:EOG42RD6D
            GenomeRNAi:7175 NextBio:28128 PMAP-CutDB:P12270 ArrayExpress:P12270
            Bgee:P12270 CleanEx:HS_TPR Genevestigator:P12270
            GermOnline:ENSG00000047410 Uniprot:P12270
        Length = 2363

 Score = 128 (50.1 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 41/186 (22%), Positives = 88/186 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1349 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1408

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1409 EKETIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1468

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
              H  VQ++  L   L     +        E  +K  ++     + +++  + + +E+ +L
Sbjct:  1469 QHVSVQEMQELKETLNQAETKSKSLESQVENLQKTLSEKETEARNLQEQTVQLQSELSRL 1528

Query:   223 RAELMN 228
             R +L +
Sbjct:  1529 RQDLQD 1534

 Score = 56 (24.8 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 19/63 (30%), Positives = 24/63 (38%)

Query:   233 DRRADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTS 290
             D   D +  G  G   NE +G   G + YE  D  G   G G  P   T   +G G    
Sbjct:  1973 DDDEDDTGMGDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGEGNH 2029

Query:   291 TSA 293
              +A
Sbjct:  2030 RAA 2032


>UNIPROTKB|F1S300 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0031965 "nuclear membrane" evidence=IEA] [GO:0007094
            "mitotic spindle assembly checkpoint" evidence=IEA] [GO:0005643
            "nuclear pore" evidence=IEA] [GO:0000776 "kinetochore"
            evidence=IEA] [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0004828 "serine-tRNA ligase activity"
            evidence=IEA] InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926
            GO:GO:0005524 GO:GO:0005737 GO:GO:0000776 GO:GO:0007094
            GO:GO:0031965 GO:GO:0005643 GO:GO:0006606 InterPro:IPR009053
            SUPFAM:SSF46579 GeneTree:ENSGT00700000104019 GO:GO:0004828
            GO:GO:0006434 Gene3D:1.10.287.40 OMA:RFIRREK EMBL:CU657929
            EMBL:FP340191 Ensembl:ENSSSCT00000016969 Uniprot:F1S300
        Length = 2365

 Score = 128 (50.1 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 43/187 (22%), Positives = 88/187 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1349 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1408

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1409 EKESIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1468

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQVMEKNYITMATEVEK 221
              H  VQ++  L   L     +        E  +K      +E+  + E+  + + +E+ +
Sbjct:  1469 QHVSVQEMQELKEALNQAEAKSKSLESQVENLQKTLSEKEMEARNLQEQT-VQLQSELSR 1527

Query:   222 LRAELMN 228
             LR +L +
Sbjct:  1528 LRQDLQD 1534

 Score = 56 (24.8 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 19/63 (30%), Positives = 25/63 (39%)

Query:   233 DRRADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTS 290
             D   D +  G  G   NE +G   G + YE  D  G   G G  P   T   +G G +  
Sbjct:  1975 DDDEDDTGMGDEGEVSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGESNQ 2031

Query:   291 TSA 293
              +A
Sbjct:  2032 RAA 2034


>UNIPROTKB|F1NCR0 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001501 "skeletal system
            development" evidence=IEA] [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
            GeneTree:ENSGT00660000095287 GO:GO:0005584 EMBL:AADN02000724
            IPI:IPI00821202 Ensembl:ENSGALT00000015706 ArrayExpress:F1NCR0
            Uniprot:F1NCR0
        Length = 1318

 Score = 130 (50.8 bits), Expect = 0.00013, P = 0.00013
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   255 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 308
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   781 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 837

Query:   309 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 367
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   838 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 892

Query:   368 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 423
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   893 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 943

Query:   424 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 477
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   944 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1000

Query:   478 NVPYGSATPPARSGSGQPRGGN 499
             N P G   PP  SG     G N
Sbjct:  1001 NGPAGPRGPPGPSGPPGKDGRN 1022


>UNIPROTKB|F1M6Q3 [details] [associations]
            symbol:Col4a2 "Protein Col4a2" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            GO:GO:0071560 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0006351
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0016525 GO:GO:0005201
            GO:GO:0005587 Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772
            IPI:IPI00778948 Ensembl:ENSRNOT00000057461 Uniprot:F1M6Q3
        Length = 1647

 Score = 131 (51.2 bits), Expect = 0.00013, P = 0.00013
 Identities = 90/302 (29%), Positives = 112/302 (37%)

Query:   229 APNVDRRADGSYGGATGN----SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 284
             +P VD   D  + G TG+     E  T   PVG    +   G+P   GP  S    G  G
Sbjct:  1145 SPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGAPGQKGEQGIPGERGPVGSPGLQGFPG 1204

Query:   285 AGPNTSTSAYAATQSGTPM---RAAYDIPRGP-GYEASKGPGYD--ASKAPSYDPTKGPS 338
               P ++ S       G P       Y  P GP G  A  G   D  +S A  +   KG  
Sbjct:  1205 ISPPSNISGLPG-DVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGEKGWV 1263

Query:   339 YDPAKGPGYDP-TKG-PGYDAQKGSN-YDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPN 394
              DP  GP   P   G PG    KG   +    GP+  +  RGP   P+   G+    G  
Sbjct:  1264 GDP--GPQGQPGVHGLPGEKGPKGEQGFMGNTGPSGAVGDRGPK-GPKGDQGFPGAPGS- 1319

Query:   395 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDP 453
                   PG     +PG   Q+  V      P    +RG PG   + G      + P  DP
Sbjct:  1320 ---MGSPG-----IPGIP-QKIAVQPGTMGPQ--GRRGLPGALGEMGP-----QGPPGDP 1363

Query:   454 SRGTGFDGAPRGAAPHGQ-----VPP------PLNNV-PYGSATPPARSGS-GQPRGGNP 500
                 GF GAP  A P G+     VP       P+ +  P G    P R GS G P  G P
Sbjct:  1364 ----GFRGAPGKAGPQGRGGVSAVPGFRGDQGPMGHQGPIGQEGEPGRPGSPGLP--GMP 1417

Query:   501 AR 502
              R
Sbjct:  1418 GR 1419


>UNIPROTKB|H7BZK0 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP EMBL:AC099777
            ProteinModelPortal:H7BZK0 PRIDE:H7BZK0 Ensembl:ENST00000416658
            Uniprot:H7BZK0
        Length = 433

 Score = 124 (48.7 bits), Expect = 0.00013, P = 0.00013
 Identities = 40/165 (24%), Positives = 82/165 (49%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:   154 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 209

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   ++
Sbjct:   210 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGEL 267

Query:   168 QQIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             +++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   268 EKLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 312


>UNIPROTKB|P02467 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005583 "fibrillar collagen" evidence=IDA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0046872 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M25963
            EMBL:M25956 EMBL:M25959 EMBL:M25961 EMBL:M25962 EMBL:M25965
            EMBL:M25964 EMBL:M25984 EMBL:M25957 EMBL:M25966 EMBL:M25967
            EMBL:M25969 EMBL:M25970 EMBL:M25971 EMBL:M25972 EMBL:M25973
            EMBL:M25974 EMBL:M25976 EMBL:M25977 EMBL:M25978 EMBL:M25979
            EMBL:M25980 EMBL:M25981 EMBL:M25982 EMBL:M25983 EMBL:J00826
            EMBL:J00821 EMBL:K00792 EMBL:J00830 EMBL:J00829 EMBL:J00837
            EMBL:J00812 EMBL:J00811 EMBL:J00814 EMBL:J00815 EMBL:X02657
            EMBL:K00794 EMBL:V00390 EMBL:M17608 EMBL:M10581 EMBL:M10540
            EMBL:J00828 EMBL:J00827 EMBL:J00832 EMBL:J00831 EMBL:J00833
            EMBL:J00822 IPI:IPI00914483 PIR:I50173 PIR:I50206 PIR:S10847
            UniGene:Gga.5097 STRING:P02467 PRIDE:P02467 InParanoid:P02467
            PMAP-CutDB:P02467 GO:GO:0005583 Uniprot:P02467
        Length = 1362

 Score = 130 (50.8 bits), Expect = 0.00013, P = 0.00013
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   255 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 308
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   825 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 881

Query:   309 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 367
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   882 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 936

Query:   368 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 423
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   937 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 987

Query:   424 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 477
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   988 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1044

Query:   478 NVPYGSATPPARSGSGQPRGGN 499
             N P G   PP  SG     G N
Sbjct:  1045 NGPAGPRGPPGPSGPPGKDGRN 1066


>UNIPROTKB|F1P0H9 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001501 "skeletal system
            development" evidence=IEA] [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
            GeneTree:ENSGT00660000095287 KO:K06236 GO:GO:0005584 CTD:1278
            IPI:IPI00914483 UniGene:Gga.5097 EMBL:AADN02000724
            RefSeq:NP_001073182.2 PRIDE:F1P0H9 Ensembl:ENSGALT00000015703
            GeneID:396243 KEGG:gga:396243 OMA:IGMPGAR NextBio:20816295
            ArrayExpress:F1P0H9 Uniprot:F1P0H9
        Length = 1363

 Score = 130 (50.8 bits), Expect = 0.00013, P = 0.00013
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   255 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 308
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   826 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 882

Query:   309 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 367
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   883 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 937

Query:   368 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 423
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   938 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 988

Query:   424 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 477
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   989 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1045

Query:   478 NVPYGSATPPARSGSGQPRGGN 499
             N P G   PP  SG     G N
Sbjct:  1046 NGPAGPRGPPGPSGPPGKDGRN 1067


>UNIPROTKB|F1SNP1 [details] [associations]
            symbol:COL4A4 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0032836 "glomerular basement membrane development"
            evidence=IEA] [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587
            "collagen type IV" evidence=IEA] [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
            EMBL:CU466451 EMBL:FP690341 Ensembl:ENSSSCT00000017688
            Uniprot:F1SNP1
        Length = 1711

 Score = 131 (51.2 bits), Expect = 0.00014, P = 0.00014
 Identities = 76/260 (29%), Positives = 89/260 (34%)

Query:   253 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 312
             G P G    E   G+P   GPP      G  G               G P         G
Sbjct:  1207 GVP-GPRGPEGSMGLPGQRGPP-GPECKGEPGPDGRRGEDGLPGPP-GPPGHKGDMGEAG 1263

Query:   313 -PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKGPGYDAQKGSNYDAQRG 368
              PG    KG PG   +  PS    +G + DP  G   G  P   PG     G N   QRG
Sbjct:  1264 CPGAPGPKGFPGRRGTPGPSLIGFRGDTGDPGFGGEKGSSPIGPPGSPGSPGMN--GQRG 1321

Query:   369 PNYDIHRG-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA-- 424
             P  D   G P    +RGL G    +G   D  R        +PG+   +GP     RA  
Sbjct:  1322 PPGDPALGYPGPPGKRGLFGSPGSKGLRGDPGRPGATGPAGMPGFPGLKGPKGREGRAGF 1381

Query:   425 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 484
             P  +P   PG+  + G     R  P   P    G  GAP      G + PP      G  
Sbjct:  1382 PG-VPGP-PGHSCESGA--PGRPGPPGLPG-APGSPGAPGWKGQRGDMGPPGPAGMKGVP 1436

Query:   485 TPPARSGSGQPRG--GNPAR 502
               P R G   P G  G P R
Sbjct:  1437 GVPGRPGPDGPPGPPGVPGR 1456


>TAIR|locus:2079502 [details] [associations]
            symbol:RS31 "arginine/serine-rich splicing factor 31"
            species:3702 "Arabidopsis thaliana" [GO:0000166 "nucleotide
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISM] [GO:0016607 "nuclear speck" evidence=IDA]
            [GO:0008380 "RNA splicing" evidence=NAS] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=IDA;RCA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=RCA]
            [GO:0030422 "production of siRNA involved in RNA interference"
            evidence=RCA] [GO:0035196 "production of miRNAs involved in gene
            silencing by miRNA" evidence=RCA] [GO:0043687 "post-translational
            protein modification" evidence=RCA] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=RCA]
            [GO:0005681 "spliceosomal complex" evidence=TAS] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0000166 GO:GO:0016607
            Gene3D:3.30.70.330 GO:GO:0005681 GO:GO:0003723 GO:GO:0000398
            EMBL:AL138642 HOGENOM:HOG000276234 KO:K12893 EMBL:X99435
            EMBL:AF439831 EMBL:AY125565 IPI:IPI00530595 PIR:T47978 PIR:T51304
            RefSeq:NP_567120.1 UniGene:At.24231 ProteinModelPortal:P92964
            SMR:P92964 IntAct:P92964 STRING:P92964 PaxDb:P92964 PRIDE:P92964
            EnsemblPlants:AT3G61860.1 GeneID:825359 KEGG:ath:AT3G61860
            TAIR:At3g61860 eggNOG:NOG277933 InParanoid:P92964 OMA:FEYETRQ
            PhylomeDB:P92964 ProtClustDB:CLSN2917489 Genevestigator:P92964
            GermOnline:AT3G61860 Uniprot:P92964
        Length = 264

 Score = 120 (47.3 bits), Expect = 0.00014, P = 0.00014
 Identities = 30/88 (34%), Positives = 41/88 (46%)

Query:   301 TPMRAAYDIPR---GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YDPTKGPGYD 356
             +P R+   + R    P Y     PG     +P Y   + P YD  KGP  Y+  + P Y 
Sbjct:   177 SPRRSLSPVYRRRPSPDYGRRPSPGQGRRPSPDYGRARSPEYDRYKGPAAYERRRSPDY- 235

Query:   357 AQKGSNYDAQRGPNYDIHRGPSYDPQRG 384
              ++ S+Y  QR P YD +R  S  P RG
Sbjct:   236 GRRSSDYGRQRSPGYDRYRSRSPVP-RG 262


>UNIPROTKB|F1MSR8 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
            "Bos taurus" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            UniGene:Bt.21390 GeneID:407142 KEGG:bta:407142 CTD:1280
            NextBio:20818406 EMBL:DAAA02012985 EMBL:DAAA02012986
            IPI:IPI00786510 RefSeq:NP_001106695.1 PRIDE:F1MSR8
            Ensembl:ENSBTAT00000017509 Uniprot:F1MSR8
        Length = 1418

 Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
 Identities = 89/295 (30%), Positives = 112/295 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 287
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:    64 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 119

Query:   288 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 344
               +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +G
Sbjct:   120 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 174

Query:   345 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 400
             P   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   175 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 231

Query:   401 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 450
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   232 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 289

Query:   451 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 500
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   290 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 341

 Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 292
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   723 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 781

Query:   293 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 344
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   782 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 838

Query:   345 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 403
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   839 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 885

Query:   404 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 460
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   886 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 941

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             GA     P G V PP    P G    P R GS     G P R
Sbjct:   942 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 979


>MGI|MGI:88467 [details] [associations]
            symbol:Col1a1 "collagen, type I, alpha 1" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development"
            evidence=ISO;IMP] [GO:0001568 "blood vessel development"
            evidence=ISO;IMP] [GO:0001957 "intramembranous ossification"
            evidence=IGI] [GO:0001958 "endochondral ossification" evidence=IMP]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IDA] [GO:0005581
            "collagen" evidence=IMP;IDA] [GO:0005584 "collagen type I"
            evidence=ISO;IMP;IDA] [GO:0005615 "extracellular space"
            evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0007601
            "visual perception" evidence=ISO] [GO:0007605 "sensory perception
            of sound" evidence=ISO] [GO:0010718 "positive regulation of
            epithelial to mesenchymal transition" evidence=ISO] [GO:0010812
            "negative regulation of cell-substrate adhesion" evidence=IDA]
            [GO:0015031 "protein transport" evidence=IMP] [GO:0030199 "collagen
            fibril organization" evidence=ISO] [GO:0030335 "positive regulation
            of cell migration" evidence=ISO] [GO:0031012 "extracellular matrix"
            evidence=IDA] [GO:0032964 "collagen biosynthetic process"
            evidence=ISO] [GO:0034504 "protein localization to nucleus"
            evidence=ISO] [GO:0034505 "tooth mineralization" evidence=ISO]
            [GO:0042060 "wound healing" evidence=ISO] [GO:0042802 "identical
            protein binding" evidence=ISO] [GO:0043588 "skin development"
            evidence=IMP] [GO:0043589 "skin morphogenesis" evidence=ISO]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=ISO]
            [GO:0048705 "skeletal system morphogenesis" evidence=IGI]
            [GO:0048706 "embryonic skeletal system development" evidence=ISO]
            [GO:0060325 "face morphogenesis" evidence=IGI] [GO:0060346 "bone
            trabecula formation" evidence=IGI] [GO:0060351 "cartilage
            development involved in endochondral bone morphogenesis"
            evidence=IMP] [GO:0070208 "protein heterotrimerization"
            evidence=IDA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IDA] [GO:0090263 "positive regulation of
            canonical Wnt receptor signaling pathway" evidence=ISO]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 MGI:MGI:88467 GO:GO:0005737
            GO:GO:0045893 GO:GO:0043588 GO:GO:0005615 GO:GO:0071363
            GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0071300
            GO:GO:0043434 GO:GO:0030199 GO:GO:0007584 GO:GO:0010035
            GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0042542
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            GO:GO:0071260 GO:GO:0001568 GO:GO:0001649 GO:GO:0051591
            GO:GO:0034505 GO:GO:0090263 GO:GO:0010812 GO:GO:0060325
            GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
            GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
            HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ OrthoDB:EOG4S4PHP
            GO:GO:0005584 GO:GO:0060346 ChiTaRS:COL1A1 GO:GO:0031960
            EMBL:U08020 EMBL:AL662790 EMBL:AL606480 EMBL:BC050014 EMBL:BC059281
            EMBL:K01688 EMBL:S67530 EMBL:S67482 EMBL:X54876 EMBL:M14423
            EMBL:M17491 EMBL:K03036 EMBL:K03029 EMBL:K03030 EMBL:K03031
            EMBL:K03032 EMBL:K03033 EMBL:K03034 EMBL:K03035 EMBL:X06753
            EMBL:X15896 EMBL:X57981 IPI:IPI00329872 IPI:IPI00623191 PIR:I49558
            PIR:S57243 RefSeq:NP_031768.2 UniGene:Mm.277735 UniGene:Mm.458212
            ProteinModelPortal:P11087 SMR:P11087 IntAct:P11087 STRING:P11087
            PhosphoSite:P11087 PaxDb:P11087 PRIDE:P11087
            Ensembl:ENSMUST00000001547 GeneID:12842 KEGG:mmu:12842
            UCSC:uc007kzn.1 InParanoid:P11087 NextBio:282376 PMAP-CutDB:P11087
            Bgee:P11087 CleanEx:MM_COL1A1 Genevestigator:P11087
            GermOnline:ENSMUSG00000001506 Uniprot:P11087
        Length = 1453

 Score = 130 (50.8 bits), Expect = 0.00015, P = 0.00015
 Identities = 79/254 (31%), Positives = 95/254 (37%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSAT----TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 310
             P+G N    G   P+G   PP AT     AG VG  P  S +A      G   +     P
Sbjct:   841 PIG-NVGAPGPKGPRGAAGPPGATGFPGAAGRVGP-PGPSGNAGPPGPPGPVGKEGGKGP 898

Query:   311 RGPGYEASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYDAQKGS-NYDAQ 366
             RG    A + PG      P   P  G    P A GP   P T GP G   Q+G      Q
Sbjct:   899 RGETGPAGR-PGEVGPPGPP-GPA-GEKGSPGADGPAGSPGTPGPQGIAGQRGVVGLPGQ 955

Query:   367 RGPN-YDIHRGPSYDP-QRG-LGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 423
             RG   +    GPS +P ++G  G   +RGP   M  GP       PG     GP  E+ R
Sbjct:   956 RGERGFPGLPGPSGEPGKQGPSGSSGERGPPGPM--GP-------PGL---AGPPGESGR 1003

Query:   424 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGS 483
               S   +  PG D   G   D        P    G  GAP    P G+        P G 
Sbjct:  1004 EGSPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGP 1063

Query:   484 ATPPARSGSGQPRG 497
             A P   +G+  P G
Sbjct:  1064 AGPIGPAGARGPAG 1077


>UNIPROTKB|P04280 [details] [associations]
            symbol:PRB1 "Basic salivary proline-rich protein 1"
            species:9606 "Homo sapiens" [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005576 "extracellular region" evidence=NAS] GO:GO:0005576
            PIR:B40750 InterPro:IPR026086 PANTHER:PTHR23203 EMBL:K03204
            EMBL:K03205 EMBL:K03206 EMBL:S52986 EMBL:M97220 EMBL:K02575
            EMBL:K02576 EMBL:X07516 EMBL:X07517 EMBL:S62928 EMBL:S62941
            IPI:IPI00023038 PIR:C38355 PIR:D40750 RefSeq:NP_005030.2
            RefSeq:NP_955385.1 RefSeq:NP_955386.1 UniGene:Hs.631726
            ProteinModelPortal:P04280 STRING:P04280 PhosphoSite:P04280
            DMDM:52001469 PRIDE:P04280 GeneID:5542 KEGG:hsa:5542 CTD:5542
            GeneCards:GC12M011504 HGNC:HGNC:9337 MIM:180989 neXtProt:NX_P04280
            PharmGKB:PA33699 KO:K13911 GenomeRNAi:5542 NextBio:21470
            ArrayExpress:P04280 CleanEx:HS_PRB1 Genevestigator:P04280
            Uniprot:P04280
        Length = 392

 Score = 123 (48.4 bits), Expect = 0.00015, P = 0.00015
 Identities = 76/279 (27%), Positives = 94/279 (33%)

Query:   241 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT-STSAYAATQS 299
             GG          G+P G      G   PQG  PPP     G    G  + S  +      
Sbjct:    43 GGNKPQGPPPPPGKPQGPPP--QGGNKPQG--PPPPGKPQGPPPQGDKSRSPRSPPGKPQ 98

Query:   300 GTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTKG------PSYDPAKGPGYDPTK 351
             G P +     P+GP     K  GP       P   P  G      P  D ++ P   P K
Sbjct:    99 GPPPQGGNQ-PQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDKSQSPRSPPGK 157

Query:   352 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQ-- 406
               G   Q G N      P     +GP   P +G G   Q  P     +GP   G ++Q  
Sbjct:   158 PQGPPPQ-GGNQPQGPPPPPGKPQGP---PPQG-GNKPQGPPPPGKPQGPPPQGDKSQSP 212

Query:   407 RVP-----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
             R P     G   Q G   +    P   PQ  P     R QG      P   P +G     
Sbjct:   213 RSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPQQGGNRPQGPPPPGKPQGPPPQGDK-SR 271

Query:   462 APRGAAPHGQVPPPLN-NVPYGSATPPARSGSGQPRGGN 499
             +P+      Q PPP   N P G   PP +     P+GGN
Sbjct:   272 SPQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGN 310


>UNIPROTKB|P02459 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
            "Bos taurus" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
            GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604 GO:GO:0001502
            GO:GO:0060021 GO:GO:0002062 GO:GO:0010468 GO:GO:0060272
            GO:GO:0006029 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GeneTree:ENSGT00660000095287 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 EMBL:AAFC03017082 EMBL:AAFC03017085
            EMBL:AAFC03056593 EMBL:L28918 EMBL:AF138883 EMBL:AF138957
            EMBL:X02420 IPI:IPI01028216 PIR:A90369 PIR:I45876
            RefSeq:NP_001001135.2 UniGene:Bt.21390 IntAct:P02459 STRING:P02459
            PRIDE:P02459 Ensembl:ENSBTAT00000017505 GeneID:407142
            KEGG:bta:407142 CTD:1280 InParanoid:Q9XT25 OMA:SSCRICV
            Reactome:REACT_133391 NextBio:20818406 PMAP-CutDB:P02459
            ArrayExpress:P02459 GO:GO:0005585 GO:GO:0060174 GO:GO:0030903
            Uniprot:P02459
        Length = 1487

 Score = 130 (50.8 bits), Expect = 0.00015, P = 0.00015
 Identities = 89/295 (30%), Positives = 112/295 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 287
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:   133 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 188

Query:   288 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 344
               +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +G
Sbjct:   189 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 243

Query:   345 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 400
             P   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   244 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 300

Query:   401 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 450
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   301 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 358

Query:   451 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 500
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   359 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 410

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 292
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   792 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850

Query:   293 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 344
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907

Query:   345 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 403
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 954

Query:   404 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 460
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   955 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048


>UNIPROTKB|P02458 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001502 "cartilage condensation" evidence=IEA] [GO:0001894
            "tissue homeostasis" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0003007 "heart morphogenesis"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0006029
            "proteoglycan metabolic process" evidence=IEA] [GO:0007417 "central
            nervous system development" evidence=IEA] [GO:0010468 "regulation
            of gene expression" evidence=IEA] [GO:0030903 "notochord
            development" evidence=IEA] [GO:0042472 "inner ear morphogenesis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0060021 "palate development"
            evidence=IEA] [GO:0060174 "limb bud formation" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0071599 "otic vesicle development"
            evidence=IEA] [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IMP]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IDA]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] [GO:0042802 "identical protein binding"
            evidence=NAS] [GO:0001501 "skeletal system development"
            evidence=IMP] [GO:0007605 "sensory perception of sound"
            evidence=IMP] [GO:0060272 "embryonic skeletal joint morphogenesis"
            evidence=IMP] [GO:0051216 "cartilage development" evidence=TAS]
            [GO:0030199 "collagen fibril organization" evidence=IMP]
            [GO:0005585 "collagen type II" evidence=IDA] [GO:0030020
            "extracellular matrix structural constituent conferring tensile
            strength" evidence=IC] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0043066 GO:GO:0005615 PDB:2FSE PDBsum:2FSE
            PDB:2SEB PDBsum:2SEB GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0005788 GO:GO:0042472
            GO:GO:0001894 GO:GO:0042802 GO:GO:0007605 GO:GO:0071773
            GO:GO:0051216 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
            GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
            GO:GO:0001958 GO:GO:0060351 HOVERGEN:HBG004933 KO:K06236
            DrugBank:DB00048 GO:GO:0048407 CTD:1280 OMA:SSCRICV GO:GO:0005585
            GO:GO:0060174 GO:GO:0030903 OrthoDB:EOG4FTW1C EMBL:X16468
            EMBL:L10347 EMBL:BT007205 EMBL:AC004801 EMBL:BC007252 EMBL:BC116449
            EMBL:X16711 EMBL:M25730 EMBL:M32168 EMBL:M25655 EMBL:M25656
            EMBL:M64345 EMBL:M60299 EMBL:M25698 EMBL:X58709 EMBL:X57010
            EMBL:U15195 EMBL:X13783 EMBL:M25728 EMBL:X02371 EMBL:X02372
            EMBL:X02373 EMBL:X02374 EMBL:X02375 EMBL:X02376 EMBL:X02377
            EMBL:X02378 EMBL:X16158 EMBL:J00116 EMBL:L00977 EMBL:M63281
            EMBL:M27468 EMBL:X06268 EMBL:X00339 EMBL:M12048 IPI:IPI00186460
            IPI:IPI00748487 IPI:IPI00936892 PIR:A38513 RefSeq:NP_001835.3
            RefSeq:NP_149162.2 UniGene:Hs.408182 PDB:1U5M PDBsum:1U5M
            ProteinModelPortal:P02458 SMR:P02458 IntAct:P02458
            MINT:MINT-6796075 STRING:P02458 PhosphoSite:P02458 DMDM:124056489
            PaxDb:P02458 PRIDE:P02458 DNASU:1280 Ensembl:ENST00000337299
            Ensembl:ENST00000380518 GeneID:1280 KEGG:hsa:1280 UCSC:uc001rqt.3
            UCSC:uc001rqu.3 UCSC:uc001rqv.3 GeneCards:GC12M048266
            HGNC:HGNC:2200 HPA:CAB002214 MIM:108300 MIM:120140 MIM:132450
            MIM:150600 MIM:151210 MIM:156550 MIM:183900 MIM:184250 MIM:200610
            MIM:271700 MIM:604864 MIM:608805 MIM:609162 MIM:609508
            neXtProt:NX_P02458 Orphanet:93296 Orphanet:209867 Orphanet:137678
            Orphanet:86820 Orphanet:93297 Orphanet:485 Orphanet:2380
            Orphanet:93279 Orphanet:166011 Orphanet:1427 Orphanet:85166
            Orphanet:93346 Orphanet:94068 Orphanet:93315 Orphanet:1856
            Orphanet:90653 PharmGKB:PA26715 ChiTaRS:COL2A1
            EvolutionaryTrace:P02458 GenomeRNAi:1280 NextBio:5171
            PMAP-CutDB:P02458 Bgee:P02458 Genevestigator:P02458
            GermOnline:ENSG00000139219 GO:GO:0030020 Uniprot:P02458
        Length = 1487

 Score = 130 (50.8 bits), Expect = 0.00015, P = 0.00015
 Identities = 90/295 (30%), Positives = 113/295 (38%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 287
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:   133 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 188

Query:   288 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 344
               +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +G
Sbjct:   189 EKAGGAQLGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 243

Query:   345 PGYDPTKGPGYDAQKGSNYDA-QRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 400
             P   P K PG D + G    A +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   244 PPGPPGK-PGDDGEAGKPGKAGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 300

Query:   401 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 450
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   301 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 358

Query:   451 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 500
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   359 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNP 410

 Score = 124 (48.7 bits), Expect = 0.00067, P = 0.00067
 Identities = 88/282 (31%), Positives = 101/282 (35%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 292
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   792 GPPGPAGANGEKGEVGPP-GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850

Query:   293 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 344
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907

Query:   345 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 403
             PG +   GP G     G   D  +G      RG S  P R  G    +GP      GP  
Sbjct:   908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRA-GEPGLQGP-----AGPPG 954

Query:   404 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 460
             E    PG D   G   E    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   955 EKGE-PGDDGPSGA--EGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048


>UNIPROTKB|A7E348 [details] [associations]
            symbol:PYGO2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0048589 "developmental growth" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0030879
            "mammary gland development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0007420 "brain
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0002088 "lens development in camera-type eye" evidence=IEA]
            [GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
            GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
            GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
            GO:GO:0033599 GO:GO:0051569 GO:GO:0002088 eggNOG:NOG72798
            HOGENOM:HOG000001580 HOVERGEN:HBG053774
            GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
            OrthoDB:EOG4QZ7MB EMBL:DAAA02007156 EMBL:BC151715 IPI:IPI00866934
            RefSeq:NP_001095712.1 UniGene:Bt.102068 SMR:A7E348
            Ensembl:ENSBTAT00000005670 GeneID:540401 KEGG:bta:540401
            InParanoid:A7E348 NextBio:20878610 Uniprot:A7E348
        Length = 405

 Score = 123 (48.4 bits), Expect = 0.00015, P = 0.00015
 Identities = 78/298 (26%), Positives = 111/298 (37%)

Query:   227 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 280
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G  P    +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97

Query:   281 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 339
              V   G           Q G     A  +P G G     GP     + P + P+  GP++
Sbjct:    98 PVPFGG--------FRVQGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPSPMGPAF 145

Query:   340 D-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQRGP 393
             + P +GPGY P     + +Q    ++   G N+    G     P  G G      M + P
Sbjct:   146 NMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPP 202

Query:   394 NYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPSYIPQRGP--GYDLQ-RGQGYDMRRA 448
               ++  GP    QR        GP  +   Q  PS  P   P  G D    G G +    
Sbjct:   203 RGEL--GPPSLPQRFAQPGAPFGPSLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDGGK 260

Query:   449 PSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRGGN 499
             P  +P   T F   P   +P    +G  P  P N+   G  TP A S +  G+  GG+
Sbjct:   261 P-LNPPAATAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGGGS 317


>ZFIN|ZDB-GENE-041008-78 [details] [associations]
            symbol:polr2a "polymerase (RNA) II (DNA directed)
            polypeptide A" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 ZFIN:ZDB-GENE-041008-78 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            GO:GO:0005665 GeneTree:ENSGT00700000104490 EMBL:AL929346
            IPI:IPI00608319 Ensembl:ENSDART00000077495 Bgee:F1Q9K4
            Uniprot:F1Q9K4
        Length = 1965

 Score = 131 (51.2 bits), Expect = 0.00016, P = 0.00016
 Identities = 67/234 (28%), Positives = 87/234 (37%)

Query:   270 GHGPPPSATTAGVVGAGPNTSTSAYAATQ----SG-TPMRAAYDIPRGPGYEASKGPGYD 324
             G  P P +  +  +      +T AY A      SG TP  A +  P      +   PGY 
Sbjct:  1501 GSAPSPMSGMSPAMTPWNTGATPAYGAWSPSVGSGMTPGAAGFS-PSAASDASGFSPGYS 1559

Query:   325 A--SKAPSYDPTKGPS--YDPAKG---PGYDPTKGPGYDAQK-GSNYDAQRGPNYDIHRG 376
                S  P    + GP+  Y P+ G   P Y PT  P Y+ +  G  Y  Q  P Y     
Sbjct:  1560 PAWSPTPGSPGSPGPASPYIPSPGALSPNYSPTS-PAYEPRSPGGGYTPQ-SPGYS-PTS 1616

Query:   377 PSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 436
             PSY P     Y     PNY     P Y     P Y     P Y +  +PSY P   P Y 
Sbjct:  1617 PSYSPTSP-SYS-PTSPNYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS 1669

Query:   437 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 490
                   Y    +PSY P+  +    +P   +P      P +  P  S T P+ S
Sbjct:  1670 -PTSPSYSPT-SPSYSPTSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1718


>UNIPROTKB|Q5T171 [details] [associations]
            symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
            "in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
            development" evidence=IEA] [GO:0002088 "lens development in
            camera-type eye" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0007420 "brain development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0030879 "mammary
            gland development" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0048589 "developmental growth"
            evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] InterPro:IPR001965 InterPro:IPR019787 Pfam:PF00628
            PROSITE:PS50016 SMART:SM00249 GO:GO:0005634 GO:GO:0007420
            GO:GO:0046872 GO:GO:0008270 GO:GO:0001701 GO:GO:0009791
            GO:GO:0001822 EMBL:AL451085 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 EMBL:CH471121 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            HOGENOM:HOG000001580 HOVERGEN:HBG053774 UniGene:Hs.533597
            HGNC:HGNC:30257 IPI:IPI00642524 SMR:Q5T171 STRING:Q5T171
            Ensembl:ENST00000368456 Uniprot:Q5T171
        Length = 369

 Score = 122 (48.0 bits), Expect = 0.00017, P = 0.00017
 Identities = 80/302 (26%), Positives = 113/302 (37%)

Query:   227 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ-GHGPPPSATT 279
             M +P   RR   + G A  + +E      P     V  N +ED +G P+ G   PP   +
Sbjct:     1 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGS 60

Query:   280 AGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTK-G 336
                 G             Q G     A  +P  PGY    G G    +   P + P   G
Sbjct:    61 PVPFGG---------FRVQGGM----AGQVP--PGYSTGGGGGPQPLRRQPPPFPPNPMG 105

Query:   337 PSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQ 390
             P+++ P +GPGY P     + +Q    ++   G N+    G     P  G G      M 
Sbjct:   106 PAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMG 162

Query:   391 RGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYDLQ-RGQGYD 444
             + P  ++  GP   +QR   PG      P+    Q  PS  P   P  G D    G G +
Sbjct:   163 QPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGE 220

Query:   445 MRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRG 497
                 P  +P   T F   P   +P    +G  P  P N+   G  TP A S +  G+  G
Sbjct:   221 DGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGG 279

Query:   498 GN 499
             G+
Sbjct:   280 GS 281


>MGI|MGI:88452 [details] [associations]
            symbol:Col2a1 "collagen, type II, alpha 1" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development" evidence=ISO]
            [GO:0001502 "cartilage condensation" evidence=IMP] [GO:0001894
            "tissue homeostasis" evidence=IMP] [GO:0001958 "endochondral
            ossification" evidence=IMP] [GO:0002062 "chondrocyte
            differentiation" evidence=IMP] [GO:0003007 "heart morphogenesis"
            evidence=IMP] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0005581 "collagen" evidence=IDA] [GO:0005585
            "collagen type II" evidence=ISO;IDA;IMP] [GO:0005604 "basement
            membrane" evidence=IDA] [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006029
            "proteoglycan metabolic process" evidence=IMP] [GO:0007601 "visual
            perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
            evidence=ISO] [GO:0010468 "regulation of gene expression"
            evidence=IMP] [GO:0030199 "collagen fibril organization"
            evidence=ISO;IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0035108 "limb morphogenesis" evidence=IMP] [GO:0042472 "inner
            ear morphogenesis" evidence=IMP] [GO:0042802 "identical protein
            binding" evidence=IPI] [GO:0043066 "negative regulation of
            apoptotic process" evidence=IMP] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=ISO] [GO:0048705 "skeletal system morphogenesis"
            evidence=IMP] [GO:0048839 "inner ear development" evidence=IMP]
            [GO:0051216 "cartilage development" evidence=IMP] [GO:0060021
            "palate development" evidence=IMP] [GO:0060272 "embryonic skeletal
            joint morphogenesis" evidence=ISO] [GO:0060348 "bone development"
            evidence=IMP] [GO:0060351 "cartilage development involved in
            endochondral bone morphogenesis" evidence=IMP] [GO:0071773
            "cellular response to BMP stimulus" evidence=IDA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 MGI:MGI:88452 GO:GO:0005737
            GO:GO:0043066 GO:GO:0005615 GO:GO:0046872 GO:GO:0003007
            GO:GO:0007601 GO:GO:0030199 GO:GO:0007417 GO:GO:0042472
            GO:GO:0001894 GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604
            GO:GO:0001502 GO:GO:0060021 GO:GO:0002062 GO:GO:0010468
            GO:GO:0060272 GO:GO:0006029 GO:GO:0001958 GO:GO:0060351
            GO:GO:0005201 GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933
            KO:K06236 CTD:1280 OMA:SSCRICV GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 OrthoDB:EOG4FTW1C ChiTaRS:COL2A1 EMBL:M65161
            EMBL:BC030913 EMBL:BC051383 EMBL:BC052326 EMBL:BC082331 EMBL:S63190
            EMBL:M63708 EMBL:M63709 EMBL:M63710 EMBL:AK028295 EMBL:X57982
            IPI:IPI00471183 IPI:IPI00621255 IPI:IPI00622890 IPI:IPI00623625
            IPI:IPI00828467 IPI:IPI00828653 IPI:IPI00828753 PIR:A41182
            PIR:B41182 RefSeq:NP_001106987.2 RefSeq:NP_112440.2 UniGene:Mm.2423
            PDB:2W65 PDBsum:2W65 ProteinModelPortal:P28481 SMR:P28481
            IntAct:P28481 STRING:P28481 PhosphoSite:P28481 PRIDE:P28481
            Ensembl:ENSMUST00000023123 Ensembl:ENSMUST00000088355 GeneID:12824
            KEGG:mmu:12824 UCSC:uc007xlp.2 UCSC:uc007xlq.2 InParanoid:P28481
            EvolutionaryTrace:P28481 NextBio:282306 Bgee:P28481
            CleanEx:MM_COL2A1 Genevestigator:P28481
            GermOnline:ENSMUSG00000022483 Uniprot:P28481
        Length = 1487

 Score = 129 (50.5 bits), Expect = 0.00019, P = 0.00019
 Identities = 88/296 (29%), Positives = 110/296 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 286
             P  DR  D    GA G    +  G P G        G P   GPP  +     A + G  
Sbjct:   132 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPAGPPGPPGPPGLSAGNFAAQMAGGY 187

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 343
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 243

Query:   344 GPGYDPTKGPGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 399
              PG  P   PG D + G      +RG P     RG    P  GL G    RG P  D  +
Sbjct:   244 PPG--PAGKPGDDGEAGKPGKSGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299

Query:   400 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 450
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357

Query:   451 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 500
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   358 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410


>UNIPROTKB|P05997 [details] [associations]
            symbol:COL5A2 "Collagen alpha-2(V) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0071230
            "cellular response to amino acid stimulus" evidence=IEA]
            [GO:0030199 "collagen fibril organization" evidence=ISS;IMP]
            [GO:0043588 "skin development" evidence=ISS;IMP] [GO:0031012
            "extracellular matrix" evidence=NAS] [GO:0003674
            "molecular_function" evidence=ND] [GO:0048592 "eye morphogenesis"
            evidence=IMP] [GO:0005588 "collagen type V" evidence=IMP]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0043588 GO:GO:0046872 GO:GO:0030199
            GO:GO:0005788 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            HOVERGEN:HBG004933 KO:K06236 MIM:130000 Orphanet:90309
            EMBL:AY016295 PDB:1A9A PDBsum:1A9A MIM:130010 Orphanet:90318
            GO:GO:0005588 EMBL:Y14690 EMBL:AB209045 EMBL:AC064833 EMBL:AC133106
            EMBL:J04478 EMBL:AY016288 EMBL:AY016287 EMBL:AY016289 EMBL:AY016290
            EMBL:AY016291 EMBL:AY016292 EMBL:AY016293 EMBL:AY016294 EMBL:M58529
            EMBL:X04758 EMBL:BC043613 EMBL:M10956 EMBL:M11135 EMBL:M11718
            EMBL:J03051 IPI:IPI00739099 PIR:A31427 RefSeq:NP_000384.2
            UniGene:Hs.445827 ProteinModelPortal:P05997 SMR:P05997
            STRING:P05997 PhosphoSite:P05997 DMDM:143811378 PaxDb:P05997
            PRIDE:P05997 Ensembl:ENST00000374866 GeneID:1290 KEGG:hsa:1290
            UCSC:uc002uqk.3 CTD:1290 GeneCards:GC02M189861 HGNC:HGNC:2210
            MIM:120190 neXtProt:NX_P05997 PharmGKB:PA26725 InParanoid:P05997
            OMA:PDHKPVW OrthoDB:EOG4K0QMS ChiTaRS:COL5A2 GenomeRNAi:1290
            NextBio:5223 PMAP-CutDB:P05997 ArrayExpress:P05997 Bgee:P05997
            Genevestigator:P05997 GermOnline:ENSG00000204262 Uniprot:P05997
        Length = 1499

 Score = 129 (50.5 bits), Expect = 0.00019, P = 0.00019
 Identities = 87/293 (29%), Positives = 109/293 (37%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 289
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   784 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 842

Query:   290 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 345
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   843 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 901

Query:   346 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 395
                P  T  PG   + G    A   GP   +   P  +   GL  D         RGP  
Sbjct:   902 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 959

Query:   396 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 452
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   960 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1015

Query:   453 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 503
             P +  G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1016 PGK-VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067


>UNIPROTKB|D3ZZM1 [details] [associations]
            symbol:Taf15 "Protein Taf15" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950003
            ProteinModelPortal:D3ZZM1 Ensembl:ENSRNOT00000064396
            ArrayExpress:D3ZZM1 Uniprot:D3ZZM1
        Length = 558

 Score = 124 (48.7 bits), Expect = 0.00020, P = 0.00020
 Identities = 67/238 (28%), Positives = 89/238 (37%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 292
             RR +   GG +G       G   G+  ++   G P+ G    P+ +   +  A  N+   
Sbjct:   318 RRPEFMRGGGSGGGRRGRGGYR-GRGGFQGRGGDPKNGDWVCPNPSCGNMNFARRNSCNQ 376

Query:   293 AYAAT-QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 351
                   +   P    +   RG GY   +G  +        D  +G       G GY   +
Sbjct:   377 CNEPRPEDSRPSGGDF---RGRGYGGERG--FRGRGGRGGD--RGGYGADRSGGGYGGDR 429

Query:   352 GPG-YDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRV 408
               G Y A + G  Y   R G  Y   RG  Y   RG GY   RG +Y   RG GY   R 
Sbjct:   430 SGGSYGADRSGGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGDRGGSYGGDRG-GYGGDR- 486

Query:   409 PGYDVQRGPVYEAQRAP-SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
              GY   RG  Y   R+  +Y   RG G       GY   R+  Y   RG G+ G  RG
Sbjct:   487 GGYGGDRGG-YGGDRSRGAYGGDRGGG-----SGGYGGDRSGGYGGDRGGGY-GGDRG 537


>UNIPROTKB|Q9BRQ0 [details] [associations]
            symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
            "in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
            development" evidence=IEA] [GO:0002088 "lens development in
            camera-type eye" evidence=IEA] [GO:0007420 "brain development"
            evidence=IEA] [GO:0009791 "post-embryonic development"
            evidence=IEA] [GO:0030879 "mammary gland development" evidence=IEA]
            [GO:0033599 "regulation of mammary gland epithelial cell
            proliferation" evidence=IEA] [GO:0042393 "histone binding"
            evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0060021 "palate development" evidence=IEA] [GO:0060070
            "canonical Wnt receptor signaling pathway" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001965
            InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
            GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
            GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
            InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 PDB:2XB1 PDBsum:2XB1 GO:GO:0051569
            GO:GO:0002088 eggNOG:NOG72798 HOGENOM:HOG000001580
            HOVERGEN:HBG053774 EMBL:AF457208 EMBL:BC006132 EMBL:BC013725
            EMBL:BC032099 EMBL:AF289598 IPI:IPI00042099 RefSeq:NP_612157.1
            UniGene:Hs.533597 ProteinModelPortal:Q9BRQ0 SMR:Q9BRQ0
            IntAct:Q9BRQ0 STRING:Q9BRQ0 PhosphoSite:Q9BRQ0 DMDM:23396825
            PaxDb:Q9BRQ0 PRIDE:Q9BRQ0 DNASU:90780 Ensembl:ENST00000368457
            GeneID:90780 KEGG:hsa:90780 UCSC:uc001fft.3 CTD:90780
            GeneCards:GC01M154929 HGNC:HGNC:30257 HPA:HPA023689 MIM:606903
            neXtProt:NX_Q9BRQ0 PharmGKB:PA134881185 InParanoid:Q9BRQ0
            OMA:PGLVYPC OrthoDB:EOG4QZ7MB PhylomeDB:Q9BRQ0 GenomeRNAi:90780
            NextBio:76956 ArrayExpress:Q9BRQ0 Bgee:Q9BRQ0 CleanEx:HS_PYGO2
            Genevestigator:Q9BRQ0 GermOnline:ENSG00000163348 Uniprot:Q9BRQ0
        Length = 406

 Score = 122 (48.0 bits), Expect = 0.00020, P = 0.00020
 Identities = 80/302 (26%), Positives = 113/302 (37%)

Query:   227 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ-GHGPPPSATT 279
             M +P   RR   + G A  + +E      P     V  N +ED +G P+ G   PP   +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGS 97

Query:   280 AGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTK-G 336
                 G             Q G     A  +P  PGY    G G    +   P + P   G
Sbjct:    98 PVPFGG---------FRVQGGM----AGQVP--PGYSTGGGGGPQPLRRQPPPFPPNPMG 142

Query:   337 PSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQ 390
             P+++ P +GPGY P     + +Q    ++   G N+    G     P  G G      M 
Sbjct:   143 PAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMG 199

Query:   391 RGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYDLQ-RGQGYD 444
             + P  ++  GP   +QR   PG      P+    Q  PS  P   P  G D    G G +
Sbjct:   200 QPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGE 257

Query:   445 MRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRG 497
                 P  +P   T F   P   +P    +G  P  P N+   G  TP A S +  G+  G
Sbjct:   258 DGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGG 316

Query:   498 GN 499
             G+
Sbjct:   317 GS 318


>UNIPROTKB|E2RRS5 [details] [associations]
            symbol:RBM12B "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 GO:GO:0000166
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00620000087828
            OMA:EHFRRPP CTD:389677 EMBL:AAEX03015951 RefSeq:XP_544177.3
            Ensembl:ENSCAFT00000014490 GeneID:487048 KEGG:cfa:487048
            NextBio:20860720 Uniprot:E2RRS5
        Length = 994

 Score = 124 (48.7 bits), Expect = 0.00021, Sum P(2) = 0.00021
 Identities = 45/174 (25%), Positives = 71/174 (40%)

Query:   302 PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 361
             P    +  PR   +   +    D  + P  D  + P  D  + P  D  + P  D ++  
Sbjct:   591 PWEEGFRYPREEDFRYPREE--DWRRPPEEDFRRPPKDDFRRPPEEDWRRLPEGDFRRPP 648

Query:   362 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 421
               D +R P  D  R P  + +R    D +R P  D +R P  + +R+P  D +R P  + 
Sbjct:   649 EEDWRRPPEDDFRRLPQGEWRRPPEEDFRRPPEEDFRRLPEEDFRRLPEEDFRRPPEEDF 708

Query:   422 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 475
             +R+P    +R P  D +R      RR P  +  R    +   R    H + PPP
Sbjct:   709 RRSPEEDFRRSPEEDFRRPPPEHFRRPPP-EHLRRPPPEHFRRPPPEHFRRPPP 761

 Score = 50 (22.7 bits), Expect = 0.00021, Sum P(2) = 0.00021
 Identities = 14/57 (24%), Positives = 24/57 (42%)

Query:   212 YITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP 268
             +++   E++K   E+     + R   GS  GA+G          + + A   GYG P
Sbjct:    72 FLSSKAEMQKT-IEMRRTDRIGRERPGS--GASGAGSLSNFVEAIKEEASNSGYGSP 125


>UNIPROTKB|E1BF47 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0031965 "nuclear membrane" evidence=IEA] [GO:0007094
            "mitotic spindle assembly checkpoint" evidence=IEA] [GO:0005643
            "nuclear pore" evidence=IEA] [GO:0000776 "kinetochore"
            evidence=IEA] [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0004828 "serine-tRNA ligase activity"
            evidence=IEA] InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926
            GO:GO:0005524 GO:GO:0005737 GO:GO:0000776 GO:GO:0007094
            GO:GO:0031965 GO:GO:0005643 GO:GO:0006606 KO:K09291
            InterPro:IPR009053 SUPFAM:SSF46579 GeneTree:ENSGT00700000104019
            GO:GO:0004828 GO:GO:0006434 Gene3D:1.10.287.40 CTD:7175 OMA:RFIRREK
            EMBL:DAAA02043627 IPI:IPI00694835 RefSeq:NP_001192552.1
            UniGene:Bt.1386 Ensembl:ENSBTAT00000015848 GeneID:507869
            KEGG:bta:507869 NextBio:20868255 Uniprot:E1BF47
        Length = 2360

 Score = 124 (48.7 bits), Expect = 0.00021, Sum P(2) = 0.00021
 Identities = 43/187 (22%), Positives = 87/187 (46%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1347 PDTEEYRKLLSEKEVHTKRIQQLTEELGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1406

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   + L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1407 EKENIQKELDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1466

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQVMEKNYITMATEVEK 221
              H  VQ++  L   L     +        E  +K      +E+  + E+  + + +E+ +
Sbjct:  1467 QHVSVQEMQELKETLSQAETKSKSLENQVENLQKTLSEKEIEARSLQEQT-LELQSELAR 1525

Query:   222 LRAELMN 228
             LR +L +
Sbjct:  1526 LRQDLQD 1532

 Score = 58 (25.5 bits), Expect = 0.00021, Sum P(2) = 0.00021
 Identities = 19/63 (30%), Positives = 25/63 (39%)

Query:   233 DRRADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTS 290
             D   D +  G  G   NE +G   G + YE  D  G   G G  P   T   +G G +  
Sbjct:  1970 DDDEDDTGMGDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGESNQ 2026

Query:   291 TSA 293
              +A
Sbjct:  2027 RAA 2029


>RGD|1311417 [details] [associations]
            symbol:Col7a1 "collagen, type VII, alpha 1" species:10116
            "Rattus norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005604 "basement
            membrane" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR002035 InterPro:IPR003961 Pfam:PF00041
            Pfam:PF00092 PROSITE:PS50234 PROSITE:PS50853 SMART:SM00060
            SMART:SM00327 RGD:1311417 Gene3D:2.60.40.10 InterPro:IPR013783
            SUPFAM:SSF49265 InterPro:IPR008160 Pfam:PF01391 IPI:IPI00951759
            Ensembl:ENSRNOT00000066518 UCSC:RGD:1311417 ArrayExpress:D3ZQ14
            Uniprot:D3ZQ14
        Length = 2585

 Score = 131 (51.2 bits), Expect = 0.00021, P = 0.00021
 Identities = 75/262 (28%), Positives = 96/262 (36%)

Query:   253 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 312
             G P G    +   G P   GPP S    GV G+     +  ++  +     R     P+G
Sbjct:  1285 GAP-GSTQAKGERGFPGPEGPPGSPGLPGVPGSPGVKGSPGWSGPRGDRGERGPQG-PKG 1342

Query:   313 ----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAK-GPGYDPTKGP-GYDAQKGSNYDA 365
                 PG     G PG    K    DP  GPS  P   GP  DP  GP G     G++   
Sbjct:  1343 EPGEPGQVIGGGRPGLPGKKG---DP--GPSGPPGPHGPLGDP--GPRGPPGLPGTSVKG 1395

Query:   366 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 424
              +G   +  RGP   P  G G   Q  P      G PG   Q  PG   ++G   + +  
Sbjct:  1396 DKGDRGE--RGP---PGPGTGASEQGSPGLPGLPGSPG--PQGPPGRTGEKGEKGDCEDG 1448

Query:   425 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGS 483
                +P + PG   + G    +R AP     +G  G  G P      G+  PP    P G 
Sbjct:  1449 GPGLPGQ-PGVPGEPG----LRGAPGVTGPKGDRGLTGTPGEPGEKGERGPPGPVGPQGL 1503

Query:   484 ATPPARSGSGQPRG--GNPARR 503
                  R G   P G  G P RR
Sbjct:  1504 PGAAGRPGVEGPEGPPGPPGRR 1525


>ZFIN|ZDB-GENE-030516-3 [details] [associations]
            symbol:col18a1 "collagen type XVIII, alpha 1"
            species:7955 "Danio rerio" [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005198 "structural molecule activity"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] InterPro:IPR010515 InterPro:IPR020067
            Pfam:PF01392 Pfam:PF06482 PROSITE:PS50038 ZFIN:ZDB-GENE-030516-3
            GO:GO:0005198 Gene3D:3.10.100.10 InterPro:IPR016186
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0007155 InterPro:IPR008985
            SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            InterPro:IPR001791 SMART:SM00282 Gene3D:1.10.2000.10
            SUPFAM:SSF63501 SMART:SM00210 GeneTree:ENSGT00700000104250
            HOGENOM:HOG000231591 HOVERGEN:HBG053241 EMBL:BX927363 EMBL:CT030212
            IPI:IPI00616856 UniGene:Dr.52833 SMR:B0S8G4
            Ensembl:ENSDART00000130434 OMA:DRFNRYD Uniprot:B0S8G4
        Length = 1645

 Score = 129 (50.5 bits), Expect = 0.00021, P = 0.00021
 Identities = 73/277 (26%), Positives = 99/277 (35%)

Query:   235 RADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGP--PPSATTAGVVGA-GPNT 289
             + D   G  +G       G P G+   +   G+G P   G   PP     G  G  GP  
Sbjct:   609 KGDVGSGSVSGGGSKGDKGVP-GEKGMKGTSGFGYPGSKGDRGPP-----GPPGPPGPQG 662

Query:   290 STSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGY 347
              ++       G+ ++     PRGP G +   GP G +       +  K     P+  PG 
Sbjct:   663 PSAEVEVRGDGSVVQKVTG-PRGPPGPQGPPGPPGPEGEPGDPGEDGKAGQVGPSGFPGN 721

Query:   348 DPTKGP-GYDAQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQ--RG-PG 402
                 GP G    +G +    RGP       GPS    R    DM+ G  +DM   R  PG
Sbjct:   722 PGNPGPKGDKGDRGESQPGPRGPPGPPGPPGPSSGFDRPTFVDME-GSGFDMDSVRAVPG 780

Query:   403 YETQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 461
                   PG     GP   A      + P   PG +   GQ   +   P  D   G     
Sbjct:   781 LPGP--PGPPGPPGPPGSASSGSGGFGPPGPPGQNGAPGQP-GLSGVPGADGKPGLPGPK 837

Query:   462 APRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQPRG 497
               +G A    +P P+      GS+ PP  +G G P G
Sbjct:   838 GEKGDAGELGLPGPVGEKGAKGSSGPPGTTGIGGPAG 874


>UNIPROTKB|O46392 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
            "Canis lupus familiaris" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 CTD:1278 EMBL:AF035120
            RefSeq:NP_001003187.1 UniGene:Cfa.1262 STRING:O46392 GeneID:403824
            KEGG:cfa:403824 NextBio:20817320 Uniprot:O46392
        Length = 1366

 Score = 128 (50.1 bits), Expect = 0.00022, P = 0.00022
 Identities = 86/283 (30%), Positives = 105/283 (37%)

Query:   242 GATG-NSENETSGRP--VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 298
             GA G       +G P   G        G+P   G   +    G+VG  P  + S   +  
Sbjct:   301 GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGIVGE-PGPAGSKGESGN 359

Query:   299 SGTPMRAAYDIPRGP-GYEASKGPGYDASKA-PSYDPTKGPSYDP-AKG-PGYDPTKGP- 353
              G P  A    P GP G E  +GP  +A  A PS  P  G    P ++G PG D   G  
Sbjct:   360 KGEPGSAGAQGPPGPSGEEGKRGPNGEAGSAGPSGPP--GLRGSPGSRGLPGADGPAGVM 417

Query:   354 GYDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GYE-TQRVP 409
             G    +G+   A  RGPN D  R P  +P    G    RG P      GP G E    +P
Sbjct:   418 GPPGPRGATGPAGVRGPNGDSGR-PG-EP----GLMGPRGFPGAPGNVGPAGKEGPMGLP 471

Query:   410 GYDVQRGPVYEA--QRAPSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRG 465
             G D + GP+  A  +  P  I   GP G     G+  D   A     +RG  G DG    
Sbjct:   472 GIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHA-GLAGARGAPGPDGNNGA 530

Query:   466 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-----GNPARR 503
               P G           G A PP   G   P G     G P  R
Sbjct:   531 QGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGER 573


>UNIPROTKB|F1KQQ4 [details] [associations]
            symbol:F1KQQ4 "Collagen alpha-1(IV) chain" species:6253
            "Ascaris suum" [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10 EMBL:JI164326
            Uniprot:F1KQQ4
        Length = 1759

 Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
 Identities = 86/285 (30%), Positives = 105/285 (36%)

Query:   238 GSYGGATGNSENETSGRP--VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 295
             G  G A  N      G P   G+   +  +G P   GP  +   +G+ GA P        
Sbjct:  1164 GIPGDAGFNGRAGLPGLPGIKGERGQDGQHGYPGEPGPVGAHGESGLTGA-PGLQGEPGL 1222

Query:   296 ATQSGTPMR----AAYDIPRGPGYEASKG----PGYDASKA-PSYD--PTKGPSYDPAKG 344
               + G P +     A   P  PG E   G     G D     P  D  P +GP  D A  
Sbjct:  1223 PGRMGLPGQPGELGAPGFPGAPGLEGIPGIRGERGDDGLPGLPGIDGIPIQGPEGD-AGY 1281

Query:   345 PGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRG----PSYDPQRGL-GYDMQRGPNYDM 397
             PG D   G PG   Q+G   D   G P     RG    P Y  +RGL G D +RGP  D 
Sbjct:  1282 PGRDGNDGLPGLPGQRGD--DGLPGLPGLIGERGDDGLPGYPGERGLRGIDGKRGP--DG 1337

Query:   398 QRG-PGYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPS 454
              RG PG       PG   +RG        P +  + G PGY  +RG+       P     
Sbjct:  1338 ARGLPGPPGLDGYPGAPGERG----MDGLPGFPGKDGIPGYPGERGEV----GLPGLPGM 1389

Query:   455 RGT-GFDGAPRGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG 497
             RG  G  G P  A   G +    L  +P G   P    G   P G
Sbjct:  1390 RGEDGLPGLPGLAGQKGARGDDGLPGLP-GLPGPVGARGRPGPPG 1433


>UNIPROTKB|F1LNY9 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00558825
            Ensembl:ENSRNOT00000049994 ArrayExpress:F1LNY9 Uniprot:F1LNY9
        Length = 1441

 Score = 128 (50.1 bits), Expect = 0.00024, P = 0.00024
 Identities = 81/280 (28%), Positives = 105/280 (37%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 292
             R       GA GN        P G      G G P   G P +   AG  GA GP  +  
Sbjct:   288 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 344

Query:   293 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 345
             +     + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   345 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 397

Query:   346 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 401
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   398 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 456

Query:   402 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 456
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   457 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 513

Query:   457 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 495
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   514 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553

 Score = 125 (49.1 bits), Expect = 0.00050, P = 0.00050
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057


>UNIPROTKB|F1LQ06 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00949996
            Ensembl:ENSRNOT00000066385 ArrayExpress:F1LQ06 Uniprot:F1LQ06
        Length = 1441

 Score = 128 (50.1 bits), Expect = 0.00024, P = 0.00024
 Identities = 81/280 (28%), Positives = 105/280 (37%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 292
             R       GA GN        P G      G G P   G P +   AG  GA GP  +  
Sbjct:   288 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 344

Query:   293 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 345
             +     + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   345 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 397

Query:   346 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 401
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   398 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 456

Query:   402 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 456
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   457 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 513

Query:   457 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 495
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   514 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553

 Score = 125 (49.1 bits), Expect = 0.00050, P = 0.00050
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057


>UNIPROTKB|Q9XSK0 [details] [associations]
            symbol:CRX "Cone-rod homeobox protein" species:9913 "Bos
            taurus" [GO:0060041 "retina development in camera-type eye"
            evidence=IEA] [GO:0045944 "positive regulation of transcription
            from RNA polymerase II promoter" evidence=IEA] [GO:0043522 "leucine
            zipper domain binding" evidence=IEA] [GO:0005667 "transcription
            factor complex" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0003682
            "chromatin binding" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0043565 "sequence-specific DNA
            binding" evidence=IEA] InterPro:IPR001356 InterPro:IPR009057
            InterPro:IPR013851 InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529
            PROSITE:PS00027 PROSITE:PS50071 SMART:SM00389 GO:GO:0043565
            GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0060041
            EMBL:AF154123 IPI:IPI00695402 RefSeq:NP_776329.1 UniGene:Bt.283
            ProteinModelPortal:Q9XSK0 SMR:Q9XSK0 STRING:Q9XSK0 PRIDE:Q9XSK0
            Ensembl:ENSBTAT00000028232 GeneID:280756 KEGG:bta:280756 CTD:1406
            eggNOG:NOG324074 GeneTree:ENSGT00700000104128 HOGENOM:HOG000082677
            HOVERGEN:HBG004028 InParanoid:Q9XSK0 KO:K09337 OMA:QTKARPA
            OrthoDB:EOG4NKBWG NextBio:20804923 Uniprot:Q9XSK0
        Length = 299

 Score = 119 (46.9 bits), Expect = 0.00024, P = 0.00024
 Identities = 29/96 (30%), Positives = 42/96 (43%)

Query:   268 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 327
             P    P P A  AG+V +GP+ +++ YA T +  P  A    P   G  +S   G D   
Sbjct:   165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222

Query:   328 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 363
             +P   P  GP+  P  GP   P+      +  G +Y
Sbjct:   223 SPMVPPLGGPALSPLSGPSVGPSLTQSPTSLSGQSY 258


>UNIPROTKB|F1M8G1 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00475975
            Ensembl:ENSRNOT00000050833 ArrayExpress:F1M8G1 Uniprot:F1M8G1
        Length = 1458

 Score = 128 (50.1 bits), Expect = 0.00024, P = 0.00024
 Identities = 81/280 (28%), Positives = 105/280 (37%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 292
             R       GA GN        P G      G G P   G P +   AG  GA GP  +  
Sbjct:   305 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 361

Query:   293 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 345
             +     + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   362 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 414

Query:   346 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 401
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   415 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 473

Query:   402 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 456
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   474 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 530

Query:   457 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 495
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   531 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 570

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 87/286 (30%), Positives = 109/286 (38%)

Query:   237 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 296
             DG+ G    + E  T G P G        G P G G    A  A + G     +  A   
Sbjct:   113 DGAPGPRGRDGEPGTPGNP-GPPGPPGPPG-PPGLGGGNFA--AQMAGGFDEKAGGAQMG 168

Query:   297 TQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP 353
                G PM      PRGP G   + GP G+  +     +P   GP   P   PG  P   P
Sbjct:   169 VMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRGPPG--PAGKP 222

Query:   354 GYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG----PGYET 405
             G D + G    A +RG P     RG    P  GL G    RG P  D  +G    PG + 
Sbjct:   223 GDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKGEAGAPGVKG 280

Query:   406 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS-----YDPSRGTGF 459
             +   PG +   GP+   +  P    + GP       +G D +  P+       P+ G GF
Sbjct:   281 ESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGPVGPAGGPGF 338

Query:   460 DGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 500
              GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   339 PGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 381


>UNIPROTKB|F1PS24 [details] [associations]
            symbol:COL2A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
            PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005737
            GO:GO:0043066 GO:GO:0005615 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
            GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
            GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
            GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GeneTree:ENSGT00660000095287 GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 EMBL:AAEX03015088 EMBL:AAEX03015089
            Ensembl:ENSCAFT00000014414 OMA:CPICPTE Uniprot:F1PS24
        Length = 1489

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 292
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   794 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 852

Query:   293 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 344
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   853 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 909

Query:   345 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 403
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   910 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 956

Query:   404 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 460
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   957 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1012

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1013 GASGDRGPPGPVGPPGLTGPSGE---PGREGS-PGADGPPGR 1050

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 72/271 (26%), Positives = 92/271 (33%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA- 296
             G  G      +    G P G    +   G P   GPP      G  G G N +       
Sbjct:   130 GEQGPRGDRGDKGEKGAP-GPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 188

Query:   297 -TQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA-KGPGYDPTKGP 353
               ++G         P GP G     GP   A     +    G   +P   GP   P   P
Sbjct:   189 DEKAGGAQMGVMQGPMGPMGPRGPPGPA-GAPGPQGFQGNPGEPGEPGVSGP-MGPRGPP 246

Query:   354 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG-PGYETQR---- 407
             G   + G + +A + P     RGP   PQ   G+    G P     RG PG +  +    
Sbjct:   247 GPPGKPGDDGEAGK-PGKSGERGPP-GPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAG 304

Query:   408 VPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA 466
              PG   + G   E   +P  +  RG PG   +RG     R  P+   +   G DG P  A
Sbjct:   305 APGVKGESGSPGE-NGSPGPMGPRGLPG---ERG-----RTGPA-GAAGARGNDGQPGPA 354

Query:   467 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
              P G V P     P     P A  G   P G
Sbjct:   355 GPPGPVSPA--GGPGFPGAPGASQGEAGPTG 383


>RGD|1309595 [details] [associations]
            symbol:Taf15 "TAF15 RNA polymerase II, TATA box binding protein
            (TBP)-associated factor" species:10116 "Rattus norvegicus"
            [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0005622 "intracellular" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950713
            PRIDE:F1M8P1 Ensembl:ENSRNOT00000014438 ArrayExpress:F1M8P1
            Uniprot:F1M8P1
        Length = 554

 Score = 123 (48.4 bits), Expect = 0.00025, P = 0.00025
 Identities = 72/237 (30%), Positives = 86/237 (36%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 293
             RR +   GG +G       GR  G+  Y  G G  QG G  P       V   P+     
Sbjct:   318 RRPEFMRGGGSGG------GRR-GRGGYR-GRGGFQGRGGDPK--NGDWVCPNPSCGNMN 367

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP 353
             +A   S             P     +G GY   +   +    G   D   G G D + G 
Sbjct:   368 FARRNSCNQCNEPRPEDSRPSGGDFRGRGYGGERG--FRGRGGRGGDRG-GYGADRSGG- 423

Query:   354 GYDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 411
             GY   + G +Y A R G  Y   R   Y   RG GY   RG +Y   RG GY   R  GY
Sbjct:   424 GYGGDRSGGSYGADRSGGGYGGDRS-GYGGDRG-GYGGDRGGSYGGDRG-GYGGDR-GGY 479

Query:   412 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG---YDMRRAPSYDPSRGTGFDGAPRG 465
                RG  Y   R   Y   R   Y   RG G   Y   R+  Y   RG G+ G  RG
Sbjct:   480 GGDRGG-YGGDRG-GYGGDRRGAYGGDRGGGSGGYGGDRSGGYGGDRGGGY-GGDRG 533


>UNIPROTKB|F1SEN8 [details] [associations]
            symbol:LDB3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
            "cytoskeletal protein binding" evidence=IEA] [GO:0005856
            "cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 CTD:11155
            OMA:CTSQATT InterPro:IPR006643 SMART:SM00735
            GeneTree:ENSGT00700000104411 EMBL:CU468409 RefSeq:XP_003359314.1
            UniGene:Ssc.97236 Ensembl:ENSSSCT00000011341 GeneID:100151883
            KEGG:ssc:100151883 Uniprot:F1SEN8
        Length = 715

 Score = 124 (48.7 bits), Expect = 0.00028, P = 0.00028
 Identities = 50/192 (26%), Positives = 69/192 (35%)

Query:   243 ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP----NTSTSAYAATQ 298
             AT ++    S            Y       P P+A T     A P       T+A     
Sbjct:   344 ATASAAAPASSPADSPRPQASAYSPAVATSPAPAAHTYSEAPAAPAPKPRVVTTASIRPS 403

Query:   299 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 358
                P+ A+   P  PG   S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+  
Sbjct:   404 VYQPVPASTYSP-SPGANYSPTP-YTPSPAPAYTPSPAPTYSPSPAPAYTPSPAPSYNPT 461

Query:   359 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQ--- 414
               S   A+           S+  +   G          + RG P Y T  + G  V    
Sbjct:   462 PYSGGPAESASRPPWVTDDSFSQKFAPGKSTTSISKQSLPRGAPAY-TPPLQGPQVSPLA 520

Query:   415 RGPVYEAQRAPS 426
             RG V  A+R P+
Sbjct:   521 RGTVQRAERFPA 532


>RGD|1311620 [details] [associations]
            symbol:Zmiz1 "zinc finger, MIZ-type containing 1" species:10116
            "Rattus norvegicus" [GO:0001570 "vasculogenesis" evidence=IEA;ISO]
            [GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
            [GO:0003007 "heart morphogenesis" evidence=IEA;ISO] [GO:0007296
            "vitellogenesis" evidence=IEA;ISO] [GO:0007569 "cell aging"
            evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0048146 "positive
            regulation of fibroblast proliferation" evidence=IEA;ISO]
            [GO:0048589 "developmental growth" evidence=IEA;ISO] [GO:0048844
            "artery morphogenesis" evidence=IEA;ISO] InterPro:IPR004181
            Pfam:PF02891 PROSITE:PS51044 RGD:1311620 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
            CTD:57178 OMA:MNQYGPM OrthoDB:EOG45MN70 EMBL:CH474067
            IPI:IPI00364462 RefSeq:NP_001101863.1 UniGene:Rn.1712
            Ensembl:ENSRNOT00000014004 GeneID:361103 KEGG:rno:361103
            UCSC:RGD:1311620 NextBio:675228 Uniprot:D4AE97
        Length = 1072

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 66/233 (28%), Positives = 87/233 (37%)

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 340
             GP  S+     TQ+          PRGP   AS G   + AS A    P+   GP    +
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374

Query:   341 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 398
               + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN    
Sbjct:   375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRSYPGEPNYG---NQQYGPNSQFP 431

Query:   399 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP-- 453
               PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    +   
Sbjct:   432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488

Query:   454 SRGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 502
             S G+ +    +G+      P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   489 SSGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541


>UNIPROTKB|F1NI79 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
            SMART:SM00210 GeneTree:ENSGT00700000104155 EMBL:AADN02026433
            EMBL:AADN02026434 EMBL:AADN02026427 EMBL:AADN02026428
            EMBL:AADN02026429 EMBL:AADN02026430 EMBL:AADN02026431
            EMBL:AADN02026432 IPI:IPI00602965 Ensembl:ENSGALT00000004020
            ArrayExpress:F1NI79 Uniprot:F1NI79
        Length = 1702

 Score = 128 (50.1 bits), Expect = 0.00029, P = 0.00029
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 312
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:   930 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 984

Query:   313 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 369
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:   985 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1041

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 428
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1042 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1090

Query:   429 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 487
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1091 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1150

Query:   488 ARSGS-GQP 495
               SG  G P
Sbjct:  1151 GESGEPGLP 1159


>UNIPROTKB|E1BF96 [details] [associations]
            symbol:PPP1R10 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0072357 "PTW/PP1 phosphatase complex" evidence=IEA]
            [GO:0000785 "chromatin" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0003677 GO:GO:0008270 GO:GO:0000785 GO:GO:0006351
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            OMA:PPPHEHR GeneTree:ENSGT00530000063820 EMBL:DAAA02055402
            IPI:IPI00698425 RefSeq:NP_001137335.1 UniGene:Bt.27784
            Ensembl:ENSBTAT00000009104 GeneID:510825 KEGG:bta:510825
            NextBio:20869636 Uniprot:E1BF96
        Length = 924

 Score = 125 (49.1 bits), Expect = 0.00030, P = 0.00030
 Identities = 71/271 (26%), Positives = 87/271 (32%)

Query:   238 GSYGGATGNSENETS-GRPV-GQNAYEDGYGVPQGH---GPPPSATTAGVVGAGPNTSTS 292
             G  GG  G        G P+ G +    G G P G    GPPP          GP     
Sbjct:   631 GGPGGPKGMQHFPPGPGGPMPGPHGGPGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDP-- 688

Query:   293 AYAATQSGTPMRAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 351
                    G PMR     P GPG Y   +G        P   P +G     + G   +   
Sbjct:   689 -----MRGGPMRGGPG-P-GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRG 741

Query:   352 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 411
             GPG     G  +    GP   ++ G  + P  G G  M  G  +    GPG       G+
Sbjct:   742 GPGGGMVGGGGHRPHEGPGGGMNSGSGHRPHEGPGSGM--GGGHRPHEGPGGSMGG--GH 797

Query:   412 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 471
                 GP         + P  GPG  +  G G+         P  G G  G P G  PH  
Sbjct:   798 RPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-D 847

Query:   472 VPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             VP    +   G      R   G   GG   R
Sbjct:   848 VPGHRGHDHRGPPPHEHRGHDGPGHGGGGHR 878

 Score = 121 (47.7 bits), Expect = 0.00081, P = 0.00081
 Identities = 49/192 (25%), Positives = 68/192 (35%)

Query:   242 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGV-------VGAGPNTSTSA 293
             G  G +E      P  G      G G P G G P      G         G G N+ +  
Sbjct:   710 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMNSGSGH 769

Query:   294 YAATQSGTPMRAAYDIPRGPG------YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY 347
                   G+ M   +    GPG      +   +GPG        + P +GP      G G+
Sbjct:   770 RPHEGPGSGMGGGHRPHEGPGGSMGGGHRPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGH 829

Query:   348 DPTKGPGYDAQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 406
              P +GPG+    G   +D      +D HRGP   P    G+D   GP +      G++  
Sbjct:   830 RPHEGPGHGGPHGHRPHDVPGHRGHD-HRGPP--PHEHRGHD---GPGHGGGGHRGHDGG 883

Query:   407 RVPGYDVQRGPV 418
                G D+   PV
Sbjct:   884 HSHGGDMSNRPV 895


>ZFIN|ZDB-GENE-030707-4 [details] [associations]
            symbol:anxa11a "annexin A11a" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
            "calcium-dependent phospholipid binding" evidence=IEA]
            InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
            InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
            SMART:SM00335 ZFIN:ZDB-GENE-030707-4 GO:GO:0005509 eggNOG:NOG267770
            GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
            HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29 HSSP:P79134 EMBL:AY178801
            IPI:IPI00498021 UniGene:Dr.77310 ProteinModelPortal:Q804G4
            SMR:Q804G4 PRIDE:Q804G4 InParanoid:Q804G4 NextBio:20812811
            ArrayExpress:Q804G4 Bgee:Q804G4 Uniprot:Q804G4
        Length = 526

 Score = 122 (48.0 bits), Expect = 0.00030, P = 0.00030
 Identities = 58/201 (28%), Positives = 73/201 (36%)

Query:   300 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 359
             G P ++ Y  P+G GY     PG     A  Y P  G  Y P  G GY P  G  Y  Q 
Sbjct:     5 GYPPQSGYP-PQGGGYPPQ--PGAYPPAAGGYPPQPG-MYPPQAG-GYPPQPG-AYPPQP 58

Query:   360 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVY 419
             G+ +  Q G    +  G    P   +G D    P ++     G   Q          P  
Sbjct:    59 GA-FPGQPGQYPSVPSGGWGAP---IGLDNLPNPGFNASNIQGMANQFAADGGFAPNPSM 114

Query:   420 EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV 479
              +   P   PQ G    +   Q Y M   P     +  G  G P G  P GQ  P   N+
Sbjct:   115 FSGGYPG--PQPGGPPAVSPNQPYGMYPQPGGGMPQNPGM-GYP-GGPPPGQQMPSYPNI 170

Query:   480 PYGSATPPARSGSGQPRGGNP 500
             P  + TP   SG   PR  +P
Sbjct:   171 P--APTP---SGPSYPRAPSP 186


>UNIPROTKB|F1NR01 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
            EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
            EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00822317
            Ensembl:ENSGALT00000039037 ArrayExpress:F1NR01 Uniprot:F1NR01
        Length = 1773

 Score = 128 (50.1 bits), Expect = 0.00030, P = 0.00030
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 312
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1001 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1055

Query:   313 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 369
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1056 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1112

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 428
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1113 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1161

Query:   429 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 487
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1162 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1221

Query:   488 ARSGS-GQP 495
               SG  G P
Sbjct:  1222 GESGEPGLP 1230


>UNIPROTKB|F1NR03 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
            EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
            EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00818113
            Ensembl:ENSGALT00000039034 ArrayExpress:F1NR03 Uniprot:F1NR03
        Length = 1804

 Score = 128 (50.1 bits), Expect = 0.00031, P = 0.00031
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 312
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1032 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1086

Query:   313 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 369
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1087 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1143

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 428
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1144 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1192

Query:   429 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 487
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1193 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1252

Query:   488 ARSGS-GQP 495
               SG  G P
Sbjct:  1253 GESGEPGLP 1261


>UNIPROTKB|C9JPE6 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP HOGENOM:HOG000072709
            EMBL:AC099777 IPI:IPI01019103 ProteinModelPortal:C9JPE6
            STRING:C9JPE6 Ensembl:ENST00000442599 UCSC:uc011bez.1
            ArrayExpress:C9JPE6 Bgee:C9JPE6 Uniprot:C9JPE6
        Length = 296

 Score = 118 (46.6 bits), Expect = 0.00031, P = 0.00031
 Identities = 39/164 (23%), Positives = 81/164 (49%)

Query:    50 VMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             +++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  +
Sbjct:    15 LLKAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTDI 70

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
              +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   +++
Sbjct:    71 ASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGELE 128

Query:   169 QIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             ++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   129 KLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 172


>UNIPROTKB|F1NR02 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0003007 "heart morphogenesis" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005588 "collagen type V" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0007155 "cell
            adhesion" evidence=IEA] [GO:0008201 "heparin binding" evidence=IEA]
            [GO:0030199 "collagen fibril organization" evidence=IEA]
            [GO:0032964 "collagen biosynthetic process" evidence=IEA]
            [GO:0035313 "wound healing, spreading of epidermal cells"
            evidence=IEA] [GO:0043206 "extracellular fibril organization"
            evidence=IEA] [GO:0043394 "proteoglycan binding" evidence=IEA]
            [GO:0043588 "skin development" evidence=IEA] [GO:0045112 "integrin
            biosynthetic process" evidence=IEA] [GO:0048407 "platelet-derived
            growth factor binding" evidence=IEA] [GO:0048592 "eye
            morphogenesis" evidence=IEA] [GO:0051128 "regulation of cellular
            component organization" evidence=IEA] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
            GO:GO:0030199 GO:GO:0008201 GO:GO:0007155 Gene3D:2.60.120.200
            InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0035313
            InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 SMART:SM00282
            GO:GO:0005604 GO:GO:0043206 Pfam:PF02210 GO:GO:0005201 OMA:TIYEGIG
            GO:GO:0005588 GO:GO:0045112 GO:GO:0051128 SMART:SM00210
            GeneTree:ENSGT00700000104155 EMBL:AADN02026433 EMBL:AADN02026434
            EMBL:AADN02026427 EMBL:AADN02026428 EMBL:AADN02026429
            EMBL:AADN02026430 EMBL:AADN02026431 EMBL:AADN02026432
            IPI:IPI00821684 Ensembl:ENSGALT00000039035 ArrayExpress:F1NR02
            Uniprot:F1NR02
        Length = 1815

 Score = 128 (50.1 bits), Expect = 0.00031, P = 0.00031
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 312
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1043 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1097

Query:   313 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 369
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1098 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1154

Query:   370 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 428
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1155 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1203

Query:   429 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 487
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1204 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1263

Query:   488 ARSGS-GQP 495
               SG  G P
Sbjct:  1264 GESGEPGLP 1272


>UNIPROTKB|E9PQW6 [details] [associations]
            symbol:ARID1A "AT-rich interactive domain-containing
            protein 1A" species:9606 "Homo sapiens" [GO:0006325 "chromatin
            organization" evidence=IEA] [GO:0016514 "SWI/SNF complex"
            evidence=IEA] [GO:0071564 "npBAF complex" evidence=IEA] [GO:0071565
            "nBAF complex" evidence=IEA] EMBL:AL034380 GO:GO:0016514
            EMBL:AL512408 HGNC:HGNC:11110 ChiTaRS:ARID1A GO:GO:0006325
            IPI:IPI00979164 Ensembl:ENST00000524572 ArrayExpress:E9PQW6
            Bgee:E9PQW6 Uniprot:E9PQW6
        Length = 123

 Score = 98 (39.6 bits), Expect = 0.00032, P = 0.00032
 Identities = 36/108 (33%), Positives = 47/108 (43%)

Query:   339 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 398
             Y   +GP   P +G GY  Q   +   QR P     +G +     GL Y  Q  P Y  Q
Sbjct:    18 YSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPM--TMQGRAQSAMGGLSYTQQIPP-YG-Q 73

Query:   399 RGP-GYETQ-RVPGYDVQ------RGPVYEAQRAPSYIPQRGPGYDLQ 438
             +GP GY  Q + P Y+ Q      + P Y +Q+ PS  P   P Y  Q
Sbjct:    74 QGPSGYGQQGQTPYYNQQSPHPQQQQPPY-SQQPPSQTPHAQPSYQQQ 120


>UNIPROTKB|F1MA98 [details] [associations]
            symbol:Tpr "Protein Tpr" species:10116 "Rattus norvegicus"
            [GO:0000122 "negative regulation of transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0000189 "MAPK import into
            nucleus" evidence=ISS] [GO:0000776 "kinetochore" evidence=ISS]
            [GO:0003682 "chromatin binding" evidence=ISS] [GO:0003729 "mRNA
            binding" evidence=ISS] [GO:0004828 "serine-tRNA ligase activity"
            evidence=IEA] [GO:0005487 "nucleocytoplasmic transporter activity"
            evidence=ISS] [GO:0005524 "ATP binding" evidence=IEA] [GO:0005635
            "nuclear envelope" evidence=ISS] [GO:0005643 "nuclear pore"
            evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005868
            "cytoplasmic dynein complex" evidence=ISS] [GO:0006404 "RNA import
            into nucleus" evidence=ISS] [GO:0006405 "RNA export from nucleus"
            evidence=ISS] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0006999 "nuclear pore organization" evidence=ISS] [GO:0007094
            "mitotic spindle assembly checkpoint" evidence=ISS] [GO:0010965
            "regulation of mitotic sister chromatid separation" evidence=ISS]
            [GO:0019898 "extrinsic to membrane" evidence=ISS] [GO:0031072 "heat
            shock protein binding" evidence=ISS] [GO:0031453 "positive
            regulation of heterochromatin assembly" evidence=ISS] [GO:0031965
            "nuclear membrane" evidence=IDA] [GO:0031990 "mRNA export from
            nucleus in response to heat stress" evidence=ISS] [GO:0034399
            "nuclear periphery" evidence=IDA] [GO:0034605 "cellular response to
            heat" evidence=ISS] [GO:0035457 "cellular response to
            interferon-alpha" evidence=ISS] [GO:0042307 "positive regulation of
            protein import into nucleus" evidence=ISS] [GO:0042405 "nuclear
            inclusion body" evidence=IDA] [GO:0042803 "protein homodimerization
            activity" evidence=ISS] [GO:0044615 "nuclear pore nuclear basket"
            evidence=IDA] [GO:0045947 "negative regulation of translational
            initiation" evidence=ISS] [GO:0046827 "positive regulation of
            protein export from nucleus" evidence=IMP] [GO:0046832 "negative
            regulation of RNA export from nucleus" evidence=ISS] [GO:0051019
            "mitogen-activated protein kinase binding" evidence=ISS]
            [GO:0070849 "response to epidermal growth factor stimulus"
            evidence=ISS] [GO:0072686 "mitotic spindle" evidence=ISS]
            [GO:0090267 "positive regulation of mitotic cell cycle spindle
            assembly checkpoint" evidence=ISS] [GO:0090316 "positive regulation
            of intracellular protein transport" evidence=ISS] [GO:1901673
            "regulation of spindle assembly involved in mitosis" evidence=ISS]
            [GO:0005215 "transporter activity" evidence=ISS] [GO:0006606
            "protein import into nucleus" evidence=ISS] [GO:0006611 "protein
            export from nucleus" evidence=ISS] [GO:0031647 "regulation of
            protein stability" evidence=ISS] [GO:0042306 "regulation of protein
            import into nucleus" evidence=IMP] [GO:0043495 "protein anchor"
            evidence=ISS] [GO:0043578 "nuclear matrix organization"
            evidence=ISS] [GO:0051292 "nuclear pore complex assembly"
            evidence=IMP] InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926
            RGD:1310664 GO:GO:0005524 GO:GO:0005737 GO:GO:0005643 GO:GO:0006606
            KO:K09291 InterPro:IPR009053 SUPFAM:SSF46579
            GeneTree:ENSGT00700000104019 GO:GO:0004828 GO:GO:0006434
            Gene3D:1.10.287.40 CTD:7175 IPI:IPI00950468 RefSeq:NP_001100655.1
            UniGene:Rn.58980 Ensembl:ENSRNOT00000063833 GeneID:304862
            KEGG:rno:304862 NextBio:653738 ArrayExpress:F1MA98 Uniprot:F1MA98
        Length = 2360

 Score = 124 (48.7 bits), Expect = 0.00033, Sum P(2) = 0.00033
 Identities = 44/186 (23%), Positives = 88/186 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1349 PDTEEYRKLLSEKEIHTKRIQQLNEEVGRLKAEIARSNASLTNNQNLIQSLKEDLSKVRT 1408

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L  A+++ +    Q + D Q 
Sbjct:  1409 EKESIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQFEELK-AQQKAMETSTQSSGDHQE 1467

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
              H  VQ++  L   L     +     G  E  +K  ++     + +++    + +E+ +L
Sbjct:  1468 QHISVQEMQELKDNLSQSETKTKSLEGQVENLQKTLSEKETEARSLQEQTAQLQSELSRL 1527

Query:   223 RAELMN 228
             R EL +
Sbjct:  1528 RQELQD 1533

 Score = 56 (24.8 bits), Expect = 0.00033, Sum P(2) = 0.00033
 Identities = 21/70 (30%), Positives = 28/70 (40%)

Query:   233 DRRADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTS 290
             D   D +  G  G   NE +G   G + YE  D  G   G G  P   T   +G G  ++
Sbjct:  1970 DDEEDDTGMGDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMG-GAESN 2025

Query:   291 TSAYAATQSG 300
               A  +  SG
Sbjct:  2026 QRAADSQNSG 2035


>UNIPROTKB|Q14BN4 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA] [GO:0005815
            "microtubule organizing center" evidence=IEA] [GO:0042383
            "sarcolemma" evidence=IEA] [GO:0005790 "smooth endoplasmic
            reticulum" evidence=TAS] [GO:0005887 "integral to plasma membrane"
            evidence=TAS] [GO:0006936 "muscle contraction" evidence=TAS]
            InterPro:IPR000253 InterPro:IPR002777 InterPro:IPR008984
            Pfam:PF00498 Pfam:PF01920 PROSITE:PS50006 SMART:SM00240
            GO:GO:0006457 GO:GO:0005887 Gene3D:2.60.200.20 SUPFAM:SSF49879
            GO:GO:0005815 GO:GO:0042383 GO:GO:0006936 GO:GO:0016272
            GO:GO:0005790 eggNOG:COG1716 EMBL:AF304450 EMBL:AF100750
            EMBL:AY358410 EMBL:AK124200 EMBL:AL834538 EMBL:CR627321
            EMBL:BC114627 EMBL:BC115701 EMBL:AB046821 IPI:IPI00026691
            IPI:IPI00030531 IPI:IPI00432472 IPI:IPI00446339 IPI:IPI00791574
            IPI:IPI00794462 IPI:IPI00794566 IPI:IPI00795406 RefSeq:NP_009090.2
            UniGene:Hs.476432 ProteinModelPortal:Q14BN4 SMR:Q14BN4
            IntAct:Q14BN4 STRING:Q14BN4 PhosphoSite:Q14BN4 DMDM:118597508
            PaxDb:Q14BN4 PRIDE:Q14BN4 Ensembl:ENST00000295951
            Ensembl:ENST00000295952 Ensembl:ENST00000383718
            Ensembl:ENST00000416870 Ensembl:ENST00000428312
            Ensembl:ENST00000449503 GeneID:7871 KEGG:hsa:7871 UCSC:uc003djc.1
            UCSC:uc003djd.1 UCSC:uc003dje.1 UCSC:uc003djf.1 UCSC:uc003djg.1
            UCSC:uc003djh.3 UCSC:uc003dji.1 CTD:7871 GeneCards:GC03P057802
            H-InvDB:HIX0003396 HGNC:HGNC:16643 HPA:HPA002357 HPA:HPA002358
            MIM:602701 neXtProt:NX_Q14BN4 PharmGKB:PA38179 HOVERGEN:HBG082442
            OMA:RTSKQKC ChiTaRS:SLMAP GenomeRNAi:7871 NextBio:30324
            ArrayExpress:Q14BN4 Bgee:Q14BN4 Genevestigator:Q14BN4
            Uniprot:Q14BN4
        Length = 828

 Score = 124 (48.7 bits), Expect = 0.00033, P = 0.00033
 Identities = 40/165 (24%), Positives = 82/165 (49%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:   546 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 601

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R +      Q  ++  R   ++
Sbjct:   602 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLRCQQCED--QQREEATRLQGEL 659

Query:   168 QQIP----ALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQV 207
             +++     AL +E  SL++E        +  EK+ +N   +SL++
Sbjct:   660 EKLRKEWNALETECHSLKRENVLLSSELQRQEKELHNSQKQSLEL 704


>ZFIN|ZDB-GENE-030707-5 [details] [associations]
            symbol:anxa11b "annexin A11b" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
            "calcium-dependent phospholipid binding" evidence=IEA]
            InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
            InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
            SMART:SM00335 ZFIN:ZDB-GENE-030707-5 GO:GO:0005509 eggNOG:NOG267770
            GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
            HOGENOM:HOG000158803 HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29
            OrthoDB:EOG4Z0B60 InterPro:IPR013286 PRINTS:PR01871 HSSP:P79134
            EMBL:BC068366 EMBL:AY178802 IPI:IPI00484212 RefSeq:NP_861431.1
            UniGene:Dr.76267 SMR:Q804G3 STRING:Q804G3 GeneID:353365
            KEGG:dre:353365 CTD:353365 NextBio:20812741 Uniprot:Q804G3
        Length = 485

 Score = 121 (47.7 bits), Expect = 0.00034, P = 0.00034
 Identities = 59/175 (33%), Positives = 71/175 (40%)

Query:   329 PSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 388
             P Y P  G SY PA GP   P  G  Y  Q G+ Y  Q G  Y    G ++ PQ G  + 
Sbjct:     4 PGYPPAGG-SYPPASGPYQQPAAG--YPPQPGA-YPPQAG-YYPPQPG-AFPPQPG-AFP 56

Query:   389 MQRG--P---NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY--IPQRG-----PGYD 436
              Q G  P    Y  Q G GY      G+  Q G  Y A +  +Y  +P  G     PG+ 
Sbjct:    57 PQPGAFPPGAGYPPQAG-GYPAAPGGGFPPQAGG-YPAAQPGAYPNMPAAGGWGGHPGFG 114

Query:   437 LQRG---QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 488
                G   QGY    AP   P     + GAP    P+  +P      P G  TPPA
Sbjct:   115 APAGGMPQGYPGVPAPGQQPM--PAYPGAP---VPNPGMPGYGGGAPTGP-TPPA 163


>UNIPROTKB|P02812 [details] [associations]
            symbol:PRB2 "Basic salivary proline-rich protein 2"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] GO:GO:0005576 EMBL:AC078950
            EMBL:BX484538 EMBL:S80905 EMBL:K03208 IPI:IPI00552432 PIR:B40750
            PIR:E25372 UniGene:Hs.654486 STRING:P02812 DMDM:160409933
            PaxDb:P02812 PRIDE:P02812 Ensembl:ENST00000389362 UCSC:uc010shk.1
            GeneCards:GC12M011544 HGNC:HGNC:9338 MIM:168810 neXtProt:NX_P02812
            ArrayExpress:P02812 Bgee:P02812 CleanEx:HS_PRB2
            Genevestigator:P02812 GermOnline:ENSG00000173342 InterPro:IPR026086
            PANTHER:PTHR23203 Uniprot:P02812
        Length = 416

 Score = 120 (47.3 bits), Expect = 0.00035, P = 0.00035
 Identities = 69/257 (26%), Positives = 88/257 (34%)

Query:   252 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA--ATQSGTPMRAAYDI 309
             +G P  Q A   G   PQG  P P     G    G N             G P +   + 
Sbjct:    33 AGNP--QGAPPQGGNKPQGP-PSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGG-NK 88

Query:   310 PRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 366
             P+GP   G      P  D S++P   P K P   P +G G  P +GP     K      Q
Sbjct:    89 PQGPPPPGKPQGPPPQGDKSRSPRSPPGK-PQGPPPQG-GNQP-QGPPPPPGKPQGPPPQ 145

Query:   367 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPS 426
              G      +GP   P +  G   Q        R P  + Q  P    Q G   +    P 
Sbjct:   146 GGNK---PQGPP-PPGKPQGPPPQGDNKSRSSRSPPGKPQGPPP---QGGNQPQGPPPPP 198

Query:   427 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLN-NVPYGSAT 485
               PQ  P     + QG      P   P +G     + R      Q PPP   N P G   
Sbjct:   199 GKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDNKSQSARSPPGKPQGPPPQGGNQPQGPPP 258

Query:   486 PPARSGSGQPRGGNPAR 502
             PP +     P+GGN ++
Sbjct:   259 PPGKPQGPPPQGGNKSQ 275

 Score = 118 (46.6 bits), Expect = 0.00057, P = 0.00057
 Identities = 76/272 (27%), Positives = 99/272 (36%)

Query:   245 GNSENETSGRPVG--QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTP 302
             G++++ +S  P G  Q     G   PQG  PPP        G  P            G P
Sbjct:   166 GDNKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQ----GPPPQGGNKPQGPPPPGKP 221

Query:   303 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 361
                    P+G    ++++ P     K P   P +G +  P +GP   P K  G   Q G+
Sbjct:   222 QGPP---PQGDNKSQSARSP---PGK-PQGPPPQGGN-QP-QGPPPPPGKPQGPPPQGGN 272

Query:   362 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVPGYDVQ-RGP 417
                +Q  P     +GP   PQ G      R P    Q  P   G + Q  P    + +GP
Sbjct:   273 K--SQGPPPPGKPQGPP--PQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGP 328

Query:   418 VYEAQRAPSYIPQRG-P-GYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR--GAAPHGQVP 473
               +    P   P  G P G   Q G      R+P   P       G P+  G  P G  P
Sbjct:   329 PPQGGNKPQGPPPPGKPQGPPPQGGSKSRSARSPPGKPQ------GPPQQEGNNPQGP-P 381

Query:   474 PPLNNVPYGSATPPARSGSGQPR---GGNPAR 502
             PP    P     PPA    G PR   GG P+R
Sbjct:   382 PPAGGNPQQPQAPPAGQPQGPPRPPQGGRPSR 413


>UNIPROTKB|F1Q0F7 [details] [associations]
            symbol:COL4A5 "Collagen alpha-5(IV) chain" species:9615
            "Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:AAEX03026757 EMBL:AAEX03026761
            EMBL:AAEX03026758 EMBL:AAEX03026759 EMBL:AAEX03026760
            Ensembl:ENSCAFT00000018078 Uniprot:F1Q0F7
        Length = 1678

 Score = 127 (49.8 bits), Expect = 0.00036, P = 0.00036
 Identities = 59/197 (29%), Positives = 72/197 (36%)

Query:   310 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 363
             P  PG     GP  G    K    +P K G P  D   G PG     G PGY  + G   
Sbjct:   269 PGPPGIRGPPGPPGGMKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 326

Query:   364 DAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRG-PVYE 420
             D ++G   DI   GP    + G G  +    N  +   PG + +R  PG     G P   
Sbjct:   327 DGEKGQKGDIGSTGPPGLSKPGTGVTVGEKGNMGLPGLPGEKGERGFPGIQGPPGLPGPP 386

Query:   421 AQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 480
                     P   PG+  +RGQ  D    P        G DG P      G   PP    P
Sbjct:   387 VLGTAVMGPPGPPGFPGERGQKGD-EGPPGISIPGFPGLDGQPGAPGLRGPPGPP---GP 442

Query:   481 YGSATPPARSGSGQPRG 497
             + S +PP   GS   RG
Sbjct:   443 HISPSPPGPPGSPGDRG 459


>UNIPROTKB|F1PHY1 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
            "Canis lupus familiaris" [GO:0071230 "cellular response to amino
            acid stimulus" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0048407 "platelet-derived
            growth factor binding" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0043589 "skin morphogenesis" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0030674
            "protein binding, bridging" evidence=IEA] [GO:0030199 "collagen
            fibril organization" evidence=IEA] [GO:0008217 "regulation of blood
            pressure" evidence=IEA] [GO:0007266 "Rho protein signal
            transduction" evidence=IEA] [GO:0007179 "transforming growth factor
            beta receptor signaling pathway" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            [GO:0001501 "skeletal system development" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005615 GO:GO:0030199 GO:GO:0001501
            GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0001568 GO:GO:0071230
            GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287
            GO:GO:0005584 OMA:TGPIGSA EMBL:AAEX03009315
            Ensembl:ENSCAFT00000031580 Uniprot:F1PHY1
        Length = 1366

 Score = 126 (49.4 bits), Expect = 0.00037, P = 0.00037
 Identities = 83/261 (31%), Positives = 99/261 (37%)

Query:   266 GVPQGHGPPPSATTAGVVGA----G-PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASK 319
             G+P   G P     AG  GA    G P  + S   +   G P  A    P GP G E  +
Sbjct:   322 GLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGAQGPPGPSGEEGKR 381

Query:   320 GPGYDASKA-PSYDPTKGPSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIH 374
             GP  +A  A PS  P  G    P ++G PG D   G  G    +G+   A  RGPN D  
Sbjct:   382 GPNGEAGSAGPSGPP--GLRGSPGSRGLPGADGRAGVMGPPGPRGATGPAGVRGPNGDSG 439

Query:   375 RGPSYDPQRGLGYDMQRG-PNYDMQRGP-GYE-TQRVPGYDVQRGPVYEA--QRAPSYIP 429
             R P  +P    G    RG P      GP G E    +PG D + GP+  A  +  P  I 
Sbjct:   440 R-PG-EP----GLMGPRGFPGAPGNVGPAGKEGPMGLPGIDGRPGPIGPAGARGEPGNIG 493

Query:   430 QRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPP 487
               GP G     G+  D   A     +RG  G DG      P G           G A PP
Sbjct:   494 FPGPKGPTGDPGKNGDKGHA-GLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPP 552

Query:   488 ARSGSGQPRG-----GNPARR 503
                G   P G     G P  R
Sbjct:   553 GFQGLPGPAGTAGEVGKPGER 573


>UNIPROTKB|E1BC70 [details] [associations]
            symbol:VPS37C "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR009851
            Pfam:PF07200 PROSITE:PS51314 KO:K12185 GeneTree:ENSGT00530000063123
            CTD:55048 OMA:VERCQEQ EMBL:DAAA02063396 IPI:IPI00692039
            RefSeq:NP_001193079.1 UniGene:Bt.105953 Ensembl:ENSBTAT00000010607
            GeneID:613817 KEGG:bta:613817 NextBio:20898788 Uniprot:E1BC70
        Length = 350

 Score = 91 (37.1 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 61/196 (31%), Positives = 71/196 (36%)

Query:   325 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG 384
             AS  P+ D T  P   P   PG   T  P  DAQ      +   P Y +   P Y P  G
Sbjct:   162 ASLEPAGD-TPPPRPPPPLHPGPQTTPPPAEDAQPQPPQPSVVPP-YPL---P-YSPSPG 215

Query:   385 LGYDMQRGPNYDMQRGPG-YETQRVPG--YDVQRGPVYEAQ----RAPS---YIPQRG-- 432
                 M  GP       P  +     P   Y    GP Y A     RAPS   + PQR   
Sbjct:   216 ----MPVGPTAHGALPPAPFPVVSQPSFSYSGPLGPPYAAAQPGTRAPSGYSWSPQRSMP 271

Query:   433 --PGYDLQ----RGQGYDM--RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 484
               PGY +      G GY +   RAPS  P    G+   P   +  G+ P P    P G  
Sbjct:   272 PRPGYPVAPTGASGPGYPVVGGRAPS--P----GYPQQPPYLSTGGKPPYPTQPQPSGPL 325

Query:   485 TPPARSGSGQPRGGNP 500
              PP   G   P G  P
Sbjct:   326 QPPYPPGPAPPYGFPP 341

 Score = 71 (30.1 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 31/144 (21%), Positives = 66/144 (45%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             +M   PE ++ ++A    E+Q L  E +   AT+ +L +     Q  L+I    +    S
Sbjct:    14 EMQNDPEAID-RLAQDSPEVQDLQLEREMALATNRSLAERNLEFQGPLEISRSNL----S 68

Query:   103 ERELQMRNLTEKIAKMEAEL-KTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQ 161
             ++  ++R L E+  + +A+L K +  ++L       + +++ +  EE  A   +  +   
Sbjct:    69 DKYQELRKLVERYQEQKAKLEKFSSALQLGTLLDLLQIESMKI-EEESEAMAEKFLEGEV 127

Query:   162 RAHTDVQQIPAL--LSELESLRQE 183
                T ++   ++  LS L  +R E
Sbjct:   128 PLDTFLENFSSMRTLSHLRRVRVE 151


>RGD|61817 [details] [associations]
            symbol:Col1a1 "collagen, type I, alpha 1" species:10116 "Rattus
           norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
           [GO:0001503 "ossification" evidence=IEP] [GO:0001568 "blood vessel
           development" evidence=IEA;ISO] [GO:0001649 "osteoblast
           differentiation" evidence=IEA] [GO:0001957 "intramembranous
           ossification" evidence=IEA;ISO] [GO:0001958 "endochondral
           ossification" evidence=IEA;ISO] [GO:0003674 "molecular_function"
           evidence=ND] [GO:0005201 "extracellular matrix structural
           constituent" evidence=IEA;ISO] [GO:0005578 "proteinaceous
           extracellular matrix" evidence=ISO] [GO:0005581 "collagen"
           evidence=ISO] [GO:0005584 "collagen type I" evidence=IEA;ISO]
           [GO:0005615 "extracellular space" evidence=ISO;IDA] [GO:0005737
           "cytoplasm" evidence=IEA;ISO] [GO:0007584 "response to nutrient"
           evidence=IEP] [GO:0007601 "visual perception" evidence=IEA;ISO]
           [GO:0007605 "sensory perception of sound" evidence=IEA;ISO]
           [GO:0009612 "response to mechanical stimulus" evidence=IEP]
           [GO:0010035 "response to inorganic substance" evidence=IEP]
           [GO:0010718 "positive regulation of epithelial to mesenchymal
           transition" evidence=IEA;ISO] [GO:0010812 "negative regulation of
           cell-substrate adhesion" evidence=IEA;ISO] [GO:0015031 "protein
           transport" evidence=IEA;ISO] [GO:0030199 "collagen fibril
           organization" evidence=IEA;ISO] [GO:0030335 "positive regulation of
           cell migration" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
           evidence=ISO] [GO:0031960 "response to corticosteroid stimulus"
           evidence=IEP] [GO:0032964 "collagen biosynthetic process"
           evidence=IEA;ISO] [GO:0034504 "protein localization to nucleus"
           evidence=IEA;ISO] [GO:0034505 "tooth mineralization"
           evidence=IEA;ISO] [GO:0042060 "wound healing" evidence=IMP]
           [GO:0042542 "response to hydrogen peroxide" evidence=IEP]
           [GO:0042802 "identical protein binding" evidence=IEA;ISO]
           [GO:0043434 "response to peptide hormone stimulus" evidence=IEP]
           [GO:0043588 "skin development" evidence=ISO] [GO:0043589 "skin
           morphogenesis" evidence=IEA;ISO] [GO:0045893 "positive regulation of
           transcription, DNA-dependent" evidence=IEA;ISO] [GO:0046872 "metal
           ion binding" evidence=IEA] [GO:0048407 "platelet-derived growth
           factor binding" evidence=IEA;ISO] [GO:0048705 "skeletal system
           morphogenesis" evidence=ISO] [GO:0048706 "embryonic skeletal system
           development" evidence=IEA;ISO] [GO:0051591 "response to cAMP"
           evidence=IEP] [GO:0060325 "face morphogenesis" evidence=IEA;ISO]
           [GO:0060346 "bone trabecula formation" evidence=IEA;ISO] [GO:0060351
           "cartilage development involved in endochondral bone morphogenesis"
           evidence=IEA;ISO] [GO:0070208 "protein heterotrimerization"
           evidence=IEA;ISO] [GO:0071230 "cellular response to amino acid
           stimulus" evidence=IEA;ISO] [GO:0071260 "cellular response to
           mechanical stimulus" evidence=IEA] [GO:0071300 "cellular response to
           retinoic acid" evidence=IEP] [GO:0071363 "cellular response to
           growth factor stimulus" evidence=IEP] [GO:0071560 "cellular response
           to transforming growth factor beta stimulus" evidence=IEP]
           [GO:0090263 "positive regulation of canonical Wnt receptor signaling
           pathway" evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007
           Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
           PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
           RGD:61817 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615 GO:GO:0009612
           GO:GO:0071560 GO:GO:0046872 GO:GO:0015031 GO:GO:0007601
           GO:GO:0071300 GO:GO:0043434 GO:GO:0030199 GO:GO:0007584
           GO:GO:0010035 GO:GO:0007605 GO:GO:0010718 GO:GO:0030335
           GO:GO:0042542 GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391
           eggNOG:NOG12793 GO:GO:0042060 GO:GO:0071260 GO:GO:0001568
           GO:GO:0001649 GO:GO:0051591 GO:GO:0034505 GO:GO:0090263
           GO:GO:0001503 GO:GO:0010812 GO:GO:0060325 EMBL:CH473948
           GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
           GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
           GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
           HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ
           GO:GO:0005584 GO:GO:0060346 GO:GO:0031960 EMBL:Z78279 EMBL:BC133728
           EMBL:M11432 IPI:IPI00188909 PIR:A90559 RefSeq:NP_445756.1
           UniGene:Rn.2953 PDB:3HQV PDB:3HR2 PDBsum:3HQV PDBsum:3HR2
           ProteinModelPortal:P02454 IntAct:P02454 STRING:P02454 PRIDE:P02454
           Ensembl:ENSRNOT00000005311 GeneID:29393 KEGG:rno:29393
           UCSC:RGD:61817 InParanoid:A3KNA1 Reactome:REACT_150387
           EvolutionaryTrace:P02454 NextBio:609017 ArrayExpress:P02454
           Genevestigator:P02454 GermOnline:ENSRNOG00000003897 Uniprot:P02454
        Length = 1453

 Score = 126 (49.4 bits), Expect = 0.00039, P = 0.00039
 Identities = 88/285 (30%), Positives = 108/285 (37%)

Query:   236 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 292
             ADG  G  G  G++  +    P G  A   G   P G+ G P    + G  G  P  +  
Sbjct:   808 ADGQPGAKGEPGDTGVKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGSRGAAGP-PGATGF 865

Query:   293 AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--KG 344
               AA + G P  +    P GP    G E  KGP  +   A  P      GP   PA  KG
Sbjct:   866 PGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRGETGPAGRPGEVGPPGPP-GPAGEKG 924

Query:   345 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 392
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   925 SPGADGPAGSPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984

Query:   393 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 452
             P   M  GP       PG     GP  E+ R  S   +  PG D   G   D        
Sbjct:   985 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGAPGAKGDRGETGPAG 1032

Query:   453 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
             P    G  GAP    P G+        P G A P   +G+  P G
Sbjct:  1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPAGARGPAG 1077


>UNIPROTKB|F1LQ00 [details] [associations]
            symbol:Col5a2 "Protein Col5a2" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:70921 GO:GO:0043588 GO:GO:0030199
            GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230
            GO:GO:0005201 GO:GO:0048592 GeneTree:ENSGT00660000095287
            GO:GO:0005588 IPI:IPI00366945 Ensembl:ENSRNOT00000005073
            Uniprot:F1LQ00
        Length = 1467

 Score = 126 (49.4 bits), Expect = 0.00040, P = 0.00040
 Identities = 87/290 (30%), Positives = 109/290 (37%)

Query:   233 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 289
             ++ A+G+ G  GA G   +     P G    E G   P+G  GPP S    G  G    T
Sbjct:   752 EKGAEGTAGNDGARGLPGSLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 810

Query:   290 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 345
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   811 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPHGVPGLKGGRGT 869

Query:   346 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-QRGPNYDM-QRG 400
                P  T  PG   + G    A   GP   I   P  +   GL  D    G   D    G
Sbjct:   870 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPIGE-PGKEGPPGLRGDPGSHGRVGDRGPAG 928

Query:   401 P-GYETQRV-PGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSR 455
             P G    +  PG D Q GP  +    P+    QRG  G   QRG+ G      P+  P +
Sbjct:   929 PPGSPGDKGDPGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGK 986

Query:   456 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 503
               G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:   987 -VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1035


>UNIPROTKB|Q02388 [details] [associations]
            symbol:COL7A1 "Collagen alpha-1(VII) chain" species:9606
            "Homo sapiens" [GO:0004867 "serine-type endopeptidase inhibitor
            activity" evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0005590 "collagen type VII"
            evidence=TAS] [GO:0008544 "epidermis development" evidence=TAS]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS] [GO:0031012
            "extracellular matrix" evidence=ISS] InterPro:IPR002035
            InterPro:IPR002223 InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041
            Pfam:PF00092 PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279
            PROSITE:PS50853 SMART:SM00060 SMART:SM00327 Reactome:REACT_118779
            Gene3D:2.60.40.10 InterPro:IPR013783 GO:GO:0004867 SUPFAM:SSF49265
            GO:GO:0030198 GO:GO:0007155 Gene3D:4.10.410.10 InterPro:IPR020901
            SUPFAM:SSF57362 PROSITE:PS00280 GO:GO:0005788 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0008544 GO:GO:0005604
            EMBL:L23982 EMBL:L02870 EMBL:D13694 EMBL:M96984 EMBL:S51236
            EMBL:M65158 EMBL:L06862 IPI:IPI00025418 IPI:IPI00795118 PIR:A54849
            RefSeq:NP_000085.1 UniGene:Hs.476218 ProteinModelPortal:Q02388
            SMR:Q02388 IntAct:Q02388 MINT:MINT-1390694 STRING:Q02388
            MEROPS:I02.967 PhosphoSite:Q02388 DMDM:1345650 PaxDb:Q02388
            PRIDE:Q02388 Ensembl:ENST00000328333 Ensembl:ENST00000454817
            GeneID:1294 KEGG:hsa:1294 UCSC:uc003ctz.2 CTD:1294
            GeneCards:GC03M048576 HGNC:HGNC:2214 HPA:CAB016357 MIM:120120
            MIM:131705 MIM:131750 MIM:131850 MIM:132000 MIM:226600 MIM:604129
            MIM:607523 neXtProt:NX_Q02388 Orphanet:158673 Orphanet:79407
            Orphanet:216989 Orphanet:79408 Orphanet:89842 Orphanet:89841
            Orphanet:79409 Orphanet:89839 Orphanet:158676 Orphanet:79410
            Orphanet:89843 Orphanet:79411 PharmGKB:PA26730 HOGENOM:HOG000111866
            HOVERGEN:HBG051053 InParanoid:Q02388 KO:K16628 OMA:RRVCTTA
            OrthoDB:EOG4J117P PhylomeDB:Q02388 ChiTaRS:COL7A1 GenomeRNAi:1294
            NextBio:5251 ArrayExpress:Q02388 Bgee:Q02388 CleanEx:HS_COL7A1
            Genevestigator:Q02388 GermOnline:ENSG00000114270 GO:GO:0005590
            Uniprot:Q02388
        Length = 2944

 Score = 129 (50.5 bits), Expect = 0.00040, P = 0.00040
 Identities = 83/269 (30%), Positives = 99/269 (36%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPG 314
             P G        G P   GPP SAT  G  G  P       A  + G+P RA    P  PG
Sbjct:  1270 PPGDPGLPGRTGAPGPQGPPGSATAKGERGF-PG------ADGRPGSPGRAGN--PGTPG 1320

Query:   315 YEASKG-PGYDASKA-PSYDPTKGPSYDPAKGPGYD-PTKGPGYDAQKG----SNYDAQR 367
                 KG PG    +  P     +GP  +P   PG     +GPG   +KG    S     R
Sbjct:  1321 APGLKGSPGLPGPRGDPGERGPRGPKGEPG-APGQVIGGEGPGLPGRKGDPGPSGPPGPR 1379

Query:   368 GPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGP-GY-ETQRVPGYDVQRG-PVYEAQR 423
             GP  D   GP   P  GL     +G   D  +RGP G  E    PG   + G P      
Sbjct:  1380 GPLGD--PGPRGPP--GLPGTAMKGDKGDRGERGPPGPGEGGIAPG---EPGLPGLPGSP 1432

Query:   424 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA----APHGQ--VPPPLN 477
              P   P   PG   ++G   D   AP      G+  +  PRG      P G    P PL 
Sbjct:  1433 GPQG-PVGPPGKKGEKGDSED--GAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLG 1489

Query:   478 NV-PYGSATPPARSGS-GQPR-GGNPARR 503
                  G   PP  +GS G P   G P  +
Sbjct:  1490 EAGEKGERGPPGPAGSRGLPGVAGRPGAK 1518


>ZFIN|ZDB-GENE-980526-192 [details] [associations]
            symbol:col2a1a "collagen type II, alpha-1a"
            species:7955 "Danio rerio" [GO:0005581 "collagen" evidence=IEA;ISS]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0030903 "notochord development" evidence=IGI]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 ZFIN:ZDB-GENE-980526-192 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933 KO:K06236
            GO:GO:0030903 EMBL:BX927144 EMBL:DQ335127 IPI:IPI00505438
            RefSeq:NP_571367.1 UniGene:Dr.75057 SMR:Q2LDA1 STRING:Q2LDA1
            Ensembl:ENSDART00000100234 GeneID:562496 KEGG:dre:562496 CTD:562496
            InParanoid:Q2LDA1 NextBio:20884441 Uniprot:Q2LDA1
        Length = 1491

 Score = 126 (49.4 bits), Expect = 0.00041, P = 0.00041
 Identities = 83/270 (30%), Positives = 96/270 (35%)

Query:   242 GATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGA-GPNTSTSAYAATQ- 298
             GA G   N+      GQ   + G   PQG  G P      GV G  G   +  A  AT  
Sbjct:   844 GADGQPGNKGEQGESGQKG-DSGAPGPQGPSGAPGPVGPTGVTGPKGARGAQGAPGATGF 902

Query:   299 SGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGP-GYDPTKGP 353
              G   R     P G PG     GP G D  K    D    G + D   +GP G    KG 
Sbjct:   903 PGAAGRVGPPGPNGNPGAAGPAGPSGKDGPKGVRGDAGPPGRAGDAGLRGPPGAPGEKGE 962

Query:   354 -GYDAQKGSNYDAQRGP-NYDIHRGPSYDP-QRG-LGYDMQRGPNYD--MQRGPGYETQR 407
              G D   G   D   GP      RG    P QRG  G+    GP+ +   Q  PG    R
Sbjct:   963 AGEDGPPGP--DGPSGPAGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGSGDR 1020

Query:   408 VP----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFDGA 462
              P    G     GP  E  R  +      PG D   G +G      P   P    G  GA
Sbjct:  1021 GPPGPVGPPGLTGPAGETGREGNPGSDGPPGRDGAAGVKGERGNTGPIGAPG-APGAPGA 1079

Query:   463 PRGAAPHGQVPPPLNNVPYGSATPPARSGS 492
             P    P G+      N P G A PP  +G+
Sbjct:  1080 PGSVGPIGKQGDRGENGPQGPAGPPGPAGA 1109


>WB|WBGene00001076 [details] [associations]
            symbol:dpy-17 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0010171 "body
            morphogenesis" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0040007
            "growth" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] InterPro:IPR002486 Pfam:PF01484 SMART:SM01088
            GO:GO:0040007 GO:GO:0002119 GO:GO:0010171 GO:GO:0040035
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0040002 EMBL:FO080874
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00390000012316
            RefSeq:NP_498086.1 ProteinModelPortal:Q20778 SMR:Q20778
            DIP:DIP-26150N MINT:MINT-1080630 STRING:Q20778 PaxDb:Q20778
            EnsemblMetazoa:F54D8.1.1 EnsemblMetazoa:F54D8.1.2 GeneID:175696
            KEGG:cel:CELE_F54D8.1 UCSC:F54D8.1.1 CTD:175696 WormBase:F54D8.1
            eggNOG:NOG253878 InParanoid:Q20778 OMA:TEMEAWR NextBio:889252
            Uniprot:Q20778
        Length = 352

 Score = 118 (46.6 bits), Expect = 0.00043, P = 0.00043
 Identities = 74/296 (25%), Positives = 104/296 (35%)

Query:   218 EVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGY-GVPQGHGPPPS 276
             E +++  ++     V R+A G YGG  G      SG P G +    G+ G PQGH P  +
Sbjct:    48 ESDQIYMDMQKFGRVRRQA-GGYGGYGGYGSGP-SG-PSGPSGPHGGFPGGPQGHFPGNT 104

Query:   277 ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG 336
              ++      G      +      G+P+        GPG + +          P+  P   
Sbjct:   105 GSSNTPTLPGVIGVPPSVTGHPGGSPINPDGSPSAGPGDKCNCNTENSCPAGPA-GPKGT 163

Query:   337 PSYDPAKG-PGYDPTKGPGYDAQKGSNYDAQRGPNYD----IHRGPSYDP-QRGL-GYDM 389
             P +D   G PG      PG D +   +  AQ    YD       GP   P  +G  G   
Sbjct:   164 PGHDGPDGIPGV-----PGVDGEDADDAKAQT-QQYDGCFTCPAGPQGPPGSQGKPGARG 217

Query:   390 QRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYD-MRRA 448
              RG        PG +    PG     GP+     A    P   PG D++   G    +  
Sbjct:   218 MRGARGQAAM-PGRDGS--PGMPGSLGPIGPPGAAGEEGPTGEPGADVEHQIGLPGAKGT 274

Query:   449 PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPAR 502
             P      G   +   RGA   G   PP    P G       +G+ G P   G P +
Sbjct:   275 PGAPGESGDQGEQGDRGAT--GIAGPPGERGPQGEKGDDGPNGAAGSPGEEGEPGQ 328


>ZFIN|ZDB-GENE-040426-2678 [details] [associations]
            symbol:pdcd6ip "programmed cell death 6
            interacting protein" species:7955 "Danio rerio" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR025304 Pfam:PF13949
            ZFIN:ZDB-GENE-040426-2678 Gene3D:1.25.40.280 InterPro:IPR004328
            Pfam:PF03097 SMART:SM01041 PROSITE:PS51180
            GeneTree:ENSGT00670000098017 EMBL:CU469582 IPI:IPI00503522
            Ensembl:ENSDART00000028592 ArrayExpress:F1Q5T7 Bgee:F1Q5T7
            Uniprot:F1Q5T7
        Length = 873

 Score = 123 (48.4 bits), Expect = 0.00046, P = 0.00046
 Identities = 74/329 (22%), Positives = 123/329 (37%)

Query:    79 LRQELAA---AQHELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKS 135
             LR +LA     + E ++L G++  +  +   +      +   +  E+ T+  +   +   
Sbjct:   556 LRSQLAQLDEVKREREVLEGEVKSVTFDLTAKFLTALAQDGAINEEVMTSSELDARYGSH 615

Query:   136 KTEAQNLVVAREELIAKV---HQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYE 192
                 Q  +  +EEL++++   HQ    L++++++      +L +L S    Y       +
Sbjct:   616 NQRVQQNLRRQEELLSQIQVSHQEFSALKQSNSEANTREDVLKKLASAHDSYIEISSNIK 675

Query:   193 YEKKFYNDHLESLQVMEKNY--ITMA--TEVEKLRAELMNA----PNVDRRADGSYGGAT 244
                KFYND  E L   +     I  A  TE ++L  EL  +    P+    +  SY   T
Sbjct:   676 EGTKFYNDLTEILLKFQNKCSDIVFARKTERDELLKELQQSIAREPSAPSFSVPSYQSNT 735

Query:   245 GNSENETSGRPVGQNAYEDGYGVPQ--GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTP 302
                    +  P    + +     PQ     PPPS        A P    SA  A  S  P
Sbjct:   736 PAPAGGPTPAPRTVFSQQQPQAKPQPPARPPPPSIAPQAASAAVP---VSAPMAPGSSNP 792

Query:   303 MRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 361
                A   P GP    ++GP Y + +  P Y      +Y+P     Y+    P Y AQ  +
Sbjct:   793 PPVA---PTGPSQ--AQGPPYPSYQGYPGYYQMP-MAYNPYAYGQYNMPYMP-YQAQGQA 845

Query:   362 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQ 390
              Y          +  P   PQ+   Y  Q
Sbjct:   846 GYPGAPATQQP-YPYPQQPPQQQPYYPQQ 873


>UNIPROTKB|G4MYW7 [details] [associations]
            symbol:MGG_10829 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000571 PROSITE:PS50103 GO:GO:0008270 GO:GO:0003676
            EMBL:CM001232 InterPro:IPR019496 Pfam:PF10453 RefSeq:XP_003713435.1
            EnsemblFungi:MGG_10829T0 GeneID:2676344 KEGG:mgr:MGG_10829
            Uniprot:G4MYW7
        Length = 600

 Score = 121 (47.7 bits), Expect = 0.00046, P = 0.00046
 Identities = 61/238 (25%), Positives = 82/238 (34%)

Query:   270 GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD--IPRGPGYEASKGPGYDASK 327
             G+GPPP        GA P      Y   Q        +    PRG G  A  G G     
Sbjct:     5 GYGPPPPPPA----GAPPQAYQQQYGQYQQPPATGHVHGGHAPRG-GRGAHSGRGDFHGS 59

Query:   328 APSYDPTKGPSYDPA-KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLG 386
              PSY     P   P+  GP + P   P +      NY     P +  ++ P Y  Q+   
Sbjct:    60 PPSYPYNNQPQPPPSYTGPHHAPP--PPHTPLAPQNYHPNYAPQH--YQQPQYAHQQQYP 115

Query:   387 YDMQRGPNYDMQRGPGYETQRVPGY-DVQRGPVYEAQRAPSYIPQR--GPG-YDLQRGQG 442
             +   + P    Q+ P Y     P Y      P ++    P+    +  GP  Y   RG+G
Sbjct:   116 HQQPQQPPQPPQQAP-Y-AHHYPSYPQAPNAPPHQPWGGPATAGHQPAGPAHYGSGRGRG 173

Query:   443 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 500
                     + P+   G      G    G  PP L  V   +  PP     G P+GG P
Sbjct:   174 GHQGDRGGHKPAAAMG-PPLRMGFDNRGPEPPAL--VSSATVYPP--QPFGPPQGGAP 226


>ZFIN|ZDB-GENE-041221-2 [details] [associations]
            symbol:prnpb "prion protein b" species:7955 "Danio
            rerio" [GO:0051260 "protein homooligomerization" evidence=IEA]
            [GO:0016020 "membrane" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0016338 "calcium-independent
            cell-cell adhesion" evidence=IMP] [GO:0007156 "homophilic cell
            adhesion" evidence=IDA] [GO:0055113 "epiboly involved in
            gastrulation with mouth forming second" evidence=IGI;IMP]
            [GO:2000047 "regulation of cell-cell adhesion mediated by cadherin"
            evidence=IMP] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0007417 "central nervous system development" evidence=IGI]
            [GO:0009986 "cell surface" evidence=IDA] InterPro:IPR022416
            ZFIN:ZDB-GENE-041221-2 GO:GO:0005886 GO:GO:0009986 GO:GO:0051260
            GO:GO:0007156 GO:GO:0055113 GO:GO:0016338 Gene3D:1.10.790.10
            SUPFAM:SSF54098 EMBL:AJ850286 IPI:IPI00485089 UniGene:Dr.90045
            ProteinModelPortal:Q5K0E1 PRIDE:Q5K0E1 HOVERGEN:HBG056090
            InParanoid:Q5K0E1 Bgee:Q5K0E1 GO:GO:2000047 Uniprot:Q5K0E1
        Length = 606

 Score = 121 (47.7 bits), Expect = 0.00047, P = 0.00047
 Identities = 89/287 (31%), Positives = 108/287 (37%)

Query:   236 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG--PNTST 291
             A GSY   G  G+S      +  G  +Y  G   P   G P      G    G  PN + 
Sbjct:    94 AGGSYPYPGRGGSSPGGYPNQNPGAGSYPSGGSYPSAGGNPNQYPGRGGYNPGGYPNQNP 153

Query:   292 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 351
              A +    G+   A  +  + PG   +   GY     P+ +P  G SY PA G  Y    
Sbjct:   154 GAGSYPAGGSYPSAGGNPNQYPGRGGTSPAGY-----PNQNPGAG-SY-PAGG-SYPSAG 205

Query:   352 G-PG-YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG---PGYET 405
             G P  Y  + GSN      PN +   G SY P  G  Y    G PN    RG   PG   
Sbjct:   206 GNPNQYPGRGGSNPGGY--PNQNPGAG-SY-PAGG-SYPSAGGNPNQYPGRGGSSPGGNP 260

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR-GQ-GYDMRRAP---SYDPSRGTGFD 460
              + PG     G  Y     P+  P  G GY  Q  G+ GY     P   SY P R  G  
Sbjct:   261 NQNPGAGTYAGGGY-----PNQYPGGG-GYSNQNPGRSGYSPGGYPGAGSY-PVRNAGQP 313

Query:   461 GAPRGAAPH--GQVPP--PLNNV--P-YGSATPPARSGSGQPRGGNP 500
             G   GA P   G  P   P N +  P YG +      G G   GG+P
Sbjct:   314 GVYPGAHPSAGGGYPNWNPNNQILSPRYGGSF----GGGGFGTGGSP 356


>WB|WBGene00001263 [details] [associations]
            symbol:emb-9 species:6239 "Caenorhabditis elegans"
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA;TAS] [GO:0005581 "collagen" evidence=IEA] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0008340
            "determination of adult lifespan" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0040039
            "inductive cell migration" evidence=IMP] [GO:0030198 "extracellular
            matrix organization" evidence=IMP] [GO:0009790 "embryo development"
            evidence=IMP] [GO:0050714 "positive regulation of protein
            secretion" evidence=IMP] [GO:0007517 "muscle organ development"
            evidence=IMP] [GO:0005604 "basement membrane" evidence=IDA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            GO:GO:0008340 GO:GO:0009792 GO:GO:0006898 GO:GO:0040007
            GO:GO:0040010 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0030198 GO:GO:0000003 GO:GO:0050714 GO:GO:0007517
            GO:GO:0040039 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005604 GO:GO:0005201 HOGENOM:HOG000085652
            Gene3D:2.170.240.10 EMBL:X56979 EMBL:Z27078 EMBL:J05067 PIR:S40991
            RefSeq:NP_001022662.1 RefSeq:NP_001022663.1
            ProteinModelPortal:P17139 SMR:P17139 IntAct:P17139
            MINT:MINT-1091171 STRING:P17139 PaxDb:P17139 PRIDE:P17139
            EnsemblMetazoa:K04H4.1a GeneID:176314 KEGG:cel:CELE_K04H4.1
            UCSC:K04H4.1b CTD:176314 WormBase:K04H4.1a WormBase:K04H4.1b
            GeneTree:ENSGT00690000101772 InParanoid:P17139 OMA:EEGIPGC
            NextBio:892048 Uniprot:P17139
        Length = 1759

 Score = 126 (49.4 bits), Expect = 0.00049, P = 0.00049
 Identities = 79/282 (28%), Positives = 100/282 (35%)

Query:   238 GSYGGATGNSENETSGRP----VGQNAYEDGY-GVP--QGHGPPPSATTAGVVGAGPNTS 290
             G+YG      E    G P        A E GY G P  +G   P      G   AGP+  
Sbjct:   315 GNYGEKGSQGEQGLGGTPGYPGTKGGAGEPGYPGRPGFEGDCGPEGPLGEGTGEAGPH-G 373

Query:   291 TSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP-GY 347
                +   Q G  +     +P GP G     G PG  A   P  D   G +    +G  GY
Sbjct:   374 AQGFDGVQGGKGLPGHDGLP-GPVGPRGPVGAPG--APGQPGIDGMPGYTEKGDRGEDGY 430

Query:   348 DPTKG-PGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGY 403
                 G PG   + G   Y  + G P YDI   P  D Q G  G+    G   D    PGY
Sbjct:   431 PGFAGEPGLPGEPGDCGYPGEDGLPGYDIQGPPGLDGQSGRDGFPGIPGDIGD----PGY 486

Query:   404 ETQR-VPGYDVQR-GP--VYEAQRAPSYIPQR-G-PGYDLQRGQGYDMRRAPSYDPSRGT 457
               ++  PG  V + GP  +      P  +P R G  GY    G   +      Y P    
Sbjct:   487 SGEKGFPGTGVNKVGPPGMTGLPGEPG-MPGRIGVDGYPGPPGNNGERGEDCGYCPDGVP 545

Query:   458 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN 499
             G  G P     +G   PP  N  +G    P   G  +  G +
Sbjct:   546 GNAGDPGFPGMNGYPGPPGPNGDHGDCGMPGAPGKPRSAGSD 587


>UNIPROTKB|E1BT66 [details] [associations]
            symbol:TAF15 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0005634 GO:GO:0005737 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00530000063105
            OMA:YGNQGSQ EMBL:AADN02025953 EMBL:AADN02025954 IPI:IPI00575015
            ProteinModelPortal:E1BT66 Ensembl:ENSGALT00000003204 Uniprot:E1BT66
        Length = 443

 Score = 119 (46.9 bits), Expect = 0.00049, P = 0.00049
 Identities = 70/232 (30%), Positives = 89/232 (38%)

Query:   247 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 306
             S++ + G+  GQ +Y   YG     G      T G  G G +   S+Y   QS       
Sbjct:     3 SDSGSYGQSGGQQSYSS-YG---NQGNQSYGQTQGYSGYGQSGDNSSYG--QSYGNYHGN 56

Query:   307 YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP--GYDAQKGSNYD 364
             Y      GY      GYD     SYD     SY+          KG   G      S+YD
Sbjct:    57 YG-QNQTGY-GQDSHGYDDES--SYDNQNQSSYNQQSYSNQGQQKGSSRGGRGSYSSSYD 112

Query:   365 AQRGPNYDIHRGPSYDPQRGLG----YDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV-Y 419
              Q G  Y  H+G SYD Q G G    YD + G N   Q   G+  Q    Y  Q+G   +
Sbjct:   113 QQSG--YG-HQG-SYDQQSGYGHQSSYDQKSGYNQH-QSSYGHSQQ---SYQSQKGSYSH 164

Query:   420 EAQ---RAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRG--TGFDGAPRG 465
              +Q   R  S   +   GY   +G G    R   YD   RG  +G+ G  RG
Sbjct:   165 NSQDDRREKSRYGEDNRGYGGSQGGG----RG-GYDMDGRGHMSGYSGGDRG 211


>UNIPROTKB|F1LRM7 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0001502 "cartilage condensation"
            evidence=IEA] [GO:0001894 "tissue homeostasis" evidence=IEA]
            [GO:0001958 "endochondral ossification" evidence=IEA] [GO:0002062
            "chondrocyte differentiation" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005585 "collagen type
            II" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0006029 "proteoglycan metabolic
            process" evidence=IEA] [GO:0007417 "central nervous system
            development" evidence=IEA] [GO:0007601 "visual perception"
            evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0010468 "regulation of gene expression"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030903 "notochord development" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IEA] [GO:0060021
            "palate development" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060272 "embryonic skeletal joint morphogenesis"
            evidence=IEA] [GO:0060351 "cartilage development involved in
            endochondral bone morphogenesis" evidence=IEA] [GO:0071599 "otic
            vesicle development" evidence=IEA] [GO:0071773 "cellular response
            to BMP stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2375
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 IPI:IPI00394380
            Ensembl:ENSRNOT00000016044 ArrayExpress:F1LRM7 Uniprot:F1LRM7
        Length = 1419

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:   998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 89/296 (30%), Positives = 110/296 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 286
             P  DR  D    GA G    +  G P G        G P   GPP        A + G  
Sbjct:    64 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 119

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 343
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   120 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 175

Query:   344 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 399
              PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  +
Sbjct:   176 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 231

Query:   400 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 450
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   232 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 289

Query:   451 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 500
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   290 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 342


>RGD|2375 [details] [associations]
            symbol:Col2a1 "collagen, type II, alpha 1" species:10116 "Rattus
          norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
          [GO:0001502 "cartilage condensation" evidence=ISO] [GO:0001894
          "tissue homeostasis" evidence=ISO] [GO:0001958 "endochondral
          ossification" evidence=ISO] [GO:0002062 "chondrocyte differentiation"
          evidence=ISO] [GO:0003007 "heart morphogenesis" evidence=ISO]
          [GO:0005201 "extracellular matrix structural constituent"
          evidence=TAS] [GO:0005581 "collagen" evidence=ISO] [GO:0005585
          "collagen type II" evidence=ISO;TAS] [GO:0005604 "basement membrane"
          evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
          [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006029 "proteoglycan
          metabolic process" evidence=ISO] [GO:0007601 "visual perception"
          evidence=ISO] [GO:0007605 "sensory perception of sound" evidence=ISO]
          [GO:0010468 "regulation of gene expression" evidence=ISO] [GO:0030199
          "collagen fibril organization" evidence=ISO] [GO:0031012
          "extracellular matrix" evidence=ISO] [GO:0035108 "limb morphogenesis"
          evidence=ISO] [GO:0042472 "inner ear morphogenesis" evidence=ISO]
          [GO:0042802 "identical protein binding" evidence=ISO] [GO:0043066
          "negative regulation of apoptotic process" evidence=ISO] [GO:0046872
          "metal ion binding" evidence=IEA] [GO:0048407 "platelet-derived
          growth factor binding" evidence=ISO] [GO:0048705 "skeletal system
          morphogenesis" evidence=ISO] [GO:0048839 "inner ear development"
          evidence=ISO] [GO:0051216 "cartilage development" evidence=IEP;ISO]
          [GO:0060021 "palate development" evidence=ISO] [GO:0060272 "embryonic
          skeletal joint morphogenesis" evidence=ISO] [GO:0060348 "bone
          development" evidence=ISO] [GO:0060351 "cartilage development
          involved in endochondral bone morphogenesis" evidence=ISO]
          [GO:0071773 "cellular response to BMP stimulus" evidence=ISO]
          InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
          SMART:SM00038 RGD:2375 GO:GO:0046872 GO:GO:0051216 InterPro:IPR008160
          Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
          HOVERGEN:HBG004933 KO:K06236 CTD:1280 Reactome:REACT_133391
          GO:GO:0005585 EMBL:L48440 EMBL:K02804 EMBL:M10613 EMBL:X79816
          IPI:IPI00394380 PIR:A05152 PIR:I60384 RefSeq:NP_037061.1
          UniGene:Rn.10124 IntAct:P05539 STRING:P05539 PRIDE:P05539
          GeneID:25412 KEGG:rno:25412 UCSC:RGD:2375 NextBio:606543
          ArrayExpress:P05539 Genevestigator:P05539
          GermOnline:ENSRNOG00000022282 Uniprot:P05539
        Length = 1419

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:   998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035


>DICTYBASE|DDB_G0279193 [details] [associations]
            symbol:rpb1 "RNA polymerase II core subunit"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA;IDA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISS] [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=ISS] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0016740 "transferase activity" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 dictyBase:DDB_G0279193
            GO:GO:0006355 GenomeReviews:CM000152_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010
            EMBL:AAFI02000030 GO:GO:0003899 eggNOG:COG0086 GO:GO:0005665
            OMA:KVLPWST EMBL:S52651 PIR:A56823 RefSeq:XP_641735.1 STRING:P35084
            PRIDE:P35084 EnsemblProtists:DDB0215406 GeneID:8621932
            KEGG:ddi:DDB_G0279193 KO:K03006 ProtClustDB:CLSZ2428993
            Uniprot:P35084
        Length = 1727

 Score = 135 (52.6 bits), Expect = 0.00050, Sum P(2) = 0.00050
 Identities = 65/219 (29%), Positives = 85/219 (38%)

Query:   287 PNTSTSAYA-ATQSGTPMRAAYDIPRGPGYEASKG---------PGYDASKA--PSYDP- 333
             P + T +Y+    S TP    YD P  P  E  +G         PGY+A+K+   SY   
Sbjct:  1488 PGSQTPSYSYGDGSTTPFHNPYDAPLSPFNETFRGDFSPSAMNSPGYNANKSYGSSYQYF 1547

Query:   334 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 393
              + P+Y P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P
Sbjct:  1548 PQSPTYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSP 1600

Query:   394 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 453
              Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P
Sbjct:  1601 FYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSP 1653

Query:   454 SRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 492
             +  +    +P   +P      P +  P  S T P+ S S
Sbjct:  1654 TSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYSPS 1689

 Score = 40 (19.1 bits), Expect = 0.00050, Sum P(2) = 0.00050
 Identities = 12/43 (27%), Positives = 20/43 (46%)

Query:   195 KKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAD 237
             +K +N  ++  +V + N   +  E+EKL A L      D   D
Sbjct:   978 QKLFN--IDIRRVSDLNPAVVVLEIEKLVARLKIIATADTTED 1018


>UNIPROTKB|F1RIA5 [details] [associations]
            symbol:VPS37C "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR009851
            Pfam:PF07200 PROSITE:PS51314 KO:K12185 GeneTree:ENSGT00530000063123
            OMA:VERCQEQ EMBL:CU914270 RefSeq:XP_003122720.1
            Ensembl:ENSSSCT00000032280 GeneID:100511491 KEGG:ssc:100511491
            Uniprot:F1RIA5
        Length = 358

 Score = 91 (37.1 bits), Expect = 0.00050, Sum P(2) = 0.00050
 Identities = 38/117 (32%), Positives = 48/117 (41%)

Query:   272 GPP-PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDA--SK 327
             GPP PSA        GP  S     + Q  TP R  Y + P G     + GPGY     +
Sbjct:   251 GPPYPSAQP------GPRASAGYSWSPQRSTPPRPGYPVAPTG-----ASGPGYPVVGGR 299

Query:   328 APSYD-PTKGPSYDPAKGPGYDPTKG--PGYDAQKGSNYDAQRGPNYDIH--RGPSY 379
             APS   P + P   P   P Y PT+   PG+  Q    Y     P Y     +GP++
Sbjct:   300 APSPGYPQQPPYLSPGGKPPY-PTQPQPPGFAGQPQPPYPPGPAPPYGFPPPQGPTW 355

 Score = 89 (36.4 bits), Expect = 0.00083, Sum P(2) = 0.00083
 Identities = 53/188 (28%), Positives = 64/188 (34%)

Query:   310 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 369
             PR P  +A+     D    P   P   PS  P     Y P+  PG      ++   Q  P
Sbjct:   179 PR-PSPQATPPVAEDRQPPPPLPPPPQPSVVPPYPLPYSPS--PGMSVGPTAHGALQPAP 235

Query:   370 NYDIHRGPSYDPQRGLG--Y-DMQRGPNYDM--QRGPGYETQRVPGYDVQ----RGPVYE 420
              + +   PS+     LG  Y   Q GP         P   T   PGY V      GP Y 
Sbjct:   236 -FPVVSQPSFSYSGPLGPPYPSAQPGPRASAGYSWSPQRSTPPRPGYPVAPTGASGPGYP 294

Query:   421 AQ--RAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 478
                 RAPS      PGY  Q        + P     +  GF G P+   P G  PP    
Sbjct:   295 VVGGRAPS------PGYPQQPPYLSPGGKPPYPTQPQPPGFAGQPQPPYPPGPAPPYGFP 348

Query:   479 VPYGSATP 486
              P G   P
Sbjct:   349 PPQGPTWP 356

 Score = 70 (29.7 bits), Expect = 0.00050, Sum P(2) = 0.00050
 Identities = 31/143 (21%), Positives = 65/143 (45%)

Query:    44 MMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSE 103
             M   PE ++ ++A +  E+Q L  E +   AT+ +L +     Q  L+I    +    S+
Sbjct:    15 MQNDPEAID-RLAQESPEVQDLQLEREMALATNRSLAERNLEFQGPLEISRSNL----SD 69

Query:   104 RELQMRNLTEKIAKMEAEL-KTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             +  ++R L E+  + +A+L K +  ++L       + + + +  EE  A   +  +    
Sbjct:    70 KYQELRKLVERCQEQKAKLEKFSSALQLGTLLDLLQIEGMKI-EEESEAMAEKFLEGEVP 128

Query:   163 AHTDVQQIPAL--LSELESLRQE 183
               T ++   ++  LS L  +R E
Sbjct:   129 LETFLETFSSMRMLSHLRRVRVE 151


>UNIPROTKB|E7ENY8 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 EMBL:AC066694 HGNC:HGNC:2201
            ChiTaRS:COL3A1 IPI:IPI00981037 PDB:4GYX PDBsum:4GYX
            ProteinModelPortal:E7ENY8 SMR:E7ENY8 PRIDE:E7ENY8
            Ensembl:ENST00000317840 ArrayExpress:E7ENY8 Bgee:E7ENY8
            Uniprot:E7ENY8
        Length = 1163

 Score = 124 (48.7 bits), Expect = 0.00050, P = 0.00050
 Identities = 81/280 (28%), Positives = 101/280 (36%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 290
             A G   G  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPS 224

Query:   291 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 347
               A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGL 283

Query:   348 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 404
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   405 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPPGINGSP 392

Query:   464 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 498
              G    G  P  +   P   G+  PP  +G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430

 Score = 123 (48.4 bits), Expect = 0.00065, P = 0.00065
 Identities = 85/284 (29%), Positives = 101/284 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSA 293
             A G  GGA    +N   G P G        G+P   G P +    G  G+ G P  +   
Sbjct:   424 APGLRGGAGEPGKNGAKGEP-GPRGERGEAGIP---GVPGAKGEDGKDGSPGEPGANGLP 479

Query:   294 YAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 352
              AA + G P    +  P GP G    KGP  +   AP   P  GP    A  PG D   G
Sbjct:   480 GAAGERGAP---GFRGPAGPNGIPGEKGPAGERG-APG--PA-GPR-GAAGEPGRDGVPG 531

Query:   353 -PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRG----PGYE-T 405
              PG     GS      GP  D   GP    Q   G     GP+    Q G    PG +  
Sbjct:   532 GPGMRGMPGS----PGGPGSDGKPGPP-GSQGESGRPGPPGPSGPRGQPGVMGFPGPKGN 586

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD-----PSRGTGF 459
                PG + +RG        P   PQ  PG + + G QG      P  D     P    G 
Sbjct:   587 DGAPGKNGERG----GPGGPG--PQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGL 640

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 503
              G P    P G+   P    P G A  P   G G+   G P  R
Sbjct:   641 QGLPGTGGPPGENGKPGEPGPKGDAGAPGAPG-GKGDAGAPGER 683


>UNIPROTKB|A4FU28 [details] [associations]
            symbol:CTAGE9 "Cutaneous T-cell lymphoma-associated antigen
            9" species:9606 "Homo sapiens" [GO:0016021 "integral to membrane"
            evidence=IEA] GO:GO:0016021 HOVERGEN:HBG051216 HOGENOM:HOG000112043
            OrthoDB:EOG4WSWC5 EMBL:AC005587 EMBL:BC101322 IPI:IPI00740858
            RefSeq:NP_001139131.1 UniGene:Hs.632613 ProteinModelPortal:A4FU28
            PhosphoSite:A4FU28 PRIDE:A4FU28 Ensembl:ENST00000314099
            GeneID:643854 KEGG:hsa:643854 UCSC:uc011ece.2 CTD:643854
            GeneCards:GC06M132030 HGNC:HGNC:37275 neXtProt:NX_A4FU28
            PharmGKB:PA165617886 OMA:CEGLESS PhylomeDB:A4FU28 GenomeRNAi:643854
            NextBio:115484 Bgee:A4FU28 Uniprot:A4FU28
        Length = 777

 Score = 122 (48.0 bits), Expect = 0.00051, P = 0.00051
 Identities = 106/470 (22%), Positives = 181/470 (38%)

Query:    46 PPPEVMEQKI--ASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSE 103
             PP   +++ I  A  +V ++ L  E   +      + +        ++ L  Q   ++SE
Sbjct:   310 PPKGALKKLIHAAKLNVSLKSLEGERNHIIIQLSEVDKTKEELTEHIKNLQTQQASLQSE 369

Query:   104 R---ELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDL 160
                 E + + L +K+ K+  E      +KL ++K   E +N  +  EE +++V +    +
Sbjct:   370 NIYFESENQKLQQKL-KIMTEFYQENEMKL-YRKLTVE-ENYRIEEEEKLSRVEE---KI 423

Query:   161 QRAHTDVQQIPALLSELES-LRQEYH-HCRGTYEYEKKFYNDHLESLQVMEKNYITMATE 218
               A  +++    L  +LE  L +  H + +    YEK+ +++ L + +  E+N   +  E
Sbjct:   424 SHATEELETYRKLAKDLEEELERTVHFYQKQVISYEKRGHDNWLAA-RTAERNLSDLRKE 482

Query:   219 ----VEKL-RAEL-MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYED-GYGVPQGH 271
                  +KL   EL       D  A      A G   +  S  P+G+ + E   +  PQ  
Sbjct:   483 NAHNKQKLTERELKFELLEKDPNALDVSNTAFGREHSPCSPSPLGRPSSETRAFPSPQTL 542

Query:   272 GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEA----SKGPGYDAS 326
                P   +  + G G    +S       G P+       RG P Y+      + P    S
Sbjct:   543 LEDPLRLSPVLPGGGGRGPSSP------GNPLDHQITNERGEPSYDRLIDPHRAPSDTGS 596

Query:   327 KAPSYDPTKGPSYDPAKGPGY-DPTKGPGYDAQKGSNYDAQRGPNYD-IHRGPSYDPQ-R 383
              +   +  +   + P  G  Y D T  P  + +  SN +   GP      +  S D   R
Sbjct:   597 LSSPVEQDRRMMFPPP-GQSYPDSTLPPQREDRFYSNSERLSGPAEPRSFKMTSLDKMDR 655

Query:   384 GLGYDMQRGPNYDMQRGPG---YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 440
              +  +M+   N D +   G        +P  +   GP       P   P  GP + +   
Sbjct:   656 SMPSEMESSRN-DAKDDLGNLNVPDSSLPAENEATGP---GLIPPPLAPISGPLFPVDT- 710

Query:   441 QGYDMRRAPSYDPSR-GTGFDGAPRGAAPHGQVPPPLNNVPYGSAT--PP 487
             +G  MRR P + P   GT F GA RG  P    P P  + P+      PP
Sbjct:   711 RGPFMRRGPPFPPPPPGTMF-GASRGYFPPRDFPGP-PHAPFAMRNIYPP 758


>UNIPROTKB|F1LP41 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00205809
            Ensembl:ENSRNOT00000012441 ArrayExpress:F1LP41 Uniprot:F1LP41
        Length = 1458

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074


>UNIPROTKB|P02453 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9913 "Bos
            taurus" [GO:0090263 "positive regulation of canonical Wnt receptor
            signaling pathway" evidence=IEA] [GO:0071260 "cellular response to
            mechanical stimulus" evidence=IEA] [GO:0071230 "cellular response
            to amino acid stimulus" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0060351 "cartilage
            development involved in endochondral bone morphogenesis"
            evidence=IEA] [GO:0060346 "bone trabecula formation" evidence=IEA]
            [GO:0060325 "face morphogenesis" evidence=IEA] [GO:0048706
            "embryonic skeletal system development" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IEA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0043589 "skin morphogenesis" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0034505 "tooth
            mineralization" evidence=IEA] [GO:0034504 "protein localization to
            nucleus" evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030335 "positive regulation of cell migration"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0015031 "protein transport" evidence=IEA]
            [GO:0010812 "negative regulation of cell-substrate adhesion"
            evidence=IEA] [GO:0010718 "positive regulation of epithelial to
            mesenchymal transition" evidence=IEA] [GO:0007605 "sensory
            perception of sound" evidence=IEA] [GO:0007601 "visual perception"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001958 "endochondral ossification"
            evidence=IEA] [GO:0001957 "intramembranous ossification"
            evidence=IEA] [GO:0001649 "osteoblast differentiation"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615
            GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071260
            GO:GO:0001568 GO:GO:0001649 GO:GO:0034505 GO:GO:0090263
            GO:GO:0010812 GO:GO:0060325 GO:GO:0032964 GO:GO:0071230
            GO:GO:0048706 GO:GO:0001957 GO:GO:0034504 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GO:GO:0043589 EMBL:BC105184
            IPI:IPI00707857 PIR:A91193 RefSeq:NP_001029211.1 UniGene:Bt.23316
            IntAct:P02453 STRING:P02453 PRIDE:P02453 Ensembl:ENSBTAT00000017420
            GeneID:282187 KEGG:bta:282187 CTD:1277 GeneTree:ENSGT00660000095287
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 InParanoid:P02453 KO:K06236
            OMA:VAYMDQQ OrthoDB:EOG4S4PHP NextBio:20806015 PMAP-CutDB:P02453
            ArrayExpress:P02453 GO:GO:0005584 GO:GO:0060346 Uniprot:P02453
        Length = 1463

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 90/286 (31%), Positives = 109/286 (38%)

Query:   236 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 292
             ADG  G  G  G++  +    P G  A   G   P G+ G P      G   AGP  +T 
Sbjct:   818 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGARG--SAGPPGATG 874

Query:   293 -AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--K 343
                AA + G P  +    P GP    G E SKGP  +   A  P      GP   PA  K
Sbjct:   875 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGSKGPRGETGPAGRPGEVGPPGPP-GPAGEK 933

Query:   344 G-PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQR 391
             G PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +R
Sbjct:   934 GAPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGER 993

Query:   392 GPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 451
             GP   M  GP       PG     GP  E+ R  +   +  PG D   G   D       
Sbjct:   994 GPPGPM--GP-------PGL---AGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPA 1041

Query:   452 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
              P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1042 GPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIGPVGARGPAG 1087

 Score = 124 (48.7 bits), Expect = 0.00065, P = 0.00065
 Identities = 82/275 (29%), Positives = 108/275 (39%)

Query:   240 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 299
             + GA G ++ E  G P G    E   GV    GPP  A  AG  G  P       A   +
Sbjct:   344 FPGAVG-AKGE--GGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAG-NPGADGQPGAKGAN 399

Query:   300 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP-GYDA 357
             G P      I   PG+  ++GP     + PS  P  KG S +P   PG   +KG  G   
Sbjct:   400 GAP-----GIAGAPGFPGARGPS--GPQGPSGPPGPKGNSGEPG-APG---SKGDTGAKG 448

Query:   358 QKG-SNYDAQRGP-NYDIHRGPSYDP-QRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDV 413
             + G +      GP   +  RG   +P   GL G   +RG       GPG  ++  PG D 
Sbjct:   449 EPGPTGIQGPPGPAGEEGKRGARGEPGPAGLPGPPGERG-------GPG--SRGFPGADG 499

Query:   414 QRGPVYEA-QR-APSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPH 469
               GP   A +R AP    P+  PG   + G+   +  A     S G+ G DG      P 
Sbjct:   500 VAGPKGPAGERGAPGPAGPKGSPGEAGRPGEA-GLPGAKGLTGSPGSPGPDGKTGPPGPA 558

Query:   470 GQVPPPLNNVPYGSATPPARSGSGQPRG--GNPAR 502
             GQ   P    P G+       G   P+G  G P +
Sbjct:   559 GQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGK 593


>UNIPROTKB|E1BLD0 [details] [associations]
            symbol:LOC100847165 "Uncharacterized protein" species:9913
            "Bos taurus" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
            InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
            PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
            GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
            OMA:SRYESQN EMBL:DAAA02057905 IPI:IPI00717370
            Ensembl:ENSBTAT00000061583 Uniprot:E1BLD0
        Length = 540

 Score = 120 (47.3 bits), Expect = 0.00051, P = 0.00051
 Identities = 40/160 (25%), Positives = 70/160 (43%)

Query:   227 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYED---GYGV-PQGHGPPPSATTAGV 282
             M +P+     +GS  G    +E E   +  G   YE     +G+ PQ  G  P +     
Sbjct:    15 MQSPDEMGSPEGSLKGNMSENEEEEISQQEGTGDYEVEEIAFGLEPQSPGFGPQSPEFEP 74

Query:   283 VGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 342
                     +  + +   G    +    PR P  + S+ P ++  ++P Y+P + P Y+P 
Sbjct:    75 QSPRFEPESPGFESRSPGFVPPSPEFAPRSPESD-SQSPDFEP-QSPRYEP-QSPGYEP- 130

Query:   343 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 382
             K PGY+P + PGY+  K   Y+ Q  P +   + P ++ +
Sbjct:   131 KSPGYEP-RSPGYEP-KSPGYEPQN-PEFKT-QSPEFEAE 166


>UNIPROTKB|O43186 [details] [associations]
            symbol:CRX "Cone-rod homeobox protein" species:9606 "Homo
            sapiens" [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0007601 "visual perception" evidence=IEA] [GO:0050896 "response
            to stimulus" evidence=IEA] [GO:0003682 "chromatin binding"
            evidence=IEA] [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA] [GO:0005667
            "transcription factor complex" evidence=IEA] [GO:0045944 "positive
            regulation of transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0060041 "retina development in camera-type eye"
            evidence=IEA] [GO:0043522 "leucine zipper domain binding"
            evidence=IPI] [GO:0009887 "organ morphogenesis" evidence=TAS]
            InterPro:IPR001356 InterPro:IPR009057 InterPro:IPR013851
            InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529 PROSITE:PS00027
            PROSITE:PS50071 SMART:SM00389 GO:GO:0007601 GO:GO:0043565
            GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
            Orphanet:1872 Orphanet:791 GO:GO:0050896 Gene3D:1.10.10.60
            SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0009887 GO:GO:0060041
            Orphanet:65 MIM:268000 CTD:1406 eggNOG:NOG324074
            HOGENOM:HOG000082677 HOVERGEN:HBG004028 KO:K09337 OMA:QTKARPA
            OrthoDB:EOG4NKBWG EMBL:AF024711 EMBL:BT007364 EMBL:AC008745
            EMBL:BC016664 EMBL:BC053672 IPI:IPI00011226 RefSeq:NP_000545.1
            UniGene:Hs.617342 UniGene:Hs.633434 UniGene:Hs.639114
            ProteinModelPortal:O43186 SMR:O43186 IntAct:O43186
            MINT:MINT-1442706 STRING:O43186 PhosphoSite:O43186 PRIDE:O43186
            DNASU:1406 Ensembl:ENST00000221996 Ensembl:ENST00000539067
            Ensembl:ENST00000556900 Ensembl:ENST00000557738 GeneID:1406
            KEGG:hsa:1406 UCSC:uc002phq.4 GeneCards:GC19P048327 HGNC:HGNC:2383
            HPA:HPA036762 HPA:HPA036763 MIM:120970 MIM:602225 MIM:613829
            neXtProt:NX_O43186 PharmGKB:PA26903 InParanoid:O43186
            PhylomeDB:O43186 ChiTaRS:CRX GenomeRNAi:1406 NextBio:5749
            ArrayExpress:O43186 Bgee:O43186 CleanEx:HS_CRX
            Genevestigator:O43186 GermOnline:ENSG00000105392 Uniprot:O43186
        Length = 299

 Score = 116 (45.9 bits), Expect = 0.00052, P = 0.00052
 Identities = 29/98 (29%), Positives = 42/98 (42%)

Query:   268 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 327
             P    P P A  AG+V +GP+ +++ YA T +  P  A    P   G  +S   G D   
Sbjct:   165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222

Query:   328 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDA 365
             +P      GP+  P  GP   P+      +  G +Y A
Sbjct:   223 SPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGA 260


>UNIPROTKB|F1LN37 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
            GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
            GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
            GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 IPI:IPI00388575 Ensembl:ENSRNOT00000037840
            ArrayExpress:F1LN37 Uniprot:F1LN37
        Length = 1487

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 293
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   841 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 899

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 349
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   900 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 954

Query:   350 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 405
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   955 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 1005

Query:   406 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 461
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:  1006 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1065

Query:   462 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 501
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1066 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1103

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 89/296 (30%), Positives = 110/296 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 286
             P  DR  D    GA G    +  G P G        G P   GPP        A + G  
Sbjct:   132 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 187

Query:   287 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 343
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 243

Query:   344 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 399
              PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  +
Sbjct:   244 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299

Query:   400 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 450
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357

Query:   451 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 500
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   358 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410


>UNIPROTKB|F1NI72 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005586 "collagen type III"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
            evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
            "peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0032964 "collagen biosynthetic
            process" evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
            "extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0048565 "digestive tract development" evidence=IEA] [GO:0050777
            "negative regulation of immune response" evidence=IEA] [GO:0071230
            "cellular response to amino acid stimulus" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005615 GO:GO:0034097
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0042060 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0071230 GO:GO:0043206
            GO:GO:0005201 GeneTree:ENSGT00660000095287 GO:GO:0005586
            EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI00589264
            Ensembl:ENSGALT00000004033 OMA:ETCLSAN ArrayExpress:F1NI72
            Uniprot:F1NI72
        Length = 1498

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 78/276 (28%), Positives = 97/276 (35%)

Query:   242 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 300
             G  G   +N   G P G        G P   GPP      G  G  P  +       + G
Sbjct:   464 GTPGEPGKNGAKGDP-GPKGERGENGTPGAPGPPGEEGKRGANGE-PGQNGVPGTPGERG 521

Query:   301 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 356
             +P      +P   G    KGP G   S  P   P+ GP+ D  +  GPG    +G PG  
Sbjct:   522 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 577

Query:   357 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 413
                GS  D + GP      G   +P R  G     GP     +   PG +  +  PG + 
Sbjct:   578 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 629

Query:   414 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 469
             +RGP       P    + G    PG     G   D R  P   PS   G  G P G  P 
Sbjct:   630 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 685

Query:   470 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 503
             G+   P    P G    P   G   P+G N  P  R
Sbjct:   686 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 718

 Score = 123 (48.4 bits), Expect = 0.00086, P = 0.00086
 Identities = 84/275 (30%), Positives = 104/275 (37%)

Query:   252 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 311
             +G P G        G+P   G P      G+ G  P TS +  A    G P +       
Sbjct:   424 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 478

Query:   312 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 365
             GP G     G PG  A   P  +  +G + +P +   PG    +G PG+    GSN    
Sbjct:   479 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 536

Query:   366 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 420
             ++GP  +       GPS  P    G D   GP     RG PG      PG D + GP   
Sbjct:   537 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 591

Query:   421 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 472
              Q  P      GP G   Q G  G+   +    AP  +  RG G   G P  A  +G V 
Sbjct:   592 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 650

Query:   473 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 501
              P PP    P G    P  SGS    G P G  PA
Sbjct:   651 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 685


>UNIPROTKB|E2QSE6 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA] [GO:0004828
            "serine-tRNA ligase activity" evidence=IEA] InterPro:IPR012929
            InterPro:IPR015866 Pfam:PF07926 GO:GO:0005524 GO:GO:0005737
            GO:GO:0005643 GO:GO:0006606 InterPro:IPR009053 SUPFAM:SSF46579
            GO:GO:0004828 GO:GO:0006434 Gene3D:1.10.287.40
            Ensembl:ENSCAFT00000021777 Uniprot:E2QSE6
        Length = 2366

 Score = 127 (49.8 bits), Expect = 0.00053, P = 0.00053
 Identities = 42/187 (22%), Positives = 88/187 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1351 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1410

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1411 EKETIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1470

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQVMEKNYITMATEVEK 221
              H  VQ++  L   L     +        E  +KK  ++     + +++  + +  E+ +
Sbjct:  1471 QHVSVQEMQELKETLNQAETKAKSLESQVENLQKKTLSEKEAEARNLQEQTVQLQCELSR 1530

Query:   222 LRAELMN 228
             LR +L +
Sbjct:  1531 LRQDLQD 1537


>ZFIN|ZDB-GENE-030131-4487 [details] [associations]
            symbol:sec24c "SEC24 family, member C (S.
            cerevisiae)" species:7955 "Danio rerio" [GO:0030127 "COPII vesicle
            coat" evidence=IEA] [GO:0006886 "intracellular protein transport"
            evidence=IEA] [GO:0006888 "ER to Golgi vesicle-mediated transport"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006810 "transport" evidence=IEA] [GO:0015031 "protein
            transport" evidence=IEA] InterPro:IPR006895 InterPro:IPR006896
            InterPro:IPR006900 Pfam:PF04810 Pfam:PF04811 Pfam:PF04815
            ZFIN:ZDB-GENE-030131-4487 GO:GO:0006886 GO:GO:0008270
            InterPro:IPR007123 Pfam:PF00626 GO:GO:0006888 GO:GO:0030127
            SUPFAM:SSF82919 InterPro:IPR012990 Pfam:PF08033 SUPFAM:SSF81811
            GeneTree:ENSGT00590000082962 EMBL:CU469520 EMBL:CU694198
            IPI:IPI00972073 Ensembl:ENSDART00000085476 ArrayExpress:F1R9P2
            Bgee:F1R9P2 Uniprot:F1R9P2
        Length = 1241

 Score = 124 (48.7 bits), Expect = 0.00054, P = 0.00054
 Identities = 82/291 (28%), Positives = 110/291 (37%)

Query:   241 GGATGNSENETSGRPV--GQNAYED-GYGVPQGHGPPPS-ATTAGVVGAGPNTSTSAYAA 296
             G   G  E  TSG P   G  +Y   G G  Q +GPPP  A   G + + P+T  +   +
Sbjct:    70 GPPQGMREPPTSGTPPVSGAQSYSQFGQGETQ-NGPPPMVAPPQGTLVSQPHTPNAVSLS 128

Query:   297 TQSGTPMRAAYDIPR-GPGYEASKGPGYDA-SKAPSYDPTKGPSYDP---AKGP---GYD 348
               +  P    +  P  G     ++       S APS  P  GP Y P   A+ P    Y 
Sbjct:   129 GPTQPPYGQQFGSPPIGMQQMTNQMASMQVGSTAPS--PA-GPGYAPPSTAQAPISAAYT 185

Query:   349 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDM---QRGPGYE 404
             P+  P +     S+  +Q  P   + + P   P  G     Q+  PN        GP  +
Sbjct:   186 PSAPPTFPPT--SSAPSQPPPTEAVAQAPP-QPYYGAPPPAQQPFPNAVSTFSSAGPT-Q 241

Query:   405 TQRVPGYDVQRGPVYEAQRAPSY--IPQRGP----GYDLQRGQGYDMRRAPSYDPSRGTG 458
              Q  P    Q  P   A   P +   P  GP    G  L   Q    +RAP      G  
Sbjct:   242 PQAPPSVSQQSFPQAPAVSQPPFSTAPPPGPSQSYGGPLPPTQP-SFQRAPLPTSQPGV- 299

Query:   459 FDGAPRGAAPHGQVP------PPLNNV-PYGSATPPARSGSGQPRGGNPAR 502
             F G P   + H Q+P      PP++   PY S  PP  + S  P+ G P R
Sbjct:   300 FPGGPPPTSTHSQLPGPMPPQPPVSQPSPYYSEPPPT-TASFPPQVGAPPR 349


>UNIPROTKB|P15941 [details] [associations]
            symbol:MUC1 "Mucin-1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IBA] [GO:0009986 "cell surface" evidence=IBA]
            [GO:0016324 "apical plasma membrane" evidence=IBA] [GO:0005887
            "integral to plasma membrane" evidence=TAS] [GO:0005796 "Golgi
            lumen" evidence=TAS] [GO:0016266 "O-glycan processing"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0002039 "p53 binding" evidence=IPI] [GO:0006977 "DNA damage
            response, signal transduction by p53 class mediator resulting in
            cell cycle arrest" evidence=IDA] [GO:0000790 "nuclear chromatin"
            evidence=IDA] [GO:0090240 "positive regulation of histone H4
            acetylation" evidence=IDA] [GO:0000978 "RNA polymerase II core
            promoter proximal region sequence-specific DNA binding"
            evidence=IDA] [GO:0043618 "regulation of transcription from RNA
            polymerase II promoter in response to stress" evidence=IDA]
            [GO:0006978 "DNA damage response, signal transduction by p53 class
            mediator resulting in transcription of p21 class mediator"
            evidence=IDA] [GO:0010944 "negative regulation of transcription by
            competitive promoter binding" evidence=IDA] [GO:0003712
            "transcription cofactor activity" evidence=IDA] [GO:0036003
            "positive regulation of transcription from RNA polymerase II
            promoter in response to stress" evidence=IDA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IDA] Reactome:REACT_17015
            PANTHER:PTHR10006 GO:GO:0043066 GO:GO:0005576 GO:GO:0009986
            GO:GO:0005887 GO:GO:0006977 GO:GO:0016324 GO:GO:0000978
            GO:GO:0000790 GO:GO:0003712 GO:GO:0043687 InterPro:IPR000082
            Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 GO:GO:0005796
            EMBL:CH471121 GO:GO:0010944 GO:GO:0090240 PDB:2FO4 PDBsum:2FO4
            GO:GO:0016266 GO:GO:0006978 EMBL:AL713999 GO:GO:0036003
            MEROPS:S71.001 CTD:4582 eggNOG:NOG77744 KO:K06568
            InterPro:IPR023217 EMBL:J05582 EMBL:M32738 EMBL:M32739 EMBL:M34089
            EMBL:M34088 EMBL:J05581 EMBL:M61170 EMBL:X52229 EMBL:X52228
            EMBL:M35093 EMBL:X80761 EMBL:U60259 EMBL:U60260 EMBL:U60261
            EMBL:AF125525 EMBL:AF348143 EMBL:AY327582 EMBL:AY463543
            EMBL:BC120974 EMBL:Z17324 EMBL:Z17325 EMBL:M31823 EMBL:S81781
            EMBL:S81736 EMBL:M21868 IPI:IPI00013955 IPI:IPI00218163
            IPI:IPI00218164 IPI:IPI00218165 IPI:IPI00218166 IPI:IPI00218168
            IPI:IPI00218169 IPI:IPI00607673 IPI:IPI00902840 IPI:IPI00978078
            PIR:A35175 RefSeq:NP_001018016.1 RefSeq:NP_001018017.1
            RefSeq:NP_001037855.1 RefSeq:NP_001037856.1 RefSeq:NP_001037857.1
            RefSeq:NP_001037858.1 RefSeq:NP_001191214.1 RefSeq:NP_001191215.1
            RefSeq:NP_001191216.1 RefSeq:NP_001191217.1 RefSeq:NP_001191218.1
            RefSeq:NP_001191219.1 RefSeq:NP_001191220.1 RefSeq:NP_001191221.1
            RefSeq:NP_001191222.1 RefSeq:NP_001191223.1 RefSeq:NP_001191224.1
            RefSeq:NP_001191225.1 RefSeq:NP_001191226.1 RefSeq:NP_002447.4
            UniGene:Hs.89603 PDB:2ACM PDBsum:2ACM ProteinModelPortal:P15941
            SMR:P15941 IntAct:P15941 MINT:MINT-156679 STRING:P15941
            GlycoSuiteDB:P15941 PhosphoSite:P15941 DMDM:296439295 PaxDb:P15941
            PRIDE:P15941 DNASU:4582 Ensembl:ENST00000337604
            Ensembl:ENST00000343256 Ensembl:ENST00000368389
            Ensembl:ENST00000368390 Ensembl:ENST00000368398 GeneID:4582
            KEGG:hsa:4582 UCSC:uc001fib.3 GeneCards:GC01M155158 HGNC:HGNC:7508
            HPA:CAB000036 HPA:CAB001986 HPA:HPA004179 HPA:HPA007235
            HPA:HPA008855 MIM:113720 MIM:158340 neXtProt:NX_P15941
            PharmGKB:PA31309 ChiTaRS:MUC1 EvolutionaryTrace:P15941
            GenomeRNAi:4582 NextBio:17597 Bgee:P15941 Genevestigator:P15941
            GermOnline:ENSG00000185499 Uniprot:P15941
        Length = 1255

 Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
 Identities = 65/275 (23%), Positives = 91/275 (33%)

Query:   236 ADGSYGGATGNSENETSGRPVG--QNAYEDGYGVPQGHGPPP-SATTAGV-VGAGPNTST 291
             A  + GG    S  + S  P    +NA      V   H P   S+TT G  V   P T  
Sbjct:    27 ASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEP 86

Query:   292 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 351
             ++ +A   G  + +   + R P   ++  P +D + AP   P  G +  PA G    P  
Sbjct:    87 ASGSAATWGQDVTSV-PVTR-PALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDT 144

Query:   352 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL--GYDMQRGPNYDMQRGPGY----ET 405
              P   +     +     P+     G +  P  G+    D +  P        G     +T
Sbjct:   145 RPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDT 204

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG--QGYDMRRAPSYDPSRGTGFDGAP 463
             +  PG      P +    AP   P  G       G     D R AP        G   AP
Sbjct:   205 RPAPGSTAP--PAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAP 262

Query:   464 RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
                   G   PP + V     T PA   +  P  G
Sbjct:   263 DTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHG 297


>RGD|1308535 [details] [associations]
            symbol:Pygo2 "pygopus 2" species:10116 "Rattus norvegicus"
            [GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
            [GO:0001822 "kidney development" evidence=IEA;ISO] [GO:0002088
            "lens development in camera-type eye" evidence=IEA;ISO] [GO:0005634
            "nucleus" evidence=IEA;ISO] [GO:0007420 "brain development"
            evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0009791 "post-embryonic development" evidence=IEA;ISO]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=ISO]
            [GO:0030879 "mammary gland development" evidence=IEA;ISO]
            [GO:0033599 "regulation of mammary gland epithelial cell
            proliferation" evidence=IEA;ISO] [GO:0042393 "histone binding"
            evidence=IEA;ISO] [GO:0048589 "developmental growth"
            evidence=IEA;ISO] [GO:0051569 "regulation of histone H3-K4
            methylation" evidence=IEA;ISO] [GO:0060021 "palate development"
            evidence=IEA;ISO] [GO:0060070 "canonical Wnt receptor signaling
            pathway" evidence=IEA;ISO] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 RGD:1308535
            GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
            GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
            InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            EMBL:CH473976 eggNOG:NOG72798 HOGENOM:HOG000001580
            HOVERGEN:HBG053774 GeneTree:ENSGT00530000063948 CTD:90780
            OMA:PGLVYPC OrthoDB:EOG4QZ7MB EMBL:BC169054 IPI:IPI00368626
            RefSeq:NP_001099917.1 UniGene:Rn.24988 STRING:B5DFG8
            Ensembl:ENSRNOT00000028052 GeneID:295251 KEGG:rno:295251
            UCSC:RGD:1308535 NextBio:639221 Genevestigator:B5DFG8
            Uniprot:B5DFG8
        Length = 405

 Score = 118 (46.6 bits), Expect = 0.00055, P = 0.00055
 Identities = 79/294 (26%), Positives = 110/294 (37%)

Query:   227 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ--GHGPPPSAT 278
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G GPP    
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKMGGAGPP---- 93

Query:   279 TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GP 337
                 +G+ P      +   Q G     A  +P G G     GP     + P + P   GP
Sbjct:    94 ---FLGS-P-VPFGGFRV-QGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPNPMGP 143

Query:   338 SYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQR 391
             +++ P +GPGY P     + +Q    ++   G N+    G     P  G G      M +
Sbjct:   144 AFNMPPQGPGYPPPGNMNFPSQP---FNQSLGQNFSPPGGQMIPGPVGGFGPMISPTMGQ 200

Query:   392 GPNYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPSYIPQRGP--GYDLQ-RGQGYDMR 446
              P  ++  GP    QR        GP  +   Q  PS  P   P  G D    G G +  
Sbjct:   201 PPRGEL--GPPPLPQRFTQPGAPFGPSLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDG 258

Query:   447 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 500
               P  +P   T F   P   +P   V     N P   + PP+ SG G   GG P
Sbjct:   259 GKP-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPSSSGRG---GGTP 302


>UNIPROTKB|F1LNH3 [details] [associations]
            symbol:Col6a2 "Protein Col6a2" species:10116 "Rattus
            norvegicus" [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0031012 "extracellular matrix" evidence=IEA] [GO:0042383
            "sarcolemma" evidence=IEA] [GO:0043234 "protein complex"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 RGD:1305585 GO:GO:0005615 GO:GO:0043234 GO:GO:0042383
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 GO:GO:0031012
            GeneTree:ENSGT00530000063022 OMA:RALCNHD IPI:IPI00372839
            Ensembl:ENSRNOT00000001695 ArrayExpress:F1LNH3 Uniprot:F1LNH3
        Length = 1025

 Score = 123 (48.4 bits), Expect = 0.00056, P = 0.00056
 Identities = 88/284 (30%), Positives = 99/284 (34%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AG-PNTSTSA 293
             +DG  G      +N T G    Q       G P   G P S    G  G AG P      
Sbjct:   320 SDGRKGAPGLAGKNGTDG----QKGKLGRIGPPGCKGDPGSRGPDGYPGEAGSPGEQGDQ 375

Query:   294 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDA-SKAPSYDPTKGPSYDPA----KG-PGY 347
              A   SG P R     P  PG + SKG  Y   S AP     KG    P     KG PG 
Sbjct:   376 GAKGDSGRPGRRGP--PGNPGDKGSKG--YRGNSGAPGSPGVKGGKGGPGPRGPKGEPGR 431

Query:   348 --DP-TKG-PGYDAQKGSNYD-AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 402
               DP TKG PG D  KG   D    GP        S   +   G    RGP   +   PG
Sbjct:   432 RGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEIGSKGAKGDRGLPGPRGPQGALGE-PG 490

Query:   403 YETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA----PSYDPSRGT 457
              +  R  PG    RG     Q  P   P R PG+     +G    +     P  +  RG 
Sbjct:   491 KQGSRGDPGDAGPRGD--SGQPGPKGDPGR-PGFSYPGPRGTPGEKGEPGPPGPEGGRGD 547

Query:   458 -GFDGAPRGAAPHGQV--P-PPLNNVPYGSATPPARSGSGQPRG 497
              G  GAP      G+   P PP    P G    P   G   P G
Sbjct:   548 FGLKGAPGRKGEKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPG 591


>FB|FBgn0035060 [details] [associations]
            symbol:Eps-15 "Epidermal growth factor receptor pathway
            substrate clone 15" species:7227 "Drosophila melanogaster"
            [GO:0007269 "neurotransmitter secretion" evidence=NAS] [GO:0048488
            "synaptic vesicle endocytosis" evidence=IMP;TAS] [GO:0006898
            "receptor-mediated endocytosis" evidence=NAS] [GO:0016192
            "vesicle-mediated transport" evidence=IMP] [GO:0005509 "calcium ion
            binding" evidence=IEA] [GO:0045746 "negative regulation of Notch
            signaling pathway" evidence=IMP] [GO:0008021 "synaptic vesicle"
            evidence=IDA] [GO:0008582 "regulation of synaptic growth at
            neuromuscular junction" evidence=IMP] InterPro:IPR000261
            InterPro:IPR002048 InterPro:IPR011992 PROSITE:PS50031
            PROSITE:PS50222 SMART:SM00027 SMART:SM00054 Prosite:PS00018
            GO:GO:0006898 GO:GO:0005509 Gene3D:1.10.238.10 InterPro:IPR018247
            GO:GO:0048488 GO:GO:0007269 GO:GO:0008582 GO:GO:0045746 HSSP:P42566
            FlyBase:FBgn0035060 EMBL:AY122260 EMBL:AJ421624 IntAct:Q8WQ61
            STRING:Q8WQ61 InParanoid:Q8WQ61 Uniprot:Q8WQ61
        Length = 1253

 Score = 103 (41.3 bits), Expect = 0.00056, Sum P(3) = 0.00056
 Identities = 47/221 (21%), Positives = 106/221 (47%)

Query:    29 SGMRPP-MPGAFPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQ 87
             + M PP M       D+ P     E K    + E++ ++ E + LA     L  E+A  +
Sbjct:   394 ANMVPPSMRATVAGVDLQP----QEVKPTYSNPELEMISKEIEELARERRVLETEIAQKE 449

Query:    88 HELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVARE 147
              +++I +G++  ++SE +     LT  + ++E +   A+  +L+  +++      V+A  
Sbjct:   450 ADVRIKNGEVRSLQSELD----TLTATLKQLENQRGEAQK-RLDDLQAQVSHNTAVLANV 504

Query:   148 EL-IAKVH-QLTQDLQRAHT-DVQ------QIPALLSELESLRQEYHHCRGTYEYEKKFY 198
              L I++ + Q+T+   + H  +V       ++ A  SEL+ L+ E    +  Y+   +  
Sbjct:   505 SLDISRTNEQVTKIRDQCHMQEVTINEQEGELNAKRSELQKLKDEEASLQKEYDSNNREL 564

Query:   199 N---DHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRA 236
             +   +HL++ Q+   +  +M T++ + + ++ +A  + R A
Sbjct:   565 SKLTNHLQATQLQISSVRSMVTQLLETQRQMTDALLICRAA 605

 Score = 63 (27.2 bits), Expect = 0.00056, Sum P(3) = 0.00056
 Identities = 45/167 (26%), Positives = 61/167 (36%)

Query:   349 PTKGPGYDAQKGSNYDAQRGPNYDIHRG-PSYDPQRGLGYD----MQRG--PNYDM--QR 399
             P   P  +   G+   A  G   D   G P   P    G+D    M  G    +D   Q 
Sbjct:   639 PKDDPFEENNSGAANQATNGFGSDPFSGQPVNKPAISTGFDDSFNMSSGFDSGFDAFGQS 698

Query:   400 GPGY---ETQRVP-GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP-- 453
             G G    +TQR P G D      + A ++ +  P+  PG D      +    AP+     
Sbjct:   699 GAGSAFGQTQRDPFGSDA-----FAANKSNAITPE--PGKDDFGSDPFAALHAPTGQGQV 751

Query:   454 -SRGTGFDGAP-RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
              S      G P R  +P   +PP  + VP     PP    + QP GG
Sbjct:   752 LSPNAQKSGPPPRPESPSPALPPKKSKVPPPRPAPPR---AAQPTGG 795

 Score = 50 (22.7 bits), Expect = 0.00056, Sum P(3) = 0.00056
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query:    33 PPMPGAFPPFDMMPP 47
             PP+P A PP   +PP
Sbjct:   266 PPLPVAVPPMTRIPP 280


>FB|FBgn0003980 [details] [associations]
            symbol:Vm26Ab "Vitelline membrane 26Ab" species:7227
            "Drosophila melanogaster" [GO:0007304 "chorion-containing eggshell
            formation" evidence=IMP] [GO:0007305 "vitelline membrane formation
            involved in chorion-containing eggshell formation" evidence=NAS]
            [GO:0008316 "structural constituent of vitelline membrane"
            evidence=NAS] [GO:0007343 "egg activation" evidence=IMP]
            [GO:0060388 "vitelline envelope" evidence=IDA] GO:GO:0005576
            EMBL:AE014134 GO:GO:0007304 GO:GO:0007343 eggNOG:NOG295326
            PROSITE:PS51137 GeneTree:ENSGT00540000073505 GO:GO:0060388
            InterPro:IPR013135 Pfam:PF10542 EMBL:M20936 EMBL:EF441676
            PIR:A45943 RefSeq:NP_476784.1 UniGene:Dm.26740 DIP:DIP-19185N
            IntAct:P13238 MINT:MINT-1563965 STRING:P13238
            EnsemblMetazoa:FBtr0079171 GeneID:33827 KEGG:dme:Dmel_CG9046
            CTD:33827 FlyBase:FBgn0003980 InParanoid:P13238 OMA:RAAYGGY
            PhylomeDB:P13238 GenomeRNAi:33827 NextBio:785460 Bgee:P13238
            GermOnline:CG9046 Uniprot:P13238
        Length = 168

 Score = 108 (43.1 bits), Expect = 0.00056, P = 0.00056
 Identities = 28/92 (30%), Positives = 35/92 (38%)

Query:   276 SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK 335
             S    G  GA P  +  +Y+A  +  P   AY  P  P Y A   P Y A  AP+Y    
Sbjct:    45 SRAAYGGYGAAP--AAPSYSAPAA--PAAQAYSAPAAPAYSAPAAPAYSAPAAPAYSAPA 100

Query:   336 GPSYDPAKGPGYD-PTKGPGYDAQKGSNYDAQ 366
              P+Y     P Y  P   P     K   +  Q
Sbjct:   101 APAYSAPAAPAYSAPASIPSPPCPKNYLFSCQ 132


>UNIPROTKB|I3L781 [details] [associations]
            symbol:I3L781 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287
            Ensembl:ENSSSCT00000024528 OMA:EVSMPEI Uniprot:I3L781
        Length = 1087

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 83/271 (30%), Positives = 99/271 (36%)

Query:   242 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-----PS-ATTAGVVGAGPNTSTSAYA 295
             GA G   N  +  P G    + G G     GPP     P  A TAG VG           
Sbjct:   518 GAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGERGIPG-- 575

Query:   296 ATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKG 352
               + G P  A     RGP G   + GP G   S+ PS  P  GP  D  KG PG      
Sbjct:   576 --EFGLPGPAGPRGERGPPGESGAAGPAGPIGSRGPSGPP--GP--DGNKGEPGV--LGA 627

Query:   353 PGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVP- 409
             PG     G S    +RG    I  G     + GL  D+   P  D  RG PG      P 
Sbjct:   628 PGTAGPSGPSGLPGERGAA-GIPGGKGEKGETGLRGDVG-SPGRDGARGAPGAVGAPGPA 685

Query:   410 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP--RGA- 466
             G +  RG    A  A    P+  PG   +RG+           P+   G  GA   RG  
Sbjct:   686 GANGDRGEAGPAGPAGPAGPRGSPG---ERGEVGPAGPNGFAGPAGAAGQPGAKGERGTK 742

Query:   467 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 497
              P G+  P     P G+A P   +G   P G
Sbjct:   743 GPKGENGPVGPTGPVGAAGPAGPNGPPGPAG 773


>WB|WBGene00000618 [details] [associations]
            symbol:col-41 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00530000064674 EMBL:Z72514 PIR:T24769
            RefSeq:NP_510522.1 ProteinModelPortal:Q22369 IntAct:Q22369
            MINT:MINT-213826 STRING:Q22369 PaxDb:Q22369 EnsemblMetazoa:T10B10.1
            GeneID:181610 KEGG:cel:CELE_T10B10.1 UCSC:T10B10.1 CTD:181610
            WormBase:T10B10.1 InParanoid:Q22369 OMA:CSIGHIV NextBio:914648
            Uniprot:Q22369
        Length = 428

 Score = 118 (46.6 bits), Expect = 0.00060, P = 0.00060
 Identities = 93/347 (26%), Positives = 124/347 (35%)

Query:   173 LLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL-RAELMNAP- 230
             ++ ++ +LR E     G  +  K   +D  + L +++      A  V  L R +    P 
Sbjct:    26 IVQDINNLRSEVE---GRVDEFKVLADDTWDRLLILQSPTGESANPVPSLLRNKRFVYPG 82

Query:   231 --NVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 288
               N D  + G   GA G   N   G+  G   +    G  +G      ATT  + G    
Sbjct:    83 MCNCDSNSQGCPAGAPGPPGNP--GKR-GDEGHPGDEG-RRGASGISLATTHDIPGGCIK 138

Query:   289 TSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP- 345
                       +G P       P G PG +   GP G D   AP  +   G   +  +GP 
Sbjct:   139 CPEGP-----AGPPGPDGDSGPEGFPGLQGQSGPSGEDG--APGQEGAPGDQGE--QGPK 189

Query:   346 GYDPTKGPGYDAQKGSNY-DAQRG-PNYDIHRG-PSYDPQRGL-GYDMQRGPNYDMQRGP 401
             GYD T GP  D Q G+ Y   Q G P      G P    Q G  G D + GP    Q  P
Sbjct:   190 GYDGTDGP--DGQPGTTYFPGQAGQPGEPGWLGEPGLPGQHGEPGKDGEEGP----QGAP 243

Query:   402 GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS------YDPSR 455
             G  T    G+D   G   +A + P   P +   Y     Q  D R  PS        P R
Sbjct:   244 G--TPGNAGHDAFPGTPGQAGK-PG-APGKDANY-CPCPQRQDDRTPPSSGTSAPQPPPR 298

Query:   456 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             G+    AP   AP     PP    P  +   P  +    P    P R
Sbjct:   299 GS--TAAPGTRAPPATRAPPATRAPPATTRAPPATTRPAPASQPPVR 343


>UNIPROTKB|P08123 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0071230 "cellular response
            to amino acid stimulus" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IDA;IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0001501 "skeletal system development" evidence=IMP] [GO:0042476
            "odontogenesis" evidence=NAS] [GO:0008217 "regulation of blood
            pressure" evidence=IMP] [GO:0007179 "transforming growth factor
            beta receptor signaling pathway" evidence=IDA] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0042802 "identical protein binding" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0030674 "protein binding,
            bridging" evidence=IMP] [GO:0030199 "collagen fibril organization"
            evidence=IMP] [GO:0007266 "Rho protein signal transduction"
            evidence=IDA] [GO:0043589 "skin morphogenesis" evidence=IMP]
            [GO:0001568 "blood vessel development" evidence=IMP] [GO:0070062
            "extracellular vesicular exosome" evidence=IDA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IDA] [GO:0005576
            "extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
            reticulum lumen" evidence=TAS] [GO:0007411 "axon guidance"
            evidence=TAS] [GO:0007596 "blood coagulation" evidence=TAS]
            [GO:0030168 "platelet activation" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS] [GO:0050900
            "leukocyte migration" evidence=TAS] [GO:0031012 "extracellular
            matrix" evidence=IDA] Reactome:REACT_604 InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0005615 GO:GO:0030168 GO:GO:0046872
            GO:GO:0050900 GO:GO:0070062 GO:GO:0030199 GO:GO:0030674
            GO:GO:0005788 GO:GO:0042802 GO:GO:0001501 GO:GO:0008217
            GO:GO:0007179 GO:GO:0007266
            Pathway_Interaction_DB:endothelinpathway GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568
            Pathway_Interaction_DB:il4_2pathway
            Pathway_Interaction_DB:smad2_3nuclearpathway
            Pathway_Interaction_DB:lymphangiogenesis_pathway GO:GO:0042476
            GO:GO:0071230 Orphanet:216812 EMBL:AC002528 GO:GO:0005201
            GO:GO:0043589 HOVERGEN:HBG004933 KO:K06236 GO:GO:0005584 MIM:130060
            MIM:166200 MIM:166210 MIM:166220 MIM:259420 Orphanet:230857
            Orphanet:216796 Orphanet:216804 Orphanet:216820 DrugBank:DB00048
            GO:GO:0048407 CTD:1278 OrthoDB:EOG412M65 EMBL:J03464 EMBL:Z74616
            EMBL:AF004877 EMBL:BC042586 EMBL:BC054498 EMBL:Y00724 EMBL:X02488
            EMBL:AB004317 EMBL:M35391 EMBL:S98904 EMBL:M21671 EMBL:S41099
            EMBL:M21353 EMBL:M28985 EMBL:V00503 EMBL:S96821 EMBL:L47668
            EMBL:X55525 EMBL:J00114 EMBL:M22816 EMBL:M22817 EMBL:K01078
            EMBL:K02568 IPI:IPI00304962 PIR:A28500 RefSeq:NP_000080.2
            UniGene:Hs.489142 ProteinModelPortal:P08123 SMR:P08123
            DIP:DIP-36079N IntAct:P08123 MINT:MINT-4791958 STRING:P08123
            PhosphoSite:P08123 DMDM:296439507 PaxDb:P08123 PRIDE:P08123
            Ensembl:ENST00000297268 GeneID:1278 KEGG:hsa:1278 UCSC:uc003ung.1
            GeneCards:GC07P094023 H-InvDB:HIX0006854 HGNC:HGNC:2198
            HPA:CAB032650 MIM:120160 MIM:225320 neXtProt:NX_P08123
            Orphanet:99876 Orphanet:230851 PharmGKB:PA35042 ChEMBL:CHEMBL2685
            ChiTaRS:COL1A2 GenomeRNAi:1278 NextBio:5165 ArrayExpress:P08123
            Bgee:P08123 Genevestigator:P08123 GermOnline:ENSG00000164692
            Uniprot:P08123
        Length = 1366

 Score = 124 (48.7 bits), Expect = 0.00060, P = 0.00060
 Identities = 79/261 (30%), Positives = 99/261 (37%)

Query:   266 GVPQGHGPPPSATTAGVVGA----G-PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASK 319
             G+P   G P     AG  GA    G P  + S   +   G P  A    P GP G E  +
Sbjct:   322 GLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKR 381

Query:   320 GPGYDASKAPSYDPTKGPSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIHR 375
             GP  +A  A    P  G    P ++G PG D   G  G    +G++  A  RGPN D  R
Sbjct:   382 GPNGEAGSAGPPGPP-GLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGR 440

Query:   376 -G-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA--QRAPSYI-- 428
              G P     RGL G     GP    + GP      +PG D + GP+  A  +  P  I  
Sbjct:   441 PGEPGLMGPRGLPGSPGNIGPAG--KEGP----VGLPGIDGRPGPIGPAGARGEPGNIGF 494

Query:   429 -----PQRGPGYDLQRGQG--YDMRRAPSYDPSRGT----GFDGAPRGAAPHGQVPPP-L 476
                  P   PG +  +G       R AP  D + G     G  G   G    G   PP  
Sbjct:   495 PGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPPGPPGF 554

Query:   477 NNVPYGSATPPARSGSGQPRG 497
               +P G + P    G    RG
Sbjct:   555 QGLP-GPSGPAGEVGKPGERG 574


>UNIPROTKB|Q51MB1 [details] [associations]
            symbol:RIM9 "pH-response regulator protein palI/RIM9"
            species:242507 "Magnaporthe oryzae 70-15" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF06687 GO:GO:0016021 GO:GO:0005886
            eggNOG:NOG12793 EMBL:CM000230 EMBL:CM001237 OrthoDB:EOG4DBXQ8
            InterPro:IPR009571 RefSeq:XP_003721159.1 EnsemblFungi:MGG_02630T0
            GeneID:2682829 KEGG:mgr:MGG_02630 Uniprot:Q51MB1
        Length = 736

 Score = 121 (47.7 bits), Expect = 0.00061, P = 0.00061
 Identities = 56/176 (31%), Positives = 69/176 (39%)

Query:   226 LMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG----HGPPPSATTAG 281
             +  AP+ +R   G+ GG  G         P G+  Y  GYG P G    +GPP      G
Sbjct:   303 VQRAPSAERMNPGARGGYRGRGYG-----PPGRGGY--GYGPPPGSRGGYGPPGR----G 351

Query:   282 VVGAGPNTSTSAYAATQSGTPMRAAYDIP-RG----PGYEASK-GPGYDASKAPSYDPTK 335
               G GPN     Y     G P R  Y  P RG    PGY+  + G   +A   P   P +
Sbjct:   352 GYGPGPN-GRGGY-----GPPPRGGYGPPMRGRAPPPGYQYDRRGSPAEAYGPP---PGQ 402

Query:   336 GPSYDPAKGPGYDPTKGPGYDAQKGSN-------YDAQRGPNYDIHRGPSYDPQRG 384
             GP     + PG  P   PGY    GS        Y  Q  P+ D+ R  S  P  G
Sbjct:   403 GPYGQRQQSPG--PPSAPGY-GMNGSTPTVSSAAYGHQHTPSDDLPRAESPPPLPG 455


>UNIPROTKB|B0QYK0 [details] [associations]
            symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
            sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
            SMART:SM00360 SMART:SM00547 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 EMBL:AC002059
            EMBL:AL031186 EMBL:AC000026 UniGene:Hs.374477 HGNC:HGNC:3508
            HOGENOM:HOG000038010 HOVERGEN:HBG000970 ChiTaRS:EWSR1
            IPI:IPI00879242 SMR:B0QYK0 STRING:B0QYK0 Ensembl:ENST00000331029
            Uniprot:B0QYK0
        Length = 618

 Score = 120 (47.3 bits), Expect = 0.00062, P = 0.00062
 Identities = 75/279 (26%), Positives = 102/279 (36%)

Query:   238 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 347
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   348 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 404
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212

Query:   405 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 459
             T   P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        
Sbjct:   213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
             D  P     +GQ     +  P  + +       G+ RGG
Sbjct:   270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|D4A458 [details] [associations]
            symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 IPI:IPI00767290 Ensembl:ENSRNOT00000057377
            ArrayExpress:D4A458 Uniprot:D4A458
        Length = 618

 Score = 120 (47.3 bits), Expect = 0.00062, P = 0.00062
 Identities = 74/278 (26%), Positives = 100/278 (35%)

Query:   238 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +  S    GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 348
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159

Query:   349 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213

Query:   406 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 460
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306


>UNIPROTKB|P02461 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001501 "skeletal system development" evidence=IEA] [GO:0001568
            "blood vessel development" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0007160 "cell-matrix adhesion" evidence=IDA] [GO:0018149
            "peptide cross-linking" evidence=IDA] [GO:0050777 "negative
            regulation of immune response" evidence=IMP] [GO:0005178 "integrin
            binding" evidence=NAS;IMP] [GO:0030168 "platelet activation"
            evidence=NAS] [GO:0007179 "transforming growth factor beta receptor
            signaling pathway" evidence=IDA] [GO:0034097 "response to cytokine
            stimulus" evidence=IDA] [GO:0009314 "response to radiation"
            evidence=IDA] [GO:0042060 "wound healing" evidence=IDA;NAS]
            [GO:0043206 "extracellular fibril organization" evidence=IMP]
            [GO:0030199 "collagen fibril organization" evidence=NAS;IMP]
            [GO:0007507 "heart development" evidence=IMP] [GO:0032964 "collagen
            biosynthetic process" evidence=IMP;TAS] [GO:0005615 "extracellular
            space" evidence=IDA;NAS] [GO:0043588 "skin development"
            evidence=IMP] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IMP] [GO:0007229 "integrin-mediated signaling
            pathway" evidence=IMP] [GO:0005586 "collagen type III"
            evidence=NAS;IMP] [GO:0048407 "platelet-derived growth factor
            binding" evidence=IDA] [GO:0005576 "extracellular region"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0007411 "axon guidance" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 Reactome:REACT_118779
            Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
            GO:GO:0043588 GO:GO:0005615 GO:GO:0030168 GO:GO:0007507
            GO:GO:0046872 GO:GO:0034097 GO:GO:0030199 GO:GO:0005788
            GO:GO:0001501 EMBL:CH471058 GO:GO:0005178 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160
            Pathway_Interaction_DB:endothelinpathway InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568 GO:GO:0048565
            GO:GO:0050777 GO:GO:0009314 GO:GO:0018149 GO:GO:0032964
            GO:GO:0071230 GO:GO:0043206 GO:GO:0005201 HOVERGEN:HBG004933
            KO:K06236 DrugBank:DB00048 DrugBank:DB00039 GO:GO:0048407
            OrthoDB:EOG4FTW1C EMBL:X14420 EMBL:AY054301 EMBL:AY016295
            EMBL:AC066694 EMBL:BC028178 EMBL:M26939 EMBL:X07240 EMBL:X15332
            EMBL:S62925 EMBL:S79877 EMBL:M59312 EMBL:M59227 EMBL:M55603
            EMBL:X06700 EMBL:X01655 EMBL:X01742 EMBL:M13146 EMBL:M11134
            IPI:IPI00021033 IPI:IPI00167087 PIR:S05272 RefSeq:NP_000081.1
            UniGene:Hs.443625 PDB:2V53 PDB:3DMW PDB:4AE2 PDB:4AEJ PDB:4AK3
            PDBsum:2V53 PDBsum:3DMW PDBsum:4AE2 PDBsum:4AEJ PDBsum:4AK3
            ProteinModelPortal:P02461 SMR:P02461 DIP:DIP-57177N IntAct:P02461
            STRING:P02461 PhosphoSite:P02461 DMDM:124056490 PaxDb:P02461
            PRIDE:P02461 Ensembl:ENST00000304636 GeneID:1281 KEGG:hsa:1281
            UCSC:uc002uqj.1 CTD:1281 GeneCards:GC02P189803 HGNC:HGNC:2201
            HPA:CAB016766 HPA:HPA007583 MIM:100070 MIM:120180 MIM:130020
            MIM:130050 neXtProt:NX_P02461 Orphanet:2500 Orphanet:285
            Orphanet:286 Orphanet:86 PharmGKB:PA26716 InParanoid:P02461
            OMA:EGSPGHP PhylomeDB:P02461 ChiTaRS:COL3A1
            EvolutionaryTrace:P02461 GenomeRNAi:1281 NextBio:5177
            ArrayExpress:P02461 Bgee:P02461 Genevestigator:P02461
            GermOnline:ENSG00000168542 GO:GO:0005586 Uniprot:P02461
        Length = 1466

 Score = 124 (48.7 bits), Expect = 0.00065, P = 0.00065
 Identities = 81/280 (28%), Positives = 101/280 (36%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 290
             A G   G  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPS 224

Query:   291 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 347
               A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGL 283

Query:   348 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 404
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   405 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 463
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPPGINGSP 392

Query:   464 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 498
              G    G  P  +   P   G+  PP  +G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 85/284 (29%), Positives = 101/284 (35%)

Query:   236 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSA 293
             A G  GGA    +N   G P G        G+P   G P +    G  G+ G P  +   
Sbjct:   424 APGLRGGAGEPGKNGAKGEP-GPRGERGEAGIP---GVPGAKGEDGKDGSPGEPGANGLP 479

Query:   294 YAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 352
              AA + G P    +  P GP G    KGP  +   AP   P  GP    A  PG D   G
Sbjct:   480 GAAGERGAP---GFRGPAGPNGIPGEKGPAGERG-APG--PA-GPR-GAAGEPGRDGVPG 531

Query:   353 -PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRG----PGYE-T 405
              PG     GS      GP  D   GP    Q   G     GP+    Q G    PG +  
Sbjct:   532 GPGMRGMPGS----PGGPGSDGKPGPP-GSQGESGRPGPPGPSGPRGQPGVMGFPGPKGN 586

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD-----PSRGTGF 459
                PG + +RG        P   PQ  PG + + G QG      P  D     P    G 
Sbjct:   587 DGAPGKNGERG----GPGGPG--PQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGL 640

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 503
              G P    P G+   P    P G A  P   G G+   G P  R
Sbjct:   641 QGLPGTGGPPGENGKPGEPGPKGDAGAPGAPG-GKGDAGAPGER 683


>UNIPROTKB|B4DR34 [details] [associations]
            symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
            [GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
            [GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
            "cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
            protein kinase cascade" evidence=IEA] [GO:0042493 "response to
            drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
            evidence=IEA] GO:GO:0000226 GO:GO:0042493 GO:GO:0007243
            GO:GO:0000902 GO:GO:0048013 GO:GO:0005881 HOVERGEN:HBG003892
            InterPro:IPR007726 PANTHER:PTHR23107 UniGene:Hs.129261
            EMBL:AC091021 HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK299082
            IPI:IPI01015658 STRING:B4DR34 Ensembl:ENST00000539849
            Uniprot:B4DR34
        Length = 336

 Score = 116 (45.9 bits), Expect = 0.00066, P = 0.00065
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 289
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   106 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 165

Query:   290 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 341
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   166 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 224

Query:   342 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   225 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 278

Query:   401 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 451
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   279 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 334


>UNIPROTKB|J9NW09 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 EMBL:AAEX03003616 EMBL:AAEX03003617
            Ensembl:ENSCAFT00000050029 Uniprot:J9NW09
        Length = 1789

 Score = 137 (53.3 bits), Expect = 0.00066, Sum P(2) = 0.00066
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00066, Sum P(2) = 0.00066
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>MGI|MGI:88462 [details] [associations]
            symbol:Col7a1 "collagen, type VII, alpha 1" species:10090 "Mus
            musculus" [GO:0004867 "serine-type endopeptidase inhibitor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005604
            "basement membrane" evidence=IDA] [GO:0007155 "cell adhesion"
            evidence=IEA] [GO:0010466 "negative regulation of peptidase
            activity" evidence=IEA] [GO:0030414 "peptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 MGI:MGI:88462 Gene3D:2.60.40.10
            InterPro:IPR013783 GO:GO:0004867 SUPFAM:SSF49265 GO:GO:0007155
            Gene3D:4.10.410.10 InterPro:IPR020901 SUPFAM:SSF57362
            PROSITE:PS00280 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005604 EMBL:AC174646 MEROPS:I02.967 CTD:1294
            HOGENOM:HOG000111866 HOVERGEN:HBG051053 KO:K16628 OMA:RRVCTTA
            OrthoDB:EOG4J117P EMBL:U32107 EMBL:S63654 IPI:IPI00134652
            PIR:A45748 RefSeq:NP_031764.2 UniGene:Mm.6200 HSSP:P12111
            ProteinModelPortal:Q63870 SMR:Q63870 STRING:Q63870
            PhosphoSite:Q63870 PaxDb:Q63870 PRIDE:Q63870
            Ensembl:ENSMUST00000026740 Ensembl:ENSMUST00000112070 GeneID:12836
            KEGG:mmu:12836 UCSC:uc009rrh.1 GeneTree:ENSGT00700000104250
            InParanoid:Q63870 NextBio:282356 Bgee:Q63870 CleanEx:MM_COL7A1
            Genevestigator:Q63870 Uniprot:Q63870
        Length = 2944

 Score = 127 (49.8 bits), Expect = 0.00066, P = 0.00066
 Identities = 86/270 (31%), Positives = 103/270 (38%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-- 312
             P G    +   G P   GPP S    GV G+ P    S       G         P+G  
Sbjct:  1289 PPGSTQAKGERGFPGPEGPPGSPGLPGVPGS-PGIKGSTGRPGPRGEQGERGPQGPKGEP 1347

Query:   313 --PGY-EASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYD--AQKGSNYD 364
               PG      GPG+   K    DP  GPS  P ++GP  DP  +GP G    + KG   D
Sbjct:  1348 GEPGQITGGGGPGFPGKKG---DP--GPSGPPGSRGPVGDPGPRGPPGLPGISVKGDKGD 1402

Query:   365 -AQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 421
               +RGP    I      DP  GL G     GP     R PG + ++  G     GP    
Sbjct:  1403 RGERGPPGPGIGASEQGDP--GLPGLPGSPGPQGPAGR-PGEKGEK--GDCEDGGPGLPG 1457

Query:   422 QRAPSYIPQ-RG-PGYDLQRG-QGYDMRRA-PSYDPSRG----TGFDGAPRGAAPHGQVP 473
             Q  P   P  RG PG    +G +G       P     RG     G  G P GAA H    
Sbjct:  1458 QPGPPGEPGLRGAPGMTGPKGDRGLTGTPGEPGVKGERGHPGPVGPQGLP-GAAGH---- 1512

Query:   474 PPLNNVPYGSATPPARSGS-GQP-RGGNPA 501
             P +   P G   P  R G  G+P R G+PA
Sbjct:  1513 PGVEG-PEGPPGPTGRRGEKGEPGRPGDPA 1541


>UNIPROTKB|I3LNI2 [details] [associations]
            symbol:TFG "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0043123 "positive regulation of I-kappaB
            kinase/NF-kappaB cascade" evidence=IEA] [GO:0042802 "identical
            protein binding" evidence=IEA] [GO:0004871 "signal transducer
            activity" evidence=IEA] GO:GO:0043123 GO:GO:0004871 OMA:YTTQTSQ
            GeneTree:ENSGT00510000047809 EMBL:CU928320 EMBL:AEMK01189642
            Ensembl:ENSSSCT00000026186 Uniprot:I3LNI2
        Length = 340

 Score = 116 (45.9 bits), Expect = 0.00067, P = 0.00067
 Identities = 76/301 (25%), Positives = 114/301 (37%)

Query:   216 ATEVEKLRAELMNAPN-VDRRAD-----GSYGGATGNSENET-SGRPVGQNAYEDGYGVP 268
             +++V+ LR EL+   N V+R  D     G  G +T  +EN+T  GR   + A  D  G  
Sbjct:    38 SSQVKYLRRELIELRNKVNRLLDSLEPPGEPGPSTNITENDTVDGREE-KPAASDSSGKQ 96

Query:   269 QGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA 328
                    S +    +      + +  +A   G         P  P  + S  P   AS +
Sbjct:    97 STQVMAASMSAFDPLKNQDEINKNVMSAF--GLTDDQVSGPPSAPAEDRSGTPDSIASSS 154

Query:   329 PSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 387
              +  P   P   P + P        G  + Q    Y  Q G      + P   PQ+  G 
Sbjct:   155 SAAHP---PGVQPQQPPYTGALTQAGQSEGQMYQQYPQQAGYGTQQPQAPPQPPQQS-GS 210

Query:   388 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI--PQRGPGYDLQRGQGYDM 445
              + +G  Y  Q GP  + Q+  GY  Q  P  +A  AP++   PQ+ P    Q+ Q    
Sbjct:   211 SLSKG--YSQQTGP-QQPQQFQGYGQQ--PTSQAP-APAFSGQPQQMPAQPPQQYQASSY 264

Query:   446 R-RAPSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPRGGNPAR 502
               +  +   S+ T +  AP  A+  G  P  P       G   PP  + +  P G NP  
Sbjct:   265 PPQTYTTQTSQPTNYTVAP--ASQPGMAPSQPGAYQPRPGFTPPPGSTMTPLPSGSNPYA 322

Query:   503 R 503
             R
Sbjct:   323 R 323


>UNIPROTKB|A8E651 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
            SMART:SM00360 SMART:SM00547 GO:GO:0005634 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676 eggNOG:NOG240581
            GeneTree:ENSGT00530000063105 CTD:2130 HOGENOM:HOG000038010
            HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY OrthoDB:EOG42NJ15
            EMBL:DAAA02045602 EMBL:BC153844 IPI:IPI00871084
            RefSeq:NP_001103270.1 UniGene:Bt.33949 SMR:A8E651 STRING:A8E651
            Ensembl:ENSBTAT00000023612 GeneID:534073 KEGG:bta:534073
            InParanoid:A8E651 NextBio:20876260 Uniprot:A8E651
        Length = 655

 Score = 120 (47.3 bits), Expect = 0.00067, P = 0.00067
 Identities = 73/278 (26%), Positives = 99/278 (35%)

Query:   238 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPAAPQAYSQPVQGYGTGAYDTT 101

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 348
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 159

Query:   349 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVSAPPSYPPT---SYSSTQPTSYDQSSYSQQNT 213

Query:   406 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 460
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|Q01844 [details] [associations]
            symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
            sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005516 "calmodulin binding" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0005886 "plasma membrane"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50096 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005886
            GO:GO:0005634 GO:GO:0005737 GO:GO:0006355 GO:GO:0000166
            GO:GO:0046872 EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0006351 GO:GO:0003723 EMBL:AC002059 MIM:612160 Orphanet:97338
            Pathway_Interaction_DB:bard1pathway eggNOG:NOG240581 EMBL:AL031186
            MIM:612219 Orphanet:319 EMBL:X66899 EMBL:X72990 EMBL:X72991
            EMBL:X72992 EMBL:X72993 EMBL:X72994 EMBL:X72995 EMBL:X72996
            EMBL:X72997 EMBL:X72998 EMBL:X72999 EMBL:X73000 EMBL:X73001
            EMBL:X73002 EMBL:X73003 EMBL:X73004 EMBL:Y07848 EMBL:CR456490
            EMBL:AK056309 EMBL:AK056681 EMBL:AC000026 EMBL:BC000527
            EMBL:BC004817 EMBL:BC011048 EMBL:BC072442 EMBL:Y08806 EMBL:AB016435
            IPI:IPI00065554 IPI:IPI00293254 IPI:IPI00335961 IPI:IPI00872855
            IPI:IPI00879259 PIR:A49358 RefSeq:NP_001156757.1
            RefSeq:NP_001156759.1 RefSeq:NP_005234.1 RefSeq:NP_053733.2
            UniGene:Hs.374477 PDB:2CPE PDBsum:2CPE ProteinModelPortal:Q01844
            SMR:Q01844 IntAct:Q01844 MINT:MINT-2858561 STRING:Q01844
            PhosphoSite:Q01844 DMDM:544261 PaxDb:Q01844 PRIDE:Q01844 DNASU:2130
            Ensembl:ENST00000332035 Ensembl:ENST00000333395
            Ensembl:ENST00000397938 Ensembl:ENST00000406548
            Ensembl:ENST00000414183 GeneID:2130 KEGG:hsa:2130 UCSC:uc003aet.3
            CTD:2130 GeneCards:GC22P029663 HGNC:HGNC:3508 HPA:CAB004230
            MIM:133450 neXtProt:NX_Q01844 Orphanet:83469 PharmGKB:PA27921
            HOGENOM:HOG000038010 HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY
            OrthoDB:EOG42NJ15 PhylomeDB:Q01844 ChiTaRS:EWSR1
            EvolutionaryTrace:Q01844 GenomeRNAi:2130 NextBio:8605
            ArrayExpress:Q01844 Bgee:Q01844 CleanEx:HS_EWSR1
            Genevestigator:Q01844 GermOnline:ENSG00000182944 Uniprot:Q01844
        Length = 656

 Score = 120 (47.3 bits), Expect = 0.00067, P = 0.00067
 Identities = 75/279 (26%), Positives = 102/279 (36%)

Query:   238 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 347
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   348 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 404
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212

Query:   405 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 459
             T   P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        
Sbjct:   213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269

Query:   460 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
             D  P     +GQ     +  P  + +       G+ RGG
Sbjct:   270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|F1LN98 [details] [associations]
            symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 IPI:IPI00364603
            Ensembl:ENSRNOT00000012634 ArrayExpress:F1LN98 Uniprot:F1LN98
        Length = 656

 Score = 120 (47.3 bits), Expect = 0.00067, P = 0.00067
 Identities = 74/278 (26%), Positives = 100/278 (35%)

Query:   238 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 290
             G+YG  T  S  +  S    GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   291 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 348
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159

Query:   349 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213

Query:   406 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 460
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   461 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 498
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306


>UNIPROTKB|F1RY40 [details] [associations]
            symbol:RBM12B "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 GO:GO:0000166
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00620000087828
            OMA:EHFRRPP CTD:389677 EMBL:CU633952 RefSeq:XP_003125614.1
            UniGene:Ssc.32661 Ensembl:ENSSSCT00000006702 GeneID:100514101
            KEGG:ssc:100514101 Uniprot:F1RY40
        Length = 986

 Score = 122 (48.0 bits), Expect = 0.00068, P = 0.00068
 Identities = 42/150 (28%), Positives = 65/150 (43%)

Query:   327 KAPSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 385
             + P  D  + P  +  + P  +  + P   D ++    D +R P  D  R P  D +R  
Sbjct:   581 RRPPEDDFRRPWEEDFRYPREEDFRYPREEDWRRPPEEDFRRPPKDDFRRPPEEDWRRPP 640

Query:   386 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 445
               D +R P  D +R P  + +R P  + +R P  + +R P    +R P  D +R    D 
Sbjct:   641 EGDFRRPPEEDWRRPPEEDFRRPPPGEWRRPPEEDFRRPPEEDFRRLPEEDFRRPHEEDF 700

Query:   446 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 475
             RR+P  D  R +  D   R    H + PPP
Sbjct:   701 RRSPEED-FRHSPEDDFRRPPPEHFRRPPP 729


>ZFIN|ZDB-GENE-030131-6410 [details] [associations]
            symbol:tprb "translocated promoter region b (to
            activated MET oncogene)" species:7955 "Danio rerio" [GO:0006606
            "protein import into nucleus" evidence=IEA] [GO:0005643 "nuclear
            pore" evidence=IEA] InterPro:IPR012929 Pfam:PF07926
            ZFIN:ZDB-GENE-030131-6410 GO:GO:0005643 GO:GO:0006606 KO:K09291
            EMBL:BX323056 GeneTree:ENSGT00700000104019 HOGENOM:HOG000139431
            HOVERGEN:HBG009158 IPI:IPI00507729 RefSeq:NP_001025294.1
            UniGene:Dr.52426 Ensembl:ENSDART00000017941 GeneID:558883
            KEGG:dre:558883 CTD:558883 InParanoid:Q5RI09 OMA:RVSWEEQ
            NextBio:20882676 Uniprot:Q5RI09
        Length = 2352

 Score = 125 (49.1 bits), Expect = 0.00070, Sum P(4) = 0.00070
 Identities = 41/179 (22%), Positives = 75/179 (41%)

Query:    59 HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAK 117
             H++ +Q+L  E  RL A        L   Q ++Q L   +G +  ER+   ++   KI  
Sbjct:  1367 HLKRIQQLVEETGRLKADAARSSGSLTTLQSQVQNLRENLGKVMVERDNLKKDQEAKILD 1426

Query:   118 MEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQL-TQDLQRAHTDVQQIPALLSE 176
             ++ ++KT   VK   ++ KT+ + L V  E+L+A       QD +      Q++  L   
Sbjct:  1427 IQEKIKTITQVKKIGRRYKTQYEELKVEYEKLVAAAASAPAQDQEAQQASAQELQNLKES 1486

Query:   177 LESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRR 235
             L           G  E   +   +     +  ++    + TE+ +LR EL    + + R
Sbjct:  1487 LNQSETRIRELEGQLENLNRTVGEREMEARSAQEQASRLQTELTRLRQELQEKSSQEER 1545

 Score = 48 (22.0 bits), Expect = 0.00070, Sum P(4) = 0.00070
 Identities = 18/60 (30%), Positives = 27/60 (45%)

Query:   275 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 334
             P +T+ G+  A P+TS+   A+  S +P  A    PR    E S      +   P+  PT
Sbjct:  1810 PLSTSTGLWSATPSTSS---ASAVSASPGSALSKRPREEEQE-SMSADTQSQDEPNDSPT 1865

 Score = 46 (21.3 bits), Expect = 0.00070, Sum P(4) = 0.00070
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query:   468 PHGQVPPPLNNVPYGSATPP-ARSGSGQPRGGNPARR 503
             P     P  ++    S+ PP ARSGSG+   G+   R
Sbjct:  2297 PSTSQEPSSSSADTSSSQPPKARSGSGRQWTGSRGSR 2333

 Score = 45 (20.9 bits), Expect = 0.00070, Sum P(4) = 0.00070
 Identities = 37/153 (24%), Positives = 55/153 (35%)

Query:   324 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK--GSNYDAQR---GPNYDIHRGPS 378
             D S   S D  +    D  +GP  DPT  PG + ++  G+    QR     +++ +    
Sbjct:  2004 DESNEESRDDNEAYEGDDTEGP--DPTD-PGTETEESLGATDSTQRMADSQSFESNTLEM 2060

Query:   379 YD-PQRGLGYDMQRGPNYDMQR-GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 436
             ++ P         + P        P       P  ++  GP  + QR P+     G G  
Sbjct:  2061 FEVPVTSSAPRPPQSPRRPQHPLPPRLNILAAPAQEL--GPPAQVQRLPARRQSVGRGLQ 2118

Query:   437 LQRG-----QGY---DMRRAPSYD--PSRGTGF 459
             L  G     Q +   D R  PS    P R  GF
Sbjct:  2119 LASGMASSAQPFFEDDDRMVPSTPTLPLRSDGF 2151


>UNIPROTKB|E1BYQ6 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006606 "protein import into nucleus" evidence=IEA]
            [GO:0000776 "kinetochore" evidence=IEA] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0007094 "mitotic spindle assembly checkpoint"
            evidence=IEA] [GO:0031965 "nuclear membrane" evidence=IEA]
            InterPro:IPR012929 Pfam:PF07926 GO:GO:0000776 GO:GO:0007094
            GO:GO:0031965 GO:GO:0005643 GO:GO:0006606 KO:K09291
            GeneTree:ENSGT00700000104019 CTD:7175 OMA:RFIRREK EMBL:AADN02061595
            IPI:IPI00591857 RefSeq:XP_422300.2 UniGene:Gga.14251
            Ensembl:ENSGALT00000008185 GeneID:424457 KEGG:gga:424457
            NextBio:20826784 Uniprot:E1BYQ6
        Length = 2368

 Score = 119 (46.9 bits), Expect = 0.00070, Sum P(2) = 0.00070
 Identities = 36/179 (20%), Positives = 83/179 (46%)

Query:    49 EVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             +++ +K A+    +Q+++ E  RL A        L  +Q+ LQ L  ++  +++E+E   
Sbjct:  1359 KLLSEKEANTK-RIQQMSEETGRLKAEIARTTASLTTSQNLLQNLKDEVAKIRTEKETLQ 1417

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVH-QLTQDLQRAHTDV 167
             + L  K+A ++ ++KT   VK   ++ KT+ + L    ++++A+   Q   + Q     V
Sbjct:  1418 KELDAKVADIQEKVKTITQVKKIGRRYKTQYEELKAQHDKMVAEAATQSFVEQQEEQVSV 1477

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAEL 226
             Q++  L   L     +        E  +K   +     + +++    + +E+ + R +L
Sbjct:  1478 QEVQELKDSLSQAEGKTKTLENQVENLQKTVAEKETEARNLQEQISQLQSELARFRQDL 1536

 Score = 58 (25.5 bits), Expect = 0.00070, Sum P(2) = 0.00070
 Identities = 29/113 (25%), Positives = 40/113 (35%)

Query:   233 DRRADGSYG-GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNT 289
             D   D   G G  G+  NE +G   G + YE  D  G     G  P   T   +G G + 
Sbjct:  1976 DEDDDEDTGMGDEGDDSNEGTGSADGNDGYEADDAEGAD---GTDPGTETEESLGGGESN 2032

Query:   290 STSAYAATQ-SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 341
               +A +     G+   A    P     E    P   AS+  +  P + P   P
Sbjct:  2033 QRAADSQNSCEGSTSTAESTFPHESSREQQ--PS-SASERQAPRPPQSPRRPP 2082


>ZFIN|ZDB-GENE-041221-3 [details] [associations]
            symbol:prnprs3 "prion protein, related sequence 3"
            species:7955 "Danio rerio" [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0005544 "calcium-dependent phospholipid binding"
            evidence=IEA] [GO:0051260 "protein homooligomerization"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0048854
            "brain morphogenesis" evidence=IMP] [GO:0005886 "plasma membrane"
            evidence=IDA] [GO:0007156 "homophilic cell adhesion" evidence=IDA]
            [GO:0021731 "trigeminal motor nucleus development" evidence=IMP]
            [GO:0042981 "regulation of apoptotic process" evidence=IMP]
            InterPro:IPR001464 InterPro:IPR022416 ZFIN:ZDB-GENE-041221-3
            GO:GO:0005886 GO:GO:0042981 GO:GO:0051260 GO:GO:0005509
            GO:GO:0007156 GO:GO:0005544 PANTHER:PTHR10502 GO:GO:0048854
            Gene3D:1.10.790.10 SUPFAM:SSF54098 HOVERGEN:HBG056090 EMBL:AJ620614
            IPI:IPI00679275 RefSeq:NP_001013316.1 UniGene:Dr.162496
            UniGene:Dr.84038 ProteinModelPortal:Q5K4F8 GeneID:503702
            KEGG:dre:503702 CTD:503702 InParanoid:Q5K4F8 NextBio:20866258
            ArrayExpress:Q5K4F8 GO:GO:0021731 Uniprot:Q5K4F8
        Length = 567

 Score = 119 (46.9 bits), Expect = 0.00071, P = 0.00071
 Identities = 70/224 (31%), Positives = 94/224 (41%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYG----VPQ--GHGPPPSATTAG 281
             ++ N    + G+ GG++ +S + +S +    +      G     PQ     PPP     G
Sbjct:    36 SSSNKGGSSSGNKGGSSSSSSSSSSSKGTSSHGTHTSPGNYPRQPQVPNQNPPPYP---G 92

Query:   282 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 341
               G  P       A +  G P + +Y  P   GY  ++G GY A     Y P +G  Y P
Sbjct:    93 AGGGYPGQGRYPPAGSNPGYPNQGSY--PGRAGYP-NQG-GYPAQGG--Y-PAQG-GY-P 143

Query:   342 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG 400
             A+G GY P +G GY AQ G  Y AQ G     + G S  P +G GY  Q G P      G
Sbjct:   144 AQG-GY-PAQG-GYPAQGG--YPAQGGYPQGNYPGRSGYPGQG-GYPAQGGYPGGASYPG 197

Query:   401 PGYET--QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 442
              G  +   R PG +    PV  +   P Y P RG     Q G G
Sbjct:   198 AGAGSYPNRYPGGNPY--PVGGSY--PGY-PVRGGSSPNQFGGG 236


>UNIPROTKB|F1NL02 [details] [associations]
            symbol:COL22A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005198 "structural molecule activity"
            evidence=IEA] [GO:0005587 "collagen type IV" evidence=IEA]
            [GO:0030198 "extracellular matrix organization" evidence=IEA]
            [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 InterPro:IPR008985 SUPFAM:SSF49899 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00210
            GeneTree:ENSGT00700000104250 OMA:KRENGAQ EMBL:AADN02037495
            EMBL:AADN02037496 EMBL:AADN02037497 EMBL:AADN02037498
            IPI:IPI00577055 Ensembl:ENSGALT00000026109 Uniprot:F1NL02
        Length = 1588

 Score = 124 (48.7 bits), Expect = 0.00072, P = 0.00072
 Identities = 78/265 (29%), Positives = 99/265 (37%)

Query:   255 PVGQNAYEDGYGVPQGHGPPPSAT-TAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRG 312
             P G    E G   P G G PP      G +G  GP          ++G P  A    P G
Sbjct:  1248 PPGPRG-EPGATGPAGRGGPPGKDGDTGPIGPQGPRGLRGQPG--KNGLPGSAGEPGPAG 1304

Query:   313 -PGYEASKG-------PGYDASKAPSYDP-TKGP-SYDPAKG-PGYDPTKG----PGYDA 357
              PG + +KG       PG+   + P  DP  KGP   + A G PG   +KG    PG   
Sbjct:  1305 NPGPKGNKGENGSPGLPGFIGPRGPPGDPGEKGPPGKEGAPGKPGETGSKGERGEPGIKG 1364

Query:   358 QKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDM-QRGP-GYETQRVPGYDVQ 414
             +KG     Q+GP  +    P     +G  G     GP  D  Q GP G   Q  PG+   
Sbjct:  1365 EKGPQ--GQKGPPGE----PGIPGHKGHPGLMGPHGPPGDTGQVGPPGPPGQ--PGFPGP 1416

Query:   415 RG--PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 472
             RG  P  E  R    + Q      L     Y + + P   P+      G P    P G+ 
Sbjct:  1417 RGEPPSLETLRR---LIQEELAKQLDAKLAYLLAQIP---PAHVKASHGRPGPPGPPGKE 1470

Query:   473 PPPLNNVPYGSATPPARSGSGQPRG 497
               P    P G    P ++GS  P G
Sbjct:  1471 GLPGRTGPPGEPGRPGQTGSEGPPG 1495


>MGI|MGI:1932491 [details] [associations]
            symbol:Prp2 "proline rich protein 2" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
            MGI:MGI:1932491 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
            UniGene:Mm.425348 UniGene:Mm.484054 CleanEx:MM_PRH1 EMBL:M23236
            EMBL:M12100 EMBL:M19419 IPI:IPI00474263 IPI:IPI00855123 PIR:A28996
            PIR:D29149 UniGene:Mm.333439 Genevestigator:P05143
            GermOnline:ENSMUSG00000058295 Uniprot:P05143
        Length = 317

 Score = 115 (45.5 bits), Expect = 0.00076, P = 0.00076
 Identities = 67/242 (27%), Positives = 77/242 (31%)

Query:   266 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYD 324
             G P   GP P          GP            G   R     P  PG    + P G  
Sbjct:    79 GPPPPGGPQPRPPQGPPPPGGPQPRPPQGPPPPGGPQPRPPQG-PPPPGGPQPRPPQGPP 137

Query:   325 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG 384
                 P   P +GP   P  GP   P +GP   A  G      +GP      GP   P +G
Sbjct:   138 PPGGPQQRPPQGPP--PPGGPQPRPPQGPPPPA--GPQPRPPQGPPPPA--GPHLRPTQG 191

Query:   385 ---LGYDMQRGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 440
                 G   QR P       PG    R P G     GP     + P   P  GP    +  
Sbjct:   192 PPPTGGPQQRYPQSPPP--PGGPQPRPPQGPPPPGGPHPRPTQGP---PPTGP--QPRPT 244

Query:   441 QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ--PRGG 498
             QG      P   P +G    G P+   P G  PPP    P  +  P    G  Q  P  G
Sbjct:   245 QGPPPTGGPQQRPPQGPPPPGGPQPRPPQGP-PPPTGPQPRPTQGPHPTGGPQQTPPLAG 303

Query:   499 NP 500
             NP
Sbjct:   304 NP 305


>MGI|MGI:88455 [details] [associations]
            symbol:Col4a2 "collagen, type IV, alpha 2" species:10090 "Mus
            musculus" [GO:0001525 "angiogenesis" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] [GO:0005587 "collagen type IV"
            evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IDA]
            [GO:0016525 "negative regulation of angiogenesis" evidence=ISO]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            MGI:MGI:88455 GO:GO:0071560 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0006351 GO:GO:0001525 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0016525 GO:GO:0005201 HOVERGEN:HBG004933
            GO:GO:0005587 Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772
            KO:K06237 EMBL:J04448 EMBL:M23333 OrthoDB:EOG4XGZZF CTD:1284
            OMA:TTIPEQN ChiTaRS:COL4A2 EMBL:J04695 EMBL:AK053858 EMBL:AK075619
            EMBL:AK164096 EMBL:BC013560 EMBL:BC080789 EMBL:BC107685 EMBL:M23334
            EMBL:X02896 EMBL:X02897 EMBL:X02898 EMBL:X02899 EMBL:X04410
            EMBL:X04647 EMBL:M15833 EMBL:AY375463 EMBL:AY502946 EMBL:AY502947
            IPI:IPI00338452 PIR:A33526 RefSeq:NP_034062.3 UniGene:Mm.181021
            ProteinModelPortal:P08122 SMR:P08122 STRING:P08122
            PhosphoSite:P08122 PaxDb:P08122 PRIDE:P08122
            Ensembl:ENSMUST00000033899 GeneID:12827 KEGG:mmu:12827
            InParanoid:P08122 NextBio:282318 Bgee:P08122 CleanEx:MM_COL4A2
            Genevestigator:P08122 GermOnline:ENSMUSG00000031503 Uniprot:P08122
        Length = 1707

 Score = 124 (48.7 bits), Expect = 0.00078, P = 0.00078
 Identities = 91/301 (30%), Positives = 110/301 (36%)

Query:   229 APNVDRRADGSYGGATGN----SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 284
             +P VD   D  + G TG+     E  T   PVG    +   G P   GP  S    G  G
Sbjct:  1205 SPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGVPGQKGERGTPGERGPAGSPGLQGFPG 1264

Query:   285 AGP--NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYD--ASKAPSYDPTKGPSY 339
               P  N S S       G      Y  P GP G  A  G   D  +S A  +   KG   
Sbjct:  1265 ISPPSNISGSPGDVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGQKGWVG 1324

Query:   340 DPAKGPGYDP-TKG-PGYDAQKGSN-YDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNY 395
             DP  GP   P   G PG    KG   +    GP+  +  RGP   P+   G+    G   
Sbjct:  1325 DP--GPQGQPGVLGLPGEKGPKGEQGFMGNTGPSGAVGDRGPK-GPKGDQGFPGAPGS-- 1379

Query:   396 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPS 454
                  PG     +PG   Q+  V      P    +RG PG   + G      + P  DP 
Sbjct:  1380 --MGSPG-----IPGIP-QKIAVQPGTLGPQ--GRRGLPGALGEIGP-----QGPPGDP- 1423

Query:   455 RGTGFDGAPRGAAPHGQ-----VPP------PLNNV-PYGSATPPARSGS-GQPRGGNPA 501
                GF GAP  A P G+     VP       P+ +  P G    P R GS G P  G P 
Sbjct:  1424 ---GFRGAPGKAGPQGRGGVSAVPGFRGDQGPMGHQGPVGQEGEPGRPGSPGLP--GMPG 1478

Query:   502 R 502
             R
Sbjct:  1479 R 1479


>UNIPROTKB|I3LSV6 [details] [associations]
            symbol:COL2A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS51461 SMART:SM00038 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
            GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
            GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
            GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GeneTree:ENSGT00660000095287
            GO:GO:0005585 GO:GO:0060174 GO:GO:0030903 OMA:CPICPTE
            Ensembl:ENSSSCT00000031054 Uniprot:I3LSV6
        Length = 1365

 Score = 123 (48.4 bits), Expect = 0.00078, P = 0.00078
 Identities = 89/295 (30%), Positives = 111/295 (37%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 287
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:    35 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 90

Query:   288 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 344
               +  A      G PM      PRGP G   + GP G+      +  P   GP   P +G
Sbjct:    91 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGRVEDNSLPKATGPM-GP-RG 145

Query:   345 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 400
             P   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   146 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 202

Query:   401 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 450
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   203 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 260

Query:   451 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 500
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   261 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGNPGSPGPAGASGNP 312


>UNIPROTKB|F1NFF0 [details] [associations]
            symbol:Gga.41084 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005634 "nucleus" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR003618
            InterPro:IPR017890 InterPro:IPR019787 Pfam:PF00628 Pfam:PF07500
            PROSITE:PS50016 PROSITE:PS51321 SMART:SM00249 SMART:SM00510
            GO:GO:0005634 GO:GO:0046872 GO:GO:0008270 GO:GO:0006351
            Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
            SUPFAM:SSF57903 InterPro:IPR019786 PROSITE:PS01359
            Gene3D:1.10.472.30 InterPro:IPR012921 Pfam:PF07744 SUPFAM:SSF46942
            GeneTree:ENSGT00530000063844 EMBL:AADN02019222 EMBL:AADN02019223
            IPI:IPI00821338 Ensembl:ENSGALT00000039659 ArrayExpress:F1NFF0
            Uniprot:F1NFF0
        Length = 2253

 Score = 121 (47.7 bits), Expect = 0.00078, Sum P(2) = 0.00078
 Identities = 69/225 (30%), Positives = 91/225 (40%)

Query:   298 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG-PSYDPAKGPG---YDPTKGP 353
             + G P    ++ P GP      GP +    AP +    G P+ D  +GP    + P KGP
Sbjct:  1778 KGGPPQPPIHNAPGGP-----HGPNF-RGPAPQFSEEHGSPNSDGQRGPAPGRFGPQKGP 1831

Query:   354 G---YDAQKGSN-YDAQRGP--NYDIHRG--PSYDPQRGLGYDMQR---GPNY-DMQRGP 401
                 + +Q GS  +   RGP  +Y + RG  PS        +  QR      Y +M R P
Sbjct:  1832 IPSLFSSQHGSPLFGDNRGPAPSYGVPRGMSPSQFEDHMEPHMEQREFSDSQYNEMIRPP 1891

Query:   402 G-YETQRVPGYDVQRGPV-YEAQRAP---SYIPQRG-PGYDLQRGQGYDMRRAPSYDPSR 455
             G +E    P +   RGP  +  QR P    +  QRG P +   RG        P   P  
Sbjct:  1892 GQFEGSDQPQFMGNRGPFPFGGQRRPPPAQFKGQRGGPQFGGPRGPAPGHFGGPR-GPHT 1950

Query:   456 GTGFDGAPRGAAP-HGQVPPPLNNVPYGSATPPARSGSGQPRGGN 499
                F+G  RG AP HG  P  L   P+       R GS  PR  N
Sbjct:  1951 NQ-FEGQ-RGPAPNHGPGPRGLLPQPFEE-----RRGSPPPRFAN 1988

 Score = 55 (24.4 bits), Expect = 0.00078, Sum P(2) = 0.00078
 Identities = 30/123 (24%), Positives = 50/123 (40%)

Query:    30 GMRPPMPGAFP-PFDMMPPPEVME--QKIASQHVEMQK---LATENQRLAATHGTLRQEL 83
             G  PP P   P P      P V++    I S           AT +  + ATH +  +  
Sbjct:  1288 GSPPPPPPPLPVPESSAVTPSVLKILSSIKSGSTTTAPPPVSATASPSVTATHSSSSKTA 1347

Query:    84 AAAQHELQILHGQIGGMK-SERELQMRNLTEKIAKMEAE-----LKTAEPVKLEF-QKSK 136
                +H LQ L G+    + + +E +    + + A+  AE     +   +P+  +F Q SK
Sbjct:  1348 TPLEHILQTLFGKKKSFEPAAKESETAQSSNQKAQAAAEGGMAAVPLLDPIVQQFGQMSK 1407

Query:   137 TEA 139
              +A
Sbjct:  1408 DKA 1410


>UNIPROTKB|F1NGH5 [details] [associations]
            symbol:Gga.41084 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0006915 "apoptotic process"
            evidence=IEA] InterPro:IPR001965 InterPro:IPR003618
            InterPro:IPR017890 InterPro:IPR019787 Pfam:PF00628 Pfam:PF07500
            PROSITE:PS50016 PROSITE:PS51321 SMART:SM00249 SMART:SM00510
            GO:GO:0005634 GO:GO:0006915 GO:GO:0046872 GO:GO:0008270
            GO:GO:0006351 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 InterPro:IPR019786
            PROSITE:PS01359 Gene3D:1.10.472.30 InterPro:IPR012921 Pfam:PF07744
            SUPFAM:SSF46942 OMA:PNRMCAD GeneTree:ENSGT00530000063844
            EMBL:AADN02019222 EMBL:AADN02019223 IPI:IPI00577866
            Ensembl:ENSGALT00000009066 ArrayExpress:F1NGH5 Uniprot:F1NGH5
        Length = 2287

 Score = 121 (47.7 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 69/225 (30%), Positives = 91/225 (40%)

Query:   298 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG-PSYDPAKGPG---YDPTKGP 353
             + G P    ++ P GP      GP +    AP +    G P+ D  +GP    + P KGP
Sbjct:  1806 KGGPPQPPIHNAPGGP-----HGPNF-RGPAPQFSEEHGSPNSDGQRGPAPGRFGPQKGP 1859

Query:   354 G---YDAQKGSN-YDAQRGP--NYDIHRG--PSYDPQRGLGYDMQR---GPNY-DMQRGP 401
                 + +Q GS  +   RGP  +Y + RG  PS        +  QR      Y +M R P
Sbjct:  1860 IPSLFSSQHGSPLFGDNRGPAPSYGVPRGMSPSQFEDHMEPHMEQREFSDSQYNEMIRPP 1919

Query:   402 G-YETQRVPGYDVQRGPV-YEAQRAP---SYIPQRG-PGYDLQRGQGYDMRRAPSYDPSR 455
             G +E    P +   RGP  +  QR P    +  QRG P +   RG        P   P  
Sbjct:  1920 GQFEGSDQPQFMGNRGPFPFGGQRRPPPAQFKGQRGGPQFGGPRGPAPGHFGGPR-GPHT 1978

Query:   456 GTGFDGAPRGAAP-HGQVPPPLNNVPYGSATPPARSGSGQPRGGN 499
                F+G  RG AP HG  P  L   P+       R GS  PR  N
Sbjct:  1979 NQ-FEGQ-RGPAPNHGPGPRGLLPQPFEE-----RRGSPPPRFAN 2016

 Score = 55 (24.4 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 30/123 (24%), Positives = 50/123 (40%)

Query:    30 GMRPPMPGAFP-PFDMMPPPEVME--QKIASQHVEMQK---LATENQRLAATHGTLRQEL 83
             G  PP P   P P      P V++    I S           AT +  + ATH +  +  
Sbjct:  1288 GSPPPPPPPLPVPESSAVTPSVLKILSSIKSGSTTTAPPPVSATASPSVTATHSSSSKTA 1347

Query:    84 AAAQHELQILHGQIGGMK-SERELQMRNLTEKIAKMEAE-----LKTAEPVKLEF-QKSK 136
                +H LQ L G+    + + +E +    + + A+  AE     +   +P+  +F Q SK
Sbjct:  1348 TPLEHILQTLFGKKKSFEPAAKESETAQSSNQKAQAAAEGGMAAVPLLDPIVQQFGQMSK 1407

Query:   137 TEA 139
              +A
Sbjct:  1408 DKA 1410


>UNIPROTKB|F1PGS0 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:AAEX03003616
            EMBL:AAEX03003617 Ensembl:ENSCAFT00000026237 Uniprot:F1PGS0
        Length = 1969

 Score = 137 (53.3 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>UNIPROTKB|G3MZY8 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9913
            "Bos taurus" [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0004672 "protein kinase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=IEA]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:DAAA02048777
            EMBL:DAAA02048778 EMBL:DAAA02048779 EMBL:DAAA02048780
            EMBL:DAAA02048781 Ensembl:ENSBTAT00000064788 Uniprot:G3MZY8
        Length = 1970

 Score = 137 (53.3 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1490 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1547

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1548 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1599

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1600 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1653

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1654 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1706

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:  1707 YSPTSPSYSPTS--PSYSPTSPSYS 1729

 Score = 37 (18.1 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>UNIPROTKB|P24928 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0003968 "RNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0003677 "DNA binding" evidence=NAS] [GO:0003899 "DNA-directed
            RNA polymerase activity" evidence=NAS] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=NAS] [GO:0006366
            "transcription from RNA polymerase II promoter"
            evidence=IDA;NAS;TAS] [GO:0005634 "nucleus" evidence=IDA;NAS]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IDA] [GO:0004672 "protein kinase activity" evidence=IDA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0000398 "mRNA splicing,
            via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=TAS]
            [GO:0006289 "nucleotide-excision repair" evidence=TAS] [GO:0006367
            "transcription initiation from RNA polymerase II promoter"
            evidence=TAS] [GO:0006368 "transcription elongation from RNA
            polymerase II promoter" evidence=TAS] [GO:0006370
            "7-methylguanosine mRNA capping" evidence=TAS] [GO:0008380 "RNA
            splicing" evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0016032 "viral reproduction" evidence=TAS] [GO:0050434
            "positive regulation of viral transcription" evidence=TAS]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=IPI]
            [GO:0006468 "protein phosphorylation" evidence=IDA]
            Reactome:REACT_216 Reactome:REACT_71 InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 Reactome:REACT_116125
            EMBL:CH471108 GO:GO:0016032 GO:GO:0006355 GO:GO:0046872
            GO:GO:0003677 Reactome:REACT_1675 GO:GO:0006468 GO:GO:0006368
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0006367 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0006370 GO:GO:0050434 GO:GO:0006283
            Reactome:REACT_1892 EMBL:AC113189 GO:GO:0003899 PDB:2GHQ PDB:2GHT
            PDBsum:2GHQ PDBsum:2GHT eggNOG:COG0086 GO:GO:0003968 GO:GO:0005665
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 EMBL:X63564 EMBL:X74874
            EMBL:X74873 EMBL:X74872 EMBL:X74871 EMBL:X74870 EMBL:BC137231
            IPI:IPI00031627 PIR:I38186 PIR:S21054 RefSeq:NP_000928.1
            UniGene:Hs.270017 PDB:2LTO PDBsum:2LTO ProteinModelPortal:P24928
            SMR:P24928 DIP:DIP-29011N IntAct:P24928 MINT:MINT-156582
            STRING:P24928 PhosphoSite:P24928 DMDM:281185484 PaxDb:P24928
            PRIDE:P24928 Ensembl:ENST00000322644 GeneID:5430 KEGG:hsa:5430
            UCSC:uc002ghf.4 CTD:5430 GeneCards:GC17P007387 H-InvDB:HIX0173727
            HGNC:HGNC:9187 HPA:CAB012226 HPA:CAB016388 HPA:CAB022311
            HPA:HPA021563 MIM:180660 neXtProt:NX_P24928 PharmGKB:PA33507
            HOVERGEN:HBG004339 InParanoid:P24928 OrthoDB:EOG4JWVCM
            BindingDB:P24928 ChEMBL:CHEMBL1641353 ChiTaRS:POLR2A
            EvolutionaryTrace:P24928 GenomeRNAi:5430 NextBio:21009
            ArrayExpress:P24928 Bgee:P24928 CleanEx:HS_POLR2A
            Genevestigator:P24928 GermOnline:ENSG00000181222 Uniprot:P24928
        Length = 1970

 Score = 137 (53.3 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>MGI|MGI:98086 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide
            A" species:10090 "Mus musculus" [GO:0003677 "DNA binding"
            evidence=IDA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=ISO] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=ISO] [GO:0005730 "nucleolus"
            evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=ISO] [GO:0006468 "protein phosphorylation"
            evidence=ISO] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 MGI:MGI:98086
            GO:GO:0046872 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            EMBL:AL603707 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 GeneTree:ENSGT00700000104490
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 CTD:5430
            HOVERGEN:HBG004339 OrthoDB:EOG4JWVCM ChiTaRS:POLR2A EMBL:M12130
            EMBL:M14101 IPI:IPI00136207 PIR:A28490 RefSeq:NP_033115.1
            UniGene:Mm.16533 DisProt:DP00181 ProteinModelPortal:P08775
            SMR:P08775 DIP:DIP-46369N IntAct:P08775 STRING:P08775
            PhosphoSite:P08775 PaxDb:P08775 PRIDE:P08775
            Ensembl:ENSMUST00000058470 Ensembl:ENSMUST00000071213 GeneID:20020
            KEGG:mmu:20020 UCSC:uc007jrj.1 InParanoid:Q5F298 NextBio:297535
            Bgee:P08775 CleanEx:MM_POLR2A Genevestigator:P08775
            GermOnline:ENSMUSG00000005198 Uniprot:P08775
        Length = 1970

 Score = 137 (53.3 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>RGD|1587326 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide A"
            species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003677 "DNA binding" evidence=IEA;ISO]
            [GO:0003899 "DNA-directed RNA polymerase activity" evidence=IEA]
            [GO:0004672 "protein kinase activity" evidence=IEA;ISO] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005634 "nucleus"
            evidence=ISO] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISO] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0006468 "protein
            phosphorylation" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA;ISO] [GO:0005730 "nucleolus" evidence=ISO]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 RGD:1587326 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 KO:K03006 CTD:5430 OrthoDB:EOG4JWVCM
            IPI:IPI00952328 RefSeq:XP_001079162.1 RefSeq:XP_343923.3
            UniGene:Rn.163136 Ensembl:ENSRNOT00000068013 GeneID:363633
            KEGG:rno:363633 UCSC:RGD:1587326 NextBio:683839 ArrayExpress:D4A5A6
            Uniprot:D4A5A6
        Length = 1970

 Score = 137 (53.3 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   228 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 285
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 345
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 405
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   406 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 465
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   466 AAPHGQVPPPLNNVPYGSATPPARS 490
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00081, Sum P(2) = 0.00081
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>UNIPROTKB|F1LRC5 [details] [associations]
            symbol:Cux1 "Homeobox protein cut-like 1" species:10116
            "Rattus norvegicus" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR001356 InterPro:IPR003350
            InterPro:IPR009057 InterPro:IPR010982 InterPro:IPR017970
            Pfam:PF00046 Pfam:PF02376 PROSITE:PS00027 PROSITE:PS50071
            PROSITE:PS51042 SMART:SM00389 RGD:620618 GO:GO:0005634
            GO:GO:0043565 GO:GO:0003700 Gene3D:1.10.10.60 SUPFAM:SSF46689
            Gene3D:1.10.260.40 SUPFAM:SSF47413 GeneTree:ENSGT00530000063019
            IPI:IPI00769084 EMBL:AC091536 EMBL:AC091618
            Ensembl:ENSRNOT00000059486 ArrayExpress:F1LRC5 Uniprot:F1LRC5
        Length = 1434

 Score = 88 (36.0 bits), Expect = 0.00084, Sum P(2) = 0.00084
 Identities = 45/173 (26%), Positives = 81/173 (46%)

Query:    62 MQKLATENQRLAATHG---TLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAKM 118
             M  L   NQR         TLR++L++A H LQ L  QI   +   ++ +  LT    ++
Sbjct:   174 MTDLERANQRAEVAQREAETLREQLSSANHSLQ-LASQI---QKAPDVAIEVLTRSSLEV 229

Query:   119 EAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPALL---S 175
             E   K  E  +L     + +A +L   RE   +++ QL Q L   ++ ++Q+   L   +
Sbjct:   230 ELAAKEREIAQLVEDVQRLQA-SLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQA 288

Query:   176 ELESLRQEYHHCRGTYEY---EKKFYNDHLESLQVM--EKNYITMATEVEKLR 223
             + E +++E    + + E+   E     D  + L+V+  EKN  ++ +E   LR
Sbjct:   289 DYEDVKKELTTLK-SMEFAPSEGAGTQDSTKPLEVLLLEKNR-SLQSENATLR 339

 Score = 85 (35.0 bits), Expect = 0.00084, Sum P(2) = 0.00084
 Identities = 41/146 (28%), Positives = 58/146 (39%)

Query:   200 DHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQN 259
             D +E+    E     +AT+ +   AE+  AP  DR  + +       ++  +SG P GQ+
Sbjct:  1253 DGVEAADTEEPGGNIVATKSQGGPAEVTAAP-ADRE-EATQPAEKAKAQPLSSGTP-GQD 1309

Query:   260 AYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 319
               ED      G   PP    A      PN +  A A   + T   A       PG  A  
Sbjct:  1310 DGEDA-----GRSRPPPEGLADAPAPVPNLAAPA-AGEDAATSATAPAMATEAPG-AARA 1362

Query:   320 GPGYDASKAPSYDPTKGPSYDPAKGP 345
             GP   +S  PS   T  P+  PA+ P
Sbjct:  1363 GPAERSSALPS---TSAPANAPARRP 1385


>UNIPROTKB|F1LM15 [details] [associations]
            symbol:Cux1 "Homeobox protein cut-like 1" species:10116
            "Rattus norvegicus" [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0043565 "sequence-specific DNA binding"
            evidence=IEA] InterPro:IPR001356 InterPro:IPR003350
            InterPro:IPR009057 InterPro:IPR010982 InterPro:IPR017970
            Pfam:PF00046 Pfam:PF02376 PROSITE:PS00027 PROSITE:PS50071
            PROSITE:PS51042 SMART:SM00389 RGD:620618 GO:GO:0005634
            GO:GO:0005737 GO:GO:0030324 GO:GO:0043565 GO:GO:0003700
            GO:GO:0003682 Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0000122
            Gene3D:1.10.260.40 SUPFAM:SSF47413 GeneTree:ENSGT00530000063019
            GO:GO:0042491 EMBL:AC091536 EMBL:AC091618 IPI:IPI00370330
            Ensembl:ENSRNOT00000001928 ArrayExpress:F1LM15 Uniprot:F1LM15
        Length = 1456

 Score = 88 (36.0 bits), Expect = 0.00087, Sum P(2) = 0.00087
 Identities = 45/173 (26%), Positives = 81/173 (46%)

Query:    62 MQKLATENQRLAATHG---TLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAKM 118
             M  L   NQR         TLR++L++A H LQ L  QI   +   ++ +  LT    ++
Sbjct:   174 MTDLERANQRAEVAQREAETLREQLSSANHSLQ-LASQI---QKAPDVAIEVLTRSSLEV 229

Query:   119 EAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPALL---S 175
             E   K  E  +L     + +A +L   RE   +++ QL Q L   ++ ++Q+   L   +
Sbjct:   230 ELAAKEREIAQLVEDVQRLQA-SLTKLRENSASQISQLEQQLNAKNSTLKQLEEKLKGQA 288

Query:   176 ELESLRQEYHHCRGTYEY---EKKFYNDHLESLQVM--EKNYITMATEVEKLR 223
             + E +++E    + + E+   E     D  + L+V+  EKN  ++ +E   LR
Sbjct:   289 DYEDVKKELTTLK-SMEFAPSEGAGTQDSTKPLEVLLLEKNR-SLQSENATLR 339

 Score = 85 (35.0 bits), Expect = 0.00087, Sum P(2) = 0.00087
 Identities = 41/146 (28%), Positives = 58/146 (39%)

Query:   200 DHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQN 259
             D +E+    E     +AT+ +   AE+  AP  DR  + +       ++  +SG P GQ+
Sbjct:  1275 DGVEAADTEEPGGNIVATKSQGGPAEVTAAP-ADRE-EATQPAEKAKAQPLSSGTP-GQD 1331

Query:   260 AYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 319
               ED      G   PP    A      PN +  A A   + T   A       PG  A  
Sbjct:  1332 DGEDA-----GRSRPPPEGLADAPAPVPNLAAPA-AGEDAATSATAPAMATEAPG-AARA 1384

Query:   320 GPGYDASKAPSYDPTKGPSYDPAKGP 345
             GP   +S  PS   T  P+  PA+ P
Sbjct:  1385 GPAERSSALPS---TSAPANAPARRP 1407


>UNIPROTKB|B4DLD3 [details] [associations]
            symbol:SS18 "cDNA FLJ58120, highly similar to SSXT protein"
            species:9606 "Homo sapiens" [GO:0000226 "microtubule cytoskeleton
            organization" evidence=IEA] [GO:0000902 "cell morphogenesis"
            evidence=IEA] [GO:0005881 "cytoplasmic microtubule" evidence=IEA]
            [GO:0007243 "intracellular protein kinase cascade" evidence=IEA]
            [GO:0042493 "response to drug" evidence=IEA] [GO:0048013 "ephrin
            receptor signaling pathway" evidence=IEA] GO:GO:0000226
            GO:GO:0042493 GO:GO:0007243 GO:GO:0000902 GO:GO:0048013
            GO:GO:0005881 HOVERGEN:HBG003892 InterPro:IPR007726
            PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:AC091021
            HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK296949 IPI:IPI01011245
            STRING:B4DLD3 Ensembl:ENST00000542420 Uniprot:B4DLD3
        Length = 395

 Score = 116 (45.9 bits), Expect = 0.00087, P = 0.00087
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 289
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   165 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 224

Query:   290 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 341
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   225 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 283

Query:   342 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   284 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 337

Query:   401 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 451
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   338 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 393


>UNIPROTKB|J9P0I3 [details] [associations]
            symbol:ZNF768 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
            InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
            PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
            GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
            KO:K09228 CTD:79724 OMA:SRYESQN EMBL:AAEX03004391
            RefSeq:XP_547025.2 Ensembl:ENSCAFT00000045233 GeneID:489906
            KEGG:cfa:489906 Uniprot:J9P0I3
        Length = 554

 Score = 118 (46.6 bits), Expect = 0.00088, P = 0.00088
 Identities = 27/71 (38%), Positives = 42/71 (59%)

Query:   302 PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ--- 358
             P    Y+ P+ PGYE  + PGY+  K+P Y+P K P Y+P + PGY+ ++ PGY+ Q   
Sbjct:   116 PQSPRYE-PQSPGYEP-RSPGYEP-KSPGYEP-KSPGYEP-RSPGYE-SQSPGYEPQNPE 169

Query:   359 ---KGSNYDAQ 366
                +   ++AQ
Sbjct:   170 FKTQSPEFEAQ 180


>UNIPROTKB|F1NGZ3 [details] [associations]
            symbol:F1NGZ3 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0000278 "mitotic cell cycle" evidence=IEA]
            [GO:0005814 "centriole" evidence=IEA] [GO:0008022 "protein
            C-terminus binding" evidence=IEA] [GO:0008104 "protein
            localization" evidence=IEA] [GO:0010457 "centriole-centriole
            cohesion" evidence=IEA] [GO:0019901 "protein kinase binding"
            evidence=IEA] [GO:0030997 "regulation of centriole-centriole
            cohesion" evidence=IEA] [GO:0031616 "spindle pole centrosome"
            evidence=IEA] InterPro:IPR026048 GO:GO:0043234 GO:GO:0008104
            GO:GO:0005814 GO:GO:0000278 GeneTree:ENSGT00700000104019
            GO:GO:0030997 GO:GO:0010457 PANTHER:PTHR23159:SF1 EMBL:AADN02019503
            EMBL:AADN02019504 EMBL:AADN02019505 IPI:IPI00570644
            Ensembl:ENSGALT00000002729 Uniprot:F1NGZ3
        Length = 2417

 Score = 125 (49.1 bits), Expect = 0.00089, P = 0.00089
 Identities = 51/189 (26%), Positives = 90/189 (47%)

Query:    56 ASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKI 115
             A+Q +  +K   E + L  T    + EL  A H+L+ L  ++   K ++E + +N+TEK+
Sbjct:   886 ANQEILTEK-ENEKKALLETLLQTQGELTEACHQLEQLRQEV---KEQQEYE-QNITEKL 940

Query:   116 AKMEAELKTAE-PVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPALL 174
                +AEL+     +K+     K E +N+   R++L  +V +LT  L  +    Q I    
Sbjct:   941 ---QAELQETHCKIKMVENMHKEEMENIKEQRDDLQKQVEELTSQLAASEESHQAIGHKA 997

Query:   175 SELESLRQEYHHCRGTYEYEKKFYNDHLE----SLQVMEKNYITMATEVEKLRAELMNAP 230
              +  S  QE    +   E E++  +  LE    SL+ +E+N +    EV KL + +  A 
Sbjct:   998 QQELSEAQELSRQKAL-ESERERLSLSLEQKELSLKTLEENNLVQQNEVSKLHSAIQQAQ 1056

Query:   231 NV--DRRAD 237
              +  D R +
Sbjct:  1057 QLHSDHRRE 1065


>WB|WBGene00000653 [details] [associations]
            symbol:col-77 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            EMBL:Z66498 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00610000086159 PIR:T23801 RefSeq:NP_495759.1
            ProteinModelPortal:Q21562 DIP:DIP-26119N MINT:MINT-1050309
            STRING:Q21562 EnsemblMetazoa:M195.1 GeneID:174336
            KEGG:cel:CELE_M195.1 UCSC:M195.1 CTD:174336 WormBase:M195.1
            eggNOG:NOG315089 InParanoid:Q21562 OMA:IAFFGIC NextBio:883606
            Uniprot:Q21562
        Length = 304

 Score = 114 (45.2 bits), Expect = 0.00090, P = 0.00090
 Identities = 71/238 (29%), Positives = 87/238 (36%)

Query:   264 GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPG 322
             GYG P  +    + +  G    G   S  +  A   GTP     D   G PG +   G  
Sbjct:    85 GYGAPAEYSTDAAVSAGGSEAGGQCCSCGSGPAGPPGTPGEDGRDGNDGQPGPDGQPGSD 144

Query:   323 YDASKAPSYDPTKGPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 381
               A   P+ D      +D PA  PG     GP     KG+  +A   P  D   G    P
Sbjct:   145 APAEAIPTADDF---CFDCPAGPPGPAGNAGP-----KGAPGNAG-APGNDGQAGAPGAP 195

Query:   382 QRGLGYDMQRGP-NYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 439
                 G D  +GP   D   G PG + Q  PG  V+   V      P   PQ  PG D Q 
Sbjct:   196 ----GNDGPQGPPGQDGAAGQPGPDGQ--PGV-VEEVAVPAGPPGPPG-PQGAPGTDGQP 247

Query:   440 GQ-GYDMRRAPSYDPSRGTGFDGAP--RGAA-PHGQVPPPLNNVPYGSATPPARSGSG 493
             G  G   +  P   P+   G DGAP   GAA   G+   P          PP R+  G
Sbjct:   248 GSAGQPGQDGPQ-GPAGDAGTDGAPGQAGAAGEQGEAGQPGEGGGCDHCPPP-RTAPG 303


>FB|FBgn0038642 [details] [associations]
            symbol:Muc91C "Mucin 91C" species:7227 "Drosophila
            melanogaster" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISM] [GO:0031012 "extracellular matrix"
            evidence=ISM] [GO:0022008 "neurogenesis" evidence=IMP]
            EMBL:AE014297 GO:GO:0022008 eggNOG:NOG12793 GO:GO:0031012
            GO:GO:0005201 GeneTree:ENSGT00700000104744 RefSeq:NP_650744.1
            UniGene:Dm.10760 EnsemblMetazoa:FBtr0083687 GeneID:42246
            KEGG:dme:Dmel_CG7709 UCSC:CG7709-RA CTD:42246 FlyBase:FBgn0038642
            InParanoid:Q9VE45 OMA:GPYPSAP PhylomeDB:Q9VE45 GenomeRNAi:42246
            NextBio:827869 ArrayExpress:Q9VE45 Bgee:Q9VE45 Uniprot:Q9VE45
        Length = 950

 Score = 129 (50.5 bits), Expect = 0.00091, Sum P(2) = 0.00091
 Identities = 72/281 (25%), Positives = 98/281 (34%)

Query:   234 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGHGPPPSATTAGVVGAGPNTST 291
             RR   SYG       +++ G P   + Y      P  Q +G P  A  +   G   +  +
Sbjct:   222 RRPSSSYGAPRPAPPSQSYGAPPSAS-YGPPKSAPPSQSYGAP--APPSSKYGPPKSAPS 278

Query:   292 SAYAATQSGTPMRAAYDIPRGPG--YEASKGPG--YDASKAPS--YDPTKGPSYDPAKGP 345
             S+Y A +   P  ++Y  P  P   Y A   P   Y A  APS  Y     PS   + G 
Sbjct:   279 SSYGAPRPAAPS-SSYGAPAPPSSSYGAPAAPSSSYGAPAAPSSSYGAPAAPS--SSYGA 335

Query:   346 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR--GPGY 403
                P+K  G  A   S+Y A   P+     G    P    G       +Y       P Y
Sbjct:   336 PAPPSKSYGAPAPPSSSYGAPAAPSKSY--GAPAPPSSSYGAPAPPSSSYGAPAPPSPSY 393

Query:   404 ETQRVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 462
                  P        P   +  AP+  P +  G        Y    AP+  PS   G   A
Sbjct:   394 GAPAPPSKSYGAPAPPSSSYGAPA-APSKSYGAPAPPSSSYG---APA-PPSSSYGAPSA 448

Query:   463 PRGA-APHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 502
             P  +  P    P P ++  YG A P A   S  P    P++
Sbjct:   449 PSSSYGPPKPAPAPPSS-SYG-APPQAPVSSYLPPASRPSK 487

 Score = 38 (18.4 bits), Expect = 0.00091, Sum P(2) = 0.00091
 Identities = 8/19 (42%), Positives = 9/19 (47%)

Query:    28 VSGMRPPMPGAFPPFDMMP 46
             VS   PP  G  P F+  P
Sbjct:   142 VSSYLPPASGPAPSFNSAP 160


>UNIPROTKB|E2RQK9 [details] [associations]
            symbol:PYGO2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060070 "canonical Wnt receptor signaling
            pathway" evidence=IEA] [GO:0060021 "palate development"
            evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
            evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
            [GO:0042393 "histone binding" evidence=IEA] [GO:0033599 "regulation
            of mammary gland epithelial cell proliferation" evidence=IEA]
            [GO:0030879 "mammary gland development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0007420 "brain
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0002088 "lens development in camera-type eye" evidence=IEA]
            [GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
            GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
            GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
            GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
            EMBL:AAEX03005346 RefSeq:XP_547562.2 Ensembl:ENSCAFT00000027172
            GeneID:490440 KEGG:cfa:490440 NextBio:20863469 Uniprot:E2RQK9
        Length = 405

 Score = 116 (45.9 bits), Expect = 0.00091, P = 0.00091
 Identities = 80/294 (27%), Positives = 106/294 (36%)

Query:   227 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 280
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G  P    +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97

Query:   281 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 339
              V   G           Q G     A  +P G G     GP     + P + P   GP++
Sbjct:    98 PVPFGG--------FRVQGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPNPMGPAF 145

Query:   340 D-PAKGPGYDPTKGPGYDAQK-----GSNYDAQRG---PNYDIHRGPSYDPQRGLGYDMQ 390
             + P +GPGY P     + +Q      G N+    G   P      GP   P  G     +
Sbjct:   146 NMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPPRGE 205

Query:   391 RGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQ-RGQGYDMR 446
              GP+   QR   +     P G  +QR P    Q  PS  P   P  G D    G G +  
Sbjct:   206 LGPHSLPQR---FAQPGAPFGPSLQR-P---GQGLPSLPPNTSPFPGPDPGFPGPGGEDG 258

Query:   447 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 500
               P  +P   T F   P   +P   V     N P   + PP  SG G   GG P
Sbjct:   259 GKP-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPNSSGRG---GGTP 302


>ZFIN|ZDB-GENE-060526-207 [details] [associations]
            symbol:specc1 "sperm antigen with calponin homology
            and coiled-coil domains 1" species:7955 "Danio rerio" [GO:0060325
            "face morphogenesis" evidence=IMP;IDA] InterPro:IPR001715
            Pfam:PF00307 PROSITE:PS50021 SMART:SM00033 ZFIN:ZDB-GENE-060526-207
            eggNOG:COG5069 Gene3D:1.10.418.10 SUPFAM:SSF47576 GO:GO:0060325
            GeneTree:ENSGT00530000062761 HOVERGEN:HBG056096 OMA:VEKDYSY
            EMBL:AL928675 IPI:IPI00486418 UniGene:Dr.160202 UniGene:Dr.83172
            Ensembl:ENSDART00000137052 InParanoid:A2ASQ4 NextBio:20884360
            Uniprot:A2ASQ4
        Length = 1035

 Score = 121 (47.7 bits), Expect = 0.00092, P = 0.00092
 Identities = 56/213 (26%), Positives = 99/213 (46%)

Query:    53 QKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLT 112
             Q++A Q   +Q+L  EN+RLA   G L+  L   +  +++L  Q        E  ++ L 
Sbjct:   360 QELADQQQVVQELTAENERLAEEKGLLQTSLQQQRERVELLAQQ-------NETLLQRLR 412

Query:   113 EKIAKMEAELKTAEPV-KLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIP 171
             E+    EAE   A  + +LE Q+   + ++    RE+L+    QLT  L+    + Q+  
Sbjct:   413 EQAQSQEAEASRASRMAELE-QRLAEQVESSRFEREKLVDIQQQLTGSLRALEKENQEAQ 471

Query:   172 ALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITM-ATEVEKLRAELMNAP 230
                + ++SLR+E    +G  E EK    D  E+++  E+  + M A  V+   A +    
Sbjct:   472 ---TAVKSLREEEGLLQGHLESEK-LARD--EAVRKTEEQRLAMEALRVDN--ASMKAQV 523

Query:   231 NVDRRADGSYGGATGNSENETSGRPVGQNAYED 263
              V+R+           S+N T  + + + A+ED
Sbjct:   524 EVERQKVAELKAVQSASDN-TELQSLLKVAHED 555


>UNIPROTKB|Q15532 [details] [associations]
            symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
            [GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
            "cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
            protein kinase cascade" evidence=IEA] [GO:0042493 "response to
            drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0030374
            "ligand-dependent nuclear receptor transcription coactivator
            activity" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IDA] GO:GO:0005634 GO:GO:0000226
            GO:GO:0042493 GO:GO:0045944 GO:GO:0007243 GO:GO:0006351
            EMBL:CH471088 GO:GO:0000902 Orphanet:3273 GO:GO:0048013
            GO:GO:0005881 GO:GO:0030374 HOVERGEN:HBG003892 InterPro:IPR007726
            PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:X79200
            EMBL:S79894 EMBL:X79201 EMBL:AF343880 EMBL:EF445031 EMBL:BC096223
            IPI:IPI00452919 IPI:IPI00940186 PIR:S46269 RefSeq:NP_001007560.1
            RefSeq:NP_005628.2 ProteinModelPortal:Q15532 IntAct:Q15532
            STRING:Q15532 PhosphoSite:Q15532 DMDM:20141795 PaxDb:Q15532
            PRIDE:Q15532 DNASU:6760 Ensembl:ENST00000269137
            Ensembl:ENST00000415083 GeneID:6760 KEGG:hsa:6760 UCSC:uc002kvm.3
            CTD:6760 GeneCards:GC18M023596 HGNC:HGNC:11340 MIM:600192
            neXtProt:NX_Q15532 PharmGKB:PA36164 eggNOG:NOG274014
            InParanoid:Q15532 KO:K15623 OrthoDB:EOG4RFKTH PhylomeDB:Q15532
            ChiTaRS:SS18 GenomeRNAi:6760 NextBio:26388 ArrayExpress:Q15532
            Bgee:Q15532 CleanEx:HS_SS18 Genevestigator:Q15532
            GermOnline:ENSG00000141380 Uniprot:Q15532
        Length = 418

 Score = 116 (45.9 bits), Expect = 0.00096, P = 0.00096
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   238 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 289
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   188 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 247

Query:   290 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 341
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   248 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 306

Query:   342 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 400
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   307 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 360

Query:   401 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 451
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   361 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 416


>WB|WBGene00000627 [details] [associations]
            symbol:col-50 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00530000064217
            EMBL:FO080999 PIR:T15142 RefSeq:NP_491194.1 UniGene:Cel.16665
            ProteinModelPortal:O01662 EnsemblMetazoa:T28F2.6 GeneID:189050
            KEGG:cel:CELE_T28F2.6 UCSC:T28F2.6 CTD:189050 WormBase:T28F2.6
            eggNOG:NOG279371 InParanoid:O01662 OMA:AGNCITC NextBio:941028
            Uniprot:O01662
        Length = 418

 Score = 116 (45.9 bits), Expect = 0.00096, P = 0.00096
 Identities = 79/285 (27%), Positives = 95/285 (33%)

Query:   230 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 289
             P  +  A+G+ GG       + SG P G        G     G P  A   G  G   + 
Sbjct:    96 PAKEGYAEGAGGGGGCQCAAQASGCPAGPPGPPGEAGAD---GEPGEAGQDGAAGEAGSA 152

Query:   290 STSAYAATQSGTPMRAAYDIPRGP-GYEASKGP----GYDASKAPSYDPTKGPSYDPAKG 344
              T A AA    T   A    P GP G     GP    G D   A   +P  GP+  PA  
Sbjct:   153 DTYAGAAGNCIT-CPAGPPGPPGPDGNAGPAGPAGAAGPDGEGAGYAEP--GPA-GPAGP 208

Query:   345 PGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP- 401
             PG D   G PG D Q G+        P      GP   P    G D    P+     GP 
Sbjct:   209 PGPDGQPGAPGPDGQPGAGGTTSTNQPGPPGPAGPP-GPAGPAGEDAYAQPSPAGTPGPP 267

Query:   402 ---GYETQR-------VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 451
                G + +         PG D   GP  +A   P      G G   + G       A  Y
Sbjct:   268 GPPGKDGEAGPDGPAGAPGTDGAPGP--DAAYCPCPPRTLGAGAYPEGGDAAAAAPAGGY 325

Query:   452 DPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQP 495
             D   G   + AP  AA     P P     P G     A +G+  P
Sbjct:   326 DGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQGGAAAGAAAP 370


>MGI|MGI:3040693 [details] [associations]
            symbol:Zmiz1 "zinc finger, MIZ-type containing 1"
            species:10090 "Mus musculus" [GO:0001570 "vasculogenesis"
            evidence=IMP] [GO:0001701 "in utero embryonic development"
            evidence=IMP] [GO:0003007 "heart morphogenesis" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0007296 "vitellogenesis"
            evidence=IMP] [GO:0007569 "cell aging" evidence=IDA] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0045944 "positive regulation
            of transcription from RNA polymerase II promoter" evidence=IMP]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048146 "positive
            regulation of fibroblast proliferation" evidence=IMP] [GO:0048589
            "developmental growth" evidence=IMP] [GO:0048844 "artery
            morphogenesis" evidence=IMP] InterPro:IPR004181 Pfam:PF02891
            PROSITE:PS51044 MGI:MGI:3040693 GO:GO:0005737 GO:GO:0046872
            GO:GO:0016607 GO:GO:0003007 GO:GO:0008270 GO:GO:0001701
            GO:GO:0045944 GO:GO:0006351 Gene3D:3.30.40.10 InterPro:IPR013083
            GO:GO:0048589 GO:GO:0001570 GO:GO:0048146 GO:GO:0048844
            GO:GO:0007569 GO:GO:0007296 GeneTree:ENSGT00550000074410 CTD:57178
            eggNOG:NOG237400 HOGENOM:HOG000253014 HOVERGEN:HBG056252
            OMA:MNQYGPM OrthoDB:EOG45MN70 ChiTaRS:ZMIZ1 EMBL:BC057691
            EMBL:BC058646 EMBL:BC065120 EMBL:AK054366 IPI:IPI00226072
            IPI:IPI00480418 RefSeq:NP_899031.2 UniGene:Mm.227484
            UniGene:Mm.486339 UniGene:Mm.489608 ProteinModelPortal:Q6P1E1
            SMR:Q6P1E1 IntAct:Q6P1E1 STRING:Q6P1E1 PhosphoSite:Q6P1E1
            PaxDb:Q6P1E1 PRIDE:Q6P1E1 Ensembl:ENSMUST00000007961
            Ensembl:ENSMUST00000162645 GeneID:328365 KEGG:mmu:328365
            UCSC:uc007srn.1 UCSC:uc007sro.1 InParanoid:Q6P1E1 NextBio:398259
            Bgee:Q6P1E1 CleanEx:MM_ZMIZ1 Genevestigator:Q6P1E1
            GermOnline:ENSMUSG00000007817 Uniprot:Q6P1E1
        Length = 1072

 Score = 121 (47.7 bits), Expect = 0.00096, P = 0.00096
 Identities = 65/232 (28%), Positives = 84/232 (36%)

Query:   286 GPNTSTSAYAATQSGTPMRAAYDIPRGPG-YEASKGP-GYDASKAPSYDPTKGP--SYDP 341
             GP  S+     TQ+          PRGP     S  P G  A   PS     GP    + 
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGPASMGGSLNPAGMAAGMTPS--GMSGPPMGMNQ 375

Query:   342 AKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 399
              + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN     
Sbjct:   376 PRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFPT 432

Query:   400 GPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP--S 454
              PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    +   S
Sbjct:   433 QPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTFS 489

Query:   455 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 502
              G+ +    +G+      P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   490 SGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541


>MGI|MGI:2147661 [details] [associations]
            symbol:Vps37c "vacuolar protein sorting 37C (yeast)"
            species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005768 "endosome" evidence=IEA] [GO:0006810
            "transport" evidence=IEA] [GO:0015031 "protein transport"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] MGI:MGI:2147661
            GO:GO:0031902 GO:GO:0015031 InterPro:IPR009851 Pfam:PF07200
            PROSITE:PS51314 KO:K12185 GeneTree:ENSGT00530000063123
            HOGENOM:HOG000234744 HOVERGEN:HBG073355 CTD:55048 eggNOG:NOG311749
            OMA:VERCQEQ OrthoDB:EOG4B2SZG EMBL:AK158833 EMBL:AK159309
            EMBL:BC025865 IPI:IPI00153241 IPI:IPI00877200 RefSeq:NP_852068.1
            UniGene:Mm.19091 ProteinModelPortal:Q8R105 IntAct:Q8R105
            STRING:Q8R105 PhosphoSite:Q8R105 PaxDb:Q8R105 PRIDE:Q8R105
            Ensembl:ENSMUST00000087951 GeneID:107305 KEGG:mmu:107305
            UCSC:uc008gqr.1 UCSC:uc008gqs.1 InParanoid:Q8R105 NextBio:358674
            Bgee:Q8R105 CleanEx:MM_VPS37C Genevestigator:Q8R105 Uniprot:Q8R105
        Length = 352

 Score = 90 (36.7 bits), Expect = 0.00098, Sum P(2) = 0.00098
 Identities = 46/178 (25%), Positives = 60/178 (33%)

Query:   267 VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG-YDA 325
             VP    PPP                     T    P   +  +P GP  + +  P  +  
Sbjct:   170 VPPKRPPPPRPVPQATPPETEEQPPQPSVVTPYPLPYSPSPGLPVGPTAQGALQPAPFPV 229

Query:   326 SKAPS-YDPTKGPSYDPAKGP----GYDPTKGPGYDAQKG--SNYDAQRGPNYDIHRGPS 378
                PS Y    GP   P  GP    GY  +       Q G  +   +  GP Y +  G +
Sbjct:   230 VAQPSSYGGPLGPYPSPHPGPRAMVGYSWSPQRSGPPQPGYPTAPTSTSGPGYPLVGGRT 289

Query:   379 YDPQRGLGYDMQRGPNYDMQRGPGYETQ-RVPGYDVQRGPVYEAQRAPSYIPQRGPGY 435
               P    GY  Q+ P       P Y TQ ++PG+  Q  P    Q  P Y P   P Y
Sbjct:   290 PGP----GYP-QQSPYLPSGNKPPYPTQPQLPGFPGQPQPPVPPQ--PPYPPGTTPSY 340

 Score = 68 (29.0 bits), Expect = 0.00098, Sum P(2) = 0.00098
 Identities = 20/81 (24%), Positives = 42/81 (51%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             +M   PE +  ++A +  E+Q L  E +   AT+ +L ++    Q  L+I    +    S
Sbjct:    14 EMQNDPEAIA-RLALESPEVQDLQLEREMALATNRSLAEQNLEFQGPLEISRSNL----S 68

Query:   103 ERELQMRNLTEKIAKMEAELK 123
             ++  ++R L E+  + +A+L+
Sbjct:    69 DKYQELRKLVERCQEQKAKLE 89


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.311   0.132   0.392    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      503       484   0.00080  119 3  11 23  0.36    35
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  241
  No. of states in DFA:  586 (62 KB)
  Total size of DFA:  256 KB (2132 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  60.21u 0.15s 60.36t   Elapsed:  00:00:02
  Total cpu time:  60.28u 0.15s 60.43t   Elapsed:  00:00:02
  Start:  Sat May 11 05:25:26 2013   End:  Sat May 11 05:25:28 2013
WARNINGS ISSUED:  1

Back to top