BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>016220
MPKVGAHKLEIRCTLIFTCTLDFLFRQVYSKALHFGHPWICESSSPQYYFLHLAFQHCYC
AIFLKIWSKNAITFHLCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAP
NVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS
TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT
KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG
YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG
QVPPPLNNVPYGSATPPARSGSGQPRGGNPARR

High Scoring Gene Products

Symbol, full name Information P value
AT1G67170 protein from Arabidopsis thaliana 2.3e-30
Vml
Vitelline membrane-like
protein from Drosophila melanogaster 1.0e-23
LOC100518332
Uncharacterized protein
protein from Sus scrofa 1.8e-13
POLR2A
DNA-directed RNA polymerase II subunit RPB1
protein from Cricetulus griseus 2.2e-11
eif3a
Eukaryotic translation initiation factor 3 subunit A
protein from Xenopus (Silurana) tropicalis 8.8e-11
eif3a
Eukaryotic translation initiation factor 3 subunit A
protein from Xenopus laevis 1.6e-10
T17H7.1 gene from Caenorhabditis elegans 9.4e-10
prc
pericardin
protein from Drosophila melanogaster 4.0e-09
polr2a
polymerase (RNA) II (DNA directed) polypeptide A
gene_product from Danio rerio 6.0e-09
fhaA
FHA domain-containing protein FhaA
protein from Mycobacterium tuberculosis 8.0e-09
TAF15
TATA-binding protein-associated factor 2N
protein from Homo sapiens 8.8e-09
TAF15
Uncharacterized protein
protein from Canis lupus familiaris 1.5e-08
K02E11.10 gene from Caenorhabditis elegans 2.9e-08
cbpP
calcium-binding protein
gene from Dictyostelium discoideum 3.9e-08
CG30203 protein from Drosophila melanogaster 6.5e-08
spt-5 gene from Caenorhabditis elegans 7.1e-08
spt-5
Transcription elongation factor SPT5
protein from Caenorhabditis elegans 7.1e-08
RPO21
RNA polymerase II largest subunit B220
gene from Saccharomyces cerevisiae 8.4e-08
Krtap6-2
keratin associated protein 6-2
protein from Mus musculus 1.3e-07
let-2 gene from Caenorhabditis elegans 1.4e-07
let-2
Collagen alpha-2(IV) chain
protein from Caenorhabditis elegans 1.4e-07
arid1ab
AT rich interactive domain 1Ab (SWI-like)
gene_product from Danio rerio 1.5e-07
ama-1 gene from Caenorhabditis elegans 1.5e-07
ama-1
DNA-directed RNA polymerase II subunit RPB1
protein from Caenorhabditis elegans 1.5e-07
ZNF768
Uncharacterized protein
protein from Canis lupus familiaris 3.1e-07
CG7185 protein from Drosophila melanogaster 3.2e-07
COL4A4
Collagen alpha-4(IV) chain
protein from Homo sapiens 3.7e-07
COL4A4
Collagen alpha-4(IV) chain
protein from Homo sapiens 3.7e-07
COL1A1
Collagen alpha-1(I) chain
protein from Gallus gallus 8.6e-07
MGG_04961
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 1.0e-06
swsn-1 gene from Caenorhabditis elegans 1.1e-06
AT1G33680 protein from Arabidopsis thaliana 1.4e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Homo sapiens 1.4e-06
COL4A4
Uncharacterized protein
protein from Nomascus leucogenys 1.7e-06
osa protein from Drosophila melanogaster 2.2e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Macaca mulatta 2.3e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Pan troglodytes 2.3e-06
EWSR1
RNA-binding protein EWS
protein from Homo sapiens 2.6e-06
COL3A1
Collagen alpha-1(III) chain
protein from Gallus gallus 2.6e-06
AT1G10390 protein from Arabidopsis thaliana 2.6e-06
Ldb3
LIM domain binding 3
protein from Mus musculus 2.7e-06
LDB3
LIM domain-binding protein 3
protein from Homo sapiens 2.7e-06
EGK_04858
Putative uncharacterized protein
protein from Macaca mulatta 2.8e-06
EGM_04376
Putative uncharacterized protein
protein from Macaca fascicularis 2.8e-06
AT2G25970 protein from Arabidopsis thaliana 2.9e-06
pygo2
pygopus homolog 2 (Drosophila)
gene_product from Danio rerio 3.2e-06
COL3A1
Collagen alpha-1(III) chain
protein from Bos taurus 3.4e-06
PPP1R10
Uncharacterized protein
protein from Canis lupus familiaris 3.8e-06
ewsr1b
Ewing sarcoma breakpoint region 1b
gene_product from Danio rerio 3.8e-06
fus
fusion (involved in t(12;16) in malignant liposarcoma)
gene_product from Danio rerio 4.9e-06
I3LQ53
Uncharacterized protein
protein from Sus scrofa 5.0e-06
COL3A1
Collagen alpha-1(III) chain
protein from Bos taurus 5.1e-06
COL5A1
Uncharacterized protein
protein from Canis lupus familiaris 6.2e-06
TFG
Uncharacterized protein
protein from Gallus gallus 6.3e-06
Col11a1
collagen, type XI, alpha 1
gene from Rattus norvegicus 6.4e-06
Col11a1
Collagen alpha-1(XI) chain
protein from Rattus norvegicus 6.4e-06
AT3G07030 protein from Arabidopsis thaliana 6.6e-06
MUC1
Mucin-1
protein from Bos taurus 7.0e-06
RPO21 gene_product from Candida albicans 7.8e-06
RPO21
DNA-directed RNA polymerase
protein from Candida albicans SC5314 7.8e-06
SFPQ
Uncharacterized protein
protein from Gallus gallus 8.2e-06
COL5A1
Uncharacterized protein
protein from Canis lupus familiaris 8.3e-06
Zfp768
zinc finger protein 768
protein from Mus musculus 8.8e-06
Krtap21-1
keratin associated protein 21-1
protein from Mus musculus 9.3e-06
COL4A5
Uncharacterized protein
protein from Bos taurus 9.8e-06
RpII215
RNA polymerase II 215kD subunit
protein from Drosophila melanogaster 1.1e-05
AT1G55170 protein from Arabidopsis thaliana 1.1e-05
TAF15
TATA-binding protein-associated factor 2N
protein from Homo sapiens 1.2e-05
EWSR1
Uncharacterized protein
protein from Sus scrofa 1.3e-05
E2RS29
Uncharacterized protein
protein from Canis lupus familiaris 1.3e-05
COL3A1
Uncharacterized protein
protein from Sus scrofa 1.4e-05
COL3A1
Collagen alpha-1(III) chain
protein from Gallus gallus 1.5e-05
col-51 gene from Caenorhabditis elegans 1.6e-05
FUS
RNA-binding protein FUS
protein from Bos taurus 1.6e-05
col11a1b
collagen, type XI, alpha 1b
gene_product from Danio rerio 1.8e-05
bli-1 gene from Caenorhabditis elegans 1.8e-05
COL3A1
Uncharacterized protein
protein from Canis lupus familiaris 1.8e-05
COL4A2
Collagen alpha-2(IV) chain
protein from Bos taurus 2.0e-05
POLR2A
DNA-directed RNA polymerase
protein from Canis lupus familiaris 2.2e-05
Col3a1
collagen, type III, alpha 1
protein from Mus musculus 2.3e-05
gho
ghost
protein from Drosophila melanogaster 2.3e-05
ego-2 gene from Caenorhabditis elegans 2.3e-05
COL7A1
Uncharacterized protein
protein from Sus scrofa 2.4e-05
LOC100858979
Uncharacterized protein
protein from Gallus gallus 2.4e-05
POLR2A
DNA-directed RNA polymerase
protein from Canis lupus familiaris 2.5e-05
POLR2A
DNA-directed RNA polymerase
protein from Bos taurus 2.5e-05
POLR2A
DNA-directed RNA polymerase II subunit RPB1
protein from Homo sapiens 2.5e-05
Polr2a
polymerase (RNA) II (DNA directed) polypeptide A
protein from Mus musculus 2.5e-05
Polr2a
polymerase (RNA) II (DNA directed) polypeptide A
gene from Rattus norvegicus 2.5e-05
COL5A2
Uncharacterized protein
protein from Sus scrofa 2.5e-05
AT3G14750 protein from Arabidopsis thaliana 2.7e-05
COL2A1
Uncharacterized protein
protein from Sus scrofa 2.7e-05
COL3A1
Uncharacterized protein
protein from Canis lupus familiaris 3.0e-05
COL5A2
Uncharacterized protein
protein from Bos taurus 3.0e-05
COL5A2
Uncharacterized protein
protein from Canis lupus familiaris 3.0e-05
ZAP3 protein from Drosophila melanogaster 3.0e-05

The BLAST search returned 4 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  016220
        (393 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2033681 - symbol:AT1G67170 "AT1G67170" species...   335  2.3e-30   1
FB|FBgn0085362 - symbol:Vml "Vitelline membrane-like" spe...   286  1.0e-23   1
UNIPROTKB|F1S187 - symbol:LOC100518332 "Uncharacterized p...   201  1.8e-13   1
UNIPROTKB|P11414 - symbol:POLR2A "DNA-directed RNA polyme...   184  2.2e-11   1
UNIPROTKB|A4II09 - symbol:eif3a "Eukaryotic translation i...   186  8.8e-11   2
UNIPROTKB|A2VD00 - symbol:eif3a "Eukaryotic translation i...   184  1.6e-10   2
WB|WBGene00020550 - symbol:T17H7.1 species:6239 "Caenorha...   172  9.4e-10   1
FB|FBgn0028573 - symbol:prc "pericardin" species:7227 "Dr...   171  4.0e-09   1
ZFIN|ZDB-GENE-041008-78 - symbol:polr2a "polymerase (RNA)...   170  6.0e-09   1
UNIPROTKB|P71590 - symbol:fhaA "FHA domain-containing pro...   162  8.0e-09   1
UNIPROTKB|Q92804 - symbol:TAF15 "TATA-binding protein-ass...   160  8.8e-09   2
UNIPROTKB|F1PB61 - symbol:TAF15 "Uncharacterized protein"...   160  1.5e-08   1
WB|WBGene00044109 - symbol:K02E11.10 species:6239 "Caenor...   154  2.9e-08   1
DICTYBASE|DDB_G0277909 - symbol:cbpP "calcium-binding pro...   155  3.9e-08   1
FB|FBgn0050203 - symbol:CG30203 species:7227 "Drosophila ...   157  6.5e-08   1
WB|WBGene00005015 - symbol:spt-5 species:6239 "Caenorhabd...   158  7.1e-08   1
UNIPROTKB|Q21338 - symbol:spt-5 "Transcription elongation...   158  7.1e-08   1
SGD|S000002299 - symbol:RPO21 "RNA polymerase II largest ...   159  8.4e-08   1
MGI|MGI:1330280 - symbol:Krtap6-2 "keratin associated pro...   128  1.3e-07   1
WB|WBGene00002280 - symbol:let-2 species:6239 "Caenorhabd...   157  1.4e-07   1
UNIPROTKB|P17140 - symbol:let-2 "Collagen alpha-2(IV) cha...   157  1.4e-07   1
ZFIN|ZDB-GENE-030131-5725 - symbol:arid1ab "AT rich inter...   157  1.5e-07   2
WB|WBGene00000123 - symbol:ama-1 species:6239 "Caenorhabd...   157  1.5e-07   1
UNIPROTKB|P16356 - symbol:ama-1 "DNA-directed RNA polymer...   157  1.5e-07   1
UNIPROTKB|J9P0I3 - symbol:ZNF768 "Uncharacterized protein...   148  3.1e-07   1
FB|FBgn0035872 - symbol:CG7185 species:7227 "Drosophila m...   141  3.2e-07   2
UNIPROTKB|J3KNM7 - symbol:COL4A4 "Collagen alpha-4(IV) ch...   153  3.7e-07   1
UNIPROTKB|P53420 - symbol:COL4A4 "Collagen alpha-4(IV) ch...   153  3.7e-07   1
UNIPROTKB|D4ADB1 - symbol:D4ADB1 "Uncharacterized protein...   148  4.3e-07   1
UNIPROTKB|P02457 - symbol:COL1A1 "Collagen alpha-1(I) cha...   149  8.6e-07   1
UNIPROTKB|G4N3H5 - symbol:MGG_04961 "Uncharacterized prot...   144  1.0e-06   1
WB|WBGene00004203 - symbol:swsn-1 species:6239 "Caenorhab...   145  1.1e-06   1
TAIR|locus:2012713 - symbol:AT1G33680 "AT1G33680" species...   144  1.4e-06   1
UNIPROTKB|Q96QC0 - symbol:PPP1R10 "Serine/threonine-prote...   145  1.4e-06   1
UNIPROTKB|G1RSL2 - symbol:COL4A4 "Uncharacterized protein...   147  1.7e-06   1
FB|FBgn0261885 - symbol:osa "osa" species:7227 "Drosophil...   148  2.2e-06   1
UNIPROTKB|Q5TM61 - symbol:PPP1R10 "Serine/threonine-prote...   143  2.3e-06   1
UNIPROTKB|Q7YR38 - symbol:PPP1R10 "Serine/threonine-prote...   143  2.3e-06   1
UNIPROTKB|C9JGE3 - symbol:EWSR1 "Ewing sarcoma breakpoint...   127  2.6e-06   2
UNIPROTKB|P12105 - symbol:COL3A1 "Collagen alpha-1(III) c...   144  2.6e-06   1
TAIR|locus:2012788 - symbol:AT1G10390 "AT1G10390" species...   143  2.6e-06   1
MGI|MGI:1344412 - symbol:Ldb3 "LIM domain binding 3" spec...   141  2.7e-06   1
UNIPROTKB|O75112 - symbol:LDB3 "LIM domain-binding protei...   141  2.7e-06   1
UNIPROTKB|G7N928 - symbol:EGK_04858 "Putative uncharacter...   145  2.8e-06   1
UNIPROTKB|G7PK77 - symbol:EGM_04376 "Putative uncharacter...   145  2.8e-06   1
TAIR|locus:2043530 - symbol:AT2G25970 "AT2G25970" species...   140  2.9e-06   1
ZFIN|ZDB-GENE-050809-108 - symbol:pygo2 "pygopus homolog ...   139  3.2e-06   1
UNIPROTKB|P04258 - symbol:COL3A1 "Collagen alpha-1(III) c...   142  3.4e-06   1
UNIPROTKB|E2R2K8 - symbol:PPP1R10 "Uncharacterized protei...   141  3.8e-06   1
ZFIN|ZDB-GENE-030131-1600 - symbol:ewsr1b "Ewing sarcoma ...   142  3.8e-06   2
ZFIN|ZDB-GENE-040426-1010 - symbol:fus "fusion (involved ...   137  4.9e-06   1
UNIPROTKB|I3LQ53 - symbol:I3LQ53 "Uncharacterized protein...   137  5.0e-06   1
UNIPROTKB|F1MXS8 - symbol:COL3A1 "Collagen alpha-1(III) c...   142  5.1e-06   1
UNIPROTKB|J9P8F7 - symbol:COL5A1 "Uncharacterized protein...   141  6.2e-06   1
UNIPROTKB|E1C0T1 - symbol:TFG "Uncharacterized protein" s...   134  6.3e-06   1
UNIPROTKB|F1LLX1 - symbol:Col11a1 "Collagen alpha-1(XI) c...   142  6.4e-06   1
RGD|2372 - symbol:Col11a1 "collagen, type XI, alpha 1" sp...   142  6.4e-06   1
UNIPROTKB|P20909 - symbol:Col11a1 "Collagen alpha-1(XI) c...   142  6.4e-06   1
TAIR|locus:2077547 - symbol:AT3G07030 species:3702 "Arabi...   134  6.6e-06   1
UNIPROTKB|Q8WML4 - symbol:MUC1 "Mucin-1" species:9913 "Bo...   136  7.0e-06   1
CGD|CAL0000919 - symbol:RPO21 species:5476 "Candida albic...   141  7.8e-06   1
UNIPROTKB|Q5ACI7 - symbol:RPO21 "DNA-directed RNA polymer...   141  7.8e-06   1
UNIPROTKB|F1P555 - symbol:SFPQ "Uncharacterized protein" ...   136  8.2e-06   1
UNIPROTKB|F1PHX8 - symbol:COL5A1 "Uncharacterized protein...   141  8.3e-06   1
MGI|MGI:2384582 - symbol:Zfp768 "zinc finger protein 768"...   135  8.8e-06   1
MGI|MGI:2157767 - symbol:Krtap21-1 "keratin associated pr...   111  9.3e-06   1
UNIPROTKB|F1N474 - symbol:COL4A5 "Uncharacterized protein...   140  9.8e-06   1
FB|FBgn0003277 - symbol:RpII215 "RNA polymerase II 215kD ...   140  1.1e-05   1
TAIR|locus:2035751 - symbol:AT1G55170 "AT1G55170" species...   129  1.1e-05   1
UNIPROTKB|K7EKB2 - symbol:TAF15 "TATA-binding protein-ass...   125  1.2e-05   1
UNIPROTKB|F1RFI8 - symbol:EWSR1 "Uncharacterized protein"...   121  1.3e-05   2
UNIPROTKB|E2RS29 - symbol:E2RS29 "Uncharacterized protein...   133  1.3e-05   1
UNIPROTKB|F1RYI8 - symbol:COL3A1 "Uncharacterized protein...   138  1.4e-05   1
UNIPROTKB|F1NI73 - symbol:COL3A1 "Collagen alpha-1(III) c...   137  1.5e-05   1
WB|WBGene00000628 - symbol:col-51 species:6239 "Caenorhab...   131  1.6e-05   1
UNIPROTKB|Q28009 - symbol:FUS "RNA-binding protein FUS" s...   132  1.6e-05   1
ZFIN|ZDB-GENE-070912-607 - symbol:col11a1b "collagen, typ...   138  1.8e-05   1
WB|WBGene00000251 - symbol:bli-1 species:6239 "Caenorhabd...   135  1.8e-05   1
UNIPROTKB|J9P0L0 - symbol:COL3A1 "Uncharacterized protein...   137  1.8e-05   1
UNIPROTKB|F1N7Q7 - symbol:COL4A2 "Collagen alpha-2(IV) ch...   137  2.0e-05   1
UNIPROTKB|F1LRJ1 - symbol:Col4a3 "Protein Col4a3" species...   137  2.1e-05   1
UNIPROTKB|J9NW09 - symbol:POLR2A "DNA-directed RNA polyme...   137  2.2e-05   1
MGI|MGI:88453 - symbol:Col3a1 "collagen, type III, alpha ...   136  2.3e-05   1
FB|FBgn0262126 - symbol:gho "ghost" species:7227 "Drosoph...   135  2.3e-05   1
WB|WBGene00001215 - symbol:ego-2 species:6239 "Caenorhabd...   136  2.3e-05   1
UNIPROTKB|F1SKM1 - symbol:COL7A1 "Uncharacterized protein...   148  2.4e-05   2
UNIPROTKB|F1NRH2 - symbol:LOC100858979 "Uncharacterized p...   132  2.4e-05   1
UNIPROTKB|F1PGS0 - symbol:POLR2A "DNA-directed RNA polyme...   137  2.5e-05   1
UNIPROTKB|G3MZY8 - symbol:POLR2A "DNA-directed RNA polyme...   137  2.5e-05   1
UNIPROTKB|P24928 - symbol:POLR2A "DNA-directed RNA polyme...   137  2.5e-05   1
MGI|MGI:98086 - symbol:Polr2a "polymerase (RNA) II (DNA d...   137  2.5e-05   1
RGD|1587326 - symbol:Polr2a "polymerase (RNA) II (DNA dir...   137  2.5e-05   1
UNIPROTKB|F1RXW0 - symbol:COL5A2 "Uncharacterized protein...   135  2.5e-05   1
TAIR|locus:2089616 - symbol:AT3G14750 "AT3G14750" species...   127  2.7e-05   1
UNIPROTKB|I3LSV6 - symbol:COL2A1 "Uncharacterized protein...   135  2.7e-05   1
TAIR|locus:4010713902 - symbol:AT4G22505 species:3702 "Ar...   130  2.8e-05   1
UNIPROTKB|F1PG69 - symbol:COL3A1 "Uncharacterized protein...   135  3.0e-05   1
UNIPROTKB|F1N2Y2 - symbol:COL5A2 "Uncharacterized protein...   135  3.0e-05   1
UNIPROTKB|F1PG08 - symbol:COL5A2 "Uncharacterized protein...   135  3.0e-05   1
FB|FBgn0052685 - symbol:ZAP3 species:7227 "Drosophila mel...   136  3.0e-05   1

WARNING:  Descriptions of 139 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2033681 [details] [associations]
            symbol:AT1G67170 "AT1G67170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] EMBL:CP002684 EMBL:BT005883
            EMBL:AK228253 IPI:IPI00547288 RefSeq:NP_176888.2 UniGene:At.35681
            ProteinModelPortal:Q84TD8 SMR:Q84TD8 IntAct:Q84TD8 PRIDE:Q84TD8
            EnsemblPlants:AT1G67170.1 GeneID:843037 KEGG:ath:AT1G67170
            TAIR:At1g67170 HOGENOM:HOG000005883 InParanoid:Q84TD8 OMA:MESKGRI
            PhylomeDB:Q84TD8 ProtClustDB:CLSN2918424 Genevestigator:Q84TD8
            Uniprot:Q84TD8
        Length = 359

 Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
 Identities = 84/176 (47%), Positives = 99/176 (56%)

Query:    74 FHLCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGA 133
             +  CR TY+YEKKFYNDHLESLQ MEKNY+TMA EVEKL+A+LMN  N DRRA G YG  
Sbjct:   191 YQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQAQLMNNANSDRRAGGPYGNN 250

Query:   134 TGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA--Y---AATQ 188
               N+E + SG   G   YED +G PQG+ P P A  A     GPN+   A  Y     TQ
Sbjct:   251 I-NAEIDASGHQSGNGYYEDAFG-PQGYIPQPVAGNA----TGPNSVVGAAQYPYQGVTQ 304

Query:   189 SGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYD-PAKGPGYDPTK 241
              G  P R  Y+ PRGP    S  P       P   P   GPS + P  G   +P++
Sbjct:   305 PGYFPQRPGYNFPRGP--PGSYDPTTRLPTGPYGAPFPPGPSNNTPYAGTHGNPSR 358


>FB|FBgn0085362 [details] [associations]
            symbol:Vml "Vitelline membrane-like" species:7227 "Drosophila
            melanogaster" [GO:0009950 "dorsal/ventral axis specification"
            evidence=IGI] [GO:0060388 "vitelline envelope" evidence=IDA]
            [GO:0007305 "vitelline membrane formation involved in
            chorion-containing eggshell formation" evidence=ISM] [GO:0008316
            "structural constituent of vitelline membrane" evidence=ISM]
            [GO:0035805 "egg coat" evidence=ISM] EMBL:AE014298 GO:GO:0009950
            GeneTree:ENSGT00700000104744 PROSITE:PS51137 GO:GO:0060388
            InterPro:IPR013135 RefSeq:NP_001096866.1 UniGene:Dm.32785
            STRING:A8JUV4 EnsemblMetazoa:FBtr0112535 GeneID:5740271
            KEGG:dme:Dmel_CG34333 UCSC:CG34333-RA CTD:5740271
            FlyBase:FBgn0085362 eggNOG:NOG284187 InParanoid:A8JUV4 OMA:ISKYETI
            OrthoDB:EOG4KPRTT GenomeRNAi:5740271 NextBio:20891311 Bgee:A8JUV4
            Uniprot:A8JUV4
        Length = 578

 Score = 286 (105.7 bits), Expect = 1.0e-23, P = 1.0e-23
 Identities = 83/283 (29%), Positives = 99/283 (34%)

Query:   119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVGA 175
             AP+    A  SY      S +  S  P         Y  P     H P   A++     A
Sbjct:   198 APSYSAPAAPSYSAPAAPSYSAPSA-PSYSAQKTSSYSAPAAPSYHAPAAPASSYSAP-A 255

Query:   176 GPNTSTSA---YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 232
             GP+ S  A   Y+A     P  ++Y   + P Y A   P Y A  APSY  +  PSY   
Sbjct:   256 GPSYSAPAAPSYSAPSYSAPA-SSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPSYSSP 314

Query:   233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
                 Y     P Y A K  +Y A   P+Y     PSY       Y     P+Y     P 
Sbjct:   315 ASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 374

Query:   293 YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
             Y     P Y       Y A  APSY     P Y       Y    APSY       +  A
Sbjct:   375 YSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYS-A 433

Query:   353 PRGAAPHGQVPP-PLNNVPYGSATPPARS---GSGQPRGGNPA 391
             P  AAP    P  P  + P  S    AR+   GS  P  G  A
Sbjct:   434 P--AAPSYSAPAAPSYSAPASSGYSAARAYSAGSAAPASGYSA 474

 Score = 274 (101.5 bits), Expect = 8.8e-22, P = 8.8e-22
 Identities = 80/271 (29%), Positives = 97/271 (35%)

Query:   133 ATGNSENETSGRPVGQNAYEDGYG--VP-QGHGPP------PSATTAGVVG-AGPNTSTS 182
             AT N E +  G P  +  YE+ +   +P Q + PP       S + A   G + P     
Sbjct:    24 ATRNEEFD-DGFPESEFDYEERHTREIPAQAYAPPIVYNSQSSYSPAKDQGYSAPAAPVY 82

Query:   183 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
             + AA     P   +Y  P  P Y A   P Y A  APSY     PSY       Y     
Sbjct:    83 SPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAA 142

Query:   243 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 302
             P Y A    +Y A   P+Y      SY       Y     P+Y     P Y     P Y 
Sbjct:   143 PSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYS 202

Query:   303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGAPRGAAPHGQ 361
                 P Y A  APSY     P Y  Q+   Y    APSY  P+       AP G  P   
Sbjct:   203 APAAPSYSAPAAPSYSAPSAPSYSAQKTSSYSAPAAPSYHAPAAPASSYSAPAG--PSYS 260

Query:   362 VPP-PLNNVPYGSATPPARSGSGQPRGGNPA 391
              P  P  + P  SA   + S    P    PA
Sbjct:   261 APAAPSYSAPSYSAPASSYSALKAPSYSAPA 291

 Score = 262 (97.3 bits), Expect = 3.1e-20, P = 3.1e-20
 Identities = 69/246 (28%), Positives = 83/246 (33%)

Query:   155 YGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGY 213
             Y  P  +    S + A   G + P     + AA     P   +Y  P  P Y A   P Y
Sbjct:    54 YAPPIVYNSQSSYSPAKDQGYSAPAAPVYSPAAPSYSAPAAPSYSAPAAPSYSAPAAPSY 113

Query:   214 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR 273
              A  APSY     PSY       Y     P Y A    +Y A   P+Y      SY    
Sbjct:   114 SAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPA 173

Query:   274 GLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 333
                Y     P+Y     P Y     P Y     P Y A  APSY     P Y  Q+   Y
Sbjct:   174 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPSAPSYSAQKTSSY 233

Query:   334 DMRRAPSYD-PSRGTGFDGAPRGAAPHGQVPP----PLNNVP---YGSATPPARSGSGQP 385
                 APSY  P+       AP G +      P    P  + P   Y +   P+ S    P
Sbjct:   234 SAPAAPSYHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAP 293

Query:   386 RGGNPA 391
                 PA
Sbjct:   294 SYSAPA 299

 Score = 259 (96.2 bits), Expect = 7.3e-20, P = 7.3e-20
 Identities = 66/241 (27%), Positives = 84/241 (34%)

Query:   155 YGVPQG--HGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 212
             Y  P G  +  P + + +    + P +S SA  A     P   +Y  P  P Y +S  P 
Sbjct:   251 YSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPS 310

Query:   213 YD--------ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 264
             Y         A  AP+Y   K  SY     P Y     P Y A   S+Y A   P+Y   
Sbjct:   311 YSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAP 370

Query:   265 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 324
               PSY       Y      +Y     P Y     P Y       Y A  APSY     P 
Sbjct:   371 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 430

Query:   325 YDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 384
             Y       Y    APSY     +G+  A R  +     P    + P  S+   A + SG 
Sbjct:   431 YSAPAAPSYSAPAAPSYSAPASSGYSAA-RAYSAGSAAPASGYSAPKTSSGYSAPASSGS 489

Query:   385 P 385
             P
Sbjct:   490 P 490

 Score = 254 (94.5 bits), Expect = 3.0e-19, P = 3.0e-19
 Identities = 73/277 (26%), Positives = 91/277 (32%)

Query:   119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 178
             AP+    A  SY      S +  +       A    Y  P         T++    A P+
Sbjct:   182 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPA-APSYSAPSAPSYSAQKTSSYSAPAAPS 240

Query:   179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASK--GPG--YDASKAPSYDPTKGPSYDPAKG 234
                 A  A+    P   +Y  P  P Y A     P   Y A KAPSY     PSY     
Sbjct:   241 YHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAA 300

Query:   235 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
             P Y  +  P Y +   S+Y A   P Y   +  SY       Y     P+Y       Y 
Sbjct:   301 PSYSSSASPSYSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYS 360

Query:   295 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 354
                 P Y     P Y A  APSY       Y       Y    APSY     + +  AP 
Sbjct:   361 APAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYS-AP- 418

Query:   355 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
              AAP    P   +   Y +   P+ S    P    PA
Sbjct:   419 -AAPSYSAPAAPS---YSAPAAPSYSAPAAPSYSAPA 451

 Score = 220 (82.5 bits), Expect = 2.8e-15, P = 2.8e-15
 Identities = 80/278 (28%), Positives = 94/278 (33%)

Query:   117 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 176
             + AP+    A  SY      S + +S  P   +     Y  P    P  SA  A    A 
Sbjct:   282 LKAPSYSAPAAPSYSAPAAPSYS-SSASPSYSSPASSSYSAPAA--PTYSAPKAQSYSAP 338

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
                S SA AA     P  ++Y  P  P Y A   P Y A  APSY      SY     P 
Sbjct:   339 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPS 398

Query:   237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
             Y     P Y A   S+Y A   P+Y     PSY       Y     P+Y      GY   
Sbjct:   399 YSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSGYSAA 458

Query:   297 RVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDG--A 352
             R   Y    G    A  A  Y  P+   GY      G     A SY  P+  T   G  A
Sbjct:   459 RA--YSA--G---SAAPASGYSAPKTSSGYSAPASSGSPA--ASSYSAPASSTASSGYSA 509

Query:   353 P--------RGAAPHGQVPPPLNNVPYGSATPPARSGS 382
             P        R    H  +        YGSA P A  G+
Sbjct:   510 PASKSSGYARSEMDHQILGMARTAGGYGSAAPSAAYGA 547


>UNIPROTKB|F1S187 [details] [associations]
            symbol:LOC100518332 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 GeneTree:ENSGT00530000063105
            EMBL:CU896616 Ensembl:ENSSSCT00000019273 OMA:TESSSGX Uniprot:F1S187
        Length = 406

 Score = 201 (75.8 bits), Expect = 1.8e-13, P = 1.8e-13
 Identities = 69/221 (31%), Positives = 84/221 (38%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P   R + G + G     E    GR  G+     GYG  +  G      + G  G G + 
Sbjct:   187 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSGGG-GYGGDR 244

Query:   180 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 239
             S   Y   +SG      Y   RG GY   +G GY   +   Y   +   Y   +G GY  
Sbjct:   245 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGGYGG 300

Query:   240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQR-GPNY--DMQRGPGYETQ 296
              +G GY   +G  Y   RG  Y   RG  Y   RG GY   R G  Y  D   G GY   
Sbjct:   301 DRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRG-GYGGDRSGGGYGGDRGSGSGYGGD 358

Query:   297 RVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 336
             R  GY   R G  Y   R+  Y   RG GY  + G   D R
Sbjct:   359 RSGGYGGDRSGGGYGGDRSGGYGGDRG-GYGGKMGGRNDYR 398

 Score = 190 (71.9 bits), Expect = 3.2e-12, P = 3.2e-12
 Identities = 71/225 (31%), Positives = 84/225 (37%)

Query:   136 NSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
             N       RP G +    GYG  +G+ G        G  G G + S   Y   +SG    
Sbjct:   183 NEPRPEDSRPSGGDFRGRGYGGERGYRGRGGRGGDRG--GYGGDRSGGGYGGDRSGG--- 237

Query:   195 AAYDIPR-GPGYEASK-GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSN 252
               Y   R G GY   + G GY   +   Y   +G  Y   +G GY   +  GY   +G  
Sbjct:   238 GGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGG 297

Query:   253 YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG-YDVQRGPVYEA 311
             Y   RG  Y   RG  Y   RG GY   RG  Y   RG GY   R  G Y   RG     
Sbjct:   298 YGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRG-GYGGDRSGGGYGGDRGS---- 351

Query:   312 QRAPSYIPQRGPGYDLQR-GQGYDMRRAPSYDPSRGTGFDGAPRG 355
                  Y   R  GY   R G GY   R+  Y   RG G+ G   G
Sbjct:   352 --GSGYGGDRSGGYGGDRSGGGYGGDRSGGYGGDRG-GYGGKMGG 393


>UNIPROTKB|P11414 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:10029 "Cricetulus griseus" [GO:0005634 "nucleus"
            evidence=ISS] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=ISS] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0006468 "protein
            phosphorylation" evidence=ISS] [GO:0004672 "protein kinase
            activity" evidence=ISS] InterPro:IPR000684 Pfam:PF05001
            PROSITE:PS00115 GO:GO:0003677 GO:GO:0006468 GO:GO:0006366
            GO:GO:0003899 GO:GO:0005665 EMBL:M19538 PIR:A27677
            ProteinModelPortal:P11414 Uniprot:P11414
        Length = 467

 Score = 184 (69.8 bits), Expect = 2.2e-11, P = 2.2e-11
 Identities = 77/263 (29%), Positives = 101/263 (38%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS---TSAYAATQ 188
             GA G S           +A  D  G   G+ P  S T       GP++    +   A + 
Sbjct:    29 GAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSP 88

Query:   189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 248
             S +P   AY+ PR PG    + P Y  + +PSY PT  PSY P   P Y PT  P Y   
Sbjct:    89 SYSPTSPAYE-PRSPGGYTPQSPSYSPT-SPSYSPTS-PSYSPTS-PNYSPTS-PSYSPT 143

Query:   249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 308
               S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P 
Sbjct:   144 SPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-PTSPSYSPTS-PSYS-PTSPS 195

Query:   309 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA--APHGQVPPPL 366
             Y +  +PSY P   P Y       Y    +PSY P+  +    +P  +  +P+     P 
Sbjct:   196 Y-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSPSYSPTSPNYSPTSP- 250

Query:   367 NNVPYGSATPPARSGSGQPRGGN 389
             N  P   +  P  S S  P   N
Sbjct:   251 NYTPTSPSYSPT-SPSYSPTSPN 272

 Score = 165 (63.1 bits), Expect = 3.0e-09, P = 3.0e-09
 Identities = 69/236 (29%), Positives = 93/236 (39%)

Query:   118 NAPNVDRRA-DGSYGGATG---NSENETSGRPVGQN-AYEDGYGVPQGHGP--PPSATTA 170
             N P +      G   GA G   ++ ++ SG   G + A+    G P   GP  P   +  
Sbjct:    24 NIPGLGAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPG 83

Query:   171 GVVGAGPNTSTSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS 228
             G +    + ++ AY     G  TP   +Y  P  P Y  +  P Y  + +P+Y PT  PS
Sbjct:    84 GAMSPSYSPTSPAYEPRSPGGYTPQSPSYS-PTSPSYSPTS-PSYSPT-SPNYSPTS-PS 139

Query:   229 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
             Y P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P+Y   
Sbjct:   140 YSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-P 191

Query:   289 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 344
               P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:   192 TSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 241

 Score = 121 (47.7 bits), Expect = 0.00023, P = 0.00023
 Identities = 63/225 (28%), Positives = 80/225 (35%)

Query:   163 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 216
             P  S T+       PN     Y  T    +P   +Y  P  P Y  +  P Y  S     
Sbjct:   257 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 309

Query:   217 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 275
              ++P+Y P+  PSY P+  P Y PT  P Y     S Y     P Y     P Y P    
Sbjct:   310 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 363

Query:   276 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 335
              Y     P Y     P Y +   P Y     P Y +  +P Y P   P Y       Y  
Sbjct:   364 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 415

Query:   336 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 380
               +P Y P+  T    +P+G+      P      P  S T PA S
Sbjct:   416 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 459


>UNIPROTKB|A4II09 [details] [associations]
            symbol:eif3a "Eukaryotic translation initiation factor 3
            subunit A" species:8364 "Xenopus (Silurana) tropicalis" [GO:0001732
            "formation of translation initiation complex" evidence=ISS]
            [GO:0005852 "eukaryotic translation initiation factor 3 complex"
            evidence=ISS] [GO:0003743 "translation initiation factor activity"
            evidence=ISS] InterPro:IPR000717 Pfam:PF01399 SMART:SM00088
            GO:GO:0003743 GO:GO:0005852 eggNOG:NOG236708 HOGENOM:HOG000246822
            KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128 GO:GO:0001732 CTD:8661
            EMBL:BC135790 RefSeq:NP_001096173.1 UniGene:Str.55518 STRING:A4II09
            PRIDE:A4II09 GeneID:100124719 KEGG:xtr:100124719
            Xenbase:XB-GENE-994394 Uniprot:A4II09
        Length = 1391

 Score = 186 (70.5 bits), Expect = 8.8e-11, Sum P(2) = 8.8e-11
 Identities = 68/224 (30%), Positives = 101/224 (45%)

Query:   156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP-- 211
             G+ +  GP      AG    G           +     R  +D  RGP  G++  +GP  
Sbjct:   981 GLEEDRGPRRGIDDAGP-RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRR 1039

Query:   212 GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQKGSN--YDAQRGPN- 260
             G+D  + P    D  +GP   +D  + P  G+D  +GP  G+D  +G    +D  RGP  
Sbjct:  1040 GFDEDRGPRRGIDDDRGPRRGFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRR 1099

Query:   261 -YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQRVP--GYDVQRGPVY 309
              ++  RGP   ++  RG   G++  RGP   ++  RGP  G+E  R P  G+D  RGP  
Sbjct:  1100 GFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGP-- 1157

Query:   310 EAQRAPSYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRGT 347
               +R   +   RGP  G+D  R   +G+D  R P    D  RG+
Sbjct:  1158 --RRG--FEDDRGPRRGFDEDRTPRRGFDDDRGPRRGLDEDRGS 1197

 Score = 183 (69.5 bits), Expect = 1.9e-10, Sum P(2) = 1.9e-10
 Identities = 65/191 (34%), Positives = 92/191 (48%)

Query:   194 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 243
             R  ++  RGP  G E  + P  G+D  + P   +D  +GP   +D  +GP  G D  +GP
Sbjct:   998 RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGIDDDRGP 1057

Query:   244 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 293
                 ++G  +D  R P   +D  RGP   +D  RG   G+D  RGP   ++  RGP  G+
Sbjct:  1058 ----RRG--FDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGF 1111

Query:   294 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAPSYDP 343
             E  R P  G++  RGP   +E  R P   +   RGP  G+D  RG  +G++  R P    
Sbjct:  1112 EDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPR--- 1168

Query:   344 SRGTGFDGAPR 354
              RG   D  PR
Sbjct:  1169 -RGFDEDRTPR 1178

 Score = 167 (63.8 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
 Identities = 71/225 (31%), Positives = 103/225 (45%)

Query:   200 PRGPGYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQK 249
             PR  G++  + P  G+D  + P   +D  +GP   +D  +GP  G++  +GP  G++  +
Sbjct:  1057 PRR-GFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGFEDDR 1115

Query:   250 GSN--YDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQR 297
             G    ++  RGP   ++  RGP   ++  RG   G+D  RGP   ++  RGP  G++  R
Sbjct:  1116 GPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPRRGFDEDR 1175

Query:   298 VP--GYDVQRGPV--YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              P  G+D  RGP    +  R  S+   RG G D+ R +G D  R P     RG   D  P
Sbjct:  1176 TPRRGFDDDRGPRRGLDEDRG-SW---RG-GDDVPR-RGADDDRGPR----RGADDDRGP 1225

Query:   354 RGAAPHGQVP--PPLNNVPYG-SATPPARSGS-GQPRGGN-PARR 393
             R      Q P  P   + P G      AR  S G PR    P  R
Sbjct:  1226 RRGEDRDQTPWKPMAASRPGGWREREKAREDSWGPPRDSQAPEER 1270

 Score = 150 (57.9 bits), Expect = 8.8e-07, Sum P(2) = 8.8e-07
 Identities = 82/301 (27%), Positives = 122/301 (40%)

Query:    82 EYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELMNAPNVDRRADGSYGGATGNSE 138
             E E++ Y + L+ L+  E+       E+E   + R E     +   R D S  G     E
Sbjct:   838 EAEQRDYQERLKKLEEQERKKRQRELEIEERERKREEERRGGDDTFRKDSSRWG-----E 892

Query:   139 NETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVGAG-PNTSTSAYAATQSGTPMR 194
              E SG   G +  E     P+     G P S        +       +A    +     R
Sbjct:   893 REESGWRRGADPDERKQVPPERDWRRGGPDSKPVINEDASNREEDENAALRKDEEQVSSR 952

Query:   195 AAYDIPRGPGYEASKGPGY-DASKAPS--YDPTKGP--SYDPAKGP--GYDPTKGPGYDA 247
             A  +    P  +  KG  + D  + P    +  +GP    D A GP  G++  +GP    
Sbjct:   953 AFEEKVSLPDADEEKGGSWRDEDRGPKRGLEEDRGPRRGIDDA-GPRRGFEEDRGP---- 1007

Query:   248 QKGSNYDAQRGPNYDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQRVP 299
             ++G   D      +D  RGP   +D  RG   G+D  RGP    D  RGP  G++  R P
Sbjct:  1008 RRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGIDDDRGPRRGFDEDRTP 1067

Query:   300 --GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRGT--GF 349
               G+D  RGP    +R   +   RGP  G+D  RG  +G++  R P   ++  RG   GF
Sbjct:  1068 RRGFDDDRGP----RRG--FDDDRGPRRGFDEDRGPRRGFEDDRGPRRGFEDDRGPRRGF 1121

Query:   350 D 350
             +
Sbjct:  1122 E 1122

 Score = 46 (21.3 bits), Expect = 8.8e-11, Sum P(2) = 8.8e-11
 Identities = 10/35 (28%), Positives = 21/35 (60%)

Query:    75 HLCR-GTYEYEKKFYNDHLESLQVMEKNYITMATE 108
             HL + G Y+Y+      +++SL+ + + Y+ +A E
Sbjct:    65 HLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLAEE 99


>UNIPROTKB|A2VD00 [details] [associations]
            symbol:eif3a "Eukaryotic translation initiation factor 3
            subunit A" species:8355 "Xenopus laevis" [GO:0001732 "formation of
            translation initiation complex" evidence=ISS] [GO:0005852
            "eukaryotic translation initiation factor 3 complex" evidence=ISS]
            [GO:0003743 "translation initiation factor activity" evidence=ISS]
            InterPro:IPR000717 Pfam:PF01399 SMART:SM00088 GO:GO:0003743
            GO:GO:0005852 KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128
            GO:GO:0001732 EMBL:BC129055 RefSeq:NP_001085285.1 UniGene:Xl.57279
            PRIDE:A2VD00 GeneID:443632 KEGG:xla:443632 Uniprot:A2VD00
        Length = 1424

 Score = 184 (69.8 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 66/197 (33%), Positives = 90/197 (45%)

Query:   194 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 243
             R   D  RGP  G +  +GP  G D  + P    D  +GP   +D  +GP  G+D  +GP
Sbjct:  1030 RRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGFDEDRGPRRGFDEDRGP 1089

Query:   244 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 293
               D      +D  RGP   +D  RGP   +D  RG   G+D  RGP   +D  RGP  G+
Sbjct:  1090 RRD------FDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGPRRGF 1143

Query:   294 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SY 341
             +  R P  G++  RGP   +E  R P   +   RGP  G+D  R   +G++  R P    
Sbjct:  1144 DDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRTPRRGFEDDRGPRRGM 1203

Query:   342 DPSRGTGFDGAPRGAAP 358
             D  R +   GA     P
Sbjct:  1204 DEERVSWRGGAEEDRGP 1220

 Score = 184 (69.8 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 73/232 (31%), Positives = 104/232 (44%)

Query:   152 EDGYGVPQGHGPPPSATT-AGVVGAGPNTSTSAYAATQSG---TP-MRAAYDIPRGP--G 204
             +D   V +G G    A +  G    GP  S       + G    P  R  ++  +GP  G
Sbjct:   943 KDEEQVARGDGDEERAASWRGTDDRGPKRSVEEDGGPRRGFNDEPGPRRGFEDDQGPRRG 1002

Query:   205 YEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQKGSN-- 252
              +  +GP  G D  + P    D  +GP    D  +GP  G D  +GP  G D  +G    
Sbjct:  1003 LDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRG 1062

Query:   253 YDAQRGPN--YDIHRGPSYDPQRGLGYDMQRGP--NYDMQRGP--GYETQRVP--GYDVQ 304
              D  RGP   +D  RGP    +RG  +D  RGP  ++D  RGP  G++  R P  G+D  
Sbjct:  1063 LDEDRGPRRGFDEDRGP----RRG--FDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDED 1116

Query:   305 RGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRG 346
             RGP   ++  R P   +   RGP  G+D  RG  +G++  R P   ++  RG
Sbjct:  1117 RGPRRGFDEDRGPRRGFDDDRGPRRGFDDDRGPRRGFEDDRGPRRGFEDDRG 1168

 Score = 159 (61.0 bits), Expect = 9.5e-08, Sum P(2) = 9.5e-08
 Identities = 61/197 (30%), Positives = 91/197 (46%)

Query:   194 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 243
             R  +D  RGP   ++  +GP  G+D  + P   +D  +GP   +D  +GP  G+D  +GP
Sbjct:  1080 RRGFDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGP 1139

Query:   244 GYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRGLGYDMQRGPN--YDMQRGP--GYETQR 297
                 ++G  +D  RGP   ++  RGP    +RG  ++  RGP   ++  RGP  G++  R
Sbjct:  1140 ----RRG--FDDDRGPRRGFEDDRGP----RRG--FEDDRGPRRGFEDDRGPRRGFDEDR 1187

Query:   298 VP--GYDVQRGPV--YEAQRAP---SYIPQRGPGYDLQRGQGYDMRRAPSYD--PSRGTG 348
              P  G++  RGP    + +R          RGP    +  +G   RR    D  P RG  
Sbjct:  1188 TPRRGFEDDRGPRRGMDEERVSWRGGAEEDRGPRRGAEEDRG--PRRGAEEDRGPRRGAE 1245

Query:   349 FDGAPRGAAPH--GQVP 363
              D  PR  A    GQ P
Sbjct:  1246 EDRGPRRGAEEDRGQTP 1262

 Score = 145 (56.1 bits), Expect = 3.3e-06, Sum P(2) = 3.3e-06
 Identities = 83/298 (27%), Positives = 119/298 (39%)

Query:    82 EYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELMNAPNVDRRADGSYGGATGNSE 138
             E E++ Y + L+ L+  E+       E+E   K R E    P+   R   +    +   +
Sbjct:   838 EAEQREYQERLKKLEEQERKKRLRELEIEEREKKRDEERRGPDDSFRKQDT---PSRWGD 894

Query:   139 NETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 198
              E SG   G +  E      +   PP     +G   + P          +     +    
Sbjct:   895 REESGWRRGADPDE------RKQAPPERDWRSGGQDSKP-VKDEDREGDEDSVLRKDEEQ 947

Query:   199 IPRGPGYE--ASKGPGYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQ 248
             + RG G E  A+   G D  + P  S +   GP   ++   GP  G++  +GP  G D  
Sbjct:   948 VARGDGDEERAASWRGTD-DRGPKRSVEEDGGPRRGFNDEPGPRRGFEDDQGPRRGLDED 1006

Query:   249 KGSN--YDAQRGPNYDIHRGPSYD--PQRGLGYDMQRGPN--YDMQRGP--GYETQRVP- 299
             +G     D  RGP     RG   D  P+RGL  D  RGP    D  RGP  G +  R P 
Sbjct:  1007 RGPRRGLDEDRGPR----RGLDEDRGPRRGL--DEDRGPRRGLDEDRGPRRGLDEDRGPR 1060

Query:   300 -GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRG 346
              G D  RGP   ++  R P   +   RGP   +D  RG  +G+D  R P   +D  RG
Sbjct:  1061 RGLDEDRGPRRGFDEDRGPRRGFDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDEDRG 1118

 Score = 46 (21.3 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
 Identities = 10/35 (28%), Positives = 21/35 (60%)

Query:    75 HLCR-GTYEYEKKFYNDHLESLQVMEKNYITMATE 108
             HL + G Y+Y+      +++SL+ + + Y+ +A E
Sbjct:    65 HLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLAEE 99


>WB|WBGene00020550 [details] [associations]
            symbol:T17H7.1 species:6239 "Caenorhabditis elegans"
            [GO:0019915 "lipid storage" evidence=IMP] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            GO:GO:0009792 GO:GO:0019915 InterPro:IPR003677 Pfam:PF02520
            EMBL:FO080638 PIR:T28899 RefSeq:NP_497250.1
            ProteinModelPortal:Q22537 PaxDb:Q22537 EnsemblMetazoa:T17H7.1
            GeneID:175228 KEGG:cel:CELE_T17H7.1 UCSC:T17H7.1 CTD:175228
            WormBase:T17H7.1 eggNOG:NOG271901 GeneTree:ENSGT00700000104820
            HOGENOM:HOG000020548 InParanoid:Q22537 OMA:GRGQGPD NextBio:887312
            Uniprot:Q22537
        Length = 682

 Score = 172 (65.6 bits), Expect = 9.4e-10, P = 9.4e-10
 Identities = 75/273 (27%), Positives = 101/273 (36%)

Query:   125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
             R DG  G   G  +N   G   G+      +G P  +    +  +      GP++  S  
Sbjct:   229 RGDGP-GFVPGTQDNNQRGS--GERGQRQNFG-PSDNLTNGNQFSKKQFARGPSSMNSDL 284

Query:   185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGP 243
             +     +   + +D PRGPG    +G G D          +GP + P    PG   + GP
Sbjct:   285 SENSQHSDSNSQFDFPRGPGGRGGRGQGPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGP 344

Query:   244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG-LGYDMQRGPNYDM--QRG---PGYETQR 297
             G    +G   D +   ++   RG     +RG  G     GP  D   +RG   PG    R
Sbjct:   345 GGRGGRGQGPDFEPQDDFPGRRGSGGPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGR 404

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
               G D   GP  +  R     P  GP  D   +RG G      P     RG   D  P G
Sbjct:   405 GQGPDF--GPGRQGGRGQG--PDFGPQDDFSGRRGSG-----GPGGRGGRGQEPDFGPGG 455

Query:   356 AAPHGQVPP--PLNNVP--YGSATPPARSGSGQ 384
                 GQ P   P ++ P   GS  P  R G GQ
Sbjct:   456 QGGRGQGPDFGPQDDFPGRRGSGGPEGRDGRGQ 488

 Score = 139 (54.0 bits), Expect = 4.1e-06, P = 4.1e-06
 Identities = 76/265 (28%), Positives = 93/265 (35%)

Query:   131 GGATGNSENETSGRPVGQNAYEDG--YGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
             GG  G  +    G P GQ     G  +G PQ   P    +  G  G G       +   Q
Sbjct:   304 GGRGGRGQGPDFG-PGGQGGRGQGPDFG-PQDDFPGRRGS-GGPGGRGGRGQGPDFEP-Q 359

Query:   189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDPTKGPGYDA 247
                P R       GPG    +G G D      +   +G      +G  G  P  GPG   
Sbjct:   360 DDFPGRRGSG---GPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQG 416

Query:   248 QKGSNYDAQRGPNYDI--HRGPSYDPQRG-LGYDMQRGPNYDMQRG--PGYETQR-VPGY 301
              +G   D   GP  D    RG      RG  G +   GP     RG  P +  Q   PG 
Sbjct:   417 GRGQGPDF--GPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGR 474

Query:   302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
                 GP  E +      P  GPG    RGQ  D     ++   RG+G  G  RG  P   
Sbjct:   475 RGSGGP--EGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGG-RGQGPDFG 531

Query:   362 VPPPLNNVP--YGSATPPARSGSGQ 384
                P ++ P   GS  P  R G GQ
Sbjct:   532 ---PQDDFPGRRGSGGPEGRDGRGQ 553

 Score = 120 (47.3 bits), Expect = 0.00051, P = 0.00051
 Identities = 72/265 (27%), Positives = 94/265 (35%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP-----QGHGPPPSATTAGVVGAGPN 178
             RR  G  G   G  +    G P        G G P     +G GP       G  G GP+
Sbjct:   365 RRGSGGPGRRGGRGQGPDFG-PQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQGGRGQGPD 423

Query:   179 TSTSA-YAATQ-SGTPM-RAA--YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 233
                   ++  + SG P  R     +   GPG +  +G G D      +   +G      +
Sbjct:   424 FGPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGPEGR 483

Query:   234 -GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM--QRG 290
              G G  P  GPG    +G + D+     +   RG      RG G D   GP  D   +RG
Sbjct:   484 DGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGGRGQGPDF--GPQDDFPGRRG 541

Query:   291 PGYETQRV---------PGYDVQRGPVYEAQRAPSYIPQRGPGYD--LQ-RGQGYDMRRA 338
              G    R          PG    RG   ++    ++  +RGPG    L  RGQG D    
Sbjct:   542 SGGPEGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGPGGPGGLGGRGQGPDF--G 599

Query:   339 PSYDPSRGTGFDGAPRGAAPHGQVP 363
             P     RG G D   R     GQ P
Sbjct:   600 PGGQGDRGQGPDFGARSQGNRGQGP 624

 Score = 118 (46.6 bits), Expect = 0.00084, P = 0.00084
 Identities = 62/240 (25%), Positives = 86/240 (35%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDG--YGVPQ-------GHGPPPSATTAGV-V 173
             RR  G  GG  G  +    G P GQ     G  +G PQ       G G P      G   
Sbjct:   433 RRGSGGPGGRGGRGQEPDFG-PGGQGGRGQGPDFG-PQDDFPGRRGSGGPEGRDGRGQGP 490

Query:   174 GAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 233
               GP +       + SG+  + A+   RG G    +G G D      +   +G      +
Sbjct:   491 DFGPGSQGGRGQDSDSGS--QDAFPGRRGSGGPGGRGQGPDFGPQDDFPGRRGSGGPEGR 548

Query:   234 -GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ----RGLGYDMQRGPNYDMQ 288
              G G  P  GPG    +G + D+     +   RGP   P     RG G D   G   D  
Sbjct:   549 DGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGPG-GPGGLGGRGQGPDFGPGGQGDRG 607

Query:   289 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTG 348
             +GP +   R  G   Q GP +E+++   +          Q+     M     +D S G G
Sbjct:   608 QGPDFGA-RSQGNRGQ-GPAFESRQPRQFDDADTSSAPSQKYFQRQMNSGMRFDQSSGFG 665


>FB|FBgn0028573 [details] [associations]
            symbol:prc "pericardin" species:7227 "Drosophila
            melanogaster" [GO:0005605 "basal lamina" evidence=NAS] [GO:0007507
            "heart development" evidence=IMP;TAS] [GO:0005578 "proteinaceous
            extracellular matrix" evidence=IDA] [GO:0035088 "establishment or
            maintenance of apical/basal cell polarity" evidence=TAS]
            [GO:0016477 "cell migration" evidence=TAS] [GO:0002009
            "morphogenesis of an epithelium" evidence=TAS] GO:GO:0002009
            GO:GO:0007507 GO:GO:0005578 FlyBase:FBgn0028573 InterPro:IPR009765
            Pfam:PF07054 EMBL:AF203342 STRING:Q9U617 PRIDE:Q9U617
            InParanoid:Q9U617 ArrayExpress:Q9U617 Bgee:Q9U617 Uniprot:Q9U617
        Length = 1729

 Score = 171 (65.3 bits), Expect = 4.0e-09, P = 4.0e-09
 Identities = 81/274 (29%), Positives = 98/274 (35%)

Query:   130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---VGAG-PNTSTSAYA 185
             YG   G      +G+P G    + G G   G G P   T  G+    GAG P   T    
Sbjct:   417 YGTQPGIGGQTGAGQP-GYGT-QPGIGAQTGAGQPGYGTQPGIGGQTGAGQPGYGTQPGI 474

Query:   186 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG- 242
               Q+G   +  Y    G G +   G PGY +          G P Y    G G     G 
Sbjct:   475 GVQTGAG-QPGYGSQPGIGAQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQ 533

Query:   243 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPG 300
             PGY  Q G    AQ G        P Y  Q G+G     G P Y  Q G G +T    PG
Sbjct:   534 PGYGTQPGIG--AQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPG 586

Query:   301 YDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT-GFDGAPRGAA 357
             Y  Q G   +     P Y  Q G G  +  GQ GY  +         G  G+   P    
Sbjct:   587 YGTQPGVGAQTGTGQPGYGSQPGVGTQIGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGG 646

Query:   358 PHGQVPPPLNNVPYGSATPPARSGSGQPR-GGNP 390
               G   P     P G     A++G+GQP  G  P
Sbjct:   647 QTGAAQPGYGTQP-GVG---AQTGTGQPGYGAQP 676

 Score = 169 (64.5 bits), Expect = 6.7e-09, P = 6.7e-09
 Identities = 86/271 (31%), Positives = 99/271 (36%)

Query:   130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS--ATTAGVVGAGPNTSTSAYAAT 187
             YGG  G S     G+P G        G+P G+G  P   A TA V G      T      
Sbjct:   876 YGGQPGISGQTGGGQP-GYGGQATISGLP-GYGTQPGIGALTA-VPGGHYGYETQPGIGG 932

Query:   188 QSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
             Q+GT        P G G +   G PGY     P      G S    + PGY    G G  
Sbjct:   933 QTGTNQPGFGGQP-GIGGQTGAGQPGYGFIGQPGIGGQTGTS---GRQPGYGTQPGIGGQ 988

Query:   247 AQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYD 302
                G   Y +Q G       G P Y  Q G+G  +  G P Y  Q G G +T    PGY 
Sbjct:   989 TAAGQPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYG 1048

Query:   303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
              Q G  +  Q  P Y  Q  PG   Q G G      P Y    G G  G      P   V
Sbjct:  1049 AQPG--FGGQ--PGYGNQ--PGVGGQTGAGQ-----PGYGSQPGVG--GQTGAGQPGYGV 1095

Query:   363 PPPLNNVP-YGSATPPARSG-SGQPR-GGNP 390
              P     P  G  T   + G  GQP  GG+P
Sbjct:  1096 IPGFGGQPGIGGQTAAGKPGYGGQPGIGGSP 1126

 Score = 164 (62.8 bits), Expect = 2.4e-08, P = 2.4e-08
 Identities = 84/270 (31%), Positives = 99/270 (36%)

Query:   131 GGATGNSENETSGRPV--GQN-AYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             GG TG  +     +P   GQ  A + GYG   G G     T AG  G G  T     A T
Sbjct:   390 GGQTGPGQPGYGSQPGIGGQTGAGQPGYGTQPGIG---GQTGAGQPGYG--TQPGIGAQT 444

Query:   188 QSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG-PG 244
              +G P    Y    G G +   G PGY            G P Y    G G     G PG
Sbjct:   445 GAGQP---GYGTQPGIGGQTGAGQPGYGTQPGIGVQTGAGQPGYGSQPGIGAQTGAGQPG 501

Query:   245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYD 302
             Y +Q G     Q G        P Y  Q G+G     G P Y  Q G G +T    PGY 
Sbjct:   502 YGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYG 554

Query:   303 VQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
              Q G   +     P Y  Q G G     GQ GY  +  P      GTG  G   G+ P  
Sbjct:   555 SQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGTQ--PGVGAQTGTGQPGY--GSQPGV 610

Query:   361 QVPPPLNNVPYGSATP-PARSGSGQPRGGN 389
                       YGS      ++G+GQP  G+
Sbjct:   611 GTQIGAGQPGYGSQPGIGGQTGAGQPGYGS 640

 Score = 154 (59.3 bits), Expect = 3.0e-07, P = 3.0e-07
 Identities = 78/247 (31%), Positives = 90/247 (36%)

Query:   131 GGATGNSENETS-G-RPV--GQNAY-EDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             GG TG S  +   G +P   GQ A  + GYG   G G     T AG  G G  T      
Sbjct:   967 GGQTGTSGRQPGYGTQPGIGGQTAAGQPGYGSQPGIG---GQTGAGQPGYGSQTGVGGQI 1023

Query:   186 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
                +G P    Y    G G +   G PGY A   P +    G    P  G G      PG
Sbjct:  1024 G--AGQP---GYGSQPGIGGQTGAGQPGYGAQ--PGFGGQPGYGNQPGVG-GQTGAGQPG 1075

Query:   245 YDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG----PGYETQRV 298
             Y +Q G       G P Y +   P +  Q G+G     G P Y  Q G    P Y TQ+ 
Sbjct:  1076 YGSQPGVGGQTGAGQPGYGVI--PGFGGQPGIGGQTAAGKPGYGGQPGIGGSPVYGTQQG 1133

Query:   299 PG--YDVQRG-PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA-PSYDPSRGTGFDGAP- 353
              G    +  G P Y  Q  P       PGY    G G       P Y P    G  GAP 
Sbjct:  1134 TGGPSGISGGQPGYGTQ--PGQTGAGQPGYGSLPGTGGQATAGQPGYGPGSQPGIGGAPV 1191

Query:   354 RGAAPHG 360
              G  P G
Sbjct:  1192 YGTQPGG 1198

 Score = 154 (59.3 bits), Expect = 3.0e-07, P = 3.0e-07
 Identities = 86/280 (30%), Positives = 100/280 (35%)

Query:   131 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG-PNTSTSAYAATQS 189
             GG TG  +     +P G    + G G P G+G  P     G  G G P   T      Q+
Sbjct:   339 GGQTGAGQPGYGTQP-GIGG-QTGAGQP-GYGTQPGI--GGQTGPGQPGYGTQPGIGGQT 393

Query:   190 GTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG-PGYD 246
             G P +  Y    G G +   G PGY            G P Y    G G     G PGY 
Sbjct:   394 G-PGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYG 452

Query:   247 AQKGSNYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG-PGYETQRVPGYDV 303
              Q G       G P Y    G       G  GY  Q G       G PGY +Q  PG   
Sbjct:   453 TQPGIGGQTGAGQPGYGTQPGIGVQTGAGQPGYGSQPGIGAQTGAGQPGYGSQ--PGIGG 510

Query:   304 QRG---PVYEAQRAPSYIPQRG---PGYDLQRGQGYDMRRA-PSYDPSRGTGFD-GAPRG 355
             Q G   P Y +Q  P    Q G   PGY  Q G G       P Y    G G   GA  G
Sbjct:   511 QTGAGQPGYGSQ--PGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYGSQPGIGGQTGA--G 566

Query:   356 AAPHGQVPPPLNNVPYGS---ATPP---ARSGSGQPRGGN 389
                +G  P        G     T P   A++G+GQP  G+
Sbjct:   567 QPGYGSQPGIGGQTGAGQPGYGTQPGVGAQTGTGQPGYGS 606

 Score = 151 (58.2 bits), Expect = 6.3e-07, P = 6.3e-07
 Identities = 85/282 (30%), Positives = 100/282 (35%)

Query:   120 PNVDRRADGSYGGATGNSENETS--GRPVGQN-AYEDGYGVPQGHGPPPSATTAGVVGAG 176
             P+  R  D S  G  G  ++  S  G   GQ  A + GYG   G G     T  G  G G
Sbjct:   107 PSSGRILDASGSGGIGRPDSIISLPGGVGGQTGAGQPGYGSQPGIG---GQTATGQPGYG 163

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKG 234
                   A A   +G P    Y    G G +   G PGY +          G P Y    G
Sbjct:   164 SQLGVGAQAG--AGQP---GYGAQPGVGAQTGAGQPGYGSQTGIGGQTGAGQPGYGSQPG 218

Query:   235 PGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPG 292
              G     G PGY +Q G     Q G        P Y  Q G+G     G P Y  Q G G
Sbjct:   219 IGGQTGAGQPGYGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIG 271

Query:   293 YETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGF 349
              +T    PGY  Q G   +     P Y  Q G G     GQ GY  +  P      G G 
Sbjct:   272 GQTGAGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGSQ--PGIGGQTGAGQ 329

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGSATPPA---RSGSGQPRGG 388
              G        GQ         YG  T P    ++G+GQP  G
Sbjct:   330 PGYGSQPGIGGQTGA--GQPGYG--TQPGIGGQTGAGQPGYG 367

 Score = 142 (55.0 bits), Expect = 6.1e-06, P = 6.1e-06
 Identities = 85/297 (28%), Positives = 102/297 (34%)

Query:   120 PNVDRRADGS---YGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGA 175
             P +  +  G    YGG    S     G   G  A      VP GH G        G  G 
Sbjct:   880 PGISGQTGGGQPGYGGQATISGLPGYGTQPGIGALT---AVPGGHYGYETQPGIGGQTGT 936

Query:   176 G-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
               P          Q+G   +  Y     PG     G    + + P Y    G     A G
Sbjct:   937 NQPGFGGQPGIGGQTGAG-QPGYGFIGQPGIGGQTGT---SGRQPGYGTQPGIGGQTAAG 992

Query:   235 -PGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRG 290
              PGY    G G     G   Y +Q G    I  G P Y  Q G+G     G P Y  Q G
Sbjct:   993 QPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYGAQPG 1052

Query:   291 ----PGYETQRVPGYDVQRG---PVYEAQRAPSYIPQRG---PGYDL------QRGQGYD 334
                 PGY  Q  PG   Q G   P Y +Q  P    Q G   PGY +      Q G G  
Sbjct:  1053 FGGQPGYGNQ--PGVGGQTGAGQPGYGSQ--PGVGGQTGAGQPGYGVIPGFGGQPGIGGQ 1108

Query:   335 MRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPP-LNNVPYGSATPPARSGSGQPRGGN 389
                  P Y    G G  G+P      G   P  ++    G  T P ++G+GQP  G+
Sbjct:  1109 TAAGKPGYGGQPGIG--GSPVYGTQQGTGGPSGISGGQPGYGTQPGQTGAGQPGYGS 1163

 Score = 123 (48.4 bits), Expect = 0.00072, P = 0.00072
 Identities = 59/188 (31%), Positives = 68/188 (36%)

Query:   212 GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRG-PS 268
             G   +  P Y    G     A G PGY    G G  A  G   Y AQ G       G P 
Sbjct:   136 GQTGAGQPGYGSQPGIGGQTATGQPGYGSQLGVGAQAGAGQPGYGAQPGVGAQTGAGQPG 195

Query:   269 YDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGY 325
             Y  Q G+G     G P Y  Q G G +T    PGY  Q G   +     P Y  Q G G 
Sbjct:   196 YGSQTGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGG 255

Query:   326 DLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA---RSG 381
                 GQ GY  +  P      G G  G        GQ        P G  T P    ++G
Sbjct:   256 QTGAGQPGYGSQ--PGIGGQTGAGQPGYGSQPGIGGQTGA---GQP-GYGTQPGIGGQTG 309

Query:   382 SGQPRGGN 389
             +GQP  G+
Sbjct:   310 AGQPGYGS 317


>ZFIN|ZDB-GENE-041008-78 [details] [associations]
            symbol:polr2a "polymerase (RNA) II (DNA directed)
            polypeptide A" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 ZFIN:ZDB-GENE-041008-78 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            GO:GO:0005665 GeneTree:ENSGT00700000104490 EMBL:AL929346
            IPI:IPI00608319 Ensembl:ENSDART00000077495 Bgee:F1Q9K4
            Uniprot:F1Q9K4
        Length = 1965

 Score = 170 (64.9 bits), Expect = 6.0e-09, P = 6.0e-09
 Identities = 65/200 (32%), Positives = 82/200 (41%)

Query:   149 NAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS--TSAYAATQSGTPMRAAYDIPRGPG-- 204
             +A  D  G   G+ P  S T       GP +    S  A + + +P   AY+ PR PG  
Sbjct:  1546 SAASDASGFSPGYSPAWSPTPGSPGSPGPASPYIPSPGALSPNYSPTSPAYE-PRSPGGG 1604

Query:   205 YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 264
             Y   + PGY  + +PSY PT  PSY P   P Y PT  P Y     S Y +   P+Y   
Sbjct:  1605 Y-TPQSPGYSPT-SPSYSPTS-PSYSPTS-PNYSPTS-PSYSPTSPS-Y-SPTSPSYS-P 1656

Query:   265 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 324
               PSY P     Y     P+Y     P Y     P Y     P Y +  +PSY P   P 
Sbjct:  1657 TSPSYSPTSP-SYS-PTSPSYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPS 1709

Query:   325 YDLQRGQGYDMRRAPSYDPS 344
             Y       Y    +PSY P+
Sbjct:  1710 YS-PTSPSYSPT-SPSYSPT 1727

 Score = 131 (51.2 bits), Expect = 0.00011, P = 0.00011
 Identities = 67/234 (28%), Positives = 87/234 (37%)

Query:   160 GHGPPPSATTAGVVGAGPNTSTSAYAATQ----SG-TPMRAAYDIPRGPGYEASKGPGYD 214
             G  P P +  +  +      +T AY A      SG TP  A +  P      +   PGY 
Sbjct:  1501 GSAPSPMSGMSPAMTPWNTGATPAYGAWSPSVGSGMTPGAAGFS-PSAASDASGFSPGYS 1559

Query:   215 A--SKAPSYDPTKGPS--YDPAKG---PGYDPTKGPGYDAQK-GSNYDAQRGPNYDIHRG 266
                S  P    + GP+  Y P+ G   P Y PT  P Y+ +  G  Y  Q  P Y     
Sbjct:  1560 PAWSPTPGSPGSPGPASPYIPSPGALSPNYSPTS-PAYEPRSPGGGYTPQ-SPGYS-PTS 1616

Query:   267 PSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
             PSY P     Y     PNY     P Y     P Y     P Y +  +PSY P   P Y 
Sbjct:  1617 PSYSPTSP-SYS-PTSPNYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS 1669

Query:   327 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 380
                   Y    +PSY P+  +    +P   +P      P +  P  S T P+ S
Sbjct:  1670 -PTSPSYSPT-SPSYSPTSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1718


>UNIPROTKB|P71590 [details] [associations]
            symbol:fhaA "FHA domain-containing protein FhaA"
            species:1773 "Mycobacterium tuberculosis" [GO:0005618 "cell wall"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            InterPro:IPR000253 InterPro:IPR008984 Pfam:PF00498 PROSITE:PS50006
            SMART:SM00240 GO:GO:0005829 GO:GO:0005618 GenomeReviews:AL123456_GR
            EMBL:BX842572 Gene3D:2.60.200.20 SUPFAM:SSF49879 PIR:B70700
            RefSeq:NP_214534.1 RefSeq:YP_006513334.1 PDB:2LC0 PDB:2LC1 PDB:3OUN
            PDB:3PO8 PDB:3POA PDBsum:2LC0 PDBsum:2LC1 PDBsum:3OUN PDBsum:3PO8
            PDBsum:3POA ProteinModelPortal:P71590 SMR:P71590 DIP:DIP-59047N
            PhosSite:P12071703 PRIDE:P71590 EnsemblBacteria:EBMYCT00000001781
            GeneID:13315997 GeneID:887067 KEGG:mtu:Rv0020c KEGG:mtv:RVBD_0020c
            PATRIC:18148538 TubercuList:Rv0020c HOGENOM:HOG000235804
            OMA:DQGYGQP ProtClustDB:CLSK790198 EvolutionaryTrace:P71590
            InterPro:IPR022128 Pfam:PF12401 Uniprot:P71590
        Length = 527

 Score = 162 (62.1 bits), Expect = 8.0e-09, P = 8.0e-09
 Identities = 84/244 (34%), Positives = 98/244 (40%)

Query:   164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYD 222
             P   T   V+      S  A+ A     PM        G G +      YD   A P  D
Sbjct:   127 PDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNSSYRGGQG-QGRPDEYYDDRYARPQED 185

Query:   223 PTKGPSYDPAKGP--GYDPTKGPGYDAQKGSNYDAQRGPNY-DIHRGPSYDPQRGLGYDM 279
             P  GP       P  GY P  G GY  Q G  Y   R P+  D      Y P +G GY  
Sbjct:   186 PRGGPDPQGGSDPRGGYPPETG-GYPPQPG--YPRPRHPDQGDYPEQIGY-PDQG-GYPE 240

Query:   280 QRGPNYDMQRG-P---GYETQRVPGY-DVQRG---PVYEAQRAP-SYIPQRG---PGYDL 327
             QRG  Y  QRG P   GY+ Q   GY D  +G   P YE QR P S  P  G   PGYD 
Sbjct:   241 QRG--YPEQRGYPDQRGYQDQG-RGYPDQGQGGYPPPYE-QRPPVSPGPAAGYGAPGYD- 295

Query:   328 QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN---VPYGSATPPARSGSGQ 384
                QGY  R++  Y PS G G  G   G   +G+ P        VP G   PP +  +  
Sbjct:   296 ---QGY--RQSGGYGPSPGGGQPGYG-GYGEYGRGPARHEEGSYVPSGPPGPPEQRPAYP 349

Query:   385 PRGG 388
              +GG
Sbjct:   350 DQGG 353

 Score = 120 (47.3 bits), Expect = 0.00036, P = 0.00036
 Identities = 92/303 (30%), Positives = 111/303 (36%)

Query:   120 PNVDRRADGS-YGGATGNSENETSGRPVGQNAYEDGYGVPQGH---GPPPSATTAGVVGA 175
             P V   +D S Y G  G       GRP     Y+D Y  PQ     GP P   +    G 
Sbjct:   151 PGVAPMSDNSSYRGGQGQ------GRP--DEYYDDRYARPQEDPRGGPDPQGGSDPRGGY 202

Query:   176 GPNTSTSAYAATQSGTPMRAAY----DIPRGPGYEASKG-P---GYDASKAPSYDPTKGP 227
              P T    Y   Q G P R  +    D P   GY    G P   GY   +   Y   +G 
Sbjct:   203 PPETG--GYPP-QPGYP-RPRHPDQGDYPEQIGYPDQGGYPEQRGYPEQRG--YPDQRG- 255

Query:   228 SYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRGPNYDIHRGPSYDP---QRGLGYDMQRG- 282
              Y   +G GY P +G G Y            GP    +  P YD    Q G GY    G 
Sbjct:   256 -YQD-QGRGY-PDQGQGGYPPPYEQRPPVSPGPAAG-YGAPGYDQGYRQSG-GYGPSPGG 310

Query:   283 --PNY----DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 336
               P Y    +  RGP    +   G  V  GP    ++ P+Y P +G GYD    QG    
Sbjct:   311 GQPGYGGYGEYGRGPARHEE---GSYVPSGPPGPPEQRPAY-PDQG-GYDQGYQQGATTY 365

Query:   337 RAPSYDPSRG-TGFDGAPR--GAAPHG--QVPPPLNNVPYG-SATP----PARSG-SGQP 385
                 Y      T +  +PR  G AP G     P   +  YG S  P    PA  G SG  
Sbjct:   366 GRQDYGGGADYTRYTESPRVPGYAPQGGGYAEPAGRDYDYGQSGAPDYGQPAPGGYSGYG 425

Query:   386 RGG 388
             +GG
Sbjct:   426 QGG 428


>UNIPROTKB|Q92804 [details] [associations]
            symbol:TAF15 "TATA-binding protein-associated factor 2N"
            species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0005730 "nucleolus"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=TAS]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0005737 GO:GO:0045893 GO:GO:0000166 GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003723
            EMBL:CH471147 eggNOG:NOG240581 HOGENOM:HOG000038010 EMBL:AC015849
            EMBL:U51334 EMBL:X98893 EMBL:AB010067 EMBL:AY197697 EMBL:AK313223
            IPI:IPI00020194 IPI:IPI00294426 PIR:S71954 RefSeq:NP_003478.1
            RefSeq:NP_631961.1 UniGene:Hs.402752 ProteinModelPortal:Q92804
            SMR:Q92804 IntAct:Q92804 STRING:Q92804 PhosphoSite:Q92804
            DMDM:8928305 PaxDb:Q92804 PRIDE:Q92804 DNASU:8148
            Ensembl:ENST00000311979 GeneID:8148 KEGG:hsa:8148 UCSC:uc002hkc.3
            UCSC:uc002hkd.3 CTD:8148 GeneCards:GC17P034136 HGNC:HGNC:11547
            HPA:HPA052059 MIM:601574 neXtProt:NX_Q92804 PharmGKB:PA36322
            HOVERGEN:HBG005755 InParanoid:Q92804 KO:K14651 OMA:YGNQGSQ
            OrthoDB:EOG4MW872 PhylomeDB:Q92804 ChiTaRS:TAF15 GenomeRNAi:8148
            NextBio:30819 PMAP-CutDB:Q92804 ArrayExpress:Q92804 Bgee:Q92804
            CleanEx:HS_TAF15 Genevestigator:Q92804 GermOnline:ENSG00000172660
            Uniprot:Q92804
        Length = 592

 Score = 160 (61.4 bits), Expect = 1.6e-08, P = 1.6e-08
 Identities = 67/206 (32%), Positives = 79/206 (38%)

Query:   136 NSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
             N       RP G +    GYG  +G+ G        G  G G + S   Y   +S     
Sbjct:   380 NEPRPEDSRPSGGDFRGRGYGGERGYRGRGGRGGDRG--GYGGDRSGGGYGGDRSSG--- 434

Query:   195 AAYDIPR-GPGYEASK-GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSN 252
               Y   R G GY   + G GY   +   Y   +G  Y   +G GY   +G GY   +G  
Sbjct:   435 GGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRGGG 493

Query:   253 YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY--DMQRGPGYETQRV--PGYDVQRGPV 308
             Y   RG  Y   RG  Y   RG GY   RG  Y  D  RG GY   R    GY   R   
Sbjct:   494 YGGDRG-GYGGDRG-GYGGDRG-GYGGDRG-GYGGDRSRG-GYGGDRGGGSGYGGDRSGG 548

Query:   309 YEAQRAPS-YIPQRGPGYDLQRGQGY 333
             Y   R+   Y   RG GY   RG GY
Sbjct:   549 YGGDRSGGGYGGDRGGGYGGDRG-GY 573

 Score = 159 (61.0 bits), Expect = 2.1e-08, P = 2.1e-08
 Identities = 68/220 (30%), Positives = 83/220 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P   R + G + G     E    GR  G+     GYG  +  G      ++G  G   + 
Sbjct:   384 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GYSGDR 441

Query:   180 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 239
             S   Y   +SG      Y   RG GY   +G GY   +   Y   +G  Y   +G GY  
Sbjct:   442 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRGGGYGG 496

Query:   240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSY--DPQRGLGYDMQRGPNYDMQRGPGYETQR 297
              +G GY   +G  Y   RG  Y   RG  Y  D  RG GY   RG       G GY   R
Sbjct:   497 DRG-GYGGDRGG-YGGDRG-GYGGDRG-GYGGDRSRG-GYGGDRGG------GSGYGGDR 545

Query:   298 VPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 336
               GY   R G  Y   R   Y   RG GY  + G   D R
Sbjct:   546 SGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRNDYR 584

 Score = 153 (58.9 bits), Expect = 8.8e-09, Sum P(2) = 8.8e-09
 Identities = 60/164 (36%), Positives = 68/164 (41%)

Query:   201 RGPGYEASK-GPGY--DASKAPSYDPTK-GPSYDPAK-GPGYDPTKGPGYDAQKGSNYDA 255
             RG GY   + G GY  D S    Y   + G  Y   + G GY   +G GY   +G  Y  
Sbjct:   415 RG-GYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGG 473

Query:   256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAP 315
              RG  Y   RG  Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R+ 
Sbjct:   474 DRGGGYGGDRG-GYGGDRGGGYGGDRG-GYGGDRG-GYGGDR-GGYGGDRGG-YGGDRSR 528

Query:   316 S-YIPQRG--PGYDLQRGQGYDMRRAPS-YDPSRGTGFDGAPRG 355
               Y   RG   GY   R  GY   R+   Y   RG G+ G  RG
Sbjct:   529 GGYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGY-GGDRG 571

 Score = 53 (23.7 bits), Expect = 8.8e-09, Sum P(2) = 8.8e-09
 Identities = 21/96 (21%), Positives = 40/96 (41%)

Query:    78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 137
             +  Y+ +   Y+ + +S     +NY   +   +  R ++      +R   GS GG  G  
Sbjct:   132 QSNYDQQHDSYSQNQQSYHSQRENY---SHHTQDDRRDVSRYGEDNRGYGGSQGGGRGRG 188

Query:   138 ENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 167
               +  GR P+ G +  + G    +G  + +GP   A
Sbjct:   189 GYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRTDA 224


>UNIPROTKB|F1PB61 [details] [associations]
            symbol:TAF15 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 CTD:8148 KO:K14651
            OMA:YGNQGSQ EMBL:AAEX03006620 EMBL:AAEX03006619 RefSeq:XP_548255.2
            ProteinModelPortal:F1PB61 Ensembl:ENSCAFT00000028877 GeneID:491135
            KEGG:cfa:491135 Uniprot:F1PB61
        Length = 571

 Score = 160 (61.4 bits), Expect = 1.5e-08, P = 1.5e-08
 Identities = 70/240 (29%), Positives = 87/240 (36%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
             RR +   GG +G       G   G+  ++   G P+ G    P+ +   +  A  N+   
Sbjct:   319 RRPEFMRGGGSGGGRRGRGGYR-GRGGFQGRGGDPKSGDWVCPNPSCGNMNFARRNSCNQ 377

Query:   183 AYAAT-QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY--DP 239
                   +   P    +   RG GY   +G  Y        D   G   D + G GY  D 
Sbjct:   378 CNEPRPEDSRPSGGDF---RGRGYGGERG--YRGRGGRGGD-RGGYGADRSSG-GYGGDR 430

Query:   240 TKGPGYDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 297
             + G GY   + G  Y   R G  Y   RG  Y   RG GY   RG  Y   RG GY   R
Sbjct:   431 SGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDR 490

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG--YDMRRAPSYDPSRGTGFDGAPRG 355
               GY   RG  Y   R      + G GY   RG G  Y   R   Y   R  G  G  RG
Sbjct:   491 GGGYGGDRGGGYGGDRGGYGGDRSGGGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG 550

 Score = 145 (56.1 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
 Identities = 52/152 (34%), Positives = 61/152 (40%)

Query:   194 RAAYDIPR---GPGYEASKGPGYDASKAPS-YDPTK-GPSYDPAKGPGYDPTKGPGYDAQ 248
             R  Y   R   G G + S G GY   ++   Y   + G  Y   +G GY   +G GY   
Sbjct:   414 RGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGD 473

Query:   249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR-GPGYETQRVPG--YDVQR 305
             +G  Y   RG  Y   RG  Y   RG GY   RG  Y   R G GY   R  G  Y   R
Sbjct:   474 RGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSGGGYGGDRGGGGGYGGDR 532

Query:   306 GPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMR 336
             G  Y   R+   Y   RG GY  + G   D R
Sbjct:   533 GGGYGGDRSGGGYGGDRG-GYGGKMGGRNDYR 563

 Score = 139 (54.0 bits), Expect = 8.9e-08, Sum P(2) = 8.9e-08
 Identities = 68/219 (31%), Positives = 76/219 (34%)

Query:   147 GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYE 206
             G+  Y  G G  QG G  P +     V   P+     +A   S             P   
Sbjct:   335 GRGGYR-GRGGFQGRGGDPKS--GDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGG 391

Query:   207 ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY--DAQKGSNYDAQR-GPNYDI 263
               +G GY   +   Y    G   D   G G D + G GY  D   G  Y   R G  Y  
Sbjct:   392 DFRGRGYGGERG--YRGRGGRGGDRG-GYGADRSSG-GYGGDRSGGGGYGGDRSGGGYGG 447

Query:   264 HR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 322
              R G  Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R   Y   RG
Sbjct:   448 DRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG 507

Query:   323 PGYDLQR-GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
              GY   R G GY        D   G G+ G  RG    G
Sbjct:   508 -GYGGDRSGGGY------GGDRGGGGGY-GGDRGGGYGG 538

 Score = 121 (47.7 bits), Expect = 0.00031, P = 0.00031
 Identities = 48/167 (28%), Positives = 62/167 (37%)

Query:   125 RADGSYGGATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 183
             R  G  GG  G    + +SG   G  +   GYG  +  G      + G  G G +     
Sbjct:   405 RGRGGRGGDRGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGG--GYGGDRG-GG 461

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-G 242
             Y   + G      Y   RG GY   +G GY   +   Y   +G  Y   +G GY   + G
Sbjct:   462 YGGDRGG-----GYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSG 515

Query:   243 PGYDAQKGSN--YDAQRGPNYDIHR-GPSYDPQRGLGYDMQRGPNYD 286
              GY   +G    Y   RG  Y   R G  Y   RG GY  + G   D
Sbjct:   516 GGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG-GYGGKMGGRND 561

 Score = 58 (25.5 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
 Identities = 23/96 (23%), Positives = 39/96 (40%)

Query:    78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 137
             +  Y  +   YN + +S      NY   +   +  R ++      +R   GS GG  G  
Sbjct:   131 QSNYGPQHDSYNQNQQSYHSQRDNY---SHHTQDDRRDVSRYGEDNRGYGGSQGGGRGRG 187

Query:   138 ENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 167
               +  GR P+ G +  + G    +G  + +GP P A
Sbjct:   188 GYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRPDA 223


>WB|WBGene00044109 [details] [associations]
            symbol:K02E11.10 species:6239 "Caenorhabditis elegans"
            [GO:0016021 "integral to membrane" evidence=IEA] EMBL:Z77665
            RefSeq:NP_001024024.1 ProteinModelPortal:Q5FC49
            EnsemblMetazoa:K02E11.10 GeneID:259661 KEGG:cel:CELE_K02E11.10
            UCSC:K02E11.10 CTD:259661 WormBase:K02E11.10
            GeneTree:ENSGT00530000065030 InParanoid:Q5FC49 OMA:VQASGYQ
            NextBio:952394 Uniprot:Q5FC49
        Length = 360

 Score = 154 (59.3 bits), Expect = 2.9e-08, P = 2.9e-08
 Identities = 69/224 (30%), Positives = 91/224 (40%)

Query:   154 GYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 212
             G+G   G    P A   G+ G  G      A+     G     A     G G     G G
Sbjct:    81 GFGGAGGSYAAP-ALGGGLGGFGGAPAPAPAFGGLGGGYQAAPALGGGLGGGLGGGPGGG 139

Query:   213 YDASKAPSYDPTKGPSYDPA---KGPGYD--PTKGPGYDAQKGSNYDAQRGP---NYDIH 264
             Y A+ A        P+  PA    G GY   PT G G  AQ G+ Y  Q+GP    +   
Sbjct:   140 YQAAPALQLPGLGAPA--PAFGGLGGGYQGAPTLGGG-QAQGGAGY--QQGPAQGRFVAQ 194

Query:   265 RGPSYDPQRGLGYDMQRGP---NYDMQRGPGYETQRVPGYDVQRGPV---YEAQRAPSYI 318
             +G +   Q G GY  Q+GP    +  Q+GP    Q   GY  Q+GP    + AQ+ P+  
Sbjct:   195 QGSAQGVQGGAGY--QQGPAQGGFTAQQGPAQVVQGGAGY--QQGPAQGGFVAQQGPAPA 250

Query:   319 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AAPHGQ 361
              Q G GY     QG     A     ++G G+  A +G +AP  Q
Sbjct:   251 AQGGAGYQQGSTQGGFEAVAQQGQVAQGAGYQSAAQGQSAPVSQ 294


>DICTYBASE|DDB_G0277909 [details] [associations]
            symbol:cbpP "calcium-binding protein" species:44689
            "Dictyostelium discoideum" [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR002048
            InterPro:IPR011992 Pfam:PF13499 PROSITE:PS50222 SMART:SM00054
            dictyBase:DDB_G0277909 Prosite:PS00018 GenomeReviews:CM000152_GR
            EMBL:AAFI02000023 GO:GO:0005509 Gene3D:1.10.238.10
            InterPro:IPR018247 EMBL:U03413 RefSeq:XP_642080.1
            ProteinModelPortal:P35085 PRIDE:P35085 EnsemblProtists:DDB0214957
            GeneID:8621293 KEGG:ddi:DDB_G0277909 eggNOG:NOG135385 OMA:MGAYPPQ
            ProtClustDB:CLSZ2846833 Uniprot:P35085
        Length = 467

 Score = 155 (59.6 bits), Expect = 3.9e-08, P = 3.9e-08
 Identities = 73/247 (29%), Positives = 89/247 (36%)

Query:   158 PQGHGPPPSATTAGVVGAGPNT--STSAYAATQS--GTPMRAAYDIPRGPGYEASKGPGY 213
             PQ   PPP+ + A      P     T     +QS  G P       P+ PG   S  P Y
Sbjct:     4 PQN--PPPAGSAADFYSQMPVKVMGTPGAPGSQSTPGAPGAPGQYPPQQPGAPGSNLPPY 61

Query:   214 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQ 272
               ++ P      G  Y P + PG  P + PG   Q       Q G P     +   Y PQ
Sbjct:    62 PGTQQPGAPGAPG-QYPPQQ-PGQYPPQQPGAPGQYPPQQPGQPGYPPQQPGQSGQYPPQ 119

Query:   273 R-GL-GYDMQR--GPN-YDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
             + G  GY  Q+   P  Y  Q+G PG    + PG   Q  P  + Q  P    Q G    
Sbjct:   120 QPGQPGYPPQQPGAPGQYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPGAYPP 179

Query:   327 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP---ARSGSG 383
              Q GQ        +Y P +G     A  GA     VPPP    P     PP   A  G  
Sbjct:   180 QQSGQ------PGAYPPQQGVQNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYPGQQ 233

Query:   384 QPRGGNP 390
              P G  P
Sbjct:   234 PPMGAYP 240

 Score = 139 (54.0 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 79/251 (31%), Positives = 98/251 (39%)

Query:   162 GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY 221
             G P S +T G  GA P      Y   Q G P     ++P  PG +    PG      P  
Sbjct:    29 GAPGSQSTPGAPGA-PGQ----YPPQQPGAP---GSNLPPYPGTQQPGAPGAPGQYPPQ- 79

Query:   222 DPTKGPSYDPAKGPG-YDPTK-G-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL-G 276
              P + P   P   PG Y P + G PGY  Q+      Q  P       P Y PQ+ G  G
Sbjct:    80 QPGQYPPQQPG-APGQYPPQQPGQPGYPPQQPGQ-SGQYPPQQPGQ--PGYPPQQPGAPG 135

Query:   277 -YDMQRG-PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PG-YDLQRGQ 331
              Y  Q+G P     + PG   Q  P    Q  P    Q   +Y PQ+   PG Y  Q+G 
Sbjct:   136 QYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGV 194

Query:   332 GYDMRRA-----PSYDPSRGT--GFDGAP--RGAAPHGQVPPPLNNVPYGSATPPARSGS 382
                + +      P   P +G   G  G P  +GA P GQ PP     P G   P A    
Sbjct:   195 QNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYP-GQQPPMGAYPPQGQ--PGAYPPQ 251

Query:   383 GQPRGGNPARR 393
             GQP G  P ++
Sbjct:   252 GQP-GAYPPQQ 261

 Score = 133 (51.9 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 83/276 (30%), Positives = 101/276 (36%)

Query:   132 GATGNSENETSGRPVGQNAYEDGY-GVPQGHGPP-PSATTAGVVGA-G--PNTSTSAYAA 186
             GA G+    T G P     Y     G P  + PP P     G  GA G  P      Y  
Sbjct:    29 GAPGSQS--TPGAPGAPGQYPPQQPGAPGSNLPPYPGTQQPGAPGAPGQYPPQQPGQYPP 86

Query:   187 TQSGTPMRAAYDIPRGPGYEASKGPG----YDASKA--PSYDPTK--GPS-YDPAKG-PG 236
              Q G P +     P  PGY   + PG    Y   +   P Y P +   P  Y P +G PG
Sbjct:    87 QQPGAPGQYPPQQPGQPGYPPQQ-PGQSGQYPPQQPGQPGYPPQQPGAPGQYPPQQGQPG 145

Query:   237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL--GYDMQRGPNYDMQRGPGY 293
               P + PG   Q       Q  P      G +Y PQ+ G    Y  Q+G    + +  G 
Sbjct:   146 QYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGVQNTLAK-TGA 203

Query:   294 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA- 352
               Q  PG    +G  Y  Q  P   PQ+G  Y    GQ   M    +Y P    G  GA 
Sbjct:   204 PGQ--PGVPPPQG-AYPGQ--PGVPPQQG-AYP---GQQPPMG---AYPPQ---GQPGAY 248

Query:   353 PRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
             P    P G  PP    V Y    PP   G+  P+ G
Sbjct:   249 PPQGQP-GAYPPQQQQVAYPGQQPPM--GAYPPQQG 281


>FB|FBgn0050203 [details] [associations]
            symbol:CG30203 species:7227 "Drosophila melanogaster"
            [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002223 Pfam:PF00014 PROSITE:PS50279
            SMART:SM00131 EMBL:AE013599 GO:GO:0004867 Gene3D:4.10.410.10
            SUPFAM:SSF57362 InterPro:IPR000884 Pfam:PF00090 SMART:SM00209
            SUPFAM:SSF82895 PROSITE:PS50092 InterPro:IPR002861 Pfam:PF02014
            PROSITE:PS51019 GeneTree:ENSGT00640000091268 InterPro:IPR009465
            Pfam:PF06468 PROSITE:PS51020 EMBL:BT023853 RefSeq:NP_725128.2
            UniGene:Dm.23753 SMR:Q3ZAL6 EnsemblMetazoa:FBtr0273303
            GeneID:246514 KEGG:dme:Dmel_CG30203 FlyBase:FBgn0050203
            eggNOG:NOG244582 OMA:KWARNTH OrthoDB:EOG43R22N GenomeRNAi:246514
            NextBio:842774 Uniprot:Q3ZAL6
        Length = 924

 Score = 157 (60.3 bits), Expect = 6.5e-08, P = 6.5e-08
 Identities = 39/105 (37%), Positives = 49/105 (46%)

Query:   194 RAAYDIP--RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 251
             R +YD    RG  Y+ + G  Y  ++  SYD   G SYD   G  Y  T G  YD  +  
Sbjct:   793 RRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDR 852

Query:   252 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY-ET 295
             +YD   G +Y      SYD  RG  YD   G +YD+  G  Y ET
Sbjct:   853 SYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGET 897

 Score = 153 (58.9 bits), Expect = 1.8e-07, P = 1.8e-07
 Identities = 46/148 (31%), Positives = 60/148 (40%)

Query:   206 EASKGPGYDASKAPSYDP--TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 263
             E S+    D     SYD   T+G  YD   G  Y  T+G  YD + G +YD   G +Y  
Sbjct:   781 ERSENDAMDLYGRRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQ 840

Query:   264 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 323
               G SYD      YD+  G +Y       Y+  R   YD   G  Y+     SY      
Sbjct:   841 TGGGSYDQPEDRSYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEA 900

Query:   324 GYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
             G D+  G+     R+  YD SR   + G
Sbjct:   901 G-DI--GEPMSQTRS-RYDTSRRGRYGG 924

 Score = 134 (52.2 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 36/111 (32%), Positives = 45/111 (40%)

Query:   245 YDAQ--KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 302
             YD +  +G  YD   G  Y    G SYD + G  YD   G +Y    G  Y+      YD
Sbjct:   796 YDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYD 855

Query:   303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
             +  G  Y      SY   RG  YD   G+ YD+    SY  +   G  G P
Sbjct:   856 LSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEP 906

 Score = 123 (48.4 bits), Expect = 0.00035, P = 0.00035
 Identities = 38/119 (31%), Positives = 52/119 (43%)

Query:   179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 238
             TS  AY  T+       +YD   G  Y+ + G  Y  +   SYD  +  SYD + G  Y 
Sbjct:   809 TSGIAYGQTEG-----RSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYDLSTGRSYV 863

Query:   239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP--QRG-LGYDM-QRGPNYDMQRGPGY 293
               +   YD  +G +YD   G +YD+  G SY    + G +G  M Q    YD  R   Y
Sbjct:   864 QPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEPMSQTRSRYDTSRRGRY 922


>WB|WBGene00005015 [details] [associations]
            symbol:spt-5 species:6239 "Caenorhabditis elegans"
            [GO:0032968 "positive regulation of transcription elongation from
            RNA polymerase II promoter" evidence=IEA] [GO:0006357 "regulation
            of transcription from RNA polymerase II promoter" evidence=IEA]
            [GO:0032784 "regulation of DNA-dependent transcription, elongation"
            evidence=IEA] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] [GO:0040007 "growth" evidence=IMP]
            [GO:0002119 "nematode larval development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] InterPro:IPR006645 InterPro:IPR017071
            InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
            Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
            InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
            eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
            InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
            Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
            ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
            EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
            UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
            GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
            NextBio:899898 Uniprot:Q21338
        Length = 1208

 Score = 158 (60.7 bits), Expect = 7.1e-08, P = 7.1e-08
 Identities = 60/182 (32%), Positives = 76/182 (41%)

Query:   179 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
             + T  Y A T     M  AYD  R P Y E  + P Y  SK P+Y      S       G
Sbjct:   813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871

Query:   237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 295
              D ++ P Y    GS  D  R P Y    G    P  G   D  R P YD   R PGYE+
Sbjct:   872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924

Query:   296 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
                R P YD   + P Y E++ +      R P Y+      YD+  +P+Y+P     +D 
Sbjct:   925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975

Query:   352 AP 353
             AP
Sbjct:   976 AP 977

 Score = 143 (55.4 bits), Expect = 3.1e-06, P = 3.1e-06
 Identities = 73/253 (28%), Positives = 95/253 (37%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             A GS   A G+ +  +S R     AY +       +G    A T    G+     T AY 
Sbjct:   848 AYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTPAYG 903

Query:   186 ATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGYDPT 240
             +   S TP   AYD   R PGYE+  S+ P YD+S K P+Y  ++  +  PA    YD  
Sbjct:   904 SMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDYDIP 960

Query:   241 KGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG----PNY 285
               P Y+      YD           R P YD +    P+Y+P      +   G    P Y
Sbjct:   961 LSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSSPTY 1020

Query:   286 DMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 340
             D    P       PG  +    P  Y     P +     PG     G  YD   APS   
Sbjct:  1021 DSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPSPFA 1073

Query:   341 -YDPSRGTGFDGA 352
              YD +     DGA
Sbjct:  1074 GYDSNNYNNADGA 1086

 Score = 133 (51.9 bits), Expect = 3.9e-05, P = 3.9e-05
 Identities = 67/218 (30%), Positives = 84/218 (38%)

Query:   194 RAAYDIPRGPGYEASKGPG---YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
             RA   +    G  A  G G   Y +SK P  D  K P Y  +K P Y   + P Y +   
Sbjct:   773 RARVMVVGDTGITAGSGGGSSFYSSSKTPMRDSGKTPMYG-SKTPMYG-AQTPMYGSMTP 830

Query:   251 SNYDAQRGPNY-DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY--ETQRVPGY---DVQ 304
             + YD  R P Y +  R P+Y  +     D+     +   R P Y  ++ R P Y   D  
Sbjct:   831 A-YDGGRTPAYGEGGRTPAYGSKTPAYGDLDE---HSSSRTPAYGNDSSRTPAYGSADGA 886

Query:   305 RGPVY---EAQRAPSYIPQ---RGPGYDLQ-RGQGYDMR--RAPSYDPSRGTGFDGAPRG 355
             R P Y   E  R P+Y      R P YD   R  GY+    R P+YD S  T     P  
Sbjct:   887 RTPAYGSTEGGRTPAYGSMDNSRTPAYDDSGRTPGYESMPSRTPNYDSSSKT--PAYPE- 943

Query:   356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN-PAR 392
              + H    P  NN  Y     PA          N PAR
Sbjct:   944 -SEHSARTPAYNN-DYDIPLSPAYEPDAPEAYDNAPAR 979


>UNIPROTKB|Q21338 [details] [associations]
            symbol:spt-5 "Transcription elongation factor SPT5"
            species:6239 "Caenorhabditis elegans" [GO:0032044 "DSIF complex"
            evidence=ISS] InterPro:IPR006645 InterPro:IPR017071
            InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
            Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
            InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
            eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
            InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
            Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
            ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
            EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
            UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
            GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
            NextBio:899898 Uniprot:Q21338
        Length = 1208

 Score = 158 (60.7 bits), Expect = 7.1e-08, P = 7.1e-08
 Identities = 60/182 (32%), Positives = 76/182 (41%)

Query:   179 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
             + T  Y A T     M  AYD  R P Y E  + P Y  SK P+Y      S       G
Sbjct:   813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871

Query:   237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 295
              D ++ P Y    GS  D  R P Y    G    P  G   D  R P YD   R PGYE+
Sbjct:   872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924

Query:   296 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
                R P YD   + P Y E++ +      R P Y+      YD+  +P+Y+P     +D 
Sbjct:   925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975

Query:   352 AP 353
             AP
Sbjct:   976 AP 977

 Score = 143 (55.4 bits), Expect = 3.1e-06, P = 3.1e-06
 Identities = 73/253 (28%), Positives = 95/253 (37%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             A GS   A G+ +  +S R     AY +       +G    A T    G+     T AY 
Sbjct:   848 AYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTPAYG 903

Query:   186 ATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGYDPT 240
             +   S TP   AYD   R PGYE+  S+ P YD+S K P+Y  ++  +  PA    YD  
Sbjct:   904 SMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDYDIP 960

Query:   241 KGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG----PNY 285
               P Y+      YD           R P YD +    P+Y+P      +   G    P Y
Sbjct:   961 LSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSSPTY 1020

Query:   286 DMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 340
             D    P       PG  +    P  Y     P +     PG     G  YD   APS   
Sbjct:  1021 DSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPSPFA 1073

Query:   341 -YDPSRGTGFDGA 352
              YD +     DGA
Sbjct:  1074 GYDSNNYNNADGA 1086

 Score = 133 (51.9 bits), Expect = 3.9e-05, P = 3.9e-05
 Identities = 67/218 (30%), Positives = 84/218 (38%)

Query:   194 RAAYDIPRGPGYEASKGPG---YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
             RA   +    G  A  G G   Y +SK P  D  K P Y  +K P Y   + P Y +   
Sbjct:   773 RARVMVVGDTGITAGSGGGSSFYSSSKTPMRDSGKTPMYG-SKTPMYG-AQTPMYGSMTP 830

Query:   251 SNYDAQRGPNY-DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY--ETQRVPGY---DVQ 304
             + YD  R P Y +  R P+Y  +     D+     +   R P Y  ++ R P Y   D  
Sbjct:   831 A-YDGGRTPAYGEGGRTPAYGSKTPAYGDLDE---HSSSRTPAYGNDSSRTPAYGSADGA 886

Query:   305 RGPVY---EAQRAPSYIPQ---RGPGYDLQ-RGQGYDMR--RAPSYDPSRGTGFDGAPRG 355
             R P Y   E  R P+Y      R P YD   R  GY+    R P+YD S  T     P  
Sbjct:   887 RTPAYGSTEGGRTPAYGSMDNSRTPAYDDSGRTPGYESMPSRTPNYDSSSKT--PAYPE- 943

Query:   356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN-PAR 392
              + H    P  NN  Y     PA          N PAR
Sbjct:   944 -SEHSARTPAYNN-DYDIPLSPAYEPDAPEAYDNAPAR 979


>SGD|S000002299 [details] [associations]
            symbol:RPO21 "RNA polymerase II largest subunit B220"
            species:4932 "Saccharomyces cerevisiae" [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA;IMP] [GO:0003899 "DNA-directed RNA
            polymerase activity" evidence=IEA;IDA] [GO:0005739 "mitochondrion"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA;IDA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed
            RNA polymerase activity" evidence=IDA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 SGD:S000002299 GO:GO:0005739
            GO:GO:0046872 GO:GO:0003677 EMBL:BK006938 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 EMBL:X96876 EMBL:U27182
            GO:GO:0003899 PDB:4GWQ PDBsum:4GWQ PDB:2LO6 PDBsum:2LO6
            eggNOG:COG0086 GO:GO:0005665 PDB:1I3Q PDB:1I50 PDB:1I6H PDB:1K83
            PDB:1NIK PDB:1NT9 PDB:1PQV PDB:1R5U PDB:1R9S PDB:1R9T PDB:1SFO
            PDB:1TWA PDB:1TWC PDB:1TWF PDB:1TWG PDB:1TWH PDB:1WCM PDB:1Y1V
            PDB:1Y1W PDB:1Y1Y PDB:1Y77 PDB:2B63 PDB:2B8K PDB:2E2H PDB:2E2I
            PDB:2E2J PDB:2JA5 PDB:2JA6 PDB:2JA7 PDB:2JA8 PDB:2NVQ PDB:2NVT
            PDB:2NVX PDB:2NVY PDB:2NVZ PDB:2R7Z PDB:2R92 PDB:2R93 PDB:2VUM
            PDB:2YU9 PDB:3CQZ PDB:3FKI PDB:3GTG PDB:3GTJ PDB:3GTK PDB:3GTL
            PDB:3GTM PDB:3GTO PDB:3GTP PDB:3GTQ PDB:3H3V PDB:3HOU PDB:3HOV
            PDB:3HOW PDB:3HOX PDB:3HOY PDB:3HOZ PDB:3I4M PDB:3I4N PDB:3K1F
            PDB:3K7A PDB:3M3Y PDB:3M4O PDB:3PO2 PDB:3PO3 PDB:3QT1 PDB:3RZD
            PDB:3RZO PDB:3S14 PDB:3S15 PDB:3S16 PDB:3S17 PDB:3S1M PDB:3S1N
            PDB:3S1Q PDB:3S1R PDB:3S2D PDB:3S2H PDB:4A3B PDB:4A3C PDB:4A3D
            PDB:4A3E PDB:4A3F PDB:4A3G PDB:4A3I PDB:4A3J PDB:4A3K PDB:4A3L
            PDB:4A3M PDB:4A93 PDB:4BBR PDB:4BBS PDBsum:1I3Q PDBsum:1I50
            PDBsum:1I6H PDBsum:1K83 PDBsum:1NIK PDBsum:1NT9 PDBsum:1PQV
            PDBsum:1R5U PDBsum:1R9S PDBsum:1R9T PDBsum:1SFO PDBsum:1TWA
            PDBsum:1TWC PDBsum:1TWF PDBsum:1TWG PDBsum:1TWH PDBsum:1WCM
            PDBsum:1Y1V PDBsum:1Y1W PDBsum:1Y1Y PDBsum:1Y77 PDBsum:2B63
            PDBsum:2B8K PDBsum:2E2H PDBsum:2E2I PDBsum:2E2J PDBsum:2JA5
            PDBsum:2JA6 PDBsum:2JA7 PDBsum:2JA8 PDBsum:2NVQ PDBsum:2NVT
            PDBsum:2NVX PDBsum:2NVY PDBsum:2NVZ PDBsum:2R7Z PDBsum:2R92
            PDBsum:2R93 PDBsum:2VUM PDBsum:2YU9 PDBsum:3CQZ PDBsum:3FKI
            PDBsum:3GTG PDBsum:3GTJ PDBsum:3GTK PDBsum:3GTL PDBsum:3GTM
            PDBsum:3GTO PDBsum:3GTP PDBsum:3GTQ PDBsum:3H3V PDBsum:3HOU
            PDBsum:3HOV PDBsum:3HOW PDBsum:3HOX PDBsum:3HOY PDBsum:3HOZ
            PDBsum:3I4M PDBsum:3I4N PDBsum:3K1F PDBsum:3K7A PDBsum:3M3Y
            PDBsum:3M4O PDBsum:3PO2 PDBsum:3PO3 PDBsum:3QT1 PDBsum:3RZD
            PDBsum:3RZO PDBsum:3S14 PDBsum:3S15 PDBsum:3S16 PDBsum:3S17
            PDBsum:3S1M PDBsum:3S1N PDBsum:3S1Q PDBsum:3S1R PDBsum:3S2D
            PDBsum:3S2H PDBsum:4A3B PDBsum:4A3C PDBsum:4A3D PDBsum:4A3E
            PDBsum:4A3F PDBsum:4A3G PDBsum:4A3I PDBsum:4A3J PDBsum:4A3K
            PDBsum:4A3L PDBsum:4A3M PDBsum:4A93 PDBsum:4BBR PDBsum:4BBS
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 OrthoDB:EOG4J14H5
            EMBL:X03128 EMBL:Z74188 PIR:S67686 RefSeq:NP_010141.1 PDB:2L0I
            PDBsum:2L0I ProteinModelPortal:P04050 SMR:P04050 DIP:DIP-611N
            IntAct:P04050 MINT:MINT-432838 STRING:P04050 PaxDb:P04050
            PeptideAtlas:P04050 EnsemblFungi:YDL140C GeneID:851415
            KEGG:sce:YDL140C CYGD:YDL140c GeneTree:ENSGT00700000105212
            EvolutionaryTrace:P04050 NextBio:968606 ArrayExpress:P04050
            Genevestigator:P04050 GermOnline:YDL140C Uniprot:P04050
        Length = 1733

 Score = 159 (61.0 bits), Expect = 8.4e-08, P = 8.4e-08
 Identities = 67/218 (30%), Positives = 90/218 (41%)

Query:   112 LRAELMNAPNVDRRA-DGSYGGAT--GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT 168
             ++ ELM +P VD  + D   GG T  G ++   +  P G  AY          G  P++ 
Sbjct:  1486 VKDELMFSPLVDSGSNDAMAGGFTAYGGADYGEATSPFG--AY----------GEAPTSP 1533

Query:   169 TAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP 227
               GV   G + ++  Y+ T    +P   +Y  P  P Y  +  P Y  + +PSY PT  P
Sbjct:  1534 GFGVSSPGFSPTSPTYSPTSPAYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSPTS-P 1589

Query:   228 SYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM 287
             SY P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P+Y  
Sbjct:  1590 SYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS- 1641

Query:   288 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGY 325
                P Y     P Y     P Y +  +PSY P   P Y
Sbjct:  1642 PTSPSYSPTS-PSYS-PTSPAY-SPTSPSYSPT-SPSY 1675


>MGI|MGI:1330280 [details] [associations]
            symbol:Krtap6-2 "keratin associated protein 6-2"
            species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005882 "intermediate filament" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] MGI:MGI:1330280 GO:GO:0005882
            CTD:337967 EMBL:D89902 IPI:IPI00116464 RefSeq:NP_034803.2
            UniGene:Mm.3524 PRIDE:O08884 DNASU:16701 GeneID:16701
            KEGG:mmu:16701 UCSC:uc007zvp.1 NextBio:290464 Genevestigator:O08884
            Uniprot:O08884
        Length = 159

 Score = 128 (50.1 bits), Expect = 1.3e-07, P = 1.3e-07
 Identities = 38/124 (30%), Positives = 40/124 (32%)

Query:   202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
             G GY +  G GY       Y    G  Y    G GY    G GY    GS Y    G  Y
Sbjct:    13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72

Query:   262 DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQR 321
                 G  Y    G GY    G  Y    G GY      GY    G  Y +     Y    
Sbjct:    73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132

Query:   322 GPGY 325
             G GY
Sbjct:   133 GCGY 136

 Score = 126 (49.4 bits), Expect = 2.2e-07, P = 2.2e-07
 Identities = 39/130 (30%), Positives = 40/130 (30%)

Query:   204 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 263
             G     G GY +     Y    G  Y    G GY    G GY    GS Y    G  Y  
Sbjct:     7 GNSCGYGCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGC 66

Query:   264 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 323
               G  Y    G GY    G  Y    G GY      GY    G  Y       Y    G 
Sbjct:    67 GYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGS 126

Query:   324 GYDLQRGQGY 333
             GY    G GY
Sbjct:   127 GYGSGCGCGY 136

 Score = 125 (49.1 bits), Expect = 2.8e-07, P = 2.8e-07
 Identities = 40/136 (29%), Positives = 42/136 (30%)

Query:   226 GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY 285
             G  Y    G GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y
Sbjct:    13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72

Query:   286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 345
                 G GY      GY    G  Y       Y    G GY    G GY       Y    
Sbjct:    73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132

Query:   346 GTGFDGAPR-GAAPHG 360
             G G+    R G   +G
Sbjct:   133 GCGYGSYYRSGCCGYG 148

 Score = 124 (48.7 bits), Expect = 3.6e-07, P = 3.6e-07
 Identities = 34/112 (30%), Positives = 37/112 (33%)

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
             G+   + Y    G GY    G GY       Y    G  Y    G GY    G GY    
Sbjct:    17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76

Query:   250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
             GS Y    G  Y    G  Y    G GY    G  Y    G GY +    GY
Sbjct:    77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGY 128

 Score = 118 (46.6 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 33/107 (30%), Positives = 35/107 (32%)

Query:   195 AAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 254
             + Y    G GY    G GY       Y    G  Y    G GY    G GY    GS Y 
Sbjct:    30 SGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYG 89

Query:   255 AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
                G  Y    G  Y    G GY    G  Y    G GY +    GY
Sbjct:    90 CGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136

 Score = 118 (46.6 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 34/120 (28%), Positives = 39/120 (32%)

Query:   174 GAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 233
             G+G  +     + +  G    + Y    G GY    G GY       Y    G  Y    
Sbjct:    17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76

Query:   234 GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
             G GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y    G GY
Sbjct:    77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136

 Score = 111 (44.1 bits), Expect = 7.4e-05, P = 7.4e-05
 Identities = 35/127 (27%), Positives = 40/127 (31%)

Query:   151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
             Y  GYG   G+G      +    G G  +       +  G    + Y    G GY    G
Sbjct:    12 YGCGYG--SGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYG 69

Query:   211 PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD 270
              GY       Y    G  Y    G GY    G GY    GS Y    G  Y    G  Y 
Sbjct:    70 SGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYG 129

Query:   271 PQRGLGY 277
                G GY
Sbjct:   130 SGCGCGY 136


>WB|WBGene00002280 [details] [associations]
            symbol:let-2 species:6239 "Caenorhabditis elegans"
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0040007
            "growth" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0040039
            "inductive cell migration" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0005604 "basement membrane" evidence=IDA] [GO:0005198
            "structural molecule activity" evidence=IDA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
            GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
            GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
            EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
            RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
            SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
            KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
            WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
            NextBio:915032 GO:GO:0016043 Uniprot:P17140
        Length = 1758

 Score = 157 (60.3 bits), Expect = 1.4e-07, P = 1.4e-07
 Identities = 82/261 (31%), Positives = 95/261 (36%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTSTS 182
             ++ +  Y G  G   N     P G   + DG   P G  G P +    G  G  P     
Sbjct:   335 QKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-PGAPGP 393

Query:   183 AYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG-PGYDP 239
             A     +G P    Y    G PG +  KG G     AP      G P     KG PGY  
Sbjct:   394 AGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGEPGYRG 452

Query:   240 TKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG---PN-YD 286
             T G      PG D + G      ++G N     RGP  D   GL G   QRG   PN YD
Sbjct:   453 TPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPGPNGYD 512

Query:   287 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYD-PS 344
              + G        PG    RG    A  AP    ++G PGY  Q G   D R  P    P 
Sbjct:   513 GRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPGMPGPV 569

Query:   345 RGTGFDGAPRGAAPHGQVPPP 365
                G DG P  A   G   PP
Sbjct:   570 GDAGDDGLPGPAGRPGSPGPP 590


>UNIPROTKB|P17140 [details] [associations]
            symbol:let-2 "Collagen alpha-2(IV) chain" species:6239
            "Caenorhabditis elegans" [GO:0016043 "cellular component
            organization" evidence=NAS] [GO:0030020 "extracellular matrix
            structural constituent conferring tensile strength" evidence=IMP]
            [GO:0005587 "collagen type IV" evidence=IMP] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
            GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
            GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
            EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
            RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
            SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
            KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
            WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
            NextBio:915032 GO:GO:0016043 Uniprot:P17140
        Length = 1758

 Score = 157 (60.3 bits), Expect = 1.4e-07, P = 1.4e-07
 Identities = 82/261 (31%), Positives = 95/261 (36%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTSTS 182
             ++ +  Y G  G   N     P G   + DG   P G  G P +    G  G  P     
Sbjct:   335 QKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-PGAPGP 393

Query:   183 AYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG-PGYDP 239
             A     +G P    Y    G PG +  KG G     AP      G P     KG PGY  
Sbjct:   394 AGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGEPGYRG 452

Query:   240 TKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG---PN-YD 286
             T G      PG D + G      ++G N     RGP  D   GL G   QRG   PN YD
Sbjct:   453 TPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPGPNGYD 512

Query:   287 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYD-PS 344
              + G        PG    RG    A  AP    ++G PGY  Q G   D R  P    P 
Sbjct:   513 GRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPGMPGPV 569

Query:   345 RGTGFDGAPRGAAPHGQVPPP 365
                G DG P  A   G   PP
Sbjct:   570 GDAGDDGLPGPAGRPGSPGPP 590


>ZFIN|ZDB-GENE-030131-5725 [details] [associations]
            symbol:arid1ab "AT rich interactive domain 1Ab
            (SWI-like)" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
            ZFIN:ZDB-GENE-030131-5725 GO:GO:0003677 GO:GO:0005622
            Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
            GeneTree:ENSGT00550000074575 EMBL:CABZ01050711 EMBL:CT027837
            IPI:IPI00485842 Ensembl:ENSDART00000084272 Bgee:F1RE50
            Uniprot:F1RE50
        Length = 2135

 Score = 157 (60.3 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
 Identities = 78/257 (30%), Positives = 104/257 (40%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP-QGHGPP-PSATTAGVVGAGPNTSTSAYA 185
             G + GA GN  ++  G P      + G   P QG+GPP P     G+ G    TS +  +
Sbjct:   312 GQHYGA-GNPYSQQQGPPPSS---QQGPPYPGQGYGPPGPQRYPMGMQG---RTSGNL-S 363

Query:   186 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP---SY--DPAKGPGYDP- 239
               Q G  M   Y    GPG       GY   + PS  P  GP   SY   P+ GPG  P 
Sbjct:   364 GIQYGQQM--GYG-QHGPGGYGQNQAGYYGQQGPS--PHGGPQQSSYPQQPSTGPGSQPP 418

Query:   240 -TKGPGYD--AQKGSNYDAQRGPNYDIHRGPSYD--PQRGLG---YDMQRGPNYDMQRGP 291
              ++ P      Q G++Y   +GP+      P Y   PQ   G   +   +GP        
Sbjct:   419 YSQQPSGTPHGQSGTSYGQPQGPHVPNQGQPPYSQTPQSQSGQSPFPQSQGPTQSQGPSQ 478

Query:   292 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY---DPSRGT 347
             G + +Q  PGY     P    Q A     Q+GP    Q+ QG    + PS     PS+ T
Sbjct:   479 GQQGSQSQPGYT--HPPSGSGQPAQ----QQGPS---QQQQGPPQSQTPSSAPPQPSQQT 529

Query:   348 GFDGAPRGAAPHGQVPP 364
                G P   +P+ Q PP
Sbjct:   530 SGQGQP---SPYSQTPP 543

 Score = 125 (49.1 bits), Expect = 0.00055, P = 0.00055
 Identities = 79/298 (26%), Positives = 109/298 (36%)

Query:   115 ELMNAPNVDRRADGSYGGATGNSENETSGR-PVGQNA-YEDGYGVPQ--GHGPPPSATTA 170
             +L+ +P+  R          G  E    G   +G ++ Y  G+   Q   H PPP +   
Sbjct:   232 QLLTSPSSTRSYQNYPASEYGGQEGAAKGPGDMGSSSQYGGGHPAWQQRSHHPPPMSP-- 289

Query:   171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
             G  G    T        Q G      Y    G  Y   +GP   + + P Y P +G  Y 
Sbjct:   290 GNTGQANRTQPPG-PMDQVGKIRGQHYGA--GNPYSQQQGPPPSSQQGPPY-PGQG--YG 343

Query:   231 PAKGPGYDPTKGPGYDAQK--GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN---- 284
             P  GP   P    G  +    G  Y  Q G  Y  H GP    Q   GY  Q+GP+    
Sbjct:   344 PP-GPQRYPMGMQGRTSGNLSGIQYGQQMG--YGQH-GPGGYGQNQAGYYGQQGPSPHGG 399

Query:   285 -----YDMQ--RGPGYE---TQRVPGYDV-QRGPVYEAQRAPSYIPQRG-PGYDLQRGQG 332
                  Y  Q   GPG +   +Q+  G    Q G  Y   + P ++P +G P Y  Q  Q 
Sbjct:   400 PQQSSYPQQPSTGPGSQPPYSQQPSGTPHGQSGTSYGQPQGP-HVPNQGQPPYS-QTPQS 457

Query:   333 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
                 ++P +  S+G      P       Q  P   + P GS  P  + G  Q + G P
Sbjct:   458 QS-GQSP-FPQSQGPTQSQGPSQGQQGSQSQPGYTHPPSGSGQPAQQQGPSQQQQGPP 513

 Score = 50 (22.7 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
 Identities = 13/33 (39%), Positives = 17/33 (51%)

Query:   360 GQVPPP--LNNVP---YGSATPPARSGSGQPRG 387
             G+ PPP   NN P        PP+ +GSG  +G
Sbjct:  1061 GEDPPPDFFNNDPKKNQAKVQPPSPAGSGSLQG 1093


>WB|WBGene00000123 [details] [associations]
            symbol:ama-1 species:6239 "Caenorhabditis elegans"
            [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA;IMP] [GO:0009792 "embryo development ending in birth
            or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040010 "positive regulation of growth rate"
            evidence=IMP] [GO:0007052 "mitotic spindle organization"
            evidence=IMP] [GO:0010458 "exit from mitosis" evidence=IMP]
            [GO:0008356 "asymmetric cell division" evidence=IMP] [GO:0032502
            "developmental process" evidence=IMP] [GO:0006479 "protein
            methylation" evidence=IMP] [GO:0007369 "gastrulation" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0001055 "RNA polymerase II
            activity" evidence=IMP] [GO:0042789 "mRNA transcription from RNA
            polymerase II promoter" evidence=IMP] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 GO:GO:0005634
            GO:GO:0009792 GO:GO:0040010 GO:GO:0007052 GO:GO:0010458
            GO:GO:0046872 GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20
            InterPro:IPR009010 GO:GO:0006479 GO:GO:0008356 GO:GO:0007369
            GO:GO:0042789 EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665
            EMBL:M29235 PIR:A34092 PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356
            STRING:P16356 PaxDb:P16356 EnsemblMetazoa:F36A4.7.1
            EnsemblMetazoa:F36A4.7.2 GeneID:177190 KEGG:cel:CELE_F36A4.7
            UCSC:F36A4.7 CTD:247749 WormBase:F36A4.7
            GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975 InParanoid:P16356
            OMA:KVLPWST NextBio:895720 GO:GO:0001055 Uniprot:P16356
        Length = 1856

 Score = 157 (60.3 bits), Expect = 1.5e-07, P = 1.5e-07
 Identities = 68/254 (26%), Positives = 93/254 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             G   GA  +    T G   G + + +G   P   G P  A +      G   S   Y+ +
Sbjct:  1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582

Query:   188 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 247
                  M + +  P  P Y  +      +  +PSY PT  PSY P   P Y PT  P Y  
Sbjct:  1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639

Query:   248 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 307
                S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P
Sbjct:  1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691

Query:   308 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
              Y +  +P+Y P   P Y       + G GY    +P Y PS  T    +P  +    Q 
Sbjct:  1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748

Query:   363 PPPLNNVPYGSATP 376
              P   +  Y  ++P
Sbjct:  1749 SP--TSPQYSPSSP 1760

 Score = 154 (59.3 bits), Expect = 3.2e-07, P = 3.2e-07
 Identities = 65/219 (29%), Positives = 87/219 (39%)

Query:   164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 223
             P  + T+   G  P  S S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653

Query:   224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
             T  PSY P+  P Y P+  P Y +     Y +   P Y     P+Y P     Y     P
Sbjct:  1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705

Query:   284 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 338
              Y       + G GY     P Y     P Y +  +PSY P   P Y     Q Y    +
Sbjct:  1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759

Query:   339 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 376
             P+Y PS  T    +PRG ++P      P  +    S TP
Sbjct:  1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798


>UNIPROTKB|P16356 [details] [associations]
            symbol:ama-1 "DNA-directed RNA polymerase II subunit RPB1"
            species:6239 "Caenorhabditis elegans" [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0005634 GO:GO:0009792
            GO:GO:0040010 GO:GO:0007052 GO:GO:0010458 GO:GO:0046872
            GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20 InterPro:IPR009010
            GO:GO:0006479 GO:GO:0008356 GO:GO:0007369 GO:GO:0042789
            EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665 EMBL:M29235 PIR:A34092
            PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356 STRING:P16356
            PaxDb:P16356 EnsemblMetazoa:F36A4.7.1 EnsemblMetazoa:F36A4.7.2
            GeneID:177190 KEGG:cel:CELE_F36A4.7 UCSC:F36A4.7 CTD:247749
            WormBase:F36A4.7 GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975
            InParanoid:P16356 OMA:KVLPWST NextBio:895720 GO:GO:0001055
            Uniprot:P16356
        Length = 1856

 Score = 157 (60.3 bits), Expect = 1.5e-07, P = 1.5e-07
 Identities = 68/254 (26%), Positives = 93/254 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             G   GA  +    T G   G + + +G   P   G P  A +      G   S   Y+ +
Sbjct:  1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582

Query:   188 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 247
                  M + +  P  P Y  +      +  +PSY PT  PSY P   P Y PT  P Y  
Sbjct:  1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639

Query:   248 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 307
                S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P
Sbjct:  1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691

Query:   308 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
              Y +  +P+Y P   P Y       + G GY    +P Y PS  T    +P  +    Q 
Sbjct:  1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748

Query:   363 PPPLNNVPYGSATP 376
              P   +  Y  ++P
Sbjct:  1749 SP--TSPQYSPSSP 1760

 Score = 154 (59.3 bits), Expect = 3.2e-07, P = 3.2e-07
 Identities = 65/219 (29%), Positives = 87/219 (39%)

Query:   164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 223
             P  + T+   G  P  S S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653

Query:   224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
             T  PSY P+  P Y P+  P Y +     Y +   P Y     P+Y P     Y     P
Sbjct:  1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705

Query:   284 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 338
              Y       + G GY     P Y     P Y +  +PSY P   P Y     Q Y    +
Sbjct:  1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759

Query:   339 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 376
             P+Y PS  T    +PRG ++P      P  +    S TP
Sbjct:  1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798


>UNIPROTKB|J9P0I3 [details] [associations]
            symbol:ZNF768 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
            InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
            PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
            GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
            KO:K09228 CTD:79724 OMA:SRYESQN EMBL:AAEX03004391
            RefSeq:XP_547025.2 Ensembl:ENSCAFT00000045233 GeneID:489906
            KEGG:cfa:489906 Uniprot:J9P0I3
        Length = 554

 Score = 148 (57.2 bits), Expect = 3.1e-07, P = 3.1e-07
 Identities = 48/170 (28%), Positives = 77/170 (45%)

Query:   127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 186
             +GS  G    +E E   +  G   YE    +P   G  P +        G  + +  +  
Sbjct:    25 EGSLKGNMSENEEEEMSQQEGTGDYEVEE-IP--FGLDPQSPGFEPQSPGFESQSPRFEP 81

Query:   187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
                G   R+   +P  P + A + P  D S++P ++P + P Y+P + PGY+P + PGY+
Sbjct:    82 ESPGFESRSPGFVPPSPEF-APRSPDSD-SQSPEFEP-QSPRYEP-QSPGYEP-RSPGYE 136

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
               K   Y+  + P Y+  R P Y+ Q   GY+ Q  P +  Q  P +E Q
Sbjct:   137 P-KSPGYEP-KSPGYE-PRSPGYESQSP-GYEPQN-PEFKTQ-SPEFEAQ 180


>FB|FBgn0035872 [details] [associations]
            symbol:CG7185 species:7227 "Drosophila melanogaster"
            [GO:0003729 "mRNA binding" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
            evidence=ISS] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IC] [GO:0000381 "regulation of alternative mRNA
            splicing, via spliceosome" evidence=IMP] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 EMBL:AE014296
            GO:GO:0000166 GO:GO:0003729 Gene3D:3.30.70.330 GO:GO:0000381
            GO:GO:0006379 GO:GO:0005849 eggNOG:NOG313287 KO:K14398
            GeneTree:ENSGT00690000101901 EMBL:AY058563 RefSeq:NP_648206.1
            UniGene:Dm.887 ProteinModelPortal:Q9VSH4 SMR:Q9VSH4 IntAct:Q9VSH4
            MINT:MINT-1562127 STRING:Q9VSH4 PaxDb:Q9VSH4
            EnsemblMetazoa:FBtr0076710 GeneID:38937 KEGG:dme:Dmel_CG7185
            UCSC:CG7185-RA FlyBase:FBgn0035872 InParanoid:Q9VSH4 OMA:PYERGDY
            OrthoDB:EOG4S1RQ4 PhylomeDB:Q9VSH4 ChiTaRS:CG7185 GenomeRNAi:38937
            NextBio:811101 Bgee:Q9VSH4 Uniprot:Q9VSH4
        Length = 652

 Score = 141 (54.7 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
 Identities = 63/199 (31%), Positives = 79/199 (39%)

Query:   200 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRG 258
             PRGP    S G G   +  P      GP   P +G   +    PG Y  Q  S      G
Sbjct:   197 PRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRG--MNSIMQPGQYRPQHMSQVPQVGG 253

Query:   259 PNYDIHR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY 317
             PN    R  P   PQ GL  + Q  P Y   +G  +  QR PG   + GP     + P +
Sbjct:   254 PNSGPPRMQPPMHPQGGLMGNQQPPPRYPSAQGQ-WPGQR-PG-GPRPGPPNGPPQRPMF 310

Query:   318 IPQRGP-GYDLQRGQGYDMRRAPSYD--PSRGT--GFDGAPRGAAPHGQVPPPLNNVPYG 372
               Q GP G  ++   G D RR P +   P +G   G   AP    PHG   P +N   + 
Sbjct:   311 --QGGPMGMPVRGPAGPDWRRPPMHGGFPPQGPPRGLPPAPGPGGPHGAPAPHVNPAFFN 368

Query:   373 SATPPARS-GSGQPRGGNP 390
                 PA+  G G P  G P
Sbjct:   369 QPGGPAQHPGMGGPPHGAP 387

 Score = 112 (44.5 bits), Expect = 0.00049, Sum P(2) = 0.00049
 Identities = 53/171 (30%), Positives = 61/171 (35%)

Query:   223 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG 282
             P +GP+  P+ G G  PT  PG     G      RG N  +  G  Y PQ         G
Sbjct:   196 PPRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRGMNSIMQPG-QYRPQHMSQVPQVGG 253

Query:   283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPS 340
             PN     GP    +  P    Q G +   Q  P Y   +G  PG   QR  G   R  P 
Sbjct:   254 PN----SGP---PRMQPPMHPQGGLMGNQQPPPRYPSAQGQWPG---QRPGG--PRPGPP 301

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
               P +   F G P G    G   P     P     PP     G PRG  PA
Sbjct:   302 NGPPQRPMFQGGPMGMPVRGPAGPDWRRPPMHGGFPP----QGPPRGLPPA 348

 Score = 52 (23.4 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
 Identities = 24/76 (31%), Positives = 30/76 (39%)

Query:   135 GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
             G +++E  G   G + Y+D  G     GP  SA + G  G G   S    A   SG P  
Sbjct:    19 GQAQDEFGGD--GVDLYDD-IG-----GPTESAASGG--GGGGTPSADGAAGPGSGEPGE 68

Query:   195 AAYDIPRGPGYEASKG 210
                  P G  Y  S G
Sbjct:    69 RNSGGPNGV-YHQSSG 83

 Score = 41 (19.5 bits), Expect = 4.3e-06, Sum P(2) = 4.3e-06
 Identities = 9/22 (40%), Positives = 11/22 (50%)

Query:   126 ADGSYGGATGNSENETSGRPVG 147
             ADG+ G  +G      SG P G
Sbjct:    55 ADGAAGPGSGEPGERNSGGPNG 76


>UNIPROTKB|J3KNM7 [details] [associations]
            symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            EMBL:CH471063 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
            EMBL:AC079235 EMBL:AC073149 UniGene:Hs.591645 HGNC:HGNC:2206
            ChiTaRS:COL4A4 ProteinModelPortal:J3KNM7 Ensembl:ENST00000329662
            Uniprot:J3KNM7
        Length = 1687

 Score = 153 (58.9 bits), Expect = 3.7e-07, P = 3.7e-07
 Identities = 81/253 (32%), Positives = 101/253 (39%)

Query:   151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
             Y   +G P   GPP      G  GA P  S S     + GTP  A  +IP  PG+    G
Sbjct:   672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728

Query:   211 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 262
              PG+   K  S     GP   P     KG PG DP  G  G   ++G S     +GP  D
Sbjct:   729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787

Query:   263 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 316
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843

Query:   317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 374
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896

Query:   375 TPPARSGSGQPRG 387
               P   G   PRG
Sbjct:   897 GLPGPPGPKGPRG 909

 Score = 130 (50.8 bits), Expect = 0.00012, P = 0.00012
 Identities = 81/260 (31%), Positives = 104/260 (40%)

Query:   152 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 208
             E G+ GVP GH  P      G+ G  G   S +     + G P    +D P GP G+   
Sbjct:   640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693

Query:   209 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 261
             +G PG   S      P T G +  P   PG+    G PG+  +KGS+     GP      
Sbjct:   694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752

Query:   262 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 311
             +  +G   DP  G LG   +RG    P     RG    PG E    +PG+   +GP    
Sbjct:   753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812

Query:   312 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 367
               A  P  +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP  
Sbjct:   813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865

Query:   368 NVPYGSATPPARSGSGQPRG 387
                 G    P R G+  P G
Sbjct:   866 AGMKGLPGLPGRPGAHGPPG 885

 Score = 123 (48.4 bits), Expect = 0.00070, P = 0.00070
 Identities = 81/280 (28%), Positives = 104/280 (37%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 191
             GA+G  +    G PVG    +   G P   G  P     G  G  P    S+     +G 
Sbjct:  1190 GASGLHDVGPPG-PVGIPGLKGERGDPGSPGISPPGPR-GKKGP-PGPPGSSGPPGPAGA 1246

Query:   192 PMRAAYDIPRGPGYEASKGP-GYDASK-AP-------SYDPTKGPSYD-----PAKGPGY 237
               RA  DIP  PG    +GP G D  + AP       S D  +G   D     P   PG 
Sbjct:  1247 TGRAPKDIP-DPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPG- 1304

Query:   238 DPTKGPGYDAQKGSN-YDAQRGP-NYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GY 293
              P   PGY    G +  D Q+GP  +   +GP   P    G   ++G P    ++GP G 
Sbjct:  1305 -PPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFP----GPPGEKGLPGPPGRKGPTGL 1359

Query:   294 ETQRVPGYDVQRGP-VYEAQRAPSYI-PQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD 350
               +  P  DV   P +     AP    P+   G    RG  G   +  P  D  RG   D
Sbjct:  1360 PGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGV--D 1417

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
             G P    P G+      +   G   PP   G   P+G  P
Sbjct:  1418 GVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKGFGP 1457


>UNIPROTKB|P53420 [details] [associations]
            symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
            "Homo sapiens" [GO:0005587 "collagen type IV" evidence=IDA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IMP] [GO:0032836 "glomerular basement membrane
            development" evidence=IMP] [GO:0005605 "basal lamina" evidence=IDA]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 Reactome:REACT_118779
            Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005788 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
            HOVERGEN:HBG004933 HOGENOM:HOG000085652 GO:GO:0005587
            Gene3D:2.170.240.10 KO:K06237 OrthoDB:EOG4XGZZF EMBL:AC079235
            EMBL:AB008496 MIM:141200 MIM:203780 Orphanet:88919 Orphanet:97562
            GO:GO:0032836 EMBL:X81053 EMBL:Y17397 EMBL:Y17398 EMBL:Y17399
            EMBL:Y17400 EMBL:Y17401 EMBL:Y17402 EMBL:Y17403 EMBL:Y17404
            EMBL:Y17405 EMBL:Y17406 EMBL:Y17407 EMBL:Y17408 EMBL:Y17409
            EMBL:Y17410 EMBL:Y17411 EMBL:Y17412 EMBL:Y17413 EMBL:Y17427
            EMBL:Y17426 EMBL:Y17414 EMBL:Y17415 EMBL:Y17416 EMBL:Y17417
            EMBL:Y17418 EMBL:Y17419 EMBL:Y17420 EMBL:Y17443 EMBL:Y17442
            EMBL:Y17441 EMBL:Y17440 EMBL:Y17439 EMBL:Y17438 EMBL:Y17437
            EMBL:Y17436 EMBL:Y17435 EMBL:Y17434 EMBL:Y17433 EMBL:Y17432
            EMBL:Y17431 EMBL:Y17430 EMBL:Y17429 EMBL:Y17428 EMBL:Y17421
            EMBL:Y17422 EMBL:Y17423 EMBL:Y17424 EMBL:Y17425 EMBL:AC073149
            EMBL:D17391 IPI:IPI00478572 PIR:A55360 RefSeq:NP_000083.3
            UniGene:Hs.591645 ProteinModelPortal:P53420 SMR:P53420
            IntAct:P53420 STRING:P53420 PhosphoSite:P53420 DMDM:259016360
            PaxDb:P53420 PRIDE:P53420 Ensembl:ENST00000396625 GeneID:1286
            KEGG:hsa:1286 UCSC:uc021vxr.1 CTD:1286 GeneCards:GC02M227867
            H-InvDB:HIX0030014 HGNC:HGNC:2206 MIM:120131 neXtProt:NX_P53420
            PharmGKB:PA26721 InParanoid:P53420 OMA:FRGDMGD ChiTaRS:COL4A4
            GenomeRNAi:1286 NextBio:5201 Bgee:P53420 CleanEx:HS_COL4A4
            Genevestigator:P53420 GermOnline:ENSG00000081052 Uniprot:P53420
        Length = 1690

 Score = 153 (58.9 bits), Expect = 3.7e-07, P = 3.7e-07
 Identities = 81/253 (32%), Positives = 101/253 (39%)

Query:   151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
             Y   +G P   GPP      G  GA P  S S     + GTP  A  +IP  PG+    G
Sbjct:   672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728

Query:   211 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 262
              PG+   K  S     GP   P     KG PG DP  G  G   ++G S     +GP  D
Sbjct:   729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787

Query:   263 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 316
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843

Query:   317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 374
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896

Query:   375 TPPARSGSGQPRG 387
               P   G   PRG
Sbjct:   897 GLPGPPGPKGPRG 909

 Score = 130 (50.8 bits), Expect = 0.00012, P = 0.00012
 Identities = 81/260 (31%), Positives = 104/260 (40%)

Query:   152 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 208
             E G+ GVP GH  P      G+ G  G   S +     + G P    +D P GP G+   
Sbjct:   640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693

Query:   209 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 261
             +G PG   S      P T G +  P   PG+    G PG+  +KGS+     GP      
Sbjct:   694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752

Query:   262 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 311
             +  +G   DP  G LG   +RG    P     RG    PG E    +PG+   +GP    
Sbjct:   753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812

Query:   312 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 367
               A  P  +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP  
Sbjct:   813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865

Query:   368 NVPYGSATPPARSGSGQPRG 387
                 G    P R G+  P G
Sbjct:   866 AGMKGLPGLPGRPGAHGPPG 885

 Score = 122 (48.0 bits), Expect = 0.00090, P = 0.00090
 Identities = 74/257 (28%), Positives = 89/257 (34%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
             P G    +   G P   GPP  A   G  G  P            G P     D PRG P
Sbjct:  1222 PPGPRGKKGPPGPPGSSGPPGPA---GATGRAPKDIPDPGPPGDQGPP---GPDGPRGAP 1275

Query:   204 GYEASKGPGYDASKAPSYD-PTKGPSYDPAK-GP-GYDPTKG-PGYDAQKGS-NYDAQRG 258
             G     G   D  +    D    GP   P   GP GY    G  G D QKG   +   +G
Sbjct:  1276 GPPGLPG-SVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGPVGFPGPQG 1334

Query:   259 PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGP-VYEAQRAP 315
             P    H  P    ++GL G   ++GP      G PG   +  P  DV   P +     AP
Sbjct:  1335 P----HGFPGPPGEKGLPGPPGRKGPT-----GLPGPRGEPGPPADVDDCPRIPGLPGAP 1385

Query:   316 SYI-PQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGS 373
                 P+   G    RG  G   +  P  D  RG   DG P    P G+      +   G 
Sbjct:  1386 GMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGV--DGVPGSPGPPGRKGDTGEDGYPGG 1443

Query:   374 ATPPARSGSGQPRGGNP 390
               PP   G   P+G  P
Sbjct:  1444 PGPPGPIGDPGPKGFGP 1460


>UNIPROTKB|D4ADB1 [details] [associations]
            symbol:D4ADB1 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0046872 GO:GO:0008270 Gene3D:2.10.110.10
            SUPFAM:SSF50156 InterPro:IPR006643 SMART:SM00735 IPI:IPI00951885
            PRIDE:D4ADB1 Ensembl:ENSRNOT00000043713 ArrayExpress:D4ADB1
            Uniprot:D4ADB1
        Length = 684

 Score = 148 (57.2 bits), Expect = 4.3e-07, P = 4.3e-07
 Identities = 50/182 (27%), Positives = 70/182 (38%)

Query:   141 TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
             TS  P    +Y +G   P    P P   T   +   P+      A+  S +P  A Y  P
Sbjct:   331 TSPAPAAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASPYSPSP-GANYS-P 383

Query:   201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 260
               P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+    + Y    GP+
Sbjct:   384 T-P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYNPTPSAAYSG--GPS 439

Query:   261 YDIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 314
                 R P     S+  +   G          + RG P Y         + RG    A+R 
Sbjct:   440 ESASRPPWVTDDSFSQKFAPGKSTTSVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERF 499

Query:   315 PS 316
             P+
Sbjct:   500 PA 501


>UNIPROTKB|P02457 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M17839
            EMBL:M17838 EMBL:V00401 EMBL:M10571 EMBL:M17607 IPI:IPI00572548
            PIR:A27179 PIR:A90458 PIR:I50629 PIR:S07234 UniGene:Gga.2073
            UniGene:Gga.43371 IntAct:P02457 PRIDE:P02457 Uniprot:P02457
        Length = 1453

 Score = 149 (57.5 bits), Expect = 8.6e-07, P = 8.6e-07
 Identities = 90/285 (31%), Positives = 109/285 (38%)

Query:   126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
             ADG  G  G TG++  +    P G  A   G   P G  G P      G   AGP  +T 
Sbjct:   808 ADGQPGAKGETGDAGAKGDAGPPGP-AGPTGAPGPAGZVGAPGPKGARG--SAGPPGATG 864

Query:   183 AYAATQSGTPMRAAYDI----PRGP-GYEASKGPGYDASKA--PSYDPTKGPSYDPA-KG 234
                A     P   + +I    P GP G + SKGP  +   A  P      GP   P  KG
Sbjct:   865 FPGAAGRVGPPGPSGNIGLPGPPGPAGKZGSKGPRGETGPAGRPGEPGPAGPPGPPGEKG 924

Query:   235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   925 SPGADGPIGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984

Query:   283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
             P   M  GP       PG     GP  EA R  +   +  PG D   G   D        
Sbjct:   985 PPGPM--GP-------PGL---AGPPGEAGREGAPGAEGAPGRDGAAGPKGDRGETGPAG 1032

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             P    G  GAP    P G+        P G A PP  +G+  P G
Sbjct:  1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPPGPAGARGPAG 1077


>UNIPROTKB|G4N3H5 [details] [associations]
            symbol:MGG_04961 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:CM001233
            RefSeq:XP_003712457.1 EnsemblFungi:MGG_04961T0 GeneID:2675293
            KEGG:mgr:MGG_04961 Uniprot:G4N3H5
        Length = 616

 Score = 144 (55.7 bits), Expect = 1.0e-06, P = 1.0e-06
 Identities = 61/185 (32%), Positives = 80/185 (43%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P   R   G       + ++ +SGR     +     G P G   PP + TA +   GP+ 
Sbjct:   445 PGYQRNQPGGPPSRFDSYDDYSSGRASPAPSMYPSRG-PGGPNMPPRSATAPIPPRGPD- 502

Query:   180 STSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPS-YDPTKGPSYDPAKGPG 236
                AY    +G  +P  + Y  PRGPG     GP   AS APS Y+P + P    A GP 
Sbjct:   503 ---AYDDYSNGRASPAPSMYP-PRGPG-----GPNGRASPAPSMYNPPRAPPQRSATGPM 553

Query:   237 YDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGP----SYDPQRGLGYDMQRGPNYDM--Q 288
               P +GPG+  Q+     A  GP+  YD +  P    S  P RG       G N D+  Q
Sbjct:   554 --PPRGPGFPPQRNMTAPAP-GPDDPYDYNTRPPTSSSQAPPRGA---FGNGWNSDLENQ 607

Query:   289 RG-PG 292
             RG PG
Sbjct:   608 RGGPG 612

 Score = 128 (50.1 bits), Expect = 5.8e-05, P = 5.8e-05
 Identities = 81/289 (28%), Positives = 97/289 (33%)

Query:   113 RAELMNA--PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTA 170
             RA+ M    P   R   G+ G    NS ++    P  Q      Y   Q     P    A
Sbjct:   332 RADTMTTLPPYASR--PGTPGSIELNSLDQKRPMPSRQGTMNSSYSSRQ-----PLVGAA 384

Query:   171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
                G   + + S  +   SG        + R     +S    Y AS AP    T  P+  
Sbjct:   385 AEFGRSASPAPSIPSTNYSGRTYGGQPPMSRMQSNASSMSRAYTASPAPFSSDTV-PAL- 442

Query:   231 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
             P   PGY   + PG    +  +YD            PS  P RG G     GPN   +  
Sbjct:   443 PR--PGYQRNQ-PGGPPSRFDSYDDYSSGRAS--PAPSMYPSRGPG-----GPNMPPRSA 492

Query:   291 PGYETQRVP-GYD-VQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT 347
                   R P  YD    G    A  APS  P RGPG    R      M   P   P R  
Sbjct:   493 TAPIPPRGPDAYDDYSNG---RASPAPSMYPPRGPGGPNGRASPAPSMYNPPRAPPQRSA 549

Query:   348 GFDGAPRGAA--PHGQV--PPPLNNVPYGSAT-PPARSGSGQPRG--GN 389
                  PRG    P   +  P P  + PY   T PP  S    PRG  GN
Sbjct:   550 TGPMPPRGPGFPPQRNMTAPAPGPDDPYDYNTRPPTSSSQAPPRGAFGN 598


>WB|WBGene00004203 [details] [associations]
            symbol:swsn-1 species:6239 "Caenorhabditis elegans"
            [GO:0003682 "chromatin binding" evidence=IEA] [GO:0000003
            "reproduction" evidence=IGI;IMP] [GO:0040035 "hermaphrodite
            genitalia development" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IGI;IMP] [GO:0009792 "embryo development ending in birth
            or egg hatching" evidence=IGI;IMP] [GO:0040018 "positive regulation
            of multicellular organism growth" evidence=IGI;IMP] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            [GO:0046662 "regulation of oviposition" evidence=IMP] [GO:0002009
            "morphogenesis of an epithelium" evidence=IMP] [GO:0035262 "gonad
            morphogenesis" evidence=IMP] InterPro:IPR001005 InterPro:IPR007526
            InterPro:IPR009057 Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934
            SMART:SM00717 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
            GO:GO:0040007 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
            GO:GO:0040018 Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
            InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
            EMBL:AL110477 KO:K11649 UniGene:Cel.7072 GeneID:180324
            KEGG:cel:CELE_Y113G7B.23 CTD:180324 RefSeq:NP_001256907.1
            ProteinModelPortal:H8ESF3 SMR:H8ESF3 WormBase:Y113G7B.23c
            Uniprot:H8ESF3
        Length = 792

 Score = 145 (56.1 bits), Expect = 1.1e-06, P = 1.1e-06
 Identities = 86/316 (27%), Positives = 123/316 (38%)

Query:    91 HLESL-QVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQN 149
             H + L Q+M+K   ++  +  +L  E   A ++D+     Y      +++E   R     
Sbjct:   493 HFDELEQIMDKERESLEYQRHQLILE-RQAFHMDQL---KY--LENRAKHEAHSRMTSSG 546

Query:   150 AYEDGYGVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP 203
             A   G  +P G    GPP   P    +    A P    ++ AAT +  P  +    P+ P
Sbjct:   547 ALPAG--LPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAP 602

Query:   204 GYEASKGP--GYDASKAP--SYDPTKGPSYDPAKGPGYDPTKGPGYDA----QKGSNYDA 255
               +A+  P     A +AP  +Y    GP   P +   Y P +G  Y      Q+   + A
Sbjct:   603 PVQAAPAPVQAPQAPQAPPQAYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQA 662

Query:   256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG-PVYEAQRA 314
             Q+  +   H GP    Q G     Q    Y     PG       GY  Q+  P Y+AQ  
Sbjct:   663 QQAQS-QAHYGPPGGGQ-GPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPPYQAQPY 720

Query:   315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 374
             P   P   P    QRG GY     P   P     F G P    P+GQ+PPP    P+G  
Sbjct:   721 PG--P---PPPQQQRGYGYP----PPPQPV----FSGHPY-QQPYGQMPPP----PHGQY 762

Query:   375 TPPARSGSGQ-PRGGN 389
              P  + G    P GG+
Sbjct:   763 QPQQQQGGPMGPPGGH 778


>TAIR|locus:2012713 [details] [associations]
            symbol:AT1G33680 "AT1G33680" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
            [GO:0005829 "cytosol" evidence=IDA] InterPro:IPR004087
            InterPro:IPR004088 Pfam:PF13014 PROSITE:PS50084 SMART:SM00322
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005829 GO:GO:0003723
            eggNOG:NOG300923 KO:K13210 UniGene:At.39892 UniGene:At.71035
            HOGENOM:HOG000242545 EMBL:AK229850 EMBL:AK229909 EMBL:AK230055
            IPI:IPI00786006 RefSeq:NP_174629.3 ProteinModelPortal:Q0WLY0
            SMR:Q0WLY0 STRING:Q0WLY0 PaxDb:Q0WLY0 PRIDE:Q0WLY0
            EnsemblPlants:AT1G33680.1 GeneID:840259 KEGG:ath:AT1G33680
            TAIR:At1g33680 InParanoid:Q0WLY0 OMA:PSYGSTP PhylomeDB:Q0WLY0
            ProtClustDB:CLSN2690290 Genevestigator:Q0WLY0 Uniprot:Q0WLY0
        Length = 763

 Score = 144 (55.7 bits), Expect = 1.4e-06, P = 1.4e-06
 Identities = 65/233 (27%), Positives = 82/233 (35%)

Query:   130 YGGATGNSENETSGRPVG-QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
             Y  A G  + +   RP G Q + E GYG P+   PP      G   A P+  ++  AA+ 
Sbjct:   537 YPSAGGQHQMQQPSRPYGMQGSAEQGYGPPRPAAPPGDVPYQGPTPAAPSYGSTPAAASY 596

Query:   189 SGTPMRAAY-DIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYDPAK-GPGYD---- 238
               TP   +Y   P  P Y ++       GY AS AP+      PSY  A    GY+    
Sbjct:   597 GSTPAAPSYGSTPAAPSYGSNMAQQQQYGY-ASSAPTQQTY--PSYSSAAPSDGYNGTQP 653

Query:   239 PTKGPGYD---AQKGSNYDAQRG------PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 289
             P   P Y+   AQ  S      G      P       PS  P  G     Q   NY    
Sbjct:   654 PAVAPAYEQHGAQPASGVQQTSGGYGQVPPTGGYSSYPSTQPAYG-NTPAQSNGNY---- 708

Query:   290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP---GYDLQRGQGYDMRRAP 339
               GY   + P Y       Y A    +   Q  P   GY+    Q      AP
Sbjct:   709 --GYIGSQYPSYGGGNASAYAAPTGQTAYSQTAPPQAGYEQSATQSAGYAAAP 759


>UNIPROTKB|Q96QC0 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9606 "Homo sapiens" [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0004864 "protein
            phosphatase inhibitor activity" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] [GO:0000785 "chromatin" evidence=ISS] [GO:0006606
            "protein import into nucleus" evidence=TAS] InterPro:IPR000571
            InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
            PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
            GO:GO:0005634 EMBL:BA000025 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0000785 GO:GO:0006351 GO:GO:0003723
            EMBL:AL662800 EMBL:AL662825 GO:GO:0000790 GO:GO:0006606
            GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
            EMBL:Y13247 EMBL:AJ544537 EMBL:AB088097 EMBL:BX248507
            IPI:IPI00298731 PIR:JE0291 RefSeq:NP_002705.2 UniGene:Hs.106019
            ProteinModelPortal:Q96QC0 SMR:Q96QC0 DIP:DIP-39343N IntAct:Q96QC0
            MINT:MINT-1197376 STRING:Q96QC0 PhosphoSite:Q96QC0 DMDM:61214507
            PaxDb:Q96QC0 PeptideAtlas:Q96QC0 PRIDE:Q96QC0
            Ensembl:ENST00000376511 Ensembl:ENST00000383586
            Ensembl:ENST00000420949 Ensembl:ENST00000424446
            Ensembl:ENST00000426299 Ensembl:ENST00000429597
            Ensembl:ENST00000449113 GeneID:5514 KEGG:hsa:5514 UCSC:uc003nqn.1
            CTD:5514 GeneCards:GC06M030568 H-InvDB:HIX0165052
            H-InvDB:HIX0166290 H-InvDB:HIX0166579 H-InvDB:HIX0166833
            H-InvDB:HIX0167082 H-InvDB:HIX0167322 H-InvDB:HIX0167569
            HGNC:HGNC:9284 HPA:CAB025501 MIM:603771 neXtProt:NX_Q96QC0
            PharmGKB:PA33612 eggNOG:NOG69306 HOGENOM:HOG000049285
            HOVERGEN:HBG053646 InParanoid:Q96QC0 OMA:PPPHEHR OrthoDB:EOG451DQK
            PhylomeDB:Q96QC0 ChiTaRS:PPP1R10 GenomeRNAi:5514 NextBio:21326
            ArrayExpress:Q96QC0 Bgee:Q96QC0 CleanEx:HS_PPP1R10
            Genevestigator:Q96QC0 GermOnline:ENSG00000204569 Uniprot:Q96QC0
        Length = 940

 Score = 145 (56.1 bits), Expect = 1.4e-06, P = 1.4e-06
 Identities = 63/248 (25%), Positives = 83/248 (33%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 180
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   181 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 240
                    +     R+    P G G     GPG        + P +GP        G+ P 
Sbjct:   716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNSSGHRPH 770

Query:   241 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
             +GPG     GS +    GP   +  G  + P  G G  +  G  +    GPG       G
Sbjct:   771 EGPG--GGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828

Query:   301 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 351
             +    GP      +  + P  GPG+         D+   +G+D R  P   P    G DG
Sbjct:   829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885

Query:   352 APRGAAPH 359
                G   H
Sbjct:   886 PGHGGGGH 893

 Score = 144 (55.7 bits), Expect = 1.8e-06, P = 1.8e-06
 Identities = 71/268 (26%), Positives = 90/268 (33%)

Query:   143 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   201 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 251
              GPG     GPG Y      +  +  P   P +  A+G   G  P  G   PG     G 
Sbjct:   694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749

Query:   252 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 304
              +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  + 
Sbjct:   750 GHRPHEGPGGGMGNSSGHR-PHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGIS 808

Query:   305 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
              G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP 
Sbjct:   809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866

Query:   365 PLNNVPYGSATPPARSGSGQPRGGNPAR 392
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 130 (50.8 bits), Expect = 6.1e-05, P = 6.1e-05
 Identities = 53/213 (24%), Positives = 72/213 (33%)

Query:   132 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
             G  G +E      P  G      G G P G G P      G  G  P+          SG
Sbjct:   708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSSG 766

Query:   191 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
                        G G+   +GPG        + P +GP    + G G+ P +GPG     G
Sbjct:   767 HRPHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAG 826

Query:   251 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVY 309
               +    GP   +     + P  G G+    G   +D+   PG+      G+D  RGP  
Sbjct:   827 GGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPPP 877

Query:   310 EAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 339
                R    P +      G+D     G DM   P
Sbjct:   878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>UNIPROTKB|G1RSL2 [details] [associations]
            symbol:COL4A4 "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISS] [GO:0005587 "collagen type IV"
            evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS] [GO:0032836
            "glomerular basement membrane development" evidence=ISS]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
            EMBL:ADFV01083072 EMBL:ADFV01083073 EMBL:ADFV01083074
            EMBL:ADFV01083075 EMBL:ADFV01083076 EMBL:ADFV01083077
            EMBL:ADFV01083078 Ensembl:ENSNLET00000017067 Uniprot:G1RSL2
        Length = 1690

 Score = 147 (56.8 bits), Expect = 1.7e-06, P = 1.7e-06
 Identities = 79/253 (31%), Positives = 99/253 (39%)

Query:   151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
             Y    G P   G P      G  GA P  S S     + GTP     +IP  PG+    G
Sbjct:   671 YPGRQGPPGFDGLPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPGPPGFRGDMG 727

Query:   211 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 262
              PG+   +  S     GP   P     KG PG DP  GP G   ++G S     +GP  D
Sbjct:   728 DPGFGGERGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGPLGPPGKRGLSGVPGIKGPRGD 786

Query:   263 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 316
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   787 --PGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 842

Query:   317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 374
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   843 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 895

Query:   375 TPPARSGSGQPRG 387
               P   G   PRG
Sbjct:   896 GLPGPPGPKGPRG 908

 Score = 123 (48.4 bits), Expect = 0.00070, P = 0.00070
 Identities = 76/253 (30%), Positives = 97/253 (38%)

Query:   159 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 214
             +GH G P      G  G  G    T +   T  G      +D   GP G+   +G PG  
Sbjct:   640 RGHPGVPGRPGVRGPDGLKGQKGDTISCNVTYPGRQGPPGFDGLPGPKGFPGPQGAPGLS 699

Query:   215 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 268
              S      P T G S  P   PG+    G PG+  ++GS+     GP      +  +G  
Sbjct:   700 GSDGHKGRPGTPGTSEIPGP-PGFRGDMGDPGFGGERGSSPVGPPGPPGSPGVNGQKGIP 758

Query:   269 YDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEAQRA--PS 316
              DP  G LG   +RG    P     RG    PG E    +PG+   +GP      A  P 
Sbjct:   759 GDPAFGPLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPG 818

Query:   317 YIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 374
              +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP      G  
Sbjct:   819 -VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGPAGMKGLP 871

Query:   375 TPPARSGSGQPRG 387
               P R G+  P G
Sbjct:   872 GLPGRPGAHGPPG 884


>FB|FBgn0261885 [details] [associations]
            symbol:osa "osa" species:7227 "Drosophila melanogaster"
            [GO:0046530 "photoreceptor cell differentiation" evidence=IMP]
            [GO:0005634 "nucleus" evidence=NAS;IDA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IMP] [GO:0008587 "imaginal disc-derived
            wing margin morphogenesis" evidence=IMP] [GO:0007379 "segment
            specification" evidence=IMP] [GO:0003677 "DNA binding"
            evidence=ISS;IDA;NAS] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IDA;IMP] [GO:0045893 "positive regulation
            of transcription, DNA-dependent" evidence=IDA] [GO:0035060 "brahma
            complex" evidence=IDA;TAS] [GO:0003713 "transcription coactivator
            activity" evidence=IC] [GO:0007476 "imaginal disc-derived wing
            morphogenesis" evidence=IMP] [GO:0048190 "wing disc dorsal/ventral
            pattern formation" evidence=IGI] [GO:0042058 "regulation of
            epidermal growth factor receptor signaling pathway" evidence=IMP]
            [GO:0007480 "imaginal disc-derived leg morphogenesis" evidence=IMP]
            [GO:0008586 "imaginal disc-derived wing vein morphogenesis"
            evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
            InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
            EMBL:AE014297 GO:GO:0048190 GO:GO:0045893 GO:GO:0016055
            GO:GO:0003677 GO:GO:0008586 GO:GO:0006351 GO:GO:0016568
            eggNOG:NOG12793 GO:GO:0007379 GO:GO:0007480 KO:K11653
            Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
            GeneTree:ENSGT00550000074575 GO:GO:0046530 GO:GO:0008587
            GO:GO:0035060 GO:GO:0042058 EMBL:AF053091 PIR:T13049
            RefSeq:NP_001163639.1 RefSeq:NP_524392.2 RefSeq:NP_732263.1
            UniGene:Dm.2989 ProteinModelPortal:Q8IN94 SMR:Q8IN94 DIP:DIP-20699N
            IntAct:Q8IN94 MINT:MINT-297379 STRING:Q8IN94 PaxDb:Q8IN94
            PRIDE:Q8IN94 EnsemblMetazoa:FBtr0089581 EnsemblMetazoa:FBtr0301487
            GeneID:42130 KEGG:dme:Dmel_CG7467 CTD:42130 FlyBase:FBgn0261885
            InParanoid:Q8IN94 OMA:SQMGQGP OrthoDB:EOG4MCVF9 PhylomeDB:Q8IN94
            ChiTaRS:osa GenomeRNAi:42130 NextBio:827314 Bgee:Q8IN94
            GermOnline:CG7467 Uniprot:Q8IN94
        Length = 2716

 Score = 148 (57.2 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 86/333 (25%), Positives = 126/333 (37%)

Query:    75 HLCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGAT 134
             H  +    +E  F    ++ L ++++  +   ++ +  +A  + +P      D     +T
Sbjct:  1078 HYTKNLLTFECHFDRGDIDPLPIIQQ--VEAGSKKKTAKAASVPSPG-GGHLDAGTTNST 1134

Query:   135 GNSENETS-GRPVGQ--NAYEDGY-GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
             G+S ++ S   P G   NA  DGY G P G  P P A+     G  P+ +T   A     
Sbjct:  1135 GSSNSQDSFPAPPGSAPNAAIDGYPGYPGG-SPYPVAS-----GPQPDYAT---AGQMQR 1185

Query:   191 TPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTK---GPSYDPAKGPGYDPTKGPGYD 246
              P +     P  PG  A+   G + S + P  DP     GP      GPG  P  GPG  
Sbjct:  1186 PPSQNNPQTPH-PGAAAAVAAGDNISVSNPFEDPIAAGGGPGSGTGPGPGQGP--GPGA- 1241

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDP----QRGLGYDMQRGPNYDMQRGPGYET-QRVPGY 301
             A  G+      G     H  P + P    Q+  G   Q+ P +     PG    Q+  G 
Sbjct:  1242 ASGGAGAVGAVGGGPQPHPPPPHSPHTAAQQAAGQHQQQHPQHQHPGLPGPPPPQQQQGQ 1301

Query:   302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
               Q+ P       P    Q GPG      Q +    A +  P  G+G+   P    P   
Sbjct:  1302 QGQQPPPSVGGGPPPAPQQHGPGQVPPSPQQHVRPAAGAPYPPGGSGYP-TPVSRTPGSP 1360

Query:   362 VPP-PLNNVPYGSATPPARSGS-GQPRGGNPAR 392
              P  P     YGS+     +G  GQP G  P +
Sbjct:  1361 YPSQPGAYGQYGSSDQYNATGPPGQPFGQGPGQ 1393

 Score = 132 (51.5 bits), Expect = 0.00012, P = 0.00012
 Identities = 79/277 (28%), Positives = 101/277 (36%)

Query:   131 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP----NTSTSAYAA 186
             GGA G   +  S  P G+ + +D Y  P    P P      +  + P    N     Y A
Sbjct:  1449 GGAPGAPPS--SAYPTGRPSQQDYYQPPPDQSPQPRRHPDFIKDSQPYPGYNARPQIYGA 1506

Query:   187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
              QSGT     Y     P Y +S  P  +   AP   P +G +  P   P   P + P   
Sbjct:  1507 WQSGTQQ---YR----PQYPSSPAP-QNWGGAP---P-RGAAPPPG-APHGPPIQQPAGV 1553

Query:   247 AQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYETQRVPGYDVQ 304
             AQ   + Y  Q+GP       P    Q+      Q+ P Y    GP G +  + P    Q
Sbjct:  1554 AQWDQHRYPPQQGP-------PPPPQQQQQPQQQQQQPPYQQVAGPPGQQPPQAPPQWAQ 1606

Query:   305 RGPVYEAQR--APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
               P   AQ   AP   P R P    Q+ +   M         +G G    P   A HG V
Sbjct:  1607 MNPGQTAQSGIAPPGSPLRPPSGPGQQNRMPGMPAQQQQSQQQG-GVPQPPPQQASHGGV 1665

Query:   363 PPP-LNNV--------PYGSATPPARSGSGQPRGGNP 390
             P P L  V        PY    PP++ G GQ  G  P
Sbjct:  1666 PSPGLPQVGPGGMVKPPYAMPPPPSQ-GVGQQVGQGP 1701


>UNIPROTKB|Q5TM61 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9544 "Macaca mulatta" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
            GO:GO:0006351 GO:GO:0003723 EMBL:AB128049 GO:GO:0004864
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            eggNOG:NOG69306 HOVERGEN:HBG053646 RefSeq:NP_001108416.1
            UniGene:Mmu.17467 ProteinModelPortal:Q5TM61 GeneID:711949
            KEGG:mcc:711949 NextBio:19975847 Uniprot:Q5TM61
        Length = 940

 Score = 143 (55.4 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 73/271 (26%), Positives = 93/271 (34%)

Query:   143 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   201 RGPG-----YEASKGPGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKG 250
              GPG     Y   +G G   ++ P   P   P +  A+G   G  P  G   PG     G
Sbjct:   694 GGPGPGPGPYHRGRG-GRGGNEPP---PPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGG 749

Query:   251 SNYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDV 303
               +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  +
Sbjct:   750 GGHRPHEGPGGGMGNSSGHR-PHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGI 808

Query:   304 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP 363
               G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP
Sbjct:   809 SGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVP 866

Query:   364 PPLNNVPYGSATPPA--RSGSGQPRGGNPAR 392
                 +   G   PP   R   G   GG   R
Sbjct:   867 GHRGHDHRG---PPHEHRGHDGPGHGGGGHR 894

 Score = 142 (55.0 bits), Expect = 3.0e-06, P = 3.0e-06
 Identities = 54/213 (25%), Positives = 73/213 (34%)

Query:   131 GGATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 189
             GG  GN        P  G      G G P G G P      G  G  P+          S
Sbjct:   708 GGRGGNEPPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSS 766

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
             G           G G+   +GPG        + P +GP    + G G+ P +GPG     
Sbjct:   767 GHRPHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGA 826

Query:   250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPV 308
             G  +    GP   +     + P  G G+    G   +D+   PG+      G+D  RGP 
Sbjct:   827 GGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPP 877

Query:   309 YE--AQRAPSYIPQRGPGYDLQRGQGYDMRRAP 339
             +E      P +      G+D     G DM   P
Sbjct:   878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910

 Score = 140 (54.3 bits), Expect = 4.9e-06, P = 4.9e-06
 Identities = 62/245 (25%), Positives = 83/245 (33%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 180
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   181 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP 239
                        P R A     G G    +G PG        + P +GP        G+ P
Sbjct:   716 PPP-----PPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMGNSSGHRP 770

Query:   240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP 299
              +GPG  +  GS +    GP   +  G  + P  G G  +  G  +    GPG       
Sbjct:   771 HEGPG--SGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGG 828

Query:   300 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD----PSRGTGFDGAPR 354
             G+    GP      +  + P  GPG+    G + +D+     +D    P    G DG   
Sbjct:   829 GHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPPHEHRGHDGPGH 888

Query:   355 GAAPH 359
             G   H
Sbjct:   889 GGGGH 893


>UNIPROTKB|Q7YR38 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9598 "Pan troglodytes" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
            GO:GO:0006351 GO:GO:0003723 EMBL:BA000041 GO:GO:0004864
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646 OMA:PPPHEHR
            GeneTree:ENSGT00530000063820 EMBL:AB210175 EMBL:AB210176
            RefSeq:NP_001038965.1 UniGene:Ptr.6270 ProteinModelPortal:Q7YR38
            Ensembl:ENSPTRT00000033108 GeneID:462544 KEGG:ptr:462544
            NextBio:20841794 Uniprot:Q7YR38
        Length = 940

 Score = 143 (55.4 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 63/248 (25%), Positives = 83/248 (33%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 180
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   181 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 240
                    +     R+    P G G     GPG        + P +GP        G+ P 
Sbjct:   716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNNSGHRPH 770

Query:   241 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
             +GPG     GS +    GP   +  G  + P  G G  +  G  +    GPG       G
Sbjct:   771 EGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828

Query:   301 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 351
             +    GP      +  + P  GPG+         D+   +G+D R  P   P    G DG
Sbjct:   829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885

Query:   352 APRGAAPH 359
                G   H
Sbjct:   886 PGHGGGGH 893

 Score = 142 (55.0 bits), Expect = 3.0e-06, P = 3.0e-06
 Identities = 71/268 (26%), Positives = 90/268 (33%)

Query:   143 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   201 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 251
              GPG     GPG Y      +  +  P   P +  A+G   G  P  G   PG     G 
Sbjct:   694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749

Query:   252 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 304
              +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  + 
Sbjct:   750 GHRPHEGPGGGMGNNSGHR-PHEGPGGGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGIS 808

Query:   305 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
              G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP 
Sbjct:   809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866

Query:   365 PLNNVPYGSATPPARSGSGQPRGGNPAR 392
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 132 (51.5 bits), Expect = 3.7e-05, P = 3.7e-05
 Identities = 54/214 (25%), Positives = 72/214 (33%)

Query:   147 GQNAYEDGYGVPQGHGPPPS-----ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             G   Y  G G   G+ PPP          G  G GP            G      ++ P 
Sbjct:   699 GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPG 758

Query:   202 G-----PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
             G      G+   +GPG        + P +GP+     G G+ P +GPG     GS +   
Sbjct:   759 GGMGNNSGHRPHEGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPH 816

Query:   257 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY------ETQRVPGY--DVQRGPV 308
              GP   +  G  + P  G G  M     +    GPG+          VPG+     RGP 
Sbjct:   817 EGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP 876

Query:   309 YEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 339
                 R    P +      G+D     G DM   P
Sbjct:   877 PHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>UNIPROTKB|C9JGE3 [details] [associations]
            symbol:EWSR1 "Ewing sarcoma breakpoint region 1, isoform
            CRA_e" species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0000166
            EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 EMBL:AC002059 EMBL:AL031186 EMBL:AC000026
            UniGene:Hs.374477 HGNC:HGNC:3508 HOGENOM:HOG000038010 ChiTaRS:EWSR1
            IPI:IPI00953325 SMR:C9JGE3 STRING:C9JGE3 Ensembl:ENST00000332050
            UCSC:uc003aez.3 Uniprot:C9JGE3
        Length = 583

 Score = 127 (49.8 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
 Identities = 68/254 (26%), Positives = 95/254 (37%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 237
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   238 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD      Y 
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQS---SYS 209

Query:   295 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF-DGAP 353
              Q   G     G      +  SY  Q    Y  Q G  Y   +APS    + + +    P
Sbjct:   210 QQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGS-YS--QAPSQYSQQSSSYGQQRP 266

Query:   354 RGAAPHGQVPPPLN 367
                 P   + PP++
Sbjct:   267 MDEGPDLDLGPPVD 280

 Score = 57 (25.1 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   354 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 392
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   382 RGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 427


>UNIPROTKB|P12105 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933
            EMBL:U07973 EMBL:X00822 EMBL:X00823 EMBL:X00826 EMBL:X00825
            EMBL:X00827 EMBL:X00828 EMBL:X00830 EMBL:X00831 EMBL:K02302
            EMBL:K02301 EMBL:V00391 EMBL:V00392 EMBL:M36662 IPI:IPI00590578
            PIR:A05269 PIR:I50694 UniGene:Gga.42140 ProteinModelPortal:P12105
            STRING:P12105 Uniprot:P12105
        Length = 1262

 Score = 144 (55.7 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 84/280 (30%), Positives = 109/280 (38%)

Query:   132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 188
             GA G   +N   G P G+       G+P  +G P     AG  G+ GP   S  A    Q
Sbjct:   467 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 525

Query:   189 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 243
              G P    MR    IP  PG +   GP  +  + P      GP+  P   PG     GP 
Sbjct:   526 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 583

Query:   244 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 297
             G +   G N   +RGP       GP+  +   GL G     GP  D  + GP      Q 
Sbjct:   584 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 641

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 353
             +PG     GP  E  +     P+    GPG+   +G+ G    R P   P   TG  G P
Sbjct:   642 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERGPQGPPGP-TGARGGP 697

Query:   354 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
               A   G + PP     P G+  P  +   G+ RG  G+P
Sbjct:   698 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 736

 Score = 128 (50.1 bits), Expect = 0.00014, P = 0.00014
 Identities = 87/281 (30%), Positives = 107/281 (38%)

Query:   131 GGATGNSENETSGRPVGQNAY-EDGY-GVPQGHGPPPSATTAGVVGAGPNTSTSAYA--- 185
             GG TG  E    G P G  A+ +DG  G     GPP    TAG  G+ P     A     
Sbjct:   301 GGPTG--ERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPGS-PGFKGEAGPPGP 357

Query:   186 ATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-P 243
             A  SG P       P+G  G    +GP   A  +P      GPS  P  GPG    +G P
Sbjct:   358 AGASGNPGERGEPGPQGQAGPPGPQGPPGRAG-SPGGKGEMGPSGIPG-GPGPPGGRGLP 415

Query:   244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGPGYETQRVPGYD 302
             G     G N  A+  P      G   DP    G   +RG N     RGP       PG +
Sbjct:   416 GPPGTSG-NPGAKGTPGEPGKNGAKGDP----GPKGERGENGTPGARGP-------PGEE 463

Query:   303 VQRGPVYEAQR--APSYIPQRG-PGY-DLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AA 357
              +RG   E  +   P    +RG PG+  L    G    + P+ +  RG+     P G A 
Sbjct:   464 GKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPGEKGPAGE--RGSPGPPGPSGPAG 521

Query:   358 PHGQV--P--PPLNNVPYGSATPPARSGSGQPRG--GNPAR 392
               GQ   P  P +  +P G    P   G   P G  G P R
Sbjct:   522 DRGQDGGPGLPGMRGLP-GIPGSPGSDGKPGPPGNQGEPGR 561

 Score = 127 (49.8 bits), Expect = 0.00018, P = 0.00018
 Identities = 78/276 (28%), Positives = 97/276 (35%)

Query:   132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
             G  G   +N   G P G        G P   GPP      G  G  P  +       + G
Sbjct:   428 GTPGEPGKNGAKGDP-GPKGERGENGTPGARGPPGEEGKRGANGE-PGQNGVPGTPGERG 485

Query:   191 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 246
             +P      +P   G    KGP G   S  P   P+ GP+ D  +  GPG    +G PG  
Sbjct:   486 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 541

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 303
                GS  D + GP      G   +P R  G     GP     +   PG +  +  PG + 
Sbjct:   542 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 593

Query:   304 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 359
             +RGP       P    + G    PG     G   D R  P   PS   G  G P G  P 
Sbjct:   594 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 649

Query:   360 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 393
             G+   P    P G    P   G   P+G N  P  R
Sbjct:   650 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 682

 Score = 125 (49.1 bits), Expect = 0.00031, P = 0.00031
 Identities = 74/259 (28%), Positives = 95/259 (36%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
             P G N Y+   G P   GP      AG++G AGP          + G P R   +  RG 
Sbjct:   192 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGP--------PGKDGEPGRPGRNGDRGI 243

Query:   203 PGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRG 258
             PG    KG PG      P     +G    D AKG    P  GP G   Q G+N    Q G
Sbjct:   244 PGLPGHKGHPGMPGM--PGMKGARGFDGKDGAKGDSGAP--GPKGEAGQPGANGSPGQPG 299

Query:   259 PNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYE-TQRVPGYDVQRGPVYEAQRAP 315
             P      RG   +P     +     P      GP G   T   PG      P ++ +  P
Sbjct:   300 PGGPTGERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPG-----SPGFKGEAGP 354

Query:   316 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 375
                P    G   +RG+     +A    P    G  G+P G    G++ P  + +P G   
Sbjct:   355 PG-PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGK---GEMGP--SGIPGGPGP 408

Query:   376 PPARSGSGQP-RGGNPARR 393
             P  R   G P   GNP  +
Sbjct:   409 PGGRGLPGPPGTSGNPGAK 427


>TAIR|locus:2012788 [details] [associations]
            symbol:AT1G10390 "AT1G10390" species:3702 "Arabidopsis
            thaliana" [GO:0005215 "transporter activity" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0006810 "transport" evidence=IEA] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005635 "nuclear envelope"
            evidence=IDA] InterPro:IPR007230 Pfam:PF04096 PROSITE:PS51434
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005635 GO:GO:0006810
            GO:GO:0005643 eggNOG:NOG12793 SUPFAM:SSF82215 KO:K14297 HSSP:Q9Y6J4
            EMBL:AY078948 EMBL:BT003030 EMBL:AK226964 IPI:IPI00523265
            RefSeq:NP_001031018.1 RefSeq:NP_172510.2 UniGene:At.27877
            ProteinModelPortal:Q8RY25 SMR:Q8RY25 STRING:Q8RY25 MEROPS:S59.A02
            PaxDb:Q8RY25 PRIDE:Q8RY25 EnsemblPlants:AT1G10390.1
            EnsemblPlants:AT1G10390.2 GeneID:837579 KEGG:ath:AT1G10390
            TAIR:At1g10390 HOGENOM:HOG000085153 InParanoid:Q8RY25 OMA:ESISAMP
            PhylomeDB:Q8RY25 ProtClustDB:CLSN2713828 Genevestigator:Q8RY25
            Uniprot:Q8RY25
        Length = 1041

 Score = 143 (55.4 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 52/263 (19%), Positives = 89/263 (33%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP---SATTAGVVGAGPNTSTSAYAATQ 188
             GA+ +     S    G +     +G   G G  P   S   +   G     S  A+  T 
Sbjct:    80 GASSSPAFGNSTPAFGASPASSPFGGSSGFGQKPLGFSTPQSNPFGNSTQQSQPAFGNTS 139

Query:   189 SG--TPMRA----AYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
              G  TP  A    A+  P  P + A+  P + AS  P++  T  P++  +  P +  T  
Sbjct:   140 FGSSTPFGATNTPAFGAPSTPSFGATSTPSFGASSTPAFGATNTPAFGASNSPSFGATNT 199

Query:   243 PGYDAQKGSNYDAQRGP--NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
             P + A     + +      N     G ++       +     P +     P +     P 
Sbjct:   200 PAFGASPTPAFGSTGTTFGNTGFGSGGAFGASNTPAFGASGTPAFGASGTPAFGASSTPA 259

Query:   301 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
             +     P + A   P++     P +       +    +P++  S  + F     G++  G
Sbjct:   260 FGASSTPAFGASSTPAFGGSSTPSFGASNTSSFSFGSSPAFGQST-SAF-----GSSAFG 313

Query:   361 QVPPPLNNVPYGSATPPARSGSG 383
               P P        A+ P   GSG
Sbjct:   314 STPSPFGGA---QASTPTFGGSG 333


>MGI|MGI:1344412 [details] [associations]
            symbol:Ldb3 "LIM domain binding 3" species:10090 "Mus
            musculus" [GO:0005080 "protein kinase C binding" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0005856 "cytoskeleton" evidence=ISO] [GO:0008092
            "cytoskeletal protein binding" evidence=ISO] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0030018 "Z disc" evidence=ISO;IDA]
            [GO:0042995 "cell projection" evidence=IEA] [GO:0045214 "sarcomere
            organization" evidence=IMP] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0051371 "muscle alpha-actinin binding"
            evidence=IDA;IPI] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
            InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
            SMART:SM00132 SMART:SM00228 MGI:MGI:1344412 GO:GO:0048471
            GO:GO:0005080 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 GO:GO:0031143 Gene3D:2.10.110.10 SUPFAM:SSF50156
            CTD:11155 eggNOG:NOG286537 HOVERGEN:HBG051478 OMA:CTSQATT
            OrthoDB:EOG4GTKDQ InterPro:IPR006643 SMART:SM00735 EMBL:AF114378
            EMBL:AF114379 EMBL:AJ005621 EMBL:AF228057 EMBL:AF228058
            EMBL:AY206011 EMBL:AY206012 EMBL:AY206013 EMBL:AY206015
            EMBL:AK172980 EMBL:AK004020 EMBL:AK137181 EMBL:AK142292
            EMBL:BC099596 EMBL:BC138793 EMBL:BC145420 IPI:IPI00123369
            IPI:IPI00323030 IPI:IPI00403041 IPI:IPI00621572 IPI:IPI00625287
            IPI:IPI00656173 RefSeq:NP_001034160.1 RefSeq:NP_001034161.1
            RefSeq:NP_001034162.1 RefSeq:NP_001034163.1 RefSeq:NP_001034164.1
            RefSeq:NP_001034165.1 RefSeq:NP_036048.3 UniGene:Mm.29733 PDB:1WJL
            PDBsum:1WJL ProteinModelPortal:Q9JKS4 SMR:Q9JKS4 IntAct:Q9JKS4
            MINT:MINT-97840 STRING:Q9JKS4 PhosphoSite:Q9JKS4 PaxDb:Q9JKS4
            PRIDE:Q9JKS4 Ensembl:ENSMUST00000022327 Ensembl:ENSMUST00000022328
            Ensembl:ENSMUST00000022330 Ensembl:ENSMUST00000090040 GeneID:24131
            KEGG:mmu:24131 UCSC:uc007taz.1 UCSC:uc007tba.1 UCSC:uc007tbc.1
            UCSC:uc007tbd.1 UCSC:uc007tbe.1 UCSC:uc007tbf.1
            GeneTree:ENSGT00700000104411 InParanoid:B2RSB0
            EvolutionaryTrace:Q9JKS4 NextBio:304169 Bgee:Q9JKS4 CleanEx:MM_LDB3
            Genevestigator:Q9JKS4 GermOnline:ENSMUSG00000021798 Uniprot:Q9JKS4
        Length = 723

 Score = 141 (54.7 bits), Expect = 2.7e-06, P = 2.7e-06
 Identities = 49/181 (27%), Positives = 69/181 (38%)

Query:   142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             S  P    +Y +G   P    P P   T   +   P+      A++ S +P  A Y  P 
Sbjct:   371 SPAPSAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASSYSPSP-GANYS-PT 423

Query:   202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
              P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y     + Y    GP+ 
Sbjct:   424 -P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYSG--GPSE 479

Query:   262 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAP 315
                R P     S+  +   G          + RG P Y         + RG    A+R P
Sbjct:   480 SASRPPWVTDDSFSQKFAPGKSTTTVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERFP 539

Query:   316 S 316
             +
Sbjct:   540 A 540

 Score = 135 (52.6 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 55/192 (28%), Positives = 70/192 (36%)

Query:   155 YGVPQGHGPPPSATTAGVVGAG-----PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 209
             Y       P PSA T+   G       P   T+A        P+ A+   P  PG   S 
Sbjct:   364 YSPAAAASPAPSAHTSYSEGPAAPAPKPRVVTTASIRPSVYQPVPASSYSP-SPGANYSP 422

Query:   210 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSY 269
              P Y  S AP+Y P+  P+Y P+  P Y P+  P Y      NY       Y    GPS 
Sbjct:   423 TP-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYS--GGPSE 479

Query:   270 DPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQ 328
                R          ++  +  PG  T  V    + RG       AP+Y P  GP    L 
Sbjct:   480 SASRP---PWVTDDSFSQKFAPGKSTTTVSKQTLPRG-------APAYNPT-GPQVTPLA 528

Query:   329 RGQGYDMRRAPS 340
             RG      R P+
Sbjct:   529 RGTFQRAERFPA 540

 Score = 132 (51.5 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 56/213 (26%), Positives = 74/213 (34%)

Query:   166 SATTAGVVGA---GPNTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKGPGY--DASKAP 219
             +A+ AG   +    P    SAY+   + +P  +A+     GP   A K P     AS  P
Sbjct:   343 AASAAGPAASPVENPRPQASAYSPAAAASPAPSAHTSYSEGPAAPAPK-PRVVTTASIRP 401

Query:   220 S-YDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 278
             S Y P    SY P+ G  Y PT  P Y       Y     P Y     P+Y P     Y 
Sbjct:   402 SVYQPVPASSYSPSPGANYSPT--P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYT 458

Query:   279 MQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR- 337
                 PNY       Y     P     R P        S+  +  PG          + R 
Sbjct:   459 PSPAPNYTPTPSAAYSGG--PSESASRPPWVTDD---SFSQKFAPGKSTTTVSKQTLPRG 513

Query:   338 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 370
             AP+Y+P+ G       RG     +  P  +  P
Sbjct:   514 APAYNPT-GPQVTPLARGTFQRAERFPASSRTP 545


>UNIPROTKB|O75112 [details] [associations]
            symbol:LDB3 "LIM domain-binding protein 3" species:9606
            "Homo sapiens" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005080 "protein kinase C binding" evidence=IEA] [GO:0031143
            "pseudopodium" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0005856 "cytoskeleton" evidence=IDA] [GO:0008092
            "cytoskeletal protein binding" evidence=IPI] [GO:0030018 "Z disc"
            evidence=IDA] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
            InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
            SMART:SM00132 SMART:SM00228 GO:GO:0048471 GO:GO:0030018
            GO:GO:0005856 GO:GO:0046872 GO:GO:0008270 Orphanet:154
            GO:GO:0031143 Gene3D:2.10.110.10 Orphanet:54260 SUPFAM:SSF50156
            EMBL:AJ133766 EMBL:AJ133767 EMBL:AJ133768 EMBL:AF276807
            EMBL:AF276808 EMBL:AF276809 EMBL:AB014513 EMBL:AK304760
            EMBL:EF179181 EMBL:AC067750 EMBL:BC010929 IPI:IPI00165263
            IPI:IPI00294958 IPI:IPI00294959 IPI:IPI00514458 IPI:IPI00552865
            IPI:IPI00654766 IPI:IPI00909817 RefSeq:NP_001073583.1
            RefSeq:NP_001073584.1 RefSeq:NP_001073585.1 RefSeq:NP_001165081.1
            RefSeq:NP_001165082.1 RefSeq:NP_009009.1 UniGene:Hs.657271 PDB:1RGW
            PDBsum:1RGW ProteinModelPortal:O75112 SMR:O75112 IntAct:O75112
            STRING:O75112 PhosphoSite:O75112 UCD-2DPAGE:O75112
            UCD-2DPAGE:Q9Y4Z5 PaxDb:O75112 PRIDE:O75112 DNASU:11155
            Ensembl:ENST00000263066 Ensembl:ENST00000310944
            Ensembl:ENST00000352360 Ensembl:ENST00000361373
            Ensembl:ENST00000372056 Ensembl:ENST00000372066
            Ensembl:ENST00000429277 Ensembl:ENST00000458213
            Ensembl:ENST00000542786 GeneID:11155 KEGG:hsa:11155 UCSC:uc001kdr.3
            UCSC:uc001kds.3 UCSC:uc001kdu.3 UCSC:uc001kdv.3 UCSC:uc009xsy.3
            UCSC:uc009xsz.3 CTD:11155 GeneCards:GC10P088426 HGNC:HGNC:15710
            HPA:HPA048955 MIM:601493 MIM:605906 MIM:609452 neXtProt:NX_O75112
            Orphanet:247 Orphanet:609 Orphanet:98912 PharmGKB:PA30318
            eggNOG:NOG286537 HOGENOM:HOG000220936 HOVERGEN:HBG051478
            InParanoid:O75112 OMA:CTSQATT OrthoDB:EOG4GTKDQ ChiTaRS:LDB3
            EvolutionaryTrace:O75112 GenomeRNAi:11155 NextBio:42413
            ArrayExpress:O75112 Bgee:O75112 Genevestigator:O75112
            GermOnline:ENSG00000122367 InterPro:IPR006643 SMART:SM00735
            Uniprot:O75112
        Length = 727

 Score = 141 (54.7 bits), Expect = 2.7e-06, P = 2.7e-06
 Identities = 53/183 (28%), Positives = 72/183 (39%)

Query:   142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             S  P    +Y +G   P    P P   T   +   P+      A+T S +P  A Y  P 
Sbjct:   375 SSAPATHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASTYSPSP-GANYS-PT 427

Query:   202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
              P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+      Y    GP  
Sbjct:   428 -P-YTPSPAPAYTPSPAPAYTPSPVPTYTPSPAPAYTPSPAPNYNPAPSVAYSG--GPAE 483

Query:   262 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQ--RVPGYDVQRGPVYEAQR 313
                R P     S+  +   G          + RG P Y     +VP   + RG V  A+R
Sbjct:   484 PASRPPWVTDDSFSQKFAPGKSTTSISKQTLPRGGPAYTPAGPQVP--PLARGTVQRAER 541

Query:   314 APS 316
              P+
Sbjct:   542 FPA 544


>UNIPROTKB|G7N928 [details] [associations]
            symbol:EGK_04858 "Putative uncharacterized protein"
            species:9544 "Macaca mulatta" [GO:0005201 "extracellular matrix
            structural constituent" evidence=ISS] [GO:0005587 "collagen type
            IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001264 Uniprot:G7N928
        Length = 1692

 Score = 145 (56.1 bits), Expect = 2.8e-06, P = 2.8e-06
 Identities = 81/261 (31%), Positives = 100/261 (38%)

Query:   143 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             G  V  N  Y    G P   GPP      G  GA P  S S     + GTP     +IP 
Sbjct:   663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719

Query:   202 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 253
              PG+    G PG+   K  S     GP   P     KG PG DP  G  G   ++G S  
Sbjct:   720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778

Query:   254 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 309
                +GP  D    P  +   G+ G+   +GP   +   G PG      PG+  +RG P  
Sbjct:   779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835

Query:   310 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
               Q  P      G PG    +GQ  D+   P   P+   G  G P     HG  PP L  
Sbjct:   836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888

Query:   369 VP--YGSATPPARSGSGQPRG 387
             +P  +G    P   G   PRG
Sbjct:   889 IPGPFGDDGLPGPPGPKGPRG 909

 Score = 141 (54.7 bits), Expect = 7.6e-06, P = 7.6e-06
 Identities = 77/252 (30%), Positives = 97/252 (38%)

Query:   159 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 214
             +GH G P      G  G  G    T +   T  G      +D P GP G+   +G PG  
Sbjct:   641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700

Query:   215 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 268
              S      P T G S  P   PG+    G PG+  +KGS+     GP      +  +G  
Sbjct:   701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759

Query:   269 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 316
              DP  G LG   +RG    P     RG PGY        +PG+   +GP      A  P 
Sbjct:   760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819

Query:   317 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 375
              +P   PG+  +RG  G   +     DP    G  GAP G    G V PP      G   
Sbjct:   820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873

Query:   376 PPARSGSGQPRG 387
              P R G+  P G
Sbjct:   874 LPGRPGAHGPPG 885

 Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
 Identities = 81/259 (31%), Positives = 100/259 (38%)

Query:   145 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 199
             PVG      G+ G P  +GH G P      G  G  G    T +   T  G      +D 
Sbjct:   626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683

Query:   200 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 255
             P GP G+   +G PG   S      P T G S  P   PG+    G PG+  +KGS+   
Sbjct:   684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742

Query:   256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 313
               GP       P  + Q+G+  D    P +     PG      VPG    RG P Y    
Sbjct:   743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794

Query:   314 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 371
              P+ IP   PG    +G +G+     P      G   + GAP    P GQ  P L   P 
Sbjct:   795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845

Query:   372 GSATPPARSGSGQPRGGNP 390
             GS  P A  G GQP    P
Sbjct:   846 GS--PGAPGGKGQPGDVGP 862


>UNIPROTKB|G7PK77 [details] [associations]
            symbol:EGM_04376 "Putative uncharacterized protein"
            species:9541 "Macaca fascicularis" [GO:0005201 "extracellular
            matrix structural constituent" evidence=ISS] [GO:0005587 "collagen
            type IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001287 Uniprot:G7PK77
        Length = 1695

 Score = 145 (56.1 bits), Expect = 2.8e-06, P = 2.8e-06
 Identities = 81/261 (31%), Positives = 100/261 (38%)

Query:   143 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             G  V  N  Y    G P   GPP      G  GA P  S S     + GTP     +IP 
Sbjct:   663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719

Query:   202 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 253
              PG+    G PG+   K  S     GP   P     KG PG DP  G  G   ++G S  
Sbjct:   720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778

Query:   254 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 309
                +GP  D    P  +   G+ G+   +GP   +   G PG      PG+  +RG P  
Sbjct:   779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835

Query:   310 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
               Q  P      G PG    +GQ  D+   P   P+   G  G P     HG  PP L  
Sbjct:   836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888

Query:   369 VP--YGSATPPARSGSGQPRG 387
             +P  +G    P   G   PRG
Sbjct:   889 IPGPFGDDGLPGPPGPKGPRG 909

 Score = 141 (54.7 bits), Expect = 7.6e-06, P = 7.6e-06
 Identities = 77/252 (30%), Positives = 97/252 (38%)

Query:   159 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 214
             +GH G P      G  G  G    T +   T  G      +D P GP G+   +G PG  
Sbjct:   641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700

Query:   215 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 268
              S      P T G S  P   PG+    G PG+  +KGS+     GP      +  +G  
Sbjct:   701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759

Query:   269 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 316
              DP  G LG   +RG    P     RG PGY        +PG+   +GP      A  P 
Sbjct:   760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819

Query:   317 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 375
              +P   PG+  +RG  G   +     DP    G  GAP G    G V PP      G   
Sbjct:   820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873

Query:   376 PPARSGSGQPRG 387
              P R G+  P G
Sbjct:   874 LPGRPGAHGPPG 885

 Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
 Identities = 81/259 (31%), Positives = 100/259 (38%)

Query:   145 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 199
             PVG      G+ G P  +GH G P      G  G  G    T +   T  G      +D 
Sbjct:   626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683

Query:   200 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 255
             P GP G+   +G PG   S      P T G S  P   PG+    G PG+  +KGS+   
Sbjct:   684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742

Query:   256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 313
               GP       P  + Q+G+  D    P +     PG      VPG    RG P Y    
Sbjct:   743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794

Query:   314 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 371
              P+ IP   PG    +G +G+     P      G   + GAP    P GQ  P L   P 
Sbjct:   795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845

Query:   372 GSATPPARSGSGQPRGGNP 390
             GS  P A  G GQP    P
Sbjct:   846 GS--PGAPGGKGQPGDVGP 862


>TAIR|locus:2043530 [details] [associations]
            symbol:AT2G25970 "AT2G25970" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0006606 "protein import into nucleus"
            evidence=RCA] InterPro:IPR004087 InterPro:IPR004088 Pfam:PF00013
            PROSITE:PS50084 SMART:SM00322 GO:GO:0005829 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0003723 EMBL:AC004747 EMBL:AC005395
            eggNOG:NOG300923 KO:K13210 HSSP:Q9UNW9 EMBL:AY078954 EMBL:AK226845
            IPI:IPI00540360 PIR:T02627 RefSeq:NP_180167.1 UniGene:At.21555
            ProteinModelPortal:O82762 SMR:O82762 STRING:O82762 PaxDb:O82762
            PRIDE:O82762 ProMEX:O82762 EnsemblPlants:AT2G25970.1 GeneID:817137
            KEGG:ath:AT2G25970 TAIR:At2g25970 HOGENOM:HOG000242545
            InParanoid:O82762 OMA:AANSTQD PhylomeDB:O82762
            ProtClustDB:CLSN2913011 ArrayExpress:O82762 Genevestigator:O82762
            Uniprot:O82762
        Length = 632

 Score = 140 (54.3 bits), Expect = 2.9e-06, P = 2.9e-06
 Identities = 76/283 (26%), Positives = 100/283 (35%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYE---DGYGVPQGHGPPPSATTAGVVGAG 176
             P   +   GSY   T     + S  P  Q + +   D YG  Q   P    ++A      
Sbjct:   355 PQYGQSPYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSA------ 408

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
             P T T+ Y   Q  +    A     G GY+      Y+AS+   Y    G  YD  +G G
Sbjct:   409 PPTDTTGYNYYQHASGYGQA-----GQGYQQDGYGAYNASQQSGYGQAAG--YDQ-QG-G 459

Query:   237 YDPTKGPGYD---AQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
             Y  T  P  +   +Q      AQ G   Y    G     Q   G   Q G         G
Sbjct:   460 YGSTTNPSQEEDASQAAPPSSAQSGQAGYGT-TGQQPPAQGSTG---QAGYGAPPTSQAG 515

Query:   293 YETQRVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQ--GYDMRRAPSYDPSRGTGF 349
             Y +Q    Y+   G    A + P+Y   Q+ PG     G   GY    A  Y      G+
Sbjct:   516 YSSQPAAAYNSGYGAPPPASKPPTYGQSQQSPGAPGSYGSQSGYAQPAASGYGQPPAYGY 575

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGS-ATPPARSGSGQPRGGNPA 391
               AP+G   +G    P     Y S  +  A +G G   GG PA
Sbjct:   576 GQAPQGYGSYGGYTQPAAGGGYSSDGSAGATAGGG---GGTPA 615

 Score = 123 (48.4 bits), Expect = 0.00021, P = 0.00021
 Identities = 69/265 (26%), Positives = 89/265 (33%)

Query:   136 NSENETSGRPVGQN-AYEDGYGV-PQGHGPPPSATTAGVVGAGPNTSTSAYAAT-QSGTP 192
             + EN      +G     + GY   P     PP    A   G G      AY    Q G  
Sbjct:   302 SGENRMRNSAMGGGYPQQGGYQARPPSSWAPPGGPPAQP-GYGGYMQPGAYPGPPQYGQS 360

Query:   193 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPT-KGPSYDPAKG-PGYDPTKGPGYDA-Q 248
                +Y      GY + S  P    S    YD   +  S  P+ G     PT   GY+  Q
Sbjct:   361 PYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYNYYQ 420

Query:   249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 308
               S Y  Q G  Y      +Y+  +  GY    G  YD Q G G  T   P  +      
Sbjct:   421 HASGY-GQAGQGYQQDGYGAYNASQQSGYGQAAG--YDQQGGYGSTTN--PSQEEDA--- 472

Query:   309 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
               +Q AP    Q G     Q G G   ++ P+   +   G+   P   A +   P    N
Sbjct:   473 --SQAAPPSSAQSG-----QAGYGTTGQQPPAQGSTGQAGYGAPPTSQAGYSSQPAAAYN 525

Query:   369 VPYGSATP---PARSGSGQPRGGNP 390
               YG+  P   P   G  Q   G P
Sbjct:   526 SGYGAPPPASKPPTYGQSQQSPGAP 550

 Score = 107 (42.7 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 57/201 (28%), Positives = 76/201 (37%)

Query:   202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
             G GY   +G GY A    S+ P  GP   PA+ PGY      GY  Q G+ Y     P Y
Sbjct:   313 GGGYP-QQG-GYQARPPSSWAPPGGP---PAQ-PGYG-----GY-MQPGA-YPGP--PQY 357

Query:   262 DIHRGPSYDPQRGLGYDMQRG--PNYDMQRGP----GYETQRVPGYDVQRGPVYEAQRAP 315
                   SY  Q   GY  Q    P+    +G     G +  + P       P  +     
Sbjct:   358 GQSPYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYN 417

Query:   316 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG---AAPHGQVPPPLNNVPYG 372
              Y  Q   GY  Q GQGY      +Y+ S+ +G+ G   G      +G    P       
Sbjct:   418 YY--QHASGYG-QAGQGYQQDGYGAYNASQQSGY-GQAAGYDQQGGYGSTTNPSQEEDAS 473

Query:   373 SATPPARSGSGQPRGGNPARR 393
              A PP+ + SGQ   G   ++
Sbjct:   474 QAAPPSSAQSGQAGYGTTGQQ 494

 Score = 63 (27.2 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 26/107 (24%), Positives = 43/107 (40%)

Query:   108 EVEKLRAELMNA-----PNVDRRADGSYGGATGNSENETSGRPVG---QNAYEDGYGVPQ 159
             + +++ A L+N+     P VD  A   YG   G S   + G+ +     ++    YG  Q
Sbjct:    73 KAQEIAARLLNSADAKRPRVDNGASYDYGDNKGFSSYPSEGKQMSGTVPSSIPVSYGSFQ 132

Query:   160 GHGPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP 203
             G       P+     ++G G  T    Y   QSG  ++   D+   P
Sbjct:   133 GTTKKIDIPNMRVGVIIGKGGETIK--YLQLQSGAKIQVTRDMDADP 177


>ZFIN|ZDB-GENE-050809-108 [details] [associations]
            symbol:pygo2 "pygopus homolog 2 (Drosophila)"
            species:7955 "Danio rerio" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR001965
            InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
            ZFIN:ZDB-GENE-050809-108 GO:GO:0046872 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
            SUPFAM:SSF57903 InterPro:IPR019786 PROSITE:PS01359
            GeneTree:ENSGT00530000063948 CTD:90780 OrthoDB:EOG4QZ7MB
            EMBL:CR628394 IPI:IPI00650328 RefSeq:NP_001028283.2
            UniGene:Dr.159286 SMR:Q1L8T6 Ensembl:ENSDART00000131324
            GeneID:613247 KEGG:dre:613247 InParanoid:Q1L8T6 OMA:RFGMPPQ
            NextBio:20898499 Uniprot:Q1L8T6
        Length = 571

 Score = 139 (54.0 bits), Expect = 3.2e-06, P = 3.2e-06
 Identities = 83/301 (27%), Positives = 103/301 (34%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ---GHGPPPSA 167
             M +P   +R   S G A  + SE      P     V  N ++D +G P    G G P  A
Sbjct:    16 MKSPEKKKRKSNSQGAAFSHLSEFAPPPTPMVDHLVASNPFDDDFGPPSRSAGGGGPGGA 75

Query:   168 TTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP 227
             T     GAG       Y     G  M        GPG   S  PG      P   P  GP
Sbjct:    76 TFLPSPGAGGG----GYGGP--GR-MGGGMGFMGGPGGPGSGQPGRRPPFGPP-TPNTGP 127

Query:   228 SYDPAKG--PGYDPTKGPGYDA----QKGSNYDAQRGPNYD--IHRGPSYDPQRGLGYDM 279
              +    G  PG+    G G         G        PN+   +H G  ++P    G  M
Sbjct:   128 HHPLGFGGMPGFGGGGGGGGGGGGGFPPGGPSQFNMPPNFSPPMHPGQGFNPMLSPGA-M 186

Query:   280 QRGPNYDMQRGPGYET----QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR---GQG 332
               GP      GP +      Q+ P +  Q G  + +   P     RGP +       G G
Sbjct:   187 GGGPGGG--GGPPHPRFGMPQQQPPHG-QGGHPFNSPPLPGGPGPRGPPHGPMNPMGGMG 243

Query:   333 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY-GSATPPARSGS--GQPRGGN 389
               M          G    G   G  P GQ PPP +  PY GS+ P    G   G P GG 
Sbjct:   244 GGMNMMGMGGGGGGGNMVGGHPGMPPQGQFPPPQDG-PYPGSSPPVGEEGKNFGGPGGGP 302

Query:   390 P 390
             P
Sbjct:   303 P 303


>UNIPROTKB|P04258 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
            "Bos taurus" [GO:0005581 "collagen" evidence=IEA] PROSITE:PS01208
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 IPI:IPI00731432 PIR:A02862
            UniGene:Bt.64714 STRING:P04258 PRIDE:P04258 Uniprot:P04258
        Length = 1049

 Score = 142 (55.0 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 82/262 (31%), Positives = 97/262 (37%)

Query:   142 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 198
             SG P G+       G P    G GPP      G  G  P    SA      G P      
Sbjct:   521 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 577

Query:   199 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
              P GPG +  KG PG      AP  D  +GP+  P   PG  P   PG   + G+     
Sbjct:   578 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 628

Query:   257 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 311
               P      GP   P +RG  G     G P    Q G PG + +R  PG   + GP   A
Sbjct:   629 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 686

Query:   312 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 371
               A    P   PG    +G+    R +P      G G  G P G  P G  PP  N  P 
Sbjct:   687 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 735

Query:   372 --GSATPPARSGSGQPRGGNPA 391
               GS+  P + G   P G N A
Sbjct:   736 PPGSSGAPGKDGPPGPPGSNGA 757

 Score = 139 (54.0 bits), Expect = 7.2e-06, P = 7.2e-06
 Identities = 86/286 (30%), Positives = 103/286 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAY 184
             A G   G  G +       P G + +    G P   GPP     AG  G  GP  +    
Sbjct:    12 AGGGIAGYPGPAGPPGPPGPPGTSGHPGAPGAPGYQGPPGEPGQAGPAGPPGPPGAIGPS 71

Query:   185 AAT-QSGTPMRAAYDIPRG-PGYEASKGP----GYDASKAP-SYDPTKGPSYDPAKGPGY 237
                 +SG P R     PRG PG    KGP    G+   K    +D   G   +P   PG 
Sbjct:    72 GKDGESGRPGRPG---PRGFPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGEPG-APGL 127

Query:   238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
                 G PG D   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   128 KGENGVPGEDGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 182

Query:   295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              T   PG    +G V  A    S      PG   QRG+      A +  P    G DG+P
Sbjct:   183 GTAGFPGSPGAKGEVGPAGSPGS---SGAPG---QRGEPGPQGHAGAPGPPGPPGSDGSP 236

Query:   354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
              G    G  P  +   P   G+  PP   G+ G P  RG  G P +
Sbjct:   237 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGVPGQRGAAGEPGK 280

 Score = 122 (48.0 bits), Expect = 0.00052, P = 0.00052
 Identities = 84/289 (29%), Positives = 101/289 (34%)

Query:   120 PNVDRRADGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAG 171
             P  +   DGS G  GA G      E    G R P G N      G P   G P  A   G
Sbjct:   304 PKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRG 363

Query:   172 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 231
             V G  P  +         G  +R     P GPG     GP     +     P   P   P
Sbjct:   364 VAGE-PGRN-----GLPGGPGLRGIPGSPGGPGSNGKPGPPGSQGETGRPGPPGSPG--P 415

Query:   232 AKGPGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM- 287
                PG     GP G D   G N + + GP     +GP+  + + G  G     GP+ D  
Sbjct:   416 RGQPGVMGFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKG 474

Query:   288 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRG 346
               GP    Q + G     GP  E  +     P+   G   +  G+G D   AP     RG
Sbjct:   475 DTGPP-GPQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERG 528

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
                 G P G  P G   PP      G+A PP   GS    G  G P  R
Sbjct:   529 PPGAGGPPG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 575


>UNIPROTKB|E2R2K8 [details] [associations]
            symbol:PPP1R10 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 Gene3D:1.20.930.10
            SUPFAM:SSF47676 CTD:5514 OMA:PPPHEHR GeneTree:ENSGT00530000063820
            EMBL:AAEX03008197 RefSeq:XP_848400.1 Ensembl:ENSCAFT00000000645
            Ensembl:ENSCAFT00000048295 GeneID:481705 KEGG:cfa:481705
            NextBio:20856447 Uniprot:E2R2K8
        Length = 940

 Score = 141 (54.7 bits), Expect = 3.8e-06, P = 3.8e-06
 Identities = 68/268 (25%), Positives = 87/268 (32%)

Query:   128 GSYGGATGNSENETSGRPV---GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
             G +GG  G+      G P    G + + DG G P   GP       G  G GP       
Sbjct:   653 GPHGGPGGSVGPRLLGPPPPPRGGDPFWDGPGDPMRGGP-----MRGGPGPGPGPYHRGR 707

Query:   185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
                    P       P  P +  ++G G      P+     GP      G G+ P +GPG
Sbjct:   708 GGRGGNEPP------PPPPPFRGARG-GRSGGGPPN--GRGGPGGGMVGGGGHRPHEGPG 758

Query:   245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQ 304
                   S +    GP   +  G  + P  G G  M  G  +    GPG       G+   
Sbjct:   759 GGMNSSSGHRPHEGPGGGM--GGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPH 816

Query:   305 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
              GP         + P  GPG  +  G G+         P  G G  G P G  PH  VP 
Sbjct:   817 EGPGSGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-DVPS 866

Query:   365 PLNNVPYGSATPPARSGSGQPRGGNPAR 392
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 139 (54.0 bits), Expect = 6.3e-06, P = 6.3e-06
 Identities = 56/215 (26%), Positives = 74/215 (34%)

Query:   132 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
             G  G +E      P  G      G G P G G P      G  G  P+        + SG
Sbjct:   708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMNSSSG 766

Query:   191 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
                        G G+   +GPG        + P +GP      G G+ P +GPG     G
Sbjct:   767 HRPHEGPGGGMGGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPHEGPGSGMGGG 826

Query:   251 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP---GYDVQRGP 307
             S +    GP   +  G  + P  G G+    GP+       G+    VP   G+D  RGP
Sbjct:   827 SGHRPHEGPGGGMGAGGGHRPHEGPGHG---GPH-------GHRPHDVPSHRGHD-HRGP 875

Query:   308 VYEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 339
                  R    P +      G+D     G DM   P
Sbjct:   876 PPHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>ZFIN|ZDB-GENE-030131-1600 [details] [associations]
            symbol:ewsr1b "Ewing sarcoma breakpoint region 1b"
            species:7955 "Danio rerio" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0021954 "central nervous system
            neuron development" evidence=IMP] [GO:0007067 "mitosis"
            evidence=IMP] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            ZFIN:ZDB-GENE-030131-1600 GO:GO:0007067 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 GO:GO:0021954
            GeneTree:ENSGT00530000063105 HOGENOM:HOG000038010
            HOVERGEN:HBG000970 EMBL:BX664747 EMBL:BC097019 UniGene:Dr.76923
            SMR:Q4QRG0 STRING:Q4QRG0 Ensembl:ENSDART00000003998 OMA:PVINIYL
            Uniprot:Q4QRG0
        Length = 579

 Score = 142 (55.0 bits), Expect = 3.8e-06, Sum P(2) = 3.8e-06
 Identities = 73/255 (28%), Positives = 96/255 (37%)

Query:   119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 178
             AP+    A   YG   G +    +  P         YG PQ   P      A   GA   
Sbjct:    61 APSAGAYAQQQYGSTYGQAAATAAAAPAA-------YGTPQ---PGAYTQPAQSYGASSY 110

Query:   179 TSTSAYAATQSGTPMRAAYDI-PRGPGYE---ASKGP-GYDASKAPSYDPTKGPSYDPAK 233
             T ++A  A Q+    +  Y   P   GY    A+  P  Y AS  P+Y+ +   +Y    
Sbjct:   111 TGSTAAPAAQASYGSQPGYSTQPAYSGYSQQPAASAPQSYSASSQPAYNQS---AYSQPA 167

Query:   234 G---PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGP--SYDPQRGLGYDMQRGPNYDMQ 288
             G   PGY   + PGY  Q+ S Y  Q  P     +GP  +Y PQ    Y   +   Y  Q
Sbjct:   168 GYSQPGYQAQQ-PGYGQQQQSAY-GQGQPPQQHQQGPPAAYPPQGSSSYAQTQ---YGQQ 222

Query:   289 RGPGYETQRVPGYDVQRGPV---YEAQRAPSYIPQRGPGYDL--QRGQGYDMRRAPSYDP 343
               P  + Q+ P     +G V   Y   +   Y      GYD    RG+G   R       
Sbjct:   223 SAPQNDYQQNPYNSYSQGGVSGGYPGSQRGGYQDGGRDGYDRGGPRGRGMG-RGGMGIAG 281

Query:   344 SRGTGFD--GAP-RG 355
              RG GF+  G P RG
Sbjct:   282 DRG-GFNKPGGPMRG 295

 Score = 139 (54.0 bits), Expect = 8.2e-06, Sum P(2) = 8.2e-06
 Identities = 78/283 (27%), Positives = 100/283 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             A  SYG  T     +T G+   Q   +  Y     +  P +A  A    A P  S  AYA
Sbjct:    15 AQQSYGSYTAPPA-QTYGQTAQQGYTQQDYS---SYAQPAAAPEATYSQAAP--SAGAYA 68

Query:   186 ATQSGTPM-RAAYDIPRGPGYEASKGPGYDASKAPSYDPTK--GPSYDPAKGPGYDPTKG 242
               Q G+   +AA      P    +  PG     A SY  +   G +  PA    Y     
Sbjct:    69 QQQYGSTYGQAAATAAAAPAAYGTPQPGAYTQPAQSYGASSYTGSTAAPAAQASYGSQ-- 126

Query:   243 PGYDAQKG-SNYDAQ---RGP-NYDIHRGPSYDPQRGLGYDMQRG---PNYDMQRGPGYE 294
             PGY  Q   S Y  Q     P +Y     P+Y+      Y    G   P Y  Q+ PGY 
Sbjct:   127 PGYSTQPAYSGYSQQPAASAPQSYSASSQPAYNQS---AYSQPAGYSQPGYQAQQ-PGYG 182

Query:   295 TQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGY-DLQRGQGY----DMRRAPSYDPSRGT- 347
              Q+   Y   + P    Q  P+ Y PQ    Y   Q GQ      D ++ P    S+G  
Sbjct:   183 QQQQSAYGQGQPPQQHQQGPPAAYPPQGSSSYAQTQYGQQSAPQNDYQQNPYNSYSQGGV 242

Query:   348 --GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
               G+ G+ RG    G         P G        G    RGG
Sbjct:   243 SGGYPGSQRGGYQDGGRDGYDRGGPRGRGMGRGGMGIAGDRGG 285

 Score = 39 (18.8 bits), Expect = 3.8e-06, Sum P(2) = 3.8e-06
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:   377 PARSGSGQPRGGNPAR 392
             P R G G  RGG   R
Sbjct:   410 PMRGGPGMDRGGMMGR 425


>ZFIN|ZDB-GENE-040426-1010 [details] [associations]
            symbol:fus "fusion (involved in t(12;16) in
            malignant liposarcoma)" species:7955 "Danio rerio" [GO:0000166
            "nucleotide binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 ZFIN:ZDB-GENE-040426-1010 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622
            GeneTree:ENSGT00530000063105 KO:K13098 CTD:2521 EMBL:BX571714
            IPI:IPI00785727 RefSeq:NP_957377.2 UniGene:Dr.114403
            Ensembl:ENSDART00000055340 GeneID:394058 KEGG:dre:394058
            NextBio:20815017 Bgee:F1R0M4 Uniprot:F1R0M4
        Length = 541

 Score = 137 (53.3 bits), Expect = 4.9e-06, P = 4.9e-06
 Identities = 64/250 (25%), Positives = 91/250 (36%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P+    +  SYGG   N  +E+S  P  Q  Y   YG  Q  G    A + G   +  + 
Sbjct:    28 PSAQNYSQQSYGGY--NQSSESSSAPYNQGGYSSNYGQSQSGGYGSQAPSQGYSQSSQSY 85

Query:   180 STSAYAATQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 238
             S+  Y+ T    P ++        GY + S   GY+ S +P+  P    S   + G G  
Sbjct:    86 SSGGYSNTSQPPPAQSG-------GYSQQSSYSGYNQS-SPASAPGGYSSSSQSSGYGQQ 137

Query:   239 PTK-GPGYDAQKGSN--YDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
               + G GY    G +  Y +  G +      G  +   +  G      PNY       Y 
Sbjct:   138 QQQSGGGYGGSGGQSGGYGSSGGQSSGFGGSGGQHQSSQSGGGSYSPSPNYSSPPPQSYG 197

Query:   295 TQRV---PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
              Q      GY+    P+        Y  Q G GY  Q G+G    R   +      GFD 
Sbjct:   198 QQSQYGQGGYNQDSPPMSGGGGGGGYGGQDG-GYS-QDGRG-GRGRGGGFGGRGAGGFDR 254

Query:   352 APRGAAPHGQ 361
               RG  P G+
Sbjct:   255 GGRGG-PRGR 263


>UNIPROTKB|I3LQ53 [details] [associations]
            symbol:I3LQ53 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR000684 Pfam:PF05001 PROSITE:PS00115 GO:GO:0003677
            GO:GO:0006366 GO:GO:0005665 GeneTree:ENSGT00700000104490
            EMBL:FP565284 Ensembl:ENSSSCT00000030016 OMA:YAESDYL Uniprot:I3LQ53
        Length = 543

 Score = 137 (53.3 bits), Expect = 5.0e-06, P = 5.0e-06
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:    62 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 119

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:   120 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 171

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:   172 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 225

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:   226 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 278

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:   279 YSPTSPSYSPTS--PSYSPTSPSYS 301

 Score = 121 (47.7 bits), Expect = 0.00029, P = 0.00029
 Identities = 63/225 (28%), Positives = 80/225 (35%)

Query:   163 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 216
             P  S T+       PN     Y  T    +P   +Y  P  P Y  +  P Y  S     
Sbjct:   333 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 385

Query:   217 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 275
              ++P+Y P+  PSY P+  P Y PT  P Y     S Y     P Y     P Y P    
Sbjct:   386 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 439

Query:   276 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 335
              Y     P Y     P Y +   P Y     P Y +  +P Y P   P Y       Y  
Sbjct:   440 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 491

Query:   336 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 380
               +P Y P+  T    +P+G+      P      P  S T PA S
Sbjct:   492 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 535


>UNIPROTKB|F1MXS8 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
            "Bos taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0050777 "negative regulation of immune response"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
            "skin development" evidence=IEA] [GO:0043206 "extracellular fibril
            organization" evidence=IEA] [GO:0042060 "wound healing"
            evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
            "heart development" evidence=IEA] [GO:0007229 "integrin-mediated
            signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
            factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
            "cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
            GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
            IPI:IPI00731432 OMA:EGSPGHP GO:GO:0005586 EMBL:DAAA02003919
            EMBL:DAAA02003920 Ensembl:ENSBTAT00000028617 ArrayExpress:F1MXS8
            Uniprot:F1MXS8
        Length = 1466

 Score = 142 (55.0 bits), Expect = 5.1e-06, P = 5.1e-06
 Identities = 82/262 (31%), Positives = 97/262 (37%)

Query:   142 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 198
             SG P G+       G P    G GPP      G  G  P    SA      G P      
Sbjct:   677 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 733

Query:   199 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
              P GPG +  KG PG      AP  D  +GP+  P   PG  P   PG   + G+     
Sbjct:   734 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 784

Query:   257 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 311
               P      GP   P +RG  G     G P    Q G PG + +R  PG   + GP   A
Sbjct:   785 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 842

Query:   312 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 371
               A    P   PG    +G+    R +P      G G  G P G  P G  PP  N  P 
Sbjct:   843 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 891

Query:   372 --GSATPPARSGSGQPRGGNPA 391
               GS+  P + G   P G N A
Sbjct:   892 PPGSSGAPGKDGPPGPPGSNGA 913

 Score = 129 (50.5 bits), Expect = 0.00013, P = 0.00013
 Identities = 78/257 (30%), Positives = 104/257 (40%)

Query:   156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR--AAYDIPRGP----GYEASK 209
             G P   GPP    +    G   +    AY   +SG      A Y  P GP    G   + 
Sbjct:   130 GSPGSPGPPGICESCPTGGQNYSPQYEAYDV-KSGVAGGGIAGYPGPAGPPGPPGPPGTS 188

Query:   210 G-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP----GYDAQKGS-NYDAQRG-PNYD 262
             G PG    KA    P +  SY P   PG     GP    G D + G      +RG P   
Sbjct:   189 GHPGAPHLKAWQKPPQQSTSYSPIGPPGPPGAIGPSGPAGKDGESGRPGRPGERGFPGPP 248

Query:   263 IHRGPSYDP----QRG-LGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRAPS 316
               +GP+  P     +G  G+D + G   +    PG + +  VPG +   GP+   + AP 
Sbjct:   249 GMKGPAGMPGFPGMKGHRGFDGRNGEKGETG-APGLKGENGVPGENGAPGPM-GPRGAPG 306

Query:   317 YIPQRG-PGYDLQRG----QGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVP 370
                + G PG    RG    +G D +  P   P  GT GF G+P GA   G+V P     P
Sbjct:   307 ERGRPGLPGAAGARGNDGARGSDGQPGPPGPP--GTAGFPGSP-GAK--GEVGPA--GSP 359

Query:   371 YGSATPPARSGSGQPRG 387
              GS+  P + G   P+G
Sbjct:   360 -GSSGAPGQRGEPGPQG 375

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 84/289 (29%), Positives = 101/289 (34%)

Query:   120 PNVDRRADGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAG 171
             P  +   DGS G  GA G      E    G R P G N      G P   G P  A   G
Sbjct:   460 PKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRG 519

Query:   172 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 231
             V G  P            G  +R     P GPG +   GP     +     P   P   P
Sbjct:   520 VAGE-PGRD-----GLPGGPGLRGIPGSPGGPGSDGKPGPPGSQGETGRPGPPGSPG--P 571

Query:   232 AKGPGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM- 287
                PG     GP G D   G N + + GP     +GP+  + + G  G     GP+ D  
Sbjct:   572 RGQPGVMGFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKG 630

Query:   288 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRG 346
               GP    Q + G     GP  E  +     P+   G   +  G+G D   AP     RG
Sbjct:   631 DTGPP-GPQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERG 684

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
                 G P G  P G   PP      G+A PP   GS    G  G P  R
Sbjct:   685 PPGAGGPPG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 731


>UNIPROTKB|J9P8F7 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00700000104155 EMBL:AAEX03006798
            EMBL:AAEX03006799 EMBL:AAEX03006800 Ensembl:ENSCAFT00000044143
            Uniprot:J9P8F7
        Length = 1405

 Score = 141 (54.7 bits), Expect = 6.2e-06, P = 6.2e-06
 Identities = 77/254 (30%), Positives = 100/254 (39%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
             PVG    +   G P   GP  S    G  GA            Q G P  A     +G P
Sbjct:   634 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 689

Query:   204 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 260
             G +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP 
Sbjct:   690 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 746

Query:   261 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 319
                  GP+  PQ  +G   Q GP+  D + GP  + Q + G     GP       P  + 
Sbjct:   747 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 795

Query:   320 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 377
              +G PG   ++G+  D+ +     P    G  GAP    P G  P  + N    G    P
Sbjct:   796 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 854

Query:   378 ARSGS-GQPRGGNP 390
               +G  G P  G P
Sbjct:   855 GEAGEPGLPGEGGP 868


>UNIPROTKB|E1C0T1 [details] [associations]
            symbol:TFG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0004871 "signal transducer activity" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043123
            "positive regulation of I-kappaB kinase/NF-kappaB cascade"
            evidence=IEA] InterPro:IPR000270 Pfam:PF00564 SMART:SM00666
            GO:GO:0043123 GO:GO:0004871 CTD:10342 KO:K09292 OMA:YTTQTSQ
            GeneTree:ENSGT00510000047809 EMBL:AADN02032793 IPI:IPI00599103
            RefSeq:XP_416608.1 UniGene:Gga.1550 PRIDE:E1C0T1
            Ensembl:ENSGALT00000024692 GeneID:418391 KEGG:gga:418391
            NextBio:20821576 Uniprot:E1C0T1
        Length = 395

 Score = 134 (52.2 bits), Expect = 6.3e-06, P = 6.3e-06
 Identities = 57/210 (27%), Positives = 81/210 (38%)

Query:   175 AGPNTSTSAYAATQSGTP--MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 232
             AGP    SA A  +SGTP  + ++      PG +  + P Y  ++  +    +G  Y   
Sbjct:   194 AGP---PSAPAEERSGTPDSIASSSSAAHPPGVQPQQAP-YPGAQPQTGQQVEGQMYQQY 249

Query:   233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
             + PGY P + P   AQ    Y  Q    Y   +  S   Q+   Y  Q  P      G G
Sbjct:   250 QQPGY-PAQQP--QAQPQQQYGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAFP-GQG 305

Query:   293 YETQRVPGYDVQRGPV--YEAQ----RAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRG 346
              + Q++P    Q+ P   +  Q    +A    P  GP    Q   G    R P + P  G
Sbjct:   306 -QAQQLPAQQPQQYPAGSFPPQPYTTQASQPAPYSGPP-GAQAAPGTFQPR-PGFTPPPG 362

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATP 376
             +     P G  P+ +  PP    P G A P
Sbjct:   363 STMTPPPSGPNPYARTRPPFG--PQGYAQP 390

 Score = 133 (51.9 bits), Expect = 8.1e-06, P = 8.1e-06
 Identities = 54/197 (27%), Positives = 70/197 (35%)

Query:   200 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 259
             P  P  E S  P   AS + +  P   P   P + P Y     PG   Q G   + Q   
Sbjct:   197 PSAPAEERSGTPDSIASSSSAAHP---PGVQPQQAP-Y-----PGAQPQTGQQVEGQM-- 245

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY-I 318
              Y  ++ P Y  Q+      Q+   Y +Q   GY  Q+      Q+ P Y  Q AP+   
Sbjct:   246 -YQQYQQPGYPAQQPQAQPQQQ---YGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAF 301

Query:   319 PQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP 376
             P +G    L  Q+ Q Y     P   P        AP    P  Q  P       G   P
Sbjct:   302 PGQGQAQQLPAQQPQQYPAGSFPP-QPYTTQASQPAPYSGPPGAQAAPGTFQPRPGFTPP 360

Query:   377 PARSGSGQPRGGNPARR 393
             P  + +  P G NP  R
Sbjct:   361 PGSTMTPPPSGPNPYAR 377


>UNIPROTKB|F1LLX1 [details] [associations]
            symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            OMA:HPGKEGQ IPI:IPI00949317 Ensembl:ENSRNOT00000024138
            ArrayExpress:F1LLX1 Uniprot:F1LLX1
        Length = 1803

 Score = 142 (55.0 bits), Expect = 6.4e-06, P = 6.4e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   132 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 183
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1003 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1062

Query:   184 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 235
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1063 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1119

Query:   236 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 289
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1120 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1177

Query:   290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 347
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1178 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1233

Query:   348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1234 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1273


>RGD|2372 [details] [associations]
            symbol:Col11a1 "collagen, type XI, alpha 1" species:10116 "Rattus
          norvegicus" [GO:0001502 "cartilage condensation" evidence=ISO]
          [GO:0001503 "ossification" evidence=IEP] [GO:0002063 "chondrocyte
          development" evidence=ISO] [GO:0003007 "heart morphogenesis"
          evidence=ISO] [GO:0005201 "extracellular matrix structural
          constituent" evidence=TAS] [GO:0005581 "collagen" evidence=ISO]
          [GO:0005592 "collagen type XI" evidence=ISO] [GO:0006029
          "proteoglycan metabolic process" evidence=ISO] [GO:0007601 "visual
          perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
          evidence=ISO] [GO:0030199 "collagen fibril organization"
          evidence=ISO;TAS] [GO:0031012 "extracellular matrix"
          evidence=ISO;IDA] [GO:0035989 "tendon development" evidence=ISO]
          [GO:0042472 "inner ear morphogenesis" evidence=ISO] [GO:0048704
          "embryonic skeletal system morphogenesis" evidence=ISO] [GO:0048705
          "skeletal system morphogenesis" evidence=ISO] [GO:0050910 "detection
          of mechanical stimulus involved in sensory perception of sound"
          evidence=ISO] [GO:0051216 "cartilage development" evidence=ISO]
          [GO:0055010 "ventricular cardiac muscle tissue morphogenesis"
          evidence=ISO] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
          PROSITE:PS51461 SMART:SM00038 RGD:2372 GO:GO:0046872 GO:GO:0007601
          GO:GO:0030199 Gene3D:2.60.120.200 InterPro:IPR008985
          InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472 GO:GO:0050910
          GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
          InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025 GO:GO:0001502
          GO:GO:0048704 GO:GO:0006029 GO:GO:0055010 Pfam:PF02210 GO:GO:0005201
          GO:GO:0002063 HOGENOM:HOG000085654 KO:K06236 HOVERGEN:HBG103137
          OrthoDB:EOG49GKHM SMART:SM00210 GeneTree:ENSGT00700000104155 CTD:1301
          EMBL:AABR03012126 EMBL:AABR03013126 EMBL:AABR03014171
          EMBL:AABR03015382 EMBL:AABR03015832 EMBL:AABR03016562
          EMBL:AABR03017847 EMBL:AABR03017951 EMBL:AABR03018245
          EMBL:AABR03019675 EMBL:AABR03023874 EMBL:U20116 EMBL:U20118
          EMBL:U20121 IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589
          IPI:IPI00949317 IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1
          UniGene:Rn.260 IntAct:P20909 STRING:P20909 PhosphoSite:P20909
          PRIDE:P20909 Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413
          GeneID:25654 KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909
          NextBio:607535 ArrayExpress:P20909 Genevestigator:P20909
          GermOnline:ENSRNOG00000023148 Uniprot:P20909
        Length = 1804

 Score = 142 (55.0 bits), Expect = 6.4e-06, P = 6.4e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   132 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 183
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063

Query:   184 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 235
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120

Query:   236 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 289
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178

Query:   290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 347
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234

Query:   348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274


>UNIPROTKB|P20909 [details] [associations]
            symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2372
            GO:GO:0046872 GO:GO:0007601 GO:GO:0030199 Gene3D:2.60.120.200
            InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472
            GO:GO:0050910 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025
            GO:GO:0001502 GO:GO:0048704 GO:GO:0006029 GO:GO:0055010
            Pfam:PF02210 GO:GO:0005201 GO:GO:0002063 HOGENOM:HOG000085654
            KO:K06236 HOVERGEN:HBG103137 OrthoDB:EOG49GKHM SMART:SM00210
            GeneTree:ENSGT00700000104155 CTD:1301 EMBL:AABR03012126
            EMBL:AABR03013126 EMBL:AABR03014171 EMBL:AABR03015382
            EMBL:AABR03015832 EMBL:AABR03016562 EMBL:AABR03017847
            EMBL:AABR03017951 EMBL:AABR03018245 EMBL:AABR03019675
            EMBL:AABR03023874 EMBL:U20116 EMBL:U20118 EMBL:U20121
            IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589 IPI:IPI00949317
            IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1 UniGene:Rn.260
            IntAct:P20909 STRING:P20909 PhosphoSite:P20909 PRIDE:P20909
            Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413 GeneID:25654
            KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909 NextBio:607535
            ArrayExpress:P20909 Genevestigator:P20909
            GermOnline:ENSRNOG00000023148 Uniprot:P20909
        Length = 1804

 Score = 142 (55.0 bits), Expect = 6.4e-06, P = 6.4e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   132 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 183
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063

Query:   184 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 235
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120

Query:   236 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 289
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178

Query:   290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 347
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234

Query:   348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274


>TAIR|locus:2077547 [details] [associations]
            symbol:AT3G07030 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005829
            "cytosol" evidence=IDA] InterPro:IPR002775 Pfam:PF01918
            GO:GO:0005829 EMBL:CP002686 GO:GO:0003676 IPI:IPI00519674
            RefSeq:NP_187359.2 UniGene:At.74527 ProteinModelPortal:F4JD88
            SMR:F4JD88 PRIDE:F4JD88 EnsemblPlants:AT3G07030.1 GeneID:3768790
            KEGG:ath:AT3G07030 OMA:ERRNDGY Uniprot:F4JD88
        Length = 405

 Score = 134 (52.2 bits), Expect = 6.6e-06, P = 6.6e-06
 Identities = 57/209 (27%), Positives = 72/209 (34%)

Query:   149 NAY-EDGYGVPQGHGPPP--SATTAGVVGAGPNTSTSAYAATQS-GTPMRA-AYDI-PRG 202
             NAY E+G  V +G         TT GV+      +      T   G   RA A D+    
Sbjct:   150 NAYGEEGEVVAEGEAGEEVDMETTKGVMKEKTKGTIKKIIKTMKVGIQTRAEAVDVVDEA 209

Query:   203 PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYD 262
                   +G GY   +   Y   +   Y   +  GY   +   Y   +   Y   R   Y 
Sbjct:   210 MAIVGGRG-GYGGGRDGGYGGGRDDGYGERRNDGYGERRNDRYGGGRDDGYGGGRDDGYG 268

Query:   263 IHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 322
               R   Y  +RG G+   RG   D   G G       G    +G  Y   R   Y   RG
Sbjct:   269 GGRNDGYGGRRG-GFRGGRGGGRDEGYGGG--RGGYGGRSGGQGDGYGGGRGDGYGGGRG 325

Query:   323 PGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
              GY   RG GY   R   YD  R  G+ G
Sbjct:   326 DGYGGGRGDGYGGGRVDRYDGGRRDGYGG 354

 Score = 125 (49.1 bits), Expect = 6.6e-05, P = 6.6e-05
 Identities = 50/158 (31%), Positives = 59/158 (37%)

Query:   201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 260
             R  GY   +  GY   +   Y   +G  +   +G G D     GY   +G  Y  + G  
Sbjct:   255 RDDGYGGGRDDGYGGGRNDGYGGRRG-GFRGGRGGGRDE----GYGGGRGG-YGGRSGG- 307

Query:   261 YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQ 320
                 +G  Y   RG GY   RG  Y   RG GY   RV  YD  R   Y   R   Y   
Sbjct:   308 ----QGDGYGGGRGDGYGGGRGDGYGGGRGDGYGGGRVDRYDGGRRDGYGGGRYDGYGGG 363

Query:   321 RGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA 357
             +  GY   RG GY   R   Y   RG  G  G  R  A
Sbjct:   364 KSDGYGGGRG-GYRGGRG-GYGRGRGRMGNGGRSRDGA 399

 Score = 122 (48.0 bits), Expect = 0.00014, P = 0.00014
 Identities = 52/181 (28%), Positives = 63/181 (34%)

Query:   127 DGSYGGATGNSENETSGRPVGQNAYED-GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             DG YGG   +   E      G+   +  G G   G+G        G  G G N     Y 
Sbjct:   224 DGGYGGGRDDGYGERRNDGYGERRNDRYGGGRDDGYG---GGRDDGY-GGGRN---DGYG 276

Query:   186 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY 245
               + G   R      R  GY   +G GY           +G  Y   +G GY   +G GY
Sbjct:   277 GRRGG--FRGGRGGGRDEGYGGGRG-GYGGRSGG-----QGDGYGGGRGDGYGGGRGDGY 328

Query:   246 DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQR 305
                +G  Y   R   YD  R   Y   R  GY   +   Y   RG GY   R  GY   R
Sbjct:   329 GGGRGDGYGGGRVDRYDGGRRDGYGGGRYDGYGGGKSDGYGGGRG-GYRGGR-GGYGRGR 386

Query:   306 G 306
             G
Sbjct:   387 G 387


>UNIPROTKB|Q8WML4 [details] [associations]
            symbol:MUC1 "Mucin-1" species:9913 "Bos taurus" [GO:0016324
            "apical plasma membrane" evidence=IBA] [GO:0009986 "cell surface"
            evidence=IBA] [GO:0005737 "cytoplasm" evidence=IBA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] PANTHER:PTHR10006 GO:GO:0016021 GO:GO:0005634
            GO:GO:0005737 GO:GO:0009986 GO:GO:0016324 InterPro:IPR000082
            Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 EMBL:AJ400824
            EMBL:AF399757 IPI:IPI00706283 RefSeq:NP_776540.1 UniGene:Bt.9561
            HSSP:Q16615 ProteinModelPortal:Q8WML4 SMR:Q8WML4 STRING:Q8WML4
            MEROPS:S71.001 Ensembl:ENSBTAT00000014051 GeneID:281333
            KEGG:bta:281333 CTD:4582 eggNOG:NOG77744
            GeneTree:ENSGT00700000104548 HOGENOM:HOG000290201
            HOVERGEN:HBG003075 InParanoid:Q8WML4 KO:K06568 OMA:PPAHGVT
            OrthoDB:EOG4NGGNM NextBio:20805343 PMAP-CutDB:Q8WML4
            ArrayExpress:Q8WML4 InterPro:IPR023217 Uniprot:Q8WML4
        Length = 580

 Score = 136 (52.9 bits), Expect = 7.0e-06, P = 7.0e-06
 Identities = 59/261 (22%), Positives = 99/261 (37%)

Query:   137 SENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAGPNTSTSAYAATQSGTPMRA 195
             +++  +  P  + ++     +     P PS A + G  GA  +T TS+ A + + +P   
Sbjct:    44 TQSSPTSSPTKETSWSTTTTLLTASSPAPSPAASPGHDGA--STPTSSPAPSPAASPGHD 101

Query:   196 AYDIPRG-PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 254
                 P   P    +  PG+D +  P+  P   P+  P       PT  P         ++
Sbjct:   102 GASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHN 161

Query:   255 AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA 314
                 P       P+  P    G+D    P       P       PG++    P      A
Sbjct:   162 GTSSPT----GSPAPSPAASPGHDGASTPTSSPAPSPAAS----PGHNGTSSPT--GSPA 211

Query:   315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG--APRGA-APHGQVPPPLNNVPY 371
             PS  P   PG+D   G       +P+  P+   G +G  +P G+ AP     P  ++ P 
Sbjct:   212 PS--PAASPGHD---GASTPTS-SPAPSPAASPGHNGTSSPTGSPAPSPTASPGHDSAPS 265

Query:   372 GSATP-PARSGS-GQPRGGNP 390
              +++P P+ + S GQ    +P
Sbjct:   266 LTSSPAPSPTASPGQHGASSP 286

 Score = 121 (47.7 bits), Expect = 0.00032, P = 0.00032
 Identities = 59/236 (25%), Positives = 82/236 (34%)

Query:   165 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK----GPGYDASKAPS 220
             P +TT     + P   TS    T   T    A      PG++ +      P    + +P 
Sbjct:    40 PVSTTQSSPTSSPTKETSWSTTTTLLTASSPAPSPAASPGHDGASTPTSSPAPSPAASPG 99

Query:   221 YD----PTKGPSYDPAKGPGYD----PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 272
             +D    PT  P+  PA  PG+D    PT  P         +D    P       P+  P 
Sbjct:   100 HDGASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPA 155

Query:   273 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 332
                G++    P       P       PG+D    P   +  APS  P   PG++   G  
Sbjct:   156 ASPGHNGTSSPTGSPAPSPAAS----PGHDGASTPT--SSPAPS--PAASPGHN---GTS 204

Query:   333 YDMRRAPSYDPSRGTGFDGA--PRGA-APHGQVPPPLNNV--PYGSATPPARSGSG 383
                  +P+  P+   G DGA  P  + AP     P  N    P GS  P   +  G
Sbjct:   205 -SPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTGSPAPSPTASPG 259

 Score = 121 (47.7 bits), Expect = 0.00032, P = 0.00032
 Identities = 55/234 (23%), Positives = 80/234 (34%)

Query:   164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 223
             P   T      + P +S +   +  + T +  A   P  P   AS  PG+D +  P+  P
Sbjct:    35 PRRTTPVSTTQSSPTSSPTKETSWSTTTTLLTASS-P-APSPAAS--PGHDGASTPTSSP 90

Query:   224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
                P+  P       PT  P         +D    P       P+  P    G+D    P
Sbjct:    91 APSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPAASPGHDGASTP 146

Query:   284 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 343
                    P       PG++    P      APS  P   PG+D           +P+  P
Sbjct:   147 TSSPAPSPAAS----PGHNGTSSPT--GSPAPS--PAASPGHDGASTPTSSPAPSPAASP 198

Query:   344 SR-GTGFD-GAPR---GAAP-HGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
                GT    G+P     A+P H     P ++     A  P  +G+  P G +PA
Sbjct:   199 GHNGTSSPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTG-SPA 251


>CGD|CAL0000919 [details] [associations]
            symbol:RPO21 species:5476 "Candida albicans" [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0030447 "filamentous growth" evidence=IMP]
            [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0009267 "cellular response to starvation"
            evidence=IMP] [GO:0036170 "filamentous growth of a population of
            unicellular organisms in response to starvation" evidence=IMP]
            [GO:0036180 "filamentous growth of a population of unicellular
            organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
            "cellular response to biotic stimulus" evidence=IMP] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed RNA
            polymerase activity" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 CGD:CAL0000919
            GO:GO:0071216 GO:GO:0036180 GO:GO:0003677 GO:GO:0006366
            GO:GO:0009267 Gene3D:2.40.40.20 InterPro:IPR009010
            EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899 eggNOG:COG0086
            GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1 STRING:Q5ACI7
            GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
        Length = 1728

 Score = 141 (54.7 bits), Expect = 7.8e-06, P = 7.8e-06
 Identities = 72/234 (30%), Positives = 91/234 (38%)

Query:   116 LMNAPN---VDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 172
             L  AP+   +D  ADG  GGAT   + E        NA ++   +  G G  P       
Sbjct:  1452 LQKAPSSMAMDDIADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMN 1501

Query:   173 VG-AGPNTSTSAYAATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
              G  G  TS      + + T P    Y+    PGY  S G GY  + +PSY PT  PSY 
Sbjct:  1502 EGNIGGLTSYGGQPTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYA 1558

Query:   231 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
             P   P Y PT  P Y A     Y +   P+Y     P+Y P     Y     P+Y     
Sbjct:  1559 PTS-PAYSPTS-PSY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTS 1610

Query:   291 PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 344
             P Y     P Y     P Y +  +PSY P   P Y            +PSY P+
Sbjct:  1611 PQYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651


>UNIPROTKB|Q5ACI7 [details] [associations]
            symbol:RPO21 "DNA-directed RNA polymerase" species:237561
            "Candida albicans SC5314" [GO:0009267 "cellular response to
            starvation" evidence=IMP] [GO:0030447 "filamentous growth"
            evidence=IMP] [GO:0036170 "filamentous growth of a population of
            unicellular organisms in response to starvation" evidence=IMP]
            [GO:0036180 "filamentous growth of a population of unicellular
            organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
            "cellular response to biotic stimulus" evidence=IMP]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 CGD:CAL0000919 GO:GO:0071216 GO:GO:0036180
            GO:GO:0003677 GO:GO:0006366 GO:GO:0009267 Gene3D:2.40.40.20
            InterPro:IPR009010 EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1
            STRING:Q5ACI7 GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
        Length = 1728

 Score = 141 (54.7 bits), Expect = 7.8e-06, P = 7.8e-06
 Identities = 72/234 (30%), Positives = 91/234 (38%)

Query:   116 LMNAPN---VDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 172
             L  AP+   +D  ADG  GGAT   + E        NA ++   +  G G  P       
Sbjct:  1452 LQKAPSSMAMDDIADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMN 1501

Query:   173 VG-AGPNTSTSAYAATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
              G  G  TS      + + T P    Y+    PGY  S G GY  + +PSY PT  PSY 
Sbjct:  1502 EGNIGGLTSYGGQPTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYA 1558

Query:   231 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
             P   P Y PT  P Y A     Y +   P+Y     P+Y P     Y     P+Y     
Sbjct:  1559 PTS-PAYSPTS-PSY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTS 1610

Query:   291 PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 344
             P Y     P Y     P Y +  +PSY P   P Y            +PSY P+
Sbjct:  1611 PQYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651


>UNIPROTKB|F1P555 [details] [associations]
            symbol:SFPQ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0000380 "alternative mRNA
            splicing, via spliceosome" evidence=IEA] [GO:0016363 "nuclear
            matrix" evidence=IEA] [GO:0042382 "paraspeckles" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
            SMART:SM00360 GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0016363 GO:GO:0000380 GO:GO:0042382 InterPro:IPR012975
            Pfam:PF08075 GeneTree:ENSGT00390000005004 OMA:APGGHPK
            EMBL:AADN02043825 EMBL:AADN02043826 IPI:IPI00574618
            Ensembl:ENSGALT00000003963 ArrayExpress:F1P555 Uniprot:F1P555
        Length = 647

 Score = 136 (52.9 bits), Expect = 8.2e-06, P = 8.2e-06
 Identities = 62/219 (28%), Positives = 89/219 (40%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSA-------TTAGVVGAG 176
             R   G  GG   +  +   G  +GQN    G G PQG G PP                A 
Sbjct:    19 RGGGGGRGGPNHDFRSPPPGMGMGQNRGPMGGG-PQGPGGPPGGGPKSEPPKPPASTSAP 77

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDAS-KAPSYDPTKGPSYDPAKGP 235
             P++S+S+ A T      ++    P      A + P   A   APS  P+ GP       P
Sbjct:    78 PSSSSSSSATTAGPAGSQSGPGAPPPSALPAGQPPQQQAQGSAPSSAPS-GPGGQQQPQP 136

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
                P+  P    +KG       GP     +GP   PQ+G G   + GP +  + GPG E+
Sbjct:   137 KPSPSPTPAGGPKKGQGQSPGGGP-----KGPG-GPQQGPGGPHKGGPGH--RGGPGGES 188

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGP-GYDLQRGQGY 333
             +   G    RG  ++ Q++ S   Q+GP G D    +G+
Sbjct:   189 R---G----RGQQHQGQQSLSL--QQGPAGGDQLSDEGF 218


>UNIPROTKB|F1PHX8 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
            OMA:TIYEGIG SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AAEX03006798 EMBL:AAEX03006799 EMBL:AAEX03006800
            Ensembl:ENSCAFT00000031582 Uniprot:F1PHX8
        Length = 1814

 Score = 141 (54.7 bits), Expect = 8.3e-06, P = 8.3e-06
 Identities = 77/254 (30%), Positives = 100/254 (39%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
             PVG    +   G P   GP  S    G  GA            Q G P  A     +G P
Sbjct:  1043 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 1098

Query:   204 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 260
             G +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP 
Sbjct:  1099 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 1155

Query:   261 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 319
                  GP+  PQ  +G   Q GP+  D + GP  + Q + G     GP       P  + 
Sbjct:  1156 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 1204

Query:   320 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 377
              +G PG   ++G+  D+ +     P    G  GAP    P G  P  + N    G    P
Sbjct:  1205 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 1263

Query:   378 ARSGS-GQPRGGNP 390
               +G  G P  G P
Sbjct:  1264 GEAGEPGLPGEGGP 1277


>MGI|MGI:2384582 [details] [associations]
            symbol:Zfp768 "zinc finger protein 768" species:10090 "Mus
            musculus" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0008150 "biological_process" evidence=ND] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] Pfam:PF00096 InterPro:IPR007087 InterPro:IPR013087
            InterPro:IPR015880 PROSITE:PS00028 PROSITE:PS50157 SMART:SM00355
            MGI:MGI:2384582 GO:GO:0005634 GO:GO:0006355 GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 eggNOG:COG5048
            Gene3D:3.30.160.60 HOGENOM:HOG000234617
            GeneTree:ENSGT00700000104520 KO:K09228 HSSP:P17028
            HOVERGEN:HBG105926 OMA:SRYESQN OrthoDB:EOG4CNQQT EMBL:AK155155
            EMBL:BC026432 IPI:IPI00153270 RefSeq:NP_666314.1 UniGene:Mm.23031
            ProteinModelPortal:Q8R0T2 SMR:Q8R0T2 IntAct:Q8R0T2 STRING:Q8R0T2
            PhosphoSite:Q8R0T2 PRIDE:Q8R0T2 Ensembl:ENSMUST00000060783
            GeneID:233890 KEGG:mmu:233890 UCSC:uc009jvc.1 CTD:233890
            InParanoid:Q8R0T2 NextBio:381919 Bgee:Q8R0T2 CleanEx:MM_ZFP768
            Genevestigator:Q8R0T2 Uniprot:Q8R0T2
        Length = 568

 Score = 135 (52.6 bits), Expect = 8.8e-06, P = 8.8e-06
 Identities = 70/278 (25%), Positives = 107/278 (38%)

Query:   119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 178
             A N     +G      GN + E    P G       +  PQ       +        G  
Sbjct:    32 AGNTSENEEGEISQREGNGDYEVEEIPFGLEPQSPEFE-PQSPEFESQSPRFEPESPGFE 90

Query:   179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 238
             + +  +         R+    P+ P +E S+ P Y+  ++P   P + P  +P   P Y+
Sbjct:    91 SRSPGFVPPSPEFAPRSPESDPQSPEFE-SQSPKYEP-RSPGCHP-RSPGCEPGS-PRYE 146

Query:   239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYETQR 297
             P K PGY + K   +++Q  P Y+  + P Y+PQ   G  +Q   N + +   P +ETQ 
Sbjct:   147 P-KSPGYGS-KSPEFESQ-SPGYE-SQSPGYEPQNS-GDGVQ---NSEFKTHSPEFETQS 198

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRA-PSYD-PSRGTGFDGAPR 354
                 +    P+   ++ P  I       D   +G G     A P +D PS      GA  
Sbjct:   199 SKFQEGAEMPLSPEEKNPLSISLGVHPLDSFTQGFGEQPTGALPPFDMPS------GALL 252

Query:   355 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
              A     +  PLN    G+   P R G G+ RGG   R
Sbjct:   253 AAPQFEMLQNPLNLT--GTLRGPGRRG-GRARGGQGPR 287


>MGI|MGI:2157767 [details] [associations]
            symbol:Krtap21-1 "keratin associated protein 21-1"
            species:10090 "Mus musculus" [GO:0001942 "hair follicle
            development" evidence=IMP] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005882 "intermediate filament" evidence=IEA] [GO:0007165
            "signal transduction" evidence=IMP] [GO:0008283 "cell
            proliferation" evidence=IMP] [GO:0022405 "hair cycle process"
            evidence=IMP] [GO:0031077 "post-embryonic camera-type eye
            development" evidence=IMP] [GO:0042640 "anagen" evidence=IMP]
            [GO:0043480 "pigment accumulation in tissues" evidence=IMP]
            [GO:0043588 "skin development" evidence=IMP] [GO:0048589
            "developmental growth" evidence=IMP] [GO:0051726 "regulation of
            cell cycle" evidence=IMP] MGI:MGI:2157767 GO:GO:0007165
            GO:GO:0043588 GO:GO:0008283 GO:GO:0005882 GO:GO:0051726
            GO:GO:0042640 GO:GO:0031077 EMBL:AF345297 EMBL:AK003736
            IPI:IPI00126890 UniGene:Mm.46109 HSSP:P10969 Genevestigator:Q925H4
            GO:GO:0043480 Uniprot:Q925H4
        Length = 128

 Score = 111 (44.1 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 32/103 (31%), Positives = 32/103 (31%)

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
             G   R  Y    G GY    G GY       Y    G  Y    G GY    G GY    
Sbjct:    14 GYGSRYGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGY 73

Query:   250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
             GS Y    G  Y    G  Y    G GY    G  Y    G G
Sbjct:    74 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSRYGCGYGSG 116

 Score = 103 (41.3 bits), Expect = 6.8e-05, P = 6.8e-05
 Identities = 31/98 (31%), Positives = 33/98 (33%)

Query:   204 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 263
             GY    G GY       Y    G  Y    G GY    G GY    GS Y    G  Y  
Sbjct:    20 GYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGC 79

Query:   264 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
               G  Y    G GY    G  Y    G GY ++   GY
Sbjct:    80 GYGSGY----GCGYGSGYGCGYGSGYGCGYGSRYGCGY 113

 Score = 93 (37.8 bits), Expect = 0.00082, P = 0.00082
 Identities = 31/98 (31%), Positives = 32/98 (32%)

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
             GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y    G GY  
Sbjct:    20 GYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGC 79

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 333
                 GY    G  Y       Y    G GY  + G GY
Sbjct:    80 GYGSGYGCGYGSGYGCGYGSGY----GCGYGSRYGCGY 113


>UNIPROTKB|F1N474 [details] [associations]
            symbol:COL4A5 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0031594 "neuromuscular junction" evidence=IEA]
            [GO:0007528 "neuromuscular junction development" evidence=IEA]
            [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587 "collagen type
            IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0007528 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0031594 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:DAAA02071513 EMBL:DAAA02071512
            IPI:IPI00729819 Ensembl:ENSBTAT00000019400 OMA:MPMNMEP
            Uniprot:F1N474
        Length = 1688

 Score = 140 (54.3 bits), Expect = 9.8e-06, P = 9.8e-06
 Identities = 62/203 (30%), Positives = 76/203 (37%)

Query:   200 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 253
             P  PG     GP  G    K    +P K G P  D   G PG     G PGY  + G   
Sbjct:   266 PGPPGIRGPPGPPGGVKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 323

Query:   254 DAQRGPNYDIHR-GPS--YDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVY 309
             D ++G   D    GP     P+ G G  +    N  +   PG +  R  PG  +Q  P  
Sbjct:   324 DGEKGQKGDTGLPGPPGLVIPRPGTGVTVGEKGNIGLPGLPGDKGDRGFPG--IQGPPGL 381

Query:   310 EAQRAPSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
                  P+ I P   PG+  +RGQ  D    P        G DG P    P G   PP  +
Sbjct:   382 PGPPGPAVIGPPGPPGFPGERGQKGD-EGPPGISIPGSPGLDGQPGAPGPPGPPGPPGPH 440

Query:   369 VPYGS----ATPPARSGSGQPRG 387
             +P       A PP   GS   RG
Sbjct:   441 IPPSDKICEAGPPGPPGSPGDRG 463


>FB|FBgn0003277 [details] [associations]
            symbol:RpII215 "RNA polymerase II 215kD subunit" species:7227
            "Drosophila melanogaster" [GO:0005665 "DNA-directed RNA polymerase
            II, core complex" evidence=ISS;NAS;IDA] [GO:0005703 "polytene
            chromosome puff" evidence=IDA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=ISS;NAS] [GO:0003899 "DNA-directed
            RNA polymerase activity" evidence=ISS;NAS] [GO:0005634 "nucleus"
            evidence=IDA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0007095
            "mitotic G2 DNA damage checkpoint" evidence=IGI] [GO:0005700
            "polytene chromosome" evidence=IDA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            PROSITE:PS00115 SMART:SM00663 GO:GO:0007095 GO:GO:0046872
            GO:GO:0003677 EMBL:AE014298 GO:GO:0006366 Gene3D:2.40.40.20
            InterPro:IPR009010 GO:GO:0005703 GO:GO:0003899 eggNOG:COG0086
            GO:GO:0005665 GeneTree:ENSGT00700000104490 OMA:KVLPWST KO:K03006
            EMBL:M27431 EMBL:M14203 EMBL:M11798 EMBL:M19537 PIR:S04457
            RefSeq:NP_511124.1 UniGene:Dm.2925 ProteinModelPortal:P04052
            SMR:P04052 DIP:DIP-22282N IntAct:P04052 MINT:MINT-970158
            STRING:P04052 PaxDb:P04052 EnsemblMetazoa:FBtr0073542 GeneID:32100
            KEGG:dme:Dmel_CG1554 CTD:32100 FlyBase:FBgn0003277
            InParanoid:P04052 OrthoDB:EOG4QRFJV PhylomeDB:P04052
            GenomeRNAi:32100 NextBio:776837 Bgee:P04052 GermOnline:CG1554
            Uniprot:P04052
        Length = 1887

 Score = 140 (54.3 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 65/240 (27%), Positives = 90/240 (37%)

Query:   119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAGP 177
             +P     A   Y   T N   +++G     + Y     V   + P     ++    G+G 
Sbjct:  1606 SPTSPLYASPRYASTTPNFNPQSTGYSPSSSGYSPTSPV---YSPTVQFQSSPSFAGSGS 1662

Query:   178 NTST--SAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
             N  +  +AY+ + S  +P   +Y  P  P Y  S  P Y  + +P Y PT  PSY P   
Sbjct:  1663 NIYSPGNAYSPSSSNYSPNSPSYS-PTSPSYSPSS-PSYSPT-SPCYSPTS-PSYSPTS- 1717

Query:   235 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQR------GPNYDMQ 288
             P Y P   P Y +    NY A   P Y     P+Y  Q G+ Y           P+YD  
Sbjct:  1718 PNYTPVT-PSY-SPTSPNYSAS--PQYS-PASPAYS-QTGVKYSPTSPTYSPPSPSYDGS 1771

Query:   289 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIP---QRGPGYDLQ-RGQGYDMRRAPSYDPS 344
              G    T   P Y     P Y +  +P Y P   Q  P       G  Y    +P Y P+
Sbjct:  1772 PGSPQYTPGSPQYS-PASPKY-SPTSPLYSPSSPQHSPSNQYSPTGSTYSAT-SPRYSPN 1828


>TAIR|locus:2035751 [details] [associations]
            symbol:AT1G55170 "AT1G55170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC073944
            EMBL:AY084916 EMBL:BT006117 EMBL:AK118721 IPI:IPI00529305
            RefSeq:NP_564678.1 UniGene:At.37108 ProteinModelPortal:Q9C717
            SMR:Q9C717 PaxDb:Q9C717 PRIDE:Q9C717 EnsemblPlants:AT1G55170.1
            GeneID:841960 KEGG:ath:AT1G55170 TAIR:At1g55170 eggNOG:NOG306311
            InParanoid:Q9C717 OMA:ELHRMNL PhylomeDB:Q9C717
            ProtClustDB:CLSN2688822 ArrayExpress:Q9C717 Genevestigator:Q9C717
            Uniprot:Q9C717
        Length = 283

 Score = 129 (50.5 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 35/78 (44%), Positives = 42/78 (53%)

Query:    78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 137
             R   EYEKK   + +E  Q MEKN ++MA EVEKLRAEL     VD R  G +GG+ G +
Sbjct:   185 RDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAELAT---VDSRPWG-FGGSYGMN 240

Query:   138 ENETSGRPVGQNAYEDGY 155
              N   G   G     D Y
Sbjct:   241 YNNMDGTFRGSYGENDTY 258


>UNIPROTKB|K7EKB2 [details] [associations]
            symbol:TAF15 "TATA-binding protein-associated factor 2N"
            species:9606 "Homo sapiens" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR001876 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50199
            SMART:SM00547 EMBL:AC015849 HGNC:HGNC:11547 Ensembl:ENST00000585577
            Uniprot:K7EKB2
        Length = 214

 Score = 125 (49.1 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 48/140 (34%), Positives = 52/140 (37%)

Query:   204 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-GPGYDAQK-GSNYDAQRGPNY 261
             GY    G G D           G   D + G GY   + G GY   + G  Y   RG  Y
Sbjct:    69 GYRGRGGRGGDRGGYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGY 128

Query:   262 DIHRGPSYDPQRGLGY--DMQRGPNYDMQRG--PGYETQRVPGYDVQR-GPVYEAQRAPS 316
                RG  Y   RG GY  D  RG  Y   RG   GY   R  GY   R G  Y   R   
Sbjct:   129 GGDRGGGYGGDRGGGYGGDRSRG-GYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGG 187

Query:   317 YIPQRGPGYDLQRGQGYDMR 336
             Y   RG GY  + G   D R
Sbjct:   188 YGGDRG-GYGGKMGGRNDYR 206

 Score = 120 (47.3 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 47/155 (30%), Positives = 59/155 (38%)

Query:   136 NSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
             N       RP G +    GYG  +G+ G        G  G G + S   Y   +S     
Sbjct:    45 NEPRPEDSRPSGGDFRGRGYGGERGYRGRGGRGGDRG--GYGGDRSGGGYGGDRSSG--- 99

Query:   195 AAYDIPR-GPGYEASK-GPGYDASKAPSYDPTKGPSYDPAKGPGY--DPTKGPGYDAQKG 250
               Y   R G GY   + G GY   +   Y   +G  Y   +G GY  D ++G GY   +G
Sbjct:   100 GGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSRG-GYGGDRG 158

Query:   251 --SNYDAQRGPNYDIHR-GPSYDPQRGLGYDMQRG 282
               S Y   R   Y   R G  Y   RG GY   RG
Sbjct:   159 GGSGYGGDRSGGYGGDRSGGGYGGDRGGGYGGDRG 193

 Score = 120 (47.3 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 48/170 (28%), Positives = 62/170 (36%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P   R + G + G     E    GR  G+     GYG  +  G      ++G  G   + 
Sbjct:    49 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GYSGDR 106

Query:   180 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY--DPTKGPSYDPAKGPGY 237
             S   Y   +SG      Y   RG GY   +G GY   +   Y  D ++G       G G 
Sbjct:   107 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSRG-------GYGG 155

Query:   238 DPTKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYD 286
             D   G GY   +   Y   R G  Y   RG  Y   RG GY  + G   D
Sbjct:   156 DRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRND 204


>UNIPROTKB|F1RFI8 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 OMA:EGTSTGY EMBL:CU640468
            EMBL:CT737304 Ensembl:ENSSSCT00000010930 Uniprot:F1RFI8
        Length = 606

 Score = 121 (47.7 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
 Identities = 54/178 (30%), Positives = 75/178 (42%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAP-SYDPTKGPSYDPAKGPGYD 238
             T+    TQ+    ++AY   P  P Y   + P   A+ AP SY  T+  SYD +     +
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQP---AATAPASYSSTQPTSYDQSSYSQQN 157

Query:   239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
                 P    Q+ S+Y  Q   +Y      SY PQ G  Y   + P+   Q+   Y  Q
Sbjct:   158 TYGQPSSYGQQ-SSYGQQS--SYGQQPPTSYPPQTG-SYS--QAPSQYSQQSSSYGQQ 209

 Score = 57 (25.1 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   354 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 392
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   404 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 449

 Score = 49 (22.3 bits), Expect = 8.5e-05, Sum P(2) = 8.5e-05
 Identities = 25/86 (29%), Positives = 33/86 (38%)

Query:   311 AQRAPSYIPQRGPGYDLQRGQGYD--MRRAPSYDPSRGTGFDGAPRG-----AAPHGQVP 363
             A++ P     RG G   + G+G    +R  P      G G  G P G         G  P
Sbjct:   394 ARKKPPMNSMRG-GMPPREGRGMPPPLRGGPG-----GPGGPGGPMGRMGGRGGDRGGFP 447

Query:   364 PPLNNVPYGSATPPARSGSGQPRGGN 389
             P     P GS   P+  G+ Q R G+
Sbjct:   448 P---RGPRGSRGNPSGGGNVQHRAGD 470


>UNIPROTKB|E2RS29 [details] [associations]
            symbol:E2RS29 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 EMBL:AAEX03026460
            Ensembl:ENSCAFT00000019701 Uniprot:E2RS29
        Length = 538

 Score = 133 (51.9 bits), Expect = 1.3e-05, P = 1.3e-05
 Identities = 80/314 (25%), Positives = 115/314 (36%)

Query:    99 EKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPV-GQNAYEDGYGV 157
             ++ Y    T+  +  A+   A    +++ G+YG  T  S  +       GQ AY   YG 
Sbjct:    15 QQGYSAYTTQPTQGYAQTTQA--YGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQ 72

Query:   158 PQ-GHGPP--PSATTAGVVG--AGP-NTSTSAYAATQSGTPMRAAYDI-PRGPGY---EA 207
             P  G+  P  P A +  V G   G  +T+T+    TQ+    ++AY   P  P Y    A
Sbjct:    73 PPAGYTTPTAPQAYSQPVQGYSTGAYDTTTATVTTTQASYEAQSAYGTQPAYPAYGQQPA 132

Query:   208 SKGPG--YDASK-APSYDP--TKGPSYDPAKGPG---YDPTKGPG-YDAQKGSNYDAQRG 258
             +  P    D +K A +  P  + G    P+ G G   Y   + PG Y  Q  +   +   
Sbjct:   133 ATAPARPQDGNKPAETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPP 192

Query:   259 PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
              +Y   +  SYD Q   G     G      +   Y  Q    Y  Q G  Y   +APS  
Sbjct:   193 TSYSSTQPTSYDQQNTYGQPSSYGQQSSYGQQSSYGQQLPTSYPPQTGS-YS--QAPSQY 249

Query:   319 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 378
              Q+   Y  Q     D  R+         GF       +  G   P      +      +
Sbjct:   250 SQQSSSYGQQSSFQQDHPRSMGVYGQESGGFSRPGENRSMSGPDNPGRGRGGFDRGDM-S 308

Query:   379 RSGSGQPRGGNPAR 392
             R G G  RGG  AR
Sbjct:   309 RGGRGGGRGGMGAR 322


>UNIPROTKB|F1RYI8 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0050777 "negative regulation of immune response"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
            "skin development" evidence=IEA] [GO:0043206 "extracellular fibril
            organization" evidence=IEA] [GO:0042060 "wound healing"
            evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
            "heart development" evidence=IEA] [GO:0007229 "integrin-mediated
            signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
            factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
            "cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
            GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:CU467671
            RefSeq:NP_001230226.1 UniGene:Ssc.24309 UniGene:Ssc.97562
            Ensembl:ENSSSCT00000017459 GeneID:100152001 KEGG:ssc:100152001
            Uniprot:F1RYI8
        Length = 1466

 Score = 138 (53.6 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 85/286 (29%), Positives = 105/286 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
             A G  GG  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   166 AGGGIGGYPGPAGPPGPPGPPGVSGHPGAPGSPGYQGPPGEPGQAGPAGPPGPPGAIGPS 225

Query:   181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
               A    +SG P R     +P  PG +   G PG+   K    +D   G   D    PG 
Sbjct:   226 GPAGKDGESGRPGRPGERGLPGPPGLKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 284

Query:   238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   285 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 339

Query:   295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              T   PG    +G V  A  +P   P   PG   QRG+      A +  P    G +G+P
Sbjct:   340 GTAGFPGSPGAKGEVGPAG-SPG--PSGSPG---QRGEPGPQGHAGAAGPPGPPGSNGSP 393

Query:   354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
              G    G  P  +   P   G+  PP   G+ G P  RG  G P +
Sbjct:   394 GGKGEMG--PAGIPGAPGLMGARGPPGPPGTNGAPGQRGAAGEPGK 437


>UNIPROTKB|F1NI73 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
            EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI01017330
            Ensembl:ENSGALT00000004032 ArrayExpress:F1NI73 Uniprot:F1NI73
        Length = 1260

 Score = 137 (53.3 bits), Expect = 1.5e-05, P = 1.5e-05
 Identities = 83/280 (29%), Positives = 109/280 (38%)

Query:   132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 188
             GA G   +N   G P G+       G+P  +G P     AG  G+ GP   S  A    Q
Sbjct:   465 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 523

Query:   189 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 243
              G P    MR    IP  PG +   GP  +  + P      GP+  P   PG     GP 
Sbjct:   524 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 581

Query:   244 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 297
             G +   G N   +RGP       GP+  +   GL G     GP  D  + GP      Q 
Sbjct:   582 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 639

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 353
             +PG     GP  E  +     P+    GPG+   +G+ G    R  +  P   TG  G P
Sbjct:   640 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERG-AQGPPGPTGARGGP 695

Query:   354 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
               A   G + PP     P G+  P  +   G+ RG  G+P
Sbjct:   696 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 734

 Score = 123 (48.4 bits), Expect = 0.00051, P = 0.00051
 Identities = 84/275 (30%), Positives = 104/275 (37%)

Query:   142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             +G P G        G+P   G P      G+ G  P TS +  A    G P +       
Sbjct:   386 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 440

Query:   202 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 255
             GP G     G PG  A   P  +  +G + +P +   PG    +G PG+    GSN    
Sbjct:   441 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 498

Query:   256 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 310
             ++GP  +       GPS  P    G D   GP     RG PG      PG D + GP   
Sbjct:   499 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 553

Query:   311 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 362
              Q  P      GP G   Q G  G+   +    AP  +  RG G   G P  A  +G V 
Sbjct:   554 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 612

Query:   363 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 391
              P PP    P G    P  SGS    G P G  PA
Sbjct:   613 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 647

 Score = 122 (48.0 bits), Expect = 0.00065, P = 0.00065
 Identities = 80/269 (29%), Positives = 105/269 (39%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAA-YDIPRG 202
             P G N Y+   G P   GP      AG++G AGP          + G P R     IP  
Sbjct:   190 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGPPGKDG-----EPGRPGRNGDRGIPGL 244

Query:   203 PGYEASKG-PGYDASK-APSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP 259
             PG++   G PG    K A  +D   G   D    PG     G PG +   G      RGP
Sbjct:   245 PGHKGHPGMPGMPGMKGARGFDGKDGAKGDSG-APGPKGEAGQPGANGSPGQ--PGPRGP 301

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-------GYETQRVPGYDVQRGPVYEAQ 312
               +  RG   +P   + Y         + +GP       G+     PG+  + GP   A 
Sbjct:   302 TGE--RGRPGNPGGPVTYRCDIVVFLSLFKGPPGPPGTAGFPGS--PGFKGEAGPPGPAG 357

Query:   313 RAPSYIP-QRG-PGYDLQRG----QGYDMRR-APSYDPSRG-TGFDGAPRGAAPHGQ-VP 363
              + S  P +RG PG   Q G    QG   R  +P      G +G  GAP    P G+ +P
Sbjct:   358 ASGS--PGERGEPGPQGQAGPPGPQGPPGRAGSPGNKGEMGPSGIPGAP--GLPGGRGLP 413

Query:   364 PPLNNVPYGSATPPARSGSGQPRGGNPAR 392
              P    P  S  P A+   G+P G N A+
Sbjct:   414 GP----PGTSGNPGAKGTPGEP-GKNGAK 437


>WB|WBGene00000628 [details] [associations]
            symbol:col-51 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00530000064217 EMBL:FO080999 RefSeq:NP_491195.1
            UniGene:Cel.29694 ProteinModelPortal:Q7Z152 MINT:MINT-3384184
            STRING:Q7Z152 EnsemblMetazoa:T28F2.8 GeneID:189052
            KEGG:cel:CELE_T28F2.8 UCSC:T28F2.8 CTD:189052 WormBase:T28F2.8
            eggNOG:NOG245561 InParanoid:Q7Z152 OMA:MMASRRI NextBio:941036
            Uniprot:Q7Z152
        Length = 435

 Score = 131 (51.2 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 90/299 (30%), Positives = 102/299 (34%)

Query:   110 EKLRAE-LMNAPNVDRRADGSYGG--ATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPP 165
             EK+  E L  A      A G  GG  A G       G   G +      G P G  GPP 
Sbjct:    84 EKVAFEGLFRAKRQYATAAGGGGGYAAGGGGGGGGGGGGGGCHCAAQASGCPAGPPGPPG 143

Query:   166 SATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGP----GYEASKGP-GYDASKAP 219
              A T G  G AG +          SG+  +A    P GP    G + + GP G      P
Sbjct:   144 EAGTDGEPGQAGQDGQPGQAGQADSGSSGQACITCPAGPPGPPGPDGNAGPAGAPGVPGP 203

Query:   220 SYD----PTKGPSYDPAKGPGYDPTKG-PGYDAQKGS----NYDAQRGPNYDIHRGPSYD 270
               D    P  GP   P   PG D   G PG D Q G+      ++  GP      GP   
Sbjct:   204 DGDAGSPPPPGPPGPPGP-PGNDGQPGAPGQDGQPGAPGTNTVNSPGGPGPAGPPGPPGP 262

Query:   271 P-QRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 329
             P Q G G   Q GP       PG      PG D Q G        P   P  GPG D   
Sbjct:   263 PGQDGSGGAAQPGP-------PG--PPGPPGNDGQPG-------GPGQ-PG-GPGQD--G 302

Query:   330 GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
             G G D    P   P R       P G    G  P       Y +     R+ SG   GG
Sbjct:   303 GPGTDAAYCPC--PPR------TPAGGGGGGDFPAGGGGGGYSTGGGGGRADSGGAAGG 353

 Score = 115 (45.5 bits), Expect = 0.00095, P = 0.00095
 Identities = 76/270 (28%), Positives = 84/270 (31%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 191
             G  G +    SG   GQ       G P   GP  +A  AG  G  P     A +    G 
Sbjct:   158 GQPGQAGQADSGSS-GQACITCPAGPPGPPGPDGNAGPAGAPGV-PGPDGDAGSPPPPGP 215

Query:   192 PMRAAYDIPRGPGYEASKG-PGYDASK-APSYDPTKGPSYDPAKGPG-YDPTKGPGYDAQ 248
             P       P  PG +   G PG D    AP  +    P      GPG   P   PG   Q
Sbjct:   216 P-----GPPGPPGNDGQPGAPGQDGQPGAPGTNTVNSPG-----GPGPAGPPGPPGPPGQ 265

Query:   249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ------RVPGYD 302
              GS   AQ GP       P  D Q G G     GP  D   GPG +        R P   
Sbjct:   266 DGSGGAAQPGPPGP-PGPPGNDGQPG-GPGQPGGPGQD--GGPGTDAAYCPCPPRTPAGG 321

Query:   303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG----AAP 358
                G          Y    G G     G       A  Y    G G  GA  G    A  
Sbjct:   322 GGGGDFPAGGGGGGYSTGGGGGRADSGGAAGGAGGAGGYSGGGGGGGGGAAAGGGYNAGG 381

Query:   359 HGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
              G   P     P  +  P A +G G   GG
Sbjct:   382 GGGGAPQAAPAPQAAPAPAAPAGGGYNAGG 411


>UNIPROTKB|Q28009 [details] [associations]
            symbol:FUS "RNA-binding protein FUS" species:9913 "Bos
            taurus" [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=ISS]
            [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634 "nucleus"
            evidence=ISS] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0005737 GO:GO:0000166 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0045944 GO:GO:0003723
            eggNOG:NOG240581 GeneTree:ENSGT00530000063105 KO:K13098
            HOGENOM:HOG000038010 CTD:2521 EMBL:U26024 EMBL:BC119965
            IPI:IPI00705463 RefSeq:NP_776337.1 UniGene:Bt.2474
            ProteinModelPortal:Q28009 STRING:Q28009 PRIDE:Q28009
            Ensembl:ENSBTAT00000007571 GeneID:280796 KEGG:bta:280796
            InParanoid:Q28009 OrthoDB:EOG4DV5NH NextBio:20804952 Uniprot:Q28009
        Length = 513

 Score = 132 (51.5 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 67/237 (28%), Positives = 93/237 (39%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             G+Y    G   ++ S +P GQ +Y  GYG          ++ +G  G   NT  S  +A 
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSY-GGYGQSTDTSGYGQSSYSGSYGQTQNTGYSTQSAP 73

Query:   188 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
             Q G      Y   +     Y + S  PGY    APS   T G     ++  GY   +G G
Sbjct:    74 Q-GYSSAGGYGSSQSSQSSYGQQSSYPGYGQQPAPS--GTSGSYGSSSQSSGYGQPQGGG 130

Query:   245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG--YETQRVPGYD 302
             Y  Q G  Y  Q+  +Y   +  SY+P +G G   Q   +     G G  Y   +     
Sbjct:   131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQSQYNSSGGGGGGGGGSYGQDQPSMSS 185

Query:   303 VQRGPVYEAQ-RAPSY---IPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                G  Y  Q ++  Y      RG G     G GY+ R +  Y+P RG G     RG
Sbjct:   186 GGGGGGYGNQDQSGGYGGGQQDRG-GRGRGGGGGYN-RSSGGYEP-RGRGGGRGGRG 239


>ZFIN|ZDB-GENE-070912-607 [details] [associations]
            symbol:col11a1b "collagen, type XI, alpha 1b"
            species:7955 "Danio rerio" [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS51461 SMART:SM00038 ZFIN:ZDB-GENE-070912-607
            Gene3D:2.60.120.200 InterPro:IPR008985 InterPro:IPR013320
            SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            InterPro:IPR001791 SMART:SM00282 Pfam:PF02210 GO:GO:0005201
            HOGENOM:HOG000085654 SMART:SM00210 GeneTree:ENSGT00700000104155
            UniGene:Dr.3536 EMBL:BX510342 EMBL:BX547933 EMBL:CT583637
            EMBL:GQ485665 IPI:IPI00511026 RefSeq:NP_001171883.1
            UniGene:Dr.42128 Ensembl:ENSDART00000049589 GeneID:555202
            KEGG:dre:555202 CTD:555202 NextBio:20880850 Uniprot:D6MUD3
        Length = 1815

 Score = 138 (53.6 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 71/250 (28%), Positives = 100/250 (40%)

Query:   156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA 215
             G P  HG P      G  G          + T    P RA  +  +GP   A +     A
Sbjct:   469 GSPGLHGDPGERGPPGRPGLPGGDGAPGPSGTILMLPFRAGGESSKGPVVSAQEAQA-QA 527

Query:   216 SKAPSYDPTKGPSYDPAKGPGYD-PTKGPGYDAQKGSNYDA-QRGPNYDIHRGPSYDP-- 271
               A +    +GP   P    G   P  GPG    KG + D+  +GP     +GP+  P  
Sbjct:   528 ILAQARLTMRGPP-GPMGLTGRSGPVGGPGAPGAKGESGDSGPQGPRG--LQGPTGSPGK 584

Query:   272 --QRGL-GYDMQRG-PNYDMQRGP-GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
               +RG  G D  RG P     +G  G++   +PG   ++G  +  ++ P  +P   PG D
Sbjct:   585 PGKRGRNGADGARGIPGESGAKGDRGFDG--LPGLPGEKG--HRGEQGPIGLPG-SPGED 639

Query:   327 LQRGQGYDM--RRAPSYDPSRGT-GFDGAPRGAAPHGQV----PP-PLNNV-PYGSATPP 377
               RG+  ++  R  P     RG  G  G+P  A   G      PP P  N+ P G   PP
Sbjct:   640 GPRGEDGEIGQRGMPGESGPRGLLGPRGSPGTAGQRGLTGLDGPPGPKGNMGPQGEPGPP 699

Query:   378 ARSGSGQPRG 387
              + G+  P G
Sbjct:   700 GQQGNTGPHG 709


>WB|WBGene00000251 [details] [associations]
            symbol:bli-1 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0002119 "nematode larval
            development" evidence=IMP] [GO:0018996 "molting cycle, collagen and
            cuticulin-based cuticle" evidence=IMP] [GO:0005578 "proteinaceous
            extracellular matrix" evidence=ISS] [GO:0042329 "structural
            constituent of collagen and cuticulin-based cuticle" evidence=ISS]
            InterPro:IPR002486 InterPro:IPR012613 Pfam:PF01484 Pfam:PF08175
            SMART:SM01088 GO:GO:0009792 GO:GO:0002119 GO:GO:0018996
            GO:GO:0005578 GO:GO:0040011 GO:GO:0000003 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0040002
            EMBL:Z46791 PIR:T19140 RefSeq:NP_496311.2 ProteinModelPortal:Q09457
            STRING:Q09457 PaxDb:Q09457 EnsemblMetazoa:C09G5.6 GeneID:174653
            KEGG:cel:CELE_C09G5.6 UCSC:C09G5.6 CTD:174653 WormBase:C09G5.6
            GeneTree:ENSGT00690000102663 HOGENOM:HOG000016778 InParanoid:Q09457
            OMA:WEEHRKS NextBio:884926 GO:GO:0042601 GO:GO:0042329
            GO:GO:0030436 Uniprot:Q09457
        Length = 948

 Score = 135 (52.6 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 89/338 (26%), Positives = 120/338 (35%)

Query:    87 FYNDHLESLQVMEK--NYITMATEVEKLRAELMNAPNVDRRA-----DGSYGGATGNSEN 139
             FY++  E L   +   N I      E    E+  A + DR       +G Y   T     
Sbjct:    36 FYSEAQEELVEFKDIANNIWEEMVFELTPEEMREAEDNDREKRSYEPEGPYQSETTTPST 95

Query:   140 ETSGRPVGQNAYED--GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAY 197
              TS       A ED  GY     +GPP S          P T     A   + T   + Y
Sbjct:    96 TTSTAATTTEAAEDESGYDFVNDNGPPSSRPRKPEPPTMPRTIQGFRAPPPAAT---STY 152

Query:   198 DIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD-PAKGPG-----YDPTKGP--GYDAQK 249
               P G  Y+ + G    +S+ P Y P + PS   P   P      Y+P   P  GY    
Sbjct:   153 RPPHGSNYD-NYGREPASSRRP-YPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNP 210

Query:   250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP--GYET--QRVPG----Y 301
                Y+  + PNY   R P+Y       Y   R PN    R P  GY++  Q  P     Y
Sbjct:   211 RVPYNPPQ-PNYT--RQPTYPEDNRAPYKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIY 267

Query:   302 DVQR----GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 357
             + +R    GP Y   + P+  P   PG   QR      R  P+   +R       P    
Sbjct:   268 NTRRPNNHGPGYPEDQVPTAPPV--PGQ--QRVPPTQTRNPPNPTNTRQPSRPVPPTSDG 323

Query:   358 PHGQVPPPLN-NVPYGSATPPARSGSG--QPRGGNPAR 392
              H +   P N +  Y +    +  G G  +PR G   R
Sbjct:   324 -HIEATTPYNPSAQYPTGKRGSHPGFGPQRPRPGTRPR 360

 Score = 131 (51.2 bits), Expect = 4.8e-05, P = 4.8e-05
 Identities = 76/266 (28%), Positives = 102/266 (38%)

Query:   145 PVGQNAYEDGYGVPQGHG----PPPSATTAGVVGAGPNTSTSAY---AATQSGTPM--RA 195
             P G N Y D YG          PP    +     + PN  TS Y      ++G P   R 
Sbjct:   155 PHGSN-Y-DNYGREPASSRRPYPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNPRV 212

Query:   196 AYDIPRGPGYEASKGPGY-DASKAPSYDPTKGPSYDPAKGP--GYD-----PTKGPG-YD 246
              Y+ P+ P Y  ++ P Y + ++AP Y PT+ P+  P + P  GYD     P   P  Y+
Sbjct:   213 PYNPPQ-PNY--TRQPTYPEDNRAP-YKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIYN 268

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG 306
              ++ +N+    GP Y   + P+  P  G     QR P    +  P     R P   V   
Sbjct:   269 TRRPNNH----GPGYPEDQVPTAPPVPG----QQRVPPTQTRNPPNPTNTRQPSRPVPPT 320

Query:   307 PVYEAQRAPSYIPQRGPGYDL-QRGQ--GYDMRRA-PSYDPSRGTGFDGAPRGAAP-HGQ 361
                  +    Y P     Y   +RG   G+  +R  P   P RG   D     A P H  
Sbjct:   321 SDGHIEATTPYNPSAQ--YPTGKRGSHPGFGPQRPRPGTRP-RGNPCDQC--SAQPNHCP 375

Query:   362 VPPPLNNVPYGSATPPARSGSGQPRG 387
               PP    P G   PP   G   PRG
Sbjct:   376 SGPP---GPRGRPGPPGFPGQDGPRG 398

 Score = 130 (50.8 bits), Expect = 6.2e-05, P = 6.2e-05
 Identities = 76/265 (28%), Positives = 97/265 (36%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPN 178
             P  +R  DG+  G  G    +      GQ+      G P  HG   S  T G  G  G N
Sbjct:   427 PPGERGPDGT-PGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGKPGLPGRN 485

Query:   179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKA----PSYDPTKGPSYDPA- 232
               +        G P      +P   G   + G  G D S      P  D T GP   P  
Sbjct:   486 GQSCKSIPGPPGQP--GVMGVPGRDGDPGTDGEHGQDGSPGIQGPPGRDGTSGPDGQPGV 543

Query:   233 KGPGYDPTKGPGYDA--QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
               PG   T G GY    ++ S +D     N D  RG   +  R  GYD +R      +  
Sbjct:   544 SAPGAPGTDG-GYCPCPKRSSKFDFNDAYNDDEKRG--LEEHRPRGYDSERAE----EPR 596

Query:   291 PGYETQRVPGYDVQRGPVYEAQRAPSY------IPQRGPGY-DLQRGQGYDMRRAPSYDP 343
             P  +T R   YD   G   E QR P+Y       P R   Y D +R +    +R P   P
Sbjct:   597 PR-QTVRTNTYDENSGA--EHQRRPNYEPSAEVAPPRQDRYEDEERVREPPPKRPPP--P 651

Query:   344 SRGTGFDGAPRGAAPHGQVPPPLNN 368
              R T  +  P    P+ + PPP  N
Sbjct:   652 HRQTPHELYPE-EQPYVRRPPPPQN 675

 Score = 122 (48.0 bits), Expect = 0.00046, P = 0.00046
 Identities = 71/243 (29%), Positives = 88/243 (36%)

Query:   163 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 222
             P P  +   +    P   ++ Y   + G+        PR PG      P    S  P++ 
Sbjct:   316 PVPPTSDGHIEATTPYNPSAQYPTGKRGSHPGFGPQRPR-PGTRPRGNPCDQCSAQPNHC 374

Query:   223 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRG-----L 275
             P+ GP   P   PG  P   PG D  +G      RG N  Y   +  SYDP  G     +
Sbjct:   375 PS-GPP-GPRGRPG--PPGFPGQDGPRGL-----RGLNGGYSGVQPSSYDPVIGCVQCPI 425

Query:   276 GYDMQRGPNYDMQRG-PGYE----TQRVPGYDVQRG----PVYEAQRAPSYIPQRGPGYD 326
             G   +RGP  D   G PG +     Q V G D Q G    P Y         P + PG  
Sbjct:   426 GPPGERGP--DGTPGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGK-PGLP 482

Query:   327 LQRGQGYDMRRAPSYDPS-RGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 384
              + GQ       P   P   G  G DG P     HGQ   P      G   PP R G+  
Sbjct:   483 GRNGQSCKSIPGPPGQPGVMGVPGRDGDPGTDGEHGQDGSP------GIQGPPGRDGTSG 536

Query:   385 PRG 387
             P G
Sbjct:   537 PDG 539


>UNIPROTKB|J9P0L0 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            CTD:1281 EMBL:AAEX03017880 RefSeq:XP_851009.1
            Ensembl:ENSCAFT00000047312 GeneID:478835 KEGG:cfa:478835
            Uniprot:J9P0L0
        Length = 1465

 Score = 137 (53.3 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 83/284 (29%), Positives = 105/284 (36%)

Query:   127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYA 185
             +G  G      E+ + G P G+    D  G P   GPP +A   G  G AGP        
Sbjct:   653 NGKPGEPGPKGESGSPGVPGGKG---DS-GAPGERGPPGAAGPMGPRGGAGPPGPEGGKG 708

Query:   186 AT-------QSGTP----MRAAYDIPRGPGYEASKG-PGY-DASKAPSYDPTKGPSYDPA 232
             A         +GTP    M      P GPG +  KG PG   A  AP  D  +GP+  P 
Sbjct:   709 AAGPPGPPGSAGTPGLQGMPGERGGPGGPGPKGDKGEPGSAGADGAPGKDGPRGPT-GPI 767

Query:   233 KGPGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 291
               PG  P   PG   + G+       GP         + P    G+    G N +    P
Sbjct:   768 GPPG--PAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAGFPGAPGQNGE----P 821

Query:   292 GYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
             G + +R  PG   + GP   A       P   PG    +G+    R +P      G G  
Sbjct:   822 GAKGERGAPGEKGEGGPPGVAGPPGGAGPAGPPGPQGVKGE----RGSPG-----GPGAA 872

Query:   351 GAPRGAAPHGQVPPPLNNV---PYGSATPPARSGSGQPRGGNPA 391
             G P G    G   PP NN    P GS+  P + G   P G N A
Sbjct:   873 GFPGGRGLPG---PPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 913

 Score = 132 (51.5 bits), Expect = 6.3e-05, P = 6.3e-05
 Identities = 83/280 (29%), Positives = 101/280 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
             A G  GG  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAMGPS 224

Query:   181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
               A    +SG P R     +P  PG +   G PG+   K    +D   G   D    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 283

Query:   238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPPGSNGSP 392

Query:   354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
              G    G  P  +   P   G+  PP   G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 78/261 (29%), Positives = 98/261 (37%)

Query:   147 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 200
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   P
Sbjct:   321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380

Query:   201 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 256
               PG   S G PG      P+  P   P    A+GP   P T G PG     G    +  
Sbjct:   381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439

Query:   257 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 308
             +G P     RG +  P   G  G D + G P      G PG   +R  PG+   RGP   
Sbjct:   440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496

Query:   309 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
                  ++ P+   + GPG    RG  G   R      P    G  G+P G    G+  PP
Sbjct:   497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554

Query:   366 LNNVPYGSATPPARSGS-GQP 385
              +    G   PP  SG  GQP
Sbjct:   555 GSQGESGRPGPPGPSGPRGQP 575


>UNIPROTKB|F1N7Q7 [details] [associations]
            symbol:COL4A2 "Collagen alpha-2(IV) chain" species:9913
            "Bos taurus" [GO:0071560 "cellular response to transforming growth
            factor beta stimulus" evidence=IEA] [GO:0016525 "negative
            regulation of angiogenesis" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005587 "collagen
            type IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0071560 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006351 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0016525 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:DAAA02034911 IPI:IPI00712524
            Ensembl:ENSBTAT00000005916 OMA:QETIQPG Uniprot:F1N7Q7
        Length = 1650

 Score = 137 (53.3 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 75/251 (29%), Positives = 98/251 (39%)

Query:   116 LMNAPNVD-RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVV 173
             L   P +  R+ D    GA G +  +    P G + +    G+P GH G        G  
Sbjct:    18 LQGFPGLQGRKGDKGQRGAPGITGPKGDVGPRGVSGFPGADGIP-GHPGQGGPRGPPGYD 76

Query:   174 GAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA 232
             G       S YA    G P    +  PRGP G +  KG  Y A  +   D  +G   +P 
Sbjct:    77 GCNGTVGDSGYA----GPPGPGGFLGPRGPQGPKGQKGEPY-ALSSEDRDKYRGEPGEPG 131

Query:   233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRGPNYDMQ-RG 290
                   P   PG   Q G    A   P      GP   P  RGLG+  ++G   DM  +G
Sbjct:   132 LVGLQGPPGRPGPVGQMGP-VGAPGRPGPPGPPGPKGQPGNRGLGFYGEKGEKGDMGLQG 190

Query:   291 PGYETQRVP---GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRG 346
             PG     +P   GY  +  PVYE       +P++  G   ++G QG   R   S     G
Sbjct:   191 PG----GIPPDNGYVEKPTPVYEL------LPEQYKG---EKGSQGEPGRIGVSLKGEEG 237

Query:   347 T-GFDGAPRGA 356
               GF G PRGA
Sbjct:   238 VVGFSG-PRGA 247


>UNIPROTKB|F1LRJ1 [details] [associations]
            symbol:Col4a3 "Protein Col4a3" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            RGD:71085 GO:GO:0006917 GO:GO:0008283 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006919 GO:GO:0007166 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0016525 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1285
            GO:GO:0032836 IPI:IPI00367109 RefSeq:NP_001129231.1
            UniGene:Rn.121139 Ensembl:ENSRNOT00000020669 GeneID:363265
            KEGG:rno:363265 NextBio:683046 ArrayExpress:F1LRJ1 Uniprot:F1LRJ1
        Length = 1670

 Score = 137 (53.3 bits), Expect = 2.1e-05, P = 2.1e-05
 Identities = 93/289 (32%), Positives = 106/289 (36%)

Query:   127 DGSYGGATGNSENETSGRPV--GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
             DGS GG          G P   G+   +   G P   GPP  A  AG  G GP       
Sbjct:   568 DGSPGGPGAKGPRGPRGEPALSGRKGDQGPPGAPGSPGPPGPAGPAGPPGYGPQGEPGPK 627

Query:   185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK-GP-GYDPTKG 242
              A   G P   A     GP  EA    G  ++  P   P  GP   P + GP G     G
Sbjct:   628 GA--QGVP--GAL----GPPGEAGL-KGESSASIPVLGPP-GPPGPPGQAGPRGLPGLPG 677

Query:   243 PGYDAQKGS-NYDAQRG-PNYDIH--RGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR 297
             P      G    D + G P       RGP  D     G+    G P Y     PG ET R
Sbjct:   678 PVGTCDPGHPGPDGEPGIPEVGFPGARGPKGDQ----GFPGTIGLPGY-----PG-ETGR 727

Query:   298 VPGYDVQRGPVYEAQRAPSY-IP-QRG-PGYDLQRGQGYDMRRA--PSYDPSRGT----G 348
              PGY  + G V  A+  PS   P + G PG+  +RG   +      P      GT    G
Sbjct:   728 -PGYPGEMG-VPGAKGEPSVGRPGEPGKPGFPGERGNSGENGDIGLPGLPGPPGTPGKDG 785

Query:   349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQP--RG--GNPARR 393
             FDG P    P GQ  PP    P G   P  R   G P   G  G P RR
Sbjct:   786 FDGPP--GDP-GQSGPPGAKGPPGRCIPGPRGTQGLPGLNGLKGQPGRR 831


>UNIPROTKB|J9NW09 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 EMBL:AAEX03003616 EMBL:AAEX03003617
            Ensembl:ENSCAFT00000050029 Uniprot:J9NW09
        Length = 1789

 Score = 137 (53.3 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728


>MGI|MGI:88453 [details] [associations]
            symbol:Col3a1 "collagen, type III, alpha 1" species:10090 "Mus
            musculus" [GO:0001568 "blood vessel development" evidence=IMP]
            [GO:0005178 "integrin binding" evidence=ISO] [GO:0005201
            "extracellular matrix structural constituent" evidence=ISO]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
            "collagen" evidence=IDA] [GO:0005586 "collagen type III"
            evidence=ISO;IDA] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0007160 "cell-matrix adhesion" evidence=ISO] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=ISO] [GO:0007229 "integrin-mediated signaling pathway"
            evidence=ISO] [GO:0007507 "heart development" evidence=ISO]
            [GO:0009314 "response to radiation" evidence=ISO] [GO:0018149
            "peptide cross-linking" evidence=ISO] [GO:0030199 "collagen fibril
            organization" evidence=ISO;IMP] [GO:0031012 "extracellular matrix"
            evidence=ISO;IDA] [GO:0032964 "collagen biosynthetic process"
            evidence=ISO] [GO:0034097 "response to cytokine stimulus"
            evidence=ISO] [GO:0042060 "wound healing" evidence=ISO] [GO:0043206
            "extracellular fibril organization" evidence=ISO] [GO:0043588 "skin
            development" evidence=ISO] [GO:0046332 "SMAD binding" evidence=IPI]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=ISO] [GO:0048565
            "digestive tract development" evidence=IMP] [GO:0050777 "negative
            regulation of immune response" evidence=ISO] [GO:0071230 "cellular
            response to amino acid stimulus" evidence=IDA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 MGI:MGI:88453 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0046872 GO:GO:0034097 GO:GO:0030199
            GO:GO:0001501 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0042060
            GO:GO:0001568 GO:GO:0048565 GO:GO:0050777 GO:GO:0009314
            GO:GO:0018149 GO:GO:0032964 GO:GO:0071230 GO:GO:0043206
            GO:GO:0005201 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
            CTD:1281 OMA:EGSPGHP ChiTaRS:COL3A1 GO:GO:0005586 EMBL:X52046
            EMBL:BC043089 EMBL:BC058724 EMBL:M18933 EMBL:K03037 EMBL:AK019448
            EMBL:X57983 IPI:IPI00129571 PIR:A27353 PIR:S59856
            RefSeq:NP_034060.2 UniGene:Mm.249555 ProteinModelPortal:P08121
            SMR:P08121 STRING:P08121 PhosphoSite:P08121 PaxDb:P08121
            PRIDE:P08121 Ensembl:ENSMUST00000087883 GeneID:12825 KEGG:mmu:12825
            InParanoid:P08121 NextBio:282310 Bgee:P08121 CleanEx:MM_COL3A1
            Genevestigator:P08121 Uniprot:P08121
        Length = 1464

 Score = 136 (52.9 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 86/285 (30%), Positives = 101/285 (35%)

Query:   120 PNVDRRADGSYGGATGNSENETSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP 177
             P  +   DGS G    N     +G   P G        G+P   GPP      G  G   
Sbjct:   459 PKGEDGKDGSPGEPGANGLPGAAGERGPSGFRGPAGPNGIPGEKGPPGERGGPGPAGPRG 518

Query:   178 NTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPG 236
                      T  G  +R     P GPG +   GP    S+  S  P   GPS  P   PG
Sbjct:   519 VAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGP--PGSQGESGRPGPPGPS-GPRGQPG 575

Query:   237 YDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM-QRGP- 291
                  GP G D   G N + + GP      GP+  + + G  G     GP  D    GP 
Sbjct:   576 VMGFPGPKGNDGAPGKNGE-RGGPGGPGLPGPAGKNGETGPQGPPGPTGPAGDKGDSGPP 634

Query:   292 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GF 349
             G +  Q +PG     GP  E  +     P+   G     G G     AP      GT G 
Sbjct:   635 GPQGLQGIPGTG---GPPGENGKPGEPGPKGEVGAPGAPG-GKGDSGAPGERGPPGTAGI 690

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS----GQP--RGG 388
              GA  GA P G   P     P G   PP  SGS    G P  RGG
Sbjct:   691 PGARGGAGPPG---PEGGKGPAGPPGPPGASGSPGLQGMPGERGG 732

 Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
 Identities = 81/284 (28%), Positives = 103/284 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTSTS 182
             G  GG  G +       P G + +    G P   GPP     AG  G  GP      +  
Sbjct:   166 GGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGALGPAGP 225

Query:   183 AYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGYDP 239
             A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG   
Sbjct:   226 AGKDGESGRPGRPGERGLPGPPGIKGPAGMPGFPGMKGHRGFDGRNGEKGETG-APGLKG 284

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE-T 295
               G PG +   G      RG   +  R P      G  G D  RG   D Q GP G   T
Sbjct:   285 ENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPPGT 339

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                PG    +G V  A    S      PG   QRG+      A +  P    G +G+P G
Sbjct:   340 AGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGNNGSPGG 393

Query:   356 AAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
                 G  P  +   P   G+  PP  +G+ G P  RG  G P +
Sbjct:   394 KGEMG--PAGIPGAPGLIGARGPPGPAGTNGIPGTRGPSGEPGK 435


>FB|FBgn0262126 [details] [associations]
            symbol:gho "ghost" species:7227 "Drosophila melanogaster"
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0006888 "ER to Golgi vesicle-mediated transport" evidence=IEA]
            [GO:0006886 "intracellular protein transport" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0030127 "COPII
            vesicle coat" evidence=IEA] [GO:0005811 "lipid particle"
            evidence=IDA] [GO:0035158 "regulation of tube diameter, open
            tracheal system" evidence=IMP] [GO:0009306 "protein secretion"
            evidence=IMP] [GO:0035151 "regulation of tube size, open tracheal
            system" evidence=IMP] [GO:0070971 "endoplasmic reticulum exit site"
            evidence=IDA] [GO:0003331 "positive regulation of extracellular
            matrix constituent secretion" evidence=IMP] [GO:0007029
            "endoplasmic reticulum organization" evidence=IMP] [GO:0048081
            "positive regulation of cuticle pigmentation" evidence=IMP]
            [GO:0030011 "maintenance of cell polarity" evidence=IMP]
            [GO:0007030 "Golgi organization" evidence=IMP] [GO:0016203 "muscle
            attachment" evidence=IMP] [GO:0035149 "lumen formation, open
            tracheal system" evidence=IMP] [GO:0034394 "protein localization to
            cell surface" evidence=IMP] [GO:0040003 "chitin-based cuticle
            development" evidence=IMP] [GO:0022409 "positive regulation of
            cell-cell adhesion" evidence=IMP] [GO:0008360 "regulation of cell
            shape" evidence=IMP] [GO:0071711 "basement membrane organization"
            evidence=IMP] [GO:0000902 "cell morphogenesis" evidence=IMP]
            InterPro:IPR006895 InterPro:IPR006896 InterPro:IPR006900
            Pfam:PF04810 Pfam:PF04811 Pfam:PF04815 GO:GO:0006886 EMBL:AE014134
            GO:GO:0008360 GO:GO:0005811 GO:GO:0008270 GO:GO:0009306
            GO:GO:0016787 GO:GO:0016203 GO:GO:0000902 InterPro:IPR007123
            Pfam:PF00626 GO:GO:0006888 GO:GO:0040003 GO:GO:0034394
            GO:GO:0003331 GO:GO:0071711 GO:GO:0007030 GO:GO:0007029
            GO:GO:0030011 GO:GO:0035158 GO:GO:0022409 GO:GO:0035149
            GO:GO:0030127 SUPFAM:SSF82919 GO:GO:0070971 InterPro:IPR012990
            Pfam:PF08033 SUPFAM:SSF81811 eggNOG:COG5028 KO:K14007
            GeneTree:ENSGT00590000082962 HSSP:P40482 OMA:QDQGNCN GO:GO:0048081
            EMBL:AY052042 RefSeq:NP_608664.2 UniGene:Dm.269 SMR:Q9VQ94
            IntAct:Q9VQ94 MINT:MINT-283494 STRING:Q9VQ94
            EnsemblMetazoa:FBtr0077810 EnsemblMetazoa:FBtr0329964 GeneID:33409
            KEGG:dme:Dmel_CG10882 UCSC:CG10882-RA CTD:33409 FlyBase:FBgn0262126
            InParanoid:Q9VQ94 OrthoDB:EOG4CVDNW GenomeRNAi:33409 NextBio:783418
            Uniprot:Q9VQ94
        Length = 1193

 Score = 135 (52.6 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 65/231 (28%), Positives = 84/231 (36%)

Query:   156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP---RGPGYEASKGPG 212
             G P   G PP +    +  + P  S     +++ G P       P     PG    +  G
Sbjct:   211 GQPPLPGQPPFS--GQIPTSQPAPSPYGVPSSRPGQPQLPPGATPPTYTQPGLPPQQQQG 268

Query:   213 YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 272
                 + P   P + P + P + PG  P   PG   Q G+ Y A +   Y    G  +  Q
Sbjct:   269 IPPLQQPGI-PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQGGYS---G-GFPGQ 322

Query:   273 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG----PGYDLQ 328
                G+     P       PG +    P +   + P Y  Q+ P Y PQ G    PGY  Q
Sbjct:   323 APGGFPGAPPPL------PGQQAAAPPQFGAPQ-PGYPGQQ-PGYPPQPGQQPMPGYPPQ 374

Query:   329 RGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPAR 379
              GQ       P Y P  G GF G P G     Q P P     Y  A P AR
Sbjct:   375 PGQQLG---GPGYPPQPGAGFPGQP-GRPGFNQPPMPGAGNMYQQA-PQAR 420

 Score = 127 (49.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 75/283 (26%), Positives = 100/283 (35%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHG-PPPSATTAGVVGAGPNTSTSAY-- 184
             G  GGA         G P        G+     +  PPP+       GA P T   +Y  
Sbjct:    90 GGVGGANPLKPPLPQGAPAAAAPPPTGFNQFNSNAAPPPTNNNNAAFGAPPPTQAGSYVN 149

Query:   185 -AATQSGTPMRAAYDIPRGPGYEASKG--PGYDASKAPSYDPTKGPSYDPAKG------- 234
              A   S TP   A  I +     A+    P     KA +     G    PA G       
Sbjct:   150 GALPPSSTPQSVASGINQMSLNSATLAGLPHMPPPKAATPGAAPGQPPIPAAGSTSQPPL 209

Query:   235 PGYDPTKGPGYDAQKGSNYDAQRGPN-YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
             PG  P   PG     G    +Q  P+ Y +       PQ   G      P Y     P  
Sbjct:   210 PGQPPL--PGQPPFSGQIPTSQPAPSPYGVPSSRPGQPQLPPG---ATPPTYTQPGLPPQ 264

Query:   294 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGA 352
             + Q +P   +Q+  +   Q+ P + PQ+ PG       G   +    Y  P +G G+ G 
Sbjct:   265 QQQGIP--PLQQPGI--PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQG-GYSGG 318

Query:   353 PRGAAPHG--QVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
               G AP G    PPPL   P   A  P + G+ QP  G P ++
Sbjct:   319 FPGQAPGGFPGAPPPL---PGQQAAAPPQFGAPQP--GYPGQQ 356


>WB|WBGene00001215 [details] [associations]
            symbol:ego-2 species:6239 "Caenorhabditis elegans"
            [GO:0040002 "collagen and cuticulin-based cuticle development"
            evidence=IMP] [GO:0002009 "morphogenesis of an epithelium"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0045747 "positive regulation of Notch signaling pathway"
            evidence=IGI] InterPro:IPR025304 Pfam:PF13949 GO:GO:0009792
            GO:GO:0002009 GO:GO:0040007 GO:GO:0002119 GO:GO:0045747
            GO:GO:0040035 Gene3D:1.25.40.280 InterPro:IPR004328 Pfam:PF03097
            SMART:SM01041 PROSITE:PS51180 GO:GO:0040002 EMBL:AL117201
            UniGene:Cel.16377 GeneID:190251 KEGG:cel:CELE_Y53H1C.2 CTD:190251
            RefSeq:NP_001251634.1 ProteinModelPortal:H8ESG1 WormBase:Y53H1C.2c
            Uniprot:H8ESG1
        Length = 1494

 Score = 136 (52.9 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 79/280 (28%), Positives = 107/280 (38%)

Query:   129 SYGGATGNSENETSGRPVGQNAYEDGYGVPQG-----HGPPPSATTAGVVGAGPNTSTSA 183
             SYG  T      + G   G + Y++G   P G      GPP +   A    A P TS   
Sbjct:  1050 SYGAPT--PPQASYGPAPGAHGYQNGAQGPPGAEVGAQGPPGAHFGAHGASAPPPTS--- 1104

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDP--- 239
             Y A     P +A+Y     PG +   G  ++A  A +  PT   +  P +GP G  P   
Sbjct:  1105 YGAPTPQRPPQASYGA--APGAQGPPGGQFEAHGAAALPPTSHGAPTP-QGPFGAAPGAQ 1161

Query:   240 --TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 297
                +GP Y  Q+G+ Y+AQ+ P   I   P   PQ    +  Q G        PG +   
Sbjct:  1162 FGAQGP-Y-GQQGARYEAQKSPGAAIFGAPGAPPQHQGSFGAQFGVPPPQNSAPGAQFGA 1219

Query:   298 VPGYDVQRGPVYEAQRAPSY-IPQRGPGYDL-QRG-QGYDMRRAP---SYD-----P-SR 345
              P       P    Q  PSY  P   P   + Q   QG  +   P   S+      P +R
Sbjct:  1220 KPEAS-SHAPTPPPQPHPSYQAPAPPPALSVFQHSPQGAPITAPPPASSHHEHIAAPQAR 1278

Query:   346 GTGFDGAPRG--AAPHG-QVPPPLNNVPYGSATPPARSGS 382
              T   GAP    A P   +   P N  P   A P A++ +
Sbjct:  1279 FTPTPGAPSPWHATPAELKFQTPWNTTPQYHAPPGAQAAA 1318


>UNIPROTKB|F1SKM1 [details] [associations]
            symbol:COL7A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 Gene3D:2.60.40.10 InterPro:IPR013783
            GO:GO:0004867 SUPFAM:SSF49265 Gene3D:4.10.410.10 InterPro:IPR020901
            SUPFAM:SSF57362 PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005604 OMA:RRVCTTA GeneTree:ENSGT00700000104301
            EMBL:CU633242 Ensembl:ENSSSCT00000012432 ArrayExpress:F1SKM1
            Uniprot:F1SKM1
        Length = 2939

 Score = 148 (57.2 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
 Identities = 82/272 (30%), Positives = 105/272 (38%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
             P G        G P   GPP SA   G  G  P    S  +  + GTP  +    P+G P
Sbjct:  1270 PPGPPGLPGRIGAPGPPGPPGSAIAKGERGF-PGADGSPGSPGRPGTPGTSG---PKGSP 1325

Query:   204 GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNY 261
             G+   +G PG    + P  +P +       +GPG    KG PG     GS     RGP+ 
Sbjct:  1326 GWPGPRGEPGERGPRGPKGEPGEPGRVIGGEGPGLPGQKGDPGLPGPPGS-----RGPSG 1380

Query:   262 DIH-RGPSYDPQRGL----GYDMQRGPNY--DMQRGPGYE-TQRVPGYDVQRGPV----Y 309
             D   RGP   P   +    G   +RGP    D    PG      +PG    +GPV     
Sbjct:  1381 DPGPRGPPGFPGTAVKGEKGDRGERGPPGPGDGTAAPGDPGLPGLPGSPGPQGPVGPPGE 1440

Query:   310 EAQRAPSYIPQRG----PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPHGQVP 363
             + ++  S     G    PG   +RG +G+     P  D  RG TG  G P      G  P
Sbjct:  1441 KGEKGDSEDGAPGLPGQPGVPGERGLRGFPGDTGPKGD--RGLTGAVGEPGEKGERGS-P 1497

Query:   364 PPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
              P+   P G    P R G+  P G  G   RR
Sbjct:  1498 GPVG--PQGPPGVPGRPGAEGPEGPPGPTGRR 1527

 Score = 41 (19.5 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
 Identities = 8/24 (33%), Positives = 15/24 (62%)

Query:     2 PKVGA-HKLEIRCTLIFTCTLDFL 24
             P+V A H+  + CT +++  + FL
Sbjct:    19 PRVRAQHRERVTCTRLYSADIVFL 42


>UNIPROTKB|F1NRH2 [details] [associations]
            symbol:LOC100858979 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005581 "collagen" evidence=IEA] [GO:0005938
            "cell cortex" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
            GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:AC147437
            IPI:IPI01017314 RefSeq:XP_003641055.1 Ensembl:ENSGALT00000024133
            GeneID:100858979 KEGG:gga:100858979 Uniprot:F1NRH2
        Length = 674

 Score = 132 (51.5 bits), Expect = 2.4e-05, P = 2.4e-05
 Identities = 87/283 (30%), Positives = 107/283 (37%)

Query:   125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSA 183
             + D    GA G +       P G+   E G G P   GPP  A   G  G  GP      
Sbjct:   227 KGDRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP------ 279

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPT 240
               A   G+P    +  P  PG +  +GP G      P  D  +GP+  P + GP G    
Sbjct:   280 --AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGEPGPAGPQGN 335

Query:   241 KGP-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PG 292
              GP G     G N     GP  D+   GP+  P    +RGL G D +  P Y  ++G PG
Sbjct:   336 MGPQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPG 391

Query:   293 YETQRVPGYDVQRGPVYEAQRAPSYIPQR-GP-GYDLQRG-QGYDMRRAPSYDPS-RGT- 347
              +    PG   Q+G    A   P  +P   GP G     G  G    R PS  P  RG  
Sbjct:   392 PKGH--PGLPGQKGDTGHA--GPPGLPGPVGPQGVKGVPGINGEPGPRGPSGIPGIRGPI 447

Query:   348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
             G  G P      G+   P    P G AT   R   G P    P
Sbjct:   448 GPPGMPGAPGAKGEAGAPGLPGPAGIATKGLRGPMGPPGPPGP 490


>UNIPROTKB|F1PGS0 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:AAEX03003616
            EMBL:AAEX03003617 Ensembl:ENSCAFT00000026237 Uniprot:F1PGS0
        Length = 1969

 Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728


>UNIPROTKB|G3MZY8 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9913
            "Bos taurus" [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0004672 "protein kinase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=IEA]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:DAAA02048777
            EMBL:DAAA02048778 EMBL:DAAA02048779 EMBL:DAAA02048780
            EMBL:DAAA02048781 Ensembl:ENSBTAT00000064788 Uniprot:G3MZY8
        Length = 1970

 Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1490 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1547

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1548 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1599

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1600 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1653

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1654 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1706

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:  1707 YSPTSPSYSPTS--PSYSPTSPSYS 1729


>UNIPROTKB|P24928 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0003968 "RNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0003677 "DNA binding" evidence=NAS] [GO:0003899 "DNA-directed
            RNA polymerase activity" evidence=NAS] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=NAS] [GO:0006366
            "transcription from RNA polymerase II promoter"
            evidence=IDA;NAS;TAS] [GO:0005634 "nucleus" evidence=IDA;NAS]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IDA] [GO:0004672 "protein kinase activity" evidence=IDA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0000398 "mRNA splicing,
            via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=TAS]
            [GO:0006289 "nucleotide-excision repair" evidence=TAS] [GO:0006367
            "transcription initiation from RNA polymerase II promoter"
            evidence=TAS] [GO:0006368 "transcription elongation from RNA
            polymerase II promoter" evidence=TAS] [GO:0006370
            "7-methylguanosine mRNA capping" evidence=TAS] [GO:0008380 "RNA
            splicing" evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0016032 "viral reproduction" evidence=TAS] [GO:0050434
            "positive regulation of viral transcription" evidence=TAS]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=IPI]
            [GO:0006468 "protein phosphorylation" evidence=IDA]
            Reactome:REACT_216 Reactome:REACT_71 InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 Reactome:REACT_116125
            EMBL:CH471108 GO:GO:0016032 GO:GO:0006355 GO:GO:0046872
            GO:GO:0003677 Reactome:REACT_1675 GO:GO:0006468 GO:GO:0006368
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0006367 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0006370 GO:GO:0050434 GO:GO:0006283
            Reactome:REACT_1892 EMBL:AC113189 GO:GO:0003899 PDB:2GHQ PDB:2GHT
            PDBsum:2GHQ PDBsum:2GHT eggNOG:COG0086 GO:GO:0003968 GO:GO:0005665
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 EMBL:X63564 EMBL:X74874
            EMBL:X74873 EMBL:X74872 EMBL:X74871 EMBL:X74870 EMBL:BC137231
            IPI:IPI00031627 PIR:I38186 PIR:S21054 RefSeq:NP_000928.1
            UniGene:Hs.270017 PDB:2LTO PDBsum:2LTO ProteinModelPortal:P24928
            SMR:P24928 DIP:DIP-29011N IntAct:P24928 MINT:MINT-156582
            STRING:P24928 PhosphoSite:P24928 DMDM:281185484 PaxDb:P24928
            PRIDE:P24928 Ensembl:ENST00000322644 GeneID:5430 KEGG:hsa:5430
            UCSC:uc002ghf.4 CTD:5430 GeneCards:GC17P007387 H-InvDB:HIX0173727
            HGNC:HGNC:9187 HPA:CAB012226 HPA:CAB016388 HPA:CAB022311
            HPA:HPA021563 MIM:180660 neXtProt:NX_P24928 PharmGKB:PA33507
            HOVERGEN:HBG004339 InParanoid:P24928 OrthoDB:EOG4JWVCM
            BindingDB:P24928 ChEMBL:CHEMBL1641353 ChiTaRS:POLR2A
            EvolutionaryTrace:P24928 GenomeRNAi:5430 NextBio:21009
            ArrayExpress:P24928 Bgee:P24928 CleanEx:HS_POLR2A
            Genevestigator:P24928 GermOnline:ENSG00000181222 Uniprot:P24928
        Length = 1970

 Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728


>MGI|MGI:98086 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide
            A" species:10090 "Mus musculus" [GO:0003677 "DNA binding"
            evidence=IDA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=ISO] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=ISO] [GO:0005730 "nucleolus"
            evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=ISO] [GO:0006468 "protein phosphorylation"
            evidence=ISO] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 MGI:MGI:98086
            GO:GO:0046872 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            EMBL:AL603707 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 GeneTree:ENSGT00700000104490
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 CTD:5430
            HOVERGEN:HBG004339 OrthoDB:EOG4JWVCM ChiTaRS:POLR2A EMBL:M12130
            EMBL:M14101 IPI:IPI00136207 PIR:A28490 RefSeq:NP_033115.1
            UniGene:Mm.16533 DisProt:DP00181 ProteinModelPortal:P08775
            SMR:P08775 DIP:DIP-46369N IntAct:P08775 STRING:P08775
            PhosphoSite:P08775 PaxDb:P08775 PRIDE:P08775
            Ensembl:ENSMUST00000058470 Ensembl:ENSMUST00000071213 GeneID:20020
            KEGG:mmu:20020 UCSC:uc007jrj.1 InParanoid:Q5F298 NextBio:297535
            Bgee:P08775 CleanEx:MM_POLR2A Genevestigator:P08775
            GermOnline:ENSMUSG00000005198 Uniprot:P08775
        Length = 1970

 Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728


>RGD|1587326 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide A"
            species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003677 "DNA binding" evidence=IEA;ISO]
            [GO:0003899 "DNA-directed RNA polymerase activity" evidence=IEA]
            [GO:0004672 "protein kinase activity" evidence=IEA;ISO] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005634 "nucleus"
            evidence=ISO] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISO] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0006468 "protein
            phosphorylation" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA;ISO] [GO:0005730 "nucleolus" evidence=ISO]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 RGD:1587326 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 KO:K03006 CTD:5430 OrthoDB:EOG4JWVCM
            IPI:IPI00952328 RefSeq:XP_001079162.1 RefSeq:XP_343923.3
            UniGene:Rn.163136 Ensembl:ENSRNOT00000068013 GeneID:363633
            KEGG:rno:363633 UCSC:RGD:1587326 NextBio:683839 ArrayExpress:D4A5A6
            Uniprot:D4A5A6
        Length = 1970

 Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/265 (26%), Positives = 95/265 (35%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
             N P +   A G  G   G++ +   G       +  G     G   P   S  T G  G 
Sbjct:  1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
              P+ ++ A   +   +P  A    P  PG      PG  +   PS      PSY P   P
Sbjct:  1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y     P Y  
Sbjct:  1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:  1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705

Query:   356 AAPHGQVPPPLNNVPYGSATPPARS 380
              +P      P +  P  S T P+ S
Sbjct:  1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728


>UNIPROTKB|F1RXW0 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0005588 "collagen type V"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0043588
            GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
            EMBL:CU467671 Ensembl:ENSSSCT00000017460 ArrayExpress:F1RXW0
            Uniprot:F1RXW0
        Length = 1269

 Score = 135 (52.6 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 87/293 (29%), Positives = 109/293 (37%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   554 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 612

Query:   180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   613 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 671

Query:   236 GYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
                P  T  PG   + G        GP   +   P  +   GL  D         RGP  
Sbjct:   672 QGPPGATGFPGSAGRVGPPGPTGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 729

Query:   286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   730 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 785

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
             P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:   786 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 837

 Score = 121 (47.7 bits), Expect = 0.00084, P = 0.00084
 Identities = 83/283 (29%), Positives = 103/283 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGV---PQGHGPPPSATTAGVVG-AGPNTST 181
             A G  G      E    G PVG        G    P   GP  S  T+G  G AGP  S 
Sbjct:   164 ARGPEGPQGQRGETGPPG-PVGSQGLPGAVGTDGTPGAKGPTGSPGTSGPPGLAGPPGSP 222

Query:   182 SAYAATQSGTP-MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP- 239
                 +T  G P +R     P  PG++   GP  +        P   P  +  +GP  DP 
Sbjct:   223 GPQGST--GPPGIRGQPGDPGVPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRGPRGDPG 280

Query:   240 TKGP-GYDAQKGSNYDAQRG-PNYDIHRGPS-YDPQRG-LGYDMQRGPNYDMQR-G-PGY 293
             T GP G   ++G+     RG P  D   GP     +RG +G    +G   D  R G PG 
Sbjct:   281 TVGPPGPMGERGA--PGNRGFPGSDGLPGPKGAQGERGPVGSSGPKGGQGDPGRPGEPGL 338

Query:   294 ETQR-VPGYDVQRGPVYEAQRAPSYIP-QRG----PGYDLQRGQGYDMRRAPSYDPSRGT 347
                R + G    +GP  E +  P   P + G    PG    RGQ   M        S   
Sbjct:   339 PGARGLTGNPGVQGP--EGKLGPLGAPGEDGRPGPPGSIGIRGQPGSMGLPGPKGSSGDP 396

Query:   348 GFDGAPRGAAPHGQ--VPPPLNNV-PYGSATPPARSGSGQPRG 387
             G  G    A   GQ   P     V P G   PP  +G    +G
Sbjct:   397 GKPGEAGNAGVPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQG 439


>TAIR|locus:2089616 [details] [associations]
            symbol:AT3G14750 "AT3G14750" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0048573 "photoperiodism, flowering" evidence=RCA]
            EMBL:CP002686 EMBL:AY035083 EMBL:AY051034 IPI:IPI00544941
            RefSeq:NP_566492.1 UniGene:At.20367 ProteinModelPortal:Q93V84
            SMR:Q93V84 PaxDb:Q93V84 PRIDE:Q93V84 EnsemblPlants:AT3G14750.1
            GeneID:820703 KEGG:ath:AT3G14750 TAIR:At3g14750 eggNOG:NOG236769
            HOGENOM:HOG000242815 InParanoid:Q93V84 OMA:YAENYEH PhylomeDB:Q93V84
            ProtClustDB:CLSN2688383 ArrayExpress:Q93V84 Genevestigator:Q93V84
            Uniprot:Q93V84
        Length = 331

 Score = 127 (49.8 bits), Expect = 2.7e-05, P = 2.7e-05
 Identities = 40/111 (36%), Positives = 55/111 (49%)

Query:    78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADG--------S 129
             R   +YEKK Y ++ E  ++ME   + MA E+EKLRAE+ N+      A+G        +
Sbjct:   207 RAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKLRAEIANS-ETSAYANGPVGNPGGVA 265

Query:   130 YGGATGNSENETSGRPVGQNAYEDGYGV-PQ-----GHGPPPSATTAGVVG 174
             YGG  GN E   +G PV  N Y+  Y + P      G+ PPP    A   G
Sbjct:   266 YGGGYGNPE---AGYPV--NPYQPNYTMNPAQTGVVGYYPPPYGPQAAWAG 311


>UNIPROTKB|I3LSV6 [details] [associations]
            symbol:COL2A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS51461 SMART:SM00038 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
            GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
            GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
            GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GeneTree:ENSGT00660000095287
            GO:GO:0005585 GO:GO:0060174 GO:GO:0030903 OMA:CPICPTE
            Ensembl:ENSSSCT00000031054 Uniprot:I3LSV6
        Length = 1365

 Score = 135 (52.6 bits), Expect = 2.7e-05, P = 2.7e-05
 Identities = 80/273 (29%), Positives = 107/273 (39%)

Query:   127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAGPNTSTSA 183
             DG  G    + E    G P G   +    G+P  +GH G P      G  GA P     +
Sbjct:   156 DGEAGKPGKSGERGPPG-PQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGA-PGVKGES 213

Query:   184 YAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
              +  ++G+P       PRG PG     GP   A+ A   D   GP+  P  GP   P  G
Sbjct:   214 GSPGENGSPGPMG---PRGLPGERGRTGPA-GAAGARGNDGQPGPAGPP--GP-VGPAGG 266

Query:   243 PGYDAQKGSNYDAQRGPNYDIHRGP--SYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
             PG+    G+  +A  GP     RGP  +  P+   G     GP       PG  T  +PG
Sbjct:   267 PGFPGAPGAKGEA--GPTGA--RGPEGAQGPRGEPGNPGSPGPA-GASGNPG--TDGIPG 319

Query:   301 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
                  G    A  AP +   RGP    Q   G    +  + +P    GF G  +G  P G
Sbjct:   320 AKGSAGAPGIAG-APGFPGPRGPPGP-QGATGPLGPKGQTGEPGIA-GFKGE-QG--PKG 373

Query:   361 QVPPPLNNV-PYGSATPPARSGS-GQPRGGNPA 391
             +   P   + P G A    + G+ G+P G  PA
Sbjct:   374 EPAVPGAELQPGGPAGEEGKRGARGEPGGAGPA 406

 Score = 123 (48.4 bits), Expect = 0.00055, P = 0.00055
 Identities = 89/295 (30%), Positives = 111/295 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:    35 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 90

Query:   178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
               +  A      G PM      PRGP G   + GP G+      +  P   GP   P +G
Sbjct:    91 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGRVEDNSLPKATGPM-GP-RG 145

Query:   235 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
             P   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   146 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 202

Query:   291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   203 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 260

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   261 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGNPGSPGPAGASGNP 312


>TAIR|locus:4010713902 [details] [associations]
            symbol:AT4G22505 species:3702 "Arabidopsis thaliana"
            [GO:0006869 "lipid transport" evidence=IEA] EMBL:CP002687
            GO:GO:0006869 InterPro:IPR016140 SUPFAM:SSF47699 UniGene:At.22887
            UniGene:At.74604 IPI:IPI00938995 RefSeq:NP_001154263.1 PRIDE:F4JLV7
            EnsemblPlants:AT4G22505.1 GeneID:5008157 KEGG:ath:AT4G22505
            OMA:GSEMAGM Uniprot:F4JLV7
        Length = 530

 Score = 130 (50.8 bits), Expect = 2.8e-05, P = 2.8e-05
 Identities = 54/229 (23%), Positives = 67/229 (29%)

Query:   158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
             P+   PPP  T      A P T   +        P       P+ P     K P     +
Sbjct:    74 PRTPPPPPPRTPRTPPTAPPRTPPVSPRIPPILPPKTPPTAPPQTPPVSPPKSPPNSPPR 133

Query:   218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 277
             AP   P + P   P + P   P + P     +       R P+    R P   P R    
Sbjct:   134 APPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPSTSPPRAPPLSPPRTPPT 193

Query:   278 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 337
                R P       P     R P     R P     R P   P R P     R       R
Sbjct:   194 SPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPRAPPVPPPRISPTAPPR 253

Query:   338 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR 386
             AP   P R T     PR       + PP +       +PP    +  PR
Sbjct:   254 APPLSPPR-TPPTSPPRTPPLSPPITPPTSPPRAPPLSPPRTPPTSPPR 301

 Score = 121 (47.7 bits), Expect = 0.00028, P = 0.00028
 Identities = 58/231 (25%), Positives = 69/231 (29%)

Query:   158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
             P+   PPP  T        P T  +   A     P+      PR P     K P     +
Sbjct:    63 PRTPPPPPPRTPRTPPPPPPRTPRTPPTAPPRTPPVS-----PRIPPILPPKTPPTAPPQ 117

Query:   218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 277
              P   P K P   P + P   P + P     +       R P     R P   P R    
Sbjct:   118 TPPVSPPKSPPNSPPRAPPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPST 177

Query:   278 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 337
                R P     R P     R P       P     RAP   P R P     R       R
Sbjct:   178 SPPRAPPLSPPRTPPTSPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPR 237

Query:   338 APSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPR 386
             AP   P R +     PR  AP    P  PP +       +PP    +  PR
Sbjct:   238 APPVPPPRISP-TAPPR--APPLSPPRTPPTSPPRTPPLSPPITPPTSPPR 285

 Score = 117 (46.2 bits), Expect = 0.00077, P = 0.00077
 Identities = 53/224 (23%), Positives = 63/224 (28%)

Query:   163 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 222
             PP S   A  +      STS   A     P       PR P       P     +AP   
Sbjct:   159 PPTSPPRAPPIPPPRTPSTSPPRAPPLSPPRTPPTSPPRAPPVPPPNTPPTSPPRAPPLS 218

Query:   223 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG 282
             P + P   P + P   P + P     + S     R P     R P   P R         
Sbjct:   219 PPRTPPNSPPRTPPTSPPRAPPVPPPRISPTAPPRAPPLSPPRTPPTSPPRTPPLSPPIT 278

Query:   283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
             P     R P     R P     R P     R P   P R P     R        +P   
Sbjct:   279 PPTSPPRAPPLSPPRTPPTSPPRAPPISPPRTPPSSPPRAPPMPPPRTPPTSPPLSPLSP 338

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR 386
             P R       P    P   V PP +       TPP    +  P+
Sbjct:   339 PPRSPPMP--PTRTPP---VSPPTSPSRTPPVTPPRAPPTAPPQ 377


>UNIPROTKB|F1PG69 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 OMA:EGSPGHP
            EMBL:AAEX03017880 Ensembl:ENSCAFT00000023503 Uniprot:F1PG69
        Length = 1467

 Score = 135 (52.6 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 85/274 (31%), Positives = 106/274 (38%)

Query:   142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIP 200
             +G+P G+ +++   G P   GPP +A   G  G AGP           SG  +R    I 
Sbjct:   653 NGKP-GEPSHQGDSGAPGERGPPGAAGPMGPRGGAGP---PGPEGGKVSGGDLRPP--IS 706

Query:   201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGS-NYDAQRG 258
              G G     GP   A   P      G    P  GPG    KG PG     G+   D  RG
Sbjct:   707 AGAGAAGPPGPPGSAG-TPGLQGMPGERGGPG-GPGPKGDKGEPGSAGADGAPGKDGPRG 764

Query:   259 PNYDIHR-GPSYDP-QRGLG--------YDMQRGPNYDMQRGPGYETQRVPGYDVQRG-P 307
             P   I   GP+  P  +G G           + GP    + GP       PG   Q G P
Sbjct:   765 PTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAG-FPGAPGQNGEP 823

Query:   308 VYEAQR-APSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA--PHGQ- 361
               + +R AP    + GP G     G G      P     +G  G  G P GAA  P G+ 
Sbjct:   824 GAKGERGAPGEKGEGGPPGVAGPPG-GAGPAGPPGPQGVKGERGSPGGP-GAAGFPGGRG 881

Query:   362 VP-PPLNNV---PYGSATPPARSGSGQPRGGNPA 391
             +P PP NN    P GS+  P + G   P G N A
Sbjct:   882 LPGPPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 915

 Score = 132 (51.5 bits), Expect = 6.3e-05, P = 6.3e-05
 Identities = 83/280 (29%), Positives = 101/280 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
             A G  GG  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAMGPS 224

Query:   181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
               A    +SG P R     +P  PG +   G PG+   K    +D   G   D    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 283

Query:   238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPPGSNGSP 392

Query:   354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
              G    G  P  +   P   G+  PP   G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 78/261 (29%), Positives = 98/261 (37%)

Query:   147 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 200
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   P
Sbjct:   321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380

Query:   201 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 256
               PG   S G PG      P+  P   P    A+GP   P T G PG     G    +  
Sbjct:   381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439

Query:   257 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 308
             +G P     RG +  P   G  G D + G P      G PG   +R  PG+   RGP   
Sbjct:   440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496

Query:   309 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
                  ++ P+   + GPG    RG  G   R      P    G  G+P G    G+  PP
Sbjct:   497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554

Query:   366 LNNVPYGSATPPARSGS-GQP 385
              +    G   PP  SG  GQP
Sbjct:   555 GSQGESGRPGPPGPSGPRGQP 575


>UNIPROTKB|F1N2Y2 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0005588 "collagen type V"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            GO:GO:0043588 GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
            EMBL:DAAA02003915 EMBL:DAAA02003916 EMBL:DAAA02003917
            EMBL:DAAA02003918 IPI:IPI00826022 Ensembl:ENSBTAT00000038684
            Uniprot:F1N2Y2
        Length = 1491

 Score = 135 (52.6 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 88/293 (30%), Positives = 110/293 (37%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   785 EKGAEGTAGNDGARGLPGPLGPPGPSGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 843

Query:   180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   844 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 902

Query:   236 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
                P  T  PG   + G    A   GP   +   P  +   GL  D         RGP  
Sbjct:   903 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 960

Query:   286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   961 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1016

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
             P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1017 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1068


>UNIPROTKB|F1PG08 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
            EMBL:AAEX03017882 EMBL:AAEX03017883 EMBL:AAEX03017884
            Ensembl:ENSCAFT00000023545 OMA:ETCNGLD Uniprot:F1PG08
        Length = 1499

 Score = 135 (52.6 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 87/293 (29%), Positives = 109/293 (37%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   784 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 842

Query:   180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   843 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 901

Query:   236 GYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
                P  T  PG   + G        GP   +   P  +   GL  D         RGP  
Sbjct:   902 QGPPGATGFPGSAGRVGPPGPPGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 959

Query:   286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   960 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1015

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
             P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1016 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067


>FB|FBgn0052685 [details] [associations]
            symbol:ZAP3 species:7227 "Drosophila melanogaster"
            [GO:0005634 "nucleus" evidence=IEA] [GO:0008157 "protein
            phosphatase 1 binding" evidence=IPI] [GO:0048812 "neuron projection
            morphogenesis" evidence=IMP] InterPro:IPR026314 GO:GO:0005634
            EMBL:AE014298 PANTHER:PTHR13413 GeneTree:ENSGT00440000039837
            FlyBase:FBgn0052685 RefSeq:NP_727393.1 UniGene:Dm.10734
            ProteinModelPortal:Q9W2Y5 SMR:Q9W2Y5 IntAct:Q9W2Y5 MINT:MINT-741898
            STRING:Q9W2Y5 EnsemblMetazoa:FBtr0071489 GeneID:31942
            KEGG:dme:Dmel_CG32685 UCSC:CG32685-RC InParanoid:Q9W2Y5
            PhylomeDB:Q9W2Y5 GenomeRNAi:31942 NextBio:776058
            ArrayExpress:Q9W2Y5 Bgee:Q9W2Y5 Uniprot:Q9W2Y5
        Length = 1884

 Score = 136 (52.9 bits), Expect = 3.0e-05, P = 3.0e-05
 Identities = 77/285 (27%), Positives = 109/285 (38%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAG 176
             N+ N ++  D     +T N E   +  P G     +G G   G GP  +  +   V G  
Sbjct:   994 NSGNENKSQDAGDSVSTNNGEKPDNNGPPGGFGPGNGPGGGPGSGPGQNDGSRFDVFGPN 1053

Query:   177 PNTSTSAYAATQSGTPMRAAYDI---PRGPGYEASKGPGYDASKAPSYD--PTKGPSYDP 231
               +  +      +G P          P GPG   + GP +  +  P     P   P+  P
Sbjct:  1054 QVSGNNFIDLDNNGPPGFGPPGRNFGPNGPGPRGNFGPNFGHNFGPRGPGGPFIRPN-GP 1112

Query:   232 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 291
               GPG  P  GP +    G N+    GPN+    GP++ P+ G      RGP+     GP
Sbjct:  1113 LPGPG--PNFGPHF-RPNGPNF----GPNF----GPNFGPRPGSRNFGPRGPD-----GP 1156

Query:   292 -GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP--SYDPSRGTG 348
              G      PG D   GP +   R P   P  GPG++++   G  +   P        G G
Sbjct:  1157 FG------PGRDDFGGPPFGGPR-PHMGPN-GPGHNMRGFNGGPISDNPFRRQGGPPGPG 1208

Query:   349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
             F     GA P  + P    N  +G+   P   G G   GGN  R+
Sbjct:  1209 FGNDDLGAGPP-RGPRNFGN-RFGN---PGGGGGGGGGGGNNNRK 1248


>UNIPROTKB|P08125 [details] [associations]
            symbol:COL10A1 "Collagen alpha-1(X) chain" species:9031
            "Gallus gallus" [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR008983 HOGENOM:HOG000085653 HOVERGEN:HBG108220
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871 eggNOG:NOG114228
            OrthoDB:EOG4FFD29 EMBL:M13496 EMBL:J04194 IPI:IPI00600819
            PIR:S23297 ProteinModelPortal:P08125 SMR:P08125 STRING:P08125
            InParanoid:P08125 Reactome:REACT_132934 PMAP-CutDB:P08125
            Uniprot:P08125
        Length = 674

 Score = 131 (51.2 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 91/293 (31%), Positives = 116/293 (39%)

Query:   125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSA 183
             + D    GA G +       P G+   E G G P   GPP  A   G  G  GP      
Sbjct:   227 KGDRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP------ 279

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPT 240
               A   G+P    +  P  PG +  +GP G      P  D  +GP+  P + GP G    
Sbjct:   280 --AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGELGPAGPQGN 335

Query:   241 KGP-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PG 292
              GP G     G N     GP  D+   GP+  P    +RGL G D +  P Y  ++G PG
Sbjct:   336 MGPQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPG 391

Query:   293 YETQRVPGYDVQRGPVYEAQRA--PSYI-PQ--RG-PGYDLQRGQGYDMRRAPSYDPS-R 345
              +    PG   Q+G    A     P  + PQ  +G PG + + G      R PS  P  R
Sbjct:   392 PKGH--PGLPGQKGDTGHAGHPGLPGPVGPQGVKGVPGINGEPGP-----RGPSGIPGVR 444

Query:   346 GT----GFDGAP--RGAAPHGQVPPPLNNV------PYGSATPPARSG-SGQP 385
             G     G  GAP  +G A    +P P   V      P G   PP   G SG+P
Sbjct:   445 GPIGPPGMPGAPGAKGEAGAPGLPGPAGIVTKGLRGPMGPLGPPGPKGNSGEP 497


>UNIPROTKB|G5EF87 [details] [associations]
            symbol:swsn-1 "SWI3-like protein" species:6239
            "Caenorhabditis elegans" [GO:0042802 "identical protein binding"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            InterPro:IPR001005 InterPro:IPR007526 InterPro:IPR009057
            Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934 SMART:SM00717
            GO:GO:0005634 GO:GO:0009792 GO:GO:0002009 GO:GO:0040007
            GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0040018
            Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
            InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
            EMBL:AL110477 KO:K11649 GeneTree:ENSGT00390000018166 EMBL:AF230279
            PIR:T26449 RefSeq:NP_001256906.1 UniGene:Cel.7072 SMR:G5EF87
            IntAct:G5EF87 EnsemblMetazoa:Y113G7B.23 GeneID:180324
            KEGG:cel:CELE_Y113G7B.23 CTD:180324 WormBase:Y113G7B.23a
            OMA:HFDELEQ NextBio:908892 Uniprot:G5EF87
        Length = 789

 Score = 131 (51.2 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 71/248 (28%), Positives = 92/248 (37%)

Query:   156 GVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 209
             G+P G    GPP   P    +    A P    ++ AAT +  P  +    P+ P  +A+ 
Sbjct:   551 GLPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAPPVQAAP 608

Query:   210 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI-HRGPS 268
              P   A +AP   P    +Y    GPG  P +   Y  Q+G  Y     P     H+   
Sbjct:   609 AP-VQAPQAPQAPPQ---AYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQAQQ 664

Query:   269 YDPQRGLGYDMQ-RGPNYDMQRGPGYETQRVPG--YDVQRGPVYEAQRAPSYIPQRGPGY 325
                Q   G     +GP    Q    Y     PG  Y    G   + QR P Y  Q  PG 
Sbjct:   665 AQSQAHYGPPGGGQGPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPP-YQAQPYPGP 723

Query:   326 ---DLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 382
                  QRG GY     P   P       G P    P+GQ+PPP    P+G   P  + G 
Sbjct:   724 PPPQQQRGYGYP----PPPQP-------GHPY-QQPYGQMPPP----PHGQYQPQQQQGG 767

Query:   383 GQ-PRGGN 389
                P GG+
Sbjct:   768 PMGPPGGH 775


>WB|WBGene00000677 [details] [associations]
            symbol:col-103 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0040011
            "locomotion" evidence=IMP] InterPro:IPR002486 Pfam:PF01484
            SMART:SM01088 GO:GO:0040011 GeneTree:ENSGT00690000102663
            GO:GO:0042302 HOGENOM:HOG000085656 EMBL:FO081484 PIR:E88633
            RefSeq:NP_499982.1 ProteinModelPortal:O45114 STRING:O45114
            EnsemblMetazoa:F56B3.1 GeneID:176901 KEGG:cel:CELE_F56B3.1
            UCSC:F56B3.1 CTD:176901 WormBase:F56B3.1 eggNOG:NOG301529
            InParanoid:O45114 OMA:SNTCPPG NextBio:894512 Uniprot:O45114
        Length = 371

 Score = 126 (49.4 bits), Expect = 4.4e-05, P = 4.4e-05
 Identities = 87/287 (30%), Positives = 103/287 (35%)

Query:   119 APNVDRRA------DGSYGGATGNSE-NETSGRPVGQNA---YEDGYGVPQGHGPPPSAT 168
             APN ++R        G YGG  G +      G  VG      Y  G+G   GHG      
Sbjct:    63 APNREKRGYAQYGGGGGYGGGHGGAAVGGGYGGAVGGGGGGGYGGGHG--GGHGGAVGGG 120

Query:   169 TAGVVGAGPNTSTSAYAAT----QSGTPMRAAYD-IPRGPGYEASKGPGYDASKAPSYDP 223
               G  G G     S  + T      G P +A  D +P  PG   S G     S   S   
Sbjct:   121 YGGGGGGGGGCQCSPSSNTCPPGPRGPPGQAGLDGLPGAPGQPGSNGGA--GSNGASEGS 178

Query:   224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
               G    PA  PG  P  GP   A +  N D Q G        PS+    G+G     GP
Sbjct:   179 AGGCKTCPAGPPG--PP-GPAGQAGRPGN-DGQPG-------APSFGG--GVGAPGAPGP 225

Query:   284 NYDM-QRG-PGYETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS 340
               D    G PG   Q   PG + Q G        P+  P   PG +   G GY +   P 
Sbjct:   226 AGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAG-PPGPPGNNGAPGGGYGV--GPP 282

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNN-VPYGSAT--P-PARSGSG 383
               P   +G  GAP    P GQ   P N+  P   A   P P R G G
Sbjct:   283 GPPGP-SGRPGAPGQPGPDGQPGAPGNDGTPGTDAAYCPCPGRGGGG 328


>RGD|628797 [details] [associations]
            symbol:Prpmp5 "proline-rich protein MP5" species:10116 "Rattus
            norvegicus" [GO:0005576 "extracellular region" evidence=IEA]
            RGD:628797 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
            CTD:5542 KO:K13911 EMBL:L17318 EMBL:M11899 IPI:IPI00187926
            PIR:B48013 RefSeq:NP_742062.1 UniGene:Rn.29950 GeneID:257651
            KEGG:rno:257651 UCSC:RGD:628797 NextBio:624204
            Genevestigator:P10165 Uniprot:P10165
        Length = 295

 Score = 124 (48.7 bits), Expect = 4.5e-05, P = 4.5e-05
 Identities = 63/200 (31%), Positives = 77/200 (38%)

Query:   200 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR-- 257
             P   G +    PG  + + P   P  GP   P +GP   P  GP    Q GS        
Sbjct:   101 PPAAGPQRPPQPG--SPQGPP--PPGGPQQRPPQGP--PPQGGPQRPPQPGSPQGPPPPG 154

Query:   258 GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVP-GYDVQRGPVYEAQR 313
             GP     +GP   PQ G     QR P     +GP   G   QR P G   Q GP    QR
Sbjct:   155 GPQQRPPQGPP--PQGG----PQRPPQPGSPQGPPPPGGPQQRAPQGPPPQGGP----QR 204

Query:   314 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP--PLNNVPY 371
              P     +GP        G   +R P   P +G G    P+  +P G  PP  P    P 
Sbjct:   205 PPQPGSPQGPP-----PPGGPQQRPPQGPPPQG-GPQRPPQPGSPQGPPPPGGPQQRPPQ 258

Query:   372 GSATPPARSGSGQP-RGGNP 390
             G   PP + G  +P + GNP
Sbjct:   259 G---PPPQGGPQRPPQPGNP 275


>UNIPROTKB|E2RA07 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 OMA:EGTSTGY
            EMBL:AAEX03014786 EMBL:AAEX03014787 Ensembl:ENSCAFT00000019384
            Uniprot:E2RA07
        Length = 671

 Score = 117 (46.2 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
 Identities = 63/238 (26%), Positives = 87/238 (36%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 160

Query:   239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              P+ G G   Q   +Y    G  P   +   PSY P R   ++      Y   R   Y +
Sbjct:   161 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPTR---FNSSSLKLYHYSRS--YSS 212

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              +   YD            PS   Q+   Y  Q    Y  +   SY P  G+ +  AP
Sbjct:   213 TQPTSYDQSSYSQQNTYGQPSSYGQQS-SYGQQ--SSYGQQPPTSYPPQTGS-YSQAP 266

 Score = 57 (25.1 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   354 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 392
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   470 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 515


>ZFIN|ZDB-GENE-030131-8373 [details] [associations]
            symbol:col10a1 "collagen, type X, alpha 1"
            species:7955 "Danio rerio" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR008983 ZFIN:ZDB-GENE-030131-8373 GO:GO:0005581
            Gene3D:2.60.120.40 InterPro:IPR001073 InterPro:IPR008160
            Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007 SMART:SM00110
            SUPFAM:SSF49842 PROSITE:PS50871 GeneTree:ENSGT00700000104270
            OMA:KPGHGSP EMBL:CU306817 IPI:IPI00491103
            Ensembl:ENSDART00000091021 ArrayExpress:F1QXD5 Bgee:F1QXD5
            Uniprot:F1QXD5
        Length = 655

 Score = 129 (50.5 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 81/269 (30%), Positives = 107/269 (39%)

Query:   145 PVGQNAYEDGYGVPQGHGPP----PSATTA-GVVGA--GPNTSTSAYAATQSGTPMRAAY 197
             P G  A +DG G+P   GPP    P+  +A G  G+  GP    +  A    G       
Sbjct:    64 PPGP-AGQDGEGLPGPQGPPGAPGPAGYSAPGKPGSPGGPGKPGATGAPGLKGDTGAPGL 122

Query:   198 DIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDP-AKGP-GYDPTKG----PGYDAQK 249
               PRG PG   S GP G  A+  P      GP+  P A GP G    KG    PG   QK
Sbjct:   123 QGPRGMPGPSGSPGPAGISATGKP------GPAGLPGAMGPRGEQGFKGHPGIPGLPGQK 176

Query:   250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQR-VPGYDVQRGP 307
             G      +GP  +  RGP+  P    G     G     + G PG   +   PG D + GP
Sbjct:   177 GEMGVGVQGPAGE--RGPT-GPVGPSGKPGAPGVGLPGKPGAPGEAGKSGSPGRDGESGP 233

Query:   308 VY-EAQRAPSYIPQRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
             +  + Q+  +  P  G PG   + G  G      P   P   +G  GAP G   +G+  P
Sbjct:   234 MGPQGQKGQTGAPGVGIPGKPGENGAPGMPGPTGPK-GPQGASGAPGAP-GVPGYGK--P 289

Query:   365 PLNNVPYGSATPPARSGSGQPRGGNPARR 393
               N +      P +   +GQ   G P  +
Sbjct:   290 GENGLKGDRGVPGSPGTTGQK--GEPGAK 316


>UNIPROTKB|Q04118 [details] [associations]
            symbol:PRB3 "Basic salivary proline-rich protein 3"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=NAS] [GO:0051636 "Gram-negative bacterial cell surface
            binding" evidence=NAS] [GO:0008150 "biological_process"
            evidence=ND] GO:GO:0005576 GO:GO:0051636 InterPro:IPR026086
            PANTHER:PTHR23203 EMBL:X07637 EMBL:X07881 EMBL:BC096209
            EMBL:BC096210 EMBL:BC096211 IPI:IPI00006699 PIR:A36298 PIR:B36298
            PIR:S10889 RefSeq:NP_006240.4 UniGene:Hs.73031 STRING:Q04118
            DMDM:229462763 PaxDb:Q04118 PRIDE:Q04118 Ensembl:ENST00000381842
            GeneID:5544 KEGG:hsa:5544 CTD:5544 GeneCards:GC12M011418
            H-InvDB:HIX0201930 HGNC:HGNC:9339 MIM:168840 neXtProt:NX_Q04118
            PharmGKB:PA33701 HOGENOM:HOG000060075 GenomeRNAi:5544 NextBio:21478
            ArrayExpress:Q04118 Bgee:Q04118 CleanEx:HS_PRB3
            Genevestigator:Q04118 GermOnline:ENSG00000197870 Uniprot:Q04118
        Length = 309

 Score = 124 (48.7 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 79/271 (29%), Positives = 99/271 (36%)

Query:   137 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 196
             S +  SG+P G+     G   PQ   PPP     G    G N S         G P R  
Sbjct:    28 SPSVISGKPEGRRP--QGGNQPQ-RTPPPPGKPEGRPPQGGNQS--------QGPPPRPG 76

Query:   197 YDIPRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 253
                P GP   G   S+GP     K P   P +G +   ++GP   P K  G   Q G N 
Sbjct:    77 K--PEGPPPQGGNQSQGPPPRPGK-PEGQPPQGGNQ--SQGPPPRPGKPEGPPPQ-GGNQ 130

Query:   254 DAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP----GYETQRVPGYDVQ-RGPV 308
                  P      GP   P +G        P+     GP    G ++Q  P    +  GP 
Sbjct:   131 SQGPPPRPGKPEGP---PPQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPP 187

Query:   309 YEAQRAPSYIPQRGPGY-DLQRGQGYDMRRAPSYDPSR--GTGFDGA--PRGAAPH-G-- 360
              +        P R PG  +    QG +  + P   P +  G+   G   P+G  PH G  
Sbjct:   188 PQGGNQSQGPPPR-PGKPEGPPPQGGNQSQGPPPRPGKPEGSPSQGGNKPQGPPPHPGKP 246

Query:   361 QVPPPLN-NVPYGSATPPARSGSGQPRGGNP 390
             Q PPP   N P     PP R     P GGNP
Sbjct:   247 QGPPPQEGNKPQ-RPPPPGRPQGPPPPGGNP 276


>TAIR|locus:2204400 [details] [associations]
            symbol:AT1G76010 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0005829 "cytosol"
            evidence=IDA] InterPro:IPR002775 Pfam:PF01918 EMBL:CP002684
            GO:GO:0005829 GO:GO:0003676 EMBL:AF412102 EMBL:AY054208
            EMBL:AF428441 EMBL:AY124847 IPI:IPI00531013 RefSeq:NP_565124.1
            UniGene:At.24580 UniGene:At.67776 UniGene:At.75066 HSSP:P60849
            ProteinModelPortal:Q93VA8 SMR:Q93VA8 STRING:Q93VA8 PRIDE:Q93VA8
            EnsemblPlants:AT1G76010.1 GeneID:843932 KEGG:ath:AT1G76010
            TAIR:At1g76010 HOGENOM:HOG000240806 InParanoid:Q93VA8 OMA:YDGPPQG
            PhylomeDB:Q93VA8 ProtClustDB:CLSN2917456 Genevestigator:Q93VA8
            Uniprot:Q93VA8
        Length = 350

 Score = 125 (49.1 bits), Expect = 5.0e-05, P = 5.0e-05
 Identities = 70/207 (33%), Positives = 88/207 (42%)

Query:   144 RPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ---SGTPMRAAYDIP 200
             +P+G   YE   G P G G           G G     +AY   +    G     +Y   
Sbjct:   134 KPMGDIDYEGREGSPGGRGRGRGRGRGR--GRGRGGRGNAYVNVEHEDGGWEREQSYGRG 191

Query:   201 RGPGY-EASKGPGYDASKAP--SYDPTK--GPSYD-PAKGPGYDPTKGPGYDA--QKGSN 252
             RG G   +S+G G      P   YD  +  G  YD P +  GYD  +G GYDA  Q    
Sbjct:   192 RGRGRGRSSRGRGRGGYNGPPNEYDAPQDGGYGYDAPHEHRGYDD-RG-GYDAPPQGRGG 249

Query:   253 YDAQRGPN-YDIHRGP-SYD--PQ-RGLGYDMQRGPNYDMQRGPGYE--TQRVPGYDVQR 305
             YD  +G   YD  +G   YD  PQ RG GYD   GP+    RG GY+  +Q   GYD   
Sbjct:   250 YDGPQGRGGYDGPQGRRGYDGPPQGRG-GYD---GPSQG--RG-GYDGPSQGRGGYD--- 299

Query:   306 GPVYEAQRAPSYIPQRGPGYDLQRGQG 332
             GP   +Q    Y   +G G    RG+G
Sbjct:   300 GP---SQGRGGYDGPQGRGRGRGRGRG 323


>UNIPROTKB|F1RZK4 [details] [associations]
            symbol:COL10A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005938 "cell cortex" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
            GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:CU062641
            Ensembl:ENSSSCT00000004901 Uniprot:F1RZK4
        Length = 675

 Score = 129 (50.5 bits), Expect = 5.2e-05, P = 5.2e-05
 Identities = 88/296 (29%), Positives = 113/296 (38%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG--AGPN 178
             ++ A G  G  G  G +     GRP G        G P G   PP     G  G    P 
Sbjct:   176 EKGAPGVPGINGQKGETGYGAPGRP-GDRGLPGPQG-PMGPPGPPGVGKRGENGFPGQPG 233

Query:   179 TSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG---PGYD-ASKAPSYDPTKG----PSY 229
                      +SG P  A    P+GP G +  +G   PG   A+  P    TKG    P  
Sbjct:   234 IKGDRGFPGESG-P--AGPPGPQGPPGEQGREGIGKPGAPGAAGQPGLPGTKGHPGAPGM 290

Query:   230 -DPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGL-GYDMQRGPNYD 286
               P   PG+     PG   Q+G        P     +GP+  P + GL G    RGP   
Sbjct:   291 AGPPGAPGFGKPGLPGLKGQRGP-IGLPGAPGAKGEQGPAGHPGEPGLTGPPGSRGP--- 346

Query:   287 MQRGPGYETQRVPGYDVQRGPVYEAQRA-PSYIP----QRGP-GYDLQRGQ-GYDMRRAP 339
               +GP    + +PG +   GP  E   A P+  P    +RGP G D + G  G      P
Sbjct:   347 --QGP----KGIPGNNGVPGPKGEIGLAGPAGFPGAKGERGPSGLDGKPGYPGEPGLNGP 400

Query:   340 SYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
               +P    G  G P    P G +P P+   P G+   P  +G G PRG  G P  R
Sbjct:   401 KGNPGL-PGPKGDPGIGGPPG-LPGPVG--PAGAKGVPGHNGEGGPRGAPGIPGTR 452


>ZFIN|ZDB-GENE-030131-2281 [details] [associations]
            symbol:col4a5 "collagen, type IV, alpha 5 (Alport
            syndrome)" species:7955 "Danio rerio" [GO:0005201 "extracellular
            matrix structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] [GO:0031290 "retinal ganglion cell axon guidance"
            evidence=IMP] [GO:0007412 "axon target recognition" evidence=IMP]
            [GO:0030198 "extracellular matrix organization" evidence=IMP]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            ZFIN:ZDB-GENE-030131-2281 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0030198 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0007412 GO:GO:0031290 GO:GO:0005201
            HOVERGEN:HBG004933 HOGENOM:HOG000085652 OrthoDB:EOG45DWPF
            Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1287
            OMA:MPMNMEP EMBL:CR354588 EMBL:CR936978 IPI:IPI00835382
            RefSeq:NP_001116702.1 UniGene:Dr.77841 SMR:B0UXF7
            Ensembl:ENSDART00000073827 GeneID:323561 KEGG:dre:323561
            NextBio:20808319 Uniprot:B0UXF7
        Length = 1659

 Score = 133 (51.9 bits), Expect = 5.6e-05, P = 5.6e-05
 Identities = 83/294 (28%), Positives = 100/294 (34%)

Query:   117 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP--PSATTAGVVG 174
             M  P V  R      G  G+        P GQ  +    G+P   G P  P     G  G
Sbjct:   652 MTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFP---GLPGSKGEPGLPGIGLPGPPG 708

Query:   175 AGPNTSTSAYAATQSGTPMRAAYD-IPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPA 232
             A       A +    G P R   D +P  PG   SKG PGY     P   PT  P     
Sbjct:   709 A-KGFPGIAGSPGGPGIPGRPGLDGLPGQPGLPGSKGDPGYGLPGPPG--PTGSPGIKGG 765

Query:   233 KGPGYDPTKGPGYDAQKGS-NYDAQRGPNYD--IHRGPS-YDPQRGLGYDMQRGPNYDMQ 288
              GP  D +  PG   Q G    D   GP  D     GP    P     + +Q  P     
Sbjct:   766 PGPKGD-SGFPGSPGQPGRPGLDGAPGPKGDAGFPGGPGPRGPPGAPAFGLQGPPG--PP 822

Query:   289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPS-R 345
               PG   +  VPG + ++G      R P  +    PG+   RG  G      P   P   
Sbjct:   823 GAPGSIGSPGVPGANGEKG-----DRGPPGLST--PGFQGDRGISGLPGPPGPVGPPGVP 875

Query:   346 GT-GFDGAPRGAAPHGQV----PPPLNNVPYGSATP--PARSGS-GQP-RGGNP 390
             G  G DG P      G++    PP     P     P  P   G  G P + GNP
Sbjct:   876 GRPGQDGLPGLPGSKGEMGSMGPPGSKGNPGNPGAPGFPGPKGDDGVPGQSGNP 929

 Score = 126 (49.4 bits), Expect = 0.00033, P = 0.00032
 Identities = 82/275 (29%), Positives = 97/275 (35%)

Query:   132 GATGNSENETSGR-PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSAYAATQ 188
             G  G +  E   R P GQ+      G P   GPP      G+ G+ G P           
Sbjct:   648 GEPGMTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFPGLPGSKGEPGLPGIGLPGPP 707

Query:   189 SGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTKG-PSYDPAKGPGYDPTKGPGYD 246
                        P GPG      PG D     P    +KG P Y     PG  PT  PG  
Sbjct:   708 GAKGFPGIAGSPGGPGIPGR--PGLDGLPGQPGLPGSKGDPGYGLPGPPG--PTGSPGI- 762

Query:   247 AQKGSNYDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYD 302
               KG       GP  D    G    P R  G D   GP  D     GPG       P + 
Sbjct:   763 --KGGP-----GPKGDSGFPGSPGQPGRP-GLDGAPGPKGDAGFPGGPGPRGPPGAPAFG 814

Query:   303 VQRGPVYEAQRAPSYIPQRG-PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPH 359
             +Q GP      AP  I   G PG + ++G +G      P +   RG +G  G P    P 
Sbjct:   815 LQ-GPP-GPPGAPGSIGSPGVPGANGEKGDRGPPGLSTPGFQGDRGISGLPGPPGPVGPP 872

Query:   360 GQVP--PPLNNVPYGSATPPARSGSGQPRG--GNP 390
             G VP  P  + +P G        GS  P G  GNP
Sbjct:   873 G-VPGRPGQDGLP-GLPGSKGEMGSMGPPGSKGNP 905


>UNIPROTKB|G3N3C9 [details] [associations]
            symbol:LDB3 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
            "cytoskeletal protein binding" evidence=IEA] [GO:0005856
            "cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 OMA:CTSQATT
            InterPro:IPR006643 SMART:SM00735 GeneTree:ENSGT00700000104411
            EMBL:DAAA02062163 Ensembl:ENSBTAT00000065403 Uniprot:G3N3C9
        Length = 730

 Score = 129 (50.5 bits), Expect = 5.7e-05, P = 5.7e-05
 Identities = 54/206 (26%), Positives = 76/206 (36%)

Query:   115 ELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 174
             E M  P+ +         +T +    TS  P   + Y +    P    P P   T   + 
Sbjct:   353 EYMQDPDEEALRRSRPQASTYSPAVATSPAPAA-HTYSEAPAAP---APKPRVVTTASIR 408

Query:   175 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
               P+      A+T S +P  A Y  P  P Y  S  P Y  S  P+Y P+  P+Y P+  
Sbjct:   409 --PSVYQPVPASTYSPSP-GANYS-PT-P-YTPSPAPAYTPSPTPAYTPSPAPTYSPSPA 462

Query:   235 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGY 293
             P Y P+  P Y+    S   A+           S+  +   G          + RG P Y
Sbjct:   463 PAYTPSPAPSYNPTLYSGGPAESASRPPWVTDDSFSQKFAPGKTTTTVSKQSLPRGAPAY 522

Query:   294 ETQRVPGYDVQ---RGPVYEAQRAPS 316
              T   P   V    RG V  A+R P+
Sbjct:   523 -TPPPPAPQVSPLARGTVQRAERFPA 547


>UNIPROTKB|G8ENL4 [details] [associations]
            symbol:FUS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0005634 GO:GO:0005737 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 EMBL:CU464163 EMBL:JF940526
            Ensembl:ENSSSCT00000036326 Uniprot:G8ENL4
        Length = 517

 Score = 127 (49.8 bits), Expect = 5.8e-05, P = 5.8e-05
 Identities = 68/240 (28%), Positives = 93/240 (38%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             G+Y    G   ++ S +P GQ +Y  GYG          ++     G   NT   A +A 
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSYS-GYGQSADTSGYGQSSYGSSYGQTQNTGYGAQSAP 73

Query:   188 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
             Q G      Y   +G    Y + S  PGY    APS   T G     ++  GY   +  G
Sbjct:    74 Q-GYGSTGGYGSGQGSQSSYGQQSSYPGYGQQPAPS--STSGSYGTSSQSSGYGQPQSGG 130

Query:   245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET--QRVPGYD 302
             Y  Q G  Y  Q+  +Y   +  SY+P +G G   Q   +     G G  +  Q  P   
Sbjct:   131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQNQYNSSSGGGGGGGGGSYGQDQPSMS 185

Query:   303 VQRGPVYEAQ-RAPSYI--PQ----RGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                G  Y  Q ++  Y    Q    RG G     G GY+ R +  Y+P RG G     RG
Sbjct:   186 GGGGGGYGNQDQSGGYGGGQQDRGGRGRGGGSGGGGGYN-RSSGGYEP-RGRGGGRGGRG 243

 Score = 117 (46.2 bits), Expect = 0.00074, P = 0.00074
 Identities = 63/218 (28%), Positives = 80/218 (36%)

Query:   187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS-YDPAK-GPGYDPTKGPG 244
             TQ  T    AY    G GY       Y       Y  +   S Y  +  G  Y  T+  G
Sbjct:     7 TQQATQSYGAYPTQPGQGYSQQSNQPYGQQSYSGYGQSADTSGYGQSSYGSSYGQTQNTG 66

Query:   245 YDAQKG-SNYDAQRGPNYDIHRGP--SYDPQRGL-GYDMQRGPN-----YDMQ-RGPGYE 294
             Y AQ     Y +  G  Y   +G   SY  Q    GY  Q  P+     Y    +  GY 
Sbjct:    67 YGAQSAPQGYGSTGG--YGSGQGSQSSYGQQSSYPGYGQQPAPSSTSGSYGTSSQSSGYG 124

Query:   295 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 354
               +  GY  Q G  Y  Q+  SY  Q+   Y+    QGY  +    Y+ S G G  G   
Sbjct:   125 QPQSGGYGQQSG--YGGQQQ-SYGQQQS--YNPP--QGYGQQN--QYNSSSGGGGGG--- 172

Query:   355 GAAPHGQVPPPLNNVP---YGSATPPARSGSGQP-RGG 388
             G   +GQ  P ++      YG+       G GQ  RGG
Sbjct:   173 GGGSYGQDQPSMSGGGGGGYGNQDQSGGYGGGQQDRGG 210


>RGD|71029 [details] [associations]
            symbol:Col3a1 "collagen, type III, alpha 1" species:10116 "Rattus
           norvegicus" [GO:0001501 "skeletal system development" evidence=IEP]
           [GO:0001568 "blood vessel development" evidence=IEA;ISO] [GO:0005201
           "extracellular matrix structural constituent" evidence=IEA]
           [GO:0005581 "collagen" evidence=ISO] [GO:0005586 "collagen type III"
           evidence=ISO;TAS] [GO:0005615 "extracellular space" evidence=IEA]
           [GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
           "transforming growth factor beta receptor signaling pathway"
           evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
           evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
           [GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
           "peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
           organization" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
           evidence=ISO] [GO:0032964 "collagen biosynthetic process"
           evidence=IEA] [GO:0034097 "response to cytokine stimulus"
           evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
           "extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
           development" evidence=IEA] [GO:0046332 "SMAD binding"
           evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
           [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
           [GO:0048565 "digestive tract development" evidence=IEA;ISO]
           [GO:0050777 "negative regulation of immune response" evidence=IEA]
           [GO:0071230 "cellular response to amino acid stimulus"
           evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
           Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
           PROSITE:PS51461 SMART:SM00038 SMART:SM00214 RGD:71029 GO:GO:0043588
           GO:GO:0005615 GO:GO:0007507 GO:GO:0046872 GO:GO:0034097
           GO:GO:0030199 GO:GO:0001501 GO:GO:0007179 GO:GO:0007229
           GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
           GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
           GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
           GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
           HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
           CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:BC087039 EMBL:X70369
           EMBL:AJ005395 EMBL:M21354 IPI:IPI00366944 PIR:S41067
           RefSeq:NP_114474.1 UniGene:Rn.3247 ProteinModelPortal:P13941
           IntAct:P13941 STRING:P13941 PRIDE:P13941 Ensembl:ENSRNOT00000004956
           GeneID:84032 KEGG:rno:84032 UCSC:RGD:71029 InParanoid:P13941
           NextBio:616623 Genevestigator:P13941 GermOnline:ENSRNOG00000003357
           Uniprot:P13941
        Length = 1463

 Score = 132 (51.5 bits), Expect = 6.3e-05, P = 6.3e-05
 Identities = 76/261 (29%), Positives = 102/261 (39%)

Query:   147 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAYDIPR 201
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   +
Sbjct:   320 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAQ 379

Query:   202 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 259
             GP G   + G PG      P+  P   P    A+GP   P    G   Q+G +   + G 
Sbjct:   380 GPPGPPGNNGSPGGKGEMGPAGIPG-APGLLGARGPP-GPAGANGAPGQRGPS--GEPGK 435

Query:   260 NYDIHRGPSYDPQRG-LGYDMQRGPN-YDMQRG-PGYE-TQRVPGYDVQRG-PVYEAQRA 314
             N      P    +RG  G     GP   D + G PG      VPG   +RG P +     
Sbjct:   436 N-GAKGEPGARGERGEAGSPGIPGPKGEDGKDGSPGEPGANGVPGNPGERGAPGFRGPAG 494

Query:   315 PSYIP-QRGPGYDLQRGQGYDMRRAPSYDPSR-GT-------GFDGAPRGAAPHGQVPPP 365
             P+  P ++GP  + + G G    R  + +P R GT       G  G+P G    G+  PP
Sbjct:   495 PNGAPGEKGPAGE-RGGPGPAGPRGVAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGPP 553

Query:   366 LNNVPYGSATPPARSGS-GQP 385
              +    G   PP  SG  GQP
Sbjct:   554 GSQGESGRPGPPGPSGPRGQP 574

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 82/284 (28%), Positives = 103/284 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTSTS 182
             G  GG  G +       P G + +    G P   GPP     AG  G  GP      S  
Sbjct:   166 GGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAIGPSGP 225

Query:   183 AYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGYDP 239
             A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG   
Sbjct:   226 AGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGLKG 284

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE-T 295
               G PG +   G      RG   +  R P      G  G D  RG   D Q GP G   T
Sbjct:   285 ENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPPGT 339

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
                PG    +G V  A    S      PG   QRG+      A +  P    G +G+P G
Sbjct:   340 AGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGNNGSPGG 393

Query:   356 AAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
                 G  P  +   P   G+  PP  +G+ G P  RG  G P +
Sbjct:   394 KGEMG--PAGIPGAPGLLGARGPPGPAGANGAPGQRGPSGEPGK 435

 Score = 122 (48.0 bits), Expect = 0.00077, P = 0.00077
 Identities = 80/272 (29%), Positives = 99/272 (36%)

Query:   125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
             + +G   GA G         P G    +   G P G G        G+ G  P  + +  
Sbjct:   832 KGEGGPPGAAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGGRGLPGP-PGNNGNPG 890

Query:   185 AATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTK 241
                 SG P +   D P GP G   S G PG    K  +  P  KGP    A+GP   P  
Sbjct:   891 PPGPSGAPGK---DGPPGPAGNSGSPGNPGVAGPKGDAGQPGEKGPP--GAQGPPGSP-- 943

Query:   242 GP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVP 299
             GP G     G+   A   P     RG S  PQ   G   + G + ++ +RGP    Q +P
Sbjct:   944 GPLGIAGLTGARGLAGP-PGMPGPRG-SPGPQGIKGESGKPGASGHNGERGPP-GPQGLP 1000

Query:   300 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 359
             G   Q G   E  R  +      PG D   G   D  R  +  P    G  GAP    P 
Sbjct:  1001 G---QPGTAGEPGRDGNPGSDGQPGRDGSPGGKGD--RGENGSP----GAPGAPGHPGPP 1051

Query:   360 GQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
             G V P   N   G   P   SG+  P G   A
Sbjct:  1052 GPVGPSGKNGDRGETGPAGPSGAPGPAGARGA 1083


>ZFIN|ZDB-GENE-050302-9 [details] [associations]
            symbol:col2a1b "collagen type II, alpha-1b"
            species:7955 "Danio rerio" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0033333 "fin development" evidence=IMP]
            [GO:0033334 "fin morphogenesis" evidence=IMP] [GO:0005581
            "collagen" evidence=IEA] EMBL:HF563615 EMBL:HF563616 EMBL:HF563617
            Uniprot:L0S5L0
        Length = 1493

 Score = 132 (51.5 bits), Expect = 6.4e-05, P = 6.4e-05
 Identities = 82/282 (29%), Positives = 99/282 (35%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNT 179
             +R   G  G  GA GN        P G        G P   G P +   AG  GA GP  
Sbjct:   337 ERGRPGPSGASGARGNDGLPGGAGPPGPVGTAGSPGFP---GSPGAKGEAGPTGARGPEG 393

Query:   180 STSAYAATQSGTPMRAAYDIPRG-PGYEASKG-PGYDASK-APSYDPTKG-PSYDPAKGP 235
             +       +SG P  +    P G  G   S G PG   S  AP      G P   P   P
Sbjct:   394 AQGPRG--ESGVPGASG---PSGVSGNPGSDGMPGAKGSVGAPGIGGAPGFPG--PRGPP 446

Query:   236 GYDPTKGP-GYDAQKGSN----YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
             G     GP G   Q G +    +  + GP  +I            G + +RGP  +    
Sbjct:   447 GPQGATGPLGPKGQSGDSGLAGFKGEAGPKGEIGNAGLQGAPGPAGEEGKRGPRGEPGAA 506

Query:   291 --PGYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPS 344
               PG   +R  PG    RG P  +    P   P +RGP G    +G G D  R       
Sbjct:   507 GPPGPTGERGTPG---NRGFPGQDGLAGPKGAPGERGPAGVSGPKGAGGDPGRPGEPGLP 563

Query:   345 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSG-SGQP 385
                G  G P  A P G+V P       G   PP   G  GQP
Sbjct:   564 GARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGVRGQP 605

 Score = 131 (51.2 bits), Expect = 8.2e-05, P = 8.2e-05
 Identities = 88/298 (29%), Positives = 112/298 (37%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             A G  G A    E    G+P G + ++   G+P   GPP      G  G  P  +  A A
Sbjct:   646 AAGPPGPAGSAGERGEQGQP-GPSGFQ---GLPGPPGPPGEGGKPGDQGV-PGEAGGAGA 700

Query:   186 AT---QSGTPMRAAYDIPRG-PGYEASKG-PGYDASKAPSYDP--TKGPSYDPA-KG-PG 236
                  + G P       P+G  G     G PG D  K     P  T G    P  +G PG
Sbjct:   701 TGPRGERGFPGERGGAGPQGLQGPRGLPGTPGTDGPKG-GVGPAGTAGAQGPPGLQGMPG 759

Query:   237 YDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL----GYDMQRGPNYDM-QRG 290
                T G PG    +G N D  +GP       P  D  RGL    G     GPN +  + G
Sbjct:   760 ERGTSGNPGPKGDRGDNGD--KGPE----GAPGKDGSRGLTGPIGPTGPAGPNGEKGESG 813

Query:   291 P----GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 345
             P    G   T+ VPG   + GP   A  A        PG   ++G+G     A +  P  
Sbjct:   814 PAGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADGQPGVKGEQGEGGQKGDAGAPGPQG 873

Query:   346 GTGFDG--APRGAA-PHG----QVPPPLNNVP--YGSATPPARSGSGQPRG--GNPAR 392
              +G  G   P G + P G    Q PP     P   G   PP  +G+  P G  G P +
Sbjct:   874 PSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPNGNPGPAGPAGPPGK 931

 Score = 124 (48.7 bits), Expect = 0.00048, P = 0.00048
 Identities = 78/259 (30%), Positives = 90/259 (34%)

Query:   147 GQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAAT-QSGTPMRAAYDIPRGPG 204
             G+   +   G P   GP  +    G  G +GP  +  A      +G P  A    P GP 
Sbjct:   858 GEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPN 917

Query:   205 YEASKGPGYDASKAPSYDPTKGPSYD--PAKGPGYDPTKGP-GYDAQKGS-NYDAQRGPN 260
                + GP   A   P  D  KG   D  P   PG    +G  G   +KG    D   GP 
Sbjct:   918 --GNPGPAGPAGP-PGKDGPKGVRGDGGPPGRPGDAGLRGSAGPAGEKGDPGEDGPHGP- 973

Query:   261 YDIHRGPS-YDPQRGL-GYDMQRGPN-YDMQRGPGYET--QRVPGYDVQRGPVYEAQRAP 315
              D   GP     QRG+ G   QRG   +    GP  E   Q  PG    RGP      AP
Sbjct:   974 -DGPAGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGPGDRGPPGPVG-AP 1031

Query:   316 SYIPQRG-PGYDLQRGQGYDMRRAPS--YDPSRG----TGFDGAPRGAAPHGQVPPPLNN 368
                   G PG +   G      R  S      RG     G  GAP G    G V P    
Sbjct:  1032 GLTGAAGEPGREGNPGSDGPPGRDGSAGIKGDRGDTGPAGAPGAPGGPGAPGPVGPTGKQ 1091

Query:   369 VPYGSATPPARSGSGQPRG 387
                G A P   SG   P G
Sbjct:  1092 GDRGEAGPHGPSGPPGPAG 1110

 Score = 123 (48.4 bits), Expect = 0.00061, P = 0.00061
 Identities = 79/280 (28%), Positives = 96/280 (34%)

Query:   123 DRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTST 181
             D+   G  GGA         G P G+     G G PQG  GP     T G  G       
Sbjct:   688 DQGVPGEAGGAGATGPRGERGFP-GERG---GAG-PQGLQGPRGLPGTPGTDGPKGGVGP 742

Query:   182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PG-YDP 239
             +  A  Q G P      +P   G   + GP  D        P   P  D ++G  G   P
Sbjct:   743 AGTAGAQ-GPP--GLQGMPGERGTSGNPGPKGDRGDNGDKGPEGAPGKDGSRGLTGPIGP 799

Query:   240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRGLGYDM-QRGPN--YDMQRGPGYET 295
             T   G + +KG +     GP      GPS     RG+  D  + GP         PG + 
Sbjct:   800 TGPAGPNGEKGES-----GP-----AGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADG 849

Query:   296 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 354
             Q  V G   + G   +A       P   PG     G         +  P   TGF GA  
Sbjct:   850 QPGVKGEQGEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAG 909

Query:   355 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPARR 393
                P G   P  N  P G A PP + G    RG G P  R
Sbjct:   910 RVGPPG---PNGNPGPAGPAGPPGKDGPKGVRGDGGPPGR 946


>FB|FBgn0038642 [details] [associations]
            symbol:Muc91C "Mucin 91C" species:7227 "Drosophila
            melanogaster" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISM] [GO:0031012 "extracellular matrix"
            evidence=ISM] [GO:0022008 "neurogenesis" evidence=IMP]
            EMBL:AE014297 GO:GO:0022008 eggNOG:NOG12793 GO:GO:0031012
            GO:GO:0005201 GeneTree:ENSGT00700000104744 RefSeq:NP_650744.1
            UniGene:Dm.10760 EnsemblMetazoa:FBtr0083687 GeneID:42246
            KEGG:dme:Dmel_CG7709 UCSC:CG7709-RA CTD:42246 FlyBase:FBgn0038642
            InParanoid:Q9VE45 OMA:GPYPSAP PhylomeDB:Q9VE45 GenomeRNAi:42246
            NextBio:827869 ArrayExpress:Q9VE45 Bgee:Q9VE45 Uniprot:Q9VE45
        Length = 950

 Score = 129 (50.5 bits), Expect = 8.0e-05, P = 8.0e-05
 Identities = 72/281 (25%), Positives = 98/281 (34%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGHGPPPSATTAGVVGAGPNTST 181
             RR   SYG       +++ G P   + Y      P  Q +G P  A  +   G   +  +
Sbjct:   222 RRPSSSYGAPRPAPPSQSYGAPPSAS-YGPPKSAPPSQSYGAP--APPSSKYGPPKSAPS 278

Query:   182 SAYAATQSGTPMRAAYDIPRGPG--YEASKGPG--YDASKAPS--YDPTKGPSYDPAKGP 235
             S+Y A +   P  ++Y  P  P   Y A   P   Y A  APS  Y     PS   + G 
Sbjct:   279 SSYGAPRPAAPS-SSYGAPAPPSSSYGAPAAPSSSYGAPAAPSSSYGAPAAPS--SSYGA 335

Query:   236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR--GPGY 293
                P+K  G  A   S+Y A   P+     G    P    G       +Y       P Y
Sbjct:   336 PAPPSKSYGAPAPPSSSYGAPAAPSKSY--GAPAPPSSSYGAPAPPSSSYGAPAPPSPSY 393

Query:   294 ETQRVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
                  P        P   +  AP+  P +  G        Y    AP+  PS   G   A
Sbjct:   394 GAPAPPSKSYGAPAPPSSSYGAPA-APSKSYGAPAPPSSSYG---APA-PPSSSYGAPSA 448

Query:   353 PRGA-APHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             P  +  P    P P ++  YG A P A   S  P    P++
Sbjct:   449 PSSSYGPPKPAPAPPSS-SYG-APPQAPVSSYLPPASRPSK 487

 Score = 127 (49.8 bits), Expect = 0.00013, P = 0.00013
 Identities = 67/265 (25%), Positives = 96/265 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS--TSAYA 185
             GS GG+  ++ + +   P    +   G   P       SA ++     GP  S  +S+Y+
Sbjct:   589 GSSGGSFQSAPSSSYSAPSA--SANSGGSYPSAPSSSYSAPSSSSSSGGPYASAPSSSYS 646

Query:   186 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY 245
             A  SG+     Y  P  P    S  P   A+   SY      SY  A  PG + + GP Y
Sbjct:   647 APSSGSNSGGPY--PAAPSSSYS-APSASANSGGSYPSAPSSSYS-APSPGSN-SGGP-Y 700

Query:   246 DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY-----ETQRVPG 300
              A   S+Y A   P+   + G  Y       Y     P+     G  Y      +   P 
Sbjct:   701 PAAPSSSYSA---PSPSANSGGPYASAPSSSYS---APSSSSNSGGPYAAAPSSSYSAPS 754

Query:   301 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
                  G  Y +  + SY     P   L  G  Y    + SY     +   G P  AAP  
Sbjct:   755 SSSSSGGPYPSAPSSSY---SAPSSSLSSGGPYPSAPSSSYAAPSPSSNSGGPYPAAPSN 811

Query:   361 QVPPPLN--NVPYGS-ATPPARSGS 382
                 P+   +  YG+ A+ P+ S S
Sbjct:   812 SYSAPIAPPSSSYGAPASGPSPSFS 836

 Score = 124 (48.7 bits), Expect = 0.00028, P = 0.00028
 Identities = 63/264 (23%), Positives = 91/264 (34%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP--PSATTAGVVGAGPNTSTSAYAATQS 189
             G++G S   +S            YG P     P  P +++ G   +G  +S+ +++A  S
Sbjct:   521 GSSGYSSGPSSSYEAPVAPPSSSYGAPSSSFQPISPPSSSYGAPSSGSGSSSGSFSAAPS 580

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD-PTK-----GP 243
                  A      G  ++++    Y A   PS     G SY  A    Y  P+      GP
Sbjct:   581 SL-YSAPSKGSSGGSFQSAPSSSYSA---PSASANSGGSYPSAPSSSYSAPSSSSSSGGP 636

Query:   244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDM-----QRGPNYDMQRGPGYETQRV 298
              Y +   S+Y A   P+   + G  Y       Y         G +Y       Y     
Sbjct:   637 -YASAPSSSYSA---PSSGSNSGGPYPAAPSSSYSAPSASANSGGSYPSAPSSSYSAPS- 691

Query:   299 PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP 358
             PG +   GP Y A  + SY     P      G  Y    + SY     +   G P  AAP
Sbjct:   692 PGSN-SGGP-YPAAPSSSY---SAPSPSANSGGPYASAPSSSYSAPSSSSNSGGPYAAAP 746

Query:   359 HGQVPPPLNNVPYGSATPPARSGS 382
                   P ++   G   P A S S
Sbjct:   747 SSSYSAPSSSSSSGGPYPSAPSSS 770


>UNIPROTKB|F1SN69 [details] [associations]
            symbol:F1SN69 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 InterPro:IPR008985 SUPFAM:SSF49899 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 GO:GO:0005201
            SMART:SM00210 GeneTree:ENSGT00700000104301 OMA:YSYPDRL
            EMBL:CU618340 EMBL:CU606988 EMBL:CU861519
            Ensembl:ENSSSCT00000006033 Uniprot:F1SN69
        Length = 1869

 Score = 132 (51.5 bits), Expect = 8.2e-05, P = 8.2e-05
 Identities = 74/250 (29%), Positives = 98/250 (39%)

Query:   156 GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRG-------PGYEA 207
             GVP   GPP +    G  G+ GP  +     A   G P  A YD  +G       PG + 
Sbjct:  1274 GVPGDPGPPGTPGPKGSRGSLGPTGAPGRMGA--QGEPGLAGYDGHKGIMGPLGPPGPKG 1331

Query:   208 SKGP-GYDA-SKAPSYDP-TKGPSYDPAKGPGYDPTKGPGYDAQKG-----SNYDAQRGP 259
              KG  G D  ++ P   P  +GP  D  +G   +P   PGY  Q+G      N   Q  P
Sbjct:  1332 EKGEQGEDGKAEGPPGPPGDRGPVGD--RGDRGEPGD-PGYPGQEGVQGLRGNPGQQGQP 1388

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRGPVYEAQRAPSYI 318
              +   RG    P+   G +  +G        PG   TQ +PG    RG V   ++ P  +
Sbjct:  1389 GHPGPRGRP-GPKGSKGEEGPKGKQ-GKAGAPGRRGTQGLPGLPGPRGVV--GRQGPEGV 1444

Query:   319 --PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA-PHGQVPPPL---NNVPYG 372
               P   PG D Q GQ  +        P    G  G P  A  P  Q PP     + +P G
Sbjct:  1445 AGPDGLPGLDGQAGQQGEQGDDGDPGPLGPAGKRGNPGVAGLPGAQGPPGFKGESGLP-G 1503

Query:   373 SATPPARSGS 382
                PP + G+
Sbjct:  1504 QLGPPGKRGT 1513


>WB|WBGene00000694 [details] [associations]
            symbol:col-120 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00610000086159 EMBL:AL032632 PIR:T26465
            RefSeq:NP_501617.1 ProteinModelPortal:Q9XWR2 DIP:DIP-26936N
            IntAct:Q9XWR2 MINT:MINT-1070946 STRING:Q9XWR2
            EnsemblMetazoa:Y11D7A.11 GeneID:177748 KEGG:cel:CELE_Y11D7A.11
            UCSC:Y11D7A.11 CTD:177748 WormBase:Y11D7A.11 eggNOG:NOG265281
            InParanoid:Q9XWR2 OMA:HWELLED NextBio:898216 Uniprot:Q9XWR2
        Length = 313

 Score = 122 (48.0 bits), Expect = 8.7e-05, P = 8.7e-05
 Identities = 77/268 (28%), Positives = 97/268 (36%)

Query:   136 NSENE-TSGRPVGQNAY--EDGYGV--PQ---GHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             N EN   S + VG      + GYG   P    G  P PS   A    A  ++S+S+ +  
Sbjct:    64 NLENMYESTKAVGSGPVKRQAGYGASSPSRASGSHPAPSPYDA----ASTSSSSSSDSCC 119

Query:   188 QSGTPMRAAYDIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYD---PAKGPGYDPT 240
               G  +      P  PG +   GP    G D         + G   +   PA  PG  P 
Sbjct:   120 SCGIGLAGPAGFPGRPGRDGIDGPAGKPGRDGQDLDGESSSDGSQIELDCPAGPPG--PP 177

Query:   241 KGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRV 298
               PG     G    D   G N    R P    +RG  G D + G   D    PG     +
Sbjct:   178 GNPGPQGNSGRPGMDGMPGRNGRCGR-PGEQGERGPNGEDGRPGRRGD-DGMPG-TVNEI 234

Query:   299 PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAA 357
             PG   Q GP    + AP     +GP     RG  G    + P+  P    GFDGAP G  
Sbjct:   235 PG---QAGPP-GLRGAPGATGSQGP-----RGNDGRPGNKGPAGPPG-DQGFDGAPGGPG 284

Query:   358 PHGQ--VPPPLNNVPYGSATPPARSGSG 383
               G+     PL      S  PP R+  G
Sbjct:   285 ADGEPGAQGPLGAKGECSHCPPPRTAPG 312


>DICTYBASE|DDB_G0286613 [details] [associations]
            symbol:DDB_G0286613 "14-3-3 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0019904 "protein
            domain specific binding" evidence=IEA] [GO:0006366 "transcription
            from RNA polymerase II promoter" evidence=IEA] [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA]
            [GO:0003950 "NAD+ ADP-ribosyltransferase activity" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] InterPro:IPR000308
            InterPro:IPR000684 InterPro:IPR002035 InterPro:IPR012317
            Pfam:PF00644 PRINTS:PR00305 PROSITE:PS00115 PROSITE:PS50234
            PROSITE:PS51059 SMART:SM00327 InterPro:IPR001357
            dictyBase:DDB_G0286613 Pfam:PF00533 eggNOG:COG5040
            Gene3D:1.20.190.20 InterPro:IPR023410 Pfam:PF00244 SMART:SM00101
            SUPFAM:SSF48445 GO:GO:0003677 EMBL:AAFI02000089 GO:GO:0006366
            SMART:SM00292 SUPFAM:SSF52113 PROSITE:PS50172 GO:GO:0003950
            InterPro:IPR013694 Pfam:PF08487 PROSITE:PS51468 KO:K10798
            GO:GO:0005665 RefSeq:XP_637567.1 ProteinModelPortal:Q54LJ4
            EnsemblProtists:DDB0232950 GeneID:8625707 KEGG:ddi:DDB_G0286613
            InParanoid:Q54LJ4 OMA:THTKATI Uniprot:Q54LJ4
        Length = 2563

 Score = 133 (51.9 bits), Expect = 9.0e-05, P = 9.0e-05
 Identities = 45/137 (32%), Positives = 62/137 (45%)

Query:   125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
             R   S GG+ G+S     G  +G  A+  G G P    PPP  +T+  +G G    +  +
Sbjct:  1787 RGGSSRGGSIGSSRGGRGGN-IG-TAF--GRGAPPPPQPPPPPSTS--LGRGAPPPSLFF 1840

Query:   185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
              A+Q  +P    Y IP  P Y  +  P Y  + +PSY PT  PSY P   P Y  +  P 
Sbjct:  1841 QASQPYSPTSPFY-IPTSPSYSPTS-PSYSPT-SPSYSPTS-PSYSPTS-PSYSTS--PL 1893

Query:   245 YDAQKGSNYDAQRGPNY 261
             Y A    +Y     P+Y
Sbjct:  1894 Y-ASTSQSYSPV-SPSY 1908


>UNIPROTKB|F1NCR0 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001501 "skeletal system
            development" evidence=IEA] [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
            GeneTree:ENSGT00660000095287 GO:GO:0005584 EMBL:AADN02000724
            IPI:IPI00821202 Ensembl:ENSGALT00000015706 ArrayExpress:F1NCR0
            Uniprot:F1NCR0
        Length = 1318

 Score = 130 (50.8 bits), Expect = 9.1e-05, P = 9.1e-05
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   145 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 198
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   781 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 837

Query:   199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 257
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   838 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 892

Query:   258 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   893 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 943

Query:   314 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 367
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   944 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1000

Query:   368 NVPYGSATPPARSGSGQPRGGN 389
             N P G   PP  SG     G N
Sbjct:  1001 NGPAGPRGPPGPSGPPGKDGRN 1022


>UNIPROTKB|F1M6Q3 [details] [associations]
            symbol:Col4a2 "Protein Col4a2" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            GO:GO:0071560 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0006351
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0016525 GO:GO:0005201
            GO:GO:0005587 Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772
            IPI:IPI00778948 Ensembl:ENSRNOT00000057461 Uniprot:F1M6Q3
        Length = 1647

 Score = 131 (51.2 bits), Expect = 9.2e-05, P = 9.2e-05
 Identities = 90/302 (29%), Positives = 112/302 (37%)

Query:   119 APNVDRRADGSYGGATGN----SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 174
             +P VD   D  + G TG+     E  T   PVG    +   G+P   GP  S    G  G
Sbjct:  1145 SPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGAPGQKGEQGIPGERGPVGSPGLQGFPG 1204

Query:   175 AGPNTSTSAYAATQSGTPM---RAAYDIPRGP-GYEASKGPGYD--ASKAPSYDPTKGPS 228
               P ++ S       G P       Y  P GP G  A  G   D  +S A  +   KG  
Sbjct:  1205 ISPPSNISGLPG-DVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGEKGWV 1263

Query:   229 YDPAKGPGYDP-TKG-PGYDAQKGSN-YDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPN 284
              DP  GP   P   G PG    KG   +    GP+  +  RGP   P+   G+    G  
Sbjct:  1264 GDP--GPQGQPGVHGLPGEKGPKGEQGFMGNTGPSGAVGDRGPK-GPKGDQGFPGAPGS- 1319

Query:   285 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDP 343
                   PG     +PG   Q+  V      P    +RG PG   + G      + P  DP
Sbjct:  1320 ---MGSPG-----IPGIP-QKIAVQPGTMGPQ--GRRGLPGALGEMGP-----QGPPGDP 1363

Query:   344 SRGTGFDGAPRGAAPHGQ-----VPP------PLNNV-PYGSATPPARSGS-GQPRGGNP 390
                 GF GAP  A P G+     VP       P+ +  P G    P R GS G P  G P
Sbjct:  1364 ----GFRGAPGKAGPQGRGGVSAVPGFRGDQGPMGHQGPIGQEGEPGRPGSPGLP--GMP 1417

Query:   391 AR 392
              R
Sbjct:  1418 GR 1419


>UNIPROTKB|P02467 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005583 "fibrillar collagen" evidence=IDA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0046872 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M25963
            EMBL:M25956 EMBL:M25959 EMBL:M25961 EMBL:M25962 EMBL:M25965
            EMBL:M25964 EMBL:M25984 EMBL:M25957 EMBL:M25966 EMBL:M25967
            EMBL:M25969 EMBL:M25970 EMBL:M25971 EMBL:M25972 EMBL:M25973
            EMBL:M25974 EMBL:M25976 EMBL:M25977 EMBL:M25978 EMBL:M25979
            EMBL:M25980 EMBL:M25981 EMBL:M25982 EMBL:M25983 EMBL:J00826
            EMBL:J00821 EMBL:K00792 EMBL:J00830 EMBL:J00829 EMBL:J00837
            EMBL:J00812 EMBL:J00811 EMBL:J00814 EMBL:J00815 EMBL:X02657
            EMBL:K00794 EMBL:V00390 EMBL:M17608 EMBL:M10581 EMBL:M10540
            EMBL:J00828 EMBL:J00827 EMBL:J00832 EMBL:J00831 EMBL:J00833
            EMBL:J00822 IPI:IPI00914483 PIR:I50173 PIR:I50206 PIR:S10847
            UniGene:Gga.5097 STRING:P02467 PRIDE:P02467 InParanoid:P02467
            PMAP-CutDB:P02467 GO:GO:0005583 Uniprot:P02467
        Length = 1362

 Score = 130 (50.8 bits), Expect = 9.5e-05, P = 9.5e-05
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   145 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 198
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   825 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 881

Query:   199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 257
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   882 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 936

Query:   258 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   937 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 987

Query:   314 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 367
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   988 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1044

Query:   368 NVPYGSATPPARSGSGQPRGGN 389
             N P G   PP  SG     G N
Sbjct:  1045 NGPAGPRGPPGPSGPPGKDGRN 1066


>UNIPROTKB|F1P0H9 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001501 "skeletal system
            development" evidence=IEA] [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
            GeneTree:ENSGT00660000095287 KO:K06236 GO:GO:0005584 CTD:1278
            IPI:IPI00914483 UniGene:Gga.5097 EMBL:AADN02000724
            RefSeq:NP_001073182.2 PRIDE:F1P0H9 Ensembl:ENSGALT00000015703
            GeneID:396243 KEGG:gga:396243 OMA:IGMPGAR NextBio:20816295
            ArrayExpress:F1P0H9 Uniprot:F1P0H9
        Length = 1363

 Score = 130 (50.8 bits), Expect = 9.5e-05, P = 9.5e-05
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   145 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 198
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   826 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 882

Query:   199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 257
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   883 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 937

Query:   258 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   938 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 988

Query:   314 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 367
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   989 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1045

Query:   368 NVPYGSATPPARSGSGQPRGGN 389
             N P G   PP  SG     G N
Sbjct:  1046 NGPAGPRGPPGPSGPPGKDGRN 1067


>UNIPROTKB|F1SNP1 [details] [associations]
            symbol:COL4A4 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0032836 "glomerular basement membrane development"
            evidence=IEA] [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587
            "collagen type IV" evidence=IEA] [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
            EMBL:CU466451 EMBL:FP690341 Ensembl:ENSSSCT00000017688
            Uniprot:F1SNP1
        Length = 1711

 Score = 131 (51.2 bits), Expect = 9.6e-05, P = 9.6e-05
 Identities = 76/260 (29%), Positives = 89/260 (34%)

Query:   143 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 202
             G P G    E   G+P   GPP      G  G               G P         G
Sbjct:  1207 GVP-GPRGPEGSMGLPGQRGPP-GPECKGEPGPDGRRGEDGLPGPP-GPPGHKGDMGEAG 1263

Query:   203 -PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKGPGYDAQKGSNYDAQRG 258
              PG    KG PG   +  PS    +G + DP  G   G  P   PG     G N   QRG
Sbjct:  1264 CPGAPGPKGFPGRRGTPGPSLIGFRGDTGDPGFGGEKGSSPIGPPGSPGSPGMN--GQRG 1321

Query:   259 PNYDIHRG-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA-- 314
             P  D   G P    +RGL G    +G   D  R        +PG+   +GP     RA  
Sbjct:  1322 PPGDPALGYPGPPGKRGLFGSPGSKGLRGDPGRPGATGPAGMPGFPGLKGPKGREGRAGF 1381

Query:   315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 374
             P  +P   PG+  + G     R  P   P    G  GAP      G + PP      G  
Sbjct:  1382 PG-VPGP-PGHSCESGA--PGRPGPPGLPG-APGSPGAPGWKGQRGDMGPPGPAGMKGVP 1436

Query:   375 TPPARSGSGQPRG--GNPAR 392
               P R G   P G  G P R
Sbjct:  1437 GVPGRPGPDGPPGPPGVPGR 1456


>TAIR|locus:2079502 [details] [associations]
            symbol:RS31 "arginine/serine-rich splicing factor 31"
            species:3702 "Arabidopsis thaliana" [GO:0000166 "nucleotide
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISM] [GO:0016607 "nuclear speck" evidence=IDA]
            [GO:0008380 "RNA splicing" evidence=NAS] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=IDA;RCA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=RCA]
            [GO:0030422 "production of siRNA involved in RNA interference"
            evidence=RCA] [GO:0035196 "production of miRNAs involved in gene
            silencing by miRNA" evidence=RCA] [GO:0043687 "post-translational
            protein modification" evidence=RCA] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=RCA]
            [GO:0005681 "spliceosomal complex" evidence=TAS] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0000166 GO:GO:0016607
            Gene3D:3.30.70.330 GO:GO:0005681 GO:GO:0003723 GO:GO:0000398
            EMBL:AL138642 HOGENOM:HOG000276234 KO:K12893 EMBL:X99435
            EMBL:AF439831 EMBL:AY125565 IPI:IPI00530595 PIR:T47978 PIR:T51304
            RefSeq:NP_567120.1 UniGene:At.24231 ProteinModelPortal:P92964
            SMR:P92964 IntAct:P92964 STRING:P92964 PaxDb:P92964 PRIDE:P92964
            EnsemblPlants:AT3G61860.1 GeneID:825359 KEGG:ath:AT3G61860
            TAIR:At3g61860 eggNOG:NOG277933 InParanoid:P92964 OMA:FEYETRQ
            PhylomeDB:P92964 ProtClustDB:CLSN2917489 Genevestigator:P92964
            GermOnline:AT3G61860 Uniprot:P92964
        Length = 264

 Score = 120 (47.3 bits), Expect = 9.9e-05, P = 9.9e-05
 Identities = 30/88 (34%), Positives = 41/88 (46%)

Query:   191 TPMRAAYDIPR---GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YDPTKGPGYD 246
             +P R+   + R    P Y     PG     +P Y   + P YD  KGP  Y+  + P Y 
Sbjct:   177 SPRRSLSPVYRRRPSPDYGRRPSPGQGRRPSPDYGRARSPEYDRYKGPAAYERRRSPDY- 235

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDPQRG 274
              ++ S+Y  QR P YD +R  S  P RG
Sbjct:   236 GRRSSDYGRQRSPGYDRYRSRSPVP-RG 262


>UNIPROTKB|F1MSR8 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
            "Bos taurus" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            UniGene:Bt.21390 GeneID:407142 KEGG:bta:407142 CTD:1280
            NextBio:20818406 EMBL:DAAA02012985 EMBL:DAAA02012986
            IPI:IPI00786510 RefSeq:NP_001106695.1 PRIDE:F1MSR8
            Ensembl:ENSBTAT00000017509 Uniprot:F1MSR8
        Length = 1418

 Score = 130 (50.8 bits), Expect = 9.9e-05, P = 9.9e-05
 Identities = 89/295 (30%), Positives = 112/295 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:    64 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 119

Query:   178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
               +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +G
Sbjct:   120 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 174

Query:   235 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
             P   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   175 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 231

Query:   291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   232 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 289

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   290 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 341

 Score = 128 (50.1 bits), Expect = 0.00016, P = 0.00016
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   723 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 781

Query:   183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   782 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 838

Query:   235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   839 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 885

Query:   294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   886 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 941

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             GA     P G V PP    P G    P R GS     G P R
Sbjct:   942 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 979


>MGI|MGI:88467 [details] [associations]
            symbol:Col1a1 "collagen, type I, alpha 1" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development"
            evidence=ISO;IMP] [GO:0001568 "blood vessel development"
            evidence=ISO;IMP] [GO:0001957 "intramembranous ossification"
            evidence=IGI] [GO:0001958 "endochondral ossification" evidence=IMP]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IDA] [GO:0005581
            "collagen" evidence=IMP;IDA] [GO:0005584 "collagen type I"
            evidence=ISO;IMP;IDA] [GO:0005615 "extracellular space"
            evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0007601
            "visual perception" evidence=ISO] [GO:0007605 "sensory perception
            of sound" evidence=ISO] [GO:0010718 "positive regulation of
            epithelial to mesenchymal transition" evidence=ISO] [GO:0010812
            "negative regulation of cell-substrate adhesion" evidence=IDA]
            [GO:0015031 "protein transport" evidence=IMP] [GO:0030199 "collagen
            fibril organization" evidence=ISO] [GO:0030335 "positive regulation
            of cell migration" evidence=ISO] [GO:0031012 "extracellular matrix"
            evidence=IDA] [GO:0032964 "collagen biosynthetic process"
            evidence=ISO] [GO:0034504 "protein localization to nucleus"
            evidence=ISO] [GO:0034505 "tooth mineralization" evidence=ISO]
            [GO:0042060 "wound healing" evidence=ISO] [GO:0042802 "identical
            protein binding" evidence=ISO] [GO:0043588 "skin development"
            evidence=IMP] [GO:0043589 "skin morphogenesis" evidence=ISO]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=ISO]
            [GO:0048705 "skeletal system morphogenesis" evidence=IGI]
            [GO:0048706 "embryonic skeletal system development" evidence=ISO]
            [GO:0060325 "face morphogenesis" evidence=IGI] [GO:0060346 "bone
            trabecula formation" evidence=IGI] [GO:0060351 "cartilage
            development involved in endochondral bone morphogenesis"
            evidence=IMP] [GO:0070208 "protein heterotrimerization"
            evidence=IDA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IDA] [GO:0090263 "positive regulation of
            canonical Wnt receptor signaling pathway" evidence=ISO]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 MGI:MGI:88467 GO:GO:0005737
            GO:GO:0045893 GO:GO:0043588 GO:GO:0005615 GO:GO:0071363
            GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0071300
            GO:GO:0043434 GO:GO:0030199 GO:GO:0007584 GO:GO:0010035
            GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0042542
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            GO:GO:0071260 GO:GO:0001568 GO:GO:0001649 GO:GO:0051591
            GO:GO:0034505 GO:GO:0090263 GO:GO:0010812 GO:GO:0060325
            GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
            GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
            HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ OrthoDB:EOG4S4PHP
            GO:GO:0005584 GO:GO:0060346 ChiTaRS:COL1A1 GO:GO:0031960
            EMBL:U08020 EMBL:AL662790 EMBL:AL606480 EMBL:BC050014 EMBL:BC059281
            EMBL:K01688 EMBL:S67530 EMBL:S67482 EMBL:X54876 EMBL:M14423
            EMBL:M17491 EMBL:K03036 EMBL:K03029 EMBL:K03030 EMBL:K03031
            EMBL:K03032 EMBL:K03033 EMBL:K03034 EMBL:K03035 EMBL:X06753
            EMBL:X15896 EMBL:X57981 IPI:IPI00329872 IPI:IPI00623191 PIR:I49558
            PIR:S57243 RefSeq:NP_031768.2 UniGene:Mm.277735 UniGene:Mm.458212
            ProteinModelPortal:P11087 SMR:P11087 IntAct:P11087 STRING:P11087
            PhosphoSite:P11087 PaxDb:P11087 PRIDE:P11087
            Ensembl:ENSMUST00000001547 GeneID:12842 KEGG:mmu:12842
            UCSC:uc007kzn.1 InParanoid:P11087 NextBio:282376 PMAP-CutDB:P11087
            Bgee:P11087 CleanEx:MM_COL1A1 Genevestigator:P11087
            GermOnline:ENSMUSG00000001506 Uniprot:P11087
        Length = 1453

 Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 79/254 (31%), Positives = 95/254 (37%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSAT----TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
             P+G N    G   P+G   PP AT     AG VG  P  S +A      G   +     P
Sbjct:   841 PIG-NVGAPGPKGPRGAAGPPGATGFPGAAGRVGP-PGPSGNAGPPGPPGPVGKEGGKGP 898

Query:   201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYDAQKGS-NYDAQ 256
             RG    A + PG      P   P  G    P A GP   P T GP G   Q+G      Q
Sbjct:   899 RGETGPAGR-PGEVGPPGPP-GPA-GEKGSPGADGPAGSPGTPGPQGIAGQRGVVGLPGQ 955

Query:   257 RGPN-YDIHRGPSYDP-QRG-LGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
             RG   +    GPS +P ++G  G   +RGP   M  GP       PG     GP  E+ R
Sbjct:   956 RGERGFPGLPGPSGEPGKQGPSGSSGERGPPGPM--GP-------PGL---AGPPGESGR 1003

Query:   314 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGS 373
               S   +  PG D   G   D        P    G  GAP    P G+        P G 
Sbjct:  1004 EGSPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGP 1063

Query:   374 ATPPARSGSGQPRG 387
             A P   +G+  P G
Sbjct:  1064 AGPIGPAGARGPAG 1077


>UNIPROTKB|P04280 [details] [associations]
            symbol:PRB1 "Basic salivary proline-rich protein 1"
            species:9606 "Homo sapiens" [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005576 "extracellular region" evidence=NAS] GO:GO:0005576
            PIR:B40750 InterPro:IPR026086 PANTHER:PTHR23203 EMBL:K03204
            EMBL:K03205 EMBL:K03206 EMBL:S52986 EMBL:M97220 EMBL:K02575
            EMBL:K02576 EMBL:X07516 EMBL:X07517 EMBL:S62928 EMBL:S62941
            IPI:IPI00023038 PIR:C38355 PIR:D40750 RefSeq:NP_005030.2
            RefSeq:NP_955385.1 RefSeq:NP_955386.1 UniGene:Hs.631726
            ProteinModelPortal:P04280 STRING:P04280 PhosphoSite:P04280
            DMDM:52001469 PRIDE:P04280 GeneID:5542 KEGG:hsa:5542 CTD:5542
            GeneCards:GC12M011504 HGNC:HGNC:9337 MIM:180989 neXtProt:NX_P04280
            PharmGKB:PA33699 KO:K13911 GenomeRNAi:5542 NextBio:21470
            ArrayExpress:P04280 CleanEx:HS_PRB1 Genevestigator:P04280
            Uniprot:P04280
        Length = 392

 Score = 123 (48.4 bits), Expect = 0.00010, P = 0.00010
 Identities = 76/279 (27%), Positives = 94/279 (33%)

Query:   131 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT-STSAYAATQS 189
             GG          G+P G      G   PQG  PPP     G    G  + S  +      
Sbjct:    43 GGNKPQGPPPPPGKPQGPPP--QGGNKPQG--PPPPGKPQGPPPQGDKSRSPRSPPGKPQ 98

Query:   190 GTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTKG------PSYDPAKGPGYDPTK 241
             G P +     P+GP     K  GP       P   P  G      P  D ++ P   P K
Sbjct:    99 GPPPQGGNQ-PQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDKSQSPRSPPGK 157

Query:   242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQ-- 296
               G   Q G N      P     +GP   P +G G   Q  P     +GP   G ++Q  
Sbjct:   158 PQGPPPQ-GGNQPQGPPPPPGKPQGP---PPQG-GNKPQGPPPPGKPQGPPPQGDKSQSP 212

Query:   297 RVP-----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
             R P     G   Q G   +    P   PQ  P     R QG      P   P +G     
Sbjct:   213 RSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPQQGGNRPQGPPPPGKPQGPPPQGDK-SR 271

Query:   352 APRGAAPHGQVPPPLN-NVPYGSATPPARSGSGQPRGGN 389
             +P+      Q PPP   N P G   PP +     P+GGN
Sbjct:   272 SPQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGN 310


>UNIPROTKB|P02459 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
            "Bos taurus" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
            GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604 GO:GO:0001502
            GO:GO:0060021 GO:GO:0002062 GO:GO:0010468 GO:GO:0060272
            GO:GO:0006029 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GeneTree:ENSGT00660000095287 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 EMBL:AAFC03017082 EMBL:AAFC03017085
            EMBL:AAFC03056593 EMBL:L28918 EMBL:AF138883 EMBL:AF138957
            EMBL:X02420 IPI:IPI01028216 PIR:A90369 PIR:I45876
            RefSeq:NP_001001135.2 UniGene:Bt.21390 IntAct:P02459 STRING:P02459
            PRIDE:P02459 Ensembl:ENSBTAT00000017505 GeneID:407142
            KEGG:bta:407142 CTD:1280 InParanoid:Q9XT25 OMA:SSCRICV
            Reactome:REACT_133391 NextBio:20818406 PMAP-CutDB:P02459
            ArrayExpress:P02459 GO:GO:0005585 GO:GO:0060174 GO:GO:0030903
            Uniprot:P02459
        Length = 1487

 Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 89/295 (30%), Positives = 112/295 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:   133 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 188

Query:   178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
               +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +G
Sbjct:   189 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 243

Query:   235 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
             P   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   244 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 300

Query:   291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   301 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 358

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   359 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 410

 Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   792 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850

Query:   183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907

Query:   235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 954

Query:   294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   955 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048


>UNIPROTKB|P02458 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001502 "cartilage condensation" evidence=IEA] [GO:0001894
            "tissue homeostasis" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0003007 "heart morphogenesis"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0006029
            "proteoglycan metabolic process" evidence=IEA] [GO:0007417 "central
            nervous system development" evidence=IEA] [GO:0010468 "regulation
            of gene expression" evidence=IEA] [GO:0030903 "notochord
            development" evidence=IEA] [GO:0042472 "inner ear morphogenesis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0060021 "palate development"
            evidence=IEA] [GO:0060174 "limb bud formation" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0071599 "otic vesicle development"
            evidence=IEA] [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IMP]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IDA]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] [GO:0042802 "identical protein binding"
            evidence=NAS] [GO:0001501 "skeletal system development"
            evidence=IMP] [GO:0007605 "sensory perception of sound"
            evidence=IMP] [GO:0060272 "embryonic skeletal joint morphogenesis"
            evidence=IMP] [GO:0051216 "cartilage development" evidence=TAS]
            [GO:0030199 "collagen fibril organization" evidence=IMP]
            [GO:0005585 "collagen type II" evidence=IDA] [GO:0030020
            "extracellular matrix structural constituent conferring tensile
            strength" evidence=IC] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0043066 GO:GO:0005615 PDB:2FSE PDBsum:2FSE
            PDB:2SEB PDBsum:2SEB GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0005788 GO:GO:0042472
            GO:GO:0001894 GO:GO:0042802 GO:GO:0007605 GO:GO:0071773
            GO:GO:0051216 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
            GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
            GO:GO:0001958 GO:GO:0060351 HOVERGEN:HBG004933 KO:K06236
            DrugBank:DB00048 GO:GO:0048407 CTD:1280 OMA:SSCRICV GO:GO:0005585
            GO:GO:0060174 GO:GO:0030903 OrthoDB:EOG4FTW1C EMBL:X16468
            EMBL:L10347 EMBL:BT007205 EMBL:AC004801 EMBL:BC007252 EMBL:BC116449
            EMBL:X16711 EMBL:M25730 EMBL:M32168 EMBL:M25655 EMBL:M25656
            EMBL:M64345 EMBL:M60299 EMBL:M25698 EMBL:X58709 EMBL:X57010
            EMBL:U15195 EMBL:X13783 EMBL:M25728 EMBL:X02371 EMBL:X02372
            EMBL:X02373 EMBL:X02374 EMBL:X02375 EMBL:X02376 EMBL:X02377
            EMBL:X02378 EMBL:X16158 EMBL:J00116 EMBL:L00977 EMBL:M63281
            EMBL:M27468 EMBL:X06268 EMBL:X00339 EMBL:M12048 IPI:IPI00186460
            IPI:IPI00748487 IPI:IPI00936892 PIR:A38513 RefSeq:NP_001835.3
            RefSeq:NP_149162.2 UniGene:Hs.408182 PDB:1U5M PDBsum:1U5M
            ProteinModelPortal:P02458 SMR:P02458 IntAct:P02458
            MINT:MINT-6796075 STRING:P02458 PhosphoSite:P02458 DMDM:124056489
            PaxDb:P02458 PRIDE:P02458 DNASU:1280 Ensembl:ENST00000337299
            Ensembl:ENST00000380518 GeneID:1280 KEGG:hsa:1280 UCSC:uc001rqt.3
            UCSC:uc001rqu.3 UCSC:uc001rqv.3 GeneCards:GC12M048266
            HGNC:HGNC:2200 HPA:CAB002214 MIM:108300 MIM:120140 MIM:132450
            MIM:150600 MIM:151210 MIM:156550 MIM:183900 MIM:184250 MIM:200610
            MIM:271700 MIM:604864 MIM:608805 MIM:609162 MIM:609508
            neXtProt:NX_P02458 Orphanet:93296 Orphanet:209867 Orphanet:137678
            Orphanet:86820 Orphanet:93297 Orphanet:485 Orphanet:2380
            Orphanet:93279 Orphanet:166011 Orphanet:1427 Orphanet:85166
            Orphanet:93346 Orphanet:94068 Orphanet:93315 Orphanet:1856
            Orphanet:90653 PharmGKB:PA26715 ChiTaRS:COL2A1
            EvolutionaryTrace:P02458 GenomeRNAi:1280 NextBio:5171
            PMAP-CutDB:P02458 Bgee:P02458 Genevestigator:P02458
            GermOnline:ENSG00000139219 GO:GO:0030020 Uniprot:P02458
        Length = 1487

 Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
 Identities = 90/295 (30%), Positives = 113/295 (38%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
             P  DR  D    GA G    +  G P G        G P   GPP       A + G   
Sbjct:   133 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 188

Query:   178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
               +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +G
Sbjct:   189 EKAGGAQLGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 243

Query:   235 PGYDPTKGPGYDAQKGSNYDA-QRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
             P   P K PG D + G    A +RGP      RG    P  GL G    RG P  D  +G
Sbjct:   244 PPGPPGK-PGDDGEAGKPGKAGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 300

Query:   291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
                 PG + +   PG +   GP+   +  P    + GP       +G D +  P+     
Sbjct:   301 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 358

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
               P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   359 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNP 410

 Score = 124 (48.7 bits), Expect = 0.00047, P = 0.00047
 Identities = 88/282 (31%), Positives = 101/282 (35%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   792 GPPGPAGANGEKGEVGPP-GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850

Query:   183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907

Query:   235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
             PG +   GP G     G   D  +G      RG S  P R  G    +GP      GP  
Sbjct:   908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRA-GEPGLQGP-----AGPPG 954

Query:   294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
             E    PG D   G   E    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   955 EKGE-PGDDGPSGA--EGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048


>UNIPROTKB|E2RRS5 [details] [associations]
            symbol:RBM12B "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 GO:GO:0000166
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00620000087828
            OMA:EHFRRPP CTD:389677 EMBL:AAEX03015951 RefSeq:XP_544177.3
            Ensembl:ENSCAFT00000014490 GeneID:487048 KEGG:cfa:487048
            NextBio:20860720 Uniprot:E2RRS5
        Length = 994

 Score = 124 (48.7 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 45/174 (25%), Positives = 71/174 (40%)

Query:   192 PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 251
             P    +  PR   +   +    D  + P  D  + P  D  + P  D  + P  D ++  
Sbjct:   591 PWEEGFRYPREEDFRYPREE--DWRRPPEEDFRRPPKDDFRRPPEEDWRRLPEGDFRRPP 648

Query:   252 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 311
               D +R P  D  R P  + +R    D +R P  D +R P  + +R+P  D +R P  + 
Sbjct:   649 EEDWRRPPEDDFRRLPQGEWRRPPEEDFRRPPEEDFRRLPEEDFRRLPEEDFRRPPEEDF 708

Query:   312 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
             +R+P    +R P  D +R      RR P  +  R    +   R    H + PPP
Sbjct:   709 RRSPEEDFRRSPEEDFRRPPPEHFRRPPP-EHLRRPPPEHFRRPPPEHFRRPPP 761

 Score = 50 (22.7 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 14/57 (24%), Positives = 24/57 (42%)

Query:   102 YITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP 158
             +++   E++K   E+     + R   GS  GA+G          + + A   GYG P
Sbjct:    72 FLSSKAEMQKT-IEMRRTDRIGRERPGS--GASGAGSLSNFVEAIKEEASNSGYGSP 125


>UNIPROTKB|A7E348 [details] [associations]
            symbol:PYGO2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0048589 "developmental growth" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0030879
            "mammary gland development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0007420 "brain
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0002088 "lens development in camera-type eye" evidence=IEA]
            [GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
            GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
            GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
            GO:GO:0033599 GO:GO:0051569 GO:GO:0002088 eggNOG:NOG72798
            HOGENOM:HOG000001580 HOVERGEN:HBG053774
            GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
            OrthoDB:EOG4QZ7MB EMBL:DAAA02007156 EMBL:BC151715 IPI:IPI00866934
            RefSeq:NP_001095712.1 UniGene:Bt.102068 SMR:A7E348
            Ensembl:ENSBTAT00000005670 GeneID:540401 KEGG:bta:540401
            InParanoid:A7E348 NextBio:20878610 Uniprot:A7E348
        Length = 405

 Score = 123 (48.4 bits), Expect = 0.00011, P = 0.00011
 Identities = 78/298 (26%), Positives = 111/298 (37%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 170
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G  P    +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97

Query:   171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 229
              V   G           Q G     A  +P G G     GP     + P + P+  GP++
Sbjct:    98 PVPFGG--------FRVQGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPSPMGPAF 145

Query:   230 D-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQRGP 283
             + P +GPGY P     + +Q    ++   G N+    G     P  G G      M + P
Sbjct:   146 NMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPP 202

Query:   284 NYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPSYIPQRGP--GYDLQ-RGQGYDMRRA 338
               ++  GP    QR        GP  +   Q  PS  P   P  G D    G G +    
Sbjct:   203 RGEL--GPPSLPQRFAQPGAPFGPSLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDGGK 260

Query:   339 PSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRGGN 389
             P  +P   T F   P   +P    +G  P  P N+   G  TP A S +  G+  GG+
Sbjct:   261 P-LNPPAATAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGGGS 317


>UNIPROTKB|Q5T171 [details] [associations]
            symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
            "in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
            development" evidence=IEA] [GO:0002088 "lens development in
            camera-type eye" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0007420 "brain development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0030879 "mammary
            gland development" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0048589 "developmental growth"
            evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] InterPro:IPR001965 InterPro:IPR019787 Pfam:PF00628
            PROSITE:PS50016 SMART:SM00249 GO:GO:0005634 GO:GO:0007420
            GO:GO:0046872 GO:GO:0008270 GO:GO:0001701 GO:GO:0009791
            GO:GO:0001822 EMBL:AL451085 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 EMBL:CH471121 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            HOGENOM:HOG000001580 HOVERGEN:HBG053774 UniGene:Hs.533597
            HGNC:HGNC:30257 IPI:IPI00642524 SMR:Q5T171 STRING:Q5T171
            Ensembl:ENST00000368456 Uniprot:Q5T171
        Length = 369

 Score = 122 (48.0 bits), Expect = 0.00012, P = 0.00012
 Identities = 80/302 (26%), Positives = 113/302 (37%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ-GHGPPPSATT 169
             M +P   RR   + G A  + +E      P     V  N +ED +G P+ G   PP   +
Sbjct:     1 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGS 60

Query:   170 AGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTK-G 226
                 G             Q G     A  +P  PGY    G G    +   P + P   G
Sbjct:    61 PVPFGG---------FRVQGGM----AGQVP--PGYSTGGGGGPQPLRRQPPPFPPNPMG 105

Query:   227 PSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQ 280
             P+++ P +GPGY P     + +Q    ++   G N+    G     P  G G      M 
Sbjct:   106 PAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMG 162

Query:   281 RGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYDLQ-RGQGYD 334
             + P  ++  GP   +QR   PG      P+    Q  PS  P   P  G D    G G +
Sbjct:   163 QPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGE 220

Query:   335 MRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRG 387
                 P  +P   T F   P   +P    +G  P  P N+   G  TP A S +  G+  G
Sbjct:   221 DGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGG 279

Query:   388 GN 389
             G+
Sbjct:   280 GS 281


>TAIR|locus:2140513 [details] [associations]
            symbol:AT4G10070 "AT4G10070" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
            [GO:0000956 "nuclear-transcribed mRNA catabolic process"
            evidence=RCA] [GO:0009688 "abscisic acid biosynthetic process"
            evidence=RCA] InterPro:IPR004087 InterPro:IPR004088 Pfam:PF00013
            PROSITE:PS50084 SMART:SM00322 EMBL:CP002687 GO:GO:0003723
            UniGene:At.33655 IPI:IPI01020077 RefSeq:NP_192745.2
            ProteinModelPortal:F4JLJ3 SMR:F4JLJ3 EnsemblPlants:AT4G10070.1
            GeneID:826598 KEGG:ath:AT4G10070 OMA:PSTHAIG ArrayExpress:F4JLJ3
            Uniprot:F4JLJ3
        Length = 725

 Score = 126 (49.4 bits), Expect = 0.00012, P = 0.00012
 Identities = 70/253 (27%), Positives = 87/253 (34%)

Query:   160 GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKA 218
             G  PPPS         GP  S  +Y   QS  P    +  P    GY+ + G  Y+  K 
Sbjct:   442 GPVPPPSGPVPSPAFGGPPLSQVSYGYGQSHGP-EYGHAAPYSQTGYQQTYGQTYEQPKY 500

Query:   219 ---PSYDPTKGPSYDPAKG--PGYDPTKGPG---YDAQKG---SNYD----AQRGPNYDI 263
                P   P  G SY PA G   GY   + PG   Y  Q+G     Y     A    + D+
Sbjct:   501 DSNPPMQPPYGGSYPPAGGGQSGYYQMQQPGVRPYGMQQGPVQQGYGPPQPAAAASSGDV 560

Query:   264 -HRG-----PSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY 317
              ++G     PSY          Q G  Y    GP  + Q  P Y     P  +A    + 
Sbjct:   561 PYQGATPAAPSYGSTNMAPQQQQYG--YTSSDGP-VQQQTYPSYS--SAPPSDAYNNGTQ 615

Query:   318 IPQRGPGYDLQRGQG----YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN-VPYG 372
              P  GP Y  Q  Q     YD   A     + G G   AP G   +    P   +   Y 
Sbjct:   616 TPATGPAYQQQSVQPASSTYDQTGAQQA-AAAGYGGQVAPTGGYTYPTSQPAYGSQAAYS 674

Query:   373 SATPPARSGSGQP 385
              A P       QP
Sbjct:   675 QAAPTQTGYEQQP 687


>MGI|MGI:88452 [details] [associations]
            symbol:Col2a1 "collagen, type II, alpha 1" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development" evidence=ISO]
            [GO:0001502 "cartilage condensation" evidence=IMP] [GO:0001894
            "tissue homeostasis" evidence=IMP] [GO:0001958 "endochondral
            ossification" evidence=IMP] [GO:0002062 "chondrocyte
            differentiation" evidence=IMP] [GO:0003007 "heart morphogenesis"
            evidence=IMP] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0005581 "collagen" evidence=IDA] [GO:0005585
            "collagen type II" evidence=ISO;IDA;IMP] [GO:0005604 "basement
            membrane" evidence=IDA] [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006029
            "proteoglycan metabolic process" evidence=IMP] [GO:0007601 "visual
            perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
            evidence=ISO] [GO:0010468 "regulation of gene expression"
            evidence=IMP] [GO:0030199 "collagen fibril organization"
            evidence=ISO;IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0035108 "limb morphogenesis" evidence=IMP] [GO:0042472 "inner
            ear morphogenesis" evidence=IMP] [GO:0042802 "identical protein
            binding" evidence=IPI] [GO:0043066 "negative regulation of
            apoptotic process" evidence=IMP] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=ISO] [GO:0048705 "skeletal system morphogenesis"
            evidence=IMP] [GO:0048839 "inner ear development" evidence=IMP]
            [GO:0051216 "cartilage development" evidence=IMP] [GO:0060021
            "palate development" evidence=IMP] [GO:0060272 "embryonic skeletal
            joint morphogenesis" evidence=ISO] [GO:0060348 "bone development"
            evidence=IMP] [GO:0060351 "cartilage development involved in
            endochondral bone morphogenesis" evidence=IMP] [GO:0071773
            "cellular response to BMP stimulus" evidence=IDA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 MGI:MGI:88452 GO:GO:0005737
            GO:GO:0043066 GO:GO:0005615 GO:GO:0046872 GO:GO:0003007
            GO:GO:0007601 GO:GO:0030199 GO:GO:0007417 GO:GO:0042472
            GO:GO:0001894 GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604
            GO:GO:0001502 GO:GO:0060021 GO:GO:0002062 GO:GO:0010468
            GO:GO:0060272 GO:GO:0006029 GO:GO:0001958 GO:GO:0060351
            GO:GO:0005201 GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933
            KO:K06236 CTD:1280 OMA:SSCRICV GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 OrthoDB:EOG4FTW1C ChiTaRS:COL2A1 EMBL:M65161
            EMBL:BC030913 EMBL:BC051383 EMBL:BC052326 EMBL:BC082331 EMBL:S63190
            EMBL:M63708 EMBL:M63709 EMBL:M63710 EMBL:AK028295 EMBL:X57982
            IPI:IPI00471183 IPI:IPI00621255 IPI:IPI00622890 IPI:IPI00623625
            IPI:IPI00828467 IPI:IPI00828653 IPI:IPI00828753 PIR:A41182
            PIR:B41182 RefSeq:NP_001106987.2 RefSeq:NP_112440.2 UniGene:Mm.2423
            PDB:2W65 PDBsum:2W65 ProteinModelPortal:P28481 SMR:P28481
            IntAct:P28481 STRING:P28481 PhosphoSite:P28481 PRIDE:P28481
            Ensembl:ENSMUST00000023123 Ensembl:ENSMUST00000088355 GeneID:12824
            KEGG:mmu:12824 UCSC:uc007xlp.2 UCSC:uc007xlq.2 InParanoid:P28481
            EvolutionaryTrace:P28481 NextBio:282306 Bgee:P28481
            CleanEx:MM_COL2A1 Genevestigator:P28481
            GermOnline:ENSMUSG00000022483 Uniprot:P28481
        Length = 1487

 Score = 129 (50.5 bits), Expect = 0.00014, P = 0.00014
 Identities = 88/296 (29%), Positives = 110/296 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
             P  DR  D    GA G    +  G P G        G P   GPP  +     A + G  
Sbjct:   132 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPAGPPGPPGPPGLSAGNFAAQMAGGY 187

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 243

Query:   234 GPGYDPTKGPGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
              PG  P   PG D + G      +RG P     RG    P  GL G    RG P  D  +
Sbjct:   244 PPG--PAGKPGDDGEAGKPGKSGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299

Query:   290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357

Query:   341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   358 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410


>UNIPROTKB|P05997 [details] [associations]
            symbol:COL5A2 "Collagen alpha-2(V) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0071230
            "cellular response to amino acid stimulus" evidence=IEA]
            [GO:0030199 "collagen fibril organization" evidence=ISS;IMP]
            [GO:0043588 "skin development" evidence=ISS;IMP] [GO:0031012
            "extracellular matrix" evidence=NAS] [GO:0003674
            "molecular_function" evidence=ND] [GO:0048592 "eye morphogenesis"
            evidence=IMP] [GO:0005588 "collagen type V" evidence=IMP]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0043588 GO:GO:0046872 GO:GO:0030199
            GO:GO:0005788 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            HOVERGEN:HBG004933 KO:K06236 MIM:130000 Orphanet:90309
            EMBL:AY016295 PDB:1A9A PDBsum:1A9A MIM:130010 Orphanet:90318
            GO:GO:0005588 EMBL:Y14690 EMBL:AB209045 EMBL:AC064833 EMBL:AC133106
            EMBL:J04478 EMBL:AY016288 EMBL:AY016287 EMBL:AY016289 EMBL:AY016290
            EMBL:AY016291 EMBL:AY016292 EMBL:AY016293 EMBL:AY016294 EMBL:M58529
            EMBL:X04758 EMBL:BC043613 EMBL:M10956 EMBL:M11135 EMBL:M11718
            EMBL:J03051 IPI:IPI00739099 PIR:A31427 RefSeq:NP_000384.2
            UniGene:Hs.445827 ProteinModelPortal:P05997 SMR:P05997
            STRING:P05997 PhosphoSite:P05997 DMDM:143811378 PaxDb:P05997
            PRIDE:P05997 Ensembl:ENST00000374866 GeneID:1290 KEGG:hsa:1290
            UCSC:uc002uqk.3 CTD:1290 GeneCards:GC02M189861 HGNC:HGNC:2210
            MIM:120190 neXtProt:NX_P05997 PharmGKB:PA26725 InParanoid:P05997
            OMA:PDHKPVW OrthoDB:EOG4K0QMS ChiTaRS:COL5A2 GenomeRNAi:1290
            NextBio:5223 PMAP-CutDB:P05997 ArrayExpress:P05997 Bgee:P05997
            Genevestigator:P05997 GermOnline:ENSG00000204262 Uniprot:P05997
        Length = 1499

 Score = 129 (50.5 bits), Expect = 0.00014, P = 0.00014
 Identities = 87/293 (29%), Positives = 109/293 (37%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
             ++ A+G+ G  GA G         P G    E G   P+G  GPP S    G  G    T
Sbjct:   784 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 842

Query:   180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   843 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 901

Query:   236 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
                P  T  PG   + G    A   GP   +   P  +   GL  D         RGP  
Sbjct:   902 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 959

Query:   286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
                 GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+  
Sbjct:   960 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1015

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
             P +  G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1016 PGK-VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067


>UNIPROTKB|D3ZZM1 [details] [associations]
            symbol:Taf15 "Protein Taf15" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950003
            ProteinModelPortal:D3ZZM1 Ensembl:ENSRNOT00000064396
            ArrayExpress:D3ZZM1 Uniprot:D3ZZM1
        Length = 558

 Score = 124 (48.7 bits), Expect = 0.00014, P = 0.00014
 Identities = 67/238 (28%), Positives = 89/238 (37%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
             RR +   GG +G       G   G+  ++   G P+ G    P+ +   +  A  N+   
Sbjct:   318 RRPEFMRGGGSGGGRRGRGGYR-GRGGFQGRGGDPKNGDWVCPNPSCGNMNFARRNSCNQ 376

Query:   183 AYAAT-QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
                   +   P    +   RG GY   +G  +        D  +G       G GY   +
Sbjct:   377 CNEPRPEDSRPSGGDF---RGRGYGGERG--FRGRGGRGGD--RGGYGADRSGGGYGGDR 429

Query:   242 GPG-YDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRV 298
               G Y A + G  Y   R G  Y   RG  Y   RG GY   RG +Y   RG GY   R 
Sbjct:   430 SGGSYGADRSGGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGDRGGSYGGDRG-GYGGDR- 486

Query:   299 PGYDVQRGPVYEAQRAP-SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
              GY   RG  Y   R+  +Y   RG G       GY   R+  Y   RG G+ G  RG
Sbjct:   487 GGYGGDRGG-YGGDRSRGAYGGDRGGG-----SGGYGGDRSGGYGGDRGGGY-GGDRG 537


>UNIPROTKB|Q9BRQ0 [details] [associations]
            symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
            "in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
            development" evidence=IEA] [GO:0002088 "lens development in
            camera-type eye" evidence=IEA] [GO:0007420 "brain development"
            evidence=IEA] [GO:0009791 "post-embryonic development"
            evidence=IEA] [GO:0030879 "mammary gland development" evidence=IEA]
            [GO:0033599 "regulation of mammary gland epithelial cell
            proliferation" evidence=IEA] [GO:0042393 "histone binding"
            evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0060021 "palate development" evidence=IEA] [GO:0060070
            "canonical Wnt receptor signaling pathway" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001965
            InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
            GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
            GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
            InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 PDB:2XB1 PDBsum:2XB1 GO:GO:0051569
            GO:GO:0002088 eggNOG:NOG72798 HOGENOM:HOG000001580
            HOVERGEN:HBG053774 EMBL:AF457208 EMBL:BC006132 EMBL:BC013725
            EMBL:BC032099 EMBL:AF289598 IPI:IPI00042099 RefSeq:NP_612157.1
            UniGene:Hs.533597 ProteinModelPortal:Q9BRQ0 SMR:Q9BRQ0
            IntAct:Q9BRQ0 STRING:Q9BRQ0 PhosphoSite:Q9BRQ0 DMDM:23396825
            PaxDb:Q9BRQ0 PRIDE:Q9BRQ0 DNASU:90780 Ensembl:ENST00000368457
            GeneID:90780 KEGG:hsa:90780 UCSC:uc001fft.3 CTD:90780
            GeneCards:GC01M154929 HGNC:HGNC:30257 HPA:HPA023689 MIM:606903
            neXtProt:NX_Q9BRQ0 PharmGKB:PA134881185 InParanoid:Q9BRQ0
            OMA:PGLVYPC OrthoDB:EOG4QZ7MB PhylomeDB:Q9BRQ0 GenomeRNAi:90780
            NextBio:76956 ArrayExpress:Q9BRQ0 Bgee:Q9BRQ0 CleanEx:HS_PYGO2
            Genevestigator:Q9BRQ0 GermOnline:ENSG00000163348 Uniprot:Q9BRQ0
        Length = 406

 Score = 122 (48.0 bits), Expect = 0.00014, P = 0.00014
 Identities = 80/302 (26%), Positives = 113/302 (37%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ-GHGPPPSATT 169
             M +P   RR   + G A  + +E      P     V  N +ED +G P+ G   PP   +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGS 97

Query:   170 AGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTK-G 226
                 G             Q G     A  +P  PGY    G G    +   P + P   G
Sbjct:    98 PVPFGG---------FRVQGGM----AGQVP--PGYSTGGGGGPQPLRRQPPPFPPNPMG 142

Query:   227 PSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQ 280
             P+++ P +GPGY P     + +Q    ++   G N+    G     P  G G      M 
Sbjct:   143 PAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMG 199

Query:   281 RGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYDLQ-RGQGYD 334
             + P  ++  GP   +QR   PG      P+    Q  PS  P   P  G D    G G +
Sbjct:   200 QPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGE 257

Query:   335 MRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRG 387
                 P  +P   T F   P   +P    +G  P  P N+   G  TP A S +  G+  G
Sbjct:   258 DGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGG 316

Query:   388 GN 389
             G+
Sbjct:   317 GS 318


>RGD|1311417 [details] [associations]
            symbol:Col7a1 "collagen, type VII, alpha 1" species:10116
            "Rattus norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005604 "basement
            membrane" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR002035 InterPro:IPR003961 Pfam:PF00041
            Pfam:PF00092 PROSITE:PS50234 PROSITE:PS50853 SMART:SM00060
            SMART:SM00327 RGD:1311417 Gene3D:2.60.40.10 InterPro:IPR013783
            SUPFAM:SSF49265 InterPro:IPR008160 Pfam:PF01391 IPI:IPI00951759
            Ensembl:ENSRNOT00000066518 UCSC:RGD:1311417 ArrayExpress:D3ZQ14
            Uniprot:D3ZQ14
        Length = 2585

 Score = 131 (51.2 bits), Expect = 0.00015, P = 0.00015
 Identities = 75/262 (28%), Positives = 96/262 (36%)

Query:   143 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 202
             G P G    +   G P   GPP S    GV G+     +  ++  +     R     P+G
Sbjct:  1285 GAP-GSTQAKGERGFPGPEGPPGSPGLPGVPGSPGVKGSPGWSGPRGDRGERGPQG-PKG 1342

Query:   203 ----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAK-GPGYDPTKGP-GYDAQKGSNYDA 255
                 PG     G PG    K    DP  GPS  P   GP  DP  GP G     G++   
Sbjct:  1343 EPGEPGQVIGGGRPGLPGKKG---DP--GPSGPPGPHGPLGDP--GPRGPPGLPGTSVKG 1395

Query:   256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 314
              +G   +  RGP   P  G G   Q  P      G PG   Q  PG   ++G   + +  
Sbjct:  1396 DKGDRGE--RGP---PGPGTGASEQGSPGLPGLPGSPG--PQGPPGRTGEKGEKGDCEDG 1448

Query:   315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGS 373
                +P + PG   + G    +R AP     +G  G  G P      G+  PP    P G 
Sbjct:  1449 GPGLPGQ-PGVPGEPG----LRGAPGVTGPKGDRGLTGTPGEPGEKGERGPPGPVGPQGL 1503

Query:   374 ATPPARSGSGQPRG--GNPARR 393
                  R G   P G  G P RR
Sbjct:  1504 PGAAGRPGVEGPEGPPGPPGRR 1525


>ZFIN|ZDB-GENE-030516-3 [details] [associations]
            symbol:col18a1 "collagen type XVIII, alpha 1"
            species:7955 "Danio rerio" [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005198 "structural molecule activity"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] InterPro:IPR010515 InterPro:IPR020067
            Pfam:PF01392 Pfam:PF06482 PROSITE:PS50038 ZFIN:ZDB-GENE-030516-3
            GO:GO:0005198 Gene3D:3.10.100.10 InterPro:IPR016186
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0007155 InterPro:IPR008985
            SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            InterPro:IPR001791 SMART:SM00282 Gene3D:1.10.2000.10
            SUPFAM:SSF63501 SMART:SM00210 GeneTree:ENSGT00700000104250
            HOGENOM:HOG000231591 HOVERGEN:HBG053241 EMBL:BX927363 EMBL:CT030212
            IPI:IPI00616856 UniGene:Dr.52833 SMR:B0S8G4
            Ensembl:ENSDART00000130434 OMA:DRFNRYD Uniprot:B0S8G4
        Length = 1645

 Score = 129 (50.5 bits), Expect = 0.00015, P = 0.00015
 Identities = 73/277 (26%), Positives = 99/277 (35%)

Query:   125 RADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGP--PPSATTAGVVGA-GPNT 179
             + D   G  +G       G P G+   +   G+G P   G   PP     G  G  GP  
Sbjct:   609 KGDVGSGSVSGGGSKGDKGVP-GEKGMKGTSGFGYPGSKGDRGPP-----GPPGPPGPQG 662

Query:   180 STSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGY 237
              ++       G+ ++     PRGP G +   GP G +       +  K     P+  PG 
Sbjct:   663 PSAEVEVRGDGSVVQKVTG-PRGPPGPQGPPGPPGPEGEPGDPGEDGKAGQVGPSGFPGN 721

Query:   238 DPTKGP-GYDAQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQ--RG-PG 292
                 GP G    +G +    RGP       GPS    R    DM+ G  +DM   R  PG
Sbjct:   722 PGNPGPKGDKGDRGESQPGPRGPPGPPGPPGPSSGFDRPTFVDME-GSGFDMDSVRAVPG 780

Query:   293 YETQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
                   PG     GP   A      + P   PG +   GQ   +   P  D   G     
Sbjct:   781 LPGP--PGPPGPPGPPGSASSGSGGFGPPGPPGQNGAPGQP-GLSGVPGADGKPGLPGPK 837

Query:   352 APRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQPRG 387
               +G A    +P P+      GS+ PP  +G G P G
Sbjct:   838 GEKGDAGELGLPGPVGEKGAKGSSGPPGTTGIGGPAG 874


>UNIPROTKB|O46392 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
            "Canis lupus familiaris" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 CTD:1278 EMBL:AF035120
            RefSeq:NP_001003187.1 UniGene:Cfa.1262 STRING:O46392 GeneID:403824
            KEGG:cfa:403824 NextBio:20817320 Uniprot:O46392
        Length = 1366

 Score = 128 (50.1 bits), Expect = 0.00016, P = 0.00016
 Identities = 86/283 (30%), Positives = 105/283 (37%)

Query:   132 GATG-NSENETSGRP--VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
             GA G       +G P   G        G+P   G   +    G+VG  P  + S   +  
Sbjct:   301 GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGIVGE-PGPAGSKGESGN 359

Query:   189 SGTPMRAAYDIPRGP-GYEASKGPGYDASKA-PSYDPTKGPSYDP-AKG-PGYDPTKGP- 243
              G P  A    P GP G E  +GP  +A  A PS  P  G    P ++G PG D   G  
Sbjct:   360 KGEPGSAGAQGPPGPSGEEGKRGPNGEAGSAGPSGPP--GLRGSPGSRGLPGADGPAGVM 417

Query:   244 GYDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GYE-TQRVP 299
             G    +G+   A  RGPN D  R P  +P    G    RG P      GP G E    +P
Sbjct:   418 GPPGPRGATGPAGVRGPNGDSGR-PG-EP----GLMGPRGFPGAPGNVGPAGKEGPMGLP 471

Query:   300 GYDVQRGPVYEA--QRAPSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRG 355
             G D + GP+  A  +  P  I   GP G     G+  D   A     +RG  G DG    
Sbjct:   472 GIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHA-GLAGARGAPGPDGNNGA 530

Query:   356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-----GNPARR 393
               P G           G A PP   G   P G     G P  R
Sbjct:   531 QGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGER 573


>UNIPROTKB|F1KQQ4 [details] [associations]
            symbol:F1KQQ4 "Collagen alpha-1(IV) chain" species:6253
            "Ascaris suum" [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10 EMBL:JI164326
            Uniprot:F1KQQ4
        Length = 1759

 Score = 129 (50.5 bits), Expect = 0.00016, P = 0.00016
 Identities = 86/285 (30%), Positives = 105/285 (36%)

Query:   128 GSYGGATGNSENETSGRP--VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
             G  G A  N      G P   G+   +  +G P   GP  +   +G+ GA P        
Sbjct:  1164 GIPGDAGFNGRAGLPGLPGIKGERGQDGQHGYPGEPGPVGAHGESGLTGA-PGLQGEPGL 1222

Query:   186 ATQSGTPMR----AAYDIPRGPGYEASKG----PGYDASKA-PSYD--PTKGPSYDPAKG 234
               + G P +     A   P  PG E   G     G D     P  D  P +GP  D A  
Sbjct:  1223 PGRMGLPGQPGELGAPGFPGAPGLEGIPGIRGERGDDGLPGLPGIDGIPIQGPEGD-AGY 1281

Query:   235 PGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRG----PSYDPQRGL-GYDMQRGPNYDM 287
             PG D   G PG   Q+G   D   G P     RG    P Y  +RGL G D +RGP  D 
Sbjct:  1282 PGRDGNDGLPGLPGQRGD--DGLPGLPGLIGERGDDGLPGYPGERGLRGIDGKRGP--DG 1337

Query:   288 QRG-PGYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPS 344
              RG PG       PG   +RG        P +  + G PGY  +RG+       P     
Sbjct:  1338 ARGLPGPPGLDGYPGAPGERG----MDGLPGFPGKDGIPGYPGERGEV----GLPGLPGM 1389

Query:   345 RGT-GFDGAPRGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG 387
             RG  G  G P  A   G +    L  +P G   P    G   P G
Sbjct:  1390 RGEDGLPGLPGLAGQKGARGDDGLPGLP-GLPGPVGARGRPGPPG 1433


>UNIPROTKB|F1LNY9 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00558825
            Ensembl:ENSRNOT00000049994 ArrayExpress:F1LNY9 Uniprot:F1LNY9
        Length = 1441

 Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/280 (28%), Positives = 105/280 (37%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
             R       GA GN        P G      G G P   G P +   AG  GA GP  +  
Sbjct:   288 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 344

Query:   183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
             +     + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   345 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 397

Query:   236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   398 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 456

Query:   292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   457 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 513

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   514 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057


>UNIPROTKB|F1LQ06 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00949996
            Ensembl:ENSRNOT00000066385 ArrayExpress:F1LQ06 Uniprot:F1LQ06
        Length = 1441

 Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/280 (28%), Positives = 105/280 (37%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
             R       GA GN        P G      G G P   G P +   AG  GA GP  +  
Sbjct:   288 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 344

Query:   183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
             +     + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   345 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 397

Query:   236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   398 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 456

Query:   292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   457 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 513

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   514 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057


>UNIPROTKB|F1M8G1 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00475975
            Ensembl:ENSRNOT00000050833 ArrayExpress:F1M8G1 Uniprot:F1M8G1
        Length = 1458

 Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/280 (28%), Positives = 105/280 (37%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
             R       GA GN        P G      G G P   G P +   AG  GA GP  +  
Sbjct:   305 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 361

Query:   183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
             +     + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   362 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 414

Query:   236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   415 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 473

Query:   292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   474 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 530

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   531 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 570

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 87/286 (30%), Positives = 109/286 (38%)

Query:   127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 186
             DG+ G    + E  T G P G        G P G G    A  A + G     +  A   
Sbjct:   113 DGAPGPRGRDGEPGTPGNP-GPPGPPGPPG-PPGLGGGNFA--AQMAGGFDEKAGGAQMG 168

Query:   187 TQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP 243
                G PM      PRGP G   + GP G+  +     +P   GP   P   PG  P   P
Sbjct:   169 VMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRGPPG--PAGKP 222

Query:   244 GYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG----PGYET 295
             G D + G    A +RG P     RG    P  GL G    RG P  D  +G    PG + 
Sbjct:   223 GDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKGEAGAPGVKG 280

Query:   296 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS-----YDPSRGTGF 349
             +   PG +   GP+   +  P    + GP       +G D +  P+       P+ G GF
Sbjct:   281 ESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGPVGPAGGPGF 338

Query:   350 DGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
              GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   339 PGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 381


>UNIPROTKB|Q9XSK0 [details] [associations]
            symbol:CRX "Cone-rod homeobox protein" species:9913 "Bos
            taurus" [GO:0060041 "retina development in camera-type eye"
            evidence=IEA] [GO:0045944 "positive regulation of transcription
            from RNA polymerase II promoter" evidence=IEA] [GO:0043522 "leucine
            zipper domain binding" evidence=IEA] [GO:0005667 "transcription
            factor complex" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0003682
            "chromatin binding" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0043565 "sequence-specific DNA
            binding" evidence=IEA] InterPro:IPR001356 InterPro:IPR009057
            InterPro:IPR013851 InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529
            PROSITE:PS00027 PROSITE:PS50071 SMART:SM00389 GO:GO:0043565
            GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0060041
            EMBL:AF154123 IPI:IPI00695402 RefSeq:NP_776329.1 UniGene:Bt.283
            ProteinModelPortal:Q9XSK0 SMR:Q9XSK0 STRING:Q9XSK0 PRIDE:Q9XSK0
            Ensembl:ENSBTAT00000028232 GeneID:280756 KEGG:bta:280756 CTD:1406
            eggNOG:NOG324074 GeneTree:ENSGT00700000104128 HOGENOM:HOG000082677
            HOVERGEN:HBG004028 InParanoid:Q9XSK0 KO:K09337 OMA:QTKARPA
            OrthoDB:EOG4NKBWG NextBio:20804923 Uniprot:Q9XSK0
        Length = 299

 Score = 119 (46.9 bits), Expect = 0.00017, P = 0.00017
 Identities = 29/96 (30%), Positives = 42/96 (43%)

Query:   158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
             P    P P A  AG+V +GP+ +++ YA T +  P  A    P   G  +S   G D   
Sbjct:   165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222

Query:   218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 253
             +P   P  GP+  P  GP   P+      +  G +Y
Sbjct:   223 SPMVPPLGGPALSPLSGPSVGPSLTQSPTSLSGQSY 258


>UNIPROTKB|J9P2F0 [details] [associations]
            symbol:ZNF768 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
            InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
            PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
            GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
            EMBL:AAEX03004391 Ensembl:ENSCAFT00000043076 Uniprot:J9P2F0
        Length = 540

 Score = 123 (48.4 bits), Expect = 0.00017, P = 0.00017
 Identities = 39/146 (26%), Positives = 66/146 (45%)

Query:   127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 186
             +GS  G    +E E   +  G   YE    +P   G  P +        G  + +  +  
Sbjct:    25 EGSLKGNMSENEEEEMSQQEGTGDYEVEE-IP--FGLDPQSPGFEPQSPGFESQSPRFEP 81

Query:   187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
                G   R+   +P  P + A + P  D S++P ++P + P Y+P + PGY+P + PGY+
Sbjct:    82 ESPGFESRSPGFVPPSPEF-APRSPDSD-SQSPEFEP-QSPRYEP-QSPGYEP-RSPGYE 136

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDPQ 272
               K   Y++Q  P Y+  + P +  Q
Sbjct:   137 P-KSPGYESQ-SPGYE-PQNPEFKTQ 159


>UNIPROTKB|F1PS24 [details] [associations]
            symbol:COL2A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
            PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005737
            GO:GO:0043066 GO:GO:0005615 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
            GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
            GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
            GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GeneTree:ENSGT00660000095287 GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 EMBL:AAEX03015088 EMBL:AAEX03015089
            Ensembl:ENSCAFT00000014414 OMA:CPICPTE Uniprot:F1PS24
        Length = 1489

 Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   794 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 852

Query:   183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   853 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 909

Query:   235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   910 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 956

Query:   294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   957 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1012

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1013 GASGDRGPPGPVGPPGLTGPSGE---PGREGS-PGADGPPGR 1050

 Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
 Identities = 72/271 (26%), Positives = 92/271 (33%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA- 186
             G  G      +    G P G    +   G P   GPP      G  G G N +       
Sbjct:   130 GEQGPRGDRGDKGEKGAP-GPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 188

Query:   187 -TQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA-KGPGYDPTKGP 243
               ++G         P GP G     GP   A     +    G   +P   GP   P   P
Sbjct:   189 DEKAGGAQMGVMQGPMGPMGPRGPPGPA-GAPGPQGFQGNPGEPGEPGVSGP-MGPRGPP 246

Query:   244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG-PGYETQR---- 297
             G   + G + +A + P     RGP   PQ   G+    G P     RG PG +  +    
Sbjct:   247 GPPGKPGDDGEAGK-PGKSGERGPP-GPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAG 304

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA 356
              PG   + G   E   +P  +  RG PG   +RG     R  P+   +   G DG P  A
Sbjct:   305 APGVKGESGSPGE-NGSPGPMGPRGLPG---ERG-----RTGPA-GAAGARGNDGQPGPA 354

Query:   357 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
              P G V P     P     P A  G   P G
Sbjct:   355 GPPGPVSPA--GGPGFPGAPGASQGEAGPTG 383

 Score = 123 (48.4 bits), Expect = 0.00061, P = 0.00061
 Identities = 82/280 (29%), Positives = 105/280 (37%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
             R       GA GN        P G  +   G G P G  P  S   AG  GA GP  +  
Sbjct:   335 RTGPAGAAGARGNDGQPGPAGPPGPVSPAGGPGFP-G-APGASQGEAGPTGARGPEGAQG 392

Query:   183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
                   + G+P  A      G    PG + S G PG   + AP +   +GP   P  GP 
Sbjct:   393 PRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 445

Query:   236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
             G     GP G   + G + +  ++GP  +    GP   P    G + +RG   +    GP
Sbjct:   446 GATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 504

Query:   292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
              G   +R  PG    RG P  +    P   P +RGP G    +G   D  R         
Sbjct:   505 VGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 561

Query:   347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
              G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   562 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 601

 Score = 122 (48.0 bits), Expect = 0.00078, P = 0.00078
 Identities = 83/282 (29%), Positives = 106/282 (37%)

Query:   130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQ 188
             + GA G S+ E    P G    E   G P+G  G P S   AG  G  P T     A   
Sbjct:   368 FPGAPGASQGEAG--PTGARGPEGAQG-PRGEPGTPGSPGPAGASG-NPGTDGIPGAKGS 423

Query:   189 SGTPMRAA---YDIPRGP-GYEASKGP----GYDASKA-PSYDPTKGPSYDPAKGPGYDP 239
             +G P  A    +  PRGP G + + GP    G         +   +GP  +P  GP   P
Sbjct:   424 AGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEP--GPA-GP 480

Query:   240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRG-PNYDMQRGP-GYETQ 296
                PG   ++G    A+  P      GP   P +RG   +  RG P  D   GP G   +
Sbjct:   481 QGAPGPAGEEGKR-GARGEPG---GAGPVGPPGERGAPGN--RGFPGQDGLAGPKGAPGE 534

Query:   297 RVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
             R P      GP   A   P    + G PG     G+  D        PS   G DG P  
Sbjct:   535 RGPSG--LAGPK-GANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGP 591

Query:   356 AAPHG-QVPPPLNNVP--YGSATPPARSGS-GQPRGGNPARR 393
               P G +  P +   P   G+   P ++G  G P  G P  R
Sbjct:   592 PGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLP--GAPGLR 631


>RGD|1309595 [details] [associations]
            symbol:Taf15 "TAF15 RNA polymerase II, TATA box binding protein
            (TBP)-associated factor" species:10116 "Rattus norvegicus"
            [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0005622 "intracellular" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950713
            PRIDE:F1M8P1 Ensembl:ENSRNOT00000014438 ArrayExpress:F1M8P1
            Uniprot:F1M8P1
        Length = 554

 Score = 123 (48.4 bits), Expect = 0.00018, P = 0.00018
 Identities = 72/237 (30%), Positives = 86/237 (36%)

Query:   124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 183
             RR +   GG +G       GR  G+  Y  G G  QG G  P       V   P+     
Sbjct:   318 RRPEFMRGGGSGG------GRR-GRGGYR-GRGGFQGRGGDPK--NGDWVCPNPSCGNMN 367

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP 243
             +A   S             P     +G GY   +   +    G   D   G G D + G 
Sbjct:   368 FARRNSCNQCNEPRPEDSRPSGGDFRGRGYGGERG--FRGRGGRGGDRG-GYGADRSGG- 423

Query:   244 GYDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
             GY   + G +Y A R G  Y   R   Y   RG GY   RG +Y   RG GY   R  GY
Sbjct:   424 GYGGDRSGGSYGADRSGGGYGGDRS-GYGGDRG-GYGGDRGGSYGGDRG-GYGGDR-GGY 479

Query:   302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG---YDMRRAPSYDPSRGTGFDGAPRG 355
                RG  Y   R   Y   R   Y   RG G   Y   R+  Y   RG G+ G  RG
Sbjct:   480 GGDRGG-YGGDRG-GYGGDRRGAYGGDRGGGSGGYGGDRSGGYGGDRGGGY-GGDRG 533


>UNIPROTKB|F1SEN8 [details] [associations]
            symbol:LDB3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
            "cytoskeletal protein binding" evidence=IEA] [GO:0005856
            "cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 CTD:11155
            OMA:CTSQATT InterPro:IPR006643 SMART:SM00735
            GeneTree:ENSGT00700000104411 EMBL:CU468409 RefSeq:XP_003359314.1
            UniGene:Ssc.97236 Ensembl:ENSSSCT00000011341 GeneID:100151883
            KEGG:ssc:100151883 Uniprot:F1SEN8
        Length = 715

 Score = 124 (48.7 bits), Expect = 0.00020, P = 0.00020
 Identities = 50/192 (26%), Positives = 69/192 (35%)

Query:   133 ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP----NTSTSAYAATQ 188
             AT ++    S            Y       P P+A T     A P       T+A     
Sbjct:   344 ATASAAAPASSPADSPRPQASAYSPAVATSPAPAAHTYSEAPAAPAPKPRVVTTASIRPS 403

Query:   189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 248
                P+ A+   P  PG   S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+  
Sbjct:   404 VYQPVPASTYSP-SPGANYSPTP-YTPSPAPAYTPSPAPTYSPSPAPAYTPSPAPSYNPT 461

Query:   249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQ--- 304
               S   A+           S+  +   G          + RG P Y T  + G  V    
Sbjct:   462 PYSGGPAESASRPPWVTDDSFSQKFAPGKSTTSISKQSLPRGAPAY-TPPLQGPQVSPLA 520

Query:   305 RGPVYEAQRAPS 316
             RG V  A+R P+
Sbjct:   521 RGTVQRAERFPA 532


>RGD|1311620 [details] [associations]
            symbol:Zmiz1 "zinc finger, MIZ-type containing 1" species:10116
            "Rattus norvegicus" [GO:0001570 "vasculogenesis" evidence=IEA;ISO]
            [GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
            [GO:0003007 "heart morphogenesis" evidence=IEA;ISO] [GO:0007296
            "vitellogenesis" evidence=IEA;ISO] [GO:0007569 "cell aging"
            evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0048146 "positive
            regulation of fibroblast proliferation" evidence=IEA;ISO]
            [GO:0048589 "developmental growth" evidence=IEA;ISO] [GO:0048844
            "artery morphogenesis" evidence=IEA;ISO] InterPro:IPR004181
            Pfam:PF02891 PROSITE:PS51044 RGD:1311620 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
            CTD:57178 OMA:MNQYGPM OrthoDB:EOG45MN70 EMBL:CH474067
            IPI:IPI00364462 RefSeq:NP_001101863.1 UniGene:Rn.1712
            Ensembl:ENSRNOT00000014004 GeneID:361103 KEGG:rno:361103
            UCSC:RGD:1311620 NextBio:675228 Uniprot:D4AE97
        Length = 1072

 Score = 126 (49.4 bits), Expect = 0.00020, P = 0.00020
 Identities = 66/233 (28%), Positives = 87/233 (37%)

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
             GP  S+     TQ+          PRGP   AS G   + AS A    P+   GP    +
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374

Query:   231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
               + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN    
Sbjct:   375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRSYPGEPNYG---NQQYGPNSQFP 431

Query:   289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP-- 343
               PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    +   
Sbjct:   432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488

Query:   344 SRGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
             S G+ +    +G+      P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   489 SSGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541


>UNIPROTKB|F1NI79 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
            SMART:SM00210 GeneTree:ENSGT00700000104155 EMBL:AADN02026433
            EMBL:AADN02026434 EMBL:AADN02026427 EMBL:AADN02026428
            EMBL:AADN02026429 EMBL:AADN02026430 EMBL:AADN02026431
            EMBL:AADN02026432 IPI:IPI00602965 Ensembl:ENSGALT00000004020
            ArrayExpress:F1NI79 Uniprot:F1NI79
        Length = 1702

 Score = 128 (50.1 bits), Expect = 0.00020, P = 0.00020
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:   930 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 984

Query:   203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:   985 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1041

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1042 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1090

Query:   319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1091 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1150

Query:   378 ARSGS-GQP 385
               SG  G P
Sbjct:  1151 GESGEPGLP 1159


>UNIPROTKB|E1BF96 [details] [associations]
            symbol:PPP1R10 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0072357 "PTW/PP1 phosphatase complex" evidence=IEA]
            [GO:0000785 "chromatin" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0003677 GO:GO:0008270 GO:GO:0000785 GO:GO:0006351
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            OMA:PPPHEHR GeneTree:ENSGT00530000063820 EMBL:DAAA02055402
            IPI:IPI00698425 RefSeq:NP_001137335.1 UniGene:Bt.27784
            Ensembl:ENSBTAT00000009104 GeneID:510825 KEGG:bta:510825
            NextBio:20869636 Uniprot:E1BF96
        Length = 924

 Score = 125 (49.1 bits), Expect = 0.00021, P = 0.00021
 Identities = 71/271 (26%), Positives = 87/271 (32%)

Query:   128 GSYGGATGNSENETS-GRPV-GQNAYEDGYGVPQGH---GPPPSATTAGVVGAGPNTSTS 182
             G  GG  G        G P+ G +    G G P G    GPPP          GP     
Sbjct:   631 GGPGGPKGMQHFPPGPGGPMPGPHGGPGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDP-- 688

Query:   183 AYAATQSGTPMRAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
                    G PMR     P GPG Y   +G        P   P +G     + G   +   
Sbjct:   689 -----MRGGPMRGGPG-P-GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRG 741

Query:   242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
             GPG     G  +    GP   ++ G  + P  G G  M  G  +    GPG       G+
Sbjct:   742 GPGGGMVGGGGHRPHEGPGGGMNSGSGHRPHEGPGSGM--GGGHRPHEGPGGSMGG--GH 797

Query:   302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
                 GP         + P  GPG  +  G G+         P  G G  G P G  PH  
Sbjct:   798 RPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-D 847

Query:   362 VPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             VP    +   G      R   G   GG   R
Sbjct:   848 VPGHRGHDHRGPPPHEHRGHDGPGHGGGGHR 878

 Score = 121 (47.7 bits), Expect = 0.00058, P = 0.00058
 Identities = 49/192 (25%), Positives = 68/192 (35%)

Query:   132 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGV-------VGAGPNTSTSA 183
             G  G +E      P  G      G G P G G P      G         G G N+ +  
Sbjct:   710 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMNSGSGH 769

Query:   184 YAATQSGTPMRAAYDIPRGPG------YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY 237
                   G+ M   +    GPG      +   +GPG        + P +GP      G G+
Sbjct:   770 RPHEGPGSGMGGGHRPHEGPGGSMGGGHRPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGH 829

Query:   238 DPTKGPGYDAQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
              P +GPG+    G   +D      +D HRGP   P    G+D   GP +      G++  
Sbjct:   830 RPHEGPGHGGPHGHRPHDVPGHRGHD-HRGPP--PHEHRGHD---GPGHGGGGHRGHDGG 883

Query:   297 RVPGYDVQRGPV 308
                G D+   PV
Sbjct:   884 HSHGGDMSNRPV 895


>UNIPROTKB|F1NR01 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
            EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
            EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00822317
            Ensembl:ENSGALT00000039037 ArrayExpress:F1NR01 Uniprot:F1NR01
        Length = 1773

 Score = 128 (50.1 bits), Expect = 0.00021, P = 0.00021
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1001 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1055

Query:   203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1056 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1112

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1113 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1161

Query:   319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1162 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1221

Query:   378 ARSGS-GQP 385
               SG  G P
Sbjct:  1222 GESGEPGLP 1230


>ZFIN|ZDB-GENE-030707-4 [details] [associations]
            symbol:anxa11a "annexin A11a" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
            "calcium-dependent phospholipid binding" evidence=IEA]
            InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
            InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
            SMART:SM00335 ZFIN:ZDB-GENE-030707-4 GO:GO:0005509 eggNOG:NOG267770
            GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
            HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29 HSSP:P79134 EMBL:AY178801
            IPI:IPI00498021 UniGene:Dr.77310 ProteinModelPortal:Q804G4
            SMR:Q804G4 PRIDE:Q804G4 InParanoid:Q804G4 NextBio:20812811
            ArrayExpress:Q804G4 Bgee:Q804G4 Uniprot:Q804G4
        Length = 526

 Score = 122 (48.0 bits), Expect = 0.00021, P = 0.00021
 Identities = 58/201 (28%), Positives = 73/201 (36%)

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
             G P ++ Y  P+G GY     PG     A  Y P  G  Y P  G GY P  G  Y  Q 
Sbjct:     5 GYPPQSGYP-PQGGGYPPQ--PGAYPPAAGGYPPQPG-MYPPQAG-GYPPQPG-AYPPQP 58

Query:   250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVY 309
             G+ +  Q G    +  G    P   +G D    P ++     G   Q          P  
Sbjct:    59 GA-FPGQPGQYPSVPSGGWGAP---IGLDNLPNPGFNASNIQGMANQFAADGGFAPNPSM 114

Query:   310 EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV 369
              +   P   PQ G    +   Q Y M   P     +  G  G P G  P GQ  P   N+
Sbjct:   115 FSGGYPG--PQPGGPPAVSPNQPYGMYPQPGGGMPQNPGM-GYP-GGPPPGQQMPSYPNI 170

Query:   370 PYGSATPPARSGSGQPRGGNP 390
             P  + TP   SG   PR  +P
Sbjct:   171 P--APTP---SGPSYPRAPSP 186

 Score = 116 (45.9 bits), Expect = 0.00098, P = 0.00098
 Identities = 63/215 (29%), Positives = 77/215 (35%)

Query:   156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA 215
             G P   G PP        G G      AY     G P +     P+  GY    G     
Sbjct:     5 GYPPQSGYPPQ-------GGGYPPQPGAYPPAAGGYPPQPGMYPPQAGGYPPQPGAYPPQ 57

Query:   216 SKAPSYDPTKGPSYDPAKG---P-GYDPTKGPGYDAQK----GSNYDAQRG--PNYDIHR 265
               A    P + PS  P+ G   P G D    PG++A       + + A  G  PN  +  
Sbjct:    58 PGAFPGQPGQYPSV-PSGGWGAPIGLDNLPNPGFNASNIQGMANQFAADGGFAPNPSMFS 116

Query:   266 GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY--IPQRGP 323
             G    PQ G    +     Y M   PG    + PG     GP    Q+ PSY  IP   P
Sbjct:   117 GGYPGPQPGGPPAVSPNQPYGMYPQPGGGMPQNPGMGYPGGPP-PGQQMPSYPNIPAPTP 175

Query:   324 GYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP 358
                   G  Y   RAPS +PS   G+ G   G AP
Sbjct:   176 S-----GPSYP--RAPSPNPSM-PGYGGGYGGGAP 202


>UNIPROTKB|F1NR03 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
            EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
            EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00818113
            Ensembl:ENSGALT00000039034 ArrayExpress:F1NR03 Uniprot:F1NR03
        Length = 1804

 Score = 128 (50.1 bits), Expect = 0.00022, P = 0.00022
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1032 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1086

Query:   203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1087 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1143

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1144 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1192

Query:   319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1193 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1252

Query:   378 ARSGS-GQP 385
               SG  G P
Sbjct:  1253 GESGEPGLP 1261


>UNIPROTKB|F1NR02 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0003007 "heart morphogenesis" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005588 "collagen type V" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0007155 "cell
            adhesion" evidence=IEA] [GO:0008201 "heparin binding" evidence=IEA]
            [GO:0030199 "collagen fibril organization" evidence=IEA]
            [GO:0032964 "collagen biosynthetic process" evidence=IEA]
            [GO:0035313 "wound healing, spreading of epidermal cells"
            evidence=IEA] [GO:0043206 "extracellular fibril organization"
            evidence=IEA] [GO:0043394 "proteoglycan binding" evidence=IEA]
            [GO:0043588 "skin development" evidence=IEA] [GO:0045112 "integrin
            biosynthetic process" evidence=IEA] [GO:0048407 "platelet-derived
            growth factor binding" evidence=IEA] [GO:0048592 "eye
            morphogenesis" evidence=IEA] [GO:0051128 "regulation of cellular
            component organization" evidence=IEA] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
            GO:GO:0030199 GO:GO:0008201 GO:GO:0007155 Gene3D:2.60.120.200
            InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0035313
            InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 SMART:SM00282
            GO:GO:0005604 GO:GO:0043206 Pfam:PF02210 GO:GO:0005201 OMA:TIYEGIG
            GO:GO:0005588 GO:GO:0045112 GO:GO:0051128 SMART:SM00210
            GeneTree:ENSGT00700000104155 EMBL:AADN02026433 EMBL:AADN02026434
            EMBL:AADN02026427 EMBL:AADN02026428 EMBL:AADN02026429
            EMBL:AADN02026430 EMBL:AADN02026431 EMBL:AADN02026432
            IPI:IPI00821684 Ensembl:ENSGALT00000039035 ArrayExpress:F1NR02
            Uniprot:F1NR02
        Length = 1815

 Score = 128 (50.1 bits), Expect = 0.00022, P = 0.00022
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1043 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1097

Query:   203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1098 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1154

Query:   260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1155 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1203

Query:   319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1204 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1263

Query:   378 ARSGS-GQP 385
               SG  G P
Sbjct:  1264 GESGEPGLP 1272


>UNIPROTKB|E9PQW6 [details] [associations]
            symbol:ARID1A "AT-rich interactive domain-containing
            protein 1A" species:9606 "Homo sapiens" [GO:0006325 "chromatin
            organization" evidence=IEA] [GO:0016514 "SWI/SNF complex"
            evidence=IEA] [GO:0071564 "npBAF complex" evidence=IEA] [GO:0071565
            "nBAF complex" evidence=IEA] EMBL:AL034380 GO:GO:0016514
            EMBL:AL512408 HGNC:HGNC:11110 ChiTaRS:ARID1A GO:GO:0006325
            IPI:IPI00979164 Ensembl:ENST00000524572 ArrayExpress:E9PQW6
            Bgee:E9PQW6 Uniprot:E9PQW6
        Length = 123

 Score = 98 (39.6 bits), Expect = 0.00024, P = 0.00024
 Identities = 36/108 (33%), Positives = 47/108 (43%)

Query:   229 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
             Y   +GP   P +G GY  Q   +   QR P     +G +     GL Y  Q  P Y  Q
Sbjct:    18 YSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPM--TMQGRAQSAMGGLSYTQQIPP-YG-Q 73

Query:   289 RGP-GYETQ-RVPGYDVQ------RGPVYEAQRAPSYIPQRGPGYDLQ 328
             +GP GY  Q + P Y+ Q      + P Y +Q+ PS  P   P Y  Q
Sbjct:    74 QGPSGYGQQGQTPYYNQQSPHPQQQQPPY-SQQPPSQTPHAQPSYQQQ 120


>ZFIN|ZDB-GENE-030707-5 [details] [associations]
            symbol:anxa11b "annexin A11b" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
            "calcium-dependent phospholipid binding" evidence=IEA]
            InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
            InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
            SMART:SM00335 ZFIN:ZDB-GENE-030707-5 GO:GO:0005509 eggNOG:NOG267770
            GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
            HOGENOM:HOG000158803 HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29
            OrthoDB:EOG4Z0B60 InterPro:IPR013286 PRINTS:PR01871 HSSP:P79134
            EMBL:BC068366 EMBL:AY178802 IPI:IPI00484212 RefSeq:NP_861431.1
            UniGene:Dr.76267 SMR:Q804G3 STRING:Q804G3 GeneID:353365
            KEGG:dre:353365 CTD:353365 NextBio:20812741 Uniprot:Q804G3
        Length = 485

 Score = 121 (47.7 bits), Expect = 0.00024, P = 0.00024
 Identities = 59/175 (33%), Positives = 71/175 (40%)

Query:   219 PSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 278
             P Y P  G SY PA GP   P  G  Y  Q G+ Y  Q G  Y    G ++ PQ G  + 
Sbjct:     4 PGYPPAGG-SYPPASGPYQQPAAG--YPPQPGA-YPPQAG-YYPPQPG-AFPPQPG-AFP 56

Query:   279 MQRG--P---NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY--IPQRG-----PGYD 326
              Q G  P    Y  Q G GY      G+  Q G  Y A +  +Y  +P  G     PG+ 
Sbjct:    57 PQPGAFPPGAGYPPQAG-GYPAAPGGGFPPQAGG-YPAAQPGAYPNMPAAGGWGGHPGFG 114

Query:   327 LQRG---QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 378
                G   QGY    AP   P     + GAP    P+  +P      P G  TPPA
Sbjct:   115 APAGGMPQGYPGVPAPGQQPM--PAYPGAP---VPNPGMPGYGGGAPTGP-TPPA 163


>UNIPROTKB|P02812 [details] [associations]
            symbol:PRB2 "Basic salivary proline-rich protein 2"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] GO:GO:0005576 EMBL:AC078950
            EMBL:BX484538 EMBL:S80905 EMBL:K03208 IPI:IPI00552432 PIR:B40750
            PIR:E25372 UniGene:Hs.654486 STRING:P02812 DMDM:160409933
            PaxDb:P02812 PRIDE:P02812 Ensembl:ENST00000389362 UCSC:uc010shk.1
            GeneCards:GC12M011544 HGNC:HGNC:9338 MIM:168810 neXtProt:NX_P02812
            ArrayExpress:P02812 Bgee:P02812 CleanEx:HS_PRB2
            Genevestigator:P02812 GermOnline:ENSG00000173342 InterPro:IPR026086
            PANTHER:PTHR23203 Uniprot:P02812
        Length = 416

 Score = 120 (47.3 bits), Expect = 0.00025, P = 0.00025
 Identities = 69/257 (26%), Positives = 88/257 (34%)

Query:   142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA--ATQSGTPMRAAYDI 199
             +G P  Q A   G   PQG  P P     G    G N             G P +   + 
Sbjct:    33 AGNP--QGAPPQGGNKPQGP-PSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGG-NK 88

Query:   200 PRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
             P+GP   G      P  D S++P   P K P   P +G G  P +GP     K      Q
Sbjct:    89 PQGPPPPGKPQGPPPQGDKSRSPRSPPGK-PQGPPPQG-GNQP-QGPPPPPGKPQGPPPQ 145

Query:   257 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPS 316
              G      +GP   P +  G   Q        R P  + Q  P    Q G   +    P 
Sbjct:   146 GGNK---PQGPP-PPGKPQGPPPQGDNKSRSSRSPPGKPQGPPP---QGGNQPQGPPPPP 198

Query:   317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLN-NVPYGSAT 375
               PQ  P     + QG      P   P +G     + R      Q PPP   N P G   
Sbjct:   199 GKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDNKSQSARSPPGKPQGPPPQGGNQPQGPPP 258

Query:   376 PPARSGSGQPRGGNPAR 392
             PP +     P+GGN ++
Sbjct:   259 PPGKPQGPPPQGGNKSQ 275

 Score = 118 (46.6 bits), Expect = 0.00041, P = 0.00041
 Identities = 76/272 (27%), Positives = 99/272 (36%)

Query:   135 GNSENETSGRPVG--QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTP 192
             G++++ +S  P G  Q     G   PQG  PPP        G  P            G P
Sbjct:   166 GDNKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQ----GPPPQGGNKPQGPPPPGKP 221

Query:   193 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 251
                    P+G    ++++ P     K P   P +G +  P +GP   P K  G   Q G+
Sbjct:   222 QGPP---PQGDNKSQSARSP---PGK-PQGPPPQGGN-QP-QGPPPPPGKPQGPPPQGGN 272

Query:   252 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVPGYDVQ-RGP 307
                +Q  P     +GP   PQ G      R P    Q  P   G + Q  P    + +GP
Sbjct:   273 K--SQGPPPPGKPQGPP--PQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGP 328

Query:   308 VYEAQRAPSYIPQRG-P-GYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR--GAAPHGQVP 363
               +    P   P  G P G   Q G      R+P   P       G P+  G  P G  P
Sbjct:   329 PPQGGNKPQGPPPPGKPQGPPPQGGSKSRSARSPPGKPQ------GPPQQEGNNPQGP-P 381

Query:   364 PPLNNVPYGSATPPARSGSGQPR---GGNPAR 392
             PP    P     PPA    G PR   GG P+R
Sbjct:   382 PPAGGNPQQPQAPPAGQPQGPPRPPQGGRPSR 413


>DICTYBASE|DDB_G0279193 [details] [associations]
            symbol:rpb1 "RNA polymerase II core subunit"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA;IDA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISS] [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=ISS] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0016740 "transferase activity" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 dictyBase:DDB_G0279193
            GO:GO:0006355 GenomeReviews:CM000152_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010
            EMBL:AAFI02000030 GO:GO:0003899 eggNOG:COG0086 GO:GO:0005665
            OMA:KVLPWST EMBL:S52651 PIR:A56823 RefSeq:XP_641735.1 STRING:P35084
            PRIDE:P35084 EnsemblProtists:DDB0215406 GeneID:8621932
            KEGG:ddi:DDB_G0279193 KO:K03006 ProtClustDB:CLSZ2428993
            Uniprot:P35084
        Length = 1727

 Score = 135 (52.6 bits), Expect = 0.00025, Sum P(2) = 0.00025
 Identities = 65/219 (29%), Positives = 85/219 (38%)

Query:   177 PNTSTSAYA-ATQSGTPMRAAYDIPRGPGYEASKG---------PGYDASKA--PSYDP- 223
             P + T +Y+    S TP    YD P  P  E  +G         PGY+A+K+   SY   
Sbjct:  1488 PGSQTPSYSYGDGSTTPFHNPYDAPLSPFNETFRGDFSPSAMNSPGYNANKSYGSSYQYF 1547

Query:   224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
              + P+Y P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P
Sbjct:  1548 PQSPTYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSP 1600

Query:   284 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 343
              Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P
Sbjct:  1601 FYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSP 1653

Query:   344 SRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 382
             +  +    +P   +P      P +  P  S T P+ S S
Sbjct:  1654 TSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYSPS 1689

 Score = 40 (19.1 bits), Expect = 0.00025, Sum P(2) = 0.00025
 Identities = 12/43 (27%), Positives = 20/43 (46%)

Query:    85 KKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAD 127
             +K +N  ++  +V + N   +  E+EKL A L      D   D
Sbjct:   978 QKLFN--IDIRRVSDLNPAVVVLEIEKLVARLKIIATADTTED 1018


>UNIPROTKB|F1Q0F7 [details] [associations]
            symbol:COL4A5 "Collagen alpha-5(IV) chain" species:9615
            "Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:AAEX03026757 EMBL:AAEX03026761
            EMBL:AAEX03026758 EMBL:AAEX03026759 EMBL:AAEX03026760
            Ensembl:ENSCAFT00000018078 Uniprot:F1Q0F7
        Length = 1678

 Score = 127 (49.8 bits), Expect = 0.00026, P = 0.00026
 Identities = 59/197 (29%), Positives = 72/197 (36%)

Query:   200 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 253
             P  PG     GP  G    K    +P K G P  D   G PG     G PGY  + G   
Sbjct:   269 PGPPGIRGPPGPPGGMKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 326

Query:   254 DAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRG-PVYE 310
             D ++G   DI   GP    + G G  +    N  +   PG + +R  PG     G P   
Sbjct:   327 DGEKGQKGDIGSTGPPGLSKPGTGVTVGEKGNMGLPGLPGEKGERGFPGIQGPPGLPGPP 386

Query:   311 AQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 370
                     P   PG+  +RGQ  D    P        G DG P      G   PP    P
Sbjct:   387 VLGTAVMGPPGPPGFPGERGQKGD-EGPPGISIPGFPGLDGQPGAPGLRGPPGPP---GP 442

Query:   371 YGSATPPARSGSGQPRG 387
             + S +PP   GS   RG
Sbjct:   443 HISPSPPGPPGSPGDRG 459

 Score = 122 (48.0 bits), Expect = 0.00090, P = 0.00090
 Identities = 80/270 (29%), Positives = 98/270 (36%)

Query:   132 GATG-NSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQS 189
             G  G N      G P G+     G G P   GPP      G  G  GP        A Q 
Sbjct:  1131 GPKGINGPPGNPGLP-GEPGPVGGGGRPGPPGPPGEKGNPGQDGIPGP--------AGQK 1181

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTKGPSYDPAKGPGYDPTKGPGYDA 247
             G P +  + IP  PG     G   D      P      GP  +P    G+   +GP    
Sbjct:  1182 GEPGQPGFGIPGPPGLPGLSGQKGDGGLPGIPGNPGLPGPKGEPGF-QGFPGVQGP--PG 1238

Query:   248 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG-PG---YETQRV-PGY 301
               GS   A  GP  +   GP   P R + Y  QRG P  +  RG PG    + +R  PG 
Sbjct:  1239 PPGSPGPALEGPKGN--PGPQGPPGRPV-YTFQRGLPGPEGPRGLPGNGGIKGERGNPGQ 1295

Query:   302 DVQRG-PVYEAQRAPSYIPQRGPGYDLQRGQGYD--MRRAPSYDPSRGTGFDGAPRGAAP 358
               Q G P  +  + P  I Q  PG     G   D  +   P +   +G    G P  A P
Sbjct:  1296 PGQPGLPGLKGDQGPPGI-QGNPGRPGLNGMKGDPGLPGVPGFPGMKGPS--GVPGSAGP 1352

Query:   359 HGQ---VPPPLNNVPYGSATPPARSG-SGQ 384
              G    V PP+    +    PP   G SGQ
Sbjct:  1353 EGDPGLVGPPV--CMFCILGPPGLPGPSGQ 1380


>UNIPROTKB|F1PHY1 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
            "Canis lupus familiaris" [GO:0071230 "cellular response to amino
            acid stimulus" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0048407 "platelet-derived
            growth factor binding" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0043589 "skin morphogenesis" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0030674
            "protein binding, bridging" evidence=IEA] [GO:0030199 "collagen
            fibril organization" evidence=IEA] [GO:0008217 "regulation of blood
            pressure" evidence=IEA] [GO:0007266 "Rho protein signal
            transduction" evidence=IEA] [GO:0007179 "transforming growth factor
            beta receptor signaling pathway" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            [GO:0001501 "skeletal system development" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005615 GO:GO:0030199 GO:GO:0001501
            GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0001568 GO:GO:0071230
            GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287
            GO:GO:0005584 OMA:TGPIGSA EMBL:AAEX03009315
            Ensembl:ENSCAFT00000031580 Uniprot:F1PHY1
        Length = 1366

 Score = 126 (49.4 bits), Expect = 0.00026, P = 0.00026
 Identities = 83/261 (31%), Positives = 99/261 (37%)

Query:   156 GVPQGHGPPPSATTAGVVGA----G-PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASK 209
             G+P   G P     AG  GA    G P  + S   +   G P  A    P GP G E  +
Sbjct:   322 GLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGAQGPPGPSGEEGKR 381

Query:   210 GPGYDASKA-PSYDPTKGPSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIH 264
             GP  +A  A PS  P  G    P ++G PG D   G  G    +G+   A  RGPN D  
Sbjct:   382 GPNGEAGSAGPSGPP--GLRGSPGSRGLPGADGRAGVMGPPGPRGATGPAGVRGPNGDSG 439

Query:   265 RGPSYDPQRGLGYDMQRG-PNYDMQRGP-GYE-TQRVPGYDVQRGPVYEA--QRAPSYIP 319
             R P  +P    G    RG P      GP G E    +PG D + GP+  A  +  P  I 
Sbjct:   440 R-PG-EP----GLMGPRGFPGAPGNVGPAGKEGPMGLPGIDGRPGPIGPAGARGEPGNIG 493

Query:   320 QRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
               GP G     G+  D   A     +RG  G DG      P G           G A PP
Sbjct:   494 FPGPKGPTGDPGKNGDKGHA-GLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPP 552

Query:   378 ARSGSGQPRG-----GNPARR 393
                G   P G     G P  R
Sbjct:   553 GFQGLPGPAGTAGEVGKPGER 573


>RGD|61817 [details] [associations]
            symbol:Col1a1 "collagen, type I, alpha 1" species:10116 "Rattus
           norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
           [GO:0001503 "ossification" evidence=IEP] [GO:0001568 "blood vessel
           development" evidence=IEA;ISO] [GO:0001649 "osteoblast
           differentiation" evidence=IEA] [GO:0001957 "intramembranous
           ossification" evidence=IEA;ISO] [GO:0001958 "endochondral
           ossification" evidence=IEA;ISO] [GO:0003674 "molecular_function"
           evidence=ND] [GO:0005201 "extracellular matrix structural
           constituent" evidence=IEA;ISO] [GO:0005578 "proteinaceous
           extracellular matrix" evidence=ISO] [GO:0005581 "collagen"
           evidence=ISO] [GO:0005584 "collagen type I" evidence=IEA;ISO]
           [GO:0005615 "extracellular space" evidence=ISO;IDA] [GO:0005737
           "cytoplasm" evidence=IEA;ISO] [GO:0007584 "response to nutrient"
           evidence=IEP] [GO:0007601 "visual perception" evidence=IEA;ISO]
           [GO:0007605 "sensory perception of sound" evidence=IEA;ISO]
           [GO:0009612 "response to mechanical stimulus" evidence=IEP]
           [GO:0010035 "response to inorganic substance" evidence=IEP]
           [GO:0010718 "positive regulation of epithelial to mesenchymal
           transition" evidence=IEA;ISO] [GO:0010812 "negative regulation of
           cell-substrate adhesion" evidence=IEA;ISO] [GO:0015031 "protein
           transport" evidence=IEA;ISO] [GO:0030199 "collagen fibril
           organization" evidence=IEA;ISO] [GO:0030335 "positive regulation of
           cell migration" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
           evidence=ISO] [GO:0031960 "response to corticosteroid stimulus"
           evidence=IEP] [GO:0032964 "collagen biosynthetic process"
           evidence=IEA;ISO] [GO:0034504 "protein localization to nucleus"
           evidence=IEA;ISO] [GO:0034505 "tooth mineralization"
           evidence=IEA;ISO] [GO:0042060 "wound healing" evidence=IMP]
           [GO:0042542 "response to hydrogen peroxide" evidence=IEP]
           [GO:0042802 "identical protein binding" evidence=IEA;ISO]
           [GO:0043434 "response to peptide hormone stimulus" evidence=IEP]
           [GO:0043588 "skin development" evidence=ISO] [GO:0043589 "skin
           morphogenesis" evidence=IEA;ISO] [GO:0045893 "positive regulation of
           transcription, DNA-dependent" evidence=IEA;ISO] [GO:0046872 "metal
           ion binding" evidence=IEA] [GO:0048407 "platelet-derived growth
           factor binding" evidence=IEA;ISO] [GO:0048705 "skeletal system
           morphogenesis" evidence=ISO] [GO:0048706 "embryonic skeletal system
           development" evidence=IEA;ISO] [GO:0051591 "response to cAMP"
           evidence=IEP] [GO:0060325 "face morphogenesis" evidence=IEA;ISO]
           [GO:0060346 "bone trabecula formation" evidence=IEA;ISO] [GO:0060351
           "cartilage development involved in endochondral bone morphogenesis"
           evidence=IEA;ISO] [GO:0070208 "protein heterotrimerization"
           evidence=IEA;ISO] [GO:0071230 "cellular response to amino acid
           stimulus" evidence=IEA;ISO] [GO:0071260 "cellular response to
           mechanical stimulus" evidence=IEA] [GO:0071300 "cellular response to
           retinoic acid" evidence=IEP] [GO:0071363 "cellular response to
           growth factor stimulus" evidence=IEP] [GO:0071560 "cellular response
           to transforming growth factor beta stimulus" evidence=IEP]
           [GO:0090263 "positive regulation of canonical Wnt receptor signaling
           pathway" evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007
           Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
           PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
           RGD:61817 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615 GO:GO:0009612
           GO:GO:0071560 GO:GO:0046872 GO:GO:0015031 GO:GO:0007601
           GO:GO:0071300 GO:GO:0043434 GO:GO:0030199 GO:GO:0007584
           GO:GO:0010035 GO:GO:0007605 GO:GO:0010718 GO:GO:0030335
           GO:GO:0042542 GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391
           eggNOG:NOG12793 GO:GO:0042060 GO:GO:0071260 GO:GO:0001568
           GO:GO:0001649 GO:GO:0051591 GO:GO:0034505 GO:GO:0090263
           GO:GO:0001503 GO:GO:0010812 GO:GO:0060325 EMBL:CH473948
           GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
           GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
           GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
           HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ
           GO:GO:0005584 GO:GO:0060346 GO:GO:0031960 EMBL:Z78279 EMBL:BC133728
           EMBL:M11432 IPI:IPI00188909 PIR:A90559 RefSeq:NP_445756.1
           UniGene:Rn.2953 PDB:3HQV PDB:3HR2 PDBsum:3HQV PDBsum:3HR2
           ProteinModelPortal:P02454 IntAct:P02454 STRING:P02454 PRIDE:P02454
           Ensembl:ENSRNOT00000005311 GeneID:29393 KEGG:rno:29393
           UCSC:RGD:61817 InParanoid:A3KNA1 Reactome:REACT_150387
           EvolutionaryTrace:P02454 NextBio:609017 ArrayExpress:P02454
           Genevestigator:P02454 GermOnline:ENSRNOG00000003897 Uniprot:P02454
        Length = 1453

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 88/285 (30%), Positives = 108/285 (37%)

Query:   126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
             ADG  G  G  G++  +    P G  A   G   P G+ G P    + G  G  P  +  
Sbjct:   808 ADGQPGAKGEPGDTGVKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGSRGAAGP-PGATGF 865

Query:   183 AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--KG 234
               AA + G P  +    P GP    G E  KGP  +   A  P      GP   PA  KG
Sbjct:   866 PGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRGETGPAGRPGEVGPPGPP-GPAGEKG 924

Query:   235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   925 SPGADGPAGSPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984

Query:   283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
             P   M  GP       PG     GP  E+ R  S   +  PG D   G   D        
Sbjct:   985 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGAPGAKGDRGETGPAG 1032

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             P    G  GAP    P G+        P G A P   +G+  P G
Sbjct:  1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPAGARGPAG 1077


>UNIPROTKB|F1LQ00 [details] [associations]
            symbol:Col5a2 "Protein Col5a2" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:70921 GO:GO:0043588 GO:GO:0030199
            GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230
            GO:GO:0005201 GO:GO:0048592 GeneTree:ENSGT00660000095287
            GO:GO:0005588 IPI:IPI00366945 Ensembl:ENSRNOT00000005073
            Uniprot:F1LQ00
        Length = 1467

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 87/290 (30%), Positives = 109/290 (37%)

Query:   123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
             ++ A+G+ G  GA G   +     P G    E G   P+G  GPP S    G  G    T
Sbjct:   752 EKGAEGTAGNDGARGLPGSLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 810

Query:   180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
                 +A  Q   G P ++     P   G   S GP G   S  P + P   P     +G 
Sbjct:   811 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPHGVPGLKGGRGT 869

Query:   236 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-QRGPNYDM-QRG 290
                P  T  PG   + G    A   GP   I   P  +   GL  D    G   D    G
Sbjct:   870 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPIGE-PGKEGPPGLRGDPGSHGRVGDRGPAG 928

Query:   291 P-GYETQRV-PGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSR 345
             P G    +  PG D Q GP  +    P+    QRG  G   QRG+ G      P+  P +
Sbjct:   929 PPGSPGDKGDPGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGK 986

Query:   346 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
               G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:   987 -VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1035


>ZFIN|ZDB-GENE-980526-192 [details] [associations]
            symbol:col2a1a "collagen type II, alpha-1a"
            species:7955 "Danio rerio" [GO:0005581 "collagen" evidence=IEA;ISS]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0030903 "notochord development" evidence=IGI]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 ZFIN:ZDB-GENE-980526-192 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933 KO:K06236
            GO:GO:0030903 EMBL:BX927144 EMBL:DQ335127 IPI:IPI00505438
            RefSeq:NP_571367.1 UniGene:Dr.75057 SMR:Q2LDA1 STRING:Q2LDA1
            Ensembl:ENSDART00000100234 GeneID:562496 KEGG:dre:562496 CTD:562496
            InParanoid:Q2LDA1 NextBio:20884441 Uniprot:Q2LDA1
        Length = 1491

 Score = 126 (49.4 bits), Expect = 0.00029, P = 0.00029
 Identities = 83/270 (30%), Positives = 96/270 (35%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGA-GPNTSTSAYAATQ- 188
             GA G   N+      GQ   + G   PQG  G P      GV G  G   +  A  AT  
Sbjct:   844 GADGQPGNKGEQGESGQKG-DSGAPGPQGPSGAPGPVGPTGVTGPKGARGAQGAPGATGF 902

Query:   189 SGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGP-GYDPTKGP 243
              G   R     P G PG     GP G D  K    D    G + D   +GP G    KG 
Sbjct:   903 PGAAGRVGPPGPNGNPGAAGPAGPSGKDGPKGVRGDAGPPGRAGDAGLRGPPGAPGEKGE 962

Query:   244 -GYDAQKGSNYDAQRGP-NYDIHRGPSYDP-QRG-LGYDMQRGPNYD--MQRGPGYETQR 297
              G D   G   D   GP      RG    P QRG  G+    GP+ +   Q  PG    R
Sbjct:   963 AGEDGPPGP--DGPSGPAGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGSGDR 1020

Query:   298 VP----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFDGA 352
              P    G     GP  E  R  +      PG D   G +G      P   P    G  GA
Sbjct:  1021 GPPGPVGPPGLTGPAGETGREGNPGSDGPPGRDGAAGVKGERGNTGPIGAPG-APGAPGA 1079

Query:   353 PRGAAPHGQVPPPLNNVPYGSATPPARSGS 382
             P    P G+      N P G A PP  +G+
Sbjct:  1080 PGSVGPIGKQGDRGENGPQGPAGPPGPAGA 1109


>WB|WBGene00001076 [details] [associations]
            symbol:dpy-17 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0010171 "body
            morphogenesis" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0040007
            "growth" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] InterPro:IPR002486 Pfam:PF01484 SMART:SM01088
            GO:GO:0040007 GO:GO:0002119 GO:GO:0010171 GO:GO:0040035
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0040002 EMBL:FO080874
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00390000012316
            RefSeq:NP_498086.1 ProteinModelPortal:Q20778 SMR:Q20778
            DIP:DIP-26150N MINT:MINT-1080630 STRING:Q20778 PaxDb:Q20778
            EnsemblMetazoa:F54D8.1.1 EnsemblMetazoa:F54D8.1.2 GeneID:175696
            KEGG:cel:CELE_F54D8.1 UCSC:F54D8.1.1 CTD:175696 WormBase:F54D8.1
            eggNOG:NOG253878 InParanoid:Q20778 OMA:TEMEAWR NextBio:889252
            Uniprot:Q20778
        Length = 352

 Score = 118 (46.6 bits), Expect = 0.00031, P = 0.00031
 Identities = 74/296 (25%), Positives = 104/296 (35%)

Query:   108 EVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGY-GVPQGHGPPPS 166
             E +++  ++     V R+A G YGG  G      SG P G +    G+ G PQGH P  +
Sbjct:    48 ESDQIYMDMQKFGRVRRQA-GGYGGYGGYGSGP-SG-PSGPSGPHGGFPGGPQGHFPGNT 104

Query:   167 ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG 226
              ++      G      +      G+P+        GPG + +          P+  P   
Sbjct:   105 GSSNTPTLPGVIGVPPSVTGHPGGSPINPDGSPSAGPGDKCNCNTENSCPAGPA-GPKGT 163

Query:   227 PSYDPAKG-PGYDPTKGPGYDAQKGSNYDAQRGPNYD----IHRGPSYDP-QRGL-GYDM 279
             P +D   G PG      PG D +   +  AQ    YD       GP   P  +G  G   
Sbjct:   164 PGHDGPDGIPGV-----PGVDGEDADDAKAQT-QQYDGCFTCPAGPQGPPGSQGKPGARG 217

Query:   280 QRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYD-MRRA 338
              RG        PG +    PG     GP+     A    P   PG D++   G    +  
Sbjct:   218 MRGARGQAAM-PGRDGS--PGMPGSLGPIGPPGAAGEEGPTGEPGADVEHQIGLPGAKGT 274

Query:   339 PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPAR 392
             P      G   +   RGA   G   PP    P G       +G+ G P   G P +
Sbjct:   275 PGAPGESGDQGEQGDRGAT--GIAGPPGERGPQGEKGDDGPNGAAGSPGEEGEPGQ 328


>UNIPROTKB|G4MYW7 [details] [associations]
            symbol:MGG_10829 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000571 PROSITE:PS50103 GO:GO:0008270 GO:GO:0003676
            EMBL:CM001232 InterPro:IPR019496 Pfam:PF10453 RefSeq:XP_003713435.1
            EnsemblFungi:MGG_10829T0 GeneID:2676344 KEGG:mgr:MGG_10829
            Uniprot:G4MYW7
        Length = 600

 Score = 121 (47.7 bits), Expect = 0.00033, P = 0.00033
 Identities = 61/238 (25%), Positives = 82/238 (34%)

Query:   160 GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD--IPRGPGYEASKGPGYDASK 217
             G+GPPP        GA P      Y   Q        +    PRG G  A  G G     
Sbjct:     5 GYGPPPPPPA----GAPPQAYQQQYGQYQQPPATGHVHGGHAPRG-GRGAHSGRGDFHGS 59

Query:   218 APSYDPTKGPSYDPA-KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLG 276
              PSY     P   P+  GP + P   P +      NY     P +  ++ P Y  Q+   
Sbjct:    60 PPSYPYNNQPQPPPSYTGPHHAPP--PPHTPLAPQNYHPNYAPQH--YQQPQYAHQQQYP 115

Query:   277 YDMQRGPNYDMQRGPGYETQRVPGY-DVQRGPVYEAQRAPSYIPQR--GPG-YDLQRGQG 332
             +   + P    Q+ P Y     P Y      P ++    P+    +  GP  Y   RG+G
Sbjct:   116 HQQPQQPPQPPQQAP-Y-AHHYPSYPQAPNAPPHQPWGGPATAGHQPAGPAHYGSGRGRG 173

Query:   333 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
                     + P+   G      G    G  PP L  V   +  PP     G P+GG P
Sbjct:   174 GHQGDRGGHKPAAAMG-PPLRMGFDNRGPEPPAL--VSSATVYPP--QPFGPPQGGAP 226


>ZFIN|ZDB-GENE-041221-2 [details] [associations]
            symbol:prnpb "prion protein b" species:7955 "Danio
            rerio" [GO:0051260 "protein homooligomerization" evidence=IEA]
            [GO:0016020 "membrane" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0016338 "calcium-independent
            cell-cell adhesion" evidence=IMP] [GO:0007156 "homophilic cell
            adhesion" evidence=IDA] [GO:0055113 "epiboly involved in
            gastrulation with mouth forming second" evidence=IGI;IMP]
            [GO:2000047 "regulation of cell-cell adhesion mediated by cadherin"
            evidence=IMP] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0007417 "central nervous system development" evidence=IGI]
            [GO:0009986 "cell surface" evidence=IDA] InterPro:IPR022416
            ZFIN:ZDB-GENE-041221-2 GO:GO:0005886 GO:GO:0009986 GO:GO:0051260
            GO:GO:0007156 GO:GO:0055113 GO:GO:0016338 Gene3D:1.10.790.10
            SUPFAM:SSF54098 EMBL:AJ850286 IPI:IPI00485089 UniGene:Dr.90045
            ProteinModelPortal:Q5K0E1 PRIDE:Q5K0E1 HOVERGEN:HBG056090
            InParanoid:Q5K0E1 Bgee:Q5K0E1 GO:GO:2000047 Uniprot:Q5K0E1
        Length = 606

 Score = 121 (47.7 bits), Expect = 0.00034, P = 0.00034
 Identities = 89/287 (31%), Positives = 108/287 (37%)

Query:   126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG--PNTST 181
             A GSY   G  G+S      +  G  +Y  G   P   G P      G    G  PN + 
Sbjct:    94 AGGSYPYPGRGGSSPGGYPNQNPGAGSYPSGGSYPSAGGNPNQYPGRGGYNPGGYPNQNP 153

Query:   182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
              A +    G+   A  +  + PG   +   GY     P+ +P  G SY PA G  Y    
Sbjct:   154 GAGSYPAGGSYPSAGGNPNQYPGRGGTSPAGY-----PNQNPGAG-SY-PAGG-SYPSAG 205

Query:   242 G-PG-YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG---PGYET 295
             G P  Y  + GSN      PN +   G SY P  G  Y    G PN    RG   PG   
Sbjct:   206 GNPNQYPGRGGSNPGGY--PNQNPGAG-SY-PAGG-SYPSAGGNPNQYPGRGGSSPGGNP 260

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR-GQ-GYDMRRAP---SYDPSRGTGFD 350
              + PG     G  Y     P+  P  G GY  Q  G+ GY     P   SY P R  G  
Sbjct:   261 NQNPGAGTYAGGGY-----PNQYPGGG-GYSNQNPGRSGYSPGGYPGAGSY-PVRNAGQP 313

Query:   351 GAPRGAAPH--GQVPP--PLNNV--P-YGSATPPARSGSGQPRGGNP 390
             G   GA P   G  P   P N +  P YG +      G G   GG+P
Sbjct:   314 GVYPGAHPSAGGGYPNWNPNNQILSPRYGGSF----GGGGFGTGGSP 356


>WB|WBGene00001263 [details] [associations]
            symbol:emb-9 species:6239 "Caenorhabditis elegans"
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA;TAS] [GO:0005581 "collagen" evidence=IEA] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0008340
            "determination of adult lifespan" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0040039
            "inductive cell migration" evidence=IMP] [GO:0030198 "extracellular
            matrix organization" evidence=IMP] [GO:0009790 "embryo development"
            evidence=IMP] [GO:0050714 "positive regulation of protein
            secretion" evidence=IMP] [GO:0007517 "muscle organ development"
            evidence=IMP] [GO:0005604 "basement membrane" evidence=IDA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            GO:GO:0008340 GO:GO:0009792 GO:GO:0006898 GO:GO:0040007
            GO:GO:0040010 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0030198 GO:GO:0000003 GO:GO:0050714 GO:GO:0007517
            GO:GO:0040039 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005604 GO:GO:0005201 HOGENOM:HOG000085652
            Gene3D:2.170.240.10 EMBL:X56979 EMBL:Z27078 EMBL:J05067 PIR:S40991
            RefSeq:NP_001022662.1 RefSeq:NP_001022663.1
            ProteinModelPortal:P17139 SMR:P17139 IntAct:P17139
            MINT:MINT-1091171 STRING:P17139 PaxDb:P17139 PRIDE:P17139
            EnsemblMetazoa:K04H4.1a GeneID:176314 KEGG:cel:CELE_K04H4.1
            UCSC:K04H4.1b CTD:176314 WormBase:K04H4.1a WormBase:K04H4.1b
            GeneTree:ENSGT00690000101772 InParanoid:P17139 OMA:EEGIPGC
            NextBio:892048 Uniprot:P17139
        Length = 1759

 Score = 126 (49.4 bits), Expect = 0.00035, P = 0.00035
 Identities = 79/282 (28%), Positives = 100/282 (35%)

Query:   128 GSYGGATGNSENETSGRP----VGQNAYEDGY-GVP--QGHGPPPSATTAGVVGAGPNTS 180
             G+YG      E    G P        A E GY G P  +G   P      G   AGP+  
Sbjct:   315 GNYGEKGSQGEQGLGGTPGYPGTKGGAGEPGYPGRPGFEGDCGPEGPLGEGTGEAGPH-G 373

Query:   181 TSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP-GY 237
                +   Q G  +     +P GP G     G PG  A   P  D   G +    +G  GY
Sbjct:   374 AQGFDGVQGGKGLPGHDGLP-GPVGPRGPVGAPG--APGQPGIDGMPGYTEKGDRGEDGY 430

Query:   238 DPTKG-PGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGY 293
                 G PG   + G   Y  + G P YDI   P  D Q G  G+    G   D    PGY
Sbjct:   431 PGFAGEPGLPGEPGDCGYPGEDGLPGYDIQGPPGLDGQSGRDGFPGIPGDIGD----PGY 486

Query:   294 ETQR-VPGYDVQR-GP--VYEAQRAPSYIPQR-G-PGYDLQRGQGYDMRRAPSYDPSRGT 347
               ++  PG  V + GP  +      P  +P R G  GY    G   +      Y P    
Sbjct:   487 SGEKGFPGTGVNKVGPPGMTGLPGEPG-MPGRIGVDGYPGPPGNNGERGEDCGYCPDGVP 545

Query:   348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN 389
             G  G P     +G   PP  N  +G    P   G  +  G +
Sbjct:   546 GNAGDPGFPGMNGYPGPPGPNGDHGDCGMPGAPGKPRSAGSD 587


>UNIPROTKB|F1LRM7 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0001502 "cartilage condensation"
            evidence=IEA] [GO:0001894 "tissue homeostasis" evidence=IEA]
            [GO:0001958 "endochondral ossification" evidence=IEA] [GO:0002062
            "chondrocyte differentiation" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005585 "collagen type
            II" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0006029 "proteoglycan metabolic
            process" evidence=IEA] [GO:0007417 "central nervous system
            development" evidence=IEA] [GO:0007601 "visual perception"
            evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0010468 "regulation of gene expression"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030903 "notochord development" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IEA] [GO:0060021
            "palate development" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060272 "embryonic skeletal joint morphogenesis"
            evidence=IEA] [GO:0060351 "cartilage development involved in
            endochondral bone morphogenesis" evidence=IEA] [GO:0071599 "otic
            vesicle development" evidence=IEA] [GO:0071773 "cellular response
            to BMP stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2375
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 IPI:IPI00394380
            Ensembl:ENSRNOT00000016044 ArrayExpress:F1LRM7 Uniprot:F1LRM7
        Length = 1419

 Score = 125 (49.1 bits), Expect = 0.00035, P = 0.00035
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:   998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035

 Score = 125 (49.1 bits), Expect = 0.00035, P = 0.00035
 Identities = 89/296 (30%), Positives = 110/296 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
             P  DR  D    GA G    +  G P G        G P   GPP        A + G  
Sbjct:    64 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 119

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   120 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 175

Query:   234 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
              PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  +
Sbjct:   176 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 231

Query:   290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   232 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 289

Query:   341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   290 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 342


>RGD|2375 [details] [associations]
            symbol:Col2a1 "collagen, type II, alpha 1" species:10116 "Rattus
          norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
          [GO:0001502 "cartilage condensation" evidence=ISO] [GO:0001894
          "tissue homeostasis" evidence=ISO] [GO:0001958 "endochondral
          ossification" evidence=ISO] [GO:0002062 "chondrocyte differentiation"
          evidence=ISO] [GO:0003007 "heart morphogenesis" evidence=ISO]
          [GO:0005201 "extracellular matrix structural constituent"
          evidence=TAS] [GO:0005581 "collagen" evidence=ISO] [GO:0005585
          "collagen type II" evidence=ISO;TAS] [GO:0005604 "basement membrane"
          evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
          [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006029 "proteoglycan
          metabolic process" evidence=ISO] [GO:0007601 "visual perception"
          evidence=ISO] [GO:0007605 "sensory perception of sound" evidence=ISO]
          [GO:0010468 "regulation of gene expression" evidence=ISO] [GO:0030199
          "collagen fibril organization" evidence=ISO] [GO:0031012
          "extracellular matrix" evidence=ISO] [GO:0035108 "limb morphogenesis"
          evidence=ISO] [GO:0042472 "inner ear morphogenesis" evidence=ISO]
          [GO:0042802 "identical protein binding" evidence=ISO] [GO:0043066
          "negative regulation of apoptotic process" evidence=ISO] [GO:0046872
          "metal ion binding" evidence=IEA] [GO:0048407 "platelet-derived
          growth factor binding" evidence=ISO] [GO:0048705 "skeletal system
          morphogenesis" evidence=ISO] [GO:0048839 "inner ear development"
          evidence=ISO] [GO:0051216 "cartilage development" evidence=IEP;ISO]
          [GO:0060021 "palate development" evidence=ISO] [GO:0060272 "embryonic
          skeletal joint morphogenesis" evidence=ISO] [GO:0060348 "bone
          development" evidence=ISO] [GO:0060351 "cartilage development
          involved in endochondral bone morphogenesis" evidence=ISO]
          [GO:0071773 "cellular response to BMP stimulus" evidence=ISO]
          InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
          SMART:SM00038 RGD:2375 GO:GO:0046872 GO:GO:0051216 InterPro:IPR008160
          Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
          HOVERGEN:HBG004933 KO:K06236 CTD:1280 Reactome:REACT_133391
          GO:GO:0005585 EMBL:L48440 EMBL:K02804 EMBL:M10613 EMBL:X79816
          IPI:IPI00394380 PIR:A05152 PIR:I60384 RefSeq:NP_037061.1
          UniGene:Rn.10124 IntAct:P05539 STRING:P05539 PRIDE:P05539
          GeneID:25412 KEGG:rno:25412 UCSC:RGD:2375 NextBio:606543
          ArrayExpress:P05539 Genevestigator:P05539
          GermOnline:ENSRNOG00000022282 Uniprot:P05539
        Length = 1419

 Score = 125 (49.1 bits), Expect = 0.00035, P = 0.00035
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:   998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035

 Score = 122 (48.0 bits), Expect = 0.00074, P = 0.00074
 Identities = 89/296 (30%), Positives = 110/296 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
             P  DR  D    GA G    +  G P G        G P   GPP        A + G  
Sbjct:    64 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 119

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   120 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGP-IGPRG 175

Query:   234 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
              PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  +
Sbjct:   176 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 231

Query:   290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   232 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 289

Query:   341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   290 PVGPAGGPGFLGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 342


>UNIPROTKB|E1BT66 [details] [associations]
            symbol:TAF15 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0005634 GO:GO:0005737 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00530000063105
            OMA:YGNQGSQ EMBL:AADN02025953 EMBL:AADN02025954 IPI:IPI00575015
            ProteinModelPortal:E1BT66 Ensembl:ENSGALT00000003204 Uniprot:E1BT66
        Length = 443

 Score = 119 (46.9 bits), Expect = 0.00035, P = 0.00035
 Identities = 70/232 (30%), Positives = 89/232 (38%)

Query:   137 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 196
             S++ + G+  GQ +Y   YG     G      T G  G G +   S+Y   QS       
Sbjct:     3 SDSGSYGQSGGQQSYSS-YG---NQGNQSYGQTQGYSGYGQSGDNSSYG--QSYGNYHGN 56

Query:   197 YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP--GYDAQKGSNYD 254
             Y      GY      GYD     SYD     SY+          KG   G      S+YD
Sbjct:    57 YG-QNQTGY-GQDSHGYDDES--SYDNQNQSSYNQQSYSNQGQQKGSSRGGRGSYSSSYD 112

Query:   255 AQRGPNYDIHRGPSYDPQRGLG----YDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV-Y 309
              Q G  Y  H+G SYD Q G G    YD + G N   Q   G+  Q    Y  Q+G   +
Sbjct:   113 QQSG--YG-HQG-SYDQQSGYGHQSSYDQKSGYNQH-QSSYGHSQQ---SYQSQKGSYSH 164

Query:   310 EAQ---RAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRG--TGFDGAPRG 355
              +Q   R  S   +   GY   +G G    R   YD   RG  +G+ G  RG
Sbjct:   165 NSQDDRREKSRYGEDNRGYGGSQGGG----RG-GYDMDGRGHMSGYSGGDRG 211


>UNIPROTKB|E7ENY8 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 EMBL:AC066694 HGNC:HGNC:2201
            ChiTaRS:COL3A1 IPI:IPI00981037 PDB:4GYX PDBsum:4GYX
            ProteinModelPortal:E7ENY8 SMR:E7ENY8 PRIDE:E7ENY8
            Ensembl:ENST00000317840 ArrayExpress:E7ENY8 Bgee:E7ENY8
            Uniprot:E7ENY8
        Length = 1163

 Score = 124 (48.7 bits), Expect = 0.00036, P = 0.00036
 Identities = 81/280 (28%), Positives = 101/280 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
             A G   G  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPS 224

Query:   181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
               A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGL 283

Query:   238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPPGINGSP 392

Query:   354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
              G    G  P  +   P   G+  PP  +G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430

 Score = 123 (48.4 bits), Expect = 0.00046, P = 0.00046
 Identities = 85/284 (29%), Positives = 101/284 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSA 183
             A G  GGA    +N   G P G        G+P   G P +    G  G+ G P  +   
Sbjct:   424 APGLRGGAGEPGKNGAKGEP-GPRGERGEAGIP---GVPGAKGEDGKDGSPGEPGANGLP 479

Query:   184 YAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
              AA + G P    +  P GP G    KGP  +   AP   P  GP    A  PG D   G
Sbjct:   480 GAAGERGAP---GFRGPAGPNGIPGEKGPAGERG-APG--PA-GPR-GAAGEPGRDGVPG 531

Query:   243 -PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRG----PGYE-T 295
              PG     GS      GP  D   GP    Q   G     GP+    Q G    PG +  
Sbjct:   532 GPGMRGMPGS----PGGPGSDGKPGPP-GSQGESGRPGPPGPSGPRGQPGVMGFPGPKGN 586

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD-----PSRGTGF 349
                PG + +RG        P   PQ  PG + + G QG      P  D     P    G 
Sbjct:   587 DGAPGKNGERG----GPGGPG--PQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGL 640

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
              G P    P G+   P    P G A  P   G G+   G P  R
Sbjct:   641 QGLPGTGGPPGENGKPGEPGPKGDAGAPGAPG-GKGDAGAPGER 683


>UNIPROTKB|F1LP41 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00205809
            Ensembl:ENSRNOT00000012441 ArrayExpress:F1LP41 Uniprot:F1LP41
        Length = 1458

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074

 Score = 122 (48.0 bits), Expect = 0.00077, P = 0.00077
 Identities = 91/308 (29%), Positives = 117/308 (37%)

Query:   105 MATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP 164
             +AT   KL  +++  P       G+ G    + E  T G P G        G P G G  
Sbjct:    92 LATASGKLGPKIIG-PKGPPGPQGAPGPRGRDGEPGTPGNP-GPPGPPGPPG-PPGLGGG 148

Query:   165 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYD 222
               A  A + G     +  A      G PM      PRGP G   + GP G+  +     +
Sbjct:   149 NFA--AQMAGGFDEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGE 203

Query:   223 P-TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYD 278
             P   GP   P   PG  P   PG D + G    A +RG P     RG    P  GL G  
Sbjct:   204 PGVSGPM-GPRGPPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVK 258

Query:   279 MQRG-PNYDMQRG----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 332
               RG P  D  +G    PG + +   PG +   GP+   +  P    + GP       +G
Sbjct:   259 GHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARG 316

Query:   333 YDMRRAPS-----YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQ 384
              D +  P+       P+ G GF GAP  +G A P G   P       GS   P   GS  
Sbjct:   317 NDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPG 373

Query:   385 PRG--GNP 390
             P G  GNP
Sbjct:   374 PAGASGNP 381


>UNIPROTKB|P02453 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9913 "Bos
            taurus" [GO:0090263 "positive regulation of canonical Wnt receptor
            signaling pathway" evidence=IEA] [GO:0071260 "cellular response to
            mechanical stimulus" evidence=IEA] [GO:0071230 "cellular response
            to amino acid stimulus" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0060351 "cartilage
            development involved in endochondral bone morphogenesis"
            evidence=IEA] [GO:0060346 "bone trabecula formation" evidence=IEA]
            [GO:0060325 "face morphogenesis" evidence=IEA] [GO:0048706
            "embryonic skeletal system development" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IEA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0043589 "skin morphogenesis" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0034505 "tooth
            mineralization" evidence=IEA] [GO:0034504 "protein localization to
            nucleus" evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030335 "positive regulation of cell migration"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0015031 "protein transport" evidence=IEA]
            [GO:0010812 "negative regulation of cell-substrate adhesion"
            evidence=IEA] [GO:0010718 "positive regulation of epithelial to
            mesenchymal transition" evidence=IEA] [GO:0007605 "sensory
            perception of sound" evidence=IEA] [GO:0007601 "visual perception"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001958 "endochondral ossification"
            evidence=IEA] [GO:0001957 "intramembranous ossification"
            evidence=IEA] [GO:0001649 "osteoblast differentiation"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615
            GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071260
            GO:GO:0001568 GO:GO:0001649 GO:GO:0034505 GO:GO:0090263
            GO:GO:0010812 GO:GO:0060325 GO:GO:0032964 GO:GO:0071230
            GO:GO:0048706 GO:GO:0001957 GO:GO:0034504 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GO:GO:0043589 EMBL:BC105184
            IPI:IPI00707857 PIR:A91193 RefSeq:NP_001029211.1 UniGene:Bt.23316
            IntAct:P02453 STRING:P02453 PRIDE:P02453 Ensembl:ENSBTAT00000017420
            GeneID:282187 KEGG:bta:282187 CTD:1277 GeneTree:ENSGT00660000095287
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 InParanoid:P02453 KO:K06236
            OMA:VAYMDQQ OrthoDB:EOG4S4PHP NextBio:20806015 PMAP-CutDB:P02453
            ArrayExpress:P02453 GO:GO:0005584 GO:GO:0060346 Uniprot:P02453
        Length = 1463

 Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
 Identities = 90/286 (31%), Positives = 109/286 (38%)

Query:   126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
             ADG  G  G  G++  +    P G  A   G   P G+ G P      G   AGP  +T 
Sbjct:   818 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGARG--SAGPPGATG 874

Query:   183 -AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--K 233
                AA + G P  +    P GP    G E SKGP  +   A  P      GP   PA  K
Sbjct:   875 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGSKGPRGETGPAGRPGEVGPPGPP-GPAGEK 933

Query:   234 G-PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQR 281
             G PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +R
Sbjct:   934 GAPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGER 993

Query:   282 GPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 341
             GP   M  GP       PG     GP  E+ R  +   +  PG D   G   D       
Sbjct:   994 GPPGPM--GP-------PGL---AGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPA 1041

Query:   342 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
              P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1042 GPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIGPVGARGPAG 1087

 Score = 124 (48.7 bits), Expect = 0.00046, P = 0.00046
 Identities = 82/275 (29%), Positives = 108/275 (39%)

Query:   130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 189
             + GA G ++ E  G P G    E   GV    GPP  A  AG  G  P       A   +
Sbjct:   344 FPGAVG-AKGE--GGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAG-NPGADGQPGAKGAN 399

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP-GYDA 247
             G P      I   PG+  ++GP     + PS  P  KG S +P   PG   +KG  G   
Sbjct:   400 GAP-----GIAGAPGFPGARGPS--GPQGPSGPPGPKGNSGEPG-APG---SKGDTGAKG 448

Query:   248 QKG-SNYDAQRGP-NYDIHRGPSYDP-QRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDV 303
             + G +      GP   +  RG   +P   GL G   +RG       GPG  ++  PG D 
Sbjct:   449 EPGPTGIQGPPGPAGEEGKRGARGEPGPAGLPGPPGERG-------GPG--SRGFPGADG 499

Query:   304 QRGPVYEA-QR-APSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPH 359
               GP   A +R AP    P+  PG   + G+   +  A     S G+ G DG      P 
Sbjct:   500 VAGPKGPAGERGAPGPAGPKGSPGEAGRPGEA-GLPGAKGLTGSPGSPGPDGKTGPPGPA 558

Query:   360 GQVPPPLNNVPYGSATPPARSGSGQPRG--GNPAR 392
             GQ   P    P G+       G   P+G  G P +
Sbjct:   559 GQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGK 593

 Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
 Identities = 80/272 (29%), Positives = 99/272 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTS--A 183
             A G  G A    E    G P G   ++   G+P   GPP  A   G  G   +      +
Sbjct:   617 AQGPPGPAGPAGERGEQG-PAGSPGFQ---GLPGPAGPPGEAGKPGEQGVPGDLGAPGPS 672

Query:   184 YAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG-PGYDPT 240
              A  + G P       P GP G   + G PG D +K  +  P   P    A G  G    
Sbjct:   673 GARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPG-APGSQGAPGLQGMPGE 731

Query:   241 KGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP 299
             +G  G    KG   DA  GP       P  D  RGL   +  GP       PG + +  P
Sbjct:   732 RGAAGLPGPKGDRGDA--GPK-GADGAPGKDGVRGLTGPI--GPP-GPAGAPGDKGEAGP 785

Query:   300 GYDVQRGPVYEAQRAPSYIPQRGP-GYDLQRGQ-GYDMRRAPSYDPS-RGTGFDGAPRG- 355
                   GP   A+ AP    + GP G     G  G D +     +P   G   D  P G 
Sbjct:   786 SGPA--GPT-GARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGP 842

Query:   356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             A P G  P P+ NV  G+  P    GS  P G
Sbjct:   843 AGPAGP-PGPIGNV--GAPGPKGARGSAGPPG 871


>UNIPROTKB|F1LN37 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
            GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
            GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
            GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 IPI:IPI00388575 Ensembl:ENSRNOT00000037840
            ArrayExpress:F1LN37 Uniprot:F1LN37
        Length = 1487

 Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   841 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 899

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   900 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 954

Query:   240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   955 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 1005

Query:   296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:  1006 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1065

Query:   352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1066 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1103

 Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
 Identities = 89/296 (30%), Positives = 110/296 (37%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
             P  DR  D    GA G    +  G P G        G P   GPP        A + G  
Sbjct:   132 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 187

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P  
Sbjct:   188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 243

Query:   234 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
              PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  +
Sbjct:   244 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299

Query:   290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357

Query:   341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
                P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   358 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410


>UNIPROTKB|E1BLD0 [details] [associations]
            symbol:LOC100847165 "Uncharacterized protein" species:9913
            "Bos taurus" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
            InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
            PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
            GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
            OMA:SRYESQN EMBL:DAAA02057905 IPI:IPI00717370
            Ensembl:ENSBTAT00000061583 Uniprot:E1BLD0
        Length = 540

 Score = 120 (47.3 bits), Expect = 0.00037, P = 0.00037
 Identities = 40/160 (25%), Positives = 70/160 (43%)

Query:   117 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYED---GYGV-PQGHGPPPSATTAGV 172
             M +P+     +GS  G    +E E   +  G   YE     +G+ PQ  G  P +     
Sbjct:    15 MQSPDEMGSPEGSLKGNMSENEEEEISQQEGTGDYEVEEIAFGLEPQSPGFGPQSPEFEP 74

Query:   173 VGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 232
                     +  + +   G    +    PR P  + S+ P ++  ++P Y+P + P Y+P 
Sbjct:    75 QSPRFEPESPGFESRSPGFVPPSPEFAPRSPESD-SQSPDFEP-QSPRYEP-QSPGYEP- 130

Query:   233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 272
             K PGY+P + PGY+  K   Y+ Q  P +   + P ++ +
Sbjct:   131 KSPGYEP-RSPGYEP-KSPGYEPQN-PEFKT-QSPEFEAE 166


>UNIPROTKB|F1NI72 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005586 "collagen type III"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
            evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
            "peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0032964 "collagen biosynthetic
            process" evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
            "extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0048565 "digestive tract development" evidence=IEA] [GO:0050777
            "negative regulation of immune response" evidence=IEA] [GO:0071230
            "cellular response to amino acid stimulus" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005615 GO:GO:0034097
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0042060 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0071230 GO:GO:0043206
            GO:GO:0005201 GeneTree:ENSGT00660000095287 GO:GO:0005586
            EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI00589264
            Ensembl:ENSGALT00000004033 OMA:ETCLSAN ArrayExpress:F1NI72
            Uniprot:F1NI72
        Length = 1498

 Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
 Identities = 78/276 (28%), Positives = 97/276 (35%)

Query:   132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
             G  G   +N   G P G        G P   GPP      G  G  P  +       + G
Sbjct:   464 GTPGEPGKNGAKGDP-GPKGERGENGTPGAPGPPGEEGKRGANGE-PGQNGVPGTPGERG 521

Query:   191 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 246
             +P      +P   G    KGP G   S  P   P+ GP+ D  +  GPG    +G PG  
Sbjct:   522 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 577

Query:   247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 303
                GS  D + GP      G   +P R  G     GP     +   PG +  +  PG + 
Sbjct:   578 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 629

Query:   304 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 359
             +RGP       P    + G    PG     G   D R  P   PS   G  G P G  P 
Sbjct:   630 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 685

Query:   360 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 393
             G+   P    P G    P   G   P+G N  P  R
Sbjct:   686 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 718

 Score = 123 (48.4 bits), Expect = 0.00062, P = 0.00062
 Identities = 84/275 (30%), Positives = 104/275 (37%)

Query:   142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
             +G P G        G+P   G P      G+ G  P TS +  A    G P +       
Sbjct:   424 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 478

Query:   202 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 255
             GP G     G PG  A   P  +  +G + +P +   PG    +G PG+    GSN    
Sbjct:   479 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 536

Query:   256 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 310
             ++GP  +       GPS  P    G D   GP     RG PG      PG D + GP   
Sbjct:   537 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 591

Query:   311 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 362
              Q  P      GP G   Q G  G+   +    AP  +  RG G   G P  A  +G V 
Sbjct:   592 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 650

Query:   363 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 391
              P PP    P G    P  SGS    G P G  PA
Sbjct:   651 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 685


>UNIPROTKB|O43186 [details] [associations]
            symbol:CRX "Cone-rod homeobox protein" species:9606 "Homo
            sapiens" [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0007601 "visual perception" evidence=IEA] [GO:0050896 "response
            to stimulus" evidence=IEA] [GO:0003682 "chromatin binding"
            evidence=IEA] [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA] [GO:0005667
            "transcription factor complex" evidence=IEA] [GO:0045944 "positive
            regulation of transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0060041 "retina development in camera-type eye"
            evidence=IEA] [GO:0043522 "leucine zipper domain binding"
            evidence=IPI] [GO:0009887 "organ morphogenesis" evidence=TAS]
            InterPro:IPR001356 InterPro:IPR009057 InterPro:IPR013851
            InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529 PROSITE:PS00027
            PROSITE:PS50071 SMART:SM00389 GO:GO:0007601 GO:GO:0043565
            GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
            Orphanet:1872 Orphanet:791 GO:GO:0050896 Gene3D:1.10.10.60
            SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0009887 GO:GO:0060041
            Orphanet:65 MIM:268000 CTD:1406 eggNOG:NOG324074
            HOGENOM:HOG000082677 HOVERGEN:HBG004028 KO:K09337 OMA:QTKARPA
            OrthoDB:EOG4NKBWG EMBL:AF024711 EMBL:BT007364 EMBL:AC008745
            EMBL:BC016664 EMBL:BC053672 IPI:IPI00011226 RefSeq:NP_000545.1
            UniGene:Hs.617342 UniGene:Hs.633434 UniGene:Hs.639114
            ProteinModelPortal:O43186 SMR:O43186 IntAct:O43186
            MINT:MINT-1442706 STRING:O43186 PhosphoSite:O43186 PRIDE:O43186
            DNASU:1406 Ensembl:ENST00000221996 Ensembl:ENST00000539067
            Ensembl:ENST00000556900 Ensembl:ENST00000557738 GeneID:1406
            KEGG:hsa:1406 UCSC:uc002phq.4 GeneCards:GC19P048327 HGNC:HGNC:2383
            HPA:HPA036762 HPA:HPA036763 MIM:120970 MIM:602225 MIM:613829
            neXtProt:NX_O43186 PharmGKB:PA26903 InParanoid:O43186
            PhylomeDB:O43186 ChiTaRS:CRX GenomeRNAi:1406 NextBio:5749
            ArrayExpress:O43186 Bgee:O43186 CleanEx:HS_CRX
            Genevestigator:O43186 GermOnline:ENSG00000105392 Uniprot:O43186
        Length = 299

 Score = 116 (45.9 bits), Expect = 0.00037, P = 0.00037
 Identities = 29/98 (29%), Positives = 42/98 (42%)

Query:   158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
             P    P P A  AG+V +GP+ +++ YA T +  P  A    P   G  +S   G D   
Sbjct:   165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222

Query:   218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDA 255
             +P      GP+  P  GP   P+      +  G +Y A
Sbjct:   223 SPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGA 260


>ZFIN|ZDB-GENE-030131-4487 [details] [associations]
            symbol:sec24c "SEC24 family, member C (S.
            cerevisiae)" species:7955 "Danio rerio" [GO:0030127 "COPII vesicle
            coat" evidence=IEA] [GO:0006886 "intracellular protein transport"
            evidence=IEA] [GO:0006888 "ER to Golgi vesicle-mediated transport"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006810 "transport" evidence=IEA] [GO:0015031 "protein
            transport" evidence=IEA] InterPro:IPR006895 InterPro:IPR006896
            InterPro:IPR006900 Pfam:PF04810 Pfam:PF04811 Pfam:PF04815
            ZFIN:ZDB-GENE-030131-4487 GO:GO:0006886 GO:GO:0008270
            InterPro:IPR007123 Pfam:PF00626 GO:GO:0006888 GO:GO:0030127
            SUPFAM:SSF82919 InterPro:IPR012990 Pfam:PF08033 SUPFAM:SSF81811
            GeneTree:ENSGT00590000082962 EMBL:CU469520 EMBL:CU694198
            IPI:IPI00972073 Ensembl:ENSDART00000085476 ArrayExpress:F1R9P2
            Bgee:F1R9P2 Uniprot:F1R9P2
        Length = 1241

 Score = 124 (48.7 bits), Expect = 0.00038, P = 0.00038
 Identities = 82/291 (28%), Positives = 110/291 (37%)

Query:   131 GGATGNSENETSGRPV--GQNAYED-GYGVPQGHGPPPS-ATTAGVVGAGPNTSTSAYAA 186
             G   G  E  TSG P   G  +Y   G G  Q +GPPP  A   G + + P+T  +   +
Sbjct:    70 GPPQGMREPPTSGTPPVSGAQSYSQFGQGETQ-NGPPPMVAPPQGTLVSQPHTPNAVSLS 128

Query:   187 TQSGTPMRAAYDIPR-GPGYEASKGPGYDA-SKAPSYDPTKGPSYDP---AKGP---GYD 238
               +  P    +  P  G     ++       S APS  P  GP Y P   A+ P    Y 
Sbjct:   129 GPTQPPYGQQFGSPPIGMQQMTNQMASMQVGSTAPS--PA-GPGYAPPSTAQAPISAAYT 185

Query:   239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDM---QRGPGYE 294
             P+  P +     S+  +Q  P   + + P   P  G     Q+  PN        GP  +
Sbjct:   186 PSAPPTFPPT--SSAPSQPPPTEAVAQAPP-QPYYGAPPPAQQPFPNAVSTFSSAGPT-Q 241

Query:   295 TQRVPGYDVQRGPVYEAQRAPSY--IPQRGP----GYDLQRGQGYDMRRAPSYDPSRGTG 348
              Q  P    Q  P   A   P +   P  GP    G  L   Q    +RAP      G  
Sbjct:   242 PQAPPSVSQQSFPQAPAVSQPPFSTAPPPGPSQSYGGPLPPTQP-SFQRAPLPTSQPGV- 299

Query:   349 FDGAPRGAAPHGQVP------PPLNNV-PYGSATPPARSGSGQPRGGNPAR 392
             F G P   + H Q+P      PP++   PY S  PP  + S  P+ G P R
Sbjct:   300 FPGGPPPTSTHSQLPGPMPPQPPVSQPSPYYSEPPPT-TASFPPQVGAPPR 349


>UNIPROTKB|P15941 [details] [associations]
            symbol:MUC1 "Mucin-1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IBA] [GO:0009986 "cell surface" evidence=IBA]
            [GO:0016324 "apical plasma membrane" evidence=IBA] [GO:0005887
            "integral to plasma membrane" evidence=TAS] [GO:0005796 "Golgi
            lumen" evidence=TAS] [GO:0016266 "O-glycan processing"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0002039 "p53 binding" evidence=IPI] [GO:0006977 "DNA damage
            response, signal transduction by p53 class mediator resulting in
            cell cycle arrest" evidence=IDA] [GO:0000790 "nuclear chromatin"
            evidence=IDA] [GO:0090240 "positive regulation of histone H4
            acetylation" evidence=IDA] [GO:0000978 "RNA polymerase II core
            promoter proximal region sequence-specific DNA binding"
            evidence=IDA] [GO:0043618 "regulation of transcription from RNA
            polymerase II promoter in response to stress" evidence=IDA]
            [GO:0006978 "DNA damage response, signal transduction by p53 class
            mediator resulting in transcription of p21 class mediator"
            evidence=IDA] [GO:0010944 "negative regulation of transcription by
            competitive promoter binding" evidence=IDA] [GO:0003712
            "transcription cofactor activity" evidence=IDA] [GO:0036003
            "positive regulation of transcription from RNA polymerase II
            promoter in response to stress" evidence=IDA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IDA] Reactome:REACT_17015
            PANTHER:PTHR10006 GO:GO:0043066 GO:GO:0005576 GO:GO:0009986
            GO:GO:0005887 GO:GO:0006977 GO:GO:0016324 GO:GO:0000978
            GO:GO:0000790 GO:GO:0003712 GO:GO:0043687 InterPro:IPR000082
            Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 GO:GO:0005796
            EMBL:CH471121 GO:GO:0010944 GO:GO:0090240 PDB:2FO4 PDBsum:2FO4
            GO:GO:0016266 GO:GO:0006978 EMBL:AL713999 GO:GO:0036003
            MEROPS:S71.001 CTD:4582 eggNOG:NOG77744 KO:K06568
            InterPro:IPR023217 EMBL:J05582 EMBL:M32738 EMBL:M32739 EMBL:M34089
            EMBL:M34088 EMBL:J05581 EMBL:M61170 EMBL:X52229 EMBL:X52228
            EMBL:M35093 EMBL:X80761 EMBL:U60259 EMBL:U60260 EMBL:U60261
            EMBL:AF125525 EMBL:AF348143 EMBL:AY327582 EMBL:AY463543
            EMBL:BC120974 EMBL:Z17324 EMBL:Z17325 EMBL:M31823 EMBL:S81781
            EMBL:S81736 EMBL:M21868 IPI:IPI00013955 IPI:IPI00218163
            IPI:IPI00218164 IPI:IPI00218165 IPI:IPI00218166 IPI:IPI00218168
            IPI:IPI00218169 IPI:IPI00607673 IPI:IPI00902840 IPI:IPI00978078
            PIR:A35175 RefSeq:NP_001018016.1 RefSeq:NP_001018017.1
            RefSeq:NP_001037855.1 RefSeq:NP_001037856.1 RefSeq:NP_001037857.1
            RefSeq:NP_001037858.1 RefSeq:NP_001191214.1 RefSeq:NP_001191215.1
            RefSeq:NP_001191216.1 RefSeq:NP_001191217.1 RefSeq:NP_001191218.1
            RefSeq:NP_001191219.1 RefSeq:NP_001191220.1 RefSeq:NP_001191221.1
            RefSeq:NP_001191222.1 RefSeq:NP_001191223.1 RefSeq:NP_001191224.1
            RefSeq:NP_001191225.1 RefSeq:NP_001191226.1 RefSeq:NP_002447.4
            UniGene:Hs.89603 PDB:2ACM PDBsum:2ACM ProteinModelPortal:P15941
            SMR:P15941 IntAct:P15941 MINT:MINT-156679 STRING:P15941
            GlycoSuiteDB:P15941 PhosphoSite:P15941 DMDM:296439295 PaxDb:P15941
            PRIDE:P15941 DNASU:4582 Ensembl:ENST00000337604
            Ensembl:ENST00000343256 Ensembl:ENST00000368389
            Ensembl:ENST00000368390 Ensembl:ENST00000368398 GeneID:4582
            KEGG:hsa:4582 UCSC:uc001fib.3 GeneCards:GC01M155158 HGNC:HGNC:7508
            HPA:CAB000036 HPA:CAB001986 HPA:HPA004179 HPA:HPA007235
            HPA:HPA008855 MIM:113720 MIM:158340 neXtProt:NX_P15941
            PharmGKB:PA31309 ChiTaRS:MUC1 EvolutionaryTrace:P15941
            GenomeRNAi:4582 NextBio:17597 Bgee:P15941 Genevestigator:P15941
            GermOnline:ENSG00000185499 Uniprot:P15941
        Length = 1255

 Score = 124 (48.7 bits), Expect = 0.00039, P = 0.00039
 Identities = 65/275 (23%), Positives = 91/275 (33%)

Query:   126 ADGSYGGATGNSENETSGRPVG--QNAYEDGYGVPQGHGPPP-SATTAGV-VGAGPNTST 181
             A  + GG    S  + S  P    +NA      V   H P   S+TT G  V   P T  
Sbjct:    27 ASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEP 86

Query:   182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
             ++ +A   G  + +   + R P   ++  P +D + AP   P  G +  PA G    P  
Sbjct:    87 ASGSAATWGQDVTSV-PVTR-PALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDT 144

Query:   242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL--GYDMQRGPNYDMQRGPGY----ET 295
              P   +     +     P+     G +  P  G+    D +  P        G     +T
Sbjct:   145 RPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDT 204

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG--QGYDMRRAPSYDPSRGTGFDGAP 353
             +  PG      P +    AP   P  G       G     D R AP        G   AP
Sbjct:   205 RPAPGSTAP--PAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAP 262

Query:   354 RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
                   G   PP + V     T PA   +  P  G
Sbjct:   263 DTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHG 297


>RGD|1308535 [details] [associations]
            symbol:Pygo2 "pygopus 2" species:10116 "Rattus norvegicus"
            [GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
            [GO:0001822 "kidney development" evidence=IEA;ISO] [GO:0002088
            "lens development in camera-type eye" evidence=IEA;ISO] [GO:0005634
            "nucleus" evidence=IEA;ISO] [GO:0007420 "brain development"
            evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0009791 "post-embryonic development" evidence=IEA;ISO]
            [GO:0016055 "Wnt receptor signaling pathway" evidence=ISO]
            [GO:0030879 "mammary gland development" evidence=IEA;ISO]
            [GO:0033599 "regulation of mammary gland epithelial cell
            proliferation" evidence=IEA;ISO] [GO:0042393 "histone binding"
            evidence=IEA;ISO] [GO:0048589 "developmental growth"
            evidence=IEA;ISO] [GO:0051569 "regulation of histone H3-K4
            methylation" evidence=IEA;ISO] [GO:0060021 "palate development"
            evidence=IEA;ISO] [GO:0060070 "canonical Wnt receptor signaling
            pathway" evidence=IEA;ISO] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 RGD:1308535
            GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
            GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
            InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            EMBL:CH473976 eggNOG:NOG72798 HOGENOM:HOG000001580
            HOVERGEN:HBG053774 GeneTree:ENSGT00530000063948 CTD:90780
            OMA:PGLVYPC OrthoDB:EOG4QZ7MB EMBL:BC169054 IPI:IPI00368626
            RefSeq:NP_001099917.1 UniGene:Rn.24988 STRING:B5DFG8
            Ensembl:ENSRNOT00000028052 GeneID:295251 KEGG:rno:295251
            UCSC:RGD:1308535 NextBio:639221 Genevestigator:B5DFG8
            Uniprot:B5DFG8
        Length = 405

 Score = 118 (46.6 bits), Expect = 0.00040, P = 0.00040
 Identities = 79/294 (26%), Positives = 110/294 (37%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ--GHGPPPSAT 168
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G GPP    
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKMGGAGPP---- 93

Query:   169 TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GP 227
                 +G+ P      +   Q G     A  +P G G     GP     + P + P   GP
Sbjct:    94 ---FLGS-P-VPFGGFRV-QGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPNPMGP 143

Query:   228 SYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQR 281
             +++ P +GPGY P     + +Q    ++   G N+    G     P  G G      M +
Sbjct:   144 AFNMPPQGPGYPPPGNMNFPSQP---FNQSLGQNFSPPGGQMIPGPVGGFGPMISPTMGQ 200

Query:   282 GPNYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPSYIPQRGP--GYDLQ-RGQGYDMR 336
              P  ++  GP    QR        GP  +   Q  PS  P   P  G D    G G +  
Sbjct:   201 PPRGEL--GPPPLPQRFTQPGAPFGPSLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDG 258

Query:   337 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
               P  +P   T F   P   +P   V     N P   + PP+ SG G   GG P
Sbjct:   259 GKP-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPSSSGRG---GGTP 302


>UNIPROTKB|F1LNH3 [details] [associations]
            symbol:Col6a2 "Protein Col6a2" species:10116 "Rattus
            norvegicus" [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0031012 "extracellular matrix" evidence=IEA] [GO:0042383
            "sarcolemma" evidence=IEA] [GO:0043234 "protein complex"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 RGD:1305585 GO:GO:0005615 GO:GO:0043234 GO:GO:0042383
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 GO:GO:0031012
            GeneTree:ENSGT00530000063022 OMA:RALCNHD IPI:IPI00372839
            Ensembl:ENSRNOT00000001695 ArrayExpress:F1LNH3 Uniprot:F1LNH3
        Length = 1025

 Score = 123 (48.4 bits), Expect = 0.00040, P = 0.00040
 Identities = 88/284 (30%), Positives = 99/284 (34%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AG-PNTSTSA 183
             +DG  G      +N T G    Q       G P   G P S    G  G AG P      
Sbjct:   320 SDGRKGAPGLAGKNGTDG----QKGKLGRIGPPGCKGDPGSRGPDGYPGEAGSPGEQGDQ 375

Query:   184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDA-SKAPSYDPTKGPSYDPA----KG-PGY 237
              A   SG P R     P  PG + SKG  Y   S AP     KG    P     KG PG 
Sbjct:   376 GAKGDSGRPGRRGP--PGNPGDKGSKG--YRGNSGAPGSPGVKGGKGGPGPRGPKGEPGR 431

Query:   238 --DP-TKG-PGYDAQKGSNYD-AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
               DP TKG PG D  KG   D    GP        S   +   G    RGP   +   PG
Sbjct:   432 RGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEIGSKGAKGDRGLPGPRGPQGALGE-PG 490

Query:   293 YETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA----PSYDPSRGT 347
              +  R  PG    RG     Q  P   P R PG+     +G    +     P  +  RG 
Sbjct:   491 KQGSRGDPGDAGPRGD--SGQPGPKGDPGR-PGFSYPGPRGTPGEKGEPGPPGPEGGRGD 547

Query:   348 -GFDGAPRGAAPHGQV--P-PPLNNVPYGSATPPARSGSGQPRG 387
              G  GAP      G+   P PP    P G    P   G   P G
Sbjct:   548 FGLKGAPGRKGEKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPG 591


>FB|FBgn0003980 [details] [associations]
            symbol:Vm26Ab "Vitelline membrane 26Ab" species:7227
            "Drosophila melanogaster" [GO:0007304 "chorion-containing eggshell
            formation" evidence=IMP] [GO:0007305 "vitelline membrane formation
            involved in chorion-containing eggshell formation" evidence=NAS]
            [GO:0008316 "structural constituent of vitelline membrane"
            evidence=NAS] [GO:0007343 "egg activation" evidence=IMP]
            [GO:0060388 "vitelline envelope" evidence=IDA] GO:GO:0005576
            EMBL:AE014134 GO:GO:0007304 GO:GO:0007343 eggNOG:NOG295326
            PROSITE:PS51137 GeneTree:ENSGT00540000073505 GO:GO:0060388
            InterPro:IPR013135 Pfam:PF10542 EMBL:M20936 EMBL:EF441676
            PIR:A45943 RefSeq:NP_476784.1 UniGene:Dm.26740 DIP:DIP-19185N
            IntAct:P13238 MINT:MINT-1563965 STRING:P13238
            EnsemblMetazoa:FBtr0079171 GeneID:33827 KEGG:dme:Dmel_CG9046
            CTD:33827 FlyBase:FBgn0003980 InParanoid:P13238 OMA:RAAYGGY
            PhylomeDB:P13238 GenomeRNAi:33827 NextBio:785460 Bgee:P13238
            GermOnline:CG9046 Uniprot:P13238
        Length = 168

 Score = 108 (43.1 bits), Expect = 0.00041, P = 0.00041
 Identities = 28/92 (30%), Positives = 35/92 (38%)

Query:   166 SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK 225
             S    G  GA P  +  +Y+A  +  P   AY  P  P Y A   P Y A  AP+Y    
Sbjct:    45 SRAAYGGYGAAP--AAPSYSAPAA--PAAQAYSAPAAPAYSAPAAPAYSAPAAPAYSAPA 100

Query:   226 GPSYDPAKGPGYD-PTKGPGYDAQKGSNYDAQ 256
              P+Y     P Y  P   P     K   +  Q
Sbjct:   101 APAYSAPAAPAYSAPASIPSPPCPKNYLFSCQ 132


>UNIPROTKB|I3L781 [details] [associations]
            symbol:I3L781 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287
            Ensembl:ENSSSCT00000024528 OMA:EVSMPEI Uniprot:I3L781
        Length = 1087

 Score = 123 (48.4 bits), Expect = 0.00043, P = 0.00043
 Identities = 83/271 (30%), Positives = 99/271 (36%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-----PS-ATTAGVVGAGPNTSTSAYA 185
             GA G   N  +  P G    + G G     GPP     P  A TAG VG           
Sbjct:   518 GAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGERGIPG-- 575

Query:   186 ATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKG 242
               + G P  A     RGP G   + GP G   S+ PS  P  GP  D  KG PG      
Sbjct:   576 --EFGLPGPAGPRGERGPPGESGAAGPAGPIGSRGPSGPP--GP--DGNKGEPGV--LGA 627

Query:   243 PGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVP- 299
             PG     G S    +RG    I  G     + GL  D+   P  D  RG PG      P 
Sbjct:   628 PGTAGPSGPSGLPGERGAA-GIPGGKGEKGETGLRGDVG-SPGRDGARGAPGAVGAPGPA 685

Query:   300 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP--RGA- 356
             G +  RG    A  A    P+  PG   +RG+           P+   G  GA   RG  
Sbjct:   686 GANGDRGEAGPAGPAGPAGPRGSPG---ERGEVGPAGPNGFAGPAGAAGQPGAKGERGTK 742

Query:   357 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
              P G+  P     P G+A P   +G   P G
Sbjct:   743 GPKGENGPVGPTGPVGAAGPAGPNGPPGPAG 773


>UNIPROTKB|P08123 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0071230 "cellular response
            to amino acid stimulus" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IDA;IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0001501 "skeletal system development" evidence=IMP] [GO:0042476
            "odontogenesis" evidence=NAS] [GO:0008217 "regulation of blood
            pressure" evidence=IMP] [GO:0007179 "transforming growth factor
            beta receptor signaling pathway" evidence=IDA] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0042802 "identical protein binding" evidence=IDA] [GO:0005615
            "extracellular space" evidence=IDA] [GO:0030674 "protein binding,
            bridging" evidence=IMP] [GO:0030199 "collagen fibril organization"
            evidence=IMP] [GO:0007266 "Rho protein signal transduction"
            evidence=IDA] [GO:0043589 "skin morphogenesis" evidence=IMP]
            [GO:0001568 "blood vessel development" evidence=IMP] [GO:0070062
            "extracellular vesicular exosome" evidence=IDA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IDA] [GO:0005576
            "extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
            reticulum lumen" evidence=TAS] [GO:0007411 "axon guidance"
            evidence=TAS] [GO:0007596 "blood coagulation" evidence=TAS]
            [GO:0030168 "platelet activation" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS] [GO:0050900
            "leukocyte migration" evidence=TAS] [GO:0031012 "extracellular
            matrix" evidence=IDA] Reactome:REACT_604 InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0005615 GO:GO:0030168 GO:GO:0046872
            GO:GO:0050900 GO:GO:0070062 GO:GO:0030199 GO:GO:0030674
            GO:GO:0005788 GO:GO:0042802 GO:GO:0001501 GO:GO:0008217
            GO:GO:0007179 GO:GO:0007266
            Pathway_Interaction_DB:endothelinpathway GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568
            Pathway_Interaction_DB:il4_2pathway
            Pathway_Interaction_DB:smad2_3nuclearpathway
            Pathway_Interaction_DB:lymphangiogenesis_pathway GO:GO:0042476
            GO:GO:0071230 Orphanet:216812 EMBL:AC002528 GO:GO:0005201
            GO:GO:0043589 HOVERGEN:HBG004933 KO:K06236 GO:GO:0005584 MIM:130060
            MIM:166200 MIM:166210 MIM:166220 MIM:259420 Orphanet:230857
            Orphanet:216796 Orphanet:216804 Orphanet:216820 DrugBank:DB00048
            GO:GO:0048407 CTD:1278 OrthoDB:EOG412M65 EMBL:J03464 EMBL:Z74616
            EMBL:AF004877 EMBL:BC042586 EMBL:BC054498 EMBL:Y00724 EMBL:X02488
            EMBL:AB004317 EMBL:M35391 EMBL:S98904 EMBL:M21671 EMBL:S41099
            EMBL:M21353 EMBL:M28985 EMBL:V00503 EMBL:S96821 EMBL:L47668
            EMBL:X55525 EMBL:J00114 EMBL:M22816 EMBL:M22817 EMBL:K01078
            EMBL:K02568 IPI:IPI00304962 PIR:A28500 RefSeq:NP_000080.2
            UniGene:Hs.489142 ProteinModelPortal:P08123 SMR:P08123
            DIP:DIP-36079N IntAct:P08123 MINT:MINT-4791958 STRING:P08123
            PhosphoSite:P08123 DMDM:296439507 PaxDb:P08123 PRIDE:P08123
            Ensembl:ENST00000297268 GeneID:1278 KEGG:hsa:1278 UCSC:uc003ung.1
            GeneCards:GC07P094023 H-InvDB:HIX0006854 HGNC:HGNC:2198
            HPA:CAB032650 MIM:120160 MIM:225320 neXtProt:NX_P08123
            Orphanet:99876 Orphanet:230851 PharmGKB:PA35042 ChEMBL:CHEMBL2685
            ChiTaRS:COL1A2 GenomeRNAi:1278 NextBio:5165 ArrayExpress:P08123
            Bgee:P08123 Genevestigator:P08123 GermOnline:ENSG00000164692
            Uniprot:P08123
        Length = 1366

 Score = 124 (48.7 bits), Expect = 0.00043, P = 0.00043
 Identities = 79/261 (30%), Positives = 99/261 (37%)

Query:   156 GVPQGHGPPPSATTAGVVGA----G-PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASK 209
             G+P   G P     AG  GA    G P  + S   +   G P  A    P GP G E  +
Sbjct:   322 GLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKR 381

Query:   210 GPGYDASKAPSYDPTKGPSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIHR 265
             GP  +A  A    P  G    P ++G PG D   G  G    +G++  A  RGPN D  R
Sbjct:   382 GPNGEAGSAGPPGPP-GLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGR 440

Query:   266 -G-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA--QRAPSYI-- 318
              G P     RGL G     GP    + GP      +PG D + GP+  A  +  P  I  
Sbjct:   441 PGEPGLMGPRGLPGSPGNIGPAG--KEGP----VGLPGIDGRPGPIGPAGARGEPGNIGF 494

Query:   319 -----PQRGPGYDLQRGQG--YDMRRAPSYDPSRGT----GFDGAPRGAAPHGQVPPP-L 366
                  P   PG +  +G       R AP  D + G     G  G   G    G   PP  
Sbjct:   495 PGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPPGPPGF 554

Query:   367 NNVPYGSATPPARSGSGQPRG 387
               +P G + P    G    RG
Sbjct:   555 QGLP-GPSGPAGEVGKPGERG 574


>UNIPROTKB|Q51MB1 [details] [associations]
            symbol:RIM9 "pH-response regulator protein palI/RIM9"
            species:242507 "Magnaporthe oryzae 70-15" [GO:0003674
            "molecular_function" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] Pfam:PF06687 GO:GO:0016021 GO:GO:0005886
            eggNOG:NOG12793 EMBL:CM000230 EMBL:CM001237 OrthoDB:EOG4DBXQ8
            InterPro:IPR009571 RefSeq:XP_003721159.1 EnsemblFungi:MGG_02630T0
            GeneID:2682829 KEGG:mgr:MGG_02630 Uniprot:Q51MB1
        Length = 736

 Score = 121 (47.7 bits), Expect = 0.00043, P = 0.00043
 Identities = 56/176 (31%), Positives = 69/176 (39%)

Query:   116 LMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG----HGPPPSATTAG 171
             +  AP+ +R   G+ GG  G         P G+  Y  GYG P G    +GPP      G
Sbjct:   303 VQRAPSAERMNPGARGGYRGRGYG-----PPGRGGY--GYGPPPGSRGGYGPPGR----G 351

Query:   172 VVGAGPNTSTSAYAATQSGTPMRAAYDIP-RG----PGYEASK-GPGYDASKAPSYDPTK 225
               G GPN     Y     G P R  Y  P RG    PGY+  + G   +A   P   P +
Sbjct:   352 GYGPGPN-GRGGY-----GPPPRGGYGPPMRGRAPPPGYQYDRRGSPAEAYGPP---PGQ 402

Query:   226 GPSYDPAKGPGYDPTKGPGYDAQKGSN-------YDAQRGPNYDIHRGPSYDPQRG 274
             GP     + PG  P   PGY    GS        Y  Q  P+ D+ R  S  P  G
Sbjct:   403 GPYGQRQQSPG--PPSAPGY-GMNGSTPTVSSAAYGHQHTPSDDLPRAESPPPLPG 455


>UNIPROTKB|B0QYK0 [details] [associations]
            symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
            sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
            SMART:SM00360 SMART:SM00547 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 EMBL:AC002059
            EMBL:AL031186 EMBL:AC000026 UniGene:Hs.374477 HGNC:HGNC:3508
            HOGENOM:HOG000038010 HOVERGEN:HBG000970 ChiTaRS:EWSR1
            IPI:IPI00879242 SMR:B0QYK0 STRING:B0QYK0 Ensembl:ENST00000331029
            Uniprot:B0QYK0
        Length = 618

 Score = 120 (47.3 bits), Expect = 0.00045, P = 0.00045
 Identities = 75/279 (26%), Positives = 102/279 (36%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 237
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   238 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212

Query:   295 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 349
             T   P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        
Sbjct:   213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
             D  P     +GQ     +  P  + +       G+ RGG
Sbjct:   270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|D4A458 [details] [associations]
            symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 IPI:IPI00767290 Ensembl:ENSRNOT00000057377
            ArrayExpress:D4A458 Uniprot:D4A458
        Length = 618

 Score = 120 (47.3 bits), Expect = 0.00045, P = 0.00045
 Identities = 74/278 (26%), Positives = 100/278 (35%)

Query:   128 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +  S    GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159

Query:   239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213

Query:   296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306


>UNIPROTKB|P02461 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001501 "skeletal system development" evidence=IEA] [GO:0001568
            "blood vessel development" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0007160 "cell-matrix adhesion" evidence=IDA] [GO:0018149
            "peptide cross-linking" evidence=IDA] [GO:0050777 "negative
            regulation of immune response" evidence=IMP] [GO:0005178 "integrin
            binding" evidence=NAS;IMP] [GO:0030168 "platelet activation"
            evidence=NAS] [GO:0007179 "transforming growth factor beta receptor
            signaling pathway" evidence=IDA] [GO:0034097 "response to cytokine
            stimulus" evidence=IDA] [GO:0009314 "response to radiation"
            evidence=IDA] [GO:0042060 "wound healing" evidence=IDA;NAS]
            [GO:0043206 "extracellular fibril organization" evidence=IMP]
            [GO:0030199 "collagen fibril organization" evidence=NAS;IMP]
            [GO:0007507 "heart development" evidence=IMP] [GO:0032964 "collagen
            biosynthetic process" evidence=IMP;TAS] [GO:0005615 "extracellular
            space" evidence=IDA;NAS] [GO:0043588 "skin development"
            evidence=IMP] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IMP] [GO:0007229 "integrin-mediated signaling
            pathway" evidence=IMP] [GO:0005586 "collagen type III"
            evidence=NAS;IMP] [GO:0048407 "platelet-derived growth factor
            binding" evidence=IDA] [GO:0005576 "extracellular region"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0007411 "axon guidance" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 Reactome:REACT_118779
            Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
            GO:GO:0043588 GO:GO:0005615 GO:GO:0030168 GO:GO:0007507
            GO:GO:0046872 GO:GO:0034097 GO:GO:0030199 GO:GO:0005788
            GO:GO:0001501 EMBL:CH471058 GO:GO:0005178 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160
            Pathway_Interaction_DB:endothelinpathway InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568 GO:GO:0048565
            GO:GO:0050777 GO:GO:0009314 GO:GO:0018149 GO:GO:0032964
            GO:GO:0071230 GO:GO:0043206 GO:GO:0005201 HOVERGEN:HBG004933
            KO:K06236 DrugBank:DB00048 DrugBank:DB00039 GO:GO:0048407
            OrthoDB:EOG4FTW1C EMBL:X14420 EMBL:AY054301 EMBL:AY016295
            EMBL:AC066694 EMBL:BC028178 EMBL:M26939 EMBL:X07240 EMBL:X15332
            EMBL:S62925 EMBL:S79877 EMBL:M59312 EMBL:M59227 EMBL:M55603
            EMBL:X06700 EMBL:X01655 EMBL:X01742 EMBL:M13146 EMBL:M11134
            IPI:IPI00021033 IPI:IPI00167087 PIR:S05272 RefSeq:NP_000081.1
            UniGene:Hs.443625 PDB:2V53 PDB:3DMW PDB:4AE2 PDB:4AEJ PDB:4AK3
            PDBsum:2V53 PDBsum:3DMW PDBsum:4AE2 PDBsum:4AEJ PDBsum:4AK3
            ProteinModelPortal:P02461 SMR:P02461 DIP:DIP-57177N IntAct:P02461
            STRING:P02461 PhosphoSite:P02461 DMDM:124056490 PaxDb:P02461
            PRIDE:P02461 Ensembl:ENST00000304636 GeneID:1281 KEGG:hsa:1281
            UCSC:uc002uqj.1 CTD:1281 GeneCards:GC02P189803 HGNC:HGNC:2201
            HPA:CAB016766 HPA:HPA007583 MIM:100070 MIM:120180 MIM:130020
            MIM:130050 neXtProt:NX_P02461 Orphanet:2500 Orphanet:285
            Orphanet:286 Orphanet:86 PharmGKB:PA26716 InParanoid:P02461
            OMA:EGSPGHP PhylomeDB:P02461 ChiTaRS:COL3A1
            EvolutionaryTrace:P02461 GenomeRNAi:1281 NextBio:5177
            ArrayExpress:P02461 Bgee:P02461 Genevestigator:P02461
            GermOnline:ENSG00000168542 GO:GO:0005586 Uniprot:P02461
        Length = 1466

 Score = 124 (48.7 bits), Expect = 0.00047, P = 0.00047
 Identities = 81/280 (28%), Positives = 101/280 (36%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
             A G   G  G +       P G + +    G P   GPP     AG  G  GP      S
Sbjct:   165 AVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPS 224

Query:   181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
               A    +SG P R     +P  PG +   G PG+   K    +D   G   +    PG 
Sbjct:   225 GPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGL 283

Query:   238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
                 G PG +   G      RG   +  R P      G  G D  RG   D Q GP G  
Sbjct:   284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338

Query:   295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
              T   PG    +G V  A    S      PG   QRG+      A +  P    G +G+P
Sbjct:   339 GTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPPGINGSP 392

Query:   354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
              G    G  P  +   P   G+  PP  +G+ G P  RGG
Sbjct:   393 GGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 85/284 (29%), Positives = 101/284 (35%)

Query:   126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSA 183
             A G  GGA    +N   G P G        G+P   G P +    G  G+ G P  +   
Sbjct:   424 APGLRGGAGEPGKNGAKGEP-GPRGERGEAGIP---GVPGAKGEDGKDGSPGEPGANGLP 479

Query:   184 YAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
              AA + G P    +  P GP G    KGP  +   AP   P  GP    A  PG D   G
Sbjct:   480 GAAGERGAP---GFRGPAGPNGIPGEKGPAGERG-APG--PA-GPR-GAAGEPGRDGVPG 531

Query:   243 -PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRG----PGYE-T 295
              PG     GS      GP  D   GP    Q   G     GP+    Q G    PG +  
Sbjct:   532 GPGMRGMPGS----PGGPGSDGKPGPP-GSQGESGRPGPPGPSGPRGQPGVMGFPGPKGN 586

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD-----PSRGTGF 349
                PG + +RG        P   PQ  PG + + G QG      P  D     P    G 
Sbjct:   587 DGAPGKNGERG----GPGGPG--PQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGL 640

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
              G P    P G+   P    P G A  P   G G+   G P  R
Sbjct:   641 QGLPGTGGPPGENGKPGEPGPKGDAGAPGAPG-GKGDAGAPGER 683


>MGI|MGI:88462 [details] [associations]
            symbol:Col7a1 "collagen, type VII, alpha 1" species:10090 "Mus
            musculus" [GO:0004867 "serine-type endopeptidase inhibitor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005604
            "basement membrane" evidence=IDA] [GO:0007155 "cell adhesion"
            evidence=IEA] [GO:0010466 "negative regulation of peptidase
            activity" evidence=IEA] [GO:0030414 "peptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 MGI:MGI:88462 Gene3D:2.60.40.10
            InterPro:IPR013783 GO:GO:0004867 SUPFAM:SSF49265 GO:GO:0007155
            Gene3D:4.10.410.10 InterPro:IPR020901 SUPFAM:SSF57362
            PROSITE:PS00280 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005604 EMBL:AC174646 MEROPS:I02.967 CTD:1294
            HOGENOM:HOG000111866 HOVERGEN:HBG051053 KO:K16628 OMA:RRVCTTA
            OrthoDB:EOG4J117P EMBL:U32107 EMBL:S63654 IPI:IPI00134652
            PIR:A45748 RefSeq:NP_031764.2 UniGene:Mm.6200 HSSP:P12111
            ProteinModelPortal:Q63870 SMR:Q63870 STRING:Q63870
            PhosphoSite:Q63870 PaxDb:Q63870 PRIDE:Q63870
            Ensembl:ENSMUST00000026740 Ensembl:ENSMUST00000112070 GeneID:12836
            KEGG:mmu:12836 UCSC:uc009rrh.1 GeneTree:ENSGT00700000104250
            InParanoid:Q63870 NextBio:282356 Bgee:Q63870 CleanEx:MM_COL7A1
            Genevestigator:Q63870 Uniprot:Q63870
        Length = 2944

 Score = 127 (49.8 bits), Expect = 0.00047, P = 0.00047
 Identities = 86/270 (31%), Positives = 103/270 (38%)

Query:   145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-- 202
             P G    +   G P   GPP S    GV G+ P    S       G         P+G  
Sbjct:  1289 PPGSTQAKGERGFPGPEGPPGSPGLPGVPGS-PGIKGSTGRPGPRGEQGERGPQGPKGEP 1347

Query:   203 --PGY-EASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYD--AQKGSNYD 254
               PG      GPG+   K    DP  GPS  P ++GP  DP  +GP G    + KG   D
Sbjct:  1348 GEPGQITGGGGPGFPGKKG---DP--GPSGPPGSRGPVGDPGPRGPPGLPGISVKGDKGD 1402

Query:   255 -AQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 311
               +RGP    I      DP  GL G     GP     R PG + ++  G     GP    
Sbjct:  1403 RGERGPPGPGIGASEQGDP--GLPGLPGSPGPQGPAGR-PGEKGEK--GDCEDGGPGLPG 1457

Query:   312 QRAPSYIPQ-RG-PGYDLQRG-QGYDMRRA-PSYDPSRG----TGFDGAPRGAAPHGQVP 363
             Q  P   P  RG PG    +G +G       P     RG     G  G P GAA H    
Sbjct:  1458 QPGPPGEPGLRGAPGMTGPKGDRGLTGTPGEPGVKGERGHPGPVGPQGLP-GAAGH---- 1512

Query:   364 PPLNNVPYGSATPPARSGS-GQP-RGGNPA 391
             P +   P G   P  R G  G+P R G+PA
Sbjct:  1513 PGVEG-PEGPPGPTGRRGEKGEPGRPGDPA 1541


>UNIPROTKB|B4DR34 [details] [associations]
            symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
            [GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
            [GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
            "cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
            protein kinase cascade" evidence=IEA] [GO:0042493 "response to
            drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
            evidence=IEA] GO:GO:0000226 GO:GO:0042493 GO:GO:0007243
            GO:GO:0000902 GO:GO:0048013 GO:GO:0005881 HOVERGEN:HBG003892
            InterPro:IPR007726 PANTHER:PTHR23107 UniGene:Hs.129261
            EMBL:AC091021 HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK299082
            IPI:IPI01015658 STRING:B4DR34 Ensembl:ENST00000539849
            Uniprot:B4DR34
        Length = 336

 Score = 116 (45.9 bits), Expect = 0.00047, P = 0.00047
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 179
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   106 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 165

Query:   180 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 231
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   166 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 224

Query:   232 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   225 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 278

Query:   291 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 341
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   279 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 334


>UNIPROTKB|A8E651 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
            SMART:SM00360 SMART:SM00547 GO:GO:0005634 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676 eggNOG:NOG240581
            GeneTree:ENSGT00530000063105 CTD:2130 HOGENOM:HOG000038010
            HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY OrthoDB:EOG42NJ15
            EMBL:DAAA02045602 EMBL:BC153844 IPI:IPI00871084
            RefSeq:NP_001103270.1 UniGene:Bt.33949 SMR:A8E651 STRING:A8E651
            Ensembl:ENSBTAT00000023612 GeneID:534073 KEGG:bta:534073
            InParanoid:A8E651 NextBio:20876260 Uniprot:A8E651
        Length = 655

 Score = 120 (47.3 bits), Expect = 0.00048, P = 0.00048
 Identities = 73/278 (26%), Positives = 99/278 (35%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPAAPQAYSQPVQGYGTGAYDTT 101

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 159

Query:   239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVSAPPSYPPT---SYSSTQPTSYDQSSYSQQNT 213

Query:   296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|Q01844 [details] [associations]
            symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
            sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005516 "calmodulin binding" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0005886 "plasma membrane"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50096 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005886
            GO:GO:0005634 GO:GO:0005737 GO:GO:0006355 GO:GO:0000166
            GO:GO:0046872 EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0006351 GO:GO:0003723 EMBL:AC002059 MIM:612160 Orphanet:97338
            Pathway_Interaction_DB:bard1pathway eggNOG:NOG240581 EMBL:AL031186
            MIM:612219 Orphanet:319 EMBL:X66899 EMBL:X72990 EMBL:X72991
            EMBL:X72992 EMBL:X72993 EMBL:X72994 EMBL:X72995 EMBL:X72996
            EMBL:X72997 EMBL:X72998 EMBL:X72999 EMBL:X73000 EMBL:X73001
            EMBL:X73002 EMBL:X73003 EMBL:X73004 EMBL:Y07848 EMBL:CR456490
            EMBL:AK056309 EMBL:AK056681 EMBL:AC000026 EMBL:BC000527
            EMBL:BC004817 EMBL:BC011048 EMBL:BC072442 EMBL:Y08806 EMBL:AB016435
            IPI:IPI00065554 IPI:IPI00293254 IPI:IPI00335961 IPI:IPI00872855
            IPI:IPI00879259 PIR:A49358 RefSeq:NP_001156757.1
            RefSeq:NP_001156759.1 RefSeq:NP_005234.1 RefSeq:NP_053733.2
            UniGene:Hs.374477 PDB:2CPE PDBsum:2CPE ProteinModelPortal:Q01844
            SMR:Q01844 IntAct:Q01844 MINT:MINT-2858561 STRING:Q01844
            PhosphoSite:Q01844 DMDM:544261 PaxDb:Q01844 PRIDE:Q01844 DNASU:2130
            Ensembl:ENST00000332035 Ensembl:ENST00000333395
            Ensembl:ENST00000397938 Ensembl:ENST00000406548
            Ensembl:ENST00000414183 GeneID:2130 KEGG:hsa:2130 UCSC:uc003aet.3
            CTD:2130 GeneCards:GC22P029663 HGNC:HGNC:3508 HPA:CAB004230
            MIM:133450 neXtProt:NX_Q01844 Orphanet:83469 PharmGKB:PA27921
            HOGENOM:HOG000038010 HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY
            OrthoDB:EOG42NJ15 PhylomeDB:Q01844 ChiTaRS:EWSR1
            EvolutionaryTrace:Q01844 GenomeRNAi:2130 NextBio:8605
            ArrayExpress:Q01844 Bgee:Q01844 CleanEx:HS_EWSR1
            Genevestigator:Q01844 GermOnline:ENSG00000182944 Uniprot:Q01844
        Length = 656

 Score = 120 (47.3 bits), Expect = 0.00048, P = 0.00048
 Identities = 75/279 (26%), Positives = 102/279 (36%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 237
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   238 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212

Query:   295 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 349
             T   P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        
Sbjct:   213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269

Query:   350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
             D  P     +GQ     +  P  + +       G+ RGG
Sbjct:   270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|F1LN98 [details] [associations]
            symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 IPI:IPI00364603
            Ensembl:ENSRNOT00000012634 ArrayExpress:F1LN98 Uniprot:F1LN98
        Length = 656

 Score = 120 (47.3 bits), Expect = 0.00048, P = 0.00048
 Identities = 74/278 (26%), Positives = 100/278 (35%)

Query:   128 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +  S    GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159

Query:   239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213

Query:   296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306


>UNIPROTKB|I3LNI2 [details] [associations]
            symbol:TFG "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0043123 "positive regulation of I-kappaB
            kinase/NF-kappaB cascade" evidence=IEA] [GO:0042802 "identical
            protein binding" evidence=IEA] [GO:0004871 "signal transducer
            activity" evidence=IEA] GO:GO:0043123 GO:GO:0004871 OMA:YTTQTSQ
            GeneTree:ENSGT00510000047809 EMBL:CU928320 EMBL:AEMK01189642
            Ensembl:ENSSSCT00000026186 Uniprot:I3LNI2
        Length = 340

 Score = 116 (45.9 bits), Expect = 0.00048, P = 0.00048
 Identities = 76/301 (25%), Positives = 114/301 (37%)

Query:   106 ATEVEKLRAELMNAPN-VDRRAD-----GSYGGATGNSENET-SGRPVGQNAYEDGYGVP 158
             +++V+ LR EL+   N V+R  D     G  G +T  +EN+T  GR   + A  D  G  
Sbjct:    38 SSQVKYLRRELIELRNKVNRLLDSLEPPGEPGPSTNITENDTVDGREE-KPAASDSSGKQ 96

Query:   159 QGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA 218
                    S +    +      + +  +A   G         P  P  + S  P   AS +
Sbjct:    97 STQVMAASMSAFDPLKNQDEINKNVMSAF--GLTDDQVSGPPSAPAEDRSGTPDSIASSS 154

Query:   219 PSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 277
              +  P   P   P + P        G  + Q    Y  Q G      + P   PQ+  G 
Sbjct:   155 SAAHP---PGVQPQQPPYTGALTQAGQSEGQMYQQYPQQAGYGTQQPQAPPQPPQQS-GS 210

Query:   278 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI--PQRGPGYDLQRGQGYDM 335
              + +G  Y  Q GP  + Q+  GY  Q  P  +A  AP++   PQ+ P    Q+ Q    
Sbjct:   211 SLSKG--YSQQTGP-QQPQQFQGYGQQ--PTSQAP-APAFSGQPQQMPAQPPQQYQASSY 264

Query:   336 R-RAPSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPRGGNPAR 392
               +  +   S+ T +  AP  A+  G  P  P       G   PP  + +  P G NP  
Sbjct:   265 PPQTYTTQTSQPTNYTVAP--ASQPGMAPSQPGAYQPRPGFTPPPGSTMTPLPSGSNPYA 322

Query:   393 R 393
             R
Sbjct:   323 R 323


>UNIPROTKB|F1RY40 [details] [associations]
            symbol:RBM12B "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 GO:GO:0000166
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00620000087828
            OMA:EHFRRPP CTD:389677 EMBL:CU633952 RefSeq:XP_003125614.1
            UniGene:Ssc.32661 Ensembl:ENSSSCT00000006702 GeneID:100514101
            KEGG:ssc:100514101 Uniprot:F1RY40
        Length = 986

 Score = 122 (48.0 bits), Expect = 0.00049, P = 0.00049
 Identities = 42/150 (28%), Positives = 65/150 (43%)

Query:   217 KAPSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 275
             + P  D  + P  +  + P  +  + P   D ++    D +R P  D  R P  D +R  
Sbjct:   581 RRPPEDDFRRPWEEDFRYPREEDFRYPREEDWRRPPEEDFRRPPKDDFRRPPEEDWRRPP 640

Query:   276 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 335
               D +R P  D +R P  + +R P  + +R P  + +R P    +R P  D +R    D 
Sbjct:   641 EGDFRRPPEEDWRRPPEEDFRRPPPGEWRRPPEEDFRRPPEEDFRRLPEEDFRRPHEEDF 700

Query:   336 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
             RR+P  D  R +  D   R    H + PPP
Sbjct:   701 RRSPEED-FRHSPEDDFRRPPPEHFRRPPP 729


>ZFIN|ZDB-GENE-041221-3 [details] [associations]
            symbol:prnprs3 "prion protein, related sequence 3"
            species:7955 "Danio rerio" [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0005544 "calcium-dependent phospholipid binding"
            evidence=IEA] [GO:0051260 "protein homooligomerization"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0048854
            "brain morphogenesis" evidence=IMP] [GO:0005886 "plasma membrane"
            evidence=IDA] [GO:0007156 "homophilic cell adhesion" evidence=IDA]
            [GO:0021731 "trigeminal motor nucleus development" evidence=IMP]
            [GO:0042981 "regulation of apoptotic process" evidence=IMP]
            InterPro:IPR001464 InterPro:IPR022416 ZFIN:ZDB-GENE-041221-3
            GO:GO:0005886 GO:GO:0042981 GO:GO:0051260 GO:GO:0005509
            GO:GO:0007156 GO:GO:0005544 PANTHER:PTHR10502 GO:GO:0048854
            Gene3D:1.10.790.10 SUPFAM:SSF54098 HOVERGEN:HBG056090 EMBL:AJ620614
            IPI:IPI00679275 RefSeq:NP_001013316.1 UniGene:Dr.162496
            UniGene:Dr.84038 ProteinModelPortal:Q5K4F8 GeneID:503702
            KEGG:dre:503702 CTD:503702 InParanoid:Q5K4F8 NextBio:20866258
            ArrayExpress:Q5K4F8 GO:GO:0021731 Uniprot:Q5K4F8
        Length = 567

 Score = 119 (46.9 bits), Expect = 0.00051, P = 0.00051
 Identities = 70/224 (31%), Positives = 94/224 (41%)

Query:   118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYG----VPQ--GHGPPPSATTAG 171
             ++ N    + G+ GG++ +S + +S +    +      G     PQ     PPP     G
Sbjct:    36 SSSNKGGSSSGNKGGSSSSSSSSSSSKGTSSHGTHTSPGNYPRQPQVPNQNPPPYP---G 92

Query:   172 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 231
               G  P       A +  G P + +Y  P   GY  ++G GY A     Y P +G  Y P
Sbjct:    93 AGGGYPGQGRYPPAGSNPGYPNQGSY--PGRAGYP-NQG-GYPAQGG--Y-PAQG-GY-P 143

Query:   232 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG 290
             A+G GY P +G GY AQ G  Y AQ G     + G S  P +G GY  Q G P      G
Sbjct:   144 AQG-GY-PAQG-GYPAQGG--YPAQGGYPQGNYPGRSGYPGQG-GYPAQGGYPGGASYPG 197

Query:   291 PGYET--QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 332
              G  +   R PG +    PV  +   P Y P RG     Q G G
Sbjct:   198 AGAGSYPNRYPGGNPY--PVGGSY--PGY-PVRGGSSPNQFGGG 236


>RGD|1565398 [details] [associations]
            symbol:Col6a1 "collagen, type VI, alpha 1" species:10116 "Rattus
            norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005576
            "extracellular region" evidence=IEA;ISO] [GO:0008150
            "biological_process" evidence=ND] [GO:0031012 "extracellular
            matrix" evidence=IEA;ISO] [GO:0042383 "sarcolemma"
            evidence=IEA;ISO] [GO:0043234 "protein complex" evidence=IEA;ISO]
            [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA;ISO] [GO:0070208 "protein heterotrimerization"
            evidence=IEA;ISO] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA;ISO] InterPro:IPR002035 Pfam:PF00092
            PROSITE:PS50234 SMART:SM00327 RGD:1565398 GO:GO:0005576
            GO:GO:0043234 GO:GO:0042383 GO:GO:0070208 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0031012 GO:GO:0071230 OrthoDB:EOG4WWRHX
            OMA:VKENYAE GeneTree:ENSGT00530000063022 IPI:IPI00371853
            PRIDE:D3ZUL3 Ensembl:ENSRNOT00000001679 Uniprot:D3ZUL3
        Length = 1025

 Score = 122 (48.0 bits), Expect = 0.00051, P = 0.00051
 Identities = 85/262 (32%), Positives = 103/262 (39%)

Query:   120 PNVDRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AG 176
             P  D  A G  G  G  G +E E +GRP G +      G P   GPP     AG  G AG
Sbjct:   359 PKGDAGAFGLKGEKGEAG-AEGE-AGRP-GNSGPPGDEGEPGEPGPPGEKGEAGDEGNAG 415

Query:   177 PNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKA-PSYDPTK-GPSYDPAK 233
             P+      A  + G P       PRG PG    +GP  D  +A P  D  + GP   P  
Sbjct:   416 PDG-----APGERGGPGERG---PRGTPGV---RGPRGDPGEAGPQGDQGREGPVGIPGD 464

Query:   234 GPGYDPTKGP-GYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG 290
              PG     GP GY   +G    +  RG    +  GP  DP  GL G   + GP  +   G
Sbjct:   465 -PGESGPIGPKGYRGDEGPPGPEGLRGAPGPV--GPPGDP--GLMGERGEDGPPGNGTEG 519

Query:   291 -PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGT- 347
              PG+     PGY   RGP       P     +G PG     G+  D     +    RG  
Sbjct:   520 FPGF-----PGYPGNRGP-------PGINGTKGYPGLKGDEGEAGDPGEDNNDVSPRGVK 567

Query:   348 ---GFDGAPRGA-APHGQVPPP 365
                G+ G P G   P G V PP
Sbjct:   568 GAKGYRG-PEGPQGPPGHVGPP 588


>MGI|MGI:1932491 [details] [associations]
            symbol:Prp2 "proline rich protein 2" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
            MGI:MGI:1932491 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
            UniGene:Mm.425348 UniGene:Mm.484054 CleanEx:MM_PRH1 EMBL:M23236
            EMBL:M12100 EMBL:M19419 IPI:IPI00474263 IPI:IPI00855123 PIR:A28996
            PIR:D29149 UniGene:Mm.333439 Genevestigator:P05143
            GermOnline:ENSMUSG00000058295 Uniprot:P05143
        Length = 317

 Score = 115 (45.5 bits), Expect = 0.00055, P = 0.00055
 Identities = 67/242 (27%), Positives = 77/242 (31%)

Query:   156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYD 214
             G P   GP P          GP            G   R     P  PG    + P G  
Sbjct:    79 GPPPPGGPQPRPPQGPPPPGGPQPRPPQGPPPPGGPQPRPPQG-PPPPGGPQPRPPQGPP 137

Query:   215 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG 274
                 P   P +GP   P  GP   P +GP   A  G      +GP      GP   P +G
Sbjct:   138 PPGGPQQRPPQGPP--PPGGPQPRPPQGPPPPA--GPQPRPPQGPPPPA--GPHLRPTQG 191

Query:   275 ---LGYDMQRGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 330
                 G   QR P       PG    R P G     GP     + P   P  GP    +  
Sbjct:   192 PPPTGGPQQRYPQSPPP--PGGPQPRPPQGPPPPGGPHPRPTQGP---PPTGP--QPRPT 244

Query:   331 QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ--PRGG 388
             QG      P   P +G    G P+   P G  PPP    P  +  P    G  Q  P  G
Sbjct:   245 QGPPPTGGPQQRPPQGPPPPGGPQPRPPQGP-PPPTGPQPRPTQGPHPTGGPQQTPPLAG 303

Query:   389 NP 390
             NP
Sbjct:   304 NP 305


>MGI|MGI:88455 [details] [associations]
            symbol:Col4a2 "collagen, type IV, alpha 2" species:10090 "Mus
            musculus" [GO:0001525 "angiogenesis" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] [GO:0005587 "collagen type IV"
            evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IDA]
            [GO:0016525 "negative regulation of angiogenesis" evidence=ISO]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            MGI:MGI:88455 GO:GO:0071560 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0006351 GO:GO:0001525 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0016525 GO:GO:0005201 HOVERGEN:HBG004933
            GO:GO:0005587 Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772
            KO:K06237 EMBL:J04448 EMBL:M23333 OrthoDB:EOG4XGZZF CTD:1284
            OMA:TTIPEQN ChiTaRS:COL4A2 EMBL:J04695 EMBL:AK053858 EMBL:AK075619
            EMBL:AK164096 EMBL:BC013560 EMBL:BC080789 EMBL:BC107685 EMBL:M23334
            EMBL:X02896 EMBL:X02897 EMBL:X02898 EMBL:X02899 EMBL:X04410
            EMBL:X04647 EMBL:M15833 EMBL:AY375463 EMBL:AY502946 EMBL:AY502947
            IPI:IPI00338452 PIR:A33526 RefSeq:NP_034062.3 UniGene:Mm.181021
            ProteinModelPortal:P08122 SMR:P08122 STRING:P08122
            PhosphoSite:P08122 PaxDb:P08122 PRIDE:P08122
            Ensembl:ENSMUST00000033899 GeneID:12827 KEGG:mmu:12827
            InParanoid:P08122 NextBio:282318 Bgee:P08122 CleanEx:MM_COL4A2
            Genevestigator:P08122 GermOnline:ENSMUSG00000031503 Uniprot:P08122
        Length = 1707

 Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
 Identities = 91/301 (30%), Positives = 110/301 (36%)

Query:   119 APNVDRRADGSYGGATGN----SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 174
             +P VD   D  + G TG+     E  T   PVG    +   G P   GP  S    G  G
Sbjct:  1205 SPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGVPGQKGERGTPGERGPAGSPGLQGFPG 1264

Query:   175 AGP--NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYD--ASKAPSYDPTKGPSY 229
               P  N S S       G      Y  P GP G  A  G   D  +S A  +   KG   
Sbjct:  1265 ISPPSNISGSPGDVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGQKGWVG 1324

Query:   230 DPAKGPGYDP-TKG-PGYDAQKGSN-YDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNY 285
             DP  GP   P   G PG    KG   +    GP+  +  RGP   P+   G+    G   
Sbjct:  1325 DP--GPQGQPGVLGLPGEKGPKGEQGFMGNTGPSGAVGDRGPK-GPKGDQGFPGAPGS-- 1379

Query:   286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPS 344
                  PG     +PG   Q+  V      P    +RG PG   + G      + P  DP 
Sbjct:  1380 --MGSPG-----IPGIP-QKIAVQPGTLGPQ--GRRGLPGALGEIGP-----QGPPGDP- 1423

Query:   345 RGTGFDGAPRGAAPHGQ-----VPP------PLNNV-PYGSATPPARSGS-GQPRGGNPA 391
                GF GAP  A P G+     VP       P+ +  P G    P R GS G P  G P 
Sbjct:  1424 ---GFRGAPGKAGPQGRGGVSAVPGFRGDQGPMGHQGPVGQEGEPGRPGSPGLP--GMPG 1478

Query:   392 R 392
             R
Sbjct:  1479 R 1479


>UNIPROTKB|Q96P44 [details] [associations]
            symbol:COL21A1 "Collagen alpha-1(XXI) chain" species:9606
            "Homo sapiens" [GO:0005581 "collagen" evidence=IEA] [GO:0005576
            "extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
            reticulum lumen" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] [GO:0031012 "extracellular matrix"
            evidence=IDA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 Reactome:REACT_118779 GO:GO:0005576 GO:GO:0030198
            Gene3D:2.60.120.200 InterPro:IPR008985 InterPro:IPR013320
            SUPFAM:SSF49899 GO:GO:0005788 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 InterPro:IPR001791 PROSITE:PS50025
            SMART:SM00210 EMBL:AF414088 EMBL:AF330693 EMBL:AF438327
            EMBL:AL136624 EMBL:AF370383 EMBL:AK313398 EMBL:AL513530
            EMBL:AL031782 EMBL:AL034452 EMBL:BC045597 EMBL:BC126108
            IPI:IPI00102435 IPI:IPI00435960 IPI:IPI00644733 RefSeq:NP_110447.2
            UniGene:Hs.47629 HSSP:P18614 ProteinModelPortal:Q96P44 SMR:Q96P44
            STRING:Q96P44 DMDM:74752071 PaxDb:Q96P44 PRIDE:Q96P44
            Ensembl:ENST00000244728 Ensembl:ENST00000370808
            Ensembl:ENST00000370819 GeneID:81578 KEGG:hsa:81578 UCSC:uc003pcs.3
            UCSC:uc003pcu.1 UCSC:uc010jzz.3 CTD:81578 GeneCards:GC06M055968
            HGNC:HGNC:17025 HPA:HPA031210 HPA:HPA031212 HPA:HPA031213
            MIM:610002 neXtProt:NX_Q96P44 PharmGKB:PA26714 HOVERGEN:HBG106599
            InParanoid:Q96P44 KO:K16629 OMA:NGRQGIP OrthoDB:EOG4KH2TF
            GenomeRNAi:81578 NextBio:71896 ArrayExpress:Q96P44 Bgee:Q96P44
            CleanEx:HS_COL21A1 Genevestigator:Q96P44 Uniprot:Q96P44
        Length = 957

 Score = 121 (47.7 bits), Expect = 0.00060, P = 0.00060
 Identities = 58/205 (28%), Positives = 78/205 (38%)

Query:   199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYD-AQ 256
             +P  PGY     PG D    P Y    G    P   PG    +G PGY  + G + D   
Sbjct:   462 LPGNPGYPGQ--PGQDGK--PGYQGIAGTPGVPGS-PGIQGARGLPGYKGEPGRDGDKGD 516

Query:   257 RG-PNYD-IHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA 314
             RG P +  +H  P    + G   D +  P +  ++G   E     G+    GP  E  R 
Sbjct:   517 RGLPGFPGLHGMPGSKGEMGAKGD-KGSPGFYGKKGAKGEKGNA-GFPGLPGPAGEPGRH 574

Query:   315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAP--RGA-APHGQVPPPLNNVP 370
                     PG+  + G       AP  D +RG  G  G P  RG     G++ PP     
Sbjct:   575 GKDGLMGSPGFKGEAGSP----GAPGQDGTRGEPGIPGFPGNRGLMGQKGEIGPPGQQGK 630

Query:   371 YGSATPPARSGS-GQP-RGGNPARR 393
              G+   P   GS G P + G P  +
Sbjct:   631 KGAPGMPGLMGSNGSPGQPGTPGSK 655


>UNIPROTKB|B4DLD3 [details] [associations]
            symbol:SS18 "cDNA FLJ58120, highly similar to SSXT protein"
            species:9606 "Homo sapiens" [GO:0000226 "microtubule cytoskeleton
            organization" evidence=IEA] [GO:0000902 "cell morphogenesis"
            evidence=IEA] [GO:0005881 "cytoplasmic microtubule" evidence=IEA]
            [GO:0007243 "intracellular protein kinase cascade" evidence=IEA]
            [GO:0042493 "response to drug" evidence=IEA] [GO:0048013 "ephrin
            receptor signaling pathway" evidence=IEA] GO:GO:0000226
            GO:GO:0042493 GO:GO:0007243 GO:GO:0000902 GO:GO:0048013
            GO:GO:0005881 HOVERGEN:HBG003892 InterPro:IPR007726
            PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:AC091021
            HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK296949 IPI:IPI01011245
            STRING:B4DLD3 Ensembl:ENST00000542420 Uniprot:B4DLD3
        Length = 395

 Score = 116 (45.9 bits), Expect = 0.00063, P = 0.00063
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 179
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   165 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 224

Query:   180 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 231
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   225 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 283

Query:   232 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   284 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 337

Query:   291 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 341
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   338 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 393


>UNIPROTKB|F1NNB3 [details] [associations]
            symbol:PRNP "Major prion protein" species:9031 "Gallus
            gallus" [GO:0051260 "protein homooligomerization" evidence=IEA]
            [GO:0001933 "negative regulation of protein phosphorylation"
            evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA] [GO:0005783
            "endoplasmic reticulum" evidence=IEA] [GO:0005794 "Golgi apparatus"
            evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
            [GO:0006979 "response to oxidative stress" evidence=IEA]
            [GO:0008017 "microtubule binding" evidence=IEA] [GO:0032689
            "negative regulation of interferon-gamma production" evidence=IEA]
            [GO:0032700 "negative regulation of interleukin-17 production"
            evidence=IEA] [GO:0032703 "negative regulation of interleukin-2
            production" evidence=IEA] [GO:0042802 "identical protein binding"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0043433 "negative regulation of
            sequence-specific DNA binding transcription factor activity"
            evidence=IEA] [GO:0046007 "negative regulation of activated T cell
            proliferation" evidence=IEA] [GO:0050860 "negative regulation of T
            cell receptor signaling pathway" evidence=IEA] [GO:0070885
            "negative regulation of calcineurin-NFAT signaling cascade"
            evidence=IEA] InterPro:IPR000817 InterPro:IPR022416 PROSITE:PS00291
            GO:GO:0005783 GO:GO:0005886 GO:GO:0005794 GO:GO:0043066
            GO:GO:0006979 GO:GO:0005730 GO:GO:0032689 GO:GO:0051260
            GO:GO:0005507 GO:GO:0043433 GO:GO:0001933 GO:GO:0046007
            GO:GO:0050860 GO:GO:0070885 GO:GO:0032703 GO:GO:0032700
            Gene3D:1.10.790.10 PANTHER:PTHR11522 GeneTree:ENSGT00510000049083
            EMBL:AADN02055483 IPI:IPI00819942 Ensembl:ENSGALT00000041079
            ArrayExpress:F1NNB3 Uniprot:F1NNB3
        Length = 125

 Score = 94 (38.1 bits), Expect = 0.00064, P = 0.00064
 Identities = 37/103 (35%), Positives = 46/103 (44%)

Query:   159 QGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG----PGYD 214
             +G G P    + G  GAG +   S Y   Q G P    Y  P  PGY  + G    PGY 
Sbjct:     3 KGKGKP----SGGGWGAGSHRQPS-YPR-QPGYPHNPGY--PHNPGYPHNPGYPHNPGYP 54

Query:   215 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR 257
              +  P Y P + P Y P   PGY P  G GY+   G +Y  Q+
Sbjct:    55 HN--PGY-P-QNPGY-P-HNPGY-PGWGQGYNPSSGGSYHNQK 90


>UNIPROTKB|Q03692 [details] [associations]
            symbol:COL10A1 "Collagen alpha-1(X) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0005581 "collagen" evidence=IEA] [GO:0005938 "cell cortex"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=TAS] [GO:0005576 "extracellular region" evidence=TAS]
            [GO:0005788 "endoplasmic reticulum lumen" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS]
            InterPro:IPR008983 Reactome:REACT_118779 GO:GO:0005938
            GO:GO:0046872 EMBL:CH471051 GO:GO:0030198 GO:GO:0005788
            GO:GO:0001501 HOGENOM:HOG000085653 HOVERGEN:HBG108220 GO:GO:0005581
            Gene3D:2.60.120.40 InterPro:IPR001073 InterPro:IPR008160
            Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007 SMART:SM00110
            SUPFAM:SSF49842 PROSITE:PS50871 eggNOG:NOG114228 CTD:1300
            OrthoDB:EOG4FFD29 EMBL:X60382 EMBL:X72579 EMBL:X72580 EMBL:X98568
            EMBL:AL121963 EMBL:BC130621 EMBL:BC130623 EMBL:X65120 EMBL:X58879
            EMBL:M74050 EMBL:S68531 IPI:IPI00011685 PIR:S26396
            RefSeq:NP_000484.2 UniGene:Hs.520339 PDB:1GR3 PDBsum:1GR3
            ProteinModelPortal:Q03692 SMR:Q03692 MINT:MINT-101719 STRING:Q03692
            DMDM:2506306 PaxDb:Q03692 PRIDE:Q03692 Ensembl:ENST00000243222
            Ensembl:ENST00000327673 GeneID:1300 KEGG:hsa:1300 UCSC:uc003pwm.3
            GeneCards:GC06M116440 HGNC:HGNC:2185 MIM:120110 MIM:156500
            neXtProt:NX_Q03692 Orphanet:174 PharmGKB:PA26701 InParanoid:Q03692
            OMA:IKGPPPN PhylomeDB:Q03692 EvolutionaryTrace:Q03692
            GenomeRNAi:1300 NextBio:5279 ArrayExpress:Q03692 Bgee:Q03692
            CleanEx:HS_COL10A1 Genevestigator:Q03692 GermOnline:ENSG00000123500
            Uniprot:Q03692
        Length = 680

 Score = 119 (46.9 bits), Expect = 0.00065, P = 0.00065
 Identities = 76/279 (27%), Positives = 99/279 (35%)

Query:   120 PNVDRRADGSYGGATG-NSENETSGR--PVGQNAYEDGYGV--PQGHGPPPSATTAGVVG 174
             P V +R +    G  G   +    G   P+G    +   G   P+G G P +A   G  G
Sbjct:   217 PGVGKRGENGVPGQPGIKGDRGFPGEMGPIGPPGPQGPPGERGPEGIGKPGAAGAPGQPG 276

Query:   175 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
               P T     A   +G P    +  P  PG +  +GP       P     +GP+  P K 
Sbjct:   277 I-PGTKGLPGAPGIAGPPGPPGFGKPGLPGLKGERGPA-GLPGGPGAKGEQGPAGLPGK- 333

Query:   235 PGYD-PTKGPGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP 291
             PG   P    G    KG        GP  +   GP+  P    G   +RG P  D +  P
Sbjct:   334 PGLTGPPGNMGPQGPKGIPGSHGLPGPKGET--GPA-GPAGYPGAKGERGSPGSDGK--P 388

Query:   292 GYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPSRGT-G 348
             GY  +  PG D  +G        P   P  G  PG     G     +  P ++   G  G
Sbjct:   389 GYPGK--PGLDGPKGN--PGLPGPKGDPGVGGPPGLPGPVGPA-GAKGMPGHNGEAGPRG 443

Query:   349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
               G P    P G  PP +   P GS   P   G   P G
Sbjct:   444 APGIPGTRGPIG--PPGIPGFP-GSKGDPGSPGPPGPAG 479


>WB|WBGene00000653 [details] [associations]
            symbol:col-77 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            EMBL:Z66498 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00610000086159 PIR:T23801 RefSeq:NP_495759.1
            ProteinModelPortal:Q21562 DIP:DIP-26119N MINT:MINT-1050309
            STRING:Q21562 EnsemblMetazoa:M195.1 GeneID:174336
            KEGG:cel:CELE_M195.1 UCSC:M195.1 CTD:174336 WormBase:M195.1
            eggNOG:NOG315089 InParanoid:Q21562 OMA:IAFFGIC NextBio:883606
            Uniprot:Q21562
        Length = 304

 Score = 114 (45.2 bits), Expect = 0.00065, P = 0.00065
 Identities = 71/238 (29%), Positives = 87/238 (36%)

Query:   154 GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPG 212
             GYG P  +    + +  G    G   S  +  A   GTP     D   G PG +   G  
Sbjct:    85 GYGAPAEYSTDAAVSAGGSEAGGQCCSCGSGPAGPPGTPGEDGRDGNDGQPGPDGQPGSD 144

Query:   213 YDASKAPSYDPTKGPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 271
               A   P+ D      +D PA  PG     GP     KG+  +A   P  D   G    P
Sbjct:   145 APAEAIPTADDF---CFDCPAGPPGPAGNAGP-----KGAPGNAG-APGNDGQAGAPGAP 195

Query:   272 QRGLGYDMQRGP-NYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 329
                 G D  +GP   D   G PG + Q  PG  V+   V      P   PQ  PG D Q 
Sbjct:   196 ----GNDGPQGPPGQDGAAGQPGPDGQ--PGV-VEEVAVPAGPPGPPG-PQGAPGTDGQP 247

Query:   330 GQ-GYDMRRAPSYDPSRGTGFDGAP--RGAA-PHGQVPPPLNNVPYGSATPPARSGSG 383
             G  G   +  P   P+   G DGAP   GAA   G+   P          PP R+  G
Sbjct:   248 GSAGQPGQDGPQ-GPAGDAGTDGAPGQAGAAGEQGEAGQPGEGGGCDHCPPP-RTAPG 303


>UNIPROTKB|E2RQK9 [details] [associations]
            symbol:PYGO2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0060070 "canonical Wnt receptor signaling
            pathway" evidence=IEA] [GO:0060021 "palate development"
            evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
            evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
            [GO:0042393 "histone binding" evidence=IEA] [GO:0033599 "regulation
            of mammary gland epithelial cell proliferation" evidence=IEA]
            [GO:0030879 "mammary gland development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0007420 "brain
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0002088 "lens development in camera-type eye" evidence=IEA]
            [GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
            GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
            GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
            GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
            EMBL:AAEX03005346 RefSeq:XP_547562.2 Ensembl:ENSCAFT00000027172
            GeneID:490440 KEGG:cfa:490440 NextBio:20863469 Uniprot:E2RQK9
        Length = 405

 Score = 116 (45.9 bits), Expect = 0.00066, P = 0.00065
 Identities = 80/294 (27%), Positives = 106/294 (36%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 170
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G  P    +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97

Query:   171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 229
              V   G           Q G     A  +P G G     GP     + P + P   GP++
Sbjct:    98 PVPFGG--------FRVQGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPNPMGPAF 145

Query:   230 D-PAKGPGYDPTKGPGYDAQK-----GSNYDAQRG---PNYDIHRGPSYDPQRGLGYDMQ 280
             + P +GPGY P     + +Q      G N+    G   P      GP   P  G     +
Sbjct:   146 NMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPPRGE 205

Query:   281 RGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQ-RGQGYDMR 336
              GP+   QR   +     P G  +QR P    Q  PS  P   P  G D    G G +  
Sbjct:   206 LGPHSLPQR---FAQPGAPFGPSLQR-P---GQGLPSLPPNTSPFPGPDPGFPGPGGEDG 258

Query:   337 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
               P  +P   T F   P   +P   V     N P   + PP  SG G   GG P
Sbjct:   259 GKP-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPNSSGRG---GGTP 302


>UNIPROTKB|Q2KFJ6 [details] [associations]
            symbol:MGCH7_ch7g689 "Putative uncharacterized protein"
            species:242507 "Magnaporthe oryzae 70-15" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            EMBL:CM000230 Uniprot:Q2KFJ6
        Length = 356

 Score = 115 (45.5 bits), Expect = 0.00068, P = 0.00068
 Identities = 72/273 (26%), Positives = 97/273 (35%)

Query:    69 KNAITFHLCRGTYEYEKKFYNDHLE-SLQVMEKNYITM-----ATEVEKLRAELMNAPNV 122
             +  IT  +CR     +      HLE   +V++  YIT      +  +E L+ +       
Sbjct:    38 REVITADICRYLGN-DALVRPGHLERDGRVVQGYYITAYRNLTSAMIESLKEDSQKWVEE 96

Query:   123 DRRADGSYGGAT--GNSEN---ETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP 177
              RRA G+ GG    G S N     S  P  Q  Y D   +   +      T  GV    P
Sbjct:    97 KRRAQGAQGGTKYPGGSANCSARKSNSPTAQMRYMDS-SLRNPNAVSQHMT--GVARDYP 153

Query:   178 NTSTSAYAATQSGTPMRAAYDIP-RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
             + S +AY+ +            P R  GY A+  PG    + P Y   + P    A+   
Sbjct:   154 D-SQAAYSESYGAGGQGGFGQYPSRDQGY-AAPPPGSFPPREPVYADRQDPYGAQARATS 211

Query:   237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD-MQRGPGYET 295
                    GY  Q    Y A  G N      P+  PQ+  G  MQ  P+Y    +G  Y  
Sbjct:   212 QQYVSA-GYGQQADGPYHAT-GMNRQYAAPPA--PQQAYGDPMQITPSYPPTSQGGAYSP 267

Query:   296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQ 328
             Q    Y     P      AP Y PQ  P    Q
Sbjct:   268 QAQQPYYAGAAP---PPGAPRYDPQGVPATSAQ 297


>MGI|MGI:3040693 [details] [associations]
            symbol:Zmiz1 "zinc finger, MIZ-type containing 1"
            species:10090 "Mus musculus" [GO:0001570 "vasculogenesis"
            evidence=IMP] [GO:0001701 "in utero embryonic development"
            evidence=IMP] [GO:0003007 "heart morphogenesis" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0007296 "vitellogenesis"
            evidence=IMP] [GO:0007569 "cell aging" evidence=IDA] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0045944 "positive regulation
            of transcription from RNA polymerase II promoter" evidence=IMP]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048146 "positive
            regulation of fibroblast proliferation" evidence=IMP] [GO:0048589
            "developmental growth" evidence=IMP] [GO:0048844 "artery
            morphogenesis" evidence=IMP] InterPro:IPR004181 Pfam:PF02891
            PROSITE:PS51044 MGI:MGI:3040693 GO:GO:0005737 GO:GO:0046872
            GO:GO:0016607 GO:GO:0003007 GO:GO:0008270 GO:GO:0001701
            GO:GO:0045944 GO:GO:0006351 Gene3D:3.30.40.10 InterPro:IPR013083
            GO:GO:0048589 GO:GO:0001570 GO:GO:0048146 GO:GO:0048844
            GO:GO:0007569 GO:GO:0007296 GeneTree:ENSGT00550000074410 CTD:57178
            eggNOG:NOG237400 HOGENOM:HOG000253014 HOVERGEN:HBG056252
            OMA:MNQYGPM OrthoDB:EOG45MN70 ChiTaRS:ZMIZ1 EMBL:BC057691
            EMBL:BC058646 EMBL:BC065120 EMBL:AK054366 IPI:IPI00226072
            IPI:IPI00480418 RefSeq:NP_899031.2 UniGene:Mm.227484
            UniGene:Mm.486339 UniGene:Mm.489608 ProteinModelPortal:Q6P1E1
            SMR:Q6P1E1 IntAct:Q6P1E1 STRING:Q6P1E1 PhosphoSite:Q6P1E1
            PaxDb:Q6P1E1 PRIDE:Q6P1E1 Ensembl:ENSMUST00000007961
            Ensembl:ENSMUST00000162645 GeneID:328365 KEGG:mmu:328365
            UCSC:uc007srn.1 UCSC:uc007sro.1 InParanoid:Q6P1E1 NextBio:398259
            Bgee:Q6P1E1 CleanEx:MM_ZMIZ1 Genevestigator:Q6P1E1
            GermOnline:ENSMUSG00000007817 Uniprot:Q6P1E1
        Length = 1072

 Score = 121 (47.7 bits), Expect = 0.00069, P = 0.00069
 Identities = 65/232 (28%), Positives = 84/232 (36%)

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPG-YEASKGP-GYDASKAPSYDPTKGP--SYDP 231
             GP  S+     TQ+          PRGP     S  P G  A   PS     GP    + 
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGPASMGGSLNPAGMAAGMTPS--GMSGPPMGMNQ 375

Query:   232 AKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 289
              + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN     
Sbjct:   376 PRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFPT 432

Query:   290 GPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP--S 344
              PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    +   S
Sbjct:   433 QPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTFS 489

Query:   345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
              G+ +    +G+      P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   490 SGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541


>UNIPROTKB|Q15532 [details] [associations]
            symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
            [GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
            "cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
            protein kinase cascade" evidence=IEA] [GO:0042493 "response to
            drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0030374
            "ligand-dependent nuclear receptor transcription coactivator
            activity" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IDA] GO:GO:0005634 GO:GO:0000226
            GO:GO:0042493 GO:GO:0045944 GO:GO:0007243 GO:GO:0006351
            EMBL:CH471088 GO:GO:0000902 Orphanet:3273 GO:GO:0048013
            GO:GO:0005881 GO:GO:0030374 HOVERGEN:HBG003892 InterPro:IPR007726
            PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:X79200
            EMBL:S79894 EMBL:X79201 EMBL:AF343880 EMBL:EF445031 EMBL:BC096223
            IPI:IPI00452919 IPI:IPI00940186 PIR:S46269 RefSeq:NP_001007560.1
            RefSeq:NP_005628.2 ProteinModelPortal:Q15532 IntAct:Q15532
            STRING:Q15532 PhosphoSite:Q15532 DMDM:20141795 PaxDb:Q15532
            PRIDE:Q15532 DNASU:6760 Ensembl:ENST00000269137
            Ensembl:ENST00000415083 GeneID:6760 KEGG:hsa:6760 UCSC:uc002kvm.3
            CTD:6760 GeneCards:GC18M023596 HGNC:HGNC:11340 MIM:600192
            neXtProt:NX_Q15532 PharmGKB:PA36164 eggNOG:NOG274014
            InParanoid:Q15532 KO:K15623 OrthoDB:EOG4RFKTH PhylomeDB:Q15532
            ChiTaRS:SS18 GenomeRNAi:6760 NextBio:26388 ArrayExpress:Q15532
            Bgee:Q15532 CleanEx:HS_SS18 Genevestigator:Q15532
            GermOnline:ENSG00000141380 Uniprot:Q15532
        Length = 418

 Score = 116 (45.9 bits), Expect = 0.00069, P = 0.00069
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 179
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   188 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 247

Query:   180 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 231
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   248 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 306

Query:   232 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   307 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 360

Query:   291 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 341
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   361 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 416


>WB|WBGene00000627 [details] [associations]
            symbol:col-50 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00530000064217
            EMBL:FO080999 PIR:T15142 RefSeq:NP_491194.1 UniGene:Cel.16665
            ProteinModelPortal:O01662 EnsemblMetazoa:T28F2.6 GeneID:189050
            KEGG:cel:CELE_T28F2.6 UCSC:T28F2.6 CTD:189050 WormBase:T28F2.6
            eggNOG:NOG279371 InParanoid:O01662 OMA:AGNCITC NextBio:941028
            Uniprot:O01662
        Length = 418

 Score = 116 (45.9 bits), Expect = 0.00069, P = 0.00069
 Identities = 79/285 (27%), Positives = 95/285 (33%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P  +  A+G+ GG       + SG P G        G     G P  A   G  G   + 
Sbjct:    96 PAKEGYAEGAGGGGGCQCAAQASGCPAGPPGPPGEAGAD---GEPGEAGQDGAAGEAGSA 152

Query:   180 STSAYAATQSGTPMRAAYDIPRGP-GYEASKGP----GYDASKAPSYDPTKGPSYDPAKG 234
              T A AA    T   A    P GP G     GP    G D   A   +P  GP+  PA  
Sbjct:   153 DTYAGAAGNCIT-CPAGPPGPPGPDGNAGPAGPAGAAGPDGEGAGYAEP--GPA-GPAGP 208

Query:   235 PGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP- 291
             PG D   G PG D Q G+        P      GP   P    G D    P+     GP 
Sbjct:   209 PGPDGQPGAPGPDGQPGAGGTTSTNQPGPPGPAGPP-GPAGPAGEDAYAQPSPAGTPGPP 267

Query:   292 ---GYETQR-------VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 341
                G + +         PG D   GP  +A   P      G G   + G       A  Y
Sbjct:   268 GPPGKDGEAGPDGPAGAPGTDGAPGP--DAAYCPCPPRTLGAGAYPEGGDAAAAAPAGGY 325

Query:   342 DPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQP 385
             D   G   + AP  AA     P P     P G     A +G+  P
Sbjct:   326 DGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQGGAAAGAAAP 370


>ZFIN|ZDB-GENE-040407-1 [details] [associations]
            symbol:cherp "calcium homeostasis endoplasmic
            reticulum protein" species:7955 "Danio rerio" [GO:0003723 "RNA
            binding" evidence=IEA] [GO:0006396 "RNA processing" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000061
            InterPro:IPR000467 Pfam:PF01585 Pfam:PF01805 PROSITE:PS50128
            PROSITE:PS50174 SMART:SM00443 SMART:SM00648 ZFIN:ZDB-GENE-040407-1
            GO:GO:0003723 GO:GO:0006396 Gene3D:1.25.40.90 InterPro:IPR008942
            SUPFAM:SSF48464 HOGENOM:HOG000010294 HOVERGEN:HBG052716
            InterPro:IPR006903 InterPro:IPR006569 Pfam:PF04818 SUPFAM:SSF109905
            PROSITE:PS51391 EMBL:BC171627 IPI:IPI00490676 UniGene:Dr.75231
            ArrayExpress:B7ZVL5 Bgee:B7ZVL5 Uniprot:B7ZVL5
        Length = 910

 Score = 120 (47.3 bits), Expect = 0.00073, P = 0.00073
 Identities = 61/221 (27%), Positives = 93/221 (42%)

Query:   190 GTPMRAAYDIPR-GPGYEASKGPG-YDAS-KAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
             G+  R++   P   P   +SK PG +D     P+++P + P +DP++ P   P   P ++
Sbjct:   383 GSQNRSSDSNPALSPEMSSSK-PGWFDPQHNMPAWNPQQPPPFDPSQAP--PPC--PPWN 437

Query:   247 AQKGSNYDAQRGPNYDIHR--GP---SYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
             + +G  ++ QR PN+   R  GP     DP        ++ P    Q  P    QR P +
Sbjct:   438 SHEGL-WNEQRDPNWSDPRDGGPWSGQNDPPPSWSGQYEQPPWSSQQDQPPPWGQREPPF 496

Query:   302 DVQRGPVYEAQRAPSYIP----QRGPGYDLQRGQGYDMR---------RAPSYDPSRGTG 348
              +QR P +     P   P    Q  P ++  R     M+           P Y P     
Sbjct:   497 RMQRPPHFRGPFPPHQQPPPFNQPPPPHNFGRFPPRFMQDDFPPRHHFERPPYPPHH--- 553

Query:   349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN 389
             FD  P+G  P G++ PP ++ P     PP  S    P GGN
Sbjct:   554 FD-YPQGDFP-GEIGPPPHHHPNQRIPPPGLSDP-PPWGGN 591


>UNIPROTKB|Q767K9 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9823 "Sus scrofa" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] [GO:0005634 "nucleus" evidence=IEA] [GO:0004864
            "protein phosphatase inhibitor activity" evidence=IEA] [GO:0003723
            "RNA binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] InterPro:IPR000571
            InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
            PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
            GO:GO:0005634 GO:GO:0046872 GO:GO:0003677 GO:GO:0008270
            GO:GO:0000785 GO:GO:0006351 GO:GO:0003723 EMBL:AB113357
            GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
            CTD:5514 eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646
            OMA:PPPHEHR OrthoDB:EOG451DQK GeneTree:ENSGT00530000063820
            RefSeq:NP_001116637.1 UniGene:Ssc.39454 ProteinModelPortal:Q767K9
            Ensembl:ENSSSCT00000001463 Ensembl:ENSSSCT00000034462
            GeneID:100144450 KEGG:ssc:100144450 ArrayExpress:Q767K9
            Uniprot:Q767K9
        Length = 925

 Score = 120 (47.3 bits), Expect = 0.00075, P = 0.00075
 Identities = 71/271 (26%), Positives = 86/271 (31%)

Query:   128 GSYGGATGNSENETS-GRPV-GQNAYEDGYGVPQGH---GPPPSATTAGVVGAGPNTSTS 182
             G  GG  G        G P+ G +    G G P G    GPPP          GP     
Sbjct:   632 GGPGGPKGMQHFPPGPGGPMPGPHGGPGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDP-- 689

Query:   183 AYAATQSGTPMRAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
                    G PMR     P GPG Y   +G        P   P +G     + G   +   
Sbjct:   690 -----MRGGPMRGGPG-P-GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRG 742

Query:   242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
             GPG     G  +    GP   +  G  + P  G G  M  G  +    GPG       G+
Sbjct:   743 GPGGGMVGGGGHRPHEGPGGGMSSGSGHRPHEGPGGGM--GGGHRPHEGPGGGMGG--GH 798

Query:   302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
                 GP         + P  GPG  +  G G+         P  G G  G P G  PH  
Sbjct:   799 RPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-D 848

Query:   362 VPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
             VP    +   G      R   G   GG   R
Sbjct:   849 VPGHRGHDHRGPPPHEHRGHDGPGHGGGGHR 879


>UNIPROTKB|P12107 [details] [associations]
            symbol:COL11A1 "Collagen alpha-1(XI) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001502 "cartilage condensation" evidence=IEA] [GO:0002063
            "chondrocyte development" evidence=IEA] [GO:0006029 "proteoglycan
            metabolic process" evidence=IEA] [GO:0042472 "inner ear
            morphogenesis" evidence=IEA] [GO:0048704 "embryonic skeletal system
            morphogenesis" evidence=IEA] [GO:0055010 "ventricular cardiac
            muscle tissue morphogenesis" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=NAS]
            [GO:0005592 "collagen type XI" evidence=IDA;NAS] [GO:0030198
            "extracellular matrix organization" evidence=NAS;TAS] [GO:0030674
            "protein binding, bridging" evidence=NAS] [GO:0007601 "visual
            perception" evidence=IMP] [GO:0007605 "sensory perception of sound"
            evidence=IMP] [GO:0050910 "detection of mechanical stimulus
            involved in sensory perception of sound" evidence=IMP] [GO:0030199
            "collagen fibril organization" evidence=NAS] [GO:0050840
            "extracellular matrix binding" evidence=NAS] [GO:0005576
            "extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
            reticulum lumen" evidence=TAS] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 Reactome:REACT_118779
            GO:GO:0046872 GO:GO:0007601 GO:GO:0030199 GO:GO:0030674
            EMBL:CH471097 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005788 GO:GO:0042472
            GO:GO:0050910 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025 GO:GO:0001502
            GO:GO:0048704 GO:GO:0001503 GO:GO:0050840 GO:GO:0006029
            GO:GO:0055010 Pfam:PF02210 GO:GO:0005201 GO:GO:0002063
            HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG49GKHM SMART:SM00210
            EMBL:J04177 EMBL:AF101112 EMBL:AF101079 EMBL:AF101080 EMBL:AF101081
            EMBL:AF101082 EMBL:AF101083 EMBL:AF101084 EMBL:AF101085
            EMBL:AF101086 EMBL:AF101087 EMBL:AF101088 EMBL:AF101089
            EMBL:AF101090 EMBL:AF101091 EMBL:AF101092 EMBL:AF101093
            EMBL:AF101094 EMBL:AF101095 EMBL:AF101096 EMBL:AF101097
            EMBL:AF101098 EMBL:AF101099 EMBL:AF101100 EMBL:AF101101
            EMBL:AF101102 EMBL:AF101103 EMBL:AF101104 EMBL:AF101105
            EMBL:AF101106 EMBL:AF101107 EMBL:AF101108 EMBL:AF101109
            EMBL:AF101110 EMBL:AF101111 EMBL:AL627203 EMBL:AC093150
            EMBL:AC099567 EMBL:L38956 IPI:IPI00218539 IPI:IPI00218540
            IPI:IPI00295575 PIR:A35239 RefSeq:NP_001177638.1 RefSeq:NP_001845.3
            RefSeq:NP_542196.2 UniGene:Hs.523446 ProteinModelPortal:P12107
            SMR:P12107 STRING:P12107 PhosphoSite:P12107 DMDM:215274245
            PaxDb:P12107 PRIDE:P12107 Ensembl:ENST00000353414
            Ensembl:ENST00000358392 Ensembl:ENST00000370096 GeneID:1301
            KEGG:hsa:1301 UCSC:uc001dul.3 UCSC:uc001dum.3 UCSC:uc001dun.3
            CTD:1301 GeneCards:GC01M103342 H-InvDB:HIX0028847 HGNC:HGNC:2186
            MIM:120280 MIM:154780 MIM:228520 MIM:604841 neXtProt:NX_P12107
            Orphanet:2021 Orphanet:560 Orphanet:90654 PharmGKB:PA26702
            OMA:HPGKEGQ GenomeRNAi:1301 NextBio:5283 PMAP-CutDB:B1ASK7
            ArrayExpress:P12107 Bgee:P12107 CleanEx:HS_COL11A1
            Genevestigator:P12107 GermOnline:ENSG00000060718 GO:GO:0005592
            Uniprot:P12107
        Length = 1806

 Score = 123 (48.4 bits), Expect = 0.00076, P = 0.00076
 Identities = 74/250 (29%), Positives = 96/250 (38%)

Query:   156 GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD 214
             G P   GPP      G  G  GP         T    P R   D  +GP   A +     
Sbjct:   466 GDPGDRGPPGRPGLPGADGLPGP-------PGTMLMLPFRYGGDGSKGPTISAQEAQA-Q 517

Query:   215 ASKAPSYDPTKGPSYDPA-KG-PGYDPTKGPGYDAQKGSNYD-AQRGPN-YDIHRGPSYD 270
             A    +    +GP       G PG  P  GPG    KG + D   +GP       GP+  
Sbjct:   518 AILQQARIALRGPPGPMGLTGRPG--PVGGPGSSGAKGESGDPGPQGPRGVQGPPGPTGK 575

Query:   271 P-QRGL-GYDMQRG-PNYDMQRGP-GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
             P +RG  G D  RG P     +G  G++   +PG    +G  +  +R P   P   PG D
Sbjct:   576 PGKRGRPGADGGRGMPGEPGAKGDRGFDG--LPGLPGDKG--HRGERGPQG-PPGPPGDD 630

Query:   327 LQRGQGYDM--RRAPSYDPSRGT----GFDGAPR--GAAPHGQVPPPLNNV-PYGSATPP 377
               RG+  ++  R  P     RG     G  GAP   G A     P P  N+ P G   PP
Sbjct:   631 GMRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNMGPQGEPGPP 690

Query:   378 ARSGSGQPRG 387
              + G+  P+G
Sbjct:   691 GQQGNPGPQG 700


>WB|WBGene00001734 [details] [associations]
            symbol:grl-25 species:6239 "Caenorhabditis elegans"
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] GO:GO:0009792
            GO:GO:0040010 GO:GO:0000003 EMBL:Z11126
            GeneTree:ENSGT00570000079107 EMBL:Z12018 RefSeq:NP_001023025.1
            ProteinModelPortal:G5EDQ6 EnsemblMetazoa:ZK643.8 GeneID:176265
            KEGG:cel:CELE_ZK643.8 CTD:176265 WormBase:ZK643.8 OMA:QYLGAYA
            NextBio:891834 Uniprot:G5EDQ6
        Length = 774

 Score = 119 (46.9 bits), Expect = 0.00077, P = 0.00077
 Identities = 70/278 (25%), Positives = 101/278 (36%)

Query:   126 ADGSYGGATGNSENETSGRPV----GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTST 181
             +  S GG +G  E+ +SG       G ++   G G   G     S++++G    G ++S+
Sbjct:   343 SSSSGGGYSGGGESSSSGGSSYSSGGDSSSSSGGGYSSGGDSSSSSSSSGGYSGGSDSSS 402

Query:   182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP----SYDPAKGPGY 237
             S+  ++ SG       D     G E+S   GY  S +   + + G     S +PA  P  
Sbjct:   403 SS--SSSSGGYSSGGGDAGASSGGESSSAGGYSGSSSSGGEASSGGYSGGSSEPAPAPEA 460

Query:   238 DPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 297
              P    GY    GS    +  P       PS     G     +  P        G E   
Sbjct:   461 APASSGGYSG--GSEAAPEAAP-----AAPS-GGYSGSEAAPEAAPAAPSGGYSGSEAAP 512

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQR-GPGYDLQRGQGYDMRRAPSYDPSRG-TGFDGAPRG 355
                     G    ++ AP   P     GY    G       AP+  PS G +G + AP  
Sbjct:   513 EAAPAAPSGGYSGSEAAPEAAPAAPSGGYS---GSEAAPEAAPAA-PSGGYSGSEAAPEA 568

Query:   356 A--APHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
             A  AP G      ++ P  +A  PA S  G   GG  A
Sbjct:   569 APAAPSGGYSGSESSAP--AAPEPAPSSGGYSGGGGDA 604


>WB|WBGene00000639 [details] [associations]
            symbol:col-63 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00530000064217 EMBL:Z81143 PIR:T27806
            RefSeq:NP_492245.1 ProteinModelPortal:Q94399 STRING:Q94399
            EnsemblMetazoa:ZK265.2 GeneID:172607 KEGG:cel:CELE_ZK265.2
            UCSC:ZK265.2 CTD:172607 WormBase:ZK265.2 eggNOG:NOG289407
            InParanoid:Q94399 OMA:ENGQDGQ NextBio:876231 Uniprot:Q94399
        Length = 381

 Score = 115 (45.5 bits), Expect = 0.00077, P = 0.00077
 Identities = 82/282 (29%), Positives = 103/282 (36%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSG 190
             G  G + N  S    G    +    V    GPP      G  G  GP+           G
Sbjct:   122 GPAGRAGNPGSDSTEGDRMADFNKDVKCPAGPPGPPGPNGFPGHPGPDGDFGV-----DG 176

Query:   191 TPMRAAYDIPRGP-GYEASKG-PGYDASKAP-SYDPTKGPSYD-PAKGPGYDPTKGPGYD 246
             T  +     P GP G E + G PG      P   + T+G     P   PG  P  GPG D
Sbjct:   177 TNGKDGEPGPDGPEGDEGTPGLPGPPGEDGPVGQNGTRGQGQPGPVGAPGA-PG-GPGRD 234

Query:   247 AQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQRV--PGYD 302
              + G N  D Q+GP      GP+       G D Q G P  D   GP  +   V  PG D
Sbjct:   235 GEPGENGQDGQQGPE-----GPA-------GADGQPGHPGPD---GPSGDVGEVGAPGAD 279

Query:   303 VQRGPV--YEAQRAPSYIPQRGPG-YDLQRG---QGYDMRRAPSYDPSRGTGFDGAPRGA 356
                 P     A+ A +      P  Y+       +GYD   +P+  P+   G+D AP   
Sbjct:   280 AAYCPCPPRSAEMAATGSSDSQPASYEAPAPAATKGYD---SPA--PAAPKGYD-APAPT 333

Query:   357 APHGQVPPPLNNVP-----YGSATPPARSGSGQPRGGNPARR 393
             APH   PPP    P     Y S  P A +    P    P +R
Sbjct:   334 APHP--PPPAPVAPPKLHDYESPAPVADAHDAAP-AAQPYKR 372


>ZFIN|ZDB-GENE-070501-8 [details] [associations]
            symbol:col6a3 "collagen, type VI, alpha 3"
            species:7955 "Danio rerio" [GO:0004867 "serine-type endopeptidase
            inhibitor activity" evidence=IEA] InterPro:IPR002035
            InterPro:IPR002223 InterPro:IPR003961 Pfam:PF00014 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00131 SMART:SM00327 ZFIN:ZDB-GENE-070501-8 GO:GO:0004867
            Gene3D:4.10.410.10 InterPro:IPR020901 SUPFAM:SSF57362
            PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
            GeneTree:ENSGT00530000063022 EMBL:CR545476 IPI:IPI01023461
            Ensembl:ENSDART00000138754 ArrayExpress:F1QKE8 Uniprot:F1QKE8
        Length = 3733

 Score = 126 (49.4 bits), Expect = 0.00078, P = 0.00078
 Identities = 68/203 (33%), Positives = 82/203 (40%)

Query:   204 GYEASKG-PGYD-ASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPN 260
             G     G PG D   + P  DP  G +  PA  PG D  KG PG   ++GS  D +RGP 
Sbjct:  2701 GIRGDPGTPGRDNTQRGPKGDP--GDA-GPAGEPGVDGNKGGPGEPGRRGS--DGRRGPP 2755

Query:   261 YDIHRG--PSYDPQRGL-GYDMQRGPNYDMQ----RGP-GYETQRVPGYDVQRGPVYEAQ 312
                     P  D   G  G    RGP   +     RG  G    R PG   Q GP  E  
Sbjct:  2756 GQAGAAGRPGSDGLAGEPGIGGSRGPAGPIGAPGVRGEDGNPGPRGPGG--QPGPAGEKG 2813

Query:   313 RAPSYIPQRG-PGYDLQRG-QG-YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV 369
             R  + + ++G PG    +G  G +  R  P  D   G GF G P+G    G    P    
Sbjct:  2814 RRGA-VGRKGEPGEPGPKGVTGPFGPRGEPGEDGRDGFGFPG-PKGRK--GDEGFPGFPG 2869

Query:   370 PYGSATPPARSGSGQPRGGNPAR 392
             P G A  P  +G   PRG N  R
Sbjct:  2870 PKGEAGDPGTNGGPGPRGNNGQR 2892


>MGI|MGI:3645678 [details] [associations]
            symbol:Flg2 "filaggrin family member 2" species:10090 "Mus
            musculus" [GO:0005509 "calcium ion binding" evidence=IEA]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0046872 "metal ion binding"
            evidence=IEA] InterPro:IPR001751 InterPro:IPR002048
            InterPro:IPR011992 PROSITE:PS00303 PROSITE:PS50222 Prosite:PS00018
            MGI:MGI:3645678 GO:GO:0005509 Gene3D:1.10.238.10 InterPro:IPR018247
            eggNOG:NOG12793 InterPro:IPR013787 Pfam:PF01023 CTD:388698
            HOGENOM:HOG000112590 KO:K10384 OrthoDB:EOG4RJG10 EMBL:DQ118292
            EMBL:AK036878 IPI:IPI00406870 RefSeq:NP_001013826.1
            UniGene:Mm.10755 HSSP:P24480 ProteinModelPortal:Q2VIS4 SMR:Q2VIS4
            STRING:Q2VIS4 PhosphoSite:Q2VIS4 PaxDb:Q2VIS4 PRIDE:Q2VIS4
            GeneID:229574 KEGG:mmu:229574 UCSC:uc008qfe.1 InParanoid:Q2VIS4
            NextBio:379521 Genevestigator:Q2VIS4 Uniprot:Q2VIS4
        Length = 2362

 Score = 124 (48.7 bits), Expect = 0.00079, P = 0.00079
 Identities = 64/247 (25%), Positives = 95/247 (38%)

Query:   128 GSYGGATGNSENET----SGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
             G   G  G+ + E+     GRP G +  +D    PQ G G P  + +    G  P    S
Sbjct:  1773 GQGQGQAGHQQRESVHGQRGRPQGPS--QDSSRQPQAGQGQPSQSGSGRSPGRSPVHPES 1830

Query:   183 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
             +     S  P R +     G G+   +G      +  S    +G    P++     P  G
Sbjct:  1831 SEGEEHSVVPQRHSES---GHGHGQGQGQAGHQQRE-SVHGQRGRPQGPSQDSSRQPQAG 1886

Query:   243 PGYDAQKGSNYDAQRGPNY-DIHRGPSYD--PQRGLGYDMQRGPNYDMQRGPGYETQR-- 297
              G  +Q GS    +R P + +   G  +   PQR  G     G  +   +G     QR  
Sbjct:  1887 QGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQRHSG----SGHGHGQGQGQAGHQQRES 1942

Query:   298 VPGYDVQ-RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY-DPSRGTGFDGAPR- 354
             V G  V+ +GP   +Q + S  PQ   G   Q G G   RR+P + + S G      P+ 
Sbjct:  1943 VHGQPVRPQGP---SQDSSSQ-PQASQGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQR 1998

Query:   355 -GAAPHG 360
                + HG
Sbjct:  1999 HSGSGHG 2005

 Score = 124 (48.7 bits), Expect = 0.00079, P = 0.00079
 Identities = 61/245 (24%), Positives = 89/245 (36%)

Query:   128 GSYGGATGNSENET----SGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
             G   G  G+ + E+     GRP G    +D    PQ G G P  + +       P    S
Sbjct:  1459 GQGQGQAGHQQRESVHGQRGRPQGPT--QDSSRQPQAGQGQPSQSGSGRSPRRSPVHPES 1516

Query:   183 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTK 241
             +     S  P R +     G G+   +G G     +  S    +G    P++     P  
Sbjct:  1517 SEGEEHSVVPQRHSGS---GHGHGHGQGQGQAGHQQRESVHGQRGRPQGPSQDSSRQPQA 1573

Query:   242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
             G G  +Q GS    +R P   +H   S   +  +      G  +   +G     QR   +
Sbjct:  1574 GQGQPSQSGSGRSPRRSP---VHPESSEGEEHSVVPQRYSGSGHGHGQGQAGHQQRESVH 1630

Query:   302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY-DPSRGTGFDGAPR---GAA 357
               QRG      +  S  PQ G G   Q G G   RR+P + + S G      P+   G+ 
Sbjct:  1631 G-QRGRPQGPSQDSSRQPQAGQGQPSQSGSGRSPRRSPVHPESSEGEEHSVIPQRHSGSG 1689

Query:   358 -PHGQ 361
               HGQ
Sbjct:  1690 HSHGQ 1694


>UNIPROTKB|F1P7J0 [details] [associations]
            symbol:SFPQ "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676 InterPro:IPR012975
            Pfam:PF08075 GeneTree:ENSGT00390000005004 OMA:APGGHPK
            EMBL:AAEX03009659 EMBL:AAEX03009658 Ensembl:ENSCAFT00000005784
            Uniprot:F1P7J0
        Length = 659

 Score = 118 (46.6 bits), Expect = 0.00081, P = 0.00081
 Identities = 55/163 (33%), Positives = 57/163 (34%)

Query:   158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQ--SGTPMRAAYDIPRGPGYEASKGPGYDA 215
             PQG GP P       VG+ P  S SA  AT   SG P       P  P    S  PG   
Sbjct:    68 PQGPGPAPG------VGSAPPASGSAPPATPPTSGAPAGPG-PTPTPPPAVTSAPPGAPP 120

Query:   216 SKAPSYD-PTK-----GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR----GPNYDIHR 265
                PS   PT      GP   PA GPG  P +GPG    KG           GP      
Sbjct:   121 PAPPSSGVPTTPPQAGGPPPPPAGGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPG 180

Query:   266 GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 308
             G    P RG G      P    Q  P Y  Q   G     GPV
Sbjct:   181 GHPKPPHRGGGE-----PRGGRQHHPPYHQQHHQG-PPPGGPV 217


>UNIPROTKB|I3LJD1 [details] [associations]
            symbol:ZMIZ1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR004181 Pfam:PF02891 PROSITE:PS51044 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
            EMBL:CT827949 EMBL:CT827837 Ensembl:ENSSSCT00000025452
            Uniprot:I3LJD1
        Length = 1021

 Score = 120 (47.3 bits), Expect = 0.00084, P = 0.00084
 Identities = 65/232 (28%), Positives = 86/232 (37%)

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
             GP  S+     TQ+          PRGP   AS G   + AS A    P+   GP    +
Sbjct:   265 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 321

Query:   231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
               + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN    
Sbjct:   322 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFP 378

Query:   289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPS- 344
               PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    + + 
Sbjct:   379 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 435

Query:   345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
              G+ +    +G       P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   436 SGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 487


>UNIPROTKB|E1BI98 [details] [associations]
            symbol:COL6A1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0043234 "protein complex" evidence=IEA]
            [GO:0042383 "sarcolemma" evidence=IEA] [GO:0031012 "extracellular
            matrix" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 GO:GO:0005576 GO:GO:0043234 GO:GO:0042383
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 GO:GO:0031012
            GO:GO:0071230 CTD:1291 KO:K06238 OMA:VKENYAE
            GeneTree:ENSGT00530000063022 EMBL:DAAA02003502 IPI:IPI00713573
            RefSeq:NP_001137337.1 UniGene:Bt.23508 PRIDE:E1BI98
            Ensembl:ENSBTAT00000015668 GeneID:511422 KEGG:bta:511422
            NextBio:20869920 Uniprot:E1BI98
        Length = 1027

 Score = 120 (47.3 bits), Expect = 0.00085, P = 0.00084
 Identities = 67/206 (32%), Positives = 85/206 (41%)

Query:   200 PRG-PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQ 256
             PRG PGYE  +G PG    K  + DP +     P    GY   KG  G   +KGS     
Sbjct:   261 PRGDPGYEGERGKPGLPGEKGEAGDPGRPGDLGPV---GYQGMKGEKGSRGEKGS----- 312

Query:   257 RGPN-YDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA 314
             RGP  Y   +G     +RG+ G D  +G        PG +    PG+D  +GP       
Sbjct:   313 RGPKGYKGEKG-----KRGMDGVDGMKGET-GFPGLPGCKGS--PGFDGIQGP------- 357

Query:   315 PSYIPQRGPG-YDL--QRGQ-GYDMR--RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
             P   P+  PG + L  Q+G+ G D    R  S  P    G  G P      G+     N 
Sbjct:   358 PG--PKGDPGAFGLKGQKGEPGADGEPGRPGSTGPPGDEGEPGEPGPPGEKGEAGDEGNA 415

Query:   369 VPYGSATPPARSGSGQ--PRGGNPAR 392
              P G+  P  R G G+  PRG   AR
Sbjct:   416 GPDGA--PGERGGPGERGPRGTPGAR 439


>DICTYBASE|DDB_G0292870 [details] [associations]
            symbol:pex13 "peroxin 13" species:44689
            "Dictyostelium discoideum" [GO:0016560 "protein import into
            peroxisome matrix, docking" evidence=IEA;ISS] [GO:0016021 "integral
            to membrane" evidence=IEA] [GO:0005777 "peroxisome" evidence=IEA]
            [GO:0003674 "molecular_function" evidence=ND] [GO:0005779 "integral
            to peroxisomal membrane" evidence=ISS] [GO:0016020 "membrane"
            evidence=IEA] [GO:0015031 "protein transport" evidence=IEA]
            [GO:0006810 "transport" evidence=IEA] [GO:0005778 "peroxisomal
            membrane" evidence=IEA] Pfam:PF00018 InterPro:IPR001452
            InterPro:IPR007223 Pfam:PF04088 PROSITE:PS50002 SMART:SM00326
            dictyBase:DDB_G0292870 GenomeReviews:CM000155_GR EMBL:AAFI02000197
            GO:GO:0005779 SUPFAM:SSF50044 HSSP:Q64010 GO:GO:0016560
            RefSeq:XP_629403.1 ProteinModelPortal:Q54CL3
            EnsemblProtists:DDB0238077 GeneID:8628922 KEGG:ddi:DDB_G0292870
            eggNOG:NOG312130 OMA:SWMEALH Uniprot:Q54CL3
        Length = 570

 Score = 117 (46.2 bits), Expect = 0.00085, P = 0.00085
 Identities = 44/135 (32%), Positives = 55/135 (40%)

Query:   129 SYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
             S+GG  G S    S    G   Y D Y    G+G     ++ G  G+G    +S Y    
Sbjct:    90 SFGGGVGGSSGYRSSYGGG---YRDSYS-SGGYGSSGYGSSYGSGGSG-GYGSSLYGG-- 142

Query:   189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS-YDPAKGPGYDPTKGPGYDA 247
              G      Y    G GY +S G GY +S    Y  + G S Y  + G GY  + G GY  
Sbjct:   143 -GGYSSGGYG---GSGYGSSYGGGYGSSYGSGYGSSYGGSGYGSSYGGGYGSSYGGGYGG 198

Query:   248 QKGSNYDAQRGPNYD 262
               G  Y  QRG  YD
Sbjct:   199 GYGGGY-GQRG--YD 210


>UNIPROTKB|F1RGP4 [details] [associations]
            symbol:PYGO2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0048589 "developmental growth" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0030879
            "mammary gland development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0007420 "brain
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0002088 "lens development in camera-type eye" evidence=IEA]
            [GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
            GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
            GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
            GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC EMBL:CU207227
            RefSeq:NP_001172104.1 UniGene:Ssc.4680 Ensembl:ENSSSCT00000007162
            GeneID:100157530 KEGG:ssc:100157530 Uniprot:F1RGP4
        Length = 406

 Score = 115 (45.5 bits), Expect = 0.00085, P = 0.00085
 Identities = 77/294 (26%), Positives = 107/294 (36%)

Query:   117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 170
             M +P   RR   + G A  + +E      P     V  N +ED +G P+  G  P    +
Sbjct:    38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97

Query:   171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 229
              +   G           Q G     A  +P G G     GP     + P + P   GP++
Sbjct:    98 PIPFGG--------FRVQGGM----AGQVPPGYGTAGGGGPQPLRRQPPPFPPNPMGPAF 145

Query:   230 D-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQRGP 283
             + P +GPGY P     + +Q    ++   G N+    G     P  G G      M + P
Sbjct:   146 NMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPP 202

Query:   284 NYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYD--LQRGQGYDMR 336
               ++  GP    QR   PG      P+    Q  PS  P   P  G D       G D  
Sbjct:   203 RGEL--GPPSLPQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPAPGGEDGG 260

Query:   337 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
             + P  +P   T F   P   +P   V     N P   + PP+ SG G   GG P
Sbjct:   261 K-P-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPSSSGRG---GGTP 303


>TAIR|locus:2036224 [details] [associations]
            symbol:AT1G15830 "AT1G15830" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] EMBL:CP002684
            GenomeReviews:CT485782_GR EMBL:AC034256 IPI:IPI00522201
            RefSeq:NP_173035.1 UniGene:At.41914 PaxDb:Q3EDB7 PRIDE:Q3EDB7
            EnsemblPlants:AT1G15830.1 GeneID:838153 KEGG:ath:AT1G15830
            TAIR:At1g15830 eggNOG:NOG303006 HOGENOM:HOG000131777
            InParanoid:Q3EDB7 OMA:VMQGCGG ProtClustDB:CLSN2912688
            Genevestigator:Q3EDB7 Uniprot:Q3EDB7
        Length = 483

 Score = 116 (45.9 bits), Expect = 0.00086, P = 0.00086
 Identities = 72/251 (28%), Positives = 89/251 (35%)

Query:   124 RRADGSYGG---ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAG---VVGAGP 177
             RR  G  GG        EN  SG   G +A   G G P   G PP     G   + GA P
Sbjct:    65 RRKTGDGGGDPVVISGGENHASGGMGGTSATRGGGGEPVIPGAPPPNRGGGETVIPGAPP 124

Query:   178 NTSTSAYAATQSGTP--MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSYDPAKG 234
                         G P   R     P  PG    K  G      P   P K G   +P   
Sbjct:   125 PIRGGGGEPAIPGAPPPKRGGGGEPVIPGAPPPKRGGGGEPVIPGAPPPKRGGGGEPVI- 183

Query:   235 PGYDPTK--GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGP 291
             PG  P K  G G     G+    + G    +  G    P+RG G +    P   + +RG 
Sbjct:   184 PGAPPPKRGGGGEPVIPGAPPPKRGGGGEPVIPGAP-PPKRGGGGEPVI-PGAPLPKRGG 241

Query:   292 GYETQRVPGYDVQR--GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 349
             G E+  VPG    +  G V       +  P RG G D   G+G + R     D   G G 
Sbjct:   242 GGESV-VPGAPPPKRGGGVIVNGGCETVPPGRGGGGDKTNGRGGEGREE---DNGGGRGA 297

Query:   350 DGAPRGAAPHG 360
             +G  RG+   G
Sbjct:   298 EGGGRGSTGEG 308


>UNIPROTKB|F1NDF5 [details] [associations]
            symbol:COL4A5 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005587 "collagen type IV"
            evidence=IEA] [GO:0005605 "basal lamina" evidence=IEA] [GO:0007528
            "neuromuscular junction development" evidence=IEA] [GO:0031594
            "neuromuscular junction" evidence=IEA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0031594 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 OMA:MPMNMEP EMBL:AADN02013568
            EMBL:AADN02013569 EMBL:AADN02013570 EMBL:AADN02013571
            EMBL:AADN02013572 IPI:IPI00583230 Ensembl:ENSGALT00000013221
            ArrayExpress:F1NDF5 Uniprot:F1NDF5
        Length = 1658

 Score = 122 (48.0 bits), Expect = 0.00088, P = 0.00088
 Identities = 80/270 (29%), Positives = 101/270 (37%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 191
             G  G+    + G P G    +   G+P   G P S   AG  G  P    S  A  + G 
Sbjct:  1030 GEKGDPGLSSIGIP-GLPGPKGDLGLPGYPGSPGSKGIAGNPGL-PGLPGSPGAKGEPGL 1087

Query:   192 P-MRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTK-GPSYDPA-KG-PGYDPTKGP-GY 245
             P       IP   G E   G PG      P  D  + GP   P  KG PG D   GP G 
Sbjct:  1088 PGFPGTPGIPGPKGIEGPPGNPGLPGPPGPVGDTGRPGPPGPPGEKGQPGRDGIPGPAGQ 1147

Query:   246 DAQKGSNYDAQRGPNYDIHRG-PSYDPQRG-LGYDMQRGP-NYDMQRG-PGYE----TQR 297
               + G     + GP      G P    Q+G LG     GP      +G PG++     Q 
Sbjct:  1148 KGEPGLPGFGRPGPP-----GLPGLSGQKGELGLPGPPGPPGLPGLKGEPGFQGFPGLQG 1202

Query:   298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 357
              PG     GP  E  +  S  P   PG   + G+G  +  +P  +  RG    G  +G  
Sbjct:  1203 PPGPPGLPGPPLEGPKG-SPGPPGVPG---RPGKGM-IHGSPGPEGPRGPPGSGGLKGEK 1257

Query:   358 PH-GQVPPPLNNVPYGSATPPARSGS-GQP 385
              + GQ  PP      G   PP R G  G+P
Sbjct:  1258 GNPGQPGPPGLTGQKGDQGPPGRQGDPGRP 1287


>UNIPROTKB|E2RGZ0 [details] [associations]
            symbol:ZMIZ1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0048844 "artery morphogenesis" evidence=IEA]
            [GO:0048589 "developmental growth" evidence=IEA] [GO:0048146
            "positive regulation of fibroblast proliferation" evidence=IEA]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0007569 "cell aging"
            evidence=IEA] [GO:0007296 "vitellogenesis" evidence=IEA]
            [GO:0003007 "heart morphogenesis" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0001570
            "vasculogenesis" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR004181 Pfam:PF02891 PROSITE:PS51044
            GO:GO:0008270 Gene3D:3.30.40.10 InterPro:IPR013083
            GeneTree:ENSGT00550000074410 OMA:MNQYGPM EMBL:AAEX03002865
            EMBL:AAEX03002866 EMBL:AAEX03002867 EMBL:AAEX03002868
            EMBL:AAEX03002869 EMBL:AAEX03002870 EMBL:AAEX03002871
            EMBL:AAEX03002872 Ensembl:ENSCAFT00000024855 NextBio:20862292
            Uniprot:E2RGZ0
        Length = 1072

 Score = 120 (47.3 bits), Expect = 0.00089, P = 0.00089
 Identities = 65/232 (28%), Positives = 86/232 (37%)

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
             GP  S+     TQ+          PRGP   AS G   + AS A    P+   GP    +
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374

Query:   231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
               + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN    
Sbjct:   375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFP 431

Query:   289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPS- 344
               PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    + + 
Sbjct:   432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488

Query:   345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
              G+ +    +G       P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   489 SGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 540


>UNIPROTKB|F1S2E4 [details] [associations]
            symbol:ZMIZ1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR004181 Pfam:PF02891 PROSITE:PS51044 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
            OMA:MNQYGPM EMBL:CT827949 EMBL:CT827837 Ensembl:ENSSSCT00000011307
            Uniprot:F1S2E4
        Length = 1072

 Score = 120 (47.3 bits), Expect = 0.00089, P = 0.00089
 Identities = 65/232 (28%), Positives = 86/232 (37%)

Query:   176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
             GP  S+     TQ+          PRGP   AS G   + AS A    P+   GP    +
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374

Query:   231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
               + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN    
Sbjct:   375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFP 431

Query:   289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPS- 344
               PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    + + 
Sbjct:   432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488

Query:   345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
              G+ +    +G       P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   489 SGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 540


>ZFIN|ZDB-GENE-030131-8415 [details] [associations]
            symbol:col1a2 "collagen, type I, alpha 2"
            species:7955 "Danio rerio" [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS51461 SMART:SM00038 ZFIN:ZDB-GENE-030131-8415
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            HOVERGEN:HBG004933 EMBL:AJ318213 IPI:IPI00502653 UniGene:Dr.75575
            STRING:Q90YJ0 PRIDE:Q90YJ0 InParanoid:Q90YJ0 ArrayExpress:Q90YJ0
            Bgee:Q90YJ0 Uniprot:Q90YJ0
        Length = 1352

 Score = 121 (47.7 bits), Expect = 0.00090, P = 0.00090
 Identities = 86/282 (30%), Positives = 107/282 (37%)

Query:   128 GSYGGATGNSENETSGR--PVGQNAYEDGYGVPQGHGPPPSATTAG---VVGA-G-PNTS 180
             G  G A        +GR  P+G      G G P   GPP  A  AG   +VGA G P + 
Sbjct:   387 GPRGAAGTRGLPGLAGRSGPMGMPGPRGGVGAPGARGPPGDAGRAGEAGLVGARGLPGSP 446

Query:   181 TSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYD 238
              S+    + G    A  D   GP G    +G PG      P     KGPS +  K PG  
Sbjct:   447 GSSGPPGKEGPSGAAGQDGRTGPPGPTGPRGQPGNIGFPGP-----KGPSGEAGK-PG-- 498

Query:   239 PTKGP-GYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
               KGP G    +GS   D   GP   +  G +  P    G   ++GP+      PG+  Q
Sbjct:   499 -EKGPVGPTGLRGSPGPDGNNGPAGPV--GLAGAP----GEKGEQGPS----GAPGF--Q 545

Query:   297 RVPGYDVQRGPVYEAQRAPSY-IPQ----RGP-GYDLQRGQGYDMRRAPSYDPSRGTGFD 350
              +PG     GPV EA +     IP      GP G   +RG       A +  P    G  
Sbjct:   546 GLPG---PAGPVGEAGKPGDRGIPGDQGVSGPAGVKGERGNPGPAGAAGAQGPIGARGPS 602

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
             G P    P G    P    P G+  P   +G    RG  G P
Sbjct:   603 GTP---GPDGNKGEPGAVGPAGAPGPQGAAGMPGERGAAGTP 641


>WB|WBGene00000656 [details] [associations]
            symbol:col-80 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0040002
            "collagen and cuticulin-based cuticle development" evidence=IMP]
            InterPro:IPR002486 Pfam:PF01484 SMART:SM01088 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0040002 EMBL:Z46791
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00530000064217
            PIR:T19143 RefSeq:NP_496310.1 ProteinModelPortal:Q09456
            DIP:DIP-27389N MINT:MINT-1079432 STRING:Q09456
            EnsemblMetazoa:C09G5.5 GeneID:174652 KEGG:cel:CELE_C09G5.5
            UCSC:C09G5.5 CTD:174652 WormBase:C09G5.5 eggNOG:NOG285871
            InParanoid:Q09456 OMA:VEIHTHH NextBio:884922 Uniprot:Q09456
        Length = 317

 Score = 113 (44.8 bits), Expect = 0.00091, P = 0.00091
 Identities = 41/125 (32%), Positives = 54/125 (43%)

Query:   132 GATGNSENE-TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS- 189
             GA GN   +  +G P G  A+  G G P   GP   A + G  GA  N      +  +S 
Sbjct:   143 GAPGNPGPQGPNGNP-GAPAHGGGQGPPGPPGPAGDAGSPGQAGAPGNPGRPGQSGQRSR 201

Query:   190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDA 247
             G P  +    P+GP   A   PG  ++  P+  P   GP+  P   PG D   G PG D 
Sbjct:   202 GLPGPSGRPGPQGPP-GAPGQPGSGSTPGPAGPPGPPGPNGQPGH-PGQDGQPGAPGNDG 259

Query:   248 QKGSN 252
               GS+
Sbjct:   260 APGSD 264


>RGD|621351 [details] [associations]
            symbol:Col1a2 "collagen, type I, alpha 2" species:10116 "Rattus
            norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
            [GO:0001568 "blood vessel development" evidence=ISO] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            [GO:0005581 "collagen" evidence=IEA;ISO] [GO:0005584 "collagen type
            I" evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0007179 "transforming growth factor beta receptor signaling
            pathway" evidence=ISO] [GO:0007266 "Rho protein signal
            transduction" evidence=ISO] [GO:0008217 "regulation of blood
            pressure" evidence=ISO] [GO:0030199 "collagen fibril organization"
            evidence=ISO] [GO:0030674 "protein binding, bridging" evidence=ISO]
            [GO:0042802 "identical protein binding" evidence=ISO] [GO:0043589
            "skin morphogenesis" evidence=ISO] [GO:0046332 "SMAD binding"
            evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=ISO]
            [GO:0070062 "extracellular vesicular exosome" evidence=ISO]
            [GO:0070208 "protein heterotrimerization" evidence=ISO] [GO:0071230
            "cellular response to amino acid stimulus" evidence=ISO]
            [GO:0031012 "extracellular matrix" evidence=ISO] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
            RGD:621351 GO:GO:0005615 GO:GO:0046872 GO:GO:0030199 GO:GO:0001501
            GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568
            GO:GO:0071230 GO:GO:0005201 GO:GO:0043589 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 GO:GO:0005584 PDB:3HQV PDB:3HR2
            PDBsum:3HQV PDBsum:3HR2 Reactome:REACT_150387 CTD:1278
            OrthoDB:EOG412M65 EMBL:AF121217 IPI:IPI00188921 RefSeq:NP_445808.1
            UniGene:Rn.107239 IntAct:P02466 STRING:P02466 PRIDE:P02466
            GeneID:84352 KEGG:rno:84352 UCSC:RGD:621351 InParanoid:P02466
            EvolutionaryTrace:P02466 NextBio:616663 PMAP-CutDB:P02466
            ArrayExpress:P02466 Genevestigator:P02466
            GermOnline:ENSRNOG00000011292 Uniprot:P02466
        Length = 1372

 Score = 121 (47.7 bits), Expect = 0.00092, P = 0.00092
 Identities = 85/284 (29%), Positives = 105/284 (36%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSG 190
             G TG+        P G     DG   P G   PP A   G  G  GP   T  +AA  S 
Sbjct:    33 GPTGDRGPRGQRGPAGPRG-RDGVDGPVGPPGPPGAP--GPPGPPGPPGLTGNFAAQYSD 89

Query:   191 TPMRAAYDI-----PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA--KGPGYDPTKGP 243
               + A         PRGP   A   PG    + P+ +P +     PA  +GP   P K  
Sbjct:    90 KGVSAGPGPMGLMGPRGPP-GAVGAPGPQGFQGPAGEPGEPGQTGPAGSRGPAGPPGKA- 147

Query:   244 GYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYE-TQR 297
             G D   G      +RG       RG    P  GL G+   RG N  D  +G PG +  + 
Sbjct:   148 GEDGHPGKPGRPGERGVVGPQGARGFPGTP--GLPGFKGIRGHNGLDGLKGQPGAQGVKG 205

Query:   298 VPGYDVQRGPVYEAQRAPSYIP-QRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
              PG   + G     Q     +P +RG    PG    RG    +       P    G  G 
Sbjct:   206 EPGAPGENGT--PGQAGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGF 263

Query:   353 PRGAAPHGQVPPPLNNVPYGSATPPARSG----SGQPRG--GNP 390
             P    P G++ P  N  P G A P   +G    SG P G  GNP
Sbjct:   264 PGAPGPKGELGPVGNPGPAGPAGPRGEAGLPGLSG-PVGPPGNP 306


>UNIPROTKB|F1LS40 [details] [associations]
            symbol:Col1a2 "Collagen alpha-2(I) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 RGD:621351 GO:GO:0005615 GO:GO:0030199 GO:GO:0001501
            GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0001568 GO:GO:0071230
            GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287 KO:K06236
            GO:GO:0005584 IPI:IPI00188921 EMBL:AC107447 RefSeq:XP_003749738.1
            Ensembl:ENSRNOT00000016423 GeneID:100911218 KEGG:rno:100911218
            ArrayExpress:F1LS40 Uniprot:F1LS40
        Length = 1372

 Score = 121 (47.7 bits), Expect = 0.00092, P = 0.00092
 Identities = 85/284 (29%), Positives = 105/284 (36%)

Query:   132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSG 190
             G TG+        P G     DG   P G   PP A   G  G  GP   T  +AA  S 
Sbjct:    33 GPTGDRGPRGQRGPAGPRG-RDGVDGPVGPPGPPGAP--GPPGPPGPPGLTGNFAAQYSD 89

Query:   191 TPMRAAYDI-----PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA--KGPGYDPTKGP 243
               + A         PRGP   A   PG    + P+ +P +     PA  +GP   P K  
Sbjct:    90 KGVSAGPGPMGLMGPRGPP-GAVGAPGPQGFQGPAGEPGEPGQTGPAGSRGPAGPPGKA- 147

Query:   244 GYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYE-TQR 297
             G D   G      +RG       RG    P  GL G+   RG N  D  +G PG +  + 
Sbjct:   148 GEDGHPGKPGRPGERGVVGPQGARGFPGTP--GLPGFKGIRGHNGLDGLKGQPGAQGVKG 205

Query:   298 VPGYDVQRGPVYEAQRAPSYIP-QRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
              PG   + G     Q     +P +RG    PG    RG    +       P    G  G 
Sbjct:   206 EPGAPGENGT--PGQAGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGF 263

Query:   353 PRGAAPHGQVPPPLNNVPYGSATPPARSG----SGQPRG--GNP 390
             P    P G++ P  N  P G A P   +G    SG P G  GNP
Sbjct:   264 PGAPGPKGELGPVGNPGPAGPAGPRGEAGLPGLSG-PVGPPGNP 306


>UNIPROTKB|G4MTN4 [details] [associations]
            symbol:MGG_07193 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR001202 Pfam:PF00397 PROSITE:PS01159 PROSITE:PS50020
            SMART:SM00456 Gene3D:2.20.70.10 SUPFAM:SSF51045 EMBL:CM001232
            RefSeq:XP_003715399.1 ProteinModelPortal:G4MTN4
            EnsemblFungi:MGG_07193T0 GeneID:2683176 KEGG:mgr:MGG_07193
            Uniprot:G4MTN4
        Length = 366

 Score = 114 (45.2 bits), Expect = 0.00092, P = 0.00092
 Identities = 70/248 (28%), Positives = 92/248 (37%)

Query:   163 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 222
             PPP  T  G    GP  S  A A++ + TP     D+ + P YE    P    + A    
Sbjct:    53 PPPPPTGDGAPD-GPPPSYQASASSATATPT----DVKKNP-YETE--PAASPNPAGVGG 104

Query:   223 PTKGPSYDPAKGP--GYDPTKGPG-YDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYD 278
              + GP+  P   P  G  P        AQ  +  DA+ RG   + + G    P  G G+ 
Sbjct:   105 SSSGPAPPPVNSPRPGDPPVSDDAKLAAQMQAEEDARARGSGGNPNYGGG-SPAPGQGFP 163

Query:   279 MQRGPNYDMQR--------GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 330
              Q     +  +        G G ++Q    Y    GP+ + Q      PQ   GY    G
Sbjct:   164 NQLPARQERSKSGILGKLFGKGKQSQTAH-YGA--GPLPQQQYQQ---PQHQQGYP---G 214

Query:   331 QGYDMRRAPSYDPSRGTGFD-GAPRGAAP-HG---QVP---PPLNNVPYGSATPPARSGS 382
              GY      S  P  G G+  GAP    P +G   Q P   PP    PYG      + G 
Sbjct:   215 AGYQQGAPYSPQPGYGGGYQQGAPYSPQPGYGGGYQQPGYGPPPG--PYGQPGYGPQPGY 272

Query:   383 GQPRGGNP 390
             G P  G P
Sbjct:   273 GHPPYGQP 280


>WB|WBGene00000618 [details] [associations]
            symbol:col-41 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00530000064674 EMBL:Z72514 PIR:T24769
            RefSeq:NP_510522.1 ProteinModelPortal:Q22369 IntAct:Q22369
            MINT:MINT-213826 STRING:Q22369 PaxDb:Q22369 EnsemblMetazoa:T10B10.1
            GeneID:181610 KEGG:cel:CELE_T10B10.1 UCSC:T10B10.1 CTD:181610
            WormBase:T10B10.1 InParanoid:Q22369 OMA:CSIGHIV NextBio:914648
            Uniprot:Q22369
        Length = 428

 Score = 115 (45.5 bits), Expect = 0.00093, P = 0.00093
 Identities = 80/292 (27%), Positives = 100/292 (34%)

Query:   120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
             P++ R     Y G   N ++ + G P G        G     G P      G  G    T
Sbjct:    70 PSLLRNKRFVYPGMC-NCDSNSQGCPAGAPGPPGNPGKRGDEGHPGDEGRRGASGISLAT 128

Query:   180 STSAYAAT------QSGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDP 231
             +              +G P       P G PG +   GP G D   AP  +   G   + 
Sbjct:   129 THDIPGGCIKCPEGPAGPPGPDGDSGPEGFPGLQGQSGPSGEDG--APGQEGAPGDQGE- 185

Query:   232 AKGP-GYDPTKGPGYDAQKGSNY-DAQRG-PNYDIHRG-PSYDPQRGL-GYDMQRGPNYD 286
              +GP GYD T GP  D Q G+ Y   Q G P      G P    Q G  G D + GP   
Sbjct:   186 -QGPKGYDGTDGP--DGQPGTTYFPGQAGQPGEPGWLGEPGLPGQHGEPGKDGEEGP--- 239

Query:   287 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS------ 340
              Q  PG  T    G+D   G   +A + P   P +   Y     Q  D R  PS      
Sbjct:   240 -QGAPG--TPGNAGHDAFPGTPGQAGK-PG-APGKDANY-CPCPQRQDDRTPPSSGTSAP 293

Query:   341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
               P RG+    AP   AP     PP    P  +   P  +    P    P R
Sbjct:   294 QPPPRGS--TAAPGTRAPPATRAPPATRAPPATTRAPPATTRPAPASQPPVR 343


>TAIR|locus:2154679 [details] [associations]
            symbol:ENODL1 "early nodulin-like protein 1" species:3702
            "Arabidopsis thaliana" [GO:0005507 "copper ion binding"
            evidence=IEA;ISS] [GO:0005886 "plasma membrane" evidence=ISM;IDA]
            [GO:0009055 "electron carrier activity" evidence=IEA] [GO:0031225
            "anchored to membrane" evidence=TAS] InterPro:IPR003245
            Pfam:PF02298 ProDom:PD003122 PROSITE:PS51485 GO:GO:0005886
            EMBL:CP002688 GO:GO:0009055 GO:GO:0031225 GO:GO:0005507
            EMBL:AB007644 Gene3D:2.60.40.420 InterPro:IPR008972 SUPFAM:SSF49503
            ProtClustDB:CLSN2915882 HSSP:P29602 EMBL:BT026028 IPI:IPI00530005
            RefSeq:NP_200198.1 UniGene:At.49170 ProteinModelPortal:Q9FN39
            SMR:Q9FN39 PRIDE:Q9FN39 EnsemblPlants:AT5G53870.1 GeneID:835468
            KEGG:ath:AT5G53870 TAIR:At5g53870 InParanoid:Q9FN39 OMA:AHAPSHS
            Genevestigator:Q9FN39 Uniprot:Q9FN39
        Length = 370

 Score = 114 (45.2 bits), Expect = 0.00094, P = 0.00094
 Identities = 30/114 (26%), Positives = 47/114 (41%)

Query:   158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
             P    PP S+ +     A P  S+S  + T + +P  A    P  P   + K P    S 
Sbjct:   166 PSKSQPPRSSVSP----AQPPKSSSPISHTPALSPSHATSHSPATPS-PSPKSPS-PVSH 219

Query:   218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 271
             +PS+ P   PS+ PA  P + P   P +      ++     P++     PS+ P
Sbjct:   220 SPSHSPAHTPSHSPAHTPSHSPAHAPSHSPAHAPSHSPAHAPSHSPAHSPSHSP 273


>UNIPROTKB|E2RA46 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 EMBL:AAEX03014786
            EMBL:AAEX03014787 Ensembl:ENSCAFT00000019364 Uniprot:E2RA46
        Length = 619

 Score = 117 (46.2 bits), Expect = 0.00095, P = 0.00095
 Identities = 73/278 (26%), Positives = 99/278 (35%)

Query:   128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 160

Query:   239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   161 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQNT 214

Query:   296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   215 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 271

Query:   351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   272 H-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 307


>UNIPROTKB|F1PBJ4 [details] [associations]
            symbol:FUS "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0005634 GO:GO:0005737 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 OMA:YGTQSTP EMBL:AAEX03004378
            EMBL:AAEX03004379 Ensembl:ENSCAFT00000026694 Uniprot:F1PBJ4
        Length = 517

 Score = 116 (45.9 bits), Expect = 0.00095, P = 0.00095
 Identities = 46/168 (27%), Positives = 64/168 (38%)

Query:   128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
             G+Y    G   ++ S +P GQ +Y  GYG          ++     G   NT     +  
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSYS-GYGQSADTSGYGQSSYGSSYGQTQNTGYGTQSTP 73

Query:   188 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
             Q G      Y   +G    Y + S  PGY    APS   T G     ++  GY   +  G
Sbjct:    74 Q-GYGSTGGYGSSQGSQSSYGQQSSYPGYGQQPAPS--STSGSYGSGSQSSGYGQPQSGG 130

Query:   245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
             Y  Q G  Y  Q+   Y   +  SY+P +G G   Q    Y+   G G
Sbjct:   131 YGQQSG--YSGQQ-QGYGQQQS-SYNPPQGYGQQNQ----YNSSSGGG 170


>ZFIN|ZDB-GENE-040426-2801 [details] [associations]
            symbol:ssbp3b "single stranded DNA binding protein
            3b" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0003697 "single-stranded DNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR006594 InterPro:IPR007591 InterPro:IPR008116
            Pfam:PF04503 PRINTS:PR01743 PROSITE:PS50896 SMART:SM00667
            ZFIN:ZDB-GENE-040426-2801 GO:GO:0005634 GO:GO:0003697
            eggNOG:NOG245801 HOGENOM:HOG000037785 PANTHER:PTHR12610
            GeneTree:ENSGT00390000009187 EMBL:CR847832 EMBL:GQ903695
            IPI:IPI00920092 UniGene:Dr.77852 STRING:D0EWT5
            Ensembl:ENSDART00000121984 Uniprot:D0EWT5
        Length = 373

 Score = 114 (45.2 bits), Expect = 0.00095, P = 0.00095
 Identities = 60/197 (30%), Positives = 81/197 (41%)

Query:   206 EASKGPGYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGPGY-DAQKGSNYDAQRGPNYDI 263
             EA     Y A+ APS  P  G +  P  G PG  P   PG+     GS       P  + 
Sbjct:    85 EAKAFHDYSAAAAPS--PVLG-NMPPGDGMPG-GPMP-PGFFQGPPGSQASPHAPPPPNS 139

Query:   264 HRGPSYDPQRGLGYDMQRGPNYDMQRG---PG---YETQRVPGYDVQ-RGPVYEAQRAPS 316
               GP   P     +    GP   ++ G   PG        +P  D + +GP+ +    P 
Sbjct:   140 MMGPHGQPFMSPRFG--GGPRPPIRMGNQPPGGVPAAQPMLPNMDPRLQGPM-QRMNVPR 196

Query:   317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP---PPLNNVPYGS 373
              +   GPG    +G G  MR  P ++ S G G  G   G   +G+ P   P  NN+PY S
Sbjct:   197 GMGPMGPG---PQGFGGGMR--PPHN-SMGPGMPGVNMGPG-NGRPPWPNPNANNMPYSS 249

Query:   374 ATPPARSGSGQPRGGNP 390
              +P A  G   P+GG P
Sbjct:   250 PSPGAYGG---PQGGGP 263


>UNIPROTKB|Q9XSJ7 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9615
            "Canis lupus familiaris" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0046872 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
            CTD:1277 HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236
            OrthoDB:EOG4S4PHP EMBL:AF153062 RefSeq:NP_001003090.1
            UniGene:Cfa.100 STRING:Q9XSJ7 GeneID:403651 KEGG:cfa:403651
            InParanoid:Q9XSJ7 NextBio:20817156 Uniprot:Q9XSJ7
        Length = 1460

 Score = 121 (47.7 bits), Expect = 0.00098, P = 0.00098
 Identities = 88/285 (30%), Positives = 107/285 (37%)

Query:   126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
             ADG  G  G  G++  +    P G  A   G   P G+ G P      G   AGP  +T 
Sbjct:   815 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPTGPPGPIGNVGAPGPKGARG--SAGPPGATG 871

Query:   183 -AYAATQSGTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTK-GPSYDPA----KG 234
                AA + G P  +    P GP   A K  G G      P+  P + GP   P     KG
Sbjct:   872 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGARGETGPAGRPGEVGPPGPPGPAGEKG 931

Query:   235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   932 SPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGTSGERG 991

Query:   283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
             P   M  GP       PG     GP  E+ R  S   +  PG D   G   D        
Sbjct:   992 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGSPGPKGDRGETGPAG 1039

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1040 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPVGARGPAG 1084

 Score = 121 (47.7 bits), Expect = 0.00098, P = 0.00098
 Identities = 81/290 (27%), Positives = 101/290 (34%)

Query:   119 APNVDRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 176
             A   D+   G  G  G TG         P G++      G+P   GPP      G  G G
Sbjct:    96 ASPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLP---GPPGPPGPPGPPGLG 152

Query:   177 PNTSTS-AYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAK 233
              N +   +Y   +  T       +P GP G    +G PG     AP     +GP  +P +
Sbjct:   153 GNFAPQMSYGYDEKST---GGISVP-GPMGPSGPRGLPGPPG--APGPQGFQGPPGEPGE 206

Query:   234 GPGYDPTKGP-GYDAQKGSNYD-AQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQR 289
              PG     GP G     G N D  + G P     RGP   PQ   G     G P     R
Sbjct:   207 -PGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPP-GPQGARGLPGTAGLPGMKGHR 264

Query:   290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTG 348
             G       + G     GP    +  P    + G PG    RG   +  R  +  P+   G
Sbjct:   265 G----FSGLDGAKGDAGPA-GPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARG 319

Query:   349 FDGAPRGAAPHGQV----PPPLNNV--PYGSATPPARSGSGQPRG--GNP 390
              DGA   A P G      PP         G A P    GS  P+G  G P
Sbjct:   320 NDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGARGSEGPQGVRGEP 369


>UNIPROTKB|F1Q3I5 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9615
            "Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
            PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 OMA:VAYMDQQ EMBL:AAEX03006535
            Ensembl:ENSCAFT00000026953 Uniprot:F1Q3I5
        Length = 1464

 Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
 Identities = 88/285 (30%), Positives = 107/285 (37%)

Query:   126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
             ADG  G  G  G++  +    P G  A   G   P G+ G P      G   AGP  +T 
Sbjct:   819 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPTGPPGPIGNVGAPGPKGARG--SAGPPGATG 875

Query:   183 -AYAATQSGTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTK-GPSYDPA----KG 234
                AA + G P  +    P GP   A K  G G      P+  P + GP   P     KG
Sbjct:   876 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGARGETGPAGRPGEVGPPGPPGPAGEKG 935

Query:   235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   936 SPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 995

Query:   283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
             P   M  GP       PG     GP  E+ R  S   +  PG D   G   D        
Sbjct:   996 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGSPGPKGDRGETGPAG 1043

Query:   343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
             P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1044 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPVGARGPAG 1088

 Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
 Identities = 81/290 (27%), Positives = 101/290 (34%)

Query:   119 APNVDRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 176
             A   D+   G  G  G TG         P G++      G+P   GPP      G  G G
Sbjct:   100 ASPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLP---GPPGPPGPPGPPGLG 156

Query:   177 PNTSTS-AYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAK 233
              N +   +Y   +  T       +P GP G    +G PG     AP     +GP  +P +
Sbjct:   157 GNFAPQMSYGYDEKST---GGISVP-GPMGPSGPRGLPGPPG--APGPQGFQGPPGEPGE 210

Query:   234 GPGYDPTKGP-GYDAQKGSNYD-AQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQR 289
              PG     GP G     G N D  + G P     RGP   PQ   G     G P     R
Sbjct:   211 -PGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPP-GPQGARGLPGTAGLPGMKGHR 268

Query:   290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTG 348
             G       + G     GP    +  P    + G PG    RG   +  R  +  P+   G
Sbjct:   269 G----FSGLDGAKGDAGPA-GPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARG 323

Query:   349 FDGAPRGAAPHGQV----PPPLNNV--PYGSATPPARSGSGQPRG--GNP 390
              DGA   A P G      PP         G A P    GS  P+G  G P
Sbjct:   324 NDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGARGSEGPQGVRGEP 373


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.315   0.136   0.431    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      393       393   0.00095  117 3  11 23  0.47    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  239
  No. of states in DFA:  603 (64 KB)
  Total size of DFA:  266 KB (2137 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  45.18u 0.11s 45.29t   Elapsed:  00:00:02
  Total cpu time:  45.25u 0.11s 45.36t   Elapsed:  00:00:02
  Start:  Tue May 21 05:03:37 2013   End:  Tue May 21 05:03:39 2013
WARNINGS ISSUED:  1

Back to top