BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>010679
MGSKGRIPPPHLRRPPPGPGMMHPDPFVSGMRPPMPGAFPPFDMMPPPEVMEQKIASQHV
EMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAKMEA
ELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPALLSELESL
RQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAADGS
YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS
GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK
GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVY
EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV
PYGSATPPARSGSGQPRGGNPARR

High Scoring Gene Products

Symbol, full name Information P value
AT1G67170 protein from Arabidopsis thaliana 9.9e-87
AT3G14750 protein from Arabidopsis thaliana 2.9e-39
AT1G55170 protein from Arabidopsis thaliana 6.9e-31
AT5G61920 protein from Arabidopsis thaliana 2.2e-25
Vml
Vitelline membrane-like
protein from Drosophila melanogaster 3.4e-22
AT2G30120 protein from Arabidopsis thaliana 1.6e-17
eif3a
Eukaryotic translation initiation factor 3 subunit A
protein from Xenopus laevis 9.1e-15
eif3a
Eukaryotic translation initiation factor 3 subunit A
protein from Xenopus (Silurana) tropicalis 1.3e-14
LOC100518332
Uncharacterized protein
protein from Sus scrofa 1.4e-13
POLR2A
DNA-directed RNA polymerase II subunit RPB1
protein from Cricetulus griseus 6.0e-10
RPO21
RNA polymerase II largest subunit B220
gene from Saccharomyces cerevisiae 7.6e-10
TAF15
Uncharacterized protein
protein from Canis lupus familiaris 1.2e-09
prc
pericardin
protein from Drosophila melanogaster 3.8e-09
T17H7.1 gene from Caenorhabditis elegans 4.1e-09
fhaA
FHA domain-containing protein FhaA
protein from Mycobacterium tuberculosis 1.2e-08
TAF15
TATA-binding protein-associated factor 2N
protein from Homo sapiens 1.5e-08
K02E11.10 gene from Caenorhabditis elegans 4.4e-08
cbpP
calcium-binding protein
gene from Dictyostelium discoideum 5.8e-08
CG30203 protein from Drosophila melanogaster 9.9e-08
spt-5 gene from Caenorhabditis elegans 1.1e-07
spt-5
Transcription elongation factor SPT5
protein from Caenorhabditis elegans 1.1e-07
let-2 gene from Caenorhabditis elegans 1.3e-07
let-2
Collagen alpha-2(IV) chain
protein from Caenorhabditis elegans 1.3e-07
Krtap6-2
keratin associated protein 6-2
protein from Mus musculus 1.9e-07
ama-1 gene from Caenorhabditis elegans 2.3e-07
ama-1
DNA-directed RNA polymerase II subunit RPB1
protein from Caenorhabditis elegans 2.3e-07
ego-2 gene from Caenorhabditis elegans 2.4e-07
arid1ab
AT rich interactive domain 1Ab (SWI-like)
gene_product from Danio rerio 3.4e-07
COL4A4
Collagen alpha-4(IV) chain
protein from Homo sapiens 5.5e-07
COL4A4
Collagen alpha-4(IV) chain
protein from Homo sapiens 5.6e-07
swsn-1 gene from Caenorhabditis elegans 6.0e-07
CG7185 protein from Drosophila melanogaster 6.6e-07
AT1G33680 protein from Arabidopsis thaliana 9.7e-07
I3LQ53
Uncharacterized protein
protein from Sus scrofa 1.2e-06
COL1A1
Collagen alpha-1(I) chain
protein from Gallus gallus 1.3e-06
D3ZZM1
Uncharacterized protein
protein from Rattus norvegicus 1.5e-06
AT1G10390 protein from Arabidopsis thaliana 1.8e-06
Taf15
TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor
gene from Rattus norvegicus 2.0e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Homo sapiens 2.0e-06
SFPQ
Uncharacterized protein
protein from Gallus gallus 2.0e-06
ewsr1b
Ewing sarcoma breakpoint region 1b
gene_product from Danio rerio 2.3e-06
COL3A1
Collagen alpha-1(III) chain
protein from Gallus gallus 2.3e-06
zgc:172323 gene_product from Danio rerio 2.3e-06
COL4A4
Uncharacterized protein
protein from Nomascus leucogenys 2.5e-06
gho
ghost
protein from Drosophila melanogaster 3.2e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Pan troglodytes 3.4e-06
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Macaca mulatta 3.4e-06
COL7A1
Uncharacterized protein
protein from Sus scrofa 3.6e-06
Ldb3
LIM domain binding 3
protein from Mus musculus 3.9e-06
LDB3
LIM domain-binding protein 3
protein from Homo sapiens 4.0e-06
EGK_04858
Putative uncharacterized protein
protein from Macaca mulatta 4.1e-06
EGM_04376
Putative uncharacterized protein
protein from Macaca fascicularis 4.1e-06
COL3A1
Collagen alpha-1(III) chain
protein from Bos taurus 5.0e-06
EWSR1
Ewing sarcoma breakpoint region 1, isoform CRA_e
protein from Homo sapiens 5.0e-06
MGG_04961
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 5.2e-06
PPP1R10
Uncharacterized protein
protein from Canis lupus familiaris 5.5e-06
osa protein from Drosophila melanogaster 6.2e-06
COL3A1
Collagen alpha-1(III) chain
protein from Bos taurus 7.4e-06
COL5A1
Uncharacterized protein
protein from Canis lupus familiaris 9.0e-06
TFG
Uncharacterized protein
protein from Gallus gallus 9.0e-06
Col11a1
collagen, type XI, alpha 1
gene from Rattus norvegicus 9.3e-06
Col11a1
Collagen alpha-1(XI) chain
protein from Rattus norvegicus 9.3e-06
AT3G07030 protein from Arabidopsis thaliana 9.4e-06
AT2G25970 protein from Arabidopsis thaliana 1.1e-05
COL4A2
Collagen alpha-2(IV) chain
protein from Bos taurus 1.1e-05
COL5A1
Uncharacterized protein
protein from Canis lupus familiaris 1.2e-05
Krtap21-1
keratin associated protein 21-1
protein from Mus musculus 1.3e-05
eif3s10
eukaryotic translation initiation factor 3, subunit 10 (theta)
gene_product from Danio rerio 1.3e-05
COL4A5
Uncharacterized protein
protein from Bos taurus 1.4e-05
COL3A1
Uncharacterized protein
protein from Sus scrofa 1.6e-05
TAF15
TATA-binding protein-associated factor 2N
protein from Homo sapiens 1.7e-05
col-51 gene from Caenorhabditis elegans 1.8e-05
bli-1 gene from Caenorhabditis elegans 2.0e-05
COL3A1
Uncharacterized protein
protein from Canis lupus familiaris 2.0e-05
COL3A1
Collagen alpha-1(III) chain
protein from Gallus gallus 2.2e-05
COL5A2
Uncharacterized protein
protein from Sus scrofa 2.2e-05
FUS
RNA-binding protein FUS
protein from Bos taurus 2.3e-05
EWSR1
Uncharacterized protein
protein from Sus scrofa 2.5e-05
E2RS29
Uncharacterized protein
protein from Canis lupus familiaris 2.5e-05
CTAGE8
Cutaneous T-cell lymphoma-associated antigen 8
protein from Homo sapiens 2.5e-05
fus
fusion (involved in t(12;16) in malignant liposarcoma)
gene_product from Danio rerio 2.5e-05
col11a1b
collagen, type XI, alpha 1b
gene_product from Danio rerio 2.5e-05
COL5A2
Uncharacterized protein
protein from Bos taurus 2.6e-05
COL5A2
Uncharacterized protein
protein from Canis lupus familiaris 2.6e-05
pygo2
pygopus homolog 2 (Drosophila)
gene_product from Danio rerio 2.7e-05
CROCC
Uncharacterized protein
protein from Canis lupus familiaris 3.1e-05
CROCC
Uncharacterized protein
protein from Canis lupus familiaris 3.1e-05
PPP1R10
Serine/threonine-protein phosphatase 1 regulatory subunit 10
protein from Sus scrofa 3.1e-05
EIF3A
Uncharacterized protein
protein from Sus scrofa 3.3e-05
col-103 gene from Caenorhabditis elegans 3.7e-05
RPO21 gene_product from Candida albicans 4.0e-05
RPO21
DNA-directed RNA polymerase
protein from Candida albicans SC5314 4.0e-05
Col3a1
collagen, type III, alpha 1
protein from Mus musculus 4.2e-05
COL3A1
Uncharacterized protein
protein from Canis lupus familiaris 4.2e-05
swsn-1
SWI3-like protein
protein from Caenorhabditis elegans 5.4e-05
Ccdc88b
coiled-coil domain containing 88B
protein from Mus musculus 5.5e-05

The BLAST search returned 4 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  010679
        (504 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2033681 - symbol:AT1G67170 "AT1G67170" species...   867  9.9e-87   1
TAIR|locus:2089616 - symbol:AT3G14750 "AT3G14750" species...   419  2.9e-39   1
TAIR|locus:2035751 - symbol:AT1G55170 "AT1G55170" species...   340  6.9e-31   1
TAIR|locus:2156146 - symbol:AT5G61920 "AT5G61920" species...   292  2.2e-25   1
FB|FBgn0085362 - symbol:Vml "Vitelline membrane-like" spe...   284  3.4e-22   1
TAIR|locus:2060848 - symbol:AT2G30120 species:3702 "Arabi...   221  1.6e-17   1
UNIPROTKB|A2VD00 - symbol:eif3a "Eukaryotic translation i...   195  9.1e-15   2
UNIPROTKB|A4II09 - symbol:eif3a "Eukaryotic translation i...   186  1.3e-14   2
UNIPROTKB|F1S187 - symbol:LOC100518332 "Uncharacterized p...   204  1.4e-13   1
UNIPROTKB|P11414 - symbol:POLR2A "DNA-directed RNA polyme...   173  6.0e-10   1
SGD|S000002299 - symbol:RPO21 "RNA polymerase II largest ...   191  7.6e-10   2
UNIPROTKB|F1PB61 - symbol:TAF15 "Uncharacterized protein"...   155  1.2e-09   2
FB|FBgn0028573 - symbol:prc "pericardin" species:7227 "Dr...   173  3.8e-09   1
WB|WBGene00020550 - symbol:T17H7.1 species:6239 "Caenorha...   168  4.1e-09   1
UNIPROTKB|P71590 - symbol:fhaA "FHA domain-containing pro...   162  1.2e-08   1
UNIPROTKB|Q92804 - symbol:TAF15 "TATA-binding protein-ass...   162  1.5e-08   1
WB|WBGene00044109 - symbol:K02E11.10 species:6239 "Caenor...   154  4.4e-08   1
DICTYBASE|DDB_G0277909 - symbol:cbpP "calcium-binding pro...   155  5.8e-08   1
FB|FBgn0050203 - symbol:CG30203 species:7227 "Drosophila ...   157  9.9e-08   1
WB|WBGene00005015 - symbol:spt-5 species:6239 "Caenorhabd...   158  1.1e-07   1
UNIPROTKB|Q21338 - symbol:spt-5 "Transcription elongation...   158  1.1e-07   1
WB|WBGene00002280 - symbol:let-2 species:6239 "Caenorhabd...   159  1.3e-07   1
UNIPROTKB|P17140 - symbol:let-2 "Collagen alpha-2(IV) cha...   159  1.3e-07   1
MGI|MGI:1330280 - symbol:Krtap6-2 "keratin associated pro...   128  1.9e-07   1
WB|WBGene00000123 - symbol:ama-1 species:6239 "Caenorhabd...   157  2.3e-07   1
UNIPROTKB|P16356 - symbol:ama-1 "DNA-directed RNA polymer...   157  2.3e-07   1
WB|WBGene00001215 - symbol:ego-2 species:6239 "Caenorhabd...   136  2.4e-07   2
ZFIN|ZDB-GENE-030131-5725 - symbol:arid1ab "AT rich inter...   157  3.4e-07   2
UNIPROTKB|J3KNM7 - symbol:COL4A4 "Collagen alpha-4(IV) ch...   153  5.5e-07   1
UNIPROTKB|P53420 - symbol:COL4A4 "Collagen alpha-4(IV) ch...   153  5.6e-07   1
WB|WBGene00004203 - symbol:swsn-1 species:6239 "Caenorhab...   149  6.0e-07   1
UNIPROTKB|D4ADB1 - symbol:D4ADB1 "Uncharacterized protein...   148  6.3e-07   1
FB|FBgn0035872 - symbol:CG7185 species:7227 "Drosophila m...   141  6.6e-07   2
TAIR|locus:2012713 - symbol:AT1G33680 "AT1G33680" species...   144  9.7e-07   2
UNIPROTKB|I3LQ53 - symbol:I3LQ53 "Uncharacterized protein...   144  1.2e-06   1
UNIPROTKB|P02457 - symbol:COL1A1 "Collagen alpha-1(I) cha...   149  1.3e-06   1
UNIPROTKB|D3ZZM1 - symbol:Taf15 "Protein Taf15" species:1...   119  1.5e-06   2
TAIR|locus:2012788 - symbol:AT1G10390 "AT1G10390" species...   146  1.8e-06   1
RGD|1309595 - symbol:Taf15 "TAF15 RNA polymerase II, TATA...   118  2.0e-06   2
UNIPROTKB|Q96QC0 - symbol:PPP1R10 "Serine/threonine-prote...   145  2.0e-06   1
UNIPROTKB|F1P555 - symbol:SFPQ "Uncharacterized protein" ...   143  2.0e-06   1
ZFIN|ZDB-GENE-030131-1600 - symbol:ewsr1b "Ewing sarcoma ...   147  2.3e-06   2
UNIPROTKB|P12105 - symbol:COL3A1 "Collagen alpha-1(III) c...   146  2.3e-06   1
ZFIN|ZDB-GENE-080204-113 - symbol:zgc:172323 "zgc:172323"...   144  2.3e-06   1
UNIPROTKB|G1RSL2 - symbol:COL4A4 "Uncharacterized protein...   147  2.5e-06   1
FB|FBgn0262126 - symbol:gho "ghost" species:7227 "Drosoph...   135  3.2e-06   2
UNIPROTKB|Q7YR38 - symbol:PPP1R10 "Serine/threonine-prote...   143  3.4e-06   1
UNIPROTKB|Q5TM61 - symbol:PPP1R10 "Serine/threonine-prote...   143  3.4e-06   1
UNIPROTKB|F1SKM1 - symbol:COL7A1 "Uncharacterized protein...   148  3.6e-06   1
MGI|MGI:1344412 - symbol:Ldb3 "LIM domain binding 3" spec...   141  3.9e-06   1
UNIPROTKB|O75112 - symbol:LDB3 "LIM domain-binding protei...   141  4.0e-06   1
UNIPROTKB|G7N928 - symbol:EGK_04858 "Putative uncharacter...   145  4.1e-06   1
UNIPROTKB|G7PK77 - symbol:EGM_04376 "Putative uncharacter...   145  4.1e-06   1
UNIPROTKB|P04258 - symbol:COL3A1 "Collagen alpha-1(III) c...   142  5.0e-06   1
UNIPROTKB|C9JGE3 - symbol:EWSR1 "Ewing sarcoma breakpoint...   127  5.0e-06   2
UNIPROTKB|G4N3H5 - symbol:MGG_04961 "Uncharacterized prot...   139  5.2e-06   1
UNIPROTKB|E2R2K8 - symbol:PPP1R10 "Uncharacterized protei...   141  5.5e-06   1
FB|FBgn0261885 - symbol:osa "osa" species:7227 "Drosophil...   153  6.2e-06   2
UNIPROTKB|F1MXS8 - symbol:COL3A1 "Collagen alpha-1(III) c...   142  7.4e-06   1
UNIPROTKB|F1LRJ1 - symbol:Col4a3 "Protein Col4a3" species...   142  8.6e-06   1
UNIPROTKB|J9P8F7 - symbol:COL5A1 "Uncharacterized protein...   141  9.0e-06   1
UNIPROTKB|E1C0T1 - symbol:TFG "Uncharacterized protein" s...   134  9.0e-06   1
UNIPROTKB|F1LLX1 - symbol:Col11a1 "Collagen alpha-1(XI) c...   142  9.3e-06   1
RGD|2372 - symbol:Col11a1 "collagen, type XI, alpha 1" sp...   142  9.3e-06   1
UNIPROTKB|P20909 - symbol:Col11a1 "Collagen alpha-1(XI) c...   142  9.3e-06   1
TAIR|locus:2077547 - symbol:AT3G07030 species:3702 "Arabi...   134  9.4e-06   1
TAIR|locus:2043530 - symbol:AT2G25970 "AT2G25970" species...   139  1.1e-05   2
UNIPROTKB|F1N7Q7 - symbol:COL4A2 "Collagen alpha-2(IV) ch...   141  1.1e-05   1
UNIPROTKB|F1PHX8 - symbol:COL5A1 "Uncharacterized protein...   141  1.2e-05   1
MGI|MGI:2157767 - symbol:Krtap21-1 "keratin associated pr...   111  1.3e-05   1
ZFIN|ZDB-GENE-030131-5726 - symbol:eif3s10 "eukaryotic tr...   139  1.3e-05   1
UNIPROTKB|F1N474 - symbol:COL4A5 "Uncharacterized protein...   140  1.4e-05   1
UNIPROTKB|F1RYI8 - symbol:COL3A1 "Uncharacterized protein...   139  1.6e-05   1
UNIPROTKB|K7EKB2 - symbol:TAF15 "TATA-binding protein-ass...   125  1.7e-05   1
WB|WBGene00000628 - symbol:col-51 species:6239 "Caenorhab...   132  1.8e-05   1
WB|WBGene00000251 - symbol:bli-1 species:6239 "Caenorhabd...   136  2.0e-05   1
UNIPROTKB|J9P0L0 - symbol:COL3A1 "Uncharacterized protein...   138  2.0e-05   1
UNIPROTKB|F1NI73 - symbol:COL3A1 "Collagen alpha-1(III) c...   137  2.2e-05   1
UNIPROTKB|F1RXW0 - symbol:COL5A2 "Uncharacterized protein...   137  2.2e-05   1
UNIPROTKB|Q28009 - symbol:FUS "RNA-binding protein FUS" s...   132  2.3e-05   1
UNIPROTKB|F1RFI8 - symbol:EWSR1 "Uncharacterized protein"...   121  2.5e-05   2
UNIPROTKB|E2RS29 - symbol:E2RS29 "Uncharacterized protein...   132  2.5e-05   1
UNIPROTKB|P0CG41 - symbol:CTAGE8 "Cutaneous T-cell lympho...   134  2.5e-05   1
ZFIN|ZDB-GENE-040426-1010 - symbol:fus "fusion (involved ...   132  2.5e-05   1
ZFIN|ZDB-GENE-070912-607 - symbol:col11a1b "collagen, typ...   138  2.5e-05   1
UNIPROTKB|F1N2Y2 - symbol:COL5A2 "Uncharacterized protein...   137  2.6e-05   1
UNIPROTKB|F1PG08 - symbol:COL5A2 "Uncharacterized protein...   137  2.6e-05   1
ZFIN|ZDB-GENE-050809-108 - symbol:pygo2 "pygopus homolog ...   132  2.7e-05   1
UNIPROTKB|J9P8I1 - symbol:CROCC "Uncharacterized protein"...   116  3.1e-05   2
UNIPROTKB|F1Q2C0 - symbol:CROCC "Uncharacterized protein"...   116  3.1e-05   2
UNIPROTKB|Q767K9 - symbol:PPP1R10 "Serine/threonine-prote...   134  3.1e-05   1
UNIPROTKB|F1S4P6 - symbol:EIF3A "Uncharacterized protein"...   101  3.3e-05   2
WB|WBGene00000677 - symbol:col-103 species:6239 "Caenorha...   128  3.7e-05   1
CGD|CAL0000919 - symbol:RPO21 species:5476 "Candida albic...   136  4.0e-05   1
UNIPROTKB|Q5ACI7 - symbol:RPO21 "DNA-directed RNA polymer...   136  4.0e-05   1
TAIR|locus:4010713902 - symbol:AT4G22505 species:3702 "Ar...   130  4.0e-05   1
MGI|MGI:88453 - symbol:Col3a1 "collagen, type III, alpha ...   135  4.2e-05   1
UNIPROTKB|F1PG69 - symbol:COL3A1 "Uncharacterized protein...   135  4.2e-05   1
UNIPROTKB|G5EF87 - symbol:swsn-1 "SWI3-like protein" spec...   131  5.4e-05   1
MGI|MGI:1925567 - symbol:Ccdc88b "coiled-coil domain cont...   134  5.5e-05   1

WARNING:  Descriptions of 141 database sequences were not reported due to the
          limiting value of parameter V = 100.


>TAIR|locus:2033681 [details] [associations]
            symbol:AT1G67170 "AT1G67170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] EMBL:CP002684 EMBL:BT005883
            EMBL:AK228253 IPI:IPI00547288 RefSeq:NP_176888.2 UniGene:At.35681
            ProteinModelPortal:Q84TD8 SMR:Q84TD8 IntAct:Q84TD8 PRIDE:Q84TD8
            EnsemblPlants:AT1G67170.1 GeneID:843037 KEGG:ath:AT1G67170
            TAIR:At1g67170 HOGENOM:HOG000005883 InParanoid:Q84TD8 OMA:MESKGRI
            PhylomeDB:Q84TD8 ProtClustDB:CLSN2918424 Genevestigator:Q84TD8
            Uniprot:Q84TD8
        Length = 359

 Score = 867 (310.3 bits), Expect = 9.9e-87, P = 9.9e-87
 Identities = 189/333 (56%), Positives = 229/333 (68%)

Query:    30 GMRPPMP--GAFPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQ 87
             G  PP    G +P F+M+PPPEVMEQK  +QH E+Q+LA ENQRL  THG+LRQELAAAQ
Sbjct:    35 GAIPPSAAQGVYPSFNMLPPPEVMEQKFVAQHGELQRLAIENQRLGGTHGSLRQELAAAQ 94

Query:    88 HELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVARE 147
             HE+Q+LH QIG MKSERE +M  L EK+AKME EL+ +E VKLE Q+++ EA++LVVARE
Sbjct:    95 HEIQMLHAQIGSMKSEREQRMMGLAEKVAKMETELQKSEAVKLEMQQARAEARSLVVARE 154

Query:   148 ELIAKVHQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQV 207
             EL++KVHQLTQ+LQ++ +DVQQIPAL+SELE+LRQEY  CR TY+YEKKFYNDHLESLQ 
Sbjct:   155 ELMSKVHQLTQELQKSRSDVQQIPALMSELENLRQEYQQCRATYDYEKKFYNDHLESLQA 214

Query:   208 MEKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYG 267
             MEKNY+TMA EVEKL+A+LMN  N DRRA  G YG    N+E + SG   G   YED +G
Sbjct:   215 MEKNYMTMAREVEKLQAQLMNNANSDRRAG-GPYGNNI-NAEIDASGHQSGNGYYEDAFG 272

Query:   268 VPQGHGPPPSATTAGVVGAGPNTSTSA----Y-AATQSGT-PMRAAYDIPRGPGYEASKG 321
              PQG+ P P A  A     GPN+   A    Y   TQ G  P R  Y+ PRGP    S  
Sbjct:   273 -PQGYIPQPVAGNA----TGPNSVVGAAQYPYQGVTQPGYFPQRPGYNFPRGP--PGSYD 325

Query:   322 PGYDASKAPSYDP-TKGPSYD-PAKGPGYDPTK 352
             P       P   P   GPS + P  G   +P++
Sbjct:   326 PTTRLPTGPYGAPFPPGPSNNTPYAGTHGNPSR 358


>TAIR|locus:2089616 [details] [associations]
            symbol:AT3G14750 "AT3G14750" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] [GO:0048573 "photoperiodism, flowering" evidence=RCA]
            EMBL:CP002686 EMBL:AY035083 EMBL:AY051034 IPI:IPI00544941
            RefSeq:NP_566492.1 UniGene:At.20367 ProteinModelPortal:Q93V84
            SMR:Q93V84 PaxDb:Q93V84 PRIDE:Q93V84 EnsemblPlants:AT3G14750.1
            GeneID:820703 KEGG:ath:AT3G14750 TAIR:At3g14750 eggNOG:NOG236769
            HOGENOM:HOG000242815 InParanoid:Q93V84 OMA:YAENYEH PhylomeDB:Q93V84
            ProtClustDB:CLSN2688383 ArrayExpress:Q93V84 Genevestigator:Q93V84
            Uniprot:Q93V84
        Length = 331

 Score = 419 (152.6 bits), Expect = 2.9e-39, P = 2.9e-39
 Identities = 96/252 (38%), Positives = 146/252 (57%)

Query:    45 MPPP-EVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQ-ILHGQIGGMKS 102
             +PP   ++E ++A+Q+ ++Q L  +NQRLAATH  L+QEL  AQHELQ I+H  I  +++
Sbjct:    63 LPPQFSILEDRLAAQNQDVQGLLADNQRLAATHVALKQELEVAQHELQRIMH-YIDSLRA 121

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E E+ MR + +K  + E EL+  + ++ E QK + + +     R+EL ++VH +TQDL R
Sbjct:   122 EEEIMMREMYDKSMRSEMELREVDAMRAEIQKIRADIKEFTSGRQELTSQVHLMTQDLAR 181

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
                D+QQIP L +E+E+ +QE    R   +YEKK Y ++ E  ++ME   + MA E+EKL
Sbjct:   182 LTADLQQIPTLTAEIENTKQELQRARAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKL 241

Query:   223 RAELMNAP-NVDRRAADGSYGGAT--GNSENETSGRPVGQNAYEDGYGV-PQ-----GHG 273
             RAE+ N+  +       G+ GG    G   N  +G PV  N Y+  Y + P      G+ 
Sbjct:   242 RAEIANSETSAYANGPVGNPGGVAYGGGYGNPEAGYPV--NPYQPNYTMNPAQTGVVGYY 299

Query:   274 PPPSATTAGVVG 285
             PPP    A   G
Sbjct:   300 PPPYGPQAAWAG 311


>TAIR|locus:2035751 [details] [associations]
            symbol:AT1G55170 "AT1G55170" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
            "nucleus" evidence=ISM] [GO:0008150 "biological_process"
            evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC073944
            EMBL:AY084916 EMBL:BT006117 EMBL:AK118721 IPI:IPI00529305
            RefSeq:NP_564678.1 UniGene:At.37108 ProteinModelPortal:Q9C717
            SMR:Q9C717 PaxDb:Q9C717 PRIDE:Q9C717 EnsemblPlants:AT1G55170.1
            GeneID:841960 KEGG:ath:AT1G55170 TAIR:At1g55170 eggNOG:NOG306311
            InParanoid:Q9C717 OMA:ELHRMNL PhylomeDB:Q9C717
            ProtClustDB:CLSN2688822 ArrayExpress:Q9C717 Genevestigator:Q9C717
            Uniprot:Q9C717
        Length = 283

 Score = 340 (124.7 bits), Expect = 6.9e-31, P = 6.9e-31
 Identities = 86/241 (35%), Positives = 130/241 (53%)

Query:    32 RPPMPGAFPPFDMMPPPEVMEQ------KIASQHVEMQKLATENQRLAATHGTLRQELAA 85
             RP + G  PP    PPP ++E       +I  Q  E+++L ++N  LA     L +EL A
Sbjct:    25 RPFLRG--PPLLQPPPPSLLEDLQIQEGEIRRQDAEIRRLLSDNHGLADDRMVLERELVA 82

Query:    86 AQHELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVA 145
             A+ EL  ++  I  +++E++LQ+R  +EK  K+E +++  E  K E  + + E Q L   
Sbjct:    83 AKEELHRMNLMISDLRAEQDLQLREFSEKRHKLEGDVRAMESYKKEASQLRGEVQKLDEI 142

Query:   146 REELIAKVHQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESL 205
             + EL   V  L +DL +  +D +QIP + +E++ L++E  H R   EYEKK   + +E  
Sbjct:   143 KRELSGNVQLLRKDLAKLQSDNKQIPGMRAEVKDLQKELMHARDAIEYEKKEKFELMEQR 202

Query:   206 QVMEKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDG 265
             Q MEKN ++MA EVEKLRAEL     VD R     +GG+ G + N   G   G     D 
Sbjct:   203 QTMEKNMVSMAREVEKLRAELAT---VDSRP--WGFGGSYGMNYNNMDGTFRGSYGENDT 257

Query:   266 Y 266
             Y
Sbjct:   258 Y 258


>TAIR|locus:2156146 [details] [associations]
            symbol:AT5G61920 "AT5G61920" species:3702 "Arabidopsis
            thaliana" [GO:0008150 "biological_process" evidence=ND]
            EMBL:CP002688 GenomeReviews:BA000015_GR EMBL:AB022212
            UniGene:At.55672 EMBL:DQ447104 IPI:IPI00520542
            RefSeq:NP_001119474.1 RefSeq:NP_200998.1 PRIDE:Q9FH51
            EnsemblPlants:AT5G61920.1 EnsemblPlants:AT5G61920.2 GeneID:836313
            KEGG:ath:AT5G61920 TAIR:At5g61920 eggNOG:NOG265125
            HOGENOM:HOG000090683 InParanoid:Q9FH51 OMA:KAHIRSI PhylomeDB:Q9FH51
            ProtClustDB:CLSN2686951 Genevestigator:Q9FH51 Uniprot:Q9FH51
        Length = 238

 Score = 292 (107.8 bits), Expect = 2.2e-25, P = 2.2e-25
 Identities = 64/183 (34%), Positives = 107/183 (58%)

Query:    49 EVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             +++E KIA Q  E+ +L+ +N++LA+++  L+++L  A  E+Q L   I   +++ E+Q+
Sbjct:    51 DILENKIAVQAAEIDRLSNDNRKLASSYVALKEDLTVADREVQGLRAHIRKTETDHEIQI 110

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R+  EKIAKME  +K  E ++ E Q +  EA  L   REEL +KV    +DL++   + +
Sbjct:   111 RSTLEKIAKMEGMVKNRENIRREVQSAHIEAHRLAREREELASKVKLGMKDLKKVCLEAE 170

Query:   169 QIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMN 228
              + A   ELE L++E+   R  +E EK    + L  L+ ME+  I     +EKLR+E+  
Sbjct:   171 SLEASSQELERLKEEHQRLRKEFEEEKSGNVEKLAQLKGMERKIIGAVKAIEKLRSEIST 230

Query:   229 APN 231
             A N
Sbjct:   231 ARN 233


>FB|FBgn0085362 [details] [associations]
            symbol:Vml "Vitelline membrane-like" species:7227 "Drosophila
            melanogaster" [GO:0009950 "dorsal/ventral axis specification"
            evidence=IGI] [GO:0060388 "vitelline envelope" evidence=IDA]
            [GO:0007305 "vitelline membrane formation involved in
            chorion-containing eggshell formation" evidence=ISM] [GO:0008316
            "structural constituent of vitelline membrane" evidence=ISM]
            [GO:0035805 "egg coat" evidence=ISM] EMBL:AE014298 GO:GO:0009950
            GeneTree:ENSGT00700000104744 PROSITE:PS51137 GO:GO:0060388
            InterPro:IPR013135 RefSeq:NP_001096866.1 UniGene:Dm.32785
            STRING:A8JUV4 EnsemblMetazoa:FBtr0112535 GeneID:5740271
            KEGG:dme:Dmel_CG34333 UCSC:CG34333-RA CTD:5740271
            FlyBase:FBgn0085362 eggNOG:NOG284187 InParanoid:A8JUV4 OMA:ISKYETI
            OrthoDB:EOG4KPRTT GenomeRNAi:5740271 NextBio:20891311 Bgee:A8JUV4
            Uniprot:A8JUV4
        Length = 578

 Score = 284 (105.0 bits), Expect = 3.4e-22, P = 3.4e-22
 Identities = 84/284 (29%), Positives = 100/284 (35%)

Query:   229 APNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVG 285
             AP+    AA  SY      S +  S  P         Y  P     H P   A++     
Sbjct:   198 APSYSAPAAP-SYSAPAAPSYSAPSA-PSYSAQKTSSYSAPAAPSYHAPAAPASSYSAP- 254

Query:   286 AGPNTSTSA---YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 342
             AGP+ S  A   Y+A     P  ++Y   + P Y A   P Y A  APSY  +  PSY  
Sbjct:   255 AGPSYSAPAAPSYSAPSYSAPA-SSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPSYSS 313

Query:   343 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 402
                  Y     P Y A K  +Y A   P+Y     PSY       Y     P+Y     P
Sbjct:   314 PASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAP 373

Query:   403 GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 462
              Y     P Y       Y A  APSY     P Y       Y    APSY       +  
Sbjct:   374 SYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYS- 432

Query:   463 APRGAAPHGQVPP-PLNNVPYGSATPPARS---GSGQPRGGNPA 502
             AP  AAP    P  P  + P  S    AR+   GS  P  G  A
Sbjct:   433 AP--AAPSYSAPAAPSYSAPASSGYSAARAYSAGSAAPASGYSA 474

 Score = 274 (101.5 bits), Expect = 4.7e-21, P = 4.7e-21
 Identities = 80/271 (29%), Positives = 97/271 (35%)

Query:   244 ATGNSENETSGRPVGQNAYEDGYG--VP-QGHGPP------PSATTAGVVG-AGPNTSTS 293
             AT N E +  G P  +  YE+ +   +P Q + PP       S + A   G + P     
Sbjct:    24 ATRNEEFD-DGFPESEFDYEERHTREIPAQAYAPPIVYNSQSSYSPAKDQGYSAPAAPVY 82

Query:   294 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 353
             + AA     P   +Y  P  P Y A   P Y A  APSY     PSY       Y     
Sbjct:    83 SPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAA 142

Query:   354 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 413
             P Y A    +Y A   P+Y      SY       Y     P+Y     P Y     P Y 
Sbjct:   143 PSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYS 202

Query:   414 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGAPRGAAPHGQ 472
                 P Y A  APSY     P Y  Q+   Y    APSY  P+       AP G  P   
Sbjct:   203 APAAPSYSAPAAPSYSAPSAPSYSAQKTSSYSAPAAPSYHAPAAPASSYSAPAG--PSYS 260

Query:   473 VPP-PLNNVPYGSATPPARSGSGQPRGGNPA 502
              P  P  + P  SA   + S    P    PA
Sbjct:   261 APAAPSYSAPSYSAPASSYSALKAPSYSAPA 291

 Score = 262 (97.3 bits), Expect = 1.1e-19, P = 1.1e-19
 Identities = 69/246 (28%), Positives = 83/246 (33%)

Query:   266 YGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGY 324
             Y  P  +    S + A   G + P     + AA     P   +Y  P  P Y A   P Y
Sbjct:    54 YAPPIVYNSQSSYSPAKDQGYSAPAAPVYSPAAPSYSAPAAPSYSAPAAPSYSAPAAPSY 113

Query:   325 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR 384
              A  APSY     PSY       Y     P Y A    +Y A   P+Y      SY    
Sbjct:   114 SAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPA 173

Query:   385 GLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 444
                Y     P+Y     P Y     P Y     P Y A  APSY     P Y  Q+   Y
Sbjct:   174 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPSAPSYSAQKTSSY 233

Query:   445 DMRRAPSYD-PSRGTGFDGAPRGAAPHGQVPP----PLNNVP---YGSATPPARSGSGQP 496
                 APSY  P+       AP G +      P    P  + P   Y +   P+ S    P
Sbjct:   234 SAPAAPSYHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAP 293

Query:   497 RGGNPA 502
                 PA
Sbjct:   294 SYSAPA 299

 Score = 259 (96.2 bits), Expect = 2.4e-19, P = 2.4e-19
 Identities = 66/241 (27%), Positives = 84/241 (34%)

Query:   266 YGVPQG--HGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 323
             Y  P G  +  P + + +    + P +S SA  A     P   +Y  P  P Y +S  P 
Sbjct:   251 YSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPS 310

Query:   324 YD--------ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 375
             Y         A  AP+Y   K  SY     P Y     P Y A   S+Y A   P+Y   
Sbjct:   311 YSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAP 370

Query:   376 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 435
               PSY       Y      +Y     P Y     P Y       Y A  APSY     P 
Sbjct:   371 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 430

Query:   436 YDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 495
             Y       Y    APSY     +G+  A R  +     P    + P  S+   A + SG 
Sbjct:   431 YSAPAAPSYSAPAAPSYSAPASSGYSAA-RAYSAGSAAPASGYSAPKTSSGYSAPASSGS 489

Query:   496 P 496
             P
Sbjct:   490 P 490

 Score = 252 (93.8 bits), Expect = 1.5e-18, P = 1.5e-18
 Identities = 74/278 (26%), Positives = 92/278 (33%)

Query:   229 APNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP 288
             AP+    AA  SY      S +  +       A    Y  P         T++    A P
Sbjct:   182 APSYSAPAAP-SYSAPAAPSYSAPAAPSYSAPA-APSYSAPSAPSYSAQKTSSYSAPAAP 239

Query:   289 NTSTSAYAATQSGTPMRAAYDIPRGPGYEASK--GPG--YDASKAPSYDPTKGPSYDPAK 344
             +    A  A+    P   +Y  P  P Y A     P   Y A KAPSY     PSY    
Sbjct:   240 SYHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPA 299

Query:   345 GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
              P Y  +  P Y +   S+Y A   P Y   +  SY       Y     P+Y       Y
Sbjct:   300 APSYSSSASPSYSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSY 359

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 464
                  P Y     P Y A  APSY       Y       Y    APSY     + +  AP
Sbjct:   360 SAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYS-AP 418

Query:   465 RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
               AAP    P   +   Y +   P+ S    P    PA
Sbjct:   419 --AAPSYSAPAAPS---YSAPAAPSYSAPAAPSYSAPA 451

 Score = 218 (81.8 bits), Expect = 9.3e-15, P = 9.3e-15
 Identities = 81/279 (29%), Positives = 95/279 (34%)

Query:   227 MNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA 286
             + AP+    AA  SY      S + +S  P   +     Y  P    P  SA  A    A
Sbjct:   282 LKAPSYSAPAAP-SYSAPAAPSYS-SSASPSYSSPASSSYSAPAA--PTYSAPKAQSYSA 337

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 346
                 S SA AA     P  ++Y  P  P Y A   P Y A  APSY      SY     P
Sbjct:   338 PAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAP 397

Query:   347 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
              Y     P Y A   S+Y A   P+Y     PSY       Y     P+Y      GY  
Sbjct:   398 SYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSGYSA 457

Query:   407 QRVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDG-- 462
              R   Y    G    A  A  Y  P+   GY      G     A SY  P+  T   G  
Sbjct:   458 ARA--YSA--G---SAAPASGYSAPKTSSGYSAPASSGSPA--ASSYSAPASSTASSGYS 508

Query:   463 AP--------RGAAPHGQVPPPLNNVPYGSATPPARSGS 493
             AP        R    H  +        YGSA P A  G+
Sbjct:   509 APASKSSGYARSEMDHQILGMARTAGGYGSAAPSAAYGA 547


>TAIR|locus:2060848 [details] [associations]
            symbol:AT2G30120 species:3702 "Arabidopsis thaliana"
            [GO:0009507 "chloroplast" evidence=ISM] EMBL:CP002685
            IPI:IPI00938894 RefSeq:NP_001154541.1 UniGene:At.19562
            EnsemblPlants:AT2G30120.2 GeneID:817564 KEGG:ath:AT2G30120
            OMA:PEANGTH Uniprot:F4IMQ0
        Length = 288

 Score = 221 (82.9 bits), Expect = 1.6e-17, P = 1.6e-17
 Identities = 65/235 (27%), Positives = 114/235 (48%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMR 109
             ++E +IA QH E+Q L  +NQRLA  H  L+ +L  A+ EL+ L      +K+E E ++R
Sbjct:    38 ILEDRIAIQHREIQSLLNDNQRLAVAHIGLKDQLNVAKRELERLLETAVKVKAEGEAKVR 97

Query:   110 NLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQ 169
              + +   +MEAE +  + +  E  + +++ Q L   R+EL  ++     ++ +A  +  +
Sbjct:    98 EVYQNALRMEAEARVIDGLGAELGQVRSDVQRLGSDRQELATELAMFDDEMAKAKPNSDR 157

Query:   170 IPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNA 229
                +  E+E LR E    R   E EKK    +L   + MEK    +  E+ KL  EL++ 
Sbjct:   158 AIEVKLEIEILRGEIRKGRAALELEKKTRASNLHHERGMEKTIDHLNREIVKLEEELVD- 216

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVV 284
               ++ +A + +       + +       G N  +D YG  QG   P +  T  +V
Sbjct:   217 --LETKAREANAAAEAAPTPSPGLAASYGNNT-DDIYG-GQGRQYPEANGTHELV 267


>UNIPROTKB|A2VD00 [details] [associations]
            symbol:eif3a "Eukaryotic translation initiation factor 3
            subunit A" species:8355 "Xenopus laevis" [GO:0001732 "formation of
            translation initiation complex" evidence=ISS] [GO:0005852
            "eukaryotic translation initiation factor 3 complex" evidence=ISS]
            [GO:0003743 "translation initiation factor activity" evidence=ISS]
            InterPro:IPR000717 Pfam:PF01399 SMART:SM00088 GO:GO:0003743
            GO:GO:0005852 KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128
            GO:GO:0001732 EMBL:BC129055 RefSeq:NP_001085285.1 UniGene:Xl.57279
            PRIDE:A2VD00 GeneID:443632 KEGG:xla:443632 Uniprot:A2VD00
        Length = 1424

 Score = 195 (73.7 bits), Expect = 1.2e-11, P = 1.2e-11
 Identities = 119/453 (26%), Positives = 178/453 (39%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAATHGT-LRQELAAAQHELQILHGQIGGMKSER--EL 106
             V   K + Q V   KL    +RLA      L +     + E +I + +    + +R  E 
Sbjct:   761 VSNLKASRQSVYDAKLKQFQERLAEEKRVRLEERKRQRKEERRISYYRDKEEEEQRLIEE 820

Query:   107 QMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTD 166
             Q++   E   K+E E + AE  + + +  K E Q       EL  +  +  +D +R   D
Sbjct:   821 QLKQEREDREKIENEKREAEQREYQERLKKLEEQERKKRLRELEIEEREKKRDEERRGPD 880

Query:   167 ----VQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
                  Q  P+   +    R+E    RG    E+K      +     + +      + E  
Sbjct:   881 DSFRKQDTPSRWGD----REESGWRRGADPDERKQAPPERDWRSGGQDSKPVKDEDREGD 936

Query:   223 RAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAG 282
                ++          DG    A   S   T  R   ++  EDG G  +G    P      
Sbjct:   937 EDSVLRKDEEQVARGDGDEERAA--SWRGTDDRGPKRSVEEDG-GPRRGFNDEPGPRRGF 993

Query:   283 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTK 336
                 GP          +   P R   D  RGP  G +  +GP  G D  + P    D  +
Sbjct:   994 EDDQGPRRGLD-----EDRGPRRGL-DEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDR 1047

Query:   337 GP--SYDPAKGP--GYDPTKGP--GYDAQKGSN--YDAQRGP--NYDIHRGPSYDPQRGL 386
             GP    D  +GP  G D  +GP  G+D  +G    +D  RGP  ++D  RGP    +RG 
Sbjct:  1048 GPRRGLDEDRGPRRGLDEDRGPRRGFDEDRGPRRGFDEDRGPRRDFDEDRGP----RRG- 1102

Query:   387 GYDMQRGPN--YDMQRGP--GYETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP-- 434
              +D  RGP   +D  RGP  G++  R P  G+D  RGP   ++  R P   +   RGP  
Sbjct:  1103 -FDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGPRRGFDDDRGPRRGFEDDRGPRR 1161

Query:   435 GYDLQRG--QGYDMRRAPSYDPSRGTGFDGAPR 465
             G++  RG  +G++  R P     RG   D  PR
Sbjct:  1162 GFEDDRGPRRGFEDDRGPR----RGFDEDRTPR 1190

 Score = 184 (69.8 bits), Expect = 9.1e-15, Sum P(2) = 9.1e-15
 Identities = 66/197 (33%), Positives = 90/197 (45%)

Query:   305 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 354
             R   D  RGP  G +  +GP  G D  + P    D  +GP   +D  +GP  G+D  +GP
Sbjct:  1030 RRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGFDEDRGPRRGFDEDRGP 1089

Query:   355 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 404
               D      +D  RGP   +D  RGP   +D  RG   G+D  RGP   +D  RGP  G+
Sbjct:  1090 RRD------FDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGPRRGF 1143

Query:   405 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SY 452
             +  R P  G++  RGP   +E  R P   +   RGP  G+D  R   +G++  R P    
Sbjct:  1144 DDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRTPRRGFEDDRGPRRGM 1203

Query:   453 DPSRGTGFDGAPRGAAP 469
             D  R +   GA     P
Sbjct:  1204 DEERVSWRGGAEEDRGP 1220

 Score = 159 (61.0 bits), Expect = 4.9e-12, Sum P(2) = 4.9e-12
 Identities = 61/197 (30%), Positives = 91/197 (46%)

Query:   305 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 354
             R  +D  RGP   ++  +GP  G+D  + P   +D  +GP   +D  +GP  G+D  +GP
Sbjct:  1080 RRGFDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGP 1139

Query:   355 GYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRGLGYDMQRGPN--YDMQRGP--GYETQR 408
                 ++G  +D  RGP   ++  RGP    +RG  ++  RGP   ++  RGP  G++  R
Sbjct:  1140 ----RRG--FDDDRGPRRGFEDDRGP----RRG--FEDDRGPRRGFEDDRGPRRGFDEDR 1187

Query:   409 VP--GYDVQRGPV--YEAQRAP---SYIPQRGPGYDLQRGQGYDMRRAPSYD--PSRGTG 459
              P  G++  RGP    + +R          RGP    +  +G   RR    D  P RG  
Sbjct:  1188 TPRRGFEDDRGPRRGMDEERVSWRGGAEEDRGPRRGAEEDRG--PRRGAEEDRGPRRGAE 1245

Query:   460 FDGAPRGAAPH--GQVP 474
              D  PR  A    GQ P
Sbjct:  1246 EDRGPRRGAEEDRGQTP 1262

 Score = 91 (37.1 bits), Expect = 9.1e-15, Sum P(2) = 9.1e-15
 Identities = 42/178 (23%), Positives = 84/178 (47%)

Query:    53 QKIASQHVEMQ--KLATENQRLAAT--HGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             Q + S+H+  Q   ++T   +  AT     + QE    QH++ I   Q    K  + +  
Sbjct:   512 QNMPSEHIRNQLTAMSTVLSKAVATIKPAHVLQE-KEEQHQIAISAYQKNSRKEHQRILT 570

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R  T +  K   E    +  K E ++ + E Q +  A EE + +  +  ++ +R   + +
Sbjct:   571 RRQTIEERKERLENLNIQREKEEHEQREAELQKVRKAEEERLRQEAK-EREKERILQEHE 629

Query:   169 QIPALLSELESLRQEYHHCRGTYEYEKKFYND-HLESLQVMEKNYITMATEVEKLRAE 225
             QI     + +++R+     + T E+  K + D  +E+L+ ++ ++I MA +VE+L  E
Sbjct:   630 QI-----KKKTVRERLEQIKKT-EFGAKAFKDIDIENLEELDPDFI-MAKQVEQLEKE 680

 Score = 54 (24.1 bits), Expect = 6.2e-11, Sum P(2) = 6.2e-11
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query:   162 RAHTDVQQI--PALLSELE---SLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMA 216
             + H   Q+I  P +L  LE    LR+ +    G Y+Y+      +++SL+ + + Y+ +A
Sbjct:    38 KKHRTWQKIHEPIMLKYLELCVDLRKSHLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLA 97

Query:   217 TE 218
              E
Sbjct:    98 EE 99


>UNIPROTKB|A4II09 [details] [associations]
            symbol:eif3a "Eukaryotic translation initiation factor 3
            subunit A" species:8364 "Xenopus (Silurana) tropicalis" [GO:0001732
            "formation of translation initiation complex" evidence=ISS]
            [GO:0005852 "eukaryotic translation initiation factor 3 complex"
            evidence=ISS] [GO:0003743 "translation initiation factor activity"
            evidence=ISS] InterPro:IPR000717 Pfam:PF01399 SMART:SM00088
            GO:GO:0003743 GO:GO:0005852 eggNOG:NOG236708 HOGENOM:HOG000246822
            KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128 GO:GO:0001732 CTD:8661
            EMBL:BC135790 RefSeq:NP_001096173.1 UniGene:Str.55518 STRING:A4II09
            PRIDE:A4II09 GeneID:100124719 KEGG:xtr:100124719
            Xenbase:XB-GENE-994394 Uniprot:A4II09
        Length = 1391

 Score = 186 (70.5 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 68/224 (30%), Positives = 101/224 (45%)

Query:   267 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP-- 322
             G+ +  GP      AG    G           +     R  +D  RGP  G++  +GP  
Sbjct:   981 GLEEDRGPRRGIDDAGP-RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRR 1039

Query:   323 GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQKGSN--YDAQRGPN- 371
             G+D  + P    D  +GP   +D  + P  G+D  +GP  G+D  +G    +D  RGP  
Sbjct:  1040 GFDEDRGPRRGIDDDRGPRRGFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRR 1099

Query:   372 -YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQRVP--GYDVQRGPVY 420
              ++  RGP   ++  RG   G++  RGP   ++  RGP  G+E  R P  G+D  RGP  
Sbjct:  1100 GFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGP-- 1157

Query:   421 EAQRAPSYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRGT 458
               +R   +   RGP  G+D  R   +G+D  R P    D  RG+
Sbjct:  1158 --RRG--FEDDRGPRRGFDEDRTPRRGFDDDRGPRRGLDEDRGS 1197

 Score = 183 (69.5 bits), Expect = 2.9e-14, Sum P(2) = 2.9e-14
 Identities = 65/191 (34%), Positives = 92/191 (48%)

Query:   305 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 354
             R  ++  RGP  G E  + P  G+D  + P   +D  +GP   +D  +GP  G D  +GP
Sbjct:   998 RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGIDDDRGP 1057

Query:   355 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 404
                 ++G  +D  R P   +D  RGP   +D  RG   G+D  RGP   ++  RGP  G+
Sbjct:  1058 ----RRG--FDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGF 1111

Query:   405 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAPSYDP 454
             E  R P  G++  RGP   +E  R P   +   RGP  G+D  RG  +G++  R P    
Sbjct:  1112 EDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPR--- 1168

Query:   455 SRGTGFDGAPR 465
              RG   D  PR
Sbjct:  1169 -RGFDEDRTPR 1178

 Score = 167 (63.8 bits), Expect = 1.6e-12, Sum P(2) = 1.6e-12
 Identities = 71/225 (31%), Positives = 103/225 (45%)

Query:   311 PRGPGYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQK 360
             PR  G++  + P  G+D  + P   +D  +GP   +D  +GP  G++  +GP  G++  +
Sbjct:  1057 PRR-GFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGFEDDR 1115

Query:   361 GSN--YDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQR 408
             G    ++  RGP   ++  RGP   ++  RG   G+D  RGP   ++  RGP  G++  R
Sbjct:  1116 GPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPRRGFDEDR 1175

Query:   409 VP--GYDVQRGPV--YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 464
              P  G+D  RGP    +  R  S+   RG G D+ R +G D  R P     RG   D  P
Sbjct:  1176 TPRRGFDDDRGPRRGLDEDRG-SW---RG-GDDVPR-RGADDDRGPR----RGADDDRGP 1225

Query:   465 RGAAPHGQVP--PPLNNVPYG-SATPPARSGS-GQPRGGN-PARR 504
             R      Q P  P   + P G      AR  S G PR    P  R
Sbjct:  1226 RRGEDRDQTPWKPMAASRPGGWREREKAREDSWGPPRDSQAPEER 1270

 Score = 165 (63.1 bits), Expect = 2.2e-08, P = 2.2e-08
 Identities = 105/441 (23%), Positives = 176/441 (39%)

Query:    58 QHVEMQKLATENQRLAATH-GTLRQELAAA----QHELQILH-GQIGGMKSERELQMRNL 111
             + + + K A E QR+       L++E   +    + E  + H  ++  M  ++EL +  L
Sbjct:   705 EEIPLLKKAYEEQRINDMELWELQEEERISTLLLEREKAVEHKNRMSRMVEDKELFVSKL 764

Query:   112 -TEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQI 170
                + +  EA+LK  +    E + ++ E +      E  +       ++ +R   +  Q+
Sbjct:   765 KASRQSLYEAKLKQFQERLAEEKAARLEERKRERKEERRVNYYRDKEEEEERLREE--QL 822

Query:   171 PALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAP 230
                  E E +  E        E E++ Y + L+ L+  E+       E+E    E     
Sbjct:   823 KQEREEQEKVENEKR------EAEQRDYQERLKKLEEQERKKRQRELEIE----ERERKR 872

Query:   231 NVDRRAADGSYGGATGN-SENETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVGA 286
               +RR  D ++   +    E E SG   G +  E     P+     G P S        +
Sbjct:   873 EEERRGGDDTFRKDSSRWGEREESGWRRGADPDERKQVPPERDWRRGGPDSKPVINEDAS 932

Query:   287 G-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGY-DASKAPS--YDPTKGP--SY 340
                    +A    +     RA  +    P  +  KG  + D  + P    +  +GP    
Sbjct:   933 NREEDENAALRKDEEQVSSRAFEEKVSLPDADEEKGGSWRDEDRGPKRGLEEDRGPRRGI 992

Query:   341 DPAKGP--GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGP--SYDPQRG--LGYDMQRGP 394
             D A GP  G++  +GP    ++G   D      +D  RGP   +D  RG   G+D  RGP
Sbjct:   993 DDA-GPRRGFEEDRGP----RRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGP 1047

Query:   395 N--YDMQRGP--GYETQRVP--GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQRG--QGY 444
                 D  RGP  G++  R P  G+D  RGP    +R   +   RGP  G+D  RG  +G+
Sbjct:  1048 RRGIDDDRGPRRGFDEDRTPRRGFDDDRGP----RRG--FDDDRGPRRGFDEDRGPRRGF 1101

Query:   445 DMRRAP--SYDPSRGT--GFD 461
             +  R P   ++  RG   GF+
Sbjct:  1102 EDDRGPRRGFEDDRGPRRGFE 1122

 Score = 87 (35.7 bits), Expect = 1.3e-14, Sum P(2) = 1.3e-14
 Identities = 40/187 (21%), Positives = 81/187 (43%)

Query:    39 FPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIG 98
             F PF    P E +  ++ +    + K       +   H    +E    QH++ I   Q  
Sbjct:   507 FGPFLQNMPSEQIRNQLTAMSCVLSKAVGA---IKPAHVLQEKE---EQHQIAITAYQKN 560

Query:    99 GMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQ 158
               K  + +  R  T +  K   E    +  K E ++ + E Q +  A EE + +  +  +
Sbjct:   561 SRKEHQRILARRQTIEERKERLENLNIQREKEEMEQKEAELQKVRKAEEERLRQEAK-ER 619

Query:   159 DLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATE 218
             + +R   + +QI     + +++R+     + T    K F +  +E+L+ ++ ++I MA +
Sbjct:   620 EKERILQEHEQI-----KKKTVRERLEQIKKTELGAKAFKDIDIENLEELDPDFI-MAKQ 673

Query:   219 VEKLRAE 225
             VE+L  E
Sbjct:   674 VEQLEKE 680

 Score = 54 (24.1 bits), Expect = 3.5e-11, Sum P(2) = 3.5e-11
 Identities = 17/62 (27%), Positives = 32/62 (51%)

Query:   162 RAHTDVQQI--PALLSELE---SLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMA 216
             + H   Q+I  P +L  LE    LR+ +    G Y+Y+      +++SL+ + + Y+ +A
Sbjct:    38 KKHRTWQKIHEPIMLKYLELCVDLRKSHLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLA 97

Query:   217 TE 218
              E
Sbjct:    98 EE 99

 Score = 51 (23.0 bits), Expect = 7.2e-11, Sum P(2) = 7.2e-11
 Identities = 27/120 (22%), Positives = 51/120 (42%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAATHGT-LRQELAAAQHELQILHGQIGGMKSER--EL 106
             V + K + Q +   KL    +RLA      L +     + E ++ + +    + ER  E 
Sbjct:   761 VSKLKASRQSLYEAKLKQFQERLAEEKAARLEERKRERKEERRVNYYRDKEEEEERLREE 820

Query:   107 QMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTD 166
             Q++   E+  K+E E + AE    + +  K E Q     + EL  +  +  ++ +R   D
Sbjct:   821 QLKQEREEQEKVENEKREAEQRDYQERLKKLEEQERKKRQRELEIEERERKREEERRGGD 880


>UNIPROTKB|F1S187 [details] [associations]
            symbol:LOC100518332 "Uncharacterized protein" species:9823
            "Sus scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 GeneTree:ENSGT00530000063105
            EMBL:CU896616 Ensembl:ENSSSCT00000019273 OMA:TESSSGX Uniprot:F1S187
        Length = 406

 Score = 204 (76.9 bits), Expect = 1.4e-13, P = 1.4e-13
 Identities = 71/225 (31%), Positives = 86/225 (38%)

Query:   228 NAPNV-DRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA 286
             N P   D R + G + G     E    GR  G+     GYG  +  G      + G  G 
Sbjct:   183 NEPRPEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSGGG-GY 240

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 346
             G + S   Y   +SG      Y   RG GY   +G GY   +   Y   +   Y   +G 
Sbjct:   241 GGDRSGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGG 296

Query:   347 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQR-GPNY--DMQRGPG 403
             GY   +G GY   +G  Y   RG  Y   RG  Y   RG GY   R G  Y  D   G G
Sbjct:   297 GYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRG-GYGGDRSGGGYGGDRGSGSG 354

Query:   404 YETQRVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 447
             Y   R  GY   R G  Y   R+  Y   RG GY  + G   D R
Sbjct:   355 YGGDRSGGYGGDRSGGGYGGDRSGGYGGDRG-GYGGKMGGRNDYR 398

 Score = 170 (64.9 bits), Expect = 9.5e-10, P = 9.5e-10
 Identities = 57/163 (34%), Positives = 65/163 (39%)

Query:   312 RGPGYEASK-GPGY--DASKAPSYDPTK-GPSYDPAK-GPGYDPTKGPGYDAQKGSNYDA 366
             RG GY   + G GY  D S    Y   + G  Y   + G GY   +G GY   +G  Y  
Sbjct:   218 RG-GYGGDRSGGGYGGDRSGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGG 276

Query:   367 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAP 426
              RG  Y   R   Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R  
Sbjct:   277 DRGGGYGGDRSGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGG-YGGDRGG 335

Query:   427 SYIPQRGPGY--DLQRGQGYDMRRAPSYDPSR-GTGFDGAPRG 466
                 + G GY  D   G GY   R+  Y   R G G+ G   G
Sbjct:   336 YGGDRSGGGYGGDRGSGSGYGGDRSGGYGGDRSGGGYGGDRSG 378

 Score = 141 (54.7 bits), Expect = 1.6e-06, P = 1.6e-06
 Identities = 50/137 (36%), Positives = 55/137 (40%)

Query:   338 PSYDPAKGPGYDPTKG-PGYDAQKGSN--YDAQR-GPNY--DIHRGPSYDPQR-GLGYDM 390
             PS    +G GY   +G  G   + G    Y   R G  Y  D   G  Y   R G GY  
Sbjct:   192 PSGGDFRGRGYGGERGYRGRGGRGGDRGGYGGDRSGGGYGGDRSGGGGYGGDRSGGGYGG 251

Query:   391 QR-GPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 449
              R G  Y   RG GY   R  GY   RG  Y   R+  Y   RG GY   RG GY   R 
Sbjct:   252 DRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGGYGGDRGGGYGGDRG 311

Query:   450 PSYDPSRGTGFDGAPRG 466
               Y   RG G+ G  RG
Sbjct:   312 GGYGGDRGGGY-GGDRG 327


>UNIPROTKB|P11414 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:10029 "Cricetulus griseus" [GO:0005634 "nucleus"
            evidence=ISS] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=ISS] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0006468 "protein
            phosphorylation" evidence=ISS] [GO:0004672 "protein kinase
            activity" evidence=ISS] InterPro:IPR000684 Pfam:PF05001
            PROSITE:PS00115 GO:GO:0003677 GO:GO:0006468 GO:GO:0006366
            GO:GO:0003899 GO:GO:0005665 EMBL:M19538 PIR:A27677
            ProteinModelPortal:P11414 Uniprot:P11414
        Length = 467

 Score = 173 (66.0 bits), Expect = 6.0e-10, P = 6.0e-10
 Identities = 69/236 (29%), Positives = 93/236 (39%)

Query:   228 NAPNVDRRAADGSYGGATG---NSENETSGRPVGQN-AYEDGYGVPQGHGP--PPSATTA 281
             N P +      G   GA G   ++ ++ SG   G + A+    G P   GP  P   +  
Sbjct:    24 NIPGLGAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPG 83

Query:   282 GVVGAGPNTSTSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS 339
             G +    + ++ AY     G  TP   +Y  P  P Y  +  P Y  + +P+Y PT  PS
Sbjct:    84 GAMSPSYSPTSPAYEPRSPGGYTPQSPSYS-PTSPSYSPTS-PSYSPT-SPNYSPTS-PS 139

Query:   340 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 399
             Y P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P+Y   
Sbjct:   140 YSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-P 191

Query:   400 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
               P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:   192 TSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 241

 Score = 132 (51.5 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 62/205 (30%), Positives = 79/205 (38%)

Query:   296 AATQSG-TPMRAAYDIPRGPGYEASKGPGYDA--SKAPSYDPTKGPS--YDPAKG----P 346
             AA +SG TP  A +  P      +   PGY    S  P    + GPS  Y P+ G    P
Sbjct:    30 AAGRSGMTPGAAGFS-PSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSP 88

Query:   347 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
              Y PT  P Y+ +    Y  Q  P+Y     PSY P     Y     PNY     P Y  
Sbjct:    89 SYSPTS-PAYEPRSPGGYTPQ-SPSYS-PTSPSYSPTSP-SYS-PTSPNYS-PTSPSYSP 142

Query:   407 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 466
                P Y     P Y +  +PSY P   P Y       Y    +PSY P+  +    +P  
Sbjct:   143 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 195

Query:   467 AAPHGQVPPPLNNVPYGSATPPARS 491
              +P      P +  P  S T P+ S
Sbjct:   196 YSPTSPSYSPTS--PSYSPTSPSYS 218

 Score = 131 (51.2 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 61/215 (28%), Positives = 80/215 (37%)

Query:   277 SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK 336
             S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS     
Sbjct:    34 SGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAM 86

Query:   337 GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY 396
              PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+Y
Sbjct:    87 SPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSY 140

Query:   397 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 456
                  P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+ 
Sbjct:   141 S-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTS 193

Query:   457 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
              +    +P   +P      P +  P  S T P+ S
Sbjct:   194 PSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 225

 Score = 121 (47.7 bits), Expect = 0.00032, P = 0.00032
 Identities = 63/225 (28%), Positives = 80/225 (35%)

Query:   274 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 327
             P  S T+       PN     Y  T    +P   +Y  P  P Y  +  P Y  S     
Sbjct:   257 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 309

Query:   328 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 386
              ++P+Y P+  PSY P+  P Y PT  P Y     S Y     P Y     P Y P    
Sbjct:   310 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 363

Query:   387 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 446
              Y     P Y     P Y +   P Y     P Y +  +P Y P   P Y       Y  
Sbjct:   364 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 415

Query:   447 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +P Y P+  T    +P+G+      P      P  S T PA S
Sbjct:   416 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 459


>SGD|S000002299 [details] [associations]
            symbol:RPO21 "RNA polymerase II largest subunit B220"
            species:4932 "Saccharomyces cerevisiae" [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA;IMP] [GO:0003899 "DNA-directed RNA
            polymerase activity" evidence=IEA;IDA] [GO:0005739 "mitochondrion"
            evidence=IDA] [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA;IDA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed
            RNA polymerase activity" evidence=IDA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 SGD:S000002299 GO:GO:0005739
            GO:GO:0046872 GO:GO:0003677 EMBL:BK006938 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 EMBL:X96876 EMBL:U27182
            GO:GO:0003899 PDB:4GWQ PDBsum:4GWQ PDB:2LO6 PDBsum:2LO6
            eggNOG:COG0086 GO:GO:0005665 PDB:1I3Q PDB:1I50 PDB:1I6H PDB:1K83
            PDB:1NIK PDB:1NT9 PDB:1PQV PDB:1R5U PDB:1R9S PDB:1R9T PDB:1SFO
            PDB:1TWA PDB:1TWC PDB:1TWF PDB:1TWG PDB:1TWH PDB:1WCM PDB:1Y1V
            PDB:1Y1W PDB:1Y1Y PDB:1Y77 PDB:2B63 PDB:2B8K PDB:2E2H PDB:2E2I
            PDB:2E2J PDB:2JA5 PDB:2JA6 PDB:2JA7 PDB:2JA8 PDB:2NVQ PDB:2NVT
            PDB:2NVX PDB:2NVY PDB:2NVZ PDB:2R7Z PDB:2R92 PDB:2R93 PDB:2VUM
            PDB:2YU9 PDB:3CQZ PDB:3FKI PDB:3GTG PDB:3GTJ PDB:3GTK PDB:3GTL
            PDB:3GTM PDB:3GTO PDB:3GTP PDB:3GTQ PDB:3H3V PDB:3HOU PDB:3HOV
            PDB:3HOW PDB:3HOX PDB:3HOY PDB:3HOZ PDB:3I4M PDB:3I4N PDB:3K1F
            PDB:3K7A PDB:3M3Y PDB:3M4O PDB:3PO2 PDB:3PO3 PDB:3QT1 PDB:3RZD
            PDB:3RZO PDB:3S14 PDB:3S15 PDB:3S16 PDB:3S17 PDB:3S1M PDB:3S1N
            PDB:3S1Q PDB:3S1R PDB:3S2D PDB:3S2H PDB:4A3B PDB:4A3C PDB:4A3D
            PDB:4A3E PDB:4A3F PDB:4A3G PDB:4A3I PDB:4A3J PDB:4A3K PDB:4A3L
            PDB:4A3M PDB:4A93 PDB:4BBR PDB:4BBS PDBsum:1I3Q PDBsum:1I50
            PDBsum:1I6H PDBsum:1K83 PDBsum:1NIK PDBsum:1NT9 PDBsum:1PQV
            PDBsum:1R5U PDBsum:1R9S PDBsum:1R9T PDBsum:1SFO PDBsum:1TWA
            PDBsum:1TWC PDBsum:1TWF PDBsum:1TWG PDBsum:1TWH PDBsum:1WCM
            PDBsum:1Y1V PDBsum:1Y1W PDBsum:1Y1Y PDBsum:1Y77 PDBsum:2B63
            PDBsum:2B8K PDBsum:2E2H PDBsum:2E2I PDBsum:2E2J PDBsum:2JA5
            PDBsum:2JA6 PDBsum:2JA7 PDBsum:2JA8 PDBsum:2NVQ PDBsum:2NVT
            PDBsum:2NVX PDBsum:2NVY PDBsum:2NVZ PDBsum:2R7Z PDBsum:2R92
            PDBsum:2R93 PDBsum:2VUM PDBsum:2YU9 PDBsum:3CQZ PDBsum:3FKI
            PDBsum:3GTG PDBsum:3GTJ PDBsum:3GTK PDBsum:3GTL PDBsum:3GTM
            PDBsum:3GTO PDBsum:3GTP PDBsum:3GTQ PDBsum:3H3V PDBsum:3HOU
            PDBsum:3HOV PDBsum:3HOW PDBsum:3HOX PDBsum:3HOY PDBsum:3HOZ
            PDBsum:3I4M PDBsum:3I4N PDBsum:3K1F PDBsum:3K7A PDBsum:3M3Y
            PDBsum:3M4O PDBsum:3PO2 PDBsum:3PO3 PDBsum:3QT1 PDBsum:3RZD
            PDBsum:3RZO PDBsum:3S14 PDBsum:3S15 PDBsum:3S16 PDBsum:3S17
            PDBsum:3S1M PDBsum:3S1N PDBsum:3S1Q PDBsum:3S1R PDBsum:3S2D
            PDBsum:3S2H PDBsum:4A3B PDBsum:4A3C PDBsum:4A3D PDBsum:4A3E
            PDBsum:4A3F PDBsum:4A3G PDBsum:4A3I PDBsum:4A3J PDBsum:4A3K
            PDBsum:4A3L PDBsum:4A3M PDBsum:4A93 PDBsum:4BBR PDBsum:4BBS
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 OrthoDB:EOG4J14H5
            EMBL:X03128 EMBL:Z74188 PIR:S67686 RefSeq:NP_010141.1 PDB:2L0I
            PDBsum:2L0I ProteinModelPortal:P04050 SMR:P04050 DIP:DIP-611N
            IntAct:P04050 MINT:MINT-432838 STRING:P04050 PaxDb:P04050
            PeptideAtlas:P04050 EnsemblFungi:YDL140C GeneID:851415
            KEGG:sce:YDL140C CYGD:YDL140c GeneTree:ENSGT00700000105212
            EvolutionaryTrace:P04050 NextBio:968606 ArrayExpress:P04050
            Genevestigator:P04050 GermOnline:YDL140C Uniprot:P04050
        Length = 1733

 Score = 191 (72.3 bits), Expect = 7.6e-10, Sum P(2) = 7.6e-10
 Identities = 79/251 (31%), Positives = 104/251 (41%)

Query:   222 LRAELMNAPNVDRRAADGSYGGAT--GNSENETSGRPVGQNAYED-----GYGVPQGHGP 274
             ++ ELM +P VD  + D   GG T  G ++   +  P G  AY +     G+GV      
Sbjct:  1486 VKDELMFSPLVDSGSNDAMAGGFTAYGGADYGEATSPFG--AYGEAPTSPGFGVSSPGFS 1543

Query:   275 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 334
             P S T +    A   TS S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1544 PTSPTYSPTSPAYSPTSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1600

Query:   335 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 394
             T  PSY P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P
Sbjct:  1601 TS-PSYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSP 1652

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 454
             +Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P
Sbjct:  1653 SYS-PTSPAYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPNYSPT-SPSYSP 1705

Query:   455 SRGTGFD-GAP 464
             +   G+  G+P
Sbjct:  1706 T-SPGYSPGSP 1715

 Score = 38 (18.4 bits), Expect = 7.6e-10, Sum P(2) = 7.6e-10
 Identities = 12/39 (30%), Positives = 16/39 (41%)

Query:    52 EQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHEL 90
             E  + + H+E Q L T     AA     R +L    H L
Sbjct:   870 EDGMDAAHIEKQSLDTIGGSDAAFEKRYRVDLLNTDHTL 908


>UNIPROTKB|F1PB61 [details] [associations]
            symbol:TAF15 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 CTD:8148 KO:K14651
            OMA:YGNQGSQ EMBL:AAEX03006620 EMBL:AAEX03006619 RefSeq:XP_548255.2
            ProteinModelPortal:F1PB61 Ensembl:ENSCAFT00000028877 GeneID:491135
            KEGG:cfa:491135 Uniprot:F1PB61
        Length = 571

 Score = 155 (59.6 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 69/221 (31%), Positives = 78/221 (35%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYE 317
             G+  Y  G G  QG G  P +     V   P+     +A   S             P   
Sbjct:   335 GRGGYR-GRGGFQGRGGDPKS--GDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGG 391

Query:   318 ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY--DAQKGSNYDAQR-GPNYDI 374
               +G GY   +   Y    G   D   G G D + G GY  D   G  Y   R G  Y  
Sbjct:   392 DFRGRGYGGERG--YRGRGGRGGDRG-GYGADRSSG-GYGGDRSGGGGYGGDRSGGGYGG 447

Query:   375 HR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 433
              R G  Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R   Y   RG
Sbjct:   448 DRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG 507

Query:   434 PGYDLQR-GQGY--DMRRAPSYDPSRGTGFDGAPRGAAPHG 471
              GY   R G GY  D      Y   RG G+ G  R    +G
Sbjct:   508 -GYGGDRSGGGYGGDRGGGGGYGGDRGGGY-GGDRSGGGYG 546

 Score = 155 (59.6 bits), Expect = 8.2e-08, P = 8.2e-08
 Identities = 68/233 (29%), Positives = 84/233 (36%)

Query:   242 GGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTSAYAAT-Q 299
             GG +G       G   G+  ++   G P+ G    P+ +   +  A  N+         +
Sbjct:   326 GGGSGGGRRGRGGYR-GRGGFQGRGGDPKSGDWVCPNPSCGNMNFARRNSCNQCNEPRPE 384

Query:   300 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY--DPTKGPGYD 357
                P    +   RG GY   +G  Y        D   G   D + G GY  D + G GY 
Sbjct:   385 DSRPSGGDF---RGRGYGGERG--YRGRGGRGGD-RGGYGADRSSG-GYGGDRSGGGGYG 437

Query:   358 AQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQ 415
               + G  Y   R G  Y   RG  Y   RG GY   RG  Y   RG GY   R  GY   
Sbjct:   438 GDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGD 497

Query:   416 RGPVYEAQRAPSYIPQRGPGYDLQRGQG--YDMRRAPSYDPSRGTGFDGAPRG 466
             RG  Y   R      + G GY   RG G  Y   R   Y   R  G  G  RG
Sbjct:   498 RGGGYGGDRGGYGGDRSGGGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG 550

 Score = 145 (56.1 bits), Expect = 1.6e-08, Sum P(2) = 1.6e-08
 Identities = 52/152 (34%), Positives = 61/152 (40%)

Query:   305 RAAYDIPR---GPGYEASKGPGYDASKAPS-YDPTK-GPSYDPAKGPGYDPTKGPGYDAQ 359
             R  Y   R   G G + S G GY   ++   Y   + G  Y   +G GY   +G GY   
Sbjct:   414 RGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGD 473

Query:   360 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR-GPGYETQRVPG--YDVQR 416
             +G  Y   RG  Y   RG  Y   RG GY   RG  Y   R G GY   R  G  Y   R
Sbjct:   474 RGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSGGGYGGDRGGGGGYGGDR 532

Query:   417 GPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMR 447
             G  Y   R+   Y   RG GY  + G   D R
Sbjct:   533 GGGYGGDRSGGGYGGDRG-GYGGKMGGRNDYR 563

 Score = 119 (46.9 bits), Expect = 0.00072, P = 0.00072
 Identities = 47/167 (28%), Positives = 61/167 (36%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             R   G  G   G   + +SG   G  +   GYG  +  G      + G  G G +     
Sbjct:   405 RGRGGRGGDRGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGG--GYGGDRG-GG 461

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-G 353
             Y   + G      Y   RG GY   +G GY   +   Y   +G  Y   +G GY   + G
Sbjct:   462 YGGDRGG-----GYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSG 515

Query:   354 PGYDAQKGSN--YDAQRGPNYDIHR-GPSYDPQRGLGYDMQRGPNYD 397
              GY   +G    Y   RG  Y   R G  Y   RG GY  + G   D
Sbjct:   516 GGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG-GYGGKMGGRND 561

 Score = 62 (26.9 bits), Expect = 1.2e-09, Sum P(2) = 1.2e-09
 Identities = 27/107 (25%), Positives = 46/107 (42%)

Query:   184 YHHCRGTYEYEKKF------YNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAA 237
             Y   +G+Y+ +  +      YN + +S      NY +  T+ +  R ++      D R  
Sbjct:   121 YDQHQGSYDEQSNYGPQHDSYNQNQQSYHSQRDNY-SHHTQDD--RRDVSRYGE-DNRGY 176

Query:   238 DGSYGGATGNSENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 278
              GS GG  G    +  GR P+ G +  + G    +G  + +GP P A
Sbjct:   177 GGSQGGGRGRGGYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRPDA 223


>FB|FBgn0028573 [details] [associations]
            symbol:prc "pericardin" species:7227 "Drosophila
            melanogaster" [GO:0005605 "basal lamina" evidence=NAS] [GO:0007507
            "heart development" evidence=IMP;TAS] [GO:0005578 "proteinaceous
            extracellular matrix" evidence=IDA] [GO:0035088 "establishment or
            maintenance of apical/basal cell polarity" evidence=TAS]
            [GO:0016477 "cell migration" evidence=TAS] [GO:0002009
            "morphogenesis of an epithelium" evidence=TAS] GO:GO:0002009
            GO:GO:0007507 GO:GO:0005578 FlyBase:FBgn0028573 InterPro:IPR009765
            Pfam:PF07054 EMBL:AF203342 STRING:Q9U617 PRIDE:Q9U617
            InParanoid:Q9U617 ArrayExpress:Q9U617 Bgee:Q9U617 Uniprot:Q9U617
        Length = 1729

 Score = 173 (66.0 bits), Expect = 3.8e-09, P = 3.8e-09
 Identities = 87/276 (31%), Positives = 100/276 (36%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS--ATTAGVVGAGPNTSTS 293
             A    YGG  G S     G+P G        G+P G+G  P   A TA V G      T 
Sbjct:   871 AGQSGYGGQPGISGQTGGGQP-GYGGQATISGLP-GYGTQPGIGALTA-VPGGHYGYETQ 927

Query:   294 AYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 352
                  Q+GT        P G G +   G PGY     P      G S    + PGY    
Sbjct:   928 PGIGGQTGTNQPGFGGQP-GIGGQTGAGQPGYGFIGQPGIGGQTGTS---GRQPGYGTQP 983

Query:   353 GPGYDAQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR- 408
             G G     G   Y +Q G       G P Y  Q G+G  +  G P Y  Q G G +T   
Sbjct:   984 GIGGQTAAGQPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAG 1043

Query:   409 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 468
              PGY  Q G  +  Q  P Y  Q  PG   Q G G      P Y    G G  G      
Sbjct:  1044 QPGYGAQPG--FGGQ--PGYGNQ--PGVGGQTGAGQ-----PGYGSQPGVG--GQTGAGQ 1090

Query:   469 PHGQVPPPLNNVP-YGSATPPARSG-SGQPR-GGNP 501
             P   V P     P  G  T   + G  GQP  GG+P
Sbjct:  1091 PGYGVIPGFGGQPGIGGQTAAGKPGYGGQPGIGGSP 1126

 Score = 173 (66.0 bits), Expect = 3.8e-09, P = 3.8e-09
 Identities = 82/279 (29%), Positives = 99/279 (35%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---VGAG-PNTS 291
             A    YG   G      +G+P G    + G G   G G P   T  G+    GAG P   
Sbjct:   412 AGQPGYGTQPGIGGQTGAGQP-GYGT-QPGIGAQTGAGQPGYGTQPGIGGQTGAGQPGYG 469

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYD 349
             T      Q+G   +  Y    G G +   G PGY +          G P Y    G G  
Sbjct:   470 TQPGIGVQTGAG-QPGYGSQPGIGAQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQ 528

Query:   350 PTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQ 407
                G PGY  Q G    AQ G        P Y  Q G+G     G P Y  Q G G +T 
Sbjct:   529 TGAGQPGYGTQPGIG--AQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIGGQTG 581

Query:   408 R-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT-GFDGA 463
                PGY  Q G   +     P Y  Q G G  +  GQ GY  +         G  G+   
Sbjct:   582 AGQPGYGTQPGVGAQTGTGQPGYGSQPGVGTQIGAGQPGYGSQPGIGGQTGAGQPGYGSQ 641

Query:   464 PRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR-GGNP 501
             P      G   P     P G     A++G+GQP  G  P
Sbjct:   642 PGIGGQTGAAQPGYGTQP-GVG---AQTGTGQPGYGAQP 676

 Score = 165 (63.1 bits), Expect = 2.8e-08, P = 2.8e-08
 Identities = 83/284 (29%), Positives = 103/284 (36%)

Query:   230 PNVDRRAADGS--YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---V 284
             P +  + A G   YG   G      +G+P G  A + G G   G G P   +  G+    
Sbjct:   149 PGIGGQTATGQPGYGSQLGVGAQAGAGQP-GYGA-QPGVGAQTGAGQPGYGSQTGIGGQT 206

Query:   285 GAG-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYD 341
             GAG P   +      Q+G   +  Y    G G +   G PGY +          G P Y 
Sbjct:   207 GAGQPGYGSQPGIGGQTGAG-QPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYG 265

Query:   342 PAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQ 399
                G G     G PGY +Q G     Q G        P Y  Q G+G     G P Y  Q
Sbjct:   266 SQPGIGGQTGAGQPGYGSQPGIG--GQTGAGQ-----PGYGTQPGIGGQTGAGQPGYGSQ 318

Query:   400 RGPGYETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSR 456
              G G +T    PGY  Q G   +     P Y  Q G G     GQ GY  +  P      
Sbjct:   319 PGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGTQ--PGIGGQT 376

Query:   457 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATP-PARSGSGQPRGG 499
             G G  G        GQ  P      YGS      ++G+GQP  G
Sbjct:   377 GPGQPGYGTQPGIGGQTGP--GQPGYGSQPGIGGQTGAGQPGYG 418

 Score = 165 (63.1 bits), Expect = 2.8e-08, P = 2.8e-08
 Identities = 83/285 (29%), Positives = 102/285 (35%)

Query:   230 PNVDRRAADGS--YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---V 284
             P +  +   G   YG   G       G+P G  + + G G   G G P   T  G+    
Sbjct:   370 PGIGGQTGPGQPGYGTQPGIGGQTGPGQP-GYGS-QPGIGGQTGAGQPGYGTQPGIGGQT 427

Query:   285 GAG-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYD 341
             GAG P   T      Q+G   +  Y    G G +   G PGY            G P Y 
Sbjct:   428 GAGQPGYGTQPGIGAQTGAG-QPGYGTQPGIGGQTGAGQPGYGTQPGIGVQTGAGQPGYG 486

Query:   342 PAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQ 399
                G G     G PGY +Q G     Q G        P Y  Q G+G     G P Y  Q
Sbjct:   487 SQPGIGAQTGAGQPGYGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGTQ 539

Query:   400 RGPGYETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSR 456
              G G +T    PGY  Q G   +     P Y  Q G G     GQ GY  +  P      
Sbjct:   540 PGIGAQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGTQ--PGVGAQT 597

Query:   457 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATP-PARSGSGQPRGGN 500
             GTG  G   G+ P            YGS      ++G+GQP  G+
Sbjct:   598 GTGQPGY--GSQPGVGTQIGAGQPGYGSQPGIGGQTGAGQPGYGS 640

 Score = 158 (60.7 bits), Expect = 1.6e-07, P = 1.6e-07
 Identities = 85/283 (30%), Positives = 101/283 (35%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---VGAG-PNTS 291
             A    YG   G      +G+P G    + G GV  G G P   +  G+    GAG P   
Sbjct:   446 AGQPGYGTQPGIGGQTGAGQP-GYGT-QPGIGVQTGAGQPGYGSQPGIGAQTGAGQPGYG 503

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYD 349
             +      Q+G   +  Y    G G +   G PGY            G P Y    G G  
Sbjct:   504 SQPGIGGQTGAG-QPGYGSQPGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYGSQPGIGGQ 562

Query:   350 PTKG-PGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG-PGYE 405
                G PGY +Q G       G P Y    G       G  GY  Q G    +  G PGY 
Sbjct:   563 TGAGQPGYGSQPGIGGQTGAGQPGYGTQPGVGAQTGTGQPGYGSQPGVGTQIGAGQPGYG 622

Query:   406 TQRVPGYDVQRG---PVYEAQRAPSYIPQRG---PGYDLQRGQGYDMRRA-PSYDPSRGT 458
             +Q  PG   Q G   P Y +Q  P    Q G   PGY  Q G G       P Y    G 
Sbjct:   623 SQ--PGIGGQTGAGQPGYGSQ--PGIGGQTGAAQPGYGTQPGVGAQTGTGQPGYGAQPGI 678

Query:   459 GFD-GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSG-QPRGG 499
             G   GA  G   +G+ P        G  T P + G G QP  G
Sbjct:   679 GGQTGA--GQPGYGRQPG------IGGQTGPGQPGYGTQPGVG 713

 Score = 154 (59.3 bits), Expect = 4.5e-07, P = 4.5e-07
 Identities = 78/247 (31%), Positives = 90/247 (36%)

Query:   242 GGATGNSENETS-G-RPV--GQNAY-EDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 296
             GG TG S  +   G +P   GQ A  + GYG   G G     T AG  G G  T      
Sbjct:   967 GGQTGTSGRQPGYGTQPGIGGQTAAGQPGYGSQPGIG---GQTGAGQPGYGSQTGVGGQI 1023

Query:   297 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 355
                +G P    Y    G G +   G PGY A   P +    G    P  G G      PG
Sbjct:  1024 G--AGQP---GYGSQPGIGGQTGAGQPGYGAQ--PGFGGQPGYGNQPGVG-GQTGAGQPG 1075

Query:   356 YDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG----PGYETQRV 409
             Y +Q G       G P Y +   P +  Q G+G     G P Y  Q G    P Y TQ+ 
Sbjct:  1076 YGSQPGVGGQTGAGQPGYGVI--PGFGGQPGIGGQTAAGKPGYGGQPGIGGSPVYGTQQG 1133

Query:   410 PG--YDVQRG-PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA-PSYDPSRGTGFDGAP- 464
              G    +  G P Y  Q  P       PGY    G G       P Y P    G  GAP 
Sbjct:  1134 TGGPSGISGGQPGYGTQ--PGQTGAGQPGYGSLPGTGGQATAGQPGYGPGSQPGIGGAPV 1191

Query:   465 RGAAPHG 471
              G  P G
Sbjct:  1192 YGTQPGG 1198

 Score = 150 (57.9 bits), Expect = 1.2e-06, P = 1.2e-06
 Identities = 85/304 (27%), Positives = 114/304 (37%)

Query:   209 EKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGATGNSENE-TSGRPVGQNAYEDGYG 267
             +  Y+      E  +AE     N  R    G++G   G ++++ +SG   G + Y+  Y 
Sbjct:    50 QSQYLNFGKPEEDGKAEAEATENGSRSTVSGTHG--MGQAQSQFSSGDCNGCSGYQTDY- 106

Query:   268 VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDA 326
              P   G    A+ +G +G  P++  S       G    A       PGY +  G  G  A
Sbjct:   107 -PSS-GRILDASGSGGIGR-PDSIISLPGGV--GGQTGAGQ-----PGYGSQPGIGGQTA 156

Query:   327 SKAPSYDPTKGPSYDPAKG-PGYDPTKGPGYDAQKGSN---YDAQRGPNYDIHRG-PSYD 381
             +  P Y    G       G PGY     PG  AQ G+    Y +Q G       G P Y 
Sbjct:   157 TGQPGYGSQLGVGAQAGAGQPGYGAQ--PGVGAQTGAGQPGYGSQTGIGGQTGAGQPGYG 214

Query:   382 PQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDL 438
              Q G+G     G P Y  Q G G +T    PGY  Q G   +     P Y  Q G G   
Sbjct:   215 SQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQT 274

Query:   439 QRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP-PARSGSGQP 496
               GQ GY  +  P      G G  G        GQ         YGS      ++G+GQP
Sbjct:   275 GAGQPGYGSQ--PGIGGQTGAGQPGYGTQPGIGGQTGA--GQPGYGSQPGIGGQTGAGQP 330

Query:   497 RGGN 500
               G+
Sbjct:   331 GYGS 334

 Score = 148 (57.2 bits), Expect = 2.0e-06, P = 2.0e-06
 Identities = 84/278 (30%), Positives = 98/278 (35%)

Query:   235 RAADGSYGGATGNSENETS--GRPVGQN-AYEDGYGVPQGHGPPPSATTAGVVGAGPNTS 291
             R  D S  G  G  ++  S  G   GQ  A + GYG   G G     T  G  G G    
Sbjct:   111 RILDASGSGGIGRPDSIISLPGGVGGQTGAGQPGYGSQPGIG---GQTATGQPGYGSQLG 167

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYD 349
               A A   +G P    Y    G G +   G PGY +          G P Y    G G  
Sbjct:   168 VGAQAG--AGQP---GYGAQPGVGAQTGAGQPGYGSQTGIGGQTGAGQPGYGSQPGIGGQ 222

Query:   350 PTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQ 407
                G PGY +Q G     Q G        P Y  Q G+G     G P Y  Q G G +T 
Sbjct:   223 TGAGQPGYGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIGGQTG 275

Query:   408 R-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 464
                PGY  Q G   +     P Y  Q G G     GQ GY  +  P      G G  G  
Sbjct:   276 AGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGSQ--PGIGGQTGAGQPGYG 333

Query:   465 RGAAPHGQVPPPLNNVPYGSATPPA---RSGSGQPRGG 499
                   GQ         YG  T P    ++G+GQP  G
Sbjct:   334 SQPGIGGQTGA--GQPGYG--TQPGIGGQTGAGQPGYG 367

 Score = 144 (55.7 bits), Expect = 5.4e-06, P = 5.4e-06
 Identities = 85/297 (28%), Positives = 102/297 (34%)

Query:   230 PNVDRRAADGS--YGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGA 286
             P +  +   G   YGG    S     G   G  A      VP GH G        G  G 
Sbjct:   880 PGISGQTGGGQPGYGGQATISGLPGYGTQPGIGALT---AVPGGHYGYETQPGIGGQTGT 936

Query:   287 G-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 345
               P          Q+G   +  Y     PG     G    + + P Y    G     A G
Sbjct:   937 NQPGFGGQPGIGGQTGAG-QPGYGFIGQPGIGGQTGT---SGRQPGYGTQPGIGGQTAAG 992

Query:   346 -PGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRG 401
              PGY    G G     G   Y +Q G    I  G P Y  Q G+G     G P Y  Q G
Sbjct:   993 QPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYGAQPG 1052

Query:   402 ----PGYETQRVPGYDVQRG---PVYEAQRAPSYIPQRG---PGYDL------QRGQGYD 445
                 PGY  Q  PG   Q G   P Y +Q  P    Q G   PGY +      Q G G  
Sbjct:  1053 FGGQPGYGNQ--PGVGGQTGAGQPGYGSQ--PGVGGQTGAGQPGYGVIPGFGGQPGIGGQ 1108

Query:   446 MRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPP-LNNVPYGSATPPARSGSGQPRGGN 500
                  P Y    G G  G+P      G   P  ++    G  T P ++G+GQP  G+
Sbjct:  1109 TAAGKPGYGGQPGIG--GSPVYGTQQGTGGPSGISGGQPGYGTQPGQTGAGQPGYGS 1163

 Score = 125 (49.1 bits), Expect = 0.00062, P = 0.00062
 Identities = 72/250 (28%), Positives = 89/250 (35%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---VGAG-PNTS 291
             AA   YG   G      +G+P G  A + G G   G G P      G+    G G P   
Sbjct:   650 AAQPGYGTQPGVGAQTGTGQP-GYGA-QPGIGGQTGAGQPGYGRQPGIGGQTGPGQPGYG 707

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYD 349
             T     TQ+GT  +  Y    G G ++  G PGY +          G P Y    G G  
Sbjct:   708 TQPGVGTQTGTG-QPGYGAQPGIGGQSGAGQPGYGSQPGIGGQTGGGQPGYGSQIG-GQT 765

Query:   350 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG----PGY 404
                 P Y +Q G    AQ G        P Y  +  +G     G P Y  Q G    PG+
Sbjct:   766 GAGQPSYGSQPGVG--AQNGGGQ-----PGYGTRPVIGGQTGAGQPGYGGQTGVGGSPGF 818

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 464
              TQ  PG     GP+   +          PGY  Q G G   R    Y    G G D   
Sbjct:   819 LTQ--PGIGGISGPI-GGKVGGGQSEAAKPGYWAQPGIGGPSR----YGSQPGIG-DQTG 870

Query:   465 RGAAPHGQVP 474
              G + +G  P
Sbjct:   871 AGQSGYGGQP 880


>WB|WBGene00020550 [details] [associations]
            symbol:T17H7.1 species:6239 "Caenorhabditis elegans"
            [GO:0019915 "lipid storage" evidence=IMP] [GO:0009792 "embryo
            development ending in birth or egg hatching" evidence=IMP]
            GO:GO:0009792 GO:GO:0019915 InterPro:IPR003677 Pfam:PF02520
            EMBL:FO080638 PIR:T28899 RefSeq:NP_497250.1
            ProteinModelPortal:Q22537 PaxDb:Q22537 EnsemblMetazoa:T17H7.1
            GeneID:175228 KEGG:cel:CELE_T17H7.1 UCSC:T17H7.1 CTD:175228
            WormBase:T17H7.1 eggNOG:NOG271901 GeneTree:ENSGT00700000104820
            HOGENOM:HOG000020548 InParanoid:Q22537 OMA:GRGQGPD NextBio:887312
            Uniprot:Q22537
        Length = 682

 Score = 168 (64.2 bits), Expect = 4.1e-09, P = 4.1e-09
 Identities = 75/276 (27%), Positives = 101/276 (36%)

Query:   234 RRAADGSYGG-ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTST 292
             RR   G   G   G  +N   G   G+      +G P  +    +  +      GP++  
Sbjct:   225 RRGGRGDGPGFVPGTQDNNQRGS--GERGQRQNFG-PSDNLTNGNQFSKKQFARGPSSMN 281

Query:   293 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PGYDPT 351
             S  +     +   + +D PRGPG    +G G D          +GP + P    PG   +
Sbjct:   282 SDLSENSQHSDSNSQFDFPRGPGGRGGRGQGPDFGPGGQGGRGQGPDFGPQDDFPGRRGS 341

Query:   352 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG-LGYDMQRGPNYDM--QRG---PGYE 405
              GPG    +G   D +   ++   RG     +RG  G     GP  D   +RG   PG  
Sbjct:   342 GGPGGRGGRGQGPDFEPQDDFPGRRGSGGPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGR 401

Query:   406 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGA 463
               R  G D   GP  +  R     P  GP  D   +RG G      P     RG   D  
Sbjct:   402 GGRGQGPDF--GPGRQGGRGQG--PDFGPQDDFSGRRGSG-----GPGGRGGRGQEPDFG 452

Query:   464 PRGAAPHGQVPP--PLNNVP--YGSATPPARSGSGQ 495
             P G    GQ P   P ++ P   GS  P  R G GQ
Sbjct:   453 PGGQGGRGQGPDFGPQDDFPGRRGSGGPEGRDGRGQ 488

 Score = 139 (54.0 bits), Expect = 6.0e-06, P = 6.0e-06
 Identities = 76/265 (28%), Positives = 93/265 (35%)

Query:   242 GGATGNSENETSGRPVGQNAYEDG--YGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 299
             GG  G  +    G P GQ     G  +G PQ   P    +  G  G G       +   Q
Sbjct:   304 GGRGGRGQGPDFG-PGGQGGRGQGPDFG-PQDDFPGRRGS-GGPGGRGGRGQGPDFEP-Q 359

Query:   300 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDPTKGPGYDA 358
                P R       GPG    +G G D      +   +G      +G  G  P  GPG   
Sbjct:   360 DDFPGRRGSG---GPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQG 416

Query:   359 QKGSNYDAQRGPNYDI--HRGPSYDPQRG-LGYDMQRGPNYDMQRG--PGYETQR-VPGY 412
              +G   D   GP  D    RG      RG  G +   GP     RG  P +  Q   PG 
Sbjct:   417 GRGQGPDF--GPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGR 474

Query:   413 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 472
                 GP  E +      P  GPG    RGQ  D     ++   RG+G  G  RG  P   
Sbjct:   475 RGSGGP--EGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGG-RGQGPDFG 531

Query:   473 VPPPLNNVP--YGSATPPARSGSGQ 495
                P ++ P   GS  P  R G GQ
Sbjct:   532 ---PQDDFPGRRGSGGPEGRDGRGQ 553


>UNIPROTKB|P71590 [details] [associations]
            symbol:fhaA "FHA domain-containing protein FhaA"
            species:1773 "Mycobacterium tuberculosis" [GO:0005618 "cell wall"
            evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
            InterPro:IPR000253 InterPro:IPR008984 Pfam:PF00498 PROSITE:PS50006
            SMART:SM00240 GO:GO:0005829 GO:GO:0005618 GenomeReviews:AL123456_GR
            EMBL:BX842572 Gene3D:2.60.200.20 SUPFAM:SSF49879 PIR:B70700
            RefSeq:NP_214534.1 RefSeq:YP_006513334.1 PDB:2LC0 PDB:2LC1 PDB:3OUN
            PDB:3PO8 PDB:3POA PDBsum:2LC0 PDBsum:2LC1 PDBsum:3OUN PDBsum:3PO8
            PDBsum:3POA ProteinModelPortal:P71590 SMR:P71590 DIP:DIP-59047N
            PhosSite:P12071703 PRIDE:P71590 EnsemblBacteria:EBMYCT00000001781
            GeneID:13315997 GeneID:887067 KEGG:mtu:Rv0020c KEGG:mtv:RVBD_0020c
            PATRIC:18148538 TubercuList:Rv0020c HOGENOM:HOG000235804
            OMA:DQGYGQP ProtClustDB:CLSK790198 EvolutionaryTrace:P71590
            InterPro:IPR022128 Pfam:PF12401 Uniprot:P71590
        Length = 527

 Score = 162 (62.1 bits), Expect = 1.2e-08, P = 1.2e-08
 Identities = 84/244 (34%), Positives = 98/244 (40%)

Query:   275 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYD 333
             P   T   V+      S  A+ A     PM        G G +      YD   A P  D
Sbjct:   127 PDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNSSYRGGQG-QGRPDEYYDDRYARPQED 185

Query:   334 PTKGPSYDPAKGP--GYDPTKGPGYDAQKGSNYDAQRGPNY-DIHRGPSYDPQRGLGYDM 390
             P  GP       P  GY P  G GY  Q G  Y   R P+  D      Y P +G GY  
Sbjct:   186 PRGGPDPQGGSDPRGGYPPETG-GYPPQPG--YPRPRHPDQGDYPEQIGY-PDQG-GYPE 240

Query:   391 QRGPNYDMQRG-P---GYETQRVPGY-DVQRG---PVYEAQRAP-SYIPQRG---PGYDL 438
             QRG  Y  QRG P   GY+ Q   GY D  +G   P YE QR P S  P  G   PGYD 
Sbjct:   241 QRG--YPEQRGYPDQRGYQDQG-RGYPDQGQGGYPPPYE-QRPPVSPGPAAGYGAPGYD- 295

Query:   439 QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN---VPYGSATPPARSGSGQ 495
                QGY  R++  Y PS G G  G   G   +G+ P        VP G   PP +  +  
Sbjct:   296 ---QGY--RQSGGYGPSPGGGQPGYG-GYGEYGRGPARHEEGSYVPSGPPGPPEQRPAYP 349

Query:   496 PRGG 499
              +GG
Sbjct:   350 DQGG 353

 Score = 122 (48.0 bits), Expect = 0.00030, P = 0.00030
 Identities = 91/303 (30%), Positives = 111/303 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH---GPPPSATTAGVVGA 286
             P V   + + SY G  G       GRP     Y+D Y  PQ     GP P   +    G 
Sbjct:   151 PGVAPMSDNSSYRGGQGQ------GRP--DEYYDDRYARPQEDPRGGPDPQGGSDPRGGY 202

Query:   287 GPNTSTSAYAATQSGTPMRAAY----DIPRGPGYEASKG-P---GYDASKAPSYDPTKGP 338
              P T    Y   Q G P R  +    D P   GY    G P   GY   +   Y   +G 
Sbjct:   203 PPETG--GYPP-QPGYP-RPRHPDQGDYPEQIGYPDQGGYPEQRGYPEQRG--YPDQRG- 255

Query:   339 SYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRGPNYDIHRGPSYDP---QRGLGYDMQRG- 393
              Y   +G GY P +G G Y            GP    +  P YD    Q G GY    G 
Sbjct:   256 -YQD-QGRGY-PDQGQGGYPPPYEQRPPVSPGPAAG-YGAPGYDQGYRQSG-GYGPSPGG 310

Query:   394 --PNY----DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 447
               P Y    +  RGP    +   G  V  GP    ++ P+Y P +G GYD    QG    
Sbjct:   311 GQPGYGGYGEYGRGPARHEE---GSYVPSGPPGPPEQRPAY-PDQG-GYDQGYQQGATTY 365

Query:   448 RAPSYDPSRG-TGFDGAPR--GAAPHG--QVPPPLNNVPYG-SATP----PARSG-SGQP 496
                 Y      T +  +PR  G AP G     P   +  YG S  P    PA  G SG  
Sbjct:   366 GRQDYGGGADYTRYTESPRVPGYAPQGGGYAEPAGRDYDYGQSGAPDYGQPAPGGYSGYG 425

Query:   497 RGG 499
             +GG
Sbjct:   426 QGG 428


>UNIPROTKB|Q92804 [details] [associations]
            symbol:TAF15 "TATA-binding protein-associated factor 2N"
            species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0005730 "nucleolus"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=TAS]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0005737 GO:GO:0045893 GO:GO:0000166 GO:GO:0046872
            GO:GO:0003677 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003723
            EMBL:CH471147 eggNOG:NOG240581 HOGENOM:HOG000038010 EMBL:AC015849
            EMBL:U51334 EMBL:X98893 EMBL:AB010067 EMBL:AY197697 EMBL:AK313223
            IPI:IPI00020194 IPI:IPI00294426 PIR:S71954 RefSeq:NP_003478.1
            RefSeq:NP_631961.1 UniGene:Hs.402752 ProteinModelPortal:Q92804
            SMR:Q92804 IntAct:Q92804 STRING:Q92804 PhosphoSite:Q92804
            DMDM:8928305 PaxDb:Q92804 PRIDE:Q92804 DNASU:8148
            Ensembl:ENST00000311979 GeneID:8148 KEGG:hsa:8148 UCSC:uc002hkc.3
            UCSC:uc002hkd.3 CTD:8148 GeneCards:GC17P034136 HGNC:HGNC:11547
            HPA:HPA052059 MIM:601574 neXtProt:NX_Q92804 PharmGKB:PA36322
            HOVERGEN:HBG005755 InParanoid:Q92804 KO:K14651 OMA:YGNQGSQ
            OrthoDB:EOG4MW872 PhylomeDB:Q92804 ChiTaRS:TAF15 GenomeRNAi:8148
            NextBio:30819 PMAP-CutDB:Q92804 ArrayExpress:Q92804 Bgee:Q92804
            CleanEx:HS_TAF15 Genevestigator:Q92804 GermOnline:ENSG00000172660
            Uniprot:Q92804
        Length = 592

 Score = 162 (62.1 bits), Expect = 1.5e-08, P = 1.5e-08
 Identities = 70/224 (31%), Positives = 85/224 (37%)

Query:   228 NAPNV-DRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA 286
             N P   D R + G + G     E    GR  G+     GYG  +  G      ++G  G 
Sbjct:   380 NEPRPEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GY 437

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 346
               + S   Y   +SG      Y   RG GY   +G GY   +   Y   +G  Y   +G 
Sbjct:   438 SGDRSGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRGG 492

Query:   347 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSY--DPQRGLGYDMQRGPNYDMQRGPGY 404
             GY   +G GY   +G  Y   RG  Y   RG  Y  D  RG GY   RG       G GY
Sbjct:   493 GYGGDRG-GYGGDRGG-YGGDRG-GYGGDRG-GYGGDRSRG-GYGGDRGG------GSGY 541

Query:   405 ETQRVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 447
                R  GY   R G  Y   R   Y   RG GY  + G   D R
Sbjct:   542 GGDRSGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRNDYR 584

 Score = 153 (58.9 bits), Expect = 1.5e-08, Sum P(2) = 1.5e-08
 Identities = 60/164 (36%), Positives = 68/164 (41%)

Query:   312 RGPGYEASK-GPGY--DASKAPSYDPTK-GPSYDPAK-GPGYDPTKGPGYDAQKGSNYDA 366
             RG GY   + G GY  D S    Y   + G  Y   + G GY   +G GY   +G  Y  
Sbjct:   415 RG-GYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGG 473

Query:   367 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAP 426
              RG  Y   RG  Y   RG GY   RG  Y   RG GY   R  GY   RG  Y   R+ 
Sbjct:   474 DRGGGYGGDRG-GYGGDRGGGYGGDRG-GYGGDRG-GYGGDR-GGYGGDRGG-YGGDRSR 528

Query:   427 S-YIPQRG--PGYDLQRGQGYDMRRAPS-YDPSRGTGFDGAPRG 466
               Y   RG   GY   R  GY   R+   Y   RG G+ G  RG
Sbjct:   529 GGYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGY-GGDRG 571

 Score = 54 (24.1 bits), Expect = 1.5e-08, Sum P(2) = 1.5e-08
 Identities = 23/97 (23%), Positives = 42/97 (43%)

Query:   188 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGATGN 247
             +  Y+ +   Y+ + +S     +NY +  T+ +  R ++      D R   GS GG  G 
Sbjct:   132 QSNYDQQHDSYSQNQQSYHSQRENY-SHHTQDD--RRDVSRYGE-DNRGYGGSQGGGRGR 187

Query:   248 SENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 278
                +  GR P+ G +  + G    +G  + +GP   A
Sbjct:   188 GGYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRTDA 224


>WB|WBGene00044109 [details] [associations]
            symbol:K02E11.10 species:6239 "Caenorhabditis elegans"
            [GO:0016021 "integral to membrane" evidence=IEA] EMBL:Z77665
            RefSeq:NP_001024024.1 ProteinModelPortal:Q5FC49
            EnsemblMetazoa:K02E11.10 GeneID:259661 KEGG:cel:CELE_K02E11.10
            UCSC:K02E11.10 CTD:259661 WormBase:K02E11.10
            GeneTree:ENSGT00530000065030 InParanoid:Q5FC49 OMA:VQASGYQ
            NextBio:952394 Uniprot:Q5FC49
        Length = 360

 Score = 154 (59.3 bits), Expect = 4.4e-08, P = 4.4e-08
 Identities = 69/224 (30%), Positives = 91/224 (40%)

Query:   265 GYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 323
             G+G   G    P A   G+ G  G      A+     G     A     G G     G G
Sbjct:    81 GFGGAGGSYAAP-ALGGGLGGFGGAPAPAPAFGGLGGGYQAAPALGGGLGGGLGGGPGGG 139

Query:   324 YDASKAPSYDPTKGPSYDPA---KGPGYD--PTKGPGYDAQKGSNYDAQRGP---NYDIH 375
             Y A+ A        P+  PA    G GY   PT G G  AQ G+ Y  Q+GP    +   
Sbjct:   140 YQAAPALQLPGLGAPA--PAFGGLGGGYQGAPTLGGG-QAQGGAGY--QQGPAQGRFVAQ 194

Query:   376 RGPSYDPQRGLGYDMQRGP---NYDMQRGPGYETQRVPGYDVQRGPV---YEAQRAPSYI 429
             +G +   Q G GY  Q+GP    +  Q+GP    Q   GY  Q+GP    + AQ+ P+  
Sbjct:   195 QGSAQGVQGGAGY--QQGPAQGGFTAQQGPAQVVQGGAGY--QQGPAQGGFVAQQGPAPA 250

Query:   430 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AAPHGQ 472
              Q G GY     QG     A     ++G G+  A +G +AP  Q
Sbjct:   251 AQGGAGYQQGSTQGGFEAVAQQGQVAQGAGYQSAAQGQSAPVSQ 294


>DICTYBASE|DDB_G0277909 [details] [associations]
            symbol:cbpP "calcium-binding protein" species:44689
            "Dictyostelium discoideum" [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR002048
            InterPro:IPR011992 Pfam:PF13499 PROSITE:PS50222 SMART:SM00054
            dictyBase:DDB_G0277909 Prosite:PS00018 GenomeReviews:CM000152_GR
            EMBL:AAFI02000023 GO:GO:0005509 Gene3D:1.10.238.10
            InterPro:IPR018247 EMBL:U03413 RefSeq:XP_642080.1
            ProteinModelPortal:P35085 PRIDE:P35085 EnsemblProtists:DDB0214957
            GeneID:8621293 KEGG:ddi:DDB_G0277909 eggNOG:NOG135385 OMA:MGAYPPQ
            ProtClustDB:CLSZ2846833 Uniprot:P35085
        Length = 467

 Score = 155 (59.6 bits), Expect = 5.8e-08, P = 5.8e-08
 Identities = 73/247 (29%), Positives = 89/247 (36%)

Query:   269 PQGHGPPPSATTAGVVGAGPNT--STSAYAATQS--GTPMRAAYDIPRGPGYEASKGPGY 324
             PQ   PPP+ + A      P     T     +QS  G P       P+ PG   S  P Y
Sbjct:     4 PQN--PPPAGSAADFYSQMPVKVMGTPGAPGSQSTPGAPGAPGQYPPQQPGAPGSNLPPY 61

Query:   325 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQ 383
               ++ P      G  Y P + PG  P + PG   Q       Q G P     +   Y PQ
Sbjct:    62 PGTQQPGAPGAPG-QYPPQQ-PGQYPPQQPGAPGQYPPQQPGQPGYPPQQPGQSGQYPPQ 119

Query:   384 R-GL-GYDMQR--GPN-YDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 437
             + G  GY  Q+   P  Y  Q+G PG    + PG   Q  P  + Q  P    Q G    
Sbjct:   120 QPGQPGYPPQQPGAPGQYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPGAYPP 179

Query:   438 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP---ARSGSG 494
              Q GQ        +Y P +G     A  GA     VPPP    P     PP   A  G  
Sbjct:   180 QQSGQ------PGAYPPQQGVQNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYPGQQ 233

Query:   495 QPRGGNP 501
              P G  P
Sbjct:   234 PPMGAYP 240

 Score = 139 (54.0 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 79/251 (31%), Positives = 98/251 (39%)

Query:   273 GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY 332
             G P S +T G  GA P      Y   Q G P     ++P  PG +    PG      P  
Sbjct:    29 GAPGSQSTPGAPGA-PGQ----YPPQQPGAP---GSNLPPYPGTQQPGAPGAPGQYPPQ- 79

Query:   333 DPTKGPSYDPAKGPG-YDPTK-G-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL-G 387
              P + P   P   PG Y P + G PGY  Q+      Q  P       P Y PQ+ G  G
Sbjct:    80 QPGQYPPQQPG-APGQYPPQQPGQPGYPPQQPGQ-SGQYPPQQPGQ--PGYPPQQPGAPG 135

Query:   388 -YDMQRG-PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PG-YDLQRGQ 442
              Y  Q+G P     + PG   Q  P    Q  P    Q   +Y PQ+   PG Y  Q+G 
Sbjct:   136 QYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGV 194

Query:   443 GYDMRRA-----PSYDPSRGT--GFDGAP--RGAAPHGQVPPPLNNVPYGSATPPARSGS 493
                + +      P   P +G   G  G P  +GA P GQ PP     P G   P A    
Sbjct:   195 QNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYP-GQQPPMGAYPPQGQ--PGAYPPQ 251

Query:   494 GQPRGGNPARR 504
             GQP G  P ++
Sbjct:   252 GQP-GAYPPQQ 261

 Score = 133 (51.9 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 83/276 (30%), Positives = 101/276 (36%)

Query:   243 GATGNSENETSGRPVGQNAYEDGY-GVPQGHGPP-PSATTAGVVGA-G--PNTSTSAYAA 297
             GA G+    T G P     Y     G P  + PP P     G  GA G  P      Y  
Sbjct:    29 GAPGSQS--TPGAPGAPGQYPPQQPGAPGSNLPPYPGTQQPGAPGAPGQYPPQQPGQYPP 86

Query:   298 TQSGTPMRAAYDIPRGPGYEASKGPG----YDASKA--PSYDPTK--GPS-YDPAKG-PG 347
              Q G P +     P  PGY   + PG    Y   +   P Y P +   P  Y P +G PG
Sbjct:    87 QQPGAPGQYPPQQPGQPGYPPQQ-PGQSGQYPPQQPGQPGYPPQQPGAPGQYPPQQGQPG 145

Query:   348 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL--GYDMQRGPNYDMQRGPGY 404
               P + PG   Q       Q  P      G +Y PQ+ G    Y  Q+G    + +  G 
Sbjct:   146 QYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGVQNTLAK-TGA 203

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA- 463
               Q  PG    +G  Y  Q  P   PQ+G  Y    GQ   M    +Y P    G  GA 
Sbjct:   204 PGQ--PGVPPPQG-AYPGQ--PGVPPQQG-AYP---GQQPPMG---AYPPQ---GQPGAY 248

Query:   464 PRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
             P    P G  PP    V Y    PP   G+  P+ G
Sbjct:   249 PPQGQP-GAYPPQQQQVAYPGQQPPM--GAYPPQQG 281


>FB|FBgn0050203 [details] [associations]
            symbol:CG30203 species:7227 "Drosophila melanogaster"
            [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002223 Pfam:PF00014 PROSITE:PS50279
            SMART:SM00131 EMBL:AE013599 GO:GO:0004867 Gene3D:4.10.410.10
            SUPFAM:SSF57362 InterPro:IPR000884 Pfam:PF00090 SMART:SM00209
            SUPFAM:SSF82895 PROSITE:PS50092 InterPro:IPR002861 Pfam:PF02014
            PROSITE:PS51019 GeneTree:ENSGT00640000091268 InterPro:IPR009465
            Pfam:PF06468 PROSITE:PS51020 EMBL:BT023853 RefSeq:NP_725128.2
            UniGene:Dm.23753 SMR:Q3ZAL6 EnsemblMetazoa:FBtr0273303
            GeneID:246514 KEGG:dme:Dmel_CG30203 FlyBase:FBgn0050203
            eggNOG:NOG244582 OMA:KWARNTH OrthoDB:EOG43R22N GenomeRNAi:246514
            NextBio:842774 Uniprot:Q3ZAL6
        Length = 924

 Score = 157 (60.3 bits), Expect = 9.9e-08, P = 9.9e-08
 Identities = 39/105 (37%), Positives = 49/105 (46%)

Query:   305 RAAYDIP--RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 362
             R +YD    RG  Y+ + G  Y  ++  SYD   G SYD   G  Y  T G  YD  +  
Sbjct:   793 RRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDR 852

Query:   363 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY-ET 406
             +YD   G +Y      SYD  RG  YD   G +YD+  G  Y ET
Sbjct:   853 SYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGET 897

 Score = 153 (58.9 bits), Expect = 2.7e-07, P = 2.7e-07
 Identities = 46/148 (31%), Positives = 60/148 (40%)

Query:   317 EASKGPGYDASKAPSYDP--TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 374
             E S+    D     SYD   T+G  YD   G  Y  T+G  YD + G +YD   G +Y  
Sbjct:   781 ERSENDAMDLYGRRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQ 840

Query:   375 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 434
               G SYD      YD+  G +Y       Y+  R   YD   G  Y+     SY      
Sbjct:   841 TGGGSYDQPEDRSYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEA 900

Query:   435 GYDLQRGQGYDMRRAPSYDPSRGTGFDG 462
             G D+  G+     R+  YD SR   + G
Sbjct:   901 G-DI--GEPMSQTRS-RYDTSRRGRYGG 924

 Score = 134 (52.2 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 36/111 (32%), Positives = 45/111 (40%)

Query:   356 YDAQ--KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 413
             YD +  +G  YD   G  Y    G SYD + G  YD   G +Y    G  Y+      YD
Sbjct:   796 YDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYD 855

Query:   414 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 464
             +  G  Y      SY   RG  YD   G+ YD+    SY  +   G  G P
Sbjct:   856 LSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEP 906

 Score = 123 (48.4 bits), Expect = 0.00049, P = 0.00049
 Identities = 38/119 (31%), Positives = 52/119 (43%)

Query:   290 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 349
             TS  AY  T+       +YD   G  Y+ + G  Y  +   SYD  +  SYD + G  Y 
Sbjct:   809 TSGIAYGQTEG-----RSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYDLSTGRSYV 863

Query:   350 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP--QRG-LGYDM-QRGPNYDMQRGPGY 404
               +   YD  +G +YD   G +YD+  G SY    + G +G  M Q    YD  R   Y
Sbjct:   864 QPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEPMSQTRSRYDTSRRGRY 922


>WB|WBGene00005015 [details] [associations]
            symbol:spt-5 species:6239 "Caenorhabditis elegans"
            [GO:0032968 "positive regulation of transcription elongation from
            RNA polymerase II promoter" evidence=IEA] [GO:0006357 "regulation
            of transcription from RNA polymerase II promoter" evidence=IEA]
            [GO:0032784 "regulation of DNA-dependent transcription, elongation"
            evidence=IEA] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] [GO:0040007 "growth" evidence=IMP]
            [GO:0002119 "nematode larval development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] InterPro:IPR006645 InterPro:IPR017071
            InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
            Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
            InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
            eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
            InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
            Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
            ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
            EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
            UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
            GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
            NextBio:899898 Uniprot:Q21338
        Length = 1208

 Score = 158 (60.7 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 60/182 (32%), Positives = 76/182 (41%)

Query:   290 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 347
             + T  Y A T     M  AYD  R P Y E  + P Y  SK P+Y      S       G
Sbjct:   813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871

Query:   348 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 406
              D ++ P Y    GS  D  R P Y    G    P  G   D  R P YD   R PGYE+
Sbjct:   872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924

Query:   407 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 462
                R P YD   + P Y E++ +      R P Y+      YD+  +P+Y+P     +D 
Sbjct:   925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975

Query:   463 AP 464
             AP
Sbjct:   976 AP 977

 Score = 146 (56.5 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 74/256 (28%), Positives = 96/256 (37%)

Query:   234 RRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTS 293
             R  A GS   A G+ +  +S R     AY +       +G    A T    G+     T 
Sbjct:   845 RTPAYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTP 900

Query:   294 AYAATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGY 348
             AY +   S TP   AYD   R PGYE+  S+ P YD+S K P+Y  ++  +  PA    Y
Sbjct:   901 AYGSMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDY 957

Query:   349 DPTKGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG---- 393
             D    P Y+      YD           R P YD +    P+Y+P      +   G    
Sbjct:   958 DIPLSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSS 1017

Query:   394 PNYDMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS 451
             P YD    P       PG  +    P  Y     P +     PG     G  YD   APS
Sbjct:  1018 PTYDSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPS 1070

Query:   452 ----YDPSRGTGFDGA 463
                 YD +     DGA
Sbjct:  1071 PFAGYDSNNYNNADGA 1086


>UNIPROTKB|Q21338 [details] [associations]
            symbol:spt-5 "Transcription elongation factor SPT5"
            species:6239 "Caenorhabditis elegans" [GO:0032044 "DSIF complex"
            evidence=ISS] InterPro:IPR006645 InterPro:IPR017071
            InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
            Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
            GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
            InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
            eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
            InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
            Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
            ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
            EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
            UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
            GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
            NextBio:899898 Uniprot:Q21338
        Length = 1208

 Score = 158 (60.7 bits), Expect = 1.1e-07, P = 1.1e-07
 Identities = 60/182 (32%), Positives = 76/182 (41%)

Query:   290 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 347
             + T  Y A T     M  AYD  R P Y E  + P Y  SK P+Y      S       G
Sbjct:   813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871

Query:   348 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 406
              D ++ P Y    GS  D  R P Y    G    P  G   D  R P YD   R PGYE+
Sbjct:   872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924

Query:   407 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 462
                R P YD   + P Y E++ +      R P Y+      YD+  +P+Y+P     +D 
Sbjct:   925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975

Query:   463 AP 464
             AP
Sbjct:   976 AP 977

 Score = 146 (56.5 bits), Expect = 2.2e-06, P = 2.2e-06
 Identities = 74/256 (28%), Positives = 96/256 (37%)

Query:   234 RRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTS 293
             R  A GS   A G+ +  +S R     AY +       +G    A T    G+     T 
Sbjct:   845 RTPAYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTP 900

Query:   294 AYAATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGY 348
             AY +   S TP   AYD   R PGYE+  S+ P YD+S K P+Y  ++  +  PA    Y
Sbjct:   901 AYGSMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDY 957

Query:   349 DPTKGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG---- 393
             D    P Y+      YD           R P YD +    P+Y+P      +   G    
Sbjct:   958 DIPLSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSS 1017

Query:   394 PNYDMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS 451
             P YD    P       PG  +    P  Y     P +     PG     G  YD   APS
Sbjct:  1018 PTYDSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPS 1070

Query:   452 ----YDPSRGTGFDGA 463
                 YD +     DGA
Sbjct:  1071 PFAGYDSNNYNNADGA 1086


>WB|WBGene00002280 [details] [associations]
            symbol:let-2 species:6239 "Caenorhabditis elegans"
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0040007
            "growth" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0009792 "embryo development ending in birth or
            egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0040039
            "inductive cell migration" evidence=IMP] [GO:0040018 "positive
            regulation of multicellular organism growth" evidence=IMP]
            [GO:0005604 "basement membrane" evidence=IDA] [GO:0005198
            "structural molecule activity" evidence=IDA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
            GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
            GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
            EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
            RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
            SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
            KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
            WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
            NextBio:915032 GO:GO:0016043 Uniprot:P17140
        Length = 1758

 Score = 159 (61.0 bits), Expect = 1.3e-07, P = 1.3e-07
 Identities = 83/266 (31%), Positives = 96/266 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGP 288
             P +  +  +  Y G  G   N     P G   + DG   P G  G P +    G  G  P
Sbjct:   330 PGLPGQKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-P 388

Query:   289 NTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG- 345
                  A     +G P    Y    G PG +  KG G     AP      G P     KG 
Sbjct:   389 GAPGPAGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGE 447

Query:   346 PGYDPTKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG--- 393
             PGY  T G      PG D + G      ++G N     RGP  D   GL G   QRG   
Sbjct:   448 PGYRGTPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPG 507

Query:   394 PN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPS 451
             PN YD + G        PG    RG    A  AP    ++G PGY  Q G   D R  P 
Sbjct:   508 PNGYDGRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPG 564

Query:   452 YD-PSRGTGFDGAPRGAAPHGQVPPP 476
                P    G DG P  A   G   PP
Sbjct:   565 MPGPVGDAGDDGLPGPAGRPGSPGPP 590


>UNIPROTKB|P17140 [details] [associations]
            symbol:let-2 "Collagen alpha-2(IV) chain" species:6239
            "Caenorhabditis elegans" [GO:0016043 "cellular component
            organization" evidence=NAS] [GO:0030020 "extracellular matrix
            structural constituent conferring tensile strength" evidence=IMP]
            [GO:0005587 "collagen type IV" evidence=IMP] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
            GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
            GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
            EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
            RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
            SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
            KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
            WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
            NextBio:915032 GO:GO:0016043 Uniprot:P17140
        Length = 1758

 Score = 159 (61.0 bits), Expect = 1.3e-07, P = 1.3e-07
 Identities = 83/266 (31%), Positives = 96/266 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGP 288
             P +  +  +  Y G  G   N     P G   + DG   P G  G P +    G  G  P
Sbjct:   330 PGLPGQKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-P 388

Query:   289 NTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG- 345
                  A     +G P    Y    G PG +  KG G     AP      G P     KG 
Sbjct:   389 GAPGPAGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGE 447

Query:   346 PGYDPTKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG--- 393
             PGY  T G      PG D + G      ++G N     RGP  D   GL G   QRG   
Sbjct:   448 PGYRGTPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPG 507

Query:   394 PN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPS 451
             PN YD + G        PG    RG    A  AP    ++G PGY  Q G   D R  P 
Sbjct:   508 PNGYDGRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPG 564

Query:   452 YD-PSRGTGFDGAPRGAAPHGQVPPP 476
                P    G DG P  A   G   PP
Sbjct:   565 MPGPVGDAGDDGLPGPAGRPGSPGPP 590


>MGI|MGI:1330280 [details] [associations]
            symbol:Krtap6-2 "keratin associated protein 6-2"
            species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005882 "intermediate filament" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] MGI:MGI:1330280 GO:GO:0005882
            CTD:337967 EMBL:D89902 IPI:IPI00116464 RefSeq:NP_034803.2
            UniGene:Mm.3524 PRIDE:O08884 DNASU:16701 GeneID:16701
            KEGG:mmu:16701 UCSC:uc007zvp.1 NextBio:290464 Genevestigator:O08884
            Uniprot:O08884
        Length = 159

 Score = 128 (50.1 bits), Expect = 1.9e-07, P = 1.9e-07
 Identities = 38/124 (30%), Positives = 40/124 (32%)

Query:   313 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 372
             G GY +  G GY       Y    G  Y    G GY    G GY    GS Y    G  Y
Sbjct:    13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72

Query:   373 DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQR 432
                 G  Y    G GY    G  Y    G GY      GY    G  Y +     Y    
Sbjct:    73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132

Query:   433 GPGY 436
             G GY
Sbjct:   133 GCGY 136

 Score = 126 (49.4 bits), Expect = 3.1e-07, P = 3.1e-07
 Identities = 39/130 (30%), Positives = 40/130 (30%)

Query:   315 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 374
             G     G GY +     Y    G  Y    G GY    G GY    GS Y    G  Y  
Sbjct:     7 GNSCGYGCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGC 66

Query:   375 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 434
               G  Y    G GY    G  Y    G GY      GY    G  Y       Y    G 
Sbjct:    67 GYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGS 126

Query:   435 GYDLQRGQGY 444
             GY    G GY
Sbjct:   127 GYGSGCGCGY 136

 Score = 125 (49.1 bits), Expect = 4.0e-07, P = 4.0e-07
 Identities = 40/136 (29%), Positives = 42/136 (30%)

Query:   337 GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY 396
             G  Y    G GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y
Sbjct:    13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72

Query:   397 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 456
                 G GY      GY    G  Y       Y    G GY    G GY       Y    
Sbjct:    73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132

Query:   457 GTGFDGAPR-GAAPHG 471
             G G+    R G   +G
Sbjct:   133 GCGYGSYYRSGCCGYG 148

 Score = 124 (48.7 bits), Expect = 5.1e-07, P = 5.1e-07
 Identities = 34/112 (30%), Positives = 37/112 (33%)

Query:   301 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 360
             G+   + Y    G GY    G GY       Y    G  Y    G GY    G GY    
Sbjct:    17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76

Query:   361 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 412
             GS Y    G  Y    G  Y    G GY    G  Y    G GY +    GY
Sbjct:    77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGY 128

 Score = 118 (46.6 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 33/107 (30%), Positives = 35/107 (32%)

Query:   306 AAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 365
             + Y    G GY    G GY       Y    G  Y    G GY    G GY    GS Y 
Sbjct:    30 SGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYG 89

Query:   366 AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 412
                G  Y    G  Y    G GY    G  Y    G GY +    GY
Sbjct:    90 CGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136

 Score = 118 (46.6 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 34/120 (28%), Positives = 39/120 (32%)

Query:   285 GAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 344
             G+G  +     + +  G    + Y    G GY    G GY       Y    G  Y    
Sbjct:    17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76

Query:   345 GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
             G GY    G GY    GS Y    G  Y    G  Y    G GY    G  Y    G GY
Sbjct:    77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136

 Score = 111 (44.1 bits), Expect = 0.00010, P = 0.00010
 Identities = 35/127 (27%), Positives = 40/127 (31%)

Query:   262 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 321
             Y  GYG   G+G      +    G G  +       +  G    + Y    G GY    G
Sbjct:    12 YGCGYG--SGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYG 69

Query:   322 PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD 381
              GY       Y    G  Y    G GY    G GY    GS Y    G  Y    G  Y 
Sbjct:    70 SGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYG 129

Query:   382 PQRGLGY 388
                G GY
Sbjct:   130 SGCGCGY 136


>WB|WBGene00000123 [details] [associations]
            symbol:ama-1 species:6239 "Caenorhabditis elegans"
            [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA;IMP] [GO:0009792 "embryo development ending in birth
            or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
            evidence=IMP] [GO:0040010 "positive regulation of growth rate"
            evidence=IMP] [GO:0007052 "mitotic spindle organization"
            evidence=IMP] [GO:0010458 "exit from mitosis" evidence=IMP]
            [GO:0008356 "asymmetric cell division" evidence=IMP] [GO:0032502
            "developmental process" evidence=IMP] [GO:0006479 "protein
            methylation" evidence=IMP] [GO:0007369 "gastrulation" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0001055 "RNA polymerase II
            activity" evidence=IMP] [GO:0042789 "mRNA transcription from RNA
            polymerase II promoter" evidence=IMP] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 GO:GO:0005634
            GO:GO:0009792 GO:GO:0040010 GO:GO:0007052 GO:GO:0010458
            GO:GO:0046872 GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20
            InterPro:IPR009010 GO:GO:0006479 GO:GO:0008356 GO:GO:0007369
            GO:GO:0042789 EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665
            EMBL:M29235 PIR:A34092 PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356
            STRING:P16356 PaxDb:P16356 EnsemblMetazoa:F36A4.7.1
            EnsemblMetazoa:F36A4.7.2 GeneID:177190 KEGG:cel:CELE_F36A4.7
            UCSC:F36A4.7 CTD:247749 WormBase:F36A4.7
            GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975 InParanoid:P16356
            OMA:KVLPWST NextBio:895720 GO:GO:0001055 Uniprot:P16356
        Length = 1856

 Score = 157 (60.3 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 68/254 (26%), Positives = 93/254 (36%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             G   GA  +    T G   G + + +G   P   G P  A +      G   S   Y+ +
Sbjct:  1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582

Query:   299 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 358
                  M + +  P  P Y  +      +  +PSY PT  PSY P   P Y PT  P Y  
Sbjct:  1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639

Query:   359 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 418
                S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P
Sbjct:  1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691

Query:   419 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 473
              Y +  +P+Y P   P Y       + G GY    +P Y PS  T    +P  +    Q 
Sbjct:  1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748

Query:   474 PPPLNNVPYGSATP 487
              P   +  Y  ++P
Sbjct:  1749 SP--TSPQYSPSSP 1760

 Score = 154 (59.3 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 65/219 (29%), Positives = 87/219 (39%)

Query:   275 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 334
             P  + T+   G  P  S S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653

Query:   335 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 394
             T  PSY P+  P Y P+  P Y +     Y +   P Y     P+Y P     Y     P
Sbjct:  1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705

Query:   395 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 449
              Y       + G GY     P Y     P Y +  +PSY P   P Y     Q Y    +
Sbjct:  1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759

Query:   450 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 487
             P+Y PS  T    +PRG ++P      P  +    S TP
Sbjct:  1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798


>UNIPROTKB|P16356 [details] [associations]
            symbol:ama-1 "DNA-directed RNA polymerase II subunit RPB1"
            species:6239 "Caenorhabditis elegans" [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0005634 GO:GO:0009792
            GO:GO:0040010 GO:GO:0007052 GO:GO:0010458 GO:GO:0046872
            GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20 InterPro:IPR009010
            GO:GO:0006479 GO:GO:0008356 GO:GO:0007369 GO:GO:0042789
            EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665 EMBL:M29235 PIR:A34092
            PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356 STRING:P16356
            PaxDb:P16356 EnsemblMetazoa:F36A4.7.1 EnsemblMetazoa:F36A4.7.2
            GeneID:177190 KEGG:cel:CELE_F36A4.7 UCSC:F36A4.7 CTD:247749
            WormBase:F36A4.7 GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975
            InParanoid:P16356 OMA:KVLPWST NextBio:895720 GO:GO:0001055
            Uniprot:P16356
        Length = 1856

 Score = 157 (60.3 bits), Expect = 2.3e-07, P = 2.3e-07
 Identities = 68/254 (26%), Positives = 93/254 (36%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             G   GA  +    T G   G + + +G   P   G P  A +      G   S   Y+ +
Sbjct:  1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582

Query:   299 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 358
                  M + +  P  P Y  +      +  +PSY PT  PSY P   P Y PT  P Y  
Sbjct:  1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639

Query:   359 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 418
                S Y +   P+Y     PSY P     Y     P+Y     P Y     P Y     P
Sbjct:  1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691

Query:   419 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 473
              Y +  +P+Y P   P Y       + G GY    +P Y PS  T    +P  +    Q 
Sbjct:  1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748

Query:   474 PPPLNNVPYGSATP 487
              P   +  Y  ++P
Sbjct:  1749 SP--TSPQYSPSSP 1760

 Score = 154 (59.3 bits), Expect = 4.8e-07, P = 4.8e-07
 Identities = 65/219 (29%), Positives = 87/219 (39%)

Query:   275 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 334
             P  + T+   G  P  S S    + S +P   +Y  P  P Y  +  P Y  + +PSY P
Sbjct:  1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653

Query:   335 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 394
             T  PSY P+  P Y P+  P Y +     Y +   P Y     P+Y P     Y     P
Sbjct:  1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705

Query:   395 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 449
              Y       + G GY     P Y     P Y +  +PSY P   P Y     Q Y    +
Sbjct:  1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759

Query:   450 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 487
             P+Y PS  T    +PRG ++P      P  +    S TP
Sbjct:  1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798


>WB|WBGene00001215 [details] [associations]
            symbol:ego-2 species:6239 "Caenorhabditis elegans"
            [GO:0040002 "collagen and cuticulin-based cuticle development"
            evidence=IMP] [GO:0002009 "morphogenesis of an epithelium"
            evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
            "nematode larval development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0045747 "positive regulation of Notch signaling pathway"
            evidence=IGI] InterPro:IPR025304 Pfam:PF13949 GO:GO:0009792
            GO:GO:0002009 GO:GO:0040007 GO:GO:0002119 GO:GO:0045747
            GO:GO:0040035 Gene3D:1.25.40.280 InterPro:IPR004328 Pfam:PF03097
            SMART:SM01041 PROSITE:PS51180 GO:GO:0040002 EMBL:AL117201
            UniGene:Cel.16377 GeneID:190251 KEGG:cel:CELE_Y53H1C.2 CTD:190251
            RefSeq:NP_001251634.1 ProteinModelPortal:H8ESG1 WormBase:Y53H1C.2c
            Uniprot:H8ESG1
        Length = 1494

 Score = 136 (52.9 bits), Expect = 2.4e-07, Sum P(2) = 2.4e-07
 Identities = 79/280 (28%), Positives = 107/280 (38%)

Query:   240 SYGGATGNSENETSGRPVGQNAYEDGYGVPQG-----HGPPPSATTAGVVGAGPNTSTSA 294
             SYG  T      + G   G + Y++G   P G      GPP +   A    A P TS   
Sbjct:  1050 SYGAPT--PPQASYGPAPGAHGYQNGAQGPPGAEVGAQGPPGAHFGAHGASAPPPTS--- 1104

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDP--- 350
             Y A     P +A+Y     PG +   G  ++A  A +  PT   +  P +GP G  P   
Sbjct:  1105 YGAPTPQRPPQASYGA--APGAQGPPGGQFEAHGAAALPPTSHGAPTP-QGPFGAAPGAQ 1161

Query:   351 --TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 408
                +GP Y  Q+G+ Y+AQ+ P   I   P   PQ    +  Q G        PG +   
Sbjct:  1162 FGAQGP-Y-GQQGARYEAQKSPGAAIFGAPGAPPQHQGSFGAQFGVPPPQNSAPGAQFGA 1219

Query:   409 VPGYDVQRGPVYEAQRAPSY-IPQRGPGYDL-QRG-QGYDMRRAP---SYD-----P-SR 456
              P       P    Q  PSY  P   P   + Q   QG  +   P   S+      P +R
Sbjct:  1220 KPEAS-SHAPTPPPQPHPSYQAPAPPPALSVFQHSPQGAPITAPPPASSHHEHIAAPQAR 1278

Query:   457 GTGFDGAPRG--AAPHG-QVPPPLNNVPYGSATPPARSGS 493
              T   GAP    A P   +   P N  P   A P A++ +
Sbjct:  1279 FTPTPGAPSPWHATPAELKFQTPWNTTPQYHAPPGAQAAA 1318

 Score = 126 (49.4 bits), Expect = 2.9e-06, Sum P(2) = 2.9e-06
 Identities = 70/267 (26%), Positives = 94/267 (35%)

Query:   235 RAADGSYGGATGNSENETSGRPVG-QNAYEDGYGVPQGHGPPPS---ATTAGVVGAGPNT 290
             + A G++ GA G       G   G Q A    +G     GPPP+   A T      GP  
Sbjct:  1011 QGAQGAHFGAQG-----AQGAHFGAQGAQGTQFGAQGAQGPPPASYGAPTPPQASYGPAP 1065

Query:   291 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 350
                 Y     G P     ++    G +   G  + A  A +  PT   +  P + P    
Sbjct:  1066 GAHGYQNGAQGPP---GAEV----GAQGPPGAHFGAHGASAPPPTSYGAPTPQRPPQASY 1118

Query:   351 TKGPGYDAQKGSNYDAQRGPNYD--IHRGPSYDPQRGLGYDMQ---RGPNYDMQRGPGYE 405
                PG     G  ++A          H  P+     G     Q   +GP Y  Q+G  YE
Sbjct:  1119 GAAPGAQGPPGGQFEAHGAAALPPTSHGAPTPQGPFGAAPGAQFGAQGP-YG-QQGARYE 1176

Query:   406 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 465
              Q+ PG       ++ A  AP   PQ    +  Q G       AP      G  F   P 
Sbjct:  1177 AQKSPG-----AAIFGAPGAP---PQHQGSFGAQFGVPPPQNSAP------GAQFGAKPE 1222

Query:   466 GAAPHGQVPPPLNNVPYGS-ATPPARS 491
              A+ H   PPP  +  Y + A PPA S
Sbjct:  1223 -ASSHAPTPPPQPHPSYQAPAPPPALS 1248

 Score = 70 (29.7 bits), Expect = 2.4e-07, Sum P(2) = 2.4e-07
 Identities = 30/122 (24%), Positives = 58/122 (47%)

Query:    57 SQHVEMQKLATENQRLA-ATHGTLRQELAAAQHEL--QIL----HGQIGGMKSERELQMR 109
             ++H+E  K    +   A A H    Q L     E+  +I+     G++    S  ELQ+R
Sbjct:   520 AEHLEQAKAHNVSLNKAIAQHSANLQLLTLPCREMWMKIVPPEQQGEMRNGSSPEELQVR 579

Query:   110 NLTEKIAKMEAEL-KTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
              + EK+ +M+A+  K  E  + +  K+   +  L+   E    ++  +  +L + HT++Q
Sbjct:   580 KMIEKVMEMQAQRRKLVEQFEADL-KADNISNKLMGTNERGAEEI--MKSELTK-HTNIQ 635

Query:   169 QI 170
             Q+
Sbjct:   636 QL 637


>ZFIN|ZDB-GENE-030131-5725 [details] [associations]
            symbol:arid1ab "AT rich interactive domain 1Ab
            (SWI-like)" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
            InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
            ZFIN:ZDB-GENE-030131-5725 GO:GO:0003677 GO:GO:0005622
            Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
            GeneTree:ENSGT00550000074575 EMBL:CABZ01050711 EMBL:CT027837
            IPI:IPI00485842 Ensembl:ENSDART00000084272 Bgee:F1RE50
            Uniprot:F1RE50
        Length = 2135

 Score = 157 (60.3 bits), Expect = 3.4e-07, Sum P(2) = 3.4e-07
 Identities = 78/257 (30%), Positives = 104/257 (40%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVP-QGHGPP-PSATTAGVVGAGPNTSTSAYA 296
             G + GA GN  ++  G P      + G   P QG+GPP P     G+ G    TS +  +
Sbjct:   312 GQHYGA-GNPYSQQQGPPPSS---QQGPPYPGQGYGPPGPQRYPMGMQG---RTSGNL-S 363

Query:   297 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP---SY--DPAKGPGYDP- 350
               Q G  M   Y    GPG       GY   + PS  P  GP   SY   P+ GPG  P 
Sbjct:   364 GIQYGQQM--GYG-QHGPGGYGQNQAGYYGQQGPS--PHGGPQQSSYPQQPSTGPGSQPP 418

Query:   351 -TKGPGYD--AQKGSNYDAQRGPNYDIHRGPSYD--PQRGLG---YDMQRGPNYDMQRGP 402
              ++ P      Q G++Y   +GP+      P Y   PQ   G   +   +GP        
Sbjct:   419 YSQQPSGTPHGQSGTSYGQPQGPHVPNQGQPPYSQTPQSQSGQSPFPQSQGPTQSQGPSQ 478

Query:   403 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY---DPSRGT 458
             G + +Q  PGY     P    Q A     Q+GP    Q+ QG    + PS     PS+ T
Sbjct:   479 GQQGSQSQPGYT--HPPSGSGQPAQ----QQGPS---QQQQGPPQSQTPSSAPPQPSQQT 529

Query:   459 GFDGAPRGAAPHGQVPP 475
                G P   +P+ Q PP
Sbjct:   530 SGQGQP---SPYSQTPP 543

 Score = 126 (49.4 bits), Expect = 0.00068, Sum P(2) = 0.00068
 Identities = 80/300 (26%), Positives = 111/300 (37%)

Query:   225 ELMNAPNVDRRAAD---GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ--GHGPPPSAT 279
             +L+ +P+  R   +     YGG  G ++    G     + Y  G+   Q   H PPP + 
Sbjct:   232 QLLTSPSSTRSYQNYPASEYGGQEGAAKGP--GDMGSSSQYGGGHPAWQQRSHHPPPMSP 289

Query:   280 TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS 339
               G  G    T        Q G      Y    G  Y   +GP   + + P Y P +G  
Sbjct:   290 --GNTGQANRTQPPG-PMDQVGKIRGQHYGA--GNPYSQQQGPPPSSQQGPPY-PGQG-- 341

Query:   340 YDPAKGPGYDPTKGPGYDAQK--GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-- 395
             Y P  GP   P    G  +    G  Y  Q G  Y  H GP    Q   GY  Q+GP+  
Sbjct:   342 YGPP-GPQRYPMGMQGRTSGNLSGIQYGQQMG--YGQH-GPGGYGQNQAGYYGQQGPSPH 397

Query:   396 -------YDMQ--RGPGYE---TQRVPGYDV-QRGPVYEAQRAPSYIPQRG-PGYDLQRG 441
                    Y  Q   GPG +   +Q+  G    Q G  Y   + P ++P +G P Y  Q  
Sbjct:   398 GGPQQSSYPQQPSTGPGSQPPYSQQPSGTPHGQSGTSYGQPQGP-HVPNQGQPPYS-QTP 455

Query:   442 QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 501
             Q     ++P +  S+G      P       Q  P   + P GS  P  + G  Q + G P
Sbjct:   456 QSQS-GQSP-FPQSQGPTQSQGPSQGQQGSQSQPGYTHPPSGSGQPAQQQGPSQQQQGPP 513

 Score = 50 (22.7 bits), Expect = 3.4e-07, Sum P(2) = 3.4e-07
 Identities = 9/12 (75%), Positives = 9/12 (75%)

Query:    30 GMRPPMPGAFPP 41
             GM P  PGAFPP
Sbjct:   101 GMAPHHPGAFPP 112


>UNIPROTKB|J3KNM7 [details] [associations]
            symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            EMBL:CH471063 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
            EMBL:AC079235 EMBL:AC073149 UniGene:Hs.591645 HGNC:HGNC:2206
            ChiTaRS:COL4A4 ProteinModelPortal:J3KNM7 Ensembl:ENST00000329662
            Uniprot:J3KNM7
        Length = 1687

 Score = 153 (58.9 bits), Expect = 5.5e-07, P = 5.5e-07
 Identities = 81/253 (32%), Positives = 101/253 (39%)

Query:   262 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 321
             Y   +G P   GPP      G  GA P  S S     + GTP  A  +IP  PG+    G
Sbjct:   672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728

Query:   322 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 373
              PG+   K  S     GP   P     KG PG DP  G  G   ++G S     +GP  D
Sbjct:   729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787

Query:   374 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 427
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843

Query:   428 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 485
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896

Query:   486 TPPARSGSGQPRG 498
               P   G   PRG
Sbjct:   897 GLPGPPGPKGPRG 909

 Score = 130 (50.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/260 (31%), Positives = 104/260 (40%)

Query:   263 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 319
             E G+ GVP GH  P      G+ G  G   S +     + G P    +D P GP G+   
Sbjct:   640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693

Query:   320 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 372
             +G PG   S      P T G +  P   PG+    G PG+  +KGS+     GP      
Sbjct:   694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752

Query:   373 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 422
             +  +G   DP  G LG   +RG    P     RG    PG E    +PG+   +GP    
Sbjct:   753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812

Query:   423 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 478
               A  P  +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP  
Sbjct:   813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865

Query:   479 NVPYGSATPPARSGSGQPRG 498
                 G    P R G+  P G
Sbjct:   866 AGMKGLPGLPGRPGAHGPPG 885

 Score = 123 (48.4 bits), Expect = 0.00099, P = 0.00099
 Identities = 81/280 (28%), Positives = 104/280 (37%)

Query:   243 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 302
             GA+G  +    G PVG    +   G P   G  P     G  G  P    S+     +G 
Sbjct:  1190 GASGLHDVGPPG-PVGIPGLKGERGDPGSPGISPPGPR-GKKGP-PGPPGSSGPPGPAGA 1246

Query:   303 PMRAAYDIPRGPGYEASKGP-GYDASK-AP-------SYDPTKGPSYD-----PAKGPGY 348
               RA  DIP  PG    +GP G D  + AP       S D  +G   D     P   PG 
Sbjct:  1247 TGRAPKDIP-DPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPG- 1304

Query:   349 DPTKGPGYDAQKGSN-YDAQRGP-NYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GY 404
              P   PGY    G +  D Q+GP  +   +GP   P    G   ++G P    ++GP G 
Sbjct:  1305 -PPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFP----GPPGEKGLPGPPGRKGPTGL 1359

Query:   405 ETQRVPGYDVQRGP-VYEAQRAPSYI-PQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD 461
               +  P  DV   P +     AP    P+   G    RG  G   +  P  D  RG   D
Sbjct:  1360 PGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGV--D 1417

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 501
             G P    P G+      +   G   PP   G   P+G  P
Sbjct:  1418 GVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKGFGP 1457


>UNIPROTKB|P53420 [details] [associations]
            symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
            "Homo sapiens" [GO:0005587 "collagen type IV" evidence=IDA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IMP] [GO:0032836 "glomerular basement membrane
            development" evidence=IMP] [GO:0005605 "basal lamina" evidence=IDA]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 Reactome:REACT_118779
            Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005788 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
            HOVERGEN:HBG004933 HOGENOM:HOG000085652 GO:GO:0005587
            Gene3D:2.170.240.10 KO:K06237 OrthoDB:EOG4XGZZF EMBL:AC079235
            EMBL:AB008496 MIM:141200 MIM:203780 Orphanet:88919 Orphanet:97562
            GO:GO:0032836 EMBL:X81053 EMBL:Y17397 EMBL:Y17398 EMBL:Y17399
            EMBL:Y17400 EMBL:Y17401 EMBL:Y17402 EMBL:Y17403 EMBL:Y17404
            EMBL:Y17405 EMBL:Y17406 EMBL:Y17407 EMBL:Y17408 EMBL:Y17409
            EMBL:Y17410 EMBL:Y17411 EMBL:Y17412 EMBL:Y17413 EMBL:Y17427
            EMBL:Y17426 EMBL:Y17414 EMBL:Y17415 EMBL:Y17416 EMBL:Y17417
            EMBL:Y17418 EMBL:Y17419 EMBL:Y17420 EMBL:Y17443 EMBL:Y17442
            EMBL:Y17441 EMBL:Y17440 EMBL:Y17439 EMBL:Y17438 EMBL:Y17437
            EMBL:Y17436 EMBL:Y17435 EMBL:Y17434 EMBL:Y17433 EMBL:Y17432
            EMBL:Y17431 EMBL:Y17430 EMBL:Y17429 EMBL:Y17428 EMBL:Y17421
            EMBL:Y17422 EMBL:Y17423 EMBL:Y17424 EMBL:Y17425 EMBL:AC073149
            EMBL:D17391 IPI:IPI00478572 PIR:A55360 RefSeq:NP_000083.3
            UniGene:Hs.591645 ProteinModelPortal:P53420 SMR:P53420
            IntAct:P53420 STRING:P53420 PhosphoSite:P53420 DMDM:259016360
            PaxDb:P53420 PRIDE:P53420 Ensembl:ENST00000396625 GeneID:1286
            KEGG:hsa:1286 UCSC:uc021vxr.1 CTD:1286 GeneCards:GC02M227867
            H-InvDB:HIX0030014 HGNC:HGNC:2206 MIM:120131 neXtProt:NX_P53420
            PharmGKB:PA26721 InParanoid:P53420 OMA:FRGDMGD ChiTaRS:COL4A4
            GenomeRNAi:1286 NextBio:5201 Bgee:P53420 CleanEx:HS_COL4A4
            Genevestigator:P53420 GermOnline:ENSG00000081052 Uniprot:P53420
        Length = 1690

 Score = 153 (58.9 bits), Expect = 5.6e-07, P = 5.6e-07
 Identities = 81/253 (32%), Positives = 101/253 (39%)

Query:   262 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 321
             Y   +G P   GPP      G  GA P  S S     + GTP  A  +IP  PG+    G
Sbjct:   672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728

Query:   322 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 373
              PG+   K  S     GP   P     KG PG DP  G  G   ++G S     +GP  D
Sbjct:   729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787

Query:   374 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 427
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843

Query:   428 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 485
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896

Query:   486 TPPARSGSGQPRG 498
               P   G   PRG
Sbjct:   897 GLPGPPGPKGPRG 909

 Score = 130 (50.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 81/260 (31%), Positives = 104/260 (40%)

Query:   263 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 319
             E G+ GVP GH  P      G+ G  G   S +     + G P    +D P GP G+   
Sbjct:   640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693

Query:   320 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 372
             +G PG   S      P T G +  P   PG+    G PG+  +KGS+     GP      
Sbjct:   694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752

Query:   373 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 422
             +  +G   DP  G LG   +RG    P     RG    PG E    +PG+   +GP    
Sbjct:   753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812

Query:   423 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 478
               A  P  +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP  
Sbjct:   813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865

Query:   479 NVPYGSATPPARSGSGQPRG 498
                 G    P R G+  P G
Sbjct:   866 AGMKGLPGLPGRPGAHGPPG 885


>WB|WBGene00004203 [details] [associations]
            symbol:swsn-1 species:6239 "Caenorhabditis elegans"
            [GO:0003682 "chromatin binding" evidence=IEA] [GO:0000003
            "reproduction" evidence=IGI;IMP] [GO:0040035 "hermaphrodite
            genitalia development" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IGI;IMP] [GO:0009792 "embryo development ending in birth
            or egg hatching" evidence=IGI;IMP] [GO:0040018 "positive regulation
            of multicellular organism growth" evidence=IGI;IMP] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0040027
            "negative regulation of vulval development" evidence=IMP]
            [GO:0046662 "regulation of oviposition" evidence=IMP] [GO:0002009
            "morphogenesis of an epithelium" evidence=IMP] [GO:0035262 "gonad
            morphogenesis" evidence=IMP] InterPro:IPR001005 InterPro:IPR007526
            InterPro:IPR009057 Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934
            SMART:SM00717 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
            GO:GO:0040007 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
            GO:GO:0040018 Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
            InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
            EMBL:AL110477 KO:K11649 UniGene:Cel.7072 GeneID:180324
            KEGG:cel:CELE_Y113G7B.23 CTD:180324 RefSeq:NP_001256907.1
            ProteinModelPortal:H8ESF3 SMR:H8ESF3 WormBase:Y113G7B.23c
            Uniprot:H8ESF3
        Length = 792

 Score = 149 (57.5 bits), Expect = 6.0e-07, P = 6.0e-07
 Identities = 85/312 (27%), Positives = 122/312 (39%)

Query:   201 HLESL-QVMEKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGR-PVG 258
             H + L Q+M+K   ++  +  +L  E   A ++D+     +      +S   +SG  P G
Sbjct:   493 HFDELEQIMDKERESLEYQRHQLILE-RQAFHMDQLKYLENRAKHEAHSRMTSSGALPAG 551

Query:   259 QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEA 318
                  +  G PQ   P P    +    A P    ++ AAT +  P  +    P+ P  +A
Sbjct:   552 LPPGFEVTGPPQ---PTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAPPVQA 606

Query:   319 SKGP--GYDASKAP--SYDPTKGPSYDPAKGPGYDPTKGPGYDA----QKGSNYDAQRGP 370
             +  P     A +AP  +Y    GP   P +   Y P +G  Y      Q+   + AQ+  
Sbjct:   607 APAPVQAPQAPQAPPQAYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQAQQAQ 666

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG-PVYEAQRAPSYI 429
             +   H GP    Q G     Q    Y     PG       GY  Q+  P Y+AQ  P   
Sbjct:   667 S-QAHYGPPGGGQ-GPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPPYQAQPYPG-- 722

Query:   430 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 489
             P   P    QRG GY     P   P     F G P    P+GQ+PPP    P+G   P  
Sbjct:   723 P---PPPQQQRGYGYP----PPPQPV----FSGHPY-QQPYGQMPPP----PHGQYQPQQ 766

Query:   490 RSGSGQ-PRGGN 500
             + G    P GG+
Sbjct:   767 QQGGPMGPPGGH 778


>UNIPROTKB|D4ADB1 [details] [associations]
            symbol:D4ADB1 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0046872 GO:GO:0008270 Gene3D:2.10.110.10
            SUPFAM:SSF50156 InterPro:IPR006643 SMART:SM00735 IPI:IPI00951885
            PRIDE:D4ADB1 Ensembl:ENSRNOT00000043713 ArrayExpress:D4ADB1
            Uniprot:D4ADB1
        Length = 684

 Score = 148 (57.2 bits), Expect = 6.3e-07, P = 6.3e-07
 Identities = 50/182 (27%), Positives = 70/182 (38%)

Query:   252 TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 311
             TS  P    +Y +G   P    P P   T   +   P+      A+  S +P  A Y  P
Sbjct:   331 TSPAPAAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASPYSPSP-GANYS-P 383

Query:   312 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 371
               P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+    + Y    GP+
Sbjct:   384 T-P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYNPTPSAAYSG--GPS 439

Query:   372 YDIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 425
                 R P     S+  +   G          + RG P Y         + RG    A+R 
Sbjct:   440 ESASRPPWVTDDSFSQKFAPGKSTTSVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERF 499

Query:   426 PS 427
             P+
Sbjct:   500 PA 501


>FB|FBgn0035872 [details] [associations]
            symbol:CG7185 species:7227 "Drosophila melanogaster"
            [GO:0003729 "mRNA binding" evidence=ISS] [GO:0006379 "mRNA
            cleavage" evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
            evidence=ISS] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IC] [GO:0000381 "regulation of alternative mRNA
            splicing, via spliceosome" evidence=IMP] InterPro:IPR000504
            InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 EMBL:AE014296
            GO:GO:0000166 GO:GO:0003729 Gene3D:3.30.70.330 GO:GO:0000381
            GO:GO:0006379 GO:GO:0005849 eggNOG:NOG313287 KO:K14398
            GeneTree:ENSGT00690000101901 EMBL:AY058563 RefSeq:NP_648206.1
            UniGene:Dm.887 ProteinModelPortal:Q9VSH4 SMR:Q9VSH4 IntAct:Q9VSH4
            MINT:MINT-1562127 STRING:Q9VSH4 PaxDb:Q9VSH4
            EnsemblMetazoa:FBtr0076710 GeneID:38937 KEGG:dme:Dmel_CG7185
            UCSC:CG7185-RA FlyBase:FBgn0035872 InParanoid:Q9VSH4 OMA:PYERGDY
            OrthoDB:EOG4S1RQ4 PhylomeDB:Q9VSH4 ChiTaRS:CG7185 GenomeRNAi:38937
            NextBio:811101 Bgee:Q9VSH4 Uniprot:Q9VSH4
        Length = 652

 Score = 141 (54.7 bits), Expect = 6.6e-07, Sum P(2) = 6.6e-07
 Identities = 63/199 (31%), Positives = 79/199 (39%)

Query:   311 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRG 369
             PRGP    S G G   +  P      GP   P +G   +    PG Y  Q  S      G
Sbjct:   197 PRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRG--MNSIMQPGQYRPQHMSQVPQVGG 253

Query:   370 PNYDIHR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY 428
             PN    R  P   PQ GL  + Q  P Y   +G  +  QR PG   + GP     + P +
Sbjct:   254 PNSGPPRMQPPMHPQGGLMGNQQPPPRYPSAQGQ-WPGQR-PG-GPRPGPPNGPPQRPMF 310

Query:   429 IPQRGP-GYDLQRGQGYDMRRAPSYD--PSRGT--GFDGAPRGAAPHGQVPPPLNNVPYG 483
               Q GP G  ++   G D RR P +   P +G   G   AP    PHG   P +N   + 
Sbjct:   311 --QGGPMGMPVRGPAGPDWRRPPMHGGFPPQGPPRGLPPAPGPGGPHGAPAPHVNPAFFN 368

Query:   484 SATPPARS-GSGQPRGGNP 501
                 PA+  G G P  G P
Sbjct:   369 QPGGPAQHPGMGGPPHGAP 387

 Score = 112 (44.5 bits), Expect = 0.00091, Sum P(2) = 0.00091
 Identities = 53/171 (30%), Positives = 61/171 (35%)

Query:   334 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG 393
             P +GP+  P+ G G  PT  PG     G      RG N  +  G  Y PQ         G
Sbjct:   196 PPRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRGMNSIMQPG-QYRPQHMSQVPQVGG 253

Query:   394 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPS 451
             PN     GP    +  P    Q G +   Q  P Y   +G  PG   QR  G   R  P 
Sbjct:   254 PN----SGP---PRMQPPMHPQGGLMGNQQPPPRYPSAQGQWPG---QRPGG--PRPGPP 301

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
               P +   F G P G    G   P     P     PP     G PRG  PA
Sbjct:   302 NGPPQRPMFQGGPMGMPVRGPAGPDWRRPPMHGGFPP----QGPPRGLPPA 348

 Score = 52 (23.4 bits), Expect = 6.6e-07, Sum P(2) = 6.6e-07
 Identities = 24/76 (31%), Positives = 30/76 (39%)

Query:   246 GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 305
             G +++E  G   G + Y+D  G     GP  SA + G  G G   S    A   SG P  
Sbjct:    19 GQAQDEFGGD--GVDLYDD-IG-----GPTESAASGG--GGGGTPSADGAAGPGSGEPGE 68

Query:   306 AAYDIPRGPGYEASKG 321
                  P G  Y  S G
Sbjct:    69 RNSGGPNGV-YHQSSG 83

 Score = 42 (19.8 bits), Expect = 7.0e-06, Sum P(2) = 7.0e-06
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query:   236 AADGSYGGATGNSENETSGRPVG 258
             +ADG+ G  +G      SG P G
Sbjct:    54 SADGAAGPGSGEPGERNSGGPNG 76


>TAIR|locus:2012713 [details] [associations]
            symbol:AT1G33680 "AT1G33680" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
            [GO:0005829 "cytosol" evidence=IDA] InterPro:IPR004087
            InterPro:IPR004088 Pfam:PF13014 PROSITE:PS50084 SMART:SM00322
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005829 GO:GO:0003723
            eggNOG:NOG300923 KO:K13210 UniGene:At.39892 UniGene:At.71035
            HOGENOM:HOG000242545 EMBL:AK229850 EMBL:AK229909 EMBL:AK230055
            IPI:IPI00786006 RefSeq:NP_174629.3 ProteinModelPortal:Q0WLY0
            SMR:Q0WLY0 STRING:Q0WLY0 PaxDb:Q0WLY0 PRIDE:Q0WLY0
            EnsemblPlants:AT1G33680.1 GeneID:840259 KEGG:ath:AT1G33680
            TAIR:At1g33680 InParanoid:Q0WLY0 OMA:PSYGSTP PhylomeDB:Q0WLY0
            ProtClustDB:CLSN2690290 Genevestigator:Q0WLY0 Uniprot:Q0WLY0
        Length = 763

 Score = 144 (55.7 bits), Expect = 9.7e-07, Sum P(2) = 9.7e-07
 Identities = 65/233 (27%), Positives = 82/233 (35%)

Query:   241 YGGATGNSENETSGRPVG-QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 299
             Y  A G  + +   RP G Q + E GYG P+   PP      G   A P+  ++  AA+ 
Sbjct:   537 YPSAGGQHQMQQPSRPYGMQGSAEQGYGPPRPAAPPGDVPYQGPTPAAPSYGSTPAAASY 596

Query:   300 SGTPMRAAY-DIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYDPAK-GPGYD---- 349
               TP   +Y   P  P Y ++       GY AS AP+      PSY  A    GY+    
Sbjct:   597 GSTPAAPSYGSTPAAPSYGSNMAQQQQYGY-ASSAPTQQTY--PSYSSAAPSDGYNGTQP 653

Query:   350 PTKGPGYD---AQKGSNYDAQRG------PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 400
             P   P Y+   AQ  S      G      P       PS  P  G     Q   NY    
Sbjct:   654 PAVAPAYEQHGAQPASGVQQTSGGYGQVPPTGGYSSYPSTQPAYG-NTPAQSNGNY---- 708

Query:   401 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP---GYDLQRGQGYDMRRAP 450
               GY   + P Y       Y A    +   Q  P   GY+    Q      AP
Sbjct:   709 --GYIGSQYPSYGGGNASAYAAPTGQTAYSQTAPPQAGYEQSATQSAGYAAAP 759

 Score = 49 (22.3 bits), Expect = 9.7e-07, Sum P(2) = 9.7e-07
 Identities = 10/19 (52%), Positives = 11/19 (57%)

Query:    29 SGMRPPMPGAFPPFDMMPP 47
             S  RPP  G +PP   MPP
Sbjct:   444 SHFRPPNSGGYPP-QHMPP 461


>UNIPROTKB|I3LQ53 [details] [associations]
            symbol:I3LQ53 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR000684 Pfam:PF05001 PROSITE:PS00115 GO:GO:0003677
            GO:GO:0006366 GO:GO:0005665 GeneTree:ENSGT00700000104490
            EMBL:FP565284 Ensembl:ENSSSCT00000030016 OMA:YAESDYL Uniprot:I3LQ53
        Length = 543

 Score = 144 (55.7 bits), Expect = 1.2e-06, P = 1.2e-06
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:    49 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 108

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:   109 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 161

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:   162 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 215

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:   216 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 268

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:   269 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 301

 Score = 121 (47.7 bits), Expect = 0.00040, P = 0.00040
 Identities = 63/225 (28%), Positives = 80/225 (35%)

Query:   274 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 327
             P  S T+       PN     Y  T    +P   +Y  P  P Y  +  P Y  S     
Sbjct:   333 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 385

Query:   328 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 386
              ++P+Y P+  PSY P+  P Y PT  P Y     S Y     P Y     P Y P    
Sbjct:   386 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 439

Query:   387 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 446
              Y     P Y     P Y +   P Y     P Y +  +P Y P   P Y       Y  
Sbjct:   440 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 491

Query:   447 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +P Y P+  T    +P+G+      P      P  S T PA S
Sbjct:   492 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 535


>UNIPROTKB|P02457 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M17839
            EMBL:M17838 EMBL:V00401 EMBL:M10571 EMBL:M17607 IPI:IPI00572548
            PIR:A27179 PIR:A90458 PIR:I50629 PIR:S07234 UniGene:Gga.2073
            UniGene:Gga.43371 IntAct:P02457 PRIDE:P02457 Uniprot:P02457
        Length = 1453

 Score = 149 (57.5 bits), Expect = 1.3e-06, P = 1.3e-06
 Identities = 90/285 (31%), Positives = 109/285 (38%)

Query:   237 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 293
             ADG  G  G TG++  +    P G  A   G   P G  G P      G   AGP  +T 
Sbjct:   808 ADGQPGAKGETGDAGAKGDAGPPGP-AGPTGAPGPAGZVGAPGPKGARG--SAGPPGATG 864

Query:   294 AYAATQSGTPMRAAYDI----PRGP-GYEASKGPGYDASKA--PSYDPTKGPSYDPA-KG 345
                A     P   + +I    P GP G + SKGP  +   A  P      GP   P  KG
Sbjct:   865 FPGAAGRVGPPGPSGNIGLPGPPGPAGKZGSKGPRGETGPAGRPGEPGPAGPPGPPGEKG 924

Query:   346 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 393
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   925 SPGADGPIGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984

Query:   394 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 453
             P   M  GP       PG     GP  EA R  +   +  PG D   G   D        
Sbjct:   985 PPGPM--GP-------PGL---AGPPGEAGREGAPGAEGAPGRDGAAGPKGDRGETGPAG 1032

Query:   454 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
             P    G  GAP    P G+        P G A PP  +G+  P G
Sbjct:  1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPPGPAGARGPAG 1077


>UNIPROTKB|D3ZZM1 [details] [associations]
            symbol:Taf15 "Protein Taf15" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950003
            ProteinModelPortal:D3ZZM1 Ensembl:ENSRNOT00000064396
            ArrayExpress:D3ZZM1 Uniprot:D3ZZM1
        Length = 558

 Score = 119 (46.9 bits), Expect = 1.5e-06, Sum P(2) = 1.5e-06
 Identities = 66/216 (30%), Positives = 77/216 (35%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYE 317
             G+  Y  G G  QG G  P       V   P+     +A   S             P   
Sbjct:   334 GRGGYR-GRGGFQGRGGDPK--NGDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGG 390

Query:   318 ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK-GSNYDAQR-GPNYDIH 375
               +G GY   +   +    G   D   G G D + G GY   + G +Y A R G  Y   
Sbjct:   391 DFRGRGYGGERG--FRGRGGRGGDRG-GYGADRSGG-GYGGDRSGGSYGADRSGGGYGGD 446

Query:   376 R-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 434
             R G  Y   RG GY   RG  Y   RG  Y   R  GY   RG  Y   R   Y   R  
Sbjct:   447 RSGGGYGGDRGGGYGGDRG-GYGGDRGGSYGGDR-GGYGGDRGG-YGGDRG-GYGGDRSR 502

Query:   435 G-YDLQRGQG---YDMRRAPSYDPSRGTGFDGAPRG 466
             G Y   RG G   Y   R+  Y   RG G+ G  RG
Sbjct:   503 GAYGGDRGGGSGGYGGDRSGGYGGDRGGGY-GGDRG 537

 Score = 70 (29.7 bits), Expect = 1.5e-06, Sum P(2) = 1.5e-06
 Identities = 28/106 (26%), Positives = 48/106 (45%)

Query:   184 YHHCRGTYEYEKKF-----YNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAAD 238
             Y   +G+Y+ +  +     YN + +S     +NY +  T+ +  R + M+    D R   
Sbjct:   121 YDQHQGSYDEQSNYQQHDSYNQNQQSYHPQRENY-SHHTQDD--RRD-MSRYGEDNRGYG 176

Query:   239 GSYGGATGNSENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 278
             GS GG  G    +  GR P+ G +  + G    +G  + +GP P A
Sbjct:   177 GSQGGGRGRGGYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRPDA 222


>TAIR|locus:2012788 [details] [associations]
            symbol:AT1G10390 "AT1G10390" species:3702 "Arabidopsis
            thaliana" [GO:0005215 "transporter activity" evidence=ISS]
            [GO:0005634 "nucleus" evidence=ISM] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0006810 "transport" evidence=IEA] [GO:0005515
            "protein binding" evidence=IPI] [GO:0005635 "nuclear envelope"
            evidence=IDA] InterPro:IPR007230 Pfam:PF04096 PROSITE:PS51434
            EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005635 GO:GO:0006810
            GO:GO:0005643 eggNOG:NOG12793 SUPFAM:SSF82215 KO:K14297 HSSP:Q9Y6J4
            EMBL:AY078948 EMBL:BT003030 EMBL:AK226964 IPI:IPI00523265
            RefSeq:NP_001031018.1 RefSeq:NP_172510.2 UniGene:At.27877
            ProteinModelPortal:Q8RY25 SMR:Q8RY25 STRING:Q8RY25 MEROPS:S59.A02
            PaxDb:Q8RY25 PRIDE:Q8RY25 EnsemblPlants:AT1G10390.1
            EnsemblPlants:AT1G10390.2 GeneID:837579 KEGG:ath:AT1G10390
            TAIR:At1g10390 HOGENOM:HOG000085153 InParanoid:Q8RY25 OMA:ESISAMP
            PhylomeDB:Q8RY25 ProtClustDB:CLSN2713828 Genevestigator:Q8RY25
            Uniprot:Q8RY25
        Length = 1041

 Score = 146 (56.5 bits), Expect = 1.8e-06, P = 1.8e-06
 Identities = 55/280 (19%), Positives = 93/280 (33%)

Query:   226 LMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP---SATTAG 282
             +  AP      A     GA+ +     S    G +     +G   G G  P   S   + 
Sbjct:    63 VFGAPQTSSPFASTPTFGASSSPAFGNSTPAFGASPASSPFGGSSGFGQKPLGFSTPQSN 122

Query:   283 VVGAGPNTSTSAYAATQSG--TPMRA----AYDIPRGPGYEASKGPGYDASKAPSYDPTK 336
               G     S  A+  T  G  TP  A    A+  P  P + A+  P + AS  P++  T 
Sbjct:   123 PFGNSTQQSQPAFGNTSFGSSTPFGATNTPAFGAPSTPSFGATSTPSFGASSTPAFGATN 182

Query:   337 GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP--NYDIHRGPSYDPQRGLGYDMQRGP 394
              P++  +  P +  T  P + A     + +      N     G ++       +     P
Sbjct:   183 TPAFGASNSPSFGATNTPAFGASPTPAFGSTGTTFGNTGFGSGGAFGASNTPAFGASGTP 242

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 454
              +     P +     P +     P + A   P++     P +       +    +P++  
Sbjct:   243 AFGASGTPAFGASSTPAFGASSTPAFGASSTPAFGGSSTPSFGASNTSSFSFGSSPAFGQ 302

Query:   455 SRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSG 494
             S  + F     G++  G  P P        A+ P   GSG
Sbjct:   303 ST-SAF-----GSSAFGSTPSPFGGA---QASTPTFGGSG 333


>RGD|1309595 [details] [associations]
            symbol:Taf15 "TAF15 RNA polymerase II, TATA box binding protein
            (TBP)-associated factor" species:10116 "Rattus norvegicus"
            [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0005622 "intracellular" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950713
            PRIDE:F1M8P1 Ensembl:ENSRNOT00000014438 ArrayExpress:F1M8P1
            Uniprot:F1M8P1
        Length = 554

 Score = 118 (46.6 bits), Expect = 2.0e-06, Sum P(2) = 2.0e-06
 Identities = 65/214 (30%), Positives = 77/214 (35%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYE 317
             G+  Y  G G  QG G  P       V   P+     +A   S             P   
Sbjct:   334 GRGGYR-GRGGFQGRGGDPK--NGDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGG 390

Query:   318 ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK-GSNYDAQR-GPNYDIH 375
               +G GY   +   +    G   D   G G D + G GY   + G +Y A R G  Y   
Sbjct:   391 DFRGRGYGGERG--FRGRGGRGGDRG-GYGADRSGG-GYGGDRSGGSYGADRSGGGYGGD 446

Query:   376 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 435
             R   Y   RG GY   RG +Y   RG GY   R  GY   RG  Y   R   Y   R   
Sbjct:   447 RS-GYGGDRG-GYGGDRGGSYGGDRG-GYGGDR-GGYGGDRGG-YGGDRG-GYGGDRRGA 500

Query:   436 YDLQRGQG---YDMRRAPSYDPSRGTGFDGAPRG 466
             Y   RG G   Y   R+  Y   RG G+ G  RG
Sbjct:   501 YGGDRGGGSGGYGGDRSGGYGGDRGGGY-GGDRG 533

 Score = 70 (29.7 bits), Expect = 2.0e-06, Sum P(2) = 2.0e-06
 Identities = 28/106 (26%), Positives = 48/106 (45%)

Query:   184 YHHCRGTYEYEKKF-----YNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAAD 238
             Y   +G+Y+ +  +     YN + +S     +NY +  T+ +  R + M+    D R   
Sbjct:   121 YDQHQGSYDEQSNYQQHDSYNQNQQSYHPQRENY-SHHTQDD--RRD-MSRYGEDNRGYG 176

Query:   239 GSYGGATGNSENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 278
             GS GG  G    +  GR P+ G +  + G    +G  + +GP P A
Sbjct:   177 GSQGGGRGRGGYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRPDA 222


>UNIPROTKB|Q96QC0 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9606 "Homo sapiens" [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0004864 "protein
            phosphatase inhibitor activity" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] [GO:0000785 "chromatin" evidence=ISS] [GO:0006606
            "protein import into nucleus" evidence=TAS] InterPro:IPR000571
            InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
            PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
            GO:GO:0005634 EMBL:BA000025 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 GO:GO:0000785 GO:GO:0006351 GO:GO:0003723
            EMBL:AL662800 EMBL:AL662825 GO:GO:0000790 GO:GO:0006606
            GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
            EMBL:Y13247 EMBL:AJ544537 EMBL:AB088097 EMBL:BX248507
            IPI:IPI00298731 PIR:JE0291 RefSeq:NP_002705.2 UniGene:Hs.106019
            ProteinModelPortal:Q96QC0 SMR:Q96QC0 DIP:DIP-39343N IntAct:Q96QC0
            MINT:MINT-1197376 STRING:Q96QC0 PhosphoSite:Q96QC0 DMDM:61214507
            PaxDb:Q96QC0 PeptideAtlas:Q96QC0 PRIDE:Q96QC0
            Ensembl:ENST00000376511 Ensembl:ENST00000383586
            Ensembl:ENST00000420949 Ensembl:ENST00000424446
            Ensembl:ENST00000426299 Ensembl:ENST00000429597
            Ensembl:ENST00000449113 GeneID:5514 KEGG:hsa:5514 UCSC:uc003nqn.1
            CTD:5514 GeneCards:GC06M030568 H-InvDB:HIX0165052
            H-InvDB:HIX0166290 H-InvDB:HIX0166579 H-InvDB:HIX0166833
            H-InvDB:HIX0167082 H-InvDB:HIX0167322 H-InvDB:HIX0167569
            HGNC:HGNC:9284 HPA:CAB025501 MIM:603771 neXtProt:NX_Q96QC0
            PharmGKB:PA33612 eggNOG:NOG69306 HOGENOM:HOG000049285
            HOVERGEN:HBG053646 InParanoid:Q96QC0 OMA:PPPHEHR OrthoDB:EOG451DQK
            PhylomeDB:Q96QC0 ChiTaRS:PPP1R10 GenomeRNAi:5514 NextBio:21326
            ArrayExpress:Q96QC0 Bgee:Q96QC0 CleanEx:HS_PPP1R10
            Genevestigator:Q96QC0 GermOnline:ENSG00000204569 Uniprot:Q96QC0
        Length = 940

 Score = 145 (56.1 bits), Expect = 2.0e-06, P = 2.0e-06
 Identities = 63/248 (25%), Positives = 83/248 (33%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 291
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 351
                    +     R+    P G G     GPG        + P +GP        G+ P 
Sbjct:   716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNSSGHRPH 770

Query:   352 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 411
             +GPG     GS +    GP   +  G  + P  G G  +  G  +    GPG       G
Sbjct:   771 EGPG--GGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828

Query:   412 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 462
             +    GP      +  + P  GPG+         D+   +G+D R  P   P    G DG
Sbjct:   829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885

Query:   463 APRGAAPH 470
                G   H
Sbjct:   886 PGHGGGGH 893

 Score = 144 (55.7 bits), Expect = 2.6e-06, P = 2.6e-06
 Identities = 71/268 (26%), Positives = 90/268 (33%)

Query:   254 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 311
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   312 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 362
              GPG     GPG Y      +  +  P   P +  A+G   G  P  G   PG     G 
Sbjct:   694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749

Query:   363 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 415
              +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  + 
Sbjct:   750 GHRPHEGPGGGMGNSSGHR-PHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGIS 808

Query:   416 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 475
              G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP 
Sbjct:   809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866

Query:   476 PLNNVPYGSATPPARSGSGQPRGGNPAR 503
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 130 (50.8 bits), Expect = 8.7e-05, P = 8.7e-05
 Identities = 53/213 (24%), Positives = 72/213 (33%)

Query:   243 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 301
             G  G +E      P  G      G G P G G P      G  G  P+          SG
Sbjct:   708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSSG 766

Query:   302 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 361
                        G G+   +GPG        + P +GP    + G G+ P +GPG     G
Sbjct:   767 HRPHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAG 826

Query:   362 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVY 420
               +    GP   +     + P  G G+    G   +D+   PG+      G+D  RGP  
Sbjct:   827 GGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPPP 877

Query:   421 EAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 450
                R    P +      G+D     G DM   P
Sbjct:   878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910

 Score = 122 (48.0 bits), Expect = 0.00064, P = 0.00064
 Identities = 50/195 (25%), Positives = 69/195 (35%)

Query:   234 RRAADGSYGGATGNSENETSGRPVGQNAYED----GYGVPQGHGPPPSATTAGVVGAG-- 287
             R A  G  GG   N      G  VG   +      G G+    G  P     G +G+G  
Sbjct:   723 RGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMGNSSGHRPHEGPGGGMGSGHR 782

Query:   288 PNTSTSAYAATQSG-TPMRA-AYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 345
             P+           G  P       I  G G+   +GPG        + P +GP       
Sbjct:   783 PHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGS 842

Query:   346 PGYDPTKGPGYDAQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
              G+ P +GPG+    G   +D      +D HRGP   P    G+D   GP +      G+
Sbjct:   843 GGHRPHEGPGHGGPHGHRPHDVPGHRGHD-HRGPP--PHEHRGHD---GPGHGGGGHRGH 896

Query:   405 ETQRVPGYDVQRGPV 419
             +     G D+   PV
Sbjct:   897 DGGHSHGGDMSNRPV 911


>UNIPROTKB|F1P555 [details] [associations]
            symbol:SFPQ "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0000380 "alternative mRNA
            splicing, via spliceosome" evidence=IEA] [GO:0016363 "nuclear
            matrix" evidence=IEA] [GO:0042382 "paraspeckles" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
            SMART:SM00360 GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0016363 GO:GO:0000380 GO:GO:0042382 InterPro:IPR012975
            Pfam:PF08075 GeneTree:ENSGT00390000005004 OMA:APGGHPK
            EMBL:AADN02043825 EMBL:AADN02043826 IPI:IPI00574618
            Ensembl:ENSGALT00000003963 ArrayExpress:F1P555 Uniprot:F1P555
        Length = 647

 Score = 143 (55.4 bits), Expect = 2.0e-06, P = 2.0e-06
 Identities = 63/220 (28%), Positives = 90/220 (40%)

Query:   234 RRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSA-------TTAGVVGA 286
             RR   G  GG   +  +   G  +GQN    G G PQG G PP                A
Sbjct:    18 RRGGGGGRGGPNHDFRSPPPGMGMGQNRGPMGGG-PQGPGGPPGGGPKSEPPKPPASTSA 76

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDAS-KAPSYDPTKGPSYDPAKG 345
              P++S+S+ A T      ++    P      A + P   A   APS  P+ GP       
Sbjct:    77 PPSSSSSSSATTAGPAGSQSGPGAPPPSALPAGQPPQQQAQGSAPSSAPS-GPGGQQQPQ 135

Query:   346 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 405
             P   P+  P    +KG       GP     +GP   PQ+G G   + GP +  + GPG E
Sbjct:   136 PKPSPSPTPAGGPKKGQGQSPGGGP-----KGPG-GPQQGPGGPHKGGPGH--RGGPGGE 187

Query:   406 TQRVPGYDVQRGPVYEAQRAPSYIPQRGP-GYDLQRGQGY 444
             ++   G    RG  ++ Q++ S   Q+GP G D    +G+
Sbjct:   188 SR---G----RGQQHQGQQSLSL--QQGPAGGDQLSDEGF 218

 Score = 130 (50.8 bits), Expect = 5.4e-05, P = 5.4e-05
 Identities = 53/184 (28%), Positives = 67/184 (36%)

Query:   230 PNVDRRAADGSYG-----GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVV 284
             PN D R+     G     G  G       G P G    E          PP S++++   
Sbjct:    28 PNHDFRSPPPGMGMGQNRGPMGGGPQGPGGPPGGGPKSEPPKPPASTSAPPSSSSSSSAT 87

Query:   285 GAGPNTSTSAYAATQ-SGTPM-RAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYD 341
              AGP  S S   A   S  P  +      +G     A  GPG      P   P+  P+  
Sbjct:    88 TAGPAGSQSGPGAPPPSALPAGQPPQQQAQGSAPSSAPSGPGGQQQPQPKPSPSPTPAGG 147

Query:   342 PAKGPGYDP---TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM 398
             P KG G  P    KGPG   Q+G     + GP    HRG      RG G   Q   +  +
Sbjct:   148 PKKGQGQSPGGGPKGPG-GPQQGPGGPHKGGPG---HRGGPGGESRGRGQQHQGQQSLSL 203

Query:   399 QRGP 402
             Q+GP
Sbjct:   204 QQGP 207


>ZFIN|ZDB-GENE-030131-1600 [details] [associations]
            symbol:ewsr1b "Ewing sarcoma breakpoint region 1b"
            species:7955 "Danio rerio" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003676 "nucleic
            acid binding" evidence=IEA] [GO:0021954 "central nervous system
            neuron development" evidence=IMP] [GO:0007067 "mitosis"
            evidence=IMP] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            ZFIN:ZDB-GENE-030131-1600 GO:GO:0007067 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 GO:GO:0021954
            GeneTree:ENSGT00530000063105 HOGENOM:HOG000038010
            HOVERGEN:HBG000970 EMBL:BX664747 EMBL:BC097019 UniGene:Dr.76923
            SMR:Q4QRG0 STRING:Q4QRG0 Ensembl:ENSDART00000003998 OMA:PVINIYL
            Uniprot:Q4QRG0
        Length = 579

 Score = 147 (56.8 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
 Identities = 80/296 (27%), Positives = 106/296 (35%)

Query:   224 AELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 283
             A + N  + ++  A  SYG  T     +T G+   Q   +  Y     +  P +A  A  
Sbjct:     2 ASVTNYSSYNQAGAQQSYGSYTAPPA-QTYGQTAQQGYTQQDYS---SYAQPAAAPEATY 57

Query:   284 VGAGPNTSTSAYAATQSGTPM-RAAYDIPRGPGYEASKGPGYDASKAPSYDPTK--GPSY 340
               A P  S  AYA  Q G+   +AA      P    +  PG     A SY  +   G + 
Sbjct:    58 SQAAP--SAGAYAQQQYGSTYGQAAATAAAAPAAYGTPQPGAYTQPAQSYGASSYTGSTA 115

Query:   341 DPAKGPGYDPTKGPGYDAQKG-SNYDAQ---RGP-NYDIHRGPSYDPQRGLGYDMQRG-- 393
              PA    Y     PGY  Q   S Y  Q     P +Y     P+Y+      Y    G  
Sbjct:   116 APAAQASYGSQ--PGYSTQPAYSGYSQQPAASAPQSYSASSQPAYNQS---AYSQPAGYS 170

Query:   394 -PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGY-DLQRGQGY----DM 446
              P Y  Q+ PGY  Q+   Y   + P    Q  P+ Y PQ    Y   Q GQ      D 
Sbjct:   171 QPGYQAQQ-PGYGQQQQSAYGQGQPPQQHQQGPPAAYPPQGSSSYAQTQYGQQSAPQNDY 229

Query:   447 RRAPSYDPSRGT---GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
             ++ P    S+G    G+ G+ RG    G         P G        G    RGG
Sbjct:   230 QQNPYNSYSQGGVSGGYPGSQRGGYQDGGRDGYDRGGPRGRGMGRGGMGIAGDRGG 285

 Score = 39 (18.8 bits), Expect = 2.3e-06, Sum P(2) = 2.3e-06
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:   488 PARSGSGQPRGGNPAR 503
             P R G G  RGG   R
Sbjct:   410 PMRGGPGMDRGGMMGR 425


>UNIPROTKB|P12105 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933
            EMBL:U07973 EMBL:X00822 EMBL:X00823 EMBL:X00826 EMBL:X00825
            EMBL:X00827 EMBL:X00828 EMBL:X00830 EMBL:X00831 EMBL:K02302
            EMBL:K02301 EMBL:V00391 EMBL:V00392 EMBL:M36662 IPI:IPI00590578
            PIR:A05269 PIR:I50694 UniGene:Gga.42140 ProteinModelPortal:P12105
            STRING:P12105 Uniprot:P12105
        Length = 1262

 Score = 146 (56.5 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 87/290 (30%), Positives = 112/290 (38%)

Query:   235 RAADGSYG--GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-N 289
             R   G  G  GA G   +N   G P G+       G+P  +G P     AG  G+ GP  
Sbjct:   457 RGPPGEEGKRGANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPG 515

Query:   290 TSTSAYAATQSGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 345
              S  A    Q G P    MR    IP  PG +   GP  +  + P      GP+  P   
Sbjct:   516 PSGPAGDRGQDGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQ 573

Query:   346 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QR 400
             PG     GP G +   G N   +RGP       GP+  +   GL G     GP  D  + 
Sbjct:   574 PGVMGFPGPKGNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEP 631

Query:   401 GPGYET--QRVPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDP 454
             GP      Q +PG     GP  E  +     P+    GPG+   +G+ G    R P   P
Sbjct:   632 GPSGSPGLQGLPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERGPQGPP 688

Query:   455 SRGTGFDGAPRGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 501
                TG  G P  A   G + PP     P G+  P  +   G+ RG  G+P
Sbjct:   689 GP-TGARGGPGPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 736

 Score = 128 (50.1 bits), Expect = 0.00020, P = 0.00020
 Identities = 87/281 (30%), Positives = 107/281 (38%)

Query:   242 GGATGNSENETSGRPVGQNAY-EDGY-GVPQGHGPPPSATTAGVVGAGPNTSTSAYA--- 296
             GG TG  E    G P G  A+ +DG  G     GPP    TAG  G+ P     A     
Sbjct:   301 GGPTG--ERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPGS-PGFKGEAGPPGP 357

Query:   297 ATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-P 354
             A  SG P       P+G  G    +GP   A  +P      GPS  P  GPG    +G P
Sbjct:   358 AGASGNPGERGEPGPQGQAGPPGPQGPPGRAG-SPGGKGEMGPSGIPG-GPGPPGGRGLP 415

Query:   355 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGPGYETQRVPGYD 413
             G     G N  A+  P      G   DP    G   +RG N     RGP       PG +
Sbjct:   416 GPPGTSG-NPGAKGTPGEPGKNGAKGDP----GPKGERGENGTPGARGP-------PGEE 463

Query:   414 VQRGPVYEAQR--APSYIPQRG-PGY-DLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AA 468
              +RG   E  +   P    +RG PG+  L    G    + P+ +  RG+     P G A 
Sbjct:   464 GKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPGEKGPAGE--RGSPGPPGPSGPAG 521

Query:   469 PHGQV--P--PPLNNVPYGSATPPARSGSGQPRG--GNPAR 503
               GQ   P  P +  +P G    P   G   P G  G P R
Sbjct:   522 DRGQDGGPGLPGMRGLP-GIPGSPGSDGKPGPPGNQGEPGR 561

 Score = 127 (49.8 bits), Expect = 0.00026, P = 0.00026
 Identities = 78/276 (28%), Positives = 97/276 (35%)

Query:   243 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 301
             G  G   +N   G P G        G P   GPP      G  G  P  +       + G
Sbjct:   428 GTPGEPGKNGAKGDP-GPKGERGENGTPGARGPPGEEGKRGANGE-PGQNGVPGTPGERG 485

Query:   302 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 357
             +P      +P   G    KGP G   S  P   P+ GP+ D  +  GPG    +G PG  
Sbjct:   486 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 541

Query:   358 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 414
                GS  D + GP      G   +P R  G     GP     +   PG +  +  PG + 
Sbjct:   542 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 593

Query:   415 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 470
             +RGP       P    + G    PG     G   D R  P   PS   G  G P G  P 
Sbjct:   594 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 649

Query:   471 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 504
             G+   P    P G    P   G   P+G N  P  R
Sbjct:   650 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 682

 Score = 125 (49.1 bits), Expect = 0.00043, P = 0.00043
 Identities = 74/259 (28%), Positives = 95/259 (36%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 313
             P G N Y+   G P   GP      AG++G AGP          + G P R   +  RG 
Sbjct:   192 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGP--------PGKDGEPGRPGRNGDRGI 243

Query:   314 PGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRG 369
             PG    KG PG      P     +G    D AKG    P  GP G   Q G+N    Q G
Sbjct:   244 PGLPGHKGHPGMPGM--PGMKGARGFDGKDGAKGDSGAP--GPKGEAGQPGANGSPGQPG 299

Query:   370 PNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYE-TQRVPGYDVQRGPVYEAQRAP 426
             P      RG   +P     +     P      GP G   T   PG      P ++ +  P
Sbjct:   300 PGGPTGERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPG-----SPGFKGEAGP 354

Query:   427 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 486
                P    G   +RG+     +A    P    G  G+P G    G++ P  + +P G   
Sbjct:   355 PG-PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGK---GEMGP--SGIPGGPGP 408

Query:   487 PPARSGSGQP-RGGNPARR 504
             P  R   G P   GNP  +
Sbjct:   409 PGGRGLPGPPGTSGNPGAK 427


>ZFIN|ZDB-GENE-080204-113 [details] [associations]
            symbol:zgc:172323 "zgc:172323" species:7955 "Danio
            rerio" [GO:0005882 "intermediate filament" evidence=IEA]
            [GO:0005198 "structural molecule activity" evidence=IEA]
            [GO:0008150 "biological_process" evidence=ND] InterPro:IPR001664
            InterPro:IPR006821 Pfam:PF04732 ZFIN:ZDB-GENE-080204-113
            GO:GO:0005198 GO:GO:0005882 HOVERGEN:HBG013015 InterPro:IPR016044
            PANTHER:PTHR23239 Pfam:PF00038 GeneTree:ENSGT00560000076873
            EMBL:CR848819 EMBL:BC155653 IPI:IPI00492297 RefSeq:NP_001107899.1
            UniGene:Dr.18713 SMR:A9JRG7 Ensembl:ENSDART00000075191
            GeneID:564165 KEGG:dre:564165 eggNOG:NOG147695 HOGENOM:HOG000207709
            NextBio:20885253 Uniprot:A9JRG7
        Length = 847

 Score = 144 (55.7 bits), Expect = 2.3e-06, P = 2.3e-06
 Identities = 107/460 (23%), Positives = 167/460 (36%)

Query:    58 QHVEMQKLATEN-QRLAATHGTLRQEL--AAAQHELQILHGQIGGMKSERELQ-----MR 109
             QH +   +A +N Q + + +     +L    ++H  Q+ H + G   +++++Q     M 
Sbjct:   281 QH-QYDDIAAKNLQEMDSWYKNKFDDLNNKTSKHVDQVRHVREGIASAKKDIQNKERDMD 339

Query:   110 NLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQ 169
             ++  K   +EA+++  +    +++K   + Q  + A +  +    Q T  L R + D   
Sbjct:   340 SMNTKNEALEAQIRDTQD---KYRKELEDLQARIEALQLELKSSKQRTALLLREYQD--- 393

Query:   170 IPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNA 229
                LL+   SL  E    R   E E       ++S+Q M    ++ +T V  + A    A
Sbjct:   394 ---LLNVKMSLEIEITTYRKLIEGEDSRLTSMVQSMQTM--TLMSGSTSVHTVAA---GA 445

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH---GPPPSATTAGVVGA 286
              N   R   G  GG  G       G P        G G+       G       A  VG 
Sbjct:   446 ANRGGRGLAGGLGGDVGLEFAGGLGGPATGLERGVGRGLDGSATVLGESVGGDAARGVGG 505

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 346
             GP T    +     G  + +   I  G G     GP    +     DP KG       GP
Sbjct:   506 GPTTVLGGHVDGGLGGGIGSGPAIGLGGG--VGSGPATGFAGGVGGDPAKGLPGGVGGGP 563

Query:   347 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
                   G G D  KG       GP   +  G   DP +GL  D+   P   +  G G + 
Sbjct:   564 ATGLGGGVGGDPAKGLPGGVGGGPATGLTGGVGGDPGKGLS-DVGGVPATSLAGGVGGDP 622

Query:   407 QR-VPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGT--GFDG 462
              + +PG  V  GP           P +G PG  +  G    +      D ++G   G  G
Sbjct:   623 AKGLPG-GVGGGPATGLAGGVGVDPAKGLPG-GVSGGPASGLAGGVGGDTAKGLPGGVGG 680

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
              P      G    P+  +  G    P++   G   GG PA
Sbjct:   681 GPATGLAGGVGGVPVTGLAGGVGGDPSKGLPGGV-GGGPA 719


>UNIPROTKB|G1RSL2 [details] [associations]
            symbol:COL4A4 "Uncharacterized protein" species:61853
            "Nomascus leucogenys" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISS] [GO:0005587 "collagen type IV"
            evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS] [GO:0032836
            "glomerular basement membrane development" evidence=ISS]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
            EMBL:ADFV01083072 EMBL:ADFV01083073 EMBL:ADFV01083074
            EMBL:ADFV01083075 EMBL:ADFV01083076 EMBL:ADFV01083077
            EMBL:ADFV01083078 Ensembl:ENSNLET00000017067 Uniprot:G1RSL2
        Length = 1690

 Score = 147 (56.8 bits), Expect = 2.5e-06, P = 2.5e-06
 Identities = 79/253 (31%), Positives = 99/253 (39%)

Query:   262 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 321
             Y    G P   G P      G  GA P  S S     + GTP     +IP  PG+    G
Sbjct:   671 YPGRQGPPGFDGLPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPGPPGFRGDMG 727

Query:   322 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 373
              PG+   +  S     GP   P     KG PG DP  GP G   ++G S     +GP  D
Sbjct:   728 DPGFGGERGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGPLGPPGKRGLSGVPGIKGPRGD 786

Query:   374 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 427
                G P  +   G+ G+   +GP   +   G PG      PG+  +RG P    Q   P 
Sbjct:   787 --PGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 842

Query:   428 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 485
             Y P   PG    +GQ  D+   P   P+   G  G P     HG  PP L  +P  +G  
Sbjct:   843 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 895

Query:   486 TPPARSGSGQPRG 498
               P   G   PRG
Sbjct:   896 GLPGPPGPKGPRG 908

 Score = 123 (48.4 bits), Expect = 0.00099, P = 0.00099
 Identities = 76/253 (30%), Positives = 97/253 (38%)

Query:   270 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 325
             +GH G P      G  G  G    T +   T  G      +D   GP G+   +G PG  
Sbjct:   640 RGHPGVPGRPGVRGPDGLKGQKGDTISCNVTYPGRQGPPGFDGLPGPKGFPGPQGAPGLS 699

Query:   326 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 379
              S      P T G S  P   PG+    G PG+  ++GS+     GP      +  +G  
Sbjct:   700 GSDGHKGRPGTPGTSEIPGP-PGFRGDMGDPGFGGERGSSPVGPPGPPGSPGVNGQKGIP 758

Query:   380 YDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEAQRA--PS 427
              DP  G LG   +RG    P     RG    PG E    +PG+   +GP      A  P 
Sbjct:   759 GDPAFGPLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPG 818

Query:   428 YIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 485
              +P   PG+  +RG  G   +   P Y P    G  GAP G    G V PP      G  
Sbjct:   819 -VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGPAGMKGLP 871

Query:   486 TPPARSGSGQPRG 498
               P R G+  P G
Sbjct:   872 GLPGRPGAHGPPG 884


>FB|FBgn0262126 [details] [associations]
            symbol:gho "ghost" species:7227 "Drosophila melanogaster"
            [GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
            [GO:0006888 "ER to Golgi vesicle-mediated transport" evidence=IEA]
            [GO:0006886 "intracellular protein transport" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0030127 "COPII
            vesicle coat" evidence=IEA] [GO:0005811 "lipid particle"
            evidence=IDA] [GO:0035158 "regulation of tube diameter, open
            tracheal system" evidence=IMP] [GO:0009306 "protein secretion"
            evidence=IMP] [GO:0035151 "regulation of tube size, open tracheal
            system" evidence=IMP] [GO:0070971 "endoplasmic reticulum exit site"
            evidence=IDA] [GO:0003331 "positive regulation of extracellular
            matrix constituent secretion" evidence=IMP] [GO:0007029
            "endoplasmic reticulum organization" evidence=IMP] [GO:0048081
            "positive regulation of cuticle pigmentation" evidence=IMP]
            [GO:0030011 "maintenance of cell polarity" evidence=IMP]
            [GO:0007030 "Golgi organization" evidence=IMP] [GO:0016203 "muscle
            attachment" evidence=IMP] [GO:0035149 "lumen formation, open
            tracheal system" evidence=IMP] [GO:0034394 "protein localization to
            cell surface" evidence=IMP] [GO:0040003 "chitin-based cuticle
            development" evidence=IMP] [GO:0022409 "positive regulation of
            cell-cell adhesion" evidence=IMP] [GO:0008360 "regulation of cell
            shape" evidence=IMP] [GO:0071711 "basement membrane organization"
            evidence=IMP] [GO:0000902 "cell morphogenesis" evidence=IMP]
            InterPro:IPR006895 InterPro:IPR006896 InterPro:IPR006900
            Pfam:PF04810 Pfam:PF04811 Pfam:PF04815 GO:GO:0006886 EMBL:AE014134
            GO:GO:0008360 GO:GO:0005811 GO:GO:0008270 GO:GO:0009306
            GO:GO:0016787 GO:GO:0016203 GO:GO:0000902 InterPro:IPR007123
            Pfam:PF00626 GO:GO:0006888 GO:GO:0040003 GO:GO:0034394
            GO:GO:0003331 GO:GO:0071711 GO:GO:0007030 GO:GO:0007029
            GO:GO:0030011 GO:GO:0035158 GO:GO:0022409 GO:GO:0035149
            GO:GO:0030127 SUPFAM:SSF82919 GO:GO:0070971 InterPro:IPR012990
            Pfam:PF08033 SUPFAM:SSF81811 eggNOG:COG5028 KO:K14007
            GeneTree:ENSGT00590000082962 HSSP:P40482 OMA:QDQGNCN GO:GO:0048081
            EMBL:AY052042 RefSeq:NP_608664.2 UniGene:Dm.269 SMR:Q9VQ94
            IntAct:Q9VQ94 MINT:MINT-283494 STRING:Q9VQ94
            EnsemblMetazoa:FBtr0077810 EnsemblMetazoa:FBtr0329964 GeneID:33409
            KEGG:dme:Dmel_CG10882 UCSC:CG10882-RA CTD:33409 FlyBase:FBgn0262126
            InParanoid:Q9VQ94 OrthoDB:EOG4CVDNW GenomeRNAi:33409 NextBio:783418
            Uniprot:Q9VQ94
        Length = 1193

 Score = 135 (52.6 bits), Expect = 3.2e-06, Sum P(2) = 3.2e-06
 Identities = 65/231 (28%), Positives = 84/231 (36%)

Query:   267 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP---RGPGYEASKGPG 323
             G P   G PP +    +  + P  S     +++ G P       P     PG    +  G
Sbjct:   211 GQPPLPGQPPFS--GQIPTSQPAPSPYGVPSSRPGQPQLPPGATPPTYTQPGLPPQQQQG 268

Query:   324 YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 383
                 + P   P + P + P + PG  P   PG   Q G+ Y A +   Y    G  +  Q
Sbjct:   269 IPPLQQPGI-PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQGGYS---G-GFPGQ 322

Query:   384 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG----PGYDLQ 439
                G+     P       PG +    P +   + P Y  Q+ P Y PQ G    PGY  Q
Sbjct:   323 APGGFPGAPPPL------PGQQAAAPPQFGAPQ-PGYPGQQ-PGYPPQPGQQPMPGYPPQ 374

Query:   440 RGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPAR 490
              GQ       P Y P  G GF G P G     Q P P     Y  A P AR
Sbjct:   375 PGQQLG---GPGYPPQPGAGFPGQP-GRPGFNQPPMPGAGNMYQQA-PQAR 420

 Score = 127 (49.8 bits), Expect = 2.3e-05, Sum P(2) = 2.3e-05
 Identities = 75/283 (26%), Positives = 100/283 (35%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHG-PPPSATTAGVVGAGPNTSTSAY-- 295
             G  GGA         G P        G+     +  PPP+       GA P T   +Y  
Sbjct:    90 GGVGGANPLKPPLPQGAPAAAAPPPTGFNQFNSNAAPPPTNNNNAAFGAPPPTQAGSYVN 149

Query:   296 -AATQSGTPMRAAYDIPRGPGYEASKG--PGYDASKAPSYDPTKGPSYDPAKG------- 345
              A   S TP   A  I +     A+    P     KA +     G    PA G       
Sbjct:   150 GALPPSSTPQSVASGINQMSLNSATLAGLPHMPPPKAATPGAAPGQPPIPAAGSTSQPPL 209

Query:   346 PGYDPTKGPGYDAQKGSNYDAQRGPN-YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
             PG  P   PG     G    +Q  P+ Y +       PQ   G      P Y     P  
Sbjct:   210 PGQPPL--PGQPPFSGQIPTSQPAPSPYGVPSSRPGQPQLPPG---ATPPTYTQPGLPPQ 264

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGA 463
             + Q +P   +Q+  +   Q+ P + PQ+ PG       G   +    Y  P +G G+ G 
Sbjct:   265 QQQGIP--PLQQPGI--PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQG-GYSGG 318

Query:   464 PRGAAPHG--QVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 504
               G AP G    PPPL   P   A  P + G+ QP  G P ++
Sbjct:   319 FPGQAPGGFPGAPPPL---PGQQAAAPPQFGAPQP--GYPGQQ 356

 Score = 116 (45.9 bits), Expect = 0.00034, Sum P(2) = 0.00034
 Identities = 69/272 (25%), Positives = 96/272 (35%)

Query:   239 GSY--GGATGNSENETSGRPVGQNAYEDGY--GVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             GSY  G    +S  ++    + Q +       G+P  H PPP A T G   A P      
Sbjct:   145 GSYVNGALPPSSTPQSVASGINQMSLNSATLAGLP--HMPPPKAATPG---AAPGQPPIP 199

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-- 352
              A + S  P+     +P  P + + + P    + +P   P+  P   P   PG  P    
Sbjct:   200 AAGSTSQPPLPGQPPLPGQPPF-SGQIPTSQPAPSPYGVPSSRPG-QPQLPPGATPPTYT 257

Query:   353 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 412
              PG   Q+       + P     + P + PQ+  G      P    Q G  Y   +  GY
Sbjct:   258 QPGLPPQQQQGIPPLQQPGIP-QQQPGFPPQQP-GLPPLSQPGLPPQPGAPYGAPQQGGY 315

Query:   413 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGAPRGAAPHG 471
                 G  +  Q AP   P   P    Q+        AP    P +  G+   P G  P  
Sbjct:   316 S---GG-FPGQ-APGGFPGAPPPLPGQQAAAPPQFGAPQPGYPGQQPGYPPQP-GQQPMP 369

Query:   472 QVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
               PP       G   PP + G+G P  G P R
Sbjct:   370 GYPPQPGQQLGGPGYPP-QPGAGFP--GQPGR 398

 Score = 58 (25.5 bits), Expect = 3.2e-06, Sum P(2) = 3.2e-06
 Identities = 20/76 (26%), Positives = 35/76 (46%)

Query:    30 GMRPPMPGAFPPFDM-MPPPEVMEQKIASQHVEMQ-KLATENQRLAAT----HGTLRQEL 83
             G  PP  G +PP    +P  +  +Q++  Q  + Q +        AA+    +G  +Q+L
Sbjct:    20 GAPPPNSGGWPPQQQQLPQQQPPQQQLPPQQQQQQPQYGAPPPTSAASQPYLNGNYQQQL 79

Query:    84 AAAQHELQILHGQIGG 99
             A +   L +  G +GG
Sbjct:    80 ATSMGGLSV-GGGVGG 94

 Score = 40 (19.1 bits), Expect = 0.00022, Sum P(2) = 0.00022
 Identities = 10/24 (41%), Positives = 13/24 (54%)

Query:    31 MRPPMP-GAFPPFDMMPPPEVMEQ 53
             ++PP+P GA  P    PPP    Q
Sbjct:    98 LKPPLPQGA--PAAAAPPPTGFNQ 119


>UNIPROTKB|Q7YR38 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9598 "Pan troglodytes" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
            GO:GO:0006351 GO:GO:0003723 EMBL:BA000041 GO:GO:0004864
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646 OMA:PPPHEHR
            GeneTree:ENSGT00530000063820 EMBL:AB210175 EMBL:AB210176
            RefSeq:NP_001038965.1 UniGene:Ptr.6270 ProteinModelPortal:Q7YR38
            Ensembl:ENSPTRT00000033108 GeneID:462544 KEGG:ptr:462544
            NextBio:20841794 Uniprot:Q7YR38
        Length = 940

 Score = 143 (55.4 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 63/248 (25%), Positives = 83/248 (33%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 291
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 351
                    +     R+    P G G     GPG        + P +GP        G+ P 
Sbjct:   716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNNSGHRPH 770

Query:   352 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 411
             +GPG     GS +    GP   +  G  + P  G G  +  G  +    GPG       G
Sbjct:   771 EGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828

Query:   412 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 462
             +    GP      +  + P  GPG+         D+   +G+D R  P   P    G DG
Sbjct:   829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885

Query:   463 APRGAAPH 470
                G   H
Sbjct:   886 PGHGGGGH 893

 Score = 142 (55.0 bits), Expect = 4.3e-06, P = 4.3e-06
 Identities = 71/268 (26%), Positives = 90/268 (33%)

Query:   254 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 311
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   312 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 362
              GPG     GPG Y      +  +  P   P +  A+G   G  P  G   PG     G 
Sbjct:   694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749

Query:   363 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 415
              +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  + 
Sbjct:   750 GHRPHEGPGGGMGNNSGHR-PHEGPGGGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGIS 808

Query:   416 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 475
              G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP 
Sbjct:   809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866

Query:   476 PLNNVPYGSATPPARSGSGQPRGGNPAR 503
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 132 (51.5 bits), Expect = 5.3e-05, P = 5.3e-05
 Identities = 54/214 (25%), Positives = 72/214 (33%)

Query:   258 GQNAYEDGYGVPQGHGPPPS-----ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             G   Y  G G   G+ PPP          G  G GP            G      ++ P 
Sbjct:   699 GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPG 758

Query:   313 G-----PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 367
             G      G+   +GPG        + P +GP+     G G+ P +GPG     GS +   
Sbjct:   759 GGMGNNSGHRPHEGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPH 816

Query:   368 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY------ETQRVPGY--DVQRGPV 419
              GP   +  G  + P  G G  M     +    GPG+          VPG+     RGP 
Sbjct:   817 EGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP 876

Query:   420 YEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 450
                 R    P +      G+D     G DM   P
Sbjct:   877 PHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910

 Score = 123 (48.4 bits), Expect = 0.00050, P = 0.00050
 Identities = 50/195 (25%), Positives = 70/195 (35%)

Query:   234 RRAADGSYGGATGNSENETSGRPVGQNAYED----GYGVPQGHGPPPSATTAGVVGAG-- 287
             R A  G  GG   N      G  VG   +      G G+    G  P     G +G+G  
Sbjct:   723 RGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMGNNSGHRPHEGPGGGMGSGHR 782

Query:   288 PNTSTSAYAATQSG-TPMRA-AYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 345
             P+   +       G  P       I  G G+   +GPG        + P +GP       
Sbjct:   783 PHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGS 842

Query:   346 PGYDPTKGPGYDAQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
              G+ P +GPG+    G   +D      +D HRGP   P    G+D   GP +      G+
Sbjct:   843 GGHRPHEGPGHGGPHGHRPHDVPGHRGHD-HRGPP--PHEHRGHD---GPGHGGGGHRGH 896

Query:   405 ETQRVPGYDVQRGPV 419
             +     G D+   PV
Sbjct:   897 DGGHSHGGDMSNRPV 911


>UNIPROTKB|Q5TM61 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9544 "Macaca mulatta" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
            GO:GO:0006351 GO:GO:0003723 EMBL:AB128049 GO:GO:0004864
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            eggNOG:NOG69306 HOVERGEN:HBG053646 RefSeq:NP_001108416.1
            UniGene:Mmu.17467 ProteinModelPortal:Q5TM61 GeneID:711949
            KEGG:mcc:711949 NextBio:19975847 Uniprot:Q5TM61
        Length = 940

 Score = 143 (55.4 bits), Expect = 3.4e-06, P = 3.4e-06
 Identities = 73/271 (26%), Positives = 93/271 (34%)

Query:   254 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 311
             G P G   +  G G  +P  HG P       ++G  P            G PMR    + 
Sbjct:   635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693

Query:   312 RGPG-----YEASKGPGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKG 361
              GPG     Y   +G G   ++ P   P   P +  A+G   G  P  G   PG     G
Sbjct:   694 GGPGPGPGPYHRGRG-GRGGNEPP---PPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGG 749

Query:   362 SNYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDV 414
               +    GP     N   HR P   P  G+G  +    GP   M  G G+     PG  +
Sbjct:   750 GGHRPHEGPGGGMGNSSGHR-PHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGI 808

Query:   415 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP 474
               G  +     P      G G+    G G  M  +  + P  G G  G P G  PH  VP
Sbjct:   809 SGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVP 866

Query:   475 PPLNNVPYGSATPPA--RSGSGQPRGGNPAR 503
                 +   G   PP   R   G   GG   R
Sbjct:   867 GHRGHDHRG---PPHEHRGHDGPGHGGGGHR 894

 Score = 142 (55.0 bits), Expect = 4.3e-06, P = 4.3e-06
 Identities = 54/213 (25%), Positives = 73/213 (34%)

Query:   242 GGATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 300
             GG  GN        P  G      G G P G G P      G  G  P+          S
Sbjct:   708 GGRGGNEPPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSS 766

Query:   301 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 360
             G           G G+   +GPG        + P +GP    + G G+ P +GPG     
Sbjct:   767 GHRPHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGA 826

Query:   361 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPV 419
             G  +    GP   +     + P  G G+    G   +D+   PG+      G+D  RGP 
Sbjct:   827 GGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPP 877

Query:   420 YE--AQRAPSYIPQRGPGYDLQRGQGYDMRRAP 450
             +E      P +      G+D     G DM   P
Sbjct:   878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910

 Score = 140 (54.3 bits), Expect = 7.1e-06, P = 7.1e-06
 Identities = 62/245 (25%), Positives = 83/245 (33%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 291
             G  GG  G         P G + + DG G P        G GP P     G  G G N  
Sbjct:   656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP 350
                        P R A     G G    +G PG        + P +GP        G+ P
Sbjct:   716 PPP-----PPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMGNSSGHRP 770

Query:   351 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP 410
              +GPG  +  GS +    GP   +  G  + P  G G  +  G  +    GPG       
Sbjct:   771 HEGPG--SGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGG 828

Query:   411 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD----PSRGTGFDGAPR 465
             G+    GP      +  + P  GPG+    G + +D+     +D    P    G DG   
Sbjct:   829 GHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPPHEHRGHDGPGH 888

Query:   466 GAAPH 470
             G   H
Sbjct:   889 GGGGH 893


>UNIPROTKB|F1SKM1 [details] [associations]
            symbol:COL7A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 Gene3D:2.60.40.10 InterPro:IPR013783
            GO:GO:0004867 SUPFAM:SSF49265 Gene3D:4.10.410.10 InterPro:IPR020901
            SUPFAM:SSF57362 PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005604 OMA:RRVCTTA GeneTree:ENSGT00700000104301
            EMBL:CU633242 Ensembl:ENSSSCT00000012432 ArrayExpress:F1SKM1
            Uniprot:F1SKM1
        Length = 2939

 Score = 148 (57.2 bits), Expect = 3.6e-06, P = 3.6e-06
 Identities = 82/272 (30%), Positives = 105/272 (38%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 314
             P G        G P   GPP SA   G  G  P    S  +  + GTP  +    P+G P
Sbjct:  1270 PPGPPGLPGRIGAPGPPGPPGSAIAKGERGF-PGADGSPGSPGRPGTPGTSG---PKGSP 1325

Query:   315 GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNY 372
             G+   +G PG    + P  +P +       +GPG    KG PG     GS     RGP+ 
Sbjct:  1326 GWPGPRGEPGERGPRGPKGEPGEPGRVIGGEGPGLPGQKGDPGLPGPPGS-----RGPSG 1380

Query:   373 DIH-RGPSYDPQRGL----GYDMQRGPNY--DMQRGPGYE-TQRVPGYDVQRGPV----Y 420
             D   RGP   P   +    G   +RGP    D    PG      +PG    +GPV     
Sbjct:  1381 DPGPRGPPGFPGTAVKGEKGDRGERGPPGPGDGTAAPGDPGLPGLPGSPGPQGPVGPPGE 1440

Query:   421 EAQRAPSYIPQRG----PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPHGQVP 474
             + ++  S     G    PG   +RG +G+     P  D  RG TG  G P      G  P
Sbjct:  1441 KGEKGDSEDGAPGLPGQPGVPGERGLRGFPGDTGPKGD--RGLTGAVGEPGEKGERGS-P 1497

Query:   475 PPLNNVPYGSATPPARSGSGQPRG--GNPARR 504
              P+   P G    P R G+  P G  G   RR
Sbjct:  1498 GPVG--PQGPPGVPGRPGAEGPEGPPGPTGRR 1527

 Score = 127 (49.8 bits), Expect = 0.00067, P = 0.00067
 Identities = 78/280 (27%), Positives = 100/280 (35%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             R   G  G      E    GR +G     +G G+P   G P      G    GP+     
Sbjct:  1331 RGEPGERGPRGPKGEPGEPGRVIGG----EGPGLPGQKGDPGLPGPPG--SRGPSGDPGP 1384

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP 354
                   G P  A        G     GPG D + AP  DP   P    + GP   P   P
Sbjct:  1385 RGPP--GFPGTAVKGEKGDRGERGPPGPG-DGTAAPG-DPGL-PGLPGSPGP-QGPVGPP 1438

Query:   355 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG-------PGYET 406
             G   +KG + D   G    +   P    +RGL G+    GP  D  RG       PG + 
Sbjct:  1439 GEKGEKGDSEDGAPG----LPGQPGVPGERGLRGFPGDTGPKGD--RGLTGAVGEPGEKG 1492

Query:   407 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 465
             +R  PG    +GP     R  +  P+  PG   +RG+  +  R P  DP+ G G  GA  
Sbjct:  1493 ERGSPGPVGPQGPPGVPGRPGAEGPEGPPGPTGRRGEKGEPGR-PG-DPAVGPGGAGAKG 1550

Query:   466 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPARR 504
                  G   P       G   PP  +  G P   G+P  R
Sbjct:  1551 EKGDMGPTGPKGATGTKGERGPPGLALPGDPGPKGDPGER 1590


>MGI|MGI:1344412 [details] [associations]
            symbol:Ldb3 "LIM domain binding 3" species:10090 "Mus
            musculus" [GO:0005080 "protein kinase C binding" evidence=IDA]
            [GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0005856 "cytoskeleton" evidence=ISO] [GO:0008092
            "cytoskeletal protein binding" evidence=ISO] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0030018 "Z disc" evidence=ISO;IDA]
            [GO:0042995 "cell projection" evidence=IEA] [GO:0045214 "sarcomere
            organization" evidence=IMP] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0051371 "muscle alpha-actinin binding"
            evidence=IDA;IPI] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
            InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
            SMART:SM00132 SMART:SM00228 MGI:MGI:1344412 GO:GO:0048471
            GO:GO:0005080 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 GO:GO:0031143 Gene3D:2.10.110.10 SUPFAM:SSF50156
            CTD:11155 eggNOG:NOG286537 HOVERGEN:HBG051478 OMA:CTSQATT
            OrthoDB:EOG4GTKDQ InterPro:IPR006643 SMART:SM00735 EMBL:AF114378
            EMBL:AF114379 EMBL:AJ005621 EMBL:AF228057 EMBL:AF228058
            EMBL:AY206011 EMBL:AY206012 EMBL:AY206013 EMBL:AY206015
            EMBL:AK172980 EMBL:AK004020 EMBL:AK137181 EMBL:AK142292
            EMBL:BC099596 EMBL:BC138793 EMBL:BC145420 IPI:IPI00123369
            IPI:IPI00323030 IPI:IPI00403041 IPI:IPI00621572 IPI:IPI00625287
            IPI:IPI00656173 RefSeq:NP_001034160.1 RefSeq:NP_001034161.1
            RefSeq:NP_001034162.1 RefSeq:NP_001034163.1 RefSeq:NP_001034164.1
            RefSeq:NP_001034165.1 RefSeq:NP_036048.3 UniGene:Mm.29733 PDB:1WJL
            PDBsum:1WJL ProteinModelPortal:Q9JKS4 SMR:Q9JKS4 IntAct:Q9JKS4
            MINT:MINT-97840 STRING:Q9JKS4 PhosphoSite:Q9JKS4 PaxDb:Q9JKS4
            PRIDE:Q9JKS4 Ensembl:ENSMUST00000022327 Ensembl:ENSMUST00000022328
            Ensembl:ENSMUST00000022330 Ensembl:ENSMUST00000090040 GeneID:24131
            KEGG:mmu:24131 UCSC:uc007taz.1 UCSC:uc007tba.1 UCSC:uc007tbc.1
            UCSC:uc007tbd.1 UCSC:uc007tbe.1 UCSC:uc007tbf.1
            GeneTree:ENSGT00700000104411 InParanoid:B2RSB0
            EvolutionaryTrace:Q9JKS4 NextBio:304169 Bgee:Q9JKS4 CleanEx:MM_LDB3
            Genevestigator:Q9JKS4 GermOnline:ENSMUSG00000021798 Uniprot:Q9JKS4
        Length = 723

 Score = 141 (54.7 bits), Expect = 3.9e-06, P = 3.9e-06
 Identities = 49/181 (27%), Positives = 69/181 (38%)

Query:   253 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             S  P    +Y +G   P    P P   T   +   P+      A++ S +P  A Y  P 
Sbjct:   371 SPAPSAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASSYSPSP-GANYS-PT 423

Query:   313 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 372
              P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y     + Y    GP+ 
Sbjct:   424 -P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYSG--GPSE 479

Query:   373 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAP 426
                R P     S+  +   G          + RG P Y         + RG    A+R P
Sbjct:   480 SASRPPWVTDDSFSQKFAPGKSTTTVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERFP 539

Query:   427 S 427
             +
Sbjct:   540 A 540

 Score = 135 (52.6 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 55/192 (28%), Positives = 70/192 (36%)

Query:   266 YGVPQGHGPPPSATTAGVVGAG-----PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 320
             Y       P PSA T+   G       P   T+A        P+ A+   P  PG   S 
Sbjct:   364 YSPAAAASPAPSAHTSYSEGPAAPAPKPRVVTTASIRPSVYQPVPASSYSP-SPGANYSP 422

Query:   321 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSY 380
              P Y  S AP+Y P+  P+Y P+  P Y P+  P Y      NY       Y    GPS 
Sbjct:   423 TP-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYS--GGPSE 479

Query:   381 DPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQ 439
                R          ++  +  PG  T  V    + RG       AP+Y P  GP    L 
Sbjct:   480 SASRP---PWVTDDSFSQKFAPGKSTTTVSKQTLPRG-------APAYNPT-GPQVTPLA 528

Query:   440 RGQGYDMRRAPS 451
             RG      R P+
Sbjct:   529 RGTFQRAERFPA 540

 Score = 132 (51.5 bits), Expect = 3.8e-05, P = 3.8e-05
 Identities = 56/213 (26%), Positives = 74/213 (34%)

Query:   277 SATTAGVVGA---GPNTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKGPGY--DASKAP 330
             +A+ AG   +    P    SAY+   + +P  +A+     GP   A K P     AS  P
Sbjct:   343 AASAAGPAASPVENPRPQASAYSPAAAASPAPSAHTSYSEGPAAPAPK-PRVVTTASIRP 401

Query:   331 S-YDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 389
             S Y P    SY P+ G  Y PT  P Y       Y     P Y     P+Y P     Y 
Sbjct:   402 SVYQPVPASSYSPSPGANYSPT--P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYT 458

Query:   390 MQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR- 448
                 PNY       Y     P     R P        S+  +  PG          + R 
Sbjct:   459 PSPAPNYTPTPSAAYSGG--PSESASRPPWVTDD---SFSQKFAPGKSTTTVSKQTLPRG 513

Query:   449 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 481
             AP+Y+P+ G       RG     +  P  +  P
Sbjct:   514 APAYNPT-GPQVTPLARGTFQRAERFPASSRTP 545


>UNIPROTKB|O75112 [details] [associations]
            symbol:LDB3 "LIM domain-binding protein 3" species:9606
            "Homo sapiens" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005080 "protein kinase C binding" evidence=IEA] [GO:0031143
            "pseudopodium" evidence=IEA] [GO:0048471 "perinuclear region of
            cytoplasm" evidence=IEA] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0005856 "cytoskeleton" evidence=IDA] [GO:0008092
            "cytoskeletal protein binding" evidence=IPI] [GO:0030018 "Z disc"
            evidence=IDA] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
            InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
            SMART:SM00132 SMART:SM00228 GO:GO:0048471 GO:GO:0030018
            GO:GO:0005856 GO:GO:0046872 GO:GO:0008270 Orphanet:154
            GO:GO:0031143 Gene3D:2.10.110.10 Orphanet:54260 SUPFAM:SSF50156
            EMBL:AJ133766 EMBL:AJ133767 EMBL:AJ133768 EMBL:AF276807
            EMBL:AF276808 EMBL:AF276809 EMBL:AB014513 EMBL:AK304760
            EMBL:EF179181 EMBL:AC067750 EMBL:BC010929 IPI:IPI00165263
            IPI:IPI00294958 IPI:IPI00294959 IPI:IPI00514458 IPI:IPI00552865
            IPI:IPI00654766 IPI:IPI00909817 RefSeq:NP_001073583.1
            RefSeq:NP_001073584.1 RefSeq:NP_001073585.1 RefSeq:NP_001165081.1
            RefSeq:NP_001165082.1 RefSeq:NP_009009.1 UniGene:Hs.657271 PDB:1RGW
            PDBsum:1RGW ProteinModelPortal:O75112 SMR:O75112 IntAct:O75112
            STRING:O75112 PhosphoSite:O75112 UCD-2DPAGE:O75112
            UCD-2DPAGE:Q9Y4Z5 PaxDb:O75112 PRIDE:O75112 DNASU:11155
            Ensembl:ENST00000263066 Ensembl:ENST00000310944
            Ensembl:ENST00000352360 Ensembl:ENST00000361373
            Ensembl:ENST00000372056 Ensembl:ENST00000372066
            Ensembl:ENST00000429277 Ensembl:ENST00000458213
            Ensembl:ENST00000542786 GeneID:11155 KEGG:hsa:11155 UCSC:uc001kdr.3
            UCSC:uc001kds.3 UCSC:uc001kdu.3 UCSC:uc001kdv.3 UCSC:uc009xsy.3
            UCSC:uc009xsz.3 CTD:11155 GeneCards:GC10P088426 HGNC:HGNC:15710
            HPA:HPA048955 MIM:601493 MIM:605906 MIM:609452 neXtProt:NX_O75112
            Orphanet:247 Orphanet:609 Orphanet:98912 PharmGKB:PA30318
            eggNOG:NOG286537 HOGENOM:HOG000220936 HOVERGEN:HBG051478
            InParanoid:O75112 OMA:CTSQATT OrthoDB:EOG4GTKDQ ChiTaRS:LDB3
            EvolutionaryTrace:O75112 GenomeRNAi:11155 NextBio:42413
            ArrayExpress:O75112 Bgee:O75112 Genevestigator:O75112
            GermOnline:ENSG00000122367 InterPro:IPR006643 SMART:SM00735
            Uniprot:O75112
        Length = 727

 Score = 141 (54.7 bits), Expect = 4.0e-06, P = 4.0e-06
 Identities = 53/183 (28%), Positives = 72/183 (39%)

Query:   253 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             S  P    +Y +G   P    P P   T   +   P+      A+T S +P  A Y  P 
Sbjct:   375 SSAPATHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASTYSPSP-GANYS-PT 427

Query:   313 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 372
              P Y  S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+      Y    GP  
Sbjct:   428 -P-YTPSPAPAYTPSPAPAYTPSPVPTYTPSPAPAYTPSPAPNYNPAPSVAYSG--GPAE 483

Query:   373 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQ--RVPGYDVQRGPVYEAQR 424
                R P     S+  +   G          + RG P Y     +VP   + RG V  A+R
Sbjct:   484 PASRPPWVTDDSFSQKFAPGKSTTSISKQTLPRGGPAYTPAGPQVP--PLARGTVQRAER 541

Query:   425 APS 427
              P+
Sbjct:   542 FPA 544


>UNIPROTKB|G7N928 [details] [associations]
            symbol:EGK_04858 "Putative uncharacterized protein"
            species:9544 "Macaca mulatta" [GO:0005201 "extracellular matrix
            structural constituent" evidence=ISS] [GO:0005587 "collagen type
            IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001264 Uniprot:G7N928
        Length = 1692

 Score = 145 (56.1 bits), Expect = 4.1e-06, P = 4.1e-06
 Identities = 81/261 (31%), Positives = 100/261 (38%)

Query:   254 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             G  V  N  Y    G P   GPP      G  GA P  S S     + GTP     +IP 
Sbjct:   663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719

Query:   313 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 364
              PG+    G PG+   K  S     GP   P     KG PG DP  G  G   ++G S  
Sbjct:   720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778

Query:   365 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 420
                +GP  D    P  +   G+ G+   +GP   +   G PG      PG+  +RG P  
Sbjct:   779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835

Query:   421 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 479
               Q  P      G PG    +GQ  D+   P   P+   G  G P     HG  PP L  
Sbjct:   836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888

Query:   480 VP--YGSATPPARSGSGQPRG 498
             +P  +G    P   G   PRG
Sbjct:   889 IPGPFGDDGLPGPPGPKGPRG 909

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 77/252 (30%), Positives = 97/252 (38%)

Query:   270 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 325
             +GH G P      G  G  G    T +   T  G      +D P GP G+   +G PG  
Sbjct:   641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700

Query:   326 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 379
              S      P T G S  P   PG+    G PG+  +KGS+     GP      +  +G  
Sbjct:   701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759

Query:   380 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 427
              DP  G LG   +RG    P     RG PGY        +PG+   +GP      A  P 
Sbjct:   760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819

Query:   428 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 486
              +P   PG+  +RG  G   +     DP    G  GAP G    G V PP      G   
Sbjct:   820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873

Query:   487 PPARSGSGQPRG 498
              P R G+  P G
Sbjct:   874 LPGRPGAHGPPG 885

 Score = 124 (48.7 bits), Expect = 0.00077, P = 0.00077
 Identities = 81/259 (31%), Positives = 100/259 (38%)

Query:   256 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 310
             PVG      G+ G P  +GH G P      G  G  G    T +   T  G      +D 
Sbjct:   626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683

Query:   311 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 366
             P GP G+   +G PG   S      P T G S  P   PG+    G PG+  +KGS+   
Sbjct:   684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742

Query:   367 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 424
               GP       P  + Q+G+  D    P +     PG      VPG    RG P Y    
Sbjct:   743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794

Query:   425 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 482
              P+ IP   PG    +G +G+     P      G   + GAP    P GQ  P L   P 
Sbjct:   795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845

Query:   483 GSATPPARSGSGQPRGGNP 501
             GS  P A  G GQP    P
Sbjct:   846 GS--PGAPGGKGQPGDVGP 862


>UNIPROTKB|G7PK77 [details] [associations]
            symbol:EGM_04376 "Putative uncharacterized protein"
            species:9541 "Macaca fascicularis" [GO:0005201 "extracellular
            matrix structural constituent" evidence=ISS] [GO:0005587 "collagen
            type IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
            [GO:0032836 "glomerular basement membrane development"
            evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001287 Uniprot:G7PK77
        Length = 1695

 Score = 145 (56.1 bits), Expect = 4.1e-06, P = 4.1e-06
 Identities = 81/261 (31%), Positives = 100/261 (38%)

Query:   254 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             G  V  N  Y    G P   GPP      G  GA P  S S     + GTP     +IP 
Sbjct:   663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719

Query:   313 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 364
              PG+    G PG+   K  S     GP   P     KG PG DP  G  G   ++G S  
Sbjct:   720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778

Query:   365 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 420
                +GP  D    P  +   G+ G+   +GP   +   G PG      PG+  +RG P  
Sbjct:   779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835

Query:   421 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 479
               Q  P      G PG    +GQ  D+   P   P+   G  G P     HG  PP L  
Sbjct:   836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888

Query:   480 VP--YGSATPPARSGSGQPRG 498
             +P  +G    P   G   PRG
Sbjct:   889 IPGPFGDDGLPGPPGPKGPRG 909

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 77/252 (30%), Positives = 97/252 (38%)

Query:   270 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 325
             +GH G P      G  G  G    T +   T  G      +D P GP G+   +G PG  
Sbjct:   641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700

Query:   326 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 379
              S      P T G S  P   PG+    G PG+  +KGS+     GP      +  +G  
Sbjct:   701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759

Query:   380 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 427
              DP  G LG   +RG    P     RG PGY        +PG+   +GP      A  P 
Sbjct:   760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819

Query:   428 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 486
              +P   PG+  +RG  G   +     DP    G  GAP G    G V PP      G   
Sbjct:   820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873

Query:   487 PPARSGSGQPRG 498
              P R G+  P G
Sbjct:   874 LPGRPGAHGPPG 885

 Score = 124 (48.7 bits), Expect = 0.00077, P = 0.00077
 Identities = 81/259 (31%), Positives = 100/259 (38%)

Query:   256 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 310
             PVG      G+ G P  +GH G P      G  G  G    T +   T  G      +D 
Sbjct:   626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683

Query:   311 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 366
             P GP G+   +G PG   S      P T G S  P   PG+    G PG+  +KGS+   
Sbjct:   684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742

Query:   367 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 424
               GP       P  + Q+G+  D    P +     PG      VPG    RG P Y    
Sbjct:   743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794

Query:   425 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 482
              P+ IP   PG    +G +G+     P      G   + GAP    P GQ  P L   P 
Sbjct:   795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845

Query:   483 GSATPPARSGSGQPRGGNP 501
             GS  P A  G GQP    P
Sbjct:   846 GS--PGAPGGKGQPGDVGP 862


>UNIPROTKB|P04258 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
            "Bos taurus" [GO:0005581 "collagen" evidence=IEA] PROSITE:PS01208
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 IPI:IPI00731432 PIR:A02862
            UniGene:Bt.64714 STRING:P04258 PRIDE:P04258 Uniprot:P04258
        Length = 1049

 Score = 142 (55.0 bits), Expect = 5.0e-06, P = 5.0e-06
 Identities = 82/262 (31%), Positives = 97/262 (37%)

Query:   253 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 309
             SG P G+       G P    G GPP      G  G  P    SA      G P      
Sbjct:   521 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 577

Query:   310 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 367
              P GPG +  KG PG      AP  D  +GP+  P   PG  P   PG   + G+     
Sbjct:   578 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 628

Query:   368 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 422
               P      GP   P +RG  G     G P    Q G PG + +R  PG   + GP   A
Sbjct:   629 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 686

Query:   423 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 482
               A    P   PG    +G+    R +P      G G  G P G  P G  PP  N  P 
Sbjct:   687 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 735

Query:   483 --GSATPPARSGSGQPRGGNPA 502
               GS+  P + G   P G N A
Sbjct:   736 PPGSSGAPGKDGPPGPPGSNGA 757

 Score = 141 (54.7 bits), Expect = 6.4e-06, P = 6.4e-06
 Identities = 88/294 (29%), Positives = 106/294 (36%)

Query:   229 APNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G 287
             A +V    A G   G  G +       P G + +    G P   GPP     AG  G  G
Sbjct:     4 AYDVKSGVAGGGIAGYPGPAGPPGPPGPPGTSGHPGAPGAPGYQGPPGEPGQAGPAGPPG 63

Query:   288 PNTSTSAYAAT-QSGTPMRAAYDIPRG-PGYEASKGP----GYDASKAP-SYDPTKGPSY 340
             P  +        +SG P R     PRG PG    KGP    G+   K    +D   G   
Sbjct:    64 PPGAIGPSGKDGESGRPGRPG---PRGFPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKG 120

Query:   341 DPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDM 398
             +P   PG     G PG D   G      RG   +  R P      G  G D  RG   D 
Sbjct:   121 EPG-APGLKGENGVPGEDGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DG 174

Query:   399 QRGP-GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 456
             Q GP G   T   PG    +G V  A    S      PG   QRG+      A +  P  
Sbjct:   175 QPGPPGPPGTAGFPGSPGAKGEVGPAGSPGS---SGAPG---QRGEPGPQGHAGAPGPPG 228

Query:   457 GTGFDGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 503
               G DG+P G    G  P  +   P   G+  PP   G+ G P  RG  G P +
Sbjct:   229 PPGSDGSPGGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGVPGQRGAAGEPGK 280

 Score = 121 (47.7 bits), Expect = 0.00094, P = 0.00094
 Identities = 83/282 (29%), Positives = 99/282 (35%)

Query:   238 DGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 289
             DGS G  GA G      E    G R P G N      G P   G P  A   GV G  P 
Sbjct:   311 DGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRGVAGE-PG 369

Query:   290 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 349
              +         G  +R     P GPG     GP     +     P   P   P   PG  
Sbjct:   370 RN-----GLPGGPGLRGIPGSPGGPGSNGKPGPPGSQGETGRPGPPGSPG--PRGQPGVM 422

Query:   350 PTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM-QRGPGYE 405
                GP G D   G N + + GP     +GP+  + + G  G     GP+ D    GP   
Sbjct:   423 GFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKGDTGPP-G 480

Query:   406 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRGTGFDGAP 464
              Q + G     GP  E  +     P+   G   +  G+G D   AP     RG    G P
Sbjct:   481 PQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERGPPGAGGP 535

Query:   465 RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 504
              G  P G   PP      G+A PP   GS    G  G P  R
Sbjct:   536 PG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 575


>UNIPROTKB|C9JGE3 [details] [associations]
            symbol:EWSR1 "Ewing sarcoma breakpoint region 1, isoform
            CRA_e" species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0000166
            EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 EMBL:AC002059 EMBL:AL031186 EMBL:AC000026
            UniGene:Hs.374477 HGNC:HGNC:3508 HOGENOM:HOG000038010 ChiTaRS:EWSR1
            IPI:IPI00953325 SMR:C9JGE3 STRING:C9JGE3 Ensembl:ENST00000332050
            UCSC:uc003aez.3 Uniprot:C9JGE3
        Length = 583

 Score = 127 (49.8 bits), Expect = 5.0e-06, Sum P(2) = 5.0e-06
 Identities = 68/254 (26%), Positives = 95/254 (37%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 348
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   349 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 405
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD      Y 
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQS---SYS 209

Query:   406 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF-DGAP 464
              Q   G     G      +  SY  Q    Y  Q G  Y   +APS    + + +    P
Sbjct:   210 QQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGS-YS--QAPSQYSQQSSSYGQQRP 266

Query:   465 RGAAPHGQVPPPLN 478
                 P   + PP++
Sbjct:   267 MDEGPDLDLGPPVD 280

 Score = 57 (25.1 bits), Expect = 5.0e-06, Sum P(2) = 5.0e-06
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   465 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 503
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   382 RGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 427


>UNIPROTKB|G4N3H5 [details] [associations]
            symbol:MGG_04961 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:CM001233
            RefSeq:XP_003712457.1 EnsemblFungi:MGG_04961T0 GeneID:2675293
            KEGG:mgr:MGG_04961 Uniprot:G4N3H5
        Length = 616

 Score = 139 (54.0 bits), Expect = 5.2e-06, P = 5.2e-06
 Identities = 58/167 (34%), Positives = 76/167 (45%)

Query:   249 ENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG--TPMRA 306
             ++ +SGR     +     G P G   PP + TA +   GP+    AY    +G  +P  +
Sbjct:   463 DDYSSGRASPAPSMYPSRG-PGGPNMPPRSATAPIPPRGPD----AYDDYSNGRASPAPS 517

Query:   307 AYDIPRGPGYEASKGPGYDASKAPS-YDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 365
              Y  PRGPG     GP   AS APS Y+P + P    A GP   P +GPG+  Q+     
Sbjct:   518 MYP-PRGPG-----GPNGRASPAPSMYNPPRAPPQRSATGPM--PPRGPGFPPQRNMTAP 569

Query:   366 AQRGPN--YDIHRGP----SYDPQRGLGYDMQRGPNYDM--QRG-PG 403
             A  GP+  YD +  P    S  P RG       G N D+  QRG PG
Sbjct:   570 AP-GPDDPYDYNTRPPTSSSQAPPRGA---FGNGWNSDLENQRGGPG 612

 Score = 126 (49.4 bits), Expect = 0.00014, P = 0.00014
 Identities = 76/272 (27%), Positives = 91/272 (33%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             G+ G    NS ++    P  Q      Y   Q     P    A   G   + + S  +  
Sbjct:   347 GTPGSIELNSLDQKRPMPSRQGTMNSSYSSRQ-----PLVGAAAEFGRSASPAPSIPSTN 401

Query:   299 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 358
              SG        + R     +S    Y AS AP    T  P+  P   PGY   + PG   
Sbjct:   402 YSGRTYGGQPPMSRMQSNASSMSRAYTASPAPFSSDTV-PAL-PR--PGYQRNQ-PGGPP 456

Query:   359 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP-GYD-VQR 416
              +  +YD            PS  P RG G     GPN   +        R P  YD    
Sbjct:   457 SRFDSYDDYSSGRAS--PAPSMYPSRGPG-----GPNMPPRSATAPIPPRGPDAYDDYSN 509

Query:   417 GPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAA--PHGQV 473
             G    A  APS  P RGPG    R      M   P   P R       PRG    P   +
Sbjct:   510 G---RASPAPSMYPPRGPGGPNGRASPAPSMYNPPRAPPQRSATGPMPPRGPGFPPQRNM 566

Query:   474 --PPPLNNVPYGSAT-PPARSGSGQPRG--GN 500
               P P  + PY   T PP  S    PRG  GN
Sbjct:   567 TAPAPGPDDPYDYNTRPPTSSSQAPPRGAFGN 598


>UNIPROTKB|E2R2K8 [details] [associations]
            symbol:PPP1R10 "Uncharacterized protein" species:9615
            "Canis lupus familiaris" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 Gene3D:1.20.930.10
            SUPFAM:SSF47676 CTD:5514 OMA:PPPHEHR GeneTree:ENSGT00530000063820
            EMBL:AAEX03008197 RefSeq:XP_848400.1 Ensembl:ENSCAFT00000000645
            Ensembl:ENSCAFT00000048295 GeneID:481705 KEGG:cfa:481705
            NextBio:20856447 Uniprot:E2R2K8
        Length = 940

 Score = 141 (54.7 bits), Expect = 5.5e-06, P = 5.5e-06
 Identities = 68/268 (25%), Positives = 87/268 (32%)

Query:   239 GSYGGATGNSENETSGRPV---GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 295
             G +GG  G+      G P    G + + DG G P   GP       G  G GP       
Sbjct:   653 GPHGGPGGSVGPRLLGPPPPPRGGDPFWDGPGDPMRGGP-----MRGGPGPGPGPYHRGR 707

Query:   296 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 355
                    P       P  P +  ++G G      P+     GP      G G+ P +GPG
Sbjct:   708 GGRGGNEPP------PPPPPFRGARG-GRSGGGPPN--GRGGPGGGMVGGGGHRPHEGPG 758

Query:   356 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQ 415
                   S +    GP   +  G  + P  G G  M  G  +    GPG       G+   
Sbjct:   759 GGMNSSSGHRPHEGPGGGM--GGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPH 816

Query:   416 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 475
              GP         + P  GPG  +  G G+         P  G G  G P G  PH  VP 
Sbjct:   817 EGPGSGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-DVPS 866

Query:   476 PLNNVPYGSATPPARSGSGQPRGGNPAR 503
                +   G      R   G   GG   R
Sbjct:   867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894

 Score = 139 (54.0 bits), Expect = 9.2e-06, P = 9.2e-06
 Identities = 56/215 (26%), Positives = 74/215 (34%)

Query:   243 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 301
             G  G +E      P  G      G G P G G P      G  G  P+        + SG
Sbjct:   708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMNSSSG 766

Query:   302 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 361
                        G G+   +GPG        + P +GP      G G+ P +GPG     G
Sbjct:   767 HRPHEGPGGGMGGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPHEGPGSGMGGG 826

Query:   362 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP---GYDVQRGP 418
             S +    GP   +  G  + P  G G+    GP+       G+    VP   G+D  RGP
Sbjct:   827 SGHRPHEGPGGGMGAGGGHRPHEGPGHG---GPH-------GHRPHDVPSHRGHD-HRGP 875

Query:   419 VYEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 450
                  R    P +      G+D     G DM   P
Sbjct:   876 PPHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910


>FB|FBgn0261885 [details] [associations]
            symbol:osa "osa" species:7227 "Drosophila melanogaster"
            [GO:0046530 "photoreceptor cell differentiation" evidence=IMP]
            [GO:0005634 "nucleus" evidence=NAS;IDA] [GO:0016055 "Wnt receptor
            signaling pathway" evidence=IMP] [GO:0008587 "imaginal disc-derived
            wing margin morphogenesis" evidence=IMP] [GO:0007379 "segment
            specification" evidence=IMP] [GO:0003677 "DNA binding"
            evidence=ISS;IDA;NAS] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IDA;IMP] [GO:0045893 "positive regulation
            of transcription, DNA-dependent" evidence=IDA] [GO:0035060 "brahma
            complex" evidence=IDA;TAS] [GO:0003713 "transcription coactivator
            activity" evidence=IC] [GO:0007476 "imaginal disc-derived wing
            morphogenesis" evidence=IMP] [GO:0048190 "wing disc dorsal/ventral
            pattern formation" evidence=IGI] [GO:0042058 "regulation of
            epidermal growth factor receptor signaling pathway" evidence=IMP]
            [GO:0007480 "imaginal disc-derived leg morphogenesis" evidence=IMP]
            [GO:0008586 "imaginal disc-derived wing vein morphogenesis"
            evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
            InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
            EMBL:AE014297 GO:GO:0048190 GO:GO:0045893 GO:GO:0016055
            GO:GO:0003677 GO:GO:0008586 GO:GO:0006351 GO:GO:0016568
            eggNOG:NOG12793 GO:GO:0007379 GO:GO:0007480 KO:K11653
            Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
            GeneTree:ENSGT00550000074575 GO:GO:0046530 GO:GO:0008587
            GO:GO:0035060 GO:GO:0042058 EMBL:AF053091 PIR:T13049
            RefSeq:NP_001163639.1 RefSeq:NP_524392.2 RefSeq:NP_732263.1
            UniGene:Dm.2989 ProteinModelPortal:Q8IN94 SMR:Q8IN94 DIP:DIP-20699N
            IntAct:Q8IN94 MINT:MINT-297379 STRING:Q8IN94 PaxDb:Q8IN94
            PRIDE:Q8IN94 EnsemblMetazoa:FBtr0089581 EnsemblMetazoa:FBtr0301487
            GeneID:42130 KEGG:dme:Dmel_CG7467 CTD:42130 FlyBase:FBgn0261885
            InParanoid:Q8IN94 OMA:SQMGQGP OrthoDB:EOG4MCVF9 PhylomeDB:Q8IN94
            ChiTaRS:osa GenomeRNAi:42130 NextBio:827314 Bgee:Q8IN94
            GermOnline:CG7467 Uniprot:Q8IN94
        Length = 2716

 Score = 153 (58.9 bits), Expect = 6.2e-06, Sum P(2) = 6.2e-06
 Identities = 92/349 (26%), Positives = 133/349 (38%)

Query:   170 IPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNA 229
             I A  S   +LR+  H+ +    +E  F    ++ L ++++    +    +K  A+  + 
Sbjct:  1065 IGASSSAAYTLRK--HYTKNLLTFECHFDRGDIDPLPIIQQ----VEAGSKKKTAKAASV 1118

Query:   230 PNVDRRAADGSYGGATGNSENETS-GRPVGQ--NAYEDGY-GVPQGHGPPPSATTAGVVG 285
             P+      D     +TG+S ++ S   P G   NA  DGY G P G  P P A+     G
Sbjct:  1119 PSPGGGHLDAGTTNSTGSSNSQDSFPAPPGSAPNAAIDGYPGYPGG-SPYPVAS-----G 1172

Query:   286 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTK---GPSYD 341
               P+ +T   A      P +     P  PG  A+   G + S + P  DP     GP   
Sbjct:  1173 PQPDYAT---AGQMQRPPSQNNPQTPH-PGAAAAVAAGDNISVSNPFEDPIAAGGGPGSG 1228

Query:   342 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP----QRGLGYDMQRGPNYD 397
                GPG  P  GPG  A  G+      G     H  P + P    Q+  G   Q+ P + 
Sbjct:  1229 TGPGPGQGP--GPGA-ASGGAGAVGAVGGGPQPHPPPPHSPHTAAQQAAGQHQQQHPQHQ 1285

Query:   398 MQRGPGYET-QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 456
                 PG    Q+  G   Q+ P       P    Q GPG      Q +    A +  P  
Sbjct:  1286 HPGLPGPPPPQQQQGQQGQQPPPSVGGGPPPAPQQHGPGQVPPSPQQHVRPAAGAPYPPG 1345

Query:   457 GTGFDGAPRGAAPHGQVPP-PLNNVPYGSATPPARSGS-GQPRGGNPAR 503
             G+G+   P    P    P  P     YGS+     +G  GQP G  P +
Sbjct:  1346 GSGYP-TPVSRTPGSPYPSQPGAYGQYGSSDQYNATGPPGQPFGQGPGQ 1393

 Score = 44 (20.5 bits), Expect = 6.2e-06, Sum P(2) = 6.2e-06
 Identities = 7/16 (43%), Positives = 9/16 (56%)

Query:    34 PMPGAFPPFDMMPPPE 49
             P+PG  PP    P P+
Sbjct:   166 PLPGGKPPQQQQPHPQ 181

 Score = 43 (20.2 bits), Expect = 7.8e-06, Sum P(2) = 7.8e-06
 Identities = 8/18 (44%), Positives = 11/18 (61%)

Query:    30 GMRPPMPGAFPPFDMMPP 47
             GM P   G +PP+  +PP
Sbjct:   706 GM-PNHTGQYPPYQWVPP 722

 Score = 43 (20.2 bits), Expect = 7.8e-06, Sum P(2) = 7.8e-06
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query:    29 SGMRP--PMPGAFPPFDMMPPPE 49
             +G +P  P+PG  PP     PP+
Sbjct:   344 AGQQPGGPVPGGPPPGTGQQPPQ 366

 Score = 42 (19.8 bits), Expect = 9.9e-06, Sum P(2) = 9.9e-06
 Identities = 7/16 (43%), Positives = 7/16 (43%)

Query:    33 PPMPGAFPPFDMMPPP 48
             P  P   PP    PPP
Sbjct:   427 PASPHHVPPLQQQPPP 442

 Score = 39 (18.8 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
 Identities = 7/20 (35%), Positives = 10/20 (50%)

Query:    30 GMRPPMPGAFPPFDMMPPPE 49
             G  P  P  +PP +  P P+
Sbjct:   648 GYPPQQPQQYPPGNYPPRPQ 667

 Score = 38 (18.4 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 9/20 (45%), Positives = 9/20 (45%)

Query:    28 VSGMRPPMPGAFPPFDMMPP 47
             V G  PP  G  PP    PP
Sbjct:   352 VPGGPPPGTGQQPPQQNTPP 371

 Score = 37 (18.1 bits), Expect = 3.2e-05, Sum P(2) = 3.2e-05
 Identities = 6/12 (50%), Positives = 7/12 (58%)

Query:    36 PGAFPPFDMMPP 47
             PG +PP    PP
Sbjct:   659 PGNYPPRPQYPP 670


>UNIPROTKB|F1MXS8 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
            "Bos taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0050777 "negative regulation of immune response"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
            "skin development" evidence=IEA] [GO:0043206 "extracellular fibril
            organization" evidence=IEA] [GO:0042060 "wound healing"
            evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
            "heart development" evidence=IEA] [GO:0007229 "integrin-mediated
            signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
            factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
            "cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
            GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
            IPI:IPI00731432 OMA:EGSPGHP GO:GO:0005586 EMBL:DAAA02003919
            EMBL:DAAA02003920 Ensembl:ENSBTAT00000028617 ArrayExpress:F1MXS8
            Uniprot:F1MXS8
        Length = 1466

 Score = 142 (55.0 bits), Expect = 7.4e-06, P = 7.4e-06
 Identities = 82/262 (31%), Positives = 97/262 (37%)

Query:   253 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 309
             SG P G+       G P    G GPP      G  G  P    SA      G P      
Sbjct:   677 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 733

Query:   310 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 367
              P GPG +  KG PG      AP  D  +GP+  P   PG  P   PG   + G+     
Sbjct:   734 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 784

Query:   368 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 422
               P      GP   P +RG  G     G P    Q G PG + +R  PG   + GP   A
Sbjct:   785 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 842

Query:   423 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 482
               A    P   PG    +G+    R +P      G G  G P G  P G  PP  N  P 
Sbjct:   843 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 891

Query:   483 --GSATPPARSGSGQPRGGNPA 502
               GS+  P + G   P G N A
Sbjct:   892 PPGSSGAPGKDGPPGPPGSNGA 913

 Score = 129 (50.5 bits), Expect = 0.00019, P = 0.00019
 Identities = 78/257 (30%), Positives = 104/257 (40%)

Query:   267 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR--AAYDIPRGP----GYEASK 320
             G P   GPP    +    G   +    AY   +SG      A Y  P GP    G   + 
Sbjct:   130 GSPGSPGPPGICESCPTGGQNYSPQYEAYDV-KSGVAGGGIAGYPGPAGPPGPPGPPGTS 188

Query:   321 G-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP----GYDAQKGS-NYDAQRG-PNYD 373
             G PG    KA    P +  SY P   PG     GP    G D + G      +RG P   
Sbjct:   189 GHPGAPHLKAWQKPPQQSTSYSPIGPPGPPGAIGPSGPAGKDGESGRPGRPGERGFPGPP 248

Query:   374 IHRGPSYDP----QRG-LGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRAPS 427
               +GP+  P     +G  G+D + G   +    PG + +  VPG +   GP+   + AP 
Sbjct:   249 GMKGPAGMPGFPGMKGHRGFDGRNGEKGETG-APGLKGENGVPGENGAPGPM-GPRGAPG 306

Query:   428 YIPQRG-PGYDLQRG----QGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVP 481
                + G PG    RG    +G D +  P   P  GT GF G+P GA   G+V P     P
Sbjct:   307 ERGRPGLPGAAGARGNDGARGSDGQPGPPGPP--GTAGFPGSP-GAK--GEVGPA--GSP 359

Query:   482 YGSATPPARSGSGQPRG 498
              GS+  P + G   P+G
Sbjct:   360 -GSSGAPGQRGEPGPQG 375


>UNIPROTKB|F1LRJ1 [details] [associations]
            symbol:Col4a3 "Protein Col4a3" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            RGD:71085 GO:GO:0006917 GO:GO:0008283 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006919 GO:GO:0007166 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0016525 GO:GO:0005201 GO:GO:0005587
            Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1285
            GO:GO:0032836 IPI:IPI00367109 RefSeq:NP_001129231.1
            UniGene:Rn.121139 Ensembl:ENSRNOT00000020669 GeneID:363265
            KEGG:rno:363265 NextBio:683046 ArrayExpress:F1LRJ1 Uniprot:F1LRJ1
        Length = 1670

 Score = 142 (55.0 bits), Expect = 8.6e-06, P = 8.6e-06
 Identities = 94/293 (32%), Positives = 108/293 (36%)

Query:   234 RRAADGSYGGATGNSENETSGRPV--GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS 291
             R+  DGS GG          G P   G+   +   G P   GPP  A  AG  G GP   
Sbjct:   564 RKGFDGSPGGPGAKGPRGPRGEPALSGRKGDQGPPGAPGSPGPPGPAGPAGPPGYGPQGE 623

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK-GP-GYD 349
                  A   G P   A     GP  EA    G  ++  P   P  GP   P + GP G  
Sbjct:   624 PGPKGA--QGVP--GAL----GPPGEAGL-KGESSASIPVLGPP-GPPGPPGQAGPRGLP 673

Query:   350 PTKGPGYDAQKGS-NYDAQRG-PNYDIH--RGPSYDPQRGLGYDMQRG-PNYDMQRGPGY 404
                GP      G    D + G P       RGP  D     G+    G P Y     PG 
Sbjct:   674 GLPGPVGTCDPGHPGPDGEPGIPEVGFPGARGPKGDQ----GFPGTIGLPGY-----PG- 723

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSY-IP-QRG-PGYDLQRGQGYDMRRA--PSYDPSRGT- 458
             ET R PGY  + G V  A+  PS   P + G PG+  +RG   +      P      GT 
Sbjct:   724 ETGR-PGYPGEMG-VPGAKGEPSVGRPGEPGKPGFPGERGNSGENGDIGLPGLPGPPGTP 781

Query:   459 ---GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQP--RG--GNPARR 504
                GFDG P    P GQ  PP    P G   P  R   G P   G  G P RR
Sbjct:   782 GKDGFDGPP--GDP-GQSGPPGAKGPPGRCIPGPRGTQGLPGLNGLKGQPGRR 831


>UNIPROTKB|J9P8F7 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00700000104155 EMBL:AAEX03006798
            EMBL:AAEX03006799 EMBL:AAEX03006800 Ensembl:ENSCAFT00000044143
            Uniprot:J9P8F7
        Length = 1405

 Score = 141 (54.7 bits), Expect = 9.0e-06, P = 9.0e-06
 Identities = 77/254 (30%), Positives = 100/254 (39%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 314
             PVG    +   G P   GP  S    G  GA            Q G P  A     +G P
Sbjct:   634 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 689

Query:   315 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 371
             G +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP 
Sbjct:   690 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 746

Query:   372 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 430
                  GP+  PQ  +G   Q GP+  D + GP  + Q + G     GP       P  + 
Sbjct:   747 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 795

Query:   431 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 488
              +G PG   ++G+  D+ +     P    G  GAP    P G  P  + N    G    P
Sbjct:   796 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 854

Query:   489 ARSGS-GQPRGGNP 501
               +G  G P  G P
Sbjct:   855 GEAGEPGLPGEGGP 868


>UNIPROTKB|E1C0T1 [details] [associations]
            symbol:TFG "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0004871 "signal transducer activity" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043123
            "positive regulation of I-kappaB kinase/NF-kappaB cascade"
            evidence=IEA] InterPro:IPR000270 Pfam:PF00564 SMART:SM00666
            GO:GO:0043123 GO:GO:0004871 CTD:10342 KO:K09292 OMA:YTTQTSQ
            GeneTree:ENSGT00510000047809 EMBL:AADN02032793 IPI:IPI00599103
            RefSeq:XP_416608.1 UniGene:Gga.1550 PRIDE:E1C0T1
            Ensembl:ENSGALT00000024692 GeneID:418391 KEGG:gga:418391
            NextBio:20821576 Uniprot:E1C0T1
        Length = 395

 Score = 134 (52.2 bits), Expect = 9.0e-06, P = 9.0e-06
 Identities = 57/210 (27%), Positives = 81/210 (38%)

Query:   286 AGPNTSTSAYAATQSGTP--MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 343
             AGP    SA A  +SGTP  + ++      PG +  + P Y  ++  +    +G  Y   
Sbjct:   194 AGP---PSAPAEERSGTPDSIASSSSAAHPPGVQPQQAP-YPGAQPQTGQQVEGQMYQQY 249

Query:   344 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 403
             + PGY P + P   AQ    Y  Q    Y   +  S   Q+   Y  Q  P      G G
Sbjct:   250 QQPGY-PAQQP--QAQPQQQYGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAFP-GQG 305

Query:   404 YETQRVPGYDVQRGPV--YEAQ----RAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRG 457
              + Q++P    Q+ P   +  Q    +A    P  GP    Q   G    R P + P  G
Sbjct:   306 -QAQQLPAQQPQQYPAGSFPPQPYTTQASQPAPYSGPP-GAQAAPGTFQPR-PGFTPPPG 362

Query:   458 TGFDGAPRGAAPHGQVPPPLNNVPYGSATP 487
             +     P G  P+ +  PP    P G A P
Sbjct:   363 STMTPPPSGPNPYARTRPPFG--PQGYAQP 390

 Score = 133 (51.9 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 54/197 (27%), Positives = 70/197 (35%)

Query:   311 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 370
             P  P  E S  P   AS + +  P   P   P + P Y     PG   Q G   + Q   
Sbjct:   197 PSAPAEERSGTPDSIASSSSAAHP---PGVQPQQAP-Y-----PGAQPQTGQQVEGQM-- 245

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY-I 429
              Y  ++ P Y  Q+      Q+   Y +Q   GY  Q+      Q+ P Y  Q AP+   
Sbjct:   246 -YQQYQQPGYPAQQPQAQPQQQ---YGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAF 301

Query:   430 PQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP 487
             P +G    L  Q+ Q Y     P   P        AP    P  Q  P       G   P
Sbjct:   302 PGQGQAQQLPAQQPQQYPAGSFPP-QPYTTQASQPAPYSGPPGAQAAPGTFQPRPGFTPP 360

Query:   488 PARSGSGQPRGGNPARR 504
             P  + +  P G NP  R
Sbjct:   361 PGSTMTPPPSGPNPYAR 377


>UNIPROTKB|F1LLX1 [details] [associations]
            symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            OMA:HPGKEGQ IPI:IPI00949317 Ensembl:ENSRNOT00000024138
            ArrayExpress:F1LLX1 Uniprot:F1LLX1
        Length = 1803

 Score = 142 (55.0 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   243 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 294
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1003 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1062

Query:   295 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 346
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1063 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1119

Query:   347 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 400
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1120 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1177

Query:   401 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 458
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1178 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1233

Query:   459 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1234 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1273


>RGD|2372 [details] [associations]
            symbol:Col11a1 "collagen, type XI, alpha 1" species:10116 "Rattus
          norvegicus" [GO:0001502 "cartilage condensation" evidence=ISO]
          [GO:0001503 "ossification" evidence=IEP] [GO:0002063 "chondrocyte
          development" evidence=ISO] [GO:0003007 "heart morphogenesis"
          evidence=ISO] [GO:0005201 "extracellular matrix structural
          constituent" evidence=TAS] [GO:0005581 "collagen" evidence=ISO]
          [GO:0005592 "collagen type XI" evidence=ISO] [GO:0006029
          "proteoglycan metabolic process" evidence=ISO] [GO:0007601 "visual
          perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
          evidence=ISO] [GO:0030199 "collagen fibril organization"
          evidence=ISO;TAS] [GO:0031012 "extracellular matrix"
          evidence=ISO;IDA] [GO:0035989 "tendon development" evidence=ISO]
          [GO:0042472 "inner ear morphogenesis" evidence=ISO] [GO:0048704
          "embryonic skeletal system morphogenesis" evidence=ISO] [GO:0048705
          "skeletal system morphogenesis" evidence=ISO] [GO:0050910 "detection
          of mechanical stimulus involved in sensory perception of sound"
          evidence=ISO] [GO:0051216 "cartilage development" evidence=ISO]
          [GO:0055010 "ventricular cardiac muscle tissue morphogenesis"
          evidence=ISO] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
          PROSITE:PS51461 SMART:SM00038 RGD:2372 GO:GO:0046872 GO:GO:0007601
          GO:GO:0030199 Gene3D:2.60.120.200 InterPro:IPR008985
          InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472 GO:GO:0050910
          GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
          InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025 GO:GO:0001502
          GO:GO:0048704 GO:GO:0006029 GO:GO:0055010 Pfam:PF02210 GO:GO:0005201
          GO:GO:0002063 HOGENOM:HOG000085654 KO:K06236 HOVERGEN:HBG103137
          OrthoDB:EOG49GKHM SMART:SM00210 GeneTree:ENSGT00700000104155 CTD:1301
          EMBL:AABR03012126 EMBL:AABR03013126 EMBL:AABR03014171
          EMBL:AABR03015382 EMBL:AABR03015832 EMBL:AABR03016562
          EMBL:AABR03017847 EMBL:AABR03017951 EMBL:AABR03018245
          EMBL:AABR03019675 EMBL:AABR03023874 EMBL:U20116 EMBL:U20118
          EMBL:U20121 IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589
          IPI:IPI00949317 IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1
          UniGene:Rn.260 IntAct:P20909 STRING:P20909 PhosphoSite:P20909
          PRIDE:P20909 Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413
          GeneID:25654 KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909
          NextBio:607535 ArrayExpress:P20909 Genevestigator:P20909
          GermOnline:ENSRNOG00000023148 Uniprot:P20909
        Length = 1804

 Score = 142 (55.0 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   243 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 294
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063

Query:   295 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 346
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120

Query:   347 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 400
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178

Query:   401 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 458
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234

Query:   459 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274


>UNIPROTKB|P20909 [details] [associations]
            symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2372
            GO:GO:0046872 GO:GO:0007601 GO:GO:0030199 Gene3D:2.60.120.200
            InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472
            GO:GO:0050910 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025
            GO:GO:0001502 GO:GO:0048704 GO:GO:0006029 GO:GO:0055010
            Pfam:PF02210 GO:GO:0005201 GO:GO:0002063 HOGENOM:HOG000085654
            KO:K06236 HOVERGEN:HBG103137 OrthoDB:EOG49GKHM SMART:SM00210
            GeneTree:ENSGT00700000104155 CTD:1301 EMBL:AABR03012126
            EMBL:AABR03013126 EMBL:AABR03014171 EMBL:AABR03015382
            EMBL:AABR03015832 EMBL:AABR03016562 EMBL:AABR03017847
            EMBL:AABR03017951 EMBL:AABR03018245 EMBL:AABR03019675
            EMBL:AABR03023874 EMBL:U20116 EMBL:U20118 EMBL:U20121
            IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589 IPI:IPI00949317
            IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1 UniGene:Rn.260
            IntAct:P20909 STRING:P20909 PhosphoSite:P20909 PRIDE:P20909
            Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413 GeneID:25654
            KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909 NextBio:607535
            ArrayExpress:P20909 Genevestigator:P20909
            GermOnline:ENSRNOG00000023148 Uniprot:P20909
        Length = 1804

 Score = 142 (55.0 bits), Expect = 9.3e-06, P = 9.3e-06
 Identities = 87/280 (31%), Positives = 107/280 (38%)

Query:   243 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 294
             GA G+   +  SG+  P G   +    G+P   G P      G  G  GP  S     SA
Sbjct:  1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063

Query:   295 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 346
               A   G P R     P GP    G    KGP    G D  + P   P  GP+  PA  P
Sbjct:  1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120

Query:   347 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 400
             G D  KG  G   QKGS  D  + GP      +GP   P  G+ G D + GP     M  
Sbjct:  1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178

Query:   401 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 458
               G E  R  G+    GP+   Q  P    ++G   D+   G  G    R P   P+   
Sbjct:  1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234

Query:   459 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
             G  G P      G V         G+  PP  +GSG P+G
Sbjct:  1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274


>TAIR|locus:2077547 [details] [associations]
            symbol:AT3G07030 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005829
            "cytosol" evidence=IDA] InterPro:IPR002775 Pfam:PF01918
            GO:GO:0005829 EMBL:CP002686 GO:GO:0003676 IPI:IPI00519674
            RefSeq:NP_187359.2 UniGene:At.74527 ProteinModelPortal:F4JD88
            SMR:F4JD88 PRIDE:F4JD88 EnsemblPlants:AT3G07030.1 GeneID:3768790
            KEGG:ath:AT3G07030 OMA:ERRNDGY Uniprot:F4JD88
        Length = 405

 Score = 134 (52.2 bits), Expect = 9.4e-06, P = 9.4e-06
 Identities = 57/209 (27%), Positives = 72/209 (34%)

Query:   260 NAY-EDGYGVPQGHGPPP--SATTAGVVGAGPNTSTSAYAATQS-GTPMRA-AYDI-PRG 313
             NAY E+G  V +G         TT GV+      +      T   G   RA A D+    
Sbjct:   150 NAYGEEGEVVAEGEAGEEVDMETTKGVMKEKTKGTIKKIIKTMKVGIQTRAEAVDVVDEA 209

Query:   314 PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYD 373
                   +G GY   +   Y   +   Y   +  GY   +   Y   +   Y   R   Y 
Sbjct:   210 MAIVGGRG-GYGGGRDGGYGGGRDDGYGERRNDGYGERRNDRYGGGRDDGYGGGRDDGYG 268

Query:   374 IHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 433
               R   Y  +RG G+   RG   D   G G       G    +G  Y   R   Y   RG
Sbjct:   269 GGRNDGYGGRRG-GFRGGRGGGRDEGYGGG--RGGYGGRSGGQGDGYGGGRGDGYGGGRG 325

Query:   434 PGYDLQRGQGYDMRRAPSYDPSRGTGFDG 462
              GY   RG GY   R   YD  R  G+ G
Sbjct:   326 DGYGGGRGDGYGGGRVDRYDGGRRDGYGG 354

 Score = 125 (49.1 bits), Expect = 9.3e-05, P = 9.3e-05
 Identities = 50/158 (31%), Positives = 59/158 (37%)

Query:   312 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 371
             R  GY   +  GY   +   Y   +G  +   +G G D     GY   +G  Y  + G  
Sbjct:   255 RDDGYGGGRDDGYGGGRNDGYGGRRG-GFRGGRGGGRDE----GYGGGRGG-YGGRSGG- 307

Query:   372 YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQ 431
                 +G  Y   RG GY   RG  Y   RG GY   RV  YD  R   Y   R   Y   
Sbjct:   308 ----QGDGYGGGRGDGYGGGRGDGYGGGRGDGYGGGRVDRYDGGRRDGYGGGRYDGYGGG 363

Query:   432 RGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA 468
             +  GY   RG GY   R   Y   RG  G  G  R  A
Sbjct:   364 KSDGYGGGRG-GYRGGRG-GYGRGRGRMGNGGRSRDGA 399


>TAIR|locus:2043530 [details] [associations]
            symbol:AT2G25970 "AT2G25970" species:3702 "Arabidopsis
            thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
            [GO:0003723 "RNA binding" evidence=IEA] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0006606 "protein import into nucleus"
            evidence=RCA] InterPro:IPR004087 InterPro:IPR004088 Pfam:PF00013
            PROSITE:PS50084 SMART:SM00322 GO:GO:0005829 EMBL:CP002685
            GenomeReviews:CT485783_GR GO:GO:0003723 EMBL:AC004747 EMBL:AC005395
            eggNOG:NOG300923 KO:K13210 HSSP:Q9UNW9 EMBL:AY078954 EMBL:AK226845
            IPI:IPI00540360 PIR:T02627 RefSeq:NP_180167.1 UniGene:At.21555
            ProteinModelPortal:O82762 SMR:O82762 STRING:O82762 PaxDb:O82762
            PRIDE:O82762 ProMEX:O82762 EnsemblPlants:AT2G25970.1 GeneID:817137
            KEGG:ath:AT2G25970 TAIR:At2g25970 HOGENOM:HOG000242545
            InParanoid:O82762 OMA:AANSTQD PhylomeDB:O82762
            ProtClustDB:CLSN2913011 ArrayExpress:O82762 Genevestigator:O82762
            Uniprot:O82762
        Length = 632

 Score = 139 (54.0 bits), Expect = 1.1e-05, Sum P(2) = 1.1e-05
 Identities = 75/275 (27%), Positives = 98/275 (35%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYE---DGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 295
             GSY   T     + S  P  Q + +   D YG  Q   P    ++A      P T T+ Y
Sbjct:   363 GSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSA------PPTDTTGY 416

Query:   296 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 355
                Q  +    A     G GY+      Y+AS+   Y    G  YD  +G GY  T  P 
Sbjct:   417 NYYQHASGYGQA-----GQGYQQDGYGAYNASQQSGYGQAAG--YDQ-QG-GYGSTTNPS 467

Query:   356 YD---AQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 411
              +   +Q      AQ G   Y    G     Q   G   Q G         GY +Q    
Sbjct:   468 QEEDASQAAPPSSAQSGQAGYGT-TGQQPPAQGSTG---QAGYGAPPTSQAGYSSQPAAA 523

Query:   412 YDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQ--GYDMRRAPSYDPSRGTGFDGAPRGAA 468
             Y+   G    A + P+Y   Q+ PG     G   GY    A  Y      G+  AP+G  
Sbjct:   524 YNSGYGAPPPASKPPTYGQSQQSPGAPGSYGSQSGYAQPAASGYGQPPAYGYGQAPQGYG 583

Query:   469 PHGQVPPPLNNVPYGS-ATPPARSGSGQPRGGNPA 502
              +G    P     Y S  +  A +G G   GG PA
Sbjct:   584 SYGGYTQPAAGGGYSSDGSAGATAGGG---GGTPA 615

 Score = 123 (48.4 bits), Expect = 0.00057, Sum P(2) = 0.00057
 Identities = 69/265 (26%), Positives = 89/265 (33%)

Query:   247 NSENETSGRPVGQN-AYEDGYGV-PQGHGPPPSATTAGVVGAGPNTSTSAYAAT-QSGTP 303
             + EN      +G     + GY   P     PP    A   G G      AY    Q G  
Sbjct:   302 SGENRMRNSAMGGGYPQQGGYQARPPSSWAPPGGPPAQP-GYGGYMQPGAYPGPPQYGQS 360

Query:   304 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPT-KGPSYDPAKG-PGYDPTKGPGYDA-Q 359
                +Y      GY + S  P    S    YD   +  S  P+ G     PT   GY+  Q
Sbjct:   361 PYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYNYYQ 420

Query:   360 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 419
               S Y  Q G  Y      +Y+  +  GY    G  YD Q G G  T   P  +      
Sbjct:   421 HASGY-GQAGQGYQQDGYGAYNASQQSGYGQAAG--YDQQGGYGSTTN--PSQEEDA--- 472

Query:   420 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 479
               +Q AP    Q G     Q G G   ++ P+   +   G+   P   A +   P    N
Sbjct:   473 --SQAAPPSSAQSG-----QAGYGTTGQQPPAQGSTGQAGYGAPPTSQAGYSSQPAAAYN 525

Query:   480 VPYGSATP---PARSGSGQPRGGNP 501
               YG+  P   P   G  Q   G P
Sbjct:   526 SGYGAPPPASKPPTYGQSQQSPGAP 550

 Score = 42 (19.8 bits), Expect = 1.1e-05, Sum P(2) = 1.1e-05
 Identities = 13/40 (32%), Positives = 20/40 (50%)

Query:    78 TLRQELAAAQHELQI--LHGQIGGMKSERELQMRNLTEKI 115
             T++   A     +Q+  LH   G    ER LQ+  +TE+I
Sbjct:   251 TIKSMQAKTGARIQVIPLHLPPGDPTPERTLQIDGITEQI 290


>UNIPROTKB|F1N7Q7 [details] [associations]
            symbol:COL4A2 "Collagen alpha-2(IV) chain" species:9913
            "Bos taurus" [GO:0071560 "cellular response to transforming growth
            factor beta stimulus" evidence=IEA] [GO:0016525 "negative
            regulation of angiogenesis" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0005587 "collagen
            type IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0071560 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006351 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0016525 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:DAAA02034911 IPI:IPI00712524
            Ensembl:ENSBTAT00000005916 OMA:QETIQPG Uniprot:F1N7Q7
        Length = 1650

 Score = 141 (54.7 bits), Expect = 1.1e-05, P = 1.1e-05
 Identities = 75/251 (29%), Positives = 97/251 (38%)

Query:   226 LMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVV 284
             L   P +  R  D    GA G +  +    P G + +    G+P GH G        G  
Sbjct:    18 LQGFPGLQGRKGDKGQRGAPGITGPKGDVGPRGVSGFPGADGIP-GHPGQGGPRGPPGYD 76

Query:   285 GAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA 343
             G       S YA    G P    +  PRGP G +  KG  Y A  +   D  +G   +P 
Sbjct:    77 GCNGTVGDSGYA----GPPGPGGFLGPRGPQGPKGQKGEPY-ALSSEDRDKYRGEPGEPG 131

Query:   344 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRGPNYDMQ-RG 401
                   P   PG   Q G    A   P      GP   P  RGLG+  ++G   DM  +G
Sbjct:   132 LVGLQGPPGRPGPVGQMGP-VGAPGRPGPPGPPGPKGQPGNRGLGFYGEKGEKGDMGLQG 190

Query:   402 PGYETQRVP---GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRG 457
             PG     +P   GY  +  PVYE       +P++  G   ++G QG   R   S     G
Sbjct:   191 PG----GIPPDNGYVEKPTPVYEL------LPEQYKG---EKGSQGEPGRIGVSLKGEEG 237

Query:   458 T-GFDGAPRGA 467
               GF G PRGA
Sbjct:   238 VVGFSG-PRGA 247


>UNIPROTKB|F1PHX8 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
            OMA:TIYEGIG SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AAEX03006798 EMBL:AAEX03006799 EMBL:AAEX03006800
            Ensembl:ENSCAFT00000031582 Uniprot:F1PHX8
        Length = 1814

 Score = 141 (54.7 bits), Expect = 1.2e-05, P = 1.2e-05
 Identities = 77/254 (30%), Positives = 100/254 (39%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 314
             PVG    +   G P   GP  S    G  GA            Q G P  A     +G P
Sbjct:  1043 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 1098

Query:   315 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 371
             G +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP 
Sbjct:  1099 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 1155

Query:   372 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 430
                  GP+  PQ  +G   Q GP+  D + GP  + Q + G     GP       P  + 
Sbjct:  1156 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 1204

Query:   431 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 488
              +G PG   ++G+  D+ +     P    G  GAP    P G  P  + N    G    P
Sbjct:  1205 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 1263

Query:   489 ARSGS-GQPRGGNP 501
               +G  G P  G P
Sbjct:  1264 GEAGEPGLPGEGGP 1277


>MGI|MGI:2157767 [details] [associations]
            symbol:Krtap21-1 "keratin associated protein 21-1"
            species:10090 "Mus musculus" [GO:0001942 "hair follicle
            development" evidence=IMP] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005882 "intermediate filament" evidence=IEA] [GO:0007165
            "signal transduction" evidence=IMP] [GO:0008283 "cell
            proliferation" evidence=IMP] [GO:0022405 "hair cycle process"
            evidence=IMP] [GO:0031077 "post-embryonic camera-type eye
            development" evidence=IMP] [GO:0042640 "anagen" evidence=IMP]
            [GO:0043480 "pigment accumulation in tissues" evidence=IMP]
            [GO:0043588 "skin development" evidence=IMP] [GO:0048589
            "developmental growth" evidence=IMP] [GO:0051726 "regulation of
            cell cycle" evidence=IMP] MGI:MGI:2157767 GO:GO:0007165
            GO:GO:0043588 GO:GO:0008283 GO:GO:0005882 GO:GO:0051726
            GO:GO:0042640 GO:GO:0031077 EMBL:AF345297 EMBL:AK003736
            IPI:IPI00126890 UniGene:Mm.46109 HSSP:P10969 Genevestigator:Q925H4
            GO:GO:0043480 Uniprot:Q925H4
        Length = 128

 Score = 111 (44.1 bits), Expect = 1.3e-05, P = 1.3e-05
 Identities = 32/103 (31%), Positives = 32/103 (31%)

Query:   301 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 360
             G   R  Y    G GY    G GY       Y    G  Y    G GY    G GY    
Sbjct:    14 GYGSRYGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGY 73

Query:   361 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 403
             GS Y    G  Y    G  Y    G GY    G  Y    G G
Sbjct:    74 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSRYGCGYGSG 116

 Score = 103 (41.3 bits), Expect = 9.3e-05, P = 9.3e-05
 Identities = 31/98 (31%), Positives = 33/98 (33%)

Query:   315 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 374
             GY    G GY       Y    G  Y    G GY    G GY    GS Y    G  Y  
Sbjct:    20 GYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGC 79

Query:   375 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 412
               G  Y    G GY    G  Y    G GY ++   GY
Sbjct:    80 GYGSGY----GCGYGSGYGCGYGSGYGCGYGSRYGCGY 113


>ZFIN|ZDB-GENE-030131-5726 [details] [associations]
            symbol:eif3s10 "eukaryotic translation initiation
            factor 3, subunit 10 (theta)" species:7955 "Danio rerio"
            [GO:0001732 "formation of translation initiation complex"
            evidence=ISS] [GO:0005852 "eukaryotic translation initiation factor
            3 complex" evidence=ISS] [GO:0003743 "translation initiation factor
            activity" evidence=IEA;ISS] [GO:0006413 "translational initiation"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0006412
            "translation" evidence=IEA] InterPro:IPR000717 Pfam:PF01399
            SMART:SM00088 ZFIN:ZDB-GENE-030131-5726 GO:GO:0003743 GO:GO:0005852
            HOGENOM:HOG000246822 KO:K03254 HAMAP:MF_03000
            GeneTree:ENSGT00690000102108 EMBL:BC059196 EMBL:BC066670
            IPI:IPI00489212 RefSeq:NP_956114.2 UniGene:Dr.132282
            ProteinModelPortal:Q6PCR7 STRING:Q6PCR7 PRIDE:Q6PCR7
            Ensembl:ENSDART00000111462 GeneID:327515 KEGG:dre:327515 CTD:327515
            eggNOG:NOG123880 HOVERGEN:HBG006128 InParanoid:Q6PCR7
            NextBio:20810067 Bgee:Q6PCR7 GO:GO:0001732 Uniprot:Q6PCR7
        Length = 1267

 Score = 139 (54.0 bits), Expect = 1.3e-05, P = 1.3e-05
 Identities = 109/437 (24%), Positives = 177/437 (40%)

Query:    58 QHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS-ERELQMRNLTEKIA 116
             + + + K A E QR+         EL   Q E +I + ++   K+ E + +M  + E   
Sbjct:   705 EEIPLIKKAYEEQRIKD------MELWELQEEERITNMKMEREKALEHKQRMSRMMEDKE 758

Query:   117 KMEAELKTAEPVKLEFQKSKTEAQNLVVAREE-LIAKVHQLTQDLQRA--HTDVQQIPAL 173
                +++K A     E +K K   + LV  R++ L  +  Q  +D ++A  H   ++   +
Sbjct:   759 NFLSKIKAARSFIYE-EKLKQFQERLVEERKKRLEERKKQRKEDRRKAFYHQKEEEAQRI 817

Query:   174 LSE-LESLRQEYHHCRGTY-EYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELMN 228
               E L+  R+E         E E++ Y + L  L+  E+       E+E   + + E   
Sbjct:   818 REEQLKKEREERERLEQEQREEEEREYQERLRKLEEQERKQRARQQEIEERERRKEEERR 877

Query:   229 APNV--DRRAADGSYGGATGNSENETSGR-PVGQNAY-EDGYGVPQGHGPPPSATTAGVV 284
             AP    ++  A+    G     E E+  R PV    + ++G    +G   P         
Sbjct:   878 APEEKPNKEWAEREESGWRKRGEGESEWRRPVPDRDWRQEGR---EGREEPDREDRDLPF 934

Query:   285 GAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP 338
               G  ++    A+ + G  +R   D  RGP  G +  + P  G+D  +     +D  +G 
Sbjct:   935 RRGGESARRG-ASDEKG--LRRGCDDDRGPRRGGDDERPPRRGFDDDRGTRRGFDDDRGQ 991

Query:   339 SY-DPAKGP--GYDPTKGPG--YDAQKGSNY-DAQRGPN--YDIHRGPSYDPQRGLGYDM 390
                D  +GP  G D  +GP    D  +G    D  RGP   +D  RGP    +RG+  D 
Sbjct:   992 RRGDDDRGPRRGMDDDRGPRRPIDDDRGPRRSDDDRGPRRGFDDDRGP----RRGM--DE 1045

Query:   391 QRGPNY--DMQRGP--GYETQRVP--GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 444
              RGP    D   GP  G + +R    G D   GP       P + P   PG    R +  
Sbjct:  1046 PRGPRRGADDDWGPRRGGDDERGGRRGMD-DSGPRRGEDSRP-WKPLGRPGAGGWRER-- 1101

Query:   445 DMRRAPSYDPSRGTGFD 461
             +  R  S+ P R +G D
Sbjct:  1102 EKAREESWGPPRDSGHD 1118

 Score = 97 (39.2 bits), Expect = 0.00095, Sum P(2) = 0.00095
 Identities = 57/193 (29%), Positives = 78/193 (40%)

Query:   303 PMRAAYDIPRGP--GYEASKGPGY-DASKAP--SYDPTKGPS--YDPAKGPGY-DPTKGP 354
             P R  +D  RG   G++  +G    D  + P    D  +GP    D  +GP   D  +GP
Sbjct:   970 PPRRGFDDDRGTRRGFDDDRGQRRGDDDRGPRRGMDDDRGPRRPIDDDRGPRRSDDDRGP 1029

Query:   355 --GYDAQKGSN--YDAQRGPNYDIHRGPSYD--PQRGLGYDMQRGPNYDMQ-RGPGYETQ 407
               G+D  +G     D  RGP     RG   D  P+RG G D +RG    M   GP     
Sbjct:  1030 RRGFDDDRGPRRGMDEPRGPR----RGADDDWGPRRG-GDD-ERGGRRGMDDSGPRRGED 1083

Query:   408 RVPGYDVQR---GPVYEAQRA--PSYIPQRGPGYDLQRGQ--GYDMRRAPSYDPSRGTGF 460
               P   + R   G   E ++A   S+ P R  G+D   G+  G D R    +   R    
Sbjct:  1084 SRPWKPLGRPGAGGWREREKAREESWGPPRDSGHDDDGGERDGDDQREGERFRERRSARE 1143

Query:   461 DGAP--RGAAPHG 471
             +G+   RG    G
Sbjct:  1144 EGSAWRRGGGGGG 1156

 Score = 74 (31.1 bits), Expect = 0.00095, Sum P(2) = 0.00095
 Identities = 35/164 (21%), Positives = 74/164 (45%)

Query:    68 ENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSER--ELQMRNLTEKIAKMEAEL--- 122
             E QR  A    L+   A  +H+  +   Q    + ER   L ++   E++ + EAEL   
Sbjct:   547 EEQRQQAITAYLKN--ARKEHQRILARRQTIEERKERLESLNIQREKEELEQREAELQKV 604

Query:   123 KTAEPVKLEFQKSKTEAQNLVVAREELIAK-VHQLTQDLQRAHTDVQQIPAL-LSELESL 180
             + AE  +L  +  + E + ++   E++  K V +  + +++     +    + + +LE L
Sbjct:   605 RKAEEERLRQEAKEREKERIMQEHEQIKKKTVRERLEQIKKTELGAKAFKDIDIEDLEEL 664

Query:   181 RQEYHHCRGTYEYEKKFYNDHLESLQVMEK--NYITMATEVEKL 222
               ++   +   + EK+   +  E L+  EK  +Y   A  +E++
Sbjct:   665 DPDFIMAKQVEQLEKE-KKELQERLKNQEKKIDYFERAKRLEEI 707


>UNIPROTKB|F1N474 [details] [associations]
            symbol:COL4A5 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0031594 "neuromuscular junction" evidence=IEA]
            [GO:0007528 "neuromuscular junction development" evidence=IEA]
            [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587 "collagen type
            IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0007528 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0031594 GO:GO:0005605 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:DAAA02071513 EMBL:DAAA02071512
            IPI:IPI00729819 Ensembl:ENSBTAT00000019400 OMA:MPMNMEP
            Uniprot:F1N474
        Length = 1688

 Score = 140 (54.3 bits), Expect = 1.4e-05, P = 1.4e-05
 Identities = 62/203 (30%), Positives = 76/203 (37%)

Query:   311 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 364
             P  PG     GP  G    K    +P K G P  D   G PG     G PGY  + G   
Sbjct:   266 PGPPGIRGPPGPPGGVKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 323

Query:   365 DAQRGPNYDIHR-GPS--YDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVY 420
             D ++G   D    GP     P+ G G  +    N  +   PG +  R  PG  +Q  P  
Sbjct:   324 DGEKGQKGDTGLPGPPGLVIPRPGTGVTVGEKGNIGLPGLPGDKGDRGFPG--IQGPPGL 381

Query:   421 EAQRAPSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 479
                  P+ I P   PG+  +RGQ  D    P        G DG P    P G   PP  +
Sbjct:   382 PGPPGPAVIGPPGPPGFPGERGQKGD-EGPPGISIPGSPGLDGQPGAPGPPGPPGPPGPH 440

Query:   480 VPYGS----ATPPARSGSGQPRG 498
             +P       A PP   GS   RG
Sbjct:   441 IPPSDKICEAGPPGPPGSPGDRG 463


>UNIPROTKB|F1RYI8 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0050777 "negative regulation of immune response"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
            "skin development" evidence=IEA] [GO:0043206 "extracellular fibril
            organization" evidence=IEA] [GO:0042060 "wound healing"
            evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
            "heart development" evidence=IEA] [GO:0007229 "integrin-mediated
            signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
            factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
            "cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
            GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:CU467671
            RefSeq:NP_001230226.1 UniGene:Ssc.24309 UniGene:Ssc.97562
            Ensembl:ENSSSCT00000017459 GeneID:100152001 KEGG:ssc:100152001
            Uniprot:F1RYI8
        Length = 1466

 Score = 139 (54.0 bits), Expect = 1.6e-05, P = 1.6e-05
 Identities = 86/292 (29%), Positives = 107/292 (36%)

Query:   231 NVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP- 288
             +V    A G  GG  G +       P G + +    G P   GPP     AG  G  GP 
Sbjct:   160 DVKAGVAGGGIGGYPGPAGPPGPPGPPGVSGHPGAPGSPGYQGPPGEPGQAGPAGPPGPP 219

Query:   289 ---NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDP 342
                  S  A    +SG P R     +P  PG +   G PG+   K    +D   G   D 
Sbjct:   220 GAIGPSGPAGKDGESGRPGRPGERGLPGPPGLKGPAGMPGFPGMKGHRGFDGRNGEKGDT 279

Query:   343 AKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQR 400
                PG     G PG +   G      RG   +  R P      G  G D  RG   D Q 
Sbjct:   280 G-APGLKGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQP 333

Query:   401 GP-GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
             GP G   T   PG    +G V  A  +P   P   PG   QRG+      A +  P    
Sbjct:   334 GPPGPPGTAGFPGSPGAKGEVGPAG-SPG--PSGSPG---QRGEPGPQGHAGAAGPPGPP 387

Query:   459 GFDGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 503
             G +G+P G    G  P  +   P   G+  PP   G+ G P  RG  G P +
Sbjct:   388 GSNGSPGGKGEMG--PAGIPGAPGLMGARGPPGPPGTNGAPGQRGAAGEPGK 437


>UNIPROTKB|K7EKB2 [details] [associations]
            symbol:TAF15 "TATA-binding protein-associated factor 2N"
            species:9606 "Homo sapiens" [GO:0005622 "intracellular"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR001876 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50199
            SMART:SM00547 EMBL:AC015849 HGNC:HGNC:11547 Ensembl:ENST00000585577
            Uniprot:K7EKB2
        Length = 214

 Score = 125 (49.1 bits), Expect = 1.7e-05, P = 1.7e-05
 Identities = 48/140 (34%), Positives = 52/140 (37%)

Query:   315 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-GPGYDAQK-GSNYDAQRGPNY 372
             GY    G G D           G   D + G GY   + G GY   + G  Y   RG  Y
Sbjct:    69 GYRGRGGRGGDRGGYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGY 128

Query:   373 DIHRGPSYDPQRGLGY--DMQRGPNYDMQRG--PGYETQRVPGYDVQR-GPVYEAQRAPS 427
                RG  Y   RG GY  D  RG  Y   RG   GY   R  GY   R G  Y   R   
Sbjct:   129 GGDRGGGYGGDRGGGYGGDRSRG-GYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGG 187

Query:   428 YIPQRGPGYDLQRGQGYDMR 447
             Y   RG GY  + G   D R
Sbjct:   188 YGGDRG-GYGGKMGGRNDYR 206

 Score = 123 (48.4 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 50/174 (28%), Positives = 64/174 (36%)

Query:   228 NAPNV-DRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA 286
             N P   D R + G + G     E    GR  G+     GYG  +  G      ++G  G 
Sbjct:    45 NEPRPEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GY 102

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY--DPTKGPSYDPAK 344
               + S   Y   +SG      Y   RG GY   +G GY   +   Y  D ++G       
Sbjct:   103 SGDRSGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSRG------- 151

Query:   345 GPGYDPTKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYD 397
             G G D   G GY   +   Y   R G  Y   RG  Y   RG GY  + G   D
Sbjct:   152 GYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRND 204


>WB|WBGene00000628 [details] [associations]
            symbol:col-51 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00530000064217 EMBL:FO080999 RefSeq:NP_491195.1
            UniGene:Cel.29694 ProteinModelPortal:Q7Z152 MINT:MINT-3384184
            STRING:Q7Z152 EnsemblMetazoa:T28F2.8 GeneID:189052
            KEGG:cel:CELE_T28F2.8 UCSC:T28F2.8 CTD:189052 WormBase:T28F2.8
            eggNOG:NOG245561 InParanoid:Q7Z152 OMA:MMASRRI NextBio:941036
            Uniprot:Q7Z152
        Length = 435

 Score = 132 (51.5 bits), Expect = 1.8e-05, P = 1.8e-05
 Identities = 90/299 (30%), Positives = 102/299 (34%)

Query:   220 EKLRAE-LMNAPNVDRRAADGSYGGATGNSENETSGRPVGQ-NAYEDGYGVPQGH-GPPP 276
             EK+  E L  A      AA G  G A G       G   G  +      G P G  GPP 
Sbjct:    84 EKVAFEGLFRAKRQYATAAGGGGGYAAGGGGGGGGGGGGGGCHCAAQASGCPAGPPGPPG 143

Query:   277 SATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGP----GYEASKGP-GYDASKAP 330
              A T G  G AG +          SG+  +A    P GP    G + + GP G      P
Sbjct:   144 EAGTDGEPGQAGQDGQPGQAGQADSGSSGQACITCPAGPPGPPGPDGNAGPAGAPGVPGP 203

Query:   331 SYD----PTKGPSYDPAKGPGYDPTKG-PGYDAQKGS----NYDAQRGPNYDIHRGPSYD 381
               D    P  GP   P   PG D   G PG D Q G+      ++  GP      GP   
Sbjct:   204 DGDAGSPPPPGPPGPPGP-PGNDGQPGAPGQDGQPGAPGTNTVNSPGGPGPAGPPGPPGP 262

Query:   382 P-QRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 440
             P Q G G   Q GP       PG      PG D Q G        P   P  GPG D   
Sbjct:   263 PGQDGSGGAAQPGP-------PG--PPGPPGNDGQPG-------GPGQ-PG-GPGQD--G 302

Query:   441 GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
             G G D    P   P R       P G    G  P       Y +     R+ SG   GG
Sbjct:   303 GPGTDAAYCPC--PPR------TPAGGGGGGDFPAGGGGGGYSTGGGGGRADSGGAAGG 353


>WB|WBGene00000251 [details] [associations]
            symbol:bli-1 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040011
            "locomotion" evidence=IMP] [GO:0002119 "nematode larval
            development" evidence=IMP] [GO:0018996 "molting cycle, collagen and
            cuticulin-based cuticle" evidence=IMP] [GO:0005578 "proteinaceous
            extracellular matrix" evidence=ISS] [GO:0042329 "structural
            constituent of collagen and cuticulin-based cuticle" evidence=ISS]
            InterPro:IPR002486 InterPro:IPR012613 Pfam:PF01484 Pfam:PF08175
            SMART:SM01088 GO:GO:0009792 GO:GO:0002119 GO:GO:0018996
            GO:GO:0005578 GO:GO:0040011 GO:GO:0000003 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0040002
            EMBL:Z46791 PIR:T19140 RefSeq:NP_496311.2 ProteinModelPortal:Q09457
            STRING:Q09457 PaxDb:Q09457 EnsemblMetazoa:C09G5.6 GeneID:174653
            KEGG:cel:CELE_C09G5.6 UCSC:C09G5.6 CTD:174653 WormBase:C09G5.6
            GeneTree:ENSGT00690000102663 HOGENOM:HOG000016778 InParanoid:Q09457
            OMA:WEEHRKS NextBio:884926 GO:GO:0042601 GO:GO:0042329
            GO:GO:0030436 Uniprot:Q09457
        Length = 948

 Score = 136 (52.9 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 89/338 (26%), Positives = 120/338 (35%)

Query:   197 FYNDHLESLQVMEK--NYITMATEVEKLRAELMNAPNVDRRAA----DGSYGGATGNSEN 250
             FY++  E L   +   N I      E    E+  A + DR       +G Y   T     
Sbjct:    36 FYSEAQEELVEFKDIANNIWEEMVFELTPEEMREAEDNDREKRSYEPEGPYQSETTTPST 95

Query:   251 ETSGRPVGQNAYED--GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAY 308
              TS       A ED  GY     +GPP S          P T     A   + T   + Y
Sbjct:    96 TTSTAATTTEAAEDESGYDFVNDNGPPSSRPRKPEPPTMPRTIQGFRAPPPAAT---STY 152

Query:   309 DIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD-PAKGPG-----YDPTKGP--GYDAQK 360
               P G  Y+ + G    +S+ P Y P + PS   P   P      Y+P   P  GY    
Sbjct:   153 RPPHGSNYD-NYGREPASSRRP-YPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNP 210

Query:   361 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP--GYET--QRVPG----Y 412
                Y+  + PNY   R P+Y       Y   R PN    R P  GY++  Q  P     Y
Sbjct:   211 RVPYNPPQ-PNYT--RQPTYPEDNRAPYKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIY 267

Query:   413 DVQR----GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 468
             + +R    GP Y   + P+  P   PG   QR      R  P+   +R       P    
Sbjct:   268 NTRRPNNHGPGYPEDQVPTAPPV--PGQ--QRVPPTQTRNPPNPTNTRQPSRPVPPTSDG 323

Query:   469 PHGQVPPPLN-NVPYGSATPPARSGSG--QPRGGNPAR 503
              H +   P N +  Y +    +  G G  +PR G   R
Sbjct:   324 -HIEATTPYNPSAQYPTGKRGSHPGFGPQRPRPGTRPR 360

 Score = 131 (51.2 bits), Expect = 6.9e-05, P = 6.9e-05
 Identities = 76/266 (28%), Positives = 102/266 (38%)

Query:   256 PVGQNAYEDGYGVPQGHG----PPPSATTAGVVGAGPNTSTSAY---AATQSGTPM--RA 306
             P G N Y D YG          PP    +     + PN  TS Y      ++G P   R 
Sbjct:   155 PHGSN-Y-DNYGREPASSRRPYPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNPRV 212

Query:   307 AYDIPRGPGYEASKGPGY-DASKAPSYDPTKGPSYDPAKGP--GYD-----PTKGPG-YD 357
              Y+ P+ P Y  ++ P Y + ++AP Y PT+ P+  P + P  GYD     P   P  Y+
Sbjct:   213 PYNPPQ-PNY--TRQPTYPEDNRAP-YKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIYN 268

Query:   358 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG 417
              ++ +N+    GP Y   + P+  P  G     QR P    +  P     R P   V   
Sbjct:   269 TRRPNNH----GPGYPEDQVPTAPPVPG----QQRVPPTQTRNPPNPTNTRQPSRPVPPT 320

Query:   418 PVYEAQRAPSYIPQRGPGYDL-QRGQ--GYDMRRA-PSYDPSRGTGFDGAPRGAAP-HGQ 472
                  +    Y P     Y   +RG   G+  +R  P   P RG   D     A P H  
Sbjct:   321 SDGHIEATTPYNPSAQ--YPTGKRGSHPGFGPQRPRPGTRP-RGNPCDQC--SAQPNHCP 375

Query:   473 VPPPLNNVPYGSATPPARSGSGQPRG 498
               PP    P G   PP   G   PRG
Sbjct:   376 SGPP---GPRGRPGPPGFPGQDGPRG 398

 Score = 128 (50.1 bits), Expect = 0.00015, P = 0.00015
 Identities = 75/261 (28%), Positives = 95/261 (36%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 293
             R  DG+  G  G    +      GQ+      G P  HG   S  T G  G  G N  + 
Sbjct:   431 RGPDGT-PGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGKPGLPGRNGQSC 489

Query:   294 AYAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKA----PSYDPTKGPSYDPA-KGPG 347
                    G P      +P   G   + G  G D S      P  D T GP   P    PG
Sbjct:   490 KSIPGPPGQP--GVMGVPGRDGDPGTDGEHGQDGSPGIQGPPGRDGTSGPDGQPGVSAPG 547

Query:   348 YDPTKGPGYDA--QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 405
                T G GY    ++ S +D     N D  RG   +  R  GYD +R      +  P  +
Sbjct:   548 APGTDG-GYCPCPKRSSKFDFNDAYNDDEKRG--LEEHRPRGYDSERAE----EPRPR-Q 599

Query:   406 TQRVPGYDVQRGPVYEAQRAPSY------IPQRGPGY-DLQRGQGYDMRRAPSYDPSRGT 458
             T R   YD   G   E QR P+Y       P R   Y D +R +    +R P   P R T
Sbjct:   600 TVRTNTYDENSGA--EHQRRPNYEPSAEVAPPRQDRYEDEERVREPPPKRPPP--PHRQT 655

Query:   459 GFDGAPRGAAPHGQVPPPLNN 479
               +  P    P+ + PPP  N
Sbjct:   656 PHELYPE-EQPYVRRPPPPQN 675

 Score = 122 (48.0 bits), Expect = 0.00065, P = 0.00065
 Identities = 71/243 (29%), Positives = 88/243 (36%)

Query:   274 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 333
             P P  +   +    P   ++ Y   + G+        PR PG      P    S  P++ 
Sbjct:   316 PVPPTSDGHIEATTPYNPSAQYPTGKRGSHPGFGPQRPR-PGTRPRGNPCDQCSAQPNHC 374

Query:   334 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRG-----L 386
             P+ GP   P   PG  P   PG D  +G      RG N  Y   +  SYDP  G     +
Sbjct:   375 PS-GPP-GPRGRPG--PPGFPGQDGPRGL-----RGLNGGYSGVQPSSYDPVIGCVQCPI 425

Query:   387 GYDMQRGPNYDMQRG-PGYE----TQRVPGYDVQRG----PVYEAQRAPSYIPQRGPGYD 437
             G   +RGP  D   G PG +     Q V G D Q G    P Y         P + PG  
Sbjct:   426 GPPGERGP--DGTPGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGK-PGLP 482

Query:   438 LQRGQGYDMRRAPSYDPS-RGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 495
              + GQ       P   P   G  G DG P     HGQ   P      G   PP R G+  
Sbjct:   483 GRNGQSCKSIPGPPGQPGVMGVPGRDGDPGTDGEHGQDGSP------GIQGPPGRDGTSG 536

Query:   496 PRG 498
             P G
Sbjct:   537 PDG 539


>UNIPROTKB|J9P0L0 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            CTD:1281 EMBL:AAEX03017880 RefSeq:XP_851009.1
            Ensembl:ENSCAFT00000047312 GeneID:478835 KEGG:cfa:478835
            Uniprot:J9P0L0
        Length = 1465

 Score = 138 (53.6 bits), Expect = 2.0e-05, P = 2.0e-05
 Identities = 85/297 (28%), Positives = 107/297 (36%)

Query:   226 LMNAPNVDRRAADGSYGGATG-NSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVV 284
             L   P       +    G  G   E+ + G P G+    D  G P   GPP +A   G  
Sbjct:   640 LQGLPGTSGPPGENGKPGEPGPKGESGSPGVPGGKG---DS-GAPGERGPPGAAGPMGPR 695

Query:   285 G-AGPNTSTSAYAAT-------QSGTP----MRAAYDIPRGPGYEASKG-PGY-DASKAP 330
             G AGP        A         +GTP    M      P GPG +  KG PG   A  AP
Sbjct:   696 GGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGERGGPGGPGPKGDKGEPGSAGADGAP 755

Query:   331 SYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYD 389
               D  +GP+  P   PG  P   PG   + G+       GP         + P    G+ 
Sbjct:   756 GKDGPRGPT-GPIGPPG--PAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAGFP 812

Query:   390 MQRGPNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 448
                G N +    PG + +R  PG   + GP   A       P   PG    +G+    R 
Sbjct:   813 GAPGQNGE----PGAKGERGAPGEKGEGGPPGVAGPPGGAGPAGPPGPQGVKGE----RG 864

Query:   449 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV---PYGSATPPARSGSGQPRGGNPA 502
             +P      G G  G P G    G   PP NN    P GS+  P + G   P G N A
Sbjct:   865 SPG-----GPGAAGFPGGRGLPG---PPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 913

 Score = 133 (51.9 bits), Expect = 7.0e-05, P = 7.0e-05
 Identities = 84/286 (29%), Positives = 103/286 (36%)

Query:   231 NVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP- 288
             +V    A G  GG  G +       P G + +    G P   GPP     AG  G  GP 
Sbjct:   159 DVKAGVAGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPP 218

Query:   289 ---NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDP 342
                  S  A    +SG P R     +P  PG +   G PG+   K    +D   G   D 
Sbjct:   219 GAMGPSGPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDT 278

Query:   343 AKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQR 400
                PG     G PG +   G      RG   +  R P      G  G D  RG   D Q 
Sbjct:   279 G-APGLKGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQP 332

Query:   401 GP-GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
             GP G   T   PG    +G V  A    S      PG   QRG+      A +  P    
Sbjct:   333 GPPGPPGTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPP 386

Query:   459 GFDGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 499
             G +G+P G    G  P  +   P   G+  PP   G+ G P  RGG
Sbjct:   387 GSNGSPGGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 78/261 (29%), Positives = 98/261 (37%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 311
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   P
Sbjct:   321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380

Query:   312 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 367
               PG   S G PG      P+  P   P    A+GP   P T G PG     G    +  
Sbjct:   381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439

Query:   368 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 419
             +G P     RG +  P   G  G D + G P      G PG   +R  PG+   RGP   
Sbjct:   440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496

Query:   420 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 476
                  ++ P+   + GPG    RG  G   R      P    G  G+P G    G+  PP
Sbjct:   497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554

Query:   477 LNNVPYGSATPPARSGS-GQP 496
              +    G   PP  SG  GQP
Sbjct:   555 GSQGESGRPGPPGPSGPRGQP 575


>UNIPROTKB|F1NI73 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
            EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI01017330
            Ensembl:ENSGALT00000004032 ArrayExpress:F1NI73 Uniprot:F1NI73
        Length = 1260

 Score = 137 (53.3 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 83/280 (29%), Positives = 109/280 (38%)

Query:   243 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 299
             GA G   +N   G P G+       G+P  +G P     AG  G+ GP   S  A    Q
Sbjct:   465 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 523

Query:   300 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 354
              G P    MR    IP  PG +   GP  +  + P      GP+  P   PG     GP 
Sbjct:   524 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 581

Query:   355 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 408
             G +   G N   +RGP       GP+  +   GL G     GP  D  + GP      Q 
Sbjct:   582 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 639

Query:   409 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 464
             +PG     GP  E  +     P+    GPG+   +G+ G    R  +  P   TG  G P
Sbjct:   640 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERG-AQGPPGPTGARGGP 695

Query:   465 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 501
               A   G + PP     P G+  P  +   G+ RG  G+P
Sbjct:   696 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 734

 Score = 123 (48.4 bits), Expect = 0.00071, P = 0.00071
 Identities = 84/275 (30%), Positives = 104/275 (37%)

Query:   253 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             +G P G        G+P   G P      G+ G  P TS +  A    G P +       
Sbjct:   386 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 440

Query:   313 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 366
             GP G     G PG  A   P  +  +G + +P +   PG    +G PG+    GSN    
Sbjct:   441 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 498

Query:   367 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 421
             ++GP  +       GPS  P    G D   GP     RG PG      PG D + GP   
Sbjct:   499 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 553

Query:   422 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 473
              Q  P      GP G   Q G  G+   +    AP  +  RG G   G P  A  +G V 
Sbjct:   554 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 612

Query:   474 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 502
              P PP    P G    P  SGS    G P G  PA
Sbjct:   613 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 647

 Score = 122 (48.0 bits), Expect = 0.00091, P = 0.00091
 Identities = 80/269 (29%), Positives = 105/269 (39%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAA-YDIPRG 313
             P G N Y+   G P   GP      AG++G AGP          + G P R     IP  
Sbjct:   190 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGPPGKDG-----EPGRPGRNGDRGIPGL 244

Query:   314 PGYEASKG-PGYDASK-APSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP 370
             PG++   G PG    K A  +D   G   D    PG     G PG +   G      RGP
Sbjct:   245 PGHKGHPGMPGMPGMKGARGFDGKDGAKGDSG-APGPKGEAGQPGANGSPGQ--PGPRGP 301

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-------GYETQRVPGYDVQRGPVYEAQ 423
               +  RG   +P   + Y         + +GP       G+     PG+  + GP   A 
Sbjct:   302 TGE--RGRPGNPGGPVTYRCDIVVFLSLFKGPPGPPGTAGFPGS--PGFKGEAGPPGPAG 357

Query:   424 RAPSYIP-QRG-PGYDLQRG----QGYDMRR-APSYDPSRG-TGFDGAPRGAAPHGQ-VP 474
              + S  P +RG PG   Q G    QG   R  +P      G +G  GAP    P G+ +P
Sbjct:   358 ASGS--PGERGEPGPQGQAGPPGPQGPPGRAGSPGNKGEMGPSGIPGAP--GLPGGRGLP 413

Query:   475 PPLNNVPYGSATPPARSGSGQPRGGNPAR 503
              P    P  S  P A+   G+P G N A+
Sbjct:   414 GP----PGTSGNPGAKGTPGEP-GKNGAK 437


>UNIPROTKB|F1RXW0 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0005588 "collagen type V"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0043588
            GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
            EMBL:CU467671 Ensembl:ENSSSCT00000017460 ArrayExpress:F1RXW0
            Uniprot:F1RXW0
        Length = 1269

 Score = 137 (53.3 bits), Expect = 2.2e-05, P = 2.2e-05
 Identities = 87/295 (29%), Positives = 109/295 (36%)

Query:   232 VDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGP 288
             +  + A+G+ G  GA G         P G    E G   P+G  GPP S    G  G   
Sbjct:   552 IGEKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENG 610

Query:   289 NTSTSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK 344
              T    +A  Q   G P ++     P   G   S GP G   S  P + P   P     +
Sbjct:   611 PTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGR 669

Query:   345 GPGYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGP 394
             G    P  T  PG   + G        GP   +   P  +   GL  D         RGP
Sbjct:   670 GTQGPPGATGFPGSAGRVGPPGPTGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGP 728

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPS 451
                   GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+
Sbjct:   729 A-GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPA 783

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 504
               P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:   784 GTPGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 837


>UNIPROTKB|Q28009 [details] [associations]
            symbol:FUS "RNA-binding protein FUS" species:9913 "Bos
            taurus" [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=ISS]
            [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634 "nucleus"
            evidence=ISS] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0005737 GO:GO:0000166 GO:GO:0046872 GO:GO:0003677
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0045944 GO:GO:0003723
            eggNOG:NOG240581 GeneTree:ENSGT00530000063105 KO:K13098
            HOGENOM:HOG000038010 CTD:2521 EMBL:U26024 EMBL:BC119965
            IPI:IPI00705463 RefSeq:NP_776337.1 UniGene:Bt.2474
            ProteinModelPortal:Q28009 STRING:Q28009 PRIDE:Q28009
            Ensembl:ENSBTAT00000007571 GeneID:280796 KEGG:bta:280796
            InParanoid:Q28009 OrthoDB:EOG4DV5NH NextBio:20804952 Uniprot:Q28009
        Length = 513

 Score = 132 (51.5 bits), Expect = 2.3e-05, P = 2.3e-05
 Identities = 67/237 (28%), Positives = 93/237 (39%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             G+Y    G   ++ S +P GQ +Y  GYG          ++ +G  G   NT  S  +A 
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSY-GGYGQSTDTSGYGQSSYSGSYGQTQNTGYSTQSAP 73

Query:   299 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 355
             Q G      Y   +     Y + S  PGY    APS   T G     ++  GY   +G G
Sbjct:    74 Q-GYSSAGGYGSSQSSQSSYGQQSSYPGYGQQPAPS--GTSGSYGSSSQSSGYGQPQGGG 130

Query:   356 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG--YETQRVPGYD 413
             Y  Q G  Y  Q+  +Y   +  SY+P +G G   Q   +     G G  Y   +     
Sbjct:   131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQSQYNSSGGGGGGGGGSYGQDQPSMSS 185

Query:   414 VQRGPVYEAQ-RAPSY---IPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 466
                G  Y  Q ++  Y      RG G     G GY+ R +  Y+P RG G     RG
Sbjct:   186 GGGGGGYGNQDQSGGYGGGQQDRG-GRGRGGGGGYN-RSSGGYEP-RGRGGGRGGRG 239


>UNIPROTKB|F1RFI8 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 OMA:EGTSTGY EMBL:CU640468
            EMBL:CT737304 Ensembl:ENSSSCT00000010930 Uniprot:F1RFI8
        Length = 606

 Score = 121 (47.7 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 54/178 (30%), Positives = 75/178 (42%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAP-SYDPTKGPSYDPAKGPGYD 349
             T+    TQ+    ++AY   P  P Y   + P   A+ AP SY  T+  SYD +     +
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQP---AATAPASYSSTQPTSYDQSSYSQQN 157

Query:   350 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 407
                 P    Q+ S+Y  Q   +Y      SY PQ G  Y   + P+   Q+   Y  Q
Sbjct:   158 TYGQPSSYGQQ-SSYGQQS--SYGQQPPTSYPPQTG-SYS--QAPSQYSQQSSSYGQQ 209

 Score = 57 (25.1 bits), Expect = 2.5e-05, Sum P(2) = 2.5e-05
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   465 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 503
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   404 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 449

 Score = 49 (22.3 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 25/86 (29%), Positives = 33/86 (38%)

Query:   422 AQRAPSYIPQRGPGYDLQRGQGYD--MRRAPSYDPSRGTGFDGAPRG-----AAPHGQVP 474
             A++ P     RG G   + G+G    +R  P      G G  G P G         G  P
Sbjct:   394 ARKKPPMNSMRG-GMPPREGRGMPPPLRGGPG-----GPGGPGGPMGRMGGRGGDRGGFP 447

Query:   475 PPLNNVPYGSATPPARSGSGQPRGGN 500
             P     P GS   P+  G+ Q R G+
Sbjct:   448 P---RGPRGSRGNPSGGGNVQHRAGD 470


>UNIPROTKB|E2RS29 [details] [associations]
            symbol:E2RS29 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 EMBL:AAEX03026460
            Ensembl:ENSCAFT00000019701 Uniprot:E2RS29
        Length = 538

 Score = 132 (51.5 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 76/285 (26%), Positives = 103/285 (36%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVG--AGP-NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G   G  +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYSTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGY---EASKGPG--YDASK-APSYDP--TKGPSYDP 342
             T+    TQ+    ++AY   P  P Y    A+  P    D +K A +  P  + G    P
Sbjct:   102 TATVTTTQASYEAQSAYGTQPAYPAYGQQPAATAPARPQDGNKPAETSQPQSSTGGYNQP 161

Query:   343 AKGPG---YDPTKGPG-YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM 398
             + G G   Y   + PG Y  Q  +   +    +Y   +  SYD Q   G     G     
Sbjct:   162 SLGYGQSNYSYPQVPGSYPMQPVTAPPSYPPTSYSSTQPTSYDQQNTYGQPSSYGQQSSY 221

Query:   399 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
              +   Y  Q    Y  Q G  Y   +APS   Q+   Y  Q     D  R+         
Sbjct:   222 GQQSSYGQQLPTSYPPQTGS-YS--QAPSQYSQQSSSYGQQSSFQQDHPRSMGVYGQESG 278

Query:   459 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
             GF       +  G   P      +      +R G G  RGG  AR
Sbjct:   279 GFSRPGENRSMSGPDNPGRGRGGFDRGDM-SRGGRGGGRGGMGAR 322


>UNIPROTKB|P0CG41 [details] [associations]
            symbol:CTAGE8 "Cutaneous T-cell lymphoma-associated antigen
            8" species:9606 "Homo sapiens" [GO:0016021 "integral to membrane"
            evidence=IEA] GO:GO:0016021 HPA:HPA000387 HPA:HPA000922
            EMBL:AC004889 UniGene:Hs.661442 IPI:IPI00969223
            ProteinModelPortal:P0CG41 PhosphoSite:P0CG41 DMDM:300680906
            PRIDE:P0CG41 Ensembl:ENST00000487179 GeneCards:GC07M143963
            HGNC:HGNC:37294 neXtProt:NX_P0CG41 OMA:LERELMV ArrayExpress:P0CG41
            Bgee:P0CG41 Uniprot:P0CG41
        Length = 777

 Score = 134 (52.2 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 107/459 (23%), Positives = 179/459 (38%)

Query:    56 ASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSER---ELQMRNLT 112
             A  +V ++ L  E   +      + +        ++ L  Q   ++SE    E + + L 
Sbjct:   322 AKLNVSLKSLEGERNHIIIQLSEVDKTKEELTEHIKNLQTQQASLQSENIYFESENQKLQ 381

Query:   113 EKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPA 172
             +K+ K+  E      +KL ++K   E +N  +  EE +++V +    + RA   ++    
Sbjct:   382 QKL-KIMTEFYQENEMKL-YRKLTVE-ENYRIEEEEKLSRVEE---KISRATEGLETYRK 435

Query:   173 LLSELES-LRQEYH-HCRGTYEYEKKFYNDHLESLQVMEKNYITMATE----VEKL-RAE 225
             L  +LE  L +  H + +    YEK+ +++ L + +  E+N   +  E     +KL   E
Sbjct:   436 LAKDLEEELERTVHFYQKQVISYEKRGHDNWLAA-RTAERNLSDLRKENAHNKQKLTETE 494

Query:   226 L-MNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYED-GYGVPQGHGPPPSATTAGV 283
             L       D  A D S   A G   +  S  P+G+ + E   +  PQ     P   +  +
Sbjct:   495 LKFELLEKDPNALDVS-NTAFGREHSPCSPSPLGRPSSETRAFPSPQTLLEDPLRLSPVL 553

Query:   284 VGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEA----SKGPGYDASKAPSYDPTKGP 338
              G G    +S       G P+       RG P Y+      + P    S +   +  +  
Sbjct:   554 PGGGGRGPSSP------GNPLDHQITNERGEPSYDRLIDPHRAPSDTGSLSSPVEQDRRM 607

Query:   339 SYDPAKGPGY-DPTKGPGYDAQKGSNYDAQRGPNYD-IHRGPSYDPQ-RGLGYDMQRGPN 395
              + P  G  Y D T  P  + +  SN +   GP      +  S D   R +  +M+   N
Sbjct:   608 MFPPP-GQSYPDSTLPPQREDRFYSNSERLSGPAEPRSFKMTSLDKMDRSMPSEMESSRN 666

Query:   396 YDMQRGPG---YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 452
              D +   G        +P  +   GP +     P   P RGP + +   +G  MRR P +
Sbjct:   667 -DAKDDLGNLNVPDSSLPAENEATGPGFIP---PPLAPVRGPLFPVDT-RGPFMRRGPPF 721

Query:   453 DPSR-GTGFDGAPRGAAPHGQVPPPLNNVPYGSAT--PP 488
              P   GT F GA RG  P    P P  + P+      PP
Sbjct:   722 PPPPPGTMF-GASRGYFPPRDFPGP-PHAPFAMRNIYPP 758


>ZFIN|ZDB-GENE-040426-1010 [details] [associations]
            symbol:fus "fusion (involved in t(12;16) in
            malignant liposarcoma)" species:7955 "Danio rerio" [GO:0000166
            "nucleotide binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 ZFIN:ZDB-GENE-040426-1010 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622
            GeneTree:ENSGT00530000063105 KO:K13098 CTD:2521 EMBL:BX571714
            IPI:IPI00785727 RefSeq:NP_957377.2 UniGene:Dr.114403
            Ensembl:ENSDART00000055340 GeneID:394058 KEGG:dre:394058
            NextBio:20815017 Bgee:F1R0M4 Uniprot:F1R0M4
        Length = 541

 Score = 132 (51.5 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 63/241 (26%), Positives = 88/241 (36%)

Query:   240 SYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 299
             SYGG   N  +E+S  P  Q  Y   YG  Q  G    A + G   +  + S+  Y+ T 
Sbjct:    37 SYGGY--NQSSESSSAPYNQGGYSSNYGQSQSGGYGSQAPSQGYSQSSQSYSSGGYSNTS 94

Query:   300 SGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-GPGYD 357
                P ++        GY + S   GY+ S +P+  P    S   + G G    + G GY 
Sbjct:    95 QPPPAQSG-------GYSQQSSYSGYNQS-SPASAPGGYSSSSQSSGYGQQQQQSGGGYG 146

Query:   358 AQKGSN--YDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRV---PG 411
                G +  Y +  G +      G  +   +  G      PNY       Y  Q      G
Sbjct:   147 GSGGQSGGYGSSGGQSSGFGGSGGQHQSSQSGGGSYSPSPNYSSPPPQSYGQQSQYGQGG 206

Query:   412 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 471
             Y+    P+        Y  Q G GY  Q G+G    R   +      GFD   RG  P G
Sbjct:   207 YNQDSPPMSGGGGGGGYGGQDG-GYS-QDGRG-GRGRGGGFGGRGAGGFDRGGRGG-PRG 262

Query:   472 Q 472
             +
Sbjct:   263 R 263


>ZFIN|ZDB-GENE-070912-607 [details] [associations]
            symbol:col11a1b "collagen, type XI, alpha 1b"
            species:7955 "Danio rerio" [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS51461 SMART:SM00038 ZFIN:ZDB-GENE-070912-607
            Gene3D:2.60.120.200 InterPro:IPR008985 InterPro:IPR013320
            SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            InterPro:IPR001791 SMART:SM00282 Pfam:PF02210 GO:GO:0005201
            HOGENOM:HOG000085654 SMART:SM00210 GeneTree:ENSGT00700000104155
            UniGene:Dr.3536 EMBL:BX510342 EMBL:BX547933 EMBL:CT583637
            EMBL:GQ485665 IPI:IPI00511026 RefSeq:NP_001171883.1
            UniGene:Dr.42128 Ensembl:ENSDART00000049589 GeneID:555202
            KEGG:dre:555202 CTD:555202 NextBio:20880850 Uniprot:D6MUD3
        Length = 1815

 Score = 138 (53.6 bits), Expect = 2.5e-05, P = 2.5e-05
 Identities = 71/250 (28%), Positives = 100/250 (40%)

Query:   267 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA 326
             G P  HG P      G  G          + T    P RA  +  +GP   A +     A
Sbjct:   469 GSPGLHGDPGERGPPGRPGLPGGDGAPGPSGTILMLPFRAGGESSKGPVVSAQEAQA-QA 527

Query:   327 SKAPSYDPTKGPSYDPAKGPGYD-PTKGPGYDAQKGSNYDA-QRGPNYDIHRGPSYDP-- 382
               A +    +GP   P    G   P  GPG    KG + D+  +GP     +GP+  P  
Sbjct:   528 ILAQARLTMRGPP-GPMGLTGRSGPVGGPGAPGAKGESGDSGPQGPRG--LQGPTGSPGK 584

Query:   383 --QRGL-GYDMQRG-PNYDMQRGP-GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 437
               +RG  G D  RG P     +G  G++   +PG   ++G  +  ++ P  +P   PG D
Sbjct:   585 PGKRGRNGADGARGIPGESGAKGDRGFDG--LPGLPGEKG--HRGEQGPIGLPG-SPGED 639

Query:   438 LQRGQGYDM--RRAPSYDPSRGT-GFDGAPRGAAPHGQV----PP-PLNNV-PYGSATPP 488
               RG+  ++  R  P     RG  G  G+P  A   G      PP P  N+ P G   PP
Sbjct:   640 GPRGEDGEIGQRGMPGESGPRGLLGPRGSPGTAGQRGLTGLDGPPGPKGNMGPQGEPGPP 699

Query:   489 ARSGSGQPRG 498
              + G+  P G
Sbjct:   700 GQQGNTGPHG 709


>UNIPROTKB|F1N2Y2 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
            [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0005588 "collagen type V"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            GO:GO:0043588 GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
            EMBL:DAAA02003915 EMBL:DAAA02003916 EMBL:DAAA02003917
            EMBL:DAAA02003918 IPI:IPI00826022 Ensembl:ENSBTAT00000038684
            Uniprot:F1N2Y2
        Length = 1491

 Score = 137 (53.3 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 88/295 (29%), Positives = 110/295 (37%)

Query:   232 VDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGP 288
             +  + A+G+ G  GA G         P G    E G   P+G  GPP S    G  G   
Sbjct:   783 IGEKGAEGTAGNDGARGLPGPLGPPGPSGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENG 841

Query:   289 NTSTSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK 344
              T    +A  Q   G P ++     P   G   S GP G   S  P + P   P     +
Sbjct:   842 PTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGR 900

Query:   345 GPGYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGP 394
             G    P  T  PG   + G    A   GP   +   P  +   GL  D         RGP
Sbjct:   901 GTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGP 959

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPS 451
                   GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+
Sbjct:   960 A-GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPA 1014

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 504
               P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1015 GTPGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1068


>UNIPROTKB|F1PG08 [details] [associations]
            symbol:COL5A2 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
            EMBL:AAEX03017882 EMBL:AAEX03017883 EMBL:AAEX03017884
            Ensembl:ENSCAFT00000023545 OMA:ETCNGLD Uniprot:F1PG08
        Length = 1499

 Score = 137 (53.3 bits), Expect = 2.6e-05, P = 2.6e-05
 Identities = 87/295 (29%), Positives = 109/295 (36%)

Query:   232 VDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGP 288
             +  + A+G+ G  GA G         P G    E G   P+G  GPP S    G  G   
Sbjct:   782 IGEKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENG 840

Query:   289 NTSTSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK 344
              T    +A  Q   G P ++     P   G   S GP G   S  P + P   P     +
Sbjct:   841 PTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGR 899

Query:   345 GPGYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGP 394
             G    P  T  PG   + G        GP   +   P  +   GL  D         RGP
Sbjct:   900 GTQGPPGATGFPGSAGRVGPPGPPGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGP 958

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPS 451
                   GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+
Sbjct:   959 A-GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPA 1013

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 504
               P +  G  GAP    P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1014 GTPGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067


>ZFIN|ZDB-GENE-050809-108 [details] [associations]
            symbol:pygo2 "pygopus homolog 2 (Drosophila)"
            species:7955 "Danio rerio" [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR001965
            InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
            ZFIN:ZDB-GENE-050809-108 GO:GO:0046872 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
            SUPFAM:SSF57903 InterPro:IPR019786 PROSITE:PS01359
            GeneTree:ENSGT00530000063948 CTD:90780 OrthoDB:EOG4QZ7MB
            EMBL:CR628394 IPI:IPI00650328 RefSeq:NP_001028283.2
            UniGene:Dr.159286 SMR:Q1L8T6 Ensembl:ENSDART00000131324
            GeneID:613247 KEGG:dre:613247 InParanoid:Q1L8T6 OMA:RFGMPPQ
            NextBio:20898499 Uniprot:Q1L8T6
        Length = 571

 Score = 132 (51.5 bits), Expect = 2.7e-05, P = 2.7e-05
 Identities = 83/302 (27%), Positives = 105/302 (34%)

Query:   227 MNAPNVDRRAADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ---GHGPPPS 277
             M +P   +R ++ S G A  + SE      P     V  N ++D +G P    G G P  
Sbjct:    16 MKSPEKKKRKSN-SQGAAFSHLSEFAPPPTPMVDHLVASNPFDDDFGPPSRSAGGGGPGG 74

Query:   278 ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG 337
             AT     GAG       Y     G  M        GPG   S  PG      P   P  G
Sbjct:    75 ATFLPSPGAGGG----GYGGP--GR-MGGGMGFMGGPGGPGSGQPGRRPPFGPP-TPNTG 126

Query:   338 PSYDPAKG--PGYDPTKGPGYDA----QKGSNYDAQRGPNYD--IHRGPSYDPQRGLGYD 389
             P +    G  PG+    G G         G        PN+   +H G  ++P    G  
Sbjct:   127 PHHPLGFGGMPGFGGGGGGGGGGGGGFPPGGPSQFNMPPNFSPPMHPGQGFNPMLSPGA- 185

Query:   390 MQRGPNYDMQRGPGYET----QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR---GQ 442
             M  GP      GP +      Q+ P +  Q G  + +   P     RGP +       G 
Sbjct:   186 MGGGPGGG--GGPPHPRFGMPQQQPPHG-QGGHPFNSPPLPGGPGPRGPPHGPMNPMGGM 242

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY-GSATPPARSGS--GQPRGG 499
             G  M          G    G   G  P GQ PPP +  PY GS+ P    G   G P GG
Sbjct:   243 GGGMNMMGMGGGGGGGNMVGGHPGMPPQGQFPPPQDG-PYPGSSPPVGEEGKNFGGPGGG 301

Query:   500 NP 501
              P
Sbjct:   302 PP 303


>UNIPROTKB|J9P8I1 [details] [associations]
            symbol:CROCC "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0051297 "centrosome organization"
            evidence=IEA] [GO:0035253 "ciliary rootlet" evidence=IEA]
            InterPro:IPR026733 GO:GO:0051297 GO:GO:0035253
            GeneTree:ENSGT00700000104019 PANTHER:PTHR23159:SF5
            EMBL:AAEX03001849 Ensembl:ENSCAFT00000047339 Uniprot:J9P8I1
        Length = 2015

 Score = 116 (45.9 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 38/135 (28%), Positives = 69/135 (51%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIG--GMKSERELQM 108
             +E++  S   E ++L T+ + L      LR+EL  AQ   Q+  GQ G    + E     
Sbjct:  1147 LERQRESSTREAEELRTQLRLLEDARDGLRRELLEAQR--QVREGQDGREAQRQEASELR 1204

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R+L+E + + EA  +T E ++   +K+++E  +L +A E+   K+  LT+       + +
Sbjct:  1205 RSLSEGVQEREALRRTNEELRAAVKKAESERISLKLANEDKEQKLALLTEARVAVGKEAE 1264

Query:   169 QIPALLSELESLRQE 183
             ++ A L E+E  R E
Sbjct:  1265 ELRAGLQEVERSRLE 1279

 Score = 106 (42.4 bits), Expect = 0.00036, Sum P(2) = 0.00036
 Identities = 52/200 (26%), Positives = 90/200 (45%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATEN--QRLAATHGTLRQELAAAQHELQI-LHGQIGG 99
             ++M    V  ++ A   +  Q++A E   QRL        +EL A + +LQ  L  +   
Sbjct:   969 ELMAQKLVQAEREAQASLREQRVAHEEDLQRLQQEKEAAWRELEAERAQLQSQLQREREE 1028

Query:   100 MKSERELQMRNLTEKIAKMEAE----LKTAEPVK---LEFQKSKTEA--QNLVVAREELI 150
             + +  E +   L+E+IA ++ E    L  AE  K   L  ++S+  A  + L+  +  L 
Sbjct:  1029 LLARLEAEKEELSEEIAALQQERDEGLLLAESEKQQALSLKESEKTALSEKLMGTQHSLA 1088

Query:   151 A---KVHQLTQDLQ-RAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQ 206
             A   ++ +  +D Q R   D   + AL+SEL  LR +      T+  E K   +   +L 
Sbjct:  1089 AISLEMERQKRDAQSRQEQDRSTVNALMSELRDLRAQLEEAADTHAQEVKRLQEQARNL- 1147

Query:   207 VMEKNYITMATEVEKLRAEL 226
               E+   +   E E+LR +L
Sbjct:  1148 --ERQRESSTREAEELRTQL 1165

 Score = 73 (30.8 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 29/92 (31%), Positives = 39/92 (42%)

Query:   408 RVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD--GAPR 465
             R  G + +   V EAQR    +   G    L+RG G  + R+PS  P   T F    AP 
Sbjct:  1413 RGQGLEAELARV-EAQRRADEVQLGGLRSALRRGLG--LGRSPSPPPLPSTSFPTGSAPA 1469

Query:   466 GAAPHG-QVPPPLNNVPYGSATPPARSGSGQP 496
             G +  G + P PL   P     PP+   +  P
Sbjct:  1470 GGSGEGLRSPSPLERSP--GCEPPSPGPTTSP 1499


>UNIPROTKB|F1Q2C0 [details] [associations]
            symbol:CROCC "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0051297 "centrosome organization"
            evidence=IEA] [GO:0035253 "ciliary rootlet" evidence=IEA]
            InterPro:IPR026733 GO:GO:0051297 GO:GO:0035253
            GeneTree:ENSGT00700000104019 PANTHER:PTHR23159:SF5
            EMBL:AAEX03001849 Ensembl:ENSCAFT00000025161 OMA:SDWRREE
            Uniprot:F1Q2C0
        Length = 2018

 Score = 116 (45.9 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 38/135 (28%), Positives = 69/135 (51%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIG--GMKSERELQM 108
             +E++  S   E ++L T+ + L      LR+EL  AQ   Q+  GQ G    + E     
Sbjct:  1147 LERQRESSTREAEELRTQLRLLEDARDGLRRELLEAQR--QVREGQDGREAQRQEASELR 1204

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
             R+L+E + + EA  +T E ++   +K+++E  +L +A E+   K+  LT+       + +
Sbjct:  1205 RSLSEGVQEREALRRTNEELRAAVKKAESERISLKLANEDKEQKLALLTEARVAVGKEAE 1264

Query:   169 QIPALLSELESLRQE 183
             ++ A L E+E  R E
Sbjct:  1265 ELRAGLQEVERSRLE 1279

 Score = 106 (42.4 bits), Expect = 0.00036, Sum P(2) = 0.00036
 Identities = 52/200 (26%), Positives = 90/200 (45%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATEN--QRLAATHGTLRQELAAAQHELQI-LHGQIGG 99
             ++M    V  ++ A   +  Q++A E   QRL        +EL A + +LQ  L  +   
Sbjct:   969 ELMAQKLVQAEREAQASLREQRVAHEEDLQRLQQEKEAAWRELEAERAQLQSQLQREREE 1028

Query:   100 MKSERELQMRNLTEKIAKMEAE----LKTAEPVK---LEFQKSKTEA--QNLVVAREELI 150
             + +  E +   L+E+IA ++ E    L  AE  K   L  ++S+  A  + L+  +  L 
Sbjct:  1029 LLARLEAEKEELSEEIAALQQERDEGLLLAESEKQQALSLKESEKTALSEKLMGTQHSLA 1088

Query:   151 A---KVHQLTQDLQ-RAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQ 206
             A   ++ +  +D Q R   D   + AL+SEL  LR +      T+  E K   +   +L 
Sbjct:  1089 AISLEMERQKRDAQSRQEQDRSTVNALMSELRDLRAQLEEAADTHAQEVKRLQEQARNL- 1147

Query:   207 VMEKNYITMATEVEKLRAEL 226
               E+   +   E E+LR +L
Sbjct:  1148 --ERQRESSTREAEELRTQL 1165

 Score = 73 (30.8 bits), Expect = 3.1e-05, Sum P(2) = 3.1e-05
 Identities = 29/92 (31%), Positives = 39/92 (42%)

Query:   408 RVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD--GAPR 465
             R  G + +   V EAQR    +   G    L+RG G  + R+PS  P   T F    AP 
Sbjct:  1413 RGQGLEAELARV-EAQRRADEVQLGGLRSALRRGLG--LGRSPSPPPLPSTSFPTGSAPA 1469

Query:   466 GAAPHG-QVPPPLNNVPYGSATPPARSGSGQP 496
             G +  G + P PL   P     PP+   +  P
Sbjct:  1470 GGSGEGLRSPSPLERSP--GCEPPSPGPTTSP 1499


>UNIPROTKB|Q767K9 [details] [associations]
            symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
            regulatory subunit 10" species:9823 "Sus scrofa" [GO:0000785
            "chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
            evidence=ISS] [GO:0005634 "nucleus" evidence=IEA] [GO:0004864
            "protein phosphatase inhibitor activity" evidence=IEA] [GO:0003723
            "RNA binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] InterPro:IPR000571
            InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
            PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
            GO:GO:0005634 GO:GO:0046872 GO:GO:0003677 GO:GO:0008270
            GO:GO:0000785 GO:GO:0006351 GO:GO:0003723 EMBL:AB113357
            GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
            CTD:5514 eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646
            OMA:PPPHEHR OrthoDB:EOG451DQK GeneTree:ENSGT00530000063820
            RefSeq:NP_001116637.1 UniGene:Ssc.39454 ProteinModelPortal:Q767K9
            Ensembl:ENSSSCT00000001463 Ensembl:ENSSSCT00000034462
            GeneID:100144450 KEGG:ssc:100144450 ArrayExpress:Q767K9
            Uniprot:Q767K9
        Length = 925

 Score = 134 (52.2 bits), Expect = 3.1e-05, P = 3.1e-05
 Identities = 60/237 (25%), Positives = 80/237 (33%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             G  GG  G         P G + + DG G P   GP       G  G GP          
Sbjct:   659 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGP-----MRGGPGPGPGPYHRGRGGR 713

Query:   299 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 358
                 P       P  P +  ++G G      P+     GP      G G+ P +GPG   
Sbjct:   714 GGNEPP------PPPPPFRGARG-GRSGGGPPN--GRGGPGGGMVGGGGHRPHEGPGGGM 764

Query:   359 QKGSNYDAQRGPNYDIHRG--PSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDV 414
               GS +    GP   +  G  P   P  G+G  +    GP   M  G G+     PG  +
Sbjct:   765 SSGSGHRPHEGPGGGMGGGHRPHEGPGGGMGGGHRPHEGPGGGMGGGSGHRPHEGPGGGM 824

Query:   415 QRGPVYEAQRAPSYIPQRGPG-YDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 470
               G  +     P +    G   +D+   +G+D R  P   P    G DG   G   H
Sbjct:   825 GAGGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDGPGHGGGGH 878


>UNIPROTKB|F1S4P6 [details] [associations]
            symbol:EIF3A "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005852 "eukaryotic translation initiation factor 3
            complex" evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA]
            [GO:0003743 "translation initiation factor activity" evidence=IEA]
            [GO:0001732 "formation of translation initiation complex"
            evidence=IEA] InterPro:IPR000717 Pfam:PF01399 SMART:SM00088
            GO:GO:0005730 GO:GO:0003743 GO:GO:0005852 OMA:QDRDEND
            GeneTree:ENSGT00690000102108 GO:GO:0001732 EMBL:CU407047
            Ensembl:ENSSSCT00000011680 Uniprot:F1S4P6
        Length = 1378

 Score = 101 (40.6 bits), Expect = 3.3e-05, Sum P(2) = 3.3e-05
 Identities = 64/224 (28%), Positives = 80/224 (35%)

Query:   247 NSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRA 306
             + E E+S RP        G  +    GP            G +    ++  T    P R 
Sbjct:   943 DEERESSLRPDEDRGPRRG--MDDDRGPRRGLDEDRFSRRGADDDRPSWRNTDDDRPPRR 1000

Query:   307 AYDIPRGPGYEASKGPGYDASKAPSYDPTKGP--SYDPAKGP--GYDPTKGP---GYDAQ 359
               D  RG    A      D       D  +G   + D  +GP  G D  +GP   G D +
Sbjct:  1001 IGDEDRGSWRHADD----DRPPRRGLDEDRGSWRTADEDRGPRRGMDEDRGPRRGGVDDE 1056

Query:   360 KGS--NYDAQRGPN-YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGPGYETQ--R 408
             + S  N D  R     D  RGP    D  RG   G D  RGP    D  RGP   T   R
Sbjct:  1057 RSSWRNADDDRPRRGMDDDRGPRRGMDDDRGPRRGMDDDRGPRRGLDDDRGPWRNTDDDR 1116

Query:   409 VP--GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP 450
             +   G D  RGP          IP+RG    + R +G D R  P
Sbjct:  1117 ISRRGADDDRGPWRNMD--DDRIPRRGDDDRIPR-RGDDSRPGP 1157

 Score = 85 (35.0 bits), Expect = 3.3e-05, Sum P(2) = 3.3e-05
 Identities = 42/186 (22%), Positives = 80/186 (43%)

Query:    40 PPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGG 99
             P    MP  ++  Q  A   V    LA   + +   H  + QE    QH+L +       
Sbjct:   509 PHLQSMPSEQIRNQLTAMSSV----LAKALEVIKPAH--ILQE-KEEQHQLAVTAYLKNS 561

Query:   100 MKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQD 159
              K  + +  R  T +  K   E    +  K E ++ + E Q +  A EE + +  +  ++
Sbjct:   562 RKEHQRILARRQTIEERKERLESLNIQREKEELEQREAELQKVRKAEEERLRQEAK-ERE 620

Query:   160 LQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEV 219
              +R   + +QI     + +++R+     + T    K F +  +E L+ ++ ++I MA +V
Sbjct:   621 KERILQEHEQI-----KKKTVRERLEQIKKTELGAKAFKDIDIEDLEELDPDFI-MAKQV 674

Query:   220 EKLRAE 225
             E+L  E
Sbjct:   675 EQLEKE 680

 Score = 72 (30.4 bits), Expect = 0.00069, Sum P(2) = 0.00069
 Identities = 31/113 (27%), Positives = 48/113 (42%)

Query:    50 VMEQKIASQHVEMQKLATENQRLAAT-HGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             VM  K A Q V  +KL    +RLA   H  L +     + E +I +      + + E + 
Sbjct:   761 VMRLKAARQSVYEEKLKQFEERLAEERHNRLEERKRQRKEERRITY-----YREKEEEEQ 815

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQ 161
             R   E++ K   E + AE  K E  +   E Q  V   EE+  K  Q   +++
Sbjct:   816 RRAEEQMLKEREERERAERAKRE--EELREYQERVKKLEEVERKKRQRELEIE 866


>WB|WBGene00000677 [details] [associations]
            symbol:col-103 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0040011
            "locomotion" evidence=IMP] InterPro:IPR002486 Pfam:PF01484
            SMART:SM01088 GO:GO:0040011 GeneTree:ENSGT00690000102663
            GO:GO:0042302 HOGENOM:HOG000085656 EMBL:FO081484 PIR:E88633
            RefSeq:NP_499982.1 ProteinModelPortal:O45114 STRING:O45114
            EnsemblMetazoa:F56B3.1 GeneID:176901 KEGG:cel:CELE_F56B3.1
            UCSC:F56B3.1 CTD:176901 WormBase:F56B3.1 eggNOG:NOG301529
            InParanoid:O45114 OMA:SNTCPPG NextBio:894512 Uniprot:O45114
        Length = 371

 Score = 128 (50.1 bits), Expect = 3.7e-05, P = 3.7e-05
 Identities = 87/287 (30%), Positives = 103/287 (35%)

Query:   229 APNVDRRA-----ADGSYGGATGNSE-NETSGRPVGQNA---YEDGYGVPQGHGPPPSAT 279
             APN ++R        G YGG  G +      G  VG      Y  G+G   GHG      
Sbjct:    63 APNREKRGYAQYGGGGGYGGGHGGAAVGGGYGGAVGGGGGGGYGGGHG--GGHGGAVGGG 120

Query:   280 TAGVVGAGPNTSTSAYAAT----QSGTPMRAAYD-IPRGPGYEASKGPGYDASKAPSYDP 334
               G  G G     S  + T      G P +A  D +P  PG   S G     S   S   
Sbjct:   121 YGGGGGGGGGCQCSPSSNTCPPGPRGPPGQAGLDGLPGAPGQPGSNGGA--GSNGASEGS 178

Query:   335 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 394
               G    PA  PG  P  GP   A +  N D Q G        PS+    G+G     GP
Sbjct:   179 AGGCKTCPAGPPG--PP-GPAGQAGRPGN-DGQPG-------APSFGG--GVGAPGAPGP 225

Query:   395 NYDM-QRG-PGYETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS 451
               D    G PG   Q   PG + Q G        P+  P   PG +   G GY +   P 
Sbjct:   226 AGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAG-PPGPPGNNGAPGGGYGV--GPP 282

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNN-VPYGSAT--P-PARSGSG 494
               P   +G  GAP    P GQ   P N+  P   A   P P R G G
Sbjct:   283 GPPGP-SGRPGAPGQPGPDGQPGAPGNDGTPGTDAAYCPCPGRGGGG 328


>CGD|CAL0000919 [details] [associations]
            symbol:RPO21 species:5476 "Candida albicans" [GO:0005665
            "DNA-directed RNA polymerase II, core complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
            evidence=IEA] [GO:0030447 "filamentous growth" evidence=IMP]
            [GO:0006366 "transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0009267 "cellular response to starvation"
            evidence=IMP] [GO:0036170 "filamentous growth of a population of
            unicellular organisms in response to starvation" evidence=IMP]
            [GO:0036180 "filamentous growth of a population of unicellular
            organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
            "cellular response to biotic stimulus" evidence=IMP] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed RNA
            polymerase activity" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 CGD:CAL0000919
            GO:GO:0071216 GO:GO:0036180 GO:GO:0003677 GO:GO:0006366
            GO:GO:0009267 Gene3D:2.40.40.20 InterPro:IPR009010
            EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899 eggNOG:COG0086
            GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1 STRING:Q5ACI7
            GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
        Length = 1728

 Score = 136 (52.9 bits), Expect = 4.0e-05, P = 4.0e-05
 Identities = 68/221 (30%), Positives = 85/221 (38%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAY 295
             ADG  GGAT   + E        NA ++   +  G G  P        G  G  TS    
Sbjct:  1465 ADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMNEGNIGGLTSYGGQ 1514

Query:   296 AATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP 354
               + + T P    Y+    PGY  S G GY  + +PSY PT  PSY P   P Y PT  P
Sbjct:  1515 PTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYAPTS-PAYSPTS-P 1569

Query:   355 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDV 414
              Y A     Y +   P+Y     P+Y P     Y     P+Y     P Y     P Y  
Sbjct:  1570 SY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTSPQYSPTS-PSYS- 1621

Query:   415 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
                P Y +  +PSY P   P Y            +PSY P+
Sbjct:  1622 PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651


>UNIPROTKB|Q5ACI7 [details] [associations]
            symbol:RPO21 "DNA-directed RNA polymerase" species:237561
            "Candida albicans SC5314" [GO:0009267 "cellular response to
            starvation" evidence=IMP] [GO:0030447 "filamentous growth"
            evidence=IMP] [GO:0036170 "filamentous growth of a population of
            unicellular organisms in response to starvation" evidence=IMP]
            [GO:0036180 "filamentous growth of a population of unicellular
            organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
            "cellular response to biotic stimulus" evidence=IMP]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 CGD:CAL0000919 GO:GO:0071216 GO:GO:0036180
            GO:GO:0003677 GO:GO:0006366 GO:GO:0009267 Gene3D:2.40.40.20
            InterPro:IPR009010 EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1
            STRING:Q5ACI7 GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
        Length = 1728

 Score = 136 (52.9 bits), Expect = 4.0e-05, P = 4.0e-05
 Identities = 68/221 (30%), Positives = 85/221 (38%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAY 295
             ADG  GGAT   + E        NA ++   +  G G  P        G  G  TS    
Sbjct:  1465 ADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMNEGNIGGLTSYGGQ 1514

Query:   296 AATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP 354
               + + T P    Y+    PGY  S G GY  + +PSY PT  PSY P   P Y PT  P
Sbjct:  1515 PTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYAPTS-PAYSPTS-P 1569

Query:   355 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDV 414
              Y A     Y +   P+Y     P+Y P     Y     P+Y     P Y     P Y  
Sbjct:  1570 SY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTSPQYSPTS-PSYS- 1621

Query:   415 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
                P Y +  +PSY P   P Y            +PSY P+
Sbjct:  1622 PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651


>TAIR|locus:4010713902 [details] [associations]
            symbol:AT4G22505 species:3702 "Arabidopsis thaliana"
            [GO:0006869 "lipid transport" evidence=IEA] EMBL:CP002687
            GO:GO:0006869 InterPro:IPR016140 SUPFAM:SSF47699 UniGene:At.22887
            UniGene:At.74604 IPI:IPI00938995 RefSeq:NP_001154263.1 PRIDE:F4JLV7
            EnsemblPlants:AT4G22505.1 GeneID:5008157 KEGG:ath:AT4G22505
            OMA:GSEMAGM Uniprot:F4JLV7
        Length = 530

 Score = 130 (50.8 bits), Expect = 4.0e-05, P = 4.0e-05
 Identities = 54/229 (23%), Positives = 67/229 (29%)

Query:   269 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 328
             P+   PPP  T      A P T   +        P       P+ P     K P     +
Sbjct:    74 PRTPPPPPPRTPRTPPTAPPRTPPVSPRIPPILPPKTPPTAPPQTPPVSPPKSPPNSPPR 133

Query:   329 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 388
             AP   P + P   P + P   P + P     +       R P+    R P   P R    
Sbjct:   134 APPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPSTSPPRAPPLSPPRTPPT 193

Query:   389 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 448
                R P       P     R P     R P     R P   P R P     R       R
Sbjct:   194 SPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPRAPPVPPPRISPTAPPR 253

Query:   449 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR 497
             AP   P R T     PR       + PP +       +PP    +  PR
Sbjct:   254 APPLSPPR-TPPTSPPRTPPLSPPITPPTSPPRAPPLSPPRTPPTSPPR 301

 Score = 121 (47.7 bits), Expect = 0.00039, P = 0.00039
 Identities = 58/231 (25%), Positives = 69/231 (29%)

Query:   269 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 328
             P+   PPP  T        P T  +   A     P+      PR P     K P     +
Sbjct:    63 PRTPPPPPPRTPRTPPPPPPRTPRTPPTAPPRTPPVS-----PRIPPILPPKTPPTAPPQ 117

Query:   329 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 388
              P   P K P   P + P   P + P     +       R P     R P   P R    
Sbjct:   118 TPPVSPPKSPPNSPPRAPPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPST 177

Query:   389 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 448
                R P     R P     R P       P     RAP   P R P     R       R
Sbjct:   178 SPPRAPPLSPPRTPPTSPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPR 237

Query:   449 APSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPR 497
             AP   P R +     PR  AP    P  PP +       +PP    +  PR
Sbjct:   238 APPVPPPRISP-TAPPR--APPLSPPRTPPTSPPRTPPLSPPITPPTSPPR 285


>MGI|MGI:88453 [details] [associations]
            symbol:Col3a1 "collagen, type III, alpha 1" species:10090 "Mus
            musculus" [GO:0001568 "blood vessel development" evidence=IMP]
            [GO:0005178 "integrin binding" evidence=ISO] [GO:0005201
            "extracellular matrix structural constituent" evidence=ISO]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
            "collagen" evidence=IDA] [GO:0005586 "collagen type III"
            evidence=ISO;IDA] [GO:0005615 "extracellular space" evidence=ISO]
            [GO:0007160 "cell-matrix adhesion" evidence=ISO] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=ISO] [GO:0007229 "integrin-mediated signaling pathway"
            evidence=ISO] [GO:0007507 "heart development" evidence=ISO]
            [GO:0009314 "response to radiation" evidence=ISO] [GO:0018149
            "peptide cross-linking" evidence=ISO] [GO:0030199 "collagen fibril
            organization" evidence=ISO;IMP] [GO:0031012 "extracellular matrix"
            evidence=ISO;IDA] [GO:0032964 "collagen biosynthetic process"
            evidence=ISO] [GO:0034097 "response to cytokine stimulus"
            evidence=ISO] [GO:0042060 "wound healing" evidence=ISO] [GO:0043206
            "extracellular fibril organization" evidence=ISO] [GO:0043588 "skin
            development" evidence=ISO] [GO:0046332 "SMAD binding" evidence=IPI]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=ISO] [GO:0048565
            "digestive tract development" evidence=IMP] [GO:0050777 "negative
            regulation of immune response" evidence=ISO] [GO:0071230 "cellular
            response to amino acid stimulus" evidence=IDA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 MGI:MGI:88453 GO:GO:0043588 GO:GO:0005615
            GO:GO:0007507 GO:GO:0046872 GO:GO:0034097 GO:GO:0030199
            GO:GO:0001501 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0042060
            GO:GO:0001568 GO:GO:0048565 GO:GO:0050777 GO:GO:0009314
            GO:GO:0018149 GO:GO:0032964 GO:GO:0071230 GO:GO:0043206
            GO:GO:0005201 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
            CTD:1281 OMA:EGSPGHP ChiTaRS:COL3A1 GO:GO:0005586 EMBL:X52046
            EMBL:BC043089 EMBL:BC058724 EMBL:M18933 EMBL:K03037 EMBL:AK019448
            EMBL:X57983 IPI:IPI00129571 PIR:A27353 PIR:S59856
            RefSeq:NP_034060.2 UniGene:Mm.249555 ProteinModelPortal:P08121
            SMR:P08121 STRING:P08121 PhosphoSite:P08121 PaxDb:P08121
            PRIDE:P08121 Ensembl:ENSMUST00000087883 GeneID:12825 KEGG:mmu:12825
            InParanoid:P08121 NextBio:282310 Bgee:P08121 CleanEx:MM_COL3A1
            Genevestigator:P08121 Uniprot:P08121
        Length = 1464

 Score = 135 (52.6 bits), Expect = 4.2e-05, P = 4.2e-05
 Identities = 85/278 (30%), Positives = 99/278 (35%)

Query:   238 DGSYGGATGNSENETSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 295
             DGS G    N     +G   P G        G+P   GPP      G  G          
Sbjct:   466 DGSPGEPGANGLPGAAGERGPSGFRGPAGPNGIPGEKGPPGERGGPGPAGPRGVAGEPGR 525

Query:   296 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP 354
               T  G  +R     P GPG +   GP    S+  S  P   GPS  P   PG     GP
Sbjct:   526 DGTPGGPGIRGMPGSPGGPGNDGKPGP--PGSQGESGRPGPPGPS-GPRGQPGVMGFPGP 582

Query:   355 -GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM-QRGP-GYE-TQR 408
              G D   G N + + GP      GP+  + + G  G     GP  D    GP G +  Q 
Sbjct:   583 KGNDGAPGKNGE-RGGPGGPGLPGPAGKNGETGPQGPPGPTGPAGDKGDSGPPGPQGLQG 641

Query:   409 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGA 467
             +PG     GP  E  +     P+   G     G G     AP      GT G  GA  GA
Sbjct:   642 IPGTG---GPPGENGKPGEPGPKGEVGAPGAPG-GKGDSGAPGERGPPGTAGIPGARGGA 697

Query:   468 APHGQVPPPLNNVPYGSATPPARSGS----GQP--RGG 499
              P G   P     P G   PP  SGS    G P  RGG
Sbjct:   698 GPPG---PEGGKGPAGPPGPPGASGSPGLQGMPGERGG 732

 Score = 124 (48.7 bits), Expect = 0.00066, P = 0.00066
 Identities = 82/290 (28%), Positives = 106/290 (36%)

Query:   233 DRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP--- 288
             D ++  G  GG  G +       P G + +    G P   GPP     AG  G  GP   
Sbjct:   160 DVKSGVGGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGA 219

Query:   289 -NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAK 344
                +  A    +SG P R     +P  PG +   G PG+   K    +D   G   +   
Sbjct:   220 LGPAGPAGKDGESGRPGRPGERGLPGPPGIKGPAGMPGFPGMKGHRGFDGRNGEKGETG- 278

Query:   345 GPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP 402
              PG     G PG +   G      RG   +  R P      G  G D  RG   D Q GP
Sbjct:   279 APGLKGENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGP 333

Query:   403 -GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 460
              G   T   PG    +G V  A    S      PG   QRG+      A +  P    G 
Sbjct:   334 PGPPGTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGN 387

Query:   461 DGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 503
             +G+P G    G  P  +   P   G+  PP  +G+ G P  RG  G P +
Sbjct:   388 NGSPGGKGEMG--PAGIPGAPGLIGARGPPGPAGTNGIPGTRGPSGEPGK 435


>UNIPROTKB|F1PG69 [details] [associations]
            symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 OMA:EGSPGHP
            EMBL:AAEX03017880 Ensembl:ENSCAFT00000023503 Uniprot:F1PG69
        Length = 1467

 Score = 135 (52.6 bits), Expect = 4.2e-05, P = 4.2e-05
 Identities = 85/274 (31%), Positives = 106/274 (38%)

Query:   253 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIP 311
             +G+P G+ +++   G P   GPP +A   G  G AGP           SG  +R    I 
Sbjct:   653 NGKP-GEPSHQGDSGAPGERGPPGAAGPMGPRGGAGP---PGPEGGKVSGGDLRPP--IS 706

Query:   312 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGS-NYDAQRG 369
              G G     GP   A   P      G    P  GPG    KG PG     G+   D  RG
Sbjct:   707 AGAGAAGPPGPPGSAG-TPGLQGMPGERGGPG-GPGPKGDKGEPGSAGADGAPGKDGPRG 764

Query:   370 PNYDIHR-GPSYDP-QRGLG--------YDMQRGPNYDMQRGPGYETQRVPGYDVQRG-P 418
             P   I   GP+  P  +G G           + GP    + GP       PG   Q G P
Sbjct:   765 PTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAG-FPGAPGQNGEP 823

Query:   419 VYEAQR-APSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA--PHGQ- 472
               + +R AP    + GP G     G G      P     +G  G  G P GAA  P G+ 
Sbjct:   824 GAKGERGAPGEKGEGGPPGVAGPPG-GAGPAGPPGPQGVKGERGSPGGP-GAAGFPGGRG 881

Query:   473 VP-PPLNNV---PYGSATPPARSGSGQPRGGNPA 502
             +P PP NN    P GS+  P + G   P G N A
Sbjct:   882 LPGPPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 915

 Score = 133 (51.9 bits), Expect = 7.0e-05, P = 7.0e-05
 Identities = 84/286 (29%), Positives = 103/286 (36%)

Query:   231 NVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP- 288
             +V    A G  GG  G +       P G + +    G P   GPP     AG  G  GP 
Sbjct:   159 DVKAGVAGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPP 218

Query:   289 ---NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDP 342
                  S  A    +SG P R     +P  PG +   G PG+   K    +D   G   D 
Sbjct:   219 GAMGPSGPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDT 278

Query:   343 AKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQR 400
                PG     G PG +   G      RG   +  R P      G  G D  RG   D Q 
Sbjct:   279 G-APGLKGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQP 332

Query:   401 GP-GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
             GP G   T   PG    +G V  A    S      PG   QRG+      A +  P    
Sbjct:   333 GPPGPPGTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPP 386

Query:   459 GFDGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 499
             G +G+P G    G  P  +   P   G+  PP   G+ G P  RGG
Sbjct:   387 GSNGSPGGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430

 Score = 123 (48.4 bits), Expect = 0.00085, P = 0.00085
 Identities = 78/261 (29%), Positives = 98/261 (37%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 311
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   P
Sbjct:   321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380

Query:   312 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 367
               PG   S G PG      P+  P   P    A+GP   P T G PG     G    +  
Sbjct:   381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439

Query:   368 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 419
             +G P     RG +  P   G  G D + G P      G PG   +R  PG+   RGP   
Sbjct:   440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496

Query:   420 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 476
                  ++ P+   + GPG    RG  G   R      P    G  G+P G    G+  PP
Sbjct:   497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554

Query:   477 LNNVPYGSATPPARSGS-GQP 496
              +    G   PP  SG  GQP
Sbjct:   555 GSQGESGRPGPPGPSGPRGQP 575


>UNIPROTKB|G5EF87 [details] [associations]
            symbol:swsn-1 "SWI3-like protein" species:6239
            "Caenorhabditis elegans" [GO:0042802 "identical protein binding"
            evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
            InterPro:IPR001005 InterPro:IPR007526 InterPro:IPR009057
            Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934 SMART:SM00717
            GO:GO:0005634 GO:GO:0009792 GO:GO:0002009 GO:GO:0040007
            GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0040018
            Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
            InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
            EMBL:AL110477 KO:K11649 GeneTree:ENSGT00390000018166 EMBL:AF230279
            PIR:T26449 RefSeq:NP_001256906.1 UniGene:Cel.7072 SMR:G5EF87
            IntAct:G5EF87 EnsemblMetazoa:Y113G7B.23 GeneID:180324
            KEGG:cel:CELE_Y113G7B.23 CTD:180324 WormBase:Y113G7B.23a
            OMA:HFDELEQ NextBio:908892 Uniprot:G5EF87
        Length = 789

 Score = 131 (51.2 bits), Expect = 5.4e-05, P = 5.4e-05
 Identities = 71/248 (28%), Positives = 92/248 (37%)

Query:   267 GVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 320
             G+P G    GPP   P    +    A P    ++ AAT +  P  +    P+ P  +A+ 
Sbjct:   551 GLPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAPPVQAAP 608

Query:   321 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI-HRGPS 379
              P   A +AP   P    +Y    GPG  P +   Y  Q+G  Y     P     H+   
Sbjct:   609 AP-VQAPQAPQAPPQ---AYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQAQQ 664

Query:   380 YDPQRGLGYDMQ-RGPNYDMQRGPGYETQRVPG--YDVQRGPVYEAQRAPSYIPQRGPGY 436
                Q   G     +GP    Q    Y     PG  Y    G   + QR P Y  Q  PG 
Sbjct:   665 AQSQAHYGPPGGGQGPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPP-YQAQPYPGP 723

Query:   437 ---DLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 493
                  QRG GY     P   P       G P    P+GQ+PPP    P+G   P  + G 
Sbjct:   724 PPPQQQRGYGYP----PPPQP-------GHPY-QQPYGQMPPP----PHGQYQPQQQQGG 767

Query:   494 GQ-PRGGN 500
                P GG+
Sbjct:   768 PMGPPGGH 775

 Score = 120 (47.3 bits), Expect = 0.00086, P = 0.00086
 Identities = 105/465 (22%), Positives = 166/465 (35%)

Query:    36 PGAF-P-PFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQE---LAAAQHEL 90
             P AF P PF     P +      +  V+ Q  A   +      G L++E   L A  HE 
Sbjct:   328 PLAFQPVPFSQSGNPVMSTVAFLASVVDPQVAAAATKAAMEEFGKLKEEIPPLVAEAHEK 387

Query:    91 QILH-----GQIGGMKSERELQMRNLTEKIAKMEAEL--KTAEPVKLEFQKSKTEAQNLV 143
              +       GQ+ G     +  ++   E     + ++   T + V    +      + + 
Sbjct:   388 NVAAMAEKTGQVDGAVGLTKSGLKPAEEAAGDSDEKMDTNTNDDVPSTTEAKSAIDKGVQ 447

Query:   144 VAREELIAKVHQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLE 203
              A    +A      + L  A  + ++I +L+++L   + +        E + + + D LE
Sbjct:   448 AAAASCLAAAAVKAKHL--AQIEERRIKSLVAQLVETQMK------KLEMKLRHF-DELE 498

Query:   204 SLQVMEKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGR-PVGQNAY 262
               Q+M+K   ++  +  +L  E   A ++D+     +      +S   +SG  P G    
Sbjct:   499 --QIMDKERESLEYQRHQLILE-RQAFHMDQLKYLENRAKHEAHSRMTSSGALPAGLPPG 555

Query:   263 EDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRA-AYDIPRGPG-YEASK 320
              +  G PQ       +     +    +TS +A AA    TP    A  +   P   +A +
Sbjct:   556 FEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPPSTPQAPQAPPVQAAPAPVQAPQ 615

Query:   321 GP-----GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 375
              P      Y     P   P +   Y P +G  Y P   P    Q  +   AQ   +Y   
Sbjct:   616 APQAPPQAYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQAQ-QAQSQAHYGPP 674

Query:   376 RGPSYDPQRGLGYDMQRGPNYDMQR-GP--GYETQRV-PGYDVQR--GPVY-EAQRAPSY 428
              G    P    G     GP    Q  GP  GY  Q+  P Y  Q   GP   + QR   Y
Sbjct:   675 GGGQGPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPPYQAQPYPGPPPPQQQRGYGY 734

Query:   429 IPQRGPGYDLQRGQGYDMRRAP--SYDPSRGTGFDGAPRGAAPHG 471
              P   PG+  Q+  G  M   P   Y P +  G    P G    G
Sbjct:   735 PPPPQPGHPYQQPYG-QMPPPPHGQYQPQQQQGGPMGPPGGHHEG 778


>MGI|MGI:1925567 [details] [associations]
            symbol:Ccdc88b "coiled-coil domain containing 88B"
            species:10090 "Mus musculus" [GO:0000226 "microtubule cytoskeleton
            organization" evidence=IEA] [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0008017 "microtubule
            binding" evidence=IEA] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR008636 Pfam:PF05622 MGI:MGI:1925567
            GO:GO:0005737 GO:GO:0000226 CTD:283234 eggNOG:NOG287357
            HOVERGEN:HBG104809 OMA:EGLEVQE OrthoDB:EOG4NS39S EMBL:AC120557
            EMBL:BC076600 EMBL:BC151001 EMBL:BC151009 IPI:IPI00608004
            IPI:IPI00874526 RefSeq:NP_001074760.1 UniGene:Mm.329596 HSSP:Q09013
            ProteinModelPortal:Q4QRL3 SMR:Q4QRL3 PhosphoSite:Q4QRL3
            PaxDb:Q4QRL3 PRIDE:Q4QRL3 Ensembl:ENSMUST00000113440 GeneID:78317
            KEGG:mmu:78317 UCSC:uc008gjb.1 GeneTree:ENSGT00690000101702
            HOGENOM:HOG000060297 InParanoid:B2RX63 NextBio:348677 Bgee:Q4QRL3
            CleanEx:MM_CCDC88B Genevestigator:Q4QRL3 Uniprot:Q4QRL3
        Length = 1481

 Score = 134 (52.2 bits), Expect = 5.5e-05, P = 5.5e-05
 Identities = 51/189 (26%), Positives = 92/189 (48%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRN 110
             +E ++ S     Q+L  ++QR       L+ E +  + + Q LH ++G ++ E     R 
Sbjct:  1009 LEGQLGSLQGRAQELLLQSQRAQEHSSRLQAEKSMMEMQGQELHRKLGVLEEEVRAARRA 1068

Query:   111 LTEKIAKMEAELKTAEP-VKLEFQKSKTEAQNLVVAREELIAKVHQLT---QDLQ----- 161
               E   + +A L+  E  V+L+ ++ +TE + L+V   +L A +  L    ++LQ     
Sbjct:  1069 QEETRGQQQALLRDHEALVQLQ-RRQETELEGLLVRHRDLKANMRALELAHRELQGRHEQ 1127

Query:   162 ----RAHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMAT 217
                 RA+ + Q++ ALL+E E L Q+ H  RG  E  ++  N+H E  Q++         
Sbjct:  1128 LQAQRANVEAQEV-ALLAERERLMQDGHRQRGLEEELRRLQNEH-ERAQMLLAEVSRERG 1185

Query:   218 EVEKLRAEL 226
             E++  R EL
Sbjct:  1186 ELQGERGEL 1194


>ZFIN|ZDB-GENE-050302-9 [details] [associations]
            symbol:col2a1b "collagen type II, alpha-1b"
            species:7955 "Danio rerio" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0033333 "fin development" evidence=IMP]
            [GO:0033334 "fin morphogenesis" evidence=IMP] [GO:0005581
            "collagen" evidence=IEA] EMBL:HF563615 EMBL:HF563616 EMBL:HF563617
            Uniprot:L0S5L0
        Length = 1493

 Score = 134 (52.2 bits), Expect = 5.5e-05, P = 5.5e-05
 Identities = 91/310 (29%), Positives = 115/310 (37%)

Query:   226 LMNAPNVD-RRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVV 284
             L   P  D    A G  G A    E    G+P G + ++   G+P   GPP      G  
Sbjct:   634 LRGLPGKDGETGAAGPPGPAGSAGERGEQGQP-GPSGFQ---GLPGPPGPPGEGGKPGDQ 689

Query:   285 GAGPNTSTSAYAAT---QSGTPMRAAYDIPRG-PGYEASKG-PGYDASKAPSYDP--TKG 337
             G  P  +  A A     + G P       P+G  G     G PG D  K     P  T G
Sbjct:   690 GV-PGEAGGAGATGPRGERGFPGERGGAGPQGLQGPRGLPGTPGTDGPKG-GVGPAGTAG 747

Query:   338 PSYDPA-KG-PGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL----GYDM 390
                 P  +G PG   T G PG    +G N D  +GP       P  D  RGL    G   
Sbjct:   748 AQGPPGLQGMPGERGTSGNPGPKGDRGDNGD--KGPE----GAPGKDGSRGLTGPIGPTG 801

Query:   391 QRGPNYDM-QRGP----GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 444
               GPN +  + GP    G   T+ VPG   + GP   A  A        PG   ++G+G 
Sbjct:   802 PAGPNGEKGESGPAGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADGQPGVKGEQGEGG 861

Query:   445 DMRRAPSYDPSRGTGFDG--APRGAA-PHG----QVPPPLNNVP--YGSATPPARSGSGQ 495
                 A +  P   +G  G   P G + P G    Q PP     P   G   PP  +G+  
Sbjct:   862 QKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPNGNPG 921

Query:   496 PRG--GNPAR 503
             P G  G P +
Sbjct:   922 PAGPAGPPGK 931

 Score = 133 (51.9 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 81/284 (28%), Positives = 97/284 (34%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G        G P   G P +   AG  GA GP
Sbjct:   335 PGERGRPGPSGASGARGNDGLPGGAGPPGPVGTAGSPGFP---GSPGAKGEAGPTGARGP 391

Query:   289 NTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKG-PGYDASK-APSYDPTKG-PSYDPAK 344
               +       +SG P  +    P G  G   S G PG   S  AP      G P   P  
Sbjct:   392 EGAQGPRG--ESGVPGASG---PSGVSGNPGSDGMPGAKGSVGAPGIGGAPGFPG--PRG 444

Query:   345 GPGYDPTKGP-GYDAQKGSN----YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 399
              PG     GP G   Q G +    +  + GP  +I            G + +RGP  +  
Sbjct:   445 PPGPQGATGPLGPKGQSGDSGLAGFKGEAGPKGEIGNAGLQGAPGPAGEEGKRGPRGEPG 504

Query:   400 RG--PGYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYD 453
                 PG   +R  PG    RG P  +    P   P +RGP G    +G G D  R     
Sbjct:   505 AAGPPGPTGERGTPG---NRGFPGQDGLAGPKGAPGERGPAGVSGPKGAGGDPGRPGEPG 561

Query:   454 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSG-SGQP 496
                  G  G P  A P G+V P       G   PP   G  GQP
Sbjct:   562 LPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGVRGQP 605

 Score = 124 (48.7 bits), Expect = 0.00067, P = 0.00067
 Identities = 78/259 (30%), Positives = 90/259 (34%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAAT-QSGTPMRAAYDIPRGPG 315
             G+   +   G P   GP  +    G  G +GP  +  A      +G P  A    P GP 
Sbjct:   858 GEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPN 917

Query:   316 YEASKGPGYDASKAPSYDPTKGPSYD--PAKGPGYDPTKGP-GYDAQKGS-NYDAQRGPN 371
                + GP   A   P  D  KG   D  P   PG    +G  G   +KG    D   GP 
Sbjct:   918 --GNPGPAGPAGP-PGKDGPKGVRGDGGPPGRPGDAGLRGSAGPAGEKGDPGEDGPHGP- 973

Query:   372 YDIHRGPS-YDPQRGL-GYDMQRGPN-YDMQRGPGYET--QRVPGYDVQRGPVYEAQRAP 426
              D   GP     QRG+ G   QRG   +    GP  E   Q  PG    RGP      AP
Sbjct:   974 -DGPAGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGPGDRGPPGPVG-AP 1031

Query:   427 SYIPQRG-PGYDLQRGQGYDMRRAPS--YDPSRG----TGFDGAPRGAAPHGQVPPPLNN 479
                   G PG +   G      R  S      RG     G  GAP G    G V P    
Sbjct:  1032 GLTGAAGEPGREGNPGSDGPPGRDGSAGIKGDRGDTGPAGAPGAPGGPGAPGPVGPTGKQ 1091

Query:   480 VPYGSATPPARSGSGQPRG 498
                G A P   SG   P G
Sbjct:  1092 GDRGEAGPHGPSGPPGPAG 1110


>UNIPROTKB|F1NRH2 [details] [associations]
            symbol:LOC100858979 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005581 "collagen" evidence=IEA] [GO:0005938
            "cell cortex" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
            GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:AC147437
            IPI:IPI01017314 RefSeq:XP_003641055.1 Ensembl:ENSGALT00000024133
            GeneID:100858979 KEGG:gga:100858979 Uniprot:F1NRH2
        Length = 674

 Score = 130 (50.8 bits), Expect = 5.7e-05, P = 5.7e-05
 Identities = 87/281 (30%), Positives = 106/281 (37%)

Query:   238 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYA 296
             D    GA G +       P G+   E G G P   GPP  A   G  G  GP        
Sbjct:   229 DRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP-------- 279

Query:   297 ATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPTKG 353
             A   G+P    +  P  PG +  +GP G      P  D  +GP+  P + GP G     G
Sbjct:   280 AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGEPGPAGPQGNMG 337

Query:   354 P-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PGYE 405
             P G     G N     GP  D+   GP+  P    +RGL G D +  P Y  ++G PG +
Sbjct:   338 PQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPGPK 393

Query:   406 TQRVPGYDVQRGPVYEAQRAPSYIPQR-GP-GYDLQRG-QGYDMRRAPSYDPS-RGT-GF 460
                 PG   Q+G    A   P  +P   GP G     G  G    R PS  P  RG  G 
Sbjct:   394 GH--PGLPGQKGDTGHA--GPPGLPGPVGPQGVKGVPGINGEPGPRGPSGIPGIRGPIGP 449

Query:   461 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 501
              G P      G+   P    P G AT   R   G P    P
Sbjct:   450 PGMPGAPGAKGEAGAPGLPGPAGIATKGLRGPMGPPGPPGP 490


>UNIPROTKB|H7BZW9 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP EMBL:AC099777
            ProteinModelPortal:H7BZW9 PRIDE:H7BZW9 Ensembl:ENST00000438794
            Uniprot:H7BZW9
        Length = 316

 Score = 125 (49.1 bits), Expect = 5.8e-05, P = 5.8e-05
 Identities = 51/196 (26%), Positives = 91/196 (46%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:    84 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 139

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     
Sbjct:   140 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR----- 187

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAEL 226
             ++   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L
Sbjct:   188 EEATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL 246

Query:   227 MNAPNVDRRAADGSYG 242
              +   + R+  +   G
Sbjct:   247 -SILQMSRKELENQVG 261


>UNIPROTKB|B7Z863 [details] [associations]
            symbol:SLMAP "cDNA FLJ54742, highly similar to Mus musculus
            sarcolemma associated protein (Slmap), mRNA" species:9606 "Homo
            sapiens" [GO:0006457 "protein folding" evidence=IEA] [GO:0016272
            "prefoldin complex" evidence=IEA] [GO:0051082 "unfolded protein
            binding" evidence=IEA] InterPro:IPR002777 Pfam:PF01920
            GO:GO:0006457 GO:GO:0016272 EMBL:AC114480 UniGene:Hs.476432
            HGNC:HGNC:16643 ChiTaRS:SLMAP HOGENOM:HOG000072709 EMBL:AC099777
            EMBL:AK302934 IPI:IPI00945565 STRING:B7Z863 Ensembl:ENST00000494088
            UCSC:uc011bfc.1 HOVERGEN:HBG087998 Uniprot:B7Z863
        Length = 318

 Score = 125 (49.1 bits), Expect = 5.9e-05, P = 5.9e-05
 Identities = 51/196 (26%), Positives = 91/196 (46%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:    39 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 94

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     
Sbjct:    95 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR----- 142

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAEL 226
             ++   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L
Sbjct:   143 EEATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL 201

Query:   227 MNAPNVDRRAADGSYG 242
              +   + R+  +   G
Sbjct:   202 -SILQMSRKELENQVG 216


>RGD|628797 [details] [associations]
            symbol:Prpmp5 "proline-rich protein MP5" species:10116 "Rattus
            norvegicus" [GO:0005576 "extracellular region" evidence=IEA]
            RGD:628797 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
            CTD:5542 KO:K13911 EMBL:L17318 EMBL:M11899 IPI:IPI00187926
            PIR:B48013 RefSeq:NP_742062.1 UniGene:Rn.29950 GeneID:257651
            KEGG:rno:257651 UCSC:RGD:628797 NextBio:624204
            Genevestigator:P10165 Uniprot:P10165
        Length = 295

 Score = 124 (48.7 bits), Expect = 6.4e-05, P = 6.4e-05
 Identities = 63/200 (31%), Positives = 77/200 (38%)

Query:   311 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR-- 368
             P   G +    PG  + + P   P  GP   P +GP   P  GP    Q GS        
Sbjct:   101 PPAAGPQRPPQPG--SPQGPP--PPGGPQQRPPQGP--PPQGGPQRPPQPGSPQGPPPPG 154

Query:   369 GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVP-GYDVQRGPVYEAQR 424
             GP     +GP   PQ G     QR P     +GP   G   QR P G   Q GP    QR
Sbjct:   155 GPQQRPPQGPP--PQGG----PQRPPQPGSPQGPPPPGGPQQRAPQGPPPQGGP----QR 204

Query:   425 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP--PLNNVPY 482
              P     +GP        G   +R P   P +G G    P+  +P G  PP  P    P 
Sbjct:   205 PPQPGSPQGPP-----PPGGPQQRPPQGPPPQG-GPQRPPQPGSPQGPPPPGGPQQRPPQ 258

Query:   483 GSATPPARSGSGQP-RGGNP 501
             G   PP + G  +P + GNP
Sbjct:   259 G---PPPQGGPQRPPQPGNP 275


>ZFIN|ZDB-GENE-030131-8373 [details] [associations]
            symbol:col10a1 "collagen, type X, alpha 1"
            species:7955 "Danio rerio" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR008983 ZFIN:ZDB-GENE-030131-8373 GO:GO:0005581
            Gene3D:2.60.120.40 InterPro:IPR001073 InterPro:IPR008160
            Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007 SMART:SM00110
            SUPFAM:SSF49842 PROSITE:PS50871 GeneTree:ENSGT00700000104270
            OMA:KPGHGSP EMBL:CU306817 IPI:IPI00491103
            Ensembl:ENSDART00000091021 ArrayExpress:F1QXD5 Bgee:F1QXD5
            Uniprot:F1QXD5
        Length = 655

 Score = 129 (50.5 bits), Expect = 7.0e-05, P = 7.0e-05
 Identities = 81/269 (30%), Positives = 107/269 (39%)

Query:   256 PVGQNAYEDGYGVPQGHGPP----PSATTA-GVVGA--GPNTSTSAYAATQSGTPMRAAY 308
             P G  A +DG G+P   GPP    P+  +A G  G+  GP    +  A    G       
Sbjct:    64 PPGP-AGQDGEGLPGPQGPPGAPGPAGYSAPGKPGSPGGPGKPGATGAPGLKGDTGAPGL 122

Query:   309 DIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDP-AKGP-GYDPTKG----PGYDAQK 360
               PRG PG   S GP G  A+  P      GP+  P A GP G    KG    PG   QK
Sbjct:   123 QGPRGMPGPSGSPGPAGISATGKP------GPAGLPGAMGPRGEQGFKGHPGIPGLPGQK 176

Query:   361 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQR-VPGYDVQRGP 418
             G      +GP  +  RGP+  P    G     G     + G PG   +   PG D + GP
Sbjct:   177 GEMGVGVQGPAGE--RGPT-GPVGPSGKPGAPGVGLPGKPGAPGEAGKSGSPGRDGESGP 233

Query:   419 VY-EAQRAPSYIPQRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 475
             +  + Q+  +  P  G PG   + G  G      P   P   +G  GAP G   +G+  P
Sbjct:   234 MGPQGQKGQTGAPGVGIPGKPGENGAPGMPGPTGPK-GPQGASGAPGAP-GVPGYGK--P 289

Query:   476 PLNNVPYGSATPPARSGSGQPRGGNPARR 504
               N +      P +   +GQ   G P  +
Sbjct:   290 GENGLKGDRGVPGSPGTTGQK--GEPGAK 316


>UNIPROTKB|Q04118 [details] [associations]
            symbol:PRB3 "Basic salivary proline-rich protein 3"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=NAS] [GO:0051636 "Gram-negative bacterial cell surface
            binding" evidence=NAS] [GO:0008150 "biological_process"
            evidence=ND] GO:GO:0005576 GO:GO:0051636 InterPro:IPR026086
            PANTHER:PTHR23203 EMBL:X07637 EMBL:X07881 EMBL:BC096209
            EMBL:BC096210 EMBL:BC096211 IPI:IPI00006699 PIR:A36298 PIR:B36298
            PIR:S10889 RefSeq:NP_006240.4 UniGene:Hs.73031 STRING:Q04118
            DMDM:229462763 PaxDb:Q04118 PRIDE:Q04118 Ensembl:ENST00000381842
            GeneID:5544 KEGG:hsa:5544 CTD:5544 GeneCards:GC12M011418
            H-InvDB:HIX0201930 HGNC:HGNC:9339 MIM:168840 neXtProt:NX_Q04118
            PharmGKB:PA33701 HOGENOM:HOG000060075 GenomeRNAi:5544 NextBio:21478
            ArrayExpress:Q04118 Bgee:Q04118 CleanEx:HS_PRB3
            Genevestigator:Q04118 GermOnline:ENSG00000197870 Uniprot:Q04118
        Length = 309

 Score = 124 (48.7 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 79/271 (29%), Positives = 99/271 (36%)

Query:   248 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 307
             S +  SG+P G+     G   PQ   PPP     G    G N S         G P R  
Sbjct:    28 SPSVISGKPEGRRP--QGGNQPQ-RTPPPPGKPEGRPPQGGNQS--------QGPPPRPG 76

Query:   308 YDIPRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 364
                P GP   G   S+GP     K P   P +G +   ++GP   P K  G   Q G N 
Sbjct:    77 K--PEGPPPQGGNQSQGPPPRPGK-PEGQPPQGGNQ--SQGPPPRPGKPEGPPPQ-GGNQ 130

Query:   365 DAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP----GYETQRVPGYDVQ-RGPV 419
                  P      GP   P +G        P+     GP    G ++Q  P    +  GP 
Sbjct:   131 SQGPPPRPGKPEGP---PPQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPP 187

Query:   420 YEAQRAPSYIPQRGPGY-DLQRGQGYDMRRAPSYDPSR--GTGFDGA--PRGAAPH-G-- 471
              +        P R PG  +    QG +  + P   P +  G+   G   P+G  PH G  
Sbjct:   188 PQGGNQSQGPPPR-PGKPEGPPPQGGNQSQGPPPRPGKPEGSPSQGGNKPQGPPPHPGKP 246

Query:   472 QVPPPLN-NVPYGSATPPARSGSGQPRGGNP 501
             Q PPP   N P     PP R     P GGNP
Sbjct:   247 QGPPPQEGNKPQ-RPPPPGRPQGPPPPGGNP 276


>TAIR|locus:2204400 [details] [associations]
            symbol:AT1G76010 species:3702 "Arabidopsis thaliana"
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005576
            "extracellular region" evidence=ISM] [GO:0008150
            "biological_process" evidence=ND] [GO:0005829 "cytosol"
            evidence=IDA] InterPro:IPR002775 Pfam:PF01918 EMBL:CP002684
            GO:GO:0005829 GO:GO:0003676 EMBL:AF412102 EMBL:AY054208
            EMBL:AF428441 EMBL:AY124847 IPI:IPI00531013 RefSeq:NP_565124.1
            UniGene:At.24580 UniGene:At.67776 UniGene:At.75066 HSSP:P60849
            ProteinModelPortal:Q93VA8 SMR:Q93VA8 STRING:Q93VA8 PRIDE:Q93VA8
            EnsemblPlants:AT1G76010.1 GeneID:843932 KEGG:ath:AT1G76010
            TAIR:At1g76010 HOGENOM:HOG000240806 InParanoid:Q93VA8 OMA:YDGPPQG
            PhylomeDB:Q93VA8 ProtClustDB:CLSN2917456 Genevestigator:Q93VA8
            Uniprot:Q93VA8
        Length = 350

 Score = 125 (49.1 bits), Expect = 7.1e-05, P = 7.1e-05
 Identities = 70/207 (33%), Positives = 88/207 (42%)

Query:   255 RPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ---SGTPMRAAYDIP 311
             +P+G   YE   G P G G           G G     +AY   +    G     +Y   
Sbjct:   134 KPMGDIDYEGREGSPGGRGRGRGRGRGR--GRGRGGRGNAYVNVEHEDGGWEREQSYGRG 191

Query:   312 RGPGY-EASKGPGYDASKAP--SYDPTK--GPSYD-PAKGPGYDPTKGPGYDA--QKGSN 363
             RG G   +S+G G      P   YD  +  G  YD P +  GYD  +G GYDA  Q    
Sbjct:   192 RGRGRGRSSRGRGRGGYNGPPNEYDAPQDGGYGYDAPHEHRGYDD-RG-GYDAPPQGRGG 249

Query:   364 YDAQRGPN-YDIHRGP-SYD--PQ-RGLGYDMQRGPNYDMQRGPGYE--TQRVPGYDVQR 416
             YD  +G   YD  +G   YD  PQ RG GYD   GP+    RG GY+  +Q   GYD   
Sbjct:   250 YDGPQGRGGYDGPQGRRGYDGPPQGRG-GYD---GPSQG--RG-GYDGPSQGRGGYD--- 299

Query:   417 GPVYEAQRAPSYIPQRGPGYDLQRGQG 443
             GP   +Q    Y   +G G    RG+G
Sbjct:   300 GP---SQGRGGYDGPQGRGRGRGRGRG 323


>UNIPROTKB|P08125 [details] [associations]
            symbol:COL10A1 "Collagen alpha-1(X) chain" species:9031
            "Gallus gallus" [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR008983 HOGENOM:HOG000085653 HOVERGEN:HBG108220
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871 eggNOG:NOG114228
            OrthoDB:EOG4FFD29 EMBL:M13496 EMBL:J04194 IPI:IPI00600819
            PIR:S23297 ProteinModelPortal:P08125 SMR:P08125 STRING:P08125
            InParanoid:P08125 Reactome:REACT_132934 PMAP-CutDB:P08125
            Uniprot:P08125
        Length = 674

 Score = 129 (50.5 bits), Expect = 7.3e-05, P = 7.3e-05
 Identities = 91/291 (31%), Positives = 115/291 (39%)

Query:   238 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYA 296
             D    GA G +       P G+   E G G P   GPP  A   G  G  GP        
Sbjct:   229 DRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP-------- 279

Query:   297 ATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPTKG 353
             A   G+P    +  P  PG +  +GP G      P  D  +GP+  P + GP G     G
Sbjct:   280 AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGELGPAGPQGNMG 337

Query:   354 P-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PGYE 405
             P G     G N     GP  D+   GP+  P    +RGL G D +  P Y  ++G PG +
Sbjct:   338 PQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPGPK 393

Query:   406 TQRVPGYDVQRGPVYEAQRA--PSYI-PQ--RG-PGYDLQRGQGYDMRRAPSYDPS-RGT 458
                 PG   Q+G    A     P  + PQ  +G PG + + G      R PS  P  RG 
Sbjct:   394 GH--PGLPGQKGDTGHAGHPGLPGPVGPQGVKGVPGINGEPGP-----RGPSGIPGVRGP 446

Query:   459 ----GFDGAP--RGAAPHGQVPPPLNNV------PYGSATPPARSG-SGQP 496
                 G  GAP  +G A    +P P   V      P G   PP   G SG+P
Sbjct:   447 IGPPGMPGAPGAKGEAGAPGLPGPAGIVTKGLRGPMGPLGPPGPKGNSGEP 497


>UNIPROTKB|F1RZK4 [details] [associations]
            symbol:COL10A1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005938 "cell cortex" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
            GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
            InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
            SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
            GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:CU062641
            Ensembl:ENSSSCT00000004901 Uniprot:F1RZK4
        Length = 675

 Score = 129 (50.5 bits), Expect = 7.3e-05, P = 7.3e-05
 Identities = 88/295 (29%), Positives = 112/295 (37%)

Query:   235 RAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG--AGPNT 290
             + A G  G  G  G +     GRP G        G P G   PP     G  G    P  
Sbjct:   177 KGAPGVPGINGQKGETGYGAPGRP-GDRGLPGPQG-PMGPPGPPGVGKRGENGFPGQPGI 234

Query:   291 STSAYAATQSGTPMRAAYDIPRGP-GYEASKG---PGYD-ASKAPSYDPTKG----PSY- 340
                     +SG P  A    P+GP G +  +G   PG   A+  P    TKG    P   
Sbjct:   235 KGDRGFPGESG-P--AGPPGPQGPPGEQGREGIGKPGAPGAAGQPGLPGTKGHPGAPGMA 291

Query:   341 DPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGL-GYDMQRGPNYDM 398
              P   PG+     PG   Q+G        P     +GP+  P + GL G    RGP    
Sbjct:   292 GPPGAPGFGKPGLPGLKGQRGP-IGLPGAPGAKGEQGPAGHPGEPGLTGPPGSRGP---- 346

Query:   399 QRGPGYETQRVPGYDVQRGPVYEAQRA-PSYIP----QRGP-GYDLQRGQ-GYDMRRAPS 451
              +GP    + +PG +   GP  E   A P+  P    +RGP G D + G  G      P 
Sbjct:   347 -QGP----KGIPGNNGVPGPKGEIGLAGPAGFPGAKGERGPSGLDGKPGYPGEPGLNGPK 401

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 504
              +P    G  G P    P G +P P+   P G+   P  +G G PRG  G P  R
Sbjct:   402 GNPGL-PGPKGDPGIGGPPG-LPGPVG--PAGAKGVPGHNGEGGPRGAPGIPGTR 452


>UNIPROTKB|B7Z964 [details] [associations]
            symbol:SLMAP "cDNA, FLJ79335, highly similar to Homo
            sapiens sarcolemma associated protein (SLMAP), mRNA" species:9606
            "Homo sapiens" [GO:0006457 "protein folding" evidence=IEA]
            [GO:0016272 "prefoldin complex" evidence=IEA] [GO:0051082 "unfolded
            protein binding" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR002777 Pfam:PF01920 GO:GO:0016021
            GO:GO:0006457 GO:GO:0016272 EMBL:AC114480 UniGene:Hs.476432
            HGNC:HGNC:16643 ChiTaRS:SLMAP HOGENOM:HOG000072709 EMBL:AC099777
            HOVERGEN:HBG087998 EMBL:AK304493 EMBL:AK316436 IPI:IPI00946123
            STRING:B7Z964 Ensembl:ENST00000495364 UCSC:uc011bfa.1
            Uniprot:B7Z964
        Length = 362

 Score = 125 (49.1 bits), Expect = 7.6e-05, P = 7.6e-05
 Identities = 51/196 (26%), Positives = 91/196 (46%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:    80 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 135

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     
Sbjct:   136 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR----- 183

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAEL 226
             ++   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L
Sbjct:   184 EEATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL 242

Query:   227 MNAPNVDRRAADGSYG 242
              +   + R+  +   G
Sbjct:   243 -SILQMSRKELENQVG 257


>UNIPROTKB|G8ENL4 [details] [associations]
            symbol:FUS "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
            "nucleotide binding" evidence=IEA] InterPro:IPR000504
            InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
            PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
            SMART:SM00547 GO:GO:0005634 GO:GO:0005737 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GeneTree:ENSGT00530000063105 EMBL:CU464163 EMBL:JF940526
            Ensembl:ENSSSCT00000036326 Uniprot:G8ENL4
        Length = 517

 Score = 127 (49.8 bits), Expect = 8.3e-05, P = 8.3e-05
 Identities = 68/240 (28%), Positives = 93/240 (38%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             G+Y    G   ++ S +P GQ +Y  GYG          ++     G   NT   A +A 
Sbjct:    15 GAYPTQPGQGYSQQSNQPYGQQSYS-GYGQSADTSGYGQSSYGSSYGQTQNTGYGAQSAP 73

Query:   299 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 355
             Q G      Y   +G    Y + S  PGY    APS   T G     ++  GY   +  G
Sbjct:    74 Q-GYGSTGGYGSGQGSQSSYGQQSSYPGYGQQPAPS--STSGSYGTSSQSSGYGQPQSGG 130

Query:   356 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET--QRVPGYD 413
             Y  Q G  Y  Q+  +Y   +  SY+P +G G   Q   +     G G  +  Q  P   
Sbjct:   131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQNQYNSSSGGGGGGGGGSYGQDQPSMS 185

Query:   414 VQRGPVYEAQ-RAPSYI--PQ----RGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 466
                G  Y  Q ++  Y    Q    RG G     G GY+ R +  Y+P RG G     RG
Sbjct:   186 GGGGGGYGNQDQSGGYGGGQQDRGGRGRGGGSGGGGGYN-RSSGGYEP-RGRGGGRGGRG 243


>UNIPROTKB|E2RA07 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] [GO:0000166 "nucleotide binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 OMA:EGTSTGY
            EMBL:AAEX03014786 EMBL:AAEX03014787 Ensembl:ENSCAFT00000019384
            Uniprot:E2RA07
        Length = 671

 Score = 117 (46.2 bits), Expect = 8.8e-05, Sum P(2) = 8.8e-05
 Identities = 63/238 (26%), Positives = 87/238 (36%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYGTGAYDTT 102

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 349
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 160

Query:   350 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
              P+ G G   Q   +Y    G  P   +   PSY P R   ++      Y   R   Y +
Sbjct:   161 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPTR---FNSSSLKLYHYSRS--YSS 212

Query:   407 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 464
              +   YD            PS   Q+   Y  Q    Y  +   SY P  G+ +  AP
Sbjct:   213 TQPTSYDQSSYSQQNTYGQPSSYGQQS-SYGQQ--SSYGQQPPTSYPPQTGS-YSQAP 266

 Score = 57 (25.1 bits), Expect = 8.8e-05, Sum P(2) = 8.8e-05
 Identities = 19/46 (41%), Positives = 21/46 (45%)

Query:   465 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 503
             RG  P   G+ +PPPL   P G   P  P     G G  RGG P R
Sbjct:   470 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 515


>RGD|71029 [details] [associations]
            symbol:Col3a1 "collagen, type III, alpha 1" species:10116 "Rattus
           norvegicus" [GO:0001501 "skeletal system development" evidence=IEP]
           [GO:0001568 "blood vessel development" evidence=IEA;ISO] [GO:0005201
           "extracellular matrix structural constituent" evidence=IEA]
           [GO:0005581 "collagen" evidence=ISO] [GO:0005586 "collagen type III"
           evidence=ISO;TAS] [GO:0005615 "extracellular space" evidence=IEA]
           [GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
           "transforming growth factor beta receptor signaling pathway"
           evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
           evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
           [GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
           "peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
           organization" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
           evidence=ISO] [GO:0032964 "collagen biosynthetic process"
           evidence=IEA] [GO:0034097 "response to cytokine stimulus"
           evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
           "extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
           development" evidence=IEA] [GO:0046332 "SMAD binding"
           evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
           [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
           [GO:0048565 "digestive tract development" evidence=IEA;ISO]
           [GO:0050777 "negative regulation of immune response" evidence=IEA]
           [GO:0071230 "cellular response to amino acid stimulus"
           evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
           Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
           PROSITE:PS51461 SMART:SM00038 SMART:SM00214 RGD:71029 GO:GO:0043588
           GO:GO:0005615 GO:GO:0007507 GO:GO:0046872 GO:GO:0034097
           GO:GO:0030199 GO:GO:0001501 GO:GO:0007179 GO:GO:0007229
           GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
           GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
           GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
           GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
           HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
           CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:BC087039 EMBL:X70369
           EMBL:AJ005395 EMBL:M21354 IPI:IPI00366944 PIR:S41067
           RefSeq:NP_114474.1 UniGene:Rn.3247 ProteinModelPortal:P13941
           IntAct:P13941 STRING:P13941 PRIDE:P13941 Ensembl:ENSRNOT00000004956
           GeneID:84032 KEGG:rno:84032 UCSC:RGD:71029 InParanoid:P13941
           NextBio:616623 Genevestigator:P13941 GermOnline:ENSRNOG00000003357
           Uniprot:P13941
        Length = 1463

 Score = 132 (51.5 bits), Expect = 8.9e-05, P = 8.9e-05
 Identities = 76/261 (29%), Positives = 102/261 (39%)

Query:   258 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAYDIPR 312
             G +      G P   GPP +A   G  GA    GP  S  +  +  Q G P    +   +
Sbjct:   320 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAQ 379

Query:   313 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 370
             GP G   + G PG      P+  P   P    A+GP   P    G   Q+G +   + G 
Sbjct:   380 GPPGPPGNNGSPGGKGEMGPAGIPG-APGLLGARGPP-GPAGANGAPGQRGPS--GEPGK 435

Query:   371 NYDIHRGPSYDPQRG-LGYDMQRGPN-YDMQRG-PGYE-TQRVPGYDVQRG-PVYEAQRA 425
             N      P    +RG  G     GP   D + G PG      VPG   +RG P +     
Sbjct:   436 N-GAKGEPGARGERGEAGSPGIPGPKGEDGKDGSPGEPGANGVPGNPGERGAPGFRGPAG 494

Query:   426 PSYIP-QRGPGYDLQRGQGYDMRRAPSYDPSR-GT-------GFDGAPRGAAPHGQVPPP 476
             P+  P ++GP  + + G G    R  + +P R GT       G  G+P G    G+  PP
Sbjct:   495 PNGAPGEKGPAGE-RGGPGPAGPRGVAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGPP 553

Query:   477 LNNVPYGSATPPARSGS-GQP 496
              +    G   PP  SG  GQP
Sbjct:   554 GSQGESGRPGPPGPSGPRGQP 574

 Score = 129 (50.5 bits), Expect = 0.00019, P = 0.00019
 Identities = 83/290 (28%), Positives = 106/290 (36%)

Query:   233 DRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP--- 288
             D ++  G  GG  G +       P G + +    G P   GPP     AG  G  GP   
Sbjct:   160 DVKSGVGGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGA 219

Query:   289 -NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAK 344
                S  A    +SG P R     +P  PG +   G PG+   K    +D   G   +   
Sbjct:   220 IGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG- 278

Query:   345 GPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP 402
              PG     G PG +   G      RG   +  R P      G  G D  RG   D Q GP
Sbjct:   279 APGLKGENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGP 333

Query:   403 -GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 460
              G   T   PG    +G V  A    S      PG   QRG+      A +  P    G 
Sbjct:   334 PGPPGTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGN 387

Query:   461 DGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 503
             +G+P G    G  P  +   P   G+  PP  +G+ G P  RG  G P +
Sbjct:   388 NGSPGGKGEMG--PAGIPGAPGLLGARGPPGPAGANGAPGQRGPSGEPGK 435


>UNIPROTKB|Q92734 [details] [associations]
            symbol:TFG "Protein TFG" species:9606 "Homo sapiens"
            [GO:0008219 "cell death" evidence=IEA] [GO:0042802 "identical
            protein binding" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=NAS] [GO:0043123 "positive regulation of I-kappaB
            kinase/NF-kappaB cascade" evidence=IMP] [GO:0004871 "signal
            transducer activity" evidence=IMP] [GO:0005515 "protein binding"
            evidence=IPI] [GO:0007165 "signal transduction" evidence=IMP]
            InterPro:IPR000270 Pfam:PF00564 SMART:SM00666 GO:GO:0005737
            GO:GO:0008219 GO:GO:0043123 EMBL:CH471052 GO:GO:0004871 MIM:188550
            Orphanet:146 EMBL:X85960 EMBL:Y07968 EMBL:AK093456 EMBL:BT007428
            EMBL:CR456781 EMBL:AC068763 EMBL:BC001483 EMBL:BC009241
            EMBL:BC023599 IPI:IPI00294619 RefSeq:NP_001007566.1
            RefSeq:NP_001182407.1 RefSeq:NP_001182408.1 RefSeq:NP_006061.2
            UniGene:Hs.518123 ProteinModelPortal:Q92734 SMR:Q92734
            IntAct:Q92734 MINT:MINT-1156489 STRING:Q92734 PhosphoSite:Q92734
            DMDM:223634676 PaxDb:Q92734 PRIDE:Q92734 DNASU:10342
            Ensembl:ENST00000240851 Ensembl:ENST00000490574 GeneID:10342
            KEGG:hsa:10342 UCSC:uc003due.3 CTD:10342 GeneCards:GC03P100428
            H-InvDB:HIX0003505 HGNC:HGNC:11758 HPA:HPA019473 MIM:602498
            MIM:604484 neXtProt:NX_Q92734 PharmGKB:PA36473 eggNOG:NOG85275
            HOGENOM:HOG000132915 HOVERGEN:HBG009087 InParanoid:Q92734 KO:K09292
            OMA:YTTQTSQ OrthoDB:EOG40K80D PhylomeDB:Q92734 ChiTaRS:TFG
            GenomeRNAi:10342 NextBio:39217 ArrayExpress:Q92734 Bgee:Q92734
            CleanEx:HS_TFG Genevestigator:Q92734 GermOnline:ENSG00000114354
            Uniprot:Q92734
        Length = 400

 Score = 125 (49.1 bits), Expect = 9.1e-05, P = 9.1e-05
 Identities = 84/306 (27%), Positives = 109/306 (35%)

Query:   216 ATEVEKLRAELMNAPNVDRRAAD-----GSYGGATGNSENET-SGRPVGQNAYEDGYGVP 269
             +++V+ LR EL+   N   R  D     G  G +T   EN+T  GR     +   G    
Sbjct:    99 SSQVKYLRRELIELRNKVNRLLDSLEPPGEPGPSTNIPENDTVDGREEKSASDSSGKQST 158

Query:   270 QGHGPPPSATTAGVVGAGPNTST-SAYAATQ---SGTPMRAAYDIPRGPGYEASKGPGYD 325
             Q      SA          N +  SA+  T    SG P   A D    P   AS      
Sbjct:   159 QVMAASMSAFDPLKNQDEINKNVMSAFGLTDDQVSGPPSAPAEDRSGTPDSIASSS---S 215

Query:   326 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA-QKGSNYDAQRGPNYDIHRGPSYDPQR 384
             A+  P   P + P Y  A+       +G  Y   Q+ + Y AQ+ P     +   Y  Q 
Sbjct:   216 AAHPPGVQPQQ-PPYTGAQTQA-GQIEGQMYQQYQQQAGYGAQQ-PQAPPQQPQQYGIQY 272

Query:   385 GLGYDMQRGPNYDMQ-RGPGYE-TQRVPGYDVQRGPVY-EAQRAPSYIPQRGPG--YDLQ 439
                Y  Q GP    Q +G G + T + P       P    AQ    Y     P   Y  Q
Sbjct:   273 SASYSQQTGPQQPQQFQGYGQQPTSQAPAPAFSGQPQQLPAQPPQQYQASNYPAQTYTAQ 332

Query:   440 RGQGYDMRRAPSYDPSRGTGFDGA--PR-G--AAPHGQV-PPPLNNVPYGSATPPARSGS 493
               Q  +   AP+  P       GA  PR G  + P   + PPP    PY    PP   G 
Sbjct:   333 TSQPTNYTVAPASQPGMAPSQPGAYQPRPGFTSLPGSTMTPPPSGPNPYARNRPPFGQGY 392

Query:   494 GQPRGG 499
              QP  G
Sbjct:   393 TQPGPG 398


>UNIPROTKB|H7C3M8 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP EMBL:AC099777
            ProteinModelPortal:H7C3M8 PRIDE:H7C3M8 Ensembl:ENST00000417128
            Uniprot:H7C3M8
        Length = 409

 Score = 125 (49.1 bits), Expect = 9.5e-05, P = 9.5e-05
 Identities = 51/196 (26%), Positives = 91/196 (46%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:   130 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 185

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     
Sbjct:   186 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR----- 233

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAEL 226
             ++   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L
Sbjct:   234 EEATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL 292

Query:   227 MNAPNVDRRAADGSYG 242
              +   + R+  +   G
Sbjct:   293 -SILQMSRKELENQVG 307


>UNIPROTKB|Q8WML4 [details] [associations]
            symbol:MUC1 "Mucin-1" species:9913 "Bos taurus" [GO:0016324
            "apical plasma membrane" evidence=IBA] [GO:0009986 "cell surface"
            evidence=IBA] [GO:0005737 "cytoplasm" evidence=IBA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] PANTHER:PTHR10006 GO:GO:0016021 GO:GO:0005634
            GO:GO:0005737 GO:GO:0009986 GO:GO:0016324 InterPro:IPR000082
            Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 EMBL:AJ400824
            EMBL:AF399757 IPI:IPI00706283 RefSeq:NP_776540.1 UniGene:Bt.9561
            HSSP:Q16615 ProteinModelPortal:Q8WML4 SMR:Q8WML4 STRING:Q8WML4
            MEROPS:S71.001 Ensembl:ENSBTAT00000014051 GeneID:281333
            KEGG:bta:281333 CTD:4582 eggNOG:NOG77744
            GeneTree:ENSGT00700000104548 HOGENOM:HOG000290201
            HOVERGEN:HBG003075 InParanoid:Q8WML4 KO:K06568 OMA:PPAHGVT
            OrthoDB:EOG4NGGNM NextBio:20805343 PMAP-CutDB:Q8WML4
            ArrayExpress:Q8WML4 InterPro:IPR023217 Uniprot:Q8WML4
        Length = 580

 Score = 127 (49.8 bits), Expect = 9.8e-05, P = 9.8e-05
 Identities = 49/202 (24%), Positives = 71/202 (35%)

Query:   264 DGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGP 322
             DG   P     P  A + G  GA  +T TS+ A + + +P       P   P    +  P
Sbjct:    81 DGASTPTSSPAPSPAASPGHDGA--STPTSSPAPSPAASPGHDGASTPTSSPAPSPAASP 138

Query:   323 GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 382
             G+D +  P+  P   P+  P       PT  P         +D    P       P+  P
Sbjct:   139 GHDGASTPTSSPAPSPAASPGHNGTSSPTGSPAPSPAASPGHDGASTPT----SSPAPSP 194

Query:   383 QRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ 442
                 G++    P       P       PG+D    P   +  APS  P   PG++   G 
Sbjct:   195 AASPGHNGTSSPTGSPAPSPAAS----PGHDGASTPT--SSPAPS--PAASPGHN---GT 243

Query:   443 GYDMRRAPSYDPSRGTGFDGAP 464
                   +P+  P+   G D AP
Sbjct:   244 S-SPTGSPAPSPTASPGHDSAP 264

 Score = 121 (47.7 bits), Expect = 0.00044, P = 0.00044
 Identities = 59/236 (25%), Positives = 82/236 (34%)

Query:   276 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK----GPGYDASKAPS 331
             P +TT     + P   TS    T   T    A      PG++ +      P    + +P 
Sbjct:    40 PVSTTQSSPTSSPTKETSWSTTTTLLTASSPAPSPAASPGHDGASTPTSSPAPSPAASPG 99

Query:   332 YD----PTKGPSYDPAKGPGYD----PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 383
             +D    PT  P+  PA  PG+D    PT  P         +D    P       P+  P 
Sbjct:   100 HDGASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPA 155

Query:   384 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 443
                G++    P       P       PG+D    P   +  APS  P   PG++   G  
Sbjct:   156 ASPGHNGTSSPTGSPAPSPAAS----PGHDGASTPT--SSPAPS--PAASPGHN---GTS 204

Query:   444 YDMRRAPSYDPSRGTGFDGA--PRGA-APHGQVPPPLNNV--PYGSATPPARSGSG 494
                  +P+  P+   G DGA  P  + AP     P  N    P GS  P   +  G
Sbjct:   205 -SPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTGSPAPSPTASPG 259

 Score = 121 (47.7 bits), Expect = 0.00044, P = 0.00044
 Identities = 55/234 (23%), Positives = 80/234 (34%)

Query:   275 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 334
             P   T      + P +S +   +  + T +  A   P  P   AS  PG+D +  P+  P
Sbjct:    35 PRRTTPVSTTQSSPTSSPTKETSWSTTTTLLTASS-P-APSPAAS--PGHDGASTPTSSP 90

Query:   335 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 394
                P+  P       PT  P         +D    P       P+  P    G+D    P
Sbjct:    91 APSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPAASPGHDGASTP 146

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 454
                    P       PG++    P      APS  P   PG+D           +P+  P
Sbjct:   147 TSSPAPSPAAS----PGHNGTSSPT--GSPAPS--PAASPGHDGASTPTSSPAPSPAASP 198

Query:   455 SR-GTGFD-GAPR---GAAP-HGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
                GT    G+P     A+P H     P ++     A  P  +G+  P G +PA
Sbjct:   199 GHNGTSSPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTG-SPA 251


>UNIPROTKB|H7BZK0 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP EMBL:AC099777
            ProteinModelPortal:H7BZK0 PRIDE:H7BZK0 Ensembl:ENST00000416658
            Uniprot:H7BZK0
        Length = 433

 Score = 125 (49.1 bits), Expect = 0.00010, P = 0.00010
 Identities = 51/196 (26%), Positives = 91/196 (46%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:   154 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 209

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     
Sbjct:   210 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR----- 257

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAEL 226
             ++   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L
Sbjct:   258 EEATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL 316

Query:   227 MNAPNVDRRAADGSYG 242
              +   + R+  +   G
Sbjct:   317 -SILQMSRKELENQVG 331


>UNIPROTKB|G3N3C9 [details] [associations]
            symbol:LDB3 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
            "cytoskeletal protein binding" evidence=IEA] [GO:0005856
            "cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 OMA:CTSQATT
            InterPro:IPR006643 SMART:SM00735 GeneTree:ENSGT00700000104411
            EMBL:DAAA02062163 Ensembl:ENSBTAT00000065403 Uniprot:G3N3C9
        Length = 730

 Score = 128 (50.1 bits), Expect = 0.00010, P = 0.00010
 Identities = 50/180 (27%), Positives = 68/180 (37%)

Query:   252 TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 311
             TS  P   + Y +    P    P P   T   +   P+      A+T S +P  A Y  P
Sbjct:   379 TSPAPAA-HTYSEAPAAP---APKPRVVTTASIR--PSVYQPVPASTYSPSP-GANYS-P 430

Query:   312 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 371
               P Y  S  P Y  S  P+Y P+  P+Y P+  P Y P+  P Y+    S   A+    
Sbjct:   431 T-P-YTPSPAPAYTPSPTPAYTPSPAPTYSPSPAPAYTPSPAPSYNPTLYSGGPAESASR 488

Query:   372 YDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQ---RGPVYEAQRAPS 427
                    S+  +   G          + RG P Y T   P   V    RG V  A+R P+
Sbjct:   489 PPWVTDDSFSQKFAPGKTTTTVSKQSLPRGAPAY-TPPPPAPQVSPLARGTVQRAERFPA 547


>UNIPROTKB|F1NFF0 [details] [associations]
            symbol:Gga.41084 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005634 "nucleus" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR003618
            InterPro:IPR017890 InterPro:IPR019787 Pfam:PF00628 Pfam:PF07500
            PROSITE:PS50016 PROSITE:PS51321 SMART:SM00249 SMART:SM00510
            GO:GO:0005634 GO:GO:0046872 GO:GO:0008270 GO:GO:0006351
            Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
            SUPFAM:SSF57903 InterPro:IPR019786 PROSITE:PS01359
            Gene3D:1.10.472.30 InterPro:IPR012921 Pfam:PF07744 SUPFAM:SSF46942
            GeneTree:ENSGT00530000063844 EMBL:AADN02019222 EMBL:AADN02019223
            IPI:IPI00821338 Ensembl:ENSGALT00000039659 ArrayExpress:F1NFF0
            Uniprot:F1NFF0
        Length = 2253

 Score = 129 (50.5 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 86/304 (28%), Positives = 116/304 (38%)

Query:   228 NAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-A 286
             + PN  +   D     A G+ E E   R +    YE+    PQ H       +  + G  
Sbjct:  1703 STPNDSQSLQDMHQENAGGHYEPE---RGLSSLQYEEQRN-PQPHQFVEQTESTSIQGEG 1758

Query:   287 GP-------NTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG-P 338
             GP       N    +    + G P    ++ P GP      GP +    AP +    G P
Sbjct:  1759 GPLPQHFEENRLPFSLPGQKGGPPQPPIHNAPGGP-----HGPNF-RGPAPQFSEEHGSP 1812

Query:   339 SYDPAKGPG---YDPTKGPG---YDAQKGSN-YDAQRGP--NYDIHRG--PSYDPQRGLG 387
             + D  +GP    + P KGP    + +Q GS  +   RGP  +Y + RG  PS        
Sbjct:  1813 NSDGQRGPAPGRFGPQKGPIPSLFSSQHGSPLFGDNRGPAPSYGVPRGMSPSQFEDHMEP 1872

Query:   388 YDMQR---GPNY-DMQRGPG-YETQRVPGYDVQRGPV-YEAQRAP---SYIPQRG-PGYD 437
             +  QR      Y +M R PG +E    P +   RGP  +  QR P    +  QRG P + 
Sbjct:  1873 HMEQREFSDSQYNEMIRPPGQFEGSDQPQFMGNRGPFPFGGQRRPPPAQFKGQRGGPQFG 1932

Query:   438 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP-HGQVPPPLNNVPYGSATPPARSGSGQP 496
               RG        P   P     F+G  RG AP HG  P  L   P+       R GS  P
Sbjct:  1933 GPRGPAPGHFGGPR-GPHTNQ-FEGQ-RGPAPNHGPGPRGLLPQPFEE-----RRGSPPP 1984

Query:   497 RGGN 500
             R  N
Sbjct:  1985 RFAN 1988

 Score = 55 (24.4 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 30/123 (24%), Positives = 50/123 (40%)

Query:    30 GMRPPMPGAFP-PFDMMPPPEVME--QKIASQHVEMQK---LATENQRLAATHGTLRQEL 83
             G  PP P   P P      P V++    I S           AT +  + ATH +  +  
Sbjct:  1288 GSPPPPPPPLPVPESSAVTPSVLKILSSIKSGSTTTAPPPVSATASPSVTATHSSSSKTA 1347

Query:    84 AAAQHELQILHGQIGGMK-SERELQMRNLTEKIAKMEAE-----LKTAEPVKLEF-QKSK 136
                +H LQ L G+    + + +E +    + + A+  AE     +   +P+  +F Q SK
Sbjct:  1348 TPLEHILQTLFGKKKSFEPAAKESETAQSSNQKAQAAAEGGMAAVPLLDPIVQQFGQMSK 1407

Query:   137 TEA 139
              +A
Sbjct:  1408 DKA 1410


>UNIPROTKB|A4FU28 [details] [associations]
            symbol:CTAGE9 "Cutaneous T-cell lymphoma-associated antigen
            9" species:9606 "Homo sapiens" [GO:0016021 "integral to membrane"
            evidence=IEA] GO:GO:0016021 HOVERGEN:HBG051216 HOGENOM:HOG000112043
            OrthoDB:EOG4WSWC5 EMBL:AC005587 EMBL:BC101322 IPI:IPI00740858
            RefSeq:NP_001139131.1 UniGene:Hs.632613 ProteinModelPortal:A4FU28
            PhosphoSite:A4FU28 PRIDE:A4FU28 Ensembl:ENST00000314099
            GeneID:643854 KEGG:hsa:643854 UCSC:uc011ece.2 CTD:643854
            GeneCards:GC06M132030 HGNC:HGNC:37275 neXtProt:NX_A4FU28
            PharmGKB:PA165617886 OMA:CEGLESS PhylomeDB:A4FU28 GenomeRNAi:643854
            NextBio:115484 Bgee:A4FU28 Uniprot:A4FU28
        Length = 777

 Score = 128 (50.1 bits), Expect = 0.00011, P = 0.00011
 Identities = 108/471 (22%), Positives = 183/471 (38%)

Query:    46 PPPEVMEQKI--ASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSE 103
             PP   +++ I  A  +V ++ L  E   +      + +        ++ L  Q   ++SE
Sbjct:   310 PPKGALKKLIHAAKLNVSLKSLEGERNHIIIQLSEVDKTKEELTEHIKNLQTQQASLQSE 369

Query:   104 R---ELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDL 160
                 E + + L +K+ K+  E      +KL ++K   E +N  +  EE +++V +    +
Sbjct:   370 NIYFESENQKLQQKL-KIMTEFYQENEMKL-YRKLTVE-ENYRIEEEEKLSRVEE---KI 423

Query:   161 QRAHTDVQQIPALLSELES-LRQEYH-HCRGTYEYEKKFYNDHLESLQVMEKNYITMATE 218
               A  +++    L  +LE  L +  H + +    YEK+ +++ L + +  E+N   +  E
Sbjct:   424 SHATEELETYRKLAKDLEEELERTVHFYQKQVISYEKRGHDNWLAA-RTAERNLSDLRKE 482

Query:   219 ----VEKL-RAEL-MNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYED-GYGVPQG 271
                  +KL   EL       D  A D S   A G   +  S  P+G+ + E   +  PQ 
Sbjct:   483 NAHNKQKLTERELKFELLEKDPNALDVS-NTAFGREHSPCSPSPLGRPSSETRAFPSPQT 541

Query:   272 HGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEA----SKGPGYDA 326
                 P   +  + G G    +S       G P+       RG P Y+      + P    
Sbjct:   542 LLEDPLRLSPVLPGGGGRGPSSP------GNPLDHQITNERGEPSYDRLIDPHRAPSDTG 595

Query:   327 SKAPSYDPTKGPSYDPAKGPGY-DPTKGPGYDAQKGSNYDAQRGPNYD-IHRGPSYDPQ- 383
             S +   +  +   + P  G  Y D T  P  + +  SN +   GP      +  S D   
Sbjct:   596 SLSSPVEQDRRMMFPPP-GQSYPDSTLPPQREDRFYSNSERLSGPAEPRSFKMTSLDKMD 654

Query:   384 RGLGYDMQRGPNYDMQRGPG---YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 440
             R +  +M+   N D +   G        +P  +   GP       P   P  GP + +  
Sbjct:   655 RSMPSEMESSRN-DAKDDLGNLNVPDSSLPAENEATGP---GLIPPPLAPISGPLFPVDT 710

Query:   441 GQGYDMRRAPSYDPSR-GTGFDGAPRGAAPHGQVPPPLNNVPYGSAT--PP 488
              +G  MRR P + P   GT F GA RG  P    P P  + P+      PP
Sbjct:   711 -RGPFMRRGPPFPPPPPGTMF-GASRGYFPPRDFPGP-PHAPFAMRNIYPP 758


>UNIPROTKB|F1NGH5 [details] [associations]
            symbol:Gga.41084 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0006915 "apoptotic process"
            evidence=IEA] InterPro:IPR001965 InterPro:IPR003618
            InterPro:IPR017890 InterPro:IPR019787 Pfam:PF00628 Pfam:PF07500
            PROSITE:PS50016 PROSITE:PS51321 SMART:SM00249 SMART:SM00510
            GO:GO:0005634 GO:GO:0006915 GO:GO:0046872 GO:GO:0008270
            GO:GO:0006351 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 InterPro:IPR019786
            PROSITE:PS01359 Gene3D:1.10.472.30 InterPro:IPR012921 Pfam:PF07744
            SUPFAM:SSF46942 OMA:PNRMCAD GeneTree:ENSGT00530000063844
            EMBL:AADN02019222 EMBL:AADN02019223 IPI:IPI00577866
            Ensembl:ENSGALT00000009066 ArrayExpress:F1NGH5 Uniprot:F1NGH5
        Length = 2287

 Score = 129 (50.5 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 86/304 (28%), Positives = 116/304 (38%)

Query:   228 NAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-A 286
             + PN  +   D     A G+ E E   R +    YE+    PQ H       +  + G  
Sbjct:  1731 STPNDSQSLQDMHQENAGGHYEPE---RGLSSLQYEEQRN-PQPHQFVEQTESTSIQGEG 1786

Query:   287 GP-------NTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG-P 338
             GP       N    +    + G P    ++ P GP      GP +    AP +    G P
Sbjct:  1787 GPLPQHFEENRLPFSLPGQKGGPPQPPIHNAPGGP-----HGPNF-RGPAPQFSEEHGSP 1840

Query:   339 SYDPAKGPG---YDPTKGPG---YDAQKGSN-YDAQRGP--NYDIHRG--PSYDPQRGLG 387
             + D  +GP    + P KGP    + +Q GS  +   RGP  +Y + RG  PS        
Sbjct:  1841 NSDGQRGPAPGRFGPQKGPIPSLFSSQHGSPLFGDNRGPAPSYGVPRGMSPSQFEDHMEP 1900

Query:   388 YDMQR---GPNY-DMQRGPG-YETQRVPGYDVQRGPV-YEAQRAP---SYIPQRG-PGYD 437
             +  QR      Y +M R PG +E    P +   RGP  +  QR P    +  QRG P + 
Sbjct:  1901 HMEQREFSDSQYNEMIRPPGQFEGSDQPQFMGNRGPFPFGGQRRPPPAQFKGQRGGPQFG 1960

Query:   438 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP-HGQVPPPLNNVPYGSATPPARSGSGQP 496
               RG        P   P     F+G  RG AP HG  P  L   P+       R GS  P
Sbjct:  1961 GPRGPAPGHFGGPR-GPHTNQ-FEGQ-RGPAPNHGPGPRGLLPQPFEE-----RRGSPPP 2012

Query:   497 RGGN 500
             R  N
Sbjct:  2013 RFAN 2016

 Score = 55 (24.4 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 30/123 (24%), Positives = 50/123 (40%)

Query:    30 GMRPPMPGAFP-PFDMMPPPEVME--QKIASQHVEMQK---LATENQRLAATHGTLRQEL 83
             G  PP P   P P      P V++    I S           AT +  + ATH +  +  
Sbjct:  1288 GSPPPPPPPLPVPESSAVTPSVLKILSSIKSGSTTTAPPPVSATASPSVTATHSSSSKTA 1347

Query:    84 AAAQHELQILHGQIGGMK-SERELQMRNLTEKIAKMEAE-----LKTAEPVKLEF-QKSK 136
                +H LQ L G+    + + +E +    + + A+  AE     +   +P+  +F Q SK
Sbjct:  1348 TPLEHILQTLFGKKKSFEPAAKESETAQSSNQKAQAAAEGGMAAVPLLDPIVQQFGQMSK 1407

Query:   137 TEA 139
              +A
Sbjct:  1408 DKA 1410


>UNIPROTKB|F1SN69 [details] [associations]
            symbol:F1SN69 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 InterPro:IPR008985 SUPFAM:SSF49899 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 GO:GO:0005201
            SMART:SM00210 GeneTree:ENSGT00700000104301 OMA:YSYPDRL
            EMBL:CU618340 EMBL:CU606988 EMBL:CU861519
            Ensembl:ENSSSCT00000006033 Uniprot:F1SN69
        Length = 1869

 Score = 132 (51.5 bits), Expect = 0.00012, P = 0.00012
 Identities = 74/250 (29%), Positives = 98/250 (39%)

Query:   267 GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRG-------PGYEA 318
             GVP   GPP +    G  G+ GP  +     A   G P  A YD  +G       PG + 
Sbjct:  1274 GVPGDPGPPGTPGPKGSRGSLGPTGAPGRMGA--QGEPGLAGYDGHKGIMGPLGPPGPKG 1331

Query:   319 SKGP-GYDA-SKAPSYDP-TKGPSYDPAKGPGYDPTKGPGYDAQKG-----SNYDAQRGP 370
              KG  G D  ++ P   P  +GP  D  +G   +P   PGY  Q+G      N   Q  P
Sbjct:  1332 EKGEQGEDGKAEGPPGPPGDRGPVGD--RGDRGEPGD-PGYPGQEGVQGLRGNPGQQGQP 1388

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRGPVYEAQRAPSYI 429
              +   RG    P+   G +  +G        PG   TQ +PG    RG V   ++ P  +
Sbjct:  1389 GHPGPRGRP-GPKGSKGEEGPKGKQ-GKAGAPGRRGTQGLPGLPGPRGVV--GRQGPEGV 1444

Query:   430 --PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA-PHGQVPPPL---NNVPYG 483
               P   PG D Q GQ  +        P    G  G P  A  P  Q PP     + +P G
Sbjct:  1445 AGPDGLPGLDGQAGQQGEQGDDGDPGPLGPAGKRGNPGVAGLPGAQGPPGFKGESGLP-G 1503

Query:   484 SATPPARSGS 493
                PP + G+
Sbjct:  1504 QLGPPGKRGT 1513


>UNIPROTKB|P05997 [details] [associations]
            symbol:COL5A2 "Collagen alpha-2(V) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0071230
            "cellular response to amino acid stimulus" evidence=IEA]
            [GO:0030199 "collagen fibril organization" evidence=ISS;IMP]
            [GO:0043588 "skin development" evidence=ISS;IMP] [GO:0031012
            "extracellular matrix" evidence=NAS] [GO:0003674
            "molecular_function" evidence=ND] [GO:0048592 "eye morphogenesis"
            evidence=IMP] [GO:0005588 "collagen type V" evidence=IMP]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0043588 GO:GO:0046872 GO:GO:0030199
            GO:GO:0005788 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            HOVERGEN:HBG004933 KO:K06236 MIM:130000 Orphanet:90309
            EMBL:AY016295 PDB:1A9A PDBsum:1A9A MIM:130010 Orphanet:90318
            GO:GO:0005588 EMBL:Y14690 EMBL:AB209045 EMBL:AC064833 EMBL:AC133106
            EMBL:J04478 EMBL:AY016288 EMBL:AY016287 EMBL:AY016289 EMBL:AY016290
            EMBL:AY016291 EMBL:AY016292 EMBL:AY016293 EMBL:AY016294 EMBL:M58529
            EMBL:X04758 EMBL:BC043613 EMBL:M10956 EMBL:M11135 EMBL:M11718
            EMBL:J03051 IPI:IPI00739099 PIR:A31427 RefSeq:NP_000384.2
            UniGene:Hs.445827 ProteinModelPortal:P05997 SMR:P05997
            STRING:P05997 PhosphoSite:P05997 DMDM:143811378 PaxDb:P05997
            PRIDE:P05997 Ensembl:ENST00000374866 GeneID:1290 KEGG:hsa:1290
            UCSC:uc002uqk.3 CTD:1290 GeneCards:GC02M189861 HGNC:HGNC:2210
            MIM:120190 neXtProt:NX_P05997 PharmGKB:PA26725 InParanoid:P05997
            OMA:PDHKPVW OrthoDB:EOG4K0QMS ChiTaRS:COL5A2 GenomeRNAi:1290
            NextBio:5223 PMAP-CutDB:P05997 ArrayExpress:P05997 Bgee:P05997
            Genevestigator:P05997 GermOnline:ENSG00000204262 Uniprot:P05997
        Length = 1499

 Score = 131 (51.2 bits), Expect = 0.00012, P = 0.00012
 Identities = 87/295 (29%), Positives = 109/295 (36%)

Query:   232 VDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGP 288
             +  + A+G+ G  GA G         P G    E G   P+G  GPP S    G  G   
Sbjct:   782 IGEKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENG 840

Query:   289 NTSTSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK 344
              T    +A  Q   G P ++     P   G   S GP G   S  P + P   P     +
Sbjct:   841 PTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGR 899

Query:   345 GPGYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGP 394
             G    P  T  PG   + G    A   GP   +   P  +   GL  D         RGP
Sbjct:   900 GTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGP 958

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPS 451
                   GPG +    PG D Q GP  +    P+    QRG  G   QRG+ G      P+
Sbjct:   959 A-GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPA 1013

Query:   452 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 504
               P +  G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1014 GTPGK-VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067


>UNIPROTKB|J9NW09 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 EMBL:AAEX03003616 EMBL:AAEX03003617
            Ensembl:ENSCAFT00000050029 Uniprot:J9NW09
        Length = 1789

 Score = 144 (55.7 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:  1476 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 1535

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:  1536 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 1588

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:  1589 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 1642

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:  1643 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 1695

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:  1696 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00012, Sum P(2) = 0.00012
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>WB|WBGene00000694 [details] [associations]
            symbol:col-120 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00610000086159 EMBL:AL032632 PIR:T26465
            RefSeq:NP_501617.1 ProteinModelPortal:Q9XWR2 DIP:DIP-26936N
            IntAct:Q9XWR2 MINT:MINT-1070946 STRING:Q9XWR2
            EnsemblMetazoa:Y11D7A.11 GeneID:177748 KEGG:cel:CELE_Y11D7A.11
            UCSC:Y11D7A.11 CTD:177748 WormBase:Y11D7A.11 eggNOG:NOG265281
            InParanoid:Q9XWR2 OMA:HWELLED NextBio:898216 Uniprot:Q9XWR2
        Length = 313

 Score = 122 (48.0 bits), Expect = 0.00012, P = 0.00012
 Identities = 77/268 (28%), Positives = 97/268 (36%)

Query:   247 NSENE-TSGRPVGQNAY--EDGYGV--PQ---GHGPPPSATTAGVVGAGPNTSTSAYAAT 298
             N EN   S + VG      + GYG   P    G  P PS   A    A  ++S+S+ +  
Sbjct:    64 NLENMYESTKAVGSGPVKRQAGYGASSPSRASGSHPAPSPYDA----ASTSSSSSSDSCC 119

Query:   299 QSGTPMRAAYDIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYD---PAKGPGYDPT 351
               G  +      P  PG +   GP    G D         + G   +   PA  PG  P 
Sbjct:   120 SCGIGLAGPAGFPGRPGRDGIDGPAGKPGRDGQDLDGESSSDGSQIELDCPAGPPG--PP 177

Query:   352 KGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRV 409
               PG     G    D   G N    R P    +RG  G D + G   D    PG     +
Sbjct:   178 GNPGPQGNSGRPGMDGMPGRNGRCGR-PGEQGERGPNGEDGRPGRRGD-DGMPG-TVNEI 234

Query:   410 PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAA 468
             PG   Q GP    + AP     +GP     RG  G    + P+  P    GFDGAP G  
Sbjct:   235 PG---QAGPP-GLRGAPGATGSQGP-----RGNDGRPGNKGPAGPPG-DQGFDGAPGGPG 284

Query:   469 PHGQ--VPPPLNNVPYGSATPPARSGSG 494
               G+     PL      S  PP R+  G
Sbjct:   285 ADGEPGAQGPLGAKGECSHCPPPRTAPG 312


>UNIPROTKB|F1NCR0 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001501 "skeletal system
            development" evidence=IEA] [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
            GeneTree:ENSGT00660000095287 GO:GO:0005584 EMBL:AADN02000724
            IPI:IPI00821202 Ensembl:ENSGALT00000015706 ArrayExpress:F1NCR0
            Uniprot:F1NCR0
        Length = 1318

 Score = 130 (50.8 bits), Expect = 0.00013, P = 0.00013
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   256 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 309
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   781 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 837

Query:   310 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 368
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   838 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 892

Query:   369 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 424
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   893 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 943

Query:   425 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 478
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   944 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1000

Query:   479 NVPYGSATPPARSGSGQPRGGN 500
             N P G   PP  SG     G N
Sbjct:  1001 NGPAGPRGPPGPSGPPGKDGRN 1022


>UNIPROTKB|F6UV28 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA] [GO:0004828
            "serine-tRNA ligase activity" evidence=IEA] InterPro:IPR012929
            InterPro:IPR015866 Pfam:PF07926 GO:GO:0005524 GO:GO:0005737
            GO:GO:0005643 GO:GO:0006606 InterPro:IPR009053 SUPFAM:SSF46579
            GeneTree:ENSGT00700000104019 GO:GO:0004828 GO:GO:0006434
            Gene3D:1.10.287.40 OMA:RFIRREK EMBL:AAEX03005165
            Ensembl:ENSCAFT00000021777 Uniprot:F6UV28
        Length = 2127

 Score = 126 (49.4 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 41/186 (22%), Positives = 87/186 (46%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1113 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1172

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1173 EKETIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1232

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
              H  VQ++  L   L     +        E  +K  ++     + +++  + +  E+ +L
Sbjct:  1233 QHVSVQEMQELKETLNQAETKAKSLESQVENLQKTLSEKEAEARNLQEQTVQLQCELSRL 1292

Query:   223 RAELMN 228
             R +L +
Sbjct:  1293 RQDLQD 1298

 Score = 57 (25.1 bits), Expect = 0.00013, Sum P(2) = 0.00013
 Identities = 17/54 (31%), Positives = 22/54 (40%)

Query:   243 GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             G  G   NE +G   G + YE  D  G   G G  P   T   +G G +   +A
Sbjct:  1746 GDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGESNQRAA 1796


>UNIPROTKB|P02467 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005583 "fibrillar collagen" evidence=IDA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0046872 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M25963
            EMBL:M25956 EMBL:M25959 EMBL:M25961 EMBL:M25962 EMBL:M25965
            EMBL:M25964 EMBL:M25984 EMBL:M25957 EMBL:M25966 EMBL:M25967
            EMBL:M25969 EMBL:M25970 EMBL:M25971 EMBL:M25972 EMBL:M25973
            EMBL:M25974 EMBL:M25976 EMBL:M25977 EMBL:M25978 EMBL:M25979
            EMBL:M25980 EMBL:M25981 EMBL:M25982 EMBL:M25983 EMBL:J00826
            EMBL:J00821 EMBL:K00792 EMBL:J00830 EMBL:J00829 EMBL:J00837
            EMBL:J00812 EMBL:J00811 EMBL:J00814 EMBL:J00815 EMBL:X02657
            EMBL:K00794 EMBL:V00390 EMBL:M17608 EMBL:M10581 EMBL:M10540
            EMBL:J00828 EMBL:J00827 EMBL:J00832 EMBL:J00831 EMBL:J00833
            EMBL:J00822 IPI:IPI00914483 PIR:I50173 PIR:I50206 PIR:S10847
            UniGene:Gga.5097 STRING:P02467 PRIDE:P02467 InParanoid:P02467
            PMAP-CutDB:P02467 GO:GO:0005583 Uniprot:P02467
        Length = 1362

 Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   256 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 309
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   825 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 881

Query:   310 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 368
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   882 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 936

Query:   369 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 424
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   937 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 987

Query:   425 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 478
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   988 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1044

Query:   479 NVPYGSATPPARSGSGQPRGGN 500
             N P G   PP  SG     G N
Sbjct:  1045 NGPAGPRGPPGPSGPPGKDGRN 1066


>UNIPROTKB|F1P0H9 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001501 "skeletal system
            development" evidence=IEA] [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
            GeneTree:ENSGT00660000095287 KO:K06236 GO:GO:0005584 CTD:1278
            IPI:IPI00914483 UniGene:Gga.5097 EMBL:AADN02000724
            RefSeq:NP_001073182.2 PRIDE:F1P0H9 Ensembl:ENSGALT00000015703
            GeneID:396243 KEGG:gga:396243 OMA:IGMPGAR NextBio:20816295
            ArrayExpress:F1P0H9 Uniprot:F1P0H9
        Length = 1363

 Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
 Identities = 81/262 (30%), Positives = 97/262 (37%)

Query:   256 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 309
             PVG+   E G   P G     GP   A  AG  G  GP     A      G P  R    
Sbjct:   826 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 882

Query:   310 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 368
             +P   G     GP    S  P     +GPS  P   PG +   G  G D   G++    R
Sbjct:   883 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 937

Query:   369 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 424
                P +   RG P    P   LG     GP+   Q GP  +    PG     GPV     
Sbjct:   938 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 988

Query:   425 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 478
             A ++ P+   GP G   ++G+  D   R  P     +G  G  G P  A  HG   PP N
Sbjct:   989 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1045

Query:   479 NVPYGSATPPARSGSGQPRGGN 500
             N P G   PP  SG     G N
Sbjct:  1046 NGPAGPRGPPGPSGPPGKDGRN 1067


>UNIPROTKB|F1SNP1 [details] [associations]
            symbol:COL4A4 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0032836 "glomerular basement membrane development"
            evidence=IEA] [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587
            "collagen type IV" evidence=IEA] [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] InterPro:IPR001442
            Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
            EMBL:CU466451 EMBL:FP690341 Ensembl:ENSSSCT00000017688
            Uniprot:F1SNP1
        Length = 1711

 Score = 131 (51.2 bits), Expect = 0.00014, P = 0.00014
 Identities = 76/260 (29%), Positives = 89/260 (34%)

Query:   254 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 313
             G P G    E   G+P   GPP      G  G               G P         G
Sbjct:  1207 GVP-GPRGPEGSMGLPGQRGPP-GPECKGEPGPDGRRGEDGLPGPP-GPPGHKGDMGEAG 1263

Query:   314 -PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKGPGYDAQKGSNYDAQRG 369
              PG    KG PG   +  PS    +G + DP  G   G  P   PG     G N   QRG
Sbjct:  1264 CPGAPGPKGFPGRRGTPGPSLIGFRGDTGDPGFGGEKGSSPIGPPGSPGSPGMN--GQRG 1321

Query:   370 PNYDIHRG-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA-- 425
             P  D   G P    +RGL G    +G   D  R        +PG+   +GP     RA  
Sbjct:  1322 PPGDPALGYPGPPGKRGLFGSPGSKGLRGDPGRPGATGPAGMPGFPGLKGPKGREGRAGF 1381

Query:   426 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 485
             P  +P   PG+  + G     R  P   P    G  GAP      G + PP      G  
Sbjct:  1382 PG-VPGP-PGHSCESGA--PGRPGPPGLPG-APGSPGAPGWKGQRGDMGPPGPAGMKGVP 1436

Query:   486 TPPARSGSGQPRG--GNPAR 503
               P R G   P G  G P R
Sbjct:  1437 GVPGRPGPDGPPGPPGVPGR 1456


>TAIR|locus:2079502 [details] [associations]
            symbol:RS31 "arginine/serine-rich splicing factor 31"
            species:3702 "Arabidopsis thaliana" [GO:0000166 "nucleotide
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0003723 "RNA binding" evidence=ISS] [GO:0005737
            "cytoplasm" evidence=ISM] [GO:0016607 "nuclear speck" evidence=IDA]
            [GO:0008380 "RNA splicing" evidence=NAS] [GO:0000398 "mRNA
            splicing, via spliceosome" evidence=IDA;RCA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=RCA]
            [GO:0030422 "production of siRNA involved in RNA interference"
            evidence=RCA] [GO:0035196 "production of miRNAs involved in gene
            silencing by miRNA" evidence=RCA] [GO:0043687 "post-translational
            protein modification" evidence=RCA] [GO:0045893 "positive
            regulation of transcription, DNA-dependent" evidence=RCA]
            [GO:0005681 "spliceosomal complex" evidence=TAS] InterPro:IPR000504
            InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
            EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0000166 GO:GO:0016607
            Gene3D:3.30.70.330 GO:GO:0005681 GO:GO:0003723 GO:GO:0000398
            EMBL:AL138642 HOGENOM:HOG000276234 KO:K12893 EMBL:X99435
            EMBL:AF439831 EMBL:AY125565 IPI:IPI00530595 PIR:T47978 PIR:T51304
            RefSeq:NP_567120.1 UniGene:At.24231 ProteinModelPortal:P92964
            SMR:P92964 IntAct:P92964 STRING:P92964 PaxDb:P92964 PRIDE:P92964
            EnsemblPlants:AT3G61860.1 GeneID:825359 KEGG:ath:AT3G61860
            TAIR:At3g61860 eggNOG:NOG277933 InParanoid:P92964 OMA:FEYETRQ
            PhylomeDB:P92964 ProtClustDB:CLSN2917489 Genevestigator:P92964
            GermOnline:AT3G61860 Uniprot:P92964
        Length = 264

 Score = 120 (47.3 bits), Expect = 0.00014, P = 0.00014
 Identities = 30/88 (34%), Positives = 41/88 (46%)

Query:   302 TPMRAAYDIPR---GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YDPTKGPGYD 357
             +P R+   + R    P Y     PG     +P Y   + P YD  KGP  Y+  + P Y 
Sbjct:   177 SPRRSLSPVYRRRPSPDYGRRPSPGQGRRPSPDYGRARSPEYDRYKGPAAYERRRSPDY- 235

Query:   358 AQKGSNYDAQRGPNYDIHRGPSYDPQRG 385
              ++ S+Y  QR P YD +R  S  P RG
Sbjct:   236 GRRSSDYGRQRSPGYDRYRSRSPVP-RG 262


>UNIPROTKB|F1KQQ4 [details] [associations]
            symbol:F1KQQ4 "Collagen alpha-1(IV) chain" species:6253
            "Ascaris suum" [GO:0031012 "extracellular matrix" evidence=IDA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10 EMBL:JI164326
            Uniprot:F1KQQ4
        Length = 1759

 Score = 131 (51.2 bits), Expect = 0.00014, P = 0.00014
 Identities = 88/302 (29%), Positives = 111/302 (36%)

Query:   226 LMNAPNVDRRAADGSYGGATGNSE-NETSGRP-----VGQNAYEDGYGVPQGHGPPPSAT 279
             L   P  D       + G  G++  N  +G P      G+   +  +G P   GP  +  
Sbjct:  1147 LQGPPGFDGLQGQKGHRGIPGDAGFNGRAGLPGLPGIKGERGQDGQHGYPGEPGPVGAHG 1206

Query:   280 TAGVVGAGPNTSTSAYAATQSGTPMR----AAYDIPRGPGYEASKG----PGYDASKA-P 330
              +G+ GA P          + G P +     A   P  PG E   G     G D     P
Sbjct:  1207 ESGLTGA-PGLQGEPGLPGRMGLPGQPGELGAPGFPGAPGLEGIPGIRGERGDDGLPGLP 1265

Query:   331 SYD--PTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRG----PSYDP 382
               D  P +GP  D A  PG D   G PG   Q+G   D   G P     RG    P Y  
Sbjct:  1266 GIDGIPIQGPEGD-AGYPGRDGNDGLPGLPGQRGD--DGLPGLPGLIGERGDDGLPGYPG 1322

Query:   383 QRGL-GYDMQRGPNYDMQRG-PGYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDL 438
             +RGL G D +RGP  D  RG PG       PG   +RG        P +  + G PGY  
Sbjct:  1323 ERGLRGIDGKRGP--DGARGLPGPPGLDGYPGAPGERG----MDGLPGFPGKDGIPGYPG 1376

Query:   439 QRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHG-QVPPPLNNVPYGSATPPARSGSGQP 496
             +RG+       P     RG  G  G P  A   G +    L  +P G   P    G   P
Sbjct:  1377 ERGEV----GLPGLPGMRGEDGLPGLPGLAGQKGARGDDGLPGLP-GLPGPVGARGRPGP 1431

Query:   497 RG 498
              G
Sbjct:  1432 PG 1433


>UNIPROTKB|F1LNY9 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00558825
            Ensembl:ENSRNOT00000049994 ArrayExpress:F1LNY9 Uniprot:F1LNY9
        Length = 1441

 Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
 Identities = 82/285 (28%), Positives = 106/285 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   283 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 339

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +  +     + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   340 EGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 394

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   395 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEP 451

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   452 GGAGPIGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 508

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   509 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553

 Score = 126 (49.4 bits), Expect = 0.00039, P = 0.00039
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   709 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 767

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   768 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 827

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   828 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 886

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   887 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 946

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   947 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 1001

Query:   503 R 503
             R
Sbjct:  1002 R 1002

 Score = 125 (49.1 bits), Expect = 0.00050, P = 0.00050
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057


>UNIPROTKB|F1LQ06 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00949996
            Ensembl:ENSRNOT00000066385 ArrayExpress:F1LQ06 Uniprot:F1LQ06
        Length = 1441

 Score = 130 (50.8 bits), Expect = 0.00014, P = 0.00014
 Identities = 82/285 (28%), Positives = 106/285 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   283 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 339

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +  +     + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   340 EGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 394

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   395 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEP 451

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   452 GGAGPIGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 508

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   509 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553

 Score = 126 (49.4 bits), Expect = 0.00039, P = 0.00039
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   709 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 767

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   768 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 827

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   828 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 886

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   887 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 946

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   947 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 1001

Query:   503 R 503
             R
Sbjct:  1002 R 1002

 Score = 125 (49.1 bits), Expect = 0.00050, P = 0.00050
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057


>MGI|MGI:88467 [details] [associations]
            symbol:Col1a1 "collagen, type I, alpha 1" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development"
            evidence=ISO;IMP] [GO:0001568 "blood vessel development"
            evidence=ISO;IMP] [GO:0001957 "intramembranous ossification"
            evidence=IGI] [GO:0001958 "endochondral ossification" evidence=IMP]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
            "proteinaceous extracellular matrix" evidence=IDA] [GO:0005581
            "collagen" evidence=IMP;IDA] [GO:0005584 "collagen type I"
            evidence=ISO;IMP;IDA] [GO:0005615 "extracellular space"
            evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0007601
            "visual perception" evidence=ISO] [GO:0007605 "sensory perception
            of sound" evidence=ISO] [GO:0010718 "positive regulation of
            epithelial to mesenchymal transition" evidence=ISO] [GO:0010812
            "negative regulation of cell-substrate adhesion" evidence=IDA]
            [GO:0015031 "protein transport" evidence=IMP] [GO:0030199 "collagen
            fibril organization" evidence=ISO] [GO:0030335 "positive regulation
            of cell migration" evidence=ISO] [GO:0031012 "extracellular matrix"
            evidence=IDA] [GO:0032964 "collagen biosynthetic process"
            evidence=ISO] [GO:0034504 "protein localization to nucleus"
            evidence=ISO] [GO:0034505 "tooth mineralization" evidence=ISO]
            [GO:0042060 "wound healing" evidence=ISO] [GO:0042802 "identical
            protein binding" evidence=ISO] [GO:0043588 "skin development"
            evidence=IMP] [GO:0043589 "skin morphogenesis" evidence=ISO]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=ISO]
            [GO:0048705 "skeletal system morphogenesis" evidence=IGI]
            [GO:0048706 "embryonic skeletal system development" evidence=ISO]
            [GO:0060325 "face morphogenesis" evidence=IGI] [GO:0060346 "bone
            trabecula formation" evidence=IGI] [GO:0060351 "cartilage
            development involved in endochondral bone morphogenesis"
            evidence=IMP] [GO:0070208 "protein heterotrimerization"
            evidence=IDA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IDA] [GO:0090263 "positive regulation of
            canonical Wnt receptor signaling pathway" evidence=ISO]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 MGI:MGI:88467 GO:GO:0005737
            GO:GO:0045893 GO:GO:0043588 GO:GO:0005615 GO:GO:0071363
            GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0071300
            GO:GO:0043434 GO:GO:0030199 GO:GO:0007584 GO:GO:0010035
            GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0042542
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            GO:GO:0071260 GO:GO:0001568 GO:GO:0001649 GO:GO:0051591
            GO:GO:0034505 GO:GO:0090263 GO:GO:0010812 GO:GO:0060325
            GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
            GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
            HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ OrthoDB:EOG4S4PHP
            GO:GO:0005584 GO:GO:0060346 ChiTaRS:COL1A1 GO:GO:0031960
            EMBL:U08020 EMBL:AL662790 EMBL:AL606480 EMBL:BC050014 EMBL:BC059281
            EMBL:K01688 EMBL:S67530 EMBL:S67482 EMBL:X54876 EMBL:M14423
            EMBL:M17491 EMBL:K03036 EMBL:K03029 EMBL:K03030 EMBL:K03031
            EMBL:K03032 EMBL:K03033 EMBL:K03034 EMBL:K03035 EMBL:X06753
            EMBL:X15896 EMBL:X57981 IPI:IPI00329872 IPI:IPI00623191 PIR:I49558
            PIR:S57243 RefSeq:NP_031768.2 UniGene:Mm.277735 UniGene:Mm.458212
            ProteinModelPortal:P11087 SMR:P11087 IntAct:P11087 STRING:P11087
            PhosphoSite:P11087 PaxDb:P11087 PRIDE:P11087
            Ensembl:ENSMUST00000001547 GeneID:12842 KEGG:mmu:12842
            UCSC:uc007kzn.1 InParanoid:P11087 NextBio:282376 PMAP-CutDB:P11087
            Bgee:P11087 CleanEx:MM_COL1A1 Genevestigator:P11087
            GermOnline:ENSMUSG00000001506 Uniprot:P11087
        Length = 1453

 Score = 130 (50.8 bits), Expect = 0.00015, P = 0.00015
 Identities = 79/254 (31%), Positives = 95/254 (37%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSAT----TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 311
             P+G N    G   P+G   PP AT     AG VG  P  S +A      G   +     P
Sbjct:   841 PIG-NVGAPGPKGPRGAAGPPGATGFPGAAGRVGP-PGPSGNAGPPGPPGPVGKEGGKGP 898

Query:   312 RGPGYEASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYDAQKGS-NYDAQ 367
             RG    A + PG      P   P  G    P A GP   P T GP G   Q+G      Q
Sbjct:   899 RGETGPAGR-PGEVGPPGPP-GPA-GEKGSPGADGPAGSPGTPGPQGIAGQRGVVGLPGQ 955

Query:   368 RGPN-YDIHRGPSYDP-QRG-LGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 424
             RG   +    GPS +P ++G  G   +RGP   M  GP       PG     GP  E+ R
Sbjct:   956 RGERGFPGLPGPSGEPGKQGPSGSSGERGPPGPM--GP-------PGL---AGPPGESGR 1003

Query:   425 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGS 484
               S   +  PG D   G   D        P    G  GAP    P G+        P G 
Sbjct:  1004 EGSPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGP 1063

Query:   485 ATPPARSGSGQPRG 498
             A P   +G+  P G
Sbjct:  1064 AGPIGPAGARGPAG 1077


>UNIPROTKB|F1M8G1 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00475975
            Ensembl:ENSRNOT00000050833 ArrayExpress:F1M8G1 Uniprot:F1M8G1
        Length = 1458

 Score = 130 (50.8 bits), Expect = 0.00015, P = 0.00015
 Identities = 82/285 (28%), Positives = 106/285 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   300 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 356

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +  +     + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   357 EGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 411

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   412 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEP 468

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   469 GGAGPIGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 525

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   526 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 570

 Score = 126 (49.4 bits), Expect = 0.00040, P = 0.00040
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   726 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 784

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   785 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 844

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   845 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 903

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   904 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 963

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   964 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 1018

Query:   503 R 503
             R
Sbjct:  1019 R 1019

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 87/286 (30%), Positives = 109/286 (38%)

Query:   238 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 297
             DG+ G    + E  T G P G        G P G G    A  A + G     +  A   
Sbjct:   113 DGAPGPRGRDGEPGTPGNP-GPPGPPGPPG-PPGLGGGNFA--AQMAGGFDEKAGGAQMG 168

Query:   298 TQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP 354
                G PM      PRGP G   + GP G+  +     +P   GP   P   PG  P   P
Sbjct:   169 VMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRGPPG--PAGKP 222

Query:   355 GYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG----PGYET 406
             G D + G    A +RG P     RG    P  GL G    RG P  D  +G    PG + 
Sbjct:   223 GDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKGEAGAPGVKG 280

Query:   407 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS-----YDPSRGTGF 460
             +   PG +   GP+   +  P    + GP       +G D +  P+       P+ G GF
Sbjct:   281 ESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGPVGPAGGPGF 338

Query:   461 DGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 501
              GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   339 PGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 381


>UNIPROTKB|F1LP41 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00205809
            Ensembl:ENSRNOT00000012441 ArrayExpress:F1LP41 Uniprot:F1LP41
        Length = 1458

 Score = 130 (50.8 bits), Expect = 0.00015, P = 0.00015
 Identities = 82/285 (28%), Positives = 106/285 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   300 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 356

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +  +     + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   357 EGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 411

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   412 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEP 468

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   469 GGAGPIGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 525

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   526 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 570

 Score = 126 (49.4 bits), Expect = 0.00040, P = 0.00040
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   726 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 784

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   785 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 844

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   845 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 903

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   904 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 963

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   964 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 1018

Query:   503 R 503
             R
Sbjct:  1019 R 1019

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074


>UNIPROTKB|P04280 [details] [associations]
            symbol:PRB1 "Basic salivary proline-rich protein 1"
            species:9606 "Homo sapiens" [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005576 "extracellular region" evidence=NAS] GO:GO:0005576
            PIR:B40750 InterPro:IPR026086 PANTHER:PTHR23203 EMBL:K03204
            EMBL:K03205 EMBL:K03206 EMBL:S52986 EMBL:M97220 EMBL:K02575
            EMBL:K02576 EMBL:X07516 EMBL:X07517 EMBL:S62928 EMBL:S62941
            IPI:IPI00023038 PIR:C38355 PIR:D40750 RefSeq:NP_005030.2
            RefSeq:NP_955385.1 RefSeq:NP_955386.1 UniGene:Hs.631726
            ProteinModelPortal:P04280 STRING:P04280 PhosphoSite:P04280
            DMDM:52001469 PRIDE:P04280 GeneID:5542 KEGG:hsa:5542 CTD:5542
            GeneCards:GC12M011504 HGNC:HGNC:9337 MIM:180989 neXtProt:NX_P04280
            PharmGKB:PA33699 KO:K13911 GenomeRNAi:5542 NextBio:21470
            ArrayExpress:P04280 CleanEx:HS_PRB1 Genevestigator:P04280
            Uniprot:P04280
        Length = 392

 Score = 123 (48.4 bits), Expect = 0.00015, P = 0.00015
 Identities = 76/279 (27%), Positives = 94/279 (33%)

Query:   242 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT-STSAYAATQS 300
             GG          G+P G      G   PQG  PPP     G    G  + S  +      
Sbjct:    43 GGNKPQGPPPPPGKPQGPPP--QGGNKPQG--PPPPGKPQGPPPQGDKSRSPRSPPGKPQ 98

Query:   301 GTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTKG------PSYDPAKGPGYDPTK 352
             G P +     P+GP     K  GP       P   P  G      P  D ++ P   P K
Sbjct:    99 GPPPQGGNQ-PQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDKSQSPRSPPGK 157

Query:   353 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQ-- 407
               G   Q G N      P     +GP   P +G G   Q  P     +GP   G ++Q  
Sbjct:   158 PQGPPPQ-GGNQPQGPPPPPGKPQGP---PPQG-GNKPQGPPPPGKPQGPPPQGDKSQSP 212

Query:   408 RVP-----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 462
             R P     G   Q G   +    P   PQ  P     R QG      P   P +G     
Sbjct:   213 RSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPQQGGNRPQGPPPPGKPQGPPPQGDK-SR 271

Query:   463 APRGAAPHGQVPPPLN-NVPYGSATPPARSGSGQPRGGN 500
             +P+      Q PPP   N P G   PP +     P+GGN
Sbjct:   272 SPQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGN 310


>UNIPROTKB|F1PGS0 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9615
            "Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:AAEX03003616
            EMBL:AAEX03003617 Ensembl:ENSCAFT00000026237 Uniprot:F1PGS0
        Length = 1969

 Score = 144 (55.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:  1476 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 1535

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:  1536 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 1588

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:  1589 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 1642

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:  1643 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 1695

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:  1696 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>UNIPROTKB|G3MZY8 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase" species:9913
            "Bos taurus" [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0004672 "protein kinase activity"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0003899
            "DNA-directed RNA polymerase activity" evidence=IEA] [GO:0006366
            "transcription from RNA polymerase II promoter" evidence=IEA]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:DAAA02048777
            EMBL:DAAA02048778 EMBL:DAAA02048779 EMBL:DAAA02048780
            EMBL:DAAA02048781 Ensembl:ENSBTAT00000064788 Uniprot:G3MZY8
        Length = 1970

 Score = 144 (55.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:  1477 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 1536

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:  1537 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 1589

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:  1590 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 1643

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:  1644 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 1696

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:  1697 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1729

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>UNIPROTKB|P24928 [details] [associations]
            symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
            species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0003968 "RNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0003677 "DNA binding" evidence=NAS] [GO:0003899 "DNA-directed
            RNA polymerase activity" evidence=NAS] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=NAS] [GO:0006366
            "transcription from RNA polymerase II promoter"
            evidence=IDA;NAS;TAS] [GO:0005634 "nucleus" evidence=IDA;NAS]
            [GO:0005665 "DNA-directed RNA polymerase II, core complex"
            evidence=IDA] [GO:0004672 "protein kinase activity" evidence=IDA]
            [GO:0005730 "nucleolus" evidence=IDA] [GO:0000398 "mRNA splicing,
            via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
            evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0006283
            "transcription-coupled nucleotide-excision repair" evidence=TAS]
            [GO:0006289 "nucleotide-excision repair" evidence=TAS] [GO:0006367
            "transcription initiation from RNA polymerase II promoter"
            evidence=TAS] [GO:0006368 "transcription elongation from RNA
            polymerase II promoter" evidence=TAS] [GO:0006370
            "7-methylguanosine mRNA capping" evidence=TAS] [GO:0008380 "RNA
            splicing" evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
            [GO:0016032 "viral reproduction" evidence=TAS] [GO:0050434
            "positive regulation of viral transcription" evidence=TAS]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=IPI]
            [GO:0006468 "protein phosphorylation" evidence=IDA]
            Reactome:REACT_216 Reactome:REACT_71 InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 Reactome:REACT_116125
            EMBL:CH471108 GO:GO:0016032 GO:GO:0006355 GO:GO:0046872
            GO:GO:0003677 Reactome:REACT_1675 GO:GO:0006468 GO:GO:0006368
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0006367 GO:GO:0000398
            Reactome:REACT_1788 GO:GO:0006370 GO:GO:0050434 GO:GO:0006283
            Reactome:REACT_1892 EMBL:AC113189 GO:GO:0003899 PDB:2GHQ PDB:2GHT
            PDBsum:2GHQ PDBsum:2GHT eggNOG:COG0086 GO:GO:0003968 GO:GO:0005665
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 EMBL:X63564 EMBL:X74874
            EMBL:X74873 EMBL:X74872 EMBL:X74871 EMBL:X74870 EMBL:BC137231
            IPI:IPI00031627 PIR:I38186 PIR:S21054 RefSeq:NP_000928.1
            UniGene:Hs.270017 PDB:2LTO PDBsum:2LTO ProteinModelPortal:P24928
            SMR:P24928 DIP:DIP-29011N IntAct:P24928 MINT:MINT-156582
            STRING:P24928 PhosphoSite:P24928 DMDM:281185484 PaxDb:P24928
            PRIDE:P24928 Ensembl:ENST00000322644 GeneID:5430 KEGG:hsa:5430
            UCSC:uc002ghf.4 CTD:5430 GeneCards:GC17P007387 H-InvDB:HIX0173727
            HGNC:HGNC:9187 HPA:CAB012226 HPA:CAB016388 HPA:CAB022311
            HPA:HPA021563 MIM:180660 neXtProt:NX_P24928 PharmGKB:PA33507
            HOVERGEN:HBG004339 InParanoid:P24928 OrthoDB:EOG4JWVCM
            BindingDB:P24928 ChEMBL:CHEMBL1641353 ChiTaRS:POLR2A
            EvolutionaryTrace:P24928 GenomeRNAi:5430 NextBio:21009
            ArrayExpress:P24928 Bgee:P24928 CleanEx:HS_POLR2A
            Genevestigator:P24928 GermOnline:ENSG00000181222 Uniprot:P24928
        Length = 1970

 Score = 144 (55.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:  1476 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 1535

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:  1536 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 1588

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:  1589 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 1642

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:  1643 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 1695

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:  1696 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>MGI|MGI:98086 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide
            A" species:10090 "Mus musculus" [GO:0003677 "DNA binding"
            evidence=IDA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0005634 "nucleus" evidence=ISO] [GO:0005665 "DNA-directed RNA
            polymerase II, core complex" evidence=ISO] [GO:0005730 "nucleolus"
            evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=ISO] [GO:0006468 "protein phosphorylation"
            evidence=ISO] [GO:0016740 "transferase activity" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0031625 "ubiquitin protein ligase binding" evidence=ISO]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 MGI:MGI:98086
            GO:GO:0046872 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            EMBL:AL603707 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            eggNOG:COG0086 GO:GO:0005665 GeneTree:ENSGT00700000104490
            HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 CTD:5430
            HOVERGEN:HBG004339 OrthoDB:EOG4JWVCM ChiTaRS:POLR2A EMBL:M12130
            EMBL:M14101 IPI:IPI00136207 PIR:A28490 RefSeq:NP_033115.1
            UniGene:Mm.16533 DisProt:DP00181 ProteinModelPortal:P08775
            SMR:P08775 DIP:DIP-46369N IntAct:P08775 STRING:P08775
            PhosphoSite:P08775 PaxDb:P08775 PRIDE:P08775
            Ensembl:ENSMUST00000058470 Ensembl:ENSMUST00000071213 GeneID:20020
            KEGG:mmu:20020 UCSC:uc007jrj.1 InParanoid:Q5F298 NextBio:297535
            Bgee:P08775 CleanEx:MM_POLR2A Genevestigator:P08775
            GermOnline:ENSMUSG00000005198 Uniprot:P08775
        Length = 1970

 Score = 144 (55.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:  1476 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 1535

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:  1536 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 1588

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:  1589 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 1642

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:  1643 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 1695

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:  1696 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>RGD|1587326 [details] [associations]
            symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide A"
            species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0003677 "DNA binding" evidence=IEA;ISO]
            [GO:0003899 "DNA-directed RNA polymerase activity" evidence=IEA]
            [GO:0004672 "protein kinase activity" evidence=IEA;ISO] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005634 "nucleus"
            evidence=ISO] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISO] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0006468 "protein
            phosphorylation" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] [GO:0031625 "ubiquitin protein ligase binding"
            evidence=IEA;ISO] [GO:0005730 "nucleolus" evidence=ISO]
            InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
            InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
            InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
            Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
            Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
            SMART:SM00663 RGD:1587326 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
            Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
            GeneTree:ENSGT00700000104490 KO:K03006 CTD:5430 OrthoDB:EOG4JWVCM
            IPI:IPI00952328 RefSeq:XP_001079162.1 RefSeq:XP_343923.3
            UniGene:Rn.163136 Ensembl:ENSRNOT00000068013 GeneID:363633
            KEGG:rno:363633 UCSC:RGD:1587326 NextBio:683839 ArrayExpress:D4A5A6
            Uniprot:D4A5A6
        Length = 1970

 Score = 144 (55.7 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 72/276 (26%), Positives = 99/276 (35%)

Query:   218 EVEKLRAELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP- 276
             + EK +  +    N+    A G  G   G++ +   G       +  G     G   P  
Sbjct:  1476 DAEKCKYGMEIPTNIPGLGAAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSV 1535

Query:   277 -SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
              S  T G  G  P+ ++ A   +   +P  A    P  PG      PG  +   PS    
Sbjct:  1536 GSGMTPGAAGFSPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGA 1588

Query:   336 KGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN 395
               PSY P   P Y+P    GY  Q  S Y +   P+Y     PSY P     Y     P+
Sbjct:  1589 MSPSYSPTS-PAYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPS 1642

Query:   396 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 455
             Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P+
Sbjct:  1643 YS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 1695

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
               +    +P   +P      P +  P  S T P+ S
Sbjct:  1696 SPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1728

 Score = 37 (18.1 bits), Expect = 0.00015, Sum P(2) = 0.00015
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query:    52 EQKIASQHVEMQKLAT 67
             E  +A + VE Q LAT
Sbjct:   893 EDGLAGESVEFQNLAT 908


>UNIPROTKB|P12270 [details] [associations]
            symbol:TPR "Nucleoprotein TPR" species:9606 "Homo sapiens"
            [GO:0004828 "serine-tRNA ligase activity" evidence=IEA] [GO:0005524
            "ATP binding" evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0031965 "nuclear membrane" evidence=IDA] [GO:0005643 "nuclear
            pore" evidence=IDA] [GO:0007094 "mitotic spindle assembly
            checkpoint" evidence=IMP] [GO:0000776 "kinetochore" evidence=IDA]
            [GO:0006404 "RNA import into nucleus" evidence=IDA] [GO:0006606
            "protein import into nucleus" evidence=IMP;IDA] [GO:0005635
            "nuclear envelope" evidence=IDA] [GO:0034399 "nuclear periphery"
            evidence=IDA] [GO:0042803 "protein homodimerization activity"
            evidence=IDA] [GO:0042405 "nuclear inclusion body" evidence=IDA]
            [GO:0090267 "positive regulation of mitotic cell cycle spindle
            assembly checkpoint" evidence=IMP] [GO:0090316 "positive regulation
            of intracellular protein transport" evidence=IMP] [GO:1901673
            "regulation of spindle assembly involved in mitosis" evidence=IMP]
            [GO:0035457 "cellular response to interferon-alpha" evidence=ISS]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0000122 "negative
            regulation of transcription from RNA polymerase II promoter"
            evidence=IMP] [GO:0046832 "negative regulation of RNA export from
            nucleus" evidence=IDA;IMP] [GO:0045947 "negative regulation of
            translational initiation" evidence=IMP] [GO:0031647 "regulation of
            protein stability" evidence=IMP] [GO:0010793 "regulation of mRNA
            export from nucleus" evidence=IMP] [GO:0042306 "regulation of
            protein import into nucleus" evidence=IMP] [GO:0046825 "regulation
            of protein export from nucleus" evidence=IMP] [GO:0005487
            "nucleocytoplasmic transporter activity" evidence=IDA] [GO:0031453
            "positive regulation of heterochromatin assembly" evidence=IMP]
            [GO:0044615 "nuclear pore nuclear basket" evidence=IDA] [GO:0005737
            "cytoplasm" evidence=IDA] [GO:0019898 "extrinsic to membrane"
            evidence=IDA] [GO:0043495 "protein anchor" evidence=IMP]
            [GO:0051019 "mitogen-activated protein kinase binding"
            evidence=IDA] [GO:0070849 "response to epidermal growth factor
            stimulus" evidence=IDA] [GO:0000189 "MAPK import into nucleus"
            evidence=IMP] [GO:0042307 "positive regulation of protein import
            into nucleus" evidence=IMP] [GO:0070840 "dynein complex binding"
            evidence=IDA] [GO:0005868 "cytoplasmic dynein complex"
            evidence=IDA] [GO:0015631 "tubulin binding" evidence=IDA]
            [GO:0072686 "mitotic spindle" evidence=IDA] [GO:0010965 "regulation
            of mitotic sister chromatid separation" evidence=IMP] [GO:0046827
            "positive regulation of protein export from nucleus" evidence=ISS]
            [GO:0031990 "mRNA export from nucleus in response to heat stress"
            evidence=IDA] [GO:0031072 "heat shock protein binding"
            evidence=IDA] [GO:0034605 "cellular response to heat" evidence=IDA]
            [GO:0003682 "chromatin binding" evidence=IDA] [GO:0003729 "mRNA
            binding" evidence=IDA] [GO:0006999 "nuclear pore organization"
            evidence=IMP] [GO:0043578 "nuclear matrix organization"
            evidence=IMP] [GO:0006611 "protein export from nucleus"
            evidence=IMP] [GO:0005215 "transporter activity" evidence=IMP]
            [GO:0006405 "RNA export from nucleus" evidence=IMP] [GO:0051292
            "nuclear pore complex assembly" evidence=IMP] [GO:0005654
            "nucleoplasm" evidence=TAS] [GO:0005975 "carbohydrate metabolic
            process" evidence=TAS] [GO:0008645 "hexose transport" evidence=TAS]
            [GO:0010827 "regulation of glucose transport" evidence=TAS]
            [GO:0015758 "glucose transport" evidence=TAS] [GO:0016032 "viral
            reproduction" evidence=TAS] [GO:0019221 "cytokine-mediated
            signaling pathway" evidence=TAS] [GO:0044281 "small molecule
            metabolic process" evidence=TAS] [GO:0055085 "transmembrane
            transport" evidence=TAS] Reactome:REACT_111217 Reactome:REACT_15518
            InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926 GO:GO:0005524
            GO:GO:0005737 Reactome:REACT_116125 Reactome:REACT_6900
            GO:GO:0005654 GO:GO:0016032 GO:GO:0007094 GO:GO:0044281
            GO:GO:0005975 GO:GO:0031965 EMBL:CH471067 GO:GO:0005643
            GO:GO:0019221 GO:GO:0015758 GO:GO:0010827 GO:GO:0055085
            GO:GO:0006606 eggNOG:NOG12793 KO:K09291 GO:GO:0051028 GO:GO:0000777
            InterPro:IPR009053 SUPFAM:SSF46579 MIM:188550 Orphanet:146
            EMBL:AL133553 EMBL:X62947 PIR:S23741 EMBL:AL596220 GO:GO:0004828
            GO:GO:0006434 Gene3D:1.10.287.40 EMBL:X66397 EMBL:Y00672
            IPI:IPI00742682 RefSeq:NP_003283.2 UniGene:Hs.279640
            ProteinModelPortal:P12270 IntAct:P12270 MINT:MINT-1144652
            STRING:P12270 PhosphoSite:P12270 DMDM:215274208 PaxDb:P12270
            PRIDE:P12270 Ensembl:ENST00000367478 GeneID:7175 KEGG:hsa:7175
            UCSC:uc001grv.3 CTD:7175 GeneCards:GC01M186281 HGNC:HGNC:12017
            HPA:HPA019661 HPA:HPA019663 HPA:HPA024336 MIM:189940
            neXtProt:NX_P12270 PharmGKB:PA36696 HOGENOM:HOG000139431
            HOVERGEN:HBG009158 InParanoid:P12270 OMA:RFIRREK OrthoDB:EOG42RD6D
            GenomeRNAi:7175 NextBio:28128 PMAP-CutDB:P12270 ArrayExpress:P12270
            Bgee:P12270 CleanEx:HS_TPR Genevestigator:P12270
            GermOnline:ENSG00000047410 Uniprot:P12270
        Length = 2363

 Score = 128 (50.1 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 41/186 (22%), Positives = 88/186 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1349 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1408

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1409 EKETIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1468

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
              H  VQ++  L   L     +        E  +K  ++     + +++  + + +E+ +L
Sbjct:  1469 QHVSVQEMQELKETLNQAETKSKSLESQVENLQKTLSEKETEARNLQEQTVQLQSELSRL 1528

Query:   223 RAELMN 228
             R +L +
Sbjct:  1529 RQDLQD 1534

 Score = 55 (24.4 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 17/54 (31%), Positives = 21/54 (38%)

Query:   243 GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             G  G   NE +G   G + YE  D  G   G G  P   T   +G G     +A
Sbjct:  1982 GDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGEGNHRAA 2032


>UNIPROTKB|F1S300 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0031965 "nuclear membrane" evidence=IEA] [GO:0007094
            "mitotic spindle assembly checkpoint" evidence=IEA] [GO:0005643
            "nuclear pore" evidence=IEA] [GO:0000776 "kinetochore"
            evidence=IEA] [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0004828 "serine-tRNA ligase activity"
            evidence=IEA] InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926
            GO:GO:0005524 GO:GO:0005737 GO:GO:0000776 GO:GO:0007094
            GO:GO:0031965 GO:GO:0005643 GO:GO:0006606 InterPro:IPR009053
            SUPFAM:SSF46579 GeneTree:ENSGT00700000104019 GO:GO:0004828
            GO:GO:0006434 Gene3D:1.10.287.40 OMA:RFIRREK EMBL:CU657929
            EMBL:FP340191 Ensembl:ENSSSCT00000016969 Uniprot:F1S300
        Length = 2365

 Score = 128 (50.1 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 43/187 (22%), Positives = 88/187 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1349 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1408

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1409 EKESIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1468

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQVMEKNYITMATEVEK 221
              H  VQ++  L   L     +        E  +K      +E+  + E+  + + +E+ +
Sbjct:  1469 QHVSVQEMQELKEALNQAEAKSKSLESQVENLQKTLSEKEMEARNLQEQT-VQLQSELSR 1527

Query:   222 LRAELMN 228
             LR +L +
Sbjct:  1528 LRQDLQD 1534

 Score = 55 (24.4 bits), Expect = 0.00016, Sum P(2) = 0.00016
 Identities = 17/54 (31%), Positives = 22/54 (40%)

Query:   243 GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             G  G   NE +G   G + YE  D  G   G G  P   T   +G G +   +A
Sbjct:  1984 GDEGEVSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGESNQRAA 2034


>ZFIN|ZDB-GENE-041008-78 [details] [associations]
            symbol:polr2a "polymerase (RNA) II (DNA directed)
            polypeptide A" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0003899 "DNA-directed RNA polymerase activity"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA] [GO:0006366 "transcription from RNA
            polymerase II promoter" evidence=IEA] [GO:0016740 "transferase
            activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
            activity" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
            InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
            InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
            InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
            Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
            PROSITE:PS00115 SMART:SM00663 ZFIN:ZDB-GENE-041008-78 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
            GO:GO:0005665 GeneTree:ENSGT00700000104490 EMBL:AL929346
            IPI:IPI00608319 Ensembl:ENSDART00000077495 Bgee:F1Q9K4
            Uniprot:F1Q9K4
        Length = 1965

 Score = 131 (51.2 bits), Expect = 0.00016, P = 0.00016
 Identities = 67/234 (28%), Positives = 87/234 (37%)

Query:   271 GHGPPPSATTAGVVGAGPNTSTSAYAATQ----SG-TPMRAAYDIPRGPGYEASKGPGYD 325
             G  P P +  +  +      +T AY A      SG TP  A +  P      +   PGY 
Sbjct:  1501 GSAPSPMSGMSPAMTPWNTGATPAYGAWSPSVGSGMTPGAAGFS-PSAASDASGFSPGYS 1559

Query:   326 A--SKAPSYDPTKGPS--YDPAKG---PGYDPTKGPGYDAQK-GSNYDAQRGPNYDIHRG 377
                S  P    + GP+  Y P+ G   P Y PT  P Y+ +  G  Y  Q  P Y     
Sbjct:  1560 PAWSPTPGSPGSPGPASPYIPSPGALSPNYSPTS-PAYEPRSPGGGYTPQ-SPGYS-PTS 1616

Query:   378 PSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 437
             PSY P     Y     PNY     P Y     P Y     P Y +  +PSY P   P Y 
Sbjct:  1617 PSYSPTSP-SYS-PTSPNYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS 1669

Query:   438 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 491
                   Y    +PSY P+  +    +P   +P      P +  P  S T P+ S
Sbjct:  1670 -PTSPSYSPT-SPSYSPTSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1718


>ZFIN|ZDB-GENE-040426-2678 [details] [associations]
            symbol:pdcd6ip "programmed cell death 6
            interacting protein" species:7955 "Danio rerio" [GO:0003674
            "molecular_function" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] InterPro:IPR025304 Pfam:PF13949
            ZFIN:ZDB-GENE-040426-2678 Gene3D:1.25.40.280 InterPro:IPR004328
            Pfam:PF03097 SMART:SM01041 PROSITE:PS51180
            GeneTree:ENSGT00670000098017 EMBL:CU469582 IPI:IPI00503522
            Ensembl:ENSDART00000028592 ArrayExpress:F1Q5T7 Bgee:F1Q5T7
            Uniprot:F1Q5T7
        Length = 873

 Score = 127 (49.8 bits), Expect = 0.00017, P = 0.00017
 Identities = 74/329 (22%), Positives = 123/329 (37%)

Query:    79 LRQELAA---AQHELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKS 135
             LR +LA     + E ++L G++  +  +   +      +   +  E+ T+  +   +   
Sbjct:   556 LRSQLAQLDEVKREREVLEGEVKSVTFDLTAKFLTALAQDGAINEEVMTSSELDARYGSH 615

Query:   136 KTEAQNLVVAREELIAKV---HQLTQDLQRAHTDVQQIPALLSELESLRQEYHHCRGTYE 192
                 Q  +  +EEL++++   HQ    L++++++      +L +L S    Y       +
Sbjct:   616 NQRVQQNLRRQEELLSQIQVSHQEFSALKQSNSEANTREDVLKKLASAHDSYIEISSNIK 675

Query:   193 YEKKFYNDHLESLQVMEKNY--ITMA--TEVEKLRAELMNAPNVDRRAADGSYGGATGNS 248
                KFYND  E L   +     I  A  TE ++L  EL  +   +  A   S      N+
Sbjct:   676 EGTKFYNDLTEILLKFQNKCSDIVFARKTERDELLKELQQSIAREPSAPSFSVPSYQSNT 735

Query:   249 ENETSG-RPVGQNAY--EDGYGVPQ--GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTP 303
                  G  P  +  +  +     PQ     PPPS        A P    SA  A  S  P
Sbjct:   736 PAPAGGPTPAPRTVFSQQQPQAKPQPPARPPPPSIAPQAASAAVP---VSAPMAPGSSNP 792

Query:   304 MRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 362
                A   P GP    ++GP Y + +  P Y      +Y+P     Y+    P Y AQ  +
Sbjct:   793 PPVA---PTGPSQ--AQGPPYPSYQGYPGYYQMP-MAYNPYAYGQYNMPYMP-YQAQGQA 845

Query:   363 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQ 391
              Y          +  P   PQ+   Y  Q
Sbjct:   846 GYPGAPATQQP-YPYPQQPPQQQPYYPQQ 873


>UNIPROTKB|O46392 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
            "Canis lupus familiaris" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 CTD:1278 EMBL:AF035120
            RefSeq:NP_001003187.1 UniGene:Cfa.1262 STRING:O46392 GeneID:403824
            KEGG:cfa:403824 NextBio:20817320 Uniprot:O46392
        Length = 1366

 Score = 129 (50.5 bits), Expect = 0.00017, P = 0.00017
 Identities = 90/303 (29%), Positives = 110/303 (36%)

Query:   223 RAELMNAPNVDRRAADGSYGGATG-NSENETSGRP--VGQNAYEDGYGVPQGHGPPPSAT 279
             R E+   P V          GA G       +G P   G        G+P   G   +  
Sbjct:   282 RGEV-GLPGVSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATG 340

Query:   280 TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKA-PSYDPTKG 337
               G+VG  P  + S   +   G P  A    P GP G E  +GP  +A  A PS  P  G
Sbjct:   341 ARGIVGE-PGPAGSKGESGNKGEPGSAGAQGPPGPSGEEGKRGPNGEAGSAGPSGPP--G 397

Query:   338 PSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYDMQRG 393
                 P ++G PG D   G  G    +G+   A  RGPN D  R P  +P    G    RG
Sbjct:   398 LRGSPGSRGLPGADGPAGVMGPPGPRGATGPAGVRGPNGDSGR-PG-EP----GLMGPRG 451

Query:   394 -PNYDMQRGP-GYE-TQRVPGYDVQRGPVYEA--QRAPSYIPQRGP-GYDLQRGQGYDMR 447
              P      GP G E    +PG D + GP+  A  +  P  I   GP G     G+  D  
Sbjct:   452 FPGAPGNVGPAGKEGPMGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKG 511

Query:   448 RAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-----GNP 501
              A     +RG  G DG      P G           G A PP   G   P G     G P
Sbjct:   512 HA-GLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKP 570

Query:   502 ARR 504
               R
Sbjct:   571 GER 573


>UNIPROTKB|F1LQ00 [details] [associations]
            symbol:Col5a2 "Protein Col5a2" species:10116 "Rattus
            norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:70921 GO:GO:0043588 GO:GO:0030199
            GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230
            GO:GO:0005201 GO:GO:0048592 GeneTree:ENSGT00660000095287
            GO:GO:0005588 IPI:IPI00366945 Ensembl:ENSRNOT00000005073
            Uniprot:F1LQ00
        Length = 1467

 Score = 129 (50.5 bits), Expect = 0.00019, P = 0.00019
 Identities = 88/292 (30%), Positives = 109/292 (37%)

Query:   232 VDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGP 288
             V  + A+G+ G  GA G   +     P G    E G   P+G  GPP S    G  G   
Sbjct:   750 VGEKGAEGTAGNDGARGLPGSLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENG 808

Query:   289 NTSTSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK 344
              T    +A  Q   G P ++     P   G   S GP G   S  P + P   P     +
Sbjct:   809 PTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPHGVPGLKGGR 867

Query:   345 GPGYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-QRGPNYDM-Q 399
             G    P  T  PG   + G    A   GP   I   P  +   GL  D    G   D   
Sbjct:   868 GTQGPPGATGFPGSAGRVGPPGPAGAPGPAGPIGE-PGKEGPPGLRGDPGSHGRVGDRGP 926

Query:   400 RGP-GYETQRV-PGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDP 454
              GP G    +  PG D Q GP  +    P+    QRG  G   QRG+ G      P+  P
Sbjct:   927 AGPPGSPGDKGDPGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTP 984

Query:   455 SRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 504
              +  G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:   985 GK-VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1035


>RGD|1311417 [details] [associations]
            symbol:Col7a1 "collagen, type VII, alpha 1" species:10116
            "Rattus norvegicus" [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0005604 "basement
            membrane" evidence=ISO] [GO:0008150 "biological_process"
            evidence=ND] InterPro:IPR002035 InterPro:IPR003961 Pfam:PF00041
            Pfam:PF00092 PROSITE:PS50234 PROSITE:PS50853 SMART:SM00060
            SMART:SM00327 RGD:1311417 Gene3D:2.60.40.10 InterPro:IPR013783
            SUPFAM:SSF49265 InterPro:IPR008160 Pfam:PF01391 IPI:IPI00951759
            Ensembl:ENSRNOT00000066518 UCSC:RGD:1311417 ArrayExpress:D3ZQ14
            Uniprot:D3ZQ14
        Length = 2585

 Score = 131 (51.2 bits), Expect = 0.00021, P = 0.00021
 Identities = 75/262 (28%), Positives = 96/262 (36%)

Query:   254 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 313
             G P G    +   G P   GPP S    GV G+     +  ++  +     R     P+G
Sbjct:  1285 GAP-GSTQAKGERGFPGPEGPPGSPGLPGVPGSPGVKGSPGWSGPRGDRGERGPQG-PKG 1342

Query:   314 ----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAK-GPGYDPTKGP-GYDAQKGSNYDA 366
                 PG     G PG    K    DP  GPS  P   GP  DP  GP G     G++   
Sbjct:  1343 EPGEPGQVIGGGRPGLPGKKG---DP--GPSGPPGPHGPLGDP--GPRGPPGLPGTSVKG 1395

Query:   367 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 425
              +G   +  RGP   P  G G   Q  P      G PG   Q  PG   ++G   + +  
Sbjct:  1396 DKGDRGE--RGP---PGPGTGASEQGSPGLPGLPGSPG--PQGPPGRTGEKGEKGDCEDG 1448

Query:   426 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGS 484
                +P + PG   + G    +R AP     +G  G  G P      G+  PP    P G 
Sbjct:  1449 GPGLPGQ-PGVPGEPG----LRGAPGVTGPKGDRGLTGTPGEPGEKGERGPPGPVGPQGL 1503

Query:   485 ATPPARSGSGQPRG--GNPARR 504
                  R G   P G  G P RR
Sbjct:  1504 PGAAGRPGVEGPEGPPGPPGRR 1525


>WB|WBGene00001263 [details] [associations]
            symbol:emb-9 species:6239 "Caenorhabditis elegans"
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA;TAS] [GO:0005581 "collagen" evidence=IEA] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0008340
            "determination of adult lifespan" evidence=IMP] [GO:0000003
            "reproduction" evidence=IMP] [GO:0009792 "embryo development ending
            in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
            evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
            "receptor-mediated endocytosis" evidence=IMP] [GO:0040039
            "inductive cell migration" evidence=IMP] [GO:0030198 "extracellular
            matrix organization" evidence=IMP] [GO:0009790 "embryo development"
            evidence=IMP] [GO:0050714 "positive regulation of protein
            secretion" evidence=IMP] [GO:0007517 "muscle organ development"
            evidence=IMP] [GO:0005604 "basement membrane" evidence=IDA]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            GO:GO:0008340 GO:GO:0009792 GO:GO:0006898 GO:GO:0040007
            GO:GO:0040010 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0030198 GO:GO:0000003 GO:GO:0050714 GO:GO:0007517
            GO:GO:0040039 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005604 GO:GO:0005201 HOGENOM:HOG000085652
            Gene3D:2.170.240.10 EMBL:X56979 EMBL:Z27078 EMBL:J05067 PIR:S40991
            RefSeq:NP_001022662.1 RefSeq:NP_001022663.1
            ProteinModelPortal:P17139 SMR:P17139 IntAct:P17139
            MINT:MINT-1091171 STRING:P17139 PaxDb:P17139 PRIDE:P17139
            EnsemblMetazoa:K04H4.1a GeneID:176314 KEGG:cel:CELE_K04H4.1
            UCSC:K04H4.1b CTD:176314 WormBase:K04H4.1a WormBase:K04H4.1b
            GeneTree:ENSGT00690000101772 InParanoid:P17139 OMA:EEGIPGC
            NextBio:892048 Uniprot:P17139
        Length = 1759

 Score = 129 (50.5 bits), Expect = 0.00023, P = 0.00023
 Identities = 83/301 (27%), Positives = 108/301 (35%)

Query:   223 RAELMNAPNVDRRAADG---SYGGATGNSENETSGRP----VGQNAYEDGY-GVP--QGH 272
             + +L +A    +R  DG   +YG      E    G P        A E GY G P  +G 
Sbjct:   296 KGDLGDAGQRGKRGKDGVPGNYGEKGSQGEQGLGGTPGYPGTKGGAGEPGYPGRPGFEGD 355

Query:   273 GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAP 330
               P      G   AGP+     +   Q G  +     +P GP G     G PG  A   P
Sbjct:   356 CGPEGPLGEGTGEAGPH-GAQGFDGVQGGKGLPGHDGLP-GPVGPRGPVGAPG--APGQP 411

Query:   331 SYDPTKGPSYDPAKGP-GYDPTKG-PGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL 386
               D   G +    +G  GY    G PG   + G   Y  + G P YDI   P  D Q G 
Sbjct:   412 GIDGMPGYTEKGDRGEDGYPGFAGEPGLPGEPGDCGYPGEDGLPGYDIQGPPGLDGQSGR 471

Query:   387 -GYDMQRGPNYDMQRGPGYETQR-VPGYDVQR-GP--VYEAQRAPSYIPQR-G-PGYDLQ 439
              G+    G   D    PGY  ++  PG  V + GP  +      P  +P R G  GY   
Sbjct:   472 DGFPGIPGDIGD----PGYSGEKGFPGTGVNKVGPPGMTGLPGEPG-MPGRIGVDGYPGP 526

Query:   440 RGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
              G   +      Y P    G  G P     +G   PP  N  +G    P   G  +  G 
Sbjct:   527 PGNNGERGEDCGYCPDGVPGNAGDPGFPGMNGYPGPPGPNGDHGDCGMPGAPGKPRSAGS 586

Query:   500 N 500
             +
Sbjct:   587 D 587


>UNIPROTKB|F1MSR8 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
            "Bos taurus" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
            UniGene:Bt.21390 GeneID:407142 KEGG:bta:407142 CTD:1280
            NextBio:20818406 EMBL:DAAA02012985 EMBL:DAAA02012986
            IPI:IPI00786510 RefSeq:NP_001106695.1 PRIDE:F1MSR8
            Ensembl:ENSBTAT00000017509 Uniprot:F1MSR8
        Length = 1418

 Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 293
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   723 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 781

Query:   294 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 345
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   782 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 838

Query:   346 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   839 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 885

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 461
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   886 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 941

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
             GA     P G V PP    P G    P R GS     G P R
Sbjct:   942 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 979

 Score = 128 (50.1 bits), Expect = 0.00023, P = 0.00023
 Identities = 89/296 (30%), Positives = 112/296 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAG 287
             P  DR   D    GA G    +  G P G        G P   GPP       A + G  
Sbjct:    64 PRGDR--GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 118

Query:   288 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 344
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +
Sbjct:   119 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-R 173

Query:   345 GPGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 400
             GP   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +
Sbjct:   174 GPPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 230

Query:   401 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 451
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   231 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 288

Query:   452 -YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 501
                P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   289 PVGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 341

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 82/285 (28%), Positives = 105/285 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   260 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 316

Query:   289 NTSTSAYAATQS-GTPMRA-AYDIPRGPGYEASKG----PGYDASKAPSYDPTKGPSYDP 342
               +        + G+P  A A   P   G   +KG    PG   + AP +   +GP   P
Sbjct:   317 EGAQGPRGEPGTPGSPGPAGAAGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 371

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   372 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPA-GEEGKRGARGEP 428

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   429 GGAGPAGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 485

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   486 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 530


>UNIPROTKB|C9JPE6 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA]
            InterPro:IPR002777 Pfam:PF01920 GO:GO:0006457 GO:GO:0016272
            EMBL:AC114480 HGNC:HGNC:16643 ChiTaRS:SLMAP HOGENOM:HOG000072709
            EMBL:AC099777 IPI:IPI01019103 ProteinModelPortal:C9JPE6
            STRING:C9JPE6 Ensembl:ENST00000442599 UCSC:uc011bez.1
            ArrayExpress:C9JPE6 Bgee:C9JPE6 Uniprot:C9JPE6
        Length = 296

 Score = 119 (46.9 bits), Expect = 0.00023, P = 0.00023
 Identities = 50/195 (25%), Positives = 90/195 (46%)

Query:    50 VMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             +++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  +
Sbjct:    15 LLKAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTDI 70

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQ 168
              +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     +
Sbjct:    71 ASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR-----E 118

Query:   169 QIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAELM 227
             +   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L 
Sbjct:   119 EATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL- 176

Query:   228 NAPNVDRRAADGSYG 242
             +   + R+  +   G
Sbjct:   177 SILQMSRKELENQVG 191


>UNIPROTKB|Q8IX94 [details] [associations]
            symbol:CTAGE4 "Cutaneous T-cell lymphoma-associated antigen
            4" species:9606 "Homo sapiens" [GO:0016021 "integral to membrane"
            evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] GO:GO:0016021
            HPA:HPA000387 HPA:HPA000922 HOVERGEN:HBG051216 EMBL:AK292236
            EMBL:AC004889 EMBL:AF338232 IPI:IPI00923407 RefSeq:NP_940897.2
            UniGene:Hs.645204 UniGene:Hs.661442 UniGene:Hs.720693
            ProteinModelPortal:Q8IX94 SMR:Q8IX94 PhosphoSite:Q8IX94
            DMDM:229462987 PaxDb:Q8IX94 PRIDE:Q8IX94 Ensembl:ENST00000486333
            GeneID:100128553 KEGG:hsa:100128553 UCSC:uc010lpc.3 CTD:100128553
            GeneCards:GC07P143880 H-InvDB:HIX0033556 H-InvDB:HIX0151950
            H-InvDB:HIX0167845 H-InvDB:HIX0167846 HGNC:HGNC:24772 MIM:608910
            neXtProt:NX_Q8IX94 PharmGKB:PA134946406 eggNOG:NOG133684
            HOGENOM:HOG000112043 OMA:LERTINF OrthoDB:EOG4WSWC5
            GenomeRNAi:100128553 NextBio:20789174 CleanEx:HS_CTAGE4
            Genevestigator:Q8IX94 GermOnline:ENSG00000196130 Uniprot:Q8IX94
        Length = 777

 Score = 125 (49.1 bits), Expect = 0.00024, P = 0.00024
 Identities = 106/459 (23%), Positives = 177/459 (38%)

Query:    56 ASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSER---ELQMRNLT 112
             A  +V ++ L  E   +      + +        ++ L  Q   ++SE    E + + L 
Sbjct:   322 AKLNVSLKSLEGERNHIIIQLSEVDKTKEELTEHIKNLQTQQASLQSENIYFESENQKLQ 381

Query:   113 EKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDVQQIPA 172
             +K+ K+  E      +KL ++K   E +N  +  EE +++V +    + RA   ++    
Sbjct:   382 QKL-KIMTEFYQENEMKL-YRKLTVE-ENYRIEEEEKLSRVEE---KISRATEGLETYRK 435

Query:   173 LLSELES-LRQEYH-HCRGTYEYEKKFYNDHLESLQVMEKNYITMATE----VEKL-RAE 225
             L  +LE  L +  H + +    YEK+ +++ L + +  E+N   +  E     +KL   E
Sbjct:   436 LAKDLEEELERTVHFYQKQVISYEKRGHDNWLAA-RTAERNLSDLRKENAHNKQKLTETE 494

Query:   226 L-MNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYED-GYGVPQGHGPPPSATTAGV 283
             L       D  A D S   A G   +  S  P+G+ + E   +  PQ     P   +  +
Sbjct:   495 LKFELLEKDPNALDVS-NTAFGREHSPCSPSPLGRPSSETRAFPSPQTLLEDPLRLSPVL 553

Query:   284 VGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEA----SKGPGYDASKAPSYDPTKGP 338
              G G    +S       G P+       RG P Y+      + P    S +   +  +  
Sbjct:   554 PGGGGRGPSSP------GNPLDHQITNERGEPSYDRLIDPHRAPSDTGSLSSPVEQDRRM 607

Query:   339 SYDPAKGPGY-DPTKGPGYDAQKGSNYDAQRGPNYD-IHRGPSYDPQ-RGLGYDMQRGPN 395
              + P  G  Y D T  P  + +  SN +   GP      +  S D   R +  +M+   N
Sbjct:   608 MFPPP-GQSYPDSTLPPQREDRFYSNSERLSGPAEPRSFKMTSLDKMDRSMPSEMESSRN 666

Query:   396 YDMQRGPG---YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 452
              D +   G        +P  +   GP       P   P  GP + +   +G  MRR P +
Sbjct:   667 -DAKDDLGNLNVPDSSLPAENEATGP---GLIPPPLAPISGPLFPVDT-RGPFMRRGPPF 721

Query:   453 DPSR-GTGFDGAPRGAAPHGQVPPPLNNVPYGSAT--PP 488
              P   GT F GA RG  P    P P  + P+      PP
Sbjct:   722 PPPPPGTMF-GASRGYFPPRDFPGP-PHAPFAMRNIYPP 758


>UNIPROTKB|Q9XSK0 [details] [associations]
            symbol:CRX "Cone-rod homeobox protein" species:9913 "Bos
            taurus" [GO:0060041 "retina development in camera-type eye"
            evidence=IEA] [GO:0045944 "positive regulation of transcription
            from RNA polymerase II promoter" evidence=IEA] [GO:0043522 "leucine
            zipper domain binding" evidence=IEA] [GO:0005667 "transcription
            factor complex" evidence=IEA] [GO:0003700 "sequence-specific DNA
            binding transcription factor activity" evidence=IEA] [GO:0003682
            "chromatin binding" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0043565 "sequence-specific DNA
            binding" evidence=IEA] InterPro:IPR001356 InterPro:IPR009057
            InterPro:IPR013851 InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529
            PROSITE:PS00027 PROSITE:PS50071 SMART:SM00389 GO:GO:0043565
            GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
            Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0060041
            EMBL:AF154123 IPI:IPI00695402 RefSeq:NP_776329.1 UniGene:Bt.283
            ProteinModelPortal:Q9XSK0 SMR:Q9XSK0 STRING:Q9XSK0 PRIDE:Q9XSK0
            Ensembl:ENSBTAT00000028232 GeneID:280756 KEGG:bta:280756 CTD:1406
            eggNOG:NOG324074 GeneTree:ENSGT00700000104128 HOGENOM:HOG000082677
            HOVERGEN:HBG004028 InParanoid:Q9XSK0 KO:K09337 OMA:QTKARPA
            OrthoDB:EOG4NKBWG NextBio:20804923 Uniprot:Q9XSK0
        Length = 299

 Score = 119 (46.9 bits), Expect = 0.00024, P = 0.00024
 Identities = 29/96 (30%), Positives = 42/96 (43%)

Query:   269 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 328
             P    P P A  AG+V +GP+ +++ YA T +  P  A    P   G  +S   G D   
Sbjct:   165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222

Query:   329 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 364
             +P   P  GP+  P  GP   P+      +  G +Y
Sbjct:   223 SPMVPPLGGPALSPLSGPSVGPSLTQSPTSLSGQSY 258


>UNIPROTKB|P02459 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
            "Bos taurus" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
            GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604 GO:GO:0001502
            GO:GO:0060021 GO:GO:0002062 GO:GO:0010468 GO:GO:0060272
            GO:GO:0006029 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GeneTree:ENSGT00660000095287 HOGENOM:HOG000085654
            HOVERGEN:HBG004933 KO:K06236 EMBL:AAFC03017082 EMBL:AAFC03017085
            EMBL:AAFC03056593 EMBL:L28918 EMBL:AF138883 EMBL:AF138957
            EMBL:X02420 IPI:IPI01028216 PIR:A90369 PIR:I45876
            RefSeq:NP_001001135.2 UniGene:Bt.21390 IntAct:P02459 STRING:P02459
            PRIDE:P02459 Ensembl:ENSBTAT00000017505 GeneID:407142
            KEGG:bta:407142 CTD:1280 InParanoid:Q9XT25 OMA:SSCRICV
            Reactome:REACT_133391 NextBio:20818406 PMAP-CutDB:P02459
            ArrayExpress:P02459 GO:GO:0005585 GO:GO:0060174 GO:GO:0030903
            Uniprot:P02459
        Length = 1487

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 293
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   792 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850

Query:   294 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 345
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907

Query:   346 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 954

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 461
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   955 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 89/296 (30%), Positives = 112/296 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAG 287
             P  DR   D    GA G    +  G P G        G P   GPP       A + G  
Sbjct:   133 PRGDR--GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 187

Query:   288 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 344
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +
Sbjct:   188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-R 242

Query:   345 GPGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 400
             GP   P K PG D + G      +RGP      RG    P  GL G    RG P  D  +
Sbjct:   243 GPPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299

Query:   401 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 451
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357

Query:   452 -YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 501
                P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   358 PVGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 410

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 82/285 (28%), Positives = 105/285 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   329 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 385

Query:   289 NTSTSAYAATQS-GTPMRA-AYDIPRGPGYEASKG----PGYDASKAPSYDPTKGPSYDP 342
               +        + G+P  A A   P   G   +KG    PG   + AP +   +GP   P
Sbjct:   386 EGAQGPRGEPGTPGSPGPAGAAGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 440

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   441 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPA-GEEGKRGARGEP 497

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   498 GGAGPAGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 554

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   555 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 599


>MGI|MGI:88452 [details] [associations]
            symbol:Col2a1 "collagen, type II, alpha 1" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development" evidence=ISO]
            [GO:0001502 "cartilage condensation" evidence=IMP] [GO:0001894
            "tissue homeostasis" evidence=IMP] [GO:0001958 "endochondral
            ossification" evidence=IMP] [GO:0002062 "chondrocyte
            differentiation" evidence=IMP] [GO:0003007 "heart morphogenesis"
            evidence=IMP] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0005581 "collagen" evidence=IDA] [GO:0005585
            "collagen type II" evidence=ISO;IDA;IMP] [GO:0005604 "basement
            membrane" evidence=IDA] [GO:0005615 "extracellular space"
            evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006029
            "proteoglycan metabolic process" evidence=IMP] [GO:0007601 "visual
            perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
            evidence=ISO] [GO:0010468 "regulation of gene expression"
            evidence=IMP] [GO:0030199 "collagen fibril organization"
            evidence=ISO;IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0035108 "limb morphogenesis" evidence=IMP] [GO:0042472 "inner
            ear morphogenesis" evidence=IMP] [GO:0042802 "identical protein
            binding" evidence=IPI] [GO:0043066 "negative regulation of
            apoptotic process" evidence=IMP] [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=ISO] [GO:0048705 "skeletal system morphogenesis"
            evidence=IMP] [GO:0048839 "inner ear development" evidence=IMP]
            [GO:0051216 "cartilage development" evidence=IMP] [GO:0060021
            "palate development" evidence=IMP] [GO:0060272 "embryonic skeletal
            joint morphogenesis" evidence=ISO] [GO:0060348 "bone development"
            evidence=IMP] [GO:0060351 "cartilage development involved in
            endochondral bone morphogenesis" evidence=IMP] [GO:0071773
            "cellular response to BMP stimulus" evidence=IDA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 MGI:MGI:88452 GO:GO:0005737
            GO:GO:0043066 GO:GO:0005615 GO:GO:0046872 GO:GO:0003007
            GO:GO:0007601 GO:GO:0030199 GO:GO:0007417 GO:GO:0042472
            GO:GO:0001894 GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604
            GO:GO:0001502 GO:GO:0060021 GO:GO:0002062 GO:GO:0010468
            GO:GO:0060272 GO:GO:0006029 GO:GO:0001958 GO:GO:0060351
            GO:GO:0005201 GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933
            KO:K06236 CTD:1280 OMA:SSCRICV GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 OrthoDB:EOG4FTW1C ChiTaRS:COL2A1 EMBL:M65161
            EMBL:BC030913 EMBL:BC051383 EMBL:BC052326 EMBL:BC082331 EMBL:S63190
            EMBL:M63708 EMBL:M63709 EMBL:M63710 EMBL:AK028295 EMBL:X57982
            IPI:IPI00471183 IPI:IPI00621255 IPI:IPI00622890 IPI:IPI00623625
            IPI:IPI00828467 IPI:IPI00828653 IPI:IPI00828753 PIR:A41182
            PIR:B41182 RefSeq:NP_001106987.2 RefSeq:NP_112440.2 UniGene:Mm.2423
            PDB:2W65 PDBsum:2W65 ProteinModelPortal:P28481 SMR:P28481
            IntAct:P28481 STRING:P28481 PhosphoSite:P28481 PRIDE:P28481
            Ensembl:ENSMUST00000023123 Ensembl:ENSMUST00000088355 GeneID:12824
            KEGG:mmu:12824 UCSC:uc007xlp.2 UCSC:uc007xlq.2 InParanoid:P28481
            EvolutionaryTrace:P28481 NextBio:282306 Bgee:P28481
            CleanEx:MM_COL2A1 Genevestigator:P28481
            GermOnline:ENSMUSG00000022483 Uniprot:P28481
        Length = 1487

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 82/285 (28%), Positives = 105/285 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   329 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 385

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +  +     + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   386 EGAQGSRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 440

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +   +GP  +    GP   P    G + +RG   + 
Sbjct:   441 --GPQGATGPLGPKGQAGEPGIAGFKGDQGPKGETGPAGPQGAPGPA-GEEGKRGARGEP 497

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   498 GGAGPIGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLTGPKGANGDPGRPGEP 554

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   555 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 599

 Score = 127 (49.8 bits), Expect = 0.00032, P = 0.00032
 Identities = 84/290 (28%), Positives = 110/290 (37%)

Query:   243 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP----PSATTAG-----VVGAGPNTSTS 293
             G  G+   + +  P G++      G P   GPP    P   +AG     + G     +  
Sbjct:   134 GDRGDKGEKGAPGPRGRDGEPGTPGNPGPAGPPGPPGPPGLSAGNFAAQMAGGYDEKAGG 193

Query:   294 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKGPGYDP 350
             A      G PM      PRGP G   + GP G+  +     +P   GP   P   PG  P
Sbjct:   194 AQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRGPPG--P 247

Query:   351 TKGPGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG----P 402
                PG D + G      +RG P     RG    P  GL G    RG P  D  +G    P
Sbjct:   248 AGKPGDDGEAGKPGKSGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKGEAGAP 305

Query:   403 GYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS-----YDPSR 456
             G + +   PG +   GP+   +  P    + GP       +G D +  P+       P+ 
Sbjct:   306 GVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGPVGPAG 363

Query:   457 GTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 501
             G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   364 GPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410

 Score = 126 (49.4 bits), Expect = 0.00041, P = 0.00041
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   755 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 813

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   814 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 873

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   874 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGANGNPGPA-GPPGPAGKDGPKGVR 932

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   933 GDSGPPGRAGDPGLQGPAGAPGEKGEPGDDGPSGLDGPPGPQGLAGQRGIVGLPGQRGER 992

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   993 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 1047

Query:   503 R 503
             R
Sbjct:  1048 R 1048


>UNIPROTKB|P02458 [details] [associations]
            symbol:COL2A1 "Collagen alpha-1(II) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001502 "cartilage condensation" evidence=IEA] [GO:0001894
            "tissue homeostasis" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0003007 "heart morphogenesis"
            evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0006029
            "proteoglycan metabolic process" evidence=IEA] [GO:0007417 "central
            nervous system development" evidence=IEA] [GO:0010468 "regulation
            of gene expression" evidence=IEA] [GO:0030903 "notochord
            development" evidence=IEA] [GO:0042472 "inner ear morphogenesis"
            evidence=IEA] [GO:0043066 "negative regulation of apoptotic
            process" evidence=IEA] [GO:0060021 "palate development"
            evidence=IEA] [GO:0060174 "limb bud formation" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0071599 "otic vesicle development"
            evidence=IEA] [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IMP]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IDA]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
            guidance" evidence=TAS] [GO:0030198 "extracellular matrix
            organization" evidence=TAS] [GO:0042802 "identical protein binding"
            evidence=NAS] [GO:0001501 "skeletal system development"
            evidence=IMP] [GO:0007605 "sensory perception of sound"
            evidence=IMP] [GO:0060272 "embryonic skeletal joint morphogenesis"
            evidence=IMP] [GO:0051216 "cartilage development" evidence=TAS]
            [GO:0030199 "collagen fibril organization" evidence=IMP]
            [GO:0005585 "collagen type II" evidence=IDA] [GO:0030020
            "extracellular matrix structural constituent conferring tensile
            strength" evidence=IC] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
            GO:GO:0007411 GO:GO:0043066 GO:GO:0005615 PDB:2FSE PDBsum:2FSE
            PDB:2SEB PDBsum:2SEB GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0005788 GO:GO:0042472
            GO:GO:0001894 GO:GO:0042802 GO:GO:0007605 GO:GO:0071773
            GO:GO:0051216 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
            GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
            GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
            GO:GO:0001958 GO:GO:0060351 HOVERGEN:HBG004933 KO:K06236
            DrugBank:DB00048 GO:GO:0048407 CTD:1280 OMA:SSCRICV GO:GO:0005585
            GO:GO:0060174 GO:GO:0030903 OrthoDB:EOG4FTW1C EMBL:X16468
            EMBL:L10347 EMBL:BT007205 EMBL:AC004801 EMBL:BC007252 EMBL:BC116449
            EMBL:X16711 EMBL:M25730 EMBL:M32168 EMBL:M25655 EMBL:M25656
            EMBL:M64345 EMBL:M60299 EMBL:M25698 EMBL:X58709 EMBL:X57010
            EMBL:U15195 EMBL:X13783 EMBL:M25728 EMBL:X02371 EMBL:X02372
            EMBL:X02373 EMBL:X02374 EMBL:X02375 EMBL:X02376 EMBL:X02377
            EMBL:X02378 EMBL:X16158 EMBL:J00116 EMBL:L00977 EMBL:M63281
            EMBL:M27468 EMBL:X06268 EMBL:X00339 EMBL:M12048 IPI:IPI00186460
            IPI:IPI00748487 IPI:IPI00936892 PIR:A38513 RefSeq:NP_001835.3
            RefSeq:NP_149162.2 UniGene:Hs.408182 PDB:1U5M PDBsum:1U5M
            ProteinModelPortal:P02458 SMR:P02458 IntAct:P02458
            MINT:MINT-6796075 STRING:P02458 PhosphoSite:P02458 DMDM:124056489
            PaxDb:P02458 PRIDE:P02458 DNASU:1280 Ensembl:ENST00000337299
            Ensembl:ENST00000380518 GeneID:1280 KEGG:hsa:1280 UCSC:uc001rqt.3
            UCSC:uc001rqu.3 UCSC:uc001rqv.3 GeneCards:GC12M048266
            HGNC:HGNC:2200 HPA:CAB002214 MIM:108300 MIM:120140 MIM:132450
            MIM:150600 MIM:151210 MIM:156550 MIM:183900 MIM:184250 MIM:200610
            MIM:271700 MIM:604864 MIM:608805 MIM:609162 MIM:609508
            neXtProt:NX_P02458 Orphanet:93296 Orphanet:209867 Orphanet:137678
            Orphanet:86820 Orphanet:93297 Orphanet:485 Orphanet:2380
            Orphanet:93279 Orphanet:166011 Orphanet:1427 Orphanet:85166
            Orphanet:93346 Orphanet:94068 Orphanet:93315 Orphanet:1856
            Orphanet:90653 PharmGKB:PA26715 ChiTaRS:COL2A1
            EvolutionaryTrace:P02458 GenomeRNAi:1280 NextBio:5171
            PMAP-CutDB:P02458 Bgee:P02458 Genevestigator:P02458
            GermOnline:ENSG00000139219 GO:GO:0030020 Uniprot:P02458
        Length = 1487

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 90/296 (30%), Positives = 113/296 (38%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAG 287
             P  DR   D    GA G    +  G P G        G P   GPP       A + G  
Sbjct:   133 PRGDR--GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 187

Query:   288 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 344
                +  A      G PM      PRGP G   + GP G+  +     +P   GP   P +
Sbjct:   188 DEKAGGAQLGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-R 242

Query:   345 GPGYDPTKGPGYDAQKGSNYDA-QRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 400
             GP   P K PG D + G    A +RGP      RG    P  GL G    RG P  D  +
Sbjct:   243 GPPGPPGK-PGDDGEAGKPGKAGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299

Query:   401 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 451
             G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+    
Sbjct:   300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357

Query:   452 -YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 501
                P+ G GF GAP GA   G+  P     P G+  P   P   GS  P G  GNP
Sbjct:   358 PVGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNP 410

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 82/285 (28%), Positives = 105/285 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G      G G P   G P +   AG  GA GP
Sbjct:   329 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGP 385

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +        + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   386 EGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 440

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   441 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPA-GEEGKRGARGEP 497

Query:   399 QR-GP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   498 GGVGPIGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 554

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   555 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 599

 Score = 124 (48.7 bits), Expect = 0.00067, P = 0.00067
 Identities = 88/282 (31%), Positives = 101/282 (35%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 293
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   792 GPPGPAGANGEKGEVGPP-GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850

Query:   294 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 345
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907

Query:   346 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
             PG +   GP G     G   D  +G      RG S  P R  G    +GP      GP  
Sbjct:   908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRA-GEPGLQGP-----AGPPG 954

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 461
             E    PG D   G   E    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   955 EKGE-PGDDGPSGA--EGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048


>UNIPROTKB|F1PS24 [details] [associations]
            symbol:COL2A1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0071773 "cellular response to BMP stimulus"
            evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
            [GO:0060351 "cartilage development involved in endochondral bone
            morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
            morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0043066 "negative regulation of apoptotic process"
            evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
            "notochord development" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0010468 "regulation of gene
            expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
            [GO:0007417 "central nervous system development" evidence=IEA]
            [GO:0006029 "proteoglycan metabolic process" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
            space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
            differentiation" evidence=IEA] [GO:0001958 "endochondral
            ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
            evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
            PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005737
            GO:GO:0043066 GO:GO:0005615 GO:GO:0003007 GO:GO:0007601
            GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
            GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
            GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
            GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
            GeneTree:ENSGT00660000095287 GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 EMBL:AAEX03015088 EMBL:AAEX03015089
            Ensembl:ENSCAFT00000014414 OMA:CPICPTE Uniprot:F1PS24
        Length = 1489

 Score = 128 (50.1 bits), Expect = 0.00025, P = 0.00025
 Identities = 88/282 (31%), Positives = 102/282 (36%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 293
             G  G A  N E    G P G        G P  +G  GPP  A  AG  GA   P     
Sbjct:   794 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 852

Query:   294 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 345
                A Q G    A    P+GP G    +GP G    K    +  P     +  A G    
Sbjct:   853 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 909

Query:   346 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 404
             PG +   GP G     G   D  +G      RG S  P R     +Q GP      GP  
Sbjct:   910 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 956

Query:   405 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 461
             E    PG D   GP  +    P  +  QRG  G   QRG+ G+     PS +P +  G  
Sbjct:   957 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1012

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
             GA     P G V PP    P G    P R GS     G P R
Sbjct:  1013 GASGDRGPPGPVGPPGLTGPSGE---PGREGS-PGADGPPGR 1050

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 72/271 (26%), Positives = 92/271 (33%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA- 297
             G  G      +    G P G    +   G P   GPP      G  G G N +       
Sbjct:   130 GEQGPRGDRGDKGEKGAP-GPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 188

Query:   298 -TQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA-KGPGYDPTKGP 354
               ++G         P GP G     GP   A     +    G   +P   GP   P   P
Sbjct:   189 DEKAGGAQMGVMQGPMGPMGPRGPPGPA-GAPGPQGFQGNPGEPGEPGVSGP-MGPRGPP 246

Query:   355 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG-PGYETQR---- 408
             G   + G + +A + P     RGP   PQ   G+    G P     RG PG +  +    
Sbjct:   247 GPPGKPGDDGEAGK-PGKSGERGPP-GPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAG 304

Query:   409 VPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA 467
              PG   + G   E   +P  +  RG PG   +RG     R  P+   +   G DG P  A
Sbjct:   305 APGVKGESGSPGE-NGSPGPMGPRGLPG---ERG-----RTGPA-GAAGARGNDGQPGPA 354

Query:   468 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
              P G V P     P     P A  G   P G
Sbjct:   355 GPPGPVSPA--GGPGFPGAPGASQGEAGPTG 383

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 83/285 (29%), Positives = 106/285 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP 288
             P    R       GA GN        P G  +   G G P G  P  S   AG  GA GP
Sbjct:   330 PGERGRTGPAGAAGARGNDGQPGPAGPPGPVSPAGGPGFP-G-APGASQGEAGPTGARGP 387

Query:   289 NTSTSAYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDP 342
               +        + G+P  A      G    PG + S G PG   + AP +   +GP   P
Sbjct:   388 EGAQGPRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P 442

Query:   343 AKGP-GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDM 398
               GP G     GP G   + G + +  ++GP  +    GP   P    G + +RG   + 
Sbjct:   443 --GPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPA-GEEGKRGARGEP 499

Query:   399 Q-RGP-GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSY 452
                GP G   +R  PG    RG P  +    P   P +RGP G    +G   D  R    
Sbjct:   500 GGAGPVGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEP 556

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 496
                   G  G P  A P G+V P       G   PP   G+ GQP
Sbjct:   557 GLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 601

 Score = 123 (48.4 bits), Expect = 0.00086, P = 0.00086
 Identities = 84/287 (29%), Positives = 107/287 (37%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSA 294
             A    + GA G S+ E    P G    E   G P+G  G P S   AG  G  P T    
Sbjct:   363 AGGPGFPGAPGASQGEAG--PTGARGPEGAQG-PRGEPGTPGSPGPAGASG-NPGTDGIP 418

Query:   295 YAATQSGTPMRAA---YDIPRGP-GYEASKGP----GYDASKA-PSYDPTKGPSYDPAKG 345
              A   +G P  A    +  PRGP G + + GP    G         +   +GP  +P  G
Sbjct:   419 GAKGSAGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEP--G 476

Query:   346 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRG-PNYDMQRGP- 402
             P   P   PG   ++G    A+  P      GP   P +RG   +  RG P  D   GP 
Sbjct:   477 PA-GPQGAPGPAGEEGKR-GARGEPG---GAGPVGPPGERGAPGN--RGFPGQDGLAGPK 529

Query:   403 GYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFD 461
             G   +R P      GP   A   P    + G PG     G+  D        PS   G D
Sbjct:   530 GAPGERGPSG--LAGPK-GANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGED 586

Query:   462 GAPRGAAPHG-QVPPPLNNVP--YGSATPPARSGS-GQPRGGNPARR 504
             G P    P G +  P +   P   G+   P ++G  G P  G P  R
Sbjct:   587 GRPGPPGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLP--GAPGLR 631


>UNIPROTKB|Q14BN4 [details] [associations]
            symbol:SLMAP "Sarcolemmal membrane-associated protein"
            species:9606 "Homo sapiens" [GO:0006457 "protein folding"
            evidence=IEA] [GO:0016272 "prefoldin complex" evidence=IEA]
            [GO:0051082 "unfolded protein binding" evidence=IEA] [GO:0005815
            "microtubule organizing center" evidence=IEA] [GO:0042383
            "sarcolemma" evidence=IEA] [GO:0005790 "smooth endoplasmic
            reticulum" evidence=TAS] [GO:0005887 "integral to plasma membrane"
            evidence=TAS] [GO:0006936 "muscle contraction" evidence=TAS]
            InterPro:IPR000253 InterPro:IPR002777 InterPro:IPR008984
            Pfam:PF00498 Pfam:PF01920 PROSITE:PS50006 SMART:SM00240
            GO:GO:0006457 GO:GO:0005887 Gene3D:2.60.200.20 SUPFAM:SSF49879
            GO:GO:0005815 GO:GO:0042383 GO:GO:0006936 GO:GO:0016272
            GO:GO:0005790 eggNOG:COG1716 EMBL:AF304450 EMBL:AF100750
            EMBL:AY358410 EMBL:AK124200 EMBL:AL834538 EMBL:CR627321
            EMBL:BC114627 EMBL:BC115701 EMBL:AB046821 IPI:IPI00026691
            IPI:IPI00030531 IPI:IPI00432472 IPI:IPI00446339 IPI:IPI00791574
            IPI:IPI00794462 IPI:IPI00794566 IPI:IPI00795406 RefSeq:NP_009090.2
            UniGene:Hs.476432 ProteinModelPortal:Q14BN4 SMR:Q14BN4
            IntAct:Q14BN4 STRING:Q14BN4 PhosphoSite:Q14BN4 DMDM:118597508
            PaxDb:Q14BN4 PRIDE:Q14BN4 Ensembl:ENST00000295951
            Ensembl:ENST00000295952 Ensembl:ENST00000383718
            Ensembl:ENST00000416870 Ensembl:ENST00000428312
            Ensembl:ENST00000449503 GeneID:7871 KEGG:hsa:7871 UCSC:uc003djc.1
            UCSC:uc003djd.1 UCSC:uc003dje.1 UCSC:uc003djf.1 UCSC:uc003djg.1
            UCSC:uc003djh.3 UCSC:uc003dji.1 CTD:7871 GeneCards:GC03P057802
            H-InvDB:HIX0003396 HGNC:HGNC:16643 HPA:HPA002357 HPA:HPA002358
            MIM:602701 neXtProt:NX_Q14BN4 PharmGKB:PA38179 HOVERGEN:HBG082442
            OMA:RTSKQKC ChiTaRS:SLMAP GenomeRNAi:7871 NextBio:30324
            ArrayExpress:Q14BN4 Bgee:Q14BN4 Genevestigator:Q14BN4
            Uniprot:Q14BN4
        Length = 828

 Score = 125 (49.1 bits), Expect = 0.00026, P = 0.00026
 Identities = 51/196 (26%), Positives = 91/196 (46%)

Query:    49 EVMEQKIASQHVEMQKLATE-NQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQ 107
             +V++ ++   H++ + L  E +  + +T    R EL +A+ E+ +LH     + SER+  
Sbjct:   546 QVLQAQLQRLHIDTENLREEKDSEITST----RDELLSARDEILLLHQAAAKVASERDTD 601

Query:   108 MRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAHTDV 167
             + +L E++ K+ AEL+       E++K  T  QN    R        Q  +D QR     
Sbjct:   602 IASLQEELKKVRAELERWRKAASEYEKEITSLQNSFQLR-------CQQCEDQQR----- 649

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLES-LQVMEKNYITMATEVEKLRAEL 226
             ++   L  ELE LR+E++    T  +  K  N  L S LQ  EK       +  +L ++L
Sbjct:   650 EEATRLQGELEKLRKEWNALE-TECHSLKRENVLLSSELQRQEKELHNSQKQSLELTSDL 708

Query:   227 MNAPNVDRRAADGSYG 242
              +   + R+  +   G
Sbjct:   709 -SILQMSRKELENQVG 723


>UNIPROTKB|E1BF47 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0031965 "nuclear membrane" evidence=IEA] [GO:0007094
            "mitotic spindle assembly checkpoint" evidence=IEA] [GO:0005643
            "nuclear pore" evidence=IEA] [GO:0000776 "kinetochore"
            evidence=IEA] [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005524 "ATP binding"
            evidence=IEA] [GO:0004828 "serine-tRNA ligase activity"
            evidence=IEA] InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926
            GO:GO:0005524 GO:GO:0005737 GO:GO:0000776 GO:GO:0007094
            GO:GO:0031965 GO:GO:0005643 GO:GO:0006606 KO:K09291
            InterPro:IPR009053 SUPFAM:SSF46579 GeneTree:ENSGT00700000104019
            GO:GO:0004828 GO:GO:0006434 Gene3D:1.10.287.40 CTD:7175 OMA:RFIRREK
            EMBL:DAAA02043627 IPI:IPI00694835 RefSeq:NP_001192552.1
            UniGene:Bt.1386 Ensembl:ENSBTAT00000015848 GeneID:507869
            KEGG:bta:507869 NextBio:20868255 Uniprot:E1BF47
        Length = 2360

 Score = 124 (48.7 bits), Expect = 0.00026, Sum P(2) = 0.00026
 Identities = 43/187 (22%), Positives = 87/187 (46%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1347 PDTEEYRKLLSEKEVHTKRIQQLTEELGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1406

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   + L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1407 EKENIQKELDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1466

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQVMEKNYITMATEVEK 221
              H  VQ++  L   L     +        E  +K      +E+  + E+  + + +E+ +
Sbjct:  1467 QHVSVQEMQELKETLSQAETKSKSLENQVENLQKTLSEKEIEARSLQEQT-LELQSELAR 1525

Query:   222 LRAELMN 228
             LR +L +
Sbjct:  1526 LRQDLQD 1532

 Score = 57 (25.1 bits), Expect = 0.00026, Sum P(2) = 0.00026
 Identities = 17/54 (31%), Positives = 22/54 (40%)

Query:   243 GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 294
             G  G   NE +G   G + YE  D  G   G G  P   T   +G G +   +A
Sbjct:  1979 GDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMGGGESNQRAA 2029


>UNIPROTKB|F1SEN8 [details] [associations]
            symbol:LDB3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
            "cytoskeletal protein binding" evidence=IEA] [GO:0005856
            "cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
            PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
            SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
            GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 CTD:11155
            OMA:CTSQATT InterPro:IPR006643 SMART:SM00735
            GeneTree:ENSGT00700000104411 EMBL:CU468409 RefSeq:XP_003359314.1
            UniGene:Ssc.97236 Ensembl:ENSSSCT00000011341 GeneID:100151883
            KEGG:ssc:100151883 Uniprot:F1SEN8
        Length = 715

 Score = 124 (48.7 bits), Expect = 0.00028, P = 0.00028
 Identities = 50/192 (26%), Positives = 69/192 (35%)

Query:   244 ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP----NTSTSAYAATQ 299
             AT ++    S            Y       P P+A T     A P       T+A     
Sbjct:   344 ATASAAAPASSPADSPRPQASAYSPAVATSPAPAAHTYSEAPAAPAPKPRVVTTASIRPS 403

Query:   300 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 359
                P+ A+   P  PG   S  P Y  S AP+Y P+  P+Y P+  P Y P+  P Y+  
Sbjct:   404 VYQPVPASTYSP-SPGANYSPTP-YTPSPAPAYTPSPAPTYSPSPAPAYTPSPAPSYNPT 461

Query:   360 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQ--- 415
               S   A+           S+  +   G          + RG P Y T  + G  V    
Sbjct:   462 PYSGGPAESASRPPWVTDDSFSQKFAPGKSTTSISKQSLPRGAPAY-TPPLQGPQVSPLA 520

Query:   416 RGPVYEAQRAPS 427
             RG V  A+R P+
Sbjct:   521 RGTVQRAERFPA 532


>RGD|1311620 [details] [associations]
            symbol:Zmiz1 "zinc finger, MIZ-type containing 1" species:10116
            "Rattus norvegicus" [GO:0001570 "vasculogenesis" evidence=IEA;ISO]
            [GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
            [GO:0003007 "heart morphogenesis" evidence=IEA;ISO] [GO:0007296
            "vitellogenesis" evidence=IEA;ISO] [GO:0007569 "cell aging"
            evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IEA;ISO] [GO:0048146 "positive
            regulation of fibroblast proliferation" evidence=IEA;ISO]
            [GO:0048589 "developmental growth" evidence=IEA;ISO] [GO:0048844
            "artery morphogenesis" evidence=IEA;ISO] InterPro:IPR004181
            Pfam:PF02891 PROSITE:PS51044 RGD:1311620 GO:GO:0008270
            Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
            CTD:57178 OMA:MNQYGPM OrthoDB:EOG45MN70 EMBL:CH474067
            IPI:IPI00364462 RefSeq:NP_001101863.1 UniGene:Rn.1712
            Ensembl:ENSRNOT00000014004 GeneID:361103 KEGG:rno:361103
            UCSC:RGD:1311620 NextBio:675228 Uniprot:D4AE97
        Length = 1072

 Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
 Identities = 66/233 (28%), Positives = 87/233 (37%)

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 341
             GP  S+     TQ+          PRGP   AS G   + AS A    P+   GP    +
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374

Query:   342 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 399
               + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN    
Sbjct:   375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRSYPGEPNYG---NQQYGPNSQFP 431

Query:   400 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP-- 454
               PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    +   
Sbjct:   432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488

Query:   455 SRGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 503
             S G+ +    +G+      P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   489 SSGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541


>ZFIN|ZDB-GENE-030131-2281 [details] [associations]
            symbol:col4a5 "collagen, type IV, alpha 5 (Alport
            syndrome)" species:7955 "Danio rerio" [GO:0005201 "extracellular
            matrix structural constituent" evidence=IEA] [GO:0005581 "collagen"
            evidence=IEA] [GO:0031290 "retinal ganglion cell axon guidance"
            evidence=IMP] [GO:0007412 "axon target recognition" evidence=IMP]
            [GO:0030198 "extracellular matrix organization" evidence=IMP]
            InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
            ZFIN:ZDB-GENE-030131-2281 InterPro:IPR016187 SUPFAM:SSF56436
            GO:GO:0030198 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0007412 GO:GO:0031290 GO:GO:0005201
            HOVERGEN:HBG004933 HOGENOM:HOG000085652 OrthoDB:EOG45DWPF
            Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1287
            OMA:MPMNMEP EMBL:CR354588 EMBL:CR936978 IPI:IPI00835382
            RefSeq:NP_001116702.1 UniGene:Dr.77841 SMR:B0UXF7
            Ensembl:ENSDART00000073827 GeneID:323561 KEGG:dre:323561
            NextBio:20808319 Uniprot:B0UXF7
        Length = 1659

 Score = 128 (50.1 bits), Expect = 0.00028, P = 0.00028
 Identities = 86/286 (30%), Positives = 101/286 (35%)

Query:   234 RRAADGSYG--GATGNSENETSGR-PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-P 288
             R  A G  G  G  G +  E   R P GQ+      G P   GPP      G+ G+ G P
Sbjct:   637 RPGAPGLPGQKGEPGMTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFPGLPGSKGEP 696

Query:   289 NTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTKG-PSYDPAKGP 346
                                   P GPG      PG D     P    +KG P Y     P
Sbjct:   697 GLPGIGLPGPPGAKGFPGIAGSPGGPGIPGR--PGLDGLPGQPGLPGSKGDPGYGLPGPP 754

Query:   347 GYDPTKGPGYDAQKGSNYDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYD--MQRGPG 403
             G  PT  PG    KG       GP  D    G    P R  G D   GP  D     GPG
Sbjct:   755 G--PTGSPGI---KGGP-----GPKGDSGFPGSPGQPGRP-GLDGAPGPKGDAGFPGGPG 803

Query:   404 YE-TQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRG-QGYDMRRAPSYDPSRG-TG 459
                    P + +Q GP      AP  I   G PG + ++G +G      P +   RG +G
Sbjct:   804 PRGPPGAPAFGLQ-GPP-GPPGAPGSIGSPGVPGANGEKGDRGPPGLSTPGFQGDRGISG 861

Query:   460 FDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPRG--GNP 501
               G P    P G VP  P  + +P G        GS  P G  GNP
Sbjct:   862 LPGPPGPVGPPG-VPGRPGQDGLP-GLPGSKGEMGSMGPPGSKGNP 905


>ZFIN|ZDB-GENE-041221-2 [details] [associations]
            symbol:prnpb "prion protein b" species:7955 "Danio
            rerio" [GO:0051260 "protein homooligomerization" evidence=IEA]
            [GO:0016020 "membrane" evidence=IEA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0016338 "calcium-independent
            cell-cell adhesion" evidence=IMP] [GO:0007156 "homophilic cell
            adhesion" evidence=IDA] [GO:0055113 "epiboly involved in
            gastrulation with mouth forming second" evidence=IGI;IMP]
            [GO:2000047 "regulation of cell-cell adhesion mediated by cadherin"
            evidence=IMP] [GO:0005886 "plasma membrane" evidence=IDA]
            [GO:0007417 "central nervous system development" evidence=IGI]
            [GO:0009986 "cell surface" evidence=IDA] InterPro:IPR022416
            ZFIN:ZDB-GENE-041221-2 GO:GO:0005886 GO:GO:0009986 GO:GO:0051260
            GO:GO:0007156 GO:GO:0055113 GO:GO:0016338 Gene3D:1.10.790.10
            SUPFAM:SSF54098 EMBL:AJ850286 IPI:IPI00485089 UniGene:Dr.90045
            ProteinModelPortal:Q5K0E1 PRIDE:Q5K0E1 HOVERGEN:HBG056090
            InParanoid:Q5K0E1 Bgee:Q5K0E1 GO:GO:2000047 Uniprot:Q5K0E1
        Length = 606

 Score = 123 (48.4 bits), Expect = 0.00029, P = 0.00029
 Identities = 90/294 (30%), Positives = 109/294 (37%)

Query:   230 PNVDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 287
             P      A GSY   G  G+S      +  G  +Y  G   P   G P      G    G
Sbjct:    87 PGAGSYPAGGSYPYPGRGGSSPGGYPNQNPGAGSYPSGGSYPSAGGNPNQYPGRGGYNPG 146

Query:   288 --PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 345
               PN +  A +    G+   A  +  + PG   +   GY     P+ +P  G SY PA G
Sbjct:   147 GYPNQNPGAGSYPAGGSYPSAGGNPNQYPGRGGTSPAGY-----PNQNPGAG-SY-PAGG 199

Query:   346 PGYDPTKG-PG-YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG- 401
               Y    G P  Y  + GSN      PN +   G SY P  G  Y    G PN    RG 
Sbjct:   200 -SYPSAGGNPNQYPGRGGSNPGGY--PNQNPGAG-SY-PAGG-SYPSAGGNPNQYPGRGG 253

Query:   402 --PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR-GQ-GYDMRRAP---SYDP 454
               PG    + PG     G  Y     P+  P  G GY  Q  G+ GY     P   SY P
Sbjct:   254 SSPGGNPNQNPGAGTYAGGGY-----PNQYPGGG-GYSNQNPGRSGYSPGGYPGAGSY-P 306

Query:   455 SRGTGFDGAPRGAAPH--GQVPP--PLNNV--P-YGSATPPARSGSGQPRGGNP 501
              R  G  G   GA P   G  P   P N +  P YG +      G G   GG+P
Sbjct:   307 VRNAGQPGVYPGAHPSAGGGYPNWNPNNQILSPRYGGSF----GGGGFGTGGSP 356


>UNIPROTKB|F1PHY1 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
            "Canis lupus familiaris" [GO:0071230 "cellular response to amino
            acid stimulus" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0048407 "platelet-derived
            growth factor binding" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0043589 "skin morphogenesis" evidence=IEA]
            [GO:0042802 "identical protein binding" evidence=IEA] [GO:0030674
            "protein binding, bridging" evidence=IEA] [GO:0030199 "collagen
            fibril organization" evidence=IEA] [GO:0008217 "regulation of blood
            pressure" evidence=IEA] [GO:0007266 "Rho protein signal
            transduction" evidence=IEA] [GO:0007179 "transforming growth factor
            beta receptor signaling pathway" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            [GO:0001501 "skeletal system development" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005615 GO:GO:0030199 GO:GO:0001501
            GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0001568 GO:GO:0071230
            GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287
            GO:GO:0005584 OMA:TGPIGSA EMBL:AAEX03009315
            Ensembl:ENSCAFT00000031580 Uniprot:F1PHY1
        Length = 1366

 Score = 127 (49.8 bits), Expect = 0.00029, P = 0.00029
 Identities = 90/303 (29%), Positives = 110/303 (36%)

Query:   223 RAELMNAPNVDRRAADGSYGGATG-NSENETSGRP--VGQNAYEDGYGVPQGHGPPPSAT 279
             R E+   P V          GA G       +G P   G        G+P   G   +  
Sbjct:   282 RGEV-GLPGVSGPVGPPGNPGANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATG 340

Query:   280 TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKA-PSYDPTKG 337
               G+VG  P  + S   +   G P  A    P GP G E  +GP  +A  A PS  P  G
Sbjct:   341 ARGLVGE-PGPAGSKGESGNKGEPGSAGAQGPPGPSGEEGKRGPNGEAGSAGPSGPP--G 397

Query:   338 PSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYDMQRG 393
                 P ++G PG D   G  G    +G+   A  RGPN D  R P  +P    G    RG
Sbjct:   398 LRGSPGSRGLPGADGRAGVMGPPGPRGATGPAGVRGPNGDSGR-PG-EP----GLMGPRG 451

Query:   394 -PNYDMQRGP-GYE-TQRVPGYDVQRGPVYEA--QRAPSYIPQRGP-GYDLQRGQGYDMR 447
              P      GP G E    +PG D + GP+  A  +  P  I   GP G     G+  D  
Sbjct:   452 FPGAPGNVGPAGKEGPMGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKG 511

Query:   448 RAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-----GNP 501
              A     +RG  G DG      P G           G A PP   G   P G     G P
Sbjct:   512 HA-GLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKP 570

Query:   502 ARR 504
               R
Sbjct:   571 GER 573


>UNIPROTKB|F1NI79 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
            SMART:SM00210 GeneTree:ENSGT00700000104155 EMBL:AADN02026433
            EMBL:AADN02026434 EMBL:AADN02026427 EMBL:AADN02026428
            EMBL:AADN02026429 EMBL:AADN02026430 EMBL:AADN02026431
            EMBL:AADN02026432 IPI:IPI00602965 Ensembl:ENSGALT00000004020
            ArrayExpress:F1NI79 Uniprot:F1NI79
        Length = 1702

 Score = 128 (50.1 bits), Expect = 0.00029, P = 0.00029
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 313
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:   930 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 984

Query:   314 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 370
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:   985 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1041

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 429
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1042 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1090

Query:   430 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 488
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1091 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1150

Query:   489 ARSGS-GQP 496
               SG  G P
Sbjct:  1151 GESGEPGLP 1159


>UNIPROTKB|E1BF96 [details] [associations]
            symbol:PPP1R10 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0072357 "PTW/PP1 phosphatase complex" evidence=IEA]
            [GO:0000785 "chromatin" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
            binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
            PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
            GO:GO:0003677 GO:GO:0008270 GO:GO:0000785 GO:GO:0006351
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
            OMA:PPPHEHR GeneTree:ENSGT00530000063820 EMBL:DAAA02055402
            IPI:IPI00698425 RefSeq:NP_001137335.1 UniGene:Bt.27784
            Ensembl:ENSBTAT00000009104 GeneID:510825 KEGG:bta:510825
            NextBio:20869636 Uniprot:E1BF96
        Length = 924

 Score = 125 (49.1 bits), Expect = 0.00030, P = 0.00030
 Identities = 71/271 (26%), Positives = 87/271 (32%)

Query:   239 GSYGGATGNSENETS-GRPV-GQNAYEDGYGVPQGH---GPPPSATTAGVVGAGPNTSTS 293
             G  GG  G        G P+ G +    G G P G    GPPP          GP     
Sbjct:   631 GGPGGPKGMQHFPPGPGGPMPGPHGGPGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDP-- 688

Query:   294 AYAATQSGTPMRAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 352
                    G PMR     P GPG Y   +G        P   P +G     + G   +   
Sbjct:   689 -----MRGGPMRGGPG-P-GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRG 741

Query:   353 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 412
             GPG     G  +    GP   ++ G  + P  G G  M  G  +    GPG       G+
Sbjct:   742 GPGGGMVGGGGHRPHEGPGGGMNSGSGHRPHEGPGSGM--GGGHRPHEGPGGSMGG--GH 797

Query:   413 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 472
                 GP         + P  GPG  +  G G+         P  G G  G P G  PH  
Sbjct:   798 RPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-D 847

Query:   473 VPPPLNNVPYGSATPPARSGSGQPRGGNPAR 503
             VP    +   G      R   G   GG   R
Sbjct:   848 VPGHRGHDHRGPPPHEHRGHDGPGHGGGGHR 878

 Score = 121 (47.7 bits), Expect = 0.00081, P = 0.00081
 Identities = 49/192 (25%), Positives = 68/192 (35%)

Query:   243 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGV-------VGAGPNTSTSA 294
             G  G +E      P  G      G G P G G P      G         G G N+ +  
Sbjct:   710 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMNSGSGH 769

Query:   295 YAATQSGTPMRAAYDIPRGPG------YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY 348
                   G+ M   +    GPG      +   +GPG        + P +GP      G G+
Sbjct:   770 RPHEGPGSGMGGGHRPHEGPGGSMGGGHRPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGH 829

Query:   349 DPTKGPGYDAQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 407
              P +GPG+    G   +D      +D HRGP   P    G+D   GP +      G++  
Sbjct:   830 RPHEGPGHGGPHGHRPHDVPGHRGHD-HRGPP--PHEHRGHD---GPGHGGGGHRGHDGG 883

Query:   408 RVPGYDVQRGPV 419
                G D+   PV
Sbjct:   884 HSHGGDMSNRPV 895


>ZFIN|ZDB-GENE-030707-4 [details] [associations]
            symbol:anxa11a "annexin A11a" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
            "calcium-dependent phospholipid binding" evidence=IEA]
            InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
            InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
            SMART:SM00335 ZFIN:ZDB-GENE-030707-4 GO:GO:0005509 eggNOG:NOG267770
            GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
            HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29 HSSP:P79134 EMBL:AY178801
            IPI:IPI00498021 UniGene:Dr.77310 ProteinModelPortal:Q804G4
            SMR:Q804G4 PRIDE:Q804G4 InParanoid:Q804G4 NextBio:20812811
            ArrayExpress:Q804G4 Bgee:Q804G4 Uniprot:Q804G4
        Length = 526

 Score = 122 (48.0 bits), Expect = 0.00030, P = 0.00030
 Identities = 58/201 (28%), Positives = 73/201 (36%)

Query:   301 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 360
             G P ++ Y  P+G GY     PG     A  Y P  G  Y P  G GY P  G  Y  Q 
Sbjct:     5 GYPPQSGYP-PQGGGYPPQ--PGAYPPAAGGYPPQPG-MYPPQAG-GYPPQPG-AYPPQP 58

Query:   361 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVY 420
             G+ +  Q G    +  G    P   +G D    P ++     G   Q          P  
Sbjct:    59 GA-FPGQPGQYPSVPSGGWGAP---IGLDNLPNPGFNASNIQGMANQFAADGGFAPNPSM 114

Query:   421 EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV 480
              +   P   PQ G    +   Q Y M   P     +  G  G P G  P GQ  P   N+
Sbjct:   115 FSGGYPG--PQPGGPPAVSPNQPYGMYPQPGGGMPQNPGM-GYP-GGPPPGQQMPSYPNI 170

Query:   481 PYGSATPPARSGSGQPRGGNP 501
             P  + TP   SG   PR  +P
Sbjct:   171 P--APTP---SGPSYPRAPSP 186


>UNIPROTKB|F1NR01 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
            EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
            EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00822317
            Ensembl:ENSGALT00000039037 ArrayExpress:F1NR01 Uniprot:F1NR01
        Length = 1773

 Score = 128 (50.1 bits), Expect = 0.00030, P = 0.00030
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 313
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1001 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1055

Query:   314 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 370
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1056 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1112

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 429
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1113 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1161

Query:   430 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 488
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1162 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1221

Query:   489 ARSGS-GQP 496
               SG  G P
Sbjct:  1222 GESGEPGLP 1230


>UNIPROTKB|F1NR03 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
            InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
            GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
            EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
            EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
            EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00818113
            Ensembl:ENSGALT00000039034 ArrayExpress:F1NR03 Uniprot:F1NR03
        Length = 1804

 Score = 128 (50.1 bits), Expect = 0.00031, P = 0.00031
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 313
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1032 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1086

Query:   314 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 370
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1087 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1143

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 429
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1144 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1192

Query:   430 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 488
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1193 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1252

Query:   489 ARSGS-GQP 496
               SG  G P
Sbjct:  1253 GESGEPGLP 1261


>UNIPROTKB|F1NR02 [details] [associations]
            symbol:COL5A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0003007 "heart morphogenesis" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005588 "collagen type V" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0007155 "cell
            adhesion" evidence=IEA] [GO:0008201 "heparin binding" evidence=IEA]
            [GO:0030199 "collagen fibril organization" evidence=IEA]
            [GO:0032964 "collagen biosynthetic process" evidence=IEA]
            [GO:0035313 "wound healing, spreading of epidermal cells"
            evidence=IEA] [GO:0043206 "extracellular fibril organization"
            evidence=IEA] [GO:0043394 "proteoglycan binding" evidence=IEA]
            [GO:0043588 "skin development" evidence=IEA] [GO:0045112 "integrin
            biosynthetic process" evidence=IEA] [GO:0048407 "platelet-derived
            growth factor binding" evidence=IEA] [GO:0048592 "eye
            morphogenesis" evidence=IEA] [GO:0051128 "regulation of cellular
            component organization" evidence=IEA] InterPro:IPR000885
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
            GO:GO:0030199 GO:GO:0008201 GO:GO:0007155 Gene3D:2.60.120.200
            InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0035313
            InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 SMART:SM00282
            GO:GO:0005604 GO:GO:0043206 Pfam:PF02210 GO:GO:0005201 OMA:TIYEGIG
            GO:GO:0005588 GO:GO:0045112 GO:GO:0051128 SMART:SM00210
            GeneTree:ENSGT00700000104155 EMBL:AADN02026433 EMBL:AADN02026434
            EMBL:AADN02026427 EMBL:AADN02026428 EMBL:AADN02026429
            EMBL:AADN02026430 EMBL:AADN02026431 EMBL:AADN02026432
            IPI:IPI00821684 Ensembl:ENSGALT00000039035 ArrayExpress:F1NR02
            Uniprot:F1NR02
        Length = 1815

 Score = 128 (50.1 bits), Expect = 0.00031, P = 0.00031
 Identities = 75/249 (30%), Positives = 96/249 (38%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 313
             P+G    +   G P   GP  S    G  G AGP          Q G P  A     +G 
Sbjct:  1043 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1097

Query:   314 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 370
             PG +  +GP G D  + P   P  GP+  P   PG D  KG  G   QKGS  D  ++GP
Sbjct:  1098 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1154

Query:   371 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 429
                   GP+  PQ  +G   Q GP   D + GP  + Q + G     GP       P  +
Sbjct:  1155 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1203

Query:   430 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 488
               +G PG   ++G+  D+ +     P    G  G P    P G      N    G    P
Sbjct:  1204 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1263

Query:   489 ARSGS-GQP 496
               SG  G P
Sbjct:  1264 GESGEPGLP 1272


>DICTYBASE|DDB_G0279193 [details] [associations]
            symbol:rpb1 "RNA polymerase II core subunit"
            species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
            evidence=IEA;IDA] [GO:0006366 "transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
            complex" evidence=IEA;ISS] [GO:0003899 "DNA-directed RNA polymerase
            activity" evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=ISS] [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
            [GO:0016740 "transferase activity" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR000684
            InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
            InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
            InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
            Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
            Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 dictyBase:DDB_G0279193
            GO:GO:0006355 GenomeReviews:CM000152_GR GO:GO:0046872 GO:GO:0003677
            GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010
            EMBL:AAFI02000030 GO:GO:0003899 eggNOG:COG0086 GO:GO:0005665
            OMA:KVLPWST EMBL:S52651 PIR:A56823 RefSeq:XP_641735.1 STRING:P35084
            PRIDE:P35084 EnsemblProtists:DDB0215406 GeneID:8621932
            KEGG:ddi:DDB_G0279193 KO:K03006 ProtClustDB:CLSZ2428993
            Uniprot:P35084
        Length = 1727

 Score = 135 (52.6 bits), Expect = 0.00031, Sum P(2) = 0.00031
 Identities = 65/219 (29%), Positives = 85/219 (38%)

Query:   288 PNTSTSAYA-ATQSGTPMRAAYDIPRGPGYEASKG---------PGYDASKA--PSYDP- 334
             P + T +Y+    S TP    YD P  P  E  +G         PGY+A+K+   SY   
Sbjct:  1488 PGSQTPSYSYGDGSTTPFHNPYDAPLSPFNETFRGDFSPSAMNSPGYNANKSYGSSYQYF 1547

Query:   335 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 394
              + P+Y P   P Y PT  P Y     S Y +   P+Y     PSY P     Y     P
Sbjct:  1548 PQSPTYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSP 1600

Query:   395 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 454
              Y     P Y     P Y     P Y +  +PSY P   P Y       Y    +PSY P
Sbjct:  1601 FYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSP 1653

Query:   455 SRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 493
             +  +    +P   +P      P +  P  S T P+ S S
Sbjct:  1654 TSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYSPS 1689

 Score = 42 (19.8 bits), Expect = 0.00031, Sum P(2) = 0.00031
 Identities = 13/50 (26%), Positives = 23/50 (46%)

Query:   195 KKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAADGSYGGA 244
             +K +N  ++  +V + N   +  E+EKL A L      D    D ++  A
Sbjct:   978 QKLFN--IDIRRVSDLNPAVVVLEIEKLVARLKIIATADTTEDDENFNRA 1025


>UNIPROTKB|E9PQW6 [details] [associations]
            symbol:ARID1A "AT-rich interactive domain-containing
            protein 1A" species:9606 "Homo sapiens" [GO:0006325 "chromatin
            organization" evidence=IEA] [GO:0016514 "SWI/SNF complex"
            evidence=IEA] [GO:0071564 "npBAF complex" evidence=IEA] [GO:0071565
            "nBAF complex" evidence=IEA] EMBL:AL034380 GO:GO:0016514
            EMBL:AL512408 HGNC:HGNC:11110 ChiTaRS:ARID1A GO:GO:0006325
            IPI:IPI00979164 Ensembl:ENST00000524572 ArrayExpress:E9PQW6
            Bgee:E9PQW6 Uniprot:E9PQW6
        Length = 123

 Score = 98 (39.6 bits), Expect = 0.00032, P = 0.00032
 Identities = 36/108 (33%), Positives = 47/108 (43%)

Query:   340 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 399
             Y   +GP   P +G GY  Q   +   QR P     +G +     GL Y  Q  P Y  Q
Sbjct:    18 YSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPM--TMQGRAQSAMGGLSYTQQIPP-YG-Q 73

Query:   400 RGP-GYETQ-RVPGYDVQ------RGPVYEAQRAPSYIPQRGPGYDLQ 439
             +GP GY  Q + P Y+ Q      + P Y +Q+ PS  P   P Y  Q
Sbjct:    74 QGPSGYGQQGQTPYYNQQSPHPQQQQPPY-SQQPPSQTPHAQPSYQQQ 120


>ZFIN|ZDB-GENE-030707-5 [details] [associations]
            symbol:anxa11b "annexin A11b" species:7955 "Danio
            rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
            "calcium-dependent phospholipid binding" evidence=IEA]
            InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
            InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
            SMART:SM00335 ZFIN:ZDB-GENE-030707-5 GO:GO:0005509 eggNOG:NOG267770
            GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
            HOGENOM:HOG000158803 HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29
            OrthoDB:EOG4Z0B60 InterPro:IPR013286 PRINTS:PR01871 HSSP:P79134
            EMBL:BC068366 EMBL:AY178802 IPI:IPI00484212 RefSeq:NP_861431.1
            UniGene:Dr.76267 SMR:Q804G3 STRING:Q804G3 GeneID:353365
            KEGG:dre:353365 CTD:353365 NextBio:20812741 Uniprot:Q804G3
        Length = 485

 Score = 121 (47.7 bits), Expect = 0.00034, P = 0.00034
 Identities = 59/175 (33%), Positives = 71/175 (40%)

Query:   330 PSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 389
             P Y P  G SY PA GP   P  G  Y  Q G+ Y  Q G  Y    G ++ PQ G  + 
Sbjct:     4 PGYPPAGG-SYPPASGPYQQPAAG--YPPQPGA-YPPQAG-YYPPQPG-AFPPQPG-AFP 56

Query:   390 MQRG--P---NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY--IPQRG-----PGYD 437
              Q G  P    Y  Q G GY      G+  Q G  Y A +  +Y  +P  G     PG+ 
Sbjct:    57 PQPGAFPPGAGYPPQAG-GYPAAPGGGFPPQAGG-YPAAQPGAYPNMPAAGGWGGHPGFG 114

Query:   438 LQRG---QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 489
                G   QGY    AP   P     + GAP    P+  +P      P G  TPPA
Sbjct:   115 APAGGMPQGYPGVPAPGQQPM--PAYPGAP---VPNPGMPGYGGGAPTGP-TPPA 163


>UNIPROTKB|P02812 [details] [associations]
            symbol:PRB2 "Basic salivary proline-rich protein 2"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005575 "cellular_component" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] GO:GO:0005576 EMBL:AC078950
            EMBL:BX484538 EMBL:S80905 EMBL:K03208 IPI:IPI00552432 PIR:B40750
            PIR:E25372 UniGene:Hs.654486 STRING:P02812 DMDM:160409933
            PaxDb:P02812 PRIDE:P02812 Ensembl:ENST00000389362 UCSC:uc010shk.1
            GeneCards:GC12M011544 HGNC:HGNC:9338 MIM:168810 neXtProt:NX_P02812
            ArrayExpress:P02812 Bgee:P02812 CleanEx:HS_PRB2
            Genevestigator:P02812 GermOnline:ENSG00000173342 InterPro:IPR026086
            PANTHER:PTHR23203 Uniprot:P02812
        Length = 416

 Score = 120 (47.3 bits), Expect = 0.00035, P = 0.00035
 Identities = 69/257 (26%), Positives = 88/257 (34%)

Query:   253 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA--ATQSGTPMRAAYDI 310
             +G P  Q A   G   PQG  P P     G    G N             G P +   + 
Sbjct:    33 AGNP--QGAPPQGGNKPQGP-PSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGG-NK 88

Query:   311 PRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 367
             P+GP   G      P  D S++P   P K P   P +G G  P +GP     K      Q
Sbjct:    89 PQGPPPPGKPQGPPPQGDKSRSPRSPPGK-PQGPPPQG-GNQP-QGPPPPPGKPQGPPPQ 145

Query:   368 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPS 427
              G      +GP   P +  G   Q        R P  + Q  P    Q G   +    P 
Sbjct:   146 GGNK---PQGPP-PPGKPQGPPPQGDNKSRSSRSPPGKPQGPPP---QGGNQPQGPPPPP 198

Query:   428 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLN-NVPYGSAT 486
               PQ  P     + QG      P   P +G     + R      Q PPP   N P G   
Sbjct:   199 GKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDNKSQSARSPPGKPQGPPPQGGNQPQGPPP 258

Query:   487 PPARSGSGQPRGGNPAR 503
             PP +     P+GGN ++
Sbjct:   259 PPGKPQGPPPQGGNKSQ 275

 Score = 118 (46.6 bits), Expect = 0.00058, P = 0.00058
 Identities = 76/272 (27%), Positives = 99/272 (36%)

Query:   246 GNSENETSGRPVG--QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTP 303
             G++++ +S  P G  Q     G   PQG  PPP        G  P            G P
Sbjct:   166 GDNKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQ----GPPPQGGNKPQGPPPPGKP 221

Query:   304 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 362
                    P+G    ++++ P     K P   P +G +  P +GP   P K  G   Q G+
Sbjct:   222 QGPP---PQGDNKSQSARSP---PGK-PQGPPPQGGN-QP-QGPPPPPGKPQGPPPQGGN 272

Query:   363 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVPGYDVQ-RGP 418
                +Q  P     +GP   PQ G      R P    Q  P   G + Q  P    + +GP
Sbjct:   273 K--SQGPPPPGKPQGPP--PQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGP 328

Query:   419 VYEAQRAPSYIPQRG-P-GYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR--GAAPHGQVP 474
               +    P   P  G P G   Q G      R+P   P       G P+  G  P G  P
Sbjct:   329 PPQGGNKPQGPPPPGKPQGPPPQGGSKSRSARSPPGKPQ------GPPQQEGNNPQGP-P 381

Query:   475 PPLNNVPYGSATPPARSGSGQPR---GGNPAR 503
             PP    P     PPA    G PR   GG P+R
Sbjct:   382 PPAGGNPQQPQAPPAGQPQGPPRPPQGGRPSR 413


>ZFIN|ZDB-GENE-030516-3 [details] [associations]
            symbol:col18a1 "collagen type XVIII, alpha 1"
            species:7955 "Danio rerio" [GO:0031012 "extracellular matrix"
            evidence=IEA] [GO:0005198 "structural molecule activity"
            evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0005581
            "collagen" evidence=IEA] InterPro:IPR010515 InterPro:IPR020067
            Pfam:PF01392 Pfam:PF06482 PROSITE:PS50038 ZFIN:ZDB-GENE-030516-3
            GO:GO:0005198 Gene3D:3.10.100.10 InterPro:IPR016186
            InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0007155 InterPro:IPR008985
            SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            InterPro:IPR001791 SMART:SM00282 Gene3D:1.10.2000.10
            SUPFAM:SSF63501 SMART:SM00210 GeneTree:ENSGT00700000104250
            HOGENOM:HOG000231591 HOVERGEN:HBG053241 EMBL:BX927363 EMBL:CT030212
            IPI:IPI00616856 UniGene:Dr.52833 SMR:B0S8G4
            Ensembl:ENSDART00000130434 OMA:DRFNRYD Uniprot:B0S8G4
        Length = 1645

 Score = 127 (49.8 bits), Expect = 0.00035, P = 0.00035
 Identities = 74/274 (27%), Positives = 99/274 (36%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGP--PPSATTAGVVGA-GPNTSTS 293
             GS G  +G       G P G+   +   G+G P   G   PP     G  G  GP   ++
Sbjct:   613 GS-GSVSGGGSKGDKGVP-GEKGMKGTSGFGYPGSKGDRGPP-----GPPGPPGPQGPSA 665

Query:   294 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPT 351
                    G+ ++     PRGP G +   GP G +       +  K     P+  PG    
Sbjct:   666 EVEVRGDGSVVQKVTG-PRGPPGPQGPPGPPGPEGEPGDPGEDGKAGQVGPSGFPGNPGN 724

Query:   352 KGP-GYDAQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQ--RG-PGYET 406
              GP G    +G +    RGP       GPS    R    DM+ G  +DM   R  PG   
Sbjct:   725 PGPKGDKGDRGESQPGPRGPPGPPGPPGPSSGFDRPTFVDME-GSGFDMDSVRAVPGLPG 783

Query:   407 QRVPGYDVQRGPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 465
                PG     GP   A      + P   PG +   GQ   +   P  D   G       +
Sbjct:   784 P--PGPPGPPGPPGSASSGSGGFGPPGPPGQNGAPGQP-GLSGVPGADGKPGLPGPKGEK 840

Query:   466 GAAPHGQVPPPLNNV-PYGSATPPARSGSGQPRG 498
             G A    +P P+      GS+ PP  +G G P G
Sbjct:   841 GDAGELGLPGPVGEKGAKGSSGPPGTTGIGGPAG 874


>UNIPROTKB|F1Q0F7 [details] [associations]
            symbol:COL4A5 "Collagen alpha-5(IV) chain" species:9615
            "Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
            SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 EMBL:AAEX03026757 EMBL:AAEX03026761
            EMBL:AAEX03026758 EMBL:AAEX03026759 EMBL:AAEX03026760
            Ensembl:ENSCAFT00000018078 Uniprot:F1Q0F7
        Length = 1678

 Score = 127 (49.8 bits), Expect = 0.00036, P = 0.00036
 Identities = 59/197 (29%), Positives = 72/197 (36%)

Query:   311 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 364
             P  PG     GP  G    K    +P K G P  D   G PG     G PGY  + G   
Sbjct:   269 PGPPGIRGPPGPPGGMKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 326

Query:   365 DAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRG-PVYE 421
             D ++G   DI   GP    + G G  +    N  +   PG + +R  PG     G P   
Sbjct:   327 DGEKGQKGDIGSTGPPGLSKPGTGVTVGEKGNMGLPGLPGEKGERGFPGIQGPPGLPGPP 386

Query:   422 AQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 481
                     P   PG+  +RGQ  D    P        G DG P      G   PP    P
Sbjct:   387 VLGTAVMGPPGPPGFPGERGQKGD-EGPPGISIPGFPGLDGQPGAPGLRGPPGPP---GP 442

Query:   482 YGSATPPARSGSGQPRG 498
             + S +PP   GS   RG
Sbjct:   443 HISPSPPGPPGSPGDRG 459


>UNIPROTKB|E1BC70 [details] [associations]
            symbol:VPS37C "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR009851
            Pfam:PF07200 PROSITE:PS51314 KO:K12185 GeneTree:ENSGT00530000063123
            CTD:55048 OMA:VERCQEQ EMBL:DAAA02063396 IPI:IPI00692039
            RefSeq:NP_001193079.1 UniGene:Bt.105953 Ensembl:ENSBTAT00000010607
            GeneID:613817 KEGG:bta:613817 NextBio:20898788 Uniprot:E1BC70
        Length = 350

 Score = 91 (37.1 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 61/196 (31%), Positives = 71/196 (36%)

Query:   326 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG 385
             AS  P+ D T  P   P   PG   T  P  DAQ      +   P Y +   P Y P  G
Sbjct:   162 ASLEPAGD-TPPPRPPPPLHPGPQTTPPPAEDAQPQPPQPSVVPP-YPL---P-YSPSPG 215

Query:   386 LGYDMQRGPNYDMQRGPG-YETQRVPG--YDVQRGPVYEAQ----RAPS---YIPQRG-- 433
                 M  GP       P  +     P   Y    GP Y A     RAPS   + PQR   
Sbjct:   216 ----MPVGPTAHGALPPAPFPVVSQPSFSYSGPLGPPYAAAQPGTRAPSGYSWSPQRSMP 271

Query:   434 --PGYDLQ----RGQGYDM--RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 485
               PGY +      G GY +   RAPS  P    G+   P   +  G+ P P    P G  
Sbjct:   272 PRPGYPVAPTGASGPGYPVVGGRAPS--P----GYPQQPPYLSTGGKPPYPTQPQPSGPL 325

Query:   486 TPPARSGSGQPRGGNP 501
              PP   G   P G  P
Sbjct:   326 QPPYPPGPAPPYGFPP 341

 Score = 71 (30.1 bits), Expect = 0.00037, Sum P(2) = 0.00037
 Identities = 31/144 (21%), Positives = 66/144 (45%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             +M   PE ++ ++A    E+Q L  E +   AT+ +L +     Q  L+I    +    S
Sbjct:    14 EMQNDPEAID-RLAQDSPEVQDLQLEREMALATNRSLAERNLEFQGPLEISRSNL----S 68

Query:   103 ERELQMRNLTEKIAKMEAEL-KTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQ 161
             ++  ++R L E+  + +A+L K +  ++L       + +++ +  EE  A   +  +   
Sbjct:    69 DKYQELRKLVERYQEQKAKLEKFSSALQLGTLLDLLQIESMKI-EEESEAMAEKFLEGEV 127

Query:   162 RAHTDVQQIPAL--LSELESLRQE 183
                T ++   ++  LS L  +R E
Sbjct:   128 PLDTFLENFSSMRTLSHLRRVRVE 151


>UNIPROTKB|F1LRM7 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0001502 "cartilage condensation"
            evidence=IEA] [GO:0001894 "tissue homeostasis" evidence=IEA]
            [GO:0001958 "endochondral ossification" evidence=IEA] [GO:0002062
            "chondrocyte differentiation" evidence=IEA] [GO:0003007 "heart
            morphogenesis" evidence=IEA] [GO:0005201 "extracellular matrix
            structural constituent" evidence=IEA] [GO:0005585 "collagen type
            II" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
            [GO:0005615 "extracellular space" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0006029 "proteoglycan metabolic
            process" evidence=IEA] [GO:0007417 "central nervous system
            development" evidence=IEA] [GO:0007601 "visual perception"
            evidence=IEA] [GO:0007605 "sensory perception of sound"
            evidence=IEA] [GO:0010468 "regulation of gene expression"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0030903 "notochord development" evidence=IEA]
            [GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IEA] [GO:0060021
            "palate development" evidence=IEA] [GO:0060174 "limb bud formation"
            evidence=IEA] [GO:0060272 "embryonic skeletal joint morphogenesis"
            evidence=IEA] [GO:0060351 "cartilage development involved in
            endochondral bone morphogenesis" evidence=IEA] [GO:0071599 "otic
            vesicle development" evidence=IEA] [GO:0071773 "cellular response
            to BMP stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2375
            GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 IPI:IPI00394380
            Ensembl:ENSRNOT00000016044 ArrayExpress:F1LRM7 Uniprot:F1LRM7
        Length = 1419

 Score = 126 (49.4 bits), Expect = 0.00039, P = 0.00039
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   687 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 745

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   746 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 805

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   806 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 864

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   865 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 924

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   925 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 979

Query:   503 R 503
             R
Sbjct:   980 R 980

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:   998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035

 Score = 123 (48.4 bits), Expect = 0.00081, P = 0.00081
 Identities = 89/297 (29%), Positives = 110/297 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGA 286
             P  DR   D    GA G    +  G P G        G P   GPP        A + G 
Sbjct:    64 PRGDR--GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGG 118

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
                 +  A      G PM      PRGP G   + GP G+  +     +P   GP   P 
Sbjct:   119 FDEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPR 174

Query:   344 KGPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQ 399
               PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  
Sbjct:   175 GPPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGA 230

Query:   400 RG----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 451
             +G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+   
Sbjct:   231 KGEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPP 288

Query:   452 --YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 501
                 P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   289 GPVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 342


>RGD|2375 [details] [associations]
            symbol:Col2a1 "collagen, type II, alpha 1" species:10116 "Rattus
          norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
          [GO:0001502 "cartilage condensation" evidence=ISO] [GO:0001894
          "tissue homeostasis" evidence=ISO] [GO:0001958 "endochondral
          ossification" evidence=ISO] [GO:0002062 "chondrocyte differentiation"
          evidence=ISO] [GO:0003007 "heart morphogenesis" evidence=ISO]
          [GO:0005201 "extracellular matrix structural constituent"
          evidence=TAS] [GO:0005581 "collagen" evidence=ISO] [GO:0005585
          "collagen type II" evidence=ISO;TAS] [GO:0005604 "basement membrane"
          evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
          [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006029 "proteoglycan
          metabolic process" evidence=ISO] [GO:0007601 "visual perception"
          evidence=ISO] [GO:0007605 "sensory perception of sound" evidence=ISO]
          [GO:0010468 "regulation of gene expression" evidence=ISO] [GO:0030199
          "collagen fibril organization" evidence=ISO] [GO:0031012
          "extracellular matrix" evidence=ISO] [GO:0035108 "limb morphogenesis"
          evidence=ISO] [GO:0042472 "inner ear morphogenesis" evidence=ISO]
          [GO:0042802 "identical protein binding" evidence=ISO] [GO:0043066
          "negative regulation of apoptotic process" evidence=ISO] [GO:0046872
          "metal ion binding" evidence=IEA] [GO:0048407 "platelet-derived
          growth factor binding" evidence=ISO] [GO:0048705 "skeletal system
          morphogenesis" evidence=ISO] [GO:0048839 "inner ear development"
          evidence=ISO] [GO:0051216 "cartilage development" evidence=IEP;ISO]
          [GO:0060021 "palate development" evidence=ISO] [GO:0060272 "embryonic
          skeletal joint morphogenesis" evidence=ISO] [GO:0060348 "bone
          development" evidence=ISO] [GO:0060351 "cartilage development
          involved in endochondral bone morphogenesis" evidence=ISO]
          [GO:0071773 "cellular response to BMP stimulus" evidence=ISO]
          InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
          SMART:SM00038 RGD:2375 GO:GO:0046872 GO:GO:0051216 InterPro:IPR008160
          Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
          HOVERGEN:HBG004933 KO:K06236 CTD:1280 Reactome:REACT_133391
          GO:GO:0005585 EMBL:L48440 EMBL:K02804 EMBL:M10613 EMBL:X79816
          IPI:IPI00394380 PIR:A05152 PIR:I60384 RefSeq:NP_037061.1
          UniGene:Rn.10124 IntAct:P05539 STRING:P05539 PRIDE:P05539
          GeneID:25412 KEGG:rno:25412 UCSC:RGD:2375 NextBio:606543
          ArrayExpress:P05539 Genevestigator:P05539
          GermOnline:ENSRNOG00000022282 Uniprot:P05539
        Length = 1419

 Score = 126 (49.4 bits), Expect = 0.00039, P = 0.00039
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   687 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 745

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   746 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 805

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   806 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 864

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   865 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 924

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   925 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 979

Query:   503 R 503
             R
Sbjct:   980 R 980

 Score = 125 (49.1 bits), Expect = 0.00049, P = 0.00049
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:   938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:   998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035


>UNIPROTKB|E7ENY8 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
            "Homo sapiens" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
            Pfam:PF01391 GO:GO:0005201 EMBL:AC066694 HGNC:HGNC:2201
            ChiTaRS:COL3A1 IPI:IPI00981037 PDB:4GYX PDBsum:4GYX
            ProteinModelPortal:E7ENY8 SMR:E7ENY8 PRIDE:E7ENY8
            Ensembl:ENST00000317840 ArrayExpress:E7ENY8 Bgee:E7ENY8
            Uniprot:E7ENY8
        Length = 1163

 Score = 125 (49.1 bits), Expect = 0.00039, P = 0.00039
 Identities = 82/286 (28%), Positives = 103/286 (36%)

Query:   231 NVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP- 288
             +V    A G   G  G +       P G + +    G P   GPP     AG  G  GP 
Sbjct:   159 DVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPP 218

Query:   289 ---NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDP 342
                  S  A    +SG P R     +P  PG +   G PG+   K    +D   G   + 
Sbjct:   219 GAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGET 278

Query:   343 AKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQR 400
                PG     G PG +   G      RG   +  R P      G  G D  RG   D Q 
Sbjct:   279 G-APGLKGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQP 332

Query:   401 GP-GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
             GP G   T   PG    +G V  A    S      PG   QRG+      A +  P    
Sbjct:   333 GPPGPPGTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPP 386

Query:   459 GFDGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 499
             G +G+P G    G  P  +   P   G+  PP  +G+ G P  RGG
Sbjct:   387 GINGSPGGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430


>WB|WBGene00001734 [details] [associations]
            symbol:grl-25 species:6239 "Caenorhabditis elegans"
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0040010
            "positive regulation of growth rate" evidence=IMP] [GO:0009792
            "embryo development ending in birth or egg hatching" evidence=IMP]
            [GO:0000003 "reproduction" evidence=IMP] GO:GO:0009792
            GO:GO:0040010 GO:GO:0000003 EMBL:Z11126
            GeneTree:ENSGT00570000079107 EMBL:Z12018 RefSeq:NP_001023025.1
            ProteinModelPortal:G5EDQ6 EnsemblMetazoa:ZK643.8 GeneID:176265
            KEGG:cel:CELE_ZK643.8 CTD:176265 WormBase:ZK643.8 OMA:QYLGAYA
            NextBio:891834 Uniprot:G5EDQ6
        Length = 774

 Score = 123 (48.4 bits), Expect = 0.00039, P = 0.00039
 Identities = 71/279 (25%), Positives = 102/279 (36%)

Query:   236 AADGSYGGATGNSENETSGRPV----GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS 291
             A+  S GG +G  E+ +SG       G ++   G G   G     S++++G    G ++S
Sbjct:   342 ASSSSGGGYSGGGESSSSGGSSYSSGGDSSSSSGGGYSSGGDSSSSSSSSGGYSGGSDSS 401

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP----SYDPAKGPG 347
             +S+  ++ SG       D     G E+S   GY  S +   + + G     S +PA  P 
Sbjct:   402 SSS--SSSSGGYSSGGGDAGASSGGESSSAGGYSGSSSSGGEASSGGYSGGSSEPAPAPE 459

Query:   348 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 407
               P    GY    GS    +  P       PS     G     +  P        G E  
Sbjct:   460 AAPASSGGYSG--GSEAAPEAAP-----AAPS-GGYSGSEAAPEAAPAAPSGGYSGSEAA 511

Query:   408 RVPGYDVQRGPVYEAQRAPSYIPQR-GPGYDLQRGQGYDMRRAPSYDPSRG-TGFDGAPR 465
                      G    ++ AP   P     GY    G       AP+  PS G +G + AP 
Sbjct:   512 PEAAPAAPSGGYSGSEAAPEAAPAAPSGGYS---GSEAAPEAAPAA-PSGGYSGSEAAPE 567

Query:   466 GA--APHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
              A  AP G      ++ P  +A  PA S  G   GG  A
Sbjct:   568 AAPAAPSGGYSGSESSAP--AAPEPAPSSGGYSGGGGDA 604


>RGD|61817 [details] [associations]
            symbol:Col1a1 "collagen, type I, alpha 1" species:10116 "Rattus
           norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
           [GO:0001503 "ossification" evidence=IEP] [GO:0001568 "blood vessel
           development" evidence=IEA;ISO] [GO:0001649 "osteoblast
           differentiation" evidence=IEA] [GO:0001957 "intramembranous
           ossification" evidence=IEA;ISO] [GO:0001958 "endochondral
           ossification" evidence=IEA;ISO] [GO:0003674 "molecular_function"
           evidence=ND] [GO:0005201 "extracellular matrix structural
           constituent" evidence=IEA;ISO] [GO:0005578 "proteinaceous
           extracellular matrix" evidence=ISO] [GO:0005581 "collagen"
           evidence=ISO] [GO:0005584 "collagen type I" evidence=IEA;ISO]
           [GO:0005615 "extracellular space" evidence=ISO;IDA] [GO:0005737
           "cytoplasm" evidence=IEA;ISO] [GO:0007584 "response to nutrient"
           evidence=IEP] [GO:0007601 "visual perception" evidence=IEA;ISO]
           [GO:0007605 "sensory perception of sound" evidence=IEA;ISO]
           [GO:0009612 "response to mechanical stimulus" evidence=IEP]
           [GO:0010035 "response to inorganic substance" evidence=IEP]
           [GO:0010718 "positive regulation of epithelial to mesenchymal
           transition" evidence=IEA;ISO] [GO:0010812 "negative regulation of
           cell-substrate adhesion" evidence=IEA;ISO] [GO:0015031 "protein
           transport" evidence=IEA;ISO] [GO:0030199 "collagen fibril
           organization" evidence=IEA;ISO] [GO:0030335 "positive regulation of
           cell migration" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
           evidence=ISO] [GO:0031960 "response to corticosteroid stimulus"
           evidence=IEP] [GO:0032964 "collagen biosynthetic process"
           evidence=IEA;ISO] [GO:0034504 "protein localization to nucleus"
           evidence=IEA;ISO] [GO:0034505 "tooth mineralization"
           evidence=IEA;ISO] [GO:0042060 "wound healing" evidence=IMP]
           [GO:0042542 "response to hydrogen peroxide" evidence=IEP]
           [GO:0042802 "identical protein binding" evidence=IEA;ISO]
           [GO:0043434 "response to peptide hormone stimulus" evidence=IEP]
           [GO:0043588 "skin development" evidence=ISO] [GO:0043589 "skin
           morphogenesis" evidence=IEA;ISO] [GO:0045893 "positive regulation of
           transcription, DNA-dependent" evidence=IEA;ISO] [GO:0046872 "metal
           ion binding" evidence=IEA] [GO:0048407 "platelet-derived growth
           factor binding" evidence=IEA;ISO] [GO:0048705 "skeletal system
           morphogenesis" evidence=ISO] [GO:0048706 "embryonic skeletal system
           development" evidence=IEA;ISO] [GO:0051591 "response to cAMP"
           evidence=IEP] [GO:0060325 "face morphogenesis" evidence=IEA;ISO]
           [GO:0060346 "bone trabecula formation" evidence=IEA;ISO] [GO:0060351
           "cartilage development involved in endochondral bone morphogenesis"
           evidence=IEA;ISO] [GO:0070208 "protein heterotrimerization"
           evidence=IEA;ISO] [GO:0071230 "cellular response to amino acid
           stimulus" evidence=IEA;ISO] [GO:0071260 "cellular response to
           mechanical stimulus" evidence=IEA] [GO:0071300 "cellular response to
           retinoic acid" evidence=IEP] [GO:0071363 "cellular response to
           growth factor stimulus" evidence=IEP] [GO:0071560 "cellular response
           to transforming growth factor beta stimulus" evidence=IEP]
           [GO:0090263 "positive regulation of canonical Wnt receptor signaling
           pathway" evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007
           Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
           PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
           RGD:61817 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615 GO:GO:0009612
           GO:GO:0071560 GO:GO:0046872 GO:GO:0015031 GO:GO:0007601
           GO:GO:0071300 GO:GO:0043434 GO:GO:0030199 GO:GO:0007584
           GO:GO:0010035 GO:GO:0007605 GO:GO:0010718 GO:GO:0030335
           GO:GO:0042542 GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391
           eggNOG:NOG12793 GO:GO:0042060 GO:GO:0071260 GO:GO:0001568
           GO:GO:0001649 GO:GO:0051591 GO:GO:0034505 GO:GO:0090263
           GO:GO:0001503 GO:GO:0010812 GO:GO:0060325 EMBL:CH473948
           GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
           GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
           GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
           HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ
           GO:GO:0005584 GO:GO:0060346 GO:GO:0031960 EMBL:Z78279 EMBL:BC133728
           EMBL:M11432 IPI:IPI00188909 PIR:A90559 RefSeq:NP_445756.1
           UniGene:Rn.2953 PDB:3HQV PDB:3HR2 PDBsum:3HQV PDBsum:3HR2
           ProteinModelPortal:P02454 IntAct:P02454 STRING:P02454 PRIDE:P02454
           Ensembl:ENSRNOT00000005311 GeneID:29393 KEGG:rno:29393
           UCSC:RGD:61817 InParanoid:A3KNA1 Reactome:REACT_150387
           EvolutionaryTrace:P02454 NextBio:609017 ArrayExpress:P02454
           Genevestigator:P02454 GermOnline:ENSRNOG00000003897 Uniprot:P02454
        Length = 1453

 Score = 126 (49.4 bits), Expect = 0.00040, P = 0.00040
 Identities = 88/285 (30%), Positives = 108/285 (37%)

Query:   237 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 293
             ADG  G  G  G++  +    P G  A   G   P G+ G P    + G  G  P  +  
Sbjct:   808 ADGQPGAKGEPGDTGVKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGSRGAAGP-PGATGF 865

Query:   294 AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--KG 345
               AA + G P  +    P GP    G E  KGP  +   A  P      GP   PA  KG
Sbjct:   866 PGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRGETGPAGRPGEVGPPGPP-GPAGEKG 924

Query:   346 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 393
              PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +RG
Sbjct:   925 SPGADGPAGSPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984

Query:   394 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 453
             P   M  GP       PG     GP  E+ R  S   +  PG D   G   D        
Sbjct:   985 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGAPGAKGDRGETGPAG 1032

Query:   454 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
             P    G  GAP    P G+        P G A P   +G+  P G
Sbjct:  1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPAGARGPAG 1077


>UNIPROTKB|Q02388 [details] [associations]
            symbol:COL7A1 "Collagen alpha-1(VII) chain" species:9606
            "Homo sapiens" [GO:0004867 "serine-type endopeptidase inhibitor
            activity" evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA]
            [GO:0005604 "basement membrane" evidence=IEA] [GO:0005515 "protein
            binding" evidence=IPI] [GO:0005590 "collagen type VII"
            evidence=TAS] [GO:0008544 "epidermis development" evidence=TAS]
            [GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
            "endoplasmic reticulum lumen" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS] [GO:0031012
            "extracellular matrix" evidence=ISS] InterPro:IPR002035
            InterPro:IPR002223 InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041
            Pfam:PF00092 PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279
            PROSITE:PS50853 SMART:SM00060 SMART:SM00327 Reactome:REACT_118779
            Gene3D:2.60.40.10 InterPro:IPR013783 GO:GO:0004867 SUPFAM:SSF49265
            GO:GO:0030198 GO:GO:0007155 Gene3D:4.10.410.10 InterPro:IPR020901
            SUPFAM:SSF57362 PROSITE:PS00280 GO:GO:0005788 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0008544 GO:GO:0005604
            EMBL:L23982 EMBL:L02870 EMBL:D13694 EMBL:M96984 EMBL:S51236
            EMBL:M65158 EMBL:L06862 IPI:IPI00025418 IPI:IPI00795118 PIR:A54849
            RefSeq:NP_000085.1 UniGene:Hs.476218 ProteinModelPortal:Q02388
            SMR:Q02388 IntAct:Q02388 MINT:MINT-1390694 STRING:Q02388
            MEROPS:I02.967 PhosphoSite:Q02388 DMDM:1345650 PaxDb:Q02388
            PRIDE:Q02388 Ensembl:ENST00000328333 Ensembl:ENST00000454817
            GeneID:1294 KEGG:hsa:1294 UCSC:uc003ctz.2 CTD:1294
            GeneCards:GC03M048576 HGNC:HGNC:2214 HPA:CAB016357 MIM:120120
            MIM:131705 MIM:131750 MIM:131850 MIM:132000 MIM:226600 MIM:604129
            MIM:607523 neXtProt:NX_Q02388 Orphanet:158673 Orphanet:79407
            Orphanet:216989 Orphanet:79408 Orphanet:89842 Orphanet:89841
            Orphanet:79409 Orphanet:89839 Orphanet:158676 Orphanet:79410
            Orphanet:89843 Orphanet:79411 PharmGKB:PA26730 HOGENOM:HOG000111866
            HOVERGEN:HBG051053 InParanoid:Q02388 KO:K16628 OMA:RRVCTTA
            OrthoDB:EOG4J117P PhylomeDB:Q02388 ChiTaRS:COL7A1 GenomeRNAi:1294
            NextBio:5251 ArrayExpress:Q02388 Bgee:Q02388 CleanEx:HS_COL7A1
            Genevestigator:Q02388 GermOnline:ENSG00000114270 GO:GO:0005590
            Uniprot:Q02388
        Length = 2944

 Score = 129 (50.5 bits), Expect = 0.00041, P = 0.00041
 Identities = 83/269 (30%), Positives = 99/269 (36%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPG 315
             P G        G P   GPP SAT  G  G  P       A  + G+P RA    P  PG
Sbjct:  1270 PPGDPGLPGRTGAPGPQGPPGSATAKGERGF-PG------ADGRPGSPGRAGN--PGTPG 1320

Query:   316 YEASKG-PGYDASKA-PSYDPTKGPSYDPAKGPGYD-PTKGPGYDAQKG----SNYDAQR 368
                 KG PG    +  P     +GP  +P   PG     +GPG   +KG    S     R
Sbjct:  1321 APGLKGSPGLPGPRGDPGERGPRGPKGEPG-APGQVIGGEGPGLPGRKGDPGPSGPPGPR 1379

Query:   369 GPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGP-GY-ETQRVPGYDVQRG-PVYEAQR 424
             GP  D   GP   P  GL     +G   D  +RGP G  E    PG   + G P      
Sbjct:  1380 GPLGD--PGPRGPP--GLPGTAMKGDKGDRGERGPPGPGEGGIAPG---EPGLPGLPGSP 1432

Query:   425 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA----APHGQ--VPPPLN 478
              P   P   PG   ++G   D   AP      G+  +  PRG      P G    P PL 
Sbjct:  1433 GPQG-PVGPPGKKGEKGDSED--GAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLG 1489

Query:   479 NV-PYGSATPPARSGS-GQPR-GGNPARR 504
                  G   PP  +GS G P   G P  +
Sbjct:  1490 EAGEKGERGPPGPAGSRGLPGVAGRPGAK 1518


>UNIPROTKB|F1LN37 [details] [associations]
            symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
            "Rattus norvegicus" [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005737 GO:GO:0043066
            GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
            GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
            GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
            GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GO:GO:0005585 GO:GO:0060174
            GO:GO:0030903 IPI:IPI00388575 Ensembl:ENSRNOT00000037840
            ArrayExpress:F1LN37 Uniprot:F1LN37
        Length = 1487

 Score = 126 (49.4 bits), Expect = 0.00041, P = 0.00041
 Identities = 89/301 (29%), Positives = 117/301 (38%)

Query:   235 RAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-PSATTA--GVVGA-GPNT 290
             R A G   G  G+  +     P G    + G G+    GPP P+      G VG  GP+ 
Sbjct:   755 RGAAG-IAGPKGDRGDVGEKGPEGAPGKDGGRGLTGPIGPPGPAGANGEKGEVGPPGPSG 813

Query:   291 STSAYAAT----QSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
             ST A  A     ++G P  A +  P G  G   +KG  G    K  +  P  +GPS  P 
Sbjct:   814 STGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGDQGEAGQKGDAGAPGPQGPSGAPG 873

Query:   344 -KGP-GYDPTKGP-GYDAQKGSN-YDAQRG----PNYDIHRGPSYDPQRGLGYDMQRGPN 395
              +GP G    KG  G     G+  +    G    P  + + GP+  P    G D  +G  
Sbjct:   874 PQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGPPGSNGNPGPA-GPPGPAGKDGPKGAR 932

Query:   396 YDM----QRG-PGYETQR-VPGYDVQRG---PV-YEAQRAPSYIP-QRG-PGYDLQRGQ- 442
              D     + G PG +     PG   + G   P   +    P  +  QRG  G   QRG+ 
Sbjct:   933 GDTGAPGRAGDPGLQGPAGAPGEKGEPGDDGPSGSDGPPGPQGLAGQRGIVGLPGQRGER 992

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 502
             G+     PS +P +  G  GA     P G V PP    P G    P R GS     G P 
Sbjct:   993 GFPGLPGPSGEPGK-QGAPGASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPG 1047

Query:   503 R 503
             R
Sbjct:  1048 R 1048

 Score = 125 (49.1 bits), Expect = 0.00052, P = 0.00052
 Identities = 87/281 (30%), Positives = 99/281 (35%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 294
             ADG  G      E    G   G    +   G P   GP       G  GA GP  +T   
Sbjct:   841 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 899

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 350
              AA + G P       P GP      GP G D  K    D    G + DP  +GP   P 
Sbjct:   900 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 954

Query:   351 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 406
              KG PG D   GS  D   GP     +G +   QRG+ G   QRG   +    GP  E  
Sbjct:   955 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 1005

Query:   407 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 462
              Q  PG    RGP           P   PG +   G      R  A      RG TG  G
Sbjct:  1006 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1065

Query:   463 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 502
             AP    P G  P P    P G       +G+  P G   PA
Sbjct:  1066 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1103

 Score = 123 (48.4 bits), Expect = 0.00086, P = 0.00086
 Identities = 89/297 (29%), Positives = 110/297 (37%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGA 286
             P  DR   D    GA G    +  G P G        G P   GPP        A + G 
Sbjct:   132 PRGDR--GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGG 186

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPA 343
                 +  A      G PM      PRGP G   + GP G+  +     +P   GP   P 
Sbjct:   187 FDEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPR 242

Query:   344 KGPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQ 399
               PG  P   PG D + G    A +RG P     RG    P  GL G    RG P  D  
Sbjct:   243 GPPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGA 298

Query:   400 RG----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 451
             +G    PG + +   PG +   GP+   +  P    + GP       +G D +  P+   
Sbjct:   299 KGEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPP 356

Query:   452 --YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 501
                 P+ G GF GAP  +G A P G   P       GS   P   GS  P G  GNP
Sbjct:   357 GPVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410


>ZFIN|ZDB-GENE-980526-192 [details] [associations]
            symbol:col2a1a "collagen type II, alpha-1a"
            species:7955 "Danio rerio" [GO:0005581 "collagen" evidence=IEA;ISS]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0030903 "notochord development" evidence=IGI]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 ZFIN:ZDB-GENE-980526-192 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933 KO:K06236
            GO:GO:0030903 EMBL:BX927144 EMBL:DQ335127 IPI:IPI00505438
            RefSeq:NP_571367.1 UniGene:Dr.75057 SMR:Q2LDA1 STRING:Q2LDA1
            Ensembl:ENSDART00000100234 GeneID:562496 KEGG:dre:562496 CTD:562496
            InParanoid:Q2LDA1 NextBio:20884441 Uniprot:Q2LDA1
        Length = 1491

 Score = 126 (49.4 bits), Expect = 0.00041, P = 0.00041
 Identities = 83/270 (30%), Positives = 96/270 (35%)

Query:   243 GATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGA-GPNTSTSAYAATQ- 299
             GA G   N+      GQ   + G   PQG  G P      GV G  G   +  A  AT  
Sbjct:   844 GADGQPGNKGEQGESGQKG-DSGAPGPQGPSGAPGPVGPTGVTGPKGARGAQGAPGATGF 902

Query:   300 SGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGP-GYDPTKGP 354
              G   R     P G PG     GP G D  K    D    G + D   +GP G    KG 
Sbjct:   903 PGAAGRVGPPGPNGNPGAAGPAGPSGKDGPKGVRGDAGPPGRAGDAGLRGPPGAPGEKGE 962

Query:   355 -GYDAQKGSNYDAQRGP-NYDIHRGPSYDP-QRG-LGYDMQRGPNYD--MQRGPGYETQR 408
              G D   G   D   GP      RG    P QRG  G+    GP+ +   Q  PG    R
Sbjct:   963 AGEDGPPGP--DGPSGPAGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGSGDR 1020

Query:   409 VP----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFDGA 463
              P    G     GP  E  R  +      PG D   G +G      P   P    G  GA
Sbjct:  1021 GPPGPVGPPGLTGPAGETGREGNPGSDGPPGRDGAAGVKGERGNTGPIGAPG-APGAPGA 1079

Query:   464 PRGAAPHGQVPPPLNNVPYGSATPPARSGS 493
             P    P G+      N P G A PP  +G+
Sbjct:  1080 PGSVGPIGKQGDRGENGPQGPAGPPGPAGA 1109


>UNIPROTKB|A7E348 [details] [associations]
            symbol:PYGO2 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0048589 "developmental growth" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0030879
            "mammary gland development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0007420 "brain
            development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0002088 "lens development in camera-type eye" evidence=IEA]
            [GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
            utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
            Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
            GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
            GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
            GO:GO:0033599 GO:GO:0051569 GO:GO:0002088 eggNOG:NOG72798
            HOGENOM:HOG000001580 HOVERGEN:HBG053774
            GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
            OrthoDB:EOG4QZ7MB EMBL:DAAA02007156 EMBL:BC151715 IPI:IPI00866934
            RefSeq:NP_001095712.1 UniGene:Bt.102068 SMR:A7E348
            Ensembl:ENSBTAT00000005670 GeneID:540401 KEGG:bta:540401
            InParanoid:A7E348 NextBio:20878610 Uniprot:A7E348
        Length = 405

 Score = 119 (46.9 bits), Expect = 0.00043, P = 0.00043
 Identities = 70/263 (26%), Positives = 99/263 (37%)

Query:   257 VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGY 316
             V  N +ED +G P+  G  P    + V   G           Q G     A  +P G G 
Sbjct:    73 VASNPFEDDFGAPKVGGAAPPFLGSPVPFGG--------FRVQGGM----AGQVPPGYGT 120

Query:   317 EASKGPGYDASKAPSYDPTK-GPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 374
                 GP     + P + P+  GP+++ P +GPGY P     + +Q    ++   G N+  
Sbjct:   121 GGGGGPQPLRRQPPPFPPSPMGPAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSP 177

Query:   375 HRGPSYD-PQRGLGY----DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPS 427
               G     P  G G      M + P  ++  GP    QR        GP  +   Q  PS
Sbjct:   178 PGGQMMPGPVGGFGPMISPTMGQPPRGEL--GPPSLPQRFAQPGAPFGPSLQRPGQGLPS 235

Query:   428 YIPQRGP--GYDLQ-RGQGYDMRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNN 479
               P   P  G D    G G +    P  +P   T F   P   +P    +G  P  P N+
Sbjct:   236 LPPNTSPFPGPDPGFPGPGGEDGGKP-LNPPAATAFPQEPHSGSPAAAVNGNQPSFPPNS 294

Query:   480 VPYGSATPPARSGS--GQPRGGN 500
                G  TP A S +  G+  GG+
Sbjct:   295 SGRGGGTPDANSLAPPGKAGGGS 317


>UNIPROTKB|G4MYW7 [details] [associations]
            symbol:MGG_10829 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
            evidence=ND] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR000571 PROSITE:PS50103 GO:GO:0008270 GO:GO:0003676
            EMBL:CM001232 InterPro:IPR019496 Pfam:PF10453 RefSeq:XP_003713435.1
            EnsemblFungi:MGG_10829T0 GeneID:2676344 KEGG:mgr:MGG_10829
            Uniprot:G4MYW7
        Length = 600

 Score = 121 (47.7 bits), Expect = 0.00046, P = 0.00046
 Identities = 61/238 (25%), Positives = 82/238 (34%)

Query:   271 GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD--IPRGPGYEASKGPGYDASK 328
             G+GPPP        GA P      Y   Q        +    PRG G  A  G G     
Sbjct:     5 GYGPPPPPPA----GAPPQAYQQQYGQYQQPPATGHVHGGHAPRG-GRGAHSGRGDFHGS 59

Query:   329 APSYDPTKGPSYDPA-KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLG 387
              PSY     P   P+  GP + P   P +      NY     P +  ++ P Y  Q+   
Sbjct:    60 PPSYPYNNQPQPPPSYTGPHHAPP--PPHTPLAPQNYHPNYAPQH--YQQPQYAHQQQYP 115

Query:   388 YDMQRGPNYDMQRGPGYETQRVPGY-DVQRGPVYEAQRAPSYIPQR--GPG-YDLQRGQG 443
             +   + P    Q+ P Y     P Y      P ++    P+    +  GP  Y   RG+G
Sbjct:   116 HQQPQQPPQPPQQAP-Y-AHHYPSYPQAPNAPPHQPWGGPATAGHQPAGPAHYGSGRGRG 173

Query:   444 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 501
                     + P+   G      G    G  PP L  V   +  PP     G P+GG P
Sbjct:   174 GHQGDRGGHKPAAAMG-PPLRMGFDNRGPEPPAL--VSSATVYPP--QPFGPPQGGAP 226


>UNIPROTKB|Q5T171 [details] [associations]
            symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
            "in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
            development" evidence=IEA] [GO:0002088 "lens development in
            camera-type eye" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            [GO:0007420 "brain development" evidence=IEA] [GO:0009791
            "post-embryonic development" evidence=IEA] [GO:0030879 "mammary
            gland development" evidence=IEA] [GO:0033599 "regulation of mammary
            gland epithelial cell proliferation" evidence=IEA] [GO:0042393
            "histone binding" evidence=IEA] [GO:0048589 "developmental growth"
            evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
            evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
            [GO:0060070 "canonical Wnt receptor signaling pathway"
            evidence=IEA] InterPro:IPR001965 InterPro:IPR019787 Pfam:PF00628
            PROSITE:PS50016 SMART:SM00249 GO:GO:0005634 GO:GO:0007420
            GO:GO:0046872 GO:GO:0008270 GO:GO:0001701 GO:GO:0009791
            GO:GO:0001822 EMBL:AL451085 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
            PROSITE:PS01359 GO:GO:0060021 EMBL:CH471121 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
            HOGENOM:HOG000001580 HOVERGEN:HBG053774 UniGene:Hs.533597
            HGNC:HGNC:30257 IPI:IPI00642524 SMR:Q5T171 STRING:Q5T171
            Ensembl:ENST00000368456 Uniprot:Q5T171
        Length = 369

 Score = 118 (46.6 bits), Expect = 0.00047, P = 0.00047
 Identities = 72/267 (26%), Positives = 101/267 (37%)

Query:   257 VGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPG 315
             V  N +ED +G P+ G   PP   +    G             Q G     A  +P  PG
Sbjct:    36 VASNPFEDDFGAPKVGVAAPPFLGSPVPFGG---------FRVQGGM----AGQVP--PG 80

Query:   316 YEASKGPGYDASKA--PSYDPTK-GPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 371
             Y    G G    +   P + P   GP+++ P +GPGY P     + +Q    ++   G N
Sbjct:    81 YSTGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQN 137

Query:   372 YDIHRGPSYD-PQRGLGY----DMQRGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQ 423
             +    G     P  G G      M + P  ++  GP   +QR   PG      P+    Q
Sbjct:   138 FSPPSGQMMPGPVGGFGPMISPTMGQPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQ 195

Query:   424 RAPSYIPQRGP--GYDLQ-RGQGYDMRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP- 475
               PS  P   P  G D    G G +    P  +P   T F   P   +P    +G  P  
Sbjct:   196 GLPSLPPNTSPFPGPDPGFPGPGGEDGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSF 254

Query:   476 PLNNVPYGSATPPARSGS--GQPRGGN 500
             P N+   G  TP A S +  G+  GG+
Sbjct:   255 PPNSSGRGGGTPDANSLAPPGKAGGGS 281


>WB|WBGene00010306 [details] [associations]
            symbol:F59A2.6 species:6239 "Caenorhabditis elegans"
            [GO:0000042 "protein targeting to Golgi" evidence=IEA]
            InterPro:IPR000237 Pfam:PF01465 PROSITE:PS50913 SMART:SM00755
            EMBL:Z34801 GeneTree:ENSGT00700000104373 GO:GO:0000042
            Gene3D:1.10.220.60 SUPFAM:SSF101283 EMBL:Z66514 PIR:T22976
            RefSeq:NP_497706.1 UniGene:Cel.10377 HSSP:Q13439
            ProteinModelPortal:G5EEK2 SMR:G5EEK2 IntAct:G5EEK2
            EnsemblMetazoa:F59A2.6 GeneID:175445 KEGG:cel:CELE_F59A2.6
            CTD:175445 WormBase:F59A2.6 OMA:QTRKDID NextBio:888194
            Uniprot:G5EEK2
        Length = 1133

 Score = 124 (48.7 bits), Expect = 0.00049, P = 0.00049
 Identities = 51/194 (26%), Positives = 90/194 (46%)

Query:    51 MEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRN 110
             +E   +   +  QKL T ++ L A   T +    A   E+ +L   +   K ++  Q++N
Sbjct:   458 LENSNSETEILKQKLETLDKELQARQQTEK----ALTEEINVLTTSLAE-KEQQTAQIQN 512

Query:   111 LTEKIAKMEAELKT-AEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQRAH-TDVQ 168
             L  +I +ME E +   E VK++ Q++   AQ+   A E L A++ QL   L+       +
Sbjct:   513 LQTQIYQMEVEKEEKVELVKVQLQQA---AQSSSSAEEALRAEIEQLEAKLKAVEQAKAE 569

Query:   169 QIPALLSELESLRQEYHHCRGTYEYEK-KFYNDHLE-SLQVMEKNYITMATEVEKLRAEL 226
              + +LL+E E L+ + H   G  + EK +     L+ + Q        +  E+EKL A+L
Sbjct:   570 ALNSLLAEKEHLQAQLHQL-GVEKEEKLEMVKVQLQQAAQSSSSVEQALRAEIEKLEAKL 628

Query:   227 MNAPNVDRRAADGS 240
                    + A + S
Sbjct:   629 QEIEEEKKNALNAS 642


>UNIPROTKB|E1BT66 [details] [associations]
            symbol:TAF15 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
            PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
            GO:GO:0005634 GO:GO:0005737 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00530000063105
            OMA:YGNQGSQ EMBL:AADN02025953 EMBL:AADN02025954 IPI:IPI00575015
            ProteinModelPortal:E1BT66 Ensembl:ENSGALT00000003204 Uniprot:E1BT66
        Length = 443

 Score = 119 (46.9 bits), Expect = 0.00049, P = 0.00049
 Identities = 70/232 (30%), Positives = 89/232 (38%)

Query:   248 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 307
             S++ + G+  GQ +Y   YG     G      T G  G G +   S+Y   QS       
Sbjct:     3 SDSGSYGQSGGQQSYSS-YG---NQGNQSYGQTQGYSGYGQSGDNSSYG--QSYGNYHGN 56

Query:   308 YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP--GYDAQKGSNYD 365
             Y      GY      GYD     SYD     SY+          KG   G      S+YD
Sbjct:    57 YG-QNQTGY-GQDSHGYDDES--SYDNQNQSSYNQQSYSNQGQQKGSSRGGRGSYSSSYD 112

Query:   366 AQRGPNYDIHRGPSYDPQRGLG----YDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV-Y 420
              Q G  Y  H+G SYD Q G G    YD + G N   Q   G+  Q    Y  Q+G   +
Sbjct:   113 QQSG--YG-HQG-SYDQQSGYGHQSSYDQKSGYNQH-QSSYGHSQQ---SYQSQKGSYSH 164

Query:   421 EAQ---RAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRG--TGFDGAPRG 466
              +Q   R  S   +   GY   +G G    R   YD   RG  +G+ G  RG
Sbjct:   165 NSQDDRREKSRYGEDNRGYGGSQGGG----RG-GYDMDGRGHMSGYSGGDRG 211


>UNIPROTKB|F1RIA5 [details] [associations]
            symbol:VPS37C "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR009851
            Pfam:PF07200 PROSITE:PS51314 KO:K12185 GeneTree:ENSGT00530000063123
            OMA:VERCQEQ EMBL:CU914270 RefSeq:XP_003122720.1
            Ensembl:ENSSSCT00000032280 GeneID:100511491 KEGG:ssc:100511491
            Uniprot:F1RIA5
        Length = 358

 Score = 91 (37.1 bits), Expect = 0.00050, Sum P(2) = 0.00050
 Identities = 38/117 (32%), Positives = 48/117 (41%)

Query:   273 GPP-PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDA--SK 328
             GPP PSA        GP  S     + Q  TP R  Y + P G     + GPGY     +
Sbjct:   251 GPPYPSAQP------GPRASAGYSWSPQRSTPPRPGYPVAPTG-----ASGPGYPVVGGR 299

Query:   329 APSYD-PTKGPSYDPAKGPGYDPTKG--PGYDAQKGSNYDAQRGPNYDIH--RGPSY 380
             APS   P + P   P   P Y PT+   PG+  Q    Y     P Y     +GP++
Sbjct:   300 APSPGYPQQPPYLSPGGKPPY-PTQPQPPGFAGQPQPPYPPGPAPPYGFPPPQGPTW 355

 Score = 89 (36.4 bits), Expect = 0.00083, Sum P(2) = 0.00083
 Identities = 53/188 (28%), Positives = 64/188 (34%)

Query:   311 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 370
             PR P  +A+     D    P   P   PS  P     Y P+  PG      ++   Q  P
Sbjct:   179 PR-PSPQATPPVAEDRQPPPPLPPPPQPSVVPPYPLPYSPS--PGMSVGPTAHGALQPAP 235

Query:   371 NYDIHRGPSYDPQRGLG--Y-DMQRGPNYDM--QRGPGYETQRVPGYDVQ----RGPVYE 421
              + +   PS+     LG  Y   Q GP         P   T   PGY V      GP Y 
Sbjct:   236 -FPVVSQPSFSYSGPLGPPYPSAQPGPRASAGYSWSPQRSTPPRPGYPVAPTGASGPGYP 294

Query:   422 AQ--RAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 479
                 RAPS      PGY  Q        + P     +  GF G P+   P G  PP    
Sbjct:   295 VVGGRAPS------PGYPQQPPYLSPGGKPPYPTQPQPPGFAGQPQPPYPPGPAPPYGFP 348

Query:   480 VPYGSATP 487
              P G   P
Sbjct:   349 PPQGPTWP 356

 Score = 70 (29.7 bits), Expect = 0.00050, Sum P(2) = 0.00050
 Identities = 31/143 (21%), Positives = 65/143 (45%)

Query:    44 MMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSE 103
             M   PE ++ ++A +  E+Q L  E +   AT+ +L +     Q  L+I    +    S+
Sbjct:    15 MQNDPEAID-RLAQESPEVQDLQLEREMALATNRSLAERNLEFQGPLEISRSNL----SD 69

Query:   104 RELQMRNLTEKIAKMEAEL-KTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             +  ++R L E+  + +A+L K +  ++L       + + + +  EE  A   +  +    
Sbjct:    70 KYQELRKLVERCQEQKAKLEKFSSALQLGTLLDLLQIEGMKI-EEESEAMAEKFLEGEVP 128

Query:   163 AHTDVQQIPAL--LSELESLRQE 183
               T ++   ++  LS L  +R E
Sbjct:   129 LETFLETFSSMRMLSHLRRVRVE 151


>UNIPROTKB|P02453 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9913 "Bos
            taurus" [GO:0090263 "positive regulation of canonical Wnt receptor
            signaling pathway" evidence=IEA] [GO:0071260 "cellular response to
            mechanical stimulus" evidence=IEA] [GO:0071230 "cellular response
            to amino acid stimulus" evidence=IEA] [GO:0070208 "protein
            heterotrimerization" evidence=IEA] [GO:0060351 "cartilage
            development involved in endochondral bone morphogenesis"
            evidence=IEA] [GO:0060346 "bone trabecula formation" evidence=IEA]
            [GO:0060325 "face morphogenesis" evidence=IEA] [GO:0048706
            "embryonic skeletal system development" evidence=IEA] [GO:0048407
            "platelet-derived growth factor binding" evidence=IEA] [GO:0045893
            "positive regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0043589 "skin morphogenesis" evidence=IEA] [GO:0042802
            "identical protein binding" evidence=IEA] [GO:0034505 "tooth
            mineralization" evidence=IEA] [GO:0034504 "protein localization to
            nucleus" evidence=IEA] [GO:0032964 "collagen biosynthetic process"
            evidence=IEA] [GO:0030335 "positive regulation of cell migration"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0015031 "protein transport" evidence=IEA]
            [GO:0010812 "negative regulation of cell-substrate adhesion"
            evidence=IEA] [GO:0010718 "positive regulation of epithelial to
            mesenchymal transition" evidence=IEA] [GO:0007605 "sensory
            perception of sound" evidence=IEA] [GO:0007601 "visual perception"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615
            "extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0001958 "endochondral ossification"
            evidence=IEA] [GO:0001957 "intramembranous ossification"
            evidence=IEA] [GO:0001649 "osteoblast differentiation"
            evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
            [GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
            InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
            PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
            SMART:SM00214 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615
            GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0030199
            GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0070208
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071260
            GO:GO:0001568 GO:GO:0001649 GO:GO:0034505 GO:GO:0090263
            GO:GO:0010812 GO:GO:0060325 GO:GO:0032964 GO:GO:0071230
            GO:GO:0048706 GO:GO:0001957 GO:GO:0034504 GO:GO:0001958
            GO:GO:0060351 GO:GO:0005201 GO:GO:0043589 EMBL:BC105184
            IPI:IPI00707857 PIR:A91193 RefSeq:NP_001029211.1 UniGene:Bt.23316
            IntAct:P02453 STRING:P02453 PRIDE:P02453 Ensembl:ENSBTAT00000017420
            GeneID:282187 KEGG:bta:282187 CTD:1277 GeneTree:ENSGT00660000095287
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 InParanoid:P02453 KO:K06236
            OMA:VAYMDQQ OrthoDB:EOG4S4PHP NextBio:20806015 PMAP-CutDB:P02453
            ArrayExpress:P02453 GO:GO:0005584 GO:GO:0060346 Uniprot:P02453
        Length = 1463

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 83/280 (29%), Positives = 109/280 (38%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 295
             A    + GA G ++ E  G P G    E   GV    GPP  A  AG  G  P       
Sbjct:   339 AGPPGFPGAVG-AKGE--GGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAG-NPGADGQPG 394

Query:   296 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP 354
             A   +G P      I   PG+  ++GP     + PS  P  KG S +P   PG   +KG 
Sbjct:   395 AKGANGAP-----GIAGAPGFPGARGPS--GPQGPSGPPGPKGNSGEPG-APG---SKGD 443

Query:   355 -GYDAQKG-SNYDAQRGP-NYDIHRGPSYDP-QRGL-GYDMQRGPNYDMQRGPGYETQRV 409
              G   + G +      GP   +  RG   +P   GL G   +RG       GPG  ++  
Sbjct:   444 TGAKGEPGPTGIQGPPGPAGEEGKRGARGEPGPAGLPGPPGERG-------GPG--SRGF 494

Query:   410 PGYDVQRGPVYEA-QR-APSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPR 465
             PG D   GP   A +R AP    P+  PG   + G+   +  A     S G+ G DG   
Sbjct:   495 PGADGVAGPKGPAGERGAPGPAGPKGSPGEAGRPGEA-GLPGAKGLTGSPGSPGPDGKTG 553

Query:   466 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPAR 503
                P GQ   P    P G+       G   P+G  G P +
Sbjct:   554 PPGPAGQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGK 593

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 90/286 (31%), Positives = 109/286 (38%)

Query:   237 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 293
             ADG  G  G  G++  +    P G  A   G   P G+ G P      G   AGP  +T 
Sbjct:   818 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGARG--SAGPPGATG 874

Query:   294 -AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--K 344
                AA + G P  +    P GP    G E SKGP  +   A  P      GP   PA  K
Sbjct:   875 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGSKGPRGETGPAGRPGEVGPPGPP-GPAGEK 933

Query:   345 G-PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQR 392
             G PG D P     T GP G   Q+G      QRG   +    GPS +P ++G  G   +R
Sbjct:   934 GAPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGER 993

Query:   393 GPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 452
             GP   M  GP       PG     GP  E+ R  +   +  PG D   G   D       
Sbjct:   994 GPPGPM--GP-------PGL---AGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPA 1041

Query:   453 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
              P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1042 GPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIGPVGARGPAG 1087


>UNIPROTKB|P02461 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
            "Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
            [GO:0001501 "skeletal system development" evidence=IEA] [GO:0001568
            "blood vessel development" evidence=IEA] [GO:0046332 "SMAD binding"
            evidence=IEA] [GO:0048565 "digestive tract development"
            evidence=IEA] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IEA] [GO:0031012 "extracellular matrix"
            evidence=IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0007160 "cell-matrix adhesion" evidence=IDA] [GO:0018149
            "peptide cross-linking" evidence=IDA] [GO:0050777 "negative
            regulation of immune response" evidence=IMP] [GO:0005178 "integrin
            binding" evidence=NAS;IMP] [GO:0030168 "platelet activation"
            evidence=NAS] [GO:0007179 "transforming growth factor beta receptor
            signaling pathway" evidence=IDA] [GO:0034097 "response to cytokine
            stimulus" evidence=IDA] [GO:0009314 "response to radiation"
            evidence=IDA] [GO:0042060 "wound healing" evidence=IDA;NAS]
            [GO:0043206 "extracellular fibril organization" evidence=IMP]
            [GO:0030199 "collagen fibril organization" evidence=NAS;IMP]
            [GO:0007507 "heart development" evidence=IMP] [GO:0032964 "collagen
            biosynthetic process" evidence=IMP;TAS] [GO:0005615 "extracellular
            space" evidence=IDA;NAS] [GO:0043588 "skin development"
            evidence=IMP] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IMP] [GO:0007229 "integrin-mediated signaling
            pathway" evidence=IMP] [GO:0005586 "collagen type III"
            evidence=NAS;IMP] [GO:0048407 "platelet-derived growth factor
            binding" evidence=IDA] [GO:0005576 "extracellular region"
            evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
            evidence=TAS] [GO:0007411 "axon guidance" evidence=TAS] [GO:0030198
            "extracellular matrix organization" evidence=TAS]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 Reactome:REACT_118779
            Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
            GO:GO:0043588 GO:GO:0005615 GO:GO:0030168 GO:GO:0007507
            GO:GO:0046872 GO:GO:0034097 GO:GO:0030199 GO:GO:0005788
            GO:GO:0001501 EMBL:CH471058 GO:GO:0005178 GO:GO:0007179
            GO:GO:0007229 GO:GO:0007160
            Pathway_Interaction_DB:endothelinpathway InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568 GO:GO:0048565
            GO:GO:0050777 GO:GO:0009314 GO:GO:0018149 GO:GO:0032964
            GO:GO:0071230 GO:GO:0043206 GO:GO:0005201 HOVERGEN:HBG004933
            KO:K06236 DrugBank:DB00048 DrugBank:DB00039 GO:GO:0048407
            OrthoDB:EOG4FTW1C EMBL:X14420 EMBL:AY054301 EMBL:AY016295
            EMBL:AC066694 EMBL:BC028178 EMBL:M26939 EMBL:X07240 EMBL:X15332
            EMBL:S62925 EMBL:S79877 EMBL:M59312 EMBL:M59227 EMBL:M55603
            EMBL:X06700 EMBL:X01655 EMBL:X01742 EMBL:M13146 EMBL:M11134
            IPI:IPI00021033 IPI:IPI00167087 PIR:S05272 RefSeq:NP_000081.1
            UniGene:Hs.443625 PDB:2V53 PDB:3DMW PDB:4AE2 PDB:4AEJ PDB:4AK3
            PDBsum:2V53 PDBsum:3DMW PDBsum:4AE2 PDBsum:4AEJ PDBsum:4AK3
            ProteinModelPortal:P02461 SMR:P02461 DIP:DIP-57177N IntAct:P02461
            STRING:P02461 PhosphoSite:P02461 DMDM:124056490 PaxDb:P02461
            PRIDE:P02461 Ensembl:ENST00000304636 GeneID:1281 KEGG:hsa:1281
            UCSC:uc002uqj.1 CTD:1281 GeneCards:GC02P189803 HGNC:HGNC:2201
            HPA:CAB016766 HPA:HPA007583 MIM:100070 MIM:120180 MIM:130020
            MIM:130050 neXtProt:NX_P02461 Orphanet:2500 Orphanet:285
            Orphanet:286 Orphanet:86 PharmGKB:PA26716 InParanoid:P02461
            OMA:EGSPGHP PhylomeDB:P02461 ChiTaRS:COL3A1
            EvolutionaryTrace:P02461 GenomeRNAi:1281 NextBio:5177
            ArrayExpress:P02461 Bgee:P02461 Genevestigator:P02461
            GermOnline:ENSG00000168542 GO:GO:0005586 Uniprot:P02461
        Length = 1466

 Score = 125 (49.1 bits), Expect = 0.00051, P = 0.00051
 Identities = 82/286 (28%), Positives = 103/286 (36%)

Query:   231 NVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP- 288
             +V    A G   G  G +       P G + +    G P   GPP     AG  G  GP 
Sbjct:   159 DVKSGVAVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPP 218

Query:   289 ---NTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDP 342
                  S  A    +SG P R     +P  PG +   G PG+   K    +D   G   + 
Sbjct:   219 GAIGPSGPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGET 278

Query:   343 AKGPGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQR 400
                PG     G PG +   G      RG   +  R P      G  G D  RG   D Q 
Sbjct:   279 G-APGLKGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQP 332

Query:   401 GP-GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
             GP G   T   PG    +G V  A    S      PG   QRG+      A +  P    
Sbjct:   333 GPPGPPGTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPP 386

Query:   459 GFDGAPRGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 499
             G +G+P G    G  P  +   P   G+  PP  +G+ G P  RGG
Sbjct:   387 GINGSPGGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430


>UNIPROTKB|O43186 [details] [associations]
            symbol:CRX "Cone-rod homeobox protein" species:9606 "Homo
            sapiens" [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0007601 "visual perception" evidence=IEA] [GO:0050896 "response
            to stimulus" evidence=IEA] [GO:0003682 "chromatin binding"
            evidence=IEA] [GO:0003700 "sequence-specific DNA binding
            transcription factor activity" evidence=IEA] [GO:0005667
            "transcription factor complex" evidence=IEA] [GO:0045944 "positive
            regulation of transcription from RNA polymerase II promoter"
            evidence=IEA] [GO:0060041 "retina development in camera-type eye"
            evidence=IEA] [GO:0043522 "leucine zipper domain binding"
            evidence=IPI] [GO:0009887 "organ morphogenesis" evidence=TAS]
            InterPro:IPR001356 InterPro:IPR009057 InterPro:IPR013851
            InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529 PROSITE:PS00027
            PROSITE:PS50071 SMART:SM00389 GO:GO:0007601 GO:GO:0043565
            GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
            Orphanet:1872 Orphanet:791 GO:GO:0050896 Gene3D:1.10.10.60
            SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0009887 GO:GO:0060041
            Orphanet:65 MIM:268000 CTD:1406 eggNOG:NOG324074
            HOGENOM:HOG000082677 HOVERGEN:HBG004028 KO:K09337 OMA:QTKARPA
            OrthoDB:EOG4NKBWG EMBL:AF024711 EMBL:BT007364 EMBL:AC008745
            EMBL:BC016664 EMBL:BC053672 IPI:IPI00011226 RefSeq:NP_000545.1
            UniGene:Hs.617342 UniGene:Hs.633434 UniGene:Hs.639114
            ProteinModelPortal:O43186 SMR:O43186 IntAct:O43186
            MINT:MINT-1442706 STRING:O43186 PhosphoSite:O43186 PRIDE:O43186
            DNASU:1406 Ensembl:ENST00000221996 Ensembl:ENST00000539067
            Ensembl:ENST00000556900 Ensembl:ENST00000557738 GeneID:1406
            KEGG:hsa:1406 UCSC:uc002phq.4 GeneCards:GC19P048327 HGNC:HGNC:2383
            HPA:HPA036762 HPA:HPA036763 MIM:120970 MIM:602225 MIM:613829
            neXtProt:NX_O43186 PharmGKB:PA26903 InParanoid:O43186
            PhylomeDB:O43186 ChiTaRS:CRX GenomeRNAi:1406 NextBio:5749
            ArrayExpress:O43186 Bgee:O43186 CleanEx:HS_CRX
            Genevestigator:O43186 GermOnline:ENSG00000105392 Uniprot:O43186
        Length = 299

 Score = 116 (45.9 bits), Expect = 0.00052, P = 0.00052
 Identities = 29/98 (29%), Positives = 42/98 (42%)

Query:   269 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 328
             P    P P A  AG+V +GP+ +++ YA T +  P  A    P   G  +S   G D   
Sbjct:   165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222

Query:   329 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDA 366
             +P      GP+  P  GP   P+      +  G +Y A
Sbjct:   223 SPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGA 260


>FB|FBgn0052685 [details] [associations]
            symbol:ZAP3 species:7227 "Drosophila melanogaster"
            [GO:0005634 "nucleus" evidence=IEA] [GO:0008157 "protein
            phosphatase 1 binding" evidence=IPI] [GO:0048812 "neuron projection
            morphogenesis" evidence=IMP] InterPro:IPR026314 GO:GO:0005634
            EMBL:AE014298 PANTHER:PTHR13413 GeneTree:ENSGT00440000039837
            FlyBase:FBgn0052685 RefSeq:NP_727393.1 UniGene:Dm.10734
            ProteinModelPortal:Q9W2Y5 SMR:Q9W2Y5 IntAct:Q9W2Y5 MINT:MINT-741898
            STRING:Q9W2Y5 EnsemblMetazoa:FBtr0071489 GeneID:31942
            KEGG:dme:Dmel_CG32685 UCSC:CG32685-RC InParanoid:Q9W2Y5
            PhylomeDB:Q9W2Y5 GenomeRNAi:31942 NextBio:776058
            ArrayExpress:Q9W2Y5 Bgee:Q9W2Y5 Uniprot:Q9W2Y5
        Length = 1884

 Score = 133 (51.9 bits), Expect = 0.00053, Sum P(3) = 0.00053
 Identities = 76/276 (27%), Positives = 107/276 (38%)

Query:   247 NSENETSGRPVGQNAYEDGYGVPQGHGPP----PSATTAGVVGAGPNTSTSA----YAAT 298
             NS NE   +  G +   +    P  +GPP    P     G  G+GP  +  +    +   
Sbjct:   994 NSGNENKSQDAGDSVSTNNGEKPDNNGPPGGFGPGNGPGGGPGSGPGQNDGSRFDVFGPN 1053

Query:   299 Q-SGTP-MRAAYDIPRG---PGYE-ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 352
             Q SG   +    + P G   PG      GPG   +  P++    GP   P  GP   P  
Sbjct:  1054 QVSGNNFIDLDNNGPPGFGPPGRNFGPNGPGPRGNFGPNFGHNFGPR-GPG-GPFIRPNG 1111

Query:   353 G-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYETQRVP 410
               PG     G ++    GPN+  + GP++ P+ G      RGP+     GP G      P
Sbjct:  1112 PLPGPGPNFGPHF-RPNGPNFGPNFGPNFGPRPGSRNFGPRGPD-----GPFG------P 1159

Query:   411 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP--SYDPSRGTGFDGAPRGAA 468
             G D   GP +   R P   P  GPG++++   G  +   P        G GF     GA 
Sbjct:  1160 GRDDFGGPPFGGPR-PHMGPN-GPGHNMRGFNGGPISDNPFRRQGGPPGPGFGNDDLGAG 1217

Query:   469 PHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 504
             P  + P    N  +G+   P   G G   GGN  R+
Sbjct:  1218 PP-RGPRNFGN-RFGN---PGGGGGGGGGGGNNNRK 1248

 Score = 47 (21.6 bits), Expect = 0.00053, Sum P(3) = 0.00053
 Identities = 8/16 (50%), Positives = 8/16 (50%)

Query:    33 PPMPGAFPPFDMMPPP 48
             PP P   PP    PPP
Sbjct:    18 PPQPSVPPPLPDAPPP 33

 Score = 40 (19.1 bits), Expect = 0.00053, Sum P(3) = 0.00053
 Identities = 14/61 (22%), Positives = 25/61 (40%)

Query:   192 EYEKKFYNDHLESLQVMEKN-YITMATEVEKLRAELMNAPNVDRRAADGSYGGATGNSEN 250
             ++E++F    L +    +++ Y     E EK R  +       RR      GG+ G S  
Sbjct:   601 QWEQQFEEWKLANANHPDRDEYRRYEEEFEKQRRRIAERREQMRRRRQLQMGGSAGGSTT 660

Query:   251 E 251
             +
Sbjct:   661 D 661


>UNIPROTKB|F1NI72 [details] [associations]
            symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
            "Gallus gallus" [GO:0001568 "blood vessel development"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] [GO:0005586 "collagen type III"
            evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
            "transforming growth factor beta receptor signaling pathway"
            evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
            evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
            [GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
            "peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
            organization" evidence=IEA] [GO:0032964 "collagen biosynthetic
            process" evidence=IEA] [GO:0034097 "response to cytokine stimulus"
            evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
            "extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
            development" evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA]
            [GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
            [GO:0048565 "digestive tract development" evidence=IEA] [GO:0050777
            "negative regulation of immune response" evidence=IEA] [GO:0071230
            "cellular response to amino acid stimulus" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0005615 GO:GO:0034097
            GO:GO:0030199 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0042060 GO:GO:0050777
            GO:GO:0009314 GO:GO:0018149 GO:GO:0071230 GO:GO:0043206
            GO:GO:0005201 GeneTree:ENSGT00660000095287 GO:GO:0005586
            EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI00589264
            Ensembl:ENSGALT00000004033 OMA:ETCLSAN ArrayExpress:F1NI72
            Uniprot:F1NI72
        Length = 1498

 Score = 125 (49.1 bits), Expect = 0.00053, P = 0.00053
 Identities = 78/276 (28%), Positives = 97/276 (35%)

Query:   243 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 301
             G  G   +N   G P G        G P   GPP      G  G  P  +       + G
Sbjct:   464 GTPGEPGKNGAKGDP-GPKGERGENGTPGAPGPPGEEGKRGANGE-PGQNGVPGTPGERG 521

Query:   302 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 357
             +P      +P   G    KGP G   S  P   P+ GP+ D  +  GPG    +G PG  
Sbjct:   522 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 577

Query:   358 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 414
                GS  D + GP      G   +P R  G     GP     +   PG +  +  PG + 
Sbjct:   578 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 629

Query:   415 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 470
             +RGP       P    + G    PG     G   D R  P   PS   G  G P G  P 
Sbjct:   630 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 685

Query:   471 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 504
             G+   P    P G    P   G   P+G N  P  R
Sbjct:   686 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 718

 Score = 123 (48.4 bits), Expect = 0.00087, P = 0.00087
 Identities = 84/275 (30%), Positives = 104/275 (37%)

Query:   253 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 312
             +G P G        G+P   G P      G+ G  P TS +  A    G P +       
Sbjct:   424 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 478

Query:   313 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 366
             GP G     G PG  A   P  +  +G + +P +   PG    +G PG+    GSN    
Sbjct:   479 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 536

Query:   367 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 421
             ++GP  +       GPS  P    G D   GP     RG PG      PG D + GP   
Sbjct:   537 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 591

Query:   422 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 473
              Q  P      GP G   Q G  G+   +    AP  +  RG G   G P  A  +G V 
Sbjct:   592 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 650

Query:   474 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 502
              P PP    P G    P  SGS    G P G  PA
Sbjct:   651 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 685

 Score = 123 (48.4 bits), Expect = 0.00087, P = 0.00087
 Identities = 87/288 (30%), Positives = 109/288 (37%)

Query:   237 ADGSYG--GATG-NSENETSGRPVGQNAY-EDGY-GVPQGHGPPPSATTAGVVGAGPNTS 291
             A+GS G  G  G   E    G P G  A+ +DG  G     GPP    TAG  G+ P   
Sbjct:   327 ANGSPGQPGPRGPTGERGRPGNPGGPGAHGKDGAPGAAGPPGPPGPPGTAGFPGS-PGFK 385

Query:   292 TSAYA---ATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 347
               A     A  SG+P       P+G  G    +GP   A  +P      GPS  P   PG
Sbjct:   386 GEAGPPGPAGASGSPGERGEPGPQGQAGPPGPQGPPGRAG-SPGNKGEMGPSGIPG-APG 443

Query:   348 YDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
                 +G PG     G N  A+  P      G   DP    G   +RG N      PG   
Sbjct:   444 LPGGRGLPGPPGTSG-NPGAKGTPGEPGKNGAKGDP----GPKGERGEN-GTPGAPG--- 494

Query:   407 QRVPGYDVQRGPVYEAQR--APSYIPQRG-PGY-DLQRGQGYDMRRAPSYDPSRGTGFDG 462
                PG + +RG   E  +   P    +RG PG+  L    G    + P+ +  RG+    
Sbjct:   495 --PPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPGEKGPAGE--RGSPGPP 550

Query:   463 APRG-AAPHGQV--P--PPLNNVPYGSATPPARSGSGQPRG--GNPAR 503
              P G A   GQ   P  P +  +P G    P   G   P G  G P R
Sbjct:   551 GPSGPAGDRGQDGGPGLPGMRGLP-GIPGSPGSDGKPGPPGNQGEPGR 597


>UNIPROTKB|E2QSE6 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006606 "protein import into nucleus"
            evidence=IEA] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0005524 "ATP binding" evidence=IEA] [GO:0004828
            "serine-tRNA ligase activity" evidence=IEA] InterPro:IPR012929
            InterPro:IPR015866 Pfam:PF07926 GO:GO:0005524 GO:GO:0005737
            GO:GO:0005643 GO:GO:0006606 InterPro:IPR009053 SUPFAM:SSF46579
            GO:GO:0004828 GO:GO:0006434 Gene3D:1.10.287.40
            Ensembl:ENSCAFT00000021777 Uniprot:E2QSE6
        Length = 2366

 Score = 127 (49.8 bits), Expect = 0.00053, P = 0.00053
 Identities = 42/187 (22%), Positives = 88/187 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1351 PDTEEYRKLLSEKEVHTKRIQQLTEEIGRLKAEIARSNASLTNNQNLIQSLKEDLNKVRT 1410

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L   +++++    Q + D Q 
Sbjct:  1411 EKETIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQYEELKAQQDKVMETSAQSSGDHQE 1470

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYE-YEKKFYNDHLESLQVMEKNYITMATEVEK 221
              H  VQ++  L   L     +        E  +KK  ++     + +++  + +  E+ +
Sbjct:  1471 QHVSVQEMQELKETLNQAETKAKSLESQVENLQKKTLSEKEAEARNLQEQTVQLQCELSR 1530

Query:   222 LRAELMN 228
             LR +L +
Sbjct:  1531 LRQDLQD 1537


>ZFIN|ZDB-GENE-030131-4487 [details] [associations]
            symbol:sec24c "SEC24 family, member C (S.
            cerevisiae)" species:7955 "Danio rerio" [GO:0030127 "COPII vesicle
            coat" evidence=IEA] [GO:0006886 "intracellular protein transport"
            evidence=IEA] [GO:0006888 "ER to Golgi vesicle-mediated transport"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0006810 "transport" evidence=IEA] [GO:0015031 "protein
            transport" evidence=IEA] InterPro:IPR006895 InterPro:IPR006896
            InterPro:IPR006900 Pfam:PF04810 Pfam:PF04811 Pfam:PF04815
            ZFIN:ZDB-GENE-030131-4487 GO:GO:0006886 GO:GO:0008270
            InterPro:IPR007123 Pfam:PF00626 GO:GO:0006888 GO:GO:0030127
            SUPFAM:SSF82919 InterPro:IPR012990 Pfam:PF08033 SUPFAM:SSF81811
            GeneTree:ENSGT00590000082962 EMBL:CU469520 EMBL:CU694198
            IPI:IPI00972073 Ensembl:ENSDART00000085476 ArrayExpress:F1R9P2
            Bgee:F1R9P2 Uniprot:F1R9P2
        Length = 1241

 Score = 124 (48.7 bits), Expect = 0.00054, P = 0.00054
 Identities = 82/291 (28%), Positives = 110/291 (37%)

Query:   242 GGATGNSENETSGRPV--GQNAYED-GYGVPQGHGPPPS-ATTAGVVGAGPNTSTSAYAA 297
             G   G  E  TSG P   G  +Y   G G  Q +GPPP  A   G + + P+T  +   +
Sbjct:    70 GPPQGMREPPTSGTPPVSGAQSYSQFGQGETQ-NGPPPMVAPPQGTLVSQPHTPNAVSLS 128

Query:   298 TQSGTPMRAAYDIPR-GPGYEASKGPGYDA-SKAPSYDPTKGPSYDP---AKGP---GYD 349
               +  P    +  P  G     ++       S APS  P  GP Y P   A+ P    Y 
Sbjct:   129 GPTQPPYGQQFGSPPIGMQQMTNQMASMQVGSTAPS--PA-GPGYAPPSTAQAPISAAYT 185

Query:   350 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDM---QRGPGYE 405
             P+  P +     S+  +Q  P   + + P   P  G     Q+  PN        GP  +
Sbjct:   186 PSAPPTFPPT--SSAPSQPPPTEAVAQAPP-QPYYGAPPPAQQPFPNAVSTFSSAGPT-Q 241

Query:   406 TQRVPGYDVQRGPVYEAQRAPSY--IPQRGP----GYDLQRGQGYDMRRAPSYDPSRGTG 459
              Q  P    Q  P   A   P +   P  GP    G  L   Q    +RAP      G  
Sbjct:   242 PQAPPSVSQQSFPQAPAVSQPPFSTAPPPGPSQSYGGPLPPTQP-SFQRAPLPTSQPGV- 299

Query:   460 FDGAPRGAAPHGQVP------PPLNNV-PYGSATPPARSGSGQPRGGNPAR 503
             F G P   + H Q+P      PP++   PY S  PP  + S  P+ G P R
Sbjct:   300 FPGGPPPTSTHSQLPGPMPPQPPVSQPSPYYSEPPPT-TASFPPQVGAPPR 349


>UNIPROTKB|P15941 [details] [associations]
            symbol:MUC1 "Mucin-1" species:9606 "Homo sapiens"
            [GO:0005576 "extracellular region" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IBA] [GO:0009986 "cell surface" evidence=IBA]
            [GO:0016324 "apical plasma membrane" evidence=IBA] [GO:0005887
            "integral to plasma membrane" evidence=TAS] [GO:0005796 "Golgi
            lumen" evidence=TAS] [GO:0016266 "O-glycan processing"
            evidence=TAS] [GO:0043687 "post-translational protein modification"
            evidence=TAS] [GO:0044267 "cellular protein metabolic process"
            evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0002039 "p53 binding" evidence=IPI] [GO:0006977 "DNA damage
            response, signal transduction by p53 class mediator resulting in
            cell cycle arrest" evidence=IDA] [GO:0000790 "nuclear chromatin"
            evidence=IDA] [GO:0090240 "positive regulation of histone H4
            acetylation" evidence=IDA] [GO:0000978 "RNA polymerase II core
            promoter proximal region sequence-specific DNA binding"
            evidence=IDA] [GO:0043618 "regulation of transcription from RNA
            polymerase II promoter in response to stress" evidence=IDA]
            [GO:0006978 "DNA damage response, signal transduction by p53 class
            mediator resulting in transcription of p21 class mediator"
            evidence=IDA] [GO:0010944 "negative regulation of transcription by
            competitive promoter binding" evidence=IDA] [GO:0003712
            "transcription cofactor activity" evidence=IDA] [GO:0036003
            "positive regulation of transcription from RNA polymerase II
            promoter in response to stress" evidence=IDA] [GO:0043066 "negative
            regulation of apoptotic process" evidence=IDA] Reactome:REACT_17015
            PANTHER:PTHR10006 GO:GO:0043066 GO:GO:0005576 GO:GO:0009986
            GO:GO:0005887 GO:GO:0006977 GO:GO:0016324 GO:GO:0000978
            GO:GO:0000790 GO:GO:0003712 GO:GO:0043687 InterPro:IPR000082
            Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 GO:GO:0005796
            EMBL:CH471121 GO:GO:0010944 GO:GO:0090240 PDB:2FO4 PDBsum:2FO4
            GO:GO:0016266 GO:GO:0006978 EMBL:AL713999 GO:GO:0036003
            MEROPS:S71.001 CTD:4582 eggNOG:NOG77744 KO:K06568
            InterPro:IPR023217 EMBL:J05582 EMBL:M32738 EMBL:M32739 EMBL:M34089
            EMBL:M34088 EMBL:J05581 EMBL:M61170 EMBL:X52229 EMBL:X52228
            EMBL:M35093 EMBL:X80761 EMBL:U60259 EMBL:U60260 EMBL:U60261
            EMBL:AF125525 EMBL:AF348143 EMBL:AY327582 EMBL:AY463543
            EMBL:BC120974 EMBL:Z17324 EMBL:Z17325 EMBL:M31823 EMBL:S81781
            EMBL:S81736 EMBL:M21868 IPI:IPI00013955 IPI:IPI00218163
            IPI:IPI00218164 IPI:IPI00218165 IPI:IPI00218166 IPI:IPI00218168
            IPI:IPI00218169 IPI:IPI00607673 IPI:IPI00902840 IPI:IPI00978078
            PIR:A35175 RefSeq:NP_001018016.1 RefSeq:NP_001018017.1
            RefSeq:NP_001037855.1 RefSeq:NP_001037856.1 RefSeq:NP_001037857.1
            RefSeq:NP_001037858.1 RefSeq:NP_001191214.1 RefSeq:NP_001191215.1
            RefSeq:NP_001191216.1 RefSeq:NP_001191217.1 RefSeq:NP_001191218.1
            RefSeq:NP_001191219.1 RefSeq:NP_001191220.1 RefSeq:NP_001191221.1
            RefSeq:NP_001191222.1 RefSeq:NP_001191223.1 RefSeq:NP_001191224.1
            RefSeq:NP_001191225.1 RefSeq:NP_001191226.1 RefSeq:NP_002447.4
            UniGene:Hs.89603 PDB:2ACM PDBsum:2ACM ProteinModelPortal:P15941
            SMR:P15941 IntAct:P15941 MINT:MINT-156679 STRING:P15941
            GlycoSuiteDB:P15941 PhosphoSite:P15941 DMDM:296439295 PaxDb:P15941
            PRIDE:P15941 DNASU:4582 Ensembl:ENST00000337604
            Ensembl:ENST00000343256 Ensembl:ENST00000368389
            Ensembl:ENST00000368390 Ensembl:ENST00000368398 GeneID:4582
            KEGG:hsa:4582 UCSC:uc001fib.3 GeneCards:GC01M155158 HGNC:HGNC:7508
            HPA:CAB000036 HPA:CAB001986 HPA:HPA004179 HPA:HPA007235
            HPA:HPA008855 MIM:113720 MIM:158340 neXtProt:NX_P15941
            PharmGKB:PA31309 ChiTaRS:MUC1 EvolutionaryTrace:P15941
            GenomeRNAi:4582 NextBio:17597 Bgee:P15941 Genevestigator:P15941
            GermOnline:ENSG00000185499 Uniprot:P15941
        Length = 1255

 Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
 Identities = 65/275 (23%), Positives = 91/275 (33%)

Query:   237 ADGSYGGATGNSENETSGRPVG--QNAYEDGYGVPQGHGPPP-SATTAGV-VGAGPNTST 292
             A  + GG    S  + S  P    +NA      V   H P   S+TT G  V   P T  
Sbjct:    27 ASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEP 86

Query:   293 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 352
             ++ +A   G  + +   + R P   ++  P +D + AP   P  G +  PA G    P  
Sbjct:    87 ASGSAATWGQDVTSV-PVTR-PALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDT 144

Query:   353 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL--GYDMQRGPNYDMQRGPGY----ET 406
              P   +     +     P+     G +  P  G+    D +  P        G     +T
Sbjct:   145 RPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDT 204

Query:   407 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG--QGYDMRRAPSYDPSRGTGFDGAP 464
             +  PG      P +    AP   P  G       G     D R AP        G   AP
Sbjct:   205 RPAPGSTAP--PAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAP 262

Query:   465 RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
                   G   PP + V     T PA   +  P  G
Sbjct:   263 DTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHG 297


>UNIPROTKB|Q9BRQ0 [details] [associations]
            symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
            sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
            "in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
            development" evidence=IEA] [GO:0002088 "lens development in
            camera-type eye" evidence=IEA] [GO:0007420 "brain development"
            evidence=IEA] [GO:0009791 "post-embryonic development"
            evidence=IEA] [GO:0030879 "mammary gland development" evidence=IEA]
            [GO:0033599 "regulation of mammary gland epithelial cell
            proliferation" evidence=IEA] [GO:0042393 "histone binding"
            evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
            [GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
            [GO:0060021 "palate development" evidence=IEA] [GO:0060070
            "canonical Wnt receptor signaling pathway" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001965
            InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
            GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
            GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
            InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
            GO:GO:0030879 GO:GO:0033599 PDB:2XB1 PDBsum:2XB1 GO:GO:0051569
            GO:GO:0002088 eggNOG:NOG72798 HOGENOM:HOG000001580
            HOVERGEN:HBG053774 EMBL:AF457208 EMBL:BC006132 EMBL:BC013725
            EMBL:BC032099 EMBL:AF289598 IPI:IPI00042099 RefSeq:NP_612157.1
            UniGene:Hs.533597 ProteinModelPortal:Q9BRQ0 SMR:Q9BRQ0
            IntAct:Q9BRQ0 STRING:Q9BRQ0 PhosphoSite:Q9BRQ0 DMDM:23396825
            PaxDb:Q9BRQ0 PRIDE:Q9BRQ0 DNASU:90780 Ensembl:ENST00000368457
            GeneID:90780 KEGG:hsa:90780 UCSC:uc001fft.3 CTD:90780
            GeneCards:GC01M154929 HGNC:HGNC:30257 HPA:HPA023689 MIM:606903
            neXtProt:NX_Q9BRQ0 PharmGKB:PA134881185 InParanoid:Q9BRQ0
            OMA:PGLVYPC OrthoDB:EOG4QZ7MB PhylomeDB:Q9BRQ0 GenomeRNAi:90780
            NextBio:76956 ArrayExpress:Q9BRQ0 Bgee:Q9BRQ0 CleanEx:HS_PYGO2
            Genevestigator:Q9BRQ0 GermOnline:ENSG00000163348 Uniprot:Q9BRQ0
        Length = 406

 Score = 118 (46.6 bits), Expect = 0.00055, P = 0.00055
 Identities = 72/267 (26%), Positives = 101/267 (37%)

Query:   257 VGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPG 315
             V  N +ED +G P+ G   PP   +    G             Q G     A  +P  PG
Sbjct:    73 VASNPFEDDFGAPKVGVAAPPFLGSPVPFGG---------FRVQGGM----AGQVP--PG 117

Query:   316 YEASKGPGYDASKA--PSYDPTK-GPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 371
             Y    G G    +   P + P   GP+++ P +GPGY P     + +Q    ++   G N
Sbjct:   118 YSTGGGGGPQPLRRQPPPFPPNPMGPAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQN 174

Query:   372 YDIHRGPSYD-PQRGLGY----DMQRGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQ 423
             +    G     P  G G      M + P  ++  GP   +QR   PG      P+    Q
Sbjct:   175 FSPPSGQMMPGPVGGFGPMISPTMGQPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQ 232

Query:   424 RAPSYIPQRGP--GYDLQ-RGQGYDMRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP- 475
               PS  P   P  G D    G G +    P  +P   T F   P   +P    +G  P  
Sbjct:   233 GLPSLPPNTSPFPGPDPGFPGPGGEDGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSF 291

Query:   476 PLNNVPYGSATPPARSGS--GQPRGGN 500
             P N+   G  TP A S +  G+  GG+
Sbjct:   292 PPNSSGRGGGTPDANSLAPPGKAGGGS 318


>WB|WBGene00001076 [details] [associations]
            symbol:dpy-17 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] [GO:0010171 "body
            morphogenesis" evidence=IMP] [GO:0040002 "collagen and
            cuticulin-based cuticle development" evidence=IMP] [GO:0040035
            "hermaphrodite genitalia development" evidence=IMP] [GO:0040007
            "growth" evidence=IMP] [GO:0002119 "nematode larval development"
            evidence=IMP] InterPro:IPR002486 Pfam:PF01484 SMART:SM01088
            GO:GO:0040007 GO:GO:0002119 GO:GO:0010171 GO:GO:0040035
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0040002 EMBL:FO080874
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00390000012316
            RefSeq:NP_498086.1 ProteinModelPortal:Q20778 SMR:Q20778
            DIP:DIP-26150N MINT:MINT-1080630 STRING:Q20778 PaxDb:Q20778
            EnsemblMetazoa:F54D8.1.1 EnsemblMetazoa:F54D8.1.2 GeneID:175696
            KEGG:cel:CELE_F54D8.1 UCSC:F54D8.1.1 CTD:175696 WormBase:F54D8.1
            eggNOG:NOG253878 InParanoid:Q20778 OMA:TEMEAWR NextBio:889252
            Uniprot:Q20778
        Length = 352

 Score = 117 (46.2 bits), Expect = 0.00056, P = 0.00056
 Identities = 78/305 (25%), Positives = 105/305 (34%)

Query:   215 MATEVEKLRAE----LMNAPNVDR-RAADGSYGGATGNSENETSGRPVGQNAYEDGY-GV 268
             + TE+E  R E     M+     R R   G YGG  G      SG P G +    G+ G 
Sbjct:    38 LTTEMEAWRLESDQIYMDMQKFGRVRRQAGGYGGYGGYGSGP-SG-PSGPSGPHGGFPGG 95

Query:   269 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 328
             PQGH P  + ++      G      +      G+P+        GPG + +         
Sbjct:    96 PQGHFPGNTGSSNTPTLPGVIGVPPSVTGHPGGSPINPDGSPSAGPGDKCNCNTENSCPA 155

Query:   329 APSYDPTKGPSYDPAKG-PGYDPTKGPGYDAQKGSNYDAQRGPNYD----IHRGPSYDP- 382
              P+  P   P +D   G PG      PG D +   +  AQ    YD       GP   P 
Sbjct:   156 GPA-GPKGTPGHDGPDGIPGV-----PGVDGEDADDAKAQT-QQYDGCFTCPAGPQGPPG 208

Query:   383 QRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 441
              +G  G    RG        PG +    PG     GP+     A    P   PG D++  
Sbjct:   209 SQGKPGARGMRGARGQAAM-PGRDGS--PGMPGSLGPIGPPGAAGEEGPTGEPGADVEHQ 265

Query:   442 QGYD-MRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RG 498
              G    +  P      G   +   RGA   G   PP    P G       +G+ G P   
Sbjct:   266 IGLPGAKGTPGAPGESGDQGEQGDRGAT--GIAGPPGERGPQGEKGDDGPNGAAGSPGEE 323

Query:   499 GNPAR 503
             G P +
Sbjct:   324 GEPGQ 328


>UNIPROTKB|F1LNH3 [details] [associations]
            symbol:Col6a2 "Protein Col6a2" species:10116 "Rattus
            norvegicus" [GO:0005615 "extracellular space" evidence=IEA]
            [GO:0031012 "extracellular matrix" evidence=IEA] [GO:0042383
            "sarcolemma" evidence=IEA] [GO:0043234 "protein complex"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 RGD:1305585 GO:GO:0005615 GO:GO:0043234 GO:GO:0042383
            GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 GO:GO:0031012
            GeneTree:ENSGT00530000063022 OMA:RALCNHD IPI:IPI00372839
            Ensembl:ENSRNOT00000001695 ArrayExpress:F1LNH3 Uniprot:F1LNH3
        Length = 1025

 Score = 123 (48.4 bits), Expect = 0.00056, P = 0.00056
 Identities = 88/284 (30%), Positives = 99/284 (34%)

Query:   237 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AG-PNTSTSA 294
             +DG  G      +N T G    Q       G P   G P S    G  G AG P      
Sbjct:   320 SDGRKGAPGLAGKNGTDG----QKGKLGRIGPPGCKGDPGSRGPDGYPGEAGSPGEQGDQ 375

Query:   295 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDA-SKAPSYDPTKGPSYDPA----KG-PGY 348
              A   SG P R     P  PG + SKG  Y   S AP     KG    P     KG PG 
Sbjct:   376 GAKGDSGRPGRRGP--PGNPGDKGSKG--YRGNSGAPGSPGVKGGKGGPGPRGPKGEPGR 431

Query:   349 --DP-TKG-PGYDAQKGSNYD-AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 403
               DP TKG PG D  KG   D    GP        S   +   G    RGP   +   PG
Sbjct:   432 RGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEIGSKGAKGDRGLPGPRGPQGALGE-PG 490

Query:   404 YETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA----PSYDPSRGT 458
              +  R  PG    RG     Q  P   P R PG+     +G    +     P  +  RG 
Sbjct:   491 KQGSRGDPGDAGPRGD--SGQPGPKGDPGR-PGFSYPGPRGTPGEKGEPGPPGPEGGRGD 547

Query:   459 -GFDGAPRGAAPHGQV--P-PPLNNVPYGSATPPARSGSGQPRG 498
              G  GAP      G+   P PP    P G    P   G   P G
Sbjct:   548 FGLKGAPGRKGEKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPG 591


>FB|FBgn0003980 [details] [associations]
            symbol:Vm26Ab "Vitelline membrane 26Ab" species:7227
            "Drosophila melanogaster" [GO:0007304 "chorion-containing eggshell
            formation" evidence=IMP] [GO:0007305 "vitelline membrane formation
            involved in chorion-containing eggshell formation" evidence=NAS]
            [GO:0008316 "structural constituent of vitelline membrane"
            evidence=NAS] [GO:0007343 "egg activation" evidence=IMP]
            [GO:0060388 "vitelline envelope" evidence=IDA] GO:GO:0005576
            EMBL:AE014134 GO:GO:0007304 GO:GO:0007343 eggNOG:NOG295326
            PROSITE:PS51137 GeneTree:ENSGT00540000073505 GO:GO:0060388
            InterPro:IPR013135 Pfam:PF10542 EMBL:M20936 EMBL:EF441676
            PIR:A45943 RefSeq:NP_476784.1 UniGene:Dm.26740 DIP:DIP-19185N
            IntAct:P13238 MINT:MINT-1563965 STRING:P13238
            EnsemblMetazoa:FBtr0079171 GeneID:33827 KEGG:dme:Dmel_CG9046
            CTD:33827 FlyBase:FBgn0003980 InParanoid:P13238 OMA:RAAYGGY
            PhylomeDB:P13238 GenomeRNAi:33827 NextBio:785460 Bgee:P13238
            GermOnline:CG9046 Uniprot:P13238
        Length = 168

 Score = 108 (43.1 bits), Expect = 0.00056, P = 0.00056
 Identities = 28/92 (30%), Positives = 35/92 (38%)

Query:   277 SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK 336
             S    G  GA P  +  +Y+A  +  P   AY  P  P Y A   P Y A  AP+Y    
Sbjct:    45 SRAAYGGYGAAP--AAPSYSAPAA--PAAQAYSAPAAPAYSAPAAPAYSAPAAPAYSAPA 100

Query:   337 GPSYDPAKGPGYD-PTKGPGYDAQKGSNYDAQ 367
              P+Y     P Y  P   P     K   +  Q
Sbjct:   101 APAYSAPAAPAYSAPASIPSPPCPKNYLFSCQ 132


>UNIPROTKB|Q749V3 [details] [associations]
            symbol:GSU2639 "Uncharacterized protein" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE017180
            GenomeReviews:AE017180_GR RefSeq:NP_953684.1
            ProteinModelPortal:Q749V3 GeneID:2686067 KEGG:gsu:GSU2639
            PATRIC:22028131 HOGENOM:HOG000131095 OMA:VAWSLTA
            ProtClustDB:CLSK828924 BioCyc:GSUL243231:GH27-2644-MONOMER
            Uniprot:Q749V3
        Length = 699

 Score = 121 (47.7 bits), Expect = 0.00057, P = 0.00057
 Identities = 60/234 (25%), Positives = 97/234 (41%)

Query:    34 PMPGAFPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQIL 93
             P P   P      P E M +K  +  V+ + L  E +R+ A +  L +E  A    L+ L
Sbjct:   154 PEPPRAPSPPSPQPAEPMPKKTPA--VDTKALEAEARRMEAENTALEREGNA---RLEEL 208

Query:    94 HGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKS---KTEAQNLVVAREELI 150
             + +I  +  ERE   R L  K   +EAE+   E  K E ++    +T A +  VAR  L 
Sbjct:   209 NARIAELGREREEIRRALAGKEKSLEAEIAQLERRKAEDEQELAKRTAAHDAEVAR--LK 266

Query:   151 AKVHQLTQDLQRAHT-DVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESL---- 205
             A + QL  +L R  + +  ++  L +EL  L  E      + E  +    D L  L    
Sbjct:   267 ADIDQLGAELARLGSGESTEVAELTAELSRLTAERERAESSAESAESELRDRLARLSAEW 326

Query:   206 ----QVMEKNYITMATEVEKLRAELMNAPNV-DRRAADGSYGGATGNSENETSG 254
                 +  E+   T+A ++ ++  E   A  +   R  +       G  E E +G
Sbjct:   327 EEVRKASEERQETLARDIARMAREKEEAERLGSERVRELEQAVRRGEEEREAAG 380


>TIGR_CMR|GSU_2639 [details] [associations]
            symbol:GSU_2639 "hypothetical protein" species:243231
            "Geobacter sulfurreducens PCA" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] EMBL:AE017180
            GenomeReviews:AE017180_GR RefSeq:NP_953684.1
            ProteinModelPortal:Q749V3 GeneID:2686067 KEGG:gsu:GSU2639
            PATRIC:22028131 HOGENOM:HOG000131095 OMA:VAWSLTA
            ProtClustDB:CLSK828924 BioCyc:GSUL243231:GH27-2644-MONOMER
            Uniprot:Q749V3
        Length = 699

 Score = 121 (47.7 bits), Expect = 0.00057, P = 0.00057
 Identities = 60/234 (25%), Positives = 97/234 (41%)

Query:    34 PMPGAFPPFDMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQIL 93
             P P   P      P E M +K  +  V+ + L  E +R+ A +  L +E  A    L+ L
Sbjct:   154 PEPPRAPSPPSPQPAEPMPKKTPA--VDTKALEAEARRMEAENTALEREGNA---RLEEL 208

Query:    94 HGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKS---KTEAQNLVVAREELI 150
             + +I  +  ERE   R L  K   +EAE+   E  K E ++    +T A +  VAR  L 
Sbjct:   209 NARIAELGREREEIRRALAGKEKSLEAEIAQLERRKAEDEQELAKRTAAHDAEVAR--LK 266

Query:   151 AKVHQLTQDLQRAHT-DVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESL---- 205
             A + QL  +L R  + +  ++  L +EL  L  E      + E  +    D L  L    
Sbjct:   267 ADIDQLGAELARLGSGESTEVAELTAELSRLTAERERAESSAESAESELRDRLARLSAEW 326

Query:   206 ----QVMEKNYITMATEVEKLRAELMNAPNV-DRRAADGSYGGATGNSENETSG 254
                 +  E+   T+A ++ ++  E   A  +   R  +       G  E E +G
Sbjct:   327 EEVRKASEERQETLARDIARMAREKEEAERLGSERVRELEQAVRRGEEEREAAG 380


>UNIPROTKB|I3L781 [details] [associations]
            symbol:I3L781 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287
            Ensembl:ENSSSCT00000024528 OMA:EVSMPEI Uniprot:I3L781
        Length = 1087

 Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
 Identities = 83/271 (30%), Positives = 99/271 (36%)

Query:   243 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-----PS-ATTAGVVGAGPNTSTSAYA 296
             GA G   N  +  P G    + G G     GPP     P  A TAG VG           
Sbjct:   518 GAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGERGIPG-- 575

Query:   297 ATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKG 353
               + G P  A     RGP G   + GP G   S+ PS  P  GP  D  KG PG      
Sbjct:   576 --EFGLPGPAGPRGERGPPGESGAAGPAGPIGSRGPSGPP--GP--DGNKGEPGV--LGA 627

Query:   354 PGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVP- 410
             PG     G S    +RG    I  G     + GL  D+   P  D  RG PG      P 
Sbjct:   628 PGTAGPSGPSGLPGERGAA-GIPGGKGEKGETGLRGDVG-SPGRDGARGAPGAVGAPGPA 685

Query:   411 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP--RGA- 467
             G +  RG    A  A    P+  PG   +RG+           P+   G  GA   RG  
Sbjct:   686 GANGDRGEAGPAGPAGPAGPRGSPG---ERGEVGPAGPNGFAGPAGAAGQPGAKGERGTK 742

Query:   468 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
              P G+  P     P G+A P   +G   P G
Sbjct:   743 GPKGENGPVGPTGPVGAAGPAGPNGPPGPAG 773


>UNIPROTKB|B0QYK0 [details] [associations]
            symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
            sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
            "nucleic acid binding" evidence=IEA] [GO:0005622 "intracellular"
            evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
            SMART:SM00360 SMART:SM00547 GO:GO:0000166 GO:GO:0008270
            Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 EMBL:AC002059
            EMBL:AL031186 EMBL:AC000026 UniGene:Hs.374477 HGNC:HGNC:3508
            HOGENOM:HOG000038010 HOVERGEN:HBG000970 ChiTaRS:EWSR1
            IPI:IPI00879242 SMR:B0QYK0 STRING:B0QYK0 Ensembl:ENST00000331029
            Uniprot:B0QYK0
        Length = 618

 Score = 120 (47.3 bits), Expect = 0.00062, P = 0.00062
 Identities = 75/279 (26%), Positives = 102/279 (36%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 348
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   349 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 405
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212

Query:   406 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 460
             T   P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        
Sbjct:   213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269

Query:   461 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
             D  P     +GQ     +  P  + +       G+ RGG
Sbjct:   270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|D4A458 [details] [associations]
            symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 IPI:IPI00767290 Ensembl:ENSRNOT00000057377
            ArrayExpress:D4A458 Uniprot:D4A458
        Length = 618

 Score = 120 (47.3 bits), Expect = 0.00062, P = 0.00062
 Identities = 74/278 (26%), Positives = 100/278 (35%)

Query:   239 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +  S    GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 349
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159

Query:   350 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213

Query:   407 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 461
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306


>MGI|MGI:1346056 [details] [associations]
            symbol:Ctage5 "CTAGE family, member 5" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0008150 "biological_process"
            evidence=ND] [GO:0016020 "membrane" evidence=IEA] [GO:0016021
            "integral to membrane" evidence=IEA] MGI:MGI:1346056 GO:GO:0016021
            HOVERGEN:HBG051216 eggNOG:NOG133684 HOGENOM:HOG000112043
            EMBL:BC024076 EMBL:BC026864 IPI:IPI00153633 UniGene:Mm.244118
            UniGene:Mm.438038 ProteinModelPortal:Q8R311 SMR:Q8R311
            PhosphoSite:Q8R311 PaxDb:Q8R311 PRIDE:Q8R311 UCSC:uc007nqf.2
            InParanoid:Q8R311 ChiTaRS:CTAGE5 CleanEx:MM_CTAGE5
            Genevestigator:Q8R311 GermOnline:ENSMUSG00000021000 Uniprot:Q8R311
        Length = 779

 Score = 121 (47.7 bits), Expect = 0.00065, P = 0.00065
 Identities = 99/425 (23%), Positives = 162/425 (38%)

Query:    79 LRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTE 138
             ++++L      L+     +   K+E E + + L +K+ K+  EL     +KL   +  T 
Sbjct:   324 VKEDLTEHIKSLESKQASLQSEKTEFESESQKLQQKL-KVITELYQENEMKLH--RKLTV 380

Query:   139 AQNLVVAREELIAKVHQ-LTQDLQRAHTDVQQIPALLSELESLRQEYHHCRG-TYEYEKK 196
              +N  + +EE ++KV + ++   +   T  Q+   L  ELE   +  H  +G    +EKK
Sbjct:   381 EENYRLEKEEKLSKVDEKISHATEELETCRQRAKDLEEELE---RTIHSYQGQVISHEKK 437

Query:   197 FYNDHLESLQVMEKNYITMATEVEKLRAELMNAP------NVDRRAADGSYGGATGNSEN 250
              +++ L + + +E+N   +  E    R +L            D  A D     A G   +
Sbjct:   438 AHDNWLAA-RTLERNLNDLRKENAHNRQKLTETEFKFELLEKDPYALDVP-NTAFGREHS 495

Query:   251 ETSGRPVGQNAYED-GYGVPQG--HGP---PPSATTAGVVGA-GPNTSTSAYAATQSG-T 302
                  P+G+   E   +  P     GP    P     G  G+ GP         T+ G +
Sbjct:   496 PYGPSPLGRPPSETRAFLSPPTLLEGPLRLSPLLPGGGGRGSRGPENLLDHQMNTERGES 555

Query:   303 PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSY-DPA--KGPGYDPTKGPGYDAQ 359
                   D PR P  + S  P ++  +  +  P  G  Y DPA  +   + P  G      
Sbjct:   556 SYDRLSDAPRAPS-DRSLSPPWEQDRRMTAHPPPGQPYSDPALQRQDRFYPNSGRLSGPA 614

Query:   360 KGSNYDAQRGPNYDIHRGP--SYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG 417
             +  +Y+    P+ D   GP  S     G G     G N ++   P      +P      G
Sbjct:   615 ELRSYNM---PSLDKVDGPVPSEMESSGNGTKDNLG-NSNVPDSP------IPAECEAAG 664

Query:   418 PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP-PP 476
               +     P + P R P + +     + MRR PS+ P        APR   P    P PP
Sbjct:   665 RGFFP---PPFPPVRDPLFPVDPRSQF-MRRGPSFPPPPPGSIYAAPRDYFPPRDFPGPP 720

Query:   477 LNNVP 481
             L   P
Sbjct:   721 LPPFP 725


>UNIPROTKB|B4DR34 [details] [associations]
            symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
            [GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
            [GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
            "cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
            protein kinase cascade" evidence=IEA] [GO:0042493 "response to
            drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
            evidence=IEA] GO:GO:0000226 GO:GO:0042493 GO:GO:0007243
            GO:GO:0000902 GO:GO:0048013 GO:GO:0005881 HOVERGEN:HBG003892
            InterPro:IPR007726 PANTHER:PTHR23107 UniGene:Hs.129261
            EMBL:AC091021 HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK299082
            IPI:IPI01015658 STRING:B4DR34 Ensembl:ENST00000539849
            Uniprot:B4DR34
        Length = 336

 Score = 116 (45.9 bits), Expect = 0.00066, P = 0.00066
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 290
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   106 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 165

Query:   291 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 342
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   166 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 224

Query:   343 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 401
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   225 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 278

Query:   402 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 452
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   279 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 334


>UNIPROTKB|O18740 [details] [associations]
            symbol:KRT9 "Keratin, type I cytoskeletal 9" species:9615
            "Canis lupus familiaris" [GO:0045109 "intermediate filament
            organization" evidence=ISS] [GO:0043588 "skin development"
            evidence=ISS] [GO:0005882 "intermediate filament" evidence=IEA]
            [GO:0005198 "structural molecule activity" evidence=IEA]
            InterPro:IPR001664 InterPro:IPR002957 PRINTS:PR01248 GO:GO:0043588
            GO:GO:0005198 GO:GO:0005882 GO:GO:0045109 HOVERGEN:HBG013015
            InterPro:IPR016044 PANTHER:PTHR23239 Pfam:PF00038 PROSITE:PS00226
            HSSP:P08670 eggNOG:NOG148410 KO:K07604 EMBL:AF000949
            RefSeq:NP_001014307.1 UniGene:Cfa.38225 ProteinModelPortal:O18740
            PRIDE:O18740 GeneID:490980 KEGG:cfa:490980 CTD:3857
            InParanoid:O18740 OrthoDB:EOG4BCDNC NextBio:20863900 Uniprot:O18740
        Length = 786

 Score = 121 (47.7 bits), Expect = 0.00066, P = 0.00066
 Identities = 95/451 (21%), Positives = 158/451 (35%)

Query:    62 MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAKMEAE 121
             +  +  E +R++A +   R+++   Q+E Q+   +   M S +E++  +  +++ ++   
Sbjct:   307 LNDMREEYERISAKN---RKDIEE-QYETQMSQMEQEVMSSGQEMESNH--KEVTQLRHS 360

Query:   122 LKTAEPVKLEFQKSKTEA--QNLVVAREELIAKVHQLTQDLQRAHTDVQQIPALLSELES 179
             ++  E ++L+ Q SK  A  ++L   +     ++ QL + ++       QI  +  E+E 
Sbjct:   361 IQEME-IELQSQLSKKSALEKSLEDTKNHYCGQLQQLQEQIRSLEG---QITEIRGEIEC 416

Query:   180 LRQEYH---HCRGTYEYEKKFYNDHLES----LQVMEKNYITMATEVEKLRAELMNAPNV 232
               QEY    + +   E E K Y   LE      +  E       +   + +  +  +   
Sbjct:   417 QNQEYSLLLNIKTRLEQEIKTYRSLLEGGQEDFESHESGQSHFGSGGSRQQGGIGGSHGR 476

Query:   233 DRRAADG-SYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS 291
               R   G SYGG + +S   + G   G +    G G   G G      + G  G G  + 
Sbjct:   477 GSRGGSGGSYGGGS-SSGGGSGGSHGGGSGGSYGGGSSSGGGSGGRGGSGGSYGGGSGSG 535

Query:   292 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 351
               +  +   G+          G G   S G G  +          G SY    G G    
Sbjct:   536 GGSSGSYGGGSSSGGGSGGSHGGGSGGSYGGGSSSGGGSGGRGGSGGSYGGGSGSG---- 591

Query:   352 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 411
              G G   ++GS    + G +Y    G         G     G       G G  +    G
Sbjct:   592 GGRGGGCEEGSGSGGRSGGSYGGGSGSGGGSSCSYGGGSSSGGGSGGSYGGGSSSGGGSG 651

Query:   412 YDVQRGPVYE--AQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP 469
                  G  Y   +          G G    RG G     A SY    G+G  G   G   
Sbjct:   652 GKGGSGCSYSGGSSSGGGSGGSYGGGSSSGRGSGGRGGSAGSYGGGSGSG--GGRGGGCE 709

Query:   470 HGQVPPPLNNVPYGSATPPARSGSGQPRGGN 500
              G      +   YG       SGSG   GG+
Sbjct:   710 EGSGSGGRSGGSYGGG-----SGSGGRSGGS 735


>MGI|MGI:88462 [details] [associations]
            symbol:Col7a1 "collagen, type VII, alpha 1" species:10090 "Mus
            musculus" [GO:0004867 "serine-type endopeptidase inhibitor
            activity" evidence=IEA] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005604
            "basement membrane" evidence=IDA] [GO:0007155 "cell adhesion"
            evidence=IEA] [GO:0010466 "negative regulation of peptidase
            activity" evidence=IEA] [GO:0030414 "peptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 MGI:MGI:88462 Gene3D:2.60.40.10
            InterPro:IPR013783 GO:GO:0004867 SUPFAM:SSF49265 GO:GO:0007155
            Gene3D:4.10.410.10 InterPro:IPR020901 SUPFAM:SSF57362
            PROSITE:PS00280 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0005604 EMBL:AC174646 MEROPS:I02.967 CTD:1294
            HOGENOM:HOG000111866 HOVERGEN:HBG051053 KO:K16628 OMA:RRVCTTA
            OrthoDB:EOG4J117P EMBL:U32107 EMBL:S63654 IPI:IPI00134652
            PIR:A45748 RefSeq:NP_031764.2 UniGene:Mm.6200 HSSP:P12111
            ProteinModelPortal:Q63870 SMR:Q63870 STRING:Q63870
            PhosphoSite:Q63870 PaxDb:Q63870 PRIDE:Q63870
            Ensembl:ENSMUST00000026740 Ensembl:ENSMUST00000112070 GeneID:12836
            KEGG:mmu:12836 UCSC:uc009rrh.1 GeneTree:ENSGT00700000104250
            InParanoid:Q63870 NextBio:282356 Bgee:Q63870 CleanEx:MM_COL7A1
            Genevestigator:Q63870 Uniprot:Q63870
        Length = 2944

 Score = 127 (49.8 bits), Expect = 0.00067, P = 0.00067
 Identities = 86/270 (31%), Positives = 103/270 (38%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-- 313
             P G    +   G P   GPP S    GV G+ P    S       G         P+G  
Sbjct:  1289 PPGSTQAKGERGFPGPEGPPGSPGLPGVPGS-PGIKGSTGRPGPRGEQGERGPQGPKGEP 1347

Query:   314 --PGY-EASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYD--AQKGSNYD 365
               PG      GPG+   K    DP  GPS  P ++GP  DP  +GP G    + KG   D
Sbjct:  1348 GEPGQITGGGGPGFPGKKG---DP--GPSGPPGSRGPVGDPGPRGPPGLPGISVKGDKGD 1402

Query:   366 -AQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 422
               +RGP    I      DP  GL G     GP     R PG + ++  G     GP    
Sbjct:  1403 RGERGPPGPGIGASEQGDP--GLPGLPGSPGPQGPAGR-PGEKGEK--GDCEDGGPGLPG 1457

Query:   423 QRAPSYIPQ-RG-PGYDLQRG-QGYDMRRA-PSYDPSRG----TGFDGAPRGAAPHGQVP 474
             Q  P   P  RG PG    +G +G       P     RG     G  G P GAA H    
Sbjct:  1458 QPGPPGEPGLRGAPGMTGPKGDRGLTGTPGEPGVKGERGHPGPVGPQGLP-GAAGH---- 1512

Query:   475 PPLNNVPYGSATPPARSGS-GQP-RGGNPA 502
             P +   P G   P  R G  G+P R G+PA
Sbjct:  1513 PGVEG-PEGPPGPTGRRGEKGEPGRPGDPA 1541


>UNIPROTKB|F1MA98 [details] [associations]
            symbol:Tpr "Protein Tpr" species:10116 "Rattus norvegicus"
            [GO:0000122 "negative regulation of transcription from RNA
            polymerase II promoter" evidence=ISS] [GO:0000189 "MAPK import into
            nucleus" evidence=ISS] [GO:0000776 "kinetochore" evidence=ISS]
            [GO:0003682 "chromatin binding" evidence=ISS] [GO:0003729 "mRNA
            binding" evidence=ISS] [GO:0004828 "serine-tRNA ligase activity"
            evidence=IEA] [GO:0005487 "nucleocytoplasmic transporter activity"
            evidence=ISS] [GO:0005524 "ATP binding" evidence=IEA] [GO:0005635
            "nuclear envelope" evidence=ISS] [GO:0005643 "nuclear pore"
            evidence=ISS] [GO:0005737 "cytoplasm" evidence=ISS] [GO:0005868
            "cytoplasmic dynein complex" evidence=ISS] [GO:0006404 "RNA import
            into nucleus" evidence=ISS] [GO:0006405 "RNA export from nucleus"
            evidence=ISS] [GO:0006434 "seryl-tRNA aminoacylation" evidence=IEA]
            [GO:0006999 "nuclear pore organization" evidence=ISS] [GO:0007094
            "mitotic spindle assembly checkpoint" evidence=ISS] [GO:0010965
            "regulation of mitotic sister chromatid separation" evidence=ISS]
            [GO:0019898 "extrinsic to membrane" evidence=ISS] [GO:0031072 "heat
            shock protein binding" evidence=ISS] [GO:0031453 "positive
            regulation of heterochromatin assembly" evidence=ISS] [GO:0031965
            "nuclear membrane" evidence=IDA] [GO:0031990 "mRNA export from
            nucleus in response to heat stress" evidence=ISS] [GO:0034399
            "nuclear periphery" evidence=IDA] [GO:0034605 "cellular response to
            heat" evidence=ISS] [GO:0035457 "cellular response to
            interferon-alpha" evidence=ISS] [GO:0042307 "positive regulation of
            protein import into nucleus" evidence=ISS] [GO:0042405 "nuclear
            inclusion body" evidence=IDA] [GO:0042803 "protein homodimerization
            activity" evidence=ISS] [GO:0044615 "nuclear pore nuclear basket"
            evidence=IDA] [GO:0045947 "negative regulation of translational
            initiation" evidence=ISS] [GO:0046827 "positive regulation of
            protein export from nucleus" evidence=IMP] [GO:0046832 "negative
            regulation of RNA export from nucleus" evidence=ISS] [GO:0051019
            "mitogen-activated protein kinase binding" evidence=ISS]
            [GO:0070849 "response to epidermal growth factor stimulus"
            evidence=ISS] [GO:0072686 "mitotic spindle" evidence=ISS]
            [GO:0090267 "positive regulation of mitotic cell cycle spindle
            assembly checkpoint" evidence=ISS] [GO:0090316 "positive regulation
            of intracellular protein transport" evidence=ISS] [GO:1901673
            "regulation of spindle assembly involved in mitosis" evidence=ISS]
            [GO:0005215 "transporter activity" evidence=ISS] [GO:0006606
            "protein import into nucleus" evidence=ISS] [GO:0006611 "protein
            export from nucleus" evidence=ISS] [GO:0031647 "regulation of
            protein stability" evidence=ISS] [GO:0042306 "regulation of protein
            import into nucleus" evidence=IMP] [GO:0043495 "protein anchor"
            evidence=ISS] [GO:0043578 "nuclear matrix organization"
            evidence=ISS] [GO:0051292 "nuclear pore complex assembly"
            evidence=IMP] InterPro:IPR012929 InterPro:IPR015866 Pfam:PF07926
            RGD:1310664 GO:GO:0005524 GO:GO:0005737 GO:GO:0005643 GO:GO:0006606
            KO:K09291 InterPro:IPR009053 SUPFAM:SSF46579
            GeneTree:ENSGT00700000104019 GO:GO:0004828 GO:GO:0006434
            Gene3D:1.10.287.40 CTD:7175 IPI:IPI00950468 RefSeq:NP_001100655.1
            UniGene:Rn.58980 Ensembl:ENSRNOT00000063833 GeneID:304862
            KEGG:rno:304862 NextBio:653738 ArrayExpress:F1MA98 Uniprot:F1MA98
        Length = 2360

 Score = 124 (48.7 bits), Expect = 0.00067, Sum P(2) = 0.00067
 Identities = 44/186 (23%), Positives = 88/186 (47%)

Query:    48 PEVME-QKIASQ---HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             P+  E +K+ S+   H + +Q+L  E  RL A        L   Q+ +Q L   +  +++
Sbjct:  1349 PDTEEYRKLLSEKEIHTKRIQQLNEEVGRLKAEIARSNASLTNNQNLIQSLKEDLSKVRT 1408

Query:   103 ERELQMRNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQLTQDLQR 162
             E+E   ++L  KI  ++ ++KT   VK   ++ KT+ + L  A+++ +    Q + D Q 
Sbjct:  1409 EKESIQKDLDAKIIDIQEKVKTITQVKKIGRRYKTQFEELK-AQQKAMETSTQSSGDHQE 1467

Query:   163 AHTDVQQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKL 222
              H  VQ++  L   L     +     G  E  +K  ++     + +++    + +E+ +L
Sbjct:  1468 QHISVQEMQELKDNLSQSETKTKSLEGQVENLQKTLSEKETEARSLQEQTAQLQSELSRL 1527

Query:   223 RAELMN 228
             R EL +
Sbjct:  1528 RQELQD 1533

 Score = 53 (23.7 bits), Expect = 0.00067, Sum P(2) = 0.00067
 Identities = 19/61 (31%), Positives = 25/61 (40%)

Query:   243 GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 300
             G  G   NE +G   G + YE  D  G   G G  P   T   +G G  ++  A  +  S
Sbjct:  1979 GDEGEDSNEGTGSADGNDGYEADDAEG---GDGTDPGTETEESMG-GAESNQRAADSQNS 2034

Query:   301 G 301
             G
Sbjct:  2035 G 2035


>UNIPROTKB|I3LNI2 [details] [associations]
            symbol:TFG "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0043123 "positive regulation of I-kappaB
            kinase/NF-kappaB cascade" evidence=IEA] [GO:0042802 "identical
            protein binding" evidence=IEA] [GO:0004871 "signal transducer
            activity" evidence=IEA] GO:GO:0043123 GO:GO:0004871 OMA:YTTQTSQ
            GeneTree:ENSGT00510000047809 EMBL:CU928320 EMBL:AEMK01189642
            Ensembl:ENSSSCT00000026186 Uniprot:I3LNI2
        Length = 340

 Score = 116 (45.9 bits), Expect = 0.00067, P = 0.00067
 Identities = 75/301 (24%), Positives = 112/301 (37%)

Query:   216 ATEVEKLRAELMNAPNVDRRAAD-----GSYGGATGNSENET-SGRPVGQNAYEDGYGVP 269
             +++V+ LR EL+   N   R  D     G  G +T  +EN+T  GR   + A  D  G  
Sbjct:    38 SSQVKYLRRELIELRNKVNRLLDSLEPPGEPGPSTNITENDTVDGREE-KPAASDSSGKQ 96

Query:   270 QGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA 329
                    S +    +      + +  +A   G         P  P  + S  P   AS +
Sbjct:    97 STQVMAASMSAFDPLKNQDEINKNVMSAF--GLTDDQVSGPPSAPAEDRSGTPDSIASSS 154

Query:   330 PSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 388
              +  P   P   P + P        G  + Q    Y  Q G      + P   PQ+  G 
Sbjct:   155 SAAHP---PGVQPQQPPYTGALTQAGQSEGQMYQQYPQQAGYGTQQPQAPPQPPQQS-GS 210

Query:   389 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI--PQRGPGYDLQRGQGYDM 446
              + +G  Y  Q GP  + Q+  GY  Q  P  +A  AP++   PQ+ P    Q+ Q    
Sbjct:   211 SLSKG--YSQQTGP-QQPQQFQGYGQQ--PTSQAP-APAFSGQPQQMPAQPPQQYQASSY 264

Query:   447 R-RAPSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPRGGNPAR 503
               +  +   S+ T +  AP  A+  G  P  P       G   PP  + +  P G NP  
Sbjct:   265 PPQTYTTQTSQPTNYTVAP--ASQPGMAPSQPGAYQPRPGFTPPPGSTMTPLPSGSNPYA 322

Query:   504 R 504
             R
Sbjct:   323 R 323


>UNIPROTKB|A8E651 [details] [associations]
            symbol:EWSR1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
            evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
            SMART:SM00360 SMART:SM00547 GO:GO:0005634 GO:GO:0000166
            GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676 eggNOG:NOG240581
            GeneTree:ENSGT00530000063105 CTD:2130 HOGENOM:HOG000038010
            HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY OrthoDB:EOG42NJ15
            EMBL:DAAA02045602 EMBL:BC153844 IPI:IPI00871084
            RefSeq:NP_001103270.1 UniGene:Bt.33949 SMR:A8E651 STRING:A8E651
            Ensembl:ENSBTAT00000023612 GeneID:534073 KEGG:bta:534073
            InParanoid:A8E651 NextBio:20876260 Uniprot:A8E651
        Length = 655

 Score = 120 (47.3 bits), Expect = 0.00067, P = 0.00067
 Identities = 73/278 (26%), Positives = 99/278 (35%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPAAPQAYSQPVQGYGTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 349
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 159

Query:   350 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVSAPPSYPPT---SYSSTQPTSYDQSSYSQQNT 213

Query:   407 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 461
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|Q01844 [details] [associations]
            symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
            sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005516 "calmodulin binding" evidence=IEA]
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
            "cytoplasm" evidence=IEA] [GO:0005886 "plasma membrane"
            evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
            InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
            Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50096 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005886
            GO:GO:0005634 GO:GO:0005737 GO:GO:0006355 GO:GO:0000166
            GO:GO:0046872 EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330
            GO:GO:0006351 GO:GO:0003723 EMBL:AC002059 MIM:612160 Orphanet:97338
            Pathway_Interaction_DB:bard1pathway eggNOG:NOG240581 EMBL:AL031186
            MIM:612219 Orphanet:319 EMBL:X66899 EMBL:X72990 EMBL:X72991
            EMBL:X72992 EMBL:X72993 EMBL:X72994 EMBL:X72995 EMBL:X72996
            EMBL:X72997 EMBL:X72998 EMBL:X72999 EMBL:X73000 EMBL:X73001
            EMBL:X73002 EMBL:X73003 EMBL:X73004 EMBL:Y07848 EMBL:CR456490
            EMBL:AK056309 EMBL:AK056681 EMBL:AC000026 EMBL:BC000527
            EMBL:BC004817 EMBL:BC011048 EMBL:BC072442 EMBL:Y08806 EMBL:AB016435
            IPI:IPI00065554 IPI:IPI00293254 IPI:IPI00335961 IPI:IPI00872855
            IPI:IPI00879259 PIR:A49358 RefSeq:NP_001156757.1
            RefSeq:NP_001156759.1 RefSeq:NP_005234.1 RefSeq:NP_053733.2
            UniGene:Hs.374477 PDB:2CPE PDBsum:2CPE ProteinModelPortal:Q01844
            SMR:Q01844 IntAct:Q01844 MINT:MINT-2858561 STRING:Q01844
            PhosphoSite:Q01844 DMDM:544261 PaxDb:Q01844 PRIDE:Q01844 DNASU:2130
            Ensembl:ENST00000332035 Ensembl:ENST00000333395
            Ensembl:ENST00000397938 Ensembl:ENST00000406548
            Ensembl:ENST00000414183 GeneID:2130 KEGG:hsa:2130 UCSC:uc003aet.3
            CTD:2130 GeneCards:GC22P029663 HGNC:HGNC:3508 HPA:CAB004230
            MIM:133450 neXtProt:NX_Q01844 Orphanet:83469 PharmGKB:PA27921
            HOGENOM:HOG000038010 HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY
            OrthoDB:EOG42NJ15 PhylomeDB:Q01844 ChiTaRS:EWSR1
            EvolutionaryTrace:Q01844 GenomeRNAi:2130 NextBio:8605
            ArrayExpress:Q01844 Bgee:Q01844 CleanEx:HS_EWSR1
            Genevestigator:Q01844 GermOnline:ENSG00000182944 Uniprot:Q01844
        Length = 656

 Score = 120 (47.3 bits), Expect = 0.00068, P = 0.00067
 Identities = 75/279 (26%), Positives = 102/279 (36%)

Query:   239 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +       GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 348
             T+    TQ+    ++AY   P  P Y   + P   A   P     PT+      + G GY
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158

Query:   349 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 405
             + P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        
Sbjct:   159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212

Query:   406 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 460
             T   P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        
Sbjct:   213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269

Query:   461 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
             D  P     +GQ     +  P  + +       G+ RGG
Sbjct:   270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306


>UNIPROTKB|F1LN98 [details] [associations]
            symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
            norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
            "intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
            evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
            InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
            PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
            GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
            GO:GO:0005622 GeneTree:ENSGT00530000063105 IPI:IPI00364603
            Ensembl:ENSRNOT00000012634 ArrayExpress:F1LN98 Uniprot:F1LN98
        Length = 656

 Score = 120 (47.3 bits), Expect = 0.00068, P = 0.00067
 Identities = 74/278 (26%), Positives = 100/278 (35%)

Query:   239 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 291
             G+YG  T  S  +  S    GQ AY   YG P  G+  P  P A +  V G G    +T+
Sbjct:    42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101

Query:   292 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 349
             T+    TQ+    ++AY   P  P Y   + P   A   P        +  P    G Y+
Sbjct:   102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159

Query:   350 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 406
              P+ G G   Q   +Y    G  P   +   PSY P     Y   +  +YD        T
Sbjct:   160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213

Query:   407 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 461
                P  Y  Q        Y  Q   SY PQ G  Y     Q Y  +++ SY        D
Sbjct:   214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270

Query:   462 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 499
               P     +GQ     +  P  + +       G+ RGG
Sbjct:   271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306


>ZFIN|ZDB-GENE-030131-6410 [details] [associations]
            symbol:tprb "translocated promoter region b (to
            activated MET oncogene)" species:7955 "Danio rerio" [GO:0006606
            "protein import into nucleus" evidence=IEA] [GO:0005643 "nuclear
            pore" evidence=IEA] InterPro:IPR012929 Pfam:PF07926
            ZFIN:ZDB-GENE-030131-6410 GO:GO:0005643 GO:GO:0006606 KO:K09291
            EMBL:BX323056 GeneTree:ENSGT00700000104019 HOGENOM:HOG000139431
            HOVERGEN:HBG009158 IPI:IPI00507729 RefSeq:NP_001025294.1
            UniGene:Dr.52426 Ensembl:ENSDART00000017941 GeneID:558883
            KEGG:dre:558883 CTD:558883 InParanoid:Q5RI09 OMA:RVSWEEQ
            NextBio:20882676 Uniprot:Q5RI09
        Length = 2352

 Score = 125 (49.1 bits), Expect = 0.00071, Sum P(4) = 0.00071
 Identities = 41/179 (22%), Positives = 75/179 (41%)

Query:    59 HVE-MQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQMRNLTEKIAK 117
             H++ +Q+L  E  RL A        L   Q ++Q L   +G +  ER+   ++   KI  
Sbjct:  1367 HLKRIQQLVEETGRLKADAARSSGSLTTLQSQVQNLRENLGKVMVERDNLKKDQEAKILD 1426

Query:   118 MEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVHQL-TQDLQRAHTDVQQIPALLSE 176
             ++ ++KT   VK   ++ KT+ + L V  E+L+A       QD +      Q++  L   
Sbjct:  1427 IQEKIKTITQVKKIGRRYKTQYEELKVEYEKLVAAAASAPAQDQEAQQASAQELQNLKES 1486

Query:   177 LESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRR 235
             L           G  E   +   +     +  ++    + TE+ +LR EL    + + R
Sbjct:  1487 LNQSETRIRELEGQLENLNRTVGEREMEARSAQEQASRLQTELTRLRQELQEKSSQEER 1545

 Score = 48 (22.0 bits), Expect = 0.00071, Sum P(4) = 0.00071
 Identities = 18/60 (30%), Positives = 27/60 (45%)

Query:   276 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPT 335
             P +T+ G+  A P+TS+   A+  S +P  A    PR    E S      +   P+  PT
Sbjct:  1810 PLSTSTGLWSATPSTSS---ASAVSASPGSALSKRPREEEQE-SMSADTQSQDEPNDSPT 1865

 Score = 46 (21.3 bits), Expect = 0.00071, Sum P(4) = 0.00071
 Identities = 13/37 (35%), Positives = 18/37 (48%)

Query:   469 PHGQVPPPLNNVPYGSATPP-ARSGSGQPRGGNPARR 504
             P     P  ++    S+ PP ARSGSG+   G+   R
Sbjct:  2297 PSTSQEPSSSSADTSSSQPPKARSGSGRQWTGSRGSR 2333

 Score = 45 (20.9 bits), Expect = 0.00071, Sum P(4) = 0.00071
 Identities = 37/153 (24%), Positives = 55/153 (35%)

Query:   325 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK--GSNYDAQR---GPNYDIHRGPS 379
             D S   S D  +    D  +GP  DPT  PG + ++  G+    QR     +++ +    
Sbjct:  2004 DESNEESRDDNEAYEGDDTEGP--DPTD-PGTETEESLGATDSTQRMADSQSFESNTLEM 2060

Query:   380 YD-PQRGLGYDMQRGPNYDMQR-GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 437
             ++ P         + P        P       P  ++  GP  + QR P+     G G  
Sbjct:  2061 FEVPVTSSAPRPPQSPRRPQHPLPPRLNILAAPAQEL--GPPAQVQRLPARRQSVGRGLQ 2118

Query:   438 LQRG-----QGY---DMRRAPSYD--PSRGTGF 460
             L  G     Q +   D R  PS    P R  GF
Sbjct:  2119 LASGMASSAQPFFEDDDRMVPSTPTLPLRSDGF 2151


>FB|FBgn0038642 [details] [associations]
            symbol:Muc91C "Mucin 91C" species:7227 "Drosophila
            melanogaster" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISM] [GO:0031012 "extracellular matrix"
            evidence=ISM] [GO:0022008 "neurogenesis" evidence=IMP]
            EMBL:AE014297 GO:GO:0022008 eggNOG:NOG12793 GO:GO:0031012
            GO:GO:0005201 GeneTree:ENSGT00700000104744 RefSeq:NP_650744.1
            UniGene:Dm.10760 EnsemblMetazoa:FBtr0083687 GeneID:42246
            KEGG:dme:Dmel_CG7709 UCSC:CG7709-RA CTD:42246 FlyBase:FBgn0038642
            InParanoid:Q9VE45 OMA:GPYPSAP PhylomeDB:Q9VE45 GenomeRNAi:42246
            NextBio:827869 ArrayExpress:Q9VE45 Bgee:Q9VE45 Uniprot:Q9VE45
        Length = 950

 Score = 130 (50.8 bits), Expect = 0.00071, Sum P(2) = 0.00071
 Identities = 68/278 (24%), Positives = 105/278 (37%)

Query:   224 AELMNAPNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 283
             + L +AP+  + ++ GS+  A  +S +  S       +Y         +  P S++++G 
Sbjct:   580 SSLYSAPS--KGSSGGSFQSAPSSSYSAPSASANSGGSYPSAPS--SSYSAPSSSSSSG- 634

Query:   284 VGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 343
              G   +  +S+Y+A  SG+     Y  P  P    S  P   A+   SY      SY  A
Sbjct:   635 -GPYASAPSSSYSAPSSGSNSGGPY--PAAPSSSYS-APSASANSGGSYPSAPSSSYS-A 689

Query:   344 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 403
               PG + + GP Y A   S+Y A   P+   + G  Y       Y     P+     G  
Sbjct:   690 PSPGSN-SGGP-YPAAPSSSYSA---PSPSANSGGPYASAPSSSYS---APSSSSNSGGP 741

Query:   404 Y-----ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT 458
             Y      +   P      G  Y +  + SY     P   L  G  Y    + SY     +
Sbjct:   742 YAAAPSSSYSAPSSSSSSGGPYPSAPSSSY---SAPSSSLSSGGPYPSAPSSSYAAPSPS 798

Query:   459 GFDGAPRGAAPHGQVPPPLN--NVPYGS-ATPPARSGS 493
                G P  AAP      P+   +  YG+ A+ P+ S S
Sbjct:   799 SNSGGPYPAAPSNSYSAPIAPPSSSYGAPASGPSPSFS 836

 Score = 38 (18.4 bits), Expect = 0.00071, Sum P(2) = 0.00071
 Identities = 8/19 (42%), Positives = 9/19 (47%)

Query:    28 VSGMRPPMPGAFPPFDMMP 46
             VS   PP  G  P F+  P
Sbjct:   142 VSSYLPPASGPAPSFNSAP 160


>ZFIN|ZDB-GENE-041221-3 [details] [associations]
            symbol:prnprs3 "prion protein, related sequence 3"
            species:7955 "Danio rerio" [GO:0005509 "calcium ion binding"
            evidence=IEA] [GO:0005544 "calcium-dependent phospholipid binding"
            evidence=IEA] [GO:0051260 "protein homooligomerization"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0048854
            "brain morphogenesis" evidence=IMP] [GO:0005886 "plasma membrane"
            evidence=IDA] [GO:0007156 "homophilic cell adhesion" evidence=IDA]
            [GO:0021731 "trigeminal motor nucleus development" evidence=IMP]
            [GO:0042981 "regulation of apoptotic process" evidence=IMP]
            InterPro:IPR001464 InterPro:IPR022416 ZFIN:ZDB-GENE-041221-3
            GO:GO:0005886 GO:GO:0042981 GO:GO:0051260 GO:GO:0005509
            GO:GO:0007156 GO:GO:0005544 PANTHER:PTHR10502 GO:GO:0048854
            Gene3D:1.10.790.10 SUPFAM:SSF54098 HOVERGEN:HBG056090 EMBL:AJ620614
            IPI:IPI00679275 RefSeq:NP_001013316.1 UniGene:Dr.162496
            UniGene:Dr.84038 ProteinModelPortal:Q5K4F8 GeneID:503702
            KEGG:dre:503702 CTD:503702 InParanoid:Q5K4F8 NextBio:20866258
            ArrayExpress:Q5K4F8 GO:GO:0021731 Uniprot:Q5K4F8
        Length = 567

 Score = 119 (46.9 bits), Expect = 0.00071, P = 0.00071
 Identities = 69/217 (31%), Positives = 92/217 (42%)

Query:   236 AADGSYGGATGNSENETSGRPVGQNAYEDGYG----VPQ--GHGPPPSATTAGVVGAGPN 289
             ++ G+ GG++ +S + +S +    +      G     PQ     PPP     G  G  P 
Sbjct:    43 SSSGNKGGSSSSSSSSSSSKGTSSHGTHTSPGNYPRQPQVPNQNPPPYP---GAGGGYPG 99

Query:   290 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 349
                   A +  G P + +Y  P   GY  ++G GY A     Y P +G  Y PA+G GY 
Sbjct:   100 QGRYPPAGSNPGYPNQGSY--PGRAGYP-NQG-GYPAQGG--Y-PAQG-GY-PAQG-GY- 148

Query:   350 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYET-- 406
             P +G GY AQ G  Y AQ G     + G S  P +G GY  Q G P      G G  +  
Sbjct:   149 PAQG-GYPAQGG--YPAQGGYPQGNYPGRSGYPGQG-GYPAQGGYPGGASYPGAGAGSYP 204

Query:   407 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 443
              R PG +    PV  +   P Y P RG     Q G G
Sbjct:   205 NRYPGGNPY--PVGGSY--PGY-PVRGGSSPNQFGGG 236


>UNIPROTKB|F1NL02 [details] [associations]
            symbol:COL22A1 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0005198 "structural molecule activity"
            evidence=IEA] [GO:0005587 "collagen type IV" evidence=IEA]
            [GO:0030198 "extracellular matrix organization" evidence=IEA]
            [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
            SMART:SM00327 InterPro:IPR008985 SUPFAM:SSF49899 InterPro:IPR008160
            Pfam:PF01391 InterPro:IPR001791 SMART:SM00210
            GeneTree:ENSGT00700000104250 OMA:KRENGAQ EMBL:AADN02037495
            EMBL:AADN02037496 EMBL:AADN02037497 EMBL:AADN02037498
            IPI:IPI00577055 Ensembl:ENSGALT00000026109 Uniprot:F1NL02
        Length = 1588

 Score = 124 (48.7 bits), Expect = 0.00072, P = 0.00072
 Identities = 78/265 (29%), Positives = 99/265 (37%)

Query:   256 PVGQNAYEDGYGVPQGHGPPPSAT-TAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRG 313
             P G    E G   P G G PP      G +G  GP          ++G P  A    P G
Sbjct:  1248 PPGPRG-EPGATGPAGRGGPPGKDGDTGPIGPQGPRGLRGQPG--KNGLPGSAGEPGPAG 1304

Query:   314 -PGYEASKG-------PGYDASKAPSYDP-TKGP-SYDPAKG-PGYDPTKG----PGYDA 358
              PG + +KG       PG+   + P  DP  KGP   + A G PG   +KG    PG   
Sbjct:  1305 NPGPKGNKGENGSPGLPGFIGPRGPPGDPGEKGPPGKEGAPGKPGETGSKGERGEPGIKG 1364

Query:   359 QKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDM-QRGP-GYETQRVPGYDVQ 415
             +KG     Q+GP  +    P     +G  G     GP  D  Q GP G   Q  PG+   
Sbjct:  1365 EKGPQ--GQKGPPGE----PGIPGHKGHPGLMGPHGPPGDTGQVGPPGPPGQ--PGFPGP 1416

Query:   416 RG--PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 473
             RG  P  E  R    + Q      L     Y + + P   P+      G P    P G+ 
Sbjct:  1417 RGEPPSLETLRR---LIQEELAKQLDAKLAYLLAQIP---PAHVKASHGRPGPPGPPGKE 1470

Query:   474 PPPLNNVPYGSATPPARSGSGQPRG 498
               P    P G    P ++GS  P G
Sbjct:  1471 GLPGRTGPPGEPGRPGQTGSEGPPG 1495


>UNIPROTKB|P10163 [details] [associations]
            symbol:PRB4 "Basic salivary proline-rich protein 4"
            species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
            evidence=NAS] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0008150 "biological_process" evidence=ND] GO:GO:0005576
            InterPro:IPR026086 PANTHER:PTHR23203 EMBL:K03207 EMBL:X07882
            EMBL:X07715 EMBL:AC010176 EMBL:BC130386 EMBL:S80916 EMBL:X07704
            IPI:IPI00019482 PIR:S03176 RefSeq:NP_001248328.1 RefSeq:NP_002714.2
            UniGene:Hs.528651 DisProt:DP00119 STRING:P10163 DMDM:158517854
            PaxDb:P10163 PRIDE:P10163 GeneID:5545 KEGG:hsa:5545 CTD:5545
            GeneCards:GC12M011460 H-InvDB:HIX0079490 HGNC:HGNC:9340 MIM:180990
            neXtProt:NX_P10163 PharmGKB:PA33702 GenomeRNAi:5545 NextBio:21484
            CleanEx:HS_PRB4 Genevestigator:P10163 GermOnline:ENSG00000121335
            Uniprot:P10163
        Length = 310

 Score = 115 (45.5 bits), Expect = 0.00073, P = 0.00073
 Identities = 72/250 (28%), Positives = 92/250 (36%)

Query:   267 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYD 325
             G P+G  PP     +   G  P+         Q G   +     P  PG   S+ P G  
Sbjct:    76 GKPEGR-PPQGGNQSQ--GPPPHPGKPERPPPQGGNQSQGP---PPHPGKPESRPPQGGH 129

Query:   326 ASKAPSYDPTKGPSYDPAKG----PGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRG-PS 379
              S+ P   P K P   P +G     G  P  G P     +G N    +GP    H G P 
Sbjct:   130 QSQGPPPTPGK-PEGPPPQGGNQSQGTPPPPGKPEGRPPQGGNQS--QGP--PPHPGKPE 184

Query:   380 YDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQ 439
               P +G G    R P       PG + +R P    Q G   ++Q  P + P +  G   Q
Sbjct:   185 RPPPQG-GNQSHRPPP-----PPG-KPERPPP---QGGN--QSQGPPPH-PGKPEGPPPQ 231

Query:   440 RGQGYDMRRAPSYDPSRGTGFDG-APRGAAPHG--QVPPPLNNVPYGSATPPARSGSGQP 496
              G      R+P   P      +G  P+G  P G  Q PPP    P     PPA    G P
Sbjct:   232 EGNKSRSARSPPGKPQGPPQQEGNKPQGPPPPGKPQGPPPPGGNPQQPQAPPAGKPQGPP 291

Query:   497 ---RGGNPAR 503
                +GG P R
Sbjct:   292 PPPQGGRPPR 301


>MGI|MGI:1932491 [details] [associations]
            symbol:Prp2 "proline rich protein 2" species:10090 "Mus
            musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
            "cellular_component" evidence=ND] [GO:0005576 "extracellular
            region" evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
            MGI:MGI:1932491 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
            UniGene:Mm.425348 UniGene:Mm.484054 CleanEx:MM_PRH1 EMBL:M23236
            EMBL:M12100 EMBL:M19419 IPI:IPI00474263 IPI:IPI00855123 PIR:A28996
            PIR:D29149 UniGene:Mm.333439 Genevestigator:P05143
            GermOnline:ENSMUSG00000058295 Uniprot:P05143
        Length = 317

 Score = 115 (45.5 bits), Expect = 0.00076, P = 0.00076
 Identities = 67/242 (27%), Positives = 77/242 (31%)

Query:   267 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYD 325
             G P   GP P          GP            G   R     P  PG    + P G  
Sbjct:    79 GPPPPGGPQPRPPQGPPPPGGPQPRPPQGPPPPGGPQPRPPQG-PPPPGGPQPRPPQGPP 137

Query:   326 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG 385
                 P   P +GP   P  GP   P +GP   A  G      +GP      GP   P +G
Sbjct:   138 PPGGPQQRPPQGPP--PPGGPQPRPPQGPPPPA--GPQPRPPQGPPPPA--GPHLRPTQG 191

Query:   386 ---LGYDMQRGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 441
                 G   QR P       PG    R P G     GP     + P   P  GP    +  
Sbjct:   192 PPPTGGPQQRYPQSPPP--PGGPQPRPPQGPPPPGGPHPRPTQGP---PPTGP--QPRPT 244

Query:   442 QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ--PRGG 499
             QG      P   P +G    G P+   P G  PPP    P  +  P    G  Q  P  G
Sbjct:   245 QGPPPTGGPQQRPPQGPPPPGGPQPRPPQGP-PPPTGPQPRPTQGPHPTGGPQQTPPLAG 303

Query:   500 NP 501
             NP
Sbjct:   304 NP 305


>UNIPROTKB|F1SFA7 [details] [associations]
            symbol:COL1A2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
            SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005201 GeneTree:ENSGT00660000095287 EMBL:CU915372
            Ensembl:ENSSSCT00000016699 OMA:KGETGNK ArrayExpress:F1SFA7
            Uniprot:F1SFA7
        Length = 1366

 Score = 123 (48.4 bits), Expect = 0.00078, P = 0.00078
 Identities = 83/271 (30%), Positives = 99/271 (36%)

Query:   243 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-----PS-ATTAGVVGAGPNTSTSAYA 296
             GA G   N  +  P G    + G G     GPP     P  A TAG VG           
Sbjct:   520 GAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGERGIPG-- 577

Query:   297 ATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKG 353
               + G P  A     RGP G   + GP G   S+ PS  P  GP  D  KG PG      
Sbjct:   578 --EFGLPGPAGPRGERGPPGESGAAGPAGPIGSRGPSGPP--GP--DGNKGEPGV--LGA 629

Query:   354 PGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVP- 410
             PG     G S    +RG    I  G     + GL  D+   P  D  RG PG      P 
Sbjct:   630 PGTAGPSGPSGLPGERGAA-GIPGGKGEKGETGLRGDVG-SPGRDGARGAPGAVGAPGPA 687

Query:   411 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP--RGA- 467
             G +  RG    A  A    P+  PG   +RG+           P+   G  GA   RG  
Sbjct:   688 GANGDRGEAGPAGPAGPAGPRGSPG---ERGEVGPAGPNGFAGPAGAAGQPGAKGERGTK 744

Query:   468 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
              P G+  P     P G+A P   +G   P G
Sbjct:   745 GPKGENGPVGPTGPVGAAGPAGPNGPPGPAG 775


>UNIPROTKB|Q9XSJ7 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9615
            "Canis lupus familiaris" [GO:0046872 "metal ion binding"
            evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
            "extracellular matrix structural constituent" evidence=IEA]
            InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
            SMART:SM00038 SMART:SM00214 GO:GO:0046872 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
            CTD:1277 HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236
            OrthoDB:EOG4S4PHP EMBL:AF153062 RefSeq:NP_001003090.1
            UniGene:Cfa.100 STRING:Q9XSJ7 GeneID:403651 KEGG:cfa:403651
            InParanoid:Q9XSJ7 NextBio:20817156 Uniprot:Q9XSJ7
        Length = 1460

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 86/288 (29%), Positives = 101/288 (35%)

Query:   230 PNVDRR-AADGSYG--GATGNSENETSGRPVGQNAYEDGYGVP-----QGHGPPPSATT- 280
             P  D +  A G  G  GA G++       P G        G P     +G   PP AT  
Sbjct:   813 PGADGQPGAKGEPGDAGAKGDAGPPGPAGPTGPPGPIGNVGAPGPKGARGSAGPPGATGF 872

Query:   281 AGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKG 337
              G  G  GP    S  A    G P  A  +   G G     GP G      P   P   G
Sbjct:   873 PGAAGRVGP-PGPSGNAGPP-GPPGPAGKE--GGKGARGETGPAGRPGEVGPPGPPGPAG 928

Query:   338 PSYDP-AKGPGYDP-TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDM 390
                 P A GP   P T GP G   Q+G      QRG   +    GPS +P ++G  G   
Sbjct:   929 EKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGTSG 988

Query:   391 QRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP 450
             +RGP   M  GP       PG     GP  E+ R  S   +  PG D   G   D     
Sbjct:   989 ERGPPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGSPGPKGDRGETG 1036

Query:   451 SYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
                P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1037 PAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPVGARGPAG 1084


>UNIPROTKB|F1Q3I5 [details] [associations]
            symbol:COL1A1 "Collagen alpha-1(I) chain" species:9615
            "Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
            Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
            PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005581
            InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
            GeneTree:ENSGT00660000095287 OMA:VAYMDQQ EMBL:AAEX03006535
            Ensembl:ENSCAFT00000026953 Uniprot:F1Q3I5
        Length = 1464

 Score = 123 (48.4 bits), Expect = 0.00084, P = 0.00084
 Identities = 86/288 (29%), Positives = 101/288 (35%)

Query:   230 PNVDRR-AADGSYG--GATGNSENETSGRPVGQNAYEDGYGVP-----QGHGPPPSATT- 280
             P  D +  A G  G  GA G++       P G        G P     +G   PP AT  
Sbjct:   817 PGADGQPGAKGEPGDAGAKGDAGPPGPAGPTGPPGPIGNVGAPGPKGARGSAGPPGATGF 876

Query:   281 AGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKG 337
              G  G  GP    S  A    G P  A  +   G G     GP G      P   P   G
Sbjct:   877 PGAAGRVGP-PGPSGNAGPP-GPPGPAGKE--GGKGARGETGPAGRPGEVGPPGPPGPAG 932

Query:   338 PSYDP-AKGPGYDP-TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDM 390
                 P A GP   P T GP G   Q+G      QRG   +    GPS +P ++G  G   
Sbjct:   933 EKGSPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASG 992

Query:   391 QRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP 450
             +RGP   M  GP       PG     GP  E+ R  S   +  PG D   G   D     
Sbjct:   993 ERGPPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGSPGPKGDRGETG 1040

Query:   451 SYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 498
                P    G  GAP    P G+        P G A P    G+  P G
Sbjct:  1041 PAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPVGARGPAG 1088


>UNIPROTKB|E1B7H2 [details] [associations]
            symbol:COL7A1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0004867 "serine-type endopeptidase inhibitor activity"
            evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
            InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
            PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
            SMART:SM00060 SMART:SM00327 Gene3D:2.60.40.10 InterPro:IPR013783
            GO:GO:0004867 SUPFAM:SSF49265 Gene3D:4.10.410.10 InterPro:IPR020901
            SUPFAM:SSF57362 PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0005604 OMA:RRVCTTA GeneTree:ENSGT00700000104301
            EMBL:DAAA02054431 IPI:IPI00706167 Ensembl:ENSBTAT00000025418
            Uniprot:E1B7H2
        Length = 2934

 Score = 126 (49.4 bits), Expect = 0.00085, P = 0.00085
 Identities = 83/296 (28%), Positives = 107/296 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 289
             P +  +A +    GA G           G    E   G P   GP       G  GA   
Sbjct:  1642 PGLPGKAGERGLRGAPGARGPVGEKGDQGDPGEEGRNGSPGPSGPKGDRGEPGPPGAPGR 1701

Query:   290 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPS-YDPTKGPSYDPA-KGP- 346
                    A + G P     + PRGP  +    PG    +  S      GP  DP  +GP 
Sbjct:  1702 LVDVGLGAGEKGEPGDRGQEGPRGPKGDPGP-PGASGERGVSGLRGPPGPQGDPGVRGPA 1760

Query:   347 GYDPTKGP-GYDAQKGSN----YDAQRGPNYDIHRGPSYDPQR-GL-GYDMQRGPNYDMQ 399
             G    +GP G D + G +         GPN  +  G + DP R GL G   ++GP     
Sbjct:  1761 GEKGDRGPPGLDGRSGLDGKPGASGPPGPNGAM--GKAGDPGRDGLPGLRGEQGP--PGP 1816

Query:   400 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD----PS 455
              GP     + PG D + G +      P       PG D ++G+  D   AP  +    P 
Sbjct:  1817 AGPPGAPGK-PGEDGKPG-LNGKNGEPG-----DPGEDGRKGEKGDSG-APGREGRDGPK 1868

Query:   456 RGTGFDGAPRGAAPHG---QVPPPLNNVPY--GSATPPARSGSGQPRG--GNPARR 504
                G  G+P    P G   QV PP    P   GS+ P    G   PRG  G P  R
Sbjct:  1869 GERGAPGSPGLQGPPGLPGQVGPPGQGFPGVPGSSGPKGDRGETGPRGEQGLPGER 1924


>MGI|MGI:88458 [details] [associations]
            symbol:Col5a2 "collagen, type V, alpha 2" species:10090 "Mus
            musculus" [GO:0001501 "skeletal system development" evidence=IMP]
            [GO:0005201 "extracellular matrix structural constituent"
            evidence=IEA] [GO:0005576 "extracellular region" evidence=IEA]
            [GO:0005578 "proteinaceous extracellular matrix" evidence=IEA]
            [GO:0005581 "collagen" evidence=IDA] [GO:0005588 "collagen type V"
            evidence=ISO] [GO:0030199 "collagen fibril organization"
            evidence=ISO;IMP] [GO:0043588 "skin development" evidence=ISO;IMP]
            [GO:0046332 "SMAD binding" evidence=IPI] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0048592 "eye morphogenesis"
            evidence=ISO] [GO:0071230 "cellular response to amino acid
            stimulus" evidence=IDA] InterPro:IPR000885 InterPro:IPR001007
            Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
            PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
            MGI:MGI:88458 GO:GO:0043588 GO:GO:0046872 GO:GO:0030199
            GO:GO:0001501 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
            eggNOG:NOG12793 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
            GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933 KO:K06236
            GO:GO:0005588 CTD:1290 OMA:PDHKPVW OrthoDB:EOG4K0QMS ChiTaRS:COL5A2
            EMBL:L02918 EMBL:AK132413 EMBL:AK139130 EMBL:AK147220 EMBL:AK147328
            EMBL:AK151929 EMBL:AK160008 EMBL:BC043696 EMBL:BC055077
            IPI:IPI00121120 PIR:I49607 RefSeq:NP_031763.2 UniGene:Mm.10299
            ProteinModelPortal:Q3U962 SMR:Q3U962 STRING:Q3U962
            PhosphoSite:Q3U962 PRIDE:Q3U962 Ensembl:ENSMUST00000086430
            GeneID:12832 KEGG:mmu:12832 UCSC:uc007awr.1 InParanoid:Q3U962
            NextBio:282338 Bgee:Q3U962 CleanEx:MM_COL5A2 Genevestigator:Q3U962
            Uniprot:Q3U962
        Length = 1497

 Score = 123 (48.4 bits), Expect = 0.00087, P = 0.00086
 Identities = 88/296 (29%), Positives = 111/296 (37%)

Query:   232 VDRRAADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGP 288
             +  + A+G+ G  GA G         P G    E G   P+G  GPP S    G  G   
Sbjct:   780 IGEKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENG 838

Query:   289 NTSTSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK 344
              T    +A  Q   G P ++     P   G   S GP G   S  P + P   P     +
Sbjct:   839 PTGAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPHGVPGLKGGR 897

Query:   345 GPGYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQR----GLGYDM-QRGPNY 396
             G    P  T  PG   + G    A   GP      GP+ +P +    GL  D    G   
Sbjct:   898 GTQGPPGATGFPGSAGRVGPPGPAGAPGP-----AGPAGEPGKEGPPGLRGDPGSHGRVG 952

Query:   397 DM-QRGP-GYETQRV-PGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAP 450
             D    GP G    +  PG D Q GP  +    P+    QRG  G   QRG+ G      P
Sbjct:   953 DRGPAGPPGSPGDKGDPGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGP 1010

Query:   451 SYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 504
             +  P +  G  GA     P G V PP +N P G   P   +G+ G P R G    R
Sbjct:  1011 AGTPGK-VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1065


>UNIPROTKB|B4DLD3 [details] [associations]
            symbol:SS18 "cDNA FLJ58120, highly similar to SSXT protein"
            species:9606 "Homo sapiens" [GO:0000226 "microtubule cytoskeleton
            organization" evidence=IEA] [GO:0000902 "cell morphogenesis"
            evidence=IEA] [GO:0005881 "cytoplasmic microtubule" evidence=IEA]
            [GO:0007243 "intracellular protein kinase cascade" evidence=IEA]
            [GO:0042493 "response to drug" evidence=IEA] [GO:0048013 "ephrin
            receptor signaling pathway" evidence=IEA] GO:GO:0000226
            GO:GO:0042493 GO:GO:0007243 GO:GO:0000902 GO:GO:0048013
            GO:GO:0005881 HOVERGEN:HBG003892 InterPro:IPR007726
            PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:AC091021
            HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK296949 IPI:IPI01011245
            STRING:B4DLD3 Ensembl:ENST00000542420 Uniprot:B4DLD3
        Length = 395

 Score = 116 (45.9 bits), Expect = 0.00087, P = 0.00087
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 290
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   165 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 224

Query:   291 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 342
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   225 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 283

Query:   343 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 401
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   284 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 337

Query:   402 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 452
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   338 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 393


>UNIPROTKB|J9P0I3 [details] [associations]
            symbol:ZNF768 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
            binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
            InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
            PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
            GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
            KO:K09228 CTD:79724 OMA:SRYESQN EMBL:AAEX03004391
            RefSeq:XP_547025.2 Ensembl:ENSCAFT00000045233 GeneID:489906
            KEGG:cfa:489906 Uniprot:J9P0I3
        Length = 554

 Score = 118 (46.6 bits), Expect = 0.00089, P = 0.00089
 Identities = 27/71 (38%), Positives = 42/71 (59%)

Query:   303 PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ--- 359
             P    Y+ P+ PGYE  + PGY+  K+P Y+P K P Y+P + PGY+ ++ PGY+ Q   
Sbjct:   116 PQSPRYE-PQSPGYEP-RSPGYEP-KSPGYEP-KSPGYEP-RSPGYE-SQSPGYEPQNPE 169

Query:   360 ---KGSNYDAQ 367
                +   ++AQ
Sbjct:   170 FKTQSPEFEAQ 180


>UNIPROTKB|E1BYQ6 [details] [associations]
            symbol:TPR "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0006606 "protein import into nucleus" evidence=IEA]
            [GO:0000776 "kinetochore" evidence=IEA] [GO:0005643 "nuclear pore"
            evidence=IEA] [GO:0007094 "mitotic spindle assembly checkpoint"
            evidence=IEA] [GO:0031965 "nuclear membrane" evidence=IEA]
            InterPro:IPR012929 Pfam:PF07926 GO:GO:0000776 GO:GO:0007094
            GO:GO:0031965 GO:GO:0005643 GO:GO:0006606 KO:K09291
            GeneTree:ENSGT00700000104019 CTD:7175 OMA:RFIRREK EMBL:AADN02061595
            IPI:IPI00591857 RefSeq:XP_422300.2 UniGene:Gga.14251
            Ensembl:ENSGALT00000008185 GeneID:424457 KEGG:gga:424457
            NextBio:20826784 Uniprot:E1BYQ6
        Length = 2368

 Score = 119 (46.9 bits), Expect = 0.00089, Sum P(2) = 0.00089
 Identities = 36/179 (20%), Positives = 83/179 (46%)

Query:    49 EVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKSERELQM 108
             +++ +K A+    +Q+++ E  RL A        L  +Q+ LQ L  ++  +++E+E   
Sbjct:  1359 KLLSEKEANTK-RIQQMSEETGRLKAEIARTTASLTTSQNLLQNLKDEVAKIRTEKETLQ 1417

Query:   109 RNLTEKIAKMEAELKTAEPVKLEFQKSKTEAQNLVVAREELIAKVH-QLTQDLQRAHTDV 167
             + L  K+A ++ ++KT   VK   ++ KT+ + L    ++++A+   Q   + Q     V
Sbjct:  1418 KELDAKVADIQEKVKTITQVKKIGRRYKTQYEELKAQHDKMVAEAATQSFVEQQEEQVSV 1477

Query:   168 QQIPALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAEL 226
             Q++  L   L     +        E  +K   +     + +++    + +E+ + R +L
Sbjct:  1478 QEVQELKDSLSQAEGKTKTLENQVENLQKTVAEKETEARNLQEQISQLQSELARFRQDL 1536

 Score = 57 (25.1 bits), Expect = 0.00089, Sum P(2) = 0.00089
 Identities = 26/103 (25%), Positives = 37/103 (35%)

Query:   243 GATGNSENETSGRPVGQNAYE--DGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ- 299
             G  G+  NE +G   G + YE  D  G     G  P   T   +G G +   +A +    
Sbjct:  1986 GDEGDDSNEGTGSADGNDGYEADDAEGAD---GTDPGTETEESLGGGESNQRAADSQNSC 2042

Query:   300 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 342
              G+   A    P     E    P   AS+  +  P + P   P
Sbjct:  2043 EGSTSTAESTFPHESSREQQ--PS-SASERQAPRPPQSPRRPP 2082


>WB|WBGene00000653 [details] [associations]
            symbol:col-77 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            EMBL:Z66498 GO:GO:0042302 HOGENOM:HOG000085656
            GeneTree:ENSGT00610000086159 PIR:T23801 RefSeq:NP_495759.1
            ProteinModelPortal:Q21562 DIP:DIP-26119N MINT:MINT-1050309
            STRING:Q21562 EnsemblMetazoa:M195.1 GeneID:174336
            KEGG:cel:CELE_M195.1 UCSC:M195.1 CTD:174336 WormBase:M195.1
            eggNOG:NOG315089 InParanoid:Q21562 OMA:IAFFGIC NextBio:883606
            Uniprot:Q21562
        Length = 304

 Score = 114 (45.2 bits), Expect = 0.00090, P = 0.00090
 Identities = 71/238 (29%), Positives = 87/238 (36%)

Query:   265 GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPG 323
             GYG P  +    + +  G    G   S  +  A   GTP     D   G PG +   G  
Sbjct:    85 GYGAPAEYSTDAAVSAGGSEAGGQCCSCGSGPAGPPGTPGEDGRDGNDGQPGPDGQPGSD 144

Query:   324 YDASKAPSYDPTKGPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 382
               A   P+ D      +D PA  PG     GP     KG+  +A   P  D   G    P
Sbjct:   145 APAEAIPTADDF---CFDCPAGPPGPAGNAGP-----KGAPGNAG-APGNDGQAGAPGAP 195

Query:   383 QRGLGYDMQRGP-NYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 440
                 G D  +GP   D   G PG + Q  PG  V+   V      P   PQ  PG D Q 
Sbjct:   196 ----GNDGPQGPPGQDGAAGQPGPDGQ--PGV-VEEVAVPAGPPGPPG-PQGAPGTDGQP 247

Query:   441 GQ-GYDMRRAPSYDPSRGTGFDGAP--RGAA-PHGQVPPPLNNVPYGSATPPARSGSG 494
             G  G   +  P   P+   G DGAP   GAA   G+   P          PP R+  G
Sbjct:   248 GSAGQPGQDGPQ-GPAGDAGTDGAPGQAGAAGEQGEAGQPGEGGGCDHCPPP-RTAPG 303


>UNIPROTKB|Q15532 [details] [associations]
            symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
            [GO:0006351 "transcription, DNA-dependent" evidence=IEA]
            [GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
            [GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
            "cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
            protein kinase cascade" evidence=IEA] [GO:0042493 "response to
            drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0030374
            "ligand-dependent nuclear receptor transcription coactivator
            activity" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0045944 "positive regulation of transcription from RNA
            polymerase II promoter" evidence=IDA] GO:GO:0005634 GO:GO:0000226
            GO:GO:0042493 GO:GO:0045944 GO:GO:0007243 GO:GO:0006351
            EMBL:CH471088 GO:GO:0000902 Orphanet:3273 GO:GO:0048013
            GO:GO:0005881 GO:GO:0030374 HOVERGEN:HBG003892 InterPro:IPR007726
            PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:X79200
            EMBL:S79894 EMBL:X79201 EMBL:AF343880 EMBL:EF445031 EMBL:BC096223
            IPI:IPI00452919 IPI:IPI00940186 PIR:S46269 RefSeq:NP_001007560.1
            RefSeq:NP_005628.2 ProteinModelPortal:Q15532 IntAct:Q15532
            STRING:Q15532 PhosphoSite:Q15532 DMDM:20141795 PaxDb:Q15532
            PRIDE:Q15532 DNASU:6760 Ensembl:ENST00000269137
            Ensembl:ENST00000415083 GeneID:6760 KEGG:hsa:6760 UCSC:uc002kvm.3
            CTD:6760 GeneCards:GC18M023596 HGNC:HGNC:11340 MIM:600192
            neXtProt:NX_Q15532 PharmGKB:PA36164 eggNOG:NOG274014
            InParanoid:Q15532 KO:K15623 OrthoDB:EOG4RFKTH PhylomeDB:Q15532
            ChiTaRS:SS18 GenomeRNAi:6760 NextBio:26388 ArrayExpress:Q15532
            Bgee:Q15532 CleanEx:HS_SS18 Genevestigator:Q15532
            GermOnline:ENSG00000141380 Uniprot:Q15532
        Length = 418

 Score = 116 (45.9 bits), Expect = 0.00096, P = 0.00096
 Identities = 66/236 (27%), Positives = 88/236 (37%)

Query:   239 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 290
             G+YG     S     G  + Q      Y +PQG   H  G  P     G V  G +    
Sbjct:   188 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 247

Query:   291 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 342
                  Y   Q G P + +  +   G  Y    +GP  G +    P      G   PSY P
Sbjct:   248 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 306

Query:   343 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 401
              +G  YD P +       +G N  +Q G   D ++GP   PQ+G     Q+ P      G
Sbjct:   307 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 360

Query:   402 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 452
                GY  +Q  PG      P  + Q+   Y P Q GP     QR  GYD  +  +Y
Sbjct:   361 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 416


>WB|WBGene00000627 [details] [associations]
            symbol:col-50 species:6239 "Caenorhabditis elegans"
            [GO:0042302 "structural constituent of cuticle" evidence=IEA]
            [GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
            Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00530000064217
            EMBL:FO080999 PIR:T15142 RefSeq:NP_491194.1 UniGene:Cel.16665
            ProteinModelPortal:O01662 EnsemblMetazoa:T28F2.6 GeneID:189050
            KEGG:cel:CELE_T28F2.6 UCSC:T28F2.6 CTD:189050 WormBase:T28F2.6
            eggNOG:NOG279371 InParanoid:O01662 OMA:AGNCITC NextBio:941028
            Uniprot:O01662
        Length = 418

 Score = 116 (45.9 bits), Expect = 0.00096, P = 0.00096
 Identities = 89/355 (25%), Positives = 124/355 (34%)

Query:   172 ALLSELESLRQEYHHCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPN 231
             +++ +L+S   ++      ++++    ND    + V  +    + +  E+L  E +    
Sbjct:    29 SIIGDLQSFETDFVDDMSAFKHKA---NDAWSQMMVARR----VESPTERLAFEGLFG-R 80

Query:   232 VDRRAADG-SYGGATGNSEN--ETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVG-- 285
             V R+ A G +   AT   E   E +G   G        G P G  GPP  A   G  G  
Sbjct:    81 VKRQYAGGDAAAAATPAKEGYAEGAGGGGGCQCAAQASGCPAGPPGPPGEAGADGEPGEA 140

Query:   286 -----AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPT---- 335
                  AG   S   YA   +G  +      P  PG + + GP   A  A P  +      
Sbjct:   141 GQDGAAGEAGSADTYAGA-AGNCITCPAGPPGPPGPDGNAGPAGPAGAAGPDGEGAGYAE 199

Query:   336 KGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRG 393
              GP+  PA  PG D   G PG D Q G+        P      GP   P    G D    
Sbjct:   200 PGPA-GPAGPPGPDGQPGAPGPDGQPGAGGTTSTNQPGPPGPAGPP-GPAGPAGEDAYAQ 257

Query:   394 PNYDMQRGP----GYETQR-------VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ 442
             P+     GP    G + +         PG D   GP  +A   P      G G   + G 
Sbjct:   258 PSPAGTPGPPGPPGKDGEAGPDGPAGAPGTDGAPGP--DAAYCPCPPRTLGAGAYPEGGD 315

Query:   443 GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQP 496
                   A  YD   G   + AP  AA     P P     P G     A +G+  P
Sbjct:   316 AAAAAPAGGYDGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQGGAAAGAAAP 370


>MGI|MGI:3040693 [details] [associations]
            symbol:Zmiz1 "zinc finger, MIZ-type containing 1"
            species:10090 "Mus musculus" [GO:0001570 "vasculogenesis"
            evidence=IMP] [GO:0001701 "in utero embryonic development"
            evidence=IMP] [GO:0003007 "heart morphogenesis" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0007296 "vitellogenesis"
            evidence=IMP] [GO:0007569 "cell aging" evidence=IDA] [GO:0008270
            "zinc ion binding" evidence=IEA] [GO:0045944 "positive regulation
            of transcription from RNA polymerase II promoter" evidence=IMP]
            [GO:0046872 "metal ion binding" evidence=IEA] [GO:0048146 "positive
            regulation of fibroblast proliferation" evidence=IMP] [GO:0048589
            "developmental growth" evidence=IMP] [GO:0048844 "artery
            morphogenesis" evidence=IMP] InterPro:IPR004181 Pfam:PF02891
            PROSITE:PS51044 MGI:MGI:3040693 GO:GO:0005737 GO:GO:0046872
            GO:GO:0016607 GO:GO:0003007 GO:GO:0008270 GO:GO:0001701
            GO:GO:0045944 GO:GO:0006351 Gene3D:3.30.40.10 InterPro:IPR013083
            GO:GO:0048589 GO:GO:0001570 GO:GO:0048146 GO:GO:0048844
            GO:GO:0007569 GO:GO:0007296 GeneTree:ENSGT00550000074410 CTD:57178
            eggNOG:NOG237400 HOGENOM:HOG000253014 HOVERGEN:HBG056252
            OMA:MNQYGPM OrthoDB:EOG45MN70 ChiTaRS:ZMIZ1 EMBL:BC057691
            EMBL:BC058646 EMBL:BC065120 EMBL:AK054366 IPI:IPI00226072
            IPI:IPI00480418 RefSeq:NP_899031.2 UniGene:Mm.227484
            UniGene:Mm.486339 UniGene:Mm.489608 ProteinModelPortal:Q6P1E1
            SMR:Q6P1E1 IntAct:Q6P1E1 STRING:Q6P1E1 PhosphoSite:Q6P1E1
            PaxDb:Q6P1E1 PRIDE:Q6P1E1 Ensembl:ENSMUST00000007961
            Ensembl:ENSMUST00000162645 GeneID:328365 KEGG:mmu:328365
            UCSC:uc007srn.1 UCSC:uc007sro.1 InParanoid:Q6P1E1 NextBio:398259
            Bgee:Q6P1E1 CleanEx:MM_ZMIZ1 Genevestigator:Q6P1E1
            GermOnline:ENSMUSG00000007817 Uniprot:Q6P1E1
        Length = 1072

 Score = 121 (47.7 bits), Expect = 0.00097, P = 0.00097
 Identities = 65/232 (28%), Positives = 84/232 (36%)

Query:   287 GPNTSTSAYAATQSGTPMRAAYDIPRGPG-YEASKGP-GYDASKAPSYDPTKGP--SYDP 342
             GP  S+     TQ+          PRGP     S  P G  A   PS     GP    + 
Sbjct:   318 GPVCSSFQMGPTQAYNSQFMNQPGPRGPASMGGSLNPAGMAAGMTPS--GMSGPPMGMNQ 375

Query:   343 AKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 400
              + PG  P  T G     Q       Q  P   I R    +P  G   + Q GPN     
Sbjct:   376 PRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFPT 432

Query:   401 GPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP--S 455
              PG Y T   P       P Y  QR PS  P  G  P   +  GQ Y   +    +   S
Sbjct:   433 QPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTFS 489

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 503
              G+ +    +G+      P P+ N P+    G+ TPP   GS  P   +P++
Sbjct:   490 SGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541


>UNIPROTKB|F1RLL9 [details] [associations]
            symbol:COL4A2 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0031012 "extracellular matrix" evidence=IDA]
            [GO:0071560 "cellular response to transforming growth factor beta
            stimulus" evidence=IEA] [GO:0016525 "negative regulation of
            angiogenesis" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0005587 "collagen type IV"
            evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
            PROSITE:PS51403 SMART:SM00111 GO:GO:0071560 InterPro:IPR016187
            SUPFAM:SSF56436 GO:GO:0006351 InterPro:IPR008160 Pfam:PF01391
            GO:GO:0016525 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
            GeneTree:ENSGT00690000101772 OMA:TTIPEQN EMBL:CT954227
            EMBL:CU041289 Ensembl:ENSSSCT00000010463 Uniprot:F1RLL9
        Length = 1654

 Score = 123 (48.4 bits), Expect = 0.00097, P = 0.00097
 Identities = 81/291 (27%), Positives = 106/291 (36%)

Query:   230 PNVDRRAADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVG--- 285
             P  D        GG  G   ++     VG   Y    G P G  GPP      G  G   
Sbjct:    55 PGADGIPGHPGQGGPRGRPGSDGCNGTVGDTGYAGPVG-PDGFLGPPGPQGPKGQKGEPY 113

Query:   286 AGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKGPSYDPAK 344
             A        Y   + G P    +  P G PG     GP      AP      GP   P  
Sbjct:   114 ALSREDRDKYRG-EPGEPGLVGFQGPPGRPGPVGQMGP----VGAPGRPGPPGPP-GPKG 167

Query:   345 GPGYDPTKGPGYDAQKGSNYDA-QRGPN---YDIHRGPSYDPQRGLGY-DMQRGPNYDMQ 399
              PG    +G G+  +KG   D  Q GPN    D H  P   P R   Y D  +G     +
Sbjct:   168 QPG---NRGLGFYGEKGEKGDVGQPGPNGIPSDNHH-PIIGPTRETIYLDQYKGEK-GSE 222

Query:   400 RGPGYETQRVPGYDVQRGPVYEAQRA-PSYIPQRG-PGYDLQRG-QGYDMRRAPS-YDPS 455
               PG +   + G +   G  +   R  P +  ++G PG    RG  GY+    P  Y   
Sbjct:   223 GEPGRKGISLKGEEGIMG--FSGSRGVPGFDGEKGSPGQKGSRGLDGYE---GPDGYPGP 277

Query:   456 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP--RG--GNP 501
             +G   D  P GA  +   P P  ++  G+   P   G+ G+P  RG  G+P
Sbjct:   278 KGERGDPGPPGAPAYS--PHP--SLAKGARGEPGFPGALGEPGARGEPGDP 324


>MGI|MGI:2147661 [details] [associations]
            symbol:Vps37c "vacuolar protein sorting 37C (yeast)"
            species:10090 "Mus musculus" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005768 "endosome" evidence=IEA] [GO:0006810
            "transport" evidence=IEA] [GO:0015031 "protein transport"
            evidence=IEA] [GO:0016020 "membrane" evidence=IEA] MGI:MGI:2147661
            GO:GO:0031902 GO:GO:0015031 InterPro:IPR009851 Pfam:PF07200
            PROSITE:PS51314 KO:K12185 GeneTree:ENSGT00530000063123
            HOGENOM:HOG000234744 HOVERGEN:HBG073355 CTD:55048 eggNOG:NOG311749
            OMA:VERCQEQ OrthoDB:EOG4B2SZG EMBL:AK158833 EMBL:AK159309
            EMBL:BC025865 IPI:IPI00153241 IPI:IPI00877200 RefSeq:NP_852068.1
            UniGene:Mm.19091 ProteinModelPortal:Q8R105 IntAct:Q8R105
            STRING:Q8R105 PhosphoSite:Q8R105 PaxDb:Q8R105 PRIDE:Q8R105
            Ensembl:ENSMUST00000087951 GeneID:107305 KEGG:mmu:107305
            UCSC:uc008gqr.1 UCSC:uc008gqs.1 InParanoid:Q8R105 NextBio:358674
            Bgee:Q8R105 CleanEx:MM_VPS37C Genevestigator:Q8R105 Uniprot:Q8R105
        Length = 352

 Score = 90 (36.7 bits), Expect = 0.00099, Sum P(2) = 0.00099
 Identities = 46/178 (25%), Positives = 60/178 (33%)

Query:   268 VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG-YDA 326
             VP    PPP                     T    P   +  +P GP  + +  P  +  
Sbjct:   170 VPPKRPPPPRPVPQATPPETEEQPPQPSVVTPYPLPYSPSPGLPVGPTAQGALQPAPFPV 229

Query:   327 SKAPS-YDPTKGPSYDPAKGP----GYDPTKGPGYDAQKG--SNYDAQRGPNYDIHRGPS 379
                PS Y    GP   P  GP    GY  +       Q G  +   +  GP Y +  G +
Sbjct:   230 VAQPSSYGGPLGPYPSPHPGPRAMVGYSWSPQRSGPPQPGYPTAPTSTSGPGYPLVGGRT 289

Query:   380 YDPQRGLGYDMQRGPNYDMQRGPGYETQ-RVPGYDVQRGPVYEAQRAPSYIPQRGPGY 436
               P    GY  Q+ P       P Y TQ ++PG+  Q  P    Q  P Y P   P Y
Sbjct:   290 PGP----GYP-QQSPYLPSGNKPPYPTQPQLPGFPGQPQPPVPPQ--PPYPPGTTPSY 340

 Score = 68 (29.0 bits), Expect = 0.00099, Sum P(2) = 0.00099
 Identities = 20/81 (24%), Positives = 42/81 (51%)

Query:    43 DMMPPPEVMEQKIASQHVEMQKLATENQRLAATHGTLRQELAAAQHELQILHGQIGGMKS 102
             +M   PE +  ++A +  E+Q L  E +   AT+ +L ++    Q  L+I    +    S
Sbjct:    14 EMQNDPEAIA-RLALESPEVQDLQLEREMALATNRSLAEQNLEFQGPLEISRSNL----S 68

Query:   103 ERELQMRNLTEKIAKMEAELK 123
             ++  ++R L E+  + +A+L+
Sbjct:    69 DKYQELRKLVERCQEQKAKLE 89


>UNIPROTKB|P02465 [details] [associations]
            symbol:COL1A2 "Collagen alpha-2(I) chain" species:9913 "Bos
            taurus" [GO:0071230 "cellular response to amino acid stimulus"
            evidence=IEA] [GO:0070208 "protein heterotrimerization"
            evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
            evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043589
            "skin morphogenesis" evidence=IEA] [GO:0042802 "identical protein
            binding" evidence=IEA] [GO:0030674 "protein binding, bridging"
            evidence=IEA] [GO:0030199 "collagen fibril organization"
            evidence=IEA] [GO:0008217 "regulation of blood pressure"
            evidence=IEA] [GO:0007266 "Rho protein signal transduction"
            evidence=IEA] [GO:0007179 "transforming growth factor beta receptor
            signaling pathway" evidence=IEA] [GO:0005615 "extracellular space"
            evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
            [GO:0001568 "blood vessel development" evidence=IEA] [GO:0001501
            "skeletal system development" evidence=IEA] [GO:0046872 "metal ion
            binding" evidence=IEA] [GO:0005201 "extracellular matrix structural
            constituent" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
            ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
            GO:GO:0046872 GO:GO:0030199 GO:GO:0001501 GO:GO:0008217
            GO:GO:0007179 GO:GO:0007266 GO:GO:0070208 InterPro:IPR008160
            Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568 GO:GO:0071230
            GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287
            HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 GO:GO:0005584
            EMBL:AB008683 EMBL:BC149095 IPI:IPI00708244 PIR:A90596
            RefSeq:NP_776945.1 UniGene:Bt.53485 IntAct:P02465 MINT:MINT-1346303
            STRING:P02465 Allergome:3550 Allergome:896 PRIDE:P02465
            Ensembl:ENSBTAT00000033863 GeneID:282188 KEGG:bta:282188 CTD:1278
            InParanoid:P02465 OMA:TGPIGSA OrthoDB:EOG412M65 NextBio:20806016
            PMAP-CutDB:P02465 Uniprot:P02465
        Length = 1364

 Score = 122 (48.0 bits), Expect = 0.0010, P = 0.0010
 Identities = 78/267 (29%), Positives = 94/267 (35%)

Query:   243 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT--QS 300
             GA G   N  +  P G    + G G     GPP      G  G               + 
Sbjct:   518 GAPGPDGNNGAQGPPGLQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEAGKPGERGIPGEF 577

Query:   301 GTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGPGYD 357
             G P  A     RGP G   + GP G   S+ PS  P  GP  D  KG PG      PG  
Sbjct:   578 GLPGPAGARGERGPPGESGAAGPTGPIGSRGPSGPP--GP--DGNKGEPGV--VGAPGTA 631

Query:   358 AQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVP-GYDV 414
                G S    +RG    I  G     + GL  D+   P  D  RG PG      P G + 
Sbjct:   632 GPSGPSGLPGERGAA-GIPGGKGEKGETGLRGDIG-SPGRDGARGAPGAIGAPGPAGANG 689

Query:   415 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP--RGA-APHG 471
              RG    A  A    P+  PG   +RG+           P+   G  GA   RG   P G
Sbjct:   690 DRGEAGPAGPAGPAGPRGSPG---ERGEVGPAGPNGFAGPAGAAGQPGAKGERGTKGPKG 746

Query:   472 QVPPPLNNVPYGSATPPARSGSGQPRG 498
             +  P     P G+A P   +G   P G
Sbjct:   747 ENGPVGPTGPVGAAGPSGPNGPPGPAG 773


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.311   0.132   0.391    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      504       485   0.00081  119 3  11 23  0.36    35
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  241
  No. of states in DFA:  586 (62 KB)
  Total size of DFA:  256 KB (2132 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  58.90u 0.09s 58.99t   Elapsed:  00:00:04
  Total cpu time:  59.00u 0.10s 59.10t   Elapsed:  00:00:04
  Start:  Sat May 11 02:18:22 2013   End:  Sat May 11 02:18:26 2013
WARNINGS ISSUED:  1

Back to top