Your job contains 1 sequence.
>016220
MPKVGAHKLEIRCTLIFTCTLDFLFRQVYSKALHFGHPWICESSSPQYYFLHLAFQHCYC
AIFLKIWSKNAITFHLCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAP
NVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS
TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT
KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG
YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG
QVPPPLNNVPYGSATPPARSGSGQPRGGNPARR
The BLAST search returned 4 gene products which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 016220
(393 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2033681 - symbol:AT1G67170 "AT1G67170" species... 335 2.3e-30 1
FB|FBgn0085362 - symbol:Vml "Vitelline membrane-like" spe... 286 1.0e-23 1
UNIPROTKB|F1S187 - symbol:LOC100518332 "Uncharacterized p... 201 1.8e-13 1
UNIPROTKB|P11414 - symbol:POLR2A "DNA-directed RNA polyme... 184 2.2e-11 1
UNIPROTKB|A4II09 - symbol:eif3a "Eukaryotic translation i... 186 8.8e-11 2
UNIPROTKB|A2VD00 - symbol:eif3a "Eukaryotic translation i... 184 1.6e-10 2
WB|WBGene00020550 - symbol:T17H7.1 species:6239 "Caenorha... 172 9.4e-10 1
FB|FBgn0028573 - symbol:prc "pericardin" species:7227 "Dr... 171 4.0e-09 1
ZFIN|ZDB-GENE-041008-78 - symbol:polr2a "polymerase (RNA)... 170 6.0e-09 1
UNIPROTKB|P71590 - symbol:fhaA "FHA domain-containing pro... 162 8.0e-09 1
UNIPROTKB|Q92804 - symbol:TAF15 "TATA-binding protein-ass... 160 8.8e-09 2
UNIPROTKB|F1PB61 - symbol:TAF15 "Uncharacterized protein"... 160 1.5e-08 1
WB|WBGene00044109 - symbol:K02E11.10 species:6239 "Caenor... 154 2.9e-08 1
DICTYBASE|DDB_G0277909 - symbol:cbpP "calcium-binding pro... 155 3.9e-08 1
FB|FBgn0050203 - symbol:CG30203 species:7227 "Drosophila ... 157 6.5e-08 1
WB|WBGene00005015 - symbol:spt-5 species:6239 "Caenorhabd... 158 7.1e-08 1
UNIPROTKB|Q21338 - symbol:spt-5 "Transcription elongation... 158 7.1e-08 1
SGD|S000002299 - symbol:RPO21 "RNA polymerase II largest ... 159 8.4e-08 1
MGI|MGI:1330280 - symbol:Krtap6-2 "keratin associated pro... 128 1.3e-07 1
WB|WBGene00002280 - symbol:let-2 species:6239 "Caenorhabd... 157 1.4e-07 1
UNIPROTKB|P17140 - symbol:let-2 "Collagen alpha-2(IV) cha... 157 1.4e-07 1
ZFIN|ZDB-GENE-030131-5725 - symbol:arid1ab "AT rich inter... 157 1.5e-07 2
WB|WBGene00000123 - symbol:ama-1 species:6239 "Caenorhabd... 157 1.5e-07 1
UNIPROTKB|P16356 - symbol:ama-1 "DNA-directed RNA polymer... 157 1.5e-07 1
UNIPROTKB|J9P0I3 - symbol:ZNF768 "Uncharacterized protein... 148 3.1e-07 1
FB|FBgn0035872 - symbol:CG7185 species:7227 "Drosophila m... 141 3.2e-07 2
UNIPROTKB|J3KNM7 - symbol:COL4A4 "Collagen alpha-4(IV) ch... 153 3.7e-07 1
UNIPROTKB|P53420 - symbol:COL4A4 "Collagen alpha-4(IV) ch... 153 3.7e-07 1
UNIPROTKB|D4ADB1 - symbol:D4ADB1 "Uncharacterized protein... 148 4.3e-07 1
UNIPROTKB|P02457 - symbol:COL1A1 "Collagen alpha-1(I) cha... 149 8.6e-07 1
UNIPROTKB|G4N3H5 - symbol:MGG_04961 "Uncharacterized prot... 144 1.0e-06 1
WB|WBGene00004203 - symbol:swsn-1 species:6239 "Caenorhab... 145 1.1e-06 1
TAIR|locus:2012713 - symbol:AT1G33680 "AT1G33680" species... 144 1.4e-06 1
UNIPROTKB|Q96QC0 - symbol:PPP1R10 "Serine/threonine-prote... 145 1.4e-06 1
UNIPROTKB|G1RSL2 - symbol:COL4A4 "Uncharacterized protein... 147 1.7e-06 1
FB|FBgn0261885 - symbol:osa "osa" species:7227 "Drosophil... 148 2.2e-06 1
UNIPROTKB|Q5TM61 - symbol:PPP1R10 "Serine/threonine-prote... 143 2.3e-06 1
UNIPROTKB|Q7YR38 - symbol:PPP1R10 "Serine/threonine-prote... 143 2.3e-06 1
UNIPROTKB|C9JGE3 - symbol:EWSR1 "Ewing sarcoma breakpoint... 127 2.6e-06 2
UNIPROTKB|P12105 - symbol:COL3A1 "Collagen alpha-1(III) c... 144 2.6e-06 1
TAIR|locus:2012788 - symbol:AT1G10390 "AT1G10390" species... 143 2.6e-06 1
MGI|MGI:1344412 - symbol:Ldb3 "LIM domain binding 3" spec... 141 2.7e-06 1
UNIPROTKB|O75112 - symbol:LDB3 "LIM domain-binding protei... 141 2.7e-06 1
UNIPROTKB|G7N928 - symbol:EGK_04858 "Putative uncharacter... 145 2.8e-06 1
UNIPROTKB|G7PK77 - symbol:EGM_04376 "Putative uncharacter... 145 2.8e-06 1
TAIR|locus:2043530 - symbol:AT2G25970 "AT2G25970" species... 140 2.9e-06 1
ZFIN|ZDB-GENE-050809-108 - symbol:pygo2 "pygopus homolog ... 139 3.2e-06 1
UNIPROTKB|P04258 - symbol:COL3A1 "Collagen alpha-1(III) c... 142 3.4e-06 1
UNIPROTKB|E2R2K8 - symbol:PPP1R10 "Uncharacterized protei... 141 3.8e-06 1
ZFIN|ZDB-GENE-030131-1600 - symbol:ewsr1b "Ewing sarcoma ... 142 3.8e-06 2
ZFIN|ZDB-GENE-040426-1010 - symbol:fus "fusion (involved ... 137 4.9e-06 1
UNIPROTKB|I3LQ53 - symbol:I3LQ53 "Uncharacterized protein... 137 5.0e-06 1
UNIPROTKB|F1MXS8 - symbol:COL3A1 "Collagen alpha-1(III) c... 142 5.1e-06 1
UNIPROTKB|J9P8F7 - symbol:COL5A1 "Uncharacterized protein... 141 6.2e-06 1
UNIPROTKB|E1C0T1 - symbol:TFG "Uncharacterized protein" s... 134 6.3e-06 1
UNIPROTKB|F1LLX1 - symbol:Col11a1 "Collagen alpha-1(XI) c... 142 6.4e-06 1
RGD|2372 - symbol:Col11a1 "collagen, type XI, alpha 1" sp... 142 6.4e-06 1
UNIPROTKB|P20909 - symbol:Col11a1 "Collagen alpha-1(XI) c... 142 6.4e-06 1
TAIR|locus:2077547 - symbol:AT3G07030 species:3702 "Arabi... 134 6.6e-06 1
UNIPROTKB|Q8WML4 - symbol:MUC1 "Mucin-1" species:9913 "Bo... 136 7.0e-06 1
CGD|CAL0000919 - symbol:RPO21 species:5476 "Candida albic... 141 7.8e-06 1
UNIPROTKB|Q5ACI7 - symbol:RPO21 "DNA-directed RNA polymer... 141 7.8e-06 1
UNIPROTKB|F1P555 - symbol:SFPQ "Uncharacterized protein" ... 136 8.2e-06 1
UNIPROTKB|F1PHX8 - symbol:COL5A1 "Uncharacterized protein... 141 8.3e-06 1
MGI|MGI:2384582 - symbol:Zfp768 "zinc finger protein 768"... 135 8.8e-06 1
MGI|MGI:2157767 - symbol:Krtap21-1 "keratin associated pr... 111 9.3e-06 1
UNIPROTKB|F1N474 - symbol:COL4A5 "Uncharacterized protein... 140 9.8e-06 1
FB|FBgn0003277 - symbol:RpII215 "RNA polymerase II 215kD ... 140 1.1e-05 1
TAIR|locus:2035751 - symbol:AT1G55170 "AT1G55170" species... 129 1.1e-05 1
UNIPROTKB|K7EKB2 - symbol:TAF15 "TATA-binding protein-ass... 125 1.2e-05 1
UNIPROTKB|F1RFI8 - symbol:EWSR1 "Uncharacterized protein"... 121 1.3e-05 2
UNIPROTKB|E2RS29 - symbol:E2RS29 "Uncharacterized protein... 133 1.3e-05 1
UNIPROTKB|F1RYI8 - symbol:COL3A1 "Uncharacterized protein... 138 1.4e-05 1
UNIPROTKB|F1NI73 - symbol:COL3A1 "Collagen alpha-1(III) c... 137 1.5e-05 1
WB|WBGene00000628 - symbol:col-51 species:6239 "Caenorhab... 131 1.6e-05 1
UNIPROTKB|Q28009 - symbol:FUS "RNA-binding protein FUS" s... 132 1.6e-05 1
ZFIN|ZDB-GENE-070912-607 - symbol:col11a1b "collagen, typ... 138 1.8e-05 1
WB|WBGene00000251 - symbol:bli-1 species:6239 "Caenorhabd... 135 1.8e-05 1
UNIPROTKB|J9P0L0 - symbol:COL3A1 "Uncharacterized protein... 137 1.8e-05 1
UNIPROTKB|F1N7Q7 - symbol:COL4A2 "Collagen alpha-2(IV) ch... 137 2.0e-05 1
UNIPROTKB|F1LRJ1 - symbol:Col4a3 "Protein Col4a3" species... 137 2.1e-05 1
UNIPROTKB|J9NW09 - symbol:POLR2A "DNA-directed RNA polyme... 137 2.2e-05 1
MGI|MGI:88453 - symbol:Col3a1 "collagen, type III, alpha ... 136 2.3e-05 1
FB|FBgn0262126 - symbol:gho "ghost" species:7227 "Drosoph... 135 2.3e-05 1
WB|WBGene00001215 - symbol:ego-2 species:6239 "Caenorhabd... 136 2.3e-05 1
UNIPROTKB|F1SKM1 - symbol:COL7A1 "Uncharacterized protein... 148 2.4e-05 2
UNIPROTKB|F1NRH2 - symbol:LOC100858979 "Uncharacterized p... 132 2.4e-05 1
UNIPROTKB|F1PGS0 - symbol:POLR2A "DNA-directed RNA polyme... 137 2.5e-05 1
UNIPROTKB|G3MZY8 - symbol:POLR2A "DNA-directed RNA polyme... 137 2.5e-05 1
UNIPROTKB|P24928 - symbol:POLR2A "DNA-directed RNA polyme... 137 2.5e-05 1
MGI|MGI:98086 - symbol:Polr2a "polymerase (RNA) II (DNA d... 137 2.5e-05 1
RGD|1587326 - symbol:Polr2a "polymerase (RNA) II (DNA dir... 137 2.5e-05 1
UNIPROTKB|F1RXW0 - symbol:COL5A2 "Uncharacterized protein... 135 2.5e-05 1
TAIR|locus:2089616 - symbol:AT3G14750 "AT3G14750" species... 127 2.7e-05 1
UNIPROTKB|I3LSV6 - symbol:COL2A1 "Uncharacterized protein... 135 2.7e-05 1
TAIR|locus:4010713902 - symbol:AT4G22505 species:3702 "Ar... 130 2.8e-05 1
UNIPROTKB|F1PG69 - symbol:COL3A1 "Uncharacterized protein... 135 3.0e-05 1
UNIPROTKB|F1N2Y2 - symbol:COL5A2 "Uncharacterized protein... 135 3.0e-05 1
UNIPROTKB|F1PG08 - symbol:COL5A2 "Uncharacterized protein... 135 3.0e-05 1
FB|FBgn0052685 - symbol:ZAP3 species:7227 "Drosophila mel... 136 3.0e-05 1
WARNING: Descriptions of 139 database sequences were not reported due to the
limiting value of parameter V = 100.
>TAIR|locus:2033681 [details] [associations]
symbol:AT1G67170 "AT1G67170" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
"biological_process" evidence=ND] EMBL:CP002684 EMBL:BT005883
EMBL:AK228253 IPI:IPI00547288 RefSeq:NP_176888.2 UniGene:At.35681
ProteinModelPortal:Q84TD8 SMR:Q84TD8 IntAct:Q84TD8 PRIDE:Q84TD8
EnsemblPlants:AT1G67170.1 GeneID:843037 KEGG:ath:AT1G67170
TAIR:At1g67170 HOGENOM:HOG000005883 InParanoid:Q84TD8 OMA:MESKGRI
PhylomeDB:Q84TD8 ProtClustDB:CLSN2918424 Genevestigator:Q84TD8
Uniprot:Q84TD8
Length = 359
Score = 335 (123.0 bits), Expect = 2.3e-30, P = 2.3e-30
Identities = 84/176 (47%), Positives = 99/176 (56%)
Query: 74 FHLCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGA 133
+ CR TY+YEKKFYNDHLESLQ MEKNY+TMA EVEKL+A+LMN N DRRA G YG
Sbjct: 191 YQQCRATYDYEKKFYNDHLESLQAMEKNYMTMAREVEKLQAQLMNNANSDRRAGGPYGNN 250
Query: 134 TGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA--Y---AATQ 188
N+E + SG G YED +G PQG+ P P A A GPN+ A Y TQ
Sbjct: 251 I-NAEIDASGHQSGNGYYEDAFG-PQGYIPQPVAGNA----TGPNSVVGAAQYPYQGVTQ 304
Query: 189 SGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYD-PAKGPGYDPTK 241
G P R Y+ PRGP S P P P GPS + P G +P++
Sbjct: 305 PGYFPQRPGYNFPRGP--PGSYDPTTRLPTGPYGAPFPPGPSNNTPYAGTHGNPSR 358
>FB|FBgn0085362 [details] [associations]
symbol:Vml "Vitelline membrane-like" species:7227 "Drosophila
melanogaster" [GO:0009950 "dorsal/ventral axis specification"
evidence=IGI] [GO:0060388 "vitelline envelope" evidence=IDA]
[GO:0007305 "vitelline membrane formation involved in
chorion-containing eggshell formation" evidence=ISM] [GO:0008316
"structural constituent of vitelline membrane" evidence=ISM]
[GO:0035805 "egg coat" evidence=ISM] EMBL:AE014298 GO:GO:0009950
GeneTree:ENSGT00700000104744 PROSITE:PS51137 GO:GO:0060388
InterPro:IPR013135 RefSeq:NP_001096866.1 UniGene:Dm.32785
STRING:A8JUV4 EnsemblMetazoa:FBtr0112535 GeneID:5740271
KEGG:dme:Dmel_CG34333 UCSC:CG34333-RA CTD:5740271
FlyBase:FBgn0085362 eggNOG:NOG284187 InParanoid:A8JUV4 OMA:ISKYETI
OrthoDB:EOG4KPRTT GenomeRNAi:5740271 NextBio:20891311 Bgee:A8JUV4
Uniprot:A8JUV4
Length = 578
Score = 286 (105.7 bits), Expect = 1.0e-23, P = 1.0e-23
Identities = 83/283 (29%), Positives = 99/283 (34%)
Query: 119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVGA 175
AP+ A SY S + S P Y P H P A++ A
Sbjct: 198 APSYSAPAAPSYSAPAAPSYSAPSA-PSYSAQKTSSYSAPAAPSYHAPAAPASSYSAP-A 255
Query: 176 GPNTSTSA---YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 232
GP+ S A Y+A P ++Y + P Y A P Y A APSY + PSY
Sbjct: 256 GPSYSAPAAPSYSAPSYSAPA-SSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPSYSSP 314
Query: 233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
Y P Y A K +Y A P+Y PSY Y P+Y P
Sbjct: 315 ASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 374
Query: 293 YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
Y P Y Y A APSY P Y Y APSY + A
Sbjct: 375 YSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYS-A 433
Query: 353 PRGAAPHGQVPP-PLNNVPYGSATPPARS---GSGQPRGGNPA 391
P AAP P P + P S AR+ GS P G A
Sbjct: 434 P--AAPSYSAPAAPSYSAPASSGYSAARAYSAGSAAPASGYSA 474
Score = 274 (101.5 bits), Expect = 8.8e-22, P = 8.8e-22
Identities = 80/271 (29%), Positives = 97/271 (35%)
Query: 133 ATGNSENETSGRPVGQNAYEDGYG--VP-QGHGPP------PSATTAGVVG-AGPNTSTS 182
AT N E + G P + YE+ + +P Q + PP S + A G + P
Sbjct: 24 ATRNEEFD-DGFPESEFDYEERHTREIPAQAYAPPIVYNSQSSYSPAKDQGYSAPAAPVY 82
Query: 183 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
+ AA P +Y P P Y A P Y A APSY PSY Y
Sbjct: 83 SPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAA 142
Query: 243 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 302
P Y A +Y A P+Y SY Y P+Y P Y P Y
Sbjct: 143 PSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYS 202
Query: 303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGAPRGAAPHGQ 361
P Y A APSY P Y Q+ Y APSY P+ AP G P
Sbjct: 203 APAAPSYSAPAAPSYSAPSAPSYSAQKTSSYSAPAAPSYHAPAAPASSYSAPAG--PSYS 260
Query: 362 VPP-PLNNVPYGSATPPARSGSGQPRGGNPA 391
P P + P SA + S P PA
Sbjct: 261 APAAPSYSAPSYSAPASSYSALKAPSYSAPA 291
Score = 262 (97.3 bits), Expect = 3.1e-20, P = 3.1e-20
Identities = 69/246 (28%), Positives = 83/246 (33%)
Query: 155 YGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGY 213
Y P + S + A G + P + AA P +Y P P Y A P Y
Sbjct: 54 YAPPIVYNSQSSYSPAKDQGYSAPAAPVYSPAAPSYSAPAAPSYSAPAAPSYSAPAAPSY 113
Query: 214 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR 273
A APSY PSY Y P Y A +Y A P+Y SY
Sbjct: 114 SAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPA 173
Query: 274 GLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 333
Y P+Y P Y P Y P Y A APSY P Y Q+ Y
Sbjct: 174 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPSAPSYSAQKTSSY 233
Query: 334 DMRRAPSYD-PSRGTGFDGAPRGAAPHGQVPP----PLNNVP---YGSATPPARSGSGQP 385
APSY P+ AP G + P P + P Y + P+ S P
Sbjct: 234 SAPAAPSYHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAP 293
Query: 386 RGGNPA 391
PA
Sbjct: 294 SYSAPA 299
Score = 259 (96.2 bits), Expect = 7.3e-20, P = 7.3e-20
Identities = 66/241 (27%), Positives = 84/241 (34%)
Query: 155 YGVPQG--HGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 212
Y P G + P + + + + P +S SA A P +Y P P Y +S P
Sbjct: 251 YSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAAPSYSSSASPS 310
Query: 213 YD--------ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 264
Y A AP+Y K SY P Y P Y A S+Y A P+Y
Sbjct: 311 YSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAP 370
Query: 265 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 324
PSY Y +Y P Y P Y Y A APSY P
Sbjct: 371 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPS 430
Query: 325 YDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 384
Y Y APSY +G+ A R + P + P S+ A + SG
Sbjct: 431 YSAPAAPSYSAPAAPSYSAPASSGYSAA-RAYSAGSAAPASGYSAPKTSSGYSAPASSGS 489
Query: 385 P 385
P
Sbjct: 490 P 490
Score = 254 (94.5 bits), Expect = 3.0e-19, P = 3.0e-19
Identities = 73/277 (26%), Positives = 91/277 (32%)
Query: 119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 178
AP+ A SY S + + A Y P T++ A P+
Sbjct: 182 APSYSAPAAPSYSAPAAPSYSAPAAPSYSAPA-APSYSAPSAPSYSAQKTSSYSAPAAPS 240
Query: 179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASK--GPG--YDASKAPSYDPTKGPSYDPAKG 234
A A+ P +Y P P Y A P Y A KAPSY PSY
Sbjct: 241 YHAPAAPASSYSAPAGPSYSAPAAPSYSAPSYSAPASSYSALKAPSYSAPAAPSYSAPAA 300
Query: 235 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
P Y + P Y + S+Y A P Y + SY Y P+Y Y
Sbjct: 301 PSYSSSASPSYSSPASSSYSAPAAPTYSAPKAQSYSAPAAPSYSAPAAPSYSAPASSSYS 360
Query: 295 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 354
P Y P Y A APSY Y Y APSY + + AP
Sbjct: 361 APAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPASSSYS-AP- 418
Query: 355 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
AAP P + Y + P+ S P PA
Sbjct: 419 -AAPSYSAPAAPS---YSAPAAPSYSAPAAPSYSAPA 451
Score = 220 (82.5 bits), Expect = 2.8e-15, P = 2.8e-15
Identities = 80/278 (28%), Positives = 94/278 (33%)
Query: 117 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 176
+ AP+ A SY S + +S P + Y P P SA A A
Sbjct: 282 LKAPSYSAPAAPSYSAPAAPSYS-SSASPSYSSPASSSYSAPAA--PTYSAPKAQSYSAP 338
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
S SA AA P ++Y P P Y A P Y A APSY SY P
Sbjct: 339 AAPSYSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSSYSAPAAPS 398
Query: 237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
Y P Y A S+Y A P+Y PSY Y P+Y GY
Sbjct: 399 YSAPAAPSYSAPASSSYSAPAAPSYSAPAAPSYSAPAAPSYSAPAAPSYSAPASSGYSAA 458
Query: 297 RVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDG--A 352
R Y G A A Y P+ GY G A SY P+ T G A
Sbjct: 459 RA--YSA--G---SAAPASGYSAPKTSSGYSAPASSGSPA--ASSYSAPASSTASSGYSA 509
Query: 353 P--------RGAAPHGQVPPPLNNVPYGSATPPARSGS 382
P R H + YGSA P A G+
Sbjct: 510 PASKSSGYARSEMDHQILGMARTAGGYGSAAPSAAYGA 547
>UNIPROTKB|F1S187 [details] [associations]
symbol:LOC100518332 "Uncharacterized protein" species:9823
"Sus scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
SMART:SM00547 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
GO:GO:0003676 GO:GO:0005622 GeneTree:ENSGT00530000063105
EMBL:CU896616 Ensembl:ENSSSCT00000019273 OMA:TESSSGX Uniprot:F1S187
Length = 406
Score = 201 (75.8 bits), Expect = 1.8e-13, P = 1.8e-13
Identities = 69/221 (31%), Positives = 84/221 (38%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P R + G + G E GR G+ GYG + G + G G G +
Sbjct: 187 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSGGG-GYGGDR 244
Query: 180 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 239
S Y +SG Y RG GY +G GY + Y + Y +G GY
Sbjct: 245 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGGYGG 300
Query: 240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQR-GPNY--DMQRGPGYETQ 296
+G GY +G Y RG Y RG Y RG GY R G Y D G GY
Sbjct: 301 DRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRG-GYGGDRSGGGYGGDRGSGSGYGGD 358
Query: 297 RVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 336
R GY R G Y R+ Y RG GY + G D R
Sbjct: 359 RSGGYGGDRSGGGYGGDRSGGYGGDRG-GYGGKMGGRNDYR 398
Score = 190 (71.9 bits), Expect = 3.2e-12, P = 3.2e-12
Identities = 71/225 (31%), Positives = 84/225 (37%)
Query: 136 NSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
N RP G + GYG +G+ G G G G + S Y +SG
Sbjct: 183 NEPRPEDSRPSGGDFRGRGYGGERGYRGRGGRGGDRG--GYGGDRSGGGYGGDRSGG--- 237
Query: 195 AAYDIPR-GPGYEASK-GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSN 252
Y R G GY + G GY + Y +G Y +G GY + GY +G
Sbjct: 238 GGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSGGYGGDRGGG 297
Query: 253 YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG-YDVQRGPVYEA 311
Y RG Y RG Y RG GY RG Y RG GY R G Y RG
Sbjct: 298 YGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRG-GYGGDRSGGGYGGDRGS---- 351
Query: 312 QRAPSYIPQRGPGYDLQR-GQGYDMRRAPSYDPSRGTGFDGAPRG 355
Y R GY R G GY R+ Y RG G+ G G
Sbjct: 352 --GSGYGGDRSGGYGGDRSGGGYGGDRSGGYGGDRG-GYGGKMGG 393
>UNIPROTKB|P11414 [details] [associations]
symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
species:10029 "Cricetulus griseus" [GO:0005634 "nucleus"
evidence=ISS] [GO:0005665 "DNA-directed RNA polymerase II, core
complex" evidence=ISS] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=ISS] [GO:0006468 "protein
phosphorylation" evidence=ISS] [GO:0004672 "protein kinase
activity" evidence=ISS] InterPro:IPR000684 Pfam:PF05001
PROSITE:PS00115 GO:GO:0003677 GO:GO:0006468 GO:GO:0006366
GO:GO:0003899 GO:GO:0005665 EMBL:M19538 PIR:A27677
ProteinModelPortal:P11414 Uniprot:P11414
Length = 467
Score = 184 (69.8 bits), Expect = 2.2e-11, P = 2.2e-11
Identities = 77/263 (29%), Positives = 101/263 (38%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS---TSAYAATQ 188
GA G S +A D G G+ P S T GP++ + A +
Sbjct: 29 GAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPGGAMSP 88
Query: 189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 248
S +P AY+ PR PG + P Y + +PSY PT PSY P P Y PT P Y
Sbjct: 89 SYSPTSPAYE-PRSPGGYTPQSPSYSPT-SPSYSPTS-PSYSPTS-PNYSPTS-PSYSPT 143
Query: 249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 308
S Y + P+Y PSY P Y P+Y P Y P Y P
Sbjct: 144 SPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-PTSPSYSPTS-PSYS-PTSPS 195
Query: 309 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA--APHGQVPPPL 366
Y + +PSY P P Y Y +PSY P+ + +P + +P+ P
Sbjct: 196 Y-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSPSYSPTSPNYSPTSP- 250
Query: 367 NNVPYGSATPPARSGSGQPRGGN 389
N P + P S S P N
Sbjct: 251 NYTPTSPSYSPT-SPSYSPTSPN 272
Score = 165 (63.1 bits), Expect = 3.0e-09, P = 3.0e-09
Identities = 69/236 (29%), Positives = 93/236 (39%)
Query: 118 NAPNVDRRA-DGSYGGATG---NSENETSGRPVGQN-AYEDGYGVPQGHGP--PPSATTA 170
N P + G GA G ++ ++ SG G + A+ G P GP P +
Sbjct: 24 NIPGLGAAGRSGMTPGAAGFSPSAASDASGFSPGYSPAWSPTPGSPGSPGPSSPYIPSPG 83
Query: 171 GVVGAGPNTSTSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS 228
G + + ++ AY G TP +Y P P Y + P Y + +P+Y PT PS
Sbjct: 84 GAMSPSYSPTSPAYEPRSPGGYTPQSPSYS-PTSPSYSPTS-PSYSPT-SPNYSPTS-PS 139
Query: 229 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
Y P P Y PT P Y S Y + P+Y PSY P Y P+Y
Sbjct: 140 YSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS-P 191
Query: 289 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 344
P Y P Y P Y + +PSY P P Y Y +PSY P+
Sbjct: 192 TSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPT 241
Score = 121 (47.7 bits), Expect = 0.00023, P = 0.00023
Identities = 63/225 (28%), Positives = 80/225 (35%)
Query: 163 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 216
P S T+ PN Y T +P +Y P P Y + P Y S
Sbjct: 257 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 309
Query: 217 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 275
++P+Y P+ PSY P+ P Y PT P Y S Y P Y P Y P
Sbjct: 310 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 363
Query: 276 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 335
Y P Y P Y + P Y P Y + +P Y P P Y Y
Sbjct: 364 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 415
Query: 336 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 380
+P Y P+ T +P+G+ P P S T PA S
Sbjct: 416 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 459
>UNIPROTKB|A4II09 [details] [associations]
symbol:eif3a "Eukaryotic translation initiation factor 3
subunit A" species:8364 "Xenopus (Silurana) tropicalis" [GO:0001732
"formation of translation initiation complex" evidence=ISS]
[GO:0005852 "eukaryotic translation initiation factor 3 complex"
evidence=ISS] [GO:0003743 "translation initiation factor activity"
evidence=ISS] InterPro:IPR000717 Pfam:PF01399 SMART:SM00088
GO:GO:0003743 GO:GO:0005852 eggNOG:NOG236708 HOGENOM:HOG000246822
KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128 GO:GO:0001732 CTD:8661
EMBL:BC135790 RefSeq:NP_001096173.1 UniGene:Str.55518 STRING:A4II09
PRIDE:A4II09 GeneID:100124719 KEGG:xtr:100124719
Xenbase:XB-GENE-994394 Uniprot:A4II09
Length = 1391
Score = 186 (70.5 bits), Expect = 8.8e-11, Sum P(2) = 8.8e-11
Identities = 68/224 (30%), Positives = 101/224 (45%)
Query: 156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP--GYEASKGP-- 211
G+ + GP AG G + R +D RGP G++ +GP
Sbjct: 981 GLEEDRGPRRGIDDAGP-RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRR 1039
Query: 212 GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQKGSN--YDAQRGPN- 260
G+D + P D +GP +D + P G+D +GP G+D +G +D RGP
Sbjct: 1040 GFDEDRGPRRGIDDDRGPRRGFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRR 1099
Query: 261 -YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQRVP--GYDVQRGPVY 309
++ RGP ++ RG G++ RGP ++ RGP G+E R P G+D RGP
Sbjct: 1100 GFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGP-- 1157
Query: 310 EAQRAPSYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRGT 347
+R + RGP G+D R +G+D R P D RG+
Sbjct: 1158 --RRG--FEDDRGPRRGFDEDRTPRRGFDDDRGPRRGLDEDRGS 1197
Score = 183 (69.5 bits), Expect = 1.9e-10, Sum P(2) = 1.9e-10
Identities = 65/191 (34%), Positives = 92/191 (48%)
Query: 194 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 243
R ++ RGP G E + P G+D + P +D +GP +D +GP G D +GP
Sbjct: 998 RRGFEEDRGPRRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGIDDDRGP 1057
Query: 244 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 293
++G +D R P +D RGP +D RG G+D RGP ++ RGP G+
Sbjct: 1058 ----RRG--FDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGF 1111
Query: 294 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAPSYDP 343
E R P G++ RGP +E R P + RGP G+D RG +G++ R P
Sbjct: 1112 EDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPR--- 1168
Query: 344 SRGTGFDGAPR 354
RG D PR
Sbjct: 1169 -RGFDEDRTPR 1178
Score = 167 (63.8 bits), Expect = 1.2e-08, Sum P(2) = 1.2e-08
Identities = 71/225 (31%), Positives = 103/225 (45%)
Query: 200 PRGPGYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQK 249
PR G++ + P G+D + P +D +GP +D +GP G++ +GP G++ +
Sbjct: 1057 PRR-GFDEDRTPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGFEDDRGPRRGFEDDR 1115
Query: 250 GSN--YDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQR 297
G ++ RGP ++ RGP ++ RG G+D RGP ++ RGP G++ R
Sbjct: 1116 GPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRGPRRGFEDDRGPRRGFDEDR 1175
Query: 298 VP--GYDVQRGPV--YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
P G+D RGP + R S+ RG G D+ R +G D R P RG D P
Sbjct: 1176 TPRRGFDDDRGPRRGLDEDRG-SW---RG-GDDVPR-RGADDDRGPR----RGADDDRGP 1225
Query: 354 RGAAPHGQVP--PPLNNVPYG-SATPPARSGS-GQPRGGN-PARR 393
R Q P P + P G AR S G PR P R
Sbjct: 1226 RRGEDRDQTPWKPMAASRPGGWREREKAREDSWGPPRDSQAPEER 1270
Score = 150 (57.9 bits), Expect = 8.8e-07, Sum P(2) = 8.8e-07
Identities = 82/301 (27%), Positives = 122/301 (40%)
Query: 82 EYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELMNAPNVDRRADGSYGGATGNSE 138
E E++ Y + L+ L+ E+ E+E + R E + R D S G E
Sbjct: 838 EAEQRDYQERLKKLEEQERKKRQRELEIEERERKREEERRGGDDTFRKDSSRWG-----E 892
Query: 139 NETSGRPVGQNAYEDGYGVPQG---HGPPPSATTAGVVGAG-PNTSTSAYAATQSGTPMR 194
E SG G + E P+ G P S + +A + R
Sbjct: 893 REESGWRRGADPDERKQVPPERDWRRGGPDSKPVINEDASNREEDENAALRKDEEQVSSR 952
Query: 195 AAYDIPRGPGYEASKGPGY-DASKAPS--YDPTKGP--SYDPAKGP--GYDPTKGPGYDA 247
A + P + KG + D + P + +GP D A GP G++ +GP
Sbjct: 953 AFEEKVSLPDADEEKGGSWRDEDRGPKRGLEEDRGPRRGIDDA-GPRRGFEEDRGP---- 1007
Query: 248 QKGSNYDAQRGPNYDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GYETQRVP 299
++G D +D RGP +D RG G+D RGP D RGP G++ R P
Sbjct: 1008 RRGIEDDRAPRRGFDDDRGPRRGFDDDRGPRRGFDEDRGPRRGIDDDRGPRRGFDEDRTP 1067
Query: 300 --GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRGT--GF 349
G+D RGP +R + RGP G+D RG +G++ R P ++ RG GF
Sbjct: 1068 RRGFDDDRGP----RRG--FDDDRGPRRGFDEDRGPRRGFEDDRGPRRGFEDDRGPRRGF 1121
Query: 350 D 350
+
Sbjct: 1122 E 1122
Score = 46 (21.3 bits), Expect = 8.8e-11, Sum P(2) = 8.8e-11
Identities = 10/35 (28%), Positives = 21/35 (60%)
Query: 75 HLCR-GTYEYEKKFYNDHLESLQVMEKNYITMATE 108
HL + G Y+Y+ +++SL+ + + Y+ +A E
Sbjct: 65 HLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLAEE 99
>UNIPROTKB|A2VD00 [details] [associations]
symbol:eif3a "Eukaryotic translation initiation factor 3
subunit A" species:8355 "Xenopus laevis" [GO:0001732 "formation of
translation initiation complex" evidence=ISS] [GO:0005852
"eukaryotic translation initiation factor 3 complex" evidence=ISS]
[GO:0003743 "translation initiation factor activity" evidence=ISS]
InterPro:IPR000717 Pfam:PF01399 SMART:SM00088 GO:GO:0003743
GO:GO:0005852 KO:K03254 HAMAP:MF_03000 HOVERGEN:HBG006128
GO:GO:0001732 EMBL:BC129055 RefSeq:NP_001085285.1 UniGene:Xl.57279
PRIDE:A2VD00 GeneID:443632 KEGG:xla:443632 Uniprot:A2VD00
Length = 1424
Score = 184 (69.8 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
Identities = 66/197 (33%), Positives = 90/197 (45%)
Query: 194 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 243
R D RGP G + +GP G D + P D +GP +D +GP G+D +GP
Sbjct: 1030 RRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGFDEDRGPRRGFDEDRGP 1089
Query: 244 GYDAQKGSNYDAQRGPN--YDIHRGP--SYDPQRG--LGYDMQRGPN--YDMQRGP--GY 293
D +D RGP +D RGP +D RG G+D RGP +D RGP G+
Sbjct: 1090 RRD------FDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGPRRGF 1143
Query: 294 ETQRVP--GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SY 341
+ R P G++ RGP +E R P + RGP G+D R +G++ R P
Sbjct: 1144 DDDRGPRRGFEDDRGPRRGFEDDRGPRRGFEDDRGPRRGFDEDRTPRRGFEDDRGPRRGM 1203
Query: 342 DPSRGTGFDGAPRGAAP 358
D R + GA P
Sbjct: 1204 DEERVSWRGGAEEDRGP 1220
Score = 184 (69.8 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
Identities = 73/232 (31%), Positives = 104/232 (44%)
Query: 152 EDGYGVPQGHGPPPSATT-AGVVGAGPNTSTSAYAATQSG---TP-MRAAYDIPRGP--G 204
+D V +G G A + G GP S + G P R ++ +GP G
Sbjct: 943 KDEEQVARGDGDEERAASWRGTDDRGPKRSVEEDGGPRRGFNDEPGPRRGFEDDQGPRRG 1002
Query: 205 YEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQKGSN-- 252
+ +GP G D + P D +GP D +GP G D +GP G D +G
Sbjct: 1003 LDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRGLDEDRGPRRG 1062
Query: 253 YDAQRGPN--YDIHRGPSYDPQRGLGYDMQRGP--NYDMQRGP--GYETQRVP--GYDVQ 304
D RGP +D RGP +RG +D RGP ++D RGP G++ R P G+D
Sbjct: 1063 LDEDRGPRRGFDEDRGP----RRG--FDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDED 1116
Query: 305 RGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRG 346
RGP ++ R P + RGP G+D RG +G++ R P ++ RG
Sbjct: 1117 RGPRRGFDEDRGPRRGFDDDRGPRRGFDDDRGPRRGFEDDRGPRRGFEDDRG 1168
Score = 159 (61.0 bits), Expect = 9.5e-08, Sum P(2) = 9.5e-08
Identities = 61/197 (30%), Positives = 91/197 (46%)
Query: 194 RAAYDIPRGP--GYEASKGP--GYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP 243
R +D RGP ++ +GP G+D + P +D +GP +D +GP G+D +GP
Sbjct: 1080 RRGFDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDEDRGPRRGFDDDRGP 1139
Query: 244 GYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRGLGYDMQRGPN--YDMQRGP--GYETQR 297
++G +D RGP ++ RGP +RG ++ RGP ++ RGP G++ R
Sbjct: 1140 ----RRG--FDDDRGPRRGFEDDRGP----RRG--FEDDRGPRRGFEDDRGPRRGFDEDR 1187
Query: 298 VP--GYDVQRGPV--YEAQRAP---SYIPQRGPGYDLQRGQGYDMRRAPSYD--PSRGTG 348
P G++ RGP + +R RGP + +G RR D P RG
Sbjct: 1188 TPRRGFEDDRGPRRGMDEERVSWRGGAEEDRGPRRGAEEDRG--PRRGAEEDRGPRRGAE 1245
Query: 349 FDGAPRGAAPH--GQVP 363
D PR A GQ P
Sbjct: 1246 EDRGPRRGAEEDRGQTP 1262
Score = 145 (56.1 bits), Expect = 3.3e-06, Sum P(2) = 3.3e-06
Identities = 83/298 (27%), Positives = 119/298 (39%)
Query: 82 EYEKKFYNDHLESLQVMEKNYITMATEVE---KLRAELMNAPNVDRRADGSYGGATGNSE 138
E E++ Y + L+ L+ E+ E+E K R E P+ R + + +
Sbjct: 838 EAEQREYQERLKKLEEQERKKRLRELEIEEREKKRDEERRGPDDSFRKQDT---PSRWGD 894
Query: 139 NETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 198
E SG G + E + PP +G + P + +
Sbjct: 895 REESGWRRGADPDE------RKQAPPERDWRSGGQDSKP-VKDEDREGDEDSVLRKDEEQ 947
Query: 199 IPRGPGYE--ASKGPGYDASKAP--SYDPTKGP--SYDPAKGP--GYDPTKGP--GYDAQ 248
+ RG G E A+ G D + P S + GP ++ GP G++ +GP G D
Sbjct: 948 VARGDGDEERAASWRGTD-DRGPKRSVEEDGGPRRGFNDEPGPRRGFEDDQGPRRGLDED 1006
Query: 249 KGSN--YDAQRGPNYDIHRGPSYD--PQRGLGYDMQRGPN--YDMQRGP--GYETQRVP- 299
+G D RGP RG D P+RGL D RGP D RGP G + R P
Sbjct: 1007 RGPRRGLDEDRGPR----RGLDEDRGPRRGL--DEDRGPRRGLDEDRGPRRGLDEDRGPR 1060
Query: 300 -GYDVQRGPV--YEAQRAP--SYIPQRGP--GYDLQRG--QGYDMRRAP--SYDPSRG 346
G D RGP ++ R P + RGP +D RG +G+D R P +D RG
Sbjct: 1061 RGLDEDRGPRRGFDEDRGPRRGFDEDRGPRRDFDEDRGPRRGFDEDRGPRRGFDEDRG 1118
Score = 46 (21.3 bits), Expect = 1.6e-10, Sum P(2) = 1.6e-10
Identities = 10/35 (28%), Positives = 21/35 (60%)
Query: 75 HLCR-GTYEYEKKFYNDHLESLQVMEKNYITMATE 108
HL + G Y+Y+ +++SL+ + + Y+ +A E
Sbjct: 65 HLAKEGLYQYKNICQQVNIKSLEDVVRAYLKLAEE 99
>WB|WBGene00020550 [details] [associations]
symbol:T17H7.1 species:6239 "Caenorhabditis elegans"
[GO:0019915 "lipid storage" evidence=IMP] [GO:0009792 "embryo
development ending in birth or egg hatching" evidence=IMP]
GO:GO:0009792 GO:GO:0019915 InterPro:IPR003677 Pfam:PF02520
EMBL:FO080638 PIR:T28899 RefSeq:NP_497250.1
ProteinModelPortal:Q22537 PaxDb:Q22537 EnsemblMetazoa:T17H7.1
GeneID:175228 KEGG:cel:CELE_T17H7.1 UCSC:T17H7.1 CTD:175228
WormBase:T17H7.1 eggNOG:NOG271901 GeneTree:ENSGT00700000104820
HOGENOM:HOG000020548 InParanoid:Q22537 OMA:GRGQGPD NextBio:887312
Uniprot:Q22537
Length = 682
Score = 172 (65.6 bits), Expect = 9.4e-10, P = 9.4e-10
Identities = 75/273 (27%), Positives = 101/273 (36%)
Query: 125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
R DG G G +N G G+ +G P + + + GP++ S
Sbjct: 229 RGDGP-GFVPGTQDNNQRGS--GERGQRQNFG-PSDNLTNGNQFSKKQFARGPSSMNSDL 284
Query: 185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGP 243
+ + + +D PRGPG +G G D +GP + P PG + GP
Sbjct: 285 SENSQHSDSNSQFDFPRGPGGRGGRGQGPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGP 344
Query: 244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG-LGYDMQRGPNYDM--QRG---PGYETQR 297
G +G D + ++ RG +RG G GP D +RG PG R
Sbjct: 345 GGRGGRGQGPDFEPQDDFPGRRGSGGPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGR 404
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
G D GP + R P GP D +RG G P RG D P G
Sbjct: 405 GQGPDF--GPGRQGGRGQG--PDFGPQDDFSGRRGSG-----GPGGRGGRGQEPDFGPGG 455
Query: 356 AAPHGQVPP--PLNNVP--YGSATPPARSGSGQ 384
GQ P P ++ P GS P R G GQ
Sbjct: 456 QGGRGQGPDFGPQDDFPGRRGSGGPEGRDGRGQ 488
Score = 139 (54.0 bits), Expect = 4.1e-06, P = 4.1e-06
Identities = 76/265 (28%), Positives = 93/265 (35%)
Query: 131 GGATGNSENETSGRPVGQNAYEDG--YGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
GG G + G P GQ G +G PQ P + G G G + Q
Sbjct: 304 GGRGGRGQGPDFG-PGGQGGRGQGPDFG-PQDDFPGRRGS-GGPGGRGGRGQGPDFEP-Q 359
Query: 189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDPTKGPGYDA 247
P R GPG +G G D + +G +G G P GPG
Sbjct: 360 DDFPGRRGSG---GPGRRGGRGQGPDFGPQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQG 416
Query: 248 QKGSNYDAQRGPNYDI--HRGPSYDPQRG-LGYDMQRGPNYDMQRG--PGYETQR-VPGY 301
+G D GP D RG RG G + GP RG P + Q PG
Sbjct: 417 GRGQGPDF--GPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGR 474
Query: 302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
GP E + P GPG RGQ D ++ RG+G G RG P
Sbjct: 475 RGSGGP--EGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGG-RGQGPDFG 531
Query: 362 VPPPLNNVP--YGSATPPARSGSGQ 384
P ++ P GS P R G GQ
Sbjct: 532 ---PQDDFPGRRGSGGPEGRDGRGQ 553
Score = 120 (47.3 bits), Expect = 0.00051, P = 0.00051
Identities = 72/265 (27%), Positives = 94/265 (35%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP-----QGHGPPPSATTAGVVGAGPN 178
RR G G G + G P G G P +G GP G G GP+
Sbjct: 365 RRGSGGPGRRGGRGQGPDFG-PQDDFPGRRGSGGPGGRGGRGQGPDFGPGRQGGRGQGPD 423
Query: 179 TSTSA-YAATQ-SGTPM-RAA--YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 233
++ + SG P R + GPG + +G G D + +G +
Sbjct: 424 FGPQDDFSGRRGSGGPGGRGGRGQEPDFGPGGQGGRGQGPDFGPQDDFPGRRGSGGPEGR 483
Query: 234 -GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM--QRG 290
G G P GPG +G + D+ + RG RG G D GP D +RG
Sbjct: 484 DGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGSGGPGGRGQGPDF--GPQDDFPGRRG 541
Query: 291 PGYETQRV---------PGYDVQRGPVYEAQRAPSYIPQRGPGYD--LQ-RGQGYDMRRA 338
G R PG RG ++ ++ +RGPG L RGQG D
Sbjct: 542 SGGPEGRDGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGPGGPGGLGGRGQGPDF--G 599
Query: 339 PSYDPSRGTGFDGAPRGAAPHGQVP 363
P RG G D R GQ P
Sbjct: 600 PGGQGDRGQGPDFGARSQGNRGQGP 624
Score = 118 (46.6 bits), Expect = 0.00084, P = 0.00084
Identities = 62/240 (25%), Positives = 86/240 (35%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDG--YGVPQ-------GHGPPPSATTAGV-V 173
RR G GG G + G P GQ G +G PQ G G P G
Sbjct: 433 RRGSGGPGGRGGRGQEPDFG-PGGQGGRGQGPDFG-PQDDFPGRRGSGGPEGRDGRGQGP 490
Query: 174 GAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 233
GP + + SG+ + A+ RG G +G G D + +G +
Sbjct: 491 DFGPGSQGGRGQDSDSGS--QDAFPGRRGSGGPGGRGQGPDFGPQDDFPGRRGSGGPEGR 548
Query: 234 -GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ----RGLGYDMQRGPNYDMQ 288
G G P GPG +G + D+ + RGP P RG G D G D
Sbjct: 549 DGRGQGPDFGPGSQGGRGQDSDSGSQDAFPGRRGPG-GPGGLGGRGQGPDFGPGGQGDRG 607
Query: 289 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTG 348
+GP + R G Q GP +E+++ + Q+ M +D S G G
Sbjct: 608 QGPDFGA-RSQGNRGQ-GPAFESRQPRQFDDADTSSAPSQKYFQRQMNSGMRFDQSSGFG 665
>FB|FBgn0028573 [details] [associations]
symbol:prc "pericardin" species:7227 "Drosophila
melanogaster" [GO:0005605 "basal lamina" evidence=NAS] [GO:0007507
"heart development" evidence=IMP;TAS] [GO:0005578 "proteinaceous
extracellular matrix" evidence=IDA] [GO:0035088 "establishment or
maintenance of apical/basal cell polarity" evidence=TAS]
[GO:0016477 "cell migration" evidence=TAS] [GO:0002009
"morphogenesis of an epithelium" evidence=TAS] GO:GO:0002009
GO:GO:0007507 GO:GO:0005578 FlyBase:FBgn0028573 InterPro:IPR009765
Pfam:PF07054 EMBL:AF203342 STRING:Q9U617 PRIDE:Q9U617
InParanoid:Q9U617 ArrayExpress:Q9U617 Bgee:Q9U617 Uniprot:Q9U617
Length = 1729
Score = 171 (65.3 bits), Expect = 4.0e-09, P = 4.0e-09
Identities = 81/274 (29%), Positives = 98/274 (35%)
Query: 130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV---VGAG-PNTSTSAYA 185
YG G +G+P G + G G G G P T G+ GAG P T
Sbjct: 417 YGTQPGIGGQTGAGQP-GYGT-QPGIGAQTGAGQPGYGTQPGIGGQTGAGQPGYGTQPGI 474
Query: 186 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG- 242
Q+G + Y G G + G PGY + G P Y G G G
Sbjct: 475 GVQTGAG-QPGYGSQPGIGAQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQ 533
Query: 243 PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPG 300
PGY Q G AQ G P Y Q G+G G P Y Q G G +T PG
Sbjct: 534 PGYGTQPGIG--AQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPG 586
Query: 301 YDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT-GFDGAPRGAA 357
Y Q G + P Y Q G G + GQ GY + G G+ P
Sbjct: 587 YGTQPGVGAQTGTGQPGYGSQPGVGTQIGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGG 646
Query: 358 PHGQVPPPLNNVPYGSATPPARSGSGQPR-GGNP 390
G P P G A++G+GQP G P
Sbjct: 647 QTGAAQPGYGTQP-GVG---AQTGTGQPGYGAQP 676
Score = 169 (64.5 bits), Expect = 6.7e-09, P = 6.7e-09
Identities = 86/271 (31%), Positives = 99/271 (36%)
Query: 130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS--ATTAGVVGAGPNTSTSAYAAT 187
YGG G S G+P G G+P G+G P A TA V G T
Sbjct: 876 YGGQPGISGQTGGGQP-GYGGQATISGLP-GYGTQPGIGALTA-VPGGHYGYETQPGIGG 932
Query: 188 QSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
Q+GT P G G + G PGY P G S + PGY G G
Sbjct: 933 QTGTNQPGFGGQP-GIGGQTGAGQPGYGFIGQPGIGGQTGTS---GRQPGYGTQPGIGGQ 988
Query: 247 AQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYD 302
G Y +Q G G P Y Q G+G + G P Y Q G G +T PGY
Sbjct: 989 TAAGQPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYG 1048
Query: 303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
Q G + Q P Y Q PG Q G G P Y G G G P V
Sbjct: 1049 AQPG--FGGQ--PGYGNQ--PGVGGQTGAGQ-----PGYGSQPGVG--GQTGAGQPGYGV 1095
Query: 363 PPPLNNVP-YGSATPPARSG-SGQPR-GGNP 390
P P G T + G GQP GG+P
Sbjct: 1096 IPGFGGQPGIGGQTAAGKPGYGGQPGIGGSP 1126
Score = 164 (62.8 bits), Expect = 2.4e-08, P = 2.4e-08
Identities = 84/270 (31%), Positives = 99/270 (36%)
Query: 131 GGATGNSENETSGRPV--GQN-AYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
GG TG + +P GQ A + GYG G G T AG G G T A T
Sbjct: 390 GGQTGPGQPGYGSQPGIGGQTGAGQPGYGTQPGIG---GQTGAGQPGYG--TQPGIGAQT 444
Query: 188 QSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG-PG 244
+G P Y G G + G PGY G P Y G G G PG
Sbjct: 445 GAGQP---GYGTQPGIGGQTGAGQPGYGTQPGIGVQTGAGQPGYGSQPGIGAQTGAGQPG 501
Query: 245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYD 302
Y +Q G Q G P Y Q G+G G P Y Q G G +T PGY
Sbjct: 502 YGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYG 554
Query: 303 VQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
Q G + P Y Q G G GQ GY + P GTG G G+ P
Sbjct: 555 SQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGTQ--PGVGAQTGTGQPGY--GSQPGV 610
Query: 361 QVPPPLNNVPYGSATP-PARSGSGQPRGGN 389
YGS ++G+GQP G+
Sbjct: 611 GTQIGAGQPGYGSQPGIGGQTGAGQPGYGS 640
Score = 154 (59.3 bits), Expect = 3.0e-07, P = 3.0e-07
Identities = 78/247 (31%), Positives = 90/247 (36%)
Query: 131 GGATGNSENETS-G-RPV--GQNAY-EDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
GG TG S + G +P GQ A + GYG G G T AG G G T
Sbjct: 967 GGQTGTSGRQPGYGTQPGIGGQTAAGQPGYGSQPGIG---GQTGAGQPGYGSQTGVGGQI 1023
Query: 186 ATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
+G P Y G G + G PGY A P + G P G G PG
Sbjct: 1024 G--AGQP---GYGSQPGIGGQTGAGQPGYGAQ--PGFGGQPGYGNQPGVG-GQTGAGQPG 1075
Query: 245 YDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG----PGYETQRV 298
Y +Q G G P Y + P + Q G+G G P Y Q G P Y TQ+
Sbjct: 1076 YGSQPGVGGQTGAGQPGYGVI--PGFGGQPGIGGQTAAGKPGYGGQPGIGGSPVYGTQQG 1133
Query: 299 PG--YDVQRG-PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA-PSYDPSRGTGFDGAP- 353
G + G P Y Q P PGY G G P Y P G GAP
Sbjct: 1134 TGGPSGISGGQPGYGTQ--PGQTGAGQPGYGSLPGTGGQATAGQPGYGPGSQPGIGGAPV 1191
Query: 354 RGAAPHG 360
G P G
Sbjct: 1192 YGTQPGG 1198
Score = 154 (59.3 bits), Expect = 3.0e-07, P = 3.0e-07
Identities = 86/280 (30%), Positives = 100/280 (35%)
Query: 131 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG-PNTSTSAYAATQS 189
GG TG + +P G + G G P G+G P G G G P T Q+
Sbjct: 339 GGQTGAGQPGYGTQP-GIGG-QTGAGQP-GYGTQPGI--GGQTGPGQPGYGTQPGIGGQT 393
Query: 190 GTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKG-PGYD 246
G P + Y G G + G PGY G P Y G G G PGY
Sbjct: 394 G-PGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYG 452
Query: 247 AQKGSNYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG-PGYETQRVPGYDV 303
Q G G P Y G G GY Q G G PGY +Q PG
Sbjct: 453 TQPGIGGQTGAGQPGYGTQPGIGVQTGAGQPGYGSQPGIGAQTGAGQPGYGSQ--PGIGG 510
Query: 304 QRG---PVYEAQRAPSYIPQRG---PGYDLQRGQGYDMRRA-PSYDPSRGTGFD-GAPRG 355
Q G P Y +Q P Q G PGY Q G G P Y G G GA G
Sbjct: 511 QTGAGQPGYGSQ--PGIGGQTGAGQPGYGTQPGIGAQTGAGQPGYGSQPGIGGQTGA--G 566
Query: 356 AAPHGQVPPPLNNVPYGS---ATPP---ARSGSGQPRGGN 389
+G P G T P A++G+GQP G+
Sbjct: 567 QPGYGSQPGIGGQTGAGQPGYGTQPGVGAQTGTGQPGYGS 606
Score = 151 (58.2 bits), Expect = 6.3e-07, P = 6.3e-07
Identities = 85/282 (30%), Positives = 100/282 (35%)
Query: 120 PNVDRRADGSYGGATGNSENETS--GRPVGQN-AYEDGYGVPQGHGPPPSATTAGVVGAG 176
P+ R D S G G ++ S G GQ A + GYG G G T G G G
Sbjct: 107 PSSGRILDASGSGGIGRPDSIISLPGGVGGQTGAGQPGYGSQPGIG---GQTATGQPGYG 163
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKG 234
A A +G P Y G G + G PGY + G P Y G
Sbjct: 164 SQLGVGAQAG--AGQP---GYGAQPGVGAQTGAGQPGYGSQTGIGGQTGAGQPGYGSQPG 218
Query: 235 PGYDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPG 292
G G PGY +Q G Q G P Y Q G+G G P Y Q G G
Sbjct: 219 IGGQTGAGQPGYGSQPGIG--GQTGAGQ-----PGYGSQPGIGGQTGAGQPGYGSQPGIG 271
Query: 293 YETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGF 349
+T PGY Q G + P Y Q G G GQ GY + P G G
Sbjct: 272 GQTGAGQPGYGSQPGIGGQTGAGQPGYGTQPGIGGQTGAGQPGYGSQ--PGIGGQTGAGQ 329
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGSATPPA---RSGSGQPRGG 388
G GQ YG T P ++G+GQP G
Sbjct: 330 PGYGSQPGIGGQTGA--GQPGYG--TQPGIGGQTGAGQPGYG 367
Score = 142 (55.0 bits), Expect = 6.1e-06, P = 6.1e-06
Identities = 85/297 (28%), Positives = 102/297 (34%)
Query: 120 PNVDRRADGS---YGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGA 175
P + + G YGG S G G A VP GH G G G
Sbjct: 880 PGISGQTGGGQPGYGGQATISGLPGYGTQPGIGALT---AVPGGHYGYETQPGIGGQTGT 936
Query: 176 G-PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
P Q+G + Y PG G + + P Y G A G
Sbjct: 937 NQPGFGGQPGIGGQTGAG-QPGYGFIGQPGIGGQTGT---SGRQPGYGTQPGIGGQTAAG 992
Query: 235 -PGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRG-PSYDPQRGLGYDMQRG-PNYDMQRG 290
PGY G G G Y +Q G I G P Y Q G+G G P Y Q G
Sbjct: 993 QPGYGSQPGIGGQTGAGQPGYGSQTGVGGQIGAGQPGYGSQPGIGGQTGAGQPGYGAQPG 1052
Query: 291 ----PGYETQRVPGYDVQRG---PVYEAQRAPSYIPQRG---PGYDL------QRGQGYD 334
PGY Q PG Q G P Y +Q P Q G PGY + Q G G
Sbjct: 1053 FGGQPGYGNQ--PGVGGQTGAGQPGYGSQ--PGVGGQTGAGQPGYGVIPGFGGQPGIGGQ 1108
Query: 335 MRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPP-LNNVPYGSATPPARSGSGQPRGGN 389
P Y G G G+P G P ++ G T P ++G+GQP G+
Sbjct: 1109 TAAGKPGYGGQPGIG--GSPVYGTQQGTGGPSGISGGQPGYGTQPGQTGAGQPGYGS 1163
Score = 123 (48.4 bits), Expect = 0.00072, P = 0.00072
Identities = 59/188 (31%), Positives = 68/188 (36%)
Query: 212 GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRG-PS 268
G + P Y G A G PGY G G A G Y AQ G G P
Sbjct: 136 GQTGAGQPGYGSQPGIGGQTATGQPGYGSQLGVGAQAGAGQPGYGAQPGVGAQTGAGQPG 195
Query: 269 YDPQRGLGYDMQRG-PNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRA-PSYIPQRGPGY 325
Y Q G+G G P Y Q G G +T PGY Q G + P Y Q G G
Sbjct: 196 YGSQTGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGGQTGAGQPGYGSQPGIGG 255
Query: 326 DLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA---RSG 381
GQ GY + P G G G GQ P G T P ++G
Sbjct: 256 QTGAGQPGYGSQ--PGIGGQTGAGQPGYGSQPGIGGQTGA---GQP-GYGTQPGIGGQTG 309
Query: 382 SGQPRGGN 389
+GQP G+
Sbjct: 310 AGQPGYGS 317
>ZFIN|ZDB-GENE-041008-78 [details] [associations]
symbol:polr2a "polymerase (RNA) II (DNA directed)
polypeptide A" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
evidence=IEA] [GO:0003899 "DNA-directed RNA polymerase activity"
evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
complex" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0016740 "transferase
activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
activity" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
PROSITE:PS00115 SMART:SM00663 ZFIN:ZDB-GENE-041008-78 GO:GO:0003677
GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
GO:GO:0005665 GeneTree:ENSGT00700000104490 EMBL:AL929346
IPI:IPI00608319 Ensembl:ENSDART00000077495 Bgee:F1Q9K4
Uniprot:F1Q9K4
Length = 1965
Score = 170 (64.9 bits), Expect = 6.0e-09, P = 6.0e-09
Identities = 65/200 (32%), Positives = 82/200 (41%)
Query: 149 NAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS--TSAYAATQSGTPMRAAYDIPRGPG-- 204
+A D G G+ P S T GP + S A + + +P AY+ PR PG
Sbjct: 1546 SAASDASGFSPGYSPAWSPTPGSPGSPGPASPYIPSPGALSPNYSPTSPAYE-PRSPGGG 1604
Query: 205 YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIH 264
Y + PGY + +PSY PT PSY P P Y PT P Y S Y + P+Y
Sbjct: 1605 Y-TPQSPGYSPT-SPSYSPTS-PSYSPTS-PNYSPTS-PSYSPTSPS-Y-SPTSPSYS-P 1656
Query: 265 RGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPG 324
PSY P Y P+Y P Y P Y P Y + +PSY P P
Sbjct: 1657 TSPSYSPTSP-SYS-PTSPSYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPS 1709
Query: 325 YDLQRGQGYDMRRAPSYDPS 344
Y Y +PSY P+
Sbjct: 1710 YS-PTSPSYSPT-SPSYSPT 1727
Score = 131 (51.2 bits), Expect = 0.00011, P = 0.00011
Identities = 67/234 (28%), Positives = 87/234 (37%)
Query: 160 GHGPPPSATTAGVVGAGPNTSTSAYAATQ----SG-TPMRAAYDIPRGPGYEASKGPGYD 214
G P P + + + +T AY A SG TP A + P + PGY
Sbjct: 1501 GSAPSPMSGMSPAMTPWNTGATPAYGAWSPSVGSGMTPGAAGFS-PSAASDASGFSPGYS 1559
Query: 215 A--SKAPSYDPTKGPS--YDPAKG---PGYDPTKGPGYDAQK-GSNYDAQRGPNYDIHRG 266
S P + GP+ Y P+ G P Y PT P Y+ + G Y Q P Y
Sbjct: 1560 PAWSPTPGSPGSPGPASPYIPSPGALSPNYSPTS-PAYEPRSPGGGYTPQ-SPGYS-PTS 1616
Query: 267 PSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
PSY P Y PNY P Y P Y P Y + +PSY P P Y
Sbjct: 1617 PSYSPTSP-SYS-PTSPNYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS 1669
Query: 327 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 380
Y +PSY P+ + +P +P P + P S T P+ S
Sbjct: 1670 -PTSPSYSPT-SPSYSPTSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYS 1718
>UNIPROTKB|P71590 [details] [associations]
symbol:fhaA "FHA domain-containing protein FhaA"
species:1773 "Mycobacterium tuberculosis" [GO:0005618 "cell wall"
evidence=IDA] [GO:0005829 "cytosol" evidence=IDA]
InterPro:IPR000253 InterPro:IPR008984 Pfam:PF00498 PROSITE:PS50006
SMART:SM00240 GO:GO:0005829 GO:GO:0005618 GenomeReviews:AL123456_GR
EMBL:BX842572 Gene3D:2.60.200.20 SUPFAM:SSF49879 PIR:B70700
RefSeq:NP_214534.1 RefSeq:YP_006513334.1 PDB:2LC0 PDB:2LC1 PDB:3OUN
PDB:3PO8 PDB:3POA PDBsum:2LC0 PDBsum:2LC1 PDBsum:3OUN PDBsum:3PO8
PDBsum:3POA ProteinModelPortal:P71590 SMR:P71590 DIP:DIP-59047N
PhosSite:P12071703 PRIDE:P71590 EnsemblBacteria:EBMYCT00000001781
GeneID:13315997 GeneID:887067 KEGG:mtu:Rv0020c KEGG:mtv:RVBD_0020c
PATRIC:18148538 TubercuList:Rv0020c HOGENOM:HOG000235804
OMA:DQGYGQP ProtClustDB:CLSK790198 EvolutionaryTrace:P71590
InterPro:IPR022128 Pfam:PF12401 Uniprot:P71590
Length = 527
Score = 162 (62.1 bits), Expect = 8.0e-09, P = 8.0e-09
Identities = 84/244 (34%), Positives = 98/244 (40%)
Query: 164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYD 222
P T V+ S A+ A PM G G + YD A P D
Sbjct: 127 PDVETHPPVIDCARPQSNHAFGAEPGVAPMSDNSSYRGGQG-QGRPDEYYDDRYARPQED 185
Query: 223 PTKGPSYDPAKGP--GYDPTKGPGYDAQKGSNYDAQRGPNY-DIHRGPSYDPQRGLGYDM 279
P GP P GY P G GY Q G Y R P+ D Y P +G GY
Sbjct: 186 PRGGPDPQGGSDPRGGYPPETG-GYPPQPG--YPRPRHPDQGDYPEQIGY-PDQG-GYPE 240
Query: 280 QRGPNYDMQRG-P---GYETQRVPGY-DVQRG---PVYEAQRAP-SYIPQRG---PGYDL 327
QRG Y QRG P GY+ Q GY D +G P YE QR P S P G PGYD
Sbjct: 241 QRG--YPEQRGYPDQRGYQDQG-RGYPDQGQGGYPPPYE-QRPPVSPGPAAGYGAPGYD- 295
Query: 328 QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN---VPYGSATPPARSGSGQ 384
QGY R++ Y PS G G G G +G+ P VP G PP + +
Sbjct: 296 ---QGY--RQSGGYGPSPGGGQPGYG-GYGEYGRGPARHEEGSYVPSGPPGPPEQRPAYP 349
Query: 385 PRGG 388
+GG
Sbjct: 350 DQGG 353
Score = 120 (47.3 bits), Expect = 0.00036, P = 0.00036
Identities = 92/303 (30%), Positives = 111/303 (36%)
Query: 120 PNVDRRADGS-YGGATGNSENETSGRPVGQNAYEDGYGVPQGH---GPPPSATTAGVVGA 175
P V +D S Y G G GRP Y+D Y PQ GP P + G
Sbjct: 151 PGVAPMSDNSSYRGGQGQ------GRP--DEYYDDRYARPQEDPRGGPDPQGGSDPRGGY 202
Query: 176 GPNTSTSAYAATQSGTPMRAAY----DIPRGPGYEASKG-P---GYDASKAPSYDPTKGP 227
P T Y Q G P R + D P GY G P GY + Y +G
Sbjct: 203 PPETG--GYPP-QPGYP-RPRHPDQGDYPEQIGYPDQGGYPEQRGYPEQRG--YPDQRG- 255
Query: 228 SYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRGPNYDIHRGPSYDP---QRGLGYDMQRG- 282
Y +G GY P +G G Y GP + P YD Q G GY G
Sbjct: 256 -YQD-QGRGY-PDQGQGGYPPPYEQRPPVSPGPAAG-YGAPGYDQGYRQSG-GYGPSPGG 310
Query: 283 --PNY----DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 336
P Y + RGP + G V GP ++ P+Y P +G GYD QG
Sbjct: 311 GQPGYGGYGEYGRGPARHEE---GSYVPSGPPGPPEQRPAY-PDQG-GYDQGYQQGATTY 365
Query: 337 RAPSYDPSRG-TGFDGAPR--GAAPHG--QVPPPLNNVPYG-SATP----PARSG-SGQP 385
Y T + +PR G AP G P + YG S P PA G SG
Sbjct: 366 GRQDYGGGADYTRYTESPRVPGYAPQGGGYAEPAGRDYDYGQSGAPDYGQPAPGGYSGYG 425
Query: 386 RGG 388
+GG
Sbjct: 426 QGG 428
>UNIPROTKB|Q92804 [details] [associations]
symbol:TAF15 "TATA-binding protein-associated factor 2N"
species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005634 "nucleus" evidence=IDA] [GO:0005730 "nucleolus"
evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0045893
"positive regulation of transcription, DNA-dependent" evidence=TAS]
InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
GO:GO:0005737 GO:GO:0045893 GO:GO:0000166 GO:GO:0046872
GO:GO:0003677 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003723
EMBL:CH471147 eggNOG:NOG240581 HOGENOM:HOG000038010 EMBL:AC015849
EMBL:U51334 EMBL:X98893 EMBL:AB010067 EMBL:AY197697 EMBL:AK313223
IPI:IPI00020194 IPI:IPI00294426 PIR:S71954 RefSeq:NP_003478.1
RefSeq:NP_631961.1 UniGene:Hs.402752 ProteinModelPortal:Q92804
SMR:Q92804 IntAct:Q92804 STRING:Q92804 PhosphoSite:Q92804
DMDM:8928305 PaxDb:Q92804 PRIDE:Q92804 DNASU:8148
Ensembl:ENST00000311979 GeneID:8148 KEGG:hsa:8148 UCSC:uc002hkc.3
UCSC:uc002hkd.3 CTD:8148 GeneCards:GC17P034136 HGNC:HGNC:11547
HPA:HPA052059 MIM:601574 neXtProt:NX_Q92804 PharmGKB:PA36322
HOVERGEN:HBG005755 InParanoid:Q92804 KO:K14651 OMA:YGNQGSQ
OrthoDB:EOG4MW872 PhylomeDB:Q92804 ChiTaRS:TAF15 GenomeRNAi:8148
NextBio:30819 PMAP-CutDB:Q92804 ArrayExpress:Q92804 Bgee:Q92804
CleanEx:HS_TAF15 Genevestigator:Q92804 GermOnline:ENSG00000172660
Uniprot:Q92804
Length = 592
Score = 160 (61.4 bits), Expect = 1.6e-08, P = 1.6e-08
Identities = 67/206 (32%), Positives = 79/206 (38%)
Query: 136 NSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
N RP G + GYG +G+ G G G G + S Y +S
Sbjct: 380 NEPRPEDSRPSGGDFRGRGYGGERGYRGRGGRGGDRG--GYGGDRSGGGYGGDRSSG--- 434
Query: 195 AAYDIPR-GPGYEASK-GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSN 252
Y R G GY + G GY + Y +G Y +G GY +G GY +G
Sbjct: 435 GGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRGGG 493
Query: 253 YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY--DMQRGPGYETQRV--PGYDVQRGPV 308
Y RG Y RG Y RG GY RG Y D RG GY R GY R
Sbjct: 494 YGGDRG-GYGGDRG-GYGGDRG-GYGGDRG-GYGGDRSRG-GYGGDRGGGSGYGGDRSGG 548
Query: 309 YEAQRAPS-YIPQRGPGYDLQRGQGY 333
Y R+ Y RG GY RG GY
Sbjct: 549 YGGDRSGGGYGGDRGGGYGGDRG-GY 573
Score = 159 (61.0 bits), Expect = 2.1e-08, P = 2.1e-08
Identities = 68/220 (30%), Positives = 83/220 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P R + G + G E GR G+ GYG + G ++G G +
Sbjct: 384 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GYSGDR 441
Query: 180 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP 239
S Y +SG Y RG GY +G GY + Y +G Y +G GY
Sbjct: 442 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRGGGYGG 496
Query: 240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSY--DPQRGLGYDMQRGPNYDMQRGPGYETQR 297
+G GY +G Y RG Y RG Y D RG GY RG G GY R
Sbjct: 497 DRG-GYGGDRGG-YGGDRG-GYGGDRG-GYGGDRSRG-GYGGDRGG------GSGYGGDR 545
Query: 298 VPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMR 336
GY R G Y R Y RG GY + G D R
Sbjct: 546 SGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRNDYR 584
Score = 153 (58.9 bits), Expect = 8.8e-09, Sum P(2) = 8.8e-09
Identities = 60/164 (36%), Positives = 68/164 (41%)
Query: 201 RGPGYEASK-GPGY--DASKAPSYDPTK-GPSYDPAK-GPGYDPTKGPGYDAQKGSNYDA 255
RG GY + G GY D S Y + G Y + G GY +G GY +G Y
Sbjct: 415 RG-GYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGG 473
Query: 256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAP 315
RG Y RG Y RG GY RG Y RG GY R GY RG Y R+
Sbjct: 474 DRGGGYGGDRG-GYGGDRGGGYGGDRG-GYGGDRG-GYGGDR-GGYGGDRGG-YGGDRSR 528
Query: 316 S-YIPQRG--PGYDLQRGQGYDMRRAPS-YDPSRGTGFDGAPRG 355
Y RG GY R GY R+ Y RG G+ G RG
Sbjct: 529 GGYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGY-GGDRG 571
Score = 53 (23.7 bits), Expect = 8.8e-09, Sum P(2) = 8.8e-09
Identities = 21/96 (21%), Positives = 40/96 (41%)
Query: 78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 137
+ Y+ + Y+ + +S +NY + + R ++ +R GS GG G
Sbjct: 132 QSNYDQQHDSYSQNQQSYHSQRENY---SHHTQDDRRDVSRYGEDNRGYGGSQGGGRGRG 188
Query: 138 ENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 167
+ GR P+ G + + G +G + +GP A
Sbjct: 189 GYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRTDA 224
>UNIPROTKB|F1PB61 [details] [associations]
symbol:TAF15 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0005622 GeneTree:ENSGT00530000063105 CTD:8148 KO:K14651
OMA:YGNQGSQ EMBL:AAEX03006620 EMBL:AAEX03006619 RefSeq:XP_548255.2
ProteinModelPortal:F1PB61 Ensembl:ENSCAFT00000028877 GeneID:491135
KEGG:cfa:491135 Uniprot:F1PB61
Length = 571
Score = 160 (61.4 bits), Expect = 1.5e-08, P = 1.5e-08
Identities = 70/240 (29%), Positives = 87/240 (36%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
RR + GG +G G G+ ++ G P+ G P+ + + A N+
Sbjct: 319 RRPEFMRGGGSGGGRRGRGGYR-GRGGFQGRGGDPKSGDWVCPNPSCGNMNFARRNSCNQ 377
Query: 183 AYAAT-QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY--DP 239
+ P + RG GY +G Y D G D + G GY D
Sbjct: 378 CNEPRPEDSRPSGGDF---RGRGYGGERG--YRGRGGRGGD-RGGYGADRSSG-GYGGDR 430
Query: 240 TKGPGYDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 297
+ G GY + G Y R G Y RG Y RG GY RG Y RG GY R
Sbjct: 431 SGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDR 490
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG--YDMRRAPSYDPSRGTGFDGAPRG 355
GY RG Y R + G GY RG G Y R Y R G G RG
Sbjct: 491 GGGYGGDRGGGYGGDRGGYGGDRSGGGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG 550
Score = 145 (56.1 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
Identities = 52/152 (34%), Positives = 61/152 (40%)
Query: 194 RAAYDIPR---GPGYEASKGPGYDASKAPS-YDPTK-GPSYDPAKGPGYDPTKGPGYDAQ 248
R Y R G G + S G GY ++ Y + G Y +G GY +G GY
Sbjct: 414 RGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGD 473
Query: 249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR-GPGYETQRVPG--YDVQR 305
+G Y RG Y RG Y RG GY RG Y R G GY R G Y R
Sbjct: 474 RGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSGGGYGGDRGGGGGYGGDR 532
Query: 306 GPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMR 336
G Y R+ Y RG GY + G D R
Sbjct: 533 GGGYGGDRSGGGYGGDRG-GYGGKMGGRNDYR 563
Score = 139 (54.0 bits), Expect = 8.9e-08, Sum P(2) = 8.9e-08
Identities = 68/219 (31%), Positives = 76/219 (34%)
Query: 147 GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYE 206
G+ Y G G QG G P + V P+ +A S P
Sbjct: 335 GRGGYR-GRGGFQGRGGDPKS--GDWVCPNPSCGNMNFARRNSCNQCNEPRPEDSRPSGG 391
Query: 207 ASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY--DAQKGSNYDAQR-GPNYDI 263
+G GY + Y G D G G D + G GY D G Y R G Y
Sbjct: 392 DFRGRGYGGERG--YRGRGGRGGDRG-GYGADRSSG-GYGGDRSGGGGYGGDRSGGGYGG 447
Query: 264 HR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 322
R G Y RG GY RG Y RG GY R GY RG Y R Y RG
Sbjct: 448 DRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG 507
Query: 323 PGYDLQR-GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
GY R G GY D G G+ G RG G
Sbjct: 508 -GYGGDRSGGGY------GGDRGGGGGY-GGDRGGGYGG 538
Score = 121 (47.7 bits), Expect = 0.00031, P = 0.00031
Identities = 48/167 (28%), Positives = 62/167 (37%)
Query: 125 RADGSYGGATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 183
R G GG G + +SG G + GYG + G + G G G +
Sbjct: 405 RGRGGRGGDRGGYGADRSSGGYGGDRSGGGGYGGDRSGGGYGGDRSGG--GYGGDRG-GG 461
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-G 242
Y + G Y RG GY +G GY + Y +G Y +G GY + G
Sbjct: 462 YGGDRGG-----GYGGDRGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRG-GYGGDRSG 515
Query: 243 PGYDAQKGSN--YDAQRGPNYDIHR-GPSYDPQRGLGYDMQRGPNYD 286
GY +G Y RG Y R G Y RG GY + G D
Sbjct: 516 GGYGGDRGGGGGYGGDRGGGYGGDRSGGGYGGDRG-GYGGKMGGRND 561
Score = 58 (25.5 bits), Expect = 1.9e-08, Sum P(2) = 1.9e-08
Identities = 23/96 (23%), Positives = 39/96 (40%)
Query: 78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 137
+ Y + YN + +S NY + + R ++ +R GS GG G
Sbjct: 131 QSNYGPQHDSYNQNQQSYHSQRDNY---SHHTQDDRRDVSRYGEDNRGYGGSQGGGRGRG 187
Query: 138 ENETSGR-PV-GQNAYEDG----YGVPQGHGPPPSA 167
+ GR P+ G + + G +G + +GP P A
Sbjct: 188 GYDKDGRGPMTGSSGGDRGGFKNFGGHRDYGPRPDA 223
>WB|WBGene00044109 [details] [associations]
symbol:K02E11.10 species:6239 "Caenorhabditis elegans"
[GO:0016021 "integral to membrane" evidence=IEA] EMBL:Z77665
RefSeq:NP_001024024.1 ProteinModelPortal:Q5FC49
EnsemblMetazoa:K02E11.10 GeneID:259661 KEGG:cel:CELE_K02E11.10
UCSC:K02E11.10 CTD:259661 WormBase:K02E11.10
GeneTree:ENSGT00530000065030 InParanoid:Q5FC49 OMA:VQASGYQ
NextBio:952394 Uniprot:Q5FC49
Length = 360
Score = 154 (59.3 bits), Expect = 2.9e-08, P = 2.9e-08
Identities = 69/224 (30%), Positives = 91/224 (40%)
Query: 154 GYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPG 212
G+G G P A G+ G G A+ G A G G G G
Sbjct: 81 GFGGAGGSYAAP-ALGGGLGGFGGAPAPAPAFGGLGGGYQAAPALGGGLGGGLGGGPGGG 139
Query: 213 YDASKAPSYDPTKGPSYDPA---KGPGYD--PTKGPGYDAQKGSNYDAQRGP---NYDIH 264
Y A+ A P+ PA G GY PT G G AQ G+ Y Q+GP +
Sbjct: 140 YQAAPALQLPGLGAPA--PAFGGLGGGYQGAPTLGGG-QAQGGAGY--QQGPAQGRFVAQ 194
Query: 265 RGPSYDPQRGLGYDMQRGP---NYDMQRGPGYETQRVPGYDVQRGPV---YEAQRAPSYI 318
+G + Q G GY Q+GP + Q+GP Q GY Q+GP + AQ+ P+
Sbjct: 195 QGSAQGVQGGAGY--QQGPAQGGFTAQQGPAQVVQGGAGY--QQGPAQGGFVAQQGPAPA 250
Query: 319 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AAPHGQ 361
Q G GY QG A ++G G+ A +G +AP Q
Sbjct: 251 AQGGAGYQQGSTQGGFEAVAQQGQVAQGAGYQSAAQGQSAPVSQ 294
>DICTYBASE|DDB_G0277909 [details] [associations]
symbol:cbpP "calcium-binding protein" species:44689
"Dictyostelium discoideum" [GO:0005509 "calcium ion binding"
evidence=IEA] [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0044351 "macropinocytosis" evidence=RCA] InterPro:IPR002048
InterPro:IPR011992 Pfam:PF13499 PROSITE:PS50222 SMART:SM00054
dictyBase:DDB_G0277909 Prosite:PS00018 GenomeReviews:CM000152_GR
EMBL:AAFI02000023 GO:GO:0005509 Gene3D:1.10.238.10
InterPro:IPR018247 EMBL:U03413 RefSeq:XP_642080.1
ProteinModelPortal:P35085 PRIDE:P35085 EnsemblProtists:DDB0214957
GeneID:8621293 KEGG:ddi:DDB_G0277909 eggNOG:NOG135385 OMA:MGAYPPQ
ProtClustDB:CLSZ2846833 Uniprot:P35085
Length = 467
Score = 155 (59.6 bits), Expect = 3.9e-08, P = 3.9e-08
Identities = 73/247 (29%), Positives = 89/247 (36%)
Query: 158 PQGHGPPPSATTAGVVGAGPNT--STSAYAATQS--GTPMRAAYDIPRGPGYEASKGPGY 213
PQ PPP+ + A P T +QS G P P+ PG S P Y
Sbjct: 4 PQN--PPPAGSAADFYSQMPVKVMGTPGAPGSQSTPGAPGAPGQYPPQQPGAPGSNLPPY 61
Query: 214 DASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQ 272
++ P G Y P + PG P + PG Q Q G P + Y PQ
Sbjct: 62 PGTQQPGAPGAPG-QYPPQQ-PGQYPPQQPGAPGQYPPQQPGQPGYPPQQPGQSGQYPPQ 119
Query: 273 R-GL-GYDMQR--GPN-YDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
+ G GY Q+ P Y Q+G PG + PG Q P + Q P Q G
Sbjct: 120 QPGQPGYPPQQPGAPGQYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPGAYPP 179
Query: 327 LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP---ARSGSG 383
Q GQ +Y P +G A GA VPPP P PP A G
Sbjct: 180 QQSGQ------PGAYPPQQGVQNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYPGQQ 233
Query: 384 QPRGGNP 390
P G P
Sbjct: 234 PPMGAYP 240
Score = 139 (54.0 bits), Expect = 2.3e-06, P = 2.3e-06
Identities = 79/251 (31%), Positives = 98/251 (39%)
Query: 162 GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY 221
G P S +T G GA P Y Q G P ++P PG + PG P
Sbjct: 29 GAPGSQSTPGAPGA-PGQ----YPPQQPGAP---GSNLPPYPGTQQPGAPGAPGQYPPQ- 79
Query: 222 DPTKGPSYDPAKGPG-YDPTK-G-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL-G 276
P + P P PG Y P + G PGY Q+ Q P P Y PQ+ G G
Sbjct: 80 QPGQYPPQQPG-APGQYPPQQPGQPGYPPQQPGQ-SGQYPPQQPGQ--PGYPPQQPGAPG 135
Query: 277 -YDMQRG-PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PG-YDLQRGQ 331
Y Q+G P + PG Q P Q P Q +Y PQ+ PG Y Q+G
Sbjct: 136 QYPPQQGQPGQYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGV 194
Query: 332 GYDMRRA-----PSYDPSRGT--GFDGAP--RGAAPHGQVPPPLNNVPYGSATPPARSGS 382
+ + P P +G G G P +GA P GQ PP P G P A
Sbjct: 195 QNTLAKTGAPGQPGVPPPQGAYPGQPGVPPQQGAYP-GQQPPMGAYPPQGQ--PGAYPPQ 251
Query: 383 GQPRGGNPARR 393
GQP G P ++
Sbjct: 252 GQP-GAYPPQQ 261
Score = 133 (51.9 bits), Expect = 1.1e-05, P = 1.1e-05
Identities = 83/276 (30%), Positives = 101/276 (36%)
Query: 132 GATGNSENETSGRPVGQNAYEDGY-GVPQGHGPP-PSATTAGVVGA-G--PNTSTSAYAA 186
GA G+ T G P Y G P + PP P G GA G P Y
Sbjct: 29 GAPGSQS--TPGAPGAPGQYPPQQPGAPGSNLPPYPGTQQPGAPGAPGQYPPQQPGQYPP 86
Query: 187 TQSGTPMRAAYDIPRGPGYEASKGPG----YDASKA--PSYDPTK--GPS-YDPAKG-PG 236
Q G P + P PGY + PG Y + P Y P + P Y P +G PG
Sbjct: 87 QQPGAPGQYPPQQPGQPGYPPQQ-PGQSGQYPPQQPGQPGYPPQQPGAPGQYPPQQGQPG 145
Query: 237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQR-GL--GYDMQRGPNYDMQRGPGY 293
P + PG Q Q P G +Y PQ+ G Y Q+G + + G
Sbjct: 146 QYPPQQPGQPGQYPPQQQGQYPPQQPGQPG-AYPPQQSGQPGAYPPQQGVQNTLAK-TGA 203
Query: 294 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA- 352
Q PG +G Y Q P PQ+G Y GQ M +Y P G GA
Sbjct: 204 PGQ--PGVPPPQG-AYPGQ--PGVPPQQG-AYP---GQQPPMG---AYPPQ---GQPGAY 248
Query: 353 PRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
P P G PP V Y PP G+ P+ G
Sbjct: 249 PPQGQP-GAYPPQQQQVAYPGQQPPM--GAYPPQQG 281
>FB|FBgn0050203 [details] [associations]
symbol:CG30203 species:7227 "Drosophila melanogaster"
[GO:0004867 "serine-type endopeptidase inhibitor activity"
evidence=IEA] InterPro:IPR002223 Pfam:PF00014 PROSITE:PS50279
SMART:SM00131 EMBL:AE013599 GO:GO:0004867 Gene3D:4.10.410.10
SUPFAM:SSF57362 InterPro:IPR000884 Pfam:PF00090 SMART:SM00209
SUPFAM:SSF82895 PROSITE:PS50092 InterPro:IPR002861 Pfam:PF02014
PROSITE:PS51019 GeneTree:ENSGT00640000091268 InterPro:IPR009465
Pfam:PF06468 PROSITE:PS51020 EMBL:BT023853 RefSeq:NP_725128.2
UniGene:Dm.23753 SMR:Q3ZAL6 EnsemblMetazoa:FBtr0273303
GeneID:246514 KEGG:dme:Dmel_CG30203 FlyBase:FBgn0050203
eggNOG:NOG244582 OMA:KWARNTH OrthoDB:EOG43R22N GenomeRNAi:246514
NextBio:842774 Uniprot:Q3ZAL6
Length = 924
Score = 157 (60.3 bits), Expect = 6.5e-08, P = 6.5e-08
Identities = 39/105 (37%), Positives = 49/105 (46%)
Query: 194 RAAYDIP--RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 251
R +YD RG Y+ + G Y ++ SYD G SYD G Y T G YD +
Sbjct: 793 RRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDR 852
Query: 252 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY-ET 295
+YD G +Y SYD RG YD G +YD+ G Y ET
Sbjct: 853 SYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGET 897
Score = 153 (58.9 bits), Expect = 1.8e-07, P = 1.8e-07
Identities = 46/148 (31%), Positives = 60/148 (40%)
Query: 206 EASKGPGYDASKAPSYDP--TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 263
E S+ D SYD T+G YD G Y T+G YD + G +YD G +Y
Sbjct: 781 ERSENDAMDLYGRRSYDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQ 840
Query: 264 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 323
G SYD YD+ G +Y Y+ R YD G Y+ SY
Sbjct: 841 TGGGSYDQPEDRSYDLSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEA 900
Query: 324 GYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
G D+ G+ R+ YD SR + G
Sbjct: 901 G-DI--GEPMSQTRS-RYDTSRRGRYGG 924
Score = 134 (52.2 bits), Expect = 2.2e-05, P = 2.2e-05
Identities = 36/111 (32%), Positives = 45/111 (40%)
Query: 245 YDAQ--KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYD 302
YD + +G YD G Y G SYD + G YD G +Y G Y+ YD
Sbjct: 796 YDQKQTRGRPYDQTSGIAYGQTEGRSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYD 855
Query: 303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
+ G Y SY RG YD G+ YD+ SY + G G P
Sbjct: 856 LSTGRSYVQPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEP 906
Score = 123 (48.4 bits), Expect = 0.00035, P = 0.00035
Identities = 38/119 (31%), Positives = 52/119 (43%)
Query: 179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 238
TS AY T+ +YD G Y+ + G Y + SYD + SYD + G Y
Sbjct: 809 TSGIAYGQTEG-----RSYDQKVGKSYDRTFGRSYKQTGGGSYDQPEDRSYDLSTGRSYV 863
Query: 239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP--QRG-LGYDM-QRGPNYDMQRGPGY 293
+ YD +G +YD G +YD+ G SY + G +G M Q YD R Y
Sbjct: 864 QPEDRSYDLSRGRSYDQPVGRSYDLAGGRSYGETSEAGDIGEPMSQTRSRYDTSRRGRY 922
>WB|WBGene00005015 [details] [associations]
symbol:spt-5 species:6239 "Caenorhabditis elegans"
[GO:0032968 "positive regulation of transcription elongation from
RNA polymerase II promoter" evidence=IEA] [GO:0006357 "regulation
of transcription from RNA polymerase II promoter" evidence=IEA]
[GO:0032784 "regulation of DNA-dependent transcription, elongation"
evidence=IEA] [GO:0009792 "embryo development ending in birth or
egg hatching" evidence=IMP] [GO:0040007 "growth" evidence=IMP]
[GO:0002119 "nematode larval development" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0000003
"reproduction" evidence=IMP] InterPro:IPR006645 InterPro:IPR017071
InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
NextBio:899898 Uniprot:Q21338
Length = 1208
Score = 158 (60.7 bits), Expect = 7.1e-08, P = 7.1e-08
Identities = 60/182 (32%), Positives = 76/182 (41%)
Query: 179 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
+ T Y A T M AYD R P Y E + P Y SK P+Y S G
Sbjct: 813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871
Query: 237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 295
D ++ P Y GS D R P Y G P G D R P YD R PGYE+
Sbjct: 872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924
Query: 296 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
R P YD + P Y E++ + R P Y+ YD+ +P+Y+P +D
Sbjct: 925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975
Query: 352 AP 353
AP
Sbjct: 976 AP 977
Score = 143 (55.4 bits), Expect = 3.1e-06, P = 3.1e-06
Identities = 73/253 (28%), Positives = 95/253 (37%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
A GS A G+ + +S R AY + +G A T G+ T AY
Sbjct: 848 AYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTPAYG 903
Query: 186 ATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGYDPT 240
+ S TP AYD R PGYE+ S+ P YD+S K P+Y ++ + PA YD
Sbjct: 904 SMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDYDIP 960
Query: 241 KGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG----PNY 285
P Y+ YD R P YD + P+Y+P + G P Y
Sbjct: 961 LSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSSPTY 1020
Query: 286 DMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 340
D P PG + P Y P + PG G YD APS
Sbjct: 1021 DSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPSPFA 1073
Query: 341 -YDPSRGTGFDGA 352
YD + DGA
Sbjct: 1074 GYDSNNYNNADGA 1086
Score = 133 (51.9 bits), Expect = 3.9e-05, P = 3.9e-05
Identities = 67/218 (30%), Positives = 84/218 (38%)
Query: 194 RAAYDIPRGPGYEASKGPG---YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
RA + G A G G Y +SK P D K P Y +K P Y + P Y +
Sbjct: 773 RARVMVVGDTGITAGSGGGSSFYSSSKTPMRDSGKTPMYG-SKTPMYG-AQTPMYGSMTP 830
Query: 251 SNYDAQRGPNY-DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY--ETQRVPGY---DVQ 304
+ YD R P Y + R P+Y + D+ + R P Y ++ R P Y D
Sbjct: 831 A-YDGGRTPAYGEGGRTPAYGSKTPAYGDLDE---HSSSRTPAYGNDSSRTPAYGSADGA 886
Query: 305 RGPVY---EAQRAPSYIPQ---RGPGYDLQ-RGQGYDMR--RAPSYDPSRGTGFDGAPRG 355
R P Y E R P+Y R P YD R GY+ R P+YD S T P
Sbjct: 887 RTPAYGSTEGGRTPAYGSMDNSRTPAYDDSGRTPGYESMPSRTPNYDSSSKT--PAYPE- 943
Query: 356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN-PAR 392
+ H P NN Y PA N PAR
Sbjct: 944 -SEHSARTPAYNN-DYDIPLSPAYEPDAPEAYDNAPAR 979
>UNIPROTKB|Q21338 [details] [associations]
symbol:spt-5 "Transcription elongation factor SPT5"
species:6239 "Caenorhabditis elegans" [GO:0032044 "DSIF complex"
evidence=ISS] InterPro:IPR006645 InterPro:IPR017071
InterPro:IPR024945 PIRSF:PIRSF036945 SMART:SM00738 SMART:SM01104
Pfam:PF00467 GO:GO:0009792 GO:GO:0040007 GO:GO:0002119
GO:GO:0006351 GO:GO:0040035 GO:GO:0032968 EMBL:Z68752
InterPro:IPR008991 SUPFAM:SSF50104 InterPro:IPR005824 SMART:SM00739
eggNOG:COG0250 GO:GO:0032044 HOGENOM:HOG000038564 KO:K15172
InterPro:IPR022581 InterPro:IPR005100 PANTHER:PTHR11125:SF7
Pfam:PF03439 Pfam:PF11942 EMBL:Z68316 PIR:T23467 RefSeq:NP_502283.1
ProteinModelPortal:Q21338 SMR:Q21338 STRING:Q21338 PaxDb:Q21338
EnsemblMetazoa:K08E4.1 GeneID:178143 KEGG:cel:CELE_K08E4.1
UCSC:K08E4.1 CTD:178143 WormBase:K08E4.1
GeneTree:ENSGT00440000037640 InParanoid:Q21338 OMA:PAYGNES
NextBio:899898 Uniprot:Q21338
Length = 1208
Score = 158 (60.7 bits), Expect = 7.1e-08, P = 7.1e-08
Identities = 60/182 (32%), Positives = 76/182 (41%)
Query: 179 TSTSAYAA-TQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
+ T Y A T M AYD R P Y E + P Y SK P+Y S G
Sbjct: 813 SKTPMYGAQTPMYGSMTPAYDGGRTPAYGEGGRTPAY-GSKTPAYGDLDEHSSSRTPAYG 871
Query: 237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYET 295
D ++ P Y GS D R P Y G P G D R P YD R PGYE+
Sbjct: 872 NDSSRTPAY----GSA-DGARTPAYGSTEG-GRTPAYG-SMDNSRTPAYDDSGRTPGYES 924
Query: 296 Q--RVPGYDVQ-RGPVY-EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
R P YD + P Y E++ + R P Y+ YD+ +P+Y+P +D
Sbjct: 925 MPSRTPNYDSSSKTPAYPESEHSA-----RTPAYN----NDYDIPLSPAYEPDAPEAYDN 975
Query: 352 AP 353
AP
Sbjct: 976 AP 977
Score = 143 (55.4 bits), Expect = 3.1e-06, P = 3.1e-06
Identities = 73/253 (28%), Positives = 95/253 (37%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
A GS A G+ + +S R AY + +G A T G+ T AY
Sbjct: 848 AYGSKTPAYGDLDEHSSSRTP---AYGNDSSRTPAYGSADGARTPAY-GSTEGGRTPAYG 903
Query: 186 ATQ-SGTPMRAAYDIP-RGPGYEA--SKGPGYDAS-KAPSYDPTKGPSYDPAKGPGYDPT 240
+ S TP AYD R PGYE+ S+ P YD+S K P+Y ++ + PA YD
Sbjct: 904 SMDNSRTP---AYDDSGRTPGYESMPSRTPNYDSSSKTPAYPESEHSARTPAYNNDYDIP 960
Query: 241 KGPGYDAQKGSNYD---------AQRGPNYDIHRG--PSYDPQRGLGYDMQRG----PNY 285
P Y+ YD R P YD + P+Y+P + G P Y
Sbjct: 961 LSPAYEPDAPEAYDNAPARTPAFVSRTPGYDTYENSSPTYEPDAATKVEEDIGDTSSPTY 1020
Query: 286 DMQRGPGYETQRVPGYDVQRG-P-VYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS--- 340
D P PG + P Y P + PG G YD APS
Sbjct: 1021 DSP--PHSYVVPTPGAMLNPATPGAYHVD-TPGFAAPMTPG----SGGAYDQYVAPSPFA 1073
Query: 341 -YDPSRGTGFDGA 352
YD + DGA
Sbjct: 1074 GYDSNNYNNADGA 1086
Score = 133 (51.9 bits), Expect = 3.9e-05, P = 3.9e-05
Identities = 67/218 (30%), Positives = 84/218 (38%)
Query: 194 RAAYDIPRGPGYEASKGPG---YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
RA + G A G G Y +SK P D K P Y +K P Y + P Y +
Sbjct: 773 RARVMVVGDTGITAGSGGGSSFYSSSKTPMRDSGKTPMYG-SKTPMYG-AQTPMYGSMTP 830
Query: 251 SNYDAQRGPNY-DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY--ETQRVPGY---DVQ 304
+ YD R P Y + R P+Y + D+ + R P Y ++ R P Y D
Sbjct: 831 A-YDGGRTPAYGEGGRTPAYGSKTPAYGDLDE---HSSSRTPAYGNDSSRTPAYGSADGA 886
Query: 305 RGPVY---EAQRAPSYIPQ---RGPGYDLQ-RGQGYDMR--RAPSYDPSRGTGFDGAPRG 355
R P Y E R P+Y R P YD R GY+ R P+YD S T P
Sbjct: 887 RTPAYGSTEGGRTPAYGSMDNSRTPAYDDSGRTPGYESMPSRTPNYDSSSKT--PAYPE- 943
Query: 356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN-PAR 392
+ H P NN Y PA N PAR
Sbjct: 944 -SEHSARTPAYNN-DYDIPLSPAYEPDAPEAYDNAPAR 979
>SGD|S000002299 [details] [associations]
symbol:RPO21 "RNA polymerase II largest subunit B220"
species:4932 "Saccharomyces cerevisiae" [GO:0016740 "transferase
activity" evidence=IEA] [GO:0016779 "nucleotidyltransferase
activity" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=IEA;IMP] [GO:0003899 "DNA-directed RNA
polymerase activity" evidence=IEA;IDA] [GO:0005739 "mitochondrion"
evidence=IDA] [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0005665
"DNA-directed RNA polymerase II, core complex" evidence=IEA;IDA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0003968 "RNA-directed
RNA polymerase activity" evidence=IDA] [GO:0006351 "transcription,
DNA-dependent" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
PROSITE:PS00115 SMART:SM00663 SGD:S000002299 GO:GO:0005739
GO:GO:0046872 GO:GO:0003677 EMBL:BK006938 GO:GO:0006366
Gene3D:2.40.40.20 InterPro:IPR009010 EMBL:X96876 EMBL:U27182
GO:GO:0003899 PDB:4GWQ PDBsum:4GWQ PDB:2LO6 PDBsum:2LO6
eggNOG:COG0086 GO:GO:0005665 PDB:1I3Q PDB:1I50 PDB:1I6H PDB:1K83
PDB:1NIK PDB:1NT9 PDB:1PQV PDB:1R5U PDB:1R9S PDB:1R9T PDB:1SFO
PDB:1TWA PDB:1TWC PDB:1TWF PDB:1TWG PDB:1TWH PDB:1WCM PDB:1Y1V
PDB:1Y1W PDB:1Y1Y PDB:1Y77 PDB:2B63 PDB:2B8K PDB:2E2H PDB:2E2I
PDB:2E2J PDB:2JA5 PDB:2JA6 PDB:2JA7 PDB:2JA8 PDB:2NVQ PDB:2NVT
PDB:2NVX PDB:2NVY PDB:2NVZ PDB:2R7Z PDB:2R92 PDB:2R93 PDB:2VUM
PDB:2YU9 PDB:3CQZ PDB:3FKI PDB:3GTG PDB:3GTJ PDB:3GTK PDB:3GTL
PDB:3GTM PDB:3GTO PDB:3GTP PDB:3GTQ PDB:3H3V PDB:3HOU PDB:3HOV
PDB:3HOW PDB:3HOX PDB:3HOY PDB:3HOZ PDB:3I4M PDB:3I4N PDB:3K1F
PDB:3K7A PDB:3M3Y PDB:3M4O PDB:3PO2 PDB:3PO3 PDB:3QT1 PDB:3RZD
PDB:3RZO PDB:3S14 PDB:3S15 PDB:3S16 PDB:3S17 PDB:3S1M PDB:3S1N
PDB:3S1Q PDB:3S1R PDB:3S2D PDB:3S2H PDB:4A3B PDB:4A3C PDB:4A3D
PDB:4A3E PDB:4A3F PDB:4A3G PDB:4A3I PDB:4A3J PDB:4A3K PDB:4A3L
PDB:4A3M PDB:4A93 PDB:4BBR PDB:4BBS PDBsum:1I3Q PDBsum:1I50
PDBsum:1I6H PDBsum:1K83 PDBsum:1NIK PDBsum:1NT9 PDBsum:1PQV
PDBsum:1R5U PDBsum:1R9S PDBsum:1R9T PDBsum:1SFO PDBsum:1TWA
PDBsum:1TWC PDBsum:1TWF PDBsum:1TWG PDBsum:1TWH PDBsum:1WCM
PDBsum:1Y1V PDBsum:1Y1W PDBsum:1Y1Y PDBsum:1Y77 PDBsum:2B63
PDBsum:2B8K PDBsum:2E2H PDBsum:2E2I PDBsum:2E2J PDBsum:2JA5
PDBsum:2JA6 PDBsum:2JA7 PDBsum:2JA8 PDBsum:2NVQ PDBsum:2NVT
PDBsum:2NVX PDBsum:2NVY PDBsum:2NVZ PDBsum:2R7Z PDBsum:2R92
PDBsum:2R93 PDBsum:2VUM PDBsum:2YU9 PDBsum:3CQZ PDBsum:3FKI
PDBsum:3GTG PDBsum:3GTJ PDBsum:3GTK PDBsum:3GTL PDBsum:3GTM
PDBsum:3GTO PDBsum:3GTP PDBsum:3GTQ PDBsum:3H3V PDBsum:3HOU
PDBsum:3HOV PDBsum:3HOW PDBsum:3HOX PDBsum:3HOY PDBsum:3HOZ
PDBsum:3I4M PDBsum:3I4N PDBsum:3K1F PDBsum:3K7A PDBsum:3M3Y
PDBsum:3M4O PDBsum:3PO2 PDBsum:3PO3 PDBsum:3QT1 PDBsum:3RZD
PDBsum:3RZO PDBsum:3S14 PDBsum:3S15 PDBsum:3S16 PDBsum:3S17
PDBsum:3S1M PDBsum:3S1N PDBsum:3S1Q PDBsum:3S1R PDBsum:3S2D
PDBsum:3S2H PDBsum:4A3B PDBsum:4A3C PDBsum:4A3D PDBsum:4A3E
PDBsum:4A3F PDBsum:4A3G PDBsum:4A3I PDBsum:4A3J PDBsum:4A3K
PDBsum:4A3L PDBsum:4A3M PDBsum:4A93 PDBsum:4BBR PDBsum:4BBS
HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 OrthoDB:EOG4J14H5
EMBL:X03128 EMBL:Z74188 PIR:S67686 RefSeq:NP_010141.1 PDB:2L0I
PDBsum:2L0I ProteinModelPortal:P04050 SMR:P04050 DIP:DIP-611N
IntAct:P04050 MINT:MINT-432838 STRING:P04050 PaxDb:P04050
PeptideAtlas:P04050 EnsemblFungi:YDL140C GeneID:851415
KEGG:sce:YDL140C CYGD:YDL140c GeneTree:ENSGT00700000105212
EvolutionaryTrace:P04050 NextBio:968606 ArrayExpress:P04050
Genevestigator:P04050 GermOnline:YDL140C Uniprot:P04050
Length = 1733
Score = 159 (61.0 bits), Expect = 8.4e-08, P = 8.4e-08
Identities = 67/218 (30%), Positives = 90/218 (41%)
Query: 112 LRAELMNAPNVDRRA-DGSYGGAT--GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT 168
++ ELM +P VD + D GG T G ++ + P G AY G P++
Sbjct: 1486 VKDELMFSPLVDSGSNDAMAGGFTAYGGADYGEATSPFG--AY----------GEAPTSP 1533
Query: 169 TAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP 227
GV G + ++ Y+ T +P +Y P P Y + P Y + +PSY PT P
Sbjct: 1534 GFGVSSPGFSPTSPTYSPTSPAYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSPTS-P 1589
Query: 228 SYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM 287
SY P P Y PT P Y S Y + P+Y PSY P Y P+Y
Sbjct: 1590 SYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSPSYS- 1641
Query: 288 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGY 325
P Y P Y P Y + +PSY P P Y
Sbjct: 1642 PTSPSYSPTS-PSYS-PTSPAY-SPTSPSYSPT-SPSY 1675
>MGI|MGI:1330280 [details] [associations]
symbol:Krtap6-2 "keratin associated protein 6-2"
species:10090 "Mus musculus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0005882 "intermediate filament" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] MGI:MGI:1330280 GO:GO:0005882
CTD:337967 EMBL:D89902 IPI:IPI00116464 RefSeq:NP_034803.2
UniGene:Mm.3524 PRIDE:O08884 DNASU:16701 GeneID:16701
KEGG:mmu:16701 UCSC:uc007zvp.1 NextBio:290464 Genevestigator:O08884
Uniprot:O08884
Length = 159
Score = 128 (50.1 bits), Expect = 1.3e-07, P = 1.3e-07
Identities = 38/124 (30%), Positives = 40/124 (32%)
Query: 202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
G GY + G GY Y G Y G GY G GY GS Y G Y
Sbjct: 13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72
Query: 262 DIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQR 321
G Y G GY G Y G GY GY G Y + Y
Sbjct: 73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132
Query: 322 GPGY 325
G GY
Sbjct: 133 GCGY 136
Score = 126 (49.4 bits), Expect = 2.2e-07, P = 2.2e-07
Identities = 39/130 (30%), Positives = 40/130 (30%)
Query: 204 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 263
G G GY + Y G Y G GY G GY GS Y G Y
Sbjct: 7 GNSCGYGCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGC 66
Query: 264 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP 323
G Y G GY G Y G GY GY G Y Y G
Sbjct: 67 GYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGS 126
Query: 324 GYDLQRGQGY 333
GY G GY
Sbjct: 127 GYGSGCGCGY 136
Score = 125 (49.1 bits), Expect = 2.8e-07, P = 2.8e-07
Identities = 40/136 (29%), Positives = 42/136 (30%)
Query: 226 GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNY 285
G Y G GY G GY GS Y G Y G Y G GY G Y
Sbjct: 13 GCGYGSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGY 72
Query: 286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 345
G GY GY G Y Y G GY G GY Y
Sbjct: 73 GCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGC 132
Query: 346 GTGFDGAPR-GAAPHG 360
G G+ R G +G
Sbjct: 133 GCGYGSYYRSGCCGYG 148
Score = 124 (48.7 bits), Expect = 3.6e-07, P = 3.6e-07
Identities = 34/112 (30%), Positives = 37/112 (33%)
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
G+ + Y G GY G GY Y G Y G GY G GY
Sbjct: 17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76
Query: 250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
GS Y G Y G Y G GY G Y G GY + GY
Sbjct: 77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGY 128
Score = 118 (46.6 bits), Expect = 1.6e-06, P = 1.6e-06
Identities = 33/107 (30%), Positives = 35/107 (32%)
Query: 195 AAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 254
+ Y G GY G GY Y G Y G GY G GY GS Y
Sbjct: 30 SGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYG 89
Query: 255 AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
G Y G Y G GY G Y G GY + GY
Sbjct: 90 CGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136
Score = 118 (46.6 bits), Expect = 1.6e-06, P = 1.6e-06
Identities = 34/120 (28%), Positives = 39/120 (32%)
Query: 174 GAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK 233
G+G + + + G + Y G GY G GY Y G Y
Sbjct: 17 GSGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGY 76
Query: 234 GPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
G GY G GY GS Y G Y G Y G GY G Y G GY
Sbjct: 77 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYGSGCGCGY 136
Score = 111 (44.1 bits), Expect = 7.4e-05, P = 7.4e-05
Identities = 35/127 (27%), Positives = 40/127 (31%)
Query: 151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
Y GYG G+G + G G + + G + Y G GY G
Sbjct: 12 YGCGYG--SGYGSGYGCGSGSGYGCGYGSGYGCGYGSGYGCGSGSGYGCGYGSGYGCGYG 69
Query: 211 PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD 270
GY Y G Y G GY G GY GS Y G Y G Y
Sbjct: 70 SGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGSGYGSGYG 129
Query: 271 PQRGLGY 277
G GY
Sbjct: 130 SGCGCGY 136
>WB|WBGene00002280 [details] [associations]
symbol:let-2 species:6239 "Caenorhabditis elegans"
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0040007
"growth" evidence=IMP] [GO:0002119 "nematode larval development"
evidence=IMP] [GO:0009792 "embryo development ending in birth or
egg hatching" evidence=IMP] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0040039
"inductive cell migration" evidence=IMP] [GO:0040018 "positive
regulation of multicellular organism growth" evidence=IMP]
[GO:0005604 "basement membrane" evidence=IDA] [GO:0005198
"structural molecule activity" evidence=IDA] InterPro:IPR001442
Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
NextBio:915032 GO:GO:0016043 Uniprot:P17140
Length = 1758
Score = 157 (60.3 bits), Expect = 1.4e-07, P = 1.4e-07
Identities = 82/261 (31%), Positives = 95/261 (36%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTSTS 182
++ + Y G G N P G + DG P G G P + G G P
Sbjct: 335 QKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-PGAPGP 393
Query: 183 AYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG-PGYDP 239
A +G P Y G PG + KG G AP G P KG PGY
Sbjct: 394 AGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGEPGYRG 452
Query: 240 TKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG---PN-YD 286
T G PG D + G ++G N RGP D GL G QRG PN YD
Sbjct: 453 TPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPGPNGYD 512
Query: 287 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYD-PS 344
+ G PG RG A AP ++G PGY Q G D R P P
Sbjct: 513 GRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPGMPGPV 569
Query: 345 RGTGFDGAPRGAAPHGQVPPP 365
G DG P A G PP
Sbjct: 570 GDAGDDGLPGPAGRPGSPGPP 590
>UNIPROTKB|P17140 [details] [associations]
symbol:let-2 "Collagen alpha-2(IV) chain" species:6239
"Caenorhabditis elegans" [GO:0016043 "cellular component
organization" evidence=NAS] [GO:0030020 "extracellular matrix
structural constituent conferring tensile strength" evidence=IMP]
[GO:0005587 "collagen type IV" evidence=IMP] InterPro:IPR001442
Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 GO:GO:0009792
GO:GO:0040007 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
GO:GO:0040018 GO:GO:0000003 GO:GO:0040039 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0030020 HOGENOM:HOG000085652
GO:GO:0005587 Gene3D:2.170.240.10 KO:K06237 EMBL:Z22964 EMBL:U22327
EMBL:FO081065 EMBL:J05066 PIR:A34476 PIR:T29350 PIR:T29351
RefSeq:NP_510663.1 RefSeq:NP_510664.1 ProteinModelPortal:P17140
SMR:P17140 STRING:P17140 PaxDb:P17140 PRIDE:P17140 GeneID:181708
KEGG:cel:CELE_F01G12.5 UCSC:F01G12.5b.1 CTD:181708
WormBase:F01G12.5a WormBase:F01G12.5b InParanoid:P17140
NextBio:915032 GO:GO:0016043 Uniprot:P17140
Length = 1758
Score = 157 (60.3 bits), Expect = 1.4e-07, P = 1.4e-07
Identities = 82/261 (31%), Positives = 95/261 (36%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTSTS 182
++ + Y G G N P G + DG P G G P + G G P
Sbjct: 335 QKGEAGYPGRDGPKGNSGPPGPPGGGTFNDGAPGPPGLPGRPGNPGPPGTDGY-PGAPGP 393
Query: 183 AYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKG-PSYDPAKG-PGYDP 239
A +G P Y G PG + KG G AP G P KG PGY
Sbjct: 394 AGPIGNTGGPGLPGYPGNEGLPGPKGDKGDG-GIPGAPGVSGPSGIPGLPGPKGEPGYRG 452
Query: 240 TKG------PGYDAQKG-SNYDAQRGPN-YDIHRGPSYDPQRGL-GYDMQRG---PN-YD 286
T G PG D + G ++G N RGP D GL G QRG PN YD
Sbjct: 453 TPGQSIPGLPGKDGKPGLDGAPGRKGENGLPGVRGPPGDSLNGLPGAPGQRGAPGPNGYD 512
Query: 287 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYD-PS 344
+ G PG RG A AP ++G PGY Q G D R P P
Sbjct: 513 GRDGVN-GLPGAPGTKGDRGGTCSAC-APGTKGEKGLPGYSGQPGPQGD-RGLPGMPGPV 569
Query: 345 RGTGFDGAPRGAAPHGQVPPP 365
G DG P A G PP
Sbjct: 570 GDAGDDGLPGPAGRPGSPGPP 590
>ZFIN|ZDB-GENE-030131-5725 [details] [associations]
symbol:arid1ab "AT rich interactive domain 1Ab
(SWI-like)" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
evidence=IEA] [GO:0005622 "intracellular" evidence=IEA]
InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
ZFIN:ZDB-GENE-030131-5725 GO:GO:0003677 GO:GO:0005622
Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
GeneTree:ENSGT00550000074575 EMBL:CABZ01050711 EMBL:CT027837
IPI:IPI00485842 Ensembl:ENSDART00000084272 Bgee:F1RE50
Uniprot:F1RE50
Length = 2135
Score = 157 (60.3 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
Identities = 78/257 (30%), Positives = 104/257 (40%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP-QGHGPP-PSATTAGVVGAGPNTSTSAYA 185
G + GA GN ++ G P + G P QG+GPP P G+ G TS + +
Sbjct: 312 GQHYGA-GNPYSQQQGPPPSS---QQGPPYPGQGYGPPGPQRYPMGMQG---RTSGNL-S 363
Query: 186 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP---SY--DPAKGPGYDP- 239
Q G M Y GPG GY + PS P GP SY P+ GPG P
Sbjct: 364 GIQYGQQM--GYG-QHGPGGYGQNQAGYYGQQGPS--PHGGPQQSSYPQQPSTGPGSQPP 418
Query: 240 -TKGPGYD--AQKGSNYDAQRGPNYDIHRGPSYD--PQRGLG---YDMQRGPNYDMQRGP 291
++ P Q G++Y +GP+ P Y PQ G + +GP
Sbjct: 419 YSQQPSGTPHGQSGTSYGQPQGPHVPNQGQPPYSQTPQSQSGQSPFPQSQGPTQSQGPSQ 478
Query: 292 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY---DPSRGT 347
G + +Q PGY P Q A Q+GP Q+ QG + PS PS+ T
Sbjct: 479 GQQGSQSQPGYT--HPPSGSGQPAQ----QQGPS---QQQQGPPQSQTPSSAPPQPSQQT 529
Query: 348 GFDGAPRGAAPHGQVPP 364
G P +P+ Q PP
Sbjct: 530 SGQGQP---SPYSQTPP 543
Score = 125 (49.1 bits), Expect = 0.00055, P = 0.00055
Identities = 79/298 (26%), Positives = 109/298 (36%)
Query: 115 ELMNAPNVDRRADGSYGGATGNSENETSGR-PVGQNA-YEDGYGVPQ--GHGPPPSATTA 170
+L+ +P+ R G E G +G ++ Y G+ Q H PPP +
Sbjct: 232 QLLTSPSSTRSYQNYPASEYGGQEGAAKGPGDMGSSSQYGGGHPAWQQRSHHPPPMSP-- 289
Query: 171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
G G T Q G Y G Y +GP + + P Y P +G Y
Sbjct: 290 GNTGQANRTQPPG-PMDQVGKIRGQHYGA--GNPYSQQQGPPPSSQQGPPY-PGQG--YG 343
Query: 231 PAKGPGYDPTKGPGYDAQK--GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN---- 284
P GP P G + G Y Q G Y H GP Q GY Q+GP+
Sbjct: 344 PP-GPQRYPMGMQGRTSGNLSGIQYGQQMG--YGQH-GPGGYGQNQAGYYGQQGPSPHGG 399
Query: 285 -----YDMQ--RGPGYE---TQRVPGYDV-QRGPVYEAQRAPSYIPQRG-PGYDLQRGQG 332
Y Q GPG + +Q+ G Q G Y + P ++P +G P Y Q Q
Sbjct: 400 PQQSSYPQQPSTGPGSQPPYSQQPSGTPHGQSGTSYGQPQGP-HVPNQGQPPYS-QTPQS 457
Query: 333 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
++P + S+G P Q P + P GS P + G Q + G P
Sbjct: 458 QS-GQSP-FPQSQGPTQSQGPSQGQQGSQSQPGYTHPPSGSGQPAQQQGPSQQQQGPP 513
Score = 50 (22.7 bits), Expect = 1.5e-07, Sum P(2) = 1.5e-07
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 360 GQVPPP--LNNVP---YGSATPPARSGSGQPRG 387
G+ PPP NN P PP+ +GSG +G
Sbjct: 1061 GEDPPPDFFNNDPKKNQAKVQPPSPAGSGSLQG 1093
>WB|WBGene00000123 [details] [associations]
symbol:ama-1 species:6239 "Caenorhabditis elegans"
[GO:0003899 "DNA-directed RNA polymerase activity"
evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0005665 "DNA-directed RNA polymerase II, core complex"
evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA;IMP] [GO:0009792 "embryo development ending in birth
or egg hatching" evidence=IMP] [GO:0000003 "reproduction"
evidence=IMP] [GO:0040010 "positive regulation of growth rate"
evidence=IMP] [GO:0007052 "mitotic spindle organization"
evidence=IMP] [GO:0010458 "exit from mitosis" evidence=IMP]
[GO:0008356 "asymmetric cell division" evidence=IMP] [GO:0032502
"developmental process" evidence=IMP] [GO:0006479 "protein
methylation" evidence=IMP] [GO:0007369 "gastrulation" evidence=IMP]
[GO:0005634 "nucleus" evidence=IDA] [GO:0001055 "RNA polymerase II
activity" evidence=IMP] [GO:0042789 "mRNA transcription from RNA
polymerase II promoter" evidence=IMP] InterPro:IPR000684
InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 GO:GO:0005634
GO:GO:0009792 GO:GO:0040010 GO:GO:0007052 GO:GO:0010458
GO:GO:0046872 GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20
InterPro:IPR009010 GO:GO:0006479 GO:GO:0008356 GO:GO:0007369
GO:GO:0042789 EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665
EMBL:M29235 PIR:A34092 PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356
STRING:P16356 PaxDb:P16356 EnsemblMetazoa:F36A4.7.1
EnsemblMetazoa:F36A4.7.2 GeneID:177190 KEGG:cel:CELE_F36A4.7
UCSC:F36A4.7 CTD:247749 WormBase:F36A4.7
GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975 InParanoid:P16356
OMA:KVLPWST NextBio:895720 GO:GO:0001055 Uniprot:P16356
Length = 1856
Score = 157 (60.3 bits), Expect = 1.5e-07, P = 1.5e-07
Identities = 68/254 (26%), Positives = 93/254 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
G GA + T G G + + +G P G P A + G S Y+ +
Sbjct: 1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582
Query: 188 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 247
M + + P P Y + + +PSY PT PSY P P Y PT P Y
Sbjct: 1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639
Query: 248 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 307
S Y + P+Y PSY P Y P+Y P Y P Y P
Sbjct: 1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691
Query: 308 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
Y + +P+Y P P Y + G GY +P Y PS T +P + Q
Sbjct: 1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748
Query: 363 PPPLNNVPYGSATP 376
P + Y ++P
Sbjct: 1749 SP--TSPQYSPSSP 1760
Score = 154 (59.3 bits), Expect = 3.2e-07, P = 3.2e-07
Identities = 65/219 (29%), Positives = 87/219 (39%)
Query: 164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 223
P + T+ G P S S + S +P +Y P P Y + P Y + +PSY P
Sbjct: 1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653
Query: 224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
T PSY P+ P Y P+ P Y + Y + P Y P+Y P Y P
Sbjct: 1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705
Query: 284 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 338
Y + G GY P Y P Y + +PSY P P Y Q Y +
Sbjct: 1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759
Query: 339 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 376
P+Y PS T +PRG ++P P + S TP
Sbjct: 1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798
>UNIPROTKB|P16356 [details] [associations]
symbol:ama-1 "DNA-directed RNA polymerase II subunit RPB1"
species:6239 "Caenorhabditis elegans" [GO:0005515 "protein binding"
evidence=IPI] InterPro:IPR000684 InterPro:IPR000722
InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
PROSITE:PS00115 SMART:SM00663 GO:GO:0005634 GO:GO:0009792
GO:GO:0040010 GO:GO:0007052 GO:GO:0010458 GO:GO:0046872
GO:GO:0003677 GO:GO:0000003 Gene3D:2.40.40.20 InterPro:IPR009010
GO:GO:0006479 GO:GO:0008356 GO:GO:0007369 GO:GO:0042789
EMBL:FO081153 eggNOG:COG0086 GO:GO:0005665 EMBL:M29235 PIR:A34092
PIR:T29959 RefSeq:NP_500523.4 IntAct:P16356 STRING:P16356
PaxDb:P16356 EnsemblMetazoa:F36A4.7.1 EnsemblMetazoa:F36A4.7.2
GeneID:177190 KEGG:cel:CELE_F36A4.7 UCSC:F36A4.7 CTD:247749
WormBase:F36A4.7 GeneTree:ENSGT00700000104490 HOGENOM:HOG000222975
InParanoid:P16356 OMA:KVLPWST NextBio:895720 GO:GO:0001055
Uniprot:P16356
Length = 1856
Score = 157 (60.3 bits), Expect = 1.5e-07, P = 1.5e-07
Identities = 68/254 (26%), Positives = 93/254 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
G GA + T G G + + +G P G P A + G S Y+ +
Sbjct: 1527 GMSPGAGFSPAGNTDG---GASPFNEGGWSPASPGDPLGALSPRTPSYG-GMSPGVYSPS 1582
Query: 188 QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDA 247
M + + P P Y + + +PSY PT PSY P P Y PT P Y
Sbjct: 1583 SPQFSMTSPHYSPTSPSYSPTSPAAGQSPVSPSYSPTS-PSYSPTS-PSYSPTS-PSYSP 1639
Query: 248 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGP 307
S Y + P+Y PSY P Y P+Y P Y P Y P
Sbjct: 1640 TSPS-Y-SPTSPSYS-PTSPSYSPSSP-SYSPS-SPSYSPS-SPRYSPTS-PTYS-PTSP 1691
Query: 308 VYEAQRAPSYIPQRGPGYD-----LQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
Y + +P+Y P P Y + G GY +P Y PS T +P + Q
Sbjct: 1692 TY-SPTSPTYSPT-SPTYSPTSPSYESGGGYSPS-SPKYSPSSPTYSPTSPSYSPTSPQY 1748
Query: 363 PPPLNNVPYGSATP 376
P + Y ++P
Sbjct: 1749 SP--TSPQYSPSSP 1760
Score = 154 (59.3 bits), Expect = 3.2e-07, P = 3.2e-07
Identities = 65/219 (29%), Positives = 87/219 (39%)
Query: 164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 223
P + T+ G P S S + S +P +Y P P Y + P Y + +PSY P
Sbjct: 1598 PSYSPTSPAAGQSP-VSPSYSPTSPSYSPTSPSYS-PTSPSYSPTS-PSYSPT-SPSYSP 1653
Query: 224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
T PSY P+ P Y P+ P Y + Y + P Y P+Y P Y P
Sbjct: 1654 TS-PSYSPSS-PSYSPSS-PSY-SPSSPRY-SPTSPTYS-PTSPTYSPTSPT-YS-PTSP 1705
Query: 284 NYD-----MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA 338
Y + G GY P Y P Y + +PSY P P Y Q Y +
Sbjct: 1706 TYSPTSPSYESGGGYSPSS-PKYSPS-SPTY-SPTSPSYSPT-SPQYSPTSPQ-YSPS-S 1759
Query: 339 PSYDPSRGTGFDGAPRG-AAPHGQVPPPLNNVPYGSATP 376
P+Y PS T +PRG ++P P + S TP
Sbjct: 1760 PTYTPSSPTYNPTSPRGFSSPQYSPTSPTYSPTSPSYTP 1798
>UNIPROTKB|J9P0I3 [details] [associations]
symbol:ZNF768 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
KO:K09228 CTD:79724 OMA:SRYESQN EMBL:AAEX03004391
RefSeq:XP_547025.2 Ensembl:ENSCAFT00000045233 GeneID:489906
KEGG:cfa:489906 Uniprot:J9P0I3
Length = 554
Score = 148 (57.2 bits), Expect = 3.1e-07, P = 3.1e-07
Identities = 48/170 (28%), Positives = 77/170 (45%)
Query: 127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 186
+GS G +E E + G YE +P G P + G + + +
Sbjct: 25 EGSLKGNMSENEEEEMSQQEGTGDYEVEE-IP--FGLDPQSPGFEPQSPGFESQSPRFEP 81
Query: 187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
G R+ +P P + A + P D S++P ++P + P Y+P + PGY+P + PGY+
Sbjct: 82 ESPGFESRSPGFVPPSPEF-APRSPDSD-SQSPEFEP-QSPRYEP-QSPGYEP-RSPGYE 136
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
K Y+ + P Y+ R P Y+ Q GY+ Q P + Q P +E Q
Sbjct: 137 P-KSPGYEP-KSPGYE-PRSPGYESQSP-GYEPQN-PEFKTQ-SPEFEAQ 180
>FB|FBgn0035872 [details] [associations]
symbol:CG7185 species:7227 "Drosophila melanogaster"
[GO:0003729 "mRNA binding" evidence=ISS] [GO:0006379 "mRNA
cleavage" evidence=ISS] [GO:0005849 "mRNA cleavage factor complex"
evidence=ISS] [GO:0000166 "nucleotide binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IC] [GO:0000381 "regulation of alternative mRNA
splicing, via spliceosome" evidence=IMP] InterPro:IPR000504
InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 EMBL:AE014296
GO:GO:0000166 GO:GO:0003729 Gene3D:3.30.70.330 GO:GO:0000381
GO:GO:0006379 GO:GO:0005849 eggNOG:NOG313287 KO:K14398
GeneTree:ENSGT00690000101901 EMBL:AY058563 RefSeq:NP_648206.1
UniGene:Dm.887 ProteinModelPortal:Q9VSH4 SMR:Q9VSH4 IntAct:Q9VSH4
MINT:MINT-1562127 STRING:Q9VSH4 PaxDb:Q9VSH4
EnsemblMetazoa:FBtr0076710 GeneID:38937 KEGG:dme:Dmel_CG7185
UCSC:CG7185-RA FlyBase:FBgn0035872 InParanoid:Q9VSH4 OMA:PYERGDY
OrthoDB:EOG4S1RQ4 PhylomeDB:Q9VSH4 ChiTaRS:CG7185 GenomeRNAi:38937
NextBio:811101 Bgee:Q9VSH4 Uniprot:Q9VSH4
Length = 652
Score = 141 (54.7 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
Identities = 63/199 (31%), Positives = 79/199 (39%)
Query: 200 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG-YDAQKGSNYDAQRG 258
PRGP S G G + P GP P +G + PG Y Q S G
Sbjct: 197 PRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRG--MNSIMQPGQYRPQHMSQVPQVGG 253
Query: 259 PNYDIHR-GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY 317
PN R P PQ GL + Q P Y +G + QR PG + GP + P +
Sbjct: 254 PNSGPPRMQPPMHPQGGLMGNQQPPPRYPSAQGQ-WPGQR-PG-GPRPGPPNGPPQRPMF 310
Query: 318 IPQRGP-GYDLQRGQGYDMRRAPSYD--PSRGT--GFDGAPRGAAPHGQVPPPLNNVPYG 372
Q GP G ++ G D RR P + P +G G AP PHG P +N +
Sbjct: 311 --QGGPMGMPVRGPAGPDWRRPPMHGGFPPQGPPRGLPPAPGPGGPHGAPAPHVNPAFFN 368
Query: 373 SATPPARS-GSGQPRGGNP 390
PA+ G G P G P
Sbjct: 369 QPGGPAQHPGMGGPPHGAP 387
Score = 112 (44.5 bits), Expect = 0.00049, Sum P(2) = 0.00049
Identities = 53/171 (30%), Positives = 61/171 (35%)
Query: 223 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG 282
P +GP+ P+ G G PT PG G RG N + G Y PQ G
Sbjct: 196 PPRGPA-PPSMGGGPMPTGHPGGPQGGGPPGHPPRGMNSIMQPG-QYRPQHMSQVPQVGG 253
Query: 283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPS 340
PN GP + P Q G + Q P Y +G PG QR G R P
Sbjct: 254 PN----SGP---PRMQPPMHPQGGLMGNQQPPPRYPSAQGQWPG---QRPGG--PRPGPP 301
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
P + F G P G G P P PP G PRG PA
Sbjct: 302 NGPPQRPMFQGGPMGMPVRGPAGPDWRRPPMHGGFPP----QGPPRGLPPA 348
Score = 52 (23.4 bits), Expect = 3.2e-07, Sum P(2) = 3.2e-07
Identities = 24/76 (31%), Positives = 30/76 (39%)
Query: 135 GNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
G +++E G G + Y+D G GP SA + G G G S A SG P
Sbjct: 19 GQAQDEFGGD--GVDLYDD-IG-----GPTESAASGG--GGGGTPSADGAAGPGSGEPGE 68
Query: 195 AAYDIPRGPGYEASKG 210
P G Y S G
Sbjct: 69 RNSGGPNGV-YHQSSG 83
Score = 41 (19.5 bits), Expect = 4.3e-06, Sum P(2) = 4.3e-06
Identities = 9/22 (40%), Positives = 11/22 (50%)
Query: 126 ADGSYGGATGNSENETSGRPVG 147
ADG+ G +G SG P G
Sbjct: 55 ADGAAGPGSGEPGERNSGGPNG 76
>UNIPROTKB|J3KNM7 [details] [associations]
symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
"Homo sapiens" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
EMBL:CH471063 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
EMBL:AC079235 EMBL:AC073149 UniGene:Hs.591645 HGNC:HGNC:2206
ChiTaRS:COL4A4 ProteinModelPortal:J3KNM7 Ensembl:ENST00000329662
Uniprot:J3KNM7
Length = 1687
Score = 153 (58.9 bits), Expect = 3.7e-07, P = 3.7e-07
Identities = 81/253 (32%), Positives = 101/253 (39%)
Query: 151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
Y +G P GPP G GA P S S + GTP A +IP PG+ G
Sbjct: 672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728
Query: 211 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 262
PG+ K S GP P KG PG DP G G ++G S +GP D
Sbjct: 729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787
Query: 263 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 316
G P + G+ G+ +GP + G PG PG+ +RG P Q P
Sbjct: 788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843
Query: 317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 374
Y P PG +GQ D+ P P+ G G P HG PP L +P +G
Sbjct: 844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896
Query: 375 TPPARSGSGQPRG 387
P G PRG
Sbjct: 897 GLPGPPGPKGPRG 909
Score = 130 (50.8 bits), Expect = 0.00012, P = 0.00012
Identities = 81/260 (31%), Positives = 104/260 (40%)
Query: 152 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 208
E G+ GVP GH P G+ G G S + + G P +D P GP G+
Sbjct: 640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693
Query: 209 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 261
+G PG S P T G + P PG+ G PG+ +KGS+ GP
Sbjct: 694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752
Query: 262 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 311
+ +G DP G LG +RG P RG PG E +PG+ +GP
Sbjct: 753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812
Query: 312 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 367
A P +P PG+ +RG G + P Y P G GAP G G V PP
Sbjct: 813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865
Query: 368 NVPYGSATPPARSGSGQPRG 387
G P R G+ P G
Sbjct: 866 AGMKGLPGLPGRPGAHGPPG 885
Score = 123 (48.4 bits), Expect = 0.00070, P = 0.00070
Identities = 81/280 (28%), Positives = 104/280 (37%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 191
GA+G + G PVG + G P G P G G P S+ +G
Sbjct: 1190 GASGLHDVGPPG-PVGIPGLKGERGDPGSPGISPPGPR-GKKGP-PGPPGSSGPPGPAGA 1246
Query: 192 PMRAAYDIPRGPGYEASKGP-GYDASK-AP-------SYDPTKGPSYD-----PAKGPGY 237
RA DIP PG +GP G D + AP S D +G D P PG
Sbjct: 1247 TGRAPKDIP-DPGPPGDQGPPGPDGPRGAPGPPGLPGSVDLLRGEPGDCGLPGPPGPPG- 1304
Query: 238 DPTKGPGYDAQKGSN-YDAQRGP-NYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GY 293
P PGY G + D Q+GP + +GP P G ++G P ++GP G
Sbjct: 1305 -PPGPPGYKGFPGCDGKDGQKGPVGFPGPQGPHGFP----GPPGEKGLPGPPGRKGPTGL 1359
Query: 294 ETQRVPGYDVQRGP-VYEAQRAPSYI-PQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD 350
+ P DV P + AP P+ G RG G + P D RG D
Sbjct: 1360 PGEPGPPADVDDCPRIPGLPGAPGMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGV--D 1417
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
G P P G+ + G PP G P+G P
Sbjct: 1418 GVPGSPGPPGRKGDTGEDGYPGGPGPPGPIGDPGPKGFGP 1457
>UNIPROTKB|P53420 [details] [associations]
symbol:COL4A4 "Collagen alpha-4(IV) chain" species:9606
"Homo sapiens" [GO:0005587 "collagen type IV" evidence=IDA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IMP] [GO:0032836 "glomerular basement membrane
development" evidence=IMP] [GO:0005605 "basal lamina" evidence=IDA]
[GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
"endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
guidance" evidence=TAS] [GO:0030198 "extracellular matrix
organization" evidence=TAS] InterPro:IPR001442 Pfam:PF01413
PROSITE:PS51403 SMART:SM00111 Reactome:REACT_118779
Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005788 GO:GO:0005605
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
HOVERGEN:HBG004933 HOGENOM:HOG000085652 GO:GO:0005587
Gene3D:2.170.240.10 KO:K06237 OrthoDB:EOG4XGZZF EMBL:AC079235
EMBL:AB008496 MIM:141200 MIM:203780 Orphanet:88919 Orphanet:97562
GO:GO:0032836 EMBL:X81053 EMBL:Y17397 EMBL:Y17398 EMBL:Y17399
EMBL:Y17400 EMBL:Y17401 EMBL:Y17402 EMBL:Y17403 EMBL:Y17404
EMBL:Y17405 EMBL:Y17406 EMBL:Y17407 EMBL:Y17408 EMBL:Y17409
EMBL:Y17410 EMBL:Y17411 EMBL:Y17412 EMBL:Y17413 EMBL:Y17427
EMBL:Y17426 EMBL:Y17414 EMBL:Y17415 EMBL:Y17416 EMBL:Y17417
EMBL:Y17418 EMBL:Y17419 EMBL:Y17420 EMBL:Y17443 EMBL:Y17442
EMBL:Y17441 EMBL:Y17440 EMBL:Y17439 EMBL:Y17438 EMBL:Y17437
EMBL:Y17436 EMBL:Y17435 EMBL:Y17434 EMBL:Y17433 EMBL:Y17432
EMBL:Y17431 EMBL:Y17430 EMBL:Y17429 EMBL:Y17428 EMBL:Y17421
EMBL:Y17422 EMBL:Y17423 EMBL:Y17424 EMBL:Y17425 EMBL:AC073149
EMBL:D17391 IPI:IPI00478572 PIR:A55360 RefSeq:NP_000083.3
UniGene:Hs.591645 ProteinModelPortal:P53420 SMR:P53420
IntAct:P53420 STRING:P53420 PhosphoSite:P53420 DMDM:259016360
PaxDb:P53420 PRIDE:P53420 Ensembl:ENST00000396625 GeneID:1286
KEGG:hsa:1286 UCSC:uc021vxr.1 CTD:1286 GeneCards:GC02M227867
H-InvDB:HIX0030014 HGNC:HGNC:2206 MIM:120131 neXtProt:NX_P53420
PharmGKB:PA26721 InParanoid:P53420 OMA:FRGDMGD ChiTaRS:COL4A4
GenomeRNAi:1286 NextBio:5201 Bgee:P53420 CleanEx:HS_COL4A4
Genevestigator:P53420 GermOnline:ENSG00000081052 Uniprot:P53420
Length = 1690
Score = 153 (58.9 bits), Expect = 3.7e-07, P = 3.7e-07
Identities = 81/253 (32%), Positives = 101/253 (39%)
Query: 151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
Y +G P GPP G GA P S S + GTP A +IP PG+ G
Sbjct: 672 YPGRHGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTPGTA--EIPGPPGFRGDMG 728
Query: 211 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 262
PG+ K S GP P KG PG DP G G ++G S +GP D
Sbjct: 729 DPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGVPGIKGPRGD 787
Query: 263 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 316
G P + G+ G+ +GP + G PG PG+ +RG P Q P
Sbjct: 788 --PGCPGAEGPAGIPGFLGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 843
Query: 317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 374
Y P PG +GQ D+ P P+ G G P HG PP L +P +G
Sbjct: 844 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 896
Query: 375 TPPARSGSGQPRG 387
P G PRG
Sbjct: 897 GLPGPPGPKGPRG 909
Score = 130 (50.8 bits), Expect = 0.00012, P = 0.00012
Identities = 81/260 (31%), Positives = 104/260 (40%)
Query: 152 EDGY-GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEAS 208
E G+ GVP GH P G+ G G S + + G P +D P GP G+
Sbjct: 640 ERGHPGVP-GH--PGVRGPDGLKGQKGDTISCNVTYPGRHGPP---GFDGPPGPKGFPGP 693
Query: 209 KG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NY 261
+G PG S P T G + P PG+ G PG+ +KGS+ GP
Sbjct: 694 QGAPGLSGSDGHKGRPGTPGTAEIPGP-PGFRGDMGDPGFGGEKGSSPVGPPGPPGSPGV 752
Query: 262 DIHRGPSYDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEA 311
+ +G DP G LG +RG P RG PG E +PG+ +GP
Sbjct: 753 NGQKGIPGDPAFGHLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFLGLKGPKGRE 812
Query: 312 QRA--PSYIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLN 367
A P +P PG+ +RG G + P Y P G GAP G G V PP
Sbjct: 813 GHAGFPG-VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGP 865
Query: 368 NVPYGSATPPARSGSGQPRG 387
G P R G+ P G
Sbjct: 866 AGMKGLPGLPGRPGAHGPPG 885
Score = 122 (48.0 bits), Expect = 0.00090, P = 0.00090
Identities = 74/257 (28%), Positives = 89/257 (34%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
P G + G P GPP A G G P G P D PRG P
Sbjct: 1222 PPGPRGKKGPPGPPGSSGPPGPA---GATGRAPKDIPDPGPPGDQGPP---GPDGPRGAP 1275
Query: 204 GYEASKGPGYDASKAPSYD-PTKGPSYDPAK-GP-GYDPTKG-PGYDAQKGS-NYDAQRG 258
G G D + D GP P GP GY G G D QKG + +G
Sbjct: 1276 GPPGLPG-SVDLLRGEPGDCGLPGPPGPPGPPGPPGYKGFPGCDGKDGQKGPVGFPGPQG 1334
Query: 259 PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGP-VYEAQRAP 315
P H P ++GL G ++GP G PG + P DV P + AP
Sbjct: 1335 P----HGFPGPPGEKGLPGPPGRKGPT-----GLPGPRGEPGPPADVDDCPRIPGLPGAP 1385
Query: 316 SYI-PQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGS 373
P+ G RG G + P D RG DG P P G+ + G
Sbjct: 1386 GMRGPEGAMGLPGMRGPSGPGCKGEPGLDGRRGV--DGVPGSPGPPGRKGDTGEDGYPGG 1443
Query: 374 ATPPARSGSGQPRGGNP 390
PP G P+G P
Sbjct: 1444 PGPPGPIGDPGPKGFGP 1460
>UNIPROTKB|D4ADB1 [details] [associations]
symbol:D4ADB1 "Uncharacterized protein" species:10116
"Rattus norvegicus" [GO:0008270 "zinc ion binding" evidence=IEA]
Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
SMART:SM00228 GO:GO:0046872 GO:GO:0008270 Gene3D:2.10.110.10
SUPFAM:SSF50156 InterPro:IPR006643 SMART:SM00735 IPI:IPI00951885
PRIDE:D4ADB1 Ensembl:ENSRNOT00000043713 ArrayExpress:D4ADB1
Uniprot:D4ADB1
Length = 684
Score = 148 (57.2 bits), Expect = 4.3e-07, P = 4.3e-07
Identities = 50/182 (27%), Positives = 70/182 (38%)
Query: 141 TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
TS P +Y +G P P P T + P+ A+ S +P A Y P
Sbjct: 331 TSPAPAAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASPYSPSP-GANYS-P 383
Query: 201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 260
P Y S P Y S AP+Y P+ P+Y P+ P Y P+ P Y+ + Y GP+
Sbjct: 384 T-P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYNPTPSAAYSG--GPS 439
Query: 261 YDIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 314
R P S+ + G + RG P Y + RG A+R
Sbjct: 440 ESASRPPWVTDDSFSQKFAPGKSTTSVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERF 499
Query: 315 PS 316
P+
Sbjct: 500 PA 501
>UNIPROTKB|P02457 [details] [associations]
symbol:COL1A1 "Collagen alpha-1(I) chain" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M17839
EMBL:M17838 EMBL:V00401 EMBL:M10571 EMBL:M17607 IPI:IPI00572548
PIR:A27179 PIR:A90458 PIR:I50629 PIR:S07234 UniGene:Gga.2073
UniGene:Gga.43371 IntAct:P02457 PRIDE:P02457 Uniprot:P02457
Length = 1453
Score = 149 (57.5 bits), Expect = 8.6e-07, P = 8.6e-07
Identities = 90/285 (31%), Positives = 109/285 (38%)
Query: 126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
ADG G G TG++ + P G A G P G G P G AGP +T
Sbjct: 808 ADGQPGAKGETGDAGAKGDAGPPGP-AGPTGAPGPAGZVGAPGPKGARG--SAGPPGATG 864
Query: 183 AYAATQSGTPMRAAYDI----PRGP-GYEASKGPGYDASKA--PSYDPTKGPSYDPA-KG 234
A P + +I P GP G + SKGP + A P GP P KG
Sbjct: 865 FPGAAGRVGPPGPSGNIGLPGPPGPAGKZGSKGPRGETGPAGRPGEPGPAGPPGPPGEKG 924
Query: 235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
PG D P T GP G Q+G QRG + GPS +P ++G G +RG
Sbjct: 925 SPGADGPIGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984
Query: 283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
P M GP PG GP EA R + + PG D G D
Sbjct: 985 PPGPM--GP-------PGL---AGPPGEAGREGAPGAEGAPGRDGAAGPKGDRGETGPAG 1032
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G GAP P G+ P G A PP +G+ P G
Sbjct: 1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPPGPAGARGPAG 1077
>UNIPROTKB|G4N3H5 [details] [associations]
symbol:MGG_04961 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0008150 "biological_process" evidence=ND] EMBL:CM001233
RefSeq:XP_003712457.1 EnsemblFungi:MGG_04961T0 GeneID:2675293
KEGG:mgr:MGG_04961 Uniprot:G4N3H5
Length = 616
Score = 144 (55.7 bits), Expect = 1.0e-06, P = 1.0e-06
Identities = 61/185 (32%), Positives = 80/185 (43%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P R G + ++ +SGR + G P G PP + TA + GP+
Sbjct: 445 PGYQRNQPGGPPSRFDSYDDYSSGRASPAPSMYPSRG-PGGPNMPPRSATAPIPPRGPD- 502
Query: 180 STSAYAATQSG--TPMRAAYDIPRGPGYEASKGPGYDASKAPS-YDPTKGPSYDPAKGPG 236
AY +G +P + Y PRGPG GP AS APS Y+P + P A GP
Sbjct: 503 ---AYDDYSNGRASPAPSMYP-PRGPG-----GPNGRASPAPSMYNPPRAPPQRSATGPM 553
Query: 237 YDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGP----SYDPQRGLGYDMQRGPNYDM--Q 288
P +GPG+ Q+ A GP+ YD + P S P RG G N D+ Q
Sbjct: 554 --PPRGPGFPPQRNMTAPAP-GPDDPYDYNTRPPTSSSQAPPRGA---FGNGWNSDLENQ 607
Query: 289 RG-PG 292
RG PG
Sbjct: 608 RGGPG 612
Score = 128 (50.1 bits), Expect = 5.8e-05, P = 5.8e-05
Identities = 81/289 (28%), Positives = 97/289 (33%)
Query: 113 RAELMNA--PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTA 170
RA+ M P R G+ G NS ++ P Q Y Q P A
Sbjct: 332 RADTMTTLPPYASR--PGTPGSIELNSLDQKRPMPSRQGTMNSSYSSRQ-----PLVGAA 384
Query: 171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
G + + S + SG + R +S Y AS AP T P+
Sbjct: 385 AEFGRSASPAPSIPSTNYSGRTYGGQPPMSRMQSNASSMSRAYTASPAPFSSDTV-PAL- 442
Query: 231 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
P PGY + PG + +YD PS P RG G GPN +
Sbjct: 443 PR--PGYQRNQ-PGGPPSRFDSYDDYSSGRAS--PAPSMYPSRGPG-----GPNMPPRSA 492
Query: 291 PGYETQRVP-GYD-VQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGT 347
R P YD G A APS P RGPG R M P P R
Sbjct: 493 TAPIPPRGPDAYDDYSNG---RASPAPSMYPPRGPGGPNGRASPAPSMYNPPRAPPQRSA 549
Query: 348 GFDGAPRGAA--PHGQV--PPPLNNVPYGSAT-PPARSGSGQPRG--GN 389
PRG P + P P + PY T PP S PRG GN
Sbjct: 550 TGPMPPRGPGFPPQRNMTAPAPGPDDPYDYNTRPPTSSSQAPPRGAFGN 598
>WB|WBGene00004203 [details] [associations]
symbol:swsn-1 species:6239 "Caenorhabditis elegans"
[GO:0003682 "chromatin binding" evidence=IEA] [GO:0000003
"reproduction" evidence=IGI;IMP] [GO:0040035 "hermaphrodite
genitalia development" evidence=IMP] [GO:0040007 "growth"
evidence=IMP] [GO:0002119 "nematode larval development"
evidence=IGI;IMP] [GO:0009792 "embryo development ending in birth
or egg hatching" evidence=IGI;IMP] [GO:0040018 "positive regulation
of multicellular organism growth" evidence=IGI;IMP] [GO:0040010
"positive regulation of growth rate" evidence=IMP] [GO:0040027
"negative regulation of vulval development" evidence=IMP]
[GO:0046662 "regulation of oviposition" evidence=IMP] [GO:0002009
"morphogenesis of an epithelium" evidence=IMP] [GO:0035262 "gonad
morphogenesis" evidence=IMP] InterPro:IPR001005 InterPro:IPR007526
InterPro:IPR009057 Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934
SMART:SM00717 GO:GO:0005634 GO:GO:0009792 GO:GO:0002009
GO:GO:0040007 GO:GO:0040010 GO:GO:0002119 GO:GO:0003677
GO:GO:0040018 Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
EMBL:AL110477 KO:K11649 UniGene:Cel.7072 GeneID:180324
KEGG:cel:CELE_Y113G7B.23 CTD:180324 RefSeq:NP_001256907.1
ProteinModelPortal:H8ESF3 SMR:H8ESF3 WormBase:Y113G7B.23c
Uniprot:H8ESF3
Length = 792
Score = 145 (56.1 bits), Expect = 1.1e-06, P = 1.1e-06
Identities = 86/316 (27%), Positives = 123/316 (38%)
Query: 91 HLESL-QVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQN 149
H + L Q+M+K ++ + +L E A ++D+ Y +++E R
Sbjct: 493 HFDELEQIMDKERESLEYQRHQLILE-RQAFHMDQL---KY--LENRAKHEAHSRMTSSG 546
Query: 150 AYEDGYGVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP 203
A G +P G GPP P + A P ++ AAT + P + P+ P
Sbjct: 547 ALPAG--LPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAP 602
Query: 204 GYEASKGP--GYDASKAP--SYDPTKGPSYDPAKGPGYDPTKGPGYDA----QKGSNYDA 255
+A+ P A +AP +Y GP P + Y P +G Y Q+ + A
Sbjct: 603 PVQAAPAPVQAPQAPQAPPQAYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQA 662
Query: 256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG-PVYEAQRA 314
Q+ + H GP Q G Q Y PG GY Q+ P Y+AQ
Sbjct: 663 QQAQS-QAHYGPPGGGQ-GPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPPYQAQPY 720
Query: 315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 374
P P P QRG GY P P F G P P+GQ+PPP P+G
Sbjct: 721 PG--P---PPPQQQRGYGYP----PPPQPV----FSGHPY-QQPYGQMPPP----PHGQY 762
Query: 375 TPPARSGSGQ-PRGGN 389
P + G P GG+
Sbjct: 763 QPQQQQGGPMGPPGGH 778
>TAIR|locus:2012713 [details] [associations]
symbol:AT1G33680 "AT1G33680" species:3702 "Arabidopsis
thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
[GO:0005829 "cytosol" evidence=IDA] InterPro:IPR004087
InterPro:IPR004088 Pfam:PF13014 PROSITE:PS50084 SMART:SM00322
EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005829 GO:GO:0003723
eggNOG:NOG300923 KO:K13210 UniGene:At.39892 UniGene:At.71035
HOGENOM:HOG000242545 EMBL:AK229850 EMBL:AK229909 EMBL:AK230055
IPI:IPI00786006 RefSeq:NP_174629.3 ProteinModelPortal:Q0WLY0
SMR:Q0WLY0 STRING:Q0WLY0 PaxDb:Q0WLY0 PRIDE:Q0WLY0
EnsemblPlants:AT1G33680.1 GeneID:840259 KEGG:ath:AT1G33680
TAIR:At1g33680 InParanoid:Q0WLY0 OMA:PSYGSTP PhylomeDB:Q0WLY0
ProtClustDB:CLSN2690290 Genevestigator:Q0WLY0 Uniprot:Q0WLY0
Length = 763
Score = 144 (55.7 bits), Expect = 1.4e-06, P = 1.4e-06
Identities = 65/233 (27%), Positives = 82/233 (35%)
Query: 130 YGGATGNSENETSGRPVG-QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
Y A G + + RP G Q + E GYG P+ PP G A P+ ++ AA+
Sbjct: 537 YPSAGGQHQMQQPSRPYGMQGSAEQGYGPPRPAAPPGDVPYQGPTPAAPSYGSTPAAASY 596
Query: 189 SGTPMRAAY-DIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYDPAK-GPGYD---- 238
TP +Y P P Y ++ GY AS AP+ PSY A GY+
Sbjct: 597 GSTPAAPSYGSTPAAPSYGSNMAQQQQYGY-ASSAPTQQTY--PSYSSAAPSDGYNGTQP 653
Query: 239 PTKGPGYD---AQKGSNYDAQRG------PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 289
P P Y+ AQ S G P PS P G Q NY
Sbjct: 654 PAVAPAYEQHGAQPASGVQQTSGGYGQVPPTGGYSSYPSTQPAYG-NTPAQSNGNY---- 708
Query: 290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGP---GYDLQRGQGYDMRRAP 339
GY + P Y Y A + Q P GY+ Q AP
Sbjct: 709 --GYIGSQYPSYGGGNASAYAAPTGQTAYSQTAPPQAGYEQSATQSAGYAAAP 759
>UNIPROTKB|Q96QC0 [details] [associations]
symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
regulatory subunit 10" species:9606 "Homo sapiens" [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0004864 "protein
phosphatase inhibitor activity" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0072357 "PTW/PP1 phosphatase complex"
evidence=ISS] [GO:0000785 "chromatin" evidence=ISS] [GO:0006606
"protein import into nucleus" evidence=TAS] InterPro:IPR000571
InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
GO:GO:0005634 EMBL:BA000025 GO:GO:0046872 GO:GO:0003677
GO:GO:0008270 GO:GO:0000785 GO:GO:0006351 GO:GO:0003723
EMBL:AL662800 EMBL:AL662825 GO:GO:0000790 GO:GO:0006606
GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
EMBL:Y13247 EMBL:AJ544537 EMBL:AB088097 EMBL:BX248507
IPI:IPI00298731 PIR:JE0291 RefSeq:NP_002705.2 UniGene:Hs.106019
ProteinModelPortal:Q96QC0 SMR:Q96QC0 DIP:DIP-39343N IntAct:Q96QC0
MINT:MINT-1197376 STRING:Q96QC0 PhosphoSite:Q96QC0 DMDM:61214507
PaxDb:Q96QC0 PeptideAtlas:Q96QC0 PRIDE:Q96QC0
Ensembl:ENST00000376511 Ensembl:ENST00000383586
Ensembl:ENST00000420949 Ensembl:ENST00000424446
Ensembl:ENST00000426299 Ensembl:ENST00000429597
Ensembl:ENST00000449113 GeneID:5514 KEGG:hsa:5514 UCSC:uc003nqn.1
CTD:5514 GeneCards:GC06M030568 H-InvDB:HIX0165052
H-InvDB:HIX0166290 H-InvDB:HIX0166579 H-InvDB:HIX0166833
H-InvDB:HIX0167082 H-InvDB:HIX0167322 H-InvDB:HIX0167569
HGNC:HGNC:9284 HPA:CAB025501 MIM:603771 neXtProt:NX_Q96QC0
PharmGKB:PA33612 eggNOG:NOG69306 HOGENOM:HOG000049285
HOVERGEN:HBG053646 InParanoid:Q96QC0 OMA:PPPHEHR OrthoDB:EOG451DQK
PhylomeDB:Q96QC0 ChiTaRS:PPP1R10 GenomeRNAi:5514 NextBio:21326
ArrayExpress:Q96QC0 Bgee:Q96QC0 CleanEx:HS_PPP1R10
Genevestigator:Q96QC0 GermOnline:ENSG00000204569 Uniprot:Q96QC0
Length = 940
Score = 145 (56.1 bits), Expect = 1.4e-06, P = 1.4e-06
Identities = 63/248 (25%), Positives = 83/248 (33%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 180
G GG G P G + + DG G P G GP P G G G N
Sbjct: 656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715
Query: 181 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 240
+ R+ P G G GPG + P +GP G+ P
Sbjct: 716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNSSGHRPH 770
Query: 241 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
+GPG GS + GP + G + P G G + G + GPG G
Sbjct: 771 EGPG--GGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828
Query: 301 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 351
+ GP + + P GPG+ D+ +G+D R P P G DG
Sbjct: 829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885
Query: 352 APRGAAPH 359
G H
Sbjct: 886 PGHGGGGH 893
Score = 144 (55.7 bits), Expect = 1.8e-06, P = 1.8e-06
Identities = 71/268 (26%), Positives = 90/268 (33%)
Query: 143 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
G P G + G G +P HG P ++G P G PMR +
Sbjct: 635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693
Query: 201 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 251
GPG GPG Y + + P P + A+G G P G PG G
Sbjct: 694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749
Query: 252 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 304
+ GP N HR P P G+G + GP M G G+ PG +
Sbjct: 750 GHRPHEGPGGGMGNSSGHR-PHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGIS 808
Query: 305 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
G + P G G+ G G M + + P G G G P G PH VP
Sbjct: 809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866
Query: 365 PLNNVPYGSATPPARSGSGQPRGGNPAR 392
+ G R G GG R
Sbjct: 867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894
Score = 130 (50.8 bits), Expect = 6.1e-05, P = 6.1e-05
Identities = 53/213 (24%), Positives = 72/213 (33%)
Query: 132 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
G G +E P G G G P G G P G G P+ SG
Sbjct: 708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSSG 766
Query: 191 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
G G+ +GPG + P +GP + G G+ P +GPG G
Sbjct: 767 HRPHEGPGGGMGSGHRPHEGPGGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAG 826
Query: 251 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVY 309
+ GP + + P G G+ G +D+ PG+ G+D RGP
Sbjct: 827 GGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPPP 877
Query: 310 EAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 339
R P + G+D G DM P
Sbjct: 878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910
>UNIPROTKB|G1RSL2 [details] [associations]
symbol:COL4A4 "Uncharacterized protein" species:61853
"Nomascus leucogenys" [GO:0005201 "extracellular matrix structural
constituent" evidence=ISS] [GO:0005587 "collagen type IV"
evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS] [GO:0032836
"glomerular basement membrane development" evidence=ISS]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
EMBL:ADFV01083072 EMBL:ADFV01083073 EMBL:ADFV01083074
EMBL:ADFV01083075 EMBL:ADFV01083076 EMBL:ADFV01083077
EMBL:ADFV01083078 Ensembl:ENSNLET00000017067 Uniprot:G1RSL2
Length = 1690
Score = 147 (56.8 bits), Expect = 1.7e-06, P = 1.7e-06
Identities = 79/253 (31%), Positives = 99/253 (39%)
Query: 151 YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG 210
Y G P G P G GA P S S + GTP +IP PG+ G
Sbjct: 671 YPGRQGPPGFDGLPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPGPPGFRGDMG 727
Query: 211 -PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNYDAQRGPNYD 262
PG+ + S GP P KG PG DP GP G ++G S +GP D
Sbjct: 728 DPGFGGERGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGPLGPPGKRGLSGVPGIKGPRGD 786
Query: 263 IHRG-PSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVYEAQRA-PS 316
G P + G+ G+ +GP + G PG PG+ +RG P Q P
Sbjct: 787 --PGCPGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGIPGQPGLPG 842
Query: 317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP--YGSA 374
Y P PG +GQ D+ P P+ G G P HG PP L +P +G
Sbjct: 843 Y-PG-SPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPGIPGPFGDD 895
Query: 375 TPPARSGSGQPRG 387
P G PRG
Sbjct: 896 GLPGPPGPKGPRG 908
Score = 123 (48.4 bits), Expect = 0.00070, P = 0.00070
Identities = 76/253 (30%), Positives = 97/253 (38%)
Query: 159 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 214
+GH G P G G G T + T G +D GP G+ +G PG
Sbjct: 640 RGHPGVPGRPGVRGPDGLKGQKGDTISCNVTYPGRQGPPGFDGLPGPKGFPGPQGAPGLS 699
Query: 215 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 268
S P T G S P PG+ G PG+ ++GS+ GP + +G
Sbjct: 700 GSDGHKGRPGTPGTSEIPGP-PGFRGDMGDPGFGGERGSSPVGPPGPPGSPGVNGQKGIP 758
Query: 269 YDPQRG-LGYDMQRG----PNYDMQRG----PGYETQR-VPGYDVQRGPVYEAQRA--PS 316
DP G LG +RG P RG PG E +PG+ +GP A P
Sbjct: 759 GDPAFGPLGPPGKRGLSGVPGIKGPRGDPGCPGAEGPAGIPGFPGLKGPKGREGHAGFPG 818
Query: 317 YIPQRGPGYDLQRGQ-GYDMRRA-PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 374
+P PG+ +RG G + P Y P G GAP G G V PP G
Sbjct: 819 -VPGP-PGHSCERGAPGIPGQPGLPGY-P----GSPGAPGGKGQPGDVGPPGPAGMKGLP 871
Query: 375 TPPARSGSGQPRG 387
P R G+ P G
Sbjct: 872 GLPGRPGAHGPPG 884
>FB|FBgn0261885 [details] [associations]
symbol:osa "osa" species:7227 "Drosophila melanogaster"
[GO:0046530 "photoreceptor cell differentiation" evidence=IMP]
[GO:0005634 "nucleus" evidence=NAS;IDA] [GO:0016055 "Wnt receptor
signaling pathway" evidence=IMP] [GO:0008587 "imaginal disc-derived
wing margin morphogenesis" evidence=IMP] [GO:0007379 "segment
specification" evidence=IMP] [GO:0003677 "DNA binding"
evidence=ISS;IDA;NAS] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IDA;IMP] [GO:0045893 "positive regulation
of transcription, DNA-dependent" evidence=IDA] [GO:0035060 "brahma
complex" evidence=IDA;TAS] [GO:0003713 "transcription coactivator
activity" evidence=IC] [GO:0007476 "imaginal disc-derived wing
morphogenesis" evidence=IMP] [GO:0048190 "wing disc dorsal/ventral
pattern formation" evidence=IGI] [GO:0042058 "regulation of
epidermal growth factor receptor signaling pathway" evidence=IMP]
[GO:0007480 "imaginal disc-derived leg morphogenesis" evidence=IMP]
[GO:0008586 "imaginal disc-derived wing vein morphogenesis"
evidence=IMP] [GO:0022008 "neurogenesis" evidence=IMP]
InterPro:IPR001606 Pfam:PF01388 PROSITE:PS51011 SMART:SM00501
EMBL:AE014297 GO:GO:0048190 GO:GO:0045893 GO:GO:0016055
GO:GO:0003677 GO:GO:0008586 GO:GO:0006351 GO:GO:0016568
eggNOG:NOG12793 GO:GO:0007379 GO:GO:0007480 KO:K11653
Gene3D:1.10.150.60 InterPro:IPR021906 Pfam:PF12031 SUPFAM:SSF46774
GeneTree:ENSGT00550000074575 GO:GO:0046530 GO:GO:0008587
GO:GO:0035060 GO:GO:0042058 EMBL:AF053091 PIR:T13049
RefSeq:NP_001163639.1 RefSeq:NP_524392.2 RefSeq:NP_732263.1
UniGene:Dm.2989 ProteinModelPortal:Q8IN94 SMR:Q8IN94 DIP:DIP-20699N
IntAct:Q8IN94 MINT:MINT-297379 STRING:Q8IN94 PaxDb:Q8IN94
PRIDE:Q8IN94 EnsemblMetazoa:FBtr0089581 EnsemblMetazoa:FBtr0301487
GeneID:42130 KEGG:dme:Dmel_CG7467 CTD:42130 FlyBase:FBgn0261885
InParanoid:Q8IN94 OMA:SQMGQGP OrthoDB:EOG4MCVF9 PhylomeDB:Q8IN94
ChiTaRS:osa GenomeRNAi:42130 NextBio:827314 Bgee:Q8IN94
GermOnline:CG7467 Uniprot:Q8IN94
Length = 2716
Score = 148 (57.2 bits), Expect = 2.2e-06, P = 2.2e-06
Identities = 86/333 (25%), Positives = 126/333 (37%)
Query: 75 HLCRGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGAT 134
H + +E F ++ L ++++ + ++ + +A + +P D +T
Sbjct: 1078 HYTKNLLTFECHFDRGDIDPLPIIQQ--VEAGSKKKTAKAASVPSPG-GGHLDAGTTNST 1134
Query: 135 GNSENETS-GRPVGQ--NAYEDGY-GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
G+S ++ S P G NA DGY G P G P P A+ G P+ +T A
Sbjct: 1135 GSSNSQDSFPAPPGSAPNAAIDGYPGYPGG-SPYPVAS-----GPQPDYAT---AGQMQR 1185
Query: 191 TPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTK---GPSYDPAKGPGYDPTKGPGYD 246
P + P PG A+ G + S + P DP GP GPG P GPG
Sbjct: 1186 PPSQNNPQTPH-PGAAAAVAAGDNISVSNPFEDPIAAGGGPGSGTGPGPGQGP--GPGA- 1241
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDP----QRGLGYDMQRGPNYDMQRGPGYET-QRVPGY 301
A G+ G H P + P Q+ G Q+ P + PG Q+ G
Sbjct: 1242 ASGGAGAVGAVGGGPQPHPPPPHSPHTAAQQAAGQHQQQHPQHQHPGLPGPPPPQQQQGQ 1301
Query: 302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
Q+ P P Q GPG Q + A + P G+G+ P P
Sbjct: 1302 QGQQPPPSVGGGPPPAPQQHGPGQVPPSPQQHVRPAAGAPYPPGGSGYP-TPVSRTPGSP 1360
Query: 362 VPP-PLNNVPYGSATPPARSGS-GQPRGGNPAR 392
P P YGS+ +G GQP G P +
Sbjct: 1361 YPSQPGAYGQYGSSDQYNATGPPGQPFGQGPGQ 1393
Score = 132 (51.5 bits), Expect = 0.00012, P = 0.00012
Identities = 79/277 (28%), Positives = 101/277 (36%)
Query: 131 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP----NTSTSAYAA 186
GGA G + S P G+ + +D Y P P P + + P N Y A
Sbjct: 1449 GGAPGAPPS--SAYPTGRPSQQDYYQPPPDQSPQPRRHPDFIKDSQPYPGYNARPQIYGA 1506
Query: 187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
QSGT Y P Y +S P + AP P +G + P P P + P
Sbjct: 1507 WQSGTQQ---YR----PQYPSSPAP-QNWGGAP---P-RGAAPPPG-APHGPPIQQPAGV 1553
Query: 247 AQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYETQRVPGYDVQ 304
AQ + Y Q+GP P Q+ Q+ P Y GP G + + P Q
Sbjct: 1554 AQWDQHRYPPQQGP-------PPPPQQQQQPQQQQQQPPYQQVAGPPGQQPPQAPPQWAQ 1606
Query: 305 RGPVYEAQR--APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQV 362
P AQ AP P R P Q+ + M +G G P A HG V
Sbjct: 1607 MNPGQTAQSGIAPPGSPLRPPSGPGQQNRMPGMPAQQQQSQQQG-GVPQPPPQQASHGGV 1665
Query: 363 PPP-LNNV--------PYGSATPPARSGSGQPRGGNP 390
P P L V PY PP++ G GQ G P
Sbjct: 1666 PSPGLPQVGPGGMVKPPYAMPPPPSQ-GVGQQVGQGP 1701
>UNIPROTKB|Q5TM61 [details] [associations]
symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
regulatory subunit 10" species:9544 "Macaca mulatta" [GO:0000785
"chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
GO:GO:0006351 GO:GO:0003723 EMBL:AB128049 GO:GO:0004864
Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
eggNOG:NOG69306 HOVERGEN:HBG053646 RefSeq:NP_001108416.1
UniGene:Mmu.17467 ProteinModelPortal:Q5TM61 GeneID:711949
KEGG:mcc:711949 NextBio:19975847 Uniprot:Q5TM61
Length = 940
Score = 143 (55.4 bits), Expect = 2.3e-06, P = 2.3e-06
Identities = 73/271 (26%), Positives = 93/271 (34%)
Query: 143 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
G P G + G G +P HG P ++G P G PMR +
Sbjct: 635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693
Query: 201 RGPG-----YEASKGPGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKG 250
GPG Y +G G ++ P P P + A+G G P G PG G
Sbjct: 694 GGPGPGPGPYHRGRG-GRGGNEPP---PPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGG 749
Query: 251 SNYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDV 303
+ GP N HR P P G+G + GP M G G+ PG +
Sbjct: 750 GGHRPHEGPGGGMGNSSGHR-PHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGI 808
Query: 304 QRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP 363
G + P G G+ G G M + + P G G G P G PH VP
Sbjct: 809 SGGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVP 866
Query: 364 PPLNNVPYGSATPPA--RSGSGQPRGGNPAR 392
+ G PP R G GG R
Sbjct: 867 GHRGHDHRG---PPHEHRGHDGPGHGGGGHR 894
Score = 142 (55.0 bits), Expect = 3.0e-06, P = 3.0e-06
Identities = 54/213 (25%), Positives = 73/213 (34%)
Query: 131 GGATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 189
GG GN P G G G P G G P G G P+ S
Sbjct: 708 GGRGGNEPPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMGNSS 766
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
G G G+ +GPG + P +GP + G G+ P +GPG
Sbjct: 767 GHRPHEGPGSGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGA 826
Query: 250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPV 308
G + GP + + P G G+ G +D+ PG+ G+D RGP
Sbjct: 827 GGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDV---PGHR-----GHD-HRGPP 877
Query: 309 YE--AQRAPSYIPQRGPGYDLQRGQGYDMRRAP 339
+E P + G+D G DM P
Sbjct: 878 HEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910
Score = 140 (54.3 bits), Expect = 4.9e-06, P = 4.9e-06
Identities = 62/245 (25%), Positives = 83/245 (33%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 180
G GG G P G + + DG G P G GP P G G G N
Sbjct: 656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715
Query: 181 TSAYAATQSGTPMRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP 239
P R A G G +G PG + P +GP G+ P
Sbjct: 716 PPP-----PPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMGNSSGHRP 770
Query: 240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP 299
+GPG + GS + GP + G + P G G + G + GPG
Sbjct: 771 HEGPG--SGMGSGHRPHEGPGSSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGG 828
Query: 300 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD----PSRGTGFDGAPR 354
G+ GP + + P GPG+ G + +D+ +D P G DG
Sbjct: 829 GHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPPHEHRGHDGPGH 888
Query: 355 GAAPH 359
G H
Sbjct: 889 GGGGH 893
>UNIPROTKB|Q7YR38 [details] [associations]
symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
regulatory subunit 10" species:9598 "Pan troglodytes" [GO:0000785
"chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
evidence=ISS] InterPro:IPR000571 InterPro:IPR003617
InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
GO:GO:0046872 GO:GO:0003677 GO:GO:0008270 GO:GO:0000785
GO:GO:0006351 GO:GO:0003723 EMBL:BA000041 GO:GO:0004864
Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646 OMA:PPPHEHR
GeneTree:ENSGT00530000063820 EMBL:AB210175 EMBL:AB210176
RefSeq:NP_001038965.1 UniGene:Ptr.6270 ProteinModelPortal:Q7YR38
Ensembl:ENSPTRT00000033108 GeneID:462544 KEGG:ptr:462544
NextBio:20841794 Uniprot:Q7YR38
Length = 940
Score = 143 (55.4 bits), Expect = 2.3e-06, P = 2.3e-06
Identities = 63/248 (25%), Positives = 83/248 (33%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-------GHGPPPSATTAGVVGAGPNTS 180
G GG G P G + + DG G P G GP P G G G N
Sbjct: 656 GGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGGPMRGGPGPGPGPYHRGRGGRGGNEP 715
Query: 181 TSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPT 240
+ R+ P G G GPG + P +GP G+ P
Sbjct: 716 PPPPPPFRGARGGRSGGGPPNGRG-----GPGGGMVGGGGHRPHEGPGGGMGNNSGHRPH 770
Query: 241 KGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
+GPG GS + GP + G + P G G + G + GPG G
Sbjct: 771 EGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPHEGPGGGMGAGGG 828
Query: 301 YDVQRGPVYEAQRAPSYIPQRGPGY---------DLQRGQGYDMRRAPSYDPSRGTGFDG 351
+ GP + + P GPG+ D+ +G+D R P P G DG
Sbjct: 829 HRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP---PHEHRGHDG 885
Query: 352 APRGAAPH 359
G H
Sbjct: 886 PGHGGGGH 893
Score = 142 (55.0 bits), Expect = 3.0e-06, P = 3.0e-06
Identities = 71/268 (26%), Positives = 90/268 (33%)
Query: 143 GRPVGQNAYEDGYG--VPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
G P G + G G +P HG P ++G P G PMR +
Sbjct: 635 GGPKGMQHFPPGPGGPMPGPHGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDPMRGG-PMR 693
Query: 201 RGPGYEASKGPG-YD---ASKAPSYDPTKGPSYDPAKG--PGYDPTKG---PGYDAQKGS 251
GPG GPG Y + + P P + A+G G P G PG G
Sbjct: 694 GGPG----PGPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG 749
Query: 252 NYDAQRGP-----NYDIHRGPSYDPQRGLG--YDMQRGPNYDMQRGPGYETQRVPGYDVQ 304
+ GP N HR P P G+G + GP M G G+ PG +
Sbjct: 750 GHRPHEGPGGGMGNNSGHR-PHEGPGGGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGIS 808
Query: 305 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
G + P G G+ G G M + + P G G G P G PH VP
Sbjct: 809 GGSGHRPHEGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGH-GGPHGHRPH-DVPG 866
Query: 365 PLNNVPYGSATPPARSGSGQPRGGNPAR 392
+ G R G GG R
Sbjct: 867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894
Score = 132 (51.5 bits), Expect = 3.7e-05, P = 3.7e-05
Identities = 54/214 (25%), Positives = 72/214 (33%)
Query: 147 GQNAYEDGYGVPQGHGPPPS-----ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
G Y G G G+ PPP G G GP G ++ P
Sbjct: 699 GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPG 758
Query: 202 G-----PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
G G+ +GPG + P +GP+ G G+ P +GPG GS +
Sbjct: 759 GGMGNNSGHRPHEGPG--GGMGSGHRPHEGPAGSMGGGGGHRPHEGPGGGISGGSGHRPH 816
Query: 257 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY------ETQRVPGY--DVQRGPV 308
GP + G + P G G M + GPG+ VPG+ RGP
Sbjct: 817 EGPGGGMGAGGGHRPHEGPGGSMGGSGGHRPHEGPGHGGPHGHRPHDVPGHRGHDHRGPP 876
Query: 309 YEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 339
R P + G+D G DM P
Sbjct: 877 PHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910
>UNIPROTKB|C9JGE3 [details] [associations]
symbol:EWSR1 "Ewing sarcoma breakpoint region 1, isoform
CRA_e" species:9606 "Homo sapiens" [GO:0000166 "nucleotide binding"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0000166
EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0005622 EMBL:AC002059 EMBL:AL031186 EMBL:AC000026
UniGene:Hs.374477 HGNC:HGNC:3508 HOGENOM:HOG000038010 ChiTaRS:EWSR1
IPI:IPI00953325 SMR:C9JGE3 STRING:C9JGE3 Ensembl:ENST00000332050
UCSC:uc003aez.3 Uniprot:C9JGE3
Length = 583
Score = 127 (49.8 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
Identities = 68/254 (26%), Positives = 95/254 (37%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 237
T+ TQ+ ++AY P P Y + P A P PT+ + G GY
Sbjct: 102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158
Query: 238 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
+ P+ G G Q +Y G P + PSY P Y + +YD Y
Sbjct: 159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQS---SYS 209
Query: 295 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF-DGAP 353
Q G G + SY Q Y Q G Y +APS + + + P
Sbjct: 210 QQNTYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTGS-YS--QAPSQYSQQSSSYGQQRP 266
Query: 354 RGAAPHGQVPPPLN 367
P + PP++
Sbjct: 267 MDEGPDLDLGPPVD 280
Score = 57 (25.1 bits), Expect = 2.6e-06, Sum P(2) = 2.6e-06
Identities = 19/46 (41%), Positives = 21/46 (45%)
Query: 354 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 392
RG P G+ +PPPL P G P P G G RGG P R
Sbjct: 382 RGGLPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 427
>UNIPROTKB|P12105 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
SMART:SM00214 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933
EMBL:U07973 EMBL:X00822 EMBL:X00823 EMBL:X00826 EMBL:X00825
EMBL:X00827 EMBL:X00828 EMBL:X00830 EMBL:X00831 EMBL:K02302
EMBL:K02301 EMBL:V00391 EMBL:V00392 EMBL:M36662 IPI:IPI00590578
PIR:A05269 PIR:I50694 UniGene:Gga.42140 ProteinModelPortal:P12105
STRING:P12105 Uniprot:P12105
Length = 1262
Score = 144 (55.7 bits), Expect = 2.6e-06, P = 2.6e-06
Identities = 84/280 (30%), Positives = 109/280 (38%)
Query: 132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 188
GA G +N G P G+ G+P +G P AG G+ GP S A Q
Sbjct: 467 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 525
Query: 189 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 243
G P MR IP PG + GP + + P GP+ P PG GP
Sbjct: 526 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 583
Query: 244 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 297
G + G N +RGP GP+ + GL G GP D + GP Q
Sbjct: 584 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 641
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 353
+PG GP E + P+ GPG+ +G+ G R P P TG G P
Sbjct: 642 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERGPQGPPGP-TGARGGP 697
Query: 354 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
A G + PP P G+ P + G+ RG G+P
Sbjct: 698 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 736
Score = 128 (50.1 bits), Expect = 0.00014, P = 0.00014
Identities = 87/281 (30%), Positives = 107/281 (38%)
Query: 131 GGATGNSENETSGRPVGQNAY-EDGY-GVPQGHGPPPSATTAGVVGAGPNTSTSAYA--- 185
GG TG E G P G A+ +DG G GPP TAG G+ P A
Sbjct: 301 GGPTG--ERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPGS-PGFKGEAGPPGP 357
Query: 186 ATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-P 243
A SG P P+G G +GP A +P GPS P GPG +G P
Sbjct: 358 AGASGNPGERGEPGPQGQAGPPGPQGPPGRAG-SPGGKGEMGPSGIPG-GPGPPGGRGLP 415
Query: 244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGPGYETQRVPGYD 302
G G N A+ P G DP G +RG N RGP PG +
Sbjct: 416 GPPGTSG-NPGAKGTPGEPGKNGAKGDP----GPKGERGENGTPGARGP-------PGEE 463
Query: 303 VQRGPVYEAQR--APSYIPQRG-PGY-DLQRGQGYDMRRAPSYDPSRGTGFDGAPRG-AA 357
+RG E + P +RG PG+ L G + P+ + RG+ P G A
Sbjct: 464 GKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPGEKGPAGE--RGSPGPPGPSGPAG 521
Query: 358 PHGQV--P--PPLNNVPYGSATPPARSGSGQPRG--GNPAR 392
GQ P P + +P G P G P G G P R
Sbjct: 522 DRGQDGGPGLPGMRGLP-GIPGSPGSDGKPGPPGNQGEPGR 561
Score = 127 (49.8 bits), Expect = 0.00018, P = 0.00018
Identities = 78/276 (28%), Positives = 97/276 (35%)
Query: 132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
G G +N G P G G P GPP G G P + + G
Sbjct: 428 GTPGEPGKNGAKGDP-GPKGERGENGTPGARGPPGEEGKRGANGE-PGQNGVPGTPGERG 485
Query: 191 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 246
+P +P G KGP G S P P+ GP+ D + GPG +G PG
Sbjct: 486 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 541
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 303
GS D + GP G +P R G GP + PG + + PG +
Sbjct: 542 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 593
Query: 304 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 359
+RGP P + G PG G D R P PS G G P G P
Sbjct: 594 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 649
Query: 360 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 393
G+ P P G P G P+G N P R
Sbjct: 650 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 682
Score = 125 (49.1 bits), Expect = 0.00031, P = 0.00031
Identities = 74/259 (28%), Positives = 95/259 (36%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
P G N Y+ G P GP AG++G AGP + G P R + RG
Sbjct: 192 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGP--------PGKDGEPGRPGRNGDRGI 243
Query: 203 PGYEASKG-PGYDASKAPSYDPTKG-PSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRG 258
PG KG PG P +G D AKG P GP G Q G+N Q G
Sbjct: 244 PGLPGHKGHPGMPGM--PGMKGARGFDGKDGAKGDSGAP--GPKGEAGQPGANGSPGQPG 299
Query: 259 PNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGP-GYE-TQRVPGYDVQRGPVYEAQRAP 315
P RG +P + P GP G T PG P ++ + P
Sbjct: 300 PGGPTGERGRPGNPGGPGAHGKDGAPGTAGPLGPPGPPGTAGFPG-----SPGFKGEAGP 354
Query: 316 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 375
P G +RG+ +A P G G+P G G++ P + +P G
Sbjct: 355 PG-PAGASGNPGERGEPGPQGQAGPPGPQGPPGRAGSPGGK---GEMGP--SGIPGGPGP 408
Query: 376 PPARSGSGQP-RGGNPARR 393
P R G P GNP +
Sbjct: 409 PGGRGLPGPPGTSGNPGAK 427
>TAIR|locus:2012788 [details] [associations]
symbol:AT1G10390 "AT1G10390" species:3702 "Arabidopsis
thaliana" [GO:0005215 "transporter activity" evidence=ISS]
[GO:0005634 "nucleus" evidence=ISM] [GO:0005643 "nuclear pore"
evidence=IEA] [GO:0006810 "transport" evidence=IEA] [GO:0005515
"protein binding" evidence=IPI] [GO:0005635 "nuclear envelope"
evidence=IDA] InterPro:IPR007230 Pfam:PF04096 PROSITE:PS51434
EMBL:CP002684 GenomeReviews:CT485782_GR GO:GO:0005635 GO:GO:0006810
GO:GO:0005643 eggNOG:NOG12793 SUPFAM:SSF82215 KO:K14297 HSSP:Q9Y6J4
EMBL:AY078948 EMBL:BT003030 EMBL:AK226964 IPI:IPI00523265
RefSeq:NP_001031018.1 RefSeq:NP_172510.2 UniGene:At.27877
ProteinModelPortal:Q8RY25 SMR:Q8RY25 STRING:Q8RY25 MEROPS:S59.A02
PaxDb:Q8RY25 PRIDE:Q8RY25 EnsemblPlants:AT1G10390.1
EnsemblPlants:AT1G10390.2 GeneID:837579 KEGG:ath:AT1G10390
TAIR:At1g10390 HOGENOM:HOG000085153 InParanoid:Q8RY25 OMA:ESISAMP
PhylomeDB:Q8RY25 ProtClustDB:CLSN2713828 Genevestigator:Q8RY25
Uniprot:Q8RY25
Length = 1041
Score = 143 (55.4 bits), Expect = 2.6e-06, P = 2.6e-06
Identities = 52/263 (19%), Positives = 89/263 (33%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP---SATTAGVVGAGPNTSTSAYAATQ 188
GA+ + S G + +G G G P S + G S A+ T
Sbjct: 80 GASSSPAFGNSTPAFGASPASSPFGGSSGFGQKPLGFSTPQSNPFGNSTQQSQPAFGNTS 139
Query: 189 SG--TPMRA----AYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
G TP A A+ P P + A+ P + AS P++ T P++ + P + T
Sbjct: 140 FGSSTPFGATNTPAFGAPSTPSFGATSTPSFGASSTPAFGATNTPAFGASNSPSFGATNT 199
Query: 243 PGYDAQKGSNYDAQRGP--NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
P + A + + N G ++ + P + P + P
Sbjct: 200 PAFGASPTPAFGSTGTTFGNTGFGSGGAFGASNTPAFGASGTPAFGASGTPAFGASSTPA 259
Query: 301 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
+ P + A P++ P + + +P++ S + F G++ G
Sbjct: 260 FGASSTPAFGASSTPAFGGSSTPSFGASNTSSFSFGSSPAFGQST-SAF-----GSSAFG 313
Query: 361 QVPPPLNNVPYGSATPPARSGSG 383
P P A+ P GSG
Sbjct: 314 STPSPFGGA---QASTPTFGGSG 333
>MGI|MGI:1344412 [details] [associations]
symbol:Ldb3 "LIM domain binding 3" species:10090 "Mus
musculus" [GO:0005080 "protein kinase C binding" evidence=IDA]
[GO:0005515 "protein binding" evidence=IPI] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0005856 "cytoskeleton" evidence=ISO] [GO:0008092
"cytoskeletal protein binding" evidence=ISO] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0030018 "Z disc" evidence=ISO;IDA]
[GO:0042995 "cell projection" evidence=IEA] [GO:0045214 "sarcomere
organization" evidence=IMP] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0051371 "muscle alpha-actinin binding"
evidence=IDA;IPI] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
SMART:SM00132 SMART:SM00228 MGI:MGI:1344412 GO:GO:0048471
GO:GO:0005080 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
GO:GO:0008270 GO:GO:0031143 Gene3D:2.10.110.10 SUPFAM:SSF50156
CTD:11155 eggNOG:NOG286537 HOVERGEN:HBG051478 OMA:CTSQATT
OrthoDB:EOG4GTKDQ InterPro:IPR006643 SMART:SM00735 EMBL:AF114378
EMBL:AF114379 EMBL:AJ005621 EMBL:AF228057 EMBL:AF228058
EMBL:AY206011 EMBL:AY206012 EMBL:AY206013 EMBL:AY206015
EMBL:AK172980 EMBL:AK004020 EMBL:AK137181 EMBL:AK142292
EMBL:BC099596 EMBL:BC138793 EMBL:BC145420 IPI:IPI00123369
IPI:IPI00323030 IPI:IPI00403041 IPI:IPI00621572 IPI:IPI00625287
IPI:IPI00656173 RefSeq:NP_001034160.1 RefSeq:NP_001034161.1
RefSeq:NP_001034162.1 RefSeq:NP_001034163.1 RefSeq:NP_001034164.1
RefSeq:NP_001034165.1 RefSeq:NP_036048.3 UniGene:Mm.29733 PDB:1WJL
PDBsum:1WJL ProteinModelPortal:Q9JKS4 SMR:Q9JKS4 IntAct:Q9JKS4
MINT:MINT-97840 STRING:Q9JKS4 PhosphoSite:Q9JKS4 PaxDb:Q9JKS4
PRIDE:Q9JKS4 Ensembl:ENSMUST00000022327 Ensembl:ENSMUST00000022328
Ensembl:ENSMUST00000022330 Ensembl:ENSMUST00000090040 GeneID:24131
KEGG:mmu:24131 UCSC:uc007taz.1 UCSC:uc007tba.1 UCSC:uc007tbc.1
UCSC:uc007tbd.1 UCSC:uc007tbe.1 UCSC:uc007tbf.1
GeneTree:ENSGT00700000104411 InParanoid:B2RSB0
EvolutionaryTrace:Q9JKS4 NextBio:304169 Bgee:Q9JKS4 CleanEx:MM_LDB3
Genevestigator:Q9JKS4 GermOnline:ENSMUSG00000021798 Uniprot:Q9JKS4
Length = 723
Score = 141 (54.7 bits), Expect = 2.7e-06, P = 2.7e-06
Identities = 49/181 (27%), Positives = 69/181 (38%)
Query: 142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
S P +Y +G P P P T + P+ A++ S +P A Y P
Sbjct: 371 SPAPSAHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASSYSPSP-GANYS-PT 423
Query: 202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
P Y S P Y S AP+Y P+ P+Y P+ P Y P+ P Y + Y GP+
Sbjct: 424 -P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYSG--GPSE 479
Query: 262 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAP 315
R P S+ + G + RG P Y + RG A+R P
Sbjct: 480 SASRPPWVTDDSFSQKFAPGKSTTTVSKQTLPRGAPAYNPTGPQVTPLARGTFQRAERFP 539
Query: 316 S 316
+
Sbjct: 540 A 540
Score = 135 (52.6 bits), Expect = 1.2e-05, P = 1.2e-05
Identities = 55/192 (28%), Positives = 70/192 (36%)
Query: 155 YGVPQGHGPPPSATTAGVVGAG-----PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 209
Y P PSA T+ G P T+A P+ A+ P PG S
Sbjct: 364 YSPAAAASPAPSAHTSYSEGPAAPAPKPRVVTTASIRPSVYQPVPASSYSP-SPGANYSP 422
Query: 210 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSY 269
P Y S AP+Y P+ P+Y P+ P Y P+ P Y NY Y GPS
Sbjct: 423 TP-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYTPSPAPNYTPTPSAAYS--GGPSE 479
Query: 270 DPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQ 328
R ++ + PG T V + RG AP+Y P GP L
Sbjct: 480 SASRP---PWVTDDSFSQKFAPGKSTTTVSKQTLPRG-------APAYNPT-GPQVTPLA 528
Query: 329 RGQGYDMRRAPS 340
RG R P+
Sbjct: 529 RGTFQRAERFPA 540
Score = 132 (51.5 bits), Expect = 2.6e-05, P = 2.6e-05
Identities = 56/213 (26%), Positives = 74/213 (34%)
Query: 166 SATTAGVVGA---GPNTSTSAYAATQSGTPMRAAY-DIPRGPGYEASKGPGY--DASKAP 219
+A+ AG + P SAY+ + +P +A+ GP A K P AS P
Sbjct: 343 AASAAGPAASPVENPRPQASAYSPAAAASPAPSAHTSYSEGPAAPAPK-PRVVTTASIRP 401
Query: 220 S-YDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 278
S Y P SY P+ G Y PT P Y Y P Y P+Y P Y
Sbjct: 402 SVYQPVPASSYSPSPGANYSPT--P-YTPSPAPAYTPSPAPTYTPSPAPTYSPSPAPAYT 458
Query: 279 MQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR- 337
PNY Y P R P S+ + PG + R
Sbjct: 459 PSPAPNYTPTPSAAYSGG--PSESASRPPWVTDD---SFSQKFAPGKSTTTVSKQTLPRG 513
Query: 338 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 370
AP+Y+P+ G RG + P + P
Sbjct: 514 APAYNPT-GPQVTPLARGTFQRAERFPASSRTP 545
>UNIPROTKB|O75112 [details] [associations]
symbol:LDB3 "LIM domain-binding protein 3" species:9606
"Homo sapiens" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005080 "protein kinase C binding" evidence=IEA] [GO:0031143
"pseudopodium" evidence=IEA] [GO:0048471 "perinuclear region of
cytoplasm" evidence=IEA] [GO:0005515 "protein binding"
evidence=IPI] [GO:0005856 "cytoskeleton" evidence=IDA] [GO:0008092
"cytoskeletal protein binding" evidence=IPI] [GO:0030018 "Z disc"
evidence=IDA] Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478
InterPro:IPR001781 PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106
SMART:SM00132 SMART:SM00228 GO:GO:0048471 GO:GO:0030018
GO:GO:0005856 GO:GO:0046872 GO:GO:0008270 Orphanet:154
GO:GO:0031143 Gene3D:2.10.110.10 Orphanet:54260 SUPFAM:SSF50156
EMBL:AJ133766 EMBL:AJ133767 EMBL:AJ133768 EMBL:AF276807
EMBL:AF276808 EMBL:AF276809 EMBL:AB014513 EMBL:AK304760
EMBL:EF179181 EMBL:AC067750 EMBL:BC010929 IPI:IPI00165263
IPI:IPI00294958 IPI:IPI00294959 IPI:IPI00514458 IPI:IPI00552865
IPI:IPI00654766 IPI:IPI00909817 RefSeq:NP_001073583.1
RefSeq:NP_001073584.1 RefSeq:NP_001073585.1 RefSeq:NP_001165081.1
RefSeq:NP_001165082.1 RefSeq:NP_009009.1 UniGene:Hs.657271 PDB:1RGW
PDBsum:1RGW ProteinModelPortal:O75112 SMR:O75112 IntAct:O75112
STRING:O75112 PhosphoSite:O75112 UCD-2DPAGE:O75112
UCD-2DPAGE:Q9Y4Z5 PaxDb:O75112 PRIDE:O75112 DNASU:11155
Ensembl:ENST00000263066 Ensembl:ENST00000310944
Ensembl:ENST00000352360 Ensembl:ENST00000361373
Ensembl:ENST00000372056 Ensembl:ENST00000372066
Ensembl:ENST00000429277 Ensembl:ENST00000458213
Ensembl:ENST00000542786 GeneID:11155 KEGG:hsa:11155 UCSC:uc001kdr.3
UCSC:uc001kds.3 UCSC:uc001kdu.3 UCSC:uc001kdv.3 UCSC:uc009xsy.3
UCSC:uc009xsz.3 CTD:11155 GeneCards:GC10P088426 HGNC:HGNC:15710
HPA:HPA048955 MIM:601493 MIM:605906 MIM:609452 neXtProt:NX_O75112
Orphanet:247 Orphanet:609 Orphanet:98912 PharmGKB:PA30318
eggNOG:NOG286537 HOGENOM:HOG000220936 HOVERGEN:HBG051478
InParanoid:O75112 OMA:CTSQATT OrthoDB:EOG4GTKDQ ChiTaRS:LDB3
EvolutionaryTrace:O75112 GenomeRNAi:11155 NextBio:42413
ArrayExpress:O75112 Bgee:O75112 Genevestigator:O75112
GermOnline:ENSG00000122367 InterPro:IPR006643 SMART:SM00735
Uniprot:O75112
Length = 727
Score = 141 (54.7 bits), Expect = 2.7e-06, P = 2.7e-06
Identities = 53/183 (28%), Positives = 72/183 (39%)
Query: 142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
S P +Y +G P P P T + P+ A+T S +P A Y P
Sbjct: 375 SSAPATHTSYSEGPAAP---APKPRVVTTASIR--PSVYQPVPASTYSPSP-GANYS-PT 427
Query: 202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
P Y S P Y S AP+Y P+ P+Y P+ P Y P+ P Y+ Y GP
Sbjct: 428 -P-YTPSPAPAYTPSPAPAYTPSPVPTYTPSPAPAYTPSPAPNYNPAPSVAYSG--GPAE 483
Query: 262 DIHRGP-----SYDPQRGLGYDMQRGPNYDMQRG-PGYETQ--RVPGYDVQRGPVYEAQR 313
R P S+ + G + RG P Y +VP + RG V A+R
Sbjct: 484 PASRPPWVTDDSFSQKFAPGKSTTSISKQTLPRGGPAYTPAGPQVP--PLARGTVQRAER 541
Query: 314 APS 316
P+
Sbjct: 542 FPA 544
>UNIPROTKB|G7N928 [details] [associations]
symbol:EGK_04858 "Putative uncharacterized protein"
species:9544 "Macaca mulatta" [GO:0005201 "extracellular matrix
structural constituent" evidence=ISS] [GO:0005587 "collagen type
IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
[GO:0032836 "glomerular basement membrane development"
evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001264 Uniprot:G7N928
Length = 1692
Score = 145 (56.1 bits), Expect = 2.8e-06, P = 2.8e-06
Identities = 81/261 (31%), Positives = 100/261 (38%)
Query: 143 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
G V N Y G P GPP G GA P S S + GTP +IP
Sbjct: 663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719
Query: 202 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 253
PG+ G PG+ K S GP P KG PG DP G G ++G S
Sbjct: 720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778
Query: 254 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 309
+GP D P + G+ G+ +GP + G PG PG+ +RG P
Sbjct: 779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835
Query: 310 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
Q P G PG +GQ D+ P P+ G G P HG PP L
Sbjct: 836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888
Query: 369 VP--YGSATPPARSGSGQPRG 387
+P +G P G PRG
Sbjct: 889 IPGPFGDDGLPGPPGPKGPRG 909
Score = 141 (54.7 bits), Expect = 7.6e-06, P = 7.6e-06
Identities = 77/252 (30%), Positives = 97/252 (38%)
Query: 159 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 214
+GH G P G G G T + T G +D P GP G+ +G PG
Sbjct: 641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700
Query: 215 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 268
S P T G S P PG+ G PG+ +KGS+ GP + +G
Sbjct: 701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759
Query: 269 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 316
DP G LG +RG P RG PGY +PG+ +GP A P
Sbjct: 760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819
Query: 317 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 375
+P PG+ +RG G + DP G GAP G G V PP G
Sbjct: 820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873
Query: 376 PPARSGSGQPRG 387
P R G+ P G
Sbjct: 874 LPGRPGAHGPPG 885
Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
Identities = 81/259 (31%), Positives = 100/259 (38%)
Query: 145 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 199
PVG G+ G P +GH G P G G G T + T G +D
Sbjct: 626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683
Query: 200 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 255
P GP G+ +G PG S P T G S P PG+ G PG+ +KGS+
Sbjct: 684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742
Query: 256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 313
GP P + Q+G+ D P + PG VPG RG P Y
Sbjct: 743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794
Query: 314 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 371
P+ IP PG +G +G+ P G + GAP P GQ P L P
Sbjct: 795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845
Query: 372 GSATPPARSGSGQPRGGNP 390
GS P A G GQP P
Sbjct: 846 GS--PGAPGGKGQPGDVGP 862
>UNIPROTKB|G7PK77 [details] [associations]
symbol:EGM_04376 "Putative uncharacterized protein"
species:9541 "Macaca fascicularis" [GO:0005201 "extracellular
matrix structural constituent" evidence=ISS] [GO:0005587 "collagen
type IV" evidence=ISS] [GO:0005605 "basal lamina" evidence=ISS]
[GO:0032836 "glomerular basement membrane development"
evidence=ISS] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005605
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 GO:GO:0005587
Gene3D:2.170.240.10 GO:GO:0032836 EMBL:CM001287 Uniprot:G7PK77
Length = 1695
Score = 145 (56.1 bits), Expect = 2.8e-06, P = 2.8e-06
Identities = 81/261 (31%), Positives = 100/261 (38%)
Query: 143 GRPVGQNA-YEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
G V N Y G P GPP G GA P S S + GTP +IP
Sbjct: 663 GDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGA-PGLSGSDGHKGRPGTP--GTSEIPG 719
Query: 202 GPGYEASKG-PGYDASKAPSYDPTKGPSYDPA----KG-PGYDPTKGP-GYDAQKG-SNY 253
PG+ G PG+ K S GP P KG PG DP G G ++G S
Sbjct: 720 PPGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIPG-DPAFGHLGPPGKRGLSGV 778
Query: 254 DAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYETQRVPGYDVQRG-PVY 309
+GP D P + G+ G+ +GP + G PG PG+ +RG P
Sbjct: 779 PGIKGPRGDPGY-PGAEGPAGIPGFPGLKGPKGREGHAGFPGVPGP--PGHSCERGAPGI 835
Query: 310 EAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
Q P G PG +GQ D+ P P+ G G P HG PP L
Sbjct: 836 PGQ--PGLPGDPGSPGAPGGKGQPGDV--GPP-GPAGMKGLPGLPGRPGAHG--PPGLPG 888
Query: 369 VP--YGSATPPARSGSGQPRG 387
+P +G P G PRG
Sbjct: 889 IPGPFGDDGLPGPPGPKGPRG 909
Score = 141 (54.7 bits), Expect = 7.6e-06, P = 7.6e-06
Identities = 77/252 (30%), Positives = 97/252 (38%)
Query: 159 QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYD 214
+GH G P G G G T + T G +D P GP G+ +G PG
Sbjct: 641 RGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDGPPGPKGFPGPQGAPGLS 700
Query: 215 ASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP----NYDIHRGPS 268
S P T G S P PG+ G PG+ +KGS+ GP + +G
Sbjct: 701 GSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVGPPGPPGSPGVNGQKGIP 759
Query: 269 YDPQRG-LGYDMQRG----PNYDMQRG-PGYETQR----VPGYDVQRGPVYEAQRA--PS 316
DP G LG +RG P RG PGY +PG+ +GP A P
Sbjct: 760 GDPAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAEGPAGIPGFPGLKGPKGREGHAGFPG 819
Query: 317 YIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSAT 375
+P PG+ +RG G + DP G GAP G G V PP G
Sbjct: 820 -VPGP-PGHSCERGAPGIPGQPGLPGDP----GSPGAPGGKGQPGDVGPPGPAGMKGLPG 873
Query: 376 PPARSGSGQPRG 387
P R G+ P G
Sbjct: 874 LPGRPGAHGPPG 885
Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
Identities = 81/259 (31%), Positives = 100/259 (38%)
Query: 145 PVGQNAYEDGY-GVP--QGH-GPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDI 199
PVG G+ G P +GH G P G G G T + T G +D
Sbjct: 626 PVGPPGL--GFPGPPGERGHPGVPGRPGVRGPDGLKGQKGDTVSCNVTYPGRQGPPGFDG 683
Query: 200 PRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDA 255
P GP G+ +G PG S P T G S P PG+ G PG+ +KGS+
Sbjct: 684 PPGPKGFPGPQGAPGLSGSDGHKGRPGTPGTSEIPGP-PGFRGDIGDPGFGGEKGSSPVG 742
Query: 256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRG-PVYEAQR 313
GP P + Q+G+ D P + PG VPG RG P Y
Sbjct: 743 PPGPP----GSPGVNGQKGIPGD----PAFGHLGPPGKRGLSGVPGIKGPRGDPGYPGAE 794
Query: 314 APSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFD-GAPRGAAPHGQVPPPLNNVPY 371
P+ IP PG +G +G+ P G + GAP P GQ P L P
Sbjct: 795 GPAGIPGF-PGLKGPKGREGH--AGFPGVPGPPGHSCERGAP--GIP-GQ--PGLPGDP- 845
Query: 372 GSATPPARSGSGQPRGGNP 390
GS P A G GQP P
Sbjct: 846 GS--PGAPGGKGQPGDVGP 862
>TAIR|locus:2043530 [details] [associations]
symbol:AT2G25970 "AT2G25970" species:3702 "Arabidopsis
thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0005829 "cytosol"
evidence=IDA] [GO:0006606 "protein import into nucleus"
evidence=RCA] InterPro:IPR004087 InterPro:IPR004088 Pfam:PF00013
PROSITE:PS50084 SMART:SM00322 GO:GO:0005829 EMBL:CP002685
GenomeReviews:CT485783_GR GO:GO:0003723 EMBL:AC004747 EMBL:AC005395
eggNOG:NOG300923 KO:K13210 HSSP:Q9UNW9 EMBL:AY078954 EMBL:AK226845
IPI:IPI00540360 PIR:T02627 RefSeq:NP_180167.1 UniGene:At.21555
ProteinModelPortal:O82762 SMR:O82762 STRING:O82762 PaxDb:O82762
PRIDE:O82762 ProMEX:O82762 EnsemblPlants:AT2G25970.1 GeneID:817137
KEGG:ath:AT2G25970 TAIR:At2g25970 HOGENOM:HOG000242545
InParanoid:O82762 OMA:AANSTQD PhylomeDB:O82762
ProtClustDB:CLSN2913011 ArrayExpress:O82762 Genevestigator:O82762
Uniprot:O82762
Length = 632
Score = 140 (54.3 bits), Expect = 2.9e-06, P = 2.9e-06
Identities = 76/283 (26%), Positives = 100/283 (35%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYE---DGYGVPQGHGPPPSATTAGVVGAG 176
P + GSY T + S P Q + + D YG Q P ++A
Sbjct: 355 PQYGQSPYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSA------ 408
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
P T T+ Y Q + A G GY+ Y+AS+ Y G YD +G G
Sbjct: 409 PPTDTTGYNYYQHASGYGQA-----GQGYQQDGYGAYNASQQSGYGQAAG--YDQ-QG-G 459
Query: 237 YDPTKGPGYD---AQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
Y T P + +Q AQ G Y G Q G Q G G
Sbjct: 460 YGSTTNPSQEEDASQAAPPSSAQSGQAGYGT-TGQQPPAQGSTG---QAGYGAPPTSQAG 515
Query: 293 YETQRVPGYDVQRGPVYEAQRAPSY-IPQRGPGYDLQRGQ--GYDMRRAPSYDPSRGTGF 349
Y +Q Y+ G A + P+Y Q+ PG G GY A Y G+
Sbjct: 516 YSSQPAAAYNSGYGAPPPASKPPTYGQSQQSPGAPGSYGSQSGYAQPAASGYGQPPAYGY 575
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGS-ATPPARSGSGQPRGGNPA 391
AP+G +G P Y S + A +G G GG PA
Sbjct: 576 GQAPQGYGSYGGYTQPAAGGGYSSDGSAGATAGGG---GGTPA 615
Score = 123 (48.4 bits), Expect = 0.00021, P = 0.00021
Identities = 69/265 (26%), Positives = 89/265 (33%)
Query: 136 NSENETSGRPVGQN-AYEDGYGV-PQGHGPPPSATTAGVVGAGPNTSTSAYAAT-QSGTP 192
+ EN +G + GY P PP A G G AY Q G
Sbjct: 302 SGENRMRNSAMGGGYPQQGGYQARPPSSWAPPGGPPAQP-GYGGYMQPGAYPGPPQYGQS 360
Query: 193 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPT-KGPSYDPAKG-PGYDPTKGPGYDA-Q 248
+Y GY + S P S YD + S P+ G PT GY+ Q
Sbjct: 361 PYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYNYYQ 420
Query: 249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 308
S Y Q G Y +Y+ + GY G YD Q G G T P +
Sbjct: 421 HASGY-GQAGQGYQQDGYGAYNASQQSGYGQAAG--YDQQGGYGSTTN--PSQEEDA--- 472
Query: 309 YEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
+Q AP Q G Q G G ++ P+ + G+ P A + P N
Sbjct: 473 --SQAAPPSSAQSG-----QAGYGTTGQQPPAQGSTGQAGYGAPPTSQAGYSSQPAAAYN 525
Query: 369 VPYGSATP---PARSGSGQPRGGNP 390
YG+ P P G Q G P
Sbjct: 526 SGYGAPPPASKPPTYGQSQQSPGAP 550
Score = 107 (42.7 bits), Expect = 0.00012, Sum P(2) = 0.00012
Identities = 57/201 (28%), Positives = 76/201 (37%)
Query: 202 GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNY 261
G GY +G GY A S+ P GP PA+ PGY GY Q G+ Y P Y
Sbjct: 313 GGGYP-QQG-GYQARPPSSWAPPGGP---PAQ-PGYG-----GY-MQPGA-YPGP--PQY 357
Query: 262 DIHRGPSYDPQRGLGYDMQRG--PNYDMQRGP----GYETQRVPGYDVQRGPVYEAQRAP 315
SY Q GY Q P+ +G G + + P P +
Sbjct: 358 GQSPYGSYPQQTSAGYYDQSSVPPSQQSAQGEYDYYGQQQSQQPSSGGSSAPPTDTTGYN 417
Query: 316 SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG---AAPHGQVPPPLNNVPYG 372
Y Q GY Q GQGY +Y+ S+ +G+ G G +G P
Sbjct: 418 YY--QHASGYG-QAGQGYQQDGYGAYNASQQSGY-GQAAGYDQQGGYGSTTNPSQEEDAS 473
Query: 373 SATPPARSGSGQPRGGNPARR 393
A PP+ + SGQ G ++
Sbjct: 474 QAAPPSSAQSGQAGYGTTGQQ 494
Score = 63 (27.2 bits), Expect = 0.00012, Sum P(2) = 0.00012
Identities = 26/107 (24%), Positives = 43/107 (40%)
Query: 108 EVEKLRAELMNA-----PNVDRRADGSYGGATGNSENETSGRPVG---QNAYEDGYGVPQ 159
+ +++ A L+N+ P VD A YG G S + G+ + ++ YG Q
Sbjct: 73 KAQEIAARLLNSADAKRPRVDNGASYDYGDNKGFSSYPSEGKQMSGTVPSSIPVSYGSFQ 132
Query: 160 GHGPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP 203
G P+ ++G G T Y QSG ++ D+ P
Sbjct: 133 GTTKKIDIPNMRVGVIIGKGGETIK--YLQLQSGAKIQVTRDMDADP 177
>ZFIN|ZDB-GENE-050809-108 [details] [associations]
symbol:pygo2 "pygopus homolog 2 (Drosophila)"
species:7955 "Danio rerio" [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0005575 "cellular_component" evidence=ND]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR001965
InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
ZFIN:ZDB-GENE-050809-108 GO:GO:0046872 GO:GO:0008270
Gene3D:3.30.40.10 InterPro:IPR011011 InterPro:IPR013083
SUPFAM:SSF57903 InterPro:IPR019786 PROSITE:PS01359
GeneTree:ENSGT00530000063948 CTD:90780 OrthoDB:EOG4QZ7MB
EMBL:CR628394 IPI:IPI00650328 RefSeq:NP_001028283.2
UniGene:Dr.159286 SMR:Q1L8T6 Ensembl:ENSDART00000131324
GeneID:613247 KEGG:dre:613247 InParanoid:Q1L8T6 OMA:RFGMPPQ
NextBio:20898499 Uniprot:Q1L8T6
Length = 571
Score = 139 (54.0 bits), Expect = 3.2e-06, P = 3.2e-06
Identities = 83/301 (27%), Positives = 103/301 (34%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ---GHGPPPSA 167
M +P +R S G A + SE P V N ++D +G P G G P A
Sbjct: 16 MKSPEKKKRKSNSQGAAFSHLSEFAPPPTPMVDHLVASNPFDDDFGPPSRSAGGGGPGGA 75
Query: 168 TTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP 227
T GAG Y G M GPG S PG P P GP
Sbjct: 76 TFLPSPGAGGG----GYGGP--GR-MGGGMGFMGGPGGPGSGQPGRRPPFGPP-TPNTGP 127
Query: 228 SYDPAKG--PGYDPTKGPGYDA----QKGSNYDAQRGPNYD--IHRGPSYDPQRGLGYDM 279
+ G PG+ G G G PN+ +H G ++P G M
Sbjct: 128 HHPLGFGGMPGFGGGGGGGGGGGGGFPPGGPSQFNMPPNFSPPMHPGQGFNPMLSPGA-M 186
Query: 280 QRGPNYDMQRGPGYET----QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR---GQG 332
GP GP + Q+ P + Q G + + P RGP + G G
Sbjct: 187 GGGPGGG--GGPPHPRFGMPQQQPPHG-QGGHPFNSPPLPGGPGPRGPPHGPMNPMGGMG 243
Query: 333 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY-GSATPPARSGS--GQPRGGN 389
M G G G P GQ PPP + PY GS+ P G G P GG
Sbjct: 244 GGMNMMGMGGGGGGGNMVGGHPGMPPQGQFPPPQDG-PYPGSSPPVGEEGKNFGGPGGGP 302
Query: 390 P 390
P
Sbjct: 303 P 303
>UNIPROTKB|P04258 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
"Bos taurus" [GO:0005581 "collagen" evidence=IEA] PROSITE:PS01208
GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
HOGENOM:HOG000085654 HOVERGEN:HBG004933 IPI:IPI00731432 PIR:A02862
UniGene:Bt.64714 STRING:P04258 PRIDE:P04258 Uniprot:P04258
Length = 1049
Score = 142 (55.0 bits), Expect = 3.4e-06, P = 3.4e-06
Identities = 82/262 (31%), Positives = 97/262 (37%)
Query: 142 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 198
SG P G+ G P G GPP G G P SA G P
Sbjct: 521 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 577
Query: 199 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
P GPG + KG PG AP D +GP+ P PG P PG + G+
Sbjct: 578 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 628
Query: 257 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 311
P GP P +RG G G P Q G PG + +R PG + GP A
Sbjct: 629 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 686
Query: 312 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 371
A P PG +G+ R +P G G G P G P G PP N P
Sbjct: 687 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 735
Query: 372 --GSATPPARSGSGQPRGGNPA 391
GS+ P + G P G N A
Sbjct: 736 PPGSSGAPGKDGPPGPPGSNGA 757
Score = 139 (54.0 bits), Expect = 7.2e-06, P = 7.2e-06
Identities = 86/286 (30%), Positives = 103/286 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAY 184
A G G G + P G + + G P GPP AG G GP +
Sbjct: 12 AGGGIAGYPGPAGPPGPPGPPGTSGHPGAPGAPGYQGPPGEPGQAGPAGPPGPPGAIGPS 71
Query: 185 AAT-QSGTPMRAAYDIPRG-PGYEASKGP----GYDASKAP-SYDPTKGPSYDPAKGPGY 237
+SG P R PRG PG KGP G+ K +D G +P PG
Sbjct: 72 GKDGESGRPGRPG---PRGFPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGEPG-APGL 127
Query: 238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
G PG D G RG + R P G G D RG D Q GP G
Sbjct: 128 KGENGVPGEDGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 182
Query: 295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
T PG +G V A S PG QRG+ A + P G DG+P
Sbjct: 183 GTAGFPGSPGAKGEVGPAGSPGS---SGAPG---QRGEPGPQGHAGAPGPPGPPGSDGSP 236
Query: 354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
G G P + P G+ PP G+ G P RG G P +
Sbjct: 237 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGVPGQRGAAGEPGK 280
Score = 122 (48.0 bits), Expect = 0.00052, P = 0.00052
Identities = 84/289 (29%), Positives = 101/289 (34%)
Query: 120 PNVDRRADGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAG 171
P + DGS G GA G E G R P G N G P G P A G
Sbjct: 304 PKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRG 363
Query: 172 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 231
V G P + G +R P GPG GP + P P P
Sbjct: 364 VAGE-PGRN-----GLPGGPGLRGIPGSPGGPGSNGKPGPPGSQGETGRPGPPGSPG--P 415
Query: 232 AKGPGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM- 287
PG GP G D G N + + GP +GP+ + + G G GP+ D
Sbjct: 416 RGQPGVMGFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKG 474
Query: 288 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRG 346
GP Q + G GP E + P+ G + G+G D AP RG
Sbjct: 475 DTGPP-GPQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERG 528
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
G P G P G PP G+A PP GS G G P R
Sbjct: 529 PPGAGGPPG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 575
>UNIPROTKB|E2R2K8 [details] [associations]
symbol:PPP1R10 "Uncharacterized protein" species:9615
"Canis lupus familiaris" [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 Gene3D:1.20.930.10
SUPFAM:SSF47676 CTD:5514 OMA:PPPHEHR GeneTree:ENSGT00530000063820
EMBL:AAEX03008197 RefSeq:XP_848400.1 Ensembl:ENSCAFT00000000645
Ensembl:ENSCAFT00000048295 GeneID:481705 KEGG:cfa:481705
NextBio:20856447 Uniprot:E2R2K8
Length = 940
Score = 141 (54.7 bits), Expect = 3.8e-06, P = 3.8e-06
Identities = 68/268 (25%), Positives = 87/268 (32%)
Query: 128 GSYGGATGNSENETSGRPV---GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
G +GG G+ G P G + + DG G P GP G G GP
Sbjct: 653 GPHGGPGGSVGPRLLGPPPPPRGGDPFWDGPGDPMRGGP-----MRGGPGPGPGPYHRGR 707
Query: 185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
P P P + ++G G P+ GP G G+ P +GPG
Sbjct: 708 GGRGGNEPP------PPPPPFRGARG-GRSGGGPPN--GRGGPGGGMVGGGGHRPHEGPG 758
Query: 245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQ 304
S + GP + G + P G G M G + GPG G+
Sbjct: 759 GGMNSSSGHRPHEGPGGGM--GGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPH 816
Query: 305 RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
GP + P GPG + G G+ P G G G P G PH VP
Sbjct: 817 EGPGSGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-DVPS 866
Query: 365 PLNNVPYGSATPPARSGSGQPRGGNPAR 392
+ G R G GG R
Sbjct: 867 HRGHDHRGPPPHEHRGHDGPGHGGGGHR 894
Score = 139 (54.0 bits), Expect = 6.3e-06, P = 6.3e-06
Identities = 56/215 (26%), Positives = 74/215 (34%)
Query: 132 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
G G +E P G G G P G G P G G P+ + SG
Sbjct: 708 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGG-GHRPHEGPGGGMNSSSG 766
Query: 191 TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKG 250
G G+ +GPG + P +GP G G+ P +GPG G
Sbjct: 767 HRPHEGPGGGMGGGHRPHEGPGSSMGGGGGHRPHEGPGGGMGSGSGHRPHEGPGSGMGGG 826
Query: 251 SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP---GYDVQRGP 307
S + GP + G + P G G+ GP+ G+ VP G+D RGP
Sbjct: 827 SGHRPHEGPGGGMGAGGGHRPHEGPGHG---GPH-------GHRPHDVPSHRGHD-HRGP 875
Query: 308 VYEAQRA---PSYIPQRGPGYDLQRGQGYDMRRAP 339
R P + G+D G DM P
Sbjct: 876 PPHEHRGHDGPGHGGGGHRGHDGGHSHGGDMSNRP 910
>ZFIN|ZDB-GENE-030131-1600 [details] [associations]
symbol:ewsr1b "Ewing sarcoma breakpoint region 1b"
species:7955 "Danio rerio" [GO:0005622 "intracellular"
evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
[GO:0008270 "zinc ion binding" evidence=IEA] [GO:0003676 "nucleic
acid binding" evidence=IEA] [GO:0021954 "central nervous system
neuron development" evidence=IMP] [GO:0007067 "mitosis"
evidence=IMP] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
ZFIN:ZDB-GENE-030131-1600 GO:GO:0007067 GO:GO:0000166 GO:GO:0008270
Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 GO:GO:0021954
GeneTree:ENSGT00530000063105 HOGENOM:HOG000038010
HOVERGEN:HBG000970 EMBL:BX664747 EMBL:BC097019 UniGene:Dr.76923
SMR:Q4QRG0 STRING:Q4QRG0 Ensembl:ENSDART00000003998 OMA:PVINIYL
Uniprot:Q4QRG0
Length = 579
Score = 142 (55.0 bits), Expect = 3.8e-06, Sum P(2) = 3.8e-06
Identities = 73/255 (28%), Positives = 96/255 (37%)
Query: 119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 178
AP+ A YG G + + P YG PQ P A GA
Sbjct: 61 APSAGAYAQQQYGSTYGQAAATAAAAPAA-------YGTPQ---PGAYTQPAQSYGASSY 110
Query: 179 TSTSAYAATQSGTPMRAAYDI-PRGPGYE---ASKGP-GYDASKAPSYDPTKGPSYDPAK 233
T ++A A Q+ + Y P GY A+ P Y AS P+Y+ + +Y
Sbjct: 111 TGSTAAPAAQASYGSQPGYSTQPAYSGYSQQPAASAPQSYSASSQPAYNQS---AYSQPA 167
Query: 234 G---PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGP--SYDPQRGLGYDMQRGPNYDMQ 288
G PGY + PGY Q+ S Y Q P +GP +Y PQ Y + Y Q
Sbjct: 168 GYSQPGYQAQQ-PGYGQQQQSAY-GQGQPPQQHQQGPPAAYPPQGSSSYAQTQ---YGQQ 222
Query: 289 RGPGYETQRVPGYDVQRGPV---YEAQRAPSYIPQRGPGYDL--QRGQGYDMRRAPSYDP 343
P + Q+ P +G V Y + Y GYD RG+G R
Sbjct: 223 SAPQNDYQQNPYNSYSQGGVSGGYPGSQRGGYQDGGRDGYDRGGPRGRGMG-RGGMGIAG 281
Query: 344 SRGTGFD--GAP-RG 355
RG GF+ G P RG
Sbjct: 282 DRG-GFNKPGGPMRG 295
Score = 139 (54.0 bits), Expect = 8.2e-06, Sum P(2) = 8.2e-06
Identities = 78/283 (27%), Positives = 100/283 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
A SYG T +T G+ Q + Y + P +A A A P S AYA
Sbjct: 15 AQQSYGSYTAPPA-QTYGQTAQQGYTQQDYS---SYAQPAAAPEATYSQAAP--SAGAYA 68
Query: 186 ATQSGTPM-RAAYDIPRGPGYEASKGPGYDASKAPSYDPTK--GPSYDPAKGPGYDPTKG 242
Q G+ +AA P + PG A SY + G + PA Y
Sbjct: 69 QQQYGSTYGQAAATAAAAPAAYGTPQPGAYTQPAQSYGASSYTGSTAAPAAQASYGSQ-- 126
Query: 243 PGYDAQKG-SNYDAQ---RGP-NYDIHRGPSYDPQRGLGYDMQRG---PNYDMQRGPGYE 294
PGY Q S Y Q P +Y P+Y+ Y G P Y Q+ PGY
Sbjct: 127 PGYSTQPAYSGYSQQPAASAPQSYSASSQPAYNQS---AYSQPAGYSQPGYQAQQ-PGYG 182
Query: 295 TQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGY-DLQRGQGY----DMRRAPSYDPSRGT- 347
Q+ Y + P Q P+ Y PQ Y Q GQ D ++ P S+G
Sbjct: 183 QQQQSAYGQGQPPQQHQQGPPAAYPPQGSSSYAQTQYGQQSAPQNDYQQNPYNSYSQGGV 242
Query: 348 --GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
G+ G+ RG G P G G RGG
Sbjct: 243 SGGYPGSQRGGYQDGGRDGYDRGGPRGRGMGRGGMGIAGDRGG 285
Score = 39 (18.8 bits), Expect = 3.8e-06, Sum P(2) = 3.8e-06
Identities = 8/16 (50%), Positives = 8/16 (50%)
Query: 377 PARSGSGQPRGGNPAR 392
P R G G RGG R
Sbjct: 410 PMRGGPGMDRGGMMGR 425
>ZFIN|ZDB-GENE-040426-1010 [details] [associations]
symbol:fus "fusion (involved in t(12;16) in
malignant liposarcoma)" species:7955 "Danio rerio" [GO:0000166
"nucleotide binding" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] InterPro:IPR000504
InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
SMART:SM00547 ZFIN:ZDB-GENE-040426-1010 GO:GO:0000166 GO:GO:0008270
Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622
GeneTree:ENSGT00530000063105 KO:K13098 CTD:2521 EMBL:BX571714
IPI:IPI00785727 RefSeq:NP_957377.2 UniGene:Dr.114403
Ensembl:ENSDART00000055340 GeneID:394058 KEGG:dre:394058
NextBio:20815017 Bgee:F1R0M4 Uniprot:F1R0M4
Length = 541
Score = 137 (53.3 bits), Expect = 4.9e-06, P = 4.9e-06
Identities = 64/250 (25%), Positives = 91/250 (36%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P+ + SYGG N +E+S P Q Y YG Q G A + G + +
Sbjct: 28 PSAQNYSQQSYGGY--NQSSESSSAPYNQGGYSSNYGQSQSGGYGSQAPSQGYSQSSQSY 85
Query: 180 STSAYAATQSGTPMRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 238
S+ Y+ T P ++ GY + S GY+ S +P+ P S + G G
Sbjct: 86 SSGGYSNTSQPPPAQSG-------GYSQQSSYSGYNQS-SPASAPGGYSSSSQSSGYGQQ 137
Query: 239 PTK-GPGYDAQKGSN--YDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
+ G GY G + Y + G + G + + G PNY Y
Sbjct: 138 QQQSGGGYGGSGGQSGGYGSSGGQSSGFGGSGGQHQSSQSGGGSYSPSPNYSSPPPQSYG 197
Query: 295 TQRV---PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
Q GY+ P+ Y Q G GY Q G+G R + GFD
Sbjct: 198 QQSQYGQGGYNQDSPPMSGGGGGGGYGGQDG-GYS-QDGRG-GRGRGGGFGGRGAGGFDR 254
Query: 352 APRGAAPHGQ 361
RG P G+
Sbjct: 255 GGRGG-PRGR 263
>UNIPROTKB|I3LQ53 [details] [associations]
symbol:I3LQ53 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0006366 "transcription from RNA polymerase II promoter"
evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
complex" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
InterPro:IPR000684 Pfam:PF05001 PROSITE:PS00115 GO:GO:0003677
GO:GO:0006366 GO:GO:0005665 GeneTree:ENSGT00700000104490
EMBL:FP565284 Ensembl:ENSSSCT00000030016 OMA:YAESDYL Uniprot:I3LQ53
Length = 543
Score = 137 (53.3 bits), Expect = 5.0e-06, P = 5.0e-06
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 62 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 119
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 120 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 171
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 172 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 225
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 226 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 278
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 279 YSPTSPSYSPTS--PSYSPTSPSYS 301
Score = 121 (47.7 bits), Expect = 0.00029, P = 0.00029
Identities = 63/225 (28%), Positives = 80/225 (35%)
Query: 163 PPPSATTAGVVGAGPNTSTSAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDAS----- 216
P S T+ PN Y T +P +Y P P Y + P Y S
Sbjct: 333 PSYSPTSPSYSPTSPN-----YTPTSPNYSPTSPSYS-PTSPSYSPTS-PSYSPSSPRYT 385
Query: 217 -KAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 275
++P+Y P+ PSY P+ P Y PT P Y S Y P Y P Y P
Sbjct: 386 PQSPTYTPSS-PSYSPSS-PSYSPTS-PKYTPTSPS-YSPS-SPEYT-PTSPKYSPTSPK 439
Query: 276 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 335
Y P Y P Y + P Y P Y + +P Y P P Y Y
Sbjct: 440 -YS-PTSPKYS-PTSPTY-SPTTPKYS-PTSPTY-SPTSPVYTPT-SPKYS-PTSPTYSP 491
Query: 336 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARS 380
+P Y P+ T +P+G+ P P S T PA S
Sbjct: 492 T-SPKYSPTSPTYSPTSPKGSTYSPTSPGYSPTSPTYSLTSPAIS 535
>UNIPROTKB|F1MXS8 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9913
"Bos taurus" [GO:0071230 "cellular response to amino acid stimulus"
evidence=IEA] [GO:0050777 "negative regulation of immune response"
evidence=IEA] [GO:0048565 "digestive tract development"
evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
"skin development" evidence=IEA] [GO:0043206 "extracellular fibril
organization" evidence=IEA] [GO:0042060 "wound healing"
evidence=IEA] [GO:0034097 "response to cytokine stimulus"
evidence=IEA] [GO:0032964 "collagen biosynthetic process"
evidence=IEA] [GO:0030199 "collagen fibril organization"
evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
[GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
"heart development" evidence=IEA] [GO:0007229 "integrin-mediated
signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
"cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
IPI:IPI00731432 OMA:EGSPGHP GO:GO:0005586 EMBL:DAAA02003919
EMBL:DAAA02003920 Ensembl:ENSBTAT00000028617 ArrayExpress:F1MXS8
Uniprot:F1MXS8
Length = 1466
Score = 142 (55.0 bits), Expect = 5.1e-06, P = 5.1e-06
Identities = 82/262 (31%), Positives = 97/262 (37%)
Query: 142 SGRPVGQNAYEDGYGVPQ---GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD 198
SG P G+ G P G GPP G G P SA G P
Sbjct: 677 SGAP-GERGPPGAGGPPGPRGGAGPPGPEGGKGAAGP-PGPPGSAGTPGLQGMPGERGG- 733
Query: 199 IPRGPGYEASKG-PGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
P GPG + KG PG AP D +GP+ P PG P PG + G+
Sbjct: 734 -PGGPGPKGDKGEPGSSGVDGAPGKDGPRGPT-GPIGPPG--PAGQPGDKGESGA----- 784
Query: 257 RGPNYDIHRGPSYDP-QRG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPVYEA 311
P GP P +RG G G P Q G PG + +R PG + GP A
Sbjct: 785 --PGVPGIAGPRGGPGERGEQGPPGPAGFPGAPGQNGEPGAKGERGAPGEKGEGGPPGAA 842
Query: 312 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPY 371
A P PG +G+ R +P G G G P G P G PP N P
Sbjct: 843 GPAGGSGPAGPPGPQGVKGE----RGSPG-----GPGAAGFPGGRGPPG--PPGSNGNPG 891
Query: 372 --GSATPPARSGSGQPRGGNPA 391
GS+ P + G P G N A
Sbjct: 892 PPGSSGAPGKDGPPGPPGSNGA 913
Score = 129 (50.5 bits), Expect = 0.00013, P = 0.00013
Identities = 78/257 (30%), Positives = 104/257 (40%)
Query: 156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR--AAYDIPRGP----GYEASK 209
G P GPP + G + AY +SG A Y P GP G +
Sbjct: 130 GSPGSPGPPGICESCPTGGQNYSPQYEAYDV-KSGVAGGGIAGYPGPAGPPGPPGPPGTS 188
Query: 210 G-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP----GYDAQKGS-NYDAQRG-PNYD 262
G PG KA P + SY P PG GP G D + G +RG P
Sbjct: 189 GHPGAPHLKAWQKPPQQSTSYSPIGPPGPPGAIGPSGPAGKDGESGRPGRPGERGFPGPP 248
Query: 263 IHRGPSYDP----QRG-LGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVYEAQRAPS 316
+GP+ P +G G+D + G + PG + + VPG + GP+ + AP
Sbjct: 249 GMKGPAGMPGFPGMKGHRGFDGRNGEKGETG-APGLKGENGVPGENGAPGPM-GPRGAPG 306
Query: 317 YIPQRG-PGYDLQRG----QGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVP 370
+ G PG RG +G D + P P GT GF G+P GA G+V P P
Sbjct: 307 ERGRPGLPGAAGARGNDGARGSDGQPGPPGPP--GTAGFPGSP-GAK--GEVGPA--GSP 359
Query: 371 YGSATPPARSGSGQPRG 387
GS+ P + G P+G
Sbjct: 360 -GSSGAPGQRGEPGPQG 375
Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
Identities = 84/289 (29%), Positives = 101/289 (34%)
Query: 120 PNVDRRADGSYG--GATG----NSENETSG-R-PVGQNAYEDGYGVPQGHGPPPSATTAG 171
P + DGS G GA G E G R P G N G P G P A G
Sbjct: 460 PKGEDGKDGSPGEPGANGLPGAAGERGVPGFRGPAGANGLPGEKGPPGDRGGPGPAGPRG 519
Query: 172 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 231
V G P G +R P GPG + GP + P P P
Sbjct: 520 VAGE-PGRD-----GLPGGPGLRGIPGSPGGPGSDGKPGPPGSQGETGRPGPPGSPG--P 571
Query: 232 AKGPGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM- 287
PG GP G D G N + + GP +GP+ + + G G GP+ D
Sbjct: 572 RGQPGVMGFPGPKGNDGAPGKNGE-RGGPGGPGPQGPAGKNGETGPQGPPGPTGPSGDKG 630
Query: 288 QRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRAPSYDPSRG 346
GP Q + G GP E + P+ G + G+G D AP RG
Sbjct: 631 DTGPP-GPQGLQGLPGTSGPPGENGKPGEPGPKGEAGAPGIPGGKG-DSG-APG---ERG 684
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
G P G P G PP G+A PP GS G G P R
Sbjct: 685 PPGAGGPPG--PRGGAGPPGPEGGKGAAGPPGPPGSAGTPGLQGMPGER 731
>UNIPROTKB|J9P8F7 [details] [associations]
symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
GO:GO:0005201 GeneTree:ENSGT00700000104155 EMBL:AAEX03006798
EMBL:AAEX03006799 EMBL:AAEX03006800 Ensembl:ENSCAFT00000044143
Uniprot:J9P8F7
Length = 1405
Score = 141 (54.7 bits), Expect = 6.2e-06, P = 6.2e-06
Identities = 77/254 (30%), Positives = 100/254 (39%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
PVG + G P GP S G GA Q G P A +G P
Sbjct: 634 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 689
Query: 204 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 260
G + +GP G D + P P GP+ P PG D KG G QKGS D ++GP
Sbjct: 690 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 746
Query: 261 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 319
GP+ PQ +G Q GP+ D + GP + Q + G GP P +
Sbjct: 747 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 795
Query: 320 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 377
+G PG ++G+ D+ + P G GAP P G P + N G P
Sbjct: 796 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 854
Query: 378 ARSGS-GQPRGGNP 390
+G G P G P
Sbjct: 855 GEAGEPGLPGEGGP 868
>UNIPROTKB|E1C0T1 [details] [associations]
symbol:TFG "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0004871 "signal transducer activity" evidence=IEA]
[GO:0042802 "identical protein binding" evidence=IEA] [GO:0043123
"positive regulation of I-kappaB kinase/NF-kappaB cascade"
evidence=IEA] InterPro:IPR000270 Pfam:PF00564 SMART:SM00666
GO:GO:0043123 GO:GO:0004871 CTD:10342 KO:K09292 OMA:YTTQTSQ
GeneTree:ENSGT00510000047809 EMBL:AADN02032793 IPI:IPI00599103
RefSeq:XP_416608.1 UniGene:Gga.1550 PRIDE:E1C0T1
Ensembl:ENSGALT00000024692 GeneID:418391 KEGG:gga:418391
NextBio:20821576 Uniprot:E1C0T1
Length = 395
Score = 134 (52.2 bits), Expect = 6.3e-06, P = 6.3e-06
Identities = 57/210 (27%), Positives = 81/210 (38%)
Query: 175 AGPNTSTSAYAATQSGTP--MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 232
AGP SA A +SGTP + ++ PG + + P Y ++ + +G Y
Sbjct: 194 AGP---PSAPAEERSGTPDSIASSSSAAHPPGVQPQQAP-YPGAQPQTGQQVEGQMYQQY 249
Query: 233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
+ PGY P + P AQ Y Q Y + S Q+ Y Q P G G
Sbjct: 250 QQPGY-PAQQP--QAQPQQQYGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAFP-GQG 305
Query: 293 YETQRVPGYDVQRGPV--YEAQ----RAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRG 346
+ Q++P Q+ P + Q +A P GP Q G R P + P G
Sbjct: 306 -QAQQLPAQQPQQYPAGSFPPQPYTTQASQPAPYSGPP-GAQAAPGTFQPR-PGFTPPPG 362
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATP 376
+ P G P+ + PP P G A P
Sbjct: 363 STMTPPPSGPNPYARTRPPFG--PQGYAQP 390
Score = 133 (51.9 bits), Expect = 8.1e-06, P = 8.1e-06
Identities = 54/197 (27%), Positives = 70/197 (35%)
Query: 200 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 259
P P E S P AS + + P P P + P Y PG Q G + Q
Sbjct: 197 PSAPAEERSGTPDSIASSSSAAHP---PGVQPQQAP-Y-----PGAQPQTGQQVEGQM-- 245
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY-I 318
Y ++ P Y Q+ Q+ Y +Q GY Q+ Q+ P Y Q AP+
Sbjct: 246 -YQQYQQPGYPAQQPQAQPQQQ---YGVQYPAGYSPQQAASQPTQQFPAYSQQPAPAAAF 301
Query: 319 PQRGPGYDL--QRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP 376
P +G L Q+ Q Y P P AP P Q P G P
Sbjct: 302 PGQGQAQQLPAQQPQQYPAGSFPP-QPYTTQASQPAPYSGPPGAQAAPGTFQPRPGFTPP 360
Query: 377 PARSGSGQPRGGNPARR 393
P + + P G NP R
Sbjct: 361 PGSTMTPPPSGPNPYAR 377
>UNIPROTKB|F1LLX1 [details] [associations]
symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
OMA:HPGKEGQ IPI:IPI00949317 Ensembl:ENSRNOT00000024138
ArrayExpress:F1LLX1 Uniprot:F1LLX1
Length = 1803
Score = 142 (55.0 bits), Expect = 6.4e-06, P = 6.4e-06
Identities = 87/280 (31%), Positives = 107/280 (38%)
Query: 132 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 183
GA G+ + SG+ P G + G+P G P G G GP S SA
Sbjct: 1003 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1062
Query: 184 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 235
A G P R P GP G KGP G D + P P GP+ PA P
Sbjct: 1063 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1119
Query: 236 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 289
G D KG G QKGS D + GP +GP P G+ G D + GP M
Sbjct: 1120 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1177
Query: 290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 347
G E R G+ GP+ Q P ++G D+ G G R P P+
Sbjct: 1178 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1233
Query: 348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
G G P G V G+ PP +GSG P+G
Sbjct: 1234 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1273
>RGD|2372 [details] [associations]
symbol:Col11a1 "collagen, type XI, alpha 1" species:10116 "Rattus
norvegicus" [GO:0001502 "cartilage condensation" evidence=ISO]
[GO:0001503 "ossification" evidence=IEP] [GO:0002063 "chondrocyte
development" evidence=ISO] [GO:0003007 "heart morphogenesis"
evidence=ISO] [GO:0005201 "extracellular matrix structural
constituent" evidence=TAS] [GO:0005581 "collagen" evidence=ISO]
[GO:0005592 "collagen type XI" evidence=ISO] [GO:0006029
"proteoglycan metabolic process" evidence=ISO] [GO:0007601 "visual
perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
evidence=ISO] [GO:0030199 "collagen fibril organization"
evidence=ISO;TAS] [GO:0031012 "extracellular matrix"
evidence=ISO;IDA] [GO:0035989 "tendon development" evidence=ISO]
[GO:0042472 "inner ear morphogenesis" evidence=ISO] [GO:0048704
"embryonic skeletal system morphogenesis" evidence=ISO] [GO:0048705
"skeletal system morphogenesis" evidence=ISO] [GO:0050910 "detection
of mechanical stimulus involved in sensory perception of sound"
evidence=ISO] [GO:0051216 "cartilage development" evidence=ISO]
[GO:0055010 "ventricular cardiac muscle tissue morphogenesis"
evidence=ISO] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
PROSITE:PS51461 SMART:SM00038 RGD:2372 GO:GO:0046872 GO:GO:0007601
GO:GO:0030199 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472 GO:GO:0050910
GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025 GO:GO:0001502
GO:GO:0048704 GO:GO:0006029 GO:GO:0055010 Pfam:PF02210 GO:GO:0005201
GO:GO:0002063 HOGENOM:HOG000085654 KO:K06236 HOVERGEN:HBG103137
OrthoDB:EOG49GKHM SMART:SM00210 GeneTree:ENSGT00700000104155 CTD:1301
EMBL:AABR03012126 EMBL:AABR03013126 EMBL:AABR03014171
EMBL:AABR03015382 EMBL:AABR03015832 EMBL:AABR03016562
EMBL:AABR03017847 EMBL:AABR03017951 EMBL:AABR03018245
EMBL:AABR03019675 EMBL:AABR03023874 EMBL:U20116 EMBL:U20118
EMBL:U20121 IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589
IPI:IPI00949317 IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1
UniGene:Rn.260 IntAct:P20909 STRING:P20909 PhosphoSite:P20909
PRIDE:P20909 Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413
GeneID:25654 KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909
NextBio:607535 ArrayExpress:P20909 Genevestigator:P20909
GermOnline:ENSRNOG00000023148 Uniprot:P20909
Length = 1804
Score = 142 (55.0 bits), Expect = 6.4e-06, P = 6.4e-06
Identities = 87/280 (31%), Positives = 107/280 (38%)
Query: 132 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 183
GA G+ + SG+ P G + G+P G P G G GP S SA
Sbjct: 1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063
Query: 184 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 235
A G P R P GP G KGP G D + P P GP+ PA P
Sbjct: 1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120
Query: 236 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 289
G D KG G QKGS D + GP +GP P G+ G D + GP M
Sbjct: 1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178
Query: 290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 347
G E R G+ GP+ Q P ++G D+ G G R P P+
Sbjct: 1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234
Query: 348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
G G P G V G+ PP +GSG P+G
Sbjct: 1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274
>UNIPROTKB|P20909 [details] [associations]
symbol:Col11a1 "Collagen alpha-1(XI) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2372
GO:GO:0046872 GO:GO:0007601 GO:GO:0030199 Gene3D:2.60.120.200
InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0042472
GO:GO:0050910 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025
GO:GO:0001502 GO:GO:0048704 GO:GO:0006029 GO:GO:0055010
Pfam:PF02210 GO:GO:0005201 GO:GO:0002063 HOGENOM:HOG000085654
KO:K06236 HOVERGEN:HBG103137 OrthoDB:EOG49GKHM SMART:SM00210
GeneTree:ENSGT00700000104155 CTD:1301 EMBL:AABR03012126
EMBL:AABR03013126 EMBL:AABR03014171 EMBL:AABR03015382
EMBL:AABR03015832 EMBL:AABR03016562 EMBL:AABR03017847
EMBL:AABR03017951 EMBL:AABR03018245 EMBL:AABR03019675
EMBL:AABR03023874 EMBL:U20116 EMBL:U20118 EMBL:U20121
IPI:IPI00189470 IPI:IPI00189494 IPI:IPI00325589 IPI:IPI00949317
IPI:IPI00959233 PIR:B31795 RefSeq:NP_037249.1 UniGene:Rn.260
IntAct:P20909 STRING:P20909 PhosphoSite:P20909 PRIDE:P20909
Ensembl:ENSRNOT00000023693 Ensembl:ENSRNOT00000068413 GeneID:25654
KEGG:rno:25654 UCSC:RGD:2372 InParanoid:P20909 NextBio:607535
ArrayExpress:P20909 Genevestigator:P20909
GermOnline:ENSRNOG00000023148 Uniprot:P20909
Length = 1804
Score = 142 (55.0 bits), Expect = 6.4e-06, P = 6.4e-06
Identities = 87/280 (31%), Positives = 107/280 (38%)
Query: 132 GATGNSENE-TSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTST----SA 183
GA G+ + SG+ P G + G+P G P G G GP S SA
Sbjct: 1004 GAKGDPGPQGISGKDGPAGIRGFPGERGLPGAQGAPGLKGGEGPQGPQGPIGSPGERGSA 1063
Query: 184 YAATQSGTPMRAAYDIPRGP----GYEASKGP----GYDASKAPSYDPTKGPSYDPAKGP 235
A G P R P GP G KGP G D + P P GP+ PA P
Sbjct: 1064 GTAGPIGLPGRPGPQGPPGPAGEKGAPGEKGPQGPAGRDGVQGPVGLP--GPA-GPAGSP 1120
Query: 236 GYDPTKGP-GYDAQKGSNYD-AQRGPNYDIH-RGPSYDPQRGL-GYDMQRGPN--YDMQR 289
G D KG G QKGS D + GP +GP P G+ G D + GP M
Sbjct: 1121 GEDGDKGEIGEPGQKGSKGDKGENGPPGPPGLQGPVGAP--GIAGGDGEAGPRGQQGMFG 1178
Query: 290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDL-QRGQ-GYDMRRAPSYDPSRGT 347
G E R G+ GP+ Q P ++G D+ G G R P P+
Sbjct: 1179 QKGDEGAR--GFPGPPGPI-GLQGLPGPPGEKGENGDVGPMGPPGPPGPRGPQ-GPNGAD 1234
Query: 348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
G G P G V G+ PP +GSG P+G
Sbjct: 1235 GPQGPPGSIGSVGGVGEKGEPGEAGNPGPPGEAGSGGPKG 1274
>TAIR|locus:2077547 [details] [associations]
symbol:AT3G07030 species:3702 "Arabidopsis thaliana"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005829
"cytosol" evidence=IDA] InterPro:IPR002775 Pfam:PF01918
GO:GO:0005829 EMBL:CP002686 GO:GO:0003676 IPI:IPI00519674
RefSeq:NP_187359.2 UniGene:At.74527 ProteinModelPortal:F4JD88
SMR:F4JD88 PRIDE:F4JD88 EnsemblPlants:AT3G07030.1 GeneID:3768790
KEGG:ath:AT3G07030 OMA:ERRNDGY Uniprot:F4JD88
Length = 405
Score = 134 (52.2 bits), Expect = 6.6e-06, P = 6.6e-06
Identities = 57/209 (27%), Positives = 72/209 (34%)
Query: 149 NAY-EDGYGVPQGHGPPP--SATTAGVVGAGPNTSTSAYAATQS-GTPMRA-AYDI-PRG 202
NAY E+G V +G TT GV+ + T G RA A D+
Sbjct: 150 NAYGEEGEVVAEGEAGEEVDMETTKGVMKEKTKGTIKKIIKTMKVGIQTRAEAVDVVDEA 209
Query: 203 PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYD 262
+G GY + Y + Y + GY + Y + Y R Y
Sbjct: 210 MAIVGGRG-GYGGGRDGGYGGGRDDGYGERRNDGYGERRNDRYGGGRDDGYGGGRDDGYG 268
Query: 263 IHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG 322
R Y +RG G+ RG D G G G +G Y R Y RG
Sbjct: 269 GGRNDGYGGRRG-GFRGGRGGGRDEGYGGG--RGGYGGRSGGQGDGYGGGRGDGYGGGRG 325
Query: 323 PGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
GY RG GY R YD R G+ G
Sbjct: 326 DGYGGGRGDGYGGGRVDRYDGGRRDGYGG 354
Score = 125 (49.1 bits), Expect = 6.6e-05, P = 6.6e-05
Identities = 50/158 (31%), Positives = 59/158 (37%)
Query: 201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN 260
R GY + GY + Y +G + +G G D GY +G Y + G
Sbjct: 255 RDDGYGGGRDDGYGGGRNDGYGGRRG-GFRGGRGGGRDE----GYGGGRGG-YGGRSGG- 307
Query: 261 YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQ 320
+G Y RG GY RG Y RG GY RV YD R Y R Y
Sbjct: 308 ----QGDGYGGGRGDGYGGGRGDGYGGGRGDGYGGGRVDRYDGGRRDGYGGGRYDGYGGG 363
Query: 321 RGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA 357
+ GY RG GY R Y RG G G R A
Sbjct: 364 KSDGYGGGRG-GYRGGRG-GYGRGRGRMGNGGRSRDGA 399
Score = 122 (48.0 bits), Expect = 0.00014, P = 0.00014
Identities = 52/181 (28%), Positives = 63/181 (34%)
Query: 127 DGSYGGATGNSENETSGRPVGQNAYED-GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
DG YGG + E G+ + G G G+G G G G N Y
Sbjct: 224 DGGYGGGRDDGYGERRNDGYGERRNDRYGGGRDDGYG---GGRDDGY-GGGRN---DGYG 276
Query: 186 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY 245
+ G R R GY +G GY +G Y +G GY +G GY
Sbjct: 277 GRRGG--FRGGRGGGRDEGYGGGRG-GYGGRSGG-----QGDGYGGGRGDGYGGGRGDGY 328
Query: 246 DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQR 305
+G Y R YD R Y R GY + Y RG GY R GY R
Sbjct: 329 GGGRGDGYGGGRVDRYDGGRRDGYGGGRYDGYGGGKSDGYGGGRG-GYRGGR-GGYGRGR 386
Query: 306 G 306
G
Sbjct: 387 G 387
>UNIPROTKB|Q8WML4 [details] [associations]
symbol:MUC1 "Mucin-1" species:9913 "Bos taurus" [GO:0016324
"apical plasma membrane" evidence=IBA] [GO:0009986 "cell surface"
evidence=IBA] [GO:0005737 "cytoplasm" evidence=IBA] [GO:0005634
"nucleus" evidence=IEA] [GO:0016021 "integral to membrane"
evidence=IEA] PANTHER:PTHR10006 GO:GO:0016021 GO:GO:0005634
GO:GO:0005737 GO:GO:0009986 GO:GO:0016324 InterPro:IPR000082
Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 EMBL:AJ400824
EMBL:AF399757 IPI:IPI00706283 RefSeq:NP_776540.1 UniGene:Bt.9561
HSSP:Q16615 ProteinModelPortal:Q8WML4 SMR:Q8WML4 STRING:Q8WML4
MEROPS:S71.001 Ensembl:ENSBTAT00000014051 GeneID:281333
KEGG:bta:281333 CTD:4582 eggNOG:NOG77744
GeneTree:ENSGT00700000104548 HOGENOM:HOG000290201
HOVERGEN:HBG003075 InParanoid:Q8WML4 KO:K06568 OMA:PPAHGVT
OrthoDB:EOG4NGGNM NextBio:20805343 PMAP-CutDB:Q8WML4
ArrayExpress:Q8WML4 InterPro:IPR023217 Uniprot:Q8WML4
Length = 580
Score = 136 (52.9 bits), Expect = 7.0e-06, P = 7.0e-06
Identities = 59/261 (22%), Positives = 99/261 (37%)
Query: 137 SENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAGPNTSTSAYAATQSGTPMRA 195
+++ + P + ++ + P PS A + G GA +T TS+ A + + +P
Sbjct: 44 TQSSPTSSPTKETSWSTTTTLLTASSPAPSPAASPGHDGA--STPTSSPAPSPAASPGHD 101
Query: 196 AYDIPRG-PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYD 254
P P + PG+D + P+ P P+ P PT P ++
Sbjct: 102 GASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHN 161
Query: 255 AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA 314
P P+ P G+D P P PG++ P A
Sbjct: 162 GTSSPT----GSPAPSPAASPGHDGASTPTSSPAPSPAAS----PGHNGTSSPT--GSPA 211
Query: 315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG--APRGA-APHGQVPPPLNNVPY 371
PS P PG+D G +P+ P+ G +G +P G+ AP P ++ P
Sbjct: 212 PS--PAASPGHD---GASTPTS-SPAPSPAASPGHNGTSSPTGSPAPSPTASPGHDSAPS 265
Query: 372 GSATP-PARSGS-GQPRGGNP 390
+++P P+ + S GQ +P
Sbjct: 266 LTSSPAPSPTASPGQHGASSP 286
Score = 121 (47.7 bits), Expect = 0.00032, P = 0.00032
Identities = 59/236 (25%), Positives = 82/236 (34%)
Query: 165 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK----GPGYDASKAPS 220
P +TT + P TS T T A PG++ + P + +P
Sbjct: 40 PVSTTQSSPTSSPTKETSWSTTTTLLTASSPAPSPAASPGHDGASTPTSSPAPSPAASPG 99
Query: 221 YD----PTKGPSYDPAKGPGYD----PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 272
+D PT P+ PA PG+D PT P +D P P+ P
Sbjct: 100 HDGASTPTSSPAPSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPA 155
Query: 273 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 332
G++ P P PG+D P + APS P PG++ G
Sbjct: 156 ASPGHNGTSSPTGSPAPSPAAS----PGHDGASTPT--SSPAPS--PAASPGHN---GTS 204
Query: 333 YDMRRAPSYDPSRGTGFDGA--PRGA-APHGQVPPPLNNV--PYGSATPPARSGSG 383
+P+ P+ G DGA P + AP P N P GS P + G
Sbjct: 205 -SPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTGSPAPSPTASPG 259
Score = 121 (47.7 bits), Expect = 0.00032, P = 0.00032
Identities = 55/234 (23%), Positives = 80/234 (34%)
Query: 164 PPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP 223
P T + P +S + + + T + A P P AS PG+D + P+ P
Sbjct: 35 PRRTTPVSTTQSSPTSSPTKETSWSTTTTLLTASS-P-APSPAAS--PGHDGASTPTSSP 90
Query: 224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
P+ P PT P +D P P+ P G+D P
Sbjct: 91 APSPAASPGHDGASTPTSSPAPSPAASPGHDGASTPT----SSPAPSPAASPGHDGASTP 146
Query: 284 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 343
P PG++ P APS P PG+D +P+ P
Sbjct: 147 TSSPAPSPAAS----PGHNGTSSPT--GSPAPS--PAASPGHDGASTPTSSPAPSPAASP 198
Query: 344 SR-GTGFD-GAPR---GAAP-HGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
GT G+P A+P H P ++ A P +G+ P G +PA
Sbjct: 199 GHNGTSSPTGSPAPSPAASPGHDGASTPTSSPAPSPAASPGHNGTSSPTG-SPA 251
>CGD|CAL0000919 [details] [associations]
symbol:RPO21 species:5476 "Candida albicans" [GO:0005665
"DNA-directed RNA polymerase II, core complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0005739 "mitochondrion"
evidence=IEA] [GO:0030447 "filamentous growth" evidence=IMP]
[GO:0006366 "transcription from RNA polymerase II promoter"
evidence=IEA] [GO:0009267 "cellular response to starvation"
evidence=IMP] [GO:0036170 "filamentous growth of a population of
unicellular organisms in response to starvation" evidence=IMP]
[GO:0036180 "filamentous growth of a population of unicellular
organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
"cellular response to biotic stimulus" evidence=IMP] [GO:0003899
"DNA-directed RNA polymerase activity" evidence=IEA] [GO:0003677
"DNA binding" evidence=IEA] [GO:0003968 "RNA-directed RNA
polymerase activity" evidence=IEA] InterPro:IPR000684
InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 CGD:CAL0000919
GO:GO:0071216 GO:GO:0036180 GO:GO:0003677 GO:GO:0006366
GO:GO:0009267 Gene3D:2.40.40.20 InterPro:IPR009010
EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899 eggNOG:COG0086
GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1 STRING:Q5ACI7
GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
Length = 1728
Score = 141 (54.7 bits), Expect = 7.8e-06, P = 7.8e-06
Identities = 72/234 (30%), Positives = 91/234 (38%)
Query: 116 LMNAPN---VDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 172
L AP+ +D ADG GGAT + E NA ++ + G G P
Sbjct: 1452 LQKAPSSMAMDDIADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMN 1501
Query: 173 VG-AGPNTSTSAYAATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
G G TS + + T P Y+ PGY S G GY + +PSY PT PSY
Sbjct: 1502 EGNIGGLTSYGGQPTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYA 1558
Query: 231 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
P P Y PT P Y A Y + P+Y P+Y P Y P+Y
Sbjct: 1559 PTS-PAYSPTS-PSY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTS 1610
Query: 291 PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 344
P Y P Y P Y + +PSY P P Y +PSY P+
Sbjct: 1611 PQYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651
>UNIPROTKB|Q5ACI7 [details] [associations]
symbol:RPO21 "DNA-directed RNA polymerase" species:237561
"Candida albicans SC5314" [GO:0009267 "cellular response to
starvation" evidence=IMP] [GO:0030447 "filamentous growth"
evidence=IMP] [GO:0036170 "filamentous growth of a population of
unicellular organisms in response to starvation" evidence=IMP]
[GO:0036180 "filamentous growth of a population of unicellular
organisms in response to biotic stimulus" evidence=IMP] [GO:0071216
"cellular response to biotic stimulus" evidence=IMP]
InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
SMART:SM00663 CGD:CAL0000919 GO:GO:0071216 GO:GO:0036180
GO:GO:0003677 GO:GO:0006366 GO:GO:0009267 Gene3D:2.40.40.20
InterPro:IPR009010 EMBL:AACQ01000032 GO:GO:0036170 GO:GO:0003899
eggNOG:COG0086 GO:GO:0005665 KO:K03006 RefSeq:XP_719414.1
STRING:Q5ACI7 GeneID:3638991 KEGG:cal:CaO19.7655 Uniprot:Q5ACI7
Length = 1728
Score = 141 (54.7 bits), Expect = 7.8e-06, P = 7.8e-06
Identities = 72/234 (30%), Positives = 91/234 (38%)
Query: 116 LMNAPN---VDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGV 172
L AP+ +D ADG GGAT + E NA ++ + G G P
Sbjct: 1452 LQKAPSSMAMDDIADG--GGATPYKDYE--------NARDENIDIDAGAGFSPIHIAQMN 1501
Query: 173 VG-AGPNTSTSAYAATQSGT-PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD 230
G G TS + + T P Y+ PGY S G GY + +PSY PT PSY
Sbjct: 1502 EGNIGGLTSYGGQPTSPAATSPFSYGYNSITSPGY-TSPGYGYSPT-SPSYSPTS-PSYA 1558
Query: 231 PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
P P Y PT P Y A Y + P+Y P+Y P Y P+Y
Sbjct: 1559 PTS-PAYSPTS-PSY-APTSPAY-SPTSPSY-APTSPAYSPTSP-AYS-PTSPSYS-PTS 1610
Query: 291 PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPS 344
P Y P Y P Y + +PSY P P Y +PSY P+
Sbjct: 1611 PQYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPAYS---------PTSPSYSPT 1651
>UNIPROTKB|F1P555 [details] [associations]
symbol:SFPQ "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0000380 "alternative mRNA
splicing, via spliceosome" evidence=IEA] [GO:0016363 "nuclear
matrix" evidence=IEA] [GO:0042382 "paraspeckles" evidence=IEA]
InterPro:IPR000504 InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102
SMART:SM00360 GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0016363 GO:GO:0000380 GO:GO:0042382 InterPro:IPR012975
Pfam:PF08075 GeneTree:ENSGT00390000005004 OMA:APGGHPK
EMBL:AADN02043825 EMBL:AADN02043826 IPI:IPI00574618
Ensembl:ENSGALT00000003963 ArrayExpress:F1P555 Uniprot:F1P555
Length = 647
Score = 136 (52.9 bits), Expect = 8.2e-06, P = 8.2e-06
Identities = 62/219 (28%), Positives = 89/219 (40%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSA-------TTAGVVGAG 176
R G GG + + G +GQN G G PQG G PP A
Sbjct: 19 RGGGGGRGGPNHDFRSPPPGMGMGQNRGPMGGG-PQGPGGPPGGGPKSEPPKPPASTSAP 77
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDAS-KAPSYDPTKGPSYDPAKGP 235
P++S+S+ A T ++ P A + P A APS P+ GP P
Sbjct: 78 PSSSSSSSATTAGPAGSQSGPGAPPPSALPAGQPPQQQAQGSAPSSAPS-GPGGQQQPQP 136
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
P+ P +KG GP +GP PQ+G G + GP + + GPG E+
Sbjct: 137 KPSPSPTPAGGPKKGQGQSPGGGP-----KGPG-GPQQGPGGPHKGGPGH--RGGPGGES 188
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGP-GYDLQRGQGY 333
+ G RG ++ Q++ S Q+GP G D +G+
Sbjct: 189 R---G----RGQQHQGQQSLSL--QQGPAGGDQLSDEGF 218
>UNIPROTKB|F1PHX8 [details] [associations]
symbol:COL5A1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
OMA:TIYEGIG SMART:SM00210 GeneTree:ENSGT00700000104155
EMBL:AAEX03006798 EMBL:AAEX03006799 EMBL:AAEX03006800
Ensembl:ENSCAFT00000031582 Uniprot:F1PHX8
Length = 1814
Score = 141 (54.7 bits), Expect = 8.3e-06, P = 8.3e-06
Identities = 77/254 (30%), Positives = 100/254 (39%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
PVG + G P GP S G GA Q G P A +G P
Sbjct: 1043 PVGALGLKGSEGPPGPPGPAGSPGERGPAGAAGPIGIPGRPGPQ-GPPGPAG---EKGAP 1098
Query: 204 GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGPN 260
G + +GP G D + P P GP+ P PG D KG G QKGS D ++GP
Sbjct: 1099 GEKGPQGPAGRDGLQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGPP 1155
Query: 261 YDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP 319
GP+ PQ +G Q GP+ D + GP + Q + G GP P +
Sbjct: 1156 -----GPT-GPQGPIG---QPGPSGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPVG 1204
Query: 320 QRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPP 377
+G PG ++G+ D+ + P G GAP P G P + N G P
Sbjct: 1205 LQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGAPGADGPQGP-PGGIGNPGAVGEKGEP 1263
Query: 378 ARSGS-GQPRGGNP 390
+G G P G P
Sbjct: 1264 GEAGEPGLPGEGGP 1277
>MGI|MGI:2384582 [details] [associations]
symbol:Zfp768 "zinc finger protein 768" species:10090 "Mus
musculus" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0005622 "intracellular"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0008150 "biological_process" evidence=ND] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] Pfam:PF00096 InterPro:IPR007087 InterPro:IPR013087
InterPro:IPR015880 PROSITE:PS00028 PROSITE:PS50157 SMART:SM00355
MGI:MGI:2384582 GO:GO:0005634 GO:GO:0006355 GO:GO:0046872
GO:GO:0003677 GO:GO:0008270 GO:GO:0006351 eggNOG:COG5048
Gene3D:3.30.160.60 HOGENOM:HOG000234617
GeneTree:ENSGT00700000104520 KO:K09228 HSSP:P17028
HOVERGEN:HBG105926 OMA:SRYESQN OrthoDB:EOG4CNQQT EMBL:AK155155
EMBL:BC026432 IPI:IPI00153270 RefSeq:NP_666314.1 UniGene:Mm.23031
ProteinModelPortal:Q8R0T2 SMR:Q8R0T2 IntAct:Q8R0T2 STRING:Q8R0T2
PhosphoSite:Q8R0T2 PRIDE:Q8R0T2 Ensembl:ENSMUST00000060783
GeneID:233890 KEGG:mmu:233890 UCSC:uc009jvc.1 CTD:233890
InParanoid:Q8R0T2 NextBio:381919 Bgee:Q8R0T2 CleanEx:MM_ZFP768
Genevestigator:Q8R0T2 Uniprot:Q8R0T2
Length = 568
Score = 135 (52.6 bits), Expect = 8.8e-06, P = 8.8e-06
Identities = 70/278 (25%), Positives = 107/278 (38%)
Query: 119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPN 178
A N +G GN + E P G + PQ + G
Sbjct: 32 AGNTSENEEGEISQREGNGDYEVEEIPFGLEPQSPEFE-PQSPEFESQSPRFEPESPGFE 90
Query: 179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD 238
+ + + R+ P+ P +E S+ P Y+ ++P P + P +P P Y+
Sbjct: 91 SRSPGFVPPSPEFAPRSPESDPQSPEFE-SQSPKYEP-RSPGCHP-RSPGCEPGS-PRYE 146
Query: 239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ-RGPGYETQR 297
P K PGY + K +++Q P Y+ + P Y+PQ G +Q N + + P +ETQ
Sbjct: 147 P-KSPGYGS-KSPEFESQ-SPGYE-SQSPGYEPQNS-GDGVQ---NSEFKTHSPEFETQS 198
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYD-LQRGQGYDMRRA-PSYD-PSRGTGFDGAPR 354
+ P+ ++ P I D +G G A P +D PS GA
Sbjct: 199 SKFQEGAEMPLSPEEKNPLSISLGVHPLDSFTQGFGEQPTGALPPFDMPS------GALL 252
Query: 355 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
A + PLN G+ P R G G+ RGG R
Sbjct: 253 AAPQFEMLQNPLNLT--GTLRGPGRRG-GRARGGQGPR 287
>MGI|MGI:2157767 [details] [associations]
symbol:Krtap21-1 "keratin associated protein 21-1"
species:10090 "Mus musculus" [GO:0001942 "hair follicle
development" evidence=IMP] [GO:0003674 "molecular_function"
evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
[GO:0005882 "intermediate filament" evidence=IEA] [GO:0007165
"signal transduction" evidence=IMP] [GO:0008283 "cell
proliferation" evidence=IMP] [GO:0022405 "hair cycle process"
evidence=IMP] [GO:0031077 "post-embryonic camera-type eye
development" evidence=IMP] [GO:0042640 "anagen" evidence=IMP]
[GO:0043480 "pigment accumulation in tissues" evidence=IMP]
[GO:0043588 "skin development" evidence=IMP] [GO:0048589
"developmental growth" evidence=IMP] [GO:0051726 "regulation of
cell cycle" evidence=IMP] MGI:MGI:2157767 GO:GO:0007165
GO:GO:0043588 GO:GO:0008283 GO:GO:0005882 GO:GO:0051726
GO:GO:0042640 GO:GO:0031077 EMBL:AF345297 EMBL:AK003736
IPI:IPI00126890 UniGene:Mm.46109 HSSP:P10969 Genevestigator:Q925H4
GO:GO:0043480 Uniprot:Q925H4
Length = 128
Score = 111 (44.1 bits), Expect = 9.3e-06, P = 9.3e-06
Identities = 32/103 (31%), Positives = 32/103 (31%)
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
G R Y G GY G GY Y G Y G GY G GY
Sbjct: 14 GYGSRYGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGY 73
Query: 250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
GS Y G Y G Y G GY G Y G G
Sbjct: 74 GSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSRYGCGYGSG 116
Score = 103 (41.3 bits), Expect = 6.8e-05, P = 6.8e-05
Identities = 31/98 (31%), Positives = 33/98 (33%)
Query: 204 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI 263
GY G GY Y G Y G GY G GY GS Y G Y
Sbjct: 20 GYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGC 79
Query: 264 HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
G Y G GY G Y G GY ++ GY
Sbjct: 80 GYGSGY----GCGYGSGYGCGYGSGYGCGYGSRYGCGY 113
Score = 93 (37.8 bits), Expect = 0.00082, P = 0.00082
Identities = 31/98 (31%), Positives = 32/98 (32%)
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
GY G GY GS Y G Y G Y G GY G Y G GY
Sbjct: 20 GYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGCGYGSGYGC 79
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGY 333
GY G Y Y G GY + G GY
Sbjct: 80 GYGSGYGCGYGSGYGCGYGSGY----GCGYGSRYGCGY 113
>UNIPROTKB|F1N474 [details] [associations]
symbol:COL4A5 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0031594 "neuromuscular junction" evidence=IEA]
[GO:0007528 "neuromuscular junction development" evidence=IEA]
[GO:0005605 "basal lamina" evidence=IEA] [GO:0005587 "collagen type
IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
PROSITE:PS51403 SMART:SM00111 GO:GO:0007528 InterPro:IPR016187
SUPFAM:SSF56436 GO:GO:0031594 GO:GO:0005605 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
GeneTree:ENSGT00690000101772 EMBL:DAAA02071513 EMBL:DAAA02071512
IPI:IPI00729819 Ensembl:ENSBTAT00000019400 OMA:MPMNMEP
Uniprot:F1N474
Length = 1688
Score = 140 (54.3 bits), Expect = 9.8e-06, P = 9.8e-06
Identities = 62/203 (30%), Positives = 76/203 (37%)
Query: 200 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 253
P PG GP G K +P K G P D G PG G PGY + G
Sbjct: 266 PGPPGIRGPPGPPGGVKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 323
Query: 254 DAQRGPNYDIHR-GPS--YDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRGPVY 309
D ++G D GP P+ G G + N + PG + R PG +Q P
Sbjct: 324 DGEKGQKGDTGLPGPPGLVIPRPGTGVTVGEKGNIGLPGLPGDKGDRGFPG--IQGPPGL 381
Query: 310 EAQRAPSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
P+ I P PG+ +RGQ D P G DG P P G PP +
Sbjct: 382 PGPPGPAVIGPPGPPGFPGERGQKGD-EGPPGISIPGSPGLDGQPGAPGPPGPPGPPGPH 440
Query: 369 VPYGS----ATPPARSGSGQPRG 387
+P A PP GS RG
Sbjct: 441 IPPSDKICEAGPPGPPGSPGDRG 463
>FB|FBgn0003277 [details] [associations]
symbol:RpII215 "RNA polymerase II 215kD subunit" species:7227
"Drosophila melanogaster" [GO:0005665 "DNA-directed RNA polymerase
II, core complex" evidence=ISS;NAS;IDA] [GO:0005703 "polytene
chromosome puff" evidence=IDA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=ISS;NAS] [GO:0003899 "DNA-directed
RNA polymerase activity" evidence=ISS;NAS] [GO:0005634 "nucleus"
evidence=IDA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0007095
"mitotic G2 DNA damage checkpoint" evidence=IGI] [GO:0005700
"polytene chromosome" evidence=IDA] InterPro:IPR000684
InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
PROSITE:PS00115 SMART:SM00663 GO:GO:0007095 GO:GO:0046872
GO:GO:0003677 EMBL:AE014298 GO:GO:0006366 Gene3D:2.40.40.20
InterPro:IPR009010 GO:GO:0005703 GO:GO:0003899 eggNOG:COG0086
GO:GO:0005665 GeneTree:ENSGT00700000104490 OMA:KVLPWST KO:K03006
EMBL:M27431 EMBL:M14203 EMBL:M11798 EMBL:M19537 PIR:S04457
RefSeq:NP_511124.1 UniGene:Dm.2925 ProteinModelPortal:P04052
SMR:P04052 DIP:DIP-22282N IntAct:P04052 MINT:MINT-970158
STRING:P04052 PaxDb:P04052 EnsemblMetazoa:FBtr0073542 GeneID:32100
KEGG:dme:Dmel_CG1554 CTD:32100 FlyBase:FBgn0003277
InParanoid:P04052 OrthoDB:EOG4QRFJV PhylomeDB:P04052
GenomeRNAi:32100 NextBio:776837 Bgee:P04052 GermOnline:CG1554
Uniprot:P04052
Length = 1887
Score = 140 (54.3 bits), Expect = 1.1e-05, P = 1.1e-05
Identities = 65/240 (27%), Positives = 90/240 (37%)
Query: 119 APNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAGP 177
+P A Y T N +++G + Y V + P ++ G+G
Sbjct: 1606 SPTSPLYASPRYASTTPNFNPQSTGYSPSSSGYSPTSPV---YSPTVQFQSSPSFAGSGS 1662
Query: 178 NTST--SAYAATQSG-TPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
N + +AY+ + S +P +Y P P Y S P Y + +P Y PT PSY P
Sbjct: 1663 NIYSPGNAYSPSSSNYSPNSPSYS-PTSPSYSPSS-PSYSPT-SPCYSPTS-PSYSPTS- 1717
Query: 235 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQR------GPNYDMQ 288
P Y P P Y + NY A P Y P+Y Q G+ Y P+YD
Sbjct: 1718 PNYTPVT-PSY-SPTSPNYSAS--PQYS-PASPAYS-QTGVKYSPTSPTYSPPSPSYDGS 1771
Query: 289 RGPGYETQRVPGYDVQRGPVYEAQRAPSYIP---QRGPGYDLQ-RGQGYDMRRAPSYDPS 344
G T P Y P Y + +P Y P Q P G Y +P Y P+
Sbjct: 1772 PGSPQYTPGSPQYS-PASPKY-SPTSPLYSPSSPQHSPSNQYSPTGSTYSAT-SPRYSPN 1828
>TAIR|locus:2035751 [details] [associations]
symbol:AT1G55170 "AT1G55170" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] EMBL:CP002684 GenomeReviews:CT485782_GR EMBL:AC073944
EMBL:AY084916 EMBL:BT006117 EMBL:AK118721 IPI:IPI00529305
RefSeq:NP_564678.1 UniGene:At.37108 ProteinModelPortal:Q9C717
SMR:Q9C717 PaxDb:Q9C717 PRIDE:Q9C717 EnsemblPlants:AT1G55170.1
GeneID:841960 KEGG:ath:AT1G55170 TAIR:At1g55170 eggNOG:NOG306311
InParanoid:Q9C717 OMA:ELHRMNL PhylomeDB:Q9C717
ProtClustDB:CLSN2688822 ArrayExpress:Q9C717 Genevestigator:Q9C717
Uniprot:Q9C717
Length = 283
Score = 129 (50.5 bits), Expect = 1.1e-05, P = 1.1e-05
Identities = 35/78 (44%), Positives = 42/78 (53%)
Query: 78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNS 137
R EYEKK + +E Q MEKN ++MA EVEKLRAEL VD R G +GG+ G +
Sbjct: 185 RDAIEYEKKEKFELMEQRQTMEKNMVSMAREVEKLRAELAT---VDSRPWG-FGGSYGMN 240
Query: 138 ENETSGRPVGQNAYEDGY 155
N G G D Y
Sbjct: 241 YNNMDGTFRGSYGENDTY 258
>UNIPROTKB|K7EKB2 [details] [associations]
symbol:TAF15 "TATA-binding protein-associated factor 2N"
species:9606 "Homo sapiens" [GO:0005622 "intracellular"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
InterPro:IPR001876 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50199
SMART:SM00547 EMBL:AC015849 HGNC:HGNC:11547 Ensembl:ENST00000585577
Uniprot:K7EKB2
Length = 214
Score = 125 (49.1 bits), Expect = 1.2e-05, P = 1.2e-05
Identities = 48/140 (34%), Positives = 52/140 (37%)
Query: 204 GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK-GPGYDAQK-GSNYDAQRGPNY 261
GY G G D G D + G GY + G GY + G Y RG Y
Sbjct: 69 GYRGRGGRGGDRGGYGGDRSGGGYGGDRSSGGGYSGDRSGGGYGGDRSGGGYGGDRGGGY 128
Query: 262 DIHRGPSYDPQRGLGY--DMQRGPNYDMQRG--PGYETQRVPGYDVQR-GPVYEAQRAPS 316
RG Y RG GY D RG Y RG GY R GY R G Y R
Sbjct: 129 GGDRGGGYGGDRGGGYGGDRSRG-GYGGDRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGG 187
Query: 317 YIPQRGPGYDLQRGQGYDMR 336
Y RG GY + G D R
Sbjct: 188 YGGDRG-GYGGKMGGRNDYR 206
Score = 120 (47.3 bits), Expect = 5.0e-05, P = 5.0e-05
Identities = 47/155 (30%), Positives = 59/155 (38%)
Query: 136 NSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQSGTPMR 194
N RP G + GYG +G+ G G G G + S Y +S
Sbjct: 45 NEPRPEDSRPSGGDFRGRGYGGERGYRGRGGRGGDRG--GYGGDRSGGGYGGDRSSG--- 99
Query: 195 AAYDIPR-GPGYEASK-GPGYDASKAPSYDPTKGPSYDPAKGPGY--DPTKGPGYDAQKG 250
Y R G GY + G GY + Y +G Y +G GY D ++G GY +G
Sbjct: 100 GGYSGDRSGGGYGGDRSGGGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSRG-GYGGDRG 158
Query: 251 --SNYDAQRGPNYDIHR-GPSYDPQRGLGYDMQRG 282
S Y R Y R G Y RG GY RG
Sbjct: 159 GGSGYGGDRSGGYGGDRSGGGYGGDRGGGYGGDRG 193
Score = 120 (47.3 bits), Expect = 5.0e-05, P = 5.0e-05
Identities = 48/170 (28%), Positives = 62/170 (36%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P R + G + G E GR G+ GYG + G ++G G +
Sbjct: 49 PEDSRPSGGDFRGRGYGGERGYRGRG-GRGGDRGGYGGDRSGGGYGGDRSSGG-GYSGDR 106
Query: 180 STSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSY--DPTKGPSYDPAKGPGY 237
S Y +SG Y RG GY +G GY + Y D ++G G G
Sbjct: 107 SGGGYGGDRSG----GGYGGDRGGGYGGDRGGGYGGDRGGGYGGDRSRG-------GYGG 155
Query: 238 DPTKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYD 286
D G GY + Y R G Y RG Y RG GY + G D
Sbjct: 156 DRGGGSGYGGDRSGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGKMGGRND 204
>UNIPROTKB|F1RFI8 [details] [associations]
symbol:EWSR1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GeneTree:ENSGT00530000063105 OMA:EGTSTGY EMBL:CU640468
EMBL:CT737304 Ensembl:ENSSSCT00000010930 Uniprot:F1RFI8
Length = 606
Score = 121 (47.7 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
Identities = 54/178 (30%), Positives = 75/178 (42%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPTAPQAYSQPVQGYGTGAYDTT 102
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAP-SYDPTKGPSYDPAKGPGYD 238
T+ TQ+ ++AY P P Y + P A+ AP SY T+ SYD + +
Sbjct: 103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQP---AATAPASYSSTQPTSYDQSSYSQQN 157
Query: 239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
P Q+ S+Y Q +Y SY PQ G Y + P+ Q+ Y Q
Sbjct: 158 TYGQPSSYGQQ-SSYGQQS--SYGQQPPTSYPPQTG-SYS--QAPSQYSQQSSSYGQQ 209
Score = 57 (25.1 bits), Expect = 1.3e-05, Sum P(2) = 1.3e-05
Identities = 19/46 (41%), Positives = 21/46 (45%)
Query: 354 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 392
RG P G+ +PPPL P G P P G G RGG P R
Sbjct: 404 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 449
Score = 49 (22.3 bits), Expect = 8.5e-05, Sum P(2) = 8.5e-05
Identities = 25/86 (29%), Positives = 33/86 (38%)
Query: 311 AQRAPSYIPQRGPGYDLQRGQGYD--MRRAPSYDPSRGTGFDGAPRG-----AAPHGQVP 363
A++ P RG G + G+G +R P G G G P G G P
Sbjct: 394 ARKKPPMNSMRG-GMPPREGRGMPPPLRGGPG-----GPGGPGGPMGRMGGRGGDRGGFP 447
Query: 364 PPLNNVPYGSATPPARSGSGQPRGGN 389
P P GS P+ G+ Q R G+
Sbjct: 448 P---RGPRGSRGNPSGGGNVQHRAGD 470
>UNIPROTKB|E2RS29 [details] [associations]
symbol:E2RS29 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676
GeneTree:ENSGT00530000063105 EMBL:AAEX03026460
Ensembl:ENSCAFT00000019701 Uniprot:E2RS29
Length = 538
Score = 133 (51.9 bits), Expect = 1.3e-05, P = 1.3e-05
Identities = 80/314 (25%), Positives = 115/314 (36%)
Query: 99 EKNYITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPV-GQNAYEDGYGV 157
++ Y T+ + A+ A +++ G+YG T S + GQ AY YG
Sbjct: 15 QQGYSAYTTQPTQGYAQTTQA--YGQQSYGTYGQPTDVSYTQAQTTATYGQTAYATSYGQ 72
Query: 158 PQ-GHGPP--PSATTAGVVG--AGP-NTSTSAYAATQSGTPMRAAYDI-PRGPGY---EA 207
P G+ P P A + V G G +T+T+ TQ+ ++AY P P Y A
Sbjct: 73 PPAGYTTPTAPQAYSQPVQGYSTGAYDTTTATVTTTQASYEAQSAYGTQPAYPAYGQQPA 132
Query: 208 SKGPG--YDASK-APSYDP--TKGPSYDPAKGPG---YDPTKGPG-YDAQKGSNYDAQRG 258
+ P D +K A + P + G P+ G G Y + PG Y Q + +
Sbjct: 133 ATAPARPQDGNKPAETSQPQSSTGGYNQPSLGYGQSNYSYPQVPGSYPMQPVTAPPSYPP 192
Query: 259 PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
+Y + SYD Q G G + Y Q Y Q G Y +APS
Sbjct: 193 TSYSSTQPTSYDQQNTYGQPSSYGQQSSYGQQSSYGQQLPTSYPPQTGS-YS--QAPSQY 249
Query: 319 PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 378
Q+ Y Q D R+ GF + G P + +
Sbjct: 250 SQQSSSYGQQSSFQQDHPRSMGVYGQESGGFSRPGENRSMSGPDNPGRGRGGFDRGDM-S 308
Query: 379 RSGSGQPRGGNPAR 392
R G G RGG AR
Sbjct: 309 RGGRGGGRGGMGAR 322
>UNIPROTKB|F1RYI8 [details] [associations]
symbol:COL3A1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0071230 "cellular response to amino acid stimulus"
evidence=IEA] [GO:0050777 "negative regulation of immune response"
evidence=IEA] [GO:0048565 "digestive tract development"
evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588
"skin development" evidence=IEA] [GO:0043206 "extracellular fibril
organization" evidence=IEA] [GO:0042060 "wound healing"
evidence=IEA] [GO:0034097 "response to cytokine stimulus"
evidence=IEA] [GO:0032964 "collagen biosynthetic process"
evidence=IEA] [GO:0030199 "collagen fibril organization"
evidence=IEA] [GO:0018149 "peptide cross-linking" evidence=IEA]
[GO:0009314 "response to radiation" evidence=IEA] [GO:0007507
"heart development" evidence=IEA] [GO:0007229 "integrin-mediated
signaling pathway" evidence=IEA] [GO:0007179 "transforming growth
factor beta receptor signaling pathway" evidence=IEA] [GO:0007160
"cell-matrix adhesion" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0005586 "collagen type III" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0043588 GO:GO:0005615
GO:GO:0007507 GO:GO:0034097 GO:GO:0030199 GO:GO:0007179
GO:GO:0007229 GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391
GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:CU467671
RefSeq:NP_001230226.1 UniGene:Ssc.24309 UniGene:Ssc.97562
Ensembl:ENSSSCT00000017459 GeneID:100152001 KEGG:ssc:100152001
Uniprot:F1RYI8
Length = 1466
Score = 138 (53.6 bits), Expect = 1.4e-05, P = 1.4e-05
Identities = 85/286 (29%), Positives = 105/286 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
A G GG G + P G + + G P GPP AG G GP S
Sbjct: 166 AGGGIGGYPGPAGPPGPPGPPGVSGHPGAPGSPGYQGPPGEPGQAGPAGPPGPPGAIGPS 225
Query: 181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
A +SG P R +P PG + G PG+ K +D G D PG
Sbjct: 226 GPAGKDGESGRPGRPGERGLPGPPGLKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 284
Query: 238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
G PG + G RG + R P G G D RG D Q GP G
Sbjct: 285 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 339
Query: 295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
T PG +G V A +P P PG QRG+ A + P G +G+P
Sbjct: 340 GTAGFPGSPGAKGEVGPAG-SPG--PSGSPG---QRGEPGPQGHAGAAGPPGPPGSNGSP 393
Query: 354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
G G P + P G+ PP G+ G P RG G P +
Sbjct: 394 GGKGEMG--PAGIPGAPGLMGARGPPGPPGTNGAPGQRGAAGEPGK 437
>UNIPROTKB|F1NI73 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI01017330
Ensembl:ENSGALT00000004032 ArrayExpress:F1NI73 Uniprot:F1NI73
Length = 1260
Score = 137 (53.3 bits), Expect = 1.5e-05, P = 1.5e-05
Identities = 83/280 (29%), Positives = 109/280 (38%)
Query: 132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP-NTSTSAYAATQ 188
GA G +N G P G+ G+P +G P AG G+ GP S A Q
Sbjct: 465 GANGEPGQNGVPGTP-GERGSPGFRGLPGSNGLPGEKGPAGERGSPGPPGPSGPAGDRGQ 523
Query: 189 SGTP----MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP- 243
G P MR IP PG + GP + + P GP+ P PG GP
Sbjct: 524 DGGPGLPGMRGLPGIPGSPGSDGKPGPPGNQGE-PGRSGPPGPA-GPRGQPGVMGFPGPK 581
Query: 244 GYDAQKGSNYDAQRGPNYDIHR-GPS-YDPQRGL-GYDMQRGPNYDM-QRGPGYET--QR 297
G + G N +RGP GP+ + GL G GP D + GP Q
Sbjct: 582 GNEGAPGKN--GERGPGGPPGTPGPAGKNGDVGLPGPPGPAGPAGDRGEPGPSGSPGLQG 639
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQR---GPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAP 353
+PG GP E + P+ GPG+ +G+ G R + P TG G P
Sbjct: 640 LPGGP---GPAGENGKPGEPGPKGDIGGPGFPGPKGENGIPGERG-AQGPPGPTGARGGP 695
Query: 354 RGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
A G + PP P G+ P + G+ RG G+P
Sbjct: 696 GPAGSEGAKGPPGPPGAPGGTGLPGLQGMPGE-RGASGSP 734
Score = 123 (48.4 bits), Expect = 0.00051, P = 0.00051
Identities = 84/275 (30%), Positives = 104/275 (37%)
Query: 142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
+G P G G+P G P G+ G P TS + A G P +
Sbjct: 386 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 440
Query: 202 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 255
GP G G PG A P + +G + +P + PG +G PG+ GSN
Sbjct: 441 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 498
Query: 256 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 310
++GP + GPS P G D GP RG PG PG D + GP
Sbjct: 499 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 553
Query: 311 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 362
Q P GP G Q G G+ + AP + RG G G P A +G V
Sbjct: 554 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 612
Query: 363 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 391
P PP P G P SGS G P G PA
Sbjct: 613 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 647
Score = 122 (48.0 bits), Expect = 0.00065, P = 0.00065
Identities = 80/269 (29%), Positives = 105/269 (39%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAA-YDIPRG 202
P G N Y+ G P GP AG++G AGP + G P R IP
Sbjct: 190 PPGSNGYQGPPGEPGQPGPSGPPGPAGMIGPAGPPGKDG-----EPGRPGRNGDRGIPGL 244
Query: 203 PGYEASKG-PGYDASK-APSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGP 259
PG++ G PG K A +D G D PG G PG + G RGP
Sbjct: 245 PGHKGHPGMPGMPGMKGARGFDGKDGAKGDSG-APGPKGEAGQPGANGSPGQ--PGPRGP 301
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP-------GYETQRVPGYDVQRGPVYEAQ 312
+ RG +P + Y + +GP G+ PG+ + GP A
Sbjct: 302 TGE--RGRPGNPGGPVTYRCDIVVFLSLFKGPPGPPGTAGFPGS--PGFKGEAGPPGPAG 357
Query: 313 RAPSYIP-QRG-PGYDLQRG----QGYDMRR-APSYDPSRG-TGFDGAPRGAAPHGQ-VP 363
+ S P +RG PG Q G QG R +P G +G GAP P G+ +P
Sbjct: 358 ASGS--PGERGEPGPQGQAGPPGPQGPPGRAGSPGNKGEMGPSGIPGAP--GLPGGRGLP 413
Query: 364 PPLNNVPYGSATPPARSGSGQPRGGNPAR 392
P P S P A+ G+P G N A+
Sbjct: 414 GP----PGTSGNPGAKGTPGEP-GKNGAK 437
>WB|WBGene00000628 [details] [associations]
symbol:col-51 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
GeneTree:ENSGT00530000064217 EMBL:FO080999 RefSeq:NP_491195.1
UniGene:Cel.29694 ProteinModelPortal:Q7Z152 MINT:MINT-3384184
STRING:Q7Z152 EnsemblMetazoa:T28F2.8 GeneID:189052
KEGG:cel:CELE_T28F2.8 UCSC:T28F2.8 CTD:189052 WormBase:T28F2.8
eggNOG:NOG245561 InParanoid:Q7Z152 OMA:MMASRRI NextBio:941036
Uniprot:Q7Z152
Length = 435
Score = 131 (51.2 bits), Expect = 1.6e-05, P = 1.6e-05
Identities = 90/299 (30%), Positives = 102/299 (34%)
Query: 110 EKLRAE-LMNAPNVDRRADGSYGG--ATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPP 165
EK+ E L A A G GG A G G G + G P G GPP
Sbjct: 84 EKVAFEGLFRAKRQYATAAGGGGGYAAGGGGGGGGGGGGGGCHCAAQASGCPAGPPGPPG 143
Query: 166 SATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRGP----GYEASKGP-GYDASKAP 219
A T G G AG + SG+ +A P GP G + + GP G P
Sbjct: 144 EAGTDGEPGQAGQDGQPGQAGQADSGSSGQACITCPAGPPGPPGPDGNAGPAGAPGVPGP 203
Query: 220 SYD----PTKGPSYDPAKGPGYDPTKG-PGYDAQKGS----NYDAQRGPNYDIHRGPSYD 270
D P GP P PG D G PG D Q G+ ++ GP GP
Sbjct: 204 DGDAGSPPPPGPPGPPGP-PGNDGQPGAPGQDGQPGAPGTNTVNSPGGPGPAGPPGPPGP 262
Query: 271 P-QRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 329
P Q G G Q GP PG PG D Q G P P GPG D
Sbjct: 263 PGQDGSGGAAQPGP-------PG--PPGPPGNDGQPG-------GPGQ-PG-GPGQD--G 302
Query: 330 GQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
G G D P P R P G G P Y + R+ SG GG
Sbjct: 303 GPGTDAAYCPC--PPR------TPAGGGGGGDFPAGGGGGGYSTGGGGGRADSGGAAGG 353
Score = 115 (45.5 bits), Expect = 0.00095, P = 0.00095
Identities = 76/270 (28%), Positives = 84/270 (31%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 191
G G + SG GQ G P GP +A AG G P A + G
Sbjct: 158 GQPGQAGQADSGSS-GQACITCPAGPPGPPGPDGNAGPAGAPGV-PGPDGDAGSPPPPGP 215
Query: 192 PMRAAYDIPRGPGYEASKG-PGYDASK-APSYDPTKGPSYDPAKGPG-YDPTKGPGYDAQ 248
P P PG + G PG D AP + P GPG P PG Q
Sbjct: 216 P-----GPPGPPGNDGQPGAPGQDGQPGAPGTNTVNSPG-----GPGPAGPPGPPGPPGQ 265
Query: 249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ------RVPGYD 302
GS AQ GP P D Q G G GP D GPG + R P
Sbjct: 266 DGSGGAAQPGPPGP-PGPPGNDGQPG-GPGQPGGPGQD--GGPGTDAAYCPCPPRTPAGG 321
Query: 303 VQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG----AAP 358
G Y G G G A Y G G GA G A
Sbjct: 322 GGGGDFPAGGGGGGYSTGGGGGRADSGGAAGGAGGAGGYSGGGGGGGGGAAAGGGYNAGG 381
Query: 359 HGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
G P P + P A +G G GG
Sbjct: 382 GGGGAPQAAPAPQAAPAPAAPAGGGYNAGG 411
>UNIPROTKB|Q28009 [details] [associations]
symbol:FUS "RNA-binding protein FUS" species:9913 "Bos
taurus" [GO:0045944 "positive regulation of transcription from RNA
polymerase II promoter" evidence=ISS] [GO:0045893 "positive
regulation of transcription, DNA-dependent" evidence=ISS]
[GO:0005737 "cytoplasm" evidence=ISS] [GO:0005634 "nucleus"
evidence=ISS] [GO:0003723 "RNA binding" evidence=IEA] [GO:0003677
"DNA binding" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005634
GO:GO:0005737 GO:GO:0000166 GO:GO:0046872 GO:GO:0003677
GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0045944 GO:GO:0003723
eggNOG:NOG240581 GeneTree:ENSGT00530000063105 KO:K13098
HOGENOM:HOG000038010 CTD:2521 EMBL:U26024 EMBL:BC119965
IPI:IPI00705463 RefSeq:NP_776337.1 UniGene:Bt.2474
ProteinModelPortal:Q28009 STRING:Q28009 PRIDE:Q28009
Ensembl:ENSBTAT00000007571 GeneID:280796 KEGG:bta:280796
InParanoid:Q28009 OrthoDB:EOG4DV5NH NextBio:20804952 Uniprot:Q28009
Length = 513
Score = 132 (51.5 bits), Expect = 1.6e-05, P = 1.6e-05
Identities = 67/237 (28%), Positives = 93/237 (39%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
G+Y G ++ S +P GQ +Y GYG ++ +G G NT S +A
Sbjct: 15 GAYPTQPGQGYSQQSNQPYGQQSY-GGYGQSTDTSGYGQSSYSGSYGQTQNTGYSTQSAP 73
Query: 188 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
Q G Y + Y + S PGY APS T G ++ GY +G G
Sbjct: 74 Q-GYSSAGGYGSSQSSQSSYGQQSSYPGYGQQPAPS--GTSGSYGSSSQSSGYGQPQGGG 130
Query: 245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG--YETQRVPGYD 302
Y Q G Y Q+ +Y + SY+P +G G Q + G G Y +
Sbjct: 131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQSQYNSSGGGGGGGGGSYGQDQPSMSS 185
Query: 303 VQRGPVYEAQ-RAPSY---IPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
G Y Q ++ Y RG G G GY+ R + Y+P RG G RG
Sbjct: 186 GGGGGGYGNQDQSGGYGGGQQDRG-GRGRGGGGGYN-RSSGGYEP-RGRGGGRGGRG 239
>ZFIN|ZDB-GENE-070912-607 [details] [associations]
symbol:col11a1b "collagen, type XI, alpha 1b"
species:7955 "Danio rerio" [GO:0005201 "extracellular matrix
structural constituent" evidence=IEA] [GO:0005581 "collagen"
evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
PROSITE:PS51461 SMART:SM00038 ZFIN:ZDB-GENE-070912-607
Gene3D:2.60.120.200 InterPro:IPR008985 InterPro:IPR013320
SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
InterPro:IPR001791 SMART:SM00282 Pfam:PF02210 GO:GO:0005201
HOGENOM:HOG000085654 SMART:SM00210 GeneTree:ENSGT00700000104155
UniGene:Dr.3536 EMBL:BX510342 EMBL:BX547933 EMBL:CT583637
EMBL:GQ485665 IPI:IPI00511026 RefSeq:NP_001171883.1
UniGene:Dr.42128 Ensembl:ENSDART00000049589 GeneID:555202
KEGG:dre:555202 CTD:555202 NextBio:20880850 Uniprot:D6MUD3
Length = 1815
Score = 138 (53.6 bits), Expect = 1.8e-05, P = 1.8e-05
Identities = 71/250 (28%), Positives = 100/250 (40%)
Query: 156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA 215
G P HG P G G + T P RA + +GP A + A
Sbjct: 469 GSPGLHGDPGERGPPGRPGLPGGDGAPGPSGTILMLPFRAGGESSKGPVVSAQEAQA-QA 527
Query: 216 SKAPSYDPTKGPSYDPAKGPGYD-PTKGPGYDAQKGSNYDA-QRGPNYDIHRGPSYDP-- 271
A + +GP P G P GPG KG + D+ +GP +GP+ P
Sbjct: 528 ILAQARLTMRGPP-GPMGLTGRSGPVGGPGAPGAKGESGDSGPQGPRG--LQGPTGSPGK 584
Query: 272 --QRGL-GYDMQRG-PNYDMQRGP-GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
+RG G D RG P +G G++ +PG ++G + ++ P +P PG D
Sbjct: 585 PGKRGRNGADGARGIPGESGAKGDRGFDG--LPGLPGEKG--HRGEQGPIGLPG-SPGED 639
Query: 327 LQRGQGYDM--RRAPSYDPSRGT-GFDGAPRGAAPHGQV----PP-PLNNV-PYGSATPP 377
RG+ ++ R P RG G G+P A G PP P N+ P G PP
Sbjct: 640 GPRGEDGEIGQRGMPGESGPRGLLGPRGSPGTAGQRGLTGLDGPPGPKGNMGPQGEPGPP 699
Query: 378 ARSGSGQPRG 387
+ G+ P G
Sbjct: 700 GQQGNTGPHG 709
>WB|WBGene00000251 [details] [associations]
symbol:bli-1 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0009792
"embryo development ending in birth or egg hatching" evidence=IMP]
[GO:0000003 "reproduction" evidence=IMP] [GO:0040002 "collagen and
cuticulin-based cuticle development" evidence=IMP] [GO:0040011
"locomotion" evidence=IMP] [GO:0002119 "nematode larval
development" evidence=IMP] [GO:0018996 "molting cycle, collagen and
cuticulin-based cuticle" evidence=IMP] [GO:0005578 "proteinaceous
extracellular matrix" evidence=ISS] [GO:0042329 "structural
constituent of collagen and cuticulin-based cuticle" evidence=ISS]
InterPro:IPR002486 InterPro:IPR012613 Pfam:PF01484 Pfam:PF08175
SMART:SM01088 GO:GO:0009792 GO:GO:0002119 GO:GO:0018996
GO:GO:0005578 GO:GO:0040011 GO:GO:0000003 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0040002
EMBL:Z46791 PIR:T19140 RefSeq:NP_496311.2 ProteinModelPortal:Q09457
STRING:Q09457 PaxDb:Q09457 EnsemblMetazoa:C09G5.6 GeneID:174653
KEGG:cel:CELE_C09G5.6 UCSC:C09G5.6 CTD:174653 WormBase:C09G5.6
GeneTree:ENSGT00690000102663 HOGENOM:HOG000016778 InParanoid:Q09457
OMA:WEEHRKS NextBio:884926 GO:GO:0042601 GO:GO:0042329
GO:GO:0030436 Uniprot:Q09457
Length = 948
Score = 135 (52.6 bits), Expect = 1.8e-05, P = 1.8e-05
Identities = 89/338 (26%), Positives = 120/338 (35%)
Query: 87 FYNDHLESLQVMEK--NYITMATEVEKLRAELMNAPNVDRRA-----DGSYGGATGNSEN 139
FY++ E L + N I E E+ A + DR +G Y T
Sbjct: 36 FYSEAQEELVEFKDIANNIWEEMVFELTPEEMREAEDNDREKRSYEPEGPYQSETTTPST 95
Query: 140 ETSGRPVGQNAYED--GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAY 197
TS A ED GY +GPP S P T A + T + Y
Sbjct: 96 TTSTAATTTEAAEDESGYDFVNDNGPPSSRPRKPEPPTMPRTIQGFRAPPPAAT---STY 152
Query: 198 DIPRGPGYEASKGPGYDASKAPSYDPTKGPSYD-PAKGPG-----YDPTKGP--GYDAQK 249
P G Y+ + G +S+ P Y P + PS P P Y+P P GY
Sbjct: 153 RPPHGSNYD-NYGREPASSRRP-YPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNP 210
Query: 250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP--GYET--QRVPG----Y 301
Y+ + PNY R P+Y Y R PN R P GY++ Q P Y
Sbjct: 211 RVPYNPPQ-PNYT--RQPTYPEDNRAPYKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIY 267
Query: 302 DVQR----GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 357
+ +R GP Y + P+ P PG QR R P+ +R P
Sbjct: 268 NTRRPNNHGPGYPEDQVPTAPPV--PGQ--QRVPPTQTRNPPNPTNTRQPSRPVPPTSDG 323
Query: 358 PHGQVPPPLN-NVPYGSATPPARSGSG--QPRGGNPAR 392
H + P N + Y + + G G +PR G R
Sbjct: 324 -HIEATTPYNPSAQYPTGKRGSHPGFGPQRPRPGTRPR 360
Score = 131 (51.2 bits), Expect = 4.8e-05, P = 4.8e-05
Identities = 76/266 (28%), Positives = 102/266 (38%)
Query: 145 PVGQNAYEDGYGVPQGHG----PPPSATTAGVVGAGPNTSTSAY---AATQSGTPM--RA 195
P G N Y D YG PP + + PN TS Y ++G P R
Sbjct: 155 PHGSN-Y-DNYGREPASSRRPYPPQQPPSTSAPHSSPNNRTSLYNPQPPPKTGYPTNPRV 212
Query: 196 AYDIPRGPGYEASKGPGY-DASKAPSYDPTKGPSYDPAKGP--GYD-----PTKGPG-YD 246
Y+ P+ P Y ++ P Y + ++AP Y PT+ P+ P + P GYD P P Y+
Sbjct: 213 PYNPPQ-PNY--TRQPTYPEDNRAP-YKPTRSPNTPPPRQPSGGYDSDGQTPPSSPRIYN 268
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRG 306
++ +N+ GP Y + P+ P G QR P + P R P V
Sbjct: 269 TRRPNNH----GPGYPEDQVPTAPPVPG----QQRVPPTQTRNPPNPTNTRQPSRPVPPT 320
Query: 307 PVYEAQRAPSYIPQRGPGYDL-QRGQ--GYDMRRA-PSYDPSRGTGFDGAPRGAAP-HGQ 361
+ Y P Y +RG G+ +R P P RG D A P H
Sbjct: 321 SDGHIEATTPYNPSAQ--YPTGKRGSHPGFGPQRPRPGTRP-RGNPCDQC--SAQPNHCP 375
Query: 362 VPPPLNNVPYGSATPPARSGSGQPRG 387
PP P G PP G PRG
Sbjct: 376 SGPP---GPRGRPGPPGFPGQDGPRG 398
Score = 130 (50.8 bits), Expect = 6.2e-05, P = 6.2e-05
Identities = 76/265 (28%), Positives = 97/265 (36%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPN 178
P +R DG+ G G + GQ+ G P HG S T G G G N
Sbjct: 427 PPGERGPDGT-PGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGKPGLPGRN 485
Query: 179 TSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKA----PSYDPTKGPSYDPA- 232
+ G P +P G + G G D S P D T GP P
Sbjct: 486 GQSCKSIPGPPGQP--GVMGVPGRDGDPGTDGEHGQDGSPGIQGPPGRDGTSGPDGQPGV 543
Query: 233 KGPGYDPTKGPGYDA--QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
PG T G GY ++ S +D N D RG + R GYD +R +
Sbjct: 544 SAPGAPGTDG-GYCPCPKRSSKFDFNDAYNDDEKRG--LEEHRPRGYDSERAE----EPR 596
Query: 291 PGYETQRVPGYDVQRGPVYEAQRAPSY------IPQRGPGY-DLQRGQGYDMRRAPSYDP 343
P +T R YD G E QR P+Y P R Y D +R + +R P P
Sbjct: 597 PR-QTVRTNTYDENSGA--EHQRRPNYEPSAEVAPPRQDRYEDEERVREPPPKRPPP--P 651
Query: 344 SRGTGFDGAPRGAAPHGQVPPPLNN 368
R T + P P+ + PPP N
Sbjct: 652 HRQTPHELYPE-EQPYVRRPPPPQN 675
Score = 122 (48.0 bits), Expect = 0.00046, P = 0.00046
Identities = 71/243 (29%), Positives = 88/243 (36%)
Query: 163 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 222
P P + + P ++ Y + G+ PR PG P S P++
Sbjct: 316 PVPPTSDGHIEATTPYNPSAQYPTGKRGSHPGFGPQRPR-PGTRPRGNPCDQCSAQPNHC 374
Query: 223 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPN--YDIHRGPSYDPQRG-----L 275
P+ GP P PG P PG D +G RG N Y + SYDP G +
Sbjct: 375 PS-GPP-GPRGRPG--PPGFPGQDGPRGL-----RGLNGGYSGVQPSSYDPVIGCVQCPI 425
Query: 276 GYDMQRGPNYDMQRG-PGYE----TQRVPGYDVQRG----PVYEAQRAPSYIPQRGPGYD 326
G +RGP D G PG + Q V G D Q G P Y P + PG
Sbjct: 426 GPPGERGP--DGTPGVPGEDGIDGEQGVNGQDGQPGAPGAPGYHGMNGSPGTPGK-PGLP 482
Query: 327 LQRGQGYDMRRAPSYDPS-RGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ 384
+ GQ P P G G DG P HGQ P G PP R G+
Sbjct: 483 GRNGQSCKSIPGPPGQPGVMGVPGRDGDPGTDGEHGQDGSP------GIQGPPGRDGTSG 536
Query: 385 PRG 387
P G
Sbjct: 537 PDG 539
>UNIPROTKB|J9P0L0 [details] [associations]
symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
CTD:1281 EMBL:AAEX03017880 RefSeq:XP_851009.1
Ensembl:ENSCAFT00000047312 GeneID:478835 KEGG:cfa:478835
Uniprot:J9P0L0
Length = 1465
Score = 137 (53.3 bits), Expect = 1.8e-05, P = 1.8e-05
Identities = 83/284 (29%), Positives = 105/284 (36%)
Query: 127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYA 185
+G G E+ + G P G+ D G P GPP +A G G AGP
Sbjct: 653 NGKPGEPGPKGESGSPGVPGGKG---DS-GAPGERGPPGAAGPMGPRGGAGPPGPEGGKG 708
Query: 186 AT-------QSGTP----MRAAYDIPRGPGYEASKG-PGY-DASKAPSYDPTKGPSYDPA 232
A +GTP M P GPG + KG PG A AP D +GP+ P
Sbjct: 709 AAGPPGPPGSAGTPGLQGMPGERGGPGGPGPKGDKGEPGSAGADGAPGKDGPRGPT-GPI 767
Query: 233 KGPGYDPTKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 291
PG P PG + G+ GP + P G+ G N + P
Sbjct: 768 GPPG--PAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAGFPGAPGQNGE----P 821
Query: 292 GYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
G + +R PG + GP A P PG +G+ R +P G G
Sbjct: 822 GAKGERGAPGEKGEGGPPGVAGPPGGAGPAGPPGPQGVKGE----RGSPG-----GPGAA 872
Query: 351 GAPRGAAPHGQVPPPLNNV---PYGSATPPARSGSGQPRGGNPA 391
G P G G PP NN P GS+ P + G P G N A
Sbjct: 873 GFPGGRGLPG---PPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 913
Score = 132 (51.5 bits), Expect = 6.3e-05, P = 6.3e-05
Identities = 83/280 (29%), Positives = 101/280 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
A G GG G + P G + + G P GPP AG G GP S
Sbjct: 165 AGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAMGPS 224
Query: 181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
A +SG P R +P PG + G PG+ K +D G D PG
Sbjct: 225 GPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 283
Query: 238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
G PG + G RG + R P G G D RG D Q GP G
Sbjct: 284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338
Query: 295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
T PG +G V A S PG QRG+ A + P G +G+P
Sbjct: 339 GTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPPGSNGSP 392
Query: 354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
G G P + P G+ PP G+ G P RGG
Sbjct: 393 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430
Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
Identities = 78/261 (29%), Positives = 98/261 (37%)
Query: 147 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 200
G + G P GPP +A G GA GP S + + Q G P + P
Sbjct: 321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380
Query: 201 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 256
PG S G PG P+ P P A+GP P T G PG G +
Sbjct: 381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439
Query: 257 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 308
+G P RG + P G G D + G P G PG +R PG+ RGP
Sbjct: 440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496
Query: 309 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
++ P+ + GPG RG G R P G G+P G G+ PP
Sbjct: 497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554
Query: 366 LNNVPYGSATPPARSGS-GQP 385
+ G PP SG GQP
Sbjct: 555 GSQGESGRPGPPGPSGPRGQP 575
>UNIPROTKB|F1N7Q7 [details] [associations]
symbol:COL4A2 "Collagen alpha-2(IV) chain" species:9913
"Bos taurus" [GO:0071560 "cellular response to transforming growth
factor beta stimulus" evidence=IEA] [GO:0016525 "negative
regulation of angiogenesis" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0005587 "collagen
type IV" evidence=IEA] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] InterPro:IPR001442 Pfam:PF01413
PROSITE:PS51403 SMART:SM00111 GO:GO:0071560 InterPro:IPR016187
SUPFAM:SSF56436 GO:GO:0006351 InterPro:IPR008160 Pfam:PF01391
GO:GO:0016525 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
GeneTree:ENSGT00690000101772 EMBL:DAAA02034911 IPI:IPI00712524
Ensembl:ENSBTAT00000005916 OMA:QETIQPG Uniprot:F1N7Q7
Length = 1650
Score = 137 (53.3 bits), Expect = 2.0e-05, P = 2.0e-05
Identities = 75/251 (29%), Positives = 98/251 (39%)
Query: 116 LMNAPNVD-RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVV 173
L P + R+ D GA G + + P G + + G+P GH G G
Sbjct: 18 LQGFPGLQGRKGDKGQRGAPGITGPKGDVGPRGVSGFPGADGIP-GHPGQGGPRGPPGYD 76
Query: 174 GAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA 232
G S YA G P + PRGP G + KG Y A + D +G +P
Sbjct: 77 GCNGTVGDSGYA----GPPGPGGFLGPRGPQGPKGQKGEPY-ALSSEDRDKYRGEPGEPG 131
Query: 233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRGPNYDMQ-RG 290
P PG Q G A P GP P RGLG+ ++G DM +G
Sbjct: 132 LVGLQGPPGRPGPVGQMGP-VGAPGRPGPPGPPGPKGQPGNRGLGFYGEKGEKGDMGLQG 190
Query: 291 PGYETQRVP---GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRG 346
PG +P GY + PVYE +P++ G ++G QG R S G
Sbjct: 191 PG----GIPPDNGYVEKPTPVYEL------LPEQYKG---EKGSQGEPGRIGVSLKGEEG 237
Query: 347 T-GFDGAPRGA 356
GF G PRGA
Sbjct: 238 VVGFSG-PRGA 247
>UNIPROTKB|F1LRJ1 [details] [associations]
symbol:Col4a3 "Protein Col4a3" species:10116 "Rattus
norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
RGD:71085 GO:GO:0006917 GO:GO:0008283 InterPro:IPR016187
SUPFAM:SSF56436 GO:GO:0006919 GO:GO:0007166 InterPro:IPR008160
Pfam:PF01391 GO:GO:0016525 GO:GO:0005201 GO:GO:0005587
Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1285
GO:GO:0032836 IPI:IPI00367109 RefSeq:NP_001129231.1
UniGene:Rn.121139 Ensembl:ENSRNOT00000020669 GeneID:363265
KEGG:rno:363265 NextBio:683046 ArrayExpress:F1LRJ1 Uniprot:F1LRJ1
Length = 1670
Score = 137 (53.3 bits), Expect = 2.1e-05, P = 2.1e-05
Identities = 93/289 (32%), Positives = 106/289 (36%)
Query: 127 DGSYGGATGNSENETSGRPV--GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
DGS GG G P G+ + G P GPP A AG G GP
Sbjct: 568 DGSPGGPGAKGPRGPRGEPALSGRKGDQGPPGAPGSPGPPGPAGPAGPPGYGPQGEPGPK 627
Query: 185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAK-GP-GYDPTKG 242
A G P A GP EA G ++ P P GP P + GP G G
Sbjct: 628 GA--QGVP--GAL----GPPGEAGL-KGESSASIPVLGPP-GPPGPPGQAGPRGLPGLPG 677
Query: 243 PGYDAQKGS-NYDAQRG-PNYDIH--RGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQR 297
P G D + G P RGP D G+ G P Y PG ET R
Sbjct: 678 PVGTCDPGHPGPDGEPGIPEVGFPGARGPKGDQ----GFPGTIGLPGY-----PG-ETGR 727
Query: 298 VPGYDVQRGPVYEAQRAPSY-IP-QRG-PGYDLQRGQGYDMRRA--PSYDPSRGT----G 348
PGY + G V A+ PS P + G PG+ +RG + P GT G
Sbjct: 728 -PGYPGEMG-VPGAKGEPSVGRPGEPGKPGFPGERGNSGENGDIGLPGLPGPPGTPGKDG 785
Query: 349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQP--RG--GNPARR 393
FDG P P GQ PP P G P R G P G G P RR
Sbjct: 786 FDGPP--GDP-GQSGPPGAKGPPGRCIPGPRGTQGLPGLNGLKGQPGRR 831
>UNIPROTKB|J9NW09 [details] [associations]
symbol:POLR2A "DNA-directed RNA polymerase" species:9615
"Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
activity" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
GeneTree:ENSGT00700000104490 EMBL:AAEX03003616 EMBL:AAEX03003617
Ensembl:ENSCAFT00000050029 Uniprot:J9NW09
Length = 1789
Score = 137 (53.3 bits), Expect = 2.2e-05, P = 2.2e-05
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728
>MGI|MGI:88453 [details] [associations]
symbol:Col3a1 "collagen, type III, alpha 1" species:10090 "Mus
musculus" [GO:0001568 "blood vessel development" evidence=IMP]
[GO:0005178 "integrin binding" evidence=ISO] [GO:0005201
"extracellular matrix structural constituent" evidence=ISO]
[GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
"proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
"collagen" evidence=IDA] [GO:0005586 "collagen type III"
evidence=ISO;IDA] [GO:0005615 "extracellular space" evidence=ISO]
[GO:0007160 "cell-matrix adhesion" evidence=ISO] [GO:0007179
"transforming growth factor beta receptor signaling pathway"
evidence=ISO] [GO:0007229 "integrin-mediated signaling pathway"
evidence=ISO] [GO:0007507 "heart development" evidence=ISO]
[GO:0009314 "response to radiation" evidence=ISO] [GO:0018149
"peptide cross-linking" evidence=ISO] [GO:0030199 "collagen fibril
organization" evidence=ISO;IMP] [GO:0031012 "extracellular matrix"
evidence=ISO;IDA] [GO:0032964 "collagen biosynthetic process"
evidence=ISO] [GO:0034097 "response to cytokine stimulus"
evidence=ISO] [GO:0042060 "wound healing" evidence=ISO] [GO:0043206
"extracellular fibril organization" evidence=ISO] [GO:0043588 "skin
development" evidence=ISO] [GO:0046332 "SMAD binding" evidence=IPI]
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0048407
"platelet-derived growth factor binding" evidence=ISO] [GO:0048565
"digestive tract development" evidence=IMP] [GO:0050777 "negative
regulation of immune response" evidence=ISO] [GO:0071230 "cellular
response to amino acid stimulus" evidence=IDA] InterPro:IPR000885
InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
SMART:SM00214 MGI:MGI:88453 GO:GO:0043588 GO:GO:0005615
GO:GO:0007507 GO:GO:0046872 GO:GO:0034097 GO:GO:0030199
GO:GO:0001501 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0042060
GO:GO:0001568 GO:GO:0048565 GO:GO:0050777 GO:GO:0009314
GO:GO:0018149 GO:GO:0032964 GO:GO:0071230 GO:GO:0043206
GO:GO:0005201 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
CTD:1281 OMA:EGSPGHP ChiTaRS:COL3A1 GO:GO:0005586 EMBL:X52046
EMBL:BC043089 EMBL:BC058724 EMBL:M18933 EMBL:K03037 EMBL:AK019448
EMBL:X57983 IPI:IPI00129571 PIR:A27353 PIR:S59856
RefSeq:NP_034060.2 UniGene:Mm.249555 ProteinModelPortal:P08121
SMR:P08121 STRING:P08121 PhosphoSite:P08121 PaxDb:P08121
PRIDE:P08121 Ensembl:ENSMUST00000087883 GeneID:12825 KEGG:mmu:12825
InParanoid:P08121 NextBio:282310 Bgee:P08121 CleanEx:MM_COL3A1
Genevestigator:P08121 Uniprot:P08121
Length = 1464
Score = 136 (52.9 bits), Expect = 2.3e-05, P = 2.3e-05
Identities = 86/285 (30%), Positives = 101/285 (35%)
Query: 120 PNVDRRADGSYGGATGNSENETSGR--PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP 177
P + DGS G N +G P G G+P GPP G G
Sbjct: 459 PKGEDGKDGSPGEPGANGLPGAAGERGPSGFRGPAGPNGIPGEKGPPGERGGPGPAGPRG 518
Query: 178 NTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPG 236
T G +R P GPG + GP S+ S P GPS P PG
Sbjct: 519 VAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGP--PGSQGESGRPGPPGPS-GPRGQPG 575
Query: 237 YDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRG-LGYDMQRGPNYDM-QRGP- 291
GP G D G N + + GP GP+ + + G G GP D GP
Sbjct: 576 VMGFPGPKGNDGAPGKNGE-RGGPGGPGLPGPAGKNGETGPQGPPGPTGPAGDKGDSGPP 634
Query: 292 GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GF 349
G + Q +PG GP E + P+ G G G AP GT G
Sbjct: 635 GPQGLQGIPGTG---GPPGENGKPGEPGPKGEVGAPGAPG-GKGDSGAPGERGPPGTAGI 690
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS----GQP--RGG 388
GA GA P G P P G PP SGS G P RGG
Sbjct: 691 PGARGGAGPPG---PEGGKGPAGPPGPPGASGSPGLQGMPGERGG 732
Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
Identities = 81/284 (28%), Positives = 103/284 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTSTS 182
G GG G + P G + + G P GPP AG G GP +
Sbjct: 166 GGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGALGPAGP 225
Query: 183 AYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGYDP 239
A +SG P R +P PG + G PG+ K +D G + PG
Sbjct: 226 AGKDGESGRPGRPGERGLPGPPGIKGPAGMPGFPGMKGHRGFDGRNGEKGETG-APGLKG 284
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE-T 295
G PG + G RG + R P G G D RG D Q GP G T
Sbjct: 285 ENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPPGT 339
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
PG +G V A S PG QRG+ A + P G +G+P G
Sbjct: 340 AGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGNNGSPGG 393
Query: 356 AAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
G P + P G+ PP +G+ G P RG G P +
Sbjct: 394 KGEMG--PAGIPGAPGLIGARGPPGPAGTNGIPGTRGPSGEPGK 435
>FB|FBgn0262126 [details] [associations]
symbol:gho "ghost" species:7227 "Drosophila melanogaster"
[GO:0004252 "serine-type endopeptidase activity" evidence=ISS]
[GO:0006888 "ER to Golgi vesicle-mediated transport" evidence=IEA]
[GO:0006886 "intracellular protein transport" evidence=IEA]
[GO:0008270 "zinc ion binding" evidence=IEA] [GO:0030127 "COPII
vesicle coat" evidence=IEA] [GO:0005811 "lipid particle"
evidence=IDA] [GO:0035158 "regulation of tube diameter, open
tracheal system" evidence=IMP] [GO:0009306 "protein secretion"
evidence=IMP] [GO:0035151 "regulation of tube size, open tracheal
system" evidence=IMP] [GO:0070971 "endoplasmic reticulum exit site"
evidence=IDA] [GO:0003331 "positive regulation of extracellular
matrix constituent secretion" evidence=IMP] [GO:0007029
"endoplasmic reticulum organization" evidence=IMP] [GO:0048081
"positive regulation of cuticle pigmentation" evidence=IMP]
[GO:0030011 "maintenance of cell polarity" evidence=IMP]
[GO:0007030 "Golgi organization" evidence=IMP] [GO:0016203 "muscle
attachment" evidence=IMP] [GO:0035149 "lumen formation, open
tracheal system" evidence=IMP] [GO:0034394 "protein localization to
cell surface" evidence=IMP] [GO:0040003 "chitin-based cuticle
development" evidence=IMP] [GO:0022409 "positive regulation of
cell-cell adhesion" evidence=IMP] [GO:0008360 "regulation of cell
shape" evidence=IMP] [GO:0071711 "basement membrane organization"
evidence=IMP] [GO:0000902 "cell morphogenesis" evidence=IMP]
InterPro:IPR006895 InterPro:IPR006896 InterPro:IPR006900
Pfam:PF04810 Pfam:PF04811 Pfam:PF04815 GO:GO:0006886 EMBL:AE014134
GO:GO:0008360 GO:GO:0005811 GO:GO:0008270 GO:GO:0009306
GO:GO:0016787 GO:GO:0016203 GO:GO:0000902 InterPro:IPR007123
Pfam:PF00626 GO:GO:0006888 GO:GO:0040003 GO:GO:0034394
GO:GO:0003331 GO:GO:0071711 GO:GO:0007030 GO:GO:0007029
GO:GO:0030011 GO:GO:0035158 GO:GO:0022409 GO:GO:0035149
GO:GO:0030127 SUPFAM:SSF82919 GO:GO:0070971 InterPro:IPR012990
Pfam:PF08033 SUPFAM:SSF81811 eggNOG:COG5028 KO:K14007
GeneTree:ENSGT00590000082962 HSSP:P40482 OMA:QDQGNCN GO:GO:0048081
EMBL:AY052042 RefSeq:NP_608664.2 UniGene:Dm.269 SMR:Q9VQ94
IntAct:Q9VQ94 MINT:MINT-283494 STRING:Q9VQ94
EnsemblMetazoa:FBtr0077810 EnsemblMetazoa:FBtr0329964 GeneID:33409
KEGG:dme:Dmel_CG10882 UCSC:CG10882-RA CTD:33409 FlyBase:FBgn0262126
InParanoid:Q9VQ94 OrthoDB:EOG4CVDNW GenomeRNAi:33409 NextBio:783418
Uniprot:Q9VQ94
Length = 1193
Score = 135 (52.6 bits), Expect = 2.3e-05, P = 2.3e-05
Identities = 65/231 (28%), Positives = 84/231 (36%)
Query: 156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP---RGPGYEASKGPG 212
G P G PP + + + P S +++ G P P PG + G
Sbjct: 211 GQPPLPGQPPFS--GQIPTSQPAPSPYGVPSSRPGQPQLPPGATPPTYTQPGLPPQQQQG 268
Query: 213 YDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 272
+ P P + P + P + PG P PG Q G+ Y A + Y G + Q
Sbjct: 269 IPPLQQPGI-PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQGGYS---G-GFPGQ 322
Query: 273 RGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG----PGYDLQ 328
G+ P PG + P + + P Y Q+ P Y PQ G PGY Q
Sbjct: 323 APGGFPGAPPPL------PGQQAAAPPQFGAPQ-PGYPGQQ-PGYPPQPGQQPMPGYPPQ 374
Query: 329 RGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPAR 379
GQ P Y P G GF G P G Q P P Y A P AR
Sbjct: 375 PGQQLG---GPGYPPQPGAGFPGQP-GRPGFNQPPMPGAGNMYQQA-PQAR 420
Score = 127 (49.8 bits), Expect = 0.00017, P = 0.00017
Identities = 75/283 (26%), Positives = 100/283 (35%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHG-PPPSATTAGVVGAGPNTSTSAY-- 184
G GGA G P G+ + PPP+ GA P T +Y
Sbjct: 90 GGVGGANPLKPPLPQGAPAAAAPPPTGFNQFNSNAAPPPTNNNNAAFGAPPPTQAGSYVN 149
Query: 185 -AATQSGTPMRAAYDIPRGPGYEASKG--PGYDASKAPSYDPTKGPSYDPAKG------- 234
A S TP A I + A+ P KA + G PA G
Sbjct: 150 GALPPSSTPQSVASGINQMSLNSATLAGLPHMPPPKAATPGAAPGQPPIPAAGSTSQPPL 209
Query: 235 PGYDPTKGPGYDAQKGSNYDAQRGPN-YDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
PG P PG G +Q P+ Y + PQ G P Y P
Sbjct: 210 PGQPPL--PGQPPFSGQIPTSQPAPSPYGVPSSRPGQPQLPPG---ATPPTYTQPGLPPQ 264
Query: 294 ETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRGTGFDGA 352
+ Q +P +Q+ + Q+ P + PQ+ PG G + Y P +G G+ G
Sbjct: 265 QQQGIP--PLQQPGI--PQQQPGFPPQQ-PGLPPLSQPGLPPQPGAPYGAPQQG-GYSGG 318
Query: 353 PRGAAPHG--QVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
G AP G PPPL P A P + G+ QP G P ++
Sbjct: 319 FPGQAPGGFPGAPPPL---PGQQAAAPPQFGAPQP--GYPGQQ 356
>WB|WBGene00001215 [details] [associations]
symbol:ego-2 species:6239 "Caenorhabditis elegans"
[GO:0040002 "collagen and cuticulin-based cuticle development"
evidence=IMP] [GO:0002009 "morphogenesis of an epithelium"
evidence=IMP] [GO:0040007 "growth" evidence=IMP] [GO:0002119
"nematode larval development" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0009792
"embryo development ending in birth or egg hatching" evidence=IMP]
[GO:0045747 "positive regulation of Notch signaling pathway"
evidence=IGI] InterPro:IPR025304 Pfam:PF13949 GO:GO:0009792
GO:GO:0002009 GO:GO:0040007 GO:GO:0002119 GO:GO:0045747
GO:GO:0040035 Gene3D:1.25.40.280 InterPro:IPR004328 Pfam:PF03097
SMART:SM01041 PROSITE:PS51180 GO:GO:0040002 EMBL:AL117201
UniGene:Cel.16377 GeneID:190251 KEGG:cel:CELE_Y53H1C.2 CTD:190251
RefSeq:NP_001251634.1 ProteinModelPortal:H8ESG1 WormBase:Y53H1C.2c
Uniprot:H8ESG1
Length = 1494
Score = 136 (52.9 bits), Expect = 2.3e-05, P = 2.3e-05
Identities = 79/280 (28%), Positives = 107/280 (38%)
Query: 129 SYGGATGNSENETSGRPVGQNAYEDGYGVPQG-----HGPPPSATTAGVVGAGPNTSTSA 183
SYG T + G G + Y++G P G GPP + A A P TS
Sbjct: 1050 SYGAPT--PPQASYGPAPGAHGYQNGAQGPPGAEVGAQGPPGAHFGAHGASAPPPTS--- 1104
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP-GYDP--- 239
Y A P +A+Y PG + G ++A A + PT + P +GP G P
Sbjct: 1105 YGAPTPQRPPQASYGA--APGAQGPPGGQFEAHGAAALPPTSHGAPTP-QGPFGAAPGAQ 1161
Query: 240 --TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 297
+GP Y Q+G+ Y+AQ+ P I P PQ + Q G PG +
Sbjct: 1162 FGAQGP-Y-GQQGARYEAQKSPGAAIFGAPGAPPQHQGSFGAQFGVPPPQNSAPGAQFGA 1219
Query: 298 VPGYDVQRGPVYEAQRAPSY-IPQRGPGYDL-QRG-QGYDMRRAP---SYD-----P-SR 345
P P Q PSY P P + Q QG + P S+ P +R
Sbjct: 1220 KPEAS-SHAPTPPPQPHPSYQAPAPPPALSVFQHSPQGAPITAPPPASSHHEHIAAPQAR 1278
Query: 346 GTGFDGAPRG--AAPHG-QVPPPLNNVPYGSATPPARSGS 382
T GAP A P + P N P A P A++ +
Sbjct: 1279 FTPTPGAPSPWHATPAELKFQTPWNTTPQYHAPPGAQAAA 1318
>UNIPROTKB|F1SKM1 [details] [associations]
symbol:COL7A1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0031012 "extracellular matrix" evidence=IDA]
[GO:0004867 "serine-type endopeptidase inhibitor activity"
evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
SMART:SM00060 SMART:SM00327 Gene3D:2.60.40.10 InterPro:IPR013783
GO:GO:0004867 SUPFAM:SSF49265 Gene3D:4.10.410.10 InterPro:IPR020901
SUPFAM:SSF57362 PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
GO:GO:0005604 OMA:RRVCTTA GeneTree:ENSGT00700000104301
EMBL:CU633242 Ensembl:ENSSSCT00000012432 ArrayExpress:F1SKM1
Uniprot:F1SKM1
Length = 2939
Score = 148 (57.2 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
Identities = 82/272 (30%), Positives = 105/272 (38%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-P 203
P G G P GPP SA G G P S + + GTP + P+G P
Sbjct: 1270 PPGPPGLPGRIGAPGPPGPPGSAIAKGERGF-PGADGSPGSPGRPGTPGTSG---PKGSP 1325
Query: 204 GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPNY 261
G+ +G PG + P +P + +GPG KG PG GS RGP+
Sbjct: 1326 GWPGPRGEPGERGPRGPKGEPGEPGRVIGGEGPGLPGQKGDPGLPGPPGS-----RGPSG 1380
Query: 262 DIH-RGPSYDPQRGL----GYDMQRGPNY--DMQRGPGYE-TQRVPGYDVQRGPV----Y 309
D RGP P + G +RGP D PG +PG +GPV
Sbjct: 1381 DPGPRGPPGFPGTAVKGEKGDRGERGPPGPGDGTAAPGDPGLPGLPGSPGPQGPVGPPGE 1440
Query: 310 EAQRAPSYIPQRG----PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPHGQVP 363
+ ++ S G PG +RG +G+ P D RG TG G P G P
Sbjct: 1441 KGEKGDSEDGAPGLPGQPGVPGERGLRGFPGDTGPKGD--RGLTGAVGEPGEKGERGS-P 1497
Query: 364 PPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
P+ P G P R G+ P G G RR
Sbjct: 1498 GPVG--PQGPPGVPGRPGAEGPEGPPGPTGRR 1527
Score = 41 (19.5 bits), Expect = 2.4e-05, Sum P(2) = 2.4e-05
Identities = 8/24 (33%), Positives = 15/24 (62%)
Query: 2 PKVGA-HKLEIRCTLIFTCTLDFL 24
P+V A H+ + CT +++ + FL
Sbjct: 19 PRVRAQHRERVTCTRLYSADIVFL 42
>UNIPROTKB|F1NRH2 [details] [associations]
symbol:LOC100858979 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005581 "collagen" evidence=IEA] [GO:0005938
"cell cortex" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:AC147437
IPI:IPI01017314 RefSeq:XP_003641055.1 Ensembl:ENSGALT00000024133
GeneID:100858979 KEGG:gga:100858979 Uniprot:F1NRH2
Length = 674
Score = 132 (51.5 bits), Expect = 2.4e-05, P = 2.4e-05
Identities = 87/283 (30%), Positives = 107/283 (37%)
Query: 125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSA 183
+ D GA G + P G+ E G G P GPP A G G GP
Sbjct: 227 KGDRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP------ 279
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPT 240
A G+P + P PG + +GP G P D +GP+ P + GP G
Sbjct: 280 --AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGEPGPAGPQGN 335
Query: 241 KGP-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PG 292
GP G G N GP D+ GP+ P +RGL G D + P Y ++G PG
Sbjct: 336 MGPQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPG 391
Query: 293 YETQRVPGYDVQRGPVYEAQRAPSYIPQR-GP-GYDLQRG-QGYDMRRAPSYDPS-RGT- 347
+ PG Q+G A P +P GP G G G R PS P RG
Sbjct: 392 PKGH--PGLPGQKGDTGHA--GPPGLPGPVGPQGVKGVPGINGEPGPRGPSGIPGIRGPI 447
Query: 348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
G G P G+ P P G AT R G P P
Sbjct: 448 GPPGMPGAPGAKGEAGAPGLPGPAGIATKGLRGPMGPPGPPGP 490
>UNIPROTKB|F1PGS0 [details] [associations]
symbol:POLR2A "DNA-directed RNA polymerase" species:9615
"Canis lupus familiaris" [GO:0003899 "DNA-directed RNA polymerase
activity" evidence=IEA] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0005665 "DNA-directed RNA
polymerase II, core complex" evidence=IEA] [GO:0003677 "DNA
binding" evidence=IEA] InterPro:IPR000684 InterPro:IPR000722
InterPro:IPR006592 InterPro:IPR007066 InterPro:IPR007073
InterPro:IPR007075 InterPro:IPR007080 InterPro:IPR007081
InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983 Pfam:PF04990
Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000 Pfam:PF05001
PROSITE:PS00115 SMART:SM00663 GO:GO:0003677 GO:GO:0006366
Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:AAEX03003616
EMBL:AAEX03003617 Ensembl:ENSCAFT00000026237 Uniprot:F1PGS0
Length = 1969
Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728
>UNIPROTKB|G3MZY8 [details] [associations]
symbol:POLR2A "DNA-directed RNA polymerase" species:9913
"Bos taurus" [GO:0031625 "ubiquitin protein ligase binding"
evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
complex" evidence=IEA] [GO:0004672 "protein kinase activity"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA] [GO:0003899
"DNA-directed RNA polymerase activity" evidence=IEA] [GO:0006366
"transcription from RNA polymerase II promoter" evidence=IEA]
InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
SMART:SM00663 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
GeneTree:ENSGT00700000104490 OMA:KVLPWST EMBL:DAAA02048777
EMBL:DAAA02048778 EMBL:DAAA02048779 EMBL:DAAA02048780
EMBL:DAAA02048781 Ensembl:ENSBTAT00000064788 Uniprot:G3MZY8
Length = 1970
Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 1490 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1547
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 1548 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1599
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 1600 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1653
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 1654 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1706
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 1707 YSPTSPSYSPTS--PSYSPTSPSYS 1729
>UNIPROTKB|P24928 [details] [associations]
symbol:POLR2A "DNA-directed RNA polymerase II subunit RPB1"
species:9606 "Homo sapiens" [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0003968 "RNA-directed RNA polymerase activity"
evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0003677 "DNA binding" evidence=NAS] [GO:0003899 "DNA-directed
RNA polymerase activity" evidence=NAS] [GO:0006355 "regulation of
transcription, DNA-dependent" evidence=NAS] [GO:0006366
"transcription from RNA polymerase II promoter"
evidence=IDA;NAS;TAS] [GO:0005634 "nucleus" evidence=IDA;NAS]
[GO:0005665 "DNA-directed RNA polymerase II, core complex"
evidence=IDA] [GO:0004672 "protein kinase activity" evidence=IDA]
[GO:0005730 "nucleolus" evidence=IDA] [GO:0000398 "mRNA splicing,
via spliceosome" evidence=TAS] [GO:0005654 "nucleoplasm"
evidence=TAS] [GO:0006281 "DNA repair" evidence=TAS] [GO:0006283
"transcription-coupled nucleotide-excision repair" evidence=TAS]
[GO:0006289 "nucleotide-excision repair" evidence=TAS] [GO:0006367
"transcription initiation from RNA polymerase II promoter"
evidence=TAS] [GO:0006368 "transcription elongation from RNA
polymerase II promoter" evidence=TAS] [GO:0006370
"7-methylguanosine mRNA capping" evidence=TAS] [GO:0008380 "RNA
splicing" evidence=TAS] [GO:0010467 "gene expression" evidence=TAS]
[GO:0016032 "viral reproduction" evidence=TAS] [GO:0050434
"positive regulation of viral transcription" evidence=TAS]
[GO:0031625 "ubiquitin protein ligase binding" evidence=IPI]
[GO:0006468 "protein phosphorylation" evidence=IDA]
Reactome:REACT_216 Reactome:REACT_71 InterPro:IPR000684
InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 Reactome:REACT_116125
EMBL:CH471108 GO:GO:0016032 GO:GO:0006355 GO:GO:0046872
GO:GO:0003677 Reactome:REACT_1675 GO:GO:0006468 GO:GO:0006368
Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0006367 GO:GO:0000398
Reactome:REACT_1788 GO:GO:0006370 GO:GO:0050434 GO:GO:0006283
Reactome:REACT_1892 EMBL:AC113189 GO:GO:0003899 PDB:2GHQ PDB:2GHT
PDBsum:2GHQ PDBsum:2GHT eggNOG:COG0086 GO:GO:0003968 GO:GO:0005665
HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 EMBL:X63564 EMBL:X74874
EMBL:X74873 EMBL:X74872 EMBL:X74871 EMBL:X74870 EMBL:BC137231
IPI:IPI00031627 PIR:I38186 PIR:S21054 RefSeq:NP_000928.1
UniGene:Hs.270017 PDB:2LTO PDBsum:2LTO ProteinModelPortal:P24928
SMR:P24928 DIP:DIP-29011N IntAct:P24928 MINT:MINT-156582
STRING:P24928 PhosphoSite:P24928 DMDM:281185484 PaxDb:P24928
PRIDE:P24928 Ensembl:ENST00000322644 GeneID:5430 KEGG:hsa:5430
UCSC:uc002ghf.4 CTD:5430 GeneCards:GC17P007387 H-InvDB:HIX0173727
HGNC:HGNC:9187 HPA:CAB012226 HPA:CAB016388 HPA:CAB022311
HPA:HPA021563 MIM:180660 neXtProt:NX_P24928 PharmGKB:PA33507
HOVERGEN:HBG004339 InParanoid:P24928 OrthoDB:EOG4JWVCM
BindingDB:P24928 ChEMBL:CHEMBL1641353 ChiTaRS:POLR2A
EvolutionaryTrace:P24928 GenomeRNAi:5430 NextBio:21009
ArrayExpress:P24928 Bgee:P24928 CleanEx:HS_POLR2A
Genevestigator:P24928 GermOnline:ENSG00000181222 Uniprot:P24928
Length = 1970
Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728
>MGI|MGI:98086 [details] [associations]
symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide
A" species:10090 "Mus musculus" [GO:0003677 "DNA binding"
evidence=IDA] [GO:0003899 "DNA-directed RNA polymerase activity"
evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005634 "nucleus" evidence=ISO] [GO:0005665 "DNA-directed RNA
polymerase II, core complex" evidence=ISO] [GO:0005730 "nucleolus"
evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=ISO] [GO:0006468 "protein phosphorylation"
evidence=ISO] [GO:0016740 "transferase activity" evidence=IEA]
[GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
[GO:0031625 "ubiquitin protein ligase binding" evidence=ISO]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000684
InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 MGI:MGI:98086
GO:GO:0046872 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
EMBL:AL603707 Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899
eggNOG:COG0086 GO:GO:0005665 GeneTree:ENSGT00700000104490
HOGENOM:HOG000222975 OMA:KVLPWST KO:K03006 CTD:5430
HOVERGEN:HBG004339 OrthoDB:EOG4JWVCM ChiTaRS:POLR2A EMBL:M12130
EMBL:M14101 IPI:IPI00136207 PIR:A28490 RefSeq:NP_033115.1
UniGene:Mm.16533 DisProt:DP00181 ProteinModelPortal:P08775
SMR:P08775 DIP:DIP-46369N IntAct:P08775 STRING:P08775
PhosphoSite:P08775 PaxDb:P08775 PRIDE:P08775
Ensembl:ENSMUST00000058470 Ensembl:ENSMUST00000071213 GeneID:20020
KEGG:mmu:20020 UCSC:uc007jrj.1 InParanoid:Q5F298 NextBio:297535
Bgee:P08775 CleanEx:MM_POLR2A Genevestigator:P08775
GermOnline:ENSMUSG00000005198 Uniprot:P08775
Length = 1970
Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728
>RGD|1587326 [details] [associations]
symbol:Polr2a "polymerase (RNA) II (DNA directed) polypeptide A"
species:10116 "Rattus norvegicus" [GO:0003674 "molecular_function"
evidence=ND] [GO:0003677 "DNA binding" evidence=IEA;ISO]
[GO:0003899 "DNA-directed RNA polymerase activity" evidence=IEA]
[GO:0004672 "protein kinase activity" evidence=IEA;ISO] [GO:0005575
"cellular_component" evidence=ND] [GO:0005634 "nucleus"
evidence=ISO] [GO:0005665 "DNA-directed RNA polymerase II, core
complex" evidence=IEA;ISO] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=IEA;ISO] [GO:0006468 "protein
phosphorylation" evidence=ISO] [GO:0008150 "biological_process"
evidence=ND] [GO:0031625 "ubiquitin protein ligase binding"
evidence=IEA;ISO] [GO:0005730 "nucleolus" evidence=ISO]
InterPro:IPR000684 InterPro:IPR000722 InterPro:IPR006592
InterPro:IPR007066 InterPro:IPR007073 InterPro:IPR007075
InterPro:IPR007080 InterPro:IPR007081 InterPro:IPR007083
Pfam:PF00623 Pfam:PF04983 Pfam:PF04990 Pfam:PF04992 Pfam:PF04997
Pfam:PF04998 Pfam:PF05000 Pfam:PF05001 PROSITE:PS00115
SMART:SM00663 RGD:1587326 GO:GO:0003677 GO:GO:0004672 GO:GO:0006366
Gene3D:2.40.40.20 InterPro:IPR009010 GO:GO:0003899 GO:GO:0005665
GeneTree:ENSGT00700000104490 KO:K03006 CTD:5430 OrthoDB:EOG4JWVCM
IPI:IPI00952328 RefSeq:XP_001079162.1 RefSeq:XP_343923.3
UniGene:Rn.163136 Ensembl:ENSRNOT00000068013 GeneID:363633
KEGG:rno:363633 UCSC:RGD:1587326 NextBio:683839 ArrayExpress:D4A5A6
Uniprot:D4A5A6
Length = 1970
Score = 137 (53.3 bits), Expect = 2.5e-05, P = 2.5e-05
Identities = 71/265 (26%), Positives = 95/265 (35%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPP--SATTAGVVGA 175
N P + A G G G++ + G + G G P S T G G
Sbjct: 1489 NIPGLG--AAGPTGMFFGSAPSPMGGISPAMTPWNQGATPAYGAWSPSVGSGMTPGAAGF 1546
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGP 235
P+ ++ A + +P A P PG PG + PS PSY P P
Sbjct: 1547 SPSAASDASGFSPGYSP--AWSPTPGSPG-----SPGPSSPYIPSPGGAMSPSYSPTS-P 1598
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
Y+P GY Q S Y + P+Y PSY P Y P+Y P Y
Sbjct: 1599 AYEPRSPGGYTPQSPS-Y-SPTSPSYS-PTSPSYSPTSP-NYS-PTSPSYS-PTSPSYSP 1652
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
P Y P Y + +PSY P P Y Y +PSY P+ + +P
Sbjct: 1653 TS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSPTSPSYSPTSP-S 1705
Query: 356 AAPHGQVPPPLNNVPYGSATPPARS 380
+P P + P S T P+ S
Sbjct: 1706 YSPTSPSYSPTS--PSYSPTSPSYS 1728
>UNIPROTKB|F1RXW0 [details] [associations]
symbol:COL5A2 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0071230 "cellular response to amino acid stimulus"
evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
[GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
development" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA] [GO:0005588 "collagen type V"
evidence=IEA] [GO:0001501 "skeletal system development"
evidence=IEA] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0043588
GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
EMBL:CU467671 Ensembl:ENSSSCT00000017460 ArrayExpress:F1RXW0
Uniprot:F1RXW0
Length = 1269
Score = 135 (52.6 bits), Expect = 2.5e-05, P = 2.5e-05
Identities = 87/293 (29%), Positives = 109/293 (37%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
++ A+G+ G GA G P G E G P+G GPP S G G T
Sbjct: 554 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 612
Query: 180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
+A Q G P ++ P G S GP G S P + P P +G
Sbjct: 613 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 671
Query: 236 GYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
P T PG + G GP + P + GL D RGP
Sbjct: 672 QGPPGATGFPGSAGRVGPPGPTGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 729
Query: 286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
GPG + PG D Q GP + P+ QRG G QRG+ G P+
Sbjct: 730 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 785
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
P + G GAP P G V PP +N P G P +G+ G P R G R
Sbjct: 786 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 837
Score = 121 (47.7 bits), Expect = 0.00084, P = 0.00084
Identities = 83/283 (29%), Positives = 103/283 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGV---PQGHGPPPSATTAGVVG-AGPNTST 181
A G G E G PVG G P GP S T+G G AGP S
Sbjct: 164 ARGPEGPQGQRGETGPPG-PVGSQGLPGAVGTDGTPGAKGPTGSPGTSGPPGLAGPPGSP 222
Query: 182 SAYAATQSGTP-MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDP- 239
+T G P +R P PG++ GP + P P + +GP DP
Sbjct: 223 GPQGST--GPPGIRGQPGDPGVPGFKGEAGPKGEPGPHGIQGPIGPPGEEGKRGPRGDPG 280
Query: 240 TKGP-GYDAQKGSNYDAQRG-PNYDIHRGPS-YDPQRG-LGYDMQRGPNYDMQR-G-PGY 293
T GP G ++G+ RG P D GP +RG +G +G D R G PG
Sbjct: 281 TVGPPGPMGERGA--PGNRGFPGSDGLPGPKGAQGERGPVGSSGPKGGQGDPGRPGEPGL 338
Query: 294 ETQR-VPGYDVQRGPVYEAQRAPSYIP-QRG----PGYDLQRGQGYDMRRAPSYDPSRGT 347
R + G +GP E + P P + G PG RGQ M S
Sbjct: 339 PGARGLTGNPGVQGP--EGKLGPLGAPGEDGRPGPPGSIGIRGQPGSMGLPGPKGSSGDP 396
Query: 348 GFDGAPRGAAPHGQ--VPPPLNNV-PYGSATPPARSGSGQPRG 387
G G A GQ P V P G PP +G +G
Sbjct: 397 GKPGEAGNAGVPGQRGAPGKDGEVGPSGPVGPPGLAGERGEQG 439
>TAIR|locus:2089616 [details] [associations]
symbol:AT3G14750 "AT3G14750" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0005634
"nucleus" evidence=ISM] [GO:0008150 "biological_process"
evidence=ND] [GO:0048573 "photoperiodism, flowering" evidence=RCA]
EMBL:CP002686 EMBL:AY035083 EMBL:AY051034 IPI:IPI00544941
RefSeq:NP_566492.1 UniGene:At.20367 ProteinModelPortal:Q93V84
SMR:Q93V84 PaxDb:Q93V84 PRIDE:Q93V84 EnsemblPlants:AT3G14750.1
GeneID:820703 KEGG:ath:AT3G14750 TAIR:At3g14750 eggNOG:NOG236769
HOGENOM:HOG000242815 InParanoid:Q93V84 OMA:YAENYEH PhylomeDB:Q93V84
ProtClustDB:CLSN2688383 ArrayExpress:Q93V84 Genevestigator:Q93V84
Uniprot:Q93V84
Length = 331
Score = 127 (49.8 bits), Expect = 2.7e-05, P = 2.7e-05
Identities = 40/111 (36%), Positives = 55/111 (49%)
Query: 78 RGTYEYEKKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRADG--------S 129
R +YEKK Y ++ E ++ME + MA E+EKLRAE+ N+ A+G +
Sbjct: 207 RAAIDYEKKGYAENYEHGKIMEHKLVAMARELEKLRAEIANS-ETSAYANGPVGNPGGVA 265
Query: 130 YGGATGNSENETSGRPVGQNAYEDGYGV-PQ-----GHGPPPSATTAGVVG 174
YGG GN E +G PV N Y+ Y + P G+ PPP A G
Sbjct: 266 YGGGYGNPE---AGYPV--NPYQPNYTMNPAQTGVVGYYPPPYGPQAAWAG 311
>UNIPROTKB|I3LSV6 [details] [associations]
symbol:COL2A1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0071773 "cellular response to BMP stimulus"
evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
[GO:0060351 "cartilage development involved in endochondral bone
morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
[GO:0043066 "negative regulation of apoptotic process"
evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
[GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
"notochord development" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA] [GO:0010468 "regulation of gene
expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
[GO:0007417 "central nervous system development" evidence=IEA]
[GO:0006029 "proteoglycan metabolic process" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
[GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
differentiation" evidence=IEA] [GO:0001958 "endochondral
ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
PROSITE:PS51461 SMART:SM00038 GO:GO:0005737 GO:GO:0043066
GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
GO:GO:0060351 GO:GO:0005201 GeneTree:ENSGT00660000095287
GO:GO:0005585 GO:GO:0060174 GO:GO:0030903 OMA:CPICPTE
Ensembl:ENSSSCT00000031054 Uniprot:I3LSV6
Length = 1365
Score = 135 (52.6 bits), Expect = 2.7e-05, P = 2.7e-05
Identities = 80/273 (29%), Positives = 107/273 (39%)
Query: 127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAGPNTSTSA 183
DG G + E G P G + G+P +GH G P G GA P +
Sbjct: 156 DGEAGKPGKSGERGPPG-PQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAGA-PGVKGES 213
Query: 184 YAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
+ ++G+P PRG PG GP A+ A D GP+ P GP P G
Sbjct: 214 GSPGENGSPGPMG---PRGLPGERGRTGPA-GAAGARGNDGQPGPAGPP--GP-VGPAGG 266
Query: 243 PGYDAQKGSNYDAQRGPNYDIHRGP--SYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPG 300
PG+ G+ +A GP RGP + P+ G GP PG T +PG
Sbjct: 267 PGFPGAPGAKGEA--GPTGA--RGPEGAQGPRGEPGNPGSPGPA-GASGNPG--TDGIPG 319
Query: 301 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
G A AP + RGP Q G + + +P GF G +G P G
Sbjct: 320 AKGSAGAPGIAG-APGFPGPRGPPGP-QGATGPLGPKGQTGEPGIA-GFKGE-QG--PKG 373
Query: 361 QVPPPLNNV-PYGSATPPARSGS-GQPRGGNPA 391
+ P + P G A + G+ G+P G PA
Sbjct: 374 EPAVPGAELQPGGPAGEEGKRGARGEPGGAGPA 406
Score = 123 (48.4 bits), Expect = 0.00055, P = 0.00055
Identities = 89/295 (30%), Positives = 111/295 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
P DR D GA G + G P G G P GPP A + G
Sbjct: 35 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 90
Query: 178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
+ A G PM PRGP G + GP G+ + P GP P +G
Sbjct: 91 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGRVEDNSLPKATGPM-GP-RG 145
Query: 235 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
P P K PG D + G +RGP RG P GL G RG P D +G
Sbjct: 146 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 202
Query: 291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 203 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 260
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
P+ G GF GAP GA G+ P P G+ P P GS P G GNP
Sbjct: 261 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGNPGSPGPAGASGNP 312
>TAIR|locus:4010713902 [details] [associations]
symbol:AT4G22505 species:3702 "Arabidopsis thaliana"
[GO:0006869 "lipid transport" evidence=IEA] EMBL:CP002687
GO:GO:0006869 InterPro:IPR016140 SUPFAM:SSF47699 UniGene:At.22887
UniGene:At.74604 IPI:IPI00938995 RefSeq:NP_001154263.1 PRIDE:F4JLV7
EnsemblPlants:AT4G22505.1 GeneID:5008157 KEGG:ath:AT4G22505
OMA:GSEMAGM Uniprot:F4JLV7
Length = 530
Score = 130 (50.8 bits), Expect = 2.8e-05, P = 2.8e-05
Identities = 54/229 (23%), Positives = 67/229 (29%)
Query: 158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
P+ PPP T A P T + P P+ P K P +
Sbjct: 74 PRTPPPPPPRTPRTPPTAPPRTPPVSPRIPPILPPKTPPTAPPQTPPVSPPKSPPNSPPR 133
Query: 218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 277
AP P + P P + P P + P + R P+ R P P R
Sbjct: 134 APPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPSTSPPRAPPLSPPRTPPT 193
Query: 278 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 337
R P P R P R P R P P R P R R
Sbjct: 194 SPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPRAPPVPPPRISPTAPPR 253
Query: 338 APSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR 386
AP P R T PR + PP + +PP + PR
Sbjct: 254 APPLSPPR-TPPTSPPRTPPLSPPITPPTSPPRAPPLSPPRTPPTSPPR 301
Score = 121 (47.7 bits), Expect = 0.00028, P = 0.00028
Identities = 58/231 (25%), Positives = 69/231 (29%)
Query: 158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
P+ PPP T P T + A P+ PR P K P +
Sbjct: 63 PRTPPPPPPRTPRTPPPPPPRTPRTPPTAPPRTPPVS-----PRIPPILPPKTPPTAPPQ 117
Query: 218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 277
P P K P P + P P + P + R P R P P R
Sbjct: 118 TPPVSPPKSPPNSPPRAPPLSPPRTPPTSPPRVPPLSPPRTPPTSPPRAPPIPPPRTPST 177
Query: 278 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR 337
R P R P R P P RAP P R P R R
Sbjct: 178 SPPRAPPLSPPRTPPTSPPRAPPVPPPNTPPTSPPRAPPLSPPRTPPNSPPRTPPTSPPR 237
Query: 338 APSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPR 386
AP P R + PR AP P PP + +PP + PR
Sbjct: 238 APPVPPPRISP-TAPPR--APPLSPPRTPPTSPPRTPPLSPPITPPTSPPR 285
Score = 117 (46.2 bits), Expect = 0.00077, P = 0.00077
Identities = 53/224 (23%), Positives = 63/224 (28%)
Query: 163 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 222
PP S A + STS A P PR P P +AP
Sbjct: 159 PPTSPPRAPPIPPPRTPSTSPPRAPPLSPPRTPPTSPPRAPPVPPPNTPPTSPPRAPPLS 218
Query: 223 PTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG 282
P + P P + P P + P + S R P R P P R
Sbjct: 219 PPRTPPNSPPRTPPTSPPRAPPVPPPRISPTAPPRAPPLSPPRTPPTSPPRTPPLSPPIT 278
Query: 283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
P R P R P R P R P P R P R +P
Sbjct: 279 PPTSPPRAPPLSPPRTPPTSPPRAPPISPPRTPPSSPPRAPPMPPPRTPPTSPPLSPLSP 338
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPR 386
P R P P V PP + TPP + P+
Sbjct: 339 PPRSPPMP--PTRTPP---VSPPTSPSRTPPVTPPRAPPTAPPQ 377
>UNIPROTKB|F1PG69 [details] [associations]
symbol:COL3A1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287 OMA:EGSPGHP
EMBL:AAEX03017880 Ensembl:ENSCAFT00000023503 Uniprot:F1PG69
Length = 1467
Score = 135 (52.6 bits), Expect = 3.0e-05, P = 3.0e-05
Identities = 85/274 (31%), Positives = 106/274 (38%)
Query: 142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIP 200
+G+P G+ +++ G P GPP +A G G AGP SG +R I
Sbjct: 653 NGKP-GEPSHQGDSGAPGERGPPGAAGPMGPRGGAGP---PGPEGGKVSGGDLRPP--IS 706
Query: 201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGS-NYDAQRG 258
G G GP A P G P GPG KG PG G+ D RG
Sbjct: 707 AGAGAAGPPGPPGSAG-TPGLQGMPGERGGPG-GPGPKGDKGEPGSAGADGAPGKDGPRG 764
Query: 259 PNYDIHR-GPSYDP-QRGLG--------YDMQRGPNYDMQRGPGYETQRVPGYDVQRG-P 307
P I GP+ P +G G + GP + GP PG Q G P
Sbjct: 765 PTGPIGPPGPAGQPGDKGEGGAPGLPGIAGPRGGPGERGEHGPPGPAG-FPGAPGQNGEP 823
Query: 308 VYEAQR-APSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAA--PHGQ- 361
+ +R AP + GP G G G P +G G G P GAA P G+
Sbjct: 824 GAKGERGAPGEKGEGGPPGVAGPPG-GAGPAGPPGPQGVKGERGSPGGP-GAAGFPGGRG 881
Query: 362 VP-PPLNNV---PYGSATPPARSGSGQPRGGNPA 391
+P PP NN P GS+ P + G P G N A
Sbjct: 882 LPGPPGNNGNPGPPGSSGAPGKDGPPGPPGNNGA 915
Score = 132 (51.5 bits), Expect = 6.3e-05, P = 6.3e-05
Identities = 83/280 (29%), Positives = 101/280 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
A G GG G + P G + + G P GPP AG G GP S
Sbjct: 165 AGGGIGGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAMGPS 224
Query: 181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
A +SG P R +P PG + G PG+ K +D G D PG
Sbjct: 225 GPAGKDGESGRPGRPGERGLPGPPGMKGPAGMPGFPGMKGHRGFDGRNGEKGDTG-APGL 283
Query: 238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
G PG + G RG + R P G G D RG D Q GP G
Sbjct: 284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338
Query: 295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
T PG +G V A S PG QRG+ A + P G +G+P
Sbjct: 339 GTAGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAPGPPGPPGSNGSP 392
Query: 354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
G G P + P G+ PP G+ G P RGG
Sbjct: 393 GGKGEMG--PAGIPGAPGLIGARGPPGPPGTNGAPGQRGG 430
Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
Identities = 78/261 (29%), Positives = 98/261 (37%)
Query: 147 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAY-DIP 200
G + G P GPP +A G GA GP S + + Q G P + P
Sbjct: 321 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAP 380
Query: 201 RGPGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDP-TKG-PGYDAQKGS-NYDAQ 256
PG S G PG P+ P P A+GP P T G PG G +
Sbjct: 381 GPPGPPGSNGSPGGKGEMGPAGIPG-APGLIGARGPPGPPGTNGAPGQRGGAGEPGKNGA 439
Query: 257 RG-PNYDIHRGPSYDPQ-RG-LGYDMQRG-PNYDMQRG-PGYETQR-VPGYDVQRGPV-- 308
+G P RG + P G G D + G P G PG +R PG+ RGP
Sbjct: 440 KGEPGPRGERGEAGSPGIPGPKGEDGKDGSPGEPGANGLPGAAGERGAPGF---RGPAGA 496
Query: 309 --YEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
++ P+ + GPG RG G R P G G+P G G+ PP
Sbjct: 497 NGLPGEKGPAG-ERGGPGPAGPRGAPGEPGRDGVPGGPGM-RGMPGSPGGPGSDGKPGPP 554
Query: 366 LNNVPYGSATPPARSGS-GQP 385
+ G PP SG GQP
Sbjct: 555 GSQGESGRPGPPGPSGPRGQP 575
>UNIPROTKB|F1N2Y2 [details] [associations]
symbol:COL5A2 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0071230 "cellular response to amino acid stimulus"
evidence=IEA] [GO:0048592 "eye morphogenesis" evidence=IEA]
[GO:0046332 "SMAD binding" evidence=IEA] [GO:0043588 "skin
development" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA] [GO:0005588 "collagen type V"
evidence=IEA] [GO:0001501 "skeletal system development"
evidence=IEA] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] InterPro:IPR000885 InterPro:IPR001007
Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
GO:GO:0043588 GO:GO:0030199 GO:GO:0001501 InterPro:IPR008160
Pfam:PF01391 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
GeneTree:ENSGT00660000095287 GO:GO:0005588 OMA:PDHKPVW
EMBL:DAAA02003915 EMBL:DAAA02003916 EMBL:DAAA02003917
EMBL:DAAA02003918 IPI:IPI00826022 Ensembl:ENSBTAT00000038684
Uniprot:F1N2Y2
Length = 1491
Score = 135 (52.6 bits), Expect = 3.0e-05, P = 3.0e-05
Identities = 88/293 (30%), Positives = 110/293 (37%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
++ A+G+ G GA G P G E G P+G GPP S G G T
Sbjct: 785 EKGAEGTAGNDGARGLPGPLGPPGPSGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 843
Query: 180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
+A Q G P ++ P G S GP G S P + P P +G
Sbjct: 844 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 902
Query: 236 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
P T PG + G A GP + P + GL D RGP
Sbjct: 903 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 960
Query: 286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
GPG + PG D Q GP + P+ QRG G QRG+ G P+
Sbjct: 961 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1016
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
P + G GAP P G V PP +N P G P +G+ G P R G R
Sbjct: 1017 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1068
>UNIPROTKB|F1PG08 [details] [associations]
symbol:COL5A2 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GeneTree:ENSGT00660000095287
EMBL:AAEX03017882 EMBL:AAEX03017883 EMBL:AAEX03017884
Ensembl:ENSCAFT00000023545 OMA:ETCNGLD Uniprot:F1PG08
Length = 1499
Score = 135 (52.6 bits), Expect = 3.0e-05, P = 3.0e-05
Identities = 87/293 (29%), Positives = 109/293 (37%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
++ A+G+ G GA G P G E G P+G GPP S G G T
Sbjct: 784 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 842
Query: 180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
+A Q G P ++ P G S GP G S P + P P +G
Sbjct: 843 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 901
Query: 236 GYDP--TKGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
P T PG + G GP + P + GL D RGP
Sbjct: 902 QGPPGATGFPGSAGRVGPPGPPGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 959
Query: 286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
GPG + PG D Q GP + P+ QRG G QRG+ G P+
Sbjct: 960 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1015
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
P + G GAP P G V PP +N P G P +G+ G P R G R
Sbjct: 1016 PGK-VGPTGAPGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067
>FB|FBgn0052685 [details] [associations]
symbol:ZAP3 species:7227 "Drosophila melanogaster"
[GO:0005634 "nucleus" evidence=IEA] [GO:0008157 "protein
phosphatase 1 binding" evidence=IPI] [GO:0048812 "neuron projection
morphogenesis" evidence=IMP] InterPro:IPR026314 GO:GO:0005634
EMBL:AE014298 PANTHER:PTHR13413 GeneTree:ENSGT00440000039837
FlyBase:FBgn0052685 RefSeq:NP_727393.1 UniGene:Dm.10734
ProteinModelPortal:Q9W2Y5 SMR:Q9W2Y5 IntAct:Q9W2Y5 MINT:MINT-741898
STRING:Q9W2Y5 EnsemblMetazoa:FBtr0071489 GeneID:31942
KEGG:dme:Dmel_CG32685 UCSC:CG32685-RC InParanoid:Q9W2Y5
PhylomeDB:Q9W2Y5 GenomeRNAi:31942 NextBio:776058
ArrayExpress:Q9W2Y5 Bgee:Q9W2Y5 Uniprot:Q9W2Y5
Length = 1884
Score = 136 (52.9 bits), Expect = 3.0e-05, P = 3.0e-05
Identities = 77/285 (27%), Positives = 109/285 (38%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPS-ATTAGVVGAG 176
N+ N ++ D +T N E + P G +G G G GP + + V G
Sbjct: 994 NSGNENKSQDAGDSVSTNNGEKPDNNGPPGGFGPGNGPGGGPGSGPGQNDGSRFDVFGPN 1053
Query: 177 PNTSTSAYAATQSGTPMRAAYDI---PRGPGYEASKGPGYDASKAPSYD--PTKGPSYDP 231
+ + +G P P GPG + GP + + P P P+ P
Sbjct: 1054 QVSGNNFIDLDNNGPPGFGPPGRNFGPNGPGPRGNFGPNFGHNFGPRGPGGPFIRPN-GP 1112
Query: 232 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP 291
GPG P GP + G N+ GPN+ GP++ P+ G RGP+ GP
Sbjct: 1113 LPGPG--PNFGPHF-RPNGPNF----GPNF----GPNFGPRPGSRNFGPRGPD-----GP 1156
Query: 292 -GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAP--SYDPSRGTG 348
G PG D GP + R P P GPG++++ G + P G G
Sbjct: 1157 FG------PGRDDFGGPPFGGPR-PHMGPN-GPGHNMRGFNGGPISDNPFRRQGGPPGPG 1208
Query: 349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
F GA P + P N +G+ P G G GGN R+
Sbjct: 1209 FGNDDLGAGPP-RGPRNFGN-RFGN---PGGGGGGGGGGGNNNRK 1248
>UNIPROTKB|P08125 [details] [associations]
symbol:COL10A1 "Collagen alpha-1(X) chain" species:9031
"Gallus gallus" [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR008983 HOGENOM:HOG000085653 HOVERGEN:HBG108220
GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871 eggNOG:NOG114228
OrthoDB:EOG4FFD29 EMBL:M13496 EMBL:J04194 IPI:IPI00600819
PIR:S23297 ProteinModelPortal:P08125 SMR:P08125 STRING:P08125
InParanoid:P08125 Reactome:REACT_132934 PMAP-CutDB:P08125
Uniprot:P08125
Length = 674
Score = 131 (51.2 bits), Expect = 3.1e-05, P = 3.1e-05
Identities = 91/293 (31%), Positives = 116/293 (39%)
Query: 125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSA 183
+ D GA G + P G+ E G G P GPP A G G GP
Sbjct: 227 KGDRGLPGARGEAGIPGPQGPPGEPG-EVGIGKPGPMGPPGPAGIPGAKGLPGP------ 279
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK-GP-GYDPT 240
A G+P + P PG + +GP G P D +GP+ P + GP G
Sbjct: 280 --AGLPGSPGLPGFGKPGLPGMKGHRGPEGPPGFPGPKGD--QGPAGVPGELGPAGPQGN 335
Query: 241 KGP-GYDAQKGSNYDAQRGPNYDIHR-GPSYDP----QRGL-GYDMQRGPNYDMQRG-PG 292
GP G G N GP D+ GP+ P +RGL G D + P Y ++G PG
Sbjct: 336 MGPQGLKGLPGEN--GLPGPKGDMGPVGPAGFPGAKGERGLPGLDGK--PGYPGEQGLPG 391
Query: 293 YETQRVPGYDVQRGPVYEAQRA--PSYI-PQ--RG-PGYDLQRGQGYDMRRAPSYDPS-R 345
+ PG Q+G A P + PQ +G PG + + G R PS P R
Sbjct: 392 PKGH--PGLPGQKGDTGHAGHPGLPGPVGPQGVKGVPGINGEPGP-----RGPSGIPGVR 444
Query: 346 GT----GFDGAP--RGAAPHGQVPPPLNNV------PYGSATPPARSG-SGQP 385
G G GAP +G A +P P V P G PP G SG+P
Sbjct: 445 GPIGPPGMPGAPGAKGEAGAPGLPGPAGIVTKGLRGPMGPLGPPGPKGNSGEP 497
>UNIPROTKB|G5EF87 [details] [associations]
symbol:swsn-1 "SWI3-like protein" species:6239
"Caenorhabditis elegans" [GO:0042802 "identical protein binding"
evidence=IPI] [GO:0005515 "protein binding" evidence=IPI]
InterPro:IPR001005 InterPro:IPR007526 InterPro:IPR009057
Pfam:PF00249 Pfam:PF04433 PROSITE:PS50934 SMART:SM00717
GO:GO:0005634 GO:GO:0009792 GO:GO:0002009 GO:GO:0040007
GO:GO:0040010 GO:GO:0002119 GO:GO:0003677 GO:GO:0040018
Gene3D:1.10.10.10 InterPro:IPR011991 GO:GO:0003682
Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0046662 GO:GO:0040035
InterPro:IPR017884 PROSITE:PS51293 GO:GO:0040027 GO:GO:0035262
EMBL:AL110477 KO:K11649 GeneTree:ENSGT00390000018166 EMBL:AF230279
PIR:T26449 RefSeq:NP_001256906.1 UniGene:Cel.7072 SMR:G5EF87
IntAct:G5EF87 EnsemblMetazoa:Y113G7B.23 GeneID:180324
KEGG:cel:CELE_Y113G7B.23 CTD:180324 WormBase:Y113G7B.23a
OMA:HFDELEQ NextBio:908892 Uniprot:G5EF87
Length = 789
Score = 131 (51.2 bits), Expect = 3.8e-05, P = 3.8e-05
Identities = 71/248 (28%), Positives = 92/248 (37%)
Query: 156 GVPQGH---GPP---PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASK 209
G+P G GPP P + A P ++ AAT + P + P+ P +A+
Sbjct: 551 GLPPGFEVTGPPQPTPQVQISAQEAAIPEKMDTSEAATAARPP--STPQAPQAPPVQAAP 608
Query: 210 GPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDI-HRGPS 268
P A +AP P +Y GPG P + Y Q+G Y P H+
Sbjct: 609 AP-VQAPQAPQAPPQ---AYQGYGGPGGPPQQAYRYPPQQGQQYSPYPPPQQQQQHQAQQ 664
Query: 269 YDPQRGLGYDMQ-RGPNYDMQRGPGYETQRVPG--YDVQRGPVYEAQRAPSYIPQRGPGY 325
Q G +GP Q Y PG Y G + QR P Y Q PG
Sbjct: 665 AQSQAHYGPPGGGQGPPPPPQGQQYYGGPPPPGQPYGPPGGYPPQQQRPP-YQAQPYPGP 723
Query: 326 ---DLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 382
QRG GY P P G P P+GQ+PPP P+G P + G
Sbjct: 724 PPPQQQRGYGYP----PPPQP-------GHPY-QQPYGQMPPP----PHGQYQPQQQQGG 767
Query: 383 GQ-PRGGN 389
P GG+
Sbjct: 768 PMGPPGGH 775
>WB|WBGene00000677 [details] [associations]
symbol:col-103 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0040011
"locomotion" evidence=IMP] InterPro:IPR002486 Pfam:PF01484
SMART:SM01088 GO:GO:0040011 GeneTree:ENSGT00690000102663
GO:GO:0042302 HOGENOM:HOG000085656 EMBL:FO081484 PIR:E88633
RefSeq:NP_499982.1 ProteinModelPortal:O45114 STRING:O45114
EnsemblMetazoa:F56B3.1 GeneID:176901 KEGG:cel:CELE_F56B3.1
UCSC:F56B3.1 CTD:176901 WormBase:F56B3.1 eggNOG:NOG301529
InParanoid:O45114 OMA:SNTCPPG NextBio:894512 Uniprot:O45114
Length = 371
Score = 126 (49.4 bits), Expect = 4.4e-05, P = 4.4e-05
Identities = 87/287 (30%), Positives = 103/287 (35%)
Query: 119 APNVDRRA------DGSYGGATGNSE-NETSGRPVGQNA---YEDGYGVPQGHGPPPSAT 168
APN ++R G YGG G + G VG Y G+G GHG
Sbjct: 63 APNREKRGYAQYGGGGGYGGGHGGAAVGGGYGGAVGGGGGGGYGGGHG--GGHGGAVGGG 120
Query: 169 TAGVVGAGPNTSTSAYAAT----QSGTPMRAAYD-IPRGPGYEASKGPGYDASKAPSYDP 223
G G G S + T G P +A D +P PG S G S S
Sbjct: 121 YGGGGGGGGGCQCSPSSNTCPPGPRGPPGQAGLDGLPGAPGQPGSNGGA--GSNGASEGS 178
Query: 224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
G PA PG P GP A + N D Q G PS+ G+G GP
Sbjct: 179 AGGCKTCPAGPPG--PP-GPAGQAGRPGN-DGQPG-------APSFGG--GVGAPGAPGP 225
Query: 284 NYDM-QRG-PGYETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS 340
D G PG Q PG + Q G P+ P PG + G GY + P
Sbjct: 226 AGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAG-PPGPPGNNGAPGGGYGV--GPP 282
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNN-VPYGSAT--P-PARSGSG 383
P +G GAP P GQ P N+ P A P P R G G
Sbjct: 283 GPPGP-SGRPGAPGQPGPDGQPGAPGNDGTPGTDAAYCPCPGRGGGG 328
>RGD|628797 [details] [associations]
symbol:Prpmp5 "proline-rich protein MP5" species:10116 "Rattus
norvegicus" [GO:0005576 "extracellular region" evidence=IEA]
RGD:628797 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
CTD:5542 KO:K13911 EMBL:L17318 EMBL:M11899 IPI:IPI00187926
PIR:B48013 RefSeq:NP_742062.1 UniGene:Rn.29950 GeneID:257651
KEGG:rno:257651 UCSC:RGD:628797 NextBio:624204
Genevestigator:P10165 Uniprot:P10165
Length = 295
Score = 124 (48.7 bits), Expect = 4.5e-05, P = 4.5e-05
Identities = 63/200 (31%), Positives = 77/200 (38%)
Query: 200 PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR-- 257
P G + PG + + P P GP P +GP P GP Q GS
Sbjct: 101 PPAAGPQRPPQPG--SPQGPP--PPGGPQQRPPQGP--PPQGGPQRPPQPGSPQGPPPPG 154
Query: 258 GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVP-GYDVQRGPVYEAQR 313
GP +GP PQ G QR P +GP G QR P G Q GP QR
Sbjct: 155 GPQQRPPQGPP--PQGG----PQRPPQPGSPQGPPPPGGPQQRAPQGPPPQGGP----QR 204
Query: 314 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP--PLNNVPY 371
P +GP G +R P P +G G P+ +P G PP P P
Sbjct: 205 PPQPGSPQGPP-----PPGGPQQRPPQGPPPQG-GPQRPPQPGSPQGPPPPGGPQQRPPQ 258
Query: 372 GSATPPARSGSGQP-RGGNP 390
G PP + G +P + GNP
Sbjct: 259 G---PPPQGGPQRPPQPGNP 275
>UNIPROTKB|E2RA07 [details] [associations]
symbol:EWSR1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0005622 GeneTree:ENSGT00530000063105 OMA:EGTSTGY
EMBL:AAEX03014786 EMBL:AAEX03014787 Ensembl:ENSCAFT00000019384
Uniprot:E2RA07
Length = 671
Score = 117 (46.2 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
Identities = 63/238 (26%), Positives = 87/238 (36%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYGTGAYDTT 102
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
T+ TQ+ ++AY P P Y + P A P + P G Y+
Sbjct: 103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 160
Query: 239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
P+ G G Q +Y G P + PSY P R ++ Y R Y +
Sbjct: 161 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPTR---FNSSSLKLYHYSRS--YSS 212
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
+ YD PS Q+ Y Q Y + SY P G+ + AP
Sbjct: 213 TQPTSYDQSSYSQQNTYGQPSSYGQQS-SYGQQ--SSYGQQPPTSYPPQTGS-YSQAP 266
Score = 57 (25.1 bits), Expect = 4.6e-05, Sum P(2) = 4.6e-05
Identities = 19/46 (41%), Positives = 21/46 (45%)
Query: 354 RGAAP--HGQ-VPPPLNNVPYGSATP--PA--RSGSGQPRGGNPAR 392
RG P G+ +PPPL P G P P G G RGG P R
Sbjct: 470 RGGMPPREGRGMPPPLRGGPGGPGGPGGPMGRMGGRGGDRGGFPPR 515
>ZFIN|ZDB-GENE-030131-8373 [details] [associations]
symbol:col10a1 "collagen, type X, alpha 1"
species:7955 "Danio rerio" [GO:0005576 "extracellular region"
evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR008983 ZFIN:ZDB-GENE-030131-8373 GO:GO:0005581
Gene3D:2.60.120.40 InterPro:IPR001073 InterPro:IPR008160
Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007 SMART:SM00110
SUPFAM:SSF49842 PROSITE:PS50871 GeneTree:ENSGT00700000104270
OMA:KPGHGSP EMBL:CU306817 IPI:IPI00491103
Ensembl:ENSDART00000091021 ArrayExpress:F1QXD5 Bgee:F1QXD5
Uniprot:F1QXD5
Length = 655
Score = 129 (50.5 bits), Expect = 5.0e-05, P = 5.0e-05
Identities = 81/269 (30%), Positives = 107/269 (39%)
Query: 145 PVGQNAYEDGYGVPQGHGPP----PSATTA-GVVGA--GPNTSTSAYAATQSGTPMRAAY 197
P G A +DG G+P GPP P+ +A G G+ GP + A G
Sbjct: 64 PPGP-AGQDGEGLPGPQGPPGAPGPAGYSAPGKPGSPGGPGKPGATGAPGLKGDTGAPGL 122
Query: 198 DIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDP-AKGP-GYDPTKG----PGYDAQK 249
PRG PG S GP G A+ P GP+ P A GP G KG PG QK
Sbjct: 123 QGPRGMPGPSGSPGPAGISATGKP------GPAGLPGAMGPRGEQGFKGHPGIPGLPGQK 176
Query: 250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQR-VPGYDVQRGP 307
G +GP + RGP+ P G G + G PG + PG D + GP
Sbjct: 177 GEMGVGVQGPAGE--RGPT-GPVGPSGKPGAPGVGLPGKPGAPGEAGKSGSPGRDGESGP 233
Query: 308 VY-EAQRAPSYIPQRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPP 364
+ + Q+ + P G PG + G G P P +G GAP G +G+ P
Sbjct: 234 MGPQGQKGQTGAPGVGIPGKPGENGAPGMPGPTGPK-GPQGASGAPGAP-GVPGYGK--P 289
Query: 365 PLNNVPYGSATPPARSGSGQPRGGNPARR 393
N + P + +GQ G P +
Sbjct: 290 GENGLKGDRGVPGSPGTTGQK--GEPGAK 316
>UNIPROTKB|Q04118 [details] [associations]
symbol:PRB3 "Basic salivary proline-rich protein 3"
species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
evidence=NAS] [GO:0051636 "Gram-negative bacterial cell surface
binding" evidence=NAS] [GO:0008150 "biological_process"
evidence=ND] GO:GO:0005576 GO:GO:0051636 InterPro:IPR026086
PANTHER:PTHR23203 EMBL:X07637 EMBL:X07881 EMBL:BC096209
EMBL:BC096210 EMBL:BC096211 IPI:IPI00006699 PIR:A36298 PIR:B36298
PIR:S10889 RefSeq:NP_006240.4 UniGene:Hs.73031 STRING:Q04118
DMDM:229462763 PaxDb:Q04118 PRIDE:Q04118 Ensembl:ENST00000381842
GeneID:5544 KEGG:hsa:5544 CTD:5544 GeneCards:GC12M011418
H-InvDB:HIX0201930 HGNC:HGNC:9339 MIM:168840 neXtProt:NX_Q04118
PharmGKB:PA33701 HOGENOM:HOG000060075 GenomeRNAi:5544 NextBio:21478
ArrayExpress:Q04118 Bgee:Q04118 CleanEx:HS_PRB3
Genevestigator:Q04118 GermOnline:ENSG00000197870 Uniprot:Q04118
Length = 309
Score = 124 (48.7 bits), Expect = 5.0e-05, P = 5.0e-05
Identities = 79/271 (29%), Positives = 99/271 (36%)
Query: 137 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 196
S + SG+P G+ G PQ PPP G G N S G P R
Sbjct: 28 SPSVISGKPEGRRP--QGGNQPQ-RTPPPPGKPEGRPPQGGNQS--------QGPPPRPG 76
Query: 197 YDIPRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 253
P GP G S+GP K P P +G + ++GP P K G Q G N
Sbjct: 77 K--PEGPPPQGGNQSQGPPPRPGK-PEGQPPQGGNQ--SQGPPPRPGKPEGPPPQ-GGNQ 130
Query: 254 DAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP----GYETQRVPGYDVQ-RGPV 308
P GP P +G P+ GP G ++Q P + GP
Sbjct: 131 SQGPPPRPGKPEGP---PPQGGNQSQGPPPHPGKPEGPPPQGGNQSQGPPPRPGKPEGPP 187
Query: 309 YEAQRAPSYIPQRGPGY-DLQRGQGYDMRRAPSYDPSR--GTGFDGA--PRGAAPH-G-- 360
+ P R PG + QG + + P P + G+ G P+G PH G
Sbjct: 188 PQGGNQSQGPPPR-PGKPEGPPPQGGNQSQGPPPRPGKPEGSPSQGGNKPQGPPPHPGKP 246
Query: 361 QVPPPLN-NVPYGSATPPARSGSGQPRGGNP 390
Q PPP N P PP R P GGNP
Sbjct: 247 QGPPPQEGNKPQ-RPPPPGRPQGPPPPGGNP 276
>TAIR|locus:2204400 [details] [associations]
symbol:AT1G76010 species:3702 "Arabidopsis thaliana"
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005576
"extracellular region" evidence=ISM] [GO:0008150
"biological_process" evidence=ND] [GO:0005829 "cytosol"
evidence=IDA] InterPro:IPR002775 Pfam:PF01918 EMBL:CP002684
GO:GO:0005829 GO:GO:0003676 EMBL:AF412102 EMBL:AY054208
EMBL:AF428441 EMBL:AY124847 IPI:IPI00531013 RefSeq:NP_565124.1
UniGene:At.24580 UniGene:At.67776 UniGene:At.75066 HSSP:P60849
ProteinModelPortal:Q93VA8 SMR:Q93VA8 STRING:Q93VA8 PRIDE:Q93VA8
EnsemblPlants:AT1G76010.1 GeneID:843932 KEGG:ath:AT1G76010
TAIR:At1g76010 HOGENOM:HOG000240806 InParanoid:Q93VA8 OMA:YDGPPQG
PhylomeDB:Q93VA8 ProtClustDB:CLSN2917456 Genevestigator:Q93VA8
Uniprot:Q93VA8
Length = 350
Score = 125 (49.1 bits), Expect = 5.0e-05, P = 5.0e-05
Identities = 70/207 (33%), Positives = 88/207 (42%)
Query: 144 RPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ---SGTPMRAAYDIP 200
+P+G YE G P G G G G +AY + G +Y
Sbjct: 134 KPMGDIDYEGREGSPGGRGRGRGRGRGR--GRGRGGRGNAYVNVEHEDGGWEREQSYGRG 191
Query: 201 RGPGY-EASKGPGYDASKAP--SYDPTK--GPSYD-PAKGPGYDPTKGPGYDA--QKGSN 252
RG G +S+G G P YD + G YD P + GYD +G GYDA Q
Sbjct: 192 RGRGRGRSSRGRGRGGYNGPPNEYDAPQDGGYGYDAPHEHRGYDD-RG-GYDAPPQGRGG 249
Query: 253 YDAQRGPN-YDIHRGP-SYD--PQ-RGLGYDMQRGPNYDMQRGPGYE--TQRVPGYDVQR 305
YD +G YD +G YD PQ RG GYD GP+ RG GY+ +Q GYD
Sbjct: 250 YDGPQGRGGYDGPQGRRGYDGPPQGRG-GYD---GPSQG--RG-GYDGPSQGRGGYD--- 299
Query: 306 GPVYEAQRAPSYIPQRGPGYDLQRGQG 332
GP +Q Y +G G RG+G
Sbjct: 300 GP---SQGRGGYDGPQGRGRGRGRGRG 323
>UNIPROTKB|F1RZK4 [details] [associations]
symbol:COL10A1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005938 "cell cortex" evidence=IEA] [GO:0005581
"collagen" evidence=IEA] InterPro:IPR008983 GO:GO:0005938
GO:GO:0005581 Gene3D:2.60.120.40 InterPro:IPR001073
InterPro:IPR008160 Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007
SMART:SM00110 SUPFAM:SSF49842 PROSITE:PS50871
GeneTree:ENSGT00700000104270 OMA:IKGPPPN EMBL:CU062641
Ensembl:ENSSSCT00000004901 Uniprot:F1RZK4
Length = 675
Score = 129 (50.5 bits), Expect = 5.2e-05, P = 5.2e-05
Identities = 88/296 (29%), Positives = 113/296 (38%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG--AGPN 178
++ A G G G G + GRP G G P G PP G G P
Sbjct: 176 EKGAPGVPGINGQKGETGYGAPGRP-GDRGLPGPQG-PMGPPGPPGVGKRGENGFPGQPG 233
Query: 179 TSTSAYAATQSGTPMRAAYDIPRGP-GYEASKG---PGYD-ASKAPSYDPTKG----PSY 229
+SG P A P+GP G + +G PG A+ P TKG P
Sbjct: 234 IKGDRGFPGESG-P--AGPPGPQGPPGEQGREGIGKPGAPGAAGQPGLPGTKGHPGAPGM 290
Query: 230 -DPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGL-GYDMQRGPNYD 286
P PG+ PG Q+G P +GP+ P + GL G RGP
Sbjct: 291 AGPPGAPGFGKPGLPGLKGQRGP-IGLPGAPGAKGEQGPAGHPGEPGLTGPPGSRGP--- 346
Query: 287 MQRGPGYETQRVPGYDVQRGPVYEAQRA-PSYIP----QRGP-GYDLQRGQ-GYDMRRAP 339
+GP + +PG + GP E A P+ P +RGP G D + G G P
Sbjct: 347 --QGP----KGIPGNNGVPGPKGEIGLAGPAGFPGAKGERGPSGLDGKPGYPGEPGLNGP 400
Query: 340 SYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNPARR 393
+P G G P P G +P P+ P G+ P +G G PRG G P R
Sbjct: 401 KGNPGL-PGPKGDPGIGGPPG-LPGPVG--PAGAKGVPGHNGEGGPRGAPGIPGTR 452
>ZFIN|ZDB-GENE-030131-2281 [details] [associations]
symbol:col4a5 "collagen, type IV, alpha 5 (Alport
syndrome)" species:7955 "Danio rerio" [GO:0005201 "extracellular
matrix structural constituent" evidence=IEA] [GO:0005581 "collagen"
evidence=IEA] [GO:0031290 "retinal ganglion cell axon guidance"
evidence=IMP] [GO:0007412 "axon target recognition" evidence=IMP]
[GO:0030198 "extracellular matrix organization" evidence=IMP]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
ZFIN:ZDB-GENE-030131-2281 InterPro:IPR016187 SUPFAM:SSF56436
GO:GO:0030198 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0007412 GO:GO:0031290 GO:GO:0005201
HOVERGEN:HBG004933 HOGENOM:HOG000085652 OrthoDB:EOG45DWPF
Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772 KO:K06237 CTD:1287
OMA:MPMNMEP EMBL:CR354588 EMBL:CR936978 IPI:IPI00835382
RefSeq:NP_001116702.1 UniGene:Dr.77841 SMR:B0UXF7
Ensembl:ENSDART00000073827 GeneID:323561 KEGG:dre:323561
NextBio:20808319 Uniprot:B0UXF7
Length = 1659
Score = 133 (51.9 bits), Expect = 5.6e-05, P = 5.6e-05
Identities = 83/294 (28%), Positives = 100/294 (34%)
Query: 117 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP--PSATTAGVVG 174
M P V R G G+ P GQ + G+P G P P G G
Sbjct: 652 MTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFP---GLPGSKGEPGLPGIGLPGPPG 708
Query: 175 AGPNTSTSAYAATQSGTPMRAAYD-IPRGPGYEASKG-PGYDASKAPSYDPTKGPSYDPA 232
A A + G P R D +P PG SKG PGY P PT P
Sbjct: 709 A-KGFPGIAGSPGGPGIPGRPGLDGLPGQPGLPGSKGDPGYGLPGPPG--PTGSPGIKGG 765
Query: 233 KGPGYDPTKGPGYDAQKGS-NYDAQRGPNYD--IHRGPS-YDPQRGLGYDMQRGPNYDMQ 288
GP D + PG Q G D GP D GP P + +Q P
Sbjct: 766 PGPKGD-SGFPGSPGQPGRPGLDGAPGPKGDAGFPGGPGPRGPPGAPAFGLQGPPG--PP 822
Query: 289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPS-R 345
PG + VPG + ++G R P + PG+ RG G P P
Sbjct: 823 GAPGSIGSPGVPGANGEKG-----DRGPPGLST--PGFQGDRGISGLPGPPGPVGPPGVP 875
Query: 346 GT-GFDGAPRGAAPHGQV----PPPLNNVPYGSATP--PARSGS-GQP-RGGNP 390
G G DG P G++ PP P P P G G P + GNP
Sbjct: 876 GRPGQDGLPGLPGSKGEMGSMGPPGSKGNPGNPGAPGFPGPKGDDGVPGQSGNP 929
Score = 126 (49.4 bits), Expect = 0.00033, P = 0.00032
Identities = 82/275 (29%), Positives = 97/275 (35%)
Query: 132 GATGNSENETSGR-PVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSAYAATQ 188
G G + E R P GQ+ G P GPP G+ G+ G P
Sbjct: 648 GEPGMTVPEVGERGPPGQDGDPGSQGRPGDSGPPGQPGFPGLPGSKGEPGLPGIGLPGPP 707
Query: 189 SGTPMRAAYDIPRGPGYEASKGPGYDASKA-PSYDPTKG-PSYDPAKGPGYDPTKGPGYD 246
P GPG PG D P +KG P Y PG PT PG
Sbjct: 708 GAKGFPGIAGSPGGPGIPGR--PGLDGLPGQPGLPGSKGDPGYGLPGPPG--PTGSPGI- 762
Query: 247 AQKGSNYDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYD 302
KG GP D G P R G D GP D GPG P +
Sbjct: 763 --KGGP-----GPKGDSGFPGSPGQPGRP-GLDGAPGPKGDAGFPGGPGPRGPPGAPAFG 814
Query: 303 VQRGPVYEAQRAPSYIPQRG-PGYDLQRG-QGYDMRRAPSYDPSRG-TGFDGAPRGAAPH 359
+Q GP AP I G PG + ++G +G P + RG +G G P P
Sbjct: 815 LQ-GPP-GPPGAPGSIGSPGVPGANGEKGDRGPPGLSTPGFQGDRGISGLPGPPGPVGPP 872
Query: 360 GQVP--PPLNNVPYGSATPPARSGSGQPRG--GNP 390
G VP P + +P G GS P G GNP
Sbjct: 873 G-VPGRPGQDGLP-GLPGSKGEMGSMGPPGSKGNP 905
>UNIPROTKB|G3N3C9 [details] [associations]
symbol:LDB3 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
"cytoskeletal protein binding" evidence=IEA] [GO:0005856
"cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 OMA:CTSQATT
InterPro:IPR006643 SMART:SM00735 GeneTree:ENSGT00700000104411
EMBL:DAAA02062163 Ensembl:ENSBTAT00000065403 Uniprot:G3N3C9
Length = 730
Score = 129 (50.5 bits), Expect = 5.7e-05, P = 5.7e-05
Identities = 54/206 (26%), Positives = 76/206 (36%)
Query: 115 ELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 174
E M P+ + +T + TS P + Y + P P P T +
Sbjct: 353 EYMQDPDEEALRRSRPQASTYSPAVATSPAPAA-HTYSEAPAAP---APKPRVVTTASIR 408
Query: 175 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
P+ A+T S +P A Y P P Y S P Y S P+Y P+ P+Y P+
Sbjct: 409 --PSVYQPVPASTYSPSP-GANYS-PT-P-YTPSPAPAYTPSPTPAYTPSPAPTYSPSPA 462
Query: 235 PGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGY 293
P Y P+ P Y+ S A+ S+ + G + RG P Y
Sbjct: 463 PAYTPSPAPSYNPTLYSGGPAESASRPPWVTDDSFSQKFAPGKTTTTVSKQSLPRGAPAY 522
Query: 294 ETQRVPGYDVQ---RGPVYEAQRAPS 316
T P V RG V A+R P+
Sbjct: 523 -TPPPPAPQVSPLARGTVQRAERFPA 547
>UNIPROTKB|G8ENL4 [details] [associations]
symbol:FUS "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0000166
"nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
SMART:SM00547 GO:GO:0005634 GO:GO:0005737 GO:GO:0000166
GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GeneTree:ENSGT00530000063105 EMBL:CU464163 EMBL:JF940526
Ensembl:ENSSSCT00000036326 Uniprot:G8ENL4
Length = 517
Score = 127 (49.8 bits), Expect = 5.8e-05, P = 5.8e-05
Identities = 68/240 (28%), Positives = 93/240 (38%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
G+Y G ++ S +P GQ +Y GYG ++ G NT A +A
Sbjct: 15 GAYPTQPGQGYSQQSNQPYGQQSYS-GYGQSADTSGYGQSSYGSSYGQTQNTGYGAQSAP 73
Query: 188 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
Q G Y +G Y + S PGY APS T G ++ GY + G
Sbjct: 74 Q-GYGSTGGYGSGQGSQSSYGQQSSYPGYGQQPAPS--STSGSYGTSSQSSGYGQPQSGG 130
Query: 245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET--QRVPGYD 302
Y Q G Y Q+ +Y + SY+P +G G Q + G G + Q P
Sbjct: 131 YGQQSG--YGGQQ-QSYGQQQ--SYNPPQGYGQQNQYNSSSGGGGGGGGGSYGQDQPSMS 185
Query: 303 VQRGPVYEAQ-RAPSYI--PQ----RGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
G Y Q ++ Y Q RG G G GY+ R + Y+P RG G RG
Sbjct: 186 GGGGGGYGNQDQSGGYGGGQQDRGGRGRGGGSGGGGGYN-RSSGGYEP-RGRGGGRGGRG 243
Score = 117 (46.2 bits), Expect = 0.00074, P = 0.00074
Identities = 63/218 (28%), Positives = 80/218 (36%)
Query: 187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS-YDPAK-GPGYDPTKGPG 244
TQ T AY G GY Y Y + S Y + G Y T+ G
Sbjct: 7 TQQATQSYGAYPTQPGQGYSQQSNQPYGQQSYSGYGQSADTSGYGQSSYGSSYGQTQNTG 66
Query: 245 YDAQKG-SNYDAQRGPNYDIHRGP--SYDPQRGL-GYDMQRGPN-----YDMQ-RGPGYE 294
Y AQ Y + G Y +G SY Q GY Q P+ Y + GY
Sbjct: 67 YGAQSAPQGYGSTGG--YGSGQGSQSSYGQQSSYPGYGQQPAPSSTSGSYGTSSQSSGYG 124
Query: 295 TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 354
+ GY Q G Y Q+ SY Q+ Y+ QGY + Y+ S G G G
Sbjct: 125 QPQSGGYGQQSG--YGGQQQ-SYGQQQS--YNPP--QGYGQQN--QYNSSSGGGGGG--- 172
Query: 355 GAAPHGQVPPPLNNVP---YGSATPPARSGSGQP-RGG 388
G +GQ P ++ YG+ G GQ RGG
Sbjct: 173 GGGSYGQDQPSMSGGGGGGYGNQDQSGGYGGGQQDRGG 210
>RGD|71029 [details] [associations]
symbol:Col3a1 "collagen, type III, alpha 1" species:10116 "Rattus
norvegicus" [GO:0001501 "skeletal system development" evidence=IEP]
[GO:0001568 "blood vessel development" evidence=IEA;ISO] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
[GO:0005581 "collagen" evidence=ISO] [GO:0005586 "collagen type III"
evidence=ISO;TAS] [GO:0005615 "extracellular space" evidence=IEA]
[GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
"transforming growth factor beta receptor signaling pathway"
evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
[GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
"peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
evidence=ISO] [GO:0032964 "collagen biosynthetic process"
evidence=IEA] [GO:0034097 "response to cytokine stimulus"
evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
"extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
development" evidence=IEA] [GO:0046332 "SMAD binding"
evidence=IEA;ISO] [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
[GO:0048565 "digestive tract development" evidence=IEA;ISO]
[GO:0050777 "negative regulation of immune response" evidence=IEA]
[GO:0071230 "cellular response to amino acid stimulus"
evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
PROSITE:PS51461 SMART:SM00038 SMART:SM00214 RGD:71029 GO:GO:0043588
GO:GO:0005615 GO:GO:0007507 GO:GO:0046872 GO:GO:0034097
GO:GO:0030199 GO:GO:0001501 GO:GO:0007179 GO:GO:0007229
GO:GO:0007160 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
GO:GO:0042060 GO:GO:0001568 GO:GO:0048565 GO:GO:0050777
GO:GO:0009314 GO:GO:0018149 GO:GO:0032964 GO:GO:0071230
GO:GO:0043206 GO:GO:0005201 GeneTree:ENSGT00660000095287
HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG4FTW1C
CTD:1281 OMA:EGSPGHP GO:GO:0005586 EMBL:BC087039 EMBL:X70369
EMBL:AJ005395 EMBL:M21354 IPI:IPI00366944 PIR:S41067
RefSeq:NP_114474.1 UniGene:Rn.3247 ProteinModelPortal:P13941
IntAct:P13941 STRING:P13941 PRIDE:P13941 Ensembl:ENSRNOT00000004956
GeneID:84032 KEGG:rno:84032 UCSC:RGD:71029 InParanoid:P13941
NextBio:616623 Genevestigator:P13941 GermOnline:ENSRNOG00000003357
Uniprot:P13941
Length = 1463
Score = 132 (51.5 bits), Expect = 6.3e-05, P = 6.3e-05
Identities = 76/261 (29%), Positives = 102/261 (39%)
Query: 147 GQNAYEDGYGVPQGHGPPPSATTAGVVGA----GPNTSTSAYAAT-QSGTPMRAAYDIPR 201
G + G P GPP +A G GA GP S + + Q G P + +
Sbjct: 320 GNDGARGSDGQPGPPGPPGTAGFPGSPGAKGEVGPAGSPGSNGSPGQRGEPGPQGHAGAQ 379
Query: 202 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGP 259
GP G + G PG P+ P P A+GP P G Q+G + + G
Sbjct: 380 GPPGPPGNNGSPGGKGEMGPAGIPG-APGLLGARGPP-GPAGANGAPGQRGPS--GEPGK 435
Query: 260 NYDIHRGPSYDPQRG-LGYDMQRGPN-YDMQRG-PGYE-TQRVPGYDVQRG-PVYEAQRA 314
N P +RG G GP D + G PG VPG +RG P +
Sbjct: 436 N-GAKGEPGARGERGEAGSPGIPGPKGEDGKDGSPGEPGANGVPGNPGERGAPGFRGPAG 494
Query: 315 PSYIP-QRGPGYDLQRGQGYDMRRAPSYDPSR-GT-------GFDGAPRGAAPHGQVPPP 365
P+ P ++GP + + G G R + +P R GT G G+P G G+ PP
Sbjct: 495 PNGAPGEKGPAGE-RGGPGPAGPRGVAGEPGRDGTPGGPGIRGMPGSPGGPGNDGKPGPP 553
Query: 366 LNNVPYGSATPPARSGS-GQP 385
+ G PP SG GQP
Sbjct: 554 GSQGESGRPGPPGPSGPRGQP 574
Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
Identities = 82/284 (28%), Positives = 103/284 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTSTS 182
G GG G + P G + + G P GPP AG G GP S
Sbjct: 166 GGMGGYPGPAGPPGPPGPPGSSGHPGSPGSPGYQGPPGEPGQAGPAGPPGPPGAIGPSGP 225
Query: 183 AYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGYDP 239
A +SG P R +P PG + G PG+ K +D G + PG
Sbjct: 226 AGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGLKG 284
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE-T 295
G PG + G RG + R P G G D RG D Q GP G T
Sbjct: 285 ENGLPGDNGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPPGT 339
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
PG +G V A S PG QRG+ A + P G +G+P G
Sbjct: 340 AGFPGSPGAKGEVGPAGSPGS---NGSPG---QRGEPGPQGHAGAQGPPGPPGNNGSPGG 393
Query: 356 AAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RG--GNPAR 392
G P + P G+ PP +G+ G P RG G P +
Sbjct: 394 KGEMG--PAGIPGAPGLLGARGPPGPAGANGAPGQRGPSGEPGK 435
Score = 122 (48.0 bits), Expect = 0.00077, P = 0.00077
Identities = 80/272 (29%), Positives = 99/272 (36%)
Query: 125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
+ +G GA G P G + G P G G G+ G P + +
Sbjct: 832 KGEGGPPGAAGPPGGSGPAGPPGPQGVKGERGSPGGPGAAGFPGGRGLPGP-PGNNGNPG 890
Query: 185 AATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDP-TKGPSYDPAKGPGYDPTK 241
SG P + D P GP G S G PG K + P KGP A+GP P
Sbjct: 891 PPGPSGAPGK---DGPPGPAGNSGSPGNPGVAGPKGDAGQPGEKGPP--GAQGPPGSP-- 943
Query: 242 GP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPN-YDMQRGPGYETQRVP 299
GP G G+ A P RG S PQ G + G + ++ +RGP Q +P
Sbjct: 944 GPLGIAGLTGARGLAGP-PGMPGPRG-SPGPQGIKGESGKPGASGHNGERGPP-GPQGLP 1000
Query: 300 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 359
G Q G E R + PG D G D R + P G GAP P
Sbjct: 1001 G---QPGTAGEPGRDGNPGSDGQPGRDGSPGGKGD--RGENGSP----GAPGAPGHPGPP 1051
Query: 360 GQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
G V P N G P SG+ P G A
Sbjct: 1052 GPVGPSGKNGDRGETGPAGPSGAPGPAGARGA 1083
>ZFIN|ZDB-GENE-050302-9 [details] [associations]
symbol:col2a1b "collagen type II, alpha-1b"
species:7955 "Danio rerio" [GO:0003674 "molecular_function"
evidence=ND] [GO:0033333 "fin development" evidence=IMP]
[GO:0033334 "fin morphogenesis" evidence=IMP] [GO:0005581
"collagen" evidence=IEA] EMBL:HF563615 EMBL:HF563616 EMBL:HF563617
Uniprot:L0S5L0
Length = 1493
Score = 132 (51.5 bits), Expect = 6.4e-05, P = 6.4e-05
Identities = 82/282 (29%), Positives = 99/282 (35%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNT 179
+R G G GA GN P G G P G P + AG GA GP
Sbjct: 337 ERGRPGPSGASGARGNDGLPGGAGPPGPVGTAGSPGFP---GSPGAKGEAGPTGARGPEG 393
Query: 180 STSAYAATQSGTPMRAAYDIPRG-PGYEASKG-PGYDASK-APSYDPTKG-PSYDPAKGP 235
+ +SG P + P G G S G PG S AP G P P P
Sbjct: 394 AQGPRG--ESGVPGASG---PSGVSGNPGSDGMPGAKGSVGAPGIGGAPGFPG--PRGPP 446
Query: 236 GYDPTKGP-GYDAQKGSN----YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
G GP G Q G + + + GP +I G + +RGP +
Sbjct: 447 GPQGATGPLGPKGQSGDSGLAGFKGEAGPKGEIGNAGLQGAPGPAGEEGKRGPRGEPGAA 506
Query: 291 --PGYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPS 344
PG +R PG RG P + P P +RGP G +G G D R
Sbjct: 507 GPPGPTGERGTPG---NRGFPGQDGLAGPKGAPGERGPAGVSGPKGAGGDPGRPGEPGLP 563
Query: 345 RGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSG-SGQP 385
G G P A P G+V P G PP G GQP
Sbjct: 564 GARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGVRGQP 605
Score = 131 (51.2 bits), Expect = 8.2e-05, P = 8.2e-05
Identities = 88/298 (29%), Positives = 112/298 (37%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
A G G A E G+P G + ++ G+P GPP G G P + A A
Sbjct: 646 AAGPPGPAGSAGERGEQGQP-GPSGFQ---GLPGPPGPPGEGGKPGDQGV-PGEAGGAGA 700
Query: 186 AT---QSGTPMRAAYDIPRG-PGYEASKG-PGYDASKAPSYDP--TKGPSYDPA-KG-PG 236
+ G P P+G G G PG D K P T G P +G PG
Sbjct: 701 TGPRGERGFPGERGGAGPQGLQGPRGLPGTPGTDGPKG-GVGPAGTAGAQGPPGLQGMPG 759
Query: 237 YDPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL----GYDMQRGPNYDM-QRG 290
T G PG +G N D +GP P D RGL G GPN + + G
Sbjct: 760 ERGTSGNPGPKGDRGDNGD--KGPE----GAPGKDGSRGLTGPIGPTGPAGPNGEKGESG 813
Query: 291 P----GYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSR 345
P G T+ VPG + GP A A PG ++G+G A + P
Sbjct: 814 PAGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADGQPGVKGEQGEGGQKGDAGAPGPQG 873
Query: 346 GTGFDG--APRGAA-PHG----QVPPPLNNVP--YGSATPPARSGSGQPRG--GNPAR 392
+G G P G + P G Q PP P G PP +G+ P G G P +
Sbjct: 874 PSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPNGNPGPAGPAGPPGK 931
Score = 124 (48.7 bits), Expect = 0.00048, P = 0.00048
Identities = 78/259 (30%), Positives = 90/259 (34%)
Query: 147 GQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAAT-QSGTPMRAAYDIPRGPG 204
G+ + G P GP + G G +GP + A +G P A P GP
Sbjct: 858 GEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAGRVGPPGPN 917
Query: 205 YEASKGPGYDASKAPSYDPTKGPSYD--PAKGPGYDPTKGP-GYDAQKGS-NYDAQRGPN 260
+ GP A P D KG D P PG +G G +KG D GP
Sbjct: 918 --GNPGPAGPAGP-PGKDGPKGVRGDGGPPGRPGDAGLRGSAGPAGEKGDPGEDGPHGP- 973
Query: 261 YDIHRGPS-YDPQRGL-GYDMQRGPN-YDMQRGPGYET--QRVPGYDVQRGPVYEAQRAP 315
D GP QRG+ G QRG + GP E Q PG RGP AP
Sbjct: 974 -DGPAGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGPGDRGPPGPVG-AP 1031
Query: 316 SYIPQRG-PGYDLQRGQGYDMRRAPS--YDPSRG----TGFDGAPRGAAPHGQVPPPLNN 368
G PG + G R S RG G GAP G G V P
Sbjct: 1032 GLTGAAGEPGREGNPGSDGPPGRDGSAGIKGDRGDTGPAGAPGAPGGPGAPGPVGPTGKQ 1091
Query: 369 VPYGSATPPARSGSGQPRG 387
G A P SG P G
Sbjct: 1092 GDRGEAGPHGPSGPPGPAG 1110
Score = 123 (48.4 bits), Expect = 0.00061, P = 0.00061
Identities = 79/280 (28%), Positives = 96/280 (34%)
Query: 123 DRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGAGPNTST 181
D+ G GGA G P G+ G G PQG GP T G G
Sbjct: 688 DQGVPGEAGGAGATGPRGERGFP-GERG---GAG-PQGLQGPRGLPGTPGTDGPKGGVGP 742
Query: 182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG-PG-YDP 239
+ A Q G P +P G + GP D P P D ++G G P
Sbjct: 743 AGTAGAQ-GPP--GLQGMPGERGTSGNPGPKGDRGDNGDKGPEGAPGKDGSRGLTGPIGP 799
Query: 240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPS-YDPQRGLGYDM-QRGPN--YDMQRGPGYET 295
T G + +KG + GP GPS RG+ D + GP PG +
Sbjct: 800 TGPAGPNGEKGES-----GP-----AGPSGVAGTRGVPGDRGETGPPGPAGFAGPPGADG 849
Query: 296 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR 354
Q V G + G +A P PG G + P TGF GA
Sbjct: 850 QPGVKGEQGEGGQKGDAGAPGPQGPSGAPGPQGPTGVSGPKGARGAQGPPGATGFPGAAG 909
Query: 355 GAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPARR 393
P G P N P G A PP + G RG G P R
Sbjct: 910 RVGPPG---PNGNPGPAGPAGPPGKDGPKGVRGDGGPPGR 946
>FB|FBgn0038642 [details] [associations]
symbol:Muc91C "Mucin 91C" species:7227 "Drosophila
melanogaster" [GO:0005201 "extracellular matrix structural
constituent" evidence=ISM] [GO:0031012 "extracellular matrix"
evidence=ISM] [GO:0022008 "neurogenesis" evidence=IMP]
EMBL:AE014297 GO:GO:0022008 eggNOG:NOG12793 GO:GO:0031012
GO:GO:0005201 GeneTree:ENSGT00700000104744 RefSeq:NP_650744.1
UniGene:Dm.10760 EnsemblMetazoa:FBtr0083687 GeneID:42246
KEGG:dme:Dmel_CG7709 UCSC:CG7709-RA CTD:42246 FlyBase:FBgn0038642
InParanoid:Q9VE45 OMA:GPYPSAP PhylomeDB:Q9VE45 GenomeRNAi:42246
NextBio:827869 ArrayExpress:Q9VE45 Bgee:Q9VE45 Uniprot:Q9VE45
Length = 950
Score = 129 (50.5 bits), Expect = 8.0e-05, P = 8.0e-05
Identities = 72/281 (25%), Positives = 98/281 (34%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGHGPPPSATTAGVVGAGPNTST 181
RR SYG +++ G P + Y P Q +G P A + G + +
Sbjct: 222 RRPSSSYGAPRPAPPSQSYGAPPSAS-YGPPKSAPPSQSYGAP--APPSSKYGPPKSAPS 278
Query: 182 SAYAATQSGTPMRAAYDIPRGPG--YEASKGPG--YDASKAPS--YDPTKGPSYDPAKGP 235
S+Y A + P ++Y P P Y A P Y A APS Y PS + G
Sbjct: 279 SSYGAPRPAAPS-SSYGAPAPPSSSYGAPAAPSSSYGAPAAPSSSYGAPAAPS--SSYGA 335
Query: 236 GYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR--GPGY 293
P+K G A S+Y A P+ G P G +Y P Y
Sbjct: 336 PAPPSKSYGAPAPPSSSYGAPAAPSKSY--GAPAPPSSSYGAPAPPSSSYGAPAPPSPSY 393
Query: 294 ETQRVPGYDVQR-GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
P P + AP+ P + G Y AP+ PS G A
Sbjct: 394 GAPAPPSKSYGAPAPPSSSYGAPA-APSKSYGAPAPPSSSYG---APA-PPSSSYGAPSA 448
Query: 353 PRGA-APHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
P + P P P ++ YG A P A S P P++
Sbjct: 449 PSSSYGPPKPAPAPPSS-SYG-APPQAPVSSYLPPASRPSK 487
Score = 127 (49.8 bits), Expect = 0.00013, P = 0.00013
Identities = 67/265 (25%), Positives = 96/265 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTS--TSAYA 185
GS GG+ ++ + + P + G P SA ++ GP S +S+Y+
Sbjct: 589 GSSGGSFQSAPSSSYSAPSA--SANSGGSYPSAPSSSYSAPSSSSSSGGPYASAPSSSYS 646
Query: 186 ATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGY 245
A SG+ Y P P S P A+ SY SY A PG + + GP Y
Sbjct: 647 APSSGSNSGGPY--PAAPSSSYS-APSASANSGGSYPSAPSSSYS-APSPGSN-SGGP-Y 700
Query: 246 DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY-----ETQRVPG 300
A S+Y A P+ + G Y Y P+ G Y + P
Sbjct: 701 PAAPSSSYSA---PSPSANSGGPYASAPSSSYS---APSSSSNSGGPYAAAPSSSYSAPS 754
Query: 301 YDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHG 360
G Y + + SY P L G Y + SY + G P AAP
Sbjct: 755 SSSSSGGPYPSAPSSSY---SAPSSSLSSGGPYPSAPSSSYAAPSPSSNSGGPYPAAPSN 811
Query: 361 QVPPPLN--NVPYGS-ATPPARSGS 382
P+ + YG+ A+ P+ S S
Sbjct: 812 SYSAPIAPPSSSYGAPASGPSPSFS 836
Score = 124 (48.7 bits), Expect = 0.00028, P = 0.00028
Identities = 63/264 (23%), Positives = 91/264 (34%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP--PSATTAGVVGAGPNTSTSAYAATQS 189
G++G S +S YG P P P +++ G +G +S+ +++A S
Sbjct: 521 GSSGYSSGPSSSYEAPVAPPSSSYGAPSSSFQPISPPSSSYGAPSSGSGSSSGSFSAAPS 580
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYD-PTK-----GP 243
A G ++++ Y A PS G SY A Y P+ GP
Sbjct: 581 SL-YSAPSKGSSGGSFQSAPSSSYSA---PSASANSGGSYPSAPSSSYSAPSSSSSSGGP 636
Query: 244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDM-----QRGPNYDMQRGPGYETQRV 298
Y + S+Y A P+ + G Y Y G +Y Y
Sbjct: 637 -YASAPSSSYSA---PSSGSNSGGPYPAAPSSSYSAPSASANSGGSYPSAPSSSYSAPS- 691
Query: 299 PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP 358
PG + GP Y A + SY P G Y + SY + G P AAP
Sbjct: 692 PGSN-SGGP-YPAAPSSSY---SAPSPSANSGGPYASAPSSSYSAPSSSSNSGGPYAAAP 746
Query: 359 HGQVPPPLNNVPYGSATPPARSGS 382
P ++ G P A S S
Sbjct: 747 SSSYSAPSSSSSSGGPYPSAPSSS 770
>UNIPROTKB|F1SN69 [details] [associations]
symbol:F1SN69 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 InterPro:IPR008985 SUPFAM:SSF49899 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 GO:GO:0005201
SMART:SM00210 GeneTree:ENSGT00700000104301 OMA:YSYPDRL
EMBL:CU618340 EMBL:CU606988 EMBL:CU861519
Ensembl:ENSSSCT00000006033 Uniprot:F1SN69
Length = 1869
Score = 132 (51.5 bits), Expect = 8.2e-05, P = 8.2e-05
Identities = 74/250 (29%), Positives = 98/250 (39%)
Query: 156 GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRG-------PGYEA 207
GVP GPP + G G+ GP + A G P A YD +G PG +
Sbjct: 1274 GVPGDPGPPGTPGPKGSRGSLGPTGAPGRMGA--QGEPGLAGYDGHKGIMGPLGPPGPKG 1331
Query: 208 SKGP-GYDA-SKAPSYDP-TKGPSYDPAKGPGYDPTKGPGYDAQKG-----SNYDAQRGP 259
KG G D ++ P P +GP D +G +P PGY Q+G N Q P
Sbjct: 1332 EKGEQGEDGKAEGPPGPPGDRGPVGD--RGDRGEPGD-PGYPGQEGVQGLRGNPGQQGQP 1388
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE-TQRVPGYDVQRGPVYEAQRAPSYI 318
+ RG P+ G + +G PG TQ +PG RG V ++ P +
Sbjct: 1389 GHPGPRGRP-GPKGSKGEEGPKGKQ-GKAGAPGRRGTQGLPGLPGPRGVV--GRQGPEGV 1444
Query: 319 --PQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA-PHGQVPPPL---NNVPYG 372
P PG D Q GQ + P G G P A P Q PP + +P G
Sbjct: 1445 AGPDGLPGLDGQAGQQGEQGDDGDPGPLGPAGKRGNPGVAGLPGAQGPPGFKGESGLP-G 1503
Query: 373 SATPPARSGS 382
PP + G+
Sbjct: 1504 QLGPPGKRGT 1513
>WB|WBGene00000694 [details] [associations]
symbol:col-120 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
GeneTree:ENSGT00610000086159 EMBL:AL032632 PIR:T26465
RefSeq:NP_501617.1 ProteinModelPortal:Q9XWR2 DIP:DIP-26936N
IntAct:Q9XWR2 MINT:MINT-1070946 STRING:Q9XWR2
EnsemblMetazoa:Y11D7A.11 GeneID:177748 KEGG:cel:CELE_Y11D7A.11
UCSC:Y11D7A.11 CTD:177748 WormBase:Y11D7A.11 eggNOG:NOG265281
InParanoid:Q9XWR2 OMA:HWELLED NextBio:898216 Uniprot:Q9XWR2
Length = 313
Score = 122 (48.0 bits), Expect = 8.7e-05, P = 8.7e-05
Identities = 77/268 (28%), Positives = 97/268 (36%)
Query: 136 NSENE-TSGRPVGQNAY--EDGYGV--PQ---GHGPPPSATTAGVVGAGPNTSTSAYAAT 187
N EN S + VG + GYG P G P PS A A ++S+S+ +
Sbjct: 64 NLENMYESTKAVGSGPVKRQAGYGASSPSRASGSHPAPSPYDA----ASTSSSSSSDSCC 119
Query: 188 QSGTPMRAAYDIPRGPGYEASKGP----GYDASKAPSYDPTKGPSYD---PAKGPGYDPT 240
G + P PG + GP G D + G + PA PG P
Sbjct: 120 SCGIGLAGPAGFPGRPGRDGIDGPAGKPGRDGQDLDGESSSDGSQIELDCPAGPPG--PP 177
Query: 241 KGPGYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRV 298
PG G D G N R P +RG G D + G D PG +
Sbjct: 178 GNPGPQGNSGRPGMDGMPGRNGRCGR-PGEQGERGPNGEDGRPGRRGD-DGMPG-TVNEI 234
Query: 299 PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQ-GYDMRRAPSYDPSRGTGFDGAPRGAA 357
PG Q GP + AP +GP RG G + P+ P GFDGAP G
Sbjct: 235 PG---QAGPP-GLRGAPGATGSQGP-----RGNDGRPGNKGPAGPPG-DQGFDGAPGGPG 284
Query: 358 PHGQ--VPPPLNNVPYGSATPPARSGSG 383
G+ PL S PP R+ G
Sbjct: 285 ADGEPGAQGPLGAKGECSHCPPPRTAPG 312
>DICTYBASE|DDB_G0286613 [details] [associations]
symbol:DDB_G0286613 "14-3-3 family protein"
species:44689 "Dictyostelium discoideum" [GO:0019904 "protein
domain specific binding" evidence=IEA] [GO:0006366 "transcription
from RNA polymerase II promoter" evidence=IEA] [GO:0005665
"DNA-directed RNA polymerase II, core complex" evidence=IEA]
[GO:0003950 "NAD+ ADP-ribosyltransferase activity" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] InterPro:IPR000308
InterPro:IPR000684 InterPro:IPR002035 InterPro:IPR012317
Pfam:PF00644 PRINTS:PR00305 PROSITE:PS00115 PROSITE:PS50234
PROSITE:PS51059 SMART:SM00327 InterPro:IPR001357
dictyBase:DDB_G0286613 Pfam:PF00533 eggNOG:COG5040
Gene3D:1.20.190.20 InterPro:IPR023410 Pfam:PF00244 SMART:SM00101
SUPFAM:SSF48445 GO:GO:0003677 EMBL:AAFI02000089 GO:GO:0006366
SMART:SM00292 SUPFAM:SSF52113 PROSITE:PS50172 GO:GO:0003950
InterPro:IPR013694 Pfam:PF08487 PROSITE:PS51468 KO:K10798
GO:GO:0005665 RefSeq:XP_637567.1 ProteinModelPortal:Q54LJ4
EnsemblProtists:DDB0232950 GeneID:8625707 KEGG:ddi:DDB_G0286613
InParanoid:Q54LJ4 OMA:THTKATI Uniprot:Q54LJ4
Length = 2563
Score = 133 (51.9 bits), Expect = 9.0e-05, P = 9.0e-05
Identities = 45/137 (32%), Positives = 62/137 (45%)
Query: 125 RADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAY 184
R S GG+ G+S G +G A+ G G P PPP +T+ +G G + +
Sbjct: 1787 RGGSSRGGSIGSSRGGRGGN-IG-TAF--GRGAPPPPQPPPPPSTS--LGRGAPPPSLFF 1840
Query: 185 AATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
A+Q +P Y IP P Y + P Y + +PSY PT PSY P P Y + P
Sbjct: 1841 QASQPYSPTSPFY-IPTSPSYSPTS-PSYSPT-SPSYSPTS-PSYSPTS-PSYSTS--PL 1893
Query: 245 YDAQKGSNYDAQRGPNY 261
Y A +Y P+Y
Sbjct: 1894 Y-ASTSQSYSPV-SPSY 1908
>UNIPROTKB|F1NCR0 [details] [associations]
symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0001501 "skeletal system
development" evidence=IEA] [GO:0001568 "blood vessel development"
evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
[GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
"transforming growth factor beta receptor signaling pathway"
evidence=IEA] [GO:0007266 "Rho protein signal transduction"
evidence=IEA] [GO:0008217 "regulation of blood pressure"
evidence=IEA] [GO:0030199 "collagen fibril organization"
evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
[GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
"skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
evidence=IEA] [GO:0070208 "protein heterotrimerization"
evidence=IEA] [GO:0071230 "cellular response to amino acid
stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
GeneTree:ENSGT00660000095287 GO:GO:0005584 EMBL:AADN02000724
IPI:IPI00821202 Ensembl:ENSGALT00000015706 ArrayExpress:F1NCR0
Uniprot:F1NCR0
Length = 1318
Score = 130 (50.8 bits), Expect = 9.1e-05, P = 9.1e-05
Identities = 81/262 (30%), Positives = 97/262 (37%)
Query: 145 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 198
PVG+ E G P G GP A AG G GP A G P R
Sbjct: 781 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 837
Query: 199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 257
+P G GP S P +GPS P PG + G G D G++ R
Sbjct: 838 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 892
Query: 258 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
P + RG P P LG GP+ Q GP + PG GPV
Sbjct: 893 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 943
Query: 314 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 367
A ++ P+ GP G ++G+ D R P +G G G P A HG PP N
Sbjct: 944 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1000
Query: 368 NVPYGSATPPARSGSGQPRGGN 389
N P G PP SG G N
Sbjct: 1001 NGPAGPRGPPGPSGPPGKDGRN 1022
>UNIPROTKB|F1M6Q3 [details] [associations]
symbol:Col4a2 "Protein Col4a2" species:10116 "Rattus
norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
GO:GO:0071560 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0006351
InterPro:IPR008160 Pfam:PF01391 GO:GO:0016525 GO:GO:0005201
GO:GO:0005587 Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772
IPI:IPI00778948 Ensembl:ENSRNOT00000057461 Uniprot:F1M6Q3
Length = 1647
Score = 131 (51.2 bits), Expect = 9.2e-05, P = 9.2e-05
Identities = 90/302 (29%), Positives = 112/302 (37%)
Query: 119 APNVDRRADGSYGGATGN----SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 174
+P VD D + G TG+ E T PVG + G+P GP S G G
Sbjct: 1145 SPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGAPGQKGEQGIPGERGPVGSPGLQGFPG 1204
Query: 175 AGPNTSTSAYAATQSGTPM---RAAYDIPRGP-GYEASKGPGYD--ASKAPSYDPTKGPS 228
P ++ S G P Y P GP G A G D +S A + KG
Sbjct: 1205 ISPPSNISGLPG-DVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGEKGWV 1263
Query: 229 YDPAKGPGYDP-TKG-PGYDAQKGSN-YDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPN 284
DP GP P G PG KG + GP+ + RGP P+ G+ G
Sbjct: 1264 GDP--GPQGQPGVHGLPGEKGPKGEQGFMGNTGPSGAVGDRGPK-GPKGDQGFPGAPGS- 1319
Query: 285 YDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDP 343
PG +PG Q+ V P +RG PG + G + P DP
Sbjct: 1320 ---MGSPG-----IPGIP-QKIAVQPGTMGPQ--GRRGLPGALGEMGP-----QGPPGDP 1363
Query: 344 SRGTGFDGAPRGAAPHGQ-----VPP------PLNNV-PYGSATPPARSGS-GQPRGGNP 390
GF GAP A P G+ VP P+ + P G P R GS G P G P
Sbjct: 1364 ----GFRGAPGKAGPQGRGGVSAVPGFRGDQGPMGHQGPIGQEGEPGRPGSPGLP--GMP 1417
Query: 391 AR 392
R
Sbjct: 1418 GR 1419
>UNIPROTKB|P02467 [details] [associations]
symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0005583 "fibrillar collagen" evidence=IDA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 GO:GO:0046872 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0005201 HOVERGEN:HBG004933 EMBL:M25963
EMBL:M25956 EMBL:M25959 EMBL:M25961 EMBL:M25962 EMBL:M25965
EMBL:M25964 EMBL:M25984 EMBL:M25957 EMBL:M25966 EMBL:M25967
EMBL:M25969 EMBL:M25970 EMBL:M25971 EMBL:M25972 EMBL:M25973
EMBL:M25974 EMBL:M25976 EMBL:M25977 EMBL:M25978 EMBL:M25979
EMBL:M25980 EMBL:M25981 EMBL:M25982 EMBL:M25983 EMBL:J00826
EMBL:J00821 EMBL:K00792 EMBL:J00830 EMBL:J00829 EMBL:J00837
EMBL:J00812 EMBL:J00811 EMBL:J00814 EMBL:J00815 EMBL:X02657
EMBL:K00794 EMBL:V00390 EMBL:M17608 EMBL:M10581 EMBL:M10540
EMBL:J00828 EMBL:J00827 EMBL:J00832 EMBL:J00831 EMBL:J00833
EMBL:J00822 IPI:IPI00914483 PIR:I50173 PIR:I50206 PIR:S10847
UniGene:Gga.5097 STRING:P02467 PRIDE:P02467 InParanoid:P02467
PMAP-CutDB:P02467 GO:GO:0005583 Uniprot:P02467
Length = 1362
Score = 130 (50.8 bits), Expect = 9.5e-05, P = 9.5e-05
Identities = 81/262 (30%), Positives = 97/262 (37%)
Query: 145 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 198
PVG+ E G P G GP A AG G GP A G P R
Sbjct: 825 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 881
Query: 199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 257
+P G GP S P +GPS P PG + G G D G++ R
Sbjct: 882 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 936
Query: 258 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
P + RG P P LG GP+ Q GP + PG GPV
Sbjct: 937 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 987
Query: 314 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 367
A ++ P+ GP G ++G+ D R P +G G G P A HG PP N
Sbjct: 988 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1044
Query: 368 NVPYGSATPPARSGSGQPRGGN 389
N P G PP SG G N
Sbjct: 1045 NGPAGPRGPPGPSGPPGKDGRN 1066
>UNIPROTKB|F1P0H9 [details] [associations]
symbol:COL1A2 "Collagen alpha-2(I) chain" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0001501 "skeletal system
development" evidence=IEA] [GO:0001568 "blood vessel development"
evidence=IEA] [GO:0005584 "collagen type I" evidence=IEA]
[GO:0005615 "extracellular space" evidence=IEA] [GO:0007179
"transforming growth factor beta receptor signaling pathway"
evidence=IEA] [GO:0007266 "Rho protein signal transduction"
evidence=IEA] [GO:0008217 "regulation of blood pressure"
evidence=IEA] [GO:0030199 "collagen fibril organization"
evidence=IEA] [GO:0030674 "protein binding, bridging" evidence=IEA]
[GO:0042802 "identical protein binding" evidence=IEA] [GO:0043589
"skin morphogenesis" evidence=IEA] [GO:0046332 "SMAD binding"
evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
evidence=IEA] [GO:0070208 "protein heterotrimerization"
evidence=IEA] [GO:0071230 "cellular response to amino acid
stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 GO:GO:0005615
GO:GO:0030199 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230 GO:GO:0005201
GeneTree:ENSGT00660000095287 KO:K06236 GO:GO:0005584 CTD:1278
IPI:IPI00914483 UniGene:Gga.5097 EMBL:AADN02000724
RefSeq:NP_001073182.2 PRIDE:F1P0H9 Ensembl:ENSGALT00000015703
GeneID:396243 KEGG:gga:396243 OMA:IGMPGAR NextBio:20816295
ArrayExpress:F1P0H9 Uniprot:F1P0H9
Length = 1363
Score = 130 (50.8 bits), Expect = 9.5e-05, P = 9.5e-05
Identities = 81/262 (30%), Positives = 97/262 (37%)
Query: 145 PVGQNAYEDGYGVPQGH----GPPPSATTAGVVGA-GPNTSTSAYAATQSGTP-MRAAYD 198
PVG+ E G P G GP A AG G GP A G P R
Sbjct: 826 PVGRTG-EQGIAGPPGFAGEKGPSGEAGAAGPPGTPGPQGILGAPGIL--GLPGSRGERG 882
Query: 199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQR 257
+P G GP S P +GPS P PG + G G D G++ R
Sbjct: 883 LPGIAGATGEPGP-LGVSGPPG---ARGPS-GPVGSPGPNGAPGEAGRDGNPGNDGPPGR 937
Query: 258 --GPNYDIHRG-PSYD-PQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
P + RG P P LG GP+ Q GP + PG GPV
Sbjct: 938 DGAPGFKGERGAPGNPGPSGALGAP---GPHG--QVGPSGK----PGNRGDPGPVGPVGP 988
Query: 314 APSYIPQ--RGP-GYDLQRGQGYDM--RRAPSYDPSRG-TGFDGAPRGAAPHGQVPPPLN 367
A ++ P+ GP G ++G+ D R P +G G G P A HG PP N
Sbjct: 989 AGAFGPRGLAGPQGPRGEKGEPGDKGHRGLPGL---KGHNGLQGLPGLAGQHGDQGPPGN 1045
Query: 368 NVPYGSATPPARSGSGQPRGGN 389
N P G PP SG G N
Sbjct: 1046 NGPAGPRGPPGPSGPPGKDGRN 1067
>UNIPROTKB|F1SNP1 [details] [associations]
symbol:COL4A4 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0032836 "glomerular basement membrane development"
evidence=IEA] [GO:0005605 "basal lamina" evidence=IEA] [GO:0005587
"collagen type IV" evidence=IEA] [GO:0005201 "extracellular matrix
structural constituent" evidence=IEA] InterPro:IPR001442
Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 InterPro:IPR016187
SUPFAM:SSF56436 GO:GO:0005605 InterPro:IPR008160 Pfam:PF01391
GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
GeneTree:ENSGT00690000101772 GO:GO:0032836 OMA:FRGDMGD
EMBL:CU466451 EMBL:FP690341 Ensembl:ENSSSCT00000017688
Uniprot:F1SNP1
Length = 1711
Score = 131 (51.2 bits), Expect = 9.6e-05, P = 9.6e-05
Identities = 76/260 (29%), Positives = 89/260 (34%)
Query: 143 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 202
G P G E G+P GPP G G G P G
Sbjct: 1207 GVP-GPRGPEGSMGLPGQRGPP-GPECKGEPGPDGRRGEDGLPGPP-GPPGHKGDMGEAG 1263
Query: 203 -PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKGPGYDAQKGSNYDAQRG 258
PG KG PG + PS +G + DP G G P PG G N QRG
Sbjct: 1264 CPGAPGPKGFPGRRGTPGPSLIGFRGDTGDPGFGGEKGSSPIGPPGSPGSPGMN--GQRG 1321
Query: 259 PNYDIHRG-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA-- 314
P D G P +RGL G +G D R +PG+ +GP RA
Sbjct: 1322 PPGDPALGYPGPPGKRGLFGSPGSKGLRGDPGRPGATGPAGMPGFPGLKGPKGREGRAGF 1381
Query: 315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSA 374
P +P PG+ + G R P P G GAP G + PP G
Sbjct: 1382 PG-VPGP-PGHSCESGA--PGRPGPPGLPG-APGSPGAPGWKGQRGDMGPPGPAGMKGVP 1436
Query: 375 TPPARSGSGQPRG--GNPAR 392
P R G P G G P R
Sbjct: 1437 GVPGRPGPDGPPGPPGVPGR 1456
>TAIR|locus:2079502 [details] [associations]
symbol:RS31 "arginine/serine-rich splicing factor 31"
species:3702 "Arabidopsis thaliana" [GO:0000166 "nucleotide
binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0003723 "RNA binding" evidence=ISS] [GO:0005737
"cytoplasm" evidence=ISM] [GO:0016607 "nuclear speck" evidence=IDA]
[GO:0008380 "RNA splicing" evidence=NAS] [GO:0000398 "mRNA
splicing, via spliceosome" evidence=IDA;RCA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=RCA]
[GO:0030422 "production of siRNA involved in RNA interference"
evidence=RCA] [GO:0035196 "production of miRNAs involved in gene
silencing by miRNA" evidence=RCA] [GO:0043687 "post-translational
protein modification" evidence=RCA] [GO:0045893 "positive
regulation of transcription, DNA-dependent" evidence=RCA]
[GO:0005681 "spliceosomal complex" evidence=TAS] InterPro:IPR000504
InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0000166 GO:GO:0016607
Gene3D:3.30.70.330 GO:GO:0005681 GO:GO:0003723 GO:GO:0000398
EMBL:AL138642 HOGENOM:HOG000276234 KO:K12893 EMBL:X99435
EMBL:AF439831 EMBL:AY125565 IPI:IPI00530595 PIR:T47978 PIR:T51304
RefSeq:NP_567120.1 UniGene:At.24231 ProteinModelPortal:P92964
SMR:P92964 IntAct:P92964 STRING:P92964 PaxDb:P92964 PRIDE:P92964
EnsemblPlants:AT3G61860.1 GeneID:825359 KEGG:ath:AT3G61860
TAIR:At3g61860 eggNOG:NOG277933 InParanoid:P92964 OMA:FEYETRQ
PhylomeDB:P92964 ProtClustDB:CLSN2917489 Genevestigator:P92964
GermOnline:AT3G61860 Uniprot:P92964
Length = 264
Score = 120 (47.3 bits), Expect = 9.9e-05, P = 9.9e-05
Identities = 30/88 (34%), Positives = 41/88 (46%)
Query: 191 TPMRAAYDIPR---GPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YDPTKGPGYD 246
+P R+ + R P Y PG +P Y + P YD KGP Y+ + P Y
Sbjct: 177 SPRRSLSPVYRRRPSPDYGRRPSPGQGRRPSPDYGRARSPEYDRYKGPAAYERRRSPDY- 235
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDPQRG 274
++ S+Y QR P YD +R S P RG
Sbjct: 236 GRRSSDYGRQRSPGYDRYRSRSPVP-RG 262
>UNIPROTKB|F1MSR8 [details] [associations]
symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
"Bos taurus" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
GO:GO:0005201 GeneTree:ENSGT00660000095287 KO:K06236
UniGene:Bt.21390 GeneID:407142 KEGG:bta:407142 CTD:1280
NextBio:20818406 EMBL:DAAA02012985 EMBL:DAAA02012986
IPI:IPI00786510 RefSeq:NP_001106695.1 PRIDE:F1MSR8
Ensembl:ENSBTAT00000017509 Uniprot:F1MSR8
Length = 1418
Score = 130 (50.8 bits), Expect = 9.9e-05, P = 9.9e-05
Identities = 89/295 (30%), Positives = 112/295 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
P DR D GA G + G P G G P GPP A + G
Sbjct: 64 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 119
Query: 178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
+ A G PM PRGP G + GP G+ + +P GP P +G
Sbjct: 120 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 174
Query: 235 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
P P K PG D + G +RGP RG P GL G RG P D +G
Sbjct: 175 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 231
Query: 291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 232 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 289
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
P+ G GF GAP GA G+ P P G+ P P GS P G GNP
Sbjct: 290 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 341
Score = 128 (50.1 bits), Expect = 0.00016, P = 0.00016
Identities = 88/282 (31%), Positives = 102/282 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
G G A N E G P G G P +G GPP A AG GA P
Sbjct: 723 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 781
Query: 183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
A Q G A P+GP G +GP G K + P + A G
Sbjct: 782 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 838
Query: 235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
PG + GP G G D +G RG S P R +Q GP GP
Sbjct: 839 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 885
Query: 294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
E PG D GP + P + QRG G QRG+ G+ PS +P + G
Sbjct: 886 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 941
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
GA P G V PP P G P R GS G P R
Sbjct: 942 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 979
>MGI|MGI:88467 [details] [associations]
symbol:Col1a1 "collagen, type I, alpha 1" species:10090 "Mus
musculus" [GO:0001501 "skeletal system development"
evidence=ISO;IMP] [GO:0001568 "blood vessel development"
evidence=ISO;IMP] [GO:0001957 "intramembranous ossification"
evidence=IGI] [GO:0001958 "endochondral ossification" evidence=IMP]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
"proteinaceous extracellular matrix" evidence=IDA] [GO:0005581
"collagen" evidence=IMP;IDA] [GO:0005584 "collagen type I"
evidence=ISO;IMP;IDA] [GO:0005615 "extracellular space"
evidence=ISO] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0007601
"visual perception" evidence=ISO] [GO:0007605 "sensory perception
of sound" evidence=ISO] [GO:0010718 "positive regulation of
epithelial to mesenchymal transition" evidence=ISO] [GO:0010812
"negative regulation of cell-substrate adhesion" evidence=IDA]
[GO:0015031 "protein transport" evidence=IMP] [GO:0030199 "collagen
fibril organization" evidence=ISO] [GO:0030335 "positive regulation
of cell migration" evidence=ISO] [GO:0031012 "extracellular matrix"
evidence=IDA] [GO:0032964 "collagen biosynthetic process"
evidence=ISO] [GO:0034504 "protein localization to nucleus"
evidence=ISO] [GO:0034505 "tooth mineralization" evidence=ISO]
[GO:0042060 "wound healing" evidence=ISO] [GO:0042802 "identical
protein binding" evidence=ISO] [GO:0043588 "skin development"
evidence=IMP] [GO:0043589 "skin morphogenesis" evidence=ISO]
[GO:0045893 "positive regulation of transcription, DNA-dependent"
evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=ISO]
[GO:0048705 "skeletal system morphogenesis" evidence=IGI]
[GO:0048706 "embryonic skeletal system development" evidence=ISO]
[GO:0060325 "face morphogenesis" evidence=IGI] [GO:0060346 "bone
trabecula formation" evidence=IGI] [GO:0060351 "cartilage
development involved in endochondral bone morphogenesis"
evidence=IMP] [GO:0070208 "protein heterotrimerization"
evidence=IDA] [GO:0071230 "cellular response to amino acid
stimulus" evidence=IDA] [GO:0090263 "positive regulation of
canonical Wnt receptor signaling pathway" evidence=ISO]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 MGI:MGI:88467 GO:GO:0005737
GO:GO:0045893 GO:GO:0043588 GO:GO:0005615 GO:GO:0071363
GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0071300
GO:GO:0043434 GO:GO:0030199 GO:GO:0007584 GO:GO:0010035
GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0042542
GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
GO:GO:0071260 GO:GO:0001568 GO:GO:0001649 GO:GO:0051591
GO:GO:0034505 GO:GO:0090263 GO:GO:0010812 GO:GO:0060325
GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ OrthoDB:EOG4S4PHP
GO:GO:0005584 GO:GO:0060346 ChiTaRS:COL1A1 GO:GO:0031960
EMBL:U08020 EMBL:AL662790 EMBL:AL606480 EMBL:BC050014 EMBL:BC059281
EMBL:K01688 EMBL:S67530 EMBL:S67482 EMBL:X54876 EMBL:M14423
EMBL:M17491 EMBL:K03036 EMBL:K03029 EMBL:K03030 EMBL:K03031
EMBL:K03032 EMBL:K03033 EMBL:K03034 EMBL:K03035 EMBL:X06753
EMBL:X15896 EMBL:X57981 IPI:IPI00329872 IPI:IPI00623191 PIR:I49558
PIR:S57243 RefSeq:NP_031768.2 UniGene:Mm.277735 UniGene:Mm.458212
ProteinModelPortal:P11087 SMR:P11087 IntAct:P11087 STRING:P11087
PhosphoSite:P11087 PaxDb:P11087 PRIDE:P11087
Ensembl:ENSMUST00000001547 GeneID:12842 KEGG:mmu:12842
UCSC:uc007kzn.1 InParanoid:P11087 NextBio:282376 PMAP-CutDB:P11087
Bgee:P11087 CleanEx:MM_COL1A1 Genevestigator:P11087
GermOnline:ENSMUSG00000001506 Uniprot:P11087
Length = 1453
Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
Identities = 79/254 (31%), Positives = 95/254 (37%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSAT----TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIP 200
P+G N G P+G PP AT AG VG P S +A G + P
Sbjct: 841 PIG-NVGAPGPKGPRGAAGPPGATGFPGAAGRVGP-PGPSGNAGPPGPPGPVGKEGGKGP 898
Query: 201 RGPGYEASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYDAQKGS-NYDAQ 256
RG A + PG P P G P A GP P T GP G Q+G Q
Sbjct: 899 RGETGPAGR-PGEVGPPGPP-GPA-GEKGSPGADGPAGSPGTPGPQGIAGQRGVVGLPGQ 955
Query: 257 RGPN-YDIHRGPSYDP-QRG-LGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQR 313
RG + GPS +P ++G G +RGP M GP PG GP E+ R
Sbjct: 956 RGERGFPGLPGPSGEPGKQGPSGSSGERGPPGPM--GP-------PGL---AGPPGESGR 1003
Query: 314 APSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGS 373
S + PG D G D P G GAP P G+ P G
Sbjct: 1004 EGSPGAEGSPGRDGAPGAKGDRGETGPAGPPGAPGAPGAPGPVGPAGKNGDRGETGPAGP 1063
Query: 374 ATPPARSGSGQPRG 387
A P +G+ P G
Sbjct: 1064 AGPIGPAGARGPAG 1077
>UNIPROTKB|P04280 [details] [associations]
symbol:PRB1 "Basic salivary proline-rich protein 1"
species:9606 "Homo sapiens" [GO:0008150 "biological_process"
evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
[GO:0005576 "extracellular region" evidence=NAS] GO:GO:0005576
PIR:B40750 InterPro:IPR026086 PANTHER:PTHR23203 EMBL:K03204
EMBL:K03205 EMBL:K03206 EMBL:S52986 EMBL:M97220 EMBL:K02575
EMBL:K02576 EMBL:X07516 EMBL:X07517 EMBL:S62928 EMBL:S62941
IPI:IPI00023038 PIR:C38355 PIR:D40750 RefSeq:NP_005030.2
RefSeq:NP_955385.1 RefSeq:NP_955386.1 UniGene:Hs.631726
ProteinModelPortal:P04280 STRING:P04280 PhosphoSite:P04280
DMDM:52001469 PRIDE:P04280 GeneID:5542 KEGG:hsa:5542 CTD:5542
GeneCards:GC12M011504 HGNC:HGNC:9337 MIM:180989 neXtProt:NX_P04280
PharmGKB:PA33699 KO:K13911 GenomeRNAi:5542 NextBio:21470
ArrayExpress:P04280 CleanEx:HS_PRB1 Genevestigator:P04280
Uniprot:P04280
Length = 392
Score = 123 (48.4 bits), Expect = 0.00010, P = 0.00010
Identities = 76/279 (27%), Positives = 94/279 (33%)
Query: 131 GGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT-STSAYAATQS 189
GG G+P G G PQG PPP G G + S +
Sbjct: 43 GGNKPQGPPPPPGKPQGPPP--QGGNKPQG--PPPPGKPQGPPPQGDKSRSPRSPPGKPQ 98
Query: 190 GTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTKG------PSYDPAKGPGYDPTK 241
G P + P+GP K GP P P G P D ++ P P K
Sbjct: 99 GPPPQGGNQ-PQGPPPPPGKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDKSQSPRSPPGK 157
Query: 242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQ-- 296
G Q G N P +GP P +G G Q P +GP G ++Q
Sbjct: 158 PQGPPPQ-GGNQPQGPPPPPGKPQGP---PPQG-GNKPQGPPPPGKPQGPPPQGDKSQSP 212
Query: 297 RVP-----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
R P G Q G + P PQ P R QG P P +G
Sbjct: 213 RSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPQQGGNRPQGPPPPGKPQGPPPQGDK-SR 271
Query: 352 APRGAAPHGQVPPPLN-NVPYGSATPPARSGSGQPRGGN 389
+P+ Q PPP N P G PP + P+GGN
Sbjct: 272 SPQSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGGN 310
>UNIPROTKB|P02459 [details] [associations]
symbol:COL2A1 "Collagen alpha-1(II) chain" species:9913
"Bos taurus" [GO:0071773 "cellular response to BMP stimulus"
evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
[GO:0060351 "cartilage development involved in endochondral bone
morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
[GO:0043066 "negative regulation of apoptotic process"
evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
[GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
"notochord development" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA] [GO:0010468 "regulation of gene
expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
[GO:0007417 "central nervous system development" evidence=IEA]
[GO:0006029 "proteoglycan metabolic process" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
[GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
differentiation" evidence=IEA] [GO:0001958 "endochondral
ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005737 GO:GO:0043066
GO:GO:0005615 GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604 GO:GO:0001502
GO:GO:0060021 GO:GO:0002062 GO:GO:0010468 GO:GO:0060272
GO:GO:0006029 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
GeneTree:ENSGT00660000095287 HOGENOM:HOG000085654
HOVERGEN:HBG004933 KO:K06236 EMBL:AAFC03017082 EMBL:AAFC03017085
EMBL:AAFC03056593 EMBL:L28918 EMBL:AF138883 EMBL:AF138957
EMBL:X02420 IPI:IPI01028216 PIR:A90369 PIR:I45876
RefSeq:NP_001001135.2 UniGene:Bt.21390 IntAct:P02459 STRING:P02459
PRIDE:P02459 Ensembl:ENSBTAT00000017505 GeneID:407142
KEGG:bta:407142 CTD:1280 InParanoid:Q9XT25 OMA:SSCRICV
Reactome:REACT_133391 NextBio:20818406 PMAP-CutDB:P02459
ArrayExpress:P02459 GO:GO:0005585 GO:GO:0060174 GO:GO:0030903
Uniprot:P02459
Length = 1487
Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
Identities = 89/295 (30%), Positives = 112/295 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
P DR D GA G + G P G G P GPP A + G
Sbjct: 133 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 188
Query: 178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
+ A G PM PRGP G + GP G+ + +P GP P +G
Sbjct: 189 EKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 243
Query: 235 PGYDPTKGPGYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
P P K PG D + G +RGP RG P GL G RG P D +G
Sbjct: 244 PPGPPGK-PGDDGEAGKPGKSGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 300
Query: 291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 301 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 358
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
P+ G GF GAP GA G+ P P G+ P P GS P G GNP
Sbjct: 359 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGAAGNP 410
Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
Identities = 88/282 (31%), Positives = 102/282 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
G G A N E G P G G P +G GPP A AG GA P
Sbjct: 792 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850
Query: 183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
A Q G A P+GP G +GP G K + P + A G
Sbjct: 851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907
Query: 235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
PG + GP G G D +G RG S P R +Q GP GP
Sbjct: 908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 954
Query: 294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
E PG D GP + P + QRG G QRG+ G+ PS +P + G
Sbjct: 955 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
GA P G V PP P G P R GS G P R
Sbjct: 1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048
>UNIPROTKB|P02458 [details] [associations]
symbol:COL2A1 "Collagen alpha-1(II) chain" species:9606
"Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0001502 "cartilage condensation" evidence=IEA] [GO:0001894
"tissue homeostasis" evidence=IEA] [GO:0001958 "endochondral
ossification" evidence=IEA] [GO:0002062 "chondrocyte
differentiation" evidence=IEA] [GO:0003007 "heart morphogenesis"
evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
[GO:0005615 "extracellular space" evidence=IEA] [GO:0006029
"proteoglycan metabolic process" evidence=IEA] [GO:0007417 "central
nervous system development" evidence=IEA] [GO:0010468 "regulation
of gene expression" evidence=IEA] [GO:0030903 "notochord
development" evidence=IEA] [GO:0042472 "inner ear morphogenesis"
evidence=IEA] [GO:0043066 "negative regulation of apoptotic
process" evidence=IEA] [GO:0060021 "palate development"
evidence=IEA] [GO:0060174 "limb bud formation" evidence=IEA]
[GO:0060351 "cartilage development involved in endochondral bone
morphogenesis" evidence=IEA] [GO:0071599 "otic vesicle development"
evidence=IEA] [GO:0071773 "cellular response to BMP stimulus"
evidence=IEA] [GO:0007601 "visual perception" evidence=IMP]
[GO:0048407 "platelet-derived growth factor binding" evidence=IDA]
[GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
"endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
guidance" evidence=TAS] [GO:0030198 "extracellular matrix
organization" evidence=TAS] [GO:0042802 "identical protein binding"
evidence=NAS] [GO:0001501 "skeletal system development"
evidence=IMP] [GO:0007605 "sensory perception of sound"
evidence=IMP] [GO:0060272 "embryonic skeletal joint morphogenesis"
evidence=IMP] [GO:0051216 "cartilage development" evidence=TAS]
[GO:0030199 "collagen fibril organization" evidence=IMP]
[GO:0005585 "collagen type II" evidence=IDA] [GO:0030020
"extracellular matrix structural constituent conferring tensile
strength" evidence=IC] InterPro:IPR000885 InterPro:IPR001007
Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
GO:GO:0007411 GO:GO:0043066 GO:GO:0005615 PDB:2FSE PDBsum:2FSE
PDB:2SEB PDBsum:2SEB GO:GO:0046872 GO:GO:0003007 GO:GO:0007601
GO:GO:0030199 GO:GO:0007417 GO:GO:0005788 GO:GO:0042472
GO:GO:0001894 GO:GO:0042802 GO:GO:0007605 GO:GO:0071773
GO:GO:0051216 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
GO:GO:0001958 GO:GO:0060351 HOVERGEN:HBG004933 KO:K06236
DrugBank:DB00048 GO:GO:0048407 CTD:1280 OMA:SSCRICV GO:GO:0005585
GO:GO:0060174 GO:GO:0030903 OrthoDB:EOG4FTW1C EMBL:X16468
EMBL:L10347 EMBL:BT007205 EMBL:AC004801 EMBL:BC007252 EMBL:BC116449
EMBL:X16711 EMBL:M25730 EMBL:M32168 EMBL:M25655 EMBL:M25656
EMBL:M64345 EMBL:M60299 EMBL:M25698 EMBL:X58709 EMBL:X57010
EMBL:U15195 EMBL:X13783 EMBL:M25728 EMBL:X02371 EMBL:X02372
EMBL:X02373 EMBL:X02374 EMBL:X02375 EMBL:X02376 EMBL:X02377
EMBL:X02378 EMBL:X16158 EMBL:J00116 EMBL:L00977 EMBL:M63281
EMBL:M27468 EMBL:X06268 EMBL:X00339 EMBL:M12048 IPI:IPI00186460
IPI:IPI00748487 IPI:IPI00936892 PIR:A38513 RefSeq:NP_001835.3
RefSeq:NP_149162.2 UniGene:Hs.408182 PDB:1U5M PDBsum:1U5M
ProteinModelPortal:P02458 SMR:P02458 IntAct:P02458
MINT:MINT-6796075 STRING:P02458 PhosphoSite:P02458 DMDM:124056489
PaxDb:P02458 PRIDE:P02458 DNASU:1280 Ensembl:ENST00000337299
Ensembl:ENST00000380518 GeneID:1280 KEGG:hsa:1280 UCSC:uc001rqt.3
UCSC:uc001rqu.3 UCSC:uc001rqv.3 GeneCards:GC12M048266
HGNC:HGNC:2200 HPA:CAB002214 MIM:108300 MIM:120140 MIM:132450
MIM:150600 MIM:151210 MIM:156550 MIM:183900 MIM:184250 MIM:200610
MIM:271700 MIM:604864 MIM:608805 MIM:609162 MIM:609508
neXtProt:NX_P02458 Orphanet:93296 Orphanet:209867 Orphanet:137678
Orphanet:86820 Orphanet:93297 Orphanet:485 Orphanet:2380
Orphanet:93279 Orphanet:166011 Orphanet:1427 Orphanet:85166
Orphanet:93346 Orphanet:94068 Orphanet:93315 Orphanet:1856
Orphanet:90653 PharmGKB:PA26715 ChiTaRS:COL2A1
EvolutionaryTrace:P02458 GenomeRNAi:1280 NextBio:5171
PMAP-CutDB:P02458 Bgee:P02458 Genevestigator:P02458
GermOnline:ENSG00000139219 GO:GO:0030020 Uniprot:P02458
Length = 1487
Score = 130 (50.8 bits), Expect = 0.00010, P = 0.00010
Identities = 90/295 (30%), Positives = 113/295 (38%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATT--AGVVGAGP 177
P DR D GA G + G P G G P GPP A + G
Sbjct: 133 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGFD 188
Query: 178 NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKG 234
+ A G PM PRGP G + GP G+ + +P GP P +G
Sbjct: 189 EKAGGAQLGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GP-RG 243
Query: 235 PGYDPTKGPGYDAQKGSNYDA-QRGP-NYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG 290
P P K PG D + G A +RGP RG P GL G RG P D +G
Sbjct: 244 PPGPPGK-PGDDGEAGKPGKAGERGPPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKG 300
Query: 291 ----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS----- 340
PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 301 EAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGP 358
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATP---PARSGSGQPRG--GNP 390
P+ G GF GAP GA G+ P P G+ P P GS P G GNP
Sbjct: 359 VGPAGGPGFPGAP-GAK--GEAGPTGARGPEGAQGPRGEPGTPGSPGPAGASGNP 410
Score = 124 (48.7 bits), Expect = 0.00047, P = 0.00047
Identities = 88/282 (31%), Positives = 101/282 (35%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
G G A N E G P G G P +G GPP A AG GA P
Sbjct: 792 GPPGPAGANGEKGEVGPP-GPAGSAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 850
Query: 183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
A Q G A P+GP G +GP G K + P + A G
Sbjct: 851 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 907
Query: 235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
PG + GP G G D +G RG S P R G +GP GP
Sbjct: 908 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRA-GEPGLQGP-----AGPPG 954
Query: 294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
E PG D G E P + QRG G QRG+ G+ PS +P + G
Sbjct: 955 EKGE-PGDDGPSGA--EGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1010
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
GA P G V PP P G P R GS G P R
Sbjct: 1011 GASGDRGPPGPVGPPGLTGPAGE---PGREGS-PGADGPPGR 1048
>UNIPROTKB|E2RRS5 [details] [associations]
symbol:RBM12B "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 GO:GO:0000166
Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00620000087828
OMA:EHFRRPP CTD:389677 EMBL:AAEX03015951 RefSeq:XP_544177.3
Ensembl:ENSCAFT00000014490 GeneID:487048 KEGG:cfa:487048
NextBio:20860720 Uniprot:E2RRS5
Length = 994
Score = 124 (48.7 bits), Expect = 0.00011, Sum P(2) = 0.00011
Identities = 45/174 (25%), Positives = 71/174 (40%)
Query: 192 PMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 251
P + PR + + D + P D + P D + P D + P D ++
Sbjct: 591 PWEEGFRYPREEDFRYPREE--DWRRPPEEDFRRPPKDDFRRPPEEDWRRLPEGDFRRPP 648
Query: 252 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 311
D +R P D R P + +R D +R P D +R P + +R+P D +R P +
Sbjct: 649 EEDWRRPPEDDFRRLPQGEWRRPPEEDFRRPPEEDFRRLPEEDFRRLPEEDFRRPPEEDF 708
Query: 312 QRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
+R+P +R P D +R RR P + R + R H + PPP
Sbjct: 709 RRSPEEDFRRSPEEDFRRPPPEHFRRPPP-EHLRRPPPEHFRRPPPEHFRRPPP 761
Score = 50 (22.7 bits), Expect = 0.00011, Sum P(2) = 0.00011
Identities = 14/57 (24%), Positives = 24/57 (42%)
Query: 102 YITMATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVP 158
+++ E++K E+ + R GS GA+G + + A GYG P
Sbjct: 72 FLSSKAEMQKT-IEMRRTDRIGRERPGS--GASGAGSLSNFVEAIKEEASNSGYGSP 125
>UNIPROTKB|A7E348 [details] [associations]
symbol:PYGO2 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0060070 "canonical Wnt receptor signaling pathway"
evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
[GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
[GO:0048589 "developmental growth" evidence=IEA] [GO:0042393
"histone binding" evidence=IEA] [GO:0033599 "regulation of mammary
gland epithelial cell proliferation" evidence=IEA] [GO:0030879
"mammary gland development" evidence=IEA] [GO:0009791
"post-embryonic development" evidence=IEA] [GO:0007420 "brain
development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0002088 "lens development in camera-type eye" evidence=IEA]
[GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
GO:GO:0033599 GO:GO:0051569 GO:GO:0002088 eggNOG:NOG72798
HOGENOM:HOG000001580 HOVERGEN:HBG053774
GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
OrthoDB:EOG4QZ7MB EMBL:DAAA02007156 EMBL:BC151715 IPI:IPI00866934
RefSeq:NP_001095712.1 UniGene:Bt.102068 SMR:A7E348
Ensembl:ENSBTAT00000005670 GeneID:540401 KEGG:bta:540401
InParanoid:A7E348 NextBio:20878610 Uniprot:A7E348
Length = 405
Score = 123 (48.4 bits), Expect = 0.00011, P = 0.00011
Identities = 78/298 (26%), Positives = 111/298 (37%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 170
M +P RR + G A + +E P V N +ED +G P+ G P +
Sbjct: 38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97
Query: 171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 229
V G Q G A +P G G GP + P + P+ GP++
Sbjct: 98 PVPFGG--------FRVQGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPSPMGPAF 145
Query: 230 D-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQRGP 283
+ P +GPGY P + +Q ++ G N+ G P G G M + P
Sbjct: 146 NMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPP 202
Query: 284 NYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPSYIPQRGP--GYDLQ-RGQGYDMRRA 338
++ GP QR GP + Q PS P P G D G G +
Sbjct: 203 RGEL--GPPSLPQRFAQPGAPFGPSLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDGGK 260
Query: 339 PSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRGGN 389
P +P T F P +P +G P P N+ G TP A S + G+ GG+
Sbjct: 261 P-LNPPAATAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGGGS 317
>UNIPROTKB|Q5T171 [details] [associations]
symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
"in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
development" evidence=IEA] [GO:0002088 "lens development in
camera-type eye" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0007420 "brain development" evidence=IEA] [GO:0009791
"post-embryonic development" evidence=IEA] [GO:0030879 "mammary
gland development" evidence=IEA] [GO:0033599 "regulation of mammary
gland epithelial cell proliferation" evidence=IEA] [GO:0042393
"histone binding" evidence=IEA] [GO:0048589 "developmental growth"
evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
[GO:0060070 "canonical Wnt receptor signaling pathway"
evidence=IEA] InterPro:IPR001965 InterPro:IPR019787 Pfam:PF00628
PROSITE:PS50016 SMART:SM00249 GO:GO:0005634 GO:GO:0007420
GO:GO:0046872 GO:GO:0008270 GO:GO:0001701 GO:GO:0009791
GO:GO:0001822 EMBL:AL451085 Gene3D:3.30.40.10 InterPro:IPR011011
InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
PROSITE:PS01359 GO:GO:0060021 EMBL:CH471121 GO:GO:0060070
GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
HOGENOM:HOG000001580 HOVERGEN:HBG053774 UniGene:Hs.533597
HGNC:HGNC:30257 IPI:IPI00642524 SMR:Q5T171 STRING:Q5T171
Ensembl:ENST00000368456 Uniprot:Q5T171
Length = 369
Score = 122 (48.0 bits), Expect = 0.00012, P = 0.00012
Identities = 80/302 (26%), Positives = 113/302 (37%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ-GHGPPPSATT 169
M +P RR + G A + +E P V N +ED +G P+ G PP +
Sbjct: 1 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGS 60
Query: 170 AGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTK-G 226
G Q G A +P PGY G G + P + P G
Sbjct: 61 PVPFGG---------FRVQGGM----AGQVP--PGYSTGGGGGPQPLRRQPPPFPPNPMG 105
Query: 227 PSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQ 280
P+++ P +GPGY P + +Q ++ G N+ G P G G M
Sbjct: 106 PAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMG 162
Query: 281 RGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYDLQ-RGQGYD 334
+ P ++ GP +QR PG P+ Q PS P P G D G G +
Sbjct: 163 QPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGE 220
Query: 335 MRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRG 387
P +P T F P +P +G P P N+ G TP A S + G+ G
Sbjct: 221 DGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGG 279
Query: 388 GN 389
G+
Sbjct: 280 GS 281
>TAIR|locus:2140513 [details] [associations]
symbol:AT4G10070 "AT4G10070" species:3702 "Arabidopsis
thaliana" [GO:0003676 "nucleic acid binding" evidence=ISS]
[GO:0003723 "RNA binding" evidence=IEA] [GO:0005634 "nucleus"
evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
[GO:0000956 "nuclear-transcribed mRNA catabolic process"
evidence=RCA] [GO:0009688 "abscisic acid biosynthetic process"
evidence=RCA] InterPro:IPR004087 InterPro:IPR004088 Pfam:PF00013
PROSITE:PS50084 SMART:SM00322 EMBL:CP002687 GO:GO:0003723
UniGene:At.33655 IPI:IPI01020077 RefSeq:NP_192745.2
ProteinModelPortal:F4JLJ3 SMR:F4JLJ3 EnsemblPlants:AT4G10070.1
GeneID:826598 KEGG:ath:AT4G10070 OMA:PSTHAIG ArrayExpress:F4JLJ3
Uniprot:F4JLJ3
Length = 725
Score = 126 (49.4 bits), Expect = 0.00012, P = 0.00012
Identities = 70/253 (27%), Positives = 87/253 (34%)
Query: 160 GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKA 218
G PPPS GP S +Y QS P + P GY+ + G Y+ K
Sbjct: 442 GPVPPPSGPVPSPAFGGPPLSQVSYGYGQSHGP-EYGHAAPYSQTGYQQTYGQTYEQPKY 500
Query: 219 ---PSYDPTKGPSYDPAKG--PGYDPTKGPG---YDAQKG---SNYD----AQRGPNYDI 263
P P G SY PA G GY + PG Y Q+G Y A + D+
Sbjct: 501 DSNPPMQPPYGGSYPPAGGGQSGYYQMQQPGVRPYGMQQGPVQQGYGPPQPAAAASSGDV 560
Query: 264 -HRG-----PSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY 317
++G PSY Q G Y GP + Q P Y P +A +
Sbjct: 561 PYQGATPAAPSYGSTNMAPQQQQYG--YTSSDGP-VQQQTYPSYS--SAPPSDAYNNGTQ 615
Query: 318 IPQRGPGYDLQRGQG----YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN-VPYG 372
P GP Y Q Q YD A + G G AP G + P + Y
Sbjct: 616 TPATGPAYQQQSVQPASSTYDQTGAQQA-AAAGYGGQVAPTGGYTYPTSQPAYGSQAAYS 674
Query: 373 SATPPARSGSGQP 385
A P QP
Sbjct: 675 QAAPTQTGYEQQP 687
>MGI|MGI:88452 [details] [associations]
symbol:Col2a1 "collagen, type II, alpha 1" species:10090 "Mus
musculus" [GO:0001501 "skeletal system development" evidence=ISO]
[GO:0001502 "cartilage condensation" evidence=IMP] [GO:0001894
"tissue homeostasis" evidence=IMP] [GO:0001958 "endochondral
ossification" evidence=IMP] [GO:0002062 "chondrocyte
differentiation" evidence=IMP] [GO:0003007 "heart morphogenesis"
evidence=IMP] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005576 "extracellular region"
evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
evidence=IEA] [GO:0005581 "collagen" evidence=IDA] [GO:0005585
"collagen type II" evidence=ISO;IDA;IMP] [GO:0005604 "basement
membrane" evidence=IDA] [GO:0005615 "extracellular space"
evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA] [GO:0006029
"proteoglycan metabolic process" evidence=IMP] [GO:0007601 "visual
perception" evidence=ISO] [GO:0007605 "sensory perception of sound"
evidence=ISO] [GO:0010468 "regulation of gene expression"
evidence=IMP] [GO:0030199 "collagen fibril organization"
evidence=ISO;IMP] [GO:0031012 "extracellular matrix" evidence=IDA]
[GO:0035108 "limb morphogenesis" evidence=IMP] [GO:0042472 "inner
ear morphogenesis" evidence=IMP] [GO:0042802 "identical protein
binding" evidence=IPI] [GO:0043066 "negative regulation of
apoptotic process" evidence=IMP] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
evidence=ISO] [GO:0048705 "skeletal system morphogenesis"
evidence=IMP] [GO:0048839 "inner ear development" evidence=IMP]
[GO:0051216 "cartilage development" evidence=IMP] [GO:0060021
"palate development" evidence=IMP] [GO:0060272 "embryonic skeletal
joint morphogenesis" evidence=ISO] [GO:0060348 "bone development"
evidence=IMP] [GO:0060351 "cartilage development involved in
endochondral bone morphogenesis" evidence=IMP] [GO:0071773
"cellular response to BMP stimulus" evidence=IDA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 MGI:MGI:88452 GO:GO:0005737
GO:GO:0043066 GO:GO:0005615 GO:GO:0046872 GO:GO:0003007
GO:GO:0007601 GO:GO:0030199 GO:GO:0007417 GO:GO:0042472
GO:GO:0001894 GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071599 GO:GO:0005604
GO:GO:0001502 GO:GO:0060021 GO:GO:0002062 GO:GO:0010468
GO:GO:0060272 GO:GO:0006029 GO:GO:0001958 GO:GO:0060351
GO:GO:0005201 GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933
KO:K06236 CTD:1280 OMA:SSCRICV GO:GO:0005585 GO:GO:0060174
GO:GO:0030903 OrthoDB:EOG4FTW1C ChiTaRS:COL2A1 EMBL:M65161
EMBL:BC030913 EMBL:BC051383 EMBL:BC052326 EMBL:BC082331 EMBL:S63190
EMBL:M63708 EMBL:M63709 EMBL:M63710 EMBL:AK028295 EMBL:X57982
IPI:IPI00471183 IPI:IPI00621255 IPI:IPI00622890 IPI:IPI00623625
IPI:IPI00828467 IPI:IPI00828653 IPI:IPI00828753 PIR:A41182
PIR:B41182 RefSeq:NP_001106987.2 RefSeq:NP_112440.2 UniGene:Mm.2423
PDB:2W65 PDBsum:2W65 ProteinModelPortal:P28481 SMR:P28481
IntAct:P28481 STRING:P28481 PhosphoSite:P28481 PRIDE:P28481
Ensembl:ENSMUST00000023123 Ensembl:ENSMUST00000088355 GeneID:12824
KEGG:mmu:12824 UCSC:uc007xlp.2 UCSC:uc007xlq.2 InParanoid:P28481
EvolutionaryTrace:P28481 NextBio:282306 Bgee:P28481
CleanEx:MM_COL2A1 Genevestigator:P28481
GermOnline:ENSMUSG00000022483 Uniprot:P28481
Length = 1487
Score = 129 (50.5 bits), Expect = 0.00014, P = 0.00014
Identities = 88/296 (29%), Positives = 110/296 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
P DR D GA G + G P G G P GPP + A + G
Sbjct: 132 PRGDR-GDKGEKGAPGPRGRD--GEP-GTPGNPGPAGPPGPPGPPGLSAGNFAAQMAGGY 187
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
+ A G PM PRGP G + GP G+ + +P GP P
Sbjct: 188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 243
Query: 234 GPGYDPTKGPGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
PG P PG D + G +RG P RG P GL G RG P D +
Sbjct: 244 PPG--PAGKPGDDGEAGKPGKSGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299
Query: 290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
G PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357
Query: 341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
P+ G GF GAP +G A P G P GS P GS P G GNP
Sbjct: 358 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410
>UNIPROTKB|P05997 [details] [associations]
symbol:COL5A2 "Collagen alpha-2(V) chain" species:9606
"Homo sapiens" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0001501 "skeletal system development"
evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA] [GO:0071230
"cellular response to amino acid stimulus" evidence=IEA]
[GO:0030199 "collagen fibril organization" evidence=ISS;IMP]
[GO:0043588 "skin development" evidence=ISS;IMP] [GO:0031012
"extracellular matrix" evidence=NAS] [GO:0003674
"molecular_function" evidence=ND] [GO:0048592 "eye morphogenesis"
evidence=IMP] [GO:0005588 "collagen type V" evidence=IMP]
[GO:0005576 "extracellular region" evidence=TAS] [GO:0005788
"endoplasmic reticulum lumen" evidence=TAS] [GO:0007411 "axon
guidance" evidence=TAS] [GO:0030198 "extracellular matrix
organization" evidence=TAS] InterPro:IPR000885 InterPro:IPR001007
Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
GO:GO:0007411 GO:GO:0043588 GO:GO:0046872 GO:GO:0030199
GO:GO:0005788 GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0071230 GO:GO:0005201 GO:GO:0048592
HOVERGEN:HBG004933 KO:K06236 MIM:130000 Orphanet:90309
EMBL:AY016295 PDB:1A9A PDBsum:1A9A MIM:130010 Orphanet:90318
GO:GO:0005588 EMBL:Y14690 EMBL:AB209045 EMBL:AC064833 EMBL:AC133106
EMBL:J04478 EMBL:AY016288 EMBL:AY016287 EMBL:AY016289 EMBL:AY016290
EMBL:AY016291 EMBL:AY016292 EMBL:AY016293 EMBL:AY016294 EMBL:M58529
EMBL:X04758 EMBL:BC043613 EMBL:M10956 EMBL:M11135 EMBL:M11718
EMBL:J03051 IPI:IPI00739099 PIR:A31427 RefSeq:NP_000384.2
UniGene:Hs.445827 ProteinModelPortal:P05997 SMR:P05997
STRING:P05997 PhosphoSite:P05997 DMDM:143811378 PaxDb:P05997
PRIDE:P05997 Ensembl:ENST00000374866 GeneID:1290 KEGG:hsa:1290
UCSC:uc002uqk.3 CTD:1290 GeneCards:GC02M189861 HGNC:HGNC:2210
MIM:120190 neXtProt:NX_P05997 PharmGKB:PA26725 InParanoid:P05997
OMA:PDHKPVW OrthoDB:EOG4K0QMS ChiTaRS:COL5A2 GenomeRNAi:1290
NextBio:5223 PMAP-CutDB:P05997 ArrayExpress:P05997 Bgee:P05997
Genevestigator:P05997 GermOnline:ENSG00000204262 Uniprot:P05997
Length = 1499
Score = 129 (50.5 bits), Expect = 0.00014, P = 0.00014
Identities = 87/293 (29%), Positives = 109/293 (37%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
++ A+G+ G GA G P G E G P+G GPP S G G T
Sbjct: 784 EKGAEGTAGNDGARGLPGPLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 842
Query: 180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
+A Q G P ++ P G S GP G S P + P P +G
Sbjct: 843 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPNGVPGLKGGRGT 901
Query: 236 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-------QRGPNY 285
P T PG + G A GP + P + GL D RGP
Sbjct: 902 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPLGE-PGKEGPPGLRGDPGSHGRVGDRGPA- 959
Query: 286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYD 342
GPG + PG D Q GP + P+ QRG G QRG+ G P+
Sbjct: 960 GPPGGPGDKGD--PGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGT 1015
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
P + G GA P G V PP +N P G P +G+ G P R G R
Sbjct: 1016 PGK-VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1067
>UNIPROTKB|D3ZZM1 [details] [associations]
symbol:Taf15 "Protein Taf15" species:10116 "Rattus
norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
"intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950003
ProteinModelPortal:D3ZZM1 Ensembl:ENSRNOT00000064396
ArrayExpress:D3ZZM1 Uniprot:D3ZZM1
Length = 558
Score = 124 (48.7 bits), Expect = 0.00014, P = 0.00014
Identities = 67/238 (28%), Positives = 89/238 (37%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
RR + GG +G G G+ ++ G P+ G P+ + + A N+
Sbjct: 318 RRPEFMRGGGSGGGRRGRGGYR-GRGGFQGRGGDPKNGDWVCPNPSCGNMNFARRNSCNQ 376
Query: 183 AYAAT-QSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
+ P + RG GY +G + D +G G GY +
Sbjct: 377 CNEPRPEDSRPSGGDF---RGRGYGGERG--FRGRGGRGGD--RGGYGADRSGGGYGGDR 429
Query: 242 GPG-YDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRV 298
G Y A + G Y R G Y RG Y RG GY RG +Y RG GY R
Sbjct: 430 SGGSYGADRSGGGYGGDRSGGGYGGDRGGGYGGDRG-GYGGDRGGSYGGDRG-GYGGDR- 486
Query: 299 PGYDVQRGPVYEAQRAP-SYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
GY RG Y R+ +Y RG G GY R+ Y RG G+ G RG
Sbjct: 487 GGYGGDRGG-YGGDRSRGAYGGDRGGG-----SGGYGGDRSGGYGGDRGGGY-GGDRG 537
>UNIPROTKB|Q9BRQ0 [details] [associations]
symbol:PYGO2 "Pygopus homolog 2" species:9606 "Homo
sapiens" [GO:0008270 "zinc ion binding" evidence=IEA] [GO:0001701
"in utero embryonic development" evidence=IEA] [GO:0001822 "kidney
development" evidence=IEA] [GO:0002088 "lens development in
camera-type eye" evidence=IEA] [GO:0007420 "brain development"
evidence=IEA] [GO:0009791 "post-embryonic development"
evidence=IEA] [GO:0030879 "mammary gland development" evidence=IEA]
[GO:0033599 "regulation of mammary gland epithelial cell
proliferation" evidence=IEA] [GO:0042393 "histone binding"
evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
[GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
[GO:0060021 "palate development" evidence=IEA] [GO:0060070
"canonical Wnt receptor signaling pathway" evidence=IEA]
[GO:0005634 "nucleus" evidence=IEA] InterPro:IPR001965
InterPro:IPR019787 Pfam:PF00628 PROSITE:PS50016 SMART:SM00249
GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
GO:GO:0030879 GO:GO:0033599 PDB:2XB1 PDBsum:2XB1 GO:GO:0051569
GO:GO:0002088 eggNOG:NOG72798 HOGENOM:HOG000001580
HOVERGEN:HBG053774 EMBL:AF457208 EMBL:BC006132 EMBL:BC013725
EMBL:BC032099 EMBL:AF289598 IPI:IPI00042099 RefSeq:NP_612157.1
UniGene:Hs.533597 ProteinModelPortal:Q9BRQ0 SMR:Q9BRQ0
IntAct:Q9BRQ0 STRING:Q9BRQ0 PhosphoSite:Q9BRQ0 DMDM:23396825
PaxDb:Q9BRQ0 PRIDE:Q9BRQ0 DNASU:90780 Ensembl:ENST00000368457
GeneID:90780 KEGG:hsa:90780 UCSC:uc001fft.3 CTD:90780
GeneCards:GC01M154929 HGNC:HGNC:30257 HPA:HPA023689 MIM:606903
neXtProt:NX_Q9BRQ0 PharmGKB:PA134881185 InParanoid:Q9BRQ0
OMA:PGLVYPC OrthoDB:EOG4QZ7MB PhylomeDB:Q9BRQ0 GenomeRNAi:90780
NextBio:76956 ArrayExpress:Q9BRQ0 Bgee:Q9BRQ0 CleanEx:HS_PYGO2
Genevestigator:Q9BRQ0 GermOnline:ENSG00000163348 Uniprot:Q9BRQ0
Length = 406
Score = 122 (48.0 bits), Expect = 0.00014, P = 0.00014
Identities = 80/302 (26%), Positives = 113/302 (37%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ-GHGPPPSATT 169
M +P RR + G A + +E P V N +ED +G P+ G PP +
Sbjct: 38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGVAAPPFLGS 97
Query: 170 AGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTK-G 226
G Q G A +P PGY G G + P + P G
Sbjct: 98 PVPFGG---------FRVQGGM----AGQVP--PGYSTGGGGGPQPLRRQPPPFPPNPMG 142
Query: 227 PSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQ 280
P+++ P +GPGY P + +Q ++ G N+ G P G G M
Sbjct: 143 PAFNMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPSGQMMPGPVGGFGPMISPTMG 199
Query: 281 RGPNYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYDLQ-RGQGYD 334
+ P ++ GP +QR PG P+ Q PS P P G D G G +
Sbjct: 200 QPPRAEL--GPPSLSQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGE 257
Query: 335 MRRAPSYDPSRGTGFDGAPRGAAP----HGQVPP-PLNNVPYGSATPPARSGS--GQPRG 387
P +P T F P +P +G P P N+ G TP A S + G+ G
Sbjct: 258 DGGKP-LNPPASTAFPQEPHSGSPAAAVNGNQPSFPPNSSGRGGGTPDANSLAPPGKAGG 316
Query: 388 GN 389
G+
Sbjct: 317 GS 318
>RGD|1311417 [details] [associations]
symbol:Col7a1 "collagen, type VII, alpha 1" species:10116
"Rattus norvegicus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0005575 "cellular_component" evidence=ND] [GO:0005604 "basement
membrane" evidence=ISO] [GO:0008150 "biological_process"
evidence=ND] InterPro:IPR002035 InterPro:IPR003961 Pfam:PF00041
Pfam:PF00092 PROSITE:PS50234 PROSITE:PS50853 SMART:SM00060
SMART:SM00327 RGD:1311417 Gene3D:2.60.40.10 InterPro:IPR013783
SUPFAM:SSF49265 InterPro:IPR008160 Pfam:PF01391 IPI:IPI00951759
Ensembl:ENSRNOT00000066518 UCSC:RGD:1311417 ArrayExpress:D3ZQ14
Uniprot:D3ZQ14
Length = 2585
Score = 131 (51.2 bits), Expect = 0.00015, P = 0.00015
Identities = 75/262 (28%), Positives = 96/262 (36%)
Query: 143 GRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG 202
G P G + G P GPP S GV G+ + ++ + R P+G
Sbjct: 1285 GAP-GSTQAKGERGFPGPEGPPGSPGLPGVPGSPGVKGSPGWSGPRGDRGERGPQG-PKG 1342
Query: 203 ----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAK-GPGYDPTKGP-GYDAQKGSNYDA 255
PG G PG K DP GPS P GP DP GP G G++
Sbjct: 1343 EPGEPGQVIGGGRPGLPGKKG---DP--GPSGPPGPHGPLGDP--GPRGPPGLPGTSVKG 1395
Query: 256 QRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYEAQRA 314
+G + RGP P G G Q P G PG Q PG ++G + +
Sbjct: 1396 DKGDRGE--RGP---PGPGTGASEQGSPGLPGLPGSPG--PQGPPGRTGEKGEKGDCEDG 1448
Query: 315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGS 373
+P + PG + G +R AP +G G G P G+ PP P G
Sbjct: 1449 GPGLPGQ-PGVPGEPG----LRGAPGVTGPKGDRGLTGTPGEPGEKGERGPPGPVGPQGL 1503
Query: 374 ATPPARSGSGQPRG--GNPARR 393
R G P G G P RR
Sbjct: 1504 PGAAGRPGVEGPEGPPGPPGRR 1525
>ZFIN|ZDB-GENE-030516-3 [details] [associations]
symbol:col18a1 "collagen type XVIII, alpha 1"
species:7955 "Danio rerio" [GO:0031012 "extracellular matrix"
evidence=IEA] [GO:0005198 "structural molecule activity"
evidence=IEA] [GO:0007155 "cell adhesion" evidence=IEA] [GO:0005581
"collagen" evidence=IEA] InterPro:IPR010515 InterPro:IPR020067
Pfam:PF01392 Pfam:PF06482 PROSITE:PS50038 ZFIN:ZDB-GENE-030516-3
GO:GO:0005198 Gene3D:3.10.100.10 InterPro:IPR016186
InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0007155 InterPro:IPR008985
SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
InterPro:IPR001791 SMART:SM00282 Gene3D:1.10.2000.10
SUPFAM:SSF63501 SMART:SM00210 GeneTree:ENSGT00700000104250
HOGENOM:HOG000231591 HOVERGEN:HBG053241 EMBL:BX927363 EMBL:CT030212
IPI:IPI00616856 UniGene:Dr.52833 SMR:B0S8G4
Ensembl:ENSDART00000130434 OMA:DRFNRYD Uniprot:B0S8G4
Length = 1645
Score = 129 (50.5 bits), Expect = 0.00015, P = 0.00015
Identities = 73/277 (26%), Positives = 99/277 (35%)
Query: 125 RADGSYGGATGNSENETSGRPVGQNAYE--DGYGVPQGHGP--PPSATTAGVVGA-GPNT 179
+ D G +G G P G+ + G+G P G PP G G GP
Sbjct: 609 KGDVGSGSVSGGGSKGDKGVP-GEKGMKGTSGFGYPGSKGDRGPP-----GPPGPPGPQG 662
Query: 180 STSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGY 237
++ G+ ++ PRGP G + GP G + + K P+ PG
Sbjct: 663 PSAEVEVRGDGSVVQKVTG-PRGPPGPQGPPGPPGPEGEPGDPGEDGKAGQVGPSGFPGN 721
Query: 238 DPTKGP-GYDAQKGSNYDAQRGP-NYDIHRGPSYDPQRGLGYDMQRGPNYDMQ--RG-PG 292
GP G +G + RGP GPS R DM+ G +DM R PG
Sbjct: 722 PGNPGPKGDKGDRGESQPGPRGPPGPPGPPGPSSGFDRPTFVDME-GSGFDMDSVRAVPG 780
Query: 293 YETQRVPGYDVQRGPVYEAQRAPS-YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDG 351
PG GP A + P PG + GQ + P D G
Sbjct: 781 LPGP--PGPPGPPGPPGSASSGSGGFGPPGPPGQNGAPGQP-GLSGVPGADGKPGLPGPK 837
Query: 352 APRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQPRG 387
+G A +P P+ GS+ PP +G G P G
Sbjct: 838 GEKGDAGELGLPGPVGEKGAKGSSGPPGTTGIGGPAG 874
>UNIPROTKB|O46392 [details] [associations]
symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
"Canis lupus familiaris" [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 GO:GO:0046872 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
HOVERGEN:HBG004933 KO:K06236 CTD:1278 EMBL:AF035120
RefSeq:NP_001003187.1 UniGene:Cfa.1262 STRING:O46392 GeneID:403824
KEGG:cfa:403824 NextBio:20817320 Uniprot:O46392
Length = 1366
Score = 128 (50.1 bits), Expect = 0.00016, P = 0.00016
Identities = 86/283 (30%), Positives = 105/283 (37%)
Query: 132 GATG-NSENETSGRP--VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
GA G +G P G G+P G + G+VG P + S +
Sbjct: 301 GANGLTGAKGAAGLPGVAGAPGLPGPRGIPGPVGAAGATGARGIVGE-PGPAGSKGESGN 359
Query: 189 SGTPMRAAYDIPRGP-GYEASKGPGYDASKA-PSYDPTKGPSYDP-AKG-PGYDPTKGP- 243
G P A P GP G E +GP +A A PS P G P ++G PG D G
Sbjct: 360 KGEPGSAGAQGPPGPSGEEGKRGPNGEAGSAGPSGPP--GLRGSPGSRGLPGADGPAGVM 417
Query: 244 GYDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP-GYE-TQRVP 299
G +G+ A RGPN D R P +P G RG P GP G E +P
Sbjct: 418 GPPGPRGATGPAGVRGPNGDSGR-PG-EP----GLMGPRGFPGAPGNVGPAGKEGPMGLP 471
Query: 300 GYDVQRGPVYEA--QRAPSYIPQRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRG 355
G D + GP+ A + P I GP G G+ D A +RG G DG
Sbjct: 472 GIDGRPGPIGPAGARGEPGNIGFPGPKGPTGDPGKNGDKGHA-GLAGARGAPGPDGNNGA 530
Query: 356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-----GNPARR 393
P G G A PP G P G G P R
Sbjct: 531 QGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGER 573
>UNIPROTKB|F1KQQ4 [details] [associations]
symbol:F1KQQ4 "Collagen alpha-1(IV) chain" species:6253
"Ascaris suum" [GO:0031012 "extracellular matrix" evidence=IDA]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10 EMBL:JI164326
Uniprot:F1KQQ4
Length = 1759
Score = 129 (50.5 bits), Expect = 0.00016, P = 0.00016
Identities = 86/285 (30%), Positives = 105/285 (36%)
Query: 128 GSYGGATGNSENETSGRP--VGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA 185
G G A N G P G+ + +G P GP + +G+ GA P
Sbjct: 1164 GIPGDAGFNGRAGLPGLPGIKGERGQDGQHGYPGEPGPVGAHGESGLTGA-PGLQGEPGL 1222
Query: 186 ATQSGTPMR----AAYDIPRGPGYEASKG----PGYDASKA-PSYD--PTKGPSYDPAKG 234
+ G P + A P PG E G G D P D P +GP D A
Sbjct: 1223 PGRMGLPGQPGELGAPGFPGAPGLEGIPGIRGERGDDGLPGLPGIDGIPIQGPEGD-AGY 1281
Query: 235 PGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRG----PSYDPQRGL-GYDMQRGPNYDM 287
PG D G PG Q+G D G P RG P Y +RGL G D +RGP D
Sbjct: 1282 PGRDGNDGLPGLPGQRGD--DGLPGLPGLIGERGDDGLPGYPGERGLRGIDGKRGP--DG 1337
Query: 288 QRG-PGYE-TQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPS 344
RG PG PG +RG P + + G PGY +RG+ P
Sbjct: 1338 ARGLPGPPGLDGYPGAPGERG----MDGLPGFPGKDGIPGYPGERGEV----GLPGLPGM 1389
Query: 345 RGT-GFDGAPRGAAPHG-QVPPPLNNVPYGSATPPARSGSGQPRG 387
RG G G P A G + L +P G P G P G
Sbjct: 1390 RGEDGLPGLPGLAGQKGARGDDGLPGLP-GLPGPVGARGRPGPPG 1433
>UNIPROTKB|F1LNY9 [details] [associations]
symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00558825
Ensembl:ENSRNOT00000049994 ArrayExpress:F1LNY9 Uniprot:F1LNY9
Length = 1441
Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
Identities = 81/280 (28%), Positives = 105/280 (37%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
R GA GN P G G G P G P + AG GA GP +
Sbjct: 288 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 344
Query: 183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
+ + G+P A G PG + S G PG + AP + +GP P GP
Sbjct: 345 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 397
Query: 236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
G GP G + G + + ++GP + GP P G + +RG + GP
Sbjct: 398 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 456
Query: 292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
G +R PG RG P + P P +RGP G +G D R
Sbjct: 457 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 513
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
G G P A P G+V P G PP G+ GQP
Sbjct: 514 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553
Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057
>UNIPROTKB|F1LQ06 [details] [associations]
symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00949996
Ensembl:ENSRNOT00000066385 ArrayExpress:F1LQ06 Uniprot:F1LQ06
Length = 1441
Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
Identities = 81/280 (28%), Positives = 105/280 (37%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
R GA GN P G G G P G P + AG GA GP +
Sbjct: 288 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 344
Query: 183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
+ + G+P A G PG + S G PG + AP + +GP P GP
Sbjct: 345 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 397
Query: 236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
G GP G + G + + ++GP + GP P G + +RG + GP
Sbjct: 398 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 456
Query: 292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
G +R PG RG P + P P +RGP G +G D R
Sbjct: 457 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 513
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
G G P A P G+V P G PP G+ GQP
Sbjct: 514 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 553
Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 795 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 853
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 854 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 908
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 909 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 959
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 960 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1019
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 1020 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1057
>UNIPROTKB|F1M8G1 [details] [associations]
symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00475975
Ensembl:ENSRNOT00000050833 ArrayExpress:F1M8G1 Uniprot:F1M8G1
Length = 1458
Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
Identities = 81/280 (28%), Positives = 105/280 (37%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
R GA GN P G G G P G P + AG GA GP +
Sbjct: 305 RTGPAGAAGARGNDGQPGPAGPPGPVGPAGGPGFP---GAPGAKGEAGPTGARGPEGAQG 361
Query: 183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
+ + G+P A G PG + S G PG + AP + +GP P GP
Sbjct: 362 SRGEPGNPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 414
Query: 236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
G GP G + G + + ++GP + GP P G + +RG + GP
Sbjct: 415 GATGPLGPKGQTGEPGIAGFKGEQGPKGETGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 473
Query: 292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
G +R PG RG P + P P +RGP G +G D R
Sbjct: 474 IGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 530
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
G G P A P G+V P G PP G+ GQP
Sbjct: 531 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 570
Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074
Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
Identities = 87/286 (30%), Positives = 109/286 (38%)
Query: 127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 186
DG+ G + E T G P G G P G G A A + G + A
Sbjct: 113 DGAPGPRGRDGEPGTPGNP-GPPGPPGPPG-PPGLGGGNFA--AQMAGGFDEKAGGAQMG 168
Query: 187 TQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP 243
G PM PRGP G + GP G+ + +P GP P PG P P
Sbjct: 169 VMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRGPPG--PAGKP 222
Query: 244 GYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQRG----PGYET 295
G D + G A +RG P RG P GL G RG P D +G PG +
Sbjct: 223 GDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAKGEAGAPGVKG 280
Query: 296 QR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS-----YDPSRGTGF 349
+ PG + GP+ + P + GP +G D + P+ P+ G GF
Sbjct: 281 ESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPGPVGPAGGPGF 338
Query: 350 DGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
GAP +G A P G P GS P GS P G GNP
Sbjct: 339 PGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 381
>UNIPROTKB|Q9XSK0 [details] [associations]
symbol:CRX "Cone-rod homeobox protein" species:9913 "Bos
taurus" [GO:0060041 "retina development in camera-type eye"
evidence=IEA] [GO:0045944 "positive regulation of transcription
from RNA polymerase II promoter" evidence=IEA] [GO:0043522 "leucine
zipper domain binding" evidence=IEA] [GO:0005667 "transcription
factor complex" evidence=IEA] [GO:0003700 "sequence-specific DNA
binding transcription factor activity" evidence=IEA] [GO:0003682
"chromatin binding" evidence=IEA] [GO:0006351 "transcription,
DNA-dependent" evidence=IEA] [GO:0043565 "sequence-specific DNA
binding" evidence=IEA] InterPro:IPR001356 InterPro:IPR009057
InterPro:IPR013851 InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529
PROSITE:PS00027 PROSITE:PS50071 SMART:SM00389 GO:GO:0043565
GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
Gene3D:1.10.10.60 SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0060041
EMBL:AF154123 IPI:IPI00695402 RefSeq:NP_776329.1 UniGene:Bt.283
ProteinModelPortal:Q9XSK0 SMR:Q9XSK0 STRING:Q9XSK0 PRIDE:Q9XSK0
Ensembl:ENSBTAT00000028232 GeneID:280756 KEGG:bta:280756 CTD:1406
eggNOG:NOG324074 GeneTree:ENSGT00700000104128 HOGENOM:HOG000082677
HOVERGEN:HBG004028 InParanoid:Q9XSK0 KO:K09337 OMA:QTKARPA
OrthoDB:EOG4NKBWG NextBio:20804923 Uniprot:Q9XSK0
Length = 299
Score = 119 (46.9 bits), Expect = 0.00017, P = 0.00017
Identities = 29/96 (30%), Positives = 42/96 (43%)
Query: 158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
P P P A AG+V +GP+ +++ YA T + P A P G +S G D
Sbjct: 165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222
Query: 218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNY 253
+P P GP+ P GP P+ + G +Y
Sbjct: 223 SPMVPPLGGPALSPLSGPSVGPSLTQSPTSLSGQSY 258
>UNIPROTKB|J9P2F0 [details] [associations]
symbol:ZNF768 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
EMBL:AAEX03004391 Ensembl:ENSCAFT00000043076 Uniprot:J9P2F0
Length = 540
Score = 123 (48.4 bits), Expect = 0.00017, P = 0.00017
Identities = 39/146 (26%), Positives = 66/146 (45%)
Query: 127 DGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA 186
+GS G +E E + G YE +P G P + G + + +
Sbjct: 25 EGSLKGNMSENEEEEMSQQEGTGDYEVEE-IP--FGLDPQSPGFEPQSPGFESQSPRFEP 81
Query: 187 TQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
G R+ +P P + A + P D S++P ++P + P Y+P + PGY+P + PGY+
Sbjct: 82 ESPGFESRSPGFVPPSPEF-APRSPDSD-SQSPEFEP-QSPRYEP-QSPGYEP-RSPGYE 136
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDPQ 272
K Y++Q P Y+ + P + Q
Sbjct: 137 P-KSPGYESQ-SPGYE-PQNPEFKTQ 159
>UNIPROTKB|F1PS24 [details] [associations]
symbol:COL2A1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0071773 "cellular response to BMP stimulus"
evidence=IEA] [GO:0071599 "otic vesicle development" evidence=IEA]
[GO:0060351 "cartilage development involved in endochondral bone
morphogenesis" evidence=IEA] [GO:0060272 "embryonic skeletal joint
morphogenesis" evidence=IEA] [GO:0060174 "limb bud formation"
evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
[GO:0043066 "negative regulation of apoptotic process"
evidence=IEA] [GO:0042802 "identical protein binding" evidence=IEA]
[GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0030903
"notochord development" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA] [GO:0010468 "regulation of gene
expression" evidence=IEA] [GO:0007605 "sensory perception of sound"
evidence=IEA] [GO:0007601 "visual perception" evidence=IEA]
[GO:0007417 "central nervous system development" evidence=IEA]
[GO:0006029 "proteoglycan metabolic process" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615 "extracellular
space" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
[GO:0005585 "collagen type II" evidence=IEA] [GO:0003007 "heart
morphogenesis" evidence=IEA] [GO:0002062 "chondrocyte
differentiation" evidence=IEA] [GO:0001958 "endochondral
ossification" evidence=IEA] [GO:0001894 "tissue homeostasis"
evidence=IEA] [GO:0001502 "cartilage condensation" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005737
GO:GO:0043066 GO:GO:0005615 GO:GO:0003007 GO:GO:0007601
GO:GO:0030199 GO:GO:0007417 GO:GO:0042472 GO:GO:0001894
GO:GO:0007605 GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391
GO:GO:0071599 GO:GO:0005604 GO:GO:0001502 GO:GO:0060021
GO:GO:0002062 GO:GO:0010468 GO:GO:0060272 GO:GO:0006029
GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
GeneTree:ENSGT00660000095287 GO:GO:0005585 GO:GO:0060174
GO:GO:0030903 EMBL:AAEX03015088 EMBL:AAEX03015089
Ensembl:ENSCAFT00000014414 OMA:CPICPTE Uniprot:F1PS24
Length = 1489
Score = 128 (50.1 bits), Expect = 0.00017, P = 0.00017
Identities = 88/282 (31%), Positives = 102/282 (36%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVP--QGH-GPPPSATTAGVVGAG--PNTSTS 182
G G A N E G P G G P +G GPP A AG GA P
Sbjct: 794 GPPGPAGANGEKGEVGPP-GPAGTAGARGAPGERGETGPPGPAGFAGPPGADGQPGAKGE 852
Query: 183 AYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAP--SYDPTKGPSYDPAKG---- 234
A Q G A P+GP G +GP G K + P + A G
Sbjct: 853 QGEAGQKGD---AGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFPGAAGRVGP 909
Query: 235 PGYDPTKGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGY 293
PG + GP G G D +G RG S P R +Q GP GP
Sbjct: 910 PGSNGNPGPPGPPGPSGK--DGPKGA-----RGDSGPPGRAGDPGLQ-GP-----AGPPG 956
Query: 294 ETQRVPGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSRGTGFD 350
E PG D GP + P + QRG G QRG+ G+ PS +P + G
Sbjct: 957 EKGE-PGDDGPSGP--DGPPGPQGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGK-QGAP 1012
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
GA P G V PP P G P R GS G P R
Sbjct: 1013 GASGDRGPPGPVGPPGLTGPSGE---PGREGS-PGADGPPGR 1050
Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
Identities = 72/271 (26%), Positives = 92/271 (33%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAA- 186
G G + G P G + G P GPP G G G N +
Sbjct: 130 GEQGPRGDRGDKGEKGAP-GPRGRDGEPGTPGNPGPPGPPGPPGPPGLGGNFAAQMAGGF 188
Query: 187 -TQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPA-KGPGYDPTKGP 243
++G P GP G GP A + G +P GP P P
Sbjct: 189 DEKAGGAQMGVMQGPMGPMGPRGPPGPA-GAPGPQGFQGNPGEPGEPGVSGP-MGPRGPP 246
Query: 244 GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG-PGYETQR---- 297
G + G + +A + P RGP PQ G+ G P RG PG + +
Sbjct: 247 GPPGKPGDDGEAGK-PGKSGERGPP-GPQGARGFPGTPGLPGVKGHRGYPGLDGAKGEAG 304
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGA 356
PG + G E +P + RG PG +RG R P+ + G DG P A
Sbjct: 305 APGVKGESGSPGE-NGSPGPMGPRGLPG---ERG-----RTGPA-GAAGARGNDGQPGPA 354
Query: 357 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G V P P P A G P G
Sbjct: 355 GPPGPVSPA--GGPGFPGAPGASQGEAGPTG 383
Score = 123 (48.4 bits), Expect = 0.00061, P = 0.00061
Identities = 82/280 (29%), Positives = 105/280 (37%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS 182
R GA GN P G + G G P G P S AG GA GP +
Sbjct: 335 RTGPAGAAGARGNDGQPGPAGPPGPVSPAGGPGFP-G-APGASQGEAGPTGARGPEGAQG 392
Query: 183 AYAATQS-GTPMRAAYDIPRG----PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP- 235
+ G+P A G PG + S G PG + AP + +GP P GP
Sbjct: 393 PRGEPGTPGSPGPAGASGNPGTDGIPGAKGSAGAPGI--AGAPGFPGPRGP---P--GPQ 445
Query: 236 GYDPTKGP-GYDAQKG-SNYDAQRGPNYDIH-RGPSYDPQRGLGYDMQRGPNYDMQ-RGP 291
G GP G + G + + ++GP + GP P G + +RG + GP
Sbjct: 446 GATGPLGPKGQTGEPGIAGFKGEQGPKGEPGPAGPQGAPGPA-GEEGKRGARGEPGGAGP 504
Query: 292 -GYETQR-VPGYDVQRG-PVYEAQRAPSYIP-QRGP-GYDLQRGQGYDMRRAPSYDPSRG 346
G +R PG RG P + P P +RGP G +G D R
Sbjct: 505 VGPPGERGAPG---NRGFPGQDGLAGPKGAPGERGPSGLAGPKGANGDPGRPGEPGLPGA 561
Query: 347 TGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP 385
G G P A P G+V P G PP G+ GQP
Sbjct: 562 RGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 601
Score = 122 (48.0 bits), Expect = 0.00078, P = 0.00078
Identities = 83/282 (29%), Positives = 106/282 (37%)
Query: 130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTSAYAATQ 188
+ GA G S+ E P G E G P+G G P S AG G P T A
Sbjct: 368 FPGAPGASQGEAG--PTGARGPEGAQG-PRGEPGTPGSPGPAGASG-NPGTDGIPGAKGS 423
Query: 189 SGTPMRAA---YDIPRGP-GYEASKGP----GYDASKA-PSYDPTKGPSYDPAKGPGYDP 239
+G P A + PRGP G + + GP G + +GP +P GP P
Sbjct: 424 AGAPGIAGAPGFPGPRGPPGPQGATGPLGPKGQTGEPGIAGFKGEQGPKGEP--GPA-GP 480
Query: 240 TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP-QRGLGYDMQRG-PNYDMQRGP-GYETQ 296
PG ++G A+ P GP P +RG + RG P D GP G +
Sbjct: 481 QGAPGPAGEEGKR-GARGEPG---GAGPVGPPGERGAPGN--RGFPGQDGLAGPKGAPGE 534
Query: 297 RVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRG 355
R P GP A P + G PG G+ D PS G DG P
Sbjct: 535 RGPSG--LAGPK-GANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGP 591
Query: 356 AAPHG-QVPPPLNNVP--YGSATPPARSGS-GQPRGGNPARR 393
P G + P + P G+ P ++G G P G P R
Sbjct: 592 PGPQGARGQPGVMGFPGPKGANGEPGKAGEKGLP--GAPGLR 631
>RGD|1309595 [details] [associations]
symbol:Taf15 "TAF15 RNA polymerase II, TATA box binding protein
(TBP)-associated factor" species:10116 "Rattus norvegicus"
[GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003674
"molecular_function" evidence=ND] [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0005575 "cellular_component"
evidence=ND] [GO:0005622 "intracellular" evidence=IEA] [GO:0008150
"biological_process" evidence=ND] [GO:0008270 "zinc ion binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
RGD:1309595 GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330
GO:GO:0003676 GO:GO:0005622 EMBL:AC119615 IPI:IPI00950713
PRIDE:F1M8P1 Ensembl:ENSRNOT00000014438 ArrayExpress:F1M8P1
Uniprot:F1M8P1
Length = 554
Score = 123 (48.4 bits), Expect = 0.00018, P = 0.00018
Identities = 72/237 (30%), Positives = 86/237 (36%)
Query: 124 RRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSA 183
RR + GG +G GR G+ Y G G QG G P V P+
Sbjct: 318 RRPEFMRGGGSGG------GRR-GRGGYR-GRGGFQGRGGDPK--NGDWVCPNPSCGNMN 367
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP 243
+A S P +G GY + + G D G G D + G
Sbjct: 368 FARRNSCNQCNEPRPEDSRPSGGDFRGRGYGGERG--FRGRGGRGGDRG-GYGADRSGG- 423
Query: 244 GYDAQK-GSNYDAQR-GPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
GY + G +Y A R G Y R Y RG GY RG +Y RG GY R GY
Sbjct: 424 GYGGDRSGGSYGADRSGGGYGGDRS-GYGGDRG-GYGGDRGGSYGGDRG-GYGGDR-GGY 479
Query: 302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG---YDMRRAPSYDPSRGTGFDGAPRG 355
RG Y R Y R Y RG G Y R+ Y RG G+ G RG
Sbjct: 480 GGDRGG-YGGDRG-GYGGDRRGAYGGDRGGGSGGYGGDRSGGYGGDRGGGY-GGDRG 533
>UNIPROTKB|F1SEN8 [details] [associations]
symbol:LDB3 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0030018 "Z disc" evidence=IEA] [GO:0008092
"cytoskeletal protein binding" evidence=IEA] [GO:0005856
"cytoskeleton" evidence=IEA] [GO:0005080 "protein kinase C binding"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
Pfam:PF00595 Pfam:PF00412 InterPro:IPR001478 InterPro:IPR001781
PROSITE:PS00478 PROSITE:PS50023 PROSITE:PS50106 SMART:SM00132
SMART:SM00228 GO:GO:0030018 GO:GO:0005856 GO:GO:0046872
GO:GO:0008270 Gene3D:2.10.110.10 SUPFAM:SSF50156 CTD:11155
OMA:CTSQATT InterPro:IPR006643 SMART:SM00735
GeneTree:ENSGT00700000104411 EMBL:CU468409 RefSeq:XP_003359314.1
UniGene:Ssc.97236 Ensembl:ENSSSCT00000011341 GeneID:100151883
KEGG:ssc:100151883 Uniprot:F1SEN8
Length = 715
Score = 124 (48.7 bits), Expect = 0.00020, P = 0.00020
Identities = 50/192 (26%), Positives = 69/192 (35%)
Query: 133 ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP----NTSTSAYAATQ 188
AT ++ S Y P P+A T A P T+A
Sbjct: 344 ATASAAAPASSPADSPRPQASAYSPAVATSPAPAAHTYSEAPAAPAPKPRVVTTASIRPS 403
Query: 189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQ 248
P+ A+ P PG S P Y S AP+Y P+ P+Y P+ P Y P+ P Y+
Sbjct: 404 VYQPVPASTYSP-SPGANYSPTP-YTPSPAPAYTPSPAPTYSPSPAPAYTPSPAPSYNPT 461
Query: 249 KGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQ--- 304
S A+ S+ + G + RG P Y T + G V
Sbjct: 462 PYSGGPAESASRPPWVTDDSFSQKFAPGKSTTSISKQSLPRGAPAY-TPPLQGPQVSPLA 520
Query: 305 RGPVYEAQRAPS 316
RG V A+R P+
Sbjct: 521 RGTVQRAERFPA 532
>RGD|1311620 [details] [associations]
symbol:Zmiz1 "zinc finger, MIZ-type containing 1" species:10116
"Rattus norvegicus" [GO:0001570 "vasculogenesis" evidence=IEA;ISO]
[GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
[GO:0003007 "heart morphogenesis" evidence=IEA;ISO] [GO:0007296
"vitellogenesis" evidence=IEA;ISO] [GO:0007569 "cell aging"
evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0045944 "positive regulation of transcription from RNA
polymerase II promoter" evidence=IEA;ISO] [GO:0048146 "positive
regulation of fibroblast proliferation" evidence=IEA;ISO]
[GO:0048589 "developmental growth" evidence=IEA;ISO] [GO:0048844
"artery morphogenesis" evidence=IEA;ISO] InterPro:IPR004181
Pfam:PF02891 PROSITE:PS51044 RGD:1311620 GO:GO:0008270
Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
CTD:57178 OMA:MNQYGPM OrthoDB:EOG45MN70 EMBL:CH474067
IPI:IPI00364462 RefSeq:NP_001101863.1 UniGene:Rn.1712
Ensembl:ENSRNOT00000014004 GeneID:361103 KEGG:rno:361103
UCSC:RGD:1311620 NextBio:675228 Uniprot:D4AE97
Length = 1072
Score = 126 (49.4 bits), Expect = 0.00020, P = 0.00020
Identities = 66/233 (28%), Positives = 87/233 (37%)
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
GP S+ TQ+ PRGP AS G + AS A P+ GP +
Sbjct: 318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374
Query: 231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
+ PG P T G Q Q P I R +P G + Q GPN
Sbjct: 375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRSYPGEPNYG---NQQYGPNSQFP 431
Query: 289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP-- 343
PG Y T P P Y QR PS P G P + GQ Y + +
Sbjct: 432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488
Query: 344 SRGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
S G+ + +G+ P P+ N P+ G+ TPP GS P +P++
Sbjct: 489 SSGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541
>UNIPROTKB|F1NI79 [details] [associations]
symbol:COL5A1 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 InterPro:IPR001791 Pfam:PF02210 GO:GO:0005201
SMART:SM00210 GeneTree:ENSGT00700000104155 EMBL:AADN02026433
EMBL:AADN02026434 EMBL:AADN02026427 EMBL:AADN02026428
EMBL:AADN02026429 EMBL:AADN02026430 EMBL:AADN02026431
EMBL:AADN02026432 IPI:IPI00602965 Ensembl:ENSGALT00000004020
ArrayExpress:F1NI79 Uniprot:F1NI79
Length = 1702
Score = 128 (50.1 bits), Expect = 0.00020, P = 0.00020
Identities = 75/249 (30%), Positives = 96/249 (38%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
P+G + G P GP S G G AGP Q G P A +G
Sbjct: 930 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 984
Query: 203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
PG + +GP G D + P P GP+ P PG D KG G QKGS D ++GP
Sbjct: 985 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1041
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
GP+ PQ +G Q GP D + GP + Q + G GP P +
Sbjct: 1042 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1090
Query: 319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
+G PG ++G+ D+ + P G G P P G N G P
Sbjct: 1091 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1150
Query: 378 ARSGS-GQP 385
SG G P
Sbjct: 1151 GESGEPGLP 1159
>UNIPROTKB|E1BF96 [details] [associations]
symbol:PPP1R10 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0072357 "PTW/PP1 phosphatase complex" evidence=IEA]
[GO:0000785 "chromatin" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003677 "DNA
binding" evidence=IEA] InterPro:IPR000571 InterPro:IPR003617
InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711 PROSITE:PS50103
PROSITE:PS51319 SMART:SM00356 SMART:SM00509 GO:GO:0005634
GO:GO:0003677 GO:GO:0008270 GO:GO:0000785 GO:GO:0006351
Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357 CTD:5514
OMA:PPPHEHR GeneTree:ENSGT00530000063820 EMBL:DAAA02055402
IPI:IPI00698425 RefSeq:NP_001137335.1 UniGene:Bt.27784
Ensembl:ENSBTAT00000009104 GeneID:510825 KEGG:bta:510825
NextBio:20869636 Uniprot:E1BF96
Length = 924
Score = 125 (49.1 bits), Expect = 0.00021, P = 0.00021
Identities = 71/271 (26%), Positives = 87/271 (32%)
Query: 128 GSYGGATGNSENETS-GRPV-GQNAYEDGYGVPQGH---GPPPSATTAGVVGAGPNTSTS 182
G GG G G P+ G + G G P G GPPP GP
Sbjct: 631 GGPGGPKGMQHFPPGPGGPMPGPHGGPGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDP-- 688
Query: 183 AYAATQSGTPMRAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
G PMR P GPG Y +G P P +G + G +
Sbjct: 689 -----MRGGPMRGGPG-P-GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRG 741
Query: 242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
GPG G + GP ++ G + P G G M G + GPG G+
Sbjct: 742 GPGGGMVGGGGHRPHEGPGGGMNSGSGHRPHEGPGSGM--GGGHRPHEGPGGSMGG--GH 797
Query: 302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
GP + P GPG + G G+ P G G G P G PH
Sbjct: 798 RPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-D 847
Query: 362 VPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
VP + G R G GG R
Sbjct: 848 VPGHRGHDHRGPPPHEHRGHDGPGHGGGGHR 878
Score = 121 (47.7 bits), Expect = 0.00058, P = 0.00058
Identities = 49/192 (25%), Positives = 68/192 (35%)
Query: 132 GATGNSENETSGRPV-GQNAYEDGYGVPQGHGPPPSATTAGV-------VGAGPNTSTSA 183
G G +E P G G G P G G P G G G N+ +
Sbjct: 710 GGRGGNEPPPPPPPFRGARGGRSGGGPPNGRGGPGGGMVGGGGHRPHEGPGGGMNSGSGH 769
Query: 184 YAATQSGTPMRAAYDIPRGPG------YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGY 237
G+ M + GPG + +GPG + P +GP G G+
Sbjct: 770 RPHEGPGSGMGGGHRPHEGPGGSMGGGHRPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGH 829
Query: 238 DPTKGPGYDAQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
P +GPG+ G +D +D HRGP P G+D GP + G++
Sbjct: 830 RPHEGPGHGGPHGHRPHDVPGHRGHD-HRGPP--PHEHRGHD---GPGHGGGGHRGHDGG 883
Query: 297 RVPGYDVQRGPV 308
G D+ PV
Sbjct: 884 HSHGGDMSNRPV 895
>UNIPROTKB|F1NR01 [details] [associations]
symbol:COL5A1 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00822317
Ensembl:ENSGALT00000039037 ArrayExpress:F1NR01 Uniprot:F1NR01
Length = 1773
Score = 128 (50.1 bits), Expect = 0.00021, P = 0.00021
Identities = 75/249 (30%), Positives = 96/249 (38%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
P+G + G P GP S G G AGP Q G P A +G
Sbjct: 1001 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1055
Query: 203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
PG + +GP G D + P P GP+ P PG D KG G QKGS D ++GP
Sbjct: 1056 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1112
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
GP+ PQ +G Q GP D + GP + Q + G GP P +
Sbjct: 1113 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1161
Query: 319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
+G PG ++G+ D+ + P G G P P G N G P
Sbjct: 1162 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1221
Query: 378 ARSGS-GQP 385
SG G P
Sbjct: 1222 GESGEPGLP 1230
>ZFIN|ZDB-GENE-030707-4 [details] [associations]
symbol:anxa11a "annexin A11a" species:7955 "Danio
rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
"calcium-dependent phospholipid binding" evidence=IEA]
InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
SMART:SM00335 ZFIN:ZDB-GENE-030707-4 GO:GO:0005509 eggNOG:NOG267770
GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29 HSSP:P79134 EMBL:AY178801
IPI:IPI00498021 UniGene:Dr.77310 ProteinModelPortal:Q804G4
SMR:Q804G4 PRIDE:Q804G4 InParanoid:Q804G4 NextBio:20812811
ArrayExpress:Q804G4 Bgee:Q804G4 Uniprot:Q804G4
Length = 526
Score = 122 (48.0 bits), Expect = 0.00021, P = 0.00021
Identities = 58/201 (28%), Positives = 73/201 (36%)
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQK 249
G P ++ Y P+G GY PG A Y P G Y P G GY P G Y Q
Sbjct: 5 GYPPQSGYP-PQGGGYPPQ--PGAYPPAAGGYPPQPG-MYPPQAG-GYPPQPG-AYPPQP 58
Query: 250 GSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVY 309
G+ + Q G + G P +G D P ++ G Q P
Sbjct: 59 GA-FPGQPGQYPSVPSGGWGAP---IGLDNLPNPGFNASNIQGMANQFAADGGFAPNPSM 114
Query: 310 EAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV 369
+ P PQ G + Q Y M P + G G P G P GQ P N+
Sbjct: 115 FSGGYPG--PQPGGPPAVSPNQPYGMYPQPGGGMPQNPGM-GYP-GGPPPGQQMPSYPNI 170
Query: 370 PYGSATPPARSGSGQPRGGNP 390
P + TP SG PR +P
Sbjct: 171 P--APTP---SGPSYPRAPSP 186
Score = 116 (45.9 bits), Expect = 0.00098, P = 0.00098
Identities = 63/215 (29%), Positives = 77/215 (35%)
Query: 156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA 215
G P G PP G G AY G P + P+ GY G
Sbjct: 5 GYPPQSGYPPQ-------GGGYPPQPGAYPPAAGGYPPQPGMYPPQAGGYPPQPGAYPPQ 57
Query: 216 SKAPSYDPTKGPSYDPAKG---P-GYDPTKGPGYDAQK----GSNYDAQRG--PNYDIHR 265
A P + PS P+ G P G D PG++A + + A G PN +
Sbjct: 58 PGAFPGQPGQYPSV-PSGGWGAPIGLDNLPNPGFNASNIQGMANQFAADGGFAPNPSMFS 116
Query: 266 GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY--IPQRGP 323
G PQ G + Y M PG + PG GP Q+ PSY IP P
Sbjct: 117 GGYPGPQPGGPPAVSPNQPYGMYPQPGGGMPQNPGMGYPGGPP-PGQQMPSYPNIPAPTP 175
Query: 324 GYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAP 358
G Y RAPS +PS G+ G G AP
Sbjct: 176 S-----GPSYP--RAPSPNPSM-PGYGGGYGGGAP 202
>UNIPROTKB|F1NR03 [details] [associations]
symbol:COL5A1 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 InterPro:IPR001791 SMART:SM00282 Pfam:PF02210
GO:GO:0005201 SMART:SM00210 GeneTree:ENSGT00700000104155
EMBL:AADN02026433 EMBL:AADN02026434 EMBL:AADN02026427
EMBL:AADN02026428 EMBL:AADN02026429 EMBL:AADN02026430
EMBL:AADN02026431 EMBL:AADN02026432 IPI:IPI00818113
Ensembl:ENSGALT00000039034 ArrayExpress:F1NR03 Uniprot:F1NR03
Length = 1804
Score = 128 (50.1 bits), Expect = 0.00022, P = 0.00022
Identities = 75/249 (30%), Positives = 96/249 (38%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
P+G + G P GP S G G AGP Q G P A +G
Sbjct: 1032 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1086
Query: 203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
PG + +GP G D + P P GP+ P PG D KG G QKGS D ++GP
Sbjct: 1087 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1143
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
GP+ PQ +G Q GP D + GP + Q + G GP P +
Sbjct: 1144 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1192
Query: 319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
+G PG ++G+ D+ + P G G P P G N G P
Sbjct: 1193 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1252
Query: 378 ARSGS-GQP 385
SG G P
Sbjct: 1253 GESGEPGLP 1261
>UNIPROTKB|F1NR02 [details] [associations]
symbol:COL5A1 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0001568 "blood vessel development"
evidence=IEA] [GO:0003007 "heart morphogenesis" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] [GO:0005588 "collagen type V" evidence=IEA]
[GO:0005604 "basement membrane" evidence=IEA] [GO:0007155 "cell
adhesion" evidence=IEA] [GO:0008201 "heparin binding" evidence=IEA]
[GO:0030199 "collagen fibril organization" evidence=IEA]
[GO:0032964 "collagen biosynthetic process" evidence=IEA]
[GO:0035313 "wound healing, spreading of epidermal cells"
evidence=IEA] [GO:0043206 "extracellular fibril organization"
evidence=IEA] [GO:0043394 "proteoglycan binding" evidence=IEA]
[GO:0043588 "skin development" evidence=IEA] [GO:0045112 "integrin
biosynthetic process" evidence=IEA] [GO:0048407 "platelet-derived
growth factor binding" evidence=IEA] [GO:0048592 "eye
morphogenesis" evidence=IEA] [GO:0051128 "regulation of cellular
component organization" evidence=IEA] InterPro:IPR000885
Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
GO:GO:0030199 GO:GO:0008201 GO:GO:0007155 Gene3D:2.60.120.200
InterPro:IPR008985 InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0035313
InterPro:IPR008160 Pfam:PF01391 InterPro:IPR001791 SMART:SM00282
GO:GO:0005604 GO:GO:0043206 Pfam:PF02210 GO:GO:0005201 OMA:TIYEGIG
GO:GO:0005588 GO:GO:0045112 GO:GO:0051128 SMART:SM00210
GeneTree:ENSGT00700000104155 EMBL:AADN02026433 EMBL:AADN02026434
EMBL:AADN02026427 EMBL:AADN02026428 EMBL:AADN02026429
EMBL:AADN02026430 EMBL:AADN02026431 EMBL:AADN02026432
IPI:IPI00821684 Ensembl:ENSGALT00000039035 ArrayExpress:F1NR02
Uniprot:F1NR02
Length = 1815
Score = 128 (50.1 bits), Expect = 0.00022, P = 0.00022
Identities = 75/249 (30%), Positives = 96/249 (38%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSGTPMRAAYDIPRG- 202
P+G + G P GP S G G AGP Q G P A +G
Sbjct: 1043 PIGSPGLKGNEGPPGPPGPAGSPGERGPAGSAGP-IGLPGRPGPQ-GPPGPAG---EKGA 1097
Query: 203 PGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYD-AQRGP 259
PG + +GP G D + P P GP+ P PG D KG G QKGS D ++GP
Sbjct: 1098 PGEKGPQGPAGRDGIQGPVGLP--GPA-GPVGPPGEDGDKGEIGEPGQKGSKGDKGEQGP 1154
Query: 260 NYDIHRGPSYDPQRGLGYDMQRGP-NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI 318
GP+ PQ +G Q GP D + GP + Q + G GP P +
Sbjct: 1155 P-----GPT-GPQGPIG---QPGPAGADGEPGPRGQ-QGLFGQKGDEGP-RGFPGPPGPV 1203
Query: 319 PQRG-PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
+G PG ++G+ D+ + P G G P P G N G P
Sbjct: 1204 GLQGLPGPPGEKGETGDVGQMGPPGPPGPRGPSGPPGADGPQGPAGGIGNPGAVGEKGEP 1263
Query: 378 ARSGS-GQP 385
SG G P
Sbjct: 1264 GESGEPGLP 1272
>UNIPROTKB|E9PQW6 [details] [associations]
symbol:ARID1A "AT-rich interactive domain-containing
protein 1A" species:9606 "Homo sapiens" [GO:0006325 "chromatin
organization" evidence=IEA] [GO:0016514 "SWI/SNF complex"
evidence=IEA] [GO:0071564 "npBAF complex" evidence=IEA] [GO:0071565
"nBAF complex" evidence=IEA] EMBL:AL034380 GO:GO:0016514
EMBL:AL512408 HGNC:HGNC:11110 ChiTaRS:ARID1A GO:GO:0006325
IPI:IPI00979164 Ensembl:ENST00000524572 ArrayExpress:E9PQW6
Bgee:E9PQW6 Uniprot:E9PQW6
Length = 123
Score = 98 (39.6 bits), Expect = 0.00024, P = 0.00024
Identities = 36/108 (33%), Positives = 47/108 (43%)
Query: 229 YDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
Y +GP P +G GY Q + QR P +G + GL Y Q P Y Q
Sbjct: 18 YSQQQGPPSGPQQGHGYPGQPYGSQTPQRYPM--TMQGRAQSAMGGLSYTQQIPP-YG-Q 73
Query: 289 RGP-GYETQ-RVPGYDVQ------RGPVYEAQRAPSYIPQRGPGYDLQ 328
+GP GY Q + P Y+ Q + P Y +Q+ PS P P Y Q
Sbjct: 74 QGPSGYGQQGQTPYYNQQSPHPQQQQPPY-SQQPPSQTPHAQPSYQQQ 120
>ZFIN|ZDB-GENE-030707-5 [details] [associations]
symbol:anxa11b "annexin A11b" species:7955 "Danio
rerio" [GO:0005509 "calcium ion binding" evidence=IEA] [GO:0005544
"calcium-dependent phospholipid binding" evidence=IEA]
InterPro:IPR001464 InterPro:IPR008157 InterPro:IPR018252
InterPro:IPR018502 Pfam:PF00191 PRINTS:PR00196 PROSITE:PS00223
SMART:SM00335 ZFIN:ZDB-GENE-030707-5 GO:GO:0005509 eggNOG:NOG267770
GO:GO:0005544 Gene3D:1.10.220.10 PANTHER:PTHR10502 SUPFAM:SSF47874
HOGENOM:HOG000158803 HOVERGEN:HBG061815 PANTHER:PTHR10502:SF29
OrthoDB:EOG4Z0B60 InterPro:IPR013286 PRINTS:PR01871 HSSP:P79134
EMBL:BC068366 EMBL:AY178802 IPI:IPI00484212 RefSeq:NP_861431.1
UniGene:Dr.76267 SMR:Q804G3 STRING:Q804G3 GeneID:353365
KEGG:dre:353365 CTD:353365 NextBio:20812741 Uniprot:Q804G3
Length = 485
Score = 121 (47.7 bits), Expect = 0.00024, P = 0.00024
Identities = 59/175 (33%), Positives = 71/175 (40%)
Query: 219 PSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYD 278
P Y P G SY PA GP P G Y Q G+ Y Q G Y G ++ PQ G +
Sbjct: 4 PGYPPAGG-SYPPASGPYQQPAAG--YPPQPGA-YPPQAG-YYPPQPG-AFPPQPG-AFP 56
Query: 279 MQRG--P---NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSY--IPQRG-----PGYD 326
Q G P Y Q G GY G+ Q G Y A + +Y +P G PG+
Sbjct: 57 PQPGAFPPGAGYPPQAG-GYPAAPGGGFPPQAGG-YPAAQPGAYPNMPAAGGWGGHPGFG 114
Query: 327 LQRG---QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPA 378
G QGY AP P + GAP P+ +P P G TPPA
Sbjct: 115 APAGGMPQGYPGVPAPGQQPM--PAYPGAP---VPNPGMPGYGGGAPTGP-TPPA 163
>UNIPROTKB|P02812 [details] [associations]
symbol:PRB2 "Basic salivary proline-rich protein 2"
species:9606 "Homo sapiens" [GO:0005576 "extracellular region"
evidence=IEA] [GO:0003674 "molecular_function" evidence=ND]
[GO:0005575 "cellular_component" evidence=ND] [GO:0008150
"biological_process" evidence=ND] GO:GO:0005576 EMBL:AC078950
EMBL:BX484538 EMBL:S80905 EMBL:K03208 IPI:IPI00552432 PIR:B40750
PIR:E25372 UniGene:Hs.654486 STRING:P02812 DMDM:160409933
PaxDb:P02812 PRIDE:P02812 Ensembl:ENST00000389362 UCSC:uc010shk.1
GeneCards:GC12M011544 HGNC:HGNC:9338 MIM:168810 neXtProt:NX_P02812
ArrayExpress:P02812 Bgee:P02812 CleanEx:HS_PRB2
Genevestigator:P02812 GermOnline:ENSG00000173342 InterPro:IPR026086
PANTHER:PTHR23203 Uniprot:P02812
Length = 416
Score = 120 (47.3 bits), Expect = 0.00025, P = 0.00025
Identities = 69/257 (26%), Positives = 88/257 (34%)
Query: 142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYA--ATQSGTPMRAAYDI 199
+G P Q A G PQG P P G G N G P + +
Sbjct: 33 AGNP--QGAPPQGGNKPQGP-PSPPGKPQGPPPQGGNQPQGPPPPPGKPQGPPPQGG-NK 88
Query: 200 PRGP---GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQ 256
P+GP G P D S++P P K P P +G G P +GP K Q
Sbjct: 89 PQGPPPPGKPQGPPPQGDKSRSPRSPPGK-PQGPPPQG-GNQP-QGPPPPPGKPQGPPPQ 145
Query: 257 RGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPS 316
G +GP P + G Q R P + Q P Q G + P
Sbjct: 146 GGNK---PQGPP-PPGKPQGPPPQGDNKSRSSRSPPGKPQGPPP---QGGNQPQGPPPPP 198
Query: 317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLN-NVPYGSAT 375
PQ P + QG P P +G + R Q PPP N P G
Sbjct: 199 GKPQGPPPQGGNKPQGPPPPGKPQGPPPQGDNKSQSARSPPGKPQGPPPQGGNQPQGPPP 258
Query: 376 PPARSGSGQPRGGNPAR 392
PP + P+GGN ++
Sbjct: 259 PPGKPQGPPPQGGNKSQ 275
Score = 118 (46.6 bits), Expect = 0.00041, P = 0.00041
Identities = 76/272 (27%), Positives = 99/272 (36%)
Query: 135 GNSENETSGRPVG--QNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTP 192
G++++ +S P G Q G PQG PPP G P G P
Sbjct: 166 GDNKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQ----GPPPQGGNKPQGPPPPGKP 221
Query: 193 MRAAYDIPRGPGY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGS 251
P+G ++++ P K P P +G + P +GP P K G Q G+
Sbjct: 222 QGPP---PQGDNKSQSARSP---PGK-PQGPPPQGGN-QP-QGPPPPPGKPQGPPPQGGN 272
Query: 252 NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP---GYETQRVPGYDVQ-RGP 307
+Q P +GP PQ G R P Q P G + Q P + +GP
Sbjct: 273 K--SQGPPPPGKPQGPP--PQGGSKSRSSRSPPGKPQGPPPQGGNQPQGPPPPPGKPQGP 328
Query: 308 VYEAQRAPSYIPQRG-P-GYDLQRGQGYDMRRAPSYDPSRGTGFDGAPR--GAAPHGQVP 363
+ P P G P G Q G R+P P G P+ G P G P
Sbjct: 329 PPQGGNKPQGPPPPGKPQGPPPQGGSKSRSARSPPGKPQ------GPPQQEGNNPQGP-P 381
Query: 364 PPLNNVPYGSATPPARSGSGQPR---GGNPAR 392
PP P PPA G PR GG P+R
Sbjct: 382 PPAGGNPQQPQAPPAGQPQGPPRPPQGGRPSR 413
>DICTYBASE|DDB_G0279193 [details] [associations]
symbol:rpb1 "RNA polymerase II core subunit"
species:44689 "Dictyostelium discoideum" [GO:0005634 "nucleus"
evidence=IEA;IDA] [GO:0006366 "transcription from RNA polymerase II
promoter" evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0005665 "DNA-directed RNA polymerase II, core
complex" evidence=IEA;ISS] [GO:0003899 "DNA-directed RNA polymerase
activity" evidence=IEA;ISS] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0006355 "regulation of transcription, DNA-dependent"
evidence=ISS] [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0016779 "nucleotidyltransferase activity" evidence=IEA]
[GO:0016740 "transferase activity" evidence=IEA] [GO:0044351
"macropinocytosis" evidence=RCA] InterPro:IPR000684
InterPro:IPR000722 InterPro:IPR006592 InterPro:IPR007066
InterPro:IPR007073 InterPro:IPR007075 InterPro:IPR007080
InterPro:IPR007081 InterPro:IPR007083 Pfam:PF00623 Pfam:PF04983
Pfam:PF04990 Pfam:PF04992 Pfam:PF04997 Pfam:PF04998 Pfam:PF05000
Pfam:PF05001 PROSITE:PS00115 SMART:SM00663 dictyBase:DDB_G0279193
GO:GO:0006355 GenomeReviews:CM000152_GR GO:GO:0046872 GO:GO:0003677
GO:GO:0006366 Gene3D:2.40.40.20 InterPro:IPR009010
EMBL:AAFI02000030 GO:GO:0003899 eggNOG:COG0086 GO:GO:0005665
OMA:KVLPWST EMBL:S52651 PIR:A56823 RefSeq:XP_641735.1 STRING:P35084
PRIDE:P35084 EnsemblProtists:DDB0215406 GeneID:8621932
KEGG:ddi:DDB_G0279193 KO:K03006 ProtClustDB:CLSZ2428993
Uniprot:P35084
Length = 1727
Score = 135 (52.6 bits), Expect = 0.00025, Sum P(2) = 0.00025
Identities = 65/219 (29%), Positives = 85/219 (38%)
Query: 177 PNTSTSAYA-ATQSGTPMRAAYDIPRGPGYEASKG---------PGYDASKA--PSYDP- 223
P + T +Y+ S TP YD P P E +G PGY+A+K+ SY
Sbjct: 1488 PGSQTPSYSYGDGSTTPFHNPYDAPLSPFNETFRGDFSPSAMNSPGYNANKSYGSSYQYF 1547
Query: 224 TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGP 283
+ P+Y P P Y PT P Y S Y + P+Y PSY P Y P
Sbjct: 1548 PQSPTYSPTS-PSYSPTS-PSYSPTSPS-Y-SPTSPSYS-PTSPSYSPTSP-SYS-PTSP 1600
Query: 284 NYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDP 343
Y P Y P Y P Y + +PSY P P Y Y +PSY P
Sbjct: 1601 FYS-PTSPSYSPTS-PSYS-PTSPSY-SPTSPSYSPT-SPSYS-PTSPSYSPT-SPSYSP 1653
Query: 344 SRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS 382
+ + +P +P P + P S T P+ S S
Sbjct: 1654 TSPSYSPTSP-SYSPTSPSYSPTS--PSYSPTSPSYSPS 1689
Score = 40 (19.1 bits), Expect = 0.00025, Sum P(2) = 0.00025
Identities = 12/43 (27%), Positives = 20/43 (46%)
Query: 85 KKFYNDHLESLQVMEKNYITMATEVEKLRAELMNAPNVDRRAD 127
+K +N ++ +V + N + E+EKL A L D D
Sbjct: 978 QKLFN--IDIRRVSDLNPAVVVLEIEKLVARLKIIATADTTED 1018
>UNIPROTKB|F1Q0F7 [details] [associations]
symbol:COL4A5 "Collagen alpha-5(IV) chain" species:9615
"Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403
SMART:SM00111 InterPro:IPR016187 SUPFAM:SSF56436 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 Gene3D:2.170.240.10
GeneTree:ENSGT00690000101772 EMBL:AAEX03026757 EMBL:AAEX03026761
EMBL:AAEX03026758 EMBL:AAEX03026759 EMBL:AAEX03026760
Ensembl:ENSCAFT00000018078 Uniprot:F1Q0F7
Length = 1678
Score = 127 (49.8 bits), Expect = 0.00026, P = 0.00026
Identities = 59/197 (29%), Positives = 72/197 (36%)
Query: 200 PRGPGYEASKGP--GYDASKAPSYDPTK-G-PSYDPAKG-PGYDPTKG-PGYDAQKGSNY 253
P PG GP G K +P K G P D G PG G PGY + G
Sbjct: 269 PGPPGIRGPPGPPGGMKGEKGEQGEPGKRGKPGKDGENGQPGIPGLPGDPGYPGEPGR-- 326
Query: 254 DAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR-VPGYDVQRG-PVYE 310
D ++G DI GP + G G + N + PG + +R PG G P
Sbjct: 327 DGEKGQKGDIGSTGPPGLSKPGTGVTVGEKGNMGLPGLPGEKGERGFPGIQGPPGLPGPP 386
Query: 311 AQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVP 370
P PG+ +RGQ D P G DG P G PP P
Sbjct: 387 VLGTAVMGPPGPPGFPGERGQKGD-EGPPGISIPGFPGLDGQPGAPGLRGPPGPP---GP 442
Query: 371 YGSATPPARSGSGQPRG 387
+ S +PP GS RG
Sbjct: 443 HISPSPPGPPGSPGDRG 459
Score = 122 (48.0 bits), Expect = 0.00090, P = 0.00090
Identities = 80/270 (29%), Positives = 98/270 (36%)
Query: 132 GATG-NSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQS 189
G G N G P G+ G G P GPP G G GP A Q
Sbjct: 1131 GPKGINGPPGNPGLP-GEPGPVGGGGRPGPPGPPGEKGNPGQDGIPGP--------AGQK 1181
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKA--PSYDPTKGPSYDPAKGPGYDPTKGPGYDA 247
G P + + IP PG G D P GP +P G+ +GP
Sbjct: 1182 GEPGQPGFGIPGPPGLPGLSGQKGDGGLPGIPGNPGLPGPKGEPGF-QGFPGVQGP--PG 1238
Query: 248 QKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG-PG---YETQRV-PGY 301
GS A GP + GP P R + Y QRG P + RG PG + +R PG
Sbjct: 1239 PPGSPGPALEGPKGN--PGPQGPPGRPV-YTFQRGLPGPEGPRGLPGNGGIKGERGNPGQ 1295
Query: 302 DVQRG-PVYEAQRAPSYIPQRGPGYDLQRGQGYD--MRRAPSYDPSRGTGFDGAPRGAAP 358
Q G P + + P I Q PG G D + P + +G G P A P
Sbjct: 1296 PGQPGLPGLKGDQGPPGI-QGNPGRPGLNGMKGDPGLPGVPGFPGMKGPS--GVPGSAGP 1352
Query: 359 HGQ---VPPPLNNVPYGSATPPARSG-SGQ 384
G V PP+ + PP G SGQ
Sbjct: 1353 EGDPGLVGPPV--CMFCILGPPGLPGPSGQ 1380
>UNIPROTKB|F1PHY1 [details] [associations]
symbol:COL1A2 "Collagen alpha-2(I) chain" species:9615
"Canis lupus familiaris" [GO:0071230 "cellular response to amino
acid stimulus" evidence=IEA] [GO:0070208 "protein
heterotrimerization" evidence=IEA] [GO:0048407 "platelet-derived
growth factor binding" evidence=IEA] [GO:0046332 "SMAD binding"
evidence=IEA] [GO:0043589 "skin morphogenesis" evidence=IEA]
[GO:0042802 "identical protein binding" evidence=IEA] [GO:0030674
"protein binding, bridging" evidence=IEA] [GO:0030199 "collagen
fibril organization" evidence=IEA] [GO:0008217 "regulation of blood
pressure" evidence=IEA] [GO:0007266 "Rho protein signal
transduction" evidence=IEA] [GO:0007179 "transforming growth factor
beta receptor signaling pathway" evidence=IEA] [GO:0005615
"extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
[GO:0001501 "skeletal system development" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 GO:GO:0005615 GO:GO:0030199 GO:GO:0001501
GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 GO:GO:0001568 GO:GO:0071230
GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287
GO:GO:0005584 OMA:TGPIGSA EMBL:AAEX03009315
Ensembl:ENSCAFT00000031580 Uniprot:F1PHY1
Length = 1366
Score = 126 (49.4 bits), Expect = 0.00026, P = 0.00026
Identities = 83/261 (31%), Positives = 99/261 (37%)
Query: 156 GVPQGHGPPPSATTAGVVGA----G-PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASK 209
G+P G P AG GA G P + S + G P A P GP G E +
Sbjct: 322 GLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGAQGPPGPSGEEGKR 381
Query: 210 GPGYDASKA-PSYDPTKGPSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIH 264
GP +A A PS P G P ++G PG D G G +G+ A RGPN D
Sbjct: 382 GPNGEAGSAGPSGPP--GLRGSPGSRGLPGADGRAGVMGPPGPRGATGPAGVRGPNGDSG 439
Query: 265 RGPSYDPQRGLGYDMQRG-PNYDMQRGP-GYE-TQRVPGYDVQRGPVYEA--QRAPSYIP 319
R P +P G RG P GP G E +PG D + GP+ A + P I
Sbjct: 440 R-PG-EP----GLMGPRGFPGAPGNVGPAGKEGPMGLPGIDGRPGPIGPAGARGEPGNIG 493
Query: 320 QRGP-GYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPHGQVPPPLNNVPYGSATPP 377
GP G G+ D A +RG G DG P G G A PP
Sbjct: 494 FPGPKGPTGDPGKNGDKGHA-GLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPP 552
Query: 378 ARSGSGQPRG-----GNPARR 393
G P G G P R
Sbjct: 553 GFQGLPGPAGTAGEVGKPGER 573
>RGD|61817 [details] [associations]
symbol:Col1a1 "collagen, type I, alpha 1" species:10116 "Rattus
norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
[GO:0001503 "ossification" evidence=IEP] [GO:0001568 "blood vessel
development" evidence=IEA;ISO] [GO:0001649 "osteoblast
differentiation" evidence=IEA] [GO:0001957 "intramembranous
ossification" evidence=IEA;ISO] [GO:0001958 "endochondral
ossification" evidence=IEA;ISO] [GO:0003674 "molecular_function"
evidence=ND] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA;ISO] [GO:0005578 "proteinaceous
extracellular matrix" evidence=ISO] [GO:0005581 "collagen"
evidence=ISO] [GO:0005584 "collagen type I" evidence=IEA;ISO]
[GO:0005615 "extracellular space" evidence=ISO;IDA] [GO:0005737
"cytoplasm" evidence=IEA;ISO] [GO:0007584 "response to nutrient"
evidence=IEP] [GO:0007601 "visual perception" evidence=IEA;ISO]
[GO:0007605 "sensory perception of sound" evidence=IEA;ISO]
[GO:0009612 "response to mechanical stimulus" evidence=IEP]
[GO:0010035 "response to inorganic substance" evidence=IEP]
[GO:0010718 "positive regulation of epithelial to mesenchymal
transition" evidence=IEA;ISO] [GO:0010812 "negative regulation of
cell-substrate adhesion" evidence=IEA;ISO] [GO:0015031 "protein
transport" evidence=IEA;ISO] [GO:0030199 "collagen fibril
organization" evidence=IEA;ISO] [GO:0030335 "positive regulation of
cell migration" evidence=IEA;ISO] [GO:0031012 "extracellular matrix"
evidence=ISO] [GO:0031960 "response to corticosteroid stimulus"
evidence=IEP] [GO:0032964 "collagen biosynthetic process"
evidence=IEA;ISO] [GO:0034504 "protein localization to nucleus"
evidence=IEA;ISO] [GO:0034505 "tooth mineralization"
evidence=IEA;ISO] [GO:0042060 "wound healing" evidence=IMP]
[GO:0042542 "response to hydrogen peroxide" evidence=IEP]
[GO:0042802 "identical protein binding" evidence=IEA;ISO]
[GO:0043434 "response to peptide hormone stimulus" evidence=IEP]
[GO:0043588 "skin development" evidence=ISO] [GO:0043589 "skin
morphogenesis" evidence=IEA;ISO] [GO:0045893 "positive regulation of
transcription, DNA-dependent" evidence=IEA;ISO] [GO:0046872 "metal
ion binding" evidence=IEA] [GO:0048407 "platelet-derived growth
factor binding" evidence=IEA;ISO] [GO:0048705 "skeletal system
morphogenesis" evidence=ISO] [GO:0048706 "embryonic skeletal system
development" evidence=IEA;ISO] [GO:0051591 "response to cAMP"
evidence=IEP] [GO:0060325 "face morphogenesis" evidence=IEA;ISO]
[GO:0060346 "bone trabecula formation" evidence=IEA;ISO] [GO:0060351
"cartilage development involved in endochondral bone morphogenesis"
evidence=IEA;ISO] [GO:0070208 "protein heterotrimerization"
evidence=IEA;ISO] [GO:0071230 "cellular response to amino acid
stimulus" evidence=IEA;ISO] [GO:0071260 "cellular response to
mechanical stimulus" evidence=IEA] [GO:0071300 "cellular response to
retinoic acid" evidence=IEP] [GO:0071363 "cellular response to
growth factor stimulus" evidence=IEP] [GO:0071560 "cellular response
to transforming growth factor beta stimulus" evidence=IEP]
[GO:0090263 "positive regulation of canonical Wnt receptor signaling
pathway" evidence=IEA;ISO] InterPro:IPR000885 InterPro:IPR001007
Pfam:PF00093 Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208
PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038 SMART:SM00214
RGD:61817 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615 GO:GO:0009612
GO:GO:0071560 GO:GO:0046872 GO:GO:0015031 GO:GO:0007601
GO:GO:0071300 GO:GO:0043434 GO:GO:0030199 GO:GO:0007584
GO:GO:0010035 GO:GO:0007605 GO:GO:0010718 GO:GO:0030335
GO:GO:0042542 GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0042060 GO:GO:0071260 GO:GO:0001568
GO:GO:0001649 GO:GO:0051591 GO:GO:0034505 GO:GO:0090263
GO:GO:0001503 GO:GO:0010812 GO:GO:0060325 EMBL:CH473948
GO:GO:0032964 GO:GO:0071230 GO:GO:0048706 GO:GO:0001957
GO:GO:0034504 GO:GO:0001958 GO:GO:0060351 GO:GO:0005201
GO:GO:0043589 CTD:1277 GeneTree:ENSGT00660000095287
HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236 OMA:VAYMDQQ
GO:GO:0005584 GO:GO:0060346 GO:GO:0031960 EMBL:Z78279 EMBL:BC133728
EMBL:M11432 IPI:IPI00188909 PIR:A90559 RefSeq:NP_445756.1
UniGene:Rn.2953 PDB:3HQV PDB:3HR2 PDBsum:3HQV PDBsum:3HR2
ProteinModelPortal:P02454 IntAct:P02454 STRING:P02454 PRIDE:P02454
Ensembl:ENSRNOT00000005311 GeneID:29393 KEGG:rno:29393
UCSC:RGD:61817 InParanoid:A3KNA1 Reactome:REACT_150387
EvolutionaryTrace:P02454 NextBio:609017 ArrayExpress:P02454
Genevestigator:P02454 GermOnline:ENSRNOG00000003897 Uniprot:P02454
Length = 1453
Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
Identities = 88/285 (30%), Positives = 108/285 (37%)
Query: 126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
ADG G G G++ + P G A G P G+ G P + G G P +
Sbjct: 808 ADGQPGAKGEPGDTGVKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGSRGAAGP-PGATGF 865
Query: 183 AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--KG 234
AA + G P + P GP G E KGP + A P GP PA KG
Sbjct: 866 PGAAGRVGPPGPSGNAGPPGPPGPVGKEGGKGPRGETGPAGRPGEVGPPGPP-GPAGEKG 924
Query: 235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
PG D P T GP G Q+G QRG + GPS +P ++G G +RG
Sbjct: 925 SPGADGPAGSPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 984
Query: 283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
P M GP PG GP E+ R S + PG D G D
Sbjct: 985 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGAPGAKGDRGETGPAG 1032
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G GAP P G+ P G A P +G+ P G
Sbjct: 1033 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPAGARGPAG 1077
>UNIPROTKB|F1LQ00 [details] [associations]
symbol:Col5a2 "Protein Col5a2" species:10116 "Rattus
norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 RGD:70921 GO:GO:0043588 GO:GO:0030199
GO:GO:0001501 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071230
GO:GO:0005201 GO:GO:0048592 GeneTree:ENSGT00660000095287
GO:GO:0005588 IPI:IPI00366945 Ensembl:ENSRNOT00000005073
Uniprot:F1LQ00
Length = 1467
Score = 126 (49.4 bits), Expect = 0.00028, P = 0.00028
Identities = 87/290 (30%), Positives = 109/290 (37%)
Query: 123 DRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNT 179
++ A+G+ G GA G + P G E G P+G GPP S G G T
Sbjct: 752 EKGAEGTAGNDGARGLPGSLGPPGPAGPTG-EKGEPGPRGLVGPPGSRGNPGSRGENGPT 810
Query: 180 STSAYAATQS--GTP-MRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAKGP 235
+A Q G P ++ P G S GP G S P + P P +G
Sbjct: 811 GAVGFAGPQGPDGQPGVKGEPGEPGQKGDAGSPGPQGLAGSPGP-HGPHGVPGLKGGRGT 869
Query: 236 GYDP--TKGPGYDAQKGSNYDAQR-GPNYDIHRGPSYDPQRGLGYDM-QRGPNYDM-QRG 290
P T PG + G A GP I P + GL D G D G
Sbjct: 870 QGPPGATGFPGSAGRVGPPGPAGAPGPAGPIGE-PGKEGPPGLRGDPGSHGRVGDRGPAG 928
Query: 291 P-GYETQRV-PGYDVQRGPVYEAQRAPSYIP-QRG-PGYDLQRGQ-GYDMRRAPSYDPSR 345
P G + PG D Q GP + P+ QRG G QRG+ G P+ P +
Sbjct: 929 PPGSPGDKGDPGEDGQPGP--DGPPGPAGTTGQRGIVGMPGQRGERGMPGLPGPAGTPGK 986
Query: 346 GTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPARR 393
G GA P G V PP +N P G P +G+ G P R G R
Sbjct: 987 -VGPTGATGDKGPPGPVGPPGSNGPVGEPGPEGPAGNDGTPGRDGAVGER 1035
>ZFIN|ZDB-GENE-980526-192 [details] [associations]
symbol:col2a1a "collagen type II, alpha-1a"
species:7955 "Danio rerio" [GO:0005581 "collagen" evidence=IEA;ISS]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] [GO:0030903 "notochord development" evidence=IGI]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 ZFIN:ZDB-GENE-980526-192 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
GeneTree:ENSGT00660000095287 HOVERGEN:HBG004933 KO:K06236
GO:GO:0030903 EMBL:BX927144 EMBL:DQ335127 IPI:IPI00505438
RefSeq:NP_571367.1 UniGene:Dr.75057 SMR:Q2LDA1 STRING:Q2LDA1
Ensembl:ENSDART00000100234 GeneID:562496 KEGG:dre:562496 CTD:562496
InParanoid:Q2LDA1 NextBio:20884441 Uniprot:Q2LDA1
Length = 1491
Score = 126 (49.4 bits), Expect = 0.00029, P = 0.00029
Identities = 83/270 (30%), Positives = 96/270 (35%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQG-HGPPPSATTAGVVGA-GPNTSTSAYAATQ- 188
GA G N+ GQ + G PQG G P GV G G + A AT
Sbjct: 844 GADGQPGNKGEQGESGQKG-DSGAPGPQGPSGAPGPVGPTGVTGPKGARGAQGAPGATGF 902
Query: 189 SGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGP-GYDPTKGP 243
G R P G PG GP G D K D G + D +GP G KG
Sbjct: 903 PGAAGRVGPPGPNGNPGAAGPAGPSGKDGPKGVRGDAGPPGRAGDAGLRGPPGAPGEKGE 962
Query: 244 -GYDAQKGSNYDAQRGP-NYDIHRGPSYDP-QRG-LGYDMQRGPNYD--MQRGPGYETQR 297
G D G D GP RG P QRG G+ GP+ + Q PG R
Sbjct: 963 AGEDGPPGP--DGPSGPAGLAGQRGIVGLPGQRGERGFPGLPGPSGEPGKQGAPGGSGDR 1020
Query: 298 VP----GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYDPSRGTGFDGA 352
P G GP E R + PG D G +G P P G GA
Sbjct: 1021 GPPGPVGPPGLTGPAGETGREGNPGSDGPPGRDGAAGVKGERGNTGPIGAPG-APGAPGA 1079
Query: 353 PRGAAPHGQVPPPLNNVPYGSATPPARSGS 382
P P G+ N P G A PP +G+
Sbjct: 1080 PGSVGPIGKQGDRGENGPQGPAGPPGPAGA 1109
>WB|WBGene00001076 [details] [associations]
symbol:dpy-17 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0010171 "body
morphogenesis" evidence=IMP] [GO:0040002 "collagen and
cuticulin-based cuticle development" evidence=IMP] [GO:0040035
"hermaphrodite genitalia development" evidence=IMP] [GO:0040007
"growth" evidence=IMP] [GO:0002119 "nematode larval development"
evidence=IMP] InterPro:IPR002486 Pfam:PF01484 SMART:SM01088
GO:GO:0040007 GO:GO:0002119 GO:GO:0010171 GO:GO:0040035
InterPro:IPR008160 Pfam:PF01391 GO:GO:0040002 EMBL:FO080874
GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00390000012316
RefSeq:NP_498086.1 ProteinModelPortal:Q20778 SMR:Q20778
DIP:DIP-26150N MINT:MINT-1080630 STRING:Q20778 PaxDb:Q20778
EnsemblMetazoa:F54D8.1.1 EnsemblMetazoa:F54D8.1.2 GeneID:175696
KEGG:cel:CELE_F54D8.1 UCSC:F54D8.1.1 CTD:175696 WormBase:F54D8.1
eggNOG:NOG253878 InParanoid:Q20778 OMA:TEMEAWR NextBio:889252
Uniprot:Q20778
Length = 352
Score = 118 (46.6 bits), Expect = 0.00031, P = 0.00031
Identities = 74/296 (25%), Positives = 104/296 (35%)
Query: 108 EVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGY-GVPQGHGPPPS 166
E +++ ++ V R+A G YGG G SG P G + G+ G PQGH P +
Sbjct: 48 ESDQIYMDMQKFGRVRRQA-GGYGGYGGYGSGP-SG-PSGPSGPHGGFPGGPQGHFPGNT 104
Query: 167 ATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKG 226
++ G + G+P+ GPG + + P+ P
Sbjct: 105 GSSNTPTLPGVIGVPPSVTGHPGGSPINPDGSPSAGPGDKCNCNTENSCPAGPA-GPKGT 163
Query: 227 PSYDPAKG-PGYDPTKGPGYDAQKGSNYDAQRGPNYD----IHRGPSYDP-QRGL-GYDM 279
P +D G PG PG D + + AQ YD GP P +G G
Sbjct: 164 PGHDGPDGIPGV-----PGVDGEDADDAKAQT-QQYDGCFTCPAGPQGPPGSQGKPGARG 217
Query: 280 QRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYD-MRRA 338
RG PG + PG GP+ A P PG D++ G +
Sbjct: 218 MRGARGQAAM-PGRDGS--PGMPGSLGPIGPPGAAGEEGPTGEPGADVEHQIGLPGAKGT 274
Query: 339 PSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGS-GQP-RGGNPAR 392
P G + RGA G PP P G +G+ G P G P +
Sbjct: 275 PGAPGESGDQGEQGDRGAT--GIAGPPGERGPQGEKGDDGPNGAAGSPGEEGEPGQ 328
>UNIPROTKB|G4MYW7 [details] [associations]
symbol:MGG_10829 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
InterPro:IPR000571 PROSITE:PS50103 GO:GO:0008270 GO:GO:0003676
EMBL:CM001232 InterPro:IPR019496 Pfam:PF10453 RefSeq:XP_003713435.1
EnsemblFungi:MGG_10829T0 GeneID:2676344 KEGG:mgr:MGG_10829
Uniprot:G4MYW7
Length = 600
Score = 121 (47.7 bits), Expect = 0.00033, P = 0.00033
Identities = 61/238 (25%), Positives = 82/238 (34%)
Query: 160 GHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYD--IPRGPGYEASKGPGYDASK 217
G+GPPP GA P Y Q + PRG G A G G
Sbjct: 5 GYGPPPPPPA----GAPPQAYQQQYGQYQQPPATGHVHGGHAPRG-GRGAHSGRGDFHGS 59
Query: 218 APSYDPTKGPSYDPA-KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLG 276
PSY P P+ GP + P P + NY P + ++ P Y Q+
Sbjct: 60 PPSYPYNNQPQPPPSYTGPHHAPP--PPHTPLAPQNYHPNYAPQH--YQQPQYAHQQQYP 115
Query: 277 YDMQRGPNYDMQRGPGYETQRVPGY-DVQRGPVYEAQRAPSYIPQR--GPG-YDLQRGQG 332
+ + P Q+ P Y P Y P ++ P+ + GP Y RG+G
Sbjct: 116 HQQPQQPPQPPQQAP-Y-AHHYPSYPQAPNAPPHQPWGGPATAGHQPAGPAHYGSGRGRG 173
Query: 333 YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
+ P+ G G G PP L V + PP G P+GG P
Sbjct: 174 GHQGDRGGHKPAAAMG-PPLRMGFDNRGPEPPAL--VSSATVYPP--QPFGPPQGGAP 226
>ZFIN|ZDB-GENE-041221-2 [details] [associations]
symbol:prnpb "prion protein b" species:7955 "Danio
rerio" [GO:0051260 "protein homooligomerization" evidence=IEA]
[GO:0016020 "membrane" evidence=IEA] [GO:0003674
"molecular_function" evidence=ND] [GO:0016338 "calcium-independent
cell-cell adhesion" evidence=IMP] [GO:0007156 "homophilic cell
adhesion" evidence=IDA] [GO:0055113 "epiboly involved in
gastrulation with mouth forming second" evidence=IGI;IMP]
[GO:2000047 "regulation of cell-cell adhesion mediated by cadherin"
evidence=IMP] [GO:0005886 "plasma membrane" evidence=IDA]
[GO:0007417 "central nervous system development" evidence=IGI]
[GO:0009986 "cell surface" evidence=IDA] InterPro:IPR022416
ZFIN:ZDB-GENE-041221-2 GO:GO:0005886 GO:GO:0009986 GO:GO:0051260
GO:GO:0007156 GO:GO:0055113 GO:GO:0016338 Gene3D:1.10.790.10
SUPFAM:SSF54098 EMBL:AJ850286 IPI:IPI00485089 UniGene:Dr.90045
ProteinModelPortal:Q5K0E1 PRIDE:Q5K0E1 HOVERGEN:HBG056090
InParanoid:Q5K0E1 Bgee:Q5K0E1 GO:GO:2000047 Uniprot:Q5K0E1
Length = 606
Score = 121 (47.7 bits), Expect = 0.00034, P = 0.00034
Identities = 89/287 (31%), Positives = 108/287 (37%)
Query: 126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG--PNTST 181
A GSY G G+S + G +Y G P G P G G PN +
Sbjct: 94 AGGSYPYPGRGGSSPGGYPNQNPGAGSYPSGGSYPSAGGNPNQYPGRGGYNPGGYPNQNP 153
Query: 182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
A + G+ A + + PG + GY P+ +P G SY PA G Y
Sbjct: 154 GAGSYPAGGSYPSAGGNPNQYPGRGGTSPAGY-----PNQNPGAG-SY-PAGG-SYPSAG 205
Query: 242 G-PG-YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG---PGYET 295
G P Y + GSN PN + G SY P G Y G PN RG PG
Sbjct: 206 GNPNQYPGRGGSNPGGY--PNQNPGAG-SY-PAGG-SYPSAGGNPNQYPGRGGSSPGGNP 260
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR-GQ-GYDMRRAP---SYDPSRGTGFD 350
+ PG G Y P+ P G GY Q G+ GY P SY P R G
Sbjct: 261 NQNPGAGTYAGGGY-----PNQYPGGG-GYSNQNPGRSGYSPGGYPGAGSY-PVRNAGQP 313
Query: 351 GAPRGAAPH--GQVPP--PLNNV--P-YGSATPPARSGSGQPRGGNP 390
G GA P G P P N + P YG + G G GG+P
Sbjct: 314 GVYPGAHPSAGGGYPNWNPNNQILSPRYGGSF----GGGGFGTGGSP 356
>WB|WBGene00001263 [details] [associations]
symbol:emb-9 species:6239 "Caenorhabditis elegans"
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA;TAS] [GO:0005581 "collagen" evidence=IEA] [GO:0040010
"positive regulation of growth rate" evidence=IMP] [GO:0008340
"determination of adult lifespan" evidence=IMP] [GO:0000003
"reproduction" evidence=IMP] [GO:0009792 "embryo development ending
in birth or egg hatching" evidence=IMP] [GO:0040007 "growth"
evidence=IMP] [GO:0002119 "nematode larval development"
evidence=IMP] [GO:0040011 "locomotion" evidence=IMP] [GO:0006898
"receptor-mediated endocytosis" evidence=IMP] [GO:0040039
"inductive cell migration" evidence=IMP] [GO:0030198 "extracellular
matrix organization" evidence=IMP] [GO:0009790 "embryo development"
evidence=IMP] [GO:0050714 "positive regulation of protein
secretion" evidence=IMP] [GO:0007517 "muscle organ development"
evidence=IMP] [GO:0005604 "basement membrane" evidence=IDA]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
GO:GO:0008340 GO:GO:0009792 GO:GO:0006898 GO:GO:0040007
GO:GO:0040010 GO:GO:0002119 InterPro:IPR016187 SUPFAM:SSF56436
GO:GO:0030198 GO:GO:0000003 GO:GO:0050714 GO:GO:0007517
GO:GO:0040039 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0005604 GO:GO:0005201 HOGENOM:HOG000085652
Gene3D:2.170.240.10 EMBL:X56979 EMBL:Z27078 EMBL:J05067 PIR:S40991
RefSeq:NP_001022662.1 RefSeq:NP_001022663.1
ProteinModelPortal:P17139 SMR:P17139 IntAct:P17139
MINT:MINT-1091171 STRING:P17139 PaxDb:P17139 PRIDE:P17139
EnsemblMetazoa:K04H4.1a GeneID:176314 KEGG:cel:CELE_K04H4.1
UCSC:K04H4.1b CTD:176314 WormBase:K04H4.1a WormBase:K04H4.1b
GeneTree:ENSGT00690000101772 InParanoid:P17139 OMA:EEGIPGC
NextBio:892048 Uniprot:P17139
Length = 1759
Score = 126 (49.4 bits), Expect = 0.00035, P = 0.00035
Identities = 79/282 (28%), Positives = 100/282 (35%)
Query: 128 GSYGGATGNSENETSGRP----VGQNAYEDGY-GVP--QGHGPPPSATTAGVVGAGPNTS 180
G+YG E G P A E GY G P +G P G AGP+
Sbjct: 315 GNYGEKGSQGEQGLGGTPGYPGTKGGAGEPGYPGRPGFEGDCGPEGPLGEGTGEAGPH-G 373
Query: 181 TSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGP-GY 237
+ Q G + +P GP G G PG A P D G + +G GY
Sbjct: 374 AQGFDGVQGGKGLPGHDGLP-GPVGPRGPVGAPG--APGQPGIDGMPGYTEKGDRGEDGY 430
Query: 238 DPTKG-PGYDAQKGS-NYDAQRG-PNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGY 293
G PG + G Y + G P YDI P D Q G G+ G D PGY
Sbjct: 431 PGFAGEPGLPGEPGDCGYPGEDGLPGYDIQGPPGLDGQSGRDGFPGIPGDIGD----PGY 486
Query: 294 ETQR-VPGYDVQR-GP--VYEAQRAPSYIPQR-G-PGYDLQRGQGYDMRRAPSYDPSRGT 347
++ PG V + GP + P +P R G GY G + Y P
Sbjct: 487 SGEKGFPGTGVNKVGPPGMTGLPGEPG-MPGRIGVDGYPGPPGNNGERGEDCGYCPDGVP 545
Query: 348 GFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN 389
G G P +G PP N +G P G + G +
Sbjct: 546 GNAGDPGFPGMNGYPGPPGPNGDHGDCGMPGAPGKPRSAGSD 587
>UNIPROTKB|F1LRM7 [details] [associations]
symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
"Rattus norvegicus" [GO:0001502 "cartilage condensation"
evidence=IEA] [GO:0001894 "tissue homeostasis" evidence=IEA]
[GO:0001958 "endochondral ossification" evidence=IEA] [GO:0002062
"chondrocyte differentiation" evidence=IEA] [GO:0003007 "heart
morphogenesis" evidence=IEA] [GO:0005201 "extracellular matrix
structural constituent" evidence=IEA] [GO:0005585 "collagen type
II" evidence=IEA] [GO:0005604 "basement membrane" evidence=IEA]
[GO:0005615 "extracellular space" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IEA] [GO:0006029 "proteoglycan metabolic
process" evidence=IEA] [GO:0007417 "central nervous system
development" evidence=IEA] [GO:0007601 "visual perception"
evidence=IEA] [GO:0007605 "sensory perception of sound"
evidence=IEA] [GO:0010468 "regulation of gene expression"
evidence=IEA] [GO:0030199 "collagen fibril organization"
evidence=IEA] [GO:0030903 "notochord development" evidence=IEA]
[GO:0042472 "inner ear morphogenesis" evidence=IEA] [GO:0042802
"identical protein binding" evidence=IEA] [GO:0043066 "negative
regulation of apoptotic process" evidence=IEA] [GO:0048407
"platelet-derived growth factor binding" evidence=IEA] [GO:0060021
"palate development" evidence=IEA] [GO:0060174 "limb bud formation"
evidence=IEA] [GO:0060272 "embryonic skeletal joint morphogenesis"
evidence=IEA] [GO:0060351 "cartilage development involved in
endochondral bone morphogenesis" evidence=IEA] [GO:0071599 "otic
vesicle development" evidence=IEA] [GO:0071773 "cellular response
to BMP stimulus" evidence=IEA] InterPro:IPR000885 Pfam:PF01410
ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 RGD:2375
GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
GeneTree:ENSGT00660000095287 IPI:IPI00394380
Ensembl:ENSRNOT00000016044 ArrayExpress:F1LRM7 Uniprot:F1LRM7
Length = 1419
Score = 125 (49.1 bits), Expect = 0.00035, P = 0.00035
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035
Score = 125 (49.1 bits), Expect = 0.00035, P = 0.00035
Identities = 89/296 (30%), Positives = 110/296 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
P DR D GA G + G P G G P GPP A + G
Sbjct: 64 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 119
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
+ A G PM PRGP G + GP G+ + +P GP P
Sbjct: 120 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 175
Query: 234 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
PG P PG D + G A +RG P RG P GL G RG P D +
Sbjct: 176 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 231
Query: 290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
G PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 232 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 289
Query: 341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
P+ G GF GAP +G A P G P GS P GS P G GNP
Sbjct: 290 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 342
>RGD|2375 [details] [associations]
symbol:Col2a1 "collagen, type II, alpha 1" species:10116 "Rattus
norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
[GO:0001502 "cartilage condensation" evidence=ISO] [GO:0001894
"tissue homeostasis" evidence=ISO] [GO:0001958 "endochondral
ossification" evidence=ISO] [GO:0002062 "chondrocyte differentiation"
evidence=ISO] [GO:0003007 "heart morphogenesis" evidence=ISO]
[GO:0005201 "extracellular matrix structural constituent"
evidence=TAS] [GO:0005581 "collagen" evidence=ISO] [GO:0005585
"collagen type II" evidence=ISO;TAS] [GO:0005604 "basement membrane"
evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
[GO:0005737 "cytoplasm" evidence=ISO] [GO:0006029 "proteoglycan
metabolic process" evidence=ISO] [GO:0007601 "visual perception"
evidence=ISO] [GO:0007605 "sensory perception of sound" evidence=ISO]
[GO:0010468 "regulation of gene expression" evidence=ISO] [GO:0030199
"collagen fibril organization" evidence=ISO] [GO:0031012
"extracellular matrix" evidence=ISO] [GO:0035108 "limb morphogenesis"
evidence=ISO] [GO:0042472 "inner ear morphogenesis" evidence=ISO]
[GO:0042802 "identical protein binding" evidence=ISO] [GO:0043066
"negative regulation of apoptotic process" evidence=ISO] [GO:0046872
"metal ion binding" evidence=IEA] [GO:0048407 "platelet-derived
growth factor binding" evidence=ISO] [GO:0048705 "skeletal system
morphogenesis" evidence=ISO] [GO:0048839 "inner ear development"
evidence=ISO] [GO:0051216 "cartilage development" evidence=IEP;ISO]
[GO:0060021 "palate development" evidence=ISO] [GO:0060272 "embryonic
skeletal joint morphogenesis" evidence=ISO] [GO:0060348 "bone
development" evidence=ISO] [GO:0060351 "cartilage development
involved in endochondral bone morphogenesis" evidence=ISO]
[GO:0071773 "cellular response to BMP stimulus" evidence=ISO]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 RGD:2375 GO:GO:0046872 GO:GO:0051216 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201 HOGENOM:HOG000085654
HOVERGEN:HBG004933 KO:K06236 CTD:1280 Reactome:REACT_133391
GO:GO:0005585 EMBL:L48440 EMBL:K02804 EMBL:M10613 EMBL:X79816
IPI:IPI00394380 PIR:A05152 PIR:I60384 RefSeq:NP_037061.1
UniGene:Rn.10124 IntAct:P05539 STRING:P05539 PRIDE:P05539
GeneID:25412 KEGG:rno:25412 UCSC:RGD:2375 NextBio:606543
ArrayExpress:P05539 Genevestigator:P05539
GermOnline:ENSRNOG00000022282 Uniprot:P05539
Length = 1419
Score = 125 (49.1 bits), Expect = 0.00035, P = 0.00035
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 773 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 831
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 832 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 886
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 887 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 937
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 938 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 997
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 998 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1035
Score = 122 (48.0 bits), Expect = 0.00074, P = 0.00074
Identities = 89/296 (30%), Positives = 110/296 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
P DR D GA G + G P G G P GPP A + G
Sbjct: 64 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 119
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
+ A G PM PRGP G + GP G+ + +P GP P
Sbjct: 120 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGP-IGPRG 175
Query: 234 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
PG P PG D + G A +RG P RG P GL G RG P D +
Sbjct: 176 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 231
Query: 290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
G PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 232 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 289
Query: 341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
P+ G GF GAP +G A P G P GS P GS P G GNP
Sbjct: 290 PVGPAGGPGFLGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 342
>UNIPROTKB|E1BT66 [details] [associations]
symbol:TAF15 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
GO:GO:0005634 GO:GO:0005737 GO:GO:0000166 GO:GO:0008270
Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00530000063105
OMA:YGNQGSQ EMBL:AADN02025953 EMBL:AADN02025954 IPI:IPI00575015
ProteinModelPortal:E1BT66 Ensembl:ENSGALT00000003204 Uniprot:E1BT66
Length = 443
Score = 119 (46.9 bits), Expect = 0.00035, P = 0.00035
Identities = 70/232 (30%), Positives = 89/232 (38%)
Query: 137 SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAA 196
S++ + G+ GQ +Y YG G T G G G + S+Y QS
Sbjct: 3 SDSGSYGQSGGQQSYSS-YG---NQGNQSYGQTQGYSGYGQSGDNSSYG--QSYGNYHGN 56
Query: 197 YDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP--GYDAQKGSNYD 254
Y GY GYD SYD SY+ KG G S+YD
Sbjct: 57 YG-QNQTGY-GQDSHGYDDES--SYDNQNQSSYNQQSYSNQGQQKGSSRGGRGSYSSSYD 112
Query: 255 AQRGPNYDIHRGPSYDPQRGLG----YDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV-Y 309
Q G Y H+G SYD Q G G YD + G N Q G+ Q Y Q+G +
Sbjct: 113 QQSG--YG-HQG-SYDQQSGYGHQSSYDQKSGYNQH-QSSYGHSQQ---SYQSQKGSYSH 164
Query: 310 EAQ---RAPSYIPQRGPGYDLQRGQGYDMRRAPSYD-PSRG--TGFDGAPRG 355
+Q R S + GY +G G R YD RG +G+ G RG
Sbjct: 165 NSQDDRREKSRYGEDNRGYGGSQGGG----RG-GYDMDGRGHMSGYSGGDRG 211
>UNIPROTKB|E7ENY8 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
"Homo sapiens" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 EMBL:AC066694 HGNC:HGNC:2201
ChiTaRS:COL3A1 IPI:IPI00981037 PDB:4GYX PDBsum:4GYX
ProteinModelPortal:E7ENY8 SMR:E7ENY8 PRIDE:E7ENY8
Ensembl:ENST00000317840 ArrayExpress:E7ENY8 Bgee:E7ENY8
Uniprot:E7ENY8
Length = 1163
Score = 124 (48.7 bits), Expect = 0.00036, P = 0.00036
Identities = 81/280 (28%), Positives = 101/280 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
A G G G + P G + + G P GPP AG G GP S
Sbjct: 165 AVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPS 224
Query: 181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
A +SG P R +P PG + G PG+ K +D G + PG
Sbjct: 225 GPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGL 283
Query: 238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
G PG + G RG + R P G G D RG D Q GP G
Sbjct: 284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338
Query: 295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
T PG +G V A S PG QRG+ A + P G +G+P
Sbjct: 339 GTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPPGINGSP 392
Query: 354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
G G P + P G+ PP +G+ G P RGG
Sbjct: 393 GGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430
Score = 123 (48.4 bits), Expect = 0.00046, P = 0.00046
Identities = 85/284 (29%), Positives = 101/284 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSA 183
A G GGA +N G P G G+P G P + G G+ G P +
Sbjct: 424 APGLRGGAGEPGKNGAKGEP-GPRGERGEAGIP---GVPGAKGEDGKDGSPGEPGANGLP 479
Query: 184 YAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
AA + G P + P GP G KGP + AP P GP A PG D G
Sbjct: 480 GAAGERGAP---GFRGPAGPNGIPGEKGPAGERG-APG--PA-GPR-GAAGEPGRDGVPG 531
Query: 243 -PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRG----PGYE-T 295
PG GS GP D GP Q G GP+ Q G PG +
Sbjct: 532 GPGMRGMPGS----PGGPGSDGKPGPP-GSQGESGRPGPPGPSGPRGQPGVMGFPGPKGN 586
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD-----PSRGTGF 349
PG + +RG P PQ PG + + G QG P D P G
Sbjct: 587 DGAPGKNGERG----GPGGPG--PQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGL 640
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
G P P G+ P P G A P G G+ G P R
Sbjct: 641 QGLPGTGGPPGENGKPGEPGPKGDAGAPGAPG-GKGDAGAPGER 683
>UNIPROTKB|F1LP41 [details] [associations]
symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201 IPI:IPI00205809
Ensembl:ENSRNOT00000012441 ArrayExpress:F1LP41 Uniprot:F1LP41
Length = 1458
Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 812 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 870
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 871 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 925
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 926 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 976
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 977 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1036
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 1037 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1074
Score = 122 (48.0 bits), Expect = 0.00077, P = 0.00077
Identities = 91/308 (29%), Positives = 117/308 (37%)
Query: 105 MATEVEKLRAELMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPP 164
+AT KL +++ P G+ G + E T G P G G P G G
Sbjct: 92 LATASGKLGPKIIG-PKGPPGPQGAPGPRGRDGEPGTPGNP-GPPGPPGPPG-PPGLGGG 148
Query: 165 PSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYD 222
A A + G + A G PM PRGP G + GP G+ + +
Sbjct: 149 NFA--AQMAGGFDEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGE 203
Query: 223 P-TKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYD 278
P GP P PG P PG D + G A +RG P RG P GL G
Sbjct: 204 PGVSGPM-GPRGPPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVK 258
Query: 279 MQRG-PNYDMQRG----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 332
RG P D +G PG + + PG + GP+ + P + GP +G
Sbjct: 259 GHRGYPGLDGAKGEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARG 316
Query: 333 YDMRRAPS-----YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQ 384
D + P+ P+ G GF GAP +G A P G P GS P GS
Sbjct: 317 NDGQPGPAGPPGPVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPG 373
Query: 385 PRG--GNP 390
P G GNP
Sbjct: 374 PAGASGNP 381
>UNIPROTKB|P02453 [details] [associations]
symbol:COL1A1 "Collagen alpha-1(I) chain" species:9913 "Bos
taurus" [GO:0090263 "positive regulation of canonical Wnt receptor
signaling pathway" evidence=IEA] [GO:0071260 "cellular response to
mechanical stimulus" evidence=IEA] [GO:0071230 "cellular response
to amino acid stimulus" evidence=IEA] [GO:0070208 "protein
heterotrimerization" evidence=IEA] [GO:0060351 "cartilage
development involved in endochondral bone morphogenesis"
evidence=IEA] [GO:0060346 "bone trabecula formation" evidence=IEA]
[GO:0060325 "face morphogenesis" evidence=IEA] [GO:0048706
"embryonic skeletal system development" evidence=IEA] [GO:0048407
"platelet-derived growth factor binding" evidence=IEA] [GO:0045893
"positive regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0043589 "skin morphogenesis" evidence=IEA] [GO:0042802
"identical protein binding" evidence=IEA] [GO:0034505 "tooth
mineralization" evidence=IEA] [GO:0034504 "protein localization to
nucleus" evidence=IEA] [GO:0032964 "collagen biosynthetic process"
evidence=IEA] [GO:0030335 "positive regulation of cell migration"
evidence=IEA] [GO:0030199 "collagen fibril organization"
evidence=IEA] [GO:0015031 "protein transport" evidence=IEA]
[GO:0010812 "negative regulation of cell-substrate adhesion"
evidence=IEA] [GO:0010718 "positive regulation of epithelial to
mesenchymal transition" evidence=IEA] [GO:0007605 "sensory
perception of sound" evidence=IEA] [GO:0007601 "visual perception"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005615
"extracellular space" evidence=IEA] [GO:0005584 "collagen type I"
evidence=IEA] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0001958 "endochondral ossification"
evidence=IEA] [GO:0001957 "intramembranous ossification"
evidence=IEA] [GO:0001649 "osteoblast differentiation"
evidence=IEA] [GO:0001568 "blood vessel development" evidence=IEA]
[GO:0046872 "metal ion binding" evidence=IEA] InterPro:IPR000885
InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410 ProDom:PD002078
PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461 SMART:SM00038
SMART:SM00214 GO:GO:0005737 GO:GO:0045893 GO:GO:0005615
GO:GO:0046872 GO:GO:0015031 GO:GO:0007601 GO:GO:0030199
GO:GO:0007605 GO:GO:0010718 GO:GO:0030335 GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0071260
GO:GO:0001568 GO:GO:0001649 GO:GO:0034505 GO:GO:0090263
GO:GO:0010812 GO:GO:0060325 GO:GO:0032964 GO:GO:0071230
GO:GO:0048706 GO:GO:0001957 GO:GO:0034504 GO:GO:0001958
GO:GO:0060351 GO:GO:0005201 GO:GO:0043589 EMBL:BC105184
IPI:IPI00707857 PIR:A91193 RefSeq:NP_001029211.1 UniGene:Bt.23316
IntAct:P02453 STRING:P02453 PRIDE:P02453 Ensembl:ENSBTAT00000017420
GeneID:282187 KEGG:bta:282187 CTD:1277 GeneTree:ENSGT00660000095287
HOGENOM:HOG000085654 HOVERGEN:HBG004933 InParanoid:P02453 KO:K06236
OMA:VAYMDQQ OrthoDB:EOG4S4PHP NextBio:20806015 PMAP-CutDB:P02453
ArrayExpress:P02453 GO:GO:0005584 GO:GO:0060346 Uniprot:P02453
Length = 1463
Score = 125 (49.1 bits), Expect = 0.00036, P = 0.00036
Identities = 90/286 (31%), Positives = 109/286 (38%)
Query: 126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
ADG G G G++ + P G A G P G+ G P G AGP +T
Sbjct: 818 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPAGPPGPIGNVGAPGPKGARG--SAGPPGATG 874
Query: 183 -AYAATQSGTPMRAAYDIPRGP----GYEASKGPGYDASKA--PSYDPTKGPSYDPA--K 233
AA + G P + P GP G E SKGP + A P GP PA K
Sbjct: 875 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGSKGPRGETGPAGRPGEVGPPGPP-GPAGEK 933
Query: 234 G-PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQR 281
G PG D P T GP G Q+G QRG + GPS +P ++G G +R
Sbjct: 934 GAPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGER 993
Query: 282 GPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 341
GP M GP PG GP E+ R + + PG D G D
Sbjct: 994 GPPGPM--GP-------PGL---AGPPGESGREGAPGAEGSPGRDGSPGAKGDRGETGPA 1041
Query: 342 DPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G GAP P G+ P G A P G+ P G
Sbjct: 1042 GPPGAPGAPGAPGPVGPAGKSGDRGETGPAGPAGPIGPVGARGPAG 1087
Score = 124 (48.7 bits), Expect = 0.00046, P = 0.00046
Identities = 82/275 (29%), Positives = 108/275 (39%)
Query: 130 YGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS 189
+ GA G ++ E G P G E GV GPP A AG G P A +
Sbjct: 344 FPGAVG-AKGE--GGPQGPRGSEGPQGVRGEPGPPGPAGAAGPAG-NPGADGQPGAKGAN 399
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKGP-GYDA 247
G P I PG+ ++GP + PS P KG S +P PG +KG G
Sbjct: 400 GAP-----GIAGAPGFPGARGPS--GPQGPSGPPGPKGNSGEPG-APG---SKGDTGAKG 448
Query: 248 QKG-SNYDAQRGP-NYDIHRGPSYDP-QRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDV 303
+ G + GP + RG +P GL G +RG GPG ++ PG D
Sbjct: 449 EPGPTGIQGPPGPAGEEGKRGARGEPGPAGLPGPPGERG-------GPG--SRGFPGADG 499
Query: 304 QRGPVYEA-QR-APSYI-PQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAPRGAAPH 359
GP A +R AP P+ PG + G+ + A S G+ G DG P
Sbjct: 500 VAGPKGPAGERGAPGPAGPKGSPGEAGRPGEA-GLPGAKGLTGSPGSPGPDGKTGPPGPA 558
Query: 360 GQVPPPLNNVPYGSATPPARSGSGQPRG--GNPAR 392
GQ P P G+ G P+G G P +
Sbjct: 559 GQDGRPGPPGPPGARGQAGVMGFPGPKGAAGEPGK 593
Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
Identities = 80/272 (29%), Positives = 99/272 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTS--A 183
A G G A E G P G ++ G+P GPP A G G + +
Sbjct: 617 AQGPPGPAGPAGERGEQG-PAGSPGFQ---GLPGPAGPPGEAGKPGEQGVPGDLGAPGPS 672
Query: 184 YAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG-PGYDPT 240
A + G P P GP G + G PG D +K + P P A G G
Sbjct: 673 GARGERGFPGERGVQGPPGPAGPRGANGAPGNDGAKGDAGAPG-APGSQGAPGLQGMPGE 731
Query: 241 KGP-GYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVP 299
+G G KG DA GP P D RGL + GP PG + + P
Sbjct: 732 RGAAGLPGPKGDRGDA--GPK-GADGAPGKDGVRGLTGPI--GPP-GPAGAPGDKGEAGP 785
Query: 300 GYDVQRGPVYEAQRAPSYIPQRGP-GYDLQRGQ-GYDMRRAPSYDPS-RGTGFDGAPRG- 355
GP A+ AP + GP G G G D + +P G D P G
Sbjct: 786 SGPA--GPT-GARGAPGDRGEPGPPGPAGFAGPPGADGQPGAKGEPGDAGAKGDAGPPGP 842
Query: 356 AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
A P G P P+ NV G+ P GS P G
Sbjct: 843 AGPAGP-PGPIGNV--GAPGPKGARGSAGPPG 871
>UNIPROTKB|F1LN37 [details] [associations]
symbol:Col2a1 "Collagen alpha-1(II) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 RGD:2375 GO:GO:0005737 GO:GO:0043066
GO:GO:0005615 GO:GO:0003007 GO:GO:0007601 GO:GO:0030199
GO:GO:0007417 GO:GO:0042472 GO:GO:0001894 GO:GO:0007605
GO:GO:0071773 InterPro:IPR008160 Pfam:PF01391 GO:GO:0071599
GO:GO:0005604 GO:GO:0001502 GO:GO:0060021 GO:GO:0002062
GO:GO:0010468 GO:GO:0060272 GO:GO:0006029 GO:GO:0001958
GO:GO:0060351 GO:GO:0005201 GO:GO:0005585 GO:GO:0060174
GO:GO:0030903 IPI:IPI00388575 Ensembl:ENSRNOT00000037840
ArrayExpress:F1LN37 Uniprot:F1LN37
Length = 1487
Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
Identities = 87/281 (30%), Positives = 99/281 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTS-A 183
ADG G E G G + G P GP G GA GP +T
Sbjct: 841 ADGQPGAKGDQGEAGQKG-DAGAPGPQGPSGAPGPQGPTGVTGPKGARGAQGPPGATGFP 899
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDP-TKGPSYDPA-KGPGYDP- 239
AA + G P P GP GP G D K D G + DP +GP P
Sbjct: 900 GAAGRVGPPGSNGNPGPAGP-----PGPAGKDGPKGARGDTGAPGRAGDPGLQGPAGAPG 954
Query: 240 TKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRGPGYET- 295
KG PG D GS D GP +G + QRG+ G QRG + GP E
Sbjct: 955 EKGEPGDDGPSGS--DGPPGP-----QGLA--GQRGIVGLPGQRGERGFPGLPGPSGEPG 1005
Query: 296 -QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRR--APSYDPSRG-TGFDG 351
Q PG RGP P PG + G R A RG TG G
Sbjct: 1006 KQGAPGASGDRGPPGPVGPPGLTGPAGEPGREGSPGADGPPGRDGAAGVKGDRGETGALG 1065
Query: 352 APRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG-GNPA 391
AP P G P P P G +G+ P G PA
Sbjct: 1066 APGAPGPPGS-PGPAG--PTGKQGDRGEAGAQGPMGPSGPA 1103
Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
Identities = 89/296 (30%), Positives = 110/296 (37%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSAT---TAGVVGAG 176
P DR D GA G + G P G G P GPP A + G
Sbjct: 132 PRGDR-GDKGERGAPGPRGRD--GEP-GTPGNPGPPGPPGPPGPPGLGGGNFAAQMAGGF 187
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDP-TKGPSYDPAK 233
+ A G PM PRGP G + GP G+ + +P GP P
Sbjct: 188 DEKAGGAQMGVMQG-PMGPMG--PRGPPGPAGAPGPQGFQGNPGEPGEPGVSGPM-GPRG 243
Query: 234 GPGYDPTKGPGYDAQKGSNYDA-QRG-PNYDIHRGPSYDPQRGL-GYDMQRG-PNYDMQR 289
PG P PG D + G A +RG P RG P GL G RG P D +
Sbjct: 244 PPG--PAGKPGDDGEAGKPGKAGERGLPGPQGARGFPGTP--GLPGVKGHRGYPGLDGAK 299
Query: 290 G----PGYETQR-VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS---- 340
G PG + + PG + GP+ + P + GP +G D + P+
Sbjct: 300 GEAGAPGVKGESGSPGENGSPGPM-GPRGLPGERGRTGPA-GAAGARGNDGQPGPAGPPG 357
Query: 341 -YDPSRGTGFDGAP--RG-AAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
P+ G GF GAP +G A P G P GS P GS P G GNP
Sbjct: 358 PVGPAGGPGFPGAPGAKGEAGPTGARGP---EGAQGSRGEPGNPGSPGPAGASGNP 410
>UNIPROTKB|E1BLD0 [details] [associations]
symbol:LOC100847165 "Uncharacterized protein" species:9913
"Bos taurus" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] Pfam:PF00096 InterPro:IPR007087
InterPro:IPR013087 InterPro:IPR015880 PROSITE:PS00028
PROSITE:PS50157 SMART:SM00355 GO:GO:0008270 GO:GO:0003676
GO:GO:0005622 Gene3D:3.30.160.60 GeneTree:ENSGT00700000104520
OMA:SRYESQN EMBL:DAAA02057905 IPI:IPI00717370
Ensembl:ENSBTAT00000061583 Uniprot:E1BLD0
Length = 540
Score = 120 (47.3 bits), Expect = 0.00037, P = 0.00037
Identities = 40/160 (25%), Positives = 70/160 (43%)
Query: 117 MNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYED---GYGV-PQGHGPPPSATTAGV 172
M +P+ +GS G +E E + G YE +G+ PQ G P +
Sbjct: 15 MQSPDEMGSPEGSLKGNMSENEEEEISQQEGTGDYEVEEIAFGLEPQSPGFGPQSPEFEP 74
Query: 173 VGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA 232
+ + + G + PR P + S+ P ++ ++P Y+P + P Y+P
Sbjct: 75 QSPRFEPESPGFESRSPGFVPPSPEFAPRSPESD-SQSPDFEP-QSPRYEP-QSPGYEP- 130
Query: 233 KGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQ 272
K PGY+P + PGY+ K Y+ Q P + + P ++ +
Sbjct: 131 KSPGYEP-RSPGYEP-KSPGYEPQN-PEFKT-QSPEFEAE 166
>UNIPROTKB|F1NI72 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9031
"Gallus gallus" [GO:0001568 "blood vessel development"
evidence=IEA] [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005586 "collagen type III"
evidence=IEA] [GO:0005615 "extracellular space" evidence=IEA]
[GO:0007160 "cell-matrix adhesion" evidence=IEA] [GO:0007179
"transforming growth factor beta receptor signaling pathway"
evidence=IEA] [GO:0007229 "integrin-mediated signaling pathway"
evidence=IEA] [GO:0007507 "heart development" evidence=IEA]
[GO:0009314 "response to radiation" evidence=IEA] [GO:0018149
"peptide cross-linking" evidence=IEA] [GO:0030199 "collagen fibril
organization" evidence=IEA] [GO:0032964 "collagen biosynthetic
process" evidence=IEA] [GO:0034097 "response to cytokine stimulus"
evidence=IEA] [GO:0042060 "wound healing" evidence=IEA] [GO:0043206
"extracellular fibril organization" evidence=IEA] [GO:0043588 "skin
development" evidence=IEA] [GO:0046332 "SMAD binding" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=IEA]
[GO:0048565 "digestive tract development" evidence=IEA] [GO:0050777
"negative regulation of immune response" evidence=IEA] [GO:0071230
"cellular response to amino acid stimulus" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0005615 GO:GO:0034097
GO:GO:0030199 GO:GO:0007179 GO:GO:0007229 GO:GO:0007160
InterPro:IPR008160 Pfam:PF01391 GO:GO:0042060 GO:GO:0050777
GO:GO:0009314 GO:GO:0018149 GO:GO:0071230 GO:GO:0043206
GO:GO:0005201 GeneTree:ENSGT00660000095287 GO:GO:0005586
EMBL:AADN02034558 EMBL:AADN02034559 IPI:IPI00589264
Ensembl:ENSGALT00000004033 OMA:ETCLSAN ArrayExpress:F1NI72
Uniprot:F1NI72
Length = 1498
Score = 125 (49.1 bits), Expect = 0.00037, P = 0.00037
Identities = 78/276 (28%), Positives = 97/276 (35%)
Query: 132 GATGN-SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSG 190
G G +N G P G G P GPP G G P + + G
Sbjct: 464 GTPGEPGKNGAKGDP-GPKGERGENGTPGAPGPPGEEGKRGANGE-PGQNGVPGTPGERG 521
Query: 191 TPMRAAYDIPRGPGYEASKGP-GYDASKAPSYDPTKGPSYDPAK--GPGYDPTKG-PGYD 246
+P +P G KGP G S P P+ GP+ D + GPG +G PG
Sbjct: 522 SP--GFRGLPGSNGLPGEKGPAGERGSPGPP-GPS-GPAGDRGQDGGPGLPGMRGLPGIP 577
Query: 247 AQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD--MQRGPGYE-TQRVPGYDV 303
GS D + GP G +P R G GP + PG + + PG +
Sbjct: 578 GSPGS--DGKPGPP-----GNQGEPGRS-GPPGPAGPRGQPGVMGFPGPKGNEGAPGKNG 629
Query: 304 QRGPVYEAQRAPSYIPQRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPH 359
+RGP P + G PG G D R P PS G G P G P
Sbjct: 630 ERGPG-GPPGTPGPAGKNGDVGLPGPPGPAGPAGD-RGEPG--PSGSPGLQGLPGGPGPA 685
Query: 360 GQVPPPLNNVPYGSATPPARSGSGQPRGGN--PARR 393
G+ P P G P G P+G N P R
Sbjct: 686 GENGKPGEPGPKGDIGGPGFPG---PKGENGIPGER 718
Score = 123 (48.4 bits), Expect = 0.00062, P = 0.00062
Identities = 84/275 (30%), Positives = 104/275 (37%)
Query: 142 SGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPR 201
+G P G G+P G P G+ G P TS + A G P +
Sbjct: 424 AGSP-GNKGEMGPSGIPGAPGLPGGR---GLPGP-PGTSGNPGAKGTPGEPGKNGAKGDP 478
Query: 202 GP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKG--PGYDPTKG-PGYDAQKGSN-YDA 255
GP G G PG A P + +G + +P + PG +G PG+ GSN
Sbjct: 479 GPKGERGENGTPG--APGPPGEEGKRGANGEPGQNGVPGTPGERGSPGFRGLPGSNGLPG 536
Query: 256 QRGPNYDIHR----GPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVPGYDVQRGPVYE 310
++GP + GPS P G D GP RG PG PG D + GP
Sbjct: 537 EKGPAGERGSPGPPGPS-GPAGDRGQD--GGPGLPGMRGLPGIPGS--PGSDGKPGPPGN 591
Query: 311 AQRAPSYIPQRGP-GYDLQRG-QGYDMRR----APSYDPSRGTGFD-GAPRGAAPHGQV- 362
Q P GP G Q G G+ + AP + RG G G P A +G V
Sbjct: 592 -QGEPGRSGPPGPAGPRGQPGVMGFPGPKGNEGAPGKNGERGPGGPPGTPGPAGKNGDVG 650
Query: 363 -P-PPLNNVPYGSATPPARSGS----GQPRGGNPA 391
P PP P G P SGS G P G PA
Sbjct: 651 LPGPPGPAGPAGDRGEPGPSGSPGLQGLPGGPGPA 685
>UNIPROTKB|O43186 [details] [associations]
symbol:CRX "Cone-rod homeobox protein" species:9606 "Homo
sapiens" [GO:0043565 "sequence-specific DNA binding" evidence=IEA]
[GO:0006351 "transcription, DNA-dependent" evidence=IEA]
[GO:0007601 "visual perception" evidence=IEA] [GO:0050896 "response
to stimulus" evidence=IEA] [GO:0003682 "chromatin binding"
evidence=IEA] [GO:0003700 "sequence-specific DNA binding
transcription factor activity" evidence=IEA] [GO:0005667
"transcription factor complex" evidence=IEA] [GO:0045944 "positive
regulation of transcription from RNA polymerase II promoter"
evidence=IEA] [GO:0060041 "retina development in camera-type eye"
evidence=IEA] [GO:0043522 "leucine zipper domain binding"
evidence=IPI] [GO:0009887 "organ morphogenesis" evidence=TAS]
InterPro:IPR001356 InterPro:IPR009057 InterPro:IPR013851
InterPro:IPR017970 Pfam:PF00046 Pfam:PF03529 PROSITE:PS00027
PROSITE:PS50071 SMART:SM00389 GO:GO:0007601 GO:GO:0043565
GO:GO:0045944 GO:GO:0003700 GO:GO:0006351 GO:GO:0003682
Orphanet:1872 Orphanet:791 GO:GO:0050896 Gene3D:1.10.10.60
SUPFAM:SSF46689 GO:GO:0005667 GO:GO:0009887 GO:GO:0060041
Orphanet:65 MIM:268000 CTD:1406 eggNOG:NOG324074
HOGENOM:HOG000082677 HOVERGEN:HBG004028 KO:K09337 OMA:QTKARPA
OrthoDB:EOG4NKBWG EMBL:AF024711 EMBL:BT007364 EMBL:AC008745
EMBL:BC016664 EMBL:BC053672 IPI:IPI00011226 RefSeq:NP_000545.1
UniGene:Hs.617342 UniGene:Hs.633434 UniGene:Hs.639114
ProteinModelPortal:O43186 SMR:O43186 IntAct:O43186
MINT:MINT-1442706 STRING:O43186 PhosphoSite:O43186 PRIDE:O43186
DNASU:1406 Ensembl:ENST00000221996 Ensembl:ENST00000539067
Ensembl:ENST00000556900 Ensembl:ENST00000557738 GeneID:1406
KEGG:hsa:1406 UCSC:uc002phq.4 GeneCards:GC19P048327 HGNC:HGNC:2383
HPA:HPA036762 HPA:HPA036763 MIM:120970 MIM:602225 MIM:613829
neXtProt:NX_O43186 PharmGKB:PA26903 InParanoid:O43186
PhylomeDB:O43186 ChiTaRS:CRX GenomeRNAi:1406 NextBio:5749
ArrayExpress:O43186 Bgee:O43186 CleanEx:HS_CRX
Genevestigator:O43186 GermOnline:ENSG00000105392 Uniprot:O43186
Length = 299
Score = 116 (45.9 bits), Expect = 0.00037, P = 0.00037
Identities = 29/98 (29%), Positives = 42/98 (42%)
Query: 158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
P P P A AG+V +GP+ +++ YA T + P A P G +S G D
Sbjct: 165 PASESPLPEAQRAGLVASGPSLTSAPYAMTYA--PASAFCSSPSAYGSPSSYFSGLDPYL 222
Query: 218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDA 255
+P GP+ P GP P+ + G +Y A
Sbjct: 223 SPMVPQLGGPALSPLSGPSVGPSLAQSPTSLSGQSYGA 260
>ZFIN|ZDB-GENE-030131-4487 [details] [associations]
symbol:sec24c "SEC24 family, member C (S.
cerevisiae)" species:7955 "Danio rerio" [GO:0030127 "COPII vesicle
coat" evidence=IEA] [GO:0006886 "intracellular protein transport"
evidence=IEA] [GO:0006888 "ER to Golgi vesicle-mediated transport"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0006810 "transport" evidence=IEA] [GO:0015031 "protein
transport" evidence=IEA] InterPro:IPR006895 InterPro:IPR006896
InterPro:IPR006900 Pfam:PF04810 Pfam:PF04811 Pfam:PF04815
ZFIN:ZDB-GENE-030131-4487 GO:GO:0006886 GO:GO:0008270
InterPro:IPR007123 Pfam:PF00626 GO:GO:0006888 GO:GO:0030127
SUPFAM:SSF82919 InterPro:IPR012990 Pfam:PF08033 SUPFAM:SSF81811
GeneTree:ENSGT00590000082962 EMBL:CU469520 EMBL:CU694198
IPI:IPI00972073 Ensembl:ENSDART00000085476 ArrayExpress:F1R9P2
Bgee:F1R9P2 Uniprot:F1R9P2
Length = 1241
Score = 124 (48.7 bits), Expect = 0.00038, P = 0.00038
Identities = 82/291 (28%), Positives = 110/291 (37%)
Query: 131 GGATGNSENETSGRPV--GQNAYED-GYGVPQGHGPPPS-ATTAGVVGAGPNTSTSAYAA 186
G G E TSG P G +Y G G Q +GPPP A G + + P+T + +
Sbjct: 70 GPPQGMREPPTSGTPPVSGAQSYSQFGQGETQ-NGPPPMVAPPQGTLVSQPHTPNAVSLS 128
Query: 187 TQSGTPMRAAYDIPR-GPGYEASKGPGYDA-SKAPSYDPTKGPSYDP---AKGP---GYD 238
+ P + P G ++ S APS P GP Y P A+ P Y
Sbjct: 129 GPTQPPYGQQFGSPPIGMQQMTNQMASMQVGSTAPS--PA-GPGYAPPSTAQAPISAAYT 185
Query: 239 PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDM---QRGPGYE 294
P+ P + S+ +Q P + + P P G Q+ PN GP +
Sbjct: 186 PSAPPTFPPT--SSAPSQPPPTEAVAQAPP-QPYYGAPPPAQQPFPNAVSTFSSAGPT-Q 241
Query: 295 TQRVPGYDVQRGPVYEAQRAPSY--IPQRGP----GYDLQRGQGYDMRRAPSYDPSRGTG 348
Q P Q P A P + P GP G L Q +RAP G
Sbjct: 242 PQAPPSVSQQSFPQAPAVSQPPFSTAPPPGPSQSYGGPLPPTQP-SFQRAPLPTSQPGV- 299
Query: 349 FDGAPRGAAPHGQVP------PPLNNV-PYGSATPPARSGSGQPRGGNPAR 392
F G P + H Q+P PP++ PY S PP + S P+ G P R
Sbjct: 300 FPGGPPPTSTHSQLPGPMPPQPPVSQPSPYYSEPPPT-TASFPPQVGAPPR 349
>UNIPROTKB|P15941 [details] [associations]
symbol:MUC1 "Mucin-1" species:9606 "Homo sapiens"
[GO:0005576 "extracellular region" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IBA] [GO:0009986 "cell surface" evidence=IBA]
[GO:0016324 "apical plasma membrane" evidence=IBA] [GO:0005887
"integral to plasma membrane" evidence=TAS] [GO:0005796 "Golgi
lumen" evidence=TAS] [GO:0016266 "O-glycan processing"
evidence=TAS] [GO:0043687 "post-translational protein modification"
evidence=TAS] [GO:0044267 "cellular protein metabolic process"
evidence=TAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0002039 "p53 binding" evidence=IPI] [GO:0006977 "DNA damage
response, signal transduction by p53 class mediator resulting in
cell cycle arrest" evidence=IDA] [GO:0000790 "nuclear chromatin"
evidence=IDA] [GO:0090240 "positive regulation of histone H4
acetylation" evidence=IDA] [GO:0000978 "RNA polymerase II core
promoter proximal region sequence-specific DNA binding"
evidence=IDA] [GO:0043618 "regulation of transcription from RNA
polymerase II promoter in response to stress" evidence=IDA]
[GO:0006978 "DNA damage response, signal transduction by p53 class
mediator resulting in transcription of p21 class mediator"
evidence=IDA] [GO:0010944 "negative regulation of transcription by
competitive promoter binding" evidence=IDA] [GO:0003712
"transcription cofactor activity" evidence=IDA] [GO:0036003
"positive regulation of transcription from RNA polymerase II
promoter in response to stress" evidence=IDA] [GO:0043066 "negative
regulation of apoptotic process" evidence=IDA] Reactome:REACT_17015
PANTHER:PTHR10006 GO:GO:0043066 GO:GO:0005576 GO:GO:0009986
GO:GO:0005887 GO:GO:0006977 GO:GO:0016324 GO:GO:0000978
GO:GO:0000790 GO:GO:0003712 GO:GO:0043687 InterPro:IPR000082
Pfam:PF01390 SMART:SM00200 PROSITE:PS50024 GO:GO:0005796
EMBL:CH471121 GO:GO:0010944 GO:GO:0090240 PDB:2FO4 PDBsum:2FO4
GO:GO:0016266 GO:GO:0006978 EMBL:AL713999 GO:GO:0036003
MEROPS:S71.001 CTD:4582 eggNOG:NOG77744 KO:K06568
InterPro:IPR023217 EMBL:J05582 EMBL:M32738 EMBL:M32739 EMBL:M34089
EMBL:M34088 EMBL:J05581 EMBL:M61170 EMBL:X52229 EMBL:X52228
EMBL:M35093 EMBL:X80761 EMBL:U60259 EMBL:U60260 EMBL:U60261
EMBL:AF125525 EMBL:AF348143 EMBL:AY327582 EMBL:AY463543
EMBL:BC120974 EMBL:Z17324 EMBL:Z17325 EMBL:M31823 EMBL:S81781
EMBL:S81736 EMBL:M21868 IPI:IPI00013955 IPI:IPI00218163
IPI:IPI00218164 IPI:IPI00218165 IPI:IPI00218166 IPI:IPI00218168
IPI:IPI00218169 IPI:IPI00607673 IPI:IPI00902840 IPI:IPI00978078
PIR:A35175 RefSeq:NP_001018016.1 RefSeq:NP_001018017.1
RefSeq:NP_001037855.1 RefSeq:NP_001037856.1 RefSeq:NP_001037857.1
RefSeq:NP_001037858.1 RefSeq:NP_001191214.1 RefSeq:NP_001191215.1
RefSeq:NP_001191216.1 RefSeq:NP_001191217.1 RefSeq:NP_001191218.1
RefSeq:NP_001191219.1 RefSeq:NP_001191220.1 RefSeq:NP_001191221.1
RefSeq:NP_001191222.1 RefSeq:NP_001191223.1 RefSeq:NP_001191224.1
RefSeq:NP_001191225.1 RefSeq:NP_001191226.1 RefSeq:NP_002447.4
UniGene:Hs.89603 PDB:2ACM PDBsum:2ACM ProteinModelPortal:P15941
SMR:P15941 IntAct:P15941 MINT:MINT-156679 STRING:P15941
GlycoSuiteDB:P15941 PhosphoSite:P15941 DMDM:296439295 PaxDb:P15941
PRIDE:P15941 DNASU:4582 Ensembl:ENST00000337604
Ensembl:ENST00000343256 Ensembl:ENST00000368389
Ensembl:ENST00000368390 Ensembl:ENST00000368398 GeneID:4582
KEGG:hsa:4582 UCSC:uc001fib.3 GeneCards:GC01M155158 HGNC:HGNC:7508
HPA:CAB000036 HPA:CAB001986 HPA:HPA004179 HPA:HPA007235
HPA:HPA008855 MIM:113720 MIM:158340 neXtProt:NX_P15941
PharmGKB:PA31309 ChiTaRS:MUC1 EvolutionaryTrace:P15941
GenomeRNAi:4582 NextBio:17597 Bgee:P15941 Genevestigator:P15941
GermOnline:ENSG00000185499 Uniprot:P15941
Length = 1255
Score = 124 (48.7 bits), Expect = 0.00039, P = 0.00039
Identities = 65/275 (23%), Positives = 91/275 (33%)
Query: 126 ADGSYGGATGNSENETSGRPVG--QNAYEDGYGVPQGHGPPP-SATTAGV-VGAGPNTST 181
A + GG S + S P +NA V H P S+TT G V P T
Sbjct: 27 ASSTPGGEKETSATQRSSVPSSTEKNAVSMTSSVLSSHSPGSGSSTTQGQDVTLAPATEP 86
Query: 182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
++ +A G + + + R P ++ P +D + AP P G + PA G P
Sbjct: 87 ASGSAATWGQDVTSV-PVTR-PALGSTTPPAHDVTSAPDNKPAPGSTAPPAHGVTSAPDT 144
Query: 242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL--GYDMQRGPNYDMQRGPGY----ET 295
P + + P+ G + P G+ D + P G +T
Sbjct: 145 RPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAPDT 204
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG--QGYDMRRAPSYDPSRGTGFDGAP 353
+ PG P + AP P G G D R AP G AP
Sbjct: 205 RPAPGSTAP--PAHGVTSAPDTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHGVTSAP 262
Query: 354 RGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
G PP + V T PA + P G
Sbjct: 263 DTRPAPGSTAPPAHGVTSAPDTRPAPGSTAPPAHG 297
>RGD|1308535 [details] [associations]
symbol:Pygo2 "pygopus 2" species:10116 "Rattus norvegicus"
[GO:0001701 "in utero embryonic development" evidence=IEA;ISO]
[GO:0001822 "kidney development" evidence=IEA;ISO] [GO:0002088
"lens development in camera-type eye" evidence=IEA;ISO] [GO:0005634
"nucleus" evidence=IEA;ISO] [GO:0007420 "brain development"
evidence=IEA;ISO] [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0009791 "post-embryonic development" evidence=IEA;ISO]
[GO:0016055 "Wnt receptor signaling pathway" evidence=ISO]
[GO:0030879 "mammary gland development" evidence=IEA;ISO]
[GO:0033599 "regulation of mammary gland epithelial cell
proliferation" evidence=IEA;ISO] [GO:0042393 "histone binding"
evidence=IEA;ISO] [GO:0048589 "developmental growth"
evidence=IEA;ISO] [GO:0051569 "regulation of histone H3-K4
methylation" evidence=IEA;ISO] [GO:0060021 "palate development"
evidence=IEA;ISO] [GO:0060070 "canonical Wnt receptor signaling
pathway" evidence=IEA;ISO] InterPro:IPR001965 InterPro:IPR019787
Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 RGD:1308535
GO:GO:0005634 GO:GO:0007420 GO:GO:0046872 GO:GO:0008270
GO:GO:0001701 GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10
InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589
InterPro:IPR019786 PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070
GO:GO:0030879 GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
EMBL:CH473976 eggNOG:NOG72798 HOGENOM:HOG000001580
HOVERGEN:HBG053774 GeneTree:ENSGT00530000063948 CTD:90780
OMA:PGLVYPC OrthoDB:EOG4QZ7MB EMBL:BC169054 IPI:IPI00368626
RefSeq:NP_001099917.1 UniGene:Rn.24988 STRING:B5DFG8
Ensembl:ENSRNOT00000028052 GeneID:295251 KEGG:rno:295251
UCSC:RGD:1308535 NextBio:639221 Genevestigator:B5DFG8
Uniprot:B5DFG8
Length = 405
Score = 118 (46.6 bits), Expect = 0.00040, P = 0.00040
Identities = 79/294 (26%), Positives = 110/294 (37%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQ--GHGPPPSAT 168
M +P RR + G A + +E P V N +ED +G P+ G GPP
Sbjct: 38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKMGGAGPP---- 93
Query: 169 TAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GP 227
+G+ P + Q G A +P G G GP + P + P GP
Sbjct: 94 ---FLGS-P-VPFGGFRV-QGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPNPMGP 143
Query: 228 SYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQR 281
+++ P +GPGY P + +Q ++ G N+ G P G G M +
Sbjct: 144 AFNMPPQGPGYPPPGNMNFPSQP---FNQSLGQNFSPPGGQMIPGPVGGFGPMISPTMGQ 200
Query: 282 GPNYDMQRGPGYETQRVPGYDVQRGPVYE--AQRAPSYIPQRGP--GYDLQ-RGQGYDMR 336
P ++ GP QR GP + Q PS P P G D G G +
Sbjct: 201 PPRGEL--GPPPLPQRFTQPGAPFGPSLQRPGQGLPSLPPNTSPFPGPDPGFPGPGGEDG 258
Query: 337 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
P +P T F P +P V N P + PP+ SG G GG P
Sbjct: 259 GKP-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPSSSGRG---GGTP 302
>UNIPROTKB|F1LNH3 [details] [associations]
symbol:Col6a2 "Protein Col6a2" species:10116 "Rattus
norvegicus" [GO:0005615 "extracellular space" evidence=IEA]
[GO:0031012 "extracellular matrix" evidence=IEA] [GO:0042383
"sarcolemma" evidence=IEA] [GO:0043234 "protein complex"
evidence=IEA] [GO:0070208 "protein heterotrimerization"
evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
SMART:SM00327 RGD:1305585 GO:GO:0005615 GO:GO:0043234 GO:GO:0042383
GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 GO:GO:0031012
GeneTree:ENSGT00530000063022 OMA:RALCNHD IPI:IPI00372839
Ensembl:ENSRNOT00000001695 ArrayExpress:F1LNH3 Uniprot:F1LNH3
Length = 1025
Score = 123 (48.4 bits), Expect = 0.00040, P = 0.00040
Identities = 88/284 (30%), Positives = 99/284 (34%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AG-PNTSTSA 183
+DG G +N T G Q G P G P S G G AG P
Sbjct: 320 SDGRKGAPGLAGKNGTDG----QKGKLGRIGPPGCKGDPGSRGPDGYPGEAGSPGEQGDQ 375
Query: 184 YAATQSGTPMRAAYDIPRGPGYEASKGPGYDA-SKAPSYDPTKGPSYDPA----KG-PGY 237
A SG P R P PG + SKG Y S AP KG P KG PG
Sbjct: 376 GAKGDSGRPGRRGP--PGNPGDKGSKG--YRGNSGAPGSPGVKGGKGGPGPRGPKGEPGR 431
Query: 238 --DP-TKG-PGYDAQKGSNYD-AQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
DP TKG PG D KG D GP S + G RGP + PG
Sbjct: 432 RGDPGTKGGPGSDGPKGEKGDPGPEGPRGLAGEIGSKGAKGDRGLPGPRGPQGALGE-PG 490
Query: 293 YETQRV-PGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRA----PSYDPSRGT 347
+ R PG RG Q P P R PG+ +G + P + RG
Sbjct: 491 KQGSRGDPGDAGPRGD--SGQPGPKGDPGR-PGFSYPGPRGTPGEKGEPGPPGPEGGRGD 547
Query: 348 -GFDGAPRGAAPHGQV--P-PPLNNVPYGSATPPARSGSGQPRG 387
G GAP G+ P PP P G P G P G
Sbjct: 548 FGLKGAPGRKGEKGEPADPGPPGEPGPRGPRGIPGPEGEPGPPG 591
>FB|FBgn0003980 [details] [associations]
symbol:Vm26Ab "Vitelline membrane 26Ab" species:7227
"Drosophila melanogaster" [GO:0007304 "chorion-containing eggshell
formation" evidence=IMP] [GO:0007305 "vitelline membrane formation
involved in chorion-containing eggshell formation" evidence=NAS]
[GO:0008316 "structural constituent of vitelline membrane"
evidence=NAS] [GO:0007343 "egg activation" evidence=IMP]
[GO:0060388 "vitelline envelope" evidence=IDA] GO:GO:0005576
EMBL:AE014134 GO:GO:0007304 GO:GO:0007343 eggNOG:NOG295326
PROSITE:PS51137 GeneTree:ENSGT00540000073505 GO:GO:0060388
InterPro:IPR013135 Pfam:PF10542 EMBL:M20936 EMBL:EF441676
PIR:A45943 RefSeq:NP_476784.1 UniGene:Dm.26740 DIP:DIP-19185N
IntAct:P13238 MINT:MINT-1563965 STRING:P13238
EnsemblMetazoa:FBtr0079171 GeneID:33827 KEGG:dme:Dmel_CG9046
CTD:33827 FlyBase:FBgn0003980 InParanoid:P13238 OMA:RAAYGGY
PhylomeDB:P13238 GenomeRNAi:33827 NextBio:785460 Bgee:P13238
GermOnline:CG9046 Uniprot:P13238
Length = 168
Score = 108 (43.1 bits), Expect = 0.00041, P = 0.00041
Identities = 28/92 (30%), Positives = 35/92 (38%)
Query: 166 SATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK 225
S G GA P + +Y+A + P AY P P Y A P Y A AP+Y
Sbjct: 45 SRAAYGGYGAAP--AAPSYSAPAA--PAAQAYSAPAAPAYSAPAAPAYSAPAAPAYSAPA 100
Query: 226 GPSYDPAKGPGYD-PTKGPGYDAQKGSNYDAQ 256
P+Y P Y P P K + Q
Sbjct: 101 APAYSAPAAPAYSAPASIPSPPCPKNYLFSCQ 132
>UNIPROTKB|I3L781 [details] [associations]
symbol:I3L781 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
GO:GO:0005201 GeneTree:ENSGT00660000095287
Ensembl:ENSSSCT00000024528 OMA:EVSMPEI Uniprot:I3L781
Length = 1087
Score = 123 (48.4 bits), Expect = 0.00043, P = 0.00043
Identities = 83/271 (30%), Positives = 99/271 (36%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPP-----PS-ATTAGVVGAGPNTSTSAYA 185
GA G N + P G + G G GPP P A TAG VG
Sbjct: 518 GAPGPDGNNGAQGPPGPQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEVGKPGERGIPG-- 575
Query: 186 ATQSGTPMRAAYDIPRGP-GYEASKGP-GYDASKAPSYDPTKGPSYDPAKG-PGYDPTKG 242
+ G P A RGP G + GP G S+ PS P GP D KG PG
Sbjct: 576 --EFGLPGPAGPRGERGPPGESGAAGPAGPIGSRGPSGPP--GP--DGNKGEPGV--LGA 627
Query: 243 PGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG-PGYETQRVP- 299
PG G S +RG I G + GL D+ P D RG PG P
Sbjct: 628 PGTAGPSGPSGLPGERGAA-GIPGGKGEKGETGLRGDVG-SPGRDGARGAPGAVGAPGPA 685
Query: 300 GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP--RGA- 356
G + RG A A P+ PG +RG+ P+ G GA RG
Sbjct: 686 GANGDRGEAGPAGPAGPAGPRGSPG---ERGEVGPAGPNGFAGPAGAAGQPGAKGERGTK 742
Query: 357 APHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G+ P P G+A P +G P G
Sbjct: 743 GPKGENGPVGPTGPVGAAGPAGPNGPPGPAG 773
>UNIPROTKB|P08123 [details] [associations]
symbol:COL1A2 "Collagen alpha-2(I) chain" species:9606
"Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0046332 "SMAD binding" evidence=IEA] [GO:0070208 "protein
heterotrimerization" evidence=IEA] [GO:0071230 "cellular response
to amino acid stimulus" evidence=IEA] [GO:0005584 "collagen type I"
evidence=IDA;IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0001501 "skeletal system development" evidence=IMP] [GO:0042476
"odontogenesis" evidence=NAS] [GO:0008217 "regulation of blood
pressure" evidence=IMP] [GO:0007179 "transforming growth factor
beta receptor signaling pathway" evidence=IDA] [GO:0005201
"extracellular matrix structural constituent" evidence=NAS]
[GO:0042802 "identical protein binding" evidence=IDA] [GO:0005615
"extracellular space" evidence=IDA] [GO:0030674 "protein binding,
bridging" evidence=IMP] [GO:0030199 "collagen fibril organization"
evidence=IMP] [GO:0007266 "Rho protein signal transduction"
evidence=IDA] [GO:0043589 "skin morphogenesis" evidence=IMP]
[GO:0001568 "blood vessel development" evidence=IMP] [GO:0070062
"extracellular vesicular exosome" evidence=IDA] [GO:0048407
"platelet-derived growth factor binding" evidence=IDA] [GO:0005576
"extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
reticulum lumen" evidence=TAS] [GO:0007411 "axon guidance"
evidence=TAS] [GO:0007596 "blood coagulation" evidence=TAS]
[GO:0030168 "platelet activation" evidence=TAS] [GO:0030198
"extracellular matrix organization" evidence=TAS] [GO:0050900
"leukocyte migration" evidence=TAS] [GO:0031012 "extracellular
matrix" evidence=IDA] Reactome:REACT_604 InterPro:IPR000885
Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
Reactome:REACT_118779 Reactome:REACT_111045 Reactome:REACT_111102
GO:GO:0007411 GO:GO:0005615 GO:GO:0030168 GO:GO:0046872
GO:GO:0050900 GO:GO:0070062 GO:GO:0030199 GO:GO:0030674
GO:GO:0005788 GO:GO:0042802 GO:GO:0001501 GO:GO:0008217
GO:GO:0007179 GO:GO:0007266
Pathway_Interaction_DB:endothelinpathway GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568
Pathway_Interaction_DB:il4_2pathway
Pathway_Interaction_DB:smad2_3nuclearpathway
Pathway_Interaction_DB:lymphangiogenesis_pathway GO:GO:0042476
GO:GO:0071230 Orphanet:216812 EMBL:AC002528 GO:GO:0005201
GO:GO:0043589 HOVERGEN:HBG004933 KO:K06236 GO:GO:0005584 MIM:130060
MIM:166200 MIM:166210 MIM:166220 MIM:259420 Orphanet:230857
Orphanet:216796 Orphanet:216804 Orphanet:216820 DrugBank:DB00048
GO:GO:0048407 CTD:1278 OrthoDB:EOG412M65 EMBL:J03464 EMBL:Z74616
EMBL:AF004877 EMBL:BC042586 EMBL:BC054498 EMBL:Y00724 EMBL:X02488
EMBL:AB004317 EMBL:M35391 EMBL:S98904 EMBL:M21671 EMBL:S41099
EMBL:M21353 EMBL:M28985 EMBL:V00503 EMBL:S96821 EMBL:L47668
EMBL:X55525 EMBL:J00114 EMBL:M22816 EMBL:M22817 EMBL:K01078
EMBL:K02568 IPI:IPI00304962 PIR:A28500 RefSeq:NP_000080.2
UniGene:Hs.489142 ProteinModelPortal:P08123 SMR:P08123
DIP:DIP-36079N IntAct:P08123 MINT:MINT-4791958 STRING:P08123
PhosphoSite:P08123 DMDM:296439507 PaxDb:P08123 PRIDE:P08123
Ensembl:ENST00000297268 GeneID:1278 KEGG:hsa:1278 UCSC:uc003ung.1
GeneCards:GC07P094023 H-InvDB:HIX0006854 HGNC:HGNC:2198
HPA:CAB032650 MIM:120160 MIM:225320 neXtProt:NX_P08123
Orphanet:99876 Orphanet:230851 PharmGKB:PA35042 ChEMBL:CHEMBL2685
ChiTaRS:COL1A2 GenomeRNAi:1278 NextBio:5165 ArrayExpress:P08123
Bgee:P08123 Genevestigator:P08123 GermOnline:ENSG00000164692
Uniprot:P08123
Length = 1366
Score = 124 (48.7 bits), Expect = 0.00043, P = 0.00043
Identities = 79/261 (30%), Positives = 99/261 (37%)
Query: 156 GVPQGHGPPPSATTAGVVGA----G-PNTSTSAYAATQSGTPMRAAYDIPRGP-GYEASK 209
G+P G P AG GA G P + S + G P A P GP G E +
Sbjct: 322 GLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGSAGPQGPPGPSGEEGKR 381
Query: 210 GPGYDASKAPSYDPTKGPSYDP-AKG-PGYDPTKGP-GYDAQKGSNYDAQ-RGPNYDIHR 265
GP +A A P G P ++G PG D G G +G++ A RGPN D R
Sbjct: 382 GPNGEAGSAGPPGPP-GLRGSPGSRGLPGADGRAGVMGPPGSRGASGPAGVRGPNGDAGR 440
Query: 266 -G-PSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA--QRAPSYI-- 318
G P RGL G GP + GP +PG D + GP+ A + P I
Sbjct: 441 PGEPGLMGPRGLPGSPGNIGPAG--KEGP----VGLPGIDGRPGPIGPAGARGEPGNIGF 494
Query: 319 -----PQRGPGYDLQRGQG--YDMRRAPSYDPSRGT----GFDGAPRGAAPHGQVPPP-L 366
P PG + +G R AP D + G G G G G PP
Sbjct: 495 PGPKGPTGDPGKNGDKGHAGLAGARGAPGPDGNNGAQGPPGPQGVQGGKGEQGPPGPPGF 554
Query: 367 NNVPYGSATPPARSGSGQPRG 387
+P G + P G RG
Sbjct: 555 QGLP-GPSGPAGEVGKPGERG 574
>UNIPROTKB|Q51MB1 [details] [associations]
symbol:RIM9 "pH-response regulator protein palI/RIM9"
species:242507 "Magnaporthe oryzae 70-15" [GO:0003674
"molecular_function" evidence=ND] [GO:0008150 "biological_process"
evidence=ND] Pfam:PF06687 GO:GO:0016021 GO:GO:0005886
eggNOG:NOG12793 EMBL:CM000230 EMBL:CM001237 OrthoDB:EOG4DBXQ8
InterPro:IPR009571 RefSeq:XP_003721159.1 EnsemblFungi:MGG_02630T0
GeneID:2682829 KEGG:mgr:MGG_02630 Uniprot:Q51MB1
Length = 736
Score = 121 (47.7 bits), Expect = 0.00043, P = 0.00043
Identities = 56/176 (31%), Positives = 69/176 (39%)
Query: 116 LMNAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQG----HGPPPSATTAG 171
+ AP+ +R G+ GG G P G+ Y GYG P G +GPP G
Sbjct: 303 VQRAPSAERMNPGARGGYRGRGYG-----PPGRGGY--GYGPPPGSRGGYGPPGR----G 351
Query: 172 VVGAGPNTSTSAYAATQSGTPMRAAYDIP-RG----PGYEASK-GPGYDASKAPSYDPTK 225
G GPN Y G P R Y P RG PGY+ + G +A P P +
Sbjct: 352 GYGPGPN-GRGGY-----GPPPRGGYGPPMRGRAPPPGYQYDRRGSPAEAYGPP---PGQ 402
Query: 226 GPSYDPAKGPGYDPTKGPGYDAQKGSN-------YDAQRGPNYDIHRGPSYDPQRG 274
GP + PG P PGY GS Y Q P+ D+ R S P G
Sbjct: 403 GPYGQRQQSPG--PPSAPGY-GMNGSTPTVSSAAYGHQHTPSDDLPRAESPPPLPG 455
>UNIPROTKB|B0QYK0 [details] [associations]
symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0003676
"nucleic acid binding" evidence=IEA] [GO:0005622 "intracellular"
evidence=IEA] [GO:0008270 "zinc ion binding" evidence=IEA]
InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
SMART:SM00360 SMART:SM00547 GO:GO:0000166 GO:GO:0008270
Gene3D:3.30.70.330 GO:GO:0003676 GO:GO:0005622 EMBL:AC002059
EMBL:AL031186 EMBL:AC000026 UniGene:Hs.374477 HGNC:HGNC:3508
HOGENOM:HOG000038010 HOVERGEN:HBG000970 ChiTaRS:EWSR1
IPI:IPI00879242 SMR:B0QYK0 STRING:B0QYK0 Ensembl:ENST00000331029
Uniprot:B0QYK0
Length = 618
Score = 120 (47.3 bits), Expect = 0.00045, P = 0.00045
Identities = 75/279 (26%), Positives = 102/279 (36%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 237
T+ TQ+ ++AY P P Y + P A P PT+ + G GY
Sbjct: 102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158
Query: 238 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
+ P+ G G Q +Y G P + PSY P Y + +YD
Sbjct: 159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212
Query: 295 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 349
T P Y Q Y Q SY PQ G Y Q Y +++ SY
Sbjct: 213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
D P +GQ + P + + G+ RGG
Sbjct: 270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306
>UNIPROTKB|D4A458 [details] [associations]
symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
"intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0005622 IPI:IPI00767290 Ensembl:ENSRNOT00000057377
ArrayExpress:D4A458 Uniprot:D4A458
Length = 618
Score = 120 (47.3 bits), Expect = 0.00045, P = 0.00045
Identities = 74/278 (26%), Positives = 100/278 (35%)
Query: 128 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + S GQ AY YG P G+ P P A + V G G +T+
Sbjct: 42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
T+ TQ+ ++AY P P Y + P A P + P G Y+
Sbjct: 102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159
Query: 239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
P+ G G Q +Y G P + PSY P Y + +YD T
Sbjct: 160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213
Query: 296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
P Y Q Y Q SY PQ G Y Q Y +++ SY D
Sbjct: 214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
P +GQ + P + + G+ RGG
Sbjct: 271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306
>UNIPROTKB|P02461 [details] [associations]
symbol:COL3A1 "Collagen alpha-1(III) chain" species:9606
"Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0001501 "skeletal system development" evidence=IEA] [GO:0001568
"blood vessel development" evidence=IEA] [GO:0046332 "SMAD binding"
evidence=IEA] [GO:0048565 "digestive tract development"
evidence=IEA] [GO:0071230 "cellular response to amino acid
stimulus" evidence=IEA] [GO:0031012 "extracellular matrix"
evidence=IMP;TAS] [GO:0005515 "protein binding" evidence=IPI]
[GO:0007160 "cell-matrix adhesion" evidence=IDA] [GO:0018149
"peptide cross-linking" evidence=IDA] [GO:0050777 "negative
regulation of immune response" evidence=IMP] [GO:0005178 "integrin
binding" evidence=NAS;IMP] [GO:0030168 "platelet activation"
evidence=NAS] [GO:0007179 "transforming growth factor beta receptor
signaling pathway" evidence=IDA] [GO:0034097 "response to cytokine
stimulus" evidence=IDA] [GO:0009314 "response to radiation"
evidence=IDA] [GO:0042060 "wound healing" evidence=IDA;NAS]
[GO:0043206 "extracellular fibril organization" evidence=IMP]
[GO:0030199 "collagen fibril organization" evidence=NAS;IMP]
[GO:0007507 "heart development" evidence=IMP] [GO:0032964 "collagen
biosynthetic process" evidence=IMP;TAS] [GO:0005615 "extracellular
space" evidence=IDA;NAS] [GO:0043588 "skin development"
evidence=IMP] [GO:0005201 "extracellular matrix structural
constituent" evidence=IMP] [GO:0007229 "integrin-mediated signaling
pathway" evidence=IMP] [GO:0005586 "collagen type III"
evidence=NAS;IMP] [GO:0048407 "platelet-derived growth factor
binding" evidence=IDA] [GO:0005576 "extracellular region"
evidence=TAS] [GO:0005788 "endoplasmic reticulum lumen"
evidence=TAS] [GO:0007411 "axon guidance" evidence=TAS] [GO:0030198
"extracellular matrix organization" evidence=TAS]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 Reactome:REACT_118779
Reactome:REACT_111045 Reactome:REACT_111102 GO:GO:0007411
GO:GO:0043588 GO:GO:0005615 GO:GO:0030168 GO:GO:0007507
GO:GO:0046872 GO:GO:0034097 GO:GO:0030199 GO:GO:0005788
GO:GO:0001501 EMBL:CH471058 GO:GO:0005178 GO:GO:0007179
GO:GO:0007229 GO:GO:0007160
Pathway_Interaction_DB:endothelinpathway InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568 GO:GO:0048565
GO:GO:0050777 GO:GO:0009314 GO:GO:0018149 GO:GO:0032964
GO:GO:0071230 GO:GO:0043206 GO:GO:0005201 HOVERGEN:HBG004933
KO:K06236 DrugBank:DB00048 DrugBank:DB00039 GO:GO:0048407
OrthoDB:EOG4FTW1C EMBL:X14420 EMBL:AY054301 EMBL:AY016295
EMBL:AC066694 EMBL:BC028178 EMBL:M26939 EMBL:X07240 EMBL:X15332
EMBL:S62925 EMBL:S79877 EMBL:M59312 EMBL:M59227 EMBL:M55603
EMBL:X06700 EMBL:X01655 EMBL:X01742 EMBL:M13146 EMBL:M11134
IPI:IPI00021033 IPI:IPI00167087 PIR:S05272 RefSeq:NP_000081.1
UniGene:Hs.443625 PDB:2V53 PDB:3DMW PDB:4AE2 PDB:4AEJ PDB:4AK3
PDBsum:2V53 PDBsum:3DMW PDBsum:4AE2 PDBsum:4AEJ PDBsum:4AK3
ProteinModelPortal:P02461 SMR:P02461 DIP:DIP-57177N IntAct:P02461
STRING:P02461 PhosphoSite:P02461 DMDM:124056490 PaxDb:P02461
PRIDE:P02461 Ensembl:ENST00000304636 GeneID:1281 KEGG:hsa:1281
UCSC:uc002uqj.1 CTD:1281 GeneCards:GC02P189803 HGNC:HGNC:2201
HPA:CAB016766 HPA:HPA007583 MIM:100070 MIM:120180 MIM:130020
MIM:130050 neXtProt:NX_P02461 Orphanet:2500 Orphanet:285
Orphanet:286 Orphanet:86 PharmGKB:PA26716 InParanoid:P02461
OMA:EGSPGHP PhylomeDB:P02461 ChiTaRS:COL3A1
EvolutionaryTrace:P02461 GenomeRNAi:1281 NextBio:5177
ArrayExpress:P02461 Bgee:P02461 Genevestigator:P02461
GermOnline:ENSG00000168542 GO:GO:0005586 Uniprot:P02461
Length = 1466
Score = 124 (48.7 bits), Expect = 0.00047, P = 0.00047
Identities = 81/280 (28%), Positives = 101/280 (36%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GP----NTS 180
A G G G + P G + + G P GPP AG G GP S
Sbjct: 165 AVGGLAGYPGPAGPPGPPGPPGTSGHPGSPGSPGYQGPPGEPGQAGPSGPPGPPGAIGPS 224
Query: 181 TSAYAATQSGTPMRAAY-DIPRGPGYEASKG-PGYDASKAP-SYDPTKGPSYDPAKGPGY 237
A +SG P R +P PG + G PG+ K +D G + PG
Sbjct: 225 GPAGKDGESGRPGRPGERGLPGPPGIKGPAGIPGFPGMKGHRGFDGRNGEKGETG-APGL 283
Query: 238 DPTKG-PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGP-GYE 294
G PG + G RG + R P G G D RG D Q GP G
Sbjct: 284 KGENGLPGENGAPGPM--GPRGAPGERGR-PGLPGAAGARGNDGARGS--DGQPGPPGPP 338
Query: 295 -TQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAP 353
T PG +G V A S PG QRG+ A + P G +G+P
Sbjct: 339 GTAGFPGSPGAKGEVGPAGSPGS---NGAPG---QRGEPGPQGHAGAQGPPGPPGINGSP 392
Query: 354 RGAAPHGQVPPPLNNVP--YGSATPPARSGS-GQP--RGG 388
G G P + P G+ PP +G+ G P RGG
Sbjct: 393 GGKGEMG--PAGIPGAPGLMGARGPPGPAGANGAPGLRGG 430
Score = 123 (48.4 bits), Expect = 0.00060, P = 0.00060
Identities = 85/284 (29%), Positives = 101/284 (35%)
Query: 126 ADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-G-PNTSTSA 183
A G GGA +N G P G G+P G P + G G+ G P +
Sbjct: 424 APGLRGGAGEPGKNGAKGEP-GPRGERGEAGIP---GVPGAKGEDGKDGSPGEPGANGLP 479
Query: 184 YAATQSGTPMRAAYDIPRGP-GYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
AA + G P + P GP G KGP + AP P GP A PG D G
Sbjct: 480 GAAGERGAP---GFRGPAGPNGIPGEKGPAGERG-APG--PA-GPR-GAAGEPGRDGVPG 531
Query: 243 -PGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRG----PGYE-T 295
PG GS GP D GP Q G GP+ Q G PG +
Sbjct: 532 GPGMRGMPGS----PGGPGSDGKPGPP-GSQGESGRPGPPGPSGPRGQPGVMGFPGPKGN 586
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG-QGYDMRRAPSYD-----PSRGTGF 349
PG + +RG P PQ PG + + G QG P D P G
Sbjct: 587 DGAPGKNGERG----GPGGPG--PQGPPGKNGETGPQGPPGPTGPGGDKGDTGPPGPQGL 640
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPARR 393
G P P G+ P P G A P G G+ G P R
Sbjct: 641 QGLPGTGGPPGENGKPGEPGPKGDAGAPGAPG-GKGDAGAPGER 683
>MGI|MGI:88462 [details] [associations]
symbol:Col7a1 "collagen, type VII, alpha 1" species:10090 "Mus
musculus" [GO:0004867 "serine-type endopeptidase inhibitor
activity" evidence=IEA] [GO:0005576 "extracellular region"
evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005604
"basement membrane" evidence=IDA] [GO:0007155 "cell adhesion"
evidence=IEA] [GO:0010466 "negative regulation of peptidase
activity" evidence=IEA] [GO:0030414 "peptidase inhibitor activity"
evidence=IEA] InterPro:IPR002035 InterPro:IPR002223
InterPro:IPR003961 Pfam:PF00014 Pfam:PF00041 Pfam:PF00092
PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
SMART:SM00060 SMART:SM00327 MGI:MGI:88462 Gene3D:2.60.40.10
InterPro:IPR013783 GO:GO:0004867 SUPFAM:SSF49265 GO:GO:0007155
Gene3D:4.10.410.10 InterPro:IPR020901 SUPFAM:SSF57362
PROSITE:PS00280 GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0005604 EMBL:AC174646 MEROPS:I02.967 CTD:1294
HOGENOM:HOG000111866 HOVERGEN:HBG051053 KO:K16628 OMA:RRVCTTA
OrthoDB:EOG4J117P EMBL:U32107 EMBL:S63654 IPI:IPI00134652
PIR:A45748 RefSeq:NP_031764.2 UniGene:Mm.6200 HSSP:P12111
ProteinModelPortal:Q63870 SMR:Q63870 STRING:Q63870
PhosphoSite:Q63870 PaxDb:Q63870 PRIDE:Q63870
Ensembl:ENSMUST00000026740 Ensembl:ENSMUST00000112070 GeneID:12836
KEGG:mmu:12836 UCSC:uc009rrh.1 GeneTree:ENSGT00700000104250
InParanoid:Q63870 NextBio:282356 Bgee:Q63870 CleanEx:MM_COL7A1
Genevestigator:Q63870 Uniprot:Q63870
Length = 2944
Score = 127 (49.8 bits), Expect = 0.00047, P = 0.00047
Identities = 86/270 (31%), Positives = 103/270 (38%)
Query: 145 PVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-- 202
P G + G P GPP S GV G+ P S G P+G
Sbjct: 1289 PPGSTQAKGERGFPGPEGPPGSPGLPGVPGS-PGIKGSTGRPGPRGEQGERGPQGPKGEP 1347
Query: 203 --PGY-EASKGPGYDASKAPSYDPTKGPSYDP-AKGPGYDP-TKGP-GYD--AQKGSNYD 254
PG GPG+ K DP GPS P ++GP DP +GP G + KG D
Sbjct: 1348 GEPGQITGGGGPGFPGKKG---DP--GPSGPPGSRGPVGDPGPRGPPGLPGISVKGDKGD 1402
Query: 255 -AQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEA 311
+RGP I DP GL G GP R PG + ++ G GP
Sbjct: 1403 RGERGPPGPGIGASEQGDP--GLPGLPGSPGPQGPAGR-PGEKGEK--GDCEDGGPGLPG 1457
Query: 312 QRAPSYIPQ-RG-PGYDLQRG-QGYDMRRA-PSYDPSRG----TGFDGAPRGAAPHGQVP 363
Q P P RG PG +G +G P RG G G P GAA H
Sbjct: 1458 QPGPPGEPGLRGAPGMTGPKGDRGLTGTPGEPGVKGERGHPGPVGPQGLP-GAAGH---- 1512
Query: 364 PPLNNVPYGSATPPARSGS-GQP-RGGNPA 391
P + P G P R G G+P R G+PA
Sbjct: 1513 PGVEG-PEGPPGPTGRRGEKGEPGRPGDPA 1541
>UNIPROTKB|B4DR34 [details] [associations]
symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
[GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
[GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
"cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
protein kinase cascade" evidence=IEA] [GO:0042493 "response to
drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
evidence=IEA] GO:GO:0000226 GO:GO:0042493 GO:GO:0007243
GO:GO:0000902 GO:GO:0048013 GO:GO:0005881 HOVERGEN:HBG003892
InterPro:IPR007726 PANTHER:PTHR23107 UniGene:Hs.129261
EMBL:AC091021 HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK299082
IPI:IPI01015658 STRING:B4DR34 Ensembl:ENST00000539849
Uniprot:B4DR34
Length = 336
Score = 116 (45.9 bits), Expect = 0.00047, P = 0.00047
Identities = 66/236 (27%), Positives = 88/236 (37%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 179
G+YG S G + Q Y +PQG H G P G V G +
Sbjct: 106 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 165
Query: 180 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 231
Y Q G P + + + G Y +GP G + P G PSY P
Sbjct: 166 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 224
Query: 232 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
+G YD P + +G N +Q G D ++GP PQ+G Q+ P G
Sbjct: 225 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 278
Query: 291 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 341
GY +Q PG P + Q+ Y P Q GP QR GYD + +Y
Sbjct: 279 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 334
>UNIPROTKB|A8E651 [details] [associations]
symbol:EWSR1 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0005634 "nucleus" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] [GO:0003676 "nucleic acid binding"
evidence=IEA] [GO:0000166 "nucleotide binding" evidence=IEA]
InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199
SMART:SM00360 SMART:SM00547 GO:GO:0005634 GO:GO:0000166
GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676 eggNOG:NOG240581
GeneTree:ENSGT00530000063105 CTD:2130 HOGENOM:HOG000038010
HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY OrthoDB:EOG42NJ15
EMBL:DAAA02045602 EMBL:BC153844 IPI:IPI00871084
RefSeq:NP_001103270.1 UniGene:Bt.33949 SMR:A8E651 STRING:A8E651
Ensembl:ENSBTAT00000023612 GeneID:534073 KEGG:bta:534073
InParanoid:A8E651 NextBio:20876260 Uniprot:A8E651
Length = 655
Score = 120 (47.3 bits), Expect = 0.00048, P = 0.00048
Identities = 73/278 (26%), Positives = 99/278 (35%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYSTPAAPQAYSQPVQGYGTGAYDTT 101
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
T+ TQ+ ++AY P P Y + P A P + P G Y+
Sbjct: 102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 159
Query: 239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
P+ G G Q +Y G P + PSY P Y + +YD T
Sbjct: 160 QPSLGYG---QSNYSYPQVPGSYPMQPVSAPPSYPPT---SYSSTQPTSYDQSSYSQQNT 213
Query: 296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
P Y Q Y Q SY PQ G Y Q Y +++ SY D
Sbjct: 214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
P +GQ + P + + G+ RGG
Sbjct: 271 H-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306
>UNIPROTKB|Q01844 [details] [associations]
symbol:EWSR1 "RNA-binding protein EWS" species:9606 "Homo
sapiens" [GO:0000166 "nucleotide binding" evidence=IEA] [GO:0008270
"zinc ion binding" evidence=IEA] [GO:0003723 "RNA binding"
evidence=IEA] [GO:0005516 "calmodulin binding" evidence=IEA]
[GO:0006351 "transcription, DNA-dependent" evidence=IEA]
[GO:0006355 "regulation of transcription, DNA-dependent"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0005737
"cytoplasm" evidence=IEA] [GO:0005886 "plasma membrane"
evidence=IEA] [GO:0005515 "protein binding" evidence=IPI]
InterPro:IPR000504 InterPro:IPR001876 InterPro:IPR012677
Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50096 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 GO:GO:0005886
GO:GO:0005634 GO:GO:0005737 GO:GO:0006355 GO:GO:0000166
GO:GO:0046872 EMBL:CH471095 GO:GO:0008270 Gene3D:3.30.70.330
GO:GO:0006351 GO:GO:0003723 EMBL:AC002059 MIM:612160 Orphanet:97338
Pathway_Interaction_DB:bard1pathway eggNOG:NOG240581 EMBL:AL031186
MIM:612219 Orphanet:319 EMBL:X66899 EMBL:X72990 EMBL:X72991
EMBL:X72992 EMBL:X72993 EMBL:X72994 EMBL:X72995 EMBL:X72996
EMBL:X72997 EMBL:X72998 EMBL:X72999 EMBL:X73000 EMBL:X73001
EMBL:X73002 EMBL:X73003 EMBL:X73004 EMBL:Y07848 EMBL:CR456490
EMBL:AK056309 EMBL:AK056681 EMBL:AC000026 EMBL:BC000527
EMBL:BC004817 EMBL:BC011048 EMBL:BC072442 EMBL:Y08806 EMBL:AB016435
IPI:IPI00065554 IPI:IPI00293254 IPI:IPI00335961 IPI:IPI00872855
IPI:IPI00879259 PIR:A49358 RefSeq:NP_001156757.1
RefSeq:NP_001156759.1 RefSeq:NP_005234.1 RefSeq:NP_053733.2
UniGene:Hs.374477 PDB:2CPE PDBsum:2CPE ProteinModelPortal:Q01844
SMR:Q01844 IntAct:Q01844 MINT:MINT-2858561 STRING:Q01844
PhosphoSite:Q01844 DMDM:544261 PaxDb:Q01844 PRIDE:Q01844 DNASU:2130
Ensembl:ENST00000332035 Ensembl:ENST00000333395
Ensembl:ENST00000397938 Ensembl:ENST00000406548
Ensembl:ENST00000414183 GeneID:2130 KEGG:hsa:2130 UCSC:uc003aet.3
CTD:2130 GeneCards:GC22P029663 HGNC:HGNC:3508 HPA:CAB004230
MIM:133450 neXtProt:NX_Q01844 Orphanet:83469 PharmGKB:PA27921
HOGENOM:HOG000038010 HOVERGEN:HBG000970 KO:K13209 OMA:EGTSTGY
OrthoDB:EOG42NJ15 PhylomeDB:Q01844 ChiTaRS:EWSR1
EvolutionaryTrace:Q01844 GenomeRNAi:2130 NextBio:8605
ArrayExpress:Q01844 Bgee:Q01844 CleanEx:HS_EWSR1
Genevestigator:Q01844 GermOnline:ENSG00000182944 Uniprot:Q01844
Length = 656
Score = 120 (47.3 bits), Expect = 0.00048, P = 0.00048
Identities = 75/279 (26%), Positives = 102/279 (36%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 42 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPS--YDPTKGPSYDPAKGPGY 237
T+ TQ+ ++AY P P Y + P A P PT+ + G GY
Sbjct: 102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPTETSQPQSSTG-GY 158
Query: 238 D-PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYE 294
+ P+ G G Q +Y G P + PSY P Y + +YD
Sbjct: 159 NQPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQN 212
Query: 295 TQRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 349
T P Y Q Y Q SY PQ G Y Q Y +++ SY
Sbjct: 213 TYGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQ 269
Query: 350 DGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
D P +GQ + P + + G+ RGG
Sbjct: 270 DH-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 306
>UNIPROTKB|F1LN98 [details] [associations]
symbol:Ewsr1 "Protein Ewsr1" species:10116 "Rattus
norvegicus" [GO:0000166 "nucleotide binding" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0005622
"intracellular" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00641 PROSITE:PS01358 PROSITE:PS50102
PROSITE:PS50199 SMART:SM00360 SMART:SM00547 RGD:1307258
GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0005622 GeneTree:ENSGT00530000063105 IPI:IPI00364603
Ensembl:ENSRNOT00000012634 ArrayExpress:F1LN98 Uniprot:F1LN98
Length = 656
Score = 120 (47.3 bits), Expect = 0.00048, P = 0.00048
Identities = 74/278 (26%), Positives = 100/278 (35%)
Query: 128 GSYGGATGNSENET-SGRPVGQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + S GQ AY YG P G+ P P A + V G G +T+
Sbjct: 42 GTYGQPTDVSYTQAQSTATXGQTAYATSYGQPPTGYTTPTAPQAYSQPVQGYGTGAYDTT 101
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
T+ TQ+ ++AY P P Y + P A P + P G Y+
Sbjct: 102 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPTRPQDGNKPAETSQPQSSTGGYN 159
Query: 239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
P+ G G Q +Y G P + PSY P Y + +YD T
Sbjct: 160 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSSQPTSYDQSSYSQQNT 213
Query: 296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
P Y Q Y Q SY PQ G Y Q Y +++ SY D
Sbjct: 214 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 270
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
P +GQ + P + + G+ RGG
Sbjct: 271 H-PSSMGVYGQESGGFSG-PGENRSLSGPDNRGRGRGG 306
>UNIPROTKB|I3LNI2 [details] [associations]
symbol:TFG "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0043123 "positive regulation of I-kappaB
kinase/NF-kappaB cascade" evidence=IEA] [GO:0042802 "identical
protein binding" evidence=IEA] [GO:0004871 "signal transducer
activity" evidence=IEA] GO:GO:0043123 GO:GO:0004871 OMA:YTTQTSQ
GeneTree:ENSGT00510000047809 EMBL:CU928320 EMBL:AEMK01189642
Ensembl:ENSSSCT00000026186 Uniprot:I3LNI2
Length = 340
Score = 116 (45.9 bits), Expect = 0.00048, P = 0.00048
Identities = 76/301 (25%), Positives = 114/301 (37%)
Query: 106 ATEVEKLRAELMNAPN-VDRRAD-----GSYGGATGNSENET-SGRPVGQNAYEDGYGVP 158
+++V+ LR EL+ N V+R D G G +T +EN+T GR + A D G
Sbjct: 38 SSQVKYLRRELIELRNKVNRLLDSLEPPGEPGPSTNITENDTVDGREE-KPAASDSSGKQ 96
Query: 159 QGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKA 218
S + + + + +A G P P + S P AS +
Sbjct: 97 STQVMAASMSAFDPLKNQDEINKNVMSAF--GLTDDQVSGPPSAPAEDRSGTPDSIASSS 154
Query: 219 PSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGY 277
+ P P P + P G + Q Y Q G + P PQ+ G
Sbjct: 155 SAAHP---PGVQPQQPPYTGALTQAGQSEGQMYQQYPQQAGYGTQQPQAPPQPPQQS-GS 210
Query: 278 DMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYI--PQRGPGYDLQRGQGYDM 335
+ +G Y Q GP + Q+ GY Q P +A AP++ PQ+ P Q+ Q
Sbjct: 211 SLSKG--YSQQTGP-QQPQQFQGYGQQ--PTSQAP-APAFSGQPQQMPAQPPQQYQASSY 264
Query: 336 R-RAPSYDPSRGTGFDGAPRGAAPHGQVP--PPLNNVPYGSATPPARSGSGQPRGGNPAR 392
+ + S+ T + AP A+ G P P G PP + + P G NP
Sbjct: 265 PPQTYTTQTSQPTNYTVAP--ASQPGMAPSQPGAYQPRPGFTPPPGSTMTPLPSGSNPYA 322
Query: 393 R 393
R
Sbjct: 323 R 323
>UNIPROTKB|F1RY40 [details] [associations]
symbol:RBM12B "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR012677 PROSITE:PS50102 SMART:SM00360 GO:GO:0000166
Gene3D:3.30.70.330 GO:GO:0003676 GeneTree:ENSGT00620000087828
OMA:EHFRRPP CTD:389677 EMBL:CU633952 RefSeq:XP_003125614.1
UniGene:Ssc.32661 Ensembl:ENSSSCT00000006702 GeneID:100514101
KEGG:ssc:100514101 Uniprot:F1RY40
Length = 986
Score = 122 (48.0 bits), Expect = 0.00049, P = 0.00049
Identities = 42/150 (28%), Positives = 65/150 (43%)
Query: 217 KAPSYDPTKGPSYDPAKGPGYDPTKGPGY-DAQKGSNYDAQRGPNYDIHRGPSYDPQRGL 275
+ P D + P + + P + + P D ++ D +R P D R P D +R
Sbjct: 581 RRPPEDDFRRPWEEDFRYPREEDFRYPREEDWRRPPEEDFRRPPKDDFRRPPEEDWRRPP 640
Query: 276 GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDM 335
D +R P D +R P + +R P + +R P + +R P +R P D +R D
Sbjct: 641 EGDFRRPPEEDWRRPPEEDFRRPPPGEWRRPPEEDFRRPPEEDFRRLPEEDFRRPHEEDF 700
Query: 336 RRAPSYDPSRGTGFDGAPRGAAPHGQVPPP 365
RR+P D R + D R H + PPP
Sbjct: 701 RRSPEED-FRHSPEDDFRRPPPEHFRRPPP 729
>ZFIN|ZDB-GENE-041221-3 [details] [associations]
symbol:prnprs3 "prion protein, related sequence 3"
species:7955 "Danio rerio" [GO:0005509 "calcium ion binding"
evidence=IEA] [GO:0005544 "calcium-dependent phospholipid binding"
evidence=IEA] [GO:0051260 "protein homooligomerization"
evidence=IEA] [GO:0016020 "membrane" evidence=IEA] [GO:0048854
"brain morphogenesis" evidence=IMP] [GO:0005886 "plasma membrane"
evidence=IDA] [GO:0007156 "homophilic cell adhesion" evidence=IDA]
[GO:0021731 "trigeminal motor nucleus development" evidence=IMP]
[GO:0042981 "regulation of apoptotic process" evidence=IMP]
InterPro:IPR001464 InterPro:IPR022416 ZFIN:ZDB-GENE-041221-3
GO:GO:0005886 GO:GO:0042981 GO:GO:0051260 GO:GO:0005509
GO:GO:0007156 GO:GO:0005544 PANTHER:PTHR10502 GO:GO:0048854
Gene3D:1.10.790.10 SUPFAM:SSF54098 HOVERGEN:HBG056090 EMBL:AJ620614
IPI:IPI00679275 RefSeq:NP_001013316.1 UniGene:Dr.162496
UniGene:Dr.84038 ProteinModelPortal:Q5K4F8 GeneID:503702
KEGG:dre:503702 CTD:503702 InParanoid:Q5K4F8 NextBio:20866258
ArrayExpress:Q5K4F8 GO:GO:0021731 Uniprot:Q5K4F8
Length = 567
Score = 119 (46.9 bits), Expect = 0.00051, P = 0.00051
Identities = 70/224 (31%), Positives = 94/224 (41%)
Query: 118 NAPNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYG----VPQ--GHGPPPSATTAG 171
++ N + G+ GG++ +S + +S + + G PQ PPP G
Sbjct: 36 SSSNKGGSSSGNKGGSSSSSSSSSSSKGTSSHGTHTSPGNYPRQPQVPNQNPPPYP---G 92
Query: 172 VVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDP 231
G P A + G P + +Y P GY ++G GY A Y P +G Y P
Sbjct: 93 AGGGYPGQGRYPPAGSNPGYPNQGSY--PGRAGYP-NQG-GYPAQGG--Y-PAQG-GY-P 143
Query: 232 AKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRG 290
A+G GY P +G GY AQ G Y AQ G + G S P +G GY Q G P G
Sbjct: 144 AQG-GY-PAQG-GYPAQGG--YPAQGGYPQGNYPGRSGYPGQG-GYPAQGGYPGGASYPG 197
Query: 291 PGYET--QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQG 332
G + R PG + PV + P Y P RG Q G G
Sbjct: 198 AGAGSYPNRYPGGNPY--PVGGSY--PGY-PVRGGSSPNQFGGG 236
>RGD|1565398 [details] [associations]
symbol:Col6a1 "collagen, type VI, alpha 1" species:10116 "Rattus
norvegicus" [GO:0003674 "molecular_function" evidence=ND]
[GO:0005575 "cellular_component" evidence=ND] [GO:0005576
"extracellular region" evidence=IEA;ISO] [GO:0008150
"biological_process" evidence=ND] [GO:0031012 "extracellular
matrix" evidence=IEA;ISO] [GO:0042383 "sarcolemma"
evidence=IEA;ISO] [GO:0043234 "protein complex" evidence=IEA;ISO]
[GO:0048407 "platelet-derived growth factor binding"
evidence=IEA;ISO] [GO:0070208 "protein heterotrimerization"
evidence=IEA;ISO] [GO:0071230 "cellular response to amino acid
stimulus" evidence=IEA;ISO] InterPro:IPR002035 Pfam:PF00092
PROSITE:PS50234 SMART:SM00327 RGD:1565398 GO:GO:0005576
GO:GO:0043234 GO:GO:0042383 GO:GO:0070208 InterPro:IPR008160
Pfam:PF01391 GO:GO:0031012 GO:GO:0071230 OrthoDB:EOG4WWRHX
OMA:VKENYAE GeneTree:ENSGT00530000063022 IPI:IPI00371853
PRIDE:D3ZUL3 Ensembl:ENSRNOT00000001679 Uniprot:D3ZUL3
Length = 1025
Score = 122 (48.0 bits), Expect = 0.00051, P = 0.00051
Identities = 85/262 (32%), Positives = 103/262 (39%)
Query: 120 PNVDRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AG 176
P D A G G G G +E E +GRP G + G P GPP AG G AG
Sbjct: 359 PKGDAGAFGLKGEKGEAG-AEGE-AGRP-GNSGPPGDEGEPGEPGPPGEKGEAGDEGNAG 415
Query: 177 PNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPGYDASKA-PSYDPTK-GPSYDPAK 233
P+ A + G P PRG PG +GP D +A P D + GP P
Sbjct: 416 PDG-----APGERGGPGERG---PRGTPGV---RGPRGDPGEAGPQGDQGREGPVGIPGD 464
Query: 234 GPGYDPTKGP-GYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGL-GYDMQRGPNYDMQRG 290
PG GP GY +G + RG + GP DP GL G + GP + G
Sbjct: 465 -PGESGPIGPKGYRGDEGPPGPEGLRGAPGPV--GPPGDP--GLMGERGEDGPPGNGTEG 519
Query: 291 -PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGT- 347
PG+ PGY RGP P +G PG G+ D + RG
Sbjct: 520 FPGF-----PGYPGNRGP-------PGINGTKGYPGLKGDEGEAGDPGEDNNDVSPRGVK 567
Query: 348 ---GFDGAPRGA-APHGQVPPP 365
G+ G P G P G V PP
Sbjct: 568 GAKGYRG-PEGPQGPPGHVGPP 588
>MGI|MGI:1932491 [details] [associations]
symbol:Prp2 "proline rich protein 2" species:10090 "Mus
musculus" [GO:0003674 "molecular_function" evidence=ND] [GO:0005575
"cellular_component" evidence=ND] [GO:0005576 "extracellular
region" evidence=IEA] [GO:0008150 "biological_process" evidence=ND]
MGI:MGI:1932491 GO:GO:0005576 InterPro:IPR026086 PANTHER:PTHR23203
UniGene:Mm.425348 UniGene:Mm.484054 CleanEx:MM_PRH1 EMBL:M23236
EMBL:M12100 EMBL:M19419 IPI:IPI00474263 IPI:IPI00855123 PIR:A28996
PIR:D29149 UniGene:Mm.333439 Genevestigator:P05143
GermOnline:ENSMUSG00000058295 Uniprot:P05143
Length = 317
Score = 115 (45.5 bits), Expect = 0.00055, P = 0.00055
Identities = 67/242 (27%), Positives = 77/242 (31%)
Query: 156 GVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGP-GYD 214
G P GP P GP G R P PG + P G
Sbjct: 79 GPPPPGGPQPRPPQGPPPPGGPQPRPPQGPPPPGGPQPRPPQG-PPPPGGPQPRPPQGPP 137
Query: 215 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRG 274
P P +GP P GP P +GP A G +GP GP P +G
Sbjct: 138 PPGGPQQRPPQGPP--PPGGPQPRPPQGPPPPA--GPQPRPPQGPPPPA--GPHLRPTQG 191
Query: 275 ---LGYDMQRGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 330
G QR P PG R P G GP + P P GP +
Sbjct: 192 PPPTGGPQQRYPQSPPP--PGGPQPRPPQGPPPPGGPHPRPTQGP---PPTGP--QPRPT 244
Query: 331 QGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQ--PRGG 388
QG P P +G G P+ P G PPP P + P G Q P G
Sbjct: 245 QGPPPTGGPQQRPPQGPPPPGGPQPRPPQGP-PPPTGPQPRPTQGPHPTGGPQQTPPLAG 303
Query: 389 NP 390
NP
Sbjct: 304 NP 305
>MGI|MGI:88455 [details] [associations]
symbol:Col4a2 "collagen, type IV, alpha 2" species:10090 "Mus
musculus" [GO:0001525 "angiogenesis" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
[GO:0005576 "extracellular region" evidence=IEA] [GO:0005578
"proteinaceous extracellular matrix" evidence=IEA] [GO:0005581
"collagen" evidence=IEA] [GO:0005587 "collagen type IV"
evidence=IDA] [GO:0005604 "basement membrane" evidence=IDA]
[GO:0006351 "transcription, DNA-dependent" evidence=IDA]
[GO:0016525 "negative regulation of angiogenesis" evidence=ISO]
InterPro:IPR001442 Pfam:PF01413 PROSITE:PS51403 SMART:SM00111
MGI:MGI:88455 GO:GO:0071560 InterPro:IPR016187 SUPFAM:SSF56436
GO:GO:0006351 GO:GO:0001525 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0016525 GO:GO:0005201 HOVERGEN:HBG004933
GO:GO:0005587 Gene3D:2.170.240.10 GeneTree:ENSGT00690000101772
KO:K06237 EMBL:J04448 EMBL:M23333 OrthoDB:EOG4XGZZF CTD:1284
OMA:TTIPEQN ChiTaRS:COL4A2 EMBL:J04695 EMBL:AK053858 EMBL:AK075619
EMBL:AK164096 EMBL:BC013560 EMBL:BC080789 EMBL:BC107685 EMBL:M23334
EMBL:X02896 EMBL:X02897 EMBL:X02898 EMBL:X02899 EMBL:X04410
EMBL:X04647 EMBL:M15833 EMBL:AY375463 EMBL:AY502946 EMBL:AY502947
IPI:IPI00338452 PIR:A33526 RefSeq:NP_034062.3 UniGene:Mm.181021
ProteinModelPortal:P08122 SMR:P08122 STRING:P08122
PhosphoSite:P08122 PaxDb:P08122 PRIDE:P08122
Ensembl:ENSMUST00000033899 GeneID:12827 KEGG:mmu:12827
InParanoid:P08122 NextBio:282318 Bgee:P08122 CleanEx:MM_COL4A2
Genevestigator:P08122 GermOnline:ENSMUSG00000031503 Uniprot:P08122
Length = 1707
Score = 124 (48.7 bits), Expect = 0.00055, P = 0.00055
Identities = 91/301 (30%), Positives = 110/301 (36%)
Query: 119 APNVDRRADGSYGGATGN----SENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG 174
+P VD D + G TG+ E T PVG + G P GP S G G
Sbjct: 1205 SPGVDAHGDPGFPGPTGDRGDRGEANTLPGPVGVPGQKGERGTPGERGPAGSPGLQGFPG 1264
Query: 175 AGP--NTSTSAYAATQSGTPMRAAYDIPRGP-GYEASKGPGYD--ASKAPSYDPTKGPSY 229
P N S S G Y P GP G A G D +S A + KG
Sbjct: 1265 ISPPSNISGSPGDVGAPGIFGLQGYQGPPGPPGPNALPGIKGDEGSSGAAGFPGQKGWVG 1324
Query: 230 DPAKGPGYDP-TKG-PGYDAQKGSN-YDAQRGPNYDI-HRGPSYDPQRGLGYDMQRGPNY 285
DP GP P G PG KG + GP+ + RGP P+ G+ G
Sbjct: 1325 DP--GPQGQPGVLGLPGEKGPKGEQGFMGNTGPSGAVGDRGPK-GPKGDQGFPGAPGS-- 1379
Query: 286 DMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPS 344
PG +PG Q+ V P +RG PG + G + P DP
Sbjct: 1380 --MGSPG-----IPGIP-QKIAVQPGTLGPQ--GRRGLPGALGEIGP-----QGPPGDP- 1423
Query: 345 RGTGFDGAPRGAAPHGQ-----VPP------PLNNV-PYGSATPPARSGS-GQPRGGNPA 391
GF GAP A P G+ VP P+ + P G P R GS G P G P
Sbjct: 1424 ---GFRGAPGKAGPQGRGGVSAVPGFRGDQGPMGHQGPVGQEGEPGRPGSPGLP--GMPG 1478
Query: 392 R 392
R
Sbjct: 1479 R 1479
>UNIPROTKB|Q96P44 [details] [associations]
symbol:COL21A1 "Collagen alpha-1(XXI) chain" species:9606
"Homo sapiens" [GO:0005581 "collagen" evidence=IEA] [GO:0005576
"extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
reticulum lumen" evidence=TAS] [GO:0030198 "extracellular matrix
organization" evidence=TAS] [GO:0031012 "extracellular matrix"
evidence=IDA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
SMART:SM00327 Reactome:REACT_118779 GO:GO:0005576 GO:GO:0030198
Gene3D:2.60.120.200 InterPro:IPR008985 InterPro:IPR013320
SUPFAM:SSF49899 GO:GO:0005788 GO:GO:0005581 InterPro:IPR008160
Pfam:PF01391 eggNOG:NOG12793 InterPro:IPR001791 PROSITE:PS50025
SMART:SM00210 EMBL:AF414088 EMBL:AF330693 EMBL:AF438327
EMBL:AL136624 EMBL:AF370383 EMBL:AK313398 EMBL:AL513530
EMBL:AL031782 EMBL:AL034452 EMBL:BC045597 EMBL:BC126108
IPI:IPI00102435 IPI:IPI00435960 IPI:IPI00644733 RefSeq:NP_110447.2
UniGene:Hs.47629 HSSP:P18614 ProteinModelPortal:Q96P44 SMR:Q96P44
STRING:Q96P44 DMDM:74752071 PaxDb:Q96P44 PRIDE:Q96P44
Ensembl:ENST00000244728 Ensembl:ENST00000370808
Ensembl:ENST00000370819 GeneID:81578 KEGG:hsa:81578 UCSC:uc003pcs.3
UCSC:uc003pcu.1 UCSC:uc010jzz.3 CTD:81578 GeneCards:GC06M055968
HGNC:HGNC:17025 HPA:HPA031210 HPA:HPA031212 HPA:HPA031213
MIM:610002 neXtProt:NX_Q96P44 PharmGKB:PA26714 HOVERGEN:HBG106599
InParanoid:Q96P44 KO:K16629 OMA:NGRQGIP OrthoDB:EOG4KH2TF
GenomeRNAi:81578 NextBio:71896 ArrayExpress:Q96P44 Bgee:Q96P44
CleanEx:HS_COL21A1 Genevestigator:Q96P44 Uniprot:Q96P44
Length = 957
Score = 121 (47.7 bits), Expect = 0.00060, P = 0.00060
Identities = 58/205 (28%), Positives = 78/205 (38%)
Query: 199 IPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYD-AQ 256
+P PGY PG D P Y G P PG +G PGY + G + D
Sbjct: 462 LPGNPGYPGQ--PGQDGK--PGYQGIAGTPGVPGS-PGIQGARGLPGYKGEPGRDGDKGD 516
Query: 257 RG-PNYD-IHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA 314
RG P + +H P + G D + P + ++G E G+ GP E R
Sbjct: 517 RGLPGFPGLHGMPGSKGEMGAKGD-KGSPGFYGKKGAKGEKGNA-GFPGLPGPAGEPGRH 574
Query: 315 PSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGT-GFDGAP--RGA-APHGQVPPPLNNVP 370
PG+ + G AP D +RG G G P RG G++ PP
Sbjct: 575 GKDGLMGSPGFKGEAGSP----GAPGQDGTRGEPGIPGFPGNRGLMGQKGEIGPPGQQGK 630
Query: 371 YGSATPPARSGS-GQP-RGGNPARR 393
G+ P GS G P + G P +
Sbjct: 631 KGAPGMPGLMGSNGSPGQPGTPGSK 655
>UNIPROTKB|B4DLD3 [details] [associations]
symbol:SS18 "cDNA FLJ58120, highly similar to SSXT protein"
species:9606 "Homo sapiens" [GO:0000226 "microtubule cytoskeleton
organization" evidence=IEA] [GO:0000902 "cell morphogenesis"
evidence=IEA] [GO:0005881 "cytoplasmic microtubule" evidence=IEA]
[GO:0007243 "intracellular protein kinase cascade" evidence=IEA]
[GO:0042493 "response to drug" evidence=IEA] [GO:0048013 "ephrin
receptor signaling pathway" evidence=IEA] GO:GO:0000226
GO:GO:0042493 GO:GO:0007243 GO:GO:0000902 GO:GO:0048013
GO:GO:0005881 HOVERGEN:HBG003892 InterPro:IPR007726
PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:AC091021
HGNC:HGNC:11340 ChiTaRS:SS18 EMBL:AK296949 IPI:IPI01011245
STRING:B4DLD3 Ensembl:ENST00000542420 Uniprot:B4DLD3
Length = 395
Score = 116 (45.9 bits), Expect = 0.00063, P = 0.00063
Identities = 66/236 (27%), Positives = 88/236 (37%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 179
G+YG S G + Q Y +PQG H G P G V G +
Sbjct: 165 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 224
Query: 180 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 231
Y Q G P + + + G Y +GP G + P G PSY P
Sbjct: 225 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 283
Query: 232 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
+G YD P + +G N +Q G D ++GP PQ+G Q+ P G
Sbjct: 284 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 337
Query: 291 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 341
GY +Q PG P + Q+ Y P Q GP QR GYD + +Y
Sbjct: 338 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 393
>UNIPROTKB|F1NNB3 [details] [associations]
symbol:PRNP "Major prion protein" species:9031 "Gallus
gallus" [GO:0051260 "protein homooligomerization" evidence=IEA]
[GO:0001933 "negative regulation of protein phosphorylation"
evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA] [GO:0005783
"endoplasmic reticulum" evidence=IEA] [GO:0005794 "Golgi apparatus"
evidence=IEA] [GO:0005886 "plasma membrane" evidence=IEA]
[GO:0006979 "response to oxidative stress" evidence=IEA]
[GO:0008017 "microtubule binding" evidence=IEA] [GO:0032689
"negative regulation of interferon-gamma production" evidence=IEA]
[GO:0032700 "negative regulation of interleukin-17 production"
evidence=IEA] [GO:0032703 "negative regulation of interleukin-2
production" evidence=IEA] [GO:0042802 "identical protein binding"
evidence=IEA] [GO:0043066 "negative regulation of apoptotic
process" evidence=IEA] [GO:0043433 "negative regulation of
sequence-specific DNA binding transcription factor activity"
evidence=IEA] [GO:0046007 "negative regulation of activated T cell
proliferation" evidence=IEA] [GO:0050860 "negative regulation of T
cell receptor signaling pathway" evidence=IEA] [GO:0070885
"negative regulation of calcineurin-NFAT signaling cascade"
evidence=IEA] InterPro:IPR000817 InterPro:IPR022416 PROSITE:PS00291
GO:GO:0005783 GO:GO:0005886 GO:GO:0005794 GO:GO:0043066
GO:GO:0006979 GO:GO:0005730 GO:GO:0032689 GO:GO:0051260
GO:GO:0005507 GO:GO:0043433 GO:GO:0001933 GO:GO:0046007
GO:GO:0050860 GO:GO:0070885 GO:GO:0032703 GO:GO:0032700
Gene3D:1.10.790.10 PANTHER:PTHR11522 GeneTree:ENSGT00510000049083
EMBL:AADN02055483 IPI:IPI00819942 Ensembl:ENSGALT00000041079
ArrayExpress:F1NNB3 Uniprot:F1NNB3
Length = 125
Score = 94 (38.1 bits), Expect = 0.00064, P = 0.00064
Identities = 37/103 (35%), Positives = 46/103 (44%)
Query: 159 QGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKG----PGYD 214
+G G P + G GAG + S Y Q G P Y P PGY + G PGY
Sbjct: 3 KGKGKP----SGGGWGAGSHRQPS-YPR-QPGYPHNPGY--PHNPGYPHNPGYPHNPGYP 54
Query: 215 ASKAPSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR 257
+ P Y P + P Y P PGY P G GY+ G +Y Q+
Sbjct: 55 HN--PGY-P-QNPGY-P-HNPGY-PGWGQGYNPSSGGSYHNQK 90
>UNIPROTKB|Q03692 [details] [associations]
symbol:COL10A1 "Collagen alpha-1(X) chain" species:9606
"Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0005581 "collagen" evidence=IEA] [GO:0005938 "cell cortex"
evidence=IEA] [GO:0001501 "skeletal system development"
evidence=TAS] [GO:0005576 "extracellular region" evidence=TAS]
[GO:0005788 "endoplasmic reticulum lumen" evidence=TAS] [GO:0030198
"extracellular matrix organization" evidence=TAS]
InterPro:IPR008983 Reactome:REACT_118779 GO:GO:0005938
GO:GO:0046872 EMBL:CH471051 GO:GO:0030198 GO:GO:0005788
GO:GO:0001501 HOGENOM:HOG000085653 HOVERGEN:HBG108220 GO:GO:0005581
Gene3D:2.60.120.40 InterPro:IPR001073 InterPro:IPR008160
Pfam:PF00386 Pfam:PF01391 PRINTS:PR00007 SMART:SM00110
SUPFAM:SSF49842 PROSITE:PS50871 eggNOG:NOG114228 CTD:1300
OrthoDB:EOG4FFD29 EMBL:X60382 EMBL:X72579 EMBL:X72580 EMBL:X98568
EMBL:AL121963 EMBL:BC130621 EMBL:BC130623 EMBL:X65120 EMBL:X58879
EMBL:M74050 EMBL:S68531 IPI:IPI00011685 PIR:S26396
RefSeq:NP_000484.2 UniGene:Hs.520339 PDB:1GR3 PDBsum:1GR3
ProteinModelPortal:Q03692 SMR:Q03692 MINT:MINT-101719 STRING:Q03692
DMDM:2506306 PaxDb:Q03692 PRIDE:Q03692 Ensembl:ENST00000243222
Ensembl:ENST00000327673 GeneID:1300 KEGG:hsa:1300 UCSC:uc003pwm.3
GeneCards:GC06M116440 HGNC:HGNC:2185 MIM:120110 MIM:156500
neXtProt:NX_Q03692 Orphanet:174 PharmGKB:PA26701 InParanoid:Q03692
OMA:IKGPPPN PhylomeDB:Q03692 EvolutionaryTrace:Q03692
GenomeRNAi:1300 NextBio:5279 ArrayExpress:Q03692 Bgee:Q03692
CleanEx:HS_COL10A1 Genevestigator:Q03692 GermOnline:ENSG00000123500
Uniprot:Q03692
Length = 680
Score = 119 (46.9 bits), Expect = 0.00065, P = 0.00065
Identities = 76/279 (27%), Positives = 99/279 (35%)
Query: 120 PNVDRRADGSYGGATG-NSENETSGR--PVGQNAYEDGYGV--PQGHGPPPSATTAGVVG 174
P V +R + G G + G P+G + G P+G G P +A G G
Sbjct: 217 PGVGKRGENGVPGQPGIKGDRGFPGEMGPIGPPGPQGPPGERGPEGIGKPGAAGAPGQPG 276
Query: 175 AGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKG 234
P T A +G P + P PG + +GP P +GP+ P K
Sbjct: 277 I-PGTKGLPGAPGIAGPPGPPGFGKPGLPGLKGERGPA-GLPGGPGAKGEQGPAGLPGK- 333
Query: 235 PGYD-PTKGPGYDAQKG-SNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGP 291
PG P G KG GP + GP+ P G +RG P D + P
Sbjct: 334 PGLTGPPGNMGPQGPKGIPGSHGLPGPKGET--GPA-GPAGYPGAKGERGSPGSDGK--P 388
Query: 292 GYETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPSRGT-G 348
GY + PG D +G P P G PG G + P ++ G G
Sbjct: 389 GYPGK--PGLDGPKGN--PGLPGPKGDPGVGGPPGLPGPVGPA-GAKGMPGHNGEAGPRG 443
Query: 349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
G P P G PP + P GS P G P G
Sbjct: 444 APGIPGTRGPIG--PPGIPGFP-GSKGDPGSPGPPGPAG 479
>WB|WBGene00000653 [details] [associations]
symbol:col-77 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
EMBL:Z66498 GO:GO:0042302 HOGENOM:HOG000085656
GeneTree:ENSGT00610000086159 PIR:T23801 RefSeq:NP_495759.1
ProteinModelPortal:Q21562 DIP:DIP-26119N MINT:MINT-1050309
STRING:Q21562 EnsemblMetazoa:M195.1 GeneID:174336
KEGG:cel:CELE_M195.1 UCSC:M195.1 CTD:174336 WormBase:M195.1
eggNOG:NOG315089 InParanoid:Q21562 OMA:IAFFGIC NextBio:883606
Uniprot:Q21562
Length = 304
Score = 114 (45.2 bits), Expect = 0.00065, P = 0.00065
Identities = 71/238 (29%), Positives = 87/238 (36%)
Query: 154 GYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRG-PGYEASKGPG 212
GYG P + + + G G S + A GTP D G PG + G
Sbjct: 85 GYGAPAEYSTDAAVSAGGSEAGGQCCSCGSGPAGPPGTPGEDGRDGNDGQPGPDGQPGSD 144
Query: 213 YDASKAPSYDPTKGPSYD-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 271
A P+ D +D PA PG GP KG+ +A P D G P
Sbjct: 145 APAEAIPTADDF---CFDCPAGPPGPAGNAGP-----KGAPGNAG-APGNDGQAGAPGAP 195
Query: 272 QRGLGYDMQRGP-NYDMQRG-PGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQR 329
G D +GP D G PG + Q PG V+ V P PQ PG D Q
Sbjct: 196 ----GNDGPQGPPGQDGAAGQPGPDGQ--PGV-VEEVAVPAGPPGPPG-PQGAPGTDGQP 247
Query: 330 GQ-GYDMRRAPSYDPSRGTGFDGAP--RGAA-PHGQVPPPLNNVPYGSATPPARSGSG 383
G G + P P+ G DGAP GAA G+ P PP R+ G
Sbjct: 248 GSAGQPGQDGPQ-GPAGDAGTDGAPGQAGAAGEQGEAGQPGEGGGCDHCPPP-RTAPG 303
>UNIPROTKB|E2RQK9 [details] [associations]
symbol:PYGO2 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0060070 "canonical Wnt receptor signaling
pathway" evidence=IEA] [GO:0060021 "palate development"
evidence=IEA] [GO:0051569 "regulation of histone H3-K4 methylation"
evidence=IEA] [GO:0048589 "developmental growth" evidence=IEA]
[GO:0042393 "histone binding" evidence=IEA] [GO:0033599 "regulation
of mammary gland epithelial cell proliferation" evidence=IEA]
[GO:0030879 "mammary gland development" evidence=IEA] [GO:0009791
"post-embryonic development" evidence=IEA] [GO:0007420 "brain
development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0002088 "lens development in camera-type eye" evidence=IEA]
[GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC
EMBL:AAEX03005346 RefSeq:XP_547562.2 Ensembl:ENSCAFT00000027172
GeneID:490440 KEGG:cfa:490440 NextBio:20863469 Uniprot:E2RQK9
Length = 405
Score = 116 (45.9 bits), Expect = 0.00066, P = 0.00065
Identities = 80/294 (27%), Positives = 106/294 (36%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 170
M +P RR + G A + +E P V N +ED +G P+ G P +
Sbjct: 38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97
Query: 171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 229
V G Q G A +P G G GP + P + P GP++
Sbjct: 98 PVPFGG--------FRVQGGM----AGQVPPGYGTGGGGGPQPLRRQPPPFPPNPMGPAF 145
Query: 230 D-PAKGPGYDPTKGPGYDAQK-----GSNYDAQRG---PNYDIHRGPSYDPQRGLGYDMQ 280
+ P +GPGY P + +Q G N+ G P GP P G +
Sbjct: 146 NMPPQGPGYPPPGNMNFPSQPFNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPPRGE 205
Query: 281 RGPNYDMQRGPGYETQRVP-GYDVQRGPVYEAQRAPSYIPQRGP--GYDLQ-RGQGYDMR 336
GP+ QR + P G +QR P Q PS P P G D G G +
Sbjct: 206 LGPHSLPQR---FAQPGAPFGPSLQR-P---GQGLPSLPPNTSPFPGPDPGFPGPGGEDG 258
Query: 337 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
P +P T F P +P V N P + PP SG G GG P
Sbjct: 259 GKP-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPNSSGRG---GGTP 302
>UNIPROTKB|Q2KFJ6 [details] [associations]
symbol:MGCH7_ch7g689 "Putative uncharacterized protein"
species:242507 "Magnaporthe oryzae 70-15" [GO:0003674
"molecular_function" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
EMBL:CM000230 Uniprot:Q2KFJ6
Length = 356
Score = 115 (45.5 bits), Expect = 0.00068, P = 0.00068
Identities = 72/273 (26%), Positives = 97/273 (35%)
Query: 69 KNAITFHLCRGTYEYEKKFYNDHLE-SLQVMEKNYITM-----ATEVEKLRAELMNAPNV 122
+ IT +CR + HLE +V++ YIT + +E L+ +
Sbjct: 38 REVITADICRYLGN-DALVRPGHLERDGRVVQGYYITAYRNLTSAMIESLKEDSQKWVEE 96
Query: 123 DRRADGSYGGAT--GNSEN---ETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGP 177
RRA G+ GG G S N S P Q Y D + + T GV P
Sbjct: 97 KRRAQGAQGGTKYPGGSANCSARKSNSPTAQMRYMDS-SLRNPNAVSQHMT--GVARDYP 153
Query: 178 NTSTSAYAATQSGTPMRAAYDIP-RGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG 236
+ S +AY+ + P R GY A+ PG + P Y + P A+
Sbjct: 154 D-SQAAYSESYGAGGQGGFGQYPSRDQGY-AAPPPGSFPPREPVYADRQDPYGAQARATS 211
Query: 237 YDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYD-MQRGPGYET 295
GY Q Y A G N P+ PQ+ G MQ P+Y +G Y
Sbjct: 212 QQYVSA-GYGQQADGPYHAT-GMNRQYAAPPA--PQQAYGDPMQITPSYPPTSQGGAYSP 267
Query: 296 QRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQ 328
Q Y P AP Y PQ P Q
Sbjct: 268 QAQQPYYAGAAP---PPGAPRYDPQGVPATSAQ 297
>MGI|MGI:3040693 [details] [associations]
symbol:Zmiz1 "zinc finger, MIZ-type containing 1"
species:10090 "Mus musculus" [GO:0001570 "vasculogenesis"
evidence=IMP] [GO:0001701 "in utero embryonic development"
evidence=IMP] [GO:0003007 "heart morphogenesis" evidence=IMP]
[GO:0005634 "nucleus" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0007296 "vitellogenesis"
evidence=IMP] [GO:0007569 "cell aging" evidence=IDA] [GO:0008270
"zinc ion binding" evidence=IEA] [GO:0045944 "positive regulation
of transcription from RNA polymerase II promoter" evidence=IMP]
[GO:0046872 "metal ion binding" evidence=IEA] [GO:0048146 "positive
regulation of fibroblast proliferation" evidence=IMP] [GO:0048589
"developmental growth" evidence=IMP] [GO:0048844 "artery
morphogenesis" evidence=IMP] InterPro:IPR004181 Pfam:PF02891
PROSITE:PS51044 MGI:MGI:3040693 GO:GO:0005737 GO:GO:0046872
GO:GO:0016607 GO:GO:0003007 GO:GO:0008270 GO:GO:0001701
GO:GO:0045944 GO:GO:0006351 Gene3D:3.30.40.10 InterPro:IPR013083
GO:GO:0048589 GO:GO:0001570 GO:GO:0048146 GO:GO:0048844
GO:GO:0007569 GO:GO:0007296 GeneTree:ENSGT00550000074410 CTD:57178
eggNOG:NOG237400 HOGENOM:HOG000253014 HOVERGEN:HBG056252
OMA:MNQYGPM OrthoDB:EOG45MN70 ChiTaRS:ZMIZ1 EMBL:BC057691
EMBL:BC058646 EMBL:BC065120 EMBL:AK054366 IPI:IPI00226072
IPI:IPI00480418 RefSeq:NP_899031.2 UniGene:Mm.227484
UniGene:Mm.486339 UniGene:Mm.489608 ProteinModelPortal:Q6P1E1
SMR:Q6P1E1 IntAct:Q6P1E1 STRING:Q6P1E1 PhosphoSite:Q6P1E1
PaxDb:Q6P1E1 PRIDE:Q6P1E1 Ensembl:ENSMUST00000007961
Ensembl:ENSMUST00000162645 GeneID:328365 KEGG:mmu:328365
UCSC:uc007srn.1 UCSC:uc007sro.1 InParanoid:Q6P1E1 NextBio:398259
Bgee:Q6P1E1 CleanEx:MM_ZMIZ1 Genevestigator:Q6P1E1
GermOnline:ENSMUSG00000007817 Uniprot:Q6P1E1
Length = 1072
Score = 121 (47.7 bits), Expect = 0.00069, P = 0.00069
Identities = 65/232 (28%), Positives = 84/232 (36%)
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPG-YEASKGP-GYDASKAPSYDPTKGP--SYDP 231
GP S+ TQ+ PRGP S P G A PS GP +
Sbjct: 318 GPVCSSFQMGPTQAYNSQFMNQPGPRGPASMGGSLNPAGMAAGMTPS--GMSGPPMGMNQ 375
Query: 232 AKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQR 289
+ PG P T G Q Q P I R +P G + Q GPN
Sbjct: 376 PRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFPT 432
Query: 290 GPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDP--S 344
PG Y T P P Y QR PS P G P + GQ Y + + S
Sbjct: 433 QPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTFS 489
Query: 345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
G+ + +G+ P P+ N P+ G+ TPP GS P +P++
Sbjct: 490 SGSSYSSYSQGSVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 541
>UNIPROTKB|Q15532 [details] [associations]
symbol:SS18 "Protein SSXT" species:9606 "Homo sapiens"
[GO:0006351 "transcription, DNA-dependent" evidence=IEA]
[GO:0000226 "microtubule cytoskeleton organization" evidence=IEA]
[GO:0000902 "cell morphogenesis" evidence=IEA] [GO:0005881
"cytoplasmic microtubule" evidence=IEA] [GO:0007243 "intracellular
protein kinase cascade" evidence=IEA] [GO:0042493 "response to
drug" evidence=IEA] [GO:0048013 "ephrin receptor signaling pathway"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0030374
"ligand-dependent nuclear receptor transcription coactivator
activity" evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0045944 "positive regulation of transcription from RNA
polymerase II promoter" evidence=IDA] GO:GO:0005634 GO:GO:0000226
GO:GO:0042493 GO:GO:0045944 GO:GO:0007243 GO:GO:0006351
EMBL:CH471088 GO:GO:0000902 Orphanet:3273 GO:GO:0048013
GO:GO:0005881 GO:GO:0030374 HOVERGEN:HBG003892 InterPro:IPR007726
PANTHER:PTHR23107 Pfam:PF05030 UniGene:Hs.129261 EMBL:X79200
EMBL:S79894 EMBL:X79201 EMBL:AF343880 EMBL:EF445031 EMBL:BC096223
IPI:IPI00452919 IPI:IPI00940186 PIR:S46269 RefSeq:NP_001007560.1
RefSeq:NP_005628.2 ProteinModelPortal:Q15532 IntAct:Q15532
STRING:Q15532 PhosphoSite:Q15532 DMDM:20141795 PaxDb:Q15532
PRIDE:Q15532 DNASU:6760 Ensembl:ENST00000269137
Ensembl:ENST00000415083 GeneID:6760 KEGG:hsa:6760 UCSC:uc002kvm.3
CTD:6760 GeneCards:GC18M023596 HGNC:HGNC:11340 MIM:600192
neXtProt:NX_Q15532 PharmGKB:PA36164 eggNOG:NOG274014
InParanoid:Q15532 KO:K15623 OrthoDB:EOG4RFKTH PhylomeDB:Q15532
ChiTaRS:SS18 GenomeRNAi:6760 NextBio:26388 ArrayExpress:Q15532
Bgee:Q15532 CleanEx:HS_SS18 Genevestigator:Q15532
GermOnline:ENSG00000141380 Uniprot:Q15532
Length = 418
Score = 116 (45.9 bits), Expect = 0.00069, P = 0.00069
Identities = 66/236 (27%), Positives = 88/236 (37%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQG---H--GPPPSATTAGVVGAGPNT--- 179
G+YG S G + Q Y +PQG H G P G V G +
Sbjct: 188 GNYGPRPNMSMQPNQGPMMHQQPPSQQYNMPQGGGQHYQGQQPPMGMMGQVNQGNHMMGQ 247
Query: 180 -STSAYAATQSGTPMR-AAYDIPRGPGYE-ASKGP--GYDASKAPSYDPTKG---PSYDP 231
Y Q G P + + + G Y +GP G + P G PSY P
Sbjct: 248 RQIPPYRPPQQGPPQQYSGQEDYYGDQYSHGGQGPPEGMNQQYYPDGHNDYGYQQPSY-P 306
Query: 232 AKGPGYD-PTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRG 290
+G YD P + +G N +Q G D ++GP PQ+G Q+ P G
Sbjct: 307 EQG--YDRPYEDSSQHYYEGGN--SQYGQQQDAYQGPP--PQQGYPPQQQQYPGQQGYPG 360
Query: 291 P--GY-ETQRVPGYDVQRGPVYEAQRAPSYIP-QRGPGYD-LQRGQGYDMRRAPSY 341
GY +Q PG P + Q+ Y P Q GP QR GYD + +Y
Sbjct: 361 QQQGYGPSQGGPGPQYPNYPQGQGQQYGGYRPTQPGPPQPPQQRPYGYDQGQYGNY 416
>WB|WBGene00000627 [details] [associations]
symbol:col-50 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00530000064217
EMBL:FO080999 PIR:T15142 RefSeq:NP_491194.1 UniGene:Cel.16665
ProteinModelPortal:O01662 EnsemblMetazoa:T28F2.6 GeneID:189050
KEGG:cel:CELE_T28F2.6 UCSC:T28F2.6 CTD:189050 WormBase:T28F2.6
eggNOG:NOG279371 InParanoid:O01662 OMA:AGNCITC NextBio:941028
Uniprot:O01662
Length = 418
Score = 116 (45.9 bits), Expect = 0.00069, P = 0.00069
Identities = 79/285 (27%), Positives = 95/285 (33%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P + A+G+ GG + SG P G G G P A G G +
Sbjct: 96 PAKEGYAEGAGGGGGCQCAAQASGCPAGPPGPPGEAGAD---GEPGEAGQDGAAGEAGSA 152
Query: 180 STSAYAATQSGTPMRAAYDIPRGP-GYEASKGP----GYDASKAPSYDPTKGPSYDPAKG 234
T A AA T A P GP G GP G D A +P GP+ PA
Sbjct: 153 DTYAGAAGNCIT-CPAGPPGPPGPDGNAGPAGPAGAAGPDGEGAGYAEP--GPA-GPAGP 208
Query: 235 PGYDPTKG-PGYDAQKGSNYDAQRG-PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGP- 291
PG D G PG D Q G+ P GP P G D P+ GP
Sbjct: 209 PGPDGQPGAPGPDGQPGAGGTTSTNQPGPPGPAGPP-GPAGPAGEDAYAQPSPAGTPGPP 267
Query: 292 ---GYETQR-------VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY 341
G + + PG D GP +A P G G + G A Y
Sbjct: 268 GPPGKDGEAGPDGPAGAPGTDGAPGP--DAAYCPCPPRTLGAGAYPEGGDAAAAAPAGGY 325
Query: 342 DPSRGTGFDGAPRGAAPHGQVPPPLNNV-PYGSATPPARSGSGQP 385
D G + AP AA P P P G A +G+ P
Sbjct: 326 DGGAGAAPEAAPAAAAAPQPAPAPAAAAAPAGGYQGGAAAGAAAP 370
>ZFIN|ZDB-GENE-040407-1 [details] [associations]
symbol:cherp "calcium homeostasis endoplasmic
reticulum protein" species:7955 "Danio rerio" [GO:0003723 "RNA
binding" evidence=IEA] [GO:0006396 "RNA processing" evidence=IEA]
[GO:0003676 "nucleic acid binding" evidence=IEA] InterPro:IPR000061
InterPro:IPR000467 Pfam:PF01585 Pfam:PF01805 PROSITE:PS50128
PROSITE:PS50174 SMART:SM00443 SMART:SM00648 ZFIN:ZDB-GENE-040407-1
GO:GO:0003723 GO:GO:0006396 Gene3D:1.25.40.90 InterPro:IPR008942
SUPFAM:SSF48464 HOGENOM:HOG000010294 HOVERGEN:HBG052716
InterPro:IPR006903 InterPro:IPR006569 Pfam:PF04818 SUPFAM:SSF109905
PROSITE:PS51391 EMBL:BC171627 IPI:IPI00490676 UniGene:Dr.75231
ArrayExpress:B7ZVL5 Bgee:B7ZVL5 Uniprot:B7ZVL5
Length = 910
Score = 120 (47.3 bits), Expect = 0.00073, P = 0.00073
Identities = 61/221 (27%), Positives = 93/221 (42%)
Query: 190 GTPMRAAYDIPR-GPGYEASKGPG-YDAS-KAPSYDPTKGPSYDPAKGPGYDPTKGPGYD 246
G+ R++ P P +SK PG +D P+++P + P +DP++ P P P ++
Sbjct: 383 GSQNRSSDSNPALSPEMSSSK-PGWFDPQHNMPAWNPQQPPPFDPSQAP--PPC--PPWN 437
Query: 247 AQKGSNYDAQRGPNYDIHR--GP---SYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
+ +G ++ QR PN+ R GP DP ++ P Q P QR P +
Sbjct: 438 SHEGL-WNEQRDPNWSDPRDGGPWSGQNDPPPSWSGQYEQPPWSSQQDQPPPWGQREPPF 496
Query: 302 DVQRGPVYEAQRAPSYIP----QRGPGYDLQRGQGYDMR---------RAPSYDPSRGTG 348
+QR P + P P Q P ++ R M+ P Y P
Sbjct: 497 RMQRPPHFRGPFPPHQQPPPFNQPPPPHNFGRFPPRFMQDDFPPRHHFERPPYPPHH--- 553
Query: 349 FDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGN 389
FD P+G P G++ PP ++ P PP S P GGN
Sbjct: 554 FD-YPQGDFP-GEIGPPPHHHPNQRIPPPGLSDP-PPWGGN 591
>UNIPROTKB|Q767K9 [details] [associations]
symbol:PPP1R10 "Serine/threonine-protein phosphatase 1
regulatory subunit 10" species:9823 "Sus scrofa" [GO:0000785
"chromatin" evidence=ISS] [GO:0072357 "PTW/PP1 phosphatase complex"
evidence=ISS] [GO:0005634 "nucleus" evidence=IEA] [GO:0004864
"protein phosphatase inhibitor activity" evidence=IEA] [GO:0003723
"RNA binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0008270 "zinc ion binding" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] InterPro:IPR000571
InterPro:IPR003617 InterPro:IPR017923 Pfam:PF00642 Pfam:PF08711
PROSITE:PS50103 PROSITE:PS51319 SMART:SM00356 SMART:SM00509
GO:GO:0005634 GO:GO:0046872 GO:GO:0003677 GO:GO:0008270
GO:GO:0000785 GO:GO:0006351 GO:GO:0003723 EMBL:AB113357
GO:GO:0004864 Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0072357
CTD:5514 eggNOG:NOG69306 HOGENOM:HOG000049285 HOVERGEN:HBG053646
OMA:PPPHEHR OrthoDB:EOG451DQK GeneTree:ENSGT00530000063820
RefSeq:NP_001116637.1 UniGene:Ssc.39454 ProteinModelPortal:Q767K9
Ensembl:ENSSSCT00000001463 Ensembl:ENSSSCT00000034462
GeneID:100144450 KEGG:ssc:100144450 ArrayExpress:Q767K9
Uniprot:Q767K9
Length = 925
Score = 120 (47.3 bits), Expect = 0.00075, P = 0.00075
Identities = 71/271 (26%), Positives = 86/271 (31%)
Query: 128 GSYGGATGNSENETS-GRPV-GQNAYEDGYGVPQGH---GPPPSATTAGVVGAGPNTSTS 182
G GG G G P+ G + G G P G GPPP GP
Sbjct: 632 GGPGGPKGMQHFPPGPGGPMPGPHGGPGGPGGPVGPRLLGPPPPPRGGDPFWDGPGDP-- 689
Query: 183 AYAATQSGTPMRAAYDIPRGPG-YEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTK 241
G PMR P GPG Y +G P P +G + G +
Sbjct: 690 -----MRGGPMRGGPG-P-GPGPYHRGRGGRGGNEPPPPPPPFRGARGGRSGGGPPNGRG 742
Query: 242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
GPG G + GP + G + P G G M G + GPG G+
Sbjct: 743 GPGGGMVGGGGHRPHEGPGGGMSSGSGHRPHEGPGGGM--GGGHRPHEGPGGGMGG--GH 798
Query: 302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQ 361
GP + P GPG + G G+ P G G G P G PH
Sbjct: 799 RPHEGPGGGMGGGSGHRPHEGPGGGMGAGGGHR--------PHEGPGH-GGPHGHRPH-D 848
Query: 362 VPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
VP + G R G GG R
Sbjct: 849 VPGHRGHDHRGPPPHEHRGHDGPGHGGGGHR 879
>UNIPROTKB|P12107 [details] [associations]
symbol:COL11A1 "Collagen alpha-1(XI) chain" species:9606
"Homo sapiens" [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0001502 "cartilage condensation" evidence=IEA] [GO:0002063
"chondrocyte development" evidence=IEA] [GO:0006029 "proteoglycan
metabolic process" evidence=IEA] [GO:0042472 "inner ear
morphogenesis" evidence=IEA] [GO:0048704 "embryonic skeletal system
morphogenesis" evidence=IEA] [GO:0055010 "ventricular cardiac
muscle tissue morphogenesis" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=NAS]
[GO:0005592 "collagen type XI" evidence=IDA;NAS] [GO:0030198
"extracellular matrix organization" evidence=NAS;TAS] [GO:0030674
"protein binding, bridging" evidence=NAS] [GO:0007601 "visual
perception" evidence=IMP] [GO:0007605 "sensory perception of sound"
evidence=IMP] [GO:0050910 "detection of mechanical stimulus
involved in sensory perception of sound" evidence=IMP] [GO:0030199
"collagen fibril organization" evidence=NAS] [GO:0050840
"extracellular matrix binding" evidence=NAS] [GO:0005576
"extracellular region" evidence=TAS] [GO:0005788 "endoplasmic
reticulum lumen" evidence=TAS] InterPro:IPR000885 Pfam:PF01410
ProDom:PD002078 PROSITE:PS51461 SMART:SM00038 Reactome:REACT_118779
GO:GO:0046872 GO:GO:0007601 GO:GO:0030199 GO:GO:0030674
EMBL:CH471097 Gene3D:2.60.120.200 InterPro:IPR008985
InterPro:IPR013320 SUPFAM:SSF49899 GO:GO:0005788 GO:GO:0042472
GO:GO:0050910 InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793
InterPro:IPR001791 SMART:SM00282 PROSITE:PS50025 GO:GO:0001502
GO:GO:0048704 GO:GO:0001503 GO:GO:0050840 GO:GO:0006029
GO:GO:0055010 Pfam:PF02210 GO:GO:0005201 GO:GO:0002063
HOVERGEN:HBG004933 KO:K06236 OrthoDB:EOG49GKHM SMART:SM00210
EMBL:J04177 EMBL:AF101112 EMBL:AF101079 EMBL:AF101080 EMBL:AF101081
EMBL:AF101082 EMBL:AF101083 EMBL:AF101084 EMBL:AF101085
EMBL:AF101086 EMBL:AF101087 EMBL:AF101088 EMBL:AF101089
EMBL:AF101090 EMBL:AF101091 EMBL:AF101092 EMBL:AF101093
EMBL:AF101094 EMBL:AF101095 EMBL:AF101096 EMBL:AF101097
EMBL:AF101098 EMBL:AF101099 EMBL:AF101100 EMBL:AF101101
EMBL:AF101102 EMBL:AF101103 EMBL:AF101104 EMBL:AF101105
EMBL:AF101106 EMBL:AF101107 EMBL:AF101108 EMBL:AF101109
EMBL:AF101110 EMBL:AF101111 EMBL:AL627203 EMBL:AC093150
EMBL:AC099567 EMBL:L38956 IPI:IPI00218539 IPI:IPI00218540
IPI:IPI00295575 PIR:A35239 RefSeq:NP_001177638.1 RefSeq:NP_001845.3
RefSeq:NP_542196.2 UniGene:Hs.523446 ProteinModelPortal:P12107
SMR:P12107 STRING:P12107 PhosphoSite:P12107 DMDM:215274245
PaxDb:P12107 PRIDE:P12107 Ensembl:ENST00000353414
Ensembl:ENST00000358392 Ensembl:ENST00000370096 GeneID:1301
KEGG:hsa:1301 UCSC:uc001dul.3 UCSC:uc001dum.3 UCSC:uc001dun.3
CTD:1301 GeneCards:GC01M103342 H-InvDB:HIX0028847 HGNC:HGNC:2186
MIM:120280 MIM:154780 MIM:228520 MIM:604841 neXtProt:NX_P12107
Orphanet:2021 Orphanet:560 Orphanet:90654 PharmGKB:PA26702
OMA:HPGKEGQ GenomeRNAi:1301 NextBio:5283 PMAP-CutDB:B1ASK7
ArrayExpress:P12107 Bgee:P12107 CleanEx:HS_COL11A1
Genevestigator:P12107 GermOnline:ENSG00000060718 GO:GO:0005592
Uniprot:P12107
Length = 1806
Score = 123 (48.4 bits), Expect = 0.00076, P = 0.00076
Identities = 74/250 (29%), Positives = 96/250 (38%)
Query: 156 GVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD 214
G P GPP G G GP T P R D +GP A +
Sbjct: 466 GDPGDRGPPGRPGLPGADGLPGP-------PGTMLMLPFRYGGDGSKGPTISAQEAQA-Q 517
Query: 215 ASKAPSYDPTKGPSYDPA-KG-PGYDPTKGPGYDAQKGSNYD-AQRGPN-YDIHRGPSYD 270
A + +GP G PG P GPG KG + D +GP GP+
Sbjct: 518 AILQQARIALRGPPGPMGLTGRPG--PVGGPGSSGAKGESGDPGPQGPRGVQGPPGPTGK 575
Query: 271 P-QRGL-GYDMQRG-PNYDMQRGP-GYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYD 326
P +RG G D RG P +G G++ +PG +G + +R P P PG D
Sbjct: 576 PGKRGRPGADGGRGMPGEPGAKGDRGFDG--LPGLPGDKG--HRGERGPQG-PPGPPGDD 630
Query: 327 LQRGQGYDM--RRAPSYDPSRGT----GFDGAPR--GAAPHGQVPPPLNNV-PYGSATPP 377
RG+ ++ R P RG G GAP G A P P N+ P G PP
Sbjct: 631 GMRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQPGMAGVDGPPGPKGNMGPQGEPGPP 690
Query: 378 ARSGSGQPRG 387
+ G+ P+G
Sbjct: 691 GQQGNPGPQG 700
>WB|WBGene00001734 [details] [associations]
symbol:grl-25 species:6239 "Caenorhabditis elegans"
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0040010
"positive regulation of growth rate" evidence=IMP] [GO:0009792
"embryo development ending in birth or egg hatching" evidence=IMP]
[GO:0000003 "reproduction" evidence=IMP] GO:GO:0009792
GO:GO:0040010 GO:GO:0000003 EMBL:Z11126
GeneTree:ENSGT00570000079107 EMBL:Z12018 RefSeq:NP_001023025.1
ProteinModelPortal:G5EDQ6 EnsemblMetazoa:ZK643.8 GeneID:176265
KEGG:cel:CELE_ZK643.8 CTD:176265 WormBase:ZK643.8 OMA:QYLGAYA
NextBio:891834 Uniprot:G5EDQ6
Length = 774
Score = 119 (46.9 bits), Expect = 0.00077, P = 0.00077
Identities = 70/278 (25%), Positives = 101/278 (36%)
Query: 126 ADGSYGGATGNSENETSGRPV----GQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTST 181
+ S GG +G E+ +SG G ++ G G G S++++G G ++S+
Sbjct: 343 SSSSGGGYSGGGESSSSGGSSYSSGGDSSSSSGGGYSSGGDSSSSSSSSGGYSGGSDSSS 402
Query: 182 SAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGP----SYDPAKGPGY 237
S+ ++ SG D G E+S GY S + + + G S +PA P
Sbjct: 403 SS--SSSSGGYSSGGGDAGASSGGESSSAGGYSGSSSSGGEASSGGYSGGSSEPAPAPEA 460
Query: 238 DPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQR 297
P GY GS + P PS G + P G E
Sbjct: 461 APASSGGYSG--GSEAAPEAAP-----AAPS-GGYSGSEAAPEAAPAAPSGGYSGSEAAP 512
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQR-GPGYDLQRGQGYDMRRAPSYDPSRG-TGFDGAPRG 355
G ++ AP P GY G AP+ PS G +G + AP
Sbjct: 513 EAAPAAPSGGYSGSEAAPEAAPAAPSGGYS---GSEAAPEAAPAA-PSGGYSGSEAAPEA 568
Query: 356 A--APHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPA 391
A AP G ++ P +A PA S G GG A
Sbjct: 569 APAAPSGGYSGSESSAP--AAPEPAPSSGGYSGGGGDA 604
>WB|WBGene00000639 [details] [associations]
symbol:col-63 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
Pfam:PF01484 SMART:SM01088 GO:GO:0042302 HOGENOM:HOG000085656
GeneTree:ENSGT00530000064217 EMBL:Z81143 PIR:T27806
RefSeq:NP_492245.1 ProteinModelPortal:Q94399 STRING:Q94399
EnsemblMetazoa:ZK265.2 GeneID:172607 KEGG:cel:CELE_ZK265.2
UCSC:ZK265.2 CTD:172607 WormBase:ZK265.2 eggNOG:NOG289407
InParanoid:Q94399 OMA:ENGQDGQ NextBio:876231 Uniprot:Q94399
Length = 381
Score = 115 (45.5 bits), Expect = 0.00077, P = 0.00077
Identities = 82/282 (29%), Positives = 103/282 (36%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVG-AGPNTSTSAYAATQSG 190
G G + N S G + V GPP G G GP+ G
Sbjct: 122 GPAGRAGNPGSDSTEGDRMADFNKDVKCPAGPPGPPGPNGFPGHPGPDGDFGV-----DG 176
Query: 191 TPMRAAYDIPRGP-GYEASKG-PGYDASKAP-SYDPTKGPSYD-PAKGPGYDPTKGPGYD 246
T + P GP G E + G PG P + T+G P PG P GPG D
Sbjct: 177 TNGKDGEPGPDGPEGDEGTPGLPGPPGEDGPVGQNGTRGQGQPGPVGAPGA-PG-GPGRD 234
Query: 247 AQKGSN-YDAQRGPNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQRGPGYETQRV--PGYD 302
+ G N D Q+GP GP+ G D Q G P D GP + V PG D
Sbjct: 235 GEPGENGQDGQQGPE-----GPA-------GADGQPGHPGPD---GPSGDVGEVGAPGAD 279
Query: 303 VQRGPV--YEAQRAPSYIPQRGPG-YDLQRG---QGYDMRRAPSYDPSRGTGFDGAPRGA 356
P A+ A + P Y+ +GYD +P+ P+ G+D AP
Sbjct: 280 AAYCPCPPRSAEMAATGSSDSQPASYEAPAPAATKGYD---SPA--PAAPKGYD-APAPT 333
Query: 357 APHGQVPPPLNNVP-----YGSATPPARSGSGQPRGGNPARR 393
APH PPP P Y S P A + P P +R
Sbjct: 334 APHP--PPPAPVAPPKLHDYESPAPVADAHDAAP-AAQPYKR 372
>ZFIN|ZDB-GENE-070501-8 [details] [associations]
symbol:col6a3 "collagen, type VI, alpha 3"
species:7955 "Danio rerio" [GO:0004867 "serine-type endopeptidase
inhibitor activity" evidence=IEA] InterPro:IPR002035
InterPro:IPR002223 InterPro:IPR003961 Pfam:PF00014 Pfam:PF00092
PRINTS:PR00759 PROSITE:PS50234 PROSITE:PS50279 PROSITE:PS50853
SMART:SM00131 SMART:SM00327 ZFIN:ZDB-GENE-070501-8 GO:GO:0004867
Gene3D:4.10.410.10 InterPro:IPR020901 SUPFAM:SSF57362
PROSITE:PS00280 InterPro:IPR008160 Pfam:PF01391
GeneTree:ENSGT00530000063022 EMBL:CR545476 IPI:IPI01023461
Ensembl:ENSDART00000138754 ArrayExpress:F1QKE8 Uniprot:F1QKE8
Length = 3733
Score = 126 (49.4 bits), Expect = 0.00078, P = 0.00078
Identities = 68/203 (33%), Positives = 82/203 (40%)
Query: 204 GYEASKG-PGYD-ASKAPSYDPTKGPSYDPAKGPGYDPTKG-PGYDAQKGSNYDAQRGPN 260
G G PG D + P DP G + PA PG D KG PG ++GS D +RGP
Sbjct: 2701 GIRGDPGTPGRDNTQRGPKGDP--GDA-GPAGEPGVDGNKGGPGEPGRRGS--DGRRGPP 2755
Query: 261 YDIHRG--PSYDPQRGL-GYDMQRGPNYDMQ----RGP-GYETQRVPGYDVQRGPVYEAQ 312
P D G G RGP + RG G R PG Q GP E
Sbjct: 2756 GQAGAAGRPGSDGLAGEPGIGGSRGPAGPIGAPGVRGEDGNPGPRGPGG--QPGPAGEKG 2813
Query: 313 RAPSYIPQRG-PGYDLQRG-QG-YDMRRAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNV 369
R + + ++G PG +G G + R P D G GF G P+G G P
Sbjct: 2814 RRGA-VGRKGEPGEPGPKGVTGPFGPRGEPGEDGRDGFGFPG-PKGRK--GDEGFPGFPG 2869
Query: 370 PYGSATPPARSGSGQPRGGNPAR 392
P G A P +G PRG N R
Sbjct: 2870 PKGEAGDPGTNGGPGPRGNNGQR 2892
>MGI|MGI:3645678 [details] [associations]
symbol:Flg2 "filaggrin family member 2" species:10090 "Mus
musculus" [GO:0005509 "calcium ion binding" evidence=IEA]
[GO:0005575 "cellular_component" evidence=ND] [GO:0008150
"biological_process" evidence=ND] [GO:0046872 "metal ion binding"
evidence=IEA] InterPro:IPR001751 InterPro:IPR002048
InterPro:IPR011992 PROSITE:PS00303 PROSITE:PS50222 Prosite:PS00018
MGI:MGI:3645678 GO:GO:0005509 Gene3D:1.10.238.10 InterPro:IPR018247
eggNOG:NOG12793 InterPro:IPR013787 Pfam:PF01023 CTD:388698
HOGENOM:HOG000112590 KO:K10384 OrthoDB:EOG4RJG10 EMBL:DQ118292
EMBL:AK036878 IPI:IPI00406870 RefSeq:NP_001013826.1
UniGene:Mm.10755 HSSP:P24480 ProteinModelPortal:Q2VIS4 SMR:Q2VIS4
STRING:Q2VIS4 PhosphoSite:Q2VIS4 PaxDb:Q2VIS4 PRIDE:Q2VIS4
GeneID:229574 KEGG:mmu:229574 UCSC:uc008qfe.1 InParanoid:Q2VIS4
NextBio:379521 Genevestigator:Q2VIS4 Uniprot:Q2VIS4
Length = 2362
Score = 124 (48.7 bits), Expect = 0.00079, P = 0.00079
Identities = 64/247 (25%), Positives = 95/247 (38%)
Query: 128 GSYGGATGNSENET----SGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
G G G+ + E+ GRP G + +D PQ G G P + + G P S
Sbjct: 1773 GQGQGQAGHQQRESVHGQRGRPQGPS--QDSSRQPQAGQGQPSQSGSGRSPGRSPVHPES 1830
Query: 183 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKG 242
+ S P R + G G+ +G + S +G P++ P G
Sbjct: 1831 SEGEEHSVVPQRHSES---GHGHGQGQGQAGHQQRE-SVHGQRGRPQGPSQDSSRQPQAG 1886
Query: 243 PGYDAQKGSNYDAQRGPNY-DIHRGPSYD--PQRGLGYDMQRGPNYDMQRGPGYETQR-- 297
G +Q GS +R P + + G + PQR G G + +G QR
Sbjct: 1887 QGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQRHSG----SGHGHGQGQGQAGHQQRES 1942
Query: 298 VPGYDVQ-RGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY-DPSRGTGFDGAPR- 354
V G V+ +GP +Q + S PQ G Q G G RR+P + + S G P+
Sbjct: 1943 VHGQPVRPQGP---SQDSSSQ-PQASQGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQR 1998
Query: 355 -GAAPHG 360
+ HG
Sbjct: 1999 HSGSGHG 2005
Score = 124 (48.7 bits), Expect = 0.00079, P = 0.00079
Identities = 61/245 (24%), Positives = 89/245 (36%)
Query: 128 GSYGGATGNSENET----SGRPVGQNAYEDGYGVPQ-GHGPPPSATTAGVVGAGPNTSTS 182
G G G+ + E+ GRP G +D PQ G G P + + P S
Sbjct: 1459 GQGQGQAGHQQRESVHGQRGRPQGPT--QDSSRQPQAGQGQPSQSGSGRSPRRSPVHPES 1516
Query: 183 AYAATQSGTPMRAAYDIPRGPGYEASKGPGYDA-SKAPSYDPTKGPSYDPAKGPGYDPTK 241
+ S P R + G G+ +G G + S +G P++ P
Sbjct: 1517 SEGEEHSVVPQRHSGS---GHGHGHGQGQGQAGHQQRESVHGQRGRPQGPSQDSSRQPQA 1573
Query: 242 GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGY 301
G G +Q GS +R P +H S + + G + +G QR +
Sbjct: 1574 GQGQPSQSGSGRSPRRSP---VHPESSEGEEHSVVPQRYSGSGHGHGQGQAGHQQRESVH 1630
Query: 302 DVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSY-DPSRGTGFDGAPR---GAA 357
QRG + S PQ G G Q G G RR+P + + S G P+ G+
Sbjct: 1631 G-QRGRPQGPSQDSSRQPQAGQGQPSQSGSGRSPRRSPVHPESSEGEEHSVIPQRHSGSG 1689
Query: 358 -PHGQ 361
HGQ
Sbjct: 1690 HSHGQ 1694
>UNIPROTKB|F1P7J0 [details] [associations]
symbol:SFPQ "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR012677 Pfam:PF00076 PROSITE:PS50102 SMART:SM00360
GO:GO:0000166 Gene3D:3.30.70.330 GO:GO:0003676 InterPro:IPR012975
Pfam:PF08075 GeneTree:ENSGT00390000005004 OMA:APGGHPK
EMBL:AAEX03009659 EMBL:AAEX03009658 Ensembl:ENSCAFT00000005784
Uniprot:F1P7J0
Length = 659
Score = 118 (46.6 bits), Expect = 0.00081, P = 0.00081
Identities = 55/163 (33%), Positives = 57/163 (34%)
Query: 158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQ--SGTPMRAAYDIPRGPGYEASKGPGYDA 215
PQG GP P VG+ P S SA AT SG P P P S PG
Sbjct: 68 PQGPGPAPG------VGSAPPASGSAPPATPPTSGAPAGPG-PTPTPPPAVTSAPPGAPP 120
Query: 216 SKAPSYD-PTK-----GPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQR----GPNYDIHR 265
PS PT GP PA GPG P +GPG KG GP
Sbjct: 121 PAPPSSGVPTTPPQAGGPPPPPAGGPGPGPKQGPGPGGPKGGKMPGGPKPGGGPGLSTPG 180
Query: 266 GPSYDPQRGLGYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPV 308
G P RG G P Q P Y Q G GPV
Sbjct: 181 GHPKPPHRGGGE-----PRGGRQHHPPYHQQHHQG-PPPGGPV 217
>UNIPROTKB|I3LJD1 [details] [associations]
symbol:ZMIZ1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
InterPro:IPR004181 Pfam:PF02891 PROSITE:PS51044 GO:GO:0008270
Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
EMBL:CT827949 EMBL:CT827837 Ensembl:ENSSSCT00000025452
Uniprot:I3LJD1
Length = 1021
Score = 120 (47.3 bits), Expect = 0.00084, P = 0.00084
Identities = 65/232 (28%), Positives = 86/232 (37%)
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
GP S+ TQ+ PRGP AS G + AS A P+ GP +
Sbjct: 265 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 321
Query: 231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
+ PG P T G Q Q P I R +P G + Q GPN
Sbjct: 322 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFP 378
Query: 289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPS- 344
PG Y T P P Y QR PS P G P + GQ Y + + +
Sbjct: 379 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 435
Query: 345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
G+ + +G P P+ N P+ G+ TPP GS P +P++
Sbjct: 436 SGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 487
>UNIPROTKB|E1BI98 [details] [associations]
symbol:COL6A1 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0071230 "cellular response to amino acid stimulus"
evidence=IEA] [GO:0070208 "protein heterotrimerization"
evidence=IEA] [GO:0048407 "platelet-derived growth factor binding"
evidence=IEA] [GO:0043234 "protein complex" evidence=IEA]
[GO:0042383 "sarcolemma" evidence=IEA] [GO:0031012 "extracellular
matrix" evidence=IEA] [GO:0005576 "extracellular region"
evidence=IEA] InterPro:IPR002035 Pfam:PF00092 PROSITE:PS50234
SMART:SM00327 GO:GO:0005576 GO:GO:0043234 GO:GO:0042383
GO:GO:0070208 InterPro:IPR008160 Pfam:PF01391 GO:GO:0031012
GO:GO:0071230 CTD:1291 KO:K06238 OMA:VKENYAE
GeneTree:ENSGT00530000063022 EMBL:DAAA02003502 IPI:IPI00713573
RefSeq:NP_001137337.1 UniGene:Bt.23508 PRIDE:E1BI98
Ensembl:ENSBTAT00000015668 GeneID:511422 KEGG:bta:511422
NextBio:20869920 Uniprot:E1BI98
Length = 1027
Score = 120 (47.3 bits), Expect = 0.00085, P = 0.00084
Identities = 67/206 (32%), Positives = 85/206 (41%)
Query: 200 PRG-PGYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGP-GYDAQKGSNYDAQ 256
PRG PGYE +G PG K + DP + P GY KG G +KGS
Sbjct: 261 PRGDPGYEGERGKPGLPGEKGEAGDPGRPGDLGPV---GYQGMKGEKGSRGEKGS----- 312
Query: 257 RGPN-YDIHRGPSYDPQRGL-GYDMQRGPNYDMQRGPGYETQRVPGYDVQRGPVYEAQRA 314
RGP Y +G +RG+ G D +G PG + PG+D +GP
Sbjct: 313 RGPKGYKGEKG-----KRGMDGVDGMKGET-GFPGLPGCKGS--PGFDGIQGP------- 357
Query: 315 PSYIPQRGPG-YDL--QRGQ-GYDMR--RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNN 368
P P+ PG + L Q+G+ G D R S P G G P G+ N
Sbjct: 358 PG--PKGDPGAFGLKGQKGEPGADGEPGRPGSTGPPGDEGEPGEPGPPGEKGEAGDEGNA 415
Query: 369 VPYGSATPPARSGSGQ--PRGGNPAR 392
P G+ P R G G+ PRG AR
Sbjct: 416 GPDGA--PGERGGPGERGPRGTPGAR 439
>DICTYBASE|DDB_G0292870 [details] [associations]
symbol:pex13 "peroxin 13" species:44689
"Dictyostelium discoideum" [GO:0016560 "protein import into
peroxisome matrix, docking" evidence=IEA;ISS] [GO:0016021 "integral
to membrane" evidence=IEA] [GO:0005777 "peroxisome" evidence=IEA]
[GO:0003674 "molecular_function" evidence=ND] [GO:0005779 "integral
to peroxisomal membrane" evidence=ISS] [GO:0016020 "membrane"
evidence=IEA] [GO:0015031 "protein transport" evidence=IEA]
[GO:0006810 "transport" evidence=IEA] [GO:0005778 "peroxisomal
membrane" evidence=IEA] Pfam:PF00018 InterPro:IPR001452
InterPro:IPR007223 Pfam:PF04088 PROSITE:PS50002 SMART:SM00326
dictyBase:DDB_G0292870 GenomeReviews:CM000155_GR EMBL:AAFI02000197
GO:GO:0005779 SUPFAM:SSF50044 HSSP:Q64010 GO:GO:0016560
RefSeq:XP_629403.1 ProteinModelPortal:Q54CL3
EnsemblProtists:DDB0238077 GeneID:8628922 KEGG:ddi:DDB_G0292870
eggNOG:NOG312130 OMA:SWMEALH Uniprot:Q54CL3
Length = 570
Score = 117 (46.2 bits), Expect = 0.00085, P = 0.00085
Identities = 44/135 (32%), Positives = 55/135 (40%)
Query: 129 SYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQ 188
S+GG G S S G Y D Y G+G ++ G G+G +S Y
Sbjct: 90 SFGGGVGGSSGYRSSYGGG---YRDSYS-SGGYGSSGYGSSYGSGGSG-GYGSSLYGG-- 142
Query: 189 SGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTKGPS-YDPAKGPGYDPTKGPGYDA 247
G Y G GY +S G GY +S Y + G S Y + G GY + G GY
Sbjct: 143 -GGYSSGGYG---GSGYGSSYGGGYGSSYGSGYGSSYGGSGYGSSYGGGYGSSYGGGYGG 198
Query: 248 QKGSNYDAQRGPNYD 262
G Y QRG YD
Sbjct: 199 GYGGGY-GQRG--YD 210
>UNIPROTKB|F1RGP4 [details] [associations]
symbol:PYGO2 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0060070 "canonical Wnt receptor signaling pathway"
evidence=IEA] [GO:0060021 "palate development" evidence=IEA]
[GO:0051569 "regulation of histone H3-K4 methylation" evidence=IEA]
[GO:0048589 "developmental growth" evidence=IEA] [GO:0042393
"histone binding" evidence=IEA] [GO:0033599 "regulation of mammary
gland epithelial cell proliferation" evidence=IEA] [GO:0030879
"mammary gland development" evidence=IEA] [GO:0009791
"post-embryonic development" evidence=IEA] [GO:0007420 "brain
development" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
[GO:0002088 "lens development in camera-type eye" evidence=IEA]
[GO:0001822 "kidney development" evidence=IEA] [GO:0001701 "in
utero embryonic development" evidence=IEA] [GO:0008270 "zinc ion
binding" evidence=IEA] InterPro:IPR001965 InterPro:IPR019787
Pfam:PF00628 PROSITE:PS50016 SMART:SM00249 GO:GO:0005634
GO:GO:0007420 GO:GO:0046872 GO:GO:0008270 GO:GO:0001701
GO:GO:0009791 GO:GO:0001822 Gene3D:3.30.40.10 InterPro:IPR011011
InterPro:IPR013083 SUPFAM:SSF57903 GO:GO:0048589 InterPro:IPR019786
PROSITE:PS01359 GO:GO:0060021 GO:GO:0060070 GO:GO:0030879
GO:GO:0033599 GO:GO:0051569 GO:GO:0002088
GeneTree:ENSGT00530000063948 CTD:90780 OMA:PGLVYPC EMBL:CU207227
RefSeq:NP_001172104.1 UniGene:Ssc.4680 Ensembl:ENSSSCT00000007162
GeneID:100157530 KEGG:ssc:100157530 Uniprot:F1RGP4
Length = 406
Score = 115 (45.5 bits), Expect = 0.00085, P = 0.00085
Identities = 77/294 (26%), Positives = 107/294 (36%)
Query: 117 MNAPNVDRRADGSYGGATGN-SENETSGRP-----VGQNAYEDGYGVPQGHGPPPSATTA 170
M +P RR + G A + +E P V N +ED +G P+ G P +
Sbjct: 38 MKSPEKKRRKSNTQGPAYSHLTEFAPPPTPMVDHLVASNPFEDDFGAPKVGGAAPPFLGS 97
Query: 171 GVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSY 229
+ G Q G A +P G G GP + P + P GP++
Sbjct: 98 PIPFGG--------FRVQGGM----AGQVPPGYGTAGGGGPQPLRRQPPPFPPNPMGPAF 145
Query: 230 D-PAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYD-PQRGLGY----DMQRGP 283
+ P +GPGY P + +Q ++ G N+ G P G G M + P
Sbjct: 146 NMPPQGPGYPPPGNMNFPSQP---FNQPLGQNFSPPGGQMMPGPVGGFGPMISPTMGQPP 202
Query: 284 NYDMQRGPGYETQRV--PGYDVQRGPVYE-AQRAPSYIPQRGP--GYD--LQRGQGYDMR 336
++ GP QR PG P+ Q PS P P G D G D
Sbjct: 203 RGEL--GPPSLPQRFAQPGAPFGPSPLQRPGQGLPSLPPNTSPFPGPDPGFPAPGGEDGG 260
Query: 337 RAPSYDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNP 390
+ P +P T F P +P V N P + PP+ SG G GG P
Sbjct: 261 K-P-LNPPAPTAFPQEPHSGSPAAAVN---GNQP---SFPPSSSGRG---GGTP 303
>TAIR|locus:2036224 [details] [associations]
symbol:AT1G15830 "AT1G15830" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
"biological_process" evidence=ND] EMBL:CP002684
GenomeReviews:CT485782_GR EMBL:AC034256 IPI:IPI00522201
RefSeq:NP_173035.1 UniGene:At.41914 PaxDb:Q3EDB7 PRIDE:Q3EDB7
EnsemblPlants:AT1G15830.1 GeneID:838153 KEGG:ath:AT1G15830
TAIR:At1g15830 eggNOG:NOG303006 HOGENOM:HOG000131777
InParanoid:Q3EDB7 OMA:VMQGCGG ProtClustDB:CLSN2912688
Genevestigator:Q3EDB7 Uniprot:Q3EDB7
Length = 483
Score = 116 (45.9 bits), Expect = 0.00086, P = 0.00086
Identities = 72/251 (28%), Positives = 89/251 (35%)
Query: 124 RRADGSYGG---ATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAG---VVGAGP 177
RR G GG EN SG G +A G G P G PP G + GA P
Sbjct: 65 RRKTGDGGGDPVVISGGENHASGGMGGTSATRGGGGEPVIPGAPPPNRGGGETVIPGAPP 124
Query: 178 NTSTSAYAATQSGTP--MRAAYDIPRGPGYEASKGPGYDASKAPSYDPTK-GPSYDPAKG 234
G P R P PG K G P P K G +P
Sbjct: 125 PIRGGGGEPAIPGAPPPKRGGGGEPVIPGAPPPKRGGGGEPVIPGAPPPKRGGGGEPVI- 183
Query: 235 PGYDPTK--GPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDM-QRGP 291
PG P K G G G+ + G + G P+RG G + P + +RG
Sbjct: 184 PGAPPPKRGGGGEPVIPGAPPPKRGGGGEPVIPGAP-PPKRGGGGEPVI-PGAPLPKRGG 241
Query: 292 GYETQRVPGYDVQR--GPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGF 349
G E+ VPG + G V + P RG G D G+G + R D G G
Sbjct: 242 GGESV-VPGAPPPKRGGGVIVNGGCETVPPGRGGGGDKTNGRGGEGREE---DNGGGRGA 297
Query: 350 DGAPRGAAPHG 360
+G RG+ G
Sbjct: 298 EGGGRGSTGEG 308
>UNIPROTKB|F1NDF5 [details] [associations]
symbol:COL4A5 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005587 "collagen type IV"
evidence=IEA] [GO:0005605 "basal lamina" evidence=IEA] [GO:0007528
"neuromuscular junction development" evidence=IEA] [GO:0031594
"neuromuscular junction" evidence=IEA] InterPro:IPR001442
Pfam:PF01413 PROSITE:PS51403 SMART:SM00111 InterPro:IPR016187
SUPFAM:SSF56436 GO:GO:0031594 GO:GO:0005605 InterPro:IPR008160
Pfam:PF01391 GO:GO:0005201 GO:GO:0005587 Gene3D:2.170.240.10
GeneTree:ENSGT00690000101772 OMA:MPMNMEP EMBL:AADN02013568
EMBL:AADN02013569 EMBL:AADN02013570 EMBL:AADN02013571
EMBL:AADN02013572 IPI:IPI00583230 Ensembl:ENSGALT00000013221
ArrayExpress:F1NDF5 Uniprot:F1NDF5
Length = 1658
Score = 122 (48.0 bits), Expect = 0.00088, P = 0.00088
Identities = 80/270 (29%), Positives = 101/270 (37%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGT 191
G G+ + G P G + G+P G P S AG G P S A + G
Sbjct: 1030 GEKGDPGLSSIGIP-GLPGPKGDLGLPGYPGSPGSKGIAGNPGL-PGLPGSPGAKGEPGL 1087
Query: 192 P-MRAAYDIPRGPGYEASKG-PGYDASKAPSYDPTK-GPSYDPA-KG-PGYDPTKGP-GY 245
P IP G E G PG P D + GP P KG PG D GP G
Sbjct: 1088 PGFPGTPGIPGPKGIEGPPGNPGLPGPPGPVGDTGRPGPPGPPGEKGQPGRDGIPGPAGQ 1147
Query: 246 DAQKGSNYDAQRGPNYDIHRG-PSYDPQRG-LGYDMQRGP-NYDMQRG-PGYE----TQR 297
+ G + GP G P Q+G LG GP +G PG++ Q
Sbjct: 1148 KGEPGLPGFGRPGPP-----GLPGLSGQKGELGLPGPPGPPGLPGLKGEPGFQGFPGLQG 1202
Query: 298 VPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAA 357
PG GP E + S P PG + G+G + +P + RG G +G
Sbjct: 1203 PPGPPGLPGPPLEGPKG-SPGPPGVPG---RPGKGM-IHGSPGPEGPRGPPGSGGLKGEK 1257
Query: 358 PH-GQVPPPLNNVPYGSATPPARSGS-GQP 385
+ GQ PP G PP R G G+P
Sbjct: 1258 GNPGQPGPPGLTGQKGDQGPPGRQGDPGRP 1287
>UNIPROTKB|E2RGZ0 [details] [associations]
symbol:ZMIZ1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0048844 "artery morphogenesis" evidence=IEA]
[GO:0048589 "developmental growth" evidence=IEA] [GO:0048146
"positive regulation of fibroblast proliferation" evidence=IEA]
[GO:0045944 "positive regulation of transcription from RNA
polymerase II promoter" evidence=IEA] [GO:0007569 "cell aging"
evidence=IEA] [GO:0007296 "vitellogenesis" evidence=IEA]
[GO:0003007 "heart morphogenesis" evidence=IEA] [GO:0001701 "in
utero embryonic development" evidence=IEA] [GO:0001570
"vasculogenesis" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] InterPro:IPR004181 Pfam:PF02891 PROSITE:PS51044
GO:GO:0008270 Gene3D:3.30.40.10 InterPro:IPR013083
GeneTree:ENSGT00550000074410 OMA:MNQYGPM EMBL:AAEX03002865
EMBL:AAEX03002866 EMBL:AAEX03002867 EMBL:AAEX03002868
EMBL:AAEX03002869 EMBL:AAEX03002870 EMBL:AAEX03002871
EMBL:AAEX03002872 Ensembl:ENSCAFT00000024855 NextBio:20862292
Uniprot:E2RGZ0
Length = 1072
Score = 120 (47.3 bits), Expect = 0.00089, P = 0.00089
Identities = 65/232 (28%), Positives = 86/232 (37%)
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
GP S+ TQ+ PRGP AS G + AS A P+ GP +
Sbjct: 318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374
Query: 231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
+ PG P T G Q Q P I R +P G + Q GPN
Sbjct: 375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFP 431
Query: 289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPS- 344
PG Y T P P Y QR PS P G P + GQ Y + + +
Sbjct: 432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488
Query: 345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
G+ + +G P P+ N P+ G+ TPP GS P +P++
Sbjct: 489 SGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 540
>UNIPROTKB|F1S2E4 [details] [associations]
symbol:ZMIZ1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
InterPro:IPR004181 Pfam:PF02891 PROSITE:PS51044 GO:GO:0008270
Gene3D:3.30.40.10 InterPro:IPR013083 GeneTree:ENSGT00550000074410
OMA:MNQYGPM EMBL:CT827949 EMBL:CT827837 Ensembl:ENSSSCT00000011307
Uniprot:F1S2E4
Length = 1072
Score = 120 (47.3 bits), Expect = 0.00089, P = 0.00089
Identities = 65/232 (28%), Positives = 86/232 (37%)
Query: 176 GPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYD-ASKAPSYDPT--KGP--SYD 230
GP S+ TQ+ PRGP AS G + AS A P+ GP +
Sbjct: 318 GPVCSSFQMGPTQAYNSQFMNQPGPRGP---ASMGGSMNPASMAAGMTPSGMSGPPMGMN 374
Query: 231 PAKGPGYDP--TKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQ 288
+ PG P T G Q Q P I R +P G + Q GPN
Sbjct: 375 QPRPPGISPFGTHGQRMPQQTYPGPRPQSLPIQSIKRPYPGEPNYG---NQQYGPNSQFP 431
Query: 289 RGPG-YETQRVPGYDVQRGPVYEAQRAPSYIPQRG--PGYDLQRGQGYDMRRAPSYDPS- 344
PG Y T P P Y QR PS P G P + GQ Y + + +
Sbjct: 432 TQPGQYPTPNPPR--PLTSPNYPGQRMPSQ-PSTGQYPPPTVNMGQYYKPEQFNGQNNTF 488
Query: 345 RGTGFDGAPRGAAPHGQVPPPLNNVPY----GSATPPARSGSGQPRGGNPAR 392
G+ + +G P P+ N P+ G+ TPP GS P +P++
Sbjct: 489 SGSSYSNYSQGNVNRPPRPVPVANYPHSPVPGNPTPPMTPGSSIPPYLSPSQ 540
>ZFIN|ZDB-GENE-030131-8415 [details] [associations]
symbol:col1a2 "collagen, type I, alpha 2"
species:7955 "Danio rerio" [GO:0005201 "extracellular matrix
structural constituent" evidence=IEA] [GO:0005581 "collagen"
evidence=IEA] InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078
PROSITE:PS51461 SMART:SM00038 ZFIN:ZDB-GENE-030131-8415
GO:GO:0005581 InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
HOVERGEN:HBG004933 EMBL:AJ318213 IPI:IPI00502653 UniGene:Dr.75575
STRING:Q90YJ0 PRIDE:Q90YJ0 InParanoid:Q90YJ0 ArrayExpress:Q90YJ0
Bgee:Q90YJ0 Uniprot:Q90YJ0
Length = 1352
Score = 121 (47.7 bits), Expect = 0.00090, P = 0.00090
Identities = 86/282 (30%), Positives = 107/282 (37%)
Query: 128 GSYGGATGNSENETSGR--PVGQNAYEDGYGVPQGHGPPPSATTAG---VVGA-G-PNTS 180
G G A +GR P+G G G P GPP A AG +VGA G P +
Sbjct: 387 GPRGAAGTRGLPGLAGRSGPMGMPGPRGGVGAPGARGPPGDAGRAGEAGLVGARGLPGSP 446
Query: 181 TSAYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAKGPGYD 238
S+ + G A D GP G +G PG P KGPS + K PG
Sbjct: 447 GSSGPPGKEGPSGAAGQDGRTGPPGPTGPRGQPGNIGFPGP-----KGPSGEAGK-PG-- 498
Query: 239 PTKGP-GYDAQKGS-NYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYETQ 296
KGP G +GS D GP + G + P G ++GP+ PG+ Q
Sbjct: 499 -EKGPVGPTGLRGSPGPDGNNGPAGPV--GLAGAP----GEKGEQGPS----GAPGF--Q 545
Query: 297 RVPGYDVQRGPVYEAQRAPSY-IPQ----RGP-GYDLQRGQGYDMRRAPSYDPSRGTGFD 350
+PG GPV EA + IP GP G +RG A + P G
Sbjct: 546 GLPG---PAGPVGEAGKPGDRGIPGDQGVSGPAGVKGERGNPGPAGAAGAQGPIGARGPS 602
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG--GNP 390
G P P G P P G+ P +G RG G P
Sbjct: 603 GTP---GPDGNKGEPGAVGPAGAPGPQGAAGMPGERGAAGTP 641
>WB|WBGene00000656 [details] [associations]
symbol:col-80 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] [GO:0040002
"collagen and cuticulin-based cuticle development" evidence=IMP]
InterPro:IPR002486 Pfam:PF01484 SMART:SM01088 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0040002 EMBL:Z46791
GO:GO:0042302 HOGENOM:HOG000085656 GeneTree:ENSGT00530000064217
PIR:T19143 RefSeq:NP_496310.1 ProteinModelPortal:Q09456
DIP:DIP-27389N MINT:MINT-1079432 STRING:Q09456
EnsemblMetazoa:C09G5.5 GeneID:174652 KEGG:cel:CELE_C09G5.5
UCSC:C09G5.5 CTD:174652 WormBase:C09G5.5 eggNOG:NOG285871
InParanoid:Q09456 OMA:VEIHTHH NextBio:884922 Uniprot:Q09456
Length = 317
Score = 113 (44.8 bits), Expect = 0.00091, P = 0.00091
Identities = 41/125 (32%), Positives = 54/125 (43%)
Query: 132 GATGNSENE-TSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAATQS- 189
GA GN + +G P G A+ G G P GP A + G GA N + +S
Sbjct: 143 GAPGNPGPQGPNGNP-GAPAHGGGQGPPGPPGPAGDAGSPGQAGAPGNPGRPGQSGQRSR 201
Query: 190 GTPMRAAYDIPRGPGYEASKGPGYDASKAPSYDP-TKGPSYDPAKGPGYDPTKG-PGYDA 247
G P + P+GP A PG ++ P+ P GP+ P PG D G PG D
Sbjct: 202 GLPGPSGRPGPQGPP-GAPGQPGSGSTPGPAGPPGPPGPNGQPGH-PGQDGQPGAPGNDG 259
Query: 248 QKGSN 252
GS+
Sbjct: 260 APGSD 264
>RGD|621351 [details] [associations]
symbol:Col1a2 "collagen, type I, alpha 2" species:10116 "Rattus
norvegicus" [GO:0001501 "skeletal system development" evidence=ISO]
[GO:0001568 "blood vessel development" evidence=ISO] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
[GO:0005581 "collagen" evidence=IEA;ISO] [GO:0005584 "collagen type
I" evidence=ISO] [GO:0005615 "extracellular space" evidence=ISO]
[GO:0007179 "transforming growth factor beta receptor signaling
pathway" evidence=ISO] [GO:0007266 "Rho protein signal
transduction" evidence=ISO] [GO:0008217 "regulation of blood
pressure" evidence=ISO] [GO:0030199 "collagen fibril organization"
evidence=ISO] [GO:0030674 "protein binding, bridging" evidence=ISO]
[GO:0042802 "identical protein binding" evidence=ISO] [GO:0043589
"skin morphogenesis" evidence=ISO] [GO:0046332 "SMAD binding"
evidence=ISO] [GO:0046872 "metal ion binding" evidence=IEA]
[GO:0048407 "platelet-derived growth factor binding" evidence=ISO]
[GO:0070062 "extracellular vesicular exosome" evidence=ISO]
[GO:0070208 "protein heterotrimerization" evidence=ISO] [GO:0071230
"cellular response to amino acid stimulus" evidence=ISO]
[GO:0031012 "extracellular matrix" evidence=ISO] InterPro:IPR000885
Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461 SMART:SM00038
RGD:621351 GO:GO:0005615 GO:GO:0046872 GO:GO:0030199 GO:GO:0001501
GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0001568
GO:GO:0071230 GO:GO:0005201 GO:GO:0043589 HOGENOM:HOG000085654
HOVERGEN:HBG004933 KO:K06236 GO:GO:0005584 PDB:3HQV PDB:3HR2
PDBsum:3HQV PDBsum:3HR2 Reactome:REACT_150387 CTD:1278
OrthoDB:EOG412M65 EMBL:AF121217 IPI:IPI00188921 RefSeq:NP_445808.1
UniGene:Rn.107239 IntAct:P02466 STRING:P02466 PRIDE:P02466
GeneID:84352 KEGG:rno:84352 UCSC:RGD:621351 InParanoid:P02466
EvolutionaryTrace:P02466 NextBio:616663 PMAP-CutDB:P02466
ArrayExpress:P02466 Genevestigator:P02466
GermOnline:ENSRNOG00000011292 Uniprot:P02466
Length = 1372
Score = 121 (47.7 bits), Expect = 0.00092, P = 0.00092
Identities = 85/284 (29%), Positives = 105/284 (36%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSG 190
G TG+ P G DG P G PP A G G GP T +AA S
Sbjct: 33 GPTGDRGPRGQRGPAGPRG-RDGVDGPVGPPGPPGAP--GPPGPPGPPGLTGNFAAQYSD 89
Query: 191 TPMRAAYDI-----PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA--KGPGYDPTKGP 243
+ A PRGP A PG + P+ +P + PA +GP P K
Sbjct: 90 KGVSAGPGPMGLMGPRGPP-GAVGAPGPQGFQGPAGEPGEPGQTGPAGSRGPAGPPGKA- 147
Query: 244 GYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYE-TQR 297
G D G +RG RG P GL G+ RG N D +G PG + +
Sbjct: 148 GEDGHPGKPGRPGERGVVGPQGARGFPGTP--GLPGFKGIRGHNGLDGLKGQPGAQGVKG 205
Query: 298 VPGYDVQRGPVYEAQRAPSYIP-QRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
PG + G Q +P +RG PG RG + P G G
Sbjct: 206 EPGAPGENGT--PGQAGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGF 263
Query: 353 PRGAAPHGQVPPPLNNVPYGSATPPARSG----SGQPRG--GNP 390
P P G++ P N P G A P +G SG P G GNP
Sbjct: 264 PGAPGPKGELGPVGNPGPAGPAGPRGEAGLPGLSG-PVGPPGNP 306
>UNIPROTKB|F1LS40 [details] [associations]
symbol:Col1a2 "Collagen alpha-2(I) chain" species:10116
"Rattus norvegicus" [GO:0005201 "extracellular matrix structural
constituent" evidence=IEA] [GO:0005581 "collagen" evidence=IEA]
InterPro:IPR000885 Pfam:PF01410 ProDom:PD002078 PROSITE:PS51461
SMART:SM00038 RGD:621351 GO:GO:0005615 GO:GO:0030199 GO:GO:0001501
GO:GO:0008217 GO:GO:0007179 GO:GO:0007266 GO:GO:0070208
InterPro:IPR008160 Pfam:PF01391 GO:GO:0001568 GO:GO:0071230
GO:GO:0005201 GO:GO:0043589 GeneTree:ENSGT00660000095287 KO:K06236
GO:GO:0005584 IPI:IPI00188921 EMBL:AC107447 RefSeq:XP_003749738.1
Ensembl:ENSRNOT00000016423 GeneID:100911218 KEGG:rno:100911218
ArrayExpress:F1LS40 Uniprot:F1LS40
Length = 1372
Score = 121 (47.7 bits), Expect = 0.00092, P = 0.00092
Identities = 85/284 (29%), Positives = 105/284 (36%)
Query: 132 GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGA-GPNTSTSAYAATQSG 190
G TG+ P G DG P G PP A G G GP T +AA S
Sbjct: 33 GPTGDRGPRGQRGPAGPRG-RDGVDGPVGPPGPPGAP--GPPGPPGPPGLTGNFAAQYSD 89
Query: 191 TPMRAAYDI-----PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPA--KGPGYDPTKGP 243
+ A PRGP A PG + P+ +P + PA +GP P K
Sbjct: 90 KGVSAGPGPMGLMGPRGPP-GAVGAPGPQGFQGPAGEPGEPGQTGPAGSRGPAGPPGKA- 147
Query: 244 GYDAQKGS-NYDAQRGP-NYDIHRGPSYDPQRGL-GYDMQRGPN-YDMQRG-PGYE-TQR 297
G D G +RG RG P GL G+ RG N D +G PG + +
Sbjct: 148 GEDGHPGKPGRPGERGVVGPQGARGFPGTP--GLPGFKGIRGHNGLDGLKGQPGAQGVKG 205
Query: 298 VPGYDVQRGPVYEAQRAPSYIP-QRG----PGYDLQRGQGYDMRRAPSYDPSRGTGFDGA 352
PG + G Q +P +RG PG RG + P G G
Sbjct: 206 EPGAPGENGT--PGQAGARGLPGERGRVGAPGPAGARGSDGSVGPVGPAGPIGSAGPPGF 263
Query: 353 PRGAAPHGQVPPPLNNVPYGSATPPARSG----SGQPRG--GNP 390
P P G++ P N P G A P +G SG P G GNP
Sbjct: 264 PGAPGPKGELGPVGNPGPAGPAGPRGEAGLPGLSG-PVGPPGNP 306
>UNIPROTKB|G4MTN4 [details] [associations]
symbol:MGG_07193 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0003674 "molecular_function"
evidence=ND] [GO:0008150 "biological_process" evidence=ND]
InterPro:IPR001202 Pfam:PF00397 PROSITE:PS01159 PROSITE:PS50020
SMART:SM00456 Gene3D:2.20.70.10 SUPFAM:SSF51045 EMBL:CM001232
RefSeq:XP_003715399.1 ProteinModelPortal:G4MTN4
EnsemblFungi:MGG_07193T0 GeneID:2683176 KEGG:mgr:MGG_07193
Uniprot:G4MTN4
Length = 366
Score = 114 (45.2 bits), Expect = 0.00092, P = 0.00092
Identities = 70/248 (28%), Positives = 92/248 (37%)
Query: 163 PPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASKAPSYD 222
PPP T G GP S A A++ + TP D+ + P YE P + A
Sbjct: 53 PPPPPTGDGAPD-GPPPSYQASASSATATPT----DVKKNP-YETE--PAASPNPAGVGG 104
Query: 223 PTKGPSYDPAKGP--GYDPTKGPG-YDAQKGSNYDAQ-RGPNYDIHRGPSYDPQRGLGYD 278
+ GP+ P P G P AQ + DA+ RG + + G P G G+
Sbjct: 105 SSSGPAPPPVNSPRPGDPPVSDDAKLAAQMQAEEDARARGSGGNPNYGGG-SPAPGQGFP 163
Query: 279 MQRGPNYDMQR--------GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRG 330
Q + + G G ++Q Y GP+ + Q PQ GY G
Sbjct: 164 NQLPARQERSKSGILGKLFGKGKQSQTAH-YGA--GPLPQQQYQQ---PQHQQGYP---G 214
Query: 331 QGYDMRRAPSYDPSRGTGFD-GAPRGAAP-HG---QVP---PPLNNVPYGSATPPARSGS 382
GY S P G G+ GAP P +G Q P PP PYG + G
Sbjct: 215 AGYQQGAPYSPQPGYGGGYQQGAPYSPQPGYGGGYQQPGYGPPPG--PYGQPGYGPQPGY 272
Query: 383 GQPRGGNP 390
G P G P
Sbjct: 273 GHPPYGQP 280
>WB|WBGene00000618 [details] [associations]
symbol:col-41 species:6239 "Caenorhabditis elegans"
[GO:0042302 "structural constituent of cuticle" evidence=IEA]
[GO:0016021 "integral to membrane" evidence=IEA] InterPro:IPR002486
Pfam:PF01484 SMART:SM01088 InterPro:IPR008160 Pfam:PF01391
eggNOG:NOG12793 GO:GO:0042302 HOGENOM:HOG000085656
GeneTree:ENSGT00530000064674 EMBL:Z72514 PIR:T24769
RefSeq:NP_510522.1 ProteinModelPortal:Q22369 IntAct:Q22369
MINT:MINT-213826 STRING:Q22369 PaxDb:Q22369 EnsemblMetazoa:T10B10.1
GeneID:181610 KEGG:cel:CELE_T10B10.1 UCSC:T10B10.1 CTD:181610
WormBase:T10B10.1 InParanoid:Q22369 OMA:CSIGHIV NextBio:914648
Uniprot:Q22369
Length = 428
Score = 115 (45.5 bits), Expect = 0.00093, P = 0.00093
Identities = 80/292 (27%), Positives = 100/292 (34%)
Query: 120 PNVDRRADGSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNT 179
P++ R Y G N ++ + G P G G G P G G T
Sbjct: 70 PSLLRNKRFVYPGMC-NCDSNSQGCPAGAPGPPGNPGKRGDEGHPGDEGRRGASGISLAT 128
Query: 180 STSAYAAT------QSGTPMRAAYDIPRG-PGYEASKGP-GYDASKAPSYDPTKGPSYDP 231
+ +G P P G PG + GP G D AP + G +
Sbjct: 129 THDIPGGCIKCPEGPAGPPGPDGDSGPEGFPGLQGQSGPSGEDG--APGQEGAPGDQGE- 185
Query: 232 AKGP-GYDPTKGPGYDAQKGSNY-DAQRG-PNYDIHRG-PSYDPQRGL-GYDMQRGPNYD 286
+GP GYD T GP D Q G+ Y Q G P G P Q G G D + GP
Sbjct: 186 -QGPKGYDGTDGP--DGQPGTTYFPGQAGQPGEPGWLGEPGLPGQHGEPGKDGEEGP--- 239
Query: 287 MQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPS------ 340
Q PG T G+D G +A + P P + Y Q D R PS
Sbjct: 240 -QGAPG--TPGNAGHDAFPGTPGQAGK-PG-APGKDANY-CPCPQRQDDRTPPSSGTSAP 293
Query: 341 YDPSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGGNPAR 392
P RG+ AP AP PP P + P + P P R
Sbjct: 294 QPPPRGS--TAAPGTRAPPATRAPPATRAPPATTRAPPATTRPAPASQPPVR 343
>TAIR|locus:2154679 [details] [associations]
symbol:ENODL1 "early nodulin-like protein 1" species:3702
"Arabidopsis thaliana" [GO:0005507 "copper ion binding"
evidence=IEA;ISS] [GO:0005886 "plasma membrane" evidence=ISM;IDA]
[GO:0009055 "electron carrier activity" evidence=IEA] [GO:0031225
"anchored to membrane" evidence=TAS] InterPro:IPR003245
Pfam:PF02298 ProDom:PD003122 PROSITE:PS51485 GO:GO:0005886
EMBL:CP002688 GO:GO:0009055 GO:GO:0031225 GO:GO:0005507
EMBL:AB007644 Gene3D:2.60.40.420 InterPro:IPR008972 SUPFAM:SSF49503
ProtClustDB:CLSN2915882 HSSP:P29602 EMBL:BT026028 IPI:IPI00530005
RefSeq:NP_200198.1 UniGene:At.49170 ProteinModelPortal:Q9FN39
SMR:Q9FN39 PRIDE:Q9FN39 EnsemblPlants:AT5G53870.1 GeneID:835468
KEGG:ath:AT5G53870 TAIR:At5g53870 InParanoid:Q9FN39 OMA:AHAPSHS
Genevestigator:Q9FN39 Uniprot:Q9FN39
Length = 370
Score = 114 (45.2 bits), Expect = 0.00094, P = 0.00094
Identities = 30/114 (26%), Positives = 47/114 (41%)
Query: 158 PQGHGPPPSATTAGVVGAGPNTSTSAYAATQSGTPMRAAYDIPRGPGYEASKGPGYDASK 217
P PP S+ + A P S+S + T + +P A P P + K P S
Sbjct: 166 PSKSQPPRSSVSP----AQPPKSSSPISHTPALSPSHATSHSPATPS-PSPKSPS-PVSH 219
Query: 218 APSYDPTKGPSYDPAKGPGYDPTKGPGYDAQKGSNYDAQRGPNYDIHRGPSYDP 271
+PS+ P PS+ PA P + P P + ++ P++ PS+ P
Sbjct: 220 SPSHSPAHTPSHSPAHTPSHSPAHAPSHSPAHAPSHSPAHAPSHSPAHSPSHSP 273
>UNIPROTKB|E2RA46 [details] [associations]
symbol:EWSR1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0008270 "zinc ion binding" evidence=IEA]
[GO:0005622 "intracellular" evidence=IEA] [GO:0003676 "nucleic acid
binding" evidence=IEA] [GO:0000166 "nucleotide binding"
evidence=IEA] InterPro:IPR000504 InterPro:IPR001876
InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641 PROSITE:PS01358
PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360 SMART:SM00547
GO:GO:0000166 GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GO:GO:0005622 GeneTree:ENSGT00530000063105 EMBL:AAEX03014786
EMBL:AAEX03014787 Ensembl:ENSCAFT00000019364 Uniprot:E2RA46
Length = 619
Score = 117 (46.2 bits), Expect = 0.00095, P = 0.00095
Identities = 73/278 (26%), Positives = 99/278 (35%)
Query: 128 GSYGGATGNSENETSGRPV-GQNAYEDGYGVPQ-GHGPP--PSATTAGVVGAGP---NTS 180
G+YG T S + GQ AY YG P G+ P P A + V G G +T+
Sbjct: 43 GTYGQPTDVSYTQAQTTATYGQTAYATSYGQPPAGYTTPTAPQAYSQPVQGYGTGAYDTT 102
Query: 181 TSAYAATQSGTPMRAAYDI-PRGPGYEASKGPGYDASKAPSYDPTKGPSYDPAKGPG-YD 238
T+ TQ+ ++AY P P Y + P A P + P G Y+
Sbjct: 103 TATVTTTQASYAAQSAYGTQPAYPAY--GQQPAATAPARPQDGNKPAETSQPQSSTGGYN 160
Query: 239 -PTKGPGYDAQKGSNYDAQRG--PNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPGYET 295
P+ G G Q +Y G P + PSY P Y + +YD T
Sbjct: 161 QPSLGYG---QSNYSYPQVPGSYPMQPVTAPPSYPPT---SYSSTQPTSYDQSSYSQQNT 214
Query: 296 QRVPG-YDVQRG----PVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFD 350
P Y Q Y Q SY PQ G Y Q Y +++ SY D
Sbjct: 215 YGQPSSYGQQSSYGQQSSYGQQPPTSYPPQTG-SYSQAPSQ-YS-QQSSSYGQQSSFRQD 271
Query: 351 GAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRGG 388
P +GQ + P + + G+ RGG
Sbjct: 272 H-PSSMGVYGQESGGFSG-PGENRSMSGPDNRGRGRGG 307
>UNIPROTKB|F1PBJ4 [details] [associations]
symbol:FUS "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0005737 "cytoplasm" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0008270 "zinc ion binding"
evidence=IEA] [GO:0003676 "nucleic acid binding" evidence=IEA]
[GO:0000166 "nucleotide binding" evidence=IEA] InterPro:IPR000504
InterPro:IPR001876 InterPro:IPR012677 Pfam:PF00076 Pfam:PF00641
PROSITE:PS01358 PROSITE:PS50102 PROSITE:PS50199 SMART:SM00360
SMART:SM00547 GO:GO:0005634 GO:GO:0005737 GO:GO:0000166
GO:GO:0008270 Gene3D:3.30.70.330 GO:GO:0003676
GeneTree:ENSGT00530000063105 OMA:YGTQSTP EMBL:AAEX03004378
EMBL:AAEX03004379 Ensembl:ENSCAFT00000026694 Uniprot:F1PBJ4
Length = 517
Score = 116 (45.9 bits), Expect = 0.00095, P = 0.00095
Identities = 46/168 (27%), Positives = 64/168 (38%)
Query: 128 GSYGGATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAGPNTSTSAYAAT 187
G+Y G ++ S +P GQ +Y GYG ++ G NT +
Sbjct: 15 GAYPTQPGQGYSQQSNQPYGQQSYS-GYGQSADTSGYGQSSYGSSYGQTQNTGYGTQSTP 73
Query: 188 QSGTPMRAAYDIPRGP--GY-EASKGPGYDASKAPSYDPTKGPSYDPAKGPGYDPTKGPG 244
Q G Y +G Y + S PGY APS T G ++ GY + G
Sbjct: 74 Q-GYGSTGGYGSSQGSQSSYGQQSSYPGYGQQPAPS--STSGSYGSGSQSSGYGQPQSGG 130
Query: 245 YDAQKGSNYDAQRGPNYDIHRGPSYDPQRGLGYDMQRGPNYDMQRGPG 292
Y Q G Y Q+ Y + SY+P +G G Q Y+ G G
Sbjct: 131 YGQQSG--YSGQQ-QGYGQQQS-SYNPPQGYGQQNQ----YNSSSGGG 170
>ZFIN|ZDB-GENE-040426-2801 [details] [associations]
symbol:ssbp3b "single stranded DNA binding protein
3b" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
evidence=IEA] [GO:0003697 "single-stranded DNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
InterPro:IPR006594 InterPro:IPR007591 InterPro:IPR008116
Pfam:PF04503 PRINTS:PR01743 PROSITE:PS50896 SMART:SM00667
ZFIN:ZDB-GENE-040426-2801 GO:GO:0005634 GO:GO:0003697
eggNOG:NOG245801 HOGENOM:HOG000037785 PANTHER:PTHR12610
GeneTree:ENSGT00390000009187 EMBL:CR847832 EMBL:GQ903695
IPI:IPI00920092 UniGene:Dr.77852 STRING:D0EWT5
Ensembl:ENSDART00000121984 Uniprot:D0EWT5
Length = 373
Score = 114 (45.2 bits), Expect = 0.00095, P = 0.00095
Identities = 60/197 (30%), Positives = 81/197 (41%)
Query: 206 EASKGPGYDASKAPSYDPTKGPSYDPAKG-PGYDPTKGPGY-DAQKGSNYDAQRGPNYDI 263
EA Y A+ APS P G + P G PG P PG+ GS P +
Sbjct: 85 EAKAFHDYSAAAAPS--PVLG-NMPPGDGMPG-GPMP-PGFFQGPPGSQASPHAPPPPNS 139
Query: 264 HRGPSYDPQRGLGYDMQRGPNYDMQRG---PG---YETQRVPGYDVQ-RGPVYEAQRAPS 316
GP P + GP ++ G PG +P D + +GP+ + P
Sbjct: 140 MMGPHGQPFMSPRFG--GGPRPPIRMGNQPPGGVPAAQPMLPNMDPRLQGPM-QRMNVPR 196
Query: 317 YIPQRGPGYDLQRGQGYDMRRAPSYDPSRGTGFDGAPRGAAPHGQVP---PPLNNVPYGS 373
+ GPG +G G MR P ++ S G G G G +G+ P P NN+PY S
Sbjct: 197 GMGPMGPG---PQGFGGGMR--PPHN-SMGPGMPGVNMGPG-NGRPPWPNPNANNMPYSS 249
Query: 374 ATPPARSGSGQPRGGNP 390
+P A G P+GG P
Sbjct: 250 PSPGAYGG---PQGGGP 263
>UNIPROTKB|Q9XSJ7 [details] [associations]
symbol:COL1A1 "Collagen alpha-1(I) chain" species:9615
"Canis lupus familiaris" [GO:0046872 "metal ion binding"
evidence=IEA] [GO:0005581 "collagen" evidence=IEA] [GO:0005201
"extracellular matrix structural constituent" evidence=IEA]
InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093 Pfam:PF01410
ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184 PROSITE:PS51461
SMART:SM00038 SMART:SM00214 GO:GO:0046872 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 eggNOG:NOG12793 GO:GO:0005201
CTD:1277 HOGENOM:HOG000085654 HOVERGEN:HBG004933 KO:K06236
OrthoDB:EOG4S4PHP EMBL:AF153062 RefSeq:NP_001003090.1
UniGene:Cfa.100 STRING:Q9XSJ7 GeneID:403651 KEGG:cfa:403651
InParanoid:Q9XSJ7 NextBio:20817156 Uniprot:Q9XSJ7
Length = 1460
Score = 121 (47.7 bits), Expect = 0.00098, P = 0.00098
Identities = 88/285 (30%), Positives = 107/285 (37%)
Query: 126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
ADG G G G++ + P G A G P G+ G P G AGP +T
Sbjct: 815 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPTGPPGPIGNVGAPGPKGARG--SAGPPGATG 871
Query: 183 -AYAATQSGTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTK-GPSYDPA----KG 234
AA + G P + P GP A K G G P+ P + GP P KG
Sbjct: 872 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGARGETGPAGRPGEVGPPGPPGPAGEKG 931
Query: 235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
PG D P T GP G Q+G QRG + GPS +P ++G G +RG
Sbjct: 932 SPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGTSGERG 991
Query: 283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
P M GP PG GP E+ R S + PG D G D
Sbjct: 992 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGSPGPKGDRGETGPAG 1039
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G GAP P G+ P G A P G+ P G
Sbjct: 1040 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPVGARGPAG 1084
Score = 121 (47.7 bits), Expect = 0.00098, P = 0.00098
Identities = 81/290 (27%), Positives = 101/290 (34%)
Query: 119 APNVDRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 176
A D+ G G G TG P G++ G+P GPP G G G
Sbjct: 96 ASPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLP---GPPGPPGPPGPPGLG 152
Query: 177 PNTSTS-AYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAK 233
N + +Y + T +P GP G +G PG AP +GP +P +
Sbjct: 153 GNFAPQMSYGYDEKST---GGISVP-GPMGPSGPRGLPGPPG--APGPQGFQGPPGEPGE 206
Query: 234 GPGYDPTKGP-GYDAQKGSNYD-AQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQR 289
PG GP G G N D + G P RGP PQ G G P R
Sbjct: 207 -PGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPP-GPQGARGLPGTAGLPGMKGHR 264
Query: 290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTG 348
G + G GP + P + G PG RG + R + P+ G
Sbjct: 265 G----FSGLDGAKGDAGPA-GPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARG 319
Query: 349 FDGAPRGAAPHGQV----PPPLNNV--PYGSATPPARSGSGQPRG--GNP 390
DGA A P G PP G A P GS P+G G P
Sbjct: 320 NDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGARGSEGPQGVRGEP 369
>UNIPROTKB|F1Q3I5 [details] [associations]
symbol:COL1A1 "Collagen alpha-1(I) chain" species:9615
"Canis lupus familiaris" [GO:0005581 "collagen" evidence=IEA]
[GO:0005201 "extracellular matrix structural constituent"
evidence=IEA] InterPro:IPR000885 InterPro:IPR001007 Pfam:PF00093
Pfam:PF01410 ProDom:PD002078 PROSITE:PS01208 PROSITE:PS50184
PROSITE:PS51461 SMART:SM00038 SMART:SM00214 GO:GO:0005581
InterPro:IPR008160 Pfam:PF01391 GO:GO:0005201
GeneTree:ENSGT00660000095287 OMA:VAYMDQQ EMBL:AAEX03006535
Ensembl:ENSCAFT00000026953 Uniprot:F1Q3I5
Length = 1464
Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
Identities = 88/285 (30%), Positives = 107/285 (37%)
Query: 126 ADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGH-GPPPSATTAGVVGAGPNTSTS 182
ADG G G G++ + P G A G P G+ G P G AGP +T
Sbjct: 819 ADGQPGAKGEPGDAGAKGDAGPPGP-AGPTGPPGPIGNVGAPGPKGARG--SAGPPGATG 875
Query: 183 -AYAATQSGTPMRAAYDIPRGPGYEASK--GPGYDASKAPSYDPTK-GPSYDPA----KG 234
AA + G P + P GP A K G G P+ P + GP P KG
Sbjct: 876 FPGAAGRVGPPGPSGNAGPPGPPGPAGKEGGKGARGETGPAGRPGEVGPPGPPGPAGEKG 935
Query: 235 -PGYD-P-----TKGP-GYDAQKGS-NYDAQRGPN-YDIHRGPSYDP-QRG-LGYDMQRG 282
PG D P T GP G Q+G QRG + GPS +P ++G G +RG
Sbjct: 936 SPGADGPAGAPGTPGPQGIAGQRGVVGLPGQRGERGFPGLPGPSGEPGKQGPSGASGERG 995
Query: 283 PNYDMQRGPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRGPGYDLQRGQGYDMRRAPSYD 342
P M GP PG GP E+ R S + PG D G D
Sbjct: 996 PPGPM--GP-------PGL---AGPPGESGREGSPGAEGSPGRDGSPGPKGDRGETGPAG 1043
Query: 343 PSRGTGFDGAPRGAAPHGQVPPPLNNVPYGSATPPARSGSGQPRG 387
P G GAP P G+ P G A P G+ P G
Sbjct: 1044 PPGAPGAPGAPGPVGPAGKNGDRGETGPAGPAGPIGPVGARGPAG 1088
Score = 121 (47.7 bits), Expect = 0.00099, P = 0.00099
Identities = 81/290 (27%), Positives = 101/290 (34%)
Query: 119 APNVDRRADGSYG--GATGNSENETSGRPVGQNAYEDGYGVPQGHGPPPSATTAGVVGAG 176
A D+ G G G TG P G++ G+P GPP G G G
Sbjct: 100 ASPTDQETTGVEGPKGDTGPRGPRGPAGPPGRDGIPGQPGLP---GPPGPPGPPGPPGLG 156
Query: 177 PNTSTS-AYAATQSGTPMRAAYDIPRGP-GYEASKG-PGYDASKAPSYDPTKGPSYDPAK 233
N + +Y + T +P GP G +G PG AP +GP +P +
Sbjct: 157 GNFAPQMSYGYDEKST---GGISVP-GPMGPSGPRGLPGPPG--APGPQGFQGPPGEPGE 210
Query: 234 GPGYDPTKGP-GYDAQKGSNYD-AQRG-PNYDIHRGPSYDPQRGLGYDMQRG-PNYDMQR 289
PG GP G G N D + G P RGP PQ G G P R
Sbjct: 211 -PGASGPMGPRGPPGPPGKNGDDGEAGKPGRPGERGPP-GPQGARGLPGTAGLPGMKGHR 268
Query: 290 GPGYETQRVPGYDVQRGPVYEAQRAPSYIPQRG-PGYDLQRGQGYDMRRAPSYDPSRGTG 348
G + G GP + P + G PG RG + R + P+ G
Sbjct: 269 G----FSGLDGAKGDAGPA-GPKGEPGSPGENGAPGQMGPRGLPGERGRPGAPGPAGARG 323
Query: 349 FDGAPRGAAPHGQV----PPPLNNV--PYGSATPPARSGSGQPRG--GNP 390
DGA A P G PP G A P GS P+G G P
Sbjct: 324 NDGATGAAGPPGPTGPAGPPGFPGAVGAKGEAGPQGARGSEGPQGVRGEP 373
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.315 0.136 0.431 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 393 393 0.00095 117 3 11 23 0.47 34
34 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 239
No. of states in DFA: 603 (64 KB)
Total size of DFA: 266 KB (2137 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 45.18u 0.11s 45.29t Elapsed: 00:00:02
Total cpu time: 45.25u 0.11s 45.36t Elapsed: 00:00:02
Start: Tue May 21 05:03:37 2013 End: Tue May 21 05:03:39 2013
WARNINGS ISSUED: 1