Your job contains 1 sequence.
>003292
MGNTLRELDEGRLQDNVLDGGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADS
VTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQASLLSLLPSYLLKIS
EVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGTPEEIAALSVALFR
ALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSF
SCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEAC
HPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLPVKRLKKI
ESGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAA
AAACKTSLRYIVAFAGCGAKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGD
LNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK
GPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEVPVKVIKNSSKSKKGQDFEPED
YDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVY
SVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYAEEEEKREAEEKKRR
EAQATSRWYQLLSSIVTRQRLNNCYGNNSTSQSSSNFQNVKKTNSNVGVDSSQNDWQSPN
QVDRGDTKLHAPSPFQSEEHEHVYLIEDQSFDEENSVTTKRCHCGFTIQVEEL
The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 003292
(833 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2174160 - symbol:RAD4 species:3702 "Arabidopsi... 1484 2.8e-211 2
MGI|MGI:103557 - symbol:Xpc "xeroderma pigmentosum, compl... 657 2.1e-64 1
UNIPROTKB|F1N806 - symbol:Gga.54220 "Uncharacterized prot... 628 4.8e-64 2
UNIPROTKB|E1BUG1 - symbol:Gga.54220 "Uncharacterized prot... 628 3.0e-63 2
UNIPROTKB|E1BDJ1 - symbol:XPC "Uncharacterized protein" s... 447 3.1e-60 3
RGD|1305760 - symbol:Xpc "xeroderma pigmentosum, compleme... 587 1.9e-56 1
UNIPROTKB|E9PH69 - symbol:XPC "DNA repair protein-complem... 581 6.9e-56 1
UNIPROTKB|Q01831 - symbol:XPC "DNA repair protein complem... 578 1.9e-55 1
UNIPROTKB|F1SPI2 - symbol:XPC "Uncharacterized protein" s... 428 1.1e-50 2
UNIPROTKB|E2RCR3 - symbol:XPC "Uncharacterized protein" s... 448 4.9e-50 2
ZFIN|ZDB-GENE-030131-8461 - symbol:xpc "xeroderma pigment... 414 4.2e-46 2
FB|FBgn0004698 - symbol:mus210 "mutagen-sensitive 210" sp... 405 9.4e-40 2
ASPGD|ASPL0000010029 - symbol:AN3890 species:162425 "Emer... 328 4.6e-29 2
WB|WBGene00022296 - symbol:xpc-1 species:6239 "Caenorhabd... 283 4.6e-26 3
DICTYBASE|DDB_G0292296 - symbol:xpc "DNA repair protein R... 304 1.8e-23 2
POMBASE|SPAC12B10.12c - symbol:rhp41 "DNA repair protein ... 286 2.3e-23 2
ASPGD|ASPL0000008254 - symbol:AN6186 species:162425 "Emer... 198 1.2e-19 4
POMBASE|SPCC4G3.10c - symbol:rhp42 "DNA repair protein Rh... 251 1.9e-19 2
CGD|CAL0004788 - symbol:orf19.6722 species:5476 "Candida ... 240 1.3e-18 2
SGD|S000000964 - symbol:RAD4 "Protein that recognizes and... 237 8.9e-17 3
UNIPROTKB|G4MUV6 - symbol:MGG_01699 "Uncharacterized prot... 200 2.8e-14 3
>TAIR|locus:2174160 [details] [associations]
symbol:RAD4 species:3702 "Arabidopsis thaliana"
[GO:0003677 "DNA binding" evidence=IEA] [GO:0003684 "damaged DNA
binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
evidence=ISM;IEA;ISS] [GO:0006289 "nucleotide-excision repair"
evidence=IEA;ISS] [GO:0009507 "chloroplast" evidence=IDA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
Pfam:PF01841 GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR
GO:GO:0009507 GO:GO:0003684 GO:GO:0006289 InterPro:IPR002931
KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135 EMBL:AY062755
EMBL:BT010359 IPI:IPI00534100 RefSeq:NP_001031894.1
RefSeq:NP_197166.2 UniGene:At.27241 ProteinModelPortal:Q8W489
STRING:Q8W489 PaxDb:Q8W489 PRIDE:Q8W489 EnsemblPlants:AT5G16630.1
EnsemblPlants:AT5G16630.2 GeneID:831525 KEGG:ath:AT5G16630
TAIR:At5g16630 HOGENOM:HOG000144515 InParanoid:Q8W489 OMA:QVDVWSE
PhylomeDB:Q8W489 ProtClustDB:CLSN2690169 Genevestigator:Q8W489
Uniprot:Q8W489
Length = 865
Score = 1484 (527.5 bits), Expect = 2.8e-211, Sum P(2) = 2.8e-211
Identities = 322/610 (52%), Positives = 395/610 (64%)
Query: 245 KENVCETSSKGSPECKYSS--PKSNNTQSK-KSPVSCELSSGNLDPSSSMACSDISEACH 301
K + TS+ P+ + S PK +++ K KSP + GN S + + ++ +C
Sbjct: 275 KHGIFRTSTLMVPKQQAISSYPKKSSSHVKNKSPFE-KPQLGNPLGSDQVQDNAVNSSCE 333
Query: 302 P--KEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLPVKRLKK 359
KS +RKGD+EFE Q+ MALSAT +D N SS V K++++
Sbjct: 334 AGMSIKSDGTRRKGDVEFERQIAMALSAT----------AD----NQQSSQVNNTKKVRE 379
Query: 360 IE--SGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDGEQKV 417
I S SS S ISTA GS+KV +PL W EVYC+GEN+ GKWVHVDA N +ID EQ +
Sbjct: 380 ITKISNSSSVSDQVISTAFGSKKVDSPLCWLEVYCNGENMDGKWVHVDAVNGMIDAEQNI 439
Query: 418 EAAAAACKTSLRYIVAFAGCGAKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGA 477
EAAAAACKT LRY+VAFA GAKDVTRRYC KW+ I+SKRV+S WWD VLAPL LESGA
Sbjct: 440 EAAAAACKTVLRYVVAFAAGGAKDVTRRYCTKWHTISSKRVSSVWWDMVLAPLVHLESGA 499
Query: 478 TGD----------LN-VES--SAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQL 524
T D LN V S S+ S R++LEDMEL TRALTE LPTNQQAYK+H++
Sbjct: 500 THDEDIALRNFNGLNPVSSRASSSSSSFGIRSALEDMELATRALTESLPTNQQAYKSHEI 559
Query: 525 YVIERWLNKYQILYPKGPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEXXXXXX 584
Y IE+WL+K QIL+PKGP+LGFCSGH VYPR+CVQTLKTKERWLR+ LQ+KANE
Sbjct: 560 YAIEKWLHKNQILHPKGPVLGFCSGHPVYPRTCVQTLKTKERWLRDGLQLKANEVPSKIL 619
Query: 585 XXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSE 644
+DFE D + +ELYGKWQ+EPL LP AVNGIVP+NERGQVDVWSE
Sbjct: 620 KRNSKFKKVKDFEDGDNNIKGGSSCMELYGKWQMEPLCLPPAVNGIVPKNERGQVDVWSE 679
Query: 645 KCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEA 704
KCLPPGTVHLR PR+++VAKR ID APAMVGFE+R+G +TP+F+GIVVC EFKDTILEA
Sbjct: 680 KCLPPGTVHLRFPRIFAVAKRFGIDYAPAMVGFEYRSGGATPIFEGIVVCTEFKDTILEA 739
Query: 705 YXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYGXXXXXXXXXXXXXVKKTN 764
Y QA SRWYQLLSSI+TR+RL N Y + N
Sbjct: 740 YAEEQEKKEEEERRRNEAQAASRWYQLLSSILTRERLKNRYANNSNDVEAKSL----EVN 795
Query: 765 SNVGVDSSQNDWQSPNQV-DRGDTKLHAPSPFQSEEHEHVYLIEDQSFDEENSVTTKRCH 823
S V + +V RG+ S + E HEHV+L E+++FDEE SV TKRC
Sbjct: 796 SETVVKAKNVKAPEKQRVAKRGEKSRVRKSRNEDESHEHVFLDEEETFDEETSVKTKRCK 855
Query: 824 CGFTIQVEEL 833
CGF+++VE++
Sbjct: 856 CGFSVEVEQM 865
Score = 581 (209.6 bits), Expect = 2.8e-211, Sum P(2) = 2.8e-211
Identities = 134/266 (50%), Positives = 172/266 (64%)
Query: 2 GNTLRELDEGRLQDNVL-DGG------EEMYDSDWEDGSIPVACSK-ENHPESDIKGVTI 53
G + LD RL DNVL D G +EM DSDWED IP S +++ D + +TI
Sbjct: 53 GKGKQALD-ARLIDNVLEDRGCGNVDDDEMNDSDWEDCPIPSLDSTVDDNNVDDTRELTI 111
Query: 54 EFD--AADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXX 111
EFD D+ +K RA+AEDK AELVHKVHLLCLLARGR++DS C+DPLIQA
Sbjct: 112 EFDDDVPDAKKQKNAYRATAEDKVRAELVHKVHLLCLLARGRIVDSACNDPLIQAALLSL 171
Query: 112 XXXXXXKISEVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGTPEEI 171
K+S + K+T ++P++ W +NF V S S+ +SF + LA ALESR+GT EE+
Sbjct: 172 LPSYLTKVSNLEKVTVKDIAPLLRWVRENFSVSCSPSSEKSFRTSLAFALESRKGTAEEL 231
Query: 172 AALSVALFRALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEE 231
AAL+VAL RALKLTTRFVSILDVASLKP AD+N SS Q+ +++ GIF TLMV K +
Sbjct: 232 AALAVALLRALKLTTRFVSILDVASLKPGADRNESSGQNRAKMKHGIFRTSTLMVPKQQA 291
Query: 232 VLASPVKSFSCDKKENVCETSSKGSP 257
+ + P KS S K ++ E G+P
Sbjct: 292 ISSYPKKSSSHVKNKSPFEKPQLGNP 317
>MGI|MGI:103557 [details] [associations]
symbol:Xpc "xeroderma pigmentosum, complementation group C"
species:10090 "Mus musculus" [GO:0000075 "cell cycle checkpoint"
evidence=ISO] [GO:0000715 "nucleotide-excision repair, DNA damage
recognition" evidence=ISO] [GO:0003677 "DNA binding" evidence=IEA]
[GO:0003684 "damaged DNA binding" evidence=ISO] [GO:0003697
"single-stranded DNA binding" evidence=ISO] [GO:0005634 "nucleus"
evidence=ISO;IDA] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006281
"DNA repair" evidence=IMP] [GO:0006289 "nucleotide-excision repair"
evidence=ISO;IDA;IMP] [GO:0006974 "response to DNA damage stimulus"
evidence=IMP] [GO:0010224 "response to UV-B" evidence=IMP]
[GO:0031573 "intra-S DNA damage checkpoint" evidence=IGI]
[GO:0071942 "XPC complex" evidence=ISO] InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
MGI:MGI:103557 GO:GO:0005737 GO:GO:0042493 GO:GO:0003684
GO:GO:0003697 GO:GO:0010224 GO:GO:0006289 GO:GO:0031573
GO:GO:0071942 GO:GO:0000715 KO:K10838 eggNOG:COG5535
PANTHER:PTHR12135 GeneTree:ENSGT00390000005194 InterPro:IPR018026
TIGRFAMs:TIGR00605 CTD:7508 HOGENOM:HOG000124671 HOVERGEN:HBG000407
OMA:MKRFNKE OrthoDB:EOG40CHGQ ChiTaRS:XPC EMBL:U27398 EMBL:AB071144
EMBL:AK004713 EMBL:AK028595 EMBL:AK166981 EMBL:U40005
IPI:IPI00124885 PIR:S70630 RefSeq:NP_033557.2 UniGene:Mm.2806
ProteinModelPortal:P51612 SMR:P51612 IntAct:P51612 STRING:P51612
PhosphoSite:P51612 PaxDb:P51612 PRIDE:P51612
Ensembl:ENSMUST00000032182 GeneID:22591 KEGG:mmu:22591
UCSC:uc009cyd.1 InParanoid:P51612 NextBio:302933 Bgee:P51612
CleanEx:MM_XPC Genevestigator:P51612 GermOnline:ENSMUSG00000030094
Uniprot:P51612
Length = 930
Score = 657 (236.3 bits), Expect = 2.1e-64, P = 2.1e-64
Identities = 218/768 (28%), Positives = 337/768 (43%)
Query: 7 ELDEGRLQDNVLDGGEEMYDS--DWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKK 64
E++E L + VLD GE S D ++ + + + I+ + ++ ++
Sbjct: 132 EVEE--LTEPVLDMGENSATSPSDMPVKAVEIEIETPQQAKERERSEKIKMEF-ETYLRR 188
Query: 65 PVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSK 124
++R + KE+ E +HKVHLLCLLA G +S+C P + A K+ +
Sbjct: 189 MMKRFN---KEVQENMHKVHLLCLLASGFYRNSICRQPDLLAIGLSIIPIRFTKVP-LQD 244
Query: 125 LTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALF 179
A LS +V WF F V + +S S DL LE R EE+ + + +
Sbjct: 245 RDAYYLSNLVKWFIGTFTVNADLSA--SEQDDLQTTLERRIAIYSARDNEELVHIFLLIL 302
Query: 180 RALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKS 239
RAL+L TR V L LK K S++++S G G + L PE P S
Sbjct: 303 RALQLLTRLVLSLQPIPLKSAVTKGRKSSKETSVEGPG--GSSELSSNSPESH-NKPTTS 359
Query: 240 FSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVS-CELSSGNLDPSSSMACSDISE 298
++E + E K + K + + + Q +K S E + + ++
Sbjct: 360 RRIKEEETLSEGRGKATARGKRGTGTAGSRQRRKPSCSEGEEAEQKVQGRPHARKRRVAA 419
Query: 299 ACHPKEKSQALKRKGDLEFEMQLEMALSATNV----ATSKSNICSDVKDLNSNSSTVLPV 354
KE+S++ +FE +++ K S + + S +
Sbjct: 420 KVSYKEESESDGAGSGSDFEPSSGEGQHSSDEDCEPGPRKQKRASAPQRTKAGSKSASKT 479
Query: 355 KRLKKIESG---ESSTSCLG------ISTA---VGSRKVGAPLYWAEVYCSGENLTGKWV 402
+R + E E+S+S G +S+ + RK W EVYC + KWV
Sbjct: 480 QRGSQCEPSSFPEASSSSSGCKRGKKVSSGAEEMADRKPAGVDQWLEVYCEPQ---AKWV 536
Query: 403 HVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK-RVNS 460
VD + ++ Q V A K + Y+V G +DVT+RY W K RV++
Sbjct: 537 CVDCVHGVVG--QPVACYKYATKP-MTYVVGIDSDGWVRDVTQRYDPAWMTATRKCRVDA 593
Query: 461 AWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYK 520
WW L P R L + +R ED E + + L +PLPT+ YK
Sbjct: 594 EWWAETLRPYRSL------------------LTEREKKEDQEFQAKHLDQPLPTSISTYK 635
Query: 521 NHQLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX 579
NH LY ++R L K+Q +YP+ +LG+C G AVY R CV TL +++ WL++A V+ E
Sbjct: 636 NHPLYALKRHLLKFQAIYPETAAVLGYCRGEAVYSRDCVHTLHSRDTWLKQARVVRLGEV 695
Query: 580 -XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQ 638
+ EP+ +D D + LYG WQ E + P AV+G VPRNE G
Sbjct: 696 PYKMVKGFSNRARKARLSEPQLHDHND----LGLYGHWQTEEYQPPIAVDGKVPRNEFGN 751
Query: 639 VDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFK 698
V ++ +P G V + LP + VA++L ID A+ GF+F G PV DG +VC EF+
Sbjct: 752 VYLFLPSMMPVGCVQMTLPNLNRVARKLGIDCVQAITGFDFHGGYCHPVTDGYIVCEEFR 811
Query: 699 DTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 746
D +L A+ +A W L+ ++ R+RL YG
Sbjct: 812 DVLLAAWENEQAIIEKKEKEKKEKRALGNWKLLVRGLLIRERLKLRYG 859
>UNIPROTKB|F1N806 [details] [associations]
symbol:Gga.54220 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0000715 "nucleotide-excision repair, DNA damage
recognition" evidence=IEA] [GO:0003684 "damaged DNA binding"
evidence=IEA] [GO:0003697 "single-stranded DNA binding"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0010224
"response to UV-B" evidence=IEA] [GO:0031573 "intra-S DNA damage
checkpoint" evidence=IEA] [GO:0071942 "XPC complex" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
GO:GO:0071942 GO:GO:0000715 PANTHER:PTHR12135
GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
EMBL:AADN02014130 IPI:IPI00818722 Ensembl:ENSGALT00000036242
ArrayExpress:F1N806 Uniprot:F1N806
Length = 826
Score = 628 (226.1 bits), Expect = 4.8e-64, Sum P(2) = 4.8e-64
Identities = 214/705 (30%), Positives = 311/705 (44%)
Query: 74 KELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPI 133
KE+ E HKVHLLCLLA G + +C P + A K+ ++ +S +
Sbjct: 94 KEVREDTHKVHLLCLLANGFYRNRICSQPDLHAIGLSIIPIHFTKVP-AGQVDLLYISNL 152
Query: 134 VSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTRF 188
V WF F V +ST + L LE R EE+ + + + RAL+L R
Sbjct: 153 VKWFVGTFTVNDELSTEKG--EPLQSTLERRFAIYAARDDEELVHIFLIILRALQLLCRL 210
Query: 189 VSILDVASLKPEADKNVSS--NQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKE 246
V L LK E VS Q + + ++ E S + K+
Sbjct: 211 VLSLQPIPLK-ETKAKVSCFLKQKLTTPCSEKSTSKKQSLSSTSEGQESSGTTPKAVAKK 269
Query: 247 NVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKS 306
C+ + + K S + +N +SKK+ S + + P +S S+ C+ +E
Sbjct: 270 CPCKKAKRDE---KSSGSEEDNEESKKTK-SAQTERTH-KPKNSRWRRVASKVCYKEESG 324
Query: 307 QALKRKGDLEFEMQLEMALSATNVAT-SKSNICSDVKDLNSNSSTVLPVKRLKKIESGES 365
D E + E +S + T SK S + S V+ VK K E+ ES
Sbjct: 325 SDEGSVSDFEISGE-ESDISDEDFETVSKKRRSSQ----GAQKSKVMTVKSPKS-ETSES 378
Query: 366 --STSCLGI----------STAVGS---------RKVGAPLYWAEVYCSGENLTGKWVHV 404
S + LG+ + + S RKV W EV+ E+ +WV V
Sbjct: 379 RLSRNSLGVEPRPHAQRKRNKIISSDEDDGQQMVRKVVGTDQWLEVFLERED---RWVCV 435
Query: 405 DAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRRYCMKWYRIA-SKRVNSAW 462
D + I+ Q + A K L YIV F G+ KDVT+RY W + KRV+ W
Sbjct: 436 DCVHGIVGQPQ--QCFTYATKP-LSYIVGFDNDGSVKDVTQRYDPVWMTMTRKKRVDPEW 492
Query: 463 WDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNH 522
W+ L P K FV DR+ E+ E + + +PLPT YKNH
Sbjct: 493 WEDTLQPY-----------------KSPFV-DRDKKEETEFQVKLQDQPLPTAIGEYKNH 534
Query: 523 QLYVIERWLNKYQILYPKGP-ILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-X 580
LY ++R L KYQ +YP+ ILG+C G AVY R CV TL +K+ WL++A V+ E
Sbjct: 535 PLYALKRHLLKYQAIYPESAAILGYCRGEAVYSRDCVHTLHSKDTWLKQARVVRIGEVPY 594
Query: 581 XXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVD 640
+ EP + D+ D + L+G+WQ E + P AV+G VPRNE G V
Sbjct: 595 KMVKGYSNQARKARLAEPANRDKAD----LALFGRWQTEEYQPPIAVDGKVPRNEYGNVY 650
Query: 641 VWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDT 700
++ LP G V LRLP + +A++L+ID A A+ GF+F G S V DG VVC E+K+
Sbjct: 651 LFLPSMLPIGCVQLRLPNLNRLARKLDIDCAQAVTGFDFHGGYSHAVTDGYVVCEEYKEV 710
Query: 701 ILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCY 745
++ A+ +A W L ++ R+RL Y
Sbjct: 711 LIAAWENEQAEIEKKEKEKREKRALGNWKLLTKGLLIRERLKQRY 755
Score = 43 (20.2 bits), Expect = 4.8e-64, Sum P(2) = 4.8e-64
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 6 RELDEGRLQDNVLDGGEEMYDSDWED 31
+E+DE DN D ++ + +WED
Sbjct: 9 KEMDE----DNTDDDDDDESEDEWED 30
>UNIPROTKB|E1BUG1 [details] [associations]
symbol:Gga.54220 "Uncharacterized protein" species:9031
"Gallus gallus" [GO:0000715 "nucleotide-excision repair, DNA damage
recognition" evidence=IEA] [GO:0003684 "damaged DNA binding"
evidence=IEA] [GO:0003697 "single-stranded DNA binding"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0010224
"response to UV-B" evidence=IEA] [GO:0031573 "intra-S DNA damage
checkpoint" evidence=IEA] [GO:0071942 "XPC complex" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
GO:GO:0071942 GO:GO:0000715 PANTHER:PTHR12135
GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
OMA:MKRFNKE EMBL:AADN02014130 IPI:IPI00603077
Ensembl:ENSGALT00000010275 ArrayExpress:E1BUG1 Uniprot:E1BUG1
Length = 936
Score = 628 (226.1 bits), Expect = 3.0e-63, Sum P(2) = 3.0e-63
Identities = 214/705 (30%), Positives = 311/705 (44%)
Query: 74 KELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPI 133
KE+ E HKVHLLCLLA G + +C P + A K+ ++ +S +
Sbjct: 204 KEVREDTHKVHLLCLLANGFYRNRICSQPDLHAIGLSIIPIHFTKVP-AGQVDLLYISNL 262
Query: 134 VSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTRF 188
V WF F V +ST + L LE R EE+ + + + RAL+L R
Sbjct: 263 VKWFVGTFTVNDELSTEKG--EPLQSTLERRFAIYAARDDEELVHIFLIILRALQLLCRL 320
Query: 189 VSILDVASLKPEADKNVSS--NQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKE 246
V L LK E VS Q + + ++ E S + K+
Sbjct: 321 VLSLQPIPLK-ETKAKVSCFLKQKLTTPCSEKSTSKKQSLSSTSEGQESSGTTPKAVAKK 379
Query: 247 NVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKS 306
C+ + + K S + +N +SKK+ S + + P +S S+ C+ +E
Sbjct: 380 CPCKKAKRDE---KSSGSEEDNEESKKTK-SAQTERTH-KPKNSRWRRVASKVCYKEESG 434
Query: 307 QALKRKGDLEFEMQLEMALSATNVAT-SKSNICSDVKDLNSNSSTVLPVKRLKKIESGES 365
D E + E +S + T SK S + S V+ VK K E+ ES
Sbjct: 435 SDEGSVSDFEISGE-ESDISDEDFETVSKKRRSSQ----GAQKSKVMTVKSPKS-ETSES 488
Query: 366 --STSCLGI----------STAVGS---------RKVGAPLYWAEVYCSGENLTGKWVHV 404
S + LG+ + + S RKV W EV+ E+ +WV V
Sbjct: 489 RLSRNSLGVEPRPHAQRKRNKIISSDEDDGQQMVRKVVGTDQWLEVFLERED---RWVCV 545
Query: 405 DAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRRYCMKWYRIA-SKRVNSAW 462
D + I+ Q + A K L YIV F G+ KDVT+RY W + KRV+ W
Sbjct: 546 DCVHGIVGQPQ--QCFTYATKP-LSYIVGFDNDGSVKDVTQRYDPVWMTMTRKKRVDPEW 602
Query: 463 WDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNH 522
W+ L P K FV DR+ E+ E + + +PLPT YKNH
Sbjct: 603 WEDTLQPY-----------------KSPFV-DRDKKEETEFQVKLQDQPLPTAIGEYKNH 644
Query: 523 QLYVIERWLNKYQILYPKGP-ILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-X 580
LY ++R L KYQ +YP+ ILG+C G AVY R CV TL +K+ WL++A V+ E
Sbjct: 645 PLYALKRHLLKYQAIYPESAAILGYCRGEAVYSRDCVHTLHSKDTWLKQARVVRIGEVPY 704
Query: 581 XXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVD 640
+ EP + D+ D + L+G+WQ E + P AV+G VPRNE G V
Sbjct: 705 KMVKGYSNQARKARLAEPANRDKAD----LALFGRWQTEEYQPPIAVDGKVPRNEYGNVY 760
Query: 641 VWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDT 700
++ LP G V LRLP + +A++L+ID A A+ GF+F G S V DG VVC E+K+
Sbjct: 761 LFLPSMLPIGCVQLRLPNLNRLARKLDIDCAQAVTGFDFHGGYSHAVTDGYVVCEEYKEV 820
Query: 701 ILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCY 745
++ A+ +A W L ++ R+RL Y
Sbjct: 821 LIAAWENEQAEIEKKEKEKREKRALGNWKLLTKGLLIRERLKQRY 865
Score = 43 (20.2 bits), Expect = 3.0e-63, Sum P(2) = 3.0e-63
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 6 RELDEGRLQDNVLDGGEEMYDSDWED 31
+E+DE DN D ++ + +WED
Sbjct: 119 KEMDE----DNTDDDDDDESEDEWED 140
>UNIPROTKB|E1BDJ1 [details] [associations]
symbol:XPC "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0071942 "XPC complex" evidence=IEA] [GO:0031573
"intra-S DNA damage checkpoint" evidence=IEA] [GO:0010224 "response
to UV-B" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0003697 "single-stranded DNA binding" evidence=IEA] [GO:0003684
"damaged DNA binding" evidence=IEA] [GO:0000715
"nucleotide-excision repair, DNA damage recognition" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
GO:GO:0071942 GO:GO:0000715 KO:K10838 PANTHER:PTHR12135
GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
CTD:7508 OMA:MKRFNKE EMBL:DAAA02054616 IPI:IPI00702830
RefSeq:NP_001192837.1 UniGene:Bt.45276 Ensembl:ENSBTAT00000009683
GeneID:524274 KEGG:bta:524274 NextBio:20873931 Uniprot:E1BDJ1
Length = 932
Score = 447 (162.4 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
Identities = 94/257 (36%), Positives = 138/257 (53%)
Query: 492 VADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGH 550
+ DR ED E + + L +PLPT YKNH LY ++R L KY+ +YP+ +LG+C G
Sbjct: 610 LVDREQREDQEFQAKHLDQPLPTVIGTYKNHPLYALKRHLLKYEAIYPETAAVLGYCRGE 669
Query: 551 AVYPRSCVQTLKTKERWLREALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGN 609
AVY R CV TL +++ WL++A V+ E + EP+ +D D
Sbjct: 670 AVYSRDCVHTLHSRDTWLKQARVVRLGEVPYKMVKGYSNRARRARQAEPQLHDYND---- 725
Query: 610 IELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEID 669
+ L+G+WQ E + P AV+G VPRNE G V ++ +P G V L LP ++ VA++L ID
Sbjct: 726 LGLFGRWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPVGCVQLNLPNLHRVARKLNID 785
Query: 670 SAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWY 729
A A+ GF+F G P+ DG VVC E++D +L A+ +A W
Sbjct: 786 CAQAVTGFDFHKGYCHPITDGYVVCEEYRDVLLTAWENEQALIEKKEKEKREKRALGNWK 845
Query: 730 QLLSSIVTRQRLNNCYG 746
L+ ++ R+RL YG
Sbjct: 846 LLVKGLLIRERLKLRYG 862
Score = 171 (65.3 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
Identities = 58/185 (31%), Positives = 85/185 (45%)
Query: 51 VTIEFDAADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXX 110
+ +EF+ + ++ ++R S KE+ E HKVHLLCLLA G +S+C+ P +QA
Sbjct: 179 IKMEFE---TYLRRMMKRFS---KEVHEDTHKVHLLCLLANGFYRNSICNQPDLQAIGLS 232
Query: 111 XXXXXXXKISEVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT--- 167
K+ + + LS +V WF F V + +ST L LE R
Sbjct: 233 IIPTRFTKVPP-RDVDVSYLSNLVKWFIGTFTVNAELSTNEQ--DGLQTTLERRFAIYSA 289
Query: 168 --PEEIAALSVALFRALKLTTRFVSILDVASLKPEADKNVSSNQD-SSRVGGGIFNAPTL 224
EE+ + + L RAL L TR V L LK A+K ++ S+ GG A +
Sbjct: 290 RDDEELVHIFLLLLRALHLPTRLVLSLQPVPLKLSAEKGKKPCKERSTEAPGGSSEAASH 349
Query: 225 MVAKP 229
KP
Sbjct: 350 APGKP 354
Score = 120 (47.3 bits), Expect = 3.1e-60, Sum P(3) = 3.1e-60
Identities = 31/87 (35%), Positives = 42/87 (48%)
Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRR 445
W EV+ E KWV VD + ++ Q + A K + Y+V G G +DVT+R
Sbjct: 527 WLEVFLEREE---KWVCVDCVHGVVG--QPLTCYQYATKP-VTYVVGIDGAGCVRDVTQR 580
Query: 446 YCMKWYRIASK-RVNSAWWDAVLAPLR 471
Y W K RV++AWW L P R
Sbjct: 581 YDPAWLTATRKSRVDAAWWAETLRPYR 607
Score = 44 (20.5 bits), Expect = 5.8e-47, Sum P(3) = 5.8e-47
Identities = 17/64 (26%), Positives = 31/64 (48%)
Query: 302 PKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLPVKRLK-KI 360
P E+ +A KG E + + + V + DV + + S++ LPVK ++ +I
Sbjct: 106 PPER-EAAADKGSCEGDDEEDSEEDWEEVEEVSEPVPGDVGESGAFSASALPVKPVEIEI 164
Query: 361 ESGE 364
E+ E
Sbjct: 165 ETPE 168
Score = 37 (18.1 bits), Expect = 1.4e-07, Sum P(2) = 1.4e-07
Identities = 19/66 (28%), Positives = 24/66 (36%)
Query: 539 PKG--PILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDF 596
P+G P G +G A R Q + + R R A +V E G DF
Sbjct: 386 PRGESPSSGEDAGQARGQRRGTQR-RAQARRRRVAAKVSYKEESGSDAASS-----GSDF 439
Query: 597 EPEDYD 602
EP D
Sbjct: 440 EPSSED 445
>RGD|1305760 [details] [associations]
symbol:Xpc "xeroderma pigmentosum, complementation group C"
species:10116 "Rattus norvegicus" [GO:0000075 "cell cycle
checkpoint" evidence=ISO] [GO:0000715 "nucleotide-excision repair,
DNA damage recognition" evidence=IEA;ISO] [GO:0003674
"molecular_function" evidence=ND] [GO:0003684 "damaged DNA binding"
evidence=IEA;ISO] [GO:0003697 "single-stranded DNA binding"
evidence=IEA;ISO] [GO:0005634 "nucleus" evidence=ISO;IDA]
[GO:0005737 "cytoplasm" evidence=ISO;IDA] [GO:0006281 "DNA repair"
evidence=ISO] [GO:0006289 "nucleotide-excision repair"
evidence=ISO] [GO:0006974 "response to DNA damage stimulus"
evidence=ISO] [GO:0010224 "response to UV-B" evidence=IEA;ISO]
[GO:0031573 "intra-S DNA damage checkpoint" evidence=IEA;ISO]
[GO:0042493 "response to drug" evidence=IEP] [GO:0071942 "XPC
complex" evidence=IEA;ISO] InterPro:IPR004583 InterPro:IPR018325
InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
SMART:SM01031 SMART:SM01032 RGD:1305760 GO:GO:0005634 GO:GO:0005737
GO:GO:0042493 GO:GO:0003684 GO:GO:0003697 GO:GO:0010224
EMBL:CH473957 GO:GO:0031573 GO:GO:0071942 GO:GO:0000715 KO:K10838
PANTHER:PTHR12135 GeneTree:ENSGT00390000005194 InterPro:IPR018026
TIGRFAMs:TIGR00605 CTD:7508 OMA:MKRFNKE OrthoDB:EOG40CHGQ
IPI:IPI00365175 RefSeq:NP_001101344.1 UniGene:Rn.22820
Ensembl:ENSRNOT00000011490 GeneID:312560 KEGG:rno:312560
UCSC:RGD:1305760 NextBio:664995 Uniprot:D4A3D8
Length = 933
Score = 587 (211.7 bits), Expect = 1.9e-56, P = 1.9e-56
Identities = 220/779 (28%), Positives = 339/779 (43%)
Query: 17 VLDGGEEMYDS--DWEDG---SIPVACSKENHP--ESD--IKGVTIEFDAADSVTKKP-- 65
V+D G + DS DWE+ + PV EN SD +K V IE + + +
Sbjct: 117 VVDQGTDEDDSEDDWEEVEELTEPVLDMGENSATSRSDLPVKAVEIEIETPEQAKARERS 176
Query: 66 ----------VRRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXX 114
+RR +KE+ E +HKVHLLCLLA G +S+C P + A
Sbjct: 177 EKIKMEFETYLRRMMKRFNKEVQENMHKVHLLCLLASGFYRNSICQQPDLLAIGLSIIPI 236
Query: 115 XXXKISEVSKLTANALSPIVSWFHDNFHVRS--SVSTRRSFHSDLAH--ALESREGTPEE 170
K+ + LS +V WF F V + S S + S + L A+ S EE
Sbjct: 237 RFTKVP-LQDRDVYYLSNLVKWFIGTFTVNADLSASEQDSLQTTLERRIAIYSARDN-EE 294
Query: 171 IAALSVALFRALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPE 230
+ + + + RAL+L TR V L LK K S++++S G G + P+ + PE
Sbjct: 295 LVHIFLLILRALQLLTRLVLSLQPIPLKSAVAKGKKSSKETSLEGPGDSSEPSSNI--PE 352
Query: 231 EVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSS 290
P S ++E + E S K + K + + + Q +K P SC S G
Sbjct: 353 SH-NKPKTSKRIKQEETLSEGSGKANARGKRGTATAGSRQQRK-P-SC--SEGE------ 401
Query: 291 MACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATN--VATSKSNICSDVKDLNSNS 348
A +I HP+ + + + K + E + + A S ++ +++ + SD +D
Sbjct: 402 EAKQEIQS--HPQAQKRRVAAKVSYKEESESDGAGSGSDFELSSGEGQHSSD-EDCKPGP 458
Query: 349 STVLPVKRLKKIESGESSTSCL--GI-----STAVGSRKVGAPLYWAEVYCSGENLTGK- 400
++ ++G S S G S +V S A ++ C GE +
Sbjct: 459 RKQKRASAPQRSKAGSKSASKTQSGSQWEPPSFSVASSSSSACKRGKKISCGGEETDDRK 518
Query: 401 ------WVHV----DAANAIIDGEQKVEAAAAAC-KTSLRYIVAFAGCGAKDVTRRYCMK 449
W+ V A +D V AC K + + + G +
Sbjct: 519 AAGVDQWLEVFCEPQAKWVCVDCVHGVVGQPVACYKYATKPMTYVVGIDSDG-------- 570
Query: 450 WYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALT 509
W R ++R + AW A + E A L S + +R ED E + + L
Sbjct: 571 WVRDVTQRYDPAWMTATRKCRVDAEWWAE-TLRPYRSP----LTEREKKEDQEFQAKHLD 625
Query: 510 EPLPTNQQAYKNHQLYVIERWLNKYQILYPKGP-ILGFCSGHAVYPRSCVQTLKTKERWL 568
+PLPT+ YKNH LY ++R L K+Q +YP+ +LG+C G AVY R CV TL +++ WL
Sbjct: 626 QPLPTSISTYKNHPLYALKRHLLKFQAIYPESAAVLGYCRGEAVYSRDCVHTLHSRDTWL 685
Query: 569 REALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAV 627
++A V+ E + EP+ +D D + L+G WQ E + P AV
Sbjct: 686 KQARVVRLGEVPYKMVKGFSNRARKARLSEPQLHDHND----LGLFGHWQTEEYQPPVAV 741
Query: 628 NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPV 687
+G VPRNE G V ++ +P G V + LP ++ VA++L ID A+ GF+F G PV
Sbjct: 742 DGKVPRNEFGNVYLFLPSMMPIGCVQMNLPNLHRVARKLGIDCVQAITGFDFHGGYCHPV 801
Query: 688 FDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 746
DG VVC EF+D +L A+ +A W L+ ++ R+RL YG
Sbjct: 802 TDGYVVCEEFRDVLLAAWENEQALIEKKEKEKKEKRALGNWKLLVRGLLIRERLKLRYG 860
>UNIPROTKB|E9PH69 [details] [associations]
symbol:XPC "DNA repair protein-complementing XP-C cells"
species:9606 "Homo sapiens" [GO:0003684 "damaged DNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA] InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 KO:K10838
PANTHER:PTHR12135 InterPro:IPR018026 TIGRFAMs:TIGR00605
EMBL:AC093495 EMBL:FJ695191 EMBL:FJ695192 RefSeq:NP_001139241.1
UniGene:Hs.475538 UniGene:Hs.739296 GeneID:7508 KEGG:hsa:7508
CTD:7508 HGNC:HGNC:12816 ChiTaRS:XPC GenomeRNAi:7508 NextBio:29391
IPI:IPI00924991 ProteinModelPortal:E9PH69 SMR:E9PH69 PRIDE:E9PH69
Ensembl:ENST00000449060 UCSC:uc011avg.2 ArrayExpress:E9PH69
Bgee:E9PH69 Uniprot:E9PH69
Length = 903
Score = 581 (209.6 bits), Expect = 6.9e-56, P = 6.9e-56
Identities = 217/764 (28%), Positives = 327/764 (42%)
Query: 22 EEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAELVH 81
EE ++DWE+ +K IK +EF+ + ++ ++R + K + E H
Sbjct: 126 EEESENDWEE-------AKTRERSEKIK---LEFE---TYLRRAMKRFN---KGVHEDTH 169
Query: 82 KVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPIVSWFHDNF 141
KVHLLCLLA G +++C P + A ++ + LS +V WF F
Sbjct: 170 KVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLP-RDVDTYYLSNLVKWFIGTF 228
Query: 142 HVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTRFVSILDVAS 196
V + +S S +L LE R EE+ + + + RAL+L TR V L
Sbjct: 229 TVNAELSA--SEQDNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIP 286
Query: 197 LKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENVCETSSKGS 256
LK K +++ G + + V + P K+ K+E ET +KG+
Sbjct: 287 LKSATAKGKKPSKERLTADPGGSSETSSQVLENH---TKP-KTSKGTKQE---ETFAKGT 339
Query: 257 PECKYSSPKSNNTQSKKS---PVSCELSSGNLDPSSSMACSDISEACHPKEKSQALKRKG 313
C+ S+ N +K P S E G D H +E+ A R
Sbjct: 340 --CRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKAT----QRRPHGRERRVA-SRVS 392
Query: 314 DLEFEMQLEMALSATNVATSKSNICSDVKDLNSN----SSTVLPVKRLKKIESGESSTSC 369
E E + A S ++ S S SD D +S P + K S +S +
Sbjct: 393 YKE-ESGSDEAGSGSDFELS-SGEASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTH 450
Query: 370 LG----------ISTAVGSRKVGAPLYWAEVYCSGENLTG-------------KWVHVDA 406
G S++ S K G + ++ G KWV VD
Sbjct: 451 RGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDC 510
Query: 407 ANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK-RVNSAWWD 464
+ ++ Q + A K + Y+V G +DVT+RY W + K RV++ WW
Sbjct: 511 VHGVVG--QPLTCYKYATKP-MTYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWA 567
Query: 465 AVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQL 524
L P + F+ DR ED+E + + + +PLPT YKNH L
Sbjct: 568 ETLRPYQS-----------------PFM-DREKKEDLEFQAKHMDQPLPTAIGLYKNHPL 609
Query: 525 YVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-XXX 582
Y ++R L KY+ +YP+ ILG+C G AVY R CV TL +++ WL++A V+ E
Sbjct: 610 YALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWLKKARVVRLGEVPYKM 669
Query: 583 XXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVW 642
+ EP+ +E D + L+G WQ E + P AV+G VPRNE G V ++
Sbjct: 670 VKGFSNRARKARLAEPQLREEND----LGLFGYWQTEEYQPPVAVDGKVPRNEFGNVYLF 725
Query: 643 SEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTIL 702
+P G V L LP ++ VA++L+ID A+ GF+F G S PV DG +VC EFKD +L
Sbjct: 726 LPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSHPVTDGYIVCEEFKDVLL 785
Query: 703 EAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 746
A+ +A W L ++ R+RL YG
Sbjct: 786 TAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRYG 829
>UNIPROTKB|Q01831 [details] [associations]
symbol:XPC "DNA repair protein complementing XP-C cells"
species:9606 "Homo sapiens" [GO:0010224 "response to UV-B"
evidence=IEA] [GO:0031573 "intra-S DNA damage checkpoint"
evidence=IEA] [GO:0042493 "response to drug" evidence=IEA]
[GO:0000075 "cell cycle checkpoint" evidence=IMP] [GO:0000405
"bubble DNA binding" evidence=TAS] [GO:0003684 "damaged DNA
binding" evidence=IDA] [GO:0000715 "nucleotide-excision repair, DNA
damage recognition" evidence=IDA;TAS] [GO:0000404 "loop DNA
binding" evidence=TAS] [GO:0071942 "XPC complex" evidence=IDA]
[GO:0006289 "nucleotide-excision repair" evidence=IDA;TAS]
[GO:0003697 "single-stranded DNA binding" evidence=IDA] [GO:0005634
"nucleus" evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA]
[GO:0000718 "nucleotide-excision repair, DNA damage removal"
evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281
"DNA repair" evidence=TAS] [GO:0005515 "protein binding"
evidence=IPI] Reactome:REACT_216 InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0005737 GO:GO:0005654 GO:GO:0042493 GO:GO:0003684
GO:GO:0003697 GO:GO:0010224 GO:GO:0000075 GO:GO:0000405
GO:GO:0031573 GO:GO:0000718 GO:GO:0071942 PDB:2A4J PDB:2GGM
PDB:2OBH PDBsum:2A4J PDBsum:2GGM PDBsum:2OBH GO:GO:0000715
GO:GO:0000404 KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135
InterPro:IPR018026 TIGRFAMs:TIGR00605 EMBL:D21089 EMBL:AF261901
EMBL:AF261892 EMBL:AF261893 EMBL:AF261894 EMBL:AF261895
EMBL:AF261896 EMBL:AF261897 EMBL:AF261898 EMBL:AF261899
EMBL:AF261900 EMBL:AY131066 EMBL:AC093495 EMBL:FJ695191
EMBL:FJ695192 EMBL:BC016620 EMBL:AK222844 EMBL:X65024
IPI:IPI00156793 PIR:S44345 RefSeq:NP_001139241.1 RefSeq:NP_004619.3
UniGene:Hs.475538 UniGene:Hs.739296 ProteinModelPortal:Q01831
SMR:Q01831 DIP:DIP-31225N IntAct:Q01831 MINT:MINT-105410
STRING:Q01831 PhosphoSite:Q01831 DMDM:296453081 PaxDb:Q01831
PeptideAtlas:Q01831 PRIDE:Q01831 Ensembl:ENST00000285021
GeneID:7508 KEGG:hsa:7508 UCSC:uc011ave.2 CTD:7508
GeneCards:GC03M014161 HGNC:HGNC:12816 HPA:CAB009932 MIM:278720
MIM:613208 neXtProt:NX_Q01831 Orphanet:276255 PharmGKB:PA37413
HOGENOM:HOG000124671 HOVERGEN:HBG000407 InParanoid:Q01831
OMA:MKRFNKE OrthoDB:EOG40CHGQ ChiTaRS:XPC EvolutionaryTrace:Q01831
GenomeRNAi:7508 NextBio:29391 ArrayExpress:Q01831 Bgee:Q01831
CleanEx:HS_XPC Genevestigator:Q01831 GermOnline:ENSG00000154767
Uniprot:Q01831
Length = 940
Score = 578 (208.5 bits), Expect = 1.9e-55, P = 1.9e-55
Identities = 209/721 (28%), Positives = 309/721 (42%)
Query: 66 VRRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSK 124
+RRA +K + E HKVHLLCLLA G +++C P + A ++
Sbjct: 190 LRRAMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLP-RD 248
Query: 125 LTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALF 179
+ LS +V WF F V + +S S +L LE R EE+ + + +
Sbjct: 249 VDTYYLSNLVKWFIGTFTVNAELSA--SEQDNLQTTLERRFAIYSARDDEELVHIFLLIL 306
Query: 180 RALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKS 239
RAL+L TR V L LK K +++ G + + V + P K+
Sbjct: 307 RALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENH---TKP-KT 362
Query: 240 FSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKS---PVSCELSSGNLDPSSSMACSDI 296
K+E ET +KG+ C+ S+ N +K P S E G D
Sbjct: 363 SKGTKQE---ETFAKGT--CRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKAT---- 413
Query: 297 SEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSN----SSTVL 352
H +E+ A R E E + A S ++ S S SD D +S
Sbjct: 414 QRRPHGRERRVA-SRVSYKE-ESGSDEAGSGSDFELS-SGEASDPSDEDSEPGPPKQRKA 470
Query: 353 PVKRLKKIESGESSTSCLG----------ISTAVGSRKVGAPLYWAEVYCSGENLTG--- 399
P + K S +S + G S++ S K G + ++ G
Sbjct: 471 PAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQ 530
Query: 400 ----------KWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCM 448
KWV VD + ++ Q + A K + Y+V G +DVT+RY
Sbjct: 531 WLEVFCEQEEKWVCVDCVHGVVG--QPLTCYKYATKP-MTYVVGIDSDGWVRDVTQRYDP 587
Query: 449 KWYRIASK-RVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRA 507
W + K RV++ WW L P + F+ DR ED+E + +
Sbjct: 588 VWMTVTRKCRVDAEWWAETLRPYQS-----------------PFM-DREKKEDLEFQAKH 629
Query: 508 LTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKER 566
+ +PLPT YKNH LY ++R L KY+ +YP+ ILG+C G AVY R CV TL +++
Sbjct: 630 MDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDT 689
Query: 567 WLREALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPS 625
WL++A V+ E + EP+ +E D + L+G WQ E + P
Sbjct: 690 WLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREEND----LGLFGYWQTEEYQPPV 745
Query: 626 AVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRST 685
AV+G VPRNE G V ++ +P G V L LP ++ VA++L+ID A+ GF+F G S
Sbjct: 746 AVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSH 805
Query: 686 PVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCY 745
PV DG +VC EFKD +L A+ +A W L ++ R+RL Y
Sbjct: 806 PVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRY 865
Query: 746 G 746
G
Sbjct: 866 G 866
>UNIPROTKB|F1SPI2 [details] [associations]
symbol:XPC "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0071942 "XPC complex" evidence=IEA] [GO:0031573
"intra-S DNA damage checkpoint" evidence=IEA] [GO:0010224 "response
to UV-B" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0003697 "single-stranded DNA binding" evidence=IEA] [GO:0003684
"damaged DNA binding" evidence=IEA] [GO:0000715
"nucleotide-excision repair, DNA damage recognition" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
GO:GO:0071942 GO:GO:0000715 KO:K10838 PANTHER:PTHR12135
GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
CTD:7508 OMA:MKRFNKE EMBL:CU633560 RefSeq:XP_003132441.1
Ensembl:ENSSSCT00000012699 GeneID:100514251 KEGG:ssc:100514251
ArrayExpress:F1SPI2 Uniprot:F1SPI2
Length = 944
Score = 428 (155.7 bits), Expect = 1.1e-50, Sum P(2) = 1.1e-50
Identities = 98/299 (32%), Positives = 147/299 (49%)
Query: 450 WYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALT 509
W R ++R + AW A R+ A + + +R ED E + + L
Sbjct: 583 WVRDVTQRYDPAWMTAT----RKCRVDAVWWAETLRPYRSPLL-EREQREDQEFQAKHLD 637
Query: 510 EPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWL 568
+P+PT YKNH LY ++R L KY+ +YP+ ILG+C G AVY R CV TL +++ WL
Sbjct: 638 QPMPTVIGTYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWL 697
Query: 569 REALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAV 627
++ V+ E + EP+ D D + L+G+WQ E + P AV
Sbjct: 698 KQGRVVRLGEVPYKMVKGYSNRARKARLAEPQLRDHND----LPLFGQWQTEEYQPPVAV 753
Query: 628 NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPV 687
+G VPRNE G V ++ +P G V L LP + VA++L ID A+ GF+F G S P+
Sbjct: 754 DGKVPRNEFGNVYLFLPSMMPVGCVQLNLPNLQRVARKLNIDCVQAITGFDFHKGYSHPI 813
Query: 688 FDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 746
DG +VC E++D +L A+ + W L+ ++ R+RL YG
Sbjct: 814 TDGYIVCEEYRDILLAAWENEQALIEKKEKEKKEKRTLGNWKLLVKGLLIRERLRLRYG 872
Score = 179 (68.1 bits), Expect = 1.1e-50, Sum P(2) = 1.1e-50
Identities = 81/304 (26%), Positives = 123/304 (40%)
Query: 73 DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSP 132
+KE+ E HKVHLLCLLA G +S+C P ++A K+ + LS
Sbjct: 200 NKEVHEDTHKVHLLCLLANGFYRNSICSQPDLRAIGLSIIPTRFTKVPP-QDVDVCYLSN 258
Query: 133 IVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTR 187
+V WF F V + +ST L LE R EE+ + + + RAL L+ R
Sbjct: 259 LVKWFIGTFTVNADLSTNEQ--DGLQTTLERRFAIYSARDDEELVHIFLLIIRALHLSAR 316
Query: 188 FVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKEN 247
V L LK A K ++++ S G G ++ T + P + +KS S +++E+
Sbjct: 317 LVLSLQPIPLKSSAAKGKKASKERSTEGPGC-SSET---SSPGPAKQTKLKSSSGNRRED 372
Query: 248 VCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKE--K 305
+ G P K K+ K+ S SSG A EA P +
Sbjct: 373 PSSEGTSG-PRAKGKGSKAAAATKKQREPS---SSGE---EEGKAAGQQGEARRPARGRR 425
Query: 306 SQALKRKGDLEFEMQLEMALSATNVATSKSNI-CSDVKDLNSNSSTVLPVKRLKKIESGE 364
QA R E E + A S+++ S + C +D L + ++G
Sbjct: 426 RQAATRVSYKE-ESGSDKASSSSDFELSSGDSHCPSDEDSEPGLRRQRRAPGLPRTKAGA 484
Query: 365 SSTS 368
S S
Sbjct: 485 KSDS 488
Score = 138 (53.6 bits), Expect = 2.2e-46, Sum P(2) = 2.2e-46
Identities = 68/247 (27%), Positives = 104/247 (42%)
Query: 238 KSFSCDKKENVCETSSKGSPECKYSS-------PKSNNTQSKKSPVSCELSSGNLDPSSS 290
K+ + KK+ E SS G E K + P + + VS + SG+ D +SS
Sbjct: 389 KAAAATKKQR--EPSSSGEEEGKAAGQQGEARRPARGRRRQAATRVSYKEESGS-DKASS 445
Query: 291 MACSDISEA---CHPKEKSQ-ALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNS 346
+ ++S C E S+ L+R+ L + + S+S S K
Sbjct: 446 SSDFELSSGDSHCPSDEDSEPGLRRQRRAP---GLPRTKAGAK-SDSRSQRGSHPKPPGF 501
Query: 347 NSSTVLPVKRLKKIESGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDA 406
+++ P +K G TS G A G + G +W EV+C E+ KWV VD
Sbjct: 502 LAASAGPPGSKRK---GGKKTSVRG-EEADGGKVAGVD-HWLEVFCERED---KWVCVDC 553
Query: 407 ANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK-RVNSAWWD 464
+ ++ Q + A K + Y+V G G +DVT+RY W K RV++ WW
Sbjct: 554 VHGVVG--QPLTCYQYATKP-MTYVVGIDGDGWVRDVTQRYDPAWMTATRKCRVDAVWWA 610
Query: 465 AVLAPLR 471
L P R
Sbjct: 611 ETLRPYR 617
>UNIPROTKB|E2RCR3 [details] [associations]
symbol:XPC "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003684
"damaged DNA binding" evidence=IEA] InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 PANTHER:PTHR12135
GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
OMA:MKRFNKE EMBL:AAEX03012049 Ensembl:ENSCAFT00000007204
Uniprot:E2RCR3
Length = 949
Score = 448 (162.8 bits), Expect = 4.9e-50, Sum P(2) = 4.9e-50
Identities = 96/259 (37%), Positives = 139/259 (53%)
Query: 490 SFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCS 548
S + +R ED E + + L +PLPT YKNH LY ++R L KY+ +YP+ ILG+C
Sbjct: 622 SLLVEREKKEDSEFQAKHLGQPLPTVIGTYKNHPLYALKRHLLKYEAIYPETAAILGYCR 681
Query: 549 GHAVYPRSCVQTLKTKERWLREALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDAR 607
G AVY R CV TL +++ WL++A V+ E + EP+ D+ D
Sbjct: 682 GEAVYSRDCVHTLHSRDTWLKQARVVRLGEVPYKMVKGYSNRARKARLAEPQLQDQND-- 739
Query: 608 GNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLE 667
+ L+GKWQ E + P AV+G VPRNE G V ++ +P G V L LP ++ VA++L+
Sbjct: 740 --LGLFGKWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPVGCVQLNLPNLHRVARKLD 797
Query: 668 IDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSR 727
ID A+ GF+F G S P+ DG +VC E+KD +L A+ +A
Sbjct: 798 IDCVQAITGFDFHKGYSHPITDGYIVCEEYKDVLLAAWENEQALIEKREKEKREKRALGN 857
Query: 728 WYQLLSSIVTRQRLNNCYG 746
W L ++ R+RL YG
Sbjct: 858 WKLLARGLLIRERLKLRYG 876
Score = 151 (58.2 bits), Expect = 4.9e-50, Sum P(2) = 4.9e-50
Identities = 59/211 (27%), Positives = 95/211 (45%)
Query: 51 VTIEFDAADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXX 110
+ +EF+ + ++ ++R S KE+ E HKVHLLCLLA G ++C+ P + A
Sbjct: 188 IKVEFE---TYLRRMMKRFS---KEVREDTHKVHLLCLLANGFYRSNICNQPDLLAIGLS 241
Query: 111 XXXXXXXKISEVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT--- 167
++ + + LS +V WF F V + +ST L LE R
Sbjct: 242 IVPTRFTRVPP-QDVDSGYLSNLVKWFVGTFTVNADLSTNEQ--DGLQTTLERRFAIYSA 298
Query: 168 --PEEIAALSVALFRALKLTTRFVSILDVASLK-PEADKNVSSNQDSSRVGGGIFNAPTL 224
EE+ + + + RAL+L TR V L LK P A ++ + S+ G +L
Sbjct: 299 RDDEELVHIFLLILRALQLPTRLVLSLQPLPLKLPTAKGKKATTEKSAEDPGS-----SL 353
Query: 225 MVAKPEEVLASPVKSFSCDKKENVCETSSKG 255
+ P + K+ ++E +TSSKG
Sbjct: 354 ETSSPVAEGQTKPKTSKGTRQE---DTSSKG 381
Score = 128 (50.1 bits), Expect = 1.3e-47, Sum P(2) = 1.3e-47
Identities = 82/302 (27%), Positives = 120/302 (39%)
Query: 195 ASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENVC----- 249
A+ + A+ SS + SS V G T + E+ + + S S K+
Sbjct: 340 ATTEKSAEDPGSSLETSSPVAEGQTKPKTSKGTRQEDTSSKGLGSTSAKGKKGKAAAVGK 399
Query: 250 ---ETSSKGSPECKYSSPKSNNTQSKK--------SPVSCELSSGNLDPSSSMACSDIS- 297
E SS G E K + + TQ ++ S VS + S + D SS + ++S
Sbjct: 400 RRREPSSSGEEERK-AGGQEEETQRRRYGRERQVASRVSYKEESAS-DKGSSGSDFELSS 457
Query: 298 -EACHPK-EKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVK-DLNSNSSTVLPV 354
EA H E S+ + + Q A S T+ T S SS+
Sbjct: 458 GEAHHSSDEDSEPVLPRQRRAPGPQRTKAGSRTDSRTQSGRPSKHPGFPAASTSSSSSKS 517
Query: 355 KRLKKIES-GESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDG 413
K+ KKI S GE + RK W EV+C E KWV VD + ++
Sbjct: 518 KQGKKISSDGEGAER----------RKAAGVDQWLEVFCEQEE---KWVCVDCVHGVVG- 563
Query: 414 EQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRRYCMKWYRIASK-RVNSAWWDAVLAPLR 471
Q + A K + Y+V G G+ +DVT+RY W K RV++ WW L P +
Sbjct: 564 -QALACYKYATKP-MTYVVGIDGDGSVRDVTQRYDPAWMTATRKCRVDAKWWAETLRPYQ 621
Query: 472 EL 473
L
Sbjct: 622 SL 623
Score = 52 (23.4 bits), Expect = 1.2e-39, Sum P(2) = 1.2e-39
Identities = 18/74 (24%), Positives = 36/74 (48%)
Query: 294 SDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLP 353
+ + + HP ++ A+ KG E + + E V + DV + + S +VLP
Sbjct: 107 ASVRKKAHPSQREAAVD-KGSCEEDDEEESEDEWEEVEELGEPVPGDVGENAAFSKSVLP 165
Query: 354 VKRLK-KIESGESS 366
VK ++ +IE+ + +
Sbjct: 166 VKPVEIEIETPQQA 179
>ZFIN|ZDB-GENE-030131-8461 [details] [associations]
symbol:xpc "xeroderma pigmentosum, complementation
group C" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IEA]
[GO:0006289 "nucleotide-excision repair" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] InterPro:IPR004583 InterPro:IPR018325
InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
SMART:SM01031 SMART:SM01032 ZFIN:ZDB-GENE-030131-8461 GO:GO:0005634
GO:GO:0003684 GO:GO:0006289 KO:K10838 PANTHER:PTHR12135
GeneTree:ENSGT00390000005194 CTD:7508 HOVERGEN:HBG000407
OMA:MKRFNKE EMBL:BX784025 IPI:IPI00610110 RefSeq:NP_001038675.1
UniGene:Dr.76635 Ensembl:ENSDART00000058100 GeneID:541386
KEGG:dre:541386 InParanoid:Q1LVE4 NextBio:20879198 Uniprot:Q1LVE4
Length = 879
Score = 414 (150.8 bits), Expect = 4.2e-46, Sum P(2) = 4.2e-46
Identities = 89/254 (35%), Positives = 133/254 (52%)
Query: 494 DRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAV 552
+R ED E++ + L +PLPT+ YKNH LYV++R L KY+ LYP +LG+C G V
Sbjct: 552 ERGQKEDQEMQAKLLDKPLPTSVSEYKNHPLYVLKRHLLKYEALYPATAAVLGYCRGEPV 611
Query: 553 YPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIEL 612
Y R CV TL +++ WL+EA V+ E E + D + L
Sbjct: 612 YSRDCVHTLHSRDTWLKEARTVRLGEEPYKMVLGFSNRSRKARMMSEQKNVKD----LAL 667
Query: 613 YGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAP 672
+G WQ E + P AV+G VPRNE G V ++ LP G VH+ LP ++ VA++L ID A
Sbjct: 668 FGTWQTEEYQPPIAVDGKVPRNEFGNVYMFKSCMLPIGCVHVHLPNLHRVARKLNIDCAL 727
Query: 673 AMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLL 732
A+ GF++ G + V DG +VC E ++ + A+ +A + W L+
Sbjct: 728 AVTGFDYHCGFAHAVNDGYIVCEEHEEILKAAWENEQEIQQKKEQEKREKRAVTNWTLLV 787
Query: 733 SSIVTRQRLNNCYG 746
++ ++RL YG
Sbjct: 788 KGLLIKERLKRRYG 801
Score = 149 (57.5 bits), Expect = 4.2e-46, Sum P(2) = 4.2e-46
Identities = 74/282 (26%), Positives = 120/282 (42%)
Query: 26 DSDWED-----GSIPVACSKENHPESDIKGVTIEFDAADSVTK---KPVRRASAE----- 72
+ DWE+ G + S E ES K V IE + D + K K R+A E
Sbjct: 109 EDDWEEVEEMAGPLGPVDSSELALES--KPVEIEIETPDMIRKRQKKEKRKAEFETYLRR 166
Query: 73 -----DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTA 127
+K+L HKVHLLCL+A G + + +P + A +S + ++
Sbjct: 167 MMNRFNKDLLVDTHKVHLLCLMASGLFRNRLLCEPDLLAVALSLLPSHFTTVS-LKRINN 225
Query: 128 NALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREG-----TPEEIAALSVALFRAL 182
L ++ WF F + ++ + DL LE R G EE+ L + + R+L
Sbjct: 226 GFLEGLLKWFQATFTLNPALPEEKEV--DLRTVLEKRMGCLSARNHEEMTYLFLLVLRSL 283
Query: 183 KLTTRFVSILDVASLKPE-ADKNVSS-NQDSSRVGGGIFNAPTLMVA----KPEEVLASP 236
+L R V L LKP A K+ ++ ++ SS ++P L V+ +P A+
Sbjct: 284 RLFCRLVLSLQPLPLKPPPATKSKTTPSKSSSEKAQSEKSSPELKVSPGSKRPSSATAAA 343
Query: 237 VKSFSCDKKENVCETSSKGSPECKYSS-PKSNNTQSKKSPVS 277
+ +K+ +T G E + PK++ +S S VS
Sbjct: 344 KEDRGGKRKK---KTGGGGDKEAAGAQKPKNSRRRSVASKVS 382
Score = 103 (41.3 bits), Expect = 2.8e-41, Sum P(2) = 2.8e-41
Identities = 60/256 (23%), Positives = 96/256 (37%)
Query: 252 SSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKSQ---- 307
S K SPE K S P S S + + + + + A PK +
Sbjct: 320 SEKSSPELKVS-PGSKRPSSATAAAKEDRGGKRKKKTGGGGDKEAAGAQKPKNSRRRSVA 378
Query: 308 ---ALKRKGDLEFEMQLEMALSATNVATSKSN-----ICSDVKDLNSNSSTVLPVKRLKK 359
+ K G E E Q E +N S+ + IC K + SS V +R ++
Sbjct: 379 SKVSYKEVGSEEEEEQSEEEFQPSNEDDSEDSDGAVKICRKSKVKSRRSSKVKQEERSEE 438
Query: 360 IESGESSTSC-LGISTAVGSRKVGAPL-YWAEVYCSGENLTGKWVHVDAANAIIDGEQKV 417
E E + +K G W EVY +G+WV VD + G+ ++
Sbjct: 439 EEEEEEEEEEEKEVKKQRRKKKQGKGADEWLEVYLES---SGRWVCVDVDQGV--GQPQL 493
Query: 418 EAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASKR-VNSAWWDAVLAPLR--EL 473
+ A + Y+V G KD+ RY W + +R V+S WW+ + + +
Sbjct: 494 CSDQATLP--ITYVVGLDDEGFMKDLGSRYDPTWLTSSRRRRVDSEWWEETMELYKSPDT 551
Query: 474 ESGATGDLNVESSAKD 489
E G D +++ D
Sbjct: 552 ERGQKEDQEMQAKLLD 567
Score = 59 (25.8 bits), Expect = 1.2e-36, Sum P(2) = 1.2e-36
Identities = 20/56 (35%), Positives = 30/56 (53%)
Query: 251 TSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKS 306
T SK +P K SS K+ QS+KS ++S G+ PSS+ A + K+K+
Sbjct: 304 TKSKTTPS-KSSSEKA---QSEKSSPELKVSPGSKRPSSATAAAKEDRGGKRKKKT 355
Score = 37 (18.1 bits), Expect = 2.7e-05, Sum P(2) = 2.7e-05
Identities = 13/49 (26%), Positives = 17/49 (34%)
Query: 486 SAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKY 534
S + S V E+ E E E +Q K Q + WL Y
Sbjct: 424 SRRSSKVKQEERSEEEEEEEEEEEEEKEVKKQRRKKKQGKGADEWLEVY 472
>FB|FBgn0004698 [details] [associations]
symbol:mus210 "mutagen-sensitive 210" species:7227
"Drosophila melanogaster" [GO:0006289 "nucleotide-excision repair"
evidence=ISS] [GO:0003684 "damaged DNA binding" evidence=ISS]
[GO:0005634 "nucleus" evidence=IEA;NAS] InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
EMBL:AE013599 GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 KO:K10838
eggNOG:COG5535 PANTHER:PTHR12135 GeneTree:ENSGT00390000005194
InterPro:IPR018026 TIGRFAMs:TIGR00605 EMBL:Z28622 EMBL:AF209743
EMBL:AY070566 PIR:S42402 RefSeq:NP_476861.1 RefSeq:NP_725451.1
UniGene:Dm.637 ProteinModelPortal:Q24595 SMR:Q24595 IntAct:Q24595
STRING:Q24595 PaxDb:Q24595 PRIDE:Q24595 EnsemblMetazoa:FBtr0087374
GeneID:36697 KEGG:dme:Dmel_CG8153 CTD:36697 FlyBase:FBgn0004698
InParanoid:Q24595 OMA:KYLQSFV OrthoDB:EOG4547F1 GenomeRNAi:36697
NextBio:799920 Bgee:Q24595 GermOnline:CG8153 Uniprot:Q24595
Length = 1293
Score = 405 (147.6 bits), Expect = 9.4e-40, Sum P(2) = 9.4e-40
Identities = 107/320 (33%), Positives = 149/320 (46%)
Query: 428 LRYIVAFAGCGA-KDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESS 486
L Y+ AF + KDVT RYC W K W L E + G
Sbjct: 996 LAYVFAFQDDQSLKDVTARYCASWSTTVRKARVEKAW------LDETIAPYLG-----RR 1044
Query: 487 AKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILG 545
K R+ ED +L +PLP + +K+H LYV+ER L K+Q LYP P LG
Sbjct: 1045 TK------RDITEDDQLRRIHSDKPLPKSISEFKDHPLYVLERHLLKFQGLYPPDAPTLG 1098
Query: 546 FCSGHAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVD 605
F G AVY R CV L ++E WL+ A VK E +D
Sbjct: 1099 FIRGEAVYSRDCVHLLHSREIWLKSARVVKLGEQPYKVVKARPKWDRLTRTVIKDQP--- 1155
Query: 606 ARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKR 665
+E++G WQ + P+A NGIVPRN G V+++ + LP TVHLRLP + + K+
Sbjct: 1156 ----LEIFGYWQTQEYEPPTAENGIVPRNAYGNVELFKDCMLPKKTVHLRLPGLMRICKK 1211
Query: 666 LEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQAT 725
L ID A A+VGF+F G P++DG +VC EF++ + A+ +
Sbjct: 1212 LNIDCANAVVGFDFHQGACHPMYDGFIVCEEFREVVTAAWEEDQQVQVLKEQEKYETRVY 1271
Query: 726 SRWYQLLSSIVTRQRLNNCY 745
W +L+ ++ R+RL Y
Sbjct: 1272 GNWKKLIKGLLIRERLKKKY 1291
Score = 105 (42.0 bits), Expect = 9.4e-40, Sum P(2) = 9.4e-40
Identities = 82/366 (22%), Positives = 141/366 (38%)
Query: 27 SDWEDGSIPVACSKENHPESDIKGV--TIEFDAADSVTKKPVRRASAEDKELAELVHKVH 84
SD +DG P S + ++G+ T E + RR + + K+ L+HKV
Sbjct: 329 SDQDDGETP-NISGDLEIRVGLEGLRPTKEQKTQHELEMALKRRLNRDIKDRQILLHKVS 387
Query: 85 LLCLLARG----RLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPIVSWFHD- 139
L+C +AR RL+ D L+QA ++L L V+WF
Sbjct: 388 LMCQIARSLKYNRLLSE--SDSLMQATLKLLPSRNAYPTERGTEL--KYLQSFVTWFKTS 443
Query: 140 ------NFHVRSSVSTRRSFHSDLAHALESREGT-PEEIAALSVALFRALKLTTRFVSIL 192
N + S +T+ + L ++ +E +++ + +AL R + + R + L
Sbjct: 444 IKLLSPNLYSAQSPATKEAILEALLEQVKRKEARCKQDMIFIFIALARGMGMHCRLIVNL 503
Query: 193 DVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENV-CET 251
L+P A S+ ++ N + ++ E P K DKK E
Sbjct: 504 QPMPLRPAA-----SDLIPIKLRPDDKNKSQTVESERESEDEKPKK----DKKAGKPAEK 554
Query: 252 SSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACS---DISEACHPKEKSQA 308
S S K + K+N +++ P+S + G+ S ++S + EKS+
Sbjct: 555 ESSKSTISKEAEKKNNAKKAEAKPLSKSTTKGSETTKSGTVPKVKKELSLSSKLVEKSKH 614
Query: 309 LKR----KGDLEFEMQLEMALSATNVATSKSNICSDVKDLNS--NSSTVLPVKRLKKIES 362
K K D F+ + + S+ + S + K L +S VL K S
Sbjct: 615 QKAYTSSKSDTSFDEKPSTSSSSKCLKEEYSELGLSKKLLKPTLSSKLVLKSKNQSSFSS 674
Query: 363 GESSTS 368
+S TS
Sbjct: 675 NKSDTS 680
Score = 38 (18.4 bits), Expect = 1.0e-32, Sum P(2) = 1.0e-32
Identities = 13/50 (26%), Positives = 23/50 (46%)
Query: 239 SFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPS 288
S S KE + SS + K +SP T+ + S + +++ N+ S
Sbjct: 689 SSSKSLKEETAKLSSSKLEDKKVASPAETKTKVQSSLLK-RVTTQNISES 737
Score = 37 (18.1 bits), Expect = 1.3e-32, Sum P(2) = 1.3e-32
Identities = 15/68 (22%), Positives = 28/68 (41%)
Query: 244 KKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPK 303
K +N SS S +P ++++ + +LSS L+ + ++ K
Sbjct: 665 KSKNQSSFSSNKSDTSFEENPSTSSSSKSLKEETAKLSSSKLEDKKVASPAETKT----K 720
Query: 304 EKSQALKR 311
+S LKR
Sbjct: 721 VQSSLLKR 728
>ASPGD|ASPL0000010029 [details] [associations]
symbol:AN3890 species:162425 "Emericella nidulans"
[GO:0003684 "damaged DNA binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0005819 "spindle" evidence=IEA]
[GO:0006298 "mismatch repair" evidence=IEA] [GO:0006289
"nucleotide-excision repair" evidence=IEA] InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0005634 GO:GO:0003684 EMBL:BN001302 GO:GO:0006289
EMBL:AACD01000062 KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135
OMA:FKGRHGT OrthoDB:EOG4Z0FG0 RefSeq:XP_661494.1
ProteinModelPortal:Q5B6E0 STRING:Q5B6E0
EnsemblFungi:CADANIAT00004811 GeneID:2873313 KEGG:ani:AN3890.2
HOGENOM:HOG000182868 Uniprot:Q5B6E0
Length = 951
Score = 328 (120.5 bits), Expect = 4.6e-29, Sum P(2) = 4.6e-29
Identities = 115/424 (27%), Positives = 178/424 (41%)
Query: 337 ICSDVKDLNSNSSTVLPVKRLKKIESGESSTSCLGI--STAVGSRKVGA----PLYWAEV 390
I SD D ++ ST K G G+ +T + SR + P++W E
Sbjct: 314 ISSDDPDSLTDGSTKSEAKPAPIRRIGRPGFKPTGVQNTTVLSSRPTRSESSYPVFWVEA 373
Query: 391 YCSGENLTGKWVHVDA-ANAIIDGEQKVEAAAAACKTSLRYIVAFA-GCGAKDVTRRYCM 448
+ KWV +D + K+E A L Y+VAF A+DVTRRY
Sbjct: 374 F---NEAFQKWVVIDPMVTKTLAKPHKLEPPATDPYNLLSYVVAFEEDASARDVTRRYT- 429
Query: 449 KWYRIASKRVNSAWWDAVLAPLRELESGATGDL---NVESSAKDSFVADRNSLEDMELET 505
RV ++A LR +ES G+ V + F+ DR+ LE EL
Sbjct: 430 --------RV----FNAKTRKLR-VESTKNGEAWWKRVLEHFEKPFLEDRDELEIAELTA 476
Query: 506 RALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK---GPI-LGFCSGHA----VYPRSC 557
+ +EP+P N Q +K+H +Y +ER L + ++++PK G + LG G +Y RS
Sbjct: 477 KTASEPMPRNVQDFKDHPIYALERHLRRNEVIFPKRVTGHVSLGKSGGKGQTEPIYRRSD 536
Query: 558 VQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQ 617
V L++ +W R +K E G + E+ E A LY +Q
Sbjct: 537 VHILRSANKWYRLGRDIKVGEQPLKRIPVRNR---GMAVDDEEEGEETA-----LYAFFQ 588
Query: 618 LEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGF 677
E + P V G +P+N G +DV+ +P G +H+ A+ L ID A A+ GF
Sbjct: 589 TELYKPPPVVQGRIPKNAFGNLDVYVPSMVPAGGIHITHLDAARAARILGIDYADAVTGF 648
Query: 678 EFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVT 737
F+ T + G+VV +E+K+ + E + W LL +
Sbjct: 649 SFKGRHGTAIIKGVVVASEYKEAVEEVLKALEEEKLQNEQEERAVEVLRAWKNLLMKLRI 708
Query: 738 RQRL 741
+R+
Sbjct: 709 AERV 712
Score = 80 (33.2 bits), Expect = 4.6e-29, Sum P(2) = 4.6e-29
Identities = 24/83 (28%), Positives = 41/83 (49%)
Query: 26 DSDWEDGSIPV-ACSKENHPESDIKGVTIEFDAADSVTKKPVRR--ASAEDKELAELVHK 82
D +WE+ I S +D + I + + ++ VRR +A +K+L VHK
Sbjct: 105 DMEWEEVDIQQPTISGPTSSVTDEAPLQITLEQDHNRKRRVVRRKPVTAAEKKLRLDVHK 164
Query: 83 VHLLCLLARGRLIDSVCDDPLIQ 105
+HLLCL+ + + C+D +Q
Sbjct: 165 MHLLCLMCHVQRRNLWCNDEEVQ 187
>WB|WBGene00022296 [details] [associations]
symbol:xpc-1 species:6239 "Caenorhabditis elegans"
[GO:0003684 "damaged DNA binding" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 EMBL:FO081666 KO:K10838
eggNOG:COG5535 PANTHER:PTHR12135 GeneTree:ENSGT00390000005194
RefSeq:NP_500156.2 ProteinModelPortal:Q9N4C3 IntAct:Q9N4C3
MINT:MINT-228757 STRING:Q9N4C3 PaxDb:Q9N4C3
EnsemblMetazoa:Y76B12C.2 GeneID:177002 KEGG:cel:CELE_Y76B12C.2
UCSC:Y76B12C.2 CTD:177002 WormBase:Y76B12C.2 InParanoid:Q9N4C3
OMA:YLRQEIN NextBio:894928 Uniprot:Q9N4C3
Length = 1119
Score = 283 (104.7 bits), Expect = 4.6e-26, Sum P(3) = 4.6e-26
Identities = 72/214 (33%), Positives = 106/214 (49%)
Query: 493 ADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPI---LGFCSG 549
++R E M++ + PLPT YKNH LY +E+ L K++ +YP LG G
Sbjct: 812 SERKKWEMMQMREDLVKRPLPTVMSEYKNHPLYALEKDLLKFEAIYPPPATQKPLGQIRG 871
Query: 550 HAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGN 609
H VYPRS V TL+ + WL+ A VK E P+ V+ R +
Sbjct: 872 HNVYPRSTVFTLQGENNWLKLARSVKIGEKPYKIVKA----------RPDPRIPVEDRED 921
Query: 610 --IELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLE 667
+++YG WQ E R P NG +P NE G V +++E P +L+L + ++++L
Sbjct: 922 KFLDVYGYWQTEKYRRPPLKNGKIPHNEYGNVYMFNENMCPLDCTYLKLSGLVQISRKLG 981
Query: 668 IDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTI 701
PA+VG+ F G + PV DG +V KD I
Sbjct: 982 KQCIPAVVGWAFDGGFTHPVIDGAIVLE--KDAI 1013
Score = 89 (36.4 bits), Expect = 4.6e-26, Sum P(3) = 4.6e-26
Identities = 26/103 (25%), Positives = 43/103 (41%)
Query: 74 KELAELVHKVHLLCLLARGRLIDSVC-DDPLIQAXXXXXXXXXXXKISEVSKLTANALSP 132
+E+ E HKVHLLC +A + + + D+ L+ + K + + +
Sbjct: 517 REMWENTHKVHLLCFMAHLKFVVKIALDESLVPSLMMSQLPNGYLKFIGEPVVPIDIMKN 576
Query: 133 IVSWFHDNFHVRSSVSTRRSFHSD-LAHALESREGTPEEIAAL 174
+V WF D F + V + S D L E+R + AL
Sbjct: 577 LVKWFADAFRPLNGVVSVASIEQDSLLEGHEARYPETRRLTAL 619
Score = 61 (26.5 bits), Expect = 6.5e-22, Sum P(2) = 6.5e-22
Identities = 31/141 (21%), Positives = 60/141 (42%)
Query: 228 KPEEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDP 287
K E ++ S KS + K + E PE + N +S KS + S+ N
Sbjct: 150 KSENLVQSVPKSTTNGSKVAIIEDD----PEIR----AENGVKSSKSDEKPDFSAQN--- 198
Query: 288 SSSMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSN 347
S +A + + P+ K+ + + QLE++ S++ + +S + D ++
Sbjct: 199 GSKLAQNAPNRISRPRRSVTTAKKVSYVPSDDQLELSSSSSELESSSED--EDT-EIRPK 255
Query: 348 SSTVLPVKRLKKIESGESSTS 368
+ + + KR K + ES +S
Sbjct: 256 TGSKIAKKREKSFKISESESS 276
Score = 58 (25.5 bits), Expect = 4.6e-26, Sum P(3) = 4.6e-26
Identities = 12/40 (30%), Positives = 20/40 (50%)
Query: 234 ASPVKS-FSCDKKENVCETSSKGSPECKYSSPKSNNTQSK 272
ASP+ F+ D K+ +CE S + + +C + T K
Sbjct: 758 ASPISYVFAIDNKQGICEVSQRYAMDCVKQDFRRRRTNPK 797
Score = 50 (22.7 bits), Expect = 9.2e-21, Sum P(2) = 9.2e-21
Identities = 32/117 (27%), Positives = 52/117 (44%)
Query: 194 VASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVK-SF--SCDKKE---N 247
V S K + + S+ Q+ S++ NAP +++P + + K S+ S D+ E +
Sbjct: 183 VKSSKSDEKPDFSA-QNGSKLAQ---NAPN-RISRPRRSVTTAKKVSYVPSDDQLELSSS 237
Query: 248 VCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKE 304
E S E PK+ + +KK S ++S SSS + D SEA E
Sbjct: 238 SSELESSSEDEDTEIRPKTGSKIAKKREKSFKISESE---SSSESPDDESEASEASE 291
>DICTYBASE|DDB_G0292296 [details] [associations]
symbol:xpc "DNA repair protein Rad4 family protein"
species:44689 "Dictyostelium discoideum" [GO:0006289
"nucleotide-excision repair" evidence=IEA] [GO:0005634 "nucleus"
evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0044351
"macropinocytosis" evidence=RCA] InterPro:IPR004583
InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
Pfam:PF10405 SMART:SM01031 SMART:SM01032 dictyBase:DDB_G0292296
GO:GO:0005634 GenomeReviews:CM000155_GR GO:GO:0003684
EMBL:AAFI02000189 GO:GO:0006289 KO:K10838 eggNOG:COG5535
PANTHER:PTHR12135 RefSeq:XP_001134493.1 ProteinModelPortal:Q1ZXA6
EnsemblProtists:DDB0232368 GeneID:8628599 KEGG:ddi:DDB_G0292296
InParanoid:Q1ZXA6 OMA:VELFYMV Uniprot:Q1ZXA6
Length = 967
Score = 304 (112.1 bits), Expect = 1.8e-23, Sum P(2) = 1.8e-23
Identities = 127/546 (23%), Positives = 233/546 (42%)
Query: 230 EEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSS 289
E +++ P+ S +++++ K + S K+ T SKK + LSS N ++
Sbjct: 459 ELIISKPITS----RQKSIQANQFKNTVLNSKISKKTETTMSKKRKTNSSLSSKNKKKNN 514
Query: 290 SMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSS 349
S + +D + K + + + + + + S ++ SK K L +SS
Sbjct: 515 SDSENDTDNERDSGSDNDDAGDKNNNKSDQEKDNSSSDSDYKDSK-------KKLKRSSS 567
Query: 350 TVLPVKRLKKIESGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANA 409
+ RL ++ ES T+ + + + + W EV+ ++ KW+ +D N
Sbjct: 568 EPIKRSRLSNLDDKESKTTTTTTTNTLSNNEKVEIESWIEVF---DHEKKKWISIDLINK 624
Query: 410 IIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRYCMKWYRIASKRVNSA---WW--- 463
ID E Y+VA + KDVT RY + + KR+ A WW
Sbjct: 625 EIDKPLNFEKIL----DPFSYVVAISKYQIKDVTSRYTNNYIGSSLKRLPIAQIKWWLQL 680
Query: 464 --DAVLAPLRE-----------LESGATGDLNVESSAKDSFVADRNSLEDMEL-ETRALT 509
DA+ P L+S +N++ S + +R S+E++++ E + L
Sbjct: 681 VGDAINNPTEVENDNEPVSKFILDSKKIISVNIDLLNNLS-IDERKSIEEIDVYEKQELI 739
Query: 510 --E---PLPTNQQAYKNHQLYVIERWLNKYQILYPKGPILG-FCSGHAVYPRSCVQTLKT 563
E P P++ +K+H ++V+E+ + KY P LG F H +Y + ++ L T
Sbjct: 740 IKESKLPFPSSFAQFKSHPIFVLEKDIAKYCSPDPSSKPLGLFNETHKIYHKDQIKVLHT 799
Query: 564 KERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIE--LYGKWQLEPL 621
++W++ V GQ +P + ++ N L+G+WQ + L
Sbjct: 800 SDKWVQNGRMV----------------IEGQ--QPLKIVKGRSKSNPTSMLFGEWQTK-L 840
Query: 622 RLPSAV--NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEF 679
P+ + +GIVP N G V +++ P VHLR + VAK+L I+ APA+ G+E
Sbjct: 841 FEPAVIGKDGIVPTNSFGNVYLFNSSMCPINGVHLRGKGLIRVAKKLGINFAPALTGWEN 900
Query: 680 RNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQ 739
S P+ DG+VV +F +L+ + + +RW + + ++ +
Sbjct: 901 GPKSSYPIIDGVVVAKKFSKKLLDTWLSESSSRAEAELQKKNDEIKARWKRFMKKLLIKN 960
Query: 740 RLNNCY 745
+ Y
Sbjct: 961 YIEKTY 966
Score = 52 (23.4 bits), Expect = 1.8e-23, Sum P(2) = 1.8e-23
Identities = 23/83 (27%), Positives = 41/83 (49%)
Query: 9 DEGRLQDNVLDGGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKP-VR 67
+EG + +N LD EE+ ++ + G E+ E +I T EF + ++ KK V+
Sbjct: 46 EEGDI-NNSLDTDEEIGENQDDAGDA------EDAIEFEID--TNEFKSKENGKKKRIVK 96
Query: 68 RASAEDKELAELVHKVHLLCLLA 90
+ ++K +H+ L C LA
Sbjct: 97 KVDLKEKHNCLYLHRTVLTCYLA 119
Score = 37 (18.1 bits), Expect = 6.6e-22, Sum P(2) = 6.6e-22
Identities = 14/59 (23%), Positives = 23/59 (38%)
Query: 26 DSDWE----DGSIPVACSKEN--HPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAE 78
D +WE D S + P D + + EFD D + + + D+E+ E
Sbjct: 5 DIEWEESNNDNSTTTTTTTTTTASPRFD-ESINNEFDDEDKEEEGDINNSLDTDEEIGE 62
>POMBASE|SPAC12B10.12c [details] [associations]
symbol:rhp41 "DNA repair protein Rhp41" species:4896
"Schizosaccharomyces pombe" [GO:0000109 "nucleotide-excision repair
complex" evidence=ISO] [GO:0003684 "damaged DNA binding"
evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005819
"spindle" evidence=IDA] [GO:0006289 "nucleotide-excision repair"
evidence=IGI] [GO:0006298 "mismatch repair" evidence=IGI]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
PomBase:SPAC12B10.12c EMBL:CU329670 GenomeReviews:CU329670_GR
GO:GO:0005819 GO:GO:0003684 GO:GO:0006298 GO:GO:0006289
GO:GO:0000109 KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135
OrthoDB:EOG4Z0FG0 PIR:T37579 RefSeq:NP_594644.1
ProteinModelPortal:Q10445 STRING:Q10445
EnsemblFungi:SPAC12B10.12c.1 GeneID:2542967 KEGG:spo:SPAC12B10.12c
OMA:NEASSHE NextBio:20804002 InterPro:IPR018026 TIGRFAMs:TIGR00605
Uniprot:Q10445
Length = 638
Score = 286 (105.7 bits), Expect = 2.3e-23, Sum P(2) = 2.3e-23
Identities = 118/410 (28%), Positives = 175/410 (42%)
Query: 355 KRLKKIESGESSTSCLGISTAVGSR---KV---GAPLYWAEVYCSGENLTGKWVHVDA-A 407
KR K I+ S+ S L S V KV P++W E + KWV VD
Sbjct: 267 KRRKIIQPSFSNLSHLDASDIVTEDTKLKVIDSPKPVFWVEAF---NKAMQKWVCVDPFG 323
Query: 408 NAIIDGE-QKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASKRVN-----S 460
+A + G+ ++ E A++ + Y+ A G KDVTR+YC+ +Y+I RV
Sbjct: 324 DASVIGKYRRFEPASSDHLNQMTYVFAIEANGYVKDVTRKYCLHYYKILKNRVEIFPFGK 383
Query: 461 AWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYK 520
AW + + + + G D F D +++ED EL +E +P N Q K
Sbjct: 384 AWMNRIFSKI-----GKPRD----------FYNDMDAIEDAELLRLEQSEGIPRNIQDLK 428
Query: 521 NHQLYVIERWLNKYQILYPKGPILGFCS---G-HAVYPRSCVQTLKTKERWLREALQVKA 576
+H L+V+ER L K Q + G G + G VYPR V + E W R+ +K
Sbjct: 429 DHPLFVLERHLKKNQAI-KTGKSCGRINTKNGVELVYPRKYVSNGFSAEHWYRKGRIIKP 487
Query: 577 NEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNER 636
G P YDE +A +LY +P+ V IVP+N
Sbjct: 488 G------AQPLKHVKNGDKVLPL-YDE-EAT---QLYTP---KPV-----VANIVPKNAY 528
Query: 637 GQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAE 696
G +D++ LP G H R + AK LEID A A+VGF+F+ S P +G+VV
Sbjct: 529 GNIDLYVPSMLPYGAYHCRKRCALAAAKFLEIDYAKAVVGFDFQRKYSKPKLEGVVVSKR 588
Query: 697 FKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 746
+++ I W +L++ + RQR+ YG
Sbjct: 589 YEEAIDLIAEEIDQEEKEAEARNVRKTCLLLWKRLITGLRIRQRVFEEYG 638
Score = 63 (27.2 bits), Expect = 2.3e-23, Sum P(2) = 2.3e-23
Identities = 16/61 (26%), Positives = 30/61 (49%)
Query: 41 ENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCD 100
+ P D V D +V K+ + ++ D+++ +H++HLLCL ++ CD
Sbjct: 49 QERPTHDFGDVEATVDR--TVEKRSRLKITSVDRKIRLQIHQLHLLCLTYHLCTRNTWCD 106
Query: 101 D 101
D
Sbjct: 107 D 107
>ASPGD|ASPL0000008254 [details] [associations]
symbol:AN6186 species:162425 "Emericella nidulans"
[GO:0003684 "damaged DNA binding" evidence=IEA] [GO:0006298
"mismatch repair" evidence=IEA] [GO:0006289 "nucleotide-excision
repair" evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01031 SMART:SM01032 GO:GO:0005634
GO:GO:0003684 EMBL:BN001301 GO:GO:0006289 EMBL:AACD01000105
eggNOG:COG5535 PANTHER:PTHR12135 OrthoDB:EOG4DJP4K
RefSeq:XP_663790.1 EnsemblFungi:CADANIAT00006823 GeneID:2871078
KEGG:ani:AN6186.2 HOGENOM:HOG000164138 OMA:IPKNEYG Uniprot:Q5AZU4
Length = 941
Score = 198 (74.8 bits), Expect = 1.2e-19, Sum P(4) = 1.2e-19
Identities = 51/194 (26%), Positives = 84/194 (43%)
Query: 552 VYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIE 611
VY RS V +T E W +E + + + E+ +
Sbjct: 582 VYRRSDVVKCQTAESWHKEGREPLPSAKPLKHVPIRAVTLLRKREVDEEARRTGQKPLQG 641
Query: 612 LYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSA 671
LY Q + + P V+GI+P+NE G +D + + +P G VH+ + K+L ID A
Sbjct: 642 LYSFEQTQEIIPPPIVDGIIPKNEYGNIDCFVPRMVPKGAVHIPFSGTARICKKLGIDYA 701
Query: 672 PAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQL 731
A+ GFEF + + PV +G+VV AE KD +++A+ + + W +
Sbjct: 702 EAVTGFEFGSQMAVPVIEGVVVAAENKDLVVDAWRADNEEKRRKEARKAEAKILATWRKF 761
Query: 732 LSSIVTRQRLNNCY 745
L + QR+ Y
Sbjct: 762 LFGLRIAQRVQEEY 775
Score = 95 (38.5 bits), Expect = 1.2e-19, Sum P(4) = 1.2e-19
Identities = 54/192 (28%), Positives = 82/192 (42%)
Query: 384 PLYWAEVYCSGENLTGKWVHVDA---ANAIIDGEQKVEAA-------AAACKTSLRYIVA 433
P+YW EV +T + + VD +NA+ Q+++AA A K + Y++A
Sbjct: 384 PIYWTEVVSP---ITHQVISVDPLVLSNAVA-ATQELQAAFEPRGAKAEKAKQVICYVIA 439
Query: 434 F-AGCGAKDVTRRYCMK--W------YRIASKRVNSAWWDAVLAPLRELESGATGDLNVE 484
F A AKDVT RY + W +R+ K + D LR N E
Sbjct: 440 FSADKTAKDVTTRYLRRRTWPGKTKGFRLGKKGPDDDLLDWFRVLLR----------NYE 489
Query: 485 SSAKDSFVADRNSLEDM-ELETRALTEPLPTNQ-----QAYKNHQLYVIERWLNKYQILY 538
KD D +ED +L T+ PTN+ Q+ + +V+ER+L + + L
Sbjct: 490 RPYKDRTAVD--DIEDAKDLVPNRPTKSKPTNETVDTLQSLRTSSEFVLERFLRREEALR 547
Query: 539 PKG-PILGFCSG 549
P P+ F G
Sbjct: 548 PGALPVRTFTPG 559
Score = 67 (28.6 bits), Expect = 1.2e-19, Sum P(4) = 1.2e-19
Identities = 22/97 (22%), Positives = 50/97 (51%)
Query: 9 DEGRLQDNVLDGGEEMYDSDWEDGSIPVACSKENH--PESDIKGVTIEFDAADSVTKKPV 66
D+ + D+ + EE+ DWED +I A + P +++ +T++ +
Sbjct: 58 DKKVVSDSDVTDSEEV---DWED-AIHTAAPATSFVSPHENLE-LTLDRNEVHLEDILQG 112
Query: 67 RRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDP 102
++A + ++++ L+H++H+ CLLA + + +DP
Sbjct: 113 QKAPTKIERQIRILIHRLHVQCLLAHNAIRNDWINDP 149
Score = 52 (23.4 bits), Expect = 1.2e-19, Sum P(4) = 1.2e-19
Identities = 16/59 (27%), Positives = 26/59 (44%)
Query: 134 VSWFHDNFHVRSSVSTRRSFHSDLAHALESREGTPEEIAALSVALFRALKLTTRFVSIL 192
++ FH + H + + A E EG+ + A L AL RA+ + R V+ L
Sbjct: 263 IASFHKDKHDPELYGEKIPSVEEFRQAAERMEGSRDLGAQLFTALLRAIAIEARLVASL 321
Score = 49 (22.3 bits), Expect = 5.3e-15, Sum P(4) = 5.3e-15
Identities = 15/53 (28%), Positives = 24/53 (45%)
Query: 230 EEVL---ASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCE 279
EE L A PV++F+ K+ + +P SPK+ N + V C+
Sbjct: 543 EEALRPGALPVRTFTPGGKKKNANGNGASTPT---ESPKAENVYRRSDVVKCQ 592
>POMBASE|SPCC4G3.10c [details] [associations]
symbol:rhp42 "DNA repair protein Rhp42" species:4896
"Schizosaccharomyces pombe" [GO:0000109 "nucleotide-excision repair
complex" evidence=ISO] [GO:0003684 "damaged DNA binding"
evidence=ISO] [GO:0005730 "nucleolus" evidence=IDA] [GO:0006289
"nucleotide-excision repair" evidence=IGI] [GO:0006298 "mismatch
repair" evidence=IGI] InterPro:IPR004583 InterPro:IPR018325
InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
SMART:SM01031 SMART:SM01032 PomBase:SPCC4G3.10c GO:GO:0005730
EMBL:CU329672 GenomeReviews:CU329672_GR GO:GO:0003684 GO:GO:0006298
GO:GO:0006289 GO:GO:0000109 KO:K10838 eggNOG:COG5535
PANTHER:PTHR12135 InterPro:IPR018026 TIGRFAMs:TIGR00605 PIR:T41366
RefSeq:NP_587828.1 ProteinModelPortal:P87235 STRING:P87235
EnsemblFungi:SPCC4G3.10c.1 GeneID:2539465 KEGG:spo:SPCC4G3.10c
OMA:YPESETE OrthoDB:EOG4DJP4K NextBio:20800627 Uniprot:P87235
Length = 686
Score = 251 (93.4 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
Identities = 101/380 (26%), Positives = 157/380 (41%)
Query: 384 PLYWAEVYCSGENLTGKWVHVDAA--NAIIDGEQK-VEAAAAACKTS-LRY--IVAFAG- 436
P++W E+Y E KW+ VDA N + + E A ++ LR + A+
Sbjct: 323 PIFWTEIYDQSEK---KWIAVDAVVLNGVYTNDMTWFEPKGAYAESKHLRMGIVAAYDND 379
Query: 437 CGAKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRN 496
AKDVT RY Y+ S R+ + + G L + KD+ +
Sbjct: 380 LYAKDVTLRYTD--YQ--SSRLKKIRHVSFADKYFDFYKAIFGQLAKRN--KDA----ED 429
Query: 497 SLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKG-PI--LGFCSG---- 549
E+ ELE++ P + +KNH +V+ R L + + L P P+ F +G
Sbjct: 430 IYEEKELESKVPIRE-PKSFADFKNHPEFVLIRHLRREEALLPNAKPVKTATFGNGKKAT 488
Query: 550 -HAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQ---DFEPEDYDEVD 605
VY R V KT E + +E +K E + +F + +E
Sbjct: 489 SEEVYLRKDVVICKTPENYHKEGRVIKEGEQPRKMVKARAVTISRKREHEFRVAETNEPV 548
Query: 606 ARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKR 665
+G LY Q E P +GI+P+N G +D + E +P G HL + +AK+
Sbjct: 549 LQG---LYSSDQTELYVPPPIKDGIIPKNGYGNMDCFVESMIPKGAAHLPYRGIAKIAKK 605
Query: 666 LEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQAT 725
L ID A A+ GFEFR R+ PV GI+V E + E +
Sbjct: 606 LNIDYADAVTGFEFRKHRAIPVTTGIIVPEESAQMVYEEFLECEKIRIEKQQMKERKIIY 665
Query: 726 SRWYQLLSSIVTRQRLNNCY 745
+W LL+++ R+R+ Y
Sbjct: 666 GQWKHLLNALRIRKRIEEQY 685
Score = 64 (27.6 bits), Expect = 1.9e-19, Sum P(2) = 1.9e-19
Identities = 25/90 (27%), Positives = 43/90 (47%)
Query: 9 DEGRLQDNVLDGGEE--MYDSD---WEDGSIPVACSKENHPESDIKGVTIEFDAADSVTK 63
++G +DN G E +D D WE + ++ +K+ + D+ VT +TK
Sbjct: 81 EKGSDEDNEKLGSSEDDEFDDDFDTWEQ--VDLSPNKQED-KKDLHIVTQHI--TPQLTK 135
Query: 64 KPVR-RASAEDKELAELVHKVHLLCLLARG 92
+ + +SA DK + +H +H CLL G
Sbjct: 136 ESKKGSSSAMDKSIRLSIHIMHFTCLLYHG 165
>CGD|CAL0004788 [details] [associations]
symbol:orf19.6722 species:5476 "Candida albicans" [GO:0000111
"nucleotide-excision repair factor 2 complex" evidence=IEA]
[GO:0005829 "cytosol" evidence=IEA] [GO:0005819 "spindle"
evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IEA]
[GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
process" evidence=IEA] [GO:0006298 "mismatch repair" evidence=IEA]
[GO:0006289 "nucleotide-excision repair" evidence=IEA]
InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
CGD:CAL0004788 GO:GO:0005634 GO:GO:0003684 GO:GO:0006289
EMBL:AACQ01000029 EMBL:AACQ01000028 KO:K10838 eggNOG:COG5535
PANTHER:PTHR12135 RefSeq:XP_719704.1 RefSeq:XP_719821.1
ProteinModelPortal:Q5ADX0 STRING:Q5ADX0 GeneID:3638462
GeneID:3638600 KEGG:cal:CaO19.14014 KEGG:cal:CaO19.6722
Uniprot:Q5ADX0
Length = 709
Score = 240 (89.5 bits), Expect = 1.3e-18, Sum P(2) = 1.3e-18
Identities = 103/388 (26%), Positives = 158/388 (40%)
Query: 384 PLYWAEVYCSGENLTGKWVHVDA-ANAIID--GEQK---VEAAAAACKTSLRYIVAFAGC 437
P++W EV+ T +WV +D +I+ ++K E + L Y+VAF
Sbjct: 281 PVFWVEVW---NKYTRQWVSIDPIVMKLIEVCPKRKKSPFEPPPTDERNQLTYVVAFDKF 337
Query: 438 G-AKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRN 496
G +DVTRRY Y +K + + E +S L K VAD
Sbjct: 338 GRVRDVTRRYS---YNYNAKTIRKR----IEFRSSEDKSWYLKVLRCCDFKKTQNVAD-- 388
Query: 497 SLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPI--LG-FCSGHA-- 551
E E R L E +P N QA+KNH LY +E L + +I++PK G F S ++
Sbjct: 389 IYEQKEFYDRDLAEGMPNNIQAFKNHPLYALESQLRQDEIIFPKDDTSKCGTFRSKNSSK 448
Query: 552 ---VYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARG 608
VY RSCV L++ + W Q+K P E D R
Sbjct: 449 VFQVYKRSCVHRLRSAKAWYMRGRQLKVGAI------------------PLKSKEEDVR- 489
Query: 609 NIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLR------LPRVYSV 662
LY ++Q + P +GIVP+N+ G +DV+++ LP ++ + + + +
Sbjct: 490 ---LYAEFQTQLYIPPPVTDGIVPKNQYGNIDVYTKTMLPENSILIECDENCSMKMLQNA 546
Query: 663 AKRLEIDSAPAMVGFEFRNGRS----TPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXX 718
A L ID A A+V F F+ + T GIV+ E+++ +
Sbjct: 547 ANLLAIDYAKAIVSFSFKGKKKKHNITAREGGIVIAKEYEEAMQLTIDNLIEQEEEDQRA 606
Query: 719 XXXXQATSRWYQLLSSIVTRQRLNNCYG 746
A W L + RLN +G
Sbjct: 607 LSEANALRNWKYFLLKLRLEDRLNKSHG 634
Score = 68 (29.0 bits), Expect = 1.3e-18, Sum P(2) = 1.3e-18
Identities = 23/86 (26%), Positives = 41/86 (47%)
Query: 16 NVLDGGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKE 75
N+LD +E D E+ IP KE+ ++ + I D K P S E++
Sbjct: 54 NILDDSDEFETIDLEN--IP----KESGNDT----LVIRIDNNKKEEKTPKNLISREERH 103
Query: 76 LAELVHKVHLLCLLARGRLIDSVCDD 101
L+HK++L+ +L G + + C++
Sbjct: 104 RRVLLHKMYLVMMLVHGSIRNLWCNN 129
>SGD|S000000964 [details] [associations]
symbol:RAD4 "Protein that recognizes and binds damaged DNA
during NER" species:4932 "Saccharomyces cerevisiae" [GO:0000111
"nucleotide-excision repair factor 2 complex" evidence=IDA]
[GO:0003684 "damaged DNA binding" evidence=IEA;IDA] [GO:0005634
"nucleus" evidence=IEA;IDA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0006281 "DNA repair" evidence=IEA] [GO:0006974 "response to DNA
damage stimulus" evidence=IEA] [GO:0003677 "DNA binding"
evidence=IEA] [GO:0043161 "proteasomal ubiquitin-dependent protein
catabolic process" evidence=IMP] [GO:0005829 "cytosol"
evidence=IDA] [GO:0006289 "nucleotide-excision repair"
evidence=IEA;IMP] InterPro:IPR004583 InterPro:IPR018325
InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
SMART:SM01031 SMART:SM01032 SGD:S000000964 GO:GO:0005829
GO:GO:0043161 GO:GO:0003684 EMBL:BK006939 KO:K01530
RefSeq:NP_011093.3 GeneID:856913 KEGG:sce:YER166W GO:GO:0006289
EMBL:U18917 RefSeq:NP_011089.4 GeneID:856909 KEGG:sce:YER162C
KO:K10838 PDB:2QSF PDB:2QSG PDB:2QSH PDBsum:2QSF PDBsum:2QSG
PDBsum:2QSH GO:GO:0000111 eggNOG:COG5535 PANTHER:PTHR12135
EMBL:M26050 EMBL:M24928 PIR:S30814 ProteinModelPortal:P14736
SMR:P14736 DIP:DIP-1547N IntAct:P14736 MINT:MINT-396392
STRING:P14736 PaxDb:P14736 PeptideAtlas:P14736 EnsemblFungi:YER162C
GeneTree:ENSGT00390000005194 HOGENOM:HOG000074544 OMA:FKGRHGT
OrthoDB:EOG4Z0FG0 EvolutionaryTrace:P14736 NextBio:983347
Genevestigator:P14736 GermOnline:YER162C Uniprot:P14736
Length = 754
Score = 237 (88.5 bits), Expect = 8.9e-17, Sum P(3) = 8.9e-17
Identities = 94/380 (24%), Positives = 159/380 (41%)
Query: 384 PLYWAEVYCSGENLTGKWVHVDAANA-IIDG---EQKVEAAAAAC--KTSLRYIVAF-AG 436
P++W EV+ + + KW+ VD N I+ K+ AC + LRY++A+
Sbjct: 313 PIFWCEVW---DKFSKKWITVDPVNLKTIEQVRLHSKLAPKGVACCERNMLRYVIAYDRK 369
Query: 437 CGAKDVTRRYCMKWY--RIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVAD 494
G +DVTRRY +W ++ +R+ D R++ + L+ K + D
Sbjct: 370 YGCRDVTRRYA-QWMNSKVRKRRITKD--DFGEKWFRKVITA----LHHRKRTK---IDD 419
Query: 495 RNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPILGFCSGHA--- 551
ED R +E +P + Q KNH YV+E+ + + QI+ P G+ H
Sbjct: 420 ---YEDQYFFQRDESEGIPDSVQDLKNHPYYVLEQDIKQTQIVKPGCKECGYLKVHGKVG 476
Query: 552 ----VYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDAR 607
VY + + LK+ +W +K G+ E ED + + +
Sbjct: 477 KVLKVYAKRDIADLKSARQWYMNGRILKTGSRCKKVIKRTVGRPKGEA-EEED-ERLYSF 534
Query: 608 GNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLE 667
+ ELY + PL ++ +G + +N G ++V++ +P + P A+ L
Sbjct: 535 EDTELY----IPPL---ASASGEITKNTFGNIEVFAPTMIPGNCCLVENPVAIKAARFLG 587
Query: 668 IDSAPAMVGFEFRNGRST-PVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATS 726
++ APA+ F+F G + PV GIVV ++ I A A
Sbjct: 588 VEFAPAVTSFKFERGSTVKPVLSGIVVAKWLREAIETAIDGIEFIQEDDNRKEHLLGALE 647
Query: 727 RWYQLLSSIVTRQRLNNCYG 746
W LL + R +LN+ YG
Sbjct: 648 SWNTLLLKLRIRSKLNSTYG 667
Score = 52 (23.4 bits), Expect = 8.9e-17, Sum P(3) = 8.9e-17
Identities = 17/80 (21%), Positives = 41/80 (51%)
Query: 18 LDGGEEMYDSD-WEDGSIPVACSKENHPESDIKGVTIEFDAA---DSVTKKPVRRA-SAE 72
+ EE YDS+ +ED + + + + ++ +++E + +S ++ R S E
Sbjct: 83 IQSSEEDYDSEEFEDVT-------DGNEVAGVEDISVEIKPSSKRNSDARRTSRNVCSNE 135
Query: 73 DKELAELVHKVHLLCLLARG 92
+++ + H ++L+CL+ G
Sbjct: 136 ERKRRKYFHMLYLVCLMVHG 155
Score = 46 (21.3 bits), Expect = 8.9e-17, Sum P(3) = 8.9e-17
Identities = 13/51 (25%), Positives = 21/51 (41%)
Query: 143 VRSSVSTRRSF----HSDLAHALESREGTPEEIAALSVALFRALKLTTRFV 189
+ S + +R F SD A+ G P+ VA+ RA + R +
Sbjct: 233 IEMSANNKRKFKTLKRSDFLRAVSKGHGDPDISVQGFVAMLRACNVNARLI 283
>UNIPROTKB|G4MUV6 [details] [associations]
symbol:MGG_01699 "Uncharacterized protein" species:242507
"Magnaporthe oryzae 70-15" [GO:0043581 "mycelium development"
evidence=IEP] InterPro:IPR004583 InterPro:IPR018325
InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01031
SMART:SM01032 GO:GO:0005634 GO:GO:0003684 GO:GO:0043581
EMBL:CM001232 GO:GO:0006289 PANTHER:PTHR12135 RefSeq:XP_003714693.1
ProteinModelPortal:G4MUV6 EnsemblFungi:MGG_01699T0 GeneID:2679173
KEGG:mgr:MGG_01699 Uniprot:G4MUV6
Length = 1045
Score = 200 (75.5 bits), Expect = 2.8e-14, Sum P(3) = 2.8e-14
Identities = 67/266 (25%), Positives = 110/266 (41%)
Query: 494 DRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQ-ILYPKGPILGF---CSG 549
D L + E + + E T Q YK + YV+ER L + + +L P+ F G
Sbjct: 599 DSTDLRPAKHEKKEVKEGDETLQY-YKQSKEYVLERHLKREEALLQDATPVKVFKVKAKG 657
Query: 550 -----HAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEV 604
VY R V +K+ E W ++ K E + D
Sbjct: 658 GEFTEENVYLRRDVVQVKSAETWHKQGRAPKEGEKPLKMVPYRAATMNRK----RDIAAA 713
Query: 605 DAR-GNIELYGKWQLEPLR--LPSAV-NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVY 660
+A G L G + ++ +P + +GI+P+NE G +D+++E P G VH+
Sbjct: 714 EAATGKKVLQGLYSMDQTDWIIPPPIKDGIIPKNEYGNIDLFAEHMCPQGAVHVPFRGAV 773
Query: 661 SVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXX 720
V +RL +D A A++ FEF + + PV G+V+ E D ++E
Sbjct: 774 KVCRRLGVDYAEAVIDFEFGHRMAVPVIQGVVIAEEHHDRVMEELAKDEAERARKEDAKR 833
Query: 721 XXQATSRWYQLLSSIVTRQRLNNCYG 746
A + W ++L ++ RL YG
Sbjct: 834 TAAALAMWRKMLMAMRITNRLREEYG 859
Score = 72 (30.4 bits), Expect = 2.8e-14, Sum P(3) = 2.8e-14
Identities = 25/99 (25%), Positives = 49/99 (49%)
Query: 6 RELDEGRLQDNVLDGGEEMYDSDWEDGSIPVA-CSKENHPESDIKGVTIEFDAADSVTKK 64
R LD D+ D ++ D ++ED +A ++E P D++ +T++ D S+T +
Sbjct: 92 RSLDMADEDDDGSDDDDD--DIEFEDVQASLAPFAEEAAPSGDLE-LTLDLDGRISLTNE 148
Query: 65 PVRRASAEDKE--LAELVHKVHLLCLLARGRLIDS-VCD 100
+ +E VH+VH++ L+ + +S +CD
Sbjct: 149 YGNKKGPSKRERITRNAVHRVHVMFLMWHNAVRNSWLCD 187
Score = 47 (21.6 bits), Expect = 2.8e-14, Sum P(3) = 2.8e-14
Identities = 20/91 (21%), Positives = 34/91 (37%)
Query: 227 AKPEEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLD 286
A PEE +S + + ++K P ++ S +S QSK + +
Sbjct: 378 ADPEEERSSQPSPEKPTQTTQTPQKNTKNEPRRQHVSSRSRGKQSKAIEEEDSNYVDDFE 437
Query: 287 PSSSMACSDISEACHPKEKSQALKRKGDLEF 317
P + ++ K Q+ K DLEF
Sbjct: 438 PQEVNSDDEMVVEVPKKMAPQSKKFDQDLEF 468
Score = 47 (21.6 bits), Expect = 9.8e-11, Sum P(3) = 9.8e-11
Identities = 14/51 (27%), Positives = 22/51 (43%)
Query: 279 ELSSGNLDPSSSMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATN 329
E G+ D + D+ + P + A GDLE + L+ +S TN
Sbjct: 99 EDDDGSDDDDDDIEFEDVQASLAPFAEEAA--PSGDLELTLDLDGRISLTN 147
Score = 37 (18.1 bits), Expect = 2.9e-13, Sum P(3) = 2.9e-13
Identities = 8/15 (53%), Positives = 10/15 (66%)
Query: 374 TAVGSRKVGAPLYWA 388
+ GSR VGA L+ A
Sbjct: 336 SCTGSRDVGAQLFTA 350
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.314 0.129 0.380 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 833 778 0.00094 121 3 11 23 0.45 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 21
No. of states in DFA: 632 (67 KB)
Total size of DFA: 422 KB (2202 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 73.11u 0.13s 73.24t Elapsed: 00:00:04
Total cpu time: 73.12u 0.13s 73.25t Elapsed: 00:00:04
Start: Mon May 20 15:47:01 2013 End: Mon May 20 15:47:05 2013