BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>002340
MRTRQDSKTQKDQASGKESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSR
SKKQDCAVGLTTSVLKVSGKQEVDKRVTWSDVDAHGCSRDAMGNTLRELDEGRLQDNVLD
GGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAEL
VHKVHLLCLLARGRLIDSVCDDPLIQASLLSLLPSYLLKISEVSKLTANALSPIVSWFHD
NFHVRSSVSTRRSFHSDLAHALESREGTPEEIAALSVALFRALKLTTRFVSILDVASLKP
EADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENVCETSSKGSPEC
KYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKSQALKRKGDLEFEM
QLEMALSATNVATSKSNICSDVKDLNSNSSTVLPVKRLKKIESGESSTSCLGISTAVGSR
KVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA
KDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLE
DMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPILGFCSGHAVYPRSCVQ
TLKTKERWLREALQVKANEVPVKVIKNSSKSKKGQDFEPEDYDEVDARGNIELYGKWQLE
PLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEF
RNGRSTPVFDGIVVCAEFKDTILEAYAEEEEKREAEEKKRREAQATSRWYQLLSSIVTRQ
RLNNCYGNNSTSQSSSNFQNVKKTNSNVGVDSSQNDWQSPNQVDRGDTKLHAPSPFQSEE
HEHVYLIEDQSFDEENSVTTKRCHCGFTIQVEEL

High Scoring Gene Products

Symbol, full name Information P value
RAD4
AT5G16630
protein from Arabidopsis thaliana 9.2e-213
Xpc
xeroderma pigmentosum, complementation group C
protein from Mus musculus 4.5e-66
Gga.54220
Uncharacterized protein
protein from Gallus gallus 4.8e-64
Gga.54220
Uncharacterized protein
protein from Gallus gallus 2.1e-60
XPC
Uncharacterized protein
protein from Bos taurus 1.2e-59
Xpc
xeroderma pigmentosum, complementation group C
gene from Rattus norvegicus 3.1e-55
XPC
DNA repair protein complementing XP-C cells
protein from Homo sapiens 1.7e-54
XPC
Uncharacterized protein
protein from Sus scrofa 2.5e-50
XPC
Uncharacterized protein
protein from Canis lupus familiaris 1.2e-49
xpc
xeroderma pigmentosum, complementation group C
gene_product from Danio rerio 2.1e-46
mus210
mutagen-sensitive 210
protein from Drosophila melanogaster 1.9e-39
xpc-1 gene from Caenorhabditis elegans 4.0e-25
xpc
DNA repair protein Rad4 family protein
gene from Dictyostelium discoideum 2.9e-23
orf19.6722 gene_product from Candida albicans 2.0e-18
RAD4
Protein that recognizes and binds damaged DNA during NER
gene from Saccharomyces cerevisiae 1.6e-16
MGG_01699
Uncharacterized protein
protein from Magnaporthe oryzae 70-15 2.3e-14

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  002340
        (934 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2174160 - symbol:RAD4 species:3702 "Arabidopsi...  1484  9.2e-213  2
MGI|MGI:103557 - symbol:Xpc "xeroderma pigmentosum, compl...   672  4.5e-66   1
UNIPROTKB|F1N806 - symbol:Gga.54220 "Uncharacterized prot...   628  4.8e-64   2
UNIPROTKB|E1BUG1 - symbol:Gga.54220 "Uncharacterized prot...   628  2.1e-60   2
UNIPROTKB|E1BDJ1 - symbol:XPC "Uncharacterized protein" s...   447  1.2e-59   3
RGD|1305760 - symbol:Xpc "xeroderma pigmentosum, compleme...   593  3.1e-55   1
UNIPROTKB|Q01831 - symbol:XPC "DNA repair protein complem...   587  1.7e-54   1
UNIPROTKB|E9PH69 - symbol:XPC "DNA repair protein-complem...   581  6.3e-54   1
UNIPROTKB|F1SPI2 - symbol:XPC "Uncharacterized protein" s...   428  2.5e-50   2
UNIPROTKB|E2RCR3 - symbol:XPC "Uncharacterized protein" s...   448  1.2e-49   2
ZFIN|ZDB-GENE-030131-8461 - symbol:xpc "xeroderma pigment...   414  2.1e-46   2
FB|FBgn0004698 - symbol:mus210 "mutagen-sensitive 210" sp...   405  1.9e-39   2
ASPGD|ASPL0000010029 - symbol:AN3890 species:162425 "Emer...   328  4.8e-29   2
WB|WBGene00022296 - symbol:xpc-1 species:6239 "Caenorhabd...   283  4.0e-25   4
DICTYBASE|DDB_G0292296 - symbol:xpc "DNA repair protein R...   304  2.9e-23   2
POMBASE|SPAC12B10.12c - symbol:rhp41 "DNA repair protein ...   286  3.6e-23   2
ASPGD|ASPL0000008254 - symbol:AN6186 species:162425 "Emer...   198  2.5e-19   4
POMBASE|SPCC4G3.10c - symbol:rhp42 "DNA repair protein Rh...   251  2.9e-19   2
CGD|CAL0004788 - symbol:orf19.6722 species:5476 "Candida ...   240  2.0e-18   2
SGD|S000000964 - symbol:RAD4 "Protein that recognizes and...   237  1.6e-16   3
UNIPROTKB|G4MUV6 - symbol:MGG_01699 "Uncharacterized prot...   200  2.3e-14   3


>TAIR|locus:2174160 [details] [associations]
            symbol:RAD4 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003684 "damaged DNA
            binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM;IEA;ISS] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA;ISS] [GO:0009507 "chloroplast" evidence=IDA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            Pfam:PF01841 GO:GO:0005634 EMBL:CP002688 GenomeReviews:BA000015_GR
            GO:GO:0009507 GO:GO:0003684 GO:GO:0006289 InterPro:IPR002931
            KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135 EMBL:AY062755
            EMBL:BT010359 IPI:IPI00534100 RefSeq:NP_001031894.1
            RefSeq:NP_197166.2 UniGene:At.27241 ProteinModelPortal:Q8W489
            STRING:Q8W489 PaxDb:Q8W489 PRIDE:Q8W489 EnsemblPlants:AT5G16630.1
            EnsemblPlants:AT5G16630.2 GeneID:831525 KEGG:ath:AT5G16630
            TAIR:At5g16630 HOGENOM:HOG000144515 InParanoid:Q8W489 OMA:QVDVWSE
            PhylomeDB:Q8W489 ProtClustDB:CLSN2690169 Genevestigator:Q8W489
            Uniprot:Q8W489
        Length = 865

 Score = 1484 (527.5 bits), Expect = 9.2e-213, Sum P(2) = 9.2e-213
 Identities = 322/610 (52%), Positives = 395/610 (64%)

Query:   346 KENVCETSSKGSPECKYSS--PKSNNTQSK-KSPVSCELSSGNLDPSSSMACSDISEACH 402
             K  +  TS+   P+ +  S  PK +++  K KSP   +   GN   S  +  + ++ +C 
Sbjct:   275 KHGIFRTSTLMVPKQQAISSYPKKSSSHVKNKSPFE-KPQLGNPLGSDQVQDNAVNSSCE 333

Query:   403 P--KEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLPVKRLKK 460
                  KS   +RKGD+EFE Q+ MALSAT          +D    N  SS V   K++++
Sbjct:   334 AGMSIKSDGTRRKGDVEFERQIAMALSAT----------AD----NQQSSQVNNTKKVRE 379

Query:   461 IE--SGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDGEQKV 518
             I   S  SS S   ISTA GS+KV +PL W EVYC+GEN+ GKWVHVDA N +ID EQ +
Sbjct:   380 ITKISNSSSVSDQVISTAFGSKKVDSPLCWLEVYCNGENMDGKWVHVDAVNGMIDAEQNI 439

Query:   519 EAAAAACKTSLRYIVAFAGCGAKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGA 578
             EAAAAACKT LRY+VAFA  GAKDVTRRYC KW+ I+SKRV+S WWD VLAPL  LESGA
Sbjct:   440 EAAAAACKTVLRYVVAFAAGGAKDVTRRYCTKWHTISSKRVSSVWWDMVLAPLVHLESGA 499

Query:   579 TGD----------LN-VES--SAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQL 625
             T D          LN V S  S+  S    R++LEDMEL TRALTE LPTNQQAYK+H++
Sbjct:   500 THDEDIALRNFNGLNPVSSRASSSSSSFGIRSALEDMELATRALTESLPTNQQAYKSHEI 559

Query:   626 YVIERWLNKYQILYPKGPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEXXXXXX 685
             Y IE+WL+K QIL+PKGP+LGFCSGH VYPR+CVQTLKTKERWLR+ LQ+KANE      
Sbjct:   560 YAIEKWLHKNQILHPKGPVLGFCSGHPVYPRTCVQTLKTKERWLRDGLQLKANEVPSKIL 619

Query:   686 XXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSE 745
                      +DFE  D +       +ELYGKWQ+EPL LP AVNGIVP+NERGQVDVWSE
Sbjct:   620 KRNSKFKKVKDFEDGDNNIKGGSSCMELYGKWQMEPLCLPPAVNGIVPKNERGQVDVWSE 679

Query:   746 KCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEA 805
             KCLPPGTVHLR PR+++VAKR  ID APAMVGFE+R+G +TP+F+GIVVC EFKDTILEA
Sbjct:   680 KCLPPGTVHLRFPRIFAVAKRFGIDYAPAMVGFEYRSGGATPIFEGIVVCTEFKDTILEA 739

Query:   806 YXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYGXXXXXXXXXXXXXVKKTN 865
             Y                 QA SRWYQLLSSI+TR+RL N Y                + N
Sbjct:   740 YAEEQEKKEEEERRRNEAQAASRWYQLLSSILTRERLKNRYANNSNDVEAKSL----EVN 795

Query:   866 SNVGVDSSQNDWQSPNQV-DRGDTKLHAPSPFQSEEHEHVYLIEDQSFDEENSVTTKRCH 924
             S   V +         +V  RG+      S  + E HEHV+L E+++FDEE SV TKRC 
Sbjct:   796 SETVVKAKNVKAPEKQRVAKRGEKSRVRKSRNEDESHEHVFLDEEETFDEETSVKTKRCK 855

Query:   925 CGFTIQVEEL 934
             CGF+++VE++
Sbjct:   856 CGFSVEVEQM 865

 Score = 595 (214.5 bits), Expect = 9.2e-213, Sum P(2) = 9.2e-213
 Identities = 148/331 (44%), Positives = 195/331 (58%)

Query:    31 SHNETGTLAETSREGVGKFLRHVNARSSSRSKKQDCAVGLTTSVLKVSGKQEVDKRVTWS 90
             S ++   LA+ SR  V K L   +AR S   KKQD           V+GK    K+    
Sbjct:     5 SESKNCRLAQASRVAVNKVLDKSSARGSRGKKKQDDNCDSAKRDKGVNGK---GKQA--- 58

Query:    91 DVDAHGCSRDAMGNTLRELDEGRLQDNVLDGGEEMYDSDWEDGSIPVACSK-ENHPESDI 149
              +DA       + N L +   G + D      +EM DSDWED  IP   S  +++   D 
Sbjct:    59 -LDAR-----LIDNVLEDRGCGNVDD------DEMNDSDWEDCPIPSLDSTVDDNNVDDT 106

Query:   150 KGVTIEFD--AADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQA 207
             + +TIEFD    D+  +K   RA+AEDK  AELVHKVHLLCLLARGR++DS C+DPLIQA
Sbjct:   107 RELTIEFDDDVPDAKKQKNAYRATAEDKVRAELVHKVHLLCLLARGRIVDSACNDPLIQA 166

Query:   208 XXXXXXXXXXXKISEVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREG 267
                        K+S + K+T   ++P++ W  +NF V  S S+ +SF + LA ALESR+G
Sbjct:   167 ALLSLLPSYLTKVSNLEKVTVKDIAPLLRWVRENFSVSCSPSSEKSFRTSLAFALESRKG 226

Query:   268 TPEEIAALSVALFRALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMV 327
             T EE+AAL+VAL RALKLTTRFVSILDVASLKP AD+N SS Q+ +++  GIF   TLMV
Sbjct:   227 TAEELAALAVALLRALKLTTRFVSILDVASLKPGADRNESSGQNRAKMKHGIFRTSTLMV 286

Query:   328 AKPEEVLASPVKSFSCDKKENVCETSSKGSP 358
              K + + + P KS S  K ++  E    G+P
Sbjct:   287 PKQQAISSYPKKSSSHVKNKSPFEKPQLGNP 317


>MGI|MGI:103557 [details] [associations]
            symbol:Xpc "xeroderma pigmentosum, complementation group C"
            species:10090 "Mus musculus" [GO:0000075 "cell cycle checkpoint"
            evidence=ISO] [GO:0000715 "nucleotide-excision repair, DNA damage
            recognition" evidence=ISO] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003684 "damaged DNA binding" evidence=ISO] [GO:0003697
            "single-stranded DNA binding" evidence=ISO] [GO:0005634 "nucleus"
            evidence=ISO;IDA] [GO:0005737 "cytoplasm" evidence=ISO] [GO:0006281
            "DNA repair" evidence=IMP] [GO:0006289 "nucleotide-excision repair"
            evidence=ISO;IDA;IMP] [GO:0006974 "response to DNA damage stimulus"
            evidence=IMP] [GO:0010224 "response to UV-B" evidence=IMP]
            [GO:0031573 "intra-S DNA damage checkpoint" evidence=IGI]
            [GO:0071942 "XPC complex" evidence=ISO] InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            MGI:MGI:103557 GO:GO:0005737 GO:GO:0042493 GO:GO:0003684
            GO:GO:0003697 GO:GO:0010224 GO:GO:0006289 GO:GO:0031573
            GO:GO:0071942 GO:GO:0000715 KO:K10838 eggNOG:COG5535
            PANTHER:PTHR12135 GeneTree:ENSGT00390000005194 InterPro:IPR018026
            TIGRFAMs:TIGR00605 CTD:7508 HOGENOM:HOG000124671 HOVERGEN:HBG000407
            OMA:MKRFNKE OrthoDB:EOG40CHGQ ChiTaRS:XPC EMBL:U27398 EMBL:AB071144
            EMBL:AK004713 EMBL:AK028595 EMBL:AK166981 EMBL:U40005
            IPI:IPI00124885 PIR:S70630 RefSeq:NP_033557.2 UniGene:Mm.2806
            ProteinModelPortal:P51612 SMR:P51612 IntAct:P51612 STRING:P51612
            PhosphoSite:P51612 PaxDb:P51612 PRIDE:P51612
            Ensembl:ENSMUST00000032182 GeneID:22591 KEGG:mmu:22591
            UCSC:uc009cyd.1 InParanoid:P51612 NextBio:302933 Bgee:P51612
            CleanEx:MM_XPC Genevestigator:P51612 GermOnline:ENSMUSG00000030094
            Uniprot:P51612
        Length = 930

 Score = 672 (241.6 bits), Expect = 4.5e-66, P = 4.5e-66
 Identities = 234/886 (26%), Positives = 376/886 (42%)

Query:     2 RTRQDSKTQKDQASGKESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSRS 61
             R ++  KT+ ++ +  E +V     D +       +  + S+    +        ++  +
Sbjct:    11 RRKRGQKTEDNKVARHEESVADDFEDEKQKPRRKSSFPKVSQGKRKRGCSDPGDPTNGAA 70

Query:    62 KKQDCAVGLTTSVLKVSGKQEVDKRVTWSDVDAHGCSRDAMGNTLRELDEGRLQDNVLDG 121
             KK+       +  LKV  ++ +     + D  A  C +       + +D+G  +D+  D 
Sbjct:    71 KKKVAKATAKSKNLKVLKEEALSDGDDFRDSPAD-CKKAKKHPKSKVVDQGTDEDDSEDD 129

Query:   122 GEEMYDSDWEDGSIPVACSKENHP-ESDIKGVTIEFDAADSVTKKP------------VR 168
              EE+   +  +  + +  +    P +  +K V IE +      ++             +R
Sbjct:   130 WEEV--EELTEPVLDMGENSATSPSDMPVKAVEIEIETPQQAKERERSEKIKMEFETYLR 187

Query:   169 RASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLT 227
             R     +KE+ E +HKVHLLCLLA G   +S+C  P + A           K+  +    
Sbjct:   188 RMMKRFNKEVQENMHKVHLLCLLASGFYRNSICRQPDLLAIGLSIIPIRFTKVP-LQDRD 246

Query:   228 ANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRA 282
             A  LS +V WF   F V + +S   S   DL   LE R         EE+  + + + RA
Sbjct:   247 AYYLSNLVKWFIGTFTVNADLSA--SEQDDLQTTLERRIAIYSARDNEELVHIFLLILRA 304

Query:   283 LKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFS 342
             L+L TR V  L    LK    K   S++++S  G G   +  L    PE     P  S  
Sbjct:   305 LQLLTRLVLSLQPIPLKSAVTKGRKSSKETSVEGPG--GSSELSSNSPESH-NKPTTSRR 361

Query:   343 CDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVS-CELSSGNLDPSSSMACSDISEAC 401
               ++E + E   K +   K  +  + + Q +K   S  E +   +          ++   
Sbjct:   362 IKEEETLSEGRGKATARGKRGTGTAGSRQRRKPSCSEGEEAEQKVQGRPHARKRRVAAKV 421

Query:   402 HPKEKSQALKRKGDLEFEMQLEMALSATNV----ATSKSNICSDVKDLNSNSSTVLPVKR 457
               KE+S++       +FE        +++        K    S  +   + S +    +R
Sbjct:   422 SYKEESESDGAGSGSDFEPSSGEGQHSSDEDCEPGPRKQKRASAPQRTKAGSKSASKTQR 481

Query:   458 LKKIESG---ESSTSCLG------ISTA---VGSRKVGAPLYWAEVYCSGENLTGKWVHV 505
               + E     E+S+S  G      +S+    +  RK      W EVYC  +    KWV V
Sbjct:   482 GSQCEPSSFPEASSSSSGCKRGKKVSSGAEEMADRKPAGVDQWLEVYCEPQ---AKWVCV 538

Query:   506 DAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK-RVNSAW 563
             D  + ++   Q V     A K  + Y+V     G  +DVT+RY   W     K RV++ W
Sbjct:   539 DCVHGVVG--QPVACYKYATKP-MTYVVGIDSDGWVRDVTQRYDPAWMTATRKCRVDAEW 595

Query:   564 WDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNH 623
             W   L P R L                  + +R   ED E + + L +PLPT+   YKNH
Sbjct:   596 WAETLRPYRSL------------------LTEREKKEDQEFQAKHLDQPLPTSISTYKNH 637

Query:   624 QLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-X 681
              LY ++R L K+Q +YP+   +LG+C G AVY R CV TL +++ WL++A  V+  E   
Sbjct:   638 PLYALKRHLLKFQAIYPETAAVLGYCRGEAVYSRDCVHTLHSRDTWLKQARVVRLGEVPY 697

Query:   682 XXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVD 741
                          +  EP+ +D  D    + LYG WQ E  + P AV+G VPRNE G V 
Sbjct:   698 KMVKGFSNRARKARLSEPQLHDHND----LGLYGHWQTEEYQPPIAVDGKVPRNEFGNVY 753

Query:   742 VWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDT 801
             ++    +P G V + LP +  VA++L ID   A+ GF+F  G   PV DG +VC EF+D 
Sbjct:   754 LFLPSMMPVGCVQMTLPNLNRVARKLGIDCVQAITGFDFHGGYCHPVTDGYIVCEEFRDV 813

Query:   802 ILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 847
             +L A+                 +A   W  L+  ++ R+RL   YG
Sbjct:   814 LLAAWENEQAIIEKKEKEKKEKRALGNWKLLVRGLLIRERLKLRYG 859


>UNIPROTKB|F1N806 [details] [associations]
            symbol:Gga.54220 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0000715 "nucleotide-excision repair, DNA damage
            recognition" evidence=IEA] [GO:0003684 "damaged DNA binding"
            evidence=IEA] [GO:0003697 "single-stranded DNA binding"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0010224
            "response to UV-B" evidence=IEA] [GO:0031573 "intra-S DNA damage
            checkpoint" evidence=IEA] [GO:0071942 "XPC complex" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
            GO:GO:0071942 GO:GO:0000715 PANTHER:PTHR12135
            GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
            EMBL:AADN02014130 IPI:IPI00818722 Ensembl:ENSGALT00000036242
            ArrayExpress:F1N806 Uniprot:F1N806
        Length = 826

 Score = 628 (226.1 bits), Expect = 4.8e-64, Sum P(2) = 4.8e-64
 Identities = 214/705 (30%), Positives = 311/705 (44%)

Query:   175 KELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPI 234
             KE+ E  HKVHLLCLLA G   + +C  P + A           K+    ++    +S +
Sbjct:    94 KEVREDTHKVHLLCLLANGFYRNRICSQPDLHAIGLSIIPIHFTKVP-AGQVDLLYISNL 152

Query:   235 VSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTRF 289
             V WF   F V   +ST +     L   LE R         EE+  + + + RAL+L  R 
Sbjct:   153 VKWFVGTFTVNDELSTEKG--EPLQSTLERRFAIYAARDDEELVHIFLIILRALQLLCRL 210

Query:   290 VSILDVASLKPEADKNVSS--NQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKE 347
             V  L    LK E    VS    Q  +        +    ++   E   S   +     K+
Sbjct:   211 VLSLQPIPLK-ETKAKVSCFLKQKLTTPCSEKSTSKKQSLSSTSEGQESSGTTPKAVAKK 269

Query:   348 NVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKS 407
               C+ + +     K S  + +N +SKK+  S +    +  P +S      S+ C+ +E  
Sbjct:   270 CPCKKAKRDE---KSSGSEEDNEESKKTK-SAQTERTH-KPKNSRWRRVASKVCYKEESG 324

Query:   408 QALKRKGDLEFEMQLEMALSATNVAT-SKSNICSDVKDLNSNSSTVLPVKRLKKIESGES 466
                    D E   + E  +S  +  T SK    S      +  S V+ VK  K  E+ ES
Sbjct:   325 SDEGSVSDFEISGE-ESDISDEDFETVSKKRRSSQ----GAQKSKVMTVKSPKS-ETSES 378

Query:   467 --STSCLGI----------STAVGS---------RKVGAPLYWAEVYCSGENLTGKWVHV 505
               S + LG+          +  + S         RKV     W EV+   E+   +WV V
Sbjct:   379 RLSRNSLGVEPRPHAQRKRNKIISSDEDDGQQMVRKVVGTDQWLEVFLERED---RWVCV 435

Query:   506 DAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRRYCMKWYRIA-SKRVNSAW 563
             D  + I+   Q  +    A K  L YIV F   G+ KDVT+RY   W  +   KRV+  W
Sbjct:   436 DCVHGIVGQPQ--QCFTYATKP-LSYIVGFDNDGSVKDVTQRYDPVWMTMTRKKRVDPEW 492

Query:   564 WDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNH 623
             W+  L P                  K  FV DR+  E+ E + +   +PLPT    YKNH
Sbjct:   493 WEDTLQPY-----------------KSPFV-DRDKKEETEFQVKLQDQPLPTAIGEYKNH 534

Query:   624 QLYVIERWLNKYQILYPKGP-ILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-X 681
              LY ++R L KYQ +YP+   ILG+C G AVY R CV TL +K+ WL++A  V+  E   
Sbjct:   535 PLYALKRHLLKYQAIYPESAAILGYCRGEAVYSRDCVHTLHSKDTWLKQARVVRIGEVPY 594

Query:   682 XXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVD 741
                          +  EP + D+ D    + L+G+WQ E  + P AV+G VPRNE G V 
Sbjct:   595 KMVKGYSNQARKARLAEPANRDKAD----LALFGRWQTEEYQPPIAVDGKVPRNEYGNVY 650

Query:   742 VWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDT 801
             ++    LP G V LRLP +  +A++L+ID A A+ GF+F  G S  V DG VVC E+K+ 
Sbjct:   651 LFLPSMLPIGCVQLRLPNLNRLARKLDIDCAQAVTGFDFHGGYSHAVTDGYVVCEEYKEV 710

Query:   802 ILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCY 846
             ++ A+                 +A   W  L   ++ R+RL   Y
Sbjct:   711 LIAAWENEQAEIEKKEKEKREKRALGNWKLLTKGLLIRERLKQRY 755

 Score = 43 (20.2 bits), Expect = 4.8e-64, Sum P(2) = 4.8e-64
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query:   107 RELDEGRLQDNVLDGGEEMYDSDWED 132
             +E+DE    DN  D  ++  + +WED
Sbjct:     9 KEMDE----DNTDDDDDDESEDEWED 30


>UNIPROTKB|E1BUG1 [details] [associations]
            symbol:Gga.54220 "Uncharacterized protein" species:9031
            "Gallus gallus" [GO:0000715 "nucleotide-excision repair, DNA damage
            recognition" evidence=IEA] [GO:0003684 "damaged DNA binding"
            evidence=IEA] [GO:0003697 "single-stranded DNA binding"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0010224
            "response to UV-B" evidence=IEA] [GO:0031573 "intra-S DNA damage
            checkpoint" evidence=IEA] [GO:0071942 "XPC complex" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
            GO:GO:0071942 GO:GO:0000715 PANTHER:PTHR12135
            GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
            OMA:MKRFNKE EMBL:AADN02014130 IPI:IPI00603077
            Ensembl:ENSGALT00000010275 ArrayExpress:E1BUG1 Uniprot:E1BUG1
        Length = 936

 Score = 628 (226.1 bits), Expect = 2.1e-60, Sum P(2) = 2.1e-60
 Identities = 214/705 (30%), Positives = 311/705 (44%)

Query:   175 KELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPI 234
             KE+ E  HKVHLLCLLA G   + +C  P + A           K+    ++    +S +
Sbjct:   204 KEVREDTHKVHLLCLLANGFYRNRICSQPDLHAIGLSIIPIHFTKVP-AGQVDLLYISNL 262

Query:   235 VSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTRF 289
             V WF   F V   +ST +     L   LE R         EE+  + + + RAL+L  R 
Sbjct:   263 VKWFVGTFTVNDELSTEKG--EPLQSTLERRFAIYAARDDEELVHIFLIILRALQLLCRL 320

Query:   290 VSILDVASLKPEADKNVSS--NQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKE 347
             V  L    LK E    VS    Q  +        +    ++   E   S   +     K+
Sbjct:   321 VLSLQPIPLK-ETKAKVSCFLKQKLTTPCSEKSTSKKQSLSSTSEGQESSGTTPKAVAKK 379

Query:   348 NVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKS 407
               C+ + +     K S  + +N +SKK+  S +    +  P +S      S+ C+ +E  
Sbjct:   380 CPCKKAKRDE---KSSGSEEDNEESKKTK-SAQTERTH-KPKNSRWRRVASKVCYKEESG 434

Query:   408 QALKRKGDLEFEMQLEMALSATNVAT-SKSNICSDVKDLNSNSSTVLPVKRLKKIESGES 466
                    D E   + E  +S  +  T SK    S      +  S V+ VK  K  E+ ES
Sbjct:   435 SDEGSVSDFEISGE-ESDISDEDFETVSKKRRSSQ----GAQKSKVMTVKSPKS-ETSES 488

Query:   467 --STSCLGI----------STAVGS---------RKVGAPLYWAEVYCSGENLTGKWVHV 505
               S + LG+          +  + S         RKV     W EV+   E+   +WV V
Sbjct:   489 RLSRNSLGVEPRPHAQRKRNKIISSDEDDGQQMVRKVVGTDQWLEVFLERED---RWVCV 545

Query:   506 DAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRRYCMKWYRIA-SKRVNSAW 563
             D  + I+   Q  +    A K  L YIV F   G+ KDVT+RY   W  +   KRV+  W
Sbjct:   546 DCVHGIVGQPQ--QCFTYATKP-LSYIVGFDNDGSVKDVTQRYDPVWMTMTRKKRVDPEW 602

Query:   564 WDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNH 623
             W+  L P                  K  FV DR+  E+ E + +   +PLPT    YKNH
Sbjct:   603 WEDTLQPY-----------------KSPFV-DRDKKEETEFQVKLQDQPLPTAIGEYKNH 644

Query:   624 QLYVIERWLNKYQILYPKGP-ILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-X 681
              LY ++R L KYQ +YP+   ILG+C G AVY R CV TL +K+ WL++A  V+  E   
Sbjct:   645 PLYALKRHLLKYQAIYPESAAILGYCRGEAVYSRDCVHTLHSKDTWLKQARVVRIGEVPY 704

Query:   682 XXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVD 741
                          +  EP + D+ D    + L+G+WQ E  + P AV+G VPRNE G V 
Sbjct:   705 KMVKGYSNQARKARLAEPANRDKAD----LALFGRWQTEEYQPPIAVDGKVPRNEYGNVY 760

Query:   742 VWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDT 801
             ++    LP G V LRLP +  +A++L+ID A A+ GF+F  G S  V DG VVC E+K+ 
Sbjct:   761 LFLPSMLPIGCVQLRLPNLNRLARKLDIDCAQAVTGFDFHGGYSHAVTDGYVVCEEYKEV 820

Query:   802 ILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCY 846
             ++ A+                 +A   W  L   ++ R+RL   Y
Sbjct:   821 LIAAWENEQAEIEKKEKEKREKRALGNWKLLTKGLLIRERLKQRY 865

 Score = 43 (20.2 bits), Expect = 2.1e-60, Sum P(2) = 2.1e-60
 Identities = 9/26 (34%), Positives = 15/26 (57%)

Query:   107 RELDEGRLQDNVLDGGEEMYDSDWED 132
             +E+DE    DN  D  ++  + +WED
Sbjct:   119 KEMDE----DNTDDDDDDESEDEWED 140


>UNIPROTKB|E1BDJ1 [details] [associations]
            symbol:XPC "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0071942 "XPC complex" evidence=IEA] [GO:0031573
            "intra-S DNA damage checkpoint" evidence=IEA] [GO:0010224 "response
            to UV-B" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0003697 "single-stranded DNA binding" evidence=IEA] [GO:0003684
            "damaged DNA binding" evidence=IEA] [GO:0000715
            "nucleotide-excision repair, DNA damage recognition" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
            GO:GO:0071942 GO:GO:0000715 KO:K10838 PANTHER:PTHR12135
            GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
            CTD:7508 OMA:MKRFNKE EMBL:DAAA02054616 IPI:IPI00702830
            RefSeq:NP_001192837.1 UniGene:Bt.45276 Ensembl:ENSBTAT00000009683
            GeneID:524274 KEGG:bta:524274 NextBio:20873931 Uniprot:E1BDJ1
        Length = 932

 Score = 447 (162.4 bits), Expect = 1.2e-59, Sum P(3) = 1.2e-59
 Identities = 94/257 (36%), Positives = 138/257 (53%)

Query:   593 VADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGH 651
             + DR   ED E + + L +PLPT    YKNH LY ++R L KY+ +YP+   +LG+C G 
Sbjct:   610 LVDREQREDQEFQAKHLDQPLPTVIGTYKNHPLYALKRHLLKYEAIYPETAAVLGYCRGE 669

Query:   652 AVYPRSCVQTLKTKERWLREALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGN 710
             AVY R CV TL +++ WL++A  V+  E                +  EP+ +D  D    
Sbjct:   670 AVYSRDCVHTLHSRDTWLKQARVVRLGEVPYKMVKGYSNRARRARQAEPQLHDYND---- 725

Query:   711 IELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEID 770
             + L+G+WQ E  + P AV+G VPRNE G V ++    +P G V L LP ++ VA++L ID
Sbjct:   726 LGLFGRWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPVGCVQLNLPNLHRVARKLNID 785

Query:   771 SAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWY 830
              A A+ GF+F  G   P+ DG VVC E++D +L A+                 +A   W 
Sbjct:   786 CAQAVTGFDFHKGYCHPITDGYVVCEEYRDVLLTAWENEQALIEKKEKEKREKRALGNWK 845

Query:   831 QLLSSIVTRQRLNNCYG 847
              L+  ++ R+RL   YG
Sbjct:   846 LLVKGLLIRERLKLRYG 862

 Score = 171 (65.3 bits), Expect = 1.2e-59, Sum P(3) = 1.2e-59
 Identities = 58/185 (31%), Positives = 85/185 (45%)

Query:   152 VTIEFDAADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXX 211
             + +EF+   +  ++ ++R S   KE+ E  HKVHLLCLLA G   +S+C+ P +QA    
Sbjct:   179 IKMEFE---TYLRRMMKRFS---KEVHEDTHKVHLLCLLANGFYRNSICNQPDLQAIGLS 232

Query:   212 XXXXXXXKISEVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT--- 268
                    K+     +  + LS +V WF   F V + +ST       L   LE R      
Sbjct:   233 IIPTRFTKVPP-RDVDVSYLSNLVKWFIGTFTVNAELSTNEQ--DGLQTTLERRFAIYSA 289

Query:   269 --PEEIAALSVALFRALKLTTRFVSILDVASLKPEADKNVSSNQD-SSRVGGGIFNAPTL 325
                EE+  + + L RAL L TR V  L    LK  A+K     ++ S+   GG   A + 
Sbjct:   290 RDDEELVHIFLLLLRALHLPTRLVLSLQPVPLKLSAEKGKKPCKERSTEAPGGSSEAASH 349

Query:   326 MVAKP 330
                KP
Sbjct:   350 APGKP 354

 Score = 120 (47.3 bits), Expect = 1.2e-59, Sum P(3) = 1.2e-59
 Identities = 31/87 (35%), Positives = 42/87 (48%)

Query:   488 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRR 546
             W EV+   E    KWV VD  + ++   Q +     A K  + Y+V   G G  +DVT+R
Sbjct:   527 WLEVFLEREE---KWVCVDCVHGVVG--QPLTCYQYATKP-VTYVVGIDGAGCVRDVTQR 580

Query:   547 YCMKWYRIASK-RVNSAWWDAVLAPLR 572
             Y   W     K RV++AWW   L P R
Sbjct:   581 YDPAWLTATRKSRVDAAWWAETLRPYR 607

 Score = 54 (24.1 bits), Expect = 2.0e-47, Sum P(3) = 2.0e-47
 Identities = 26/91 (28%), Positives = 37/91 (40%)

Query:     2 RTRQDSKTQKDQASGKESTVRGALRDSE-SSHNETGTLAETSREGVGKFLR-----HVNA 55
             R R  +K    + SG ++   G+  D E SS +      E S  G+ +  R        A
Sbjct:   415 RRRVAAKVSYKEESGSDAASSGS--DFEPSSEDSCRPSDEDSEPGLPRPRRAPAPQRTKA 472

Query:    56 RSSSRSKKQDCAVGLTTSVLKVSGKQEVDKR 86
              S SRSK Q  + GL    ++ S      KR
Sbjct:   473 GSKSRSKSQQGSRGLRPGFVEASASAAGSKR 503

 Score = 44 (20.5 bits), Expect = 2.3e-46, Sum P(3) = 2.3e-46
 Identities = 17/64 (26%), Positives = 31/64 (48%)

Query:   403 PKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLPVKRLK-KI 461
             P E+ +A   KG  E + + +       V      +  DV +  + S++ LPVK ++ +I
Sbjct:   106 PPER-EAAADKGSCEGDDEEDSEEDWEEVEEVSEPVPGDVGESGAFSASALPVKPVEIEI 164

Query:   462 ESGE 465
             E+ E
Sbjct:   165 ETPE 168

 Score = 37 (18.1 bits), Expect = 2.0e-07, Sum P(2) = 2.0e-07
 Identities = 19/66 (28%), Positives = 24/66 (36%)

Query:   640 PKG--PILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDF 697
             P+G  P  G  +G A   R   Q  + + R  R A +V   E              G DF
Sbjct:   386 PRGESPSSGEDAGQARGQRRGTQR-RAQARRRRVAAKVSYKEESGSDAASS-----GSDF 439

Query:   698 EPEDYD 703
             EP   D
Sbjct:   440 EPSSED 445


>RGD|1305760 [details] [associations]
            symbol:Xpc "xeroderma pigmentosum, complementation group C"
            species:10116 "Rattus norvegicus" [GO:0000075 "cell cycle
            checkpoint" evidence=ISO] [GO:0000715 "nucleotide-excision repair,
            DNA damage recognition" evidence=IEA;ISO] [GO:0003674
            "molecular_function" evidence=ND] [GO:0003684 "damaged DNA binding"
            evidence=IEA;ISO] [GO:0003697 "single-stranded DNA binding"
            evidence=IEA;ISO] [GO:0005634 "nucleus" evidence=ISO;IDA]
            [GO:0005737 "cytoplasm" evidence=ISO;IDA] [GO:0006281 "DNA repair"
            evidence=ISO] [GO:0006289 "nucleotide-excision repair"
            evidence=ISO] [GO:0006974 "response to DNA damage stimulus"
            evidence=ISO] [GO:0010224 "response to UV-B" evidence=IEA;ISO]
            [GO:0031573 "intra-S DNA damage checkpoint" evidence=IEA;ISO]
            [GO:0042493 "response to drug" evidence=IEP] [GO:0071942 "XPC
            complex" evidence=IEA;ISO] InterPro:IPR004583 InterPro:IPR018325
            InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
            Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
            SMART:SM01031 SMART:SM01032 RGD:1305760 GO:GO:0005634 GO:GO:0005737
            GO:GO:0042493 GO:GO:0003684 GO:GO:0003697 GO:GO:0010224
            EMBL:CH473957 GO:GO:0031573 GO:GO:0071942 GO:GO:0000715 KO:K10838
            PANTHER:PTHR12135 GeneTree:ENSGT00390000005194 InterPro:IPR018026
            TIGRFAMs:TIGR00605 CTD:7508 OMA:MKRFNKE OrthoDB:EOG40CHGQ
            IPI:IPI00365175 RefSeq:NP_001101344.1 UniGene:Rn.22820
            Ensembl:ENSRNOT00000011490 GeneID:312560 KEGG:rno:312560
            UCSC:RGD:1305760 NextBio:664995 Uniprot:D4A3D8
        Length = 933

 Score = 593 (213.8 bits), Expect = 3.1e-55, P = 3.1e-55
 Identities = 244/890 (27%), Positives = 378/890 (42%)

Query:    11 KDQASGKESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSR--SKKQDCAV 68
             K +  G+E+    A R  ES  ++     E  +     FL  V+     R  S   D   
Sbjct:    10 KRRKRGQEAEDNKATRPEESDADDFED--EKQKPPRKCFLPKVSQGKRKRDCSDPGDPTN 67

Query:    69 GLTTS-VLKVSGKQEVDKRVTWSDVDAHGCS-RDAMGNTLRELDEGRLQDNVLDGGEEMY 126
             G     V K + K +  K V    +   G   RD++ N  +     + +  V+D G +  
Sbjct:    68 GAAKKKVAKATTKSKNLKAVKEEALSDDGDDFRDSLSNCRKAKKHPKRE--VVDQGTDED 125

Query:   127 DS--DWEDG---SIPVACSKENHP--ESD--IKGVTIEFDAADSVTKKP----------- 166
             DS  DWE+    + PV    EN     SD  +K V IE +  +    +            
Sbjct:   126 DSEDDWEEVEELTEPVLDMGENSATSRSDLPVKAVEIEIETPEQAKARERSEKIKMEFET 185

Query:   167 -VRRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVS 224
              +RR     +KE+ E +HKVHLLCLLA G   +S+C  P + A           K+  + 
Sbjct:   186 YLRRMMKRFNKEVQENMHKVHLLCLLASGFYRNSICQQPDLLAIGLSIIPIRFTKVP-LQ 244

Query:   225 KLTANALSPIVSWFHDNFHVRS--SVSTRRSFHSDLAH--ALESREGTPEEIAALSVALF 280
                   LS +V WF   F V +  S S + S  + L    A+ S     EE+  + + + 
Sbjct:   245 DRDVYYLSNLVKWFIGTFTVNADLSASEQDSLQTTLERRIAIYSARDN-EELVHIFLLIL 303

Query:   281 RALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKS 340
             RAL+L TR V  L    LK    K   S++++S  G G  + P+  +  PE     P  S
Sbjct:   304 RALQLLTRLVLSLQPIPLKSAVAKGKKSSKETSLEGPGDSSEPSSNI--PESH-NKPKTS 360

Query:   341 FSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEA 400
                 ++E + E S K +   K  +  + + Q +K P SC  S G        A  +I   
Sbjct:   361 KRIKQEETLSEGSGKANARGKRGTATAGSRQQRK-P-SC--SEGE------EAKQEIQS- 409

Query:   401 CHPKEKSQALKRKGDLEFEMQLEMALSATN--VATSKSNICSDVKDLNSNSSTVLPVKRL 458
              HP+ + + +  K   + E + + A S ++  +++ +    SD +D              
Sbjct:   410 -HPQAQKRRVAAKVSYKEESESDGAGSGSDFELSSGEGQHSSD-EDCKPGPRKQKRASAP 467

Query:   459 KKIESGESSTSCL--GI-----STAVGSRKVGAPLYWAEVYCSGENLTGK-------WVH 504
             ++ ++G  S S    G      S +V S    A     ++ C GE    +       W+ 
Sbjct:   468 QRSKAGSKSASKTQSGSQWEPPSFSVASSSSSACKRGKKISCGGEETDDRKAAGVDQWLE 527

Query:   505 V----DAANAIIDGEQKVEAAAAAC-KTSLRYIVAFAGCGAKDVTRRYCMKWYRIASKRV 559
             V     A    +D    V     AC K + + +    G  +          W R  ++R 
Sbjct:   528 VFCEPQAKWVCVDCVHGVVGQPVACYKYATKPMTYVVGIDSDG--------WVRDVTQRY 579

Query:   560 NSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQA 619
             + AW  A      + E  A   L    S     + +R   ED E + + L +PLPT+   
Sbjct:   580 DPAWMTATRKCRVDAEWWAE-TLRPYRSP----LTEREKKEDQEFQAKHLDQPLPTSIST 634

Query:   620 YKNHQLYVIERWLNKYQILYPKGP-ILGFCSGHAVYPRSCVQTLKTKERWLREALQVKAN 678
             YKNH LY ++R L K+Q +YP+   +LG+C G AVY R CV TL +++ WL++A  V+  
Sbjct:   635 YKNHPLYALKRHLLKFQAIYPESAAVLGYCRGEAVYSRDCVHTLHSRDTWLKQARVVRLG 694

Query:   679 EX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNER 737
             E                +  EP+ +D  D    + L+G WQ E  + P AV+G VPRNE 
Sbjct:   695 EVPYKMVKGFSNRARKARLSEPQLHDHND----LGLFGHWQTEEYQPPVAVDGKVPRNEF 750

Query:   738 GQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAE 797
             G V ++    +P G V + LP ++ VA++L ID   A+ GF+F  G   PV DG VVC E
Sbjct:   751 GNVYLFLPSMMPIGCVQMNLPNLHRVARKLGIDCVQAITGFDFHGGYCHPVTDGYVVCEE 810

Query:   798 FKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 847
             F+D +L A+                 +A   W  L+  ++ R+RL   YG
Sbjct:   811 FRDVLLAAWENEQALIEKKEKEKKEKRALGNWKLLVRGLLIRERLKLRYG 860


>UNIPROTKB|Q01831 [details] [associations]
            symbol:XPC "DNA repair protein complementing XP-C cells"
            species:9606 "Homo sapiens" [GO:0010224 "response to UV-B"
            evidence=IEA] [GO:0031573 "intra-S DNA damage checkpoint"
            evidence=IEA] [GO:0042493 "response to drug" evidence=IEA]
            [GO:0000075 "cell cycle checkpoint" evidence=IMP] [GO:0000405
            "bubble DNA binding" evidence=TAS] [GO:0003684 "damaged DNA
            binding" evidence=IDA] [GO:0000715 "nucleotide-excision repair, DNA
            damage recognition" evidence=IDA;TAS] [GO:0000404 "loop DNA
            binding" evidence=TAS] [GO:0071942 "XPC complex" evidence=IDA]
            [GO:0006289 "nucleotide-excision repair" evidence=IDA;TAS]
            [GO:0003697 "single-stranded DNA binding" evidence=IDA] [GO:0005634
            "nucleus" evidence=IDA] [GO:0005737 "cytoplasm" evidence=IDA]
            [GO:0000718 "nucleotide-excision repair, DNA damage removal"
            evidence=TAS] [GO:0005654 "nucleoplasm" evidence=TAS] [GO:0006281
            "DNA repair" evidence=TAS] [GO:0005515 "protein binding"
            evidence=IPI] Reactome:REACT_216 InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0005737 GO:GO:0005654 GO:GO:0042493 GO:GO:0003684
            GO:GO:0003697 GO:GO:0010224 GO:GO:0000075 GO:GO:0000405
            GO:GO:0031573 GO:GO:0000718 GO:GO:0071942 PDB:2A4J PDB:2GGM
            PDB:2OBH PDBsum:2A4J PDBsum:2GGM PDBsum:2OBH GO:GO:0000715
            GO:GO:0000404 KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135
            InterPro:IPR018026 TIGRFAMs:TIGR00605 EMBL:D21089 EMBL:AF261901
            EMBL:AF261892 EMBL:AF261893 EMBL:AF261894 EMBL:AF261895
            EMBL:AF261896 EMBL:AF261897 EMBL:AF261898 EMBL:AF261899
            EMBL:AF261900 EMBL:AY131066 EMBL:AC093495 EMBL:FJ695191
            EMBL:FJ695192 EMBL:BC016620 EMBL:AK222844 EMBL:X65024
            IPI:IPI00156793 PIR:S44345 RefSeq:NP_001139241.1 RefSeq:NP_004619.3
            UniGene:Hs.475538 UniGene:Hs.739296 ProteinModelPortal:Q01831
            SMR:Q01831 DIP:DIP-31225N IntAct:Q01831 MINT:MINT-105410
            STRING:Q01831 PhosphoSite:Q01831 DMDM:296453081 PaxDb:Q01831
            PeptideAtlas:Q01831 PRIDE:Q01831 Ensembl:ENST00000285021
            GeneID:7508 KEGG:hsa:7508 UCSC:uc011ave.2 CTD:7508
            GeneCards:GC03M014161 HGNC:HGNC:12816 HPA:CAB009932 MIM:278720
            MIM:613208 neXtProt:NX_Q01831 Orphanet:276255 PharmGKB:PA37413
            HOGENOM:HOG000124671 HOVERGEN:HBG000407 InParanoid:Q01831
            OMA:MKRFNKE OrthoDB:EOG40CHGQ ChiTaRS:XPC EvolutionaryTrace:Q01831
            GenomeRNAi:7508 NextBio:29391 ArrayExpress:Q01831 Bgee:Q01831
            CleanEx:HS_XPC Genevestigator:Q01831 GermOnline:ENSG00000154767
            Uniprot:Q01831
        Length = 940

 Score = 587 (211.7 bits), Expect = 1.7e-54, P = 1.7e-54
 Identities = 246/903 (27%), Positives = 375/903 (41%)

Query:     5 QDSKTQKDQASGK---ESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSRS 61
             ++ ++QK +A  K   E     A  D +    +   L++ S+    +   H    +   +
Sbjct:    14 RELRSQKSKAKSKARREEEEEDAFEDEKPP--KKSLLSKVSQGKRKRGCSHPGGSADGPA 71

Query:    62 KKQDCAVGLTTSVLKVSGKQEV----DKRVTWSDVD-AHGCSRDAMGNTLRELDEGRLQD 116
             KK+   V + +  LKV   + +    D R   SD+  AH   R A  N     +E    +
Sbjct:    72 KKKVAKVTVKSENLKVIKDEALSDGDDLRDFPSDLKKAHHLKRGATMNEDSNEEEEE-SE 130

Query:   117 NVLDGGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKP---------- 166
             N  +  EE+ +    D     A S+   P   +K V IE +  +    +           
Sbjct:   131 NDWEEVEELSEPVLGDVRESTAFSRSLLP---VKPVEIEIETPEQAKTRERSEKIKLEFE 187

Query:   167 --VRRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEV 223
               +RRA    +K + E  HKVHLLCLLA G   +++C  P + A           ++   
Sbjct:   188 TYLRRAMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLP- 246

Query:   224 SKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVA 278
               +    LS +V WF   F V + +S   S   +L   LE R         EE+  + + 
Sbjct:   247 RDVDTYYLSNLVKWFIGTFTVNAELSA--SEQDNLQTTLERRFAIYSARDDEELVHIFLL 304

Query:   279 LFRALKLTTRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPV 338
             + RAL+L TR V  L    LK    K    +++      G  +  +  V +       P 
Sbjct:   305 ILRALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENH---TKP- 360

Query:   339 KSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKS---PVSCELSSGNLDPSSSMACS 395
             K+    K+E   ET +KG+  C+ S+    N   +K    P S E   G  D        
Sbjct:   361 KTSKGTKQE---ETFAKGT--CRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKAT-- 413

Query:   396 DISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSN----SST 451
                   H +E+  A  R    E E   + A S ++   S S   SD  D +S        
Sbjct:   414 --QRRPHGRERRVA-SRVSYKE-ESGSDEAGSGSDFELS-SGEASDPSDEDSEPGPPKQR 468

Query:   452 VLPVKRLKKIESGESSTSCLG----------ISTAVGSRKVGAPLYWAEVYCSGENLTG- 500
               P  +  K  S  +S +  G           S++  S K G  +          ++ G 
Sbjct:   469 KAPAPQRTKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGI 528

Query:   501 ------------KWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRY 547
                         KWV VD  + ++   Q +     A K  + Y+V     G  +DVT+RY
Sbjct:   529 DQWLEVFCEQEEKWVCVDCVHGVVG--QPLTCYKYATKP-MTYVVGIDSDGWVRDVTQRY 585

Query:   548 CMKWYRIASK-RVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELET 606
                W  +  K RV++ WW   L P +                   F+ DR   ED+E + 
Sbjct:   586 DPVWMTVTRKCRVDAEWWAETLRPYQS-----------------PFM-DREKKEDLEFQA 627

Query:   607 RALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTK 665
             + + +PLPT    YKNH LY ++R L KY+ +YP+   ILG+C G AVY R CV TL ++
Sbjct:   628 KHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSR 687

Query:   666 ERWLREALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRL 724
             + WL++A  V+  E                +  EP+  +E D    + L+G WQ E  + 
Sbjct:   688 DTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREEND----LGLFGYWQTEEYQP 743

Query:   725 PSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGR 784
             P AV+G VPRNE G V ++    +P G V L LP ++ VA++L+ID   A+ GF+F  G 
Sbjct:   744 PVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGY 803

Query:   785 STPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNN 844
             S PV DG +VC EFKD +L A+                 +A   W  L   ++ R+RL  
Sbjct:   804 SHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKR 863

Query:   845 CYG 847
              YG
Sbjct:   864 RYG 866


>UNIPROTKB|E9PH69 [details] [associations]
            symbol:XPC "DNA repair protein-complementing XP-C cells"
            species:9606 "Homo sapiens" [GO:0003684 "damaged DNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0006289
            "nucleotide-excision repair" evidence=IEA] InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 KO:K10838
            PANTHER:PTHR12135 InterPro:IPR018026 TIGRFAMs:TIGR00605
            EMBL:AC093495 EMBL:FJ695191 EMBL:FJ695192 RefSeq:NP_001139241.1
            UniGene:Hs.475538 UniGene:Hs.739296 GeneID:7508 KEGG:hsa:7508
            CTD:7508 HGNC:HGNC:12816 ChiTaRS:XPC GenomeRNAi:7508 NextBio:29391
            IPI:IPI00924991 ProteinModelPortal:E9PH69 SMR:E9PH69 PRIDE:E9PH69
            Ensembl:ENST00000449060 UCSC:uc011avg.2 ArrayExpress:E9PH69
            Bgee:E9PH69 Uniprot:E9PH69
        Length = 903

 Score = 581 (209.6 bits), Expect = 6.3e-54, P = 6.3e-54
 Identities = 217/764 (28%), Positives = 327/764 (42%)

Query:   123 EEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAELVH 182
             EE  ++DWE+       +K       IK   +EF+   +  ++ ++R +   K + E  H
Sbjct:   126 EEESENDWEE-------AKTRERSEKIK---LEFE---TYLRRAMKRFN---KGVHEDTH 169

Query:   183 KVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPIVSWFHDNF 242
             KVHLLCLLA G   +++C  P + A           ++     +    LS +V WF   F
Sbjct:   170 KVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLP-RDVDTYYLSNLVKWFIGTF 228

Query:   243 HVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTRFVSILDVAS 297
              V + +S   S   +L   LE R         EE+  + + + RAL+L TR V  L    
Sbjct:   229 TVNAELSA--SEQDNLQTTLERRFAIYSARDDEELVHIFLLILRALQLLTRLVLSLQPIP 286

Query:   298 LKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENVCETSSKGS 357
             LK    K    +++      G  +  +  V +       P K+    K+E   ET +KG+
Sbjct:   287 LKSATAKGKKPSKERLTADPGGSSETSSQVLENH---TKP-KTSKGTKQE---ETFAKGT 339

Query:   358 PECKYSSPKSNNTQSKKS---PVSCELSSGNLDPSSSMACSDISEACHPKEKSQALKRKG 414
               C+ S+    N   +K    P S E   G  D              H +E+  A  R  
Sbjct:   340 --CRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKAT----QRRPHGRERRVA-SRVS 392

Query:   415 DLEFEMQLEMALSATNVATSKSNICSDVKDLNSN----SSTVLPVKRLKKIESGESSTSC 470
               E E   + A S ++   S S   SD  D +S          P  +  K  S  +S + 
Sbjct:   393 YKE-ESGSDEAGSGSDFELS-SGEASDPSDEDSEPGPPKQRKAPAPQRTKAGSKSASRTH 450

Query:   471 LG----------ISTAVGSRKVGAPLYWAEVYCSGENLTG-------------KWVHVDA 507
              G           S++  S K G  +          ++ G             KWV VD 
Sbjct:   451 RGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEKRSIAGIDQWLEVFCEQEEKWVCVDC 510

Query:   508 ANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK-RVNSAWWD 565
              + ++   Q +     A K  + Y+V     G  +DVT+RY   W  +  K RV++ WW 
Sbjct:   511 VHGVVG--QPLTCYKYATKP-MTYVVGIDSDGWVRDVTQRYDPVWMTVTRKCRVDAEWWA 567

Query:   566 AVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQL 625
               L P +                   F+ DR   ED+E + + + +PLPT    YKNH L
Sbjct:   568 ETLRPYQS-----------------PFM-DREKKEDLEFQAKHMDQPLPTAIGLYKNHPL 609

Query:   626 YVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWLREALQVKANEX-XXX 683
             Y ++R L KY+ +YP+   ILG+C G AVY R CV TL +++ WL++A  V+  E     
Sbjct:   610 YALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWLKKARVVRLGEVPYKM 669

Query:   684 XXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVW 743
                        +  EP+  +E D    + L+G WQ E  + P AV+G VPRNE G V ++
Sbjct:   670 VKGFSNRARKARLAEPQLREEND----LGLFGYWQTEEYQPPVAVDGKVPRNEFGNVYLF 725

Query:   744 SEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTIL 803
                 +P G V L LP ++ VA++L+ID   A+ GF+F  G S PV DG +VC EFKD +L
Sbjct:   726 LPSMMPIGCVQLNLPNLHRVARKLDIDCVQAITGFDFHGGYSHPVTDGYIVCEEFKDVLL 785

Query:   804 EAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 847
              A+                 +A   W  L   ++ R+RL   YG
Sbjct:   786 TAWENEQAVIERKEKEKKEKRALGNWKLLAKGLLIRERLKRRYG 829


>UNIPROTKB|F1SPI2 [details] [associations]
            symbol:XPC "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0071942 "XPC complex" evidence=IEA] [GO:0031573
            "intra-S DNA damage checkpoint" evidence=IEA] [GO:0010224 "response
            to UV-B" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0003697 "single-stranded DNA binding" evidence=IEA] [GO:0003684
            "damaged DNA binding" evidence=IEA] [GO:0000715
            "nucleotide-excision repair, DNA damage recognition" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0003684 GO:GO:0003697 GO:GO:0010224 GO:GO:0031573
            GO:GO:0071942 GO:GO:0000715 KO:K10838 PANTHER:PTHR12135
            GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
            CTD:7508 OMA:MKRFNKE EMBL:CU633560 RefSeq:XP_003132441.1
            Ensembl:ENSSSCT00000012699 GeneID:100514251 KEGG:ssc:100514251
            ArrayExpress:F1SPI2 Uniprot:F1SPI2
        Length = 944

 Score = 428 (155.7 bits), Expect = 2.5e-50, Sum P(2) = 2.5e-50
 Identities = 98/299 (32%), Positives = 147/299 (49%)

Query:   551 WYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALT 610
             W R  ++R + AW  A     R+    A          +   + +R   ED E + + L 
Sbjct:   583 WVRDVTQRYDPAWMTAT----RKCRVDAVWWAETLRPYRSPLL-EREQREDQEFQAKHLD 637

Query:   611 EPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWL 669
             +P+PT    YKNH LY ++R L KY+ +YP+   ILG+C G AVY R CV TL +++ WL
Sbjct:   638 QPMPTVIGTYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAVYSRDCVHTLHSRDTWL 697

Query:   670 REALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAV 728
             ++   V+  E                +  EP+  D  D    + L+G+WQ E  + P AV
Sbjct:   698 KQGRVVRLGEVPYKMVKGYSNRARKARLAEPQLRDHND----LPLFGQWQTEEYQPPVAV 753

Query:   729 NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPV 788
             +G VPRNE G V ++    +P G V L LP +  VA++L ID   A+ GF+F  G S P+
Sbjct:   754 DGKVPRNEFGNVYLFLPSMMPVGCVQLNLPNLQRVARKLNIDCVQAITGFDFHKGYSHPI 813

Query:   789 FDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 847
              DG +VC E++D +L A+                 +    W  L+  ++ R+RL   YG
Sbjct:   814 TDGYIVCEEYRDILLAAWENEQALIEKKEKEKKEKRTLGNWKLLVKGLLIRERLRLRYG 872

 Score = 179 (68.1 bits), Expect = 2.5e-50, Sum P(2) = 2.5e-50
 Identities = 81/304 (26%), Positives = 123/304 (40%)

Query:   174 DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSP 233
             +KE+ E  HKVHLLCLLA G   +S+C  P ++A           K+     +    LS 
Sbjct:   200 NKEVHEDTHKVHLLCLLANGFYRNSICSQPDLRAIGLSIIPTRFTKVPP-QDVDVCYLSN 258

Query:   234 IVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTR 288
             +V WF   F V + +ST       L   LE R         EE+  + + + RAL L+ R
Sbjct:   259 LVKWFIGTFTVNADLSTNEQ--DGLQTTLERRFAIYSARDDEELVHIFLLIIRALHLSAR 316

Query:   289 FVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKEN 348
              V  L    LK  A K   ++++ S  G G  ++ T   + P     + +KS S +++E+
Sbjct:   317 LVLSLQPIPLKSSAAKGKKASKERSTEGPGC-SSET---SSPGPAKQTKLKSSSGNRRED 372

Query:   349 VCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKE--K 406
                  + G P  K    K+     K+   S   SSG        A     EA  P    +
Sbjct:   373 PSSEGTSG-PRAKGKGSKAAAATKKQREPS---SSGE---EEGKAAGQQGEARRPARGRR 425

Query:   407 SQALKRKGDLEFEMQLEMALSATNVATSKSNI-CSDVKDLNSNSSTVLPVKRLKKIESGE 465
              QA  R    E E   + A S+++   S  +  C   +D             L + ++G 
Sbjct:   426 RQAATRVSYKE-ESGSDKASSSSDFELSSGDSHCPSDEDSEPGLRRQRRAPGLPRTKAGA 484

Query:   466 SSTS 469
              S S
Sbjct:   485 KSDS 488

 Score = 138 (53.6 bits), Expect = 7.5e-47, Sum P(3) = 7.5e-47
 Identities = 68/247 (27%), Positives = 104/247 (42%)

Query:   339 KSFSCDKKENVCETSSKGSPECKYSS-------PKSNNTQSKKSPVSCELSSGNLDPSSS 391
             K+ +  KK+   E SS G  E K +        P     +   + VS +  SG+ D +SS
Sbjct:   389 KAAAATKKQR--EPSSSGEEEGKAAGQQGEARRPARGRRRQAATRVSYKEESGS-DKASS 445

Query:   392 MACSDISEA---CHPKEKSQ-ALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNS 447
              +  ++S     C   E S+  L+R+        L    +    + S+S   S  K    
Sbjct:   446 SSDFELSSGDSHCPSDEDSEPGLRRQRRAP---GLPRTKAGAK-SDSRSQRGSHPKPPGF 501

Query:   448 NSSTVLPVKRLKKIESGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDA 507
              +++  P    +K   G   TS  G   A G +  G   +W EV+C  E+   KWV VD 
Sbjct:   502 LAASAGPPGSKRK---GGKKTSVRG-EEADGGKVAGVD-HWLEVFCERED---KWVCVDC 553

Query:   508 ANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK-RVNSAWWD 565
              + ++   Q +     A K  + Y+V   G G  +DVT+RY   W     K RV++ WW 
Sbjct:   554 VHGVVG--QPLTCYQYATKP-MTYVVGIDGDGWVRDVTQRYDPAWMTATRKCRVDAVWWA 610

Query:   566 AVLAPLR 572
               L P R
Sbjct:   611 ETLRPYR 617

 Score = 52 (23.4 bits), Expect = 7.5e-47, Sum P(3) = 7.5e-47
 Identities = 15/49 (30%), Positives = 22/49 (44%)

Query:    17 KESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSRSKKQD 65
             K S  +G     E S    G  +ETS  G  K       +SSS ++++D
Sbjct:   327 KSSAAKGKKASKERSTEGPGCSSETSSPGPAK---QTKLKSSSGNRRED 372


>UNIPROTKB|E2RCR3 [details] [associations]
            symbol:XPC "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IEA] [GO:0003684
            "damaged DNA binding" evidence=IEA] InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 PANTHER:PTHR12135
            GeneTree:ENSGT00390000005194 InterPro:IPR018026 TIGRFAMs:TIGR00605
            OMA:MKRFNKE EMBL:AAEX03012049 Ensembl:ENSCAFT00000007204
            Uniprot:E2RCR3
        Length = 949

 Score = 448 (162.8 bits), Expect = 1.2e-49, Sum P(2) = 1.2e-49
 Identities = 96/259 (37%), Positives = 139/259 (53%)

Query:   591 SFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCS 649
             S + +R   ED E + + L +PLPT    YKNH LY ++R L KY+ +YP+   ILG+C 
Sbjct:   622 SLLVEREKKEDSEFQAKHLGQPLPTVIGTYKNHPLYALKRHLLKYEAIYPETAAILGYCR 681

Query:   650 GHAVYPRSCVQTLKTKERWLREALQVKANEX-XXXXXXXXXXXXXGQDFEPEDYDEVDAR 708
             G AVY R CV TL +++ WL++A  V+  E                +  EP+  D+ D  
Sbjct:   682 GEAVYSRDCVHTLHSRDTWLKQARVVRLGEVPYKMVKGYSNRARKARLAEPQLQDQND-- 739

Query:   709 GNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLE 768
               + L+GKWQ E  + P AV+G VPRNE G V ++    +P G V L LP ++ VA++L+
Sbjct:   740 --LGLFGKWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPVGCVQLNLPNLHRVARKLD 797

Query:   769 IDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSR 828
             ID   A+ GF+F  G S P+ DG +VC E+KD +L A+                 +A   
Sbjct:   798 IDCVQAITGFDFHKGYSHPITDGYIVCEEYKDVLLAAWENEQALIEKREKEKREKRALGN 857

Query:   829 WYQLLSSIVTRQRLNNCYG 847
             W  L   ++ R+RL   YG
Sbjct:   858 WKLLARGLLIRERLKLRYG 876

 Score = 151 (58.2 bits), Expect = 1.2e-49, Sum P(2) = 1.2e-49
 Identities = 59/211 (27%), Positives = 95/211 (45%)

Query:   152 VTIEFDAADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCDDPLIQAXXXX 211
             + +EF+   +  ++ ++R S   KE+ E  HKVHLLCLLA G    ++C+ P + A    
Sbjct:   188 IKVEFE---TYLRRMMKRFS---KEVREDTHKVHLLCLLANGFYRSNICNQPDLLAIGLS 241

Query:   212 XXXXXXXKISEVSKLTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT--- 268
                    ++     + +  LS +V WF   F V + +ST       L   LE R      
Sbjct:   242 IVPTRFTRVPP-QDVDSGYLSNLVKWFVGTFTVNADLSTNEQ--DGLQTTLERRFAIYSA 298

Query:   269 --PEEIAALSVALFRALKLTTRFVSILDVASLK-PEADKNVSSNQDSSRVGGGIFNAPTL 325
                EE+  + + + RAL+L TR V  L    LK P A    ++ + S+   G      +L
Sbjct:   299 RDDEELVHIFLLILRALQLPTRLVLSLQPLPLKLPTAKGKKATTEKSAEDPGS-----SL 353

Query:   326 MVAKPEEVLASPVKSFSCDKKENVCETSSKG 356
               + P     +  K+    ++E   +TSSKG
Sbjct:   354 ETSSPVAEGQTKPKTSKGTRQE---DTSSKG 381

 Score = 128 (50.1 bits), Expect = 3.3e-47, Sum P(2) = 3.3e-47
 Identities = 82/302 (27%), Positives = 120/302 (39%)

Query:   296 ASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENVC----- 350
             A+ +  A+   SS + SS V  G     T    + E+  +  + S S   K+        
Sbjct:   340 ATTEKSAEDPGSSLETSSPVAEGQTKPKTSKGTRQEDTSSKGLGSTSAKGKKGKAAAVGK 399

Query:   351 ---ETSSKGSPECKYSSPKSNNTQSKK--------SPVSCELSSGNLDPSSSMACSDIS- 398
                E SS G  E K +  +   TQ ++        S VS +  S + D  SS +  ++S 
Sbjct:   400 RRREPSSSGEEERK-AGGQEEETQRRRYGRERQVASRVSYKEESAS-DKGSSGSDFELSS 457

Query:   399 -EACHPK-EKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVK-DLNSNSSTVLPV 455
              EA H   E S+ +  +       Q   A S T+  T              S SS+    
Sbjct:   458 GEAHHSSDEDSEPVLPRQRRAPGPQRTKAGSRTDSRTQSGRPSKHPGFPAASTSSSSSKS 517

Query:   456 KRLKKIES-GESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDG 514
             K+ KKI S GE +            RK      W EV+C  E    KWV VD  + ++  
Sbjct:   518 KQGKKISSDGEGAER----------RKAAGVDQWLEVFCEQEE---KWVCVDCVHGVVG- 563

Query:   515 EQKVEAAAAACKTSLRYIVAFAGCGA-KDVTRRYCMKWYRIASK-RVNSAWWDAVLAPLR 572
              Q +     A K  + Y+V   G G+ +DVT+RY   W     K RV++ WW   L P +
Sbjct:   564 -QALACYKYATKP-MTYVVGIDGDGSVRDVTQRYDPAWMTATRKCRVDAKWWAETLRPYQ 621

Query:   573 EL 574
              L
Sbjct:   622 SL 623

 Score = 52 (23.4 bits), Expect = 3.1e-39, Sum P(2) = 3.1e-39
 Identities = 18/74 (24%), Positives = 36/74 (48%)

Query:   395 SDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSSTVLP 454
             + + +  HP ++  A+  KG  E + + E       V      +  DV +  + S +VLP
Sbjct:   107 ASVRKKAHPSQREAAVD-KGSCEEDDEEESEDEWEEVEELGEPVPGDVGENAAFSKSVLP 165

Query:   455 VKRLK-KIESGESS 467
             VK ++ +IE+ + +
Sbjct:   166 VKPVEIEIETPQQA 179

 Score = 45 (20.9 bits), Expect = 1.7e-38, Sum P(2) = 1.7e-38
 Identities = 14/43 (32%), Positives = 20/43 (46%)

Query:     3 TRQDSKTQKDQASGKESTVRGALRDSESSHNETGTLAETSREG 45
             +R DS+TQ  + S K      A   S SS ++ G    +  EG
Sbjct:   488 SRTDSRTQSGRPS-KHPGFPAASTSSSSSKSKQGKKISSDGEG 529

 Score = 43 (20.2 bits), Expect = 2.8e-38, Sum P(2) = 2.8e-38
 Identities = 21/77 (27%), Positives = 34/77 (44%)

Query:    14 ASGKESTVRGALRDSESSHNETGTLAE-TSREGVGKFLRHVNARS------SSRSKK-QD 65
             A GK++T   +  D  SS   +  +AE  ++    K  R  +  S      S++ KK + 
Sbjct:   335 AKGKKATTEKSAEDPGSSLETSSPVAEGQTKPKTSKGTRQEDTSSKGLGSTSAKGKKGKA 394

Query:    66 CAVGLTTSVLKVSGKQE 82
              AVG        SG++E
Sbjct:   395 AAVGKRRREPSSSGEEE 411

 Score = 41 (19.5 bits), Expect = 4.5e-38, Sum P(2) = 4.5e-38
 Identities = 21/73 (28%), Positives = 28/73 (38%)

Query:    17 KESTVRGALRDSESSHNETGTLAETSR---EGVGKFLRHVNARSSSRSKKQDCAVGLTTS 73
             K  T +G    +E S  + G+  ETS    EG  K       R    S K    +G T++
Sbjct:   331 KLPTAKGKKATTEKSAEDPGSSLETSSPVAEGQTKPKTSKGTRQEDTSSK---GLGSTSA 387

Query:    74 VLKVSGKQEVDKR 86
               K      V KR
Sbjct:   388 KGKKGKAAAVGKR 400


>ZFIN|ZDB-GENE-030131-8461 [details] [associations]
            symbol:xpc "xeroderma pigmentosum, complementation
            group C" species:7955 "Danio rerio" [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IEA]
            [GO:0006289 "nucleotide-excision repair" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] InterPro:IPR004583 InterPro:IPR018325
            InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
            Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
            SMART:SM01031 SMART:SM01032 ZFIN:ZDB-GENE-030131-8461 GO:GO:0005634
            GO:GO:0003684 GO:GO:0006289 KO:K10838 PANTHER:PTHR12135
            GeneTree:ENSGT00390000005194 CTD:7508 HOVERGEN:HBG000407
            OMA:MKRFNKE EMBL:BX784025 IPI:IPI00610110 RefSeq:NP_001038675.1
            UniGene:Dr.76635 Ensembl:ENSDART00000058100 GeneID:541386
            KEGG:dre:541386 InParanoid:Q1LVE4 NextBio:20879198 Uniprot:Q1LVE4
        Length = 879

 Score = 414 (150.8 bits), Expect = 2.1e-46, Sum P(2) = 2.1e-46
 Identities = 89/254 (35%), Positives = 133/254 (52%)

Query:   595 DRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAV 653
             +R   ED E++ + L +PLPT+   YKNH LYV++R L KY+ LYP    +LG+C G  V
Sbjct:   552 ERGQKEDQEMQAKLLDKPLPTSVSEYKNHPLYVLKRHLLKYEALYPATAAVLGYCRGEPV 611

Query:   654 YPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIEL 713
             Y R CV TL +++ WL+EA  V+  E                    E  +  D    + L
Sbjct:   612 YSRDCVHTLHSRDTWLKEARTVRLGEEPYKMVLGFSNRSRKARMMSEQKNVKD----LAL 667

Query:   714 YGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAP 773
             +G WQ E  + P AV+G VPRNE G V ++    LP G VH+ LP ++ VA++L ID A 
Sbjct:   668 FGTWQTEEYQPPIAVDGKVPRNEFGNVYMFKSCMLPIGCVHVHLPNLHRVARKLNIDCAL 727

Query:   774 AMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLL 833
             A+ GF++  G +  V DG +VC E ++ +  A+                 +A + W  L+
Sbjct:   728 AVTGFDYHCGFAHAVNDGYIVCEEHEEILKAAWENEQEIQQKKEQEKREKRAVTNWTLLV 787

Query:   834 SSIVTRQRLNNCYG 847
               ++ ++RL   YG
Sbjct:   788 KGLLIKERLKRRYG 801

 Score = 155 (59.6 bits), Expect = 2.1e-46, Sum P(2) = 2.1e-46
 Identities = 94/385 (24%), Positives = 167/385 (43%)

Query:    26 RDSESSHNETGTLAETSREGVGKFLRHV-NAR-SSSRSKKQDCAVGLTTSVLKVSGKQEV 83
             +  + ++ ++G+  + ++E   +  +++ N++ +S RS+K    +   TS  K     EV
Sbjct:    15 KPKQIANTKSGSKTQKAKENGMETKKNLKNSKVASRRSRKVKDVLDEVTS--KYFQDSEV 72

Query:    84 DKRVTWSDVDAHGCSRDAMGNTLRELDEGRLQDNVLDGGEEMYDSDWED-----GSIPVA 138
              K     D+  H   R  + +T   L + ++++   D  +E    DWE+     G +   
Sbjct:    73 -KTEEPEDLSDHSEERMIIEDT--SLSK-QVKEEEEDSEDE---DDWEEVEEMAGPLGPV 125

Query:   139 CSKENHPESDIKGVTIEFDAADSVTK---KPVRRASAE----------DKELAELVHKVH 185
              S E   ES  K V IE +  D + K   K  R+A  E          +K+L    HKVH
Sbjct:   126 DSSELALES--KPVEIEIETPDMIRKRQKKEKRKAEFETYLRRMMNRFNKDLLVDTHKVH 183

Query:   186 LLCLLARGRLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPIVSWFHDNFHVR 245
             LLCL+A G   + +  +P + A            +S + ++    L  ++ WF   F + 
Sbjct:   184 LLCLMASGLFRNRLLCEPDLLAVALSLLPSHFTTVS-LKRINNGFLEGLLKWFQATFTLN 242

Query:   246 SSVSTRRSFHSDLAHALESREG-----TPEEIAALSVALFRALKLTTRFVSILDVASLKP 300
              ++   +    DL   LE R G       EE+  L + + R+L+L  R V  L    LKP
Sbjct:   243 PALPEEKEV--DLRTVLEKRMGCLSARNHEEMTYLFLLVLRSLRLFCRLVLSLQPLPLKP 300

Query:   301 E-ADKNVSS-NQDSSRVGGGIFNAPTLMVA----KPEEVLASPVKSFSCDKKENVCETSS 354
               A K+ ++ ++ SS       ++P L V+    +P    A+  +     +K+   +T  
Sbjct:   301 PPATKSKTTPSKSSSEKAQSEKSSPELKVSPGSKRPSSATAAAKEDRGGKRKK---KTGG 357

Query:   355 KGSPECKYSS-PKSNNTQSKKSPVS 378
              G  E   +  PK++  +S  S VS
Sbjct:   358 GGDKEAAGAQKPKNSRRRSVASKVS 382

 Score = 103 (41.3 bits), Expect = 3.3e-41, Sum P(3) = 3.3e-41
 Identities = 60/256 (23%), Positives = 96/256 (37%)

Query:   353 SSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKSQ---- 408
             S K SPE K S P S    S  +    +        +      + + A  PK   +    
Sbjct:   320 SEKSSPELKVS-PGSKRPSSATAAAKEDRGGKRKKKTGGGGDKEAAGAQKPKNSRRRSVA 378

Query:   409 ---ALKRKGDLEFEMQLEMALSATNVATSKSN-----ICSDVKDLNSNSSTVLPVKRLKK 460
                + K  G  E E Q E     +N   S+ +     IC   K  +  SS V   +R ++
Sbjct:   379 SKVSYKEVGSEEEEEQSEEEFQPSNEDDSEDSDGAVKICRKSKVKSRRSSKVKQEERSEE 438

Query:   461 IESGESSTSC-LGISTAVGSRKVGAPL-YWAEVYCSGENLTGKWVHVDAANAIIDGEQKV 518
              E  E        +      +K G     W EVY      +G+WV VD    +  G+ ++
Sbjct:   439 EEEEEEEEEEEKEVKKQRRKKKQGKGADEWLEVYLES---SGRWVCVDVDQGV--GQPQL 493

Query:   519 EAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASKR-VNSAWWDAVLAPLR--EL 574
              +  A     + Y+V     G  KD+  RY   W   + +R V+S WW+  +   +  + 
Sbjct:   494 CSDQATLP--ITYVVGLDDEGFMKDLGSRYDPTWLTSSRRRRVDSEWWEETMELYKSPDT 551

Query:   575 ESGATGDLNVESSAKD 590
             E G   D  +++   D
Sbjct:   552 ERGQKEDQEMQAKLLD 567

 Score = 59 (25.8 bits), Expect = 1.2e-36, Sum P(3) = 1.2e-36
 Identities = 20/56 (35%), Positives = 30/56 (53%)

Query:   352 TSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKEKS 407
             T SK +P  K SS K+   QS+KS    ++S G+  PSS+ A +        K+K+
Sbjct:   304 TKSKTTPS-KSSSEKA---QSEKSSPELKVSPGSKRPSSATAAAKEDRGGKRKKKT 355

 Score = 46 (21.3 bits), Expect = 3.3e-41, Sum P(3) = 3.3e-41
 Identities = 20/97 (20%), Positives = 43/97 (44%)

Query:     3 TRQDSKTQKDQASGKESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSRSK 62
             T+  SKTQK + +G E+  +  L++S+ +   +  + +   E   K+ +    ++     
Sbjct:    22 TKSGSKTQKAKENGMET--KKNLKNSKVASRRSRKVKDVLDEVTSKYFQDSEVKTEEPED 79

Query:    63 KQDCA----VGLTTSVLKVSGKQEVDKRVT--WSDVD 93
               D +    +   TS+ K   ++E D      W +V+
Sbjct:    80 LSDHSEERMIIEDTSLSKQVKEEEEDSEDEDDWEEVE 116

 Score = 37 (18.1 bits), Expect = 8.5e-06, Sum P(2) = 8.5e-06
 Identities = 13/49 (26%), Positives = 17/49 (34%)

Query:   587 SAKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKY 635
             S + S V      E+ E E     E     +Q  K  Q    + WL  Y
Sbjct:   424 SRRSSKVKQEERSEEEEEEEEEEEEEKEVKKQRRKKKQGKGADEWLEVY 472


>FB|FBgn0004698 [details] [associations]
            symbol:mus210 "mutagen-sensitive 210" species:7227
            "Drosophila melanogaster" [GO:0006289 "nucleotide-excision repair"
            evidence=ISS] [GO:0003684 "damaged DNA binding" evidence=ISS]
            [GO:0005634 "nucleus" evidence=IEA;NAS] InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            EMBL:AE013599 GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 KO:K10838
            eggNOG:COG5535 PANTHER:PTHR12135 GeneTree:ENSGT00390000005194
            InterPro:IPR018026 TIGRFAMs:TIGR00605 EMBL:Z28622 EMBL:AF209743
            EMBL:AY070566 PIR:S42402 RefSeq:NP_476861.1 RefSeq:NP_725451.1
            UniGene:Dm.637 ProteinModelPortal:Q24595 SMR:Q24595 IntAct:Q24595
            STRING:Q24595 PaxDb:Q24595 PRIDE:Q24595 EnsemblMetazoa:FBtr0087374
            GeneID:36697 KEGG:dme:Dmel_CG8153 CTD:36697 FlyBase:FBgn0004698
            InParanoid:Q24595 OMA:KYLQSFV OrthoDB:EOG4547F1 GenomeRNAi:36697
            NextBio:799920 Bgee:Q24595 GermOnline:CG8153 Uniprot:Q24595
        Length = 1293

 Score = 405 (147.6 bits), Expect = 1.9e-39, Sum P(2) = 1.9e-39
 Identities = 107/320 (33%), Positives = 149/320 (46%)

Query:   529 LRYIVAFAGCGA-KDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESS 587
             L Y+ AF    + KDVT RYC  W     K      W      L E  +   G       
Sbjct:   996 LAYVFAFQDDQSLKDVTARYCASWSTTVRKARVEKAW------LDETIAPYLG-----RR 1044

Query:   588 AKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILG 646
              K      R+  ED +L      +PLP +   +K+H LYV+ER L K+Q LYP   P LG
Sbjct:  1045 TK------RDITEDDQLRRIHSDKPLPKSISEFKDHPLYVLERHLLKFQGLYPPDAPTLG 1098

Query:   647 FCSGHAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVD 706
             F  G AVY R CV  L ++E WL+ A  VK  E                    +D     
Sbjct:  1099 FIRGEAVYSRDCVHLLHSREIWLKSARVVKLGEQPYKVVKARPKWDRLTRTVIKDQP--- 1155

Query:   707 ARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKR 766
                 +E++G WQ +    P+A NGIVPRN  G V+++ +  LP  TVHLRLP +  + K+
Sbjct:  1156 ----LEIFGYWQTQEYEPPTAENGIVPRNAYGNVELFKDCMLPKKTVHLRLPGLMRICKK 1211

Query:   767 LEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQAT 826
             L ID A A+VGF+F  G   P++DG +VC EF++ +  A+                 +  
Sbjct:  1212 LNIDCANAVVGFDFHQGACHPMYDGFIVCEEFREVVTAAWEEDQQVQVLKEQEKYETRVY 1271

Query:   827 SRWYQLLSSIVTRQRLNNCY 846
               W +L+  ++ R+RL   Y
Sbjct:  1272 GNWKKLIKGLLIRERLKKKY 1291

 Score = 105 (42.0 bits), Expect = 1.9e-39, Sum P(2) = 1.9e-39
 Identities = 82/366 (22%), Positives = 141/366 (38%)

Query:   128 SDWEDGSIPVACSKENHPESDIKGV--TIEFDAADSVTKKPVRRASAEDKELAELVHKVH 185
             SD +DG  P   S +      ++G+  T E      +     RR + + K+   L+HKV 
Sbjct:   329 SDQDDGETP-NISGDLEIRVGLEGLRPTKEQKTQHELEMALKRRLNRDIKDRQILLHKVS 387

Query:   186 LLCLLARG----RLIDSVCDDPLIQAXXXXXXXXXXXKISEVSKLTANALSPIVSWFHD- 240
             L+C +AR     RL+     D L+QA                ++L    L   V+WF   
Sbjct:   388 LMCQIARSLKYNRLLSE--SDSLMQATLKLLPSRNAYPTERGTEL--KYLQSFVTWFKTS 443

Query:   241 ------NFHVRSSVSTRRSFHSDLAHALESREGT-PEEIAALSVALFRALKLTTRFVSIL 293
                   N +   S +T+ +    L   ++ +E    +++  + +AL R + +  R +  L
Sbjct:   444 IKLLSPNLYSAQSPATKEAILEALLEQVKRKEARCKQDMIFIFIALARGMGMHCRLIVNL 503

Query:   294 DVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVKSFSCDKKENV-CET 352
                 L+P A     S+    ++     N    + ++ E     P K    DKK     E 
Sbjct:   504 QPMPLRPAA-----SDLIPIKLRPDDKNKSQTVESERESEDEKPKK----DKKAGKPAEK 554

Query:   353 SSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACS---DISEACHPKEKSQA 409
              S  S   K +  K+N  +++  P+S   + G+    S        ++S +    EKS+ 
Sbjct:   555 ESSKSTISKEAEKKNNAKKAEAKPLSKSTTKGSETTKSGTVPKVKKELSLSSKLVEKSKH 614

Query:   410 LKR----KGDLEFEMQLEMALSATNVATSKSNICSDVKDLNS--NSSTVLPVKRLKKIES 463
              K     K D  F+ +   + S+  +    S +    K L    +S  VL  K      S
Sbjct:   615 QKAYTSSKSDTSFDEKPSTSSSSKCLKEEYSELGLSKKLLKPTLSSKLVLKSKNQSSFSS 674

Query:   464 GESSTS 469
              +S TS
Sbjct:   675 NKSDTS 680

 Score = 45 (20.9 bits), Expect = 2.8e-32, Sum P(3) = 2.8e-32
 Identities = 18/85 (21%), Positives = 40/85 (47%)

Query:     2 RTRQDSKTQKDQASGK----ESTVRGALRDSESSHNETGTLAETSREGVGKFLRHVNARS 57
             R  +D K +KD+ +GK    ES+     +++E  +N     A+   +   K      + +
Sbjct:   535 RESEDEKPKKDKKAGKPAEKESSKSTISKEAEKKNNAKKAEAKPLSKSTTKGSETTKSGT 594

Query:    58 SSRSKKQDCAVGLTTSVLKVSGKQE 82
               + KK+   + L++ +++ S  Q+
Sbjct:   595 VPKVKKE---LSLSSKLVEKSKHQK 616

 Score = 38 (18.4 bits), Expect = 2.8e-32, Sum P(3) = 2.8e-32
 Identities = 13/50 (26%), Positives = 23/50 (46%)

Query:   340 SFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPS 389
             S S   KE   + SS    + K +SP    T+ + S +   +++ N+  S
Sbjct:   689 SSSKSLKEETAKLSSSKLEDKKVASPAETKTKVQSSLLK-RVTTQNISES 737

 Score = 37 (18.1 bits), Expect = 3.6e-32, Sum P(3) = 3.6e-32
 Identities = 15/68 (22%), Positives = 28/68 (41%)

Query:   345 KKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPK 404
             K +N    SS  S      +P ++++       + +LSS  L+     + ++       K
Sbjct:   665 KSKNQSSFSSNKSDTSFEENPSTSSSSKSLKEETAKLSSSKLEDKKVASPAETKT----K 720

Query:   405 EKSQALKR 412
              +S  LKR
Sbjct:   721 VQSSLLKR 728


>ASPGD|ASPL0000010029 [details] [associations]
            symbol:AN3890 species:162425 "Emericella nidulans"
            [GO:0003684 "damaged DNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0005819 "spindle" evidence=IEA]
            [GO:0006298 "mismatch repair" evidence=IEA] [GO:0006289
            "nucleotide-excision repair" evidence=IEA] InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0005634 GO:GO:0003684 EMBL:BN001302 GO:GO:0006289
            EMBL:AACD01000062 KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135
            OMA:FKGRHGT OrthoDB:EOG4Z0FG0 RefSeq:XP_661494.1
            ProteinModelPortal:Q5B6E0 STRING:Q5B6E0
            EnsemblFungi:CADANIAT00004811 GeneID:2873313 KEGG:ani:AN3890.2
            HOGENOM:HOG000182868 Uniprot:Q5B6E0
        Length = 951

 Score = 328 (120.5 bits), Expect = 4.8e-29, Sum P(2) = 4.8e-29
 Identities = 115/424 (27%), Positives = 178/424 (41%)

Query:   438 ICSDVKDLNSNSSTVLPVKRLKKIESGESSTSCLGI--STAVGSRKVGA----PLYWAEV 491
             I SD  D  ++ ST    K       G       G+  +T + SR   +    P++W E 
Sbjct:   314 ISSDDPDSLTDGSTKSEAKPAPIRRIGRPGFKPTGVQNTTVLSSRPTRSESSYPVFWVEA 373

Query:   492 YCSGENLTGKWVHVDA-ANAIIDGEQKVEAAAAACKTSLRYIVAFA-GCGAKDVTRRYCM 549
             +        KWV +D      +    K+E  A      L Y+VAF     A+DVTRRY  
Sbjct:   374 F---NEAFQKWVVIDPMVTKTLAKPHKLEPPATDPYNLLSYVVAFEEDASARDVTRRYT- 429

Query:   550 KWYRIASKRVNSAWWDAVLAPLRELESGATGDL---NVESSAKDSFVADRNSLEDMELET 606
                     RV    ++A    LR +ES   G+     V    +  F+ DR+ LE  EL  
Sbjct:   430 --------RV----FNAKTRKLR-VESTKNGEAWWKRVLEHFEKPFLEDRDELEIAELTA 476

Query:   607 RALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK---GPI-LGFCSGHA----VYPRSC 658
             +  +EP+P N Q +K+H +Y +ER L + ++++PK   G + LG   G      +Y RS 
Sbjct:   477 KTASEPMPRNVQDFKDHPIYALERHLRRNEVIFPKRVTGHVSLGKSGGKGQTEPIYRRSD 536

Query:   659 VQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQ 718
             V  L++  +W R    +K  E              G   + E+  E  A     LY  +Q
Sbjct:   537 VHILRSANKWYRLGRDIKVGEQPLKRIPVRNR---GMAVDDEEEGEETA-----LYAFFQ 588

Query:   719 LEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGF 778
              E  + P  V G +P+N  G +DV+    +P G +H+        A+ L ID A A+ GF
Sbjct:   589 TELYKPPPVVQGRIPKNAFGNLDVYVPSMVPAGGIHITHLDAARAARILGIDYADAVTGF 648

Query:   779 EFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVT 838
              F+    T +  G+VV +E+K+ + E                   +    W  LL  +  
Sbjct:   649 SFKGRHGTAIIKGVVVASEYKEAVEEVLKALEEEKLQNEQEERAVEVLRAWKNLLMKLRI 708

Query:   839 RQRL 842
              +R+
Sbjct:   709 AERV 712

 Score = 82 (33.9 bits), Expect = 4.8e-29, Sum P(2) = 4.8e-29
 Identities = 44/188 (23%), Positives = 75/188 (39%)

Query:    22 RGALRDSESSHNETGTLAETSREGVGKFLRHVNARSSSRSKKQDCAVGLTTSVLKVSGKQ 81
             RG  R   S   E   + E  RE     L    A+  S+S+ +  A     +  +    Q
Sbjct:    13 RGTPRSRRSKQAED-EIPEVYRE----MLAEAEAQEISQSENERPAKRFKPAGYRARTAQ 67

Query:    82 EVDKRVTWSDVDAHGCSRDAMGNTLRELDEGRLQDNVLDGGEEMYDSDWEDGSIPV-ACS 140
                 +V   D +      DA+        + ++  N     +E  D +WE+  I     S
Sbjct:    68 AFKAQVLQQDTNPMDAEEDAV-------KQPQIVYNSPSESDES-DMEWEEVDIQQPTIS 119

Query:   141 KENHPESDIKGVTIEFDAADSVTKKPVRR--ASAEDKELAELVHKVHLLCLLARGRLIDS 198
                   +D   + I  +   +  ++ VRR   +A +K+L   VHK+HLLCL+   +  + 
Sbjct:   120 GPTSSVTDEAPLQITLEQDHNRKRRVVRRKPVTAAEKKLRLDVHKMHLLCLMCHVQRRNL 179

Query:   199 VCDDPLIQ 206
              C+D  +Q
Sbjct:   180 WCNDEEVQ 187


>WB|WBGene00022296 [details] [associations]
            symbol:xpc-1 species:6239 "Caenorhabditis elegans"
            [GO:0003684 "damaged DNA binding" evidence=IEA] [GO:0005634
            "nucleus" evidence=IEA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            GO:GO:0005634 GO:GO:0003684 GO:GO:0006289 EMBL:FO081666 KO:K10838
            eggNOG:COG5535 PANTHER:PTHR12135 GeneTree:ENSGT00390000005194
            RefSeq:NP_500156.2 ProteinModelPortal:Q9N4C3 IntAct:Q9N4C3
            MINT:MINT-228757 STRING:Q9N4C3 PaxDb:Q9N4C3
            EnsemblMetazoa:Y76B12C.2 GeneID:177002 KEGG:cel:CELE_Y76B12C.2
            UCSC:Y76B12C.2 CTD:177002 WormBase:Y76B12C.2 InParanoid:Q9N4C3
            OMA:YLRQEIN NextBio:894928 Uniprot:Q9N4C3
        Length = 1119

 Score = 283 (104.7 bits), Expect = 4.0e-25, Sum P(4) = 4.0e-25
 Identities = 72/214 (33%), Positives = 106/214 (49%)

Query:   594 ADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPI---LGFCSG 650
             ++R   E M++    +  PLPT    YKNH LY +E+ L K++ +YP       LG   G
Sbjct:   812 SERKKWEMMQMREDLVKRPLPTVMSEYKNHPLYALEKDLLKFEAIYPPPATQKPLGQIRG 871

Query:   651 HAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGN 710
             H VYPRS V TL+ +  WL+ A  VK  E                   P+    V+ R +
Sbjct:   872 HNVYPRSTVFTLQGENNWLKLARSVKIGEKPYKIVKA----------RPDPRIPVEDRED 921

Query:   711 --IELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLE 768
               +++YG WQ E  R P   NG +P NE G V +++E   P    +L+L  +  ++++L 
Sbjct:   922 KFLDVYGYWQTEKYRRPPLKNGKIPHNEYGNVYMFNENMCPLDCTYLKLSGLVQISRKLG 981

Query:   769 IDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTI 802
                 PA+VG+ F  G + PV DG +V    KD I
Sbjct:   982 KQCIPAVVGWAFDGGFTHPVIDGAIVLE--KDAI 1013

 Score = 89 (36.4 bits), Expect = 4.0e-25, Sum P(4) = 4.0e-25
 Identities = 26/103 (25%), Positives = 43/103 (41%)

Query:   175 KELAELVHKVHLLCLLARGRLIDSVC-DDPLIQAXXXXXXXXXXXKISEVSKLTANALSP 233
             +E+ E  HKVHLLC +A  + +  +  D+ L+ +           K      +  + +  
Sbjct:   517 REMWENTHKVHLLCFMAHLKFVVKIALDESLVPSLMMSQLPNGYLKFIGEPVVPIDIMKN 576

Query:   234 IVSWFHDNFHVRSSVSTRRSFHSD-LAHALESREGTPEEIAAL 275
             +V WF D F   + V +  S   D L    E+R      + AL
Sbjct:   577 LVKWFADAFRPLNGVVSVASIEQDSLLEGHEARYPETRRLTAL 619

 Score = 61 (26.5 bits), Expect = 1.0e-21, Sum P(2) = 1.0e-21
 Identities = 31/141 (21%), Positives = 60/141 (42%)

Query:   329 KPEEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDP 388
             K E ++ S  KS +   K  + E      PE +      N  +S KS    + S+ N   
Sbjct:   150 KSENLVQSVPKSTTNGSKVAIIEDD----PEIR----AENGVKSSKSDEKPDFSAQN--- 198

Query:   389 SSSMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSN 448
              S +A +  +    P+      K+   +  + QLE++ S++ + +S  +   D  ++   
Sbjct:   199 GSKLAQNAPNRISRPRRSVTTAKKVSYVPSDDQLELSSSSSELESSSED--EDT-EIRPK 255

Query:   449 SSTVLPVKRLKKIESGESSTS 469
             + + +  KR K  +  ES +S
Sbjct:   256 TGSKIAKKREKSFKISESESS 276

 Score = 58 (25.5 bits), Expect = 4.0e-25, Sum P(4) = 4.0e-25
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query:   335 ASPVKS-FSCDKKENVCETSSKGSPECKYSSPKSNNTQSK 373
             ASP+   F+ D K+ +CE S + + +C     +   T  K
Sbjct:   758 ASPISYVFAIDNKQGICEVSQRYAMDCVKQDFRRRRTNPK 797

 Score = 50 (22.7 bits), Expect = 1.4e-20, Sum P(2) = 1.4e-20
 Identities = 32/117 (27%), Positives = 52/117 (44%)

Query:   295 VASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVK-SF--SCDKKE---N 348
             V S K +   + S+ Q+ S++     NAP   +++P   + +  K S+  S D+ E   +
Sbjct:   183 VKSSKSDEKPDFSA-QNGSKLAQ---NAPN-RISRPRRSVTTAKKVSYVPSDDQLELSSS 237

Query:   349 VCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACHPKE 405
               E  S    E     PK+ +  +KK   S ++S      SSS +  D SEA    E
Sbjct:   238 SSELESSSEDEDTEIRPKTGSKIAKKREKSFKISESE---SSSESPDDESEASEASE 291

 Score = 37 (18.1 bits), Expect = 4.0e-25, Sum P(4) = 4.0e-25
 Identities = 8/27 (29%), Positives = 15/27 (55%)

Query:     8 KTQKDQASGKESTVRGALRDSESSHNE 34
             K+QK+    +++  +    DS SS +E
Sbjct:   440 KSQKNVKKSEKNDEKNTAGDSSSSEDE 466


>DICTYBASE|DDB_G0292296 [details] [associations]
            symbol:xpc "DNA repair protein Rad4 family protein"
            species:44689 "Dictyostelium discoideum" [GO:0006289
            "nucleotide-excision repair" evidence=IEA] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0044351
            "macropinocytosis" evidence=RCA] InterPro:IPR004583
            InterPro:IPR018325 InterPro:IPR018326 InterPro:IPR018327
            InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403 Pfam:PF10404
            Pfam:PF10405 SMART:SM01031 SMART:SM01032 dictyBase:DDB_G0292296
            GO:GO:0005634 GenomeReviews:CM000155_GR GO:GO:0003684
            EMBL:AAFI02000189 GO:GO:0006289 KO:K10838 eggNOG:COG5535
            PANTHER:PTHR12135 RefSeq:XP_001134493.1 ProteinModelPortal:Q1ZXA6
            EnsemblProtists:DDB0232368 GeneID:8628599 KEGG:ddi:DDB_G0292296
            InParanoid:Q1ZXA6 OMA:VELFYMV Uniprot:Q1ZXA6
        Length = 967

 Score = 304 (112.1 bits), Expect = 2.9e-23, Sum P(2) = 2.9e-23
 Identities = 127/546 (23%), Positives = 233/546 (42%)

Query:   331 EEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSS 390
             E +++ P+ S    +++++     K +      S K+  T SKK   +  LSS N   ++
Sbjct:   459 ELIISKPITS----RQKSIQANQFKNTVLNSKISKKTETTMSKKRKTNSSLSSKNKKKNN 514

Query:   391 SMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNICSDVKDLNSNSS 450
             S + +D          +     K + + + + + + S ++   SK       K L  +SS
Sbjct:   515 SDSENDTDNERDSGSDNDDAGDKNNNKSDQEKDNSSSDSDYKDSK-------KKLKRSSS 567

Query:   451 TVLPVKRLKKIESGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANA 510
               +   RL  ++  ES T+    +  + + +      W EV+   ++   KW+ +D  N 
Sbjct:   568 EPIKRSRLSNLDDKESKTTTTTTTNTLSNNEKVEIESWIEVF---DHEKKKWISIDLINK 624

Query:   511 IIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRYCMKWYRIASKRVNSA---WW--- 564
              ID     E           Y+VA +    KDVT RY   +   + KR+  A   WW   
Sbjct:   625 EIDKPLNFEKIL----DPFSYVVAISKYQIKDVTSRYTNNYIGSSLKRLPIAQIKWWLQL 680

Query:   565 --DAVLAPLRE-----------LESGATGDLNVESSAKDSFVADRNSLEDMEL-ETRALT 610
               DA+  P              L+S     +N++     S + +R S+E++++ E + L 
Sbjct:   681 VGDAINNPTEVENDNEPVSKFILDSKKIISVNIDLLNNLS-IDERKSIEEIDVYEKQELI 739

Query:   611 --E---PLPTNQQAYKNHQLYVIERWLNKYQILYPKGPILG-FCSGHAVYPRSCVQTLKT 664
               E   P P++   +K+H ++V+E+ + KY    P    LG F   H +Y +  ++ L T
Sbjct:   740 IKESKLPFPSSFAQFKSHPIFVLEKDIAKYCSPDPSSKPLGLFNETHKIYHKDQIKVLHT 799

Query:   665 KERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIE--LYGKWQLEPL 722
              ++W++    V                  GQ  +P    +  ++ N    L+G+WQ + L
Sbjct:   800 SDKWVQNGRMV----------------IEGQ--QPLKIVKGRSKSNPTSMLFGEWQTK-L 840

Query:   723 RLPSAV--NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEF 780
               P+ +  +GIVP N  G V +++    P   VHLR   +  VAK+L I+ APA+ G+E 
Sbjct:   841 FEPAVIGKDGIVPTNSFGNVYLFNSSMCPINGVHLRGKGLIRVAKKLGINFAPALTGWEN 900

Query:   781 RNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQ 840
                 S P+ DG+VV  +F   +L+ +                 +  +RW + +  ++ + 
Sbjct:   901 GPKSSYPIIDGVVVAKKFSKKLLDTWLSESSSRAEAELQKKNDEIKARWKRFMKKLLIKN 960

Query:   841 RLNNCY 846
              +   Y
Sbjct:   961 YIEKTY 966

 Score = 52 (23.4 bits), Expect = 2.9e-23, Sum P(2) = 2.9e-23
 Identities = 23/83 (27%), Positives = 41/83 (49%)

Query:   110 DEGRLQDNVLDGGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKP-VR 168
             +EG + +N LD  EE+ ++  + G        E+  E +I   T EF + ++  KK  V+
Sbjct:    46 EEGDI-NNSLDTDEEIGENQDDAGDA------EDAIEFEID--TNEFKSKENGKKKRIVK 96

Query:   169 RASAEDKELAELVHKVHLLCLLA 191
             +   ++K     +H+  L C LA
Sbjct:    97 KVDLKEKHNCLYLHRTVLTCYLA 119

 Score = 37 (18.1 bits), Expect = 1.1e-21, Sum P(2) = 1.1e-21
 Identities = 14/59 (23%), Positives = 23/59 (38%)

Query:   127 DSDWE----DGSIPVACSKEN--HPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAE 179
             D +WE    D S     +      P  D + +  EFD  D   +  +  +   D+E+ E
Sbjct:     5 DIEWEESNNDNSTTTTTTTTTTASPRFD-ESINNEFDDEDKEEEGDINNSLDTDEEIGE 62


>POMBASE|SPAC12B10.12c [details] [associations]
            symbol:rhp41 "DNA repair protein Rhp41" species:4896
            "Schizosaccharomyces pombe" [GO:0000109 "nucleotide-excision repair
            complex" evidence=ISO] [GO:0003684 "damaged DNA binding"
            evidence=IEA] [GO:0005634 "nucleus" evidence=IDA] [GO:0005819
            "spindle" evidence=IDA] [GO:0006289 "nucleotide-excision repair"
            evidence=IGI] [GO:0006298 "mismatch repair" evidence=IGI]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            PomBase:SPAC12B10.12c EMBL:CU329670 GenomeReviews:CU329670_GR
            GO:GO:0005819 GO:GO:0003684 GO:GO:0006298 GO:GO:0006289
            GO:GO:0000109 KO:K10838 eggNOG:COG5535 PANTHER:PTHR12135
            OrthoDB:EOG4Z0FG0 PIR:T37579 RefSeq:NP_594644.1
            ProteinModelPortal:Q10445 STRING:Q10445
            EnsemblFungi:SPAC12B10.12c.1 GeneID:2542967 KEGG:spo:SPAC12B10.12c
            OMA:NEASSHE NextBio:20804002 InterPro:IPR018026 TIGRFAMs:TIGR00605
            Uniprot:Q10445
        Length = 638

 Score = 286 (105.7 bits), Expect = 3.6e-23, Sum P(2) = 3.6e-23
 Identities = 118/410 (28%), Positives = 175/410 (42%)

Query:   456 KRLKKIESGESSTSCLGISTAVGSR---KV---GAPLYWAEVYCSGENLTGKWVHVDA-A 508
             KR K I+   S+ S L  S  V      KV     P++W E +        KWV VD   
Sbjct:   267 KRRKIIQPSFSNLSHLDASDIVTEDTKLKVIDSPKPVFWVEAF---NKAMQKWVCVDPFG 323

Query:   509 NAIIDGE-QKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASKRVN-----S 561
             +A + G+ ++ E A++     + Y+ A    G  KDVTR+YC+ +Y+I   RV       
Sbjct:   324 DASVIGKYRRFEPASSDHLNQMTYVFAIEANGYVKDVTRKYCLHYYKILKNRVEIFPFGK 383

Query:   562 AWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTNQQAYK 621
             AW + + + +     G   D          F  D +++ED EL     +E +P N Q  K
Sbjct:   384 AWMNRIFSKI-----GKPRD----------FYNDMDAIEDAELLRLEQSEGIPRNIQDLK 428

Query:   622 NHQLYVIERWLNKYQILYPKGPILGFCS---G-HAVYPRSCVQTLKTKERWLREALQVKA 677
             +H L+V+ER L K Q +   G   G  +   G   VYPR  V    + E W R+   +K 
Sbjct:   429 DHPLFVLERHLKKNQAI-KTGKSCGRINTKNGVELVYPRKYVSNGFSAEHWYRKGRIIKP 487

Query:   678 NEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNER 737
                             G    P  YDE +A    +LY     +P+     V  IVP+N  
Sbjct:   488 G------AQPLKHVKNGDKVLPL-YDE-EAT---QLYTP---KPV-----VANIVPKNAY 528

Query:   738 GQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAE 797
             G +D++    LP G  H R     + AK LEID A A+VGF+F+   S P  +G+VV   
Sbjct:   529 GNIDLYVPSMLPYGAYHCRKRCALAAAKFLEIDYAKAVVGFDFQRKYSKPKLEGVVVSKR 588

Query:   798 FKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQLLSSIVTRQRLNNCYG 847
             +++ I                          W +L++ +  RQR+   YG
Sbjct:   589 YEEAIDLIAEEIDQEEKEAEARNVRKTCLLLWKRLITGLRIRQRVFEEYG 638

 Score = 63 (27.2 bits), Expect = 3.6e-23, Sum P(2) = 3.6e-23
 Identities = 16/61 (26%), Positives = 30/61 (49%)

Query:   142 ENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKELAELVHKVHLLCLLARGRLIDSVCD 201
             +  P  D   V    D   +V K+   + ++ D+++   +H++HLLCL       ++ CD
Sbjct:    49 QERPTHDFGDVEATVDR--TVEKRSRLKITSVDRKIRLQIHQLHLLCLTYHLCTRNTWCD 106

Query:   202 D 202
             D
Sbjct:   107 D 107


>ASPGD|ASPL0000008254 [details] [associations]
            symbol:AN6186 species:162425 "Emericella nidulans"
            [GO:0003684 "damaged DNA binding" evidence=IEA] [GO:0006298
            "mismatch repair" evidence=IEA] [GO:0006289 "nucleotide-excision
            repair" evidence=IEA] [GO:0005730 "nucleolus" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01031 SMART:SM01032 GO:GO:0005634
            GO:GO:0003684 EMBL:BN001301 GO:GO:0006289 EMBL:AACD01000105
            eggNOG:COG5535 PANTHER:PTHR12135 OrthoDB:EOG4DJP4K
            RefSeq:XP_663790.1 EnsemblFungi:CADANIAT00006823 GeneID:2871078
            KEGG:ani:AN6186.2 HOGENOM:HOG000164138 OMA:IPKNEYG Uniprot:Q5AZU4
        Length = 941

 Score = 198 (74.8 bits), Expect = 2.5e-19, Sum P(4) = 2.5e-19
 Identities = 51/194 (26%), Positives = 84/194 (43%)

Query:   653 VYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARGNIE 712
             VY RS V   +T E W +E  +   +                +    E+      +    
Sbjct:   582 VYRRSDVVKCQTAESWHKEGREPLPSAKPLKHVPIRAVTLLRKREVDEEARRTGQKPLQG 641

Query:   713 LYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSA 772
             LY   Q + +  P  V+GI+P+NE G +D +  + +P G VH+       + K+L ID A
Sbjct:   642 LYSFEQTQEIIPPPIVDGIIPKNEYGNIDCFVPRMVPKGAVHIPFSGTARICKKLGIDYA 701

Query:   773 PAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATSRWYQL 832
              A+ GFEF +  + PV +G+VV AE KD +++A+                 +  + W + 
Sbjct:   702 EAVTGFEFGSQMAVPVIEGVVVAAENKDLVVDAWRADNEEKRRKEARKAEAKILATWRKF 761

Query:   833 LSSIVTRQRLNNCY 846
             L  +   QR+   Y
Sbjct:   762 LFGLRIAQRVQEEY 775

 Score = 95 (38.5 bits), Expect = 2.5e-19, Sum P(4) = 2.5e-19
 Identities = 54/192 (28%), Positives = 82/192 (42%)

Query:   485 PLYWAEVYCSGENLTGKWVHVDA---ANAIIDGEQKVEAA-------AAACKTSLRYIVA 534
             P+YW EV      +T + + VD    +NA+    Q+++AA       A   K  + Y++A
Sbjct:   384 PIYWTEVVSP---ITHQVISVDPLVLSNAVA-ATQELQAAFEPRGAKAEKAKQVICYVIA 439

Query:   535 F-AGCGAKDVTRRYCMK--W------YRIASKRVNSAWWDAVLAPLRELESGATGDLNVE 585
             F A   AKDVT RY  +  W      +R+  K  +    D     LR          N E
Sbjct:   440 FSADKTAKDVTTRYLRRRTWPGKTKGFRLGKKGPDDDLLDWFRVLLR----------NYE 489

Query:   586 SSAKDSFVADRNSLEDM-ELETRALTEPLPTNQ-----QAYKNHQLYVIERWLNKYQILY 639
                KD    D   +ED  +L     T+  PTN+     Q+ +    +V+ER+L + + L 
Sbjct:   490 RPYKDRTAVD--DIEDAKDLVPNRPTKSKPTNETVDTLQSLRTSSEFVLERFLRREEALR 547

Query:   640 PKG-PILGFCSG 650
             P   P+  F  G
Sbjct:   548 PGALPVRTFTPG 559

 Score = 67 (28.6 bits), Expect = 2.5e-19, Sum P(4) = 2.5e-19
 Identities = 22/97 (22%), Positives = 50/97 (51%)

Query:   110 DEGRLQDNVLDGGEEMYDSDWEDGSIPVACSKENH--PESDIKGVTIEFDAADSVTKKPV 167
             D+  + D+ +   EE+   DWED +I  A    +   P  +++ +T++ +          
Sbjct:    58 DKKVVSDSDVTDSEEV---DWED-AIHTAAPATSFVSPHENLE-LTLDRNEVHLEDILQG 112

Query:   168 RRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDP 203
             ++A  + ++++  L+H++H+ CLLA   + +   +DP
Sbjct:   113 QKAPTKIERQIRILIHRLHVQCLLAHNAIRNDWINDP 149

 Score = 52 (23.4 bits), Expect = 2.5e-19, Sum P(4) = 2.5e-19
 Identities = 16/59 (27%), Positives = 26/59 (44%)

Query:   235 VSWFHDNFHVRSSVSTRRSFHSDLAHALESREGTPEEIAALSVALFRALKLTTRFVSIL 293
             ++ FH + H       +     +   A E  EG+ +  A L  AL RA+ +  R V+ L
Sbjct:   263 IASFHKDKHDPELYGEKIPSVEEFRQAAERMEGSRDLGAQLFTALLRAIAIEARLVASL 321

 Score = 49 (22.3 bits), Expect = 1.1e-14, Sum P(4) = 1.1e-14
 Identities = 15/53 (28%), Positives = 24/53 (45%)

Query:   331 EEVL---ASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCE 380
             EE L   A PV++F+   K+     +   +P     SPK+ N   +   V C+
Sbjct:   543 EEALRPGALPVRTFTPGGKKKNANGNGASTPT---ESPKAENVYRRSDVVKCQ 592


>POMBASE|SPCC4G3.10c [details] [associations]
            symbol:rhp42 "DNA repair protein Rhp42" species:4896
            "Schizosaccharomyces pombe" [GO:0000109 "nucleotide-excision repair
            complex" evidence=ISO] [GO:0003684 "damaged DNA binding"
            evidence=ISO] [GO:0005730 "nucleolus" evidence=IDA] [GO:0006289
            "nucleotide-excision repair" evidence=IGI] [GO:0006298 "mismatch
            repair" evidence=IGI] InterPro:IPR004583 InterPro:IPR018325
            InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
            Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
            SMART:SM01031 SMART:SM01032 PomBase:SPCC4G3.10c GO:GO:0005730
            EMBL:CU329672 GenomeReviews:CU329672_GR GO:GO:0003684 GO:GO:0006298
            GO:GO:0006289 GO:GO:0000109 KO:K10838 eggNOG:COG5535
            PANTHER:PTHR12135 InterPro:IPR018026 TIGRFAMs:TIGR00605 PIR:T41366
            RefSeq:NP_587828.1 ProteinModelPortal:P87235 STRING:P87235
            EnsemblFungi:SPCC4G3.10c.1 GeneID:2539465 KEGG:spo:SPCC4G3.10c
            OMA:YPESETE OrthoDB:EOG4DJP4K NextBio:20800627 Uniprot:P87235
        Length = 686

 Score = 251 (93.4 bits), Expect = 2.9e-19, Sum P(2) = 2.9e-19
 Identities = 101/380 (26%), Positives = 157/380 (41%)

Query:   485 PLYWAEVYCSGENLTGKWVHVDAA--NAIIDGEQK-VEAAAAACKTS-LRY--IVAFAG- 537
             P++W E+Y   E    KW+ VDA   N +   +    E   A  ++  LR   + A+   
Sbjct:   323 PIFWTEIYDQSEK---KWIAVDAVVLNGVYTNDMTWFEPKGAYAESKHLRMGIVAAYDND 379

Query:   538 CGAKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRN 597
               AKDVT RY    Y+  S R+      +      +      G L   +  KD+     +
Sbjct:   380 LYAKDVTLRYTD--YQ--SSRLKKIRHVSFADKYFDFYKAIFGQLAKRN--KDA----ED 429

Query:   598 SLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKG-PI--LGFCSG---- 650
               E+ ELE++      P +   +KNH  +V+ R L + + L P   P+    F +G    
Sbjct:   430 IYEEKELESKVPIRE-PKSFADFKNHPEFVLIRHLRREEALLPNAKPVKTATFGNGKKAT 488

Query:   651 -HAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQ---DFEPEDYDEVD 706
                VY R  V   KT E + +E   +K  E               +   +F   + +E  
Sbjct:   489 SEEVYLRKDVVICKTPENYHKEGRVIKEGEQPRKMVKARAVTISRKREHEFRVAETNEPV 548

Query:   707 ARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKR 766
              +G   LY   Q E    P   +GI+P+N  G +D + E  +P G  HL    +  +AK+
Sbjct:   549 LQG---LYSSDQTELYVPPPIKDGIIPKNGYGNMDCFVESMIPKGAAHLPYRGIAKIAKK 605

Query:   767 LEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQAT 826
             L ID A A+ GFEFR  R+ PV  GI+V  E    + E +                    
Sbjct:   606 LNIDYADAVTGFEFRKHRAIPVTTGIIVPEESAQMVYEEFLECEKIRIEKQQMKERKIIY 665

Query:   827 SRWYQLLSSIVTRQRLNNCY 846
              +W  LL+++  R+R+   Y
Sbjct:   666 GQWKHLLNALRIRKRIEEQY 685

 Score = 64 (27.6 bits), Expect = 2.9e-19, Sum P(2) = 2.9e-19
 Identities = 25/90 (27%), Positives = 43/90 (47%)

Query:   110 DEGRLQDNVLDGGEE--MYDSD---WEDGSIPVACSKENHPESDIKGVTIEFDAADSVTK 164
             ++G  +DN   G  E   +D D   WE   + ++ +K+   + D+  VT        +TK
Sbjct:    81 EKGSDEDNEKLGSSEDDEFDDDFDTWEQ--VDLSPNKQED-KKDLHIVTQHI--TPQLTK 135

Query:   165 KPVR-RASAEDKELAELVHKVHLLCLLARG 193
             +  +  +SA DK +   +H +H  CLL  G
Sbjct:   136 ESKKGSSSAMDKSIRLSIHIMHFTCLLYHG 165


>CGD|CAL0004788 [details] [associations]
            symbol:orf19.6722 species:5476 "Candida albicans" [GO:0000111
            "nucleotide-excision repair factor 2 complex" evidence=IEA]
            [GO:0005829 "cytosol" evidence=IEA] [GO:0005819 "spindle"
            evidence=IEA] [GO:0003684 "damaged DNA binding" evidence=IEA]
            [GO:0043161 "proteasomal ubiquitin-dependent protein catabolic
            process" evidence=IEA] [GO:0006298 "mismatch repair" evidence=IEA]
            [GO:0006289 "nucleotide-excision repair" evidence=IEA]
            InterPro:IPR004583 InterPro:IPR018325 InterPro:IPR018326
            InterPro:IPR018327 InterPro:IPR018328 Pfam:PF03835 Pfam:PF10403
            Pfam:PF10404 Pfam:PF10405 SMART:SM01030 SMART:SM01031 SMART:SM01032
            CGD:CAL0004788 GO:GO:0005634 GO:GO:0003684 GO:GO:0006289
            EMBL:AACQ01000029 EMBL:AACQ01000028 KO:K10838 eggNOG:COG5535
            PANTHER:PTHR12135 RefSeq:XP_719704.1 RefSeq:XP_719821.1
            ProteinModelPortal:Q5ADX0 STRING:Q5ADX0 GeneID:3638462
            GeneID:3638600 KEGG:cal:CaO19.14014 KEGG:cal:CaO19.6722
            Uniprot:Q5ADX0
        Length = 709

 Score = 240 (89.5 bits), Expect = 2.0e-18, Sum P(2) = 2.0e-18
 Identities = 103/388 (26%), Positives = 158/388 (40%)

Query:   485 PLYWAEVYCSGENLTGKWVHVDA-ANAIID--GEQK---VEAAAAACKTSLRYIVAFAGC 538
             P++W EV+      T +WV +D     +I+   ++K    E      +  L Y+VAF   
Sbjct:   281 PVFWVEVW---NKYTRQWVSIDPIVMKLIEVCPKRKKSPFEPPPTDERNQLTYVVAFDKF 337

Query:   539 G-AKDVTRRYCMKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRN 597
             G  +DVTRRY    Y   +K +       +     E +S     L      K   VAD  
Sbjct:   338 GRVRDVTRRYS---YNYNAKTIRKR----IEFRSSEDKSWYLKVLRCCDFKKTQNVAD-- 388

Query:   598 SLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPI--LG-FCSGHA-- 652
               E  E   R L E +P N QA+KNH LY +E  L + +I++PK      G F S ++  
Sbjct:   389 IYEQKEFYDRDLAEGMPNNIQAFKNHPLYALESQLRQDEIIFPKDDTSKCGTFRSKNSSK 448

Query:   653 ---VYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDARG 709
                VY RSCV  L++ + W     Q+K                      P    E D R 
Sbjct:   449 VFQVYKRSCVHRLRSAKAWYMRGRQLKVGAI------------------PLKSKEEDVR- 489

Query:   710 NIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLR------LPRVYSV 763
                LY ++Q +    P   +GIVP+N+ G +DV+++  LP  ++ +       +  + + 
Sbjct:   490 ---LYAEFQTQLYIPPPVTDGIVPKNQYGNIDVYTKTMLPENSILIECDENCSMKMLQNA 546

Query:   764 AKRLEIDSAPAMVGFEFRNGRS----TPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXX 819
             A  L ID A A+V F F+  +     T    GIV+  E+++ +                 
Sbjct:   547 ANLLAIDYAKAIVSFSFKGKKKKHNITAREGGIVIAKEYEEAMQLTIDNLIEQEEEDQRA 606

Query:   820 XXXXQATSRWYQLLSSIVTRQRLNNCYG 847
                  A   W   L  +    RLN  +G
Sbjct:   607 LSEANALRNWKYFLLKLRLEDRLNKSHG 634

 Score = 68 (29.0 bits), Expect = 2.0e-18, Sum P(2) = 2.0e-18
 Identities = 23/86 (26%), Positives = 41/86 (47%)

Query:   117 NVLDGGEEMYDSDWEDGSIPVACSKENHPESDIKGVTIEFDAADSVTKKPVRRASAEDKE 176
             N+LD  +E    D E+  IP    KE+  ++    + I  D      K P    S E++ 
Sbjct:    54 NILDDSDEFETIDLEN--IP----KESGNDT----LVIRIDNNKKEEKTPKNLISREERH 103

Query:   177 LAELVHKVHLLCLLARGRLIDSVCDD 202
                L+HK++L+ +L  G + +  C++
Sbjct:   104 RRVLLHKMYLVMMLVHGSIRNLWCNN 129


>SGD|S000000964 [details] [associations]
            symbol:RAD4 "Protein that recognizes and binds damaged DNA
            during NER" species:4932 "Saccharomyces cerevisiae" [GO:0000111
            "nucleotide-excision repair factor 2 complex" evidence=IDA]
            [GO:0003684 "damaged DNA binding" evidence=IEA;IDA] [GO:0005634
            "nucleus" evidence=IEA;IDA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0006281 "DNA repair" evidence=IEA] [GO:0006974 "response to DNA
            damage stimulus" evidence=IEA] [GO:0003677 "DNA binding"
            evidence=IEA] [GO:0043161 "proteasomal ubiquitin-dependent protein
            catabolic process" evidence=IMP] [GO:0005829 "cytosol"
            evidence=IDA] [GO:0006289 "nucleotide-excision repair"
            evidence=IEA;IMP] InterPro:IPR004583 InterPro:IPR018325
            InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
            Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01030
            SMART:SM01031 SMART:SM01032 SGD:S000000964 GO:GO:0005829
            GO:GO:0043161 GO:GO:0003684 EMBL:BK006939 KO:K01530
            RefSeq:NP_011093.3 GeneID:856913 KEGG:sce:YER166W GO:GO:0006289
            EMBL:U18917 RefSeq:NP_011089.4 GeneID:856909 KEGG:sce:YER162C
            KO:K10838 PDB:2QSF PDB:2QSG PDB:2QSH PDBsum:2QSF PDBsum:2QSG
            PDBsum:2QSH GO:GO:0000111 eggNOG:COG5535 PANTHER:PTHR12135
            EMBL:M26050 EMBL:M24928 PIR:S30814 ProteinModelPortal:P14736
            SMR:P14736 DIP:DIP-1547N IntAct:P14736 MINT:MINT-396392
            STRING:P14736 PaxDb:P14736 PeptideAtlas:P14736 EnsemblFungi:YER162C
            GeneTree:ENSGT00390000005194 HOGENOM:HOG000074544 OMA:FKGRHGT
            OrthoDB:EOG4Z0FG0 EvolutionaryTrace:P14736 NextBio:983347
            Genevestigator:P14736 GermOnline:YER162C Uniprot:P14736
        Length = 754

 Score = 237 (88.5 bits), Expect = 1.6e-16, Sum P(3) = 1.6e-16
 Identities = 94/380 (24%), Positives = 159/380 (41%)

Query:   485 PLYWAEVYCSGENLTGKWVHVDAANA-IIDG---EQKVEAAAAAC--KTSLRYIVAF-AG 537
             P++W EV+   +  + KW+ VD  N   I+      K+     AC  +  LRY++A+   
Sbjct:   313 PIFWCEVW---DKFSKKWITVDPVNLKTIEQVRLHSKLAPKGVACCERNMLRYVIAYDRK 369

Query:   538 CGAKDVTRRYCMKWY--RIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVAD 595
              G +DVTRRY  +W   ++  +R+     D      R++ +     L+     K   + D
Sbjct:   370 YGCRDVTRRYA-QWMNSKVRKRRITKD--DFGEKWFRKVITA----LHHRKRTK---IDD 419

Query:   596 RNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPILGFCSGHA--- 652
                 ED     R  +E +P + Q  KNH  YV+E+ + + QI+ P     G+   H    
Sbjct:   420 ---YEDQYFFQRDESEGIPDSVQDLKNHPYYVLEQDIKQTQIVKPGCKECGYLKVHGKVG 476

Query:   653 ----VYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEVDAR 708
                 VY +  +  LK+  +W      +K                 G+  E ED + + + 
Sbjct:   477 KVLKVYAKRDIADLKSARQWYMNGRILKTGSRCKKVIKRTVGRPKGEA-EEED-ERLYSF 534

Query:   709 GNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLE 768
              + ELY    + PL   ++ +G + +N  G ++V++   +P     +  P     A+ L 
Sbjct:   535 EDTELY----IPPL---ASASGEITKNTFGNIEVFAPTMIPGNCCLVENPVAIKAARFLG 587

Query:   769 IDSAPAMVGFEFRNGRST-PVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXXXXQATS 827
             ++ APA+  F+F  G +  PV  GIVV    ++ I  A                   A  
Sbjct:   588 VEFAPAVTSFKFERGSTVKPVLSGIVVAKWLREAIETAIDGIEFIQEDDNRKEHLLGALE 647

Query:   828 RWYQLLSSIVTRQRLNNCYG 847
              W  LL  +  R +LN+ YG
Sbjct:   648 SWNTLLLKLRIRSKLNSTYG 667

 Score = 52 (23.4 bits), Expect = 1.6e-16, Sum P(3) = 1.6e-16
 Identities = 17/80 (21%), Positives = 41/80 (51%)

Query:   119 LDGGEEMYDSD-WEDGSIPVACSKENHPESDIKGVTIEFDAA---DSVTKKPVRRA-SAE 173
             +   EE YDS+ +ED +       + +  + ++ +++E   +   +S  ++  R   S E
Sbjct:    83 IQSSEEDYDSEEFEDVT-------DGNEVAGVEDISVEIKPSSKRNSDARRTSRNVCSNE 135

Query:   174 DKELAELVHKVHLLCLLARG 193
             +++  +  H ++L+CL+  G
Sbjct:   136 ERKRRKYFHMLYLVCLMVHG 155

 Score = 46 (21.3 bits), Expect = 1.6e-16, Sum P(3) = 1.6e-16
 Identities = 13/51 (25%), Positives = 21/51 (41%)

Query:   244 VRSSVSTRRSF----HSDLAHALESREGTPEEIAALSVALFRALKLTTRFV 290
             +  S + +R F     SD   A+    G P+      VA+ RA  +  R +
Sbjct:   233 IEMSANNKRKFKTLKRSDFLRAVSKGHGDPDISVQGFVAMLRACNVNARLI 283


>UNIPROTKB|G4MUV6 [details] [associations]
            symbol:MGG_01699 "Uncharacterized protein" species:242507
            "Magnaporthe oryzae 70-15" [GO:0043581 "mycelium development"
            evidence=IEP] InterPro:IPR004583 InterPro:IPR018325
            InterPro:IPR018326 InterPro:IPR018327 InterPro:IPR018328
            Pfam:PF03835 Pfam:PF10403 Pfam:PF10404 Pfam:PF10405 SMART:SM01031
            SMART:SM01032 GO:GO:0005634 GO:GO:0003684 GO:GO:0043581
            EMBL:CM001232 GO:GO:0006289 PANTHER:PTHR12135 RefSeq:XP_003714693.1
            ProteinModelPortal:G4MUV6 EnsemblFungi:MGG_01699T0 GeneID:2679173
            KEGG:mgr:MGG_01699 Uniprot:G4MUV6
        Length = 1045

 Score = 200 (75.5 bits), Expect = 2.3e-14, Sum P(3) = 2.3e-14
 Identities = 67/266 (25%), Positives = 110/266 (41%)

Query:   595 DRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQ-ILYPKGPILGF---CSG 650
             D   L   + E + + E   T Q  YK  + YV+ER L + + +L    P+  F     G
Sbjct:   599 DSTDLRPAKHEKKEVKEGDETLQY-YKQSKEYVLERHLKREEALLQDATPVKVFKVKAKG 657

Query:   651 -----HAVYPRSCVQTLKTKERWLREALQVKANEXXXXXXXXXXXXXXGQDFEPEDYDEV 705
                    VY R  V  +K+ E W ++    K  E               +     D    
Sbjct:   658 GEFTEENVYLRRDVVQVKSAETWHKQGRAPKEGEKPLKMVPYRAATMNRK----RDIAAA 713

Query:   706 DAR-GNIELYGKWQLEPLR--LPSAV-NGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVY 761
             +A  G   L G + ++     +P  + +GI+P+NE G +D+++E   P G VH+      
Sbjct:   714 EAATGKKVLQGLYSMDQTDWIIPPPIKDGIIPKNEYGNIDLFAEHMCPQGAVHVPFRGAV 773

Query:   762 SVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYXXXXXXXXXXXXXXX 821
              V +RL +D A A++ FEF +  + PV  G+V+  E  D ++E                 
Sbjct:   774 KVCRRLGVDYAEAVIDFEFGHRMAVPVIQGVVIAEEHHDRVMEELAKDEAERARKEDAKR 833

Query:   822 XXQATSRWYQLLSSIVTRQRLNNCYG 847
                A + W ++L ++    RL   YG
Sbjct:   834 TAAALAMWRKMLMAMRITNRLREEYG 859

 Score = 75 (31.5 bits), Expect = 2.3e-14, Sum P(3) = 2.3e-14
 Identities = 26/107 (24%), Positives = 52/107 (48%)

Query:    99 RDAMGNTLRELDEGRLQDNVLDGGEEMYDSDWEDGSIPVA-CSKENHPESDIKGVTIEFD 157
             +D +    R LD     D+  D  ++  D ++ED    +A  ++E  P  D++ +T++ D
Sbjct:    84 KDVVAAADRSLDMADEDDDGSDDDDD--DIEFEDVQASLAPFAEEAAPSGDLE-LTLDLD 140

Query:   158 AADSVTKKPVRRASAEDKE--LAELVHKVHLLCLLARGRLIDS-VCD 201
                S+T +   +     +E      VH+VH++ L+    + +S +CD
Sbjct:   141 GRISLTNEYGNKKGPSKRERITRNAVHRVHVMFLMWHNAVRNSWLCD 187

 Score = 47 (21.6 bits), Expect = 2.3e-14, Sum P(3) = 2.3e-14
 Identities = 20/91 (21%), Positives = 34/91 (37%)

Query:   328 AKPEEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLD 387
             A PEE  +S        +     + ++K  P  ++ S +S   QSK           + +
Sbjct:   378 ADPEEERSSQPSPEKPTQTTQTPQKNTKNEPRRQHVSSRSRGKQSKAIEEEDSNYVDDFE 437

Query:   388 PSSSMACSDISEACHPKEKSQALKRKGDLEF 418
             P    +  ++      K   Q+ K   DLEF
Sbjct:   438 PQEVNSDDEMVVEVPKKMAPQSKKFDQDLEF 468

 Score = 47 (21.6 bits), Expect = 1.7e-10, Sum P(3) = 1.7e-10
 Identities = 14/51 (27%), Positives = 22/51 (43%)

Query:   380 ELSSGNLDPSSSMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATN 430
             E   G+ D    +   D+  +  P  +  A    GDLE  + L+  +S TN
Sbjct:    99 EDDDGSDDDDDDIEFEDVQASLAPFAEEAA--PSGDLELTLDLDGRISLTN 147

 Score = 37 (18.1 bits), Expect = 2.4e-13, Sum P(3) = 2.4e-13
 Identities = 8/15 (53%), Positives = 10/15 (66%)

Query:   475 TAVGSRKVGAPLYWA 489
             +  GSR VGA L+ A
Sbjct:   336 SCTGSRDVGAQLFTA 350


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.313   0.128   0.373    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      934       879   0.00085  122 3  11 23  0.45    34
                                                     37  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  21
  No. of states in DFA:  633 (67 KB)
  Total size of DFA:  451 KB (2213 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  83.73u 0.11s 83.84t   Elapsed:  00:00:04
  Total cpu time:  83.75u 0.11s 83.86t   Elapsed:  00:00:04
  Start:  Tue May 21 19:27:32 2013   End:  Tue May 21 19:27:36 2013

Back to top