BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 003292
         (833 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P51612|XPC_MOUSE DNA repair protein complementing XP-C cells homolog OS=Mus musculus
           GN=Xpc PE=1 SV=2
          Length = 930

 Score =  277 bits (709), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 230/716 (32%), Positives = 335/716 (46%), Gaps = 86/716 (12%)

Query: 73  DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQASLLSLLPSYLLKISEVSKLTANALSP 132
           +KE+ E +HKVHLLCLLA G   +S+C  P + A  LS++P    K+  +    A  LS 
Sbjct: 194 NKEVQENMHKVHLLCLLASGFYRNSICRQPDLLAIGLSIIPIRFTKVP-LQDRDAYYLSN 252

Query: 133 IVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALFRALKLTTR 187
           +V WF   F V + +S   S   DL   LE R         EE+  + + + RAL+L TR
Sbjct: 253 LVKWFIGTFTVNADLSA--SEQDDLQTTLERRIAIYSARDNEELVHIFLLILRALQLLTR 310

Query: 188 FVSILDVASLKPEADKNVSSNQDSSRVGGG-----------IFNAPTL-MVAKPEEVLAS 235
            V  L    LK    K   S++++S  G G             N PT     K EE L+ 
Sbjct: 311 LVLSLQPIPLKSAVTKGRKSSKETSVEGPGGSSELSSNSPESHNKPTTSRRIKEEETLSE 370

Query: 236 PVKSFSCDKKENVCETSSKG--SPECKYSSPKSNNTQ----SKKSPVSCELS-------- 281
                +   K       S+    P C          Q    ++K  V+ ++S        
Sbjct: 371 GRGKATARGKRGTGTAGSRQRRKPSCSEGEEAEQKVQGRPHARKRRVAAKVSYKEESESD 430

Query: 282 ----SGNLDPSSSMACSDISEACHPKEKSQALKRKGDLEFEMQLEMALSATNVATSKSNI 337
                 + +PSS        E C P  + Q  KR        Q   A S +   T + + 
Sbjct: 431 GAGSGSDFEPSSGEGQHSSDEDCEPGPRKQ--KRAS----APQRTKAGSKSASKTQRGSQ 484

Query: 338 CSDVKDLNSNSSTVLPVKRLKKIESGESSTSCLGISTAVGSRKVGAPLYWAEVYCSGENL 397
           C +       SS+    KR KK+ SG         +  +  RK      W EVYC  +  
Sbjct: 485 C-EPSSFPEASSSSSGCKRGKKVSSG---------AEEMADRKPAGVDQWLEVYCEPQ-- 532

Query: 398 TGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AKDVTRRYCMKWYRIASK 456
             KWV VD  + ++   Q V     A K  + Y+V     G  +DVT+RY   W     K
Sbjct: 533 -AKWVCVDCVHGVVG--QPVACYKYATK-PMTYVVGIDSDGWVRDVTQRYDPAWMTATRK 588

Query: 457 -RVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADRNSLEDMELETRALTEPLPTN 515
            RV++ WW   L P R                  S + +R   ED E + + L +PLPT+
Sbjct: 589 CRVDAEWWAETLRPYR------------------SLLTEREKKEDQEFQAKHLDQPLPTS 630

Query: 516 QQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAVYPRSCVQTLKTKERWLREALQV 574
              YKNH LY ++R L K+Q +YP+   +LG+C G AVY R CV TL +++ WL++A  V
Sbjct: 631 ISTYKNHPLYALKRHLLKFQAIYPETAAVLGYCRGEAVYSRDCVHTLHSRDTWLKQARVV 690

Query: 575 KANEVPVKVIKN-SSKSKKGQDFEPEDYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPR 633
           +  EVP K++K  S++++K +  EP+ +D  D    + LYG WQ E  + P AV+G VPR
Sbjct: 691 RLGEVPYKMVKGFSNRARKARLSEPQLHDHND----LGLYGHWQTEEYQPPIAVDGKVPR 746

Query: 634 NERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVV 693
           NE G V ++    +P G V + LP +  VA++L ID   A+ GF+F  G   PV DG +V
Sbjct: 747 NEFGNVYLFLPSMMPVGCVQMTLPNLNRVARKLGIDCVQAITGFDFHGGYCHPVTDGYIV 806

Query: 694 CAEFKDTILEAYAEEEEKREAEEKKRREAQATSRWYQLLSSIVTRQRLNNCYGNNS 749
           C EF+D +L A+  E+   E +EK+++E +A   W  L+  ++ R+RL   YG  S
Sbjct: 807 CEEFRDVLLAAWENEQAIIEKKEKEKKEKRALGNWKLLVRGLLIRERLKLRYGAKS 862


>sp|Q01831|XPC_HUMAN DNA repair protein complementing XP-C cells OS=Homo sapiens GN=XPC
           PE=1 SV=4
          Length = 940

 Score =  254 bits (650), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 230/738 (31%), Positives = 337/738 (45%), Gaps = 112/738 (15%)

Query: 66  VRRASAE-DKELAELVHKVHLLCLLARGRLIDSVCDDPLIQASLLSLLPSYLLKISEVSK 124
           +RRA    +K + E  HKVHLLCLLA G   +++C  P + A  LS++P+   ++     
Sbjct: 190 LRRAMKRFNKGVHEDTHKVHLLCLLANGFYRNNICSQPDLHAIGLSIIPARFTRVLP-RD 248

Query: 125 LTANALSPIVSWFHDNFHVRSSVSTRRSFHSDLAHALESREGT-----PEEIAALSVALF 179
           +    LS +V WF   F V + +S   S   +L   LE R         EE+  + + + 
Sbjct: 249 VDTYYLSNLVKWFIGTFTVNAELSA--SEQDNLQTTLERRFAIYSARDDEELVHIFLLIL 306

Query: 180 RALKLTTRFVSILDVASLKPEADKNV------------SSNQDSSRVGGGIFNAPTLMVA 227
           RAL+L TR V  L    LK    K               S++ SS+V        T    
Sbjct: 307 RALQLLTRLVLSLQPIPLKSATAKGKKPSKERLTADPGGSSETSSQVLENHTKPKTSKGT 366

Query: 228 KPEEVLASPVKSFSCDKKENVCETSSKGSPECKYSSPKSNNTQSKK-------------- 273
           K EE  A      S   K N      +  P          + Q K               
Sbjct: 367 KQEETFAKGTCRPSAKGKRNKGGRKKRSKPSSSEEDEGPGDKQEKATQRRPHGRERRVAS 426

Query: 274 --------------SPVSCELSSGNL-DPSSSMACSDISEACHPKEKSQALKRKGDLEFE 318
                         S    ELSSG   DPS      + SE   PK++     ++      
Sbjct: 427 RVSYKEESGSDEAGSGSDFELSSGEASDPSD-----EDSEPGPPKQRKAPAPQR------ 475

Query: 319 MQLEMALSATNVATSKSNICSDVKD--LNSNSSTVLPVKRLKKIES-GESSTSCLGISTA 375
                   A + + S+++  S  KD  L + SS+    KR KK+ S GE +         
Sbjct: 476 ------TKAGSKSASRTHRGSHRKDPSLPAASSSSSSSKRGKKMCSDGEKAEK------- 522

Query: 376 VGSRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFA 435
              R +     W EV+C  E    KWV VD  + ++   Q +     A K  + Y+V   
Sbjct: 523 ---RSIAGIDQWLEVFCEQEE---KWVCVDCVHGVVG--QPLTCYKYATK-PMTYVVGID 573

Query: 436 GCG-AKDVTRRYCMKWYRIASK-RVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVA 493
             G  +DVT+RY   W  +  K RV++ WW   L P +                  S   
Sbjct: 574 SDGWVRDVTQRYDPVWMTVTRKCRVDAEWWAETLRPYQ------------------SPFM 615

Query: 494 DRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK-GPILGFCSGHAV 552
           DR   ED+E + + + +PLPT    YKNH LY ++R L KY+ +YP+   ILG+C G AV
Sbjct: 616 DREKKEDLEFQAKHMDQPLPTAIGLYKNHPLYALKRHLLKYEAIYPETAAILGYCRGEAV 675

Query: 553 YPRSCVQTLKTKERWLREALQVKANEVPVKVIKN-SSKSKKGQDFEPEDYDEVDARGNIE 611
           Y R CV TL +++ WL++A  V+  EVP K++K  S++++K +  EP+  +E D    + 
Sbjct: 676 YSRDCVHTLHSRDTWLKKARVVRLGEVPYKMVKGFSNRARKARLAEPQLREEND----LG 731

Query: 612 LYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSA 671
           L+G WQ E  + P AV+G VPRNE G V ++    +P G V L LP ++ VA++L+ID  
Sbjct: 732 LFGYWQTEEYQPPVAVDGKVPRNEFGNVYLFLPSMMPIGCVQLNLPNLHRVARKLDIDCV 791

Query: 672 PAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYAEEEEKREAEEKKRREAQATSRWYQL 731
            A+ GF+F  G S PV DG +VC EFKD +L A+  E+   E +EK+++E +A   W  L
Sbjct: 792 QAITGFDFHGGYSHPVTDGYIVCEEFKDVLLTAWENEQAVIERKEKEKKEKRALGNWKLL 851

Query: 732 LSSIVTRQRLNNCYGNNS 749
              ++ R+RL   YG  S
Sbjct: 852 AKGLLIRERLKRRYGPKS 869


>sp|Q24595|XPC_DROME DNA repair protein complementing XP-C cells homolog OS=Drosophila
            melanogaster GN=mus210 PE=1 SV=2
          Length = 1293

 Score =  212 bits (540), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 128/372 (34%), Positives = 189/372 (50%), Gaps = 36/372 (9%)

Query: 378  SRKVGAPLYWAEVYCSGENLTGKWVHVDAANAIIDGEQK-VEAAAAACKTSLRYIVAFAG 436
            +RK  A   W EV+   E    +W+ +D    +  G+   V+         L Y+ AF  
Sbjct: 952  NRKTDASDMWVEVWSDVEE---QWICID----LFKGKLHCVDTIRKNATPGLAYVFAFQD 1004

Query: 437  CGA-KDVTRRYCMKWYRIASK-RVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFVAD 494
              + KDVT RYC  W     K RV  AW D  +AP     +                   
Sbjct: 1005 DQSLKDVTARYCASWSTTVRKARVEKAWLDETIAPYLGRRT------------------K 1046

Query: 495  RNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYP-KGPILGFCSGHAVY 553
            R+  ED +L      +PLP +   +K+H LYV+ER L K+Q LYP   P LGF  G AVY
Sbjct: 1047 RDITEDDQLRRIHSDKPLPKSISEFKDHPLYVLERHLLKFQGLYPPDAPTLGFIRGEAVY 1106

Query: 554  PRSCVQTLKTKERWLREALQVKANEVPVKVIKNSSKSKKGQDFEPEDYDEVDARGNIELY 613
             R CV  L ++E WL+ A  VK  E P KV+K   K  +      +D         +E++
Sbjct: 1107 SRDCVHLLHSREIWLKSARVVKLGEQPYKVVKARPKWDRLTRTVIKDQP-------LEIF 1159

Query: 614  GKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAPA 673
            G WQ +    P+A NGIVPRN  G V+++ +  LP  TVHLRLP +  + K+L ID A A
Sbjct: 1160 GYWQTQEYEPPTAENGIVPRNAYGNVELFKDCMLPKKTVHLRLPGLMRICKKLNIDCANA 1219

Query: 674  MVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYAEEEEKREAEEKKRREAQATSRWYQLLS 733
            +VGF+F  G   P++DG +VC EF++ +  A+ E+++ +  +E+++ E +    W +L+ 
Sbjct: 1220 VVGFDFHQGACHPMYDGFIVCEEFREVVTAAWEEDQQVQVLKEQEKYETRVYGNWKKLIK 1279

Query: 734  SIVTRQRLNNCY 745
             ++ R+RL   Y
Sbjct: 1280 GLLIRERLKKKY 1291



 Score = 41.6 bits (96), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 74/170 (43%), Gaps = 27/170 (15%)

Query: 67  RRASAEDKELAELVHKVHLLCLLAR----GRLIDSVCDDPLIQASLLSLLPSYLLKISEV 122
           RR + + K+   L+HKV L+C +AR     RL+     D L+QA+ L LLPS     +E 
Sbjct: 370 RRLNRDIKDRQILLHKVSLMCQIARSLKYNRLLSE--SDSLMQAT-LKLLPSRNAYPTER 426

Query: 123 SKLTANALSPIVSWFHD-------NFHVRSSVSTRRSFHSDLAHALESREGT-PEEIAAL 174
                  L   V+WF         N +   S +T+ +    L   ++ +E    +++  +
Sbjct: 427 G-TELKYLQSFVTWFKTSIKLLSPNLYSAQSPATKEAILEALLEQVKRKEARCKQDMIFI 485

Query: 175 SVALFRALKLTTRFVSILDVASLKPEA-----------DKNVSSNQDSSR 213
            +AL R + +  R +  L    L+P A           DKN S   +S R
Sbjct: 486 FIALARGMGMHCRLIVNLQPMPLRPAASDLIPIKLRPDDKNKSQTVESER 535


>sp|Q10445|RHP41_SCHPO DNA repair protein rhp41 OS=Schizosaccharomyces pombe (strain 972 /
           ATCC 24843) GN=rhp41 PE=3 SV=1
          Length = 638

 Score =  143 bits (360), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 111/376 (29%), Positives = 171/376 (45%), Gaps = 52/376 (13%)

Query: 384 PLYWAEVYCSGENLTGKWVHVDAAN--AIIDGEQKVEAAAAACKTSLRYIVAFAGCG-AK 440
           P++W E +        KWV VD     ++I   ++ E A++     + Y+ A    G  K
Sbjct: 302 PVFWVEAFNKA---MQKWVCVDPFGDASVIGKYRRFEPASSDHLNQMTYVFAIEANGYVK 358

Query: 441 DVTRRYCMKWYRIASKRV-----NSAWWDAVLAPLRELESGATGDLNVESSAKDSFVADR 495
           DVTR+YC+ +Y+I   RV       AW + + + +     G   D          F  D 
Sbjct: 359 DVTRKYCLHYYKILKNRVEIFPFGKAWMNRIFSKI-----GKPRD----------FYNDM 403

Query: 496 NSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPK---GPILGFCSGHAV 552
           +++ED EL     +E +P N Q  K+H L+V+ER L K Q +      G I        V
Sbjct: 404 DAIEDAELLRLEQSEGIPRNIQDLKDHPLFVLERHLKKNQAIKTGKSCGRINTKNGVELV 463

Query: 553 YPRSCVQTLKTKERWLREALQVKANEVPVKVIKNSSKSKKGQDFEPEDYDEVDARGNIEL 612
           YPR  V    + E W R+   +K    P+K +KN  K                    + L
Sbjct: 464 YPRKYVSNGFSAEHWYRKGRIIKPGAQPLKHVKNGDKV-------------------LPL 504

Query: 613 YGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRVYSVAKRLEIDSAP 672
           Y +   +       V  IVP+N  G +D++    LP G  H R     + AK LEID A 
Sbjct: 505 YDEEATQLYTPKPVVANIVPKNAYGNIDLYVPSMLPYGAYHCRKRCALAAAKFLEIDYAK 564

Query: 673 AMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYAEE--EEKREAEEKKRREAQATSRWYQ 730
           A+VGF+F+   S P  +G+VV   +++ I +  AEE  +E++EAE +  R+      W +
Sbjct: 565 AVVGFDFQRKYSKPKLEGVVVSKRYEEAI-DLIAEEIDQEEKEAEARNVRKT-CLLLWKR 622

Query: 731 LLSSIVTRQRLNNCYG 746
           L++ +  RQR+   YG
Sbjct: 623 LITGLRIRQRVFEEYG 638


>sp|P87235|RHP42_SCHPO DNA repair protein rhp42 OS=Schizosaccharomyces pombe (strain 972 /
           ATCC 24843) GN=rhp42 PE=3 SV=1
          Length = 686

 Score =  117 bits (292), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 172/387 (44%), Gaps = 47/387 (12%)

Query: 384 PLYWAEVYCSGENLTGKWVHVDAA--NAIIDGEQ---KVEAAAAACKTSLRYIVAFAG-- 436
           P++W E+Y   E    KW+ VDA   N +   +    + + A A  K     IVA     
Sbjct: 323 PIFWTEIYDQSEK---KWIAVDAVVLNGVYTNDMTWFEPKGAYAESKHLRMGIVAAYDND 379

Query: 437 CGAKDVTRRYC----MKWYRIASKRVNSAWWDAVLAPLRELESGATGDLNVESSAKDSFV 492
             AKDVT RY      +  +I        ++D   A   +L             A+D + 
Sbjct: 380 LYAKDVTLRYTDYQSSRLKKIRHVSFADKYFDFYKAIFGQLAKR-------NKDAEDIY- 431

Query: 493 ADRNSLEDMELETRALTEPL--PTNQQAYKNHQLYVIERWLNKYQILYPKG-PILGFCSG 549
                 E+ ELE++    P+  P +   +KNH  +V+ R L + + L P   P+     G
Sbjct: 432 ------EEKELESKV---PIREPKSFADFKNHPEFVLIRHLRREEALLPNAKPVKTATFG 482

Query: 550 HA-------VYPRSCVQTLKTKERWLREALQVKANEVPVKVIKNSSKS---KKGQDFEPE 599
           +        VY R  V   KT E + +E   +K  E P K++K  + +   K+  +F   
Sbjct: 483 NGKKATSEEVYLRKDVVICKTPENYHKEGRVIKEGEQPRKMVKARAVTISRKREHEFRVA 542

Query: 600 DYDEVDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPRV 659
           + +E   +G   LY   Q E    P   +GI+P+N  G +D + E  +P G  HL    +
Sbjct: 543 ETNEPVLQG---LYSSDQTELYVPPPIKDGIIPKNGYGNMDCFVESMIPKGAAHLPYRGI 599

Query: 660 YSVAKRLEIDSAPAMVGFEFRNGRSTPVFDGIVVCAEFKDTILEAYAEEEEKREAEEKKR 719
             +AK+L ID A A+ GFEFR  R+ PV  GI+V  E    + E + E E+ R  +++ +
Sbjct: 600 AKIAKKLNIDYADAVTGFEFRKHRAIPVTTGIIVPEESAQMVYEEFLECEKIRIEKQQMK 659

Query: 720 REAQATSRWYQLLSSIVTRQRLNNCYG 746
                  +W  LL+++  R+R+   Y 
Sbjct: 660 ERKIIYGQWKHLLNALRIRKRIEEQYA 686


>sp|P14736|RAD4_YEAST DNA repair protein RAD4 OS=Saccharomyces cerevisiae (strain ATCC
           204508 / S288c) GN=RAD4 PE=1 SV=3
          Length = 754

 Score =  115 bits (287), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/410 (25%), Positives = 174/410 (42%), Gaps = 55/410 (13%)

Query: 381 VGAPLYWAEVYCSGENLTGKWVHVDAANAI----IDGEQKVEAAAAAC--KTSLRYIVAF 434
           V  P++W EV+   +  + KW+ VD  N      +    K+     AC  +  LRY++A+
Sbjct: 310 VKYPIFWCEVW---DKFSKKWITVDPVNLKTIEQVRLHSKLAPKGVACCERNMLRYVIAY 366

Query: 435 -AGCGAKDVTRRYCMKWY-------RIASKRVNSAWWDAVLAPLRELESGATGDLNVESS 486
               G +DVTRRY  +W        RI        W+  V+  L   +     D      
Sbjct: 367 DRKYGCRDVTRRYA-QWMNSKVRKRRITKDDFGEKWFRKVITALHHRKRTKIDDY----- 420

Query: 487 AKDSFVADRNSLEDMELETRALTEPLPTNQQAYKNHQLYVIERWLNKYQILYPKGPILGF 546
                       ED     R  +E +P + Q  KNH  YV+E+ + + QI+ P     G+
Sbjct: 421 ------------EDQYFFQRDESEGIPDSVQDLKNHPYYVLEQDIKQTQIVKPGCKECGY 468

Query: 547 CSGHA-------VYPRSCVQTLKTKERWLREALQVKANEVPVKVIKNSSKSKKGQDFEPE 599
              H        VY +  +  LK+  +W      +K      KVIK +    KG   E E
Sbjct: 469 LKVHGKVGKVLKVYAKRDIADLKSARQWYMNGRILKTGSRCKKVIKRTVGRPKG---EAE 525

Query: 600 DYDE-VDARGNIELYGKWQLEPLRLPSAVNGIVPRNERGQVDVWSEKCLPPGTVHLRLPR 658
           + DE + +  + ELY    + PL   ++ +G + +N  G ++V++   +P     +  P 
Sbjct: 526 EEDERLYSFEDTELY----IPPL---ASASGEITKNTFGNIEVFAPTMIPGNCCLVENPV 578

Query: 659 VYSVAKRLEIDSAPAMVGFEFRNGRST-PVFDGIVVCAEFKDTILEAYAEEEEKREAEEK 717
               A+ L ++ APA+  F+F  G +  PV  GIVV    ++ I  A    E  +E + +
Sbjct: 579 AIKAARFLGVEFAPAVTSFKFERGSTVKPVLSGIVVAKWLREAIETAIDGIEFIQEDDNR 638

Query: 718 KRREAQATSRWYQLLSSIVTRQRLNNCYGNNSTSQSS-SNFQNVKKTNSN 766
           K     A   W  LL  +  R +LN+ YG  +  + + +  QN+   + N
Sbjct: 639 KEHLLGALESWNTLLLKLRIRSKLNSTYGKIAEEEPNVTKEQNIADNHDN 688


>sp|Q7F0R1|PNG1_ORYSJ Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Oryza sativa subsp. japonica GN=PNG1 PE=2 SV=1
          Length = 447

 Score = 50.4 bits (119), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 37/72 (51%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W E +    NL G+W+H+D    + D     E         L Y++A +  G +DVT+RY
Sbjct: 296 WTECF---SNLYGRWMHLDPCEGVYDNPLLYEKG---WNKKLDYVIAISKDGVRDVTKRY 349

Query: 447 CMKWYRIASKRV 458
             KW+ + S+R+
Sbjct: 350 TRKWHEVLSRRI 361


>sp|Q9FGY9|PNG1_ARATH Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Arabidopsis thaliana GN=PNG1 PE=2 SV=1
          Length = 721

 Score = 43.9 bits (102), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 32/72 (44%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W E Y        +W+H+D    + D     E         L Y++A +  G  DVT+RY
Sbjct: 280 WTECYSHS---LKRWIHLDPCEGVYDKPMLYEKG---WNKKLNYVIAISKDGVCDVTKRY 333

Query: 447 CMKWYRIASKRV 458
             KW+ + S+R 
Sbjct: 334 TKKWHEVLSRRT 345


>sp|Q8J2R3|PNG1_GIBMO Protein PNG1 OS=Gibberella moniliformis GN=PNG1 PE=3 SV=1
          Length = 450

 Score = 41.6 bits (96), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 31/63 (49%), Gaps = 6/63 (9%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +WVHVDA     D  + +   A      + Y +AF+  GA DVTRRY
Sbjct: 272 WTEVYSDHQK---RWVHVDACEEAWDNPRLL---AEGWGKKMSYCIAFSIDGATDVTRRY 325

Query: 447 CMK 449
             K
Sbjct: 326 VRK 328


>sp|Q4IR87|PNG1_GIBZE Protein PNG1 OS=Gibberella zeae (strain PH-1 / ATCC MYA-4620 / FGSC
           9075 / NRRL 31084) GN=PNG1 PE=3 SV=1
          Length = 451

 Score = 41.2 bits (95), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 30/63 (47%), Gaps = 6/63 (9%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +WVHVDA     D  +     A      + Y +AF+  GA DVTRRY
Sbjct: 272 WTEVYSDHQK---RWVHVDACEEAWDNPRLY---AEGWGKKMSYCIAFSIDGATDVTRRY 325

Query: 447 CMK 449
             K
Sbjct: 326 VRK 328


>sp|Q2UPS5|PNG1_ASPOR Protein png1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
           GN=png1 PE=3 SV=1
          Length = 457

 Score = 40.8 bits (94), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 32/60 (53%), Gaps = 6/60 (10%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W E+Y   +    +WVHVDA     D + ++ A     K S  Y +AF+  GA DVTRRY
Sbjct: 290 WTEIYSEHQR---RWVHVDACEGAWD-QPRLYAEGWGRKMS--YCIAFSIDGATDVTRRY 343


>sp|Q4WHW1|PNG1_ASPFU Protein png1 OS=Neosartorya fumigata (strain ATCC MYA-4609 / Af293
           / CBS 101355 / FGSC A1100) GN=png1 PE=3 SV=1
          Length = 455

 Score = 40.4 bits (93), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +WVHVDA     D   +           L Y +AF+  GA DVTRRY
Sbjct: 288 WTEVYSEHQR---RWVHVDACEGAWD---QPRLYTEGWGRKLSYCIAFSIDGATDVTRRY 341


>sp|Q5B6P3|PNG1_EMENI Protein PNG1 OS=Emericella nidulans (strain FGSC A4 / ATCC 38163 /
           CBS 112.46 / NRRL 194 / M139) GN=png1 PE=3 SV=1
          Length = 441

 Score = 40.0 bits (92), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 32/60 (53%), Gaps = 6/60 (10%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+HVDA     D + ++ A     K S  Y +AF+  GA DVTRRY
Sbjct: 275 WTEVYSEHQK---RWIHVDACEETWD-QPRLYAEGWGRKIS--YCIAFSIDGATDVTRRY 328


>sp|Q75D29|PNG1_ASHGO Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Ashbya gossypii (strain ATCC 10895 / CBS 109.51 /
           FGSC 9923 / NRRL Y-1056) GN=PNG1 PE=3 SV=1
          Length = 350

 Score = 37.7 bits (86), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 20/63 (31%), Positives = 32/63 (50%), Gaps = 6/63 (9%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY    N   +WVHVD+     D   +    +     ++ Y++AF+    KDV+RRY
Sbjct: 212 WCEVY---SNALKRWVHVDSCEKSFD---EPHIYSVNWNKAMSYVIAFSNRSVKDVSRRY 265

Query: 447 CMK 449
            ++
Sbjct: 266 IVR 268


>sp|Q7SI01|PNG1_NEUCR Protein png-1 OS=Neurospora crassa (strain ATCC 24698 / 74-OR23-1A
           / CBS 708.71 / DSM 1257 / FGSC 987) GN=png-1 PE=3 SV=1
          Length = 382

 Score = 37.0 bits (84), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 6/60 (10%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W E+Y   +    +WVHVD+     D               + Y++AF+  GA DVTRRY
Sbjct: 207 WTEIYSEHQQ---RWVHVDSCEEAWDMPHMYYKNWG---KKMSYVIAFSREGAVDVTRRY 260


>sp|Q5XI55|NGLY1_RAT Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Rattus norvegicus GN=Ngly1 PE=2 SV=2
          Length = 651

 Score = 36.6 bits (83), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+H DA   + D     E         L YI+AF+     DVT RY
Sbjct: 335 WTEVYSPSQQ---RWLHCDACEDVCDKPLLYEIGWGK---KLSYIIAFSKDEVVDVTWRY 388

Query: 447 CMKWYRIASKRV 458
             K   + S+R 
Sbjct: 389 SCKHEEVMSRRT 400


>sp|Q6CAX5|PNG1_YARLI Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Yarrowia lipolytica (strain CLIB 122 / E 150) GN=PNG1
           PE=3 SV=1
          Length = 356

 Score = 36.2 bits (82), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 28/63 (44%), Gaps = 6/63 (9%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W+EVY        +W+H D+     +        +      + Y+V F+G G  DVTRRY
Sbjct: 218 WSEVYSERRK---EWIHTDSCEEAWNSPT---IYSQGWGKKMSYVVGFSGDGVTDVTRRY 271

Query: 447 CMK 449
             K
Sbjct: 272 VRK 274


>sp|Q0VDD8|DYH14_HUMAN Dynein heavy chain 14, axonemal OS=Homo sapiens GN=DNAH14 PE=2 SV=3
          Length = 3507

 Score = 36.2 bits (82), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 46/181 (25%), Positives = 77/181 (42%), Gaps = 41/181 (22%)

Query: 430  YIVAFAGCGAKDVTRRYCM----KWYRI-ASKRVNSAWWDAVLAPLRELESGATG----- 479
            + +   GCG K      C     K YR+  S +     +  V   +  + +G  G     
Sbjct: 2010 FQIGIDGCGKKTCATLACYLTDNKLYRVPISHKCAYIEFKEVFKKVF-IHAGLKGKPTVL 2068

Query: 480  ---DLNVESSAKDSFVADRNSL----------EDMELETRALTEPLPTNQQAYKNHQLYV 526
               +LN+E   +DSF+ D N +          E++EL++ A+     T Q  + +++  +
Sbjct: 2069 MVPNLNIE---QDSFLEDLNYIISSGRIPDLFENVELDSIAMKIRYLTEQSGHMDNRQSL 2125

Query: 527  IERW-------LNKYQILYPKGPILGFCSGHAVYPR---SCVQTLKTKERWLREALQVKA 576
            +  +       L+ + I+ P+GP   F     VYP    SC  T+   ERW  EAL + A
Sbjct: 2126 LSFFQKRIYKNLHIFVIMSPEGP--SFRQNCRVYPSMISSC--TIDWYERWPEEALLIVA 2181

Query: 577  N 577
            N
Sbjct: 2182 N 2182


>sp|Q9JI78|NGLY1_MOUSE Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase OS=Mus
           musculus GN=Ngly1 PE=1 SV=2
          Length = 651

 Score = 35.8 bits (81), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+H DA   + D     E         L YI+AF+     DVT RY
Sbjct: 335 WTEVYSPSQQ---RWLHCDACEDVCDKPLLYEIGWGK---KLSYIIAFSKDEVVDVTWRY 388

Query: 447 CMKWYRIASKRV 458
             K   + S+R 
Sbjct: 389 SCKHDEVMSRRT 400


>sp|Q4R6F3|NGLY1_MACFA Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Macaca fascicularis GN=NGLY1 PE=2 SV=1
          Length = 654

 Score = 34.7 bits (78), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+H DA   + D     E         L Y++AF+     DVT RY
Sbjct: 338 WTEVYSPSQQ---RWLHCDACEDVCDKPLLYEIGWGK---KLSYVIAFSKDEVVDVTWRY 391

Query: 447 CMKWYRIASKRV 458
             K   + ++R 
Sbjct: 392 SCKHEEVIARRT 403


>sp|Q7KRR5|NGLY1_DROME Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Drosophila melanogaster GN=Pngl PE=1 SV=1
          Length = 631

 Score = 34.7 bits (78), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 6/60 (10%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+HVD +  +ID     +      K  + YI+A++    +DVT RY
Sbjct: 325 WTEVYSEAQ---MRWLHVDPSENVIDSPLMYQ---HGWKRHIDYILAYSRDDIQDVTWRY 378


>sp|Q96IV0|NGLY1_HUMAN Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase OS=Homo
           sapiens GN=NGLY1 PE=1 SV=1
          Length = 654

 Score = 34.7 bits (78), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+H DA   + D     E         L Y++AF+     DVT RY
Sbjct: 338 WTEVYSPSQQ---RWLHCDACEDVCDKPLLYEIGWGK---KLSYVIAFSKDEVVDVTWRY 391

Query: 447 CMKWYRIASKRV 458
             K   + ++R 
Sbjct: 392 SCKHEEVIARRT 403


>sp|Q503I8|NGLY1_DANRE Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Danio rerio GN=ngly1 PE=2 SV=1
          Length = 644

 Score = 33.9 bits (76), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 31/78 (39%), Gaps = 8/78 (10%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY   +    +W+H D      D     E         L YI+AF+     DVT RY
Sbjct: 323 WTEVYSQSQR---RWIHCDPCENACDKPLLYEVGWGK---KLSYILAFSKDQVADVTWRY 376

Query: 447 CMKWYRIASKR--VNSAW 462
             K   + S+R  V   W
Sbjct: 377 SCKHPEVLSRRTQVQETW 394


>sp|Q30NS7|PYRG_SULDN CTP synthase OS=Sulfurimonas denitrificans (strain ATCC 33889 / DSM
           1251) GN=pyrG PE=3 SV=1
          Length = 542

 Score = 33.1 bits (74), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 7/124 (5%)

Query: 186 TRFVSILDVASLKPEADKNVSSNQDSSRVGGGIFNAPTLMVAKPEEVLASPVK---SFSC 242
           T FV +  +  +K   +      Q S +    I   P +++A+ E  L    K   + SC
Sbjct: 168 TFFVHVTLIPFIKAAGELKSKPTQHSVQELRRIGITPQMIIARSENALPKTFKKKLAMSC 227

Query: 243 D-KKENVCETSSKGSPECKYSSPKSNNTQSKKSPVSCELSSGNLDPSSSMACSDISEACH 301
           D   ++V E     S    Y  P S   Q+   P+S ELS G L+P+     S + +   
Sbjct: 228 DVSSDSVIEALDAAS---IYDVPISFLRQNILKPISKELSLGELNPNMEKWDSLVKKIVQ 284

Query: 302 PKEK 305
           P+ +
Sbjct: 285 PQNR 288


>sp|Q5ZJM3|NGLY1_CHICK Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase
           OS=Gallus gallus GN=NGLY1 PE=2 SV=1
          Length = 651

 Score = 33.1 bits (74), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 6/72 (8%)

Query: 387 WAEVYCSGENLTGKWVHVDAANAIIDGEQKVEAAAAACKTSLRYIVAFAGCGAKDVTRRY 446
           W EVY + +    +W+H D    + D     E         L Y++AF+     DVT RY
Sbjct: 334 WTEVYSASQK---RWLHCDPCENVCDKPLLYEIGWGK---KLSYVIAFSKDEVVDVTWRY 387

Query: 447 CMKWYRIASKRV 458
             K   + ++R 
Sbjct: 388 SCKHQEVLTRRT 399


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.313    0.128    0.372 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 304,204,466
Number of Sequences: 539616
Number of extensions: 12853612
Number of successful extensions: 49013
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 18
Number of HSP's successfully gapped in prelim test: 99
Number of HSP's that attempted gapping in prelim test: 47962
Number of HSP's gapped (non-prelim): 1006
length of query: 833
length of database: 191,569,459
effective HSP length: 126
effective length of query: 707
effective length of database: 123,577,843
effective search space: 87369535001
effective search space used: 87369535001
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 66 (30.0 bits)