BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 038235
         (514 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|O70279|DGC14_MOUSE Protein DGCR14 OS=Mus musculus GN=Dgcr14 PE=2 SV=2
          Length = 479

 Score =  140 bits (352), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 139/460 (30%), Positives = 214/460 (46%), Gaps = 74/460 (16%)

Query: 45  KKRAKVLDEDTYVSAIESIIERDYFPDLSKLRDRYEWLQATRTGDPVQIRDAQLKIMERR 104
           + R +VLDE+ Y+  ++++I+RD+FPD+ KL+ + E+L+A   GD  ++R   +K     
Sbjct: 35  RSRQRVLDEEEYIEGLQTVIQRDFFPDVEKLQAQKEYLEAEENGDLERMRQIAIKFG--- 91

Query: 105 GKKVISLNSDGKIRTQTETPGSTFARNFTPFDEFDVNITTPKVGVELSGEGVCRDGTDEV 164
                  ++ GKI  +   P  T A   TP    +V+  +  +G +   +G  RD  D  
Sbjct: 92  -------SALGKISREPPPPYVTPATFETP----EVHPGSAVLGNKPRPQG--RDLDDGE 138

Query: 165 DVSMR-------LDEFLRKHTSEDNDSFSKILEKVNRKRKERYQFLLEG----EKRDNNN 213
                       LD FL ++TSEDN SF +I+E    K   R+ +L +     EKR  +N
Sbjct: 139 AGEEEEKEPLPSLDVFLSQYTSEDNASFQEIMEVAKEKSHARHAWLYQAEEEFEKRQKDN 198

Query: 214 L-IEDVKKDRITDGYGTSYQPPSTLEGWKYTAKNLLMYHPADCGEAPLTEEEQAVRLKGL 272
           L +   +   I           + +E WKY AKN LMY+P       + +EEQ  +    
Sbjct: 199 LELPSAEHQAIESSQ-------AGVETWKYKAKNSLMYYPEG-----VPDEEQLFK---K 243

Query: 273 TKEINKTNTRFHGKIMDSRPNDDGTVEVLYTPVAGTTPGPVFDRDGDKMKKYDLEDMRKT 332
            ++I   NTRF   + D  P             A         + G   K+   ++  + 
Sbjct: 244 PRQIVHKNTRF---LRD--PFSQALSRSQLQQAAALNAQHKQGKVGPDGKELIPQESPRV 298

Query: 333 PNRFYVESGKKAENGYSFVKTPSPAPGADESPFITWGEIEGTPLRLETEDTP-IDIGGSG 391
                         G+ FV TPSPAPG +ESP +TWGE+E TPLR+E  ++P +D     
Sbjct: 299 -------------GGFGFVATPSPAPGVNESPLMTWGEVENTPLRVEGSESPYVD---RT 342

Query: 392 DGPHFKIPSAPTRDVMAHELSREASRKLKERSKMFQKPPLPSPYRAGSASPSGKLLSPAA 451
            GP FKI     R+ +  +++ EA+   K R+K  Q+          S +P G  LSPA 
Sbjct: 343 PGPTFKILEPGRRERLGLKMANEAA--AKNRAKK-QEALRRVTENLASLTPKG--LSPAM 397

Query: 452 QKFMRNAMAKSSSS-VDDALRASYRGS---SPVTGTPKGG 487
              ++  +++++S   D ALRASY  S   S    TP GG
Sbjct: 398 SPALQRLVSRTASKYTDRALRASYTPSPARSSHLKTPAGG 437


>sp|Q96DF8|DGC14_HUMAN Protein DGCR14 OS=Homo sapiens GN=DGCR14 PE=1 SV=1
          Length = 476

 Score =  138 bits (348), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 134/454 (29%), Positives = 212/454 (46%), Gaps = 76/454 (16%)

Query: 43  PPKKR-----------AKVLDEDTYVSAIESIIERDYFPDLSKLRDRYEWLQATRTGDPV 91
           PP+KR            +VLDE+ Y+  ++++I+RD+FPD+ KL+ + E+L+A   GD  
Sbjct: 19  PPRKREAGEAGAATSKQRVLDEEEYIEGLQTVIQRDFFPDVEKLQAQKEYLEAEENGDLE 78

Query: 92  QIRDAQLKIMERRGKKVISLNSDGKIRTQTETPGSTFARNFTPFDEFDVNITTPKVGVEL 151
           ++R   +K            ++ GK+  +   P  T A   TP    +V+  T  VG + 
Sbjct: 79  RMRQIAIKFG----------SALGKMSREPPPPYVTPATFETP----EVHAGTGVVGNKP 124

Query: 152 SGEGVCRDGTDEVDVSMR-----LDEFLRKHTSEDNDSFSKILEKVNRKRKERYQFLLEG 206
              G   +  +  +   +     LD FL ++TSEDN SF +I+E    + + R+ +L + 
Sbjct: 125 RPRGRGLEDGEAGEEEEKEPLPSLDVFLSRYTSEDNASFQEIMEVAKERSRARHAWLYQA 184

Query: 207 ----EKRDNNNLIEDVKKDRITDGYGTSYQPPSTLEGWKYTAKNLLMYHPADCGEAPLTE 262
               EKR  +NL     + +  +    S      +E WKY AKN LMY+P       + +
Sbjct: 185 EEEFEKRQKDNLELPSAEHQAIESSQAS------VETWKYKAKNSLMYYPEG-----VPD 233

Query: 263 EEQAVRLKGLTKEINKTNTRFHGKIMDSRPNDDGTVEVLYTPVAGTTPGPVFDRDGDKMK 322
           EEQ   L    +++   NTRF   + D  P             A         + G   K
Sbjct: 234 EEQ---LFKKPRQVVHKNTRF---LRD--PFSQALSRCQLQQAAALNAQHKQGKVGPDGK 285

Query: 323 KYDLEDMRKTPNRFYVESGKKAENGYSFVKTPSPAPGADESPFITWGEIEGTPLRLETED 382
           +   ++  +               G+ FV TPSPAPG +ESP +TWGE+E TPLR+E  +
Sbjct: 286 ELIPQESPRV-------------GGFGFVATPSPAPGVNESPMMTWGEVENTPLRVEGSE 332

Query: 383 TP-IDIGGSGDGPHFKIPSAPTRDVMAHELSREASRKLKERSKMFQKPPLPSPYRAGSAS 441
           TP +D      GP FKI     R+ +  +++ EA+   K R+K  Q+          S +
Sbjct: 333 TPYVD---RTPGPAFKILEPGRRERLGLKMANEAA--AKNRAKK-QEALRRVTENLASLT 386

Query: 442 PSGKLLSPAAQKFMRNAMAKSSSS-VDDALRASY 474
           P G  LSPA    ++  +++++S   D ALRASY
Sbjct: 387 PKG--LSPAMSPALQRLVSRTASKYTDRALRASY 418


>sp|O44424|DGC14_DROME Protein DGCR14 homolog OS=Drosophila melanogaster GN=Es2 PE=1 SV=2
          Length = 501

 Score =  135 bits (339), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 133/448 (29%), Positives = 199/448 (44%), Gaps = 81/448 (18%)

Query: 29  LTNVNNSASATQKVPP-----KKRAKVLDEDTYVSAIESIIERDYFPDLSKLRDRYEWLQ 83
           +  V NSA A  K P      K + K+L E+ Y+  +  II+RD+FPDL +LR + ++L 
Sbjct: 24  VARVQNSALAEFKKPTAMVRHKNKPKILTEEKYIEEMSKIIQRDFFPDLERLRAQNDYLD 83

Query: 84  ATRTGDPVQIRDAQLKIMERRGKKVISLNSDGKIRTQTETPGSTFARNFTPFDEFDVNIT 143
           A    D VQ+ +    I ER     IS       R       +TF    TP  +   +  
Sbjct: 84  AESRRDFVQMAE----IRERYSLGRISGTGRSTSRRNNAMSPATFE---TPVSQAKCS-N 135

Query: 144 TPKVGVELSGEGVCRDGTDEVDVSMR-------LDEFLRKHTSEDNDSFSKILEKVNRKR 196
           TP      +      DG+++ D   R       LD FL+K+TSEDN SF +I+E    K 
Sbjct: 136 TPLPNSRATDTPFSTDGSEKSDAEGRDTTAKLSLDAFLQKYTSEDNQSFQEIIETAEAKL 195

Query: 197 KERYQFLLEGEKRDNNNLIEDVKKDRITDGYGTSYQPPSTL---EGWKYTAKNLLMYHPA 253
           +++Y  L   EK       E +++  +     T ++ P  L   E W YT  N +MY P 
Sbjct: 196 RQKYAVLYNHEKLS----AEQLQRALMLPSIETQFEEPDPLRKIETWNYTNMNSIMYVP- 250

Query: 254 DCGEAPLTEEEQAVRLKGLTKEINKTNTRF----HGKIMDSRPNDDGTVEVLYTPVAGTT 309
           D  E   TEEE+ V+L    + I    TR       + MD++  +D   EV      G T
Sbjct: 251 DGVE--YTEEER-VQLAERKQSIQHNATRLPDEAKHREMDTKKLND---EVPQNGAGGAT 304

Query: 310 PGPVFDRDGDKMKKYDLEDMRKTPNRFYVESGKKAENGYSFVKTPSPAPGADESPFITWG 369
             P       K++ +DL                        +++PSP PG   SP +TWG
Sbjct: 305 ATP-------KVRGFDL------------------------LRSPSPRPGEAFSPIMTWG 333

Query: 370 EIEGTPLRLETEDTPIDIGGSGDGPHFKIPSAPTRDVMAHELSREASRKLKERSKMF--- 426
           EI+GTP RL+  DTP+       GP F+I     R+ +A  L+   S +++ + +M    
Sbjct: 334 EIDGTPFRLDGGDTPLR---PTQGPSFRINENSRRENIAIALAERVSERMRNQKQMALDT 390

Query: 427 QKPPLPSPY------RAGSASPSGKLLS 448
            +  + SP       R  S SP+ +LL+
Sbjct: 391 ARRNIGSPLIRTNMERLASMSPAAQLLA 418


>sp|P34420|DGC14_CAEEL ES2 similar protein 2 OS=Caenorhabditis elegans GN=ess-2 PE=3 SV=2
          Length = 531

 Score =  128 bits (321), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 119/435 (27%), Positives = 192/435 (44%), Gaps = 66/435 (15%)

Query: 49  KVLDEDTYVSAIESIIERDYFPDLSKLRDRYEWLQATRTGDPVQIRDAQLKIMERRGKKV 108
           +V+ E+ Y++ ++ IIE+DYFP L K++ + E+L+A    D  +I++ Q+K         
Sbjct: 40  QVVPEEKYIAGLDKIIEKDYFPHLKKMQAQKEYLEAVANKDINKIKELQMKFCS-----T 94

Query: 109 ISLNSDGKIRTQTETPGSTFARNFTPFD------------EFDVNITTPKVGVELSGEGV 156
            S+ +D   RT   T  +T A + + FD                 + +P       G+  
Sbjct: 95  GSVRTDRSFRTPITTRSTTEAPDVSSFDADTPGPSSASTSSAHDWMQSPMPFANEEGDNE 154

Query: 157 CRDGTDEVDVSMRLDEFLRKHTSEDNDSFSKILEKVNRKRKE-RYQFLLEGEKRDNNNLI 215
             +   +      L  +L K+TSEDN SF + L KV R+R++ R  ++ + E+  N NL+
Sbjct: 155 ALNRKRKKKKEETLTSYLNKYTSEDNASFEE-LAKVMREREDARRPWVYKAEEEHNKNLV 213

Query: 216 --------EDVKKDRITDGYGTSYQPPSTLEGWKYTAKNLLMYHPADCGEAPLTEEEQAV 267
                    DV+   +          P  ++ W Y A N ++++P     A LT  E A 
Sbjct: 214 TRQAIAAEADVQL-ALKHAVDADDNRPLNVDNWAYKAWNTVLFNP---DGAALTPAEIAD 269

Query: 268 RLKGLTKEINKTNTRF--HGKIMDSRPNDDGTVEVLYTPVAGTTPGPVFDRDGDKMKKYD 325
             +    EINK  TRF   GK+   +P+D+       +                   K D
Sbjct: 270 AARKQQTEINKRGTRFPDSGKL---KPSDEAMTRAAVSHALANA------------GKVD 314

Query: 326 LEDMRKTPNRFYVESGKKAENGYSFVKTPSPAPGADESPFITWGEIEGTPLRLETEDTPI 385
                 TP            N +  ++TP+P P   +SP +TWGEI+GTP RL+  D   
Sbjct: 315 FLGNEVTP-----------ANSFKLLETPNPNPDDMDSPLMTWGEIDGTPFRLDAPDV-T 362

Query: 386 DIGGSGDGPHFKIPSAPTRDVMAHELSREASRKLKERSKMFQKPPLPSPYRA----GSAS 441
           +    G  P FKIP  P R+ +A  ++   + K +++ K+  +    + +      GS  
Sbjct: 363 EHSLPGAAPVFKIPEVPYREKIAQSMNDSIAAKYRDKRKVAMRAAEGAHFSRTPGFGSKR 422

Query: 442 PSGKL--LSPAAQKF 454
            S KL  LSPAAQK 
Sbjct: 423 VSDKLAQLSPAAQKL 437


>sp|O59793|BIS1_SCHPO Stress response protein bis1 OS=Schizosaccharomyces pombe (strain
           972 / ATCC 24843) GN=bis1 PE=1 SV=1
          Length = 384

 Score = 55.5 bits (132), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 51/220 (23%), Positives = 92/220 (41%), Gaps = 47/220 (21%)

Query: 51  LDEDTYVSAIESIIERDYFPDLSKLRDRYEWLQATRTGDPVQIRDAQLKIMERRGKKVIS 110
           L+ED Y+  +  II++ YFPDL KL+      +     + V   DAQ +  + + K +I+
Sbjct: 30  LEEDDYIEGLSYIIQQQYFPDLPKLK-----AEVVLESEEVGSFDAQNESRDEKLKYLIA 84

Query: 111 LNSDGKIRTQTETPGSTFARNFTPFDEFDVNITTPKVGVELSGEGVCRDGTDEVDVSMRL 170
            NS+  +R +  +                           L+   + +   D  +  + +
Sbjct: 85  KNSEDPLRKRLPS---------------------------LAIHEITKAQLDGENKPISV 117

Query: 171 DEFLRKHTSEDNDSFSKILEKVNRKRKERYQFLLEGEKRDNNNLIEDVKKDRITDGYGTS 230
             +  K TSEDN SF +++E  +R R E+++       +  +N I+ +       GY  S
Sbjct: 118 ASYQNKFTSEDNASFGELMEDESRLRAEQHKRRFGVHSQQPSNSIQTI-------GYSNS 170

Query: 231 YQPPS--------TLEGWKYTAKNLLMYHPADCGEAPLTE 262
               S        +++ W Y  KN LMY P     + L++
Sbjct: 171 DAIKSIAWKEKDKSIKTWNYQPKNALMYTPETNHSSSLSQ 210


>sp|Q92LF6|P4H_RHIME L-proline cis-4-hydroxylase OS=Rhizobium meliloti (strain 1021)
           GN=R03107 PE=1 SV=1
          Length = 280

 Score = 35.8 bits (81), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 39/86 (45%), Gaps = 15/86 (17%)

Query: 22  SSFPESYLTNVNNSASATQKVPPKKR------------AKVLDEDTYVSAIESIIERDYF 69
           ++FPES+L NV    +    V P+K             + ++ E  Y   I SI+ + +F
Sbjct: 174 TAFPESFLRNVEQPVTTRDMVDPRKELTDEVIEGILGFSIIISEANY-REIVSILAKLHF 232

Query: 70  PDLSKLRDRYEWLQ--ATRTGDPVQI 93
              +  R  Y+WL+    R GDP  I
Sbjct: 233 FYKADCRSMYDWLKEICKRRGDPALI 258


>sp|P04809|HSP83_DROPS Heat shock protein 83 OS=Drosophila pseudoobscura pseudoobscura
           GN=Hsp83 PE=3 SV=2
          Length = 717

 Score = 35.8 bits (81), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 55/117 (47%), Gaps = 12/117 (10%)

Query: 169 RLDEFLRKHTSEDNDSFSKILEKVNR-KRKERYQFLLEGEKRD---NNNLIEDVKKDRIT 224
           +L +FLR HTS   D F  + + V+R K  +++ + + GE +D   N+  +E VK     
Sbjct: 443 KLADFLRFHTSASGDDFCSLSDYVSRMKENQKHVYFITGESKDQVSNSAFVERVKARGFE 502

Query: 225 DGYGTSYQPPSTLEGWK-YTAKNLL------MYHPADCGEAPLTEEEQAVRLKGLTK 274
             Y T       ++  K Y  K L+      +  P D  E    EE++A + +GL K
Sbjct: 503 VVYMTEPIDEYVIQHLKEYKGKQLVSVTKEGLELPEDEAEKKKREEDKA-KFEGLCK 558


>sp|Q3MHI7|RF1M_BOVIN Peptide chain release factor 1, mitochondrial OS=Bos taurus
           GN=MTRF1 PE=2 SV=1
          Length = 447

 Score = 35.8 bits (81), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 37/78 (47%), Gaps = 12/78 (15%)

Query: 188 ILEKVNRKRKERYQFLLEGEKRDNNNLIEDVKKDRITDG---------YGTSYQPPSTLE 238
           I +K+N    E +Q LL  EK D N++I +V   R T G             YQ  S+ +
Sbjct: 161 IAQKINMLYSELFQSLLPKEKYDKNDVILEVTSGRTTGGDICQQFTREIFDMYQNYSSYK 220

Query: 239 GWKYTAKNLLMYHPADCG 256
            W++    LL Y PAD G
Sbjct: 221 HWRF---ELLNYTPADYG 235


>sp|O02192|HSP83_DROAV Heat shock protein 83 OS=Drosophila auraria GN=Hsp83 PE=3 SV=1
          Length = 716

 Score = 35.0 bits (79), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/120 (28%), Positives = 56/120 (46%), Gaps = 12/120 (10%)

Query: 169 RLDEFLRKHTSEDNDSFSKILEKVNR-KRKERYQFLLEGEKRD---NNNLIEDVKKDRIT 224
           +L +FLR HTS   D F  + + V+R K  +++ + + GE +D   N+  +E VK     
Sbjct: 442 KLADFLRFHTSASGDDFCSLSDYVSRMKENQKHVYFITGESKDQVSNSAFVERVKARGFE 501

Query: 225 DGYGTSYQPPSTLEGWK-YTAKNLL------MYHPADCGEAPLTEEEQAVRLKGLTKEIN 277
             Y T       ++  K Y  K L+      +  P D  E    EE++A + + L K +N
Sbjct: 502 VVYMTEPIDEYVIQHLKEYKGKQLVSVTKEGLELPEDDAEKKKREEDKA-KFESLCKLMN 560


>sp|P53797|PSY_NARPS Phytoene synthase, chloroplastic OS=Narcissus pseudonarcissus
           GN=PSY PE=2 SV=1
          Length = 423

 Score = 33.5 bits (75), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 22/101 (21%), Positives = 44/101 (43%), Gaps = 9/101 (8%)

Query: 219 KKDRITDGYGTSYQPPSTLEGWKYTAKNLLMYHPADCGEAPLTEEEQAV---------RL 269
           + D + DG+  S+  PS L+ W+   ++L    P D  +A L++               +
Sbjct: 167 RTDELVDGHNASHITPSALDRWEARLEDLFAGRPYDMFDAALSDTVSRFPVDIQPFMDMV 226

Query: 270 KGLTKEINKTNTRFHGKIMDSRPNDDGTVEVLYTPVAGTTP 310
           +G+  ++ K+  +   ++        GTV ++  PV G  P
Sbjct: 227 EGMRMDLKKSRYKNFDELYLYCYYVAGTVGLMSVPVMGIAP 267


>sp|P02828|HSP83_DROME Heat shock protein 83 OS=Drosophila melanogaster GN=Hsp83 PE=1 SV=1
          Length = 717

 Score = 33.1 bits (74), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 32/55 (58%), Gaps = 4/55 (7%)

Query: 169 RLDEFLRKHTSEDNDSFSKILEKVNR-KRKERYQFLLEGEKRD---NNNLIEDVK 219
           +L +FLR HTS   D F  + + V+R K  +++ + + GE +D   N+  +E VK
Sbjct: 443 KLADFLRFHTSASGDDFCSLADYVSRMKDNQKHVYFITGESKDQVSNSAFVERVK 497


>sp|O75570|RF1M_HUMAN Peptide chain release factor 1, mitochondrial OS=Homo sapiens
           GN=MTRF1 PE=1 SV=2
          Length = 445

 Score = 33.1 bits (74), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 35/76 (46%), Gaps = 12/76 (15%)

Query: 190 EKVNRKRKERYQFLLEGEKRDNNNLIEDVKKDRITDG---------YGTSYQPPSTLEGW 240
           +K+N    E +Q L+  EK D N++I +V   R T G             YQ  S  + W
Sbjct: 161 QKINMLYNELFQSLVPKEKYDKNDVILEVTAGRTTGGDICQQFTREIFDMYQNYSCYKHW 220

Query: 241 KYTAKNLLMYHPADCG 256
           ++    LL Y PAD G
Sbjct: 221 QF---ELLNYTPADYG 233


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.311    0.130    0.376 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 211,899,464
Number of Sequences: 539616
Number of extensions: 10044405
Number of successful extensions: 25440
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 66
Number of HSP's that attempted gapping in prelim test: 25356
Number of HSP's gapped (non-prelim): 136
length of query: 514
length of database: 191,569,459
effective HSP length: 122
effective length of query: 392
effective length of database: 125,736,307
effective search space: 49288632344
effective search space used: 49288632344
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 64 (29.3 bits)