BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 016531
         (388 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|F4HVY0|CER1_ARATH Protein ECERIFERUM 1 OS=Arabidopsis thaliana GN=CER1 PE=1 SV=1
          Length = 625

 Score =  340 bits (871), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 184/333 (55%), Positives = 225/333 (67%), Gaps = 34/333 (10%)

Query: 58  GTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPITSVT 115
             Q P WR DGV++ AL+H GPVEF+YYWLH+ALHHH+LYSR H H    IV EPITSV 
Sbjct: 116 AKQLPWWRTDGVLMAALIHTGPVEFLYYWLHKALHHHFLYSRYHSHHHSSIVTEPITSVI 175

Query: 116 RPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTIF 175
            PFAEHI YF+LFA PL+TT LT   SI+   GYI YID MNNMGHCNF LIPK LF +F
Sbjct: 176 HPFAEHIAYFILFAIPLLTTLLTKTASIISFAGYIIYIDFMNNMGHCNFELIPKRLFHLF 235

Query: 176 PPLKYLMYTPS-----------------PL------------TASYAAPGELLDDSLDVV 206
           PPLK+L YTPS                 PL               Y    E  DD +DVV
Sbjct: 236 PPLKFLCYTPSYHSLHHTQFRTNYSLFMPLYDYIYGTMDESTDTLYEKTLERGDDIVDVV 295

Query: 207 YLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFSMMITWIYGRTFVVER 266
           +LTHLTTPESIYH+R+GLAS AS P  +    W+  LLWP T  SM+ T  Y R FV ER
Sbjct: 296 HLTHLTTPESIYHLRIGLASFASYPFAY---RWFMRLLWPFTSLSMIFTLFYARLFVAER 352

Query: 267 NRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGEELN 326
           N  NKL LQ+W   +YN+QY  +   E+IN +IE+AILEA++KG +V+SLGL+NQGEELN
Sbjct: 353 NSFNKLNLQSWVIPRYNLQYLLKWRKEAINNMIEKAILEADKKGVKVLSLGLMNQGEELN 412

Query: 327 RYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIP 359
           R G +++H +P++K+++VDGS LA AV+ NS+P
Sbjct: 413 RNGEVYIHNHPDMKVRLVDGSRLAAAVVINSVP 445


>sp|F4HVX7|CERL1_ARATH Protein CER1-like 1 OS=Arabidopsis thaliana GN=At1g02190 PE=2 SV=1
          Length = 627

 Score =  314 bits (805), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 179/338 (52%), Positives = 217/338 (64%), Gaps = 33/338 (9%)

Query: 57  GGTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPITSV 114
           G +  P WRLDG ILMALLHAGPVEF+YYW HRALHHH+LYSR H H    IV EPITSV
Sbjct: 115 GASHLPPWRLDGAILMALLHAGPVEFLYYWFHRALHHHFLYSRYHSHHHSSIVTEPITSV 174

Query: 115 TRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTI 174
             PFAEHI Y +LFA P++T +L G  SIV   GYITYID MNNMGHCNF L PK LF +
Sbjct: 175 VHPFAEHIAYTLLFAIPMVTASLCGILSIVSIMGYITYIDFMNNMGHCNFELFPKRLFHL 234

Query: 175 FPPLKYLMYTPS----------------------------PLTAS-YAAPGELLDDSLDV 205
           FPPLK+L YTPS                             LT S Y    E+ ++S DV
Sbjct: 235 FPPLKFLCYTPSFHSLHHTQFRTNYSLFMPIYDFIYGTTDNLTDSLYERSLEIEEESPDV 294

Query: 206 VYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFS--MMITWIYGRTFV 263
           ++LTHLTT  SIY MRLG  SL+S P       +    +WP TL     + + I  RTFV
Sbjct: 295 IHLTHLTTHNSIYQMRLGFPSLSSCPLWSRPPWYLTCFMWPFTLLCSFALTSAIPLRTFV 354

Query: 264 VERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGE 323
            ERNRL  L + +    K++  Y SQ+ +ESIN +IEEAILEA+EKG +V+SLGL+N  E
Sbjct: 355 FERNRLRDLTVHSHLLPKFSFHYKSQRHHESINTIIEEAILEADEKGVKVMSLGLMNNRE 414

Query: 324 ELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPAE 361
           ELN  G ++V K P+LKI++VDGSS+A  V+ N+IP E
Sbjct: 415 ELNGSGEMYVQKYPKLKIRLVDGSSMAATVVINNIPKE 452


>sp|F4IR05|CERL2_ARATH Protein CER1-like 2 OS=Arabidopsis thaliana GN=At2g37700 PE=2 SV=1
          Length = 613

 Score =  308 bits (789), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 175/343 (51%), Positives = 220/343 (64%), Gaps = 36/343 (10%)

Query: 51  LTEREIGGTQ-FPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IV 107
           LT+  + GT   P WR DGVIL+ALLHAGPVEF+YYW HRALHHH+LYSR H H    IV
Sbjct: 104 LTKVYVSGTSTIPFWRTDGVILVALLHAGPVEFIYYWFHRALHHHFLYSRYHSHHHSSIV 163

Query: 108 PEPITSVTRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLI 167
            EPITSV  PFAEHI Y ++   PLITT + G  S+V    Y+TYID MNNMGHCNF LI
Sbjct: 164 TEPITSVVHPFAEHIGYTLILGLPLITTFMCGTVSVVSIALYLTYIDFMNNMGHCNFELI 223

Query: 168 PKWLFTIFPPLKYLMYTPS-----------------PLTASYAAPGELLDDSL------- 203
           PK+LF++ PPLK+L YTPS                 P+        +   DSL       
Sbjct: 224 PKFLFSLLPPLKFLCYTPSFHSLHHTQFRTNYSLFMPMYDYIYGTTDECSDSLYETSLEK 283

Query: 204 -----DVVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTL-FSMMITWI 257
                D ++LTHLT+ +SIYH+RLG ASL+S P    SS  Y +L+ P  L  S ++   
Sbjct: 284 EEEKPDAIHLTHLTSLDSIYHLRLGFASLSSHP---LSSRCYLFLMKPFALILSFILRSF 340

Query: 258 YGRTFVVERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLG 317
             +TFVVERNR   L L +    K++  Y S Q  E IN++IE AILEA++KG +V+SLG
Sbjct: 341 SFQTFVVERNRFRDLTLHSHLLPKFSSHYMSHQQKECINKMIEAAILEADKKGVKVMSLG 400

Query: 318 LLNQGEELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPA 360
           LLNQGEELN YG ++V ++P+LKI++VDG SLA  V+ +SIP 
Sbjct: 401 LLNQGEELNGYGEMYVRRHPKLKIRIVDGGSLAAEVVLHSIPV 443


>sp|Q8H1Z0|CER3_ARATH Protein ECERIFERUM 3 OS=Arabidopsis thaliana GN=CER3 PE=1 SV=1
          Length = 632

 Score =  178 bits (451), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 115/365 (31%), Positives = 175/365 (47%), Gaps = 38/365 (10%)

Query: 59  TQFPIWRLDGVILMALLHAGPVEFVYYWLHRALH-HHYLYSRSH--HHSSIVPEPITSVT 115
              P+W   G+I + +LH    E +YY+LHR+ H ++Y ++  H  HHSS VP P+T+  
Sbjct: 118 NSLPLWNTKGLIALIVLHVTFSEPLYYFLHRSFHRNNYFFTHYHSFHHSSPVPHPMTAGN 177

Query: 116 RPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTIF 175
               E+I   V+   PLI   L G GS+   +GY    D M  +GHCN  +    LF I 
Sbjct: 178 ATLLENIILCVVAGVPLIGCCLFGVGSLSAIYGYAVMFDFMRCLGHCNVEIFSHKLFEIL 237

Query: 176 PPLKYLMYTPSPLTASYAAPG----------ELLDDSLD--------------------- 204
           P L+YL+YTP+  +  +   G          ++L D+ +                     
Sbjct: 238 PVLRYLIYTPTYHSLHHQEMGTNFCLFMPLFDVLGDTQNPNSWELQKKIRLSAGERKRVP 297

Query: 205 -VVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFSMMITWIYGRTFV 263
             V+L H     S  H      S AS P+   ++  +   +WP T   M+  W + +TF+
Sbjct: 298 EFVFLAHGVDVMSAMHAPFVFRSFASMPY---TTRIFLLPMWPFTFCVMLGMWAWSKTFL 354

Query: 264 VERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGE 323
                L     QTW   ++  QYF     + IN  IE AIL A++ G +VISL  LN+ E
Sbjct: 355 FSFYTLRNNLCQTWGVPRFGFQYFLPFATKGINDQIEAAILRADKIGVKVISLAALNKNE 414

Query: 324 ELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPAEQPKWSLEAFSLRLLMPLPLPYA 383
            LN  G LFV+K+P+L+++VV G++L  AV+   IP +  +  L   + +L   + L   
Sbjct: 415 ALNGGGTLFVNKHPDLRVRVVHGNTLTAAVILYEIPKDVNEVFLTGATSKLGRAIALYLC 474

Query: 384 KREFR 388
           +R  R
Sbjct: 475 RRGVR 479


>sp|O13666|ERG32_SCHPO C-5 sterol desaturase erg32 OS=Schizosaccharomyces pombe (strain
           972 / ATCC 24843) GN=erg32 PE=3 SV=1
          Length = 329

 Score = 42.0 bits (97), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 19/35 (54%), Positives = 25/35 (71%), Gaps = 2/35 (5%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
           +F+ YW+HRALHH +LY+  H  HH  IVP P +S
Sbjct: 163 DFLIYWIHRALHHRWLYAPLHKLHHKWIVPTPYSS 197


>sp|Q7SBB6|ERG3_NEUCR Probable C-5 sterol desaturase OS=Neurospora crassa (strain ATCC
           24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
           GN=NCU06207 PE=3 SV=1
          Length = 344

 Score = 37.4 bits (85), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 7/66 (10%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVT----RPFAEHITYFVL-FATPLI 133
           +F  YW+HR LHH  +Y   H  HH  I+P P  S        FA+ I Y +  F  PL 
Sbjct: 174 DFGIYWIHRGLHHPLVYKHLHKPHHKWIMPTPYASHAFHPIDGFAQSIPYHIFPFIFPLQ 233

Query: 134 TTALTG 139
             A  G
Sbjct: 234 KMAYVG 239


>sp|Q55D54|MSMOB_DICDI Putative methylsterol monooxygenase DDB_G0269788 OS=Dictyostelium
           discoideum GN=DDB_G0269788 PE=3 SV=1
          Length = 270

 Score = 36.2 bits (82), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 16/69 (23%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 138
           +F +YW+HRALHH + Y   H  HH    P  +T+              +A PL  T + 
Sbjct: 125 DFYFYWVHRALHHGFWYKHIHKVHHDHAAPFGMTAE-------------YAHPL-ETVIL 170

Query: 139 GAGSIVPAF 147
           G G+++  F
Sbjct: 171 GVGTVIGPF 179


>sp|Q8VWZ8|SMO22_ARATH Methylsterol monooxygenase 2-2 OS=Arabidopsis thaliana GN=SMO2-2
           PE=2 SV=1
          Length = 266

 Score = 35.4 bits (80), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 138
           +FV+YW HR LH  +LY   H  HH    P  +TS     AE +  F+ FAT ++  ALT
Sbjct: 120 DFVFYWGHRILHSKWLYKNVHSVHHEYATPFGLTSEYAHPAEIL--FLGFAT-IVGPALT 176

Query: 139 G 139
           G
Sbjct: 177 G 177


>sp|Q9ZW22|SMO21_ARATH Methylsterol monooxygenase 2-1 OS=Arabidopsis thaliana GN=SMO2-1
           PE=2 SV=2
          Length = 272

 Score = 35.0 bits (79), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 138
           +FV+YW HR LH  +LY   H  HH    P  +TS     AE +  F+ FAT ++  ALT
Sbjct: 120 DFVFYWGHRILHTKWLYKNVHSVHHEYATPFGLTSEYAHPAEIL--FLGFAT-IVGPALT 176

Query: 139 G 139
           G
Sbjct: 177 G 177


>sp|O94457|ERG31_SCHPO C-5 sterol desaturase erg31 OS=Schizosaccharomyces pombe (strain
           972 / ATCC 24843) GN=erg31 PE=3 SV=1
          Length = 300

 Score = 34.7 bits (78), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 2/35 (5%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
           +F  YW HR LHH Y+Y R H  HH  I+  P  S
Sbjct: 130 DFGIYWAHRFLHHRYVYPRLHKLHHKWIICTPYAS 164


>sp|Q92BZ5|NUSB_LISIN N utilization substance protein B homolog OS=Listeria innocua
           serovar 6a (strain CLIP 11262) GN=nusB PE=3 SV=1
          Length = 128

 Score = 34.3 bits (77), Expect = 1.8,   Method: Composition-based stats.
 Identities = 21/59 (35%), Positives = 32/59 (54%), Gaps = 1/59 (1%)

Query: 267 NRLNKLKLQTWAKSKYNMQYFSQQPNE-SINRLIEEAILEAEEKGARVISLGLLNQGEE 324
           +RLNK+ L     S Y ++Y    PN  S+N  IE A + ++EK ++ I+  L N   E
Sbjct: 68  DRLNKVDLSLLRLSVYEIKYLDDVPNRVSLNESIEIAKIYSDEKSSKFINGVLANIAPE 126


>sp|Q8J207|ERG3_LEPMC C-5 sterol desaturase OS=Leptosphaeria maculans GN=ERG3 PE=3 SV=1
          Length = 356

 Score = 33.9 bits (76), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 2/35 (5%)

Query: 81  EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
           +   YW+HR LHH  +Y   H  HH  I+P P  S
Sbjct: 185 DLCIYWIHRGLHHPMVYKHIHKPHHKWIMPTPFAS 219


>sp|O75845|SC5D_HUMAN Lathosterol oxidase OS=Homo sapiens GN=SC5DL PE=1 SV=2
          Length = 299

 Score = 33.5 bits (75), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 15/31 (48%), Positives = 18/31 (58%), Gaps = 2/31 (6%)

Query: 85  YWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
           YW+HR LHH  +Y R H  HH   +P P  S
Sbjct: 135 YWIHRGLHHRLVYKRLHKPHHIWKIPTPFAS 165


>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
          Length = 334

 Score = 33.1 bits (74), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 9/95 (9%)

Query: 222 LGLASLASKPHQHASSEWYKWLLWPVTLFSMMIT-WIYGRTFVVERN-RLNKLKLQTWAK 279
           LG+AS A K  Q+  ++WYKW      L+ M    W   R  V E+N ++ +L  Q +++
Sbjct: 12  LGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGW---RRAVWEKNMKMIELHNQEYSQ 68

Query: 280 SKYN----MQYFSQQPNESINRLIEEAILEAEEKG 310
            K+     M  F    NE   +++     +  +KG
Sbjct: 69  GKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKG 103


>sp|Q3L8U1|CHD9_HUMAN Chromodomain-helicase-DNA-binding protein 9 OS=Homo sapiens GN=CHD9
           PE=1 SV=2
          Length = 2897

 Score = 33.1 bits (74), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 27/47 (57%)

Query: 268 RLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVI 314
           +L   +++  ++ K   +  S+   E  NR+I EAI +A+E+G R I
Sbjct: 499 KLQNTQVRVMSEKKQRKKVESESKQEKANRIISEAIAKAKERGERNI 545


>sp|Q55D52|MSMOA_DICDI Putative methylsterol monooxygenase DDB_G0270946 OS=Dictyostelium
           discoideum GN=DDB_G0270946 PE=3 SV=2
          Length = 267

 Score = 33.1 bits (74), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 17/81 (20%)

Query: 70  ILMALLHAGPVE-FVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFV 126
           +L+ L+ +  +E F +YW HRALHH   Y   H  HH    P  IT      AE      
Sbjct: 113 LLLTLVSSFIIEDFYFYWAHRALHHGIWYKYIHKVHHDYASPFGIT------AE------ 160

Query: 127 LFATPLITTALTGAGSIVPAF 147
            +A PL T  L G G+++  F
Sbjct: 161 -YAHPLETIIL-GVGTVIGPF 179


>sp|A1K6P9|Y1887_AZOSB UPF0246 protein azo1887 OS=Azoarcus sp. (strain BH72) GN=azo1887
           PE=3 SV=1
          Length = 261

 Score = 32.7 bits (73), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 5/57 (8%)

Query: 174 IFPPLKYLMYTPSPLTASYAAPGELLDDSLDVVYLTHLTTPESIYHMRLGLASLASK 230
           +  P K L Y   P+TA +  P + LDD+ +++ +    TP  I     GL SL+ K
Sbjct: 4   VISPAKALDYETPPVTARFTTP-DFLDDAAELIDILRTCTPADI----AGLMSLSDK 55


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.322    0.136    0.424 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 146,019,196
Number of Sequences: 539616
Number of extensions: 5982113
Number of successful extensions: 15184
Number of sequences better than 100.0: 24
Number of HSP's better than 100.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 15158
Number of HSP's gapped (non-prelim): 30
length of query: 388
length of database: 191,569,459
effective HSP length: 119
effective length of query: 269
effective length of database: 127,355,155
effective search space: 34258536695
effective search space used: 34258536695
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (28.5 bits)