BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016531
(388 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|F4HVY0|CER1_ARATH Protein ECERIFERUM 1 OS=Arabidopsis thaliana GN=CER1 PE=1 SV=1
Length = 625
Score = 340 bits (871), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 184/333 (55%), Positives = 225/333 (67%), Gaps = 34/333 (10%)
Query: 58 GTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPITSVT 115
Q P WR DGV++ AL+H GPVEF+YYWLH+ALHHH+LYSR H H IV EPITSV
Sbjct: 116 AKQLPWWRTDGVLMAALIHTGPVEFLYYWLHKALHHHFLYSRYHSHHHSSIVTEPITSVI 175
Query: 116 RPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTIF 175
PFAEHI YF+LFA PL+TT LT SI+ GYI YID MNNMGHCNF LIPK LF +F
Sbjct: 176 HPFAEHIAYFILFAIPLLTTLLTKTASIISFAGYIIYIDFMNNMGHCNFELIPKRLFHLF 235
Query: 176 PPLKYLMYTPS-----------------PL------------TASYAAPGELLDDSLDVV 206
PPLK+L YTPS PL Y E DD +DVV
Sbjct: 236 PPLKFLCYTPSYHSLHHTQFRTNYSLFMPLYDYIYGTMDESTDTLYEKTLERGDDIVDVV 295
Query: 207 YLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFSMMITWIYGRTFVVER 266
+LTHLTTPESIYH+R+GLAS AS P + W+ LLWP T SM+ T Y R FV ER
Sbjct: 296 HLTHLTTPESIYHLRIGLASFASYPFAY---RWFMRLLWPFTSLSMIFTLFYARLFVAER 352
Query: 267 NRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGEELN 326
N NKL LQ+W +YN+QY + E+IN +IE+AILEA++KG +V+SLGL+NQGEELN
Sbjct: 353 NSFNKLNLQSWVIPRYNLQYLLKWRKEAINNMIEKAILEADKKGVKVLSLGLMNQGEELN 412
Query: 327 RYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIP 359
R G +++H +P++K+++VDGS LA AV+ NS+P
Sbjct: 413 RNGEVYIHNHPDMKVRLVDGSRLAAAVVINSVP 445
>sp|F4HVX7|CERL1_ARATH Protein CER1-like 1 OS=Arabidopsis thaliana GN=At1g02190 PE=2 SV=1
Length = 627
Score = 314 bits (805), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 179/338 (52%), Positives = 217/338 (64%), Gaps = 33/338 (9%)
Query: 57 GGTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPITSV 114
G + P WRLDG ILMALLHAGPVEF+YYW HRALHHH+LYSR H H IV EPITSV
Sbjct: 115 GASHLPPWRLDGAILMALLHAGPVEFLYYWFHRALHHHFLYSRYHSHHHSSIVTEPITSV 174
Query: 115 TRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTI 174
PFAEHI Y +LFA P++T +L G SIV GYITYID MNNMGHCNF L PK LF +
Sbjct: 175 VHPFAEHIAYTLLFAIPMVTASLCGILSIVSIMGYITYIDFMNNMGHCNFELFPKRLFHL 234
Query: 175 FPPLKYLMYTPS----------------------------PLTAS-YAAPGELLDDSLDV 205
FPPLK+L YTPS LT S Y E+ ++S DV
Sbjct: 235 FPPLKFLCYTPSFHSLHHTQFRTNYSLFMPIYDFIYGTTDNLTDSLYERSLEIEEESPDV 294
Query: 206 VYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFS--MMITWIYGRTFV 263
++LTHLTT SIY MRLG SL+S P + +WP TL + + I RTFV
Sbjct: 295 IHLTHLTTHNSIYQMRLGFPSLSSCPLWSRPPWYLTCFMWPFTLLCSFALTSAIPLRTFV 354
Query: 264 VERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGE 323
ERNRL L + + K++ Y SQ+ +ESIN +IEEAILEA+EKG +V+SLGL+N E
Sbjct: 355 FERNRLRDLTVHSHLLPKFSFHYKSQRHHESINTIIEEAILEADEKGVKVMSLGLMNNRE 414
Query: 324 ELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPAE 361
ELN G ++V K P+LKI++VDGSS+A V+ N+IP E
Sbjct: 415 ELNGSGEMYVQKYPKLKIRLVDGSSMAATVVINNIPKE 452
>sp|F4IR05|CERL2_ARATH Protein CER1-like 2 OS=Arabidopsis thaliana GN=At2g37700 PE=2 SV=1
Length = 613
Score = 308 bits (789), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 175/343 (51%), Positives = 220/343 (64%), Gaps = 36/343 (10%)
Query: 51 LTEREIGGTQ-FPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IV 107
LT+ + GT P WR DGVIL+ALLHAGPVEF+YYW HRALHHH+LYSR H H IV
Sbjct: 104 LTKVYVSGTSTIPFWRTDGVILVALLHAGPVEFIYYWFHRALHHHFLYSRYHSHHHSSIV 163
Query: 108 PEPITSVTRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLI 167
EPITSV PFAEHI Y ++ PLITT + G S+V Y+TYID MNNMGHCNF LI
Sbjct: 164 TEPITSVVHPFAEHIGYTLILGLPLITTFMCGTVSVVSIALYLTYIDFMNNMGHCNFELI 223
Query: 168 PKWLFTIFPPLKYLMYTPS-----------------PLTASYAAPGELLDDSL------- 203
PK+LF++ PPLK+L YTPS P+ + DSL
Sbjct: 224 PKFLFSLLPPLKFLCYTPSFHSLHHTQFRTNYSLFMPMYDYIYGTTDECSDSLYETSLEK 283
Query: 204 -----DVVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTL-FSMMITWI 257
D ++LTHLT+ +SIYH+RLG ASL+S P SS Y +L+ P L S ++
Sbjct: 284 EEEKPDAIHLTHLTSLDSIYHLRLGFASLSSHP---LSSRCYLFLMKPFALILSFILRSF 340
Query: 258 YGRTFVVERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLG 317
+TFVVERNR L L + K++ Y S Q E IN++IE AILEA++KG +V+SLG
Sbjct: 341 SFQTFVVERNRFRDLTLHSHLLPKFSSHYMSHQQKECINKMIEAAILEADKKGVKVMSLG 400
Query: 318 LLNQGEELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPA 360
LLNQGEELN YG ++V ++P+LKI++VDG SLA V+ +SIP
Sbjct: 401 LLNQGEELNGYGEMYVRRHPKLKIRIVDGGSLAAEVVLHSIPV 443
>sp|Q8H1Z0|CER3_ARATH Protein ECERIFERUM 3 OS=Arabidopsis thaliana GN=CER3 PE=1 SV=1
Length = 632
Score = 178 bits (451), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 115/365 (31%), Positives = 175/365 (47%), Gaps = 38/365 (10%)
Query: 59 TQFPIWRLDGVILMALLHAGPVEFVYYWLHRALH-HHYLYSRSH--HHSSIVPEPITSVT 115
P+W G+I + +LH E +YY+LHR+ H ++Y ++ H HHSS VP P+T+
Sbjct: 118 NSLPLWNTKGLIALIVLHVTFSEPLYYFLHRSFHRNNYFFTHYHSFHHSSPVPHPMTAGN 177
Query: 116 RPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTIF 175
E+I V+ PLI L G GS+ +GY D M +GHCN + LF I
Sbjct: 178 ATLLENIILCVVAGVPLIGCCLFGVGSLSAIYGYAVMFDFMRCLGHCNVEIFSHKLFEIL 237
Query: 176 PPLKYLMYTPSPLTASYAAPG----------ELLDDSLD--------------------- 204
P L+YL+YTP+ + + G ++L D+ +
Sbjct: 238 PVLRYLIYTPTYHSLHHQEMGTNFCLFMPLFDVLGDTQNPNSWELQKKIRLSAGERKRVP 297
Query: 205 -VVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFSMMITWIYGRTFV 263
V+L H S H S AS P+ ++ + +WP T M+ W + +TF+
Sbjct: 298 EFVFLAHGVDVMSAMHAPFVFRSFASMPY---TTRIFLLPMWPFTFCVMLGMWAWSKTFL 354
Query: 264 VERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGE 323
L QTW ++ QYF + IN IE AIL A++ G +VISL LN+ E
Sbjct: 355 FSFYTLRNNLCQTWGVPRFGFQYFLPFATKGINDQIEAAILRADKIGVKVISLAALNKNE 414
Query: 324 ELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPAEQPKWSLEAFSLRLLMPLPLPYA 383
LN G LFV+K+P+L+++VV G++L AV+ IP + + L + +L + L
Sbjct: 415 ALNGGGTLFVNKHPDLRVRVVHGNTLTAAVILYEIPKDVNEVFLTGATSKLGRAIALYLC 474
Query: 384 KREFR 388
+R R
Sbjct: 475 RRGVR 479
>sp|O13666|ERG32_SCHPO C-5 sterol desaturase erg32 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=erg32 PE=3 SV=1
Length = 329
Score = 42.0 bits (97), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
+F+ YW+HRALHH +LY+ H HH IVP P +S
Sbjct: 163 DFLIYWIHRALHHRWLYAPLHKLHHKWIVPTPYSS 197
>sp|Q7SBB6|ERG3_NEUCR Probable C-5 sterol desaturase OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=NCU06207 PE=3 SV=1
Length = 344
Score = 37.4 bits (85), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 31/66 (46%), Gaps = 7/66 (10%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVT----RPFAEHITYFVL-FATPLI 133
+F YW+HR LHH +Y H HH I+P P S FA+ I Y + F PL
Sbjct: 174 DFGIYWIHRGLHHPLVYKHLHKPHHKWIMPTPYASHAFHPIDGFAQSIPYHIFPFIFPLQ 233
Query: 134 TTALTG 139
A G
Sbjct: 234 KMAYVG 239
>sp|Q55D54|MSMOB_DICDI Putative methylsterol monooxygenase DDB_G0269788 OS=Dictyostelium
discoideum GN=DDB_G0269788 PE=3 SV=1
Length = 270
Score = 36.2 bits (82), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 16/69 (23%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 138
+F +YW+HRALHH + Y H HH P +T+ +A PL T +
Sbjct: 125 DFYFYWVHRALHHGFWYKHIHKVHHDHAAPFGMTAE-------------YAHPL-ETVIL 170
Query: 139 GAGSIVPAF 147
G G+++ F
Sbjct: 171 GVGTVIGPF 179
>sp|Q8VWZ8|SMO22_ARATH Methylsterol monooxygenase 2-2 OS=Arabidopsis thaliana GN=SMO2-2
PE=2 SV=1
Length = 266
Score = 35.4 bits (80), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 138
+FV+YW HR LH +LY H HH P +TS AE + F+ FAT ++ ALT
Sbjct: 120 DFVFYWGHRILHSKWLYKNVHSVHHEYATPFGLTSEYAHPAEIL--FLGFAT-IVGPALT 176
Query: 139 G 139
G
Sbjct: 177 G 177
>sp|Q9ZW22|SMO21_ARATH Methylsterol monooxygenase 2-1 OS=Arabidopsis thaliana GN=SMO2-1
PE=2 SV=2
Length = 272
Score = 35.0 bits (79), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 138
+FV+YW HR LH +LY H HH P +TS AE + F+ FAT ++ ALT
Sbjct: 120 DFVFYWGHRILHTKWLYKNVHSVHHEYATPFGLTSEYAHPAEIL--FLGFAT-IVGPALT 176
Query: 139 G 139
G
Sbjct: 177 G 177
>sp|O94457|ERG31_SCHPO C-5 sterol desaturase erg31 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=erg31 PE=3 SV=1
Length = 300
Score = 34.7 bits (78), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
+F YW HR LHH Y+Y R H HH I+ P S
Sbjct: 130 DFGIYWAHRFLHHRYVYPRLHKLHHKWIICTPYAS 164
>sp|Q92BZ5|NUSB_LISIN N utilization substance protein B homolog OS=Listeria innocua
serovar 6a (strain CLIP 11262) GN=nusB PE=3 SV=1
Length = 128
Score = 34.3 bits (77), Expect = 1.8, Method: Composition-based stats.
Identities = 21/59 (35%), Positives = 32/59 (54%), Gaps = 1/59 (1%)
Query: 267 NRLNKLKLQTWAKSKYNMQYFSQQPNE-SINRLIEEAILEAEEKGARVISLGLLNQGEE 324
+RLNK+ L S Y ++Y PN S+N IE A + ++EK ++ I+ L N E
Sbjct: 68 DRLNKVDLSLLRLSVYEIKYLDDVPNRVSLNESIEIAKIYSDEKSSKFINGVLANIAPE 126
>sp|Q8J207|ERG3_LEPMC C-5 sterol desaturase OS=Leptosphaeria maculans GN=ERG3 PE=3 SV=1
Length = 356
Score = 33.9 bits (76), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 15/35 (42%), Positives = 19/35 (54%), Gaps = 2/35 (5%)
Query: 81 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
+ YW+HR LHH +Y H HH I+P P S
Sbjct: 185 DLCIYWIHRGLHHPMVYKHIHKPHHKWIMPTPFAS 219
>sp|O75845|SC5D_HUMAN Lathosterol oxidase OS=Homo sapiens GN=SC5DL PE=1 SV=2
Length = 299
Score = 33.5 bits (75), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 18/31 (58%), Gaps = 2/31 (6%)
Query: 85 YWLHRALHHHYLYSRSH--HHSSIVPEPITS 113
YW+HR LHH +Y R H HH +P P S
Sbjct: 135 YWIHRGLHHRLVYKRLHKPHHIWKIPTPFAS 165
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 33.1 bits (74), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 9/95 (9%)
Query: 222 LGLASLASKPHQHASSEWYKWLLWPVTLFSMMIT-WIYGRTFVVERN-RLNKLKLQTWAK 279
LG+AS A K Q+ ++WYKW L+ M W R V E+N ++ +L Q +++
Sbjct: 12 LGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGW---RRAVWEKNMKMIELHNQEYSQ 68
Query: 280 SKYN----MQYFSQQPNESINRLIEEAILEAEEKG 310
K+ M F NE +++ + +KG
Sbjct: 69 GKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKG 103
>sp|Q3L8U1|CHD9_HUMAN Chromodomain-helicase-DNA-binding protein 9 OS=Homo sapiens GN=CHD9
PE=1 SV=2
Length = 2897
Score = 33.1 bits (74), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 268 RLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVI 314
+L +++ ++ K + S+ E NR+I EAI +A+E+G R I
Sbjct: 499 KLQNTQVRVMSEKKQRKKVESESKQEKANRIISEAIAKAKERGERNI 545
>sp|Q55D52|MSMOA_DICDI Putative methylsterol monooxygenase DDB_G0270946 OS=Dictyostelium
discoideum GN=DDB_G0270946 PE=3 SV=2
Length = 267
Score = 33.1 bits (74), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 39/81 (48%), Gaps = 17/81 (20%)
Query: 70 ILMALLHAGPVE-FVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFV 126
+L+ L+ + +E F +YW HRALHH Y H HH P IT AE
Sbjct: 113 LLLTLVSSFIIEDFYFYWAHRALHHGIWYKYIHKVHHDYASPFGIT------AE------ 160
Query: 127 LFATPLITTALTGAGSIVPAF 147
+A PL T L G G+++ F
Sbjct: 161 -YAHPLETIIL-GVGTVIGPF 179
>sp|A1K6P9|Y1887_AZOSB UPF0246 protein azo1887 OS=Azoarcus sp. (strain BH72) GN=azo1887
PE=3 SV=1
Length = 261
Score = 32.7 bits (73), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 174 IFPPLKYLMYTPSPLTASYAAPGELLDDSLDVVYLTHLTTPESIYHMRLGLASLASK 230
+ P K L Y P+TA + P + LDD+ +++ + TP I GL SL+ K
Sbjct: 4 VISPAKALDYETPPVTARFTTP-DFLDDAAELIDILRTCTPADI----AGLMSLSDK 55
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.322 0.136 0.424
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 146,019,196
Number of Sequences: 539616
Number of extensions: 5982113
Number of successful extensions: 15184
Number of sequences better than 100.0: 24
Number of HSP's better than 100.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 15158
Number of HSP's gapped (non-prelim): 30
length of query: 388
length of database: 191,569,459
effective HSP length: 119
effective length of query: 269
effective length of database: 127,355,155
effective search space: 34258536695
effective search space used: 34258536695
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (28.5 bits)