BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019830
(335 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|F4HVY0|CER1_ARATH Protein ECERIFERUM 1 OS=Arabidopsis thaliana GN=CER1 PE=1 SV=1
Length = 625
Score = 342 bits (877), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 186/337 (55%), Positives = 228/337 (67%), Gaps = 34/337 (10%)
Query: 1 MLPGGTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPI 58
+LP Q P WR DGV++ AL+H GPVEF+YYWLH+ALHHH+LYSR H H IV EPI
Sbjct: 112 LLPEAKQLPWWRTDGVLMAALIHTGPVEFLYYWLHKALHHHFLYSRYHSHHHSSIVTEPI 171
Query: 59 TSVTRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWL 118
TSV PFAEHI YF+LFA PL+TT LT SI+ GYI YID MNNMGHCNF LIPK L
Sbjct: 172 TSVIHPFAEHIAYFILFAIPLLTTLLTKTASIISFAGYIIYIDFMNNMGHCNFELIPKRL 231
Query: 119 FTIFPPLKYLMYTPS-----------------PL------------TASYAAPGELLDDS 149
F +FPPLK+L YTPS PL Y E DD
Sbjct: 232 FHLFPPLKFLCYTPSYHSLHHTQFRTNYSLFMPLYDYIYGTMDESTDTLYEKTLERGDDI 291
Query: 150 LDVVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFSMMITWIYGRTF 209
+DVV+LTHLTTPESIYH+R+GLAS AS P + W+ LLWP T SM+ T Y R F
Sbjct: 292 VDVVHLTHLTTPESIYHLRIGLASFASYPFAY---RWFMRLLWPFTSLSMIFTLFYARLF 348
Query: 210 VVERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQG 269
V ERN NKL LQ+W +YN+QY + E+IN +IE+AILEA++KG +V+SLGL+NQG
Sbjct: 349 VAERNSFNKLNLQSWVIPRYNLQYLLKWRKEAINNMIEKAILEADKKGVKVLSLGLMNQG 408
Query: 270 EELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIP 306
EELNR G +++H +P++K+++VDGS LA AV+ NS+P
Sbjct: 409 EELNRNGEVYIHNHPDMKVRLVDGSRLAAAVVINSVP 445
>sp|F4HVX7|CERL1_ARATH Protein CER1-like 1 OS=Arabidopsis thaliana GN=At1g02190 PE=2 SV=1
Length = 627
Score = 317 bits (812), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 181/340 (53%), Positives = 219/340 (64%), Gaps = 33/340 (9%)
Query: 2 LPGGTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPIT 59
LPG + P WRLDG ILMALLHAGPVEF+YYW HRALHHH+LYSR H H IV EPIT
Sbjct: 113 LPGASHLPPWRLDGAILMALLHAGPVEFLYYWFHRALHHHFLYSRYHSHHHSSIVTEPIT 172
Query: 60 SVTRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLF 119
SV PFAEHI Y +LFA P++T +L G SIV GYITYID MNNMGHCNF L PK LF
Sbjct: 173 SVVHPFAEHIAYTLLFAIPMVTASLCGILSIVSIMGYITYIDFMNNMGHCNFELFPKRLF 232
Query: 120 TIFPPLKYLMYTPS----------------------------PLTAS-YAAPGELLDDSL 150
+FPPLK+L YTPS LT S Y E+ ++S
Sbjct: 233 HLFPPLKFLCYTPSFHSLHHTQFRTNYSLFMPIYDFIYGTTDNLTDSLYERSLEIEEESP 292
Query: 151 DVVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFS--MMITWIYGRT 208
DV++LTHLTT SIY MRLG SL+S P + +WP TL + + I RT
Sbjct: 293 DVIHLTHLTTHNSIYQMRLGFPSLSSCPLWSRPPWYLTCFMWPFTLLCSFALTSAIPLRT 352
Query: 209 FVVERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQ 268
FV ERNRL L + + K++ Y SQ+ +ESIN +IEEAILEA+EKG +V+SLGL+N
Sbjct: 353 FVFERNRLRDLTVHSHLLPKFSFHYKSQRHHESINTIIEEAILEADEKGVKVMSLGLMNN 412
Query: 269 GEELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPAE 308
EELN G ++V K P+LKI++VDGSS+A V+ N+IP E
Sbjct: 413 REELNGSGEMYVQKYPKLKIRLVDGSSMAATVVINNIPKE 452
>sp|F4IR05|CERL2_ARATH Protein CER1-like 2 OS=Arabidopsis thaliana GN=At2g37700 PE=2 SV=1
Length = 613
Score = 307 bits (787), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 172/338 (50%), Positives = 217/338 (64%), Gaps = 35/338 (10%)
Query: 2 LPGGTQFPIWRLDGVILMALLHAGPVEFVYYWLHRALHHHYLYSRSHHHSS--IVPEPIT 59
+ G + P WR DGVIL+ALLHAGPVEF+YYW HRALHHH+LYSR H H IV EPIT
Sbjct: 109 VSGTSTIPFWRTDGVILVALLHAGPVEFIYYWFHRALHHHFLYSRYHSHHHSSIVTEPIT 168
Query: 60 SVTRPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLF 119
SV PFAEHI Y ++ PLITT + G S+V Y+TYID MNNMGHCNF LIPK+LF
Sbjct: 169 SVVHPFAEHIGYTLILGLPLITTFMCGTVSVVSIALYLTYIDFMNNMGHCNFELIPKFLF 228
Query: 120 TIFPPLKYLMYTPS-----------------PLTASYAAPGELLDDSL------------ 150
++ PPLK+L YTPS P+ + DSL
Sbjct: 229 SLLPPLKFLCYTPSFHSLHHTQFRTNYSLFMPMYDYIYGTTDECSDSLYETSLEKEEEKP 288
Query: 151 DVVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTL-FSMMITWIYGRTF 209
D ++LTHLT+ +SIYH+RLG ASL+S P SS Y +L+ P L S ++ +TF
Sbjct: 289 DAIHLTHLTSLDSIYHLRLGFASLSSHP---LSSRCYLFLMKPFALILSFILRSFSFQTF 345
Query: 210 VVERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQG 269
VVERNR L L + K++ Y S Q E IN++IE AILEA++KG +V+SLGLLNQG
Sbjct: 346 VVERNRFRDLTLHSHLLPKFSSHYMSHQQKECINKMIEAAILEADKKGVKVMSLGLLNQG 405
Query: 270 EELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPA 307
EELN YG ++V ++P+LKI++VDG SLA V+ +SIP
Sbjct: 406 EELNGYGEMYVRRHPKLKIRIVDGGSLAAEVVLHSIPV 443
>sp|Q8H1Z0|CER3_ARATH Protein ECERIFERUM 3 OS=Arabidopsis thaliana GN=CER3 PE=1 SV=1
Length = 632
Score = 178 bits (452), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 115/365 (31%), Positives = 175/365 (47%), Gaps = 38/365 (10%)
Query: 6 TQFPIWRLDGVILMALLHAGPVEFVYYWLHRALH-HHYLYSRSH--HHSSIVPEPITSVT 62
P+W G+I + +LH E +YY+LHR+ H ++Y ++ H HHSS VP P+T+
Sbjct: 118 NSLPLWNTKGLIALIVLHVTFSEPLYYFLHRSFHRNNYFFTHYHSFHHSSPVPHPMTAGN 177
Query: 63 RPFAEHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFGLIPKWLFTIF 122
E+I V+ PLI L G GS+ +GY D M +GHCN + LF I
Sbjct: 178 ATLLENIILCVVAGVPLIGCCLFGVGSLSAIYGYAVMFDFMRCLGHCNVEIFSHKLFEIL 237
Query: 123 PPLKYLMYTPSPLTASYAAPG----------ELLDDSL---------------------- 150
P L+YL+YTP+ + + G ++L D+
Sbjct: 238 PVLRYLIYTPTYHSLHHQEMGTNFCLFMPLFDVLGDTQNPNSWELQKKIRLSAGERKRVP 297
Query: 151 DVVYLTHLTTPESIYHMRLGLASLASKPHQHASSEWYKWLLWPVTLFSMMITWIYGRTFV 210
+ V+L H S H S AS P+ ++ + +WP T M+ W + +TF+
Sbjct: 298 EFVFLAHGVDVMSAMHAPFVFRSFASMPY---TTRIFLLPMWPFTFCVMLGMWAWSKTFL 354
Query: 211 VERNRLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGE 270
L QTW ++ QYF + IN IE AIL A++ G +VISL LN+ E
Sbjct: 355 FSFYTLRNNLCQTWGVPRFGFQYFLPFATKGINDQIEAAILRADKIGVKVISLAALNKNE 414
Query: 271 ELNRYGGLFVHKNPELKIKVVDGSSLAVAVLTNSIPAEQPKWSLEAFSLRLLMPLPLPYA 330
LN G LFV+K+P+L+++VV G++L AV+ IP + + L + +L + L
Sbjct: 415 ALNGGGTLFVNKHPDLRVRVVHGNTLTAAVILYEIPKDVNEVFLTGATSKLGRAIALYLC 474
Query: 331 KREFR 335
+R R
Sbjct: 475 RRGVR 479
>sp|O13666|ERG32_SCHPO C-5 sterol desaturase erg32 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=erg32 PE=3 SV=1
Length = 329
Score = 40.8 bits (94), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 25/35 (71%), Gaps = 2/35 (5%)
Query: 28 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 60
+F+ YW+HRALHH +LY+ H HH IVP P +S
Sbjct: 163 DFLIYWIHRALHHRWLYAPLHKLHHKWIVPTPYSS 197
>sp|Q7SBB6|ERG3_NEUCR Probable C-5 sterol desaturase OS=Neurospora crassa (strain ATCC
24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987)
GN=NCU06207 PE=3 SV=1
Length = 344
Score = 36.6 bits (83), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 31/67 (46%), Gaps = 7/67 (10%)
Query: 27 VEFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVT----RPFAEHITYFVL-FATPL 79
+F YW+HR LHH +Y H HH I+P P S FA+ I Y + F PL
Sbjct: 173 TDFGIYWIHRGLHHPLVYKHLHKPHHKWIMPTPYASHAFHPIDGFAQSIPYHIFPFIFPL 232
Query: 80 ITTALTG 86
A G
Sbjct: 233 QKMAYVG 239
>sp|Q55D54|MSMOB_DICDI Putative methylsterol monooxygenase DDB_G0269788 OS=Dictyostelium
discoideum GN=DDB_G0269788 PE=3 SV=1
Length = 270
Score = 35.4 bits (80), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 16/69 (23%)
Query: 28 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 85
+F +YW+HRALHH + Y H HH P +T+ +A PL T +
Sbjct: 125 DFYFYWVHRALHHGFWYKHIHKVHHDHAAPFGMTAE-------------YAHPL-ETVIL 170
Query: 86 GAGSIVPAF 94
G G+++ F
Sbjct: 171 GVGTVIGPF 179
>sp|Q8VWZ8|SMO22_ARATH Methylsterol monooxygenase 2-2 OS=Arabidopsis thaliana GN=SMO2-2
PE=2 SV=1
Length = 266
Score = 35.4 bits (80), Expect = 0.65, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 28 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 85
+FV+YW HR LH +LY H HH P +TS AE + F+ FAT ++ ALT
Sbjct: 120 DFVFYWGHRILHSKWLYKNVHSVHHEYATPFGLTSEYAHPAEIL--FLGFAT-IVGPALT 176
Query: 86 G 86
G
Sbjct: 177 G 177
>sp|Q9ZW22|SMO21_ARATH Methylsterol monooxygenase 2-1 OS=Arabidopsis thaliana GN=SMO2-1
PE=2 SV=2
Length = 272
Score = 35.0 bits (79), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 34/61 (55%), Gaps = 5/61 (8%)
Query: 28 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFVLFATPLITTALT 85
+FV+YW HR LH +LY H HH P +TS AE + F+ FAT ++ ALT
Sbjct: 120 DFVFYWGHRILHTKWLYKNVHSVHHEYATPFGLTSEYAHPAEIL--FLGFAT-IVGPALT 176
Query: 86 G 86
G
Sbjct: 177 G 177
>sp|O94457|ERG31_SCHPO C-5 sterol desaturase erg31 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=erg31 PE=3 SV=1
Length = 300
Score = 34.7 bits (78), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 17/35 (48%), Positives = 20/35 (57%), Gaps = 2/35 (5%)
Query: 28 EFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 60
+F YW HR LHH Y+Y R H HH I+ P S
Sbjct: 130 DFGIYWAHRFLHHRYVYPRLHKLHHKWIICTPYAS 164
>sp|Q8J207|ERG3_LEPMC C-5 sterol desaturase OS=Leptosphaeria maculans GN=ERG3 PE=3 SV=1
Length = 356
Score = 33.5 bits (75), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 19/36 (52%), Gaps = 2/36 (5%)
Query: 27 VEFVYYWLHRALHHHYLYSRSH--HHSSIVPEPITS 60
+ YW+HR LHH +Y H HH I+P P S
Sbjct: 184 TDLCIYWIHRGLHHPMVYKHIHKPHHKWIMPTPFAS 219
>sp|O75845|SC5D_HUMAN Lathosterol oxidase OS=Homo sapiens GN=SC5DL PE=1 SV=2
Length = 299
Score = 33.1 bits (74), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 15/31 (48%), Positives = 18/31 (58%), Gaps = 2/31 (6%)
Query: 32 YWLHRALHHHYLYSRSH--HHSSIVPEPITS 60
YW+HR LHH +Y R H HH +P P S
Sbjct: 135 YWIHRGLHHRLVYKRLHKPHHIWKIPTPFAS 165
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 32.7 bits (73), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 9/95 (9%)
Query: 169 LGLASLASKPHQHASSEWYKWLLWPVTLFSMMIT-WIYGRTFVVERN-RLNKLKLQTWAK 226
LG+AS A K Q+ ++WYKW L+ M W R V E+N ++ +L Q +++
Sbjct: 12 LGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGW---RRAVWEKNMKMIELHNQEYSQ 68
Query: 227 SKYN----MQYFSQQPNESINRLIEEAILEAEEKG 257
K+ M F NE +++ + +KG
Sbjct: 69 GKHGFSMAMNAFGDMTNEEFRQVMNGFQNQKHKKG 103
>sp|A1K6P9|Y1887_AZOSB UPF0246 protein azo1887 OS=Azoarcus sp. (strain BH72) GN=azo1887
PE=3 SV=1
Length = 261
Score = 32.7 bits (73), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 5/57 (8%)
Query: 121 IFPPLKYLMYTPSPLTASYAAPGELLDDSLDVVYLTHLTTPESIYHMRLGLASLASK 177
+ P K L Y P+TA + P + LDD+ +++ + TP I GL SL+ K
Sbjct: 4 VISPAKALDYETPPVTARFTTP-DFLDDAAELIDILRTCTPADI----AGLMSLSDK 55
>sp|Q55D52|MSMOA_DICDI Putative methylsterol monooxygenase DDB_G0270946 OS=Dictyostelium
discoideum GN=DDB_G0270946 PE=3 SV=2
Length = 267
Score = 32.3 bits (72), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 17/81 (20%)
Query: 17 ILMALLHAGPVE-FVYYWLHRALHHHYLYSRSH--HHSSIVPEPITSVTRPFAEHITYFV 73
+L+ L+ + +E F +YW HRALHH Y H HH P IT+
Sbjct: 113 LLLTLVSSFIIEDFYFYWAHRALHHGIWYKYIHKVHHDYASPFGITAE------------ 160
Query: 74 LFATPLITTALTGAGSIVPAF 94
+A PL T L G G+++ F
Sbjct: 161 -YAHPLETIIL-GVGTVIGPF 179
>sp|Q54215|DNAK_STRGR Chaperone protein DnaK OS=Streptomyces griseus GN=dnaK PE=3 SV=1
Length = 618
Score = 32.3 bits (72), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 27/52 (51%)
Query: 221 LQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGEEL 272
L T + K + S P + +NR+ EEA AEE AR + NQGE+L
Sbjct: 462 LGTGKEQKMTVTGGSSLPKDEVNRMREEAEKYAEEDHARREAAESRNQGEQL 513
>sp|B1VMF3|DNAK_STRGG Chaperone protein DnaK OS=Streptomyces griseus subsp. griseus
(strain JCM 4626 / NBRC 13350) GN=dnaK PE=3 SV=1
Length = 617
Score = 32.3 bits (72), Expect = 5.8, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 27/52 (51%)
Query: 221 LQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVISLGLLNQGEEL 272
L T + K + S P + +NR+ EEA AEE AR + NQGE+L
Sbjct: 462 LGTGKEQKMTVTGGSSLPKDEVNRMREEAEKYAEEDHARREAAESRNQGEQL 513
>sp|Q3L8U1|CHD9_HUMAN Chromodomain-helicase-DNA-binding protein 9 OS=Homo sapiens GN=CHD9
PE=1 SV=2
Length = 2897
Score = 32.0 bits (71), Expect = 6.4, Method: Composition-based stats.
Identities = 15/47 (31%), Positives = 27/47 (57%)
Query: 215 RLNKLKLQTWAKSKYNMQYFSQQPNESINRLIEEAILEAEEKGARVI 261
+L +++ ++ K + S+ E NR+I EAI +A+E+G R I
Sbjct: 499 KLQNTQVRVMSEKKQRKKVESESKQEKANRIISEAIAKAKERGERNI 545
>sp|Q03PY6|ECFA1_LACBA Energy-coupling factor transporter ATP-binding protein EcfA 1
OS=Lactobacillus brevis (strain ATCC 367 / JCM 1170)
GN=ecfA1 PE=3 SV=1
Length = 290
Score = 31.6 bits (70), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 18/46 (39%), Positives = 21/46 (45%), Gaps = 6/46 (13%)
Query: 67 EHITYFVLFATPLITTALTGAGSIVPAFGYITYIDLMNNMGHCNFG 112
EH+TY TP+ TALT VP GY+ I GH G
Sbjct: 6 EHVTYTYQAGTPMAHTALTDVSLTVPDRGYLAII------GHTGSG 45
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.138 0.435
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 129,601,482
Number of Sequences: 539616
Number of extensions: 5351399
Number of successful extensions: 13750
Number of sequences better than 100.0: 27
Number of HSP's better than 100.0 without gapping: 14
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 13720
Number of HSP's gapped (non-prelim): 33
length of query: 335
length of database: 191,569,459
effective HSP length: 118
effective length of query: 217
effective length of database: 127,894,771
effective search space: 27753165307
effective search space used: 27753165307
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (28.1 bits)