Taking a further step from my previous post, in this one, we are going to see the application of the Golden-section Search to a density function. If you are not quite familiar with the method yet, you might want to refer first here.

Motivational Problem

Suppose we have a function

KaTeX can only parse string typed expression

f : [L, R] \to [0, \infty)

satisfying

\int_{L}^{R} f (x) d x = 1

, which shows the spread of local minimum/maximum of a function

KaTeX can only parse string typed expression

g : [L, R] \to R

that is unimodal. Denote

m

as the minimum/maximum peak point of

g

, we aim to find a point

p

such that

\int_{p - δ}^{p + δ} f (x) d x \leq ϵ s.t. ∣ m - p ∣ \leq δ

, where

KaTeX can only parse string typed expression

ϵ

represents an acceptable error and

ϵ < 1

. We achieve this by only querying the value of the function at specific points, i.e., evaluating

f (x)

for chosen values of

x

in the interval

[L, R]

, as efficiently as possible.

Golden-section Search Method

Performing search based on the unimodal function

In the previous post, we have shown that in the Golden-section Search method, we usually want to have our pivot points to be located in both

KaTeX can only parse string typed expression

L + (1 - \frac{1}{ϕ}) (R - L)

and

R - (1 - \frac{1}{ϕ}) (R - L)

. However, we notice that the number of queries depends on the value of

L

and

R

; consequently, when the range between

L

and

R

is big enough (say they are approaching

- \infty

and

\infty

), the number of queries used will also become bigger. That being said, performing a search based on function

g

will not be reliable.

Performing search based on the density function

To make up for this, we need to make an adjustment for which interval we are going to query as our base. By using prior knowledge about the statistical method, we can focus our search on where the minimum / maximum is more likely to be found. Hence, we want to perform our search based on function

KaTeX can only parse string typed expression

f

. For the pivots that we needed, we want to set

p_{1}

and

p_{2}

as values that satisfy

\int_{L}^{p_{1}} f (x) d x = \int_{L}^{L} f (x) d x + (1 - \frac{1}{ϕ}) \int_{L}^{R} f (x) d x = (1 - \frac{1}{ϕ}) \int_{L}^{R} f (x) d x

and

\int_{L}^{p_{2}} f (x) d x = \int_{L}^{R} f (x) d x - (1 - \frac{1}{ϕ}) \int_{L}^{R} f (x) d x = \frac{1}{ϕ} \int_{L}^{R} f (x) d x

Figure 1

Figure 2

This works well as after each transition; we manage to remove either

KaTeX can only parse string typed expression

[L, p_{1}]

[p_{2}, R]

from our search space. Although, in a sense, it is quite hard to calculate the number of queries needed as

p_{1}

and

p_{2}

do not entirely depends on both

L

and

R

, but also

f

. However, notice that we can always choose

p = \frac{L + R}{2}

and

δ = \frac{R - L}{2}

in any valid interval that subject to

∣ m - p ∣ \leq δ

. Denote

k

\int_{p - δ}^{p + δ} f (x) d x

, let's analyze the two cases:

Case
KaTeX can only parse string typed expression
$[L, p_{1}]$ is removed from the interval $[L, R]$ :
$p δ k := \frac{p _{1} + R}{2} := \frac{R - p _{1}}{2} := \int_{p_{1}}^{R} f (x) d x = \int_{L}^{R} f (x) d x - \int_{L}^{p_{1}} f (x) d x = \frac{1}{ϕ} \int_{L}^{R} f (x) d x$
Case
KaTeX can only parse string typed expression
$[p_{2}, R]$ is removed from the interval $[L, R]$ :
$p δ k := \frac{L + p _{2}}{2} := \frac{p _{2} - L}{2} := \int_{L}^{p_{2}} f (x) d x = \frac{1}{ϕ} \int_{L}^{R} f (x) d x$

Each query allows us to reduce

KaTeX can only parse string typed expression

k

\frac{1}{ϕ} k

; denote

Q

as the number of queries needed, then:

Q = 3 + ⌈ lo g_{ϕ} (\frac{1}{ϵ})⌉

Additional Variation: Density Function

When encountering a similar function, say

KaTeX can only parse string typed expression

f

, that does not satisfy the

\int_{- \infty}^{\infty} f (x) d x = 1

condition, but still represents the spread of the peak point in a unimodal function, we can make a transformation to accommodate this scenario. We will transform the function

f (x)

into

f^{'} (x)

and

ϵ

into

ϵ^{'}

, where:

f^{'} (x) = \frac{1}{\int _{- \infty}^{\infty} f ( x ) d x} f (x)

and

ϵ^{'} = \frac{1}{\int _{- \infty}^{\infty} f ( x ) d x} ϵ, ϵ < \int_{- \infty}^{\infty} f (x) d x

. And there we have it! If prior knowledge about the function's behavior or specific features of the local minima/maxima is known, we can always customize the search strategy accordingly. This could involve using different weighting schemes or adapting the pivots.

← [Tutorial] Golden-section Search [Tutorial] DP Optimization: Convex Hull Trick →

1

10

11

12

13

14

15

16

17

18

19

2

20

2020-Fall

2021-Fall

2021-Spring

2021-Summer

2022-Fall

2022-Spring

2023-Fall

2023-Spring

2024-Spring

20K-smooth

20K-smooth (no image)

21

22

23

24

25

26

27

28

29

3

30

31

32

33

34

35

36

37

4

5

6

7

8

9

appendix

appendix

appendix

archive

BIO1001-General-Biology

bonus

bonus

bucketsort

CHM1001-General-Chemistry

collections

Competitive-Programming

components

cpu

CSC1001-Introduction-to-Computer-Science-Programming-Methodology

CSC1002-Computational-Laboratory

CSC3001-Discrete-Mathematics

CSC3002-Assignment-1-src

CSC3002-Assignment-2-src

CSC3002-Assignment-3-src

CSC3002-Assignment-4-src

CSC3002-Assignment-5-src

CSC3002-Assignment-6-src

CSC3002-Introduction-to-Computer-Science-Programming-Paradigms

CSC3050-Computer-Architecture

CSC3050-Project-1-assembler

CSC3050-Project-2-simulator

CSC3050-Project-3-verilog

CSC3050-Project-4-CPU

CSC3100-Data-Structures

CSC3150-Assignment-1

CSC3150-Assignment-2

CSC3150-Assignment-3