yohandi

A recent graduate with a strong interest in algorithms and data structures.

[Tutorial] Golden-section Search

August 09, 2023 by Yohandi

You might have heard about the Golden Ratio. It is this special number that often shows up in mathematics. The Golden Section Search uses this very ratio to solve a particular kind of problem, such as finding the minimum or maximum of a unimodal function.

Motivational Problem

Suppose we have a unimodal function

KaTeX can only parse string typed expression

f : [L, R] \to R

, where the function increases on

[L, m]

and decreases on

[m, R]

, with

m

being the point where the function reaches its only local minimum/maximum (or it can be considered as global minimum/maximum in that particular function). We aim to find

f (m \pm δ)

, where

δ

represents an acceptable error ranging in the interval

[0, 0.5]

, serving as the satisfaction threshold. We achieve this by only querying the value of the function at specific points, i.e., evaluating

f (x)

for chosen values of

x

in the interval

[L, R]

, as efficiently as possible.

Ternary Search Method

One of the well-known algorithms, Ternary Search, uses a strategy of picking two pivot points

KaTeX can only parse string typed expression

p_{1}

and

p_{2}

such that

L < p_{1} < p_{2} < R

, and then evaluating

f (p_{1})

and

f (p_{2})

. This evaluation leads to one of three possible cases, which are listed below. For the sake of simplicity, let's assume that the unimodal function has a local maximum that we are seeking to find (if we were looking for a local minimum, we would simply invert each comparison).

Case
KaTeX can only parse string typed expression
f(p1)<f(p2):

Considering that the function is increasing before the maximum point, we are faced with one of the following scenarios:
- $m \leq p_{1} < p_{2}$ , which is a contradiction under the current assumption that $f (p_{1}) < f (p_{2})$ , since the function would be increasing from $L$ until $p_{1}$ ; hence, it is impossible to have a maximum point on $p_{1}$ and before $p_{1}$ .
- $p_{1} < m \leq p_{2}$ , which implies that the maximum must lie in $(p_{1}, p_{2}]$ .
- $p_{1} < p_{2} < m$ , which implies that the function is still increasing until $p_{2}$ , meaning that the maximum must lie in $(p_{2}, R]$ .
Figure 1 & 2

This case describes the possibilities and illustrates why, in a case where
KaTeX can only parse string typed expression
f(p1)<f(p2), we confidently narrow down our search space into $(p_{1}, p_{2}] \cup (p_{2}, R] = (p_{1}, R]$ .
Case
KaTeX can only parse string typed expression
f(p1)>f(p2):

We now deal with a similar situation to the previous case. We have either:
- $m < p_{1} < p_{2}$ , which implies that the function is now in the state of decreasing after $p_{1}$ , meaning that the maximum must lie in $[L, p_{1})$ .
- $p_{1} \leq m < p_{2}$ , which implies that the maximum must lie in $[p_{1}, p_{2})$ .
- $p_{1} < p_{2} \leq m$ , which is a contradiction under the current assumption that $f (p_{1}) > f (p_{2})$ , since the function would be decreasing from $p_{2}$ onwards; hence, it is impossible to have a maximum point on $p_{2}$ and after $p_{2}$ .
Figure 3 & 4

This case describes the possibilities and illustrates why, in a case where
KaTeX can only parse string typed expression
f(p1)>f(p2), we confidently narrow down our search space into $[L, p_{1}) \cup [p_{1}, p_{2}) = [L, p_{2})$ .
Case
KaTeX can only parse string typed expression
f(p1)=f(p2):

This case is quite simple as out of the possibilities below, only one is deemed valid:
- $m \leq p_{1} < p_{2}$ , which is not a valid possibility, as this case implies that the function is now decreasing from $m$ onwards. Consequently, we have $f (p_{1}) > f (p_{2})$ , which is a contradiction under the current assumption that $f (p_{1}) = f (p_{2})$ .
- $p_{1} < m < p_{2}$ , which is valid as the maximum lies in $(p_{1}, p_{2})$ .
- $p_{1} < p_{2} \leq m$ , which is not a valid possibility, as this case implies that the function must be increasing before $m$ . Consequently, we have $f (p_{1}) < f (p_{2})$ , which is a contradiction under the current assumption that $f (p_{1}) = f (p_{2})$ .
Figure 5

This allows us to narrow down the search space into
KaTeX can only parse string typed expression
(p1,p2).

Ternary Search partitions the current search space into three equal-sized intervals. Denote

KaTeX can only parse string typed expression

N

as the length of the search space, i.e.,

N = R - L

, then the size of the partitioned interval is

\frac{N}{3}

. This means that we want to choose

p_{1} = L + \frac{N}{3}

and

p_{2} = R - \frac{N}{3}

. Denote

Q

as the number of queries used to solve the motivational problem using the Ternary Search method.

Q = 2 \times ⌈ lo g_{\frac{R - L}{m a x ( R - p _{1} , p _{2} - L , p _{2} - p _{1} )}} (R - L)⌉ = 2 \times ⌈ lo g_{\frac{N}{m a x ( R - L - \frac{N}{3} , R - \frac{N}{3} - L , R - \frac{N}{3} - L - \frac{N}{3} )}} (N)⌉ = 2 \times ⌈ lo g_{\frac{N}{m a x ( \frac{2 N}{3} , \frac{2 N}{3} , \frac{N}{3} )}} (N)⌉ = 2 \times ⌈ lo g_{\frac{3}{2}} (N)⌉

Adapting Ternary Search Method Towards Binary Search

Instead of dividing the interval into three equal parts, which typically lead to reducing the search space to either the first two-thirds, the last two-thirds, or the middle one-third, we could consider a variation where the pivots

KaTeX can only parse string typed expression

p_{1}

and

p_{2}

are chosen closer to the center of the interval. This resembles the idea of Binary Search.

By choosing both

KaTeX can only parse string typed expression

p_{1}

and

p_{2}

as close as possible to the middle of the interval, we want to set the limits as

k

approaches just above

2

p_{1} = L + \frac{N}{k}

and

p_{2} = R - \frac{N}{k}

. Then, the number of queries (

Q

) used can be calculated as follows:

Q = 2 \times ⌈ lo g_{\frac{R - L}{m a x ( R - p _{1} , p _{2} - L , p _{2} - p _{1} )}} (R - L)⌉ = 2 \times ⌈ k \to 2^{+} lim lo g_{\frac{N}{m a x ( R - L - \frac{N}{k} , R - \frac{N}{k} - L , R - \frac{N}{k} - L - \frac{N}{k} )}} (N)⌉ = 2 \times ⌈ k \to 2^{+} lim lo g_{\frac{N}{m a x ( N - \frac{N}{k} , N - \frac{N}{k} , N - 2 \frac{N}{k} )}} (N)⌉ = 2 \times ⌈ k \to 2^{-} lim lo g_{k} (N)⌉ = 2 \times ⌈ lo g_{2} (N)⌉

Golden-section Search Method

The approach of the Golden-section Search is quite similar to the Ternary Search, as the idea of maintaining two pivots is inherited. For the sake of simplicity, we will once again assume that the unimodal function has a local maximum that we are seeking to find. Let's denote the two pivots as

KaTeX can only parse string typed expression

p_{1}

and

p_{2}

, which satisfy

L < p_{1} < p_{2} < R

, and be certain that the point

m

is located in the interval

[L, R]

. Assume that by using the previous queries, we already have the value of

f (L)

f (R)

, and one of the

f (p_{1})

and

f (p_{2})

such that

max (f (L), f (R)) \leq f (p_{1})

f (p_{2})

. Consider that we have

f (p_{1})

(in a case where we have

f (p_{2})

, we only need to mirror the domain toward

\frac{L + R}{2}

, i.e., the middle point), our next step is to query the value of

f (p_{2})

, and we will handle it based on the following similar cases in Ternary Search Method:

Case
KaTeX can only parse string typed expression
$f (p_{1}) = f (p_{2})$ :

As we agreed that in this case the point
KaTeX can only parse string typed expression
$m$ lies in $(p_{1}, p_{2})$ , we may include this case to either $f (p_{1}) < f (p_{2})$ or $f (p_{1}) > f (p_{2})$ as $(p_{1}, p_{2}) \subset (p_{1}, R]$ and $(p_{1}, p_{2}) \subset [L, p_{2})$ .
Case
KaTeX can only parse string typed expression
$f (p_{1}) < f (p_{2})$ :

As explained, this case indicates that the point
KaTeX can only parse string typed expression
$m$ lies in $(p_{1}, R]$ . Then, we can set $L := p_{1}$ , $p_{1} := p_{2}$ , and $R := R$ , utilizing the value of $f (p_{1})$ as the new value of $f (L)$ and the value of $f (p_{2})$ as the new value of $f (p_{1})$ . The choice of pivot is made accordingly; the reason is shown later.

Figure 6
Case
KaTeX can only parse string typed expression
$f (p_{1}) > f (p_{2})$ :

Similarly, as explained, this case indicates that the point
KaTeX can only parse string typed expression
$m$ lies in $[L, p_{2})$ . Then, we can set $R := p_{2}$ , $p_{2} := p_{1}$ , and $L := L$ , utilizing the value of $f (p_{2})$ as the new value of $f (R)$ and the value of $f (p_{1})$ as the new value of $f (p_{2})$ in the new interval. Again, the choice of pivot is made accordingly; the reason is shown later.

Figure 7

Notice that after the transition, the assumptions we made are all still satisfied. With that, we manage to remove either

KaTeX can only parse string typed expression

[L, p_{1}]

[p_{2}, R]

from our search space with only

1

additional query, whereas the Ternary Search uses

2

additional queries. However, it is still quite tricky to choose the value of

p_{1}

and

p_{2}

as we want the values to be optimal when we inherit them for the next interval query.

Suppose

KaTeX can only parse string typed expression

p_{1}

is located in

L + k (R - L)

where

k \in (0, 0.5)

and a constant, then,

p_{2}

should also be located in

R - k (R - L)

. Assuming that we manage to remove the interval

[L, p_{1}]

from our search space, then, we want

p_{2}

to be located either in

p_{1} + k (R - p_{1})

as the new

p_{1}

or in

R - k (R - p_{1})

as the new

p_{2}

. Let's consider these two cases:

Case $p_{2}$ is assigned as the new $p_{1}$ :

\frac{R - p _{1}}{R - L} (R - p_{1})^{2} (R - L - k (R - L))^{2} (R - L)^{2} (1 - k)^{2} k^{2} - 3 k + 1 \Rightarrow k = \frac{R - p _{2}}{R - p _{1}} = (R - L) (R - p_{2}) = (R - L) (R - R + k (R - L)) = k (R - L)^{2} = 0 = {\frac{3 - 5}{2}, \frac{3 + 5}{2}}

Since

KaTeX can only parse string typed expression

k \in (0, 0.5)

, we only consider

k = \frac{3 - 5}{2}

Case $p_{2}$ is assigned as the new $p_{2}$ :

\frac{R - p _{2}}{R - L} R - L L \Rightarrow k = \frac{R - p _{2}}{R - p _{1}} = R - p_{1} = p_{1} = 0

Since

KaTeX can only parse string typed expression

k \in / (0, 0.5)

, this case is not considered. With this, we have shown the reason for the choice of the new pivot.

The only

KaTeX can only parse string typed expression

k

that satisfies our needs is in the case where

p_{2}

is assigned as the new

p_{1}

. With that, we can calculate the number of queries used to solve the motivational problem using the Golden-section Search Method. Denote

Q

as the number of queries and

N

as the length of the search space, i.e.,

N = R - L

Q = 3 + ⌈ lo g_{\frac{R - L}{m a x ( R - p _{1} , p _{2} - L )}} (R - L)⌉ = 3 + ⌈ lo g_{\frac{R - L}{m a x ( R - L - k ( R - L ) , R - k ( R - L ) - L )}} (R - L)⌉ = 3 + ⌈ lo g_{\frac{N}{N ( 1 - k )}} (N)⌉ = 3 + ⌈ lo g_{\frac{2}{2 - 3 + 5}} (N)⌉ = 3 + ⌈ lo g_{\frac{5 - 1}{2}} (N)⌉ = 3 + ⌈ lo g_{ϕ} (N)⌉

Notice that the additional

KaTeX can only parse string typed expression

3

queries in this method are used for the initial

L

R

, and one of

p_{1}

and

p_{2}

. It is clear that

lo g_{ϕ} (N) < 2 lo g_{2} (N)

ϕ > 2

; hence, the Golden-section Search Method provides a better and more efficient way to solve the motivational problem.

And there we have it! By strategically positioning our pivots and leveraging the mathematics of the golden ratio, we have managed to achieve an even more efficient solution.

You may compare the number of queries used for all the explained methods side-by-side in the below neat little table.

Method	Number of Queries Used
Ternary Search	$2 \times ⌈ lo g_{\frac{3}{2}} (N)⌉$
Adapted Ternary Search	$2 \times ⌈ lo g_{2} (N)⌉$
Golden-section Search	$3 + ⌈ lo g_{ϕ} (N)⌉$

In the upcoming post, I am planning to write about the Variation of the Golden-section Search on a Density Function. Stay tuned!

← My First Blog [Tutorial] Variations of Golden-section Search on a Density Function →