Optimization and Interpolation

This HTML version of the book is provided as a convenience, but some math equations are not translated correctly. The PDF version is more reliable.

Chapter 12 Optimization and Interpolation

12.1 Optimization

“Optimal” is a fancy way of saying “best;” what that means depends on the problem. In Exercise 6, you were asked to approximate the launch angle which can produce the longest horizontal distance for a ball batted at a particular speed and from a particular height. This angle can be considered as the optimal angle.

The first step in any optimization problem is to define what “optimal” means. The second step is to define a range of values where you want to search. In this case the range of feasible values is between 0 degrees (parallel to the ground) and 90 degrees (straight up). We might guess that the optimal angle be near to 45 degrees, but we might not be sure how far from 45 degrees to look. To play it safe, we could start with the widest feasible range.

In many cases, the simplest way to get close to an optimal value is to run the simulation with a wide range of values and choose the one that yields the best result. This method does not combine efficiency and accuracy, and that combination is especially important in a case like this where computing the distances is expensive.

12.2 Golden section search

To present the main ideas behind the Golden Section Search, I will present a simplified search algorithm I’ll call a Silver Section Search. The basic idea is similar to the methods for zero-finding we saw in Section 7.5. In the case of zero-finding, we had a picture like this:

We are given a function, f, that we can evaluate, and we want to find a root of f; that is, a value of x that makes f(x)=0. If we can find a value, x₁, that makes f(x₁) positive and another value, x₂, that makes f(x₂) negative, then there has to be a root in between (as long as f is continuous). In this case we say that x₁ and x₂ “bracket” the root.

The algorithm proceeds by choosing a third value, x₃, between x₁ and x₂ and then evaluating y = f(x₃). If y is positive, we can form a new pair, (x₃, x₂), that brackets the root. If y is negative then the pair (x₁, x₃) brackets the root. Either way the size of the bracket gets smaller, so our estimate of the location of the root gets better.

So that was root-finding. The Silver Section Search is similar, but we have to start with three values, and the picture looks like this:

This diagram shows that we have evaluated f in three places, x₁, x₂, and x₃, and found that x₂ yields the highest value. If f is continuous, then there has to be at least one local maximum between x₁ and x₃, so we would say that the triple (x₁, x₂, x₃) brackets a maximum.

The next step is to choose a fourth point, x₄, and evaluate f(x₄). There are two possible outcomes, depending on whether f(x₄) is greater than f(x₂):

If f(x₄) is less than than f(x₂) (shown on the left), then the new triple (x₁, x₂, x₄) brackets the maximum. If f(x₄) is greater than f(x₂) (shown on the right), then (x₂, x₄, x₃) brackets the maximum. Either way the range gets smaller and our estimate of the optimal value of x gets better.

This method works for almost any value of x₄, but some choices are better than others. In the example, I chose to bisect the bigger of the ranges (x₁, x₂), and (x₂, x₃).

The input variable is a vector that contains three values that bracket a maximum; in this case they are angles in degrees. optimize starts by evaluating distance_func for each of the three values. We assume that distance_func returns the result we want to maximize; for the baseball best-angle-for-longest-distance problem it is the horizontal distance travelled by the ball when it hits the ground.

Each time through the for loop the function chooses a value of x4, evaluates height_func, and then updates the triplet x1, x2, and x3 according to the results.

After the update, it computes the range of the bracket, x3-x1, and checks whether it is small enough. If so, it breaks out of the loop and returns the current triplet as a result. In the worst case the loop executes 200 times.

Exercise 1 I call this algorithm a Silver Section Search because it is almost as good as a Golden Section Search. Read the Wikipedia page about the Golden Section Search (https://en.wikipedia.org/wiki/Golden_section_search) and then modify this code to implement it.

Exercise 2 You can write functions that take function handles as input variables, just as fzero and ode45 do. For example, handle_func takes a function handle called func and calls it, passing pi as an argument.

function res = handle_func(func)
    func(pi)
end

You can call handle_func from the Command Window and pass different function handles as arguments:

>> handle_func(@sin)

ans = 0

>> handle_func(@cos)

ans = -1

Modify optimize so that it takes a function handle as an input variable and uses it as the function to be maximized.

Exercise 3 The MATLAB function fminsearch takes a function handle and searches for a local minimum. Read the documentation for fminsearch and use it to find the optimal launch angle of a baseball with a given velocity.

Write a function that takes the initial velocity of the baseball as an input variable, computes the launch angle that maximizes the range, and returns the optimal angle and the corresponding range as output variables. How does the optimal launch angle vary with initial velocity?
When the Red Sox won the World Series in 2007, they played the Colorado Rockies at their home field in Denver, Colorado. Find an estimate of the density of air in the Mile High City. What effect does this have on drag? Make a prediction about what effect this will have on the launch angle that produces the longest horizontal flying distance, and then use your simulation to test your prediction.
The actual drag on a baseball is more complicated than what is captured by our simple model. In particular, the drag coefficient varies with velocity. You can get some of the details from The Physics of Baseball¹; you also might find information on the web. Either way, specify a more realistic model of drag and modify your program to implement it. How big is the effect on your computed ranges? How big is the effect on the launch angles that produce the maximum horizontal flying distance?

Exercise 4 Very often, in an optimization problem, the meaning of “optimal” is not obvious.

For example, the Green Monster in Fenway Park is about 12 m high and about 97 m from home plate along the left field line. What is the the optimal angle that the ball should leave the bat so that the ball clears the Green Monster with the minimum speed.

We may be tempted to choose for the optimal angle the angle that yields the longest range (distance from home plate when it lands). But in this case we are trying to clear a 12m wall, so maybe we want the angle that yields the longest range when the ball falls through 12m.

Although either definition would be good enough for some purposes, neither is quite right. In this case the “optimal” angle is the one that yields the greatest height at the point where the ball reaches the wall, which is 97m from home plate. Remember, we are ultimately trying to minimize the speed that the ball needs to leave the bat. At this minimum speed, the ball will just clear the wall if hit at the optimal angle.

12.3 Discrete and continuous maps

When you solve an ODE analytically, the result is a function, which you can think of as a continuous map. When you use an ODE solver, you get two vectors (or a vector and a matrix), which you can think of as a discrete map.

For example, in Section 9.4, we used the following rate function to estimate the population of rats as a function of time:

T contains the time values where ode45 estimated the population; Y contains the population estimates.

Now suppose we would like to know the population on the 180th day of the year. We could search T for the value 180:

But there is no guarantee that any particular value appears in T. We can find the index where T crosses 180:

I gets the indices of all elements of T greater than 180, so I(1) is the index of the first one.

That gives us a coarse estimate of the population on Day 180. If we wanted to do a little better, we could also find the last value before Day 180:

But where in this range is the best estimate? A simple option is to choose whichever time value is closer to 180 and use the corresponding population estimate. In the example, that’s not a great choice because the time value we want is right in the middle.

12.4 Interpolation

A better option is to draw a straight line between the two points that bracket Day 180 and use the line to estimate the value in between. This process is called linear interpolation, and MATLAB provides a function named interp1 that does it:

The first two arguments specify a discrete map from the values in T to the values in Y. The third argument is the time value where we want to interpolate. The result is what we expected, about halfway between the values that bracket it.

interp1 can also take a fourth argument that specifies what kind of interpolation you want. The default is ’linear’, which does linear interpolation. Other choices include ’spline’ which uses a spline curve to fit two points on either side, and ’pchip’, which uses piecewise cubic Hermite interpolation.

In this case we expect the spline and cubic interpolations to be better than linear, because they use more of the data, and we know the function isn’t linear. But we have no reason to expect the spline to be more accurate than the cubic, or the other way around. Fortunately, they are not very different.

We can also use interp1 to project the rat population out beyond the values in T:

This process is called extrapolation. For time values near 365, extrapolation may be reasonable, but as we go farther into the “future,” we expect them to be less accurate. For example, here is the estimate we get by extrapolating for a whole year:

12.5 Interpolating the inverse function

We have used interp1 to find population as a function of time; by reversing the roles of T and Y, we can also interpolate time as a function of population. For example, we might want to know how long it takes the population to reach 20.

This use of interp1 might be confusing if you think of the arguments as x and y. You might find it helpful to think of them as the range and domain of a map (where the third argument is an element of the range).

The following plot shows f (Y plotted as a function of T) and the inverse of f (T plotted as a function of Y).

In this case we can use interp1 either way because f is a single-valued mapping, which means that for each value in the domain, there is only one value in the range that maps to it.

If we reduce the food supply so that the rat population decreases during the winter, we might see something like this:

So on Day 260, the population is about 15, but if we ask on what day the population was 15, there are two possible answers, 172.44 and 260.44. If we try to use interp1, we get the wrong answer:

On Day 196, the population is actually 16.8, so interp1 isn’t even close! The problem is that T as a function of Y is a multivalued mapping; for some values in the range there are more than one values in the domain. This causes interp1 to fail.

12.6 Field mice

As we’ve seen, one use of interpolation is to interpret the results of a numerical computation; another is to fill in the gaps between discrete measurements.

For example², suppose that the population of field mice is governed by this rate equation:

where t is time in months, y is population, a is a parameter that characterizes population growth in the absence of limitations, and b is a function of time that characterizes the effect of the food supply on the death rate.

Although b appears in the equation as a continuous function, we might not know b(t) for all t. Instead, we might only have discrete measurements:

If we use ode45 to solve the differential equation, then we don’t get to choose the values of t where the rate function (and therefore b) gets evaluated. We need to provide a function that can evaluate b everywhere:

Exercise 5 Write a rate function that uses interpolate_b to evaluate g and then use ode45 to compute the population of field mice from t=0 to t=8 with an initial population of 100 and a=0.9.

Then modify interpolate_b to use spline interpolation and run ode45 again to see how much effect the interpolation has on the results.

12.7 Glossary

12.8 Exercises

Exercise 6

A golf ball hit with backspin generates lift, which might increase the range, but the energy that goes into generating spin probably comes at the cost of lower initial velocity.³ Write a simulation of the flight of a golf ball and use it to find the launch angle and allocation of spin and initial velocity (for a fixed energy budget) that maximizes the horizontal range of the ball in the air.

The lift of a spinning ball is due to the Magnus force⁴, which is perpendicular to the axis of spin and the path of flight. The coefficient of lift is proportional to the spin rate; for a ball spinning at 3000 rpm it is about 0.1. The coefficient of drag of a golf ball is about 0.2 as long as the ball is moving faster than 20 m/s.

Robert K. Adair, Harper Paperbacks, 3rd Edition, 2002.

This example is adapted from Gerald and Wheatley, Applied Numerical Analysis, Fourth Edition, Addison-Wesley, 1989.