Chapter 13

Curves in 3D

I didn't discover curves; I only uncovered them.

— Mae West (1892–1980)

This chapter talks about how to represent curves mathematically in 3D. Recreating a curve from its mathematical definition is relatively easy; the tricky part is obtaining a curve with desired properties, or alternatively, making a tool that designers can use to draw such curves. Our goal in this chapter is to provide a graceful and intuitive introduction to the mathematics of curves. In comparison with most of the other books on the subject, our aim is to hit the most important points, without stopping every other paragraph to prove that what we are saying is true. (We will, however, stop periodically to discuss correct pronunciation, which is probably appropriate considering that most of the people who developed the math we'll be using in this chapter were French.) Curves and splines are very useful for all sorts of reasons. There are obvious applications such as moving objects around on curved trajectories. But then the coordinates of our curve need not have a spatial interpretation; essentially, any time we wish to fit a function for a color, intensity, or other property to given data points, we have a potential application for curves and splines.

The chapter is divided roughly into two parts. The first part is about simple, “short” curves that can be described by one equation.

Section 13.1 introduces the specific type of curve we focus on almost exclusively: the parametric polynomial curve. (It pays special attention to cubic polynomials.)
Section 13.2 describes polynomial interpolation, whereby a curve is threaded through specified control points.
Section 13.3 discusses Hermite form, which describes a curve in terms of its endpoints and the derivatives at those endpoints.
Section 13.4 shows how the Bézier form specifies the curve endpoints, plus interior control points that influence the shape of the curve but are not interpolated.
Section 13.5 shows how to subdivide a curve into smaller pieces.

The second half of the chapter covers splines, which are longer curves created by joining together multiple curves in succession.

Section 13.6 introduces some basic notation, terminology, and concepts.
Section 13.7 discusses how to join together Hermite or Bézier curves into a spline.
Section 13.8 considers continuity (smoothness) conditions for splines.
Section 13.9 ends the discussion on splines by considering various methods for automatically determining the tangents of a spline at the control points.

13.1Parametric Polynomial Curves

We focus here almost exclusively on one particular type of curve, the parametric polynomial curve. It's important to understand what the two adjectives parametric and polynomial mean, so Section 13.1.1 and Section 13.1.2. discuss them in detail. Section 13.1.3 reviews some useful alternate notation. Section 13.1.4 examines the straight line, which is a particularly instructive example of a parametric polynomial curve. Section 13.1.5 considers the relationship between the endpoints of the curve and polynomial coefficients. Section 13.1.6 discusses derivatives, such as velocity and acceleration, and shows how they are related to tangent vectors and local curvature.

13.1.1Parametric Curves

The word parametric in the phrase “parametric polynomial curve” means (not altogether surprisingly) that the curve can be described by a function of an independent parameter, which is often assigned the symbol $t$ . This curve function is of the form $p (t)$ , taking a scalar input (the parameter $t$ ) and returning the point on the curve corresponding to that parameter value as a vector output. The function $p (t)$ traces out the shape of the curve as $t$ varies. For example, consider the classic parametric description of a unit circle,

\begin{matrix} (13.1) & \begin{aligned} x (t) & = \cos (2 π t), \\ y (t) & = \sin (2 π t) . \end{aligned} \end{matrix}

Parametric description of a circle

We briefly introduced parametric representation of geometric primitives in Section 9.1. Let's take a moment to review some of the alternative forms from that section so we can understand ways of describing a curve that are not parametric. An implicit representation is a relation that is true for all points in the shape being described; for example, the unit circle can be described implicitly as the set of points satisfying $x^{2} + y^{2} = 1$ . Another alternative to parametric form is the functional form, in which one coordinate is expressed as a function of the other coordinate or coordinates; for example, the top half of a unit circle can be described in functional form as $y = \sqrt{1 - x^{2}}$ .

The curve $p (t)$ could be infinite, particularly if we place no limits on the range of $t$ . Often it's useful to select a finite segment by restricting $t$ to a particular bounded domain, most commonly the domain $[0, 1]$ . It's natural to designate the “forward” direction as the direction of increasing $t$ , so the curve “starts” at $t = 0$ , “ends” at $t = 1$ , and consists of all of the points between.

Sometimes we think of the position function $p (t)$ as a single function that yields a vector result; other times it will be helpful to extract the function for a specific coordinate. For example, the scalar function $x (t)$ specifies the $x$ -coordinate of $p (t)$ , so in two dimensions $p (t) = (x (t), y (t))$ . Notice that each coordinate is specified by a function that depends only on the parameter value so that each coordinate is independent of the others. We work in the plane for the majority of this chapter because almost every important aspect of parametric curves can be demonstrated in 2D and, in general, extension into three dimensions is straightforward.

13.1.2Polynomial Curves

Now that we know what the adjective parametric means, let's turn our attention to the second important word, polynomial. A polynomial parametric curve is a parametric curve function $p (t)$ that can be written as a polynomial in $t$ :

Polynomial parametric form of arbitrary degree

n

p (t) = c_{0} + c_{1} t + c_{2} t^{2} + \dots + c_{n - 1} t^{n - 1} + c_{n} t^{n} .

The number $n$ is called the degree of the polynomial. Higher degree polynomials are more flexible in the sense that they can describe curves with more “wiggles.” However, sometimes extra “wiggles” come in that we don't want;1 more on this in Section 13.6.

We've already seen an example of a curve function that is parametric but not polynomial—the parametric circle given by Equation (13.1). The expressions for $x (t)$ and $y (t)$ are not polynomials because they use trig functions. A complete circle can't be described in parametric polynomial form, although a circular arc can be described by a rational curve. A rational curve is essentially the result of dividing one curve by another, sort of like the projective geometry of homogeneous coordinates (see Section 6.4.1). The curve in the denominator is a 1D curve. Rational curves are not as common in video games as simple polynomial curves and are not discussed in this book.

Of most interest to us are the parametric polynomial curves of degree 3, known as cubic curves. Cubic curves are those that can be expressed in the form shown in Equation (13.2).

Cubic Curve in Monomial Form

\begin{matrix} (13.2) & p (t) = c_{0} + c_{1} t + c_{2} t^{2} + c_{3} t^{3} . \end{matrix}

This method of describing curves is often called the monomial form or the power form, to emphasize the fact that the curve is specified by listing the coefficients of the powers of $t$ . Sections 13.2–13.4 discuss other methods of describing a curve with more direct geometric data, such as a list of control points that the curve is to pass through or nearby. These other forms are still polynomial curves in the sense that they can be converted to monomial form.

Once we have the coefficients, it's easy to reconstruct the curve by evaluating the function $p (t)$ for different values of $t$ . For example, let's say we wish to move a platform along a path in a video game. Our platform actor would have a state variable to remember its parametric position $t$ along the path, and at each simulation time step, we would update $t$ and set the position of the platform to $p (t)$ .

Suppose we need to render a curve. One simple way to do this is to approximate it with, say, 10 line segments, sampling the curve at $t = 0, \frac{1}{10}, \frac{2}{10}, \dots, \frac{9}{10}, 1$ and drawing line segments between consecutive sample points. We can reduce the error in the approximation to any desiredthreshold simply by using more sample points. We can do much better than this naïve approach by adaptively subdividing the curve, using more segments in the “curvier” parts and fewer in the “straighter” parts.

But where do the coefficients $c_{0}, c_{1}, c_{2}, c_{3}$ come from? How can we set them to design a particular curve? In general, the monomial form is particularly ill-suited to this task, so we use other forms and convert to monomial form when appropriate. (In many cases, we don't need the monomial form at all!) Before we discuss these other forms, however, we need to introduce some more notation and concepts about curves.

13.1.3Matrix Notation

We can rewrite the monomial form (Equation (13.2)) in several different ways. It's useful to be able to refer to a coefficient for a particular coordinate. For example, in 2D let's use the notation $c_{i} = [\begin{matrix} c_{1, i} & c_{2, i} \end{matrix}]$ so we have one polynomial per coordinate:

2D cubic curve in expanded monomial form

\begin{aligned} x (t) & = c_{1, 0} + c_{1, 1} t + c_{1, 2} t^{2} + c_{1, 3} t^{3}, \\ y (t) & = c_{2, 0} + c_{2, 1} t + c_{2, 2} t^{2} + c_{2, 3} t^{3} . \end{aligned}

Some books are fond of writing this more compactly by using matrix notation. Let's put the coefficients into a matrix $C$ and create a column vector $t$ from the powers of $t$ , such that $t_{i} = t^{i - 1}$ :

\begin{aligned} C & = [\begin{array}{c} c_{1, 0} & c_{1, 1} & c_{1, 2} & c_{1, 3} \\ c_{2, 0} & c_{2, 1} & c_{2, 2} & c_{2, 3} \end{array}], & t & = [\begin{array}{c} t^{0} \\ t^{1} \\ t^{2} \\ t^{3} \end{array}] = [\begin{array}{c} 1 \\ t \\ t^{2} \\ t^{3} \end{array}] . \end{aligned}

Now we can express our curve function $p (t)$ as a single matrix product:

2D cubic curve in monomial form, expressed as a matrix product

\begin{matrix} (13.1.3) & p (t) = C t = [\begin{matrix} c_{1, 0} & c_{1, 1} & c_{1, 2} & c_{1, 3} \\ c_{2, 0} & c_{2, 1} & c_{2, 2} & c_{2, 3} \end{matrix}] [\begin{matrix} 1 \\ t \\ t^{2} \\ t^{3} \end{matrix}] . \end{matrix}

Don't try to apply any geometric interpretations just yet. The vector

t

is not to be interpreted as a point in space, and the matrix

C

is not a transformation matrix. Although we're about to learn how to extract geometric meaning from

C

, the techniques are very different from those learned in previous chapters. For now, let's just be happy to use matrix notation purely for sake of compactness.

The matrix $C$ must be as “tall” as the number of dimensions the data have; for example, three if we have 3D data. However, we don't need to refer to specific $x$ , $y$ , or $z$ coordinates much in this chapter because most of the ideas work the same in 3D or 2D (or 1D!). We can just leave each coefficient $c_{i}$ in vector form and assume that it is a vector of the appropriate dimension, so that each $c_{i}$ corresponds to a single column of $C$ :

Coefficients as column vectors

\begin{aligned} C & = [\begin{array}{c} | & | & | & | \\ c_{0} & c_{1} & c_{2} & c_{3} \\ | & | & | & | \end{array}], & p (t) & = C t = [\begin{array}{c} | & | & | & | \\ c_{0} & c_{1} & c_{2} & c_{3} \\ | & | & | & | \end{array}] [\begin{array}{c} 1 \\ t \\ t^{2} \\ t^{3} \end{array}] . \end{aligned}

When dealing with a higher degree polynomial, the matrix $C$ is wider and the power vector $t$ is taller, since we have more coefficients and more powers of $t$ . This not only makes sense, it's the law: for the product $C t$ to be legal according to linear algebra rules, the number of columns in $C$ must match the number of rows in $t$ .

13.1.4Two Trivial Types of Curves

Although you're reading this section because you want to learn how to draw a curve, allow a brief digression to mention two trivial types of “curves”: a straight line segment and a point.

We showed how to represent a line segment parametrically in Section 9.2 when we discussed rays. Consider a ray from the point $p_{0}$ to the point $p_{1}$ . If we let $d$ be the delta vector $p_{1} - p_{0}$ , then the ray is expressed parametrically as

Parametric line segment

\begin{matrix} (13.3) & p (t) = p_{0} + d t . \end{matrix}

Observe that this is a polynomial of the type we've been considering, where $c_{0} = p_{0}$ , $c_{1} = d$ , and the other coefficients are zero. In other words, this linear curve is a polynomial curve of degree 1.

As boring as lines are, there's an even less interesting shape that can be represented in parametric polynomial form: the point. Lowering the degree of the polynomial from 1 to 0 results in a so-called constant curve. In this case, the function $p (t) = c_{0}$ always returns the same value, resulting in a “curve” that is a single stationary point.

13.1.5Endpoints in Monomial Form

Clearly, one of the most basic properties of a curve that we want to control are the locations of its start and end, $p (0)$ and $p (1)$ , respectively. Let's see what $p (t)$ looks like at the endpoints. We'll use the cubic case as our example. At $t = 0$ , we have

c_{0}

specifies the start point

p (0) = c_{0} + c_{1} (0) + c_{2} (0)^{2} + c_{3} (0)^{3} = c_{0} .

In other words, $c_{0}$ specifies the start point of the curve. Now let's see what happens at the end of the curve at $t = 1$ :

The endpoint is the sum of the coefficients

p (1) = c_{0} + c_{1} (1) + c_{2} (1)^{2} + c_{3} (1)^{3} = c_{0} + c_{1} + c_{2} + c_{3} .

So the endpoint of the curve is given by the sum of the coefficients.

13.1.6Velocities and Tangents

We can think of curves as being either static or dynamic. In the static sense, a curve defines a shape. We operate in this mode of thinking when we use a curve to describe the cross section of an airplane wing or a portion of the letter “S” in the Times Roman font. In the dynamic sense, a curve can be a trajectory or path of an object over time, with the parameter $t$ as “time” and the position function $p (t)$ describing the position of a particle at time $t$ as it moves along the path.

If we consider only the static shape of the curve, then the timing of the curve doesn't matter and our task is a bit easier. For example, when defining a shape, it doesn't matter which endpoint is considered the “start” and which is the ”end”; but if we are using the curve to define a path traversed over time, then it matters very much where the path starts and where it ends.

Using the dynamic mental framework and thinking about curves as paths and not just shapes, some natural questions to ask are, “In what direction is the particle moving at a given point in time?” “How fast is it moving?” These questions can be answered if we create another function $v (t)$ that describes the instantaneous velocity of the particle at time $t$ .

The phrase “instantaneous velocity” implies that the velocity changes over time. So the next logical step is to ask, “How fast is the velocity changing?” Thus it is also helpful to define an instantaneous acceleration function $a (t)$ that describes the rate at which the velocity of the particle is changing at time $t$ .

If you've had at least a semester of calculus, or if you read Chapter 11, you should recognize that the velocity function $v (t)$ is the first derivative of the position function $p (t)$ because velocity measures the rate of change in position over time. Likewise, the acceleration function $a (t)$ is the derivative of the velocity function $v (t)$ because acceleration measures the rate of change of velocity over time.

We're considering curves where $p (t)$ is a polynomial of $t$ here, so the derivatives are trivially obtained. The position, velocity, and acceleration functions for polynomials of arbitrary degree $n$ are

Velocity and acceleration are the first and second derivatives, remember?

\begin{aligned} p (t) & = c_{0} + c_{1} t + c_{2} t^{2} + \dots + c_{n - 1} t^{n - 1} + c_{n} t^{n}, \\ v (t) = \dot{p} (t) & = c_{1} + 2 c_{2} t + \dots + (n - 1) c_{n - 1} t^{n - 2} + n c_{n} t^{n - 1}, \\ a (t) = \dot{v} (t) = \ddot{p} (t) & = 2 c_{2} + \dots + (n - 1) (n - 2) c_{n - 1} t^{n - 3} + n (n - 1) c_{n} t^{n - 2} . \end{aligned}

The derivatives of cubic curves are especially notable and appear several times in this chapter.

Velocity and Acceleration of Cubic Monomial Curve

\begin{aligned} p (t) & = c_{0} + c_{1} t + c_{2} t^{2} + c_{3} t^{3}, \\ (13.5) & v (t) = \dot{p} (t) & = c_{1} + 2 c_{2} t + 3 c_{3} t^{2}, \\ (13.6) & a (t) = \dot{v} (t) = \ddot{p} (t) & = 2 c_{2} + 6 c_{3} t . \end{aligned}

Now let's examine velocity and acceleration in the special case of a parametric ray. Applying the velocity and acceleration functions of Equations (13.5) and (13.6) to the original parameterization of a ray from Equation (13.3) yields

Velocity and acceleration of a ray

\begin{aligned} p (t) & = p_{0} + d t, \\ v (t) & = c_{1} + 2 c_{2} t + 3 c_{3} t^{2} = d, \\ a (t) & = 2 c_{2} + 6 c_{3} t = 0 . \end{aligned}

As we'd expect, the velocity is constant; there is no acceleration.

Sometimes two curves define the same shape but different paths (see Figure 13.1). We've already mentioned one example of this: if we traverse the path backwards it still traces out the same shape. A more general way to generate alternate paths that trace out the same shape is to reparameterize the curve. For example, let's reparameterize our line segment $p (t) = p_{0} + d t$ . We'll make a new function $s (t) = t^{2}$ and see what $p (s (t))$ looks like:

p (s (t)) = p (t^{2}) = p_{0} + d t^{2} .

Figure 13.1 Two curves that define the same “shape,” but not the same “path”

Notice that both curves in Figure 13.1 define the same static shape, but different paths. On the left, the particle moves with constant velocity, but on the right it starts out slowly and accelerates to the finish.

If we are using a curve as a shape and not a path, then this reparameterization doesn't have a visible effect. But that doesn't mean that the derivatives of the curve are irrelevant in the context of shape design. Imagine that we are creating a font using a curve to define a segment of the letter S. In this instance, we might not care about the velocity at any point, but we would care very much about the tangent of the line at any given point. The tangent at a point is the direction the curve is moving at that point, the line that just touches the curve. The tangent is basically the normalized velocity of the curve. Let's formally define the tangent of a curve to be the unit vector pointing in the same direction as the velocity:

The tangent vector

t (t) = \hat{v} (t) = \frac{v (t)}{∥ v (t) ∥} .

Higher derivatives also have geometric meaning. The second derivative is related to curvature, which is sometimes denoted $κ$ , the lowercase Greek letter kappa. We can define a measure of curvature by considering a circle of a given radius. A circle with radius $r$ has curvature equal to $κ = 1 / r$ everywhere on the circle. A straight portion of a curve has zero curvature, which can be interpreted as the curvature of a circle with infinite radius. The curvature is computed by the formula

Curvature

κ (t) = \frac{∥ v (t) \times a (t) ∥}{{∥ v (t) ∥}^{3}} .

13.2Polynomial Interpolation

You are probably already familiar with linear interpolation. Given two “endpoint” values, create a function that transitions at a constant rate (spatially, in a straight line) from one to the other. We say that the function interpolates the two control points, meaning that it passes through the control points and can be used to compute intermediate values.

Polynomial interpolation is similar. Given a series of control points, our goal is to construct a polynomial that interpolates them. The degree of the polynomial depends on the number of control points. A polynomial of degree $n$ can be made to interpolate $n + 1$ control points. For example, linear interpolation is simply first-degree polynomial interpolation. We're primarily interested in cubic (third-degree) curves in this chapter, so we are creating polynomials that interpolate four control points.

In the context of curve design, to say that a curve interpolates control points is to place specific emphasis on the fact that the curve passes through the control points. This is to be contrasted with a curve that merely approximates the control points, meaning it doesn't pass through thepoints but is attracted to them in some way. We use the word“knot” to refer to control points that are interpolated, invoking the meta-phor of a rope with knots in it. It would seem at first glance that the availability of an interpolation scheme would make any approximation schemeobsolete, but we'll see that approximation techniques do have their advantages.

Polynomial interpolation is a classic problem with several well-studied solutions. Since this is a book on 3D math we cast the discussion primarily in geometric terms, but be aware that most of the literature on polynomial interpolation adopts a more general view, because the task of fitting a function to a set of data points has broad applicability.

To facilitate the discussion we use a particular example curve, shown in Figure 13.2. It's somewhat like an S turned on its side. We've marked the four control points on the curve that we are attempting to interpolate. We've chosen to place the $y$ coordinates on the interval $[2, 3]$ for reasons that will be useful later.

\begin{array}{ccc} t & x (t) & y (t) \\ 0 & 0 & 2 \\ 1 / 3 & 1 / 3 & 3 \\ 2 / 3 & 2 / 3 & 2 \\ 1 & 1 & 3 \end{array}

Figure 13.2An example curve and four control points. Can we draw this shape?

Notice that we must specify not only the position of each control point (the $x$ and $y$ coordinates), but the time when we want the curve to reach that control point (the $t$ value). We use the notation that the independent value (the “time values”) of the control points are named $t_{1}, t_{2}, \dots, t_{n}$ and the dependent variables (the spatial coordinate values at those times) are $y_{1}, y_{2}, \dots, y_{n} .$ The symbol $P$ stands for the polynomial function that we seek: $y_{i} = P (t_{i})$ .

The array of time values $t_{1} \dots t_{n}$ is known in other contexts as the knot vector or knot sequence. The word “vector” indicates that the sequence of $t$ values is an array of numbers, not that these numbers form a vector in the geometric sense of the word. If the $t$ s are spaced evenly like they are in our example, then we have a uniform knot vector; otherwise, we say that the knot vector is nonuniform. (Because it might be confusing, let us clarify that the knot vector is the sequence of $t$ values, not the sequence of control points.)

What about the $x$ -coordinate? Because the $x$ and $y$ coordinates are independent of one another, a general 2D curve-fitting application involves two separate one-dimensional problems. Aside from the fact that the two problems use the same knot vector, the coordinates are otherwise unrelated. Even though Figure 13.2 may look like a 2D curve, it is more properly interpreted as a graph of one coordinate (the $y$ -coordinate) as a function of time. We chose as the example curve an S turned on its side, rather than an S in its regular orientation, since the latter is not the graph of a function (technically it's called a relation because it associates more than one value of $y$ with each value of $x$ ).

With that said, there are two ways of interpreting Figure 13.2. We can interpret it either as a 1D function of $y (t)$ , or as a 2D curve, where one of the coordinates has a trivial form $x = t$ . This is a common source of confusion when looking at diagrams of curves in this book and elsewhere. Make sure you pay special attention to the horizontal axis to make sure you know whether it is a graph of one coordinate over time or a plot of the 2D curve that includes the behavior of both spatial coordinates. The traditional literature on polynomial interpolation is mostly in abstract terms of any function of the form $y = f (x)$ . In this context, $x$ would be the independent variable rather than a dependent value as it is for us. The notation we have chosen avoids the symbol $x$ and its associated baggage.

Now we are ready to answer a question some readers might be thinking: “I don't care what time the curve reaches the points, I just want a smooth shape that goes through the points.” Unfortunately, this doesn't unambiguously define a curve—we need to provide some other criteria to nail down the shape, and one way to do this is to associate time values with each control point. In typical applications of polynomial interpolation, we want to be able to specify the values of the dependent variable, because we are trying to fit a function to some known data points. There are some reasonable ways we can synthesize this information if we don't have it—for example, by making the difference between adjacent $t$ values proportional to the Euclidian distance between the corresponding control points. However, the general fact that polynomial interpolation needs us to provide the $t$ values when we often don't have a good way to decide what they should be is a harbinger of later discoveries.

Now that we've set the ground rules, let's try to create this curve. We first take a geometric approach in Section 13.2.1. Then, in Section 13.2.2, we look at the problem from a slightly more abstract mathematical perspective.

13.2.1Aitken's Algorithm

Our first approach to polynomial interpolation is a recursive technique due to Alexander Aitken (1895–1967). Like many recursive algorithms, it works on the principle of divide and conquer. To solve a difficult problem, we first divide it into two (or more) easier problems, solve the easier problems independently, and then combine the results to get the solution to the harder problem. In this case, the “hard” problem is to create a curve that interpolates $n$ control points. We split this curve into two “easier” curves: (1) one that interpolates only the first $n - 1$ points, disregarding the last point; and (2) another that interpolates the last $n - 1$ points without worrying about the first point. Then, we blend these two curves together.

Let's take the important cubic (third-degree) case as an example. A cubic curve has four control points $y_{1} \dots y_{4}$ that we wish to interpolate at the corresponding times $t_{1} \dots t_{4}$ . Applying the “divide-and-conquer” approach, we split this up into two smaller problems: one curve to interpolate $y_{1} \dots y_{3}$ , and another curve to interpolate $y_{2} \dots y_{4}$ . Since each of these curves has three control points, they are quadratic (second-degree) curves. Of course, quadratic curve-fitting is still a “hard” problem for us, and so each curve must be further subdivided.

Consider the first quadratic curve, between $y_{1}$ , $y_{2}$ , and $y_{3}$ . We further divide this curve into two parts, the first part between $y_{1}$ and $y_{2}$ and the other part between $y_{2}$ and $y_{3}$ . These two curves have only two control points each; they are straight line segments. Finally, a problem that is truly “easy”!

Since we have lots of curves at this point, we should invent some notation for them. We let $y_{i}^{1} (t)$ denote the linear curve between $y_{i}$ and $y_{i + 1}$ , the notation $y_{i}^{2} (t)$ denote the quadratic curve between $y_{i}$ and $y_{i + 2}$ , and so on. In other words, the superscript indicates the recursion level in the divide-and-conquer algorithm (and also the degree of the polynomial), and the subscript indexes along the length of the curve.

Take a look at the first quadratic curve $y_{1}^{2} (t)$ that interpolates $y_{1}$ , $y_{2}$ , and $y_{3}$ . It is formed by blending together the two lines containing the first two linear segments. An example of such blending is shown in Figure 13.3. (This figure doesn't use the data from our S example; it's a less symmetric case that better illustrates the blending process.) Notice that each curve segment is an interval from an infinite curve that is defined for any value of $t$ .

Figure 13.3 Creating a quadratic curve as a blend of two linear segments according to Aitken's algorithm

Now let's look at the math behind this. It's all linear interpolation. The easiest are the linear segments, which are defined by linear interpolation between the adjacent control points:

Linear interpolation between two control points

\begin{aligned} y_{1}^{1} (t) & = \frac{(t_{2} - t) y_{1} + (t - t_{1}) y_{2}}{t_{2} - t_{1}}, & y_{2}^{1} (t) & = \frac{(t_{3} - t) y_{2} + (t - t_{2}) y_{3}}{t_{3} - t_{2}} . \end{aligned}

The quadratic curve is only slightly more complicated. We just linearly interpolate between the line segments:

Linear interpolation of lines yields a quadratic curve

y_{1}^{2} (t) = \frac{(t_{3} - t) [y_{1}^{1} (t)] + (t - t_{1}) [y_{2}^{1} (t)]}{t_{3} - t_{1}} .

Hopefully you can see the pattern—each curve is the result of linearly interpolating two curves of lesser degree. Aitken's algorithm can be summarized succinctly as a recurrence relation.

Aitken's Algorithm

\begin{aligned} y_{i}^{0} (t) & = y_{i}, \\ y_{i}^{j} (t) & = \frac{(t_{i + j} - t) [y_{i}^{j - 1} (t)] + (t - t_{i}) [y_{i + 1}^{j - 1} (t)]}{t_{i + j} - t_{i}} . \end{aligned}

Aitken's algorithm works because, at each level both curves being blended already touch the middle control points. The two outermost control points are touched by only one curve or the other, but for those values of $t$ , the blend weights reach their extreme values and all the weight is given to the curve that touches the control point.

Figure 13.4Two levels of Aitken's algorithm

Now that we have the basic idea, let's apply it to our sideways S. Figure 13.4 shows Aitken's algorithm at work with our four data points. On the left, the three linear segments are blended to form two quadratic segments. On the right, the two quadratic curves are blending, yielding the final result that we've been seeking: a cubic spline that interpolates all four control points.

So we've successfully interpolated the four control points, and accomplished the goal set out at the start of this section, right? Well, not exactly. Although our curve does pass through the control points, it isn't really the curve we wanted. If we compare the curve on the right side of Figure 13.4 with the curve we set out to create at the start of this section in Figure 13.2, we see that the curve produced by Aitken's algorithm overshoots the $y$ value of the two middle control points. We have discovered an inconvenient truth.2

Polynomial interpolation doesn't really give us the type of control we want for curve design in geometric settings.

But don't despair! We've learned several important ideas that will be helpful when we discuss Bézier curves in Section 13.4 and splines in Section 13.6. In fact, we're going to beg your patience to allow us to extend the discussion on polynomial interpolation just a bit further. It's sort of like watching the movie Titanic; even though you know that the journey will end tragically, you still might find something useful along the way. We promise that the other techniques in this chapter will have practical as well as educational value.

By the way, you might have noticed that we didn't actually compute the polynomial $P$ that produces the curve. Working through this math is straightforward, but a bit tedious and not all that enlightening. The important point is that Aitken's algorithm is a recursive process of blending curves together and works by repeated linear interpolation. Besides, why bother with the details when we have computers to solve algebra problems for us?3 However, you needn't feel short-changed by lazy authors. If you really want to know what the polynomial is (or just want to feel like you're getting your money's worth), keep reading. We'll discover it in the next section by using a different method that's less tedious mathematically.

13.2.2Lagrange Basis Polynomials

Section 13.2.1 applied geometric intuition to the problem of polynomial interpolation and came up with Aitken's algorithm. Now we approach the subject from a more abstract mathematical point of view.

One mathematical approach to the interpolation problem comes from linear algebra.4 Each control point gives us one equation, and each coefficient gives us one unknown. This system of equations can be put into an $n \times n$ matrix,5 which can be solved by standard techniques such as Gaussian elimination or LU decomposition. Such techniques are outside the scope of this book, but you can learn about them in practically any good book on linear algebra or numerical methods.

Solving a matrix is a relatively time-consuming computational process, requiring $O (n^{3})$ time for an $n \times n$ matrix in the worst case. Luckily there are more efficient approaches. As we did with Aitken's algorithm, we solve a large complicated problem by dividing it into a series of smaller, simpler problems, and then combining those results. Aitken's algorithm is a recursive procedure, but here we will make one “simple” problem per control point.

Let's ignore the $y$ 's for now and think only about the $t$ 's. What if we could create a polynomial for each knot $t_{i}$ such that the polynomial evaluates to unity at that knot, but for all the other knots it evaluates to zero? If we denote the $i$ th polynomial as $ℓ_{i}$ , then this idea can be expressed in mathspeak: $ℓ_{i} (t_{i}) = 1$ , and $ℓ_{i} (t_{j}) = 0$ for all $j \neq i$ . For example, let's assume $n = 4$ . Then our polynomials would have the following values at the knots:

\begin{aligned} ℓ_{1} (t_{1}) & = 1, & ℓ_{1} (t_{1}) & = 0, & ℓ_{3} (t_{1}) & = 0, & ℓ_{4} (t_{1}) & = 0, \\ ℓ_{1} (t_{2}) & = 0, & ℓ_{2} (t_{2}) & = 1, & ℓ_{3} (t_{2}) & = 0, & ℓ_{4} (t_{2}) & = 0, \\ ℓ_{1} (t_{3}) & = 0, & ℓ_{2} (t_{3}) & = 0, & ℓ_{3} (t_{3}) & = 1, & ℓ_{4} (t_{3}) & = 0, \\ ℓ_{1} (t_{4}) & = 0, & ℓ_{2} (t_{4}) & = 0, & ℓ_{3} (t_{4}) & = 0, & ℓ_{4} (t_{4}) & = 1. \end{aligned}

If we were able to create polynomials with the above properties, we would be able to use them as basis polynomials. We would scale each basis polynomial $ℓ_{i}$ by the corresponding coordinate value $y_{i}$ , and add all the scaled polynomials together:

Interpolating polynomial in Lagrange basis form

\begin{matrix} (13.7) & P (t) = \sum_{i = 1}^{n} y_{i} ℓ_{i} (t) = y_{1} ℓ_{1} (t) + y_{2} ℓ_{2} (t) + \dots + y_{n - 1} ℓ_{n - 1} (t) + y_{n} ℓ_{n} (t) . \end{matrix}

You might want to take a moment to convince yourself that this polynomial actually interpolates the control points, meaning $P (t_{i}) = y_{i}$ .

Notice that the basis polynomials depend only on the knot vector (the $t$ 's) and not on the coordinate values (the $y$ 's). Because of this, a set of basis polynomials can be used to quickly construct multiple curves with the same knot vector. This is precisely the situation we find ourselves in when dealing with a 3D curve, which is really three one-dimensional curves that share the same knot sequence.

Of course, all of this would work only if we knew the basis polynomials, and finding $ℓ_{i}$ is itself a problem of polynomial interpolation. However, the “data points” we wish $ℓ_{i}$ to interpolate are all either 0 or 1, so $ℓ_{i}$ can be expressed in a simple form. Such basis polynomials are called Lagrange basis polynomials.6 A Lagrange7 basis polynomial $ℓ_{i}$ for knot vector $t_{1} \dots t_{n}$ looks like Equation (13.8):

Lagrange Basis Polynomial

\begin{matrix} (13.8) & ℓ_{i} (t) = \prod_{\begin{matrix} 1 \leq j \leq n, \\ j \neq i \end{matrix}} \frac{t - t_{j}}{t_{i} - t_{j}} = \frac{t - t_{0}}{t_{i} - t_{0}} \dots \frac{t - t_{i - 1}}{t_{i} - t_{i - 1}} \frac{t - t_{i + 1}}{t_{i} - i_{i + 1}} \dots \frac{t - t_{n}}{t_{i} - t_{n}} . \end{matrix}

This trick works because at the knot $t_{i}$ , all the terms in the product equal 1, causing the entire expression to evaluate to 1, and at any other knot, one of the terms in the product is 0, which causes the entire expression to evaluate to 0.

Let's apply this to our example S curve. Recall that it used the uniform knot vector $(0, \frac{1}{3}, \frac{2}{3}, 1)$ . Here, we work through the first basis polynomial and just present the results for the others:

\begin{aligned} ℓ_{1} (t) & = (\frac{t - t_{2}}{t_{1} - t_{2}}) (\frac{t - t_{3}}{t_{1} - t_{3}}) (\frac{t - t_{4}}{t_{1} - t_{4}}) = (\frac{t - 1 / 3}{0 - 1 / 3}) (\frac{t - 2 / 3}{0 - 2 / 3}) (\frac{t - 1}{0 - 1}) \\ = (\frac{3 t - 1}{- 1}) (\frac{3 t - 2}{- 2}) (\frac{t - 1}{- 1}) = \frac{(3 t - 1) (3 t - 2) (t - 1)}{- 2} \\ = - (9 / 2) t^{3} + 9 t^{2} - (11 / 2) t + 1, \end{aligned}

\begin{aligned} ℓ_{2} (t) & = (27 / 2) t^{3} - (45 / 2) t^{2} + 9 t, \\ ℓ_{3} (t) & = - (27 / 2) t^{3} + 18 t^{2} - (9 / 2) t, \\ ℓ_{4} (t) & = (9 / 2) t^{3} - (9 / 2) t^{2} + t . \end{aligned}

Figure 13.5 shows what these basis polynomials look like.

Figure 13.5Cubic Lagrange basis polynomials for uniform knot vector

Now that we have the Lagrange basis polynomials for the knot vector, let's plug in the $y$ values from our example S curve (Figure 13.2) into Equation (13.7) to get the complete interpolating polynomial:

\begin{aligned} P (t) & = y_{1} ℓ_{1} (t) + y_{2} ℓ_{2} (t) + y_{3} ℓ_{3} (t) + y_{4} ℓ_{4} (t) \\ = 2 [- (9 / 2) t^{3} + 9 t^{2} - (11 / 2) t + 1] + 3 [(27 / 2) t^{3} - (45 / 2) t^{2} + 9 t] \\ + 2 [- (27 / 2) t^{3} + 18 t^{2} - (9 / 2) t] + 3 [(9 / 2) t^{3} - (9 / 2) t^{2} + t] \\ = - 9 t^{3} + 18 t^{2} - 11 t + 2 + (81 / 2) t^{3} - (135 / 2) t^{2} + 27 t \\ - 27 t^{3} + 36 t^{2} - 9 t + (27 / 2) t^{3} - (27 / 2) t^{2} + 3 t \\ = 18 t^{3} - 27 t^{2} + 10 t + 2. \end{aligned}

Let's show these results graphically. First, we scale each basis polynomial by the corresponding coordinate value, as shown in Figure 13.6.

Finally, adding the scaled basis vectors together yields the interpolating polynomial $P$ , the blue curve at the top of Figure 13.7.

Figure 13.6 Scaling each Lagrange basis polynomial by the corresponding coordinate value

Figure 13.7 The interpolating curve is the sum of the scaled basis polynomials

We use the word basis in basis polynomial to emphasize the fact that we can use these polynomials as building blocks to reconstruct absolutely any polynomial whatsoever, given the values of the polynomial at the knots. It's the same basic concept as a basis vector (see Section 3.3.3): any arbitrary vector can be described as a linear combination of the basis vectors. In our case, the space being spanned by the basis is not a geometric 3D space, but the vector space of all possible polynomials of a certain degree, and the scale values for each curve are the known values of the polynomial at the knots.

But there's an alternate way to understand the multiplication and summing that's going on. Instead of thinking about the polynomials as the building blocks and the control points as the scale factors, we can view each point on the curve as a result of taking a weighted average of the control points, where the basis polynomials provide the blending weights. So the control points are the building blocks and the basis polynomials provide the scale factors, although we prefer to be more specific and call these scale factors barycentric coordinates. We introduced barycentric coordinates in the context of triangles in Section 9.6.3, but the term refers to a general technique of describing some value as a weighted average of data points.

We can think of basis polynomials as functions yielding barycentric coordinates (blending weights).

Notice that some values are negative or greater than 1 on certain intervals, which explains why direct polynomial interpolation overshoots the control points. When all barycentric coordinates are inside the $[0, 1]$ range, the resulting point is guaranteed to lie inside the convex hull of the control points. (The convex hull is the smallest polygon that contains all the control points. It “shrink wraps” the control points, sort of like if you were to stretch a rubber band around the control points and then release it.) But when we have any one coordinate outside this interval, the resulting point could extend outside the convex hull. For purposes of geometric curve design, the convex hull guarantee is a very nice one to have. Section 13.4 shows that Bézier curves do provide this guarantee through the Bernstein basis.

13.2.3Polynomial Interpolation Summary

We've approached polynomial interpolation from two perspectives. Aitken's algorithm is a geometric approach based on repeated linear interpolation, and with it we can compute a point on the curve for a given $t$ without knowing the polynomial for the curve. Lagrange interpolation works by creating basis functions that depend only on the knot vector. We can view the use of the basis polynomials in two ways. Either we can think about scaling each basis polynomial by the corresponding coordinate value and then adding them all together, or we can think about the polynomials as functions that compute barycentric coordinates that are used as blending weights in a simple weighted average of the coordinate points.

Both methods yield the same curve when given the same data. Furthermore, this polynomial is unique—no other polynomial of the same degree interpolates the data points. An informal argument for why this is true goes like this: A polynomial of degree $n$ has $n + 1$ degrees of freedom, corresponding to the $n + 1$ coefficients in monomial form. Therefore, the degree $n$ polynomial that interpolates $n + 1$ control points must be unique. (Farin [1] gives a more rigorous argument.)

For purposes of curve design, polynomial interpolation is not ideal, primarily because of our inability to control the overshoot. The overshoot is guaranteed by the fact that the underlying Lagrange basis polynomials are not restricted to the unit interval $[0, 1]$ , and the curve escapes the convex hull of the control points.

Direct polynomial interpolation finds limited application in video games, but our study has introduced the themes of repeated linear interpolation and basis polynomials. We've also seen a bit of the beautiful duality between the two techniques.

13.3Hermite Curves

Polynomial interpolation tries to control the interior of the curve by threading the curve through specified knots. This doesn't work as well as we would like, because of the tendency to oscillate and overshoot, so let's try a different approach. We're still going to want to specify the endpoint positions, of course. But instead of specifying the interior positions to interpolate, let's control the shape of the curve through the tangents at the endpoints. A curve thus specified is said to be a Hermite curve or a curve in Hermite form, named in honor of Charles Hermite8 (1822–1901).

The Hermite form specifies a curve by listing its starting and ending positions and derivatives. A cubic curve has only four coefficients, which allows for the specification of just the first derivatives, the velocities at the endpoints. So describing a cubic curve in Hermite form boils down to the following four pieces of information:

The start position at $t = 0$ ,
The first derivative (initial velocity) at $t = 0$ ,
The end position at $t = 1$ ,
The first derivative (final velocity) at $t = 1$ .

Let's call the desired start and end positions $p_{0}$ and $p_{1}$ and the start and end velocities $v_{0}$ and $v_{1}$ . Figure 13.8 shows some examples of cubic Hermite curves. Please note that the velocity vectors $v_{0}$ and $v_{1}$ have been drawn at one-third their actual length. One reason for doing this is to save space, and another will make sense later once we learn about Bézier curves in Section 13.4.

Figure 13.8Some cubic Hermite curves

Determining the monomial coefficients from the Hermite values is a relatively straightforward algebraic process of combining equations previously discussed in this chapter. The four Hermite values can be translated into the following system of equations:

\begin{aligned} (13.9) & p (0) & = p_{0} & ⟹ & c_{0} & = p_{0}, \\ (13.10) & v (0) & = v_{0} & ⟹ & c_{1} & = v_{0}, \\ (13.11) & v (1) & = v_{1} & ⟹ & c_{1} + 2 c_{2} + 3 c_{3} & = v_{1}, \\ (13.12) & p (1) & = p_{1} & ⟹ & c_{0} + c_{1} + c_{2} + c_{3} & = p_{1} . \end{aligned}

System of equations for Hermite conditions

Equations (13.9) and (13.12), which specify the endpoints, just repeat what we said in Section 13.1.5. Equations (13.10) and (13.11), which specify velocities, follow directly from the velocity equations for a cubic polynomial (Equation (13.5). The order in which these equations are listed is a convention used in other literature on curves, and the utility of this convention will become apparent later in this chapter.

Solving this system of equations results in a method to compute the monomial coefficients from the Hermite positions and derivatives:

Converting Hermite form to monomial form

\begin{aligned} (13.13) & c_{0} & = p_{0}, \\ c_{1} & = v_{0}, \\ c_{2} & = - 3 p_{0} - 2 v_{0} - v_{1} + 3 p_{1}, \\ (13.16) & c_{3} & = 2 p_{0} + v_{0} + v_{1} - 2 p_{1} . \end{aligned}

We can also write these equations in the compact matrix notation introduced in Section 13.1.2. Remember that when we put the coefficients as columns in a matrix $C$ , and the powers of $t$ into the column vector $t$ , we can express a polynomial curve as the matrix product $C t$ ,

We can write monomial form using matrix notation, remember?

p (t) = C t = [\begin{matrix} | & | & | & | \\ c_{0} & c_{1} & c_{2} & c_{3} \\ | & | & | & | \end{matrix}] [\begin{matrix} 1 \\ t \\ t^{2} \\ t^{3} \end{matrix}],

where $p (t)$ and each of the coefficient vectors $c_{i}$ are column vectors whose height matches the number of geometric dimensions (1D, 2D, or 3D). The height of $t$ matches the number of $c$ 's, which depends on the degree of the curve.

The coefficient matrix $C$ may be expressed as a matrix product by putting the Hermite positions and velocities as columns in a matrix $P$ and multiplying by a conversion matrix $H$ :

Cubic Hermite curve using matrix notation

p (t) = C t = P H t = [\begin{matrix} | & | & | & | \\ p_{0} & v_{0} & v_{1} & p_{1} \\ | & | & | & | \end{matrix}] [\begin{matrix} 1 & 0 & - 3 & 2 \\ 0 & 1 & - 2 & 1 \\ 0 & 0 & - 1 & 1 \\ 0 & 0 & 3 & - 2 \end{matrix}] [\begin{matrix} 1 \\ t \\ t^{2} \\ t^{3} \end{matrix}] .

We can interpret the product $P H t$ in two ways. If we group it like $P (H t)$ , then the matrix product $H t$ can be interpreted as Hermite basis functions; we'll have more to say about this basis shortly. Or, we can think about $C = P H$ , in which case, multiplication by $H$ can be considered a conversion from the Hermite basis to the monomial basis, essentially a restatement of Equations (13.13)–(13.16).

We emphasize that the adjectives “monomial,” “Hermite,” and “Bézier” refer to different ways of describing the same set of polynomial curves; they are not different sets of curves. We convert a curve from Hermite form to monomial form by using Equations (13.13)–(13.16), and from monomial form to Hermite form with Equations (13.9)–(13.12).

Let's take a closer look at the Hermite basis and hopefully gain some geometric intuition as to why it works. Remember that we can interpret basis functions as functions of $t$ yielding barycentric coordinates. For cubic Hermite curves, four values are being blended: the two positions and the two velocity vectors.9 Thus, we have four basis functions that are the elements of the column result of the matrix product $H t$ . Expanding the product, we have

\begin{aligned} p (t) & = P (H t) \\ = [\begin{array}{c} | & | & | & | \\ p_{0} & v_{0} & v_{1} & p_{1} \\ | & | & | & | \end{array}] ([\begin{array}{c} 1 & 0 & - 3 & 2 \\ 0 & 1 & - 2 & 1 \\ 0 & 0 & - 1 & 1 \\ 0 & 0 & 3 & - 2 \end{array}] [\begin{array}{c} 1 \\ t \\ t^{2} \\ t^{3} \end{array}]) \\ = [\begin{array}{c} | & | & | & | \\ p_{0} & v_{0} & v_{1} & p_{1} \\ | & | & | & | \end{array}] [\begin{array}{c} 1 - 3 t^{2} + 2 t^{3} \\ t - 2 t^{2} + t^{3} \\ - t^{2} + t^{3} \\ 3 t^{2} - 2 t^{3} \end{array}] . \end{aligned}

Next, we name these basis functions (the rows of $H t$ ) as $H_{0} (t) \dots H_{3} (t)$ (you may see these same functions indexed with different subscripts in other sources):

The cubic Hermite basis functions

\begin{aligned} H_{0} (t) & = 1 - 3 t^{2} + 2 t^{3}, \\ H_{1} (t) & = t - 2 t^{2} + t^{3}, \\ H_{2} (t) & = - t^{2} + t^{3}, \\ H_{3} (t) & = 3 t^{2} - 2 t^{3} . \end{aligned}

Now, expanding the matrix multiplication makes it explicit that these functions serve as blending weights:

Interpreting the Hermite basis functions as blending weights

\begin{aligned} p (t) & = [\begin{array}{c} | & | & | & | \\ p_{0} & v_{0} & v_{1} & p_{1} \\ | & | & | & | \end{array}] [\begin{array}{c} H_{0} (t) \\ H_{1} (t) \\ H_{2} (t) \\ H_{3} (t) \end{array}] \\ = H_{0} (t) p_{0} + H_{1} (t) v_{0} + H_{2} (t) v_{1} + H_{3} (t) p_{1} . \end{aligned}

Figure 13.9 shows a graph of the Hermite basis functions.

Figure 13.9The Hermite basis functions

Now let's make a few observations. First, notice that $H_{0} (t) + H_{3} (t) = 1$ , so those who object to the idea of adding “points” together can breath a sigh of relief, as we can interpret the situation as a proper barycentric combination of the points.

The curve $H_{3} (t)$ deserves special attention. It is also is known as the smoothstep function and is truly a gem that every game programmer should know. This function is found in many places, including the Renderman shading language and HLSL. To remove the rigid, robotic feeling from any linear interpolation (especially camera transitions), simply compute the normalized interpolation fraction $t$ as usual (in the range $0 \leq t \leq 1$ ), and then replace $t$ with $3 t^{2} - 2 t^{3}$ . Voila! Everything will suddenly feel more polished. The reason for this is that the smoothstep function eliminates the sudden jump in velocity at the endpoints: $H_{3}^{'} (0) = H_{3}^{'} (1) = 0$ .

Smoothstep is Your Friend

The Hermite basis function

H_{3} (t)

is also known as the smoothstep function. Almost any transition based on linear interpolation, especially a camera transition, feels better when replaced with the smoothstep function.

One final word about Hermite curves. Like the other forms for polynomial curves, it's possible to design a scheme for Hermite curves of higher degree, although the cubic polynomial is the most commonly used in computer graphics and animation. With the cubic spline, we specified the position (the “0th” derivative) and velocities (first derivatives) at the end points. A quintic (fifth-degree) Hermite curve happens when we also specify the accelerations (second derivatives).

13.4Bézier Curves

This chapter has so far discussed a number of ideas about curves that were enlightening, but it has yet to describe a fully practical way to design a curve. All of that will change in this section.10 Bézier curves were invented by Pierre Bézier (1910–1999), a French11 engineer, while he was working for the automaker Renault. Bézier curves have many desirable properties that make them well suited for curve design. Importantly, Bézier curves approximate rather than interpolate: although they do pass through the first and last control points, they only pass near the interior points. For this reason, the Bézier control points are called “control points” rather than “knots.” Some example cubic Bézier curves are shown in Figure 13.10.

Figure 13.10Some cubic Bézier curves

Recall from Section 13.2 that the problem of polynomial interpolation had two solutions that produced the same result. Aitken's algorithm was a recursive construction technique that appealed to our geometric sensibilities, and a more abstract approach yielded the Lagrange basis polynomials. Bézier curves exhibit a similar duality. The counterpart of Aitken's algorithm for Bézier curves is the de Casteljau algorithm, a recursive geometric technique for constructing Bézier curves through repeated linear interpolation; this is the subject of Section 13.4.1. The analog to the Lagrange basis is the Bernstein basis, which is discussed in Section 13.4.2. After considering both sides of this coin, Section 13.4.3 investigates the derivatives12 of Bézier curves and reveals the relationship to Hermite form.

We've seen some beautiful cooperation between math and geometry in this book, but the convergence is particularly elegant for Bézier curves. It seems as if almost every important property of Bézier curves was independently discovered multiple times by researchers in different fields. Rogers' book [4] includes an interesting look at this story.

$t = .25$	$t = .50$	$t = .75$
$Unknown environment 'picture'$	$Unknown environment 'picture'$	$Unknown environment 'picture'$
$Unknown environment 'picture'$	$Unknown environment 'picture'$	$Unknown environment 'picture'$
$Unknown environment 'picture'$	$Unknown environment 'picture'$	$Unknown environment 'picture'$
$Unknown environment 'picture'$	$Unknown environment 'picture'$	$Unknown environment 'picture'$

Figure 13.11The de Casteljau algorithm applied to a cubic curve

13.4.1The de Casteljau Algorithm

The de Casteljau algorithm defines a method for constructing Bézier curves through repeated linear interpolation. It was created in 1959 by physicist and mathematician Paul de Casteljau (1910–1999).13 We show how the algorithm works for the important cubic case as our example. First, a bit of notation is necessary. A cubic curve is defined by four control points, $b_{0} \dots b_{3}$ . Notice that Bézier control points traditionally are indexed starting at zero (which will appeal to the C programmers amongst us). Also, as with Aitken's algorithm, we add a superscript to indicate the level of recursion. The original control points are assigned level 0, thus $b_{i}^{0} = b_{i}$ .

With that out of the way, let's consider a specific parameter value $t$ from 0 to 1. The de Casteljau algorithm geometrically constructs the corresponding point on the curve $p (t)$ as follows. Between each pair of consecutive control points, we interpolate according to the fraction $t$ to obtain a new point. So, starting with the original four control points $b_{0}^{0} \dots b_{3}^{0}$ , we derive three new points $b_{0}^{1}$ , $b_{1}^{1}$ , and $b_{2}^{1}$ . Another round of interpolation between each pair of these three points gives us two points $b_{0}^{2}$ and $b_{1}^{2}$ , and a final interpolation yields the point $b_{0}^{3} = p (t)$ we're looking for. Figure 13.11 shows the de Casteljau algorithm applied to the same curve at $t = .25$ , $t = .50$ , and $t = .75$ .

It's helpful to write out all the $b$ s in a triangular fashion, as shown in Figure 13.12. Each intermediate point is the result of linearly interpolating between two points on the row above.

\begin{array}{ccccccccccccc} b_{0}^{0} & b_{1}^{0} & b_{2}^{0} & b_{3}^{0} \\ ↘ & ↙ & ↘ & ↙ & ↘ & ↙ \\ b_{0}^{1} & b_{1}^{1} & b_{2}^{1} \\ ↘ & ↙ & ↘ & ↙ \\ b_{0}^{2} & b_{1}^{2} \\ ↘ & ↙ \\ b_{0}^{3} \end{array}

Figure 13.12 Hierarchical relationships in the de Casteljau algorithm for a cubic curve

If we combine these recursive relationships with the basic linear interpolation formula, we obtain the de Casteljau recurrence relation.

De Casteljau Recurrence Relation

\begin{aligned} b_{i}^{0} (t) & = b_{i}, \\ b_{i}^{n} (t) & = (1 - t) [b_{i}^{n - 1} (t)] + t [b_{i + 1}^{n - 1} (t)] . \end{aligned}

Listing 13.1 illustrates how the de Casteljau algorithm could be implemented in C++ to evaluate a Bézier curve for a specific value of $t$ . The caller passes in the original control points in an array, which is also used as a temporary working space as the operation is performed in place. Each iteration of the outer loop performs one round of interpolation, replacing the points at one level with the points at the next higher numbered superscript. This process is continued until there is one point remaining, the desired result $p (t)$ . This example is intended to illustrate how the algorithm works, not how to do anything particularly fast or provide a clean interface.

Vector3 deCasteljau(
    int n,            // order of the curve, the number of points
    Vector3 points[], // array of points.  Overwritten, as
                      // the algorithm works in place
    float t           // parameter value we wish to evaluate
) {

    // Perform the conversion in place
    while (n > 1) {
        --n;

        // Perform the next round of interpolation, reducing the
        // degree of the curve by one.
        for (int i = 0 ; i < n ; ++i) {
            points[i] = points[i]*(1.0f-t) + points[i+1]*t;
        }
    }

    // Result is now in the first slot.
    return points[0];
}

This gives us a method for locating a point at any given $t$ through repeated interpolation, but it doesn't directly give us a closed form expression to calculate the point in terms of the control points. We emphasize that such a closed form expression is often not needed, but let's derive it in monomial form anyway. We're looking for a polynomial grouped by powers of $t$ . We'll work our way up from the linear and quadratic cases to the cubic. Section 13.4.2 presents a general pattern leading us to the expression for arbitrary degree curves.

The linear case comes straight from the recurrence relation without any real work:

\begin{aligned} b_{i}^{0} (t) & = b_{i}, \\ b_{i}^{1} (t) & = (1 - t) [b_{i}^{0} (t)] + t [b_{i + 1}^{0} (t)] \\ = (1 - t) b_{i} + t b_{i + 1} \\ = b_{i} + t (b_{i + 1} - b_{i}) . \end{aligned}

Applying one more level gives us a quadratic polynomial:

\begin{aligned} b_{i}^{2} (t) & = (1 - t) [b_{i}^{1} (t)] + t [b_{i + 1}^{1} (t)] \\ = (1 - t) [b_{i} + t (b_{i + 1} - b_{i})] + t [b_{i + 1} + t (b_{i + 2} - b_{i + 1})] \\ = [b_{i} + t (b_{i + 1} - b_{i})] - t [b_{i} + t (b_{i + 1} - b_{i})] \\ + t [b_{i + 1} + t (b_{i + 2} - b_{i + 1})] \\ = b_{i} + t (b_{i + 1} - b_{i}) - t b_{i} - t^{2} (b_{i + 1} - b_{i}) \\ + t b_{i + 1} + t^{2} (b_{i + 2} - b_{i + 1}) \\ = b_{i} + t (2 b_{i + 1} - 2 b_{i}) + t^{2} (b_{i} - 2 b_{i + 1} + b_{i + 2}) . \end{aligned}

In other words, quadratic Bézier curves, which have three control points, can be expressed in monomial form as

Quadratic Bézier curve in monomial form

\begin{matrix} (13.17) & p (t) = b_{0}^{2} (t) = b_{0} + t (2 b_{1} - 2 b_{0}) + t^{2} (b_{0} - 2 b_{1} + b_{2}) . \end{matrix}

Before we do the last round of interpolation to get the cubic curve, let's take a closer look at the quadratic expression in Equation (13.17). This conversion from Bézier form to monomial basis can be written with fewer letters by using the matrix form introduced earlier in this chapter. After putting the control points $b_{0}$ , $b_{1}$ , $b_{2}$ as columns into a matrix $B$ , we can write

Quadratic Bézier curve using matrix notation

\begin{matrix} (13.18) & p (t) = C t = B M t = [\begin{matrix} | & | & | \\ b_{0} & b_{1} & b_{2} \\ | & | & | \end{matrix}] [\begin{matrix} 1 & - 2 & 1 \\ 0 & 2 & - 2 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 \\ t \\ t^{2} \end{matrix}] . \end{matrix}

As we saw in Section 13.3 with Hermite curves, the two different ways to group the product $B M t$ lead to two different interpretations. If we perform the multiplication $B M$ first, we get the matrix of monomial coefficients $C$ , meaning $M$ is a conversion matrix from Bézier form to monomial form. Direct evaluation of the monomial form is faster than implementing the de Casteljau algorithm, and so this form might be preferable in situations where we need to evaluate the same curve for many different values of $t$ , for example, when moving an object over time along a path described by a Bézier curve. (However, one must be aware of issues related to precision. For example, we can ensure that performing de Casteljau using $t = 1.0$ produces a result that matches the last control point exactly. However, substituting $t = 1.0$ into the polynomial in monomial form, the coefficients might not sum exactly to this value due to floating point representation.)

The other way to group the product $B M t$ is to perform the right-hand multiplication first: $B (M t)$ . When we plug in a specific value of $t$ , the product $M t$ yields a column vector of barycentric coordinates. If we perform this multiplication leaving $t$ as a variable, we get a column vector of polynomials that can be interpreted as a basis. The basis polynomials for Bézier curves are the Bernstein basis, discussed in Section 13.4.2.

Back to repeated interpolation. One last round gives us the cubic polynomial:

One last iteration of de Casteljau iteration yields the cubic polynomial.
\
Whew, expanding it all out like this is pretty exhausting!

\begin{aligned} b_{i}^{3} (t) & = (1 - t) [b_{i}^{2} (t)] + t [b_{i + 1}^{2} (t)] \\ = (1 - t) [b_{i} + t (2 b_{i + 1} - 2 b_{i}) + t^{2} (b_{i} - 2 b_{i + 1} + b_{i + 2})] \\ + t [b_{i + 1} + t (2 b_{i + 2} - 2 b_{i + 1}) + t^{2} (b_{i + 1} - 2 b_{i + 2} + b_{i + 3})] \\ = [b_{i} + t (2 b_{i + 1} - 2 b_{i}) + t^{2} (b_{i} - 2 b_{i + 1} + b_{i + 2})] \\ - t [b_{i} + t (2 b_{i + 1} - 2 b_{i}) + t^{2} (b_{i} - 2 b_{i + 1} + b_{i + 2})] \\ + t [b_{i + 1} + t (2 b_{i + 2} - 2 b_{i + 1}) + t^{2} (b_{i + 1} - 2 b_{i + 2} + b_{i + 3})] \\ = b_{i} + t (2 b_{i + 1} - 2 b_{i}) + t^{2} (b_{i} - 2 b_{i + 1} + b_{i + 2}) \\ - t b_{i} - t^{2} (2 b_{i + 1} - 2 b_{i}) - t^{3} (b_{i} - 2 b_{i + 1} + b_{i + 2}) \\ + t b_{i + 1} + t^{2} (2 b_{i + 2} - 2 b_{i + 1}) + t^{3} (b_{i + 1} - 2 b_{i + 2} + b_{i + 3}) \end{aligned}

\begin{aligned} = b_{i} + t (3 b_{i + 1} - 3 b_{i}) + t^{2} (3 b_{i} - 6 b_{i + 1} + 3 b_{i + 2}) \\ + t^{3} (- b_{i} + 3 b_{i + 1} - 3 b_{i + 2} + b_{i + 3}) . \end{aligned}

Writing the last line again, but this time assuming the cubic level is the final level of recursion, we have

Cubic Bézier curve in monomial form

\begin{aligned} (13.19) & \begin{aligned} p (t) = b_{0}^{3} (t) & = b_{0} + t (3 b_{1} - 3 b_{0}) + t^{2} (3 b_{0} - 6 b_{1} + 3 b_{2}) \\ + t^{3} (- b_{0} + 3 b_{1} - 3 b_{2} + b_{3}) . \end{aligned} \end{aligned}

Just to make sure you didn't miss it, Equation (13.19) tells us how to convert a cubic Bézier curve to monomial form. Since this is important, let's write it a bit more deliberately as

Cubic monomial coefficients from Bézier control points

\begin{aligned} c_{0} & = b_{0}, \\ c_{1} & = - 3 b_{0} + 3 b_{1}, \\ c_{2} & = 3 b_{0} - 6 b_{1} + 3 b_{2}, \\ c_{3} & = - b_{0} + 3 b_{1} - 3 b_{2} + b_{3} . \end{aligned}

We can now put this conversion into a matrix like we did with the quadratic case in Equation (13.18). The cubic equation for a specific point on the curve $p (t)$ is written in matrix notation as

Cubic Bézier curve using matrix notation

p (t) = C t = B M t = [\begin{matrix} | & | & | & | \\ b_{0} & b_{1} & b_{2} & b_{3} \\ | & | & | & | \end{matrix}] [\begin{matrix} 1 & - 3 & 3 & - 1 \\ 0 & 3 & - 6 & 3 \\ 0 & 0 & 3 & - 3 \\ 0 & 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 \\ t \\ t^{2} \\ t^{3} \end{matrix}] .

We can also invert this process, meaning we can convert any polynomial curve from monomial form to Bézier form. Given any polynomial curve, the Bézier control points that describe the curve are uniquely determined:

Computing Bézier control points from monomial coefficients

\begin{aligned} (13.20) & b_{0} & = c_{0}, \\ b_{1} & = c_{0} + (1 / 3) c_{1}, \\ b_{2} & = c_{0} + (2 / 3) c_{1} + (1 / 3) c_{2}, \\ (13.23) & b_{3} & = c_{0} + c_{1} + c_{2} + c_{3} . \end{aligned}

And, of course, we can write this in matrix form:

Converting from monomial to Bézier form, in matrix notation

[\begin{matrix} | & | & | & | \\ b_{0} & b_{1} & b_{2} & b_{3} \\ | & | & | & | \end{matrix}] = [\begin{matrix} | & | & | & | \\ c_{0} & c_{1} & c_{2} & c_{3} \\ | & | & | & | \end{matrix}] [\begin{matrix} 1 & 1 & 1 & 1 \\ 0 & 1 / 3 & 2 / 3 & 1 \\ 0 & 0 & 1 / 3 & 1 \\ 0 & 0 & 0 & 1 \end{matrix}] .

13.4.2The Bernstein Basis

Section 13.4.1 ended with a bit of algebra to calculate the polynomial for a curve from the Bézier control points. This polynomial was expressed in monomial form, meaning the coefficients were for the powers of $t$ . We can also write the polynomial in Bézier form by collecting the terms on the control points rather than the powers of $t$ . When written this way, each control point has a coefficient that represents the barycentric weight as a function of $t$ that the control point contributes to the curve.

Let's repeat the algebra exercise from Section 13.4.1, only this time we'll be writing things in a slightly different way that will lead us to some observations. As we did before, we start with the linear case (remember, $b_{i}^{0} = b_{i}$ ):

\begin{aligned} b_{i}^{1} (t) & = (1 - t) [b_{i}^{0} (t)] + t [b_{i + 1}^{0} (t)] \\ = (1 - t) b_{i} + t b_{i + 1} . \end{aligned}

Next comes the quadratic:

\begin{aligned} b_{i}^{2} (t) & = (1 - t) b_{i}^{1} (t) + t b_{i + 1}^{1} (t) \\ = (1 - t) [(1 - t) b_{i} + t b_{i + 1}] + t [(1 - t) b_{i + 1} + t b_{i + 2}] \\ = (1 - t)^{2} b_{i} + t (1 - t) b_{i + 1} + t (1 - t) b_{i + 1} + t^{2} b_{i + 2} \\ = (1 - t)^{2} b_{i} + 2 t (1 - t) b_{i + 1} + t^{2} b_{i + 2} . \end{aligned}

And finally, we have the cubic case:

\begin{aligned} b_{i}^{3} (t) & = (1 - t) [b_{i}^{2} (t)] + t [b_{i + 1}^{2} (t)] \\ = (1 - t) [(1 - t)^{2} b_{i} + 2 t (1 - t) b_{i + 1} + t^{2} b_{i + 2}] \\ + t [(1 - t)^{2} b_{i + 1} + 2 t (1 - t) b_{i + 2} + t^{2} b_{i + 3}] \\ = (1 - t)^{3} b_{i} + 2 t (1 - t)^{2} b_{i + 1} + t^{2} (1 - t) b_{i + 2} \\ + t (1 - t)^{2} b_{i + 1} + 2 t^{2} (1 - t) b_{i + 2} + t^{3} b_{i + 3} \\ = (1 - t)^{3} b_{i} + 3 t (1 - t)^{2} b_{i + 1} + 3 t^{2} (1 - t) b_{i + 2} + t^{3} b_{i + 3} . \end{aligned}

You might see a pattern emerging, but just to make it even more clear, let's show the curves up to degree 5 (we'll skip over the algebra; it's similar to what we did above):

Bézier curves of degree 1–5

\begin{aligned} b_{0}^{1} (t) & = (1 - t) b_{0} + t b_{1}, \\ b_{0}^{2} (t) & = (1 - t)^{2} b_{0} + 2 t (1 - t) b_{1} + t^{2} b_{2}, \\ (13.26) & b_{0}^{3} (t) & = (1 - t)^{3} b_{0} + 3 t (1 - t)^{2} b_{1} + 3 t^{2} (1 - t) b_{2} + t^{3} b_{3}, \end{aligned}

\begin{aligned} \begin{aligned} b_{0}^{4} (t) & = (1 - t)^{4} b_{0} + 4 t (1 - t)^{3} b_{1} + 6 t^{2} (1 - t)^{2} b_{2} \\ + 4 t^{3} (t - 1) b_{3} + t^{4} b_{4}, \end{aligned} \\ \begin{aligned} b_{0}^{5} (t) & = (1 - t)^{5} b_{0} + 5 t (1 - t)^{4} b_{1} + 10 t^{2} (1 - t)^{3} b_{2} \\ + 10 t^{3} (1 - t)^{2} b_{3} + 5 t^{4} (1 - t) b_{4} + t^{5} b_{5} . \end{aligned} \end{aligned}

Now the pattern is more clear. Each term has a constant coefficient, a power of $(1 - t)$ , and a power of $t$ . The powers of $t$ are numbered in increasing order, so $b_{i}$ has a coefficient $t^{i}$ . The powers of $(1 - t)$ follow the opposite pattern and are numbered in decreasing order.

The pattern for the constant coefficients is a bit more complicated. Please permit a brief, but hopefully interesting, detour into combinatorics. Let's write out the first eight levels in a triangular form to make the pattern a bit easier to see:

Pascal's triangle

\begin{array}{cccccccccccccccccc} 0 & 1 \\ 1 & 1 & 1 \\ 2 & 1 & 2 & 1 \\ 3 & 1 & 3 & 3 & 1 \\ 4 & 1 & 4 & 6 & 4 & 1 \\ 5 & 1 & 5 & 10 & 10 & 5 & 1 \\ 6 & 1 & 6 & 15 & 20 & 15 & 6 & 1 \\ 7 & 1 & 7 & 21 & 35 & 35 & 21 & 7 & 1 \end{array}

With the exception of the 1s on the outer edge of the triangle, all other numbers are the sum of the two numbers above it. You are looking at a very famous number pattern that has been studied for centuries, known as the binomial coefficients because the $n$ th row gives the coefficients when expanding the binomial $(a + b)^{n}$ . The compulsion to organize these numbers in a triangular manner like this has struck many people, including the mathematician and physicist Blaise Pascal (1623–1662).14 This triangular arrangement of the binomial coefficients is known as Pascal's triangle.15

Binomial coefficients have a special notation. We can refer to the $k$ th number on row $n$ in Pascal's triangle (where the indexing starts at 0 for both $n$ and $k$ ) using binomial coefficient notation as

Binomial coefficient notation

(\binom{n}{k}) .

For example, $(\binom{6}{2}) = 15$ . We read $(\binom{n}{k})$ as “ $n$ choose $k$ ,” because the value of $(\binom{n}{k})$ also happens to be the number of subsets of $k$ objects that can be chosen from a set of $n$ objects, disregarding the order.

Now let's look at the general formula for computing binomial coefficients. (We emphasize that this formula is primarily for entertainment purposes, since our use of binomial coefficients in this chapter on curves will be restricted to the first few lines of Pascal's triangle.) Remember from Section 11.4.6 the factorial operator, denoted $n!$ , which is the product of all the whole numbers up to and including $n$ :

Factorial operator

n! = \prod_{i = 1}^{n} i = 1 \times 2 \times 3 \times \dots \times n .

Using factorials, and defining $0! \equiv 1$ , we compute a binomial coefficient as

Binomial coefficient

(\binom{n}{k}) = \frac{n!}{k! (n - k)!} .

Binomial coefficients arise frequently in applications dealing with combinations and permutations, such as probability and analysis of algorithms. Because of their importance, and the amazingly large number of patterns that can be found in them, they have been the subject of quite a large amount of study. A very thorough discussion of binomial coefficients, especially regarding their use in computer algorithms, is presented by Knuth [2].

Back to curves. We've analyzed the pattern of the barycentric weights. Now let's rewrite a Bézier curve, replacing each control point weight with a function $B_{i}^{n} (t)$ , and using the cubic curve formula (Equation (13.26)) as our example:

\begin{aligned} b_{0}^{3} (t) & = (1 - t)^{3} b_{0} + 3 t (1 - t)^{2} b_{1} + 3 t^{2} (1 - t) b_{2} + t^{3} b_{3} \\ = [B_{0}^{3} (t)] b_{0} + [B_{1}^{3} (t)] b_{1} + [B_{2}^{3} (t)] b_{2} + [B_{3}^{3} (t)] b_{3} . \end{aligned}

More generally, we can write a Bézier curve of degree $n$ (having $n + 1$ control points) as

Bézier curve of arbitrary degree

b_{0}^{n} (t) = \sum_{i = 0}^{n} [B_{i}^{n} (t)] b_{i} .



$B_{0}^{1} (t) = 1 - t$	$B_{0}^{2} (t) = (1 - t)^{2}$
$B_{1}^{1} (t) = t$	$B_{1}^{2} (t) = 2 (1 - t) t$
	$B_{2}^{2} (t) = t^{2}$



$B_{0}^{3} (t) = (1 - t)^{3}$	$B_{0}^{4} (t) = (1 - t)^{4}$
$B_{1}^{3} (t) = 3 t (1 - t)^{2}$	$B_{1}^{4} (t) = 4 t (1 - t)^{3}$
$B_{2}^{3} (t) = 3 t^{2} (1 - t)$	$B_{2}^{4} (t) = 6 t^{2} (1 - t)^{2}$
$B_{3}^{3} (t) = t^{3}$	$B_{3}^{4} (t) = 4 t^{3} (1 - t)$
	$B_{4}^{4} (t) = t^{4}$

Figure 13.13Bernstein polynomials of degrees 1–4

The function $B_{i}^{n} (t)$ is a Bernstein polynomial, named after Sergei Bernstein (1880–1968).16 We've already figured out the pattern of these polynomials, but here's the precise formula:

Bernstein polynomial

B_{i}^{n} (t) = (\binom{n}{i}) t^{i} (1 - t)^{n - i}, 0 \leq i \leq n .

Figure 13.13 shows the graphs for the Bernstein polynomials up to the quartic case.

The properties of the Bernstein polynomials tell us a lot about how Bézier curves behave. Let's discuss a few properties in particular.

Sum to one. The Bernstein polynomials sum to unity for all values of $t$ , which is nice because if they didn't, then they wouldn't define proper barycentric coordinates. This fact is not immediately obvious, neither from visual inspection of Figure 13.13 nor from a cursory examination of the equations, but it can be proven. If you relish the idea of working through such a proof for the quadratic case, check out Exercise 4.

Convex hull property. The range of the Bernstein polynomials is $0 \dots 1$ for the entire length of the curve, $0 \leq t \leq 1$ . Combined with the previous property, this means that Bézier curves obey the convex hull property: the curve is bounded to stay within the convex hull of the control points. Compare this with the Lagrange basis polynomials, which do not stay within the $[0, 1]$ interval, causing polynomial interpolation to not obey the convex hull property. One manifestation of this is the undesirable “overshooting” witnessed in Figure 13.4.

Endpoints interpolated. The first and last polynomials attain unity when we need them to. Because $B_{0}^{n} (0) = 1$ and $B_{n}^{n} (1) = 1$ , the curve touches the endpoints. Notice that $t = 0$ and $t = 1$ are the only places where any of the basis polynomials reach 1, which is why the other control points are only approximated and not interpolated.

Global support. All the polynomials are nonzero on the open interval $(0, 1$ )—that is, the entire curve excluding the endpoints. The region where the blending weight for a control point is nonzero is called the support of the control point. Wherever the control point has support, it exerts some influence on the curve.

Bézier control points have global support because the Bernstein polynomials are nonzero everywhere other than the endpoints. The practical result is that when any one control point is moved, the entire curve is affected. This is not a desirable property for curve design. Once we have a section of the curve that looks how we want, we would prefer that editing of some other distant control point not disturb the section that was shaped the way we liked it. This envious situation, known as local support, occurs when we move a particular control point and only the part of the curve near that control point is affected, for some definition of “near.”

Local support means that the basis function is nonzero only in some interval, and outside this interval it is zero. Unfortunately, such a basis function cannot be described as a polynomial, and thus no polynomial curve can achieve local control. However, local support is possible by piecing together small curves that fit together just right to form a spline, as Section 13.6 discusses.

One local maximum. Although each control point exercises influence over the entire curve, each exerts the most influence at one particular point along the curve. Each Bernstein polynomial $B_{i}^{n} (t)$ , which serves as the blend weight for the control point $b_{i}$ , has one maximum at the auspicious time $t = i / n$ . Furthermore, at that time, $b_{i}$ exerts more weight than any other control point.

Thus, although every point on the interior of the curve is influenced to some degree by all the control points (because Bézier control points have global support), the nearest control point has the most influence.

13.4.3Bézier Derivatives and Their Relationship
to the Hermite Form

Let's take a look at the derivatives of a Bézier curve. Since we like to use the cubic curve as our example, we're talking about the velocity and acceleration of the curve. Remember that the velocity is related to the tangent (direction) of the curve, and the acceleration is related to its curvature.

Section 13.1.6 showed how to get the velocity function of a curve from the monomial coefficients:

Position and velocity of a cubic curve

\begin{aligned} p (t) & = c_{0} + c_{1} t + c_{2} t^{2} + c_{3} t^{3}, \\ (13.29) & v (t) = \dot{p} (t) & = c_{1} + 2 c_{2} t + 3 c_{3} t^{2} . \end{aligned}

And Section 13.4.1 showed how to extract the monomial coefficients from a cubic Bézier curve:

\begin{aligned} c_{0} & = b_{0}, \\ c_{1} & = 3 b_{1} - 3 b_{0}, \\ c_{2} & = 3 b_{0} - 6 b_{1} + 3 b_{2}, \\ c_{3} & = - b_{0} + 3 b_{1} - 3 b_{2} + b_{3} . \end{aligned}

Plugging these coefficients into the velocity formula (Equation (13.29)), we obtain a formula for the instantaneous velocity of a curve in terms of the Bézier control points:

First derivative (velocity) of a cubic Bézier curve

\begin{aligned} v (t) & = c_{1} + 2 c_{2} t + 3 c_{3} t^{2} \\ = (3 b_{1} - 3 b_{0}) + 2 (3 b_{0} - 6 b_{1} + 3 b_{2}) t + 3 (- b_{0} + 3 b_{1} - 3 b_{2} + b_{3}) t^{2} . \end{aligned}

Now consider the velocity at the endpoints $t = 0$ and $t = 1$ :

Velocity at the endpoints of a cubic Bézier curve

\begin{aligned} v (0) = & (3 b_{1} - 3 b_{0}) + 2 (3 b_{0} - 6 b_{1} + 3 b_{2}) (0) \\ + 3 (- b_{0} + 3 b_{1} - 3 b_{2} + b_{3}) (0)^{2} \\ (13.30) & = & 3 (b_{1} - b_{0}), \\ v (1) = & (3 b_{1} - 3 b_{0}) + 2 (3 b_{0} - 6 b_{1} + 3 b_{2}) (1) \\ + 3 (- b_{0} + 3 b_{1} - 3 b_{2} + b_{3}) (1)^{2} \\ = & 3 b_{1} - 3 b_{0} + 6 b_{0} - 12 b_{1} + 6 b_{2} - 3 b_{0} + 9 b_{1} - 9 b_{2} + 3 b_{3} \\ (13.31) & = & 3 (b_{3} - b_{2}) . \end{aligned}

This is interesting. Observe that $b_{1} - b_{0}$ gives us the vector from the first control point to the second control point, and $b_{3} - b_{2}$ is the vector from the third control point to the last control point. So the tangent at the start of the curve at $t = 0$ is “aimed towards” the first control point, and the tangent at the end of the curve at $t = 1$ is “aimed towards” the third control point. (Actually, the tangent at $t = 1$ points directly away from the third control point, if we think about moving along the curve in the direction of increasing $t$ ). This is a key point.

The first edge of the Bézier control polygon completely determines the tangent at the start of the curve, and the last edge determines the tangent at the end of the curve.

Another way to illustrate the role of the middle control points in a cubic Bézier curve is to examine the relationship between the Bézier and Hermite forms. Remember that the cubic Hermite form contains the initial position $p_{0}$ and velocity $p_{1}$ and the final position $p_{1}$ and velocity $v_{1}$ . Now that we know the relationship between the Bézier control points and the curve velocity, it's easy to convert from Bézier to Hermite form:

Converting cubic curve from Bézier form to Hermite form

\begin{aligned} (13.32) & p_{0} & = b_{0}, \\ v_{0} & = 3 (b_{1} - b_{0}), \\ v_{1} & = 3 (b_{3} - b_{2}), \\ (13.35) & p_{1} & = b_{3} . \end{aligned}

Or, we can convert from Hermite to Bézier:

Converting cubic curve from Hermite form to Bézier form

\begin{aligned} b_{0} & = p_{0}, \\ b_{1} & = p_{0} + (1 / 3) v_{0}, \\ b_{2} & = p_{1} - (1 / 3) v_{1}, \\ b_{3} & = p_{1} . \end{aligned}

Thus, Hermite and Bézier forms are very closely related, and it is very easy to convert between them. Their relationship is depicted graphically in Figure 13.14.

Figure 13.14 Relationship between Bézier and Hermite forms

We've said that the first derivative at either endpoint is completely determined by the nearest two Bézier control points. We can actually make a more general statement. The $n$ th derivative at either endpoint is completely determined by the nearest $n + 1$ control points. The “0th derivative” (the position of the curve) is completely determined by the interpolated control point. The first derivative has been discussed. The second derivative (acceleration) at the end of the curve is determined by the closest three control points. In fact, let's see exactly what the acceleration is in terms of the Bézier control points for a cubic curve. Converting the acceleration function (Equation (13.6)) from monomial to Bézier form, we get

Acceleration of a cubic Bézier curve

\begin{aligned} a (t) & = 2 c_{2} + 6 c_{3} t \\ = 2 (3 b_{0} - 6 b_{1} + 3 b_{2}) + 6 (- b_{0} + 3 b_{1} - 3 b_{2} + b_{3}) t \\ = (6 b_{0} - 12 b_{1} + 6 b_{2}) + (- 6 b_{0} + 18 b_{1} - 18 b_{2} + 6 b_{3}) t . \end{aligned}

At the endpoints, the acceleration is given by

Acceleration of a cubic Bézier curve at the endpoints

\begin{aligned} a (0) & = (6 b_{0} - 12 b_{1} + 6 b_{2}) + (- 6 b_{0} + 18 b_{1} - 18 b_{2} + 6 b_{3}) 0 \\ = 6 b_{0} - 12 b_{1} + 6 b_{2}, \\ a (1) & = (6 b_{0} - 12 b_{1} + 6 b_{2}) + (- 6 b_{0} + 18 b_{1} - 18 b_{2} + 6 b_{3}) 1 \\ = 6 b_{1} - 12 b_{2} + 6 b_{3} . \end{aligned}

As expected, the acceleration at the start is completely determined by the first three control points, and the acceleration at the end is determined by the last three control points.

Let's define $d_{i} = b_{i + 1} - b_{i}$ as shorthand for the delta between consecutive control points, the vector of the $i$ th edge of the Bézier control polygon. With this notation, the acceleration formulas bear a striking resemblance to the velocity formulas:

Acceleration of a cubic Bézier curve at the endpoints, in terms of the delta between consecutive control points

\begin{aligned} a (0) & = 6 b_{0} - 12 b_{1} + 6 b_{2} = 6 b_{0} - 6 b_{1} - 6 b_{1} + 6 b_{2} \\ = 6 ((b_{2} - b_{1}) - (b_{1} - b_{0})) \\ (13.36) & = 6 (d_{1} - d_{0}), \\ a (1) & = 6 b_{1} - 12 b_{2} + 6 b_{3} = 6 b_{1} - 6 b_{2} - 6 b_{2} + 6 b_{3} \\ = 6 ((b_{3} - b_{2}) - (b_{2} - b_{1})) \\ (13.37) & = 6 (d_{2} - d_{1}) . \end{aligned}

The above discussion applies to Bézier curves of any degree. In general, the pattern is this: if we move control point $b_{i}$ , we affect the $i$ th derivative and higher at the start of the curve, but not lower-numbered derivatives. (Similar statements apply at the end of the curve, regarding control point $b_{i}$ and the derivative $n - i$ and higher.) Of course, for a cubic spline that's just about the end of the story, since we cannot move any control point without potentially changing the third derivative at every point on the spline, since the third derivative is constant for a cubic, and all higher derivatives are zero. We come back to these ideas in Section 13.8.1 when we talk about the continuity conditions of two or more Bézier curve segments joined in a spline.

13.5Subdivision

Beginning with Section 13.6, this chapter addresses the topic of joining together curves into a spline, which we can make as long and as complex as we want. Before we do that, this section considers the opposite problem: how to take a curve and chop it up into smaller pieces.

Why would we ever want to do this? There are a couple of reasons.

Curve refinement. In the process of designing a curve interactively, we may find that we almost have the shape we want, but one curve can't quite give us the flexibility that we need. So we cut the curve into two pieces (forming a spline), which gives us greater flexibility.
Approximation techniques. Another reason to subdivide a curve is that a piece of a curve is generally simpler than the whole curve, where “simpler” means “more like a straight line.” So we can cut it into a sufficiently large number of pieces, and then do something with those pieces as if they were straight line segments, such as render them or raytrace them. In this way, we can approximate the result we would get if we were able to render or raytrace the curve analytically.
Strictly speaking, we don't need subdivision to do piecewise linear approximation—we already discussed one simple technique that evaluates the curve at fixed-size intervals and draws lines between those sample points. But subdivision allows us to choose the number of line segments adaptively by using fewer line segments on the straighter parts of the curve and more line segments on the curvier parts.

So that's the “why” of curve subdivision. Before we learn the “how,” let's be a bit more precise about the “what.” Consider a parametric polynomial curve $P$ defined by the function $p (t)$ , adopting the usual conventions that the curve starts at $t = 0$ and ends at $t = 1$ . Now consider a segment $Q$ that starts at an arbitrary time $t = a$ and ends at $t = b$ . This is illustrated in Figure 13.15.

Figure 13.15Extracting a segment of a curve by using subdivision

The goal of subdivision is a mathematical description for $Q$ in some form (monomial, Hermite, or Bézier). But don't we already have that? After all, we assume that have a mathematical description of $P$ in some form, and so it's perfectly valid to define $Q$ by saying, “Take the curve defined by $P$ , but instead of starting at 0 and ending at 1, start at $a$ and end at $b$ .” That's not really what we want. We want $Q$ to be a fully independent and “regular” curve that makes no reference to $P$ , not subordinate or qualified in some way. For example, if we are using Bézier form, then we want new Bézier control points that define $Q$ .

The following sections present two different methods for subdividing curves. Section 13.5.1 presents a straightforward algebraic approach in monomial form. Section 13.5.2 considers Bézier curve subdivision, which is geometrically based and lends itself towards rather elegant and efficient implementations.

Hermite form doesn't lend itself naturally to subdivision. If we wish to subdivide a Hermite form, we first convert the curve to another form (probably Bézier) and do the subdivision in that form.

13.5.1Subdividing Curves in Monomial Form

Extracting a segment from a curve in monomial form is a straightforward algebraic task. Remember that monomial form is just an explicit polynomial on $t$ . Although we are typically interested only in the part where $0 \leq t \leq 1$ , the polynomial is defined for all values of $t$ and so it actually defines an infinite curve. The smaller segment $Q$ that we wish to extract is just a different subsection of the same infinite curve.

With this in mind, we realize that the problem of subdivision can easily be viewed as a simple problem of reparameterization. Rather than trying to muck directly with the monomial coefficients, we perform some algebra on the parameter value. Let's introduce a local parameter $s$ that varies from 0 to 1 as $q (s)$ traces out the curve $Q$ . Given this, we can define the curve $q (s)$ in terms of $p (t)$ as

\begin{aligned} t & = F (s), & q (s) & = p (t) = p (F (s)), \end{aligned}

where the function $F (s)$ is our reparameterization function that returns the global parameter $t$ corresponding to the local parameter $s$ . It's not too hard to see what form $F$ should be, since we wish to satisfy the endpoint conditions $F (0) = a$ and $F (1) = b$ . Adopting a straightforward linear relation between $t$ and $s$ yields

t = F (s) = a + s (b - a) .

You might want to verify that this does behave correctly at the endpoints.

Of course, all we have really accomplished is to define $Q$ in terms of $P$ , which is precisely what we said was not sufficient at the start of this section. The difference is that if we continue working through the math, and substitute for $p (t)$ and eliminate $t$ , we can get a direct equation for $q (s)$ , which is a “regular” and independent curve satisfying the goals outlined at the start of this section.

However, the ensuing algebra gruntwork produces a messy result without revealing any insight. The main thing we wish to communicate here is that subdivision of a curve in monomial form is a simple matter of reparameterization, which can be accomplished algebraically. Furthermore, because we can convert between monomial forms and other forms, we now have a surefire method for subdividing any polynomial curve in any format.

But we need not be satisfied with this “brute force” approach; as it turns out, in Bézier\ form, we can do better.

13.5.2Subdividing Curves in Bézier Form

Subdivision of a Bézier curve can be done geometrically through a variant of the de Casteljau algorithm. The full algorithm of extracting any subsection for arbitrary endpoint parameters $a$ and $b$ is not immediately grasped, so we follow Farin [1] and start off with a simple case.

We begin by restricting ourselves to extracting only the “left side” of a curve. In other words, we fix $a = 0$ . Clearly, the first Bézier control point on the smaller curve (at $s = 0$ ) is the same as the first control point on the larger curve (at $t = 0$ ). Equally clear is that the endpoint at $t = b$ is obtainable by the basic de Casteljau algorithm from Section 13.4.1. An example situation with $b = 0.75$ is illustrated in Figure 13.16.

Figure 13.16 Locating the interior endpoint using the de Casteljau algorithm

We have the endpoints—now for those tricky interior points. Surprisingly, if you look closely at Figure 13.16, you'll notice that we already constructed them! As it turns out, each round of de Casteljau interpolation produces one of our Bézier control points. Figure 13.17 makes this clearer, showing the selected Bézier points and the control polygon.

Figure 13.17 The de Casteljau algorithm gives us all the Bézier control points of the extracted curve segment

Why does this work? Recall the relationship between the Bézier form and the Hermite form from Section 13.4.3. The first interior control point $b_{1}$ completely determines the first derivative (the velocity) at $t = 0$ . Now, the subcurve that we are extracting is part of the same infinite curve, and thus its position and derivatives match everywhere, in a geometric sense. However, the derivative is a rate of change relative to the rate of change of the parameter. By subdividing, we have made the parameter $t$ move “faster,” since it goes from 0 to 1 over a smaller spatial interval. Thus, the derivative of the subcurve is in the same direction, but it is shorter according to the fraction of the curve that we are extracting, in our case the value $b$ .

Let's summarize our findings. To extract the left half of a curve, $0 \leq t \leq b$ , we perform de Casteljau subdivision as if we were trying to locate the endpoint at $t = b$ . The first control point from each round of interpolation gives us another control point for our subdivided curve. Extracting the right half of a curve is analogous, so we won't go into detail here.

There is one important special case of Bézier subdivision that we can do armed only with what we know so far: subdividing a curve “in half” at $t = 1 / 2$ . This computation makes possible rather elegant recursive algorithms for adaptive subdivision. Let's use our standard notation $b_{i}$ for the original Bézier control points. For the two halves, we pick two letters at random and call the control points for the left and right halves of the curve $q_{i}$ and $r_{i}$ , respectively. The seven control points are given by

Subdividing a Bézier curve at

t = 1 / 2

\begin{aligned} q_{0} & = b_{0}, \\ q_{1} & = b_{0} / 2 + b_{1} / 2, \\ q_{2} & = b_{0} / 4 + b_{1} / 2 + b_{2} / 4, \\ q_{3} = r_{0} & = b_{0} / 8 + 3 b_{1} / 8 + 3 b_{2} / 8 + b_{3} / 8, \\ r_{1} & = b_{1} / 4 + b_{2} / 2 + b_{3} / 4, \end{aligned}

\begin{aligned} r_{2} & = b_{2} / 2 + b_{3} / 2, \\ r_{3} & = b_{3} . \end{aligned}

The general case is obtained through blossoming, which is a general term referring to a number of techniques involving repeated de Casteljau steps taken with different interpolation fractions. To determine each control point, we take three de Casteljau steps (for a cubic curve, at least). For each control point $b_{i}$ we take $i$ of those steps using $t = b$ , and the rest using $t = a$ . As it turns out, it doesn't matter which of the interpolation steps use $a$ and which use $b$ , but the number of steps using $a$ or $b$ is important. Let's consider each point on the cubic curve to make this clear. To compute $b_{0}$ , at each round we use $t = a$ as the interpolation fraction. For $b_{1}$ , we use $t = a$ for two of the rounds, and $t = b$ for one round. To calculate $b_{2}$ , we use $t = a$ for the interpolation fraction in only one round, and $t = b$ for the other two. And of course, for the last control point $b_{3}$ , we use $t = b$ for all three rounds, exactly as we described at the start of this section.

13.6Splines

So far we have been focusing on cubic curves, and for good reason; they are the most commonly used type of curves in 3D. Such curves inherently have four degrees of freedom, whether we are using Bézier curves with four control points, monomial curves with four coefficients, or Hermite curves with two ending points plus two derivatives. Because there are only four degrees of freedom, the set of curves that can be represented by using only the techniques of cubic curves is sharply limited.

Additional freedom is obtained by joining smaller curves together in a spline, which is the subject of the remainder of this chapter. Before we discuss splines, let's pause for a moment to discuss one potential alternative: using a higher degree polynomial. Obviously any degree $n$ curve can be converted to a degree $n + 1$ curve; such a conversion is known as degree elevation. In monomial form, of course, this is trivial, we just add a new leading coefficient of zero.

In Bézier form, degree elevation adds a new control point and, as you might have guessed, the positions of the new control points can be constructed geometrically by using linear interpolation. Given a curve of degree $n$ , which has $n + 1$ control points denoted $b_{i}$ , degree elevation produces a degree $n + 1$ curve with $n + 2$ control points, denoted $b_{j}^{'}$ . To determine these new control points, we linearly interpolate using an interpolation fraction proportional to the index of the control point:

Degree elevation in Bézier form

\begin{aligned} (13.38) & b_{j}^{'} = \frac{j}{n + 1} b_{j - 1} + (1 - \frac{j}{n + 1}) b_{j}, 0 \leq j \leq n + 1. \end{aligned}

(Note that the computation of $b_{j}^{'}$ will “blend” the nonexistent point $b_{- 1}^{'}$ with a weight of zero.)

For Hermite curves, we usually are interested only in odd values for $n$ , so that we have the same number of derivatives at each endpoint.

A higher degree polynomial has the ability to describe a curve with more “wiggles,” but, unfortunately, in general it suffers from several shortcomings:

The curve has global support. Each control point exerts some nonzero weight on every point along the curve, with the exception of the endpoints.
The curve has extraneous “wiggles” that sometimes show up in places we don't want, oscillating back and forth between the control points. This is known as the Runge17 phenomenon.
Somewhat related to the extra wiggles is the fact that higher degree curves are very sensitive. Due to the curve's global support, a change to any one of the control points will result in a change over the entire curve; due to the high sensitivity, this response can be very large.
Having ruled out polynomial interpolation as a viable curve design tool, we cannot directly specify a point that we want the curve to interpolate, other than the endpoints.

The basic problem is that we are asking too much from a single polynomial. Splines do not have these shortcomings.

Here's what's in store. First, to facilitate the discussion, we must expand our notation and introduce a level of indirection between the local and global parameterization, which we do in Sections 13.6.1 and 13.6.2. Then, in Section 13.7, we talk about Hermite and Bézier splines, which are used in many software packages, such as Adobe Photoshop and Autodesk 3DS Max. From there, our focus naturally gravitates towards deciding what to do at the “seams.” The first hurdle is to define the criteria that must be met so that the curve is smooth at these junction points. Such continuity conditions are the subject of Section 13.8. Once we understand these issues, we will have finally reached our goal set at the start of this chapter, a spline system that provides an intuitive means to define a curved shape.

Having developed a flexible design tool where the user can specify the position and tangent at each control point, Section 13.9 then investigates methods by which the designer need specify only the positions of the control points, and the tangents are computed automatically based on a set of intuitive user controls.

13.6.1Rules of the Game

Our spline is composed of $n$ segments, denoted $q_{0}$ , $q_{1}$ , …, $q_{n - 1}$ . The $i$ th segment $q_{i}$ is a function that accepts a local parameter, named $s_{i}$ , which is normalized to vary from 0 to 1 over the length of the segment. In other words, for each segment there is a curve function $q_{i} (s_{i})$ exactly like the ones we studied in the first part of this chapter; the only differences are the cosmetic renaming of the function from $p$ to $q_{i}$ and the argument from $t$ to $s_{i}$ .

We use two different notations to refer to the entire spline. One way is to just drop the subscripts from the notation above, so the function $q (s)$ refers to the entire spline, and the parameter $s$ (without subscript) is a global parameter. As $s$ varies from 0 to $n$ , the function $q (s)$ traces out the entire spline.

The composite function $q (s)$ is very simple. Basically we take the integer portion of $s$ to get the index $i$ , describing which segment we are on, and then the fractional portion is used as $s_{i}$ and plugged into the segment $q_{i}$ . So the first segment $q_{0} (s_{0})$ defines the spline on the interval between $q (0)$ and $q (1)$ , the second segment defines the spline from $q (1)$ to $q (2)$ , and so on. More formally,

A composite curve with a simple global parameterization

\begin{aligned} i & = ⌊ s ⌋, & (select segment by using the floor function) \\ s_{i} & = s - i, & (calculate local parameter) \\ q (s) & = q_{i} (s_{i}) . & (evaluate segment) \end{aligned}

Note that, given a particular value for $s$ , we can unambiguously identify the point $q (s)$ along the spline. However, a particular value of $s_{i}$ is meaningful only within the context of segment $i$ ; this is emphasized by the subscript.

If we are not concerned with the timing of our curve, then this notation may be all we need. However, when defining an animation path, we usually need a level of indirection. We introduce the notation $p (t)$ to refer to the final curve, a function that returns our position at a given “time” $t$ . It's just a different parameterization of the same curve; $p (t)$ and $q (s)$ trace out the same shape, but the $s$ and $t$ values for a particular point along the path are usually not be the same. We can parameterize the curve so that some sections are traversed quickly and others more slowly. The range of $s$ is fixed by the number of knots, but we are free to assign the range of $t$ , the total duration of the curve, to anything we wish.

In general, we can define $p (t)$ in terms of $q (s)$ by creating a function that maps a time value $t$ to a parameter value $s$ . When we want to be explicit that $s$ is a function of $t$ , we use the notation $s (t)$ , and this function is called the time-to-parameter function. If you're a computer programmer, you can think of $p (t)$ as the public interface, and $q (s)$ as an internal implementation detail. We are engaging in a fundamental practice of computer science: breaking down complexity by introducing a level of indirection.

With the above notation established, the basic game plan for evaluating $p (t)$ is as follows:

Map the time value $t$ into a value of $s$ by evaluating the time-to-parameter function $s (t)$ .
Extract the integer portion of $s$ as $i$ , and the fractional portion as $s_{i}$ .
Evaluate the curve segment $q_{i} (s_{i})$ .

Of course, if we don't care about the timing of the spline (perhaps we only care about its shape), then we have no need of the first step, and we can just use the trivial mapping of $s (t) = t$ . Unfortunately, due to space constraints, this is precisely what we're going to do in this book. We don't discuss the subtleties of dealing with the timing.

With the assumption for now that $s = t$ , the first step is trivial. The second step is also easy, and we devoted the first part of this chapter to the third step. So we really already know how to evaluate a spline; let's look at how we might create one.

13.6.2Knots

Think about the juncture between two segments. For the curve to be continuous, clearly the ending point of one segment must be coincident with the starting point of the next segment. (Section 13.8 addresses additional desirable criteria.) These shared control points that are interpolated by the spline are called the knots of the spline. The knot at index $i$ is denoted $k_{i}$ , and since there is one more knot than the number of segments, the knots are numbered $k_{0} \dots k_{n}$ .

We assume that the segments are connected at the knots. In other words, $q (s)$ passes through the knots at integer values of $s$ . With this assumption, there's no need for separate notation (or separate storage space in a computer program) for the beginning point and ending point of each segment. Instead, each interior knot $k_{i}$ serves a dual role as the starting point of segment $q_{i}$ and the ending point of segment $q_{i - 1}$ . Thus, we establish the following relations:

\begin{aligned} q (i) & = k_{i}, & q_{i} (0) & = k_{i}, & q_{i} (1) & = k_{i + 1} . \end{aligned}

Note that $k_{i}$ specifies a single point, whereas the notation $q_{i}$ refers to an entire segment, which is a function of a local parameter $s_{i}$ that yields a point. All of this notation is depicted in Figure 13.18.

Figure 13.18 A spline with

n

segments has

n + 1

knots, named

k_{0} \dots k_{n}

In animation contexts, the knots are sometimes called keys. This is a reference to the old-school animation methods where a master animator would create the key frames, or frames where the characters reached important poses. The in-between frames would be filled in by a less experienced (and less expensive) apprentice. In computer animation, a key can be any position, orientation, or other piece of data whose value at a particular time is specified by a human animator (or any other source). The role of the apprentice to “fill in the missing frames” is played by the animation program, using interpolation methods such as the ones being discussed in this chapter. As we've noted before, most of the early research on splines was aimed at defining static shapes, not animated trajectories, and so the term “knot” is more prevalent.

13.7Hermite and Bézier Splines

A spline is made by patching together curve segments so that they fit together smoothly. What sorts of curve segments? For reasons that will soon become apparent, it is most convenient for us to use the Hermite representation for the individual segments. When we say convenient “for us,” we mean the people writing the code for an animation system or carrying out the mathematical discussion in the following sections. When it comes to depicting or manipulating splines graphically, the Bézier form is typically preferred. Of course, the Hermite and Bézier forms are closely related, and it is easy to convert between the two forms. If you don't remember this relationship, we review it in just a moment.

Remember that a Hermite curve segment is defined by its starting and ending positions and velocities. When we were focused on a single segment, we denoted the positions by $p_{0}$ and $p_{1}$ , and the velocities by $v_{0}$ and $v_{1}$ . In the context of a spline, we use a notation organized around a knot rather than a segment. For positions, we don't use the $p$ s because, as we've said earlier, the knot $k_{i}$ , which is the starting position of the segment $q_{i} (0)$ , also serves as the ending position of the previous segment at $q_{i - 1} (1)$ . For velocities, the notation $v_{i}^{o u t}$ refers to the outgoing velocity at knot $i$ and defines the starting velocity for the segment $q_{i}$ . Likewise, the incoming velocity from the left side of $k_{i}$ is denoted $v_{i}^{i n}$ and defines the ending velocity of the previous segment $q_{i - 1}$ . We also refer to these velocity vectors as tangents.

Figure 13.19 shows a spline with five Hermite segments. All of the knots, segments, and tangents are labeled according to the notation just described.

Figure 13.19Our notation for splines with segments in Hermite form

Be warned that the tangents in Figure 13.19—and all the figures of Hermite curves in this chapter—are drawn at one-third scale. Officially we'd like to tell you that this was done so that the diagrams would be smaller and this book would consume less of the Earth's natural resources. A more accurate reason is that we draw the tangents at one-third length so the tangents will be the same as the edges of the Bézier control polygon. Matching the Bézier control polygon has some educational benefits, but, more importantly, it facilitates laziness on the part of the authors: the tools we used to create the curves in the diagrams are based on Bézier splines.

The splines in the diagrams in this book were created in Adobe Photoshop by making a path and then “stroking” the path. The arrows for the tangent vectors were drawn by putting one end at a knot and the other end at the “handle” used to control the shape of the curve, which is essentially the same as the Bézier control point. (Photoshop calls the knots the “anchor points” and refers to the interior Bézier control points that are not interpolated as “control points.”)

For example, Figure 13.20 is a screen capture taken while one author was hard at work creating Figure 13.19. (The opacity of the actual figure has been decreased to make it easier to see the Photoshop controls.)

Figure 13.20Creating Figure 13.19 with Adobe Photoshop.

While we're on the subject of Bézier curves, let's take this opportunity to introduce the notation we use for Bézier splines. When we were dealing with only a single Bézier segment, we referred to the $i$ th control point on that segment as $b_{i}$ . Here we use the notation $f_{i}$ to refer to the control point “in front” of the $i$ th knot, and $a_{i}$ for the control point “after” it.18 This notation is illustrated in Figure 13.21.

Figure 13.21 A spline with its Bézier control polygon, and the notation we use for Bézier splines

The important relationship between Hermite and Bézier forms was introduced in Section 13.4.3. Let's restate it here in the newly-introduced spline notation:

Converting between Bézier and Hermite forms

\begin{aligned} v_{i}^{i n} & = 3 (k_{i} - f_{i}), & f_{i} & = k_{i} - v_{i}^{i n} / 3, \\ v_{i}^{o u t} & = 3 (a_{i} - k_{i}), & a_{i} & = k_{i} + v_{i}^{o u t} / 3. \end{aligned}

13.8Continuity

For a few sections now we've been promising to tell you how you can piece together segments into a spline such that they fit together smoothly. All this lead-up may have given the impression that it's a mysterious secret. But if you take a closer look at Figure 13.19, you'll see that the criterion is relatively obvious: if the incoming and outgoing velocity vectors are equal at a knot, as they are at $k_{1}$ and also $k_{2}$ , then the curve will be smooth. Notice that at $k_{3}$ , the tangents are not equal, and the curve has a kink in it. Pretty obvious, eh? Actually, as it turns out, there's quite a bit more to say on this subject.

Consider the curve near $k_{4}$ in Figure 13.19. Notice that the curve is “smooth,” yet the incoming velocity vector $v_{4}^{i n}$ is much longer than $v_{4}^{o u t}$ . Now, you might be thinking, “That curve isn't smooth there! If you were traveling along the curve, you would slam on the brakes just as you crossed the key.” But take the tangent vectors out of the diagram and just look at the shape of the curve. It's a smooth shape, right? We're back to a recurring theme: animation paths are more “demanding” than static shapes. (Notice that in the objection you just raised, you used animation-oriented terminology when you said “key” instead of “knot.” You're really catching on fast!)

Speaking of smooth animations, we just said that the curve is smooth at $k_{1}$ and $k_{2}$ . But is it? We can see that the shape is smooth, but we've just pointed out how there is a difference between a smooth shape and a smooth animation. In general, we cannot tell if the animation is smooth without knowing more about the time-to-parameter function $s (t)$ . If the shape is not smooth, the animation will not be smooth (with one exception to be discussed momentarily). But even if the shape is smooth, discontinuities in $s (t)$ can result in discontinuities in the animation. When $s (t) = t$ , no discontinuities are introduced by this trivial mapping, so if the tangents are equal, the motion will be smooth.

Finally, consider a knot for which the incoming and outgoing velocities are both zero. In this case, even though the tangents are continuous, most people would agree that the shape is not smooth at this knot. What about the motion? Is the motion smooth when we come to a complete stop and then accelerate away in a potentially different direction? That will depend on your needs.

It looks like the answer to the question “Is it smooth?” is a bit fuzzy. This is a mathematics book, and it's really bad form to be putting quotation marks around vague words such as “smooth.” We really need some more precise terminology. In the context of curves, the most important smoothness criteria are parametric continuity and the closely related geometric continuity. Let's look at each of these in turn, starting with parametric continuity, which is easier to define mathematically.

13.8.1Parametric Continuity

A curve is said to have $C^{n}$ continuity if its first $n$ derivatives are continuous. A $C^{0}$ curve is one in which the position (the “0th derivative”) is continuous. $C^{0}$ continuity means that we can draw a shape on a piece of paper in one stroke without lifting our pencil, or we can move along an animation path without “teleporting.”19 A $C^{1}$ curve has a continuous first derivative, which means the velocity doesn't jump instantaneously. This doesn't mean the velocity cannot change rapidly, but it never jumps from a velocity at one instant to a different velocity at the next instant without passing through velocities in between. For example, the curve in Figure 13.19 forms one connected line, so it is $C^{0}$ continuous everywhere. It is $C^{1}$ continuous everywhere except at $k_{3}$ and $k_{4}$ , where the velocity jumps suddenly.

Higher numbers for $n$ just mean the curve's higher-order derivatives are continuous. A curve is $C^{2}$ if its second derivative (acceleration) is continuous. Continuity conditions beyond $C^{1}$ are not that important for our purposes in this book. The lack of $C^{1}$ continuity (a sudden change in velocity) corresponds to an infinite acceleration, and this can create many problems. If the path is used to control a physical object, such as a robot or cutting tool, then we are asking for the motors driving the object to do something that is physically impossible. Even if the animation is taking place entirely inside of the virtual world of a computer, when such paths are observed by humans, they are usually perceived as “jerky.” Thus it's usually desirable to avoid (or at least control) velocity discontinuities. In contrast, a sudden change in acceleration does not create such a jarring sensation and for most purposes is perfectly acceptable.

Any individual polynomial curve segment by itself has $C^{\infty}$ continuity, since we can take the derivative of a polynomial as many times as we want and we always get a real-valued, continuous function. (Eventually, the derivatives become the constant zero function.) This is why the question of continuity didn't arise earlier in the chapter—the only places we have to worry about continuity are at the knots.

One last comment regarding higher derivatives. When we say that a curve is $C^{n}$ continuous, this implies continuity for all lower derivatives as well. For example, if the acceleration is continuous, then the velocity and position must also be continuous. A discontinuity in a function means that the function's derivative is undefined where the discontinuity occurs.

Now that we've discussed parametric continuity informally, let's define the criteria mathematically for Hermite and Bézier curves. To do so, we make use of some observations concerning the derivatives of Bézier curves from Section 13.4.3; our findings from that section are summarized here.

The $n$ th derivative at an endpoint of a Bézier curve segment is completely determined by the endpoint and the nearest $n$ control points.
The velocity at an endpoint is proportional to the vector between the endpoint and the adjacent control point (Equations (13.30) and (13.31)).
The acceleration at an endpoint is proportional to the difference of the delta vectors along the nearest two segments of the control polygon (Equations (13.36) and (13.37)).

Let's start with $C^{0}$ , which is a no-brainer due to our choice of notation. In our scheme, the ending point of one segment is the same as the starting point of the next segment by definition. Moving on to $C^{1}$ continuity, we've said that it occurs when the tangents are equal at a key. This translates directly to Hermite form as

C^{1}

continuity condition for Hermite splines

v_{i}^{i n} = v_{i}^{o u t},

and with just a little effort we can also express it in Bézier form as

C^{1}

continuity condition for cubic Bézier splines

\begin{matrix} (13.8.1) & k_{i} - f_{i} = a_{i} - k_{i} . \end{matrix}

With a quick application of algebra, we see that geometrically this means that the knot is at the midpoint of the line between $f_{i}$ and $a_{i}$ :

\begin{aligned} k_{i} - f_{i} & = a_{i} - k_{i}, \\ 2 k_{i} & = f_{i} + a_{i}, \\ k_{i} & = (f_{i} + a_{i}) / 2. \end{aligned}

Most curve design tools will automatically enforce this rule for you. For example, when you move a control point in Photoshop, it automatically moves the opposing control point like a seesaw, and if you pull the control point away from the anchor point (the knot), the opposing control point will mirror your movements to maintain the $C^{1}$ continuity relationship. (If you want to force a corner in the curve, you can hold a modifier key to tell Photoshop not to do this).

Now let's look at $C^{2}$ continuity. It's is easier to visualize in Bézier form than Hermite. We just need to apply what we learned in Section 13.4.3 to make the ending acceleration of one segment (the left side of the equations below) match the starting acceleration of the next segment (on the right side):

C^{2}

continuity condition for cubic Bézier splines

\begin{aligned} 6 a_{i - 1} - 12 f_{i} + 6 k_{i} & = 6 k_{i} - 12 a_{i} + 6 f_{i + 1}, \\ a_{i - 1} - 2 f_{i} + k_{i} & = k_{i} - 2 a_{i} + f_{i + 1} \\ 2 f_{i} - a_{i - 1}, & = 2 a_{i} - f_{i + 1}, \\ f_{i} + (f_{i} - a_{i - 1}) & = a_{i} + (a_{i} - f_{i + 1}) . \end{aligned}

The geometric interpretation of this is as follows: Take the two Bézier control polygon segments that are not direct neighbors of the knot, but one segment away, and “double” them. If they meet at a common point, the curve is $C^{2}$ continuous. To visualize this, compare the two Bézier curves in Figure 13.22. Both have $C^{1}$ continuity, since the knot $k_{i}$ is on the midpoint of the line between $f_{i}$ and $a_{i}$ for both curves. However, the top curve is $C^{2}$ continuous because the extensions of the neighboring control polygon lines meet at the common point; the curve on the bottom is not $C^{2}$ continuous.

Figure 13.22Continuity conditions for cubic Bézier splines.

13.8.2Geometric Continuity

Geometric continuity is a broader criterion of continuity. Different authors use different definitions for geometric continuity, but a very general one is that a curve has $G^{n}$ continuity if there exists some way to parameterize the curve such that the curve has $C^{n}$ continuity. Let's look at an example.

In Figure 13.19 the curve is not $C^{1}$ continuous at $k_{4}$ because the tangents are not equal. However, the curve is $G^{1}$ continuous at this location. The hint, of course, is that the tangents are parallel at the knot. If the tangents at a knot are not parallel, then there's no way to move along the curve in a smooth way. However, if the tangents are parallel, then the discontinuity is purely a change in speed, not a change in direction. We could remove this discontinuity by carefully introducing an offsetting discontinuity in the time-to-parameter function $s (t)$ that exactly “undoes” the jump in speed.

Higher-order geometric continuity extends this idea, although it is a bit more difficult to visualize. We say that a curve is $G^{2}$ continuous if its curvature changes continuously.

13.8.3How Smooth Can a Curve Be?

We end our discussion on continuity by asking an important question: what's the highest level of continuity we can expect from a polynomial spline? We said earlier that any particular curve segment has $C^{\infty}$ continuity, because we can differentiate it as many times as we want and the result is always a continuous function. Can we achieve this same level of smoothness with a spline?

Consider two adjacent cubic Bézier segments. Let's fix the first segment and consider what happens to the second segment as we demand higher and higher levels of continuity at the knot. When we demand $C^{0}$ continuity, we lock in the first Bézier control point. Clearly, the first endpoint must match the last endpoint of the first segment for the spline to be $C^{0}$ continuous.

What about $C^{1}$ continuity? Remember that the velocity at an endpoint is completely determined by the endpoint and the adjacent control point. This means if we want to match the velocity, we are locking in the position of the second control point as well.

Continuing this pattern, we see that for a Bézier segment to match $n$ derivatives requires us to “lock in” $n + 1$ control points. For a cubic curve, if we ask for $C^{4}$ continuity or higher, we can get it, but only by making every segment be a piece of the same infinite polynomial. We have gained continuity, but we have lost the flexibility that was the very reason we used splines in the first place!

The bottom line is that, practically speaking, a polynomial curve of degree $n$ (a Bézier curve with $n + 1$ control points) can really achieve only $C^{n - 1}$ continuity. For example, a piecewise linear (degree 1) polynomial can only achieve $C^{0}$ continuity. We can make a curve which is connected, but with straight lines, we cannot match the tangents. A quadratic (degree 2) polynomial can match tangents ( $C^{1}$ ), but not accelerations. A cubic curve, the type of curve we have been focusing on in this book, can achieve $C^{2}$ continuity by reducing the number of degrees of freedom per segment to one. Continuity beyond $C^{2}$ can be achieved only by eliminating all degrees of freedom (other than the curve timing), and setting each segment to be a section of the same polynomial.

13.9Automatic Tangent Control

At the start of this chapter, we began our investigation into curves with the plan of defining a curve just by listing points that we wanted the curve to pass through. We tried basic polynomial interpolation in Section 13.2, but found that it didn't give us what we wanted. We then developed the Bézier forms, which require the user to specify two endpoints, which are interpolated, and two (in the case of a cubic Bézier) interior control points, which are not interpolated but instead define the derivatives at the endpoints. So far in this chapter, we've learned how to piece together those Bézier segments in a smooth spline.

This section investigates various methods whereby a spline can be determined by just the knots, without the need for the user to specify any additional criteria. This is useful to generate a curve that looks “natural” and passes through some points, or any other time we wish to smoothly interpolate some data points.

For the moment, let's ignore the first and last knots and focus our attention on the interior knots. The problem at hand is to compute an appropriate $v_{i}^{i n}$ and $v_{i}^{o u t}$ using only the positions of the knots. Notice that we are posing the problem in Hermite form, which turns out to be the easiest form to use for this problem. The situation is depicted in Figure 13.23, which shows three control points and three different choices we could use for the tangents.

Figure 13.23 Three different choices of tangents for the middle knot, leading to three different interpolating splines

The following sections discuss a family of techniques that can be used to pick tangents that result in “good” interpolating splines. First, Section 13.9.1 discuss the Catmull-Rom spline, which is a simple and straightforward technique. Then Section 13.9.2 considers TCB splines, a generalization of the Catmull-Rom form and a hybrid that exposes additional “sliders” to the user to adjust the shape of the curve in a (hopefully) more intuitive manner without resorting to direct geometric specification of the tangents. Finally, Section 13.9.3 lists a few options for dealing with the endpoints.

When reading the following sections, keep in mind that all of these splines are still Hermite splines. We are just introducing various techniques for autocalculating the tangents. Once the tangents have been determined, the spline is no different than any other Hermite spline.

13.9.1Catmull-Rom Splines

Looking at Figure 13.23, it seems obvious which of the three choices of tangents is the most natural: the one in the middle. Why is this? The vector from the previous knot $k_{i - 1}$ to the next knot $k_{i + 1}$ is a horizontal line, and therefore it makes sense that our tangents should be horizontal. So it looks like one heuristic we could use to pick good tangents would be to make the tangents at a knot be parallel to the line between the previous and next knot. (Note that our example is slightly contrived in that the middle knot happens to be halfway between its neighbors, which is a special case. However, the fact that the neighbors lie on a horizontal line is not a special case, since we can always rotate our perspective to view the points in this configuration.)

But how long should the tangents be? Perhaps we should again use the vector between the previous and next knots as our guide. It seems as though the farther apart our neighbors are, the larger the curve, and so making our tangents be a constant multiple of this vector would be a good idea. In other words, we would set $v_{i}^{i n} = v_{i}^{o u t} = a (k_{i + 1} - k_{i - 1})$ . But what should we use for the value of $a$ ?

One way would be just to experiment and find a nice round number that seems to give results that are aesthetically pleasing. The constant $a = 1 / 2$ is a nice round number and works moderately well, so let's go with that. Figure 13.24 shows a spline loop generated by this technique.

Figure 13.24A Catmull-Rom spline

Although $a = 1 / 2$ gives “medium” results, there's definitely an argument to be made that it is a matter of preference. Sometimes we want a “tighter” curve, which would correspond to smaller values of $a$ , and sometimes we want a “looser” curve. This is a good idea, but let's put it on ice for a moment to say two more quick things about the method we've stumbled upon.

First, let's give a formal definition and name to this technique. A spline with the tangents derived according to the relation

Tangent computation for the Catmull-Rom spline and its Bézier control polygon

\begin{matrix} (13.39) & v_{i}^{i n} = v_{i}^{o u t} = \frac{k_{i + 1} - k_{i - 1}}{2} \end{matrix}

is known as a Catmull-Rom spline. The name comes from the two people who invented it, one of whom is Edwin Catmull (1945–). He later went on to become the president of Walt Disney Animation Studios and Pixar Animation Studios.

The other thing we'd like to discuss is an alternative way to derive Equation (13.39). Just a bit of algebraic manipulation yields

Catmull-Rom spline as average of adjacent delta vectors

\begin{aligned} v_{i}^{i n} = v_{i}^{o u t} & = \frac{k_{i + 1} - k_{i - 1}}{2} \\ = \frac{k_{i + 1} - k_{i} + k_{i} - k_{i - 1}}{2} \\ = \frac{(k_{i + 1} - k_{i}) + (k_{i} - k_{i - 1})}{2} . \end{aligned}

The geometric interpretation of the last line states that to compute a tangent at a knot, we take the two neighboring difference vectors of the control polygon and average them.

13.9.2TCB Splines

Section 13.9.1 showed that the tangent at a knot can be computed by multiplying the vectors of the adjacent edges of the control polygon by an appropriate constant, which we called $a$ , and adding the result. By varying $a$ , we had an intuitive “dial” we could turn to adjust the shape of the curve. We can generalize this idea further by having not just one scaling factor, but two. In other words, we can take an arbitrary linear combination of the adjacent edge vectors. Taking the straightforward approach of assigning one “dial” for each of the two scale factors doesn't quite work out as an intuitive system. Instead, a standard technique is to provide three intuitive dials, known as tension, continuity, and bias, and derive the two scale factors from these dials. A spline with the tangents thus derived is known as Kochanek-Bartels spline, often called a TCB spline for obvious reasons.20

Kochanek and Bartels [3] designed the equations so that if we turn all three dials to zero, we get the standard Catmull-Rom curve. The typical useful range for all of the parameters is $[- 1, + 1]$ , although there's no problem in going outside this range. Thus, you can think of each setting as a way to start with a Catmull-Rom curve and tweak it in a particular direction. First, let's show how each of these settings could be implemented by itself, and then let's present the full formulas that combine all three settings together.

Figure 13.25A TCB spline with different values for continuity.

The tension setting is related to the $a$ value we discovered in the previous section. We the symbol $t$ to refer to tension, and luckily there won't be any situations where this will be confused with the other meaning of $t$ , the time parameter. Like all the TCB settings, a value of $t = 0$ corresponds to the regular Catmull-Rom curve. As we increase the tension, the curve “tightens”—essentially the same effect we got by decreasing the value of $a$ in the previous section. Figure 13.25 shows the effect of the tension parameter. In each curve, the continuity and bias values are zero. Compare this with the standard Catmull-Rom curve in Figure 13.24, corresponding to $t = 0$ .

Note that $t = 1$ results in $v_{i}^{i n} = v_{i}^{o u t} = 0$ , causing the velocity to come to a stop at the knot, creating a cusp in the shape. If we increase $t$ further, the velocities point in the “wrong direction,” which creates a loop at the knots. At the other extreme, the value $t = - 1$ results in a curve that is “twice as loose” as a Catmull-Rom curve. There's nothing special about this particular value; you can easily make the curve even looser by making $t$ more negative.

We incorporate tension into the Catmul-Rom tangent formula as follows:

Catmull-Rom formula extended to allow tension adjustments

\begin{aligned} v_{i}^{i n} = v_{i}^{o u t} & = \frac{(1 - t) (k_{i + 1} - k_{i - 1})}{2} \\ = \frac{(1 - t)}{2} (k_{i} - k_{i - 1}) + \frac{(1 - t)}{2} (k_{i + 1} - k_{i}) . \end{aligned}

Next let's turn to the continuity setting, which can be used to break the smoothness of the curve and force a corner at the knot. The value of zero will result in equal tangent (no matter what values for tension and bias are used), thus ensuring $C^{2}$ parametric continuity, as discussed in Section 13.8.1. As we decrease the continuity value, each tangent begins to turn towards its adjacent knot. At $c = - 1$ , each tangent will point directly to the neighboring knot, causing the “spline” to be composed of linear segments. Figure 13.26 illustrates the effect that different continuity values have on the spline.

One important observation to note is that setting $c = - 1$ appears to have an effect on the shape of the curve similar to that of $t = 1$ ; both result in segments that are shaped like straight line segments. However, they are very different when viewed from an animation perspective. A spline with 100%tension comes to a stop at each key, and reaches a maximum value in the middle of the segment. (This is the Hermite smoothstep velocity profile, observable in the nonuniform spacing of the dots in each segment.) Notice that Bézier control points for the $t = 1$ spline in Figure 13.25 are not visible as they are coincident with the knots. Compare this to the $c = - 1$ spline in Figure 13.26, where the Bézier control points are spaced equally along each linear segment. We observed earlier that this produces a curve with constant velocity, as evidenced by the equal spacing of the smaller black dots used to draw the curve.

Figure 13.26A TCB spline with different values for continuity.

Figure 13.27A TCB spline with different values for bias.

The math behind TCB continuity is written as

Catmull-Rom formula extended to allow continuity adjustments

\begin{aligned} v_{i}^{i n} & = \frac{(1 - c)}{2} (k_{i} - k_{i - 1}) + \frac{(1 + c)}{2} (k_{i + 1} - k_{i}), \\ v_{i}^{o u t} & = \frac{(1 + c)}{2} (k_{i} - k_{i - 1}) + \frac{(1 - c)}{2} (k_{i + 1} - k_{i}) . \end{aligned}

Finally, the bias argument can be used to turn the tangents towards one or the other adjacent knots, rather than being parallel to the line between the adjacent knots, as the Catmull-Rom curve does. Consider a sequence of three knots. A negative bias causes the curve to “anticipate” the third knot, turning the curve in the direction of the third knot a bit before the middle knot is reached. In contrast, a positive bias value causes the curve to wait to make the turn towards the third knot, causing some “overshoot” through the middle knot. Figure 13.27 shows our example spline with several different bias values.

The bias value works by scaling the relative weights that the two control polygon edges have on the resultant tangent:

Catmull-Rom formula extended to allow bias adjustments

\begin{aligned} v_{i}^{i n} = v_{i}^{o u t} & = \frac{(1 + b)}{2} (k_{i} - k_{i - 1}) + \frac{(1 - b)}{2} (k_{i + 1} - k_{i}) . \end{aligned}

The equations presented thus far have isolated each setting to make it easier to understand the math behind each one. Now let's put all three settings together:

Computing Tangents for TCB Splines

\begin{aligned} v_{i}^{i n} & = \frac{(1 - t) (1 + b) (1 - c)}{2} (k_{i} - k_{i - 1}) + \frac{(1 - t) (1 - b) (1 + c)}{2} (k_{i + 1} - k_{i}), \\ v_{i}^{o u t} & = \frac{(1 - t) (1 + b) (1 + c)}{2} (k_{i} - k_{i - 1}) + \frac{(1 - t) (1 - b) (1 - c)}{2} (k_{i + 1} - k_{i}) . \end{aligned}

One last note. The examples in this section used the same values at each knot in the spline, but that need not be the case. The TCB values are often adjusted on a per-knot basis.

13.9.3Endpoint Conditions

The Catmull-Rom methods rely on the previous and next knots to compute the tangent at a given knot. What should we do at an endpoint when there is no “previous” or “next” knot? Several solutions to this problem have been proposed.

One obvious answer would be to just throw our hands in the air and set the tangent to zero at an endpoint. While this seems like surrendering before the first shot is fired, it actually can be a good choice if the spline is to be used for animation, since it's often natural to want the object being animated to start and end “at rest.”

Another idea is to create extra knots $k_{- 1}$ and $k_{n + 1}$ , which are used for tangent computations but are not interpolated. Where should we place these so-called phantom points? One idea is to duplicate the neighboring endpoint, which produces zero tangents and is equivalent to the “surrender” spline of the previous paragraph. Another idea is simply to ask the user to place the phantom point. When this method is used, the spline is known as a Cardinal spline.

One final method is to fit the first (or last) three knots to a quadratic, and use the endpoint tangent of this curve. The curve fitting is an example of polynomial interpolation and can thus be done by using the techniques from earlier in this chapter, such as Aitken's algorithm.

Exercises

Compute the Lagrange basis polynomials for the knot sequence $t_{1} = 0$ , $t_{2} = 1$ , $t_{3} = 2$ .
The motion of a projectile (see Section 11.6) can be described by the quadratic function
$p (t) = p_{0} + t v_{0} + t^{2} (a / 2),$
where $p_{0}$ is the initial position, $v_{0}$ is the initial velocity, and $a$ is the constant acceleration (typically due to gravity).
Imagine you want to animate the path of a projectile—say, a herring sandwich. Assume you are working in our standard 3D coordinate space (see Section 1.3.4) and the object is launched from the origin, reaches a maximum at $t = 1$ when its position is $p (1) = (0, h, d / 2)$ , and finally lands at $t = 2$ at the position $p (2) = (0, 0, d)$ . Derive an expression for $p (t)$ in monomial form, in terms of the variables $h$ and $d$ .
Consider the Bézier curve in the figure below.
1. (a)Use de Casteljau to determine the position on the curve at $t = 0.40$ .
2. (b)Convert the curve to Hermite form.
3. (c)Convert the curve to monomial form.
4. (d)Check your work on part (a) by substituting $t = 0.40$ into the polynomial computed in part (c).
5. (e)What is the velocity polynomial function $v (t)$ ?
6. (f)What is the velocity at $t = 0.40$ , $t = 0.00$ , and $t = 1.00$ ?
Prove that the quadratic Bernstein basis polynomials sum to 1 for any value of $t$ .
Where should we put the Bézier control points to get a “constant curve” where $p (t)$ always returns the same point?
Where should we put the Bézier control points to get a linear “curve,” which is a straight line segment with constant velocity?
Where should we put the Bézier control points to get a straight line shape, but this time the velocity of the curve follows the smoothstep pattern: it starts at zero, accelerates to a maximum velocity at the middle, and then decelerates to end with zero velocity?
Describe the motion of a particle that moves along the Bézier curve where $b_{0} = b_{2}$ and $b_{1} = b_{3}$ .
Consider the projectile herring sandwich from Exercise 2. Assume you need to animate this sandwich, and the only tools available to you are cubic Bézier\ curves. Where should you put the four Bézier control points to get physically realistic motion, which is quadratic? Don't worry about the total duration the sandwich is airborne; consider only the shape of the trajectory.
To plot the shape of the parabola in Figure 12.8 , the authors tabulated a list of $x, y$ image-space coordinates of the center of the mass of the board, and then did a least-squares fit to arrive at the equation for the parabola $y = - 0.364 x^{2} + 1.145 x + 2.110$ . The pen tool in Adobe Illustrator, which was used to draw the parabola, is based on cubic Bézier curves. The starting and ending $x$ -coordinates for our curve were $- 0.9683$ and 4.2253, respectively. What were the $(x, y)$ coordinates for all four control points?
Returning to the curve in Exercise 3:
1. (a)Compute the Bézier control points for the segment of the curve from 0.2 to 0.5.
2. (b)Split this curve into two halves at $t = 1 / 2$ . What are the Bézier control points of the curve on each side?
3. (c)Perform degree elevation on this curve to the quartic case. What are the five control points?

My curves are not crazy.

— Henri Matisse (1869–1954)

References

[1] Gerald Farin. Curves and Surfaces for Computer Aided Geometric Design: A Practical Guide, Second edition. Boston: Academic Press, 1990.

[2] Donald E. Knuth. The Art of Computer Programming, Volume 1: Fundamental Algorithms, Third edition. Reading, MA: Addison-Wesley Longman, 1997.

[3] Doris H. U. Kochanek and Richard H. Bartels. “Interpolating Splines with Local Tension, Continuity, and Bias Control.” SIGGRAPH Comput. Graph. 18:3 (1984), 33–41.

[4] David F. Rogers. An Introduction to NURBS: With Historical Perspective. New York: Academic Press, 2001.

This is not intended as a comment on a certain Australian children's musical group, but may be misinterpreted as such.
Aitken's Al Gore rhythm, if you will.
Don't try this excuse with your professor, but it's been known to work in job interviews.
We're talking about real linear algebra, not the geometry-focused subset of it we study in this book.
This type of matrix, in which each row or column is a geometric series of the powers of some term, is known as a Vandermonde matrix, after the French mathematician Alexandre-Théophile Vandermonde (1735–1796).
Although they are named for Joseph Louis Lagrange (1736–1813), Lagrange basis polynomials were discovered in 1779 by Edward Waring (1736–1798). It may be interesting to some readers that Lagrange is Ian Parberry's PhD adviser's PhD adviser's,…, PhD adviser back 10 iterations.
It's important to pronounce the name of this French mathematician “luh-GRAWNGE”. Otherwise, people might think you are talking about the small Texas town of La Grange (pronounced “luh-GRAYNGE”). To the authors' knowledge, La Grange, Texas is not the namesake of any basis polynomials, although ZZ Top did name a song after the town in honor of a nearby brothel.
He's another French guy, and his mother probably pronounced his name “air-MEET.” But many English speakers, even some we know with PhDs, pronounce it “HUR-mite,” so you can probably do the same.
If you're one of those purists who objects to the idea of “blending” points with vectors (see Section 2.4), don't worry. It's possible to interpret the equations such that the offensive comingling does not occur.
Well, just some of that is going to change—we hope your reading will still be enlightening. You know what we mean.
See, we told you a lot of these guys were French! By the way, it's pronounced “BEZ-ee-ay.”
“Rate of exchange,” if you will pardon the pun.
Yep, he's French, too, and that means you'd better pronounce his name correctly: “duh CAS-tul-jho.” He worked for Renault's rival, Citroen.
Yes, he was French, too. He appears in Ian Parberry's PhD adviser tree somewhat off to the left back 16 generations.
In addition to his triangle, Pascal has an SI unit of pressure, a law, a programming language, and a wager named after him, although the latter two are no longer in serious use.
Russian, not French.
Pronounced “RUN-guh.”
Note that by using knot-centric notation and assigning different letters to the control points (based on handy mnemonic memory aids!), we are locking in the degree of the segments to cubic. In other sources you'll find notation such as $b_{i}^{j}$ to refer to the $i$ th point on segment $j$ , or just refer to all the points on the polygon as $b_{i}$ , where the knots are $b_{0}$ , $b_{3}$ , $b_{7}$ . This notation has the advantage of being more general, but to read it requires more mental effort—something we definitely want to minimize.
Oops, there are the quotation marks that we just said were bad form in a math book!
The most important for us is that TCB is easier to pronounce than koh-CHAN-ick.