GraphPad Curve Fitting Guide

Centered polynomial equations

Centered polynomial equations

Previous topic Next topic No expanding text in this topic  

Centered polynomial equations

Previous topic Next topic JavaScript is required for expanding text JavaScript is required for the print function Mail us feedback on this topic!  

Included in Prism, are both a set of ordinary polynomial equations and also a set of centered polynomial equations. For example, when you look in the list of polynomials you'll see both 'Second order polynomial' and 'Centered second order polynomial'. We recommend always choosing one of the centered equations instead of an ordinary polynomial equation. This page explains why.

What's wrong with ordinary polynomial models?

The standard polynomial models look like this:

Y= B0 + B1*X +B2*X^2

More terms are included with the higher order equations.

There are two problems with polynomial fits:

When the X values are large, and start well above zero (for example, when  X is a calendar year), taking the very large X values to large powers can lead to math overflow. Even if the program doesn't report any math error, the results can be inaccurate. Some coefficients will be positive and some negative, so the value of Y depends on subtracting huge numbers from other huge numbers, leading to imprecise results.

Even when the X values are not large, the parameters of the model are intertwined, so have high covariance and dependency. This results in large standard errors, wide confidence intervals, and very wide confidence or prediction bands. In many cases, this problem is severe enough that Prism reports that the results are 'ambiguous' and so doesn't report confidence intervals for all the parameters and can't graph confidence bands.

What are centered polynomial models?

Both problems go away when the X values are centered. The idea of centering is to subtract the mean X from all X values before fitting the model. This can be done as part of nonlinear regression,  using this model:

XC = X - Xmean

Y= B0 + B1*XC +B2*XC^2

Here XC is the centered X value, equal to the X value minus Xmean, which  is the mean of all X values. In other words, XC is the distance of any X value from the mean of all X values.  Xmean is constant, and not a parameter that Prism tries to fit.  Of course, you can include more terms in the definition of Y to create higher order polynomial equations.

The advantages of centered models

Fitting the centered model leads to exactly the same curve (unless the regular  approach led to math errors). Accordingly, the sum-of-squares is the same, as are results of model comparisons.

However, the centered equation has reparameterized the model. The parameters have different meanings, so have different best-fit values (except the first parameter which is the same), different standard errors and confidence intervals, smaller covariances and dependencies, and tighter confidence/prediction bands.

How centered models are implemented in Prism

You can fit data to a built-in centered polynomial equation without knowing how Prism implements the model. If you are curious, read on.

The built-in set of centered polynomial equations, written as shown above, constrain the parameter XMean to equal the mean of X value by constraining it to equal a  "Data set constant (= Mean X)".

If you open a file using centered polynomial regression in an  version of Prism prior to 5.02 or 5.0b, that constraint will be lost, and centered polynomial regression won't work.