Orthogonal Polynomials
The Orthogonal Polynomials tab generates orthogonal polynomial columns from a numeric column. With the raw polynomial basis , correlations between columns grow extreme as the degree increases, causing coefficients and fitted values to lose significant digits. Using orthogonal polynomial columns as predictors in Linear Regression reduces the condition number of the design matrix to approximately 1, improving numerical precision.
Basic Usage
Opening Orthogonal Polynomials
Select Data > Orthogonal Polynomials... from the menu bar to open a new Orthogonal Polynomials tab.
Generating Columns
- Select the target dataset from the Dataset dropdown
- Select the numeric column to transform in Source column
- Set the maximum polynomial degree in Degree (1 to 30). The degree must be less than the number of valid data points in the source column (rows after excluding null, NaN, and Infinity)
- Click Preview to inspect the result
- Enter a name for the output dataset in Output Name
- Click Save as Dataset
The original dataset is not modified. A new derived dataset is created. Rows with null, NaN, or Infinity in the source column are excluded from the derived dataset. For the remaining rows, all original columns are retained and poly_1, poly_2, ..., poly_{degree} columns are appended. The output dataset may have fewer rows than the original.
Each orthogonal polynomial column is normalized to , where is the number of valid data points.

Polynomial Regression Workflow
To use orthogonal polynomials instead of the raw polynomial basis:
- In the Orthogonal Polynomials tab, generate degree- polynomial columns from the
xcolumn and save the dataset - Open a Linear Regression tab and select the saved derived dataset
- Set
yas the response variable - Set
poly_1,poly_2, ...,poly_das explanatory variables
R-squared, residual SD, fitted values, and prediction intervals are identical to those from raw polynomial regression. The coefficients are expressed in the orthogonal polynomial basis and differ in both value and interpretation from the raw polynomial basis coefficients. Each poly_j coefficient represents how much the -th orthogonal polynomial component contributes to the response variable. Because the basis is orthogonal, the -test for each coefficient is independent of the others. If the -value of the highest-degree poly_d is large, the degree- component does not contribute to the model.
To choose the polynomial degree, run regressions with different degrees and compare AIC or Adj. R-squared in Linear Regression. Orthogonal polynomials improve numerical precision but do not prevent overfitting.
Next steps
- Linear Regression - Regression analysis with orthogonal polynomial columns
See also
- Numerical Computing Fundamentals - How condition numbers affect accuracy
- Numerical Accuracy - NIST StRD benchmark accuracy verification
- Dummy Coding - Encoding categorical variables as dummy variables