plotResiduals

Class: GeneralizedLinearMixedModel

Plot residuals of generalized linear mixed-effects model

expand all in page

Syntax

plotResiduals(glme,plottype)

plotResiduals(glme,plottype,ResidualType=residualtype)

plotResiduals(ax,___)

h = plotResiduals(___)

Description

plotResiduals(glme,plottype) plots the raw conditional residuals of the generalized linear mixed-effects model glme in a plot of the type specified by plottype.

plotResiduals(glme,plottype,ResidualType=residualtype) specifies the type of residuals to plot.

example

plotResiduals(ax,___) plots into the axes specified by ax instead of the current axes (gca) using any of the input argument combinations in the previous syntaxes. (since R2024a)

h = plotResiduals(___) returns a handle, h, to the lines or patches in the plot of residuals.

Input Arguments

expand all

`glme` — Generalized linear mixed-effects model
`GeneralizedLinearMixedModel` object

Generalized linear mixed-effects model, specified as a GeneralizedLinearMixedModel object. For properties and methods of this object, see GeneralizedLinearMixedModel.

`plottype` — Type of residual plot
`"histogram"` (default) | `"caseorder"` | `"fitted"` | `"lagged"` | `"probability"` | `"observed"` | `"symmetry"`

Type of residual plot, specified as one of the following.

Value	Description
`"histogram"`	Histogram of residuals
`"caseorder"`	Residuals versus case order. Case order is the same as the row order used in the input data `tbl` when fitting the model using `fitglme`.
`"fitted"`	Residuals versus fitted values
`"lagged"`	Residuals versus lagged residual (r(t) versus r(t – 1))
`"probability"`	Normal probability plot
`"observed"`	Observed vs. fitted values. This plot includes a dotted reference line of y = x. Each residual is represented by the vertical distance from the corresponding observed value to the reference line.
`"symmetry"`	Symmetry plot

Example: plotResiduals(glme,"lagged")

`residualtype` — Residual type
`"raw"` (default) | `"Pearson"`

Residual type, specified as one of the following.

Residual Type Formula

Residual Type	Formula
`"raw"`	$r_{c i} = y_{i} - g^{- 1} (x_{i}^{T} \hat{β} + z_{i}^{T} \hat{b} + δ_{i})$
`"Pearson"`	$r_{c i}^{p e a r s o n} = \frac{r_{c i}}{\sqrt{\frac{\hat{σ^{2}}}{w_{i}} v_{i} (μ_{i} (\hat{β}, \hat{b}))}}$

"raw"

$r_{c i} = y_{i} - g^{- 1} (x_{i}^{T} \hat{β} + z_{i}^{T} \hat{b} + δ_{i})$

"Pearson"

$r_{c i}^{p e a r s o n} = \frac{r_{c i}}{\sqrt{\frac{\hat{σ^{2}}}{w_{i}} v_{i} (μ_{i} (\hat{β}, \hat{b}))}}$

In each of these equations:

y_i is the ith element of the n-by-1 response vector y, where i = 1, ..., n.
g^-1 is the inverse link function for the model.
x_i^T is the ith row of the fixed-effects design matrix X.
z_i^T is the ith row of the random-effects design matrix Z.
δ_i is the ith offset value.
σ² is the dispersion parameter.
w_i is the ith observation weight.
v_i is the variance term for the ith observation.
μ_i is the mean of the response for the ith observation.
$\hat{β}$ and $\hat{b}$ are estimated values of β and b.

Raw residuals from a generalized linear mixed-effects model have nonconstant variance. Pearson residuals are expected to have an approximately constant variance, and are generally used for analysis.

Data Types: char | string

`ax` — Target axes
`Axes` object

Since R2024a

Target axes, specified as an Axes object. If you do not specify the axes, then plotResiduals uses the current axes (gca).

Output Arguments

expand all

`h` — Handle to residual plot
graphics object

Handle to the residual plot, returned as a graphics object. You can use dot notation to change certain property values of the object, including face color for a histogram, and marker style and color for a scatterplot. For more information, see Access Property Values.

Examples

expand all

Create Plots of Residuals

Open Live Script

Load the sample data.

load mfr

This simulated data is from a manufacturing company that operates 50 factories across the world, with each factory running a batch process to create a finished product. The company wants to decrease the number of defects in each batch, so it developed a new manufacturing process. To test the effectiveness of the new process, the company selected 20 of its factories at random to participate in an experiment: Ten factories implemented the new process, while the other ten continued to run the old process. In each of the 20 factories, the company ran five batches (for a total of 100 batches) and recorded the following data:

Flag to indicate whether the batch used the new process (newprocess)
Processing time for each batch, in hours (time)
Temperature of the batch, in degrees Celsius (temp)
Categorical variable indicating the supplier (A, B, or C) of the chemical used in the batch (supplier)
Number of defects in the batch (defects)

The data also includes time_dev and temp_dev, which represent the absolute deviation of time and temperature, respectively, from the process standard of 3 hours at 20 degrees Celsius.

Fit a generalized linear mixed-effects model using newprocess, time_dev, temp_dev, and supplier as fixed-effects predictors. Include a random-effects term for intercept grouped by factory, to account for quality differences that might exist due to factory-specific variations. The response variable defects has a Poisson distribution, and the appropriate link function for this model is log. Use the Laplace fit method to estimate the coefficients. Specify the dummy variable encoding as "effects", so the dummy variable coefficients sum to 0.

The number of defects can be modeled using a Poisson distribution:

${defects}_{i j} \sim Poisson (μ_{i j})$

This corresponds to the generalized linear mixed-effects model

$\log (μ_{i j}) = β_{0} + β_{1} {newprocess}_{i j} + β_{2} {time_dev}_{i j} + β_{3} {temp_dev}_{i j} + β_{4} {supplier_C}_{i j} + β_{5} {supplier_B}_{i j} + b_{i},$

where

${defects}_{i j}$ is the number of defects observed in the batch produced by factory $i$ during batch $j$ .
$μ_{i j}$ is the mean number of defects corresponding to factory $i$ (where $i = 1, 2, . . ., 20$ ) during batch $j$ (where $j = 1, 2, . . ., 5$ ).
${newprocess}_{i j}$ , ${time_dev}_{i j}$ , and ${temp_dev}_{i j}$ are the measurements for each variable that correspond to factory $i$ during batch $j$ . For example, ${newprocess}_{i j}$ indicates whether the batch produced by factory $i$ during batch $j$ used the new process.
${supplier_C}_{i j}$ and ${supplier_B}_{i j}$ are dummy variables that use effects (sum-to-zero) coding to indicate whether company C or B, respectively, supplied the process chemicals for the batch produced by factory $i$ during batch $j$ .
$b_{i} \sim N (0, σ_{b}^{2})$ is a random-effects intercept for each factory $i$ that accounts for factory-specific variation in quality.

glme = fitglme(mfr,"defects ~ 1 + newprocess + time_dev + temp_dev + supplier + (1|factory)",Distribution="Poisson",Link="log",FitMethod="Laplace",DummyVarCoding="effects");

Create diagnostic plots using Pearson residuals to test the model assumptions.

Plot a histogram to visually confirm that the mean of the Pearson residuals is equal to 0. If the model is correct, we expect the Pearson residuals to be centered at 0.

plotResiduals(glme,"histogram",ResidualType="Pearson")

Figure contains an axes object. The axes object with title Histogram of residuals, xlabel Residuals, ylabel Probability density contains an object of type patch.

The histogram shows that the Pearson residuals are centered at 0.

Plot the Pearson residuals versus the fitted values, to check for signs of nonconstant variance among the residuals (heteroscedasticity). We expect the conditional Pearson residuals to have a constant variance. Therefore, a plot of conditional Pearson residuals versus conditional fitted values should not reveal any systematic dependence on the conditional fitted values.

plotResiduals(glme,"fitted",ResidualType="Pearson")

Figure contains an axes object. The axes object with title Plot of residuals vs. fitted values, xlabel Fitted values, ylabel Residuals contains 2 objects of type line. One or more of the lines displays its values using only markers

The plot does not show a systematic dependence on the fitted values, so there are no signs of nonconstant variance among the residuals.

Plot the Pearson residuals versus lagged residuals, to check for correlation among the residuals. The conditional independence assumption in GLME implies that the conditional Pearson residuals are approximately uncorrelated.

plotResiduals(glme,"lagged",ResidualType="Pearson")

Figure contains an axes object. The axes object with title Plot of residuals vs. lagged residuals, xlabel Residual(t-1), ylabel Residual(t) contains 3 objects of type line. One or more of the lines displays its values using only markers

There is no pattern to the plot, so there are no signs of correlation among the residuals.

Version History

expand all

R2024b: Plot observed versus fitted values

You can now plot observed versus fitted values by specifying the plottype input argument as "observed".

R2024a: Specify target axes

Specify the target axes for the plot by using the ax input argument.

plotResiduals

Syntax

Description

Input Arguments

glme — Generalized linear mixed-effects model GeneralizedLinearMixedModel object

plottype — Type of residual plot "histogram" (default) | "caseorder" | "fitted" | "lagged" | "probability" | "observed" | "symmetry"

residualtype — Residual type "raw" (default) | "Pearson"

ax — Target axes Axes object

Output Arguments

h — Handle to residual plot graphics object

Examples

Create Plots of Residuals

Version History

R2024b: Plot observed versus fitted values

R2024a: Specify target axes

See Also

`glme` — Generalized linear mixed-effects model
`GeneralizedLinearMixedModel` object

`plottype` — Type of residual plot
`"histogram"` (default) | `"caseorder"` | `"fitted"` | `"lagged"` | `"probability"` | `"observed"` | `"symmetry"`

`residualtype` — Residual type
`"raw"` (default) | `"Pearson"`

`ax` — Target axes
`Axes` object

`h` — Handle to residual plot
graphics object