Obtaining posterior covariance of GPR using fitrgp and predictExactWithCov

Question

Heein Park on 22 May 2024

0
Link

Direct link to this question

https://uk.mathworks.com/matlabcentral/answers/2121296-obtaining-posterior-covariance-of-gpr-using-fitrgp-and-predictexactwithcov

Edited: Walter Roberson on 30 Jan 2025

winequality-red.csv

Hello, thank you for reading this question. I have a question about using matlab's (maybe undocumented) function predictExactWithCov.

I have found in this forum that if I have a RegressionGP gprmdl, set of test points xTest, then by using

[postMeanGPR,postCovGPR,confInt] = predictExactWithCov(gprmdl.Impl,xTest,0.05);
Unable to resolve the name 'gprmdl.Impl'.

I can obtain an posterior mean of gprmdl at xTest (postMeanGPR), posterior covariance matrix at xTest (postCovGPR), and 95% confidence interval based on covariance matrix (confInt).

On the other hand, I can calculate posterior mean and variance using

(*)

where zero prior is assumed, kis a kernel, and so on.

However, using the kernel information from gprmdl like

sig_f = gprmdl.KernelInformation.KernelParameters(12);
sig_l = gprmdl.KernelInformation.KernelParameters(1:11)';
lambda = gprmdl.Sigma;

and calculating posterior mean and covariance using equation (*), I can obtain a same mean as postMeanGPR but posterior variance was quite different with postCovGPR's diagonal elements.

Can anyone explain why does such a discrepency occur?

Thanks.

PS: the overall code is attached below

clear;
clc;
wineTable = readtable('winequality-red.csv'); % wine data from cortez et al 2009., file attached
% 1st-11th columns are inputs and 12th column is output
wineData = table2array(wineTable);
len_tot = length(wineData);
len_train = 1000;
len_test = len_tot - len_train;
DataTrain = zeros(len_train,12);
DataTest = zeros(len_test,12);
NN = 1;
rng(NN);
%% STEP 1. Load data and separate it into training and test sets
p = sort(randperm(len_tot,len_test));
jj = 1; kk = 1;
for ii = 1:len_tot
    if ismember(ii,p','rows')
        DataTest(jj,:) = wineData(ii,:);
        jj = jj + 1;
    else
    DataTrain(kk,:) = wineData(ii,:);
    kk = kk + 1;
    end
end
%% STEP 2. Calculate GP posterior mean and covariance using predictExactWithCov
xTrain = DataTrain(:,1:11);
yTrain = DataTrain(:,12);
xTest = DataTest(:,1:11);
yTest = DataTest(:,12);
gprmdl = fitrgp(xTrain,yTrain,'KernelFunction','ardsquaredexponential','BasisFunction','none');
[postMeanGPR,postCovGPR,confInt] = predictExactWithCov(gprmdl.Impl,xTest,0.05);
%% STEP 3. Calculate GP posterior mean and variance using equation (*)
sig_f = gprmdl.KernelInformation.KernelParameters(12);
sig_l = gprmdl.KernelInformation.KernelParameters(1:11)';
lambda = gprmdl.Sigma;
Gram = Kmat(xTrain,sig_f,sig_l);
KinvY = (Gram + lambda^2*eye(len_train))\yTrain;
postMeanGPR2 = zeros(len_test,1);
postVarGPR2 = postMeanGPR2;
for ii = 1:len_test
    postMeanGPR2(ii) = kCol(xTest(ii,:),xTrain,sig_f,sig_l)'*KinvY;
    postVarGPR2(ii) = SE(xTest(ii,:),xTest(ii,:),sig_f,sig_l) - kCol(xTest(ii,:),xTrain,sig_f,sig_l)'*((Gram + lambda^2*eye(len_train))\kCol(xTest(ii,:),xTrain,sig_f,sig_l));
end
function val = SE(x1, x2, sig_f, sig_l)
% x1, x2 in R^d, row vectors
% hyperparameter sig_l: length scale, length(sig_l) = d, sig_f > 0
Lambda = diag(sig_l);
val = sig_f^2 * exp(-0.5*( (x1 - x2) /Lambda^2 * (x1 - x2)' ) );
end
function k = kCol(xTest, xTrain, sig_f, sig_l)
% xTest: Test point, row vector
% xTrain: Training points, each row corresponds to one training point
N = length(xTrain);
k = zeros(N,1);
for ii = 1:N
    k(ii) = SE(xTest, xTrain(ii,:), sig_f, sig_l);
end    
end
function K = Kmat(X, sig_f, sig_l)
% each row of X corresponds to one data point of Gram matrix K
n = length(X);
K = zeros(n,n);
for ii = 1:n
    for jj = 1:n
        K(ii,jj) = SE(X(ii,:),X(jj,:), sig_f, sig_l);
    end
end
end

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

UDAYA PEDDIRAJU on 26 Jul 2024

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/2121296-obtaining-posterior-covariance-of-gpr-using-fitrgp-and-predictexactwithcov#answer_1491031

Hi Heein,

Dcoumentation for "predictExactWithCov" can be found by right clicking on it and selecting help, you can also see the implementational details of the function by opening it. The discrepancy in posterior covariance calculations between "predictExactWithCov" and your manual implementation likely arises from differences in implementation.

To resolve this:

Check Kernel Function Implementation: Accurately replicate the kernel function used in "fitrgp". Minor differences can significantly impact results.
Inspect Hyperparameters: Verify that hyperparameters extracted from "gprmdl.KernelInformation.KernelParameters" match your manual calculations.
Debug Matrix Operations: Carefully review matrix operations in your manual code for errors.
Simplify and Compare: Start with a smaller dataset and simpler kernel to isolate the issue. Compare intermediate results step-by-step.

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Answer 2

Giacomo on 30 Jan 2025

0
Link

Direct link to this answer

https://uk.mathworks.com/matlabcentral/answers/2121296-obtaining-posterior-covariance-of-gpr-using-fitrgp-and-predictexactwithcov#answer_1558650

Edited: Walter Roberson on 30 Jan 2025

Open in MATLAB Online

Hi Heen,

you just need to add the output noise variance to this line:

postVarGPR2(ii) = SE(xTest(ii,:),xTest(ii,:),sig_f,sig_l) - kCol(xTest(ii,:),xTrain,sig_f,sig_l)'*((Gram + lambda^2*eye(len_train))\kCol(xTest(ii,:),xTrain,sig_f,sig_l)) + lambda^2;

If you now compare the diagonal of postVarGPR against postVarGPR2, you will see that they are exactly equal. You are computing correctly the covariance of the latent function f, while the predict function in MATLAB is computing the covariance in the output sample space (so the output noise has to be added).

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Obtaining posterior covariance of GPR using fitrgp and predictExactWithCov

0 Comments
Show -2 older commentsHide -2 older comments

Answers (2)

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

Obtaining posterior covariance of GPR using fitrgp and predictExactWithCov

0 Comments Show -2 older commentsHide -2 older comments

Answers (2)

0 Comments Show -2 older commentsHide -2 older comments

0 Comments Show -2 older commentsHide -2 older comments

See Also

Categories

Tags

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments

0 Comments
Show -2 older commentsHide -2 older comments